KR20210150487A - Gene Therapy for Lysosomal Disorders - Google Patents
Gene Therapy for Lysosomal Disorders Download PDFInfo
- Publication number
- KR20210150487A KR20210150487A KR1020217036238A KR20217036238A KR20210150487A KR 20210150487 A KR20210150487 A KR 20210150487A KR 1020217036238 A KR1020217036238 A KR 1020217036238A KR 20217036238 A KR20217036238 A KR 20217036238A KR 20210150487 A KR20210150487 A KR 20210150487A
- Authority
- KR
- South Korea
- Prior art keywords
- nucleic acid
- raav
- gene
- protein
- disease
- Prior art date
Links
- 208000015439 Lysosomal storage disease Diseases 0.000 title claims description 33
- 238000001415 gene therapy Methods 0.000 title description 17
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 353
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 323
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 304
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 304
- 230000014509 gene expression Effects 0.000 claims abstract description 214
- 230000002401 inhibitory effect Effects 0.000 claims abstract description 125
- 210000003169 central nervous system Anatomy 0.000 claims abstract description 47
- 238000000034 method Methods 0.000 claims abstract description 46
- 208000015114 central nervous system disease Diseases 0.000 claims abstract description 38
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims abstract description 34
- 108700019146 Transgenes Proteins 0.000 claims abstract description 31
- 201000010099 disease Diseases 0.000 claims abstract description 26
- 102000004169 proteins and genes Human genes 0.000 claims description 84
- 101000997662 Homo sapiens Lysosomal acid glucosylceramidase Proteins 0.000 claims description 82
- 102100033342 Lysosomal acid glucosylceramidase Human genes 0.000 claims description 82
- 238000002347 injection Methods 0.000 claims description 72
- 239000007924 injection Substances 0.000 claims description 72
- 230000008685 targeting Effects 0.000 claims description 61
- 102100026882 Alpha-synuclein Human genes 0.000 claims description 50
- 101000834898 Homo sapiens Alpha-synuclein Proteins 0.000 claims description 50
- 102100026232 Transmembrane protein 106B Human genes 0.000 claims description 47
- 101000834926 Homo sapiens Transmembrane protein 106B Proteins 0.000 claims description 45
- 102100040243 Microtubule-associated protein tau Human genes 0.000 claims description 43
- 102100020983 Lysosome membrane protein 2 Human genes 0.000 claims description 34
- 102100037632 Progranulin Human genes 0.000 claims description 34
- 108090000565 Capsid Proteins Proteins 0.000 claims description 32
- 102100023321 Ceruloplasmin Human genes 0.000 claims description 32
- 102100028496 Galactocerebrosidase Human genes 0.000 claims description 27
- 102100033499 Interleukin-34 Human genes 0.000 claims description 24
- 108091026890 Coding region Proteins 0.000 claims description 23
- 241000702421 Dependoparvovirus Species 0.000 claims description 23
- 101000795117 Homo sapiens Triggering receptor expressed on myeloid cells 2 Proteins 0.000 claims description 23
- 102100029678 Triggering receptor expressed on myeloid cells 2 Human genes 0.000 claims description 23
- 102100022721 40S ribosomal protein S25 Human genes 0.000 claims description 21
- 102100027346 GTP cyclohydrolase 1 Human genes 0.000 claims description 21
- 101000678929 Homo sapiens 40S ribosomal protein S25 Proteins 0.000 claims description 21
- 101000862581 Homo sapiens GTP cyclohydrolase 1 Proteins 0.000 claims description 20
- 108091005488 SCARB2 Proteins 0.000 claims description 19
- 101000860395 Homo sapiens Galactocerebrosidase Proteins 0.000 claims description 18
- 102100026263 Sphingomyelin phosphodiesterase Human genes 0.000 claims description 17
- 230000002093 peripheral effect Effects 0.000 claims description 17
- 102100021633 Cathepsin B Human genes 0.000 claims description 16
- 101000898449 Homo sapiens Cathepsin B Proteins 0.000 claims description 16
- 101000854862 Homo sapiens Vacuolar protein sorting-associated protein 35 Proteins 0.000 claims description 16
- 102100027814 Non-lysosomal glucosylceramidase Human genes 0.000 claims description 16
- 102100020822 Vacuolar protein sorting-associated protein 35 Human genes 0.000 claims description 16
- 101000859679 Homo sapiens Non-lysosomal glucosylceramidase Proteins 0.000 claims description 15
- 101000785978 Homo sapiens Sphingomyelin phosphodiesterase Proteins 0.000 claims description 15
- 208000032859 Synucleinopathies Diseases 0.000 claims description 14
- 208000034799 Tauopathies Diseases 0.000 claims description 13
- 230000004770 neurodegeneration Effects 0.000 claims description 13
- 101000712725 Homo sapiens Ras-related protein Rab-7L1 Proteins 0.000 claims description 12
- 208000015122 neurodegenerative disease Diseases 0.000 claims description 12
- 238000007913 intrathecal administration Methods 0.000 claims description 11
- 102100033100 Ras-related protein Rab-7L1 Human genes 0.000 claims description 8
- 241000702423 Adeno-associated virus - 2 Species 0.000 claims description 7
- 238000010253 intravenous injection Methods 0.000 claims description 6
- 101000998132 Homo sapiens Interleukin-34 Proteins 0.000 claims description 5
- 241000124008 Mammalia Species 0.000 claims description 3
- 101000891579 Homo sapiens Microtubule-associated protein tau Proteins 0.000 claims description 2
- 101710114165 Progranulin Proteins 0.000 claims 1
- 108010007100 Pulmonary Surfactant-Associated Protein A Proteins 0.000 claims 1
- 102100027773 Pulmonary surfactant-associated protein A2 Human genes 0.000 claims 1
- 208000015872 Gaucher disease Diseases 0.000 abstract description 64
- 208000018737 Parkinson disease Diseases 0.000 abstract description 59
- 239000000203 mixture Substances 0.000 abstract description 38
- 239000000047 product Substances 0.000 description 164
- 108020004414 DNA Proteins 0.000 description 141
- 239000013598 vector Substances 0.000 description 121
- 235000018102 proteins Nutrition 0.000 description 80
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 79
- 239000013608 rAAV vector Substances 0.000 description 74
- 210000004027 cell Anatomy 0.000 description 67
- 238000010586 diagram Methods 0.000 description 58
- 241000699670 Mus sp. Species 0.000 description 50
- 239000002679 microRNA Substances 0.000 description 50
- 108091028043 Nucleic acid sequence Proteins 0.000 description 49
- 108010012809 Progranulins Proteins 0.000 description 49
- 102000019204 Progranulins Human genes 0.000 description 48
- 108091070501 miRNA Proteins 0.000 description 45
- 101710115937 Microtubule-associated protein tau Proteins 0.000 description 42
- 239000013612 plasmid Substances 0.000 description 41
- 102100036197 Prosaposin Human genes 0.000 description 37
- 101710152403 Prosaposin Proteins 0.000 description 37
- 108090000765 processed proteins & peptides Proteins 0.000 description 36
- 230000002452 interceptive effect Effects 0.000 description 35
- 230000035772 mutation Effects 0.000 description 33
- 201000011240 Frontotemporal dementia Diseases 0.000 description 31
- 239000002773 nucleotide Substances 0.000 description 31
- 125000003729 nucleotide group Chemical group 0.000 description 31
- 230000000694 effects Effects 0.000 description 30
- 239000000546 pharmaceutical excipient Substances 0.000 description 26
- 210000004556 brain Anatomy 0.000 description 23
- 201000002832 Lewy body dementia Diseases 0.000 description 22
- 210000004962 mammalian cell Anatomy 0.000 description 21
- 101100095198 Homo sapiens SCARB2 gene Proteins 0.000 description 20
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 20
- 238000009825 accumulation Methods 0.000 description 20
- 230000035508 accumulation Effects 0.000 description 20
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 19
- 101710181549 Interleukin-34 Proteins 0.000 description 19
- 108010050848 glycylleucine Proteins 0.000 description 19
- 208000024827 Alzheimer disease Diseases 0.000 description 18
- 241000282412 Homo Species 0.000 description 18
- 208000009829 Lewy Body Disease Diseases 0.000 description 18
- 125000003275 alpha amino acid group Chemical group 0.000 description 18
- 210000000349 chromosome Anatomy 0.000 description 18
- 210000005260 human cell Anatomy 0.000 description 18
- 208000024891 symptom Diseases 0.000 description 18
- 238000003556 assay Methods 0.000 description 17
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 17
- 239000003981 vehicle Substances 0.000 description 16
- 238000010172 mouse model Methods 0.000 description 15
- 108010061238 threonyl-glycine Proteins 0.000 description 15
- 102000003802 alpha-Synuclein Human genes 0.000 description 14
- 108090000185 alpha-Synuclein Proteins 0.000 description 14
- 238000000338 in vitro Methods 0.000 description 14
- 239000013607 AAV vector Substances 0.000 description 13
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 13
- 239000012634 fragment Substances 0.000 description 13
- 108010057821 leucylproline Proteins 0.000 description 13
- 230000002132 lysosomal effect Effects 0.000 description 13
- 101000834253 Gallus gallus Actin, cytoplasmic 1 Proteins 0.000 description 12
- 239000000758 substrate Substances 0.000 description 12
- 108010047495 alanylglycine Proteins 0.000 description 11
- 108010038633 aspartylglutamate Proteins 0.000 description 11
- 101150003696 gba-1 gene Proteins 0.000 description 11
- 108010034529 leucyl-lysine Proteins 0.000 description 11
- 230000001718 repressive effect Effects 0.000 description 11
- 102220341636 rs780972896 Human genes 0.000 description 11
- 108020004459 Small interfering RNA Proteins 0.000 description 10
- 108010047857 aspartylglycine Proteins 0.000 description 10
- 230000002950 deficient Effects 0.000 description 10
- 102000004190 Enzymes Human genes 0.000 description 9
- 108090000790 Enzymes Proteins 0.000 description 9
- 108010042681 Galactosylceramidase Proteins 0.000 description 9
- 208000020916 Gaucher disease type II Diseases 0.000 description 9
- 208000028735 Gaucher disease type III Diseases 0.000 description 9
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 9
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 9
- 108010079364 N-glycylalanine Proteins 0.000 description 9
- 238000004458 analytical method Methods 0.000 description 9
- 108010089804 glycyl-threonine Proteins 0.000 description 9
- 108010051242 phenylalanylserine Proteins 0.000 description 9
- 108010026333 seryl-proline Proteins 0.000 description 9
- 238000002965 ELISA Methods 0.000 description 8
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 8
- 108010065920 Insulin Lispro Proteins 0.000 description 8
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 8
- 108091027967 Small hairpin RNA Proteins 0.000 description 8
- 101150078881 TMEM106B gene Proteins 0.000 description 8
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 8
- 210000002798 bone marrow cell Anatomy 0.000 description 8
- 108010004073 cysteinylcysteine Proteins 0.000 description 8
- 208000035475 disorder Diseases 0.000 description 8
- 108010078144 glutaminyl-glycine Proteins 0.000 description 8
- 108010037850 glycylvaline Proteins 0.000 description 8
- 239000000463 material Substances 0.000 description 8
- 230000002018 overexpression Effects 0.000 description 8
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 7
- 208000020322 Gaucher disease type I Diseases 0.000 description 7
- 101000584785 Homo sapiens Ras-related protein Rab-7a Proteins 0.000 description 7
- 238000011529 RT qPCR Methods 0.000 description 7
- 102100030019 Ras-related protein Rab-7a Human genes 0.000 description 7
- 108010060035 arginylproline Proteins 0.000 description 7
- 108010092854 aspartyllysine Proteins 0.000 description 7
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 7
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 7
- 108010049041 glutamylalanine Proteins 0.000 description 7
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 7
- 238000012417 linear regression Methods 0.000 description 7
- 150000002632 lipids Chemical class 0.000 description 7
- 108010064235 lysylglycine Proteins 0.000 description 7
- 230000002981 neuropathic effect Effects 0.000 description 7
- 230000001105 regulatory effect Effects 0.000 description 7
- 239000004055 small Interfering RNA Substances 0.000 description 7
- 238000010361 transduction Methods 0.000 description 7
- 230000026683 transduction Effects 0.000 description 7
- 230000003612 virological effect Effects 0.000 description 7
- 241000880493 Leptailurus serval Species 0.000 description 6
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 6
- 241001465754 Metazoa Species 0.000 description 6
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 6
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 6
- 108010087924 alanylproline Proteins 0.000 description 6
- 108010008355 arginyl-glutamine Proteins 0.000 description 6
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 6
- 230000003542 behavioural effect Effects 0.000 description 6
- POQRWMRXUOPCLD-GZXCKHLVSA-N beta-D-glucosyl-N-(tetracosanoyl)sphingosine Chemical compound CCCCCCCCCCCCCCCCCCCCCCCC(=O)N[C@H]([C@H](O)\C=C\CCCCCCCCCCCCC)CO[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O POQRWMRXUOPCLD-GZXCKHLVSA-N 0.000 description 6
- 230000000295 complement effect Effects 0.000 description 6
- 239000002299 complementary DNA Substances 0.000 description 6
- 108010016616 cysteinylglycine Proteins 0.000 description 6
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 6
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 6
- 108010081551 glycylphenylalanine Proteins 0.000 description 6
- 238000003384 imaging method Methods 0.000 description 6
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 6
- 108010012581 phenylalanylglutamate Proteins 0.000 description 6
- 238000012360 testing method Methods 0.000 description 6
- 238000001890 transfection Methods 0.000 description 6
- 241000701447 unidentified baculovirus Species 0.000 description 6
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 5
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 5
- 108090000712 Cathepsin B Proteins 0.000 description 5
- 102000004225 Cathepsin B Human genes 0.000 description 5
- BMHBJCVEXUBGFI-BIIVOSGPSA-N Cys-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CS)N)C(=O)O BMHBJCVEXUBGFI-BIIVOSGPSA-N 0.000 description 5
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 5
- 101000934372 Homo sapiens Macrosialin Proteins 0.000 description 5
- 239000012097 Lipofectamine 2000 Substances 0.000 description 5
- 101150070547 MAPT gene Proteins 0.000 description 5
- 102100025136 Macrosialin Human genes 0.000 description 5
- 208000001089 Multiple system atrophy Diseases 0.000 description 5
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 5
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 5
- -1 PRGN Proteins 0.000 description 5
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 5
- 108010005233 alanylglutamic acid Proteins 0.000 description 5
- 108010070944 alanylhistidine Proteins 0.000 description 5
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 5
- 230000000875 corresponding effect Effects 0.000 description 5
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 5
- 108010025306 histidylleucine Proteins 0.000 description 5
- 238000005462 in vivo assay Methods 0.000 description 5
- 238000007914 intraventricular administration Methods 0.000 description 5
- 108010003700 lysyl aspartic acid Proteins 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- 210000000274 microglia Anatomy 0.000 description 5
- 239000008194 pharmaceutical composition Substances 0.000 description 5
- 108010090894 prolylleucine Proteins 0.000 description 5
- 238000003753 real-time PCR Methods 0.000 description 5
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 5
- 239000000243 solution Substances 0.000 description 5
- 239000000126 substance Substances 0.000 description 5
- 238000006467 substitution reaction Methods 0.000 description 5
- 210000001519 tissue Anatomy 0.000 description 5
- 108010080629 tryptophan-leucine Proteins 0.000 description 5
- 239000013603 viral vector Substances 0.000 description 5
- YXHLJMWYDTXDHS-IRFLANFNSA-N 7-aminoactinomycin D Chemical compound C[C@H]1OC(=O)[C@H](C(C)C)N(C)C(=O)CN(C)C(=O)[C@@H]2CCCN2C(=O)[C@@H](C(C)C)NC(=O)[C@H]1NC(=O)C1=C(N)C(=O)C(C)=C2OC(C(C)=C(N)C=C3C(=O)N[C@@H]4C(=O)N[C@@H](C(N5CCC[C@H]5C(=O)N(C)CC(=O)N(C)[C@@H](C(C)C)C(=O)O[C@@H]4C)=O)C(C)C)=C3N=C21 YXHLJMWYDTXDHS-IRFLANFNSA-N 0.000 description 4
- 108700012813 7-aminoactinomycin D Proteins 0.000 description 4
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 4
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 4
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 4
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 4
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 4
- 108091026821 Artificial microRNA Proteins 0.000 description 4
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 4
- 101150035856 CTSB gene Proteins 0.000 description 4
- 101100282794 Caenorhabditis elegans gba-2 gene Proteins 0.000 description 4
- ISWAQPWFWKGCAL-ACZMJKKPSA-N Cys-Cys-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISWAQPWFWKGCAL-ACZMJKKPSA-N 0.000 description 4
- 238000012286 ELISA Assay Methods 0.000 description 4
- 101150004665 GCH1 gene Proteins 0.000 description 4
- 208000034826 Genetic Predisposition to Disease Diseases 0.000 description 4
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 4
- 101150027225 Il34 gene Proteins 0.000 description 4
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 4
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 4
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 4
- 208000026072 Motor neurone disease Diseases 0.000 description 4
- 108010066427 N-valyltryptophan Proteins 0.000 description 4
- 208000002537 Neuronal Ceroid-Lipofuscinoses Diseases 0.000 description 4
- 239000012124 Opti-MEM Substances 0.000 description 4
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 4
- 101150002602 Psap gene Proteins 0.000 description 4
- 108010079005 RDV peptide Proteins 0.000 description 4
- 101150110423 SNCA gene Proteins 0.000 description 4
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 4
- 101150118355 Smpd1 gene Proteins 0.000 description 4
- DGOJNGCGEYOBKN-BWBBJGPYSA-N Thr-Cys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)O DGOJNGCGEYOBKN-BWBBJGPYSA-N 0.000 description 4
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 4
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 4
- 101150035098 VPS35 gene Proteins 0.000 description 4
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 4
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 4
- 206010002026 amyotrophic lateral sclerosis Diseases 0.000 description 4
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 4
- 108010077245 asparaginyl-proline Proteins 0.000 description 4
- 238000003776 cleavage reaction Methods 0.000 description 4
- 208000010877 cognitive disease Diseases 0.000 description 4
- 230000002596 correlated effect Effects 0.000 description 4
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 4
- 101150022753 galc gene Proteins 0.000 description 4
- 101150073411 gba-2 gene Proteins 0.000 description 4
- 230000030279 gene silencing Effects 0.000 description 4
- 230000002068 genetic effect Effects 0.000 description 4
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 4
- 108010015792 glycyllysine Proteins 0.000 description 4
- 108010092114 histidylphenylalanine Proteins 0.000 description 4
- 238000001727 in vivo Methods 0.000 description 4
- 238000001361 intraarterial administration Methods 0.000 description 4
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 4
- 210000003734 kidney Anatomy 0.000 description 4
- 210000003712 lysosome Anatomy 0.000 description 4
- 230000001868 lysosomic effect Effects 0.000 description 4
- 108010009298 lysylglutamic acid Proteins 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 208000005264 motor neuron disease Diseases 0.000 description 4
- 201000008051 neuronal ceroid lipofuscinosis Diseases 0.000 description 4
- 238000004806 packaging method and process Methods 0.000 description 4
- 108091007428 primary miRNA Proteins 0.000 description 4
- 201000002212 progressive supranuclear palsy Diseases 0.000 description 4
- 108010031719 prolyl-serine Proteins 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- 230000007017 scission Effects 0.000 description 4
- 108010071207 serylmethionine Proteins 0.000 description 4
- 230000004083 survival effect Effects 0.000 description 4
- 102000013498 tau Proteins Human genes 0.000 description 4
- 108010026424 tau Proteins Proteins 0.000 description 4
- 238000002560 therapeutic procedure Methods 0.000 description 4
- 239000012096 transfection reagent Substances 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- 230000032258 transport Effects 0.000 description 4
- 108010038745 tryptophylglycine Proteins 0.000 description 4
- 108010045269 tryptophyltryptophan Proteins 0.000 description 4
- 108010003137 tyrosyltyrosine Proteins 0.000 description 4
- 108020005345 3' Untranslated Regions Proteins 0.000 description 3
- 241000972680 Adeno-associated virus - 6 Species 0.000 description 3
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 3
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 3
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 3
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 3
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 3
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 3
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 3
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 3
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 3
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 3
- 206010012289 Dementia Diseases 0.000 description 3
- 101150024624 GRN gene Proteins 0.000 description 3
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 3
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 3
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 3
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 3
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 3
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 3
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 3
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 3
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 3
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 3
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 3
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 3
- 102100031181 Glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 3
- VUUFXXGKMPLKNH-BZSNNMDCSA-N His-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N VUUFXXGKMPLKNH-BZSNNMDCSA-N 0.000 description 3
- 101100426014 Homo sapiens TREM2 gene Proteins 0.000 description 3
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 3
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 3
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 3
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 3
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 3
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 3
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 3
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 3
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 3
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 3
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 3
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 3
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 3
- CNWDWAMPKVYJJB-NUTKFTJISA-N Leu-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CNWDWAMPKVYJJB-NUTKFTJISA-N 0.000 description 3
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 3
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 3
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 3
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 3
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 3
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 3
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 3
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 3
- 108700011259 MicroRNAs Proteins 0.000 description 3
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 3
- 208000021320 Nasu-Hakola disease Diseases 0.000 description 3
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 3
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 3
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 3
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 3
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 3
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 3
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 3
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 3
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 3
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 3
- FIDNSJUXESUDOV-JYJNAYRXSA-N Pro-Tyr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O FIDNSJUXESUDOV-JYJNAYRXSA-N 0.000 description 3
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 3
- 101150108283 RpS25 gene Proteins 0.000 description 3
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 3
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 3
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 3
- 101150085127 TREM2 gene Proteins 0.000 description 3
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 3
- VPRHDRKAPYZMHL-SZMVWBNQSA-N Trp-Leu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 VPRHDRKAPYZMHL-SZMVWBNQSA-N 0.000 description 3
- 208000007930 Type C Niemann-Pick Disease Diseases 0.000 description 3
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 3
- BYAKMYBZADCNMN-JYJNAYRXSA-N Tyr-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYAKMYBZADCNMN-JYJNAYRXSA-N 0.000 description 3
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 3
- 241000700605 Viruses Species 0.000 description 3
- 108010044940 alanylglutamine Proteins 0.000 description 3
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 3
- 108010062796 arginyllysine Proteins 0.000 description 3
- 108010068265 aspartyltyrosine Proteins 0.000 description 3
- 230000006399 behavior Effects 0.000 description 3
- 230000008499 blood brain barrier function Effects 0.000 description 3
- 210000001218 blood-brain barrier Anatomy 0.000 description 3
- 230000037396 body weight Effects 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 108010069495 cysteinyltyrosine Proteins 0.000 description 3
- 230000003247 decreasing effect Effects 0.000 description 3
- 230000007812 deficiency Effects 0.000 description 3
- 230000006735 deficit Effects 0.000 description 3
- 231100000673 dose–response relationship Toxicity 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 3
- 239000013604 expression vector Substances 0.000 description 3
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 3
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 3
- 108010010147 glycylglutamine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 108010087823 glycyltyrosine Proteins 0.000 description 3
- 108010040030 histidinoalanine Proteins 0.000 description 3
- 108010085325 histidylproline Proteins 0.000 description 3
- 239000003112 inhibitor Substances 0.000 description 3
- 108010078274 isoleucylvaline Proteins 0.000 description 3
- 231100000225 lethality Toxicity 0.000 description 3
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 3
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 3
- 108010012058 leucyltyrosine Proteins 0.000 description 3
- 210000004185 liver Anatomy 0.000 description 3
- 108010017391 lysylvaline Proteins 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- 108010005942 methionylglycine Proteins 0.000 description 3
- 208000033808 peripheral neuropathy Diseases 0.000 description 3
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 3
- 208000031334 polycystic lipomembranous osteodysplasia with sclerosing leukoencephaly Diseases 0.000 description 3
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 3
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 3
- 108010015796 prolylisoleucine Proteins 0.000 description 3
- 230000002829 reductive effect Effects 0.000 description 3
- 239000000523 sample Substances 0.000 description 3
- 108010048818 seryl-histidine Proteins 0.000 description 3
- 239000002924 silencing RNA Substances 0.000 description 3
- 210000000952 spleen Anatomy 0.000 description 3
- 238000001262 western blot Methods 0.000 description 3
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 2
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 2
- HXUVTXPOZRFMOY-NSHDSACASA-N 2-[[(2s)-2-[[2-[(2-aminoacetyl)amino]acetyl]amino]-3-phenylpropanoyl]amino]acetic acid Chemical compound NCC(=O)NCC(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 HXUVTXPOZRFMOY-NSHDSACASA-N 0.000 description 2
- HVCOBJNICQPDBP-UHFFFAOYSA-N 3-[3-[3,5-dihydroxy-6-methyl-4-(3,4,5-trihydroxy-6-methyloxan-2-yl)oxyoxan-2-yl]oxydecanoyloxy]decanoic acid;hydrate Chemical compound O.OC1C(OC(CC(=O)OC(CCCCCCC)CC(O)=O)CCCCCCC)OC(C)C(O)C1OC1C(O)C(O)C(O)C(C)O1 HVCOBJNICQPDBP-UHFFFAOYSA-N 0.000 description 2
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 2
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 2
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 2
- RCQRKPUXJAGEEC-ZLUOBGJFSA-N Ala-Cys-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O RCQRKPUXJAGEEC-ZLUOBGJFSA-N 0.000 description 2
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 2
- LTSBJNNXPBBNDT-HGNGGELXSA-N Ala-His-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)O LTSBJNNXPBBNDT-HGNGGELXSA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 2
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 2
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 2
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 2
- BHTBAVZSZCQZPT-GUBZILKMSA-N Ala-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N BHTBAVZSZCQZPT-GUBZILKMSA-N 0.000 description 2
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 2
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 2
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 2
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 2
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 2
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- 108700028369 Alleles Proteins 0.000 description 2
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 2
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 2
- JQFJNGVSGOUQDH-XIRDDKMYSA-N Arg-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JQFJNGVSGOUQDH-XIRDDKMYSA-N 0.000 description 2
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 2
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 2
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 2
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 2
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 2
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 2
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 2
- LYJXHXGPWDTLKW-HJGDQZAQSA-N Arg-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O LYJXHXGPWDTLKW-HJGDQZAQSA-N 0.000 description 2
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 2
- AYZAWXAPBAYCHO-CIUDSAMLSA-N Asn-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N AYZAWXAPBAYCHO-CIUDSAMLSA-N 0.000 description 2
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 2
- XWFPGQVLOVGSLU-CIUDSAMLSA-N Asn-Gln-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XWFPGQVLOVGSLU-CIUDSAMLSA-N 0.000 description 2
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 2
- BKDDABUWNKGZCK-XHNCKOQMSA-N Asn-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O BKDDABUWNKGZCK-XHNCKOQMSA-N 0.000 description 2
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 2
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 2
- NUCUBYIUPVYGPP-XIRDDKMYSA-N Asn-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CC(N)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O NUCUBYIUPVYGPP-XIRDDKMYSA-N 0.000 description 2
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 2
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 2
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 2
- DAYDURRBMDCCFL-AAEUAGOBSA-N Asn-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N DAYDURRBMDCCFL-AAEUAGOBSA-N 0.000 description 2
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 2
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 2
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 2
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 2
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 2
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 2
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 2
- KIJLEFNHWSXHRU-NUMRIWBASA-N Asp-Gln-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KIJLEFNHWSXHRU-NUMRIWBASA-N 0.000 description 2
- UFAQGGZUXVLONR-AVGNSLFASA-N Asp-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)O UFAQGGZUXVLONR-AVGNSLFASA-N 0.000 description 2
- OGTCOKZFOJIZFG-CIUDSAMLSA-N Asp-His-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OGTCOKZFOJIZFG-CIUDSAMLSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 2
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 2
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 2
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 2
- XQFLFQWOBXPMHW-NHCYSSNCSA-N Asp-Val-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XQFLFQWOBXPMHW-NHCYSSNCSA-N 0.000 description 2
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 2
- 241001203868 Autographa californica Species 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- 208000028698 Cognitive impairment Diseases 0.000 description 2
- QADHATDBZXHRCA-ACZMJKKPSA-N Cys-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N QADHATDBZXHRCA-ACZMJKKPSA-N 0.000 description 2
- UUOYKFNULIOCGJ-GUBZILKMSA-N Cys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N UUOYKFNULIOCGJ-GUBZILKMSA-N 0.000 description 2
- UCSXXFRXHGUXCQ-SRVKXCTJSA-N Cys-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N UCSXXFRXHGUXCQ-SRVKXCTJSA-N 0.000 description 2
- KJJASVYBTKRYSN-FXQIFTODSA-N Cys-Pro-Asp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC(=O)O)C(=O)O KJJASVYBTKRYSN-FXQIFTODSA-N 0.000 description 2
- TXCCRYAZQBUCOV-CIUDSAMLSA-N Cys-Pro-Gln Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O TXCCRYAZQBUCOV-CIUDSAMLSA-N 0.000 description 2
- TXGDWPBLUFQODU-XGEHTFHBSA-N Cys-Pro-Thr Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O TXGDWPBLUFQODU-XGEHTFHBSA-N 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 2
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 2
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 2
- ULXXDWZMMSQBDC-ACZMJKKPSA-N Gln-Asp-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ULXXDWZMMSQBDC-ACZMJKKPSA-N 0.000 description 2
- KYFSMWLWHYZRNW-ACZMJKKPSA-N Gln-Asp-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KYFSMWLWHYZRNW-ACZMJKKPSA-N 0.000 description 2
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 2
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 2
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 2
- IVCOYUURLWQDJQ-LPEHRKFASA-N Gln-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O IVCOYUURLWQDJQ-LPEHRKFASA-N 0.000 description 2
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 2
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 2
- DRNMNLKUUKKPIA-HTUGSXCWSA-N Gln-Phe-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)CCC(N)=O)C(O)=O DRNMNLKUUKKPIA-HTUGSXCWSA-N 0.000 description 2
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 2
- MFORDNZDKAVNSR-SRVKXCTJSA-N Gln-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O MFORDNZDKAVNSR-SRVKXCTJSA-N 0.000 description 2
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 2
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 2
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 description 2
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 2
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 2
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 2
- OWVURWCRZZMAOZ-XHNCKOQMSA-N Glu-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)C(=O)O OWVURWCRZZMAOZ-XHNCKOQMSA-N 0.000 description 2
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 2
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 2
- NUSWUSKZRCGFEX-FXQIFTODSA-N Glu-Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O NUSWUSKZRCGFEX-FXQIFTODSA-N 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 2
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 2
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 2
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 2
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 2
- PMSDOVISAARGAV-FHWLQOOXSA-N Glu-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 PMSDOVISAARGAV-FHWLQOOXSA-N 0.000 description 2
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 2
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 2
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 2
- 108010017544 Glucosylceramidase Proteins 0.000 description 2
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 2
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 2
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 2
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 2
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 2
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 2
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 2
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 2
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 2
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 2
- NMROINAYXCACKF-WHFBIAKZSA-N Gly-Cys-Cys Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O NMROINAYXCACKF-WHFBIAKZSA-N 0.000 description 2
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 2
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 2
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 2
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 2
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 2
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 2
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 2
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 2
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 2
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 2
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 2
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 2
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 2
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 2
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 2
- 229930186217 Glycolipid Natural products 0.000 description 2
- BDHUXUFYNUOUIT-SRVKXCTJSA-N His-Asp-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BDHUXUFYNUOUIT-SRVKXCTJSA-N 0.000 description 2
- UVUIXIVPKVMONA-CIUDSAMLSA-N His-Cys-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CN=CN1 UVUIXIVPKVMONA-CIUDSAMLSA-N 0.000 description 2
- HVCRQRQPIIRNLY-IUCAKERBSA-N His-Gln-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N HVCRQRQPIIRNLY-IUCAKERBSA-N 0.000 description 2
- HYWZHNUGAYVEEW-KKUMJFAQSA-N His-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HYWZHNUGAYVEEW-KKUMJFAQSA-N 0.000 description 2
- YEKYGQZUBCRNGH-DCAQKATOSA-N His-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CO)C(=O)O YEKYGQZUBCRNGH-DCAQKATOSA-N 0.000 description 2
- CUEQQFOGARVNHU-VGDYDELISA-N His-Ser-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUEQQFOGARVNHU-VGDYDELISA-N 0.000 description 2
- 101100368517 Homo sapiens SNCA gene Proteins 0.000 description 2
- 102000004157 Hydrolases Human genes 0.000 description 2
- 108090000604 Hydrolases Proteins 0.000 description 2
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 2
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 2
- CCHSQWLCOOZREA-GMOBBJLQSA-N Ile-Asp-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N CCHSQWLCOOZREA-GMOBBJLQSA-N 0.000 description 2
- AWTDTFXPVCTHAK-BJDJZHNGSA-N Ile-Cys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N AWTDTFXPVCTHAK-BJDJZHNGSA-N 0.000 description 2
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 2
- WNQKUUQIVDDAFA-ZPFDUUQYSA-N Ile-Gln-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N WNQKUUQIVDDAFA-ZPFDUUQYSA-N 0.000 description 2
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 2
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 2
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 2
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 2
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 2
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 2
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 2
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 2
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 2
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 2
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 2
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 2
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 2
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 2
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 2
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 2
- 108091092195 Intron Proteins 0.000 description 2
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 2
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 2
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 2
- CUXRXAIAVYLVFD-ULQDDVLXSA-N Leu-Arg-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUXRXAIAVYLVFD-ULQDDVLXSA-N 0.000 description 2
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 2
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 2
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 2
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 2
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 2
- ZDSNOSQHMJBRQN-SRVKXCTJSA-N Leu-Asp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZDSNOSQHMJBRQN-SRVKXCTJSA-N 0.000 description 2
- YSKSXVKQLLBVEX-SZMVWBNQSA-N Leu-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 YSKSXVKQLLBVEX-SZMVWBNQSA-N 0.000 description 2
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 2
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 2
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 2
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 2
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 2
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 2
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 2
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 2
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 2
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 2
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 2
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 2
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 2
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 2
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 2
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 2
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 2
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 2
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- YWFZWQKWNDOWPA-XIRDDKMYSA-N Leu-Trp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O YWFZWQKWNDOWPA-XIRDDKMYSA-N 0.000 description 2
- HQBOMRTVKVKFMN-WDSOQIARSA-N Leu-Trp-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O HQBOMRTVKVKFMN-WDSOQIARSA-N 0.000 description 2
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 2
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 2
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 2
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 2
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 2
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 2
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 2
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 2
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 2
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 2
- 108010009254 Lysosomal-Associated Membrane Protein 1 Proteins 0.000 description 2
- 102100035133 Lysosome-associated membrane glycoprotein 1 Human genes 0.000 description 2
- 108010052285 Membrane Proteins Proteins 0.000 description 2
- 102000018697 Membrane Proteins Human genes 0.000 description 2
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 2
- QZPXMHVKPHJNTR-DCAQKATOSA-N Met-Leu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O QZPXMHVKPHJNTR-DCAQKATOSA-N 0.000 description 2
- PHKBGZKVOJCIMZ-SRVKXCTJSA-N Met-Pro-Arg Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PHKBGZKVOJCIMZ-SRVKXCTJSA-N 0.000 description 2
- 102000009664 Microtubule-Associated Proteins Human genes 0.000 description 2
- 108010020004 Microtubule-Associated Proteins Proteins 0.000 description 2
- 208000016285 Movement disease Diseases 0.000 description 2
- 241000699666 Mus <mouse, genus> Species 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- 108010047562 NGR peptide Proteins 0.000 description 2
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 2
- MQVFHOPCKNTHGT-MELADBBJSA-N Phe-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O MQVFHOPCKNTHGT-MELADBBJSA-N 0.000 description 2
- PDUVELWDJZOUEI-IHRRRGAJSA-N Phe-Cys-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PDUVELWDJZOUEI-IHRRRGAJSA-N 0.000 description 2
- ZBYHVSHBZYHQBW-SRVKXCTJSA-N Phe-Cys-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZBYHVSHBZYHQBW-SRVKXCTJSA-N 0.000 description 2
- UNLYPPYNDXHGDG-IHRRRGAJSA-N Phe-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UNLYPPYNDXHGDG-IHRRRGAJSA-N 0.000 description 2
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 2
- OVJMCXAPGFDGMG-HKUYNNGSSA-N Phe-Gly-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OVJMCXAPGFDGMG-HKUYNNGSSA-N 0.000 description 2
- HQCSLJFGZYOXHW-KKUMJFAQSA-N Phe-His-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CS)C(=O)O)N HQCSLJFGZYOXHW-KKUMJFAQSA-N 0.000 description 2
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 2
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 2
- GKRCCTYAGQPMMP-IHRRRGAJSA-N Phe-Ser-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GKRCCTYAGQPMMP-IHRRRGAJSA-N 0.000 description 2
- PTDAGKJHZBGDKD-OEAJRASXSA-N Phe-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O PTDAGKJHZBGDKD-OEAJRASXSA-N 0.000 description 2
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 2
- AGTHXWTYCLLYMC-FHWLQOOXSA-N Phe-Tyr-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 AGTHXWTYCLLYMC-FHWLQOOXSA-N 0.000 description 2
- DXWNFNOPBYAFRM-IHRRRGAJSA-N Phe-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N DXWNFNOPBYAFRM-IHRRRGAJSA-N 0.000 description 2
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 2
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 2
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 2
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 2
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 2
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 2
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 2
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 2
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 2
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 2
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 2
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 2
- VGVCNKSUVSZEIE-IHRRRGAJSA-N Pro-Phe-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O VGVCNKSUVSZEIE-IHRRRGAJSA-N 0.000 description 2
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 2
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 2
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 2
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 2
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 2
- YIPFBJGBRCJJJD-FHWLQOOXSA-N Pro-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 YIPFBJGBRCJJJD-FHWLQOOXSA-N 0.000 description 2
- ZYJMLBCDFPIGNL-JYJNAYRXSA-N Pro-Tyr-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O ZYJMLBCDFPIGNL-JYJNAYRXSA-N 0.000 description 2
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 2
- 208000028017 Psychotic disease Diseases 0.000 description 2
- 238000002123 RNA extraction Methods 0.000 description 2
- 102000000574 RNA-Induced Silencing Complex Human genes 0.000 description 2
- 108010016790 RNA-Induced Silencing Complex Proteins 0.000 description 2
- 102000017852 Saposin Human genes 0.000 description 2
- 108050007079 Saposin Proteins 0.000 description 2
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 2
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 2
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 2
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 2
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 2
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 2
- RFBKULCUBJAQFT-BIIVOSGPSA-N Ser-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CO)N)C(=O)O RFBKULCUBJAQFT-BIIVOSGPSA-N 0.000 description 2
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 2
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 2
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 2
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 2
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 2
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 2
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 2
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 2
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 2
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 2
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 2
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 2
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 2
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 2
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 2
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 2
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 2
- 108020004682 Single-Stranded DNA Proteins 0.000 description 2
- 101710201924 Sphingomyelin phosphodiesterase 1 Proteins 0.000 description 2
- 101710095280 Sphingomyelinase C 1 Proteins 0.000 description 2
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 2
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 2
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 2
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 2
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 2
- GCXFWAZRHBRYEM-NUMRIWBASA-N Thr-Gln-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O GCXFWAZRHBRYEM-NUMRIWBASA-N 0.000 description 2
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 2
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 2
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 2
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 2
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 2
- XZUBGOYOGDRYFC-XGEHTFHBSA-N Thr-Ser-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O XZUBGOYOGDRYFC-XGEHTFHBSA-N 0.000 description 2
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 2
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 2
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- 101710175911 Transmembrane protein 106B Proteins 0.000 description 2
- 206010044565 Tremor Diseases 0.000 description 2
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 description 2
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 2
- GTNCSPKYWCJZAC-XIRDDKMYSA-N Trp-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GTNCSPKYWCJZAC-XIRDDKMYSA-N 0.000 description 2
- XZLHHHYSWIYXHD-XIRDDKMYSA-N Trp-Gln-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XZLHHHYSWIYXHD-XIRDDKMYSA-N 0.000 description 2
- MDDYTWOFHZFABW-SZMVWBNQSA-N Trp-Gln-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 MDDYTWOFHZFABW-SZMVWBNQSA-N 0.000 description 2
- YRSOERSDNRSCBC-XIRDDKMYSA-N Trp-His-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CS)C(=O)O)N YRSOERSDNRSCBC-XIRDDKMYSA-N 0.000 description 2
- CXPJPTFWKXNDKV-NUTKFTJISA-N Trp-Leu-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CXPJPTFWKXNDKV-NUTKFTJISA-N 0.000 description 2
- UJRIVCPPPMYCNA-HOCLYGCPSA-N Trp-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UJRIVCPPPMYCNA-HOCLYGCPSA-N 0.000 description 2
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 2
- CYDVHRFXDMDMGX-KKUMJFAQSA-N Tyr-Asn-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O CYDVHRFXDMDMGX-KKUMJFAQSA-N 0.000 description 2
- RIJPHPUJRLEOAK-JYJNAYRXSA-N Tyr-Gln-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O RIJPHPUJRLEOAK-JYJNAYRXSA-N 0.000 description 2
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 2
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 2
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 2
- QSFJHIRIHOJRKS-ULQDDVLXSA-N Tyr-Leu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QSFJHIRIHOJRKS-ULQDDVLXSA-N 0.000 description 2
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 2
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 2
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 2
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 2
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 2
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 2
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 2
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 2
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 2
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 2
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 2
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 2
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 2
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 2
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 2
- MGVYZTPLGXPVQB-CYDGBPFRSA-N Val-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MGVYZTPLGXPVQB-CYDGBPFRSA-N 0.000 description 2
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 2
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 2
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 2
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 2
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 2
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 2
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 2
- LNWSJGJCLFUNTN-ZOBUZTSGSA-N Val-Trp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LNWSJGJCLFUNTN-ZOBUZTSGSA-N 0.000 description 2
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 2
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 2
- 238000007792 addition Methods 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010070783 alanyltyrosine Proteins 0.000 description 2
- 230000004075 alteration Effects 0.000 description 2
- 235000001014 amino acid Nutrition 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 2
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 238000013320 baculovirus expression vector system Methods 0.000 description 2
- 238000012742 biochemical analysis Methods 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 210000001715 carotid artery Anatomy 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 239000013592 cell lysate Substances 0.000 description 2
- 230000001149 cognitive effect Effects 0.000 description 2
- 230000001054 cortical effect Effects 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 230000007850 degeneration Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 239000003085 diluting agent Substances 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- VYFYYTLLBUKUHU-UHFFFAOYSA-N dopamine Chemical compound NCCC1=CC=C(O)C(O)=C1 VYFYYTLLBUKUHU-UHFFFAOYSA-N 0.000 description 2
- 239000003937 drug carrier Substances 0.000 description 2
- 230000004064 dysfunction Effects 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 230000005284 excitation Effects 0.000 description 2
- 239000012091 fetal bovine serum Substances 0.000 description 2
- 238000000799 fluorescence microscopy Methods 0.000 description 2
- 238000002825 functional assay Methods 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 210000004602 germ cell Anatomy 0.000 description 2
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 2
- 150000002339 glycosphingolipids Chemical class 0.000 description 2
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 2
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 102000045630 human GBA Human genes 0.000 description 2
- 229960003444 immunosuppressant agent Drugs 0.000 description 2
- 230000001861 immunosuppressant effect Effects 0.000 description 2
- 239000003018 immunosuppressive agent Substances 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 210000004263 induced pluripotent stem cell Anatomy 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 238000001802 infusion Methods 0.000 description 2
- 239000004615 ingredient Substances 0.000 description 2
- 239000007928 intraperitoneal injection Substances 0.000 description 2
- 238000001990 intravenous administration Methods 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 230000004807 localization Effects 0.000 description 2
- 230000033001 locomotion Effects 0.000 description 2
- 239000006166 lysate Substances 0.000 description 2
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 230000004060 metabolic process Effects 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 210000001616 monocyte Anatomy 0.000 description 2
- 239000000178 monomer Substances 0.000 description 2
- 230000007659 motor function Effects 0.000 description 2
- 239000013642 negative control Substances 0.000 description 2
- 238000000424 optical density measurement Methods 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 2
- 108010084572 phenylalanyl-valine Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 229920000729 poly(L-lysine) polymer Polymers 0.000 description 2
- 238000003752 polymerase chain reaction Methods 0.000 description 2
- 230000001124 posttranscriptional effect Effects 0.000 description 2
- 230000001323 posttranslational effect Effects 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 2
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 238000010814 radioimmunoprecipitation assay Methods 0.000 description 2
- 108020003175 receptors Proteins 0.000 description 2
- 102000005962 receptors Human genes 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000006942 regulation of dendrite morphogenesis Effects 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 239000013609 scAAV vector Substances 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 210000000278 spinal cord Anatomy 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 210000002504 synaptic vesicle Anatomy 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- 230000001225 therapeutic effect Effects 0.000 description 2
- 238000011200 topical administration Methods 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- 108010044292 tryptophyltyrosine Proteins 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 238000010200 validation analysis Methods 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- HHJTWTPUPVQKNA-SKXACSAKSA-N (2r,3s,4s,5s,6r)-2-[(e,3r)-2-amino-3-hydroxyoctadec-4-enoxy]-6-(hydroxymethyl)oxane-3,4,5-triol Chemical compound CCCCCCCCCCCCC\C=C\[C@@H](O)C(N)CO[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@@H]1O HHJTWTPUPVQKNA-SKXACSAKSA-N 0.000 description 1
- PKOHVHWNGUHYRE-ZFWWWQNUSA-N (2s)-1-[2-[[(2s)-2-amino-3-(1h-indol-3-yl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)NCC(=O)N1CCC[C@H]1C(O)=O PKOHVHWNGUHYRE-ZFWWWQNUSA-N 0.000 description 1
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 1
- PQFMROVJTOPVDF-JBDRJPRFSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-carboxypropanoyl]amino]-3-carboxypropanoyl]amino]-4-carboxybutanoyl]amino]butanedioic acid Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PQFMROVJTOPVDF-JBDRJPRFSA-N 0.000 description 1
- WDVIDPRACNGFPP-QWRGUYRKSA-N (2s)-2-[[(2s)-6-amino-2-[[2-[(2-aminoacetyl)amino]acetyl]amino]hexanoyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound NCC(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WDVIDPRACNGFPP-QWRGUYRKSA-N 0.000 description 1
- CUVSTAMIHSSVKL-UWVGGRQHSA-N (4s)-4-[(2-aminoacetyl)amino]-5-[[(2s)-6-amino-1-(carboxymethylamino)-1-oxohexan-2-yl]amino]-5-oxopentanoic acid Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN CUVSTAMIHSSVKL-UWVGGRQHSA-N 0.000 description 1
- PIDRBUDUWHBYSR-UHFFFAOYSA-N 1-[2-[[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O PIDRBUDUWHBYSR-UHFFFAOYSA-N 0.000 description 1
- 102100027831 14-3-3 protein theta Human genes 0.000 description 1
- 101150028074 2 gene Proteins 0.000 description 1
- XWTNPSHCJMZAHQ-QMMMGPOBSA-N 2-[[2-[[2-[[(2s)-2-amino-4-methylpentanoyl]amino]acetyl]amino]acetyl]amino]acetic acid Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(=O)NCC(O)=O XWTNPSHCJMZAHQ-QMMMGPOBSA-N 0.000 description 1
- 102100037563 40S ribosomal protein S2 Human genes 0.000 description 1
- 108020003589 5' Untranslated Regions Proteins 0.000 description 1
- 241001655883 Adeno-associated virus - 1 Species 0.000 description 1
- 241000202702 Adeno-associated virus - 3 Species 0.000 description 1
- 241000580270 Adeno-associated virus - 4 Species 0.000 description 1
- 241001634120 Adeno-associated virus - 5 Species 0.000 description 1
- 241001164823 Adeno-associated virus - 7 Species 0.000 description 1
- 241001164825 Adeno-associated virus - 8 Species 0.000 description 1
- 241000649045 Adeno-associated virus 10 Species 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 1
- HGRBNYQIMKTUNT-XVYDVKMFSA-N Ala-Asn-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HGRBNYQIMKTUNT-XVYDVKMFSA-N 0.000 description 1
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- NFDVJAKFMXHJEQ-HERUPUMHSA-N Ala-Asp-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NFDVJAKFMXHJEQ-HERUPUMHSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 1
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 1
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- SIGTYDNEPYEXGK-ZANVPECISA-N Ala-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 SIGTYDNEPYEXGK-ZANVPECISA-N 0.000 description 1
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 1
- ANGAOPNEPIDLPO-XVYDVKMFSA-N Ala-His-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N ANGAOPNEPIDLPO-XVYDVKMFSA-N 0.000 description 1
- HUUOZYZWNCXTFK-INTQDDNPSA-N Ala-His-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N HUUOZYZWNCXTFK-INTQDDNPSA-N 0.000 description 1
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 1
- NJWJSLCQEDMGNC-MBLNEYKQSA-N Ala-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N)O NJWJSLCQEDMGNC-MBLNEYKQSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 1
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- YMIYZAOBQDRCPP-UHFFFAOYSA-N Ala-Thr-Cys-Cys Chemical compound CC(N)C(=O)NC(C(O)C)C(=O)NC(CS)C(=O)NC(CS)C(O)=O YMIYZAOBQDRCPP-UHFFFAOYSA-N 0.000 description 1
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- XMIAMUXIMWREBJ-HERUPUMHSA-N Ala-Trp-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XMIAMUXIMWREBJ-HERUPUMHSA-N 0.000 description 1
- IDLBLNBDLCTPGC-HERUPUMHSA-N Ala-Trp-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CS)C(=O)O)N IDLBLNBDLCTPGC-HERUPUMHSA-N 0.000 description 1
- IEAUDUOCWNPZBR-LKTVYLICSA-N Ala-Trp-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IEAUDUOCWNPZBR-LKTVYLICSA-N 0.000 description 1
- UBTKNYUAMYRMKE-GOPGUHFVSA-N Ala-Trp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N UBTKNYUAMYRMKE-GOPGUHFVSA-N 0.000 description 1
- QDGMZAOSMNGBLP-MRFFXTKBSA-N Ala-Trp-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N QDGMZAOSMNGBLP-MRFFXTKBSA-N 0.000 description 1
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 1
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- XKHLBBQNPSOGPI-GUBZILKMSA-N Ala-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N XKHLBBQNPSOGPI-GUBZILKMSA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- 208000019901 Anxiety disease Diseases 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- OLDOLPWZEMHNIA-PJODQICGSA-N Arg-Ala-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OLDOLPWZEMHNIA-PJODQICGSA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- JTKLCCFLSLCCST-SZMVWBNQSA-N Arg-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JTKLCCFLSLCCST-SZMVWBNQSA-N 0.000 description 1
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 1
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- DXQIQUIQYAGRCC-CIUDSAMLSA-N Arg-Asp-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)CN=C(N)N DXQIQUIQYAGRCC-CIUDSAMLSA-N 0.000 description 1
- YFBGNGASPGRWEM-DCAQKATOSA-N Arg-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFBGNGASPGRWEM-DCAQKATOSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 1
- ASQYTJJWAMDISW-BPUTZDHNSA-N Arg-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N ASQYTJJWAMDISW-BPUTZDHNSA-N 0.000 description 1
- DQNLFLGFZAUIOW-FXQIFTODSA-N Arg-Cys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DQNLFLGFZAUIOW-FXQIFTODSA-N 0.000 description 1
- YUGFLWBWAJFGKY-BQBZGAKWSA-N Arg-Cys-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O YUGFLWBWAJFGKY-BQBZGAKWSA-N 0.000 description 1
- XTGGTAWGUFXJSV-NAKRPEOUSA-N Arg-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N XTGGTAWGUFXJSV-NAKRPEOUSA-N 0.000 description 1
- JVMKBJNSRZWDBO-FXQIFTODSA-N Arg-Cys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O JVMKBJNSRZWDBO-FXQIFTODSA-N 0.000 description 1
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- SYAUZLVLXCDRSH-IUCAKERBSA-N Arg-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N SYAUZLVLXCDRSH-IUCAKERBSA-N 0.000 description 1
- VRZDJJWOFXMFRO-ZFWWWQNUSA-N Arg-Gly-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O VRZDJJWOFXMFRO-ZFWWWQNUSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 1
- HCIUUZGFTDTEGM-NAKRPEOUSA-N Arg-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HCIUUZGFTDTEGM-NAKRPEOUSA-N 0.000 description 1
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 1
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 1
- ZDBWKBCKYJGKGP-DCAQKATOSA-N Arg-Leu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O ZDBWKBCKYJGKGP-DCAQKATOSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- OMKZPCPZEFMBIT-SRVKXCTJSA-N Arg-Met-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OMKZPCPZEFMBIT-SRVKXCTJSA-N 0.000 description 1
- HIMXTOIXVXWHTB-DCAQKATOSA-N Arg-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HIMXTOIXVXWHTB-DCAQKATOSA-N 0.000 description 1
- VIINVRPKMUZYOI-DCAQKATOSA-N Arg-Met-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIINVRPKMUZYOI-DCAQKATOSA-N 0.000 description 1
- YLVGUOGAFAJMKP-JYJNAYRXSA-N Arg-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YLVGUOGAFAJMKP-JYJNAYRXSA-N 0.000 description 1
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 1
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 1
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 1
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 1
- OWSMKCJUBAPHED-JYJNAYRXSA-N Arg-Pro-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OWSMKCJUBAPHED-JYJNAYRXSA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- AUIJUTGLPVHIRT-FXQIFTODSA-N Arg-Ser-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AUIJUTGLPVHIRT-FXQIFTODSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- FBXMCPLCVYUWBO-BPUTZDHNSA-N Arg-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N FBXMCPLCVYUWBO-BPUTZDHNSA-N 0.000 description 1
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 1
- OGZBJJLRKQZRHL-KJEVXHAQSA-N Arg-Thr-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OGZBJJLRKQZRHL-KJEVXHAQSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- FSPQNLYOFCXUCE-BPUTZDHNSA-N Arg-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FSPQNLYOFCXUCE-BPUTZDHNSA-N 0.000 description 1
- CTAPSNCVKPOOSM-KKUMJFAQSA-N Arg-Tyr-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CTAPSNCVKPOOSM-KKUMJFAQSA-N 0.000 description 1
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 1
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 1
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 1
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 1
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 1
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 1
- WTUZDHWWGUQEKN-SRVKXCTJSA-N Arg-Val-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O WTUZDHWWGUQEKN-SRVKXCTJSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 1
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 1
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 1
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 1
- PIWWUBYJNONVTJ-ZLUOBGJFSA-N Asn-Asp-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N PIWWUBYJNONVTJ-ZLUOBGJFSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- ZPMNECSEJXXNBE-CIUDSAMLSA-N Asn-Cys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZPMNECSEJXXNBE-CIUDSAMLSA-N 0.000 description 1
- QGNXYDHVERJIAY-ACZMJKKPSA-N Asn-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGNXYDHVERJIAY-ACZMJKKPSA-N 0.000 description 1
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 1
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 1
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 1
- AITGTTNYKAWKDR-CIUDSAMLSA-N Asn-His-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O AITGTTNYKAWKDR-CIUDSAMLSA-N 0.000 description 1
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 1
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 1
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 1
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- HGGIYWURFPGLIU-FXQIFTODSA-N Asn-Met-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(N)=O HGGIYWURFPGLIU-FXQIFTODSA-N 0.000 description 1
- CDGHMJJJHYKMPA-DLOVCJGASA-N Asn-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)N)N CDGHMJJJHYKMPA-DLOVCJGASA-N 0.000 description 1
- BSBNNPICFPXDNH-SRVKXCTJSA-N Asn-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N BSBNNPICFPXDNH-SRVKXCTJSA-N 0.000 description 1
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- UOUHBHOBGDCQPQ-IHPCNDPISA-N Asn-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC(=O)N)N UOUHBHOBGDCQPQ-IHPCNDPISA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 1
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 1
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 1
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 1
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 1
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 1
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 1
- XIDSGDJNUJRUHE-VEVYYDQMSA-N Asn-Thr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O XIDSGDJNUJRUHE-VEVYYDQMSA-N 0.000 description 1
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 1
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 1
- RDLYUKRPEJERMM-XIRDDKMYSA-N Asn-Trp-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O RDLYUKRPEJERMM-XIRDDKMYSA-N 0.000 description 1
- JPPLRQVZMZFOSX-UWJYBYFXSA-N Asn-Tyr-Ala Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 JPPLRQVZMZFOSX-UWJYBYFXSA-N 0.000 description 1
- SKQTXVZTCGSRJS-SRVKXCTJSA-N Asn-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O SKQTXVZTCGSRJS-SRVKXCTJSA-N 0.000 description 1
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 1
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 1
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 1
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 1
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- DBLPNHGKMDHWNZ-UHFFFAOYSA-N Asp Gly Arg Asn Chemical compound OC(=O)CC(N)C(=O)NCC(=O)NC(CCCN=C(N)N)C(=O)NC(CC(N)=O)C(O)=O DBLPNHGKMDHWNZ-UHFFFAOYSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 1
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- GVPSCJQLUGIKAM-GUBZILKMSA-N Asp-Arg-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GVPSCJQLUGIKAM-GUBZILKMSA-N 0.000 description 1
- QHAJMRDEWNAIBQ-FXQIFTODSA-N Asp-Arg-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O QHAJMRDEWNAIBQ-FXQIFTODSA-N 0.000 description 1
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 1
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 1
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 1
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 1
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 description 1
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- QQXOYLWJQUPXJU-WHFBIAKZSA-N Asp-Cys-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O QQXOYLWJQUPXJU-WHFBIAKZSA-N 0.000 description 1
- WLKVEEODTPQPLI-ACZMJKKPSA-N Asp-Gln-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WLKVEEODTPQPLI-ACZMJKKPSA-N 0.000 description 1
- YBMUFUWSMIKJQA-GUBZILKMSA-N Asp-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N YBMUFUWSMIKJQA-GUBZILKMSA-N 0.000 description 1
- OEUQMKNNOWJREN-AVGNSLFASA-N Asp-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N OEUQMKNNOWJREN-AVGNSLFASA-N 0.000 description 1
- ZSJFGGSPCCHMNE-LAEOZQHASA-N Asp-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N ZSJFGGSPCCHMNE-LAEOZQHASA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 1
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 1
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 1
- BIVYLQMZPHDUIH-WHFBIAKZSA-N Asp-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)O BIVYLQMZPHDUIH-WHFBIAKZSA-N 0.000 description 1
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 1
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- CMCIMCAQIULNDJ-CIUDSAMLSA-N Asp-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N CMCIMCAQIULNDJ-CIUDSAMLSA-N 0.000 description 1
- TVIZQBFURPLQDV-DJFWLOJKSA-N Asp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N TVIZQBFURPLQDV-DJFWLOJKSA-N 0.000 description 1
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- LDLZOAJRXXBVGF-GMOBBJLQSA-N Asp-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N LDLZOAJRXXBVGF-GMOBBJLQSA-N 0.000 description 1
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- JTRDJYIZIKCIRC-AJNGGQMLSA-N Asp-Leu-Leu-Gln Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JTRDJYIZIKCIRC-AJNGGQMLSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- FQHBAQLBIXLWAG-DCAQKATOSA-N Asp-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N FQHBAQLBIXLWAG-DCAQKATOSA-N 0.000 description 1
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 1
- ZXRQJQCXPSMNMR-XIRDDKMYSA-N Asp-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N ZXRQJQCXPSMNMR-XIRDDKMYSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 description 1
- JXGJJQJHXHXJQF-CIUDSAMLSA-N Asp-Met-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O JXGJJQJHXHXJQF-CIUDSAMLSA-N 0.000 description 1
- SAKCBXNPWDRWPE-BQBZGAKWSA-N Asp-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N SAKCBXNPWDRWPE-BQBZGAKWSA-N 0.000 description 1
- RRUWMFBLFLUZSI-LPEHRKFASA-N Asp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N RRUWMFBLFLUZSI-LPEHRKFASA-N 0.000 description 1
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 1
- WZUZGDANRQPCDD-SRVKXCTJSA-N Asp-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N WZUZGDANRQPCDD-SRVKXCTJSA-N 0.000 description 1
- YRZIYQGXTSBRLT-AVGNSLFASA-N Asp-Phe-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YRZIYQGXTSBRLT-AVGNSLFASA-N 0.000 description 1
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 1
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 1
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- BKOIIURTQAJHAT-GUBZILKMSA-N Asp-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 BKOIIURTQAJHAT-GUBZILKMSA-N 0.000 description 1
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 1
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- OFYVKOXTTDCUIL-FXQIFTODSA-N Asp-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OFYVKOXTTDCUIL-FXQIFTODSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 1
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 1
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 1
- KNDCWFXCFKSEBM-AVGNSLFASA-N Asp-Tyr-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O KNDCWFXCFKSEBM-AVGNSLFASA-N 0.000 description 1
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 1
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 1
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 1
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- OQMGSMNZVHYDTQ-ZKWXMUAHSA-N Asp-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N OQMGSMNZVHYDTQ-ZKWXMUAHSA-N 0.000 description 1
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 1
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 1
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- GYNUXDMCDILYIQ-QRTARXTBSA-N Asp-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N GYNUXDMCDILYIQ-QRTARXTBSA-N 0.000 description 1
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 1
- 206010003591 Ataxia Diseases 0.000 description 1
- 108091032955 Bacterial small RNA Proteins 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 102100021277 Beta-secretase 2 Human genes 0.000 description 1
- 208000018240 Bone Marrow Failure disease Diseases 0.000 description 1
- 206010065553 Bone marrow failure Diseases 0.000 description 1
- VOVIALXJUBGFJZ-KWVAZRHASA-N Budesonide Chemical compound C1CC2=CC(=O)C=C[C@]2(C)[C@@H]2[C@@H]1[C@@H]1C[C@H]3OC(CCC)O[C@@]3(C(=O)CO)[C@@]1(C)C[C@@H]2O VOVIALXJUBGFJZ-KWVAZRHASA-N 0.000 description 1
- 102100025222 CD63 antigen Human genes 0.000 description 1
- 101100282787 Caenorhabditis elegans gba-1 gene Proteins 0.000 description 1
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 1
- 241000282465 Canis Species 0.000 description 1
- YBSQGNFRWZKFMJ-UHFFFAOYSA-N Cerebroside B Natural products CCCCCCCCCCCCCCC(O)C(=O)NC(C(O)C=CCCC=C(C)CCCCCCCCC)COC1OC(CO)C(O)C(O)C1O YBSQGNFRWZKFMJ-UHFFFAOYSA-N 0.000 description 1
- 206010010904 Convulsion Diseases 0.000 description 1
- 206010010947 Coordination abnormal Diseases 0.000 description 1
- XMTDCXXLDZKAGI-ACZMJKKPSA-N Cys-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N XMTDCXXLDZKAGI-ACZMJKKPSA-N 0.000 description 1
- XEEIQMGZRFFSRD-XVYDVKMFSA-N Cys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N XEEIQMGZRFFSRD-XVYDVKMFSA-N 0.000 description 1
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 1
- GMXSSZUVDNPRMA-FXQIFTODSA-N Cys-Arg-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GMXSSZUVDNPRMA-FXQIFTODSA-N 0.000 description 1
- BUIYOWKUSCTBRE-CIUDSAMLSA-N Cys-Arg-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O BUIYOWKUSCTBRE-CIUDSAMLSA-N 0.000 description 1
- LRZPRGJXAZFXCR-DCAQKATOSA-N Cys-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N LRZPRGJXAZFXCR-DCAQKATOSA-N 0.000 description 1
- MBPKYKSYUAPLMY-DCAQKATOSA-N Cys-Arg-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MBPKYKSYUAPLMY-DCAQKATOSA-N 0.000 description 1
- QLCPDGRAEJSYQM-LPEHRKFASA-N Cys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)C(=O)O QLCPDGRAEJSYQM-LPEHRKFASA-N 0.000 description 1
- UPJGYXRAPJWIHD-CIUDSAMLSA-N Cys-Asn-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UPJGYXRAPJWIHD-CIUDSAMLSA-N 0.000 description 1
- XABFFGOGKOORCG-CIUDSAMLSA-N Cys-Asp-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XABFFGOGKOORCG-CIUDSAMLSA-N 0.000 description 1
- WKELHWMCIXSVDT-UBHSHLNASA-N Cys-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N WKELHWMCIXSVDT-UBHSHLNASA-N 0.000 description 1
- WXKWQSDHEXKKNC-ZKWXMUAHSA-N Cys-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N WXKWQSDHEXKKNC-ZKWXMUAHSA-N 0.000 description 1
- HYKFOHGZGLOCAY-ZLUOBGJFSA-N Cys-Cys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O HYKFOHGZGLOCAY-ZLUOBGJFSA-N 0.000 description 1
- LDIKUWLAMDFHPU-FXQIFTODSA-N Cys-Cys-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LDIKUWLAMDFHPU-FXQIFTODSA-N 0.000 description 1
- DVKQPQKQDHHFTE-ZLUOBGJFSA-N Cys-Cys-Asn Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N)C(=O)N DVKQPQKQDHHFTE-ZLUOBGJFSA-N 0.000 description 1
- HIPHJNWPLMUBQQ-ACZMJKKPSA-N Cys-Cys-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(N)=O HIPHJNWPLMUBQQ-ACZMJKKPSA-N 0.000 description 1
- ZJBWJHQDOIMVLM-WHFBIAKZSA-N Cys-Cys-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZJBWJHQDOIMVLM-WHFBIAKZSA-N 0.000 description 1
- QJUDRFBUWAGUSG-SRVKXCTJSA-N Cys-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N QJUDRFBUWAGUSG-SRVKXCTJSA-N 0.000 description 1
- VKAWJBQTFCBHQY-GUBZILKMSA-N Cys-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N VKAWJBQTFCBHQY-GUBZILKMSA-N 0.000 description 1
- SFRQEQGPRTVDPO-NRPADANISA-N Cys-Gln-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O SFRQEQGPRTVDPO-NRPADANISA-N 0.000 description 1
- ZEXHDOQQYZKOIB-ACZMJKKPSA-N Cys-Glu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZEXHDOQQYZKOIB-ACZMJKKPSA-N 0.000 description 1
- CVLIHKBUPSFRQP-WHFBIAKZSA-N Cys-Gly-Ala Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C)C(O)=O CVLIHKBUPSFRQP-WHFBIAKZSA-N 0.000 description 1
- HQZGVYJBRSISDT-BQBZGAKWSA-N Cys-Gly-Arg Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQZGVYJBRSISDT-BQBZGAKWSA-N 0.000 description 1
- OXOQBEVULIBOSH-ZDLURKLDSA-N Cys-Gly-Thr Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O OXOQBEVULIBOSH-ZDLURKLDSA-N 0.000 description 1
- UVZFZTWNHOQWNK-NAKRPEOUSA-N Cys-Ile-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UVZFZTWNHOQWNK-NAKRPEOUSA-N 0.000 description 1
- PRHGYQOSEHLDRW-VGDYDELISA-N Cys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N PRHGYQOSEHLDRW-VGDYDELISA-N 0.000 description 1
- ABLJDBFJPUWQQB-DCAQKATOSA-N Cys-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N ABLJDBFJPUWQQB-DCAQKATOSA-N 0.000 description 1
- BLGNLNRBABWDST-CIUDSAMLSA-N Cys-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BLGNLNRBABWDST-CIUDSAMLSA-N 0.000 description 1
- XLLSMEFANRROJE-GUBZILKMSA-N Cys-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XLLSMEFANRROJE-GUBZILKMSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- HBHMVBGGHDMPBF-GARJFASQSA-N Cys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N HBHMVBGGHDMPBF-GARJFASQSA-N 0.000 description 1
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 1
- LHMSYHSAAJOEBL-CIUDSAMLSA-N Cys-Lys-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O LHMSYHSAAJOEBL-CIUDSAMLSA-N 0.000 description 1
- GDNWBSFSHJVXKL-GUBZILKMSA-N Cys-Lys-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O GDNWBSFSHJVXKL-GUBZILKMSA-N 0.000 description 1
- BNCKELUXXUYRNY-GUBZILKMSA-N Cys-Lys-Glu Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BNCKELUXXUYRNY-GUBZILKMSA-N 0.000 description 1
- CIVXDCMSSFGWAL-YUMQZZPRSA-N Cys-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N CIVXDCMSSFGWAL-YUMQZZPRSA-N 0.000 description 1
- NIXHTNJAGGFBAW-CIUDSAMLSA-N Cys-Lys-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N NIXHTNJAGGFBAW-CIUDSAMLSA-N 0.000 description 1
- RWVBNRYBHAGYSG-GUBZILKMSA-N Cys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)N RWVBNRYBHAGYSG-GUBZILKMSA-N 0.000 description 1
- UDDITVWSXPEAIQ-IHRRRGAJSA-N Cys-Phe-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UDDITVWSXPEAIQ-IHRRRGAJSA-N 0.000 description 1
- CNAMJJOZGXPDHW-IHRRRGAJSA-N Cys-Pro-Phe Chemical compound N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O CNAMJJOZGXPDHW-IHRRRGAJSA-N 0.000 description 1
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 1
- JUNZLDGUJZIUCO-IHRRRGAJSA-N Cys-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O JUNZLDGUJZIUCO-IHRRRGAJSA-N 0.000 description 1
- XBELMDARIGXDKY-GUBZILKMSA-N Cys-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CS)N XBELMDARIGXDKY-GUBZILKMSA-N 0.000 description 1
- CMYVIUWVYHOLRD-ZLUOBGJFSA-N Cys-Ser-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CMYVIUWVYHOLRD-ZLUOBGJFSA-N 0.000 description 1
- RJPKQCFHEPPTGL-ZLUOBGJFSA-N Cys-Ser-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RJPKQCFHEPPTGL-ZLUOBGJFSA-N 0.000 description 1
- BCWIFCLVCRAIQK-ZLUOBGJFSA-N Cys-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O BCWIFCLVCRAIQK-ZLUOBGJFSA-N 0.000 description 1
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 1
- WZJLBUPPZRZNTO-CIUDSAMLSA-N Cys-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N WZJLBUPPZRZNTO-CIUDSAMLSA-N 0.000 description 1
- ABLQPNMKLMFDQU-BIIVOSGPSA-N Cys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CS)N)C(=O)O ABLQPNMKLMFDQU-BIIVOSGPSA-N 0.000 description 1
- MJOYUXLETJMQGG-IHRRRGAJSA-N Cys-Tyr-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MJOYUXLETJMQGG-IHRRRGAJSA-N 0.000 description 1
- FCXJJTRGVAZDER-FXQIFTODSA-N Cys-Val-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O FCXJJTRGVAZDER-FXQIFTODSA-N 0.000 description 1
- MQQLYEHXSBJTRK-FXQIFTODSA-N Cys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N MQQLYEHXSBJTRK-FXQIFTODSA-N 0.000 description 1
- AZDQAZRURQMSQD-XPUUQOCRSA-N Cys-Val-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AZDQAZRURQMSQD-XPUUQOCRSA-N 0.000 description 1
- NGOIQDYZMIKCOK-NAKRPEOUSA-N Cys-Val-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NGOIQDYZMIKCOK-NAKRPEOUSA-N 0.000 description 1
- 108010005843 Cysteine Proteases Proteins 0.000 description 1
- 102000005927 Cysteine Proteases Human genes 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- LHQIJBMDNUYRAM-AWFVSMACSA-N D-erythro-biopterin Chemical compound N1=C(N)NC(=O)C2=NC([C@H](O)[C@H](O)C)=CN=C21 LHQIJBMDNUYRAM-AWFVSMACSA-N 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 208000035976 Developmental Disabilities Diseases 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 108700039887 Essential Genes Proteins 0.000 description 1
- HKVAMNSJSFKALM-GKUWKFKPSA-N Everolimus Chemical compound C1C[C@@H](OCCO)[C@H](OC)C[C@@H]1C[C@@H](C)[C@H]1OC(=O)[C@@H]2CCCCN2C(=O)C(=O)[C@](O)(O2)[C@H](C)CC[C@H]2C[C@H](OC)/C(C)=C/C=C/C=C/[C@@H](C)C[C@@H](C)C(=O)[C@H](OC)[C@H](O)/C(C)=C/[C@@H](C)C(=O)C1 HKVAMNSJSFKALM-GKUWKFKPSA-N 0.000 description 1
- 206010070246 Executive dysfunction Diseases 0.000 description 1
- 108010023555 GTP Cyclohydrolase Proteins 0.000 description 1
- 102000030782 GTP binding Human genes 0.000 description 1
- 108091000058 GTP-Binding Proteins 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- 206010018341 Gliosis Diseases 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 1
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 1
- YNNXQZDEOCYJJL-CIUDSAMLSA-N Gln-Arg-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N YNNXQZDEOCYJJL-CIUDSAMLSA-N 0.000 description 1
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 1
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 1
- SSWAFVQFQWOJIJ-XIRDDKMYSA-N Gln-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N SSWAFVQFQWOJIJ-XIRDDKMYSA-N 0.000 description 1
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 1
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 1
- BTSPOOHJBYJRKO-CIUDSAMLSA-N Gln-Asp-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BTSPOOHJBYJRKO-CIUDSAMLSA-N 0.000 description 1
- RKAQZCDMSUQTSS-FXQIFTODSA-N Gln-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RKAQZCDMSUQTSS-FXQIFTODSA-N 0.000 description 1
- KZEUVLLVULIPNX-GUBZILKMSA-N Gln-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N KZEUVLLVULIPNX-GUBZILKMSA-N 0.000 description 1
- SOIAHPSKKUYREP-CIUDSAMLSA-N Gln-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N SOIAHPSKKUYREP-CIUDSAMLSA-N 0.000 description 1
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 1
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 1
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 1
- UZMWDBOHAOSCCH-ACZMJKKPSA-N Gln-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O UZMWDBOHAOSCCH-ACZMJKKPSA-N 0.000 description 1
- CXFUMJQFZVCETK-FXQIFTODSA-N Gln-Cys-Gln Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O CXFUMJQFZVCETK-FXQIFTODSA-N 0.000 description 1
- QFTRCUPCARNIPZ-XHNCKOQMSA-N Gln-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)C(=O)O QFTRCUPCARNIPZ-XHNCKOQMSA-N 0.000 description 1
- COYGBRTZEVWZBW-XKBZYTNZSA-N Gln-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O COYGBRTZEVWZBW-XKBZYTNZSA-N 0.000 description 1
- LOJYQMFIIJVETK-WDSKDSINSA-N Gln-Gln Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LOJYQMFIIJVETK-WDSKDSINSA-N 0.000 description 1
- GPISLLFQNHELLK-DCAQKATOSA-N Gln-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GPISLLFQNHELLK-DCAQKATOSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 1
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 1
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 1
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 1
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 1
- JNEITCMDYWKPIW-GUBZILKMSA-N Gln-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JNEITCMDYWKPIW-GUBZILKMSA-N 0.000 description 1
- SBHVGKBYOQKAEA-SDDRHHMPSA-N Gln-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SBHVGKBYOQKAEA-SDDRHHMPSA-N 0.000 description 1
- KHGGWBRVRPHFMH-PEFMBERDSA-N Gln-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHGGWBRVRPHFMH-PEFMBERDSA-N 0.000 description 1
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- JKGHMESJHRTHIC-SIUGBPQLSA-N Gln-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JKGHMESJHRTHIC-SIUGBPQLSA-N 0.000 description 1
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 1
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 1
- GURIQZQSTBBHRV-SRVKXCTJSA-N Gln-Lys-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GURIQZQSTBBHRV-SRVKXCTJSA-N 0.000 description 1
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 1
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 1
- ZXGLLNZQSBLQLT-SRVKXCTJSA-N Gln-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZXGLLNZQSBLQLT-SRVKXCTJSA-N 0.000 description 1
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 1
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 1
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 1
- DOQUICBEISTQHE-CIUDSAMLSA-N Gln-Pro-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O DOQUICBEISTQHE-CIUDSAMLSA-N 0.000 description 1
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 1
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 1
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 1
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 1
- MFHVAWMMKZBSRQ-ACZMJKKPSA-N Gln-Ser-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N MFHVAWMMKZBSRQ-ACZMJKKPSA-N 0.000 description 1
- RWQCWSGOOOEGPB-FXQIFTODSA-N Gln-Ser-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O RWQCWSGOOOEGPB-FXQIFTODSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- KPNWAJMEMRCLAL-GUBZILKMSA-N Gln-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KPNWAJMEMRCLAL-GUBZILKMSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 1
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- DYVMTEWCGAVKSE-HJGDQZAQSA-N Gln-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O DYVMTEWCGAVKSE-HJGDQZAQSA-N 0.000 description 1
- RONJIBWTGKVKFY-HTUGSXCWSA-N Gln-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O RONJIBWTGKVKFY-HTUGSXCWSA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 1
- UGEZSPWLJABDAR-KKUMJFAQSA-N Gln-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N UGEZSPWLJABDAR-KKUMJFAQSA-N 0.000 description 1
- UQKVUFGUSVYJMQ-IRIUXVKKSA-N Gln-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N)O UQKVUFGUSVYJMQ-IRIUXVKKSA-N 0.000 description 1
- QZQYITIKPAUDGN-GVXVVHGQSA-N Gln-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QZQYITIKPAUDGN-GVXVVHGQSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- UTKICHUQEQBDGC-ACZMJKKPSA-N Glu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UTKICHUQEQBDGC-ACZMJKKPSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- KEBACWCLVOXFNC-DCAQKATOSA-N Glu-Arg-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KEBACWCLVOXFNC-DCAQKATOSA-N 0.000 description 1
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 1
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 1
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- SVZIKUHLRKVZIF-GUBZILKMSA-N Glu-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N SVZIKUHLRKVZIF-GUBZILKMSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- RTOOAKXIJADOLL-GUBZILKMSA-N Glu-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N RTOOAKXIJADOLL-GUBZILKMSA-N 0.000 description 1
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 1
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- COSBSYQVPSODFX-GUBZILKMSA-N Glu-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N COSBSYQVPSODFX-GUBZILKMSA-N 0.000 description 1
- XOFYVODYSNKPDK-AVGNSLFASA-N Glu-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOFYVODYSNKPDK-AVGNSLFASA-N 0.000 description 1
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- YVYVMJNUENBOOL-KBIXCLLPSA-N Glu-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N YVYVMJNUENBOOL-KBIXCLLPSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 1
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- OHWJUIXZHVIXJJ-GUBZILKMSA-N Glu-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N OHWJUIXZHVIXJJ-GUBZILKMSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- LGWUJBCIFGVBSJ-CIUDSAMLSA-N Glu-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LGWUJBCIFGVBSJ-CIUDSAMLSA-N 0.000 description 1
- MCGNJCNXIMQCMN-DCAQKATOSA-N Glu-Met-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCC(O)=O MCGNJCNXIMQCMN-DCAQKATOSA-N 0.000 description 1
- UMHRCVCZUPBBQW-GARJFASQSA-N Glu-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UMHRCVCZUPBBQW-GARJFASQSA-N 0.000 description 1
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 1
- RXESHTOTINOODU-JYJNAYRXSA-N Glu-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N RXESHTOTINOODU-JYJNAYRXSA-N 0.000 description 1
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 1
- TWYFJOHWGCCRIR-DCAQKATOSA-N Glu-Pro-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYFJOHWGCCRIR-DCAQKATOSA-N 0.000 description 1
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 1
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 1
- LPHGXOWFAXFCPX-KKUMJFAQSA-N Glu-Pro-Phe Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O LPHGXOWFAXFCPX-KKUMJFAQSA-N 0.000 description 1
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- DDXZHOHEABQXSE-NKIYYHGXSA-N Glu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O DDXZHOHEABQXSE-NKIYYHGXSA-N 0.000 description 1
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- VHPVBPCCWVDGJL-IRIUXVKKSA-N Glu-Thr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VHPVBPCCWVDGJL-IRIUXVKKSA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- BPCLDCNZBUYGOD-BPUTZDHNSA-N Glu-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 BPCLDCNZBUYGOD-BPUTZDHNSA-N 0.000 description 1
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 1
- DXMOIVCNJIJQSC-QEJZJMRPSA-N Glu-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N DXMOIVCNJIJQSC-QEJZJMRPSA-N 0.000 description 1
- QGAJQIGFFIQJJK-IHRRRGAJSA-N Glu-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QGAJQIGFFIQJJK-IHRRRGAJSA-N 0.000 description 1
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 1
- KCCNSVHJSMMGFS-NRPADANISA-N Glu-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KCCNSVHJSMMGFS-NRPADANISA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 1
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 1
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 1
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- FMVLWTYYODVFRG-BQBZGAKWSA-N Gly-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN FMVLWTYYODVFRG-BQBZGAKWSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 1
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 1
- JMQFHZWESBGPFC-WDSKDSINSA-N Gly-Gln-Asp Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JMQFHZWESBGPFC-WDSKDSINSA-N 0.000 description 1
- XLFHCWHXKSFVIB-BQBZGAKWSA-N Gly-Gln-Gln Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLFHCWHXKSFVIB-BQBZGAKWSA-N 0.000 description 1
- JUGQPPOVWXSPKJ-RYUDHWBXSA-N Gly-Gln-Phe Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JUGQPPOVWXSPKJ-RYUDHWBXSA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 1
- LJXWZPHEMJSNRC-KBPBESRZSA-N Gly-Gln-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LJXWZPHEMJSNRC-KBPBESRZSA-N 0.000 description 1
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- HHSOPSCKAZKQHQ-PEXQALLHSA-N Gly-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN HHSOPSCKAZKQHQ-PEXQALLHSA-N 0.000 description 1
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 1
- HPAIKDPJURGQLN-KBPBESRZSA-N Gly-His-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 HPAIKDPJURGQLN-KBPBESRZSA-N 0.000 description 1
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 1
- SIYTVHWNKGIGMD-HOTGVXAUSA-N Gly-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)CN SIYTVHWNKGIGMD-HOTGVXAUSA-N 0.000 description 1
- YNIMVVJTPWCUJH-KBPBESRZSA-N Gly-His-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YNIMVVJTPWCUJH-KBPBESRZSA-N 0.000 description 1
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 1
- OMOZPGCHVWOXHN-BQBZGAKWSA-N Gly-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)CN OMOZPGCHVWOXHN-BQBZGAKWSA-N 0.000 description 1
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 1
- QVDGHDFFYHKJPN-QWRGUYRKSA-N Gly-Phe-Cys Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O QVDGHDFFYHKJPN-QWRGUYRKSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 1
- CAVKXZMMDNOZJU-UHFFFAOYSA-N Gly-Pro-Ala-Gly-Pro Natural products C1CCC(C(O)=O)N1C(=O)CNC(=O)C(C)NC(=O)C1CCCN1C(=O)CN CAVKXZMMDNOZJU-UHFFFAOYSA-N 0.000 description 1
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 1
- QSQXZZCGPXQBPP-BQBZGAKWSA-N Gly-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)CN)C(=O)N[C@@H](CS)C(=O)O QSQXZZCGPXQBPP-BQBZGAKWSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- JNGHLWWFPGIJER-STQMWFEESA-N Gly-Pro-Tyr Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JNGHLWWFPGIJER-STQMWFEESA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- IMRNSEPSPFQNHF-STQMWFEESA-N Gly-Ser-Trp Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O IMRNSEPSPFQNHF-STQMWFEESA-N 0.000 description 1
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 1
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 1
- ZKJZBRHRWKLVSJ-ZDLURKLDSA-N Gly-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O ZKJZBRHRWKLVSJ-ZDLURKLDSA-N 0.000 description 1
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- SFOXOSKVTLDEDM-HOTGVXAUSA-N Gly-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)CN)=CNC2=C1 SFOXOSKVTLDEDM-HOTGVXAUSA-N 0.000 description 1
- LKJCZEPXHOIAIW-HOTGVXAUSA-N Gly-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN LKJCZEPXHOIAIW-HOTGVXAUSA-N 0.000 description 1
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 1
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 1
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 1
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 1
- JYGYNWYVKXENNE-OALUTQOASA-N Gly-Tyr-Trp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JYGYNWYVKXENNE-OALUTQOASA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- DZMVESFTHXSSPZ-XVYDVKMFSA-N His-Ala-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DZMVESFTHXSSPZ-XVYDVKMFSA-N 0.000 description 1
- HXKZJLWGSWQKEA-LSJOCFKGSA-N His-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 HXKZJLWGSWQKEA-LSJOCFKGSA-N 0.000 description 1
- YPLYIXGKCRQZGW-SRVKXCTJSA-N His-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YPLYIXGKCRQZGW-SRVKXCTJSA-N 0.000 description 1
- KYMUEAZVLPRVAE-GUBZILKMSA-N His-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KYMUEAZVLPRVAE-GUBZILKMSA-N 0.000 description 1
- XJQDHFMUUBRCGA-KKUMJFAQSA-N His-Asn-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XJQDHFMUUBRCGA-KKUMJFAQSA-N 0.000 description 1
- VOEGKUNRHYKYSU-XVYDVKMFSA-N His-Asp-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O VOEGKUNRHYKYSU-XVYDVKMFSA-N 0.000 description 1
- ZJSMFRTVYSLKQU-DJFWLOJKSA-N His-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZJSMFRTVYSLKQU-DJFWLOJKSA-N 0.000 description 1
- UOAVQQRILDGZEN-SRVKXCTJSA-N His-Asp-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UOAVQQRILDGZEN-SRVKXCTJSA-N 0.000 description 1
- LMMPTUVWHCFTOT-GARJFASQSA-N His-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O LMMPTUVWHCFTOT-GARJFASQSA-N 0.000 description 1
- YOSQCYUFZGPIPC-PBCZWWQYSA-N His-Asp-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YOSQCYUFZGPIPC-PBCZWWQYSA-N 0.000 description 1
- AASLOGQZZKZWKH-SRVKXCTJSA-N His-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AASLOGQZZKZWKH-SRVKXCTJSA-N 0.000 description 1
- IIVZNQCUUMBBKF-GVXVVHGQSA-N His-Gln-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 IIVZNQCUUMBBKF-GVXVVHGQSA-N 0.000 description 1
- KNNSUUOHFVVJOP-GUBZILKMSA-N His-Glu-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N KNNSUUOHFVVJOP-GUBZILKMSA-N 0.000 description 1
- PQKCQZHAGILVIM-NKIYYHGXSA-N His-Glu-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O PQKCQZHAGILVIM-NKIYYHGXSA-N 0.000 description 1
- STWGDDDFLUFCCA-GVXVVHGQSA-N His-Glu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O STWGDDDFLUFCCA-GVXVVHGQSA-N 0.000 description 1
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 1
- CHZRWFUGWRTUOD-IUCAKERBSA-N His-Gly-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N CHZRWFUGWRTUOD-IUCAKERBSA-N 0.000 description 1
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 1
- MLZVJIREOKTDAR-SIGLWIIPSA-N His-Ile-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MLZVJIREOKTDAR-SIGLWIIPSA-N 0.000 description 1
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 1
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 1
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 1
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 1
- CMMBEMZGNGYJRJ-IHRRRGAJSA-N His-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N CMMBEMZGNGYJRJ-IHRRRGAJSA-N 0.000 description 1
- WPUAVVXYEJAWIV-KKUMJFAQSA-N His-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WPUAVVXYEJAWIV-KKUMJFAQSA-N 0.000 description 1
- BSVLMPMIXPQNKC-KBPBESRZSA-N His-Phe-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O BSVLMPMIXPQNKC-KBPBESRZSA-N 0.000 description 1
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 1
- LNDVNHOSZQPJGI-AVGNSLFASA-N His-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 LNDVNHOSZQPJGI-AVGNSLFASA-N 0.000 description 1
- FHKZHRMERJUXRJ-DCAQKATOSA-N His-Ser-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 FHKZHRMERJUXRJ-DCAQKATOSA-N 0.000 description 1
- WKEABZIITNXXQZ-CIUDSAMLSA-N His-Ser-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N WKEABZIITNXXQZ-CIUDSAMLSA-N 0.000 description 1
- UOYGZBIPZYKGSH-SRVKXCTJSA-N His-Ser-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N UOYGZBIPZYKGSH-SRVKXCTJSA-N 0.000 description 1
- IAYPZSHNZQHQNO-KKUMJFAQSA-N His-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N IAYPZSHNZQHQNO-KKUMJFAQSA-N 0.000 description 1
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 1
- ILUVWFTXAUYOBW-CUJWVEQBSA-N His-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N)O ILUVWFTXAUYOBW-CUJWVEQBSA-N 0.000 description 1
- XHQYFGPIRUHQIB-PBCZWWQYSA-N His-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CN=CN1 XHQYFGPIRUHQIB-PBCZWWQYSA-N 0.000 description 1
- IXQGOKWTQPCIQM-YJRXYDGGSA-N His-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O IXQGOKWTQPCIQM-YJRXYDGGSA-N 0.000 description 1
- NBWATNYAUVSAEQ-ZEILLAHLSA-N His-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O NBWATNYAUVSAEQ-ZEILLAHLSA-N 0.000 description 1
- DEMIXZCKUXVEBO-BWAGICSOSA-N His-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O DEMIXZCKUXVEBO-BWAGICSOSA-N 0.000 description 1
- UIRUVUUGUYCMBY-KCTSRDHCSA-N His-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CN=CN3)N UIRUVUUGUYCMBY-KCTSRDHCSA-N 0.000 description 1
- FRDFAWHTPDKRHG-ULQDDVLXSA-N His-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 FRDFAWHTPDKRHG-ULQDDVLXSA-N 0.000 description 1
- ISQOVWDWRUONJH-YESZJQIVSA-N His-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ISQOVWDWRUONJH-YESZJQIVSA-N 0.000 description 1
- GYXDQXPCPASCNR-NHCYSSNCSA-N His-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N GYXDQXPCPASCNR-NHCYSSNCSA-N 0.000 description 1
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 1
- CGAMSLMBYJHMDY-ONGXEEELSA-N His-Val-Gly Chemical compound CC(C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N CGAMSLMBYJHMDY-ONGXEEELSA-N 0.000 description 1
- PUFNQIPSRXVLQJ-IHRRRGAJSA-N His-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N PUFNQIPSRXVLQJ-IHRRRGAJSA-N 0.000 description 1
- GGXUJBKENKVYNV-ULQDDVLXSA-N His-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N GGXUJBKENKVYNV-ULQDDVLXSA-N 0.000 description 1
- 101001098029 Homo sapiens 40S ribosomal protein S2 Proteins 0.000 description 1
- 101000934368 Homo sapiens CD63 antigen Proteins 0.000 description 1
- 101001066129 Homo sapiens Glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 1
- 101100368626 Homo sapiens TMEM106B gene Proteins 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- WZPIKDWQVRTATP-SYWGBEHUSA-N Ile-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 WZPIKDWQVRTATP-SYWGBEHUSA-N 0.000 description 1
- VZIFYHYNQDIPLI-HJWJTTGWSA-N Ile-Arg-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N VZIFYHYNQDIPLI-HJWJTTGWSA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 1
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 1
- BALLIXFZYSECCF-QEWYBTABSA-N Ile-Gln-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N BALLIXFZYSECCF-QEWYBTABSA-N 0.000 description 1
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 1
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 1
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 1
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 1
- OEQKGSPBDVKYOC-ZKWXMUAHSA-N Ile-Gly-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OEQKGSPBDVKYOC-ZKWXMUAHSA-N 0.000 description 1
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 1
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 1
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 1
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 1
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- KBAPKNDWAGVGTH-IGISWZIWSA-N Ile-Ile-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KBAPKNDWAGVGTH-IGISWZIWSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 1
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 1
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 1
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 1
- DNKDIDZHXZAGRY-HJWJTTGWSA-N Ile-Met-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N DNKDIDZHXZAGRY-HJWJTTGWSA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 1
- BJECXJHLUJXPJQ-PYJNHQTQSA-N Ile-Pro-His Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N BJECXJHLUJXPJQ-PYJNHQTQSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- JTBFQNHKNRZJDS-SYWGBEHUSA-N Ile-Trp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C)C(=O)O)N JTBFQNHKNRZJDS-SYWGBEHUSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- 102100022297 Integrin alpha-X Human genes 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- LHQIJBMDNUYRAM-UHFFFAOYSA-N L-erythro-Biopterin Natural products N1=C(N)NC(=O)C2=NC(C(O)C(O)C)=CN=C21 LHQIJBMDNUYRAM-UHFFFAOYSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 1
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 1
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 1
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 1
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- USTCFDAQCLDPBD-XIRDDKMYSA-N Leu-Asn-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N USTCFDAQCLDPBD-XIRDDKMYSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 1
- GBDMISNMNXVTNV-XIRDDKMYSA-N Leu-Asp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GBDMISNMNXVTNV-XIRDDKMYSA-N 0.000 description 1
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 1
- IIKJNQWOQIWWMR-CIUDSAMLSA-N Leu-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N IIKJNQWOQIWWMR-CIUDSAMLSA-N 0.000 description 1
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 1
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 1
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 1
- DDEMUMVXNFPDKC-SRVKXCTJSA-N Leu-His-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N DDEMUMVXNFPDKC-SRVKXCTJSA-N 0.000 description 1
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 1
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- AOFYPTOHESIBFZ-KKUMJFAQSA-N Leu-His-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O AOFYPTOHESIBFZ-KKUMJFAQSA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- LKXANTUNFMVCNF-IHPCNDPISA-N Leu-His-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LKXANTUNFMVCNF-IHPCNDPISA-N 0.000 description 1
- HMDDEJADNKQTBR-BZSNNMDCSA-N Leu-His-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMDDEJADNKQTBR-BZSNNMDCSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- PPQRKXHCLYCBSP-IHRRRGAJSA-N Leu-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N PPQRKXHCLYCBSP-IHRRRGAJSA-N 0.000 description 1
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- ARRIJPQRBWRNLT-DCAQKATOSA-N Leu-Met-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ARRIJPQRBWRNLT-DCAQKATOSA-N 0.000 description 1
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 1
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 1
- POMXSEDNUXYPGK-IHRRRGAJSA-N Leu-Met-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N POMXSEDNUXYPGK-IHRRRGAJSA-N 0.000 description 1
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 1
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 1
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 1
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- MAXILRZVORNXBE-PMVMPFDFSA-N Leu-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MAXILRZVORNXBE-PMVMPFDFSA-N 0.000 description 1
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 1
- VTJUNIYRYIAIHF-IUCAKERBSA-N Leu-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O VTJUNIYRYIAIHF-IUCAKERBSA-N 0.000 description 1
- HGUUMQWGYCVPKG-DCAQKATOSA-N Leu-Pro-Cys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HGUUMQWGYCVPKG-DCAQKATOSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- JLYUZRKPDKHUTC-WDSOQIARSA-N Leu-Pro-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JLYUZRKPDKHUTC-WDSOQIARSA-N 0.000 description 1
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 1
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- IDGRADDMTTWOQC-WDSOQIARSA-N Leu-Trp-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IDGRADDMTTWOQC-WDSOQIARSA-N 0.000 description 1
- LSLUTXRANSUGFY-XIRDDKMYSA-N Leu-Trp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O LSLUTXRANSUGFY-XIRDDKMYSA-N 0.000 description 1
- HOMFINRJHIIZNJ-HOCLYGCPSA-N Leu-Trp-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O HOMFINRJHIIZNJ-HOCLYGCPSA-N 0.000 description 1
- FPFOYSCDUWTZBF-IHPCNDPISA-N Leu-Trp-Leu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]([NH3+])CC(C)C)C(=O)N[C@@H](CC(C)C)C([O-])=O)=CNC2=C1 FPFOYSCDUWTZBF-IHPCNDPISA-N 0.000 description 1
- SUYRAPCRSCCPAK-VFAJRCTISA-N Leu-Trp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUYRAPCRSCCPAK-VFAJRCTISA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- OZTZJMUZVAVJGY-BZSNNMDCSA-N Leu-Tyr-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OZTZJMUZVAVJGY-BZSNNMDCSA-N 0.000 description 1
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 1
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 1
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- 208000019693 Lung disease Diseases 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- YRWCPXOFBKTCFY-NUTKFTJISA-N Lys-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N YRWCPXOFBKTCFY-NUTKFTJISA-N 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 1
- ALSRJRIWBNENFY-DCAQKATOSA-N Lys-Arg-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O ALSRJRIWBNENFY-DCAQKATOSA-N 0.000 description 1
- BRSGXFITDXFMFF-IHRRRGAJSA-N Lys-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N BRSGXFITDXFMFF-IHRRRGAJSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 1
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 1
- GGAPIOORBXHMNY-ULQDDVLXSA-N Lys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)O GGAPIOORBXHMNY-ULQDDVLXSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- JBRWKVANRYPCAF-XIRDDKMYSA-N Lys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N JBRWKVANRYPCAF-XIRDDKMYSA-N 0.000 description 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- XTONYTDATVADQH-CIUDSAMLSA-N Lys-Cys-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O XTONYTDATVADQH-CIUDSAMLSA-N 0.000 description 1
- RDIILCRAWOSDOQ-CIUDSAMLSA-N Lys-Cys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RDIILCRAWOSDOQ-CIUDSAMLSA-N 0.000 description 1
- GGNOBVSOZPHLCE-GUBZILKMSA-N Lys-Gln-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GGNOBVSOZPHLCE-GUBZILKMSA-N 0.000 description 1
- GUYHHBZCBQZLFW-GUBZILKMSA-N Lys-Gln-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N GUYHHBZCBQZLFW-GUBZILKMSA-N 0.000 description 1
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 1
- MQMIRLVJXQNTRJ-SDDRHHMPSA-N Lys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O MQMIRLVJXQNTRJ-SDDRHHMPSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 1
- CRNNMTHBMRFQNG-GUBZILKMSA-N Lys-Glu-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N CRNNMTHBMRFQNG-GUBZILKMSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- DKTNGXVSCZULPO-YUMQZZPRSA-N Lys-Gly-Cys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O DKTNGXVSCZULPO-YUMQZZPRSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- UETQMSASAVBGJY-QWRGUYRKSA-N Lys-Gly-His Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 UETQMSASAVBGJY-QWRGUYRKSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- WOEDRPCHKPSFDT-MXAVVETBSA-N Lys-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N WOEDRPCHKPSFDT-MXAVVETBSA-N 0.000 description 1
- GNLJXWBNLAIPEP-MELADBBJSA-N Lys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCCN)N)C(=O)O GNLJXWBNLAIPEP-MELADBBJSA-N 0.000 description 1
- CTBMEDOQJFGNMI-IHPCNDPISA-N Lys-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CCCCN)N CTBMEDOQJFGNMI-IHPCNDPISA-N 0.000 description 1
- OIYWBDBHEGAVST-BZSNNMDCSA-N Lys-His-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OIYWBDBHEGAVST-BZSNNMDCSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 1
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 1
- XDPLZVNMYQOFQZ-BJDJZHNGSA-N Lys-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N XDPLZVNMYQOFQZ-BJDJZHNGSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- PFZWARWVRNTPBR-IHPCNDPISA-N Lys-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N PFZWARWVRNTPBR-IHPCNDPISA-N 0.000 description 1
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- INMBONMDMGPADT-AVGNSLFASA-N Lys-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N INMBONMDMGPADT-AVGNSLFASA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- PIXVFCBYEGPZPA-JYJNAYRXSA-N Lys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N PIXVFCBYEGPZPA-JYJNAYRXSA-N 0.000 description 1
- IPTUBUUIFRZMJK-ACRUOGEOSA-N Lys-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 IPTUBUUIFRZMJK-ACRUOGEOSA-N 0.000 description 1
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 1
- NQSFIPWBPXNJII-PMVMPFDFSA-N Lys-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 NQSFIPWBPXNJII-PMVMPFDFSA-N 0.000 description 1
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 1
- SVSQSPICRKBMSZ-SRVKXCTJSA-N Lys-Pro-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O SVSQSPICRKBMSZ-SRVKXCTJSA-N 0.000 description 1
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- DYJOORGDQIGZAS-DCAQKATOSA-N Lys-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N DYJOORGDQIGZAS-DCAQKATOSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- NROQVSYLPRLJIP-PMVMPFDFSA-N Lys-Trp-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NROQVSYLPRLJIP-PMVMPFDFSA-N 0.000 description 1
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- BWECSLVQIWEMSC-IHRRRGAJSA-N Lys-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BWECSLVQIWEMSC-IHRRRGAJSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- 108010009491 Lysosomal-Associated Membrane Protein 2 Proteins 0.000 description 1
- 108010064171 Lysosome-Associated Membrane Glycoproteins Proteins 0.000 description 1
- 102000014944 Lysosome-Associated Membrane Glycoproteins Human genes 0.000 description 1
- 102100038225 Lysosome-associated membrane glycoprotein 2 Human genes 0.000 description 1
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- QGQGAIBGTUJRBR-NAKRPEOUSA-N Met-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC QGQGAIBGTUJRBR-NAKRPEOUSA-N 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- BLIPQDLSCFGUFA-GUBZILKMSA-N Met-Arg-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O BLIPQDLSCFGUFA-GUBZILKMSA-N 0.000 description 1
- DSWOTZCVCBEPOU-IUCAKERBSA-N Met-Arg-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCNC(N)=N DSWOTZCVCBEPOU-IUCAKERBSA-N 0.000 description 1
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 1
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 1
- IHITVQKJXQQGLJ-LPEHRKFASA-N Met-Asn-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N IHITVQKJXQQGLJ-LPEHRKFASA-N 0.000 description 1
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 1
- XOMXAVJBLRROMC-IHRRRGAJSA-N Met-Asp-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOMXAVJBLRROMC-IHRRRGAJSA-N 0.000 description 1
- DNDVVILEHVMWIS-LPEHRKFASA-N Met-Asp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DNDVVILEHVMWIS-LPEHRKFASA-N 0.000 description 1
- AVTWKENDGGUWDC-BQBZGAKWSA-N Met-Cys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O AVTWKENDGGUWDC-BQBZGAKWSA-N 0.000 description 1
- GPVLSVCBKUCEBI-KKUMJFAQSA-N Met-Gln-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GPVLSVCBKUCEBI-KKUMJFAQSA-N 0.000 description 1
- CRGKLOXHKICQOL-GARJFASQSA-N Met-Gln-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N CRGKLOXHKICQOL-GARJFASQSA-N 0.000 description 1
- HHCOOFPGNXKFGR-HJGDQZAQSA-N Met-Gln-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HHCOOFPGNXKFGR-HJGDQZAQSA-N 0.000 description 1
- PHWSCIFNNLLUFJ-NHCYSSNCSA-N Met-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N PHWSCIFNNLLUFJ-NHCYSSNCSA-N 0.000 description 1
- UKUMISIRZAVYOG-CIUDSAMLSA-N Met-Glu-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O UKUMISIRZAVYOG-CIUDSAMLSA-N 0.000 description 1
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 1
- RNAGAJXCSPDPRK-KKUMJFAQSA-N Met-Glu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 RNAGAJXCSPDPRK-KKUMJFAQSA-N 0.000 description 1
- OGAZPKJHHZPYFK-GARJFASQSA-N Met-Glu-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGAZPKJHHZPYFK-GARJFASQSA-N 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- FYRUJIJAUPHUNB-IUCAKERBSA-N Met-Gly-Arg Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N FYRUJIJAUPHUNB-IUCAKERBSA-N 0.000 description 1
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 1
- JACAKCWAOHKQBV-UWVGGRQHSA-N Met-Gly-Lys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN JACAKCWAOHKQBV-UWVGGRQHSA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 1
- BCRQJDMZQUHQSV-STQMWFEESA-N Met-Gly-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BCRQJDMZQUHQSV-STQMWFEESA-N 0.000 description 1
- WRLYTJVPSUBYST-AVGNSLFASA-N Met-His-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N WRLYTJVPSUBYST-AVGNSLFASA-N 0.000 description 1
- ULLIQRYQNMAAHC-RWMBFGLXSA-N Met-His-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N ULLIQRYQNMAAHC-RWMBFGLXSA-N 0.000 description 1
- DJBCKVNHEIJLQA-GMOBBJLQSA-N Met-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCSC)N DJBCKVNHEIJLQA-GMOBBJLQSA-N 0.000 description 1
- HZLSUXCMSIBCRV-RVMXOQNASA-N Met-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N HZLSUXCMSIBCRV-RVMXOQNASA-N 0.000 description 1
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 1
- OCRSGGIJBDUXHU-WDSOQIARSA-N Met-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 OCRSGGIJBDUXHU-WDSOQIARSA-N 0.000 description 1
- RATXDYWHIYNZLE-DCAQKATOSA-N Met-Lys-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N RATXDYWHIYNZLE-DCAQKATOSA-N 0.000 description 1
- IILAGWCGKJSBGB-IHRRRGAJSA-N Met-Phe-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IILAGWCGKJSBGB-IHRRRGAJSA-N 0.000 description 1
- GRKPXCKLOOUDFG-UFYCRDLUSA-N Met-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 GRKPXCKLOOUDFG-UFYCRDLUSA-N 0.000 description 1
- VQILILSLEFDECU-GUBZILKMSA-N Met-Pro-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O VQILILSLEFDECU-GUBZILKMSA-N 0.000 description 1
- MPCKIRSXNKACRF-GUBZILKMSA-N Met-Pro-Asn Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O MPCKIRSXNKACRF-GUBZILKMSA-N 0.000 description 1
- LUYURUYVNYGKGM-RCWTZXSCSA-N Met-Pro-Thr Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUYURUYVNYGKGM-RCWTZXSCSA-N 0.000 description 1
- XPVCDCMPKCERFT-GUBZILKMSA-N Met-Ser-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XPVCDCMPKCERFT-GUBZILKMSA-N 0.000 description 1
- WRXOPYNEKGZWAZ-FXQIFTODSA-N Met-Ser-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O WRXOPYNEKGZWAZ-FXQIFTODSA-N 0.000 description 1
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 1
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 1
- SOAYQFDWEIWPPR-IHRRRGAJSA-N Met-Ser-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SOAYQFDWEIWPPR-IHRRRGAJSA-N 0.000 description 1
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 1
- GMMLGMFBYCFCCX-KZVJFYERSA-N Met-Thr-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMMLGMFBYCFCCX-KZVJFYERSA-N 0.000 description 1
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 1
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 1
- RKRFGIBULDYDPF-XIRDDKMYSA-N Met-Trp-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RKRFGIBULDYDPF-XIRDDKMYSA-N 0.000 description 1
- QZUCCDSNETVAIS-RYQLBKOJSA-N Met-Trp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N QZUCCDSNETVAIS-RYQLBKOJSA-N 0.000 description 1
- VVWQHJUYBPJCNS-UMPQAUOISA-N Met-Trp-Thr Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)=CNC2=C1 VVWQHJUYBPJCNS-UMPQAUOISA-N 0.000 description 1
- HOTNHEUETJELDL-BPNCWPANSA-N Met-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N HOTNHEUETJELDL-BPNCWPANSA-N 0.000 description 1
- KPVLLNDCBYXKNV-CYDGBPFRSA-N Met-Val-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KPVLLNDCBYXKNV-CYDGBPFRSA-N 0.000 description 1
- JACMWNXOOUYXCD-JYJNAYRXSA-N Met-Val-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JACMWNXOOUYXCD-JYJNAYRXSA-N 0.000 description 1
- OTKQHDPECKUDSB-SZMVWBNQSA-N Met-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 OTKQHDPECKUDSB-SZMVWBNQSA-N 0.000 description 1
- 108091033773 MiR-155 Proteins 0.000 description 1
- 102000029749 Microtubule Human genes 0.000 description 1
- 108091022875 Microtubule Proteins 0.000 description 1
- 208000034819 Mobility Limitation Diseases 0.000 description 1
- 206010061296 Motor dysfunction Diseases 0.000 description 1
- 206010060860 Neurological symptom Diseases 0.000 description 1
- 101710083785 Non-lysosomal glucosylceramidase Proteins 0.000 description 1
- 206010033661 Pancytopenia Diseases 0.000 description 1
- 206010033799 Paralysis Diseases 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- WSXKXSBOJXEZDV-DLOVCJGASA-N Phe-Ala-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@H](C)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 WSXKXSBOJXEZDV-DLOVCJGASA-N 0.000 description 1
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 1
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 1
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 1
- LBSARGIQACMGDF-WBAXXEDZSA-N Phe-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 LBSARGIQACMGDF-WBAXXEDZSA-N 0.000 description 1
- MPGJIHFJCXTVEX-KKUMJFAQSA-N Phe-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O MPGJIHFJCXTVEX-KKUMJFAQSA-N 0.000 description 1
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 1
- HXSUFWQYLPKEHF-IHRRRGAJSA-N Phe-Asn-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HXSUFWQYLPKEHF-IHRRRGAJSA-N 0.000 description 1
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 1
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 1
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 1
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 1
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 1
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 1
- QPQDWBAJWOGAMJ-IHPCNDPISA-N Phe-Asp-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 QPQDWBAJWOGAMJ-IHPCNDPISA-N 0.000 description 1
- CPTJPDZTFNKFOU-MXAVVETBSA-N Phe-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N CPTJPDZTFNKFOU-MXAVVETBSA-N 0.000 description 1
- KOUUGTKGEQZRHV-KKUMJFAQSA-N Phe-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KOUUGTKGEQZRHV-KKUMJFAQSA-N 0.000 description 1
- ZFVWWUILVLLVFA-AVGNSLFASA-N Phe-Gln-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N ZFVWWUILVLLVFA-AVGNSLFASA-N 0.000 description 1
- NKLDZIPTGKBDBB-HTUGSXCWSA-N Phe-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O NKLDZIPTGKBDBB-HTUGSXCWSA-N 0.000 description 1
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 1
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 1
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 1
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 1
- XEXSSIBQYNKFBX-KBPBESRZSA-N Phe-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CC=CC=C1 XEXSSIBQYNKFBX-KBPBESRZSA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 1
- ONORAGIFHNAADN-LLLHUVSDSA-N Phe-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N ONORAGIFHNAADN-LLLHUVSDSA-N 0.000 description 1
- XMQSOOJRRVEHRO-ULQDDVLXSA-N Phe-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMQSOOJRRVEHRO-ULQDDVLXSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 1
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 1
- MJAYDXWQQUOURZ-JYJNAYRXSA-N Phe-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MJAYDXWQQUOURZ-JYJNAYRXSA-N 0.000 description 1
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 1
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 1
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 1
- SZYBZVANEAOIPE-UBHSHLNASA-N Phe-Met-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SZYBZVANEAOIPE-UBHSHLNASA-N 0.000 description 1
- QRUOLOPKCOEZKU-HJWJTTGWSA-N Phe-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N QRUOLOPKCOEZKU-HJWJTTGWSA-N 0.000 description 1
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 description 1
- WKLMCMXFMQEKCX-SLFFLAALSA-N Phe-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O WKLMCMXFMQEKCX-SLFFLAALSA-N 0.000 description 1
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 1
- CZQZSMJXFGGBHM-KKUMJFAQSA-N Phe-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O CZQZSMJXFGGBHM-KKUMJFAQSA-N 0.000 description 1
- FZBGMXYQPACKNC-HJWJTTGWSA-N Phe-Pro-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FZBGMXYQPACKNC-HJWJTTGWSA-N 0.000 description 1
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 1
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- GLJZDMZJHFXJQG-BZSNNMDCSA-N Phe-Ser-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLJZDMZJHFXJQG-BZSNNMDCSA-N 0.000 description 1
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 1
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 1
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 1
- BPIFSOUEUYDJRM-DCPHZVHLSA-N Phe-Trp-Ala Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C)C(O)=O)C1=CC=CC=C1 BPIFSOUEUYDJRM-DCPHZVHLSA-N 0.000 description 1
- YCEWAVIRWNGGSS-NQCBNZPSSA-N Phe-Trp-Ile Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)C1=CC=CC=C1 YCEWAVIRWNGGSS-NQCBNZPSSA-N 0.000 description 1
- LKRUQZQZMXMKEQ-SFJXLCSZSA-N Phe-Trp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKRUQZQZMXMKEQ-SFJXLCSZSA-N 0.000 description 1
- QTDBZORPVYTRJU-KKXDTOCCSA-N Phe-Tyr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O QTDBZORPVYTRJU-KKXDTOCCSA-N 0.000 description 1
- BAONJAHBAUDJKA-BZSNNMDCSA-N Phe-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 1
- CVAUVSOFHJKCHN-BZSNNMDCSA-N Phe-Tyr-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=CC=C1 CVAUVSOFHJKCHN-BZSNNMDCSA-N 0.000 description 1
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 1
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 1
- KQCCDMFIALWGTL-GUBZILKMSA-N Pro-Asn-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 KQCCDMFIALWGTL-GUBZILKMSA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 1
- AHXPYZRZRMQOAU-QXEWZRGKSA-N Pro-Asn-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1)C(O)=O AHXPYZRZRMQOAU-QXEWZRGKSA-N 0.000 description 1
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 1
- NGNNPLJHUFCOMZ-FXQIFTODSA-N Pro-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 NGNNPLJHUFCOMZ-FXQIFTODSA-N 0.000 description 1
- WPQKSRHDTMRSJM-CIUDSAMLSA-N Pro-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 WPQKSRHDTMRSJM-CIUDSAMLSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- AIZVVCMAFRREQS-GUBZILKMSA-N Pro-Cys-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AIZVVCMAFRREQS-GUBZILKMSA-N 0.000 description 1
- QXNSKJLSLYCTMT-FXQIFTODSA-N Pro-Cys-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O QXNSKJLSLYCTMT-FXQIFTODSA-N 0.000 description 1
- DIZLUAZLNDFDPR-CIUDSAMLSA-N Pro-Cys-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 DIZLUAZLNDFDPR-CIUDSAMLSA-N 0.000 description 1
- GQLOZEMWEBDEAY-NAKRPEOUSA-N Pro-Cys-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GQLOZEMWEBDEAY-NAKRPEOUSA-N 0.000 description 1
- NOXSEHJOXCWRHK-DCAQKATOSA-N Pro-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 NOXSEHJOXCWRHK-DCAQKATOSA-N 0.000 description 1
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 1
- KTFZQPLSPLWLKN-KKUMJFAQSA-N Pro-Gln-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KTFZQPLSPLWLKN-KKUMJFAQSA-N 0.000 description 1
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 1
- VPFGPKIWSDVTOY-SRVKXCTJSA-N Pro-Glu-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O VPFGPKIWSDVTOY-SRVKXCTJSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- WSRWHZRUOCACLJ-UWVGGRQHSA-N Pro-Gly-His Chemical compound C([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H]1NCCC1)C1=CN=CN1 WSRWHZRUOCACLJ-UWVGGRQHSA-N 0.000 description 1
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 1
- FFSLAIOXRMOFIZ-GJZGRUSLSA-N Pro-Gly-Trp Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)O)C(=O)CNC(=O)[C@@H]1CCCN1 FFSLAIOXRMOFIZ-GJZGRUSLSA-N 0.000 description 1
- XQHGISDMVBTGAL-ULQDDVLXSA-N Pro-His-Phe Chemical compound C([C@@H](C(=O)[O-])NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H]1[NH2+]CCC1)C1=CC=CC=C1 XQHGISDMVBTGAL-ULQDDVLXSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 1
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 1
- ZTMLZUNPFDGPKY-VKOGCVSHSA-N Pro-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ZTMLZUNPFDGPKY-VKOGCVSHSA-N 0.000 description 1
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 1
- HATVCTYBNCNMAA-AVGNSLFASA-N Pro-Leu-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O HATVCTYBNCNMAA-AVGNSLFASA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 1
- YAZNFQUKPUASKB-DCAQKATOSA-N Pro-Lys-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O YAZNFQUKPUASKB-DCAQKATOSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 1
- APIAILHCTSBGLU-JYJNAYRXSA-N Pro-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@@H]2CCCN2 APIAILHCTSBGLU-JYJNAYRXSA-N 0.000 description 1
- ANESFYPBAJPYNJ-SDDRHHMPSA-N Pro-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ANESFYPBAJPYNJ-SDDRHHMPSA-N 0.000 description 1
- WLJYLAQSUSIQNH-GUBZILKMSA-N Pro-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@@H]1CCCN1 WLJYLAQSUSIQNH-GUBZILKMSA-N 0.000 description 1
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 1
- GNADVDLLGVSXLS-ULQDDVLXSA-N Pro-Phe-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GNADVDLLGVSXLS-ULQDDVLXSA-N 0.000 description 1
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 1
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 1
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 1
- HWLKHNDRXWTFTN-GUBZILKMSA-N Pro-Pro-Cys Chemical compound C1C[C@H](NC1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CS)C(=O)O HWLKHNDRXWTFTN-GUBZILKMSA-N 0.000 description 1
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 1
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 1
- RTQKBZIRDWZLDF-BZSNNMDCSA-N Pro-Pro-Trp Chemical compound C([C@H]1C(=O)N[C@@H](CC=2C3=CC=CC=C3NC=2)C(=O)O)CCN1C(=O)[C@@H]1CCCN1 RTQKBZIRDWZLDF-BZSNNMDCSA-N 0.000 description 1
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 1
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 1
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 1
- CNUIHOAISPKQPY-HSHDSVGOSA-N Pro-Thr-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CNUIHOAISPKQPY-HSHDSVGOSA-N 0.000 description 1
- DMNANGOFEUVBRV-GJZGRUSLSA-N Pro-Trp-Gly Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(=O)O)C(=O)[C@@H]1CCCN1 DMNANGOFEUVBRV-GJZGRUSLSA-N 0.000 description 1
- OFSZYRZOUMNCCU-BZSNNMDCSA-N Pro-Trp-Met Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCSC)C(O)=O)C(=O)[C@@H]1CCCN1 OFSZYRZOUMNCCU-BZSNNMDCSA-N 0.000 description 1
- VPBQDHMASPJHGY-JYJNAYRXSA-N Pro-Trp-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CO)C(=O)O VPBQDHMASPJHGY-JYJNAYRXSA-N 0.000 description 1
- QMABBZHZMDXHKU-FKBYEOEOSA-N Pro-Tyr-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QMABBZHZMDXHKU-FKBYEOEOSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- XRGIDCGRSSWCKE-SRVKXCTJSA-N Pro-Val-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O XRGIDCGRSSWCKE-SRVKXCTJSA-N 0.000 description 1
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 1
- MTMJNKFZDQEVSY-BZSNNMDCSA-N Pro-Val-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MTMJNKFZDQEVSY-BZSNNMDCSA-N 0.000 description 1
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 108091034057 RNA (poly(A)) Proteins 0.000 description 1
- 102000009572 RNA Polymerase II Human genes 0.000 description 1
- 108010009460 RNA Polymerase II Proteins 0.000 description 1
- 108091008103 RNA aptamers Proteins 0.000 description 1
- 238000012228 RNA interference-mediated gene silencing Methods 0.000 description 1
- 108010025216 RVF peptide Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 108010000605 Ribosomal Proteins Proteins 0.000 description 1
- 102000002278 Ribosomal Proteins Human genes 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 238000010818 SYBR green PCR Master Mix Methods 0.000 description 1
- 206010053694 Saccadic eye movement Diseases 0.000 description 1
- 108091081021 Sense strand Proteins 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 1
- HZWAHWQZPSXNCB-BPUTZDHNSA-N Ser-Arg-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HZWAHWQZPSXNCB-BPUTZDHNSA-N 0.000 description 1
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 1
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 1
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 1
- COAHUSQNSVFYBW-FXQIFTODSA-N Ser-Asn-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O COAHUSQNSVFYBW-FXQIFTODSA-N 0.000 description 1
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- KCFKKAQKRZBWJB-ZLUOBGJFSA-N Ser-Cys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O KCFKKAQKRZBWJB-ZLUOBGJFSA-N 0.000 description 1
- KNCJWSPMTFFJII-ZLUOBGJFSA-N Ser-Cys-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KNCJWSPMTFFJII-ZLUOBGJFSA-N 0.000 description 1
- SNNSYBWPPVAXQW-ZLUOBGJFSA-N Ser-Cys-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)O SNNSYBWPPVAXQW-ZLUOBGJFSA-N 0.000 description 1
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 1
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 1
- YMAWDPHQVABADW-CIUDSAMLSA-N Ser-Gln-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O YMAWDPHQVABADW-CIUDSAMLSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- GRSLLFZTTLBOQX-CIUDSAMLSA-N Ser-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N GRSLLFZTTLBOQX-CIUDSAMLSA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- CLKKNZQUQMZDGD-SRVKXCTJSA-N Ser-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CN=CN1 CLKKNZQUQMZDGD-SRVKXCTJSA-N 0.000 description 1
- WEQAYODCJHZSJZ-KKUMJFAQSA-N Ser-His-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 WEQAYODCJHZSJZ-KKUMJFAQSA-N 0.000 description 1
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 1
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 1
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 1
- JLPMFVAIQHCBDC-CIUDSAMLSA-N Ser-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N JLPMFVAIQHCBDC-CIUDSAMLSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- SRKMDKACHDVPMD-SRVKXCTJSA-N Ser-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N SRKMDKACHDVPMD-SRVKXCTJSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- WBAXJMCUFIXCNI-WDSKDSINSA-N Ser-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WBAXJMCUFIXCNI-WDSKDSINSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- SOACHCFYJMCMHC-BWBBJGPYSA-N Ser-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O SOACHCFYJMCMHC-BWBBJGPYSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- QNBVFKZSSRYNFX-CUJWVEQBSA-N Ser-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N)O QNBVFKZSSRYNFX-CUJWVEQBSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- AXKJPUBALUNJEO-UBHSHLNASA-N Ser-Trp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O AXKJPUBALUNJEO-UBHSHLNASA-N 0.000 description 1
- UQGAAZXSCGWMFU-UBHSHLNASA-N Ser-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N UQGAAZXSCGWMFU-UBHSHLNASA-N 0.000 description 1
- SDFUZKIAHWRUCS-QEJZJMRPSA-N Ser-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N SDFUZKIAHWRUCS-QEJZJMRPSA-N 0.000 description 1
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 1
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 1
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 1
- BIWBTRRBHIEVAH-IHPCNDPISA-N Ser-Tyr-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BIWBTRRBHIEVAH-IHPCNDPISA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 1
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 1
- 102000006467 TATA-Box Binding Protein Human genes 0.000 description 1
- 108010044281 TATA-Box Binding Protein Proteins 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 1
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 1
- IRKWVRSEQFTGGV-VEVYYDQMSA-N Thr-Asn-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IRKWVRSEQFTGGV-VEVYYDQMSA-N 0.000 description 1
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 1
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- VUKVQVNKIIZBPO-HOUAVDHOSA-N Thr-Asp-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VUKVQVNKIIZBPO-HOUAVDHOSA-N 0.000 description 1
- ODSAPYVQSLDRSR-LKXGYXEUSA-N Thr-Cys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O ODSAPYVQSLDRSR-LKXGYXEUSA-N 0.000 description 1
- KZUJCMPVNXOBAF-LKXGYXEUSA-N Thr-Cys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KZUJCMPVNXOBAF-LKXGYXEUSA-N 0.000 description 1
- KWQBJOUOSNJDRR-XAVMHZPKSA-N Thr-Cys-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N)O KWQBJOUOSNJDRR-XAVMHZPKSA-N 0.000 description 1
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- GUZGCDIZVGODML-NKIYYHGXSA-N Thr-Gln-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O GUZGCDIZVGODML-NKIYYHGXSA-N 0.000 description 1
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 1
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 1
- RCEHMXVEMNXRIW-IRIUXVKKSA-N Thr-Gln-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O RCEHMXVEMNXRIW-IRIUXVKKSA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- XSTGOZBBXFKGHA-YJRXYDGGSA-N Thr-His-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O XSTGOZBBXFKGHA-YJRXYDGGSA-N 0.000 description 1
- KLCCPYZXGXHAGS-QTKMDUPCSA-N Thr-His-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N)O KLCCPYZXGXHAGS-QTKMDUPCSA-N 0.000 description 1
- YUOCMLNTUZAGNF-KLHWPWHYSA-N Thr-His-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N)O YUOCMLNTUZAGNF-KLHWPWHYSA-N 0.000 description 1
- KRGDDWVBBDLPSJ-CUJWVEQBSA-N Thr-His-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O KRGDDWVBBDLPSJ-CUJWVEQBSA-N 0.000 description 1
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 1
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- MEBDIIKMUUNBSB-RPTUDFQQSA-N Thr-Phe-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MEBDIIKMUUNBSB-RPTUDFQQSA-N 0.000 description 1
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 1
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- IWAVRIPRTCJAQO-HSHDSVGOSA-N Thr-Pro-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O IWAVRIPRTCJAQO-HSHDSVGOSA-N 0.000 description 1
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- CSZFFQBUTMGHAH-UAXMHLISSA-N Thr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O CSZFFQBUTMGHAH-UAXMHLISSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- FBQHKSPOIAFUEI-OWLDWWDNSA-N Thr-Trp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O FBQHKSPOIAFUEI-OWLDWWDNSA-N 0.000 description 1
- GJOBRAHDRIDAPT-NGTWOADLSA-N Thr-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H]([C@@H](C)O)N GJOBRAHDRIDAPT-NGTWOADLSA-N 0.000 description 1
- ZEJBJDHSQPOVJV-UAXMHLISSA-N Thr-Trp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZEJBJDHSQPOVJV-UAXMHLISSA-N 0.000 description 1
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 1
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 1
- CJEHCEOXPLASCK-MEYUZBJRSA-N Thr-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=C(O)C=C1 CJEHCEOXPLASCK-MEYUZBJRSA-N 0.000 description 1
- XVHAUVJXBFGUPC-RPTUDFQQSA-N Thr-Tyr-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XVHAUVJXBFGUPC-RPTUDFQQSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 1
- CURFABYITJVKEW-QTKMDUPCSA-N Thr-Val-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O CURFABYITJVKEW-QTKMDUPCSA-N 0.000 description 1
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- GHXXDFDIDHIEIL-WFBYXXMGSA-N Trp-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GHXXDFDIDHIEIL-WFBYXXMGSA-N 0.000 description 1
- FOAJSVIXYCLTSC-PJODQICGSA-N Trp-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N FOAJSVIXYCLTSC-PJODQICGSA-N 0.000 description 1
- XNRJFXBORWMIPY-DCPHZVHLSA-N Trp-Ala-Phe Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XNRJFXBORWMIPY-DCPHZVHLSA-N 0.000 description 1
- AVYVKJMBNLPWRX-WFBYXXMGSA-N Trp-Ala-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 AVYVKJMBNLPWRX-WFBYXXMGSA-N 0.000 description 1
- BIJDDZBDSJLWJY-PJODQICGSA-N Trp-Ala-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O BIJDDZBDSJLWJY-PJODQICGSA-N 0.000 description 1
- QNMIVTOQXUSGLN-SZMVWBNQSA-N Trp-Arg-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QNMIVTOQXUSGLN-SZMVWBNQSA-N 0.000 description 1
- IXEGQBJZDIRRIV-QEJZJMRPSA-N Trp-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IXEGQBJZDIRRIV-QEJZJMRPSA-N 0.000 description 1
- MHNHRNHJMXAVHZ-AAEUAGOBSA-N Trp-Asn-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N MHNHRNHJMXAVHZ-AAEUAGOBSA-N 0.000 description 1
- ADBFWLXCCKIXBQ-XIRDDKMYSA-N Trp-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ADBFWLXCCKIXBQ-XIRDDKMYSA-N 0.000 description 1
- KZTLJLFVOIMRAQ-IHPCNDPISA-N Trp-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZTLJLFVOIMRAQ-IHPCNDPISA-N 0.000 description 1
- PXQPYPMSLBQHJJ-WFBYXXMGSA-N Trp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N PXQPYPMSLBQHJJ-WFBYXXMGSA-N 0.000 description 1
- RERIQEJUYCLJQI-QRTARXTBSA-N Trp-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RERIQEJUYCLJQI-QRTARXTBSA-N 0.000 description 1
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 1
- VMBBTANKMSRJSS-JSGCOSHPSA-N Trp-Glu-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VMBBTANKMSRJSS-JSGCOSHPSA-N 0.000 description 1
- NOFFAYIYPAUNRM-HKUYNNGSSA-N Trp-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC2=CNC3=CC=CC=C32)N NOFFAYIYPAUNRM-HKUYNNGSSA-N 0.000 description 1
- WSGPBCAGEGHKQJ-BBRMVZONSA-N Trp-Gly-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WSGPBCAGEGHKQJ-BBRMVZONSA-N 0.000 description 1
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 1
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 1
- YLGQHMHKAASRGJ-WDSOQIARSA-N Trp-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N YLGQHMHKAASRGJ-WDSOQIARSA-N 0.000 description 1
- WMBFONUKQXGLMU-WDSOQIARSA-N Trp-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WMBFONUKQXGLMU-WDSOQIARSA-N 0.000 description 1
- KWTRGSQOQHZKIA-PMVMPFDFSA-N Trp-Lys-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CCCCN)C(O)=O)C1=CC=C(O)C=C1 KWTRGSQOQHZKIA-PMVMPFDFSA-N 0.000 description 1
- PWPJLBWYRTVYQS-PMVMPFDFSA-N Trp-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PWPJLBWYRTVYQS-PMVMPFDFSA-N 0.000 description 1
- CSOBBJWWODOYGW-ILWGZMRPSA-N Trp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N)C(=O)O CSOBBJWWODOYGW-ILWGZMRPSA-N 0.000 description 1
- UHXOYRWHIQZAKV-SZMVWBNQSA-N Trp-Pro-Arg Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O UHXOYRWHIQZAKV-SZMVWBNQSA-N 0.000 description 1
- BIBZRFIKOLGWFQ-XIRDDKMYSA-N Trp-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O BIBZRFIKOLGWFQ-XIRDDKMYSA-N 0.000 description 1
- XOLLWQIBBLBAHQ-WDSOQIARSA-N Trp-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O XOLLWQIBBLBAHQ-WDSOQIARSA-N 0.000 description 1
- XDQGKIMTRSVSBC-WDSOQIARSA-N Trp-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CNC2=CC=CC=C12 XDQGKIMTRSVSBC-WDSOQIARSA-N 0.000 description 1
- IKUMWSDCGQVGHC-UMPQAUOISA-N Trp-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O IKUMWSDCGQVGHC-UMPQAUOISA-N 0.000 description 1
- HHPSUFUXXBOFQY-AQZXSJQPSA-N Trp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O HHPSUFUXXBOFQY-AQZXSJQPSA-N 0.000 description 1
- DDHFMBDACJYSKW-AQZXSJQPSA-N Trp-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DDHFMBDACJYSKW-AQZXSJQPSA-N 0.000 description 1
- DTPWXZXGFAHEKL-NWLDYVSISA-N Trp-Thr-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DTPWXZXGFAHEKL-NWLDYVSISA-N 0.000 description 1
- SEXRBCGSZRCIPE-LYSGOOTNSA-N Trp-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O SEXRBCGSZRCIPE-LYSGOOTNSA-N 0.000 description 1
- VMXLNDRJXVAJFT-JYBASQMISA-N Trp-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O VMXLNDRJXVAJFT-JYBASQMISA-N 0.000 description 1
- UIDJDMVRDUANDL-BVSLBCMMSA-N Trp-Tyr-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UIDJDMVRDUANDL-BVSLBCMMSA-N 0.000 description 1
- YTHWAWACWGWBLE-MNSWYVGCSA-N Trp-Tyr-Thr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 YTHWAWACWGWBLE-MNSWYVGCSA-N 0.000 description 1
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 1
- RWTFCAMQLFNPTK-UMPQAUOISA-N Trp-Val-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)=CNC2=C1 RWTFCAMQLFNPTK-UMPQAUOISA-N 0.000 description 1
- IELISNUVHBKYBX-XDTLVQLUSA-N Tyr-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IELISNUVHBKYBX-XDTLVQLUSA-N 0.000 description 1
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 1
- SDNVRAKIJVKAGS-LKTVYLICSA-N Tyr-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N SDNVRAKIJVKAGS-LKTVYLICSA-N 0.000 description 1
- HKIUVWMZYFBIHG-KKUMJFAQSA-N Tyr-Arg-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O HKIUVWMZYFBIHG-KKUMJFAQSA-N 0.000 description 1
- KDGFPPHLXCEQRN-STECZYCISA-N Tyr-Arg-Ile Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDGFPPHLXCEQRN-STECZYCISA-N 0.000 description 1
- SGFIXFAHVWJKTD-KJEVXHAQSA-N Tyr-Arg-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SGFIXFAHVWJKTD-KJEVXHAQSA-N 0.000 description 1
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- XKDOQXAXKFQWQJ-SRVKXCTJSA-N Tyr-Cys-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O XKDOQXAXKFQWQJ-SRVKXCTJSA-N 0.000 description 1
- MOCXXGZHHSPNEJ-AVGNSLFASA-N Tyr-Cys-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O MOCXXGZHHSPNEJ-AVGNSLFASA-N 0.000 description 1
- YLRLHDFMMWDYTK-KKUMJFAQSA-N Tyr-Cys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 YLRLHDFMMWDYTK-KKUMJFAQSA-N 0.000 description 1
- GHUNBABNQPIETG-MELADBBJSA-N Tyr-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O GHUNBABNQPIETG-MELADBBJSA-N 0.000 description 1
- QHEGAOPHISYNDF-XDTLVQLUSA-N Tyr-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHEGAOPHISYNDF-XDTLVQLUSA-N 0.000 description 1
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 1
- RYSNTWVRSLCAJZ-RYUDHWBXSA-N Tyr-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RYSNTWVRSLCAJZ-RYUDHWBXSA-N 0.000 description 1
- UXUFNBVCPAWACG-SIUGBPQLSA-N Tyr-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N UXUFNBVCPAWACG-SIUGBPQLSA-N 0.000 description 1
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 1
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 1
- LHTGRUZSZOIAKM-SOUVJXGZSA-N Tyr-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O LHTGRUZSZOIAKM-SOUVJXGZSA-N 0.000 description 1
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 1
- OSMTVLSRTQDWHJ-JBACZVJFSA-N Tyr-Glu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 OSMTVLSRTQDWHJ-JBACZVJFSA-N 0.000 description 1
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 1
- NMKJPMCEKQHRPD-IRXDYDNUSA-N Tyr-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NMKJPMCEKQHRPD-IRXDYDNUSA-N 0.000 description 1
- KEANSLVUGJADPN-LKTVYLICSA-N Tyr-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N KEANSLVUGJADPN-LKTVYLICSA-N 0.000 description 1
- YYZPVPJCOGGQPC-JYJNAYRXSA-N Tyr-His-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYZPVPJCOGGQPC-JYJNAYRXSA-N 0.000 description 1
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 1
- HHFMNAVFGBYSAT-IGISWZIWSA-N Tyr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HHFMNAVFGBYSAT-IGISWZIWSA-N 0.000 description 1
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 1
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 1
- YMUQBRQQCPQEQN-CXTHYWKRSA-N Tyr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YMUQBRQQCPQEQN-CXTHYWKRSA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 1
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 1
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- BBSPTGPYIPGTKH-JYJNAYRXSA-N Tyr-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BBSPTGPYIPGTKH-JYJNAYRXSA-N 0.000 description 1
- KZOZXAYPVKKDIO-UFYCRDLUSA-N Tyr-Met-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 KZOZXAYPVKKDIO-UFYCRDLUSA-N 0.000 description 1
- IGXLNVIYDYONFB-UFYCRDLUSA-N Tyr-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 IGXLNVIYDYONFB-UFYCRDLUSA-N 0.000 description 1
- BGFCXQXETBDEHP-BZSNNMDCSA-N Tyr-Phe-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O BGFCXQXETBDEHP-BZSNNMDCSA-N 0.000 description 1
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 1
- OKDNSNWJEXAMSU-IRXDYDNUSA-N Tyr-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 OKDNSNWJEXAMSU-IRXDYDNUSA-N 0.000 description 1
- JXGUUJMPCRXMSO-HJOGWXRNSA-N Tyr-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 JXGUUJMPCRXMSO-HJOGWXRNSA-N 0.000 description 1
- VBFVQTPETKJCQW-RPTUDFQQSA-N Tyr-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VBFVQTPETKJCQW-RPTUDFQQSA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 1
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 1
- KWKJGBHDYJOVCR-SRVKXCTJSA-N Tyr-Ser-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O KWKJGBHDYJOVCR-SRVKXCTJSA-N 0.000 description 1
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 1
- RIVVDNTUSRVTQT-IRIUXVKKSA-N Tyr-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O RIVVDNTUSRVTQT-IRIUXVKKSA-N 0.000 description 1
- XFEMMSGONWQACR-KJEVXHAQSA-N Tyr-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XFEMMSGONWQACR-KJEVXHAQSA-N 0.000 description 1
- MQUYPYFPHIPVHJ-MNSWYVGCSA-N Tyr-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)O MQUYPYFPHIPVHJ-MNSWYVGCSA-N 0.000 description 1
- GPLTZEMVOCZVAV-UFYCRDLUSA-N Tyr-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 GPLTZEMVOCZVAV-UFYCRDLUSA-N 0.000 description 1
- AGDDLOQMXUQPDY-BZSNNMDCSA-N Tyr-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O AGDDLOQMXUQPDY-BZSNNMDCSA-N 0.000 description 1
- UUJHRSTVQCFDPA-UFYCRDLUSA-N Tyr-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 UUJHRSTVQCFDPA-UFYCRDLUSA-N 0.000 description 1
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- NXPDPYYCIRDUHO-ULQDDVLXSA-N Tyr-Val-His Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=C(O)C=C1 NXPDPYYCIRDUHO-ULQDDVLXSA-N 0.000 description 1
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 1
- SMUWZUSWMWVOSL-JYJNAYRXSA-N Tyr-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SMUWZUSWMWVOSL-JYJNAYRXSA-N 0.000 description 1
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 1
- WOCYUGQDXPTQPY-FXQIFTODSA-N Val-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N WOCYUGQDXPTQPY-FXQIFTODSA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 1
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 1
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- VXCAZHCVDBQMTP-NRPADANISA-N Val-Cys-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VXCAZHCVDBQMTP-NRPADANISA-N 0.000 description 1
- KOPBYUSPXBQIHD-NRPADANISA-N Val-Cys-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KOPBYUSPXBQIHD-NRPADANISA-N 0.000 description 1
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 1
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 1
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 1
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 1
- PGBJAZDAEWPDAA-NHCYSSNCSA-N Val-Gln-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N PGBJAZDAEWPDAA-NHCYSSNCSA-N 0.000 description 1
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- AHHJARQXFFGOKF-NRPADANISA-N Val-Glu-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N AHHJARQXFFGOKF-NRPADANISA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 1
- OXGVAUFVTOPFFA-XPUUQOCRSA-N Val-Gly-Cys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OXGVAUFVTOPFFA-XPUUQOCRSA-N 0.000 description 1
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 1
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 1
- MJXNDRCLGDSBBE-FHWLQOOXSA-N Val-His-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N MJXNDRCLGDSBBE-FHWLQOOXSA-N 0.000 description 1
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- BZOSBRIDWSSTFN-AVGNSLFASA-N Val-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N BZOSBRIDWSSTFN-AVGNSLFASA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- SJLVYVZBFDTRCG-DCAQKATOSA-N Val-Lys-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N SJLVYVZBFDTRCG-DCAQKATOSA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- MLADEWAIYAPAAU-IHRRRGAJSA-N Val-Lys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MLADEWAIYAPAAU-IHRRRGAJSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 1
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 1
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 1
- BCBFMJYTNKDALA-UFYCRDLUSA-N Val-Phe-Phe Chemical compound N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O BCBFMJYTNKDALA-UFYCRDLUSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 1
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 1
- NGXQOQNXSGOYOI-BQFCYCMXSA-N Val-Trp-Gln Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 NGXQOQNXSGOYOI-BQFCYCMXSA-N 0.000 description 1
- UFCHCOKFAGOQSF-BQFCYCMXSA-N Val-Trp-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N UFCHCOKFAGOQSF-BQFCYCMXSA-N 0.000 description 1
- VBTFUDNTMCHPII-UHFFFAOYSA-N Val-Trp-Tyr Natural products C=1NC2=CC=CC=C2C=1CC(NC(=O)C(N)C(C)C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 VBTFUDNTMCHPII-UHFFFAOYSA-N 0.000 description 1
- KJFBXCFOPAKPTM-BZSNNMDCSA-N Val-Trp-Val Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 KJFBXCFOPAKPTM-BZSNNMDCSA-N 0.000 description 1
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- YKZVPMUGEJXEOR-JYJNAYRXSA-N Val-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N YKZVPMUGEJXEOR-JYJNAYRXSA-N 0.000 description 1
- 241001492404 Woodchuck hepatitis virus Species 0.000 description 1
- LEBBDRXHHNYZIA-LDUWYPJVSA-N [(2s,3r,4s,5r,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl] n-[(z)-1,3-dihydroxyoctadec-4-en-2-yl]carbamate Chemical compound CCCCCCCCCCCCC\C=C/C(O)C(CO)NC(=O)O[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O LEBBDRXHHNYZIA-LDUWYPJVSA-N 0.000 description 1
- 230000001594 aberrant effect Effects 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 230000005856 abnormality Effects 0.000 description 1
- CUJRVFIICFDLGR-UHFFFAOYSA-N acetylacetonate Chemical compound CC(=O)[CH-]C(C)=O CUJRVFIICFDLGR-UHFFFAOYSA-N 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 229960002964 adalimumab Drugs 0.000 description 1
- 150000003838 adenosines Chemical class 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 150000001413 amino acids Chemical class 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 230000036506 anxiety Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 108010021908 aspartyl-aspartyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010031045 aspartyl-glycyl-aspartyl-alanine Proteins 0.000 description 1
- 230000004900 autophagic degradation Effects 0.000 description 1
- 230000006736 behavioral deficit Effects 0.000 description 1
- 230000006741 behavioral dysfunction Effects 0.000 description 1
- 238000009227 behaviour therapy Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- WPIHMWBQRSAMDE-YCZTVTEBSA-N beta-D-galactosyl-(1->4)-beta-D-galactosyl-N-(pentacosanoyl)sphingosine Chemical compound CCCCCCCCCCCCCCCCCCCCCCCCC(=O)N[C@@H](CO[C@@H]1O[C@H](CO)[C@H](O[C@@H]2O[C@H](CO)[C@H](O)[C@H](O)[C@H]2O)[C@H](O)[C@H]1O)[C@H](O)\C=C\CCCCCCCCCCCCC WPIHMWBQRSAMDE-YCZTVTEBSA-N 0.000 description 1
- HHJTWTPUPVQKNA-JIAPQYILSA-N beta-D-glucosylsphingosine Chemical compound CCCCCCCCCCCCC\C=C\[C@@H](O)[C@@H](N)CO[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O HHJTWTPUPVQKNA-JIAPQYILSA-N 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- 238000003236 bicinchoninic acid assay Methods 0.000 description 1
- 239000003124 biologic agent Substances 0.000 description 1
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- 108010006025 bovine growth hormone Proteins 0.000 description 1
- 210000005013 brain tissue Anatomy 0.000 description 1
- 229960004436 budesonide Drugs 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 210000000234 capsid Anatomy 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000000423 cell based assay Methods 0.000 description 1
- 230000004663 cell proliferation Effects 0.000 description 1
- 210000003710 cerebral cortex Anatomy 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000001876 chaperonelike Effects 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 230000006999 cognitive decline Effects 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000002247 constant time method Methods 0.000 description 1
- 239000003246 corticosteroid Substances 0.000 description 1
- 239000006071 cream Substances 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 239000002270 dispersing agent Substances 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 229960003638 dopamine Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 230000002996 emotional effect Effects 0.000 description 1
- 238000005538 encapsulation Methods 0.000 description 1
- 230000002121 endocytic effect Effects 0.000 description 1
- 210000001163 endosome Anatomy 0.000 description 1
- 238000001849 endotracheal instillation Methods 0.000 description 1
- 238000002641 enzyme replacement therapy Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 229960005167 everolimus Drugs 0.000 description 1
- 230000005713 exacerbation Effects 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 210000002950 fibroblast Anatomy 0.000 description 1
- 230000004761 fibrosis Effects 0.000 description 1
- 239000000945 filler Substances 0.000 description 1
- 238000007421 fluorometric assay Methods 0.000 description 1
- 229940014144 folate Drugs 0.000 description 1
- OVBPIULPVIDEAO-LBPRGKRZSA-N folic acid Chemical compound C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-LBPRGKRZSA-N 0.000 description 1
- 235000019152 folic acid Nutrition 0.000 description 1
- 239000011724 folic acid Substances 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 229930182830 galactose Natural products 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 230000002496 gastric effect Effects 0.000 description 1
- 230000009368 gene silencing by RNA Effects 0.000 description 1
- 238000009650 gentamicin protection assay Methods 0.000 description 1
- 230000007387 gliosis Effects 0.000 description 1
- 150000002305 glucosylceramides Chemical class 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 1
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010054666 glycyl-leucyl-glycyl-glycine Proteins 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 230000003394 haemopoietic effect Effects 0.000 description 1
- 210000002216 heart Anatomy 0.000 description 1
- 206010019847 hepatosplenomegaly Diseases 0.000 description 1
- 102000057063 human MAPT Human genes 0.000 description 1
- 238000010166 immunofluorescence Methods 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 230000005022 impaired gait Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 230000002757 inflammatory effect Effects 0.000 description 1
- 230000004054 inflammatory process Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 208000028756 lack of coordination Diseases 0.000 description 1
- 231100000518 lethal Toxicity 0.000 description 1
- 230000001665 lethal effect Effects 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 210000005228 liver tissue Anatomy 0.000 description 1
- 230000001926 lymphatic effect Effects 0.000 description 1
- 101150059888 lysM gene Proteins 0.000 description 1
- 108010045758 lysosomal proteins Proteins 0.000 description 1
- 229940124302 mTOR inhibitor Drugs 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 239000003628 mammalian target of rapamycin inhibitor Substances 0.000 description 1
- 210000001259 mesencephalon Anatomy 0.000 description 1
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 210000004688 microtubule Anatomy 0.000 description 1
- 239000007758 minimum essential medium Substances 0.000 description 1
- 210000000337 motor cortex Anatomy 0.000 description 1
- 239000007922 nasal spray Substances 0.000 description 1
- 229940097496 nasal spray Drugs 0.000 description 1
- 229960005027 natalizumab Drugs 0.000 description 1
- 230000012106 negative regulation of microtubule depolymerization Effects 0.000 description 1
- 210000001577 neostriatum Anatomy 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 108091027963 non-coding RNA Proteins 0.000 description 1
- 102000042567 non-coding RNA Human genes 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- 239000002674 ointment Substances 0.000 description 1
- 150000002482 oligosaccharides Polymers 0.000 description 1
- 239000000668 oral spray Substances 0.000 description 1
- 229940041678 oral spray Drugs 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 1
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 108010025488 pinealon Proteins 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 238000007747 plating Methods 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 229960004618 prednisone Drugs 0.000 description 1
- XOFYZVNMUHMLCC-ZPOLXVRWSA-N prednisone Chemical compound O=C1C=C[C@]2(C)[C@H]3C(=O)C[C@](C)([C@@](CC4)(O)C(=O)CO)[C@@H]4[C@@H]3CCC2=C1 XOFYZVNMUHMLCC-ZPOLXVRWSA-N 0.000 description 1
- 210000000063 presynaptic terminal Anatomy 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 238000000751 protein extraction Methods 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 230000007111 proteostasis Effects 0.000 description 1
- HHJTWTPUPVQKNA-PIIMIWFASA-N psychosine Chemical compound CCCCCCCCCCCCC\C=C\[C@@H](O)[C@@H](N)CO[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O HHJTWTPUPVQKNA-PIIMIWFASA-N 0.000 description 1
- 208000005069 pulmonary fibrosis Diseases 0.000 description 1
- ZAHRKKWIAAJSAO-UHFFFAOYSA-N rapamycin Natural products COCC(O)C(=C/C(C)C(=O)CC(OC(=O)C1CCCCN1C(=O)C(=O)C2(O)OC(CC(OC)C(=CC=CC=CC(C)CC(C)C(=O)C)C)CCC2C)C(C)CC3CCC(O)C(C3)OC)C ZAHRKKWIAAJSAO-UHFFFAOYSA-N 0.000 description 1
- 108700042226 ras Genes Proteins 0.000 description 1
- 230000014493 regulation of gene expression Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000007441 retrograde transport Effects 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 230000004434 saccadic eye movement Effects 0.000 description 1
- 231100000279 safety data Toxicity 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 108010007375 seryl-seryl-seryl-arginine Proteins 0.000 description 1
- 229960002930 sirolimus Drugs 0.000 description 1
- QFJCIRLUMZQUOT-HPLJOQBZSA-N sirolimus Chemical compound C1C[C@@H](O)[C@H](OC)C[C@@H]1C[C@@H](C)[C@H]1OC(=O)[C@@H]2CCCCN2C(=O)C(=O)[C@](O)(O2)[C@H](C)CC[C@H]2C[C@H](OC)/C(C)=C/C=C/C=C/[C@@H](C)C[C@@H](C)C(=O)[C@H](OC)[C@H](O)/C(C)=C/[C@@H](C)C(=O)C1 QFJCIRLUMZQUOT-HPLJOQBZSA-N 0.000 description 1
- 208000019116 sleep disease Diseases 0.000 description 1
- 208000022925 sleep disturbance Diseases 0.000 description 1
- 229940126586 small molecule drug Drugs 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 230000004137 sphingolipid metabolism Effects 0.000 description 1
- 150000003408 sphingolipids Chemical class 0.000 description 1
- 210000001324 spliceosome Anatomy 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 238000010254 subcutaneous injection Methods 0.000 description 1
- 239000007929 subcutaneous injection Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 239000000375 suspending agent Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 239000002562 thickening agent Substances 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 210000003412 trans-golgi network Anatomy 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- 108010029384 tryptophyl-histidine Proteins 0.000 description 1
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 1
- 108010029599 tyrosyl-glutamyl-tryptophan Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 210000003462 vein Anatomy 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 230000004584 weight gain Effects 0.000 description 1
- 235000019786 weight gain Nutrition 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
- A61K48/005—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
- A61K48/0075—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the delivery route, e.g. oral, subcutaneous
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K9/00—Medicinal preparations characterised by special physical form
- A61K9/0012—Galenical forms characterised by the site of application
- A61K9/0019—Injectable compositions; Intramuscular, intravenous, arterial, subcutaneous administration; Compositions to be administered through the skin in an invasive manner
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K9/00—Medicinal preparations characterised by special physical form
- A61K9/0012—Galenical forms characterised by the site of application
- A61K9/0085—Brain, e.g. brain implants; Spinal cord
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P25/00—Drugs for disorders of the nervous system
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P25/00—Drugs for disorders of the nervous system
- A61P25/14—Drugs for disorders of the nervous system for treating abnormal movements, e.g. chorea, dyskinesia
- A61P25/16—Anti-Parkinson drugs
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/475—Growth factors; Growth regulators
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
- C12N15/1137—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing against enzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
- C12N9/2405—Glucanases
- C12N9/2408—Glucanases acting on alpha -1,4-glucosidic bonds
- C12N9/2411—Amylases
- C12N9/2414—Alpha-amylase (3.2.1.1.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y302/00—Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
- C12Y302/01—Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
- C12Y302/01045—Glucosylceramidase (3.2.1.45), i.e. beta-glucocerebrosidase
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2207/00—Modified animals
- A01K2207/20—Animals treated with compounds which are neither proteins nor nucleic acids
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2227/00—Animals characterised by species
- A01K2227/10—Mammal
- A01K2227/105—Murine
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2267/00—Animals characterised by purpose
- A01K2267/03—Animal model, e.g. for test or diseases
- A01K2267/0306—Animal model for genetic diseases
- A01K2267/0318—Animal model for neurodegenerative disease, e.g. non- Alzheimer's
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/14—Type of nucleic acid interfering N.A.
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2320/00—Applications; Uses
- C12N2320/30—Special therapeutic applications
- C12N2320/31—Combination therapy
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2330/00—Production
- C12N2330/50—Biochemical production, i.e. in a transformed host cell
- C12N2330/51—Specially adapted vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2750/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
- C12N2750/00011—Details
- C12N2750/14011—Parvoviridae
- C12N2750/14111—Dependovirus, e.g. adenoassociated viruses
- C12N2750/14141—Use of virus, viral particle or viral elements as a vector
- C12N2750/14143—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Zoology (AREA)
- Biomedical Technology (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- General Engineering & Computer Science (AREA)
- Medicinal Chemistry (AREA)
- Biochemistry (AREA)
- Pharmacology & Pharmacy (AREA)
- Veterinary Medicine (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Biophysics (AREA)
- Microbiology (AREA)
- Epidemiology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Neurosurgery (AREA)
- Neurology (AREA)
- Virology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Toxicology (AREA)
- Gastroenterology & Hepatology (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Psychology (AREA)
- Immunology (AREA)
- Orthopedic Medicine & Surgery (AREA)
- Dermatology (AREA)
- Cell Biology (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
본 개시내용은 일부 측면에서, 중추 신경계 (CNS) 질환, 예를 들어 파킨슨병 (PD) 및 고셔병의 치료를 위한 조성물 및 방법에 관한 것이다. 일부 실시양태에서, 본 개시내용은 하나 이상의 CNS 질환-연관 유전자 산물 및/또는 CNS 질환-연관 유전자 또는 유전자 산물을 표적화하는 하나 이상의 억제성 핵산을 코딩하는 트랜스진을 포함하는 발현 구축물을 제공한다. 일부 실시양태에서, 본 개시내용은 이러한 발현 구축물을 CNS 질환의 치료를 필요로 하는 대상체에게 투여함으로써 CNS 질환을 치료하는 방법을 제공한다.The present disclosure, in some aspects, relates to compositions and methods for the treatment of central nervous system (CNS) diseases, such as Parkinson's disease (PD) and Gaucher disease. In some embodiments, the present disclosure provides expression constructs comprising a transgene encoding one or more CNS disease-associated gene products and/or one or more inhibitory nucleic acids that target CNS disease-associated genes or gene products. In some embodiments, the present disclosure provides a method of treating a CNS disease by administering such an expression construct to a subject in need thereof.
Description
관련 출원 Related applications
본 출원은 35 U.S.C. 119(e) 하에 "TREM2를 코딩하는 AAV 벡터 및 그의 용도"라는 명칭으로 2019년 4월 10일에 출원된 미국 가출원 일련 번호 62/832,223, "리소좀 장애를 위한 유전자 요법"이라는 명칭으로 2019년 4월 10일에 출원된 미국 가출원 일련 번호 62/831,840, "리소좀 장애를 위한 유전자 요법"이라는 명칭으로 2019년 4월 10일에 출원된 미국 가출원 일련 번호 62/831,846, "리소좀 장애를 위한 유전자 요법"이라는 명칭으로 2019년 4월 10일에 출원된 미국 가출원 일련 번호 62/831,856, "리소좀 장애를 위한 유전자 요법"이라는 명칭으로 2019년 11월 12일에 출원된 미국 가출원 일련 번호 62/934,450, "리소좀 장애를 위한 유전자 요법"이라는 명칭으로 2019년 12월 27일에 출원된 미국 가출원 일련 번호 62/954,089, "리소좀 장애를 위한 유전자 요법"이라는 명칭으로 2020년 1월 13일에 출원된 미국 가출원 일련 번호 62/960,471, "리소좀 장애를 위한 유전자 요법"이라는 명칭으로 2020년 3월 12일에 출원된 미국 가출원 일련 번호 62/998,665, 및 "리소좀 장애를 위한 유전자 요법"이라는 명칭으로 2020년 3월 16일에 출원된 미국 가출원 일련 번호 62/990,246을 우선권 주장하며, 이들 가출원 각각의 전체 내용이 본원에 참조로 포함된다.This application is filed under 35 U.S.C. U.S. Provisional Application Serial No. 62/832,223, filed April 10, 2019, entitled "AAV Vectors Encoding TREM2 and Uses Thereof" under 119(e), 4, 2019 entitled "Gene Therapy for Lysosomal Disorders" U.S. Provisional Application Serial No. 62/831,840, filed April 10, U.S. Provisional Application Serial No. 62/831,846, filed April 10, 2019, entitled “Gene Therapy for Lysosomal Disorders,” “Genetic Therapy for Lysosomal Disorders” U.S. Provisional Application Serial No. 62/831,856, filed on April 10, 2019, entitled "Gene Therapy for Lysosomal Disorders," U.S. Provisional Application Serial No. 62/934,450, filed on November 12, 2019, "Lysosomal Serial No. 62/954,089, U.S. Provisional Application Serial No. 62/954,089, filed on December 27, 2019, titled "Gene Therapy for Disorders," U.S. Provisional Application Serial No., filed January 13, 2020, titled "Gene Therapy for Lysosomal Disorders" 62/960,471, U.S. Provisional Application Serial No. 62/998,665, filed March 12, 2020, entitled “Gene Therapy for Lysosomal Disorders,” and March 16, 2020, entitled “Gene Therapy for Lysosomal Disorders” Priority is claimed to U.S. Provisional Application Serial No. 62/990,246, filed on , the entire contents of each of which are incorporated herein by reference.
고셔병은 리소좀산 β-글루코세레브로시다아제 (Gcase, "GBA")의 결핍으로 인한 글리코스핑고지질 대사의 드문 타고난 오류이다. 환자는 간비장종대, 범혈구감소증을 유발하는 골수 부전, 폐 장애 및 섬유증, 및 뼈 결손을 포함한 비-CNS 증상 및 소견으로 고통받고 있다. 또한 상당수의 환자가 결함이 있는 사카딕(saccadic) 안구 운동 및 응시, 발작, 인지 장애, 발달 지연, 및 파킨슨병을 포함한 운동 장애를 포함한 신경학적 증상으로 고통받고 있다.Gaucher disease is a rare innate error in glycosphingolipid metabolism due to a deficiency of lysosomal β-glucocerebrosidase (Gcase, "GBA"). The patient suffers from non-CNS symptoms and findings including hepatosplenomegaly, bone marrow failure leading to pancytopenia, lung disorders and fibrosis, and bone defects. A significant number of patients also suffer from neurological symptoms, including defective saccadic eye movement and gaze, seizures, cognitive impairment, developmental delay, and movement disorders including Parkinson's disease.
말초 질환과 조혈 골수 및 내장의 주요 임상 증상을 다루는 여러 치료법이 존재하며, 이는 하기에 기재된 바와 같은 효소 대체 요법, 결함이 있는 Gcase과 결합하여 안정성을 개선시키는 샤페론 유사 소분자 약물, 및 고셔병에 축적되어 증상 및 소견을 유발하는 기질 생성을 차단하는 기질 감소 요법을 포함한다. 그러나, 고셔병의 다른 측면 (특히 골격과 뇌에 영향을 미치는 측면)은 치료에 불응성인 것으로 보인다. Several therapies exist to address peripheral diseases and key clinical symptoms of the hematopoietic bone marrow and intestines, including enzyme replacement therapy as described below, chaperone-like small molecule drugs that improve stability by binding to defective Gcases, and accumulating in Gaucher's disease. Substrate reduction therapy that blocks the production of the substrate that causes symptoms and findings. However, other aspects of Gaucher's disease (particularly those affecting the skeleton and brain) appear to be refractory to treatment.
본 개시내용은 부분적으로, 특정 중추 신경계 (CNS) 질환, 예를 들어 신경퇴행성 질환 (예를 들어, 표 2에 열거된 신경퇴행성 질환), 시누클레인병증 (예를 들어, 표 3에 열거된 시누클레인병증), 타우병증 (표 4에 열거된 타우병증), 또는 리소좀 축적 질환 (예를 들어, 표 5에 열거된 리소좀 축적 질환)을 치료하기 위한 조성물 및 방법에 관한 것이다.The present disclosure relates, in part, to certain central nervous system (CNS) diseases, such as neurodegenerative diseases (eg, neurodegenerative diseases listed in Table 2), synucleinopathy (eg, sinus listed in Table 3). Kleinopathy), tauopathy (a tauopathy listed in Table 4), or a lysosomal storage disease (eg, a lysosomal storage disease listed in Table 5).
고셔병 환자 (GBA1 유전자의 두 염색체 대립유전자 모두에 돌연변이를 보유함) 외에도, GBA1의 하나의 대립유전자에만 돌연변이를 갖는 환자는 파킨슨병 (PD)의 위험이 매우 증가된다. 보행 장애, 안정 시 떨림, 강직, 및 종종 우울증, 수면 장애 및 인지 저하를 포함하는 PD 증상의 중증도는 효소 활성 감소의 정도와 상관관계가 있다. 따라서 고셔병 환자는 가장 심각한 경과를 보이는 반면, GBA1에 단일 경미한 돌연변이를 갖는 환자는 전형적으로 더 양성 경과를 보인다. 돌연변이 보인자는 또한, 실행 기능 장애, 정신병 및 PD 유사 운동 장애를 특징으로 하는 루이소체 치매, 및 특징적인 운동 및 인지 장애가 동반된 다계통 위축을 포함한 다른 PD-관련 장애의 위험이 높다. 이러한 장애의 냉혹한 경과를 바꾸는 요법은 없다.In addition to Gaucher disease patients ( carrying mutations in both chromosomal alleles of the GBA1 gene), patients with mutations in only one allele of GBA1 are at a greatly increased risk of Parkinson's disease (PD). The severity of PD symptoms, including impaired gait, resting tremors, rigidity, and often depression, sleep disturbances and cognitive decline, correlates with the extent of decreased enzyme activity. Thus, patients with Gaucher disease have the most severe course, whereas patients with a single minor mutation in GBA1 typically have a more benign course. Mutant carriers are also at increased risk for other PD-related disorders, including executive dysfunction, Lewy body dementia, characterized by psychosis and PD-like movement disorders, and multiple system atrophy with characteristic motor and cognitive impairments. There is no therapy to change the grim course of these disorders.
Gcase (예를 들어, GBA1 유전자의 유전자 산물)와 같은 효소의 결핍뿐만 아니라 리소좀 기능 또는 거대분자를 리소좀으로 트래피킹하는 것과 관련된 많은 유전자에서의 공통 변이체 (예를 들어, SCARB2로서 지칭되기도 한, 리소좀 막 단백질 1 (LIMP))는 PD 위험 및/또는 고셔병 (예를 들어, 신경병증성 고셔병, 예컨대 유형 2 고셔병 또는 유형 3 고셔병) 위험 증가와 연관이 있었다. 본 개시내용은 부분적으로, 중추 신경계 (CNS) 질환, 예를 들어 고셔병, PD 등과 연관된 하나 이상의 유전자, 예를 들어 Gcase, GBA2, 프로사포신, 프로그래뉼린, LIMP2, GALC, CTSB, SMPD1, GCH1, RAB7, VPS35, IL-34, TREM2, TMEM106B, 또는 전술한 것 중 임의의 것 (또는 그의 일부분)의 조합을 코딩하는 발현 구축물 (예를 들어, 벡터)에 기초한다. 일부 실시양태에서, 본원에 기재된 유전자 산물의 조합은 함께 작용하여 (예를 들어, 상승적으로), 대상체에서 발현될 때 CNS 질환의 하나 이상의 징후 및 증상을 감소시킨다.Common variants in many genes involved in lysosomal function or trafficking of macromolecules into lysosomes, as well as deficiencies in enzymes such as Gcase (eg, the gene product of the GBA1 gene) (eg, lysosomes, also referred to as SCARB2). Membrane protein 1 (LIMP)) has been associated with an increased risk of PD and/or Gaucher disease (eg, neuropathic Gaucher disease, such as
따라서, 일부 측면에서, 본 개시내용은 Gcase (예를 들어, GBA1 유전자의 유전자 산물)를 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 (예를 들어, 포유동물 세포, 예를 들어 인간 세포에서의 발현을 위해 코돈-최적화된) Gcase-코딩 서열을 포함한다. 일부 실시양태에서, Gcase를 코딩하는 핵산 서열은 서열식별번호 (SEQ ID NO): 14에 제시된 바와 같은 (예를 들어, NCBI 참조 서열 NP_000148.2에 제시된 바와 같은) 아미노산 서열을 포함하는 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 15에 제시된 서열을 포함한다. 일부 실시양태에서 발현 구축물은 아데노-연관 바이러스 (AAV) 역위 말단 반복부 (ITR), 예를 들어 Gcase 단백질을 코딩하는 핵산 서열에 플랭킹된 AAV ITR을 포함한다. Accordingly, in some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding a Gcase (eg, a gene product of a GBA1 gene). In some embodiments, the isolated nucleic acid comprises a Gcase-coding sequence that is codon-optimized (eg, codon-optimized for expression in a mammalian cell, eg, a human cell). In some embodiments, the nucleic acid sequence encoding the Gcase encodes a protein comprising an amino acid sequence as set forth in SEQ ID NO: 14 (eg, as set forth in the NCBI reference sequence NP_000148.2). do. In some embodiments, the isolated nucleic acid comprises the sequence set forth in SEQ ID NO:15. In some embodiments the expression construct comprises an adeno-associated virus (AAV) inverted terminal repeat (ITR), eg, an AAV ITR flanked by a nucleic acid sequence encoding a Gcase protein.
일부 측면에서, 본 개시내용은 프로사포신 (예를 들어, PSAP 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 (예를 들어, 포유동물 세포, 예를 들어 인간 세포에서의 발현을 위해 코돈-최적화된) 프로사포신-코딩 서열을 포함한다. 일부 실시양태에서, 프로사포신을 코딩하는 핵산 서열은 서열식별번호: 16에 제시된 바와 같은 (예를 들어, NCBI 참조 서열 NP_002769.1에 제시된 바와 같은) 아미노산 서열을 포함하는 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 17에 제시된 서열을 포함한다. 일부 실시양태에서 발현 구축물은 아데노-연관 바이러스 (AAV) 역위 말단 반복부 (ITR), 예를 들어 프로사포신 단백질을 코딩하는 핵산 서열에 플랭킹된 AAV ITR을 포함한다.In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding a prosaposin (eg, a gene product of a PSAP gene). In some embodiments, the isolated nucleic acid comprises a prosaposin-coding sequence that is codon-optimized (eg, codon-optimized for expression in a mammalian cell, eg, a human cell). In some embodiments, the nucleic acid sequence encoding prosaposin encodes a protein comprising an amino acid sequence as set forth in SEQ ID NO: 16 (eg, as set forth in NCBI reference sequence NP_002769.1). In some embodiments, the isolated nucleic acid comprises the sequence set forth in SEQ ID NO:17. In some embodiments the expression construct comprises an adeno-associated virus (AAV) inverted terminal repeat (ITR), eg, an AAV ITR flanked by a nucleic acid sequence encoding a prosaposin protein.
일부 측면에서, 본 개시내용은 LIMP2/SCARB2 (예를 들어, SCARB2 유전자의 유전자 산물)를 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 (예를 들어, 포유동물 세포, 예를 들어 인간 세포에서의 발현을 위해 코돈-최적화된) SCARB2-코딩 서열을 포함한다. 일부 실시양태에서, LIMP2/SCARB2를 코딩하는 핵산 서열은 서열식별번호: 18에 제시된 바와 같은 (예를 들어, NCBI 참조 서열 NP_005497.1에 제시된 바와 같은) 아미노산 서열을 포함하는 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 19에 제시된 서열을 포함한다. 일부 실시양태에서 발현 구축물은 아데노-연관 바이러스 (AAV) 역위 말단 반복부 (ITR), 예를 들어 SCARB2 단백질을 코딩하는 핵산 서열에 플랭킹된 AAV ITR을 포함한다. In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding LIMP2/SCARB2 (eg, a gene product of the SCARB2 gene). In some embodiments, the isolated nucleic acid comprises a SCARB2-coding sequence that is codon-optimized (eg, codon-optimized for expression in a mammalian cell, eg, a human cell). In some embodiments, the nucleic acid sequence encoding LIMP2/SCARB2 encodes a protein comprising an amino acid sequence as set forth in SEQ ID NO: 18 (eg, as set forth in NCBI reference sequence NP_005497.1). In some embodiments, the isolated nucleic acid comprises the sequence set forth in SEQ ID NO:19. In some embodiments the expression construct comprises an adeno-associated virus (AAV) inverted terminal repeat (ITR), eg, an AAV ITR flanked by a nucleic acid sequence encoding a SCARB2 protein.
일부 측면에서, 본 개시내용은 GBA2 단백질 (예를 들어, GBA2 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 (예를 들어, 포유동물 세포, 예를 들어 인간 세포에서의 발현을 위해 코돈-최적화된) GBA2-코딩 서열을 포함한다. 일부 실시양태에서, GBA2를 코딩하는 핵산 서열은 서열식별번호: 30에 제시된 바와 같은 (예를 들어, NCBI 참조 서열 NP_065995.1에 제시된 바와 같은) 아미노산 서열을 포함하는 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 31에 제시된 서열을 포함한다. 일부 실시양태에서 발현 구축물은 아데노-연관 바이러스 (AAV) 역위 말단 반복부 (ITR), 예를 들어 GBA2 단백질을 코딩하는 핵산 서열에 플랭킹된 AAV ITR을 포함한다.In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding a GBA2 protein (eg, a gene product of a GBA2 gene). In some embodiments, the isolated nucleic acid comprises a GBA2-coding sequence that is codon-optimized (eg, codon-optimized for expression in a mammalian cell, eg, a human cell). In some embodiments, the nucleic acid sequence encoding GBA2 encodes a protein comprising an amino acid sequence as set forth in SEQ ID NO: 30 (eg, as set forth in NCBI reference sequence NP_065995.1). In some embodiments, the isolated nucleic acid comprises the sequence set forth in SEQ ID NO:31. In some embodiments the expression construct comprises an adeno-associated virus (AAV) inverted terminal repeat (ITR), eg, an AAV ITR flanked by a nucleic acid sequence encoding a GBA2 protein.
일부 측면에서, 본 개시내용은 GALC 단백질 (예를 들어, GALC 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 (예를 들어, 포유동물 세포, 예를 들어 인간 세포에서의 발현을 위해 코돈-최적화된) GALC-코딩 서열을 포함한다. 일부 실시양태에서, GALC를 코딩하는 핵산 서열은 서열식별번호: 33에 제시된 바와 같은 (예를 들어, NCBI 참조 서열 NP_000144.2에 제시된 바와 같은) 아미노산 서열을 포함하는 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 34에 제시된 서열을 포함한다. 일부 실시양태에서, 발현 구축물은 아데노-연관 바이러스 (AAV) 역위 말단 반복부 (ITR), 예를 들어 GALC 단백질을 코딩하는 핵산 서열에 플랭킹된 AAV ITR을 포함한다.In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding a GALC protein (eg, a gene product of a GALC gene). In some embodiments, the isolated nucleic acid comprises a GALC-coding sequence that is codon-optimized (eg, codon-optimized for expression in a mammalian cell, eg, a human cell). In some embodiments, the nucleic acid sequence encoding GALC encodes a protein comprising an amino acid sequence as set forth in SEQ ID NO:33 (eg, as set forth in NCBI reference sequence NP_000144.2). In some embodiments, the isolated nucleic acid comprises the sequence set forth in SEQ ID NO:34. In some embodiments, the expression construct comprises an adeno-associated virus (AAV) inverted terminal repeat (ITR), eg, an AAV ITR flanked by a nucleic acid sequence encoding a GALC protein.
일부 측면에서, 본 개시내용은 CTSB 단백질 (예를 들어, CTSB 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 (예를 들어, 포유동물 세포, 예를 들어 인간 세포에서의 발현을 위해 코돈-최적화된) CTSB-코딩 서열을 포함한다. 일부 실시양태에서, CTSB를 코딩하는 핵산 서열은 서열식별번호: 35에 제시된 바와 같은 (예를 들어, NCBI 참조 서열 NP_001899.1에 제시된 바와 같은) 아미노산 서열을 포함하는 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 36에 제시된 서열을 포함한다. 일부 실시양태에서, 발현 구축물은 아데노-연관 바이러스 (AAV) 역위 말단 반복부 (ITR), 예를 들어 CTSB 단백질을 코딩하는 핵산 서열에 플랭킹된 AAV ITR을 포함한다.In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding a CTSB protein (eg, a gene product of a CTSB gene). In some embodiments, the isolated nucleic acid comprises a CTSB-coding sequence that is codon-optimized (eg, codon-optimized for expression in a mammalian cell, eg, a human cell). In some embodiments, the nucleic acid sequence encoding the CTSB encodes a protein comprising an amino acid sequence as set forth in SEQ ID NO: 35 (eg, as set forth in NCBI reference sequence NP_001899.1). In some embodiments, the isolated nucleic acid comprises the sequence set forth in SEQ ID NO:36. In some embodiments, the expression construct comprises an adeno-associated virus (AAV) inverted terminal repeat (ITR), eg, an AAV ITR flanked by a nucleic acid sequence encoding a CTSB protein.
일부 측면에서, 본 개시내용은 SMPD1 단백질 (예를 들어, SMPD1 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 (예를 들어, 포유동물 세포, 예를 들어 인간 세포에서의 발현을 위해 코돈-최적화된) SMPD1-코딩 서열을 포함한다. 일부 실시양태에서, SMPD1을 코딩하는 핵산 서열은 서열식별번호: 37에 제시된 바와 같은 (예를 들어, NCBI 참조 서열 NP_000534.3에 제시된 바와 같은) 아미노산 서열을 포함하는 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 38에 제시된 서열을 포함한다. 일부 실시양태에서, 발현 구축물은 아데노-연관 바이러스 (AAV) 역위 말단 반복부 (ITR), 예를 들어 SMPD1 단백질을 코딩하는 핵산 서열에 플랭킹된 AAV ITR을 포함한다.In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding a SMPD1 protein (eg, a gene product of the SMPD1 gene). In some embodiments, the isolated nucleic acid comprises a SMPD1-coding sequence that is codon-optimized (eg, codon-optimized for expression in a mammalian cell, eg, a human cell). In some embodiments, the nucleic acid sequence encoding SMPD1 encodes a protein comprising an amino acid sequence as set forth in SEQ ID NO: 37 (eg, as set forth in NCBI reference sequence NP_000534.3). In some embodiments, the isolated nucleic acid comprises the sequence set forth in SEQ ID NO:38. In some embodiments, the expression construct comprises an adeno-associated virus (AAV) inverted terminal repeat (ITR), eg, an AAV ITR flanked by a nucleic acid sequence encoding a SMPD1 protein.
일부 측면에서, 본 개시내용은 GCH1 단백질 (예를 들어, GCH1 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 (예를 들어, 포유동물 세포, 예를 들어 인간 세포에서의 발현을 위해 코돈-최적화된) GCH1-코딩 서열을 포함한다. 일부 실시양태에서, GCH1을 코딩하는 핵산 서열은 서열식별번호: 45에 제시된 바와 같은 (예를 들어, NCBI 참조 서열 NP_000534.3에 제시된 바와 같은) 아미노산 서열을 포함하는 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 46에 제시된 서열을 포함한다. 일부 실시양태에서 발현 구축물은 아데노-연관 바이러스 (AAV) 역위 말단 반복부 (ITR), 예를 들어 GCH1 단백질을 코딩하는 핵산 서열에 플랭킹된 AAV ITR을 포함한다. In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding a GCH1 protein (eg, a gene product of a GCH1 gene). In some embodiments, the isolated nucleic acid comprises a GCH1-coding sequence that is codon-optimized (eg, codon-optimized for expression in a mammalian cell, eg, a human cell). In some embodiments, the nucleic acid sequence encoding GCH1 encodes a protein comprising an amino acid sequence as set forth in SEQ ID NO:45 (eg, as set forth in NCBI reference sequence NP_000534.3). In some embodiments, the isolated nucleic acid comprises the sequence set forth in SEQ ID NO:46. In some embodiments the expression construct comprises an adeno-associated virus (AAV) inverted terminal repeat (ITR), eg, an AAV ITR flanked by a nucleic acid sequence encoding a GCH1 protein.
일부 측면에서, 본 개시내용은 RAB7L 단백질 (예를 들어, RAB7L 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 (예를 들어, 포유동물 세포, 예를 들어 인간 세포에서의 발현을 위해 코돈-최적화된) RAB7L-코딩 서열을 포함한다. 일부 실시양태에서, RAB7L을 코딩하는 핵산 서열은 서열식별번호: 47에 제시된 바와 같은 (예를 들어, NCBI 참조 서열 NP_003920.1에 제시된 바와 같은) 아미노산 서열을 포함하는 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 48에 제시된 서열을 포함한다. 일부 실시양태에서, 발현 구축물은 아데노-연관 바이러스 (AAV) 역위 말단 반복부 (ITR), 예를 들어 RAB7L 단백질을 코딩하는 핵산 서열에 플랭킹된 AAV ITR을 포함한다. In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding a RAB7L protein (eg, a gene product of a RAB7L gene). In some embodiments, the isolated nucleic acid comprises a RAB7L-coding sequence that is codon-optimized (eg, codon-optimized for expression in a mammalian cell, eg, a human cell). In some embodiments, the nucleic acid sequence encoding RAB7L encodes a protein comprising an amino acid sequence as set forth in SEQ ID NO:47 (eg, as set forth in NCBI reference sequence NP_003920.1). In some embodiments, the isolated nucleic acid comprises the sequence set forth in SEQ ID NO:48. In some embodiments, the expression construct comprises an adeno-associated virus (AAV) inverted terminal repeat (ITR), eg, an AAV ITR flanked by a nucleic acid sequence encoding a RAB7L protein.
일부 측면에서, 본 개시내용은 VPS35 단백질 (예를 들어, VPS35 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 (예를 들어, 포유동물 세포, 예를 들어 인간 세포에서의 발현을 위해 코돈-최적화된) VPS35-코딩 서열을 포함한다. 일부 실시양태에서, VPS35를 코딩하는 핵산 서열은 서열식별번호: 49에 제시된 바와 같은 (예를 들어, NCBI 참조 서열 NP_060676.2에 제시된 바와 같은) 아미노산 서열을 포함하는 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 50에 제시된 서열을 포함한다. 일부 실시양태에서, 발현 구축물은 아데노-연관 바이러스 (AAV) 역위 말단 반복부 (ITR), 예를 들어 VPS35 단백질을 코딩하는 핵산 서열에 플랭킹된 AAV ITR을 포함한다.In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding a VPS35 protein (eg, a gene product of a VPS35 gene). In some embodiments, the isolated nucleic acid comprises a VPS35-coding sequence that is codon-optimized (eg, codon-optimized for expression in a mammalian cell, eg, a human cell). In some embodiments, the nucleic acid sequence encoding VPS35 encodes a protein comprising an amino acid sequence as set forth in SEQ ID NO: 49 (eg, as set forth in NCBI reference sequence NP_060676.2). In some embodiments, the isolated nucleic acid comprises the sequence set forth in SEQ ID NO:50. In some embodiments, the expression construct comprises an adeno-associated virus (AAV) inverted terminal repeat (ITR), eg, an AAV ITR flanked by a nucleic acid sequence encoding a VPS35 protein.
일부 측면에서, 본 개시내용은 IL-34 단백질 (예를 들어, IL34 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 (예를 들어, 포유동물 세포, 예를 들어 인간 세포에서의 발현을 위해 코돈-최적화된) IL-34-코딩 서열을 포함한다. 일부 실시양태에서, IL-34를 코딩하는 핵산 서열은 서열식별번호: 55에 제시된 바와 같은 (예를 들어, NCBI 참조 서열 NP_689669.2에 제시된 바와 같은) 아미노산 서열을 포함하는 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 56에 제시된 서열을 포함한다. 일부 실시양태에서, 발현 구축물은 아데노-연관 바이러스 (AAV) 역위 말단 반복부 (ITR), 예를 들어 IL-34 단백질을 코딩하는 핵산 서열에 플랭킹된 AAV ITR을 포함한다. In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding an IL-34 protein (eg, a gene product of an IL34 gene). In some embodiments, the isolated nucleic acid comprises an IL-34-coding sequence that is codon-optimized (eg, codon-optimized for expression in a mammalian cell, eg, a human cell). In some embodiments, the nucleic acid sequence encoding IL-34 encodes a protein comprising an amino acid sequence as set forth in SEQ ID NO: 55 (eg, as set forth in NCBI reference sequence NP_689669.2). In some embodiments, the isolated nucleic acid comprises a sequence set forth in SEQ ID NO:56. In some embodiments, the expression construct comprises an adeno-associated virus (AAV) inverted terminal repeat (ITR), eg, an AAV ITR flanked by a nucleic acid sequence encoding an IL-34 protein.
일부 측면에서, 본 개시내용은 TREM2 단백질 (예를 들어, TREM 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 (예를 들어, 포유동물 세포, 예를 들어 인간 세포에서의 발현을 위해 코돈-최적화된) TREM2-코딩 서열을 포함한다. 일부 실시양태에서, TREM2를 코딩하는 핵산 서열은 서열식별번호: 57에 제시된 바와 같은 (예를 들어, NCBI 참조 서열 NP_061838.1에 제시된 바와 같은) 아미노산 서열을 포함하는 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 58에 제시된 서열을 포함한다. 일부 실시양태에서, 발현 구축물은 아데노-연관 바이러스 (AAV) 역위 말단 반복부 (ITR), 예를 들어 TREM2 단백질을 코딩하는 핵산 서열에 플랭킹된 AAV ITR을 포함한다. In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding a TREM2 protein (eg, a gene product of a TREM gene). In some embodiments, the isolated nucleic acid comprises a TREM2-coding sequence that is codon-optimized (eg, codon-optimized for expression in a mammalian cell, eg, a human cell). In some embodiments, the nucleic acid sequence encoding TREM2 encodes a protein comprising an amino acid sequence as set forth in SEQ ID NO: 57 (eg, as set forth in NCBI reference sequence NP_061838.1). In some embodiments, the isolated nucleic acid comprises the sequence set forth in SEQ ID NO:58. In some embodiments, the expression construct comprises an adeno-associated virus (AAV) inverted terminal repeat (ITR), eg, an AAV ITR flanked by a nucleic acid sequence encoding a TREM2 protein.
일부 측면에서, 본 개시내용은 TMEM106B 단백질 (예를 들어, TMEM106B 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 (예를 들어, 포유동물 세포, 예를 들어 인간 세포에서의 발현을 위해 코돈-최적화된) TMEM106B-코딩 서열을 포함한다. 일부 실시양태에서, TMEM106B를 코딩하는 핵산 서열은 서열식별번호: 63에 제시된 바와 같은 (예를 들어, NCBI 참조 서열 NP_060844.2에 제시된 바와 같은) 아미노산 서열을 포함하는 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 64에 제시된 서열을 포함한다. 일부 실시양태에서 발현 구축물은 아데노-연관 바이러스 (AAV) 역위 말단 반복부 (ITR), 예를 들어 TMEM106B 단백질을 코딩하는 핵산 서열에 플랭킹된 AAV ITR을 포함한다.In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding a TMEM106B protein (eg, a gene product of the TMEM106B gene). In some embodiments, the isolated nucleic acid comprises a TMEM106B-coding sequence that is codon-optimized (eg, codon-optimized for expression in a mammalian cell, eg, a human cell). In some embodiments, the nucleic acid sequence encoding TMEM106B encodes a protein comprising an amino acid sequence as set forth in SEQ ID NO:63 (eg, as set forth in NCBI reference sequence NP_060844.2). In some embodiments, the isolated nucleic acid comprises the sequence set forth in SEQ ID NO:64. In some embodiments the expression construct comprises an adeno-associated virus (AAV) inverted terminal repeat (ITR), eg, an AAV ITR flanked by a nucleic acid sequence encoding the TMEM106B protein.
일부 측면에서, 본 개시내용은 프로그래뉼린 (예를 들어, GRN 유전자로서 지칭되기도 한 PGRN 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 (예를 들어, 포유동물 세포, 예를 들어 인간 세포에서의 발현을 위해 코돈-최적화된) 프로사포신-코딩 서열을 포함한다. 일부 실시양태에서, 프로그래뉼린 (GRN으로서 지칭되기도 하는 PRGN)을 코딩하는 핵산 서열은 서열식별번호: 67에 제시된 바와 같은 (예를 들어, NCBI 참조 서열 NP_002078.1에 제시된 바와 같은) 아미노산 서열을 포함하는 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 68에 제시된 서열을 포함한다. 일부 실시양태에서 발현 구축물은 아데노-연관 바이러스 (AAV) 역위 말단 반복부 (ITR), 예를 들어 프로사포신 단백질을 코딩하는 핵산 서열에 플랭킹된 AAV ITR을 포함한다. In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding a progranulin (eg, a gene product of a PGRN gene, also referred to as a GRN gene). In some embodiments, the isolated nucleic acid comprises a prosaposin-coding sequence that is codon-optimized (eg, codon-optimized for expression in a mammalian cell, eg, a human cell). In some embodiments, the nucleic acid sequence encoding progranulin (PRGN, also referred to as GRN) comprises an amino acid sequence as set forth in SEQ ID NO: 67 (e.g., as set forth in NCBI reference sequence NP_002078.1) It encodes a protein containing In some embodiments, the isolated nucleic acid comprises the sequence set forth in SEQ ID NO:68. In some embodiments the expression construct comprises an adeno-associated virus (AAV) inverted terminal repeat (ITR), eg, an AAV ITR flanked by a nucleic acid sequence encoding a prosaposin protein.
본 개시내용의 측면은 하나 이상의 억제성 핵산을 코딩하는 단리된 핵산 및 발현 구축물 (예를 들어, rAAV 벡터)에 관한 것이다. 일부 실시양태에서, 하나 이상의 억제성 핵산은 특정 중추 신경계 (CNS) 질환과 연관된 유전자 (예를 들어, SNCA, TMEM106B, RPS2 또는 MAPT)를 표적으로 한다. 일부 실시양태에서, 억제성 핵산은 단독으로 또는 본원에 기재된 하나 이상의 유전자 산물 (예를 들어, GBA1, PSAP, PRGN 등)과 조합하여 발현된다. 일부 실시양태에서, 단리된 핵산은 1) SNCA를 표적화하는 억제성 핵산, 및 2) GBA1 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 1) SNCA를 표적화하는 억제성 핵산, 및 2) PSAP 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 1) SNCA를 표적화하는 억제성 핵산, 및 2) PGRN 단백질 (예를 들어, GRN 단백질)을 코딩한다. 일부 실시양태에서, 단리된 핵산은 1) MAPT를 표적화하는 억제성 핵산, 및 2) GBA1 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 1) MAPT를 표적화하는 억제성 핵산, 및 2) PSAP 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 1) MAPT를 표적화하는 억제성 핵산, 및 2) PGRN 단백질 (예를 들어, GRN 단백질)을 코딩한다. 일부 실시양태에서, 단리된 핵산은 1) TMEM106B를 표적화하는 억제성 핵산, 및 2) GBA1 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 1) TMEM106B를 표적화하는 억제성 핵산, 및 2) PSAP 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 1) TMEM106B를 표적화하는 억제성 핵산, 및 2) PGRN 단백질 (예를 들어, GRN 단백질)을 코딩한다. 일부 실시양태에서, 단리된 핵산은 1) RPS25를 표적화하는 억제성 핵산, 및 2) GBA1 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 1) RPS25를 표적화하는 억제성 핵산, 및 2) PSAP 단백질을 코딩한다. 일부 실시양태에서, 단리된 핵산은 1) RPS25를 표적화하는 억제성 핵산, 및 2) PGRN 단백질 (예를 들어, GRN 단백질)을 코딩한다.Aspects of the present disclosure relate to isolated nucleic acids and expression constructs (eg, rAAV vectors) encoding one or more inhibitory nucleic acids. In some embodiments, one or more inhibitory nucleic acid is a gene associated with a particular central nervous system (CNS) disorders (e.g., SNCA, TMEM106B, RPS2 or MAPT) targeted. In some embodiments, the inhibitory nucleic acid is expressed alone or in combination with one or more gene products described herein (eg, GBA1, PSAP, PRGN, etc.). In some embodiments, the isolated nucleic acid encodes 1) an inhibitory nucleic acid that targets SNCA , and 2) a GBA1 protein. In some embodiments, the isolated nucleic acid encodes 1) an inhibitory nucleic acid that targets SNCA , and 2) a PSAP protein. In some embodiments, the isolated nucleic acid encodes 1) an inhibitory nucleic acid that targets SNCA , and 2) a PGRN protein (eg, a GRN protein). In some embodiments, the isolated nucleic acid encoding an inhibitory nucleic acid, and 2) to target the protein GBA1 1) MAPT. In some embodiments, the isolated nucleic acid encodes 1) an inhibitory nucleic acid that targets MAPT , and 2) a PSAP protein. In some embodiments, the isolated nucleic acid encodes 1) an inhibitory nucleic acid that targets MAPT , and 2) a PGRN protein (eg, a GRN protein). In some embodiments, the isolated nucleic acid encodes 1) an inhibitory nucleic acid that targets TMEM106B, and 2) a GBA1 protein. In some embodiments, the isolated nucleic acid encodes 1) an inhibitory nucleic acid that targets TMEM106B, and 2) a PSAP protein. In some embodiments, the isolated nucleic acid encodes 1) an inhibitory nucleic acid that targets TMEM106B, and 2) a PGRN protein (eg, a GRN protein). In some embodiments, the isolated nucleic acid encodes 1) an inhibitory nucleic acid that targets RPS25, and 2) a GBA1 protein. In some embodiments, the isolated nucleic acid encodes 1) an inhibitory nucleic acid that targets RPS25, and 2) a PSAP protein. In some embodiments, the isolated nucleic acid encodes 1) an inhibitory nucleic acid that targets RPS25, and 2) a PGRN protein (eg, a GRN protein).
일부 측면에서, 본 개시내용은 AAV 역위 말단 반복부 (ITR)에 의해 플랭킹된 α-Syn의 발현 또는 활성을 억제하는 억제성 핵산을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. 일부 실시양태에서, 억제성 핵산은 서열식별번호: 90에 제시된 서열의 적어도 6개의 인접 뉴클레오티드에 상보적이다. 일부 실시양태에서, 억제성 핵산은 서열식별번호: 20-25 중 어느 하나에 제시된 핵산 서열을 포함하는 억제성 RNA이다. 일부 실시양태에서, 억제성 핵산은 서열식별번호: 94-99 중 어느 하나에 제시된 서열을 포함한다. In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding an inhibitory nucleic acid that inhibits the expression or activity of a-Syn flanked by an AAV inverted terminal repeat (ITR). In some embodiments, the inhibitory nucleic acid is complementary to at least 6 contiguous nucleotides of the sequence set forth in SEQ ID NO:90. In some embodiments, the inhibitory nucleic acid is an inhibitory RNA comprising a nucleic acid sequence set forth in any one of SEQ ID NOs: 20-25. In some embodiments, the inhibitory nucleic acid comprises a sequence set forth in any one of SEQ ID NOs: 94-99.
일부 측면에서, 본 개시내용은 AAV 역위 말단 반복부 (ITR)에 의해 플랭킹된 TMEM106B의 발현 또는 활성을 억제하는 억제성 핵산을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. 일부 실시양태에서, 억제성 핵산은 서열식별번호: 91에 제시된 서열의 적어도 6개의 인접 뉴클레오티드에 상보적이다. 일부 실시양태에서, 억제성 핵산은 서열식별번호: 92 또는 93에 제시된 핵산 서열을 포함하는 억제성 RNA이다. 일부 실시양태에서, 억제성 핵산은 서열식별번호: 65 또는 66에 제시된 서열을 포함한다. In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding an inhibitory nucleic acid that inhibits the expression or activity of TMEM106B flanked by an AAV inverted terminal repeat (ITR). In some embodiments, the inhibitory nucleic acid is complementary to at least 6 contiguous nucleotides of the sequence set forth in SEQ ID NO:91. In some embodiments, the inhibitory nucleic acid is an inhibitory RNA comprising a nucleic acid sequence set forth in SEQ ID NOs: 92 or 93. In some embodiments, the inhibitory nucleic acid comprises a sequence set forth in SEQ ID NOs: 65 or 66.
일부 측면에서, 본 개시내용은 AAV 역위 말단 반복부 (ITR)에 의해 플랭킹된 MAPT의 발현 또는 활성을 억제하는 억제성 핵산을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. 일부 실시양태에서, 억제성 핵산은 서열식별번호: 114에 제시된 서열의 적어도 6개의 인접 뉴클레오티드에 상보적이다. 일부 실시양태에서, 억제성 핵산은 서열식별번호: 123, 124, 127, 128, 131, 132, 135 또는 136에 제시된 핵산 서열을 포함하는 억제성 RNA이다. 일부 실시양태에서, 억제성 핵산은 서열식별번호: 125, 126, 129, 130, 133, 134, 137 또는 138에 제시된 서열을 포함한다.In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding an inhibitory nucleic acid that inhibits the expression or activity of MAPT flanked by an AAV inverted terminal repeat (ITR). In some embodiments, the inhibitory nucleic acid is complementary to at least 6 contiguous nucleotides of the sequence set forth in SEQ ID NO:114. In some embodiments, the inhibitory nucleic acid is an inhibitory RNA comprising a nucleic acid sequence set forth in SEQ ID NOs: 123, 124, 127, 128, 131, 132, 135 or 136. In some embodiments, the inhibitory nucleic acid comprises a sequence set forth in SEQ ID NOs: 125, 126, 129, 130, 133, 134, 137 or 138.
일부 측면에서, 본 개시내용은 제1 유전자 산물 및 제2 유전자 산물을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공하고, 여기서 각각의 유전자 산물은 독립적으로, 표 1에 제시된 유전자 산물 또는 그의 일부분, 또는 표 1에 제시된 유전자 또는 유전자 산물을 표적화하는 억제성 핵산으로부터 선택된다. 일부 실시양태에서, 제1 유전자 산물은 단백질이고, 제2 유전자 산물은 단백질이다. 일부 실시양태에서, 제1 유전자 산물은 억제성 핵산이고 제2 유전자 산물은 단백질이다. 일부 실시양태에서, 제1 유전자 산물은 억제성 핵산이고 제2 유전자 산물은 억제성 핵산이다. In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding a first gene product and a second gene product, wherein each gene product is independently a gene product set forth in Table 1 or a portion thereof , or an inhibitory nucleic acid targeting the gene or gene product set forth in Table 1. In some embodiments, the first gene product is a protein and the second gene product is a protein. In some embodiments, the first gene product is an inhibitory nucleic acid and the second gene product is a protein. In some embodiments, the first gene product is an inhibitory nucleic acid and the second gene product is an inhibitory nucleic acid.
일부 실시양태에서, 제1 유전자 산물은 Gcase 단백질 또는 그의 일부분이다. 일부 실시양태에서, 제2 유전자 산물은 SNCA를 표적화하는 억제성 핵산이다. 일부 실시양태에서, 간섭 핵산은 siRNA, shRNA, miRNA, 또는 dsRNA이며, 임의로 여기서 간섭 핵산은 α-Syn 단백질의 발현을 억제한다. 일부 실시양태에서, 단리된 핵산은 하나 이상의 프로모터를 추가로 포함하며, 임의로 여기서 하나 이상의 프로모터 각각은 독립적으로 치킨-베타 액틴 (CBA) 프로모터, CAG 프로모터, CD68 프로모터, 또는 JeT 프로모터이다. 일부 실시양태에서, 단리된 핵산은 내부 리보솜 진입 부위 (IRES)를 추가로 포함하며, 임의로 여기서 IRES는 제1 유전자 산물과 제2 유전자 산물 사이에 위치한다. 일부 실시양태에서, 단리된 핵산은 자기 절단 펩티드 코딩 서열을 추가로 포함하며, 임의로 여기서 자기 절단 펩티드는 T2A이다. 일부 실시양태에서, 발현 구축물은 제1 유전자 산물 및 제2 유전자 산물에 플랭킹된 2개의 아데노-연관 바이러스 (AAV) 역위 말단 반복부 (ITR) 서열을 포함한다.In some embodiments, the first gene product is a Gcase protein or a portion thereof. In some embodiments, the second gene product is an inhibitory nucleic acid that targets SNCA. In some embodiments, the interfering nucleic acid is an siRNA, shRNA, miRNA, or dsRNA, optionally wherein the interfering nucleic acid inhibits expression of an α-Syn protein. In some embodiments, the isolated nucleic acid further comprises one or more promoters, optionally wherein each of the one or more promoters is independently a chicken-beta actin (CBA) promoter, a CAG promoter, a CD68 promoter, or a JeT promoter. In some embodiments, the isolated nucleic acid further comprises an internal ribosome entry site (IRES), optionally wherein the IRES is located between the first gene product and the second gene product. In some embodiments, the isolated nucleic acid further comprises a self-cleaving peptide coding sequence, optionally wherein the self-cleaving peptide is T2A. In some embodiments, the expression construct comprises two adeno-associated virus (AAV) inverted terminal repeat (ITR) sequences flanked by a first gene product and a second gene product.
일부 측면에서, 본 개시내용은 제1 유전자 산물 및 제2 유전자 산물을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공하며, 여기서 각각의 유전자 산물은 독립적으로, 표 1에 제시된 유전자 산물 또는 그의 일부분, 또는 표 1에 제시된 유전자 또는 유전자 산물을 표적화하는 억제성 핵산으로부터 선택된다. In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding a first gene product and a second gene product, wherein each gene product is independently a gene product set forth in Table 1 or a portion thereof , or an inhibitory nucleic acid targeting the gene or gene product set forth in Table 1.
일부 실시양태에서, 제1 유전자 산물 또는 제2 유전자 산물은 Gcase 단백질 또는 그의 일부분이다. 일부 실시양태에서, 제1 유전자 산물은 Gcase 단백질이고 제2 유전자 산물은 GBA2, 프로사포신, 프로그래뉼린, LIMP2, GALC, CTSB, SMPD1, GCH1, RAB7, VPS35, IL-34, TREM2, 및 TMEM106B로부터 선택된다.In some embodiments, the first gene product or the second gene product is a Gcase protein or a portion thereof. In some embodiments, the first gene product is a Gcase protein and the second gene product is GBA2, prosaposin, progranulin, LIMP2, GALC, CTSB, SMPD1, GCH1, RAB7, VPS35, IL-34, TREM2, and TMEM106B is selected from
일부 실시양태에서, 발현 구축물은 간섭 핵산 (예를 들어, shRNA, miRNA, dsRNA 등)을 (예를 들어, 단독으로 또는 또 다른 유전자 산물에 더하여) 코딩한다. 일부 실시양태에서, 간섭 핵산은 α-시누클레인 (α-시누클레인)의 발현을 억제한다. 일부 실시양태에서, 발현 구축물은 SNCA를 표적화하는 억제성 핵산을 코딩하고, GBA1, GBA2, PSAP, PRGN, LIMP2, GALC, CTSB, SMPD1, GCH1, RAB7, VPS35, IL-34, TREM2, 및 TMEM106B로부터 선택된 하나 이상의 유전자 산물을 코딩한다. 일부 실시양태에서, α-시누클레인을 표적화하는 간섭 핵산은 서열식별번호: 20-25 중 어느 하나에 제시된 서열을 포함한다. 일부 실시양태에서, α-시누클레인을 표적화하는 간섭 핵산은 서열식별번호: 20-25 중 어느 하나에 제시된 서열과 결합 (예를 들어, 혼성화)한다. In some embodiments, the expression construct encodes (eg, alone or in addition to another gene product) an interfering nucleic acid (eg, shRNA, miRNA, dsRNA, etc.). In some embodiments, the interfering nucleic acid inhibits expression of α-synuclein (α-synuclein). In some embodiments, the expression construct encodes an inhibitory nucleic acid that targets SNCA and is from GBA1, GBA2, PSAP, PRGN, LIMP2, GALC, CTSB, SMPD1, GCH1, RAB7, VPS35, IL-34, TREM2, and TMEM106B. encodes one or more selected gene products. In some embodiments, the interfering nucleic acid targeting α-synuclein comprises a sequence set forth in any one of SEQ ID NOs: 20-25. In some embodiments, the interfering nucleic acid targeting α-synuclein binds (eg, hybridizes) to a sequence set forth in any one of SEQ ID NOs: 20-25.
일부 실시양태에서, 간섭 핵산은 TMEM106B의 발현을 억제한다. 일부 실시양태에서, 발현 구축물은 TMEM106B를 표적화하는 억제성 핵산을 코딩하고, GBA1, GBA2, PSAP, PRGN, LIMP2, GALC, CTSB, SMPD1, GCH1, RAB7, VPS35, IL-34, 및 TREM2로부터 선택된 하나 이상의 유전자 산물을 코딩한다. 일부 실시양태에서, TMEM106B를 표적화하는 간섭 핵산은 서열식별번호: 65 또는 66에 제시된 서열을 포함한다. 일부 실시양태에서, TMEM106B를 표적화하는 간섭 핵산은 서열식별번호: 65 또는 66에 제시된 서열과 결합 (예를 들어, 혼성화)한다.In some embodiments, the interfering nucleic acid inhibits expression of TMEM106B. In some embodiments, the expression construct encodes an inhibitory nucleic acid that targets TMEM106B and is one selected from GBA1, GBA2, PSAP, PRGN, LIMP2, GALC, CTSB, SMPD1, GCH1, RAB7, VPS35, IL-34, and TREM2. It encodes the above gene products. In some embodiments, the interfering nucleic acid targeting TMEM106B comprises a sequence set forth in SEQ ID NOs: 65 or 66. In some embodiments, the interfering nucleic acid targeting TMEM106B binds (eg, hybridizes) to a sequence set forth in SEQ ID NO: 65 or 66.
일부 실시양태에서, 간섭 핵산은 MAPT의 발현을 억제한다. 일부 실시양태에서, 발현 구축물은 MAPT를 표적화하는 억제성 핵산을 코딩하고, GBA1, GBA2, PSAP, PRGN, LIMP2, GALC, CTSB, SMPD1, GCH1, RAB7, VPS35, IL-34, TREM2, 및 TMEM106B로부터 선택된 하나 이상의 유전자 산물을 코딩한다. 일부 실시양태에서, MAPT를 표적화하는 간섭 핵산은 서열식별번호: 123-138 중 어느 하나에 제시된 서열을 포함한다. 일부 실시양태에서, MAPT를 표적화하는 간섭 핵산은 서열식별번호: 123-138 중 어느 하나에 제시된 서열과 결합 (예를 들어, 혼성화)한다.In some embodiments, the interfering nucleic acid inhibits expression of MAPT. In some embodiments, the expression construct is from encoding the inhibitory nucleic acids targeting the MAPT and, GBA1, GBA2, PSAP, PRGN, LIMP2, GALC, CTSB, SMPD1, GCH1, RAB7, VPS35, IL-34, TREM2, and TMEM106B encodes one or more selected gene products. In some embodiments, the interfering nucleic acid targeting MAPT comprises a sequence set forth in any one of SEQ ID NOs: 123-138. In some embodiments, the interfering nucleic acid targeting MAPT binds (eg, hybridizes) to a sequence set forth in any one of SEQ ID NOs: 123-138.
일부 실시양태에서, 간섭 핵산은 RPS25의 발현을 억제한다. 일부 실시양태에서, 발현 구축물은 RPS25를 표적화하는 억제성 핵산을 코딩하고, GBA1, GBA2, PSAP, PRGN, LIMP2, GALC, CTSB, SMPD1, GCH1, RAB7, VPS35, IL-34, TREM2, 및 TMEM106B로부터 선택된 하나 이상의 유전자 산물을 코딩한다. 일부 실시양태에서, RPS25를 표적화하는 간섭 핵산은 서열식별번호: 115-122 중 어느 하나에 제시된 서열을 포함한다. 일부 실시양태에서, RPS25를 표적화하는 간섭 핵산은 서열식별번호: 115-122 중 어느 하나에 제시된 서열과 결합 (예를 들어, 혼성화)한다. 일부 실시양태에서, 발현 구축물은 하나 이상의 프로모터를 추가로 포함한다. 일부 실시양태에서, 프로모터는 치킨-베타 액틴 (CBA) 프로모터, CAG 프로모터, CD68 프로모터, 또는 JeT 프로모터이다. 일부 실시양태에서, 프로모터는 RNA pol II 프로모터 (예를 들어, 또는 RNA pol III 프로모터 (예를 들어, U6 등)이다.In some embodiments, the interfering nucleic acid inhibits expression of RPS25. In some embodiments, the expression construct encodes an inhibitory nucleic acid that targets RPS25 and is from GBA1, GBA2, PSAP, PRGN, LIMP2, GALC, CTSB, SMPD1, GCH1, RAB7, VPS35, IL-34, TREM2, and TMEM106B. encodes one or more selected gene products. In some embodiments, the interfering nucleic acid targeting RPS25 comprises a sequence set forth in any one of SEQ ID NOs: 115-122. In some embodiments, the interfering nucleic acid targeting RPS25 binds (eg, hybridizes) to a sequence set forth in any one of SEQ ID NOs: 115-122. In some embodiments, the expression construct further comprises one or more promoters. In some embodiments, the promoter is a chicken-beta actin (CBA) promoter, a CAG promoter, a CD68 promoter, or a JeT promoter. In some embodiments, the promoter is an RNA pol II promoter (eg, or an RNA pol III promoter (eg, U6, etc.).
일부 실시양태에서, 발현 구축물은 내부 리보솜 진입 부위 (IRES)를 추가로 포함한다. 일부 실시양태에서, IRES는 제1 유전자 산물과 제2 유전자 산물 사이에 위치한다. In some embodiments, the expression construct further comprises an internal ribosome entry site (IRES). In some embodiments, the IRES is located between the first gene product and the second gene product.
일부 실시양태에서, 발현 구축물은 자기 절단 펩티드 코딩 서열을 추가로 포함한다. 일부 실시양태에서, 자기 절단 펩티드는 T2A 펩티드이다. In some embodiments, the expression construct further comprises a self-cleaving peptide coding sequence. In some embodiments, the self-cleaving peptide is a T2A peptide.
일부 실시양태에서, 발현 구축물은 2개의 아데노-연관 바이러스 (AAV) 역위 말단 반복부 (ITR) 서열을 포함한다. 일부 실시양태에서, ITR 서열은 제1 유전자 산물 및 제2 유전자 산물에 플랭킹된다 (예를 들어, 5'-말단에서 3'-말단으로 하기와 같이 배열된다: ITR-제1 유전자 산물-제2 유전자 산물-ITR). 일부 실시양태에서, 단리된 핵산의 ITR 서열 중 하나는 기능적 말단 분해 부위 (trs)가 결여되어 있다. 예를 들어, 일부 실시양태에서, ITR 중 하나는 ΔITR이다.In some embodiments, the expression construct comprises two adeno-associated virus (AAV) inverted terminal repeat (ITR) sequences. In some embodiments, the ITR sequence is flanked by the first gene product and the second gene product (eg, arranged 5'-end to 3'-end as follows: ITR-first gene product-first 2 gene product-ITR). In some embodiments, one of the ITR sequences of the isolated nucleic acid lacks a functional terminal cleavage site (trs). For example, in some embodiments, one of the ITRs is ΔITR.
본 개시내용은 일부 측면에서, 변형된 "D" 영역 (예를 들어, 야생형 AAV2 ITR과 비교하여 변형된 D 서열, 서열식별번호: 29)을 갖는 ITR을 포함하는 rAAV 벡터에 관한 것이다. 일부 실시양태에서, 변형된 D 영역을 갖는 ITR은 rAAV 벡터의 5' ITR이다. 일부 실시양태에서, 변형된 "D" 영역은, 예를 들어 서열식별번호: 26에 제시된 바와 같은 "S" 서열을 포함한다. 일부 실시양태에서, 변형된 "D" 영역을 갖는 ITR은 rAAV 벡터의 3' ITR이다. 일부 실시양태에서, 변형된 "D" 영역은 이러한 "D" 영역이 ITR의 3' 말단에 위치하는 3'ITR을 포함한다 (예를 들어, 벡터의 트랜스진 삽입체과 비교하여 ITR의 외부 또는 말단 끝에 있음). 일부 실시양태에서, 변형된 "D" 영역은 서열식별번호: 26 또는 27에 제시된 바와 같은 서열을 포함한다.The present disclosure, in some aspects, relates to a rAAV vector comprising an ITR having a modified "D" region (eg, a modified D sequence compared to a wild-type AAV2 ITR, SEQ ID NO:29). In some embodiments, the ITR with a modified D region is the 5' ITR of a rAAV vector. In some embodiments, the modified “D” region comprises an “S” sequence, eg, as set forth in SEQ ID NO:26. In some embodiments, the ITR with a modified "D" region is the 3' ITR of a rAAV vector. In some embodiments, a modified “D” region comprises a 3′ ITR in which such “D” region is located at the 3′ end of the ITR (eg, outside or at the terminus of the ITR as compared to the transgene insert of the vector). at the end). In some embodiments, the modified “D” region comprises a sequence as set forth in SEQ ID NO:26 or 27.
일부 실시양태에서, 단리된 핵산 (예를 들어, rAAV 벡터)은 TRY 영역을 포함한다. 일부 실시양태에서, TRY 영역은 서열식별번호: 28에 제시된 서열을 포함한다.In some embodiments, the isolated nucleic acid (eg, rAAV vector) comprises a TRY region. In some embodiments, the TRY region comprises the sequence set forth in SEQ ID NO:28.
일부 실시양태에서, 본 개시내용에 의해 기재된 단리된 핵산은 서열식별번호: 1-149 중 어느 하나에 제시된 서열을 갖는 펩티드를 포함하거나 이로 이루어지거나, 또는 이를 코딩한다.In some embodiments, the isolated nucleic acid described by the present disclosure comprises, consists of, or encodes a peptide having the sequence set forth in any one of SEQ ID NOs: 1-149.
일부 측면에서, 본 개시내용은 본 개시내용에 의해 기재된 바와 같은 단리된 핵산을 포함하는 벡터를 제공한다. 일부 실시양태에서, 벡터는 플라스미드 또는 바이러스 벡터이다. 일부 실시양태에서, 바이러스 벡터는 재조합 AAV (rAAV) 벡터 또는 배큘로바이러스 벡터이다. 일부 실시양태에서, rAAV 벡터는 단일 가닥 (예를 들어, 단일 가닥 DNA)이다.In some aspects, the present disclosure provides a vector comprising an isolated nucleic acid as described by the present disclosure. In some embodiments, the vector is a plasmid or viral vector. In some embodiments, the viral vector is a recombinant AAV (rAAV) vector or a baculovirus vector. In some embodiments, the rAAV vector is single stranded (eg, single stranded DNA).
일부 측면에서, 본 개시내용은 본 개시내용에 의해 기재된 바와 같은 단리된 핵산 또는 본 개시내용에 의해 기재된 바와 같은 벡터를 포함하는 숙주 세포를 제공한다.In some aspects, the present disclosure provides a host cell comprising an isolated nucleic acid as described by the present disclosure or a vector as described by the present disclosure.
일부 측면에서, 본 개시내용은 캡시드 단백질 및 본 개시내용에 의해 기재된 바와 같은 단리된 핵산 또는 벡터를 포함하는 재조합 아데노-연관 바이러스 (rAAV)를 제공한다.In some aspects, the present disclosure provides a recombinant adeno-associated virus (rAAV) comprising a capsid protein and an isolated nucleic acid or vector as described by the present disclosure.
일부 실시양태에서, 캡시드 단백질, 예를 들어 AAV9 캡시드 단백질 또는 AAVrh.10 캡시드 단백질은 혈액-뇌 장벽을 통과할 수 있다. 일부 실시양태에서, rAAV는 중추 신경계 (CNS)의 뉴런 세포 및 비-뉴런 세포를 형질도입한다.In some embodiments, the capsid protein, eg, AAV9 capsid protein or AAVrh.10 capsid protein, is capable of crossing the blood-brain barrier. In some embodiments, the rAAV transduces neuronal and non-neuronal cells of the central nervous system (CNS).
일부 측면에서, 본 개시내용은 중추 신경계 (CNS) 질환을 갖거나 또는 갖는 것으로 의심되는 대상체에게 본 개시내용에 의해 기재된 바와 같은 조성물 (예를 들어, 단리된 핵산 또는 벡터 또는 rAAV을 포함하는 조성물)을 투여하는 단계를 포함하는, 상기 대상체를 치료하는 방법을 제공한다. 일부 실시양태에서, CNS 질환은 신경퇴행성 질환, 예컨대 표 2에 열거된 신경퇴행성 질환이다. 일부 실시양태에서, CNS 질환은 시누클레인병증, 예컨대 표 3에 열거된 시누클레인병증이다. 일부 실시양태에서, CNS 질환은 타우병증, 예컨대 표 4에 열거된 타우병증이다. 일부 실시양태에서, CNS 질환은 리소좀 축적 질환, 예컨대 표 5에 열거된 리소좀 축적 질환이다. 일부 실시양태에서, 리소좀 축적 질환은 신경병증성 고셔병, 예컨대 유형 2 고셔병 또는 유형 3 고셔병이다. In some aspects, the present disclosure provides a composition as described by the present disclosure (eg, an isolated nucleic acid or vector or a composition comprising rAAV) to a subject having or suspected of having a central nervous system (CNS) disease. It provides a method of treating the subject, comprising the step of administering. In some embodiments, the CNS disease is a neurodegenerative disease, such as a neurodegenerative disease listed in Table 2. In some embodiments, the CNS disease is a synucleinopathy, such as a synucleinopathy listed in Table 3. In some embodiments, the CNS disease is a tauopathy, such as a tauopathy listed in Table 4. In some embodiments, the CNS disease is a lysosomal storage disease, such as a lysosomal storage disease listed in Table 5. In some embodiments, the lysosomal storage disease is a neuropathic Gaucher disease, such as
일부 실시양태에서, 본 개시내용은 GBA1을 코딩하는 단리된 핵산 (예를 들어, 단리된 핵산을 포함하는 rAAV 벡터 또는 rAAV)을 하기 질환의 치료를 필요로 하는 대상체에게 투여함으로써 파킨슨병 (예를 들어, GBA1 돌연변이를 갖는 파킨슨병 (PD-GBA), 산발성 파킨슨병 (sPD)), 고셔병 (예를 들어, 신경병증성 고셔병 (nGD), 유형 I 고셔병 (T1GD), 유형 II 고셔병 (T2GD), 및 유형 III 고셔병 (T3GD)), 루이소체 치매 (DLB), 근위축성 측삭 경화증 (ALS), 및 니만-피크(Niemann-Pick) 유형 C 질환 (NPC)으로부터 선택된 질환을 치료하는 방법에 관한 것이다.In some embodiments, the present disclosure provides an isolated nucleic acid encoding GBA1 (eg, a rAAV vector or rAAV comprising the isolated nucleic acid) to a subject in need of treatment for Parkinson's disease (e.g., For example, Parkinson's disease with GBA1 mutation (PD-GBA), sporadic Parkinson's disease (sPD)), Gaucher disease (eg, neuropathic Gaucher disease (nGD), type I Gaucher disease (T1GD), type II Gaucher disease (T2GD), and type III Gaucher disease (T3GD)), Lewy body dementia (DLB), amyotrophic lateral sclerosis (ALS), and Niemann-Pick type C disease (NPC).
일부 실시양태에서, 본 개시내용은 PGRN (예를 들어, GRN)을 코딩하는 단리된 핵산 (예를 들어, 단리된 핵산을 포함하는 rAAV 벡터 또는 rAAV)을 하기 질환의 치료를 필요로 하는 대상체에게 투여함으로써 전측두엽 치매 (예를 들어, GRN 돌연변이를 갖는 전측두엽 치매 (FTD-GRN), MAPT 돌연변이를 갖는 전측두엽 치매 (FTD-타우), 및 C9ORF72 돌연변이를 갖는 전측두엽 치매 (FTD-C9orf72)), 파킨슨병 (PD), 알츠하이머병 (AD), 신경 세로이드 리포푸신증 (NCL), 피질기저 변성 (CBD), 운동 뉴런 질환 (MND), 또는 고셔병 (GD)을 치료하는 방법에 관한 것이다.In some embodiments, the present disclosure provides an isolated nucleic acid encoding a PGRN (eg, GRN) (eg, a rAAV vector or rAAV comprising the isolated nucleic acid) to a subject in need thereof frontotemporal dementia by administration (e. g., frontotemporal dementia (FTD-GRN), frontotemporal dementia (FTD- tau having MAPT mutations), and frontotemporal dementia (FTD-C9orf72 having C9ORF72 mutant having a mutation GRN)) , Parkinson's disease (PD), Alzheimer's disease (AD), neuronal ceroid lipofuscinosis (NCL), cortical basal degeneration (CBD), motor neuron disease (MND), or Gaucher disease (GD).
일부 실시양태에서, 본 개시내용은 GBA1 유전자 산물을 코딩하는 단리된 핵산 (예를 들어, 단리된 핵산을 포함하는 rAAV 벡터 또는 rAAV), 및 SNCA를 표적화하는 억제성 핵산을 하기 질환의 치료를 필요로 하는 대상체에게 투여함으로써 시누클레인병증 (예를 들어, 다계통 위축 (MSA), 파킨슨병 (PD), GBA1 돌연변이를 갖는 파킨슨병 (PD-GBA), 루이소체 치매 (DLB), GBA1 돌연변이를 갖는 루이소체 치매, 및 루이소체 질환)을 치료하는 방법에 관한 것이다In some embodiments, the present disclosure provides an isolated nucleic acid encoding a GBA1 gene product (eg, a rAAV vector or rAAV comprising the isolated nucleic acid), and an inhibitory nucleic acid targeting SNCA in the treatment of the following diseases: synucleinopathy (eg, multiple system atrophy (MSA), Parkinson's disease (PD), Parkinson's disease with a GBA1 mutation (PD-GBA), Lewy body dementia (DLB), having a GBA1 mutation) Lewy body dementia, and Lewy body disease).
일부 실시양태에서, 본 개시내용은 PSAP를 코딩하는 단리된 핵산 (예를 들어, 단리된 핵산을 포함하는 rAAV 벡터 또는 rAAV)을 하기 질환의 치료를 필요로 하는 대상체에게 투여함으로써 파킨슨병 (PD), 전측두엽 치매 (예를 들어, GRN 돌연변이를 갖는 전측두엽 치매 (FTD-GRN)), 리소좀 축적 질환 (LSD), 또는 고셔병 (GD)으로부터 선택된 질환을 치료하는 방법에 관한 것이다.In some embodiments, the disclosure provides for Parkinson's disease (PD) by administering to a subject in need thereof an isolated nucleic acid encoding PSAP (eg, a rAAV vector or rAAV comprising the isolated nucleic acid) , frontotemporal dementia (eg, frontotemporal dementia with GRN mutations (FTD-GRN)), lysosomal storage disease (LSD), or Gaucher disease (GD).
일부 실시양태에서, 본 개시내용은 TREM2를 코딩하는 단리된 핵산 (예를 들어, 단리된 핵산을 포함하는 rAAV 벡터 또는 rAAV)을 하기 질환의 치료를 필요로 하는 대상체에게 투여함으로써 알츠하이머병 (AD), 나수-하콜라(Nasu-Hakola)병 (NHD) 또는 파킨슨병 (PD)을 치료하는 방법에 관한 것이다.In some embodiments, the present disclosure provides for Alzheimer's disease (AD) by administering to a subject in need thereof an isolated nucleic acid encoding TREM2 (eg, a rAAV vector or rAAV comprising the isolated nucleic acid) , to a method of treating Nasu-Hakola disease (NHD) or Parkinson's disease (PD).
일부 실시양태에서, 본 개시내용은 MAPT를 표적화하는 억제성 핵산을 코딩하는 단리된 핵산 (예를 들어, 단리된 핵산을 포함하는 rAAV 벡터 또는 rAAV)을 하기 질환의 치료를 필요로 하는 대상체에게 투여함으로써 알츠하이머병 (AD) 또는 전측두엽 치매 (MAPT 돌연변이를 갖는 전측두엽 치매 (FTD-타우)), 진행성 핵상 마비 (PSP), 신경퇴행성 질환, 루이소체 질환 (LBD) 또는 파킨슨병을 치료하는 방법에 관한 것이다.In some embodiments, the present disclosure provides for administering an isolated nucleic acid encoding an inhibitory nucleic acid targeting MAPT (eg, a rAAV vector or rAAV comprising the isolated nucleic acid) to a subject in need thereof thereby treating Alzheimer's disease (AD) or frontotemporal dementia (frontal temporal dementia with MAPT mutation (FTD-tau)), progressive supranuclear palsy (PSP), neurodegenerative disease, Lewy body disease (LBD) or Parkinson's disease it's about
일부 측면에서, 본 개시내용은 본 개시내용에 의해 기재된 바와 같은 조성물 (예를 들어, 단리된 핵산 또는 벡터 또는 rAAV를 포함하는 조성물)을 파킨슨병을 갖거나 또는 갖는 것으로 의심되는 대상체에게 투여하는 단계를 포함하는, 상기 대상체를 치료하는 방법을 제공한다.In some aspects, the present disclosure provides a method comprising administering a composition as described by the present disclosure (eg, a composition comprising an isolated nucleic acid or vector or rAAV) to a subject having or suspected of having Parkinson's disease. It provides a method of treating the subject comprising:
일부 실시양태에서, 조성물은 본 출원에 기재된 2개 이상의 유전자 산물 (예를 들어, CNS 질환-연관 유전자 산물), 예를 들어 2, 3, 4, 5개 또는 그 초과의 유전자 산물을 코딩하는 핵산 (예를 들어, rAAV 게놈, 예를 들어 AAV 캡시드 단백질에 의해 캡슐화됨)을 포함한다. 일부 실시양태에서, 조성물은 각각이 하나 이상의 상이한 유전자 산물을 코딩하는 2개 이상 (예를 들어, 2, 3, 4, 5개 또는 그 초과)의 상이한 핵산 (예를 들어, 2개 이상의 rAAV 게놈, 예를 들어 AAV 캡시드 단백질에 의해 별도로 캡슐화됨)을 포함한다. 일부 실시양태에서, 2개 이상의 상이한 조성물이 대상체에게 투여되고, 각각의 조성물은 상이한 유전자 산물을 코딩하는 하나 이상의 핵산을 포함한다. 일부 실시양태에서, 상이한 유전자 산물은 동일한 프로모터 유형 (예를 들어, 동일한 프로모터)에 작동가능하게 연결된다. 일부 실시양태에서, 상이한 유전자 산물은 상이한 프로모터에 작동가능하게 연결된다.In some embodiments, the composition comprises a nucleic acid encoding two or more gene products (eg, CNS disease-associated gene products) described herein, eg, 2, 3, 4, 5 or more gene products. (eg, encapsulated by the rAAV genome, eg, an AAV capsid protein). In some embodiments, the composition comprises two or more (eg, 2, 3, 4, 5 or more) different nucleic acids (eg, two or more rAAV genomes) each encoding one or more different gene products. , eg separately encapsulated by the AAV capsid protein). In some embodiments, two or more different compositions are administered to a subject, each composition comprising one or more nucleic acids encoding a different gene product. In some embodiments, different gene products are operably linked to the same promoter type (eg, the same promoter). In some embodiments, different gene products are operably linked to different promoters.
일부 실시양태에서, 투여는 대상체의 CNS에 대한 직접 주사를 포함한다. 일부 실시양태에서, 직접 주사는 뇌내 주사, 실질내 주사, 척수강내 주사, 대수조내 주사, 또는 그의 임의의 조합이다. 일부 실시양태에서, 대상체의 CNS에 대한 직접 주사는 대류 강화 전달 (CED)을 포함한다. In some embodiments, administering comprises direct injection into the subject's CNS. In some embodiments, the direct injection is an intracerebral injection, an intraparenchymal injection, an intrathecal injection, an intracisternal injection, or any combination thereof. In some embodiments, the direct injection into the CNS of the subject comprises convective enhanced delivery (CED).
일부 실시양태에서, 투여는 말초 주사를 포함한다. 일부 실시양태에서, 말초 주사는 정맥내 주사이다.In some embodiments, administering comprises peripheral injection. In some embodiments, the peripheral injection is an intravenous injection.
일부 측면에서, 본 개시내용은 중추 신경계 (CNS) 질환을 갖거나 또는 갖는 것으로 의심되는 대상체를 치료하는 방법을 제공하며, 이러한 방법은: (i) 표 1에 열거된 하나 이상의 유전자 산물 또는 표 1에 제시된 유전자 또는 유전자 산물을 표적화하는 억제성 핵산을 코딩하는 트랜스진을 포함하는 발현 구축물 및 (ii) 이러한 발현 구축물에 플랭킹된 2개의 아데노-연관 바이러스 (AAV) 역위 말단 반복부 (ITR)를 포함하는 단리된 핵산을 상기 대상체에게 투여하는 단계를 포함한다. 일부 측면에서, 본 개시내용은 중추 신경계 (CNS) 질환을 갖거나 또는 갖는 것으로 의심되는 대상체를 치료하는 방법을 제공하며, 이러한 방법은 상이한 유전자 산물을 코딩하는 단리된 핵산의 2가지 이상의 유형을 상기 대상체에게 투여하는 단계를 포함하며, 여기서 단리된 핵산의 각각의 유형은: (i) 표 1에 열거된 하나 이상의 유전자 산물 또는 표 1에 제시된 유전자 또는 유전자 산물을 표적화하는 억제성 핵산을 코딩하는 트랜스진을 포함하는 발현 구축물 및 (ii) 이러한 발현 구축물에 플랭킹된 2개의 아데노-연관 바이러스 (AAV) 역위 말단 반복부 (ITR)를 포함한다.In some aspects, the present disclosure provides a method of treating a subject having or suspected of having a central nervous system (CNS) disease, the method comprising: (i) one or more gene products listed in Table 1 or Table 1 An expression construct comprising a transgene encoding an inhibitory nucleic acid targeting the gene or gene product set forth in and administering to said subject an isolated nucleic acid comprising In some aspects, the present disclosure provides a method of treating a subject having or suspected of having a central nervous system (CNS) disease, said method comprising two or more types of isolated nucleic acids encoding different gene products. administering to a subject, wherein each type of isolated nucleic acid comprises: (i) a trans encoding one or more gene products listed in Table 1 or an inhibitory nucleic acid targeting the genes or gene products shown in Table 1; an expression construct comprising the gene and (ii) two adeno-associated virus (AAV) inverted terminal repeats (ITRs) flanked by the expression construct.
일부 실시양태에서, 트랜스진은 GBA1, GBA2, PGRN (예를 들어, GRN), TREM2, PSAP, SCARB2, GALC, SMPD1, CTSB, RAB7L, VPS35, GCH1, 및 IL34로부터 선택된 하나 이상의 단백질을 코딩한다. 일부 실시양태에서, 하나 이상의 유전자 산물을 코딩하는 트랜스진은 코돈-최적화된 단백질 코딩 서열을 포함한다. 일부 실시양태에서, 트랜스진은 SNCA, MAPT, RPS25, 및/또는 TMEM106B를 표적화하는 하나 이상의 억제성 핵산을 코딩한다.In some embodiments, the transgene encodes one or more proteins selected from GBA1, GBA2, PGRN (eg, GRN), TREM2, PSAP, SCARB2, GALC, SMPD1, CTSB, RAB7L, VPS35, GCH1, and IL34. In some embodiments, a transgene encoding one or more gene products comprises a codon-optimized protein coding sequence. In some embodiments, the transgene encodes one or more inhibitory nucleic acids that target SNCA, MAPT, RPS25, and/or TMEM106B.
일부 실시양태에서, AAV ITR은 AAV2 ITR이다. In some embodiments, the AAV ITR is an AAV2 ITR.
일부 실시양태에서, 단리된 핵산은 재조합 아데노-연관 바이러스 (rAAV)에 패키징된다. 일부 실시양태에서, rAAV는 AAV9 캡시드 단백질을 포함한다.In some embodiments, the isolated nucleic acid is packaged in a recombinant adeno-associated virus (rAAV). In some embodiments, the rAAV comprises an AAV9 capsid protein.
일부 실시양태에서, 대상체는 포유동물이다. 일부 실시양태에서, 대상체는 인간이다. 일부 실시양태에서, CNS 질환은 신경퇴행성 질환, 시누클레인병증, 타우병증, 및/또는 리소좀 축적 질환 (LSD)이다. 일부 실시양태에서, CNS 질환은 표 2, 표 3, 표 4 또는 표 5에 열거되어 있다. In some embodiments, the subject is a mammal. In some embodiments, the subject is a human. In some embodiments, the CNS disease is a neurodegenerative disease, synucleinopathy, tauopathy, and/or lysosomal storage disease (LSD). In some embodiments, the CNS disease is listed in Table 2, Table 3, Table 4, or Table 5.
일부 실시양태에서, 투여는 대상체의 CNS에 대한 직접 주사를 포함하며, 임의로 여기서 직접 주사는 뇌내 주사, 실질내 주사, 척수강내 주사, 대수조내 주사 또는 그의 임의의 조합이다. 일부 실시양태에서, 대수조내 주사는 대수조 내로의 후두하 주사이다. 일부 실시양태에서, 대상체의 CNS에 대한 직접 주사는 대류 강화 전달 (CED)을 포함한다. 일부 실시양태에서, 투여는 말초 주사를 포함하며, 임의로 여기서 말초 주사는 정맥내 주사이다. 일부 실시양태에서, 대상체에게 약 1 x 1010 vg 내지 약 1 x 1016 vg의 rAAV가 투여된다.In some embodiments, administering comprises direct injection into the CNS of the subject, optionally wherein the direct injection is an intracerebral injection, an intraparenchymal injection, an intrathecal injection, an intrathecal injection, or any combination thereof. In some embodiments, the intracisternal injection is a suboccipital injection into the acinar. In some embodiments, the direct injection into the CNS of the subject comprises convective enhanced delivery (CED). In some embodiments, administering comprises peripheral injection, optionally wherein the peripheral injection is an intravenous injection. In some embodiments, the subject is administered about 1 x 10 10 vg to about 1 x 10 16 vg of rAAV.
도 1은 Gcase (예를 들어, GBA1 또는 그의 일부분)를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다.
도 2는 Gcase (예를 들어, GBA1 또는 그의 일부분) 및 LIMP2 (SCARB2) 또는 그의 일부분을 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다. Gcase 및 LIMP2의 코딩 서열은 내부 리보솜 진입 부위 (IRES)에 의해 분리된다.
도 3은 Gcase (예를 들어, GBA1 또는 그의 일부분) 및 LIMP2 (SCARB2) 또는 그의 일부분을 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다. Gcase 및 LIMP2의 코딩 서열의 발현은 각각 별도의 프로모터에 의해 구동된다.
도 4는 Gcase (예를 들어, GBA1 또는 그의 일부분), LIMP2 (SCARB2) 또는 그의 일부분, 및 α-Syn에 대한 간섭 RNA를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다.
도 5는 Gcase (예를 들어, GBA1 또는 그의 일부분), 프로사포신 (예를 들어, PSAP 또는 그의 일부분), 및 α-Syn에 대한 간섭 RNA를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다.
도 6은 Gcase (예를 들어, GBA1 또는 그의 일부분) 및 프로사포신 (예를 들어, PSAP 또는 그의 일부분)을 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다. Gcase 및 프로사포신의 코딩 서열은 내부 리보솜 진입 부위 (IRES)에 의해 분리된다.
도 7은 Gcase (예를 들어, GBA1 또는 그의 일부분)를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다. 이러한 실시양태에서, 벡터는 CMV 인핸서 (CMVe), CBA 프로모터 (CBAp), 엑손 1, 및 인트론 (int)의 4개 부분으로 이루어진 CBA 프로모터 요소 (CBA)를 포함하여 인간 GBA1의 코돈-최적화된 코딩 서열을 구성적으로 발현한다. 3' 영역은 또한 WPRE 조절 요소에 이어 bGH 폴리A 꼬리를 함유한다. 3개의 전사 조절 활성화 부위가 프로모터 영역의 5' 말단에 포함된다: TATA, RBS 및 YY1. 플랭킹 ITR은 개재 서열의 정확한 패키징을 허용한다. 5' ITR 서열의 2가지 변이체 (삽입 상자)가 평가되었고; 이들은 야생형 AAV2 ITR의 20개 뉴클레오티드 "D" 영역 내에서 몇 가지 뉴클레오티드 차이가 있다. 일부 실시양태에서, rAAV 벡터는 상단 라인에 제시된 "D" 도메인 뉴클레오티드 서열을 함유한다. 일부 실시양태에서, rAAV 벡터는 돌연변이체 "D" 도메인 (예를 들어, "S" 도메인, 하단 라인에 뉴클레오티드 변화가 표시됨)을 포함한다.
도 8은 도 6에 기재된 벡터의 한 실시양태를 도시하는 개략도이다.
도 9는 파킨슨병의 CBE 마우스 모델에서 Gcase (예를 들어, GBA1 또는 그의 일부분)를 코딩하는 트랜스진을 포함하는 rAAV의 전달을 위한 대표적인 데이터를 보여준다. PBS 비히클, 25 mg/kg CBE, 37.5 mg/kg CBE, 또는 50 mg/kg CBE의 매일 IP 전달 (왼쪽에서 오른쪽으로)은 P8에서 개시되었다. 생존 (왼쪽 상단)은 하루에 2회 검사하였고, 체중 (오른쪽 상단)은 매일 검사하였다. 모든 군은 n = 8로 시작하였다. 행동은 P23에서 오픈 필드에서 이동한 총 거리 (왼쪽 하단)와 P24에서 로타로드 상에 떨어질 때까지의 지연 시간 (중간 하단)으로서 평가되었다. GCase 기질의 수준은 CBE 중단을 수반한 경우 (제3일) 및 수반하지 않은 경우 (제1일) 둘 모두의 PBS 및 25 mg/kg CBE 처리 군에서 마우스의 피질에서 분석되었다. 합계 GluSph 및 GalSph 수준 (오른쪽 하단)은 조직의 습윤 중량 mg당 pmol로서 표시된다. 평균이 제시된다. 오차 막대는 SEM이다. *p<0.05; **p<0.01; ***p<0.001, 선형 회귀에 의한 처리 군에 대한 명목 p-값.
도 10은 CBE 마우스 모델에서 최대 rAAV 용량에 대한 연구 설계의 한 실시양태를 도시하는 개략도이다. 간단히 말해서, rAAV는 P3에서 ICV 주사에 의해 전달되었고, 매일 CBE 처리는 P8에서 개시되었다. 행동은 P24-25에서 오픈 필드 및 로타로드 검정에서 평가되었고 기질 수준은 P36 및 P38에서 측정되었다.
도 11은 CBE 마우스 모델에서 최대 rAAV 용량의 생존 중 평가를 위한 대표적인 데이터를 보여준다. P3에서, ICV 전달을 통해 마우스를 부형제 또는 8.8e9 vg rAAV-GBA1로 처리하였다. PBS 또는 25 mg/kg CBE의 매일 IP 전달은 P8에서 개시되었다. 연구 종료 시, 마우스의 절반은 P36 (제1일)에 마지막 CBE 투여 1일 후에 희생되었고 나머지 절반은 P38 (제3일)에 희생되기 전에 3일 동안 CBE 중단을 거쳤다. 모든 처리 군 (부형제 + PBS n = 8, rAAV-GBA1 + PBS n = 7, 부형제 + CBE n = 8 및 변이체 + CBE n = 9)은 매일 체중을 측정하고 (왼쪽 상단), P36에서의 체중을 분석하였다 (오른쪽 상단). 행동은 P23에서 오픈 필드에서 이동한 총 거리 (왼쪽 하단) 및 P24에서 로타로드 상에 떨어질 때까지의 지연 시간 (오른쪽 하단)으로서 평가되었으며, 각각의 동물에 대해 3번의 시험 전체에 걸친 중앙값으로서 평가되었다. 치사율로 인해, 행동 검정을 위한 부형제 + CBE 군의 경우 n = 7이고 다른 모든 군의 경우 n = 8이다. 동물 전체에 걸친 평균이 제시된다. 오차 막대는 SEM이다. *p<0.05; ***p<0.001, CBE 처리된 동물에서 선형 회귀에 의한 처리 군에 대한 명목 p-값.
도 12는 CBE 마우스 모델에서 최대 rAAV 용량의 생화학적 평가를 위한 대표적인 데이터를 보여준다. 모든 처리 군 (부형제 + PBS n = 8, 변이체 + PBS n = 7, 부형제 + CBE n = 7 및 변이체 + CBE n = 9)의 피질을 사용하여 CBE 중단 이전 (제1일) 또는 이후 (제3일) 상기 군에서의 GCase 활성 (왼쪽 상단), GluSph 수준 (오른쪽 상단), GluCer 수준 (왼쪽 하단) 및 벡터 게놈 (오른쪽 하단)을 측정하였다. 생체내 분포는 게놈 DNA 1 μg당 벡터 게놈으로서 표시된다. 평균이 제시된다. 오차 막대는 SEM이다. (*)p<0.1; **p<0.01; ***p<0.001, CBE 처리된 동물에서 선형 회귀에 의한 처리 군에 대한 명목 p-값, 공변량으로 보정된 수집 날짜 및 성별.
도 13은 부형제 + PBS, 부형제 + CBE, 및 변이체 + CBE 처리 군의 투여 후 CBE 마우스 모델에서 행동 및 생화학적 상관관계에 대한 대표적인 데이터를 나타낸다. 처리 군 전체에 걸친 로타로드에 대한 성능은 GluCer 축적과 음의 상관관계가 있었고 (a, 선형 회귀에 의한 p=0.0012), GluSph 축적은 증가된 GCase 활성과 음의 상관관계가 있었다 (b, 선형 회귀에 의한 p=0.0086).
도 14는 CBE 마우스 모델에서 변이체의 생체내 분포에 대한 대표적인 데이터를 보여준다. 벡터 게놈의 존재는 모든 처리 군 (부형제 + PBS n = 8, 변이체 + PBS n = 7, 부형제 + CBE n = 7, 및 변이체 + CBE n = 9)에 대해 간, 비장, 신장 및 생식선에서 평가되었다. 생체내 분포는 게놈 DNA 1 μg당 벡터 게놈으로서 표시된다. 벡터 참조 표준 곡선을 사용하여 정량적 PCR에 의해 벡터 게놈 존재를 정량화하였고; 게놈 DNA 농도는 A260 광학 밀도 측정에 의해 평가되었다. 평균이 제시된다. 오차 막대는 SEM이다. *p<0.05; **p<0.01; ***p<0.001, CBE 처리된 동물에서 선형 회귀에 의한 처리 군에 대한 명목 p-값, 공변량으로 보정된 수집 날짜 및 성별.
도 15는 CBE 마우스 모델에서 rAAV 용량 범위의 생존 중 평가를 위한 대표적인 데이터를 보여준다. 마우스는 P3에서 ICV 전달에 의해 rAAV-GBA1의 3가지 상이한 용량: 3.2e9 vg, 1.0e10 vg, 또는 3.2e10 vg 중 하나 또는 부형제를 받았다. P8에서, 25 mg/kg CBE의 매일 IP 처리가 개시되었다. 부형제와 CBE 또는 부형제와 PBS를 받은 마우스가 대조군으로서 제공되었다. 모든 처리 군은 군당 n = 10 (5M/5F)으로 시작하였다. 모든 마우스는 최종 CBE 투여 1일 후에 희생되었다 (P38-P40). 모든 처리 군은 매일 체중을 측정하고, P36에서 체중을 분석하였다. 운동 성능은 P24에서 로타로드 상에 떨어질 때까지의 지연 시간과 P30에서 테이퍼 빔을 가로지를 때까지의 지연 시간으로서 평가되었다. 초기 치사율로 인해, 행동 검정에 참여하는 마우스의 수는 하기와 같다: 부형제 + PBS n = 10, 부형제 + CBE n = 9 및 3.2e9 vg rAAV-GBA1 + CBE n = 6, 1.0e10 vg rAAV-GBA1 + CBE n = 10, 3.2e10 vg rAAV-GBA1 + CBE n = 7. 평균이 제시된다. 오차 막대는 SEM이다. * p<0.05; **p<0.01, CBE 처리된 군에서 선형 회귀에 의한 명목 p-값, 공변량으로 보정된 성별.
도 16은 CBE 마우스 모델에서 rAAV 용량 범위의 생화학적 평가를 위한 대표적인 데이터를 보여준다. 모든 처리 군 (부형제 + PBS n = 10, 부형제 + CBE n = 9, 및 3.2e9 vg rAAV-GBA1 + CBE n = 6, 1.0e10 vg rAAV-GBA1 + CBE n = 10, 3.2e10 vg rAAV-GBA1+ CBE n = 7)의 피질을 사용하여 GCase 활성, GluSph 수준, GluCer 수준 및 벡터 게놈을 측정하였다. GCase 활성은 총 단백질 mg당 GCase의 ng로서 표시된다. GluSph 및 GluCer 수준은 조직의 습윤 중량 mg당 pmol로서 표시된다. 생체내 분포는 게놈 DNA 1 μg당 벡터 게놈으로서 표시된다. 벡터 참조 표준 곡선을 사용하여 정량적 PCR에 의해 벡터 게놈 존재를 정량화하였고; 게놈 DNA 농도는 A260 광학 밀도 측정에 의해 평가되었다. 벡터 게놈 존재는 간에서 또한 측정되었다 (e). 평균이 제시된다. 오차 막대는 SEM이다. **p<0.01; ***p<0.001, CBE 처리된 군에서 선형 회귀에 의한 명목 p-값, 공변량으로 보정된 성별.
도 17은 유전적 마우스 모델에서 최대 용량 rAAV-GBA1의 테이퍼 빔 분석을 위한 대표적인 데이터를 보여준다. 처리 군 (WT + 부형제 (n = 5), 4L/PS-NA + 부형제 (n = 6) 및 4L/PS-NA + rAAV-GBA1 (n = 5))의 운동 성능은 rAAV-GBA1 투여 후 4주에 빔 워크(Beam Walk)에 의해 검정되었다. 총 슬립 및 활동 시간은 상이한 빔에 대한 총 5회 시도로서 표시된다. 속도 및 속도당 슬립은 상이한 빔에 대한 5회 시도에 대한 평균으로서 표시된다. 평균이 제시된다. 오차 막대는 SEM이다.
도 18은 프로그래뉼린 (PGRN) 단백질 (GRN 단백질로서 지칭되기도 함)을 코딩하는 rAAV 구축물의 시험관내 발현에 대한 대표적인 데이터를 보여준다. 왼쪽 패널은 프로그래뉼린 (PGRN) ELISA 검정의 표준 곡선을 보여준다. 하단 패널은 rAAV로 형질도입된 HEK293T 세포의 세포 용해물에서 ELISA 검정에 의해 측정된 PGRN 발현의 용량-반응을 나타낸다. MOI = 감염의 다중도 (세포당 벡터 게놈).
도 19는 프로사포신 (PSAP), SCARB2, 및/또는 하나 이상의 억제성 핵산과 조합된 GBA1을 코딩하는 rAAV 구축물의 시험관내 발현에 대한 대표적인 데이터를 보여준다. 데이터는 각각의 구축물을 사용한 HEK293 세포의 형질감염이 모의 형질감염된 세포에 비해 관심 트랜스진의 과다발현을 발생시켰다는 것을 나타낸다.
도 20은 ITR의 "외부"에 위치한 (예를 들어, 트랜스진 삽입체 또는 발현 구축물을 기준으로 하여 ITR의 말단에 근접함) "D" 영역을 포함하는 rAAV 벡터 (상단) 및 벡터의 "내부"에 ITR을 갖는 (예를 들어, 벡터의 트랜스진 삽입체에 근접함) 야생형 rAAV 벡터를 도시하는 개략도이다.
도 21은 GBA2 또는 그의 일부분, 및 α-Syn에 대한 간섭 RNA를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다.
도 22는 Gcase (예를 들어, GBA1 또는 그의 일부분) 및 갈락토실세라미다제 (예를 들어, GALC 또는 그의 일부분)를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다. Gcase 및 갈락토실세라미다제의 코딩 서열의 발현은 T2A 자기 절단 펩티드 서열에 의해 분리된다.
도 23은 Gcase (예를 들어, GBA1 또는 그의 일부분) 및 갈락토실세라미다제 (예를 들어, GALC 또는 그의 일부분)를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다. Gcase 및 갈락토실세라미다제의 코딩 서열의 발현은 T2A 자기 절단 펩티드 서열에 의해 분리된다.
도 24는 Gcase (예를 들어, GBA1 또는 그의 일부분), 카텝신 B (예를 들어, CTSB 또는 그의 일부분), 및 α-Syn에 대한 간섭 RNA를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다. Gcase 및 카텝신 B의 코딩 서열의 발현은 T2A 자기 절단 펩티드 서열에 의해 분리된다.
도 25는 Gcase (예를 들어, GBA1 또는 그의 일부분), 스핑고미엘린 포스포디에스테라제 1 (예를 들어, SMPD1 또는 그의 일부분), 및 α-Syn에 대한 간섭 RNA를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다.
도 26은 Gcase (예를 들어, GBA1 또는 그의 일부분) 및 갈락토실세라미다제 (예를 들어, GALC 또는 그의 일부분)를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다. Gcase 및 갈락토실세라미다제의 코딩 서열은 내부 리보솜 진입 부위 (IRES)에 의해 분리된다.
도 27은 Gcase (예를 들어, GBA1 또는 그의 일부분) 및 카텝신 B (예를 들어, CTSB 또는 그의 일부분)를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다. Gcase 및 카텝신 B의 코딩 서열의 발현은 각각 별도의 프로모터에 의해 구동된다.
도 28은 Gcase (예를 들어, GBA1 또는 그의 일부분), GCH1 (예를 들어, GCH1 또는 그의 일부분), 및 α-Syn에 대한 간섭 RNA를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다. Gcase 및 GCH1의 코딩 서열은 T2A 자기 절단 펩티드 서열에 의해 분리된다
도 29는 Gcase (예를 들어, GBA1 또는 그의 일부분), RAB7L1 (예를 들어, RAB7L1 또는 그의 일부분), 및 α-Syn에 대한 간섭 RNA를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다. Gcase 및 RAB7L1의 코딩 서열은 T2A 자기 절단 펩티드 서열에 의해 분리된다.
도 30은 Gcase (예를 들어, GBA1 또는 그의 일부분), GCH1 (예를 들어, GCH1 또는 그의 일부분), 및 α-Syn에 대한 간섭 RNA를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다. Gcase 및 GCH1의 코딩 서열의 발현은 내부 리보솜 진입 부위 (IRES)이다.
도 31은 VPS35 (예를 들어, VPS35 또는 그의 일부분) 및 α-Syn 및 TMEM106B에 대한 간섭 RNA를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다.
도 32는 Gcase (예를 들어, GBA1 또는 그의 일부분), IL-34 (예를 들어, IL34 또는 그의 일부분), 및 α-Syn에 대한 간섭 RNA를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다. Gcase 및 IL-34의 코딩 서열은 T2A 자기 절단 펩티드 서열에 의해 분리된다.
도 33은 Gcase (예를 들어, GBA1 또는 그의 일부분) 및 IL-34 (예를 들어, IL34 또는 그의 일부분)를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다. Gcase 및 IL-34의 코딩 서열은 내부 리보솜 진입 부위 (IRES)에 의해 분리된다.
도 34는 Gcase (예를 들어, GBA1 또는 그의 일부분) 및 TREM2 (예를 들어, TREM2 또는 그의 일부분)를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다. Gcase 및 TREM2의 코딩 서열의 발현은 각각 별도의 프로모터에 의해 구동된다.
도 35는 Gcase (예를 들어, GBA1 또는 그의 일부분) 및 IL-34 (예를 들어, IL34 또는 그의 일부분)를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다. Gcase 및 IL-34의 코딩 서열의 발현은 각각 별도의 프로모터에 의해 구동된다.
도 36a-36b는 qPCR 및 ELISA에 의해 측정된 바와 같은, 대조군 형질도입된 세포에 비해 HEK293 세포에서의 TREM2 및 GBA1의 과다발현에 대한 대표적인 데이터를 보여준다. 도 36a는 TREM2의 과다발현에 대한 데이터를 보여준다. 도 36b는 동일한 구축물로부터의 GBA1의 과다발현에 대한 데이터를 보여준다.
도 37은 GFP 리포터 검정 (상단) 및 α-Syn 검정 (하단)에 의한 시험관내 SNCA의 성공적인 침묵을 나타내는 대표적인 데이터를 보여준다.
도 38은 GFP 리포터 검정 (상단) 및 α-Syn 검정 (하단)에 의한 시험관내 TMEM106B의 성공적인 침묵을 나타내는 대표적인 데이터를 보여준다.
도 39는 PGRN (GRN으로서 지칭되기도 함)을 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다.
도 40은 "D" 서열의 야생형 (원형) 또는 대안적 (예를 들어, "외부"; 사각형) 배치를 수반한 ITR을 갖는 rAAV를 사용한 HEK293 세포의 형질도입에 대한 데이터를 보여준다. "외부"에 배치된 ITR을 갖는 rAAV는 야생형 ITR을 갖는 rAAV만큼 효율적으로 세포를 형질도입할 수 있었다.
도 41은 Gcase (예를 들어, GBA1 또는 그의 일부분)를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다.
도 42는 Gcase (예를 들어, GBA1 또는 그의 일부분)를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다.
도 43은 Gcase (예를 들어, GBA1 또는 그의 일부분) 및 α-Syn에 대한 간섭 RNA를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다.
도 44는 PGRN (GRN으로서 지칭되기도 함)을 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다.
도 45는 PGRN (GRN으로서 지칭되기도 함)을 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다.
도 46은 PGRN (GRN으로서 지칭되기도 함) 및 미세소관-연관 단백질 타우 (MAPT)에 대한 간섭 RNA를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다. 이러한 벡터의 핵산 서열은 서열식별번호: 142에 제시되어 있다.
도 47은 Gcase (예를 들어, GBA1 또는 그의 일부분) 및 α-Syn에 대한 간섭 RNA를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다.
도 48은 PSAP를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다.
도 49는 Gcase (예를 들어, GBA1 또는 그의 일부분)를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다.
도 50은 Gcase (예를 들어, GBA1 또는 그의 일부분) 및 갈락토실세라미다제 (예를 들어, GALC 또는 그의 일부분)를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다.
도 51은 Gcase (예를 들어, GBA1 또는 그의 일부분), 프로사포신 (예를 들어, PSAP 또는 그의 일부분), 및 α-Syn에 대한 간섭 RNA를 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다.
도 52는 Gcase (예를 들어, GBA1 또는 그의 일부분) 및 SNCA를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다.
도 53은 SNCA를 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다.
도 54는 SNCA를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다. 억제성 RNA는 프로모터 서열과 Gcase 코딩 서열 사이의 인트론 내에 위치한다.
도 55는 프로그래뉼린 (PGRN; GRN으로서 지칭되기도 함) 및 SNCA를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다. 억제성 RNA는 프로모터 서열과 Gcase 코딩 서열 사이의 인트론 내에 위치한다.
도 56은 Gcase (GBA1) 및 SNCA를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다. 억제성 RNA는 프로모터 서열과 Gcase 코딩 서열 사이의 인트론 내에 위치한다.
도 57은 Gcase (GBA1) 및 SNCA를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다. 억제성 RNA는 프로모터 서열과 Gcase 코딩 서열 사이의 인트론 내에 위치한다.
도 58은 Gcase (GBA1) 및 SNCA를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다. 3'ITR의 "D" 서열은 벡터의 "외부"에 위치한다.
도 59는 Gcase (GBA1) 및 SNCA를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다. 억제성 RNA는 프로모터 서열과 Gcase 코딩 서열 사이의 인트론 내에 위치한다.
도 60은 Gcase (GBA1) 및 SNCA를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다. 억제성 RNA는 프로모터 서열과 Gcase 코딩 서열 사이의 인트론 내에 위치한다.
도 61은 Gcase (GBA1) 및 SNCA를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다.
도 62는 Gcase (GBA1) 및 SNCA를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다. 억제성 RNA는 프로모터 서열과 Gcase 코딩 서열 사이의 인트론 내에 위치한다.
도 63은 Gcase (GBA1) 및 SNCA를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다. 억제성 RNA는 프로모터 서열과 Gcase 코딩 서열 사이의 인트론 내에 위치한다.
도 64는 Gcase (GBA1) 및 프로그래뉼린 (PGRN; GRN으로서 지칭되기도 함), 및 TMEM106B를 표적화하는 억제성 RNA을 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다. 억제성 RNA는 프로모터 서열과 Gcase 코딩 서열 사이의 인트론 내에 위치한다.
도 65는 RPS25를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다.
도 66은 RPS25를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다.
도 67은 MAPT를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다.
도 68은 MAPT를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다.
도 69는 프로그래뉼린 (PGRN; GRN으로서 지칭되기도 함) 및 MAPT를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다. 억제성 RNA는 프로모터 서열과 PGRN (GRN으로서 지칭되기도 함) 코딩 서열 사이의 인트론 내에 위치한다.
도 70은 MAPT를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다.
도 71은 프로그래뉼린 (PGRN; GRN으로서 지칭되기도 함) 및 MAPT를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다. 억제성 RNA는 프로모터 서열과 PGRN (GRN으로서 지칭되기도 함) 코딩 서열 사이의 인트론 내에 위치한다.
도 72는 Gcase (예를 들어, GBA1 또는 그의 일부분) 및 SNCA를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다. 이러한 벡터의 핵산 서열은 서열식별번호: 141에 제시되어 있다.
도 73은 Gcase (예를 들어, GBA1 또는 그의 일부분) 및 SNCA를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다. 이러한 벡터의 핵산 서열은 서열식별번호: 143에 제시되어 있다.
도 74는 Gcase (GBA1) 및 프로사포신 (PSAP), 및 SNCA를 표적화하는 억제성 RNA를 코딩하는 발현 구축물을 포함하는 rAAV 벡터를 포함하는 플라스미드의 한 실시양태를 도시하는 개략도이다. 이러한 벡터의 핵산 서열은 서열식별번호: 144에 제시되어 있다.
도 75a-75c는 RNA 간섭에 의한 SY5Y 세포에서의 MAPT 녹다운을 보여주는 차트이다. 도 75a는 BGHpA에 대해 유도된 프로브를 사용한 AAV 벡터의 면역형광 위치 지정을 보여준다. 도 75b는 형질도입 후 3일 및 7일에 MAPT 발현의 RT-PCR 결과를 보여준다. 도 75c는 형질도입에 사용된 rAAV 바이러스 스톡의 일반 정보를 보여준다.1 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding a Gcase (eg, GBA1 or a portion thereof).
2 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding Gcase (eg, GBA1 or a portion thereof) and LIMP2 (SCARB2) or a portion thereof. The coding sequences for Gcase and LIMP2 are separated by an internal ribosome entry site (IRES).
3 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding Gcase (eg, GBA1 or a portion thereof) and LIMP2 (SCARB2) or a portion thereof. Expression of the coding sequences of Gcase and LIMP2 is driven by separate promoters, respectively.
4 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding an interfering RNA for Gcase (eg, GBA1 or a portion thereof), LIMP2 (SCARB2) or a portion thereof, and α-Syn.
5 is one embodiment of a vector comprising an expression construct encoding an interfering RNA for Gcase (eg, GBA1 or a portion thereof) , prosaposin (eg, PSAP or a portion thereof), and α-Syn. is a schematic diagram showing
6 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding a Gcase (eg, GBA1 or a portion thereof) and a prosaposin (eg, PSAP or a portion thereof). The coding sequences for Gcase and prosaposin are separated by an internal ribosome entry site (IRES).
7 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding a Gcase (eg, GBA1 or a portion thereof). In this embodiment, the vector comprises a CBA promoter element (CBA) consisting of four parts: a CMV enhancer (CMVe), a CBA promoter (CBAp), an
8 is a schematic diagram illustrating one embodiment of the vector described in FIG. 6 .
9 shows representative data for delivery of rAAV comprising a transgene encoding a Gcase (eg, GBA1 or a portion thereof) in a CBE mouse model of Parkinson's disease. Daily IP delivery (left to right) of PBS vehicle, 25 mg/kg CBE, 37.5 mg/kg CBE, or 50 mg/kg CBE was initiated at P8. Survival (top left) was tested twice a day, and body weight (top right) was tested daily. All groups started with n = 8. Behavior was assessed as the total distance traveled in the open field at P23 (lower left) and the delay time to fall on the rotarod at P24 (lower middle). Levels of GCase substrates were analyzed in the cortex of mice in PBS and 25 mg/kg CBE treated groups both with (day 3) and without (day 1) CBE cessation. Sum GluSph and GalSph levels (bottom right) are expressed as pmol per mg wet weight of tissue. The average is presented. Error bars are SEM. *p<0.05;**p<0.01;***p<0.001, nominal p-value for treatment group by linear regression.
10 is a schematic depicting one embodiment of a study design for maximal rAAV dose in a CBE mouse model. Briefly, rAAV was delivered by ICV injection at P3 and daily CBE treatment was initiated at P8. Behavior was assessed in open field and rotarod assays at P24-25 and substrate levels were measured at P36 and P38.
11 shows representative data for the in-survival assessment of maximal rAAV dose in a CBE mouse model. At P3, mice were treated with vehicle or 8.8e9 vg rAAV-GBA1 via ICV delivery. Daily IP delivery of PBS or 25 mg/kg CBE was initiated at P8. At the end of the study, half of the mice were sacrificed 1 day after the last CBE administration at P36 (day 1) and the other half underwent CBE withdrawal for 3 days before being sacrificed at P38 (day 3). All treatment groups (excipient + PBS n = 8, rAAV-GBA1 + PBS n = 7, excipient + CBE n = 8 and variant + CBE n = 9) were weighed daily (top left), and body weights at P36 were measured. analyzed (top right). Behavior was assessed as the total distance traveled in the open field at P23 (bottom left) and the delay time to fall on the rotarod at P24 (bottom right), as the median across three trials for each animal. became Due to mortality, n = 7 for the excipient + CBE group for behavioral testing and n = 8 for all other groups. Averages across animals are presented. Error bars are SEM. *p<0.05;***p<0.001, nominal p-value for treatment group by linear regression in CBE treated animals.
12 shows representative data for biochemical evaluation of maximal rAAV dose in a CBE mouse model. Using the cortex of all treatment groups (excipient + PBS n = 8, variant + PBS n = 7, vehicle + CBE n = 7 and variant + CBE n = 9) before (day 1) or after (third day) CBE discontinuation Day) GCase activity (top left), GluSph level (top right), GluCer level (bottom left) and vector genome (bottom right) in the group were measured. Biodistribution is expressed as vector genomes per μg of genomic DNA. The average is presented. Error bars are SEM. (*)p<0.1;**p<0.01;***p<0.001, nominal p-value for treatment group by linear regression in CBE-treated animals, date of collection and sex corrected for covariates.
13 shows representative data for behavioral and biochemical correlations in a CBE mouse model after administration of excipient+PBS, excipient+CBE, and variant+CBE treated groups. Performance on rotarod across treatment groups was negatively correlated with GluCer accumulation (a, p=0.0012 by linear regression), and GluSph accumulation was negatively correlated with increased GCase activity (b, linear). p=0.0086 by regression).
14 shows representative data for the biodistribution of variants in a CBE mouse model. Presence of vector genome was assessed in liver, spleen, kidney and germline for all treatment groups (Excipient + PBS n = 8, Variant + PBS n = 7, Vehicle + CBE n = 7, and Variant + CBE n = 9). . Biodistribution is expressed as vector genomes per μg of genomic DNA. Vector genome presence was quantified by quantitative PCR using a vector reference standard curve; Genomic DNA concentrations were assessed by A260 optical densitometry. The average is presented. Error bars are SEM. *p<0.05;**p<0.01;***p<0.001, nominal p-value for treatment group by linear regression in CBE-treated animals, date of collection and sex corrected for covariates.
15 shows representative data for the in-survival evaluation of a range of rAAV doses in a CBE mouse model. Mice received one of three different doses of rAAV-GBA1 by ICV delivery at P3: 3.2e9 vg, 1.0e10 vg, or 3.2e10 vg or vehicle. At P8, daily IP treatment of 25 mg/kg CBE was initiated. Mice that received vehicle plus CBE or vehicle plus PBS served as controls. All treatment groups started with n = 10 (5M/5F) per group. All mice were sacrificed 1 day after the last CBE administration (P38-P40). All treatment groups were weighed daily, and body weights were analyzed at P36. Kinetic performance was evaluated as the delay time from falling on the rotarod at P24 and the delay time from crossing the tapered beam at P30. Due to the initial lethality, the number of mice participating in the behavioral assay was as follows: vehicle + PBS n = 10, vehicle + CBE n = 9 and 3.2e9 vg rAAV-GBA1 + CBE n = 6, 1.0e10 vg rAAV-GBA1 + CBE n = 10, 3.2e10 vg rAAV-GBA1 + CBE n = 7. Means are shown. Error bars are SEM. * p<0.05;**p<0.01, nominal p-value by linear regression in CBE-treated group, gender adjusted for covariates.
16 shows representative data for biochemical evaluation of rAAV dose ranges in a CBE mouse model. All treatment groups (Excipient + PBS n = 10, Vehicle + CBE n = 9, and 3.2e9 vg rAAV-GBA1 + CBE n = 6, 1.0e10 vg rAAV-GBA1 + CBE n = 10, 3.2e10 vg rAAV-GBA1 + CBE n = 7) cortex was used to measure GCase activity, GluSph levels, GluCer levels and vector genomes. GCase activity is expressed as ng of GCase per mg total protein. GluSph and GluCer levels are expressed as pmol per mg wet weight of tissue. Biodistribution is expressed as vector genomes per μg of genomic DNA. Vector genome presence was quantified by quantitative PCR using a vector reference standard curve; Genomic DNA concentrations were assessed by A260 optical densitometry. Vector genome presence was also determined in liver (e). The average is presented. Error bars are SEM. **p<0.01;***p<0.001, nominal p-value by linear regression in CBE-treated group, gender adjusted for covariates.
17 shows representative data for tapered beam analysis of maximal dose rAAV-GBA1 in a genetic mouse model. The motor performance of the treatment groups (WT + excipient (n = 5), 4L/PS-NA + excipient (n = 6) and 4L/PS-NA + rAAV-GBA1 (n = 5)) was 4 after rAAV-GBA1 administration. Weeks were tested by Beam Walk. Total sleep and active time are expressed as a total of 5 trials on different beams. Velocity and slip per velocity are expressed as averages over 5 trials for different beams. The average is presented. Error bars are SEM.
18 shows representative data for in vitro expression of rAAV constructs encoding progranulin (PGRN) protein (also referred to as GRN protein). The left panel shows the standard curve of the progranulin (PGRN) ELISA assay. The lower panel shows the dose-response of PGRN expression measured by ELISA assay in cell lysates of HEK293T cells transduced with rAAV. MOI = multiplicity of infection (vector genomes per cell).
19 shows representative data for in vitro expression of rAAV constructs encoding prosaposin (PSAP ), SCARB2, and/or GBA1 in combination with one or more inhibitory nucleic acids. The data show that transfection of HEK293 cells with each construct resulted in overexpression of the transgene of interest compared to mock transfected cells.
20 is an rAAV vector (top) comprising a “D” region located “outside” of the ITR (eg, proximal to the terminus of the ITR relative to the transgene insert or expression construct) (top) and “inside” of the vector. Schematic depicting a wild-type rAAV vector having an ITR in (eg, proximate to the transgene insert of the vector).
21 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding GBA2 or a portion thereof, and an interfering RNA for α-Syn.
22 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding a Gcase (eg, GBA1 or a portion thereof) and a galactosylceramidase (eg, GALC or a portion thereof). Expression of the coding sequences for Gcase and galactosylceramidase is separated by a T2A self-cleaving peptide sequence.
23 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding a Gcase (eg, GBA1 or a portion thereof) and a galactosylceramidase (eg, GALC or a portion thereof). Expression of the coding sequences for Gcase and galactosylceramidase is separated by a T2A self-cleaving peptide sequence.
24 is an embodiment of a vector comprising an expression construct encoding an interfering RNA for Gcase (eg, GBA1 or a portion thereof), cathepsin B (eg, CTSB or a portion thereof), and α-Syn. is a schematic diagram showing Expression of the coding sequences for Gcase and cathepsin B is separated by a T2A self-cleaving peptide sequence.
25 includes expression constructs encoding interfering RNAs for Gcase (eg, GBA1 or a portion thereof), sphingomyelin phosphodiesterase 1 (eg, SMPD1 or portion thereof), and α-Syn. It is a schematic diagram depicting one embodiment of a vector that
26 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding a Gcase (eg, GBA1 or a portion thereof) and a galactosylceramidase (eg, GALC or a portion thereof). The coding sequences for Gcase and galactosylceramidase are separated by an internal ribosome entry site (IRES).
27 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding Gcase (eg, GBA1 or a portion thereof) and cathepsin B (eg, CTSB or a portion thereof). Expression of the coding sequences for Gcase and cathepsin B are each driven by separate promoters.
28 depicts one embodiment of a vector comprising expression constructs encoding interfering RNAs for Gcase (eg, GBA1 or portion thereof), GCH1 (eg, GCH1 or portion thereof), and α-Syn. is a schematic diagram of The coding sequences for Gcase and GCH1 are separated by a T2A self-cleaving peptide sequence.
29 depicts one embodiment of a vector comprising an expression construct encoding an interfering RNA for Gcase (eg, GBA1 or a portion thereof), RAB7L1 (eg, RAB7L1 or a portion thereof), and α-Syn. is a schematic diagram of The coding sequences for Gcase and RAB7L1 are separated by a T2A self-cleaving peptide sequence.
30 depicts one embodiment of a vector comprising expression constructs encoding interfering RNAs for Gcase (eg, GBA1 or a portion thereof), GCH1 (eg, GCH1 or a portion thereof), and α-Syn. is a schematic diagram of Expression of the coding sequences for Gcase and GCH1 is an internal ribosome entry site (IRES).
31 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding an interfering RNA for VPS35 (eg, VPS35 or a portion thereof) and α-Syn and TMEM106B.
32 is one embodiment of a vector comprising an expression construct encoding an interfering RNA for Gcase (eg, GBA1 or a portion thereof), IL-34 (eg, IL34 or a portion thereof), and α-Syn. is a schematic diagram showing The coding sequences for Gcase and IL-34 are separated by a T2A self-cleaving peptide sequence.
33 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding a Gcase (eg, GBA1 or a portion thereof) and IL-34 (eg, IL34 or a portion thereof). The coding sequences for Gcase and IL-34 are separated by an internal ribosome entry site (IRES).
34 is a schematic diagram depicting one embodiment of a vector comprising expression constructs encoding Gcase (eg, GBA1 or portion thereof) and TREM2 (eg, TREM2 or portion thereof). Expression of the coding sequences for Gcase and TREM2 is driven by separate promoters, respectively.
35 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding a Gcase (eg, GBA1 or a portion thereof) and IL-34 (eg, IL34 or a portion thereof). Expression of the coding sequence of Gcase and IL-34 is driven by separate promoters, respectively.
36A-36B show representative data for overexpression of TREM2 and GBA1 in HEK293 cells compared to control transduced cells, as measured by qPCR and ELISA. 36A shows data for overexpression of TREM2. 36B shows data for overexpression of GBA1 from the same construct.
37 shows representative data demonstrating successful silencing of SNCA in vitro by GFP reporter assay (top) and α-Syn assay (bottom).
38 shows representative data demonstrating successful silencing of TMEM106B in vitro by GFP reporter assay (top) and α-Syn assay (bottom).
39 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding PGRN (also referred to as GRN).
40 shows data for transduction of HEK293 cells with rAAV with ITRs with wild-type (circular) or alternative (eg, “outer”; square) placement of the “D” sequence. rAAVs with ITRs placed “outside” were able to transduce cells as efficiently as rAAVs with wild-type ITRs.
41 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding a Gcase (eg, GBA1 or a portion thereof).
42 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding a Gcase (eg, GBA1 or a portion thereof).
43 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding an interfering RNA for Gcase (eg, GBA1 or a portion thereof) and α-Syn.
44 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding PGRN (also referred to as GRN).
45 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding PGRN (also referred to as GRN).
46 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding an interfering RNA for PGRN (also referred to as GRN) and microtubule-associated protein tau (MAPT). The nucleic acid sequence of this vector is set forth in SEQ ID NO: 142.
47 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding an interfering RNA for Gcase (eg, GBA1 or a portion thereof) and α-Syn.
48 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding PSAP.
49 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding a Gcase (eg, GBA1 or a portion thereof).
50 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding a Gcase (eg, GBA1 or a portion thereof) and a galactosylceramidase (eg, GALC or a portion thereof).
51 is a rAAV vector comprising expression constructs encoding interfering RNAs for Gcase (eg, GBA1 or a portion thereof) , prosaposin (eg, PSAP or a portion thereof), and α-Syn. A schematic depicting one embodiment of a plasmid.
52 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding a Gcase (eg, GBA1 or a portion thereof) and an inhibitory RNA targeting SNCA.
Figure 53 is a schematic diagram depicting one embodiment of a plasmid comprising a rAAV vector comprising an expression construct encoding SNCA.
54 is a schematic diagram depicting one embodiment of a plasmid comprising a rAAV vector comprising an expression construct encoding an inhibitory RNA targeting SNCA. The repressive RNA is located within the intron between the promoter sequence and the Gcase coding sequence.
55 is a schematic diagram depicting one embodiment of a plasmid comprising a rAAV vector comprising an expression construct encoding an inhibitory RNA targeting progranulin (PGRN; also referred to as GRN) and SNCA. The repressive RNA is located within the intron between the promoter sequence and the Gcase coding sequence.
Figure 56 is a schematic diagram depicting one embodiment of a plasmid comprising a rAAV vector comprising an expression construct encoding an inhibitory RNA targeting Gcase (GBA1) and SNCA. The repressive RNA is located within the intron between the promoter sequence and the Gcase coding sequence.
Figure 57 is a schematic diagram depicting one embodiment of a plasmid comprising a rAAV vector comprising an expression construct encoding an inhibitory RNA targeting Gcase (GBA1) and SNCA. The repressive RNA is located within the intron between the promoter sequence and the Gcase coding sequence.
58 is a schematic diagram depicting one embodiment of a plasmid comprising a rAAV vector comprising an expression construct encoding an inhibitory RNA targeting Gcase (GBA1) and SNCA. The "D" sequence of the 3'ITR is located "outside" of the vector.
59 is a schematic diagram depicting one embodiment of a plasmid comprising a rAAV vector comprising an expression construct encoding an inhibitory RNA targeting Gcase (GBA1) and SNCA. The repressive RNA is located within the intron between the promoter sequence and the Gcase coding sequence.
60 is a schematic diagram depicting one embodiment of a plasmid comprising a rAAV vector comprising an expression construct encoding an inhibitory RNA targeting Gcase (GBA1) and SNCA. The repressive RNA is located within the intron between the promoter sequence and the Gcase coding sequence.
Figure 61 is a schematic diagram depicting one embodiment of a plasmid comprising a rAAV vector comprising an expression construct encoding an inhibitory RNA targeting Gcase (GBA1) and SNCA.
62 is a schematic diagram depicting one embodiment of a plasmid comprising a rAAV vector comprising an expression construct encoding an inhibitory RNA targeting Gcase (GBA1) and SNCA. The repressive RNA is located within the intron between the promoter sequence and the Gcase coding sequence.
63 is a schematic diagram depicting one embodiment of a plasmid comprising a rAAV vector comprising an expression construct encoding an inhibitory RNA targeting Gcase (GBA1) and SNCA. The repressive RNA is located within the intron between the promoter sequence and the Gcase coding sequence.
Figure 64 depicts one embodiment of a plasmid comprising a rAAV vector comprising expression constructs encoding Gcase (GBA1) and progranulin (PGRN; also referred to as GRN), and an inhibitory RNA targeting TMEM106B. It is a schematic diagram. The repressive RNA is located within the intron between the promoter sequence and the Gcase coding sequence.
65 is a schematic diagram depicting one embodiment of a plasmid comprising a rAAV vector comprising an expression construct encoding an inhibitory RNA targeting RPS25.
Figure 66 is a schematic diagram depicting one embodiment of a plasmid comprising a rAAV vector comprising an expression construct encoding an inhibitory RNA targeting RPS25.
67 is a schematic diagram depicting one embodiment of a plasmid comprising a rAAV vector comprising an expression construct encoding an inhibitory RNA targeting MAPT.
68 is a schematic diagram depicting one embodiment of a plasmid comprising a rAAV vector comprising an expression construct encoding an inhibitory RNA targeting MAPT.
69 is a schematic diagram depicting one embodiment of a plasmid comprising a rAAV vector comprising an expression construct encoding an inhibitory RNA targeting progranulin (PGRN; also referred to as GRN) and MAPT. The repressive RNA is located within the intron between the promoter sequence and the PGRN (also referred to as GRN) coding sequence.
70 is a schematic diagram depicting one embodiment of a plasmid comprising a rAAV vector comprising an expression construct encoding an inhibitory RNA targeting MAPT.
71 is a schematic diagram depicting one embodiment of a plasmid comprising a rAAV vector comprising an expression construct encoding an inhibitory RNA targeting MAPT and progranulin (PGRN; also referred to as GRN). The repressive RNA is located within the intron between the promoter sequence and the PGRN (also referred to as GRN) coding sequence.
72 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding a Gcase (eg, GBA1 or a portion thereof) and an inhibitory RNA targeting SNCA. The nucleic acid sequence of this vector is set forth in SEQ ID NO: 141.
73 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding a Gcase (eg, GBA1 or a portion thereof) and an inhibitory RNA targeting SNCA. The nucleic acid sequence of this vector is set forth in SEQ ID NO:143.
74 is a schematic diagram depicting one embodiment of a plasmid comprising a rAAV vector comprising expression constructs encoding Gcase (GBA1) and prosaposin (PSAP), and an inhibitory RNA targeting SNCA. The nucleic acid sequence of this vector is set forth in SEQ ID NO:144.
75A-75C are charts showing MAPT knockdown in SY5Y cells by RNA interference. Figure 75A shows immunofluorescence localization of AAV vectors using probes directed against BGHpA. Figure 75b shows RT-PCR results of MAPT expression at 3 and 7 days after transduction. Figure 75C shows general information of the rAAV virus stock used for transduction.
본 개시내용은 부분적으로, 대상체에서 특정 유전자 산물 (예를 들어, CNS 질환과 연관된 유전자 산물)의 조합의 발현을 위한 조성물 및 방법에 기초한다. 유전자 산물은 단백질, 단백질의 단편 (예를 들어, 일부분), CNS 질환-연관 유전자를 억제하는 간섭 핵산 등일 수 있다. 일부 실시양태에서, 유전자 산물은 CNS 질환-연관 유전자에 의해 코딩된 단백질 또는 단백질 단편이다. 일부 실시양태에서, 유전자 산물은 CNS 질환-연관 유전자를 억제하는 간섭 핵산 (예를 들어, shRNA, siRNA, miRNA, amiRNA 등)이다.The present disclosure is based, in part, on compositions and methods for the expression of a combination of a particular gene product (eg, a gene product associated with a CNS disease) in a subject. The gene product can be a protein, a fragment (eg, a portion) of a protein, an interfering nucleic acid that inhibits a CNS disease-associated gene, and the like. In some embodiments, the gene product is a protein or protein fragment encoded by a CNS disease-associated gene. In some embodiments, the gene product is an interfering nucleic acid (eg, shRNA, siRNA, miRNA, amiRNA, etc.) that inhibits a CNS disease-associated gene.
CNS 질환-연관 유전자는 CNS 질환, 예컨대 PD와 유전적으로, 생화학적으로 또는 기능적으로 연관되는 유전자 산물을 코딩하는 유전자를 지칭한다. 예를 들어, GBA1 유전자 (단백질 Gcase를 코딩함)에 돌연변이를 갖는 개체는 GBA1에 돌연변이가 없는 개체와 비교하여 PD 발병 위험이 증가하는 것으로 관찰되었다. 또 다른 예에서, 시누클레인병증 (예를 들어, PD 등)은 α-시누클레인 (α-Syn) 단백질을 포함하는 단백질 응집체의 축적과 연관되며; 따라서, SNCA (α-Syn을 코딩함)는 PD-연관 유전자이다. 일부 실시양태에서, 본원에 기재된 발현 카세트는 CNS 질환-연관 유전자, 예를 들어 PD-연관 유전자 (또는 그의 코딩 서열)의 야생형 또는 비-돌연변이체 형태를 코딩한다. CNS 질환-연관 유전자 (예를 들어, PD-연관 유전자, AD-연관 유전자, FTD-연관 유전자 등)의 예는 표 1에 열거되어 있다.A CNS disease-associated gene refers to a gene encoding a gene product that is genetically, biochemically or functionally associated with a CNS disease, such as PD. For example, individuals with a mutation in the GBA1 gene (encoding the protein Gcase) were observed to have an increased risk of developing PD compared to individuals without the mutation in GBA1. In another example, synucleinopathy (eg, PD, etc.) is associated with accumulation of protein aggregates comprising α-synuclein (α-Syn) protein; Thus, SNCA (encoding α-Syn) is a PD-associated gene. In some embodiments, an expression cassette described herein encodes a wild-type or non-mutant form of a CNS disease-associated gene, eg, a PD-associated gene (or its coding sequence). Examples of CNS disease-associated genes (eg, PD-associated genes, AD-associated genes, FTD-associated genes, etc.) are listed in Table 1.
<표 1><Table 1>
CNS 질환-연관 유전자 및 유전자 산물의 예 Examples of CNS disease-associated genes and gene products
단리된 핵산 및 벡터Isolated Nucleic Acids and Vectors
단리된 핵산은 DNA 또는 RNA일 수 있다. 본원에 사용된 바와 같은, 용어 "단리된"은 인위적으로 생산된 것을 의미한다. 본원에 사용된 바와 같은, "단리된 핵산"은 (i) 예를 들어, 폴리머라제 연쇄 반응 (PCR)에 의해 시험관내에서 증폭되거나; (ii) 클로닝에 의해 재조합적으로 생산되거나; (iii) 절단 및 겔 분리에 의해 정제되거나; 또는 (iv) 예를 들어, 화학적 합성에 의해 합성된 핵산을 지칭한다. 단리된 핵산은 관련 기술분야에 널리 공지된 재조합 DNA 기술에 의해 쉽게 조작할 수 있는 것이다.An isolated nucleic acid may be DNA or RNA. As used herein, the term “isolated” means artificially produced. As used herein, an “isolated nucleic acid” refers to (i) amplified in vitro, for example, by polymerase chain reaction (PCR); (ii) produced recombinantly by cloning; (iii) purified by cleavage and gel separation; or (iv) a nucleic acid synthesized, for example, by chemical synthesis. An isolated nucleic acid is one that can be readily manipulated by recombinant DNA techniques well known in the art.
본 개시내용은 일부 측면에서, 하나 이상의 CNS 질환-연관 유전자 (예를 들어, PD-연관 유전자), 예를 들어 Gcase, 프로사포신, LIMP2/SCARB2, GBA2, GALC 단백질, CTSB 단백질, SMPD1, GCH1 단백질, RAB7L 단백질, VPS35 단백질, IL-34 단백질, TREM2 단백질, 또는 TMEM106B 단백질을 코딩하는 발현 구축물을 포함하는 단리된 핵산 (예를 들어, rAAV 벡터)을 제공한다. 본 개시내용은 또한 일부 측면에서, 하나 이상의 CNS 질환-연관 유전자, 예를 들어 SNCA, TMEM106B, RPS25 및 MAPT를 표적화하는 하나 이상의 억제성 핵산을 코딩하는 단리된 핵산 (예를 들어, rAAV 벡터)을 제공한다. 일부 실시양태에서, CNS 질환-연관 유전자를 코딩하는 단리된 핵산은 하나 이상의 CNS 질환-연관 유전자를 표적화하는 억제성 핵산에 대한 코딩 서열을 추가로 포함할 수 있다. 일부 실시양태에서, CNS 질환-연관 유전자 및 CNS 질환-연관 유전자를 표적화하는 억제성 핵산은 상이한 핵산 상에 코딩된다.The present disclosure provides, in some aspects, one or more CNS disease-associated genes (eg, PD-associated genes), eg, Gcase, prosaposin, LIMP2/SCARB2, GBA2, GALC protein, CTSB protein, SMPD1, GCH1 An isolated nucleic acid (eg, a rAAV vector) comprising an expression construct encoding a protein, RAB7L protein, VPS35 protein, IL-34 protein, TREM2 protein, or TMEM106B protein is provided. The present disclosure also provides, in some aspects, isolated nucleic acids (e.g., rAAV vectors) encoding one or more inhibitory nucleic acids that target one or more CNS disease-associated genes, e.g., SNCA, TMEM106B, RPS25, and MAPT. to provide. In some embodiments, the isolated nucleic acid encoding a CNS disease-associated gene may further comprise a coding sequence for an inhibitory nucleic acid that targets one or more CNS disease-associated genes. In some embodiments, the CNS disease-associated gene and the inhibitory nucleic acid targeting the CNS disease-associated gene are encoded on different nucleic acids.
일부 측면에서, 본 개시내용은 Gcase (예를 들어, GBA1 유전자의 유전자 산물)를 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. β-글루코세레브로시다아제 또는 GBA로서 지칭되기도 한 Gcase는 당지질 대사의 중간체인 화학적 글루코세레브로시드의 베타-글루코시드 연결을 절단하는 리소좀 단백질을 지칭한다. 인간에서, Gcase는 염색체 1에 위치한 GBA1 유전자에 의해 코딩된다. 일부 실시양태에서, GBA1은 NCBI 참조 서열 NP_000148.2 (서열식별번호: 14)로 표시되는 펩티드를 코딩한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 (예를 들어, 포유동물 세포, 예를 들어 인간 세포에서의 발현을 위해 코돈-최적화된) Gcase-코딩 서열, 예컨대 서열식별번호: 15에 제시된 서열을 포함한다.In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding a Gcase (eg, a gene product of a GBA1 gene). Gcase, also referred to as β-glucocerebrosidase or GBA, refers to a lysosomal protein that cleaves the beta-glucosidic linkage of the chemical glucocerebroside, an intermediate in glycolipid metabolism. In humans, Gcase is encoded by the GBA1 gene located on
일부 측면에서, 본 개시내용은 프로사포신 (예를 들어, PSAP 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. 프로사포신은 짧은 올리고사카라이드 기를 갖는 글리코스핑고지질의 이화작용을 촉진하는 스핑고지질 활성화 단백질 (사포신) A, B, C 및 D에 대한 전구체 당단백질이다. 인간에서, PSAP 유전자는 염색체 10에 위치한다. 일부 실시양태에서, PSAP는 NCBI 참조 서열 NP_002769.1 (예를 들어, 서열식별번호: 16)로 표시되는 펩티드를 코딩한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 (예를 들어, 포유동물 세포, 예를 들어 인간 세포에서의 발현을 위해 코돈-최적화된) 프로사포신-코딩 서열, 예컨대 서열식별번호: 17에 제시된 서열을 포함한다.In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding a prosaposin (eg, a gene product of a PSAP gene). Prosaposins are precursor glycoproteins for sphingolipid activating proteins (saposins) A, B, C and D that promote the catabolism of glycosphingolipids with short oligosaccharide groups. In humans, the PSAP gene is located on
본 개시내용의 측면은 LIMP2/SCARB2 (예를 들어, SCARB2 유전자의 유전자 산물)를 코딩하는 발현 구축물을 포함하는 단리된 핵산에 관한 것이다. SCARB2는 세포 내에서 리소좀 및 엔도좀 수송을 조절하는 막 단백질을 지칭한다. 인간에서, SCARB2 유전자는 염색체 4에 위치한다. 일부 실시양태에서, SCARB2 유전자는 NCBI 참조 서열 NP_005497.1 (서열식별번호: 18)로 표시되는 펩티드를 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 19에 제시된 서열을 포함한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 SCARB2-코딩 서열을 포함한다. Aspects of the present disclosure relate to isolated nucleic acids comprising an expression construct encoding LIMP2/SCARB2 (eg, a gene product of the SCARB2 gene). SCARB2 refers to a membrane protein that regulates lysosomal and endosomal transport within cells. In humans, the SCARB2 gene is located on
본 개시내용의 측면은 GBA2 단백질 (예를 들어, GBA2 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산에 관한 것이다. GBA2 단백질은 비-리소좀 글루코실세라미다제를 지칭한다. 인간에서, GBA2 유전자는 염색체 9에 위치한다. 일부 실시양태에서, GBA2 유전자는 NCBI 참조 서열 NP_065995.1 (서열식별번호: 30)로 표시되는 펩티드를 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 31에 제시된 서열을 포함한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 GBA2-코딩 서열을 포함한다.Aspects of the present disclosure relate to isolated nucleic acids comprising an expression construct encoding a GBA2 protein (eg, a gene product of a GBA2 gene). GBA2 protein refers to non-lysosomal glucosylceramidase. In humans, the GBA2 gene is located on chromosome 9. In some embodiments, the GBA2 gene encodes a peptide represented by the NCBI reference sequence NP_065995.1 (SEQ ID NO: 30). In some embodiments, the isolated nucleic acid comprises the sequence set forth in SEQ ID NO:31. In some embodiments, the isolated nucleic acid comprises a codon-optimized GBA2-coding sequence.
본 개시내용의 측면은 GALC 단백질 (예를 들어, GALC 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산에 관한 것이다. GALC 단백질은 갈락토세레브로시드, 갈락토실스핑고신, 락토실세라미드, 및 모노갈락토실디글리세리드의 갈락토스 에스테르 결합을 가수분해하는 효소인 갈락토실세라미다제 (또는 갈락토세레브로시다제)를 지칭한다. 인간에서, GALC 유전자는 염색체 14에 위치한다. 일부 실시양태에서, GALC 유전자는 NCBI 참조 서열 NP_000144.2 (서열식별번호: 33)로 표시되는 펩티드를 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 34에 제시된 서열을 포함한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 GALC-코딩 서열을 포함한다.Aspects of the present disclosure relate to isolated nucleic acids comprising an expression construct encoding a GALC protein (eg, a gene product of a GALC gene). GALC protein contains galactosylceramidase (or galactocerebrosidase), an enzyme that hydrolyzes the galactose ester bonds of galactocerebroside, galactosylsphingosine, lactosylceramide, and monogalactosyldiglycerides. refers to In humans, the GALC gene is located on
본 개시내용의 측면은 CTSB 단백질 (예를 들어, CTSB 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산에 관한 것이다. CTSB 단백질은 세포내 단백질 분해에 중요한 역할을 하는 리소좀 시스테인 프로테아제인 카텝신 B를 지칭한다. 인간에서, CTSB 유전자는 염색체 8에 위치한다. 일부 실시양태에서, CTSB 유전자는 NCBI 참조 서열 NP_001899.1 (서열식별번호: 35)로 표시되는 펩티드를 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 36에 제시된 서열을 포함한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 CTSB-코딩 서열을 포함한다.Aspects of the present disclosure relate to isolated nucleic acids comprising an expression construct encoding a CTSB protein (eg, a gene product of a CTSB gene). CTSB protein refers to cathepsin B, a lysosomal cysteine protease that plays an important role in intracellular proteolysis. In humans, the CTSB gene is located on
본 개시내용의 측면은 SMPD1 단백질 (예를 들어, SMPD1 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산에 관한 것이다. SMPD1 단백질은 스핑고지질 대사에 관여하는 히드롤라제 효소인 스핑고미엘린 포스포디에스테라제 1을 지칭한다. 인간에서, SMPD1 유전자는 염색체 11에 위치한다. 일부 실시양태에서, SMPD1 유전자는 NCBI 참조 서열 NP_000534.3 (서열식별번호: 37)로 표시되는 펩티드를 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 38에 제시된 서열을 포함한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 SMPD1-코딩 서열을 포함한다.Aspects of the present disclosure relate to isolated nucleic acids comprising an expression construct encoding a SMPD1 protein (eg, a gene product of the SMPD1 gene). SMPD1 protein refers to sphingomyelin
본 개시내용의 측면은 GCH1 단백질 (예를 들어, GCH1 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산에 관한 것이다. GCH1 단백질은 엽산염 및 바이오프테린 생합성 경로의 일부인 히드롤라제 효소인 GTP 시클로히드롤라제 I을 지칭한다. 인간에서, GCH1 유전자는 염색체 14에 위치한다. 일부 실시양태에서, GCH1 유전자는 NCBI 참조 서열 NP_000152.1 (서열식별번호: 45)로 표시되는 펩티드를 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 46에 제시된 서열을 포함한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 GCH1-코딩 서열을 포함한다.Aspects of the present disclosure relate to isolated nucleic acids comprising an expression construct encoding a GCH1 protein (eg, a gene product of a GCH1 gene). GCH1 protein refers to GTP cyclohydrolase I, a hydrolase enzyme that is part of the folate and biopterin biosynthetic pathway. In humans, the GCH1 gene is located on
본 개시내용의 측면은 RAB7L 단백질 (예를 들어, RAB7L 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산에 관한 것이다. RAB7L 단백질은 GTP 결합 단백질인 RAS 종양유전자 패밀리-유사 1 구성원인 RAB7을 지칭한다. 인간에서, RAB7L 유전자는 염색체 1에 위치한다. 일부 실시양태에서, RAB7L 유전자는 NCBI 참조 서열 NP_003920.1 (서열식별번호: 47)로 표시되는 펩티드를 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 48에 제시된 서열을 포함한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 RAB7L-코딩 서열을 포함한다.Aspects of the present disclosure relate to isolated nucleic acids comprising an expression construct encoding a RAB7L protein (eg, a gene product of a RAB7L gene). RAB7L protein refers to RAB7, a member of the RAS oncogene family-like 1 that is a GTP binding protein. In humans, the RAB7L gene is located on
본 개시내용의 측면은 VPS35 단백질 (예를 들어, VPS35 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산에 관한 것이다. VPS35 단백질은 액포 단백질 분류-연관 단백질 35를 지칭하며, 이는 엔도솜에서 트랜스 골지 네트워크로의 단백질의 역행 수송에 관여하는 단백질 복합체의 일부이다. 인간에서, VPS35 유전자는 염색체 16에 위치한다. 일부 실시양태에서, VPS35 유전자는 NCBI 참조 서열 NP_060676.2 (서열식별번호: 49)로 표시되는 펩티드를 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 50에 제시된 서열을 포함한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 VPS35-코딩 서열을 포함한다.Aspects of the present disclosure relate to isolated nucleic acids comprising an expression construct encoding a VPS35 protein (eg, a gene product of a VPS35 gene). VPS35 protein refers to vacuolar protein classification-associated
본 개시내용의 측면은 IL-34 단백질 (예를 들어, IL34 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산에 관한 것이다. IL-34 단백질은 인터류킨 34를 지칭하며, 이는 단핵구의 성장과 생존을 증가시키는 시토카인이다. 인간에서, IL34 유전자는 염색체 16에 위치한다. 일부 실시양태에서, IL34 유전자는 NCBI 참조 서열 NP_689669.2 (서열식별번호: 55)로 표시되는 펩티드를 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 56에 제시된 서열을 포함한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 IL-34-코딩 서열을 포함한다. Aspects of the present disclosure relate to isolated nucleic acids comprising an expression construct encoding an IL-34 protein (eg, a gene product of an IL34 gene). IL-34 protein refers to interleukin 34, which is a cytokine that increases the growth and survival of monocytes. In humans, the IL34 gene is located on chromosome 16. In some embodiments, the IL34 gene encodes a peptide represented by the NCBI reference sequence NP_689669.2 (SEQ ID NO: 55). In some embodiments, the isolated nucleic acid comprises a sequence set forth in SEQ ID NO:56. In some embodiments, the isolated nucleic acid comprises a codon-optimized IL-34-coding sequence.
본 개시내용의 측면은 TREM2 단백질 (예를 들어, TREM2 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산에 관한 것이다. TREM2 단백질은 골수 세포에서 발견되는 이뮤노글로불린 슈퍼패밀리 수용체인 골수 세포 2에서 발현되는 촉발 수용체를 지칭한다. 인간에서, TREM2 유전자는 염색체 6에 위치한다. 일부 실시양태에서, TREM2 유전자는 NCBI 참조 서열 NP_061838.1 (서열식별번호: 57)로 표시되는 펩티드를 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 58에 제시된 서열을 포함한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 TREM2-코딩 서열을 포함한다.Aspects of the present disclosure relate to isolated nucleic acids comprising an expression construct encoding a TREM2 protein (eg, a gene product of the TREM2 gene). TREM2 protein refers to a trigger receptor expressed on
본 개시내용의 측면은 TMEM106B 단백질 (예를 들어, TMEM106B 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산에 관한 것이다. TMEM106B 단백질은 수상돌기 형태형성 및 리소좀 트래피킹의 조절에 관여하는 단백질인 막횡단 단백질 106B를 지칭한다. 인간에서, TMEM106B 유전자는 염색체 7에 위치한다. 일부 실시양태에서, TMEM106B 유전자는 NCBI 참조 서열 NP_060844.2 (서열식별번호: 63)로 표시되는 펩티드를 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 64에 제시된 서열을 포함한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 TMEM106B-코딩 서열을 포함한다.Aspects of the present disclosure relate to isolated nucleic acids comprising an expression construct encoding a TMEM106B protein (eg, a gene product of the TMEM106B gene). TMEM106B protein refers to transmembrane protein 106B, a protein involved in the regulation of dendrite morphogenesis and lysosomal trafficking. In humans, the TMEM106B gene is located on chromosome 7. In some embodiments, the TMEM106B gene encodes a peptide represented by the NCBI reference sequence NP_060844.2 (SEQ ID NO: 63). In some embodiments, the isolated nucleic acid comprises the sequence set forth in SEQ ID NO:64. In some embodiments, the isolated nucleic acid comprises a codon-optimized TMEM106B-coding sequence.
본 개시내용의 측면은 프로그래뉼린 단백질 (예를 들어, GRN 유전자의 유전자 산물)을 코딩하는 발현 구축물을 포함하는 단리된 핵산에 관한 것이다. PGRN 단백질은 발달, 염증, 세포 증식 및 단백질 항상성에 관여하는 단백질인 프로그래뉼린을 지칭한다. 인간에서, PGRN (GRN으로서 지칭되기도 함) 유전자는 염색체 17에 위치한다. 일부 실시양태에서, GRN 유전자는 NCBI 참조 서열 NP_002078.1 (서열식별번호: 67)로 표시되는 펩티드를 코딩한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 68에 제시된 서열을 포함한다. 일부 실시양태에서, 단리된 핵산은 코돈-최적화된 PGRN-코딩 서열 (GRN-코딩 서열)을 포함한다.Aspects of the present disclosure relate to isolated nucleic acids comprising an expression construct encoding a progranulin protein (eg, a gene product of a GRN gene). PGRN protein refers to progranulin, a protein involved in development, inflammation, cell proliferation and protein homeostasis. In humans, the PGRN ( also referred to as GRN ) gene is located on chromosome 17. In some embodiments, the GRN gene encodes a peptide represented by the NCBI reference sequence NP_002078.1 (SEQ ID NO: 67). In some embodiments, the isolated nucleic acid comprises the sequence set forth in SEQ ID NO:68. In some embodiments, the isolated nucleic acid comprises a codon-optimized PGRN-coding sequence (GRN-coding sequence).
일부 측면에서, 본 개시내용은 제1 유전자 산물 및 제2 유전자 산물을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공하며, 여기서 각각의 유전자 산물은 독립적으로, 표 1에 제시된 유전자 산물 또는 그의 일부분, 또는 표 1에 제시된 유전자 또는 유전자 산물을 표적화하는 억제성 핵산으로부터 선택된다.In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding a first gene product and a second gene product, wherein each gene product is independently a gene product set forth in Table 1 or a portion thereof , or an inhibitory nucleic acid targeting the gene or gene product set forth in Table 1.
일부 실시양태에서, 유전자 산물은 자연적으로 발생하는 유전자의 코딩 부분 (예를 들어, cDNA)에 의해 코딩된다. 일부 실시양태에서, 제1 유전자 산물은 GBA1 유전자에 의해 코딩되는 단백질 (또는 그의 단편)이다. 일부 실시양태에서, 유전자 산물은 표 1에 열거된 또 다른 유전자, 예를 들어 SCARB2/LIMP2 유전자 또는 PSAP 유전자에 의해 코딩되는 단백질 (또는 그의 단편)이다. 그러나, 통상의 기술자는 제1 유전자 산물 (예를 들어, Gcase) 및 제2 유전자 산물 (예를 들어, LIMP2 등)의 발현 순서가 일반적으로 역전될 수 있음을 인식한다 (예를 들어, LIMP2는 제1 유전자 산물이고 Gcase는 제2 유전자 산물이다). 일부 실시양태에서, 유전자 산물은 표 1에 열거된 유전자의 단편 (예를 들어, 일부분)이다. 단백질 단편은 표 1에 열거된 유전자에 의해 코딩되는 단백질의 약 50%, 약 60%, 약 70%, 약 80%, 약 90% 또는 약 99%를 포함할 수 있다. 일부 실시양태에서, 단백질 단편은 표 1에 열거된 유전자에 의해 코딩되는 단백질의 50% 내지 99.9% (예를 들어, 50% 내지 99.9% 사이의 임의의 값)를 포함한다.In some embodiments, a gene product is encoded by a coding portion (eg, cDNA) of a naturally occurring gene. In some embodiments, the first gene product is a protein (or fragment thereof) encoded by the GBA1 gene. In some embodiments, the gene product is a protein (or fragment thereof) encoded by another gene listed in Table 1, eg, a SCARB2/LIMP2 gene or a PSAP gene. However, one of ordinary skill in the art recognizes that the order of expression of a first gene product (eg, Gcase) and a second gene product (eg, LIMP2, etc.) can generally be reversed (eg, LIMP2 is the first gene product and Gcase is the second gene product). In some embodiments, the gene product is a fragment (eg, a portion) of a gene listed in Table 1. A protein fragment may comprise about 50%, about 60%, about 70%, about 80%, about 90%, or about 99% of the protein encoded by the genes listed in Table 1. In some embodiments, the protein fragment comprises 50% to 99.9% (eg, any value between 50% and 99.9%) of the protein encoded by the genes listed in Table 1.
병리학적으로, 장애, 예컨대 PD 및 고셔병은 주로 α-시누클레인 (α-Syn) 단백질로 구성된 단백질 응집체의 축적과 연관이 있다. 따라서, 일부 실시양태에서, 본원에 기재된 단리된 핵산은 α-Syn 단백질의 발현을 감소 또는 방지하는 억제성 핵산을 포함한다.Pathologically, disorders such as PD and Gaucher's disease are associated with the accumulation of protein aggregates composed primarily of α-synuclein (α-Syn) proteins. Accordingly, in some embodiments, an isolated nucleic acid described herein comprises an inhibitory nucleic acid that reduces or prevents expression of an α-Syn protein.
일부 측면에서, 본 개시내용은 알츠하이머병 및 FTD-타우에 관여하는 미세소관-연관 단백질 타우, MAPT (예를 들어, MAPT 유전자의 유전자 산물)를 표적화하는 하나 이상의 간섭 핵산 (예를 들어, dsRNA, siRNA, miRNA, amiRNA 등)을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다.In some aspects, the disclosure of which microtubules are involved in Alzheimer's disease and FTD- Tau-associated protein tau e., MAPT one or more interference to target nucleic acid (e. G., The gene product of the MAPT gene) (e.g., dsRNA, siRNA, miRNA, amiRNA, etc.) are provided.
일반적으로, 본원에 기재된 바와 같은 단리된 핵산은 1, 2, 3, 4, 5, 6, 7, 8, 9, 10개 또는 그 초과의 억제성 핵산 (예를 들어, dsRNA, siRNA, shRNA, miRNA, amiRNA 등)을 코딩할 수 있다. 일부 실시양태에서, 단리된 핵산은 10개 초과의 억제성 핵산을 코딩한다. 일부 실시양태에서, 하나 이상의 억제성 핵산 각각은 상이한 유전자 또는 유전자의 일부분을 표적으로 한다 (예를 들어, 제1 miRNA는 유전자의 제1 표적 서열을 표적으로 하고 제2 miRNA는 제1 표적 서열과 상이한 유전자의 제2 표적 서열을 표적으로 한다). 일부 실시양태에서, 하나 이상의 억제성 핵산 각각은 동일한 유전자의 동일한 표적 서열을 표적으로 한다 (예를 들어, 단리된 핵산은 동일한 miRNA의 다중 카피를 코딩한다).In general, an isolated nucleic acid as described herein comprises 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more inhibitory nucleic acids (e.g., dsRNA, siRNA, shRNA, miRNA, amiRNA, etc.). In some embodiments, the isolated nucleic acid encodes more than 10 inhibitory nucleic acids. In some embodiments, each of the one or more inhibitory nucleic acids targets a different gene or portion of a gene (e.g., a first miRNA targets a first target sequence of a gene and a second miRNA targets a first target sequence and a target a second target sequence of a different gene). In some embodiments, each of the one or more inhibitory nucleic acids targets the same target sequence of the same gene (eg, the isolated nucleic acid encodes multiple copies of the same miRNA).
일부 측면에서, 본 개시내용은 α-시누클레인 단백질 (예를 들어, SNCA 유전자의 유전자 산물)을 표적화하는 하나 이상의 간섭 핵산 (예를 들어, dsRNA, siRNA, miRNA, amiRNA 등)을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. α-시누클레인 단백질은 뇌 조직에서 발견되는 단백질을 지칭하며, 이는 시냅스 소포를 클러스터링하고 도파민의 방출을 조절함으로써 시냅스 전 말단에서 시냅스 소포의 공급을 유지하는 데 일정 역할을 한다. 인간에서, SNCA 유전자는 염색체 4에 위치한다. 일부 실시양태에서, SNCA 유전자는 NCBI 참조 서열 NP_001139527.1로 표시되는 펩티드를 코딩한다. 일부 실시양태에서, SNCA 유전자는 서열식별번호: 90에 제시된 서열을 포함한다.In some aspects, the present disclosure provides an expression construct encoding one or more interfering nucleic acids (eg, dsRNA, siRNA, miRNA, amiRNA, etc.) that target an α-synuclein protein (eg, the gene product of a SNCA gene) It provides an isolated nucleic acid comprising a. α-synuclein protein refers to a protein found in brain tissue, which plays a role in maintaining the supply of synaptic vesicles at presynaptic terminals by clustering synaptic vesicles and regulating the release of dopamine. In humans, the SNCA gene is located on
SNCA를 표적화하는 억제성 핵산은 길이가 6 내지 50개 뉴클레오티드인 상보성 영역 (예를 들어, 표적 유전자, 예컨대 SNCA와 혼성화하는 억제성 핵산의 영역)을 포함할 수 있다. 일부 실시양태에서, 억제성 핵산은 길이가 약 6 내지 30개, 약 8 내지 20개, 또는 약 10 내지 19개 뉴클레오티드인 SNCA와의 상보성 영역을 포함한다. 일부 실시양태에서, 억제성 핵산은 SNCA 서열의 적어도 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 또는 25개의 인접 뉴클레오티드와 상보적이다. An inhibitory nucleic acid targeting a SNCA may comprise a region of complementarity (eg, a region of an inhibitory nucleic acid that hybridizes to a target gene, such as SNCA) that is 6 to 50 nucleotides in length. In some embodiments, the inhibitory nucleic acid comprises a region of complementarity with SNCA that is about 6-30, about 8-20, or about 10-19 nucleotides in length. In some embodiments, the inhibitory nucleic acid comprises at least 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 of a SNCA sequence. , 22, 23, 24, or 25 contiguous nucleotides.
일부 측면에서, 본 개시내용은 TMEM106B 단백질 (예를 들어, TMEM106B 유전자의 유전자 산물)을 표적화하는 하나 이상의 간섭 핵산 (예를 들어, dsRNA, siRNA, miRNA, amiRNA 등)을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. TMEM106B 단백질은 수상돌기 형태형성 및 리소좀 트래피킹의 조절에 관여하는 단백질인 막횡단 단백질 106B를 지칭한다. 인간에서, TMEM106B 유전자는 염색체 7에 위치한다. 일부 실시양태에서, TMEM106B 유전자는 NCBI 참조 서열 NP_060844.2로 표시되는 펩티드를 코딩한다. 일부 실시양태에서, TMEM106B 유전자는 서열식별번호: 91에 제시된 서열을 포함한다.In some aspects, the present disclosure provides an expression construct encoding one or more interfering nucleic acids (e.g., dsRNA, siRNA, miRNA, amiRNA, etc.) that target the TMEM106B protein (e.g., the gene product of the TMEM106B gene). An isolated nucleic acid is provided. TMEM106B protein refers to transmembrane protein 106B, a protein involved in the regulation of dendrite morphogenesis and lysosomal trafficking. In humans, the TMEM106B gene is located on chromosome 7. In some embodiments, the TMEM106B gene encodes a peptide represented by the NCBI reference sequence NP_060844.2. In some embodiments, the TMEM106B gene comprises the sequence set forth in SEQ ID NO:91.
TMEM106B를 표적화하는 억제성 핵산은 길이가 6 내지 50개 뉴클레오티드인 상보성 영역 (예를 들어, 표적 유전자, 예컨대 TMEM106B와 혼성화하는 억제성 핵산의 영역)을 포함할 수 있다. 일부 실시양태에서, 억제성 핵산은 길이가 약 6 내지 30개, 약 8 내지 20개, 또는 약 10 내지 19개 뉴클레오티드인 TMEM106B와의 상보성 영역을 포함한다. 일부 실시양태에서, 억제성 핵산은 TMEM106B 서열의 적어도 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 또는 25개의 인접 뉴클레오티드와 상보적이다. An inhibitory nucleic acid targeting TMEM106B may comprise a region of complementarity (eg, a region of an inhibitory nucleic acid that hybridizes to a target gene, such as TMEM106B) that is 6-50 nucleotides in length. In some embodiments, the inhibitory nucleic acid comprises a region of complementarity with TMEM106B that is about 6-30, about 8-20, or about 10-19 nucleotides in length. In some embodiments, the inhibitory nucleic acid is at least 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 of the TMEM106B sequence. , 22, 23, 24, or 25 contiguous nucleotides.
일부 측면에서, 본 개시내용은 리보솜 단백질 s25 (RPS25) (예를 들어, RPS25의 유전자 산물)를 표적화하는 하나 이상의 간섭 핵산 (예를 들어, dsRNA, siRNA, miRNA, amiRNA 등)을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. RPS25 단백질은 단백질 합성에 관여하는 단백질 복합체인 s40 리보솜의 서브유닛인 리보솜 단백질을 지칭한다. 인간에서, RPS25 유전자는 염색체 11에 위치한다. 일부 실시양태에서, RPS25 유전자는 NCBI 참조 서열 NP_001019.1로 표시되는 펩티드를 코딩한다. 일부 실시양태에서, RPS25 유전자는 서열식별번호: 113에 제시된 서열을 포함한다.In some aspects, the present disclosure is ribosomal protein s25 (RPS25) one or more interfering nucleic acids targeted to (e. G., The gene product of RPS25) expression constructs encoding (e.g., dsRNA, siRNA, miRNA, amiRNA etc.) It provides an isolated nucleic acid comprising a. The RPS25 protein refers to a ribosomal protein that is a subunit of the s40 ribosome, a protein complex involved in protein synthesis. In humans, the RPS25 gene is located on chromosome 11. In some embodiments, the RPS25 gene encodes a peptide represented by the NCBI reference sequence NP_001019.1. In some embodiments, the RPS25 gene comprises the sequence set forth in SEQ ID NO:113.
RPS25를 표적화하는 억제성 핵산은 길이가 6 내지 50개 뉴클레오티드인 상보성 영역 (예를 들어, 표적 유전자, 예컨대 RPS25와 혼성화하는 억제성 핵산의 영역)을 포함할 수 있다. 일부 실시양태에서, 억제성 핵산은 길이가 약 6 내지 30개, 약 8 내지 20개, 또는 약 10 내지 19개 뉴클레오티드인 RPS25와의 상보성 영역을 포함한다. 일부 실시양태에서, 억제성 핵산은 RPS25 서열의 적어도 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 또는 25개의 인접 뉴클레오티드와 상보적이다. An inhibitory nucleic acid targeting RPS25 can comprise a region of complementarity (eg, a region of an inhibitory nucleic acid that hybridizes to a target gene, such as RPS25) that is 6-50 nucleotides in length. In some embodiments, the inhibitory nucleic acid comprises a region of complementarity with RPS25 that is about 6-30, about 8-20, or about 10-19 nucleotides in length. In some embodiments, the inhibitory nucleic acid comprises at least 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 of the RPS25 sequence. , 22, 23, 24, or 25 contiguous nucleotides.
일부 측면에서, 본 개시내용은 미세소관-연관 단백질 타우, MAPT (예를 들어, MAPT 유전자의 유전자 산물)를 표적화하는 하나 이상의 간섭 핵산 (예를 들어, dsRNA, siRNA, miRNA, amiRNA 등)을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공한다. MAPT 단백질은 미세소관 안정화에 관여하는 단백질인 미세소관-연관 단백질 타우를 지칭한다. 인간에서, MAPT 유전자는 염색체 17에 위치한다. 일부 실시양태에서, MAPT 유전자는 NCBI 참조 서열 NP_005901.2로 표시되는 펩티드를 코딩한다. 일부 실시양태에서, MAPT 유전자는 서열식별번호: 114에 제시된 서열을 포함한다.In some aspects, the present disclosure encodes one or more interfering nucleic acids (eg, dsRNA, siRNA, miRNA, amiRNA, etc.) that target the microtubule-associated protein tau, MAPT (eg, the gene product of the MAPT gene). An isolated nucleic acid comprising an expression construct comprising: MAPT protein refers to the microtubule-associated protein tau, a protein involved in microtubule stabilization. In humans, the MAPT gene is located on chromosome 17. In some embodiments, MAPT gene encoding the peptide represented by the NCBI reference sequence NP_005901.2. In some embodiments, the MAPT gene comprises the sequence set forth in SEQ ID NO:114.
MAPT를 표적화하는 억제성 핵산은 길이가 6 내지 50개 뉴클레오티드인 상보성 영역 (예를 들어, 표적 유전자, 예컨대 MAPT와 혼성화하는 억제성 핵산의 영역)을 포함할 수 있다. 일부 실시양태에서, 억제성 핵산은 길이가 약 6 내지 30개, 약 8 내지 20개, 또는 약 10 내지 19개 뉴클레오티드인 MAPT와의 상보성 영역을 포함한다. 일부 실시양태에서, 억제성 핵산은 MAPT 서열의 적어도 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 또는 25개의 인접 뉴클레오티드와 상보적이다. An inhibitory nucleic acid targeting MAPT may comprise a region of complementarity (eg, a region of an inhibitory nucleic acid that hybridizes to a target gene, such as MAPT) that is 6-50 nucleotides in length. In some embodiments, the inhibitory nucleic acid comprises a region of complementarity with MAPT that is about 6-30, about 8-20, or about 10-19 nucleotides in length. In some embodiments, the inhibitory nucleic acid comprises at least 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 of the MAPT sequence. , 22, 23, 24, or 25 contiguous nucleotides.
본 개시내용의 측면은 하나 이상의 유전자 산물 (예를 들어, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10개, 또는 그 초과의 유전자 산물)을 코딩하는 단리된 핵산에 관한 것이다. 일부 실시양태에서, 하나 이상의 유전자 산물은 둘 이상의 단백질이다. 일부 실시양태에서, 하나 이상의 유전자 산물은 둘 이상의 억제성 핵산이다. 일부 실시양태에서, 하나 이상의 유전자 산물은 하나 이상의 단백질 및 하나 이상의 억제성 핵산이다. 일부 측면에서, 본 개시내용은 제1 유전자 산물 및 제2 유전자 산물을 코딩하는 발현 구축물을 포함하는 단리된 핵산을 제공하며, 여기서 각각의 유전자 산물은 독립적으로, 표 1에 제시된 유전자 산물 또는 그의 일부분, 또는 표 1에 제시된 유전자 또는 유전자 산물을 표적화하는 억제성 핵산으로부터 선택된다. 억제성 핵산을 코딩하는 서열은 발현 벡터의 비번역 영역 (예를 들어, 인트론, 5'UTR, 3'UTR 등)에 위치할 수 있다.Aspects of the present disclosure relate to isolated nucleic acids encoding one or more gene products (eg, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more gene products). it's about In some embodiments, the one or more gene products are two or more proteins. In some embodiments, the one or more gene products are two or more inhibitory nucleic acids. In some embodiments, the one or more gene products are one or more proteins and one or more inhibitory nucleic acids. In some aspects, the present disclosure provides an isolated nucleic acid comprising an expression construct encoding a first gene product and a second gene product, wherein each gene product is independently a gene product set forth in Table 1 or a portion thereof , or an inhibitory nucleic acid targeting the gene or gene product set forth in Table 1. A sequence encoding an inhibitory nucleic acid may be located in an untranslated region (eg, intron, 5'UTR, 3'UTR, etc.) of an expression vector.
일부 실시양태에서, 유전자 산물은 자연적으로 발생하는 유전자의 코딩 부분 (예를 들어, cDNA)에 의해 코딩된다. 일부 실시양태에서, 제1 유전자 산물은 GBA1 유전자에 의해 코딩되는 단백질 (또는 그의 단편)이다. 일부 실시양태에서, 유전자 산물은 PD-연관 유전자 (예를 들어, SNCA)를 표적화하는 (예를 들어, 상기 유전자와 혼성화하거나 또는 상기 유전자와의 상보성 영역을 포함하는) 억제성 핵산이다. 통상의 기술자는 제1 유전자 산물 (예를 들어, Gcase) 및 제2 유전자 산물 (예를 들어, SNCA를 표적화하는 억제성 RNA)의 발현 순서가 일반적으로 역전될 수 있음을 인식한다 (예를 들어, 억제성 RNA가 제1 유전자 산물이고 Gcase가 제2 유전자 산물이다). 일부 실시양태에서, 유전자 산물은 표 1에 열거된 유전자의 단편 (예를 들어, 일부분)이다. 단백질 단편은 표 1에 열거된 유전자에 의해 코딩되는 단백질의 약 50%, 약 60%, 약 70%, 약 80%, 약 90% 또는 약 99%를 포함할 수 있다. 일부 실시양태에서, 단백질 단편은 표 1에 열거된 유전자에 의해 코딩되는 단백질의 50% 내지 99.9% (예를 들어, 50% 내지 99.9% 사이의 임의의 값)를 포함한다. 일부 실시양태에서, 유전자 산물 (예를 들어, 억제성 RNA)은 표적 유전자의 일부분과 혼성화한다 (예를 들어, 표적 유전자, 예를 들어 SNCA의 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21개, 또는 그 초과의 인접 뉴클레오티드에 상보적이다). 일부 실시양태에서, 발현 구축물은 모노시스트론성이다 (예를 들어, 발현 구축물은 제1 유전자 산물 및 제2 유전자 산물을 포함하는 단일 융합 단백질을 코딩한다). 일부 실시양태에서, 발현 구축물은 폴리시스트론성이다 (예를 들어, 발현 구축물은 2개의 별개의 유전자 산물, 예를 들어 2개의 상이한 단백질 또는 단백질 단편을 코딩한다).In some embodiments, a gene product is encoded by a coding portion (eg, cDNA) of a naturally occurring gene. In some embodiments, the first gene product is a protein (or fragment thereof) encoded by the GBA1 gene. In some embodiments, the gene product is an inhibitory nucleic acid that targets (eg, hybridizes with or comprises a region of complementarity with a PD-associated gene (eg, SNCA )). The skilled artisan recognizes that the order of expression of a first gene product (eg, Gcase) and a second gene product (eg, inhibitory RNA targeting SNCA ) can generally be reversed (eg, , where the inhibitory RNA is the first gene product and Gcase is the second gene product). In some embodiments, the gene product is a fragment (eg, a portion) of a gene listed in Table 1. A protein fragment may comprise about 50%, about 60%, about 70%, about 80%, about 90%, or about 99% of the protein encoded by the genes listed in Table 1. In some embodiments, the protein fragment comprises 50% to 99.9% (eg, any value between 50% and 99.9%) of the protein encoded by the genes listed in Table 1. In some embodiments, a gene product (eg, inhibitory RNA) hybridizes to a portion of a target gene (eg, 5, 6, 7, 8, 9, 10, 11 of a target gene, eg, SNCA) , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, or more contiguous nucleotides). In some embodiments, the expression construct is monocistronic (eg, the expression construct encodes a single fusion protein comprising a first gene product and a second gene product). In some embodiments, the expression construct is polycistronic (eg, the expression construct encodes two distinct gene products, eg, two different proteins or protein fragments).
폴리시스트론성 발현 벡터는 하나 이상 (예를 들어, 1, 2, 3, 4, 5개 또는 그 초과)의 프로모터를 포함할 수 있다. 임의의 적합한 프로모터, 예를 들어 구성적 프로모터, 유도성 프로모터, 내인성 프로모터, 조직-특이적 프로모터 (예를 들어, CNS-특이적 프로모터) 등을 사용할 수 있다. 일부 실시양태에서, 프로모터는 치킨 베타-액틴 프로모터 (CBA 프로모터), CAG 프로모터 (예를 들어, 문헌 [Alexopoulou et al. (2008) BMC Cell Biol. 9:2; doi: 10.1186/1471-2121-9-2]에 기재된 바와 같음), CD68 프로모터, 또는 JeT 프로모터 (예를 들어, 문헌 [Tornøe et al. (2002) Gene 297(1-2):21-32]에 기재된 바와 같음)이다. 일부 실시양태에서, 프로모터는 제1 유전자 산물, 제2 유전자 산물, 또는 제1 유전자 산물과 제2 유전자 산물을 코딩하는 핵산 서열에 작동가능하게 연결된다. 일부 실시양태에서, 발현 카세트는 전사 인자 결합 서열, 인트론 스플라이스 부위, 폴리(A) 부가 부위, 인핸서 서열, 억제인자 결합 부위, 또는 전술한 것의 임의의 조합을 포함하나 이에 제한되지는 않는 하나 이상의 부가의 조절 서열을 포함한다.A polycistronic expression vector may include one or more (eg, 1, 2, 3, 4, 5 or more) promoters. Any suitable promoter may be used, such as constitutive promoters, inducible promoters, endogenous promoters, tissue-specific promoters (eg, CNS-specific promoters), and the like. In some embodiments, the promoter is a chicken beta-actin promoter (CBA promoter), a CAG promoter (eg, Alexopoulou et al. (2008) BMC Cell Biol . 9:2; doi: 10.1186/1471-2121-9 -2), the CD68 promoter, or the JeT promoter (eg, as described in Tornøe et al. (2002) Gene 297(1-2):21-32). In some embodiments, a promoter is operably linked to a nucleic acid sequence encoding a first gene product, a second gene product, or a first gene product and a second gene product. In some embodiments, the expression cassette comprises one or more, including but not limited to, a transcription factor binding sequence, an intron splice site, a poly(A) addition site, an enhancer sequence, a repressor binding site, or any combination of the foregoing. additional regulatory sequences.
일부 실시양태에서, 제1 유전자 산물을 코딩하는 핵산 서열 및 제2 유전자 산물을 코딩하는 핵산 서열은 내부 리보솜 진입 부위 (IRES)를 코딩하는 핵산 서열에 의해 분리된다. IRES 부위의 예는, 예를 들어 문헌 [Mokrejs et al. (2006) Nucleic Acids Res. 34(Database issue):D125-30]에 기재되어 있다. 일부 실시양태에서, 제1 유전자 산물을 코딩하는 핵산 서열 및 제2 유전자 산물을 코딩하는 핵산 서열은 자기 절단 펩티드를 코딩하는 핵산 서열에 의해 분리된다. 자기 절단 펩티드의 예는 T2A, P2A, E2A, F2A, BmCPV 2A, 및 BmIFV 2A, 및 문헌 [Liu et al. (2017) Sci Rep. 7: 2193]에 기재된 것을 포함하나 이에 제한되지는 않는다. 일부 실시양태에서, 자기 절단 펩티드는 T2A 펩티드이다.In some embodiments, a nucleic acid sequence encoding a first gene product and a nucleic acid sequence encoding a second gene product are separated by a nucleic acid sequence encoding an internal ribosome entry site (IRES). Examples of IRES sites are described, for example, in Mokrejs et al. (2006) Nucleic Acids Res . 34 (Database issue): D125-30]. In some embodiments, a nucleic acid sequence encoding a first gene product and a nucleic acid sequence encoding a second gene product are separated by a nucleic acid sequence encoding a self-cleaving peptide. Examples of self-cleaving peptides include T2A, P2A, E2A, F2A, BmCPV 2A, and BmIFV 2A, and Liu et al. (2017) Sci Rep . 7: 2193]. In some embodiments, the self-cleaving peptide is a T2A peptide.
일부 실시양태에서, 억제성 핵산은 발현 구축물의 인트론, 예를 들어 제1 유전자 산물을 코딩하는 서열의 상류 인트론에 위치한다. 억제성 핵산은 이중 가닥 RNA (dsRNA), siRNA, 마이크로 RNA (miRNA), 인공 miRNA (amiRNA) 또는 RNA 앱타머일 수 있다. 일반적으로, 억제성 핵산은 표적 RNA (예를 들어, mRNA)의 약 6 내지 약 30개 (예를 들어, 6 내지 30개 사이의 포괄적인 임의의 정수)의 인접 뉴클레오티드와 결합 (예를 들어, 혼성화)한다. 일부 실시양태에서, 억제성 핵산 분자는 miRNA 또는 amiRNA, 예를 들어 SNCA (α-Syn 단백질을 코딩하는 유전자) 또는 TMEM106B (예를 들어, TMEM106B 단백질을 코딩하는 유전자)를 표적화하는 miRNA이다. 일부 실시양태에서, miRNA는 그것과 혼성화하는 SNCA mRNA의 영역과의 어떠한 미스매치도 포함하지 않는다 (예를 들어, miRNA는 "완벽하다"). 일부 실시양태에서, 억제성 핵산은 shRNA (예를 들어, SNCA 또는 TMEM106B를 표적화하는 shRNA)이다. 일부 실시양태에서, 억제성 핵산은 miR-155 스캐폴드 및 SNCA 또는 TMEM106B 표적화 서열을 포함하는 인공 miRNA (amiRNA)이다.In some embodiments, the inhibitory nucleic acid is located in an intron of the expression construct, eg, an intron upstream of the sequence encoding the first gene product. The inhibitory nucleic acid may be a double-stranded RNA (dsRNA), siRNA, micro RNA (miRNA), artificial miRNA (amiRNA) or RNA aptamer. Generally, an inhibitory nucleic acid binds (e.g., from about 6 to about 30 (e.g., any integer inclusive between 6 and 30) contiguous nucleotides of a target RNA (e.g., mRNA) (e.g., hybridize). In some embodiments, the inhibitory nucleic acid molecule is a miRNA or amiRNA, eg, a miRNA that targets SNCA (a gene encoding the α-Syn protein) or TMEM106B (eg, a gene encoding a TMEM106B protein). In some embodiments, the miRNA does not contain any mismatches with the region of the SNCA mRNA to which it hybridizes (eg, the miRNA is “perfect”). In some embodiments, the inhibitory nucleic acid is an shRNA (eg, an shRNA that targets SNCA or TMEM106B). In some embodiments, the inhibitory nucleic acid is an artificial miRNA (amiRNA) comprising a miR-155 scaffold and a SNCA or TMEM106B targeting sequence.
일부 실시양태에서, 억제성 핵산은 인공 마이크로RNA (amiRNA)이다. 마이크로RNA (miRNA)는 전형적으로 식물과 동물에서 발견되는 작은 비-코딩 RNA를 지칭하며, 유전자 발현의 전사 및 번역 후 조절 기능을 한다. MiRNA는 RNA 폴리머라제에 의해 전사되어 pri-miRNA로서 지칭되는 헤어핀-루프 구조를 형성하며, 이는 후속적으로 효소 (예를 들어, 드로샤(Drosha), 파샤(Pasha), 스플라이세오솜 등)에 의해 프로세싱되어 프리-miRNA 헤어핀 구조를 형성한 다음, 다이서(Dicer)에 의해 프로세싱되어 miRNA/miRNA* 이중체 (*는 miRNA 이중체의 패신저 가닥을 나타낸다)를 형성하고, 그 중 한 가닥은 RNA 유도 침묵 복합체 (RISC) 내로 혼입된다. 일부 실시양태에서, 본원에 기재된 바와 같은 억제성 RNA는 SNCA 또는 TMEM106B를 표적화하는 miRNA이다.In some embodiments, the inhibitory nucleic acid is an artificial microRNA (amiRNA). MicroRNAs (miRNAs) refer to small non-coding RNAs typically found in plants and animals, which function in the transcriptional and post-translational regulation of gene expression. MiRNAs are transcribed by RNA polymerase to form hairpin-loop structures referred to as pri-miRNAs, which are subsequently enzymes (e.g., Drosha, Pasha, spliceosome, etc.) is processed to form a pre-miRNA hairpin structure, which is then processed by Dicer to form a miRNA/miRNA* duplex (* indicates the passenger strand of the miRNA duplex), one of which is is incorporated into the RNA-induced silencing complex (RISC). In some embodiments, the inhibitory RNA as described herein is a miRNA that targets SNCA or TMEM106B.
일부 실시양태에서, SNCA를 표적화하는 억제성 핵산은 miRNA/miRNA* 이중체를 포함한다. 일부 실시양태에서, miRNA/miRNA* 이중체의 miRNA 가닥은 서열식별번호: 20-25 중 어느 하나에 제시된 서열을 포함하거나 또는 이로 이루어진다. 일부 실시양태에서, miRNA/miRNA* 이중체의 miRNA* 가닥은 서열식별번호: 20-25 중 어느 하나에 제시된 서열을 포함하거나 또는 이로 이루어진다.In some embodiments, the inhibitory nucleic acid targeting SNCA comprises a miRNA/miRNA* duplex. In some embodiments, the miRNA strand of a miRNA/miRNA* duplex comprises or consists of a sequence set forth in any one of SEQ ID NOs: 20-25. In some embodiments, the miRNA* strand of a miRNA/miRNA* duplex comprises or consists of a sequence set forth in any one of SEQ ID NOs: 20-25.
일부 실시양태에서, TMEM106B를 표적화하는 억제성 핵산은 miRNA/miRNA* 이중체를 포함한다. 일부 실시양태에서, miRNA/miRNA* 이중체의 miRNA 가닥은 서열식별번호: 92 또는 93에 제시된 서열을 포함하거나 또는 이로 이루어진다. 일부 실시양태에서, miRNA/miRNA* 이중체의 miRNA* 가닥은 서열식별번호: 92 또는 93에 제시된 서열을 포함하거나 또는 이로 이루어진다.In some embodiments, the inhibitory nucleic acid targeting TMEM106B comprises a miRNA/miRNA* duplex. In some embodiments, the miRNA strand of a miRNA/miRNA* duplex comprises or consists of a sequence set forth in SEQ ID NOs: 92 or 93. In some embodiments, the miRNA* strand of the miRNA/miRNA* duplex comprises or consists of the sequence set forth in SEQ ID NOs: 92 or 93.
인공 마이크로RNA (amiRNA)는 프리-mRNA의 자연 표적화 영역을 관심 표적화 영역으로 대체하기 위해 천연 miRNA를 변형시킴으로써 유래된다. 예를 들어, 자연적으로 발생하는 발현된 miRNA는 관심 유전자를 표적화하는 miRNA의 줄기 서열로 대체된 줄기 서열을 갖는 스캐폴드 또는 백본 (예를 들어, pri-miRNA 스캐폴드)으로서 사용될 수 있다. 인공 전구체 마이크로RNA (프리-amiRNA)는 정상적으로, 하나의 단일 안정한 작은 RNA가 우선적으로 생성되도록 프로세싱된다. 일부 실시양태에서, 본원에 기재된 scAAV 벡터 및 scAAV는 amiRNA를 코딩하는 핵산을 포함한다. 일부 실시양태에서, amiRNA의 pri-miRNA 스캐폴드는 pri-MIR-21, pri-MIR-22, pri-MIR-26a, pri-MIR-30a, pri-MIR-33, pri-MIR-122, pri-MIR-375, pri-MIR-199, pri-MIR-99, pri-MIR-194, pri-MIR-155, 및 pri-MIR-451로 이루어진 군으로부터 선택된 pri-miRNA로부터 유래된다. 일부 실시양태에서, amiRNA는 SNCA 또는 TMEM106B를 표적화하는 핵산 서열 및 예를 들어, 문헌 [Fowler et al. Nucleic Acids Res. 2016 Mar 18; 44(5): e48]에 기재된 바와 같은 eSIBR amiRNA 스캐폴드를 포함한다.Artificial microRNAs (amiRNAs) are derived by modifying the native miRNA to replace the natural targeting region of the pre-mRNA with a targeting region of interest. For example, a naturally occurring expressed miRNA can be used as a scaffold or backbone (eg, a pri-miRNA scaffold) having a stem sequence replaced with a stem sequence of a miRNA targeting a gene of interest. Artificial precursor microRNAs (pre-amiRNAs) are normally processed such that one single stable small RNA is preferentially produced. In some embodiments, the scAAV vectors and scAAVs described herein comprise a nucleic acid encoding an amiRNA. In some embodiments, the pri-miRNA scaffold of amiRNA is pri-MIR-21, pri-MIR-22, pri-MIR-26a, pri-MIR-30a, pri-MIR-33, pri-MIR-122, pri from a pri-miRNA selected from the group consisting of -MIR-375, pri-MIR-199, pri-MIR-99, pri-MIR-194, pri-MIR-155, and pri-MIR-451. In some embodiments, the amiRNA is a nucleic acid sequence that targets SNCA or TMEM106B and is described, e.g., in Fowler et al. Nucleic Acids Res. 2016 Mar 18; 44(5): e48].
일부 실시양태에서, SNCA를 표적화하는 amiRNA는 서열식별번호: 94-99 중 어느 하나에 제시된 서열을 포함하거나 또는 이로 이루어진다. 일부 실시양태에서, TMEM106B를 표적화하는 amiRNA는 서열식별번호: 65-66에 제시된 서열을 포함하거나 또는 이로 이루어진다. 일부 실시양태에서, RPS25를 표적화하는 amiRNA는 서열식별번호: 115 내지 122에 제시된 서열을 포함하거나 또는 이로 이루어진다. 일부 실시양태에서, MAPT를 표적화하는 amiRNA는 서열식별번호: 123-138에 제시된 서열을 포함하거나 또는 이로 이루어진다.In some embodiments, the amiRNA targeting SNCA comprises or consists of a sequence set forth in any one of SEQ ID NOs: 94-99. In some embodiments, the amiRNA targeting TMEM106B comprises or consists of the sequence set forth in SEQ ID NOs: 65-66. In some embodiments, the amiRNA targeting RPS25 comprises or consists of a sequence set forth in SEQ ID NOs: 115-122. In some embodiments, the amiRNA targeting MAPT comprises or consists of a sequence set forth in SEQ ID NOs: 123-138.
일부 실시양태에서, 본 개시내용에 의해 기재된 단리된 핵산 또는 벡터 (예를 들어, rAAV 벡터)는 서열식별번호: 1-13, 15, 17, 19-29, 31, 32, 34, 36, 38-44, 46, 48, 50-54, 56, 58-62, 64-66, 및 68-145 중 어느 하나에 제시된 서열을 포함하거나 또는 이로 이루어진다. 일부 실시양태에서, 본 개시내용에 의해 기재된 단리된 핵산 또는 벡터 (예를 들어, rAAV 벡터)는 서열식별번호: 1-13, 15, 17, 19-29, 31, 32, 34, 36, 38-44, 46, 48, 50-54, 56, 58-62, 64-66, 및 68-145 중 어느 하나에 제시된 서열에 상보적인 (예를 들어, 이러한 서열의 보체인) 서열을 포함하거나 또는 이로 이루어진다. 일부 실시양태에서, 본 개시내용에 의해 기재된 단리된 핵산 또는 벡터 (예를 들어, rAAV 벡터)는 서열식별번호: 1-13, 15, 17, 19-29, 31, 32, 34, 36, 38-44, 46, 48, 50-54, 56, 58-62, 64-66, 및 68-145 중 어느 하나에 제시된 서열의 역 보체인 서열을 포함하거나 또는 이로 이루어진다. 일부 실시양태에서, 본 개시내용에 의해 기재된 단리된 핵산 또는 벡터 (예를 들어, rAAV 벡터)는 서열식별번호: 1-13, 15, 17, 19-29, 31, 32, 34, 36, 38-44, 46, 48, 50-54, 56, 58-62, 64-66, 및 68-145 중 어느 하나에 제시된 서열의 일부분을 포함하거나 또는 이로 이루어진다. 일부분은 서열식별번호: 1-13, 15, 17, 19-29, 31, 32, 34, 36, 38-44, 46, 48, 50-54, 56, 58-62, 64-66, 및 68-145 중 어느 하나에 제시된 서열의 적어도 25%, 50%, 60%, 70%, 80%, 90%, 95%, 또는 99%를 포함할 수 있다. 일부 실시양태에서, 본 개시내용에 의해 기재된 핵산 서열은 핵산 센스 가닥 (예를 들어, 5'에서 3'로의 가닥), 또는 바이러스 서열의 맥락에서 플러스 (+) 가닥이다. 일부 실시양태에서, 본 개시내용에 의해 기재된 핵산 서열은 핵산 안티센스 가닥 (예를 들어, 3'에서 5'로의 가닥), 또는 바이러스 서열의 맥락에서 마이너스 (-) 가닥이다.In some embodiments, the isolated nucleic acid or vector (eg, rAAV vector) described by the present disclosure comprises SEQ ID NOs: 1-13, 15, 17, 19-29, 31, 32, 34, 36, 38 -44, 46, 48, 50-54, 56, 58-62, 64-66, and 68-145. In some embodiments, the isolated nucleic acid or vector (eg, rAAV vector) described by the present disclosure comprises SEQ ID NOs: 1-13, 15, 17, 19-29, 31, 32, 34, 36, 38 -44, 46, 48, 50-54, 56, 58-62, 64-66, and 68-145 comprising a sequence that is complementary to (eg, is the complement of) a sequence set forth in any one of, or This is done In some embodiments, the isolated nucleic acid or vector (eg, rAAV vector) described by the present disclosure comprises SEQ ID NOs: 1-13, 15, 17, 19-29, 31, 32, 34, 36, 38 -44, 46, 48, 50-54, 56, 58-62, 64-66, and 68-145 comprising or consisting of a sequence that is the reverse complement of a sequence set forth in any one of. In some embodiments, the isolated nucleic acid or vector (eg, rAAV vector) described by the present disclosure comprises SEQ ID NOs: 1-13, 15, 17, 19-29, 31, 32, 34, 36, 38 -44, 46, 48, 50-54, 56, 58-62, 64-66, and 68-145. Portions are SEQ ID NOs: 1-13, 15, 17, 19-29, 31, 32, 34, 36, 38-44, 46, 48, 50-54, 56, 58-62, 64-66, and 68 at least 25%, 50%, 60%, 70%, 80%, 90%, 95%, or 99% of the sequence set forth in any one of -145. In some embodiments, a nucleic acid sequence described by the present disclosure is the nucleic acid sense strand (eg, 5′ to 3′ strand), or the plus (+) strand in the context of a viral sequence. In some embodiments, a nucleic acid sequence described by the present disclosure is the nucleic acid antisense strand (eg, 3' to 5' strand), or the minus (-) strand in the context of a viral sequence.
통상의 기술자는 억제성 핵산 (예를 들어, dsRNA, siRNA, miRNA, amiRNA 등)을 포함하거나 또는 이를 코딩하는 핵산 서열을 언급할 때, 본원에 제공된 서열 내의 임의의 하나 이상의 티미딘 (T) 뉴클레오티드 또는 우리딘 (U) 뉴클레오티드가, 아데노신 뉴클레오티드와의 염기쌍 형성 (예를 들어, 왓슨-크릭(Watson-Crick) 염기쌍을 통함)에 적합한 임의의 다른 뉴클레오티드로 대체될 수 있다는 것을 인식한다. 예를 들어, T는 U로 대체될 수 있고 U는 T로 대체될 수 있다.The skilled artisan will, when referring to a nucleic acid sequence comprising or encoding an inhibitory nucleic acid (eg, dsRNA, siRNA, miRNA, amiRNA, etc.), any one or more thymidine (T) nucleotides in the sequences provided herein. Alternatively, it is recognized that the uridine (U) nucleotide may be replaced with any other nucleotide suitable for base pairing with an adenosine nucleotide (eg, via Watson-Crick base pairing). For example, T may be replaced by U and U may be replaced by T.
본원에 기재된 바와 같은 단리된 핵산은 그 자체로 또는 벡터의 일부로서 존재할 수 있다. 일반적으로, 벡터는 플라스미드, 코스미드, 파지미드, 박테리아 인공 염색체 (BAC), 또는 바이러스 벡터 (예를 들어, 아데노바이러스 벡터, 아데노-연관 바이러스 (AAV) 벡터, 레트로바이러스 벡터, 배큘로바이러스 벡터 등)일 수 있다. 일부 실시양태에서, 벡터는 플라스미드 (예를 들어, 본원에 기재된 바와 같은 단리된 핵산을 포함하는 플라스미드)이다. 일부 실시양태에서, rAAV 벡터는 단일 가닥 (예를 들어, 단일 가닥 DNA)이다. 일부 실시양태에서, 벡터는 재조합 AAV (rAAV) 벡터이다. 일부 실시양태에서, 벡터는 배큘로바이러스 벡터 [예를 들어, 오토그라파 칼리포르니카(Autographa californica) 핵 다면체증 (AcNPV) 벡터]이다.An isolated nucleic acid as described herein may exist on its own or as part of a vector. In general, vectors are plasmids, cosmids, phagemids, bacterial artificial chromosomes (BAC), or viral vectors (eg, adenoviral vectors, adeno-associated virus (AAV) vectors, retroviral vectors, baculovirus vectors, etc.) ) can be In some embodiments, the vector is a plasmid (eg, a plasmid comprising an isolated nucleic acid as described herein). In some embodiments, the rAAV vector is single stranded (eg, single stranded DNA). In some embodiments, the vector is a recombinant AAV (rAAV) vector. In some embodiments, the vector is a baculoviral vector (eg, an Autographa californica nuclear polyhedrosis (AcNPV) vector).
전형적으로 rAAV 벡터 (예를 들어, rAAV 게놈)는 2개의 AAV 역위 말단 반복부 (ITR) 서열에 의해 플랭킹된 트랜스진 (예를 들어, 하기 각각의 하나 이상을 포함하는 발현 구축물: 프로모터, 인트론, 인핸서 서열, 단백질 코딩 서열, 억제성 RNA 코딩 서열, 폴리A 꼬리 서열 등)을 포함한다. 일부 실시양태에서, rAAV 벡터의 트랜스진은 본 개시내용에 의해 기재된 바와 같은 단리된 핵산을 포함한다. 일부 실시양태에서, rAAV 벡터의 2개의 ITR 서열 각각은 완전한 길이의 ITR (예를 들어, 길이가 대략 145 bp이고 기능적 Rep 결합 부위 (RBS) 및 말단 분해 부위 (trs)를 함유함)이다. 일부 실시양태에서, rAAV 벡터의 ITR 중 하나는 말단절단된다 (예를 들어, 단축되거나 또는 완전한 길이가 아님). 일부 실시양태에서, 말단절단된 ITR에는 기능적 말단 분해 부위 (trs)가 결여되고, 자기-상보적 AAV 벡터 (scAAV 벡터)의 생산을 위해 사용된다. 일부 실시양태에서, 말단절단된 ITR은, 예를 들어 문헌 [McCarty et al. (2003) Gene Ther. 10(26):2112-8]에 기재된 바와 같은 ΔITR이다.Typically rAAV vectors (eg, rAAV genomes) are transgenes flanked by two AAV inverted terminal repeat (ITR) sequences (eg, an expression construct comprising one or more of each of: a promoter, an intron , enhancer sequences, protein coding sequences, inhibitory RNA coding sequences, polyA tail sequences, etc.). In some embodiments, the transgene of the rAAV vector comprises an isolated nucleic acid as described by the present disclosure. In some embodiments, each of the two ITR sequences of the rAAV vector is a full-length ITR (eg, approximately 145 bp in length and containing a functional Rep binding site (RBS) and a terminal cleavage site (trs)). In some embodiments, one of the ITRs of the rAAV vector is truncated (eg, shortened or not in full length). In some embodiments, the truncated ITR lacks a functional terminal cleavage site (trs) and is used for the production of self-complementary AAV vectors (scAAV vectors). In some embodiments, truncated ITRs are described, eg, in McCarty et al. (2003) Gene Ther . 10(26):2112-8].
본 개시내용의 측면은 예를 들어, 야생형 AAV ITR에 비해, 예를 들어 야생형 AAV2 ITR (예를 들어, 서열식별번호: 29)에 비해 하나 이상의 변형 (예를 들어, 핵산 부가, 결실, 치환 등)을 갖는 ITR을 포함하는 단리된 핵산 (예를 들어, rAAV 벡터)에 관한 것이다. 야생형 AAV2 ITR의 구조가 도 20에 도시되어 있다. 일반적으로, 야생형 ITR은 자기 어닐링되어 2개의 교차 아암으로 이루어진 팔린드롬성 이중 가닥 T-형 헤어핀 구조 (B/B' 및 C/C'로서 각각 지칭되는 서열에 의해 형성됨), 더 긴 줄기 영역 (서열 A/A'에 의해 형성됨), 및 "D" 영역으로서 지칭되는 단일 가닥 말단 영역을 형성하는 125개 뉴클레오티드 영역을 포함한다 (도 20). 일반적으로, ITR의 "D" 영역은 A/A' 서열에 의해 형성된 줄기 영역과 rAAV 벡터의 트랜스진을 함유하는 삽입체 사이에 위치한다 (예를 들어, ITR의 말단을 기준으로 하여 ITR의 "내부"에 위치하거나 또는 rAAV 벡터의 트랜스진 삽입체 또는 발현 구축물에 근접함). 일부 실시양태에서, "D" 영역은 서열식별번호: 27에 제시된 서열을 포함한다. "D" 영역은 캡시드 단백질에 의한 rAAV 벡터의 캡슐화에서 중요한 역할을 하는 것으로 관찰되었으며, 이는 예를 들어, 문헌 [Ling et al. (2015) J Mol Genet Med 9(3)]에 개시된 바와 같다.Aspects of the present disclosure relate to one or more modifications (e.g., nucleic acid additions, deletions, substitutions, etc. ) (eg, a rAAV vector) comprising an ITR with The structure of the wild-type AAV2 ITR is shown in FIG. 20 . In general, wild-type ITRs are self-annealed palindromic double-stranded T-shaped hairpin structures consisting of two cross-arms (formed by sequences referred to as B/B' and C/C', respectively), a longer stem region ( sequence A/A′), and a 125 nucleotide region forming a single-stranded end region referred to as the “D” region ( FIG. 20 ). Generally, the "D" region of an ITR is located between the stem region formed by the A/A' sequence and the insert containing the transgene of the rAAV vector (e.g., the "D" region of the ITR relative to the end of the ITR. "inside" or proximate the transgene insert or expression construct of the rAAV vector). In some embodiments, the “D” region comprises the sequence set forth in SEQ ID NO:27. The “D” region has been observed to play an important role in the encapsulation of rAAV vectors by capsid proteins, as described, for example, in Ling et al. (2015) J Mol Genet Med 9(3)].
본 개시내용은 부분적으로, ITR의 "외부" (예를 들어, 트랜스진 삽입체 또는 발현 구축물을 기준으로 하여 ITR의 말단에 근접함)에 위치한 "D" 영역을 포함하는 rAAV 벡터가, 비변형된 (예를 들어, 야생형) ITR을 수반한 ITR을 갖는 rAAV 벡터보다 AAV 캡시드 단백질에 의해 효율적으로 캡슐화된다는 것에 기초한다. 일부 실시양태에서, 변형된 "D" 서열 (예를 들어, "외부" 위치의 "D" 서열)을 갖는 rAAV 벡터는 야생형 ITR 서열을 갖는 rAAV 벡터에 비해 감소된 독성을 갖는다.The disclosure provides, in part, that an rAAV vector comprising a "D" region located "outside" of the ITR (eg, proximal to the terminus of the ITR relative to the transgene insert or expression construct) is an unmodified It is based on being more efficiently encapsulated by the AAV capsid protein than the rAAV vector with an ITR accompanied by a modified (eg wild-type) ITR. In some embodiments, a rAAV vector having a modified “D” sequence (eg, a “D” sequence in an “external” position) has reduced toxicity compared to a rAAV vector having a wild-type ITR sequence.
일부 실시양태에서, 변형된 "D" 서열은 야생형 "D" 서열 (예를 들어, 서열식별번호: 27)에 비해 적어도 하나의 뉴클레오티드 치환을 포함한다. 변형된 "D" 서열은 야생형 "D" 서열 (예를 들어, 서열식별번호: 27)에 비해 적어도 1, 2, 3, 4, 5, 6, 7, 8, 9, 10개, 또는 10개 초과의 뉴클레오티드 치환을 가질 수 있다. 일부 실시양태에서, 변형된 "D" 서열은 야생형 "D" 서열 (예를 들어, 서열식별번호: 27)에 비해 적어도 10, 11, 12, 13, 14, 15, 16, 17, 18, 또는 19개의 핵산 치환을 포함한다. 일부 실시양태에서, 변형된 "D" 서열은 야생형 "D" 서열 (예를 들어, 서열식별번호: 27)과 약 10% 내지 약 99% (예를 들어, 10%, 15%, 20%, 25%, 30%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 또는 99%) 동일하다. 일부 실시양태에서, 변형된 "D" 서열은 문헌 [Wang et al. (1995) J Mol Biol 250(5):573-80]에 기재된 바와 같은 "S" 서열로서 지칭되기도 하는, 서열식별번호: 26에 제시된 서열을 포함한다.In some embodiments, the modified “D” sequence comprises at least one nucleotide substitution relative to the wild-type “D” sequence (eg, SEQ ID NO:27). The modified “D” sequence has at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or 10 compared to the wild-type “D” sequence (eg, SEQ ID NO: 27). may have more than one nucleotide substitution. In some embodiments, the modified "D" sequence is at least 10, 11, 12, 13, 14, 15, 16, 17, 18, or It contains 19 nucleic acid substitutions. In some embodiments, the modified "D" sequence comprises from about 10% to about 99% (e.g., 10%, 15%, 20%, 25%, 30%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or 99%). In some embodiments, the modified “D” sequence is described in Wang et al. (1995) J Mol Biol 250(5):573-80.
본 개시내용에 의해 기재된 바와 같은 단리된 핵산 또는 rAAV 벡터는, 예를 들어 서열식별번호: 28에 제시된 바와 같거나 또는 문헌 [Francois, et al. 2005. The Cellular TATA Binding Protein Is Required for Rep-Dependent Replication of a Minimal Adeno-Associated Virus Type 2 p5 Element. J Virol]에 기재된 바와 같은 "TRY" 서열을 추가로 포함할 수 있다. 일부 실시양태에서, TRY 서열은 단리된 핵산 또는 rAAV 벡터의 ITR (예를 들어, 5' ITR)과 발현 구축물 (예를 들어, 트랜스진-코딩 삽입체) 사이에 위치한다.An isolated nucleic acid or rAAV vector as described by the present disclosure can be prepared, for example, as set forth in SEQ ID NO: 28 or as described in Francois, et al. 2005. The Cellular TATA Binding Protein Is Required for Rep-Dependent Replication of a Minimal Adeno-
본 개시내용의 측면은 대상체의 골수 세포 (예를 들어, CNS 골수 세포, 예컨대 미세아교세포)에서 하나 이상의 트랜스진을 발현하도록 구성된 구축물에 관한 것이다. 따라서, 일부 실시양태에서, 구축물 (예를 들어, 유전자 발현 벡터)은 골수 세포-특이적 프로모터에 작동가능하게 연결된 단백질 코딩 서열을 포함한다. 골수 세포 특이적 프로모터의 예는, 예를 들어 문헌 [Lin et al. Adv Exp Med Biol. 2010;706:149-56]에 기재된 바와 같은, CD68 프로모터, lysM 프로모터, csf1r 프로모터, CD11c 프로모터, c-fes 프로모터 및 F4/80 프로모터를 포함한다. 일부 실시양태에서, 골수 세포 특이적 프로모터는 CD68 프로모터 또는 F4/80 프로모터이다.Aspects of the disclosure relate to constructs configured to express one or more transgenes in bone marrow cells (eg, CNS bone marrow cells, such as microglia) of a subject. Accordingly, in some embodiments, the construct (eg, gene expression vector) comprises a protein coding sequence operably linked to a bone marrow cell-specific promoter. Examples of bone marrow cell specific promoters are described, for example, in Lin et al. Adv Exp Med Biol. 2010;706:149-56, the CD68 promoter, the lysM promoter, the csflr promoter, the CD11c promoter, the c-fes promoter and the F4/80 promoter. In some embodiments, the bone marrow cell specific promoter is the CD68 promoter or the F4/80 promoter.
일부 측면에서, 본 개시내용은 본 개시내용에 의해 기재된 바와 같은 단리된 핵산 또는 rAAV 벡터를 포함하는 배큘로바이러스 벡터에 관한 것이다. 일부 실시양태에서, 배큘로바이러스 벡터는, 예를 들어 문헌 [Urabe et al. (2002) Hum Gene Ther 13(16):1935-43 및 Smith et al. (2009) Mol Ther 17(11):1888-1896]에 기재된 바와 같은, 오토그라파 칼리포르니카 핵 다면체증 (AcNPV) 벡터이다.In some aspects, the present disclosure relates to a baculovirus vector comprising an isolated nucleic acid or rAAV vector as described by the present disclosure. In some embodiments, baculovirus vectors are described, eg, in Urabe et al. (2002) Hum Gene Ther 13(16):1935-43 and Smith et al. (2009) Mol Ther 17(11):1888-1896, Autographa californica nuclear polyhedron (AcNPV) vector.
일부 측면에서, 본 개시내용은 본원에 기재된 바와 같은 단리된 핵산 또는 벡터를 포함하는 숙주 세포를 제공한다. 숙주 세포는 원핵 세포 또는 진핵 세포일 수 있다. 예를 들어, 숙주 세포는 포유동물 세포, 박테리아 세포, 효모 세포, 곤충 세포 등일 수 있다. 일부 실시양태에서, 숙주 세포는 포유동물 세포, 예를 들어 HEK293T 세포이다. 일부 실시양태에서, 숙주 세포는 박테리아 세포, 예를 들어 이. 콜라이(E. coli) 세포이다.In some aspects, the present disclosure provides a host cell comprising an isolated nucleic acid or vector as described herein. The host cell may be a prokaryotic cell or a eukaryotic cell. For example, the host cell can be a mammalian cell, a bacterial cell, a yeast cell, an insect cell, and the like. In some embodiments, the host cell is a mammalian cell, eg, a HEK293T cell. In some embodiments, the host cell is a bacterial cell, eg, E. E. coli cells.
rAAVrAAV
일부 측면에서, 본 개시내용은 본원에 기재된 바와 같은 하나 이상의 단리된 핵산을 코딩하는 트랜스진을 포함하는 재조합 AAV (rAAV) (예를 들어, 본원에 기재된 1, 2, 3, 4, 5, 6, 7, 8, 9, 10개 또는 그 초과의 유전자 산물 및/또는 본원에 기재된 유전자 산물을 표적화하는 억제성 핵산을 코딩하는 rAAV 벡터)에 관한 것이다. 용어 "rAAV"는 일반적으로 하나 이상의 AAV 캡시드 단백질에 의해 캡슐화된 rAAV 벡터를 포함하는 바이러스 입자를 지칭한다. 본 개시내용에 의해 기재된 rAAV는 AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, 및 AAV10, 또는 그의 변이체로부터 선택된 혈청형을 갖는 캡시드 단백질을 포함할 수 있다. 일부 실시양태에서, 캡시드 단백질은 AAV9 캡시드 단백질 또는 그의 변이체이다. 일부 실시양태에서, AAV9 캡시드 단백질 변이체는 서열식별번호: 147의 T492, Y705, 및 Y731에 상응하는 (예를 들어, AAV6의 이들 위치에 상응하는) 하나 이상의 위치에서 돌연변이를 포함한다. 일부 실시양태에서, 하나 이상의 돌연변이는 T492V, Y705F, Y731F, 또는 그의 조합으로부터 선택된다. 일부 실시양태에서, AAV9 캡시드 단백질 변이체는 서열식별번호: 149에 제시된 아미노산 서열을 포함한다.In some aspects, the disclosure provides a recombinant AAV (rAAV) comprising a transgene encoding one or more isolated nucleic acids as described herein (eg, 1, 2, 3, 4, 5, 6 described herein) , 7, 8, 9, 10 or more gene products and/or rAAV vectors encoding inhibitory nucleic acids that target the gene products described herein). The term “rAAV” generally refers to a viral particle comprising a rAAV vector encapsulated by one or more AAV capsid proteins. The rAAV described by the present disclosure may comprise a capsid protein having a serotype selected from AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, and AAV10, or variants thereof. In some embodiments, the capsid protein is an AAV9 capsid protein or a variant thereof. In some embodiments, the AAV9 capsid protein variant comprises a mutation at one or more positions corresponding to T492, Y705, and Y731 of SEQ ID NO: 147 (eg, corresponding to these positions in AAV6). In some embodiments, the one or more mutations are selected from T492V, Y705F, Y731F, or a combination thereof. In some embodiments, the AAV9 capsid protein variant comprises the amino acid sequence set forth in SEQ ID NO:149.
일부 실시양태에서, rAAV는 비-인간 숙주로부터의 캡시드 단백질, 예를 들어 레서스 AAV 캡시드 단백질, 예컨대 AAVrh.10, AAVrh.39 등을 포함한다. 일부 실시양태에서, 본 개시내용에 의해 기재된 rAAV는 야생형 캡시드 단백질의 변이체인 캡시드 단백질, 예컨대 그것이 유래되는 야생형 AAV 캡시드 단백질과 비교하여 적어도 1, 2, 3, 4, 5, 6, 7, 8, 9, 10개, 또는 10개 초과 (예를 들어, 15, 20, 25, 50, 100개 등) 아미노산 치환 (예를 들어, 돌연변이)을 포함하는 캡시드 단백질 변이체를 포함한다. 일부 실시양태에서, AAV 캡시드 단백질 변이체는, 예를 들어 문헌 [Albright et al. Mol Ther. 2018 Feb 7;26(2):510-523]에 기재된 바와 같은 AAV1RX 캡시드 단백질이다. 일부 실시양태에서, 캡시드 단백질은 AAV1RX이고 서열식별번호: 146에 제시된 아미노산 서열을 포함한다 (또는 서열식별번호: 145에 제시된 핵산 서열에 의해 코딩된다). 일부 실시양태에서, 캡시드 단백질 변이체는, 예를 들어 문헌 [Rosario et al. Mol Ther Methods Clin Dev. 2016; 3: 16026]에 기재된 바와 같은 AAV TM6 캡시드 단백질이다. 일부 실시양태에서, AAV6 캡시드 단백질 변이체는 AAV-TM6 캡시드 단백질이고 서열식별번호: 148에 제시된 아미노산 서열을 포함한다.In some embodiments, the rAAV comprises a capsid protein from a non-human host, eg, a rhesus AAV capsid protein, such as AAVrh.10, AAVrh.39, and the like. In some embodiments, the rAAV described by the present disclosure is a capsid protein that is a variant of the wild-type capsid protein, such as at least 1, 2, 3, 4, 5, 6, 7, 8, capsid protein variants comprising 9, 10, or more than 10 (eg, 15, 20, 25, 50, 100, etc.) amino acid substitutions (eg, mutations). In some embodiments, AAV capsid protein variants are described, eg, in Albright et al. Mol Ther. 2018 Feb 7:26(2):510-523]. In some embodiments, the capsid protein is AAV1RX and comprises the amino acid sequence set forth in SEQ ID NO: 146 (or encoded by the nucleic acid sequence set forth in SEQ ID NO: 145). In some embodiments, capsid protein variants are described, eg, in Rosario et al. Mol Ther Methods Clin Dev. 2016; 3: 16026]. In some embodiments, the AAV6 capsid protein variant is an AAV-TM6 capsid protein and comprises the amino acid sequence set forth in SEQ ID NO:148.
일부 실시양태에서, 본 개시내용에 의해 기재된 rAAV는 특히 CSF 공간 내로 또는 뇌 실질 내로 직접 도입될 때, CNS를 통해 쉽게 확산된다. 따라서, 일부 실시양태에서, 본 개시내용에 의해 기재된 rAAV는 혈액-뇌 장벽 (BBB)을 통과할 수 있는 캡시드 단백질을 포함한다. 예를 들어, 일부 실시양태에서, rAAV는 AAV9 또는 AAVrh.10 혈청형을 갖는 캡시드 단백질을 포함한다. rAAV의 생산은, 예를 들어 문헌 [Samulski et al. (1989) J Virol. 63(9):3822-8 및 Wright (2009) Hum Gene Ther. 20(7): 698-706]에 기재되어 있다. 일부 실시양태에서, rAAV는 골수 세포, 예를 들어 미세아교세포를 특이적으로 또는 우선적으로 표적화하는 캡시드 단백질을 포함한다. 일부 실시양태에서, rAAV는 미세아교세포를 형질도입한다.In some embodiments, the rAAV described by the present disclosure readily diffuses through the CNS, particularly when introduced into the CSF space or directly into the brain parenchyma. Thus, in some embodiments, the rAAV described by the present disclosure comprises a capsid protein capable of crossing the blood-brain barrier (BBB). For example, in some embodiments, the rAAV comprises a capsid protein having the AAV9 or AAVrh.10 serotype. The production of rAAV is described, for example, in Samulski et al. (1989) J Virol. 63(9):3822-8 and Wright (2009) Hum Gene Ther. 20(7): 698-706. In some embodiments, the rAAV comprises a capsid protein that specifically or preferentially targets bone marrow cells, eg, microglia. In some embodiments, the rAAV transduces microglia.
일부 실시양태에서, 본 개시내용에 의해 기재된 바와 같은 rAAV (예를 들어, AAV 캡시드 단백질에 의해 캡슐화된 재조합 rAAV 게놈을 포함하여 rAAV 캡시드 입자를 형성함)는 배큘로바이러스 벡터 발현 시스템 (BEVS)에서 생산된다. BEVS를 사용한 rAAV의 생산은, 예를 들어 문헌 [Urabe et al. (2002) Hum Gene Ther 13(16):1935-43, Smith et al. (2009) Mol Ther 17(11):1888-1896], 미국 특허 번호 8,945,918, 미국 특허 번호 9,879,282, 및 국제 PCT 공개 WO 2017/184879에 기재되어 있다. 그러나, rAAV는 임의의 적합한 방법 (예를 들어, 재조합 rep 및 cap 유전자를 사용함)을 사용하여 생산될 수 있다. 일부 실시양태에서, 본원에 개시된 바와 같은 rAAV는 HEK293 (인간 배아 신장) 세포에서 생산된다.In some embodiments, the rAAV as described by the present disclosure (e.g., comprising a recombinant rAAV genome encapsulated by an AAV capsid protein to form a rAAV capsid particle) in a baculovirus vector expression system (BEVS) is produced The production of rAAV using BEVS is described, for example, in Urabe et al. (2002) Hum Gene Ther 13(16):1935-43, Smith et al. (2009) Mol Ther 17(11):1888-1896], US Pat. No. 8,945,918, US Pat. No. 9,879,282, and International PCT Publication WO 2017/184879. However, rAAV can be produced using any suitable method (eg, using recombinant rep and cap genes). In some embodiments, the rAAV as disclosed herein is produced in HEK293 (human embryonic kidney) cells.
제약 조성물pharmaceutical composition
일부 측면에서, 본 개시내용은 본원에 기재된 바와 같은 단리된 핵산 또는 rAAV 및 제약상 허용되는 담체를 포함하는 제약 조성물을 제공한다. 본원에 사용된 바와 같은, 용어 "제약상 허용되는"은 화합물의 생물학적 활성 또는 특성을 폐기하지 않고 상대적으로 독성이 없는 물질, 예컨대 담체 또는 희석제를 지칭하며, 예를 들어 이러한 물질은 바람직하지 않은 생물학적 효과를 일으키지 않거나 또는 그것이 함유된 조성물의 성분 중 임의의 것과 유해한 방식으로 상호작용하지 않으면서 개체에게 투여될 수 있다.In some aspects, the present disclosure provides a pharmaceutical composition comprising an isolated nucleic acid or rAAV as described herein and a pharmaceutically acceptable carrier. As used herein, the term “pharmaceutically acceptable” refers to a material, such as a carrier or diluent, that does not abrogate the biological activity or properties of the compound and is relatively non-toxic, e.g., such material is an undesirable biological agent. It can be administered to a subject without producing an effect or interacting in a deleterious manner with any of the ingredients of the composition in which it is contained.
본원에 사용된 바와 같은, 용어 "제약상 허용되는 담체"는 의도된 기능을 수행할 수 있도록 환자 내에서 또는 환자에게 본 발명 내에서 유용한 화합물을 운반하거나 수송하는 것에 관여하는, 제약상 허용되는 물질, 조성물 또는 담체, 예컨대 액체 또는 고체 충전제, 안정제, 분산제, 현탁제, 희석제, 부형제, 증점제, 용매 또는 캡슐화 물질을 의미한다. 본 발명의 실시에 사용되는 제약 조성물에 포함될 수 있는 부가의 성분은 관련 기술분야에 공지되어 있으며, 예를 들어 본원에 참조로 포함되는 문헌 [Remington's Pharmaceutical Sciences (Genaro, Ed., Mack Publishing Co., 1985, Easton, PA)]에 기재되어 있다.As used herein, the term "pharmaceutically acceptable carrier" refers to a pharmaceutically acceptable substance that is involved in carrying or transporting a compound useful within the invention within or to a patient so that it can perform its intended function. , compositions or carriers, such as liquid or solid fillers, stabilizers, dispersants, suspending agents, diluents, excipients, thickeners, solvents or encapsulating materials. Additional ingredients that may be included in pharmaceutical compositions used in the practice of the present invention are known in the art and are described, for example, in Remington's Pharmaceutical Sciences (Genaro, Ed., Mack Publishing Co., 1985, Easton, PA).
본원에 제공되는 조성물 (예를 들어, 제약 조성물)은 장내 (예를 들어, 경구), 비경구, 정맥내, 근육내, 동맥내, 골수내, 척수강내, 피하, 뇌실내, 경피, 피내, 직장, 질내, 복강내, 국소 (분말, 연고, 크림 및/또는 점적제에 의함), 점막, 비강, 볼, 설하; 기관내 점적, 기관지 점적 및/또는 흡입에 의해; 및/또는 경구 스프레이, 비강 스프레이, 및/또는 에어로졸로서의 투여를 포함한 임의의 경로에 의해 투여될 수 있다. 구체적으로 고려되는 경로는 경구 투여, 정맥내 투여 (예를 들어, 전신 정맥내 주사), 혈액 및/또는 림프 공급을 통한 국소 투여, 및/또는 병에 걸린 부위에 대한 직접 투여이다. 일반적으로, 가장 적절한 투여 경로는 작용제의 성질 (예를 들어, 위장관 환경에서의 안정성), 및/또는 대상체의 상태 (예를 들어, 대상체가 경구 투여를 견딜 수 있는지 여부)를 포함한 다양한 요인에 따라 달라질 것이다. 특정 실시양태에서, 본원에 기재된 화합물 또는 제약 조성물은 대상체의 눈에 국소 투여하기에 적합하다.Compositions (eg, pharmaceutical compositions) provided herein can be administered to an enteric (eg, oral), parenteral, intravenous, intramuscular, intraarterial, intramedullary, intrathecal, subcutaneous, intraventricular, transdermal, intradermal, rectal, vaginal, intraperitoneal, topical (by powder, ointment, cream and/or drops), mucosal, nasal, buccal, sublingual; by endotracheal instillation, bronchial instillation and/or inhalation; and/or by any route including administration as an oral spray, nasal spray, and/or aerosol. Routes specifically contemplated are oral administration, intravenous administration (eg systemic intravenous injection), topical administration via blood and/or lymphatic supply, and/or direct administration to the affected site. In general, the most appropriate route of administration will depend on a variety of factors, including the nature of the agent (eg, stability in the gastrointestinal environment), and/or the condition of the subject (eg, whether the subject can tolerate oral administration). It will be different. In certain embodiments, a compound or pharmaceutical composition described herein is suitable for topical administration to the eye of a subject.
일부 실시양태에서, 조성물은 하나 이상 (예를 들어, 1, 2, 3, 4, 5, 6, 7, 8, 9, 또는 10개)의 상이한 rAAV를 포함하고, 각각의 rAAV는 상이한 유전자 산물 (예를 들어, 상이한 단백질 또는 억제성 핵산)을 코딩하는 단리된 핵산을 포함한다. 상이한 rAAV는 동일한 혈청형 또는 상이한 혈청형의 캡시드 단백질을 포함할 수 있다.In some embodiments, the composition comprises one or more (eg, 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10) different rAAVs, each rAAV being a different gene product. (eg, a different protein or inhibitory nucleic acid). Different rAAVs may contain capsid proteins of the same serotype or different serotypes.
방법Way
본 개시내용의 측면은 CNS-연관 질환을 치료하기 위한 대상체에서 하나 이상의 CNS 질환-연관 유전자 산물의 발현을 위한 조성물에 관한 것이다. 하나 이상의 CNS 질환-연관 유전자 산물은 하나 이상의 단리된 핵산 또는 rAAV 벡터에 의해 코딩될 수 있다. 일부 실시양태에서, 대상체에게 하나 이상 (1, 2, 3, 4, 5개 또는 그 초과)의 유전자 산물을 코딩하는 단일 벡터 (예를 들어, 단리된 핵산, rAAV 등)를 투여한다. 일부 실시양태에서, 대상체에게 복수개 (예를 들어, 2, 3, 4, 5개 또는 그 초과)의 벡터 (예를 들어, 단리된 핵산, rAAV 등)를 투여하며, 여기서 각각의 벡터는 상이한 CNS 질환-연관 유전자 산물을 코딩한다.Aspects of the present disclosure relate to a composition for expression of one or more CNS disease-associated gene products in a subject for treating a CNS-associated disease. The one or more CNS disease-associated gene products may be encoded by one or more isolated nucleic acids or rAAV vectors. In some embodiments, the subject is administered a single vector (eg, isolated nucleic acid, rAAV, etc.) encoding one or more (1, 2, 3, 4, 5 or more) gene products. In some embodiments, the subject is administered a plurality (eg, 2, 3, 4, 5 or more) vectors (eg, isolated nucleic acids, rAAV, etc.), wherein each vector is a different CNS It encodes a disease-associated gene product.
CNS-연관 질환은 신경퇴행성 질환, 시누클레인병증, 타우병증 또는 리소좀 축적 질환일 수 있다. 신경퇴행성 질환 및 연관 유전자의 예는 표 2에 열거되어 있다.The CNS-associated disease may be a neurodegenerative disease, synucleinopathy, tauopathy or a lysosomal storage disease. Examples of neurodegenerative diseases and associated genes are listed in Table 2.
"시누클레인병증"은 대상체 (예를 들어, 건강한 대상체, 예를 들어 시누클레인병증이 없는 대상체를 기준으로 함)에서 알파-시누클레인 (SNCA의 유전자 산물)의 축적, 과다발현 또는 활성을 특징으로 하는 질환 또는 장애를 지칭한다. 시누클레인병증 및 연관 유전자의 예는 표 3에 열거되어 있다."Synucleinopathy" is characterized by the accumulation, overexpression or activity of alpha-synuclein (the gene product of SNCA ) in a subject (eg, based on a healthy subject, eg, a subject without synucleinopathy). refers to a disease or disorder that Examples of synucleinopathy and associated genes are listed in Table 3.
"타우병증"은 대상체 (예를 들어, 타우병증이 없는 건강한 대상체)에서 타우 단백질의 축적, 과다발현 또는 활성을 특징으로 하는 질환 또는 장애를 지칭한다. 타우병증 및 연관 유전자의 예는 표 4에 열거되어 있다."Tauopathy" refers to a disease or disorder characterized by the accumulation, overexpression, or activity of a tau protein in a subject (eg, a healthy subject without tauopathy). Examples of tauopathy and associated genes are listed in Table 4.
"리소좀 축적 질환"은 대상체의 리소좀에서 독성의 세포성 산물이 비정상적으로 축적되는 것을 특징으로 하는 질환을 지칭한다. 리소좀 축적 질환 및 연관 유전자의 예는 표 5에 열거되어 있다.“Lysosomal storage disease” refers to a disease characterized by an abnormal accumulation of toxic cellular products in the lysosomes of a subject. Examples of lysosomal storage diseases and associated genes are listed in Table 5.
일부 실시양태에서, 본 개시내용은 GBA1을 코딩하는 단리된 핵산 (예를 들어, 단리된 핵산을 포함하는 rAAV 벡터 또는 rAAV)을 하기 질환의 치료를 필요로 하는 대상체에게 투여함으로써 파킨슨병 (예를 들어, GBA1 돌연변이를 갖는 파킨슨병 (PD-GBA), 산발성 파킨슨병 (sPD)), 고셔병 (예를 들어, 신경병증성 고셔병(nGD), 유형 I 고셔병 (T1GD), 유형 II 고셔병 (T2GD) 및 유형 III 고셔병 (T3GD)), 루이소체 치매 (DLB), 근위축성 측삭 경화증 (ALS) 및 니만-피크 유형 C 질환 (NPC)으로부터 선택된 질환을 치료하는 방법에 관한 것이다.In some embodiments, the present disclosure provides an isolated nucleic acid encoding GBA1 (eg, a rAAV vector or rAAV comprising the isolated nucleic acid) to a subject in need of treatment for Parkinson's disease (e.g., For example, Parkinson's disease with GBA1 mutation (PD-GBA), sporadic Parkinson's disease (sPD)), Gaucher disease (eg, neuropathic Gaucher disease (nGD), type I Gaucher disease (T1GD), type II Gaucher disease (T2GD) and Type III Gaucher disease (T3GD)), Lewy body dementia (DLB), amyotrophic lateral sclerosis (ALS) and Niemann-Peak type C disease (NPC).
일부 실시양태에서, 본 개시내용은 PGRN (GRN으로서 지칭되기도 함)을 코딩하는 단리된 핵산 (예를 들어, 단리된 핵산을 포함하는 rAAV 벡터 또는 rAAV)을 하기 질환의 치료를 필요로 하는 대상체에게 투여함으로써 전측두엽 치매 (예를 들어, GRN 돌연변이를 갖는 전측두엽 치매 (FTD-GRN), MAPT 돌연변이를 갖는 전측두엽 치매 (FTD-타우), 및 C9ORF72 돌연변이를 갖는 전측두엽 치매 (FTD-C9orf72)), 파킨슨병 (PD), 알츠하이머병 (AD), 신경 세로이드 리포푸신증 (NCL), 피질기저 변성 (CBD), 운동 뉴런 질환 (MND), 또는 고셔병 (GD)을 치료하는 방법에 관한 것이다.In some embodiments, the present disclosure provides an isolated nucleic acid encoding PGRN (also referred to as a GRN) (eg, a rAAV vector or rAAV comprising the isolated nucleic acid) to a subject in need thereof frontotemporal dementia by administration (e. g., frontotemporal dementia (FTD-GRN), frontotemporal dementia (FTD- tau having MAPT mutations), and frontotemporal dementia (FTD-C9orf72 having C9ORF72 mutant having a mutation GRN)) , Parkinson's disease (PD), Alzheimer's disease (AD), neuronal ceroid lipofuscinosis (NCL), cortical basal degeneration (CBD), motor neuron disease (MND), or Gaucher disease (GD).
일부 실시양태에서, 본 개시내용은 GBA1 유전자 산물을 코딩하는 단리된 핵산 (예를 들어, 단리된 핵산을 포함하는 rAAV 벡터 또는 rAAV), 및 SNCA를 표적화하는 억제성 핵산을 하기 질환의 치료를 필요로 하는 대상체에게 투여함으로써 시누클레인병증 (예를 들어, 다계통 위축 (MSA), 파킨슨병 (PD), GBA1 돌연변이를 갖는 파킨슨병 (PD-GBA), 루이소체 치매 (DLB), GBA1 돌연변이를 갖는 루이소체 치매, 및 루이소체 질환)을 치료하는 방법에 관한 것이다In some embodiments, the present disclosure provides an isolated nucleic acid encoding a GBA1 gene product (eg, a rAAV vector or rAAV comprising the isolated nucleic acid), and an inhibitory nucleic acid targeting SNCA in the treatment of the following diseases: synucleinopathy (e.g., multiple system atrophy (MSA), Parkinson's disease (PD), Parkinson's disease with a GBA1 mutation (PD-GBA), Lewy body dementia (DLB), having a GBA1 mutation) Lewy body dementia, and Lewy body disease).
일부 실시양태에서, 본 개시내용은 PSAP를 코딩하는 단리된 핵산 (예를 들어, 단리된 핵산을 포함하는 rAAV 벡터 또는 rAAV)을 하기 질환의 치료를 필요로 하는 대상체에게 투여함으로써 파킨슨병 (PD), 전측두엽 치매 (예를 들어, GRN 돌연변이를 갖는 전측두엽 치매 (FTD-GRN)), 리소좀 축적 질환 (LSD), 또는 고셔병 (GD)으로부터 선택된 질환을 치료하는 방법에 관한 것이다.In some embodiments, the disclosure provides for Parkinson's disease (PD) by administering to a subject in need thereof an isolated nucleic acid encoding PSAP (eg, a rAAV vector or rAAV comprising the isolated nucleic acid) , frontotemporal dementia (eg, frontotemporal dementia with GRN mutations (FTD-GRN)), lysosomal storage disease (LSD), or Gaucher disease (GD).
일부 실시양태에서, 본 개시내용은 TREM2를 코딩하는 단리된 핵산 (예를 들어, 단리된 핵산을 포함하는 rAAV 벡터 또는 rAAV)을 하기 질환의 치료를 필요로 하는 대상체에게 투여함으로써 알츠하이머병 (AD), 나수-하콜라병 (NHD), MAPT 돌연변이를 갖는 전측두엽 치매 (FTD-타우) 또는 파킨슨병 (PD)을 치료하는 방법에 관한 것이다.In some embodiments, the present disclosure provides for Alzheimer's disease (AD) by administering to a subject in need thereof an isolated nucleic acid encoding TREM2 (eg, a rAAV vector or rAAV comprising the isolated nucleic acid) , to a method of treating Nasu-Hacola disease (NHD), frontotemporal dementia with MAPT mutation (FTD-tau) or Parkinson's disease (PD).
일부 실시양태에서, 본 개시내용은 MAPT를 표적화하는 억제성 핵산을 코딩하는 단리된 핵산 (예를 들어, 단리된 핵산을 포함하는 rAAV 벡터 또는 rAAV)을 하기 질환의 치료를 필요로 하는 대상체에게 투여함으로써 알츠하이머병 (AD) 또는 전측두엽 치매 (MAPT 돌연변이를 갖는 전측두엽 치매 (FTD-타우)), 타우병증, 진행성 핵상 마비 (PSP), 신경퇴행성 질환, 루이소체 질환 (LBD) 또는 파킨슨병을 치료하는 방법에 관한 것이다.In some embodiments, the present disclosure provides for administering an isolated nucleic acid encoding an inhibitory nucleic acid targeting MAPT (eg, a rAAV vector or rAAV comprising the isolated nucleic acid) to a subject in need thereof treat Alzheimer's disease (AD) or frontotemporal dementia (frontal temporal dementia with MAPT mutation (FTD-tau)), tauopathy, progressive supranuclear palsy (PSP), neurodegenerative disease, Lewy body disease (LBD) or Parkinson's disease it's about how to
본원에 사용된 바와 같은 "치료하다" 또는 "치료하는 것"은 (a) CNS 질환의 발병을 예방하거나 지연시키는 것; (b) CNS 질환의 중증도를 감소시키는 것; (c) CNS 질환의 특징적인 증상의 발생을 감소 또는 예방하는 것; 및/또는 (d) CNS 질환의 특징적인 증상의 악화를 예방하는 것을 지칭한다. CNS 질환의 증상은, 예를 들어 운동 기능 장애 (예를 들어, 떨림, 강직, 느린 움직임, 보행 곤란, 마비), 인지 기능 장애 (예를 들어, 치매, 우울증, 불안, 정신병), 기억력 어려움, 정서적 및 행동 기능 장애를 포함할 수 있다.As used herein, “treat” or “treating” refers to (a) preventing or delaying the onset of a CNS disease; (b) reducing the severity of CNS disease; (c) reducing or preventing the occurrence of symptoms characteristic of a CNS disease; and/or (d) preventing the exacerbation of symptoms characteristic of a CNS disease. Symptoms of CNS disease include, for example, motor dysfunction (eg, tremors, stiffness, slow movements, difficulty walking, paralysis), cognitive dysfunction (eg, dementia, depression, anxiety, psychosis), memory difficulties, may include emotional and behavioral dysfunction.
본 개시내용은 부분적으로, 질환을 치료하기 위해 함께 (예를 들어, 상승적으로) 작용하는 대상체에서 CNS 질환-연관 유전자 (예를 들어, PD-연관 유전자 산물)의 조합의 발현을 위한 조성물에 기초한다.The present disclosure is based, in part, on a composition for expression of a combination of CNS disease-associated genes (eg, PD-associated gene products) in a subject that act together (eg, synergistically) to treat a disease. do.
따라서, 일부 측면에서, 본 개시내용은 CNS-연관 질환 (예를 들어, 파킨슨병, AD, FTD 등)을 갖거나 또는 갖는 것으로 의심되는 대상체를 치료하는 방법을 제공하며, 이러한 방법은 본 개시내용에 의해 기재된 바와 같은 조성물 (예를 들어, 단리된 핵산 또는 벡터 또는 rAAV를 포함하는 조성물)을 상기 대상체에게 투여하는 단계를 포함한다.Accordingly, in some aspects, the present disclosure provides a method of treating a subject having or suspected of having a CNS-associated disease (eg, Parkinson's disease, AD, FTD, etc.), the method comprising: administering to the subject a composition as described by (eg, a composition comprising an isolated nucleic acid or vector or rAAV).
일부 실시양태에서, 대상체는 표 2에 열거된 신경퇴행성 질환에 대한 하나 이상의 징후 또는 증상을 갖거나 또는 상기 질환에 대한 유전적 소인 (예를 들어, 표 1에 열거된 유전자에서의 돌연변이)을 갖는다. 일부 실시양태에서, 대상체는 표 3에 열거된 시누클레인병증에 대한 하나 이상의 징후 또는 증상을 갖거나 또는 상기 질환에 대한 유전적 소인 (예를 들어, 표 1에 열거된 유전자에서의 돌연변이)을 갖는다. 일부 실시양태에서, 대상체는 표 4에 열거된 타우병증에 대한 하나 이상의 징후 또는 증상을 갖거나 또는 상기 질환에 대한 유전적 소인 (예를 들어, 표 1에 열거된 유전자에서의 돌연변이)을 갖는다. 일부 실시양태에서, 대상체는 표 5에 열거된 리소좀 축적 질환에 대한 하나 이상의 징후 또는 증상을 갖거나 또는 상기 질환에 대한 유전적 소인 (예를 들어, 표 1에 열거된 유전자에서의 돌연변이)을 갖는다.In some embodiments, the subject has one or more signs or symptoms for a neurodegenerative disease listed in Table 2, or has a genetic predisposition for the disease (eg, a mutation in a gene listed in Table 1) . In some embodiments, the subject has one or more signs or symptoms for a synucleinopathy listed in Table 3 or has a genetic predisposition for the disease (eg, a mutation in a gene listed in Table 1) . In some embodiments, the subject has one or more signs or symptoms for a tauopathy listed in Table 4 or has a genetic predisposition for the disease (eg, a mutation in a gene listed in Table 1). In some embodiments, the subject has one or more signs or symptoms for a lysosomal storage disease listed in Table 5, or has a genetic predisposition for the disease (eg, a mutation in a gene listed in Table 1) .
본 개시내용은 부분적으로, 고셔병을 치료하기 위한 대상체에서 하나 이상의 CNS 질환-연관 유전자 산물의 발현을 위한 조성물에 기초한다. 일부 실시양태에서, 고셔병은 신경병증성 고셔병, 예를 들어 유형 2 고셔병 또는 유형 3 고셔병이다. 일부 실시양태에서, 대상체는 PD 또는 PD 증상이 없다.The present disclosure is based, in part, on a composition for the expression of one or more CNS disease-associated gene products in a subject for treating Gaucher's disease. In some embodiments, the Gaucher disease is a neuropathic Gaucher disease, eg,
따라서, 일부 측면에서, 본 개시내용은 신경병증성 고셔병을 갖거나 또는 갖는 것으로 의심되는 대상체에게 본 개시내용에 의해 기재된 바와 같은 조성물 (예를 들어, 단리된 핵산 또는 벡터 또는 rAAV를 포함하는 조성물)을 투여하는 단계를 포함하는, 상기 대상체를 치료하는 방법을 제공한다.Thus, in some aspects, the present disclosure provides a composition as described by the present disclosure (eg, an isolated nucleic acid or vector or a composition comprising rAAV) to a subject having or suspected of having neuropathic Gaucher disease. It provides a method of treating the subject, comprising the step of administering.
본 개시내용은 부분적으로, 알츠하이머병 또는 전측두엽 치매 (FTD)를 치료하기 위한 대상체에서 하나 이상의 CNS 질환-연관 유전자 산물의 발현을 위한 조성물에 기초한다. 일부 실시양태에서, 대상체는 알츠하이머병이 없다.The present disclosure is based, in part, on a composition for the expression of one or more CNS disease-associated gene products in a subject for treating Alzheimer's disease or frontotemporal dementia (FTD). In some embodiments, the subject does not have Alzheimer's disease.
따라서, 일부 측면에서, 본 개시내용은 FTD를 갖거나 또는 갖는 것으로 의심되는 대상체에게 본 개시내용에 의해 기재된 바와 같은 조성물 (예를 들어, 단리된 핵산 또는 벡터 또는 rAAV를 포함하는 조성물)을 투여하는 단계를 포함하는, 상기 대상체를 치료하는 방법을 제공한다. 일부 실시양태에서, 알츠하이머병 또는 전측두엽 치매 (FTD)를 갖는 대상체에게 프로그래뉼린 (PGRN; GRN으로서 지칭되기도 함) 또는 그의 일부분을 코딩하는 rAAV를 투여한다.Accordingly, in some aspects, the present disclosure provides a method for administering to a subject having or suspected of having FTD a composition as described by the present disclosure (eg, a composition comprising an isolated nucleic acid or vector or rAAV). There is provided a method of treating said subject, comprising the steps of: In some embodiments, a rAAV encoding progranulin (PGRN; also referred to as GRN) or a portion thereof is administered to a subject having Alzheimer's disease or frontotemporal dementia (FTD).
일부 측면에서, 본 개시내용은 본원에 기재된 바와 같은 rAAV를 대상체에게 투여하는 단계를 포함하는, 트랜스진을 미세아교세포에 전달하는 방법을 제공한다.In some aspects, the present disclosure provides a method of delivering a transgene to microglia comprising administering to the subject a rAAV as described herein.
일부 실시양태에서, 유형 2 또는 유형 3 고셔병 또는 GBA1 돌연변이를 갖는 파킨슨병을 치료하기 위한 Gcase 단백질을 코딩하는 rAAV는 단일 용량으로서 대상체에게 투여되고, rAAV는 이후 대상체에게 투여되지 않는다.In some embodiments, the rAAV encoding a Gcase protein for treating
일부 실시양태에서, Gcase 단백질을 코딩하는 rAAV는 대수조 내로의 단일 후두하 주사를 통해 투여된다. 일부 실시양태에서, 대수조 내로의 주사는 방사선 촬영 안내 하에 수행된다.In some embodiments, the rAAV encoding the Gcase protein is administered via a single suboccipital injection into the cistern. In some embodiments, the injection into the cistern is performed under radiographic guidance.
대상체는 전형적으로 포유동물, 바람직하게 인간이다. 일부 실시양태에서, 대상체는 1개월 내지 10세의 연령 (예를 들어, 1개월, 2개월, 3개월, 4개월, 5개월, 6개월, 7개월, 8개월, 9개월, 10개월, 11개월, 12개월, 13개월, 14개월, 15개월, 16개월, 17개월, 18개월, 19개월, 20개월, 21개월, 22개월, 23개월, 24개월, 3세, 4세, 5세, 6세, 7세, 8세, 9세, 10세 또는 그 사이의 임의의 연령)이다. 일부 실시양태에서, 대상체는 2세 내지 20세이다. 일부 실시양태에서, 대상체는 30세 내지 100세이다. 일부 실시양태에서, 대상체는 55세 초과이다.The subject is typically a mammal, preferably a human. In some embodiments, the subject is between 1 month and 10 years of age (eg, 1 month, 2 months, 3 months, 4 months, 5 months, 6 months, 7 months, 8 months, 9 months, 10 months, 11 months, 12 months, 13 months, 14 months, 15 months, 16 months, 17 months, 18 months, 19 months, 20 months, 21 months, 22 months, 23 months, 24 months, 3 years, 4 years, 5 years, 6, 7, 8, 9, 10, or any age in between). In some embodiments, the subject is between 2 and 20 years of age. In some embodiments, the subject is between 30 and 100 years of age. In some embodiments, the subject is over 55 years of age.
일부 실시양태에서, 조성물은, 예를 들어 대상체의 뇌 및/또는 척수로의 직접 주사에 의해 대상체의 CNS에 직접 투여된다. CNS-직접 투여 방식의 예는 뇌내 주사, 뇌실내 주사, 수조내 주사, 실질내 주사, 척수강내 주사, 및 전술한 것의 임의의 조합을 포함하나 이에 제한되지는 않는다. 일부 실시양태에서, 조성물은 대수조내 (ICM) 주사에 의해 대상체에게 투여된다. 일부 실시양태에서, 대상체의 CNS로의 직접 주사는 대상체의 중뇌, 선조체 및/또는 대뇌 피질에서 트랜스진 발현 (예를 들어, 제1 유전자 산물, 제2 유전자 산물, 및 적용 가능한 경우 제3 유전자 산물의 발현)을 발생시킨다. 일부 실시양태에서, CNS로의 직접 주사는 대상체의 척수 및/또는 CSF에서 트랜스진 발현 (예를 들어, 제1 유전자 산물, 제2 유전자 산물, 및 적용 가능한 경우 제3 유전자 산물의 발현)을 발생시킨다.In some embodiments, the composition is administered directly to a subject's CNS, eg, by direct injection into the subject's brain and/or spinal cord. Examples of CNS-direct modes of administration include, but are not limited to, intracerebral injection, intraventricular injection, intracisternal injection, intraparenchymal injection, intrathecal injection, and any combination of the foregoing. In some embodiments, the composition is administered to the subject by intracavitary (ICM) injection. In some embodiments, direct injection into the CNS of a subject results in transgene expression (e.g., a first gene product, a second gene product, and, if applicable, a third gene product, in the subject's midbrain, striatum, and/or cerebral cortex). expression) occurs. In some embodiments, direct injection into the CNS results in transgene expression (e.g., expression of a first gene product, a second gene product, and, where applicable, a third gene product) in the spinal cord and/or CSF of the subject. .
일부 실시양태에서, 대상체의 CNS에 대한 직접 주사는 대류 강화 전달 (CED)을 포함한다. 대류 강화 전달은 뇌의 외과적 노출 및 소직경 카테터를 뇌의 표적 부위에 직접 배치한 후 치료제 (예를 들어, 본원에 기재된 바와 같은 조성물 또는 rAAV)를 대상체의 뇌에 직접 주입하는 것을 포함하는 치료 전략이다. CED는, 예를 들어 문헌 [Debinski et al. (2009) Expert Rev Neurother. 9(10):1519-27]에 기재되어 있다.In some embodiments, the direct injection into the CNS of the subject comprises convective enhanced delivery (CED). Convection-enhanced delivery is a treatment comprising surgical exposure of the brain and placement of a small diameter catheter directly at a target site in the brain followed by direct infusion of a therapeutic agent (eg, a composition as described herein or rAAV) into the brain of a subject. strategy. CED is described, for example, in Debinski et al. (2009) Expert Rev Neurother . 9(10):1519-27].
일부 실시양태에서, 조성물은 예를 들어 말초 주사에 의해 대상체에게 말초로 투여된다. 말초 주사의 예는 피하 주사, 정맥내 주사, 동맥내 주사, 복강내 주사, 또는 전술한 것의 임의의 조합을 포함한다. 일부 실시양태에서, 말초 주사는 동맥내 주사, 예를 들어 대상체의 경동맥 내로의 주사이다.In some embodiments, the composition is administered peripherally to the subject, eg, by peripheral injection. Examples of peripheral injections include subcutaneous injections, intravenous injections, intraarterial injections, intraperitoneal injections, or any combination of the foregoing. In some embodiments, the peripheral injection is an intra-arterial injection, eg, an injection into the carotid artery of the subject.
일부 실시양태에서, 본 개시내용에 의해 기재된 바와 같은 조성물 (예를 들어, 단리된 핵산 또는 벡터 또는 rAAV를 포함하는 조성물)은 대상체의 CNS에 말초 및 직접적 둘 모두로 투여된다. 예를 들어, 일부 실시양태에서, 대상체에게 동맥내 주사 (예를 들어, 경동맥 내로의 주사) 및 실질내 주사 (예를 들어, CED에 의한 실질내 주사)에 의해 조성물을 투여한다. 일부 실시양태에서, CNS에 대한 직접 주사 및 말초 주사는 동시적이다 (예를 들어, 동시에 발생한다). 일부 실시양태에서, 직접 주사는 말초 주사 이전 (예를 들어, 1분 내지 1주, 또는 그 이전)에 발생한다. 일부 실시양태에서, 직접 주사는 말초 주사 이후 (예를 들어, 1분 내지 1주, 또는 그 이후)에 발생한다.In some embodiments, a composition as described by the present disclosure (eg, a composition comprising an isolated nucleic acid or vector or rAAV) is administered both peripherally and directly to the CNS of a subject. For example, in some embodiments, the subject is administered the composition by intra-arterial injection (eg, injection into the carotid artery) and intraparenchymal injection (eg, intraparenchymal injection with CED). In some embodiments, the direct injection into the CNS and the peripheral injection are simultaneous (eg, occur simultaneously). In some embodiments, the direct injection occurs prior to the peripheral injection (eg, 1 minute to 1 week, or earlier). In some embodiments, the direct injection occurs after peripheral injection (eg, 1 minute to 1 week, or thereafter).
일부 실시양태에서, 대상체에게 본원에 기재된 바와 같은 조성물을 투여하기 이전 (예를 들어, 1개월 내지 1분 이전) 또는 동시에 면역억제제를 투여한다. 일부 실시양태에서, 면역억제제는 코르티코스테로이드 (예를 들어, 프레드니손, 부데소니드 등), mTOR 억제제 (예를 들어, 시롤리무스, 에베롤리무스 등), 항체 (예를 들어, 아달리무맙, 에타네르셉트, 나탈리주맙 등), 또는 메토트렉세이트이다.In some embodiments, the immunosuppressant is administered to the subject prior to (eg, 1 month to 1 minute prior) or concurrently with administering the composition as described herein. In some embodiments, the immunosuppressant is a corticosteroid (eg, prednisone, budesonide, etc.), an mTOR inhibitor (eg, sirolimus, everolimus, etc.), an antibody (eg, adalimumab, etaner) sept, natalizumab, etc.), or methotrexate.
대상체에게 투여되는 본 개시내용에 의해 기재된 바와 같은 조성물 (예를 들어, 단리된 핵산 또는 벡터 또는 rAAV를 포함하는 조성물)의 양은 투여 방법에 따라 달라질 것이다. 예를 들어, 일부 실시양태에서, 본원에 기재된 바와 같은 rAAV는 약 109 게놈 카피 (GC)/kg 내지 약 1014 GC/kg (예를 들어, 약 109 GC/kg, 약 1010 GC/kg, 약 1011 GC/kg, 약 1012 GC/kg, 약 1012 GC/kg, 또는 약 1014 GC/kg)의 역가로 대상체에게 투여된다. 일부 실시양태에서, 대상체에게 CSF 공간으로의 주사 또는 실질내 주사에 의해 고 역가 (예를 들어, >1012 게놈 카피 GC/rAAV의 kg)를 투여한다. 일부 실시양태에서, 본원에 기재된 바와 같은 rAAV는 정맥내 주사에 의해 약 1 x 1010 벡터 게놈 (vg) 내지 약 1 x 1017 vg 범위의 용량으로 대상체에게 투여된다. 일부 실시양태에서, 본원에 기재된 바와 같은 rAAV는 대수조 내로의 주사에 의해 약 1 x 1010 vg 내지 약 1 x 1016 vg 범위의 용량으로 대상체에게 투여된다.The amount of a composition as described by the present disclosure administered to a subject (eg, a composition comprising an isolated nucleic acid or vector or rAAV) will vary depending on the method of administration. For example, in some embodiments, the rAAV as described herein is between about 10 9 genome copies (GC)/kg and about 10 14 GC/kg (eg, about 10 9 GC/kg, about 10 10 GC/kg). kg, about 10 11 GC/kg, about 10 12 GC/kg, about 10 12 GC/kg, or about 10 14 GC/kg). In some embodiments, the subject is administered a high titer (eg, >10 12 genomic copies GC/kg of rAAV) by injection into the CSF space or by intraparenchymal injection. In some embodiments, the rAAV as described herein is administered to the subject by intravenous injection at a dose ranging from about 1 x 10 10 vector genomes (vg) to about 1 x 10 17 vg. In some embodiments, the rAAV as described herein is administered to the subject at a dose ranging from about 1 x 10 10 vg to about 1 x 10 16 vg by injection into the cistern.
본 개시내용에 의해 기재된 바와 같은 조성물 (예를 들어, 단리된 핵산 또는 벡터 또는 rAAV를 포함하는 조성물)은 대상체에게 1회 또는 수회 (예를 들어, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20회 또는 그 초과) 투여될 수 있다. 일부 실시양태에서, 조성물은, 예를 들어 주입 펌프를 통해 지속적으로 (예를 들어, 만성적으로) 대상체에게 투여된다.A composition as described by the present disclosure (eg, an isolated nucleic acid or vector or a composition comprising rAAV) is administered to the subject once or several times (eg, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20 or more). In some embodiments, the composition is administered to the subject continuously (eg, chronically), eg, via an infusion pump.
<표 2><Table 2>
신경퇴행성 질환의 예Examples of neurodegenerative diseases
<표 3><Table 3>
시누클레인병증의 예Examples of synucleinopathy
<표 4><Table 4>
타우병증의 예Examples of tauopathy
<표 5><Table 5>
리소좀 축적 질환의 예Examples of lysosomal storage diseases
실시예Example
실시예 1: rAAV 벡터Example 1: rAAV vector
AAV 벡터는 삼중 플라스미드 형질감염을 위한 세포, 예컨대 HEK293 세포를 사용하여 생성된다. ITR 서열은 각각의 관심 트랜스진에 대한 프로모터/인핸서 요소, 3' 폴리A 신호, 및 번역 후 신호, 예컨대 WPRE 요소를 포함하는 발현 구축물에 플랭킹된다. 다중 유전자 산물, 예컨대 GBA1 및 LIMP2 및/또는 프로사포신은 단백질 서열의 융합에 의해; 또는 펩티드 결합의 생성 방지로 인해 아미노산이 부가된 2개의 펩티드 단편을 유도하는 2A 펩티드 링커, 예컨대 T2A 또는 P2A를 사용하거나; 또는 IRES 요소를 사용하거나; 또는 2개의 별도의 발현 카세트를 사용한 발현에 의해 동시에 발현될 수 있다. 발현된 유전자의 상류에서 효율적으로 스플라이싱되는 짧은 인트론 서열의 존재는 발현 수준을 개선시킬 수 있다. shRNA 및 다른 조절 RNA는 이러한 서열 내에 잠재적으로 포함될 수 있다. 본 개시내용에 의해 기재된 발현 구축물의 예는 도 1-8, 21-35, 39 및 41-51, 및 하기 표 6에 제시되어 있다.AAV vectors are generated using cells for triple plasmid transfection, such as HEK293 cells. The ITR sequence is flanked by an expression construct comprising a promoter/enhancer element for each transgene of interest, a 3' polyA signal, and a post-translational signal such as a WPRE element. Multiple gene products, such as GBA1 and LIMP2 and/or prosaposin, can be produced by fusion of protein sequences; or using a 2A peptide linker, such as T2A or P2A, which leads to two peptide fragments with added amino acids due to the prevention of the formation of peptide bonds; or using an IRES element; or simultaneous expression by expression using two separate expression cassettes. The presence of short intron sequences that are efficiently spliced upstream of the expressed gene can improve expression levels. shRNAs and other regulatory RNAs can potentially be included within these sequences. Examples of expression constructs described by the present disclosure are shown in Figures 1-8, 21-35, 39 and 41-51, and Table 6 below.
<표 6><Table 6>
실시예 2: GBA 결핍 세포로의 바이러스 형질도입의 세포 기반 검정Example 2: Cell Based Assay of Viral Transduction into GBA Deficient Cells
GBA1이 결핍된 세포는, 예를 들어 GD 환자로부터의 섬유모세포, 단핵구, 또는 hES 세포, 또는 환자 유래 유도 다능성 줄기 세포 (iPSC)로서 수득된다. 이들 세포는 기질, 예컨대 글루코실세라미드 및 글루코실스핑고신 (GlcCer 및 GlcSph)을 축적한다. Gcase 억제제, 예컨대 CBE로 야생형 또는 돌연변이체 배양된 세포주를 처리하는 것 또한 GBA 결핍 세포를 수득하는 데 사용된다. Cells deficient in GBA1 are obtained, for example, as fibroblasts, monocytes, or hES cells from a GD patient, or as patient-derived induced pluripotent stem cells (iPSCs). These cells accumulate substrates such as glucosylceramide and glucosylsphingosine (GlcCer and GlcSph). Treatment of wild-type or mutant cultured cell lines with Gcase inhibitors such as CBE is also used to obtain GBA deficient cells.
이러한 세포 모델을 사용하여, 리소좀 결함은 단백질 응집체, 예컨대 α-시누클레인에 이러한 단백질 또는 포스포-αSyn에 대한 항체가 축적된다는 관점에서 정량화한 다음, 형광 현미경 검사를 사용하여 영상화한다. 단백질 마커, 예컨대 LAMP1, LAMP2, LIMP1, LIMP2에 대한 ICC에 의해, 또는 염료 예컨대 리소트랙커(Lysotracker)를 사용하거나, 또는 형광 덱스트란 또는 다른 마커의 세포내 이입 구획을 통한 흡수에 의해 리소좀 비정상에 대한 영상화를 또한 수행한다. LC3에 대한 것과 같은 리소좀과의 결함 융합으로 인한 자가포식 마커 축적에 대한 영상화를 또한 수행할 수 있다. 웨스턴 블롯팅 및/또는 ELISA는 이들 마커의 비정상적인 축적을 정량화하는 데 사용된다. 또한, 당지질 기질 및 GBA1 산물의 축적은 표준 접근법을 사용하여 측정된다.Using this cellular model, lysosomal defects are quantified in terms of accumulation of these proteins or antibodies to phospho-αSyn in protein aggregates, such as α-synuclein, and then imaged using fluorescence microscopy. for lysosomal abnormalities by ICC for protein markers such as LAMP1, LAMP2, LIMP1, LIMP2, or by use of dyes such as Lysotracker, or by uptake through the endocytic compartment of fluorescent dextran or other markers. Imaging is also performed. Imaging for autophagy marker accumulation due to defective fusion with lysosomes, such as for LC3, can also be performed. Western blotting and/or ELISA are used to quantify the aberrant accumulation of these markers. In addition, the accumulation of glycolipid substrates and GBA1 products is measured using standard approaches.
치료 종말점 (예를 들어, PD-연관 병리상태의 감소)은 AAV 벡터의 형질도입 발현의 맥락에서 측정되어 활성 및 기능을 확인하고 정량화한다. Gcase은 또한 단백질 ELISA 측정 기준을 사용하거나, 또는 표준 Gcase 활성 검정에 의해 정량화될 수 있다.A therapeutic endpoint (eg, reduction in PD-associated pathology) is measured in the context of transduced expression of an AAV vector to identify and quantify activity and function. Gcase can also be quantified using protein ELISA metrics, or by standard Gcase activity assays.
실시예 3: 돌연변이체 마우스를 사용한 생체내 검정Example 3: In Vivo Assay Using Mutant Mice
본 실시예는 돌연변이체 마우스를 사용한 AAV 벡터의 생체내 검정을 설명한다. 돌연변이체 마우스에서 상기와 같은 AAV 벡터의 생체내 연구는, 예를 들어 문헌 [Liou et al. (2006) J. Biol. Chem. 281(7): 4242-4253, Sun et al. (2005) J. Lipid Res. 46:2102-2113, 및 Farfel-Becker et al. (2011) Dis. Model Mech. 4(6):746-752]에 기재된 검정을 사용하여 수행된다.This example describes in vivo assays of AAV vectors using mutant mice. In vivo studies of such AAV vectors in mutant mice are described, for example, in Liou et al. (2006) J. Biol. Chem. 281(7): 4242-4253, Sun et al. (2005) J. Lipid Res . 46:2102-2113, and Farfel-Becker et al. (2011) Dis. Model Mech . 4(6):746-752].
비히클 대조군 및 AAV 벡터의 척수강내 또는 뇌실내 전달 (예를 들어, 2x1011 vg/마우스의 용량에서)은 농축된 AAV 스톡을 사용하여, 예를 들어 5 내지 10 μL의 주사 용적으로 수행된다. 대류 강화 전달에 의한 실질내 전달이 수행된다.Intrathecal or intraventricular delivery of vehicle control and AAV vectors (eg, at a dose of 2x10 11 vg/mouse) is performed using the concentrated AAV stock, eg, in injection volumes of 5-10 μL. Intraparenchymal transfer by convective enhanced transfer is performed.
처리는 증상이 나타나기 이전, 또는 발병 후에 개시된다. 측정된 종말점은 CNS 및 CSF에서의 기질의 축적, ELISA 및 효소 활성에 의한 Gcase 효소의 축적, 운동 및 인지 종말점, 리소좀 기능 장애, 및 α-시누클레인 단량체, 원시섬유 또는 원섬유의 축적이다.Treatment is initiated before symptoms appear or after onset. The endpoints measured are the accumulation of substrates in the CNS and CSF, the accumulation of the Gcase enzyme by ELISA and enzymatic activity, the kinetic and cognitive endpoints, the lysosomal dysfunction, and the accumulation of α-synuclein monomers, protofibrils or fibrils.
실시예 4: 질환의 화학적 모델Example 4: Chemical model of disease
본 실시예는 고셔병의 화학적으로 유도된 마우스 모델 (예를 들어, CBE 마우스 모델)을 사용한 AAV 벡터의 생체내 검정을 설명한다. 이러한 AAV 벡터의 생체내 연구는, 예를 들어 문헌 [Vardi et al. (2016) J Pathol. 239(4):496-509]에 기재된 바와 같이, 고셔병의 화학적으로 유도된 마우스 모델에서 수행된다. This example describes an in vivo assay of an AAV vector using a chemically induced mouse model of Gaucher's disease (eg, the CBE mouse model). In vivo studies of such AAV vectors are described, for example, in Vardi et al. (2016) J Pathol . 239(4):496-509, in a chemically induced mouse model of Gaucher disease.
비히클 대조군 및 AAV 벡터의 척수강내 또는 뇌실내 전달 (예를 들어, 2x1011 vg/마우스의 용량에서)은 농축된 AAV 스톡을 사용하여, 예를 들어 5 내지 10 μL의 주사 용적으로 수행된다. 대류 강화 전달에 의한 실질내 전달이 수행된다. 말초 전달은 꼬리 정맥 주사에 의해 달성된다.Intrathecal or intraventricular delivery of vehicle control and AAV vectors (eg, at a dose of 2x10 11 vg/mouse) is performed using the concentrated AAV stock, eg, in injection volumes of 5-10 μL. Intraparenchymal transfer by convective enhanced transfer is performed. Peripheral delivery is achieved by tail vein injection.
처리는 증상이 나타나기 이전, 또는 발병 후에 개시된다. 측정된 종말점은 CNS 및 CSF에서의 기질의 축적, ELISA 및 효소 활성에 의한 Gcase 효소의 축적, 운동 및 인지 종말점, 리소좀 기능 장애, 및 α-시누클레인 단량체, 원시섬유 또는 원섬유의 축적이다.Treatment is initiated before symptoms appear or after onset. The endpoints measured are the accumulation of substrates in the CNS and CSF, the accumulation of the Gcase enzyme by ELISA and enzymatic activity, the kinetic and cognitive endpoints, the lysosomal dysfunction, and the accumulation of α-synuclein monomers, protofibrils or fibrils.
실시예 5: PD, LBD, 고셔병 환자에서의 임상 시험Example 5: Clinical trials in PD, LBD, Gaucher disease patients
일부 실시양태에서, 특정 형태의 고셔병 (예를 들어, GD1)을 갖는 환자는 파킨슨병 (PD) 또는 루이소체 치매 (LBD)가 발생할 위험이 증가된다. 본 실시예는 고셔병, PD 및/또는 LBD를 갖는 환자에서, 본 개시내용에 의해 기재된 바와 같은 rAAV의 안전성 및 효능을 평가하기 위한 임상 시험을 설명한다.In some embodiments, a patient having a certain form of Gaucher disease (eg, GD1) has an increased risk of developing Parkinson's disease (PD) or Lewy body dementia (LBD). This example describes a clinical trial to evaluate the safety and efficacy of rAAV as described by the present disclosure in patients with Gaucher disease, PD and/or LBD.
고셔병, PD 및/또는 LBD의 치료를 위한 이러한 벡터의 임상 시험은 문헌 [Grabowski et al. (1995) Ann. Intern. Med. 122(1):33-39]에 기재된 것과 유사한 연구 설계를 사용하여 수행된다.Clinical trials of such vectors for the treatment of Gaucher disease, PD and/or LBD are described in Grabowski et al. (1995) Ann. Intern. Med . 122(1):33-39].
실시예 6: 말초 질환의 치료Example 6: Treatment of Peripheral Disease
일부 실시양태에서, 특정 형태의 고셔병을 갖는 환자는, 예를 들어 문헌 [Biegstraaten et al. (2010) Brain 133(10):2909-2919]에 기재된 바와 같이, 말초 신경병증의 증상을 나타낸다.In some embodiments, patients with certain forms of Gaucher disease are described, eg, in Biegstraaten et al. (2010) Brain 133(10):2909-2919], exhibits symptoms of peripheral neuropathy.
본 실시예는 고셔병 (예를 들어, 유형 1 고셔병)과 연관된 말초 신경병증의 치료를 위한 본원에 기재된 바와 같은 AAV 벡터의 생체내 검정을 설명한다. 간단히 언급하면, 말초 신경병증의 징후 또는 증상을 갖는 것으로 확인된 유형 1 고셔병 환자에게 본 개시내용에 의해 기재된 바와 같이 rAAV를 투여한다. 일부 실시양태에서, 대상체의 말초 신경병증성 징후 및 증상은, 예를 들어 rAAV의 투여 후 문헌 [Biegstraaten et al.]에 기재된 방법을 사용하여 모니터링된다.This example describes an in vivo assay of an AAV vector as described herein for the treatment of peripheral neuropathy associated with Gaucher disease (eg,
환자 (예를 들어, 환자의 혈청, 환자의 말초 조직 (예를 들어, 간 조직, 비장 조직 등))에 존재하는 본 개시내용에 의해 기재된 바와 같은 형질도입된 유전자 산물의 수준은, 예를 들어 웨스턴 블롯 분석, 효소 기능적 검정, 또는 영상화 연구에 의해 검정된다.The level of the transduced gene product as described by the present disclosure present in the patient (eg, the patient's serum, the patient's peripheral tissues (eg, liver tissue, spleen tissue, etc.)) can be, for example, assayed by Western blot analysis, enzyme functional assays, or imaging studies.
실시예 7: CNS 형태의 치료Example 7: Treatment of CNS Forms
본 실시예는 CNS 형태의 고셔병의 치료를 위해 본원에 기재된 바와 같은 rAAV의 생체내 검정을 설명한다. 간단히 언급하면, CNS 형태의 고셔병 (예를 들어, 유형 2 또는 유형 3 고셔병)을 갖는 것으로 확인된 고셔병 환자에게 본 개시내용에 의해 기재된 바와 같이 rAAV를 투여한다. 환자의 CNS (예를 들어, 환자의 CNS의 혈청, 환자의 뇌척수액 (CSF), 또는 환자의 CNS 조직)에 존재하는 본 개시내용에 의해 기재된 바와 같은 형질도입된 유전자 산물의 수준은, 예를 들어 웨스턴 블롯 분석, 효소 기능적 검정, 또는 영상화 연구에 의해 검정된다.This example describes an in vivo assay of rAAV as described herein for the treatment of the CNS form of Gaucher's disease. Briefly, patients with Gaucher disease identified as having a CNS form of Gaucher disease (eg,
실시예 8: GBA1에서의 돌연변이를 갖는 대상체에서 파킨슨병의 유전자 요법Example 8: Gene therapy of Parkinson's disease in subjects with mutations in GBA1
본 실시예는 GBA1 유전자에서의 돌연변이를 특징으로 하는 파킨슨병을 갖는 대상체에게 GBA1을 코딩하는 재조합 아데노-연관 바이러스 (rAAV)를 투여하는 것을 설명한다.This example describes administration of a recombinant adeno-associated virus (rAAV) encoding GBA1 to a subject having Parkinson's disease characterized by a mutation in the GBA1 gene.
rAAV-GBA1 벡터 삽입체는 CMV 인핸서 (CMVe), CBA 프로모터 (CBAp), 엑손 1, 및 인트론 (int)의 4개 부분으로 이루어진 CBA 프로모터 요소 (CBA)를 함유하여 인간 GBA1 (마룬)의 코돈-최적화된 코딩 서열 (CDS)을 구성적으로 발현한다. 3' 영역은 또한 우드척 간염 바이러스 전사 후 조절 요소 (WPRE) 전사 후 조절 요소에 이어 소 성장 호르몬 폴리A 신호 (bGH 폴리A) 꼬리를 함유한다. 플랭킹 ITR은 개재 서열의 정확한 패키징을 허용한다. 5' ITR 서열의 2가지 변이체 (도 7, 삽입 상자, 하단 서열)가 평가되었고; 이들 변이체는 ITR의 20개 뉴클레오티드 "D" 영역 내에서 몇 가지 뉴클레오티드 차이를 가지며, 이는 패키징 및 발현의 효율성에 영향을 미치는 것으로 여겨진다. rAAV-GBA1 벡터 산물은 도 7 (삽입 상자, 상단 서열)에 나타낸 "D" 도메인 뉴클레오티드 서열을 함유한다. 변이체 벡터는 전임상 연구에서 유사하게 수행된 돌연변이체 "D" 도메인 (본원에서 "S" 도메인이라고 하며, 음영으로 표시된 뉴클레오티드 변화를 수반한다)을 보유한다. 백본은 카나마이신에 대한 내성을 부여하는 유전자뿐만 아니라 역 패키징을 방지하는 스터퍼 서열을 함유한다. rAAV-GBA1 벡터를 도시하는 개략도가 도 8에 도시되어 있다. rAAV-GBA1 벡터는 AAV9 혈청형 캡시드 단백질을 사용하여 rAAV에 패키징된다.The rAAV-GBA1 vector insert contains the CBA promoter element (CBA), which consists of four parts: CMV enhancer (CMVe), CBA promoter (CBAp),
rAAV-GBA1은 대수조 내로의 형광투시 유도 후두하 주사 (대수조내; ICM)를 통해 단일 용량으로서 대상체에게 투여된다. rAAV-GBA1 투여 요법 연구의 한 실시양태는 하기와 같다:rAAV-GBA1 is administered to the subject as a single dose via fluoroscopically guided suboccipital injection into the aquiva (intraaquidous; ICM). One embodiment of the rAAV-GBA1 dosing regimen study is as follows:
rAAV-GBA1의 단일 용량은 2가지 용량 수준 (3e13 vg (저용량); 1e14 vg (고용량) 등) 중 하나에서 환자 (N=12)에게 투여되며, 이는 비-임상 약리학 및 독성학 연구 결과에 기초하여 결정된다.A single dose of rAAV-GBA1 is administered to patients (N=12) at one of two dose levels (3e13 vg (low dose); 1e14 vg (high dose), etc.) is decided
초기 연구는 rAAV-GBA1 벡터 및 rAAV-GBA1 S-변이체 구축물의 효능 및 안전성을 평가하기 위해 GCase의 억제제인 콘두리톨-b-에폭시드 (CBE)의 매일 전달을 포함하는 화학적 마우스 모델에서 시행되었다 (하기에 추가로 기재된 바와 같음). 부가적으로, 초기 연구는 동형 접합성 GBA1 돌연변이를 보유하고 사포신이 부분적으로 결핍된 유전적 마우스 모델 (4L/PS-NA)에서 수행되었다. 벡터 안전성 및 효능을 추가로 평가하기 위해 마우스 및 비-인간 영장류 (NHP)에 대한 부가의 용량 범위 연구가 시행된다.An initial study was conducted in a chemical mouse model involving daily delivery of the GCase inhibitor chonduritol-b-epoxide (CBE) to evaluate the efficacy and safety of the rAAV-GBA1 vector and rAAV-GBA1 S-variant constructs. (as further described below). Additionally, initial studies were performed in a genetic mouse model (4L/PS-NA) that carries a homozygous GBA1 mutation and is partially deficient in saposin. Additional dose range studies in mice and non-human primates (NHP) are conducted to further evaluate vector safety and efficacy.
AAV 백본에서 5' 역위 말단 반복부 (ITR)의 2가지 약간 상이한 버전을 시험하여 제조 가능성 및 트랜스진 발현을 평가하였다 (도 7). 145 bp 5' ITR 내의 20 bp "D" 도메인은 최적의 바이러스 벡터 생산에 필요한 것으로 생각되지만, "D" 도메인 내의 돌연변이도 일부 경우에 트랜스진 발현을 증가시키는 것으로 보고되었다. 따라서, 무손상 "D" 도메인을 보유하는 바이러스 벡터 rAAV-GBA1 외에도, 돌연변이체 D 도메인 (본원에서 "S" 도메인으로 지칭됨)을 갖는 제2 벡터 형태가 또한 평가되었다. rAAV-GBA1과 변이체는 모두 동일한 트랜스진을 발현한다. 두 벡터가 하기에 상세히 설명된 바와 같이 생체내에서 효과적인 바이러스를 생산하는 동안, 야생형 "D" 도메인을 함유하는 rAAV-GBA1이 추가 개발을 위해 선택되었다.Two slightly different versions of the 5' inverted terminal repeat (ITR) in the AAV backbone were tested to evaluate manufacturability and transgene expression ( FIG. 7 ). The 20 bp "D" domain in the 145 bp 5' ITR is thought to be required for optimal viral vector production, although mutations in the "D" domain have also been reported to increase transgene expression in some cases. Thus, in addition to the viral vector rAAV-GBA1 carrying an intact "D" domain, a second vector form with a mutant D domain (referred to herein as "S" domain) was also evaluated. Both rAAV-GBA1 and the variant express the same transgene. While both vectors produced effective viruses in vivo as detailed below, rAAV-GBA1 containing the wild-type "D" domain was chosen for further development.
GCase 결핍의 CBE 모델을 확립하기 위해, 새끼 마우스에게 GCase의 특이적 억제제인 CBE를 투여하였다. 생후 제8일 (P8)에 시작하여 매일 IP 주사로 마우스에게 CBE를 제공하였다. 3가지 상이한 CBE 용량 (25 mg/kg, 37.5 mg/kg, 50 mg/kg) 및 PBS를 시험하여 행동 표현형을 나타내는 모델을 확립하였다 (도 9). 더 높은 용량의 CBE는 용량 의존적 방식으로 치사성을 발생시켰다. 50 mg/kg CBE로 처리된 모든 마우스는 P23에 사망했고, 37.5 mg/kg CBE로 처리된 8마리 마우스 중 5마리는 P27에 사망하였다. 25 mg/kg CBE로 처리된 마우스에서는 치사성이 없었다. CBE-주사된 마우스는 오픈 필드 검정에서 일반적인 운동 결손을 나타내지 않은 반면 (PBS를 투여한 마우스와 동일한 거리 및 동일한 속도로 이동함), CBE-처리된 마우스는 로타로드 검정에 의해 측정된 바와 같이 운동 협응 및 균형 결핍을 나타냈다.To establish a CBE model of GCase deficiency, pup mice were administered CBE, a specific inhibitor of GCase. Mice were given CBE by daily IP injection starting on day 8 (P8) after birth. Three different CBE doses (25 mg/kg, 37.5 mg/kg, 50 mg/kg) and PBS were tested to establish a model representing the behavioral phenotype ( FIG. 9 ). Higher doses of CBE caused lethality in a dose-dependent manner. All mice treated with 50 mg/kg CBE died on P23, and 5 of 8 mice treated with 37.5 mg/kg CBE died on P27. Mice treated with 25 mg/kg CBE were not lethal. CBE-injected mice did not show general motor deficits in the open field assay (move the same distance and at the same speed as mice administered PBS), whereas CBE-treated mice showed no motor deficits as measured by the rotarod assay. showed a lack of coordination and balance.
연구가 끝날 때까지 생존한 마우스는 마지막 CBE 투여 다음 날 (P27, "제1일"), 또는 CBE 중단 3일 후 (P29, "제3일")에 희생시켰다. 제1일 및 제3일 코호트 둘 모두에서 GCase 기질의 축적을 평가하기 위해 25 mg/kg CBE가 제공된 마우스의 피질에서 지질 분석을 수행하였다. GluSph 및 GalSph 수준 (본 실시예에서 합계로 측정됨)은 PBS 처리된 대조군과 비교하여 CBE 처리된 마우스에서 유의미하게 축적되었으며, 이는 GCase 불충분과 일치한다.Mice that survived to the end of the study were sacrificed on the day following the last CBE administration (P27, "
상기 기재된 연구에 기초하여, 25 mg/kg CBE 용량이 선택되었는데, 이는 생존에 영향을 미치지 않으면서 행동 결함을 일으키기 때문이다. CBE 처리 동안 뇌 전반에 걸친 광범위한 GBA1 분포 및 트랜스진 발현을 달성하기 위해, rAAV-GBA1 또는 부형제를 출생 후 제3일 (P3)에 뇌실내 (ICV) 주사한 다음, P8에서 개시되는 매일 IP CBE 또는 PBS 처리에 의해 전달하였다 (도 10).Based on the study described above, the 25 mg/kg CBE dose was chosen because it causes behavioral deficits without affecting survival. To achieve broad GBA1 distribution and transgene expression throughout the brain during CBE treatment, intraventricular (ICV) injection of rAAV-GBA1 or excipients on postnatal day 3 (P3) followed by daily IP CBE starting at P8 or by PBS treatment ( FIG. 10 ).
rAAV-GBA1을 받은 CBE 처리된 마우스는 부형제를 받은 마우스보다 로타로드에 대해 통계적으로 유의미하게 더 우수하게 수행되었다 (도 11). 변이체 처리 군의 마우스는 다른 행동 측정 기준, 예컨대 시험 동안 이동한 총 거리의 관점에서 부형제 처리된 마우스와 상이하지 않았다 (도 11).CBE-treated mice that received rAAV-GBA1 performed statistically and significantly better on rotarod than mice that received vehicle ( FIG. 11 ). Mice in the variant-treated group did not differ from vehicle-treated mice in terms of other behavioral metrics, such as total distance traveled during the test ( FIG. 11 ).
생존 중 연구의 완료 시, 생화학적 분석을 위해 마지막 CBE 투여 다음날 (P36, "제1일") 또는 CBE 중단 3일 후 (P38, "제3일") 마우스의 절반을 희생시켰다 (도 12). 생물학적 삼중으로 수행된 형광 효소 측정 검정을 사용하여, 피질에서 GCase 활성을 평가하였다. rAAV-GBA1로 처리된 마우스에서는 GCase 활성이 증가하였지만, CBE 처리는 GCase 활성을 감소시켰다. 부가적으로, CBE 및 rAAV-GBA1을 둘 모두 받은 마우스는 PBS 처리 군과 유사한 GCase 활성 수준을 가졌으며, 이는 rAAV-GBA1의 전달이 CBE 처리에 의해 유도된 GCase 활성의 억제를 극복할 수 있다는 것을 나타낸다. 지질 분석은 기질 GluCer 및 GluSph의 수준을 조사하기 위해 마우스의 운동 피질에서 수행되었다. CBE와 rAAV-GBA1 처리를 받은 마우스의 뇌에 축적된 두 지질은 기질 축적을 유의미하게 감소시켰다.At the completion of the in-survival study, half of the mice were sacrificed the day after the last CBE administration (P36, “
지질 수준은 처리 군 전반에 걸쳐 GCase 활성과 로타로드 상의 성능 둘 모두와 음의 상관관계가 있었다. rAAV-GBA1 투여 후 증가된 GCase 활성은 기질 감소 및 증강된 운동 기능과 연관이 있었다 (도 13). 도 14에 도시된 바와 같이, 예비 생체내 분포는 qPCR에 의해 측정된 바와 같이 벡터 게놈 존재에 의해 평가되었다 (1 μg 게놈 DNA당 >100개의 벡터 게놈이 양성으로서 정의됨). CBE를 수반한 경우와 수반하지 않은 경우 둘 모두, rAAV-GBA1을 받은 마우스는 피질에서 rAAV-GBA1 벡터 게놈에 대해 양성이었으며, 이는 ICV 전달이 피질로의 rAAV-GBA1 전달을 발생시킨다는 것을 나타낸다. 부가적으로, 벡터 게놈은 간에서 검출되었고, 비장에서는 거의 검출되지 않았으며 심장, 신장 또는 생식선에서는 검출되지 않았다. 모든 측정에 대해, 제1일 군과 제3일 군 간에는 통계적으로 유의미한 차이가 없었다.Lipid levels were negatively correlated with both GCase activity and performance on rotarod across treatment groups. Increased GCase activity after rAAV-GBA1 administration was associated with decreased substrate and enhanced motor function ( FIG. 13 ). As shown in Figure 14, preliminary biodistribution was assessed by vector genome presence as determined by qPCR (>100 vector genomes per 1 μg genomic DNA defined as positive). Mice that received rAAV-GBA1, both with and without CBE, were positive for the rAAV-GBA1 vector genome in the cortex, indicating that ICV delivery results in rAAV-GBA1 delivery into the cortex. Additionally, the vector genome was detected in the liver, rarely in the spleen and not in the heart, kidney or germline. For all measurements, there were no statistically significant differences between the
CBE 모델에 대한 대규모 연구에서는 CBE 모델에서 rAAV-GBA1의 효과적인 용량을 추가로 조사하였다. 25 mg/kg CBE 용량 모델을 사용하여, 부형제 또는 rAAV-GBA1을 P3에서 ICV를 통해 전달하고, 매일 IP PBS 또는 CBE 처리를 P8에서 개시하였다. 이전 연구에서 관찰된 CBE 중단이 있는 군과 없는 군 간의 유사성을 감안할 때, 모든 마우스는 최종 CBE 투여 1일 후에 희생시켰다 (P38-40). 3가지 상이한 rAAV-GBA1 용량의 효과를 평가하여, 군당 10마리의 마우스 (5M/5F)를 사용하여 하기 5개 군을 생성하였다:A large-scale study of the CBE model further investigated the effective dose of rAAV-GBA1 in the CBE model. Using a 25 mg/kg CBE dose model, vehicle or rAAV-GBA1 was delivered via ICV at P3 and daily IP PBS or CBE treatment was initiated at P8. Given the similarity between the groups with and without CBE discontinuation observed in previous studies, all mice were sacrificed 1 day after the last CBE administration (P38-40). To evaluate the effect of three different rAAV-GBA1 doses, the following five groups were generated using 10 mice per group (5M/5F):
부형제 ICV + PBS IPExcipient ICV + PBS IP
부형제 ICV + 25 mg/kg CBE IPExcipient ICV + 25 mg/kg CBE IP
3.2e9 vg (2.13e10 vg/g 뇌) rAAV-GBA1 ICV + 25 mg/kg CBE IP3.2e9 vg (2.13e10 vg/g brain) rAAV-GBA1 ICV + 25 mg/kg CBE IP
1.0e10 vg (6.67e10 vg/g 뇌) rAAV-GBA1 ICV + 25 mg/kg CBE IP1.0e10 vg (6.67e10 vg/g brain) rAAV-GBA1 ICV + 25 mg/kg CBE IP
3.2e10 vg (2.13e11 vg/g 뇌) rAAV-GBA1 ICV + 25 mg/kg CBE IP.3.2e10 vg (2.13e11 vg/g brain) rAAV-GBA1 ICV + 25 mg/kg CBE IP.
최고 용량의 rAAV-GBA1은 P37에서 CBE 처리-관련 체중 증가 실패를 구제하였다. 부가적으로, 이러한 용량은 부형제 + CBE 처리 군과 비교하여 로타로드 및 테이퍼 빔에 대한 성능에 있어서 통계적으로 유의미한 증가를 발생시켰다 (도 15). 부형제 처리 군과 rAAV-GBA1 처리 군 둘 모두를 포함한 여러 군에서 치사성이 관찰되었다 (부형제 + PBS: 0; 부형제 + 25 mg/kg CBE: 1; 3.2e9 vg rAAV-GBA1+ 25 mg/kg CBE: 4; 1.0e10 vg rAAV-GBA1+ 25 mg/kg CBE: 0; 3.2e10 vg rAAV-GBA1+ 25 mg/kg CBE: 3).The highest dose of rAAV-GBA1 rescued CBE treatment-related weight gain failure at P37. Additionally, this dose resulted in a statistically significant increase in performance for rotarod and tapered beam compared to the excipient + CBE treated group ( FIG. 15 ). Lethality was observed in several groups, including both excipient-treated and rAAV-GBA1 treated groups (excipient + PBS: 0; excipient + 25 mg/kg CBE: 1; 3.2e9 vg rAAV-
생존 중 연구의 완료 시, 생화학적 분석을 위해 마우스를 희생시켰다 (도 16). 피질에서의 GCase 활성은 형광 측정 검정에 의해 생물학적으로 삼중으로 평가되었다. CBE 처리된 마우스는 감소된 GCase 활성을 나타낸 반면, 높은 rAAV-GBA1 용량을 받은 마우스는 CBE 처리와 비교하여 GCase 활성의 통계적으로 유의미한 증가를 나타냈다. CBE 처리된 마우스는 또한 GluCer 및 GluSph가 축적되었으며, 둘 모두 고 용량의 rAAV-GBA1을 투여함으로써 구제되었다.At the completion of the in-survival study, mice were sacrificed for biochemical analysis ( FIG. 16 ). GCase activity in the cortex was assessed in triplicate biologically by a fluorometric assay. CBE-treated mice showed reduced GCase activity, whereas mice receiving high rAAV-GBA1 dose showed a statistically significant increase in GCase activity compared to CBE-treated mice. CBE-treated mice also accumulated GluCer and GluSph, both of which were rescued by administration of high doses of rAAV-GBA1.
확립된 화학적 CBE 모델 외에도, rAAV-GBA1은 또한 Gba1에서의 V394L GD 돌연변이에 대해 동형 접합성이고 GCase 국재화 및 활성에 영향을 미치는 사포신이 부분적으로 결핍된 4L/PS-NA 유전적 모델에서 평가되었다. 이러한 마우스는 빔 워크, 로타로드 및 와이어 행 검정에서의 성능에 의해 입증된 바와 같이, 운동 강도, 조정 및 균형 결핍을 나타낸다. 전형적으로 이들 마우스의 수명은 22주 미만이다. 초기 연구에서, 3 μl의 최대 역가 바이러스가 2.4e10 vg (6.0e10 vg/g 뇌)의 최종 용량으로 P23에서 ICV에 의해 전달되었다. 군당 6마리의 마우스를 사용하여, 처리 군은 하기와 같다:In addition to the established chemical CBE model, rAAV-GBA1 was also evaluated in a 4L/PS-NA genetic model homozygous for the V394L GD mutation in Gba1 and partially deficient in saposin, which affects GCase localization and activity. These mice exhibit deficits in motor intensity, coordination, and balance, as evidenced by their performance in beam walk, rotarod, and wire hang assays. Typically the lifespan of these mice is less than 22 weeks. In an initial study, 3 μl of maximal titer virus was delivered by ICV at P23 with a final dose of 2.4e10 vg (6.0e10 vg/g brain). Using 6 mice per group, the treatment groups were as follows:
WT + 부형제 ICVWT + excipient ICV
4L/PS-NA + 부형제 ICV4L/PS-NA + excipient ICV
4L/PS-NA + 2.4e10 vg (6.0e10 vg/g 뇌) rAAV-GBA1 ICV4L/PS-NA + 2.4e10 vg (6.0e10 vg/g brain) rAAV-GBA1 ICV
빔 워크 시험에 의한 운동 성능은 rAAV-GBA1 전달 후 4주에 평가되었다. rAAV-GBA1을 받은 돌연변이체 마우스 군은 부형제로 처리된 돌연변이체 마우스와 비교할 때 더 적은 총 슬립 및 속도당 더 적은 슬립에 대한 경향을 나타내어, 운동 기능이 거의 WT 수준으로 회복되었다 (도 17). 이들 마우스가 나이가 들면서 운동 표현형이 더 심해지기 때문에, 이러한 행동 시험 및 다른 행동 시험에 대한 성능은 나중 시점에 평가된다. 생존 중 연구의 완료 시, 지질 수준, GCase 활성 및 생체내 분포가 이들 마우스에서 평가된다.Motor performance by beam walk test was evaluated 4 weeks after rAAV-GBA1 delivery. The group of mutant mice that received rAAV-GBA1 showed a tendency for fewer total slips and fewer slips per velocity compared to mutant mice treated with the vehicle, thus restoring motor function to near WT levels ( FIG. 17 ). Performance on these and other behavioral tests is evaluated at a later time point, as these mice develop more severe motor phenotypes as they age. Upon completion of the in-survival study, lipid levels, GCase activity and biodistribution are assessed in these mice.
rAAV-GBA1의 부가의 더 낮은 용량은 현재, 제안된 1상 고 임상 용량의 0.03x, 0.1x 및 1x에 상응하는 CBE 모델을 사용하여 시험되고 있다. 각각의 군은 군당 10마리의 마우스 (5M/5F)를 포함한다:Additional lower doses of rAAV-GBA1 are currently being tested using CBE models corresponding to 0.03x, 0.1x and 1x of the proposed
부형제 ICVexcipient ICV
부형제 ICV + 25 mg/kg CBE IPExcipient ICV + 25 mg/kg CBE IP
3.2e8 vg (2.13e9 vg/g 뇌) rAAV-GBA1 ICV + 25 mg/kg CBE IP 3.2e8 vg (2.13e9 vg/g brain) rAAV-GBA1 ICV + 25 mg/kg CBE IP
1.0e9 vg (6.67e9 vg/g 뇌) rAAV-GBA1 ICV + 25 mg/kg CBE IP 1.0e9 vg (6.67e9 vg/g brain) rAAV-GBA1 ICV + 25 mg/kg CBE IP
1.0e10 vg (6.67e10 vg/g 뇌) rAAV-GBA1 ICV + 25 mg/kg CBE IP.1.0e10 vg (6.67e10 vg/g brain) rAAV-GBA1 ICV + 25 mg/kg CBE IP.
운동 표현형 외에도, 지질 수준 및 GCase 활성이 피질에서 평가된다. 시간 경과에 따른 처리 및 분석이 또한 수행된다.In addition to the motor phenotype, lipid levels and GCase activity are assessed in the cortex. Processing and analysis over time is also performed.
효능 및 안전성 데이터를 평가하기 위해 더 큰 용량 범위의 연구가 개시되었다. 10마리의 4L/PS-NA 마우스 (군당 5M/5F)에게 10 μl의 rAAV-GBA1을 주사하였다. 알로메트릭 뇌 중량 계산을 사용하여, 용량은 제안된 1상 고 임상 용량의 0.15x, 1.5x, 4.4x 및 14.5x와 상관관계가 있다. 주사 군은 하기로 이루어진다:A larger dose range study was initiated to evaluate efficacy and safety data. Ten 4L/PS-NA mice (5M/5F per group) were injected with 10 μl of rAAV-GBA1. Using allometric brain weight calculations, doses were correlated with 0.15x, 1.5x, 4.4x, and 14.5x of the proposed
WT + 부형제 ICV WT + excipient ICV
4L/PS-NA + 부형제 ICV4L/PS-NA + excipient ICV
4L/PS-NA + 4.3e9 vg (1.1e10 vg/g 뇌) rAAV-GBA1 ICV4L/PS-NA + 4.3e9 vg (1.1e10 vg/g brain) rAAV-GBA1 ICV
4L/PS-NA + 4.3e10 vg (1.1e11 vg/g/ 뇌) rAAV-GBA1 ICV 4L/PS-NA + 4.3e10 vg (1.1e11 vg/g/brain) rAAV-GBA1 ICV
4L/PS-NA + 1.3e11 vg (3.2e11 vg/g 뇌) rAAV-GBA1 ICV 4L/PS-NA + 1.3e11 vg (3.2e11 vg/g brain) rAAV-GBA1 ICV
4L/PS-NA + 4.3e11 vg (1.1e12 vg/g 뇌) rAAV-GBA1 ICV.4L/PS-NA + 4.3e11 vg (1.1e12 vg/g brain) rAAV-GBA1 ICV.
CBE 모델에서의 비-임상 연구 요약은 하기 표 7에 제시되어 있다.A summary of the non-clinical studies in the CBE model is presented in Table 7 below.
<표 7><Table 7>
CBE 마우스 모델에서의 결과 요약Summary of Results in the CBE Mouse Model
주: 양성 생체내 분포는 >100 vg/1 μg 게놈 DNA로서 정의된다. 약어: BD = 생체내 분포; NS = 유의미하지 않음; T = 추세; S = 유의미함; N/A = 적용가능하지 않음; + = 양성; - = 음성.Note: Positive biodistribution is defined as >100 vg/1 μg genomic DNA. Abbreviations: BD = biodistribution; NS = not significant; T = trend; S = significant; N/A = not applicable; + = positive; - = voice.
실시예 9: rAAV 벡터의 시험관내 분석Example 9: In vitro analysis of rAAV vectors
rAAV 구축물을 시험관내 및 생체내에서 시험하였다. 도 18은 프로그래뉼린 (PGRN; GRN으로서 지칭되기도 함) 단백질을 코딩하는 rAAV 구축물의 시험관내 발현에 대한 대표적인 데이터를 보여준다. 왼쪽 패널은 프로그래뉼린 (PGRN) ELISA 검정의 표준 곡선을 보여준다. 하단 패널은 rAAV로 형질도입된 HEK293T 세포의 세포 용해물에서 ELISA 검정에 의해 측정된 PGRN 발현의 용량-반응을 나타낸다. MOI = 감염의 다중도 (세포당 벡터 게놈).The rAAV constructs were tested in vitro and in vivo. 18 shows representative data for in vitro expression of rAAV constructs encoding progranulin (PGRN; also referred to as GRN) protein. The left panel shows the standard curve of the progranulin (PGRN) ELISA assay. The lower panel shows the dose-response of PGRN expression measured by ELISA assay in cell lysates of HEK293T cells transduced with rAAV. MOI = multiplicity of infection (vector genomes per cell).
단독으로 또는 GBA1 및/또는 하나 이상의 억제성 RNA와 조합하여, 프로사포신 (PSAP) 및 SCARB2를 코딩하는 rAAV 벡터의 시험관내 활성을 평가하기 위해 파일럿 연구를 수행하였다. PSAP 및 프로그래뉼린 (PGRN; GRN으로서 지칭되기도 함)을 코딩하는 하나의 구축물이 또한 시험되었다. 시험된 벡터는 표 4에 제시된 벡터를 포함한다. "Opt"는 포유동물 세포 (예를 들어, 인간 세포)에서의 발현을 위해 코돈-최적화된 핵산 서열을 지칭한다. 도 19는 각각의 구축물을 사용한 HEK293 세포의 형질감염이 모의 형질감염된 세포와 비교하여 상응하는 유전자 산물의 과다발현을 발생시켰다는 것을 나타내는 대표적인 데이터를 보여준다.A pilot study was conducted to evaluate the in vitro activity of rAAV vectors encoding prosaposin (PSAP ) and SCARB2, either alone or in combination with GBA1 and/or one or more inhibitory RNAs. One construct encoding PSAP and progranulin (PGRN; also referred to as GRN) was also tested. The vectors tested include the vectors shown in Table 4. “Opt” refers to a nucleic acid sequence that is codon-optimized for expression in a mammalian cell (eg, a human cell). 19 shows representative data indicating that transfection of HEK293 cells with each construct resulted in overexpression of the corresponding gene product compared to mock transfected cells.
단독으로 또는 하나 이상의 억제성 RNA와 조합하여, TREM2를 코딩하는 rAAV 벡터의 시험관내 활성을 평가하기 위해 파일럿 연구를 수행하였다. 시험된 벡터는 표 8에 제시된 벡터를 포함한다. "Opt"는 포유동물 세포 (예를 들어, 인간 세포)에서의 발현을 위해 코돈-최적화된 핵산 서열을 지칭한다. 도 36a-36b는 각각의 구축물을 사용한 HEK293 세포의 형질감염이 모의 형질감염된 세포와 비교하여 상응하는 유전자 산물의 과다발현을 발생시켰다는 것을 나타내는 대표적인 데이터를 보여준다.A pilot study was conducted to evaluate the in vitro activity of rAAV vectors encoding TREM2, either alone or in combination with one or more inhibitory RNAs. The vectors tested include those shown in Table 8. “Opt” refers to a nucleic acid sequence that is codon-optimized for expression in a mammalian cell (eg, a human cell). 36A-36B show representative data indicating that transfection of HEK293 cells with each construct resulted in overexpression of the corresponding gene product compared to mock transfected cells.
<표 8><Table 8>
실시예 10: SNCA 및 TMEM106B shRNA 구축물의 시험Example 10: Testing of SNCA and TMEM106B shRNA constructs
HEK293 세포HEK293 cells
인간 배아 신장 293 세포주 (HEK293)가 본 연구에 사용되었다 [#85120602, 시그마 알드리치(Sigma-Aldrich)]. HEK293 세포는 100 단위/ml 페니실린 및 100 μg/ml 스트렙토마이신 [#15140122, 써모 피셔 사이언티픽(Thermo Fisher Scientific)]을 함유하는 배양 배지 (10% 소 태아 혈청 [FBS] [#10082147, 써모 피셔 사이언티픽]이 보충된 D-MEM [#11995065, 써모 피셔 사이언티픽])에서 유지되었다.The human embryonic kidney 293 cell line (HEK293) was used in this study [#85120602, Sigma-Aldrich]. HEK293 cells were cultured in culture medium (10% fetal bovine serum [FBS] [#10082147, Thermo Fisher Scientific] containing 100 units/ml penicillin and 100 μg/ml streptomycin [#15140122, Thermo Fisher Scientific]). Tipic] supplemented D-MEM [#11995065, Thermo Fisher Scientific]).
플라스미드 형질감염Plasmid transfection
플라스미드 형질감염은 리포펙타민(Lipofectamine) 2000 형질감염 시약 (#11668019, 써모 피셔 사이언티픽)을 사용하여 제조업제의 지시에 따라 수행되었다. 간단히 언급하면, HEK293 세포 (#12022001, 시그마 알드리치)를 항생제가 없는 배양 배지에서 3x105개 세포/ml의 밀도로 플레이팅하였다. 그 다음 날, 플라스미드와 리포펙타민 2000 시약을 Opti-MEM 용액 (#31985062, 써모 피셔 사이언티픽)에서 조합하였다. 5분 후, 혼합물을 HEK293 배양물에 부가하였다. 72시간 후, RNA 또는 단백질 추출을 위해 세포를 수거하거나, 또는 영상화 분석을 실시하였다. 영상화 분석을 위해, 세포를 플레이팅하기 전에 플레이트를 0.01% 폴리-L-리신 용액 (P8920, 시그마 알드리치)으로 미리 코팅하였다.Plasmid transfection was performed using
정량적 실시간 PCR (qRT-PCR)에 의한 유전자 발현 분석Gene expression analysis by quantitative real-time PCR (qRT-PCR)
상대 유전자 발현 수준은 제조업체의 지시에 따라 파워 SYBR 그린 셀-투-CT 키트 (#4402955, 써모 피셔 사이언티픽)를 사용하여 정량적 실시간 PCR (qRT-PCR)에 의해 결정되었다. 후보 플라스미드는 리포펙타민 2000 형질감염 시약 (50 μl Opti-MEM 용액 중 0.5 μg 플라스미드 및 1.5 μl 시약)을 사용하여 48-웰 플레이트 상에 플레이팅된 HEK293 세포 (7.5 x104개 세포/웰)에 일시적으로 형질감염되었다. 72시간 후, 세포로부터 RNA를 추출하고 역전사에 사용하여 제조업체의 지시에 따라 cDNA를 합성하였다. 정량적 PCR 분석을 위해, 파워 SYBR 그린 PCR 마스터 믹스 (#4367659, 써모 피셔 사이언티픽)와 함께 유전자 특이적 프라이머 쌍 (250 nM 최종 농도)을 사용하여 2 내지 5 μl의 cDNA 산물을 이중으로 증폭시켰다. SNCA, TMEM106B 및 GAPDH 유전자에 대한 프라이머 서열은 하기와 같다: SNCA의 경우, 5'- AAG AGG GTG TTC TCT ATG TAG GC -3' (서열식별번호: 71), 5'- GCT CCT CCA ACA TTT GTC ACT T -3' (서열식별번호: 72); TMEM106B의 경우, 5'-ACA CAG TAC CTA CCG TTA TAG CA-3' (서열식별번호: 73), 5'-TGT TGT CAC AGT AAC TTG CAT CA-3' (서열식별번호: 74); 및 GAPDH의 경우, 5'- CTG GGC TAC ACT GAG CAC C -3' (서열식별번호: 75), 5'- AAG TGG TCG TTG AGG GCA ATG -3' (서열식별번호: 76). 정량적 PCR은 퀀트스튜디오(QuantStudio) 3 실시간 PCR 시스템 (써모 피셔 사이언티픽)에서 수행되었다. 발현 수준은 하우스키핑 유전자 GAPDH에 의해 정규화되었고 비교 CT 방법을 사용하여 계산되었다.Relative gene expression levels were determined by quantitative real-time PCR (qRT-PCR) using the Power SYBR Green Cell-to-CT kit (#4402955, Thermo Fisher Scientific) according to the manufacturer's instructions. Candidate plasmids were transfected into HEK293 cells (7.5 x 10 4 cells/well) plated on 48-well
형광 영상화 분석Fluorescence Imaging Analysis
EGFP 코딩 영역의 하류에 인간 SNCA 유전자의 3'-UTR을 함유하는 EGFP 리포터 플라스미드는 SNCA 및 TMEM106B 녹다운 플라스미드의 검증에 사용되었다. EGFP 리포터 플라스미드 및 후보 녹다운 플라스미드는 리포펙타민 2000 형질감염 시약 (10 μl Opti-MEM 용액 중 0.04 μg 리포터 플라스미드, 0.06 μg 녹다운 플라스미드 및 0.3 μl 시약)을 사용하여 폴리-L-리신 코팅된 96-웰 플레이트 상에 플레이팅된 HEK293 세포 (3.0 x 104개 세포/웰)에 동시에 형질감염되었다. 72시간 후, EGFP 신호의 형광 강도는 바리오스칸(Varioskan) LUX 다중 모드 판독기 (써모 피셔 사이언티픽)를 사용하여 여기 488 nm/방출 512 nm에서 측정되었다. 세포를 실온에서 10분 동안 4% PFA로 고정하고, 실온에서 30분 동안 40 μg/ml 7-아미노액티노마이신 D (7-AAD)를 함유하는 D-PBS와 함께 인큐베이션하였다. D-PBS로 세척한 후, 바리오스칸 판독기를 사용하여 여기 546 nm/방출 647 nm에서 7-AAD 신호의 형광 강도를 측정하여 세포 수를 정량화하였다. 7-AAD 신호 수준당 정규화된 EGFP 신호를 대조군 녹다운 샘플과 비교하였다.An EGFP reporter plasmid containing the 3'-UTR of the human SNCA gene downstream of the EGFP coding region was used for validation of the SNCA and TMEM106B knockdown plasmids. EGFP reporter plasmid and candidate knockdown plasmid were poly-L-lysine coated 96-
효소 결합 면역흡착 검정 (ELISA)Enzyme Linked Immunosorbent Assay (ELISA)
SNCA 코딩 영역의 하류에 인간 SNCA 유전자 또는 TMEM106B 유전자의 3'-UTR을 함유하는 α-시누클레인 리포터 플라스미드는 단백질 수준에서 녹다운 플라스미드의 검증에 사용되었다. α-시누클레인 단백질의 수준은 HEK293 세포로부터 추출된 용해물을 사용하여 ELISA (#KHB0061, 써모 피셔 사이언티픽)에 의해 결정되었다. 후보 플라스미드는 리포펙타민 2000 형질감염 시약 (25 μl Opti-MEM 용액 중 0.1 μg 리포터 플라스미드, 0.15 μg 녹다운 플라스미드 및 0.75 μl 시약)을 사용하여 48-웰 플레이트 상에 플레이팅된 HEK293 세포 (7.5 x104개 세포/웰)에 일시적으로 형질감염되었다. 72시간 후, 세포를 프로테아제 억제제 칵테일 (#P8340, 시그마 알드리치)이 보충된 방사성면역침전 검정 (RIPA) 완충액 (#89900, 써모 피셔 사이언티픽)에 용해시키고, 몇 초 동안 초음파처리하였다. 얼음 위에서 30분 동안 인큐베이션한 후, 용해물을 4℃에서 20,000 xg로 15분 동안 원심분리하고, 상층액을 수집하였다. 단백질 수준을 정량화하였다. 플레이트는 450 nm에서 바리오스칸 플레이트 판독기로 판독되었고, 농도는 소프트맥스 프로 5 소프트웨어를 사용하여 계산되었다. 측정된 단백질 농도는 비신코닌산 검정 (#23225, 써모 피셔 사이언티픽)으로 결정된 총 단백질 농도에 대해 정규화되었다. The α-synuclein reporter plasmid containing the 3'-UTR of the human SNCA gene or TMEM106B gene downstream of the SNCA coding region was used for validation of the knockdown plasmid at the protein level. The level of α-synuclein protein was determined by ELISA (#KHB0061, Thermo Fisher Scientific) using lysates extracted from HEK293 cells. Candidate plasmids were obtained from HEK293 cells (7.5 x 10 4) plated onto 48-well
도 37 및 표 9는 GFP 리포터 검정 (상단) 및 α-Syn 검정 (하단)에 의한 시험관내 SNCA의 성공적인 침묵을 나타내는 대표적인 데이터를 보여준다. 도 38 및 표 10은 GFP 리포터 검정 (상단) 및 α-Syn 검정 (하단)에 의한 시험관내 TMEM106B의 성공적인 침묵을 나타내는 대표적인 데이터를 보여준다.37 and Table 9 show representative data demonstrating successful silencing of SNCA in vitro by GFP reporter assay (top) and α-Syn assay (bottom). 38 and Table 10 show representative data demonstrating successful silencing of TMEM106B in vitro by GFP reporter assay (top) and α-Syn assay (bottom).
<표 9><Table 9>
<표 10><Table 10>
실시예 11: ITR "D" 서열 배치 및 세포 형질도입Example 11: ITR "D" Sequence Placement and Cell Transduction
rAAV 벡터의 세포 형질도입에 대한 ITR "D" 서열의 배치 효과를 조사하였다. HEK293 세포는 도 20에 도시된 바와 같이, 1) 야생형 ITR (예를 들어, 트랜스진 삽입체에 근접하고 ITR의 말단에 대해 원위에 위치한 "D" 서열) 또는 2) 벡터의 "외부"에 위치한 "D" 서열이 있는 ITR (예를 들어, ITR의 말단에 근접하고 트랜스진 삽입체에 대해 원위에 위치한 "D" 서열)을 갖는 Gcase-코딩 rAAV로 형질도입되었다. 데이터는 "외부" 위치에 위치한 "D" 서열을 갖는 rAAV가, 패키징될 수 있고 세포를 효율적으로 형질도입할 수 있는 능력을 보유한다는 것을 나타낸다 (도 40).The effect of placement of the ITR "D" sequence on cell transduction of rAAV vectors was investigated. HEK293 cells can contain either 1) wild-type ITR (eg, a "D" sequence located proximate to the transgene insert and distal to the end of the ITR) or 2) located "outside" of the vector, as shown in FIG. 20 . A Gcase-encoding rAAV was transduced with an ITR with a “D” sequence (eg, a “D” sequence proximal to the terminus of the ITR and located distal to the transgene insert). The data show that rAAVs with a “D” sequence located in an “external” position retain the ability to be packaged and to efficiently transduce cells ( FIG. 40 ).
실시예 12: 프로그래뉼린 rAAV의 시험관내 시험Example 12: In vitro testing of progranulin rAAV
도 39는 PGRN (GRN으로서 지칭되기도 함)을 코딩하는 발현 구축물을 포함하는 벡터의 한 실시양태를 도시하는 개략도이다. 프로그래뉼린은 PGRN (예를 들어, 코돈-최적화된 PGRN; 코돈-최적화된 GRN으로서 지칭되기도 함)을 코딩하는 rAAV 벡터의 주입에 의해, 실질내 또는 척수강내 주사, 예컨대 대수조내 주사에 의해, GRN 결실에 대해 이형 접합성 또는 동형 접합성인, GRN이 결핍된 설치류의 CNS에서 과다발현된다.39 is a schematic diagram depicting one embodiment of a vector comprising an expression construct encoding PGRN (also referred to as GRN). Progranulin can be obtained by injection of a rAAV vector encoding a PGRN (eg, codon-optimized PGRN; also referred to as codon-optimized GRN), by intraparenchymal or intrathecal injection, such as by intrathecal injection. , is overexpressed in the CNS of GRN -deficient rodents, either heterozygous or homozygous for the GRN deletion.
생후 2개월 또는 6개월된 마우스에게 주사하고, 6개월 또는 12개월까지 자라게 한 다음, 하기 중 하나 이상에 관하여 분석한다: RNA 및 단백질 수준에서 GRN의 발현 수준, 행동 검정 (예를 들어, 개선된 움직임), 생존 검정 (예를 들어, 개선된 생존), 미세아교세포 및 염증성 마커, 신경교세포증, 신경세포 손실, 리포푸신증 및/또는 리소좀 마커 축적 구제, 예컨대 LAMP1. GRN 결핍 마우스에 대한 검정은, 예를 들어 문헌 [Arrant et al. (2017) Brain 140: 1477-1465; Arrant et al. (2018) J. Neuroscience 38(9):2341-2358; 및 Amado et al. (2018) doi:https://doi.org/10.1101/30869] (그의 전체 내용이 본원에 참조로 포함된다)에 기재되어 있다.
실시예 13: MAPT rAAV의 시험관내 시험Example 13: In vitro test of MAPT rAAV
SY5Y 세포를 96-웰 플레이트에서 웰당 4x104개 세포로 플레이팅하였다. 그 다음 날, 세포를 1 uM 훽스트(Hoechst)를 함유하는 배지에서 2x105의 MOI 하에 삼중으로 MAPT를 표적화하는 억제성 RNA (포유동물 세포 기반 시스템에서 생산된 J00130, 및 배큘로바이러스 기반 시스템에서 생산된 J00122; 도 75c에 도시됨)를 코딩하는 2개의 바이러스 스톡 (Intronic_eSIBR_MAPT_MiR615 보존 벡터)으로 형질도입시켰다. 부형제 단독을 음성 대조군으로서 사용하였다. 72시간 후에 세포를 수거하고, MAPT에 대한 억제성 RNA를 발현하는 AAV 벡터를 검출하기 위해 프로브로 염색하였다. 프로브는 BGHpA를 표적으로 한다. 도 75a는 두 바이러스 스톡이 SY5Y 세포를 성공적으로 형질도입했다는 것을 보여준다.SY5Y cells were plated at 4 ×10 4 cells per well in 96-well plates. The next day, cells were transfected with inhibitory RNA targeting MAPT in triplicate at an MOI of 2x10 5 in medium containing 1 uM Hoechst (J00130 produced in a mammalian cell-based system, and produced in a baculovirus-based system). J00122; shown in Figure 75c) were transduced with two viral stocks (Intronic_eSIBR_MAPT_MiR615 conservation vector). Excipient alone was used as a negative control. Cells were harvested after 72 hours and stained with a probe to detect AAV vectors expressing inhibitory RNA for MAPT. The probe targets BGHpA. Figure 75A shows that both viral stocks successfully transduced SY5Y cells.
SY5Y 세포를 96-웰 플레이트에서 웰당 4x104개 세포로 플레이팅하였다. 그 다음 날, 세포는 1 uM 훽스트를 함유하는 배지에서 MOI 2x106 하에 삼중으로 MAPT를 표적화하는 억제성 RNA (J00130 및 J00122; 도 75c에 도시됨)를 코딩하는 2개의 바이러스 스톡 (Intronic_eSIBR_MAPT_MiR615 보존 벡터)으로 형질도입시켰다. 부형제 단독을 음성 대조군으로서 사용하였다. SY5Y 세포는 형질도입 후 72시간 또는 7일에 RNA 추출을 위해 용해되었다. 인비트로젠(Invitrogen) 파워 SYBR 그린 셀-투 Ct 키트를 사용하여 추출된 RNA로부터 cDNA를 만들었다. qRT-PCR은 cDNA 샘플에서 시행되었으며, 인간 MAPT 및 GAPDH 모두에 대한 프라이머를 사용하여 삼중으로 실행되었다. 도 75b는 J00130 및 J00122에 의한 MAPT 발현의 녹다운에 대한 데이터를 보여준다.SY5Y cells were plated at 4 ×10 4 cells per well in 96-well plates. The next day, cells were transfected with two viral stocks (Intronic_eSIBR_MAPT_MiR615 conserved vector) encoding inhibitory RNAs (J00130 and J00122; shown in Figure 75c) targeting MAPT in triplicate at MOI 2x10 6 in medium containing 1 uM Hoechst. was transduced into Excipient alone was used as a negative control. SY5Y cells were lysed for RNA extraction at 72 hours or 7 days after transduction. cDNA was generated from the extracted RNA using the Invitrogen Power SYBR Green Cell-to-Ct kit. qRT-PCR was run on cDNA samples and run in triplicate using primers for both human MAPT and GAPDH. Figure 75b shows data for knockdown of MAPT expression by J00130 and J00122.
등가물equivalent
본 출원은 하기 문서의 전체 내용을 참조로 포함한다: 2018년 10월 3일에 출원된 국제 PCT 출원 PCT/US2018/054225; 2018년 10월 3일에 출원된 국제 PCT 출원 PCT/US2018/054223; "리소좀 장애를 위한 유전자 요법"이라는 명칭으로 2017년 10월 3일에 출원된 미국 가출원 일련 번호 62/567,296; "리소좀 장애를 위한 유전자 요법"이라는 명칭으로 2017년 10월 3일에 출원된 미국 가출원 일련 번호 62/567,311; "리소좀 장애를 위한 유전자 요법"이라는 명칭으로 2017년 10월 3일에 출원된 미국 가출원 일련 번호 62/567,319; "리소좀 장애를 위한 유전자 요법"이라는 명칭으로 2018년 10월 3일에 출원된 미국 가출원 일련 번호 62/567,301; "리소좀 장애를 위한 유전자 요법"이라는 명칭으로 2017년 10월 3일에 출원된 미국 가출원 일련 번호 62/567,310; "리소좀 장애를 위한 유전자 요법"이라는 명칭으로 2017년 10월 3일에 출원된 미국 가출원 일련 번호 62/567,303; 및 "리소좀 장애를 위한 유전자 요법"이라는 명칭으로 2017년 10월 3일에 출원된 미국 가출원 일련 번호 62/567,305.This application is incorporated by reference in its entirety in its entirety: International PCT Application PCT/US2018/054225, filed October 3, 2018; International PCT Application PCT/US2018/054223, filed on October 3, 2018; U.S. Provisional Application Serial No. 62/567,296, filed October 3, 2017, for "Gene Therapy for Lysosomal Disorders;" U.S. Provisional Application Serial No. 62/567,311, filed October 3, 2017, entitled "Gene Therapy for Lysosomal Disorders;" U.S. Provisional Application Serial No. 62/567,319, filed on October 3, 2017, for "Gene Therapy for Lysosomal Disorders;" U.S. Provisional Application Serial No. 62/567,301, filed October 3, 2018, for "Gene Therapy for Lysosomal Disorders;" U.S. Provisional Application Serial No. 62/567,310, filed October 3, 2017, entitled "Gene Therapy for Lysosomal Disorders;" U.S. Provisional Application Serial No. 62/567,303, filed October 3, 2017, entitled "Gene Therapy for Lysosomal Disorders;" and U.S. Provisional Application Serial No. 62/567,305, filed Oct. 3, 2017, for the title “Gene Therapy for Lysosomal Disorders.”
이와 같이 본 발명의 적어도 하나의 실시양태의 여러 측면이 설명되어 있지만, 다양한 변경, 변형 및 개선이 관련 기술분야의 통상의 기술자에게 쉽게 일어날 것임을 이해해야 한다. 이러한 변경, 변형 및 개선은 본 개시내용의 일부로 의도되고, 본 발명의 취지 및 범위 내에 있는 것으로 의도된다. 따라서, 전술한 설명 및 도면은 단지 예시일 뿐이다.While various aspects of at least one embodiment of the invention have thus been described, it should be understood that various changes, modifications, and improvements will readily occur to those skilled in the art. Such alterations, modifications and improvements are intended to be a part of this disclosure and are intended to be within the spirit and scope of the present invention. Accordingly, the foregoing description and drawings are by way of example only.
본 발명의 여러 실시양태가 본원에 설명되고 예시되었지만, 관련 기술분야의 통상의 기술자는 기능의 수행 및/또는 본원에 기재된 결과 및/또는 하나 이상의 장점의 획득을 위한 다양한 다른 수단 및/또는 구조를 쉽게 고려할 것이며, 이러한 변경 및/또는 변형 각각은 본 발명의 범위 내에 있는 것으로 간주된다. 보다 일반적으로, 관련 기술분야의 통상의 기술자는 본원에 기재된 모든 파라미터, 치수, 재료 및 구성은 예시적인 것이며, 실제 파라미터, 치수, 재료 및/또는 구성은 본 발명의 교시가 사용되는 구체적 적용 또는 적용들에 따라 달라질 것임을 쉽게 이해할 것이다. 관련 기술분야의 통상의 기술자는 일상적인 실험만을 사용하여 본원에 기재된 본 발명의 구체적 실시양태에 대한 많은 등가물을 인식하거나 확인할 수 있을 것이다. 따라서, 전술한 실시양태는 단지 예로서 제시된 것이며, 첨부된 청구범위 및 이에 대한 등가물 내에서 본 발명은 구체적으로 설명되고 청구된 것과 다르게 실시될 수 있다는 것을 이해해야 한다. 본 발명은 본원에 기재된 각각의 개별 특징, 시스템, 물품, 재료 및/또는 방법에 관한 것이다. 또한, 이러한 특징, 시스템, 물품, 재료 및/또는 방법이 상호 모순되지 않는 경우, 2개 이상의 이러한 특징, 시스템, 물품, 재료 및/또는 방법의 임의의 조합은 본 발명의 범위 내에 포함된다.While several embodiments of the invention have been described and illustrated herein, those skilled in the art will recognize that various other means and/or structures for the performance of the functions and/or achievement of the results and/or one or more advantages described herein will occur. As readily contemplated, each such alteration and/or modification is considered to be within the scope of the present invention. More generally, those of ordinary skill in the art will recognize that all parameters, dimensions, materials, and configurations described herein are exemplary and that the actual parameters, dimensions, materials and/or configurations will depend on the specific application or application for which the teachings of the present invention are used. It will be easy to understand that it will vary depending on the Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the invention described herein. Accordingly, it is to be understood that the foregoing embodiments have been presented by way of example only, and that, within the scope of the appended claims and their equivalents, the invention may be practiced otherwise than as specifically described and claimed. The present invention is directed to each individual feature, system, article, material and/or method described herein. Further, any combination of two or more such features, systems, articles, materials and/or methods is included within the scope of the present invention provided such features, systems, articles, materials and/or methods are not mutually inconsistent.
본 명세서 및 청구범위에서 본원에 사용된 바와 같은 "하나"는 반대로 명확하게 표시되지 않는 한 "적어도 하나"를 의미하는 것으로 이해되어야 한다.As used herein in the specification and claims, “a” should be understood to mean “at least one” unless clearly indicated to the contrary.
명세서 및 청구범위에서 본원에 사용된 바와 같은 문구 "및/또는"은 그렇게 결합된 요소, 즉 일부 경우에는 결합적으로 존재하고 다른 경우에는 분리적으로 존재하는 요소의 "하나 또는 둘 모두"를 의미하는 것으로 이해되어야 한다. 반대로 명확하게 표시되지 않는 한 구체적으로 확인된 요소와 관련이 있는지 또는 관련이 없는지에 관계없이, "및/또는" 절에 의해 구체적으로 확인된 요소 이외의 다른 요소가 임의로 존재할 수 있다. 따라서, 비제한적인 예로서, 개방형 언어, 예컨대 "포함하는"과 연계해서 사용될 때 "A 및/또는 B"에 대한 언급은 한 실시양태에서 B가 없는 A (임의로 B 이외의 요소 포함); 또 다른 실시양태에서 A가 없는 B (임의로 A 이외의 요소 포함); 또 다른 실시양태에서 A 및 B 모두 (임의로 다른 요소 포함) 등을 지칭할 수 있다.The phrase “and/or” as used herein in the specification and claims means “one or both” of the elements so joined, i.e., present in combination in some cases and separately in other cases. should be understood as To the contrary, elements other than those specifically identified by the "and/or" clause may optionally be present, regardless of whether related or unrelated to the elements specifically identified unless expressly indicated to the contrary. Thus, as a non-limiting example, reference to "A and/or B" when used in connection with an open language, such as "comprising," in one embodiment refers to A without B (optionally including elements other than B); B without A in another embodiment (optionally including elements other than A); In yet another embodiment may refer to both A and B (optionally including other elements), and the like.
명세서 및 청구범위에서 본원에 사용된 바와 같은 "또는"은 상기 정의된 바와 같은 "및/또는"과 동일한 의미를 갖는 것으로 이해되어야 한다. 예를 들어, 목록에서 항목을 구분할 때, "또는" 또는 "및/또는"은 포괄적인 것으로 해석되어야 하며, 즉 요소의 개수 또는 목록 중 적어도 하나를 포함할 뿐만 아니라 하나 초과를 포함하고, 임의로 부가의 열거되지 않은 항목을 포함한다. 반대로 명확하게 표시된 용어, 예컨대 "다음 중 하나만" 또는 "다음 중 정확히 하나" 또는 청구범위에서 사용되는 경우 "다음으로 이루어진"만이 요소의 개수 또는 목록 중 정확히 하나의 요소를 포함하는 것을 지칭할 것이다. 일반적으로, 본원에 사용된 바와 같은 용어 "또는"은 배타성 용어, 예컨대 "둘 중 하나", "중 하나", "중 오직 하나", 또는 "중 정확히 하나"가 선행될 때 배타적인 대안 (즉, "하나 또는 다른 것이지만 둘 모두가 아닌")을 나타내는 것으로 해석되어야 한다. 청구범위에서 사용되는 경우 "본질적으로 이루어진"은 특허법 분야에서 사용되는 바와 같은 통상적인 의미를 가질 것이다.As used herein in the specification and claims, “or” should be understood to have the same meaning as “and/or” as defined above. For example, when delimiting items in a list, "or" or "and/or" should be construed as inclusive, i.e., including at least one of the number or list of elements, as well as more than one, and optionally additional Includes items not listed in Conversely, only clearly marked terms such as "only one of the following" or "exactly one of the following" or "consisting of" when used in a claim shall refer to the inclusion of exactly one element of the list or number of elements. Generally, the term “or,” as used herein, is preceded by an exclusive alternative (i.e., “or”) when preceded by an exclusive term, such as “one of the two,” “one of,” “only one of,” or “exactly one of.” , "one or the other but not both"). "Consisting essentially of" when used in the claims shall have its ordinary meaning as used in the field of patent law.
명세서 및 청구범위에서 본원에 사용된 바와 같이, 하나 이상의 요소의 목록과 관련하여 문구 "적어도 하나"는 요소의 목록 중 임의의 하나 이상의 요소로부터 선택된 적어도 하나의 요소를 의미하는 것으로 이해되어야 하지만, 요소 목록 내에 구체적으로 열거된 각각의 및 모든 요소 중 적어도 하나를 반드시 포함하지는 않고, 요소 목록 중 요소의 임의의 조합을 배제하지 않는다. 이러한 정의는 또한 구체적으로 확인된 요소와 관련이 있는지 또는 관련이 없는지에 관계없이, 문구 "적어도 하나"가 지칭하는 요소 목록 내에서 구체적으로 확인된 요소 이외의 요소가 임의로 존재할 수 있도록 한다. 따라서, 비제한적인 예로서, "A 및 B 중 적어도 하나" (또는 동등하게 "A 또는 B 중 적어도 하나" 또는 동등하게 "A 및/또는 B 중 적어도 하나")는 한 실시양태에서 B가 존재하지 않는 (및 임의로 B 이외의 요소를 포함) 적어도 하나 (임의로 하나 초과 포함)의 A; 또 다른 실시양태에서 A가 존재하지 않는 (및 임의로 A 이외의 요소를 포함) 적어도 하나 (임의로 하나 초과 포함)의 B; 또 다른 실시양태에서 적어도 하나 (임의로 하나 초과 포함)의 A, 및 적어도 하나 (임의로 하나 초과 포함)의 B (및 임의로 다른 요소를 포함) 등을 지칭할 수 있다.As used herein in the specification and claims, the phrase “at least one” in reference to a list of one or more elements should be understood to mean at least one element selected from any one or more of the list of elements, but It does not necessarily include at least one of each and every element specifically recited in a list, nor does it exclude any combination of elements in a list of elements. This definition also allows for elements other than the specifically identified element to optionally exist within the list of elements to which the phrase "at least one" refers, whether related or unrelated to the specifically identified element. Thus, as a non-limiting example, "at least one of A and B" (or equivalently "at least one of A or B" or equivalently "at least one of A and/or B") means that in one embodiment B is present. at least one (optionally including more than one) A that does not (and optionally includes elements other than B); in another embodiment at least one (optionally including more than one) B in the absence of (and optionally including elements other than A) A; in another embodiment at least one (optionally including more than one) A, and at least one (optionally including more than one) B (and optionally including other elements), and the like.
청구범위 뿐만 아니라 상기 명세서에서, 모든 이행 문구, 예컨대 "포함하는", "포함한", "운반하는", "갖는", "함유하는", "수반하는", "보유하는" 등은 개방형인 것으로, 즉 포함하나 이에 제한되지는 않는 것을 의미하는 것으로 이해되어야 한다. 이행 문구 "이루어진" 및 "본질적으로 이루어진"은 단지 각각, 미국 특허청 특허 심사 절차 매뉴얼, 섹션 2111.03에 제시된 바와 같이 폐쇄형 또는 반폐쇄형 이행 문구일 수 있다.In the claims, as well as in the specification, all transitional phrases such as "comprising", "comprising", "carrying", "having", "containing", "accompanying", "having", etc. are open-ended. , ie, including, but not limited to, should be understood to mean. The transition phrases “consisting of” and “consisting essentially of” may only be closed or semi-closed transitional phrases, respectively, as set forth in the United States Patent and Trademark Office Manual of Patent Examination Procedures, Section 2111.03.
청구항 요소를 변형시키기 위해 청구범위에서 서수 용어, 예컨대 "제1", "제2", "제3" 등을 사용하는 것은 그 자체로 방법의 실행이 수행되는 또 다른 또는 시간적 순서보다 한 청구항 요소의 어떠한 우선권, 우위 또는 순서를 함축하는 것이 아니라, 단지 청구항 요소를 구별하기 위해 특정 명칭을 갖는 한 청구항 요소를 동일한 명칭을 갖는 (그러나 서수 용어 사용) 또 다른 요소와 구별하기 위한 표지로서 사용된다.The use of ordinal terms such as "first", "second", "third", etc. in a claim to modify a claim element is itself one claim element rather than another or chronological order in which the execution of the method is performed. It is not intended to imply any precedence, precedence, or order of claims, but is merely used as a marker to distinguish one claim element having a particular name from another element having the same name (but using ordinal terminology) to distinguish the claim element.
반대로 명확하게 표시되지 않는 한, 하나 초과의 단계 또는 실행을 포함하는 본원에 청구된 임의의 방법에서, 방법의 단계 또는 실행의 순서는 그 방법의 단계 또는 실행이 나열된 순서로 반드시 제한되지 않는다는 것을 또한 이해해야 한다.It is also noted that, unless expressly indicated to the contrary, in any method claimed herein comprising more than one step or execution, the order of steps or executions of the method is not necessarily limited to the order in which the steps or executions of the method are listed. have to understand
서열order
일부 실시양태에서, 하나 이상의 유전자 산물 (예를 들어, 제1, 제2 및/또는 제3 유전자 산물)을 코딩하는 발현 카세트는 서열식별번호: 1-149 중 어느 하나에 제시된 서열을 포함하거나 또는 이로 이루어진다 (또는 이러한 서열을 갖는 펩티드를 코딩한다). 일부 실시양태에서, 유전자 산물은 서열식별번호: 1-149 중 어느 하나의 일부분 (예를 들어, 단편)에 의해 코딩된다.In some embodiments, an expression cassette encoding one or more gene products (eg, first, second and/or third gene products) comprises a sequence set forth in any one of SEQ ID NOs: 1-149, or It consists of (or encodes a peptide having such a sequence). In some embodiments, the gene product is encoded by a portion (eg, a fragment) of any one of SEQ ID NOs: 1-149.
SEQUENCE LISTING
<110> Prevail Threapeutics, Inc.
<120> GENE THERAPIES FOR LYSOSOMAL DISORDERS
<130> P1094.70012WO00
<140> Not Yet Assigned
<141> Concurrently Herewith
<150> US 62/990,246
<151> 2020-03-16
<150> US 62/998,665
<151> 2020-03-12
<150> US 62/960,471
<151> 2020-01-13
<150> US 62/954,089
<151> 2019-12-27
<150> US 62/934,450
<151> 2019-11-12
<150> US 62/832,223
<151> 2019-04-10
<150> US 62/831,840
<151> 2019-04-10
<150> US 62/831,846
<151> 2019-04-10
<150> US 62/831,856
<151> 2019-04-10
<160> 149
<170> PatentIn version 3.5
<210> 1
<211> 10697
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 1
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
cctagttatt aatagtaatc aattacgggg tcattagttc atagcccata tatggagttc 360
cgcgttacat aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca 420
ttgacgtcaa taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt 480
caatgggtgg actatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg 540
ccaagtacgc cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag 600
tacatgacct tatgggactt tcctacttgg cagtacatct acgtattagt catcgctatt 660
accatggtcg aggtgagccc cacgttctgc ttcactctcc ccatctcccc cccctcccca 720
cccccaattt tgtatttatt tattttttaa ttattttgtg cagcgatggg ggcggggggg 780
gggggggggc gcgcgccagg cggggcgggg cggggcgagg ggcggggcgg ggcgaggcgg 840
agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa agtttccttt tatggcgagg 900
cggcggcggc ggcggcccta taaaaagcga agcgcgcggc gggcgggagt cgctgcgacg 960
ctgccttcgc cccgtgcccc gctccgccgc cgcctcgcgc cgcccgcccc ggctctgact 1020
gaccgcgtta ctcccacagg tgagcgggcg ggacggccct tctcctccgg gctgtaatta 1080
gcgcttggtt taatgacggc ttgttttctg tggctgcgtg aaagccttga ggggctccgg 1140
gagctagagc ctctgctaac catgttcatg ccttcttctt tttcctacag ctcctgggca 1200
acgtgctggt tattgtgctg tctcatcatt ttggcaaaga attcctcgaa gatccgaagg 1260
gaaagtcttc cacgactgtg ggatccgttc gaagatatca ccggttgagc caccatggaa 1320
ttcagcagcc ccagcagaga ggaatgcccc aagcctctga gccgggtgtc aatcatggcc 1380
ggatctctga caggactgct gctgcttcag gccgtgtctt gggcttctgg cgctagacct 1440
tgcatcccca agagcttcgg ctacagcagc gtcgtgtgcg tgtgcaatgc cacctactgc 1500
gacagcttcg accctcctac ctttcctgct ctgggcacct tcagcagata cgagagcacc 1560
agatccggca gacggatgga actgagcatg ggacccatcc aggccaatca cacaggcact 1620
ggcctgctgc tgacactgca gcctgagcag aaattccaga aagtgaaagg cttcggcgga 1680
gccatgacag atgccgccgc tctgaatatc ctggctctgt ctccaccagc tcagaacctg 1740
ctgctcaaga gctacttcag cgaggaaggc atcggctaca acatcatcag agtgcccatg 1800
gccagctgcg acttcagcat caggacctac acctacgccg acacacccga cgatttccag 1860
ctgcacaact tcagcctgcc tgaagaggac accaagctga agatccctct gatccacaga 1920
gccctgcagc tggcacaaag acccgtgtca ctgctggcct ctccatggac atctcccacc 1980
tggctgaaaa caaatggcgc cgtgaatggc aagggcagcc tgaaaggcca acctggcgac 2040
atctaccacc agacctgggc cagatacttc gtgaagttcc tggacgccta tgccgagcac 2100
aagctgcagt tttgggccgt gacagccgag aacgaacctt ctgctggact gctgagcggc 2160
tacccctttc agtgcctggg ctttacaccc gagcaccagc gggactttat cgcccgtgat 2220
ctgggaccca cactggccaa tagcacccac cataatgtgc ggctgctgat gctggacgac 2280
cagagactgc ttctgcccca ctgggctaaa gtggtgctga cagatcctga ggccgccaaa 2340
tacgtgcacg gaatcgccgt gcactggtat ctggactttc tggcccctgc caaggccaca 2400
ctgggagaga cacacagact gttccccaac accatgctgt tcgccagcga agcctgtgtg 2460
ggcagcaagt tttgggaaca gagcgtgcgg ctcggcagct gggatagagg catgcagtac 2520
agccacagca tcatcaccaa cctgctgtac cacgtcgtcg gctggaccga ctggaatctg 2580
gccctgaatc ctgaaggcgg ccctaactgg gtccgaaact tcgtggacag ccccatcatc 2640
gtggacatca ccaaggacac cttctacaag cagcccatgt tctaccacct gggacacttc 2700
agcaagttca tccccgaggg ctctcagcgc gttggactgg tggcttccca gaagaacgat 2760
ctggacgccg tggctctgat gcaccctgat ggatctgctg tggtggtggt cctgaaccgc 2820
agcagcaaag atgtgcccct gaccatcaag gatcccgccg tgggattcct ggaaacaatc 2880
agccctggct actccatcca cacctacctg tggcgtagac agtgacaatt gttaattaag 2940
tttaaaccct cgaggccgca agcttatcga taatcaacct ctggattaca aaatttgtga 3000
aagattgact ggtattctta actatgttgc tccttttacg ctatgtggat acgctgcttt 3060
aatgcctttg tatcatgcta ttgcttcccg tatggctttc attttctcct ccttgtataa 3120
atcctggttg ctgtctcttt atgaggagtt gtggcccgtt gtcaggcaac gtggcgtggt 3180
gtgcactgtg tttgctgacg caacccccac tggttggggc attgccacca cctgtcagct 3240
cctttccggg actttcgctt tccccctccc tattgccacg gcggaactca tcgccgcctg 3300
ccttgcccgc tgctggacag gggctcggct gttgggcact gacaattccg tggtgttgtc 3360
ggggaaatca tcgtcctttc cttggctgct cgcctgtgtt gccacctgga ttctgcgcgg 3420
gacgtccttc tgctacgtcc cttcggccct caatccagcg gaccttcctt cccgcggcct 3480
gctgccggct ctgcggcctc ttccgcgtct tcgccttcgc cctcagacga gtcggatctc 3540
cctttgggcc gcctccccgc atcgataccg tcgactagag ctcgctgatc agcctcgact 3600
gtgccttcta gttgccagcc atctgttgtt tgcccctccc ccgtgccttc cttgaccctg 3660
gaaggtgcca ctcccactgt cctttcctaa taaaatgagg aaattgcatc gcattgtctg 3720
agtaggtgtc attctattct ggggggtggg gtggggcagg acagcaaggg ggaggattgg 3780
gaagacaata gcaggcatgc tggggagaga tccacgataa caaacagctt ttttggggtg 3840
aacatattga ctgaattccc tgcaggttgg ccactccctc tctgcgcgct cgctcgctca 3900
ctgaggccgc ccgggcaaag cccgggcgtc gggcgacctt tggtcgcccg gcctcagtga 3960
gcgagcgagc gcgcagagag ggagtggcca actccatcac taggggttcc tgcggccgct 4020
cgtacggtct cgaggaattc ctgcaggata acttgccaac ctcattctaa aatgtatata 4080
gaagcccaaa agacaataac aaaaatattc ttgtagaaca aaatgggaaa gaatgttcca 4140
ctaaatatca agatttagag caaagcatga gatgtgtggg gatagacagt gaggctgata 4200
aaatagagta gagctcagaa acagacccat tgatatatgt aagtgaccta tgaaaaaaat 4260
atggcatttt acaatgggaa aatgatggtc tttttctttt ttagaaaaac agggaaatat 4320
atttatatgt aaaaaataaa agggaaccca tatgtcatac catacacaca aaaaaattcc 4380
agtgaattat aagtctaaat ggagaaggca aaactttaaa tcttttagaa aataatatag 4440
aagcatgcag accagcctgg ccaacatgat gaaaccctct ctactaataa taaaatcagt 4500
agaactactc aggactactt tgagtgggaa gtccttttct atgaagactt ctttggccaa 4560
aattaggctc taaatgcaag gagatagtgc atcatgcctg gctgcactta ctgataaatg 4620
atgttatcac catctttaac caaatgcaca ggaacaagtt atggtactga tgtgctggat 4680
tgagaaggag ctctacttcc ttgacaggac acatttgtat caacttaaaa aagcagattt 4740
ttgccagcag aactattcat tcagaggtag gaaacttaga atagatgatg tcactgatta 4800
gcatggcttc cccatctcca cagctgcttc ccacccaggt tgcccacagt tgagtttgtc 4860
cagtgctcag ggctgcccac tctcagtaag aagccccaca ccagcccctc tccaaatatg 4920
ttggctgttc cttccattaa agtgacccca ctttagagca gcaagtggat ttctgtttct 4980
tacagttcag gaaggaggag tcagctgtga gaacctggag cctgagatgc ttctaagtcc 5040
cactgctact ggggtcaggg aagccagact ccagcatcag cagtcaggag cactaagccc 5100
ttgccaacat cctgtttctc agagaaactg cttccattat aatggttgtc cttttttaag 5160
ctatcaagcc aaacaaccag tgtctaccat tattctcatc acctgaagcc aagggttcta 5220
gcaaaagtca agctgtcttg taatggttga tgtgcctcca gcttctgtct tcagtcactc 5280
cactcttagc ctgctctgaa tcaactctga ccacagttcc ctggagcccc tgccacctgc 5340
tgcccctgcc accttctcca tctgcagtgc tgtgcagcct tctgcactct tgcagagcta 5400
ataggtggag acttgaagga agaggaggaa agtttctcat aatagccttg ctgcaagctc 5460
aaatgggagg tgggcactgt gcccaggagc cttggagcaa aggctgtgcc caacctctga 5520
ctgcatccag gtttggtctt gacagagata agaagccctg gcttttggag ccaaaatcta 5580
ggtcagactt aggcaggatt ctcaaagttt atcagcagaa catgaggcag aagacccttt 5640
ctgctccagc ttcttcaggc tcaaccttca tcagaataga tagaaagaga ggctgtgagg 5700
gttcttaaaa cagaagcaaa tctgactcag agaataaaca acctcctagt aaactacagc 5760
ttagacagag catctggtgg tgagtgtgct cagtgtccta ctcaactgtc tggtatcagc 5820
cctcatgagg acttctcttc tttccctcat agacctccat ctctgttttc cttagcctgc 5880
agaaatctgg atggctattc acagaatgcc tgtgctttca gagttgcatt ttttctctgg 5940
tattctggtt caagcatttg aaggtaggaa aggttctcca agtgcaagaa agccagccct 6000
gagcctcaac tgcctggcta gtgtggtcag taggatgcaa aggctgttga atgccacaag 6060
gccaaacttt aacctgtgta ccacaagcct agcagcagag gcagctctgc tcactggaac 6120
tctctgtctt ctttctcctg agccttttct tttcctgagt tttctagctc tcctcaacct 6180
tacctctgcc ctacccagga caaacccaag agccactgtt tctgtgatgt cctctccagc 6240
cctaattagg catcatgact tcagcctgac cttccatgct cagaagcagt gctaatccac 6300
ttcagatgag ctgctctatg caacacaggc agagcctaca aacctttgca ccagagccct 6360
ccacatatca gtgtttgttc atactcactt caacagcaaa tgtgactgct gagattaaga 6420
ttttacacaa gatggtctgt aatttcacag ttagttttat cccattaggt atgaaagaat 6480
tagcataatt ccccttaaac atgaatgaat cttagatttt ttaataaata gttttggaag 6540
taaagacaga gacatcagga gcacaaggaa tagcctgaga ggacaaacag aacaagaaag 6600
agtctggaaa tacacaggat gttcttggcc tcctcaaagc aagtgcaagc agatagtacc 6660
agcagcccca ggctatcaga gcccagtgaa gagaagtacc atgaaagcca cagctctaac 6720
caccctgttc cagagtgaca gacagtcccc aagacaagcc agcctgagcc agagagagaa 6780
ctgcaagaga aagtttctaa tttaggttct gttagattca gacaagtgca ggtcatcctc 6840
tctccacagc tactcacctc tccagcctaa caaagcctgc agtccacact ccaaccctgg 6900
tgtctcacct cctagcctct cccaacatcc tgctctctga ccatcttctg catctctcat 6960
ctcaccatct cccactgtct acagcctact cttgcaacta ccatctcatt ttctgacatc 7020
ctgtctacat cttctgccat actctgccat ctaccatacc acctcttacc atctaccaca 7080
ccatctttta tctccatccc tctcagaagc ctccaagctg aatcctgctt tatgtgttca 7140
tctcagcccc tgcatggaaa gctgacccca gaggcagaac tattcccaga gagcttggcc 7200
aagaaaaaca aaactaccag cctggccagg ctcaggagta gtaagctgca gtgtctgttg 7260
tgttctagct tcaacagctg caggagttcc actctcaaat gctccacatt tctcacatcc 7320
tcctgattct ggtcactacc catcttcaaa gaacagaata tctcacatca gcatactgtg 7380
aaggactagt catgggtgca gctgctcaga gctgcaaagt cattctggat ggtggagagc 7440
ttacaaacat ttcatgatgc tccccccgct ctgatggctg gagcccaatc cctacacaga 7500
ctcctgctgt atgtgttttc ctttcactct gagccacagc cagagggcag gcattcagtc 7560
tcctcttcag gctggggctg gggcactgag aactcaccca acaccttgct ctcactcctt 7620
ctgcaaaaca agaaagagct ttgtgctgca gtagccatga agaatgaaag gaaggcttta 7680
actaaaaaat gtcagagatt attttcaacc ccttactgtg gatcaccagc aaggaggaaa 7740
cacaacacag agacattttt tcccctcaaa ttatcaaaag aatcactgca tttgttaaag 7800
agagcaactg aatcaggaag cagagttttg aacatatcag aagttaggaa tctgcatcag 7860
agacaaatgc agtcatggtt gtttgctgca taccagccct aatcattaga agcctcatgg 7920
acttcaaaca tcattccctc tgacaagatg ctctagccta actccatgag ataaaataaa 7980
tctgcctttc agagccaaag aagagtccac cagcttcttc tcagtgtgaa caagagctcc 8040
agtcaggtta gtcagtccag tgcagtagag gagaccagtc tgcatcctct aattttcaaa 8100
ggcaagaaga tttgtttacc ctggacacca ggcacaagtg aggtcacaga gctcttagat 8160
atgcagtcct catgagtgag gagactaaag cgcatgccat caagacttca gtgtagagaa 8220
aacctccaaa aaagcctcct cactacttct ggaatagctc agaggccgag gcggcctcgg 8280
cctctgcata aataaaaaaa attagtcagc catggggcgg agaatgggcg gaactgggcg 8340
gagttagggg cgggatgggc ggagttaggg gcgggactat ggttgctgac taattgagat 8400
gcatgctttg catacttctg cctgctgggg agcctgggga ctttccacac ctggttgctg 8460
actaattgag atgcatgctt tgcatacttc tgcctgctgg ggagcctggg gactttccac 8520
accctaactg acacacattc cacagctgca ttaatgaatc ggccaacgcg cggggagagg 8580
cggtttgcgt attgggcgct cttccgcttc ctcgctcact gactcgctgc gctcggtcgt 8640
tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat ccacagaatc 8700
aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca ggaaccgtaa 8760
aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc atcacaaaaa 8820
tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc 8880
ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg gatacctgtc 8940
cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta ggtatctcag 9000
ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg ttcagcccga 9060
ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac acgacttatc 9120
gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag gcggtgctac 9180
agagttcttg aagtggtggc ctaactacgg ctacactaga agaacagtat ttggtatctg 9240
cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat ccggcaaaca 9300
aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa 9360
aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt ggaacgaaaa 9420
ctcacgttaa gggattttgg tcatgagatt atcaaaaagg atcttcacct agatcctttt 9480
aaattaaaaa tgaagtttta aatcaatcta aagtatatat gagtaaactt ggtctgacag 9540
ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc tgtctatttc gttcatccat 9600
agttgcctga ctcctgcaaa ccacgttgtg tctcaaaatc tctgatgtta cattgcacaa 9660
gataaaaata tatcatcatg aacaataaaa ctgtctgctt acataaacag taatacaagg 9720
ggtgttatga gccatattca acgggaaacg tcttgctcga ggccgcgatt aaattccaac 9780
atggatgctg atttatatgg gtataaatgg gctcgcgata atgtcgggca atcaggtgcg 9840
acaatctatc gattgtatgg gaagcccgat gcgccagagt tgtttctgaa acatggcaaa 9900
ggtagcgttg ccaatgatgt tacagatgag atggtcagac taaactggct gacggaattt 9960
atgcctcttc cgaccatcaa gcattttatc cgtactcctg atgatgcatg gttactcacc 10020
actgcgatcc ccgggaaaac agcattccag gtattagaag aatatcctga ttcaggtgaa 10080
aatattgttg atgcgctggc agtgttcctg cgccggttgc attcgattcc tgtttgtaat 10140
tgtcctttta acagcgatcg cgtatttcgt ctcgctcagg cgcaatcacg aatgaataac 10200
ggtttggttg atgcgagtga ttttgatgac gagcgtaatg gctggcctgt tgaacaagtc 10260
tggaaagaaa tgcataagct tttgccattc tcaccggatt cagtcgtcac tcatggtgat 10320
ttctcacttg ataaccttat ttttgacgag gggaaattaa taggttgtat tgatgttgga 10380
cgagtcggaa tcgcagaccg ataccaggat cttgccatcc tatggaactg cctcggtgag 10440
ttttctcctt cattacagaa acggcttttt caaaaatatg gtattgataa tcctgatatg 10500
aataaattgc agtttcattt gatgctcgat gagtttttct aagggcggcc tgccaccata 10560
cccacgccga aacaagcgct catgagcccg aagtggcgag cccgatcttc cccatcggtg 10620
atgtcggcga tataggcgcc agcaaccgca cctgtggcgc cggtgatgag ggcgcgccaa 10680
gtcgacgtcc ggcagtc 10697
<210> 2
<211> 11355
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 2
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctgcag ccctaggaat gcatctagac aattgtacta accttcttct 600
ctttcctctc ctgacagtcc ggaaagccac catgggccgc tgctgcttct acaccgccgg 660
caccctgagc ctgctgctgc tggtgaccag cgtgaccctg ctggtggccc gcgtgttcca 720
gaaggccgtg gaccagagca tcgagaagaa gatcgtgctg cgcaacggca ccgaggcctt 780
cgacagctgg gagaagcccc ccctgcccgt gtacacccag ttctacttct tcaacgtgac 840
caaccccgag gagatcctgc gcggcgagac cccccgcgtg gaggaggtgg gcccctacac 900
ctaccgcgag ctgcgcaaca aggccaacat ccagttcggc gacaacggca ccaccatcag 960
cgccgtgagc aacaaggcct acgtgttcga gcgcgaccag agcgtgggcg accccaagat 1020
cgacctgatc cgcaccctga acatccccgt gctgaccgtg atcgagtgga gccaggtgca 1080
cttcctgcgc gagatcatcg aggccatgct gaaggcctac cagcagaagc tgttcgtgac 1140
ccacaccgtg gacgagctgc tgtggggcta caaggacgag atcctgagcc tgatccacgt 1200
gttccgcccc gacatcagcc cctacttcgg cctgttctac gagaagaacg gcaccaacga 1260
cggcgactac gtgttcctga ccggcgagga cagctacctg aacttcacca agatcgtgga 1320
gtggaacggc aagaccagcc tggactggtg gatcaccgac aagtgcaaca tgatcaacgg 1380
caccgacggc gacagcttcc accccctgat caccaaggac gaggtgctgt acgtgttccc 1440
cagcgacttc tgccgcagcg tgtacatcac cttcagcgac tacgagagcg tgcagggcct 1500
gcccgccttc cgctacaagg tgcccgccga gatcctggcc aacaccagcg acaacgccgg 1560
cttctgcatc cccgagggca actgcctggg cagcggcgtg ctgaacgtga gcatctgcaa 1620
gaacggcgcc cccatcatca tgagcttccc ccacttctac caggccgacg agcgcttcgt 1680
gagcgccatc gagggcatgc accccaacca ggaggaccac gagaccttcg tggacatcaa 1740
ccccctgacc ggcatcatcc tgaaggccgc caagcgcttc cagatcaaca tctacgtgaa 1800
gaagctggac gacttcgtgg agaccggcga catccgcacc atggtgttcc ccgtgatgta 1860
cctgaacgag agcgtgcaca tcgacaagga gaccgccagc cgcctgaaga gcatgatcaa 1920
caccaccctg atcatcacca acatccccta catcatcatg gccctgggcg tgttcttcgg 1980
cctggtgttc acctggctgg cctgcaaggg ccagggcagc atggacgagg gcaccgccga 2040
cgagcgcgcc cccctgatcc gcacctgatt gtggccgaac cgccgaactc agaggccggc 2100
cccagaaaac ccgagcgagt agggggcggc gcgcaggagg gaggagaact gggggcgcgg 2160
gaggctggtg ggtgtggggg gtggagatgt agaagatgtg acgccgcggc ccggcgggtg 2220
ccagattagc ggacgcggtg cccgcggttg caacgggatc ccgggcgctg cagcttggga 2280
ggcggctctc cccaggcggc gtccgcggag acacccatcc gtgaacccca ggtcccgggc 2340
cgccggctcg ccgcgcacca ggggccggcg gacagaagag cggccgagcg gctcgaggct 2400
gggggaccgc gggcgcggcc gcgcgctgcc gggcgggagg ctggggggcc ggggccgggg 2460
ccgtgccccg gagcgggtcg gaggccgggg ccggggccgg gggacggcgg ctccccgcgc 2520
ggctccagcg gctcggggat cccggccggg ccccgcaggg accatgatgg aattcagcag 2580
ccccagcaga gaggaatgcc ccaagcctct gagccgggtg tcaatcatgg ccggatctct 2640
gacaggactg ctgctgcttc aggccgtgtc ttgggcttct ggcgctagac cttgcatccc 2700
caagagcttc ggctacagca gcgtcgtgtg cgtgtgcaat gccacctact gcgacagctt 2760
cgaccctcct acctttcctg ctctgggcac cttcagcaga tacgagagca ccagatccgg 2820
cagacggatg gaactgagca tgggacccat ccaggccaat cacacaggca ctggcctgct 2880
gctgacactg cagcctgagc agaaattcca gaaagtgaaa ggcttcggcg gagccatgac 2940
agatgccgcc gctctgaata tcctggctct gtctccacca gctcagaacc tgctgctcaa 3000
gagctacttc agcgaggaag gcatcggcta caacatcatc agagtgccca tggccagctg 3060
cgacttcagc atcaggacct acacctacgc cgacacaccc gacgatttcc agctgcacaa 3120
cttcagcctg cctgaagagg acaccaagct gaagatccct ctgatccaca gagccctgca 3180
gctggcacaa agacccgtgt cactgctggc ctctccatgg acatctccca cctggctgaa 3240
aacaaatggc gccgtgaatg gcaagggcag cctgaaaggc caacctggcg acatctacca 3300
ccagacctgg gccagatact tcgtgaagtt cctggacgcc tatgccgagc acaagctgca 3360
gttttgggcc gtgacagccg agaacgaacc ttctgctgga ctgctgagcg gctacccctt 3420
tcagtgcctg ggctttacac ccgagcacca gcgggacttt atcgcccgtg atctgggacc 3480
cacactggcc aatagcaccc accataatgt gcggctgctg atgctggacg accagagact 3540
gcttctgccc cactgggcta aagtggtgct gacagatcct gaggccgcca aatacgtgca 3600
cggaatcgcc gtgcactggt atctggactt tctggcccct gccaaggcca cactgggaga 3660
gacacacaga ctgttcccca acaccatgct gttcgccagc gaagcctgtg tgggcagcaa 3720
gttttgggaa cagagcgtgc ggctcggcag ctgggataga ggcatgcagt acagccacag 3780
catcatcacc aacctgctgt accacgtcgt cggctggacc gactggaatc tggccctgaa 3840
tcctgaaggc ggccctaact gggtccgaaa cttcgtggac agccccatca tcgtggacat 3900
caccaaggac accttctaca agcagcccat gttctaccac ctgggacact tcagcaagtt 3960
catccccgag ggctctcagc gcgttggact ggtggcttcc cagaagaacg atctggacgc 4020
cgtggctctg atgcaccctg atggatctgc tgtggtggtg gtcctgaacc gcagcagcaa 4080
agatgtgccc ctgaccatca aggatcccgc cgtgggattc ctggaaacaa tcagccctgg 4140
ctactccatc cacacctacc tgtggcgtag acagtgacaa ttgttaatta agtttaaacc 4200
ctcgaggccg caagccgcat cgataccgtc gactagagct cgctgatcag cctcgactgt 4260
gccttctagt tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga 4320
aggtgccact cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag 4380
taggtgtcat tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga 4440
agacaatagc aggcatgctg gggagagatc cacgataaca aacagctttt ttggggtgaa 4500
catattgact gaattccctg caggttggcc actccctctc tgcgcgctcg ctcgctcact 4560
gaggccgccc gggcaaagcc cgggcgtcgg gcgacctttg gtcgcccggc ctcagtgagc 4620
gagcgagcgc gcagagaggg agtggccaac tccatcacta ggggttcctg cggccgctcg 4680
tacggtctcg aggaattcct gcaggataac ttgccaacct cattctaaaa tgtatataga 4740
agcccaaaag acaataacaa aaatattctt gtagaacaaa atgggaaaga atgttccact 4800
aaatatcaag atttagagca aagcatgaga tgtgtgggga tagacagtga ggctgataaa 4860
atagagtaga gctcagaaac agacccattg atatatgtaa gtgacctatg aaaaaaatat 4920
ggcattttac aatgggaaaa tgatggtctt tttctttttt agaaaaacag ggaaatatat 4980
ttatatgtaa aaaataaaag ggaacccata tgtcatacca tacacacaaa aaaattccag 5040
tgaattataa gtctaaatgg agaaggcaaa actttaaatc ttttagaaaa taatatagaa 5100
gcatgcagac cagcctggcc aacatgatga aaccctctct actaataata aaatcagtag 5160
aactactcag gactactttg agtgggaagt ccttttctat gaagacttct ttggccaaaa 5220
ttaggctcta aatgcaagga gatagtgcat catgcctggc tgcacttact gataaatgat 5280
gttatcacca tctttaacca aatgcacagg aacaagttat ggtactgatg tgctggattg 5340
agaaggagct ctacttcctt gacaggacac atttgtatca acttaaaaaa gcagattttt 5400
gccagcagaa ctattcattc agaggtagga aacttagaat agatgatgtc actgattagc 5460
atggcttccc catctccaca gctgcttccc acccaggttg cccacagttg agtttgtcca 5520
gtgctcaggg ctgcccactc tcagtaagaa gccccacacc agcccctctc caaatatgtt 5580
ggctgttcct tccattaaag tgaccccact ttagagcagc aagtggattt ctgtttctta 5640
cagttcagga aggaggagtc agctgtgaga acctggagcc tgagatgctt ctaagtccca 5700
ctgctactgg ggtcagggaa gccagactcc agcatcagca gtcaggagca ctaagccctt 5760
gccaacatcc tgtttctcag agaaactgct tccattataa tggttgtcct tttttaagct 5820
atcaagccaa acaaccagtg tctaccatta ttctcatcac ctgaagccaa gggttctagc 5880
aaaagtcaag ctgtcttgta atggttgatg tgcctccagc ttctgtcttc agtcactcca 5940
ctcttagcct gctctgaatc aactctgacc acagttccct ggagcccctg ccacctgctg 6000
cccctgccac cttctccatc tgcagtgctg tgcagccttc tgcactcttg cagagctaat 6060
aggtggagac ttgaaggaag aggaggaaag tttctcataa tagccttgct gcaagctcaa 6120
atgggaggtg ggcactgtgc ccaggagcct tggagcaaag gctgtgccca acctctgact 6180
gcatccaggt ttggtcttga cagagataag aagccctggc ttttggagcc aaaatctagg 6240
tcagacttag gcaggattct caaagtttat cagcagaaca tgaggcagaa gaccctttct 6300
gctccagctt cttcaggctc aaccttcatc agaatagata gaaagagagg ctgtgagggt 6360
tcttaaaaca gaagcaaatc tgactcagag aataaacaac ctcctagtaa actacagctt 6420
agacagagca tctggtggtg agtgtgctca gtgtcctact caactgtctg gtatcagccc 6480
tcatgaggac ttctcttctt tccctcatag acctccatct ctgttttcct tagcctgcag 6540
aaatctggat ggctattcac agaatgcctg tgctttcaga gttgcatttt ttctctggta 6600
ttctggttca agcatttgaa ggtaggaaag gttctccaag tgcaagaaag ccagccctga 6660
gcctcaactg cctggctagt gtggtcagta ggatgcaaag gctgttgaat gccacaaggc 6720
caaactttaa cctgtgtacc acaagcctag cagcagaggc agctctgctc actggaactc 6780
tctgtcttct ttctcctgag ccttttcttt tcctgagttt tctagctctc ctcaacctta 6840
cctctgccct acccaggaca aacccaagag ccactgtttc tgtgatgtcc tctccagccc 6900
taattaggca tcatgacttc agcctgacct tccatgctca gaagcagtgc taatccactt 6960
cagatgagct gctctatgca acacaggcag agcctacaaa cctttgcacc agagccctcc 7020
acatatcagt gtttgttcat actcacttca acagcaaatg tgactgctga gattaagatt 7080
ttacacaaga tggtctgtaa tttcacagtt agttttatcc cattaggtat gaaagaatta 7140
gcataattcc ccttaaacat gaatgaatct tagatttttt aataaatagt tttggaagta 7200
aagacagaga catcaggagc acaaggaata gcctgagagg acaaacagaa caagaaagag 7260
tctggaaata cacaggatgt tcttggcctc ctcaaagcaa gtgcaagcag atagtaccag 7320
cagccccagg ctatcagagc ccagtgaaga gaagtaccat gaaagccaca gctctaacca 7380
ccctgttcca gagtgacaga cagtccccaa gacaagccag cctgagccag agagagaact 7440
gcaagagaaa gtttctaatt taggttctgt tagattcaga caagtgcagg tcatcctctc 7500
tccacagcta ctcacctctc cagcctaaca aagcctgcag tccacactcc aaccctggtg 7560
tctcacctcc tagcctctcc caacatcctg ctctctgacc atcttctgca tctctcatct 7620
caccatctcc cactgtctac agcctactct tgcaactacc atctcatttt ctgacatcct 7680
gtctacatct tctgccatac tctgccatct accataccac ctcttaccat ctaccacacc 7740
atcttttatc tccatccctc tcagaagcct ccaagctgaa tcctgcttta tgtgttcatc 7800
tcagcccctg catggaaagc tgaccccaga ggcagaacta ttcccagaga gcttggccaa 7860
gaaaaacaaa actaccagcc tggccaggct caggagtagt aagctgcagt gtctgttgtg 7920
ttctagcttc aacagctgca ggagttccac tctcaaatgc tccacatttc tcacatcctc 7980
ctgattctgg tcactaccca tcttcaaaga acagaatatc tcacatcagc atactgtgaa 8040
ggactagtca tgggtgcagc tgctcagagc tgcaaagtca ttctggatgg tggagagctt 8100
acaaacattt catgatgctc cccccgctct gatggctgga gcccaatccc tacacagact 8160
cctgctgtat gtgttttcct ttcactctga gccacagcca gagggcaggc attcagtctc 8220
ctcttcaggc tggggctggg gcactgagaa ctcacccaac accttgctct cactccttct 8280
gcaaaacaag aaagagcttt gtgctgcagt agccatgaag aatgaaagga aggctttaac 8340
taaaaaatgt cagagattat tttcaacccc ttactgtgga tcaccagcaa ggaggaaaca 8400
caacacagag acattttttc ccctcaaatt atcaaaagaa tcactgcatt tgttaaagag 8460
agcaactgaa tcaggaagca gagttttgaa catatcagaa gttaggaatc tgcatcagag 8520
acaaatgcag tcatggttgt ttgctgcata ccagccctaa tcattagaag cctcatggac 8580
ttcaaacatc attccctctg acaagatgct ctagcctaac tccatgagat aaaataaatc 8640
tgcctttcag agccaaagaa gagtccacca gcttcttctc agtgtgaaca agagctccag 8700
tcaggttagt cagtccagtg cagtagagga gaccagtctg catcctctaa ttttcaaagg 8760
caagaagatt tgtttaccct ggacaccagg cacaagtgag gtcacagagc tcttagatat 8820
gcagtcctca tgagtgagga gactaaagcg catgccatca agacttcagt gtagagaaaa 8880
cctccaaaaa agcctcctca ctacttctgg aatagctcag aggccgaggc ggcctcggcc 8940
tctgcataaa taaaaaaaat tagtcagcca tggggcggag aatgggcgga actgggcgga 9000
gttaggggcg ggatgggcgg agttaggggc gggactatgg ttgctgacta attgagatgc 9060
atgctttgca tacttctgcc tgctggggag cctggggact ttccacacct ggttgctgac 9120
taattgagat gcatgctttg catacttctg cctgctgggg agcctgggga ctttccacac 9180
cctaactgac acacattcca cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg 9240
gtttgcgtat tgggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc 9300
ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag 9360
gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa 9420
aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc 9480
gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc 9540
ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg 9600
cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg tatctcagtt 9660
cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc 9720
gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc 9780
cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag 9840
agttcttgaa gtggtggcct aactacggct acactagaag aacagtattt ggtatctgcg 9900
ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa 9960
ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag 10020
gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact 10080
cacgttaagg gattttggtc atgagattat caaaaaggat cttcacctag atccttttaa 10140
attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt 10200
accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag 10260
ttgcctgact cctgcaaacc acgttgtgtc tcaaaatctc tgatgttaca ttgcacaaga 10320
taaaaatata tcatcatgaa caataaaact gtctgcttac ataaacagta atacaagggg 10380
tgttatgagc catattcaac gggaaacgtc ttgctcgagg ccgcgattaa attccaacat 10440
ggatgctgat ttatatgggt ataaatgggc tcgcgataat gtcgggcaat caggtgcgac 10500
aatctatcga ttgtatggga agcccgatgc gccagagttg tttctgaaac atggcaaagg 10560
tagcgttgcc aatgatgtta cagatgagat ggtcagacta aactggctga cggaatttat 10620
gcctcttccg accatcaagc attttatccg tactcctgat gatgcatggt tactcaccac 10680
tgcgatcccc gggaaaacag cattccaggt attagaagaa tatcctgatt caggtgaaaa 10740
tattgttgat gcgctggcag tgttcctgcg ccggttgcat tcgattcctg tttgtaattg 10800
tccttttaac agcgatcgcg tatttcgtct cgctcaggcg caatcacgaa tgaataacgg 10860
tttggttgat gcgagtgatt ttgatgacga gcgtaatggc tggcctgttg aacaagtctg 10920
gaaagaaatg cataagcttt tgccattctc accggattca gtcgtcactc atggtgattt 10980
ctcacttgat aaccttattt ttgacgaggg gaaattaata ggttgtattg atgttggacg 11040
agtcggaatc gcagaccgat accaggatct tgccatccta tggaactgcc tcggtgagtt 11100
ttctccttca ttacagaaac ggctttttca aaaatatggt attgataatc ctgatatgaa 11160
taaattgcag tttcatttga tgctcgatga gtttttctaa gggcggcctg ccaccatacc 11220
cacgccgaaa caagcgctca tgagcccgaa gtggcgagcc cgatcttccc catcggtgat 11280
gtcggcgata taggcgccag caaccgcacc tgtggcgccg gtgatgaggg cgcgccaagt 11340
cgacgtccgg cagtc 11355
<210> 3
<211> 11420
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 3
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctgcag ccctaggaat gcatctagac aattgtacta accttcttct 600
ctttcctctc ctgacagtcc ggaaagccac catggaattc agcagcccca gcagagagga 660
atgccccaag cctctgagcc gggtgtcaat catggccgga tctctgacag gactgctgct 720
gcttcaggcc gtgtcttggg cttctggcgc tagaccttgc atccccaaga gcttcggcta 780
cagcagcgtc gtgtgcgtgt gcaatgccac ctactgcgac agcttcgacc ctcctacctt 840
tcctgctctg ggcaccttca gcagatacga gagcaccaga tccggcagac ggatggaact 900
gagcatggga cccatccagg ccaatcacac aggcactggc ctgctgctga cactgcagcc 960
tgagcagaaa ttccagaaag tgaaaggctt cggcggagcc atgacagatg ccgccgctct 1020
gaatatcctg gctctgtctc caccagctca gaacctgctg ctcaagagct acttcagcga 1080
ggaaggcatc ggctacaaca tcatcagagt gcccatggcc agctgcgact tcagcatcag 1140
gacctacacc tacgccgaca cacccgacga tttccagctg cacaacttca gcctgcctga 1200
agaggacacc aagctgaaga tccctctgat ccacagagcc ctgcagctgg cacaaagacc 1260
cgtgtcactg ctggcctctc catggacatc tcccacctgg ctgaaaacaa atggcgccgt 1320
gaatggcaag ggcagcctga aaggccaacc tggcgacatc taccaccaga cctgggccag 1380
atacttcgtg aagttcctgg acgcctatgc cgagcacaag ctgcagtttt gggccgtgac 1440
agccgagaac gaaccttctg ctggactgct gagcggctac ccctttcagt gcctgggctt 1500
tacacccgag caccagcggg actttatcgc ccgtgatctg ggacccacac tggccaatag 1560
cacccaccat aatgtgcggc tgctgatgct ggacgaccag agactgcttc tgccccactg 1620
ggctaaagtg gtgctgacag atcctgaggc cgccaaatac gtgcacggaa tcgccgtgca 1680
ctggtatctg gactttctgg cccctgccaa ggccacactg ggagagacac acagactgtt 1740
ccccaacacc atgctgttcg ccagcgaagc ctgtgtgggc agcaagtttt gggaacagag 1800
cgtgcggctc ggcagctggg atagaggcat gcagtacagc cacagcatca tcaccaacct 1860
gctgtaccac gtcgtcggct ggaccgactg gaatctggcc ctgaatcctg aaggcggccc 1920
taactgggtc cgaaacttcg tggacagccc catcatcgtg gacatcacca aggacacctt 1980
ctacaagcag cccatgttct accacctggg acacttcagc aagttcatcc ccgagggctc 2040
tcagcgcgtt ggactggtgg cttcccagaa gaacgatctg gacgccgtgg ctctgatgca 2100
ccctgatgga tctgctgtgg tggtggtcct gaaccgcagc agcaaagatg tgcccctgac 2160
catcaaggat cccgccgtgg gattcctgga aacaatcagc cctggctact ccatccacac 2220
ctacctgtgg cgtagacagt gacaattgtt aattaagttt catcgatacc gtcgactaga 2280
gctcgctgat cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc 2340
cccgtgcctt ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag 2400
gaaattgcat cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag 2460
gacagcaagg gggaggattg ggaagacaat agcaggcatg ctggggagag atccacgata 2520
acaaacagct tttttggggg ggcggagtta gggcggagcc aatcagcgtg cgccgttccg 2580
aaagttgcct tttatggctg ggcggagaat gggcggtgaa cgccgatgat tatataagga 2640
cgcgccgggt gtggcacagc tagttccgtc gcagccggga tttgggtcgc ggttcttgtt 2700
tgtggatccc tgtgatcgtc acttggtaag tcactgactg tctatgcctg ggaaagggtg 2760
ggcaggagat ggggcagtgc aggaaaagtg gcactatgaa ccctgcagcc ctaggaatgc 2820
atctagacaa ttgtactaac cttcttctct ttcctctcct gacagtccgg aaagccacca 2880
tgggccgctg ctgcttctac accgccggca ccctgagcct gctgctgctg gtgaccagcg 2940
tgaccctgct ggtggcccgc gtgttccaga aggccgtgga ccagagcatc gagaagaaga 3000
tcgtgctgcg caacggcacc gaggccttcg acagctggga gaagcccccc ctgcccgtgt 3060
acacccagtt ctacttcttc aacgtgacca accccgagga gatcctgcgc ggcgagaccc 3120
cccgcgtgga ggaggtgggc ccctacacct accgcgagct gcgcaacaag gccaacatcc 3180
agttcggcga caacggcacc accatcagcg ccgtgagcaa caaggcctac gtgttcgagc 3240
gcgaccagag cgtgggcgac cccaagatcg acctgatccg caccctgaac atccccgtgc 3300
tgaccgtgat cgagtggagc caggtgcact tcctgcgcga gatcatcgag gccatgctga 3360
aggcctacca gcagaagctg ttcgtgaccc acaccgtgga cgagctgctg tggggctaca 3420
aggacgagat cctgagcctg atccacgtgt tccgccccga catcagcccc tacttcggcc 3480
tgttctacga gaagaacggc accaacgacg gcgactacgt gttcctgacc ggcgaggaca 3540
gctacctgaa cttcaccaag atcgtggagt ggaacggcaa gaccagcctg gactggtgga 3600
tcaccgacaa gtgcaacatg atcaacggca ccgacggcga cagcttccac cccctgatca 3660
ccaaggacga ggtgctgtac gtgttcccca gcgacttctg ccgcagcgtg tacatcacct 3720
tcagcgacta cgagagcgtg cagggcctgc ccgccttccg ctacaaggtg cccgccgaga 3780
tcctggccaa caccagcgac aacgccggct tctgcatccc cgagggcaac tgcctgggca 3840
gcggcgtgct gaacgtgagc atctgcaaga acggcgcccc catcatcatg agcttccccc 3900
acttctacca ggccgacgag cgcttcgtga gcgccatcga gggcatgcac cccaaccagg 3960
aggaccacga gaccttcgtg gacatcaacc ccctgaccgg catcatcctg aaggccgcca 4020
agcgcttcca gatcaacatc tacgtgaaga agctggacga cttcgtggag accggcgaca 4080
tccgcaccat ggtgttcccc gtgatgtacc tgaacgagag cgtgcacatc gacaaggaga 4140
ccgccagccg cctgaagagc atgatcaaca ccaccctgat catcaccaac atcccctaca 4200
tcatcatggc cctgggcgtg ttcttcggcc tggtgttcac ctggctggcc tgcaagggcc 4260
agggcagcat ggacgagggc accgccgacg agcgcgcccc cctgatccgc acctgaccca 4320
ggggactcaa tcagcctcga agacatgata agatacattg atgagtttgg acaaaccaca 4380
acaagaatgc agtgaaaaaa atgctttatt tgtgaaattt gtgatgctat tgctttattt 4440
gtaaccatta taagctgcaa taaacaagtt aacaacaaca attgcattca ttttatgttt 4500
caggttcagg gggagatgtg ggaggttttt taaagcaagt aaaacctcta caaatgtggt 4560
atgaacatat tgactgaatt ccctgcaggt tggccactcc ctctctgcgc gctcgctcgc 4620
tcactgaggc cgcccgggca aagcccgggc gtcgggcgac ctttggtcgc ccggcctcag 4680
tgagcgagcg agcgcgcaga gagggagtgg ccaactccat cactaggggt tcctgcggcc 4740
gctcgtacgg tctcgaggaa ttcctgcagg ataacttgcc aacctcattc taaaatgtat 4800
atagaagccc aaaagacaat aacaaaaata ttcttgtaga acaaaatggg aaagaatgtt 4860
ccactaaata tcaagattta gagcaaagca tgagatgtgt ggggatagac agtgaggctg 4920
ataaaataga gtagagctca gaaacagacc cattgatata tgtaagtgac ctatgaaaaa 4980
aatatggcat tttacaatgg gaaaatgatg gtctttttct tttttagaaa aacagggaaa 5040
tatatttata tgtaaaaaat aaaagggaac ccatatgtca taccatacac acaaaaaaat 5100
tccagtgaat tataagtcta aatggagaag gcaaaacttt aaatctttta gaaaataata 5160
tagaagcatg cagaccagcc tggccaacat gatgaaaccc tctctactaa taataaaatc 5220
agtagaacta ctcaggacta ctttgagtgg gaagtccttt tctatgaaga cttctttggc 5280
caaaattagg ctctaaatgc aaggagatag tgcatcatgc ctggctgcac ttactgataa 5340
atgatgttat caccatcttt aaccaaatgc acaggaacaa gttatggtac tgatgtgctg 5400
gattgagaag gagctctact tccttgacag gacacatttg tatcaactta aaaaagcaga 5460
tttttgccag cagaactatt cattcagagg taggaaactt agaatagatg atgtcactga 5520
ttagcatggc ttccccatct ccacagctgc ttcccaccca ggttgcccac agttgagttt 5580
gtccagtgct cagggctgcc cactctcagt aagaagcccc acaccagccc ctctccaaat 5640
atgttggctg ttccttccat taaagtgacc ccactttaga gcagcaagtg gatttctgtt 5700
tcttacagtt caggaaggag gagtcagctg tgagaacctg gagcctgaga tgcttctaag 5760
tcccactgct actggggtca gggaagccag actccagcat cagcagtcag gagcactaag 5820
cccttgccaa catcctgttt ctcagagaaa ctgcttccat tataatggtt gtcctttttt 5880
aagctatcaa gccaaacaac cagtgtctac cattattctc atcacctgaa gccaagggtt 5940
ctagcaaaag tcaagctgtc ttgtaatggt tgatgtgcct ccagcttctg tcttcagtca 6000
ctccactctt agcctgctct gaatcaactc tgaccacagt tccctggagc ccctgccacc 6060
tgctgcccct gccaccttct ccatctgcag tgctgtgcag ccttctgcac tcttgcagag 6120
ctaataggtg gagacttgaa ggaagaggag gaaagtttct cataatagcc ttgctgcaag 6180
ctcaaatggg aggtgggcac tgtgcccagg agccttggag caaaggctgt gcccaacctc 6240
tgactgcatc caggtttggt cttgacagag ataagaagcc ctggcttttg gagccaaaat 6300
ctaggtcaga cttaggcagg attctcaaag tttatcagca gaacatgagg cagaagaccc 6360
tttctgctcc agcttcttca ggctcaacct tcatcagaat agatagaaag agaggctgtg 6420
agggttctta aaacagaagc aaatctgact cagagaataa acaacctcct agtaaactac 6480
agcttagaca gagcatctgg tggtgagtgt gctcagtgtc ctactcaact gtctggtatc 6540
agccctcatg aggacttctc ttctttccct catagacctc catctctgtt ttccttagcc 6600
tgcagaaatc tggatggcta ttcacagaat gcctgtgctt tcagagttgc attttttctc 6660
tggtattctg gttcaagcat ttgaaggtag gaaaggttct ccaagtgcaa gaaagccagc 6720
cctgagcctc aactgcctgg ctagtgtggt cagtaggatg caaaggctgt tgaatgccac 6780
aaggccaaac tttaacctgt gtaccacaag cctagcagca gaggcagctc tgctcactgg 6840
aactctctgt cttctttctc ctgagccttt tcttttcctg agttttctag ctctcctcaa 6900
ccttacctct gccctaccca ggacaaaccc aagagccact gtttctgtga tgtcctctcc 6960
agccctaatt aggcatcatg acttcagcct gaccttccat gctcagaagc agtgctaatc 7020
cacttcagat gagctgctct atgcaacaca ggcagagcct acaaaccttt gcaccagagc 7080
cctccacata tcagtgtttg ttcatactca cttcaacagc aaatgtgact gctgagatta 7140
agattttaca caagatggtc tgtaatttca cagttagttt tatcccatta ggtatgaaag 7200
aattagcata attcccctta aacatgaatg aatcttagat tttttaataa atagttttgg 7260
aagtaaagac agagacatca ggagcacaag gaatagcctg agaggacaaa cagaacaaga 7320
aagagtctgg aaatacacag gatgttcttg gcctcctcaa agcaagtgca agcagatagt 7380
accagcagcc ccaggctatc agagcccagt gaagagaagt accatgaaag ccacagctct 7440
aaccaccctg ttccagagtg acagacagtc cccaagacaa gccagcctga gccagagaga 7500
gaactgcaag agaaagtttc taatttaggt tctgttagat tcagacaagt gcaggtcatc 7560
ctctctccac agctactcac ctctccagcc taacaaagcc tgcagtccac actccaaccc 7620
tggtgtctca cctcctagcc tctcccaaca tcctgctctc tgaccatctt ctgcatctct 7680
catctcacca tctcccactg tctacagcct actcttgcaa ctaccatctc attttctgac 7740
atcctgtcta catcttctgc catactctgc catctaccat accacctctt accatctacc 7800
acaccatctt ttatctccat ccctctcaga agcctccaag ctgaatcctg ctttatgtgt 7860
tcatctcagc ccctgcatgg aaagctgacc ccagaggcag aactattccc agagagcttg 7920
gccaagaaaa acaaaactac cagcctggcc aggctcagga gtagtaagct gcagtgtctg 7980
ttgtgttcta gcttcaacag ctgcaggagt tccactctca aatgctccac atttctcaca 8040
tcctcctgat tctggtcact acccatcttc aaagaacaga atatctcaca tcagcatact 8100
gtgaaggact agtcatgggt gcagctgctc agagctgcaa agtcattctg gatggtggag 8160
agcttacaaa catttcatga tgctcccccc gctctgatgg ctggagccca atccctacac 8220
agactcctgc tgtatgtgtt ttcctttcac tctgagccac agccagaggg caggcattca 8280
gtctcctctt caggctgggg ctggggcact gagaactcac ccaacacctt gctctcactc 8340
cttctgcaaa acaagaaaga gctttgtgct gcagtagcca tgaagaatga aaggaaggct 8400
ttaactaaaa aatgtcagag attattttca accccttact gtggatcacc agcaaggagg 8460
aaacacaaca cagagacatt ttttcccctc aaattatcaa aagaatcact gcatttgtta 8520
aagagagcaa ctgaatcagg aagcagagtt ttgaacatat cagaagttag gaatctgcat 8580
cagagacaaa tgcagtcatg gttgtttgct gcataccagc cctaatcatt agaagcctca 8640
tggacttcaa acatcattcc ctctgacaag atgctctagc ctaactccat gagataaaat 8700
aaatctgcct ttcagagcca aagaagagtc caccagcttc ttctcagtgt gaacaagagc 8760
tccagtcagg ttagtcagtc cagtgcagta gaggagacca gtctgcatcc tctaattttc 8820
aaaggcaaga agatttgttt accctggaca ccaggcacaa gtgaggtcac agagctctta 8880
gatatgcagt cctcatgagt gaggagacta aagcgcatgc catcaagact tcagtgtaga 8940
gaaaacctcc aaaaaagcct cctcactact tctggaatag ctcagaggcc gaggcggcct 9000
cggcctctgc ataaataaaa aaaattagtc agccatgggg cggagaatgg gcggaactgg 9060
gcggagttag gggcgggatg ggcggagtta ggggcgggac tatggttgct gactaattga 9120
gatgcatgct ttgcatactt ctgcctgctg gggagcctgg ggactttcca cacctggttg 9180
ctgactaatt gagatgcatg ctttgcatac ttctgcctgc tggggagcct ggggactttc 9240
cacaccctaa ctgacacaca ttccacagct gcattaatga atcggccaac gcgcggggag 9300
aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 9360
cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 9420
atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 9480
taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 9540
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 9600
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 9660
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 9720
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 9780
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 9840
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 9900
tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat 9960
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 10020
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 10080
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 10140
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 10200
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 10260
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 10320
catagttgcc tgactcctgc aaaccacgtt gtgtctcaaa atctctgatg ttacattgca 10380
caagataaaa atatatcatc atgaacaata aaactgtctg cttacataaa cagtaataca 10440
aggggtgtta tgagccatat tcaacgggaa acgtcttgct cgaggccgcg attaaattcc 10500
aacatggatg ctgatttata tgggtataaa tgggctcgcg ataatgtcgg gcaatcaggt 10560
gcgacaatct atcgattgta tgggaagccc gatgcgccag agttgtttct gaaacatggc 10620
aaaggtagcg ttgccaatga tgttacagat gagatggtca gactaaactg gctgacggaa 10680
tttatgcctc ttccgaccat caagcatttt atccgtactc ctgatgatgc atggttactc 10740
accactgcga tccccgggaa aacagcattc caggtattag aagaatatcc tgattcaggt 10800
gaaaatattg ttgatgcgct ggcagtgttc ctgcgccggt tgcattcgat tcctgtttgt 10860
aattgtcctt ttaacagcga tcgcgtattt cgtctcgctc aggcgcaatc acgaatgaat 10920
aacggtttgg ttgatgcgag tgattttgat gacgagcgta atggctggcc tgttgaacaa 10980
gtctggaaag aaatgcataa gcttttgcca ttctcaccgg attcagtcgt cactcatggt 11040
gatttctcac ttgataacct tatttttgac gaggggaaat taataggttg tattgatgtt 11100
ggacgagtcg gaatcgcaga ccgataccag gatcttgcca tcctatggaa ctgcctcggt 11160
gagttttctc cttcattaca gaaacggctt tttcaaaaat atggtattga taatcctgat 11220
atgaataaat tgcagtttca tttgatgctc gatgagtttt tctaagggcg gcctgccacc 11280
atacccacgc cgaaacaagc gctcatgagc ccgaagtggc gagcccgatc ttccccatcg 11340
gtgatgtcgg cgatataggc gccagcaacc gcacctgtgg cgccggtgat gagggcgcgc 11400
caagtcgacg tccggcagtc 11420
<210> 4
<211> 11171
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 4
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctcctg gtggcgaggg gaggggggtg gtcctcgaac gccttgcaga 600
actggcctgg atacagagtg gaccggctgg ccccatctgg aagacttcga gatacactgt 660
tgtcttactg cgctcaacag tgtatctcga agtcttccaa atggtgccag ccatcgcagc 720
ggggtgcagg aaatgggggc agcccccctt tttggctatc cttccacgtg ttcttttttg 780
tatcttttgt gtttcctaga aaacatctca gtcaccaccg cagccctagg aatgcatcta 840
gacaattgta ctaaccttct tctctttcct ctcctgacag tccggaaagc caccatgggc 900
cgctgctgct tctacaccgc cggcaccctg agcctgctgc tgctggtgac cagcgtgacc 960
ctgctggtgg cccgcgtgtt ccagaaggcc gtggaccaga gcatcgagaa gaagatcgtg 1020
ctgcgcaacg gcaccgaggc cttcgacagc tgggagaagc cccccctgcc cgtgtacacc 1080
cagttctact tcttcaacgt gaccaacccc gaggagatcc tgcgcggcga gaccccccgc 1140
gtggaggagg tgggccccta cacctaccgc gagctgcgca acaaggccaa catccagttc 1200
ggcgacaacg gcaccaccat cagcgccgtg agcaacaagg cctacgtgtt cgagcgcgac 1260
cagagcgtgg gcgaccccaa gatcgacctg atccgcaccc tgaacatccc cgtgctgacc 1320
gtgatcgagt ggagccaggt gcacttcctg cgcgagatca tcgaggccat gctgaaggcc 1380
taccagcaga agctgttcgt gacccacacc gtggacgagc tgctgtgggg ctacaaggac 1440
gagatcctga gcctgatcca cgtgttccgc cccgacatca gcccctactt cggcctgttc 1500
tacgagaaga acggcaccaa cgacggcgac tacgtgttcc tgaccggcga ggacagctac 1560
ctgaacttca ccaagatcgt ggagtggaac ggcaagacca gcctggactg gtggatcacc 1620
gacaagtgca acatgatcaa cggcaccgac ggcgacagct tccaccccct gatcaccaag 1680
gacgaggtgc tgtacgtgtt ccccagcgac ttctgccgca gcgtgtacat caccttcagc 1740
gactacgaga gcgtgcaggg cctgcccgcc ttccgctaca aggtgcccgc cgagatcctg 1800
gccaacacca gcgacaacgc cggcttctgc atccccgagg gcaactgcct gggcagcggc 1860
gtgctgaacg tgagcatctg caagaacggc gcccccatca tcatgagctt cccccacttc 1920
taccaggccg acgagcgctt cgtgagcgcc atcgagggca tgcaccccaa ccaggaggac 1980
cacgagacct tcgtggacat caaccccctg accggcatca tcctgaaggc cgccaagcgc 2040
ttccagatca acatctacgt gaagaagctg gacgacttcg tggagaccgg cgacatccgc 2100
accatggtgt tccccgtgat gtacctgaac gagagcgtgc acatcgacaa ggagaccgcc 2160
agccgcctga agagcatgat caacaccacc ctgatcatca ccaacatccc ctacatcatc 2220
atggccctgg gcgtgttctt cggcctggtg ttcacctggc tggcctgcaa gggccagggc 2280
agcatggacg agggcaccgc cgacgagcgc gcccccctga tccgcaccga gggcagagga 2340
agtcttctga catgcggaga cgtggaagag aatcccggcc ctatggaatt cagcagcccc 2400
agcagagagg aatgccccaa gcctctgagc cgggtgtcaa tcatggccgg atctctgaca 2460
ggactgctgc tgcttcaggc cgtgtcttgg gcttctggcg ctagaccttg catccccaag 2520
agcttcggct acagcagcgt cgtgtgcgtg tgcaatgcca cctactgcga cagcttcgac 2580
cctcctacct ttcctgctct gggcaccttc agcagatacg agagcaccag atccggcaga 2640
cggatggaac tgagcatggg acccatccag gccaatcaca caggcactgg cctgctgctg 2700
acactgcagc ctgagcagaa attccagaaa gtgaaaggct tcggcggagc catgacagat 2760
gccgccgctc tgaatatcct ggctctgtct ccaccagctc agaacctgct gctcaagagc 2820
tacttcagcg aggaaggcat cggctacaac atcatcagag tgcccatggc cagctgcgac 2880
ttcagcatca ggacctacac ctacgccgac acacccgacg atttccagct gcacaacttc 2940
agcctgcctg aagaggacac caagctgaag atccctctga tccacagagc cctgcagctg 3000
gcacaaagac ccgtgtcact gctggcctct ccatggacat ctcccacctg gctgaaaaca 3060
aatggcgccg tgaatggcaa gggcagcctg aaaggccaac ctggcgacat ctaccaccag 3120
acctgggcca gatacttcgt gaagttcctg gacgcctatg ccgagcacaa gctgcagttt 3180
tgggccgtga cagccgagaa cgaaccttct gctggactgc tgagcggcta cccctttcag 3240
tgcctgggct ttacacccga gcaccagcgg gactttatcg cccgtgatct gggacccaca 3300
ctggccaata gcacccacca taatgtgcgg ctgctgatgc tggacgacca gagactgctt 3360
ctgccccact gggctaaagt ggtgctgaca gatcctgagg ccgccaaata cgtgcacgga 3420
atcgccgtgc actggtatct ggactttctg gcccctgcca aggccacact gggagagaca 3480
cacagactgt tccccaacac catgctgttc gccagcgaag cctgtgtggg cagcaagttt 3540
tgggaacaga gcgtgcggct cggcagctgg gatagaggca tgcagtacag ccacagcatc 3600
atcaccaacc tgctgtacca cgtcgtcggc tggaccgact ggaatctggc cctgaatcct 3660
gaaggcggcc ctaactgggt ccgaaacttc gtggacagcc ccatcatcgt ggacatcacc 3720
aaggacacct tctacaagca gcccatgttc taccacctgg gacacttcag caagttcatc 3780
cccgagggct ctcagcgcgt tggactggtg gcttcccaga agaacgatct ggacgccgtg 3840
gctctgatgc accctgatgg atctgctgtg gtggtggtcc tgaaccgcag cagcaaagat 3900
gtgcccctga ccatcaagga tcccgccgtg ggattcctgg aaacaatcag ccctggctac 3960
tccatccaca cctacctgtg gcgtagacag tgacaattgt taattaagtt taaaccctcg 4020
aggccgcaag ccgcatcgat accgtcgact agagctcgct gatcagcctc gactgtgcct 4080
tctagttgcc agccatctgt tgtttgcccc tcccccgtgc cttccttgac cctggaaggt 4140
gccactccca ctgtcctttc ctaataaaat gaggaaattg catcgcattg tctgagtagg 4200
tgtcattcta ttctgggggg tggggtgggg caggacagca agggggagga ttgggaagac 4260
aatagcaggc atgctgggga gagatccacg ataacaaaca gcttttttgg ggtgaacata 4320
ttgactgaat tccctgcagg ttggccactc cctctctgcg cgctcgctcg ctcactgagg 4380
ccgcccgggc aaagcccggg cgtcgggcga cctttggtcg cccggcctca gtgagcgagc 4440
gagcgcgcag agagggagtg gccaactcca tcactagggg ttcctgcggc cgctcgtacg 4500
gtctcgagga attcctgcag gataacttgc caacctcatt ctaaaatgta tatagaagcc 4560
caaaagacaa taacaaaaat attcttgtag aacaaaatgg gaaagaatgt tccactaaat 4620
atcaagattt agagcaaagc atgagatgtg tggggataga cagtgaggct gataaaatag 4680
agtagagctc agaaacagac ccattgatat atgtaagtga cctatgaaaa aaatatggca 4740
ttttacaatg ggaaaatgat ggtctttttc ttttttagaa aaacagggaa atatatttat 4800
atgtaaaaaa taaaagggaa cccatatgtc ataccataca cacaaaaaaa ttccagtgaa 4860
ttataagtct aaatggagaa ggcaaaactt taaatctttt agaaaataat atagaagcat 4920
gcagaccagc ctggccaaca tgatgaaacc ctctctacta ataataaaat cagtagaact 4980
actcaggact actttgagtg ggaagtcctt ttctatgaag acttctttgg ccaaaattag 5040
gctctaaatg caaggagata gtgcatcatg cctggctgca cttactgata aatgatgtta 5100
tcaccatctt taaccaaatg cacaggaaca agttatggta ctgatgtgct ggattgagaa 5160
ggagctctac ttccttgaca ggacacattt gtatcaactt aaaaaagcag atttttgcca 5220
gcagaactat tcattcagag gtaggaaact tagaatagat gatgtcactg attagcatgg 5280
cttccccatc tccacagctg cttcccaccc aggttgccca cagttgagtt tgtccagtgc 5340
tcagggctgc ccactctcag taagaagccc cacaccagcc cctctccaaa tatgttggct 5400
gttccttcca ttaaagtgac cccactttag agcagcaagt ggatttctgt ttcttacagt 5460
tcaggaagga ggagtcagct gtgagaacct ggagcctgag atgcttctaa gtcccactgc 5520
tactggggtc agggaagcca gactccagca tcagcagtca ggagcactaa gcccttgcca 5580
acatcctgtt tctcagagaa actgcttcca ttataatggt tgtccttttt taagctatca 5640
agccaaacaa ccagtgtcta ccattattct catcacctga agccaagggt tctagcaaaa 5700
gtcaagctgt cttgtaatgg ttgatgtgcc tccagcttct gtcttcagtc actccactct 5760
tagcctgctc tgaatcaact ctgaccacag ttccctggag cccctgccac ctgctgcccc 5820
tgccaccttc tccatctgca gtgctgtgca gccttctgca ctcttgcaga gctaataggt 5880
ggagacttga aggaagagga ggaaagtttc tcataatagc cttgctgcaa gctcaaatgg 5940
gaggtgggca ctgtgcccag gagccttgga gcaaaggctg tgcccaacct ctgactgcat 6000
ccaggtttgg tcttgacaga gataagaagc cctggctttt ggagccaaaa tctaggtcag 6060
acttaggcag gattctcaaa gtttatcagc agaacatgag gcagaagacc ctttctgctc 6120
cagcttcttc aggctcaacc ttcatcagaa tagatagaaa gagaggctgt gagggttctt 6180
aaaacagaag caaatctgac tcagagaata aacaacctcc tagtaaacta cagcttagac 6240
agagcatctg gtggtgagtg tgctcagtgt cctactcaac tgtctggtat cagccctcat 6300
gaggacttct cttctttccc tcatagacct ccatctctgt tttccttagc ctgcagaaat 6360
ctggatggct attcacagaa tgcctgtgct ttcagagttg cattttttct ctggtattct 6420
ggttcaagca tttgaaggta ggaaaggttc tccaagtgca agaaagccag ccctgagcct 6480
caactgcctg gctagtgtgg tcagtaggat gcaaaggctg ttgaatgcca caaggccaaa 6540
ctttaacctg tgtaccacaa gcctagcagc agaggcagct ctgctcactg gaactctctg 6600
tcttctttct cctgagcctt ttcttttcct gagttttcta gctctcctca accttacctc 6660
tgccctaccc aggacaaacc caagagccac tgtttctgtg atgtcctctc cagccctaat 6720
taggcatcat gacttcagcc tgaccttcca tgctcagaag cagtgctaat ccacttcaga 6780
tgagctgctc tatgcaacac aggcagagcc tacaaacctt tgcaccagag ccctccacat 6840
atcagtgttt gttcatactc acttcaacag caaatgtgac tgctgagatt aagattttac 6900
acaagatggt ctgtaatttc acagttagtt ttatcccatt aggtatgaaa gaattagcat 6960
aattcccctt aaacatgaat gaatcttaga ttttttaata aatagttttg gaagtaaaga 7020
cagagacatc aggagcacaa ggaatagcct gagaggacaa acagaacaag aaagagtctg 7080
gaaatacaca ggatgttctt ggcctcctca aagcaagtgc aagcagatag taccagcagc 7140
cccaggctat cagagcccag tgaagagaag taccatgaaa gccacagctc taaccaccct 7200
gttccagagt gacagacagt ccccaagaca agccagcctg agccagagag agaactgcaa 7260
gagaaagttt ctaatttagg ttctgttaga ttcagacaag tgcaggtcat cctctctcca 7320
cagctactca cctctccagc ctaacaaagc ctgcagtcca cactccaacc ctggtgtctc 7380
acctcctagc ctctcccaac atcctgctct ctgaccatct tctgcatctc tcatctcacc 7440
atctcccact gtctacagcc tactcttgca actaccatct cattttctga catcctgtct 7500
acatcttctg ccatactctg ccatctacca taccacctct taccatctac cacaccatct 7560
tttatctcca tccctctcag aagcctccaa gctgaatcct gctttatgtg ttcatctcag 7620
cccctgcatg gaaagctgac cccagaggca gaactattcc cagagagctt ggccaagaaa 7680
aacaaaacta ccagcctggc caggctcagg agtagtaagc tgcagtgtct gttgtgttct 7740
agcttcaaca gctgcaggag ttccactctc aaatgctcca catttctcac atcctcctga 7800
ttctggtcac tacccatctt caaagaacag aatatctcac atcagcatac tgtgaaggac 7860
tagtcatggg tgcagctgct cagagctgca aagtcattct ggatggtgga gagcttacaa 7920
acatttcatg atgctccccc cgctctgatg gctggagccc aatccctaca cagactcctg 7980
ctgtatgtgt tttcctttca ctctgagcca cagccagagg gcaggcattc agtctcctct 8040
tcaggctggg gctggggcac tgagaactca cccaacacct tgctctcact ccttctgcaa 8100
aacaagaaag agctttgtgc tgcagtagcc atgaagaatg aaaggaaggc tttaactaaa 8160
aaatgtcaga gattattttc aaccccttac tgtggatcac cagcaaggag gaaacacaac 8220
acagagacat tttttcccct caaattatca aaagaatcac tgcatttgtt aaagagagca 8280
actgaatcag gaagcagagt tttgaacata tcagaagtta ggaatctgca tcagagacaa 8340
atgcagtcat ggttgtttgc tgcataccag ccctaatcat tagaagcctc atggacttca 8400
aacatcattc cctctgacaa gatgctctag cctaactcca tgagataaaa taaatctgcc 8460
tttcagagcc aaagaagagt ccaccagctt cttctcagtg tgaacaagag ctccagtcag 8520
gttagtcagt ccagtgcagt agaggagacc agtctgcatc ctctaatttt caaaggcaag 8580
aagatttgtt taccctggac accaggcaca agtgaggtca cagagctctt agatatgcag 8640
tcctcatgag tgaggagact aaagcgcatg ccatcaagac ttcagtgtag agaaaacctc 8700
caaaaaagcc tcctcactac ttctggaata gctcagaggc cgaggcggcc tcggcctctg 8760
cataaataaa aaaaattagt cagccatggg gcggagaatg ggcggaactg ggcggagtta 8820
ggggcgggat gggcggagtt aggggcggga ctatggttgc tgactaattg agatgcatgc 8880
tttgcatact tctgcctgct ggggagcctg gggactttcc acacctggtt gctgactaat 8940
tgagatgcat gctttgcata cttctgcctg ctggggagcc tggggacttt ccacacccta 9000
actgacacac attccacagc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt 9060
gcgtattggg cgctcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct 9120
gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga 9180
taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc 9240
cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg 9300
ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg 9360
aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt 9420
tctcccttcg ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt 9480
gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg 9540
cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact 9600
ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt 9660
cttgaagtgg tggcctaact acggctacac tagaagaaca gtatttggta tctgcgctct 9720
gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac 9780
cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc 9840
tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg 9900
ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta 9960
aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca 10020
atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc 10080
ctgactcctg caaaccacgt tgtgtctcaa aatctctgat gttacattgc acaagataaa 10140
aatatatcat catgaacaat aaaactgtct gcttacataa acagtaatac aaggggtgtt 10200
atgagccata ttcaacggga aacgtcttgc tcgaggccgc gattaaattc caacatggat 10260
gctgatttat atgggtataa atgggctcgc gataatgtcg ggcaatcagg tgcgacaatc 10320
tatcgattgt atgggaagcc cgatgcgcca gagttgtttc tgaaacatgg caaaggtagc 10380
gttgccaatg atgttacaga tgagatggtc agactaaact ggctgacgga atttatgcct 10440
cttccgacca tcaagcattt tatccgtact cctgatgatg catggttact caccactgcg 10500
atccccggga aaacagcatt ccaggtatta gaagaatatc ctgattcagg tgaaaatatt 10560
gttgatgcgc tggcagtgtt cctgcgccgg ttgcattcga ttcctgtttg taattgtcct 10620
tttaacagcg atcgcgtatt tcgtctcgct caggcgcaat cacgaatgaa taacggtttg 10680
gttgatgcga gtgattttga tgacgagcgt aatggctggc ctgttgaaca agtctggaaa 10740
gaaatgcata agcttttgcc attctcaccg gattcagtcg tcactcatgg tgatttctca 10800
cttgataacc ttatttttga cgaggggaaa ttaataggtt gtattgatgt tggacgagtc 10860
ggaatcgcag accgatacca ggatcttgcc atcctatgga actgcctcgg tgagttttct 10920
ccttcattac agaaacggct ttttcaaaaa tatggtattg ataatcctga tatgaataaa 10980
ttgcagtttc atttgatgct cgatgagttt ttctaagggc ggcctgccac catacccacg 11040
ccgaaacaag cgctcatgag cccgaagtgg cgagcccgat cttccccatc ggtgatgtcg 11100
gcgatatagg cgccagcaac cgcacctgtg gcgccggtga tgagggcgcg ccaagtcgac 11160
gtccggcagt c 11171
<210> 5
<211> 11309
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 5
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctcctg gtggcgaggg gaggggggtg gtcctcgaac gccttgcaga 600
actggcctgg atacagagtg gaccggctgg ccccatctgg aagacttcga gatacactgt 660
tgtcttactg cgctcaacag tgtatctcga agtcttccaa atggtgccag ccatcgcagc 720
ggggtgcagg aaatgggggc agcccccctt tttggctatc cttccacgtg ttcttttttg 780
tatcttttgt gtttcctaga aaacatctca gtcaccaccg cagccctagg aatgcatcta 840
gacaattgta ctaaccttct tctctttcct ctcctgacag tccggaaagc caccatgtac 900
gccctgttcc tgctggccag cctgctgggc gccgccctgg ccggccccgt gctgggcctg 960
aaggagtgca cccgcggcag cgccgtgtgg tgccagaacg tgaagaccgc cagcgactgc 1020
ggcgccgtga agcactgcct gcagaccgtg tggaacaagc ccaccgtgaa gagcctgccc 1080
tgcgacatct gcaaggacgt ggtgaccgcc gccggcgaca tgctgaagga caacgccacc 1140
gaggaggaga tcctggtgta cctggagaag acctgcgact ggctgcccaa gcccaacatg 1200
agcgccagct gcaaggagat cgtggacagc tacctgcccg tgatcctgga catcatcaag 1260
ggcgagatga gccgccccgg cgaggtgtgc agcgccctga acctgtgcga gagcctgcag 1320
aagcacctgg ccgagctgaa ccaccagaag cagctggaga gcaacaagat ccccgagctg 1380
gacatgaccg aggtggtggc ccccttcatg gccaacatcc ccctgctgct gtacccccag 1440
gacggccccc gcagcaagcc ccagcccaag gacaacggcg acgtgtgcca ggactgcatc 1500
cagatggtga ccgacatcca gaccgccgtg cgcaccaaca gcaccttcgt gcaggccctg 1560
gtggagcacg tgaaggagga gtgcgaccgc ctgggccccg gcatggccga catctgcaag 1620
aactacatca gccagtacag cgagatcgcc atccagatga tgatgcacat gcagcccaag 1680
gagatctgcg ccctggtggg cttctgcgac gaggtgaagg agatgcccat gcagaccctg 1740
gtgcccgcca aggtggccag caagaacgtg atccccgccc tggagctggt ggagcccatc 1800
aagaagcacg aggtgcccgc caagagcgac gtgtactgcg aggtgtgcga gttcctggtg 1860
aaggaggtga ccaagctgat cgacaacaac aagaccgaga aggagatcct ggacgccttc 1920
gacaagatgt gcagcaagct gcccaagagc ctgagcgagg agtgccagga ggtggtggac 1980
acctacggca gcagcatcct gagcatcctg ctggaggagg tgagccccga gctggtgtgc 2040
agcatgctgc acctgtgcag cggcacccgc ctgcccgccc tgaccgtgca cgtgacccag 2100
cccaaggacg gcggcttctg cgaggtgtgc aagaagctgg tgggctacct ggaccgcaac 2160
ctggagaaga acagcaccaa gcaggagatc ctggccgccc tggagaaggg ctgcagcttc 2220
ctgcccgacc cctaccagaa gcagtgcgac cagttcgtgg ccgagtacga gcccgtgctg 2280
atcgagatcc tggtggaggt gatggacccc agcttcgtgt gcctgaagat cggcgcctgc 2340
cccagcgccc acaagcccct gctgggcacc gagaagtgca tctggggccc cagctactgg 2400
tgccagaaca ccgagaccgc cgcccagtgc aacgccgtgg agcactgcaa gcgccacgtg 2460
tggaacgagg gcagaggaag tcttctgaca tgcggagacg tggaagagaa tcccggccct 2520
atggaattca gcagccccag cagagaggaa tgccccaagc ctctgagccg ggtgtcaatc 2580
atggccggat ctctgacagg actgctgctg cttcaggccg tgtcttgggc ttctggcgct 2640
agaccttgca tccccaagag cttcggctac agcagcgtcg tgtgcgtgtg caatgccacc 2700
tactgcgaca gcttcgaccc tcctaccttt cctgctctgg gcaccttcag cagatacgag 2760
agcaccagat ccggcagacg gatggaactg agcatgggac ccatccaggc caatcacaca 2820
ggcactggcc tgctgctgac actgcagcct gagcagaaat tccagaaagt gaaaggcttc 2880
ggcggagcca tgacagatgc cgccgctctg aatatcctgg ctctgtctcc accagctcag 2940
aacctgctgc tcaagagcta cttcagcgag gaaggcatcg gctacaacat catcagagtg 3000
cccatggcca gctgcgactt cagcatcagg acctacacct acgccgacac acccgacgat 3060
ttccagctgc acaacttcag cctgcctgaa gaggacacca agctgaagat ccctctgatc 3120
cacagagccc tgcagctggc acaaagaccc gtgtcactgc tggcctctcc atggacatct 3180
cccacctggc tgaaaacaaa tggcgccgtg aatggcaagg gcagcctgaa aggccaacct 3240
ggcgacatct accaccagac ctgggccaga tacttcgtga agttcctgga cgcctatgcc 3300
gagcacaagc tgcagttttg ggccgtgaca gccgagaacg aaccttctgc tggactgctg 3360
agcggctacc cctttcagtg cctgggcttt acacccgagc accagcggga ctttatcgcc 3420
cgtgatctgg gacccacact ggccaatagc acccaccata atgtgcggct gctgatgctg 3480
gacgaccaga gactgcttct gccccactgg gctaaagtgg tgctgacaga tcctgaggcc 3540
gccaaatacg tgcacggaat cgccgtgcac tggtatctgg actttctggc ccctgccaag 3600
gccacactgg gagagacaca cagactgttc cccaacacca tgctgttcgc cagcgaagcc 3660
tgtgtgggca gcaagttttg ggaacagagc gtgcggctcg gcagctggga tagaggcatg 3720
cagtacagcc acagcatcat caccaacctg ctgtaccacg tcgtcggctg gaccgactgg 3780
aatctggccc tgaatcctga aggcggccct aactgggtcc gaaacttcgt ggacagcccc 3840
atcatcgtgg acatcaccaa ggacaccttc tacaagcagc ccatgttcta ccacctggga 3900
cacttcagca agttcatccc cgagggctct cagcgcgttg gactggtggc ttcccagaag 3960
aacgatctgg acgccgtggc tctgatgcac cctgatggat ctgctgtggt ggtggtcctg 4020
aaccgcagca gcaaagatgt gcccctgacc atcaaggatc ccgccgtggg attcctggaa 4080
acaatcagcc ctggctactc catccacacc tacctgtggc gtagacagtg acaattgtta 4140
attaagttta aaccctcgag gccgcaagcc gcatcgatac cgtcgactag agctcgctga 4200
tcagcctcga ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct 4260
tccttgaccc tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgca 4320
tcgcattgtc tgagtaggtg tcattctatt ctggggggtg gggtggggca ggacagcaag 4380
ggggaggatt gggaagacaa tagcaggcat gctggggaga gatccacgat aacaaacagc 4440
ttttttgggg tgaacatatt gactgaattc cctgcaggtt ggccactccc tctctgcgcg 4500
ctcgctcgct cactgaggcc gcccgggcaa agcccgggcg tcgggcgacc tttggtcgcc 4560
cggcctcagt gagcgagcga gcgcgcagag agggagtggc caactccatc actaggggtt 4620
cctgcggccg ctcgtacggt ctcgaggaat tcctgcagga taacttgcca acctcattct 4680
aaaatgtata tagaagccca aaagacaata acaaaaatat tcttgtagaa caaaatggga 4740
aagaatgttc cactaaatat caagatttag agcaaagcat gagatgtgtg gggatagaca 4800
gtgaggctga taaaatagag tagagctcag aaacagaccc attgatatat gtaagtgacc 4860
tatgaaaaaa atatggcatt ttacaatggg aaaatgatgg tctttttctt ttttagaaaa 4920
acagggaaat atatttatat gtaaaaaata aaagggaacc catatgtcat accatacaca 4980
caaaaaaatt ccagtgaatt ataagtctaa atggagaagg caaaacttta aatcttttag 5040
aaaataatat agaagcatgc agaccagcct ggccaacatg atgaaaccct ctctactaat 5100
aataaaatca gtagaactac tcaggactac tttgagtggg aagtcctttt ctatgaagac 5160
ttctttggcc aaaattaggc tctaaatgca aggagatagt gcatcatgcc tggctgcact 5220
tactgataaa tgatgttatc accatcttta accaaatgca caggaacaag ttatggtact 5280
gatgtgctgg attgagaagg agctctactt ccttgacagg acacatttgt atcaacttaa 5340
aaaagcagat ttttgccagc agaactattc attcagaggt aggaaactta gaatagatga 5400
tgtcactgat tagcatggct tccccatctc cacagctgct tcccacccag gttgcccaca 5460
gttgagtttg tccagtgctc agggctgccc actctcagta agaagcccca caccagcccc 5520
tctccaaata tgttggctgt tccttccatt aaagtgaccc cactttagag cagcaagtgg 5580
atttctgttt cttacagttc aggaaggagg agtcagctgt gagaacctgg agcctgagat 5640
gcttctaagt cccactgcta ctggggtcag ggaagccaga ctccagcatc agcagtcagg 5700
agcactaagc ccttgccaac atcctgtttc tcagagaaac tgcttccatt ataatggttg 5760
tcctttttta agctatcaag ccaaacaacc agtgtctacc attattctca tcacctgaag 5820
ccaagggttc tagcaaaagt caagctgtct tgtaatggtt gatgtgcctc cagcttctgt 5880
cttcagtcac tccactctta gcctgctctg aatcaactct gaccacagtt ccctggagcc 5940
cctgccacct gctgcccctg ccaccttctc catctgcagt gctgtgcagc cttctgcact 6000
cttgcagagc taataggtgg agacttgaag gaagaggagg aaagtttctc ataatagcct 6060
tgctgcaagc tcaaatggga ggtgggcact gtgcccagga gccttggagc aaaggctgtg 6120
cccaacctct gactgcatcc aggtttggtc ttgacagaga taagaagccc tggcttttgg 6180
agccaaaatc taggtcagac ttaggcagga ttctcaaagt ttatcagcag aacatgaggc 6240
agaagaccct ttctgctcca gcttcttcag gctcaacctt catcagaata gatagaaaga 6300
gaggctgtga gggttcttaa aacagaagca aatctgactc agagaataaa caacctccta 6360
gtaaactaca gcttagacag agcatctggt ggtgagtgtg ctcagtgtcc tactcaactg 6420
tctggtatca gccctcatga ggacttctct tctttccctc atagacctcc atctctgttt 6480
tccttagcct gcagaaatct ggatggctat tcacagaatg cctgtgcttt cagagttgca 6540
ttttttctct ggtattctgg ttcaagcatt tgaaggtagg aaaggttctc caagtgcaag 6600
aaagccagcc ctgagcctca actgcctggc tagtgtggtc agtaggatgc aaaggctgtt 6660
gaatgccaca aggccaaact ttaacctgtg taccacaagc ctagcagcag aggcagctct 6720
gctcactgga actctctgtc ttctttctcc tgagcctttt cttttcctga gttttctagc 6780
tctcctcaac cttacctctg ccctacccag gacaaaccca agagccactg tttctgtgat 6840
gtcctctcca gccctaatta ggcatcatga cttcagcctg accttccatg ctcagaagca 6900
gtgctaatcc acttcagatg agctgctcta tgcaacacag gcagagccta caaacctttg 6960
caccagagcc ctccacatat cagtgtttgt tcatactcac ttcaacagca aatgtgactg 7020
ctgagattaa gattttacac aagatggtct gtaatttcac agttagtttt atcccattag 7080
gtatgaaaga attagcataa ttccccttaa acatgaatga atcttagatt ttttaataaa 7140
tagttttgga agtaaagaca gagacatcag gagcacaagg aatagcctga gaggacaaac 7200
agaacaagaa agagtctgga aatacacagg atgttcttgg cctcctcaaa gcaagtgcaa 7260
gcagatagta ccagcagccc caggctatca gagcccagtg aagagaagta ccatgaaagc 7320
cacagctcta accaccctgt tccagagtga cagacagtcc ccaagacaag ccagcctgag 7380
ccagagagag aactgcaaga gaaagtttct aatttaggtt ctgttagatt cagacaagtg 7440
caggtcatcc tctctccaca gctactcacc tctccagcct aacaaagcct gcagtccaca 7500
ctccaaccct ggtgtctcac ctcctagcct ctcccaacat cctgctctct gaccatcttc 7560
tgcatctctc atctcaccat ctcccactgt ctacagccta ctcttgcaac taccatctca 7620
ttttctgaca tcctgtctac atcttctgcc atactctgcc atctaccata ccacctctta 7680
ccatctacca caccatcttt tatctccatc cctctcagaa gcctccaagc tgaatcctgc 7740
tttatgtgtt catctcagcc cctgcatgga aagctgaccc cagaggcaga actattccca 7800
gagagcttgg ccaagaaaaa caaaactacc agcctggcca ggctcaggag tagtaagctg 7860
cagtgtctgt tgtgttctag cttcaacagc tgcaggagtt ccactctcaa atgctccaca 7920
tttctcacat cctcctgatt ctggtcacta cccatcttca aagaacagaa tatctcacat 7980
cagcatactg tgaaggacta gtcatgggtg cagctgctca gagctgcaaa gtcattctgg 8040
atggtggaga gcttacaaac atttcatgat gctccccccg ctctgatggc tggagcccaa 8100
tccctacaca gactcctgct gtatgtgttt tcctttcact ctgagccaca gccagagggc 8160
aggcattcag tctcctcttc aggctggggc tggggcactg agaactcacc caacaccttg 8220
ctctcactcc ttctgcaaaa caagaaagag ctttgtgctg cagtagccat gaagaatgaa 8280
aggaaggctt taactaaaaa atgtcagaga ttattttcaa ccccttactg tggatcacca 8340
gcaaggagga aacacaacac agagacattt tttcccctca aattatcaaa agaatcactg 8400
catttgttaa agagagcaac tgaatcagga agcagagttt tgaacatatc agaagttagg 8460
aatctgcatc agagacaaat gcagtcatgg ttgtttgctg cataccagcc ctaatcatta 8520
gaagcctcat ggacttcaaa catcattccc tctgacaaga tgctctagcc taactccatg 8580
agataaaata aatctgcctt tcagagccaa agaagagtcc accagcttct tctcagtgtg 8640
aacaagagct ccagtcaggt tagtcagtcc agtgcagtag aggagaccag tctgcatcct 8700
ctaattttca aaggcaagaa gatttgttta ccctggacac caggcacaag tgaggtcaca 8760
gagctcttag atatgcagtc ctcatgagtg aggagactaa agcgcatgcc atcaagactt 8820
cagtgtagag aaaacctcca aaaaagcctc ctcactactt ctggaatagc tcagaggccg 8880
aggcggcctc ggcctctgca taaataaaaa aaattagtca gccatggggc ggagaatggg 8940
cggaactggg cggagttagg ggcgggatgg gcggagttag gggcgggact atggttgctg 9000
actaattgag atgcatgctt tgcatacttc tgcctgctgg ggagcctggg gactttccac 9060
acctggttgc tgactaattg agatgcatgc tttgcatact tctgcctgct ggggagcctg 9120
gggactttcc acaccctaac tgacacacat tccacagctg cattaatgaa tcggccaacg 9180
cgcggggaga ggcggtttgc gtattgggcg ctcttccgct tcctcgctca ctgactcgct 9240
gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt 9300
atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc 9360
caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga 9420
gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata 9480
ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc tgccgcttac 9540
cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata gctcacgctg 9600
taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc 9660
cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag 9720
acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag cgaggtatgt 9780
aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta gaagaacagt 9840
atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg 9900
atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc agcagattac 9960
gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt ctgacgctca 10020
gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac 10080
ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac 10140
ttggtctgac agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtctatt 10200
tcgttcatcc atagttgcct gactcctgca aaccacgttg tgtctcaaaa tctctgatgt 10260
tacattgcac aagataaaaa tatatcatca tgaacaataa aactgtctgc ttacataaac 10320
agtaatacaa ggggtgttat gagccatatt caacgggaaa cgtcttgctc gaggccgcga 10380
ttaaattcca acatggatgc tgatttatat gggtataaat gggctcgcga taatgtcggg 10440
caatcaggtg cgacaatcta tcgattgtat gggaagcccg atgcgccaga gttgtttctg 10500
aaacatggca aaggtagcgt tgccaatgat gttacagatg agatggtcag actaaactgg 10560
ctgacggaat ttatgcctct tccgaccatc aagcatttta tccgtactcc tgatgatgca 10620
tggttactca ccactgcgat ccccgggaaa acagcattcc aggtattaga agaatatcct 10680
gattcaggtg aaaatattgt tgatgcgctg gcagtgttcc tgcgccggtt gcattcgatt 10740
cctgtttgta attgtccttt taacagcgat cgcgtatttc gtctcgctca ggcgcaatca 10800
cgaatgaata acggtttggt tgatgcgagt gattttgatg acgagcgtaa tggctggcct 10860
gttgaacaag tctggaaaga aatgcataag cttttgccat tctcaccgga ttcagtcgtc 10920
actcatggtg atttctcact tgataacctt atttttgacg aggggaaatt aataggttgt 10980
attgatgttg gacgagtcgg aatcgcagac cgataccagg atcttgccat cctatggaac 11040
tgcctcggtg agttttctcc ttcattacag aaacggcttt ttcaaaaata tggtattgat 11100
aatcctgata tgaataaatt gcagtttcat ttgatgctcg atgagttttt ctaagggcgg 11160
cctgccacca tacccacgcc gaaacaagcg ctcatgagcc cgaagtggcg agcccgatct 11220
tccccatcgg tgatgtcggc gatataggcg ccagcaaccg cacctgtggc gccggtgatg 11280
agggcgcgcc aagtcgacgt ccggcagtc 11309
<210> 6
<211> 11293
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 6
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctgcag ccctaggaat gcatctagac aattgtacta accttcttct 600
ctttcctctc ctgacagtcc ggaaagccac catgtacgcc ctgttcctgc tggccagcct 660
gctgggcgcc gccctggccg gccccgtgct gggcctgaag gagtgcaccc gcggcagcgc 720
cgtgtggtgc cagaacgtga agaccgccag cgactgcggc gccgtgaagc actgcctgca 780
gaccgtgtgg aacaagccca ccgtgaagag cctgccctgc gacatctgca aggacgtggt 840
gaccgccgcc ggcgacatgc tgaaggacaa cgccaccgag gaggagatcc tggtgtacct 900
ggagaagacc tgcgactggc tgcccaagcc caacatgagc gccagctgca aggagatcgt 960
ggacagctac ctgcccgtga tcctggacat catcaagggc gagatgagcc gccccggcga 1020
ggtgtgcagc gccctgaacc tgtgcgagag cctgcagaag cacctggccg agctgaacca 1080
ccagaagcag ctggagagca acaagatccc cgagctggac atgaccgagg tggtggcccc 1140
cttcatggcc aacatccccc tgctgctgta cccccaggac ggcccccgca gcaagcccca 1200
gcccaaggac aacggcgacg tgtgccagga ctgcatccag atggtgaccg acatccagac 1260
cgccgtgcgc accaacagca ccttcgtgca ggccctggtg gagcacgtga aggaggagtg 1320
cgaccgcctg ggccccggca tggccgacat ctgcaagaac tacatcagcc agtacagcga 1380
gatcgccatc cagatgatga tgcacatgca gcccaaggag atctgcgccc tggtgggctt 1440
ctgcgacgag gtgaaggaga tgcccatgca gaccctggtg cccgccaagg tggccagcaa 1500
gaacgtgatc cccgccctgg agctggtgga gcccatcaag aagcacgagg tgcccgccaa 1560
gagcgacgtg tactgcgagg tgtgcgagtt cctggtgaag gaggtgacca agctgatcga 1620
caacaacaag accgagaagg agatcctgga cgccttcgac aagatgtgca gcaagctgcc 1680
caagagcctg agcgaggagt gccaggaggt ggtggacacc tacggcagca gcatcctgag 1740
catcctgctg gaggaggtga gccccgagct ggtgtgcagc atgctgcacc tgtgcagcgg 1800
cacccgcctg cccgccctga ccgtgcacgt gacccagccc aaggacggcg gcttctgcga 1860
ggtgtgcaag aagctggtgg gctacctgga ccgcaacctg gagaagaaca gcaccaagca 1920
ggagatcctg gccgccctgg agaagggctg cagcttcctg cccgacccct accagaagca 1980
gtgcgaccag ttcgtggccg agtacgagcc cgtgctgatc gagatcctgg tggaggtgat 2040
ggaccccagc ttcgtgtgcc tgaagatcgg cgcctgcccc agcgcccaca agcccctgct 2100
gggcaccgag aagtgcatct ggggccccag ctactggtgc cagaacaccg agaccgccgc 2160
ccagtgcaac gccgtggagc actgcaagcg ccacgtgtgg aactgattgt ggccgaaccg 2220
ccgaactcag aggccggccc cagaaaaccc gagcgagtag ggggcggcgc gcaggaggga 2280
ggagaactgg gggcgcggga ggctggtggg tgtggggggt ggagatgtag aagatgtgac 2340
gccgcggccc ggcgggtgcc agattagcgg acgcggtgcc cgcggttgca acgggatccc 2400
gggcgctgca gcttgggagg cggctctccc caggcggcgt ccgcggagac acccatccgt 2460
gaaccccagg tcccgggccg ccggctcgcc gcgcaccagg ggccggcgga cagaagagcg 2520
gccgagcggc tcgaggctgg gggaccgcgg gcgcggccgc gcgctgccgg gcgggaggct 2580
ggggggccgg ggccggggcc gtgccccgga gcgggtcgga ggccggggcc ggggccgggg 2640
gacggcggct ccccgcgcgg ctccagcggc tcggggatcc cggccgggcc ccgcagggac 2700
catgatggaa ttcagcagcc ccagcagaga ggaatgcccc aagcctctga gccgggtgtc 2760
aatcatggcc ggatctctga caggactgct gctgcttcag gccgtgtctt gggcttctgg 2820
cgctagacct tgcatcccca agagcttcgg ctacagcagc gtcgtgtgcg tgtgcaatgc 2880
cacctactgc gacagcttcg accctcctac ctttcctgct ctgggcacct tcagcagata 2940
cgagagcacc agatccggca gacggatgga actgagcatg ggacccatcc aggccaatca 3000
cacaggcact ggcctgctgc tgacactgca gcctgagcag aaattccaga aagtgaaagg 3060
cttcggcgga gccatgacag atgccgccgc tctgaatatc ctggctctgt ctccaccagc 3120
tcagaacctg ctgctcaaga gctacttcag cgaggaaggc atcggctaca acatcatcag 3180
agtgcccatg gccagctgcg acttcagcat caggacctac acctacgccg acacacccga 3240
cgatttccag ctgcacaact tcagcctgcc tgaagaggac accaagctga agatccctct 3300
gatccacaga gccctgcagc tggcacaaag acccgtgtca ctgctggcct ctccatggac 3360
atctcccacc tggctgaaaa caaatggcgc cgtgaatggc aagggcagcc tgaaaggcca 3420
acctggcgac atctaccacc agacctgggc cagatacttc gtgaagttcc tggacgccta 3480
tgccgagcac aagctgcagt tttgggccgt gacagccgag aacgaacctt ctgctggact 3540
gctgagcggc tacccctttc agtgcctggg ctttacaccc gagcaccagc gggactttat 3600
cgcccgtgat ctgggaccca cactggccaa tagcacccac cataatgtgc ggctgctgat 3660
gctggacgac cagagactgc ttctgcccca ctgggctaaa gtggtgctga cagatcctga 3720
ggccgccaaa tacgtgcacg gaatcgccgt gcactggtat ctggactttc tggcccctgc 3780
caaggccaca ctgggagaga cacacagact gttccccaac accatgctgt tcgccagcga 3840
agcctgtgtg ggcagcaagt tttgggaaca gagcgtgcgg ctcggcagct gggatagagg 3900
catgcagtac agccacagca tcatcaccaa cctgctgtac cacgtcgtcg gctggaccga 3960
ctggaatctg gccctgaatc ctgaaggcgg ccctaactgg gtccgaaact tcgtggacag 4020
ccccatcatc gtggacatca ccaaggacac cttctacaag cagcccatgt tctaccacct 4080
gggacacttc agcaagttca tccccgaggg ctctcagcgc gttggactgg tggcttccca 4140
gaagaacgat ctggacgccg tggctctgat gcaccctgat ggatctgctg tggtggtggt 4200
cctgaaccgc agcagcaaag atgtgcccct gaccatcaag gatcccgccg tgggattcct 4260
ggaaacaatc agccctggct actccatcca cacctacctg tggcgtagac agtgacaatt 4320
gttaattaag tttaaaccct cgaggccgca agcaataaaa tatctttatt ttcattacat 4380
ctgtgtgttg gttttttgtg tggagatcca cgataacaaa cagctttttt ggggtgaaca 4440
tattgactga attccctgca ggttggccac tccctctctg cgcgctcgct cgctcactga 4500
ggccgcccgg gcaaagcccg ggcgtcgggc gacctttggt cgcccggcct cagtgagcga 4560
gcgagcgcgc agagagggag tggccaactc catcactagg ggttcctgcg gccgctcgta 4620
cggtctcgag gaattcctgc aggataactt gccaacctca ttctaaaatg tatatagaag 4680
cccaaaagac aataacaaaa atattcttgt agaacaaaat gggaaagaat gttccactaa 4740
atatcaagat ttagagcaaa gcatgagatg tgtggggata gacagtgagg ctgataaaat 4800
agagtagagc tcagaaacag acccattgat atatgtaagt gacctatgaa aaaaatatgg 4860
cattttacaa tgggaaaatg atggtctttt tcttttttag aaaaacaggg aaatatattt 4920
atatgtaaaa aataaaaggg aacccatatg tcataccata cacacaaaaa aattccagtg 4980
aattataagt ctaaatggag aaggcaaaac tttaaatctt ttagaaaata atatagaagc 5040
atgcagacca gcctggccaa catgatgaaa ccctctctac taataataaa atcagtagaa 5100
ctactcagga ctactttgag tgggaagtcc ttttctatga agacttcttt ggccaaaatt 5160
aggctctaaa tgcaaggaga tagtgcatca tgcctggctg cacttactga taaatgatgt 5220
tatcaccatc tttaaccaaa tgcacaggaa caagttatgg tactgatgtg ctggattgag 5280
aaggagctct acttccttga caggacacat ttgtatcaac ttaaaaaagc agatttttgc 5340
cagcagaact attcattcag aggtaggaaa cttagaatag atgatgtcac tgattagcat 5400
ggcttcccca tctccacagc tgcttcccac ccaggttgcc cacagttgag tttgtccagt 5460
gctcagggct gcccactctc agtaagaagc cccacaccag cccctctcca aatatgttgg 5520
ctgttccttc cattaaagtg accccacttt agagcagcaa gtggatttct gtttcttaca 5580
gttcaggaag gaggagtcag ctgtgagaac ctggagcctg agatgcttct aagtcccact 5640
gctactgggg tcagggaagc cagactccag catcagcagt caggagcact aagcccttgc 5700
caacatcctg tttctcagag aaactgcttc cattataatg gttgtccttt tttaagctat 5760
caagccaaac aaccagtgtc taccattatt ctcatcacct gaagccaagg gttctagcaa 5820
aagtcaagct gtcttgtaat ggttgatgtg cctccagctt ctgtcttcag tcactccact 5880
cttagcctgc tctgaatcaa ctctgaccac agttccctgg agcccctgcc acctgctgcc 5940
cctgccacct tctccatctg cagtgctgtg cagccttctg cactcttgca gagctaatag 6000
gtggagactt gaaggaagag gaggaaagtt tctcataata gccttgctgc aagctcaaat 6060
gggaggtggg cactgtgccc aggagccttg gagcaaaggc tgtgcccaac ctctgactgc 6120
atccaggttt ggtcttgaca gagataagaa gccctggctt ttggagccaa aatctaggtc 6180
agacttaggc aggattctca aagtttatca gcagaacatg aggcagaaga ccctttctgc 6240
tccagcttct tcaggctcaa ccttcatcag aatagataga aagagaggct gtgagggttc 6300
ttaaaacaga agcaaatctg actcagagaa taaacaacct cctagtaaac tacagcttag 6360
acagagcatc tggtggtgag tgtgctcagt gtcctactca actgtctggt atcagccctc 6420
atgaggactt ctcttctttc cctcatagac ctccatctct gttttcctta gcctgcagaa 6480
atctggatgg ctattcacag aatgcctgtg ctttcagagt tgcatttttt ctctggtatt 6540
ctggttcaag catttgaagg taggaaaggt tctccaagtg caagaaagcc agccctgagc 6600
ctcaactgcc tggctagtgt ggtcagtagg atgcaaaggc tgttgaatgc cacaaggcca 6660
aactttaacc tgtgtaccac aagcctagca gcagaggcag ctctgctcac tggaactctc 6720
tgtcttcttt ctcctgagcc ttttcttttc ctgagttttc tagctctcct caaccttacc 6780
tctgccctac ccaggacaaa cccaagagcc actgtttctg tgatgtcctc tccagcccta 6840
attaggcatc atgacttcag cctgaccttc catgctcaga agcagtgcta atccacttca 6900
gatgagctgc tctatgcaac acaggcagag cctacaaacc tttgcaccag agccctccac 6960
atatcagtgt ttgttcatac tcacttcaac agcaaatgtg actgctgaga ttaagatttt 7020
acacaagatg gtctgtaatt tcacagttag ttttatccca ttaggtatga aagaattagc 7080
ataattcccc ttaaacatga atgaatctta gattttttaa taaatagttt tggaagtaaa 7140
gacagagaca tcaggagcac aaggaatagc ctgagaggac aaacagaaca agaaagagtc 7200
tggaaataca caggatgttc ttggcctcct caaagcaagt gcaagcagat agtaccagca 7260
gccccaggct atcagagccc agtgaagaga agtaccatga aagccacagc tctaaccacc 7320
ctgttccaga gtgacagaca gtccccaaga caagccagcc tgagccagag agagaactgc 7380
aagagaaagt ttctaattta ggttctgtta gattcagaca agtgcaggtc atcctctctc 7440
cacagctact cacctctcca gcctaacaaa gcctgcagtc cacactccaa ccctggtgtc 7500
tcacctccta gcctctccca acatcctgct ctctgaccat cttctgcatc tctcatctca 7560
ccatctccca ctgtctacag cctactcttg caactaccat ctcattttct gacatcctgt 7620
ctacatcttc tgccatactc tgccatctac cataccacct cttaccatct accacaccat 7680
cttttatctc catccctctc agaagcctcc aagctgaatc ctgctttatg tgttcatctc 7740
agcccctgca tggaaagctg accccagagg cagaactatt cccagagagc ttggccaaga 7800
aaaacaaaac taccagcctg gccaggctca ggagtagtaa gctgcagtgt ctgttgtgtt 7860
ctagcttcaa cagctgcagg agttccactc tcaaatgctc cacatttctc acatcctcct 7920
gattctggtc actacccatc ttcaaagaac agaatatctc acatcagcat actgtgaagg 7980
actagtcatg ggtgcagctg ctcagagctg caaagtcatt ctggatggtg gagagcttac 8040
aaacatttca tgatgctccc cccgctctga tggctggagc ccaatcccta cacagactcc 8100
tgctgtatgt gttttccttt cactctgagc cacagccaga gggcaggcat tcagtctcct 8160
cttcaggctg gggctggggc actgagaact cacccaacac cttgctctca ctccttctgc 8220
aaaacaagaa agagctttgt gctgcagtag ccatgaagaa tgaaaggaag gctttaacta 8280
aaaaatgtca gagattattt tcaacccctt actgtggatc accagcaagg aggaaacaca 8340
acacagagac attttttccc ctcaaattat caaaagaatc actgcatttg ttaaagagag 8400
caactgaatc aggaagcaga gttttgaaca tatcagaagt taggaatctg catcagagac 8460
aaatgcagtc atggttgttt gctgcatacc agccctaatc attagaagcc tcatggactt 8520
caaacatcat tccctctgac aagatgctct agcctaactc catgagataa aataaatctg 8580
cctttcagag ccaaagaaga gtccaccagc ttcttctcag tgtgaacaag agctccagtc 8640
aggttagtca gtccagtgca gtagaggaga ccagtctgca tcctctaatt ttcaaaggca 8700
agaagatttg tttaccctgg acaccaggca caagtgaggt cacagagctc ttagatatgc 8760
agtcctcatg agtgaggaga ctaaagcgca tgccatcaag acttcagtgt agagaaaacc 8820
tccaaaaaag cctcctcact acttctggaa tagctcagag gccgaggcgg cctcggcctc 8880
tgcataaata aaaaaaatta gtcagccatg gggcggagaa tgggcggaac tgggcggagt 8940
taggggcggg atgggcggag ttaggggcgg gactatggtt gctgactaat tgagatgcat 9000
gctttgcata cttctgcctg ctggggagcc tggggacttt ccacacctgg ttgctgacta 9060
attgagatgc atgctttgca tacttctgcc tgctggggag cctggggact ttccacaccc 9120
taactgacac acattccaca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt 9180
ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg 9240
ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg 9300
gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag 9360
gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga 9420
cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct 9480
ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc 9540
tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg 9600
gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc 9660
tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca 9720
ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag 9780
ttcttgaagt ggtggcctaa ctacggctac actagaagaa cagtatttgg tatctgcgct 9840
ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc 9900
accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga 9960
tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca 10020
cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat 10080
taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac 10140
caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt 10200
gcctgactcc tgcaaaccac gttgtgtctc aaaatctctg atgttacatt gcacaagata 10260
aaaatatatc atcatgaaca ataaaactgt ctgcttacat aaacagtaat acaaggggtg 10320
ttatgagcca tattcaacgg gaaacgtctt gctcgaggcc gcgattaaat tccaacatgg 10380
atgctgattt atatgggtat aaatgggctc gcgataatgt cgggcaatca ggtgcgacaa 10440
tctatcgatt gtatgggaag cccgatgcgc cagagttgtt tctgaaacat ggcaaaggta 10500
gcgttgccaa tgatgttaca gatgagatgg tcagactaaa ctggctgacg gaatttatgc 10560
ctcttccgac catcaagcat tttatccgta ctcctgatga tgcatggtta ctcaccactg 10620
cgatccccgg gaaaacagca ttccaggtat tagaagaata tcctgattca ggtgaaaata 10680
ttgttgatgc gctggcagtg ttcctgcgcc ggttgcattc gattcctgtt tgtaattgtc 10740
cttttaacag cgatcgcgta tttcgtctcg ctcaggcgca atcacgaatg aataacggtt 10800
tggttgatgc gagtgatttt gatgacgagc gtaatggctg gcctgttgaa caagtctgga 10860
aagaaatgca taagcttttg ccattctcac cggattcagt cgtcactcat ggtgatttct 10920
cacttgataa ccttattttt gacgagggga aattaatagg ttgtattgat gttggacgag 10980
tcggaatcgc agaccgatac caggatcttg ccatcctatg gaactgcctc ggtgagtttt 11040
ctccttcatt acagaaacgg ctttttcaaa aatatggtat tgataatcct gatatgaata 11100
aattgcagtt tcatttgatg ctcgatgagt ttttctaagg gcggcctgcc accataccca 11160
cgccgaaaca agcgctcatg agcccgaagt ggcgagcccg atcttcccca tcggtgatgt 11220
cggcgatata ggcgccagca accgcacctg tggcgccggt gatgagggcg cgccaagtcg 11280
acgtccggca gtc 11293
<210> 7
<211> 10700
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 7
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgcccgggc aaagcccggg 60
cgtcgggcga cctttggtcg cccggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 360
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 420
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 480
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 540
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 600
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 660
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 720
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 780
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 840
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 900
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagtc gctgcgacgc 960
tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg 1020
accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag 1080
cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc 1140
cgggagctag agcctctgct aaccatgttc atgccttctt ctttttccta cagctcctgg 1200
gcaacgtgct ggttattgtg ctgtctcatc attttggcaa agaattcctc gaagatccga 1260
agggaaagtc ttccacgact gtgggatccg ttcgaagata tcaccggttg agccaccatg 1320
gaattcagca gccccagcag agaggaatgc cccaagcctc tgagccgggt gtcaatcatg 1380
gccggatctc tgacaggact gctgctgctt caggccgtgt cttgggcttc tggcgctaga 1440
ccttgcatcc ccaagagctt cggctacagc agcgtcgtgt gcgtgtgcaa tgccacctac 1500
tgcgacagct tcgaccctcc tacctttcct gctctgggca ccttcagcag atacgagagc 1560
accagatccg gcagacggat ggaactgagc atgggaccca tccaggccaa tcacacaggc 1620
actggcctgc tgctgacact gcagcctgag cagaaattcc agaaagtgaa aggcttcggc 1680
ggagccatga cagatgccgc cgctctgaat atcctggctc tgtctccacc agctcagaac 1740
ctgctgctca agagctactt cagcgaggaa ggcatcggct acaacatcat cagagtgccc 1800
atggccagct gcgacttcag catcaggacc tacacctacg ccgacacacc cgacgatttc 1860
cagctgcaca acttcagcct gcctgaagag gacaccaagc tgaagatccc tctgatccac 1920
agagccctgc agctggcaca aagacccgtg tcactgctgg cctctccatg gacatctccc 1980
acctggctga aaacaaatgg cgccgtgaat ggcaagggca gcctgaaagg ccaacctggc 2040
gacatctacc accagacctg ggccagatac ttcgtgaagt tcctggacgc ctatgccgag 2100
cacaagctgc agttttgggc cgtgacagcc gagaacgaac cttctgctgg actgctgagc 2160
ggctacccct ttcagtgcct gggctttaca cccgagcacc agcgggactt tatcgcccgt 2220
gatctgggac ccacactggc caatagcacc caccataatg tgcggctgct gatgctggac 2280
gaccagagac tgcttctgcc ccactgggct aaagtggtgc tgacagatcc tgaggccgcc 2340
aaatacgtgc acggaatcgc cgtgcactgg tatctggact ttctggcccc tgccaaggcc 2400
acactgggag agacacacag actgttcccc aacaccatgc tgttcgccag cgaagcctgt 2460
gtgggcagca agttttggga acagagcgtg cggctcggca gctgggatag aggcatgcag 2520
tacagccaca gcatcatcac caacctgctg taccacgtcg tcggctggac cgactggaat 2580
ctggccctga atcctgaagg cggccctaac tgggtccgaa acttcgtgga cagccccatc 2640
atcgtggaca tcaccaagga caccttctac aagcagccca tgttctacca cctgggacac 2700
ttcagcaagt tcatccccga gggctctcag cgcgttggac tggtggcttc ccagaagaac 2760
gatctggacg ccgtggctct gatgcaccct gatggatctg ctgtggtggt ggtcctgaac 2820
cgcagcagca aagatgtgcc cctgaccatc aaggatcccg ccgtgggatt cctggaaaca 2880
atcagccctg gctactccat ccacacctac ctgtggcgta gacagtgaca attgttaatt 2940
aagtttaaac cctcgaggcc gcaagcttat cgataatcaa cctctggatt acaaaatttg 3000
tgaaagattg actggtattc ttaactatgt tgctcctttt acgctatgtg gatacgctgc 3060
tttaatgcct ttgtatcatg ctattgcttc ccgtatggct ttcattttct cctccttgta 3120
taaatcctgg ttgctgtctc tttatgagga gttgtggccc gttgtcaggc aacgtggcgt 3180
ggtgtgcact gtgtttgctg acgcaacccc cactggttgg ggcattgcca ccacctgtca 3240
gctcctttcc gggactttcg ctttccccct ccctattgcc acggcggaac tcatcgccgc 3300
ctgccttgcc cgctgctgga caggggctcg gctgttgggc actgacaatt ccgtggtgtt 3360
gtcggggaaa tcatcgtcct ttccttggct gctcgcctgt gttgccacct ggattctgcg 3420
cgggacgtcc ttctgctacg tcccttcggc cctcaatcca gcggaccttc cttcccgcgg 3480
cctgctgccg gctctgcggc ctcttccgcg tcttcgcctt cgccctcaga cgagtcggat 3540
ctccctttgg gccgcctccc cgcatcgata ccgtcgacta gagctcgctg atcagcctcg 3600
actgtgcctt ctagttgcca gccatctgtt gtttgcccct cccccgtgcc ttccttgacc 3660
ctggaaggtg ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt 3720
ctgagtaggt gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat 3780
tgggaagaca atagcaggca tgctggggag agatccacga taacaaacag cttttttggg 3840
gtgaacatat tgactgaatt ccctgcaggt tggccactcc ctctctgcgc gctcgctcgc 3900
tcactgaggc cgcccgggca aagcccgggc gtcgggcgac ctttggtcgc ccggcctcag 3960
tgagcgagcg agcgcgcaga gagggagtgg ccaactccat cactaggggt tcctgcggcc 4020
gctcgtacgg tctcgaggaa ttcctgcagg ataacttgcc aacctcattc taaaatgtat 4080
atagaagccc aaaagacaat aacaaaaata ttcttgtaga acaaaatggg aaagaatgtt 4140
ccactaaata tcaagattta gagcaaagca tgagatgtgt ggggatagac agtgaggctg 4200
ataaaataga gtagagctca gaaacagacc cattgatata tgtaagtgac ctatgaaaaa 4260
aatatggcat tttacaatgg gaaaatgatg gtctttttct tttttagaaa aacagggaaa 4320
tatatttata tgtaaaaaat aaaagggaac ccatatgtca taccatacac acaaaaaaat 4380
tccagtgaat tataagtcta aatggagaag gcaaaacttt aaatctttta gaaaataata 4440
tagaagcatg cagaccagcc tggccaacat gatgaaaccc tctctactaa taataaaatc 4500
agtagaacta ctcaggacta ctttgagtgg gaagtccttt tctatgaaga cttctttggc 4560
caaaattagg ctctaaatgc aaggagatag tgcatcatgc ctggctgcac ttactgataa 4620
atgatgttat caccatcttt aaccaaatgc acaggaacaa gttatggtac tgatgtgctg 4680
gattgagaag gagctctact tccttgacag gacacatttg tatcaactta aaaaagcaga 4740
tttttgccag cagaactatt cattcagagg taggaaactt agaatagatg atgtcactga 4800
ttagcatggc ttccccatct ccacagctgc ttcccaccca ggttgcccac agttgagttt 4860
gtccagtgct cagggctgcc cactctcagt aagaagcccc acaccagccc ctctccaaat 4920
atgttggctg ttccttccat taaagtgacc ccactttaga gcagcaagtg gatttctgtt 4980
tcttacagtt caggaaggag gagtcagctg tgagaacctg gagcctgaga tgcttctaag 5040
tcccactgct actggggtca gggaagccag actccagcat cagcagtcag gagcactaag 5100
cccttgccaa catcctgttt ctcagagaaa ctgcttccat tataatggtt gtcctttttt 5160
aagctatcaa gccaaacaac cagtgtctac cattattctc atcacctgaa gccaagggtt 5220
ctagcaaaag tcaagctgtc ttgtaatggt tgatgtgcct ccagcttctg tcttcagtca 5280
ctccactctt agcctgctct gaatcaactc tgaccacagt tccctggagc ccctgccacc 5340
tgctgcccct gccaccttct ccatctgcag tgctgtgcag ccttctgcac tcttgcagag 5400
ctaataggtg gagacttgaa ggaagaggag gaaagtttct cataatagcc ttgctgcaag 5460
ctcaaatggg aggtgggcac tgtgcccagg agccttggag caaaggctgt gcccaacctc 5520
tgactgcatc caggtttggt cttgacagag ataagaagcc ctggcttttg gagccaaaat 5580
ctaggtcaga cttaggcagg attctcaaag tttatcagca gaacatgagg cagaagaccc 5640
tttctgctcc agcttcttca ggctcaacct tcatcagaat agatagaaag agaggctgtg 5700
agggttctta aaacagaagc aaatctgact cagagaataa acaacctcct agtaaactac 5760
agcttagaca gagcatctgg tggtgagtgt gctcagtgtc ctactcaact gtctggtatc 5820
agccctcatg aggacttctc ttctttccct catagacctc catctctgtt ttccttagcc 5880
tgcagaaatc tggatggcta ttcacagaat gcctgtgctt tcagagttgc attttttctc 5940
tggtattctg gttcaagcat ttgaaggtag gaaaggttct ccaagtgcaa gaaagccagc 6000
cctgagcctc aactgcctgg ctagtgtggt cagtaggatg caaaggctgt tgaatgccac 6060
aaggccaaac tttaacctgt gtaccacaag cctagcagca gaggcagctc tgctcactgg 6120
aactctctgt cttctttctc ctgagccttt tcttttcctg agttttctag ctctcctcaa 6180
ccttacctct gccctaccca ggacaaaccc aagagccact gtttctgtga tgtcctctcc 6240
agccctaatt aggcatcatg acttcagcct gaccttccat gctcagaagc agtgctaatc 6300
cacttcagat gagctgctct atgcaacaca ggcagagcct acaaaccttt gcaccagagc 6360
cctccacata tcagtgtttg ttcatactca cttcaacagc aaatgtgact gctgagatta 6420
agattttaca caagatggtc tgtaatttca cagttagttt tatcccatta ggtatgaaag 6480
aattagcata attcccctta aacatgaatg aatcttagat tttttaataa atagttttgg 6540
aagtaaagac agagacatca ggagcacaag gaatagcctg agaggacaaa cagaacaaga 6600
aagagtctgg aaatacacag gatgttcttg gcctcctcaa agcaagtgca agcagatagt 6660
accagcagcc ccaggctatc agagcccagt gaagagaagt accatgaaag ccacagctct 6720
aaccaccctg ttccagagtg acagacagtc cccaagacaa gccagcctga gccagagaga 6780
gaactgcaag agaaagtttc taatttaggt tctgttagat tcagacaagt gcaggtcatc 6840
ctctctccac agctactcac ctctccagcc taacaaagcc tgcagtccac actccaaccc 6900
tggtgtctca cctcctagcc tctcccaaca tcctgctctc tgaccatctt ctgcatctct 6960
catctcacca tctcccactg tctacagcct actcttgcaa ctaccatctc attttctgac 7020
atcctgtcta catcttctgc catactctgc catctaccat accacctctt accatctacc 7080
acaccatctt ttatctccat ccctctcaga agcctccaag ctgaatcctg ctttatgtgt 7140
tcatctcagc ccctgcatgg aaagctgacc ccagaggcag aactattccc agagagcttg 7200
gccaagaaaa acaaaactac cagcctggcc aggctcagga gtagtaagct gcagtgtctg 7260
ttgtgttcta gcttcaacag ctgcaggagt tccactctca aatgctccac atttctcaca 7320
tcctcctgat tctggtcact acccatcttc aaagaacaga atatctcaca tcagcatact 7380
gtgaaggact agtcatgggt gcagctgctc agagctgcaa agtcattctg gatggtggag 7440
agcttacaaa catttcatga tgctcccccc gctctgatgg ctggagccca atccctacac 7500
agactcctgc tgtatgtgtt ttcctttcac tctgagccac agccagaggg caggcattca 7560
gtctcctctt caggctgggg ctggggcact gagaactcac ccaacacctt gctctcactc 7620
cttctgcaaa acaagaaaga gctttgtgct gcagtagcca tgaagaatga aaggaaggct 7680
ttaactaaaa aatgtcagag attattttca accccttact gtggatcacc agcaaggagg 7740
aaacacaaca cagagacatt ttttcccctc aaattatcaa aagaatcact gcatttgtta 7800
aagagagcaa ctgaatcagg aagcagagtt ttgaacatat cagaagttag gaatctgcat 7860
cagagacaaa tgcagtcatg gttgtttgct gcataccagc cctaatcatt agaagcctca 7920
tggacttcaa acatcattcc ctctgacaag atgctctagc ctaactccat gagataaaat 7980
aaatctgcct ttcagagcca aagaagagtc caccagcttc ttctcagtgt gaacaagagc 8040
tccagtcagg ttagtcagtc cagtgcagta gaggagacca gtctgcatcc tctaattttc 8100
aaaggcaaga agatttgttt accctggaca ccaggcacaa gtgaggtcac agagctctta 8160
gatatgcagt cctcatgagt gaggagacta aagcgcatgc catcaagact tcagtgtaga 8220
gaaaacctcc aaaaaagcct cctcactact tctggaatag ctcagaggcc gaggcggcct 8280
cggcctctgc ataaataaaa aaaattagtc agccatgggg cggagaatgg gcggaactgg 8340
gcggagttag gggcgggatg ggcggagtta ggggcgggac tatggttgct gactaattga 8400
gatgcatgct ttgcatactt ctgcctgctg gggagcctgg ggactttcca cacctggttg 8460
ctgactaatt gagatgcatg ctttgcatac ttctgcctgc tggggagcct ggggactttc 8520
cacaccctaa ctgacacaca ttccacagct gcattaatga atcggccaac gcgcggggag 8580
aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 8640
cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 8700
atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 8760
taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 8820
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 8880
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 8940
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 9000
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 9060
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 9120
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 9180
tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat 9240
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 9300
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 9360
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 9420
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 9480
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 9540
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 9600
catagttgcc tgactcctgc aaaccacgtt gtgtctcaaa atctctgatg ttacattgca 9660
caagataaaa atatatcatc atgaacaata aaactgtctg cttacataaa cagtaataca 9720
aggggtgtta tgagccatat tcaacgggaa acgtcttgct cgaggccgcg attaaattcc 9780
aacatggatg ctgatttata tgggtataaa tgggctcgcg ataatgtcgg gcaatcaggt 9840
gcgacaatct atcgattgta tgggaagccc gatgcgccag agttgtttct gaaacatggc 9900
aaaggtagcg ttgccaatga tgttacagat gagatggtca gactaaactg gctgacggaa 9960
tttatgcctc ttccgaccat caagcatttt atccgtactc ctgatgatgc atggttactc 10020
accactgcga tccccgggaa aacagcattc caggtattag aagaatatcc tgattcaggt 10080
gaaaatattg ttgatgcgct ggcagtgttc ctgcgccggt tgcattcgat tcctgtttgt 10140
aattgtcctt ttaacagcga tcgcgtattt cgtctcgctc aggcgcaatc acgaatgaat 10200
aacggtttgg ttgatgcgag tgattttgat gacgagcgta atggctggcc tgttgaacaa 10260
gtctggaaag aaatgcataa gcttttgcca ttctcaccgg attcagtcgt cactcatggt 10320
gatttctcac ttgataacct tatttttgac gaggggaaat taataggttg tattgatgtt 10380
ggacgagtcg gaatcgcaga ccgataccag gatcttgcca tcctatggaa ctgcctcggt 10440
gagttttctc cttcattaca gaaacggctt tttcaaaaat atggtattga taatcctgat 10500
atgaataaat tgcagtttca tttgatgctc gatgagtttt tctaagggcg gcctgccacc 10560
atacccacgc cgaaacaagc gctcatgagc ccgaagtggc gagcccgatc ttccccatcg 10620
gtgatgtcgg cgatataggc gccagcaacc gcacctgtgg cgccggtgat gagggcgcgc 10680
caagtcgacg tccggcagtc 10700
<210> 8
<211> 10700
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 8
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactatt agatctgatg gccgcgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 360
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 420
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 480
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 540
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 600
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 660
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 720
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 780
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 840
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 900
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagtc gctgcgacgc 960
tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg 1020
accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag 1080
cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc 1140
cgggagctag agcctctgct aaccatgttc atgccttctt ctttttccta cagctcctgg 1200
gcaacgtgct ggttattgtg ctgtctcatc attttggcaa agaattcctc gaagatccga 1260
agggaaagtc ttccacgact gtgggatccg ttcgaagata tcaccggttg agccaccatg 1320
gaattcagca gccccagcag agaggaatgc cccaagcctc tgagccgggt gtcaatcatg 1380
gccggatctc tgacaggact gctgctgctt caggccgtgt cttgggcttc tggcgctaga 1440
ccttgcatcc ccaagagctt cggctacagc agcgtcgtgt gcgtgtgcaa tgccacctac 1500
tgcgacagct tcgaccctcc tacctttcct gctctgggca ccttcagcag atacgagagc 1560
accagatccg gcagacggat ggaactgagc atgggaccca tccaggccaa tcacacaggc 1620
actggcctgc tgctgacact gcagcctgag cagaaattcc agaaagtgaa aggcttcggc 1680
ggagccatga cagatgccgc cgctctgaat atcctggctc tgtctccacc agctcagaac 1740
ctgctgctca agagctactt cagcgaggaa ggcatcggct acaacatcat cagagtgccc 1800
atggccagct gcgacttcag catcaggacc tacacctacg ccgacacacc cgacgatttc 1860
cagctgcaca acttcagcct gcctgaagag gacaccaagc tgaagatccc tctgatccac 1920
agagccctgc agctggcaca aagacccgtg tcactgctgg cctctccatg gacatctccc 1980
acctggctga aaacaaatgg cgccgtgaat ggcaagggca gcctgaaagg ccaacctggc 2040
gacatctacc accagacctg ggccagatac ttcgtgaagt tcctggacgc ctatgccgag 2100
cacaagctgc agttttgggc cgtgacagcc gagaacgaac cttctgctgg actgctgagc 2160
ggctacccct ttcagtgcct gggctttaca cccgagcacc agcgggactt tatcgcccgt 2220
gatctgggac ccacactggc caatagcacc caccataatg tgcggctgct gatgctggac 2280
gaccagagac tgcttctgcc ccactgggct aaagtggtgc tgacagatcc tgaggccgcc 2340
aaatacgtgc acggaatcgc cgtgcactgg tatctggact ttctggcccc tgccaaggcc 2400
acactgggag agacacacag actgttcccc aacaccatgc tgttcgccag cgaagcctgt 2460
gtgggcagca agttttggga acagagcgtg cggctcggca gctgggatag aggcatgcag 2520
tacagccaca gcatcatcac caacctgctg taccacgtcg tcggctggac cgactggaat 2580
ctggccctga atcctgaagg cggccctaac tgggtccgaa acttcgtgga cagccccatc 2640
atcgtggaca tcaccaagga caccttctac aagcagccca tgttctacca cctgggacac 2700
ttcagcaagt tcatccccga gggctctcag cgcgttggac tggtggcttc ccagaagaac 2760
gatctggacg ccgtggctct gatgcaccct gatggatctg ctgtggtggt ggtcctgaac 2820
cgcagcagca aagatgtgcc cctgaccatc aaggatcccg ccgtgggatt cctggaaaca 2880
atcagccctg gctactccat ccacacctac ctgtggcgta gacagtgaca attgttaatt 2940
aagtttaaac cctcgaggcc gcaagcttat cgataatcaa cctctggatt acaaaatttg 3000
tgaaagattg actggtattc ttaactatgt tgctcctttt acgctatgtg gatacgctgc 3060
tttaatgcct ttgtatcatg ctattgcttc ccgtatggct ttcattttct cctccttgta 3120
taaatcctgg ttgctgtctc tttatgagga gttgtggccc gttgtcaggc aacgtggcgt 3180
ggtgtgcact gtgtttgctg acgcaacccc cactggttgg ggcattgcca ccacctgtca 3240
gctcctttcc gggactttcg ctttccccct ccctattgcc acggcggaac tcatcgccgc 3300
ctgccttgcc cgctgctgga caggggctcg gctgttgggc actgacaatt ccgtggtgtt 3360
gtcggggaaa tcatcgtcct ttccttggct gctcgcctgt gttgccacct ggattctgcg 3420
cgggacgtcc ttctgctacg tcccttcggc cctcaatcca gcggaccttc cttcccgcgg 3480
cctgctgccg gctctgcggc ctcttccgcg tcttcgcctt cgccctcaga cgagtcggat 3540
ctccctttgg gccgcctccc cgcatcgata ccgtcgacta gagctcgctg atcagcctcg 3600
actgtgcctt ctagttgcca gccatctgtt gtttgcccct cccccgtgcc ttccttgacc 3660
ctggaaggtg ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt 3720
ctgagtaggt gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat 3780
tgggaagaca atagcaggca tgctggggag agatccacga taacaaacag cttttttggg 3840
gtgaacatat tgactgaatt ccctgcaggt tggccactcc ctctctgcgc gctcgctcgc 3900
tcactgaggc cgcccgggca aagcccgggc gtcgggcgac ctttggtcgc ccggcctcag 3960
tgagcgagcg agcgcgcaga gagggagtgg ccaactccat cactaggggt tcctgcggcc 4020
gctcgtacgg tctcgaggaa ttcctgcagg ataacttgcc aacctcattc taaaatgtat 4080
atagaagccc aaaagacaat aacaaaaata ttcttgtaga acaaaatggg aaagaatgtt 4140
ccactaaata tcaagattta gagcaaagca tgagatgtgt ggggatagac agtgaggctg 4200
ataaaataga gtagagctca gaaacagacc cattgatata tgtaagtgac ctatgaaaaa 4260
aatatggcat tttacaatgg gaaaatgatg gtctttttct tttttagaaa aacagggaaa 4320
tatatttata tgtaaaaaat aaaagggaac ccatatgtca taccatacac acaaaaaaat 4380
tccagtgaat tataagtcta aatggagaag gcaaaacttt aaatctttta gaaaataata 4440
tagaagcatg cagaccagcc tggccaacat gatgaaaccc tctctactaa taataaaatc 4500
agtagaacta ctcaggacta ctttgagtgg gaagtccttt tctatgaaga cttctttggc 4560
caaaattagg ctctaaatgc aaggagatag tgcatcatgc ctggctgcac ttactgataa 4620
atgatgttat caccatcttt aaccaaatgc acaggaacaa gttatggtac tgatgtgctg 4680
gattgagaag gagctctact tccttgacag gacacatttg tatcaactta aaaaagcaga 4740
tttttgccag cagaactatt cattcagagg taggaaactt agaatagatg atgtcactga 4800
ttagcatggc ttccccatct ccacagctgc ttcccaccca ggttgcccac agttgagttt 4860
gtccagtgct cagggctgcc cactctcagt aagaagcccc acaccagccc ctctccaaat 4920
atgttggctg ttccttccat taaagtgacc ccactttaga gcagcaagtg gatttctgtt 4980
tcttacagtt caggaaggag gagtcagctg tgagaacctg gagcctgaga tgcttctaag 5040
tcccactgct actggggtca gggaagccag actccagcat cagcagtcag gagcactaag 5100
cccttgccaa catcctgttt ctcagagaaa ctgcttccat tataatggtt gtcctttttt 5160
aagctatcaa gccaaacaac cagtgtctac cattattctc atcacctgaa gccaagggtt 5220
ctagcaaaag tcaagctgtc ttgtaatggt tgatgtgcct ccagcttctg tcttcagtca 5280
ctccactctt agcctgctct gaatcaactc tgaccacagt tccctggagc ccctgccacc 5340
tgctgcccct gccaccttct ccatctgcag tgctgtgcag ccttctgcac tcttgcagag 5400
ctaataggtg gagacttgaa ggaagaggag gaaagtttct cataatagcc ttgctgcaag 5460
ctcaaatggg aggtgggcac tgtgcccagg agccttggag caaaggctgt gcccaacctc 5520
tgactgcatc caggtttggt cttgacagag ataagaagcc ctggcttttg gagccaaaat 5580
ctaggtcaga cttaggcagg attctcaaag tttatcagca gaacatgagg cagaagaccc 5640
tttctgctcc agcttcttca ggctcaacct tcatcagaat agatagaaag agaggctgtg 5700
agggttctta aaacagaagc aaatctgact cagagaataa acaacctcct agtaaactac 5760
agcttagaca gagcatctgg tggtgagtgt gctcagtgtc ctactcaact gtctggtatc 5820
agccctcatg aggacttctc ttctttccct catagacctc catctctgtt ttccttagcc 5880
tgcagaaatc tggatggcta ttcacagaat gcctgtgctt tcagagttgc attttttctc 5940
tggtattctg gttcaagcat ttgaaggtag gaaaggttct ccaagtgcaa gaaagccagc 6000
cctgagcctc aactgcctgg ctagtgtggt cagtaggatg caaaggctgt tgaatgccac 6060
aaggccaaac tttaacctgt gtaccacaag cctagcagca gaggcagctc tgctcactgg 6120
aactctctgt cttctttctc ctgagccttt tcttttcctg agttttctag ctctcctcaa 6180
ccttacctct gccctaccca ggacaaaccc aagagccact gtttctgtga tgtcctctcc 6240
agccctaatt aggcatcatg acttcagcct gaccttccat gctcagaagc agtgctaatc 6300
cacttcagat gagctgctct atgcaacaca ggcagagcct acaaaccttt gcaccagagc 6360
cctccacata tcagtgtttg ttcatactca cttcaacagc aaatgtgact gctgagatta 6420
agattttaca caagatggtc tgtaatttca cagttagttt tatcccatta ggtatgaaag 6480
aattagcata attcccctta aacatgaatg aatcttagat tttttaataa atagttttgg 6540
aagtaaagac agagacatca ggagcacaag gaatagcctg agaggacaaa cagaacaaga 6600
aagagtctgg aaatacacag gatgttcttg gcctcctcaa agcaagtgca agcagatagt 6660
accagcagcc ccaggctatc agagcccagt gaagagaagt accatgaaag ccacagctct 6720
aaccaccctg ttccagagtg acagacagtc cccaagacaa gccagcctga gccagagaga 6780
gaactgcaag agaaagtttc taatttaggt tctgttagat tcagacaagt gcaggtcatc 6840
ctctctccac agctactcac ctctccagcc taacaaagcc tgcagtccac actccaaccc 6900
tggtgtctca cctcctagcc tctcccaaca tcctgctctc tgaccatctt ctgcatctct 6960
catctcacca tctcccactg tctacagcct actcttgcaa ctaccatctc attttctgac 7020
atcctgtcta catcttctgc catactctgc catctaccat accacctctt accatctacc 7080
acaccatctt ttatctccat ccctctcaga agcctccaag ctgaatcctg ctttatgtgt 7140
tcatctcagc ccctgcatgg aaagctgacc ccagaggcag aactattccc agagagcttg 7200
gccaagaaaa acaaaactac cagcctggcc aggctcagga gtagtaagct gcagtgtctg 7260
ttgtgttcta gcttcaacag ctgcaggagt tccactctca aatgctccac atttctcaca 7320
tcctcctgat tctggtcact acccatcttc aaagaacaga atatctcaca tcagcatact 7380
gtgaaggact agtcatgggt gcagctgctc agagctgcaa agtcattctg gatggtggag 7440
agcttacaaa catttcatga tgctcccccc gctctgatgg ctggagccca atccctacac 7500
agactcctgc tgtatgtgtt ttcctttcac tctgagccac agccagaggg caggcattca 7560
gtctcctctt caggctgggg ctggggcact gagaactcac ccaacacctt gctctcactc 7620
cttctgcaaa acaagaaaga gctttgtgct gcagtagcca tgaagaatga aaggaaggct 7680
ttaactaaaa aatgtcagag attattttca accccttact gtggatcacc agcaaggagg 7740
aaacacaaca cagagacatt ttttcccctc aaattatcaa aagaatcact gcatttgtta 7800
aagagagcaa ctgaatcagg aagcagagtt ttgaacatat cagaagttag gaatctgcat 7860
cagagacaaa tgcagtcatg gttgtttgct gcataccagc cctaatcatt agaagcctca 7920
tggacttcaa acatcattcc ctctgacaag atgctctagc ctaactccat gagataaaat 7980
aaatctgcct ttcagagcca aagaagagtc caccagcttc ttctcagtgt gaacaagagc 8040
tccagtcagg ttagtcagtc cagtgcagta gaggagacca gtctgcatcc tctaattttc 8100
aaaggcaaga agatttgttt accctggaca ccaggcacaa gtgaggtcac agagctctta 8160
gatatgcagt cctcatgagt gaggagacta aagcgcatgc catcaagact tcagtgtaga 8220
gaaaacctcc aaaaaagcct cctcactact tctggaatag ctcagaggcc gaggcggcct 8280
cggcctctgc ataaataaaa aaaattagtc agccatgggg cggagaatgg gcggaactgg 8340
gcggagttag gggcgggatg ggcggagtta ggggcgggac tatggttgct gactaattga 8400
gatgcatgct ttgcatactt ctgcctgctg gggagcctgg ggactttcca cacctggttg 8460
ctgactaatt gagatgcatg ctttgcatac ttctgcctgc tggggagcct ggggactttc 8520
cacaccctaa ctgacacaca ttccacagct gcattaatga atcggccaac gcgcggggag 8580
aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 8640
cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 8700
atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 8760
taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 8820
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 8880
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 8940
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 9000
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 9060
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 9120
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 9180
tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat 9240
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 9300
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 9360
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 9420
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 9480
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 9540
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 9600
catagttgcc tgactcctgc aaaccacgtt gtgtctcaaa atctctgatg ttacattgca 9660
caagataaaa atatatcatc atgaacaata aaactgtctg cttacataaa cagtaataca 9720
aggggtgtta tgagccatat tcaacgggaa acgtcttgct cgaggccgcg attaaattcc 9780
aacatggatg ctgatttata tgggtataaa tgggctcgcg ataatgtcgg gcaatcaggt 9840
gcgacaatct atcgattgta tgggaagccc gatgcgccag agttgtttct gaaacatggc 9900
aaaggtagcg ttgccaatga tgttacagat gagatggtca gactaaactg gctgacggaa 9960
tttatgcctc ttccgaccat caagcatttt atccgtactc ctgatgatgc atggttactc 10020
accactgcga tccccgggaa aacagcattc caggtattag aagaatatcc tgattcaggt 10080
gaaaatattg ttgatgcgct ggcagtgttc ctgcgccggt tgcattcgat tcctgtttgt 10140
aattgtcctt ttaacagcga tcgcgtattt cgtctcgctc aggcgcaatc acgaatgaat 10200
aacggtttgg ttgatgcgag tgattttgat gacgagcgta atggctggcc tgttgaacaa 10260
gtctggaaag aaatgcataa gcttttgcca ttctcaccgg attcagtcgt cactcatggt 10320
gatttctcac ttgataacct tatttttgac gaggggaaat taataggttg tattgatgtt 10380
ggacgagtcg gaatcgcaga ccgataccag gatcttgcca tcctatggaa ctgcctcggt 10440
gagttttctc cttcattaca gaaacggctt tttcaaaaat atggtattga taatcctgat 10500
atgaataaat tgcagtttca tttgatgctc gatgagtttt tctaagggcg gcctgccacc 10560
atacccacgc cgaaacaagc gctcatgagc ccgaagtggc gagcccgatc ttccccatcg 10620
gtgatgtcgg cgatataggc gccagcaacc gcacctgtgg cgccggtgat gagggcgcgc 10680
caagtcgacg tccggcagtc 10700
<210> 9
<211> 10700
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 9
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 360
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 420
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 480
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 540
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 600
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 660
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 720
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 780
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 840
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 900
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagtc gctgcgacgc 960
tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg 1020
accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag 1080
cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc 1140
cgggagctag agcctctgct aaccatgttc atgccttctt ctttttccta cagctcctgg 1200
gcaacgtgct ggttattgtg ctgtctcatc attttggcaa agaattcctc gaagatccga 1260
agggaaagtc ttccacgact gtgggatccg ttcgaagata tcaccggttg agccaccatg 1320
gaattcagca gccccagcag agaggaatgc cccaagcctc tgagccgggt gtcaatcatg 1380
gccggatctc tgacaggact gctgctgctt caggccgtgt cttgggcttc tggcgctaga 1440
ccttgcatcc ccaagagctt cggctacagc agcgtcgtgt gcgtgtgcaa tgccacctac 1500
tgcgacagct tcgaccctcc tacctttcct gctctgggca ccttcagcag atacgagagc 1560
accagatccg gcagacggat ggaactgagc atgggaccca tccaggccaa tcacacaggc 1620
actggcctgc tgctgacact gcagcctgag cagaaattcc agaaagtgaa aggcttcggc 1680
ggagccatga cagatgccgc cgctctgaat atcctggctc tgtctccacc agctcagaac 1740
ctgctgctca agagctactt cagcgaggaa ggcatcggct acaacatcat cagagtgccc 1800
atggccagct gcgacttcag catcaggacc tacacctacg ccgacacacc cgacgatttc 1860
cagctgcaca acttcagcct gcctgaagag gacaccaagc tgaagatccc tctgatccac 1920
agagccctgc agctggcaca aagacccgtg tcactgctgg cctctccatg gacatctccc 1980
acctggctga aaacaaatgg cgccgtgaat ggcaagggca gcctgaaagg ccaacctggc 2040
gacatctacc accagacctg ggccagatac ttcgtgaagt tcctggacgc ctatgccgag 2100
cacaagctgc agttttgggc cgtgacagcc gagaacgaac cttctgctgg actgctgagc 2160
ggctacccct ttcagtgcct gggctttaca cccgagcacc agcgggactt tatcgcccgt 2220
gatctgggac ccacactggc caatagcacc caccataatg tgcggctgct gatgctggac 2280
gaccagagac tgcttctgcc ccactgggct aaagtggtgc tgacagatcc tgaggccgcc 2340
aaatacgtgc acggaatcgc cgtgcactgg tatctggact ttctggcccc tgccaaggcc 2400
acactgggag agacacacag actgttcccc aacaccatgc tgttcgccag cgaagcctgt 2460
gtgggcagca agttttggga acagagcgtg cggctcggca gctgggatag aggcatgcag 2520
tacagccaca gcatcatcac caacctgctg taccacgtcg tcggctggac cgactggaat 2580
ctggccctga atcctgaagg cggccctaac tgggtccgaa acttcgtgga cagccccatc 2640
atcgtggaca tcaccaagga caccttctac aagcagccca tgttctacca cctgggacac 2700
ttcagcaagt tcatccccga gggctctcag cgcgttggac tggtggcttc ccagaagaac 2760
gatctggacg ccgtggctct gatgcaccct gatggatctg ctgtggtggt ggtcctgaac 2820
cgcagcagca aagatgtgcc cctgaccatc aaggatcccg ccgtgggatt cctggaaaca 2880
atcagccctg gctactccat ccacacctac ctgtggcgta gacagtgaca attgttaatt 2940
aagtttaaac cctcgaggcc gcaagcttat cgataatcaa cctctggatt acaaaatttg 3000
tgaaagattg actggtattc ttaactatgt tgctcctttt acgctatgtg gatacgctgc 3060
tttaatgcct ttgtatcatg ctattgcttc ccgtatggct ttcattttct cctccttgta 3120
taaatcctgg ttgctgtctc tttatgagga gttgtggccc gttgtcaggc aacgtggcgt 3180
ggtgtgcact gtgtttgctg acgcaacccc cactggttgg ggcattgcca ccacctgtca 3240
gctcctttcc gggactttcg ctttccccct ccctattgcc acggcggaac tcatcgccgc 3300
ctgccttgcc cgctgctgga caggggctcg gctgttgggc actgacaatt ccgtggtgtt 3360
gtcggggaaa tcatcgtcct ttccttggct gctcgcctgt gttgccacct ggattctgcg 3420
cgggacgtcc ttctgctacg tcccttcggc cctcaatcca gcggaccttc cttcccgcgg 3480
cctgctgccg gctctgcggc ctcttccgcg tcttcgcctt cgccctcaga cgagtcggat 3540
ctccctttgg gccgcctccc cgcatcgata ccgtcgacta gagctcgctg atcagcctcg 3600
actgtgcctt ctagttgcca gccatctgtt gtttgcccct cccccgtgcc ttccttgacc 3660
ctggaaggtg ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt 3720
ctgagtaggt gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat 3780
tgggaagaca atagcaggca tgctggggag agatccacga taacaaacag cttttttggg 3840
gtgaacatat tgactgaatt ccctgcaggt tggccactcc ctctctgcgc gctcgctcgc 3900
tcactgaggc cgcccgggca aagcccgggc gtcgggcgac ctttggtcgc ccggcctcag 3960
tgagcgagcg agcgcgcaga gagggagtgg ccaactccat cactaggggt tcctgcggcc 4020
gctcgtacgg tctcgaggaa ttcctgcagg ataacttgcc aacctcattc taaaatgtat 4080
atagaagccc aaaagacaat aacaaaaata ttcttgtaga acaaaatggg aaagaatgtt 4140
ccactaaata tcaagattta gagcaaagca tgagatgtgt ggggatagac agtgaggctg 4200
ataaaataga gtagagctca gaaacagacc cattgatata tgtaagtgac ctatgaaaaa 4260
aatatggcat tttacaatgg gaaaatgatg gtctttttct tttttagaaa aacagggaaa 4320
tatatttata tgtaaaaaat aaaagggaac ccatatgtca taccatacac acaaaaaaat 4380
tccagtgaat tataagtcta aatggagaag gcaaaacttt aaatctttta gaaaataata 4440
tagaagcatg cagaccagcc tggccaacat gatgaaaccc tctctactaa taataaaatc 4500
agtagaacta ctcaggacta ctttgagtgg gaagtccttt tctatgaaga cttctttggc 4560
caaaattagg ctctaaatgc aaggagatag tgcatcatgc ctggctgcac ttactgataa 4620
atgatgttat caccatcttt aaccaaatgc acaggaacaa gttatggtac tgatgtgctg 4680
gattgagaag gagctctact tccttgacag gacacatttg tatcaactta aaaaagcaga 4740
tttttgccag cagaactatt cattcagagg taggaaactt agaatagatg atgtcactga 4800
ttagcatggc ttccccatct ccacagctgc ttcccaccca ggttgcccac agttgagttt 4860
gtccagtgct cagggctgcc cactctcagt aagaagcccc acaccagccc ctctccaaat 4920
atgttggctg ttccttccat taaagtgacc ccactttaga gcagcaagtg gatttctgtt 4980
tcttacagtt caggaaggag gagtcagctg tgagaacctg gagcctgaga tgcttctaag 5040
tcccactgct actggggtca gggaagccag actccagcat cagcagtcag gagcactaag 5100
cccttgccaa catcctgttt ctcagagaaa ctgcttccat tataatggtt gtcctttttt 5160
aagctatcaa gccaaacaac cagtgtctac cattattctc atcacctgaa gccaagggtt 5220
ctagcaaaag tcaagctgtc ttgtaatggt tgatgtgcct ccagcttctg tcttcagtca 5280
ctccactctt agcctgctct gaatcaactc tgaccacagt tccctggagc ccctgccacc 5340
tgctgcccct gccaccttct ccatctgcag tgctgtgcag ccttctgcac tcttgcagag 5400
ctaataggtg gagacttgaa ggaagaggag gaaagtttct cataatagcc ttgctgcaag 5460
ctcaaatggg aggtgggcac tgtgcccagg agccttggag caaaggctgt gcccaacctc 5520
tgactgcatc caggtttggt cttgacagag ataagaagcc ctggcttttg gagccaaaat 5580
ctaggtcaga cttaggcagg attctcaaag tttatcagca gaacatgagg cagaagaccc 5640
tttctgctcc agcttcttca ggctcaacct tcatcagaat agatagaaag agaggctgtg 5700
agggttctta aaacagaagc aaatctgact cagagaataa acaacctcct agtaaactac 5760
agcttagaca gagcatctgg tggtgagtgt gctcagtgtc ctactcaact gtctggtatc 5820
agccctcatg aggacttctc ttctttccct catagacctc catctctgtt ttccttagcc 5880
tgcagaaatc tggatggcta ttcacagaat gcctgtgctt tcagagttgc attttttctc 5940
tggtattctg gttcaagcat ttgaaggtag gaaaggttct ccaagtgcaa gaaagccagc 6000
cctgagcctc aactgcctgg ctagtgtggt cagtaggatg caaaggctgt tgaatgccac 6060
aaggccaaac tttaacctgt gtaccacaag cctagcagca gaggcagctc tgctcactgg 6120
aactctctgt cttctttctc ctgagccttt tcttttcctg agttttctag ctctcctcaa 6180
ccttacctct gccctaccca ggacaaaccc aagagccact gtttctgtga tgtcctctcc 6240
agccctaatt aggcatcatg acttcagcct gaccttccat gctcagaagc agtgctaatc 6300
cacttcagat gagctgctct atgcaacaca ggcagagcct acaaaccttt gcaccagagc 6360
cctccacata tcagtgtttg ttcatactca cttcaacagc aaatgtgact gctgagatta 6420
agattttaca caagatggtc tgtaatttca cagttagttt tatcccatta ggtatgaaag 6480
aattagcata attcccctta aacatgaatg aatcttagat tttttaataa atagttttgg 6540
aagtaaagac agagacatca ggagcacaag gaatagcctg agaggacaaa cagaacaaga 6600
aagagtctgg aaatacacag gatgttcttg gcctcctcaa agcaagtgca agcagatagt 6660
accagcagcc ccaggctatc agagcccagt gaagagaagt accatgaaag ccacagctct 6720
aaccaccctg ttccagagtg acagacagtc cccaagacaa gccagcctga gccagagaga 6780
gaactgcaag agaaagtttc taatttaggt tctgttagat tcagacaagt gcaggtcatc 6840
ctctctccac agctactcac ctctccagcc taacaaagcc tgcagtccac actccaaccc 6900
tggtgtctca cctcctagcc tctcccaaca tcctgctctc tgaccatctt ctgcatctct 6960
catctcacca tctcccactg tctacagcct actcttgcaa ctaccatctc attttctgac 7020
atcctgtcta catcttctgc catactctgc catctaccat accacctctt accatctacc 7080
acaccatctt ttatctccat ccctctcaga agcctccaag ctgaatcctg ctttatgtgt 7140
tcatctcagc ccctgcatgg aaagctgacc ccagaggcag aactattccc agagagcttg 7200
gccaagaaaa acaaaactac cagcctggcc aggctcagga gtagtaagct gcagtgtctg 7260
ttgtgttcta gcttcaacag ctgcaggagt tccactctca aatgctccac atttctcaca 7320
tcctcctgat tctggtcact acccatcttc aaagaacaga atatctcaca tcagcatact 7380
gtgaaggact agtcatgggt gcagctgctc agagctgcaa agtcattctg gatggtggag 7440
agcttacaaa catttcatga tgctcccccc gctctgatgg ctggagccca atccctacac 7500
agactcctgc tgtatgtgtt ttcctttcac tctgagccac agccagaggg caggcattca 7560
gtctcctctt caggctgggg ctggggcact gagaactcac ccaacacctt gctctcactc 7620
cttctgcaaa acaagaaaga gctttgtgct gcagtagcca tgaagaatga aaggaaggct 7680
ttaactaaaa aatgtcagag attattttca accccttact gtggatcacc agcaaggagg 7740
aaacacaaca cagagacatt ttttcccctc aaattatcaa aagaatcact gcatttgtta 7800
aagagagcaa ctgaatcagg aagcagagtt ttgaacatat cagaagttag gaatctgcat 7860
cagagacaaa tgcagtcatg gttgtttgct gcataccagc cctaatcatt agaagcctca 7920
tggacttcaa acatcattcc ctctgacaag atgctctagc ctaactccat gagataaaat 7980
aaatctgcct ttcagagcca aagaagagtc caccagcttc ttctcagtgt gaacaagagc 8040
tccagtcagg ttagtcagtc cagtgcagta gaggagacca gtctgcatcc tctaattttc 8100
aaaggcaaga agatttgttt accctggaca ccaggcacaa gtgaggtcac agagctctta 8160
gatatgcagt cctcatgagt gaggagacta aagcgcatgc catcaagact tcagtgtaga 8220
gaaaacctcc aaaaaagcct cctcactact tctggaatag ctcagaggcc gaggcggcct 8280
cggcctctgc ataaataaaa aaaattagtc agccatgggg cggagaatgg gcggaactgg 8340
gcggagttag gggcgggatg ggcggagtta ggggcgggac tatggttgct gactaattga 8400
gatgcatgct ttgcatactt ctgcctgctg gggagcctgg ggactttcca cacctggttg 8460
ctgactaatt gagatgcatg ctttgcatac ttctgcctgc tggggagcct ggggactttc 8520
cacaccctaa ctgacacaca ttccacagct gcattaatga atcggccaac gcgcggggag 8580
aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 8640
cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 8700
atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 8760
taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 8820
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 8880
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 8940
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 9000
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 9060
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 9120
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 9180
tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat 9240
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 9300
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 9360
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 9420
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 9480
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 9540
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 9600
catagttgcc tgactcctgc aaaccacgtt gtgtctcaaa atctctgatg ttacattgca 9660
caagataaaa atatatcatc atgaacaata aaactgtctg cttacataaa cagtaataca 9720
aggggtgtta tgagccatat tcaacgggaa acgtcttgct cgaggccgcg attaaattcc 9780
aacatggatg ctgatttata tgggtataaa tgggctcgcg ataatgtcgg gcaatcaggt 9840
gcgacaatct atcgattgta tgggaagccc gatgcgccag agttgtttct gaaacatggc 9900
aaaggtagcg ttgccaatga tgttacagat gagatggtca gactaaactg gctgacggaa 9960
tttatgcctc ttccgaccat caagcatttt atccgtactc ctgatgatgc atggttactc 10020
accactgcga tccccgggaa aacagcattc caggtattag aagaatatcc tgattcaggt 10080
gaaaatattg ttgatgcgct ggcagtgttc ctgcgccggt tgcattcgat tcctgtttgt 10140
aattgtcctt ttaacagcga tcgcgtattt cgtctcgctc aggcgcaatc acgaatgaat 10200
aacggtttgg ttgatgcgag tgattttgat gacgagcgta atggctggcc tgttgaacaa 10260
gtctggaaag aaatgcataa gcttttgcca ttctcaccgg attcagtcgt cactcatggt 10320
gatttctcac ttgataacct tatttttgac gaggggaaat taataggttg tattgatgtt 10380
ggacgagtcg gaatcgcaga ccgataccag gatcttgcca tcctatggaa ctgcctcggt 10440
gagttttctc cttcattaca gaaacggctt tttcaaaaat atggtattga taatcctgat 10500
atgaataaat tgcagtttca tttgatgctc gatgagtttt tctaagggcg gcctgccacc 10560
atacccacgc cgaaacaagc gctcatgagc ccgaagtggc gagcccgatc ttccccatcg 10620
gtgatgtcgg cgatataggc gccagcaacc gcacctgtgg cgccggtgat gagggcgcgc 10680
caagtcgacg tccggcagtc 10700
<210> 10
<211> 10700
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 10
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgcccgggc aaagcccggg 60
cgtcgggcga cctttggtcg cccggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 360
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 420
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 480
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 540
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 600
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 660
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 720
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 780
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 840
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 900
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagtc gctgcgacgc 960
tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg 1020
accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag 1080
cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc 1140
cgggagctag agcctctgct aaccatgttc atgccttctt ctttttccta cagctcctgg 1200
gcaacgtgct ggttattgtg ctgtctcatc attttggcaa agaattcctc gaagatccga 1260
agggaaagtc ttccacgact gtgggatccg ttcgaagata tcaccggttg agccaccatg 1320
gaattcagca gccccagcag agaggaatgc cccaagcctc tgagccgggt gtcaatcatg 1380
gccggatctc tgacaggact gctgctgctt caggccgtgt cttgggcttc tggcgctaga 1440
ccttgcatcc ccaagagctt cggctacagc agcgtcgtgt gcgtgtgcaa tgccacctac 1500
tgcgacagct tcgaccctcc tacctttcct gctctgggca ccttcagcag atacgagagc 1560
accagatccg gcagacggat ggaactgagc atgggaccca tccaggccaa tcacacaggc 1620
actggcctgc tgctgacact gcagcctgag cagaaattcc agaaagtgaa aggcttcggc 1680
ggagccatga cagatgccgc cgctctgaat atcctggctc tgtctccacc agctcagaac 1740
ctgctgctca agagctactt cagcgaggaa ggcatcggct acaacatcat cagagtgccc 1800
atggccagct gcgacttcag catcaggacc tacacctacg ccgacacacc cgacgatttc 1860
cagctgcaca acttcagcct gcctgaagag gacaccaagc tgaagatccc tctgatccac 1920
agagccctgc agctggcaca aagacccgtg tcactgctgg cctctccatg gacatctccc 1980
acctggctga aaacaaatgg cgccgtgaat ggcaagggca gcctgaaagg ccaacctggc 2040
gacatctacc accagacctg ggccagatac ttcgtgaagt tcctggacgc ctatgccgag 2100
cacaagctgc agttttgggc cgtgacagcc gagaacgaac cttctgctgg actgctgagc 2160
ggctacccct ttcagtgcct gggctttaca cccgagcacc agcgggactt tatcgcccgt 2220
gatctgggac ccacactggc caatagcacc caccataatg tgcggctgct gatgctggac 2280
gaccagagac tgcttctgcc ccactgggct aaagtggtgc tgacagatcc tgaggccgcc 2340
aaatacgtgc acggaatcgc cgtgcactgg tatctggact ttctggcccc tgccaaggcc 2400
acactgggag agacacacag actgttcccc aacaccatgc tgttcgccag cgaagcctgt 2460
gtgggcagca agttttggga acagagcgtg cggctcggca gctgggatag aggcatgcag 2520
tacagccaca gcatcatcac caacctgctg taccacgtcg tcggctggac cgactggaat 2580
ctggccctga atcctgaagg cggccctaac tgggtccgaa acttcgtgga cagccccatc 2640
atcgtggaca tcaccaagga caccttctac aagcagccca tgttctacca cctgggacac 2700
ttcagcaagt tcatccccga gggctctcag cgcgttggac tggtggcttc ccagaagaac 2760
gatctggacg ccgtggctct gatgcaccct gatggatctg ctgtggtggt ggtcctgaac 2820
cgcagcagca aagatgtgcc cctgaccatc aaggatcccg ccgtgggatt cctggaaaca 2880
atcagccctg gctactccat ccacacctac ctgtggcgta gacagtgaca attgttaatt 2940
aagtttaaac cctcgaggcc gcaagcttat cgataatcaa cctctggatt acaaaatttg 3000
tgaaagattg actggtattc ttaactatgt tgctcctttt acgctatgtg gatacgctgc 3060
tttaatgcct ttgtatcatg ctattgcttc ccgtatggct ttcattttct cctccttgta 3120
taaatcctgg ttgctgtctc tttatgagga gttgtggccc gttgtcaggc aacgtggcgt 3180
ggtgtgcact gtgtttgctg acgcaacccc cactggttgg ggcattgcca ccacctgtca 3240
gctcctttcc gggactttcg ctttccccct ccctattgcc acggcggaac tcatcgccgc 3300
ctgccttgcc cgctgctgga caggggctcg gctgttgggc actgacaatt ccgtggtgtt 3360
gtcggggaaa tcatcgtcct ttccttggct gctcgcctgt gttgccacct ggattctgcg 3420
cgggacgtcc ttctgctacg tcccttcggc cctcaatcca gcggaccttc cttcccgcgg 3480
cctgctgccg gctctgcggc ctcttccgcg tcttcgcctt cgccctcaga cgagtcggat 3540
ctccctttgg gccgcctccc cgcatcgata ccgtcgacta gagctcgctg atcagcctcg 3600
actgtgcctt ctagttgcca gccatctgtt gtttgcccct cccccgtgcc ttccttgacc 3660
ctggaaggtg ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt 3720
ctgagtaggt gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat 3780
tgggaagaca atagcaggca tgctggggag agatccacga taacaaacag cttttttggg 3840
gtgaacatat tgactgaatt ccctgcagga ggaaccccta gtgatggagt tggccactcc 3900
ctctctgcgc gctcgctcgc tcactgaggc cgcccgggca aagcccgggc gtcgggcgac 3960
ctttggtcgc ccggcctcag tgagcgagcg agcgcgcaga gagggagtgg ccaagcggcc 4020
gctcgtacgg tctcgaggaa ttcctgcagg ataacttgcc aacctcattc taaaatgtat 4080
atagaagccc aaaagacaat aacaaaaata ttcttgtaga acaaaatggg aaagaatgtt 4140
ccactaaata tcaagattta gagcaaagca tgagatgtgt ggggatagac agtgaggctg 4200
ataaaataga gtagagctca gaaacagacc cattgatata tgtaagtgac ctatgaaaaa 4260
aatatggcat tttacaatgg gaaaatgatg gtctttttct tttttagaaa aacagggaaa 4320
tatatttata tgtaaaaaat aaaagggaac ccatatgtca taccatacac acaaaaaaat 4380
tccagtgaat tataagtcta aatggagaag gcaaaacttt aaatctttta gaaaataata 4440
tagaagcatg cagaccagcc tggccaacat gatgaaaccc tctctactaa taataaaatc 4500
agtagaacta ctcaggacta ctttgagtgg gaagtccttt tctatgaaga cttctttggc 4560
caaaattagg ctctaaatgc aaggagatag tgcatcatgc ctggctgcac ttactgataa 4620
atgatgttat caccatcttt aaccaaatgc acaggaacaa gttatggtac tgatgtgctg 4680
gattgagaag gagctctact tccttgacag gacacatttg tatcaactta aaaaagcaga 4740
tttttgccag cagaactatt cattcagagg taggaaactt agaatagatg atgtcactga 4800
ttagcatggc ttccccatct ccacagctgc ttcccaccca ggttgcccac agttgagttt 4860
gtccagtgct cagggctgcc cactctcagt aagaagcccc acaccagccc ctctccaaat 4920
atgttggctg ttccttccat taaagtgacc ccactttaga gcagcaagtg gatttctgtt 4980
tcttacagtt caggaaggag gagtcagctg tgagaacctg gagcctgaga tgcttctaag 5040
tcccactgct actggggtca gggaagccag actccagcat cagcagtcag gagcactaag 5100
cccttgccaa catcctgttt ctcagagaaa ctgcttccat tataatggtt gtcctttttt 5160
aagctatcaa gccaaacaac cagtgtctac cattattctc atcacctgaa gccaagggtt 5220
ctagcaaaag tcaagctgtc ttgtaatggt tgatgtgcct ccagcttctg tcttcagtca 5280
ctccactctt agcctgctct gaatcaactc tgaccacagt tccctggagc ccctgccacc 5340
tgctgcccct gccaccttct ccatctgcag tgctgtgcag ccttctgcac tcttgcagag 5400
ctaataggtg gagacttgaa ggaagaggag gaaagtttct cataatagcc ttgctgcaag 5460
ctcaaatggg aggtgggcac tgtgcccagg agccttggag caaaggctgt gcccaacctc 5520
tgactgcatc caggtttggt cttgacagag ataagaagcc ctggcttttg gagccaaaat 5580
ctaggtcaga cttaggcagg attctcaaag tttatcagca gaacatgagg cagaagaccc 5640
tttctgctcc agcttcttca ggctcaacct tcatcagaat agatagaaag agaggctgtg 5700
agggttctta aaacagaagc aaatctgact cagagaataa acaacctcct agtaaactac 5760
agcttagaca gagcatctgg tggtgagtgt gctcagtgtc ctactcaact gtctggtatc 5820
agccctcatg aggacttctc ttctttccct catagacctc catctctgtt ttccttagcc 5880
tgcagaaatc tggatggcta ttcacagaat gcctgtgctt tcagagttgc attttttctc 5940
tggtattctg gttcaagcat ttgaaggtag gaaaggttct ccaagtgcaa gaaagccagc 6000
cctgagcctc aactgcctgg ctagtgtggt cagtaggatg caaaggctgt tgaatgccac 6060
aaggccaaac tttaacctgt gtaccacaag cctagcagca gaggcagctc tgctcactgg 6120
aactctctgt cttctttctc ctgagccttt tcttttcctg agttttctag ctctcctcaa 6180
ccttacctct gccctaccca ggacaaaccc aagagccact gtttctgtga tgtcctctcc 6240
agccctaatt aggcatcatg acttcagcct gaccttccat gctcagaagc agtgctaatc 6300
cacttcagat gagctgctct atgcaacaca ggcagagcct acaaaccttt gcaccagagc 6360
cctccacata tcagtgtttg ttcatactca cttcaacagc aaatgtgact gctgagatta 6420
agattttaca caagatggtc tgtaatttca cagttagttt tatcccatta ggtatgaaag 6480
aattagcata attcccctta aacatgaatg aatcttagat tttttaataa atagttttgg 6540
aagtaaagac agagacatca ggagcacaag gaatagcctg agaggacaaa cagaacaaga 6600
aagagtctgg aaatacacag gatgttcttg gcctcctcaa agcaagtgca agcagatagt 6660
accagcagcc ccaggctatc agagcccagt gaagagaagt accatgaaag ccacagctct 6720
aaccaccctg ttccagagtg acagacagtc cccaagacaa gccagcctga gccagagaga 6780
gaactgcaag agaaagtttc taatttaggt tctgttagat tcagacaagt gcaggtcatc 6840
ctctctccac agctactcac ctctccagcc taacaaagcc tgcagtccac actccaaccc 6900
tggtgtctca cctcctagcc tctcccaaca tcctgctctc tgaccatctt ctgcatctct 6960
catctcacca tctcccactg tctacagcct actcttgcaa ctaccatctc attttctgac 7020
atcctgtcta catcttctgc catactctgc catctaccat accacctctt accatctacc 7080
acaccatctt ttatctccat ccctctcaga agcctccaag ctgaatcctg ctttatgtgt 7140
tcatctcagc ccctgcatgg aaagctgacc ccagaggcag aactattccc agagagcttg 7200
gccaagaaaa acaaaactac cagcctggcc aggctcagga gtagtaagct gcagtgtctg 7260
ttgtgttcta gcttcaacag ctgcaggagt tccactctca aatgctccac atttctcaca 7320
tcctcctgat tctggtcact acccatcttc aaagaacaga atatctcaca tcagcatact 7380
gtgaaggact agtcatgggt gcagctgctc agagctgcaa agtcattctg gatggtggag 7440
agcttacaaa catttcatga tgctcccccc gctctgatgg ctggagccca atccctacac 7500
agactcctgc tgtatgtgtt ttcctttcac tctgagccac agccagaggg caggcattca 7560
gtctcctctt caggctgggg ctggggcact gagaactcac ccaacacctt gctctcactc 7620
cttctgcaaa acaagaaaga gctttgtgct gcagtagcca tgaagaatga aaggaaggct 7680
ttaactaaaa aatgtcagag attattttca accccttact gtggatcacc agcaaggagg 7740
aaacacaaca cagagacatt ttttcccctc aaattatcaa aagaatcact gcatttgtta 7800
aagagagcaa ctgaatcagg aagcagagtt ttgaacatat cagaagttag gaatctgcat 7860
cagagacaaa tgcagtcatg gttgtttgct gcataccagc cctaatcatt agaagcctca 7920
tggacttcaa acatcattcc ctctgacaag atgctctagc ctaactccat gagataaaat 7980
aaatctgcct ttcagagcca aagaagagtc caccagcttc ttctcagtgt gaacaagagc 8040
tccagtcagg ttagtcagtc cagtgcagta gaggagacca gtctgcatcc tctaattttc 8100
aaaggcaaga agatttgttt accctggaca ccaggcacaa gtgaggtcac agagctctta 8160
gatatgcagt cctcatgagt gaggagacta aagcgcatgc catcaagact tcagtgtaga 8220
gaaaacctcc aaaaaagcct cctcactact tctggaatag ctcagaggcc gaggcggcct 8280
cggcctctgc ataaataaaa aaaattagtc agccatgggg cggagaatgg gcggaactgg 8340
gcggagttag gggcgggatg ggcggagtta ggggcgggac tatggttgct gactaattga 8400
gatgcatgct ttgcatactt ctgcctgctg gggagcctgg ggactttcca cacctggttg 8460
ctgactaatt gagatgcatg ctttgcatac ttctgcctgc tggggagcct ggggactttc 8520
cacaccctaa ctgacacaca ttccacagct gcattaatga atcggccaac gcgcggggag 8580
aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 8640
cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 8700
atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 8760
taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 8820
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 8880
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 8940
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 9000
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 9060
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 9120
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 9180
tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat 9240
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 9300
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 9360
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 9420
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 9480
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 9540
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 9600
catagttgcc tgactcctgc aaaccacgtt gtgtctcaaa atctctgatg ttacattgca 9660
caagataaaa atatatcatc atgaacaata aaactgtctg cttacataaa cagtaataca 9720
aggggtgtta tgagccatat tcaacgggaa acgtcttgct cgaggccgcg attaaattcc 9780
aacatggatg ctgatttata tgggtataaa tgggctcgcg ataatgtcgg gcaatcaggt 9840
gcgacaatct atcgattgta tgggaagccc gatgcgccag agttgtttct gaaacatggc 9900
aaaggtagcg ttgccaatga tgttacagat gagatggtca gactaaactg gctgacggaa 9960
tttatgcctc ttccgaccat caagcatttt atccgtactc ctgatgatgc atggttactc 10020
accactgcga tccccgggaa aacagcattc caggtattag aagaatatcc tgattcaggt 10080
gaaaatattg ttgatgcgct ggcagtgttc ctgcgccggt tgcattcgat tcctgtttgt 10140
aattgtcctt ttaacagcga tcgcgtattt cgtctcgctc aggcgcaatc acgaatgaat 10200
aacggtttgg ttgatgcgag tgattttgat gacgagcgta atggctggcc tgttgaacaa 10260
gtctggaaag aaatgcataa gcttttgcca ttctcaccgg attcagtcgt cactcatggt 10320
gatttctcac ttgataacct tatttttgac gaggggaaat taataggttg tattgatgtt 10380
ggacgagtcg gaatcgcaga ccgataccag gatcttgcca tcctatggaa ctgcctcggt 10440
gagttttctc cttcattaca gaaacggctt tttcaaaaat atggtattga taatcctgat 10500
atgaataaat tgcagtttca tttgatgctc gatgagtttt tctaagggcg gcctgccacc 10560
atacccacgc cgaaacaagc gctcatgagc ccgaagtggc gagcccgatc ttccccatcg 10620
gtgatgtcgg cgatataggc gccagcaacc gcacctgtgg cgccggtgat gagggcgcgc 10680
caagtcgacg tccggcagtc 10700
<210> 11
<211> 11188
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 11
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactatt agatctgatg gccgcgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
gtggtgactg agatgttttc taggaaacac aaaagataca aaaaagaaca cgtggaagga 300
tagccaaaaa ggggggctgc ccccatttcc tgcaccccgc tgcgatggct ggcaccattt 360
ggaagacttc gagatacact gttgagcgca gtaagacaac agtgtatctc gaagtcttcc 420
agatggggcc agccggtcca ctctgtatcc aggccagttc tgcaaggcgt tcgaggacca 480
cccccctccc ctcgccacca gggtggtctc atacagaact tataagattc ccaaatccaa 540
agacatttca cgtttatggt gatttcccag aacacatagc gacatgcaaa tattgcaggg 600
cgccactccc ctgtccctca cagccatctt cctgccaggg cgcacgcgcg ctgggtgttc 660
ccgcctagtg acactgggcc cgcgattcct tggagcgggt tgatgacgtc agcgtttccc 720
atggtgaatc cctaggttct agaaccggtg acgtctccca tggtgaagct tggatctgaa 780
ttcggtacct agttattaat agtaatcaat tacggggtca ttagttcata gcccatatat 840
ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc ccaacgaccc 900
ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag ggactttcca 960
ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac atcaagtgta 1020
tcatatgcca agtacgcccc ctattgacgt caatgacggt aaatggcccg cctggcatta 1080
tgcccagtac atgaccttat gggactttcc tacttggcag tacatctacg tattagtcat 1140
cgctattacc atggtcgagg tgagccccac gttctgcttc actctcccca tctccccccc 1200
ctccccaccc ccaattttgt atttatttat tttttaatta ttttgtgcag cgatgggggc 1260
gggggggggg ggggggcgcg cgccaggcgg ggcggggcgg ggcgaggggc ggggcggggc 1320
gaggcggaga ggtgcggcgg cagccaatca gagcggcgcg ctccgaaagt ttccttttat 1380
ggcgaggcgg cggcggcggc ggccctataa aaagcgaagc gcgcggcggg cgggagtcgc 1440
tgcgacgctg ccttcgcccc gtgccccgct ccgccgccgc ctcgcgccgc ccgccccggc 1500
tctgactgac cgcgttactc ccacaggtga gcgggcggga cggcccttct cctccgggct 1560
gtaattagcg cttggtttaa tgacggcttg tttcttttct gtggctgcgt gaaagccttg 1620
aggggctccg ggagctagag cctctgctaa ccatgttcat gccttcttct ttttcctaca 1680
gctcctgggc aacgtgctgg ttattgtgct gtctcatcat tttggcaaag aattcctcga 1740
agatccgaag ggaaagtctt ccacgactgt gggatccgtt cgaagatatc accggttgag 1800
ccaccatgga attcagcagc cccagcagag aggaatgccc caagcctctg agccgggtgt 1860
caatcatggc cggatctctg acaggactgc tgctgcttca ggccgtgtct tgggcttctg 1920
gcgctagacc ttgcatcccc aagagcttcg gctacagcag cgtcgtgtgc gtgtgcaatg 1980
ccacctactg cgacagcttc gaccctccta cctttcctgc tctgggcacc ttcagcagat 2040
acgagagcac cagatccggc agacggatgg aactgagcat gggacccatc caggccaatc 2100
acacaggcac tggcctgctg ctgacactgc agcctgagca gaaattccag aaagtgaaag 2160
gcttcggcgg agccatgaca gatgccgccg ctctgaatat cctggctctg tctccaccag 2220
ctcagaacct gctgctcaag agctacttca gcgaggaagg catcggctac aacatcatca 2280
gagtgcccat ggccagctgc gacttcagca tcaggaccta cacctacgcc gacacacccg 2340
acgatttcca gctgcacaac ttcagcctgc ctgaagagga caccaagctg aagatccctc 2400
tgatccacag agccctgcag ctggcacaaa gacccgtgtc actgctggcc tctccatgga 2460
catctcccac ctggctgaaa acaaatggcg ccgtgaatgg caagggcagc ctgaaaggcc 2520
aacctggcga catctaccac cagacctggg ccagatactt cgtgaagttc ctggacgcct 2580
atgccgagca caagctgcag ttttgggccg tgacagccga gaacgaacct tctgctggac 2640
tgctgagcgg ctaccccttt cagtgcctgg gctttacacc cgagcaccag cgggacttta 2700
tcgcccgtga tctgggaccc acactggcca atagcaccca ccataatgtg cggctgctga 2760
tgctggacga ccagagactg cttctgcccc actgggctaa agtggtgctg acagatcctg 2820
aggccgccaa atacgtgcac ggaatcgccg tgcactggta tctggacttt ctggcccctg 2880
ccaaggccac actgggagag acacacagac tgttccccaa caccatgctg ttcgccagcg 2940
aagcctgtgt gggcagcaag ttttgggaac agagcgtgcg gctcggcagc tgggatagag 3000
gcatgcagta cagccacagc atcatcacca acctgctgta ccacgtcgtc ggctggaccg 3060
actggaatct ggccctgaat cctgaaggcg gccctaactg ggtccgaaac ttcgtggaca 3120
gccccatcat cgtggacatc accaaggaca ccttctacaa gcagcccatg ttctaccacc 3180
tgggacactt cagcaagttc atccccgagg gctctcagcg cgttggactg gtggcttccc 3240
agaagaacga tctggacgcc gtggctctga tgcaccctga tggatctgct gtggtggtgg 3300
tcctgaaccg cagcagcaaa gatgtgcccc tgaccatcaa ggatcccgcc gtgggattcc 3360
tggaaacaat cagccctggc tactccatcc acacctacct gtggcgtaga cagtgacaat 3420
tgttaattaa gtttaaaccc tcgaggccgc aagcttatcg ataatcaacc tctggattac 3480
aaaatttgtg aaagattgac tggtattctt aactatgttg ctccttttac gctatgtgga 3540
tacgctgctt taatgccttt gtatcatgct attgcttccc gtatggcttt cattttctcc 3600
tccttgtata aatcctggtt gctgtctctt tatgaggagt tgtggcccgt tgtcaggcaa 3660
cgtggcgtgg tgtgcactgt gtttgctgac gcaaccccca ctggttgggg cattgccacc 3720
acctgtcagc tcctttccgg gactttcgct ttccccctcc ctattgccac ggcggaactc 3780
atcgccgcct gccttgcccg ctgctggaca ggggctcggc tgttgggcac tgacaattcc 3840
gtggtgttgt cggggaaatc atcgtccttt ccttggctgc tcgcctgtgt tgccacctgg 3900
attctgcgcg ggacgtcctt ctgctacgtc ccttcggccc tcaatccagc ggaccttcct 3960
tcccgcggcc tgctgccggc tctgcggcct cttccgcgtc ttcgccttcg ccctcagacg 4020
agtcggatct ccctttgggc cgcctccccg catcgatacc gtcgactaga gctcgctgat 4080
cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 4140
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 4200
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 4260
gggaggattg ggaagacaat agcaggcatg ctggggagag atccacgata acaaacagct 4320
tttttggggt gaacatattg actgaattcc ctgcaggttg gccactccct ctctgcgcgc 4380
tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt cgggcgacct ttggtcgccc 4440
ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc aactccatca ctaggggttc 4500
ctgcggccgc tcgtacggtc tcgaggaatt cctgcaggat aacttgccaa cctcattcta 4560
aaatgtatat agaagcccaa aagacaataa caaaaatatt cttgtagaac aaaatgggaa 4620
agaatgttcc actaaatatc aagatttaga gcaaagcatg agatgtgtgg ggatagacag 4680
tgaggctgat aaaatagagt agagctcaga aacagaccca ttgatatatg taagtgacct 4740
atgaaaaaaa tatggcattt tacaatggga aaatgatggt ctttttcttt tttagaaaaa 4800
cagggaaata tatttatatg taaaaaataa aagggaaccc atatgtcata ccatacacac 4860
aaaaaaattc cagtgaatta taagtctaaa tggagaaggc aaaactttaa atcttttaga 4920
aaataatata gaagcatgca gaccagcctg gccaacatga tgaaaccctc tctactaata 4980
ataaaatcag tagaactact caggactact ttgagtggga agtccttttc tatgaagact 5040
tctttggcca aaattaggct ctaaatgcaa ggagatagtg catcatgcct ggctgcactt 5100
actgataaat gatgttatca ccatctttaa ccaaatgcac aggaacaagt tatggtactg 5160
atgtgctgga ttgagaagga gctctacttc cttgacagga cacatttgta tcaacttaaa 5220
aaagcagatt tttgccagca gaactattca ttcagaggta ggaaacttag aatagatgat 5280
gtcactgatt agcatggctt ccccatctcc acagctgctt cccacccagg ttgcccacag 5340
ttgagtttgt ccagtgctca gggctgccca ctctcagtaa gaagccccac accagcccct 5400
ctccaaatat gttggctgtt ccttccatta aagtgacccc actttagagc agcaagtgga 5460
tttctgtttc ttacagttca ggaaggagga gtcagctgtg agaacctgga gcctgagatg 5520
cttctaagtc ccactgctac tggggtcagg gaagccagac tccagcatca gcagtcagga 5580
gcactaagcc cttgccaaca tcctgtttct cagagaaact gcttccatta taatggttgt 5640
ccttttttaa gctatcaagc caaacaacca gtgtctacca ttattctcat cacctgaagc 5700
caagggttct agcaaaagtc aagctgtctt gtaatggttg atgtgcctcc agcttctgtc 5760
ttcagtcact ccactcttag cctgctctga atcaactctg accacagttc cctggagccc 5820
ctgccacctg ctgcccctgc caccttctcc atctgcagtg ctgtgcagcc ttctgcactc 5880
ttgcagagct aataggtgga gacttgaagg aagaggagga aagtttctca taatagcctt 5940
gctgcaagct caaatgggag gtgggcactg tgcccaggag ccttggagca aaggctgtgc 6000
ccaacctctg actgcatcca ggtttggtct tgacagagat aagaagccct ggcttttgga 6060
gccaaaatct aggtcagact taggcaggat tctcaaagtt tatcagcaga acatgaggca 6120
gaagaccctt tctgctccag cttcttcagg ctcaaccttc atcagaatag atagaaagag 6180
aggctgtgag ggttcttaaa acagaagcaa atctgactca gagaataaac aacctcctag 6240
taaactacag cttagacaga gcatctggtg gtgagtgtgc tcagtgtcct actcaactgt 6300
ctggtatcag ccctcatgag gacttctctt ctttccctca tagacctcca tctctgtttt 6360
ccttagcctg cagaaatctg gatggctatt cacagaatgc ctgtgctttc agagttgcat 6420
tttttctctg gtattctggt tcaagcattt gaaggtagga aaggttctcc aagtgcaaga 6480
aagccagccc tgagcctcaa ctgcctggct agtgtggtca gtaggatgca aaggctgttg 6540
aatgccacaa ggccaaactt taacctgtgt accacaagcc tagcagcaga ggcagctctg 6600
ctcactggaa ctctctgtct tctttctcct gagccttttc ttttcctgag ttttctagct 6660
ctcctcaacc ttacctctgc cctacccagg acaaacccaa gagccactgt ttctgtgatg 6720
tcctctccag ccctaattag gcatcatgac ttcagcctga ccttccatgc tcagaagcag 6780
tgctaatcca cttcagatga gctgctctat gcaacacagg cagagcctac aaacctttgc 6840
accagagccc tccacatatc agtgtttgtt catactcact tcaacagcaa atgtgactgc 6900
tgagattaag attttacaca agatggtctg taatttcaca gttagtttta tcccattagg 6960
tatgaaagaa ttagcataat tccccttaaa catgaatgaa tcttagattt tttaataaat 7020
agttttggaa gtaaagacag agacatcagg agcacaagga atagcctgag aggacaaaca 7080
gaacaagaaa gagtctggaa atacacagga tgttcttggc ctcctcaaag caagtgcaag 7140
cagatagtac cagcagcccc aggctatcag agcccagtga agagaagtac catgaaagcc 7200
acagctctaa ccaccctgtt ccagagtgac agacagtccc caagacaagc cagcctgagc 7260
cagagagaga actgcaagag aaagtttcta atttaggttc tgttagattc agacaagtgc 7320
aggtcatcct ctctccacag ctactcacct ctccagccta acaaagcctg cagtccacac 7380
tccaaccctg gtgtctcacc tcctagcctc tcccaacatc ctgctctctg accatcttct 7440
gcatctctca tctcaccatc tcccactgtc tacagcctac tcttgcaact accatctcat 7500
tttctgacat cctgtctaca tcttctgcca tactctgcca tctaccatac cacctcttac 7560
catctaccac accatctttt atctccatcc ctctcagaag cctccaagct gaatcctgct 7620
ttatgtgttc atctcagccc ctgcatggaa agctgacccc agaggcagaa ctattcccag 7680
agagcttggc caagaaaaac aaaactacca gcctggccag gctcaggagt agtaagctgc 7740
agtgtctgtt gtgttctagc ttcaacagct gcaggagttc cactctcaaa tgctccacat 7800
ttctcacatc ctcctgattc tggtcactac ccatcttcaa agaacagaat atctcacatc 7860
agcatactgt gaaggactag tcatgggtgc agctgctcag agctgcaaag tcattctgga 7920
tggtggagag cttacaaaca tttcatgatg ctccccccgc tctgatggct ggagcccaat 7980
ccctacacag actcctgctg tatgtgtttt cctttcactc tgagccacag ccagagggca 8040
ggcattcagt ctcctcttca ggctggggct ggggcactga gaactcaccc aacaccttgc 8100
tctcactcct tctgcaaaac aagaaagagc tttgtgctgc agtagccatg aagaatgaaa 8160
ggaaggcttt aactaaaaaa tgtcagagat tattttcaac cccttactgt ggatcaccag 8220
caaggaggaa acacaacaca gagacatttt ttcccctcaa attatcaaaa gaatcactgc 8280
atttgttaaa gagagcaact gaatcaggaa gcagagtttt gaacatatca gaagttagga 8340
atctgcatca gagacaaatg cagtcatggt tgtttgctgc ataccagccc taatcattag 8400
aagcctcatg gacttcaaac atcattccct ctgacaagat gctctagcct aactccatga 8460
gataaaataa atctgccttt cagagccaaa gaagagtcca ccagcttctt ctcagtgtga 8520
acaagagctc cagtcaggtt agtcagtcca gtgcagtaga ggagaccagt ctgcatcctc 8580
taattttcaa aggcaagaag atttgtttac cctggacacc aggcacaagt gaggtcacag 8640
agctcttaga tatgcagtcc tcatgagtga ggagactaaa gcgcatgcca tcaagacttc 8700
agtgtagaga aaacctccaa aaaagcctcc tcactacttc tggaatagct cagaggccga 8760
ggcggcctcg gcctctgcat aaataaaaaa aattagtcag ccatggggcg gagaatgggc 8820
ggaactgggc ggagttaggg gcgggatggg cggagttagg ggcgggacta tggttgctga 8880
ctaattgaga tgcatgcttt gcatacttct gcctgctggg gagcctgggg actttccaca 8940
cctggttgct gactaattga gatgcatgct ttgcatactt ctgcctgctg gggagcctgg 9000
ggactttcca caccctaact gacacacatt ccacagctgc attaatgaat cggccaacgc 9060
gcggggagag gcggtttgcg tattgggcgc tcttccgctt cctcgctcac tgactcgctg 9120
cgctcggtcg ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta 9180
tccacagaat caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc 9240
aggaaccgta aaaaggccgc gttgctggcg tttttccata ggctccgccc ccctgacgag 9300
catcacaaaa atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagatac 9360
caggcgtttc cccctggaag ctccctcgtg cgctctcctg ttccgaccct gccgcttacc 9420
ggatacctgt ccgcctttct cccttcggga agcgtggcgc tttctcatag ctcacgctgt 9480
aggtatctca gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc 9540
gttcagcccg accgctgcgc cttatccggt aactatcgtc ttgagtccaa cccggtaaga 9600
cacgacttat cgccactggc agcagccact ggtaacagga ttagcagagc gaggtatgta 9660
ggcggtgcta cagagttctt gaagtggtgg cctaactacg gctacactag aagaacagta 9720
tttggtatct gcgctctgct gaagccagtt accttcggaa aaagagttgg tagctcttga 9780
tccggcaaac aaaccaccgc tggtagcggt ggtttttttg tttgcaagca gcagattacg 9840
cgcagaaaaa aaggatctca agaagatcct ttgatctttt ctacggggtc tgacgctcag 9900
tggaacgaaa actcacgtta agggattttg gtcatgagat tatcaaaaag gatcttcacc 9960
tagatccttt taaattaaaa atgaagtttt aaatcaatct aaagtatata tgagtaaact 10020
tggtctgaca gttaccaatg cttaatcagt gaggcaccta tctcagcgat ctgtctattt 10080
cgttcatcca tagttgcctg actcctgcaa accacgttgt gtctcaaaat ctctgatgtt 10140
acattgcaca agataaaaat atatcatcat gaacaataaa actgtctgct tacataaaca 10200
gtaatacaag gggtgttatg agccatattc aacgggaaac gtcttgctcg aggccgcgat 10260
taaattccaa catggatgct gatttatatg ggtataaatg ggctcgcgat aatgtcgggc 10320
aatcaggtgc gacaatctat cgattgtatg ggaagcccga tgcgccagag ttgtttctga 10380
aacatggcaa aggtagcgtt gccaatgatg ttacagatga gatggtcaga ctaaactggc 10440
tgacggaatt tatgcctctt ccgaccatca agcattttat ccgtactcct gatgatgcat 10500
ggttactcac cactgcgatc cccgggaaaa cagcattcca ggtattagaa gaatatcctg 10560
attcaggtga aaatattgtt gatgcgctgg cagtgttcct gcgccggttg cattcgattc 10620
ctgtttgtaa ttgtcctttt aacagcgatc gcgtatttcg tctcgctcag gcgcaatcac 10680
gaatgaataa cggtttggtt gatgcgagtg attttgatga cgagcgtaat ggctggcctg 10740
ttgaacaagt ctggaaagaa atgcataagc ttttgccatt ctcaccggat tcagtcgtca 10800
ctcatggtga tttctcactt gataacctta tttttgacga ggggaaatta ataggttgta 10860
ttgatgttgg acgagtcgga atcgcagacc gataccagga tcttgccatc ctatggaact 10920
gcctcggtga gttttctcct tcattacaga aacggctttt tcaaaaatat ggtattgata 10980
atcctgatat gaataaattg cagtttcatt tgatgctcga tgagtttttc taagggcggc 11040
ctgccaccat acccacgccg aaacaagcgc tcatgagccc gaagtggcga gcccgatctt 11100
ccccatcggt gatgtcggcg atataggcgc cagcaaccgc acctgtggcg ccggtgatga 11160
gggcgcgcca agtcgacgtc cggcagtc 11188
<210> 12
<211> 11187
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 12
ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac ctagttataa 60
tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg cgttacataa 120
cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt gacgtcaata 180
atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca atgggtggag 240
tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc aagtacgccc 300
cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta catgacctta 360
tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac catggtcgag 420
gtgagcccca cgttctgctt cactctcccc atctcccccc cctccccacc cccaattttg 480
tatttattta ttttttaatt attttgtgca gcgatggggg cggggggggg gggggggcgc 540
gcgccaggcg gggcggggcg gggcgagggg cggggcgggg cgaggcggag aggtgcggcg 600
gcagccaatc agagcggcgc gctccgaaag tttcctttta tggcgaggcg gcggcggcgg 660
cggccctata aaaagcgaag cgcgcggcgg gcgggagtcg ctgcgacgct gccttcgccc 720
cgtgccccgc tccgccgccg cctcgcgccg cccgccccgg ctctgactga ccgcgttact 780
cccacaggtg agcgggcggg acggcccttc tcctccgggc tgtaattagc gcttggttta 840
atgacggctt gtttcttttc tgtggctgcg tgaaagcctt gaggggctcc gggagctaga 900
gcctctgcta accatgttca tgccttcttc tttttcctac agctcctggg caacgtgctg 960
gttattgtgc tgtctcatca ttttggcaaa gaattcctcg aagatccgaa gggaaagtct 1020
tccacgactg tgggatccgt tcgaagatat caccggttga gccaccatgg aattcagcag 1080
ccccagcaga gaggaatgcc ccaagcctct gagccgggtg tcaatcatgg ccggatctct 1140
gacaggactg ctgctgcttc aggccgtgtc ttgggcttct ggcgctagac cttgcatccc 1200
caagagcttc ggctacagca gcgtcgtgtg cgtgtgcaat gccacctact gcgacagctt 1260
cgaccctcct acctttcctg ctctgggcac cttcagcaga tacgagagca ccagatccgg 1320
cagacggatg gaactgagca tgggacccat ccaggccaat cacacaggca ctggcctgct 1380
gctgacactg cagcctgagc agaaattcca gaaagtgaaa ggcttcggcg gagccatgac 1440
agatgccgcc gctctgaata tcctggctct gtctccacca gctcagaacc tgctgctcaa 1500
gagctacttc agcgaggaag gcatcggcta caacatcatc agagtgccca tggccagctg 1560
cgacttcagc atcaggacct acacctacgc cgacacaccc gacgatttcc agctgcacaa 1620
cttcagcctg cctgaagagg acaccaagct gaagatccct ctgatccaca gagccctgca 1680
gctggcacaa agacccgtgt cactgctggc ctctccatgg acatctccca cctggctgaa 1740
aacaaatggc gccgtgaatg gcaagggcag cctgaaaggc caacctggcg acatctacca 1800
ccagacctgg gccagatact tcgtgaagtt cctggacgcc tatgccgagc acaagctgca 1860
gttttgggcc gtgacagccg agaacgaacc ttctgctgga ctgctgagcg gctacccctt 1920
tcagtgcctg ggctttacac ccgagcacca gcgggacttt atcgcccgtg atctgggacc 1980
cacactggcc aatagcaccc accataatgt gcggctgctg atgctggacg accagagact 2040
gcttctgccc cactgggcta aagtggtgct gacagatcct gaggccgcca aatacgtgca 2100
cggaatcgcc gtgcactggt atctggactt tctggcccct gccaaggcca cactgggaga 2160
gacacacaga ctgttcccca acaccatgct gttcgccagc gaagcctgtg tgggcagcaa 2220
gttttgggaa cagagcgtgc ggctcggcag ctgggataga ggcatgcagt acagccacag 2280
catcatcacc aacctgctgt accacgtcgt cggctggacc gactggaatc tggccctgaa 2340
tcctgaaggc ggccctaact gggtccgaaa cttcgtggac agccccatca tcgtggacat 2400
caccaaggac accttctaca agcagcccat gttctaccac ctgggacact tcagcaagtt 2460
catccccgag ggctctcagc gcgttggact ggtggcttcc cagaagaacg atctggacgc 2520
cgtggctctg atgcaccctg atggatctgc tgtggtggtg gtcctgaacc gcagcagcaa 2580
agatgtgccc ctgaccatca aggatcccgc cgtgggattc ctggaaacaa tcagccctgg 2640
ctactccatc cacacctacc tgtggcgtag acagtgacaa ttgttaatta agtttaaacc 2700
ctcgaggccg caagcttatc gataatcaac ctctggatta caaaatttgt gaaagattga 2760
ctggtattct taactatgtt gctcctttta cgctatgtgg atacgctgct ttaatgcctt 2820
tgtatcatgc tattgcttcc cgtatggctt tcattttctc ctccttgtat aaatcctggt 2880
tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca acgtggcgtg gtgtgcactg 2940
tgtttgctga cgcaaccccc actggttggg gcattgccac cacctgtcag ctcctttccg 3000
ggactttcgc tttccccctc cctattgcca cggcggaact catcgccgcc tgccttgccc 3060
gctgctggac aggggctcgg ctgttgggca ctgacaattc cgtggtgttg tcggggaaat 3120
catcgtcctt tccttggctg ctcgcctgtg ttgccacctg gattctgcgc gggacgtcct 3180
tctgctacgt cccttcggcc ctcaatccag cggaccttcc ttcccgcggc ctgctgccgg 3240
ctctgcggcc tcttccgcgt cttcgccttc gccctcagac gagtcggatc tccctttggg 3300
ccgcctcccc gcatcgatac cgtcgactag agctcgctga tcagcctcga ctgtgccttc 3360
tagttgccag ccatctgttg tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc 3420
cactcccact gtcctttcct aataaaatga ggaaattgca tcgcattgtc tgagtaggtg 3480
tcattctatt ctggggggtg gggtggggca ggacagcaag ggggaggatt gggaagacaa 3540
tagcaggcat gctggggaga gatccacgat aacaaacagc ttttttgggg tgaacatatt 3600
gactgaattc cctgcaggtt ggccactccc tctctgcgcg ctcgctcgct cactgaggcc 3660
gcccgggcaa agcccgggcg tcgggcgacc tttggtcgcc cggcctcagt gagcgagcga 3720
gcgcgcagag agggagtggc caactccatc actaggggtt cctgcggccg ctcgtacggt 3780
ctcgaggaat tcctgcagga taacttgcca acctcattct aaaatgtata tagaagccca 3840
aaagacaata acaaaaatat tcttgtagaa caaaatggga aagaatgttc cactaaatat 3900
caagatttag agcaaagcat gagatgtgtg gggatagaca gtgaggctga taaaatagag 3960
tagagctcag aaacagaccc attgatatat gtaagtgacc tatgaaaaaa atatggcatt 4020
ttacaatggg aaaatgatgg tctttttctt ttttagaaaa acagggaaat atatttatat 4080
gtaaaaaata aaagggaacc catatgtcat accatacaca caaaaaaatt ccagtgaatt 4140
ataagtctaa atggagaagg caaaacttta aatcttttag aaaataatat agaagcatgc 4200
agaccagcct ggccaacatg atgaaaccct ctctactaat aataaaatca gtagaactac 4260
tcaggactac tttgagtggg aagtcctttt ctatgaagac ttctttggcc aaaattaggc 4320
tctaaatgca aggagatagt gcatcatgcc tggctgcact tactgataaa tgatgttatc 4380
accatcttta accaaatgca caggaacaag ttatggtact gatgtgctgg attgagaagg 4440
agctctactt ccttgacagg acacatttgt atcaacttaa aaaagcagat ttttgccagc 4500
agaactattc attcagaggt aggaaactta gaatagatga tgtcactgat tagcatggct 4560
tccccatctc cacagctgct tcccacccag gttgcccaca gttgagtttg tccagtgctc 4620
agggctgccc actctcagta agaagcccca caccagcccc tctccaaata tgttggctgt 4680
tccttccatt aaagtgaccc cactttagag cagcaagtgg atttctgttt cttacagttc 4740
aggaaggagg agtcagctgt gagaacctgg agcctgagat gcttctaagt cccactgcta 4800
ctggggtcag ggaagccaga ctccagcatc agcagtcagg agcactaagc ccttgccaac 4860
atcctgtttc tcagagaaac tgcttccatt ataatggttg tcctttttta agctatcaag 4920
ccaaacaacc agtgtctacc attattctca tcacctgaag ccaagggttc tagcaaaagt 4980
caagctgtct tgtaatggtt gatgtgcctc cagcttctgt cttcagtcac tccactctta 5040
gcctgctctg aatcaactct gaccacagtt ccctggagcc cctgccacct gctgcccctg 5100
ccaccttctc catctgcagt gctgtgcagc cttctgcact cttgcagagc taataggtgg 5160
agacttgaag gaagaggagg aaagtttctc ataatagcct tgctgcaagc tcaaatggga 5220
ggtgggcact gtgcccagga gccttggagc aaaggctgtg cccaacctct gactgcatcc 5280
aggtttggtc ttgacagaga taagaagccc tggcttttgg agccaaaatc taggtcagac 5340
ttaggcagga ttctcaaagt ttatcagcag aacatgaggc agaagaccct ttctgctcca 5400
gcttcttcag gctcaacctt catcagaata gatagaaaga gaggctgtga gggttcttaa 5460
aacagaagca aatctgactc agagaataaa caacctccta gtaaactaca gcttagacag 5520
agcatctggt ggtgagtgtg ctcagtgtcc tactcaactg tctggtatca gccctcatga 5580
ggacttctct tctttccctc atagacctcc atctctgttt tccttagcct gcagaaatct 5640
ggatggctat tcacagaatg cctgtgcttt cagagttgca ttttttctct ggtattctgg 5700
ttcaagcatt tgaaggtagg aaaggttctc caagtgcaag aaagccagcc ctgagcctca 5760
actgcctggc tagtgtggtc agtaggatgc aaaggctgtt gaatgccaca aggccaaact 5820
ttaacctgtg taccacaagc ctagcagcag aggcagctct gctcactgga actctctgtc 5880
ttctttctcc tgagcctttt cttttcctga gttttctagc tctcctcaac cttacctctg 5940
ccctacccag gacaaaccca agagccactg tttctgtgat gtcctctcca gccctaatta 6000
ggcatcatga cttcagcctg accttccatg ctcagaagca gtgctaatcc acttcagatg 6060
agctgctcta tgcaacacag gcagagccta caaacctttg caccagagcc ctccacatat 6120
cagtgtttgt tcatactcac ttcaacagca aatgtgactg ctgagattaa gattttacac 6180
aagatggtct gtaatttcac agttagtttt atcccattag gtatgaaaga attagcataa 6240
ttccccttaa acatgaatga atcttagatt ttttaataaa tagttttgga agtaaagaca 6300
gagacatcag gagcacaagg aatagcctga gaggacaaac agaacaagaa agagtctgga 6360
aatacacagg atgttcttgg cctcctcaaa gcaagtgcaa gcagatagta ccagcagccc 6420
caggctatca gagcccagtg aagagaagta ccatgaaagc cacagctcta accaccctgt 6480
tccagagtga cagacagtcc ccaagacaag ccagcctgag ccagagagag aactgcaaga 6540
gaaagtttct aatttaggtt ctgttagatt cagacaagtg caggtcatcc tctctccaca 6600
gctactcacc tctccagcct aacaaagcct gcagtccaca ctccaaccct ggtgtctcac 6660
ctcctagcct ctcccaacat cctgctctct gaccatcttc tgcatctctc atctcaccat 6720
ctcccactgt ctacagccta ctcttgcaac taccatctca ttttctgaca tcctgtctac 6780
atcttctgcc atactctgcc atctaccata ccacctctta ccatctacca caccatcttt 6840
tatctccatc cctctcagaa gcctccaagc tgaatcctgc tttatgtgtt catctcagcc 6900
cctgcatgga aagctgaccc cagaggcaga actattccca gagagcttgg ccaagaaaaa 6960
caaaactacc agcctggcca ggctcaggag tagtaagctg cagtgtctgt tgtgttctag 7020
cttcaacagc tgcaggagtt ccactctcaa atgctccaca tttctcacat cctcctgatt 7080
ctggtcacta cccatcttca aagaacagaa tatctcacat cagcatactg tgaaggacta 7140
gtcatgggtg cagctgctca gagctgcaaa gtcattctgg atggtggaga gcttacaaac 7200
atttcatgat gctccccccg ctctgatggc tggagcccaa tccctacaca gactcctgct 7260
gtatgtgttt tcctttcact ctgagccaca gccagagggc aggcattcag tctcctcttc 7320
aggctggggc tggggcactg agaactcacc caacaccttg ctctcactcc ttctgcaaaa 7380
caagaaagag ctttgtgctg cagtagccat gaagaatgaa aggaaggctt taactaaaaa 7440
atgtcagaga ttattttcaa ccccttactg tggatcacca gcaaggagga aacacaacac 7500
agagacattt tttcccctca aattatcaaa agaatcactg catttgttaa agagagcaac 7560
tgaatcagga agcagagttt tgaacatatc agaagttagg aatctgcatc agagacaaat 7620
gcagtcatgg ttgtttgctg cataccagcc ctaatcatta gaagcctcat ggacttcaaa 7680
catcattccc tctgacaaga tgctctagcc taactccatg agataaaata aatctgcctt 7740
tcagagccaa agaagagtcc accagcttct tctcagtgtg aacaagagct ccagtcaggt 7800
tagtcagtcc agtgcagtag aggagaccag tctgcatcct ctaattttca aaggcaagaa 7860
gatttgttta ccctggacac caggcacaag tgaggtcaca gagctcttag atatgcagtc 7920
ctcatgagtg aggagactaa agcgcatgcc atcaagactt cagtgtagag aaaacctcca 7980
aaaaagcctc ctcactactt ctggaatagc tcagaggccg aggcggcctc ggcctctgca 8040
taaataaaaa aaattagtca gccatggggc ggagaatggg cggaactggg cggagttagg 8100
ggcgggatgg gcggagttag gggcgggact atggttgctg actaattgag atgcatgctt 8160
tgcatacttc tgcctgctgg ggagcctggg gactttccac acctggttgc tgactaattg 8220
agatgcatgc tttgcatact tctgcctgct ggggagcctg gggactttcc acaccctaac 8280
tgacacacat tccacagctg cattaatgaa tcggccaacg cgcggggaga ggcggtttgc 8340
gtattgggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc 8400
ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata 8460
acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 8520
cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct 8580
caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa 8640
gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc 8700
tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt 8760
aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 8820
ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg 8880
cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct 8940
tgaagtggtg gcctaactac ggctacacta gaagaacagt atttggtatc tgcgctctgc 9000
tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg 9060
ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc 9120
aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt 9180
aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa 9240
aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat 9300
gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct 9360
gactcctgca aaccacgttg tgtctcaaaa tctctgatgt tacattgcac aagataaaaa 9420
tatatcatca tgaacaataa aactgtctgc ttacataaac agtaatacaa ggggtgttat 9480
gagccatatt caacgggaaa cgtcttgctc gaggccgcga ttaaattcca acatggatgc 9540
tgatttatat gggtataaat gggctcgcga taatgtcggg caatcaggtg cgacaatcta 9600
tcgattgtat gggaagcccg atgcgccaga gttgtttctg aaacatggca aaggtagcgt 9660
tgccaatgat gttacagatg agatggtcag actaaactgg ctgacggaat ttatgcctct 9720
tccgaccatc aagcatttta tccgtactcc tgatgatgca tggttactca ccactgcgat 9780
ccccgggaaa acagcattcc aggtattaga agaatatcct gattcaggtg aaaatattgt 9840
tgatgcgctg gcagtgttcc tgcgccggtt gcattcgatt cctgtttgta attgtccttt 9900
taacagcgat cgcgtatttc gtctcgctca ggcgcaatca cgaatgaata acggtttggt 9960
tgatgcgagt gattttgatg acgagcgtaa tggctggcct gttgaacaag tctggaaaga 10020
aatgcataag cttttgccat tctcaccgga ttcagtcgtc actcatggtg atttctcact 10080
tgataacctt atttttgacg aggggaaatt aataggttgt attgatgttg gacgagtcgg 10140
aatcgcagac cgataccagg atcttgccat cctatggaac tgcctcggtg agttttctcc 10200
ttcattacag aaacggcttt ttcaaaaata tggtattgat aatcctgata tgaataaatt 10260
gcagtttcat ttgatgctcg atgagttttt ctaagggcgg cctgccacca tacccacgcc 10320
gaaacaagcg ctcatgagcc cgaagtggcg agcccgatct tccccatcgg tgatgtcggc 10380
gatataggcg ccagcaaccg cacctgtggc gccggtgatg agggcgcgcc aagtcgacgt 10440
ccggcagtct tggccactcc ctctctgcgc gctcgctcgc tcactgaggc cgggcgacca 10500
aaggtcgccc gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg agcgcgcaga 10560
gagggagtgg ccaactccat cactaggggt tcctgctagc tctgggtatt taagcccgag 10620
tgagcacgca gggtctccat tttgaagcgg gaggttacgc gttcgtcgac tactagtggg 10680
taccagagcg tggtgactga gatgttttct aggaaacaca aaagatacaa aaaagaacac 10740
gtggaaggat agccaaaaag gggggctgcc cccatttcct gcaccccgct gcgatggctg 10800
gcaccatttg gaagacttcg agatacactg ttgagcgcag taagacaaca gtgtatctcg 10860
aagtcttcca gatggggcca gccggtccac tctgtatcca ggccagttct gcaaggcgtt 10920
cgaggaccac ccccctcccc tcgccaccag ggtggtctca tacagaactt ataagattcc 10980
caaatccaaa gacatttcac gtttatggtg atttcccaga acacatagcg acatgcaaat 11040
attgcagggc gccactcccc tgtccctcac agccatcttc ctgccagggc gcacgcgcgc 11100
tgggtgttcc cgcctagtga cactgggccc gcgattcctt ggagcgggtt gatgacgtca 11160
gcgtttccca tggtgaatcc ctaggtt 11187
<210> 13
<211> 10960
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 13
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
cctagttatt aatagtaatc aattacgggg tcattagttc atagcccata tatggagttc 360
cgcgttacat aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca 420
ttgacgtcaa taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt 480
caatgggtgg actatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg 540
ccaagtacgc cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag 600
tacatgacct tatgggactt tcctacttgg cagtacatct acgtattagt catcgctatt 660
accatggtcg aggtgagccc cacgttctgc ttcactctcc ccatctcccc cccctcccca 720
cccccaattt tgtatttatt tattttttaa ttattttgtg cagcgatggg ggcggggggg 780
gggggggggc gcgcgccagg cggggcgggg cggggcgagg ggcggggcgg ggcgaggcgg 840
agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa agtttccttt tatggcgagg 900
cggcggcggc ggcggcccta taaaaagcga agcgcgcggc gggcgggagt cgctgcgacg 960
ctgccttcgc cccgtgcccc gctccgccgc cgcctcgcgc cgcccgcccc ggctctgact 1020
gaccgcgtta ctcccacagg tgagcgggcg ggacggccct tctcctccgg gctgtaatta 1080
gcgcttggtt taatgacggc ttgtcctggt ggcgagggga ggggggtggt cctcgaacgc 1140
cttgcagaac tggcctggat acagagtgga ccggctggcc ccatctggaa gacttcgaga 1200
tacactgttg tcttactgcg ctcaacagtg tatctcgaag tcttccaaat ggtgccagcc 1260
atcgcagcgg ggtgcaggaa atgggggcag cccccctttt tggctatcct tccacgtgtt 1320
cttttttgta tcttttgtgt ttcctagaaa acatctcagt caccaccttt ctgtggctgc 1380
gtgaaagcct tgaggggctc cgggagctag agcctctgct aaccatgttc atgccttctt 1440
ctttttccta cagctcctgg gcaacgtgct ggttattgtg ctgtctcatc attttggcaa 1500
agaattcctc gaagatccga agggaaagtc ttccacgact gtgggatccg ttcgaagata 1560
tcaccggttg agccaccatg gaattcagca gccccagcag agaggaatgc cccaagcctc 1620
tgagccgggt gtcaatcatg gccggatctc tgacaggact gctgctgctt caggccgtgt 1680
cttgggcttc tggcgctaga ccttgcatcc ccaagagctt cggctacagc agcgtcgtgt 1740
gcgtgtgcaa tgccacctac tgcgacagct tcgaccctcc tacctttcct gctctgggca 1800
ccttcagcag atacgagagc accagatccg gcagacggat ggaactgagc atgggaccca 1860
tccaggccaa tcacacaggc actggcctgc tgctgacact gcagcctgag cagaaattcc 1920
agaaagtgaa aggcttcggc ggagccatga cagatgccgc cgctctgaat atcctggctc 1980
tgtctccacc agctcagaac ctgctgctca agagctactt cagcgaggaa ggcatcggct 2040
acaacatcat cagagtgccc atggccagct gcgacttcag catcaggacc tacacctacg 2100
ccgacacacc cgacgatttc cagctgcaca acttcagcct gcctgaagag gacaccaagc 2160
tgaagatccc tctgatccac agagccctgc agctggcaca aagacccgtg tcactgctgg 2220
cctctccatg gacatctccc acctggctga aaacaaatgg cgccgtgaat ggcaagggca 2280
gcctgaaagg ccaacctggc gacatctacc accagacctg ggccagatac ttcgtgaagt 2340
tcctggacgc ctatgccgag cacaagctgc agttttgggc cgtgacagcc gagaacgaac 2400
cttctgctgg actgctgagc ggctacccct ttcagtgcct gggctttaca cccgagcacc 2460
agcgggactt tatcgcccgt gatctgggac ccacactggc caatagcacc caccataatg 2520
tgcggctgct gatgctggac gaccagagac tgcttctgcc ccactgggct aaagtggtgc 2580
tgacagatcc tgaggccgcc aaatacgtgc acggaatcgc cgtgcactgg tatctggact 2640
ttctggcccc tgccaaggcc acactgggag agacacacag actgttcccc aacaccatgc 2700
tgttcgccag cgaagcctgt gtgggcagca agttttggga acagagcgtg cggctcggca 2760
gctgggatag aggcatgcag tacagccaca gcatcatcac caacctgctg taccacgtcg 2820
tcggctggac cgactggaat ctggccctga atcctgaagg cggccctaac tgggtccgaa 2880
acttcgtgga cagccccatc atcgtggaca tcaccaagga caccttctac aagcagccca 2940
tgttctacca cctgggacac ttcagcaagt tcatccccga gggctctcag cgcgttggac 3000
tggtggcttc ccagaagaac gatctggacg ccgtggctct gatgcaccct gatggatctg 3060
ctgtggtggt ggtcctgaac cgcagcagca aagatgtgcc cctgaccatc aaggatcccg 3120
ccgtgggatt cctggaaaca atcagccctg gctactccat ccacacctac ctgtggcgta 3180
gacagtgaca attgttaatt aagtttaaac cctcgaggcc gcaagcttat cgataatcaa 3240
cctctggatt acaaaatttg tgaaagattg actggtattc ttaactatgt tgctcctttt 3300
acgctatgtg gatacgctgc tttaatgcct ttgtatcatg ctattgcttc ccgtatggct 3360
ttcattttct cctccttgta taaatcctgg ttgctgtctc tttatgagga gttgtggccc 3420
gttgtcaggc aacgtggcgt ggtgtgcact gtgtttgctg acgcaacccc cactggttgg 3480
ggcattgcca ccacctgtca gctcctttcc gggactttcg ctttccccct ccctattgcc 3540
acggcggaac tcatcgccgc ctgccttgcc cgctgctgga caggggctcg gctgttgggc 3600
actgacaatt ccgtggtgtt gtcggggaaa tcatcgtcct ttccttggct gctcgcctgt 3660
gttgccacct ggattctgcg cgggacgtcc ttctgctacg tcccttcggc cctcaatcca 3720
gcggaccttc cttcccgcgg cctgctgccg gctctgcggc ctcttccgcg tcttcgcctt 3780
cgccctcaga cgagtcggat ctccctttgg gccgcctccc cgcatcgata ccgtcgacta 3840
gagctcgctg atcagcctcg actgtgcctt ctagttgcca gccatctgtt gtttgcccct 3900
cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc taataaaatg 3960
aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt ggggtggggc 4020
aggacagcaa gggggaggat tgggaagaca atagcaggca tgctggggag agatccacga 4080
taacaaacag cttttttggg gtgaacatat tgactgaatt ccctgcaggt tggccactcc 4140
ctctctgcgc gctcgctcgc tcactgaggc cgcccgggca aagcccgggc gtcgggcgac 4200
ctttggtcgc ccggcctcag tgagcgagcg agcgcgcaga gagggagtgg ccaactccat 4260
cactaggggt tcctgcggcc gctcgtacgg tctcgaggaa ttcctgcagg ataacttgcc 4320
aacctcattc taaaatgtat atagaagccc aaaagacaat aacaaaaata ttcttgtaga 4380
acaaaatggg aaagaatgtt ccactaaata tcaagattta gagcaaagca tgagatgtgt 4440
ggggatagac agtgaggctg ataaaataga gtagagctca gaaacagacc cattgatata 4500
tgtaagtgac ctatgaaaaa aatatggcat tttacaatgg gaaaatgatg gtctttttct 4560
tttttagaaa aacagggaaa tatatttata tgtaaaaaat aaaagggaac ccatatgtca 4620
taccatacac acaaaaaaat tccagtgaat tataagtcta aatggagaag gcaaaacttt 4680
aaatctttta gaaaataata tagaagcatg cagaccagcc tggccaacat gatgaaaccc 4740
tctctactaa taataaaatc agtagaacta ctcaggacta ctttgagtgg gaagtccttt 4800
tctatgaaga cttctttggc caaaattagg ctctaaatgc aaggagatag tgcatcatgc 4860
ctggctgcac ttactgataa atgatgttat caccatcttt aaccaaatgc acaggaacaa 4920
gttatggtac tgatgtgctg gattgagaag gagctctact tccttgacag gacacatttg 4980
tatcaactta aaaaagcaga tttttgccag cagaactatt cattcagagg taggaaactt 5040
agaatagatg atgtcactga ttagcatggc ttccccatct ccacagctgc ttcccaccca 5100
ggttgcccac agttgagttt gtccagtgct cagggctgcc cactctcagt aagaagcccc 5160
acaccagccc ctctccaaat atgttggctg ttccttccat taaagtgacc ccactttaga 5220
gcagcaagtg gatttctgtt tcttacagtt caggaaggag gagtcagctg tgagaacctg 5280
gagcctgaga tgcttctaag tcccactgct actggggtca gggaagccag actccagcat 5340
cagcagtcag gagcactaag cccttgccaa catcctgttt ctcagagaaa ctgcttccat 5400
tataatggtt gtcctttttt aagctatcaa gccaaacaac cagtgtctac cattattctc 5460
atcacctgaa gccaagggtt ctagcaaaag tcaagctgtc ttgtaatggt tgatgtgcct 5520
ccagcttctg tcttcagtca ctccactctt agcctgctct gaatcaactc tgaccacagt 5580
tccctggagc ccctgccacc tgctgcccct gccaccttct ccatctgcag tgctgtgcag 5640
ccttctgcac tcttgcagag ctaataggtg gagacttgaa ggaagaggag gaaagtttct 5700
cataatagcc ttgctgcaag ctcaaatggg aggtgggcac tgtgcccagg agccttggag 5760
caaaggctgt gcccaacctc tgactgcatc caggtttggt cttgacagag ataagaagcc 5820
ctggcttttg gagccaaaat ctaggtcaga cttaggcagg attctcaaag tttatcagca 5880
gaacatgagg cagaagaccc tttctgctcc agcttcttca ggctcaacct tcatcagaat 5940
agatagaaag agaggctgtg agggttctta aaacagaagc aaatctgact cagagaataa 6000
acaacctcct agtaaactac agcttagaca gagcatctgg tggtgagtgt gctcagtgtc 6060
ctactcaact gtctggtatc agccctcatg aggacttctc ttctttccct catagacctc 6120
catctctgtt ttccttagcc tgcagaaatc tggatggcta ttcacagaat gcctgtgctt 6180
tcagagttgc attttttctc tggtattctg gttcaagcat ttgaaggtag gaaaggttct 6240
ccaagtgcaa gaaagccagc cctgagcctc aactgcctgg ctagtgtggt cagtaggatg 6300
caaaggctgt tgaatgccac aaggccaaac tttaacctgt gtaccacaag cctagcagca 6360
gaggcagctc tgctcactgg aactctctgt cttctttctc ctgagccttt tcttttcctg 6420
agttttctag ctctcctcaa ccttacctct gccctaccca ggacaaaccc aagagccact 6480
gtttctgtga tgtcctctcc agccctaatt aggcatcatg acttcagcct gaccttccat 6540
gctcagaagc agtgctaatc cacttcagat gagctgctct atgcaacaca ggcagagcct 6600
acaaaccttt gcaccagagc cctccacata tcagtgtttg ttcatactca cttcaacagc 6660
aaatgtgact gctgagatta agattttaca caagatggtc tgtaatttca cagttagttt 6720
tatcccatta ggtatgaaag aattagcata attcccctta aacatgaatg aatcttagat 6780
tttttaataa atagttttgg aagtaaagac agagacatca ggagcacaag gaatagcctg 6840
agaggacaaa cagaacaaga aagagtctgg aaatacacag gatgttcttg gcctcctcaa 6900
agcaagtgca agcagatagt accagcagcc ccaggctatc agagcccagt gaagagaagt 6960
accatgaaag ccacagctct aaccaccctg ttccagagtg acagacagtc cccaagacaa 7020
gccagcctga gccagagaga gaactgcaag agaaagtttc taatttaggt tctgttagat 7080
tcagacaagt gcaggtcatc ctctctccac agctactcac ctctccagcc taacaaagcc 7140
tgcagtccac actccaaccc tggtgtctca cctcctagcc tctcccaaca tcctgctctc 7200
tgaccatctt ctgcatctct catctcacca tctcccactg tctacagcct actcttgcaa 7260
ctaccatctc attttctgac atcctgtcta catcttctgc catactctgc catctaccat 7320
accacctctt accatctacc acaccatctt ttatctccat ccctctcaga agcctccaag 7380
ctgaatcctg ctttatgtgt tcatctcagc ccctgcatgg aaagctgacc ccagaggcag 7440
aactattccc agagagcttg gccaagaaaa acaaaactac cagcctggcc aggctcagga 7500
gtagtaagct gcagtgtctg ttgtgttcta gcttcaacag ctgcaggagt tccactctca 7560
aatgctccac atttctcaca tcctcctgat tctggtcact acccatcttc aaagaacaga 7620
atatctcaca tcagcatact gtgaaggact agtcatgggt gcagctgctc agagctgcaa 7680
agtcattctg gatggtggag agcttacaaa catttcatga tgctcccccc gctctgatgg 7740
ctggagccca atccctacac agactcctgc tgtatgtgtt ttcctttcac tctgagccac 7800
agccagaggg caggcattca gtctcctctt caggctgggg ctggggcact gagaactcac 7860
ccaacacctt gctctcactc cttctgcaaa acaagaaaga gctttgtgct gcagtagcca 7920
tgaagaatga aaggaaggct ttaactaaaa aatgtcagag attattttca accccttact 7980
gtggatcacc agcaaggagg aaacacaaca cagagacatt ttttcccctc aaattatcaa 8040
aagaatcact gcatttgtta aagagagcaa ctgaatcagg aagcagagtt ttgaacatat 8100
cagaagttag gaatctgcat cagagacaaa tgcagtcatg gttgtttgct gcataccagc 8160
cctaatcatt agaagcctca tggacttcaa acatcattcc ctctgacaag atgctctagc 8220
ctaactccat gagataaaat aaatctgcct ttcagagcca aagaagagtc caccagcttc 8280
ttctcagtgt gaacaagagc tccagtcagg ttagtcagtc cagtgcagta gaggagacca 8340
gtctgcatcc tctaattttc aaaggcaaga agatttgttt accctggaca ccaggcacaa 8400
gtgaggtcac agagctctta gatatgcagt cctcatgagt gaggagacta aagcgcatgc 8460
catcaagact tcagtgtaga gaaaacctcc aaaaaagcct cctcactact tctggaatag 8520
ctcagaggcc gaggcggcct cggcctctgc ataaataaaa aaaattagtc agccatgggg 8580
cggagaatgg gcggaactgg gcggagttag gggcgggatg ggcggagtta ggggcgggac 8640
tatggttgct gactaattga gatgcatgct ttgcatactt ctgcctgctg gggagcctgg 8700
ggactttcca cacctggttg ctgactaatt gagatgcatg ctttgcatac ttctgcctgc 8760
tggggagcct ggggactttc cacaccctaa ctgacacaca ttccacagct gcattaatga 8820
atcggccaac gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc 8880
actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg 8940
gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc 9000
cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc 9060
ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga 9120
ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc 9180
ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat 9240
agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg 9300
cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc 9360
aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga 9420
gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact 9480
agaagaacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt 9540
ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag 9600
cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg 9660
tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa 9720
aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata 9780
tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg 9840
atctgtctat ttcgttcatc catagttgcc tgactcctgc aaaccacgtt gtgtctcaaa 9900
atctctgatg ttacattgca caagataaaa atatatcatc atgaacaata aaactgtctg 9960
cttacataaa cagtaataca aggggtgtta tgagccatat tcaacgggaa acgtcttgct 10020
cgaggccgcg attaaattcc aacatggatg ctgatttata tgggtataaa tgggctcgcg 10080
ataatgtcgg gcaatcaggt gcgacaatct atcgattgta tgggaagccc gatgcgccag 10140
agttgtttct gaaacatggc aaaggtagcg ttgccaatga tgttacagat gagatggtca 10200
gactaaactg gctgacggaa tttatgcctc ttccgaccat caagcatttt atccgtactc 10260
ctgatgatgc atggttactc accactgcga tccccgggaa aacagcattc caggtattag 10320
aagaatatcc tgattcaggt gaaaatattg ttgatgcgct ggcagtgttc ctgcgccggt 10380
tgcattcgat tcctgtttgt aattgtcctt ttaacagcga tcgcgtattt cgtctcgctc 10440
aggcgcaatc acgaatgaat aacggtttgg ttgatgcgag tgattttgat gacgagcgta 10500
atggctggcc tgttgaacaa gtctggaaag aaatgcataa gcttttgcca ttctcaccgg 10560
attcagtcgt cactcatggt gatttctcac ttgataacct tatttttgac gaggggaaat 10620
taataggttg tattgatgtt ggacgagtcg gaatcgcaga ccgataccag gatcttgcca 10680
tcctatggaa ctgcctcggt gagttttctc cttcattaca gaaacggctt tttcaaaaat 10740
atggtattga taatcctgat atgaataaat tgcagtttca tttgatgctc gatgagtttt 10800
tctaagggcg gcctgccacc atacccacgc cgaaacaagc gctcatgagc ccgaagtggc 10860
gagcccgatc ttccccatcg gtgatgtcgg cgatataggc gccagcaacc gcacctgtgg 10920
cgccggtgat gagggcgcgc caagtcgacg tccggcagtc 10960
<210> 14
<211> 536
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 14
Met Glu Phe Ser Ser Pro Ser Arg Glu Glu Cys Pro Lys Pro Leu Ser
1 5 10 15
Arg Val Ser Ile Met Ala Gly Ser Leu Thr Gly Leu Leu Leu Leu Gln
20 25 30
Ala Val Ser Trp Ala Ser Gly Ala Arg Pro Cys Ile Pro Lys Ser Phe
35 40 45
Gly Tyr Ser Ser Val Val Cys Val Cys Asn Ala Thr Tyr Cys Asp Ser
50 55 60
Phe Asp Pro Pro Thr Phe Pro Ala Leu Gly Thr Phe Ser Arg Tyr Glu
65 70 75 80
Ser Thr Arg Ser Gly Arg Arg Met Glu Leu Ser Met Gly Pro Ile Gln
85 90 95
Ala Asn His Thr Gly Thr Gly Leu Leu Leu Thr Leu Gln Pro Glu Gln
100 105 110
Lys Phe Gln Lys Val Lys Gly Phe Gly Gly Ala Met Thr Asp Ala Ala
115 120 125
Ala Leu Asn Ile Leu Ala Leu Ser Pro Pro Ala Gln Asn Leu Leu Leu
130 135 140
Lys Ser Tyr Phe Ser Glu Glu Gly Ile Gly Tyr Asn Ile Ile Arg Val
145 150 155 160
Pro Met Ala Ser Cys Asp Phe Ser Ile Arg Thr Tyr Thr Tyr Ala Asp
165 170 175
Thr Pro Asp Asp Phe Gln Leu His Asn Phe Ser Leu Pro Glu Glu Asp
180 185 190
Thr Lys Leu Lys Ile Pro Leu Ile His Arg Ala Leu Gln Leu Ala Gln
195 200 205
Arg Pro Val Ser Leu Leu Ala Ser Pro Trp Thr Ser Pro Thr Trp Leu
210 215 220
Lys Thr Asn Gly Ala Val Asn Gly Lys Gly Ser Leu Lys Gly Gln Pro
225 230 235 240
Gly Asp Ile Tyr His Gln Thr Trp Ala Arg Tyr Phe Val Lys Phe Leu
245 250 255
Asp Ala Tyr Ala Glu His Lys Leu Gln Phe Trp Ala Val Thr Ala Glu
260 265 270
Asn Glu Pro Ser Ala Gly Leu Leu Ser Gly Tyr Pro Phe Gln Cys Leu
275 280 285
Gly Phe Thr Pro Glu His Gln Arg Asp Phe Ile Ala Arg Asp Leu Gly
290 295 300
Pro Thr Leu Ala Asn Ser Thr His His Asn Val Arg Leu Leu Met Leu
305 310 315 320
Asp Asp Gln Arg Leu Leu Leu Pro His Trp Ala Lys Val Val Leu Thr
325 330 335
Asp Pro Glu Ala Ala Lys Tyr Val His Gly Ile Ala Val His Trp Tyr
340 345 350
Leu Asp Phe Leu Ala Pro Ala Lys Ala Thr Leu Gly Glu Thr His Arg
355 360 365
Leu Phe Pro Asn Thr Met Leu Phe Ala Ser Glu Ala Cys Val Gly Ser
370 375 380
Lys Phe Trp Glu Gln Ser Val Arg Leu Gly Ser Trp Asp Arg Gly Met
385 390 395 400
Gln Tyr Ser His Ser Ile Ile Thr Asn Leu Leu Tyr His Val Val Gly
405 410 415
Trp Thr Asp Trp Asn Leu Ala Leu Asn Pro Glu Gly Gly Pro Asn Trp
420 425 430
Val Arg Asn Phe Val Asp Ser Pro Ile Ile Val Asp Ile Thr Lys Asp
435 440 445
Thr Phe Tyr Lys Gln Pro Met Phe Tyr His Leu Gly His Phe Ser Lys
450 455 460
Phe Ile Pro Glu Gly Ser Gln Arg Val Gly Leu Val Ala Ser Gln Lys
465 470 475 480
Asn Asp Leu Asp Ala Val Ala Leu Met His Pro Asp Gly Ser Ala Val
485 490 495
Val Val Val Leu Asn Arg Ser Ser Lys Asp Val Pro Leu Thr Ile Lys
500 505 510
Asp Pro Ala Val Gly Phe Leu Glu Thr Ile Ser Pro Gly Tyr Ser Ile
515 520 525
His Thr Tyr Leu Trp Arg Arg Gln
530 535
<210> 15
<211> 1608
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 15
atggaattca gcagccccag cagagaggaa tgccccaagc ctctgagccg ggtgtcaatc 60
atggccggat ctctgacagg actgctgctg cttcaggccg tgtcttgggc ttctggcgct 120
agaccttgca tccccaagag cttcggctac agcagcgtcg tgtgcgtgtg caatgccacc 180
tactgcgaca gcttcgaccc tcctaccttt cctgctctgg gcaccttcag cagatacgag 240
agcaccagat ccggcagacg gatggaactg agcatgggac ccatccaggc caatcacaca 300
ggcactggcc tgctgctgac actgcagcct gagcagaaat tccagaaagt gaaaggcttc 360
ggcggagcca tgacagatgc cgccgctctg aatatcctgg ctctgtctcc accagctcag 420
aacctgctgc tcaagagcta cttcagcgag gaaggcatcg gctacaacat catcagagtg 480
cccatggcca gctgcgactt cagcatcagg acctacacct acgccgacac acccgacgat 540
ttccagctgc acaacttcag cctgcctgaa gaggacacca agctgaagat ccctctgatc 600
cacagagccc tgcagctggc acaaagaccc gtgtcactgc tggcctctcc atggacatct 660
cccacctggc tgaaaacaaa tggcgccgtg aatggcaagg gcagcctgaa aggccaacct 720
ggcgacatct accaccagac ctgggccaga tacttcgtga agttcctgga cgcctatgcc 780
gagcacaagc tgcagttttg ggccgtgaca gccgagaacg aaccttctgc tggactgctg 840
agcggctacc cctttcagtg cctgggcttt acacccgagc accagcggga ctttatcgcc 900
cgtgatctgg gacccacact ggccaatagc acccaccata atgtgcggct gctgatgctg 960
gacgaccaga gactgcttct gccccactgg gctaaagtgg tgctgacaga tcctgaggcc 1020
gccaaatacg tgcacggaat cgccgtgcac tggtatctgg actttctggc ccctgccaag 1080
gccacactgg gagagacaca cagactgttc cccaacacca tgctgttcgc cagcgaagcc 1140
tgtgtgggca gcaagttttg ggaacagagc gtgcggctcg gcagctggga tagaggcatg 1200
cagtacagcc acagcatcat caccaacctg ctgtaccacg tcgtcggctg gaccgactgg 1260
aatctggccc tgaatcctga aggcggccct aactgggtcc gaaacttcgt ggacagcccc 1320
atcatcgtgg acatcaccaa ggacaccttc tacaagcagc ccatgttcta ccacctggga 1380
cacttcagca agttcatccc cgagggctct cagcgcgttg gactggtggc ttcccagaag 1440
aacgatctgg acgccgtggc tctgatgcac cctgatggat ctgctgtggt ggtggtcctg 1500
aaccgcagca gcaaagatgt gcccctgacc atcaaggatc ccgccgtggg attcctggaa 1560
acaatcagcc ctggctactc catccacacc tacctgtggc gtagacag 1608
<210> 16
<211> 524
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 16
Met Tyr Ala Leu Phe Leu Leu Ala Ser Leu Leu Gly Ala Ala Leu Ala
1 5 10 15
Gly Pro Val Leu Gly Leu Lys Glu Cys Thr Arg Gly Ser Ala Val Trp
20 25 30
Cys Gln Asn Val Lys Thr Ala Ser Asp Cys Gly Ala Val Lys His Cys
35 40 45
Leu Gln Thr Val Trp Asn Lys Pro Thr Val Lys Ser Leu Pro Cys Asp
50 55 60
Ile Cys Lys Asp Val Val Thr Ala Ala Gly Asp Met Leu Lys Asp Asn
65 70 75 80
Ala Thr Glu Glu Glu Ile Leu Val Tyr Leu Glu Lys Thr Cys Asp Trp
85 90 95
Leu Pro Lys Pro Asn Met Ser Ala Ser Cys Lys Glu Ile Val Asp Ser
100 105 110
Tyr Leu Pro Val Ile Leu Asp Ile Ile Lys Gly Glu Met Ser Arg Pro
115 120 125
Gly Glu Val Cys Ser Ala Leu Asn Leu Cys Glu Ser Leu Gln Lys His
130 135 140
Leu Ala Glu Leu Asn His Gln Lys Gln Leu Glu Ser Asn Lys Ile Pro
145 150 155 160
Glu Leu Asp Met Thr Glu Val Val Ala Pro Phe Met Ala Asn Ile Pro
165 170 175
Leu Leu Leu Tyr Pro Gln Asp Gly Pro Arg Ser Lys Pro Gln Pro Lys
180 185 190
Asp Asn Gly Asp Val Cys Gln Asp Cys Ile Gln Met Val Thr Asp Ile
195 200 205
Gln Thr Ala Val Arg Thr Asn Ser Thr Phe Val Gln Ala Leu Val Glu
210 215 220
His Val Lys Glu Glu Cys Asp Arg Leu Gly Pro Gly Met Ala Asp Ile
225 230 235 240
Cys Lys Asn Tyr Ile Ser Gln Tyr Ser Glu Ile Ala Ile Gln Met Met
245 250 255
Met His Met Gln Pro Lys Glu Ile Cys Ala Leu Val Gly Phe Cys Asp
260 265 270
Glu Val Lys Glu Met Pro Met Gln Thr Leu Val Pro Ala Lys Val Ala
275 280 285
Ser Lys Asn Val Ile Pro Ala Leu Glu Leu Val Glu Pro Ile Lys Lys
290 295 300
His Glu Val Pro Ala Lys Ser Asp Val Tyr Cys Glu Val Cys Glu Phe
305 310 315 320
Leu Val Lys Glu Val Thr Lys Leu Ile Asp Asn Asn Lys Thr Glu Lys
325 330 335
Glu Ile Leu Asp Ala Phe Asp Lys Met Cys Ser Lys Leu Pro Lys Ser
340 345 350
Leu Ser Glu Glu Cys Gln Glu Val Val Asp Thr Tyr Gly Ser Ser Ile
355 360 365
Leu Ser Ile Leu Leu Glu Glu Val Ser Pro Glu Leu Val Cys Ser Met
370 375 380
Leu His Leu Cys Ser Gly Thr Arg Leu Pro Ala Leu Thr Val His Val
385 390 395 400
Thr Gln Pro Lys Asp Gly Gly Phe Cys Glu Val Cys Lys Lys Leu Val
405 410 415
Gly Tyr Leu Asp Arg Asn Leu Glu Lys Asn Ser Thr Lys Gln Glu Ile
420 425 430
Leu Ala Ala Leu Glu Lys Gly Cys Ser Phe Leu Pro Asp Pro Tyr Gln
435 440 445
Lys Gln Cys Asp Gln Phe Val Ala Glu Tyr Glu Pro Val Leu Ile Glu
450 455 460
Ile Leu Val Glu Val Met Asp Pro Ser Phe Val Cys Leu Lys Ile Gly
465 470 475 480
Ala Cys Pro Ser Ala His Lys Pro Leu Leu Gly Thr Glu Lys Cys Ile
485 490 495
Trp Gly Pro Ser Tyr Trp Cys Gln Asn Thr Glu Thr Ala Ala Gln Cys
500 505 510
Asn Ala Val Glu His Cys Lys Arg His Val Trp Asn
515 520
<210> 17
<211> 1572
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 17
atgtacgccc tgttcctgct ggccagcctg ctgggcgccg ccctggccgg ccccgtgctg 60
ggcctgaagg agtgcacccg cggcagcgcc gtgtggtgcc agaacgtgaa gaccgccagc 120
gactgcggcg ccgtgaagca ctgcctgcag accgtgtgga acaagcccac cgtgaagagc 180
ctgccctgcg acatctgcaa ggacgtggtg accgccgccg gcgacatgct gaaggacaac 240
gccaccgagg aggagatcct ggtgtacctg gagaagacct gcgactggct gcccaagccc 300
aacatgagcg ccagctgcaa ggagatcgtg gacagctacc tgcccgtgat cctggacatc 360
atcaagggcg agatgagccg ccccggcgag gtgtgcagcg ccctgaacct gtgcgagagc 420
ctgcagaagc acctggccga gctgaaccac cagaagcagc tggagagcaa caagatcccc 480
gagctggaca tgaccgaggt ggtggccccc ttcatggcca acatccccct gctgctgtac 540
ccccaggacg gcccccgcag caagccccag cccaaggaca acggcgacgt gtgccaggac 600
tgcatccaga tggtgaccga catccagacc gccgtgcgca ccaacagcac cttcgtgcag 660
gccctggtgg agcacgtgaa ggaggagtgc gaccgcctgg gccccggcat ggccgacatc 720
tgcaagaact acatcagcca gtacagcgag atcgccatcc agatgatgat gcacatgcag 780
cccaaggaga tctgcgccct ggtgggcttc tgcgacgagg tgaaggagat gcccatgcag 840
accctggtgc ccgccaaggt ggccagcaag aacgtgatcc ccgccctgga gctggtggag 900
cccatcaaga agcacgaggt gcccgccaag agcgacgtgt actgcgaggt gtgcgagttc 960
ctggtgaagg aggtgaccaa gctgatcgac aacaacaaga ccgagaagga gatcctggac 1020
gccttcgaca agatgtgcag caagctgccc aagagcctga gcgaggagtg ccaggaggtg 1080
gtggacacct acggcagcag catcctgagc atcctgctgg aggaggtgag ccccgagctg 1140
gtgtgcagca tgctgcacct gtgcagcggc acccgcctgc ccgccctgac cgtgcacgtg 1200
acccagccca aggacggcgg cttctgcgag gtgtgcaaga agctggtggg ctacctggac 1260
cgcaacctgg agaagaacag caccaagcag gagatcctgg ccgccctgga gaagggctgc 1320
agcttcctgc ccgaccccta ccagaagcag tgcgaccagt tcgtggccga gtacgagccc 1380
gtgctgatcg agatcctggt ggaggtgatg gaccccagct tcgtgtgcct gaagatcggc 1440
gcctgcccca gcgcccacaa gcccctgctg ggcaccgaga agtgcatctg gggccccagc 1500
tactggtgcc agaacaccga gaccgccgcc cagtgcaacg ccgtggagca ctgcaagcgc 1560
cacgtgtgga ac 1572
<210> 18
<211> 478
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 18
Met Gly Arg Cys Cys Phe Tyr Thr Ala Gly Thr Leu Ser Leu Leu Leu
1 5 10 15
Leu Val Thr Ser Val Thr Leu Leu Val Ala Arg Val Phe Gln Lys Ala
20 25 30
Val Asp Gln Ser Ile Glu Lys Lys Ile Val Leu Arg Asn Gly Thr Glu
35 40 45
Ala Phe Asp Ser Trp Glu Lys Pro Pro Leu Pro Val Tyr Thr Gln Phe
50 55 60
Tyr Phe Phe Asn Val Thr Asn Pro Glu Glu Ile Leu Arg Gly Glu Thr
65 70 75 80
Pro Arg Val Glu Glu Val Gly Pro Tyr Thr Tyr Arg Glu Leu Arg Asn
85 90 95
Lys Ala Asn Ile Gln Phe Gly Asp Asn Gly Thr Thr Ile Ser Ala Val
100 105 110
Ser Asn Lys Ala Tyr Val Phe Glu Arg Asp Gln Ser Val Gly Asp Pro
115 120 125
Lys Ile Asp Leu Ile Arg Thr Leu Asn Ile Pro Val Leu Thr Val Ile
130 135 140
Glu Trp Ser Gln Val His Phe Leu Arg Glu Ile Ile Glu Ala Met Leu
145 150 155 160
Lys Ala Tyr Gln Gln Lys Leu Phe Val Thr His Thr Val Asp Glu Leu
165 170 175
Leu Trp Gly Tyr Lys Asp Glu Ile Leu Ser Leu Ile His Val Phe Arg
180 185 190
Pro Asp Ile Ser Pro Tyr Phe Gly Leu Phe Tyr Glu Lys Asn Gly Thr
195 200 205
Asn Asp Gly Asp Tyr Val Phe Leu Thr Gly Glu Asp Ser Tyr Leu Asn
210 215 220
Phe Thr Lys Ile Val Glu Trp Asn Gly Lys Thr Ser Leu Asp Trp Trp
225 230 235 240
Ile Thr Asp Lys Cys Asn Met Ile Asn Gly Thr Asp Gly Asp Ser Phe
245 250 255
His Pro Leu Ile Thr Lys Asp Glu Val Leu Tyr Val Phe Pro Ser Asp
260 265 270
Phe Cys Arg Ser Val Tyr Ile Thr Phe Ser Asp Tyr Glu Ser Val Gln
275 280 285
Gly Leu Pro Ala Phe Arg Tyr Lys Val Pro Ala Glu Ile Leu Ala Asn
290 295 300
Thr Ser Asp Asn Ala Gly Phe Cys Ile Pro Glu Gly Asn Cys Leu Gly
305 310 315 320
Ser Gly Val Leu Asn Val Ser Ile Cys Lys Asn Gly Ala Pro Ile Ile
325 330 335
Met Ser Phe Pro His Phe Tyr Gln Ala Asp Glu Arg Phe Val Ser Ala
340 345 350
Ile Glu Gly Met His Pro Asn Gln Glu Asp His Glu Thr Phe Val Asp
355 360 365
Ile Asn Pro Leu Thr Gly Ile Ile Leu Lys Ala Ala Lys Arg Phe Gln
370 375 380
Ile Asn Ile Tyr Val Lys Lys Leu Asp Asp Phe Val Glu Thr Gly Asp
385 390 395 400
Ile Arg Thr Met Val Phe Pro Val Met Tyr Leu Asn Glu Ser Val His
405 410 415
Ile Asp Lys Glu Thr Ala Ser Arg Leu Lys Ser Met Ile Asn Thr Thr
420 425 430
Leu Ile Ile Thr Asn Ile Pro Tyr Ile Ile Met Ala Leu Gly Val Phe
435 440 445
Phe Gly Leu Val Phe Thr Trp Leu Ala Cys Lys Gly Gln Gly Ser Met
450 455 460
Asp Glu Gly Thr Ala Asp Glu Arg Ala Pro Leu Ile Arg Thr
465 470 475
<210> 19
<211> 1434
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 19
atgggccgct gctgcttcta caccgccggc accctgagcc tgctgctgct ggtgaccagc 60
gtgaccctgc tggtggcccg cgtgttccag aaggccgtgg accagagcat cgagaagaag 120
atcgtgctgc gcaacggcac cgaggccttc gacagctggg agaagccccc cctgcccgtg 180
tacacccagt tctacttctt caacgtgacc aaccccgagg agatcctgcg cggcgagacc 240
ccccgcgtgg aggaggtggg cccctacacc taccgcgagc tgcgcaacaa ggccaacatc 300
cagttcggcg acaacggcac caccatcagc gccgtgagca acaaggccta cgtgttcgag 360
cgcgaccaga gcgtgggcga ccccaagatc gacctgatcc gcaccctgaa catccccgtg 420
ctgaccgtga tcgagtggag ccaggtgcac ttcctgcgcg agatcatcga ggccatgctg 480
aaggcctacc agcagaagct gttcgtgacc cacaccgtgg acgagctgct gtggggctac 540
aaggacgaga tcctgagcct gatccacgtg ttccgccccg acatcagccc ctacttcggc 600
ctgttctacg agaagaacgg caccaacgac ggcgactacg tgttcctgac cggcgaggac 660
agctacctga acttcaccaa gatcgtggag tggaacggca agaccagcct ggactggtgg 720
atcaccgaca agtgcaacat gatcaacggc accgacggcg acagcttcca ccccctgatc 780
accaaggacg aggtgctgta cgtgttcccc agcgacttct gccgcagcgt gtacatcacc 840
ttcagcgact acgagagcgt gcagggcctg cccgccttcc gctacaaggt gcccgccgag 900
atcctggcca acaccagcga caacgccggc ttctgcatcc ccgagggcaa ctgcctgggc 960
agcggcgtgc tgaacgtgag catctgcaag aacggcgccc ccatcatcat gagcttcccc 1020
cacttctacc aggccgacga gcgcttcgtg agcgccatcg agggcatgca ccccaaccag 1080
gaggaccacg agaccttcgt ggacatcaac cccctgaccg gcatcatcct gaaggccgcc 1140
aagcgcttcc agatcaacat ctacgtgaag aagctggacg acttcgtgga gaccggcgac 1200
atccgcacca tggtgttccc cgtgatgtac ctgaacgaga gcgtgcacat cgacaaggag 1260
accgccagcc gcctgaagag catgatcaac accaccctga tcatcaccaa catcccctac 1320
atcatcatgg ccctgggcgt gttcttcggc ctggtgttca cctggctggc ctgcaagggc 1380
cagggcagca tggacgaggg caccgccgac gagcgcgccc ccctgatccg cacc 1434
<210> 20
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 20
tggaagactt cgagatacac tgt 23
<210> 21
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 21
acagtgtatc tcgaagtctt cca 23
<210> 22
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 22
tttagaaata agtggtagtc a 21
<210> 23
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 23
tgactaccac ttatttctaa a 21
<210> 24
<211> 19
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 24
agggtatcaa gactacgaa 19
<210> 25
<211> 19
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 25
ttcgtagtct tgataccct 19
<210> 26
<211> 19
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 26
tattagatct gatggccgc 19
<210> 27
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 27
ctccatcact aggggttcct 20
<210> 28
<211> 60
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 28
agctctgggt atttaagccc gagtgagcac gcagggtctc cattttgaag cgggaggtta 60
<210> 29
<211> 145
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 29
aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg ctcactgagg 60
ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc 120
gagcgcgcag agagggagtg gccaa 145
<210> 30
<211> 927
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 30
Met Gly Thr Gln Asp Pro Gly Asn Met Gly Thr Gly Val Pro Ala Ser
1 5 10 15
Glu Gln Ile Ser Cys Ala Lys Glu Asp Pro Gln Val Tyr Cys Pro Glu
20 25 30
Glu Thr Gly Gly Thr Lys Asp Val Gln Val Thr Asp Cys Lys Ser Pro
35 40 45
Glu Asp Ser Arg Pro Pro Lys Glu Thr Asp Cys Cys Asn Pro Glu Asp
50 55 60
Ser Gly Gln Leu Met Val Ser Tyr Glu Gly Lys Ala Met Gly Tyr Gln
65 70 75 80
Val Pro Pro Phe Gly Trp Arg Ile Cys Leu Ala His Glu Phe Thr Glu
85 90 95
Lys Arg Lys Pro Phe Gln Ala Asn Asn Val Ser Leu Ser Asn Met Ile
100 105 110
Lys His Ile Gly Met Gly Leu Arg Tyr Leu Gln Trp Trp Tyr Arg Lys
115 120 125
Thr His Val Glu Lys Lys Thr Pro Phe Ile Asp Met Ile Asn Ser Val
130 135 140
Pro Leu Arg Gln Ile Tyr Gly Cys Pro Leu Gly Gly Ile Gly Gly Gly
145 150 155 160
Thr Ile Thr Arg Gly Trp Arg Gly Gln Phe Cys Arg Trp Gln Leu Asn
165 170 175
Pro Gly Met Tyr Gln His Arg Thr Val Ile Ala Asp Gln Phe Thr Val
180 185 190
Cys Leu Arg Arg Glu Gly Gln Thr Val Tyr Gln Gln Val Leu Ser Leu
195 200 205
Glu Arg Pro Ser Val Leu Arg Ser Trp Asn Trp Gly Leu Cys Gly Tyr
210 215 220
Phe Ala Phe Tyr His Ala Leu Tyr Pro Arg Ala Trp Thr Val Tyr Gln
225 230 235 240
Leu Pro Gly Gln Asn Val Thr Leu Thr Cys Arg Gln Ile Thr Pro Ile
245 250 255
Leu Pro His Asp Tyr Gln Asp Ser Ser Leu Pro Val Gly Val Phe Val
260 265 270
Trp Asp Val Glu Asn Glu Gly Asp Glu Ala Leu Asp Val Ser Ile Met
275 280 285
Phe Ser Met Arg Asn Gly Leu Gly Gly Gly Asp Asp Ala Pro Gly Gly
290 295 300
Leu Trp Asn Glu Pro Phe Cys Leu Glu Arg Ser Gly Glu Thr Val Arg
305 310 315 320
Gly Leu Leu Leu His His Pro Thr Leu Pro Asn Pro Tyr Thr Met Ala
325 330 335
Val Ala Ala Arg Val Thr Ala Ala Thr Thr Val Thr His Ile Thr Ala
340 345 350
Phe Asp Pro Asp Ser Thr Gly Gln Gln Val Trp Gln Asp Leu Leu Gln
355 360 365
Asp Gly Gln Leu Asp Ser Pro Thr Gly Gln Ser Thr Pro Thr Gln Lys
370 375 380
Gly Val Gly Ile Ala Gly Ala Val Cys Val Ser Ser Lys Leu Arg Pro
385 390 395 400
Arg Gly Gln Cys Arg Leu Glu Phe Ser Leu Ala Trp Asp Met Pro Arg
405 410 415
Ile Met Phe Gly Ala Lys Gly Gln Val His Tyr Arg Arg Tyr Thr Arg
420 425 430
Phe Phe Gly Gln Asp Gly Asp Ala Ala Pro Ala Leu Ser His Tyr Ala
435 440 445
Leu Cys Arg Tyr Ala Glu Trp Glu Glu Arg Ile Ser Ala Trp Gln Ser
450 455 460
Pro Val Leu Asp Asp Arg Ser Leu Pro Ala Trp Tyr Lys Ser Ala Leu
465 470 475 480
Phe Asn Glu Leu Tyr Phe Leu Ala Asp Gly Gly Thr Val Trp Leu Glu
485 490 495
Val Leu Glu Asp Ser Leu Pro Glu Glu Leu Gly Arg Asn Met Cys His
500 505 510
Leu Arg Pro Thr Leu Arg Asp Tyr Gly Arg Phe Gly Tyr Leu Glu Gly
515 520 525
Gln Glu Tyr Arg Met Tyr Asn Thr Tyr Asp Val His Phe Tyr Ala Ser
530 535 540
Phe Ala Leu Ile Met Leu Trp Pro Lys Leu Glu Leu Ser Leu Gln Tyr
545 550 555 560
Asp Met Ala Leu Ala Thr Leu Arg Glu Asp Leu Thr Arg Arg Arg Tyr
565 570 575
Leu Met Ser Gly Val Met Ala Pro Val Lys Arg Arg Asn Val Ile Pro
580 585 590
His Asp Ile Gly Asp Pro Asp Asp Glu Pro Trp Leu Arg Val Asn Ala
595 600 605
Tyr Leu Ile His Asp Thr Ala Asp Trp Lys Asp Leu Asn Leu Lys Phe
610 615 620
Val Leu Gln Val Tyr Arg Asp Tyr Tyr Leu Thr Gly Asp Gln Asn Phe
625 630 635 640
Leu Lys Asp Met Trp Pro Val Cys Leu Ala Val Met Glu Ser Glu Met
645 650 655
Lys Phe Asp Lys Asp His Asp Gly Leu Ile Glu Asn Gly Gly Tyr Ala
660 665 670
Asp Gln Thr Tyr Asp Gly Trp Val Thr Thr Gly Pro Ser Ala Tyr Cys
675 680 685
Gly Gly Leu Trp Leu Ala Ala Val Ala Val Met Val Gln Met Ala Ala
690 695 700
Leu Cys Gly Ala Gln Asp Ile Gln Asp Lys Phe Ser Ser Ile Leu Ser
705 710 715 720
Arg Gly Gln Glu Ala Tyr Glu Arg Leu Leu Trp Asn Gly Arg Tyr Tyr
725 730 735
Asn Tyr Asp Ser Ser Ser Arg Pro Gln Ser Arg Ser Val Met Ser Asp
740 745 750
Gln Cys Ala Gly Gln Trp Phe Leu Lys Ala Cys Gly Leu Gly Glu Gly
755 760 765
Asp Thr Glu Val Phe Pro Thr Gln His Val Val Arg Ala Leu Gln Thr
770 775 780
Ile Phe Glu Leu Asn Val Gln Ala Phe Ala Gly Gly Ala Met Gly Ala
785 790 795 800
Val Asn Gly Met Gln Pro His Gly Val Pro Asp Lys Ser Ser Val Gln
805 810 815
Ser Asp Glu Val Trp Val Gly Val Val Tyr Gly Leu Ala Ala Thr Met
820 825 830
Ile Gln Glu Gly Leu Thr Trp Glu Gly Phe Gln Thr Ala Glu Gly Cys
835 840 845
Tyr Arg Thr Val Trp Glu Arg Leu Gly Leu Ala Phe Gln Thr Pro Glu
850 855 860
Ala Tyr Cys Gln Gln Arg Val Phe Arg Ser Leu Ala Tyr Met Arg Pro
865 870 875 880
Leu Ser Ile Trp Ala Met Gln Leu Ala Leu Gln Gln Gln Gln His Lys
885 890 895
Lys Ala Ser Trp Pro Lys Val Lys Gln Gly Thr Gly Leu Arg Thr Gly
900 905 910
Pro Met Phe Gly Pro Lys Glu Ala Met Ala Asn Leu Ser Pro Glu
915 920 925
<210> 31
<211> 2781
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 31
atgggcaccc aggaccccgg caacatgggc accggcgtgc ccgccagcga gcagatcagc 60
tgcgccaagg aggaccccca ggtgtactgc cccgaggaga ccggcggcac caaggacgtg 120
caggtgaccg actgcaagag ccccgaggac agccgccccc ccaaggagac cgactgctgc 180
aaccccgagg acagcggcca gctgatggtg agctacgagg gcaaggccat gggctaccag 240
gtgcccccct tcggctggcg catctgcctg gcccacgagt tcaccgagaa gcgcaagccc 300
ttccaggcca acaacgtgag cctgagcaac atgatcaagc acatcggcat gggcctgcgc 360
tacctgcagt ggtggtaccg caagacccac gtggagaaga agaccccctt catcgacatg 420
atcaacagcg tgcccctgcg ccagatctac ggctgccccc tgggcggcat cggcggcggc 480
accatcaccc gcggctggcg cggccagttc tgccgctggc agctgaaccc cggcatgtac 540
cagcaccgca ccgtgatcgc cgaccagttc accgtgtgcc tgcgccgcga gggccagacc 600
gtgtaccagc aggtgctgag cctggagcgc cccagcgtgc tgcgcagctg gaactggggc 660
ctgtgcggct acttcgcctt ctaccacgcc ctgtaccccc gcgcctggac cgtgtaccag 720
ctgcccggcc agaacgtgac cctgacctgc cgccagatca cccccatcct gccccacgac 780
taccaggaca gcagcctgcc cgtgggcgtg ttcgtgtggg acgtggagaa cgagggcgac 840
gaggccctgg acgtgagcat catgttcagc atgcgcaacg gcctgggcgg cggcgacgac 900
gcccccggcg gcctgtggaa cgagcccttc tgcctggagc gcagcggcga gaccgtgcgc 960
ggcctgctgc tgcaccaccc caccctgccc aacccctaca ccatggccgt ggccgcccgc 1020
gtgaccgccg ccaccaccgt gacccacatc accgccttcg accccgacag caccggccag 1080
caggtgtggc aggacctgct gcaggacggc cagctggaca gccccaccgg ccagagcacc 1140
cccacccaga agggcgtggg catcgccggc gccgtgtgcg tgagcagcaa gctgcgcccc 1200
cgcggccagt gccgcctgga gttcagcctg gcctgggaca tgccccgcat catgttcggc 1260
gccaagggcc aggtgcacta ccgccgctac acccgcttct tcggccagga cggcgacgcc 1320
gcccccgccc tgagccacta cgccctgtgc cgctacgccg agtgggagga gcgcatcagc 1380
gcctggcaga gccccgtgct ggacgaccgc agcctgcccg cctggtacaa gagcgccctg 1440
ttcaacgagc tgtacttcct ggccgacggc ggcaccgtgt ggctggaggt gctggaggac 1500
agcctgcccg aggagctggg ccgcaacatg tgccacctgc gccccaccct gcgcgactac 1560
ggccgcttcg gctacctgga gggccaggag taccgcatgt acaacaccta cgacgtgcac 1620
ttctacgcca gcttcgccct gatcatgctg tggcccaagc tggagctgag cctgcagtac 1680
gacatggccc tggccaccct gcgcgaggac ctgacccgcc gccgctacct gatgagcggc 1740
gtgatggccc ccgtgaagcg ccgcaacgtg atcccccacg acatcggcga ccccgacgac 1800
gagccctggc tgcgcgtgaa cgcctacctg atccacgaca ccgccgactg gaaggacctg 1860
aacctgaagt tcgtgctgca ggtgtaccgc gactactacc tgaccggcga ccagaacttc 1920
ctgaaggaca tgtggcccgt gtgcctggcc gtgatggaga gcgagatgaa gttcgacaag 1980
gaccacgacg gcctgatcga gaacggcggc tacgccgacc agacctacga cggctgggtg 2040
accaccggcc ccagcgccta ctgcggcggc ctgtggctgg ccgccgtggc cgtgatggtg 2100
cagatggccg ccctgtgcgg cgcccaggac atccaggaca agttcagcag catcctgagc 2160
cgcggccagg aggcctacga gcgcctgctg tggaacggcc gctactacaa ctacgacagc 2220
agcagccgcc cccagagccg cagcgtgatg agcgaccagt gcgccggcca gtggttcctg 2280
aaggcctgcg gcctgggcga gggcgacacc gaggtgttcc ccacccagca cgtggtgcgc 2340
gccctgcaga ccatcttcga gctgaacgtg caggccttcg ccggcggcgc catgggcgcc 2400
gtgaacggca tgcagcccca cggcgtgccc gacaagagca gcgtgcagag cgacgaggtg 2460
tgggtgggcg tggtgtacgg cctggccgcc accatgatcc aggagggcct gacctgggag 2520
ggcttccaga ccgccgaggg ctgctaccgc accgtgtggg agcgcctggg cctggccttc 2580
cagacccccg aggcctactg ccagcagcgc gtgttccgca gcctggccta catgcgcccc 2640
ctgagcatct gggccatgca gctggccctg cagcagcagc agcacaagaa ggccagctgg 2700
cccaaggtga agcagggcac cggcctgcgc accggcccca tgttcggccc caaggaggcc 2760
atggccaacc tgagccccga g 2781
<210> 32
<211> 11264
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 32
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agtaagtcac 300
tgactgtcta tgcctgggaa agggtgggca ggagatgggg cagtgcagga aaagtggcac 360
tatgaaccct cctggtggcg aggggagggg ggtggtcctc gaacgccttg cagaactggc 420
ctggatacag agtggaccgg ctggccccat ctggaagact tcgagataca ctgttgtctt 480
actgcgctca acagtgtatc tcgaagtctt ccaaatggtg ccagccatcg cagcggggtg 540
caggaaatgg gggcagcccc cctttttggc tatccttcca cgtgttcttt tttgtatctt 600
ttgtgtttcc tagaaaacat ctcagtcacc accgcagccc taggaatgca tctagacaat 660
tgtactaacc ttcttctctt tcctctcctg acagtccgga aagccaccat gggcacccag 720
gaccccggca acatgggcac cggcgtgccc gccagcgagc agatcagctg cgccaaggag 780
gacccccagg tgtactgccc cgaggagacc ggcggcacca aggacgtgca ggtgaccgac 840
tgcaagagcc ccgaggacag ccgccccccc aaggagaccg actgctgcaa ccccgaggac 900
agcggccagc tgatggtgag ctacgagggc aaggccatgg gctaccaggt gccccccttc 960
ggctggcgca tctgcctggc ccacgagttc accgagaagc gcaagccctt ccaggccaac 1020
aacgtgagcc tgagcaacat gatcaagcac atcggcatgg gcctgcgcta cctgcagtgg 1080
tggtaccgca agacccacgt ggagaagaag acccccttca tcgacatgat caacagcgtg 1140
cccctgcgcc agatctacgg ctgccccctg ggcggcatcg gcggcggcac catcacccgc 1200
ggctggcgcg gccagttctg ccgctggcag ctgaaccccg gcatgtacca gcaccgcacc 1260
gtgatcgccg accagttcac cgtgtgcctg cgccgcgagg gccagaccgt gtaccagcag 1320
gtgctgagcc tggagcgccc cagcgtgctg cgcagctgga actggggcct gtgcggctac 1380
ttcgccttct accacgccct gtacccccgc gcctggaccg tgtaccagct gcccggccag 1440
aacgtgaccc tgacctgccg ccagatcacc cccatcctgc cccacgacta ccaggacagc 1500
agcctgcccg tgggcgtgtt cgtgtgggac gtggagaacg agggcgacga ggccctggac 1560
gtgagcatca tgttcagcat gcgcaacggc ctgggcggcg gcgacgacgc ccccggcggc 1620
ctgtggaacg agcccttctg cctggagcgc agcggcgaga ccgtgcgcgg cctgctgctg 1680
caccacccca ccctgcccaa cccctacacc atggccgtgg ccgcccgcgt gaccgccgcc 1740
accaccgtga cccacatcac cgccttcgac cccgacagca ccggccagca ggtgtggcag 1800
gacctgctgc aggacggcca gctggacagc cccaccggcc agagcacccc cacccagaag 1860
ggcgtgggca tcgccggcgc cgtgtgcgtg agcagcaagc tgcgcccccg cggccagtgc 1920
cgcctggagt tcagcctggc ctgggacatg ccccgcatca tgttcggcgc caagggccag 1980
gtgcactacc gccgctacac ccgcttcttc ggccaggacg gcgacgccgc ccccgccctg 2040
agccactacg ccctgtgccg ctacgccgag tgggaggagc gcatcagcgc ctggcagagc 2100
cccgtgctgg acgaccgcag cctgcccgcc tggtacaaga gcgccctgtt caacgagctg 2160
tacttcctgg ccgacggcgg caccgtgtgg ctggaggtgc tggaggacag cctgcccgag 2220
gagctgggcc gcaacatgtg ccacctgcgc cccaccctgc gcgactacgg ccgcttcggc 2280
tacctggagg gccaggagta ccgcatgtac aacacctacg acgtgcactt ctacgccagc 2340
ttcgccctga tcatgctgtg gcccaagctg gagctgagcc tgcagtacga catggccctg 2400
gccaccctgc gcgaggacct gacccgccgc cgctacctga tgagcggcgt gatggccccc 2460
gtgaagcgcc gcaacgtgat cccccacgac atcggcgacc ccgacgacga gccctggctg 2520
cgcgtgaacg cctacctgat ccacgacacc gccgactgga aggacctgaa cctgaagttc 2580
gtgctgcagg tgtaccgcga ctactacctg accggcgacc agaacttcct gaaggacatg 2640
tggcccgtgt gcctggccgt gatggagagc gagatgaagt tcgacaagga ccacgacggc 2700
ctgatcgaga acggcggcta cgccgaccag acctacgacg gctgggtgac caccggcccc 2760
agcgcctact gcggcggcct gtggctggcc gccgtggccg tgatggtgca gatggccgcc 2820
ctgtgcggcg cccaggacat ccaggacaag ttcagcagca tcctgagccg cggccaggag 2880
gcctacgagc gcctgctgtg gaacggccgc tactacaact acgacagcag cagccgcccc 2940
cagagccgca gcgtgatgag cgaccagtgc gccggccagt ggttcctgaa ggcctgcggc 3000
ctgggcgagg gcgacaccga ggtgttcccc acccagcacg tggtgcgcgc cctgcagacc 3060
atcttcgagc tgaacgtgca ggccttcgcc ggcggcgcca tgggcgccgt gaacggcatg 3120
cagccccacg gcgtgcccga caagagcagc gtgcagagcg acgaggtgtg ggtgggcgtg 3180
gtgtacggcc tggccgccac catgatccag gagggcctga cctgggaggg cttccagacc 3240
gccgagggct gctaccgcac cgtgtgggag cgcctgggcc tggccttcca gacccccgag 3300
gcctactgcc agcagcgcgt gttccgcagc ctggcctaca tgcgccccct gagcatctgg 3360
gccatgcagc tggccctgca gcagcagcag cacaagaagg ccagctggcc caaggtgaag 3420
cagggcaccg gcctgcgcac cggccccatg ttcggcccca aggaggccat ggccaacctg 3480
agccccgagt gacaattgtt aattaagttt aaaccctcga ggccgcaagc ttatcgataa 3540
tcaacctctg gattacaaaa tttgtgaaag attgactggt attcttaact atgttgctcc 3600
ttttacgcta tgtggatacg ctgctttaat gcctttgtat catgctattg cttcccgtat 3660
ggctttcatt ttctcctcct tgtataaatc ctggttgctg tctctttatg aggagttgtg 3720
gcccgttgtc aggcaacgtg gcgtggtgtg cactgtgttt gctgacgcaa cccccactgg 3780
ttggggcatt gccaccacct gtcagctcct ttccgggact ttcgctttcc ccctccctat 3840
tgccacggcg gaactcatcg ccgcctgcct tgcccgctgc tggacagggg ctcggctgtt 3900
gggcactgac aattccgtgg tgttgtcggg gaaatcatcg tcctttcctt ggctgctcgc 3960
ctgtgttgcc acctggattc tgcgcgggac gtccttctgc tacgtccctt cggccctcaa 4020
tccagcggac cttccttccc gcggcctgct gccggctctg cggcctcttc cgcgtcttcg 4080
ccttcgccct cagacgagtc ggatctccct ttgggccgcc tccccgcatc gataccgtcg 4140
actagagctc gctgatcagc ctcgactgtg ccttctagtt gccagccatc tgttgtttgc 4200
ccctcccccg tgccttcctt gaccctggaa ggtgccactc ccactgtcct ttcctaataa 4260
aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg gggtggggtg 4320
gggcaggaca gcaaggggga ggattgggaa gacaatagca ggcatgctgg ggagagatcc 4380
acgataacaa acagcttttt tggggtgaac atattgactg aattccctgc aggttggcca 4440
ctccctctct gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg 4500
cgacctttgg tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact 4560
ccatcactag gggttcctgc ggccgctcgt acggtctcga ggaattcctg caggataact 4620
tgccaacctc attctaaaat gtatatagaa gcccaaaaga caataacaaa aatattcttg 4680
tagaacaaaa tgggaaagaa tgttccacta aatatcaaga tttagagcaa agcatgagat 4740
gtgtggggat agacagtgag gctgataaaa tagagtagag ctcagaaaca gacccattga 4800
tatatgtaag tgacctatga aaaaaatatg gcattttaca atgggaaaat gatggtcttt 4860
ttctttttta gaaaaacagg gaaatatatt tatatgtaaa aaataaaagg gaacccatat 4920
gtcataccat acacacaaaa aaattccagt gaattataag tctaaatgga gaaggcaaaa 4980
ctttaaatct tttagaaaat aatatagaag catgcagacc agcctggcca acatgatgaa 5040
accctctcta ctaataataa aatcagtaga actactcagg actactttga gtgggaagtc 5100
cttttctatg aagacttctt tggccaaaat taggctctaa atgcaaggag atagtgcatc 5160
atgcctggct gcacttactg ataaatgatg ttatcaccat ctttaaccaa atgcacagga 5220
acaagttatg gtactgatgt gctggattga gaaggagctc tacttccttg acaggacaca 5280
tttgtatcaa cttaaaaaag cagatttttg ccagcagaac tattcattca gaggtaggaa 5340
acttagaata gatgatgtca ctgattagca tggcttcccc atctccacag ctgcttccca 5400
cccaggttgc ccacagttga gtttgtccag tgctcagggc tgcccactct cagtaagaag 5460
ccccacacca gcccctctcc aaatatgttg gctgttcctt ccattaaagt gaccccactt 5520
tagagcagca agtggatttc tgtttcttac agttcaggaa ggaggagtca gctgtgagaa 5580
cctggagcct gagatgcttc taagtcccac tgctactggg gtcagggaag ccagactcca 5640
gcatcagcag tcaggagcac taagcccttg ccaacatcct gtttctcaga gaaactgctt 5700
ccattataat ggttgtcctt ttttaagcta tcaagccaaa caaccagtgt ctaccattat 5760
tctcatcacc tgaagccaag ggttctagca aaagtcaagc tgtcttgtaa tggttgatgt 5820
gcctccagct tctgtcttca gtcactccac tcttagcctg ctctgaatca actctgacca 5880
cagttccctg gagcccctgc cacctgctgc ccctgccacc ttctccatct gcagtgctgt 5940
gcagccttct gcactcttgc agagctaata ggtggagact tgaaggaaga ggaggaaagt 6000
ttctcataat agccttgctg caagctcaaa tgggaggtgg gcactgtgcc caggagcctt 6060
ggagcaaagg ctgtgcccaa cctctgactg catccaggtt tggtcttgac agagataaga 6120
agccctggct tttggagcca aaatctaggt cagacttagg caggattctc aaagtttatc 6180
agcagaacat gaggcagaag accctttctg ctccagcttc ttcaggctca accttcatca 6240
gaatagatag aaagagaggc tgtgagggtt cttaaaacag aagcaaatct gactcagaga 6300
ataaacaacc tcctagtaaa ctacagctta gacagagcat ctggtggtga gtgtgctcag 6360
tgtcctactc aactgtctgg tatcagccct catgaggact tctcttcttt ccctcataga 6420
cctccatctc tgttttcctt agcctgcaga aatctggatg gctattcaca gaatgcctgt 6480
gctttcagag ttgcattttt tctctggtat tctggttcaa gcatttgaag gtaggaaagg 6540
ttctccaagt gcaagaaagc cagccctgag cctcaactgc ctggctagtg tggtcagtag 6600
gatgcaaagg ctgttgaatg ccacaaggcc aaactttaac ctgtgtacca caagcctagc 6660
agcagaggca gctctgctca ctggaactct ctgtcttctt tctcctgagc cttttctttt 6720
cctgagtttt ctagctctcc tcaaccttac ctctgcccta cccaggacaa acccaagagc 6780
cactgtttct gtgatgtcct ctccagccct aattaggcat catgacttca gcctgacctt 6840
ccatgctcag aagcagtgct aatccacttc agatgagctg ctctatgcaa cacaggcaga 6900
gcctacaaac ctttgcacca gagccctcca catatcagtg tttgttcata ctcacttcaa 6960
cagcaaatgt gactgctgag attaagattt tacacaagat ggtctgtaat ttcacagtta 7020
gttttatccc attaggtatg aaagaattag cataattccc cttaaacatg aatgaatctt 7080
agatttttta ataaatagtt ttggaagtaa agacagagac atcaggagca caaggaatag 7140
cctgagagga caaacagaac aagaaagagt ctggaaatac acaggatgtt cttggcctcc 7200
tcaaagcaag tgcaagcaga tagtaccagc agccccaggc tatcagagcc cagtgaagag 7260
aagtaccatg aaagccacag ctctaaccac cctgttccag agtgacagac agtccccaag 7320
acaagccagc ctgagccaga gagagaactg caagagaaag tttctaattt aggttctgtt 7380
agattcagac aagtgcaggt catcctctct ccacagctac tcacctctcc agcctaacaa 7440
agcctgcagt ccacactcca accctggtgt ctcacctcct agcctctccc aacatcctgc 7500
tctctgacca tcttctgcat ctctcatctc accatctccc actgtctaca gcctactctt 7560
gcaactacca tctcattttc tgacatcctg tctacatctt ctgccatact ctgccatcta 7620
ccataccacc tcttaccatc taccacacca tcttttatct ccatccctct cagaagcctc 7680
caagctgaat cctgctttat gtgttcatct cagcccctgc atggaaagct gaccccagag 7740
gcagaactat tcccagagag cttggccaag aaaaacaaaa ctaccagcct ggccaggctc 7800
aggagtagta agctgcagtg tctgttgtgt tctagcttca acagctgcag gagttccact 7860
ctcaaatgct ccacatttct cacatcctcc tgattctggt cactacccat cttcaaagaa 7920
cagaatatct cacatcagca tactgtgaag gactagtcat gggtgcagct gctcagagct 7980
gcaaagtcat tctggatggt ggagagctta caaacatttc atgatgctcc ccccgctctg 8040
atggctggag cccaatccct acacagactc ctgctgtatg tgttttcctt tcactctgag 8100
ccacagccag agggcaggca ttcagtctcc tcttcaggct ggggctgggg cactgagaac 8160
tcacccaaca ccttgctctc actccttctg caaaacaaga aagagctttg tgctgcagta 8220
gccatgaaga atgaaaggaa ggctttaact aaaaaatgtc agagattatt ttcaacccct 8280
tactgtggat caccagcaag gaggaaacac aacacagaga cattttttcc cctcaaatta 8340
tcaaaagaat cactgcattt gttaaagaga gcaactgaat caggaagcag agttttgaac 8400
atatcagaag ttaggaatct gcatcagaga caaatgcagt catggttgtt tgctgcatac 8460
cagccctaat cattagaagc ctcatggact tcaaacatca ttccctctga caagatgctc 8520
tagcctaact ccatgagata aaataaatct gcctttcaga gccaaagaag agtccaccag 8580
cttcttctca gtgtgaacaa gagctccagt caggttagtc agtccagtgc agtagaggag 8640
accagtctgc atcctctaat tttcaaaggc aagaagattt gtttaccctg gacaccaggc 8700
acaagtgagg tcacagagct cttagatatg cagtcctcat gagtgaggag actaaagcgc 8760
atgccatcaa gacttcagtg tagagaaaac ctccaaaaaa gcctcctcac tacttctgga 8820
atagctcaga ggccgaggcg gcctcggcct ctgcataaat aaaaaaaatt agtcagccat 8880
ggggcggaga atgggcggaa ctgggcggag ttaggggcgg gatgggcgga gttaggggcg 8940
ggactatggt tgctgactaa ttgagatgca tgctttgcat acttctgcct gctggggagc 9000
ctggggactt tccacacctg gttgctgact aattgagatg catgctttgc atacttctgc 9060
ctgctgggga gcctggggac tttccacacc ctaactgaca cacattccac agctgcatta 9120
atgaatcggc caacgcgcgg ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc 9180
gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 9240
ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 9300
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 9360
ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 9420
aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 9480
gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttc 9540
tcatagctca cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca agctgggctg 9600
tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta tccggtaact atcgtcttga 9660
gtccaacccg gtaagacacg acttatcgcc actggcagca gccactggta acaggattag 9720
cagagcgagg tatgtaggcg gtgctacaga gttcttgaag tggtggccta actacggcta 9780
cactagaaga acagtatttg gtatctgcgc tctgctgaag ccagttacct tcggaaaaag 9840
agttggtagc tcttgatccg gcaaacaaac caccgctggt agcggtggtt tttttgtttg 9900
caagcagcag attacgcgca gaaaaaaagg atctcaagaa gatcctttga tcttttctac 9960
ggggtctgac gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc 10020
aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag 10080
tatatatgag taaacttggt ctgacagtta ccaatgctta atcagtgagg cacctatctc 10140
agcgatctgt ctatttcgtt catccatagt tgcctgactc ctgcaaacca cgttgtgtct 10200
caaaatctct gatgttacat tgcacaagat aaaaatatat catcatgaac aataaaactg 10260
tctgcttaca taaacagtaa tacaaggggt gttatgagcc atattcaacg ggaaacgtct 10320
tgctcgaggc cgcgattaaa ttccaacatg gatgctgatt tatatgggta taaatgggct 10380
cgcgataatg tcgggcaatc aggtgcgaca atctatcgat tgtatgggaa gcccgatgcg 10440
ccagagttgt ttctgaaaca tggcaaaggt agcgttgcca atgatgttac agatgagatg 10500
gtcagactaa actggctgac ggaatttatg cctcttccga ccatcaagca ttttatccgt 10560
actcctgatg atgcatggtt actcaccact gcgatccccg ggaaaacagc attccaggta 10620
ttagaagaat atcctgattc aggtgaaaat attgttgatg cgctggcagt gttcctgcgc 10680
cggttgcatt cgattcctgt ttgtaattgt ccttttaaca gcgatcgcgt atttcgtctc 10740
gctcaggcgc aatcacgaat gaataacggt ttggttgatg cgagtgattt tgatgacgag 10800
cgtaatggct ggcctgttga acaagtctgg aaagaaatgc ataagctttt gccattctca 10860
ccggattcag tcgtcactca tggtgatttc tcacttgata accttatttt tgacgagggg 10920
aaattaatag gttgtattga tgttggacga gtcggaatcg cagaccgata ccaggatctt 10980
gccatcctat ggaactgcct cggtgagttt tctccttcat tacagaaacg gctttttcaa 11040
aaatatggta ttgataatcc tgatatgaat aaattgcagt ttcatttgat gctcgatgag 11100
tttttctaag ggcggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag 11160
tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct 11220
gtggcgccgg tgatgagggc gcgccaagtc gacgtccggc agtc 11264
<210> 33
<211> 685
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 33
Met Ala Glu Trp Leu Leu Ser Ala Ser Trp Gln Arg Arg Ala Lys Ala
1 5 10 15
Met Thr Ala Ala Ala Gly Ser Ala Gly Arg Ala Ala Val Pro Leu Leu
20 25 30
Leu Cys Ala Leu Leu Ala Pro Gly Gly Ala Tyr Val Leu Asp Asp Ser
35 40 45
Asp Gly Leu Gly Arg Glu Phe Asp Gly Ile Gly Ala Val Ser Gly Gly
50 55 60
Gly Ala Thr Ser Arg Leu Leu Val Asn Tyr Pro Glu Pro Tyr Arg Ser
65 70 75 80
Gln Ile Leu Asp Tyr Leu Phe Lys Pro Asn Phe Gly Ala Ser Leu His
85 90 95
Ile Leu Lys Val Glu Ile Gly Gly Asp Gly Gln Thr Thr Asp Gly Thr
100 105 110
Glu Pro Ser His Met His Tyr Ala Leu Asp Glu Asn Tyr Phe Arg Gly
115 120 125
Tyr Glu Trp Trp Leu Met Lys Glu Ala Lys Lys Arg Asn Pro Asn Ile
130 135 140
Thr Leu Ile Gly Leu Pro Trp Ser Phe Pro Gly Trp Leu Gly Lys Gly
145 150 155 160
Phe Asp Trp Pro Tyr Val Asn Leu Gln Leu Thr Ala Tyr Tyr Val Val
165 170 175
Thr Trp Ile Val Gly Ala Lys Arg Tyr His Asp Leu Asp Ile Asp Tyr
180 185 190
Ile Gly Ile Trp Asn Glu Arg Ser Tyr Asn Ala Asn Tyr Ile Lys Ile
195 200 205
Leu Arg Lys Met Leu Asn Tyr Gln Gly Leu Gln Arg Val Lys Ile Ile
210 215 220
Ala Ser Asp Asn Leu Trp Glu Ser Ile Ser Ala Ser Met Leu Leu Asp
225 230 235 240
Ala Glu Leu Phe Lys Val Val Asp Val Ile Gly Ala His Tyr Pro Gly
245 250 255
Thr His Ser Ala Lys Asp Ala Lys Leu Thr Gly Lys Lys Leu Trp Ser
260 265 270
Ser Glu Asp Phe Ser Thr Leu Asn Ser Asp Met Gly Ala Gly Cys Trp
275 280 285
Gly Arg Ile Leu Asn Gln Asn Tyr Ile Asn Gly Tyr Met Thr Ser Thr
290 295 300
Ile Ala Trp Asn Leu Val Ala Ser Tyr Tyr Glu Gln Leu Pro Tyr Gly
305 310 315 320
Arg Cys Gly Leu Met Thr Ala Gln Glu Pro Trp Ser Gly His Tyr Val
325 330 335
Val Glu Ser Pro Val Trp Val Ser Ala His Thr Thr Gln Phe Thr Gln
340 345 350
Pro Gly Trp Tyr Tyr Leu Lys Thr Val Gly His Leu Glu Lys Gly Gly
355 360 365
Ser Tyr Val Ala Leu Thr Asp Gly Leu Gly Asn Leu Thr Ile Ile Ile
370 375 380
Glu Thr Met Ser His Lys His Ser Lys Cys Ile Arg Pro Phe Leu Pro
385 390 395 400
Tyr Phe Asn Val Ser Gln Gln Phe Ala Thr Phe Val Leu Lys Gly Ser
405 410 415
Phe Ser Glu Ile Pro Glu Leu Gln Val Trp Tyr Thr Lys Leu Gly Lys
420 425 430
Thr Ser Glu Arg Phe Leu Phe Lys Gln Leu Asp Ser Leu Trp Leu Leu
435 440 445
Asp Ser Asp Gly Ser Phe Thr Leu Ser Leu His Glu Asp Glu Leu Phe
450 455 460
Thr Leu Thr Thr Leu Thr Thr Gly Arg Lys Gly Ser Tyr Pro Leu Pro
465 470 475 480
Pro Lys Ser Gln Pro Phe Pro Ser Thr Tyr Lys Asp Asp Phe Asn Val
485 490 495
Asp Tyr Pro Phe Phe Ser Glu Ala Pro Asn Phe Ala Asp Gln Thr Gly
500 505 510
Val Phe Glu Tyr Phe Thr Asn Ile Glu Asp Pro Gly Glu His His Phe
515 520 525
Thr Leu Arg Gln Val Leu Asn Gln Arg Pro Ile Thr Trp Ala Ala Asp
530 535 540
Ala Ser Asn Thr Ile Ser Ile Ile Gly Asp Tyr Asn Trp Thr Asn Leu
545 550 555 560
Thr Ile Lys Cys Asp Val Tyr Ile Glu Thr Pro Asp Thr Gly Gly Val
565 570 575
Phe Ile Ala Gly Arg Val Asn Lys Gly Gly Ile Leu Ile Arg Ser Ala
580 585 590
Arg Gly Ile Phe Phe Trp Ile Phe Ala Asn Gly Ser Tyr Arg Val Thr
595 600 605
Gly Asp Leu Ala Gly Trp Ile Ile Tyr Ala Leu Gly Arg Val Glu Val
610 615 620
Thr Ala Lys Lys Trp Tyr Thr Leu Thr Leu Thr Ile Lys Gly His Phe
625 630 635 640
Thr Ser Gly Met Leu Asn Asp Lys Ser Leu Trp Thr Asp Ile Pro Val
645 650 655
Asn Phe Pro Lys Asn Gly Trp Ala Ala Ile Gly Thr His Ser Phe Glu
660 665 670
Phe Ala Gln Phe Asp Asn Phe Leu Val Glu Ala Thr Arg
675 680 685
<210> 34
<211> 2055
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 34
atggccgagt ggctgctgag cgccagctgg cagcgccgcg ccaaggccat gaccgccgcc 60
gccggcagcg ccggccgcgc cgccgtgccc ctgctgctgt gcgccctgct ggcccccggc 120
ggcgcctacg tgctggacga cagcgacggc ctgggccgcg agttcgacgg catcggcgcc 180
gtgagcggcg gcggcgccac cagccgcctg ctggtgaact accccgagcc ctaccgcagc 240
cagatcctgg actacctgtt caagcccaac ttcggcgcca gcctgcacat cctgaaggtg 300
gagatcggcg gcgacggcca gaccaccgac ggcaccgagc ccagccacat gcactacgcc 360
ctggacgaga actacttccg cggctacgag tggtggctga tgaaggaggc caagaagcgc 420
aaccccaaca tcaccctgat cggcctgccc tggagcttcc ccggctggct gggcaagggc 480
ttcgactggc cctacgtgaa cctgcagctg accgcctact acgtggtgac ctggatcgtg 540
ggcgccaagc gctaccacga cctggacatc gactacatcg gcatctggaa cgagcgcagc 600
tacaacgcca actacatcaa gatcctgcgc aagatgctga actaccaggg cctgcagcgc 660
gtgaagatca tcgccagcga caacctgtgg gagagcatca gcgccagcat gctgctggac 720
gccgagctgt tcaaggtggt ggacgtgatc ggcgcccact accccggcac ccacagcgcc 780
aaggacgcca agctgaccgg caagaagctg tggagcagcg aggacttcag caccctgaac 840
agcgacatgg gcgccggctg ctggggccgc atcctgaacc agaactacat caacggctac 900
atgaccagca ccatcgcctg gaacctggtg gccagctact acgagcagct gccctacggc 960
cgctgcggcc tgatgaccgc ccaggagccc tggagcggcc actacgtggt ggagagcccc 1020
gtgtgggtga gcgcccacac cacccagttc acccagcccg gctggtacta cctgaagacc 1080
gtgggccacc tggagaaggg cggcagctac gtggccctga ccgacggcct gggcaacctg 1140
accatcatca tcgagaccat gagccacaag cacagcaagt gcatccgccc cttcctgccc 1200
tacttcaacg tgagccagca gttcgccacc ttcgtgctga agggcagctt cagcgagatc 1260
cccgagctgc aggtgtggta caccaagctg ggcaagacca gcgagcgctt cctgttcaag 1320
cagctggaca gcctgtggct gctggacagc gacggcagct tcaccctgag cctgcacgag 1380
gacgagctgt tcaccctgac caccctgacc accggccgca agggcagcta ccccctgccc 1440
cccaagagcc agcccttccc cagcacctac aaggacgact tcaacgtgga ctaccccttc 1500
ttcagcgagg cccccaactt cgccgaccag accggcgtgt tcgagtactt caccaacatc 1560
gaggaccccg gcgagcacca cttcaccctg cgccaggtgc tgaaccagcg ccccatcacc 1620
tgggccgccg acgccagcaa caccatcagc atcatcggcg actacaactg gaccaacctg 1680
accatcaagt gcgacgtgta catcgagacc cccgacaccg gcggcgtgtt catcgccggc 1740
cgcgtgaaca agggcggcat cctgatccgc agcgcccgcg gcatcttctt ctggatcttc 1800
gccaacggca gctaccgcgt gaccggcgac ctggccggct ggatcatcta cgccctgggc 1860
cgcgtggagg tgaccgccaa gaagtggtac accctgaccc tgaccatcaa gggccacttc 1920
accagcggca tgctgaacga caagagcctg tggaccgaca tccccgtgaa cttccccaag 1980
aacggctggg ccgccatcgg cacccacagc ttcgagttcg cccagttcga caacttcctg 2040
gtggaggcca cccgc 2055
<210> 35
<211> 339
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 35
Met Trp Gln Leu Trp Ala Ser Leu Cys Cys Leu Leu Val Leu Ala Asn
1 5 10 15
Ala Arg Ser Arg Pro Ser Phe His Pro Leu Ser Asp Glu Leu Val Asn
20 25 30
Tyr Val Asn Lys Arg Asn Thr Thr Trp Gln Ala Gly His Asn Phe Tyr
35 40 45
Asn Val Asp Met Ser Tyr Leu Lys Arg Leu Cys Gly Thr Phe Leu Gly
50 55 60
Gly Pro Lys Pro Pro Gln Arg Val Met Phe Thr Glu Asp Leu Lys Leu
65 70 75 80
Pro Ala Ser Phe Asp Ala Arg Glu Gln Trp Pro Gln Cys Pro Thr Ile
85 90 95
Lys Glu Ile Arg Asp Gln Gly Ser Cys Gly Ser Cys Trp Ala Phe Gly
100 105 110
Ala Val Glu Ala Ile Ser Asp Arg Ile Cys Ile His Thr Asn Ala His
115 120 125
Val Ser Val Glu Val Ser Ala Glu Asp Leu Leu Thr Cys Cys Gly Ser
130 135 140
Met Cys Gly Asp Gly Cys Asn Gly Gly Tyr Pro Ala Glu Ala Trp Asn
145 150 155 160
Phe Trp Thr Arg Lys Gly Leu Val Ser Gly Gly Leu Tyr Glu Ser His
165 170 175
Val Gly Cys Arg Pro Tyr Ser Ile Pro Pro Cys Glu His His Val Asn
180 185 190
Gly Ser Arg Pro Pro Cys Thr Gly Glu Gly Asp Thr Pro Lys Cys Ser
195 200 205
Lys Ile Cys Glu Pro Gly Tyr Ser Pro Thr Tyr Lys Gln Asp Lys His
210 215 220
Tyr Gly Tyr Asn Ser Tyr Ser Val Ser Asn Ser Glu Lys Asp Ile Met
225 230 235 240
Ala Glu Ile Tyr Lys Asn Gly Pro Val Glu Gly Ala Phe Ser Val Tyr
245 250 255
Ser Asp Phe Leu Leu Tyr Lys Ser Gly Val Tyr Gln His Val Thr Gly
260 265 270
Glu Met Met Gly Gly His Ala Ile Arg Ile Leu Gly Trp Gly Val Glu
275 280 285
Asn Gly Thr Pro Tyr Trp Leu Val Ala Asn Ser Trp Asn Thr Asp Trp
290 295 300
Gly Asp Asn Gly Phe Phe Lys Ile Leu Arg Gly Gln Asp His Cys Gly
305 310 315 320
Ile Glu Ser Glu Val Val Ala Gly Ile Pro Arg Thr Asp Gln Tyr Trp
325 330 335
Glu Lys Ile
<210> 36
<211> 1017
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 36
atgtggcagc tgtgggccag cctgtgctgc ctgctggtgc tggccaacgc ccgcagccgc 60
cccagcttcc accccctgag cgacgagctg gtgaactacg tgaacaagcg caacaccacc 120
tggcaggccg gccacaactt ctacaacgtg gacatgagct acctgaagcg cctgtgcggc 180
accttcctgg gcggccccaa gcccccccag cgcgtgatgt tcaccgagga cctgaagctg 240
cccgccagct tcgacgcccg cgagcagtgg ccccagtgcc ccaccatcaa ggagatccgc 300
gaccagggca gctgcggcag ctgctgggcc ttcggcgccg tggaggccat cagcgaccgc 360
atctgcatcc acaccaacgc ccacgtgagc gtggaggtga gcgccgagga cctgctgacc 420
tgctgcggca gcatgtgcgg cgacggctgc aacggcggct accccgccga ggcctggaac 480
ttctggaccc gcaagggcct ggtgagcggc ggcctgtacg agagccacgt gggctgccgc 540
ccctacagca tccccccctg cgagcaccac gtgaacggca gccgcccccc ctgcaccggc 600
gagggcgaca cccccaagtg cagcaagatc tgcgagcccg gctacagccc cacctacaag 660
caggacaagc actacggcta caacagctac agcgtgagca acagcgagaa ggacatcatg 720
gccgagatct acaagaacgg ccccgtggag ggcgccttca gcgtgtacag cgacttcctg 780
ctgtacaaga gcggcgtgta ccagcacgtg accggcgaga tgatgggcgg ccacgccatc 840
cgcatcctgg gctggggcgt ggagaacggc accccctact ggctggtggc caacagctgg 900
aacaccgact ggggcgacaa cggcttcttc aagatcctgc gcggccagga ccactgcggc 960
atcgagagcg aggtggtggc cggcatcccc cgcaccgacc agtactggga gaagatc 1017
<210> 37
<211> 631
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 37
Met Pro Arg Tyr Gly Ala Ser Leu Arg Gln Ser Cys Pro Arg Ser Gly
1 5 10 15
Arg Glu Gln Gly Gln Asp Gly Thr Ala Gly Ala Pro Gly Leu Leu Trp
20 25 30
Met Gly Leu Val Leu Ala Leu Ala Leu Ala Leu Ala Leu Ala Leu Ala
35 40 45
Leu Ser Asp Ser Arg Val Leu Trp Ala Pro Ala Glu Ala His Pro Leu
50 55 60
Ser Pro Gln Gly His Pro Ala Arg Leu His Arg Ile Val Pro Arg Leu
65 70 75 80
Arg Asp Val Phe Gly Trp Gly Asn Leu Thr Cys Pro Ile Cys Lys Gly
85 90 95
Leu Phe Thr Ala Ile Asn Leu Gly Leu Lys Lys Glu Pro Asn Val Ala
100 105 110
Arg Val Gly Ser Val Ala Ile Lys Leu Cys Asn Leu Leu Lys Ile Ala
115 120 125
Pro Pro Ala Val Cys Gln Ser Ile Val His Leu Phe Glu Asp Asp Met
130 135 140
Val Glu Val Trp Arg Arg Ser Val Leu Ser Pro Ser Glu Ala Cys Gly
145 150 155 160
Leu Leu Leu Gly Ser Thr Cys Gly His Trp Asp Ile Phe Ser Ser Trp
165 170 175
Asn Ile Ser Leu Pro Thr Val Pro Lys Pro Pro Pro Lys Pro Pro Ser
180 185 190
Pro Pro Ala Pro Gly Ala Pro Val Ser Arg Ile Leu Phe Leu Thr Asp
195 200 205
Leu His Trp Asp His Asp Tyr Leu Glu Gly Thr Asp Pro Asp Cys Ala
210 215 220
Asp Pro Leu Cys Cys Arg Arg Gly Ser Gly Leu Pro Pro Ala Ser Arg
225 230 235 240
Pro Gly Ala Gly Tyr Trp Gly Glu Tyr Ser Lys Cys Asp Leu Pro Leu
245 250 255
Arg Thr Leu Glu Ser Leu Leu Ser Gly Leu Gly Pro Ala Gly Pro Phe
260 265 270
Asp Met Val Tyr Trp Thr Gly Asp Ile Pro Ala His Asp Val Trp His
275 280 285
Gln Thr Arg Gln Asp Gln Leu Arg Ala Leu Thr Thr Val Thr Ala Leu
290 295 300
Val Arg Lys Phe Leu Gly Pro Val Pro Val Tyr Pro Ala Val Gly Asn
305 310 315 320
His Glu Ser Thr Pro Val Asn Ser Phe Pro Pro Pro Phe Ile Glu Gly
325 330 335
Asn His Ser Ser Arg Trp Leu Tyr Glu Ala Met Ala Lys Ala Trp Glu
340 345 350
Pro Trp Leu Pro Ala Glu Ala Leu Arg Thr Leu Arg Ile Gly Gly Phe
355 360 365
Tyr Ala Leu Ser Pro Tyr Pro Gly Leu Arg Leu Ile Ser Leu Asn Met
370 375 380
Asn Phe Cys Ser Arg Glu Asn Phe Trp Leu Leu Ile Asn Ser Thr Asp
385 390 395 400
Pro Ala Gly Gln Leu Gln Trp Leu Val Gly Glu Leu Gln Ala Ala Glu
405 410 415
Asp Arg Gly Asp Lys Val His Ile Ile Gly His Ile Pro Pro Gly His
420 425 430
Cys Leu Lys Ser Trp Ser Trp Asn Tyr Tyr Arg Ile Val Ala Arg Tyr
435 440 445
Glu Asn Thr Leu Ala Ala Gln Phe Phe Gly His Thr His Val Asp Glu
450 455 460
Phe Glu Val Phe Tyr Asp Glu Glu Thr Leu Ser Arg Pro Leu Ala Val
465 470 475 480
Ala Phe Leu Ala Pro Ser Ala Thr Thr Tyr Ile Gly Leu Asn Pro Gly
485 490 495
Tyr Arg Val Tyr Gln Ile Asp Gly Asn Tyr Ser Gly Ser Ser His Val
500 505 510
Val Leu Asp His Glu Thr Tyr Ile Leu Asn Leu Thr Gln Ala Asn Ile
515 520 525
Pro Gly Ala Ile Pro His Trp Gln Leu Leu Tyr Arg Ala Arg Glu Thr
530 535 540
Tyr Gly Leu Pro Asn Thr Leu Pro Thr Ala Trp His Asn Leu Val Tyr
545 550 555 560
Arg Met Arg Gly Asp Met Gln Leu Phe Gln Thr Phe Trp Phe Leu Tyr
565 570 575
His Lys Gly His Pro Pro Ser Glu Pro Cys Gly Thr Pro Cys Arg Leu
580 585 590
Ala Thr Leu Cys Ala Gln Leu Ser Ala Arg Ala Asp Ser Pro Ala Leu
595 600 605
Cys Arg His Leu Met Pro Asp Gly Ser Leu Pro Glu Ala Gln Ser Leu
610 615 620
Trp Pro Arg Pro Leu Phe Cys
625 630
<210> 38
<211> 1896
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 38
atgccccgct acggcgccag cctgcgccag agctgccccc gcagcggccg cgagcagggc 60
caggacggca ccgccggcgc ccccggcctg ctgtggatgg gcctggtgct ggccctggcc 120
ctggccctgg ccctggccct ggccctgagc gacagccgcg tgctgtgggc ccccgccgag 180
gcccaccccc tgagccccca gggccacccc gcccgcctgc accgcatcgt gccccgcctg 240
cgcgacgtgt tcggctgggg caacctgacc tgccccatct gcaagggcct gttcaccgcc 300
atcaacctgg gcctgaagaa ggagcccaac gtggcccgcg tgggcagcgt ggccatcaag 360
ctgtgcaacc tgctgaagat cgcccccccc gccgtgtgcc agagcatcgt gcacctgttc 420
gaggacgaca tggtggaggt gtggcgccgc agcgtgctga gccccagcga ggcctgcggc 480
ctgctgctgg gcagcacctg cggccactgg gacatcttca gcagctggaa catcagcctg 540
cccaccgtgc ccaagccccc ccccaagccc cccagccccc ccgcccccgg cgcccccgtg 600
agccgcatcc tgttcctgac cgacctgcac tgggaccacg actacctgga gggcaccgac 660
cccgactgcg ccgaccccct gtgctgccgc cgcggcagcg gcctgccccc cgccagccgc 720
cccggcgccg gctactgggg cgagtacagc aagtgcgacc tgcccctgcg caccctggag 780
agcctgctga gcggcctggg ccccgccggc cccttcgaca tggtgtactg gaccggcgac 840
atccccgccc acgacgtgtg gcaccagacc cgccaggacc agctgcgcgc cctgaccacc 900
gtgaccgccc tggtgcgcaa gttcctgggc cccgtgcccg tgtaccccgc cgtgggcaac 960
cacgagagca cccccgtgaa cagcttcccc ccccccttca tcgagggcaa ccacagcagc 1020
cgctggctgt acgaggccat ggccaaggcc tgggagccct ggctgcccgc cgaggccctg 1080
cgcaccctgc gcatcggcgg cttctacgcc ctgagcccct accccggcct gcgcctgatc 1140
agcctgaaca tgaacttctg cagccgcgag aacttctggc tgctgatcaa cagcaccgac 1200
cccgccggcc agctgcagtg gctggtgggc gagctgcagg ccgccgagga ccgcggcgac 1260
aaggtgcaca tcatcggcca catccccccc ggccactgcc tgaagagctg gagctggaac 1320
tactaccgca tcgtggcccg ctacgagaac accctggccg cccagttctt cggccacacc 1380
cacgtggacg agttcgaggt gttctacgac gaggagaccc tgagccgccc cctggccgtg 1440
gccttcctgg cccccagcgc caccacctac atcggcctga accccggcta ccgcgtgtac 1500
cagatcgacg gcaactacag cggcagcagc cacgtggtgc tggaccacga gacctacatc 1560
ctgaacctga cccaggccaa catccccggc gccatccccc actggcagct gctgtaccgc 1620
gcccgcgaga cctacggcct gcccaacacc ctgcccaccg cctggcacaa cctggtgtac 1680
cgcatgcgcg gcgacatgca gctgttccag accttctggt tcctgtacca caagggccac 1740
ccccccagcg agccctgcgg caccccctgc cgcctggcca ccctgtgcgc ccagctgagc 1800
gcccgcgccg acagccccgc cctgtgccgc cacctgatgc ccgacggcag cctgcccgag 1860
gcccagagcc tgtggccccg ccccctgttc tgctaa 1896
<210> 39
<211> 11329
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 39
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctgcag ccctaggaat gcatctagac aattgtacta accttcttct 600
ctttcctctc ctgacagtcc ggaaagccac catggaattc agcagcccca gcagagagga 660
atgccccaag cctctgagcc gggtgtcaat catggccgga tctctgacag gactgctgct 720
gcttcaggcc gtgtcttggg cttctggcgc tagaccttgc atccccaaga gcttcggcta 780
cagcagcgtc gtgtgcgtgt gcaatgccac ctactgcgac agcttcgacc ctcctacctt 840
tcctgctctg ggcaccttca gcagatacga gagcaccaga tccggcagac ggatggaact 900
gagcatggga cccatccagg ccaatcacac aggcactggc ctgctgctga cactgcagcc 960
tgagcagaaa ttccagaaag tgaaaggctt cggcggagcc atgacagatg ccgccgctct 1020
gaatatcctg gctctgtctc caccagctca gaacctgctg ctcaagagct acttcagcga 1080
ggaaggcatc ggctacaaca tcatcagagt gcccatggcc agctgcgact tcagcatcag 1140
gacctacacc tacgccgaca cacccgacga tttccagctg cacaacttca gcctgcctga 1200
agaggacacc aagctgaaga tccctctgat ccacagagcc ctgcagctgg cacaaagacc 1260
cgtgtcactg ctggcctctc catggacatc tcccacctgg ctgaaaacaa atggcgccgt 1320
gaatggcaag ggcagcctga aaggccaacc tggcgacatc taccaccaga cctgggccag 1380
atacttcgtg aagttcctgg acgcctatgc cgagcacaag ctgcagtttt gggccgtgac 1440
agccgagaac gaaccttctg ctggactgct gagcggctac ccctttcagt gcctgggctt 1500
tacacccgag caccagcggg actttatcgc ccgtgatctg ggacccacac tggccaatag 1560
cacccaccat aatgtgcggc tgctgatgct ggacgaccag agactgcttc tgccccactg 1620
ggctaaagtg gtgctgacag atcctgaggc cgccaaatac gtgcacggaa tcgccgtgca 1680
ctggtatctg gactttctgg cccctgccaa ggccacactg ggagagacac acagactgtt 1740
ccccaacacc atgctgttcg ccagcgaagc ctgtgtgggc agcaagtttt gggaacagag 1800
cgtgcggctc ggcagctggg atagaggcat gcagtacagc cacagcatca tcaccaacct 1860
gctgtaccac gtcgtcggct ggaccgactg gaatctggcc ctgaatcctg aaggcggccc 1920
taactgggtc cgaaacttcg tggacagccc catcatcgtg gacatcacca aggacacctt 1980
ctacaagcag cccatgttct accacctggg acacttcagc aagttcatcc ccgagggctc 2040
tcagcgcgtt ggactggtgg cttcccagaa gaacgatctg gacgccgtgg ctctgatgca 2100
ccctgatgga tctgctgtgg tggtggtcct gaaccgcagc agcaaagatg tgcccctgac 2160
catcaaggat cccgccgtgg gattcctgga aacaatcagc cctggctact ccatccacac 2220
ctacctgtgg cgtagacagg agggcagagg aagtcttctg acatgcggag acgtggaaga 2280
gaatcccggc cctatggccg agtggctgct gagcgccagc tggcagcgcc gcgccaaggc 2340
catgaccgcc gccgccggca gcgccggccg cgccgccgtg cccctgctgc tgtgcgccct 2400
gctggccccc ggcggcgcct acgtgctgga cgacagcgac ggcctgggcc gcgagttcga 2460
cggcatcggc gccgtgagcg gcggcggcgc caccagccgc ctgctggtga actaccccga 2520
gccctaccgc agccagatcc tggactacct gttcaagccc aacttcggcg ccagcctgca 2580
catcctgaag gtggagatcg gcggcgacgg ccagaccacc gacggcaccg agcccagcca 2640
catgcactac gccctggacg agaactactt ccgcggctac gagtggtggc tgatgaagga 2700
ggccaagaag cgcaacccca acatcaccct gatcggcctg ccctggagct tccccggctg 2760
gctgggcaag ggcttcgact ggccctacgt gaacctgcag ctgaccgcct actacgtggt 2820
gacctggatc gtgggcgcca agcgctacca cgacctggac atcgactaca tcggcatctg 2880
gaacgagcgc agctacaacg ccaactacat caagatcctg cgcaagatgc tgaactacca 2940
gggcctgcag cgcgtgaaga tcatcgccag cgacaacctg tgggagagca tcagcgccag 3000
catgctgctg gacgccgagc tgttcaaggt ggtggacgtg atcggcgccc actaccccgg 3060
cacccacagc gccaaggacg ccaagctgac cggcaagaag ctgtggagca gcgaggactt 3120
cagcaccctg aacagcgaca tgggcgccgg ctgctggggc cgcatcctga accagaacta 3180
catcaacggc tacatgacca gcaccatcgc ctggaacctg gtggccagct actacgagca 3240
gctgccctac ggccgctgcg gcctgatgac cgcccaggag ccctggagcg gccactacgt 3300
ggtggagagc cccgtgtggg tgagcgccca caccacccag ttcacccagc ccggctggta 3360
ctacctgaag accgtgggcc acctggagaa gggcggcagc tacgtggccc tgaccgacgg 3420
cctgggcaac ctgaccatca tcatcgagac catgagccac aagcacagca agtgcatccg 3480
ccccttcctg ccctacttca acgtgagcca gcagttcgcc accttcgtgc tgaagggcag 3540
cttcagcgag atccccgagc tgcaggtgtg gtacaccaag ctgggcaaga ccagcgagcg 3600
cttcctgttc aagcagctgg acagcctgtg gctgctggac agcgacggca gcttcaccct 3660
gagcctgcac gaggacgagc tgttcaccct gaccaccctg accaccggcc gcaagggcag 3720
ctaccccctg ccccccaaga gccagccctt ccccagcacc tacaaggacg acttcaacgt 3780
ggactacccc ttcttcagcg aggcccccaa cttcgccgac cagaccggcg tgttcgagta 3840
cttcaccaac atcgaggacc ccggcgagca ccacttcacc ctgcgccagg tgctgaacca 3900
gcgccccatc acctgggccg ccgacgccag caacaccatc agcatcatcg gcgactacaa 3960
ctggaccaac ctgaccatca agtgcgacgt gtacatcgag acccccgaca ccggcggcgt 4020
gttcatcgcc ggccgcgtga acaagggcgg catcctgatc cgcagcgccc gcggcatctt 4080
cttctggatc ttcgccaacg gcagctaccg cgtgaccggc gacctggccg gctggatcat 4140
ctacgccctg ggccgcgtgg aggtgaccgc caagaagtgg tacaccctga ccctgaccat 4200
caagggccac ttcaccagcg gcatgctgaa cgacaagagc ctgtggaccg acatccccgt 4260
gaacttcccc aagaacggct gggccgccat cggcacccac agcttcgagt tcgcccagtt 4320
cgacaacttc ctggtggagg ccacccgctg acaattgtta attaagttta aaccctcgag 4380
gccgcaagca ataaaatatc tttattttca ttacatctgt gtgttggttt tttgtgtgga 4440
gatccacgat aacaaacagc ttttttgggg tgaacatatt gactgaattc cctgcaggtt 4500
ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gcccgggcaa agcccgggcg 4560
tcgggcgacc tttggtcgcc cggcctcagt gagcgagcga gcgcgcagag agggagtggc 4620
caactccatc actaggggtt cctgcggccg ctcgtacggt ctcgaggaat tcctgcagga 4680
taacttgcca acctcattct aaaatgtata tagaagccca aaagacaata acaaaaatat 4740
tcttgtagaa caaaatggga aagaatgttc cactaaatat caagatttag agcaaagcat 4800
gagatgtgtg gggatagaca gtgaggctga taaaatagag tagagctcag aaacagaccc 4860
attgatatat gtaagtgacc tatgaaaaaa atatggcatt ttacaatggg aaaatgatgg 4920
tctttttctt ttttagaaaa acagggaaat atatttatat gtaaaaaata aaagggaacc 4980
catatgtcat accatacaca caaaaaaatt ccagtgaatt ataagtctaa atggagaagg 5040
caaaacttta aatcttttag aaaataatat agaagcatgc agaccagcct ggccaacatg 5100
atgaaaccct ctctactaat aataaaatca gtagaactac tcaggactac tttgagtggg 5160
aagtcctttt ctatgaagac ttctttggcc aaaattaggc tctaaatgca aggagatagt 5220
gcatcatgcc tggctgcact tactgataaa tgatgttatc accatcttta accaaatgca 5280
caggaacaag ttatggtact gatgtgctgg attgagaagg agctctactt ccttgacagg 5340
acacatttgt atcaacttaa aaaagcagat ttttgccagc agaactattc attcagaggt 5400
aggaaactta gaatagatga tgtcactgat tagcatggct tccccatctc cacagctgct 5460
tcccacccag gttgcccaca gttgagtttg tccagtgctc agggctgccc actctcagta 5520
agaagcccca caccagcccc tctccaaata tgttggctgt tccttccatt aaagtgaccc 5580
cactttagag cagcaagtgg atttctgttt cttacagttc aggaaggagg agtcagctgt 5640
gagaacctgg agcctgagat gcttctaagt cccactgcta ctggggtcag ggaagccaga 5700
ctccagcatc agcagtcagg agcactaagc ccttgccaac atcctgtttc tcagagaaac 5760
tgcttccatt ataatggttg tcctttttta agctatcaag ccaaacaacc agtgtctacc 5820
attattctca tcacctgaag ccaagggttc tagcaaaagt caagctgtct tgtaatggtt 5880
gatgtgcctc cagcttctgt cttcagtcac tccactctta gcctgctctg aatcaactct 5940
gaccacagtt ccctggagcc cctgccacct gctgcccctg ccaccttctc catctgcagt 6000
gctgtgcagc cttctgcact cttgcagagc taataggtgg agacttgaag gaagaggagg 6060
aaagtttctc ataatagcct tgctgcaagc tcaaatggga ggtgggcact gtgcccagga 6120
gccttggagc aaaggctgtg cccaacctct gactgcatcc aggtttggtc ttgacagaga 6180
taagaagccc tggcttttgg agccaaaatc taggtcagac ttaggcagga ttctcaaagt 6240
ttatcagcag aacatgaggc agaagaccct ttctgctcca gcttcttcag gctcaacctt 6300
catcagaata gatagaaaga gaggctgtga gggttcttaa aacagaagca aatctgactc 6360
agagaataaa caacctccta gtaaactaca gcttagacag agcatctggt ggtgagtgtg 6420
ctcagtgtcc tactcaactg tctggtatca gccctcatga ggacttctct tctttccctc 6480
atagacctcc atctctgttt tccttagcct gcagaaatct ggatggctat tcacagaatg 6540
cctgtgcttt cagagttgca ttttttctct ggtattctgg ttcaagcatt tgaaggtagg 6600
aaaggttctc caagtgcaag aaagccagcc ctgagcctca actgcctggc tagtgtggtc 6660
agtaggatgc aaaggctgtt gaatgccaca aggccaaact ttaacctgtg taccacaagc 6720
ctagcagcag aggcagctct gctcactgga actctctgtc ttctttctcc tgagcctttt 6780
cttttcctga gttttctagc tctcctcaac cttacctctg ccctacccag gacaaaccca 6840
agagccactg tttctgtgat gtcctctcca gccctaatta ggcatcatga cttcagcctg 6900
accttccatg ctcagaagca gtgctaatcc acttcagatg agctgctcta tgcaacacag 6960
gcagagccta caaacctttg caccagagcc ctccacatat cagtgtttgt tcatactcac 7020
ttcaacagca aatgtgactg ctgagattaa gattttacac aagatggtct gtaatttcac 7080
agttagtttt atcccattag gtatgaaaga attagcataa ttccccttaa acatgaatga 7140
atcttagatt ttttaataaa tagttttgga agtaaagaca gagacatcag gagcacaagg 7200
aatagcctga gaggacaaac agaacaagaa agagtctgga aatacacagg atgttcttgg 7260
cctcctcaaa gcaagtgcaa gcagatagta ccagcagccc caggctatca gagcccagtg 7320
aagagaagta ccatgaaagc cacagctcta accaccctgt tccagagtga cagacagtcc 7380
ccaagacaag ccagcctgag ccagagagag aactgcaaga gaaagtttct aatttaggtt 7440
ctgttagatt cagacaagtg caggtcatcc tctctccaca gctactcacc tctccagcct 7500
aacaaagcct gcagtccaca ctccaaccct ggtgtctcac ctcctagcct ctcccaacat 7560
cctgctctct gaccatcttc tgcatctctc atctcaccat ctcccactgt ctacagccta 7620
ctcttgcaac taccatctca ttttctgaca tcctgtctac atcttctgcc atactctgcc 7680
atctaccata ccacctctta ccatctacca caccatcttt tatctccatc cctctcagaa 7740
gcctccaagc tgaatcctgc tttatgtgtt catctcagcc cctgcatgga aagctgaccc 7800
cagaggcaga actattccca gagagcttgg ccaagaaaaa caaaactacc agcctggcca 7860
ggctcaggag tagtaagctg cagtgtctgt tgtgttctag cttcaacagc tgcaggagtt 7920
ccactctcaa atgctccaca tttctcacat cctcctgatt ctggtcacta cccatcttca 7980
aagaacagaa tatctcacat cagcatactg tgaaggacta gtcatgggtg cagctgctca 8040
gagctgcaaa gtcattctgg atggtggaga gcttacaaac atttcatgat gctccccccg 8100
ctctgatggc tggagcccaa tccctacaca gactcctgct gtatgtgttt tcctttcact 8160
ctgagccaca gccagagggc aggcattcag tctcctcttc aggctggggc tggggcactg 8220
agaactcacc caacaccttg ctctcactcc ttctgcaaaa caagaaagag ctttgtgctg 8280
cagtagccat gaagaatgaa aggaaggctt taactaaaaa atgtcagaga ttattttcaa 8340
ccccttactg tggatcacca gcaaggagga aacacaacac agagacattt tttcccctca 8400
aattatcaaa agaatcactg catttgttaa agagagcaac tgaatcagga agcagagttt 8460
tgaacatatc agaagttagg aatctgcatc agagacaaat gcagtcatgg ttgtttgctg 8520
cataccagcc ctaatcatta gaagcctcat ggacttcaaa catcattccc tctgacaaga 8580
tgctctagcc taactccatg agataaaata aatctgcctt tcagagccaa agaagagtcc 8640
accagcttct tctcagtgtg aacaagagct ccagtcaggt tagtcagtcc agtgcagtag 8700
aggagaccag tctgcatcct ctaattttca aaggcaagaa gatttgttta ccctggacac 8760
caggcacaag tgaggtcaca gagctcttag atatgcagtc ctcatgagtg aggagactaa 8820
agcgcatgcc atcaagactt cagtgtagag aaaacctcca aaaaagcctc ctcactactt 8880
ctggaatagc tcagaggccg aggcggcctc ggcctctgca taaataaaaa aaattagtca 8940
gccatggggc ggagaatggg cggaactggg cggagttagg ggcgggatgg gcggagttag 9000
gggcgggact atggttgctg actaattgag atgcatgctt tgcatacttc tgcctgctgg 9060
ggagcctggg gactttccac acctggttgc tgactaattg agatgcatgc tttgcatact 9120
tctgcctgct ggggagcctg gggactttcc acaccctaac tgacacacat tccacagctg 9180
cattaatgaa tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ctcttccgct 9240
tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac 9300
tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga 9360
gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat 9420
aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac 9480
ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct 9540
gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg 9600
ctttctcata gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg 9660
ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt 9720
cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg 9780
attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac 9840
ggctacacta gaagaacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga 9900
aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt 9960
gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt 10020
tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga 10080
ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc 10140
taaagtatat atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct 10200
atctcagcga tctgtctatt tcgttcatcc atagttgcct gactcctgca aaccacgttg 10260
tgtctcaaaa tctctgatgt tacattgcac aagataaaaa tatatcatca tgaacaataa 10320
aactgtctgc ttacataaac agtaatacaa ggggtgttat gagccatatt caacgggaaa 10380
cgtcttgctc gaggccgcga ttaaattcca acatggatgc tgatttatat gggtataaat 10440
gggctcgcga taatgtcggg caatcaggtg cgacaatcta tcgattgtat gggaagcccg 10500
atgcgccaga gttgtttctg aaacatggca aaggtagcgt tgccaatgat gttacagatg 10560
agatggtcag actaaactgg ctgacggaat ttatgcctct tccgaccatc aagcatttta 10620
tccgtactcc tgatgatgca tggttactca ccactgcgat ccccgggaaa acagcattcc 10680
aggtattaga agaatatcct gattcaggtg aaaatattgt tgatgcgctg gcagtgttcc 10740
tgcgccggtt gcattcgatt cctgtttgta attgtccttt taacagcgat cgcgtatttc 10800
gtctcgctca ggcgcaatca cgaatgaata acggtttggt tgatgcgagt gattttgatg 10860
acgagcgtaa tggctggcct gttgaacaag tctggaaaga aatgcataag cttttgccat 10920
tctcaccgga ttcagtcgtc actcatggtg atttctcact tgataacctt atttttgacg 10980
aggggaaatt aataggttgt attgatgttg gacgagtcgg aatcgcagac cgataccagg 11040
atcttgccat cctatggaac tgcctcggtg agttttctcc ttcattacag aaacggcttt 11100
ttcaaaaata tggtattgat aatcctgata tgaataaatt gcagtttcat ttgatgctcg 11160
atgagttttt ctaagggcgg cctgccacca tacccacgcc gaaacaagcg ctcatgagcc 11220
cgaagtggcg agcccgatct tccccatcgg tgatgtcggc gatataggcg ccagcaaccg 11280
cacctgtggc gccggtgatg agggcgcgcc aagtcgacgt ccggcagtc 11329
<210> 40
<211> 11776
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 40
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctgcag ccctaggaat gcatctagac aattgtacta accttcttct 600
ctttcctctc ctgacagtcc ggaaagccac catggccgag tggctgctga gcgccagctg 660
gcagcgccgc gccaaggcca tgaccgccgc cgccggcagc gccggccgcg ccgccgtgcc 720
cctgctgctg tgcgccctgc tggcccccgg cggcgcctac gtgctggacg acagcgacgg 780
cctgggccgc gagttcgacg gcatcggcgc cgtgagcggc ggcggcgcca ccagccgcct 840
gctggtgaac taccccgagc cctaccgcag ccagatcctg gactacctgt tcaagcccaa 900
cttcggcgcc agcctgcaca tcctgaaggt ggagatcggc ggcgacggcc agaccaccga 960
cggcaccgag cccagccaca tgcactacgc cctggacgag aactacttcc gcggctacga 1020
gtggtggctg atgaaggagg ccaagaagcg caaccccaac atcaccctga tcggcctgcc 1080
ctggagcttc cccggctggc tgggcaaggg cttcgactgg ccctacgtga acctgcagct 1140
gaccgcctac tacgtggtga cctggatcgt gggcgccaag cgctaccacg acctggacat 1200
cgactacatc ggcatctgga acgagcgcag ctacaacgcc aactacatca agatcctgcg 1260
caagatgctg aactaccagg gcctgcagcg cgtgaagatc atcgccagcg acaacctgtg 1320
ggagagcatc agcgccagca tgctgctgga cgccgagctg ttcaaggtgg tggacgtgat 1380
cggcgcccac taccccggca cccacagcgc caaggacgcc aagctgaccg gcaagaagct 1440
gtggagcagc gaggacttca gcaccctgaa cagcgacatg ggcgccggct gctggggccg 1500
catcctgaac cagaactaca tcaacggcta catgaccagc accatcgcct ggaacctggt 1560
ggccagctac tacgagcagc tgccctacgg ccgctgcggc ctgatgaccg cccaggagcc 1620
ctggagcggc cactacgtgg tggagagccc cgtgtgggtg agcgcccaca ccacccagtt 1680
cacccagccc ggctggtact acctgaagac cgtgggccac ctggagaagg gcggcagcta 1740
cgtggccctg accgacggcc tgggcaacct gaccatcatc atcgagacca tgagccacaa 1800
gcacagcaag tgcatccgcc ccttcctgcc ctacttcaac gtgagccagc agttcgccac 1860
cttcgtgctg aagggcagct tcagcgagat ccccgagctg caggtgtggt acaccaagct 1920
gggcaagacc agcgagcgct tcctgttcaa gcagctggac agcctgtggc tgctggacag 1980
cgacggcagc ttcaccctga gcctgcacga ggacgagctg ttcaccctga ccaccctgac 2040
caccggccgc aagggcagct accccctgcc ccccaagagc cagcccttcc ccagcaccta 2100
caaggacgac ttcaacgtgg actacccctt cttcagcgag gcccccaact tcgccgacca 2160
gaccggcgtg ttcgagtact tcaccaacat cgaggacccc ggcgagcacc acttcaccct 2220
gcgccaggtg ctgaaccagc gccccatcac ctgggccgcc gacgccagca acaccatcag 2280
catcatcggc gactacaact ggaccaacct gaccatcaag tgcgacgtgt acatcgagac 2340
ccccgacacc ggcggcgtgt tcatcgccgg ccgcgtgaac aagggcggca tcctgatccg 2400
cagcgcccgc ggcatcttct tctggatctt cgccaacggc agctaccgcg tgaccggcga 2460
cctggccggc tggatcatct acgccctggg ccgcgtggag gtgaccgcca agaagtggta 2520
caccctgacc ctgaccatca agggccactt caccagcggc atgctgaacg acaagagcct 2580
gtggaccgac atccccgtga acttccccaa gaacggctgg gccgccatcg gcacccacag 2640
cttcgagttc gcccagttcg acaacttcct ggtggaggcc acccgctgat tgtggccgaa 2700
ccgccgaact cagaggccgg ccccagaaaa cccgagcgag tagggggcgg cgcgcaggag 2760
ggaggagaac tgggggcgcg ggaggctggt gggtgtgggg ggtggagatg tagaagatgt 2820
gacgccgcgg cccggcgggt gccagattag cggacgcggt gcccgcggtt gcaacgggat 2880
cccgggcgct gcagcttggg aggcggctct ccccaggcgg cgtccgcgga gacacccatc 2940
cgtgaacccc aggtcccggg ccgccggctc gccgcgcacc aggggccggc ggacagaaga 3000
gcggccgagc ggctcgaggc tgggggaccg cgggcgcggc cgcgcgctgc cgggcgggag 3060
gctggggggc cggggccggg gccgtgcccc ggagcgggtc ggaggccggg gccggggccg 3120
ggggacggcg gctccccgcg cggctccagc ggctcgggga tcccggccgg gccccgcagg 3180
gaccatgatg gaattcagca gccccagcag agaggaatgc cccaagcctc tgagccgggt 3240
gtcaatcatg gccggatctc tgacaggact gctgctgctt caggccgtgt cttgggcttc 3300
tggcgctaga ccttgcatcc ccaagagctt cggctacagc agcgtcgtgt gcgtgtgcaa 3360
tgccacctac tgcgacagct tcgaccctcc tacctttcct gctctgggca ccttcagcag 3420
atacgagagc accagatccg gcagacggat ggaactgagc atgggaccca tccaggccaa 3480
tcacacaggc actggcctgc tgctgacact gcagcctgag cagaaattcc agaaagtgaa 3540
aggcttcggc ggagccatga cagatgccgc cgctctgaat atcctggctc tgtctccacc 3600
agctcagaac ctgctgctca agagctactt cagcgaggaa ggcatcggct acaacatcat 3660
cagagtgccc atggccagct gcgacttcag catcaggacc tacacctacg ccgacacacc 3720
cgacgatttc cagctgcaca acttcagcct gcctgaagag gacaccaagc tgaagatccc 3780
tctgatccac agagccctgc agctggcaca aagacccgtg tcactgctgg cctctccatg 3840
gacatctccc acctggctga aaacaaatgg cgccgtgaat ggcaagggca gcctgaaagg 3900
ccaacctggc gacatctacc accagacctg ggccagatac ttcgtgaagt tcctggacgc 3960
ctatgccgag cacaagctgc agttttgggc cgtgacagcc gagaacgaac cttctgctgg 4020
actgctgagc ggctacccct ttcagtgcct gggctttaca cccgagcacc agcgggactt 4080
tatcgcccgt gatctgggac ccacactggc caatagcacc caccataatg tgcggctgct 4140
gatgctggac gaccagagac tgcttctgcc ccactgggct aaagtggtgc tgacagatcc 4200
tgaggccgcc aaatacgtgc acggaatcgc cgtgcactgg tatctggact ttctggcccc 4260
tgccaaggcc acactgggag agacacacag actgttcccc aacaccatgc tgttcgccag 4320
cgaagcctgt gtgggcagca agttttggga acagagcgtg cggctcggca gctgggatag 4380
aggcatgcag tacagccaca gcatcatcac caacctgctg taccacgtcg tcggctggac 4440
cgactggaat ctggccctga atcctgaagg cggccctaac tgggtccgaa acttcgtgga 4500
cagccccatc atcgtggaca tcaccaagga caccttctac aagcagccca tgttctacca 4560
cctgggacac ttcagcaagt tcatccccga gggctctcag cgcgttggac tggtggcttc 4620
ccagaagaac gatctggacg ccgtggctct gatgcaccct gatggatctg ctgtggtggt 4680
ggtcctgaac cgcagcagca aagatgtgcc cctgaccatc aaggatcccg ccgtgggatt 4740
cctggaaaca atcagccctg gctactccat ccacacctac ctgtggcgta gacagtgaca 4800
attgttaatt aagtttaaac cctcgaggcc gcaagcaata aaatatcttt attttcatta 4860
catctgtgtg ttggtttttt gtgtggagat ccacgataac aaacagcttt tttggggtga 4920
acatattgac tgaattccct gcaggttggc cactccctct ctgcgcgctc gctcgctcac 4980
tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt ggtcgcccgg cctcagtgag 5040
cgagcgagcg cgcagagagg gagtggccaa ctccatcact aggggttcct gcggccgctc 5100
gtacggtctc gaggaattcc tgcaggataa cttgccaacc tcattctaaa atgtatatag 5160
aagcccaaaa gacaataaca aaaatattct tgtagaacaa aatgggaaag aatgttccac 5220
taaatatcaa gatttagagc aaagcatgag atgtgtgggg atagacagtg aggctgataa 5280
aatagagtag agctcagaaa cagacccatt gatatatgta agtgacctat gaaaaaaata 5340
tggcatttta caatgggaaa atgatggtct ttttcttttt tagaaaaaca gggaaatata 5400
tttatatgta aaaaataaaa gggaacccat atgtcatacc atacacacaa aaaaattcca 5460
gtgaattata agtctaaatg gagaaggcaa aactttaaat cttttagaaa ataatataga 5520
agcatgcaga ccagcctggc caacatgatg aaaccctctc tactaataat aaaatcagta 5580
gaactactca ggactacttt gagtgggaag tccttttcta tgaagacttc tttggccaaa 5640
attaggctct aaatgcaagg agatagtgca tcatgcctgg ctgcacttac tgataaatga 5700
tgttatcacc atctttaacc aaatgcacag gaacaagtta tggtactgat gtgctggatt 5760
gagaaggagc tctacttcct tgacaggaca catttgtatc aacttaaaaa agcagatttt 5820
tgccagcaga actattcatt cagaggtagg aaacttagaa tagatgatgt cactgattag 5880
catggcttcc ccatctccac agctgcttcc cacccaggtt gcccacagtt gagtttgtcc 5940
agtgctcagg gctgcccact ctcagtaaga agccccacac cagcccctct ccaaatatgt 6000
tggctgttcc ttccattaaa gtgaccccac tttagagcag caagtggatt tctgtttctt 6060
acagttcagg aaggaggagt cagctgtgag aacctggagc ctgagatgct tctaagtccc 6120
actgctactg gggtcaggga agccagactc cagcatcagc agtcaggagc actaagccct 6180
tgccaacatc ctgtttctca gagaaactgc ttccattata atggttgtcc ttttttaagc 6240
tatcaagcca aacaaccagt gtctaccatt attctcatca cctgaagcca agggttctag 6300
caaaagtcaa gctgtcttgt aatggttgat gtgcctccag cttctgtctt cagtcactcc 6360
actcttagcc tgctctgaat caactctgac cacagttccc tggagcccct gccacctgct 6420
gcccctgcca ccttctccat ctgcagtgct gtgcagcctt ctgcactctt gcagagctaa 6480
taggtggaga cttgaaggaa gaggaggaaa gtttctcata atagccttgc tgcaagctca 6540
aatgggaggt gggcactgtg cccaggagcc ttggagcaaa ggctgtgccc aacctctgac 6600
tgcatccagg tttggtcttg acagagataa gaagccctgg cttttggagc caaaatctag 6660
gtcagactta ggcaggattc tcaaagttta tcagcagaac atgaggcaga agaccctttc 6720
tgctccagct tcttcaggct caaccttcat cagaatagat agaaagagag gctgtgaggg 6780
ttcttaaaac agaagcaaat ctgactcaga gaataaacaa cctcctagta aactacagct 6840
tagacagagc atctggtggt gagtgtgctc agtgtcctac tcaactgtct ggtatcagcc 6900
ctcatgagga cttctcttct ttccctcata gacctccatc tctgttttcc ttagcctgca 6960
gaaatctgga tggctattca cagaatgcct gtgctttcag agttgcattt tttctctggt 7020
attctggttc aagcatttga aggtaggaaa ggttctccaa gtgcaagaaa gccagccctg 7080
agcctcaact gcctggctag tgtggtcagt aggatgcaaa ggctgttgaa tgccacaagg 7140
ccaaacttta acctgtgtac cacaagccta gcagcagagg cagctctgct cactggaact 7200
ctctgtcttc tttctcctga gccttttctt ttcctgagtt ttctagctct cctcaacctt 7260
acctctgccc tacccaggac aaacccaaga gccactgttt ctgtgatgtc ctctccagcc 7320
ctaattaggc atcatgactt cagcctgacc ttccatgctc agaagcagtg ctaatccact 7380
tcagatgagc tgctctatgc aacacaggca gagcctacaa acctttgcac cagagccctc 7440
cacatatcag tgtttgttca tactcacttc aacagcaaat gtgactgctg agattaagat 7500
tttacacaag atggtctgta atttcacagt tagttttatc ccattaggta tgaaagaatt 7560
agcataattc cccttaaaca tgaatgaatc ttagattttt taataaatag ttttggaagt 7620
aaagacagag acatcaggag cacaaggaat agcctgagag gacaaacaga acaagaaaga 7680
gtctggaaat acacaggatg ttcttggcct cctcaaagca agtgcaagca gatagtacca 7740
gcagccccag gctatcagag cccagtgaag agaagtacca tgaaagccac agctctaacc 7800
accctgttcc agagtgacag acagtcccca agacaagcca gcctgagcca gagagagaac 7860
tgcaagagaa agtttctaat ttaggttctg ttagattcag acaagtgcag gtcatcctct 7920
ctccacagct actcacctct ccagcctaac aaagcctgca gtccacactc caaccctggt 7980
gtctcacctc ctagcctctc ccaacatcct gctctctgac catcttctgc atctctcatc 8040
tcaccatctc ccactgtcta cagcctactc ttgcaactac catctcattt tctgacatcc 8100
tgtctacatc ttctgccata ctctgccatc taccatacca cctcttacca tctaccacac 8160
catcttttat ctccatccct ctcagaagcc tccaagctga atcctgcttt atgtgttcat 8220
ctcagcccct gcatggaaag ctgaccccag aggcagaact attcccagag agcttggcca 8280
agaaaaacaa aactaccagc ctggccaggc tcaggagtag taagctgcag tgtctgttgt 8340
gttctagctt caacagctgc aggagttcca ctctcaaatg ctccacattt ctcacatcct 8400
cctgattctg gtcactaccc atcttcaaag aacagaatat ctcacatcag catactgtga 8460
aggactagtc atgggtgcag ctgctcagag ctgcaaagtc attctggatg gtggagagct 8520
tacaaacatt tcatgatgct ccccccgctc tgatggctgg agcccaatcc ctacacagac 8580
tcctgctgta tgtgttttcc tttcactctg agccacagcc agagggcagg cattcagtct 8640
cctcttcagg ctggggctgg ggcactgaga actcacccaa caccttgctc tcactccttc 8700
tgcaaaacaa gaaagagctt tgtgctgcag tagccatgaa gaatgaaagg aaggctttaa 8760
ctaaaaaatg tcagagatta ttttcaaccc cttactgtgg atcaccagca aggaggaaac 8820
acaacacaga gacatttttt cccctcaaat tatcaaaaga atcactgcat ttgttaaaga 8880
gagcaactga atcaggaagc agagttttga acatatcaga agttaggaat ctgcatcaga 8940
gacaaatgca gtcatggttg tttgctgcat accagcccta atcattagaa gcctcatgga 9000
cttcaaacat cattccctct gacaagatgc tctagcctaa ctccatgaga taaaataaat 9060
ctgcctttca gagccaaaga agagtccacc agcttcttct cagtgtgaac aagagctcca 9120
gtcaggttag tcagtccagt gcagtagagg agaccagtct gcatcctcta attttcaaag 9180
gcaagaagat ttgtttaccc tggacaccag gcacaagtga ggtcacagag ctcttagata 9240
tgcagtcctc atgagtgagg agactaaagc gcatgccatc aagacttcag tgtagagaaa 9300
acctccaaaa aagcctcctc actacttctg gaatagctca gaggccgagg cggcctcggc 9360
ctctgcataa ataaaaaaaa ttagtcagcc atggggcgga gaatgggcgg aactgggcgg 9420
agttaggggc gggatgggcg gagttagggg cgggactatg gttgctgact aattgagatg 9480
catgctttgc atacttctgc ctgctgggga gcctggggac tttccacacc tggttgctga 9540
ctaattgaga tgcatgcttt gcatacttct gcctgctggg gagcctgggg actttccaca 9600
ccctaactga cacacattcc acagctgcat taatgaatcg gccaacgcgc ggggagaggc 9660
ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt 9720
cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca 9780
ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa 9840
aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat 9900
cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc 9960
cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc 10020
gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt 10080
tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac 10140
cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg 10200
ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca 10260
gagttcttga agtggtggcc taactacggc tacactagaa gaacagtatt tggtatctgc 10320
gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa 10380
accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa 10440
ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac 10500
tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta 10560
aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt 10620
taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata 10680
gttgcctgac tcctgcaaac cacgttgtgt ctcaaaatct ctgatgttac attgcacaag 10740
ataaaaatat atcatcatga acaataaaac tgtctgctta cataaacagt aatacaaggg 10800
gtgttatgag ccatattcaa cgggaaacgt cttgctcgag gccgcgatta aattccaaca 10860
tggatgctga tttatatggg tataaatggg ctcgcgataa tgtcgggcaa tcaggtgcga 10920
caatctatcg attgtatggg aagcccgatg cgccagagtt gtttctgaaa catggcaaag 10980
gtagcgttgc caatgatgtt acagatgaga tggtcagact aaactggctg acggaattta 11040
tgcctcttcc gaccatcaag cattttatcc gtactcctga tgatgcatgg ttactcacca 11100
ctgcgatccc cgggaaaaca gcattccagg tattagaaga atatcctgat tcaggtgaaa 11160
atattgttga tgcgctggca gtgttcctgc gccggttgca ttcgattcct gtttgtaatt 11220
gtccttttaa cagcgatcgc gtatttcgtc tcgctcaggc gcaatcacga atgaataacg 11280
gtttggttga tgcgagtgat tttgatgacg agcgtaatgg ctggcctgtt gaacaagtct 11340
ggaaagaaat gcataagctt ttgccattct caccggattc agtcgtcact catggtgatt 11400
tctcacttga taaccttatt tttgacgagg ggaaattaat aggttgtatt gatgttggac 11460
gagtcggaat cgcagaccga taccaggatc ttgccatcct atggaactgc ctcggtgagt 11520
tttctccttc attacagaaa cggctttttc aaaaatatgg tattgataat cctgatatga 11580
ataaattgca gtttcatttg atgctcgatg agtttttcta agggcggcct gccaccatac 11640
ccacgccgaa acaagcgctc atgagcccga agtggcgagc ccgatcttcc ccatcggtga 11700
tgtcggcgat ataggcgcca gcaaccgcac ctgtggcgcc ggtgatgagg gcgcgccaag 11760
tcgacgtccg gcagtc 11776
<210> 41
<211> 11348
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 41
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctcctg gtggcgaggg gaggggggtg gtcctcgaac gccttgcaga 600
actggcctgg atacagagtg gaccggctgg ccccatctgg aagacttcga gatacactgt 660
tgtcttactg cgctcaacag tgtatctcga agtcttccaa atggtgccag ccatcgcagc 720
ggggtgcagg aaatgggggc agcccccctt tttggctatc cttccacgtg ttcttttttg 780
tatcttttgt gtttcctaga aaacatctca gtcaccaccg cagccctagg aatgcatcta 840
gacaattgta ctaaccttct tctctttcct ctcctgacag tccggaaagc caccatgtgg 900
cagctgtggg ccagcctgtg ctgcctgctg gtgctggcca acgcccgcag ccgccccagc 960
ttccaccccc tgagcgacga gctggtgaac tacgtgaaca agcgcaacac cacctggcag 1020
gccggccaca acttctacaa cgtggacatg agctacctga agcgcctgtg cggcaccttc 1080
ctgggcggcc ccaagccccc ccagcgcgtg atgttcaccg aggacctgaa gctgcccgcc 1140
agcttcgacg cccgcgagca gtggccccag tgccccacca tcaaggagat ccgcgaccag 1200
ggcagctgcg gcagctgctg ggccttcggc gccgtggagg ccatcagcga ccgcatctgc 1260
atccacacca acgcccacgt gagcgtggag gtgagcgccg aggacctgct gacctgctgc 1320
ggcagcatgt gcggcgacgg ctgcaacggc ggctaccccg ccgaggcctg gaacttctgg 1380
acccgcaagg gcctggtgag cggcggcctg tacgagagcc acgtgggctg ccgcccctac 1440
agcatccccc cctgcgagca ccacgtgaac ggcagccgcc ccccctgcac cggcgagggc 1500
gacaccccca agtgcagcaa gatctgcgag cccggctaca gccccaccta caagcaggac 1560
aagcactacg gctacaacag ctacagcgtg agcaacagcg agaaggacat catggccgag 1620
atctacaaga acggccccgt ggagggcgcc ttcagcgtgt acagcgactt cctgctgtac 1680
aagagcggcg tgtaccagca cgtgaccggc gagatgatgg gcggccacgc catccgcatc 1740
ctgggctggg gcgtggagaa cggcaccccc tactggctgg tggccaacag ctggaacacc 1800
gactggggcg acaacggctt cttcaagatc ctgcgcggcc aggaccactg cggcatcgag 1860
agcgaggtgg tggccggcat cccccgcacc gaccagtact gggagaagat cgagggcaga 1920
ggaagtcttc tgacatgcgg agacgtggaa gagaatcccg gccctatgga attcagcagc 1980
cccagcagag aggaatgccc caagcctctg agccgggtgt caatcatggc cggatctctg 2040
acaggactgc tgctgcttca ggccgtgtct tgggcttctg gcgctagacc ttgcatcccc 2100
aagagcttcg gctacagcag cgtcgtgtgc gtgtgcaatg ccacctactg cgacagcttc 2160
gaccctccta cctttcctgc tctgggcacc ttcagcagat acgagagcac cagatccggc 2220
agacggatgg aactgagcat gggacccatc caggccaatc acacaggcac tggcctgctg 2280
ctgacactgc agcctgagca gaaattccag aaagtgaaag gcttcggcgg agccatgaca 2340
gatgccgccg ctctgaatat cctggctctg tctccaccag ctcagaacct gctgctcaag 2400
agctacttca gcgaggaagg catcggctac aacatcatca gagtgcccat ggccagctgc 2460
gacttcagca tcaggaccta cacctacgcc gacacacccg acgatttcca gctgcacaac 2520
ttcagcctgc ctgaagagga caccaagctg aagatccctc tgatccacag agccctgcag 2580
ctggcacaaa gacccgtgtc actgctggcc tctccatgga catctcccac ctggctgaaa 2640
acaaatggcg ccgtgaatgg caagggcagc ctgaaaggcc aacctggcga catctaccac 2700
cagacctggg ccagatactt cgtgaagttc ctggacgcct atgccgagca caagctgcag 2760
ttttgggccg tgacagccga gaacgaacct tctgctggac tgctgagcgg ctaccccttt 2820
cagtgcctgg gctttacacc cgagcaccag cgggacttta tcgcccgtga tctgggaccc 2880
acactggcca atagcaccca ccataatgtg cggctgctga tgctggacga ccagagactg 2940
cttctgcccc actgggctaa agtggtgctg acagatcctg aggccgccaa atacgtgcac 3000
ggaatcgccg tgcactggta tctggacttt ctggcccctg ccaaggccac actgggagag 3060
acacacagac tgttccccaa caccatgctg ttcgccagcg aagcctgtgt gggcagcaag 3120
ttttgggaac agagcgtgcg gctcggcagc tgggatagag gcatgcagta cagccacagc 3180
atcatcacca acctgctgta ccacgtcgtc ggctggaccg actggaatct ggccctgaat 3240
cctgaaggcg gccctaactg ggtccgaaac ttcgtggaca gccccatcat cgtggacatc 3300
accaaggaca ccttctacaa gcagcccatg ttctaccacc tgggacactt cagcaagttc 3360
atccccgagg gctctcagcg cgttggactg gtggcttccc agaagaacga tctggacgcc 3420
gtggctctga tgcaccctga tggatctgct gtggtggtgg tcctgaaccg cagcagcaaa 3480
gatgtgcccc tgaccatcaa ggatcccgcc gtgggattcc tggaaacaat cagccctggc 3540
tactccatcc acacctacct gtggcgtaga cagtgacaat tgttaattaa gtttaaaccc 3600
tcgaggccgc aagcttatcg ataatcaacc tctggattac aaaatttgtg aaagattgac 3660
tggtattctt aactatgttg ctccttttac gctatgtgga tacgctgctt taatgccttt 3720
gtatcatgct attgcttccc gtatggcttt cattttctcc tccttgtata aatcctggtt 3780
gctgtctctt tatgaggagt tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt 3840
gtttgctgac gcaaccccca ctggttgggg cattgccacc acctgtcagc tcctttccgg 3900
gactttcgct ttccccctcc ctattgccac ggcggaactc atcgccgcct gccttgcccg 3960
ctgctggaca ggggctcggc tgttgggcac tgacaattcc gtggtgttgt cggggaaatc 4020
atcgtccttt ccttggctgc tcgcctgtgt tgccacctgg attctgcgcg ggacgtcctt 4080
ctgctacgtc ccttcggccc tcaatccagc ggaccttcct tcccgcggcc tgctgccggc 4140
tctgcggcct cttccgcgtc ttcgccttcg ccctcagacg agtcggatct ccctttgggc 4200
cgcctccccg catcgatacc gtcgactaga gctcgctgat cagcctcgac tgtgccttct 4260
agttgccagc catctgttgt ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc 4320
actcccactg tcctttccta ataaaatgag gaaattgcat cgcattgtct gagtaggtgt 4380
cattctattc tggggggtgg ggtggggcag gacagcaagg gggaggattg ggaagacaat 4440
agcaggcatg ctggggagag atccacgata acaaacagct tttttggggt gaacatattg 4500
actgaattcc ctgcaggttg gccactccct ctctgcgcgc tcgctcgctc actgaggccg 4560
cccgggcaaa gcccgggcgt cgggcgacct ttggtcgccc ggcctcagtg agcgagcgag 4620
cgcgcagaga gggagtggcc aactccatca ctaggggttc ctgcggccgc tcgtacggtc 4680
tcgaggaatt cctgcaggat aacttgccaa cctcattcta aaatgtatat agaagcccaa 4740
aagacaataa caaaaatatt cttgtagaac aaaatgggaa agaatgttcc actaaatatc 4800
aagatttaga gcaaagcatg agatgtgtgg ggatagacag tgaggctgat aaaatagagt 4860
agagctcaga aacagaccca ttgatatatg taagtgacct atgaaaaaaa tatggcattt 4920
tacaatggga aaatgatggt ctttttcttt tttagaaaaa cagggaaata tatttatatg 4980
taaaaaataa aagggaaccc atatgtcata ccatacacac aaaaaaattc cagtgaatta 5040
taagtctaaa tggagaaggc aaaactttaa atcttttaga aaataatata gaagcatgca 5100
gaccagcctg gccaacatga tgaaaccctc tctactaata ataaaatcag tagaactact 5160
caggactact ttgagtggga agtccttttc tatgaagact tctttggcca aaattaggct 5220
ctaaatgcaa ggagatagtg catcatgcct ggctgcactt actgataaat gatgttatca 5280
ccatctttaa ccaaatgcac aggaacaagt tatggtactg atgtgctgga ttgagaagga 5340
gctctacttc cttgacagga cacatttgta tcaacttaaa aaagcagatt tttgccagca 5400
gaactattca ttcagaggta ggaaacttag aatagatgat gtcactgatt agcatggctt 5460
ccccatctcc acagctgctt cccacccagg ttgcccacag ttgagtttgt ccagtgctca 5520
gggctgccca ctctcagtaa gaagccccac accagcccct ctccaaatat gttggctgtt 5580
ccttccatta aagtgacccc actttagagc agcaagtgga tttctgtttc ttacagttca 5640
ggaaggagga gtcagctgtg agaacctgga gcctgagatg cttctaagtc ccactgctac 5700
tggggtcagg gaagccagac tccagcatca gcagtcagga gcactaagcc cttgccaaca 5760
tcctgtttct cagagaaact gcttccatta taatggttgt ccttttttaa gctatcaagc 5820
caaacaacca gtgtctacca ttattctcat cacctgaagc caagggttct agcaaaagtc 5880
aagctgtctt gtaatggttg atgtgcctcc agcttctgtc ttcagtcact ccactcttag 5940
cctgctctga atcaactctg accacagttc cctggagccc ctgccacctg ctgcccctgc 6000
caccttctcc atctgcagtg ctgtgcagcc ttctgcactc ttgcagagct aataggtgga 6060
gacttgaagg aagaggagga aagtttctca taatagcctt gctgcaagct caaatgggag 6120
gtgggcactg tgcccaggag ccttggagca aaggctgtgc ccaacctctg actgcatcca 6180
ggtttggtct tgacagagat aagaagccct ggcttttgga gccaaaatct aggtcagact 6240
taggcaggat tctcaaagtt tatcagcaga acatgaggca gaagaccctt tctgctccag 6300
cttcttcagg ctcaaccttc atcagaatag atagaaagag aggctgtgag ggttcttaaa 6360
acagaagcaa atctgactca gagaataaac aacctcctag taaactacag cttagacaga 6420
gcatctggtg gtgagtgtgc tcagtgtcct actcaactgt ctggtatcag ccctcatgag 6480
gacttctctt ctttccctca tagacctcca tctctgtttt ccttagcctg cagaaatctg 6540
gatggctatt cacagaatgc ctgtgctttc agagttgcat tttttctctg gtattctggt 6600
tcaagcattt gaaggtagga aaggttctcc aagtgcaaga aagccagccc tgagcctcaa 6660
ctgcctggct agtgtggtca gtaggatgca aaggctgttg aatgccacaa ggccaaactt 6720
taacctgtgt accacaagcc tagcagcaga ggcagctctg ctcactggaa ctctctgtct 6780
tctttctcct gagccttttc ttttcctgag ttttctagct ctcctcaacc ttacctctgc 6840
cctacccagg acaaacccaa gagccactgt ttctgtgatg tcctctccag ccctaattag 6900
gcatcatgac ttcagcctga ccttccatgc tcagaagcag tgctaatcca cttcagatga 6960
gctgctctat gcaacacagg cagagcctac aaacctttgc accagagccc tccacatatc 7020
agtgtttgtt catactcact tcaacagcaa atgtgactgc tgagattaag attttacaca 7080
agatggtctg taatttcaca gttagtttta tcccattagg tatgaaagaa ttagcataat 7140
tccccttaaa catgaatgaa tcttagattt tttaataaat agttttggaa gtaaagacag 7200
agacatcagg agcacaagga atagcctgag aggacaaaca gaacaagaaa gagtctggaa 7260
atacacagga tgttcttggc ctcctcaaag caagtgcaag cagatagtac cagcagcccc 7320
aggctatcag agcccagtga agagaagtac catgaaagcc acagctctaa ccaccctgtt 7380
ccagagtgac agacagtccc caagacaagc cagcctgagc cagagagaga actgcaagag 7440
aaagtttcta atttaggttc tgttagattc agacaagtgc aggtcatcct ctctccacag 7500
ctactcacct ctccagccta acaaagcctg cagtccacac tccaaccctg gtgtctcacc 7560
tcctagcctc tcccaacatc ctgctctctg accatcttct gcatctctca tctcaccatc 7620
tcccactgtc tacagcctac tcttgcaact accatctcat tttctgacat cctgtctaca 7680
tcttctgcca tactctgcca tctaccatac cacctcttac catctaccac accatctttt 7740
atctccatcc ctctcagaag cctccaagct gaatcctgct ttatgtgttc atctcagccc 7800
ctgcatggaa agctgacccc agaggcagaa ctattcccag agagcttggc caagaaaaac 7860
aaaactacca gcctggccag gctcaggagt agtaagctgc agtgtctgtt gtgttctagc 7920
ttcaacagct gcaggagttc cactctcaaa tgctccacat ttctcacatc ctcctgattc 7980
tggtcactac ccatcttcaa agaacagaat atctcacatc agcatactgt gaaggactag 8040
tcatgggtgc agctgctcag agctgcaaag tcattctgga tggtggagag cttacaaaca 8100
tttcatgatg ctccccccgc tctgatggct ggagcccaat ccctacacag actcctgctg 8160
tatgtgtttt cctttcactc tgagccacag ccagagggca ggcattcagt ctcctcttca 8220
ggctggggct ggggcactga gaactcaccc aacaccttgc tctcactcct tctgcaaaac 8280
aagaaagagc tttgtgctgc agtagccatg aagaatgaaa ggaaggcttt aactaaaaaa 8340
tgtcagagat tattttcaac cccttactgt ggatcaccag caaggaggaa acacaacaca 8400
gagacatttt ttcccctcaa attatcaaaa gaatcactgc atttgttaaa gagagcaact 8460
gaatcaggaa gcagagtttt gaacatatca gaagttagga atctgcatca gagacaaatg 8520
cagtcatggt tgtttgctgc ataccagccc taatcattag aagcctcatg gacttcaaac 8580
atcattccct ctgacaagat gctctagcct aactccatga gataaaataa atctgccttt 8640
cagagccaaa gaagagtcca ccagcttctt ctcagtgtga acaagagctc cagtcaggtt 8700
agtcagtcca gtgcagtaga ggagaccagt ctgcatcctc taattttcaa aggcaagaag 8760
atttgtttac cctggacacc aggcacaagt gaggtcacag agctcttaga tatgcagtcc 8820
tcatgagtga ggagactaaa gcgcatgcca tcaagacttc agtgtagaga aaacctccaa 8880
aaaagcctcc tcactacttc tggaatagct cagaggccga ggcggcctcg gcctctgcat 8940
aaataaaaaa aattagtcag ccatggggcg gagaatgggc ggaactgggc ggagttaggg 9000
gcgggatggg cggagttagg ggcgggacta tggttgctga ctaattgaga tgcatgcttt 9060
gcatacttct gcctgctggg gagcctgggg actttccaca cctggttgct gactaattga 9120
gatgcatgct ttgcatactt ctgcctgctg gggagcctgg ggactttcca caccctaact 9180
gacacacatt ccacagctgc attaatgaat cggccaacgc gcggggagag gcggtttgcg 9240
tattgggcgc tcttccgctt cctcgctcac tgactcgctg cgctcggtcg ttcggctgcg 9300
gcgagcggta tcagctcact caaaggcggt aatacggtta tccacagaat caggggataa 9360
cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc 9420
gttgctggcg tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc 9480
aagtcagagg tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag 9540
ctccctcgtg cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct 9600
cccttcggga agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta 9660
ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc 9720
cttatccggt aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc 9780
agcagccact ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt 9840
gaagtggtgg cctaactacg gctacactag aagaacagta tttggtatct gcgctctgct 9900
gaagccagtt accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc 9960
tggtagcggt ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca 10020
agaagatcct ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta 10080
agggattttg gtcatgagat tatcaaaaag gatcttcacc tagatccttt taaattaaaa 10140
atgaagtttt aaatcaatct aaagtatata tgagtaaact tggtctgaca gttaccaatg 10200
cttaatcagt gaggcaccta tctcagcgat ctgtctattt cgttcatcca tagttgcctg 10260
actcctgcaa accacgttgt gtctcaaaat ctctgatgtt acattgcaca agataaaaat 10320
atatcatcat gaacaataaa actgtctgct tacataaaca gtaatacaag gggtgttatg 10380
agccatattc aacgggaaac gtcttgctcg aggccgcgat taaattccaa catggatgct 10440
gatttatatg ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat 10500
cgattgtatg ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt 10560
gccaatgatg ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt 10620
ccgaccatca agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc 10680
cccgggaaaa cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt 10740
gatgcgctgg cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt 10800
aacagcgatc gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt 10860
gatgcgagtg attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa 10920
atgcataagc ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt 10980
gataacctta tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga 11040
atcgcagacc gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct 11100
tcattacaga aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg 11160
cagtttcatt tgatgctcga tgagtttttc taagggcggc ctgccaccat acccacgccg 11220
aaacaagcgc tcatgagccc gaagtggcga gcccgatctt ccccatcggt gatgtcggcg 11280
atataggcgc cagcaaccgc acctgtggcg ccggtgatga gggcgcgcca agtcgacgtc 11340
cggcagtc 11348
<210> 42
<211> 11433
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 42
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctcctg gtggcgaggg gaggggggtg gtcctcgaac gccttgcaga 600
actggcctgg atacagagtg gaccggctgg ccccatctgg aagacttcga gatacactgt 660
tgtcttactg cgctcaacag tgtatctcga agtcttccaa atggtgccag ccatcgcagc 720
ggggtgcagg aaatgggggc agcccccctt tttggctatc cttccacgtg ttcttttttg 780
tatcttttgt gtttcctaga aaacatctca gtcaccaccg cagccctagg aatgcatcta 840
gacaattgta ctaaccttct tctctttcct ctcctgacag tccggaaagc caccatggaa 900
ttcagcagcc ccagcagaga ggaatgcccc aagcctctga gccgggtgtc aatcatggcc 960
ggatctctga caggactgct gctgcttcag gccgtgtctt gggcttctgg cgctagacct 1020
tgcatcccca agagcttcgg ctacagcagc gtcgtgtgcg tgtgcaatgc cacctactgc 1080
gacagcttcg accctcctac ctttcctgct ctgggcacct tcagcagata cgagagcacc 1140
agatccggca gacggatgga actgagcatg ggacccatcc aggccaatca cacaggcact 1200
ggcctgctgc tgacactgca gcctgagcag aaattccaga aagtgaaagg cttcggcgga 1260
gccatgacag atgccgccgc tctgaatatc ctggctctgt ctccaccagc tcagaacctg 1320
ctgctcaaga gctacttcag cgaggaaggc atcggctaca acatcatcag agtgcccatg 1380
gccagctgcg acttcagcat caggacctac acctacgccg acacacccga cgatttccag 1440
ctgcacaact tcagcctgcc tgaagaggac accaagctga agatccctct gatccacaga 1500
gccctgcagc tggcacaaag acccgtgtca ctgctggcct ctccatggac atctcccacc 1560
tggctgaaaa caaatggcgc cgtgaatggc aagggcagcc tgaaaggcca acctggcgac 1620
atctaccacc agacctgggc cagatacttc gtgaagttcc tggacgccta tgccgagcac 1680
aagctgcagt tttgggccgt gacagccgag aacgaacctt ctgctggact gctgagcggc 1740
tacccctttc agtgcctggg ctttacaccc gagcaccagc gggactttat cgcccgtgat 1800
ctgggaccca cactggccaa tagcacccac cataatgtgc ggctgctgat gctggacgac 1860
cagagactgc ttctgcccca ctgggctaaa gtggtgctga cagatcctga ggccgccaaa 1920
tacgtgcacg gaatcgccgt gcactggtat ctggactttc tggcccctgc caaggccaca 1980
ctgggagaga cacacagact gttccccaac accatgctgt tcgccagcga agcctgtgtg 2040
ggcagcaagt tttgggaaca gagcgtgcgg ctcggcagct gggatagagg catgcagtac 2100
agccacagca tcatcaccaa cctgctgtac cacgtcgtcg gctggaccga ctggaatctg 2160
gccctgaatc ctgaaggcgg ccctaactgg gtccgaaact tcgtggacag ccccatcatc 2220
gtggacatca ccaaggacac cttctacaag cagcccatgt tctaccacct gggacacttc 2280
agcaagttca tccccgaggg ctctcagcgc gttggactgg tggcttccca gaagaacgat 2340
ctggacgccg tggctctgat gcaccctgat ggatctgctg tggtggtggt cctgaaccgc 2400
agcagcaaag atgtgcccct gaccatcaag gatcccgccg tgggattcct ggaaacaatc 2460
agccctggct actccatcca cacctacctg tggcgtagac aggagggcag aggaagtctt 2520
ctgacatgcg gagacgtgga agagaatccc ggccctatgc cccgctacgg cgccagcctg 2580
cgccagagct gcccccgcag cggccgcgag cagggccagg acggcaccgc cggcgccccc 2640
ggcctgctgt ggatgggcct ggtgctggcc ctggccctgg ccctggccct ggccctggcc 2700
ctgagcgaca gccgcgtgct gtgggccccc gccgaggccc accccctgag cccccagggc 2760
caccccgccc gcctgcaccg catcgtgccc cgcctgcgcg acgtgttcgg ctggggcaac 2820
ctgacctgcc ccatctgcaa gggcctgttc accgccatca acctgggcct gaagaaggag 2880
cccaacgtgg cccgcgtggg cagcgtggcc atcaagctgt gcaacctgct gaagatcgcc 2940
ccccccgccg tgtgccagag catcgtgcac ctgttcgagg acgacatggt ggaggtgtgg 3000
cgccgcagcg tgctgagccc cagcgaggcc tgcggcctgc tgctgggcag cacctgcggc 3060
cactgggaca tcttcagcag ctggaacatc agcctgccca ccgtgcccaa gccccccccc 3120
aagcccccca gcccccccgc ccccggcgcc cccgtgagcc gcatcctgtt cctgaccgac 3180
ctgcactggg accacgacta cctggagggc accgaccccg actgcgccga ccccctgtgc 3240
tgccgccgcg gcagcggcct gccccccgcc agccgccccg gcgccggcta ctggggcgag 3300
tacagcaagt gcgacctgcc cctgcgcacc ctggagagcc tgctgagcgg cctgggcccc 3360
gccggcccct tcgacatggt gtactggacc ggcgacatcc ccgcccacga cgtgtggcac 3420
cagacccgcc aggaccagct gcgcgccctg accaccgtga ccgccctggt gcgcaagttc 3480
ctgggccccg tgcccgtgta ccccgccgtg ggcaaccacg agagcacccc cgtgaacagc 3540
ttcccccccc ccttcatcga gggcaaccac agcagccgct ggctgtacga ggccatggcc 3600
aaggcctggg agccctggct gcccgccgag gccctgcgca ccctgcgcat cggcggcttc 3660
tacgccctga gcccctaccc cggcctgcgc ctgatcagcc tgaacatgaa cttctgcagc 3720
cgcgagaact tctggctgct gatcaacagc accgaccccg ccggccagct gcagtggctg 3780
gtgggcgagc tgcaggccgc cgaggaccgc ggcgacaagg tgcacatcat cggccacatc 3840
ccccccggcc actgcctgaa gagctggagc tggaactact accgcatcgt ggcccgctac 3900
gagaacaccc tggccgccca gttcttcggc cacacccacg tggacgagtt cgaggtgttc 3960
tacgacgagg agaccctgag ccgccccctg gccgtggcct tcctggcccc cagcgccacc 4020
acctacatcg gcctgaaccc cggctaccgc gtgtaccaga tcgacggcaa ctacagcggc 4080
agcagccacg tggtgctgga ccacgagacc tacatcctga acctgaccca ggccaacatc 4140
cccggcgcca tcccccactg gcagctgctg taccgcgccc gcgagaccta cggcctgccc 4200
aacaccctgc ccaccgcctg gcacaacctg gtgtaccgca tgcgcggcga catgcagctg 4260
ttccagacct tctggttcct gtaccacaag ggccaccccc ccagcgagcc ctgcggcacc 4320
ccctgccgcc tggccaccct gtgcgcccag ctgagcgccc gcgccgacag ccccgccctg 4380
tgccgccacc tgatgcccga cggcagcctg cccgaggccc agagcctgtg gccccgcccc 4440
ctgttctgct aatgacaatt gttaattaag tttaaaccct cgaggccgca agcaataaaa 4500
tatctttatt ttcattacat ctgtgtgttg gttttttgtg tggagatcca cgataacaaa 4560
cagctttttt ggggtgaaca tattgactga attccctgca ggttggccac tccctctctg 4620
cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg ggcgtcgggc gacctttggt 4680
cgcccggcct cagtgagcga gcgagcgcgc agagagggag tggccaactc catcactagg 4740
ggttcctgcg gccgctcgta cggtctcgag gaattcctgc aggataactt gccaacctca 4800
ttctaaaatg tatatagaag cccaaaagac aataacaaaa atattcttgt agaacaaaat 4860
gggaaagaat gttccactaa atatcaagat ttagagcaaa gcatgagatg tgtggggata 4920
gacagtgagg ctgataaaat agagtagagc tcagaaacag acccattgat atatgtaagt 4980
gacctatgaa aaaaatatgg cattttacaa tgggaaaatg atggtctttt tcttttttag 5040
aaaaacaggg aaatatattt atatgtaaaa aataaaaggg aacccatatg tcataccata 5100
cacacaaaaa aattccagtg aattataagt ctaaatggag aaggcaaaac tttaaatctt 5160
ttagaaaata atatagaagc atgcagacca gcctggccaa catgatgaaa ccctctctac 5220
taataataaa atcagtagaa ctactcagga ctactttgag tgggaagtcc ttttctatga 5280
agacttcttt ggccaaaatt aggctctaaa tgcaaggaga tagtgcatca tgcctggctg 5340
cacttactga taaatgatgt tatcaccatc tttaaccaaa tgcacaggaa caagttatgg 5400
tactgatgtg ctggattgag aaggagctct acttccttga caggacacat ttgtatcaac 5460
ttaaaaaagc agatttttgc cagcagaact attcattcag aggtaggaaa cttagaatag 5520
atgatgtcac tgattagcat ggcttcccca tctccacagc tgcttcccac ccaggttgcc 5580
cacagttgag tttgtccagt gctcagggct gcccactctc agtaagaagc cccacaccag 5640
cccctctcca aatatgttgg ctgttccttc cattaaagtg accccacttt agagcagcaa 5700
gtggatttct gtttcttaca gttcaggaag gaggagtcag ctgtgagaac ctggagcctg 5760
agatgcttct aagtcccact gctactgggg tcagggaagc cagactccag catcagcagt 5820
caggagcact aagcccttgc caacatcctg tttctcagag aaactgcttc cattataatg 5880
gttgtccttt tttaagctat caagccaaac aaccagtgtc taccattatt ctcatcacct 5940
gaagccaagg gttctagcaa aagtcaagct gtcttgtaat ggttgatgtg cctccagctt 6000
ctgtcttcag tcactccact cttagcctgc tctgaatcaa ctctgaccac agttccctgg 6060
agcccctgcc acctgctgcc cctgccacct tctccatctg cagtgctgtg cagccttctg 6120
cactcttgca gagctaatag gtggagactt gaaggaagag gaggaaagtt tctcataata 6180
gccttgctgc aagctcaaat gggaggtggg cactgtgccc aggagccttg gagcaaaggc 6240
tgtgcccaac ctctgactgc atccaggttt ggtcttgaca gagataagaa gccctggctt 6300
ttggagccaa aatctaggtc agacttaggc aggattctca aagtttatca gcagaacatg 6360
aggcagaaga ccctttctgc tccagcttct tcaggctcaa ccttcatcag aatagataga 6420
aagagaggct gtgagggttc ttaaaacaga agcaaatctg actcagagaa taaacaacct 6480
cctagtaaac tacagcttag acagagcatc tggtggtgag tgtgctcagt gtcctactca 6540
actgtctggt atcagccctc atgaggactt ctcttctttc cctcatagac ctccatctct 6600
gttttcctta gcctgcagaa atctggatgg ctattcacag aatgcctgtg ctttcagagt 6660
tgcatttttt ctctggtatt ctggttcaag catttgaagg taggaaaggt tctccaagtg 6720
caagaaagcc agccctgagc ctcaactgcc tggctagtgt ggtcagtagg atgcaaaggc 6780
tgttgaatgc cacaaggcca aactttaacc tgtgtaccac aagcctagca gcagaggcag 6840
ctctgctcac tggaactctc tgtcttcttt ctcctgagcc ttttcttttc ctgagttttc 6900
tagctctcct caaccttacc tctgccctac ccaggacaaa cccaagagcc actgtttctg 6960
tgatgtcctc tccagcccta attaggcatc atgacttcag cctgaccttc catgctcaga 7020
agcagtgcta atccacttca gatgagctgc tctatgcaac acaggcagag cctacaaacc 7080
tttgcaccag agccctccac atatcagtgt ttgttcatac tcacttcaac agcaaatgtg 7140
actgctgaga ttaagatttt acacaagatg gtctgtaatt tcacagttag ttttatccca 7200
ttaggtatga aagaattagc ataattcccc ttaaacatga atgaatctta gattttttaa 7260
taaatagttt tggaagtaaa gacagagaca tcaggagcac aaggaatagc ctgagaggac 7320
aaacagaaca agaaagagtc tggaaataca caggatgttc ttggcctcct caaagcaagt 7380
gcaagcagat agtaccagca gccccaggct atcagagccc agtgaagaga agtaccatga 7440
aagccacagc tctaaccacc ctgttccaga gtgacagaca gtccccaaga caagccagcc 7500
tgagccagag agagaactgc aagagaaagt ttctaattta ggttctgtta gattcagaca 7560
agtgcaggtc atcctctctc cacagctact cacctctcca gcctaacaaa gcctgcagtc 7620
cacactccaa ccctggtgtc tcacctccta gcctctccca acatcctgct ctctgaccat 7680
cttctgcatc tctcatctca ccatctccca ctgtctacag cctactcttg caactaccat 7740
ctcattttct gacatcctgt ctacatcttc tgccatactc tgccatctac cataccacct 7800
cttaccatct accacaccat cttttatctc catccctctc agaagcctcc aagctgaatc 7860
ctgctttatg tgttcatctc agcccctgca tggaaagctg accccagagg cagaactatt 7920
cccagagagc ttggccaaga aaaacaaaac taccagcctg gccaggctca ggagtagtaa 7980
gctgcagtgt ctgttgtgtt ctagcttcaa cagctgcagg agttccactc tcaaatgctc 8040
cacatttctc acatcctcct gattctggtc actacccatc ttcaaagaac agaatatctc 8100
acatcagcat actgtgaagg actagtcatg ggtgcagctg ctcagagctg caaagtcatt 8160
ctggatggtg gagagcttac aaacatttca tgatgctccc cccgctctga tggctggagc 8220
ccaatcccta cacagactcc tgctgtatgt gttttccttt cactctgagc cacagccaga 8280
gggcaggcat tcagtctcct cttcaggctg gggctggggc actgagaact cacccaacac 8340
cttgctctca ctccttctgc aaaacaagaa agagctttgt gctgcagtag ccatgaagaa 8400
tgaaaggaag gctttaacta aaaaatgtca gagattattt tcaacccctt actgtggatc 8460
accagcaagg aggaaacaca acacagagac attttttccc ctcaaattat caaaagaatc 8520
actgcatttg ttaaagagag caactgaatc aggaagcaga gttttgaaca tatcagaagt 8580
taggaatctg catcagagac aaatgcagtc atggttgttt gctgcatacc agccctaatc 8640
attagaagcc tcatggactt caaacatcat tccctctgac aagatgctct agcctaactc 8700
catgagataa aataaatctg cctttcagag ccaaagaaga gtccaccagc ttcttctcag 8760
tgtgaacaag agctccagtc aggttagtca gtccagtgca gtagaggaga ccagtctgca 8820
tcctctaatt ttcaaaggca agaagatttg tttaccctgg acaccaggca caagtgaggt 8880
cacagagctc ttagatatgc agtcctcatg agtgaggaga ctaaagcgca tgccatcaag 8940
acttcagtgt agagaaaacc tccaaaaaag cctcctcact acttctggaa tagctcagag 9000
gccgaggcgg cctcggcctc tgcataaata aaaaaaatta gtcagccatg gggcggagaa 9060
tgggcggaac tgggcggagt taggggcggg atgggcggag ttaggggcgg gactatggtt 9120
gctgactaat tgagatgcat gctttgcata cttctgcctg ctggggagcc tggggacttt 9180
ccacacctgg ttgctgacta attgagatgc atgctttgca tacttctgcc tgctggggag 9240
cctggggact ttccacaccc taactgacac acattccaca gctgcattaa tgaatcggcc 9300
aacgcgcggg gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact 9360
cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac 9420
ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa 9480
aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg 9540
acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa 9600
gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc 9660
ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac 9720
gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac 9780
cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg 9840
taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt 9900
atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagaa 9960
cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct 10020
cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga 10080
ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg 10140
ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct 10200
tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt 10260
aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc 10320
tatttcgttc atccatagtt gcctgactcc tgcaaaccac gttgtgtctc aaaatctctg 10380
atgttacatt gcacaagata aaaatatatc atcatgaaca ataaaactgt ctgcttacat 10440
aaacagtaat acaaggggtg ttatgagcca tattcaacgg gaaacgtctt gctcgaggcc 10500
gcgattaaat tccaacatgg atgctgattt atatgggtat aaatgggctc gcgataatgt 10560
cgggcaatca ggtgcgacaa tctatcgatt gtatgggaag cccgatgcgc cagagttgtt 10620
tctgaaacat ggcaaaggta gcgttgccaa tgatgttaca gatgagatgg tcagactaaa 10680
ctggctgacg gaatttatgc ctcttccgac catcaagcat tttatccgta ctcctgatga 10740
tgcatggtta ctcaccactg cgatccccgg gaaaacagca ttccaggtat tagaagaata 10800
tcctgattca ggtgaaaata ttgttgatgc gctggcagtg ttcctgcgcc ggttgcattc 10860
gattcctgtt tgtaattgtc cttttaacag cgatcgcgta tttcgtctcg ctcaggcgca 10920
atcacgaatg aataacggtt tggttgatgc gagtgatttt gatgacgagc gtaatggctg 10980
gcctgttgaa caagtctgga aagaaatgca taagcttttg ccattctcac cggattcagt 11040
cgtcactcat ggtgatttct cacttgataa ccttattttt gacgagggga aattaatagg 11100
ttgtattgat gttggacgag tcggaatcgc agaccgatac caggatcttg ccatcctatg 11160
gaactgcctc ggtgagtttt ctccttcatt acagaaacgg ctttttcaaa aatatggtat 11220
tgataatcct gatatgaata aattgcagtt tcatttgatg ctcgatgagt ttttctaagg 11280
gcggcctgcc accataccca cgccgaaaca agcgctcatg agcccgaagt ggcgagcccg 11340
atcttcccca tcggtgatgt cggcgatata ggcgccagca accgcacctg tggcgccggt 11400
gatgagggcg cgccaagtcg acgtccggca gtc 11433
<210> 43
<211> 11776
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 43
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctgcag ccctaggaat gcatctagac aattgtacta accttcttct 600
ctttcctctc ctgacagtcc ggaaagccac catggccgag tggctgctga gcgccagctg 660
gcagcgccgc gccaaggcca tgaccgccgc cgccggcagc gccggccgcg ccgccgtgcc 720
cctgctgctg tgcgccctgc tggcccccgg cggcgcctac gtgctggacg acagcgacgg 780
cctgggccgc gagttcgacg gcatcggcgc cgtgagcggc ggcggcgcca ccagccgcct 840
gctggtgaac taccccgagc cctaccgcag ccagatcctg gactacctgt tcaagcccaa 900
cttcggcgcc agcctgcaca tcctgaaggt ggagatcggc ggcgacggcc agaccaccga 960
cggcaccgag cccagccaca tgcactacgc cctggacgag aactacttcc gcggctacga 1020
gtggtggctg atgaaggagg ccaagaagcg caaccccaac atcaccctga tcggcctgcc 1080
ctggagcttc cccggctggc tgggcaaggg cttcgactgg ccctacgtga acctgcagct 1140
gaccgcctac tacgtggtga cctggatcgt gggcgccaag cgctaccacg acctggacat 1200
cgactacatc ggcatctgga acgagcgcag ctacaacgcc aactacatca agatcctgcg 1260
caagatgctg aactaccagg gcctgcagcg cgtgaagatc atcgccagcg acaacctgtg 1320
ggagagcatc agcgccagca tgctgctgga cgccgagctg ttcaaggtgg tggacgtgat 1380
cggcgcccac taccccggca cccacagcgc caaggacgcc aagctgaccg gcaagaagct 1440
gtggagcagc gaggacttca gcaccctgaa cagcgacatg ggcgccggct gctggggccg 1500
catcctgaac cagaactaca tcaacggcta catgaccagc accatcgcct ggaacctggt 1560
ggccagctac tacgagcagc tgccctacgg ccgctgcggc ctgatgaccg cccaggagcc 1620
ctggagcggc cactacgtgg tggagagccc cgtgtgggtg agcgcccaca ccacccagtt 1680
cacccagccc ggctggtact acctgaagac cgtgggccac ctggagaagg gcggcagcta 1740
cgtggccctg accgacggcc tgggcaacct gaccatcatc atcgagacca tgagccacaa 1800
gcacagcaag tgcatccgcc ccttcctgcc ctacttcaac gtgagccagc agttcgccac 1860
cttcgtgctg aagggcagct tcagcgagat ccccgagctg caggtgtggt acaccaagct 1920
gggcaagacc agcgagcgct tcctgttcaa gcagctggac agcctgtggc tgctggacag 1980
cgacggcagc ttcaccctga gcctgcacga ggacgagctg ttcaccctga ccaccctgac 2040
caccggccgc aagggcagct accccctgcc ccccaagagc cagcccttcc ccagcaccta 2100
caaggacgac ttcaacgtgg actacccctt cttcagcgag gcccccaact tcgccgacca 2160
gaccggcgtg ttcgagtact tcaccaacat cgaggacccc ggcgagcacc acttcaccct 2220
gcgccaggtg ctgaaccagc gccccatcac ctgggccgcc gacgccagca acaccatcag 2280
catcatcggc gactacaact ggaccaacct gaccatcaag tgcgacgtgt acatcgagac 2340
ccccgacacc ggcggcgtgt tcatcgccgg ccgcgtgaac aagggcggca tcctgatccg 2400
cagcgcccgc ggcatcttct tctggatctt cgccaacggc agctaccgcg tgaccggcga 2460
cctggccggc tggatcatct acgccctggg ccgcgtggag gtgaccgcca agaagtggta 2520
caccctgacc ctgaccatca agggccactt caccagcggc atgctgaacg acaagagcct 2580
gtggaccgac atccccgtga acttccccaa gaacggctgg gccgccatcg gcacccacag 2640
cttcgagttc gcccagttcg acaacttcct ggtggaggcc acccgctgat tgtggccgaa 2700
ccgccgaact cagaggccgg ccccagaaaa cccgagcgag tagggggcgg cgcgcaggag 2760
ggaggagaac tgggggcgcg ggaggctggt gggtgtgggg ggtggagatg tagaagatgt 2820
gacgccgcgg cccggcgggt gccagattag cggacgcggt gcccgcggtt gcaacgggat 2880
cccgggcgct gcagcttggg aggcggctct ccccaggcgg cgtccgcgga gacacccatc 2940
cgtgaacccc aggtcccggg ccgccggctc gccgcgcacc aggggccggc ggacagaaga 3000
gcggccgagc ggctcgaggc tgggggaccg cgggcgcggc cgcgcgctgc cgggcgggag 3060
gctggggggc cggggccggg gccgtgcccc ggagcgggtc ggaggccggg gccggggccg 3120
ggggacggcg gctccccgcg cggctccagc ggctcgggga tcccggccgg gccccgcagg 3180
gaccatgatg gaattcagca gccccagcag agaggaatgc cccaagcctc tgagccgggt 3240
gtcaatcatg gccggatctc tgacaggact gctgctgctt caggccgtgt cttgggcttc 3300
tggcgctaga ccttgcatcc ccaagagctt cggctacagc agcgtcgtgt gcgtgtgcaa 3360
tgccacctac tgcgacagct tcgaccctcc tacctttcct gctctgggca ccttcagcag 3420
atacgagagc accagatccg gcagacggat ggaactgagc atgggaccca tccaggccaa 3480
tcacacaggc actggcctgc tgctgacact gcagcctgag cagaaattcc agaaagtgaa 3540
aggcttcggc ggagccatga cagatgccgc cgctctgaat atcctggctc tgtctccacc 3600
agctcagaac ctgctgctca agagctactt cagcgaggaa ggcatcggct acaacatcat 3660
cagagtgccc atggccagct gcgacttcag catcaggacc tacacctacg ccgacacacc 3720
cgacgatttc cagctgcaca acttcagcct gcctgaagag gacaccaagc tgaagatccc 3780
tctgatccac agagccctgc agctggcaca aagacccgtg tcactgctgg cctctccatg 3840
gacatctccc acctggctga aaacaaatgg cgccgtgaat ggcaagggca gcctgaaagg 3900
ccaacctggc gacatctacc accagacctg ggccagatac ttcgtgaagt tcctggacgc 3960
ctatgccgag cacaagctgc agttttgggc cgtgacagcc gagaacgaac cttctgctgg 4020
actgctgagc ggctacccct ttcagtgcct gggctttaca cccgagcacc agcgggactt 4080
tatcgcccgt gatctgggac ccacactggc caatagcacc caccataatg tgcggctgct 4140
gatgctggac gaccagagac tgcttctgcc ccactgggct aaagtggtgc tgacagatcc 4200
tgaggccgcc aaatacgtgc acggaatcgc cgtgcactgg tatctggact ttctggcccc 4260
tgccaaggcc acactgggag agacacacag actgttcccc aacaccatgc tgttcgccag 4320
cgaagcctgt gtgggcagca agttttggga acagagcgtg cggctcggca gctgggatag 4380
aggcatgcag tacagccaca gcatcatcac caacctgctg taccacgtcg tcggctggac 4440
cgactggaat ctggccctga atcctgaagg cggccctaac tgggtccgaa acttcgtgga 4500
cagccccatc atcgtggaca tcaccaagga caccttctac aagcagccca tgttctacca 4560
cctgggacac ttcagcaagt tcatccccga gggctctcag cgcgttggac tggtggcttc 4620
ccagaagaac gatctggacg ccgtggctct gatgcaccct gatggatctg ctgtggtggt 4680
ggtcctgaac cgcagcagca aagatgtgcc cctgaccatc aaggatcccg ccgtgggatt 4740
cctggaaaca atcagccctg gctactccat ccacacctac ctgtggcgta gacagtgaca 4800
attgttaatt aagtttaaac cctcgaggcc gcaagcaata aaatatcttt attttcatta 4860
catctgtgtg ttggtttttt gtgtggagat ccacgataac aaacagcttt tttggggtga 4920
acatattgac tgaattccct gcaggttggc cactccctct ctgcgcgctc gctcgctcac 4980
tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt ggtcgcccgg cctcagtgag 5040
cgagcgagcg cgcagagagg gagtggccaa ctccatcact aggggttcct gcggccgctc 5100
gtacggtctc gaggaattcc tgcaggataa cttgccaacc tcattctaaa atgtatatag 5160
aagcccaaaa gacaataaca aaaatattct tgtagaacaa aatgggaaag aatgttccac 5220
taaatatcaa gatttagagc aaagcatgag atgtgtgggg atagacagtg aggctgataa 5280
aatagagtag agctcagaaa cagacccatt gatatatgta agtgacctat gaaaaaaata 5340
tggcatttta caatgggaaa atgatggtct ttttcttttt tagaaaaaca gggaaatata 5400
tttatatgta aaaaataaaa gggaacccat atgtcatacc atacacacaa aaaaattcca 5460
gtgaattata agtctaaatg gagaaggcaa aactttaaat cttttagaaa ataatataga 5520
agcatgcaga ccagcctggc caacatgatg aaaccctctc tactaataat aaaatcagta 5580
gaactactca ggactacttt gagtgggaag tccttttcta tgaagacttc tttggccaaa 5640
attaggctct aaatgcaagg agatagtgca tcatgcctgg ctgcacttac tgataaatga 5700
tgttatcacc atctttaacc aaatgcacag gaacaagtta tggtactgat gtgctggatt 5760
gagaaggagc tctacttcct tgacaggaca catttgtatc aacttaaaaa agcagatttt 5820
tgccagcaga actattcatt cagaggtagg aaacttagaa tagatgatgt cactgattag 5880
catggcttcc ccatctccac agctgcttcc cacccaggtt gcccacagtt gagtttgtcc 5940
agtgctcagg gctgcccact ctcagtaaga agccccacac cagcccctct ccaaatatgt 6000
tggctgttcc ttccattaaa gtgaccccac tttagagcag caagtggatt tctgtttctt 6060
acagttcagg aaggaggagt cagctgtgag aacctggagc ctgagatgct tctaagtccc 6120
actgctactg gggtcaggga agccagactc cagcatcagc agtcaggagc actaagccct 6180
tgccaacatc ctgtttctca gagaaactgc ttccattata atggttgtcc ttttttaagc 6240
tatcaagcca aacaaccagt gtctaccatt attctcatca cctgaagcca agggttctag 6300
caaaagtcaa gctgtcttgt aatggttgat gtgcctccag cttctgtctt cagtcactcc 6360
actcttagcc tgctctgaat caactctgac cacagttccc tggagcccct gccacctgct 6420
gcccctgcca ccttctccat ctgcagtgct gtgcagcctt ctgcactctt gcagagctaa 6480
taggtggaga cttgaaggaa gaggaggaaa gtttctcata atagccttgc tgcaagctca 6540
aatgggaggt gggcactgtg cccaggagcc ttggagcaaa ggctgtgccc aacctctgac 6600
tgcatccagg tttggtcttg acagagataa gaagccctgg cttttggagc caaaatctag 6660
gtcagactta ggcaggattc tcaaagttta tcagcagaac atgaggcaga agaccctttc 6720
tgctccagct tcttcaggct caaccttcat cagaatagat agaaagagag gctgtgaggg 6780
ttcttaaaac agaagcaaat ctgactcaga gaataaacaa cctcctagta aactacagct 6840
tagacagagc atctggtggt gagtgtgctc agtgtcctac tcaactgtct ggtatcagcc 6900
ctcatgagga cttctcttct ttccctcata gacctccatc tctgttttcc ttagcctgca 6960
gaaatctgga tggctattca cagaatgcct gtgctttcag agttgcattt tttctctggt 7020
attctggttc aagcatttga aggtaggaaa ggttctccaa gtgcaagaaa gccagccctg 7080
agcctcaact gcctggctag tgtggtcagt aggatgcaaa ggctgttgaa tgccacaagg 7140
ccaaacttta acctgtgtac cacaagccta gcagcagagg cagctctgct cactggaact 7200
ctctgtcttc tttctcctga gccttttctt ttcctgagtt ttctagctct cctcaacctt 7260
acctctgccc tacccaggac aaacccaaga gccactgttt ctgtgatgtc ctctccagcc 7320
ctaattaggc atcatgactt cagcctgacc ttccatgctc agaagcagtg ctaatccact 7380
tcagatgagc tgctctatgc aacacaggca gagcctacaa acctttgcac cagagccctc 7440
cacatatcag tgtttgttca tactcacttc aacagcaaat gtgactgctg agattaagat 7500
tttacacaag atggtctgta atttcacagt tagttttatc ccattaggta tgaaagaatt 7560
agcataattc cccttaaaca tgaatgaatc ttagattttt taataaatag ttttggaagt 7620
aaagacagag acatcaggag cacaaggaat agcctgagag gacaaacaga acaagaaaga 7680
gtctggaaat acacaggatg ttcttggcct cctcaaagca agtgcaagca gatagtacca 7740
gcagccccag gctatcagag cccagtgaag agaagtacca tgaaagccac agctctaacc 7800
accctgttcc agagtgacag acagtcccca agacaagcca gcctgagcca gagagagaac 7860
tgcaagagaa agtttctaat ttaggttctg ttagattcag acaagtgcag gtcatcctct 7920
ctccacagct actcacctct ccagcctaac aaagcctgca gtccacactc caaccctggt 7980
gtctcacctc ctagcctctc ccaacatcct gctctctgac catcttctgc atctctcatc 8040
tcaccatctc ccactgtcta cagcctactc ttgcaactac catctcattt tctgacatcc 8100
tgtctacatc ttctgccata ctctgccatc taccatacca cctcttacca tctaccacac 8160
catcttttat ctccatccct ctcagaagcc tccaagctga atcctgcttt atgtgttcat 8220
ctcagcccct gcatggaaag ctgaccccag aggcagaact attcccagag agcttggcca 8280
agaaaaacaa aactaccagc ctggccaggc tcaggagtag taagctgcag tgtctgttgt 8340
gttctagctt caacagctgc aggagttcca ctctcaaatg ctccacattt ctcacatcct 8400
cctgattctg gtcactaccc atcttcaaag aacagaatat ctcacatcag catactgtga 8460
aggactagtc atgggtgcag ctgctcagag ctgcaaagtc attctggatg gtggagagct 8520
tacaaacatt tcatgatgct ccccccgctc tgatggctgg agcccaatcc ctacacagac 8580
tcctgctgta tgtgttttcc tttcactctg agccacagcc agagggcagg cattcagtct 8640
cctcttcagg ctggggctgg ggcactgaga actcacccaa caccttgctc tcactccttc 8700
tgcaaaacaa gaaagagctt tgtgctgcag tagccatgaa gaatgaaagg aaggctttaa 8760
ctaaaaaatg tcagagatta ttttcaaccc cttactgtgg atcaccagca aggaggaaac 8820
acaacacaga gacatttttt cccctcaaat tatcaaaaga atcactgcat ttgttaaaga 8880
gagcaactga atcaggaagc agagttttga acatatcaga agttaggaat ctgcatcaga 8940
gacaaatgca gtcatggttg tttgctgcat accagcccta atcattagaa gcctcatgga 9000
cttcaaacat cattccctct gacaagatgc tctagcctaa ctccatgaga taaaataaat 9060
ctgcctttca gagccaaaga agagtccacc agcttcttct cagtgtgaac aagagctcca 9120
gtcaggttag tcagtccagt gcagtagagg agaccagtct gcatcctcta attttcaaag 9180
gcaagaagat ttgtttaccc tggacaccag gcacaagtga ggtcacagag ctcttagata 9240
tgcagtcctc atgagtgagg agactaaagc gcatgccatc aagacttcag tgtagagaaa 9300
acctccaaaa aagcctcctc actacttctg gaatagctca gaggccgagg cggcctcggc 9360
ctctgcataa ataaaaaaaa ttagtcagcc atggggcgga gaatgggcgg aactgggcgg 9420
agttaggggc gggatgggcg gagttagggg cgggactatg gttgctgact aattgagatg 9480
catgctttgc atacttctgc ctgctgggga gcctggggac tttccacacc tggttgctga 9540
ctaattgaga tgcatgcttt gcatacttct gcctgctggg gagcctgggg actttccaca 9600
ccctaactga cacacattcc acagctgcat taatgaatcg gccaacgcgc ggggagaggc 9660
ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt 9720
cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca 9780
ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa 9840
aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat 9900
cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc 9960
cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc 10020
gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt 10080
tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac 10140
cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg 10200
ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca 10260
gagttcttga agtggtggcc taactacggc tacactagaa gaacagtatt tggtatctgc 10320
gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa 10380
accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa 10440
ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac 10500
tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta 10560
aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt 10620
taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata 10680
gttgcctgac tcctgcaaac cacgttgtgt ctcaaaatct ctgatgttac attgcacaag 10740
ataaaaatat atcatcatga acaataaaac tgtctgctta cataaacagt aatacaaggg 10800
gtgttatgag ccatattcaa cgggaaacgt cttgctcgag gccgcgatta aattccaaca 10860
tggatgctga tttatatggg tataaatggg ctcgcgataa tgtcgggcaa tcaggtgcga 10920
caatctatcg attgtatggg aagcccgatg cgccagagtt gtttctgaaa catggcaaag 10980
gtagcgttgc caatgatgtt acagatgaga tggtcagact aaactggctg acggaattta 11040
tgcctcttcc gaccatcaag cattttatcc gtactcctga tgatgcatgg ttactcacca 11100
ctgcgatccc cgggaaaaca gcattccagg tattagaaga atatcctgat tcaggtgaaa 11160
atattgttga tgcgctggca gtgttcctgc gccggttgca ttcgattcct gtttgtaatt 11220
gtccttttaa cagcgatcgc gtatttcgtc tcgctcaggc gcaatcacga atgaataacg 11280
gtttggttga tgcgagtgat tttgatgacg agcgtaatgg ctggcctgtt gaacaagtct 11340
ggaaagaaat gcataagctt ttgccattct caccggattc agtcgtcact catggtgatt 11400
tctcacttga taaccttatt tttgacgagg ggaaattaat aggttgtatt gatgttggac 11460
gagtcggaat cgcagaccga taccaggatc ttgccatcct atggaactgc ctcggtgagt 11520
tttctccttc attacagaaa cggctttttc aaaaatatgg tattgataat cctgatatga 11580
ataaattgca gtttcatttg atgctcgatg agtttttcta agggcggcct gccaccatac 11640
ccacgccgaa acaagcgctc atgagcccga agtggcgagc ccgatcttcc ccatcggtga 11700
tgtcggcgat ataggcgcca gcaaccgcac ctgtggcgcc ggtgatgagg gcgcgccaag 11760
tcgacgtccg gcagtc 11776
<210> 44
<211> 11064
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 44
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctgcag ccctaggaat gcatctagac aattgtacta accttcttct 600
ctttcctctc ctgacagtcc ggaaagccac catggaattc agcagcccca gcagagagga 660
atgccccaag cctctgagcc gggtgtcaat catggccgga tctctgacag gactgctgct 720
gcttcaggcc gtgtcttggg cttctggcgc tagaccttgc atccccaaga gcttcggcta 780
cagcagcgtc gtgtgcgtgt gcaatgccac ctactgcgac agcttcgacc ctcctacctt 840
tcctgctctg ggcaccttca gcagatacga gagcaccaga tccggcagac ggatggaact 900
gagcatggga cccatccagg ccaatcacac aggcactggc ctgctgctga cactgcagcc 960
tgagcagaaa ttccagaaag tgaaaggctt cggcggagcc atgacagatg ccgccgctct 1020
gaatatcctg gctctgtctc caccagctca gaacctgctg ctcaagagct acttcagcga 1080
ggaaggcatc ggctacaaca tcatcagagt gcccatggcc agctgcgact tcagcatcag 1140
gacctacacc tacgccgaca cacccgacga tttccagctg cacaacttca gcctgcctga 1200
agaggacacc aagctgaaga tccctctgat ccacagagcc ctgcagctgg cacaaagacc 1260
cgtgtcactg ctggcctctc catggacatc tcccacctgg ctgaaaacaa atggcgccgt 1320
gaatggcaag ggcagcctga aaggccaacc tggcgacatc taccaccaga cctgggccag 1380
atacttcgtg aagttcctgg acgcctatgc cgagcacaag ctgcagtttt gggccgtgac 1440
agccgagaac gaaccttctg ctggactgct gagcggctac ccctttcagt gcctgggctt 1500
tacacccgag caccagcggg actttatcgc ccgtgatctg ggacccacac tggccaatag 1560
cacccaccat aatgtgcggc tgctgatgct ggacgaccag agactgcttc tgccccactg 1620
ggctaaagtg gtgctgacag atcctgaggc cgccaaatac gtgcacggaa tcgccgtgca 1680
ctggtatctg gactttctgg cccctgccaa ggccacactg ggagagacac acagactgtt 1740
ccccaacacc atgctgttcg ccagcgaagc ctgtgtgggc agcaagtttt gggaacagag 1800
cgtgcggctc ggcagctggg atagaggcat gcagtacagc cacagcatca tcaccaacct 1860
gctgtaccac gtcgtcggct ggaccgactg gaatctggcc ctgaatcctg aaggcggccc 1920
taactgggtc cgaaacttcg tggacagccc catcatcgtg gacatcacca aggacacctt 1980
ctacaagcag cccatgttct accacctggg acacttcagc aagttcatcc ccgagggctc 2040
tcagcgcgtt ggactggtgg cttcccagaa gaacgatctg gacgccgtgg ctctgatgca 2100
ccctgatgga tctgctgtgg tggtggtcct gaaccgcagc agcaaagatg tgcccctgac 2160
catcaaggat cccgccgtgg gattcctgga aacaatcagc cctggctact ccatccacac 2220
ctacctgtgg cgtagacagt gacaattgtt aattaagttt aaaccctcga ggccgcaagc 2280
cgcatcgata ccgtcgacta gagctcgctg atcagcctcg actgtgcctt ctagttgcca 2340
gccatctgtt gtttgcccct cccccgtgcc ttccttgacc ctggaaggtg ccactcccac 2400
tgtcctttcc taataaaatg aggaaattgc atcgcattgt ctgagtaggt gtcattctat 2460
tctggggggt ggggtggggc aggacagcaa gggggaggat tgggaagaca atagcaggca 2520
tgctggggag agatccacga taacaaacag cttttttggg ggggcggagt tagggcggag 2580
ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga atgggcggtg 2640
aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg tcgcagccgg 2700
gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta agtcactgac 2760
tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag tggcactatg 2820
aaccctgcag ccctaggaat gcatctagac aattgtacta accttcttct ctttcctctc 2880
ctgacagtcc ggaaagccac catgtggcag ctgtgggcca gcctgtgctg cctgctggtg 2940
ctggccaacg cccgcagccg ccccagcttc caccccctga gcgacgagct ggtgaactac 3000
gtgaacaagc gcaacaccac ctggcaggcc ggccacaact tctacaacgt ggacatgagc 3060
tacctgaagc gcctgtgcgg caccttcctg ggcggcccca agccccccca gcgcgtgatg 3120
ttcaccgagg acctgaagct gcccgccagc ttcgacgccc gcgagcagtg gccccagtgc 3180
cccaccatca aggagatccg cgaccagggc agctgcggca gctgctgggc cttcggcgcc 3240
gtggaggcca tcagcgaccg catctgcatc cacaccaacg cccacgtgag cgtggaggtg 3300
agcgccgagg acctgctgac ctgctgcggc agcatgtgcg gcgacggctg caacggcggc 3360
taccccgccg aggcctggaa cttctggacc cgcaagggcc tggtgagcgg cggcctgtac 3420
gagagccacg tgggctgccg cccctacagc atccccccct gcgagcacca cgtgaacggc 3480
agccgccccc cctgcaccgg cgagggcgac acccccaagt gcagcaagat ctgcgagccc 3540
ggctacagcc ccacctacaa gcaggacaag cactacggct acaacagcta cagcgtgagc 3600
aacagcgaga aggacatcat ggccgagatc tacaagaacg gccccgtgga gggcgccttc 3660
agcgtgtaca gcgacttcct gctgtacaag agcggcgtgt accagcacgt gaccggcgag 3720
atgatgggcg gccacgccat ccgcatcctg ggctggggcg tggagaacgg caccccctac 3780
tggctggtgg ccaacagctg gaacaccgac tggggcgaca acggcttctt caagatcctg 3840
cgcggccagg accactgcgg catcgagagc gaggtggtgg ccggcatccc ccgcaccgac 3900
cagtactggg agaagatctg acccagggga ctcagcggcc gctcgagtct agagggcccg 3960
tttaaacccg ctgatcagcc tcgaagacat gataagatac attgatgagt ttggacaaac 4020
cacaacaaga atgcagtgaa aaaaatgctt tatttgtgaa atttgtgatg ctattgcttt 4080
atttgtaacc attataagct gcaataaaca agttaacaac aacaattgca ttcattttat 4140
gtttcaggtt cagggggaga tgtgggaggt tttttaaagc aagtaaaacc tctacaaatg 4200
tggtatgaac atattgactg aattccctgc aggttggcca ctccctctct gcgcgctcgc 4260
tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg tcgcccggcc 4320
tcagtgagcg agcgagcgcg cagagaggga gtggccaact ccatcactag gggttcctgc 4380
ggccgctcgt acggtctcga ggaattcctg caggataact tgccaacctc attctaaaat 4440
gtatatagaa gcccaaaaga caataacaaa aatattcttg tagaacaaaa tgggaaagaa 4500
tgttccacta aatatcaaga tttagagcaa agcatgagat gtgtggggat agacagtgag 4560
gctgataaaa tagagtagag ctcagaaaca gacccattga tatatgtaag tgacctatga 4620
aaaaaatatg gcattttaca atgggaaaat gatggtcttt ttctttttta gaaaaacagg 4680
gaaatatatt tatatgtaaa aaataaaagg gaacccatat gtcataccat acacacaaaa 4740
aaattccagt gaattataag tctaaatgga gaaggcaaaa ctttaaatct tttagaaaat 4800
aatatagaag catgcagacc agcctggcca acatgatgaa accctctcta ctaataataa 4860
aatcagtaga actactcagg actactttga gtgggaagtc cttttctatg aagacttctt 4920
tggccaaaat taggctctaa atgcaaggag atagtgcatc atgcctggct gcacttactg 4980
ataaatgatg ttatcaccat ctttaaccaa atgcacagga acaagttatg gtactgatgt 5040
gctggattga gaaggagctc tacttccttg acaggacaca tttgtatcaa cttaaaaaag 5100
cagatttttg ccagcagaac tattcattca gaggtaggaa acttagaata gatgatgtca 5160
ctgattagca tggcttcccc atctccacag ctgcttccca cccaggttgc ccacagttga 5220
gtttgtccag tgctcagggc tgcccactct cagtaagaag ccccacacca gcccctctcc 5280
aaatatgttg gctgttcctt ccattaaagt gaccccactt tagagcagca agtggatttc 5340
tgtttcttac agttcaggaa ggaggagtca gctgtgagaa cctggagcct gagatgcttc 5400
taagtcccac tgctactggg gtcagggaag ccagactcca gcatcagcag tcaggagcac 5460
taagcccttg ccaacatcct gtttctcaga gaaactgctt ccattataat ggttgtcctt 5520
ttttaagcta tcaagccaaa caaccagtgt ctaccattat tctcatcacc tgaagccaag 5580
ggttctagca aaagtcaagc tgtcttgtaa tggttgatgt gcctccagct tctgtcttca 5640
gtcactccac tcttagcctg ctctgaatca actctgacca cagttccctg gagcccctgc 5700
cacctgctgc ccctgccacc ttctccatct gcagtgctgt gcagccttct gcactcttgc 5760
agagctaata ggtggagact tgaaggaaga ggaggaaagt ttctcataat agccttgctg 5820
caagctcaaa tgggaggtgg gcactgtgcc caggagcctt ggagcaaagg ctgtgcccaa 5880
cctctgactg catccaggtt tggtcttgac agagataaga agccctggct tttggagcca 5940
aaatctaggt cagacttagg caggattctc aaagtttatc agcagaacat gaggcagaag 6000
accctttctg ctccagcttc ttcaggctca accttcatca gaatagatag aaagagaggc 6060
tgtgagggtt cttaaaacag aagcaaatct gactcagaga ataaacaacc tcctagtaaa 6120
ctacagctta gacagagcat ctggtggtga gtgtgctcag tgtcctactc aactgtctgg 6180
tatcagccct catgaggact tctcttcttt ccctcataga cctccatctc tgttttcctt 6240
agcctgcaga aatctggatg gctattcaca gaatgcctgt gctttcagag ttgcattttt 6300
tctctggtat tctggttcaa gcatttgaag gtaggaaagg ttctccaagt gcaagaaagc 6360
cagccctgag cctcaactgc ctggctagtg tggtcagtag gatgcaaagg ctgttgaatg 6420
ccacaaggcc aaactttaac ctgtgtacca caagcctagc agcagaggca gctctgctca 6480
ctggaactct ctgtcttctt tctcctgagc cttttctttt cctgagtttt ctagctctcc 6540
tcaaccttac ctctgcccta cccaggacaa acccaagagc cactgtttct gtgatgtcct 6600
ctccagccct aattaggcat catgacttca gcctgacctt ccatgctcag aagcagtgct 6660
aatccacttc agatgagctg ctctatgcaa cacaggcaga gcctacaaac ctttgcacca 6720
gagccctcca catatcagtg tttgttcata ctcacttcaa cagcaaatgt gactgctgag 6780
attaagattt tacacaagat ggtctgtaat ttcacagtta gttttatccc attaggtatg 6840
aaagaattag cataattccc cttaaacatg aatgaatctt agatttttta ataaatagtt 6900
ttggaagtaa agacagagac atcaggagca caaggaatag cctgagagga caaacagaac 6960
aagaaagagt ctggaaatac acaggatgtt cttggcctcc tcaaagcaag tgcaagcaga 7020
tagtaccagc agccccaggc tatcagagcc cagtgaagag aagtaccatg aaagccacag 7080
ctctaaccac cctgttccag agtgacagac agtccccaag acaagccagc ctgagccaga 7140
gagagaactg caagagaaag tttctaattt aggttctgtt agattcagac aagtgcaggt 7200
catcctctct ccacagctac tcacctctcc agcctaacaa agcctgcagt ccacactcca 7260
accctggtgt ctcacctcct agcctctccc aacatcctgc tctctgacca tcttctgcat 7320
ctctcatctc accatctccc actgtctaca gcctactctt gcaactacca tctcattttc 7380
tgacatcctg tctacatctt ctgccatact ctgccatcta ccataccacc tcttaccatc 7440
taccacacca tcttttatct ccatccctct cagaagcctc caagctgaat cctgctttat 7500
gtgttcatct cagcccctgc atggaaagct gaccccagag gcagaactat tcccagagag 7560
cttggccaag aaaaacaaaa ctaccagcct ggccaggctc aggagtagta agctgcagtg 7620
tctgttgtgt tctagcttca acagctgcag gagttccact ctcaaatgct ccacatttct 7680
cacatcctcc tgattctggt cactacccat cttcaaagaa cagaatatct cacatcagca 7740
tactgtgaag gactagtcat gggtgcagct gctcagagct gcaaagtcat tctggatggt 7800
ggagagctta caaacatttc atgatgctcc ccccgctctg atggctggag cccaatccct 7860
acacagactc ctgctgtatg tgttttcctt tcactctgag ccacagccag agggcaggca 7920
ttcagtctcc tcttcaggct ggggctgggg cactgagaac tcacccaaca ccttgctctc 7980
actccttctg caaaacaaga aagagctttg tgctgcagta gccatgaaga atgaaaggaa 8040
ggctttaact aaaaaatgtc agagattatt ttcaacccct tactgtggat caccagcaag 8100
gaggaaacac aacacagaga cattttttcc cctcaaatta tcaaaagaat cactgcattt 8160
gttaaagaga gcaactgaat caggaagcag agttttgaac atatcagaag ttaggaatct 8220
gcatcagaga caaatgcagt catggttgtt tgctgcatac cagccctaat cattagaagc 8280
ctcatggact tcaaacatca ttccctctga caagatgctc tagcctaact ccatgagata 8340
aaataaatct gcctttcaga gccaaagaag agtccaccag cttcttctca gtgtgaacaa 8400
gagctccagt caggttagtc agtccagtgc agtagaggag accagtctgc atcctctaat 8460
tttcaaaggc aagaagattt gtttaccctg gacaccaggc acaagtgagg tcacagagct 8520
cttagatatg cagtcctcat gagtgaggag actaaagcgc atgccatcaa gacttcagtg 8580
tagagaaaac ctccaaaaaa gcctcctcac tacttctgga atagctcaga ggccgaggcg 8640
gcctcggcct ctgcataaat aaaaaaaatt agtcagccat ggggcggaga atgggcggaa 8700
ctgggcggag ttaggggcgg gatgggcgga gttaggggcg ggactatggt tgctgactaa 8760
ttgagatgca tgctttgcat acttctgcct gctggggagc ctggggactt tccacacctg 8820
gttgctgact aattgagatg catgctttgc atacttctgc ctgctgggga gcctggggac 8880
tttccacacc ctaactgaca cacattccac agctgcatta atgaatcggc caacgcgcgg 8940
ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc gctcactgac tcgctgcgct 9000
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 9060
cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 9120
accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 9180
acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 9240
cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 9300
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 9360
atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 9420
agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 9480
acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 9540
gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 9600
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 9660
gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 9720
gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 9780
acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 9840
tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 9900
ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 9960
catccatagt tgcctgactc ctgcaaacca cgttgtgtct caaaatctct gatgttacat 10020
tgcacaagat aaaaatatat catcatgaac aataaaactg tctgcttaca taaacagtaa 10080
tacaaggggt gttatgagcc atattcaacg ggaaacgtct tgctcgaggc cgcgattaaa 10140
ttccaacatg gatgctgatt tatatgggta taaatgggct cgcgataatg tcgggcaatc 10200
aggtgcgaca atctatcgat tgtatgggaa gcccgatgcg ccagagttgt ttctgaaaca 10260
tggcaaaggt agcgttgcca atgatgttac agatgagatg gtcagactaa actggctgac 10320
ggaatttatg cctcttccga ccatcaagca ttttatccgt actcctgatg atgcatggtt 10380
actcaccact gcgatccccg ggaaaacagc attccaggta ttagaagaat atcctgattc 10440
aggtgaaaat attgttgatg cgctggcagt gttcctgcgc cggttgcatt cgattcctgt 10500
ttgtaattgt ccttttaaca gcgatcgcgt atttcgtctc gctcaggcgc aatcacgaat 10560
gaataacggt ttggttgatg cgagtgattt tgatgacgag cgtaatggct ggcctgttga 10620
acaagtctgg aaagaaatgc ataagctttt gccattctca ccggattcag tcgtcactca 10680
tggtgatttc tcacttgata accttatttt tgacgagggg aaattaatag gttgtattga 10740
tgttggacga gtcggaatcg cagaccgata ccaggatctt gccatcctat ggaactgcct 10800
cggtgagttt tctccttcat tacagaaacg gctttttcaa aaatatggta ttgataatcc 10860
tgatatgaat aaattgcagt ttcatttgat gctcgatgag tttttctaag ggcggcctgc 10920
caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc 10980
atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgagggc 11040
gcgccaagtc gacgtccggc agtc 11064
<210> 45
<211> 250
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 45
Met Glu Lys Gly Pro Val Arg Ala Pro Ala Glu Lys Pro Arg Gly Ala
1 5 10 15
Arg Cys Ser Asn Gly Phe Pro Glu Arg Asp Pro Pro Arg Pro Gly Pro
20 25 30
Ser Arg Pro Ala Glu Lys Pro Pro Arg Pro Glu Ala Lys Ser Ala Gln
35 40 45
Pro Ala Asp Gly Trp Lys Gly Glu Arg Pro Arg Ser Glu Glu Asp Asn
50 55 60
Glu Leu Asn Leu Pro Asn Leu Ala Ala Ala Tyr Ser Ser Ile Leu Ser
65 70 75 80
Ser Leu Gly Glu Asn Pro Gln Arg Gln Gly Leu Leu Lys Thr Pro Trp
85 90 95
Arg Ala Ala Ser Ala Met Gln Phe Phe Thr Lys Gly Tyr Gln Glu Thr
100 105 110
Ile Ser Asp Val Leu Asn Asp Ala Ile Phe Asp Glu Asp His Asp Glu
115 120 125
Met Val Ile Val Lys Asp Ile Asp Met Phe Ser Met Cys Glu His His
130 135 140
Leu Val Pro Phe Val Gly Lys Val His Ile Gly Tyr Leu Pro Asn Lys
145 150 155 160
Gln Val Leu Gly Leu Ser Lys Leu Ala Arg Ile Val Glu Ile Tyr Ser
165 170 175
Arg Arg Leu Gln Val Gln Glu Arg Leu Thr Lys Gln Ile Ala Val Ala
180 185 190
Ile Thr Glu Ala Leu Arg Pro Ala Gly Val Gly Val Val Val Glu Ala
195 200 205
Thr His Met Cys Met Val Met Arg Gly Val Gln Lys Met Asn Ser Lys
210 215 220
Thr Val Thr Ser Thr Met Leu Gly Val Phe Arg Glu Asp Pro Lys Thr
225 230 235 240
Arg Glu Glu Phe Leu Thr Leu Ile Arg Ser
245 250
<210> 46
<211> 750
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 46
atggagaagg gccccgtgcg cgcccccgcc gagaagcccc gcggcgcccg ctgcagcaac 60
ggcttccccg agcgcgaccc cccccgcccc ggccccagcc gccccgccga gaagcccccc 120
cgccccgagg ccaagagcgc ccagcccgcc gacggctgga agggcgagcg cccccgcagc 180
gaggaggaca acgagctgaa cctgcccaac ctggccgccg cctacagcag catcctgagc 240
agcctgggcg agaaccccca gcgccagggc ctgctgaaga ccccctggcg cgccgccagc 300
gccatgcagt tcttcaccaa gggctaccag gagaccatca gcgacgtgct gaacgacgcc 360
atcttcgacg aggaccacga cgagatggtg atcgtgaagg acatcgacat gttcagcatg 420
tgcgagcacc acctggtgcc cttcgtgggc aaggtgcaca tcggctacct gcccaacaag 480
caggtgctgg gcctgagcaa gctggcccgc atcgtggaga tctacagccg ccgcctgcag 540
gtgcaggagc gcctgaccaa gcagatcgcc gtggccatca ccgaggccct gcgccccgcc 600
ggcgtgggcg tggtggtgga ggccacccac atgtgcatgg tgatgcgcgg cgtgcagaag 660
atgaacagca agaccgtgac cagcaccatg ctgggcgtgt tccgcgagga ccccaagacc 720
cgcgaggagt tcctgaccct gatccgcagc 750
<210> 47
<211> 203
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 47
Met Gly Ser Arg Asp His Leu Phe Lys Val Leu Val Val Gly Asp Ala
1 5 10 15
Ala Val Gly Lys Thr Ser Leu Val Gln Arg Tyr Ser Gln Asp Ser Phe
20 25 30
Ser Lys His Tyr Lys Ser Thr Val Gly Val Asp Phe Ala Leu Lys Val
35 40 45
Leu Gln Trp Ser Asp Tyr Glu Ile Val Arg Leu Gln Leu Trp Asp Ile
50 55 60
Ala Gly Gln Glu Arg Phe Thr Ser Met Thr Arg Leu Tyr Tyr Arg Asp
65 70 75 80
Ala Ser Ala Cys Val Ile Met Phe Asp Val Thr Asn Ala Thr Thr Phe
85 90 95
Ser Asn Ser Gln Arg Trp Lys Gln Asp Leu Asp Ser Lys Leu Thr Leu
100 105 110
Pro Asn Gly Glu Pro Val Pro Cys Leu Leu Leu Ala Asn Lys Cys Asp
115 120 125
Leu Ser Pro Trp Ala Val Ser Arg Asp Gln Ile Asp Arg Phe Ser Lys
130 135 140
Glu Asn Gly Phe Thr Gly Trp Thr Glu Thr Ser Val Lys Glu Asn Lys
145 150 155 160
Asn Ile Asn Glu Ala Met Arg Val Leu Ile Glu Lys Met Met Arg Asn
165 170 175
Ser Thr Glu Asp Ile Met Ser Leu Ser Thr Gln Gly Asp Tyr Ile Asn
180 185 190
Leu Gln Thr Lys Ser Ser Ser Trp Ser Cys Cys
195 200
<210> 48
<211> 609
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 48
atgggcagcc gcgaccacct gttcaaggtg ctggtggtgg gcgacgccgc cgtgggcaag 60
accagcctgg tgcagcgcta cagccaggac agcttcagca agcactacaa gagcaccgtg 120
ggcgtggact tcgccctgaa ggtgctgcag tggagcgact acgagatcgt gcgcctgcag 180
ctgtgggaca tcgccggcca ggagcgcttc accagcatga cccgcctgta ctaccgcgac 240
gccagcgcct gcgtgatcat gttcgacgtg accaacgcca ccaccttcag caacagccag 300
cgctggaagc aggacctgga cagcaagctg accctgccca acggcgagcc cgtgccctgc 360
ctgctgctgg ccaacaagtg cgacctgagc ccctgggccg tgagccgcga ccagatcgac 420
cgcttcagca aggagaacgg cttcaccggc tggaccgaga ccagcgtgaa ggagaacaag 480
aacatcaacg aggccatgcg cgtgctgatc gagaagatga tgcgcaacag caccgaggac 540
atcatgagcc tgagcaccca gggcgactac atcaacctgc agaccaagag cagcagctgg 600
agctgctgc 609
<210> 49
<211> 796
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 49
Met Pro Thr Thr Gln Gln Ser Pro Gln Asp Glu Gln Glu Lys Leu Leu
1 5 10 15
Asp Glu Ala Ile Gln Ala Val Lys Val Gln Ser Phe Gln Met Lys Arg
20 25 30
Cys Leu Asp Lys Asn Lys Leu Met Asp Ala Leu Lys His Ala Ser Asn
35 40 45
Met Leu Gly Glu Leu Arg Thr Ser Met Leu Ser Pro Lys Ser Tyr Tyr
50 55 60
Glu Leu Tyr Met Ala Ile Ser Asp Glu Leu His Tyr Leu Glu Val Tyr
65 70 75 80
Leu Thr Asp Glu Phe Ala Lys Gly Arg Lys Val Ala Asp Leu Tyr Glu
85 90 95
Leu Val Gln Tyr Ala Gly Asn Ile Ile Pro Arg Leu Tyr Leu Leu Ile
100 105 110
Thr Val Gly Val Val Tyr Val Lys Ser Phe Pro Gln Ser Arg Lys Asp
115 120 125
Ile Leu Lys Asp Leu Val Glu Met Cys Arg Gly Val Gln His Pro Leu
130 135 140
Arg Gly Leu Phe Leu Arg Asn Tyr Leu Leu Gln Cys Thr Arg Asn Ile
145 150 155 160
Leu Pro Asp Glu Gly Glu Pro Thr Asp Glu Glu Thr Thr Gly Asp Ile
165 170 175
Ser Asp Ser Met Asp Phe Val Leu Leu Asn Phe Ala Glu Met Asn Lys
180 185 190
Leu Trp Val Arg Met Gln His Gln Gly His Ser Arg Asp Arg Glu Lys
195 200 205
Arg Glu Arg Glu Arg Gln Glu Leu Arg Ile Leu Val Gly Thr Asn Leu
210 215 220
Val Arg Leu Ser Gln Leu Glu Gly Val Asn Val Glu Arg Tyr Lys Gln
225 230 235 240
Ile Val Leu Thr Gly Ile Leu Glu Gln Val Val Asn Cys Arg Asp Ala
245 250 255
Leu Ala Gln Glu Tyr Leu Met Glu Cys Ile Ile Gln Val Phe Pro Asp
260 265 270
Glu Phe His Leu Gln Thr Leu Asn Pro Phe Leu Arg Ala Cys Ala Glu
275 280 285
Leu His Gln Asn Val Asn Val Lys Asn Ile Ile Ile Ala Leu Ile Asp
290 295 300
Arg Leu Ala Leu Phe Ala His Arg Glu Asp Gly Pro Gly Ile Pro Ala
305 310 315 320
Asp Ile Lys Leu Phe Asp Ile Phe Ser Gln Gln Val Ala Thr Val Ile
325 330 335
Gln Ser Arg Gln Asp Met Pro Ser Glu Asp Val Val Ser Leu Gln Val
340 345 350
Ser Leu Ile Asn Leu Ala Met Lys Cys Tyr Pro Asp Arg Val Asp Tyr
355 360 365
Val Asp Lys Val Leu Glu Thr Thr Val Glu Ile Phe Asn Lys Leu Asn
370 375 380
Leu Glu His Ile Ala Thr Ser Ser Ala Val Ser Lys Glu Leu Thr Arg
385 390 395 400
Leu Leu Lys Ile Pro Val Asp Thr Tyr Asn Asn Ile Leu Thr Val Leu
405 410 415
Lys Leu Lys His Phe His Pro Leu Phe Glu Tyr Phe Asp Tyr Glu Ser
420 425 430
Arg Lys Ser Met Ser Cys Tyr Val Leu Ser Asn Val Leu Asp Tyr Asn
435 440 445
Thr Glu Ile Val Ser Gln Asp Gln Val Asp Ser Ile Met Asn Leu Val
450 455 460
Ser Thr Leu Ile Gln Asp Gln Pro Asp Gln Pro Val Glu Asp Pro Asp
465 470 475 480
Pro Glu Asp Phe Ala Asp Glu Gln Ser Leu Val Gly Arg Phe Ile His
485 490 495
Leu Leu Arg Ser Glu Asp Pro Asp Gln Gln Tyr Leu Ile Leu Asn Thr
500 505 510
Ala Arg Lys His Phe Gly Ala Gly Gly Asn Gln Arg Ile Arg Phe Thr
515 520 525
Leu Pro Pro Leu Val Phe Ala Ala Tyr Gln Leu Ala Phe Arg Tyr Lys
530 535 540
Glu Asn Ser Lys Val Asp Asp Lys Trp Glu Lys Lys Cys Gln Lys Ile
545 550 555 560
Phe Ser Phe Ala His Gln Thr Ile Ser Ala Leu Ile Lys Ala Glu Leu
565 570 575
Ala Glu Leu Pro Leu Arg Leu Phe Leu Gln Gly Ala Leu Ala Ala Gly
580 585 590
Glu Ile Gly Phe Glu Asn His Glu Thr Val Ala Tyr Glu Phe Met Ser
595 600 605
Gln Ala Phe Ser Leu Tyr Glu Asp Glu Ile Ser Asp Ser Lys Ala Gln
610 615 620
Leu Ala Ala Ile Thr Leu Ile Ile Gly Thr Phe Glu Arg Met Lys Cys
625 630 635 640
Phe Ser Glu Glu Asn His Glu Pro Leu Arg Thr Gln Cys Ala Leu Ala
645 650 655
Ala Ser Lys Leu Leu Lys Lys Pro Asp Gln Gly Arg Ala Val Ser Thr
660 665 670
Cys Ala His Leu Phe Trp Ser Gly Arg Asn Thr Asp Lys Asn Gly Glu
675 680 685
Glu Leu His Gly Gly Lys Arg Val Met Glu Cys Leu Lys Lys Ala Leu
690 695 700
Lys Ile Ala Asn Gln Cys Met Asp Pro Ser Leu Gln Val Gln Leu Phe
705 710 715 720
Ile Glu Ile Leu Asn Arg Tyr Ile Tyr Phe Tyr Glu Lys Glu Asn Asp
725 730 735
Ala Val Thr Ile Gln Val Leu Asn Gln Leu Ile Gln Lys Ile Arg Glu
740 745 750
Asp Leu Pro Asn Leu Glu Ser Ser Glu Glu Thr Glu Gln Ile Asn Lys
755 760 765
His Phe His Asn Thr Leu Glu His Leu Arg Leu Arg Arg Glu Ser Pro
770 775 780
Glu Ser Glu Gly Pro Ile Tyr Glu Gly Leu Ile Leu
785 790 795
<210> 50
<211> 2388
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 50
atgcccacca cccagcagag cccccaggac gagcaggaga agctgctgga cgaggccatc 60
caggccgtga aggtgcagag cttccagatg aagcgctgcc tggacaagaa caagctgatg 120
gacgccctga agcacgccag caacatgctg ggcgagctgc gcaccagcat gctgagcccc 180
aagagctact acgagctgta catggccatc agcgacgagc tgcactacct ggaggtgtac 240
ctgaccgacg agttcgccaa gggccgcaag gtggccgacc tgtacgagct ggtgcagtac 300
gccggcaaca tcatcccccg cctgtacctg ctgatcaccg tgggcgtggt gtacgtgaag 360
agcttccccc agagccgcaa ggacatcctg aaggacctgg tggagatgtg ccgcggcgtg 420
cagcaccccc tgcgcggcct gttcctgcgc aactacctgc tgcagtgcac ccgcaacatc 480
ctgcccgacg agggcgagcc caccgacgag gagaccaccg gcgacatcag cgacagcatg 540
gacttcgtgc tgctgaactt cgccgagatg aacaagctgt gggtgcgcat gcagcaccag 600
ggccacagcc gcgaccgcga gaagcgcgag cgcgagcgcc aggagctgcg catcctggtg 660
ggcaccaacc tggtgcgcct gagccagctg gagggcgtga acgtggagcg ctacaagcag 720
atcgtgctga ccggcatcct ggagcaggtg gtgaactgcc gcgacgccct ggcccaggag 780
tacctgatgg agtgcatcat ccaggtgttc cccgacgagt tccacctgca gaccctgaac 840
cccttcctgc gcgcctgcgc cgagctgcac cagaacgtga acgtgaagaa catcatcatc 900
gccctgatcg accgcctggc cctgttcgcc caccgcgagg acggccccgg catccccgcc 960
gacatcaagc tgttcgacat cttcagccag caggtggcca ccgtgatcca gagccgccag 1020
gacatgccca gcgaggacgt ggtgagcctg caggtgagcc tgatcaacct ggccatgaag 1080
tgctaccccg accgcgtgga ctacgtggac aaggtgctgg agaccaccgt ggagatcttc 1140
aacaagctga acctggagca catcgccacc agcagcgccg tgagcaagga gctgacccgc 1200
ctgctgaaga tccccgtgga cacctacaac aacatcctga ccgtgctgaa gctgaagcac 1260
ttccaccccc tgttcgagta cttcgactac gagagccgca agagcatgag ctgctacgtg 1320
ctgagcaacg tgctggacta caacaccgag atcgtgagcc aggaccaggt ggacagcatc 1380
atgaacctgg tgagcaccct gatccaggac cagcccgacc agcccgtgga ggaccccgac 1440
cccgaggact tcgccgacga gcagagcctg gtgggccgct tcatccacct gctgcgcagc 1500
gaggaccccg accagcagta cctgatcctg aacaccgccc gcaagcactt cggcgccggc 1560
ggcaaccagc gcatccgctt caccctgccc cccctggtgt tcgccgccta ccagctggcc 1620
ttccgctaca aggagaacag caaggtggac gacaagtggg agaagaagtg ccagaagatc 1680
ttcagcttcg cccaccagac catcagcgcc ctgatcaagg ccgagctggc cgagctgccc 1740
ctgcgcctgt tcctgcaggg cgccctggcc gccggcgaga tcggcttcga gaaccacgag 1800
accgtggcct acgagttcat gagccaggcc ttcagcctgt acgaggacga gatcagcgac 1860
agcaaggccc agctggccgc catcaccctg atcatcggca ccttcgagcg catgaagtgc 1920
ttcagcgagg agaaccacga gcccctgcgc acccagtgcg ccctggccgc cagcaagctg 1980
ctgaagaagc ccgaccaggg ccgcgccgtg agcacctgcg cccacctgtt ctggagcggc 2040
cgcaacaccg acaagaacgg cgaggagctg cacggcggca agcgcgtgat ggagtgcctg 2100
aagaaggccc tgaagatcgc caaccagtgc atggacccca gcctgcaggt gcagctgttc 2160
atcgagatcc tgaaccgcta catctacttc tacgagaagg agaacgacgc cgtgaccatc 2220
caggtgctga accagctgat ccagaagatc cgcgaggacc tgcccaacct ggagagcagc 2280
gaggagaccg agcagatcaa caagcacttc cacaacaccc tggagcacct gcgcctgcgc 2340
cgcgagagcc ccgagagcga gggccccatc tacgagggcc tgatcctg 2388
<210> 51
<211> 11081
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 51
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctcctg gtggcgaggg gaggggggtg gtcctcgaac gccttgcaga 600
actggcctgg atacagagtg gaccggctgg ccccatctgg aagacttcga gatacactgt 660
tgtcttactg cgctcaacag tgtatctcga agtcttccaa atggtgccag ccatcgcagc 720
ggggtgcagg aaatgggggc agcccccctt tttggctatc cttccacgtg ttcttttttg 780
tatcttttgt gtttcctaga aaacatctca gtcaccaccg cagccctagg aatgcatcta 840
gacaattgta ctaaccttct tctctttcct ctcctgacag tccggaaagc caccatggaa 900
ttcagcagcc ccagcagaga ggaatgcccc aagcctctga gccgggtgtc aatcatggcc 960
ggatctctga caggactgct gctgcttcag gccgtgtctt gggcttctgg cgctagacct 1020
tgcatcccca agagcttcgg ctacagcagc gtcgtgtgcg tgtgcaatgc cacctactgc 1080
gacagcttcg accctcctac ctttcctgct ctgggcacct tcagcagata cgagagcacc 1140
agatccggca gacggatgga actgagcatg ggacccatcc aggccaatca cacaggcact 1200
ggcctgctgc tgacactgca gcctgagcag aaattccaga aagtgaaagg cttcggcgga 1260
gccatgacag atgccgccgc tctgaatatc ctggctctgt ctccaccagc tcagaacctg 1320
ctgctcaaga gctacttcag cgaggaaggc atcggctaca acatcatcag agtgcccatg 1380
gccagctgcg acttcagcat caggacctac acctacgccg acacacccga cgatttccag 1440
ctgcacaact tcagcctgcc tgaagaggac accaagctga agatccctct gatccacaga 1500
gccctgcagc tggcacaaag acccgtgtca ctgctggcct ctccatggac atctcccacc 1560
tggctgaaaa caaatggcgc cgtgaatggc aagggcagcc tgaaaggcca acctggcgac 1620
atctaccacc agacctgggc cagatacttc gtgaagttcc tggacgccta tgccgagcac 1680
aagctgcagt tttgggccgt gacagccgag aacgaacctt ctgctggact gctgagcggc 1740
tacccctttc agtgcctggg ctttacaccc gagcaccagc gggactttat cgcccgtgat 1800
ctgggaccca cactggccaa tagcacccac cataatgtgc ggctgctgat gctggacgac 1860
cagagactgc ttctgcccca ctgggctaaa gtggtgctga cagatcctga ggccgccaaa 1920
tacgtgcacg gaatcgccgt gcactggtat ctggactttc tggcccctgc caaggccaca 1980
ctgggagaga cacacagact gttccccaac accatgctgt tcgccagcga agcctgtgtg 2040
ggcagcaagt tttgggaaca gagcgtgcgg ctcggcagct gggatagagg catgcagtac 2100
agccacagca tcatcaccaa cctgctgtac cacgtcgtcg gctggaccga ctggaatctg 2160
gccctgaatc ctgaaggcgg ccctaactgg gtccgaaact tcgtggacag ccccatcatc 2220
gtggacatca ccaaggacac cttctacaag cagcccatgt tctaccacct gggacacttc 2280
agcaagttca tccccgaggg ctctcagcgc gttggactgg tggcttccca gaagaacgat 2340
ctggacgccg tggctctgat gcaccctgat ggatctgctg tggtggtggt cctgaaccgc 2400
agcagcaaag atgtgcccct gaccatcaag gatcccgccg tgggattcct ggaaacaatc 2460
agccctggct actccatcca cacctacctg tggcgtagac aggagggcag aggaagtctt 2520
ctgacatgcg gagacgtgga agagaatccc ggccctatgg agaagggccc cgtgcgcgcc 2580
cccgccgaga agccccgcgg cgcccgctgc agcaacggct tccccgagcg cgaccccccc 2640
cgccccggcc ccagccgccc cgccgagaag cccccccgcc ccgaggccaa gagcgcccag 2700
cccgccgacg gctggaaggg cgagcgcccc cgcagcgagg aggacaacga gctgaacctg 2760
cccaacctgg ccgccgccta cagcagcatc ctgagcagcc tgggcgagaa cccccagcgc 2820
cagggcctgc tgaagacccc ctggcgcgcc gccagcgcca tgcagttctt caccaagggc 2880
taccaggaga ccatcagcga cgtgctgaac gacgccatct tcgacgagga ccacgacgag 2940
atggtgatcg tgaaggacat cgacatgttc agcatgtgcg agcaccacct ggtgcccttc 3000
gtgggcaagg tgcacatcgg ctacctgccc aacaagcagg tgctgggcct gagcaagctg 3060
gcccgcatcg tggagatcta cagccgccgc ctgcaggtgc aggagcgcct gaccaagcag 3120
atcgccgtgg ccatcaccga ggccctgcgc cccgccggcg tgggcgtggt ggtggaggcc 3180
acccacatgt gcatggtgat gcgcggcgtg cagaagatga acagcaagac cgtgaccagc 3240
accatgctgg gcgtgttccg cgaggacccc aagacccgcg aggagttcct gaccctgatc 3300
cgcagctgac aattgttaat taagtttaaa ccctcgaggc cgcaagctta tcgataatca 3360
acctctggat tacaaaattt gtgaaagatt gactggtatt cttaactatg ttgctccttt 3420
tacgctatgt ggatacgctg ctttaatgcc tttgtatcat gctattgctt cccgtatggc 3480
tttcattttc tcctccttgt ataaatcctg gttgctgtct ctttatgagg agttgtggcc 3540
cgttgtcagg caacgtggcg tggtgtgcac tgtgtttgct gacgcaaccc ccactggttg 3600
gggcattgcc accacctgtc agctcctttc cgggactttc gctttccccc tccctattgc 3660
cacggcggaa ctcatcgccg cctgccttgc ccgctgctgg acaggggctc ggctgttggg 3720
cactgacaat tccgtggtgt tgtcggggaa atcatcgtcc tttccttggc tgctcgcctg 3780
tgttgccacc tggattctgc gcgggacgtc cttctgctac gtcccttcgg ccctcaatcc 3840
agcggacctt ccttcccgcg gcctgctgcc ggctctgcgg cctcttccgc gtcttcgcct 3900
tcgccctcag acgagtcgga tctccctttg ggccgcctcc ccgcatcgat accgtcgact 3960
agagctcgct gatcagcctc gactgtgcct tctagttgcc agccatctgt tgtttgcccc 4020
tcccccgtgc cttccttgac cctggaaggt gccactccca ctgtcctttc ctaataaaat 4080
gaggaaattg catcgcattg tctgagtagg tgtcattcta ttctgggggg tggggtgggg 4140
caggacagca agggggagga ttgggaagac aatagcaggc atgctgggga gagatccacg 4200
ataacaaaca gcttttttgg ggtgaacata ttgactgaat tccctgcagg ttggccactc 4260
cctctctgcg cgctcgctcg ctcactgagg ccgcccgggc aaagcccggg cgtcgggcga 4320
cctttggtcg cccggcctca gtgagcgagc gagcgcgcag agagggagtg gccaactcca 4380
tcactagggg ttcctgcggc cgctcgtacg gtctcgagga attcctgcag gataacttgc 4440
caacctcatt ctaaaatgta tatagaagcc caaaagacaa taacaaaaat attcttgtag 4500
aacaaaatgg gaaagaatgt tccactaaat atcaagattt agagcaaagc atgagatgtg 4560
tggggataga cagtgaggct gataaaatag agtagagctc agaaacagac ccattgatat 4620
atgtaagtga cctatgaaaa aaatatggca ttttacaatg ggaaaatgat ggtctttttc 4680
ttttttagaa aaacagggaa atatatttat atgtaaaaaa taaaagggaa cccatatgtc 4740
ataccataca cacaaaaaaa ttccagtgaa ttataagtct aaatggagaa ggcaaaactt 4800
taaatctttt agaaaataat atagaagcat gcagaccagc ctggccaaca tgatgaaacc 4860
ctctctacta ataataaaat cagtagaact actcaggact actttgagtg ggaagtcctt 4920
ttctatgaag acttctttgg ccaaaattag gctctaaatg caaggagata gtgcatcatg 4980
cctggctgca cttactgata aatgatgtta tcaccatctt taaccaaatg cacaggaaca 5040
agttatggta ctgatgtgct ggattgagaa ggagctctac ttccttgaca ggacacattt 5100
gtatcaactt aaaaaagcag atttttgcca gcagaactat tcattcagag gtaggaaact 5160
tagaatagat gatgtcactg attagcatgg cttccccatc tccacagctg cttcccaccc 5220
aggttgccca cagttgagtt tgtccagtgc tcagggctgc ccactctcag taagaagccc 5280
cacaccagcc cctctccaaa tatgttggct gttccttcca ttaaagtgac cccactttag 5340
agcagcaagt ggatttctgt ttcttacagt tcaggaagga ggagtcagct gtgagaacct 5400
ggagcctgag atgcttctaa gtcccactgc tactggggtc agggaagcca gactccagca 5460
tcagcagtca ggagcactaa gcccttgcca acatcctgtt tctcagagaa actgcttcca 5520
ttataatggt tgtccttttt taagctatca agccaaacaa ccagtgtcta ccattattct 5580
catcacctga agccaagggt tctagcaaaa gtcaagctgt cttgtaatgg ttgatgtgcc 5640
tccagcttct gtcttcagtc actccactct tagcctgctc tgaatcaact ctgaccacag 5700
ttccctggag cccctgccac ctgctgcccc tgccaccttc tccatctgca gtgctgtgca 5760
gccttctgca ctcttgcaga gctaataggt ggagacttga aggaagagga ggaaagtttc 5820
tcataatagc cttgctgcaa gctcaaatgg gaggtgggca ctgtgcccag gagccttgga 5880
gcaaaggctg tgcccaacct ctgactgcat ccaggtttgg tcttgacaga gataagaagc 5940
cctggctttt ggagccaaaa tctaggtcag acttaggcag gattctcaaa gtttatcagc 6000
agaacatgag gcagaagacc ctttctgctc cagcttcttc aggctcaacc ttcatcagaa 6060
tagatagaaa gagaggctgt gagggttctt aaaacagaag caaatctgac tcagagaata 6120
aacaacctcc tagtaaacta cagcttagac agagcatctg gtggtgagtg tgctcagtgt 6180
cctactcaac tgtctggtat cagccctcat gaggacttct cttctttccc tcatagacct 6240
ccatctctgt tttccttagc ctgcagaaat ctggatggct attcacagaa tgcctgtgct 6300
ttcagagttg cattttttct ctggtattct ggttcaagca tttgaaggta ggaaaggttc 6360
tccaagtgca agaaagccag ccctgagcct caactgcctg gctagtgtgg tcagtaggat 6420
gcaaaggctg ttgaatgcca caaggccaaa ctttaacctg tgtaccacaa gcctagcagc 6480
agaggcagct ctgctcactg gaactctctg tcttctttct cctgagcctt ttcttttcct 6540
gagttttcta gctctcctca accttacctc tgccctaccc aggacaaacc caagagccac 6600
tgtttctgtg atgtcctctc cagccctaat taggcatcat gacttcagcc tgaccttcca 6660
tgctcagaag cagtgctaat ccacttcaga tgagctgctc tatgcaacac aggcagagcc 6720
tacaaacctt tgcaccagag ccctccacat atcagtgttt gttcatactc acttcaacag 6780
caaatgtgac tgctgagatt aagattttac acaagatggt ctgtaatttc acagttagtt 6840
ttatcccatt aggtatgaaa gaattagcat aattcccctt aaacatgaat gaatcttaga 6900
ttttttaata aatagttttg gaagtaaaga cagagacatc aggagcacaa ggaatagcct 6960
gagaggacaa acagaacaag aaagagtctg gaaatacaca ggatgttctt ggcctcctca 7020
aagcaagtgc aagcagatag taccagcagc cccaggctat cagagcccag tgaagagaag 7080
taccatgaaa gccacagctc taaccaccct gttccagagt gacagacagt ccccaagaca 7140
agccagcctg agccagagag agaactgcaa gagaaagttt ctaatttagg ttctgttaga 7200
ttcagacaag tgcaggtcat cctctctcca cagctactca cctctccagc ctaacaaagc 7260
ctgcagtcca cactccaacc ctggtgtctc acctcctagc ctctcccaac atcctgctct 7320
ctgaccatct tctgcatctc tcatctcacc atctcccact gtctacagcc tactcttgca 7380
actaccatct cattttctga catcctgtct acatcttctg ccatactctg ccatctacca 7440
taccacctct taccatctac cacaccatct tttatctcca tccctctcag aagcctccaa 7500
gctgaatcct gctttatgtg ttcatctcag cccctgcatg gaaagctgac cccagaggca 7560
gaactattcc cagagagctt ggccaagaaa aacaaaacta ccagcctggc caggctcagg 7620
agtagtaagc tgcagtgtct gttgtgttct agcttcaaca gctgcaggag ttccactctc 7680
aaatgctcca catttctcac atcctcctga ttctggtcac tacccatctt caaagaacag 7740
aatatctcac atcagcatac tgtgaaggac tagtcatggg tgcagctgct cagagctgca 7800
aagtcattct ggatggtgga gagcttacaa acatttcatg atgctccccc cgctctgatg 7860
gctggagccc aatccctaca cagactcctg ctgtatgtgt tttcctttca ctctgagcca 7920
cagccagagg gcaggcattc agtctcctct tcaggctggg gctggggcac tgagaactca 7980
cccaacacct tgctctcact ccttctgcaa aacaagaaag agctttgtgc tgcagtagcc 8040
atgaagaatg aaaggaaggc tttaactaaa aaatgtcaga gattattttc aaccccttac 8100
tgtggatcac cagcaaggag gaaacacaac acagagacat tttttcccct caaattatca 8160
aaagaatcac tgcatttgtt aaagagagca actgaatcag gaagcagagt tttgaacata 8220
tcagaagtta ggaatctgca tcagagacaa atgcagtcat ggttgtttgc tgcataccag 8280
ccctaatcat tagaagcctc atggacttca aacatcattc cctctgacaa gatgctctag 8340
cctaactcca tgagataaaa taaatctgcc tttcagagcc aaagaagagt ccaccagctt 8400
cttctcagtg tgaacaagag ctccagtcag gttagtcagt ccagtgcagt agaggagacc 8460
agtctgcatc ctctaatttt caaaggcaag aagatttgtt taccctggac accaggcaca 8520
agtgaggtca cagagctctt agatatgcag tcctcatgag tgaggagact aaagcgcatg 8580
ccatcaagac ttcagtgtag agaaaacctc caaaaaagcc tcctcactac ttctggaata 8640
gctcagaggc cgaggcggcc tcggcctctg cataaataaa aaaaattagt cagccatggg 8700
gcggagaatg ggcggaactg ggcggagtta ggggcgggat gggcggagtt aggggcggga 8760
ctatggttgc tgactaattg agatgcatgc tttgcatact tctgcctgct ggggagcctg 8820
gggactttcc acacctggtt gctgactaat tgagatgcat gctttgcata cttctgcctg 8880
ctggggagcc tggggacttt ccacacccta actgacacac attccacagc tgcattaatg 8940
aatcggccaa cgcgcgggga gaggcggttt gcgtattggg cgctcttccg cttcctcgct 9000
cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc 9060
ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg 9120
ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg 9180
cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg 9240
actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac 9300
cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca 9360
tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt 9420
gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc 9480
caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag 9540
agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac 9600
tagaagaaca gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt 9660
tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa 9720
gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg 9780
gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa 9840
aaggatcttc acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat 9900
atatgagtaa acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc 9960
gatctgtcta tttcgttcat ccatagttgc ctgactcctg caaaccacgt tgtgtctcaa 10020
aatctctgat gttacattgc acaagataaa aatatatcat catgaacaat aaaactgtct 10080
gcttacataa acagtaatac aaggggtgtt atgagccata ttcaacggga aacgtcttgc 10140
tcgaggccgc gattaaattc caacatggat gctgatttat atgggtataa atgggctcgc 10200
gataatgtcg ggcaatcagg tgcgacaatc tatcgattgt atgggaagcc cgatgcgcca 10260
gagttgtttc tgaaacatgg caaaggtagc gttgccaatg atgttacaga tgagatggtc 10320
agactaaact ggctgacgga atttatgcct cttccgacca tcaagcattt tatccgtact 10380
cctgatgatg catggttact caccactgcg atccccggga aaacagcatt ccaggtatta 10440
gaagaatatc ctgattcagg tgaaaatatt gttgatgcgc tggcagtgtt cctgcgccgg 10500
ttgcattcga ttcctgtttg taattgtcct tttaacagcg atcgcgtatt tcgtctcgct 10560
caggcgcaat cacgaatgaa taacggtttg gttgatgcga gtgattttga tgacgagcgt 10620
aatggctggc ctgttgaaca agtctggaaa gaaatgcata agcttttgcc attctcaccg 10680
gattcagtcg tcactcatgg tgatttctca cttgataacc ttatttttga cgaggggaaa 10740
ttaataggtt gtattgatgt tggacgagtc ggaatcgcag accgatacca ggatcttgcc 10800
atcctatgga actgcctcgg tgagttttct ccttcattac agaaacggct ttttcaaaaa 10860
tatggtattg ataatcctga tatgaataaa ttgcagtttc atttgatgct cgatgagttt 10920
ttctaagggc ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg 10980
cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg 11040
gcgccggtga tgagggcgcg ccaagtcgac gtccggcagt c 11081
<210> 52
<211> 10940
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 52
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctcctg gtggcgaggg gaggggggtg gtcctcgaac gccttgcaga 600
actggcctgg atacagagtg gaccggctgg ccccatctgg aagacttcga gatacactgt 660
tgtcttactg cgctcaacag tgtatctcga agtcttccaa atggtgccag ccatcgcagc 720
ggggtgcagg aaatgggggc agcccccctt tttggctatc cttccacgtg ttcttttttg 780
tatcttttgt gtttcctaga aaacatctca gtcaccaccg cagccctagg aatgcatcta 840
gacaattgta ctaaccttct tctctttcct ctcctgacag tccggaaagc caccatgggc 900
agccgcgacc acctgttcaa ggtgctggtg gtgggcgacg ccgccgtggg caagaccagc 960
ctggtgcagc gctacagcca ggacagcttc agcaagcact acaagagcac cgtgggcgtg 1020
gacttcgccc tgaaggtgct gcagtggagc gactacgaga tcgtgcgcct gcagctgtgg 1080
gacatcgccg gccaggagcg cttcaccagc atgacccgcc tgtactaccg cgacgccagc 1140
gcctgcgtga tcatgttcga cgtgaccaac gccaccacct tcagcaacag ccagcgctgg 1200
aagcaggacc tggacagcaa gctgaccctg cccaacggcg agcccgtgcc ctgcctgctg 1260
ctggccaaca agtgcgacct gagcccctgg gccgtgagcc gcgaccagat cgaccgcttc 1320
agcaaggaga acggcttcac cggctggacc gagaccagcg tgaaggagaa caagaacatc 1380
aacgaggcca tgcgcgtgct gatcgagaag atgatgcgca acagcaccga ggacatcatg 1440
agcctgagca cccagggcga ctacatcaac ctgcagacca agagcagcag ctggagctgc 1500
tgcgagggca gaggaagtct tctgacatgc ggagacgtgg aagagaatcc cggccctatg 1560
gaattcagca gccccagcag agaggaatgc cccaagcctc tgagccgggt gtcaatcatg 1620
gccggatctc tgacaggact gctgctgctt caggccgtgt cttgggcttc tggcgctaga 1680
ccttgcatcc ccaagagctt cggctacagc agcgtcgtgt gcgtgtgcaa tgccacctac 1740
tgcgacagct tcgaccctcc tacctttcct gctctgggca ccttcagcag atacgagagc 1800
accagatccg gcagacggat ggaactgagc atgggaccca tccaggccaa tcacacaggc 1860
actggcctgc tgctgacact gcagcctgag cagaaattcc agaaagtgaa aggcttcggc 1920
ggagccatga cagatgccgc cgctctgaat atcctggctc tgtctccacc agctcagaac 1980
ctgctgctca agagctactt cagcgaggaa ggcatcggct acaacatcat cagagtgccc 2040
atggccagct gcgacttcag catcaggacc tacacctacg ccgacacacc cgacgatttc 2100
cagctgcaca acttcagcct gcctgaagag gacaccaagc tgaagatccc tctgatccac 2160
agagccctgc agctggcaca aagacccgtg tcactgctgg cctctccatg gacatctccc 2220
acctggctga aaacaaatgg cgccgtgaat ggcaagggca gcctgaaagg ccaacctggc 2280
gacatctacc accagacctg ggccagatac ttcgtgaagt tcctggacgc ctatgccgag 2340
cacaagctgc agttttgggc cgtgacagcc gagaacgaac cttctgctgg actgctgagc 2400
ggctacccct ttcagtgcct gggctttaca cccgagcacc agcgggactt tatcgcccgt 2460
gatctgggac ccacactggc caatagcacc caccataatg tgcggctgct gatgctggac 2520
gaccagagac tgcttctgcc ccactgggct aaagtggtgc tgacagatcc tgaggccgcc 2580
aaatacgtgc acggaatcgc cgtgcactgg tatctggact ttctggcccc tgccaaggcc 2640
acactgggag agacacacag actgttcccc aacaccatgc tgttcgccag cgaagcctgt 2700
gtgggcagca agttttggga acagagcgtg cggctcggca gctgggatag aggcatgcag 2760
tacagccaca gcatcatcac caacctgctg taccacgtcg tcggctggac cgactggaat 2820
ctggccctga atcctgaagg cggccctaac tgggtccgaa acttcgtgga cagccccatc 2880
atcgtggaca tcaccaagga caccttctac aagcagccca tgttctacca cctgggacac 2940
ttcagcaagt tcatccccga gggctctcag cgcgttggac tggtggcttc ccagaagaac 3000
gatctggacg ccgtggctct gatgcaccct gatggatctg ctgtggtggt ggtcctgaac 3060
cgcagcagca aagatgtgcc cctgaccatc aaggatcccg ccgtgggatt cctggaaaca 3120
atcagccctg gctactccat ccacacctac ctgtggcgta gacagtgaca attgttaatt 3180
aagtttaaac cctcgaggcc gcaagcttat cgataatcaa cctctggatt acaaaatttg 3240
tgaaagattg actggtattc ttaactatgt tgctcctttt acgctatgtg gatacgctgc 3300
tttaatgcct ttgtatcatg ctattgcttc ccgtatggct ttcattttct cctccttgta 3360
taaatcctgg ttgctgtctc tttatgagga gttgtggccc gttgtcaggc aacgtggcgt 3420
ggtgtgcact gtgtttgctg acgcaacccc cactggttgg ggcattgcca ccacctgtca 3480
gctcctttcc gggactttcg ctttccccct ccctattgcc acggcggaac tcatcgccgc 3540
ctgccttgcc cgctgctgga caggggctcg gctgttgggc actgacaatt ccgtggtgtt 3600
gtcggggaaa tcatcgtcct ttccttggct gctcgcctgt gttgccacct ggattctgcg 3660
cgggacgtcc ttctgctacg tcccttcggc cctcaatcca gcggaccttc cttcccgcgg 3720
cctgctgccg gctctgcggc ctcttccgcg tcttcgcctt cgccctcaga cgagtcggat 3780
ctccctttgg gccgcctccc cgcatcgata ccgtcgacta gagctcgctg atcagcctcg 3840
actgtgcctt ctagttgcca gccatctgtt gtttgcccct cccccgtgcc ttccttgacc 3900
ctggaaggtg ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt 3960
ctgagtaggt gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat 4020
tgggaagaca atagcaggca tgctggggag agatccacga taacaaacag cttttttggg 4080
gtgaacatat tgactgaatt ccctgcaggt tggccactcc ctctctgcgc gctcgctcgc 4140
tcactgaggc cgcccgggca aagcccgggc gtcgggcgac ctttggtcgc ccggcctcag 4200
tgagcgagcg agcgcgcaga gagggagtgg ccaactccat cactaggggt tcctgcggcc 4260
gctcgtacgg tctcgaggaa ttcctgcagg ataacttgcc aacctcattc taaaatgtat 4320
atagaagccc aaaagacaat aacaaaaata ttcttgtaga acaaaatggg aaagaatgtt 4380
ccactaaata tcaagattta gagcaaagca tgagatgtgt ggggatagac agtgaggctg 4440
ataaaataga gtagagctca gaaacagacc cattgatata tgtaagtgac ctatgaaaaa 4500
aatatggcat tttacaatgg gaaaatgatg gtctttttct tttttagaaa aacagggaaa 4560
tatatttata tgtaaaaaat aaaagggaac ccatatgtca taccatacac acaaaaaaat 4620
tccagtgaat tataagtcta aatggagaag gcaaaacttt aaatctttta gaaaataata 4680
tagaagcatg cagaccagcc tggccaacat gatgaaaccc tctctactaa taataaaatc 4740
agtagaacta ctcaggacta ctttgagtgg gaagtccttt tctatgaaga cttctttggc 4800
caaaattagg ctctaaatgc aaggagatag tgcatcatgc ctggctgcac ttactgataa 4860
atgatgttat caccatcttt aaccaaatgc acaggaacaa gttatggtac tgatgtgctg 4920
gattgagaag gagctctact tccttgacag gacacatttg tatcaactta aaaaagcaga 4980
tttttgccag cagaactatt cattcagagg taggaaactt agaatagatg atgtcactga 5040
ttagcatggc ttccccatct ccacagctgc ttcccaccca ggttgcccac agttgagttt 5100
gtccagtgct cagggctgcc cactctcagt aagaagcccc acaccagccc ctctccaaat 5160
atgttggctg ttccttccat taaagtgacc ccactttaga gcagcaagtg gatttctgtt 5220
tcttacagtt caggaaggag gagtcagctg tgagaacctg gagcctgaga tgcttctaag 5280
tcccactgct actggggtca gggaagccag actccagcat cagcagtcag gagcactaag 5340
cccttgccaa catcctgttt ctcagagaaa ctgcttccat tataatggtt gtcctttttt 5400
aagctatcaa gccaaacaac cagtgtctac cattattctc atcacctgaa gccaagggtt 5460
ctagcaaaag tcaagctgtc ttgtaatggt tgatgtgcct ccagcttctg tcttcagtca 5520
ctccactctt agcctgctct gaatcaactc tgaccacagt tccctggagc ccctgccacc 5580
tgctgcccct gccaccttct ccatctgcag tgctgtgcag ccttctgcac tcttgcagag 5640
ctaataggtg gagacttgaa ggaagaggag gaaagtttct cataatagcc ttgctgcaag 5700
ctcaaatggg aggtgggcac tgtgcccagg agccttggag caaaggctgt gcccaacctc 5760
tgactgcatc caggtttggt cttgacagag ataagaagcc ctggcttttg gagccaaaat 5820
ctaggtcaga cttaggcagg attctcaaag tttatcagca gaacatgagg cagaagaccc 5880
tttctgctcc agcttcttca ggctcaacct tcatcagaat agatagaaag agaggctgtg 5940
agggttctta aaacagaagc aaatctgact cagagaataa acaacctcct agtaaactac 6000
agcttagaca gagcatctgg tggtgagtgt gctcagtgtc ctactcaact gtctggtatc 6060
agccctcatg aggacttctc ttctttccct catagacctc catctctgtt ttccttagcc 6120
tgcagaaatc tggatggcta ttcacagaat gcctgtgctt tcagagttgc attttttctc 6180
tggtattctg gttcaagcat ttgaaggtag gaaaggttct ccaagtgcaa gaaagccagc 6240
cctgagcctc aactgcctgg ctagtgtggt cagtaggatg caaaggctgt tgaatgccac 6300
aaggccaaac tttaacctgt gtaccacaag cctagcagca gaggcagctc tgctcactgg 6360
aactctctgt cttctttctc ctgagccttt tcttttcctg agttttctag ctctcctcaa 6420
ccttacctct gccctaccca ggacaaaccc aagagccact gtttctgtga tgtcctctcc 6480
agccctaatt aggcatcatg acttcagcct gaccttccat gctcagaagc agtgctaatc 6540
cacttcagat gagctgctct atgcaacaca ggcagagcct acaaaccttt gcaccagagc 6600
cctccacata tcagtgtttg ttcatactca cttcaacagc aaatgtgact gctgagatta 6660
agattttaca caagatggtc tgtaatttca cagttagttt tatcccatta ggtatgaaag 6720
aattagcata attcccctta aacatgaatg aatcttagat tttttaataa atagttttgg 6780
aagtaaagac agagacatca ggagcacaag gaatagcctg agaggacaaa cagaacaaga 6840
aagagtctgg aaatacacag gatgttcttg gcctcctcaa agcaagtgca agcagatagt 6900
accagcagcc ccaggctatc agagcccagt gaagagaagt accatgaaag ccacagctct 6960
aaccaccctg ttccagagtg acagacagtc cccaagacaa gccagcctga gccagagaga 7020
gaactgcaag agaaagtttc taatttaggt tctgttagat tcagacaagt gcaggtcatc 7080
ctctctccac agctactcac ctctccagcc taacaaagcc tgcagtccac actccaaccc 7140
tggtgtctca cctcctagcc tctcccaaca tcctgctctc tgaccatctt ctgcatctct 7200
catctcacca tctcccactg tctacagcct actcttgcaa ctaccatctc attttctgac 7260
atcctgtcta catcttctgc catactctgc catctaccat accacctctt accatctacc 7320
acaccatctt ttatctccat ccctctcaga agcctccaag ctgaatcctg ctttatgtgt 7380
tcatctcagc ccctgcatgg aaagctgacc ccagaggcag aactattccc agagagcttg 7440
gccaagaaaa acaaaactac cagcctggcc aggctcagga gtagtaagct gcagtgtctg 7500
ttgtgttcta gcttcaacag ctgcaggagt tccactctca aatgctccac atttctcaca 7560
tcctcctgat tctggtcact acccatcttc aaagaacaga atatctcaca tcagcatact 7620
gtgaaggact agtcatgggt gcagctgctc agagctgcaa agtcattctg gatggtggag 7680
agcttacaaa catttcatga tgctcccccc gctctgatgg ctggagccca atccctacac 7740
agactcctgc tgtatgtgtt ttcctttcac tctgagccac agccagaggg caggcattca 7800
gtctcctctt caggctgggg ctggggcact gagaactcac ccaacacctt gctctcactc 7860
cttctgcaaa acaagaaaga gctttgtgct gcagtagcca tgaagaatga aaggaaggct 7920
ttaactaaaa aatgtcagag attattttca accccttact gtggatcacc agcaaggagg 7980
aaacacaaca cagagacatt ttttcccctc aaattatcaa aagaatcact gcatttgtta 8040
aagagagcaa ctgaatcagg aagcagagtt ttgaacatat cagaagttag gaatctgcat 8100
cagagacaaa tgcagtcatg gttgtttgct gcataccagc cctaatcatt agaagcctca 8160
tggacttcaa acatcattcc ctctgacaag atgctctagc ctaactccat gagataaaat 8220
aaatctgcct ttcagagcca aagaagagtc caccagcttc ttctcagtgt gaacaagagc 8280
tccagtcagg ttagtcagtc cagtgcagta gaggagacca gtctgcatcc tctaattttc 8340
aaaggcaaga agatttgttt accctggaca ccaggcacaa gtgaggtcac agagctctta 8400
gatatgcagt cctcatgagt gaggagacta aagcgcatgc catcaagact tcagtgtaga 8460
gaaaacctcc aaaaaagcct cctcactact tctggaatag ctcagaggcc gaggcggcct 8520
cggcctctgc ataaataaaa aaaattagtc agccatgggg cggagaatgg gcggaactgg 8580
gcggagttag gggcgggatg ggcggagtta ggggcgggac tatggttgct gactaattga 8640
gatgcatgct ttgcatactt ctgcctgctg gggagcctgg ggactttcca cacctggttg 8700
ctgactaatt gagatgcatg ctttgcatac ttctgcctgc tggggagcct ggggactttc 8760
cacaccctaa ctgacacaca ttccacagct gcattaatga atcggccaac gcgcggggag 8820
aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 8880
cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 8940
atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 9000
taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 9060
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 9120
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 9180
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 9240
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 9300
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 9360
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 9420
tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat 9480
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 9540
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 9600
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 9660
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 9720
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 9780
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 9840
catagttgcc tgactcctgc aaaccacgtt gtgtctcaaa atctctgatg ttacattgca 9900
caagataaaa atatatcatc atgaacaata aaactgtctg cttacataaa cagtaataca 9960
aggggtgtta tgagccatat tcaacgggaa acgtcttgct cgaggccgcg attaaattcc 10020
aacatggatg ctgatttata tgggtataaa tgggctcgcg ataatgtcgg gcaatcaggt 10080
gcgacaatct atcgattgta tgggaagccc gatgcgccag agttgtttct gaaacatggc 10140
aaaggtagcg ttgccaatga tgttacagat gagatggtca gactaaactg gctgacggaa 10200
tttatgcctc ttccgaccat caagcatttt atccgtactc ctgatgatgc atggttactc 10260
accactgcga tccccgggaa aacagcattc caggtattag aagaatatcc tgattcaggt 10320
gaaaatattg ttgatgcgct ggcagtgttc ctgcgccggt tgcattcgat tcctgtttgt 10380
aattgtcctt ttaacagcga tcgcgtattt cgtctcgctc aggcgcaatc acgaatgaat 10440
aacggtttgg ttgatgcgag tgattttgat gacgagcgta atggctggcc tgttgaacaa 10500
gtctggaaag aaatgcataa gcttttgcca ttctcaccgg attcagtcgt cactcatggt 10560
gatttctcac ttgataacct tatttttgac gaggggaaat taataggttg tattgatgtt 10620
ggacgagtcg gaatcgcaga ccgataccag gatcttgcca tcctatggaa ctgcctcggt 10680
gagttttctc cttcattaca gaaacggctt tttcaaaaat atggtattga taatcctgat 10740
atgaataaat tgcagtttca tttgatgctc gatgagtttt tctaagggcg gcctgccacc 10800
atacccacgc cgaaacaagc gctcatgagc ccgaagtggc gagcccgatc ttccccatcg 10860
gtgatgtcgg cgatataggc gccagcaacc gcacctgtgg cgccggtgat gagggcgcgc 10920
caagtcgacg tccggcagtc 10940
<210> 53
<211> 10934
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 53
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctcctg gtggcgaggg gaggggggtg gtcctcgaac gccttgcaga 600
actggcctgg atacagagtg gaccggctgg ccccatctgg aagacttcga gatacactgt 660
tgtcttactg cgctcaacag tgtatctcga agtcttccaa atggtgccag ccatcgcagc 720
ggggtgcagg aaatgggggc agcccccctt tttggctatc cttccacgtg ttcttttttg 780
tatcttttgt gtttcctaga aaacatctca gtcaccaccg cagccctagg aatgcatcta 840
gacaattgta ctaaccttct tctctttcct ctcctgacag tccggaaagc caccatggaa 900
ttcagcagcc ccagcagaga ggaatgcccc aagcctctga gccgggtgtc aatcatggcc 960
ggatctctga caggactgct gctgcttcag gccgtgtctt gggcttctgg cgctagacct 1020
tgcatcccca agagcttcgg ctacagcagc gtcgtgtgcg tgtgcaatgc cacctactgc 1080
gacagcttcg accctcctac ctttcctgct ctgggcacct tcagcagata cgagagcacc 1140
agatccggca gacggatgga actgagcatg ggacccatcc aggccaatca cacaggcact 1200
ggcctgctgc tgacactgca gcctgagcag aaattccaga aagtgaaagg cttcggcgga 1260
gccatgacag atgccgccgc tctgaatatc ctggctctgt ctccaccagc tcagaacctg 1320
ctgctcaaga gctacttcag cgaggaaggc atcggctaca acatcatcag agtgcccatg 1380
gccagctgcg acttcagcat caggacctac acctacgccg acacacccga cgatttccag 1440
ctgcacaact tcagcctgcc tgaagaggac accaagctga agatccctct gatccacaga 1500
gccctgcagc tggcacaaag acccgtgtca ctgctggcct ctccatggac atctcccacc 1560
tggctgaaaa caaatggcgc cgtgaatggc aagggcagcc tgaaaggcca acctggcgac 1620
atctaccacc agacctgggc cagatacttc gtgaagttcc tggacgccta tgccgagcac 1680
aagctgcagt tttgggccgt gacagccgag aacgaacctt ctgctggact gctgagcggc 1740
tacccctttc agtgcctggg ctttacaccc gagcaccagc gggactttat cgcccgtgat 1800
ctgggaccca cactggccaa tagcacccac cataatgtgc ggctgctgat gctggacgac 1860
cagagactgc ttctgcccca ctgggctaaa gtggtgctga cagatcctga ggccgccaaa 1920
tacgtgcacg gaatcgccgt gcactggtat ctggactttc tggcccctgc caaggccaca 1980
ctgggagaga cacacagact gttccccaac accatgctgt tcgccagcga agcctgtgtg 2040
ggcagcaagt tttgggaaca gagcgtgcgg ctcggcagct gggatagagg catgcagtac 2100
agccacagca tcatcaccaa cctgctgtac cacgtcgtcg gctggaccga ctggaatctg 2160
gccctgaatc ctgaaggcgg ccctaactgg gtccgaaact tcgtggacag ccccatcatc 2220
gtggacatca ccaaggacac cttctacaag cagcccatgt tctaccacct gggacacttc 2280
agcaagttca tccccgaggg ctctcagcgc gttggactgg tggcttccca gaagaacgat 2340
ctggacgccg tggctctgat gcaccctgat ggatctgctg tggtggtggt cctgaaccgc 2400
agcagcaaag atgtgcccct gaccatcaag gatcccgccg tgggattcct ggaaacaatc 2460
agccctggct actccatcca cacctacctg tggcgtagac agtgattgtg gccgaaccgc 2520
cgaactcaga ggccggcccc agaaaacccg agcgagtagg gggcggcgcg caggagggag 2580
gagaactggg ggcgcgggag gctggtgggt gtggggggtg gagatgtaga agatgtgacg 2640
ccgcggcccg gcgggtgcca gattagcgga cgcggtgccc gcggttgcaa cgggatcccg 2700
ggcgctgcag cttgggaggc ggctctcccc aggcggcgtc cgcggagaca cccatccgtg 2760
aaccccaggt cccgggccgc cggctcgccg cgcaccaggg gccggcggac agaagagcgg 2820
ccgagcggct cgaggctggg ggaccgcggg cgcggccgcg cgctgccggg cgggaggctg 2880
gggggccggg gccggggccg tgccccggag cgggtcggag gccggggccg gggccggggg 2940
acggcggctc cccgcgcggc tccagcggct cggggatccc ggccgggccc cgcagggacc 3000
atgatggaga agggccccgt gcgcgccccc gccgagaagc cccgcggcgc ccgctgcagc 3060
aacggcttcc ccgagcgcga ccccccccgc cccggcccca gccgccccgc cgagaagccc 3120
ccccgccccg aggccaagag cgcccagccc gccgacggct ggaagggcga gcgcccccgc 3180
agcgaggagg acaacgagct gaacctgccc aacctggccg ccgcctacag cagcatcctg 3240
agcagcctgg gcgagaaccc ccagcgccag ggcctgctga agaccccctg gcgcgccgcc 3300
agcgccatgc agttcttcac caagggctac caggagacca tcagcgacgt gctgaacgac 3360
gccatcttcg acgaggacca cgacgagatg gtgatcgtga aggacatcga catgttcagc 3420
atgtgcgagc accacctggt gcccttcgtg ggcaaggtgc acatcggcta cctgcccaac 3480
aagcaggtgc tgggcctgag caagctggcc cgcatcgtgg agatctacag ccgccgcctg 3540
caggtgcagg agcgcctgac caagcagatc gccgtggcca tcaccgaggc cctgcgcccc 3600
gccggcgtgg gcgtggtggt ggaggccacc cacatgtgca tggtgatgcg cggcgtgcag 3660
aagatgaaca gcaagaccgt gaccagcacc atgctgggcg tgttccgcga ggaccccaag 3720
acccgcgagg agttcctgac cctgatccgc agctgacaat tgttaattaa gtttaaaccc 3780
tcgaggccgc aagccgcatc gataccgtcg actagagctc gctgatcagc ctcgactgtg 3840
ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa 3900
ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt 3960
aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa 4020
gacaatagca ggcatgctgg ggagagatcc acgataacaa acagcttttt tggggtgaac 4080
atattgactg aattccctgc aggttggcca ctccctctct gcgcgctcgc tcgctcactg 4140
aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg tcgcccggcc tcagtgagcg 4200
agcgagcgcg cagagaggga gtggccaact ccatcactag gggttcctgc ggccgctcgt 4260
acggtctcga ggaattcctg caggataact tgccaacctc attctaaaat gtatatagaa 4320
gcccaaaaga caataacaaa aatattcttg tagaacaaaa tgggaaagaa tgttccacta 4380
aatatcaaga tttagagcaa agcatgagat gtgtggggat agacagtgag gctgataaaa 4440
tagagtagag ctcagaaaca gacccattga tatatgtaag tgacctatga aaaaaatatg 4500
gcattttaca atgggaaaat gatggtcttt ttctttttta gaaaaacagg gaaatatatt 4560
tatatgtaaa aaataaaagg gaacccatat gtcataccat acacacaaaa aaattccagt 4620
gaattataag tctaaatgga gaaggcaaaa ctttaaatct tttagaaaat aatatagaag 4680
catgcagacc agcctggcca acatgatgaa accctctcta ctaataataa aatcagtaga 4740
actactcagg actactttga gtgggaagtc cttttctatg aagacttctt tggccaaaat 4800
taggctctaa atgcaaggag atagtgcatc atgcctggct gcacttactg ataaatgatg 4860
ttatcaccat ctttaaccaa atgcacagga acaagttatg gtactgatgt gctggattga 4920
gaaggagctc tacttccttg acaggacaca tttgtatcaa cttaaaaaag cagatttttg 4980
ccagcagaac tattcattca gaggtaggaa acttagaata gatgatgtca ctgattagca 5040
tggcttcccc atctccacag ctgcttccca cccaggttgc ccacagttga gtttgtccag 5100
tgctcagggc tgcccactct cagtaagaag ccccacacca gcccctctcc aaatatgttg 5160
gctgttcctt ccattaaagt gaccccactt tagagcagca agtggatttc tgtttcttac 5220
agttcaggaa ggaggagtca gctgtgagaa cctggagcct gagatgcttc taagtcccac 5280
tgctactggg gtcagggaag ccagactcca gcatcagcag tcaggagcac taagcccttg 5340
ccaacatcct gtttctcaga gaaactgctt ccattataat ggttgtcctt ttttaagcta 5400
tcaagccaaa caaccagtgt ctaccattat tctcatcacc tgaagccaag ggttctagca 5460
aaagtcaagc tgtcttgtaa tggttgatgt gcctccagct tctgtcttca gtcactccac 5520
tcttagcctg ctctgaatca actctgacca cagttccctg gagcccctgc cacctgctgc 5580
ccctgccacc ttctccatct gcagtgctgt gcagccttct gcactcttgc agagctaata 5640
ggtggagact tgaaggaaga ggaggaaagt ttctcataat agccttgctg caagctcaaa 5700
tgggaggtgg gcactgtgcc caggagcctt ggagcaaagg ctgtgcccaa cctctgactg 5760
catccaggtt tggtcttgac agagataaga agccctggct tttggagcca aaatctaggt 5820
cagacttagg caggattctc aaagtttatc agcagaacat gaggcagaag accctttctg 5880
ctccagcttc ttcaggctca accttcatca gaatagatag aaagagaggc tgtgagggtt 5940
cttaaaacag aagcaaatct gactcagaga ataaacaacc tcctagtaaa ctacagctta 6000
gacagagcat ctggtggtga gtgtgctcag tgtcctactc aactgtctgg tatcagccct 6060
catgaggact tctcttcttt ccctcataga cctccatctc tgttttcctt agcctgcaga 6120
aatctggatg gctattcaca gaatgcctgt gctttcagag ttgcattttt tctctggtat 6180
tctggttcaa gcatttgaag gtaggaaagg ttctccaagt gcaagaaagc cagccctgag 6240
cctcaactgc ctggctagtg tggtcagtag gatgcaaagg ctgttgaatg ccacaaggcc 6300
aaactttaac ctgtgtacca caagcctagc agcagaggca gctctgctca ctggaactct 6360
ctgtcttctt tctcctgagc cttttctttt cctgagtttt ctagctctcc tcaaccttac 6420
ctctgcccta cccaggacaa acccaagagc cactgtttct gtgatgtcct ctccagccct 6480
aattaggcat catgacttca gcctgacctt ccatgctcag aagcagtgct aatccacttc 6540
agatgagctg ctctatgcaa cacaggcaga gcctacaaac ctttgcacca gagccctcca 6600
catatcagtg tttgttcata ctcacttcaa cagcaaatgt gactgctgag attaagattt 6660
tacacaagat ggtctgtaat ttcacagtta gttttatccc attaggtatg aaagaattag 6720
cataattccc cttaaacatg aatgaatctt agatttttta ataaatagtt ttggaagtaa 6780
agacagagac atcaggagca caaggaatag cctgagagga caaacagaac aagaaagagt 6840
ctggaaatac acaggatgtt cttggcctcc tcaaagcaag tgcaagcaga tagtaccagc 6900
agccccaggc tatcagagcc cagtgaagag aagtaccatg aaagccacag ctctaaccac 6960
cctgttccag agtgacagac agtccccaag acaagccagc ctgagccaga gagagaactg 7020
caagagaaag tttctaattt aggttctgtt agattcagac aagtgcaggt catcctctct 7080
ccacagctac tcacctctcc agcctaacaa agcctgcagt ccacactcca accctggtgt 7140
ctcacctcct agcctctccc aacatcctgc tctctgacca tcttctgcat ctctcatctc 7200
accatctccc actgtctaca gcctactctt gcaactacca tctcattttc tgacatcctg 7260
tctacatctt ctgccatact ctgccatcta ccataccacc tcttaccatc taccacacca 7320
tcttttatct ccatccctct cagaagcctc caagctgaat cctgctttat gtgttcatct 7380
cagcccctgc atggaaagct gaccccagag gcagaactat tcccagagag cttggccaag 7440
aaaaacaaaa ctaccagcct ggccaggctc aggagtagta agctgcagtg tctgttgtgt 7500
tctagcttca acagctgcag gagttccact ctcaaatgct ccacatttct cacatcctcc 7560
tgattctggt cactacccat cttcaaagaa cagaatatct cacatcagca tactgtgaag 7620
gactagtcat gggtgcagct gctcagagct gcaaagtcat tctggatggt ggagagctta 7680
caaacatttc atgatgctcc ccccgctctg atggctggag cccaatccct acacagactc 7740
ctgctgtatg tgttttcctt tcactctgag ccacagccag agggcaggca ttcagtctcc 7800
tcttcaggct ggggctgggg cactgagaac tcacccaaca ccttgctctc actccttctg 7860
caaaacaaga aagagctttg tgctgcagta gccatgaaga atgaaaggaa ggctttaact 7920
aaaaaatgtc agagattatt ttcaacccct tactgtggat caccagcaag gaggaaacac 7980
aacacagaga cattttttcc cctcaaatta tcaaaagaat cactgcattt gttaaagaga 8040
gcaactgaat caggaagcag agttttgaac atatcagaag ttaggaatct gcatcagaga 8100
caaatgcagt catggttgtt tgctgcatac cagccctaat cattagaagc ctcatggact 8160
tcaaacatca ttccctctga caagatgctc tagcctaact ccatgagata aaataaatct 8220
gcctttcaga gccaaagaag agtccaccag cttcttctca gtgtgaacaa gagctccagt 8280
caggttagtc agtccagtgc agtagaggag accagtctgc atcctctaat tttcaaaggc 8340
aagaagattt gtttaccctg gacaccaggc acaagtgagg tcacagagct cttagatatg 8400
cagtcctcat gagtgaggag actaaagcgc atgccatcaa gacttcagtg tagagaaaac 8460
ctccaaaaaa gcctcctcac tacttctgga atagctcaga ggccgaggcg gcctcggcct 8520
ctgcataaat aaaaaaaatt agtcagccat ggggcggaga atgggcggaa ctgggcggag 8580
ttaggggcgg gatgggcgga gttaggggcg ggactatggt tgctgactaa ttgagatgca 8640
tgctttgcat acttctgcct gctggggagc ctggggactt tccacacctg gttgctgact 8700
aattgagatg catgctttgc atacttctgc ctgctgggga gcctggggac tttccacacc 8760
ctaactgaca cacattccac agctgcatta atgaatcggc caacgcgcgg ggagaggcgg 8820
tttgcgtatt gggcgctctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg 8880
gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg 8940
ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa 9000
ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg 9060
acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc 9120
tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc 9180
ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc 9240
ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg 9300
ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc 9360
actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga 9420
gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg gtatctgcgc 9480
tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac 9540
caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg 9600
atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc 9660
acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa 9720
ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta 9780
ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt 9840
tgcctgactc ctgcaaacca cgttgtgtct caaaatctct gatgttacat tgcacaagat 9900
aaaaatatat catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt 9960
gttatgagcc atattcaacg ggaaacgtct tgctcgaggc cgcgattaaa ttccaacatg 10020
gatgctgatt tatatgggta taaatgggct cgcgataatg tcgggcaatc aggtgcgaca 10080
atctatcgat tgtatgggaa gcccgatgcg ccagagttgt ttctgaaaca tggcaaaggt 10140
agcgttgcca atgatgttac agatgagatg gtcagactaa actggctgac ggaatttatg 10200
cctcttccga ccatcaagca ttttatccgt actcctgatg atgcatggtt actcaccact 10260
gcgatccccg ggaaaacagc attccaggta ttagaagaat atcctgattc aggtgaaaat 10320
attgttgatg cgctggcagt gttcctgcgc cggttgcatt cgattcctgt ttgtaattgt 10380
ccttttaaca gcgatcgcgt atttcgtctc gctcaggcgc aatcacgaat gaataacggt 10440
ttggttgatg cgagtgattt tgatgacgag cgtaatggct ggcctgttga acaagtctgg 10500
aaagaaatgc ataagctttt gccattctca ccggattcag tcgtcactca tggtgatttc 10560
tcacttgata accttatttt tgacgagggg aaattaatag gttgtattga tgttggacga 10620
gtcggaatcg cagaccgata ccaggatctt gccatcctat ggaactgcct cggtgagttt 10680
tctccttcat tacagaaacg gctttttcaa aaatatggta ttgataatcc tgatatgaat 10740
aaattgcagt ttcatttgat gctcgatgag tttttctaag ggcggcctgc caccataccc 10800
acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg 10860
tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgagggc gcgccaagtc 10920
gacgtccggc agtc 10934
<210> 54
<211> 11138
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 54
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agtaagtcac 300
tgactgtcta tgcctgggaa agggtgggca ggagatgggg cagtgcagga aaagtggcac 360
tatgaaccct cctggtggcg aggggagggg ggtggtcctc gaacgccttg cagaactggc 420
ctggatacag agtggaccgg ctggccccat ctggaagact tcgagataca ctgttgtctt 480
actgcgctca acagtgtatc tcgaagtctt ccaaatggtg ccagccatcg cagcggggtg 540
caggaaatgg gggcagcccc cctttttggc tatccttcca cgtgttcttt tttgtatctt 600
ttgtgtttcc tagaaaacat ctcagtcacc accgtgatat cacaaggtcc cagggctggg 660
gtcagaaatt ctctcccgag ggaatgaagc cacaggagcc aagagcagga ggaccaaggc 720
cctggcgaag gccgtggcct cgttcaagta aaagatccta gtacagtgca ggtcccaatg 780
tgtactagga tcttttactt gaacggggac gccggcatcc gggctcagga cccccctctc 840
tgccagaggc accaacacca gagttcacaa atcagtctcc tgccctttgc atgtagcaaa 900
gcagccctag gaatgcatct agacaattgt actaaccttc ttctctttcc tctcctgaca 960
gtccggaaag ccaccatgcc caccacccag cagagccccc aggacgagca ggagaagctg 1020
ctggacgagg ccatccaggc cgtgaaggtg cagagcttcc agatgaagcg ctgcctggac 1080
aagaacaagc tgatggacgc cctgaagcac gccagcaaca tgctgggcga gctgcgcacc 1140
agcatgctga gccccaagag ctactacgag ctgtacatgg ccatcagcga cgagctgcac 1200
tacctggagg tgtacctgac cgacgagttc gccaagggcc gcaaggtggc cgacctgtac 1260
gagctggtgc agtacgccgg caacatcatc ccccgcctgt acctgctgat caccgtgggc 1320
gtggtgtacg tgaagagctt cccccagagc cgcaaggaca tcctgaagga cctggtggag 1380
atgtgccgcg gcgtgcagca ccccctgcgc ggcctgttcc tgcgcaacta cctgctgcag 1440
tgcacccgca acatcctgcc cgacgagggc gagcccaccg acgaggagac caccggcgac 1500
atcagcgaca gcatggactt cgtgctgctg aacttcgccg agatgaacaa gctgtgggtg 1560
cgcatgcagc accagggcca cagccgcgac cgcgagaagc gcgagcgcga gcgccaggag 1620
ctgcgcatcc tggtgggcac caacctggtg cgcctgagcc agctggaggg cgtgaacgtg 1680
gagcgctaca agcagatcgt gctgaccggc atcctggagc aggtggtgaa ctgccgcgac 1740
gccctggccc aggagtacct gatggagtgc atcatccagg tgttccccga cgagttccac 1800
ctgcagaccc tgaacccctt cctgcgcgcc tgcgccgagc tgcaccagaa cgtgaacgtg 1860
aagaacatca tcatcgccct gatcgaccgc ctggccctgt tcgcccaccg cgaggacggc 1920
cccggcatcc ccgccgacat caagctgttc gacatcttca gccagcaggt ggccaccgtg 1980
atccagagcc gccaggacat gcccagcgag gacgtggtga gcctgcaggt gagcctgatc 2040
aacctggcca tgaagtgcta ccccgaccgc gtggactacg tggacaaggt gctggagacc 2100
accgtggaga tcttcaacaa gctgaacctg gagcacatcg ccaccagcag cgccgtgagc 2160
aaggagctga cccgcctgct gaagatcccc gtggacacct acaacaacat cctgaccgtg 2220
ctgaagctga agcacttcca ccccctgttc gagtacttcg actacgagag ccgcaagagc 2280
atgagctgct acgtgctgag caacgtgctg gactacaaca ccgagatcgt gagccaggac 2340
caggtggaca gcatcatgaa cctggtgagc accctgatcc aggaccagcc cgaccagccc 2400
gtggaggacc ccgaccccga ggacttcgcc gacgagcaga gcctggtggg ccgcttcatc 2460
cacctgctgc gcagcgagga ccccgaccag cagtacctga tcctgaacac cgcccgcaag 2520
cacttcggcg ccggcggcaa ccagcgcatc cgcttcaccc tgccccccct ggtgttcgcc 2580
gcctaccagc tggccttccg ctacaaggag aacagcaagg tggacgacaa gtgggagaag 2640
aagtgccaga agatcttcag cttcgcccac cagaccatca gcgccctgat caaggccgag 2700
ctggccgagc tgcccctgcg cctgttcctg cagggcgccc tggccgccgg cgagatcggc 2760
ttcgagaacc acgagaccgt ggcctacgag ttcatgagcc aggccttcag cctgtacgag 2820
gacgagatca gcgacagcaa ggcccagctg gccgccatca ccctgatcat cggcaccttc 2880
gagcgcatga agtgcttcag cgaggagaac cacgagcccc tgcgcaccca gtgcgccctg 2940
gccgccagca agctgctgaa gaagcccgac cagggccgcg ccgtgagcac ctgcgcccac 3000
ctgttctgga gcggccgcaa caccgacaag aacggcgagg agctgcacgg cggcaagcgc 3060
gtgatggagt gcctgaagaa ggccctgaag atcgccaacc agtgcatgga ccccagcctg 3120
caggtgcagc tgttcatcga gatcctgaac cgctacatct acttctacga gaaggagaac 3180
gacgccgtga ccatccaggt gctgaaccag ctgatccaga agatccgcga ggacctgccc 3240
aacctggaga gcagcgagga gaccgagcag atcaacaagc acttccacaa caccctggag 3300
cacctgcgcc tgcgccgcga gagccccgag agcgagggcc ccatctacga gggcctgatc 3360
ctgtgacaat tgttaattaa gtttaaaccc tcgaggccgc aagcttatcg ataatcaacc 3420
tctggattac aaaatttgtg aaagattgac tggtattctt aactatgttg ctccttttac 3480
gctatgtgga tacgctgctt taatgccttt gtatcatgct attgcttccc gtatggcttt 3540
cattttctcc tccttgtata aatcctggtt gctgtctctt tatgaggagt tgtggcccgt 3600
tgtcaggcaa cgtggcgtgg tgtgcactgt gtttgctgac gcaaccccca ctggttgggg 3660
cattgccacc acctgtcagc tcctttccgg gactttcgct ttccccctcc ctattgccac 3720
ggcggaactc atcgccgcct gccttgcccg ctgctggaca ggggctcggc tgttgggcac 3780
tgacaattcc gtggtgttgt cggggaaatc atcgtccttt ccttggctgc tcgcctgtgt 3840
tgccacctgg attctgcgcg ggacgtcctt ctgctacgtc ccttcggccc tcaatccagc 3900
ggaccttcct tcccgcggcc tgctgccggc tctgcggcct cttccgcgtc ttcgccttcg 3960
ccctcagacg agtcggatct ccctttgggc cgcctccccg catcgatacc gtcgactaga 4020
gctcgctgat cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc 4080
cccgtgcctt ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag 4140
gaaattgcat cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag 4200
gacagcaagg gggaggattg ggaagacaat agcaggcatg ctggggagag atccacgata 4260
acaaacagct tttttggggt gaacatattg actgaattcc ctgcaggttg gccactccct 4320
ctctgcgcgc tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt cgggcgacct 4380
ttggtcgccc ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc aactccatca 4440
ctaggggttc ctgcggccgc tcgtacggtc tcgaggaatt cctgcaggat aacttgccaa 4500
cctcattcta aaatgtatat agaagcccaa aagacaataa caaaaatatt cttgtagaac 4560
aaaatgggaa agaatgttcc actaaatatc aagatttaga gcaaagcatg agatgtgtgg 4620
ggatagacag tgaggctgat aaaatagagt agagctcaga aacagaccca ttgatatatg 4680
taagtgacct atgaaaaaaa tatggcattt tacaatggga aaatgatggt ctttttcttt 4740
tttagaaaaa cagggaaata tatttatatg taaaaaataa aagggaaccc atatgtcata 4800
ccatacacac aaaaaaattc cagtgaatta taagtctaaa tggagaaggc aaaactttaa 4860
atcttttaga aaataatata gaagcatgca gaccagcctg gccaacatga tgaaaccctc 4920
tctactaata ataaaatcag tagaactact caggactact ttgagtggga agtccttttc 4980
tatgaagact tctttggcca aaattaggct ctaaatgcaa ggagatagtg catcatgcct 5040
ggctgcactt actgataaat gatgttatca ccatctttaa ccaaatgcac aggaacaagt 5100
tatggtactg atgtgctgga ttgagaagga gctctacttc cttgacagga cacatttgta 5160
tcaacttaaa aaagcagatt tttgccagca gaactattca ttcagaggta ggaaacttag 5220
aatagatgat gtcactgatt agcatggctt ccccatctcc acagctgctt cccacccagg 5280
ttgcccacag ttgagtttgt ccagtgctca gggctgccca ctctcagtaa gaagccccac 5340
accagcccct ctccaaatat gttggctgtt ccttccatta aagtgacccc actttagagc 5400
agcaagtgga tttctgtttc ttacagttca ggaaggagga gtcagctgtg agaacctgga 5460
gcctgagatg cttctaagtc ccactgctac tggggtcagg gaagccagac tccagcatca 5520
gcagtcagga gcactaagcc cttgccaaca tcctgtttct cagagaaact gcttccatta 5580
taatggttgt ccttttttaa gctatcaagc caaacaacca gtgtctacca ttattctcat 5640
cacctgaagc caagggttct agcaaaagtc aagctgtctt gtaatggttg atgtgcctcc 5700
agcttctgtc ttcagtcact ccactcttag cctgctctga atcaactctg accacagttc 5760
cctggagccc ctgccacctg ctgcccctgc caccttctcc atctgcagtg ctgtgcagcc 5820
ttctgcactc ttgcagagct aataggtgga gacttgaagg aagaggagga aagtttctca 5880
taatagcctt gctgcaagct caaatgggag gtgggcactg tgcccaggag ccttggagca 5940
aaggctgtgc ccaacctctg actgcatcca ggtttggtct tgacagagat aagaagccct 6000
ggcttttgga gccaaaatct aggtcagact taggcaggat tctcaaagtt tatcagcaga 6060
acatgaggca gaagaccctt tctgctccag cttcttcagg ctcaaccttc atcagaatag 6120
atagaaagag aggctgtgag ggttcttaaa acagaagcaa atctgactca gagaataaac 6180
aacctcctag taaactacag cttagacaga gcatctggtg gtgagtgtgc tcagtgtcct 6240
actcaactgt ctggtatcag ccctcatgag gacttctctt ctttccctca tagacctcca 6300
tctctgtttt ccttagcctg cagaaatctg gatggctatt cacagaatgc ctgtgctttc 6360
agagttgcat tttttctctg gtattctggt tcaagcattt gaaggtagga aaggttctcc 6420
aagtgcaaga aagccagccc tgagcctcaa ctgcctggct agtgtggtca gtaggatgca 6480
aaggctgttg aatgccacaa ggccaaactt taacctgtgt accacaagcc tagcagcaga 6540
ggcagctctg ctcactggaa ctctctgtct tctttctcct gagccttttc ttttcctgag 6600
ttttctagct ctcctcaacc ttacctctgc cctacccagg acaaacccaa gagccactgt 6660
ttctgtgatg tcctctccag ccctaattag gcatcatgac ttcagcctga ccttccatgc 6720
tcagaagcag tgctaatcca cttcagatga gctgctctat gcaacacagg cagagcctac 6780
aaacctttgc accagagccc tccacatatc agtgtttgtt catactcact tcaacagcaa 6840
atgtgactgc tgagattaag attttacaca agatggtctg taatttcaca gttagtttta 6900
tcccattagg tatgaaagaa ttagcataat tccccttaaa catgaatgaa tcttagattt 6960
tttaataaat agttttggaa gtaaagacag agacatcagg agcacaagga atagcctgag 7020
aggacaaaca gaacaagaaa gagtctggaa atacacagga tgttcttggc ctcctcaaag 7080
caagtgcaag cagatagtac cagcagcccc aggctatcag agcccagtga agagaagtac 7140
catgaaagcc acagctctaa ccaccctgtt ccagagtgac agacagtccc caagacaagc 7200
cagcctgagc cagagagaga actgcaagag aaagtttcta atttaggttc tgttagattc 7260
agacaagtgc aggtcatcct ctctccacag ctactcacct ctccagccta acaaagcctg 7320
cagtccacac tccaaccctg gtgtctcacc tcctagcctc tcccaacatc ctgctctctg 7380
accatcttct gcatctctca tctcaccatc tcccactgtc tacagcctac tcttgcaact 7440
accatctcat tttctgacat cctgtctaca tcttctgcca tactctgcca tctaccatac 7500
cacctcttac catctaccac accatctttt atctccatcc ctctcagaag cctccaagct 7560
gaatcctgct ttatgtgttc atctcagccc ctgcatggaa agctgacccc agaggcagaa 7620
ctattcccag agagcttggc caagaaaaac aaaactacca gcctggccag gctcaggagt 7680
agtaagctgc agtgtctgtt gtgttctagc ttcaacagct gcaggagttc cactctcaaa 7740
tgctccacat ttctcacatc ctcctgattc tggtcactac ccatcttcaa agaacagaat 7800
atctcacatc agcatactgt gaaggactag tcatgggtgc agctgctcag agctgcaaag 7860
tcattctgga tggtggagag cttacaaaca tttcatgatg ctccccccgc tctgatggct 7920
ggagcccaat ccctacacag actcctgctg tatgtgtttt cctttcactc tgagccacag 7980
ccagagggca ggcattcagt ctcctcttca ggctggggct ggggcactga gaactcaccc 8040
aacaccttgc tctcactcct tctgcaaaac aagaaagagc tttgtgctgc agtagccatg 8100
aagaatgaaa ggaaggcttt aactaaaaaa tgtcagagat tattttcaac cccttactgt 8160
ggatcaccag caaggaggaa acacaacaca gagacatttt ttcccctcaa attatcaaaa 8220
gaatcactgc atttgttaaa gagagcaact gaatcaggaa gcagagtttt gaacatatca 8280
gaagttagga atctgcatca gagacaaatg cagtcatggt tgtttgctgc ataccagccc 8340
taatcattag aagcctcatg gacttcaaac atcattccct ctgacaagat gctctagcct 8400
aactccatga gataaaataa atctgccttt cagagccaaa gaagagtcca ccagcttctt 8460
ctcagtgtga acaagagctc cagtcaggtt agtcagtcca gtgcagtaga ggagaccagt 8520
ctgcatcctc taattttcaa aggcaagaag atttgtttac cctggacacc aggcacaagt 8580
gaggtcacag agctcttaga tatgcagtcc tcatgagtga ggagactaaa gcgcatgcca 8640
tcaagacttc agtgtagaga aaacctccaa aaaagcctcc tcactacttc tggaatagct 8700
cagaggccga ggcggcctcg gcctctgcat aaataaaaaa aattagtcag ccatggggcg 8760
gagaatgggc ggaactgggc ggagttaggg gcgggatggg cggagttagg ggcgggacta 8820
tggttgctga ctaattgaga tgcatgcttt gcatacttct gcctgctggg gagcctgggg 8880
actttccaca cctggttgct gactaattga gatgcatgct ttgcatactt ctgcctgctg 8940
gggagcctgg ggactttcca caccctaact gacacacatt ccacagctgc attaatgaat 9000
cggccaacgc gcggggagag gcggtttgcg tattgggcgc tcttccgctt cctcgctcac 9060
tgactcgctg cgctcggtcg ttcggctgcg gcgagcggta tcagctcact caaaggcggt 9120
aatacggtta tccacagaat caggggataa cgcaggaaag aacatgtgag caaaaggcca 9180
gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg tttttccata ggctccgccc 9240
ccctgacgag catcacaaaa atcgacgctc aagtcagagg tggcgaaacc cgacaggact 9300
ataaagatac caggcgtttc cccctggaag ctccctcgtg cgctctcctg ttccgaccct 9360
gccgcttacc ggatacctgt ccgcctttct cccttcggga agcgtggcgc tttctcatag 9420
ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca 9480
cgaacccccc gttcagcccg accgctgcgc cttatccggt aactatcgtc ttgagtccaa 9540
cccggtaaga cacgacttat cgccactggc agcagccact ggtaacagga ttagcagagc 9600
gaggtatgta ggcggtgcta cagagttctt gaagtggtgg cctaactacg gctacactag 9660
aagaacagta tttggtatct gcgctctgct gaagccagtt accttcggaa aaagagttgg 9720
tagctcttga tccggcaaac aaaccaccgc tggtagcggt ggtttttttg tttgcaagca 9780
gcagattacg cgcagaaaaa aaggatctca agaagatcct ttgatctttt ctacggggtc 9840
tgacgctcag tggaacgaaa actcacgtta agggattttg gtcatgagat tatcaaaaag 9900
gatcttcacc tagatccttt taaattaaaa atgaagtttt aaatcaatct aaagtatata 9960
tgagtaaact tggtctgaca gttaccaatg cttaatcagt gaggcaccta tctcagcgat 10020
ctgtctattt cgttcatcca tagttgcctg actcctgcaa accacgttgt gtctcaaaat 10080
ctctgatgtt acattgcaca agataaaaat atatcatcat gaacaataaa actgtctgct 10140
tacataaaca gtaatacaag gggtgttatg agccatattc aacgggaaac gtcttgctcg 10200
aggccgcgat taaattccaa catggatgct gatttatatg ggtataaatg ggctcgcgat 10260
aatgtcgggc aatcaggtgc gacaatctat cgattgtatg ggaagcccga tgcgccagag 10320
ttgtttctga aacatggcaa aggtagcgtt gccaatgatg ttacagatga gatggtcaga 10380
ctaaactggc tgacggaatt tatgcctctt ccgaccatca agcattttat ccgtactcct 10440
gatgatgcat ggttactcac cactgcgatc cccgggaaaa cagcattcca ggtattagaa 10500
gaatatcctg attcaggtga aaatattgtt gatgcgctgg cagtgttcct gcgccggttg 10560
cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc gcgtatttcg tctcgctcag 10620
gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg attttgatga cgagcgtaat 10680
ggctggcctg ttgaacaagt ctggaaagaa atgcataagc ttttgccatt ctcaccggat 10740
tcagtcgtca ctcatggtga tttctcactt gataacctta tttttgacga ggggaaatta 10800
ataggttgta ttgatgttgg acgagtcgga atcgcagacc gataccagga tcttgccatc 10860
ctatggaact gcctcggtga gttttctcct tcattacaga aacggctttt tcaaaaatat 10920
ggtattgata atcctgatat gaataaattg cagtttcatt tgatgctcga tgagtttttc 10980
taagggcggc ctgccaccat acccacgccg aaacaagcgc tcatgagccc gaagtggcga 11040
gcccgatctt ccccatcggt gatgtcggcg atataggcgc cagcaaccgc acctgtggcg 11100
ccggtgatga gggcgcgcca agtcgacgtc cggcagtc 11138
<210> 55
<211> 242
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 55
Met Pro Arg Gly Phe Thr Trp Leu Arg Tyr Leu Gly Ile Phe Leu Gly
1 5 10 15
Val Ala Leu Gly Asn Glu Pro Leu Glu Met Trp Pro Leu Thr Gln Asn
20 25 30
Glu Glu Cys Thr Val Thr Gly Phe Leu Arg Asp Lys Leu Gln Tyr Arg
35 40 45
Ser Arg Leu Gln Tyr Met Lys His Tyr Phe Pro Ile Asn Tyr Lys Ile
50 55 60
Ser Val Pro Tyr Glu Gly Val Phe Arg Ile Ala Asn Val Thr Arg Leu
65 70 75 80
Gln Arg Ala Gln Val Ser Glu Arg Glu Leu Arg Tyr Leu Trp Val Leu
85 90 95
Val Ser Leu Ser Ala Thr Glu Ser Val Gln Asp Val Leu Leu Glu Gly
100 105 110
His Pro Ser Trp Lys Tyr Leu Gln Glu Val Glu Thr Leu Leu Leu Asn
115 120 125
Val Gln Gln Gly Leu Thr Asp Val Glu Val Ser Pro Lys Val Glu Ser
130 135 140
Val Leu Ser Leu Leu Asn Ala Pro Gly Pro Asn Leu Lys Leu Val Arg
145 150 155 160
Pro Lys Ala Leu Leu Asp Asn Cys Phe Arg Val Met Glu Leu Leu Tyr
165 170 175
Cys Ser Cys Cys Lys Gln Ser Ser Val Leu Asn Trp Gln Asp Cys Glu
180 185 190
Val Pro Ser Pro Gln Ser Cys Ser Pro Glu Pro Ser Leu Gln Tyr Ala
195 200 205
Ala Thr Gln Leu Tyr Pro Pro Pro Pro Trp Ser Pro Ser Ser Pro Pro
210 215 220
His Ser Thr Gly Ser Val Arg Pro Val Arg Ala Gln Gly Glu Gly Leu
225 230 235 240
Leu Pro
<210> 56
<211> 729
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 56
atgccccgcg gcttcacctg gctgcgctac ctgggcatct tcctgggcgt ggccctgggc 60
aacgagcccc tggagatgtg gcccctgacc cagaacgagg agtgcaccgt gaccggcttc 120
ctgcgcgaca agctgcagta ccgcagccgc ctgcagtaca tgaagcacta cttccccatc 180
aactacaaga tcagcgtgcc ctacgagggc gtgttccgca tcgccaacgt gacccgcctg 240
cagcgcgccc aggtgagcga gcgcgagctg cgctacctgt gggtgctggt gagcctgagc 300
gccaccgaga gcgtgcagga cgtgctgctg gagggccacc ccagctggaa gtacctgcag 360
gaggtggaga ccctgctgct gaacgtgcag cagggcctga ccgacgtgga ggtgagcccc 420
aaggtggaga gcgtgctgag cctgctgaac gcccccggcc ccaacctgaa gctggtgcgc 480
cccaaggccc tgctggacaa ctgcttccgc gtgatggagc tgctgtactg cagctgctgc 540
aagcagagca gcgtgctgaa ctggcaggac tgcgaggtgc ccagccccca gagctgcagc 600
cccgagccca gcctgcagta cgccgccacc cagctgtacc cccccccccc ctggagcccc 660
agcagccccc cccacagcac cggcagcgtg cgccccgtgc gcgcccaggg cgagggcctg 720
ctgccctaa 729
<210> 57
<211> 230
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 57
Met Glu Pro Leu Arg Leu Leu Ile Leu Leu Phe Val Thr Glu Leu Ser
1 5 10 15
Gly Ala His Asn Thr Thr Val Phe Gln Gly Val Ala Gly Gln Ser Leu
20 25 30
Gln Val Ser Cys Pro Tyr Asp Ser Met Lys His Trp Gly Arg Arg Lys
35 40 45
Ala Trp Cys Arg Gln Leu Gly Glu Lys Gly Pro Cys Gln Arg Val Val
50 55 60
Ser Thr His Asn Leu Trp Leu Leu Ser Phe Leu Arg Arg Trp Asn Gly
65 70 75 80
Ser Thr Ala Ile Thr Asp Asp Thr Leu Gly Gly Thr Leu Thr Ile Thr
85 90 95
Leu Arg Asn Leu Gln Pro His Asp Ala Gly Leu Tyr Gln Cys Gln Ser
100 105 110
Leu His Gly Ser Glu Ala Asp Thr Leu Arg Lys Val Leu Val Glu Val
115 120 125
Leu Ala Asp Pro Leu Asp His Arg Asp Ala Gly Asp Leu Trp Phe Pro
130 135 140
Gly Glu Ser Glu Ser Phe Glu Asp Ala His Val Glu His Ser Ile Ser
145 150 155 160
Arg Ser Leu Leu Glu Gly Glu Ile Pro Phe Pro Pro Thr Ser Ile Leu
165 170 175
Leu Leu Leu Ala Cys Ile Phe Leu Ile Lys Ile Leu Ala Ala Ser Ala
180 185 190
Leu Trp Ala Ala Ala Trp His Gly Gln Lys Pro Gly Thr His Pro Pro
195 200 205
Ser Glu Leu Asp Cys Gly His Asp Pro Gly Tyr Gln Leu Gln Thr Leu
210 215 220
Pro Gly Leu Arg Asp Thr
225 230
<210> 58
<211> 690
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 58
atggagcccc tgcgcctgct gatcctgctg ttcgtgaccg agctgagcgg cgcccacaac 60
accaccgtgt tccagggcgt ggccggccag agcctgcagg tgagctgccc ctacgacagc 120
atgaagcact ggggccgccg caaggcctgg tgccgccagc tgggcgagaa gggcccctgc 180
cagcgcgtgg tgagcaccca caacctgtgg ctgctgagct tcctgcgccg ctggaacggc 240
agcaccgcca tcaccgacga caccctgggc ggcaccctga ccatcaccct gcgcaacctg 300
cagccccacg acgccggcct gtaccagtgc cagagcctgc acggcagcga ggccgacacc 360
ctgcgcaagg tgctggtgga ggtgctggcc gaccccctgg accaccgcga cgccggcgac 420
ctgtggttcc ccggcgagag cgagagcttc gaggacgccc acgtggagca cagcatcagc 480
cgcagcctgc tggagggcga gatccccttc ccccccacca gcatcctgct gctgctggcc 540
tgcatcttcc tgatcaagat cctggccgcc agcgccctgt gggccgccgc ctggcacggc 600
cagaagcccg gcacccaccc ccccagcgag ctggactgcg gccacgaccc cggctaccag 660
ctgcagaccc tgcccggcct gcgcgacacc 690
<210> 59
<211> 11060
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 59
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctcctg gtggcgaggg gaggggggtg gtcctcgaac gccttgcaga 600
actggcctgg atacagagtg gaccggctgg ccccatctgg aagacttcga gatacactgt 660
tgtcttactg cgctcaacag tgtatctcga agtcttccaa atggtgccag ccatcgcagc 720
ggggtgcagg aaatgggggc agcccccctt tttggctatc cttccacgtg ttcttttttg 780
tatcttttgt gtttcctaga aaacatctca gtcaccaccg cagccctagg aatgcatcta 840
gacaattgta ctaaccttct tctctttcct ctcctgacag tccggaaagc caccatggaa 900
ttcagcagcc ccagcagaga ggaatgcccc aagcctctga gccgggtgtc aatcatggcc 960
ggatctctga caggactgct gctgcttcag gccgtgtctt gggcttctgg cgctagacct 1020
tgcatcccca agagcttcgg ctacagcagc gtcgtgtgcg tgtgcaatgc cacctactgc 1080
gacagcttcg accctcctac ctttcctgct ctgggcacct tcagcagata cgagagcacc 1140
agatccggca gacggatgga actgagcatg ggacccatcc aggccaatca cacaggcact 1200
ggcctgctgc tgacactgca gcctgagcag aaattccaga aagtgaaagg cttcggcgga 1260
gccatgacag atgccgccgc tctgaatatc ctggctctgt ctccaccagc tcagaacctg 1320
ctgctcaaga gctacttcag cgaggaaggc atcggctaca acatcatcag agtgcccatg 1380
gccagctgcg acttcagcat caggacctac acctacgccg acacacccga cgatttccag 1440
ctgcacaact tcagcctgcc tgaagaggac accaagctga agatccctct gatccacaga 1500
gccctgcagc tggcacaaag acccgtgtca ctgctggcct ctccatggac atctcccacc 1560
tggctgaaaa caaatggcgc cgtgaatggc aagggcagcc tgaaaggcca acctggcgac 1620
atctaccacc agacctgggc cagatacttc gtgaagttcc tggacgccta tgccgagcac 1680
aagctgcagt tttgggccgt gacagccgag aacgaacctt ctgctggact gctgagcggc 1740
tacccctttc agtgcctggg ctttacaccc gagcaccagc gggactttat cgcccgtgat 1800
ctgggaccca cactggccaa tagcacccac cataatgtgc ggctgctgat gctggacgac 1860
cagagactgc ttctgcccca ctgggctaaa gtggtgctga cagatcctga ggccgccaaa 1920
tacgtgcacg gaatcgccgt gcactggtat ctggactttc tggcccctgc caaggccaca 1980
ctgggagaga cacacagact gttccccaac accatgctgt tcgccagcga agcctgtgtg 2040
ggcagcaagt tttgggaaca gagcgtgcgg ctcggcagct gggatagagg catgcagtac 2100
agccacagca tcatcaccaa cctgctgtac cacgtcgtcg gctggaccga ctggaatctg 2160
gccctgaatc ctgaaggcgg ccctaactgg gtccgaaact tcgtggacag ccccatcatc 2220
gtggacatca ccaaggacac cttctacaag cagcccatgt tctaccacct gggacacttc 2280
agcaagttca tccccgaggg ctctcagcgc gttggactgg tggcttccca gaagaacgat 2340
ctggacgccg tggctctgat gcaccctgat ggatctgctg tggtggtggt cctgaaccgc 2400
agcagcaaag atgtgcccct gaccatcaag gatcccgccg tgggattcct ggaaacaatc 2460
agccctggct actccatcca cacctacctg tggcgtagac aggagggcag aggaagtctt 2520
ctgacatgcg gagacgtgga agagaatccc ggccctatgc cccgcggctt cacctggctg 2580
cgctacctgg gcatcttcct gggcgtggcc ctgggcaacg agcccctgga gatgtggccc 2640
ctgacccaga acgaggagtg caccgtgacc ggcttcctgc gcgacaagct gcagtaccgc 2700
agccgcctgc agtacatgaa gcactacttc cccatcaact acaagatcag cgtgccctac 2760
gagggcgtgt tccgcatcgc caacgtgacc cgcctgcagc gcgcccaggt gagcgagcgc 2820
gagctgcgct acctgtgggt gctggtgagc ctgagcgcca ccgagagcgt gcaggacgtg 2880
ctgctggagg gccaccccag ctggaagtac ctgcaggagg tggagaccct gctgctgaac 2940
gtgcagcagg gcctgaccga cgtggaggtg agccccaagg tggagagcgt gctgagcctg 3000
ctgaacgccc ccggccccaa cctgaagctg gtgcgcccca aggccctgct ggacaactgc 3060
ttccgcgtga tggagctgct gtactgcagc tgctgcaagc agagcagcgt gctgaactgg 3120
caggactgcg aggtgcccag cccccagagc tgcagccccg agcccagcct gcagtacgcc 3180
gccacccagc tgtacccccc ccccccctgg agccccagca gcccccccca cagcaccggc 3240
agcgtgcgcc ccgtgcgcgc ccagggcgag ggcctgctgc cctaatgaca attgttaatt 3300
aagtttaaac cctcgaggcc gcaagcttat cgataatcaa cctctggatt acaaaatttg 3360
tgaaagattg actggtattc ttaactatgt tgctcctttt acgctatgtg gatacgctgc 3420
tttaatgcct ttgtatcatg ctattgcttc ccgtatggct ttcattttct cctccttgta 3480
taaatcctgg ttgctgtctc tttatgagga gttgtggccc gttgtcaggc aacgtggcgt 3540
ggtgtgcact gtgtttgctg acgcaacccc cactggttgg ggcattgcca ccacctgtca 3600
gctcctttcc gggactttcg ctttccccct ccctattgcc acggcggaac tcatcgccgc 3660
ctgccttgcc cgctgctgga caggggctcg gctgttgggc actgacaatt ccgtggtgtt 3720
gtcggggaaa tcatcgtcct ttccttggct gctcgcctgt gttgccacct ggattctgcg 3780
cgggacgtcc ttctgctacg tcccttcggc cctcaatcca gcggaccttc cttcccgcgg 3840
cctgctgccg gctctgcggc ctcttccgcg tcttcgcctt cgccctcaga cgagtcggat 3900
ctccctttgg gccgcctccc cgcatcgata ccgtcgacta gagctcgctg atcagcctcg 3960
actgtgcctt ctagttgcca gccatctgtt gtttgcccct cccccgtgcc ttccttgacc 4020
ctggaaggtg ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt 4080
ctgagtaggt gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat 4140
tgggaagaca atagcaggca tgctggggag agatccacga taacaaacag cttttttggg 4200
gtgaacatat tgactgaatt ccctgcaggt tggccactcc ctctctgcgc gctcgctcgc 4260
tcactgaggc cgcccgggca aagcccgggc gtcgggcgac ctttggtcgc ccggcctcag 4320
tgagcgagcg agcgcgcaga gagggagtgg ccaactccat cactaggggt tcctgcggcc 4380
gctcgtacgg tctcgaggaa ttcctgcagg ataacttgcc aacctcattc taaaatgtat 4440
atagaagccc aaaagacaat aacaaaaata ttcttgtaga acaaaatggg aaagaatgtt 4500
ccactaaata tcaagattta gagcaaagca tgagatgtgt ggggatagac agtgaggctg 4560
ataaaataga gtagagctca gaaacagacc cattgatata tgtaagtgac ctatgaaaaa 4620
aatatggcat tttacaatgg gaaaatgatg gtctttttct tttttagaaa aacagggaaa 4680
tatatttata tgtaaaaaat aaaagggaac ccatatgtca taccatacac acaaaaaaat 4740
tccagtgaat tataagtcta aatggagaag gcaaaacttt aaatctttta gaaaataata 4800
tagaagcatg cagaccagcc tggccaacat gatgaaaccc tctctactaa taataaaatc 4860
agtagaacta ctcaggacta ctttgagtgg gaagtccttt tctatgaaga cttctttggc 4920
caaaattagg ctctaaatgc aaggagatag tgcatcatgc ctggctgcac ttactgataa 4980
atgatgttat caccatcttt aaccaaatgc acaggaacaa gttatggtac tgatgtgctg 5040
gattgagaag gagctctact tccttgacag gacacatttg tatcaactta aaaaagcaga 5100
tttttgccag cagaactatt cattcagagg taggaaactt agaatagatg atgtcactga 5160
ttagcatggc ttccccatct ccacagctgc ttcccaccca ggttgcccac agttgagttt 5220
gtccagtgct cagggctgcc cactctcagt aagaagcccc acaccagccc ctctccaaat 5280
atgttggctg ttccttccat taaagtgacc ccactttaga gcagcaagtg gatttctgtt 5340
tcttacagtt caggaaggag gagtcagctg tgagaacctg gagcctgaga tgcttctaag 5400
tcccactgct actggggtca gggaagccag actccagcat cagcagtcag gagcactaag 5460
cccttgccaa catcctgttt ctcagagaaa ctgcttccat tataatggtt gtcctttttt 5520
aagctatcaa gccaaacaac cagtgtctac cattattctc atcacctgaa gccaagggtt 5580
ctagcaaaag tcaagctgtc ttgtaatggt tgatgtgcct ccagcttctg tcttcagtca 5640
ctccactctt agcctgctct gaatcaactc tgaccacagt tccctggagc ccctgccacc 5700
tgctgcccct gccaccttct ccatctgcag tgctgtgcag ccttctgcac tcttgcagag 5760
ctaataggtg gagacttgaa ggaagaggag gaaagtttct cataatagcc ttgctgcaag 5820
ctcaaatggg aggtgggcac tgtgcccagg agccttggag caaaggctgt gcccaacctc 5880
tgactgcatc caggtttggt cttgacagag ataagaagcc ctggcttttg gagccaaaat 5940
ctaggtcaga cttaggcagg attctcaaag tttatcagca gaacatgagg cagaagaccc 6000
tttctgctcc agcttcttca ggctcaacct tcatcagaat agatagaaag agaggctgtg 6060
agggttctta aaacagaagc aaatctgact cagagaataa acaacctcct agtaaactac 6120
agcttagaca gagcatctgg tggtgagtgt gctcagtgtc ctactcaact gtctggtatc 6180
agccctcatg aggacttctc ttctttccct catagacctc catctctgtt ttccttagcc 6240
tgcagaaatc tggatggcta ttcacagaat gcctgtgctt tcagagttgc attttttctc 6300
tggtattctg gttcaagcat ttgaaggtag gaaaggttct ccaagtgcaa gaaagccagc 6360
cctgagcctc aactgcctgg ctagtgtggt cagtaggatg caaaggctgt tgaatgccac 6420
aaggccaaac tttaacctgt gtaccacaag cctagcagca gaggcagctc tgctcactgg 6480
aactctctgt cttctttctc ctgagccttt tcttttcctg agttttctag ctctcctcaa 6540
ccttacctct gccctaccca ggacaaaccc aagagccact gtttctgtga tgtcctctcc 6600
agccctaatt aggcatcatg acttcagcct gaccttccat gctcagaagc agtgctaatc 6660
cacttcagat gagctgctct atgcaacaca ggcagagcct acaaaccttt gcaccagagc 6720
cctccacata tcagtgtttg ttcatactca cttcaacagc aaatgtgact gctgagatta 6780
agattttaca caagatggtc tgtaatttca cagttagttt tatcccatta ggtatgaaag 6840
aattagcata attcccctta aacatgaatg aatcttagat tttttaataa atagttttgg 6900
aagtaaagac agagacatca ggagcacaag gaatagcctg agaggacaaa cagaacaaga 6960
aagagtctgg aaatacacag gatgttcttg gcctcctcaa agcaagtgca agcagatagt 7020
accagcagcc ccaggctatc agagcccagt gaagagaagt accatgaaag ccacagctct 7080
aaccaccctg ttccagagtg acagacagtc cccaagacaa gccagcctga gccagagaga 7140
gaactgcaag agaaagtttc taatttaggt tctgttagat tcagacaagt gcaggtcatc 7200
ctctctccac agctactcac ctctccagcc taacaaagcc tgcagtccac actccaaccc 7260
tggtgtctca cctcctagcc tctcccaaca tcctgctctc tgaccatctt ctgcatctct 7320
catctcacca tctcccactg tctacagcct actcttgcaa ctaccatctc attttctgac 7380
atcctgtcta catcttctgc catactctgc catctaccat accacctctt accatctacc 7440
acaccatctt ttatctccat ccctctcaga agcctccaag ctgaatcctg ctttatgtgt 7500
tcatctcagc ccctgcatgg aaagctgacc ccagaggcag aactattccc agagagcttg 7560
gccaagaaaa acaaaactac cagcctggcc aggctcagga gtagtaagct gcagtgtctg 7620
ttgtgttcta gcttcaacag ctgcaggagt tccactctca aatgctccac atttctcaca 7680
tcctcctgat tctggtcact acccatcttc aaagaacaga atatctcaca tcagcatact 7740
gtgaaggact agtcatgggt gcagctgctc agagctgcaa agtcattctg gatggtggag 7800
agcttacaaa catttcatga tgctcccccc gctctgatgg ctggagccca atccctacac 7860
agactcctgc tgtatgtgtt ttcctttcac tctgagccac agccagaggg caggcattca 7920
gtctcctctt caggctgggg ctggggcact gagaactcac ccaacacctt gctctcactc 7980
cttctgcaaa acaagaaaga gctttgtgct gcagtagcca tgaagaatga aaggaaggct 8040
ttaactaaaa aatgtcagag attattttca accccttact gtggatcacc agcaaggagg 8100
aaacacaaca cagagacatt ttttcccctc aaattatcaa aagaatcact gcatttgtta 8160
aagagagcaa ctgaatcagg aagcagagtt ttgaacatat cagaagttag gaatctgcat 8220
cagagacaaa tgcagtcatg gttgtttgct gcataccagc cctaatcatt agaagcctca 8280
tggacttcaa acatcattcc ctctgacaag atgctctagc ctaactccat gagataaaat 8340
aaatctgcct ttcagagcca aagaagagtc caccagcttc ttctcagtgt gaacaagagc 8400
tccagtcagg ttagtcagtc cagtgcagta gaggagacca gtctgcatcc tctaattttc 8460
aaaggcaaga agatttgttt accctggaca ccaggcacaa gtgaggtcac agagctctta 8520
gatatgcagt cctcatgagt gaggagacta aagcgcatgc catcaagact tcagtgtaga 8580
gaaaacctcc aaaaaagcct cctcactact tctggaatag ctcagaggcc gaggcggcct 8640
cggcctctgc ataaataaaa aaaattagtc agccatgggg cggagaatgg gcggaactgg 8700
gcggagttag gggcgggatg ggcggagtta ggggcgggac tatggttgct gactaattga 8760
gatgcatgct ttgcatactt ctgcctgctg gggagcctgg ggactttcca cacctggttg 8820
ctgactaatt gagatgcatg ctttgcatac ttctgcctgc tggggagcct ggggactttc 8880
cacaccctaa ctgacacaca ttccacagct gcattaatga atcggccaac gcgcggggag 8940
aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 9000
cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 9060
atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 9120
taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 9180
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 9240
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 9300
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 9360
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 9420
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 9480
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 9540
tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat 9600
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 9660
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 9720
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 9780
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 9840
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 9900
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 9960
catagttgcc tgactcctgc aaaccacgtt gtgtctcaaa atctctgatg ttacattgca 10020
caagataaaa atatatcatc atgaacaata aaactgtctg cttacataaa cagtaataca 10080
aggggtgtta tgagccatat tcaacgggaa acgtcttgct cgaggccgcg attaaattcc 10140
aacatggatg ctgatttata tgggtataaa tgggctcgcg ataatgtcgg gcaatcaggt 10200
gcgacaatct atcgattgta tgggaagccc gatgcgccag agttgtttct gaaacatggc 10260
aaaggtagcg ttgccaatga tgttacagat gagatggtca gactaaactg gctgacggaa 10320
tttatgcctc ttccgaccat caagcatttt atccgtactc ctgatgatgc atggttactc 10380
accactgcga tccccgggaa aacagcattc caggtattag aagaatatcc tgattcaggt 10440
gaaaatattg ttgatgcgct ggcagtgttc ctgcgccggt tgcattcgat tcctgtttgt 10500
aattgtcctt ttaacagcga tcgcgtattt cgtctcgctc aggcgcaatc acgaatgaat 10560
aacggtttgg ttgatgcgag tgattttgat gacgagcgta atggctggcc tgttgaacaa 10620
gtctggaaag aaatgcataa gcttttgcca ttctcaccgg attcagtcgt cactcatggt 10680
gatttctcac ttgataacct tatttttgac gaggggaaat taataggttg tattgatgtt 10740
ggacgagtcg gaatcgcaga ccgataccag gatcttgcca tcctatggaa ctgcctcggt 10800
gagttttctc cttcattaca gaaacggctt tttcaaaaat atggtattga taatcctgat 10860
atgaataaat tgcagtttca tttgatgctc gatgagtttt tctaagggcg gcctgccacc 10920
atacccacgc cgaaacaagc gctcatgagc ccgaagtggc gagcccgatc ttccccatcg 10980
gtgatgtcgg cgatataggc gccagcaacc gcacctgtgg cgccggtgat gagggcgcgc 11040
caagtcgacg tccggcagtc 11060
<210> 60
<211> 10913
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 60
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctcctg gtggcgaggg gaggggggtg gtcctcgaac gccttgcaga 600
actggcctgg atacagagtg gaccggctgg ccccatctgg aagacttcga gatacactgt 660
tgtcttactg cgctcaacag tgtatctcga agtcttccaa atggtgccag ccatcgcagc 720
ggggtgcagg aaatgggggc agcccccctt tttggctatc cttccacgtg ttcttttttg 780
tatcttttgt gtttcctaga aaacatctca gtcaccaccg cagccctagg aatgcatcta 840
gacaattgta ctaaccttct tctctttcct ctcctgacag tccggaaagc caccatggaa 900
ttcagcagcc ccagcagaga ggaatgcccc aagcctctga gccgggtgtc aatcatggcc 960
ggatctctga caggactgct gctgcttcag gccgtgtctt gggcttctgg cgctagacct 1020
tgcatcccca agagcttcgg ctacagcagc gtcgtgtgcg tgtgcaatgc cacctactgc 1080
gacagcttcg accctcctac ctttcctgct ctgggcacct tcagcagata cgagagcacc 1140
agatccggca gacggatgga actgagcatg ggacccatcc aggccaatca cacaggcact 1200
ggcctgctgc tgacactgca gcctgagcag aaattccaga aagtgaaagg cttcggcgga 1260
gccatgacag atgccgccgc tctgaatatc ctggctctgt ctccaccagc tcagaacctg 1320
ctgctcaaga gctacttcag cgaggaaggc atcggctaca acatcatcag agtgcccatg 1380
gccagctgcg acttcagcat caggacctac acctacgccg acacacccga cgatttccag 1440
ctgcacaact tcagcctgcc tgaagaggac accaagctga agatccctct gatccacaga 1500
gccctgcagc tggcacaaag acccgtgtca ctgctggcct ctccatggac atctcccacc 1560
tggctgaaaa caaatggcgc cgtgaatggc aagggcagcc tgaaaggcca acctggcgac 1620
atctaccacc agacctgggc cagatacttc gtgaagttcc tggacgccta tgccgagcac 1680
aagctgcagt tttgggccgt gacagccgag aacgaacctt ctgctggact gctgagcggc 1740
tacccctttc agtgcctggg ctttacaccc gagcaccagc gggactttat cgcccgtgat 1800
ctgggaccca cactggccaa tagcacccac cataatgtgc ggctgctgat gctggacgac 1860
cagagactgc ttctgcccca ctgggctaaa gtggtgctga cagatcctga ggccgccaaa 1920
tacgtgcacg gaatcgccgt gcactggtat ctggactttc tggcccctgc caaggccaca 1980
ctgggagaga cacacagact gttccccaac accatgctgt tcgccagcga agcctgtgtg 2040
ggcagcaagt tttgggaaca gagcgtgcgg ctcggcagct gggatagagg catgcagtac 2100
agccacagca tcatcaccaa cctgctgtac cacgtcgtcg gctggaccga ctggaatctg 2160
gccctgaatc ctgaaggcgg ccctaactgg gtccgaaact tcgtggacag ccccatcatc 2220
gtggacatca ccaaggacac cttctacaag cagcccatgt tctaccacct gggacacttc 2280
agcaagttca tccccgaggg ctctcagcgc gttggactgg tggcttccca gaagaacgat 2340
ctggacgccg tggctctgat gcaccctgat ggatctgctg tggtggtggt cctgaaccgc 2400
agcagcaaag atgtgcccct gaccatcaag gatcccgccg tgggattcct ggaaacaatc 2460
agccctggct actccatcca cacctacctg tggcgtagac agtgattgtg gccgaaccgc 2520
cgaactcaga ggccggcccc agaaaacccg agcgagtagg gggcggcgcg caggagggag 2580
gagaactggg ggcgcgggag gctggtgggt gtggggggtg gagatgtaga agatgtgacg 2640
ccgcggcccg gcgggtgcca gattagcgga cgcggtgccc gcggttgcaa cgggatcccg 2700
ggcgctgcag cttgggaggc ggctctcccc aggcggcgtc cgcggagaca cccatccgtg 2760
aaccccaggt cccgggccgc cggctcgccg cgcaccaggg gccggcggac agaagagcgg 2820
ccgagcggct cgaggctggg ggaccgcggg cgcggccgcg cgctgccggg cgggaggctg 2880
gggggccggg gccggggccg tgccccggag cgggtcggag gccggggccg gggccggggg 2940
acggcggctc cccgcgcggc tccagcggct cggggatccc ggccgggccc cgcagggacc 3000
atgatgcccc gcggcttcac ctggctgcgc tacctgggca tcttcctggg cgtggccctg 3060
ggcaacgagc ccctggagat gtggcccctg acccagaacg aggagtgcac cgtgaccggc 3120
ttcctgcgcg acaagctgca gtaccgcagc cgcctgcagt acatgaagca ctacttcccc 3180
atcaactaca agatcagcgt gccctacgag ggcgtgttcc gcatcgccaa cgtgacccgc 3240
ctgcagcgcg cccaggtgag cgagcgcgag ctgcgctacc tgtgggtgct ggtgagcctg 3300
agcgccaccg agagcgtgca ggacgtgctg ctggagggcc accccagctg gaagtacctg 3360
caggaggtgg agaccctgct gctgaacgtg cagcagggcc tgaccgacgt ggaggtgagc 3420
cccaaggtgg agagcgtgct gagcctgctg aacgcccccg gccccaacct gaagctggtg 3480
cgccccaagg ccctgctgga caactgcttc cgcgtgatgg agctgctgta ctgcagctgc 3540
tgcaagcaga gcagcgtgct gaactggcag gactgcgagg tgcccagccc ccagagctgc 3600
agccccgagc ccagcctgca gtacgccgcc acccagctgt accccccccc cccctggagc 3660
cccagcagcc ccccccacag caccggcagc gtgcgccccg tgcgcgccca gggcgagggc 3720
ctgctgccct aatgacaatt gttaattaag tttaaaccct cgaggccgca agccgcatcg 3780
ataccgtcga ctagagctcg ctgatcagcc tcgactgtgc cttctagttg ccagccatct 3840
gttgtttgcc cctcccccgt gccttccttg accctggaag gtgccactcc cactgtcctt 3900
tcctaataaa atgaggaaat tgcatcgcat tgtctgagta ggtgtcattc tattctgggg 3960
ggtggggtgg ggcaggacag caagggggag gattgggaag acaatagcag gcatgctggg 4020
gagagatcca cgataacaaa cagctttttt ggggtgaaca tattgactga attccctgca 4080
ggttggccac tccctctctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 4140
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 4200
tggccaactc catcactagg ggttcctgcg gccgctcgta cggtctcgag gaattcctgc 4260
aggataactt gccaacctca ttctaaaatg tatatagaag cccaaaagac aataacaaaa 4320
atattcttgt agaacaaaat gggaaagaat gttccactaa atatcaagat ttagagcaaa 4380
gcatgagatg tgtggggata gacagtgagg ctgataaaat agagtagagc tcagaaacag 4440
acccattgat atatgtaagt gacctatgaa aaaaatatgg cattttacaa tgggaaaatg 4500
atggtctttt tcttttttag aaaaacaggg aaatatattt atatgtaaaa aataaaaggg 4560
aacccatatg tcataccata cacacaaaaa aattccagtg aattataagt ctaaatggag 4620
aaggcaaaac tttaaatctt ttagaaaata atatagaagc atgcagacca gcctggccaa 4680
catgatgaaa ccctctctac taataataaa atcagtagaa ctactcagga ctactttgag 4740
tgggaagtcc ttttctatga agacttcttt ggccaaaatt aggctctaaa tgcaaggaga 4800
tagtgcatca tgcctggctg cacttactga taaatgatgt tatcaccatc tttaaccaaa 4860
tgcacaggaa caagttatgg tactgatgtg ctggattgag aaggagctct acttccttga 4920
caggacacat ttgtatcaac ttaaaaaagc agatttttgc cagcagaact attcattcag 4980
aggtaggaaa cttagaatag atgatgtcac tgattagcat ggcttcccca tctccacagc 5040
tgcttcccac ccaggttgcc cacagttgag tttgtccagt gctcagggct gcccactctc 5100
agtaagaagc cccacaccag cccctctcca aatatgttgg ctgttccttc cattaaagtg 5160
accccacttt agagcagcaa gtggatttct gtttcttaca gttcaggaag gaggagtcag 5220
ctgtgagaac ctggagcctg agatgcttct aagtcccact gctactgggg tcagggaagc 5280
cagactccag catcagcagt caggagcact aagcccttgc caacatcctg tttctcagag 5340
aaactgcttc cattataatg gttgtccttt tttaagctat caagccaaac aaccagtgtc 5400
taccattatt ctcatcacct gaagccaagg gttctagcaa aagtcaagct gtcttgtaat 5460
ggttgatgtg cctccagctt ctgtcttcag tcactccact cttagcctgc tctgaatcaa 5520
ctctgaccac agttccctgg agcccctgcc acctgctgcc cctgccacct tctccatctg 5580
cagtgctgtg cagccttctg cactcttgca gagctaatag gtggagactt gaaggaagag 5640
gaggaaagtt tctcataata gccttgctgc aagctcaaat gggaggtggg cactgtgccc 5700
aggagccttg gagcaaaggc tgtgcccaac ctctgactgc atccaggttt ggtcttgaca 5760
gagataagaa gccctggctt ttggagccaa aatctaggtc agacttaggc aggattctca 5820
aagtttatca gcagaacatg aggcagaaga ccctttctgc tccagcttct tcaggctcaa 5880
ccttcatcag aatagataga aagagaggct gtgagggttc ttaaaacaga agcaaatctg 5940
actcagagaa taaacaacct cctagtaaac tacagcttag acagagcatc tggtggtgag 6000
tgtgctcagt gtcctactca actgtctggt atcagccctc atgaggactt ctcttctttc 6060
cctcatagac ctccatctct gttttcctta gcctgcagaa atctggatgg ctattcacag 6120
aatgcctgtg ctttcagagt tgcatttttt ctctggtatt ctggttcaag catttgaagg 6180
taggaaaggt tctccaagtg caagaaagcc agccctgagc ctcaactgcc tggctagtgt 6240
ggtcagtagg atgcaaaggc tgttgaatgc cacaaggcca aactttaacc tgtgtaccac 6300
aagcctagca gcagaggcag ctctgctcac tggaactctc tgtcttcttt ctcctgagcc 6360
ttttcttttc ctgagttttc tagctctcct caaccttacc tctgccctac ccaggacaaa 6420
cccaagagcc actgtttctg tgatgtcctc tccagcccta attaggcatc atgacttcag 6480
cctgaccttc catgctcaga agcagtgcta atccacttca gatgagctgc tctatgcaac 6540
acaggcagag cctacaaacc tttgcaccag agccctccac atatcagtgt ttgttcatac 6600
tcacttcaac agcaaatgtg actgctgaga ttaagatttt acacaagatg gtctgtaatt 6660
tcacagttag ttttatccca ttaggtatga aagaattagc ataattcccc ttaaacatga 6720
atgaatctta gattttttaa taaatagttt tggaagtaaa gacagagaca tcaggagcac 6780
aaggaatagc ctgagaggac aaacagaaca agaaagagtc tggaaataca caggatgttc 6840
ttggcctcct caaagcaagt gcaagcagat agtaccagca gccccaggct atcagagccc 6900
agtgaagaga agtaccatga aagccacagc tctaaccacc ctgttccaga gtgacagaca 6960
gtccccaaga caagccagcc tgagccagag agagaactgc aagagaaagt ttctaattta 7020
ggttctgtta gattcagaca agtgcaggtc atcctctctc cacagctact cacctctcca 7080
gcctaacaaa gcctgcagtc cacactccaa ccctggtgtc tcacctccta gcctctccca 7140
acatcctgct ctctgaccat cttctgcatc tctcatctca ccatctccca ctgtctacag 7200
cctactcttg caactaccat ctcattttct gacatcctgt ctacatcttc tgccatactc 7260
tgccatctac cataccacct cttaccatct accacaccat cttttatctc catccctctc 7320
agaagcctcc aagctgaatc ctgctttatg tgttcatctc agcccctgca tggaaagctg 7380
accccagagg cagaactatt cccagagagc ttggccaaga aaaacaaaac taccagcctg 7440
gccaggctca ggagtagtaa gctgcagtgt ctgttgtgtt ctagcttcaa cagctgcagg 7500
agttccactc tcaaatgctc cacatttctc acatcctcct gattctggtc actacccatc 7560
ttcaaagaac agaatatctc acatcagcat actgtgaagg actagtcatg ggtgcagctg 7620
ctcagagctg caaagtcatt ctggatggtg gagagcttac aaacatttca tgatgctccc 7680
cccgctctga tggctggagc ccaatcccta cacagactcc tgctgtatgt gttttccttt 7740
cactctgagc cacagccaga gggcaggcat tcagtctcct cttcaggctg gggctggggc 7800
actgagaact cacccaacac cttgctctca ctccttctgc aaaacaagaa agagctttgt 7860
gctgcagtag ccatgaagaa tgaaaggaag gctttaacta aaaaatgtca gagattattt 7920
tcaacccctt actgtggatc accagcaagg aggaaacaca acacagagac attttttccc 7980
ctcaaattat caaaagaatc actgcatttg ttaaagagag caactgaatc aggaagcaga 8040
gttttgaaca tatcagaagt taggaatctg catcagagac aaatgcagtc atggttgttt 8100
gctgcatacc agccctaatc attagaagcc tcatggactt caaacatcat tccctctgac 8160
aagatgctct agcctaactc catgagataa aataaatctg cctttcagag ccaaagaaga 8220
gtccaccagc ttcttctcag tgtgaacaag agctccagtc aggttagtca gtccagtgca 8280
gtagaggaga ccagtctgca tcctctaatt ttcaaaggca agaagatttg tttaccctgg 8340
acaccaggca caagtgaggt cacagagctc ttagatatgc agtcctcatg agtgaggaga 8400
ctaaagcgca tgccatcaag acttcagtgt agagaaaacc tccaaaaaag cctcctcact 8460
acttctggaa tagctcagag gccgaggcgg cctcggcctc tgcataaata aaaaaaatta 8520
gtcagccatg gggcggagaa tgggcggaac tgggcggagt taggggcggg atgggcggag 8580
ttaggggcgg gactatggtt gctgactaat tgagatgcat gctttgcata cttctgcctg 8640
ctggggagcc tggggacttt ccacacctgg ttgctgacta attgagatgc atgctttgca 8700
tacttctgcc tgctggggag cctggggact ttccacaccc taactgacac acattccaca 8760
gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg ggcgctcttc 8820
cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc 8880
tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat 8940
gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt 9000
ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg 9060
aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc 9120
tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt 9180
ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa 9240
gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta 9300
tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa 9360
caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa 9420
ctacggctac actagaagaa cagtatttgg tatctgcgct ctgctgaagc cagttacctt 9480
cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt 9540
ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat 9600
cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat 9660
gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc 9720
aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc 9780
acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc tgcaaaccac 9840
gttgtgtctc aaaatctctg atgttacatt gcacaagata aaaatatatc atcatgaaca 9900
ataaaactgt ctgcttacat aaacagtaat acaaggggtg ttatgagcca tattcaacgg 9960
gaaacgtctt gctcgaggcc gcgattaaat tccaacatgg atgctgattt atatgggtat 10020
aaatgggctc gcgataatgt cgggcaatca ggtgcgacaa tctatcgatt gtatgggaag 10080
cccgatgcgc cagagttgtt tctgaaacat ggcaaaggta gcgttgccaa tgatgttaca 10140
gatgagatgg tcagactaaa ctggctgacg gaatttatgc ctcttccgac catcaagcat 10200
tttatccgta ctcctgatga tgcatggtta ctcaccactg cgatccccgg gaaaacagca 10260
ttccaggtat tagaagaata tcctgattca ggtgaaaata ttgttgatgc gctggcagtg 10320
ttcctgcgcc ggttgcattc gattcctgtt tgtaattgtc cttttaacag cgatcgcgta 10380
tttcgtctcg ctcaggcgca atcacgaatg aataacggtt tggttgatgc gagtgatttt 10440
gatgacgagc gtaatggctg gcctgttgaa caagtctgga aagaaatgca taagcttttg 10500
ccattctcac cggattcagt cgtcactcat ggtgatttct cacttgataa ccttattttt 10560
gacgagggga aattaatagg ttgtattgat gttggacgag tcggaatcgc agaccgatac 10620
caggatcttg ccatcctatg gaactgcctc ggtgagtttt ctccttcatt acagaaacgg 10680
ctttttcaaa aatatggtat tgataatcct gatatgaata aattgcagtt tcatttgatg 10740
ctcgatgagt ttttctaagg gcggcctgcc accataccca cgccgaaaca agcgctcatg 10800
agcccgaagt ggcgagcccg atcttcccca tcggtgatgt cggcgatata ggcgccagca 10860
accgcacctg tggcgccggt gatgagggcg cgccaagtcg acgtccggca gtc 10913
<210> 61
<211> 11209
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 61
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctgcag ccctaggaat gcatctagac aattgtacta accttcttct 600
ctttcctctc ctgacagtcc ggaaagccac catggaattc agcagcccca gcagagagga 660
atgccccaag cctctgagcc gggtgtcaat catggccgga tctctgacag gactgctgct 720
gcttcaggcc gtgtcttggg cttctggcgc tagaccttgc atccccaaga gcttcggcta 780
cagcagcgtc gtgtgcgtgt gcaatgccac ctactgcgac agcttcgacc ctcctacctt 840
tcctgctctg ggcaccttca gcagatacga gagcaccaga tccggcagac ggatggaact 900
gagcatggga cccatccagg ccaatcacac aggcactggc ctgctgctga cactgcagcc 960
tgagcagaaa ttccagaaag tgaaaggctt cggcggagcc atgacagatg ccgccgctct 1020
gaatatcctg gctctgtctc caccagctca gaacctgctg ctcaagagct acttcagcga 1080
ggaaggcatc ggctacaaca tcatcagagt gcccatggcc agctgcgact tcagcatcag 1140
gacctacacc tacgccgaca cacccgacga tttccagctg cacaacttca gcctgcctga 1200
agaggacacc aagctgaaga tccctctgat ccacagagcc ctgcagctgg cacaaagacc 1260
cgtgtcactg ctggcctctc catggacatc tcccacctgg ctgaaaacaa atggcgccgt 1320
gaatggcaag ggcagcctga aaggccaacc tggcgacatc taccaccaga cctgggccag 1380
atacttcgtg aagttcctgg acgcctatgc cgagcacaag ctgcagtttt gggccgtgac 1440
agccgagaac gaaccttctg ctggactgct gagcggctac ccctttcagt gcctgggctt 1500
tacacccgag caccagcggg actttatcgc ccgtgatctg ggacccacac tggccaatag 1560
cacccaccat aatgtgcggc tgctgatgct ggacgaccag agactgcttc tgccccactg 1620
ggctaaagtg gtgctgacag atcctgaggc cgccaaatac gtgcacggaa tcgccgtgca 1680
ctggtatctg gactttctgg cccctgccaa ggccacactg ggagagacac acagactgtt 1740
ccccaacacc atgctgttcg ccagcgaagc ctgtgtgggc agcaagtttt gggaacagag 1800
cgtgcggctc ggcagctggg atagaggcat gcagtacagc cacagcatca tcaccaacct 1860
gctgtaccac gtcgtcggct ggaccgactg gaatctggcc ctgaatcctg aaggcggccc 1920
taactgggtc cgaaacttcg tggacagccc catcatcgtg gacatcacca aggacacctt 1980
ctacaagcag cccatgttct accacctggg acacttcagc aagttcatcc ccgagggctc 2040
tcagcgcgtt ggactggtgg cttcccagaa gaacgatctg gacgccgtgg ctctgatgca 2100
ccctgatgga tctgctgtgg tggtggtcct gaaccgcagc agcaaagatg tgcccctgac 2160
catcaaggat cccgccgtgg gattcctgga aacaatcagc cctggctact ccatccacac 2220
ctacctgtgg cgtagacagt gacaattgtt aattaagttt aaaccctcga ggccgcaagc 2280
cgcatcgata ccgtcgacta gagctcgctg atcagcctcg actgtgcctt ctagttgcca 2340
gccatctgtt gtttgcccct cccccgtgcc ttccttgacc ctggaaggtg ccactcccac 2400
tgtcctttcc taataaaatg aggaaattgc atcgcattgt ctgagtaggt gtcattctat 2460
tctggggggt ggggtggggc aggacagcaa gggggaggat tgggaagaca atagcaggca 2520
tgctggggag agatccacga taacaaacag cttttttggg ggatatcaaa ctgcctgttt 2580
gggcttctca tttcttacct ccccttccct ctcccacctg ctactgggtg catctctgct 2640
ccccccttcc ccagcagatg gttacctttg ggctgttgct ttcttgtcac catctgagtt 2700
ctcagacgct ggaaagccat gttctcggct ctgtgaatga caatgctgac tggagtgctg 2760
cccctctgta aagggctggg tgtggatggt cacaagcccc tcacatgcct cagccaagag 2820
gaagtagtac aggggtcagc ccagaggtcc aggggaaagg agtggaaacc gatttcccca 2880
ccaagggagg ggcctgtacc tcagctgttc ccatagctta cttgccacaa ctgccaagca 2940
agtttcgctg agtttgacac atggatccct gtggatcaac tgccctagga ctccgtttgc 3000
acccatgtga cactgttgac tttgccctga cgaagcaggg ccaacagtcc cctaacttaa 3060
ttacaaaaac taatgactaa gagagaggtg gctagagctg aggcccctga gtcaggctgt 3120
gggtgggatc atctccagta caggaagtga gactttcatt tcctcctttc caagagaggg 3180
ctgagggagc agggttgagc aactggtgca gacagcctag ctggactttg ggtgaggcgg 3240
ttcagccata tcgaattctg ctggggctac tggcaggtaa ggaggaagga ggctgagggg 3300
agggggcccc tgggagggag cctgccctgg gttgctaacc atctcctctc tgccaaaagt 3360
ccggaaagcc accatggagc ccctgcgcct gctgatcctg ctgttcgtga ccgagctgag 3420
cggcgcccac aacaccaccg tgttccaggg cgtggccggc cagagcctgc aggtgagctg 3480
cccctacgac agcatgaagc actggggccg ccgcaaggcc tggtgccgcc agctgggcga 3540
gaagggcccc tgccagcgcg tggtgagcac ccacaacctg tggctgctga gcttcctgcg 3600
ccgctggaac ggcagcaccg ccatcaccga cgacaccctg ggcggcaccc tgaccatcac 3660
cctgcgcaac ctgcagcccc acgacgccgg cctgtaccag tgccagagcc tgcacggcag 3720
cgaggccgac accctgcgca aggtgctggt ggaggtgctg gccgaccccc tggaccaccg 3780
cgacgccggc gacctgtggt tccccggcga gagcgagagc ttcgaggacg cccacgtgga 3840
gcacagcatc agccgcagcc tgctggaggg cgagatcccc ttccccccca ccagcatcct 3900
gctgctgctg gcctgcatct tcctgatcaa gatcctggcc gccagcgccc tgtgggccgc 3960
cgcctggcac ggccagaagc ccggcaccca cccccccagc gagctggact gcggccacga 4020
ccccggctac cagctgcaga ccctgcccgg cctgcgcgac acctgaccca ggggactcag 4080
cggccgctcg agtctagagg gcccgtttaa acccgctgat cagcctcgaa gacatgataa 4140
gatacattga tgagtttgga caaaccacaa caagaatgca gtgaaaaaaa tgctttattt 4200
gtgaaatttg tgatgctatt gctttatttg taaccattat aagctgcaat aaacaagtta 4260
acaacaacaa ttgcattcat tttatgtttc aggttcaggg ggagatgtgg gaggtttttt 4320
aaagcaagta aaacctctac aaatgtggta tgaacatatt gactgaattc cctgcaggtt 4380
ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gcccgggcaa agcccgggcg 4440
tcgggcgacc tttggtcgcc cggcctcagt gagcgagcga gcgcgcagag agggagtggc 4500
caactccatc actaggggtt cctgcggccg ctcgtacggt ctcgaggaat tcctgcagga 4560
taacttgcca acctcattct aaaatgtata tagaagccca aaagacaata acaaaaatat 4620
tcttgtagaa caaaatggga aagaatgttc cactaaatat caagatttag agcaaagcat 4680
gagatgtgtg gggatagaca gtgaggctga taaaatagag tagagctcag aaacagaccc 4740
attgatatat gtaagtgacc tatgaaaaaa atatggcatt ttacaatggg aaaatgatgg 4800
tctttttctt ttttagaaaa acagggaaat atatttatat gtaaaaaata aaagggaacc 4860
catatgtcat accatacaca caaaaaaatt ccagtgaatt ataagtctaa atggagaagg 4920
caaaacttta aatcttttag aaaataatat agaagcatgc agaccagcct ggccaacatg 4980
atgaaaccct ctctactaat aataaaatca gtagaactac tcaggactac tttgagtggg 5040
aagtcctttt ctatgaagac ttctttggcc aaaattaggc tctaaatgca aggagatagt 5100
gcatcatgcc tggctgcact tactgataaa tgatgttatc accatcttta accaaatgca 5160
caggaacaag ttatggtact gatgtgctgg attgagaagg agctctactt ccttgacagg 5220
acacatttgt atcaacttaa aaaagcagat ttttgccagc agaactattc attcagaggt 5280
aggaaactta gaatagatga tgtcactgat tagcatggct tccccatctc cacagctgct 5340
tcccacccag gttgcccaca gttgagtttg tccagtgctc agggctgccc actctcagta 5400
agaagcccca caccagcccc tctccaaata tgttggctgt tccttccatt aaagtgaccc 5460
cactttagag cagcaagtgg atttctgttt cttacagttc aggaaggagg agtcagctgt 5520
gagaacctgg agcctgagat gcttctaagt cccactgcta ctggggtcag ggaagccaga 5580
ctccagcatc agcagtcagg agcactaagc ccttgccaac atcctgtttc tcagagaaac 5640
tgcttccatt ataatggttg tcctttttta agctatcaag ccaaacaacc agtgtctacc 5700
attattctca tcacctgaag ccaagggttc tagcaaaagt caagctgtct tgtaatggtt 5760
gatgtgcctc cagcttctgt cttcagtcac tccactctta gcctgctctg aatcaactct 5820
gaccacagtt ccctggagcc cctgccacct gctgcccctg ccaccttctc catctgcagt 5880
gctgtgcagc cttctgcact cttgcagagc taataggtgg agacttgaag gaagaggagg 5940
aaagtttctc ataatagcct tgctgcaagc tcaaatggga ggtgggcact gtgcccagga 6000
gccttggagc aaaggctgtg cccaacctct gactgcatcc aggtttggtc ttgacagaga 6060
taagaagccc tggcttttgg agccaaaatc taggtcagac ttaggcagga ttctcaaagt 6120
ttatcagcag aacatgaggc agaagaccct ttctgctcca gcttcttcag gctcaacctt 6180
catcagaata gatagaaaga gaggctgtga gggttcttaa aacagaagca aatctgactc 6240
agagaataaa caacctccta gtaaactaca gcttagacag agcatctggt ggtgagtgtg 6300
ctcagtgtcc tactcaactg tctggtatca gccctcatga ggacttctct tctttccctc 6360
atagacctcc atctctgttt tccttagcct gcagaaatct ggatggctat tcacagaatg 6420
cctgtgcttt cagagttgca ttttttctct ggtattctgg ttcaagcatt tgaaggtagg 6480
aaaggttctc caagtgcaag aaagccagcc ctgagcctca actgcctggc tagtgtggtc 6540
agtaggatgc aaaggctgtt gaatgccaca aggccaaact ttaacctgtg taccacaagc 6600
ctagcagcag aggcagctct gctcactgga actctctgtc ttctttctcc tgagcctttt 6660
cttttcctga gttttctagc tctcctcaac cttacctctg ccctacccag gacaaaccca 6720
agagccactg tttctgtgat gtcctctcca gccctaatta ggcatcatga cttcagcctg 6780
accttccatg ctcagaagca gtgctaatcc acttcagatg agctgctcta tgcaacacag 6840
gcagagccta caaacctttg caccagagcc ctccacatat cagtgtttgt tcatactcac 6900
ttcaacagca aatgtgactg ctgagattaa gattttacac aagatggtct gtaatttcac 6960
agttagtttt atcccattag gtatgaaaga attagcataa ttccccttaa acatgaatga 7020
atcttagatt ttttaataaa tagttttgga agtaaagaca gagacatcag gagcacaagg 7080
aatagcctga gaggacaaac agaacaagaa agagtctgga aatacacagg atgttcttgg 7140
cctcctcaaa gcaagtgcaa gcagatagta ccagcagccc caggctatca gagcccagtg 7200
aagagaagta ccatgaaagc cacagctcta accaccctgt tccagagtga cagacagtcc 7260
ccaagacaag ccagcctgag ccagagagag aactgcaaga gaaagtttct aatttaggtt 7320
ctgttagatt cagacaagtg caggtcatcc tctctccaca gctactcacc tctccagcct 7380
aacaaagcct gcagtccaca ctccaaccct ggtgtctcac ctcctagcct ctcccaacat 7440
cctgctctct gaccatcttc tgcatctctc atctcaccat ctcccactgt ctacagccta 7500
ctcttgcaac taccatctca ttttctgaca tcctgtctac atcttctgcc atactctgcc 7560
atctaccata ccacctctta ccatctacca caccatcttt tatctccatc cctctcagaa 7620
gcctccaagc tgaatcctgc tttatgtgtt catctcagcc cctgcatgga aagctgaccc 7680
cagaggcaga actattccca gagagcttgg ccaagaaaaa caaaactacc agcctggcca 7740
ggctcaggag tagtaagctg cagtgtctgt tgtgttctag cttcaacagc tgcaggagtt 7800
ccactctcaa atgctccaca tttctcacat cctcctgatt ctggtcacta cccatcttca 7860
aagaacagaa tatctcacat cagcatactg tgaaggacta gtcatgggtg cagctgctca 7920
gagctgcaaa gtcattctgg atggtggaga gcttacaaac atttcatgat gctccccccg 7980
ctctgatggc tggagcccaa tccctacaca gactcctgct gtatgtgttt tcctttcact 8040
ctgagccaca gccagagggc aggcattcag tctcctcttc aggctggggc tggggcactg 8100
agaactcacc caacaccttg ctctcactcc ttctgcaaaa caagaaagag ctttgtgctg 8160
cagtagccat gaagaatgaa aggaaggctt taactaaaaa atgtcagaga ttattttcaa 8220
ccccttactg tggatcacca gcaaggagga aacacaacac agagacattt tttcccctca 8280
aattatcaaa agaatcactg catttgttaa agagagcaac tgaatcagga agcagagttt 8340
tgaacatatc agaagttagg aatctgcatc agagacaaat gcagtcatgg ttgtttgctg 8400
cataccagcc ctaatcatta gaagcctcat ggacttcaaa catcattccc tctgacaaga 8460
tgctctagcc taactccatg agataaaata aatctgcctt tcagagccaa agaagagtcc 8520
accagcttct tctcagtgtg aacaagagct ccagtcaggt tagtcagtcc agtgcagtag 8580
aggagaccag tctgcatcct ctaattttca aaggcaagaa gatttgttta ccctggacac 8640
caggcacaag tgaggtcaca gagctcttag atatgcagtc ctcatgagtg aggagactaa 8700
agcgcatgcc atcaagactt cagtgtagag aaaacctcca aaaaagcctc ctcactactt 8760
ctggaatagc tcagaggccg aggcggcctc ggcctctgca taaataaaaa aaattagtca 8820
gccatggggc ggagaatggg cggaactggg cggagttagg ggcgggatgg gcggagttag 8880
gggcgggact atggttgctg actaattgag atgcatgctt tgcatacttc tgcctgctgg 8940
ggagcctggg gactttccac acctggttgc tgactaattg agatgcatgc tttgcatact 9000
tctgcctgct ggggagcctg gggactttcc acaccctaac tgacacacat tccacagctg 9060
cattaatgaa tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ctcttccgct 9120
tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac 9180
tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga 9240
gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat 9300
aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac 9360
ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct 9420
gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg 9480
ctttctcata gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg 9540
ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt 9600
cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg 9660
attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac 9720
ggctacacta gaagaacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga 9780
aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt 9840
gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt 9900
tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga 9960
ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc 10020
taaagtatat atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct 10080
atctcagcga tctgtctatt tcgttcatcc atagttgcct gactcctgca aaccacgttg 10140
tgtctcaaaa tctctgatgt tacattgcac aagataaaaa tatatcatca tgaacaataa 10200
aactgtctgc ttacataaac agtaatacaa ggggtgttat gagccatatt caacgggaaa 10260
cgtcttgctc gaggccgcga ttaaattcca acatggatgc tgatttatat gggtataaat 10320
gggctcgcga taatgtcggg caatcaggtg cgacaatcta tcgattgtat gggaagcccg 10380
atgcgccaga gttgtttctg aaacatggca aaggtagcgt tgccaatgat gttacagatg 10440
agatggtcag actaaactgg ctgacggaat ttatgcctct tccgaccatc aagcatttta 10500
tccgtactcc tgatgatgca tggttactca ccactgcgat ccccgggaaa acagcattcc 10560
aggtattaga agaatatcct gattcaggtg aaaatattgt tgatgcgctg gcagtgttcc 10620
tgcgccggtt gcattcgatt cctgtttgta attgtccttt taacagcgat cgcgtatttc 10680
gtctcgctca ggcgcaatca cgaatgaata acggtttggt tgatgcgagt gattttgatg 10740
acgagcgtaa tggctggcct gttgaacaag tctggaaaga aatgcataag cttttgccat 10800
tctcaccgga ttcagtcgtc actcatggtg atttctcact tgataacctt atttttgacg 10860
aggggaaatt aataggttgt attgatgttg gacgagtcgg aatcgcagac cgataccagg 10920
atcttgccat cctatggaac tgcctcggtg agttttctcc ttcattacag aaacggcttt 10980
ttcaaaaata tggtattgat aatcctgata tgaataaatt gcagtttcat ttgatgctcg 11040
atgagttttt ctaagggcgg cctgccacca tacccacgcc gaaacaagcg ctcatgagcc 11100
cgaagtggcg agcccgatct tccccatcgg tgatgtcggc gatataggcg ccagcaaccg 11160
cacctgtggc gccggtgatg agggcgcgcc aagtcgacgt ccggcagtc 11209
<210> 62
<211> 11459
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 62
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
cctagttatt aatagtaatc aattacgggg tcattagttc atagcccata tatggagttc 360
cgcgttacat aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca 420
ttgacgtcaa taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt 480
caatgggtgg actatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg 540
ccaagtacgc cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag 600
tacatgacct tatgggactt tcctacttgg cagtacatct acgtattagt catcgctatt 660
accatggtcg aggtgagccc cacgttctgc ttcactctcc ccatctcccc cccctcccca 720
cccccaattt tgtatttatt tattttttaa ttattttgtg cagcgatggg ggcggggggg 780
gggggggggc gcgcgccagg cggggcgggg cggggcgagg ggcggggcgg ggcgaggcgg 840
agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa agtttccttt tatggcgagg 900
cggcggcggc ggcggcccta taaaaagcga agcgcgcggc gggcgggagt cgctgcgacg 960
ctgccttcgc cccgtgcccc gctccgccgc cgcctcgcgc cgcccgcccc ggctctgact 1020
gaccgcgtta ctcccacagg tgagcgggcg ggacggccct tctcctccgg gctgtaatta 1080
gcgcttggtt taatgacggc ttgttttctg tggctgcgtg aaagccttga ggggctccgg 1140
gagctagagc ctctgctaac catgttcatg ccttcttctt tttcctacag ctcctgggca 1200
acgtgctggt tattgtgctg tctcatcatt ttggcaaaga attcctcgaa gatccgaagg 1260
gaaagtcttc cacgactgtg ggatccgttc gaagatatca ccggttgagc caccatggaa 1320
ttcagcagcc ccagcagaga ggaatgcccc aagcctctga gccgggtgtc aatcatggcc 1380
ggatctctga caggactgct gctgcttcag gccgtgtctt gggcttctgg cgctagacct 1440
tgcatcccca agagcttcgg ctacagcagc gtcgtgtgcg tgtgcaatgc cacctactgc 1500
gacagcttcg accctcctac ctttcctgct ctgggcacct tcagcagata cgagagcacc 1560
agatccggca gacggatgga actgagcatg ggacccatcc aggccaatca cacaggcact 1620
ggcctgctgc tgacactgca gcctgagcag aaattccaga aagtgaaagg cttcggcgga 1680
gccatgacag atgccgccgc tctgaatatc ctggctctgt ctccaccagc tcagaacctg 1740
ctgctcaaga gctacttcag cgaggaaggc atcggctaca acatcatcag agtgcccatg 1800
gccagctgcg acttcagcat caggacctac acctacgccg acacacccga cgatttccag 1860
ctgcacaact tcagcctgcc tgaagaggac accaagctga agatccctct gatccacaga 1920
gccctgcagc tggcacaaag acccgtgtca ctgctggcct ctccatggac atctcccacc 1980
tggctgaaaa caaatggcgc cgtgaatggc aagggcagcc tgaaaggcca acctggcgac 2040
atctaccacc agacctgggc cagatacttc gtgaagttcc tggacgccta tgccgagcac 2100
aagctgcagt tttgggccgt gacagccgag aacgaacctt ctgctggact gctgagcggc 2160
tacccctttc agtgcctggg ctttacaccc gagcaccagc gggactttat cgcccgtgat 2220
ctgggaccca cactggccaa tagcacccac cataatgtgc ggctgctgat gctggacgac 2280
cagagactgc ttctgcccca ctgggctaaa gtggtgctga cagatcctga ggccgccaaa 2340
tacgtgcacg gaatcgccgt gcactggtat ctggactttc tggcccctgc caaggccaca 2400
ctgggagaga cacacagact gttccccaac accatgctgt tcgccagcga agcctgtgtg 2460
ggcagcaagt tttgggaaca gagcgtgcgg ctcggcagct gggatagagg catgcagtac 2520
agccacagca tcatcaccaa cctgctgtac cacgtcgtcg gctggaccga ctggaatctg 2580
gccctgaatc ctgaaggcgg ccctaactgg gtccgaaact tcgtggacag ccccatcatc 2640
gtggacatca ccaaggacac cttctacaag cagcccatgt tctaccacct gggacacttc 2700
agcaagttca tccccgaggg ctctcagcgc gttggactgg tggcttccca gaagaacgat 2760
ctggacgccg tggctctgat gcaccctgat ggatctgctg tggtggtggt cctgaaccgc 2820
agcagcaaag atgtgcccct gaccatcaag gatcccgccg tgggattcct ggaaacaatc 2880
agccctggct actccatcca cacctacctg tggcgtagac agtgacaatt gttaattaag 2940
tttaaaccct cgaggccgca agccgcatcg ataccgtcga ctagagctcg ctgatcagcc 3000
tcgactgtgc cttctagttg ccagccatct gttgtttgcc cctcccccgt gccttccttg 3060
accctggaag gtgccactcc cactgtcctt tcctaataaa atgaggaaat tgcatcgcat 3120
tgtctgagta ggtgtcattc tattctgggg ggtggggtgg ggcaggacag caagggggag 3180
gattgggaag acaatagcag gcatgctggg gagagatcca cgataacaaa cagctttttt 3240
gggggggcgg agttagggcg gagccaatca gcgtgcgccg ttccgaaagt tgccttttat 3300
ggctgggcgg agaatgggcg gtgaacgccg atgattatat aaggacgcgc cgggtgtggc 3360
acagctagtt ccgtcgcagc cgggatttgg gtcgcggttc ttgtttgtgg atccctgtga 3420
tcgtcacttg gtaagtcact gactgtctat gcctgggaaa gggtgggcag gagatggggc 3480
agtgcaggaa aagtggcact atgaaccctg cagccctagg aatgcatcta gacaattgta 3540
ctaaccttct tctctttcct ctcctgacag tccggaaagc caccatgccc cgcggcttca 3600
cctggctgcg ctacctgggc atcttcctgg gcgtggccct gggcaacgag cccctggaga 3660
tgtggcccct gacccagaac gaggagtgca ccgtgaccgg cttcctgcgc gacaagctgc 3720
agtaccgcag ccgcctgcag tacatgaagc actacttccc catcaactac aagatcagcg 3780
tgccctacga gggcgtgttc cgcatcgcca acgtgacccg cctgcagcgc gcccaggtga 3840
gcgagcgcga gctgcgctac ctgtgggtgc tggtgagcct gagcgccacc gagagcgtgc 3900
aggacgtgct gctggagggc caccccagct ggaagtacct gcaggaggtg gagaccctgc 3960
tgctgaacgt gcagcagggc ctgaccgacg tggaggtgag ccccaaggtg gagagcgtgc 4020
tgagcctgct gaacgccccc ggccccaacc tgaagctggt gcgccccaag gccctgctgg 4080
acaactgctt ccgcgtgatg gagctgctgt actgcagctg ctgcaagcag agcagcgtgc 4140
tgaactggca ggactgcgag gtgcccagcc cccagagctg cagccccgag cccagcctgc 4200
agtacgccgc cacccagctg tacccccccc ccccctggag ccccagcagc cccccccaca 4260
gcaccggcag cgtgcgcccc gtgcgcgccc agggcgaggg cctgctgccc taatgaccca 4320
ggggactcag cggccgctcg agtctagagg gcccgtttaa acccgctgat cagcctcgaa 4380
gacatgataa gatacattga tgagtttgga caaaccacaa caagaatgca gtgaaaaaaa 4440
tgctttattt gtgaaatttg tgatgctatt gctttatttg taaccattat aagctgcaat 4500
aaacaagtta acaacaacaa ttgcattcat tttatgtttc aggttcaggg ggagatgtgg 4560
gaggtttttt aaagcaagta aaacctctac aaatgtggta tgaacatatt gactgaattc 4620
cctgcaggtt ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gcccgggcaa 4680
agcccgggcg tcgggcgacc tttggtcgcc cggcctcagt gagcgagcga gcgcgcagag 4740
agggagtggc caactccatc actaggggtt cctgcggccg ctcgtacggt ctcgaggaat 4800
tcctgcagga taacttgcca acctcattct aaaatgtata tagaagccca aaagacaata 4860
acaaaaatat tcttgtagaa caaaatggga aagaatgttc cactaaatat caagatttag 4920
agcaaagcat gagatgtgtg gggatagaca gtgaggctga taaaatagag tagagctcag 4980
aaacagaccc attgatatat gtaagtgacc tatgaaaaaa atatggcatt ttacaatggg 5040
aaaatgatgg tctttttctt ttttagaaaa acagggaaat atatttatat gtaaaaaata 5100
aaagggaacc catatgtcat accatacaca caaaaaaatt ccagtgaatt ataagtctaa 5160
atggagaagg caaaacttta aatcttttag aaaataatat agaagcatgc agaccagcct 5220
ggccaacatg atgaaaccct ctctactaat aataaaatca gtagaactac tcaggactac 5280
tttgagtggg aagtcctttt ctatgaagac ttctttggcc aaaattaggc tctaaatgca 5340
aggagatagt gcatcatgcc tggctgcact tactgataaa tgatgttatc accatcttta 5400
accaaatgca caggaacaag ttatggtact gatgtgctgg attgagaagg agctctactt 5460
ccttgacagg acacatttgt atcaacttaa aaaagcagat ttttgccagc agaactattc 5520
attcagaggt aggaaactta gaatagatga tgtcactgat tagcatggct tccccatctc 5580
cacagctgct tcccacccag gttgcccaca gttgagtttg tccagtgctc agggctgccc 5640
actctcagta agaagcccca caccagcccc tctccaaata tgttggctgt tccttccatt 5700
aaagtgaccc cactttagag cagcaagtgg atttctgttt cttacagttc aggaaggagg 5760
agtcagctgt gagaacctgg agcctgagat gcttctaagt cccactgcta ctggggtcag 5820
ggaagccaga ctccagcatc agcagtcagg agcactaagc ccttgccaac atcctgtttc 5880
tcagagaaac tgcttccatt ataatggttg tcctttttta agctatcaag ccaaacaacc 5940
agtgtctacc attattctca tcacctgaag ccaagggttc tagcaaaagt caagctgtct 6000
tgtaatggtt gatgtgcctc cagcttctgt cttcagtcac tccactctta gcctgctctg 6060
aatcaactct gaccacagtt ccctggagcc cctgccacct gctgcccctg ccaccttctc 6120
catctgcagt gctgtgcagc cttctgcact cttgcagagc taataggtgg agacttgaag 6180
gaagaggagg aaagtttctc ataatagcct tgctgcaagc tcaaatggga ggtgggcact 6240
gtgcccagga gccttggagc aaaggctgtg cccaacctct gactgcatcc aggtttggtc 6300
ttgacagaga taagaagccc tggcttttgg agccaaaatc taggtcagac ttaggcagga 6360
ttctcaaagt ttatcagcag aacatgaggc agaagaccct ttctgctcca gcttcttcag 6420
gctcaacctt catcagaata gatagaaaga gaggctgtga gggttcttaa aacagaagca 6480
aatctgactc agagaataaa caacctccta gtaaactaca gcttagacag agcatctggt 6540
ggtgagtgtg ctcagtgtcc tactcaactg tctggtatca gccctcatga ggacttctct 6600
tctttccctc atagacctcc atctctgttt tccttagcct gcagaaatct ggatggctat 6660
tcacagaatg cctgtgcttt cagagttgca ttttttctct ggtattctgg ttcaagcatt 6720
tgaaggtagg aaaggttctc caagtgcaag aaagccagcc ctgagcctca actgcctggc 6780
tagtgtggtc agtaggatgc aaaggctgtt gaatgccaca aggccaaact ttaacctgtg 6840
taccacaagc ctagcagcag aggcagctct gctcactgga actctctgtc ttctttctcc 6900
tgagcctttt cttttcctga gttttctagc tctcctcaac cttacctctg ccctacccag 6960
gacaaaccca agagccactg tttctgtgat gtcctctcca gccctaatta ggcatcatga 7020
cttcagcctg accttccatg ctcagaagca gtgctaatcc acttcagatg agctgctcta 7080
tgcaacacag gcagagccta caaacctttg caccagagcc ctccacatat cagtgtttgt 7140
tcatactcac ttcaacagca aatgtgactg ctgagattaa gattttacac aagatggtct 7200
gtaatttcac agttagtttt atcccattag gtatgaaaga attagcataa ttccccttaa 7260
acatgaatga atcttagatt ttttaataaa tagttttgga agtaaagaca gagacatcag 7320
gagcacaagg aatagcctga gaggacaaac agaacaagaa agagtctgga aatacacagg 7380
atgttcttgg cctcctcaaa gcaagtgcaa gcagatagta ccagcagccc caggctatca 7440
gagcccagtg aagagaagta ccatgaaagc cacagctcta accaccctgt tccagagtga 7500
cagacagtcc ccaagacaag ccagcctgag ccagagagag aactgcaaga gaaagtttct 7560
aatttaggtt ctgttagatt cagacaagtg caggtcatcc tctctccaca gctactcacc 7620
tctccagcct aacaaagcct gcagtccaca ctccaaccct ggtgtctcac ctcctagcct 7680
ctcccaacat cctgctctct gaccatcttc tgcatctctc atctcaccat ctcccactgt 7740
ctacagccta ctcttgcaac taccatctca ttttctgaca tcctgtctac atcttctgcc 7800
atactctgcc atctaccata ccacctctta ccatctacca caccatcttt tatctccatc 7860
cctctcagaa gcctccaagc tgaatcctgc tttatgtgtt catctcagcc cctgcatgga 7920
aagctgaccc cagaggcaga actattccca gagagcttgg ccaagaaaaa caaaactacc 7980
agcctggcca ggctcaggag tagtaagctg cagtgtctgt tgtgttctag cttcaacagc 8040
tgcaggagtt ccactctcaa atgctccaca tttctcacat cctcctgatt ctggtcacta 8100
cccatcttca aagaacagaa tatctcacat cagcatactg tgaaggacta gtcatgggtg 8160
cagctgctca gagctgcaaa gtcattctgg atggtggaga gcttacaaac atttcatgat 8220
gctccccccg ctctgatggc tggagcccaa tccctacaca gactcctgct gtatgtgttt 8280
tcctttcact ctgagccaca gccagagggc aggcattcag tctcctcttc aggctggggc 8340
tggggcactg agaactcacc caacaccttg ctctcactcc ttctgcaaaa caagaaagag 8400
ctttgtgctg cagtagccat gaagaatgaa aggaaggctt taactaaaaa atgtcagaga 8460
ttattttcaa ccccttactg tggatcacca gcaaggagga aacacaacac agagacattt 8520
tttcccctca aattatcaaa agaatcactg catttgttaa agagagcaac tgaatcagga 8580
agcagagttt tgaacatatc agaagttagg aatctgcatc agagacaaat gcagtcatgg 8640
ttgtttgctg cataccagcc ctaatcatta gaagcctcat ggacttcaaa catcattccc 8700
tctgacaaga tgctctagcc taactccatg agataaaata aatctgcctt tcagagccaa 8760
agaagagtcc accagcttct tctcagtgtg aacaagagct ccagtcaggt tagtcagtcc 8820
agtgcagtag aggagaccag tctgcatcct ctaattttca aaggcaagaa gatttgttta 8880
ccctggacac caggcacaag tgaggtcaca gagctcttag atatgcagtc ctcatgagtg 8940
aggagactaa agcgcatgcc atcaagactt cagtgtagag aaaacctcca aaaaagcctc 9000
ctcactactt ctggaatagc tcagaggccg aggcggcctc ggcctctgca taaataaaaa 9060
aaattagtca gccatggggc ggagaatggg cggaactggg cggagttagg ggcgggatgg 9120
gcggagttag gggcgggact atggttgctg actaattgag atgcatgctt tgcatacttc 9180
tgcctgctgg ggagcctggg gactttccac acctggttgc tgactaattg agatgcatgc 9240
tttgcatact tctgcctgct ggggagcctg gggactttcc acaccctaac tgacacacat 9300
tccacagctg cattaatgaa tcggccaacg cgcggggaga ggcggtttgc gtattgggcg 9360
ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt 9420
atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa 9480
gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc 9540
gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag 9600
gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt 9660
gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg 9720
aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt aggtcgttcg 9780
ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg 9840
taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac 9900
tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg 9960
gcctaactac ggctacacta gaagaacagt atttggtatc tgcgctctgc tgaagccagt 10020
taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg 10080
tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc 10140
tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt 10200
ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa aatgaagttt 10260
taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat gcttaatcag 10320
tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct gactcctgca 10380
aaccacgttg tgtctcaaaa tctctgatgt tacattgcac aagataaaaa tatatcatca 10440
tgaacaataa aactgtctgc ttacataaac agtaatacaa ggggtgttat gagccatatt 10500
caacgggaaa cgtcttgctc gaggccgcga ttaaattcca acatggatgc tgatttatat 10560
gggtataaat gggctcgcga taatgtcggg caatcaggtg cgacaatcta tcgattgtat 10620
gggaagcccg atgcgccaga gttgtttctg aaacatggca aaggtagcgt tgccaatgat 10680
gttacagatg agatggtcag actaaactgg ctgacggaat ttatgcctct tccgaccatc 10740
aagcatttta tccgtactcc tgatgatgca tggttactca ccactgcgat ccccgggaaa 10800
acagcattcc aggtattaga agaatatcct gattcaggtg aaaatattgt tgatgcgctg 10860
gcagtgttcc tgcgccggtt gcattcgatt cctgtttgta attgtccttt taacagcgat 10920
cgcgtatttc gtctcgctca ggcgcaatca cgaatgaata acggtttggt tgatgcgagt 10980
gattttgatg acgagcgtaa tggctggcct gttgaacaag tctggaaaga aatgcataag 11040
cttttgccat tctcaccgga ttcagtcgtc actcatggtg atttctcact tgataacctt 11100
atttttgacg aggggaaatt aataggttgt attgatgttg gacgagtcgg aatcgcagac 11160
cgataccagg atcttgccat cctatggaac tgcctcggtg agttttctcc ttcattacag 11220
aaacggcttt ttcaaaaata tggtattgat aatcctgata tgaataaatt gcagtttcat 11280
ttgatgctcg atgagttttt ctaagggcgg cctgccacca tacccacgcc gaaacaagcg 11340
ctcatgagcc cgaagtggcg agcccgatct tccccatcgg tgatgtcggc gatataggcg 11400
ccagcaaccg cacctgtggc gccggtgatg agggcgcgcc aagtcgacgt ccggcagtc 11459
<210> 63
<211> 274
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 63
Met Gly Lys Ser Leu Ser His Leu Pro Leu His Ser Ser Lys Glu Asp
1 5 10 15
Ala Tyr Asp Gly Val Thr Ser Glu Asn Met Arg Asn Gly Leu Val Asn
20 25 30
Ser Glu Val His Asn Glu Asp Gly Arg Asn Gly Asp Val Ser Gln Phe
35 40 45
Pro Tyr Val Glu Phe Thr Gly Arg Asp Ser Val Thr Cys Pro Thr Cys
50 55 60
Gln Gly Thr Gly Arg Ile Pro Arg Gly Gln Glu Asn Gln Leu Val Ala
65 70 75 80
Leu Ile Pro Tyr Ser Asp Gln Arg Leu Arg Pro Arg Arg Thr Lys Leu
85 90 95
Tyr Val Met Ala Ser Val Phe Val Cys Leu Leu Leu Ser Gly Leu Ala
100 105 110
Val Phe Phe Leu Phe Pro Arg Ser Ile Asp Val Lys Tyr Ile Gly Val
115 120 125
Lys Ser Ala Tyr Val Ser Tyr Asp Val Gln Lys Arg Thr Ile Tyr Leu
130 135 140
Asn Ile Thr Asn Thr Leu Asn Ile Thr Asn Asn Asn Tyr Tyr Ser Val
145 150 155 160
Glu Val Glu Asn Ile Thr Ala Gln Val Gln Phe Ser Lys Thr Val Ile
165 170 175
Gly Lys Ala Arg Leu Asn Asn Ile Thr Ile Ile Gly Pro Leu Asp Met
180 185 190
Lys Gln Ile Asp Tyr Thr Val Pro Thr Val Ile Ala Glu Glu Met Ser
195 200 205
Tyr Met Tyr Asp Phe Cys Thr Leu Ile Ser Ile Lys Val His Asn Ile
210 215 220
Val Leu Met Met Gln Val Thr Val Thr Thr Thr Tyr Phe Gly His Ser
225 230 235 240
Glu Gln Ile Ser Gln Glu Arg Tyr Gln Tyr Val Asp Cys Gly Arg Asn
245 250 255
Thr Thr Tyr Gln Leu Gly Gln Ser Glu Tyr Leu Asn Val Leu Gln Pro
260 265 270
Gln Gln
<210> 64
<211> 825
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 64
atgggcaaga gcctgagcca cctgcccctg cacagcagca aggaggacgc ctacgacggc 60
gtgaccagcg agaacatgcg caacggcctg gtgaacagcg aggtgcacaa cgaggacggc 120
cgcaacggcg acgtgagcca gttcccctac gtggagttca ccggccgcga cagcgtgacc 180
tgccccacct gccagggcac cggccgcatc ccccgcggcc aggagaacca gctggtggcc 240
ctgatcccct acagcgacca gcgcctgcgc ccccgccgca ccaagctgta cgtgatggcc 300
agcgtgttcg tgtgcctgct gctgagcggc ctggccgtgt tcttcctgtt cccccgcagc 360
atcgacgtga agtacatcgg cgtgaagagc gcctacgtga gctacgacgt gcagaagcgc 420
accatctacc tgaacatcac caacaccctg aacatcacca acaacaacta ctacagcgtg 480
gaggtggaga acatcaccgc ccaggtgcag ttcagcaaga ccgtgatcgg caaggcccgc 540
ctgaacaaca tcaccatcat cggccccctg gacatgaagc agatcgacta caccgtgccc 600
accgtgatcg ccgaggagat gagctacatg tacgacttct gcaccctgat cagcatcaag 660
gtgcacaaca tcgtgctgat gatgcaggtg accgtgacca ccacctactt cggccacagc 720
gagcagatca gccaggagcg ctaccagtac gtggactgcg gccgcaacac cacctaccag 780
ctgggccaga gcgagtacct gaacgtgctg cagccccagc agtaa 825
<210> 65
<211> 267
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 65
gtgatatcac aaggtcccag ggctggggtc agaaattctc tcccgaggga atgaagccac 60
aggagccaag agcaggagga ccaaggccct ggcgaaggcc gtggcctcgt tcaagtaaaa 120
gatcctagta cagtgcaggt cccaatgtgt actaggatct tttacttgaa cggggacgcc 180
ggcatccggg ctcaggaccc ccctctctgc cagaggcacc aacaccagag ttcacaaatc 240
agtctcctgc cctttgcatg tagcaaa 267
<210> 66
<211> 267
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 66
tttgctacat gcaaagggca ggagactgat ttgtgaactc tggtgttggt gcctctggca 60
gagagggggg tcctgagccc ggatgccggc gtccccgttc aagtaaaaga tcctagtaca 120
cattgggacc tgcactgtac taggatcttt tacttgaacg aggccacggc cttcgccagg 180
gccttggtcc tcctgctctt ggctcctgtg gcttcattcc ctcgggagag aatttctgac 240
cccagccctg ggaccttgtg atatcac 267
<210> 67
<211> 593
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 67
Met Trp Thr Leu Val Ser Trp Val Ala Leu Thr Ala Gly Leu Val Ala
1 5 10 15
Gly Thr Arg Cys Pro Asp Gly Gln Phe Cys Pro Val Ala Cys Cys Leu
20 25 30
Asp Pro Gly Gly Ala Ser Tyr Ser Cys Cys Arg Pro Leu Leu Asp Lys
35 40 45
Trp Pro Thr Thr Leu Ser Arg His Leu Gly Gly Pro Cys Gln Val Asp
50 55 60
Ala His Cys Ser Ala Gly His Ser Cys Ile Phe Thr Val Ser Gly Thr
65 70 75 80
Ser Ser Cys Cys Pro Phe Pro Glu Ala Val Ala Cys Gly Asp Gly His
85 90 95
His Cys Cys Pro Arg Gly Phe His Cys Ser Ala Asp Gly Arg Ser Cys
100 105 110
Phe Gln Arg Ser Gly Asn Asn Ser Val Gly Ala Ile Gln Cys Pro Asp
115 120 125
Ser Gln Phe Glu Cys Pro Asp Phe Ser Thr Cys Cys Val Met Val Asp
130 135 140
Gly Ser Trp Gly Cys Cys Pro Met Pro Gln Ala Ser Cys Cys Glu Asp
145 150 155 160
Arg Val His Cys Cys Pro His Gly Ala Phe Cys Asp Leu Val His Thr
165 170 175
Arg Cys Ile Thr Pro Thr Gly Thr His Pro Leu Ala Lys Lys Leu Pro
180 185 190
Ala Gln Arg Thr Asn Arg Ala Val Ala Leu Ser Ser Ser Val Met Cys
195 200 205
Pro Asp Ala Arg Ser Arg Cys Pro Asp Gly Ser Thr Cys Cys Glu Leu
210 215 220
Pro Ser Gly Lys Tyr Gly Cys Cys Pro Met Pro Asn Ala Thr Cys Cys
225 230 235 240
Ser Asp His Leu His Cys Cys Pro Gln Asp Thr Val Cys Asp Leu Ile
245 250 255
Gln Ser Lys Cys Leu Ser Lys Glu Asn Ala Thr Thr Asp Leu Leu Thr
260 265 270
Lys Leu Pro Ala His Thr Val Gly Asp Val Lys Cys Asp Met Glu Val
275 280 285
Ser Cys Pro Asp Gly Tyr Thr Cys Cys Arg Leu Gln Ser Gly Ala Trp
290 295 300
Gly Cys Cys Pro Phe Thr Gln Ala Val Cys Cys Glu Asp His Ile His
305 310 315 320
Cys Cys Pro Ala Gly Phe Thr Cys Asp Thr Gln Lys Gly Thr Cys Glu
325 330 335
Gln Gly Pro His Gln Val Pro Trp Met Glu Lys Ala Pro Ala His Leu
340 345 350
Ser Leu Pro Asp Pro Gln Ala Leu Lys Arg Asp Val Pro Cys Asp Asn
355 360 365
Val Ser Ser Cys Pro Ser Ser Asp Thr Cys Cys Gln Leu Thr Ser Gly
370 375 380
Glu Trp Gly Cys Cys Pro Ile Pro Glu Ala Val Cys Cys Ser Asp His
385 390 395 400
Gln His Cys Cys Pro Gln Gly Tyr Thr Cys Val Ala Glu Gly Gln Cys
405 410 415
Gln Arg Gly Ser Glu Ile Val Ala Gly Leu Glu Lys Met Pro Ala Arg
420 425 430
Arg Ala Ser Leu Ser His Pro Arg Asp Ile Gly Cys Asp Gln His Thr
435 440 445
Ser Cys Pro Val Gly Gln Thr Cys Cys Pro Ser Leu Gly Gly Ser Trp
450 455 460
Ala Cys Cys Gln Leu Pro His Ala Val Cys Cys Glu Asp Arg Gln His
465 470 475 480
Cys Cys Pro Ala Gly Tyr Thr Cys Asn Val Lys Ala Arg Ser Cys Glu
485 490 495
Lys Glu Val Val Ser Ala Gln Pro Ala Thr Phe Leu Ala Arg Ser Pro
500 505 510
His Val Gly Val Lys Asp Val Glu Cys Gly Glu Gly His Phe Cys His
515 520 525
Asp Asn Gln Thr Cys Cys Arg Asp Asn Arg Gln Gly Trp Ala Cys Cys
530 535 540
Pro Tyr Arg Gln Gly Val Cys Cys Ala Asp Arg Arg His Cys Cys Pro
545 550 555 560
Ala Gly Phe Arg Cys Ala Ala Arg Gly Thr Lys Cys Leu Arg Arg Glu
565 570 575
Ala Pro Arg Trp Asp Ala Pro Leu Arg Asp Pro Ala Leu Arg Gln Leu
580 585 590
Leu
<210> 68
<211> 1779
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 68
atgtggaccc tggtgagctg ggtggccctg accgccggcc tggtggccgg cacccgctgc 60
cccgacggcc agttctgccc cgtggcctgc tgcctggacc ccggcggcgc cagctacagc 120
tgctgccgcc ccctgctgga caagtggccc accaccctga gccgccacct gggcggcccc 180
tgccaggtgg acgcccactg cagcgccggc cacagctgca tcttcaccgt gagcggcacc 240
agcagctgct gccccttccc cgaggccgtg gcctgcggcg acggccacca ctgctgcccc 300
cgcggcttcc actgcagcgc cgacggccgc agctgcttcc agcgcagcgg caacaacagc 360
gtgggcgcca tccagtgccc cgacagccag ttcgagtgcc ccgacttcag cacctgctgc 420
gtgatggtgg acggcagctg gggctgctgc cccatgcccc aggccagctg ctgcgaggac 480
cgcgtgcact gctgccccca cggcgccttc tgcgacctgg tgcacacccg ctgcatcacc 540
cccaccggca cccaccccct ggccaagaag ctgcccgccc agcgcaccaa ccgcgccgtg 600
gccctgagca gcagcgtgat gtgccccgac gcccgcagcc gctgccccga cggcagcacc 660
tgctgcgagc tgcccagcgg caagtacggc tgctgcccca tgcccaacgc cacctgctgc 720
agcgaccacc tgcactgctg cccccaggac accgtgtgcg acctgatcca gagcaagtgc 780
ctgagcaagg agaacgccac caccgacctg ctgaccaagc tgcccgccca caccgtgggc 840
gacgtgaagt gcgacatgga ggtgagctgc cccgacggct acacctgctg ccgcctgcag 900
agcggcgcct ggggctgctg ccccttcacc caggccgtgt gctgcgagga ccacatccac 960
tgctgccccg ccggcttcac ctgcgacacc cagaagggca cctgcgagca gggcccccac 1020
caggtgccct ggatggagaa ggcccccgcc cacctgagcc tgcccgaccc ccaggccctg 1080
aagcgcgacg tgccctgcga caacgtgagc agctgcccca gcagcgacac ctgctgccag 1140
ctgaccagcg gcgagtgggg ctgctgcccc atccccgagg ccgtgtgctg cagcgaccac 1200
cagcactgct gcccccaggg ctacacctgc gtggccgagg gccagtgcca gcgcggcagc 1260
gagatcgtgg ccggcctgga gaagatgccc gcccgccgcg ccagcctgag ccacccccgc 1320
gacatcggct gcgaccagca caccagctgc cccgtgggcc agacctgctg ccccagcctg 1380
ggcggcagct gggcctgctg ccagctgccc cacgccgtgt gctgcgagga ccgccagcac 1440
tgctgccccg ccggctacac ctgcaacgtg aaggcccgca gctgcgagaa ggaggtggtg 1500
agcgcccagc ccgccacctt cctggcccgc agcccccacg tgggcgtgaa ggacgtggag 1560
tgcggcgagg gccacttctg ccacgacaac cagacctgct gccgcgacaa ccgccagggc 1620
tgggcctgct gcccctaccg ccagggcgtg tgctgcgccg accgccgcca ctgctgcccc 1680
gccggcttcc gctgcgccgc ccgcggcacc aagtgcctgc gccgcgaggc cccccgctgg 1740
gacgcccccc tgcgcgaccc cgccctgcgc cagctgctg 1779
<210> 69
<211> 10871
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 69
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 360
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 420
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 480
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 540
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 600
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 660
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 720
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 780
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 840
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 900
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagtc gctgcgacgc 960
tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg 1020
accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag 1080
cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc 1140
cgggagctag agcctctgct aaccatgttc atgccttctt ctttttccta cagctcctgg 1200
gcaacgtgct ggttattgtg ctgtctcatc attttggcaa agaattcctc gaagatccga 1260
agggaaagtc ttccacgact gtgggatccg ttcgaagata tcaccggttg agccaccatg 1320
tggaccctgg tgagctgggt ggccctgacc gccggcctgg tggccggcac ccgctgcccc 1380
gacggccagt tctgccccgt ggcctgctgc ctggaccccg gcggcgccag ctacagctgc 1440
tgccgccccc tgctggacaa gtggcccacc accctgagcc gccacctggg cggcccctgc 1500
caggtggacg cccactgcag cgccggccac agctgcatct tcaccgtgag cggcaccagc 1560
agctgctgcc ccttccccga ggccgtggcc tgcggcgacg gccaccactg ctgcccccgc 1620
ggcttccact gcagcgccga cggccgcagc tgcttccagc gcagcggcaa caacagcgtg 1680
ggcgccatcc agtgccccga cagccagttc gagtgccccg acttcagcac ctgctgcgtg 1740
atggtggacg gcagctgggg ctgctgcccc atgccccagg ccagctgctg cgaggaccgc 1800
gtgcactgct gcccccacgg cgccttctgc gacctggtgc acacccgctg catcaccccc 1860
accggcaccc accccctggc caagaagctg cccgcccagc gcaccaaccg cgccgtggcc 1920
ctgagcagca gcgtgatgtg ccccgacgcc cgcagccgct gccccgacgg cagcacctgc 1980
tgcgagctgc ccagcggcaa gtacggctgc tgccccatgc ccaacgccac ctgctgcagc 2040
gaccacctgc actgctgccc ccaggacacc gtgtgcgacc tgatccagag caagtgcctg 2100
agcaaggaga acgccaccac cgacctgctg accaagctgc ccgcccacac cgtgggcgac 2160
gtgaagtgcg acatggaggt gagctgcccc gacggctaca cctgctgccg cctgcagagc 2220
ggcgcctggg gctgctgccc cttcacccag gccgtgtgct gcgaggacca catccactgc 2280
tgccccgccg gcttcacctg cgacacccag aagggcacct gcgagcaggg cccccaccag 2340
gtgccctgga tggagaaggc ccccgcccac ctgagcctgc ccgaccccca ggccctgaag 2400
cgcgacgtgc cctgcgacaa cgtgagcagc tgccccagca gcgacacctg ctgccagctg 2460
accagcggcg agtggggctg ctgccccatc cccgaggccg tgtgctgcag cgaccaccag 2520
cactgctgcc cccagggcta cacctgcgtg gccgagggcc agtgccagcg cggcagcgag 2580
atcgtggccg gcctggagaa gatgcccgcc cgccgcgcca gcctgagcca cccccgcgac 2640
atcggctgcg accagcacac cagctgcccc gtgggccaga cctgctgccc cagcctgggc 2700
ggcagctggg cctgctgcca gctgccccac gccgtgtgct gcgaggaccg ccagcactgc 2760
tgccccgccg gctacacctg caacgtgaag gcccgcagct gcgagaagga ggtggtgagc 2820
gcccagcccg ccaccttcct ggcccgcagc ccccacgtgg gcgtgaagga cgtggagtgc 2880
ggcgagggcc acttctgcca cgacaaccag acctgctgcc gcgacaaccg ccagggctgg 2940
gcctgctgcc cctaccgcca gggcgtgtgc tgcgccgacc gccgccactg ctgccccgcc 3000
ggcttccgct gcgccgcccg cggcaccaag tgcctgcgcc gcgaggcccc ccgctgggac 3060
gcccccctgc gcgaccccgc cctgcgccag ctgctgtgac aattgttaat taagtttaaa 3120
ccctcgaggc cgcaagctta tcgataatca acctctggat tacaaaattt gtgaaagatt 3180
gactggtatt cttaactatg ttgctccttt tacgctatgt ggatacgctg ctttaatgcc 3240
tttgtatcat gctattgctt cccgtatggc tttcattttc tcctccttgt ataaatcctg 3300
gttgctgtct ctttatgagg agttgtggcc cgttgtcagg caacgtggcg tggtgtgcac 3360
tgtgtttgct gacgcaaccc ccactggttg gggcattgcc accacctgtc agctcctttc 3420
cgggactttc gctttccccc tccctattgc cacggcggaa ctcatcgccg cctgccttgc 3480
ccgctgctgg acaggggctc ggctgttggg cactgacaat tccgtggtgt tgtcggggaa 3540
atcatcgtcc tttccttggc tgctcgcctg tgttgccacc tggattctgc gcgggacgtc 3600
cttctgctac gtcccttcgg ccctcaatcc agcggacctt ccttcccgcg gcctgctgcc 3660
ggctctgcgg cctcttccgc gtcttcgcct tcgccctcag acgagtcgga tctccctttg 3720
ggccgcctcc ccgcatcgat accgtcgact agagctcgct gatcagcctc gactgtgcct 3780
tctagttgcc agccatctgt tgtttgcccc tcccccgtgc cttccttgac cctggaaggt 3840
gccactccca ctgtcctttc ctaataaaat gaggaaattg catcgcattg tctgagtagg 3900
tgtcattcta ttctgggggg tggggtgggg caggacagca agggggagga ttgggaagac 3960
aatagcaggc atgctgggga gagatccacg ataacaaaca gcttttttgg ggtgaacata 4020
ttgactgaat tccctgcagg ttggccactc cctctctgcg cgctcgctcg ctcactgagg 4080
ccgcccgggc aaagcccggg cgtcgggcga cctttggtcg cccggcctca gtgagcgagc 4140
gagcgcgcag agagggagtg gccaactcca tcactagggg ttcctgcggc cgctcgtacg 4200
gtctcgagga attcctgcag gataacttgc caacctcatt ctaaaatgta tatagaagcc 4260
caaaagacaa taacaaaaat attcttgtag aacaaaatgg gaaagaatgt tccactaaat 4320
atcaagattt agagcaaagc atgagatgtg tggggataga cagtgaggct gataaaatag 4380
agtagagctc agaaacagac ccattgatat atgtaagtga cctatgaaaa aaatatggca 4440
ttttacaatg ggaaaatgat ggtctttttc ttttttagaa aaacagggaa atatatttat 4500
atgtaaaaaa taaaagggaa cccatatgtc ataccataca cacaaaaaaa ttccagtgaa 4560
ttataagtct aaatggagaa ggcaaaactt taaatctttt agaaaataat atagaagcat 4620
gcagaccagc ctggccaaca tgatgaaacc ctctctacta ataataaaat cagtagaact 4680
actcaggact actttgagtg ggaagtcctt ttctatgaag acttctttgg ccaaaattag 4740
gctctaaatg caaggagata gtgcatcatg cctggctgca cttactgata aatgatgtta 4800
tcaccatctt taaccaaatg cacaggaaca agttatggta ctgatgtgct ggattgagaa 4860
ggagctctac ttccttgaca ggacacattt gtatcaactt aaaaaagcag atttttgcca 4920
gcagaactat tcattcagag gtaggaaact tagaatagat gatgtcactg attagcatgg 4980
cttccccatc tccacagctg cttcccaccc aggttgccca cagttgagtt tgtccagtgc 5040
tcagggctgc ccactctcag taagaagccc cacaccagcc cctctccaaa tatgttggct 5100
gttccttcca ttaaagtgac cccactttag agcagcaagt ggatttctgt ttcttacagt 5160
tcaggaagga ggagtcagct gtgagaacct ggagcctgag atgcttctaa gtcccactgc 5220
tactggggtc agggaagcca gactccagca tcagcagtca ggagcactaa gcccttgcca 5280
acatcctgtt tctcagagaa actgcttcca ttataatggt tgtccttttt taagctatca 5340
agccaaacaa ccagtgtcta ccattattct catcacctga agccaagggt tctagcaaaa 5400
gtcaagctgt cttgtaatgg ttgatgtgcc tccagcttct gtcttcagtc actccactct 5460
tagcctgctc tgaatcaact ctgaccacag ttccctggag cccctgccac ctgctgcccc 5520
tgccaccttc tccatctgca gtgctgtgca gccttctgca ctcttgcaga gctaataggt 5580
ggagacttga aggaagagga ggaaagtttc tcataatagc cttgctgcaa gctcaaatgg 5640
gaggtgggca ctgtgcccag gagccttgga gcaaaggctg tgcccaacct ctgactgcat 5700
ccaggtttgg tcttgacaga gataagaagc cctggctttt ggagccaaaa tctaggtcag 5760
acttaggcag gattctcaaa gtttatcagc agaacatgag gcagaagacc ctttctgctc 5820
cagcttcttc aggctcaacc ttcatcagaa tagatagaaa gagaggctgt gagggttctt 5880
aaaacagaag caaatctgac tcagagaata aacaacctcc tagtaaacta cagcttagac 5940
agagcatctg gtggtgagtg tgctcagtgt cctactcaac tgtctggtat cagccctcat 6000
gaggacttct cttctttccc tcatagacct ccatctctgt tttccttagc ctgcagaaat 6060
ctggatggct attcacagaa tgcctgtgct ttcagagttg cattttttct ctggtattct 6120
ggttcaagca tttgaaggta ggaaaggttc tccaagtgca agaaagccag ccctgagcct 6180
caactgcctg gctagtgtgg tcagtaggat gcaaaggctg ttgaatgcca caaggccaaa 6240
ctttaacctg tgtaccacaa gcctagcagc agaggcagct ctgctcactg gaactctctg 6300
tcttctttct cctgagcctt ttcttttcct gagttttcta gctctcctca accttacctc 6360
tgccctaccc aggacaaacc caagagccac tgtttctgtg atgtcctctc cagccctaat 6420
taggcatcat gacttcagcc tgaccttcca tgctcagaag cagtgctaat ccacttcaga 6480
tgagctgctc tatgcaacac aggcagagcc tacaaacctt tgcaccagag ccctccacat 6540
atcagtgttt gttcatactc acttcaacag caaatgtgac tgctgagatt aagattttac 6600
acaagatggt ctgtaatttc acagttagtt ttatcccatt aggtatgaaa gaattagcat 6660
aattcccctt aaacatgaat gaatcttaga ttttttaata aatagttttg gaagtaaaga 6720
cagagacatc aggagcacaa ggaatagcct gagaggacaa acagaacaag aaagagtctg 6780
gaaatacaca ggatgttctt ggcctcctca aagcaagtgc aagcagatag taccagcagc 6840
cccaggctat cagagcccag tgaagagaag taccatgaaa gccacagctc taaccaccct 6900
gttccagagt gacagacagt ccccaagaca agccagcctg agccagagag agaactgcaa 6960
gagaaagttt ctaatttagg ttctgttaga ttcagacaag tgcaggtcat cctctctcca 7020
cagctactca cctctccagc ctaacaaagc ctgcagtcca cactccaacc ctggtgtctc 7080
acctcctagc ctctcccaac atcctgctct ctgaccatct tctgcatctc tcatctcacc 7140
atctcccact gtctacagcc tactcttgca actaccatct cattttctga catcctgtct 7200
acatcttctg ccatactctg ccatctacca taccacctct taccatctac cacaccatct 7260
tttatctcca tccctctcag aagcctccaa gctgaatcct gctttatgtg ttcatctcag 7320
cccctgcatg gaaagctgac cccagaggca gaactattcc cagagagctt ggccaagaaa 7380
aacaaaacta ccagcctggc caggctcagg agtagtaagc tgcagtgtct gttgtgttct 7440
agcttcaaca gctgcaggag ttccactctc aaatgctcca catttctcac atcctcctga 7500
ttctggtcac tacccatctt caaagaacag aatatctcac atcagcatac tgtgaaggac 7560
tagtcatggg tgcagctgct cagagctgca aagtcattct ggatggtgga gagcttacaa 7620
acatttcatg atgctccccc cgctctgatg gctggagccc aatccctaca cagactcctg 7680
ctgtatgtgt tttcctttca ctctgagcca cagccagagg gcaggcattc agtctcctct 7740
tcaggctggg gctggggcac tgagaactca cccaacacct tgctctcact ccttctgcaa 7800
aacaagaaag agctttgtgc tgcagtagcc atgaagaatg aaaggaaggc tttaactaaa 7860
aaatgtcaga gattattttc aaccccttac tgtggatcac cagcaaggag gaaacacaac 7920
acagagacat tttttcccct caaattatca aaagaatcac tgcatttgtt aaagagagca 7980
actgaatcag gaagcagagt tttgaacata tcagaagtta ggaatctgca tcagagacaa 8040
atgcagtcat ggttgtttgc tgcataccag ccctaatcat tagaagcctc atggacttca 8100
aacatcattc cctctgacaa gatgctctag cctaactcca tgagataaaa taaatctgcc 8160
tttcagagcc aaagaagagt ccaccagctt cttctcagtg tgaacaagag ctccagtcag 8220
gttagtcagt ccagtgcagt agaggagacc agtctgcatc ctctaatttt caaaggcaag 8280
aagatttgtt taccctggac accaggcaca agtgaggtca cagagctctt agatatgcag 8340
tcctcatgag tgaggagact aaagcgcatg ccatcaagac ttcagtgtag agaaaacctc 8400
caaaaaagcc tcctcactac ttctggaata gctcagaggc cgaggcggcc tcggcctctg 8460
cataaataaa aaaaattagt cagccatggg gcggagaatg ggcggaactg ggcggagtta 8520
ggggcgggat gggcggagtt aggggcggga ctatggttgc tgactaattg agatgcatgc 8580
tttgcatact tctgcctgct ggggagcctg gggactttcc acacctggtt gctgactaat 8640
tgagatgcat gctttgcata cttctgcctg ctggggagcc tggggacttt ccacacccta 8700
actgacacac attccacagc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt 8760
gcgtattggg cgctcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct 8820
gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga 8880
taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc 8940
cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg 9000
ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg 9060
aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt 9120
tctcccttcg ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt 9180
gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg 9240
cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact 9300
ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt 9360
cttgaagtgg tggcctaact acggctacac tagaagaaca gtatttggta tctgcgctct 9420
gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac 9480
cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc 9540
tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg 9600
ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta 9660
aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca 9720
atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc 9780
ctgactcctg caaaccacgt tgtgtctcaa aatctctgat gttacattgc acaagataaa 9840
aatatatcat catgaacaat aaaactgtct gcttacataa acagtaatac aaggggtgtt 9900
atgagccata ttcaacggga aacgtcttgc tcgaggccgc gattaaattc caacatggat 9960
gctgatttat atgggtataa atgggctcgc gataatgtcg ggcaatcagg tgcgacaatc 10020
tatcgattgt atgggaagcc cgatgcgcca gagttgtttc tgaaacatgg caaaggtagc 10080
gttgccaatg atgttacaga tgagatggtc agactaaact ggctgacgga atttatgcct 10140
cttccgacca tcaagcattt tatccgtact cctgatgatg catggttact caccactgcg 10200
atccccggga aaacagcatt ccaggtatta gaagaatatc ctgattcagg tgaaaatatt 10260
gttgatgcgc tggcagtgtt cctgcgccgg ttgcattcga ttcctgtttg taattgtcct 10320
tttaacagcg atcgcgtatt tcgtctcgct caggcgcaat cacgaatgaa taacggtttg 10380
gttgatgcga gtgattttga tgacgagcgt aatggctggc ctgttgaaca agtctggaaa 10440
gaaatgcata agcttttgcc attctcaccg gattcagtcg tcactcatgg tgatttctca 10500
cttgataacc ttatttttga cgaggggaaa ttaataggtt gtattgatgt tggacgagtc 10560
ggaatcgcag accgatacca ggatcttgcc atcctatgga actgcctcgg tgagttttct 10620
ccttcattac agaaacggct ttttcaaaaa tatggtattg ataatcctga tatgaataaa 10680
ttgcagtttc atttgatgct cgatgagttt ttctaagggc ggcctgccac catacccacg 10740
ccgaaacaag cgctcatgag cccgaagtgg cgagcccgat cttccccatc ggtgatgtcg 10800
gcgatatagg cgccagcaac cgcacctgtg gcgccggtga tgagggcgcg ccaagtcgac 10860
gtccggcagt c 10871
<210> 70
<211> 4151
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 70
gggaggttac gcgttcgtcg actactagtg ggtaccagag cgggcggagt tagggcggag 60
ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga atgggcggtg 120
aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg tcgcagccgg 180
gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta agtcactgac 240
tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag tggcactatg 300
aaccctgcag ccctaggaat gcatctagac aattgtacta accttcttct ctttcctctc 360
ctgacagtcc ggaaagccac catgtacgcc ctgttcctgc tggccagcct gctgggcgcc 420
gccctggccg gccccgtgct gggcctgaag gagtgcaccc gcggcagcgc cgtgtggtgc 480
cagaacgtga agaccgccag cgactgcggc gccgtgaagc actgcctgca gaccgtgtgg 540
aacaagccca ccgtgaagag cctgccctgc gacatctgca aggacgtggt gaccgccgcc 600
ggcgacatgc tgaaggacaa cgccaccgag gaggagatcc tggtgtacct ggagaagacc 660
tgcgactggc tgcccaagcc caacatgagc gccagctgca aggagatcgt ggacagctac 720
ctgcccgtga tcctggacat catcaagggc gagatgagcc gccccggcga ggtgtgcagc 780
gccctgaacc tgtgcgagag cctgcagaag cacctggccg agctgaacca ccagaagcag 840
ctggagagca acaagatccc cgagctggac atgaccgagg tggtggcccc cttcatggcc 900
aacatccccc tgctgctgta cccccaggac ggcccccgca gcaagcccca gcccaaggac 960
aacggcgacg tgtgccagga ctgcatccag atggtgaccg acatccagac cgccgtgcgc 1020
accaacagca ccttcgtgca ggccctggtg gagcacgtga aggaggagtg cgaccgcctg 1080
ggccccggca tggccgacat ctgcaagaac tacatcagcc agtacagcga gatcgccatc 1140
cagatgatga tgcacatgca gcccaaggag atctgcgccc tggtgggctt ctgcgacgag 1200
gtgaaggaga tgcccatgca gaccctggtg cccgccaagg tggccagcaa gaacgtgatc 1260
cccgccctgg agctggtgga gcccatcaag aagcacgagg tgcccgccaa gagcgacgtg 1320
tactgcgagg tgtgcgagtt cctggtgaag gaggtgacca agctgatcga caacaacaag 1380
accgagaagg agatcctgga cgccttcgac aagatgtgca gcaagctgcc caagagcctg 1440
agcgaggagt gccaggaggt ggtggacacc tacggcagca gcatcctgag catcctgctg 1500
gaggaggtga gccccgagct ggtgtgcagc atgctgcacc tgtgcagcgg cacccgcctg 1560
cccgccctga ccgtgcacgt gacccagccc aaggacggcg gcttctgcga ggtgtgcaag 1620
aagctggtgg gctacctgga ccgcaacctg gagaagaaca gcaccaagca ggagatcctg 1680
gccgccctgg agaagggctg cagcttcctg cccgacccct accagaagca gtgcgaccag 1740
ttcgtggccg agtacgagcc cgtgctgatc gagatcctgg tggaggtgat ggaccccagc 1800
ttcgtgtgcc tgaagatcgg cgcctgcccc agcgcccaca agcccctgct gggcaccgag 1860
aagtgcatct ggggccccag ctactggtgc cagaacaccg agaccgccgc ccagtgcaac 1920
gccgtggagc actgcaagcg ccacgtgtgg aacagaagaa agagaggaag tggagagggc 1980
agaggaagtc ttctgacatg cggagacgtg gaagagaatc ccggccctat gtggaccctg 2040
gtgagctggg tggccctgac cgccggcctg gtggccggca cccgctgccc cgacggccag 2100
ttctgccccg tggcctgctg cctggacccc ggcggcgcca gctacagctg ctgccgcccc 2160
ctgctggaca agtggcccac caccctgagc cgccacctgg gcggcccctg ccaggtggac 2220
gcccactgca gcgccggcca cagctgcatc ttcaccgtga gcggcaccag cagctgctgc 2280
cccttccccg aggccgtggc ctgcggcgac ggccaccact gctgcccccg cggcttccac 2340
tgcagcgccg acggccgcag ctgcttccag cgcagcggca acaacagcgt gggcgccatc 2400
cagtgccccg acagccagtt cgagtgcccc gacttcagca cctgctgcgt gatggtggac 2460
ggcagctggg gctgctgccc catgccccag gccagctgct gcgaggaccg cgtgcactgc 2520
tgcccccacg gcgccttctg cgacctggtg cacacccgct gcatcacccc caccggcacc 2580
caccccctgg ccaagaagct gcccgcccag cgcaccaacc gcgccgtggc cctgagcagc 2640
agcgtgatgt gccccgacgc ccgcagccgc tgccccgacg gcagcacctg ctgcgagctg 2700
cccagcggca agtacggctg ctgccccatg cccaacgcca cctgctgcag cgaccacctg 2760
cactgctgcc cccaggacac cgtgtgcgac ctgatccaga gcaagtgcct gagcaaggag 2820
aacgccacca ccgacctgct gaccaagctg cccgcccaca ccgtgggcga cgtgaagtgc 2880
gacatggagg tgagctgccc cgacggctac acctgctgcc gcctgcagag cggcgcctgg 2940
ggctgctgcc ccttcaccca ggccgtgtgc tgcgaggacc acatccactg ctgccccgcc 3000
ggcttcacct gcgacaccca gaagggcacc tgcgagcagg gcccccacca ggtgccctgg 3060
atggagaagg cccccgccca cctgagcctg cccgaccccc aggccctgaa gcgcgacgtg 3120
ccctgcgaca acgtgagcag ctgccccagc agcgacacct gctgccagct gaccagcggc 3180
gagtggggct gctgccccat ccccgaggcc gtgtgctgca gcgaccacca gcactgctgc 3240
ccccagggct acacctgcgt ggccgagggc cagtgccagc gcggcagcga gatcgtggcc 3300
ggcctggaga agatgcccgc ccgccgcgcc agcctgagcc acccccgcga catcggctgc 3360
gaccagcaca ccagctgccc cgtgggccag acctgctgcc ccagcctggg cggcagctgg 3420
gcctgctgcc agctgcccca cgccgtgtgc tgcgaggacc gccagcactg ctgccccgcc 3480
ggctacacct gcaacgtgaa ggcccgcagc tgcgagaagg aggtggtgag cgcccagccc 3540
gccaccttcc tggcccgcag cccccacgtg ggcgtgaagg acgtggagtg cggcgagggc 3600
cacttctgcc acgacaacca gacctgctgc cgcgacaacc gccagggctg ggcctgctgc 3660
ccctaccgcc agggcgtgtg ctgcgccgac cgccgccact gctgccccgc cggcttccgc 3720
tgcgccgccc gcggcaccaa gtgcctgcgc cgcgaggccc cccgctggga cgcccccctg 3780
cgcgaccccg ccctgcgcca gctgctgtga caattgttaa ttaagtttaa accctcgagg 3840
ccgcaagcaa taaaatatct ttattttcat tacatctgtg tgttggtttt ttgtgtgaca 3900
attgttaatt aagtttaaac gttcgaggcc gcaagcgaga tccacgataa caaacagctt 3960
ttttggggtg aacatattga ctgaattccc tgcaggttgg ccactccctc tctgcgcgct 4020
cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc gggcgacctt tggtcgcccg 4080
gcctcagtga gcgagcgagc gcgcagagag ggagtggcca actccatcac taggggttcc 4140
tgcggccgct c 4151
<210> 71
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 71
aagagggtgt tctctatgta ggc 23
<210> 72
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 72
gctcctccaa catttgtcac tt 22
<210> 73
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 73
acacagtacc taccgttata gca 23
<210> 74
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 74
tgttgtcaca gtaacttgca tca 23
<210> 75
<211> 19
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 75
ctgggctaca ctgagcacc 19
<210> 76
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 76
aagtggtcgt tgagggcaat g 21
<210> 77
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 77
tattagatct gatggccgcg 20
<210> 78
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 78
tccatcacta ggggttcctg 20
<210> 79
<211> 4013
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 79
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgcccgggc aaagcccggg 60
cgtcgggcga cctttggtcg cccggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 360
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 420
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 480
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 540
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 600
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 660
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 720
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 780
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 840
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 900
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagtc gctgcgcgct 960
gccttcgccc cgtgccccgc tccgccgccg cctcgcgccg cccgccccgg ctctgactga 1020
ccgcgttact cccacaggtg agcgggcggg acggcccttc tcctccgggc tgtaattagc 1080
gcttggttta atgacggctt gtttcttttc tgtggctgcg tgaaagcctt gaggggctcc 1140
gggagctaga gcctctgcta accatgttca tgccttcttc tttttcctac agctcctggg 1200
caacgtgctg gttattgtgc tgtctcatca ttttggcaaa gaattcctcg aagatccgaa 1260
gggaaagtct tccacgactg tgggatccgt tcgaagatat caccggttga gccaccatgg 1320
aattcagcag ccccagcaga gaggaatgcc ccaagcctct gagccgggtg tcaatcatgg 1380
ccggatctct gacaggactg ctgctgcttc aggccgtgtc ttgggcttct ggcgctagac 1440
cttgcatccc caagagcttc ggctacagca gcgtcgtgtg cgtgtgcaat gccacctact 1500
gcgacagctt cgaccctcct acctttcctg ctctgggcac cttcagcaga tacgagagca 1560
ccagatccgg cagacggatg gaactgagca tgggacccat ccaggccaat cacacaggca 1620
ctggcctgct gctgacactg cagcctgagc agaaattcca gaaagtgaaa ggcttcggcg 1680
gagccatgac agatgccgcc gctctgaata tcctggctct gtctccacca gctcagaacc 1740
tgctgctcaa gagctacttc agcgaggaag gcatcggcta caacatcatc agagtgccca 1800
tggccagctg cgacttcagc atcaggacct acacctacgc cgacacaccc gacgatttcc 1860
agctgcacaa cttcagcctg cctgaagagg acaccaagct gaagatccct ctgatccaca 1920
gagccctgca gctggcacaa agacccgtgt cactgctggc ctctccatgg acatctccca 1980
cctggctgaa aacaaatggc gccgtgaatg gcaagggcag cctgaaaggc caacctggcg 2040
acatctacca ccagacctgg gccagatact tcgtgaagtt cctggacgcc tatgccgagc 2100
acaagctgca gttttgggcc gtgacagccg agaacgaacc ttctgctgga ctgctgagcg 2160
gctacccctt tcagtgcctg ggctttacac ccgagcacca gcgggacttt atcgcccgtg 2220
atctgggacc cacactggcc aatagcaccc accataatgt gcggctgctg atgctggacg 2280
accagagact gcttctgccc cactgggcta aagtggtgct gacagatcct gaggccgcca 2340
aatacgtgca cggaatcgcc gtgcactggt atctggactt tctggcccct gccaaggcca 2400
cactgggaga gacacacaga ctgttcccca acaccatgct gttcgccagc gaagcctgtg 2460
tgggcagcaa gttttgggaa cagagcgtgc ggctcggcag ctgggataga ggcatgcagt 2520
acagccacag catcatcacc aacctgctgt accacgtcgt cggctggacc gactggaatc 2580
tggccctgaa tcctgaaggc ggccctaact gggtccgaaa cttcgtggac agccccatca 2640
tcgtggacat caccaaggac accttctaca agcagcccat gttctaccac ctgggacact 2700
tcagcaagtt catccccgag ggctctcagc gcgttggact ggtggcttcc cagaagaacg 2760
atctggacgc cgtggctctg atgcaccctg atggatctgc tgtggtggtg gtcctgaacc 2820
gcagcagcaa agatgtgccc ctgaccatca aggatcccgc cgtgggattc ctggaaacaa 2880
tcagccctgg ctactccatc cacacctacc tgtggcgtag acagtgacaa ttgttaatta 2940
agtttaaacc ctcgaggccg caagcttatc gataatcaac ctctggatta caaaatttgt 3000
gaaagattga ctggtattct taactatgtt gctcctttta cgctatgtgg atacgctgct 3060
ttaatgcctt tgtatcatgc tattgcttcc cgtatggctt tcattttctc ctccttgtat 3120
aaatcctggt tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca acgtggcgtg 3180
gtgtgcactg tgtttgctga cgcaaccccc actggttggg gcattgccac cacctgtcag 3240
ctcctttccg ggactttcgc tttccccctc cctattgcca cggcggaact catcgccgcc 3300
tgccttgccc gctgctggac aggggctcgg ctgttgggca ctgacaattc cgtggtgttg 3360
tcggggaaat catcgtcctt tccttggctg ctcgcctgtg ttgccacctg gattctgcgc 3420
gggacgtcct tctgctacgt cccttcggcc ctcaatccag cggaccttcc ttcccgcggc 3480
ctgctgccgg ctctgcggcc tcttccgcgt cttcgccttc gccctcagac gagtcggatc 3540
tccctttggg ccgcctcccc gcatcgatac cgtcgactag agctcgctga tcagcctcga 3600
ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct tccttgaccc 3660
tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgca tcgcattgtc 3720
tgagtaggtg tcattctatt ctggggggtg gggtggggca ggacagcaag ggggaggatt 3780
gggaagacaa tagcaggcat gctggggaga gatccacgat aacaaacagc ttttttgggg 3840
tgaacatatt gactgaattc cctgcaggtt ggccactccc tctctgcgcg ctcgctcgct 3900
cactgaggcc gcccgggcaa agcccgggcg tcgggcgacc tttggtcgcc cggcctcagt 3960
gagcgagcga gcgcgcagag agggagtggc caactccatc actaggggtt cct 4013
<210> 80
<211> 4013
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 80
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgcccgggc aaagcccggg 60
cgtcgggcga cctttggtcg cccggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 360
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 420
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 480
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 540
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 600
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 660
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 720
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 780
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 840
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 900
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagtc gctgcgcgct 960
gccttcgccc cgtgccccgc tccgccgccg cctcgcgccg cccgccccgg ctctgactga 1020
ccgcgttact cccacaggtg agcgggcggg acggcccttc tcctccgggc tgtaattagc 1080
gcttggttta atgacggctt gtttcttttc tgtggctgcg tgaaagcctt gaggggctcc 1140
gggagctaga gcctctgcta accatgttca tgccttcttc tttttcctac agctcctggg 1200
caacgtgctg gttattgtgc tgtctcatca ttttggcaaa gaattcctcg aagatccgaa 1260
gggaaagtct tccacgactg tgggatccgt tcgaagatat caccggttga gccaccatgg 1320
aattcagcag ccccagcaga gaggaatgcc ccaagcctct gagccgggtg tcaatcatgg 1380
ccggatctct gacaggactg ctgctgcttc aggccgtgtc ttgggcttct ggcgctagac 1440
cttgcatccc caagagcttc ggctacagca gcgtcgtgtg cgtgtgcaat gccacctact 1500
gcgacagctt cgaccctcct acctttcctg ctctgggcac cttcagcaga tacgagagca 1560
ccagatccgg cagacggatg gaactgagca tgggacccat ccaggccaat cacacaggca 1620
ctggcctgct gctgacactg cagcctgagc agaaattcca gaaagtgaaa ggcttcggcg 1680
gagccatgac agatgccgcc gctctgaata tcctggctct gtctccacca gctcagaacc 1740
tgctgctcaa gagctacttc agcgaggaag gcatcggcta caacatcatc agagtgccca 1800
tggccagctg cgacttcagc atcaggacct acacctacgc cgacacaccc gacgatttcc 1860
agctgcacaa cttcagcctg cctgaagagg acaccaagct gaagatccct ctgatccaca 1920
gagccctgca gctggcacaa agacccgtgt cactgctggc ctctccatgg acatctccca 1980
cctggctgaa aacaaatggc gccgtgaatg gcaagggcag cctgaaaggc caacctggcg 2040
acatctacca ccagacctgg gccagatact tcgtgaagtt cctggacgcc tatgccgagc 2100
acaagctgca gttttgggcc gtgacagccg agaacgaacc ttctgctgga ctgctgagcg 2160
gctacccctt tcagtgcctg ggctttacac ccgagcacca gcgggacttt atcgcccgtg 2220
atctgggacc cacactggcc aatagcaccc accataatgt gcggctgctg atgctggacg 2280
accagagact gcttctgccc cactgggcta aagtggtgct gacagatcct gaggccgcca 2340
aatacgtgca cggaatcgcc gtgcactggt atctggactt tctggcccct gccaaggcca 2400
cactgggaga gacacacaga ctgttcccca acaccatgct gttcgccagc gaagcctgtg 2460
tgggcagcaa gttttgggaa cagagcgtgc ggctcggcag ctgggataga ggcatgcagt 2520
acagccacag catcatcacc aacctgctgt accacgtcgt cggctggacc gactggaatc 2580
tggccctgaa tcctgaaggc ggccctaact gggtccgaaa cttcgtggac agccccatca 2640
tcgtggacat caccaaggac accttctaca agcagcccat gttctaccac ctgggacact 2700
tcagcaagtt catccccgag ggctctcagc gcgttggact ggtggcttcc cagaagaacg 2760
atctggacgc cgtggctctg atgcaccctg atggatctgc tgtggtggtg gtcctgaacc 2820
gcagcagcaa agatgtgccc ctgaccatca aggatcccgc cgtgggattc ctggaaacaa 2880
tcagccctgg ctactccatc cacacctacc tgtggcgtag acagtgacaa ttgttaatta 2940
agtttaaacc ctcgaggccg caagcttatc gataatcaac ctctggatta caaaatttgt 3000
gaaagattga ctggtattct taactatgtt gctcctttta cgctatgtgg atacgctgct 3060
ttaatgcctt tgtatcatgc tattgcttcc cgtatggctt tcattttctc ctccttgtat 3120
aaatcctggt tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca acgtggcgtg 3180
gtgtgcactg tgtttgctga cgcaaccccc actggttggg gcattgccac cacctgtcag 3240
ctcctttccg ggactttcgc tttccccctc cctattgcca cggcggaact catcgccgcc 3300
tgccttgccc gctgctggac aggggctcgg ctgttgggca ctgacaattc cgtggtgttg 3360
tcggggaaat catcgtcctt tccttggctg ctcgcctgtg ttgccacctg gattctgcgc 3420
gggacgtcct tctgctacgt cccttcggcc ctcaatccag cggaccttcc ttcccgcggc 3480
ctgctgccgg ctctgcggcc tcttccgcgt cttcgccttc gccctcagac gagtcggatc 3540
tccctttggg ccgcctcccc gcatcgatac cgtcgactag agctcgctga tcagcctcga 3600
ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct tccttgaccc 3660
tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgca tcgcattgtc 3720
tgagtaggtg tcattctatt ctggggggtg gggtggggca ggacagcaag ggggaggatt 3780
gggaagacaa tagcaggcat gctggggaga gatccacgat aacaaacagc ttttttgggg 3840
tgaacatatt gactgaattc cctgcaggtt ggccactccc tctctgcgcg ctcgctcgct 3900
cactgaggcc gcccgggcaa agcccgggcg tcgggcgacc tttggtcgcc cggcctcagt 3960
gagcgagcga gcgcgcagag agggagtggc caactccatc actaggggtt cct 4013
<210> 81
<211> 4162
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 81
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 360
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 420
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 480
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 540
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 600
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 660
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 720
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 780
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 840
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 900
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagtc gctgcgacgc 960
tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg 1020
accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctcagcg ctgtaattag 1080
cgcttggttt aatgacggct tgttggaggc ttgctgaagg ctgtatgctg ttgtctttag 1140
aaataagtgg tagtcaagtg aagccacaga tgtgactacc acttatttct aaaaggacac 1200
aaggcctgtt actagcactc acatggaaca aatggccacc gtgggaggat gacaatttct 1260
gtggctgcgt gaaagccttg aggggctccg ggagctagag cctctgctaa ccatgttcat 1320
gccttcttct ttttcctaca gctcctgggc aacgtgctgg ttattgtgct gtctcatcat 1380
tttggcaaag aattcctcga agatccgaag ggaaagtctt ccacgactgt gggatccgtt 1440
cgaagatatc accggttgag ccaccatgga attcagcagc cccagcagag aggaatgccc 1500
caagcctctg agccgggtgt caatcatggc cggatctctg acaggactgc tgctgcttca 1560
ggccgtgtct tgggcttctg gcgctagacc ttgcatcccc aagagcttcg gctacagcag 1620
cgtcgtgtgc gtgtgcaatg ccacctactg cgacagcttc gaccctccta cctttcctgc 1680
tctgggcacc ttcagcagat acgagagcac cagatccggc agacggatgg aactgagcat 1740
gggacccatc caggccaatc acacaggcac tggcctgctg ctgacactgc agcctgagca 1800
gaaattccag aaagtgaaag gcttcggcgg agccatgaca gatgccgccg ctctgaatat 1860
cctggctctg tctccaccag ctcagaacct gctgctcaag agctacttca gcgaggaagg 1920
catcggctac aacatcatca gagtgcccat ggccagctgc gacttcagca tcaggaccta 1980
cacctacgcc gacacacccg acgatttcca gctgcacaac ttcagcctgc ctgaagagga 2040
caccaagctg aagatccctc tgatccacag agccctgcag ctggcacaaa gacccgtgtc 2100
actgctggcc tctccatgga catctcccac ctggctgaaa acaaatggcg ccgtgaatgg 2160
caagggcagc ctgaaaggcc aacctggcga catctaccac cagacctggg ccagatactt 2220
cgtgaagttc ctggacgcct atgccgagca caagctgcag ttttgggccg tgacagccga 2280
gaacgaacct tctgctggac tgctgagcgg ctaccccttt cagtgcctgg gctttacacc 2340
cgagcaccag cgggacttta tcgcccgtga tctgggaccc acactggcca atagcaccca 2400
ccataatgtg cggctgctga tgctggacga ccagagactg cttctgcccc actgggctaa 2460
agtggtgctg acagatcctg aggccgccaa atacgtgcac ggaatcgccg tgcactggta 2520
tctggacttt ctggcccctg ccaaggccac actgggagag acacacagac tgttccccaa 2580
caccatgctg ttcgccagcg aagcctgtgt gggcagcaag ttttgggaac agagcgtgcg 2640
gctcggcagc tgggatagag gcatgcagta cagccacagc atcatcacca acctgctgta 2700
ccacgtcgtc ggctggaccg actggaatct ggccctgaat cctgaaggcg gccctaactg 2760
ggtccgaaac ttcgtggaca gccccatcat cgtggacatc accaaggaca ccttctacaa 2820
gcagcccatg ttctaccacc tgggacactt cagcaagttc atccccgagg gctctcagcg 2880
cgttggactg gtggcttccc agaagaacga tctggacgcc gtggctctga tgcaccctga 2940
tggatctgct gtggtggtgg tcctgaaccg cagcagcaaa gatgtgcccc tgaccatcaa 3000
ggatcccgcc gtgggattcc tggaaacaat cagccctggc tactccatcc acacctacct 3060
gtggcgtaga cagtgacaat tgttaattaa gtttaaaccc tcgaggccgc aagcttatcg 3120
ataatcaacc tctggattac aaaatttgtg aaagattgac tggtattctt aactatgttg 3180
ctccttttac gctatgtgga tacgctgctt taatgccttt gtatcatgct attgcttccc 3240
gtatggcttt cattttctcc tccttgtata aatcctggtt gctgtctctt tatgaggagt 3300
tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt gtttgctgac gcaaccccca 3360
ctggttgggg cattgccacc acctgtcagc tcctttccgg gactttcgct ttccccctcc 3420
ctattgccac ggcggaactc atcgccgcct gccttgcccg ctgctggaca ggggctcggc 3480
tgttgggcac tgacaattcc gtggtgttgt cggggaaatc atcgtccttt ccttggctgc 3540
tcgcctgtgt tgccacctgg attctgcgcg ggacgtcctt ctgctacgtc ccttcggccc 3600
tcaatccagc ggaccttcct tcccgcggcc tgctgccggc tctgcggcct cttccgcgtc 3660
ttcgccttcg ccctcagacg agtcggatct ccctttgggc cgcctccccg catcgatacc 3720
gtcgactaga gctcgctgat cagcctcgac tgtgccttct agttgccagc catctgttgt 3780
ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc actcccactg tcctttccta 3840
ataaaatgag gaaattgcat cgcattgtct gagtaggtgt cattctattc tggggggtgg 3900
ggtggggcag gacagcaagg gggaggattg ggaagacaat agcaggcatg ctggggagag 3960
atccacgata acaaacagct tttttggggc ccacatgtac actgaattcc ctgcaggttg 4020
gccactccct ctctgcgcgc tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt 4080
cgggcgacct ttggtcgccc ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc 4140
aactccatca ctaggggttc ct 4162
<210> 82
<211> 4184
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 82
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 360
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 420
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 480
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 540
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 600
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 660
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 720
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 780
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 840
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 900
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagtc gctgcgcgct 960
gccttcgccc cgtgccccgc tccgccgccg cctcgcgccg cccgccccgg ctctgactga 1020
ccgcgttact cccacaggtg agcgggcggg acggcccttc tcctccgggc tgtaattagc 1080
gcttggttta atgacggctt gtttcttttc tgtggctgcg tgaaagcctt gaggggctcc 1140
gggagctaga gcctctgcta accatgttca tgccttcttc tttttcctac agctcctggg 1200
caacgtgctg gttattgtgc tgtctcatca ttttggcaaa gaattcctcg aagatccgaa 1260
gggaaagtct tccacgactg tgggatccgt tcgaagatat caccggttga gccaccatgt 1320
ggaccctggt gagctgggtg gccctgaccg ccggcctggt ggccggcacc cgctgccccg 1380
acggccagtt ctgccccgtg gcctgctgcc tggaccccgg cggcgccagc tacagctgct 1440
gccgccccct gctggacaag tggcccacca ccctgagccg ccacctgggc ggcccctgcc 1500
aggtggacgc ccactgcagc gccggccaca gctgcatctt caccgtgagc ggcaccagca 1560
gctgctgccc cttccccgag gccgtggcct gcggcgacgg ccaccactgc tgcccccgcg 1620
gcttccactg cagcgccgac ggccgcagct gcttccagcg cagcggcaac aacagcgtgg 1680
gcgccatcca gtgccccgac agccagttcg agtgccccga cttcagcacc tgctgcgtga 1740
tggtggacgg cagctggggc tgctgcccca tgccccaggc cagctgctgc gaggaccgcg 1800
tgcactgctg cccccacggc gccttctgcg acctggtgca cacccgctgc atcaccccca 1860
ccggcaccca ccccctggcc aagaagctgc ccgcccagcg caccaaccgc gccgtggccc 1920
tgagcagcag cgtgatgtgc cccgacgccc gcagccgctg ccccgacggc agcacctgct 1980
gcgagctgcc cagcggcaag tacggctgct gccccatgcc caacgccacc tgctgcagcg 2040
accacctgca ctgctgcccc caggacaccg tgtgcgacct gatccagagc aagtgcctga 2100
gcaaggagaa cgccaccacc gacctgctga ccaagctgcc cgcccacacc gtgggcgacg 2160
tgaagtgcga catggaggtg agctgccccg acggctacac ctgctgccgc ctgcagagcg 2220
gcgcctgggg ctgctgcccc ttcacccagg ccgtgtgctg cgaggaccac atccactgct 2280
gccccgccgg cttcacctgc gacacccaga agggcacctg cgagcagggc ccccaccagg 2340
tgccctggat ggagaaggcc cccgcccacc tgagcctgcc cgacccccag gccctgaagc 2400
gcgacgtgcc ctgcgacaac gtgagcagct gccccagcag cgacacctgc tgccagctga 2460
ccagcggcga gtggggctgc tgccccatcc ccgaggccgt gtgctgcagc gaccaccagc 2520
actgctgccc ccagggctac acctgcgtgg ccgagggcca gtgccagcgc ggcagcgaga 2580
tcgtggccgg cctggagaag atgcccgccc gccgcgccag cctgagccac ccccgcgaca 2640
tcggctgcga ccagcacacc agctgccccg tgggccagac ctgctgcccc agcctgggcg 2700
gcagctgggc ctgctgccag ctgccccacg ccgtgtgctg cgaggaccgc cagcactgct 2760
gccccgccgg ctacacctgc aacgtgaagg cccgcagctg cgagaaggag gtggtgagcg 2820
cccagcccgc caccttcctg gcccgcagcc cccacgtggg cgtgaaggac gtggagtgcg 2880
gcgagggcca cttctgccac gacaaccaga cctgctgccg cgacaaccgc cagggctggg 2940
cctgctgccc ctaccgccag ggcgtgtgct gcgccgaccg ccgccactgc tgccccgccg 3000
gcttccgctg cgccgcccgc ggcaccaagt gcctgcgccg cgaggccccc cgctgggacg 3060
cccccctgcg cgaccccgcc ctgcgccagc tgctgtgaca attgttaatt aagtttaaac 3120
cctcgaggcc gcaagcttat cgataatcaa cctctggatt acaaaatttg tgaaagattg 3180
actggtattc ttaactatgt tgctcctttt acgctatgtg gatacgctgc tttaatgcct 3240
ttgtatcatg ctattgcttc ccgtatggct ttcattttct cctccttgta taaatcctgg 3300
ttgctgtctc tttatgagga gttgtggccc gttgtcaggc aacgtggcgt ggtgtgcact 3360
gtgtttgctg acgcaacccc cactggttgg ggcattgcca ccacctgtca gctcctttcc 3420
gggactttcg ctttccccct ccctattgcc acggcggaac tcatcgccgc ctgccttgcc 3480
cgctgctgga caggggctcg gctgttgggc actgacaatt ccgtggtgtt gtcggggaaa 3540
tcatcgtcct ttccttggct gctcgcctgt gttgccacct ggattctgcg cgggacgtcc 3600
ttctgctacg tcccttcggc cctcaatcca gcggaccttc cttcccgcgg cctgctgccg 3660
gctctgcggc ctcttccgcg tcttcgcctt cgccctcaga cgagtcggat ctccctttgg 3720
gccgcctccc cgcatcgata ccgtcgacta gagctcgctg atcagcctcg actgtgcctt 3780
ctagttgcca gccatctgtt gtttgcccct cccccgtgcc ttccttgacc ctggaaggtg 3840
ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt ctgagtaggt 3900
gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat tgggaagaca 3960
atagcaggca tgctggggag agatccacga taacaaacag cttttttggg gcccacatgt 4020
acactgaatt ccctgcaggt tggccactcc ctctctgcgc gctcgctcgc tcactgaggc 4080
cgcccgggca aagcccgggc gtcgggcgac ctttggtcgc ccggcctcag tgagcgagcg 4140
agcgcgcaga gagggagtgg ccaactccat cactaggggt tcct 4184
<210> 83
<211> 4184
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 83
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 360
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 420
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 480
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 540
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 600
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 660
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 720
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 780
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 840
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 900
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagtc gctgcgcgct 960
gccttcgccc cgtgccccgc tccgccgccg cctcgcgccg cccgccccgg ctctgactga 1020
ccgcgttact cccacaggtg agcgggcggg acggcccttc tcctccgggc tgtaattagc 1080
gcttggttta atgacggctt gtttcttttc tgtggctgcg tgaaagcctt gaggggctcc 1140
gggagctaga gcctctgcta accatgttca tgccttcttc tttttcctac agctcctggg 1200
caacgtgctg gttattgtgc tgtctcatca ttttggcaaa gaattcctcg aagatccgaa 1260
gggaaagtct tccacgactg tgggatccgt tcgaagatat caccggttga gccaccatgt 1320
ggaccctggt gagctgggtg gccctgaccg ccggcctggt ggccggcacc cgctgccccg 1380
acggccagtt ctgccccgtg gcctgctgcc tggaccccgg cggcgccagc tacagctgct 1440
gccgccccct gctggacaag tggcccacca ccctgagccg ccacctgggc ggcccctgcc 1500
aggtggacgc ccactgcagc gccggccaca gctgcatctt caccgtgagc ggcaccagca 1560
gctgctgccc cttccccgag gccgtggcct gcggcgacgg ccaccactgc tgcccccgcg 1620
gcttccactg cagcgccgac ggccgcagct gcttccagcg cagcggcaac aacagcgtgg 1680
gcgccatcca gtgccccgac agccagttcg agtgccccga cttcagcacc tgctgcgtga 1740
tggtggacgg cagctggggc tgctgcccca tgccccaggc cagctgctgc gaggaccgcg 1800
tgcactgctg cccccacggc gccttctgcg acctggtgca cacccgctgc atcaccccca 1860
ccggcaccca ccccctggcc aagaagctgc ccgcccagcg caccaaccgc gccgtggccc 1920
tgagcagcag cgtgatgtgc cccgacgccc gcagccgctg ccccgacggc agcacctgct 1980
gcgagctgcc cagcggcaag tacggctgct gccccatgcc caacgccacc tgctgcagcg 2040
accacctgca ctgctgcccc caggacaccg tgtgcgacct gatccagagc aagtgcctga 2100
gcaaggagaa cgccaccacc gacctgctga ccaagctgcc cgcccacacc gtgggcgacg 2160
tgaagtgcga catggaggtg agctgccccg acggctacac ctgctgccgc ctgcagagcg 2220
gcgcctgggg ctgctgcccc ttcacccagg ccgtgtgctg cgaggaccac atccactgct 2280
gccccgccgg cttcacctgc gacacccaga agggcacctg cgagcagggc ccccaccagg 2340
tgccctggat ggagaaggcc cccgcccacc tgagcctgcc cgacccccag gccctgaagc 2400
gcgacgtgcc ctgcgacaac gtgagcagct gccccagcag cgacacctgc tgccagctga 2460
ccagcggcga gtggggctgc tgccccatcc ccgaggccgt gtgctgcagc gaccaccagc 2520
actgctgccc ccagggctac acctgcgtgg ccgagggcca gtgccagcgc ggcagcgaga 2580
tcgtggccgg cctggagaag atgcccgccc gccgcgccag cctgagccac ccccgcgaca 2640
tcggctgcga ccagcacacc agctgccccg tgggccagac ctgctgcccc agcctgggcg 2700
gcagctgggc ctgctgccag ctgccccacg ccgtgtgctg cgaggaccgc cagcactgct 2760
gccccgccgg ctacacctgc aacgtgaagg cccgcagctg cgagaaggag gtggtgagcg 2820
cccagcccgc caccttcctg gcccgcagcc cccacgtggg cgtgaaggac gtggagtgcg 2880
gcgagggcca cttctgccac gacaaccaga cctgctgccg cgacaaccgc cagggctggg 2940
cctgctgccc ctaccgccag ggcgtgtgct gcgccgaccg ccgccactgc tgccccgccg 3000
gcttccgctg cgccgcccgc ggcaccaagt gcctgcgccg cgaggccccc cgctgggacg 3060
cccccctgcg cgaccccgcc ctgcgccagc tgctgtgaca attgttaatt aagtttaaac 3120
cctcgaggcc gcaagcttat cgataatcaa cctctggatt acaaaatttg tgaaagattg 3180
actggtattc ttaactatgt tgctcctttt acgctatgtg gatacgctgc tttaatgcct 3240
ttgtatcatg ctattgcttc ccgtatggct ttcattttct cctccttgta taaatcctgg 3300
ttgctgtctc tttatgagga gttgtggccc gttgtcaggc aacgtggcgt ggtgtgcact 3360
gtgtttgctg acgcaacccc cactggttgg ggcattgcca ccacctgtca gctcctttcc 3420
gggactttcg ctttccccct ccctattgcc acggcggaac tcatcgccgc ctgccttgcc 3480
cgctgctgga caggggctcg gctgttgggc actgacaatt ccgtggtgtt gtcggggaaa 3540
tcatcgtcct ttccttggct gctcgcctgt gttgccacct ggattctgcg cgggacgtcc 3600
ttctgctacg tcccttcggc cctcaatcca gcggaccttc cttcccgcgg cctgctgccg 3660
gctctgcggc ctcttccgcg tcttcgcctt cgccctcaga cgagtcggat ctccctttgg 3720
gccgcctccc cgcatcgata ccgtcgacta gagctcgctg atcagcctcg actgtgcctt 3780
ctagttgcca gccatctgtt gtttgcccct cccccgtgcc ttccttgacc ctggaaggtg 3840
ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt ctgagtaggt 3900
gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat tgggaagaca 3960
atagcaggca tgctggggag agatccacga taacaaacag cttttttggg gcccacatgt 4020
acactgaatt ccctgcaggt tggccactcc ctctctgcgc gctcgctcgc tcactgaggc 4080
cgcccgggca aagcccgggc gtcgggcgac ctttggtcgc ccggcctcag tgagcgagcg 4140
agcgcgcaga gagggagtgg ccaactccat cactaggggt tcct 4184
<210> 84
<211> 4578
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 84
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
aaaaaaattg tcatcctccc acggtggcca tttgttccat gtgagtgcta gtaacaggcc 300
ttgtgtcctt tgtagactat ttgcacactg catctgtggc ttcactcagt gtgcaaatag 360
tctacaagac aacagcatac agccttcagc aagcctccag tggtctcata cagaacttat 420
aagattccca aatccaaaga catttcacgt ttatggtgat ttcccagaac acatagcgac 480
atgcaaatat tgcagggcgc cactcccctg tccctcacag ccatcttcct gccagggcgc 540
acgcgcgctg ggtgttcccg cctagtgaca ctgggcccgc gattccttgg agcgggttga 600
tgacgtcagc gtttcccatg gtgaagcttg gatctgatcc ctaggttcta gaaccggtga 660
cgtctcccat ggtgaagctt ggatctgaat tcggtaccta gttattaata gtaatcaatt 720
acggggtcat tagttcatag cccatatatg gagttccgcg ttacataact tacggtaaat 780
ggcccgcctg gctgaccgcc caacgacccc cgcccattga cgtcaataat gacgtatgtt 840
cccatagtaa cgccaatagg gactttccat tgacgtcaat gggtggagta tttacggtaa 900
actgcccact tggcagtaca tcaagtgtat catatgccaa gtacgccccc tattgacgtc 960
aatgacggta aatggcccgc ctggcattat gcccagtaca tgaccttatg ggactttcct 1020
acttggcagt acatctacgt attagtcatc gctattacca tggtcgaggt gagccccacg 1080
ttctgcttca ctctccccat ctcccccccc tccccacccc caattttgta tttatttatt 1140
ttttaattat tttgtgcagc gatgggggcg gggggggggg gggggcgcgc gccaggcggg 1200
gcggggcggg gcgaggggcg gggcggggcg aggcggagag gtgcggcggc agccaatcag 1260
agcggcgcgc tccgaaagtt tccttttatg gcgaggcggc ggcggcggcg gccctataaa 1320
aagcgaagcg cgcggcgggc gggagtcgct gcgacgctgc cttcgccccg tgccccgctc 1380
cgccgccgcc tcgcgccgcc cgccccggct ctgactgacc gcgttactcc cacaggtgag 1440
cgggcgggac ggcccttctc ctccgggctg taattagcgc ttggtttaat gacggcttgt 1500
tttctgtggc tgcgtgaaag ccttgagggg ctccgggagc tagagcctct gctaaccatg 1560
ttcatgcctt cttctttttc ctacagctcc tgggcaacgt gctggttatt gtgctgtctc 1620
atcattttgg caaagaattc ctcgaagatc cgaagggaaa gtcttccacg actgtgggat 1680
ccgttcgaag atatcaccgg ttgagccacc atgtggaccc tggtgagctg ggtggccctg 1740
accgccggcc tggtggccgg cacccgctgc cccgacggcc agttctgccc cgtggcctgc 1800
tgcctggacc ccggcggcgc cagctacagc tgctgccgcc ccctgctgga caagtggccc 1860
accaccctga gccgccacct gggcggcccc tgccaggtgg acgcccactg cagcgccggc 1920
cacagctgca tcttcaccgt gagcggcacc agcagctgct gccccttccc cgaggccgtg 1980
gcctgcggcg acggccacca ctgctgcccc cgcggcttcc actgcagcgc cgacggccgc 2040
agctgcttcc agcgcagcgg caacaacagc gtgggcgcca tccagtgccc cgacagccag 2100
ttcgagtgcc ccgacttcag cacctgctgc gtgatggtgg acggcagctg gggctgctgc 2160
cccatgcccc aggccagctg ctgcgaggac cgcgtgcact gctgccccca cggcgccttc 2220
tgcgacctgg tgcacacccg ctgcatcacc cccaccggca cccaccccct ggccaagaag 2280
ctgcccgccc agcgcaccaa ccgcgccgtg gccctgagca gcagcgtgat gtgccccgac 2340
gcccgcagcc gctgccccga cggcagcacc tgctgcgagc tgcccagcgg caagtacggc 2400
tgctgcccca tgcccaacgc cacctgctgc agcgaccacc tgcactgctg cccccaggac 2460
accgtgtgcg acctgatcca gagcaagtgc ctgagcaagg agaacgccac caccgacctg 2520
ctgaccaagc tgcccgccca caccgtgggc gacgtgaagt gcgacatgga ggtgagctgc 2580
cccgacggct acacctgctg ccgcctgcag agcggcgcct ggggctgctg ccccttcacc 2640
caggccgtgt gctgcgagga ccacatccac tgctgccccg ccggcttcac ctgcgacacc 2700
cagaagggca cctgcgagca gggcccccac caggtgccct ggatggagaa ggcccccgcc 2760
cacctgagcc tgcccgaccc ccaggccctg aagcgcgacg tgccctgcga caacgtgagc 2820
agctgcccca gcagcgacac ctgctgccag ctgaccagcg gcgagtgggg ctgctgcccc 2880
atccccgagg ccgtgtgctg cagcgaccac cagcactgct gcccccaggg ctacacctgc 2940
gtggccgagg gccagtgcca gcgcggcagc gagatcgtgg ccggcctgga gaagatgccc 3000
gcccgccgcg ccagcctgag ccacccccgc gacatcggct gcgaccagca caccagctgc 3060
cccgtgggcc agacctgctg ccccagcctg ggcggcagct gggcctgctg ccagctgccc 3120
cacgccgtgt gctgcgagga ccgccagcac tgctgccccg ccggctacac ctgcaacgtg 3180
aaggcccgca gctgcgagaa ggaggtggtg agcgcccagc ccgccacctt cctggcccgc 3240
agcccccacg tgggcgtgaa ggacgtggag tgcggcgagg gccacttctg ccacgacaac 3300
cagacctgct gccgcgacaa ccgccagggc tgggcctgct gcccctaccg ccagggcgtg 3360
tgctgcgccg accgccgcca ctgctgcccc gccggcttcc gctgcgccgc ccgcggcacc 3420
aagtgcctgc gccgcgaggc cccccgctgg gacgcccccc tgcgcgaccc cgccctgcgc 3480
cagctgctgt gacaattgtt aattaagttt aaaccctcga ggccgcaagc ttatcgataa 3540
tcaacctctg gattacaaaa tttgtgaaag attgactggt attcttaact atgttgctcc 3600
ttttacgcta tgtggatacg ctgctttaat gcctttgtat catgctattg cttcccgtat 3660
ggctttcatt ttctcctcct tgtataaatc ctggttgctg tctctttatg aggagttgtg 3720
gcccgttgtc aggcaacgtg gcgtggtgtg cactgtgttt gctgacgcaa cccccactgg 3780
ttggggcatt gccaccacct gtcagctcct ttccgggact ttcgctttcc ccctccctat 3840
tgccacggcg gaactcatcg ccgcctgcct tgcccgctgc tggacagggg ctcggctgtt 3900
gggcactgac aattccgtgg tgttgtcggg gaaatcatcg tcctttcctt ggctgctcgc 3960
ctgtgttgcc acctggattc tgcgcgggac gtccttctgc tacgtccctt cggccctcaa 4020
tccagcggac cttccttccc gcggcctgct gccggctctg cggcctcttc cgcgtcttcg 4080
ccttcgccct cagacgagtc ggatctccct ttgggccgcc tccccgcatc gataccgtcg 4140
actagagctc gctgatcagc ctcgactgtg ccttctagtt gccagccatc tgttgtttgc 4200
ccctcccccg tgccttcctt gaccctggaa ggtgccactc ccactgtcct ttcctaataa 4260
aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg gggtggggtg 4320
gggcaggaca gcaaggggga ggattgggaa gacaatagca ggcatgctgg ggagagatcc 4380
acgataacaa acagcttttt tggggtgaac atattgactg aattccctgc aggttggcca 4440
ctccctctct gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg 4500
cgacctttgg tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact 4560
ccatcactag gggttcct 4578
<210> 85
<211> 4162
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 85
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 360
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 420
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 480
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 540
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 600
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 660
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 720
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 780
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 840
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 900
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagtc gctgcgacgc 960
tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg 1020
accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctcagcg ctgtaattag 1080
cgcttggttt aatgacggct tgttggaggc ttgctgaagg ctgtatgctg ttgtctttag 1140
aaataagtgg tagtcaagtg aagccacaga tgtgactacc acttatttct aaaaggacac 1200
aaggcctgtt actagcactc acatggaaca aatggccacc gtgggaggat gacaatttct 1260
gtggctgcgt gaaagccttg aggggctccg ggagctagag cctctgctaa ccatgttcat 1320
gccttcttct ttttcctaca gctcctgggc aacgtgctgg ttattgtgct gtctcatcat 1380
tttggcaaag aattcctcga agatccgaag ggaaagtctt ccacgactgt gggatccgtt 1440
cgaagatatc accggttgag ccaccatgga attcagcagc cccagcagag aggaatgccc 1500
caagcctctg agccgggtgt caatcatggc cggatctctg acaggactgc tgctgcttca 1560
ggccgtgtct tgggcttctg gcgctagacc ttgcatcccc aagagcttcg gctacagcag 1620
cgtcgtgtgc gtgtgcaatg ccacctactg cgacagcttc gaccctccta cctttcctgc 1680
tctgggcacc ttcagcagat acgagagcac cagatccggc agacggatgg aactgagcat 1740
gggacccatc caggccaatc acacaggcac tggcctgctg ctgacactgc agcctgagca 1800
gaaattccag aaagtgaaag gcttcggcgg agccatgaca gatgccgccg ctctgaatat 1860
cctggctctg tctccaccag ctcagaacct gctgctcaag agctacttca gcgaggaagg 1920
catcggctac aacatcatca gagtgcccat ggccagctgc gacttcagca tcaggaccta 1980
cacctacgcc gacacacccg acgatttcca gctgcacaac ttcagcctgc ctgaagagga 2040
caccaagctg aagatccctc tgatccacag agccctgcag ctggcacaaa gacccgtgtc 2100
actgctggcc tctccatgga catctcccac ctggctgaaa acaaatggcg ccgtgaatgg 2160
caagggcagc ctgaaaggcc aacctggcga catctaccac cagacctggg ccagatactt 2220
cgtgaagttc ctggacgcct atgccgagca caagctgcag ttttgggccg tgacagccga 2280
gaacgaacct tctgctggac tgctgagcgg ctaccccttt cagtgcctgg gctttacacc 2340
cgagcaccag cgggacttta tcgcccgtga tctgggaccc acactggcca atagcaccca 2400
ccataatgtg cggctgctga tgctggacga ccagagactg cttctgcccc actgggctaa 2460
agtggtgctg acagatcctg aggccgccaa atacgtgcac ggaatcgccg tgcactggta 2520
tctggacttt ctggcccctg ccaaggccac actgggagag acacacagac tgttccccaa 2580
caccatgctg ttcgccagcg aagcctgtgt gggcagcaag ttttgggaac agagcgtgcg 2640
gctcggcagc tgggatagag gcatgcagta cagccacagc atcatcacca acctgctgta 2700
ccacgtcgtc ggctggaccg actggaatct ggccctgaat cctgaaggcg gccctaactg 2760
ggtccgaaac ttcgtggaca gccccatcat cgtggacatc accaaggaca ccttctacaa 2820
gcagcccatg ttctaccacc tgggacactt cagcaagttc atccccgagg gctctcagcg 2880
cgttggactg gtggcttccc agaagaacga tctggacgcc gtggctctga tgcaccctga 2940
tggatctgct gtggtggtgg tcctgaaccg cagcagcaaa gatgtgcccc tgaccatcaa 3000
ggatcccgcc gtgggattcc tggaaacaat cagccctggc tactccatcc acacctacct 3060
gtggcgtaga cagtgacaat tgttaattaa gtttaaaccc tcgaggccgc aagcttatcg 3120
ataatcaacc tctggattac aaaatttgtg aaagattgac tggtattctt aactatgttg 3180
ctccttttac gctatgtgga tacgctgctt taatgccttt gtatcatgct attgcttccc 3240
gtatggcttt cattttctcc tccttgtata aatcctggtt gctgtctctt tatgaggagt 3300
tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt gtttgctgac gcaaccccca 3360
ctggttgggg cattgccacc acctgtcagc tcctttccgg gactttcgct ttccccctcc 3420
ctattgccac ggcggaactc atcgccgcct gccttgcccg ctgctggaca ggggctcggc 3480
tgttgggcac tgacaattcc gtggtgttgt cggggaaatc atcgtccttt ccttggctgc 3540
tcgcctgtgt tgccacctgg attctgcgcg ggacgtcctt ctgctacgtc ccttcggccc 3600
tcaatccagc ggaccttcct tcccgcggcc tgctgccggc tctgcggcct cttccgcgtc 3660
ttcgccttcg ccctcagacg agtcggatct ccctttgggc cgcctccccg catcgatacc 3720
gtcgactaga gctcgctgat cagcctcgac tgtgccttct agttgccagc catctgttgt 3780
ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc actcccactg tcctttccta 3840
ataaaatgag gaaattgcat cgcattgtct gagtaggtgt cattctattc tggggggtgg 3900
ggtggggcag gacagcaagg gggaggattg ggaagacaat agcaggcatg ctggggagag 3960
atccacgata acaaacagct tttttggggc ccacatgtac actgaattcc ctgcaggttg 4020
gccactccct ctctgcgcgc tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt 4080
cgggcgacct ttggtcgccc ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc 4140
aactccatca ctaggggttc ct 4162
<210> 86
<211> 3977
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 86
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgcccgggc aaagcccggg 60
cgtcgggcga cctttggtcg cccggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 360
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 420
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 480
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 540
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 600
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 660
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 720
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 780
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 840
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 900
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagtc gctgcgcgct 960
gccttcgccc cgtgccccgc tccgccgccg cctcgcgccg cccgccccgg ctctgactga 1020
ccgcgttact cccacaggtg agcgggcggg acggcccttc tcctccgggc tgtaattagc 1080
gcttggttta atgacggctt gtttcttttc tgtggctgcg tgaaagcctt gaggggctcc 1140
gggagctaga gcctctgcta accatgttca tgccttcttc tttttcctac agctcctggg 1200
caacgtgctg gttattgtgc tgtctcatca ttttggcaaa gaattcctcg aagatccgaa 1260
gggaaagtct tccacgactg tgggatccgt tcgaagatat caccggttga gccaccatgt 1320
acgccctgtt cctgctggcc agcctgctgg gcgccgccct ggccggcccc gtgctgggcc 1380
tgaaggagtg cacccgcggc agcgccgtgt ggtgccagaa cgtgaagacc gccagcgact 1440
gcggcgccgt gaagcactgc ctgcagaccg tgtggaacaa gcccaccgtg aagagcctgc 1500
cctgcgacat ctgcaaggac gtggtgaccg ccgccggcga catgctgaag gacaacgcca 1560
ccgaggagga gatcctggtg tacctggaga agacctgcga ctggctgccc aagcccaaca 1620
tgagcgccag ctgcaaggag atcgtggaca gctacctgcc cgtgatcctg gacatcatca 1680
agggcgagat gagccgcccc ggcgaggtgt gcagcgccct gaacctgtgc gagagcctgc 1740
agaagcacct ggccgagctg aaccaccaga agcagctgga gagcaacaag atccccgagc 1800
tggacatgac cgaggtggtg gcccccttca tggccaacat ccccctgctg ctgtaccccc 1860
aggacggccc ccgcagcaag ccccagccca aggacaacgg cgacgtgtgc caggactgca 1920
tccagatggt gaccgacatc cagaccgccg tgcgcaccaa cagcaccttc gtgcaggccc 1980
tggtggagca cgtgaaggag gagtgcgacc gcctgggccc cggcatggcc gacatctgca 2040
agaactacat cagccagtac agcgagatcg ccatccagat gatgatgcac atgcagccca 2100
aggagatctg cgccctggtg ggcttctgcg acgaggtgaa ggagatgccc atgcagaccc 2160
tggtgcccgc caaggtggcc agcaagaacg tgatccccgc cctggagctg gtggagccca 2220
tcaagaagca cgaggtgccc gccaagagcg acgtgtactg cgaggtgtgc gagttcctgg 2280
tgaaggaggt gaccaagctg atcgacaaca acaagaccga gaaggagatc ctggacgcct 2340
tcgacaagat gtgcagcaag ctgcccaaga gcctgagcga ggagtgccag gaggtggtgg 2400
acacctacgg cagcagcatc ctgagcatcc tgctggagga ggtgagcccc gagctggtgt 2460
gcagcatgct gcacctgtgc agcggcaccc gcctgcccgc cctgaccgtg cacgtgaccc 2520
agcccaagga cggcggcttc tgcgaggtgt gcaagaagct ggtgggctac ctggaccgca 2580
acctggagaa gaacagcacc aagcaggaga tcctggccgc cctggagaag ggctgcagct 2640
tcctgcccga cccctaccag aagcagtgcg accagttcgt ggccgagtac gagcccgtgc 2700
tgatcgagat cctggtggag gtgatggacc ccagcttcgt gtgcctgaag atcggcgcct 2760
gccccagcgc ccacaagccc ctgctgggca ccgagaagtg catctggggc cccagctact 2820
ggtgccagaa caccgagacc gccgcccagt gcaacgccgt ggagcactgc aagcgccacg 2880
tgtggaactg acaattgtta attaagttta aaccctcgag gccgcaagct tatcgataat 2940
caacctctgg attacaaaat ttgtgaaaga ttgactggta ttcttaacta tgttgctcct 3000
tttacgctat gtggatacgc tgctttaatg cctttgtatc atgctattgc ttcccgtatg 3060
gctttcattt tctcctcctt gtataaatcc tggttgctgt ctctttatga ggagttgtgg 3120
cccgttgtca ggcaacgtgg cgtggtgtgc actgtgtttg ctgacgcaac ccccactggt 3180
tggggcattg ccaccacctg tcagctcctt tccgggactt tcgctttccc cctccctatt 3240
gccacggcgg aactcatcgc cgcctgcctt gcccgctgct ggacaggggc tcggctgttg 3300
ggcactgaca attccgtggt gttgtcgggg aaatcatcgt cctttccttg gctgctcgcc 3360
tgtgttgcca cctggattct gcgcgggacg tccttctgct acgtcccttc ggccctcaat 3420
ccagcggacc ttccttcccg cggcctgctg ccggctctgc ggcctcttcc gcgtcttcgc 3480
cttcgccctc agacgagtcg gatctccctt tgggccgcct ccccgcatcg ataccgtcga 3540
ctagagctcg ctgatcagcc tcgactgtgc cttctagttg ccagccatct gttgtttgcc 3600
cctcccccgt gccttccttg accctggaag gtgccactcc cactgtcctt tcctaataaa 3660
atgaggaaat tgcatcgcat tgtctgagta ggtgtcattc tattctgggg ggtggggtgg 3720
ggcaggacag caagggggag gattgggaag acaatagcag gcatgctggg gagagatcca 3780
cgataacaaa cagctttttt ggggcccaca tgtacactga attccctgca ggttggccac 3840
tccctctctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg ggcgtcgggc 3900
gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag tggccaactc 3960
catcactagg ggttcct 3977
<210> 87
<211> 4013
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 87
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgcccgggc aaagcccggg 60
cgtcgggcga cctttggtcg cccggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 360
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 420
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 480
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 540
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 600
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 660
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 720
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 780
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 840
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 900
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagtc gctgcgcgct 960
gccttcgccc cgtgccccgc tccgccgccg cctcgcgccg cccgccccgg ctctgactga 1020
ccgcgttact cccacaggtg agcgggcggg acggcccttc tcctccgggc tgtaattagc 1080
gcttggttta atgacggctt gtttcttttc tgtggctgcg tgaaagcctt gaggggctcc 1140
gggagctaga gcctctgcta accatgttca tgccttcttc tttttcctac agctcctggg 1200
caacgtgctg gttattgtgc tgtctcatca ttttggcaaa gaattcctcg aagatccgaa 1260
gggaaagtct tccacgactg tgggatccgt tcgaagatat caccggttga gccaccatgg 1320
aattcagcag ccccagcaga gaggaatgcc ccaagcctct gagccgggtg tcaatcatgg 1380
ccggatctct gacaggactg ctgctgcttc aggccgtgtc ttgggcttct ggcgctagac 1440
cttgcatccc caagagcttc ggctacagca gcgtcgtgtg cgtgtgcaat gccacctact 1500
gcgacagctt cgaccctcct acctttcctg ctctgggcac cttcagcaga tacgagagca 1560
ccagatccgg cagacggatg gaactgagca tgggacccat ccaggccaat cacacaggca 1620
ctggcctgct gctgacactg cagcctgagc agaaattcca gaaagtgaaa ggcttcggcg 1680
gagccatgac agatgccgcc gctctgaata tcctggctct gtctccacca gctcagaacc 1740
tgctgctcaa gagctacttc agcgaggaag gcatcggcta caacatcatc agagtgccca 1800
tggccagctg cgacttcagc atcaggacct acacctacgc cgacacaccc gacgatttcc 1860
agctgcacaa cttcagcctg cctgaagagg acaccaagct gaagatccct ctgatccaca 1920
gagccctgca gctggcacaa agacccgtgt cactgctggc ctctccatgg acatctccca 1980
cctggctgaa aacaaatggc gccgtgaatg gcaagggcag cctgaaaggc caacctggcg 2040
acatctacca ccagacctgg gccagatact tcgtgaagtt cctggacgcc tatgccgagc 2100
acaagctgca gttttgggcc gtgacagccg agaacgaacc ttctgctgga ctgctgagcg 2160
gctacccctt tcagtgcctg ggctttacac ccgagcacca gcgggacttt atcgcccgtg 2220
atctgggacc cacactggcc aatagcaccc accataatgt gcggctgctg atgctggacg 2280
accagagact gcttctgccc cactgggcta aagtggtgct gacagatcct gaggccgcca 2340
aatacgtgca cggaatcgcc gtgcactggt atctggactt tctggcccct gccaaggcca 2400
cactgggaga gacacacaga ctgttcccca acaccatgct gttcgccagc gaagcctgtg 2460
tgggcagcaa gttttgggaa cagagcgtgc ggctcggcag ctgggataga ggcatgcagt 2520
acagccacag catcatcacc aacctgctgt accacgtcgt cggctggacc gactggaatc 2580
tggccctgaa tcctgaaggc ggccctaact gggtccgaaa cttcgtggac agccccatca 2640
tcgtggacat caccaaggac accttctaca agcagcccat gttctaccac ctgggacact 2700
tcagcaagtt catccccgag ggctctcagc gcgttggact ggtggcttcc cagaagaacg 2760
atctggacgc cgtggctctg atgcaccctg atggatctgc tgtggtggtg gtcctgaacc 2820
gcagcagcaa agatgtgccc ctgaccatca aggatcccgc cgtgggattc ctggaaacaa 2880
tcagccctgg ctactccatc cacacctacc tgtggcgtag acagtgacaa ttgttaatta 2940
agtttaaacc ctcgaggccg caagcttatc gataatcaac ctctggatta caaaatttgt 3000
gaaagattga ctggtattct taactatgtt gctcctttta cgctatgtgg atacgctgct 3060
ttaatgcctt tgtatcatgc tattgcttcc cgtatggctt tcattttctc ctccttgtat 3120
aaatcctggt tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca acgtggcgtg 3180
gtgtgcactg tgtttgctga cgcaaccccc actggttggg gcattgccac cacctgtcag 3240
ctcctttccg ggactttcgc tttccccctc cctattgcca cggcggaact catcgccgcc 3300
tgccttgccc gctgctggac aggggctcgg ctgttgggca ctgacaattc cgtggtgttg 3360
tcggggaaat catcgtcctt tccttggctg ctcgcctgtg ttgccacctg gattctgcgc 3420
gggacgtcct tctgctacgt cccttcggcc ctcaatccag cggaccttcc ttcccgcggc 3480
ctgctgccgg ctctgcggcc tcttccgcgt cttcgccttc gccctcagac gagtcggatc 3540
tccctttggg ccgcctcccc gcatcgatac cgtcgactag agctcgctga tcagcctcga 3600
ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct tccttgaccc 3660
tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgca tcgcattgtc 3720
tgagtaggtg tcattctatt ctggggggtg gggtggggca ggacagcaag ggggaggatt 3780
gggaagacaa tagcaggcat gctggggaga gatccacgat aacaaacagc ttttttgggg 3840
tgaacatatt gactgaattc cctgcaggtt ggccactccc tctctgcgcg ctcgctcgct 3900
cactgaggcc gcccgggcaa agcccgggcg tcgggcgacc tttggtcgcc cggcctcagt 3960
gagcgagcga gcgcgcagag agggagtggc caactccatc actaggggtt cct 4013
<210> 88
<211> 4625
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 88
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctgcag ccctaggaat gcatctagac aattgtacta accttcttct 600
ctttcctctc ctgacagtcc ggaaagccac catggaattc agcagcccca gcagagagga 660
atgccccaag cctctgagcc gggtgtcaat catggccgga tctctgacag gactgctgct 720
gcttcaggcc gtgtcttggg cttctggcgc tagaccttgc atccccaaga gcttcggcta 780
cagcagcgtc gtgtgcgtgt gcaatgccac ctactgcgac agcttcgacc ctcctacctt 840
tcctgctctg ggcaccttca gcagatacga gagcaccaga tccggcagac ggatggaact 900
gagcatggga cccatccagg ccaatcacac aggcactggc ctgctgctga cactgcagcc 960
tgagcagaaa ttccagaaag tgaaaggctt cggcggagcc atgacagatg ccgccgctct 1020
gaatatcctg gctctgtctc caccagctca gaacctgctg ctcaagagct acttcagcga 1080
ggaaggcatc ggctacaaca tcatcagagt gcccatggcc agctgcgact tcagcatcag 1140
gacctacacc tacgccgaca cacccgacga tttccagctg cacaacttca gcctgcctga 1200
agaggacacc aagctgaaga tccctctgat ccacagagcc ctgcagctgg cacaaagacc 1260
cgtgtcactg ctggcctctc catggacatc tcccacctgg ctgaaaacaa atggcgccgt 1320
gaatggcaag ggcagcctga aaggccaacc tggcgacatc taccaccaga cctgggccag 1380
atacttcgtg aagttcctgg acgcctatgc cgagcacaag ctgcagtttt gggccgtgac 1440
agccgagaac gaaccttctg ctggactgct gagcggctac ccctttcagt gcctgggctt 1500
tacacccgag caccagcggg actttatcgc ccgtgatctg ggacccacac tggccaatag 1560
cacccaccat aatgtgcggc tgctgatgct ggacgaccag agactgcttc tgccccactg 1620
ggctaaagtg gtgctgacag atcctgaggc cgccaaatac gtgcacggaa tcgccgtgca 1680
ctggtatctg gactttctgg cccctgccaa ggccacactg ggagagacac acagactgtt 1740
ccccaacacc atgctgttcg ccagcgaagc ctgtgtgggc agcaagtttt gggaacagag 1800
cgtgcggctc ggcagctggg atagaggcat gcagtacagc cacagcatca tcaccaacct 1860
gctgtaccac gtcgtcggct ggaccgactg gaatctggcc ctgaatcctg aaggcggccc 1920
taactgggtc cgaaacttcg tggacagccc catcatcgtg gacatcacca aggacacctt 1980
ctacaagcag cccatgttct accacctggg acacttcagc aagttcatcc ccgagggctc 2040
tcagcgcgtt ggactggtgg cttcccagaa gaacgatctg gacgccgtgg ctctgatgca 2100
ccctgatgga tctgctgtgg tggtggtcct gaaccgcagc agcaaagatg tgcccctgac 2160
catcaaggat cccgccgtgg gattcctgga aacaatcagc cctggctact ccatccacac 2220
ctacctgtgg cgtagacaga gaagaaagag aggaagtgga gagggcagag gaagtcttct 2280
gacatgcgga gacgtggaag agaatcccgg ccctatggcc gagtggctgc tgagcgccag 2340
ctggcagcgc cgcgccaagg ccatgaccgc cgccgccggc agcgccggcc gcgccgccgt 2400
gcccctgctg ctgtgcgccc tgctggcccc cggcggcgcc tacgtgctgg acgacagcga 2460
cggcctgggc cgcgagttcg acggcatcgg cgccgtgagc ggcggcggcg ccaccagccg 2520
cctgctggtg aactaccccg agccctaccg cagccagatc ctggactacc tgttcaagcc 2580
caacttcggc gccagcctgc acatcctgaa ggtggagatc ggcggcgacg gccagaccac 2640
cgacggcacc gagcccagcc acatgcacta cgccctggac gagaactact tccgcggcta 2700
cgagtggtgg ctgatgaagg aggccaagaa gcgcaacccc aacatcaccc tgatcggcct 2760
gccctggagc ttccccggct ggctgggcaa gggcttcgac tggccctacg tgaacctgca 2820
gctgaccgcc tactacgtgg tgacctggat cgtgggcgcc aagcgctacc acgacctgga 2880
catcgactac atcggcatct ggaacgagcg cagctacaac gccaactaca tcaagatcct 2940
gcgcaagatg ctgaactacc agggcctgca gcgcgtgaag atcatcgcca gcgacaacct 3000
gtgggagagc atcagcgcca gcatgctgct ggacgccgag ctgttcaagg tggtggacgt 3060
gatcggcgcc cactaccccg gcacccacag cgccaaggac gccaagctga ccggcaagaa 3120
gctgtggagc agcgaggact tcagcaccct gaacagcgac atgggcgccg gctgctgggg 3180
ccgcatcctg aaccagaact acatcaacgg ctacatgacc agcaccatcg cctggaacct 3240
ggtggccagc tactacgagc agctgcccta cggccgctgc ggcctgatga ccgcccagga 3300
gccctggagc ggccactacg tggtggagag ccccgtgtgg gtgagcgccc acaccaccca 3360
gttcacccag cccggctggt actacctgaa gaccgtgggc cacctggaga agggcggcag 3420
ctacgtggcc ctgaccgacg gcctgggcaa cctgaccatc atcatcgaga ccatgagcca 3480
caagcacagc aagtgcatcc gccccttcct gccctacttc aacgtgagcc agcagttcgc 3540
caccttcgtg ctgaagggca gcttcagcga gatccccgag ctgcaggtgt ggtacaccaa 3600
gctgggcaag accagcgagc gcttcctgtt caagcagctg gacagcctgt ggctgctgga 3660
cagcgacggc agcttcaccc tgagcctgca cgaggacgag ctgttcaccc tgaccaccct 3720
gaccaccggc cgcaagggca gctaccccct gccccccaag agccagccct tccccagcac 3780
ctacaaggac gacttcaacg tggactaccc cttcttcagc gaggccccca acttcgccga 3840
ccagaccggc gtgttcgagt acttcaccaa catcgaggac cccggcgagc accacttcac 3900
cctgcgccag gtgctgaacc agcgccccat cacctgggcc gccgacgcca gcaacaccat 3960
cagcatcatc ggcgactaca actggaccaa cctgaccatc aagtgcgacg tgtacatcga 4020
gacccccgac accggcggcg tgttcatcgc cggccgcgtg aacaagggcg gcatcctgat 4080
ccgcagcgcc cgcggcatct tcttctggat cttcgccaac ggcagctacc gcgtgaccgg 4140
cgacctggcc ggctggatca tctacgccct gggccgcgtg gaggtgaccg ccaagaagtg 4200
gtacaccctg accctgacca tcaagggcca cttcaccagc ggcatgctga acgacaagag 4260
cctgtggacc gacatccccg tgaacttccc caagaacggc tgggccgcca tcggcaccca 4320
cagcttcgag ttcgcccagt tcgacaactt cctggtggag gccacccgct gacaattgtt 4380
aattaagttt aaaccctcga ggccgcaagc aataaaatat ctttattttc attacatctg 4440
tgtgttggtt ttttgtgttg tacactgaat tccctgcagg ttggccactc cctctctgcg 4500
cgctcgctcg ctcactgagg ccgcccgggc aaagcccggg cgtcgggcga cctttggtcg 4560
cccggcctca gtgagcgagc gagcgcgcag agagggagtg gccaactcca tcactagggg 4620
ttcct 4625
<210> 89
<211> 4606
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 89
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctcctg gtggcgaggg gaggggggtg gtcctcgaac gccttgcaga 600
actggcctgg atacagagtg gaccggctgg ccccatctgg aagacttcga gatacactgt 660
tgtcttactg cgctcaacag tgtatctcga agtcttccaa atggtgccag ccatcgcagc 720
ggggtgcagg aaatgggggc agcccccctt tttggctatc cttccacgtg ttcttttttg 780
tatcttttgt gtttcctaga aaacatctca gtcaccaccg cagccctagg aatgcatcta 840
gacaattgta ctaaccttct tctctttcct ctcctgacag tccggaaagc caccatgtac 900
gccctgttcc tgctggccag cctgctgggc gccgccctgg ccggccccgt gctgggcctg 960
aaggagtgca cccgcggcag cgccgtgtgg tgccagaacg tgaagaccgc cagcgactgc 1020
ggcgccgtga agcactgcct gcagaccgtg tggaacaagc ccaccgtgaa gagcctgccc 1080
tgcgacatct gcaaggacgt ggtgaccgcc gccggcgaca tgctgaagga caacgccacc 1140
gaggaggaga tcctggtgta cctggagaag acctgcgact ggctgcccaa gcccaacatg 1200
agcgccagct gcaaggagat cgtggacagc tacctgcccg tgatcctgga catcatcaag 1260
ggcgagatga gccgccccgg cgaggtgtgc agcgccctga acctgtgcga gagcctgcag 1320
aagcacctgg ccgagctgaa ccaccagaag cagctggaga gcaacaagat ccccgagctg 1380
gacatgaccg aggtggtggc ccccttcatg gccaacatcc ccctgctgct gtacccccag 1440
gacggccccc gcagcaagcc ccagcccaag gacaacggcg acgtgtgcca ggactgcatc 1500
cagatggtga ccgacatcca gaccgccgtg cgcaccaaca gcaccttcgt gcaggccctg 1560
gtggagcacg tgaaggagga gtgcgaccgc ctgggccccg gcatggccga catctgcaag 1620
aactacatca gccagtacag cgagatcgcc atccagatga tgatgcacat gcagcccaag 1680
gagatctgcg ccctggtggg cttctgcgac gaggtgaagg agatgcccat gcagaccctg 1740
gtgcccgcca aggtggccag caagaacgtg atccccgccc tggagctggt ggagcccatc 1800
aagaagcacg aggtgcccgc caagagcgac gtgtactgcg aggtgtgcga gttcctggtg 1860
aaggaggtga ccaagctgat cgacaacaac aagaccgaga aggagatcct ggacgccttc 1920
gacaagatgt gcagcaagct gcccaagagc ctgagcgagg agtgccagga ggtggtggac 1980
acctacggca gcagcatcct gagcatcctg ctggaggagg tgagccccga gctggtgtgc 2040
agcatgctgc acctgtgcag cggcacccgc ctgcccgccc tgaccgtgca cgtgacccag 2100
cccaaggacg gcggcttctg cgaggtgtgc aagaagctgg tgggctacct ggaccgcaac 2160
ctggagaaga acagcaccaa gcaggagatc ctggccgccc tggagaaggg ctgcagcttc 2220
ctgcccgacc cctaccagaa gcagtgcgac cagttcgtgg ccgagtacga gcccgtgctg 2280
atcgagatcc tggtggaggt gatggacccc agcttcgtgt gcctgaagat cggcgcctgc 2340
cccagcgccc acaagcccct gctgggcacc gagaagtgca tctggggccc cagctactgg 2400
tgccagaaca ccgagaccgc cgcccagtgc aacgccgtgg agcactgcaa gcgccacgtg 2460
tggaacagaa gaaagagagg aagtggagag ggcagaggaa gtcttctgac atgcggagac 2520
gtggaagaga atcccggccc tatggaattc agcagcccca gcagagagga atgccccaag 2580
cctctgagcc gggtgtcaat catggccgga tctctgacag gactgctgct gcttcaggcc 2640
gtgtcttggg cttctggcgc tagaccttgc atccccaaga gcttcggcta cagcagcgtc 2700
gtgtgcgtgt gcaatgccac ctactgcgac agcttcgacc ctcctacctt tcctgctctg 2760
ggcaccttca gcagatacga gagcaccaga tccggcagac ggatggaact gagcatggga 2820
cccatccagg ccaatcacac aggcactggc ctgctgctga cactgcagcc tgagcagaaa 2880
ttccagaaag tgaaaggctt cggcggagcc atgacagatg ccgccgctct gaatatcctg 2940
gctctgtctc caccagctca gaacctgctg ctcaagagct acttcagcga ggaaggcatc 3000
ggctacaaca tcatcagagt gcccatggcc agctgcgact tcagcatcag gacctacacc 3060
tacgccgaca cacccgacga tttccagctg cacaacttca gcctgcctga agaggacacc 3120
aagctgaaga tccctctgat ccacagagcc ctgcagctgg cacaaagacc cgtgtcactg 3180
ctggcctctc catggacatc tcccacctgg ctgaaaacaa atggcgccgt gaatggcaag 3240
ggcagcctga aaggccaacc tggcgacatc taccaccaga cctgggccag atacttcgtg 3300
aagttcctgg acgcctatgc cgagcacaag ctgcagtttt gggccgtgac agccgagaac 3360
gaaccttctg ctggactgct gagcggctac ccctttcagt gcctgggctt tacacccgag 3420
caccagcggg actttatcgc ccgtgatctg ggacccacac tggccaatag cacccaccat 3480
aatgtgcggc tgctgatgct ggacgaccag agactgcttc tgccccactg ggctaaagtg 3540
gtgctgacag atcctgaggc cgccaaatac gtgcacggaa tcgccgtgca ctggtatctg 3600
gactttctgg cccctgccaa ggccacactg ggagagacac acagactgtt ccccaacacc 3660
atgctgttcg ccagcgaagc ctgtgtgggc agcaagtttt gggaacagag cgtgcggctc 3720
ggcagctggg atagaggcat gcagtacagc cacagcatca tcaccaacct gctgtaccac 3780
gtcgtcggct ggaccgactg gaatctggcc ctgaatcctg aaggcggccc taactgggtc 3840
cgaaacttcg tggacagccc catcatcgtg gacatcacca aggacacctt ctacaagcag 3900
cccatgttct accacctggg acacttcagc aagttcatcc ccgagggctc tcagcgcgtt 3960
ggactggtgg cttcccagaa gaacgatctg gacgccgtgg ctctgatgca ccctgatgga 4020
tctgctgtgg tggtggtcct gaaccgcagc agcaaagatg tgcccctgac catcaaggat 4080
cccgccgtgg gattcctgga aacaatcagc cctggctact ccatccacac ctacctgtgg 4140
cgtagacagt gacaattgtt aattaagttt aaaccctcga ggccgcaagc cgcatcgata 4200
ccgtcgacta gagctcgctg atcagcctcg actgtgcctt ctagttgcca gccatctgtt 4260
gtttgcccct cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc 4320
taataaaatg aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt 4380
ggggtggggc aggacagcaa gggggaggat tgggaagaca atagcaggca tgctggggat 4440
gtacactgaa ttccctgcag gttggccact ccctctctgc gcgctcgctc gctcactgag 4500
gccgcccggg caaagcccgg gcgtcgggcg acctttggtc gcccggcctc agtgagcgag 4560
cgagcgcgca gagagggagt ggccaactcc atcactaggg gttcct 4606
<210> 90
<211> 3022
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 90
attctggtgt gatccaggaa cagctgtctt ccagctctga aagagtgtgg tgtaaaggaa 60
ttcattagcc atggatgtat tcatgaaagg actttcaaag gccaaggagg gagttgtggc 120
tgctgctgag aaaaccaaac agggtgtggc agaagcagca ggaaagacaa aagagggtgt 180
tctctatgta ggctccaaaa ccaaggaggg agtggtgcat ggtgtggcaa cagtggctga 240
gaagaccaaa gagcaagtga caaatgttgg aggagcagtg gtgacgggtg tgacagcagt 300
agcccagaag acagtggagg gagcagggag cattgcagca gccactggct ttgtcaaaaa 360
ggaccagttg ggcaagaatg aagaaggagc cccacaggaa ggaattctgg aagatatgcc 420
tgtggatcct gacaatgagg cttatgaaat gccttctgag gaagggtatc aagactacga 480
acctgaagcc taagaaatat ctttgctccc agtttcttga gatctgctga cagatgttcc 540
atcctgtaca agtgctcagt tccaatgtgc ccagtcatga catttctcaa agtttttaca 600
gtgtatctcg aagtcttcca tcagcagtga ttgaagtatc tgtacctgcc cccactcagc 660
atttcggtgc ttccctttca ctgaagtgaa tacatggtag cagggtcttt gtgtgctgtg 720
gattttgtgg cttcaatcta cgatgttaaa acaaattaaa aacacctaag tgactaccac 780
ttatttctaa atcctcacta tttttttgtt gctgttgttc agaagttgtt agtgatttgc 840
tatcatatat tataagattt ttaggtgtct tttaatgata ctgtctaaga ataatgacgt 900
attgtgaaat ttgttaatat atataatact taaaaatatg tgagcatgaa actatgcacc 960
tataaatact aaatatgaaa ttttaccatt ttgcgatgtg ttttattcac ttgtgtttgt 1020
atataaatgg tgagaattaa aataaaacgt tatctcattg caaaaatatt ttatttttat 1080
cccatctcac tttaataata aaaatcatgc ttataagcaa catgaattaa gaactgacac 1140
aaaggacaaa aatataaagt tattaatagc catttgaaga aggaggaatt ttagaagagg 1200
tagagaaaat ggaacattaa ccctacactc ggaattccct gaagcaacac tgccagaagt 1260
gtgttttggt atgcactggt tccttaagtg gctgtgatta attattgaaa gtggggtgtt 1320
gaagacccca actactattg tagagtggtc tatttctccc ttcaatcctg tcaatgtttg 1380
ctttacgtat tttggggaac tgttgtttga tgtgtatgtg tttataattg ttatacattt 1440
ttaattgagc cttttattaa catatattgt tatttttgtc tcgaaataat tttttagtta 1500
aaatctattt tgtctgatat tggtgtgaat gctgtacctt tctgacaata aataatattc 1560
gaccatgaat aaaaaaaaaa aaaaagtggg ttcccgggaa ctaagcagtg tagaagatga 1620
ttttgactac accctcctta gagagccata agacacatta gcacatatta gcacattcaa 1680
ggctctgaga gaatgtggtt aactttgttt aactcagcat tcctcacttt ttttttttaa 1740
tcatcagaaa ttctctctct ctctctctct ttttctctcg ctctcttttt tttttttttt 1800
ttacaggaaa tgcctttaaa catcgttgga actaccagag tcaccttaaa ggagatcaat 1860
tctctagact gataaaaatt tcatggcctc ctttaaatgt tgccaaatat atgaattcta 1920
ggatttttcc ttaggaaagg tttttctctt tcagggaaga tctattaact ccccatgggt 1980
gctgaaaata aacttgatgg tgaaaaactc tgtataaatt aatttaaaaa ttatttggtt 2040
tctcttttta attattctgg ggcatagtca tttctaaaag tcactagtag aaagtataat 2100
ttcaagacag aatattctag acatgctagc agtttatatg tattcatgag taatgtgata 2160
tatattgggc gctggtgagg aaggaaggag gaatgagtga ctataaggat ggttaccata 2220
gaaacttcct tttttaccta attgaagaga gactactaca gagtgctaag ctgcatgtgt 2280
catcttacac tagagagaaa tggtaagttt cttgttttat ttaagttatg tttaagcaag 2340
gaaaggattt gttattgaac agtatatttc aggaaggtta gaaagtggcg gttaggatat 2400
attttaaatc tacctaaagc agcatatttt aaaaatttaa aagtattggt attaaattaa 2460
gaaatagagg acagaactag actgatagca gtgacctaga acaatttgag attaggaaag 2520
ttgtgaccat gaatttaagg atttatgtgg atacaaattc tcctttaaag tgtttcttcc 2580
cttaatattt atctgacggt aatttttgag cagtgaatta ctttatatat cttaatagtt 2640
tatttgggac caaacactta aacaaaaagt tctttaagtc atataagcct tttcaggaag 2700
cttgtctcat attcactccc gagacattca cctgccaagt ggcctgagga tcaatccagt 2760
cctaggttta ttttgcagac ttacattctc ccaagttatt cagcctcata tgactccacg 2820
gtcggcttta ccaaaacagt tcagagtgca ctttggcaca caattgggaa cagaacaatc 2880
taatgtgtgg tttggtattc caagtggggt ctttttcaga atctctgcac tagtgtgaga 2940
tgcaaacatg tttcctcatc tttctggctt atccagtatg tagctatttg tgacataata 3000
aatatataca tatatgaaaa ta 3022
<210> 91
<211> 6514
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 91
aggcgcggac gcaggttaca gcagcgcttg gcctctgctg atgccgtcgt tatcctaccc 60
ctcccccgtc ccagctctac ggcggccgcg cgctccaggc cggtcgctcc accccccggc 120
tcccgggact gtggactcca cgaccctgtc ctcggccctg tccgcgccga agcagcccgg 180
gactgcgcag cgccccgcgt gccgatcttt tcctaattca gcagcgattt aaccaagagc 240
ctggaatatt ttaaggagta ataagagaca tttacaaact attctctctg aagcctgcta 300
cctggaggca tcatctagat aatcagaacc ttggcttcca catcctcctc ccttgtctta 360
actacaaaca tttctttctg ctgacttcaa ctcctcagac atgggaaagt ctctttctca 420
tttgcctttg cattcaagca aagaagatgc ttatgatgga gtcacatctg aaaacatgag 480
gaatggactg gttaatagtg aagtccataa tgaagatgga agaaatggag atgtctctca 540
gtttccatat gtggaattta caggaagaga tagtgtcacc tgccctactt gtcagggaac 600
aggaagaatt cctagggggc aagaaaacca actggtggca ttgattccat atagtgatca 660
gagattaagg ccaagaagaa caaagctgta tgtgatggct tctgtgtttg tctgtctact 720
cctttctgga ttggctgtgt ttttcctttt ccctcgctct atcgacgtga aatacattgg 780
tgtaaaatca gcctatgtca gttatgatgt tcagaagcgt acaatttatt taaatatcac 840
aaacacacta aatataacaa acaataacta ttactctgtc gaagttgaaa acatcactgc 900
ccaagttcaa ttttcaaaaa cagttattgg aaaggcacgc ttaaacaaca taaccattat 960
tggtccactt gatatgaaac aaattgatta cacagtacct accgttatag cagaggaaat 1020
gagttatatg tatgatttct gtactctgat atccatcaaa gtgcataaca tagtactcat 1080
gatgcaagtt actgtgacaa caacatactt tggccactct gaacagatat cccaggagag 1140
gtatcagtat gtcgactgtg gaagaaacac aacttatcag ttggggcagt ctgaatattt 1200
aaatgtactt cagccacaac agtaaaaact ggaagagatg gatttaaaga agaaatatct 1260
attgatattt cctatactct caatgaagag gtatttccta ataggagacc ttaaattgaa 1320
caaacctaaa gtttacactt ctaagagtac agttaaaagt atgtggacct gcagttcttg 1380
taactctcca ctctgtgtta atgatatatt tgtactagga tcttttactt gaatctaaat 1440
ttactggttg atttccttct ccagcctatc ccctacaggg aaaagctgat acttccccta 1500
tagtacaata aataattatt taaaagtcat agctccagtc actactgaaa acataatttt 1560
ggtgataaac ataatttgag aaacttaatt tctgaatgtt tttatagaaa attactgaaa 1620
gtctattact catggaagac ttttaaagaa taaccttttt tcctgtttta taaattccca 1680
ttgttatatg gtagtatttc agctacacaa tattttagct tttagctaga catttatagc 1740
ttttcatttg ttgaaatggt aatcatctgc atgtttttgt cacttatttc aggttagtga 1800
ttgcctaaca cttataagcc aaaataatct ttgcaaaatt ccatacctaa aattttgaaa 1860
gcccctaatg ttttcacaca tctttctgta ttagttatag ttttgtgaaa tctttgtgtg 1920
atcttcaaac attatcattt aatgtacaat actgtaaata aactgtgcat ggcttttata 1980
cagctttagt aaatgtcaaa taaagtggta cagactcatt acaacaagtt tctcataaaa 2040
atacaataaa taggaaaatg aaattcagaa acccatagac tgggaatagg ttccagttac 2100
agcttggatc tggcataaaa taaatttgaa ataaaatatt ttgatgctcc atttttttat 2160
gttgcttttc atactaaaga atggtgtaga catgttttgc aactgttagg tacccagtta 2220
tcaattttat caatgtttta gaggaggaaa ttattttttt ggtagaaatt gttcaagaaa 2280
tccttaattg aatgtcatta aatgatggtg gccaaaataa aacctattta gaaatttaat 2340
cactttgcac atcacttgga atatgatgcc tctagtagtt acttttttat agttttctac 2400
ttttggtttt atttaaaatt gttttcaaat atagattatt gacttattca actttgctgt 2460
tttatatttt cagtatcatt tttcattttt tttttttttt gtcttttcac ttaccaagtt 2520
ctagggacat ttaaaatatg tactaagtgt aggagtggtt atgataccaa aaaatgtagc 2580
tgggttgaga ttaatttcgt tctgttttct catgacagaa atcaggtttc cctttcccca 2640
cccctaagtg cctaacttag gtctgaaaca gcctgtttat tagtctgact ctctcaacca 2700
taaaacataa gctttattta attctgcctt taaacacact caggtttccc cttaattttc 2760
atattatttt ctgcaagttt tcttgagtat cttcaattcg ttgaatgtgg tttttggttt 2820
ttttttgttt taacactagt cttcccttaa ttcattgcta actcaagcca tccttactat 2880
taaacccaaa tcagtccttt aagttcatta tggcctttct agtatttaaa aaaaaaaaaa 2940
tcattttcat ttttcttctg ctacgtttcc tgactactac tgcatacttc tctgatacag 3000
gttctgtttg tattttttat atcattctca ttttctcatt tgacatgatc tatgtctata 3060
tatgatatag gtcccctttt gtctcaaaat ttttaattat gtgacttcaa aaatcacctg 3120
tatctgtagt agggcttcca aatctgcttc tccatatgtg accagtcacc tgtctgcttt 3180
cacatttagc tagtgaacta cacatttact aaaatgtgta aattttacac atttagtgac 3240
tgtgtaaaat aaaaaaaaag ttattttatc atatcctttc tattatgttc ccatcctgtc 3300
ctcatgtccc atttacttta ttatcaccat tcatttcttc aaaattatct tttagatacg 3360
ctcatacaaa aatcaatcct tgttttcttg cttgtgtctt ttaaccttgg aaaattacat 3420
cgtgtaaatt aaacagattt ttctgatgat ctgtgcttct tatatactat tagagtgcat 3480
gatagtatct cctgaaaagg atggaaagta gaagcatttg cttttagtca cttaattttg 3540
aatctttttt cttcatcttt tgaattaatt ttttttatta tatctacttt tagtggagtt 3600
tgagtcagaa aaaaacaaga atttgaaaca agtaaaaaga tagaagagaa ataaagatgg 3660
tatgtgacta ctttcagaga gagttaagta actgtcagaa taagcctgga acaaaacagg 3720
ctgtaaatta ataaaactac aaacacacat tcaggtgaag cagaagtata gccataaaac 3780
atctagaaag agtgaatgag gcttttagct tttcttaggt caatgtccag tgtgcttttt 3840
tccatgggaa taggataggt attaatacgc ttttctaaac tgctctcaga ccttatccag 3900
aggacatggt aaagatatgt tacagaaatt tttctgatac ttcctggaat aactttaagt 3960
tacaccctag tagactggtc attctaataa aatccagtac tataacaaac ctctgtatgt 4020
tgatagcaca ttggcccttt ttagagttct ttcctatgtt tttcttacgt gatttcccac 4080
agttccatga gtccaacaaa ggagagtgat aggctcctta tcttttagaa gaggaaggaa 4140
aggcatgaag aagttgaggg actggctgaa gatcacgtac ttactaagta gtacaactgg 4200
agcaagatca agtatctctg tctcccatat ctgtgttcta tcatttaaaa tatatattgg 4260
aaatccctgc tgactcagat tggtatgatt aaaaatgaga ggaaagttca aatagttagt 4320
agtgacaaac taatactgct ggactaagat tttggtagca ttgttttcta aaatatttta 4380
aatggagaat gaacacttat aaaatgcttt ggaacataat ctttagctta attttctgtt 4440
aaaatttagt accccttcat cattccaata aagataagac tgatccattg tctaaggaaa 4500
ttatttataa ataatagaga ttaatttatt tgagatttga aataagaata gtatgaaaat 4560
attagatacc acataaattg tttgaaatta ctgaataacc atcttaagta tggaacattt 4620
aaatggctat attttatttg tgtacagttt ttctgtgcct tgttaggcca gtgaagcaat 4680
tattttctct aagaaaatga caataaaata taacacactt cagattgtct gatttacagt 4740
ttggaaagga caccgcaatg ttcaaatagg taggagacca tcaaaaacac aattaaagta 4800
acatattagg agacttgaaa cttcagccta ataaatcctt catggttctt agccttatta 4860
ttgtgatata attctagata ttttcttgga gggcatgtgc ccaactctcc cgcaccccat 4920
tttgtttgtc ttttaaagtt cttagaataa acagttcttt atataataat tatattttat 4980
ttaagaaaat agtttgttag gtacttttta aaagatgtaa atttttaaat ttacaaatac 5040
atatgggtct ttgataagca ataggaattg aattacaagt tactagggtt ataagcaaaa 5100
ggttgcttac cataatgtca ttaggtcacg atttttagct cacatctgga agcagcaact 5160
acttggctca agtacatata agagtaatta gttttattct ctctttttta taaaatcggg 5220
tttcagatga gatgtttatc ttagactatt ttagggaaaa attttacatg tttgagatgg 5280
tggagtaaaa agactgttaa acatttcttt taaaaaatta tttttacatt acaacaatat 5340
atttatgatg tgttcagatc aaaaatttaa cttctgtgtc ccagatctac tttcaaagtg 5400
agattttcac ttgtcagctt aaatttctga ctagaactaa catttgtgta tttttgtgct 5460
tagtcggaat acaaatttca cagtggattt ttgaagtttg tccttaaatt ggataaaatc 5520
aagtgattaa agttactaaa gagataaaaa tggtaatttc catttttaaa agtaatttgg 5580
ttgtgtttat agttatttgt acaagtattt atcacagact ctaaattgaa aaatgtagta 5640
tgatctatat ttgaccctaa aaatgttgga ttaatttaac aaatatggca gatttttcat 5700
aactaagtct taagtcttct aaaaggaagc tgttaccctt ctgtttttaa ttacattaat 5760
tgaaatgtgt tttaagagat acaatttcag catattttat atattaaaaa acaaaaaagg 5820
attagtattg agccagtggc caaaaggtaa tattactacc atgtagactg ttatagttca 5880
aattgtccca cttcacccag aattttagaa actagaagtc tgggaggtac tatatcagct 5940
gtagttgggt aattccaagt gctgatagta ctattcatct tttttattat tgtgtcagat 6000
gaaacaaatg ccaagttgca aaatatgcag atttttatta tataatggtt ttaggcataa 6060
attattaaca agccatgcct tatgtgtttc atcttatatt tttctttaga actaaactat 6120
aacagatttt ggaaaatgat ttgacgtgct tgctcacttg attgacttgg tcagatattt 6180
gaatgatggt attacctaga ttctaatcct tgattctagt tatataataa ataatataga 6240
atatgaaaat atgtttgggc atttactgtt tatattatgt agtagcctcc atcatgacac 6300
acttactaca tttatgaatt gagcagttct gtaattgtaa ttattattgc tgttcatgta 6360
acaaaacatg cttataatag caaacaaata gaaatgcccc caaaatgcta tttttttaat 6420
tcagttataa ctgttactct tgtagttgtg tatgacgcaa taaaatttgt aaaaaaattt 6480
cagcatgaaa aataaaattt gtatcactta tgta 6514
<210> 92
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 92
gtgtactagg atcttttact tgaa 24
<210> 93
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 93
ttcaagtaaa agatcctagt acac 24
<210> 94
<211> 152
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 94
ctggaggctt gctttgggct gtatgctgtg gaagacttcg agatacactg tttttggcct 60
ctgactgaac agtgttctga agtcttccac aggacacaag gccctttatc agcactcaca 120
tggaacaaat ggccaccgtg ggaggatgac aa 152
<210> 95
<211> 152
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 95
ttgtcatcct cccacggtgg ccatttgttc catgtgagtg ctgataaagg gccttgtgtc 60
ctgtggaaga cttcagaaca ctgttcagtc agaggccaaa aacagtgtat ctcgaagtct 120
tccacagcat acagcccaaa gcaagcctcc ag 152
<210> 96
<211> 152
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 96
ttgtcatcct cccacggtgg ccatttgttc catgtgagtg ctagtaacag gccttgtgtc 60
cttttagaaa taagtggtag tcacatctgt ggcttcactt gactaccact tatttctaaa 120
gacaacagca tacagccttc agcaagcctc ca 152
<210> 97
<211> 152
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 97
tggaggcttg ctgaaggctg tatgctgttg tctttagaaa taagtggtag tcaagtgaag 60
ccacagatgt gactaccact tatttctaaa aggacacaag gcctgttact agcactcaca 120
tggaacaaat ggccaccgtg ggaggatgac aa 152
<210> 98
<211> 179
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 98
tggaggcttg ctgaaggctg tatgctgttg tcctcgagtg agcgtagggt atcaagacta 60
cgaatactgt aaagccacag atgggtgttc gtagtcttga tacccttcgc ctactagagg 120
acacaaggcc tgttactagc actcacatgg aacaaatggc caccgtggga ggatgacaa 179
<210> 99
<211> 179
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 99
ttgtcatcct cccacggtgg ccatttgttc catgtgagtg ctagtaacag gccttgtgtc 60
ctctagtagg cgaagggtat caagactacg aacacccatc tgtggcttta cagtattcgt 120
agtcttgata ccctacgctc actcgaggac aacagcatac agccttcagc aagcctcca 179
<210> 100
<211> 10960
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 100
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
cctagttatt aatagtaatc aattacgggg tcattagttc atagcccata tatggagttc 360
cgcgttacat aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca 420
ttgacgtcaa taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt 480
caatgggtgg actatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg 540
ccaagtacgc cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag 600
tacatgacct tatgggactt tcctacttgg cagtacatct acgtattagt catcgctatt 660
accatggtcg aggtgagccc cacgttctgc ttcactctcc ccatctcccc cccctcccca 720
cccccaattt tgtatttatt tattttttaa ttattttgtg cagcgatggg ggcggggggg 780
gggggggggc gcgcgccagg cggggcgggg cggggcgagg ggcggggcgg ggcgaggcgg 840
agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa agtttccttt tatggcgagg 900
cggcggcggc ggcggcccta taaaaagcga agcgcgcggc gggcgggagt cgctgcgacg 960
ctgccttcgc cccgtgcccc gctccgccgc cgcctcgcgc cgcccgcccc ggctctgact 1020
gaccgcgtta ctcccacagg tgagcgggcg ggacggccct tctcctccgg gctgtaatta 1080
gcgcttggtt taatgacggc ttgtcctggt ggcgagggga ggggggtggt cctcgaacgc 1140
cttgcagaac tggcctggat acagagtgga ccggctggcc ccatctggaa gacttcgaga 1200
tacactgttg tcttactgcg ctcaacagtg tatctcgaag tcttccaaat ggtgccagcc 1260
atcgcagcgg ggtgcaggaa atgggggcag cccccctttt tggctatcct tccacgtgtt 1320
cttttttgta tcttttgtgt ttcctagaaa acatctcagt caccaccttt ctgtggctgc 1380
gtgaaagcct tgaggggctc cgggagctag agcctctgct aaccatgttc atgccttctt 1440
ctttttccta cagctcctgg gcaacgtgct ggttattgtg ctgtctcatc attttggcaa 1500
agaattcctc gaagatccga agggaaagtc ttccacgact gtgggatccg ttcgaagata 1560
tcaccggttg agccaccatg gaattcagca gccccagcag agaggaatgc cccaagcctc 1620
tgagccgggt gtcaatcatg gccggatctc tgacaggact gctgctgctt caggccgtgt 1680
cttgggcttc tggcgctaga ccttgcatcc ccaagagctt cggctacagc agcgtcgtgt 1740
gcgtgtgcaa tgccacctac tgcgacagct tcgaccctcc tacctttcct gctctgggca 1800
ccttcagcag atacgagagc accagatccg gcagacggat ggaactgagc atgggaccca 1860
tccaggccaa tcacacaggc actggcctgc tgctgacact gcagcctgag cagaaattcc 1920
agaaagtgaa aggcttcggc ggagccatga cagatgccgc cgctctgaat atcctggctc 1980
tgtctccacc agctcagaac ctgctgctca agagctactt cagcgaggaa ggcatcggct 2040
acaacatcat cagagtgccc atggccagct gcgacttcag catcaggacc tacacctacg 2100
ccgacacacc cgacgatttc cagctgcaca acttcagcct gcctgaagag gacaccaagc 2160
tgaagatccc tctgatccac agagccctgc agctggcaca aagacccgtg tcactgctgg 2220
cctctccatg gacatctccc acctggctga aaacaaatgg cgccgtgaat ggcaagggca 2280
gcctgaaagg ccaacctggc gacatctacc accagacctg ggccagatac ttcgtgaagt 2340
tcctggacgc ctatgccgag cacaagctgc agttttgggc cgtgacagcc gagaacgaac 2400
cttctgctgg actgctgagc ggctacccct ttcagtgcct gggctttaca cccgagcacc 2460
agcgggactt tatcgcccgt gatctgggac ccacactggc caatagcacc caccataatg 2520
tgcggctgct gatgctggac gaccagagac tgcttctgcc ccactgggct aaagtggtgc 2580
tgacagatcc tgaggccgcc aaatacgtgc acggaatcgc cgtgcactgg tatctggact 2640
ttctggcccc tgccaaggcc acactgggag agacacacag actgttcccc aacaccatgc 2700
tgttcgccag cgaagcctgt gtgggcagca agttttggga acagagcgtg cggctcggca 2760
gctgggatag aggcatgcag tacagccaca gcatcatcac caacctgctg taccacgtcg 2820
tcggctggac cgactggaat ctggccctga atcctgaagg cggccctaac tgggtccgaa 2880
acttcgtgga cagccccatc atcgtggaca tcaccaagga caccttctac aagcagccca 2940
tgttctacca cctgggacac ttcagcaagt tcatccccga gggctctcag cgcgttggac 3000
tggtggcttc ccagaagaac gatctggacg ccgtggctct gatgcaccct gatggatctg 3060
ctgtggtggt ggtcctgaac cgcagcagca aagatgtgcc cctgaccatc aaggatcccg 3120
ccgtgggatt cctggaaaca atcagccctg gctactccat ccacacctac ctgtggcgta 3180
gacagtgaca attgttaatt aagtttaaac cctcgaggcc gcaagcttat cgataatcaa 3240
cctctggatt acaaaatttg tgaaagattg actggtattc ttaactatgt tgctcctttt 3300
acgctatgtg gatacgctgc tttaatgcct ttgtatcatg ctattgcttc ccgtatggct 3360
ttcattttct cctccttgta taaatcctgg ttgctgtctc tttatgagga gttgtggccc 3420
gttgtcaggc aacgtggcgt ggtgtgcact gtgtttgctg acgcaacccc cactggttgg 3480
ggcattgcca ccacctgtca gctcctttcc gggactttcg ctttccccct ccctattgcc 3540
acggcggaac tcatcgccgc ctgccttgcc cgctgctgga caggggctcg gctgttgggc 3600
actgacaatt ccgtggtgtt gtcggggaaa tcatcgtcct ttccttggct gctcgcctgt 3660
gttgccacct ggattctgcg cgggacgtcc ttctgctacg tcccttcggc cctcaatcca 3720
gcggaccttc cttcccgcgg cctgctgccg gctctgcggc ctcttccgcg tcttcgcctt 3780
cgccctcaga cgagtcggat ctccctttgg gccgcctccc cgcatcgata ccgtcgacta 3840
gagctcgctg atcagcctcg actgtgcctt ctagttgcca gccatctgtt gtttgcccct 3900
cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc taataaaatg 3960
aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt ggggtggggc 4020
aggacagcaa gggggaggat tgggaagaca atagcaggca tgctggggag agatccacga 4080
taacaaacag cttttttggg gtgaacatat tgactgaatt ccctgcaggt tggccactcc 4140
ctctctgcgc gctcgctcgc tcactgaggc cgcccgggca aagcccgggc gtcgggcgac 4200
ctttggtcgc ccggcctcag tgagcgagcg agcgcgcaga gagggagtgg ccaactccat 4260
cactaggggt tcctgcggcc gctcgtacgg tctcgaggaa ttcctgcagg ataacttgcc 4320
aacctcattc taaaatgtat atagaagccc aaaagacaat aacaaaaata ttcttgtaga 4380
acaaaatggg aaagaatgtt ccactaaata tcaagattta gagcaaagca tgagatgtgt 4440
ggggatagac agtgaggctg ataaaataga gtagagctca gaaacagacc cattgatata 4500
tgtaagtgac ctatgaaaaa aatatggcat tttacaatgg gaaaatgatg gtctttttct 4560
tttttagaaa aacagggaaa tatatttata tgtaaaaaat aaaagggaac ccatatgtca 4620
taccatacac acaaaaaaat tccagtgaat tataagtcta aatggagaag gcaaaacttt 4680
aaatctttta gaaaataata tagaagcatg cagaccagcc tggccaacat gatgaaaccc 4740
tctctactaa taataaaatc agtagaacta ctcaggacta ctttgagtgg gaagtccttt 4800
tctatgaaga cttctttggc caaaattagg ctctaaatgc aaggagatag tgcatcatgc 4860
ctggctgcac ttactgataa atgatgttat caccatcttt aaccaaatgc acaggaacaa 4920
gttatggtac tgatgtgctg gattgagaag gagctctact tccttgacag gacacatttg 4980
tatcaactta aaaaagcaga tttttgccag cagaactatt cattcagagg taggaaactt 5040
agaatagatg atgtcactga ttagcatggc ttccccatct ccacagctgc ttcccaccca 5100
ggttgcccac agttgagttt gtccagtgct cagggctgcc cactctcagt aagaagcccc 5160
acaccagccc ctctccaaat atgttggctg ttccttccat taaagtgacc ccactttaga 5220
gcagcaagtg gatttctgtt tcttacagtt caggaaggag gagtcagctg tgagaacctg 5280
gagcctgaga tgcttctaag tcccactgct actggggtca gggaagccag actccagcat 5340
cagcagtcag gagcactaag cccttgccaa catcctgttt ctcagagaaa ctgcttccat 5400
tataatggtt gtcctttttt aagctatcaa gccaaacaac cagtgtctac cattattctc 5460
atcacctgaa gccaagggtt ctagcaaaag tcaagctgtc ttgtaatggt tgatgtgcct 5520
ccagcttctg tcttcagtca ctccactctt agcctgctct gaatcaactc tgaccacagt 5580
tccctggagc ccctgccacc tgctgcccct gccaccttct ccatctgcag tgctgtgcag 5640
ccttctgcac tcttgcagag ctaataggtg gagacttgaa ggaagaggag gaaagtttct 5700
cataatagcc ttgctgcaag ctcaaatggg aggtgggcac tgtgcccagg agccttggag 5760
caaaggctgt gcccaacctc tgactgcatc caggtttggt cttgacagag ataagaagcc 5820
ctggcttttg gagccaaaat ctaggtcaga cttaggcagg attctcaaag tttatcagca 5880
gaacatgagg cagaagaccc tttctgctcc agcttcttca ggctcaacct tcatcagaat 5940
agatagaaag agaggctgtg agggttctta aaacagaagc aaatctgact cagagaataa 6000
acaacctcct agtaaactac agcttagaca gagcatctgg tggtgagtgt gctcagtgtc 6060
ctactcaact gtctggtatc agccctcatg aggacttctc ttctttccct catagacctc 6120
catctctgtt ttccttagcc tgcagaaatc tggatggcta ttcacagaat gcctgtgctt 6180
tcagagttgc attttttctc tggtattctg gttcaagcat ttgaaggtag gaaaggttct 6240
ccaagtgcaa gaaagccagc cctgagcctc aactgcctgg ctagtgtggt cagtaggatg 6300
caaaggctgt tgaatgccac aaggccaaac tttaacctgt gtaccacaag cctagcagca 6360
gaggcagctc tgctcactgg aactctctgt cttctttctc ctgagccttt tcttttcctg 6420
agttttctag ctctcctcaa ccttacctct gccctaccca ggacaaaccc aagagccact 6480
gtttctgtga tgtcctctcc agccctaatt aggcatcatg acttcagcct gaccttccat 6540
gctcagaagc agtgctaatc cacttcagat gagctgctct atgcaacaca ggcagagcct 6600
acaaaccttt gcaccagagc cctccacata tcagtgtttg ttcatactca cttcaacagc 6660
aaatgtgact gctgagatta agattttaca caagatggtc tgtaatttca cagttagttt 6720
tatcccatta ggtatgaaag aattagcata attcccctta aacatgaatg aatcttagat 6780
tttttaataa atagttttgg aagtaaagac agagacatca ggagcacaag gaatagcctg 6840
agaggacaaa cagaacaaga aagagtctgg aaatacacag gatgttcttg gcctcctcaa 6900
agcaagtgca agcagatagt accagcagcc ccaggctatc agagcccagt gaagagaagt 6960
accatgaaag ccacagctct aaccaccctg ttccagagtg acagacagtc cccaagacaa 7020
gccagcctga gccagagaga gaactgcaag agaaagtttc taatttaggt tctgttagat 7080
tcagacaagt gcaggtcatc ctctctccac agctactcac ctctccagcc taacaaagcc 7140
tgcagtccac actccaaccc tggtgtctca cctcctagcc tctcccaaca tcctgctctc 7200
tgaccatctt ctgcatctct catctcacca tctcccactg tctacagcct actcttgcaa 7260
ctaccatctc attttctgac atcctgtcta catcttctgc catactctgc catctaccat 7320
accacctctt accatctacc acaccatctt ttatctccat ccctctcaga agcctccaag 7380
ctgaatcctg ctttatgtgt tcatctcagc ccctgcatgg aaagctgacc ccagaggcag 7440
aactattccc agagagcttg gccaagaaaa acaaaactac cagcctggcc aggctcagga 7500
gtagtaagct gcagtgtctg ttgtgttcta gcttcaacag ctgcaggagt tccactctca 7560
aatgctccac atttctcaca tcctcctgat tctggtcact acccatcttc aaagaacaga 7620
atatctcaca tcagcatact gtgaaggact agtcatgggt gcagctgctc agagctgcaa 7680
agtcattctg gatggtggag agcttacaaa catttcatga tgctcccccc gctctgatgg 7740
ctggagccca atccctacac agactcctgc tgtatgtgtt ttcctttcac tctgagccac 7800
agccagaggg caggcattca gtctcctctt caggctgggg ctggggcact gagaactcac 7860
ccaacacctt gctctcactc cttctgcaaa acaagaaaga gctttgtgct gcagtagcca 7920
tgaagaatga aaggaaggct ttaactaaaa aatgtcagag attattttca accccttact 7980
gtggatcacc agcaaggagg aaacacaaca cagagacatt ttttcccctc aaattatcaa 8040
aagaatcact gcatttgtta aagagagcaa ctgaatcagg aagcagagtt ttgaacatat 8100
cagaagttag gaatctgcat cagagacaaa tgcagtcatg gttgtttgct gcataccagc 8160
cctaatcatt agaagcctca tggacttcaa acatcattcc ctctgacaag atgctctagc 8220
ctaactccat gagataaaat aaatctgcct ttcagagcca aagaagagtc caccagcttc 8280
ttctcagtgt gaacaagagc tccagtcagg ttagtcagtc cagtgcagta gaggagacca 8340
gtctgcatcc tctaattttc aaaggcaaga agatttgttt accctggaca ccaggcacaa 8400
gtgaggtcac agagctctta gatatgcagt cctcatgagt gaggagacta aagcgcatgc 8460
catcaagact tcagtgtaga gaaaacctcc aaaaaagcct cctcactact tctggaatag 8520
ctcagaggcc gaggcggcct cggcctctgc ataaataaaa aaaattagtc agccatgggg 8580
cggagaatgg gcggaactgg gcggagttag gggcgggatg ggcggagtta ggggcgggac 8640
tatggttgct gactaattga gatgcatgct ttgcatactt ctgcctgctg gggagcctgg 8700
ggactttcca cacctggttg ctgactaatt gagatgcatg ctttgcatac ttctgcctgc 8760
tggggagcct ggggactttc cacaccctaa ctgacacaca ttccacagct gcattaatga 8820
atcggccaac gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc 8880
actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg 8940
gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc 9000
cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc 9060
ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga 9120
ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc 9180
ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat 9240
agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg 9300
cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc 9360
aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga 9420
gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact 9480
agaagaacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt 9540
ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag 9600
cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg 9660
tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa 9720
aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata 9780
tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg 9840
atctgtctat ttcgttcatc catagttgcc tgactcctgc aaaccacgtt gtgtctcaaa 9900
atctctgatg ttacattgca caagataaaa atatatcatc atgaacaata aaactgtctg 9960
cttacataaa cagtaataca aggggtgtta tgagccatat tcaacgggaa acgtcttgct 10020
cgaggccgcg attaaattcc aacatggatg ctgatttata tgggtataaa tgggctcgcg 10080
ataatgtcgg gcaatcaggt gcgacaatct atcgattgta tgggaagccc gatgcgccag 10140
agttgtttct gaaacatggc aaaggtagcg ttgccaatga tgttacagat gagatggtca 10200
gactaaactg gctgacggaa tttatgcctc ttccgaccat caagcatttt atccgtactc 10260
ctgatgatgc atggttactc accactgcga tccccgggaa aacagcattc caggtattag 10320
aagaatatcc tgattcaggt gaaaatattg ttgatgcgct ggcagtgttc ctgcgccggt 10380
tgcattcgat tcctgtttgt aattgtcctt ttaacagcga tcgcgtattt cgtctcgctc 10440
aggcgcaatc acgaatgaat aacggtttgg ttgatgcgag tgattttgat gacgagcgta 10500
atggctggcc tgttgaacaa gtctggaaag aaatgcataa gcttttgcca ttctcaccgg 10560
attcagtcgt cactcatggt gatttctcac ttgataacct tatttttgac gaggggaaat 10620
taataggttg tattgatgtt ggacgagtcg gaatcgcaga ccgataccag gatcttgcca 10680
tcctatggaa ctgcctcggt gagttttctc cttcattaca gaaacggctt tttcaaaaat 10740
atggtattga taatcctgat atgaataaat tgcagtttca tttgatgctc gatgagtttt 10800
tctaagggcg gcctgccacc atacccacgc cgaaacaagc gctcatgagc ccgaagtggc 10860
gagcccgatc ttccccatcg gtgatgtcgg cgatataggc gccagcaacc gcacctgtgg 10920
cgccggtgat gagggcgcgc caagtcgacg tccggcagtc 10960
<210> 101
<211> 10013
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 101
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
cctagttatt aatagtaatc aattacgggg tcattagttc atagcccata tatggagttc 360
cgcgttacat aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca 420
ttgacgtcaa taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt 480
caatgggtgg actatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg 540
ccaagtacgc cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag 600
tacatgacct tatgggactt tcctacttgg cagtacatct acgtattagt catcgctatt 660
accatggtcg aggtgagccc cacgttctgc ttcactctcc ccatctcccc cccctcccca 720
cccccaattt tgtatttatt tattttttaa ttattttgtg cagcgatggg ggcggggggg 780
gggggggggc gcgcgccagg cggggcgggg cggggcgagg ggcggggcgg ggcgaggcgg 840
agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa agtttccttt tatggcgagg 900
cggcggcggc ggcggcccta taaaaagcga agcgcgcggc gggcgggagt cgctgcgacg 960
ctgccttcgc cccgtgcccc gctccgccgc cgcctcgcgc cgcccgcccc ggctctgact 1020
gaccgcgtta ctcccacagg tgagcgggcg ggacggccct tctcctccgg gctgtaatta 1080
gcgcttggtt taatgacggc ttgttttctg tggctgcgtg aaagccttga ggggctccgg 1140
gagctagagc ctctgctaac catgttcatg ccttcttctt tttcctacag ctcctgggca 1200
acgtgctggt tattgtgctg tctcatcatt ttggcaaaga attcctcgaa gatccgaagg 1260
gaaagtcttc cacgactgtg ggatccgttc gaagatatca ccggttgagc caccatggac 1320
gtgttcatga agggcctgag caaggccaag gagggcgtgg tggccgccgc cgagaagacc 1380
aagcagggcg tggccgaggc cgccggcaag accaaggagg gcgtgctgta cgtgggcagc 1440
aagaccaagg agggcgtggt gcacggcgtg gccaccgtgg ccgagaagac caaggagcag 1500
gtgaccaacg tgggcggcgc cgtggtgacc ggcgtgaccg ccgtggccca gaagaccgtg 1560
gagggcgccg gcagcatcgc cgccgccacc ggcttcgtga agaaggacca gctgggcaag 1620
aacgaggagg gcgcccccca ggagggcatc ctggaggaca tgcccgtgga ccccgacaac 1680
gaggcctacg agatgcccag cgaggagggc taccaggact acgagcccga ggcctaagaa 1740
atatctttgc tcccagtttc ttgagatctg ctgacagatg ttccatcctg tacaagtgct 1800
cagttccaat gtgcccagtc atgacatttc tcaaagtttt tacagtgtat ctcgaagtct 1860
tccatcagca gtgattgaag tatctgtacc tgcccccact cagcatttcg gtgcttccct 1920
ttcactgaag tgaatacatg gtagcagggt ctttgtgtgc tgtggatttt gtggcttcaa 1980
tctacgatgt taaaacaaat taaaaacacc taagtgacta ccacttattt ctaaatcctc 2040
actatttttt tgttgctgtt gttcagaagt tgttagtgat ttgctatcat atattataag 2100
atttttaggt gtcttttaat gatactgtct aagaataatg acgtattgtg aaatttgtta 2160
atatatataa tacttaaaaa tatgtgagca tgaaactatg cacctataaa tactaaatat 2220
gaaattttac cattttgctg acaattgtta attaagttta aaccctcgag gccgcaagct 2280
tatcgataat caacctctgg attacaaaat ttgtgaaaga ttgactggta ttcttaacta 2340
tgttgctcct tttacgctat gtggatacgc tgctttaatg cctttgtatc atgctattgc 2400
ttcccgtatg gctttcattt tctcctcctt gtataaatcc tggttgctgt ctctttatga 2460
ggagttgtgg cccgttgtca ggcaacgtgg cgtggtgtgc actgtgtttg ctgacgcaac 2520
ccccactggt tggggcattg ccaccacctg tcagctcctt tccgggactt tcgctttccc 2580
cctccctatt gccacggcgg aactcatcgc cgcctgcctt gcccgctgct ggacaggggc 2640
tcggctgttg ggcactgaca attccgtggt gttgtcgggg aaatcatcgt cctttccttg 2700
gctgctcgcc tgtgttgcca cctggattct gcgcgggacg tccttctgct acgtcccttc 2760
ggccctcaat ccagcggacc ttccttcccg cggcctgctg ccggctctgc ggcctcttcc 2820
gcgtcttcgc cttcgccctc agacgagtcg gatctccctt tgggccgcct ccccgcatcg 2880
ataccgtcga ctagagctcg ctgatcagcc tcgactgtgc cttctagttg ccagccatct 2940
gttgtttgcc cctcccccgt gccttccttg accctggaag gtgccactcc cactgtcctt 3000
tcctaataaa atgaggaaat tgcatcgcat tgtctgagta ggtgtcattc tattctgggg 3060
ggtggggtgg ggcaggacag caagggggag gattgggaag acaatagcag gcatgctggg 3120
gagagatcca cgataacaaa cagctttttt ggggtgaaca tattgactga attccctgca 3180
ggttggccac tccctctctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 3240
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 3300
tggccaactc catcactagg ggttcctgcg gccgctcgta cggtctcgag gaattcctgc 3360
aggataactt gccaacctca ttctaaaatg tatatagaag cccaaaagac aataacaaaa 3420
atattcttgt agaacaaaat gggaaagaat gttccactaa atatcaagat ttagagcaaa 3480
gcatgagatg tgtggggata gacagtgagg ctgataaaat agagtagagc tcagaaacag 3540
acccattgat atatgtaagt gacctatgaa aaaaatatgg cattttacaa tgggaaaatg 3600
atggtctttt tcttttttag aaaaacaggg aaatatattt atatgtaaaa aataaaaggg 3660
aacccatatg tcataccata cacacaaaaa aattccagtg aattataagt ctaaatggag 3720
aaggcaaaac tttaaatctt ttagaaaata atatagaagc atgcagacca gcctggccaa 3780
catgatgaaa ccctctctac taataataaa atcagtagaa ctactcagga ctactttgag 3840
tgggaagtcc ttttctatga agacttcttt ggccaaaatt aggctctaaa tgcaaggaga 3900
tagtgcatca tgcctggctg cacttactga taaatgatgt tatcaccatc tttaaccaaa 3960
tgcacaggaa caagttatgg tactgatgtg ctggattgag aaggagctct acttccttga 4020
caggacacat ttgtatcaac ttaaaaaagc agatttttgc cagcagaact attcattcag 4080
aggtaggaaa cttagaatag atgatgtcac tgattagcat ggcttcccca tctccacagc 4140
tgcttcccac ccaggttgcc cacagttgag tttgtccagt gctcagggct gcccactctc 4200
agtaagaagc cccacaccag cccctctcca aatatgttgg ctgttccttc cattaaagtg 4260
accccacttt agagcagcaa gtggatttct gtttcttaca gttcaggaag gaggagtcag 4320
ctgtgagaac ctggagcctg agatgcttct aagtcccact gctactgggg tcagggaagc 4380
cagactccag catcagcagt caggagcact aagcccttgc caacatcctg tttctcagag 4440
aaactgcttc cattataatg gttgtccttt tttaagctat caagccaaac aaccagtgtc 4500
taccattatt ctcatcacct gaagccaagg gttctagcaa aagtcaagct gtcttgtaat 4560
ggttgatgtg cctccagctt ctgtcttcag tcactccact cttagcctgc tctgaatcaa 4620
ctctgaccac agttccctgg agcccctgcc acctgctgcc cctgccacct tctccatctg 4680
cagtgctgtg cagccttctg cactcttgca gagctaatag gtggagactt gaaggaagag 4740
gaggaaagtt tctcataata gccttgctgc aagctcaaat gggaggtggg cactgtgccc 4800
aggagccttg gagcaaaggc tgtgcccaac ctctgactgc atccaggttt ggtcttgaca 4860
gagataagaa gccctggctt ttggagccaa aatctaggtc agacttaggc aggattctca 4920
aagtttatca gcagaacatg aggcagaaga ccctttctgc tccagcttct tcaggctcaa 4980
ccttcatcag aatagataga aagagaggct gtgagggttc ttaaaacaga agcaaatctg 5040
actcagagaa taaacaacct cctagtaaac tacagcttag acagagcatc tggtggtgag 5100
tgtgctcagt gtcctactca actgtctggt atcagccctc atgaggactt ctcttctttc 5160
cctcatagac ctccatctct gttttcctta gcctgcagaa atctggatgg ctattcacag 5220
aatgcctgtg ctttcagagt tgcatttttt ctctggtatt ctggttcaag catttgaagg 5280
taggaaaggt tctccaagtg caagaaagcc agccctgagc ctcaactgcc tggctagtgt 5340
ggtcagtagg atgcaaaggc tgttgaatgc cacaaggcca aactttaacc tgtgtaccac 5400
aagcctagca gcagaggcag ctctgctcac tggaactctc tgtcttcttt ctcctgagcc 5460
ttttcttttc ctgagttttc tagctctcct caaccttacc tctgccctac ccaggacaaa 5520
cccaagagcc actgtttctg tgatgtcctc tccagcccta attaggcatc atgacttcag 5580
cctgaccttc catgctcaga agcagtgcta atccacttca gatgagctgc tctatgcaac 5640
acaggcagag cctacaaacc tttgcaccag agccctccac atatcagtgt ttgttcatac 5700
tcacttcaac agcaaatgtg actgctgaga ttaagatttt acacaagatg gtctgtaatt 5760
tcacagttag ttttatccca ttaggtatga aagaattagc ataattcccc ttaaacatga 5820
atgaatctta gattttttaa taaatagttt tggaagtaaa gacagagaca tcaggagcac 5880
aaggaatagc ctgagaggac aaacagaaca agaaagagtc tggaaataca caggatgttc 5940
ttggcctcct caaagcaagt gcaagcagat agtaccagca gccccaggct atcagagccc 6000
agtgaagaga agtaccatga aagccacagc tctaaccacc ctgttccaga gtgacagaca 6060
gtccccaaga caagccagcc tgagccagag agagaactgc aagagaaagt ttctaattta 6120
ggttctgtta gattcagaca agtgcaggtc atcctctctc cacagctact cacctctcca 6180
gcctaacaaa gcctgcagtc cacactccaa ccctggtgtc tcacctccta gcctctccca 6240
acatcctgct ctctgaccat cttctgcatc tctcatctca ccatctccca ctgtctacag 6300
cctactcttg caactaccat ctcattttct gacatcctgt ctacatcttc tgccatactc 6360
tgccatctac cataccacct cttaccatct accacaccat cttttatctc catccctctc 6420
agaagcctcc aagctgaatc ctgctttatg tgttcatctc agcccctgca tggaaagctg 6480
accccagagg cagaactatt cccagagagc ttggccaaga aaaacaaaac taccagcctg 6540
gccaggctca ggagtagtaa gctgcagtgt ctgttgtgtt ctagcttcaa cagctgcagg 6600
agttccactc tcaaatgctc cacatttctc acatcctcct gattctggtc actacccatc 6660
ttcaaagaac agaatatctc acatcagcat actgtgaagg actagtcatg ggtgcagctg 6720
ctcagagctg caaagtcatt ctggatggtg gagagcttac aaacatttca tgatgctccc 6780
cccgctctga tggctggagc ccaatcccta cacagactcc tgctgtatgt gttttccttt 6840
cactctgagc cacagccaga gggcaggcat tcagtctcct cttcaggctg gggctggggc 6900
actgagaact cacccaacac cttgctctca ctccttctgc aaaacaagaa agagctttgt 6960
gctgcagtag ccatgaagaa tgaaaggaag gctttaacta aaaaatgtca gagattattt 7020
tcaacccctt actgtggatc accagcaagg aggaaacaca acacagagac attttttccc 7080
ctcaaattat caaaagaatc actgcatttg ttaaagagag caactgaatc aggaagcaga 7140
gttttgaaca tatcagaagt taggaatctg catcagagac aaatgcagtc atggttgttt 7200
gctgcatacc agccctaatc attagaagcc tcatggactt caaacatcat tccctctgac 7260
aagatgctct agcctaactc catgagataa aataaatctg cctttcagag ccaaagaaga 7320
gtccaccagc ttcttctcag tgtgaacaag agctccagtc aggttagtca gtccagtgca 7380
gtagaggaga ccagtctgca tcctctaatt ttcaaaggca agaagatttg tttaccctgg 7440
acaccaggca caagtgaggt cacagagctc ttagatatgc agtcctcatg agtgaggaga 7500
ctaaagcgca tgccatcaag acttcagtgt agagaaaacc tccaaaaaag cctcctcact 7560
acttctggaa tagctcagag gccgaggcgg cctcggcctc tgcataaata aaaaaaatta 7620
gtcagccatg gggcggagaa tgggcggaac tgggcggagt taggggcggg atgggcggag 7680
ttaggggcgg gactatggtt gctgactaat tgagatgcat gctttgcata cttctgcctg 7740
ctggggagcc tggggacttt ccacacctgg ttgctgacta attgagatgc atgctttgca 7800
tacttctgcc tgctggggag cctggggact ttccacaccc taactgacac acattccaca 7860
gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg ggcgctcttc 7920
cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc 7980
tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat 8040
gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt 8100
ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg 8160
aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc 8220
tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt 8280
ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa 8340
gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta 8400
tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa 8460
caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa 8520
ctacggctac actagaagaa cagtatttgg tatctgcgct ctgctgaagc cagttacctt 8580
cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt 8640
ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat 8700
cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat 8760
gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc 8820
aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc 8880
acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc tgcaaaccac 8940
gttgtgtctc aaaatctctg atgttacatt gcacaagata aaaatatatc atcatgaaca 9000
ataaaactgt ctgcttacat aaacagtaat acaaggggtg ttatgagcca tattcaacgg 9060
gaaacgtctt gctcgaggcc gcgattaaat tccaacatgg atgctgattt atatgggtat 9120
aaatgggctc gcgataatgt cgggcaatca ggtgcgacaa tctatcgatt gtatgggaag 9180
cccgatgcgc cagagttgtt tctgaaacat ggcaaaggta gcgttgccaa tgatgttaca 9240
gatgagatgg tcagactaaa ctggctgacg gaatttatgc ctcttccgac catcaagcat 9300
tttatccgta ctcctgatga tgcatggtta ctcaccactg cgatccccgg gaaaacagca 9360
ttccaggtat tagaagaata tcctgattca ggtgaaaata ttgttgatgc gctggcagtg 9420
ttcctgcgcc ggttgcattc gattcctgtt tgtaattgtc cttttaacag cgatcgcgta 9480
tttcgtctcg ctcaggcgca atcacgaatg aataacggtt tggttgatgc gagtgatttt 9540
gatgacgagc gtaatggctg gcctgttgaa caagtctgga aagaaatgca taagcttttg 9600
ccattctcac cggattcagt cgtcactcat ggtgatttct cacttgataa ccttattttt 9660
gacgagggga aattaatagg ttgtattgat gttggacgag tcggaatcgc agaccgatac 9720
caggatcttg ccatcctatg gaactgcctc ggtgagtttt ctccttcatt acagaaacgg 9780
ctttttcaaa aatatggtat tgataatcct gatatgaata aattgcagtt tcatttgatg 9840
ctcgatgagt ttttctaagg gcggcctgcc accataccca cgccgaaaca agcgctcatg 9900
agcccgaagt ggcgagcccg atcttcccca tcggtgatgt cggcgatata ggcgccagca 9960
accgcacctg tggcgccggt gatgagggcg cgccaagtcg acgtccggca gtc 10013
<210> 102
<211> 10849
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 102
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
cctagttatt aatagtaatc aattacgggg tcattagttc atagcccata tatggagttc 360
cgcgttacat aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca 420
ttgacgtcaa taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt 480
caatgggtgg actatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg 540
ccaagtacgc cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag 600
tacatgacct tatgggactt tcctacttgg cagtacatct acgtattagt catcgctatt 660
accatggtcg aggtgagccc cacgttctgc ttcactctcc ccatctcccc cccctcccca 720
cccccaattt tgtatttatt tattttttaa ttattttgtg cagcgatggg ggcggggggg 780
gggggggggc gcgcgccagg cggggcgggg cggggcgagg ggcggggcgg ggcgaggcgg 840
agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa agtttccttt tatggcgagg 900
cggcggcggc ggcggcccta taaaaagcga agcgcgcggc gggcgggagt cgctgcgacg 960
ctgccttcgc cccgtgcccc gctccgccgc cgcctcgcgc cgcccgcccc ggctctgact 1020
gaccgcgtta ctcccacagg tgagcgggcg ggacggccct tctcctccgg gctgtaatta 1080
gcgcttggtt taatgacggc ttgtctggag gcttgctttg ggctgtatgc tgtggaagac 1140
ttcgagatac actgtttttg gcctctgact gaacagtgtt ctgaagtctt ccacaggaca 1200
caaggccctt tatcagcact cacatggaac aaatggccac cgtgggagga tgacaatttc 1260
tgtggctgcg tgaaagcctt gaggggctcc gggagctaga gcctctgcta accatgttca 1320
tgccttcttc tttttcctac agctcctggg caacgtgctg gttattgtgc tgtctcatca 1380
ttttggcaaa gaattcctcg aagatccgaa gggaaagtct tccacgactg tgggatccgt 1440
tcgaagatat caccggttga gccaccatgg aattcagcag ccccagcaga gaggaatgcc 1500
ccaagcctct gagccgggtg tcaatcatgg ccggatctct gacaggactg ctgctgcttc 1560
aggccgtgtc ttgggcttct ggcgctagac cttgcatccc caagagcttc ggctacagca 1620
gcgtcgtgtg cgtgtgcaat gccacctact gcgacagctt cgaccctcct acctttcctg 1680
ctctgggcac cttcagcaga tacgagagca ccagatccgg cagacggatg gaactgagca 1740
tgggacccat ccaggccaat cacacaggca ctggcctgct gctgacactg cagcctgagc 1800
agaaattcca gaaagtgaaa ggcttcggcg gagccatgac agatgccgcc gctctgaata 1860
tcctggctct gtctccacca gctcagaacc tgctgctcaa gagctacttc agcgaggaag 1920
gcatcggcta caacatcatc agagtgccca tggccagctg cgacttcagc atcaggacct 1980
acacctacgc cgacacaccc gacgatttcc agctgcacaa cttcagcctg cctgaagagg 2040
acaccaagct gaagatccct ctgatccaca gagccctgca gctggcacaa agacccgtgt 2100
cactgctggc ctctccatgg acatctccca cctggctgaa aacaaatggc gccgtgaatg 2160
gcaagggcag cctgaaaggc caacctggcg acatctacca ccagacctgg gccagatact 2220
tcgtgaagtt cctggacgcc tatgccgagc acaagctgca gttttgggcc gtgacagccg 2280
agaacgaacc ttctgctgga ctgctgagcg gctacccctt tcagtgcctg ggctttacac 2340
ccgagcacca gcgggacttt atcgcccgtg atctgggacc cacactggcc aatagcaccc 2400
accataatgt gcggctgctg atgctggacg accagagact gcttctgccc cactgggcta 2460
aagtggtgct gacagatcct gaggccgcca aatacgtgca cggaatcgcc gtgcactggt 2520
atctggactt tctggcccct gccaaggcca cactgggaga gacacacaga ctgttcccca 2580
acaccatgct gttcgccagc gaagcctgtg tgggcagcaa gttttgggaa cagagcgtgc 2640
ggctcggcag ctgggataga ggcatgcagt acagccacag catcatcacc aacctgctgt 2700
accacgtcgt cggctggacc gactggaatc tggccctgaa tcctgaaggc ggccctaact 2760
gggtccgaaa cttcgtggac agccccatca tcgtggacat caccaaggac accttctaca 2820
agcagcccat gttctaccac ctgggacact tcagcaagtt catccccgag ggctctcagc 2880
gcgttggact ggtggcttcc cagaagaacg atctggacgc cgtggctctg atgcaccctg 2940
atggatctgc tgtggtggtg gtcctgaacc gcagcagcaa agatgtgccc ctgaccatca 3000
aggatcccgc cgtgggattc ctggaaacaa tcagccctgg ctactccatc cacacctacc 3060
tgtggcgtag acagtgacaa ttgttaatta agtttaaacc ctcgaggccg caagcttatc 3120
gataatcaac ctctggatta caaaatttgt gaaagattga ctggtattct taactatgtt 3180
gctcctttta cgctatgtgg atacgctgct ttaatgcctt tgtatcatgc tattgcttcc 3240
cgtatggctt tcattttctc ctccttgtat aaatcctggt tgctgtctct ttatgaggag 3300
ttgtggcccg ttgtcaggca acgtggcgtg gtgtgcactg tgtttgctga cgcaaccccc 3360
actggttggg gcattgccac cacctgtcag ctcctttccg ggactttcgc tttccccctc 3420
cctattgcca cggcggaact catcgccgcc tgccttgccc gctgctggac aggggctcgg 3480
ctgttgggca ctgacaattc cgtggtgttg tcggggaaat catcgtcctt tccttggctg 3540
ctcgcctgtg ttgccacctg gattctgcgc gggacgtcct tctgctacgt cccttcggcc 3600
ctcaatccag cggaccttcc ttcccgcggc ctgctgccgg ctctgcggcc tcttccgcgt 3660
cttcgccttc gccctcagac gagtcggatc tccctttggg ccgcctcccc gcatcgatac 3720
cgtcgactag agctcgctga tcagcctcga ctgtgccttc tagttgccag ccatctgttg 3780
tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc cactcccact gtcctttcct 3840
aataaaatga ggaaattgca tcgcattgtc tgagtaggtg tcattctatt ctggggggtg 3900
gggtggggca ggacagcaag ggggaggatt gggaagacaa tagcaggcat gctggggaga 3960
gatccacgat aacaaacagc ttttttgggg tgaacatatt gactgaattc cctgcaggtt 4020
ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gcccgggcaa agcccgggcg 4080
tcgggcgacc tttggtcgcc cggcctcagt gagcgagcga gcgcgcagag agggagtggc 4140
caactccatc actaggggtt cctgcggccg ctcgtacggt ctcgaggaat tcctgcagga 4200
taacttgcca acctcattct aaaatgtata tagaagccca aaagacaata acaaaaatat 4260
tcttgtagaa caaaatggga aagaatgttc cactaaatat caagatttag agcaaagcat 4320
gagatgtgtg gggatagaca gtgaggctga taaaatagag tagagctcag aaacagaccc 4380
attgatatat gtaagtgacc tatgaaaaaa atatggcatt ttacaatggg aaaatgatgg 4440
tctttttctt ttttagaaaa acagggaaat atatttatat gtaaaaaata aaagggaacc 4500
catatgtcat accatacaca caaaaaaatt ccagtgaatt ataagtctaa atggagaagg 4560
caaaacttta aatcttttag aaaataatat agaagcatgc agaccagcct ggccaacatg 4620
atgaaaccct ctctactaat aataaaatca gtagaactac tcaggactac tttgagtggg 4680
aagtcctttt ctatgaagac ttctttggcc aaaattaggc tctaaatgca aggagatagt 4740
gcatcatgcc tggctgcact tactgataaa tgatgttatc accatcttta accaaatgca 4800
caggaacaag ttatggtact gatgtgctgg attgagaagg agctctactt ccttgacagg 4860
acacatttgt atcaacttaa aaaagcagat ttttgccagc agaactattc attcagaggt 4920
aggaaactta gaatagatga tgtcactgat tagcatggct tccccatctc cacagctgct 4980
tcccacccag gttgcccaca gttgagtttg tccagtgctc agggctgccc actctcagta 5040
agaagcccca caccagcccc tctccaaata tgttggctgt tccttccatt aaagtgaccc 5100
cactttagag cagcaagtgg atttctgttt cttacagttc aggaaggagg agtcagctgt 5160
gagaacctgg agcctgagat gcttctaagt cccactgcta ctggggtcag ggaagccaga 5220
ctccagcatc agcagtcagg agcactaagc ccttgccaac atcctgtttc tcagagaaac 5280
tgcttccatt ataatggttg tcctttttta agctatcaag ccaaacaacc agtgtctacc 5340
attattctca tcacctgaag ccaagggttc tagcaaaagt caagctgtct tgtaatggtt 5400
gatgtgcctc cagcttctgt cttcagtcac tccactctta gcctgctctg aatcaactct 5460
gaccacagtt ccctggagcc cctgccacct gctgcccctg ccaccttctc catctgcagt 5520
gctgtgcagc cttctgcact cttgcagagc taataggtgg agacttgaag gaagaggagg 5580
aaagtttctc ataatagcct tgctgcaagc tcaaatggga ggtgggcact gtgcccagga 5640
gccttggagc aaaggctgtg cccaacctct gactgcatcc aggtttggtc ttgacagaga 5700
taagaagccc tggcttttgg agccaaaatc taggtcagac ttaggcagga ttctcaaagt 5760
ttatcagcag aacatgaggc agaagaccct ttctgctcca gcttcttcag gctcaacctt 5820
catcagaata gatagaaaga gaggctgtga gggttcttaa aacagaagca aatctgactc 5880
agagaataaa caacctccta gtaaactaca gcttagacag agcatctggt ggtgagtgtg 5940
ctcagtgtcc tactcaactg tctggtatca gccctcatga ggacttctct tctttccctc 6000
atagacctcc atctctgttt tccttagcct gcagaaatct ggatggctat tcacagaatg 6060
cctgtgcttt cagagttgca ttttttctct ggtattctgg ttcaagcatt tgaaggtagg 6120
aaaggttctc caagtgcaag aaagccagcc ctgagcctca actgcctggc tagtgtggtc 6180
agtaggatgc aaaggctgtt gaatgccaca aggccaaact ttaacctgtg taccacaagc 6240
ctagcagcag aggcagctct gctcactgga actctctgtc ttctttctcc tgagcctttt 6300
cttttcctga gttttctagc tctcctcaac cttacctctg ccctacccag gacaaaccca 6360
agagccactg tttctgtgat gtcctctcca gccctaatta ggcatcatga cttcagcctg 6420
accttccatg ctcagaagca gtgctaatcc acttcagatg agctgctcta tgcaacacag 6480
gcagagccta caaacctttg caccagagcc ctccacatat cagtgtttgt tcatactcac 6540
ttcaacagca aatgtgactg ctgagattaa gattttacac aagatggtct gtaatttcac 6600
agttagtttt atcccattag gtatgaaaga attagcataa ttccccttaa acatgaatga 6660
atcttagatt ttttaataaa tagttttgga agtaaagaca gagacatcag gagcacaagg 6720
aatagcctga gaggacaaac agaacaagaa agagtctgga aatacacagg atgttcttgg 6780
cctcctcaaa gcaagtgcaa gcagatagta ccagcagccc caggctatca gagcccagtg 6840
aagagaagta ccatgaaagc cacagctcta accaccctgt tccagagtga cagacagtcc 6900
ccaagacaag ccagcctgag ccagagagag aactgcaaga gaaagtttct aatttaggtt 6960
ctgttagatt cagacaagtg caggtcatcc tctctccaca gctactcacc tctccagcct 7020
aacaaagcct gcagtccaca ctccaaccct ggtgtctcac ctcctagcct ctcccaacat 7080
cctgctctct gaccatcttc tgcatctctc atctcaccat ctcccactgt ctacagccta 7140
ctcttgcaac taccatctca ttttctgaca tcctgtctac atcttctgcc atactctgcc 7200
atctaccata ccacctctta ccatctacca caccatcttt tatctccatc cctctcagaa 7260
gcctccaagc tgaatcctgc tttatgtgtt catctcagcc cctgcatgga aagctgaccc 7320
cagaggcaga actattccca gagagcttgg ccaagaaaaa caaaactacc agcctggcca 7380
ggctcaggag tagtaagctg cagtgtctgt tgtgttctag cttcaacagc tgcaggagtt 7440
ccactctcaa atgctccaca tttctcacat cctcctgatt ctggtcacta cccatcttca 7500
aagaacagaa tatctcacat cagcatactg tgaaggacta gtcatgggtg cagctgctca 7560
gagctgcaaa gtcattctgg atggtggaga gcttacaaac atttcatgat gctccccccg 7620
ctctgatggc tggagcccaa tccctacaca gactcctgct gtatgtgttt tcctttcact 7680
ctgagccaca gccagagggc aggcattcag tctcctcttc aggctggggc tggggcactg 7740
agaactcacc caacaccttg ctctcactcc ttctgcaaaa caagaaagag ctttgtgctg 7800
cagtagccat gaagaatgaa aggaaggctt taactaaaaa atgtcagaga ttattttcaa 7860
ccccttactg tggatcacca gcaaggagga aacacaacac agagacattt tttcccctca 7920
aattatcaaa agaatcactg catttgttaa agagagcaac tgaatcagga agcagagttt 7980
tgaacatatc agaagttagg aatctgcatc agagacaaat gcagtcatgg ttgtttgctg 8040
cataccagcc ctaatcatta gaagcctcat ggacttcaaa catcattccc tctgacaaga 8100
tgctctagcc taactccatg agataaaata aatctgcctt tcagagccaa agaagagtcc 8160
accagcttct tctcagtgtg aacaagagct ccagtcaggt tagtcagtcc agtgcagtag 8220
aggagaccag tctgcatcct ctaattttca aaggcaagaa gatttgttta ccctggacac 8280
caggcacaag tgaggtcaca gagctcttag atatgcagtc ctcatgagtg aggagactaa 8340
agcgcatgcc atcaagactt cagtgtagag aaaacctcca aaaaagcctc ctcactactt 8400
ctggaatagc tcagaggccg aggcggcctc ggcctctgca taaataaaaa aaattagtca 8460
gccatggggc ggagaatggg cggaactggg cggagttagg ggcgggatgg gcggagttag 8520
gggcgggact atggttgctg actaattgag atgcatgctt tgcatacttc tgcctgctgg 8580
ggagcctggg gactttccac acctggttgc tgactaattg agatgcatgc tttgcatact 8640
tctgcctgct ggggagcctg gggactttcc acaccctaac tgacacacat tccacagctg 8700
cattaatgaa tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ctcttccgct 8760
tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac 8820
tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga 8880
gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat 8940
aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac 9000
ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct 9060
gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg 9120
ctttctcata gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg 9180
ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt 9240
cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg 9300
attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac 9360
ggctacacta gaagaacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga 9420
aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt 9480
gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt 9540
tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga 9600
ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc 9660
taaagtatat atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct 9720
atctcagcga tctgtctatt tcgttcatcc atagttgcct gactcctgca aaccacgttg 9780
tgtctcaaaa tctctgatgt tacattgcac aagataaaaa tatatcatca tgaacaataa 9840
aactgtctgc ttacataaac agtaatacaa ggggtgttat gagccatatt caacgggaaa 9900
cgtcttgctc gaggccgcga ttaaattcca acatggatgc tgatttatat gggtataaat 9960
gggctcgcga taatgtcggg caatcaggtg cgacaatcta tcgattgtat gggaagcccg 10020
atgcgccaga gttgtttctg aaacatggca aaggtagcgt tgccaatgat gttacagatg 10080
agatggtcag actaaactgg ctgacggaat ttatgcctct tccgaccatc aagcatttta 10140
tccgtactcc tgatgatgca tggttactca ccactgcgat ccccgggaaa acagcattcc 10200
aggtattaga agaatatcct gattcaggtg aaaatattgt tgatgcgctg gcagtgttcc 10260
tgcgccggtt gcattcgatt cctgtttgta attgtccttt taacagcgat cgcgtatttc 10320
gtctcgctca ggcgcaatca cgaatgaata acggtttggt tgatgcgagt gattttgatg 10380
acgagcgtaa tggctggcct gttgaacaag tctggaaaga aatgcataag cttttgccat 10440
tctcaccgga ttcagtcgtc actcatggtg atttctcact tgataacctt atttttgacg 10500
aggggaaatt aataggttgt attgatgttg gacgagtcgg aatcgcagac cgataccagg 10560
atcttgccat cctatggaac tgcctcggtg agttttctcc ttcattacag aaacggcttt 10620
ttcaaaaata tggtattgat aatcctgata tgaataaatt gcagtttcat ttgatgctcg 10680
atgagttttt ctaagggcgg cctgccacca tacccacgcc gaaacaagcg ctcatgagcc 10740
cgaagtggcg agcccgatct tccccatcgg tgatgtcggc gatataggcg ccagcaaccg 10800
cacctgtggc gccggtgatg agggcgcgcc aagtcgacgt ccggcagtc 10849
<210> 103
<211> 11231
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 103
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
ttgtcatcct cccacggtgg ccatttgttc catgtgagtg ctagtaacag gccttgtgtc 300
cttttagaaa taagtggtag tcacatctgt ggcttcactt gactaccact tatttctaaa 360
gacaacagca tacagccttc agcaagcctc cagtggtctc atacagaact tataagattc 420
ccaaatccaa agacatttca cgtttatggt gatttcccag aacacatagc gacatgcaaa 480
tattgcaggg cgccactccc ctgtccctca cagccatctt cctgccaggg cgcacgcgcg 540
ctgggtgttc ccgcctagtg acactgggcc cgcgattcct tggagcgggt tgatgacgtc 600
agcgtttccc atggtgaagc ttggatctga tccctaggtt ctagaaccgg tgacattcgg 660
taccctagtt attaatagta atcaattacg gggtcattag ttcatagccc atatatggag 720
ttccgcgtta cataacttac ggtaaatggc ccgcctggct gaccgcccaa cgacccccgc 780
ccattgacgt caataatgac gtatgttccc atagtaacgc caatagggac tttccattga 840
cgtcaatggg tggactattt acggtaaact gcccacttgg cagtacatca agtgtatcat 900
atgccaagta cgccccctat tgacgtcaat gacggtaaat ggcccgcctg gcattatgcc 960
cagtacatga ccttatggga ctttcctact tggcagtaca tctacgtatt agtcatcgct 1020
attaccatgg tcgaggtgag ccccacgttc tgcttcactc tccccatctc ccccccctcc 1080
ccacccccaa ttttgtattt atttattttt taattatttt gtgcagcgat gggggcgggg 1140
gggggggggg ggcgcgcgcc aggcggggcg gggcggggcg aggggcgggg cggggcgagg 1200
cggagaggtg cggcggcagc caatcagagc ggcgcgctcc gaaagtttcc ttttatggcg 1260
aggcggcggc ggcggcggcc ctataaaaag cgaagcgcgc ggcgggcggg agtcgctgcg 1320
acgctgcctt cgccccgtgc cccgctccgc cgccgcctcg cgccgcccgc cccggctctg 1380
actgaccgcg ttactcccac aggtgagcgg gcgggacggc ccttctcctc cgggctgtaa 1440
ttagcgcttg gtttaatgac ggcttgtttt ctgtggctgc gtgaaagcct tgaggggctc 1500
cgggagctag agcctctgct aaccatgttc atgccttctt ctttttccta cagctcctgg 1560
gcaacgtgct ggttattgtg ctgtctcatc attttggcaa agaattcctc gaagatccga 1620
agggaaagtc ttccacgact gtgggatccg ttcgaagata tcaccggttg agccaccatg 1680
tggaccctgg tgagctgggt ggccctgacc gccggcctgg tggccggcac ccgctgcccc 1740
gacggccagt tctgccccgt ggcctgctgc ctggaccccg gcggcgccag ctacagctgc 1800
tgccgccccc tgctggacaa gtggcccacc accctgagcc gccacctggg cggcccctgc 1860
caggtggacg cccactgcag cgccggccac agctgcatct tcaccgtgag cggcaccagc 1920
agctgctgcc ccttccccga ggccgtggcc tgcggcgacg gccaccactg ctgcccccgc 1980
ggcttccact gcagcgccga cggccgcagc tgcttccagc gcagcggcaa caacagcgtg 2040
ggcgccatcc agtgccccga cagccagttc gagtgccccg acttcagcac ctgctgcgtg 2100
atggtggacg gcagctgggg ctgctgcccc atgccccagg ccagctgctg cgaggaccgc 2160
gtgcactgct gcccccacgg cgccttctgc gacctggtgc acacccgctg catcaccccc 2220
accggcaccc accccctggc caagaagctg cccgcccagc gcaccaaccg cgccgtggcc 2280
ctgagcagca gcgtgatgtg ccccgacgcc cgcagccgct gccccgacgg cagcacctgc 2340
tgcgagctgc ccagcggcaa gtacggctgc tgccccatgc ccaacgccac ctgctgcagc 2400
gaccacctgc actgctgccc ccaggacacc gtgtgcgacc tgatccagag caagtgcctg 2460
agcaaggaga acgccaccac cgacctgctg accaagctgc ccgcccacac cgtgggcgac 2520
gtgaagtgcg acatggaggt gagctgcccc gacggctaca cctgctgccg cctgcagagc 2580
ggcgcctggg gctgctgccc cttcacccag gccgtgtgct gcgaggacca catccactgc 2640
tgccccgccg gcttcacctg cgacacccag aagggcacct gcgagcaggg cccccaccag 2700
gtgccctgga tggagaaggc ccccgcccac ctgagcctgc ccgaccccca ggccctgaag 2760
cgcgacgtgc cctgcgacaa cgtgagcagc tgccccagca gcgacacctg ctgccagctg 2820
accagcggcg agtggggctg ctgccccatc cccgaggccg tgtgctgcag cgaccaccag 2880
cactgctgcc cccagggcta cacctgcgtg gccgagggcc agtgccagcg cggcagcgag 2940
atcgtggccg gcctggagaa gatgcccgcc cgccgcgcca gcctgagcca cccccgcgac 3000
atcggctgcg accagcacac cagctgcccc gtgggccaga cctgctgccc cagcctgggc 3060
ggcagctggg cctgctgcca gctgccccac gccgtgtgct gcgaggaccg ccagcactgc 3120
tgccccgccg gctacacctg caacgtgaag gcccgcagct gcgagaagga ggtggtgagc 3180
gcccagcccg ccaccttcct ggcccgcagc ccccacgtgg gcgtgaagga cgtggagtgc 3240
ggcgagggcc acttctgcca cgacaaccag acctgctgcc gcgacaaccg ccagggctgg 3300
gcctgctgcc cctaccgcca gggcgtgtgc tgcgccgacc gccgccactg ctgccccgcc 3360
ggcttccgct gcgccgcccg cggcaccaag tgcctgcgcc gcgaggcccc ccgctgggac 3420
gcccccctgc gcgaccccgc cctgcgccag ctgctgtgac aattgttaat taagtttaaa 3480
ccctcgaggc cgcaagctta tcgataatca acctctggat tacaaaattt gtgaaagatt 3540
gactggtatt cttaactatg ttgctccttt tacgctatgt ggatacgctg ctttaatgcc 3600
tttgtatcat gctattgctt cccgtatggc tttcattttc tcctccttgt ataaatcctg 3660
gttgctgtct ctttatgagg agttgtggcc cgttgtcagg caacgtggcg tggtgtgcac 3720
tgtgtttgct gacgcaaccc ccactggttg gggcattgcc accacctgtc agctcctttc 3780
cgggactttc gctttccccc tccctattgc cacggcggaa ctcatcgccg cctgccttgc 3840
ccgctgctgg acaggggctc ggctgttggg cactgacaat tccgtggtgt tgtcggggaa 3900
atcatcgtcc tttccttggc tgctcgcctg tgttgccacc tggattctgc gcgggacgtc 3960
cttctgctac gtcccttcgg ccctcaatcc agcggacctt ccttcccgcg gcctgctgcc 4020
ggctctgcgg cctcttccgc gtcttcgcct tcgccctcag acgagtcgga tctccctttg 4080
ggccgcctcc ccgcatcgat accgtcgact agagctcgct gatcagcctc gactgtgcct 4140
tctagttgcc agccatctgt tgtttgcccc tcccccgtgc cttccttgac cctggaaggt 4200
gccactccca ctgtcctttc ctaataaaat gaggaaattg catcgcattg tctgagtagg 4260
tgtcattcta ttctgggggg tggggtgggg caggacagca agggggagga ttgggaagac 4320
aatagcaggc atgctgggga gagatccacg ataacaaaca gcttttttgg ggtgaacata 4380
ttgactgaat tccctgcagg ttggccactc cctctctgcg cgctcgctcg ctcactgagg 4440
ccgcccgggc aaagcccggg cgtcgggcga cctttggtcg cccggcctca gtgagcgagc 4500
gagcgcgcag agagggagtg gccaactcca tcactagggg ttcctgcggc cgctcgtacg 4560
gtctcgagga attcctgcag gataacttgc caacctcatt ctaaaatgta tatagaagcc 4620
caaaagacaa taacaaaaat attcttgtag aacaaaatgg gaaagaatgt tccactaaat 4680
atcaagattt agagcaaagc atgagatgtg tggggataga cagtgaggct gataaaatag 4740
agtagagctc agaaacagac ccattgatat atgtaagtga cctatgaaaa aaatatggca 4800
ttttacaatg ggaaaatgat ggtctttttc ttttttagaa aaacagggaa atatatttat 4860
atgtaaaaaa taaaagggaa cccatatgtc ataccataca cacaaaaaaa ttccagtgaa 4920
ttataagtct aaatggagaa ggcaaaactt taaatctttt agaaaataat atagaagcat 4980
gcagaccagc ctggccaaca tgatgaaacc ctctctacta ataataaaat cagtagaact 5040
actcaggact actttgagtg ggaagtcctt ttctatgaag acttctttgg ccaaaattag 5100
gctctaaatg caaggagata gtgcatcatg cctggctgca cttactgata aatgatgtta 5160
tcaccatctt taaccaaatg cacaggaaca agttatggta ctgatgtgct ggattgagaa 5220
ggagctctac ttccttgaca ggacacattt gtatcaactt aaaaaagcag atttttgcca 5280
gcagaactat tcattcagag gtaggaaact tagaatagat gatgtcactg attagcatgg 5340
cttccccatc tccacagctg cttcccaccc aggttgccca cagttgagtt tgtccagtgc 5400
tcagggctgc ccactctcag taagaagccc cacaccagcc cctctccaaa tatgttggct 5460
gttccttcca ttaaagtgac cccactttag agcagcaagt ggatttctgt ttcttacagt 5520
tcaggaagga ggagtcagct gtgagaacct ggagcctgag atgcttctaa gtcccactgc 5580
tactggggtc agggaagcca gactccagca tcagcagtca ggagcactaa gcccttgcca 5640
acatcctgtt tctcagagaa actgcttcca ttataatggt tgtccttttt taagctatca 5700
agccaaacaa ccagtgtcta ccattattct catcacctga agccaagggt tctagcaaaa 5760
gtcaagctgt cttgtaatgg ttgatgtgcc tccagcttct gtcttcagtc actccactct 5820
tagcctgctc tgaatcaact ctgaccacag ttccctggag cccctgccac ctgctgcccc 5880
tgccaccttc tccatctgca gtgctgtgca gccttctgca ctcttgcaga gctaataggt 5940
ggagacttga aggaagagga ggaaagtttc tcataatagc cttgctgcaa gctcaaatgg 6000
gaggtgggca ctgtgcccag gagccttgga gcaaaggctg tgcccaacct ctgactgcat 6060
ccaggtttgg tcttgacaga gataagaagc cctggctttt ggagccaaaa tctaggtcag 6120
acttaggcag gattctcaaa gtttatcagc agaacatgag gcagaagacc ctttctgctc 6180
cagcttcttc aggctcaacc ttcatcagaa tagatagaaa gagaggctgt gagggttctt 6240
aaaacagaag caaatctgac tcagagaata aacaacctcc tagtaaacta cagcttagac 6300
agagcatctg gtggtgagtg tgctcagtgt cctactcaac tgtctggtat cagccctcat 6360
gaggacttct cttctttccc tcatagacct ccatctctgt tttccttagc ctgcagaaat 6420
ctggatggct attcacagaa tgcctgtgct ttcagagttg cattttttct ctggtattct 6480
ggttcaagca tttgaaggta ggaaaggttc tccaagtgca agaaagccag ccctgagcct 6540
caactgcctg gctagtgtgg tcagtaggat gcaaaggctg ttgaatgcca caaggccaaa 6600
ctttaacctg tgtaccacaa gcctagcagc agaggcagct ctgctcactg gaactctctg 6660
tcttctttct cctgagcctt ttcttttcct gagttttcta gctctcctca accttacctc 6720
tgccctaccc aggacaaacc caagagccac tgtttctgtg atgtcctctc cagccctaat 6780
taggcatcat gacttcagcc tgaccttcca tgctcagaag cagtgctaat ccacttcaga 6840
tgagctgctc tatgcaacac aggcagagcc tacaaacctt tgcaccagag ccctccacat 6900
atcagtgttt gttcatactc acttcaacag caaatgtgac tgctgagatt aagattttac 6960
acaagatggt ctgtaatttc acagttagtt ttatcccatt aggtatgaaa gaattagcat 7020
aattcccctt aaacatgaat gaatcttaga ttttttaata aatagttttg gaagtaaaga 7080
cagagacatc aggagcacaa ggaatagcct gagaggacaa acagaacaag aaagagtctg 7140
gaaatacaca ggatgttctt ggcctcctca aagcaagtgc aagcagatag taccagcagc 7200
cccaggctat cagagcccag tgaagagaag taccatgaaa gccacagctc taaccaccct 7260
gttccagagt gacagacagt ccccaagaca agccagcctg agccagagag agaactgcaa 7320
gagaaagttt ctaatttagg ttctgttaga ttcagacaag tgcaggtcat cctctctcca 7380
cagctactca cctctccagc ctaacaaagc ctgcagtcca cactccaacc ctggtgtctc 7440
acctcctagc ctctcccaac atcctgctct ctgaccatct tctgcatctc tcatctcacc 7500
atctcccact gtctacagcc tactcttgca actaccatct cattttctga catcctgtct 7560
acatcttctg ccatactctg ccatctacca taccacctct taccatctac cacaccatct 7620
tttatctcca tccctctcag aagcctccaa gctgaatcct gctttatgtg ttcatctcag 7680
cccctgcatg gaaagctgac cccagaggca gaactattcc cagagagctt ggccaagaaa 7740
aacaaaacta ccagcctggc caggctcagg agtagtaagc tgcagtgtct gttgtgttct 7800
agcttcaaca gctgcaggag ttccactctc aaatgctcca catttctcac atcctcctga 7860
ttctggtcac tacccatctt caaagaacag aatatctcac atcagcatac tgtgaaggac 7920
tagtcatggg tgcagctgct cagagctgca aagtcattct ggatggtgga gagcttacaa 7980
acatttcatg atgctccccc cgctctgatg gctggagccc aatccctaca cagactcctg 8040
ctgtatgtgt tttcctttca ctctgagcca cagccagagg gcaggcattc agtctcctct 8100
tcaggctggg gctggggcac tgagaactca cccaacacct tgctctcact ccttctgcaa 8160
aacaagaaag agctttgtgc tgcagtagcc atgaagaatg aaaggaaggc tttaactaaa 8220
aaatgtcaga gattattttc aaccccttac tgtggatcac cagcaaggag gaaacacaac 8280
acagagacat tttttcccct caaattatca aaagaatcac tgcatttgtt aaagagagca 8340
actgaatcag gaagcagagt tttgaacata tcagaagtta ggaatctgca tcagagacaa 8400
atgcagtcat ggttgtttgc tgcataccag ccctaatcat tagaagcctc atggacttca 8460
aacatcattc cctctgacaa gatgctctag cctaactcca tgagataaaa taaatctgcc 8520
tttcagagcc aaagaagagt ccaccagctt cttctcagtg tgaacaagag ctccagtcag 8580
gttagtcagt ccagtgcagt agaggagacc agtctgcatc ctctaatttt caaaggcaag 8640
aagatttgtt taccctggac accaggcaca agtgaggtca cagagctctt agatatgcag 8700
tcctcatgag tgaggagact aaagcgcatg ccatcaagac ttcagtgtag agaaaacctc 8760
caaaaaagcc tcctcactac ttctggaata gctcagaggc cgaggcggcc tcggcctctg 8820
cataaataaa aaaaattagt cagccatggg gcggagaatg ggcggaactg ggcggagtta 8880
ggggcgggat gggcggagtt aggggcggga ctatggttgc tgactaattg agatgcatgc 8940
tttgcatact tctgcctgct ggggagcctg gggactttcc acacctggtt gctgactaat 9000
tgagatgcat gctttgcata cttctgcctg ctggggagcc tggggacttt ccacacccta 9060
actgacacac attccacagc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt 9120
gcgtattggg cgctcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct 9180
gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga 9240
taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc 9300
cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg 9360
ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg 9420
aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt 9480
tctcccttcg ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt 9540
gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg 9600
cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact 9660
ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt 9720
cttgaagtgg tggcctaact acggctacac tagaagaaca gtatttggta tctgcgctct 9780
gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac 9840
cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc 9900
tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg 9960
ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta 10020
aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca 10080
atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc 10140
ctgactcctg caaaccacgt tgtgtctcaa aatctctgat gttacattgc acaagataaa 10200
aatatatcat catgaacaat aaaactgtct gcttacataa acagtaatac aaggggtgtt 10260
atgagccata ttcaacggga aacgtcttgc tcgaggccgc gattaaattc caacatggat 10320
gctgatttat atgggtataa atgggctcgc gataatgtcg ggcaatcagg tgcgacaatc 10380
tatcgattgt atgggaagcc cgatgcgcca gagttgtttc tgaaacatgg caaaggtagc 10440
gttgccaatg atgttacaga tgagatggtc agactaaact ggctgacgga atttatgcct 10500
cttccgacca tcaagcattt tatccgtact cctgatgatg catggttact caccactgcg 10560
atccccggga aaacagcatt ccaggtatta gaagaatatc ctgattcagg tgaaaatatt 10620
gttgatgcgc tggcagtgtt cctgcgccgg ttgcattcga ttcctgtttg taattgtcct 10680
tttaacagcg atcgcgtatt tcgtctcgct caggcgcaat cacgaatgaa taacggtttg 10740
gttgatgcga gtgattttga tgacgagcgt aatggctggc ctgttgaaca agtctggaaa 10800
gaaatgcata agcttttgcc attctcaccg gattcagtcg tcactcatgg tgatttctca 10860
cttgataacc ttatttttga cgaggggaaa ttaataggtt gtattgatgt tggacgagtc 10920
ggaatcgcag accgatacca ggatcttgcc atcctatgga actgcctcgg tgagttttct 10980
ccttcattac agaaacggct ttttcaaaaa tatggtattg ataatcctga tatgaataaa 11040
ttgcagtttc atttgatgct cgatgagttt ttctaagggc ggcctgccac catacccacg 11100
ccgaaacaag cgctcatgag cccgaagtgg cgagcccgat cttccccatc ggtgatgtcg 11160
gcgatatagg cgccagcaac cgcacctgtg gcgccggtga tgagggcgcg ccaagtcgac 11220
gtccggcagt c 11231
<210> 104
<211> 10876
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 104
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
cctagttatt aatagtaatc aattacgggg tcattagttc atagcccata tatggagttc 360
cgcgttacat aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca 420
ttgacgtcaa taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt 480
caatgggtgg actatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg 540
ccaagtacgc cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag 600
tacatgacct tatgggactt tcctacttgg cagtacatct acgtattagt catcgctatt 660
accatggtcg aggtgagccc cacgttctgc ttcactctcc ccatctcccc cccctcccca 720
cccccaattt tgtatttatt tattttttaa ttattttgtg cagcgatggg ggcggggggg 780
gggggggggc gcgcgccagg cggggcgggg cggggcgagg ggcggggcgg ggcgaggcgg 840
agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa agtttccttt tatggcgagg 900
cggcggcggc ggcggcccta taaaaagcga agcgcgcggc gggcgggagt cgctgcgacg 960
ctgccttcgc cccgtgcccc gctccgccgc cgcctcgcgc cgcccgcccc ggctctgact 1020
gaccgcgtta ctcccacagg tgagcgggcg ggacggccct tctcctcagc gctgtaatta 1080
gcgcttggtt taatgacggc ttgttggagg cttgctgaag gctgtatgct gttgtcctcg 1140
agtgagcgta gggtatcaag actacgaata ctgtaaagcc acagatgggt gttcgtagtc 1200
ttgataccct tcgcctacta gaggacacaa ggcctgttac tagcactcac atggaacaaa 1260
tggccaccgt gggaggatga caatttctgt ggctgcgtga aagccttgag gggctccggg 1320
agctagagcc tctgctaacc atgttcatgc cttcttcttt ttcctacagc tcctgggcaa 1380
cgtgctggtt attgtgctgt ctcatcattt tggcaaagaa ttcctcgaag atccgaaggg 1440
aaagtcttcc acgactgtgg gatccgttcg aagatatcac cggttgagcc accatggaat 1500
tcagcagccc cagcagagag gaatgcccca agcctctgag ccgggtgtca atcatggccg 1560
gatctctgac aggactgctg ctgcttcagg ccgtgtcttg ggcttctggc gctagacctt 1620
gcatccccaa gagcttcggc tacagcagcg tcgtgtgcgt gtgcaatgcc acctactgcg 1680
acagcttcga ccctcctacc tttcctgctc tgggcacctt cagcagatac gagagcacca 1740
gatccggcag acggatggaa ctgagcatgg gacccatcca ggccaatcac acaggcactg 1800
gcctgctgct gacactgcag cctgagcaga aattccagaa agtgaaaggc ttcggcggag 1860
ccatgacaga tgccgccgct ctgaatatcc tggctctgtc tccaccagct cagaacctgc 1920
tgctcaagag ctacttcagc gaggaaggca tcggctacaa catcatcaga gtgcccatgg 1980
ccagctgcga cttcagcatc aggacctaca cctacgccga cacacccgac gatttccagc 2040
tgcacaactt cagcctgcct gaagaggaca ccaagctgaa gatccctctg atccacagag 2100
ccctgcagct ggcacaaaga cccgtgtcac tgctggcctc tccatggaca tctcccacct 2160
ggctgaaaac aaatggcgcc gtgaatggca agggcagcct gaaaggccaa cctggcgaca 2220
tctaccacca gacctgggcc agatacttcg tgaagttcct ggacgcctat gccgagcaca 2280
agctgcagtt ttgggccgtg acagccgaga acgaaccttc tgctggactg ctgagcggct 2340
acccctttca gtgcctgggc tttacacccg agcaccagcg ggactttatc gcccgtgatc 2400
tgggacccac actggccaat agcacccacc ataatgtgcg gctgctgatg ctggacgacc 2460
agagactgct tctgccccac tgggctaaag tggtgctgac agatcctgag gccgccaaat 2520
acgtgcacgg aatcgccgtg cactggtatc tggactttct ggcccctgcc aaggccacac 2580
tgggagagac acacagactg ttccccaaca ccatgctgtt cgccagcgaa gcctgtgtgg 2640
gcagcaagtt ttgggaacag agcgtgcggc tcggcagctg ggatagaggc atgcagtaca 2700
gccacagcat catcaccaac ctgctgtacc acgtcgtcgg ctggaccgac tggaatctgg 2760
ccctgaatcc tgaaggcggc cctaactggg tccgaaactt cgtggacagc cccatcatcg 2820
tggacatcac caaggacacc ttctacaagc agcccatgtt ctaccacctg ggacacttca 2880
gcaagttcat ccccgagggc tctcagcgcg ttggactggt ggcttcccag aagaacgatc 2940
tggacgccgt ggctctgatg caccctgatg gatctgctgt ggtggtggtc ctgaaccgca 3000
gcagcaaaga tgtgcccctg accatcaagg atcccgccgt gggattcctg gaaacaatca 3060
gccctggcta ctccatccac acctacctgt ggcgtagaca gtgacaattg ttaattaagt 3120
ttaaaccctc gaggccgcaa gcttatcgat aatcaacctc tggattacaa aatttgtgaa 3180
agattgactg gtattcttaa ctatgttgct ccttttacgc tatgtggata cgctgcttta 3240
atgcctttgt atcatgctat tgcttcccgt atggctttca ttttctcctc cttgtataaa 3300
tcctggttgc tgtctcttta tgaggagttg tggcccgttg tcaggcaacg tggcgtggtg 3360
tgcactgtgt ttgctgacgc aacccccact ggttggggca ttgccaccac ctgtcagctc 3420
ctttccggga ctttcgcttt ccccctccct attgccacgg cggaactcat cgccgcctgc 3480
cttgcccgct gctggacagg ggctcggctg ttgggcactg acaattccgt ggtgttgtcg 3540
gggaaatcat cgtcctttcc ttggctgctc gcctgtgttg ccacctggat tctgcgcggg 3600
acgtccttct gctacgtccc ttcggccctc aatccagcgg accttccttc ccgcggcctg 3660
ctgccggctc tgcggcctct tccgcgtctt cgccttcgcc ctcagacgag tcggatctcc 3720
ctttgggccg cctccccgca tcgataccgt cgactagagc tcgctgatca gcctcgactg 3780
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 3840
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 3900
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 3960
aagacaatag caggcatgct ggggagagat ccacgataac aaacagcttt tttggggtga 4020
acatattgac tgaattccct gcaggttggc cactccctct ctgcgcgctc gctcgctcac 4080
tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt ggtcgcccgg cctcagtgag 4140
cgagcgagcg cgcagagagg gagtggccaa ctccatcact aggggttcct gcggccgctc 4200
gtacggtctc gaggaattcc tgcaggataa cttgccaacc tcattctaaa atgtatatag 4260
aagcccaaaa gacaataaca aaaatattct tgtagaacaa aatgggaaag aatgttccac 4320
taaatatcaa gatttagagc aaagcatgag atgtgtgggg atagacagtg aggctgataa 4380
aatagagtag agctcagaaa cagacccatt gatatatgta agtgacctat gaaaaaaata 4440
tggcatttta caatgggaaa atgatggtct ttttcttttt tagaaaaaca gggaaatata 4500
tttatatgta aaaaataaaa gggaacccat atgtcatacc atacacacaa aaaaattcca 4560
gtgaattata agtctaaatg gagaaggcaa aactttaaat cttttagaaa ataatataga 4620
agcatgcaga ccagcctggc caacatgatg aaaccctctc tactaataat aaaatcagta 4680
gaactactca ggactacttt gagtgggaag tccttttcta tgaagacttc tttggccaaa 4740
attaggctct aaatgcaagg agatagtgca tcatgcctgg ctgcacttac tgataaatga 4800
tgttatcacc atctttaacc aaatgcacag gaacaagtta tggtactgat gtgctggatt 4860
gagaaggagc tctacttcct tgacaggaca catttgtatc aacttaaaaa agcagatttt 4920
tgccagcaga actattcatt cagaggtagg aaacttagaa tagatgatgt cactgattag 4980
catggcttcc ccatctccac agctgcttcc cacccaggtt gcccacagtt gagtttgtcc 5040
agtgctcagg gctgcccact ctcagtaaga agccccacac cagcccctct ccaaatatgt 5100
tggctgttcc ttccattaaa gtgaccccac tttagagcag caagtggatt tctgtttctt 5160
acagttcagg aaggaggagt cagctgtgag aacctggagc ctgagatgct tctaagtccc 5220
actgctactg gggtcaggga agccagactc cagcatcagc agtcaggagc actaagccct 5280
tgccaacatc ctgtttctca gagaaactgc ttccattata atggttgtcc ttttttaagc 5340
tatcaagcca aacaaccagt gtctaccatt attctcatca cctgaagcca agggttctag 5400
caaaagtcaa gctgtcttgt aatggttgat gtgcctccag cttctgtctt cagtcactcc 5460
actcttagcc tgctctgaat caactctgac cacagttccc tggagcccct gccacctgct 5520
gcccctgcca ccttctccat ctgcagtgct gtgcagcctt ctgcactctt gcagagctaa 5580
taggtggaga cttgaaggaa gaggaggaaa gtttctcata atagccttgc tgcaagctca 5640
aatgggaggt gggcactgtg cccaggagcc ttggagcaaa ggctgtgccc aacctctgac 5700
tgcatccagg tttggtcttg acagagataa gaagccctgg cttttggagc caaaatctag 5760
gtcagactta ggcaggattc tcaaagttta tcagcagaac atgaggcaga agaccctttc 5820
tgctccagct tcttcaggct caaccttcat cagaatagat agaaagagag gctgtgaggg 5880
ttcttaaaac agaagcaaat ctgactcaga gaataaacaa cctcctagta aactacagct 5940
tagacagagc atctggtggt gagtgtgctc agtgtcctac tcaactgtct ggtatcagcc 6000
ctcatgagga cttctcttct ttccctcata gacctccatc tctgttttcc ttagcctgca 6060
gaaatctgga tggctattca cagaatgcct gtgctttcag agttgcattt tttctctggt 6120
attctggttc aagcatttga aggtaggaaa ggttctccaa gtgcaagaaa gccagccctg 6180
agcctcaact gcctggctag tgtggtcagt aggatgcaaa ggctgttgaa tgccacaagg 6240
ccaaacttta acctgtgtac cacaagccta gcagcagagg cagctctgct cactggaact 6300
ctctgtcttc tttctcctga gccttttctt ttcctgagtt ttctagctct cctcaacctt 6360
acctctgccc tacccaggac aaacccaaga gccactgttt ctgtgatgtc ctctccagcc 6420
ctaattaggc atcatgactt cagcctgacc ttccatgctc agaagcagtg ctaatccact 6480
tcagatgagc tgctctatgc aacacaggca gagcctacaa acctttgcac cagagccctc 6540
cacatatcag tgtttgttca tactcacttc aacagcaaat gtgactgctg agattaagat 6600
tttacacaag atggtctgta atttcacagt tagttttatc ccattaggta tgaaagaatt 6660
agcataattc cccttaaaca tgaatgaatc ttagattttt taataaatag ttttggaagt 6720
aaagacagag acatcaggag cacaaggaat agcctgagag gacaaacaga acaagaaaga 6780
gtctggaaat acacaggatg ttcttggcct cctcaaagca agtgcaagca gatagtacca 6840
gcagccccag gctatcagag cccagtgaag agaagtacca tgaaagccac agctctaacc 6900
accctgttcc agagtgacag acagtcccca agacaagcca gcctgagcca gagagagaac 6960
tgcaagagaa agtttctaat ttaggttctg ttagattcag acaagtgcag gtcatcctct 7020
ctccacagct actcacctct ccagcctaac aaagcctgca gtccacactc caaccctggt 7080
gtctcacctc ctagcctctc ccaacatcct gctctctgac catcttctgc atctctcatc 7140
tcaccatctc ccactgtcta cagcctactc ttgcaactac catctcattt tctgacatcc 7200
tgtctacatc ttctgccata ctctgccatc taccatacca cctcttacca tctaccacac 7260
catcttttat ctccatccct ctcagaagcc tccaagctga atcctgcttt atgtgttcat 7320
ctcagcccct gcatggaaag ctgaccccag aggcagaact attcccagag agcttggcca 7380
agaaaaacaa aactaccagc ctggccaggc tcaggagtag taagctgcag tgtctgttgt 7440
gttctagctt caacagctgc aggagttcca ctctcaaatg ctccacattt ctcacatcct 7500
cctgattctg gtcactaccc atcttcaaag aacagaatat ctcacatcag catactgtga 7560
aggactagtc atgggtgcag ctgctcagag ctgcaaagtc attctggatg gtggagagct 7620
tacaaacatt tcatgatgct ccccccgctc tgatggctgg agcccaatcc ctacacagac 7680
tcctgctgta tgtgttttcc tttcactctg agccacagcc agagggcagg cattcagtct 7740
cctcttcagg ctggggctgg ggcactgaga actcacccaa caccttgctc tcactccttc 7800
tgcaaaacaa gaaagagctt tgtgctgcag tagccatgaa gaatgaaagg aaggctttaa 7860
ctaaaaaatg tcagagatta ttttcaaccc cttactgtgg atcaccagca aggaggaaac 7920
acaacacaga gacatttttt cccctcaaat tatcaaaaga atcactgcat ttgttaaaga 7980
gagcaactga atcaggaagc agagttttga acatatcaga agttaggaat ctgcatcaga 8040
gacaaatgca gtcatggttg tttgctgcat accagcccta atcattagaa gcctcatgga 8100
cttcaaacat cattccctct gacaagatgc tctagcctaa ctccatgaga taaaataaat 8160
ctgcctttca gagccaaaga agagtccacc agcttcttct cagtgtgaac aagagctcca 8220
gtcaggttag tcagtccagt gcagtagagg agaccagtct gcatcctcta attttcaaag 8280
gcaagaagat ttgtttaccc tggacaccag gcacaagtga ggtcacagag ctcttagata 8340
tgcagtcctc atgagtgagg agactaaagc gcatgccatc aagacttcag tgtagagaaa 8400
acctccaaaa aagcctcctc actacttctg gaatagctca gaggccgagg cggcctcggc 8460
ctctgcataa ataaaaaaaa ttagtcagcc atggggcgga gaatgggcgg aactgggcgg 8520
agttaggggc gggatgggcg gagttagggg cgggactatg gttgctgact aattgagatg 8580
catgctttgc atacttctgc ctgctgggga gcctggggac tttccacacc tggttgctga 8640
ctaattgaga tgcatgcttt gcatacttct gcctgctggg gagcctgggg actttccaca 8700
ccctaactga cacacattcc acagctgcat taatgaatcg gccaacgcgc ggggagaggc 8760
ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt 8820
cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca 8880
ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa 8940
aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat 9000
cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc 9060
cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc 9120
gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt 9180
tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac 9240
cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg 9300
ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca 9360
gagttcttga agtggtggcc taactacggc tacactagaa gaacagtatt tggtatctgc 9420
gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa 9480
accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa 9540
ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac 9600
tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta 9660
aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt 9720
taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata 9780
gttgcctgac tcctgcaaac cacgttgtgt ctcaaaatct ctgatgttac attgcacaag 9840
ataaaaatat atcatcatga acaataaaac tgtctgctta cataaacagt aatacaaggg 9900
gtgttatgag ccatattcaa cgggaaacgt cttgctcgag gccgcgatta aattccaaca 9960
tggatgctga tttatatggg tataaatggg ctcgcgataa tgtcgggcaa tcaggtgcga 10020
caatctatcg attgtatggg aagcccgatg cgccagagtt gtttctgaaa catggcaaag 10080
gtagcgttgc caatgatgtt acagatgaga tggtcagact aaactggctg acggaattta 10140
tgcctcttcc gaccatcaag cattttatcc gtactcctga tgatgcatgg ttactcacca 10200
ctgcgatccc cgggaaaaca gcattccagg tattagaaga atatcctgat tcaggtgaaa 10260
atattgttga tgcgctggca gtgttcctgc gccggttgca ttcgattcct gtttgtaatt 10320
gtccttttaa cagcgatcgc gtatttcgtc tcgctcaggc gcaatcacga atgaataacg 10380
gtttggttga tgcgagtgat tttgatgacg agcgtaatgg ctggcctgtt gaacaagtct 10440
ggaaagaaat gcataagctt ttgccattct caccggattc agtcgtcact catggtgatt 10500
tctcacttga taaccttatt tttgacgagg ggaaattaat aggttgtatt gatgttggac 10560
gagtcggaat cgcagaccga taccaggatc ttgccatcct atggaactgc ctcggtgagt 10620
tttctccttc attacagaaa cggctttttc aaaaatatgg tattgataat cctgatatga 10680
ataaattgca gtttcatttg atgctcgatg agtttttcta agggcggcct gccaccatac 10740
ccacgccgaa acaagcgctc atgagcccga agtggcgagc ccgatcttcc ccatcggtga 10800
tgtcggcgat ataggcgcca gcaaccgcac ctgtggcgcc ggtgatgagg gcgcgccaag 10860
tcgacgtccg gcagtc 10876
<210> 105
<211> 10849
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 105
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
cctagttatt aatagtaatc aattacgggg tcattagttc atagcccata tatggagttc 360
cgcgttacat aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca 420
ttgacgtcaa taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt 480
caatgggtgg actatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg 540
ccaagtacgc cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag 600
tacatgacct tatgggactt tcctacttgg cagtacatct acgtattagt catcgctatt 660
accatggtcg aggtgagccc cacgttctgc ttcactctcc ccatctcccc cccctcccca 720
cccccaattt tgtatttatt tattttttaa ttattttgtg cagcgatggg ggcggggggg 780
gggggggggc gcgcgccagg cggggcgggg cggggcgagg ggcggggcgg ggcgaggcgg 840
agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa agtttccttt tatggcgagg 900
cggcggcggc ggcggcccta taaaaagcga agcgcgcggc gggcgggagt cgctgcgacg 960
ctgccttcgc cccgtgcccc gctccgccgc cgcctcgcgc cgcccgcccc ggctctgact 1020
gaccgcgtta ctcccacagg tgagcgggcg ggacggccct tctcctcagc gctgtaatta 1080
gcgcttggtt taatgacggc ttgttggagg cttgctgaag gctgtatgct gttgtcttta 1140
gaaataagtg gtagtcaagt gaagccacag atgtgactac cacttatttc taaaaggaca 1200
caaggcctgt tactagcact cacatggaac aaatggccac cgtgggagga tgacaatttc 1260
tgtggctgcg tgaaagcctt gaggggctcc gggagctaga gcctctgcta accatgttca 1320
tgccttcttc tttttcctac agctcctggg caacgtgctg gttattgtgc tgtctcatca 1380
ttttggcaaa gaattcctcg aagatccgaa gggaaagtct tccacgactg tgggatccgt 1440
tcgaagatat caccggttga gccaccatgg aattcagcag ccccagcaga gaggaatgcc 1500
ccaagcctct gagccgggtg tcaatcatgg ccggatctct gacaggactg ctgctgcttc 1560
aggccgtgtc ttgggcttct ggcgctagac cttgcatccc caagagcttc ggctacagca 1620
gcgtcgtgtg cgtgtgcaat gccacctact gcgacagctt cgaccctcct acctttcctg 1680
ctctgggcac cttcagcaga tacgagagca ccagatccgg cagacggatg gaactgagca 1740
tgggacccat ccaggccaat cacacaggca ctggcctgct gctgacactg cagcctgagc 1800
agaaattcca gaaagtgaaa ggcttcggcg gagccatgac agatgccgcc gctctgaata 1860
tcctggctct gtctccacca gctcagaacc tgctgctcaa gagctacttc agcgaggaag 1920
gcatcggcta caacatcatc agagtgccca tggccagctg cgacttcagc atcaggacct 1980
acacctacgc cgacacaccc gacgatttcc agctgcacaa cttcagcctg cctgaagagg 2040
acaccaagct gaagatccct ctgatccaca gagccctgca gctggcacaa agacccgtgt 2100
cactgctggc ctctccatgg acatctccca cctggctgaa aacaaatggc gccgtgaatg 2160
gcaagggcag cctgaaaggc caacctggcg acatctacca ccagacctgg gccagatact 2220
tcgtgaagtt cctggacgcc tatgccgagc acaagctgca gttttgggcc gtgacagccg 2280
agaacgaacc ttctgctgga ctgctgagcg gctacccctt tcagtgcctg ggctttacac 2340
ccgagcacca gcgggacttt atcgcccgtg atctgggacc cacactggcc aatagcaccc 2400
accataatgt gcggctgctg atgctggacg accagagact gcttctgccc cactgggcta 2460
aagtggtgct gacagatcct gaggccgcca aatacgtgca cggaatcgcc gtgcactggt 2520
atctggactt tctggcccct gccaaggcca cactgggaga gacacacaga ctgttcccca 2580
acaccatgct gttcgccagc gaagcctgtg tgggcagcaa gttttgggaa cagagcgtgc 2640
ggctcggcag ctgggataga ggcatgcagt acagccacag catcatcacc aacctgctgt 2700
accacgtcgt cggctggacc gactggaatc tggccctgaa tcctgaaggc ggccctaact 2760
gggtccgaaa cttcgtggac agccccatca tcgtggacat caccaaggac accttctaca 2820
agcagcccat gttctaccac ctgggacact tcagcaagtt catccccgag ggctctcagc 2880
gcgttggact ggtggcttcc cagaagaacg atctggacgc cgtggctctg atgcaccctg 2940
atggatctgc tgtggtggtg gtcctgaacc gcagcagcaa agatgtgccc ctgaccatca 3000
aggatcccgc cgtgggattc ctggaaacaa tcagccctgg ctactccatc cacacctacc 3060
tgtggcgtag acagtgacaa ttgttaatta agtttaaacc ctcgaggccg caagcttatc 3120
gataatcaac ctctggatta caaaatttgt gaaagattga ctggtattct taactatgtt 3180
gctcctttta cgctatgtgg atacgctgct ttaatgcctt tgtatcatgc tattgcttcc 3240
cgtatggctt tcattttctc ctccttgtat aaatcctggt tgctgtctct ttatgaggag 3300
ttgtggcccg ttgtcaggca acgtggcgtg gtgtgcactg tgtttgctga cgcaaccccc 3360
actggttggg gcattgccac cacctgtcag ctcctttccg ggactttcgc tttccccctc 3420
cctattgcca cggcggaact catcgccgcc tgccttgccc gctgctggac aggggctcgg 3480
ctgttgggca ctgacaattc cgtggtgttg tcggggaaat catcgtcctt tccttggctg 3540
ctcgcctgtg ttgccacctg gattctgcgc gggacgtcct tctgctacgt cccttcggcc 3600
ctcaatccag cggaccttcc ttcccgcggc ctgctgccgg ctctgcggcc tcttccgcgt 3660
cttcgccttc gccctcagac gagtcggatc tccctttggg ccgcctcccc gcatcgatac 3720
cgtcgactag agctcgctga tcagcctcga ctgtgccttc tagttgccag ccatctgttg 3780
tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc cactcccact gtcctttcct 3840
aataaaatga ggaaattgca tcgcattgtc tgagtaggtg tcattctatt ctggggggtg 3900
gggtggggca ggacagcaag ggggaggatt gggaagacaa tagcaggcat gctggggaga 3960
gatccacgat aacaaacagc ttttttgggg tgaacatatt gactgaattc cctgcaggtt 4020
ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gcccgggcaa agcccgggcg 4080
tcgggcgacc tttggtcgcc cggcctcagt gagcgagcga gcgcgcagag agggagtggc 4140
caactccatc actaggggtt cctgcggccg ctcgtacggt ctcgaggaat tcctgcagga 4200
taacttgcca acctcattct aaaatgtata tagaagccca aaagacaata acaaaaatat 4260
tcttgtagaa caaaatggga aagaatgttc cactaaatat caagatttag agcaaagcat 4320
gagatgtgtg gggatagaca gtgaggctga taaaatagag tagagctcag aaacagaccc 4380
attgatatat gtaagtgacc tatgaaaaaa atatggcatt ttacaatggg aaaatgatgg 4440
tctttttctt ttttagaaaa acagggaaat atatttatat gtaaaaaata aaagggaacc 4500
catatgtcat accatacaca caaaaaaatt ccagtgaatt ataagtctaa atggagaagg 4560
caaaacttta aatcttttag aaaataatat agaagcatgc agaccagcct ggccaacatg 4620
atgaaaccct ctctactaat aataaaatca gtagaactac tcaggactac tttgagtggg 4680
aagtcctttt ctatgaagac ttctttggcc aaaattaggc tctaaatgca aggagatagt 4740
gcatcatgcc tggctgcact tactgataaa tgatgttatc accatcttta accaaatgca 4800
caggaacaag ttatggtact gatgtgctgg attgagaagg agctctactt ccttgacagg 4860
acacatttgt atcaacttaa aaaagcagat ttttgccagc agaactattc attcagaggt 4920
aggaaactta gaatagatga tgtcactgat tagcatggct tccccatctc cacagctgct 4980
tcccacccag gttgcccaca gttgagtttg tccagtgctc agggctgccc actctcagta 5040
agaagcccca caccagcccc tctccaaata tgttggctgt tccttccatt aaagtgaccc 5100
cactttagag cagcaagtgg atttctgttt cttacagttc aggaaggagg agtcagctgt 5160
gagaacctgg agcctgagat gcttctaagt cccactgcta ctggggtcag ggaagccaga 5220
ctccagcatc agcagtcagg agcactaagc ccttgccaac atcctgtttc tcagagaaac 5280
tgcttccatt ataatggttg tcctttttta agctatcaag ccaaacaacc agtgtctacc 5340
attattctca tcacctgaag ccaagggttc tagcaaaagt caagctgtct tgtaatggtt 5400
gatgtgcctc cagcttctgt cttcagtcac tccactctta gcctgctctg aatcaactct 5460
gaccacagtt ccctggagcc cctgccacct gctgcccctg ccaccttctc catctgcagt 5520
gctgtgcagc cttctgcact cttgcagagc taataggtgg agacttgaag gaagaggagg 5580
aaagtttctc ataatagcct tgctgcaagc tcaaatggga ggtgggcact gtgcccagga 5640
gccttggagc aaaggctgtg cccaacctct gactgcatcc aggtttggtc ttgacagaga 5700
taagaagccc tggcttttgg agccaaaatc taggtcagac ttaggcagga ttctcaaagt 5760
ttatcagcag aacatgaggc agaagaccct ttctgctcca gcttcttcag gctcaacctt 5820
catcagaata gatagaaaga gaggctgtga gggttcttaa aacagaagca aatctgactc 5880
agagaataaa caacctccta gtaaactaca gcttagacag agcatctggt ggtgagtgtg 5940
ctcagtgtcc tactcaactg tctggtatca gccctcatga ggacttctct tctttccctc 6000
atagacctcc atctctgttt tccttagcct gcagaaatct ggatggctat tcacagaatg 6060
cctgtgcttt cagagttgca ttttttctct ggtattctgg ttcaagcatt tgaaggtagg 6120
aaaggttctc caagtgcaag aaagccagcc ctgagcctca actgcctggc tagtgtggtc 6180
agtaggatgc aaaggctgtt gaatgccaca aggccaaact ttaacctgtg taccacaagc 6240
ctagcagcag aggcagctct gctcactgga actctctgtc ttctttctcc tgagcctttt 6300
cttttcctga gttttctagc tctcctcaac cttacctctg ccctacccag gacaaaccca 6360
agagccactg tttctgtgat gtcctctcca gccctaatta ggcatcatga cttcagcctg 6420
accttccatg ctcagaagca gtgctaatcc acttcagatg agctgctcta tgcaacacag 6480
gcagagccta caaacctttg caccagagcc ctccacatat cagtgtttgt tcatactcac 6540
ttcaacagca aatgtgactg ctgagattaa gattttacac aagatggtct gtaatttcac 6600
agttagtttt atcccattag gtatgaaaga attagcataa ttccccttaa acatgaatga 6660
atcttagatt ttttaataaa tagttttgga agtaaagaca gagacatcag gagcacaagg 6720
aatagcctga gaggacaaac agaacaagaa agagtctgga aatacacagg atgttcttgg 6780
cctcctcaaa gcaagtgcaa gcagatagta ccagcagccc caggctatca gagcccagtg 6840
aagagaagta ccatgaaagc cacagctcta accaccctgt tccagagtga cagacagtcc 6900
ccaagacaag ccagcctgag ccagagagag aactgcaaga gaaagtttct aatttaggtt 6960
ctgttagatt cagacaagtg caggtcatcc tctctccaca gctactcacc tctccagcct 7020
aacaaagcct gcagtccaca ctccaaccct ggtgtctcac ctcctagcct ctcccaacat 7080
cctgctctct gaccatcttc tgcatctctc atctcaccat ctcccactgt ctacagccta 7140
ctcttgcaac taccatctca ttttctgaca tcctgtctac atcttctgcc atactctgcc 7200
atctaccata ccacctctta ccatctacca caccatcttt tatctccatc cctctcagaa 7260
gcctccaagc tgaatcctgc tttatgtgtt catctcagcc cctgcatgga aagctgaccc 7320
cagaggcaga actattccca gagagcttgg ccaagaaaaa caaaactacc agcctggcca 7380
ggctcaggag tagtaagctg cagtgtctgt tgtgttctag cttcaacagc tgcaggagtt 7440
ccactctcaa atgctccaca tttctcacat cctcctgatt ctggtcacta cccatcttca 7500
aagaacagaa tatctcacat cagcatactg tgaaggacta gtcatgggtg cagctgctca 7560
gagctgcaaa gtcattctgg atggtggaga gcttacaaac atttcatgat gctccccccg 7620
ctctgatggc tggagcccaa tccctacaca gactcctgct gtatgtgttt tcctttcact 7680
ctgagccaca gccagagggc aggcattcag tctcctcttc aggctggggc tggggcactg 7740
agaactcacc caacaccttg ctctcactcc ttctgcaaaa caagaaagag ctttgtgctg 7800
cagtagccat gaagaatgaa aggaaggctt taactaaaaa atgtcagaga ttattttcaa 7860
ccccttactg tggatcacca gcaaggagga aacacaacac agagacattt tttcccctca 7920
aattatcaaa agaatcactg catttgttaa agagagcaac tgaatcagga agcagagttt 7980
tgaacatatc agaagttagg aatctgcatc agagacaaat gcagtcatgg ttgtttgctg 8040
cataccagcc ctaatcatta gaagcctcat ggacttcaaa catcattccc tctgacaaga 8100
tgctctagcc taactccatg agataaaata aatctgcctt tcagagccaa agaagagtcc 8160
accagcttct tctcagtgtg aacaagagct ccagtcaggt tagtcagtcc agtgcagtag 8220
aggagaccag tctgcatcct ctaattttca aaggcaagaa gatttgttta ccctggacac 8280
caggcacaag tgaggtcaca gagctcttag atatgcagtc ctcatgagtg aggagactaa 8340
agcgcatgcc atcaagactt cagtgtagag aaaacctcca aaaaagcctc ctcactactt 8400
ctggaatagc tcagaggccg aggcggcctc ggcctctgca taaataaaaa aaattagtca 8460
gccatggggc ggagaatggg cggaactggg cggagttagg ggcgggatgg gcggagttag 8520
gggcgggact atggttgctg actaattgag atgcatgctt tgcatacttc tgcctgctgg 8580
ggagcctggg gactttccac acctggttgc tgactaattg agatgcatgc tttgcatact 8640
tctgcctgct ggggagcctg gggactttcc acaccctaac tgacacacat tccacagctg 8700
cattaatgaa tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ctcttccgct 8760
tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac 8820
tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga 8880
gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat 8940
aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac 9000
ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct 9060
gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg 9120
ctttctcata gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg 9180
ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt 9240
cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg 9300
attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac 9360
ggctacacta gaagaacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga 9420
aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt 9480
gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt 9540
tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga 9600
ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc 9660
taaagtatat atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct 9720
atctcagcga tctgtctatt tcgttcatcc atagttgcct gactcctgca aaccacgttg 9780
tgtctcaaaa tctctgatgt tacattgcac aagataaaaa tatatcatca tgaacaataa 9840
aactgtctgc ttacataaac agtaatacaa ggggtgttat gagccatatt caacgggaaa 9900
cgtcttgctc gaggccgcga ttaaattcca acatggatgc tgatttatat gggtataaat 9960
gggctcgcga taatgtcggg caatcaggtg cgacaatcta tcgattgtat gggaagcccg 10020
atgcgccaga gttgtttctg aaacatggca aaggtagcgt tgccaatgat gttacagatg 10080
agatggtcag actaaactgg ctgacggaat ttatgcctct tccgaccatc aagcatttta 10140
tccgtactcc tgatgatgca tggttactca ccactgcgat ccccgggaaa acagcattcc 10200
aggtattaga agaatatcct gattcaggtg aaaatattgt tgatgcgctg gcagtgttcc 10260
tgcgccggtt gcattcgatt cctgtttgta attgtccttt taacagcgat cgcgtatttc 10320
gtctcgctca ggcgcaatca cgaatgaata acggtttggt tgatgcgagt gattttgatg 10380
acgagcgtaa tggctggcct gttgaacaag tctggaaaga aatgcataag cttttgccat 10440
tctcaccgga ttcagtcgtc actcatggtg atttctcact tgataacctt atttttgacg 10500
aggggaaatt aataggttgt attgatgttg gacgagtcgg aatcgcagac cgataccagg 10560
atcttgccat cctatggaac tgcctcggtg agttttctcc ttcattacag aaacggcttt 10620
ttcaaaaata tggtattgat aatcctgata tgaataaatt gcagtttcat ttgatgctcg 10680
atgagttttt ctaagggcgg cctgccacca tacccacgcc gaaacaagcg ctcatgagcc 10740
cgaagtggcg agcccgatct tccccatcgg tgatgtcggc gatataggcg ccagcaaccg 10800
cacctgtggc gccggtgatg agggcgcgcc aagtcgacgt ccggcagtc 10849
<210> 106
<211> 11188
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 106
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactatt agatctgatg gccgcgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
gtggtgactg agatgttttc taggaaacac aaaagataca aaaaagaaca cgtggaagga 300
tagccaaaaa ggggggctgc ccccatttcc tgcaccccgc tgcgatggct ggcaccattt 360
ggaagacttc gagatacact gttgagcgca gtaagacaac agtgtatctc gaagtcttcc 420
agatggggcc agccggtcca ctctgtatcc aggccagttc tgcaaggcgt tcgaggacca 480
cccccctccc ctcgccacca gggtggtctc atacagaact tataagattc ccaaatccaa 540
agacatttca cgtttatggt gatttcccag aacacatagc gacatgcaaa tattgcaggg 600
cgccactccc ctgtccctca cagccatctt cctgccaggg cgcacgcgcg ctgggtgttc 660
ccgcctagtg acactgggcc cgcgattcct tggagcgggt tgatgacgtc agcgtttccc 720
atggtgaatc cctaggttct agaaccggtg acgtctccca tggtgaagct tggatctgaa 780
ttcggtacct agttattaat agtaatcaat tacggggtca ttagttcata gcccatatat 840
ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc ccaacgaccc 900
ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag ggactttcca 960
ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac atcaagtgta 1020
tcatatgcca agtacgcccc ctattgacgt caatgacggt aaatggcccg cctggcatta 1080
tgcccagtac atgaccttat gggactttcc tacttggcag tacatctacg tattagtcat 1140
cgctattacc atggtcgagg tgagccccac gttctgcttc actctcccca tctccccccc 1200
ctccccaccc ccaattttgt atttatttat tttttaatta ttttgtgcag cgatgggggc 1260
gggggggggg ggggggcgcg cgccaggcgg ggcggggcgg ggcgaggggc ggggcggggc 1320
gaggcggaga ggtgcggcgg cagccaatca gagcggcgcg ctccgaaagt ttccttttat 1380
ggcgaggcgg cggcggcggc ggccctataa aaagcgaagc gcgcggcggg cgggagtcgc 1440
tgcgacgctg ccttcgcccc gtgccccgct ccgccgccgc ctcgcgccgc ccgccccggc 1500
tctgactgac cgcgttactc ccacaggtga gcgggcggga cggcccttct cctccgggct 1560
gtaattagcg cttggtttaa tgacggcttg tttcttttct gtggctgcgt gaaagccttg 1620
aggggctccg ggagctagag cctctgctaa ccatgttcat gccttcttct ttttcctaca 1680
gctcctgggc aacgtgctgg ttattgtgct gtctcatcat tttggcaaag aattcctcga 1740
agatccgaag ggaaagtctt ccacgactgt gggatccgtt cgaagatatc accggttgag 1800
ccaccatgga attcagcagc cccagcagag aggaatgccc caagcctctg agccgggtgt 1860
caatcatggc cggatctctg acaggactgc tgctgcttca ggccgtgtct tgggcttctg 1920
gcgctagacc ttgcatcccc aagagcttcg gctacagcag cgtcgtgtgc gtgtgcaatg 1980
ccacctactg cgacagcttc gaccctccta cctttcctgc tctgggcacc ttcagcagat 2040
acgagagcac cagatccggc agacggatgg aactgagcat gggacccatc caggccaatc 2100
acacaggcac tggcctgctg ctgacactgc agcctgagca gaaattccag aaagtgaaag 2160
gcttcggcgg agccatgaca gatgccgccg ctctgaatat cctggctctg tctccaccag 2220
ctcagaacct gctgctcaag agctacttca gcgaggaagg catcggctac aacatcatca 2280
gagtgcccat ggccagctgc gacttcagca tcaggaccta cacctacgcc gacacacccg 2340
acgatttcca gctgcacaac ttcagcctgc ctgaagagga caccaagctg aagatccctc 2400
tgatccacag agccctgcag ctggcacaaa gacccgtgtc actgctggcc tctccatgga 2460
catctcccac ctggctgaaa acaaatggcg ccgtgaatgg caagggcagc ctgaaaggcc 2520
aacctggcga catctaccac cagacctggg ccagatactt cgtgaagttc ctggacgcct 2580
atgccgagca caagctgcag ttttgggccg tgacagccga gaacgaacct tctgctggac 2640
tgctgagcgg ctaccccttt cagtgcctgg gctttacacc cgagcaccag cgggacttta 2700
tcgcccgtga tctgggaccc acactggcca atagcaccca ccataatgtg cggctgctga 2760
tgctggacga ccagagactg cttctgcccc actgggctaa agtggtgctg acagatcctg 2820
aggccgccaa atacgtgcac ggaatcgccg tgcactggta tctggacttt ctggcccctg 2880
ccaaggccac actgggagag acacacagac tgttccccaa caccatgctg ttcgccagcg 2940
aagcctgtgt gggcagcaag ttttgggaac agagcgtgcg gctcggcagc tgggatagag 3000
gcatgcagta cagccacagc atcatcacca acctgctgta ccacgtcgtc ggctggaccg 3060
actggaatct ggccctgaat cctgaaggcg gccctaactg ggtccgaaac ttcgtggaca 3120
gccccatcat cgtggacatc accaaggaca ccttctacaa gcagcccatg ttctaccacc 3180
tgggacactt cagcaagttc atccccgagg gctctcagcg cgttggactg gtggcttccc 3240
agaagaacga tctggacgcc gtggctctga tgcaccctga tggatctgct gtggtggtgg 3300
tcctgaaccg cagcagcaaa gatgtgcccc tgaccatcaa ggatcccgcc gtgggattcc 3360
tggaaacaat cagccctggc tactccatcc acacctacct gtggcgtaga cagtgacaat 3420
tgttaattaa gtttaaaccc tcgaggccgc aagcttatcg ataatcaacc tctggattac 3480
aaaatttgtg aaagattgac tggtattctt aactatgttg ctccttttac gctatgtgga 3540
tacgctgctt taatgccttt gtatcatgct attgcttccc gtatggcttt cattttctcc 3600
tccttgtata aatcctggtt gctgtctctt tatgaggagt tgtggcccgt tgtcaggcaa 3660
cgtggcgtgg tgtgcactgt gtttgctgac gcaaccccca ctggttgggg cattgccacc 3720
acctgtcagc tcctttccgg gactttcgct ttccccctcc ctattgccac ggcggaactc 3780
atcgccgcct gccttgcccg ctgctggaca ggggctcggc tgttgggcac tgacaattcc 3840
gtggtgttgt cggggaaatc atcgtccttt ccttggctgc tcgcctgtgt tgccacctgg 3900
attctgcgcg ggacgtcctt ctgctacgtc ccttcggccc tcaatccagc ggaccttcct 3960
tcccgcggcc tgctgccggc tctgcggcct cttccgcgtc ttcgccttcg ccctcagacg 4020
agtcggatct ccctttgggc cgcctccccg catcgatacc gtcgactaga gctcgctgat 4080
cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 4140
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 4200
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 4260
gggaggattg ggaagacaat agcaggcatg ctggggagag atccacgata acaaacagct 4320
tttttggggt gaacatattg actgaattcc ctgcaggttg gccactccct ctctgcgcgc 4380
tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt cgggcgacct ttggtcgccc 4440
ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc aactccatca ctaggggttc 4500
ctgcggccgc tcgtacggtc tcgaggaatt cctgcaggat aacttgccaa cctcattcta 4560
aaatgtatat agaagcccaa aagacaataa caaaaatatt cttgtagaac aaaatgggaa 4620
agaatgttcc actaaatatc aagatttaga gcaaagcatg agatgtgtgg ggatagacag 4680
tgaggctgat aaaatagagt agagctcaga aacagaccca ttgatatatg taagtgacct 4740
atgaaaaaaa tatggcattt tacaatggga aaatgatggt ctttttcttt tttagaaaaa 4800
cagggaaata tatttatatg taaaaaataa aagggaaccc atatgtcata ccatacacac 4860
aaaaaaattc cagtgaatta taagtctaaa tggagaaggc aaaactttaa atcttttaga 4920
aaataatata gaagcatgca gaccagcctg gccaacatga tgaaaccctc tctactaata 4980
ataaaatcag tagaactact caggactact ttgagtggga agtccttttc tatgaagact 5040
tctttggcca aaattaggct ctaaatgcaa ggagatagtg catcatgcct ggctgcactt 5100
actgataaat gatgttatca ccatctttaa ccaaatgcac aggaacaagt tatggtactg 5160
atgtgctgga ttgagaagga gctctacttc cttgacagga cacatttgta tcaacttaaa 5220
aaagcagatt tttgccagca gaactattca ttcagaggta ggaaacttag aatagatgat 5280
gtcactgatt agcatggctt ccccatctcc acagctgctt cccacccagg ttgcccacag 5340
ttgagtttgt ccagtgctca gggctgccca ctctcagtaa gaagccccac accagcccct 5400
ctccaaatat gttggctgtt ccttccatta aagtgacccc actttagagc agcaagtgga 5460
tttctgtttc ttacagttca ggaaggagga gtcagctgtg agaacctgga gcctgagatg 5520
cttctaagtc ccactgctac tggggtcagg gaagccagac tccagcatca gcagtcagga 5580
gcactaagcc cttgccaaca tcctgtttct cagagaaact gcttccatta taatggttgt 5640
ccttttttaa gctatcaagc caaacaacca gtgtctacca ttattctcat cacctgaagc 5700
caagggttct agcaaaagtc aagctgtctt gtaatggttg atgtgcctcc agcttctgtc 5760
ttcagtcact ccactcttag cctgctctga atcaactctg accacagttc cctggagccc 5820
ctgccacctg ctgcccctgc caccttctcc atctgcagtg ctgtgcagcc ttctgcactc 5880
ttgcagagct aataggtgga gacttgaagg aagaggagga aagtttctca taatagcctt 5940
gctgcaagct caaatgggag gtgggcactg tgcccaggag ccttggagca aaggctgtgc 6000
ccaacctctg actgcatcca ggtttggtct tgacagagat aagaagccct ggcttttgga 6060
gccaaaatct aggtcagact taggcaggat tctcaaagtt tatcagcaga acatgaggca 6120
gaagaccctt tctgctccag cttcttcagg ctcaaccttc atcagaatag atagaaagag 6180
aggctgtgag ggttcttaaa acagaagcaa atctgactca gagaataaac aacctcctag 6240
taaactacag cttagacaga gcatctggtg gtgagtgtgc tcagtgtcct actcaactgt 6300
ctggtatcag ccctcatgag gacttctctt ctttccctca tagacctcca tctctgtttt 6360
ccttagcctg cagaaatctg gatggctatt cacagaatgc ctgtgctttc agagttgcat 6420
tttttctctg gtattctggt tcaagcattt gaaggtagga aaggttctcc aagtgcaaga 6480
aagccagccc tgagcctcaa ctgcctggct agtgtggtca gtaggatgca aaggctgttg 6540
aatgccacaa ggccaaactt taacctgtgt accacaagcc tagcagcaga ggcagctctg 6600
ctcactggaa ctctctgtct tctttctcct gagccttttc ttttcctgag ttttctagct 6660
ctcctcaacc ttacctctgc cctacccagg acaaacccaa gagccactgt ttctgtgatg 6720
tcctctccag ccctaattag gcatcatgac ttcagcctga ccttccatgc tcagaagcag 6780
tgctaatcca cttcagatga gctgctctat gcaacacagg cagagcctac aaacctttgc 6840
accagagccc tccacatatc agtgtttgtt catactcact tcaacagcaa atgtgactgc 6900
tgagattaag attttacaca agatggtctg taatttcaca gttagtttta tcccattagg 6960
tatgaaagaa ttagcataat tccccttaaa catgaatgaa tcttagattt tttaataaat 7020
agttttggaa gtaaagacag agacatcagg agcacaagga atagcctgag aggacaaaca 7080
gaacaagaaa gagtctggaa atacacagga tgttcttggc ctcctcaaag caagtgcaag 7140
cagatagtac cagcagcccc aggctatcag agcccagtga agagaagtac catgaaagcc 7200
acagctctaa ccaccctgtt ccagagtgac agacagtccc caagacaagc cagcctgagc 7260
cagagagaga actgcaagag aaagtttcta atttaggttc tgttagattc agacaagtgc 7320
aggtcatcct ctctccacag ctactcacct ctccagccta acaaagcctg cagtccacac 7380
tccaaccctg gtgtctcacc tcctagcctc tcccaacatc ctgctctctg accatcttct 7440
gcatctctca tctcaccatc tcccactgtc tacagcctac tcttgcaact accatctcat 7500
tttctgacat cctgtctaca tcttctgcca tactctgcca tctaccatac cacctcttac 7560
catctaccac accatctttt atctccatcc ctctcagaag cctccaagct gaatcctgct 7620
ttatgtgttc atctcagccc ctgcatggaa agctgacccc agaggcagaa ctattcccag 7680
agagcttggc caagaaaaac aaaactacca gcctggccag gctcaggagt agtaagctgc 7740
agtgtctgtt gtgttctagc ttcaacagct gcaggagttc cactctcaaa tgctccacat 7800
ttctcacatc ctcctgattc tggtcactac ccatcttcaa agaacagaat atctcacatc 7860
agcatactgt gaaggactag tcatgggtgc agctgctcag agctgcaaag tcattctgga 7920
tggtggagag cttacaaaca tttcatgatg ctccccccgc tctgatggct ggagcccaat 7980
ccctacacag actcctgctg tatgtgtttt cctttcactc tgagccacag ccagagggca 8040
ggcattcagt ctcctcttca ggctggggct ggggcactga gaactcaccc aacaccttgc 8100
tctcactcct tctgcaaaac aagaaagagc tttgtgctgc agtagccatg aagaatgaaa 8160
ggaaggcttt aactaaaaaa tgtcagagat tattttcaac cccttactgt ggatcaccag 8220
caaggaggaa acacaacaca gagacatttt ttcccctcaa attatcaaaa gaatcactgc 8280
atttgttaaa gagagcaact gaatcaggaa gcagagtttt gaacatatca gaagttagga 8340
atctgcatca gagacaaatg cagtcatggt tgtttgctgc ataccagccc taatcattag 8400
aagcctcatg gacttcaaac atcattccct ctgacaagat gctctagcct aactccatga 8460
gataaaataa atctgccttt cagagccaaa gaagagtcca ccagcttctt ctcagtgtga 8520
acaagagctc cagtcaggtt agtcagtcca gtgcagtaga ggagaccagt ctgcatcctc 8580
taattttcaa aggcaagaag atttgtttac cctggacacc aggcacaagt gaggtcacag 8640
agctcttaga tatgcagtcc tcatgagtga ggagactaaa gcgcatgcca tcaagacttc 8700
agtgtagaga aaacctccaa aaaagcctcc tcactacttc tggaatagct cagaggccga 8760
ggcggcctcg gcctctgcat aaataaaaaa aattagtcag ccatggggcg gagaatgggc 8820
ggaactgggc ggagttaggg gcgggatggg cggagttagg ggcgggacta tggttgctga 8880
ctaattgaga tgcatgcttt gcatacttct gcctgctggg gagcctgggg actttccaca 8940
cctggttgct gactaattga gatgcatgct ttgcatactt ctgcctgctg gggagcctgg 9000
ggactttcca caccctaact gacacacatt ccacagctgc attaatgaat cggccaacgc 9060
gcggggagag gcggtttgcg tattgggcgc tcttccgctt cctcgctcac tgactcgctg 9120
cgctcggtcg ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta 9180
tccacagaat caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc 9240
aggaaccgta aaaaggccgc gttgctggcg tttttccata ggctccgccc ccctgacgag 9300
catcacaaaa atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagatac 9360
caggcgtttc cccctggaag ctccctcgtg cgctctcctg ttccgaccct gccgcttacc 9420
ggatacctgt ccgcctttct cccttcggga agcgtggcgc tttctcatag ctcacgctgt 9480
aggtatctca gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc 9540
gttcagcccg accgctgcgc cttatccggt aactatcgtc ttgagtccaa cccggtaaga 9600
cacgacttat cgccactggc agcagccact ggtaacagga ttagcagagc gaggtatgta 9660
ggcggtgcta cagagttctt gaagtggtgg cctaactacg gctacactag aagaacagta 9720
tttggtatct gcgctctgct gaagccagtt accttcggaa aaagagttgg tagctcttga 9780
tccggcaaac aaaccaccgc tggtagcggt ggtttttttg tttgcaagca gcagattacg 9840
cgcagaaaaa aaggatctca agaagatcct ttgatctttt ctacggggtc tgacgctcag 9900
tggaacgaaa actcacgtta agggattttg gtcatgagat tatcaaaaag gatcttcacc 9960
tagatccttt taaattaaaa atgaagtttt aaatcaatct aaagtatata tgagtaaact 10020
tggtctgaca gttaccaatg cttaatcagt gaggcaccta tctcagcgat ctgtctattt 10080
cgttcatcca tagttgcctg actcctgcaa accacgttgt gtctcaaaat ctctgatgtt 10140
acattgcaca agataaaaat atatcatcat gaacaataaa actgtctgct tacataaaca 10200
gtaatacaag gggtgttatg agccatattc aacgggaaac gtcttgctcg aggccgcgat 10260
taaattccaa catggatgct gatttatatg ggtataaatg ggctcgcgat aatgtcgggc 10320
aatcaggtgc gacaatctat cgattgtatg ggaagcccga tgcgccagag ttgtttctga 10380
aacatggcaa aggtagcgtt gccaatgatg ttacagatga gatggtcaga ctaaactggc 10440
tgacggaatt tatgcctctt ccgaccatca agcattttat ccgtactcct gatgatgcat 10500
ggttactcac cactgcgatc cccgggaaaa cagcattcca ggtattagaa gaatatcctg 10560
attcaggtga aaatattgtt gatgcgctgg cagtgttcct gcgccggttg cattcgattc 10620
ctgtttgtaa ttgtcctttt aacagcgatc gcgtatttcg tctcgctcag gcgcaatcac 10680
gaatgaataa cggtttggtt gatgcgagtg attttgatga cgagcgtaat ggctggcctg 10740
ttgaacaagt ctggaaagaa atgcataagc ttttgccatt ctcaccggat tcagtcgtca 10800
ctcatggtga tttctcactt gataacctta tttttgacga ggggaaatta ataggttgta 10860
ttgatgttgg acgagtcgga atcgcagacc gataccagga tcttgccatc ctatggaact 10920
gcctcggtga gttttctcct tcattacaga aacggctttt tcaaaaatat ggtattgata 10980
atcctgatat gaataaattg cagtttcatt tgatgctcga tgagtttttc taagggcggc 11040
ctgccaccat acccacgccg aaacaagcgc tcatgagccc gaagtggcga gcccgatctt 11100
ccccatcggt gatgtcggcg atataggcgc cagcaaccgc acctgtggcg ccggtgatga 11160
gggcgcgcca agtcgacgtc cggcagtc 11188
<210> 107
<211> 11174
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 107
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtcggt ggtgactgag atgttttcta ggaaacacaa 300
aagatacaaa aaagaacacg tggaaggata gccaaaaagg ggggctgccc ccatttcctg 360
caccccgctg cgatggctgg caccatttgg aagacttcga gatacactgt tgagcgcagt 420
aagacaacag tgtatctcga agtcttccag atggggccag ccggtccact ctgtatccag 480
gccagttctg caaggcgttc gaggaccacc cccctcccct cgccaccagg gtggtctcat 540
acagaactta taagattccc aaatccaaag acatttcacg tttatggtga tttcccagaa 600
cacatagcga catgcaaata ttgcagggcg ccactcccct gtccctcaca gccatcttcc 660
tgccagggcg cacgcgcgct gggtgttccc gcctagtgac actgggcccg cgattccttg 720
gagcgggttg atgacgtcag cgtttcccat ggtgaagctt ggatctgaat tcggtaccct 780
agttattaat agtaatcaat tacggggtca ttagttcata gcccatatat ggagttccgc 840
gttacataac ttacggtaaa tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg 900
acgtcaataa tgacgtatgt tcccatagta acgccaatag ggactttcca ttgacgtcaa 960
tgggtggact atttacggta aactgcccac ttggcagtac atcaagtgta tcatatgcca 1020
agtacgcccc ctattgacgt caatgacggt aaatggcccg cctggcatta tgcccagtac 1080
atgaccttat gggactttcc tacttggcag tacatctacg tattagtcat cgctattacc 1140
atggtcgagg tgagccccac gttctgcttc actctcccca tctccccccc ctccccaccc 1200
ccaattttgt atttatttat tttttaatta ttttgtgcag cgatgggggc gggggggggg 1260
ggggggcgcg cgccaggcgg ggcggggcgg ggcgaggggc ggggcggggc gaggcggaga 1320
ggtgcggcgg cagccaatca gagcggcgcg ctccgaaagt ttccttttat ggcgaggcgg 1380
cggcggcggc ggccctataa aaagcgaagc gcgcggcggg cgggagtcgc tgcgacgctg 1440
ccttcgcccc gtgccccgct ccgccgccgc ctcgcgccgc ccgccccggc tctgactgac 1500
cgcgttactc ccacaggtga gcgggcggga cggcccttct cctccgggct gtaattagcg 1560
cttggtttaa tgacggcttg ttttctgtgg ctgcgtgaaa gccttgaggg gctccgggag 1620
ctagagcctc tgctaaccat gttcatgcct tcttcttttt cctacagctc ctgggcaacg 1680
tgctggttat tgtgctgtct catcattttg gcaaagaatt cctcgaagat ccgaagggaa 1740
agtcttccac gactgtggga tccgttcgaa gatatcaccg gttgagccac catggaattc 1800
agcagcccca gcagagagga atgccccaag cctctgagcc gggtgtcaat catggccgga 1860
tctctgacag gactgctgct gcttcaggcc gtgtcttggg cttctggcgc tagaccttgc 1920
atccccaaga gcttcggcta cagcagcgtc gtgtgcgtgt gcaatgccac ctactgcgac 1980
agcttcgacc ctcctacctt tcctgctctg ggcaccttca gcagatacga gagcaccaga 2040
tccggcagac ggatggaact gagcatggga cccatccagg ccaatcacac aggcactggc 2100
ctgctgctga cactgcagcc tgagcagaaa ttccagaaag tgaaaggctt cggcggagcc 2160
atgacagatg ccgccgctct gaatatcctg gctctgtctc caccagctca gaacctgctg 2220
ctcaagagct acttcagcga ggaaggcatc ggctacaaca tcatcagagt gcccatggcc 2280
agctgcgact tcagcatcag gacctacacc tacgccgaca cacccgacga tttccagctg 2340
cacaacttca gcctgcctga agaggacacc aagctgaaga tccctctgat ccacagagcc 2400
ctgcagctgg cacaaagacc cgtgtcactg ctggcctctc catggacatc tcccacctgg 2460
ctgaaaacaa atggcgccgt gaatggcaag ggcagcctga aaggccaacc tggcgacatc 2520
taccaccaga cctgggccag atacttcgtg aagttcctgg acgcctatgc cgagcacaag 2580
ctgcagtttt gggccgtgac agccgagaac gaaccttctg ctggactgct gagcggctac 2640
ccctttcagt gcctgggctt tacacccgag caccagcggg actttatcgc ccgtgatctg 2700
ggacccacac tggccaatag cacccaccat aatgtgcggc tgctgatgct ggacgaccag 2760
agactgcttc tgccccactg ggctaaagtg gtgctgacag atcctgaggc cgccaaatac 2820
gtgcacggaa tcgccgtgca ctggtatctg gactttctgg cccctgccaa ggccacactg 2880
ggagagacac acagactgtt ccccaacacc atgctgttcg ccagcgaagc ctgtgtgggc 2940
agcaagtttt gggaacagag cgtgcggctc ggcagctggg atagaggcat gcagtacagc 3000
cacagcatca tcaccaacct gctgtaccac gtcgtcggct ggaccgactg gaatctggcc 3060
ctgaatcctg aaggcggccc taactgggtc cgaaacttcg tggacagccc catcatcgtg 3120
gacatcacca aggacacctt ctacaagcag cccatgttct accacctggg acacttcagc 3180
aagttcatcc ccgagggctc tcagcgcgtt ggactggtgg cttcccagaa gaacgatctg 3240
gacgccgtgg ctctgatgca ccctgatgga tctgctgtgg tggtggtcct gaaccgcagc 3300
agcaaagatg tgcccctgac catcaaggat cccgccgtgg gattcctgga aacaatcagc 3360
cctggctact ccatccacac ctacctgtgg cgtagacagt gacaattgtt aattaagttt 3420
aaaccctcga ggccgcaagc ttatcgataa tcaacctctg gattacaaaa tttgtgaaag 3480
attgactggt attcttaact atgttgctcc ttttacgcta tgtggatacg ctgctttaat 3540
gcctttgtat catgctattg cttcccgtat ggctttcatt ttctcctcct tgtataaatc 3600
ctggttgctg tctctttatg aggagttgtg gcccgttgtc aggcaacgtg gcgtggtgtg 3660
cactgtgttt gctgacgcaa cccccactgg ttggggcatt gccaccacct gtcagctcct 3720
ttccgggact ttcgctttcc ccctccctat tgccacggcg gaactcatcg ccgcctgcct 3780
tgcccgctgc tggacagggg ctcggctgtt gggcactgac aattccgtgg tgttgtcggg 3840
gaaatcatcg tcctttcctt ggctgctcgc ctgtgttgcc acctggattc tgcgcgggac 3900
gtccttctgc tacgtccctt cggccctcaa tccagcggac cttccttccc gcggcctgct 3960
gccggctctg cggcctcttc cgcgtcttcg ccttcgccct cagacgagtc ggatctccct 4020
ttgggccgcc tccccgcatc gataccgtcg actagagctc gctgatcagc ctcgactgtg 4080
ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa 4140
ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt 4200
aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa 4260
gacaatagca ggcatgctgg ggagagatcc acgataacaa acagcttttt tggggtgaac 4320
atattgactg aattccctgc aggttggcca ctccctctct gcgcgctcgc tcgctcactg 4380
aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg tcgcccggcc tcagtgagcg 4440
agcgagcgcg cagagaggga gtggccaact ccatcactag gggttcctgc ggccgctcgt 4500
acggtctcga ggaattcctg caggataact tgccaacctc attctaaaat gtatatagaa 4560
gcccaaaaga caataacaaa aatattcttg tagaacaaaa tgggaaagaa tgttccacta 4620
aatatcaaga tttagagcaa agcatgagat gtgtggggat agacagtgag gctgataaaa 4680
tagagtagag ctcagaaaca gacccattga tatatgtaag tgacctatga aaaaaatatg 4740
gcattttaca atgggaaaat gatggtcttt ttctttttta gaaaaacagg gaaatatatt 4800
tatatgtaaa aaataaaagg gaacccatat gtcataccat acacacaaaa aaattccagt 4860
gaattataag tctaaatgga gaaggcaaaa ctttaaatct tttagaaaat aatatagaag 4920
catgcagacc agcctggcca acatgatgaa accctctcta ctaataataa aatcagtaga 4980
actactcagg actactttga gtgggaagtc cttttctatg aagacttctt tggccaaaat 5040
taggctctaa atgcaaggag atagtgcatc atgcctggct gcacttactg ataaatgatg 5100
ttatcaccat ctttaaccaa atgcacagga acaagttatg gtactgatgt gctggattga 5160
gaaggagctc tacttccttg acaggacaca tttgtatcaa cttaaaaaag cagatttttg 5220
ccagcagaac tattcattca gaggtaggaa acttagaata gatgatgtca ctgattagca 5280
tggcttcccc atctccacag ctgcttccca cccaggttgc ccacagttga gtttgtccag 5340
tgctcagggc tgcccactct cagtaagaag ccccacacca gcccctctcc aaatatgttg 5400
gctgttcctt ccattaaagt gaccccactt tagagcagca agtggatttc tgtttcttac 5460
agttcaggaa ggaggagtca gctgtgagaa cctggagcct gagatgcttc taagtcccac 5520
tgctactggg gtcagggaag ccagactcca gcatcagcag tcaggagcac taagcccttg 5580
ccaacatcct gtttctcaga gaaactgctt ccattataat ggttgtcctt ttttaagcta 5640
tcaagccaaa caaccagtgt ctaccattat tctcatcacc tgaagccaag ggttctagca 5700
aaagtcaagc tgtcttgtaa tggttgatgt gcctccagct tctgtcttca gtcactccac 5760
tcttagcctg ctctgaatca actctgacca cagttccctg gagcccctgc cacctgctgc 5820
ccctgccacc ttctccatct gcagtgctgt gcagccttct gcactcttgc agagctaata 5880
ggtggagact tgaaggaaga ggaggaaagt ttctcataat agccttgctg caagctcaaa 5940
tgggaggtgg gcactgtgcc caggagcctt ggagcaaagg ctgtgcccaa cctctgactg 6000
catccaggtt tggtcttgac agagataaga agccctggct tttggagcca aaatctaggt 6060
cagacttagg caggattctc aaagtttatc agcagaacat gaggcagaag accctttctg 6120
ctccagcttc ttcaggctca accttcatca gaatagatag aaagagaggc tgtgagggtt 6180
cttaaaacag aagcaaatct gactcagaga ataaacaacc tcctagtaaa ctacagctta 6240
gacagagcat ctggtggtga gtgtgctcag tgtcctactc aactgtctgg tatcagccct 6300
catgaggact tctcttcttt ccctcataga cctccatctc tgttttcctt agcctgcaga 6360
aatctggatg gctattcaca gaatgcctgt gctttcagag ttgcattttt tctctggtat 6420
tctggttcaa gcatttgaag gtaggaaagg ttctccaagt gcaagaaagc cagccctgag 6480
cctcaactgc ctggctagtg tggtcagtag gatgcaaagg ctgttgaatg ccacaaggcc 6540
aaactttaac ctgtgtacca caagcctagc agcagaggca gctctgctca ctggaactct 6600
ctgtcttctt tctcctgagc cttttctttt cctgagtttt ctagctctcc tcaaccttac 6660
ctctgcccta cccaggacaa acccaagagc cactgtttct gtgatgtcct ctccagccct 6720
aattaggcat catgacttca gcctgacctt ccatgctcag aagcagtgct aatccacttc 6780
agatgagctg ctctatgcaa cacaggcaga gcctacaaac ctttgcacca gagccctcca 6840
catatcagtg tttgttcata ctcacttcaa cagcaaatgt gactgctgag attaagattt 6900
tacacaagat ggtctgtaat ttcacagtta gttttatccc attaggtatg aaagaattag 6960
cataattccc cttaaacatg aatgaatctt agatttttta ataaatagtt ttggaagtaa 7020
agacagagac atcaggagca caaggaatag cctgagagga caaacagaac aagaaagagt 7080
ctggaaatac acaggatgtt cttggcctcc tcaaagcaag tgcaagcaga tagtaccagc 7140
agccccaggc tatcagagcc cagtgaagag aagtaccatg aaagccacag ctctaaccac 7200
cctgttccag agtgacagac agtccccaag acaagccagc ctgagccaga gagagaactg 7260
caagagaaag tttctaattt aggttctgtt agattcagac aagtgcaggt catcctctct 7320
ccacagctac tcacctctcc agcctaacaa agcctgcagt ccacactcca accctggtgt 7380
ctcacctcct agcctctccc aacatcctgc tctctgacca tcttctgcat ctctcatctc 7440
accatctccc actgtctaca gcctactctt gcaactacca tctcattttc tgacatcctg 7500
tctacatctt ctgccatact ctgccatcta ccataccacc tcttaccatc taccacacca 7560
tcttttatct ccatccctct cagaagcctc caagctgaat cctgctttat gtgttcatct 7620
cagcccctgc atggaaagct gaccccagag gcagaactat tcccagagag cttggccaag 7680
aaaaacaaaa ctaccagcct ggccaggctc aggagtagta agctgcagtg tctgttgtgt 7740
tctagcttca acagctgcag gagttccact ctcaaatgct ccacatttct cacatcctcc 7800
tgattctggt cactacccat cttcaaagaa cagaatatct cacatcagca tactgtgaag 7860
gactagtcat gggtgcagct gctcagagct gcaaagtcat tctggatggt ggagagctta 7920
caaacatttc atgatgctcc ccccgctctg atggctggag cccaatccct acacagactc 7980
ctgctgtatg tgttttcctt tcactctgag ccacagccag agggcaggca ttcagtctcc 8040
tcttcaggct ggggctgggg cactgagaac tcacccaaca ccttgctctc actccttctg 8100
caaaacaaga aagagctttg tgctgcagta gccatgaaga atgaaaggaa ggctttaact 8160
aaaaaatgtc agagattatt ttcaacccct tactgtggat caccagcaag gaggaaacac 8220
aacacagaga cattttttcc cctcaaatta tcaaaagaat cactgcattt gttaaagaga 8280
gcaactgaat caggaagcag agttttgaac atatcagaag ttaggaatct gcatcagaga 8340
caaatgcagt catggttgtt tgctgcatac cagccctaat cattagaagc ctcatggact 8400
tcaaacatca ttccctctga caagatgctc tagcctaact ccatgagata aaataaatct 8460
gcctttcaga gccaaagaag agtccaccag cttcttctca gtgtgaacaa gagctccagt 8520
caggttagtc agtccagtgc agtagaggag accagtctgc atcctctaat tttcaaaggc 8580
aagaagattt gtttaccctg gacaccaggc acaagtgagg tcacagagct cttagatatg 8640
cagtcctcat gagtgaggag actaaagcgc atgccatcaa gacttcagtg tagagaaaac 8700
ctccaaaaaa gcctcctcac tacttctgga atagctcaga ggccgaggcg gcctcggcct 8760
ctgcataaat aaaaaaaatt agtcagccat ggggcggaga atgggcggaa ctgggcggag 8820
ttaggggcgg gatgggcgga gttaggggcg ggactatggt tgctgactaa ttgagatgca 8880
tgctttgcat acttctgcct gctggggagc ctggggactt tccacacctg gttgctgact 8940
aattgagatg catgctttgc atacttctgc ctgctgggga gcctggggac tttccacacc 9000
ctaactgaca cacattccac agctgcatta atgaatcggc caacgcgcgg ggagaggcgg 9060
tttgcgtatt gggcgctctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg 9120
gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg 9180
ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa 9240
ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg 9300
acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc 9360
tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc 9420
ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc 9480
ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg 9540
ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc 9600
actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga 9660
gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg gtatctgcgc 9720
tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac 9780
caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg 9840
atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc 9900
acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa 9960
ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta 10020
ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt 10080
tgcctgactc ctgcaaacca cgttgtgtct caaaatctct gatgttacat tgcacaagat 10140
aaaaatatat catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt 10200
gttatgagcc atattcaacg ggaaacgtct tgctcgaggc cgcgattaaa ttccaacatg 10260
gatgctgatt tatatgggta taaatgggct cgcgataatg tcgggcaatc aggtgcgaca 10320
atctatcgat tgtatgggaa gcccgatgcg ccagagttgt ttctgaaaca tggcaaaggt 10380
agcgttgcca atgatgttac agatgagatg gtcagactaa actggctgac ggaatttatg 10440
cctcttccga ccatcaagca ttttatccgt actcctgatg atgcatggtt actcaccact 10500
gcgatccccg ggaaaacagc attccaggta ttagaagaat atcctgattc aggtgaaaat 10560
attgttgatg cgctggcagt gttcctgcgc cggttgcatt cgattcctgt ttgtaattgt 10620
ccttttaaca gcgatcgcgt atttcgtctc gctcaggcgc aatcacgaat gaataacggt 10680
ttggttgatg cgagtgattt tgatgacgag cgtaatggct ggcctgttga acaagtctgg 10740
aaagaaatgc ataagctttt gccattctca ccggattcag tcgtcactca tggtgatttc 10800
tcacttgata accttatttt tgacgagggg aaattaatag gttgtattga tgttggacga 10860
gtcggaatcg cagaccgata ccaggatctt gccatcctat ggaactgcct cggtgagttt 10920
tctccttcat tacagaaacg gctttttcaa aaatatggta ttgataatcc tgatatgaat 10980
aaattgcagt ttcatttgat gctcgatgag tttttctaag ggcggcctgc caccataccc 11040
acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg 11100
tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgagggc gcgccaagtc 11160
gacgtccggc agtc 11174
<210> 108
<211> 10841
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 108
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
cctagttatt aatagtaatc aattacgggg tcattagttc atagcccata tatggagttc 360
cgcgttacat aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca 420
ttgacgtcaa taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt 480
caatgggtgg actatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg 540
ccaagtacgc cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag 600
tacatgacct tatgggactt tcctacttgg cagtacatct acgtattagt catcgctatt 660
accatggtcg aggtgagccc cacgttctgc ttcactctcc ccatctcccc cccctcccca 720
cccccaattt tgtatttatt tattttttaa ttattttgtg cagcgatggg ggcggggggg 780
gggggggggc gcgcgccagg cggggcgggg cggggcgagg ggcggggcgg ggcgaggcgg 840
agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa agtttccttt tatggcgagg 900
cggcggcggc ggcggcccta taaaaagcga agcgcgcggc gggcgggagt cgctgcgacg 960
ctgccttcgc cccgtgcccc gctccgccgc cgcctcgcgc cgcccgcccc ggctctgact 1020
gaccgcgtta ctcccacagg tgagcgggcg ggacggccct tctcctccgg gctgtaatta 1080
gcgcttggtt taatgacggc ttgtctggag gcttgctttg ggctgtatgc tgagggtatc 1140
aagactacga attttggcct ctgactgatt cgtagcttat accctcagga cacaaggccc 1200
tttatcagca ctcacatgga acaaatggcc accgtgggag gatgacaatt tctgtggctg 1260
cgtgaaagcc ttgaggggct ccgggagcta gagcctctgc taaccatgtt catgccttct 1320
tctttttcct acagctcctg ggcaacgtgc tggttattgt gctgtctcat cattttggca 1380
aagaattcct cgaagatccg aagggaaagt cttccacgac tgtgggatcc gttcgaagat 1440
atcaccggtt gagccaccat ggaattcagc agccccagca gagaggaatg ccccaagcct 1500
ctgagccggg tgtcaatcat ggccggatct ctgacaggac tgctgctgct tcaggccgtg 1560
tcttgggctt ctggcgctag accttgcatc cccaagagct tcggctacag cagcgtcgtg 1620
tgcgtgtgca atgccaccta ctgcgacagc ttcgaccctc ctacctttcc tgctctgggc 1680
accttcagca gatacgagag caccagatcc ggcagacgga tggaactgag catgggaccc 1740
atccaggcca atcacacagg cactggcctg ctgctgacac tgcagcctga gcagaaattc 1800
cagaaagtga aaggcttcgg cggagccatg acagatgccg ccgctctgaa tatcctggct 1860
ctgtctccac cagctcagaa cctgctgctc aagagctact tcagcgagga aggcatcggc 1920
tacaacatca tcagagtgcc catggccagc tgcgacttca gcatcaggac ctacacctac 1980
gccgacacac ccgacgattt ccagctgcac aacttcagcc tgcctgaaga ggacaccaag 2040
ctgaagatcc ctctgatcca cagagccctg cagctggcac aaagacccgt gtcactgctg 2100
gcctctccat ggacatctcc cacctggctg aaaacaaatg gcgccgtgaa tggcaagggc 2160
agcctgaaag gccaacctgg cgacatctac caccagacct gggccagata cttcgtgaag 2220
ttcctggacg cctatgccga gcacaagctg cagttttggg ccgtgacagc cgagaacgaa 2280
ccttctgctg gactgctgag cggctacccc tttcagtgcc tgggctttac acccgagcac 2340
cagcgggact ttatcgcccg tgatctggga cccacactgg ccaatagcac ccaccataat 2400
gtgcggctgc tgatgctgga cgaccagaga ctgcttctgc cccactgggc taaagtggtg 2460
ctgacagatc ctgaggccgc caaatacgtg cacggaatcg ccgtgcactg gtatctggac 2520
tttctggccc ctgccaaggc cacactggga gagacacaca gactgttccc caacaccatg 2580
ctgttcgcca gcgaagcctg tgtgggcagc aagttttggg aacagagcgt gcggctcggc 2640
agctgggata gaggcatgca gtacagccac agcatcatca ccaacctgct gtaccacgtc 2700
gtcggctgga ccgactggaa tctggccctg aatcctgaag gcggccctaa ctgggtccga 2760
aacttcgtgg acagccccat catcgtggac atcaccaagg acaccttcta caagcagccc 2820
atgttctacc acctgggaca cttcagcaag ttcatccccg agggctctca gcgcgttgga 2880
ctggtggctt cccagaagaa cgatctggac gccgtggctc tgatgcaccc tgatggatct 2940
gctgtggtgg tggtcctgaa ccgcagcagc aaagatgtgc ccctgaccat caaggatccc 3000
gccgtgggat tcctggaaac aatcagccct ggctactcca tccacaccta cctgtggcgt 3060
agacagtgac aattgttaat taagtttaaa ccctcgaggc cgcaagctta tcgataatca 3120
acctctggat tacaaaattt gtgaaagatt gactggtatt cttaactatg ttgctccttt 3180
tacgctatgt ggatacgctg ctttaatgcc tttgtatcat gctattgctt cccgtatggc 3240
tttcattttc tcctccttgt ataaatcctg gttgctgtct ctttatgagg agttgtggcc 3300
cgttgtcagg caacgtggcg tggtgtgcac tgtgtttgct gacgcaaccc ccactggttg 3360
gggcattgcc accacctgtc agctcctttc cgggactttc gctttccccc tccctattgc 3420
cacggcggaa ctcatcgccg cctgccttgc ccgctgctgg acaggggctc ggctgttggg 3480
cactgacaat tccgtggtgt tgtcggggaa atcatcgtcc tttccttggc tgctcgcctg 3540
tgttgccacc tggattctgc gcgggacgtc cttctgctac gtcccttcgg ccctcaatcc 3600
agcggacctt ccttcccgcg gcctgctgcc ggctctgcgg cctcttccgc gtcttcgcct 3660
tcgccctcag acgagtcgga tctccctttg ggccgcctcc ccgcatcgat accgtcgact 3720
agagctcgct gatcagcctc gactgtgcct tctagttgcc agccatctgt tgtttgcccc 3780
tcccccgtgc cttccttgac cctggaaggt gccactccca ctgtcctttc ctaataaaat 3840
gaggaaattg catcgcattg tctgagtagg tgtcattcta ttctgggggg tggggtgggg 3900
caggacagca agggggagga ttgggaagac aatagcaggc atgctgggga gagatccacg 3960
ataacaaaca gcttttttgg ggtgaacata ttgactgaat tccctgcagg ttggccactc 4020
cctctctgcg cgctcgctcg ctcactgagg ccgcccgggc aaagcccggg cgtcgggcga 4080
cctttggtcg cccggcctca gtgagcgagc gagcgcgcag agagggagtg gccaactcca 4140
tcactagggg ttcctgcggc cgctcgtacg gtctcgagga attcctgcag gataacttgc 4200
caacctcatt ctaaaatgta tatagaagcc caaaagacaa taacaaaaat attcttgtag 4260
aacaaaatgg gaaagaatgt tccactaaat atcaagattt agagcaaagc atgagatgtg 4320
tggggataga cagtgaggct gataaaatag agtagagctc agaaacagac ccattgatat 4380
atgtaagtga cctatgaaaa aaatatggca ttttacaatg ggaaaatgat ggtctttttc 4440
ttttttagaa aaacagggaa atatatttat atgtaaaaaa taaaagggaa cccatatgtc 4500
ataccataca cacaaaaaaa ttccagtgaa ttataagtct aaatggagaa ggcaaaactt 4560
taaatctttt agaaaataat atagaagcat gcagaccagc ctggccaaca tgatgaaacc 4620
ctctctacta ataataaaat cagtagaact actcaggact actttgagtg ggaagtcctt 4680
ttctatgaag acttctttgg ccaaaattag gctctaaatg caaggagata gtgcatcatg 4740
cctggctgca cttactgata aatgatgtta tcaccatctt taaccaaatg cacaggaaca 4800
agttatggta ctgatgtgct ggattgagaa ggagctctac ttccttgaca ggacacattt 4860
gtatcaactt aaaaaagcag atttttgcca gcagaactat tcattcagag gtaggaaact 4920
tagaatagat gatgtcactg attagcatgg cttccccatc tccacagctg cttcccaccc 4980
aggttgccca cagttgagtt tgtccagtgc tcagggctgc ccactctcag taagaagccc 5040
cacaccagcc cctctccaaa tatgttggct gttccttcca ttaaagtgac cccactttag 5100
agcagcaagt ggatttctgt ttcttacagt tcaggaagga ggagtcagct gtgagaacct 5160
ggagcctgag atgcttctaa gtcccactgc tactggggtc agggaagcca gactccagca 5220
tcagcagtca ggagcactaa gcccttgcca acatcctgtt tctcagagaa actgcttcca 5280
ttataatggt tgtccttttt taagctatca agccaaacaa ccagtgtcta ccattattct 5340
catcacctga agccaagggt tctagcaaaa gtcaagctgt cttgtaatgg ttgatgtgcc 5400
tccagcttct gtcttcagtc actccactct tagcctgctc tgaatcaact ctgaccacag 5460
ttccctggag cccctgccac ctgctgcccc tgccaccttc tccatctgca gtgctgtgca 5520
gccttctgca ctcttgcaga gctaataggt ggagacttga aggaagagga ggaaagtttc 5580
tcataatagc cttgctgcaa gctcaaatgg gaggtgggca ctgtgcccag gagccttgga 5640
gcaaaggctg tgcccaacct ctgactgcat ccaggtttgg tcttgacaga gataagaagc 5700
cctggctttt ggagccaaaa tctaggtcag acttaggcag gattctcaaa gtttatcagc 5760
agaacatgag gcagaagacc ctttctgctc cagcttcttc aggctcaacc ttcatcagaa 5820
tagatagaaa gagaggctgt gagggttctt aaaacagaag caaatctgac tcagagaata 5880
aacaacctcc tagtaaacta cagcttagac agagcatctg gtggtgagtg tgctcagtgt 5940
cctactcaac tgtctggtat cagccctcat gaggacttct cttctttccc tcatagacct 6000
ccatctctgt tttccttagc ctgcagaaat ctggatggct attcacagaa tgcctgtgct 6060
ttcagagttg cattttttct ctggtattct ggttcaagca tttgaaggta ggaaaggttc 6120
tccaagtgca agaaagccag ccctgagcct caactgcctg gctagtgtgg tcagtaggat 6180
gcaaaggctg ttgaatgcca caaggccaaa ctttaacctg tgtaccacaa gcctagcagc 6240
agaggcagct ctgctcactg gaactctctg tcttctttct cctgagcctt ttcttttcct 6300
gagttttcta gctctcctca accttacctc tgccctaccc aggacaaacc caagagccac 6360
tgtttctgtg atgtcctctc cagccctaat taggcatcat gacttcagcc tgaccttcca 6420
tgctcagaag cagtgctaat ccacttcaga tgagctgctc tatgcaacac aggcagagcc 6480
tacaaacctt tgcaccagag ccctccacat atcagtgttt gttcatactc acttcaacag 6540
caaatgtgac tgctgagatt aagattttac acaagatggt ctgtaatttc acagttagtt 6600
ttatcccatt aggtatgaaa gaattagcat aattcccctt aaacatgaat gaatcttaga 6660
ttttttaata aatagttttg gaagtaaaga cagagacatc aggagcacaa ggaatagcct 6720
gagaggacaa acagaacaag aaagagtctg gaaatacaca ggatgttctt ggcctcctca 6780
aagcaagtgc aagcagatag taccagcagc cccaggctat cagagcccag tgaagagaag 6840
taccatgaaa gccacagctc taaccaccct gttccagagt gacagacagt ccccaagaca 6900
agccagcctg agccagagag agaactgcaa gagaaagttt ctaatttagg ttctgttaga 6960
ttcagacaag tgcaggtcat cctctctcca cagctactca cctctccagc ctaacaaagc 7020
ctgcagtcca cactccaacc ctggtgtctc acctcctagc ctctcccaac atcctgctct 7080
ctgaccatct tctgcatctc tcatctcacc atctcccact gtctacagcc tactcttgca 7140
actaccatct cattttctga catcctgtct acatcttctg ccatactctg ccatctacca 7200
taccacctct taccatctac cacaccatct tttatctcca tccctctcag aagcctccaa 7260
gctgaatcct gctttatgtg ttcatctcag cccctgcatg gaaagctgac cccagaggca 7320
gaactattcc cagagagctt ggccaagaaa aacaaaacta ccagcctggc caggctcagg 7380
agtagtaagc tgcagtgtct gttgtgttct agcttcaaca gctgcaggag ttccactctc 7440
aaatgctcca catttctcac atcctcctga ttctggtcac tacccatctt caaagaacag 7500
aatatctcac atcagcatac tgtgaaggac tagtcatggg tgcagctgct cagagctgca 7560
aagtcattct ggatggtgga gagcttacaa acatttcatg atgctccccc cgctctgatg 7620
gctggagccc aatccctaca cagactcctg ctgtatgtgt tttcctttca ctctgagcca 7680
cagccagagg gcaggcattc agtctcctct tcaggctggg gctggggcac tgagaactca 7740
cccaacacct tgctctcact ccttctgcaa aacaagaaag agctttgtgc tgcagtagcc 7800
atgaagaatg aaaggaaggc tttaactaaa aaatgtcaga gattattttc aaccccttac 7860
tgtggatcac cagcaaggag gaaacacaac acagagacat tttttcccct caaattatca 7920
aaagaatcac tgcatttgtt aaagagagca actgaatcag gaagcagagt tttgaacata 7980
tcagaagtta ggaatctgca tcagagacaa atgcagtcat ggttgtttgc tgcataccag 8040
ccctaatcat tagaagcctc atggacttca aacatcattc cctctgacaa gatgctctag 8100
cctaactcca tgagataaaa taaatctgcc tttcagagcc aaagaagagt ccaccagctt 8160
cttctcagtg tgaacaagag ctccagtcag gttagtcagt ccagtgcagt agaggagacc 8220
agtctgcatc ctctaatttt caaaggcaag aagatttgtt taccctggac accaggcaca 8280
agtgaggtca cagagctctt agatatgcag tcctcatgag tgaggagact aaagcgcatg 8340
ccatcaagac ttcagtgtag agaaaacctc caaaaaagcc tcctcactac ttctggaata 8400
gctcagaggc cgaggcggcc tcggcctctg cataaataaa aaaaattagt cagccatggg 8460
gcggagaatg ggcggaactg ggcggagtta ggggcgggat gggcggagtt aggggcggga 8520
ctatggttgc tgactaattg agatgcatgc tttgcatact tctgcctgct ggggagcctg 8580
gggactttcc acacctggtt gctgactaat tgagatgcat gctttgcata cttctgcctg 8640
ctggggagcc tggggacttt ccacacccta actgacacac attccacagc tgcattaatg 8700
aatcggccaa cgcgcgggga gaggcggttt gcgtattggg cgctcttccg cttcctcgct 8760
cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc 8820
ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg 8880
ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg 8940
cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg 9000
actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac 9060
cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca 9120
tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt 9180
gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc 9240
caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag 9300
agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac 9360
tagaagaaca gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt 9420
tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa 9480
gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg 9540
gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa 9600
aaggatcttc acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat 9660
atatgagtaa acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc 9720
gatctgtcta tttcgttcat ccatagttgc ctgactcctg caaaccacgt tgtgtctcaa 9780
aatctctgat gttacattgc acaagataaa aatatatcat catgaacaat aaaactgtct 9840
gcttacataa acagtaatac aaggggtgtt atgagccata ttcaacggga aacgtcttgc 9900
tcgaggccgc gattaaattc caacatggat gctgatttat atgggtataa atgggctcgc 9960
gataatgtcg ggcaatcagg tgcgacaatc tatcgattgt atgggaagcc cgatgcgcca 10020
gagttgtttc tgaaacatgg caaaggtagc gttgccaatg atgttacaga tgagatggtc 10080
agactaaact ggctgacgga atttatgcct cttccgacca tcaagcattt tatccgtact 10140
cctgatgatg catggttact caccactgcg atccccggga aaacagcatt ccaggtatta 10200
gaagaatatc ctgattcagg tgaaaatatt gttgatgcgc tggcagtgtt cctgcgccgg 10260
ttgcattcga ttcctgtttg taattgtcct tttaacagcg atcgcgtatt tcgtctcgct 10320
caggcgcaat cacgaatgaa taacggtttg gttgatgcga gtgattttga tgacgagcgt 10380
aatggctggc ctgttgaaca agtctggaaa gaaatgcata agcttttgcc attctcaccg 10440
gattcagtcg tcactcatgg tgatttctca cttgataacc ttatttttga cgaggggaaa 10500
ttaataggtt gtattgatgt tggacgagtc ggaatcgcag accgatacca ggatcttgcc 10560
atcctatgga actgcctcgg tgagttttct ccttcattac agaaacggct ttttcaaaaa 10620
tatggtattg ataatcctga tatgaataaa ttgcagtttc atttgatgct cgatgagttt 10680
ttctaagggc ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg 10740
cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg 10800
gcgccggtga tgagggcgcg ccaagtcgac gtccggcagt c 10841
<210> 109
<211> 11187
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 109
ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac ctagttataa 60
tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg cgttacataa 120
cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt gacgtcaata 180
atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca atgggtggag 240
tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc aagtacgccc 300
cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta catgacctta 360
tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac catggtcgag 420
gtgagcccca cgttctgctt cactctcccc atctcccccc cctccccacc cccaattttg 480
tatttattta ttttttaatt attttgtgca gcgatggggg cggggggggg gggggggcgc 540
gcgccaggcg gggcggggcg gggcgagggg cggggcgggg cgaggcggag aggtgcggcg 600
gcagccaatc agagcggcgc gctccgaaag tttcctttta tggcgaggcg gcggcggcgg 660
cggccctata aaaagcgaag cgcgcggcgg gcgggagtcg ctgcgacgct gccttcgccc 720
cgtgccccgc tccgccgccg cctcgcgccg cccgccccgg ctctgactga ccgcgttact 780
cccacaggtg agcgggcggg acggcccttc tcctccgggc tgtaattagc gcttggttta 840
atgacggctt gtttcttttc tgtggctgcg tgaaagcctt gaggggctcc gggagctaga 900
gcctctgcta accatgttca tgccttcttc tttttcctac agctcctggg caacgtgctg 960
gttattgtgc tgtctcatca ttttggcaaa gaattcctcg aagatccgaa gggaaagtct 1020
tccacgactg tgggatccgt tcgaagatat caccggttga gccaccatgg aattcagcag 1080
ccccagcaga gaggaatgcc ccaagcctct gagccgggtg tcaatcatgg ccggatctct 1140
gacaggactg ctgctgcttc aggccgtgtc ttgggcttct ggcgctagac cttgcatccc 1200
caagagcttc ggctacagca gcgtcgtgtg cgtgtgcaat gccacctact gcgacagctt 1260
cgaccctcct acctttcctg ctctgggcac cttcagcaga tacgagagca ccagatccgg 1320
cagacggatg gaactgagca tgggacccat ccaggccaat cacacaggca ctggcctgct 1380
gctgacactg cagcctgagc agaaattcca gaaagtgaaa ggcttcggcg gagccatgac 1440
agatgccgcc gctctgaata tcctggctct gtctccacca gctcagaacc tgctgctcaa 1500
gagctacttc agcgaggaag gcatcggcta caacatcatc agagtgccca tggccagctg 1560
cgacttcagc atcaggacct acacctacgc cgacacaccc gacgatttcc agctgcacaa 1620
cttcagcctg cctgaagagg acaccaagct gaagatccct ctgatccaca gagccctgca 1680
gctggcacaa agacccgtgt cactgctggc ctctccatgg acatctccca cctggctgaa 1740
aacaaatggc gccgtgaatg gcaagggcag cctgaaaggc caacctggcg acatctacca 1800
ccagacctgg gccagatact tcgtgaagtt cctggacgcc tatgccgagc acaagctgca 1860
gttttgggcc gtgacagccg agaacgaacc ttctgctgga ctgctgagcg gctacccctt 1920
tcagtgcctg ggctttacac ccgagcacca gcgggacttt atcgcccgtg atctgggacc 1980
cacactggcc aatagcaccc accataatgt gcggctgctg atgctggacg accagagact 2040
gcttctgccc cactgggcta aagtggtgct gacagatcct gaggccgcca aatacgtgca 2100
cggaatcgcc gtgcactggt atctggactt tctggcccct gccaaggcca cactgggaga 2160
gacacacaga ctgttcccca acaccatgct gttcgccagc gaagcctgtg tgggcagcaa 2220
gttttgggaa cagagcgtgc ggctcggcag ctgggataga ggcatgcagt acagccacag 2280
catcatcacc aacctgctgt accacgtcgt cggctggacc gactggaatc tggccctgaa 2340
tcctgaaggc ggccctaact gggtccgaaa cttcgtggac agccccatca tcgtggacat 2400
caccaaggac accttctaca agcagcccat gttctaccac ctgggacact tcagcaagtt 2460
catccccgag ggctctcagc gcgttggact ggtggcttcc cagaagaacg atctggacgc 2520
cgtggctctg atgcaccctg atggatctgc tgtggtggtg gtcctgaacc gcagcagcaa 2580
agatgtgccc ctgaccatca aggatcccgc cgtgggattc ctggaaacaa tcagccctgg 2640
ctactccatc cacacctacc tgtggcgtag acagtgacaa ttgttaatta agtttaaacc 2700
ctcgaggccg caagcttatc gataatcaac ctctggatta caaaatttgt gaaagattga 2760
ctggtattct taactatgtt gctcctttta cgctatgtgg atacgctgct ttaatgcctt 2820
tgtatcatgc tattgcttcc cgtatggctt tcattttctc ctccttgtat aaatcctggt 2880
tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca acgtggcgtg gtgtgcactg 2940
tgtttgctga cgcaaccccc actggttggg gcattgccac cacctgtcag ctcctttccg 3000
ggactttcgc tttccccctc cctattgcca cggcggaact catcgccgcc tgccttgccc 3060
gctgctggac aggggctcgg ctgttgggca ctgacaattc cgtggtgttg tcggggaaat 3120
catcgtcctt tccttggctg ctcgcctgtg ttgccacctg gattctgcgc gggacgtcct 3180
tctgctacgt cccttcggcc ctcaatccag cggaccttcc ttcccgcggc ctgctgccgg 3240
ctctgcggcc tcttccgcgt cttcgccttc gccctcagac gagtcggatc tccctttggg 3300
ccgcctcccc gcatcgatac cgtcgactag agctcgctga tcagcctcga ctgtgccttc 3360
tagttgccag ccatctgttg tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc 3420
cactcccact gtcctttcct aataaaatga ggaaattgca tcgcattgtc tgagtaggtg 3480
tcattctatt ctggggggtg gggtggggca ggacagcaag ggggaggatt gggaagacaa 3540
tagcaggcat gctggggaga gatccacgat aacaaacagc ttttttgggg tgaacatatt 3600
gactgaattc cctgcaggtt ggccactccc tctctgcgcg ctcgctcgct cactgaggcc 3660
gcccgggcaa agcccgggcg tcgggcgacc tttggtcgcc cggcctcagt gagcgagcga 3720
gcgcgcagag agggagtggc caactccatc actaggggtt cctgcggccg ctcgtacggt 3780
ctcgaggaat tcctgcagga taacttgcca acctcattct aaaatgtata tagaagccca 3840
aaagacaata acaaaaatat tcttgtagaa caaaatggga aagaatgttc cactaaatat 3900
caagatttag agcaaagcat gagatgtgtg gggatagaca gtgaggctga taaaatagag 3960
tagagctcag aaacagaccc attgatatat gtaagtgacc tatgaaaaaa atatggcatt 4020
ttacaatggg aaaatgatgg tctttttctt ttttagaaaa acagggaaat atatttatat 4080
gtaaaaaata aaagggaacc catatgtcat accatacaca caaaaaaatt ccagtgaatt 4140
ataagtctaa atggagaagg caaaacttta aatcttttag aaaataatat agaagcatgc 4200
agaccagcct ggccaacatg atgaaaccct ctctactaat aataaaatca gtagaactac 4260
tcaggactac tttgagtggg aagtcctttt ctatgaagac ttctttggcc aaaattaggc 4320
tctaaatgca aggagatagt gcatcatgcc tggctgcact tactgataaa tgatgttatc 4380
accatcttta accaaatgca caggaacaag ttatggtact gatgtgctgg attgagaagg 4440
agctctactt ccttgacagg acacatttgt atcaacttaa aaaagcagat ttttgccagc 4500
agaactattc attcagaggt aggaaactta gaatagatga tgtcactgat tagcatggct 4560
tccccatctc cacagctgct tcccacccag gttgcccaca gttgagtttg tccagtgctc 4620
agggctgccc actctcagta agaagcccca caccagcccc tctccaaata tgttggctgt 4680
tccttccatt aaagtgaccc cactttagag cagcaagtgg atttctgttt cttacagttc 4740
aggaaggagg agtcagctgt gagaacctgg agcctgagat gcttctaagt cccactgcta 4800
ctggggtcag ggaagccaga ctccagcatc agcagtcagg agcactaagc ccttgccaac 4860
atcctgtttc tcagagaaac tgcttccatt ataatggttg tcctttttta agctatcaag 4920
ccaaacaacc agtgtctacc attattctca tcacctgaag ccaagggttc tagcaaaagt 4980
caagctgtct tgtaatggtt gatgtgcctc cagcttctgt cttcagtcac tccactctta 5040
gcctgctctg aatcaactct gaccacagtt ccctggagcc cctgccacct gctgcccctg 5100
ccaccttctc catctgcagt gctgtgcagc cttctgcact cttgcagagc taataggtgg 5160
agacttgaag gaagaggagg aaagtttctc ataatagcct tgctgcaagc tcaaatggga 5220
ggtgggcact gtgcccagga gccttggagc aaaggctgtg cccaacctct gactgcatcc 5280
aggtttggtc ttgacagaga taagaagccc tggcttttgg agccaaaatc taggtcagac 5340
ttaggcagga ttctcaaagt ttatcagcag aacatgaggc agaagaccct ttctgctcca 5400
gcttcttcag gctcaacctt catcagaata gatagaaaga gaggctgtga gggttcttaa 5460
aacagaagca aatctgactc agagaataaa caacctccta gtaaactaca gcttagacag 5520
agcatctggt ggtgagtgtg ctcagtgtcc tactcaactg tctggtatca gccctcatga 5580
ggacttctct tctttccctc atagacctcc atctctgttt tccttagcct gcagaaatct 5640
ggatggctat tcacagaatg cctgtgcttt cagagttgca ttttttctct ggtattctgg 5700
ttcaagcatt tgaaggtagg aaaggttctc caagtgcaag aaagccagcc ctgagcctca 5760
actgcctggc tagtgtggtc agtaggatgc aaaggctgtt gaatgccaca aggccaaact 5820
ttaacctgtg taccacaagc ctagcagcag aggcagctct gctcactgga actctctgtc 5880
ttctttctcc tgagcctttt cttttcctga gttttctagc tctcctcaac cttacctctg 5940
ccctacccag gacaaaccca agagccactg tttctgtgat gtcctctcca gccctaatta 6000
ggcatcatga cttcagcctg accttccatg ctcagaagca gtgctaatcc acttcagatg 6060
agctgctcta tgcaacacag gcagagccta caaacctttg caccagagcc ctccacatat 6120
cagtgtttgt tcatactcac ttcaacagca aatgtgactg ctgagattaa gattttacac 6180
aagatggtct gtaatttcac agttagtttt atcccattag gtatgaaaga attagcataa 6240
ttccccttaa acatgaatga atcttagatt ttttaataaa tagttttgga agtaaagaca 6300
gagacatcag gagcacaagg aatagcctga gaggacaaac agaacaagaa agagtctgga 6360
aatacacagg atgttcttgg cctcctcaaa gcaagtgcaa gcagatagta ccagcagccc 6420
caggctatca gagcccagtg aagagaagta ccatgaaagc cacagctcta accaccctgt 6480
tccagagtga cagacagtcc ccaagacaag ccagcctgag ccagagagag aactgcaaga 6540
gaaagtttct aatttaggtt ctgttagatt cagacaagtg caggtcatcc tctctccaca 6600
gctactcacc tctccagcct aacaaagcct gcagtccaca ctccaaccct ggtgtctcac 6660
ctcctagcct ctcccaacat cctgctctct gaccatcttc tgcatctctc atctcaccat 6720
ctcccactgt ctacagccta ctcttgcaac taccatctca ttttctgaca tcctgtctac 6780
atcttctgcc atactctgcc atctaccata ccacctctta ccatctacca caccatcttt 6840
tatctccatc cctctcagaa gcctccaagc tgaatcctgc tttatgtgtt catctcagcc 6900
cctgcatgga aagctgaccc cagaggcaga actattccca gagagcttgg ccaagaaaaa 6960
caaaactacc agcctggcca ggctcaggag tagtaagctg cagtgtctgt tgtgttctag 7020
cttcaacagc tgcaggagtt ccactctcaa atgctccaca tttctcacat cctcctgatt 7080
ctggtcacta cccatcttca aagaacagaa tatctcacat cagcatactg tgaaggacta 7140
gtcatgggtg cagctgctca gagctgcaaa gtcattctgg atggtggaga gcttacaaac 7200
atttcatgat gctccccccg ctctgatggc tggagcccaa tccctacaca gactcctgct 7260
gtatgtgttt tcctttcact ctgagccaca gccagagggc aggcattcag tctcctcttc 7320
aggctggggc tggggcactg agaactcacc caacaccttg ctctcactcc ttctgcaaaa 7380
caagaaagag ctttgtgctg cagtagccat gaagaatgaa aggaaggctt taactaaaaa 7440
atgtcagaga ttattttcaa ccccttactg tggatcacca gcaaggagga aacacaacac 7500
agagacattt tttcccctca aattatcaaa agaatcactg catttgttaa agagagcaac 7560
tgaatcagga agcagagttt tgaacatatc agaagttagg aatctgcatc agagacaaat 7620
gcagtcatgg ttgtttgctg cataccagcc ctaatcatta gaagcctcat ggacttcaaa 7680
catcattccc tctgacaaga tgctctagcc taactccatg agataaaata aatctgcctt 7740
tcagagccaa agaagagtcc accagcttct tctcagtgtg aacaagagct ccagtcaggt 7800
tagtcagtcc agtgcagtag aggagaccag tctgcatcct ctaattttca aaggcaagaa 7860
gatttgttta ccctggacac caggcacaag tgaggtcaca gagctcttag atatgcagtc 7920
ctcatgagtg aggagactaa agcgcatgcc atcaagactt cagtgtagag aaaacctcca 7980
aaaaagcctc ctcactactt ctggaatagc tcagaggccg aggcggcctc ggcctctgca 8040
taaataaaaa aaattagtca gccatggggc ggagaatggg cggaactggg cggagttagg 8100
ggcgggatgg gcggagttag gggcgggact atggttgctg actaattgag atgcatgctt 8160
tgcatacttc tgcctgctgg ggagcctggg gactttccac acctggttgc tgactaattg 8220
agatgcatgc tttgcatact tctgcctgct ggggagcctg gggactttcc acaccctaac 8280
tgacacacat tccacagctg cattaatgaa tcggccaacg cgcggggaga ggcggtttgc 8340
gtattgggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc 8400
ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata 8460
acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 8520
cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct 8580
caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa 8640
gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc 8700
tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt 8760
aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 8820
ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg 8880
cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct 8940
tgaagtggtg gcctaactac ggctacacta gaagaacagt atttggtatc tgcgctctgc 9000
tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg 9060
ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc 9120
aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt 9180
aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa 9240
aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat 9300
gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct 9360
gactcctgca aaccacgttg tgtctcaaaa tctctgatgt tacattgcac aagataaaaa 9420
tatatcatca tgaacaataa aactgtctgc ttacataaac agtaatacaa ggggtgttat 9480
gagccatatt caacgggaaa cgtcttgctc gaggccgcga ttaaattcca acatggatgc 9540
tgatttatat gggtataaat gggctcgcga taatgtcggg caatcaggtg cgacaatcta 9600
tcgattgtat gggaagcccg atgcgccaga gttgtttctg aaacatggca aaggtagcgt 9660
tgccaatgat gttacagatg agatggtcag actaaactgg ctgacggaat ttatgcctct 9720
tccgaccatc aagcatttta tccgtactcc tgatgatgca tggttactca ccactgcgat 9780
ccccgggaaa acagcattcc aggtattaga agaatatcct gattcaggtg aaaatattgt 9840
tgatgcgctg gcagtgttcc tgcgccggtt gcattcgatt cctgtttgta attgtccttt 9900
taacagcgat cgcgtatttc gtctcgctca ggcgcaatca cgaatgaata acggtttggt 9960
tgatgcgagt gattttgatg acgagcgtaa tggctggcct gttgaacaag tctggaaaga 10020
aatgcataag cttttgccat tctcaccgga ttcagtcgtc actcatggtg atttctcact 10080
tgataacctt atttttgacg aggggaaatt aataggttgt attgatgttg gacgagtcgg 10140
aatcgcagac cgataccagg atcttgccat cctatggaac tgcctcggtg agttttctcc 10200
ttcattacag aaacggcttt ttcaaaaata tggtattgat aatcctgata tgaataaatt 10260
gcagtttcat ttgatgctcg atgagttttt ctaagggcgg cctgccacca tacccacgcc 10320
gaaacaagcg ctcatgagcc cgaagtggcg agcccgatct tccccatcgg tgatgtcggc 10380
gatataggcg ccagcaaccg cacctgtggc gccggtgatg agggcgcgcc aagtcgacgt 10440
ccggcagtct tggccactcc ctctctgcgc gctcgctcgc tcactgaggc cgggcgacca 10500
aaggtcgccc gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg agcgcgcaga 10560
gagggagtgg ccaactccat cactaggggt tcctgctagc tctgggtatt taagcccgag 10620
tgagcacgca gggtctccat tttgaagcgg gaggttacgc gttcgtcgac tactagtggg 10680
taccagagcg tggtgactga gatgttttct aggaaacaca aaagatacaa aaaagaacac 10740
gtggaaggat agccaaaaag gggggctgcc cccatttcct gcaccccgct gcgatggctg 10800
gcaccatttg gaagacttcg agatacactg ttgagcgcag taagacaaca gtgtatctcg 10860
aagtcttcca gatggggcca gccggtccac tctgtatcca ggccagttct gcaaggcgtt 10920
cgaggaccac ccccctcccc tcgccaccag ggtggtctca tacagaactt ataagattcc 10980
caaatccaaa gacatttcac gtttatggtg atttcccaga acacatagcg acatgcaaat 11040
attgcagggc gccactcccc tgtccctcac agccatcttc ctgccagggc gcacgcgcgc 11100
tgggtgttcc cgcctagtga cactgggccc gcgattcctt ggagcgggtt gatgacgtca 11160
gcgtttccca tggtgaatcc ctaggtt 11187
<210> 110
<211> 10996
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 110
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtccta gtaggcgaag ggtatcaaga ctacgaacac 300
ccatctgtgg ctttacagta ttcgtagtct tgatacccta cgctcactcg aggtggtctc 360
atacagaact tataagattc ccaaatccaa agacatttca cgtttatggt gatttcccag 420
aacacatagc gacatgcaaa tattgcaggg cgccactccc ctgtccctca cagccatctt 480
cctgccaggg cgcacgcgcg ctgggtgttc ccgcctagtg acactgggcc cgcgattcct 540
tggagcgggt tgatgacgtc agcgtttccc atggtgaagc ttggatctga attcggtacc 600
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 660
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 720
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 780
aatgggtgga ctatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 840
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 900
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 960
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 1020
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 1080
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 1140
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 1200
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagtc gctgcgacgc 1260
tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg 1320
accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag 1380
cgcttggttt aatgacggct tgttttctgt ggctgcgtga aagccttgag gggctccggg 1440
agctagagcc tctgctaacc atgttcatgc cttcttcttt ttcctacagc tcctgggcaa 1500
cgtgctggtt attgtgctgt ctcatcattt tggcaaagaa ttcctcgaag atccgaaggg 1560
aaagtcttcc acgactgtgg gatccgttcg aagatatcac cggttgagcc accatggaat 1620
tcagcagccc cagcagagag gaatgcccca agcctctgag ccgggtgtca atcatggccg 1680
gatctctgac aggactgctg ctgcttcagg ccgtgtcttg ggcttctggc gctagacctt 1740
gcatccccaa gagcttcggc tacagcagcg tcgtgtgcgt gtgcaatgcc acctactgcg 1800
acagcttcga ccctcctacc tttcctgctc tgggcacctt cagcagatac gagagcacca 1860
gatccggcag acggatggaa ctgagcatgg gacccatcca ggccaatcac acaggcactg 1920
gcctgctgct gacactgcag cctgagcaga aattccagaa agtgaaaggc ttcggcggag 1980
ccatgacaga tgccgccgct ctgaatatcc tggctctgtc tccaccagct cagaacctgc 2040
tgctcaagag ctacttcagc gaggaaggca tcggctacaa catcatcaga gtgcccatgg 2100
ccagctgcga cttcagcatc aggacctaca cctacgccga cacacccgac gatttccagc 2160
tgcacaactt cagcctgcct gaagaggaca ccaagctgaa gatccctctg atccacagag 2220
ccctgcagct ggcacaaaga cccgtgtcac tgctggcctc tccatggaca tctcccacct 2280
ggctgaaaac aaatggcgcc gtgaatggca agggcagcct gaaaggccaa cctggcgaca 2340
tctaccacca gacctgggcc agatacttcg tgaagttcct ggacgcctat gccgagcaca 2400
agctgcagtt ttgggccgtg acagccgaga acgaaccttc tgctggactg ctgagcggct 2460
acccctttca gtgcctgggc tttacacccg agcaccagcg ggactttatc gcccgtgatc 2520
tgggacccac actggccaat agcacccacc ataatgtgcg gctgctgatg ctggacgacc 2580
agagactgct tctgccccac tgggctaaag tggtgctgac agatcctgag gccgccaaat 2640
acgtgcacgg aatcgccgtg cactggtatc tggactttct ggcccctgcc aaggccacac 2700
tgggagagac acacagactg ttccccaaca ccatgctgtt cgccagcgaa gcctgtgtgg 2760
gcagcaagtt ttgggaacag agcgtgcggc tcggcagctg ggatagaggc atgcagtaca 2820
gccacagcat catcaccaac ctgctgtacc acgtcgtcgg ctggaccgac tggaatctgg 2880
ccctgaatcc tgaaggcggc cctaactggg tccgaaactt cgtggacagc cccatcatcg 2940
tggacatcac caaggacacc ttctacaagc agcccatgtt ctaccacctg ggacacttca 3000
gcaagttcat ccccgagggc tctcagcgcg ttggactggt ggcttcccag aagaacgatc 3060
tggacgccgt ggctctgatg caccctgatg gatctgctgt ggtggtggtc ctgaaccgca 3120
gcagcaaaga tgtgcccctg accatcaagg atcccgccgt gggattcctg gaaacaatca 3180
gccctggcta ctccatccac acctacctgt ggcgtagaca gtgacaattg ttaattaagt 3240
ttaaaccctc gaggccgcaa gcttatcgat aatcaacctc tggattacaa aatttgtgaa 3300
agattgactg gtattcttaa ctatgttgct ccttttacgc tatgtggata cgctgcttta 3360
atgcctttgt atcatgctat tgcttcccgt atggctttca ttttctcctc cttgtataaa 3420
tcctggttgc tgtctcttta tgaggagttg tggcccgttg tcaggcaacg tggcgtggtg 3480
tgcactgtgt ttgctgacgc aacccccact ggttggggca ttgccaccac ctgtcagctc 3540
ctttccggga ctttcgcttt ccccctccct attgccacgg cggaactcat cgccgcctgc 3600
cttgcccgct gctggacagg ggctcggctg ttgggcactg acaattccgt ggtgttgtcg 3660
gggaaatcat cgtcctttcc ttggctgctc gcctgtgttg ccacctggat tctgcgcggg 3720
acgtccttct gctacgtccc ttcggccctc aatccagcgg accttccttc ccgcggcctg 3780
ctgccggctc tgcggcctct tccgcgtctt cgccttcgcc ctcagacgag tcggatctcc 3840
ctttgggccg cctccccgca tcgataccgt cgactagagc tcgctgatca gcctcgactg 3900
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 3960
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 4020
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 4080
aagacaatag caggcatgct ggggagagat ccacgataac aaacagcttt tttggggtga 4140
acatattgac tgaattccct gcaggttggc cactccctct ctgcgcgctc gctcgctcac 4200
tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt ggtcgcccgg cctcagtgag 4260
cgagcgagcg cgcagagagg gagtggccaa ctccatcact aggggttcct gcggccgctc 4320
gtacggtctc gaggaattcc tgcaggataa cttgccaacc tcattctaaa atgtatatag 4380
aagcccaaaa gacaataaca aaaatattct tgtagaacaa aatgggaaag aatgttccac 4440
taaatatcaa gatttagagc aaagcatgag atgtgtgggg atagacagtg aggctgataa 4500
aatagagtag agctcagaaa cagacccatt gatatatgta agtgacctat gaaaaaaata 4560
tggcatttta caatgggaaa atgatggtct ttttcttttt tagaaaaaca gggaaatata 4620
tttatatgta aaaaataaaa gggaacccat atgtcatacc atacacacaa aaaaattcca 4680
gtgaattata agtctaaatg gagaaggcaa aactttaaat cttttagaaa ataatataga 4740
agcatgcaga ccagcctggc caacatgatg aaaccctctc tactaataat aaaatcagta 4800
gaactactca ggactacttt gagtgggaag tccttttcta tgaagacttc tttggccaaa 4860
attaggctct aaatgcaagg agatagtgca tcatgcctgg ctgcacttac tgataaatga 4920
tgttatcacc atctttaacc aaatgcacag gaacaagtta tggtactgat gtgctggatt 4980
gagaaggagc tctacttcct tgacaggaca catttgtatc aacttaaaaa agcagatttt 5040
tgccagcaga actattcatt cagaggtagg aaacttagaa tagatgatgt cactgattag 5100
catggcttcc ccatctccac agctgcttcc cacccaggtt gcccacagtt gagtttgtcc 5160
agtgctcagg gctgcccact ctcagtaaga agccccacac cagcccctct ccaaatatgt 5220
tggctgttcc ttccattaaa gtgaccccac tttagagcag caagtggatt tctgtttctt 5280
acagttcagg aaggaggagt cagctgtgag aacctggagc ctgagatgct tctaagtccc 5340
actgctactg gggtcaggga agccagactc cagcatcagc agtcaggagc actaagccct 5400
tgccaacatc ctgtttctca gagaaactgc ttccattata atggttgtcc ttttttaagc 5460
tatcaagcca aacaaccagt gtctaccatt attctcatca cctgaagcca agggttctag 5520
caaaagtcaa gctgtcttgt aatggttgat gtgcctccag cttctgtctt cagtcactcc 5580
actcttagcc tgctctgaat caactctgac cacagttccc tggagcccct gccacctgct 5640
gcccctgcca ccttctccat ctgcagtgct gtgcagcctt ctgcactctt gcagagctaa 5700
taggtggaga cttgaaggaa gaggaggaaa gtttctcata atagccttgc tgcaagctca 5760
aatgggaggt gggcactgtg cccaggagcc ttggagcaaa ggctgtgccc aacctctgac 5820
tgcatccagg tttggtcttg acagagataa gaagccctgg cttttggagc caaaatctag 5880
gtcagactta ggcaggattc tcaaagttta tcagcagaac atgaggcaga agaccctttc 5940
tgctccagct tcttcaggct caaccttcat cagaatagat agaaagagag gctgtgaggg 6000
ttcttaaaac agaagcaaat ctgactcaga gaataaacaa cctcctagta aactacagct 6060
tagacagagc atctggtggt gagtgtgctc agtgtcctac tcaactgtct ggtatcagcc 6120
ctcatgagga cttctcttct ttccctcata gacctccatc tctgttttcc ttagcctgca 6180
gaaatctgga tggctattca cagaatgcct gtgctttcag agttgcattt tttctctggt 6240
attctggttc aagcatttga aggtaggaaa ggttctccaa gtgcaagaaa gccagccctg 6300
agcctcaact gcctggctag tgtggtcagt aggatgcaaa ggctgttgaa tgccacaagg 6360
ccaaacttta acctgtgtac cacaagccta gcagcagagg cagctctgct cactggaact 6420
ctctgtcttc tttctcctga gccttttctt ttcctgagtt ttctagctct cctcaacctt 6480
acctctgccc tacccaggac aaacccaaga gccactgttt ctgtgatgtc ctctccagcc 6540
ctaattaggc atcatgactt cagcctgacc ttccatgctc agaagcagtg ctaatccact 6600
tcagatgagc tgctctatgc aacacaggca gagcctacaa acctttgcac cagagccctc 6660
cacatatcag tgtttgttca tactcacttc aacagcaaat gtgactgctg agattaagat 6720
tttacacaag atggtctgta atttcacagt tagttttatc ccattaggta tgaaagaatt 6780
agcataattc cccttaaaca tgaatgaatc ttagattttt taataaatag ttttggaagt 6840
aaagacagag acatcaggag cacaaggaat agcctgagag gacaaacaga acaagaaaga 6900
gtctggaaat acacaggatg ttcttggcct cctcaaagca agtgcaagca gatagtacca 6960
gcagccccag gctatcagag cccagtgaag agaagtacca tgaaagccac agctctaacc 7020
accctgttcc agagtgacag acagtcccca agacaagcca gcctgagcca gagagagaac 7080
tgcaagagaa agtttctaat ttaggttctg ttagattcag acaagtgcag gtcatcctct 7140
ctccacagct actcacctct ccagcctaac aaagcctgca gtccacactc caaccctggt 7200
gtctcacctc ctagcctctc ccaacatcct gctctctgac catcttctgc atctctcatc 7260
tcaccatctc ccactgtcta cagcctactc ttgcaactac catctcattt tctgacatcc 7320
tgtctacatc ttctgccata ctctgccatc taccatacca cctcttacca tctaccacac 7380
catcttttat ctccatccct ctcagaagcc tccaagctga atcctgcttt atgtgttcat 7440
ctcagcccct gcatggaaag ctgaccccag aggcagaact attcccagag agcttggcca 7500
agaaaaacaa aactaccagc ctggccaggc tcaggagtag taagctgcag tgtctgttgt 7560
gttctagctt caacagctgc aggagttcca ctctcaaatg ctccacattt ctcacatcct 7620
cctgattctg gtcactaccc atcttcaaag aacagaatat ctcacatcag catactgtga 7680
aggactagtc atgggtgcag ctgctcagag ctgcaaagtc attctggatg gtggagagct 7740
tacaaacatt tcatgatgct ccccccgctc tgatggctgg agcccaatcc ctacacagac 7800
tcctgctgta tgtgttttcc tttcactctg agccacagcc agagggcagg cattcagtct 7860
cctcttcagg ctggggctgg ggcactgaga actcacccaa caccttgctc tcactccttc 7920
tgcaaaacaa gaaagagctt tgtgctgcag tagccatgaa gaatgaaagg aaggctttaa 7980
ctaaaaaatg tcagagatta ttttcaaccc cttactgtgg atcaccagca aggaggaaac 8040
acaacacaga gacatttttt cccctcaaat tatcaaaaga atcactgcat ttgttaaaga 8100
gagcaactga atcaggaagc agagttttga acatatcaga agttaggaat ctgcatcaga 8160
gacaaatgca gtcatggttg tttgctgcat accagcccta atcattagaa gcctcatgga 8220
cttcaaacat cattccctct gacaagatgc tctagcctaa ctccatgaga taaaataaat 8280
ctgcctttca gagccaaaga agagtccacc agcttcttct cagtgtgaac aagagctcca 8340
gtcaggttag tcagtccagt gcagtagagg agaccagtct gcatcctcta attttcaaag 8400
gcaagaagat ttgtttaccc tggacaccag gcacaagtga ggtcacagag ctcttagata 8460
tgcagtcctc atgagtgagg agactaaagc gcatgccatc aagacttcag tgtagagaaa 8520
acctccaaaa aagcctcctc actacttctg gaatagctca gaggccgagg cggcctcggc 8580
ctctgcataa ataaaaaaaa ttagtcagcc atggggcgga gaatgggcgg aactgggcgg 8640
agttaggggc gggatgggcg gagttagggg cgggactatg gttgctgact aattgagatg 8700
catgctttgc atacttctgc ctgctgggga gcctggggac tttccacacc tggttgctga 8760
ctaattgaga tgcatgcttt gcatacttct gcctgctggg gagcctgggg actttccaca 8820
ccctaactga cacacattcc acagctgcat taatgaatcg gccaacgcgc ggggagaggc 8880
ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt 8940
cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca 9000
ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa 9060
aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat 9120
cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc 9180
cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc 9240
gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt 9300
tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac 9360
cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg 9420
ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca 9480
gagttcttga agtggtggcc taactacggc tacactagaa gaacagtatt tggtatctgc 9540
gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa 9600
accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa 9660
ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac 9720
tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta 9780
aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt 9840
taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata 9900
gttgcctgac tcctgcaaac cacgttgtgt ctcaaaatct ctgatgttac attgcacaag 9960
ataaaaatat atcatcatga acaataaaac tgtctgctta cataaacagt aatacaaggg 10020
gtgttatgag ccatattcaa cgggaaacgt cttgctcgag gccgcgatta aattccaaca 10080
tggatgctga tttatatggg tataaatggg ctcgcgataa tgtcgggcaa tcaggtgcga 10140
caatctatcg attgtatggg aagcccgatg cgccagagtt gtttctgaaa catggcaaag 10200
gtagcgttgc caatgatgtt acagatgaga tggtcagact aaactggctg acggaattta 10260
tgcctcttcc gaccatcaag cattttatcc gtactcctga tgatgcatgg ttactcacca 10320
ctgcgatccc cgggaaaaca gcattccagg tattagaaga atatcctgat tcaggtgaaa 10380
atattgttga tgcgctggca gtgttcctgc gccggttgca ttcgattcct gtttgtaatt 10440
gtccttttaa cagcgatcgc gtatttcgtc tcgctcaggc gcaatcacga atgaataacg 10500
gtttggttga tgcgagtgat tttgatgacg agcgtaatgg ctggcctgtt gaacaagtct 10560
ggaaagaaat gcataagctt ttgccattct caccggattc agtcgtcact catggtgatt 10620
tctcacttga taaccttatt tttgacgagg ggaaattaat aggttgtatt gatgttggac 10680
gagtcggaat cgcagaccga taccaggatc ttgccatcct atggaactgc ctcggtgagt 10740
tttctccttc attacagaaa cggctttttc aaaaatatgg tattgataat cctgatatga 10800
ataaattgca gtttcatttg atgctcgatg agtttttcta agggcggcct gccaccatac 10860
ccacgccgaa acaagcgctc atgagcccga agtggcgagc ccgatcttcc ccatcggtga 10920
tgtcggcgat ataggcgcca gcaaccgcac ctgtggcgcc ggtgatgagg gcgcgccaag 10980
tcgacgtccg gcagtc 10996
<210> 111
<211> 10845
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 111
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
cctagttatt aatagtaatc aattacgggg tcattagttc atagcccata tatggagttc 360
cgcgttacat aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca 420
ttgacgtcaa taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt 480
caatgggtgg actatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg 540
ccaagtacgc cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag 600
tacatgacct tatgggactt tcctacttgg cagtacatct acgtattagt catcgctatt 660
accatggtcg aggtgagccc cacgttctgc ttcactctcc ccatctcccc cccctcccca 720
cccccaattt tgtatttatt tattttttaa ttattttgtg cagcgatggg ggcggggggg 780
gggggggggc gcgcgccagg cggggcgggg cggggcgagg ggcggggcgg ggcgaggcgg 840
agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa agtttccttt tatggcgagg 900
cggcggcggc ggcggcccta taaaaagcga agcgcgcggc gggcgggagt cgctgcgacg 960
ctgccttcgc cccgtgcccc gctccgccgc cgcctcgcgc cgcccgcccc ggctctgact 1020
gaccgcgtta ctcccacagg tgagcgggcg ggacggccct tctcctccgg gctgtaatta 1080
gcgcttggtt taatgacggc ttgtctggag gcttgctttg ggctgtatgc tgtttagaaa 1140
taagtggtag tcattttggc ctctgactga tgactacact atttctaaac aggacacaag 1200
gccctttatc agcactcaca tggaacaaat ggccaccgtg ggaggatgac aatttctgtg 1260
gctgcgtgaa agccttgagg ggctccggga gctagagcct ctgctaacca tgttcatgcc 1320
ttcttctttt tcctacagct cctgggcaac gtgctggtta ttgtgctgtc tcatcatttt 1380
ggcaaagaat tcctcgaaga tccgaaggga aagtcttcca cgactgtggg atccgttcga 1440
agatatcacc ggttgagcca ccatggaatt cagcagcccc agcagagagg aatgccccaa 1500
gcctctgagc cgggtgtcaa tcatggccgg atctctgaca ggactgctgc tgcttcaggc 1560
cgtgtcttgg gcttctggcg ctagaccttg catccccaag agcttcggct acagcagcgt 1620
cgtgtgcgtg tgcaatgcca cctactgcga cagcttcgac cctcctacct ttcctgctct 1680
gggcaccttc agcagatacg agagcaccag atccggcaga cggatggaac tgagcatggg 1740
acccatccag gccaatcaca caggcactgg cctgctgctg acactgcagc ctgagcagaa 1800
attccagaaa gtgaaaggct tcggcggagc catgacagat gccgccgctc tgaatatcct 1860
ggctctgtct ccaccagctc agaacctgct gctcaagagc tacttcagcg aggaaggcat 1920
cggctacaac atcatcagag tgcccatggc cagctgcgac ttcagcatca ggacctacac 1980
ctacgccgac acacccgacg atttccagct gcacaacttc agcctgcctg aagaggacac 2040
caagctgaag atccctctga tccacagagc cctgcagctg gcacaaagac ccgtgtcact 2100
gctggcctct ccatggacat ctcccacctg gctgaaaaca aatggcgccg tgaatggcaa 2160
gggcagcctg aaaggccaac ctggcgacat ctaccaccag acctgggcca gatacttcgt 2220
gaagttcctg gacgcctatg ccgagcacaa gctgcagttt tgggccgtga cagccgagaa 2280
cgaaccttct gctggactgc tgagcggcta cccctttcag tgcctgggct ttacacccga 2340
gcaccagcgg gactttatcg cccgtgatct gggacccaca ctggccaata gcacccacca 2400
taatgtgcgg ctgctgatgc tggacgacca gagactgctt ctgccccact gggctaaagt 2460
ggtgctgaca gatcctgagg ccgccaaata cgtgcacgga atcgccgtgc actggtatct 2520
ggactttctg gcccctgcca aggccacact gggagagaca cacagactgt tccccaacac 2580
catgctgttc gccagcgaag cctgtgtggg cagcaagttt tgggaacaga gcgtgcggct 2640
cggcagctgg gatagaggca tgcagtacag ccacagcatc atcaccaacc tgctgtacca 2700
cgtcgtcggc tggaccgact ggaatctggc cctgaatcct gaaggcggcc ctaactgggt 2760
ccgaaacttc gtggacagcc ccatcatcgt ggacatcacc aaggacacct tctacaagca 2820
gcccatgttc taccacctgg gacacttcag caagttcatc cccgagggct ctcagcgcgt 2880
tggactggtg gcttcccaga agaacgatct ggacgccgtg gctctgatgc accctgatgg 2940
atctgctgtg gtggtggtcc tgaaccgcag cagcaaagat gtgcccctga ccatcaagga 3000
tcccgccgtg ggattcctgg aaacaatcag ccctggctac tccatccaca cctacctgtg 3060
gcgtagacag tgacaattgt taattaagtt taaaccctcg aggccgcaag cttatcgata 3120
atcaacctct ggattacaaa atttgtgaaa gattgactgg tattcttaac tatgttgctc 3180
cttttacgct atgtggatac gctgctttaa tgcctttgta tcatgctatt gcttcccgta 3240
tggctttcat tttctcctcc ttgtataaat cctggttgct gtctctttat gaggagttgt 3300
ggcccgttgt caggcaacgt ggcgtggtgt gcactgtgtt tgctgacgca acccccactg 3360
gttggggcat tgccaccacc tgtcagctcc tttccgggac tttcgctttc cccctcccta 3420
ttgccacggc ggaactcatc gccgcctgcc ttgcccgctg ctggacaggg gctcggctgt 3480
tgggcactga caattccgtg gtgttgtcgg ggaaatcatc gtcctttcct tggctgctcg 3540
cctgtgttgc cacctggatt ctgcgcggga cgtccttctg ctacgtccct tcggccctca 3600
atccagcgga ccttccttcc cgcggcctgc tgccggctct gcggcctctt ccgcgtcttc 3660
gccttcgccc tcagacgagt cggatctccc tttgggccgc ctccccgcat cgataccgtc 3720
gactagagct cgctgatcag cctcgactgt gccttctagt tgccagccat ctgttgtttg 3780
cccctccccc gtgccttcct tgaccctgga aggtgccact cccactgtcc tttcctaata 3840
aaatgaggaa attgcatcgc attgtctgag taggtgtcat tctattctgg ggggtggggt 3900
ggggcaggac agcaaggggg aggattggga agacaatagc aggcatgctg gggagagatc 3960
cacgataaca aacagctttt ttggggtgaa catattgact gaattccctg caggttggcc 4020
actccctctc tgcgcgctcg ctcgctcact gaggccgccc gggcaaagcc cgggcgtcgg 4080
gcgacctttg gtcgcccggc ctcagtgagc gagcgagcgc gcagagaggg agtggccaac 4140
tccatcacta ggggttcctg cggccgctcg tacggtctcg aggaattcct gcaggataac 4200
ttgccaacct cattctaaaa tgtatataga agcccaaaag acaataacaa aaatattctt 4260
gtagaacaaa atgggaaaga atgttccact aaatatcaag atttagagca aagcatgaga 4320
tgtgtgggga tagacagtga ggctgataaa atagagtaga gctcagaaac agacccattg 4380
atatatgtaa gtgacctatg aaaaaaatat ggcattttac aatgggaaaa tgatggtctt 4440
tttctttttt agaaaaacag ggaaatatat ttatatgtaa aaaataaaag ggaacccata 4500
tgtcatacca tacacacaaa aaaattccag tgaattataa gtctaaatgg agaaggcaaa 4560
actttaaatc ttttagaaaa taatatagaa gcatgcagac cagcctggcc aacatgatga 4620
aaccctctct actaataata aaatcagtag aactactcag gactactttg agtgggaagt 4680
ccttttctat gaagacttct ttggccaaaa ttaggctcta aatgcaagga gatagtgcat 4740
catgcctggc tgcacttact gataaatgat gttatcacca tctttaacca aatgcacagg 4800
aacaagttat ggtactgatg tgctggattg agaaggagct ctacttcctt gacaggacac 4860
atttgtatca acttaaaaaa gcagattttt gccagcagaa ctattcattc agaggtagga 4920
aacttagaat agatgatgtc actgattagc atggcttccc catctccaca gctgcttccc 4980
acccaggttg cccacagttg agtttgtcca gtgctcaggg ctgcccactc tcagtaagaa 5040
gccccacacc agcccctctc caaatatgtt ggctgttcct tccattaaag tgaccccact 5100
ttagagcagc aagtggattt ctgtttctta cagttcagga aggaggagtc agctgtgaga 5160
acctggagcc tgagatgctt ctaagtccca ctgctactgg ggtcagggaa gccagactcc 5220
agcatcagca gtcaggagca ctaagccctt gccaacatcc tgtttctcag agaaactgct 5280
tccattataa tggttgtcct tttttaagct atcaagccaa acaaccagtg tctaccatta 5340
ttctcatcac ctgaagccaa gggttctagc aaaagtcaag ctgtcttgta atggttgatg 5400
tgcctccagc ttctgtcttc agtcactcca ctcttagcct gctctgaatc aactctgacc 5460
acagttccct ggagcccctg ccacctgctg cccctgccac cttctccatc tgcagtgctg 5520
tgcagccttc tgcactcttg cagagctaat aggtggagac ttgaaggaag aggaggaaag 5580
tttctcataa tagccttgct gcaagctcaa atgggaggtg ggcactgtgc ccaggagcct 5640
tggagcaaag gctgtgccca acctctgact gcatccaggt ttggtcttga cagagataag 5700
aagccctggc ttttggagcc aaaatctagg tcagacttag gcaggattct caaagtttat 5760
cagcagaaca tgaggcagaa gaccctttct gctccagctt cttcaggctc aaccttcatc 5820
agaatagata gaaagagagg ctgtgagggt tcttaaaaca gaagcaaatc tgactcagag 5880
aataaacaac ctcctagtaa actacagctt agacagagca tctggtggtg agtgtgctca 5940
gtgtcctact caactgtctg gtatcagccc tcatgaggac ttctcttctt tccctcatag 6000
acctccatct ctgttttcct tagcctgcag aaatctggat ggctattcac agaatgcctg 6060
tgctttcaga gttgcatttt ttctctggta ttctggttca agcatttgaa ggtaggaaag 6120
gttctccaag tgcaagaaag ccagccctga gcctcaactg cctggctagt gtggtcagta 6180
ggatgcaaag gctgttgaat gccacaaggc caaactttaa cctgtgtacc acaagcctag 6240
cagcagaggc agctctgctc actggaactc tctgtcttct ttctcctgag ccttttcttt 6300
tcctgagttt tctagctctc ctcaacctta cctctgccct acccaggaca aacccaagag 6360
ccactgtttc tgtgatgtcc tctccagccc taattaggca tcatgacttc agcctgacct 6420
tccatgctca gaagcagtgc taatccactt cagatgagct gctctatgca acacaggcag 6480
agcctacaaa cctttgcacc agagccctcc acatatcagt gtttgttcat actcacttca 6540
acagcaaatg tgactgctga gattaagatt ttacacaaga tggtctgtaa tttcacagtt 6600
agttttatcc cattaggtat gaaagaatta gcataattcc ccttaaacat gaatgaatct 6660
tagatttttt aataaatagt tttggaagta aagacagaga catcaggagc acaaggaata 6720
gcctgagagg acaaacagaa caagaaagag tctggaaata cacaggatgt tcttggcctc 6780
ctcaaagcaa gtgcaagcag atagtaccag cagccccagg ctatcagagc ccagtgaaga 6840
gaagtaccat gaaagccaca gctctaacca ccctgttcca gagtgacaga cagtccccaa 6900
gacaagccag cctgagccag agagagaact gcaagagaaa gtttctaatt taggttctgt 6960
tagattcaga caagtgcagg tcatcctctc tccacagcta ctcacctctc cagcctaaca 7020
aagcctgcag tccacactcc aaccctggtg tctcacctcc tagcctctcc caacatcctg 7080
ctctctgacc atcttctgca tctctcatct caccatctcc cactgtctac agcctactct 7140
tgcaactacc atctcatttt ctgacatcct gtctacatct tctgccatac tctgccatct 7200
accataccac ctcttaccat ctaccacacc atcttttatc tccatccctc tcagaagcct 7260
ccaagctgaa tcctgcttta tgtgttcatc tcagcccctg catggaaagc tgaccccaga 7320
ggcagaacta ttcccagaga gcttggccaa gaaaaacaaa actaccagcc tggccaggct 7380
caggagtagt aagctgcagt gtctgttgtg ttctagcttc aacagctgca ggagttccac 7440
tctcaaatgc tccacatttc tcacatcctc ctgattctgg tcactaccca tcttcaaaga 7500
acagaatatc tcacatcagc atactgtgaa ggactagtca tgggtgcagc tgctcagagc 7560
tgcaaagtca ttctggatgg tggagagctt acaaacattt catgatgctc cccccgctct 7620
gatggctgga gcccaatccc tacacagact cctgctgtat gtgttttcct ttcactctga 7680
gccacagcca gagggcaggc attcagtctc ctcttcaggc tggggctggg gcactgagaa 7740
ctcacccaac accttgctct cactccttct gcaaaacaag aaagagcttt gtgctgcagt 7800
agccatgaag aatgaaagga aggctttaac taaaaaatgt cagagattat tttcaacccc 7860
ttactgtgga tcaccagcaa ggaggaaaca caacacagag acattttttc ccctcaaatt 7920
atcaaaagaa tcactgcatt tgttaaagag agcaactgaa tcaggaagca gagttttgaa 7980
catatcagaa gttaggaatc tgcatcagag acaaatgcag tcatggttgt ttgctgcata 8040
ccagccctaa tcattagaag cctcatggac ttcaaacatc attccctctg acaagatgct 8100
ctagcctaac tccatgagat aaaataaatc tgcctttcag agccaaagaa gagtccacca 8160
gcttcttctc agtgtgaaca agagctccag tcaggttagt cagtccagtg cagtagagga 8220
gaccagtctg catcctctaa ttttcaaagg caagaagatt tgtttaccct ggacaccagg 8280
cacaagtgag gtcacagagc tcttagatat gcagtcctca tgagtgagga gactaaagcg 8340
catgccatca agacttcagt gtagagaaaa cctccaaaaa agcctcctca ctacttctgg 8400
aatagctcag aggccgaggc ggcctcggcc tctgcataaa taaaaaaaat tagtcagcca 8460
tggggcggag aatgggcgga actgggcgga gttaggggcg ggatgggcgg agttaggggc 8520
gggactatgg ttgctgacta attgagatgc atgctttgca tacttctgcc tgctggggag 8580
cctggggact ttccacacct ggttgctgac taattgagat gcatgctttg catacttctg 8640
cctgctgggg agcctgggga ctttccacac cctaactgac acacattcca cagctgcatt 8700
aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct tccgcttcct 8760
cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa 8820
aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa 8880
aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc 8940
tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga 9000
caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc 9060
cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt 9120
ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct 9180
gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg 9240
agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta 9300
gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct 9360
acactagaag aacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa 9420
gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt 9480
gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta 9540
cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat 9600
caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa 9660
gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct 9720
cagcgatctg tctatttcgt tcatccatag ttgcctgact cctgcaaacc acgttgtgtc 9780
tcaaaatctc tgatgttaca ttgcacaaga taaaaatata tcatcatgaa caataaaact 9840
gtctgcttac ataaacagta atacaagggg tgttatgagc catattcaac gggaaacgtc 9900
ttgctcgagg ccgcgattaa attccaacat ggatgctgat ttatatgggt ataaatgggc 9960
tcgcgataat gtcgggcaat caggtgcgac aatctatcga ttgtatggga agcccgatgc 10020
gccagagttg tttctgaaac atggcaaagg tagcgttgcc aatgatgtta cagatgagat 10080
ggtcagacta aactggctga cggaatttat gcctcttccg accatcaagc attttatccg 10140
tactcctgat gatgcatggt tactcaccac tgcgatcccc gggaaaacag cattccaggt 10200
attagaagaa tatcctgatt caggtgaaaa tattgttgat gcgctggcag tgttcctgcg 10260
ccggttgcat tcgattcctg tttgtaattg tccttttaac agcgatcgcg tatttcgtct 10320
cgctcaggcg caatcacgaa tgaataacgg tttggttgat gcgagtgatt ttgatgacga 10380
gcgtaatggc tggcctgttg aacaagtctg gaaagaaatg cataagcttt tgccattctc 10440
accggattca gtcgtcactc atggtgattt ctcacttgat aaccttattt ttgacgaggg 10500
gaaattaata ggttgtattg atgttggacg agtcggaatc gcagaccgat accaggatct 10560
tgccatccta tggaactgcc tcggtgagtt ttctccttca ttacagaaac ggctttttca 10620
aaaatatggt attgataatc ctgatatgaa taaattgcag tttcatttga tgctcgatga 10680
gtttttctaa gggcggcctg ccaccatacc cacgccgaaa caagcgctca tgagcccgaa 10740
gtggcgagcc cgatcttccc catcggtgat gtcggcgata taggcgccag caaccgcacc 10800
tgtggcgccg gtgatgaggg cgcgccaagt cgacgtccgg cagtc 10845
<210> 112
<211> 11320
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 112
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctgtga tatcacaagg tcccagggct ggggtcagaa attctctccc 600
gagggaatga agccacagga gccaagagca ggaggaccaa ggccctggcg aaggccgtgg 660
cctcgttcaa gtaaaagatc ctagtacagt gcaggtccca atgtgtacta ggatctttta 720
cttgaacggg gacgccggca tccgggctca ggacccccct ctctgccaga ggcaccaaca 780
ccagagttca caaatcagtc tcctgccctt tgcatgtagc aaagcagccc taggaatgca 840
tctagacaat tgtactaacc ttcttctctt tcctctcctg acagtccgga aagccaccat 900
ggaattcagc agccccagca gagaggaatg ccccaagcct ctgagccggg tgtcaatcat 960
ggccggatct ctgacaggac tgctgctgct tcaggccgtg tcttgggctt ctggcgctag 1020
accttgcatc cccaagagct tcggctacag cagcgtcgtg tgcgtgtgca atgccaccta 1080
ctgcgacagc ttcgaccctc ctacctttcc tgctctgggc accttcagca gatacgagag 1140
caccagatcc ggcagacgga tggaactgag catgggaccc atccaggcca atcacacagg 1200
cactggcctg ctgctgacac tgcagcctga gcagaaattc cagaaagtga aaggcttcgg 1260
cggagccatg acagatgccg ccgctctgaa tatcctggct ctgtctccac cagctcagaa 1320
cctgctgctc aagagctact tcagcgagga aggcatcggc tacaacatca tcagagtgcc 1380
catggccagc tgcgacttca gcatcaggac ctacacctac gccgacacac ccgacgattt 1440
ccagctgcac aacttcagcc tgcctgaaga ggacaccaag ctgaagatcc ctctgatcca 1500
cagagccctg cagctggcac aaagacccgt gtcactgctg gcctctccat ggacatctcc 1560
cacctggctg aaaacaaatg gcgccgtgaa tggcaagggc agcctgaaag gccaacctgg 1620
cgacatctac caccagacct gggccagata cttcgtgaag ttcctggacg cctatgccga 1680
gcacaagctg cagttttggg ccgtgacagc cgagaacgaa ccttctgctg gactgctgag 1740
cggctacccc tttcagtgcc tgggctttac acccgagcac cagcgggact ttatcgcccg 1800
tgatctggga cccacactgg ccaatagcac ccaccataat gtgcggctgc tgatgctgga 1860
cgaccagaga ctgcttctgc cccactgggc taaagtggtg ctgacagatc ctgaggccgc 1920
caaatacgtg cacggaatcg ccgtgcactg gtatctggac tttctggccc ctgccaaggc 1980
cacactggga gagacacaca gactgttccc caacaccatg ctgttcgcca gcgaagcctg 2040
tgtgggcagc aagttttggg aacagagcgt gcggctcggc agctgggata gaggcatgca 2100
gtacagccac agcatcatca ccaacctgct gtaccacgtc gtcggctgga ccgactggaa 2160
tctggccctg aatcctgaag gcggccctaa ctgggtccga aacttcgtgg acagccccat 2220
catcgtggac atcaccaagg acaccttcta caagcagccc atgttctacc acctgggaca 2280
cttcagcaag ttcatccccg agggctctca gcgcgttgga ctggtggctt cccagaagaa 2340
cgatctggac gccgtggctc tgatgcaccc tgatggatct gctgtggtgg tggtcctgaa 2400
ccgcagcagc aaagatgtgc ccctgaccat caaggatccc gccgtgggat tcctggaaac 2460
aatcagccct ggctactcca tccacaccta cctgtggcgt agacaggagg gcagaggaag 2520
tcttctgaca tgcggagacg tggaagagaa tcccggccct atgtggaccc tggtgagctg 2580
ggtggccctg accgccggcc tggtggccgg cacccgctgc cccgacggcc agttctgccc 2640
cgtggcctgc tgcctggacc ccggcggcgc cagctacagc tgctgccgcc ccctgctgga 2700
caagtggccc accaccctga gccgccacct gggcggcccc tgccaggtgg acgcccactg 2760
cagcgccggc cacagctgca tcttcaccgt gagcggcacc agcagctgct gccccttccc 2820
cgaggccgtg gcctgcggcg acggccacca ctgctgcccc cgcggcttcc actgcagcgc 2880
cgacggccgc agctgcttcc agcgcagcgg caacaacagc gtgggcgcca tccagtgccc 2940
cgacagccag ttcgagtgcc ccgacttcag cacctgctgc gtgatggtgg acggcagctg 3000
gggctgctgc cccatgcccc aggccagctg ctgcgaggac cgcgtgcact gctgccccca 3060
cggcgccttc tgcgacctgg tgcacacccg ctgcatcacc cccaccggca cccaccccct 3120
ggccaagaag ctgcccgccc agcgcaccaa ccgcgccgtg gccctgagca gcagcgtgat 3180
gtgccccgac gcccgcagcc gctgccccga cggcagcacc tgctgcgagc tgcccagcgg 3240
caagtacggc tgctgcccca tgcccaacgc cacctgctgc agcgaccacc tgcactgctg 3300
cccccaggac accgtgtgcg acctgatcca gagcaagtgc ctgagcaagg agaacgccac 3360
caccgacctg ctgaccaagc tgcccgccca caccgtgggc gacgtgaagt gcgacatgga 3420
ggtgagctgc cccgacggct acacctgctg ccgcctgcag agcggcgcct ggggctgctg 3480
ccccttcacc caggccgtgt gctgcgagga ccacatccac tgctgccccg ccggcttcac 3540
ctgcgacacc cagaagggca cctgcgagca gggcccccac caggtgccct ggatggagaa 3600
ggcccccgcc cacctgagcc tgcccgaccc ccaggccctg aagcgcgacg tgccctgcga 3660
caacgtgagc agctgcccca gcagcgacac ctgctgccag ctgaccagcg gcgagtgggg 3720
ctgctgcccc atccccgagg ccgtgtgctg cagcgaccac cagcactgct gcccccaggg 3780
ctacacctgc gtggccgagg gccagtgcca gcgcggcagc gagatcgtgg ccggcctgga 3840
gaagatgccc gcccgccgcg ccagcctgag ccacccccgc gacatcggct gcgaccagca 3900
caccagctgc cccgtgggcc agacctgctg ccccagcctg ggcggcagct gggcctgctg 3960
ccagctgccc cacgccgtgt gctgcgagga ccgccagcac tgctgccccg ccggctacac 4020
ctgcaacgtg aaggcccgca gctgcgagaa ggaggtggtg agcgcccagc ccgccacctt 4080
cctggcccgc agcccccacg tgggcgtgaa ggacgtggag tgcggcgagg gccacttctg 4140
ccacgacaac cagacctgct gccgcgacaa ccgccagggc tgggcctgct gcccctaccg 4200
ccagggcgtg tgctgcgccg accgccgcca ctgctgcccc gccggcttcc gctgcgccgc 4260
ccgcggcacc aagtgcctgc gccgcgaggc cccccgctgg gacgcccccc tgcgcgaccc 4320
cgccctgcgc cagctgctgt gacaattgtt aattaagttt aaaccctcga ggccgcaagc 4380
aataaaatat ctttattttc attacatctg tgtgttggtt ttttgtgtgg agatccacga 4440
taacaaacag cttttttggg gtgaacatat tgactgaatt ccctgcaggt tggccactcc 4500
ctctctgcgc gctcgctcgc tcactgaggc cgcccgggca aagcccgggc gtcgggcgac 4560
ctttggtcgc ccggcctcag tgagcgagcg agcgcgcaga gagggagtgg ccaactccat 4620
cactaggggt tcctgcggcc gctcgtacgg tctcgaggaa ttcctgcagg ataacttgcc 4680
aacctcattc taaaatgtat atagaagccc aaaagacaat aacaaaaata ttcttgtaga 4740
acaaaatggg aaagaatgtt ccactaaata tcaagattta gagcaaagca tgagatgtgt 4800
ggggatagac agtgaggctg ataaaataga gtagagctca gaaacagacc cattgatata 4860
tgtaagtgac ctatgaaaaa aatatggcat tttacaatgg gaaaatgatg gtctttttct 4920
tttttagaaa aacagggaaa tatatttata tgtaaaaaat aaaagggaac ccatatgtca 4980
taccatacac acaaaaaaat tccagtgaat tataagtcta aatggagaag gcaaaacttt 5040
aaatctttta gaaaataata tagaagcatg cagaccagcc tggccaacat gatgaaaccc 5100
tctctactaa taataaaatc agtagaacta ctcaggacta ctttgagtgg gaagtccttt 5160
tctatgaaga cttctttggc caaaattagg ctctaaatgc aaggagatag tgcatcatgc 5220
ctggctgcac ttactgataa atgatgttat caccatcttt aaccaaatgc acaggaacaa 5280
gttatggtac tgatgtgctg gattgagaag gagctctact tccttgacag gacacatttg 5340
tatcaactta aaaaagcaga tttttgccag cagaactatt cattcagagg taggaaactt 5400
agaatagatg atgtcactga ttagcatggc ttccccatct ccacagctgc ttcccaccca 5460
ggttgcccac agttgagttt gtccagtgct cagggctgcc cactctcagt aagaagcccc 5520
acaccagccc ctctccaaat atgttggctg ttccttccat taaagtgacc ccactttaga 5580
gcagcaagtg gatttctgtt tcttacagtt caggaaggag gagtcagctg tgagaacctg 5640
gagcctgaga tgcttctaag tcccactgct actggggtca gggaagccag actccagcat 5700
cagcagtcag gagcactaag cccttgccaa catcctgttt ctcagagaaa ctgcttccat 5760
tataatggtt gtcctttttt aagctatcaa gccaaacaac cagtgtctac cattattctc 5820
atcacctgaa gccaagggtt ctagcaaaag tcaagctgtc ttgtaatggt tgatgtgcct 5880
ccagcttctg tcttcagtca ctccactctt agcctgctct gaatcaactc tgaccacagt 5940
tccctggagc ccctgccacc tgctgcccct gccaccttct ccatctgcag tgctgtgcag 6000
ccttctgcac tcttgcagag ctaataggtg gagacttgaa ggaagaggag gaaagtttct 6060
cataatagcc ttgctgcaag ctcaaatggg aggtgggcac tgtgcccagg agccttggag 6120
caaaggctgt gcccaacctc tgactgcatc caggtttggt cttgacagag ataagaagcc 6180
ctggcttttg gagccaaaat ctaggtcaga cttaggcagg attctcaaag tttatcagca 6240
gaacatgagg cagaagaccc tttctgctcc agcttcttca ggctcaacct tcatcagaat 6300
agatagaaag agaggctgtg agggttctta aaacagaagc aaatctgact cagagaataa 6360
acaacctcct agtaaactac agcttagaca gagcatctgg tggtgagtgt gctcagtgtc 6420
ctactcaact gtctggtatc agccctcatg aggacttctc ttctttccct catagacctc 6480
catctctgtt ttccttagcc tgcagaaatc tggatggcta ttcacagaat gcctgtgctt 6540
tcagagttgc attttttctc tggtattctg gttcaagcat ttgaaggtag gaaaggttct 6600
ccaagtgcaa gaaagccagc cctgagcctc aactgcctgg ctagtgtggt cagtaggatg 6660
caaaggctgt tgaatgccac aaggccaaac tttaacctgt gtaccacaag cctagcagca 6720
gaggcagctc tgctcactgg aactctctgt cttctttctc ctgagccttt tcttttcctg 6780
agttttctag ctctcctcaa ccttacctct gccctaccca ggacaaaccc aagagccact 6840
gtttctgtga tgtcctctcc agccctaatt aggcatcatg acttcagcct gaccttccat 6900
gctcagaagc agtgctaatc cacttcagat gagctgctct atgcaacaca ggcagagcct 6960
acaaaccttt gcaccagagc cctccacata tcagtgtttg ttcatactca cttcaacagc 7020
aaatgtgact gctgagatta agattttaca caagatggtc tgtaatttca cagttagttt 7080
tatcccatta ggtatgaaag aattagcata attcccctta aacatgaatg aatcttagat 7140
tttttaataa atagttttgg aagtaaagac agagacatca ggagcacaag gaatagcctg 7200
agaggacaaa cagaacaaga aagagtctgg aaatacacag gatgttcttg gcctcctcaa 7260
agcaagtgca agcagatagt accagcagcc ccaggctatc agagcccagt gaagagaagt 7320
accatgaaag ccacagctct aaccaccctg ttccagagtg acagacagtc cccaagacaa 7380
gccagcctga gccagagaga gaactgcaag agaaagtttc taatttaggt tctgttagat 7440
tcagacaagt gcaggtcatc ctctctccac agctactcac ctctccagcc taacaaagcc 7500
tgcagtccac actccaaccc tggtgtctca cctcctagcc tctcccaaca tcctgctctc 7560
tgaccatctt ctgcatctct catctcacca tctcccactg tctacagcct actcttgcaa 7620
ctaccatctc attttctgac atcctgtcta catcttctgc catactctgc catctaccat 7680
accacctctt accatctacc acaccatctt ttatctccat ccctctcaga agcctccaag 7740
ctgaatcctg ctttatgtgt tcatctcagc ccctgcatgg aaagctgacc ccagaggcag 7800
aactattccc agagagcttg gccaagaaaa acaaaactac cagcctggcc aggctcagga 7860
gtagtaagct gcagtgtctg ttgtgttcta gcttcaacag ctgcaggagt tccactctca 7920
aatgctccac atttctcaca tcctcctgat tctggtcact acccatcttc aaagaacaga 7980
atatctcaca tcagcatact gtgaaggact agtcatgggt gcagctgctc agagctgcaa 8040
agtcattctg gatggtggag agcttacaaa catttcatga tgctcccccc gctctgatgg 8100
ctggagccca atccctacac agactcctgc tgtatgtgtt ttcctttcac tctgagccac 8160
agccagaggg caggcattca gtctcctctt caggctgggg ctggggcact gagaactcac 8220
ccaacacctt gctctcactc cttctgcaaa acaagaaaga gctttgtgct gcagtagcca 8280
tgaagaatga aaggaaggct ttaactaaaa aatgtcagag attattttca accccttact 8340
gtggatcacc agcaaggagg aaacacaaca cagagacatt ttttcccctc aaattatcaa 8400
aagaatcact gcatttgtta aagagagcaa ctgaatcagg aagcagagtt ttgaacatat 8460
cagaagttag gaatctgcat cagagacaaa tgcagtcatg gttgtttgct gcataccagc 8520
cctaatcatt agaagcctca tggacttcaa acatcattcc ctctgacaag atgctctagc 8580
ctaactccat gagataaaat aaatctgcct ttcagagcca aagaagagtc caccagcttc 8640
ttctcagtgt gaacaagagc tccagtcagg ttagtcagtc cagtgcagta gaggagacca 8700
gtctgcatcc tctaattttc aaaggcaaga agatttgttt accctggaca ccaggcacaa 8760
gtgaggtcac agagctctta gatatgcagt cctcatgagt gaggagacta aagcgcatgc 8820
catcaagact tcagtgtaga gaaaacctcc aaaaaagcct cctcactact tctggaatag 8880
ctcagaggcc gaggcggcct cggcctctgc ataaataaaa aaaattagtc agccatgggg 8940
cggagaatgg gcggaactgg gcggagttag gggcgggatg ggcggagtta ggggcgggac 9000
tatggttgct gactaattga gatgcatgct ttgcatactt ctgcctgctg gggagcctgg 9060
ggactttcca cacctggttg ctgactaatt gagatgcatg ctttgcatac ttctgcctgc 9120
tggggagcct ggggactttc cacaccctaa ctgacacaca ttccacagct gcattaatga 9180
atcggccaac gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc 9240
actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg 9300
gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc 9360
cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc 9420
ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga 9480
ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc 9540
ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat 9600
agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg 9660
cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc 9720
aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga 9780
gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact 9840
agaagaacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt 9900
ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag 9960
cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg 10020
tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa 10080
aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata 10140
tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg 10200
atctgtctat ttcgttcatc catagttgcc tgactcctgc aaaccacgtt gtgtctcaaa 10260
atctctgatg ttacattgca caagataaaa atatatcatc atgaacaata aaactgtctg 10320
cttacataaa cagtaataca aggggtgtta tgagccatat tcaacgggaa acgtcttgct 10380
cgaggccgcg attaaattcc aacatggatg ctgatttata tgggtataaa tgggctcgcg 10440
ataatgtcgg gcaatcaggt gcgacaatct atcgattgta tgggaagccc gatgcgccag 10500
agttgtttct gaaacatggc aaaggtagcg ttgccaatga tgttacagat gagatggtca 10560
gactaaactg gctgacggaa tttatgcctc ttccgaccat caagcatttt atccgtactc 10620
ctgatgatgc atggttactc accactgcga tccccgggaa aacagcattc caggtattag 10680
aagaatatcc tgattcaggt gaaaatattg ttgatgcgct ggcagtgttc ctgcgccggt 10740
tgcattcgat tcctgtttgt aattgtcctt ttaacagcga tcgcgtattt cgtctcgctc 10800
aggcgcaatc acgaatgaat aacggtttgg ttgatgcgag tgattttgat gacgagcgta 10860
atggctggcc tgttgaacaa gtctggaaag aaatgcataa gcttttgcca ttctcaccgg 10920
attcagtcgt cactcatggt gatttctcac ttgataacct tatttttgac gaggggaaat 10980
taataggttg tattgatgtt ggacgagtcg gaatcgcaga ccgataccag gatcttgcca 11040
tcctatggaa ctgcctcggt gagttttctc cttcattaca gaaacggctt tttcaaaaat 11100
atggtattga taatcctgat atgaataaat tgcagtttca tttgatgctc gatgagtttt 11160
tctaagggcg gcctgccacc atacccacgc cgaaacaagc gctcatgagc ccgaagtggc 11220
gagcccgatc ttccccatcg gtgatgtcgg cgatataggc gccagcaacc gcacctgtgg 11280
cgccggtgat gagggcgcgc caagtcgacg tccggcagtc 11320
<210> 113
<211> 3793
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 113
tctcattaga agtgaggcgg ggccggccaa atcgaatgga caccgggtaa ttagcagggt 60
tacccagata ctccagcacc tctttcccgt cggccgtgta cctgccattc acgtccatgc 120
cattgatggc cagcactgca tgacccactg cagaggtgaa gctaacggtc agcgaaggtg 180
cagcccgggg attccgccga ggggacaagg gacccgacac aacccctttt cccccaaccc 240
cgcacctaca accagcccac ttctacagca ctggggccct cccacccccg cacccgccac 300
gggcccgagc ctagcccacc tcggatgccg tcccgctggc cgaaagcaac caacacacgc 360
tcatcgtgta gcttgagcag cagatccagc ggataactga aagttttctc agcctcagcc 420
cgtggcgcgt agctgtccaa ctggtaaatc aagccgccag ctttgttcac cacatacaca 480
ctaaaaatcg ccatcgctgc cttgccgctc ggaaactggt attcagcctc tacccgacgg 540
cccctccccg gaaccgcatc acagcacttg ccgccggccc caccccagcc tcctcctcct 600
cctcctcctc ctcccgcgcc ccccgtgcag ccacctgctg cacttgcgca ctgggagcga 660
cacgctcggg cataagtagt gccgaaaagt tagctgccga gacctggtgg attgcttttc 720
gtttatcagt gcaggaaaac agcgctatag tactgcgtca caactagcgc agactccggc 780
agtatttagg cggtgcggct tgggaactag aatccacttc ctgtcttccg cctcaggcta 840
gagggcgagc gcttcgccgt gggacttctt ctgcctggct ccgcctcttg ccccggaagt 900
actcacagcg gacggtggtt tttgggcccg tttctgagca gcgcttcctt tttgtccgac 960
atcttgacga ggctgcggtg tctgctgcta ttctccgagc ttcgcaatgg taagcttcag 1020
gggtgtgaag tcgccggcgt tcttgggttt gaggactcag tggggagagc cttcggcggg 1080
agcgctcctt ggcctgccgg cctcggttgc agggcgggcg cggttattgc ttggcccatg 1140
tgctctggtg gtggagtttg cgggggctga gggcgcagta ttaggggact ttggcgctat 1200
ttgaggacct ggttgcattc ccgctgccct cctacagccg cctaaggacg acaagaagaa 1260
gaaggacgct ggaaagtcgg ccaagaaaga caaagaccca gtgaacaaat ccgggggcaa 1320
ggccaaaaag aaggtagaaa taagacctct ctgaaagaga ctaggggtaa ctctctcgta 1380
atcctctagt aataggtaac ttgtatagta agtggttttt caggtgtaga tttctagagt 1440
caaaatgtga gagtttatct tcccgtcacc actcgttctt tttcccatta ggatcatgaa 1500
aatgggtctg ttgtgcgaag tgtctgccgc tgtgcctgct gtgttatttt taactgatct 1560
agtggggctc ggcccctgtt tgaaggccaa aaacgtgtcg gtgttttttt tttgtttttg 1620
ttttagtaat gtgtaattta tccttgataa cggtggaaca gatttctctg acgcagatta 1680
ctcgagaggg aaagggtgct tctgccagaa atactaactt gtttctgttt tgttttggtg 1740
agcagaagtg gtccaaaggc aaagttcggg acaagctcaa taacttagtc ttgtttgaca 1800
aagctaccta tgataaactc tgtaaggaag ttcccaacta taaacttata accccagctg 1860
tggtctctga gagactgaag attcgaggct ccctggccag ggcagccctt caggagctcc 1920
ttagtaaagg tgaggggtgt atcctacatg tgtgtttttg taggttaaat tgtcttgacc 1980
atgttaagca tcttcagtgg ttttgctgga aaagcagaat taaaaaaaaa aagcgtggct 2040
tgaccattgg ctgttagtaa tgtaattctg acgtcttact cctgatcctg agatgaattc 2100
tcagggttct tagccacttt tgtgccgtgg accctgtggc agtttagtga agcccaagga 2160
tcttttatgt ttcgagtaaa tggatgcata gaattacagg gacaaccgtt tttgaaataa 2220
ttagattact attttgaaac aactttgaaa atgtttaaaa cctttatggt aaatattttg 2280
ttgatgtatt aaattttaaa accagaaatt tagtacggtc tactcagtag tatggtctga 2340
ttaccataat tccacaataa taaggctcag ctaactatag tgactgaacg tctataattc 2400
tagcactttg ggaggccaag gcgggtgaat caacggaggt caggagttaa agaccagcct 2460
ggccaatatg gtgaaaacct gctctactga aagttagctg gacgtggggg cacacgtctg 2520
taatcccagc tactcaggat gctgaggcat gaggatccct tgaacccagg agatggaggt 2580
ggcagtgagc cgagatgaca ccactgcact ccagccttag tgacagcaaa agactgtctc 2640
agaaaggggg ggggggtgga agataatgga gccctaattt aaaggaaaag taaggataga 2700
tgatccgtta aaaacttgga ttctcggtta ccgaacgtca gattaagcaa ttctggagcc 2760
aggtgcagtg gtacccttgt atttctagct acttgggagg ccaaagcagg aggatcattt 2820
gagccaagga gttttaagac cattctgggc acctctgaga gaactctgtc tttttgtttt 2880
ccttttcttt aaatagagat gcggttttgc catgttgccc aggctggtct cctgggctca 2940
agagatccac ctgtccaaag tgctgggatt acaggcatga gcctctgcac ccggccaaaa 3000
caaaccttac tagagtctca ttctgttgcc caggttggag tgcggagggg cagtcttggc 3060
tcaatgcaac caccaattcc tgggttcagg tggtcctcac ctcagcttcc caagtagctg 3120
gaattacaag catgtgccac catgcccagc taatttttgt atttttggta gagatggggt 3180
ttcaccttgt tggccaggct ggtgtgcaac tccttacctc aagctatctg cccgtctcca 3240
cctcccaaag cagtgggatt ataagcatga gccaccgcgc ccagccaaaa accttactag 3300
tttctattgt agcatctgtt aagcatctca tcgtgctatt ctctccccct aggacttatc 3360
aaactggttt caaagcacag agctcaagta atttacacca gaaataccaa gggtggagat 3420
gctccagctg ctggtgaaga tgcatgaata ggtgagtagg aatgtgtggg ctcatggtgt 3480
aggaggtaga tacaaagctt tatggttctg attcttttaa ttttttttta caggtccaac 3540
cagctgtaca tttggaaaaa taaaacttta ttaaatcaaa tgaatgagta tgtctgtttc 3600
ctaagaaaga caatgataaa gaatttggtg gaaggtataa taggggtttg ttgactttgc 3660
ttttagcctc atggtagttg gtagagagca tgattagctt ttttctgtat gtgactgctt 3720
cttcattgct gcagcttcag ttttgaattg atgtctgaaa ggaaataaag ggttaacacg 3780
atgatgaagg gtg 3793
<210> 114
<211> 6762
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 114
ggacggccga gcggcagggc gctcgcgcgc gcccactagt ggccggagga gaaggctccc 60
gcggaggccg cgctgcccgc cccctcccct ggggaggctc gcgttcccgc tgctcgcgcc 120
tgcgccgccc gccggcctca ggaacgcgcc ctcttcgccg gcgcgcgccc tcgcagtcac 180
cgccacccac cagctccggc accaacagca gcgccgctgc caccgcccac cttctgccgc 240
cgccaccaca gccaccttct cctcctccgc tgtcctctcc cgtcctcgcc tctgtcgact 300
atcaggtgaa ctttgaacca ggatggctga gccccgccag gagttcgaag tgatggaaga 360
tcacgctggg acgtacgggt tgggggacag gaaagatcag gggggctaca ccatgcacca 420
agaccaagag ggtgacacgg acgctggcct gaaagaatct cccctgcaga cccccactga 480
ggacggatct gaggaaccgg gctctgaaac ctctgatgct aagagcactc caacagcgga 540
agatgtgaca gcacccttag tggatgaggg agctcccggc aagcaggctg ccgcgcagcc 600
ccacacggag atcccagaag gaaccacagc tgaagaagca ggcattggag acacccccag 660
cctggaagac gaagctgctg gtcacgtgac ccaagagcct gaaagtggta aggtggtcca 720
ggaaggcttc ctccgagagc caggcccccc aggtctgagc caccagctca tgtccggcat 780
gcctggggct cccctcctgc ctgagggccc cagagaggcc acacgccaac cttcggggac 840
aggacctgag gacacagagg gcggccgcca cgcccctgag ctgctcaagc accagcttct 900
aggagacctg caccaggagg ggccgccgct gaagggggca gggggcaaag agaggccggg 960
gagcaaggag gaggtggatg aagaccgcga cgtcgatgag tcctcccccc aagactcccc 1020
tccctccaag gcctccccag cccaagatgg gcggcctccc cagacagccg ccagagaagc 1080
caccagcatc ccaggcttcc cagcggaggg tgccatcccc ctccctgtgg atttcctctc 1140
caaagtttcc acagagatcc cagcctcaga gcccgacggg cccagtgtag ggcgggccaa 1200
agggcaggat gcccccctgg agttcacgtt tcacgtggaa atcacaccca acgtgcagaa 1260
ggagcaggcg cactcggagg agcatttggg aagggctgca tttccagggg cccctggaga 1320
ggggccagag gcccggggcc cctctttggg agaggacaca aaagaggctg accttccaga 1380
gccctctgaa aagcagcctg ctgctgctcc gcgggggaag cccgtcagcc gggtccctca 1440
actcaaagct cgcatggtca gtaaaagcaa agacgggact ggaagcgatg acaaaaaagc 1500
caagacatcc acacgttcct ctgctaaaac cttgaaaaat aggccttgcc ttagccccaa 1560
acaccccact cctggtagct cagaccctct gatccaaccc tccagccctg ctgtgtgccc 1620
agagccacct tcctctccta aatacgtctc ttctgtcact tcccgaactg gcagttctgg 1680
agcaaaggag atgaaactca agggggctga tggtaaaacg aagatcgcca caccgcgggg 1740
agcagcccct ccaggccaga agggccaggc caacgccacc aggattccag caaaaacccc 1800
gcccgctcca aagacaccac ccagctctgg tgaacctcca aaatcagggg atcgcagcgg 1860
ctacagcagc cccggctccc caggcactcc cggcagccgc tcccgcaccc cgtcccttcc 1920
aaccccaccc acccgggagc ccaagaaggt ggcagtggtc cgtactccac ccaagtcgcc 1980
gtcttccgcc aagagccgcc tgcagacagc ccccgtgccc atgccagacc tgaagaatgt 2040
caagtccaag atcggctcca ctgagaacct gaagcaccag ccgggaggcg ggaaggtgca 2100
gataattaat aagaagctgg atcttagcaa cgtccagtcc aagtgtggct caaaggataa 2160
tatcaaacac gtcccgggag gcggcagtgt gcaaatagtc tacaaaccag ttgacctgag 2220
caaggtgacc tccaagtgtg gctcattagg caacatccat cataaaccag gaggtggcca 2280
ggtggaagta aaatctgaga agcttgactt caaggacaga gtccagtcga agattgggtc 2340
cctggacaat atcacccacg tccctggcgg aggaaataaa aagattgaaa cccacaagct 2400
gaccttccgc gagaacgcca aagccaagac agaccacggg gcggagatcg tgtacaagtc 2460
gccagtggtg tctggggaca cgtctccacg gcatctcagc aatgtctcct ccaccggcag 2520
catcgacatg gtagactcgc cccagctcgc cacgctagct gacgaggtgt ctgcctccct 2580
ggccaagcag ggtttgtgat caggcccctg gggcggtcaa taattgtgga gaggagagaa 2640
tgagagagtg tggaaaaaaa aagaataatg acccggcccc cgccctctgc ccccagctgc 2700
tcctcgcagt tcggttaatt ggttaatcac ttaacctgct tttgtcactc ggctttggct 2760
cgggacttca aaatcagtga tgggagtaag agcaaatttc atctttccaa attgatgggt 2820
gggctagtaa taaaatattt aaaaaaaaac attcaaaaac atggccacat ccaacatttc 2880
ctcaggcaat tccttttgat tcttttttct tccccctcca tgtagaagag ggagaaggag 2940
aggctctgaa agctgcttct gggggatttc aagggactgg gggtgccaac cacctctggc 3000
cctgttgtgg gggtgtcaca gaggcagtgg cagcaacaaa ggatttgaaa cttggtgtgt 3060
tcgtggagcc acaggcagac gatgtcaacc ttgtgtgagt gtgacggggg ttggggtggg 3120
gcgggaggcc acgggggagg ccgaggcagg ggctgggcag aggggagagg aagcacaaga 3180
agtgggagtg ggagaggaag ccacgtgctg gagagtagac atccccctcc ttgccgctgg 3240
gagagccaag gcctatgcca cctgcagcgt ctgagcggcc gcctgtcctt ggtggccggg 3300
ggtgggggcc tgctgtgggt cagtgtgcca ccctctgcag ggcagcctgt gggagaaggg 3360
acagcgggta aaaagagaag gcaagctggc aggagggtgg cacttcgtgg atgacctcct 3420
tagaaaagac tgaccttgat gtcttgagag cgctggcctc ttcctccctc cctgcagggt 3480
agggggcctg agttgagggg cttccctctg ctccacagaa accctgtttt attgagttct 3540
gaaggttgga actgctgcca tgattttggc cactttgcag acctgggact ttagggctaa 3600
ccagttctct ttgtaaggac ttgtgcctct tgggagacgt ccacccgttt ccaagcctgg 3660
gccactggca tctctggagt gtgtgggggt ctgggaggca ggtcccgagc cccctgtcct 3720
tcccacggcc actgcagtca ccccgtctgc gccgctgtgc tgttgtctgc cgtgagagcc 3780
caatcactgc ctatacccct catcacacgt cacaatgtcc cgaattccca gcctcaccac 3840
cccttctcag taatgaccct ggttggttgc aggaggtacc tactccatac tgagggtgaa 3900
attaagggaa ggcaaagtcc aggcacaaga gtgggacccc agcctctcac tctcagttcc 3960
actcatccaa ctgggaccct caccacgaat ctcatgatct gattcggttc cctgtctcct 4020
cctcccgtca cagatgtgag ccagggcact gctcagctgt gaccctaggt gtttctgcct 4080
tgttgacatg gagagagccc tttcccctga gaaggcctgg ccccttcctg tgctgagccc 4140
acagcagcag gctgggtgtc ttggttgtca gtggtggcac caggatggaa gggcaaggca 4200
cccagggcag gcccacagtc ccgctgtccc ccacttgcac cctagcttgt agctgccaac 4260
ctcccagaca gcccagcccg ctgctcagct ccacatgcat agtatcagcc ctccacaccc 4320
gacaaagggg aacacacccc cttggaaatg gttcttttcc cccagtccca gctggaagcc 4380
atgctgtctg ttctgctgga gcagctgaac atatacatag atgttgccct gccctcccca 4440
tctgcaccct gttgagttgt agttggattt gtctgtttat gcttggattc accagagtga 4500
ctatgatagt gaaaagaaaa aaaaaaaaaa aaaaggacgc atgtatcttg aaatgcttgt 4560
aaagaggttt ctaacccacc ctcacgaggt gtctctcacc cccacactgg gactcgtgtg 4620
gcctgtgtgg tgccaccctg ctggggcctc ccaagttttg aaaggctttc ctcagcacct 4680
gggacccaac agagaccagc ttctagcagc taaggaggcc gttcagctgt gacgaaggcc 4740
tgaagcacag gattaggact gaagcgatga tgtccccttc cctacttccc cttggggctc 4800
cctgtgtcag ggcacagact aggtcttgtg gctggtctgg cttgcggcgc gaggatggtt 4860
ctctctggtc atagcccgaa gtctcatggc agtcccaaag gaggcttaca actcctgcat 4920
cacaagaaaa aggaagccac tgccagctgg ggggatctgc agctcccaga agctccgtga 4980
gcctcagcca cccctcagac tgggttcctc tccaagctcg ccctctggag gggcagcgca 5040
gcctcccacc aagggccctg cgaccacagc agggattggg atgaattgcc tgtcctggat 5100
ctgctctaga ggcccaagct gcctgcctga ggaaggatga cttgacaagt caggagacac 5160
tgttcccaaa gccttgacca gagcacctca gcccgctgac cttgcacaaa ctccatctgc 5220
tgccatgaga aaagggaagc cgcctttgca aaacattgct gcctaaagaa actcagcagc 5280
ctcaggccca attctgccac ttctggtttg ggtacagtta aaggcaaccc tgagggactt 5340
ggcagtagaa atccagggcc tcccctgggg ctggcagctt cgtgtgcagc tagagcttta 5400
cctgaaagga agtctctggg cccagaactc tccaccaaga gcctccctgc cgttcgctga 5460
gtcccagcaa ttctcctaag ttgaagggat ctgagaagga gaaggaaatg tggggtagat 5520
ttggtggtgg ttagagatat gcccccctca ttactgccaa cagtttcggc tgcatttctt 5580
cacgcacctc ggttcctctt cctgaagttc ttgtgccctg ctcttcagca ccatgggcct 5640
tcttatacgg aaggctctgg gatctccccc ttgtggggca ggctcttggg gccagcctaa 5700
gatcatggtt tagggtgatc agtgctggca gataaattga aaaggcacgc tggcttgtga 5760
tcttaaatga ggacaatccc cccagggctg ggcactcctc ccctcccctc acttctccca 5820
cctgcagagc cagtgtcctt gggtgggcta gataggatat actgtatgcc ggctccttca 5880
agctgctgac tcactttatc aatagttcca tttaaattga cttcagtggt gagactgtat 5940
cctgtttgct attgcttgtt gtgctatggg gggagggggg aggaatgtgt aagatagtta 6000
acatgggcaa agggagatct tggggtgcag cacttaaact gcctcgtaac ccttttcatg 6060
atttcaacca catttgctag agggagggag cagccacgga gttagaggcc cttggggttt 6120
ctcttttcca ctgacaggct ttcccaggca gctggctagt tcattccctc cccagccagg 6180
tgcaggcgta ggaatatgga catctggttg ctttggcctg ctgccctctt tcaggggtcc 6240
taagcccaca atcatgcctc cctaagacct tggcatcctt ccctctaagc cgttggcacc 6300
tctgtgccac ctctcacact ggctccagac acacagcctg tgcttttgga gctgagatca 6360
ctcgcttcac cctcctcatc tttgttctcc aagtaaagcc acgaggtcgg ggcgagggca 6420
gaggtgatca cctgcgtgtc ccatctacag acctgcagct tcataaaact tctgatttct 6480
cttcagcttt gaaaagggtt accctgggca ctggcctaga gcctcacctc ctaatagact 6540
tagccccatg agtttgccat gttgagcagg actatttctg gcacttgcaa gtcccatgat 6600
ttcttcggta attctgaggg tggggggagg gacatgaaat catcttagct tagctttctg 6660
tctgtgaatg tctatatagt gtattgtgtg ttttaacaaa tgatttacac tgactgttgc 6720
tgtaaaagtg aatttggaaa taaagttatt actctgatta aa 6762
<210> 115
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 115
ataccttcca ccaaattctt ta 22
<210> 116
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 116
taaagaattt ggtggaaggt at 22
<210> 117
<211> 150
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 117
ctggaggctt gctttgggct gtatgctgat accttccacc aaattcttta ttttggcctc 60
tgactgataa agaattgtgg aaggtatcag gacacaaggc cctttatcag cactcacatg 120
gaacaaatgg ccaccgtggg aggatgacaa 150
<210> 118
<211> 150
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 118
ttgtcatcct cccacggtgg ccatttgttc catgtgagtg ctgataaagg gccttgtgtc 60
ctgatacctt ccacaattct ttatcagtca gaggccaaaa taaagaattt ggtggaaggt 120
atcagcatac agcccaaagc aagcctccag 150
<210> 119
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 119
ataagtcctt tactaaggag c 21
<210> 120
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 120
gctccttagt aaaggactta t 21
<210> 121
<211> 148
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 121
ctggaggctt gctttgggct gtatgctgat aagtccttta ctaaggagct tttggcctct 60
gactgagctc cttgtaagga cttatcagga cacaaggccc tttatcagca ctcacatgga 120
acaaatggcc accgtgggag gatgacaa 148
<210> 122
<211> 148
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 122
ttgtcatcct cccacggtgg ccatttgttc catgtgagtg ctgataaagg gccttgtgtc 60
ctgataagtc cttacaagga gctcagtcag aggccaaaag ctccttagta aaggacttat 120
cagcatacag cccaaagcaa gcctccag 148
<210> 123
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 123
gattttgaag tcccgagcca a 21
<210> 124
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 124
ttggctcggg acttcaaaat c 21
<210> 125
<211> 148
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 125
ctggaggctt gctttgggct gtatgctgga ttttgaagtc ccgagccaat tttggcctct 60
gactgattgg ctcggattca aaatccagga cacaaggccc tttatcagca ctcacatgga 120
acaaatggcc accgtgggag gatgacaa 148
<210> 126
<211> 147
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 126
tgtcatcctc ccacggtggc catttgttcc atgtgagtgc tgataaaggg ccttgtgtcc 60
tggattttga atccgagcca atcagtcaga ggccaaaatt ggctcgggac ttcaaaatcc 120
agcatacagc ccaaagcaag cctccag 147
<210> 127
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 127
ggaaatgttg gatgtggcca tgt 23
<210> 128
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 128
acatggccac atccaacatt tcc 23
<210> 129
<211> 152
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 129
ctggaggctt gctttgggct gtatgctggg aaatgttgga tgtggccatg tttttggcct 60
ctgactgaac atggcacacc aacatttccc aggacacaag gccctttatc agcactcaca 120
tggaacaaat ggccaccgtg ggaggatgac aa 152
<210> 130
<211> 152
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 130
ttgtcatcct cccacggtgg ccatttgttc catgtgagtg ctgataaagg gccttgtgtc 60
ctgggaaatg ttggtgtgcc atgttcagtc agaggccaaa aacatggcca catccaacat 120
ttcccagcat acagcccaaa gcaagcctcc ag 152
<210> 131
<211> 45
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 131
ggtgcagata attaataagt tcgcttatta attatctgca ccttc 45
<210> 132
<211> 45
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 132
gaaggtgcag ataattaata agcgaactta ttaattatct gcacc 45
<210> 133
<211> 139
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 133
tggaggcttg ctgaaggctg tatgctgttg tcggtgcaga taattaataa gttcgcttat 60
taattatctg caccttcagg acacaaggcc tgttactagc actcacatgg aacaaatggc 120
caccgtggga ggatgacaa 139
<210> 134
<211> 139
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 134
ttgtcatcct cccacggtgg ccatttgttc catgtgagtg ctagtaacag gccttgtgtc 60
ctgaaggtgc agataattaa taagcgaact tattaattat ctgcaccgac aacagcatac 120
agccttcagc aagcctcca 139
<210> 135
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 135
ttgtagacta tttgcacact g 21
<210> 136
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 136
cagtgtgcaa atagtctaca a 21
<210> 137
<211> 152
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 137
ttgtcatcct cccacggtgg ccatttgttc catgtgagtg ctagtaacag gccttgtgtc 60
ctttgtagac tatttgcaca ctgcatctgt ggcttcactc agtgtgcaaa tagtctacaa 120
gacaacagca tacagccttc agcaagcctc ca 152
<210> 138
<211> 152
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 138
tggaggcttg ctgaaggctg tatgctgttg tcttgtagac tatttgcaca ctgagtgaag 60
ccacagatgc agtgtgcaaa tagtctacaa aggacacaag gcctgttact agcactcaca 120
tggaacaaat ggccaccgtg ggaggatgac aa 152
<210> 139
<211> 4321
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 139
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
cctagttatt aatagtaatc aattacgggg tcattagttc atagcccata tatggagttc 360
cgcgttacat aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca 420
ttgacgtcaa taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt 480
caatgggtgg actatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg 540
ccaagtacgc cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag 600
tacatgacct tatgggactt tcctacttgg cagtacatct acgtattagt catcgctatt 660
accatggtcg aggtgagccc cacgttctgc ttcactctcc ccatctcccc cccctcccca 720
cccccaattt tgtatttatt tattttttaa ttattttgtg cagcgatggg ggcggggggg 780
gggggggggc gcgcgccagg cggggcgggg cggggcgagg ggcggggcgg ggcgaggcgg 840
agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa agtttccttt tatggcgagg 900
cggcggcggc ggcggcccta taaaaagcga agcgcgcggc gggcgggagt cgctgcgacg 960
ctgccttcgc cccgtgcccc gctccgccgc cgcctcgcgc cgcccgcccc ggctctgact 1020
gaccgcgtta ctcccacagg tgagcgggcg ggacggccct tctcctcagc gctgtaatta 1080
gcgcttggtt taatgacggc ttgttggagg cttgctgaag gctgtatgct gttgtcggtg 1140
cagataatta ataagttcgc ttattaatta tctgcacctt caggacacaa ggcctgttac 1200
tagcactcac atggaacaaa tggccaccgt gggaggatga caatttctgt ggctgcgtga 1260
aagccttgag gggctccggg agctagagcc tctgctaacc atgttcatgc cttcttcttt 1320
ttcctacagc tcctgggcaa cgtgctggtt attgtgctgt ctcatcattt tggcaaagaa 1380
ttcctcgaag atccgaaggg aaagtcttcc acgactgtgg gatccgttcg aagatatcac 1440
cggttgagcc accatgtgga ccctggtgag ctgggtggcc ctgaccgccg gcctggtggc 1500
cggcacccgc tgccccgacg gccagttctg ccccgtggcc tgctgcctgg accccggcgg 1560
cgccagctac agctgctgcc gccccctgct ggacaagtgg cccaccaccc tgagccgcca 1620
cctgggcggc ccctgccagg tggacgccca ctgcagcgcc ggccacagct gcatcttcac 1680
cgtgagcggc accagcagct gctgcccctt ccccgaggcc gtggcctgcg gcgacggcca 1740
ccactgctgc ccccgcggct tccactgcag cgccgacggc cgcagctgct tccagcgcag 1800
cggcaacaac agcgtgggcg ccatccagtg ccccgacagc cagttcgagt gccccgactt 1860
cagcacctgc tgcgtgatgg tggacggcag ctggggctgc tgccccatgc cccaggccag 1920
ctgctgcgag gaccgcgtgc actgctgccc ccacggcgcc ttctgcgacc tggtgcacac 1980
ccgctgcatc acccccaccg gcacccaccc cctggccaag aagctgcccg cccagcgcac 2040
caaccgcgcc gtggccctga gcagcagcgt gatgtgcccc gacgcccgca gccgctgccc 2100
cgacggcagc acctgctgcg agctgcccag cggcaagtac ggctgctgcc ccatgcccaa 2160
cgccacctgc tgcagcgacc acctgcactg ctgcccccag gacaccgtgt gcgacctgat 2220
ccagagcaag tgcctgagca aggagaacgc caccaccgac ctgctgacca agctgcccgc 2280
ccacaccgtg ggcgacgtga agtgcgacat ggaggtgagc tgccccgacg gctacacctg 2340
ctgccgcctg cagagcggcg cctggggctg ctgccccttc acccaggccg tgtgctgcga 2400
ggaccacatc cactgctgcc ccgccggctt cacctgcgac acccagaagg gcacctgcga 2460
gcagggcccc caccaggtgc cctggatgga gaaggccccc gcccacctga gcctgcccga 2520
cccccaggcc ctgaagcgcg acgtgccctg cgacaacgtg agcagctgcc ccagcagcga 2580
cacctgctgc cagctgacca gcggcgagtg gggctgctgc cccatccccg aggccgtgtg 2640
ctgcagcgac caccagcact gctgccccca gggctacacc tgcgtggccg agggccagtg 2700
ccagcgcggc agcgagatcg tggccggcct ggagaagatg cccgcccgcc gcgccagcct 2760
gagccacccc cgcgacatcg gctgcgacca gcacaccagc tgccccgtgg gccagacctg 2820
ctgccccagc ctgggcggca gctgggcctg ctgccagctg ccccacgccg tgtgctgcga 2880
ggaccgccag cactgctgcc ccgccggcta cacctgcaac gtgaaggccc gcagctgcga 2940
gaaggaggtg gtgagcgccc agcccgccac cttcctggcc cgcagccccc acgtgggcgt 3000
gaaggacgtg gagtgcggcg agggccactt ctgccacgac aaccagacct gctgccgcga 3060
caaccgccag ggctgggcct gctgccccta ccgccagggc gtgtgctgcg ccgaccgccg 3120
ccactgctgc cccgccggct tccgctgcgc cgcccgcggc accaagtgcc tgcgccgcga 3180
ggccccccgc tgggacgccc ccctgcgcga ccccgccctg cgccagctgc tgtgacaatt 3240
gttaattaag tttaaaccct cgaggccgca agcttatcga taatcaacct ctggattaca 3300
aaatttgtga aagattgact ggtattctta actatgttgc tccttttacg ctatgtggat 3360
acgctgcttt aatgcctttg tatcatgcta ttgcttcccg tatggctttc attttctcct 3420
ccttgtataa atcctggttg ctgtctcttt atgaggagtt gtggcccgtt gtcaggcaac 3480
gtggcgtggt gtgcactgtg tttgctgacg caacccccac tggttggggc attgccacca 3540
cctgtcagct cctttccggg actttcgctt tccccctccc tattgccacg gcggaactca 3600
tcgccgcctg ccttgcccgc tgctggacag gggctcggct gttgggcact gacaattccg 3660
tggtgttgtc ggggaaatca tcgtcctttc cttggctgct cgcctgtgtt gccacctgga 3720
ttctgcgcgg gacgtccttc tgctacgtcc cttcggccct caatccagcg gaccttcctt 3780
cccgcggcct gctgccggct ctgcggcctc ttccgcgtct tcgccttcgc cctcagacga 3840
gtcggatctc cctttgggcc gcctccccgc atcgataccg tcgactagag ctcgctgatc 3900
agcctcgact gtgccttcta gttgccagcc atctgttgtt tgcccctccc ccgtgccttc 3960
cttgaccctg gaaggtgcca ctcccactgt cctttcctaa taaaatgagg aaattgcatc 4020
gcattgtctg agtaggtgtc attctattct ggggggtggg gtggggcagg acagcaaggg 4080
ggaggattgg gaagacaata gcaggcatgc tggggagaga tccacgataa caaacagctt 4140
ttttggggtg aacatattga ctgaattccc tgcaggttgg ccactccctc tctgcgcgct 4200
cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc gggcgacctt tggtcgcccg 4260
gcctcagtga gcgagcgagc gcgcagagag ggagtggcca actccatcac taggggttcc 4320
t 4321
<210> 140
<211> 4552
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 140
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
aaaaaaattg tcatcctccc acggtggcca tttgttccat gtgagtgcta gtaacaggcc 300
ttgtgtcctt tgtagactat ttgcacactg catctgtggc ttcactcagt gtgcaaatag 360
tctacaagac aacagcatac agccttcagc aagcctccag tggtctcata cagaacttat 420
aagattccca aatccaaaga catttcacgt ttatggtgat ttcccagaac acatagcgac 480
atgcaaatat tgcagggcgc cactcccctg tccctcacag ccatcttcct gccagggcgc 540
acgcgcgctg ggtgttcccg cctagtgaca ctgggcccgc gattccttgg agcgggttga 600
tgacgtcagc gtttcccatg gtgaagcttg gatctgatcc ctaggttcta gaaccggtga 660
cattcggtac cctagttatt aatagtaatc aattacgggg tcattagttc atagcccata 720
tatggagttc cgcgttacat aacttacggt aaatggcccg cctggctgac cgcccaacga 780
cccccgccca ttgacgtcaa taatgacgta tgttcccata gtaacgccaa tagggacttt 840
ccattgacgt caatgggtgg actatttacg gtaaactgcc cacttggcag tacatcaagt 900
gtatcatatg ccaagtacgc cccctattga cgtcaatgac ggtaaatggc ccgcctggca 960
ttatgcccag tacatgacct tatgggactt tcctacttgg cagtacatct acgtattagt 1020
catcgctatt accatggtcg aggtgagccc cacgttctgc ttcactctcc ccatctcccc 1080
cccctcccca cccccaattt tgtatttatt tattttttaa ttattttgtg cagcgatggg 1140
ggcggggggg gggggggggc gcgcgccagg cggggcgggg cggggcgagg ggcggggcgg 1200
ggcgaggcgg agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa agtttccttt 1260
tatggcgagg cggcggcggc ggcggcccta taaaaagcga agcgcgcggc gggcgggagt 1320
cgctgcgacg ctgccttcgc cccgtgcccc gctccgccgc cgcctcgcgc cgcccgcccc 1380
ggctctgact gaccgcgtta ctcccacagg tgagcgggcg ggacggccct tctcctccgg 1440
gctgtaatta gcgcttggtt taatgacggc ttgttttctg tggctgcgtg aaagccttga 1500
ggggctccgg gagctagagc ctctgctaac catgttcatg ccttcttctt tttcctacag 1560
ctcctgggca acgtgctggt tattgtgctg tctcatcatt ttggcaaaga attcctcgaa 1620
gatccgaagg gaaagtcttc cacgactgtg ggatccgttc gaagatatca ccggttgagc 1680
caccatgtgg accctggtga gctgggtggc cctgaccgcc ggcctggtgg ccggcacccg 1740
ctgccccgac ggccagttct gccccgtggc ctgctgcctg gaccccggcg gcgccagcta 1800
cagctgctgc cgccccctgc tggacaagtg gcccaccacc ctgagccgcc acctgggcgg 1860
cccctgccag gtggacgccc actgcagcgc cggccacagc tgcatcttca ccgtgagcgg 1920
caccagcagc tgctgcccct tccccgaggc cgtggcctgc ggcgacggcc accactgctg 1980
cccccgcggc ttccactgca gcgccgacgg ccgcagctgc ttccagcgca gcggcaacaa 2040
cagcgtgggc gccatccagt gccccgacag ccagttcgag tgccccgact tcagcacctg 2100
ctgcgtgatg gtggacggca gctggggctg ctgccccatg ccccaggcca gctgctgcga 2160
ggaccgcgtg cactgctgcc cccacggcgc cttctgcgac ctggtgcaca cccgctgcat 2220
cacccccacc ggcacccacc ccctggccaa gaagctgccc gcccagcgca ccaaccgcgc 2280
cgtggccctg agcagcagcg tgatgtgccc cgacgcccgc agccgctgcc ccgacggcag 2340
cacctgctgc gagctgccca gcggcaagta cggctgctgc cccatgccca acgccacctg 2400
ctgcagcgac cacctgcact gctgccccca ggacaccgtg tgcgacctga tccagagcaa 2460
gtgcctgagc aaggagaacg ccaccaccga cctgctgacc aagctgcccg cccacaccgt 2520
gggcgacgtg aagtgcgaca tggaggtgag ctgccccgac ggctacacct gctgccgcct 2580
gcagagcggc gcctggggct gctgcccctt cacccaggcc gtgtgctgcg aggaccacat 2640
ccactgctgc cccgccggct tcacctgcga cacccagaag ggcacctgcg agcagggccc 2700
ccaccaggtg ccctggatgg agaaggcccc cgcccacctg agcctgcccg acccccaggc 2760
cctgaagcgc gacgtgccct gcgacaacgt gagcagctgc cccagcagcg acacctgctg 2820
ccagctgacc agcggcgagt ggggctgctg ccccatcccc gaggccgtgt gctgcagcga 2880
ccaccagcac tgctgccccc agggctacac ctgcgtggcc gagggccagt gccagcgcgg 2940
cagcgagatc gtggccggcc tggagaagat gcccgcccgc cgcgccagcc tgagccaccc 3000
ccgcgacatc ggctgcgacc agcacaccag ctgccccgtg ggccagacct gctgccccag 3060
cctgggcggc agctgggcct gctgccagct gccccacgcc gtgtgctgcg aggaccgcca 3120
gcactgctgc cccgccggct acacctgcaa cgtgaaggcc cgcagctgcg agaaggaggt 3180
ggtgagcgcc cagcccgcca ccttcctggc ccgcagcccc cacgtgggcg tgaaggacgt 3240
ggagtgcggc gagggccact tctgccacga caaccagacc tgctgccgcg acaaccgcca 3300
gggctgggcc tgctgcccct accgccaggg cgtgtgctgc gccgaccgcc gccactgctg 3360
ccccgccggc ttccgctgcg ccgcccgcgg caccaagtgc ctgcgccgcg aggccccccg 3420
ctgggacgcc cccctgcgcg accccgccct gcgccagctg ctgtgacaat tgttaattaa 3480
gtttaaaccc tcgaggccgc aagcttatcg ataatcaacc tctggattac aaaatttgtg 3540
aaagattgac tggtattctt aactatgttg ctccttttac gctatgtgga tacgctgctt 3600
taatgccttt gtatcatgct attgcttccc gtatggcttt cattttctcc tccttgtata 3660
aatcctggtt gctgtctctt tatgaggagt tgtggcccgt tgtcaggcaa cgtggcgtgg 3720
tgtgcactgt gtttgctgac gcaaccccca ctggttgggg cattgccacc acctgtcagc 3780
tcctttccgg gactttcgct ttccccctcc ctattgccac ggcggaactc atcgccgcct 3840
gccttgcccg ctgctggaca ggggctcggc tgttgggcac tgacaattcc gtggtgttgt 3900
cggggaaatc atcgtccttt ccttggctgc tcgcctgtgt tgccacctgg attctgcgcg 3960
ggacgtcctt ctgctacgtc ccttcggccc tcaatccagc ggaccttcct tcccgcggcc 4020
tgctgccggc tctgcggcct cttccgcgtc ttcgccttcg ccctcagacg agtcggatct 4080
ccctttgggc cgcctccccg catcgatacc gtcgactaga gctcgctgat cagcctcgac 4140
tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt ccttgaccct 4200
ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat cgcattgtct 4260
gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg gggaggattg 4320
ggaagacaat agcaggcatg ctggggagag atccacgata acaaacagct tttttggggt 4380
gaacatattg actgaattcc ctgcaggttg gccactccct ctctgcgcgc tcgctcgctc 4440
actgaggccg cccgggcaaa gcccgggcgt cgggcgacct ttggtcgccc ggcctcagtg 4500
agcgagcgag cgcgcagaga gggagtggcc aactccatca ctaggggttc ct 4552
<210> 141
<211> 4162
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 141
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 360
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 420
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 480
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 540
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 600
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 660
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 720
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 780
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 840
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 900
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagtc gctgcgacgc 960
tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg 1020
accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctcagcg ctgtaattag 1080
cgcttggttt aatgacggct tgttggaggc ttgctgaagg ctgtatgctg ttgtctttag 1140
aaataagtgg tagtcaagtg aagccacaga tgtgactacc acttatttct aaaaggacac 1200
aaggcctgtt actagcactc acatggaaca aatggccacc gtgggaggat gacaatttct 1260
gtggctgcgt gaaagccttg aggggctccg ggagctagag cctctgctaa ccatgttcat 1320
gccttcttct ttttcctaca gctcctgggc aacgtgctgg ttattgtgct gtctcatcat 1380
tttggcaaag aattcctcga agatccgaag ggaaagtctt ccacgactgt gggatccgtt 1440
cgaagatatc accggttgag ccaccatgga attcagcagc cccagcagag aggaatgccc 1500
caagcctctg agccgggtgt caatcatggc cggatctctg acaggactgc tgctgcttca 1560
ggccgtgtct tgggcttctg gcgctagacc ttgcatcccc aagagcttcg gctacagcag 1620
cgtcgtgtgc gtgtgcaatg ccacctactg cgacagcttc gaccctccta cctttcctgc 1680
tctgggcacc ttcagcagat acgagagcac cagatccggc agacggatgg aactgagcat 1740
gggacccatc caggccaatc acacaggcac tggcctgctg ctgacactgc agcctgagca 1800
gaaattccag aaagtgaaag gcttcggcgg agccatgaca gatgccgccg ctctgaatat 1860
cctggctctg tctccaccag ctcagaacct gctgctcaag agctacttca gcgaggaagg 1920
catcggctac aacatcatca gagtgcccat ggccagctgc gacttcagca tcaggaccta 1980
cacctacgcc gacacacccg acgatttcca gctgcacaac ttcagcctgc ctgaagagga 2040
caccaagctg aagatccctc tgatccacag agccctgcag ctggcacaaa gacccgtgtc 2100
actgctggcc tctccatgga catctcccac ctggctgaaa acaaatggcg ccgtgaatgg 2160
caagggcagc ctgaaaggcc aacctggcga catctaccac cagacctggg ccagatactt 2220
cgtgaagttc ctggacgcct atgccgagca caagctgcag ttttgggccg tgacagccga 2280
gaacgaacct tctgctggac tgctgagcgg ctaccccttt cagtgcctgg gctttacacc 2340
cgagcaccag cgggacttta tcgcccgtga tctgggaccc acactggcca atagcaccca 2400
ccataatgtg cggctgctga tgctggacga ccagagactg cttctgcccc actgggctaa 2460
agtggtgctg acagatcctg aggccgccaa atacgtgcac ggaatcgccg tgcactggta 2520
tctggacttt ctggcccctg ccaaggccac actgggagag acacacagac tgttccccaa 2580
caccatgctg ttcgccagcg aagcctgtgt gggcagcaag ttttgggaac agagcgtgcg 2640
gctcggcagc tgggatagag gcatgcagta cagccacagc atcatcacca acctgctgta 2700
ccacgtcgtc ggctggaccg actggaatct ggccctgaat cctgaaggcg gccctaactg 2760
ggtccgaaac ttcgtggaca gccccatcat cgtggacatc accaaggaca ccttctacaa 2820
gcagcccatg ttctaccacc tgggacactt cagcaagttc atccccgagg gctctcagcg 2880
cgttggactg gtggcttccc agaagaacga tctggacgcc gtggctctga tgcaccctga 2940
tggatctgct gtggtggtgg tcctgaaccg cagcagcaaa gatgtgcccc tgaccatcaa 3000
ggatcccgcc gtgggattcc tggaaacaat cagccctggc tactccatcc acacctacct 3060
gtggcgtaga cagtgacaat tgttaattaa gtttaaaccc tcgaggccgc aagcttatcg 3120
ataatcaacc tctggattac aaaatttgtg aaagattgac tggtattctt aactatgttg 3180
ctccttttac gctatgtgga tacgctgctt taatgccttt gtatcatgct attgcttccc 3240
gtatggcttt cattttctcc tccttgtata aatcctggtt gctgtctctt tatgaggagt 3300
tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt gtttgctgac gcaaccccca 3360
ctggttgggg cattgccacc acctgtcagc tcctttccgg gactttcgct ttccccctcc 3420
ctattgccac ggcggaactc atcgccgcct gccttgcccg ctgctggaca ggggctcggc 3480
tgttgggcac tgacaattcc gtggtgttgt cggggaaatc atcgtccttt ccttggctgc 3540
tcgcctgtgt tgccacctgg attctgcgcg ggacgtcctt ctgctacgtc ccttcggccc 3600
tcaatccagc ggaccttcct tcccgcggcc tgctgccggc tctgcggcct cttccgcgtc 3660
ttcgccttcg ccctcagacg agtcggatct ccctttgggc cgcctccccg catcgatacc 3720
gtcgactaga gctcgctgat cagcctcgac tgtgccttct agttgccagc catctgttgt 3780
ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc actcccactg tcctttccta 3840
ataaaatgag gaaattgcat cgcattgtct gagtaggtgt cattctattc tggggggtgg 3900
ggtggggcag gacagcaagg gggaggattg ggaagacaat agcaggcatg ctggggagag 3960
atccacgata acaaacagct tttttggggc ccacatgtac actgaattcc ctgcaggttg 4020
gccactccct ctctgcgcgc tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt 4080
cgggcgacct ttggtcgccc ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc 4140
aactccatca ctaggggttc ct 4162
<210> 142
<211> 4578
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 142
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
aaaaaaattg tcatcctccc acggtggcca tttgttccat gtgagtgcta gtaacaggcc 300
ttgtgtcctt tgtagactat ttgcacactg catctgtggc ttcactcagt gtgcaaatag 360
tctacaagac aacagcatac agccttcagc aagcctccag tggtctcata cagaacttat 420
aagattccca aatccaaaga catttcacgt ttatggtgat ttcccagaac acatagcgac 480
atgcaaatat tgcagggcgc cactcccctg tccctcacag ccatcttcct gccagggcgc 540
acgcgcgctg ggtgttcccg cctagtgaca ctgggcccgc gattccttgg agcgggttga 600
tgacgtcagc gtttcccatg gtgaagcttg gatctgatcc ctaggttcta gaaccggtga 660
cgtctcccat ggtgaagctt ggatctgaat tcggtaccta gttattaata gtaatcaatt 720
acggggtcat tagttcatag cccatatatg gagttccgcg ttacataact tacggtaaat 780
ggcccgcctg gctgaccgcc caacgacccc cgcccattga cgtcaataat gacgtatgtt 840
cccatagtaa cgccaatagg gactttccat tgacgtcaat gggtggagta tttacggtaa 900
actgcccact tggcagtaca tcaagtgtat catatgccaa gtacgccccc tattgacgtc 960
aatgacggta aatggcccgc ctggcattat gcccagtaca tgaccttatg ggactttcct 1020
acttggcagt acatctacgt attagtcatc gctattacca tggtcgaggt gagccccacg 1080
ttctgcttca ctctccccat ctcccccccc tccccacccc caattttgta tttatttatt 1140
ttttaattat tttgtgcagc gatgggggcg gggggggggg gggggcgcgc gccaggcggg 1200
gcggggcggg gcgaggggcg gggcggggcg aggcggagag gtgcggcggc agccaatcag 1260
agcggcgcgc tccgaaagtt tccttttatg gcgaggcggc ggcggcggcg gccctataaa 1320
aagcgaagcg cgcggcgggc gggagtcgct gcgacgctgc cttcgccccg tgccccgctc 1380
cgccgccgcc tcgcgccgcc cgccccggct ctgactgacc gcgttactcc cacaggtgag 1440
cgggcgggac ggcccttctc ctccgggctg taattagcgc ttggtttaat gacggcttgt 1500
tttctgtggc tgcgtgaaag ccttgagggg ctccgggagc tagagcctct gctaaccatg 1560
ttcatgcctt cttctttttc ctacagctcc tgggcaacgt gctggttatt gtgctgtctc 1620
atcattttgg caaagaattc ctcgaagatc cgaagggaaa gtcttccacg actgtgggat 1680
ccgttcgaag atatcaccgg ttgagccacc atgtggaccc tggtgagctg ggtggccctg 1740
accgccggcc tggtggccgg cacccgctgc cccgacggcc agttctgccc cgtggcctgc 1800
tgcctggacc ccggcggcgc cagctacagc tgctgccgcc ccctgctgga caagtggccc 1860
accaccctga gccgccacct gggcggcccc tgccaggtgg acgcccactg cagcgccggc 1920
cacagctgca tcttcaccgt gagcggcacc agcagctgct gccccttccc cgaggccgtg 1980
gcctgcggcg acggccacca ctgctgcccc cgcggcttcc actgcagcgc cgacggccgc 2040
agctgcttcc agcgcagcgg caacaacagc gtgggcgcca tccagtgccc cgacagccag 2100
ttcgagtgcc ccgacttcag cacctgctgc gtgatggtgg acggcagctg gggctgctgc 2160
cccatgcccc aggccagctg ctgcgaggac cgcgtgcact gctgccccca cggcgccttc 2220
tgcgacctgg tgcacacccg ctgcatcacc cccaccggca cccaccccct ggccaagaag 2280
ctgcccgccc agcgcaccaa ccgcgccgtg gccctgagca gcagcgtgat gtgccccgac 2340
gcccgcagcc gctgccccga cggcagcacc tgctgcgagc tgcccagcgg caagtacggc 2400
tgctgcccca tgcccaacgc cacctgctgc agcgaccacc tgcactgctg cccccaggac 2460
accgtgtgcg acctgatcca gagcaagtgc ctgagcaagg agaacgccac caccgacctg 2520
ctgaccaagc tgcccgccca caccgtgggc gacgtgaagt gcgacatgga ggtgagctgc 2580
cccgacggct acacctgctg ccgcctgcag agcggcgcct ggggctgctg ccccttcacc 2640
caggccgtgt gctgcgagga ccacatccac tgctgccccg ccggcttcac ctgcgacacc 2700
cagaagggca cctgcgagca gggcccccac caggtgccct ggatggagaa ggcccccgcc 2760
cacctgagcc tgcccgaccc ccaggccctg aagcgcgacg tgccctgcga caacgtgagc 2820
agctgcccca gcagcgacac ctgctgccag ctgaccagcg gcgagtgggg ctgctgcccc 2880
atccccgagg ccgtgtgctg cagcgaccac cagcactgct gcccccaggg ctacacctgc 2940
gtggccgagg gccagtgcca gcgcggcagc gagatcgtgg ccggcctgga gaagatgccc 3000
gcccgccgcg ccagcctgag ccacccccgc gacatcggct gcgaccagca caccagctgc 3060
cccgtgggcc agacctgctg ccccagcctg ggcggcagct gggcctgctg ccagctgccc 3120
cacgccgtgt gctgcgagga ccgccagcac tgctgccccg ccggctacac ctgcaacgtg 3180
aaggcccgca gctgcgagaa ggaggtggtg agcgcccagc ccgccacctt cctggcccgc 3240
agcccccacg tgggcgtgaa ggacgtggag tgcggcgagg gccacttctg ccacgacaac 3300
cagacctgct gccgcgacaa ccgccagggc tgggcctgct gcccctaccg ccagggcgtg 3360
tgctgcgccg accgccgcca ctgctgcccc gccggcttcc gctgcgccgc ccgcggcacc 3420
aagtgcctgc gccgcgaggc cccccgctgg gacgcccccc tgcgcgaccc cgccctgcgc 3480
cagctgctgt gacaattgtt aattaagttt aaaccctcga ggccgcaagc ttatcgataa 3540
tcaacctctg gattacaaaa tttgtgaaag attgactggt attcttaact atgttgctcc 3600
ttttacgcta tgtggatacg ctgctttaat gcctttgtat catgctattg cttcccgtat 3660
ggctttcatt ttctcctcct tgtataaatc ctggttgctg tctctttatg aggagttgtg 3720
gcccgttgtc aggcaacgtg gcgtggtgtg cactgtgttt gctgacgcaa cccccactgg 3780
ttggggcatt gccaccacct gtcagctcct ttccgggact ttcgctttcc ccctccctat 3840
tgccacggcg gaactcatcg ccgcctgcct tgcccgctgc tggacagggg ctcggctgtt 3900
gggcactgac aattccgtgg tgttgtcggg gaaatcatcg tcctttcctt ggctgctcgc 3960
ctgtgttgcc acctggattc tgcgcgggac gtccttctgc tacgtccctt cggccctcaa 4020
tccagcggac cttccttccc gcggcctgct gccggctctg cggcctcttc cgcgtcttcg 4080
ccttcgccct cagacgagtc ggatctccct ttgggccgcc tccccgcatc gataccgtcg 4140
actagagctc gctgatcagc ctcgactgtg ccttctagtt gccagccatc tgttgtttgc 4200
ccctcccccg tgccttcctt gaccctggaa ggtgccactc ccactgtcct ttcctaataa 4260
aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg gggtggggtg 4320
gggcaggaca gcaaggggga ggattgggaa gacaatagca ggcatgctgg ggagagatcc 4380
acgataacaa acagcttttt tggggtgaac atattgactg aattccctgc aggttggcca 4440
ctccctctct gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg 4500
cgacctttgg tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact 4560
ccatcactag gggttcct 4578
<210> 143
<211> 4162
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 143
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 360
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 420
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 480
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 540
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 600
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 660
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 720
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 780
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 840
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 900
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagtc gctgcgacgc 960
tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg 1020
accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctcagcg ctgtaattag 1080
cgcttggttt aatgacggct tgttggaggc ttgctgaagg ctgtatgctg ttgtctttag 1140
aaataagtgg tagtcaagtg aagccacaga tgtgactacc acttatttct aaaaggacac 1200
aaggcctgtt actagcactc acatggaaca aatggccacc gtgggaggat gacaatttct 1260
gtggctgcgt gaaagccttg aggggctccg ggagctagag cctctgctaa ccatgttcat 1320
gccttcttct ttttcctaca gctcctgggc aacgtgctgg ttattgtgct gtctcatcat 1380
tttggcaaag aattcctcga agatccgaag ggaaagtctt ccacgactgt gggatccgtt 1440
cgaagatatc accggttgag ccaccatgga attcagcagc cccagcagag aggaatgccc 1500
caagcctctg agccgggtgt caatcatggc cggatctctg acaggactgc tgctgcttca 1560
ggccgtgtct tgggcttctg gcgctagacc ttgcatcccc aagagcttcg gctacagcag 1620
cgtcgtgtgc gtgtgcaatg ccacctactg cgacagcttc gaccctccta cctttcctgc 1680
tctgggcacc ttcagcagat acgagagcac cagatccggc agacggatgg aactgagcat 1740
gggacccatc caggccaatc acacaggcac tggcctgctg ctgacactgc agcctgagca 1800
gaaattccag aaagtgaaag gcttcggcgg agccatgaca gatgccgccg ctctgaatat 1860
cctggctctg tctccaccag ctcagaacct gctgctcaag agctacttca gcgaggaagg 1920
catcggctac aacatcatca gagtgcccat ggccagctgc gacttcagca tcaggaccta 1980
cacctacgcc gacacacccg acgatttcca gctgcacaac ttcagcctgc ctgaagagga 2040
caccaagctg aagatccctc tgatccacag agccctgcag ctggcacaaa gacccgtgtc 2100
actgctggcc tctccatgga catctcccac ctggctgaaa acaaatggcg ccgtgaatgg 2160
caagggcagc ctgaaaggcc aacctggcga catctaccac cagacctggg ccagatactt 2220
cgtgaagttc ctggacgcct atgccgagca caagctgcag ttttgggccg tgacagccga 2280
gaacgaacct tctgctggac tgctgagcgg ctaccccttt cagtgcctgg gctttacacc 2340
cgagcaccag cgggacttta tcgcccgtga tctgggaccc acactggcca atagcaccca 2400
ccataatgtg cggctgctga tgctggacga ccagagactg cttctgcccc actgggctaa 2460
agtggtgctg acagatcctg aggccgccaa atacgtgcac ggaatcgccg tgcactggta 2520
tctggacttt ctggcccctg ccaaggccac actgggagag acacacagac tgttccccaa 2580
caccatgctg ttcgccagcg aagcctgtgt gggcagcaag ttttgggaac agagcgtgcg 2640
gctcggcagc tgggatagag gcatgcagta cagccacagc atcatcacca acctgctgta 2700
ccacgtcgtc ggctggaccg actggaatct ggccctgaat cctgaaggcg gccctaactg 2760
ggtccgaaac ttcgtggaca gccccatcat cgtggacatc accaaggaca ccttctacaa 2820
gcagcccatg ttctaccacc tgggacactt cagcaagttc atccccgagg gctctcagcg 2880
cgttggactg gtggcttccc agaagaacga tctggacgcc gtggctctga tgcaccctga 2940
tggatctgct gtggtggtgg tcctgaaccg cagcagcaaa gatgtgcccc tgaccatcaa 3000
ggatcccgcc gtgggattcc tggaaacaat cagccctggc tactccatcc acacctacct 3060
gtggcgtaga cagtgacaat tgttaattaa gtttaaaccc tcgaggccgc aagcttatcg 3120
ataatcaacc tctggattac aaaatttgtg aaagattgac tggtattctt aactatgttg 3180
ctccttttac gctatgtgga tacgctgctt taatgccttt gtatcatgct attgcttccc 3240
gtatggcttt cattttctcc tccttgtata aatcctggtt gctgtctctt tatgaggagt 3300
tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt gtttgctgac gcaaccccca 3360
ctggttgggg cattgccacc acctgtcagc tcctttccgg gactttcgct ttccccctcc 3420
ctattgccac ggcggaactc atcgccgcct gccttgcccg ctgctggaca ggggctcggc 3480
tgttgggcac tgacaattcc gtggtgttgt cggggaaatc atcgtccttt ccttggctgc 3540
tcgcctgtgt tgccacctgg attctgcgcg ggacgtcctt ctgctacgtc ccttcggccc 3600
tcaatccagc ggaccttcct tcccgcggcc tgctgccggc tctgcggcct cttccgcgtc 3660
ttcgccttcg ccctcagacg agtcggatct ccctttgggc cgcctccccg catcgatacc 3720
gtcgactaga gctcgctgat cagcctcgac tgtgccttct agttgccagc catctgttgt 3780
ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc actcccactg tcctttccta 3840
ataaaatgag gaaattgcat cgcattgtct gagtaggtgt cattctattc tggggggtgg 3900
ggtggggcag gacagcaagg gggaggattg ggaagacaat agcaggcatg ctggggagag 3960
atccacgata acaaacagct tttttggggc ccacatgtac actgaattcc ctgcaggttg 4020
gccactccct ctctgcgcgc tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt 4080
cgggcgacct ttggtcgccc ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc 4140
aactccatca ctaggggttc ct 4162
<210> 144
<211> 4606
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 144
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctcctg gtggcgaggg gaggggggtg gtcctcgaac gccttgcaga 600
actggcctgg atacagagtg gaccggctgg ccccatctgg aagacttcga gatacactgt 660
tgtcttactg cgctcaacag tgtatctcga agtcttccaa atggtgccag ccatcgcagc 720
ggggtgcagg aaatgggggc agcccccctt tttggctatc cttccacgtg ttcttttttg 780
tatcttttgt gtttcctaga aaacatctca gtcaccaccg cagccctagg aatgcatcta 840
gacaattgta ctaaccttct tctctttcct ctcctgacag tccggaaagc caccatgtac 900
gccctgttcc tgctggccag cctgctgggc gccgccctgg ccggccccgt gctgggcctg 960
aaggagtgca cccgcggcag cgccgtgtgg tgccagaacg tgaagaccgc cagcgactgc 1020
ggcgccgtga agcactgcct gcagaccgtg tggaacaagc ccaccgtgaa gagcctgccc 1080
tgcgacatct gcaaggacgt ggtgaccgcc gccggcgaca tgctgaagga caacgccacc 1140
gaggaggaga tcctggtgta cctggagaag acctgcgact ggctgcccaa gcccaacatg 1200
agcgccagct gcaaggagat cgtggacagc tacctgcccg tgatcctgga catcatcaag 1260
ggcgagatga gccgccccgg cgaggtgtgc agcgccctga acctgtgcga gagcctgcag 1320
aagcacctgg ccgagctgaa ccaccagaag cagctggaga gcaacaagat ccccgagctg 1380
gacatgaccg aggtggtggc ccccttcatg gccaacatcc ccctgctgct gtacccccag 1440
gacggccccc gcagcaagcc ccagcccaag gacaacggcg acgtgtgcca ggactgcatc 1500
cagatggtga ccgacatcca gaccgccgtg cgcaccaaca gcaccttcgt gcaggccctg 1560
gtggagcacg tgaaggagga gtgcgaccgc ctgggccccg gcatggccga catctgcaag 1620
aactacatca gccagtacag cgagatcgcc atccagatga tgatgcacat gcagcccaag 1680
gagatctgcg ccctggtggg cttctgcgac gaggtgaagg agatgcccat gcagaccctg 1740
gtgcccgcca aggtggccag caagaacgtg atccccgccc tggagctggt ggagcccatc 1800
aagaagcacg aggtgcccgc caagagcgac gtgtactgcg aggtgtgcga gttcctggtg 1860
aaggaggtga ccaagctgat cgacaacaac aagaccgaga aggagatcct ggacgccttc 1920
gacaagatgt gcagcaagct gcccaagagc ctgagcgagg agtgccagga ggtggtggac 1980
acctacggca gcagcatcct gagcatcctg ctggaggagg tgagccccga gctggtgtgc 2040
agcatgctgc acctgtgcag cggcacccgc ctgcccgccc tgaccgtgca cgtgacccag 2100
cccaaggacg gcggcttctg cgaggtgtgc aagaagctgg tgggctacct ggaccgcaac 2160
ctggagaaga acagcaccaa gcaggagatc ctggccgccc tggagaaggg ctgcagcttc 2220
ctgcccgacc cctaccagaa gcagtgcgac cagttcgtgg ccgagtacga gcccgtgctg 2280
atcgagatcc tggtggaggt gatggacccc agcttcgtgt gcctgaagat cggcgcctgc 2340
cccagcgccc acaagcccct gctgggcacc gagaagtgca tctggggccc cagctactgg 2400
tgccagaaca ccgagaccgc cgcccagtgc aacgccgtgg agcactgcaa gcgccacgtg 2460
tggaacagaa gaaagagagg aagtggagag ggcagaggaa gtcttctgac atgcggagac 2520
gtggaagaga atcccggccc tatggaattc agcagcccca gcagagagga atgccccaag 2580
cctctgagcc gggtgtcaat catggccgga tctctgacag gactgctgct gcttcaggcc 2640
gtgtcttggg cttctggcgc tagaccttgc atccccaaga gcttcggcta cagcagcgtc 2700
gtgtgcgtgt gcaatgccac ctactgcgac agcttcgacc ctcctacctt tcctgctctg 2760
ggcaccttca gcagatacga gagcaccaga tccggcagac ggatggaact gagcatggga 2820
cccatccagg ccaatcacac aggcactggc ctgctgctga cactgcagcc tgagcagaaa 2880
ttccagaaag tgaaaggctt cggcggagcc atgacagatg ccgccgctct gaatatcctg 2940
gctctgtctc caccagctca gaacctgctg ctcaagagct acttcagcga ggaaggcatc 3000
ggctacaaca tcatcagagt gcccatggcc agctgcgact tcagcatcag gacctacacc 3060
tacgccgaca cacccgacga tttccagctg cacaacttca gcctgcctga agaggacacc 3120
aagctgaaga tccctctgat ccacagagcc ctgcagctgg cacaaagacc cgtgtcactg 3180
ctggcctctc catggacatc tcccacctgg ctgaaaacaa atggcgccgt gaatggcaag 3240
ggcagcctga aaggccaacc tggcgacatc taccaccaga cctgggccag atacttcgtg 3300
aagttcctgg acgcctatgc cgagcacaag ctgcagtttt gggccgtgac agccgagaac 3360
gaaccttctg ctggactgct gagcggctac ccctttcagt gcctgggctt tacacccgag 3420
caccagcggg actttatcgc ccgtgatctg ggacccacac tggccaatag cacccaccat 3480
aatgtgcggc tgctgatgct ggacgaccag agactgcttc tgccccactg ggctaaagtg 3540
gtgctgacag atcctgaggc cgccaaatac gtgcacggaa tcgccgtgca ctggtatctg 3600
gactttctgg cccctgccaa ggccacactg ggagagacac acagactgtt ccccaacacc 3660
atgctgttcg ccagcgaagc ctgtgtgggc agcaagtttt gggaacagag cgtgcggctc 3720
ggcagctggg atagaggcat gcagtacagc cacagcatca tcaccaacct gctgtaccac 3780
gtcgtcggct ggaccgactg gaatctggcc ctgaatcctg aaggcggccc taactgggtc 3840
cgaaacttcg tggacagccc catcatcgtg gacatcacca aggacacctt ctacaagcag 3900
cccatgttct accacctggg acacttcagc aagttcatcc ccgagggctc tcagcgcgtt 3960
ggactggtgg cttcccagaa gaacgatctg gacgccgtgg ctctgatgca ccctgatgga 4020
tctgctgtgg tggtggtcct gaaccgcagc agcaaagatg tgcccctgac catcaaggat 4080
cccgccgtgg gattcctgga aacaatcagc cctggctact ccatccacac ctacctgtgg 4140
cgtagacagt gacaattgtt aattaagttt aaaccctcga ggccgcaagc cgcatcgata 4200
ccgtcgacta gagctcgctg atcagcctcg actgtgcctt ctagttgcca gccatctgtt 4260
gtttgcccct cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc 4320
taataaaatg aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt 4380
ggggtggggc aggacagcaa gggggaggat tgggaagaca atagcaggca tgctggggat 4440
gtacactgaa ttccctgcag gttggccact ccctctctgc gcgctcgctc gctcactgag 4500
gccgcccggg caaagcccgg gcgtcgggcg acctttggtc gcccggcctc agtgagcgag 4560
cgagcgcgca gagagggagt ggccaactcc atcactaggg gttcct 4606
<210> 145
<211> 2216
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 145
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 60
gagtggtggg acttgaaacc tggagccccg aagcccaaag ccaaccagca aaagcaggac 120
gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac 180
aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac 240
cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt 300
caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360
gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct 420
ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcatcggc 480
aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag 540
tcagtccccg atccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct 600
actacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660
gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc 720
accaccagca cccgcacctg ggccttgccc acctacaata accacctcta caagcaaatc 780
tccaacggga cttcgggagg aagcaccaac gacaacacct acttcggcta cagcaccccc 840
tgggggtatt ttgactttaa cagattccac tgccacttct caccacgtga ctggcagcga 900
ctcatcaaca acaattgggg attccggccc aagagactca acttcaaact cttcaacatc 960
caagtcaagg aggtcacgac gaatgatggc gtcacaacca tcgctaataa ccttaccagc 1020
acggttcaag tcttctcgga ctcggagtac cagcttccgt acgtcctcgg ctctgcgcac 1080
cagggctgcc tccctccgtt cccggcggac gtgttcatga ttccgcaata cggctacctg 1140
acgctcaaca atggcagcca agccgtggga cgttcatcct tttactgcct ggaatatttc 1200
ccttctcaga tgctgagaac gggcaacaac tttaccttca gctacacctt tgaggaagtg 1260
cctttccaca gcagctacgc gcacagccag agcctggacc ggctgatgaa tcctctcatc 1320
gaccaatacc tgtattacct gaacagaact caaaatcagt ccggaagtgc ccaaaacaag 1380
gacttgctgt ttagccgtgg gtctccagct ggcatgtctg ttcagcccaa aaactggcta 1440
cctggaccct gttatcggca gcagcgcgtt tctaaaacaa aaacagacaa caacaacagc 1500
aattttacct ggactggtgc ttcaaaatat aacctcaatg ggcgtgaatc catcatcaac 1560
cctggcactg ctatggcctc acacaaagac gacgaagaca agttctttcc catgagcggt 1620
gtcatgattt ttggaaaaga gagcgccgga gcttcaaaca ctgcattgga caatgtcatg 1680
attacagacg aagaggaaat taaagccact aaccctgtgg ccaccgaaag atttgggacc 1740
gtggcagtca atttccagag cagcagcaca gaccctgcga ccggagatgt gcatgctatg 1800
ggagcattac ctggcatggt gtggcaagat agagacgtgt acctgcaggg tcccatttgg 1860
gccaaaattc ctcacacaga tggacacttt cacccgtctc ctcttatggg cggctttgga 1920
ctcaagaacc cgcctcctca gatcctcatc aaaaacacgc ctgttcctgc gaatcctccg 1980
gcggagtttt cagctacaaa gtttgcttca ttcatcaccc aatactccac aggacaagtg 2040
agtgtggaaa ttgaatggga gctgcagaaa gaaaacagca agcgctggaa tcccgaagtg 2100
cagtacacat ccaattatgc aaaatctgcc aacgttgatt ttactgtgga caacaatgga 2160
ctttatactg agcctcgccc cattggcacc cgttacctta cccgtcccct gtaatt 2216
<210> 146
<211> 737
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 146
Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser
1 5 10 15
Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro
20 25 30
Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro
35 40 45
Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro
50 55 60
Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp
65 70 75 80
Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala
85 90 95
Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly
100 105 110
Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro
115 120 125
Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg
130 135 140
Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly
145 150 155 160
Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr
165 170 175
Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro
180 185 190
Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly
195 200 205
Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala
210 215 220
Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile
225 230 235 240
Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu
245 250 255
Tyr Lys Gln Ile Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp Asn
260 265 270
Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg
275 280 285
Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn
290 295 300
Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile
305 310 315 320
Gln Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn
325 330 335
Asn Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu
340 345 350
Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro
355 360 365
Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn
370 375 380
Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe
385 390 395 400
Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr
405 410 415
Phe Glu Glu Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu
420 425 430
Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn
435 440 445
Arg Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe
450 455 460
Ser Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu
465 470 475 480
Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp
485 490 495
Asn Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu
500 505 510
Asn Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His
515 520 525
Lys Asp Asp Glu Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe
530 535 540
Gly Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met
545 550 555 560
Ile Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu
565 570 575
Arg Phe Gly Thr Val Ala Val Asn Phe Gln Ser Ser Ser Thr Asp Pro
580 585 590
Ala Thr Gly Asp Val His Ala Met Gly Ala Leu Pro Gly Met Val Trp
595 600 605
Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro
610 615 620
His Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly
625 630 635 640
Leu Lys Asn Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro
645 650 655
Ala Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile
660 665 670
Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu
675 680 685
Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser
690 695 700
Asn Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly
705 710 715 720
Leu Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro
725 730 735
Leu
<210> 147
<211> 736
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 147
Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser
1 5 10 15
Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro
20 25 30
Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro
35 40 45
Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro
50 55 60
Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp
65 70 75 80
Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala
85 90 95
Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly
100 105 110
Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro
115 120 125
Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg
130 135 140
Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly
145 150 155 160
Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr
165 170 175
Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro
180 185 190
Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly
195 200 205
Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala
210 215 220
Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile
225 230 235 240
Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu
245 250 255
Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His
260 265 270
Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe
275 280 285
His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn
290 295 300
Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln
305 310 315 320
Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn
325 330 335
Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro
340 345 350
Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala
355 360 365
Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly
370 375 380
Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro
385 390 395 400
Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe
405 410 415
Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp
420 425 430
Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg
435 440 445
Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe Ser
450 455 460
Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro
465 470 475 480
Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Thr Lys Thr Asp Asn
485 490 495
Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn
500 505 510
Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys
515 520 525
Asp Asp Glu Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly
530 535 540
Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile
545 550 555 560
Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg
565 570 575
Phe Gly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro Ala
580 585 590
Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gln
595 600 605
Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His
610 615 620
Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu
625 630 635 640
Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala
645 650 655
Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr
660 665 670
Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln
675 680 685
Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn
690 695 700
Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu
705 710 715 720
Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Pro Leu
725 730 735
<210> 148
<211> 736
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 148
Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser
1 5 10 15
Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro
20 25 30
Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro
35 40 45
Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro
50 55 60
Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp
65 70 75 80
Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala
85 90 95
Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly
100 105 110
Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro
115 120 125
Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg
130 135 140
Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly
145 150 155 160
Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr
165 170 175
Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro
180 185 190
Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly
195 200 205
Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala
210 215 220
Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile
225 230 235 240
Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu
245 250 255
Tyr Lys Gln Ile Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His
260 265 270
Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe
275 280 285
His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn Asn
290 295 300
Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile Gln
305 310 315 320
Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr Ile Ala Asn Asn
325 330 335
Leu Thr Ser Thr Val Gln Val Phe Ser Asp Ser Glu Tyr Gln Leu Pro
340 345 350
Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro Ala
355 360 365
Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn Gly
370 375 380
Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro
385 390 395 400
Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe
405 410 415
Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu Asp
420 425 430
Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Asn Arg
435 440 445
Thr Gln Asn Gln Ser Gly Ser Ala Gln Asn Lys Asp Leu Leu Phe Ser
450 455 460
Arg Gly Ser Pro Ala Gly Met Ser Val Gln Pro Lys Asn Trp Leu Pro
465 470 475 480
Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Lys Val Lys Thr Asp Asn
485 490 495
Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn
500 505 510
Gly Arg Glu Ser Ile Ile Asn Pro Gly Thr Ala Met Ala Ser His Lys
515 520 525
Asp Asp Glu Asp Lys Phe Phe Pro Met Ser Gly Val Met Ile Phe Gly
530 535 540
Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met Ile
545 550 555 560
Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Arg
565 570 575
Phe Gly Thr Val Ala Val Asn Leu Gln Ser Ser Ser Thr Asp Pro Ala
580 585 590
Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gln
595 600 605
Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His
610 615 620
Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu
625 630 635 640
Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala
645 650 655
Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe Ile Thr
660 665 670
Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln
675 680 685
Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gln Tyr Thr Ser Asn
690 695 700
Phe Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu
705 710 715 720
Tyr Thr Glu Pro Arg Pro Ile Gly Thr Arg Phe Leu Thr Arg Pro Leu
725 730 735
<210> 149
<211> 736
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 149
Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser
1 5 10 15
Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro
20 25 30
Lys Ala Asn Gln Gln Lys Gln Asp Asp Gly Arg Gly Leu Val Leu Pro
35 40 45
Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro
50 55 60
Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp
65 70 75 80
Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala
85 90 95
Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly
100 105 110
Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro
115 120 125
Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg
130 135 140
Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ser Gly Ile Gly
145 150 155 160
Lys Thr Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr
165 170 175
Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Leu Gly Glu Pro Pro
180 185 190
Ala Ala Pro Ser Gly Leu Gly Pro Asn Thr Met Ala Ser Gly Gly Gly
195 200 205
Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser
210 215 220
Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val Ile
225 230 235 240
Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu
245 250 255
Tyr Lys Gln Ile Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp Asn
260 265 270
Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg
275 280 285
Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn
290 295 300
Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile
305 310 315 320
Gln Val Lys Glu Val Thr Thr Asn Glu Gly Thr Lys Thr Ile Ala Asn
325 330 335
Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Glu Tyr Gln Leu
340 345 350
Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe Pro
355 360 365
Ala Asp Val Phe Met Val Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asn
370 375 380
Gly Ser Gln Ala Leu Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe
385 390 395 400
Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Thr
405 410 415
Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu
420 425 430
Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Val
435 440 445
Arg Thr Gln Thr Thr Gly Thr Gly Gly Thr Gln Thr Leu Ala Phe Ser
450 455 460
Gln Ala Gly Pro Ser Ser Met Ala Asn Gln Ala Arg Asn Trp Val Pro
465 470 475 480
Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Val Thr Asn Gln Asn
485 490 495
Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Ala Lys Phe Lys Leu Asn
500 505 510
Gly Arg Asp Ser Leu Met Asn Pro Gly Val Ala Met Ala Ser His Lys
515 520 525
Asp Asp Asp Asp Arg Phe Phe Pro Ser Ser Gly Val Leu Ile Phe Gly
530 535 540
Lys Gln Gly Ala Gly Asn Asp Gly Val Asp Tyr Ser Gln Val Leu Ile
545 550 555 560
Thr Asp Glu Glu Glu Ile Lys Ala Thr Asn Pro Val Ala Thr Glu Glu
565 570 575
Tyr Gly Ala Val Ala Ile Asn Asn Gln Ala Ala Asn Thr Gln Ala Gln
580 585 590
Thr Gly Leu Val His Asn Gln Gly Val Ile Pro Gly Met Val Trp Gln
595 600 605
Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His
610 615 620
Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu
625 630 635 640
Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala
645 650 655
Asp Pro Pro Leu Thr Phe Asn Gln Ala Lys Leu Asn Ser Phe Ile Thr
660 665 670
Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln
675 680 685
Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn
690 695 700
Phe Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Val
705 710 715 720
Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Phe Leu Thr Arg Asn Leu
725 730 735
SEQUENCE LISTING
<110> Prevail Thrapeutics, Inc.
<120> GENE THERAPIES FOR LYSOSOMAL DISORDERS
<130> P1094.70012WO00
<140> Not Yet Assigned
<141> Concurrently Herewith
<150> US 62/990,246
<151> 2020-03-16
<150> US 62/998,665
<151> 2020-03-12
<150> US 62/960,471
<151> 2020-01-13
<150> US 62/954,089
<151> 2019-12-27
<150> US 62/934,450
<151> 2019-11-12
<150> US 62/832,223
<151> 2019-04-10
<150> US 62/831,840
<151> 2019-04-10
<150> US 62/831,846
<151> 2019-04-10
<150> US 62/831,856
<151> 2019-04-10
<160> 149
<170> PatentIn version 3.5
<210> 1
<211> 10697
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 1
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
cctagttatt aatagtaatc aattacgggg tcattagttc atagcccata tatggagttc 360
cgcgttacat aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca 420
ttgacgtcaa taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt 480
caatgggtgg actatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg 540
ccaagtacgc cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag 600
tacatgacct tatgggactt tcctacttgg cagtacatct acgtattagt catcgctatt 660
accatggtcg aggtgagccc cacgttctgc ttcactctcc ccatctcccc cccctcccca 720
cccccaattt tgtatttatt tattttttaa ttattttgtg cagcgatggg ggcgggggg 780
gggggggggc gcgcgccagg cggggcgggg cggggcgagg ggcggggcgg ggcgaggcgg 840
agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa agtttccttt tatggcgagg 900
cggcggcggc ggcggcccta taaaaagcga agcgcgcggc gggcgggagt cgctgcgacg 960
ctgccttcgc cccgtgcccc gctccgccgc cgcctcgcgc cgcccgcccc ggctctgact 1020
gaccgcgtta ctcccacagg tgagcgggcg ggacggccct tctcctccgg gctgtaatta 1080
gcgcttggtt taatgacggc ttgttttctg tggctgcgtg aaagccttga ggggctccgg 1140
gagctagagc ctctgctaac catgttcatg ccttcttctt tttcctacag ctcctgggca 1200
acgtgctggt tattgtgctg tctcatcatt ttggcaaaga attcctcgaa gatccgaagg 1260
gaaagtcttc cacgactgtg ggatccgttc gaagatatca ccggttgagc caccatggaa 1320
ttcagcagcc ccagcagaga ggaatgcccc aagcctctga gccgggtgtc aatcatggcc 1380
ggatctctga caggactgct gctgcttcag gccgtgtctt gggcttctgg cgctagacct 1440
tgcatcccca agagcttcgg ctacagcagc gtcgtgtgcg tgtgcaatgc cacctactgc 1500
gacagcttcg accctcctac ctttcctgct ctgggcacct tcagcagata cgagagcacc 1560
agatccggca gacggatgga actgagcatg ggacccatcc aggccaatca cacaggcact 1620
ggcctgctgc tgacactgca gcctgagcag aaattccaga aagtgaaagg cttcggcgga 1680
gccatgacag atgccgccgc tctgaatatc ctggctctgt ctccaccagc tcagaacctg 1740
ctgctcaaga gctacttcag cgaggaaggc atcggctaca acatcatcag agtgcccatg 1800
gccagctgcg acttcagcat caggacctac acctacgccg acacacccga cgatttccag 1860
ctgcacaact tcagcctgcc tgaagaggac accaagctga agatccctct gatccacaga 1920
gccctgcagc tggcacaaag acccgtgtca ctgctggcct ctccatggac atctcccacc 1980
tggctgaaaa caaatggcgc cgtgaatggc aagggcagcc tgaaaggcca acctggcgac 2040
atctaccacc agacctgggc cagatacttc gtgaagttcc tggacgccta tgccgagcac 2100
aagctgcagt tttgggccgt gacagccgag aacgaacctt ctgctggact gctgagcggc 2160
tacccctttc agtgcctggg ctttacaccc gagcaccagc gggactttat cgcccgtgat 2220
ctgggaccca cactggccaa tagcacccac cataatgtgc ggctgctgat gctggacgac 2280
cagagactgc ttctgcccca ctgggctaaa gtggtgctga cagatcctga ggccgccaaa 2340
tacgtgcacg gaatcgccgt gcactggtat ctggactttc tggcccctgc caaggccaca 2400
ctgggagaga cacacagact gttccccaac accatgctgt tcgccagcga agcctgtgtg 2460
ggcagcaagt tttgggaaca gagcgtgcgg ctcggcagct gggatagagg catgcagtac 2520
agccacagca tcatcaccaa cctgctgtac cacgtcgtcg gctggaccga ctggaatctg 2580
gccctgaatc ctgaaggcgg ccctaactgg gtccgaaact tcgtggacag ccccatcatc 2640
gtggacatca ccaaggacac cttctacaag cagcccatgt tctaccacct gggacacttc 2700
agcaagttca tccccgaggg ctctcagcgc gttggactgg tggcttccca gaagaacgat 2760
ctggacgccg tggctctgat gcaccctgat ggatctgctg tggtggtggt cctgaaccgc 2820
agcagcaaag atgtgcccct gaccatcaag gatcccgccg tgggattcct ggaaacaatc 2880
agccctggct actccatcca cacctacctg tggcgtagac agtgacaatt gttaattaag 2940
tttaaaccct cgaggccgca agcttatcga taatcaacct ctggattaca aaatttgtga 3000
aagattgact ggtattctta actatgttgc tccttttacg ctatgtggat acgctgcttt 3060
aatgcctttg tatcatgcta ttgcttcccg tatggctttc attttctcct ccttgtataa 3120
atcctggttg ctgtctcttt atgaggagtt gtggcccgtt gtcaggcaac gtggcgtggt 3180
gtgcactgtg tttgctgacg caacccccac tggttggggc attgccacca cctgtcagct 3240
cctttccggg actttcgctt tccccctccc tattgccacg gcggaactca tcgccgcctg 3300
ccttgcccgc tgctggacag gggctcggct gttgggcact gacaattccg tggtgttgtc 3360
ggggaaatca tcgtcctttc cttggctgct cgcctgtgtt gccacctgga ttctgcgcgg 3420
gacgtccttc tgctacgtcc cttcggccct caatccagcg gaccttcctt cccgcggcct 3480
gctgccggct ctgcggcctc ttccgcgtct tcgccttcgc cctcagacga gtcggatctc 3540
cctttgggcc gcctccccgc atcgataccg tcgactagag ctcgctgatc agcctcgact 3600
gtgccttcta gttgccagcc atctgttgtt tgcccctccc ccgtgccttc cttgaccctg 3660
gaaggtgcca ctcccactgt cctttcctaa taaaatgagg aaattgcatc gcattgtctg 3720
agtaggtgtc attctattct ggggggtggg gtggggcagg acagcaaggg ggaggattgg 3780
gaagacaata gcaggcatgc tggggagaga tccacgataa caaacagctt ttttggggtg 3840
aacatattga ctgaattccc tgcaggttgg ccactccctc tctgcgcgct cgctcgctca 3900
ctgaggccgc ccgggcaaag cccgggcgtc gggcgacctt tggtcgcccg gcctcagtga 3960
gcgagcgagc gcgcagagag gggagtggcca actccatcac taggggttcc tgcggccgct 4020
cgtacggtct cgaggaattc ctgcaggata acttgccaac ctcattctaa aatgtatata 4080
gaagcccaaa agacaataac aaaaatattc ttgtagaaca aaatgggaaa gaatgttcca 4140
ctaaatatca agatttagag caaagcatga gatgtgtggg gatagacagt gaggctgata 4200
aaatagagta gagctcagaa acagacccat tgatatatgt aagtgaccta tgaaaaaaat 4260
atggcatttt acaatgggaa aatgatggtc tttttctttt ttagaaaaac agggaaatat 4320
atttatatgt aaaaaataaa agggaaccca tatgtcatac catacacaca aaaaaattcc 4380
agtgaattat aagtctaaat ggagaaggca aaactttaaa tcttttagaa aataatatag 4440
aagcatgcag accagcctgg ccaacatgat gaaaccctct ctactaataa taaaatcagt 4500
agaactactc aggactactt tgagtgggaa gtccttttct atgaagactt ctttggccaa 4560
aattaggctc taaatgcaag gagatagtgc atcatgcctg gctgcactta ctgataaatg 4620
atgttatcac catctttaac caaatgcaca ggaacaagtt atggtactga tgtgctggat 4680
tgagaaggag ctctacttcc ttgacaggac acatttgtat caacttaaaa aagcagattt 4740
ttgccagcag aactattcat tcagaggtag gaaacttaga atagatgatg tcactgatta 4800
gcatggcttc cccatctcca cagctgcttc ccacccaggt tgcccacagt tgagtttgtc 4860
cagtgctcag ggctgcccac tctcagtaag aagccccaca ccagcccctc tccaaatatg 4920
ttggctgttc cttccattaa agtgacccca ctttagagca gcaagtggat ttctgtttct 4980
tacagttcag gaaggaggag tcagctgtga gaacctggag cctgagatgc ttctaagtcc 5040
cactgctact ggggtcaggg aagccagact ccagcatcag cagtcaggag cactaagccc 5100
ttgccaacat cctgtttctc agagaaactg cttccattat aatggttgtc cttttttaag 5160
ctatcaagcc aaacaaccag tgtctaccat tattctcatc acctgaagcc aagggttcta 5220
gcaaaagtca agctgtcttg taatggttga tgtgcctcca gcttctgtct tcagtcactc 5280
cactcttagc ctgctctgaa tcaactctga ccacagttcc ctggagcccc tgccacctgc 5340
tgcccctgcc accttctcca tctgcagtgc tgtgcagcct tctgcactct tgcagagcta 5400
ataggtggag acttgaagga agaggaggaa agtttctcat aatagccttg ctgcaagctc 5460
aaatgggagg tgggcactgt gcccaggagc cttggagcaa aggctgtgcc caacctctga 5520
ctgcatccag gtttggtctt gacagagata agaagccctg gcttttggag ccaaaatcta 5580
ggtcagactt aggcaggatt ctcaaagttt atcagcagaa catgaggcag aagacccttt 5640
ctgctccagc ttcttcaggc tcaaccttca tcagaataga tagaaagaga ggctgtgagg 5700
gttcttaaaa cagaagcaaa tctgactcag agaataaaca acctcctagt aaactacagc 5760
ttagacagag catctggtgg tgagtgtgct cagtgtccta ctcaactgtc tggtatcagc 5820
cctcatgagg acttctcttc tttccctcat agacctccat ctctgttttc cttagcctgc 5880
agaaatctgg atggctattc acagaatgcc tgtgctttca gagttgcatt ttttctctgg 5940
tattctggtt caagcatttg aaggtaggaa aggttctcca agtgcaagaa agccagccct 6000
gagcctcaac tgcctggcta gtgtggtcag taggatgcaa aggctgttga atgccacaag 6060
gccaaacttt aacctgtgta ccacaagcct agcagcagag gcagctctgc tcactggaac 6120
tctctgtctt ctttctcctg agccttttct tttcctgagt tttctagctc tcctcaacct 6180
tacctctgcc ctacccagga caaacccaag agccactgtt tctgtgatgt cctctccagc 6240
cctaattag catcatgact tcagcctgac cttccatgct cagaagcagt gctaatccac 6300
ttcagatgag ctgctctatg caacacaggc agagcctaca aacctttgca ccagagccct 6360
ccacatatca gtgtttgttc atactcactt caacagcaaa tgtgactgct gagattaaga 6420
ttttacacaa gatggtctgt aatttcacag ttagttttat cccattaggt atgaaagaat 6480
tagcataatt ccccttaaac atgaatgaat cttagatttt ttaataaata gttttggaag 6540
taaagacaga gacatcagga gcacaaggaa tagcctgaga ggacaaacag aacaagaaag 6600
agtctggaaa tacacaggat gttcttggcc tcctcaaagc aagtgcaagc agatagtacc 6660
agcagcccca ggctatcaga gcccagtgaa gagaagtacc atgaaagcca cagctctaac 6720
caccctgttc cagagtgaca gacagtcccc aagacaagcc agcctgagcc agagagagaa 6780
ctgcaagaga aagtttctaa tttaggttct gttagattca gacaagtgca ggtcatcctc 6840
tctccacagc tactcacctc tccagcctaa caaagcctgc agtccacact ccaaccctgg 6900
tgtctcacct cctagcctct cccaacatcc tgctctctga ccatcttctg catctctcat 6960
ctcaccatct cccactgtct acagcctact cttgcaacta ccatctcatt ttctgacatc 7020
ctgtctacat cttctgccat actctgccat ctaccatacc acctcttacc atctaccaca 7080
ccatctttta tctccatccc tctcagaagc ctccaagctg aatcctgctt tatgtgttca 7140
tctcagcccc tgcatggaaa gctgacccca gaggcagaac tattcccaga gagcttggcc 7200
aagaaaaaca aaactaccag cctggccagg ctcaggagta gtaagctgca gtgtctgttg 7260
tgttctagct tcaacagctg caggagttcc actctcaaat gctccacatt tctcacatcc 7320
tcctgattct ggtcactacc catcttcaaa gaacagaata tctcacatca gcatactgtg 7380
aaggactagt catgggtgca gctgctcaga gctgcaaagt cattctggat ggtggagagc 7440
ttacaaacat ttcatgatgc tccccccgct ctgatggctg gagcccaatc cctacacaga 7500
ctcctgctgt atgtgttttc ctttcactct gagccacagc cagagggcag gcattcagtc 7560
tcctcttcag gctggggctg gggcactgag aactcaccca acaccttgct ctcactcctt 7620
ctgcaaaaca agaaagagct ttgtgctgca gtagccatga agaatgaaag gaaggcttta 7680
actaaaaaat gtcagagatt attttcaacc ccttactgtg gatcaccagc aaggaggaaa 7740
cacaacacag agacattttt tcccctcaaa ttatcaaaag aatcactgca tttgttaaag 7800
agagcaactg aatcaggaag cagagttttg aacatatcag aagttaggaa tctgcatcag 7860
agacaaatgc agtcatggtt gtttgctgca taccagccct aatcattaga agcctcatgg 7920
acttcaaaca tcattccctc tgacaagatg ctctagccta actccatgag ataaaataaa 7980
tctgcctttc agagccaaag aagagtccac cagcttcttc tcagtgtgaa caagagctcc 8040
agtcaggtta gtcagtccag tgcagtagag gagaccagtc tgcatcctct aattttcaaa 8100
ggcaagaaga tttgtttacc ctggacacca ggcacaagtg aggtcacaga gctcttagat 8160
atgcagtcct catgagtgag gagactaaag cgcatgccat caagacttca gtgtagagaa 8220
aacctccaaa aaagcctcct cactacttct ggaatagctc agaggccgag gcggcctcgg 8280
cctctgcata aataaaaaaa attagtcagc catggggcgg agaatgggcg gaactgggcg 8340
gagttagggg cgggatgggc ggagttaggg gcgggactat ggttgctgac taattgagat 8400
gcatgctttg catacttctg cctgctgggg agcctgggga ctttccacac ctggttgctg 8460
actaattgag atgcatgctt tgcatacttc tgcctgctgg ggagcctggg gactttccac 8520
accctaactg acacacattc cacagctgca ttaatgaatc ggccaacgcg cggggagagg 8580
cggtttgcgt attgggcgct cttccgcttc ctcgctcact gactcgctgc gctcggtcgt 8640
tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat ccacagaatc 8700
aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca ggaaccgtaa 8760
aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc atcacaaaaa 8820
tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc 8880
ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg gatacctgtc 8940
cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta ggtatctcag 9000
ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg ttcagcccga 9060
ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac acgacttatc 9120
gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag gcggtgctac 9180
agagttcttg aagtggtggc ctaactacgg ctacactaga agaacagtat ttggtatctg 9240
cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat ccggcaaaca 9300
aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa 9360
aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt ggaacgaaaa 9420
ctcacgttaa gggattttgg tcatgagatt atcaaaaagg atcttcacct agatcctttt 9480
aaattaaaaa tgaagtttta aatcaatcta aagtatatat gagtaaactt ggtctgacag 9540
ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc tgtctatttc gttcatccat 9600
agttgcctga ctcctgcaaa ccacgttgtg tctcaaaatc tctgatgtta cattgcacaa 9660
gataaaaata tatcatcatg aacaataaaa ctgtctgctt acataaacag taatacaagg 9720
ggtgttatga gccatattca acgggaaacg tcttgctcga ggccgcgatt aaattccaac 9780
atggatgctg atttatatgg gtataaatgg gctcgcgata atgtcgggca atcaggtgcg 9840
acaatctatc gattgtatgg gaagcccgat gcgccagagt tgtttctgaa acatggcaaa 9900
ggtagcgttg ccaatgatgt tacagatgag atggtcagac taaactggct gacggaattt 9960
atgcctcttc cgaccatcaa gcattttatc cgtactcctg atgatgcatg gttactcacc 10020
actgcgatcc ccgggaaaac agcattccag gtattagaag aatatcctga ttcaggtgaa 10080
aatattgttg atgcgctggc agtgttcctg cgccggttgc attcgattcc tgtttgtaat 10140
tgtcctttta acagcgatcg cgtatttcgt ctcgctcagg cgcaatcacg aatgaataac 10200
ggtttggttg atgcgagtga ttttgatgac gagcgtaatg gctggcctgt tgaacaagtc 10260
tggaaagaaa tgcataagct tttgccattc tcaccggatt cagtcgtcac tcatggtgat 10320
ttctcacttg ataaccttat ttttgacgag gggaaattaa taggttgtat tgatgttgga 10380
cgagtcggaa tcgcagaccg ataccaggat cttgccatcc tatggaactg cctcggtgag 10440
ttttctcctt cattacagaa acggcttttt caaaaatatg gtattgataa tcctgatatg 10500
aataaattgc agtttcattt gatgctcgat gagtttttct aagggcggcc tgccaccata 10560
cccacgccga aacaagcgct catgagcccg aagtggcgag cccgatcttc cccatcggtg 10620
atgtcggcga tataggcgcc agcaaccgca cctgtggcgc cggtgatgag ggcgcgccaa 10680
gtcgacgtcc ggcagtc 10697
<210> 2
<211> 11355
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 2
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctgcag ccctaggaat gcatctagac aattgtacta accttcttct 600
ctttcctctc ctgacagtcc ggaaagccac catgggccgc tgctgcttct acaccgccgg 660
caccctgagc ctgctgctgc tggtgaccag cgtgaccctg ctggtggccc gcgtgttcca 720
gaaggccgtg gaccagagca tcgagaagaa gatcgtgctg cgcaacggca ccgaggcctt 780
cgacagctgg gagaagcccc ccctgcccgt gtacacccag ttctacttct tcaacgtgac 840
caaccccgag gagatcctgc gcggcgagac cccccgcgtg gaggaggtgg gcccctacac 900
ctaccgcgag ctgcgcaaca aggccaacat ccagttcggc gacaacggca ccaccatcag 960
cgccgtgagc aacaaggcct acgtgttcga gcgcgaccag agcgtgggcg accccaagat 1020
cgacctgatc cgcaccctga acatccccgt gctgaccgtg atcgagtgga gccaggtgca 1080
cttcctgcgc gagatcatcg aggccatgct gaaggcctac cagcagaagc tgttcgtgac 1140
ccacaccgtg gacgagctgc tgtggggcta caaggacgag atcctgagcc tgatccacgt 1200
gttccgcccc gacatcagcc cctacttcgg cctgttctac gagaagaacg gcaccaacga 1260
cggcgactac gtgttcctga ccggcgagga cagctacctg aacttcacca agatcgtgga 1320
gtggaacggc aagaccagcc tggactggtg gatcaccgac aagtgcaaca tgatcaacgg 1380
caccgacggc gacagcttcc accccctgat caccaaggac gaggtgctgt acgtgttccc 1440
cagcgacttc tgccgcagcg tgtacatcac cttcagcgac tacgagagcg tgcagggcct 1500
gcccgccttc cgctacaagg tgcccgccga gatcctggcc aacaccagcg acaacgccgg 1560
cttctgcatc cccgagggca actgcctggg cagcggcgtg ctgaacgtga gcatctgcaa 1620
gaacggcgcc cccatcatca tgagcttccc ccacttctac caggccgacg agcgcttcgt 1680
gagcgccatc gagggcatgc accccaacca ggaggaccac gagaccttcg tggacatcaa 1740
ccccctgacc ggcatcatcc tgaaggccgc caagcgcttc cagatcaaca tctacgtgaa 1800
gaagctggac gacttcgtgg agaccggcga catccgcacc atggtgttcc ccgtgatgta 1860
cctgaacgag agcgtgcaca tcgacaagga gaccgccagc cgcctgaaga gcatgatcaa 1920
caccaccctg atcatcacca acatccccta catcatcatg gccctgggcg tgttcttcgg 1980
cctggtgttc acctggctgg cctgcaaggg ccagggcagc atggacgagg gcaccgccga 2040
cgagcgcgcc cccctgatcc gcacctgatt gtggccgaac cgccgaactc agaggccggc 2100
cccagaaaac ccgagcgagt agggggcggc gcgcaggagg gaggagaact gggggcgcgg 2160
gaggctggtg ggtgtggggg gtggagatgt agaagatgtg acgccgcggc ccggcgggtg 2220
ccagattagc ggacgcggtg cccgcggttg caacgggatc ccgggcgctg cagcttggga 2280
ggcggctctc cccaggcggc gtccgcggag acacccatcc gtgaacccca ggtcccgggc 2340
cgccggctcg ccgcgcacca ggggccggcg gacagaagag cggccgagcg gctcgaggct 2400
gggggaccgc gggcgcggcc gcgcgctgcc gggcgggagg ctggggggcc ggggccgggg 2460
ccgtgccccg gagcgggtcg gaggccgggg ccggggccgg gggacggcgg ctccccgcgc 2520
ggctccagcg gctcggggat cccggccggg ccccgcaggg accatgatgg aattcagcag 2580
ccccagcaga gaggaatgcc ccaagcctct gagccgggtg tcaatcatgg ccggatctct 2640
gacaggactg ctgctgcttc aggccgtgtc ttgggcttct ggcgctagac cttgcatccc 2700
caagagcttc ggctacagca gcgtcgtgtg cgtgtgcaat gccacctact gcgacagctt 2760
cgaccctcct acctttcctg ctctgggcac cttcagcaga tacgagagca ccagatccgg 2820
cagacggatg gaactgagca tgggacccat ccaggccaat cacacaggca ctggcctgct 2880
gctgacactg cagcctgagc agaaattcca gaaagtgaaa ggcttcggcg gagccatgac 2940
agatgccgcc gctctgaata tcctggctct gtctccacca gctcagaacc tgctgctcaa 3000
gagctacttc agcgaggaag gcatcggcta caacatcatc agagtgccca tggccagctg 3060
cgacttcagc atcaggacct acacctacgc cgacacaccc gacgatttcc agctgcacaa 3120
cttcagcctg cctgaagagg acaccaagct gaagatccct ctgatccaca gagccctgca 3180
gctggcacaa agacccgtgt cactgctggc ctctccatgg acatctccca cctggctgaa 3240
aacaaatggc gccgtgaatg gcaagggcag cctgaaaggc caacctggcg acatctacca 3300
ccagacctgg gccagatact tcgtgaagtt cctggacgcc tatgccgagc acaagctgca 3360
gttttgggcc gtgacagccg agaacgaacc ttctgctgga ctgctgagcg gctacccctt 3420
tcagtgcctg ggctttacac ccgagcacca gcgggacttt atcgcccgtg atctgggacc 3480
cacactggcc aatagcaccc accataatgt gcggctgctg atgctggacg accagagact 3540
gcttctgccc cactgggcta aagtggtgct gacagatcct gaggccgcca aatacgtgca 3600
cggaatcgcc gtgcactggt atctggactt tctggcccct gccaaggcca cactgggaga 3660
gacacacaga ctgttcccca acaccatgct gttcgccagc gaagcctgtg tgggcagcaa 3720
gttttgggaa cagagcgtgc ggctcggcag ctgggataga ggcatgcagt acagccacag 3780
catcatcacc aacctgctgt accacgtcgt cggctggacc gactggaatc tggccctgaa 3840
tcctgaaggc ggccctaact gggtccgaaa cttcgtggac agccccatca tcgtggacat 3900
caccaaggac accttctaca agcagcccat gttctaccac ctgggacact tcagcaagtt 3960
catccccgag ggctctcagc gcgttggact ggtggcttcc cagaagaacg atctggacgc 4020
cgtggctctg atgcaccctg atggatctgc tgtggtggtg gtcctgaacc gcagcagcaa 4080
agatgtgccc ctgaccatca aggatcccgc cgtgggattc ctggaaacaa tcagccctgg 4140
ctactccatc cacacctacc tgtggcgtag acagtgacaa ttgttaatta agtttaaacc 4200
ctcgaggccg caagccgcat cgataccgtc gactagagct cgctgatcag cctcgactgt 4260
gccttctagt tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga 4320
aggtgccact cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag 4380
taggtgtcat tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga 4440
agacaatagc aggcatgctg gggagagatc cacgataaca aacagctttt ttggggtgaa 4500
catattgact gaattccctg caggttggcc actccctctc tgcgcgctcg ctcgctcact 4560
gaggccgccc gggcaaagcc cgggcgtcgg gcgacctttg gtcgcccggc ctcagtgagc 4620
gagcgagcgc gcagagaggg agtggccaac tccatcacta ggggttcctg cggccgctcg 4680
tacggtctcg aggaattcct gcaggataac ttgccaacct cattctaaaa tgtatataga 4740
agcccaaaag acaataacaa aaatattctt gtagaacaaa atgggaaaga atgttccact 4800
aaatatcaag atttagagca aagcatgaga tgtgtgggga tagacagtga ggctgataaa 4860
atagagtaga gctcagaaac agacccattg atatatgtaa gtgacctatg aaaaaaatat 4920
ggcattttac aatgggaaaa tgatggtctt tttctttttt agaaaaacag ggaaatatat 4980
ttatatgtaa aaaataaaag ggaacccata tgtcatacca tacacacaaa aaaattccag 5040
tgaattataa gtctaaatgg agaaggcaaa actttaaatc ttttagaaaa taatatagaa 5100
gcatgcagac cagcctggcc aacatgatga aaccctctct actaataata aaatcagtag 5160
aactactcag gactactttg agtgggaagt ccttttctat gaagacttct ttggccaaaa 5220
ttaggctcta aatgcaagga gatagtgcat catgcctggc tgcacttact gataaatgat 5280
gttatcacca tctttaacca aatgcacagg aacaagttat ggtactgatg tgctggattg 5340
agaaggagct ctacttcctt gacaggacac atttgtatca acttaaaaaa gcagattttt 5400
gccagcagaa ctattcattc agaggtagga aacttagaat agatgatgtc actgattagc 5460
atggcttccc catctccaca gctgcttccc acccaggttg cccacagttg agtttgtcca 5520
gtgctcaggg ctgcccactc tcagtaagaa gccccacacc agcccctctc caaatatgtt 5580
ggctgttcct tccattaaag tgaccccact ttagagcagc aagtggattt ctgtttctta 5640
cagttcagga aggaggagtc agctgtgaga acctggagcc tgagatgctt ctaagtccca 5700
ctgctactgg ggtcagggaa gccagactcc agcatcagca gtcaggagca ctaagccctt 5760
gccaacatcc tgtttctcag agaaactgct tccattataa tggttgtcct tttttaagct 5820
atcaagccaa acaaccagtg tctaccatta ttctcatcac ctgaagccaa gggttctagc 5880
aaaagtcaag ctgtcttgta atggttgatg tgcctccagc ttctgtcttc agtcactcca 5940
ctcttagcct gctctgaatc aactctgacc acagttccct ggagcccctg ccacctgctg 6000
cccctgccac cttctccatc tgcagtgctg tgcagccttc tgcactcttg cagagctaat 6060
aggtggagac ttgaaggaag aggaggaaag tttctcataa tagccttgct gcaagctcaa 6120
atgggaggtg ggcactgtgc ccaggagcct tggagcaaag gctgtgccca acctctgact 6180
gcatccaggt ttggtcttga cagagataag aagccctggc ttttggagcc aaaatctagg 6240
tcagacttag gcaggattct caaagtttat cagcagaaca tgaggcagaa gaccctttct 6300
gctccagctt cttcaggctc aaccttcatc agaatagata gaaagagagg ctgtgagggt 6360
tcttaaaaca gaagcaaatc tgactcagag aataaacaac ctcctagtaa actacagctt 6420
agacagagca tctggtggtg agtgtgctca gtgtcctact caactgtctg gtatcagccc 6480
tcatgaggac ttctcttctt tccctcatag acctccatct ctgttttcct tagcctgcag 6540
aaatctggat ggctattcac agaatgcctg tgctttcaga gttgcatttt ttctctggta 6600
ttctggttca agcatttgaa ggtaggaaag gttctccaag tgcaagaaag ccagccctga 6660
gcctcaactg cctggctagt gtggtcagta ggatgcaaag gctgttgaat gccacaaggc 6720
caaactttaa cctgtgtacc acaagcctag cagcagaggc agctctgctc actggaactc 6780
tctgtcttct ttctcctgag ccttttcttt tcctgagttt tctagctctc ctcaacctta 6840
cctctgccct acccaggaca aacccaagag ccactgtttc tgtgatgtcc tctccagccc 6900
taattaggca tcatgacttc agcctgacct tccatgctca gaagcagtgc taatccactt 6960
cagatgagct gctctatgca acacaggcag agcctacaaa cctttgcacc agagccctcc 7020
acatatcagt gtttgttcat actcacttca acagcaaatg tgactgctga gattaagatt 7080
ttacacaaga tggtctgtaa tttcacagtt agttttatcc cattaggtat gaaagaatta 7140
gcataattcc ccttaaacat gaatgaatct tagatttttt aataaatagt tttggaagta 7200
aagacagaga catcaggagc acaaggaata gcctgagagg acaaacagaa caagaaagag 7260
tctggaaata cacaggatgt tcttggcctc ctcaaagcaa gtgcaagcag atagtaccag 7320
cagccccagg ctatcagagc ccagtgaaga gaagtaccat gaaagccaca gctctaacca 7380
ccctgttcca gagtgacaga cagtccccaa gacaagccag cctgagccag agagagaact 7440
gcaagagaaa gtttctaatt taggttctgt tagattcaga caagtgcagg tcatcctctc 7500
tccacagcta ctcacctctc cagcctaaca aagcctgcag tccacactcc aaccctggtg 7560
tctcacctcc tagcctctcc caacatcctg ctctctgacc atcttctgca tctctcatct 7620
caccatctcc cactgtctac agcctactct tgcaactacc atctcatttt ctgacatcct 7680
gtctacatct tctgccatac tctgccatct accataccac ctcttaccat ctaccacacc 7740
atcttttatc tccatccctc tcagaagcct ccaagctgaa tcctgcttta tgtgttcatc 7800
tcagcccctg catggaaagc tgaccccaga ggcagaacta ttcccagaga gcttggccaa 7860
gaaaaacaaa actaccagcc tggccaggct caggagtagt aagctgcagt gtctgttgtg 7920
ttctagcttc aacagctgca ggagttccac tctcaaatgc tccacatttc tcacatcctc 7980
ctgattctgg tcactaccca tcttcaaaga acagaatatc tcacatcagc atactgtgaa 8040
ggactagtca tgggtgcagc tgctcagagc tgcaaagtca ttctggatgg tggagagctt 8100
acaaacattt catgatgctc cccccgctct gatggctgga gcccaatccc tacacagact 8160
cctgctgtat gtgttttcct ttcactctga gccacagcca gagggcaggc attcagtctc 8220
ctcttcaggc tggggctggg gcactgagaa ctcacccaac accttgctct cactccttct 8280
gcaaaacaag aaagagcttt gtgctgcagt agccatgaag aatgaaagga aggctttaac 8340
taaaaaatgt cagagattat tttcaacccc ttactgtgga tcaccagcaa ggaggaaaca 8400
caacacagag acattttttc ccctcaaatt atcaaaagaa tcactgcatt tgttaaagag 8460
agcaactgaa tcaggaagca gagttttgaa catatcagaa gttaggaatc tgcatcagag 8520
acaaatgcag tcatggttgt ttgctgcata ccagccctaa tcattagaag cctcatggac 8580
ttcaaacatc attccctctg acaagatgct ctagcctaac tccatgagat aaaataaatc 8640
tgcctttcag agccaaagaa gagtccacca gcttcttctc agtgtgaaca agagctccag 8700
tcaggttagt cagtccagtg cagtagagga gaccagtctg catcctctaa ttttcaaagg 8760
caagaagatt tgtttaccct ggacaccagg cacaagtgag gtcacagagc tcttagatat 8820
gcagtcctca tgagtgagga gactaaagcg catgccatca agacttcagt gtagagaaaa 8880
cctccaaaaa agcctcctca ctacttctgg aatagctcag aggccgaggc ggcctcggcc 8940
tctgcataaa taaaaaaaat tagtcagcca tggggcggag aatgggcgga actgggcgga 9000
gttaggggcg ggatgggcgg agttaggggc gggactatgg ttgctgacta attgagatgc 9060
atgctttgca tacttctgcc tgctggggag cctggggact ttccacacct ggttgctgac 9120
taattgagat gcatgctttg catacttctg cctgctgggg agcctgggga ctttccacac 9180
cctaactgac acacattcca cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg 9240
gtttgcgtat tgggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc 9300
ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag 9360
gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa 9420
aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc 9480
gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc 9540
ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg 9600
cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg tatctcagtt 9660
cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc 9720
gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc 9780
cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag 9840
agttcttgaa gtggtggcct aactacggct acactagaag aacagtattt ggtatctgcg 9900
ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa 9960
ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag 10020
gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact 10080
cacgttaagg gattttggtc atgagattat caaaaaggat cttcacctag atccttttaa 10140
attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt 10200
accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag 10260
ttgcctgact cctgcaaacc acgttgtgtc tcaaaatctc tgatgttaca ttgcacaaga 10320
taaaaatata tcatcatgaa caataaaact gtctgcttac ataaacagta atacaagggg 10380
tgttatgagc catattcaac gggaaacgtc ttgctcgagg ccgcgattaa attccaacat 10440
ggatgctgat ttatatgggt ataaatgggc tcgcgataat gtcgggcaat caggtgcgac 10500
aatctatcga ttgtatggga agcccgatgc gccagagttg tttctgaaac atggcaaagg 10560
tagcgttgcc aatgatgtta cagatgagat ggtcagacta aactggctga cggaatttat 10620
gcctcttccg accatcaagc attttatccg tactcctgat gatgcatggt tactcaccac 10680
tgcgatcccc gggaaaacag cattccaggt attagaagaa tatcctgatt caggtgaaaa 10740
tattgttgat gcgctggcag tgttcctgcg ccggttgcat tcgattcctg tttgtaattg 10800
tccttttaac agcgatcgcg tatttcgtct cgctcaggcg caatcacgaa tgaataacgg 10860
tttggttgat gcgagtgatt ttgatgacga gcgtaatggc tggcctgttg aacaagtctg 10920
gaaagaaatg cataagcttt tgccattctc accggattca gtcgtcactc atggtgattt 10980
ctcacttgat aaccttattt ttgacgaggg gaaattaata ggttgtattg atgttggacg 11040
agtcggaatc gcagaccgat accaggatct tgccatccta tggaactgcc tcggtgagtt 11100
ttctccttca ttacagaaac ggctttttca aaaatatggt attgataatc ctgatatgaa 11160
taaattgcag tttcatttga tgctcgatga gtttttctaa gggcggcctg ccaccatacc 11220
cacgccgaaa caagcgctca tgagcccgaa gtggcgagcc cgatcttccc catcggtgat 11280
gtcggcgata taggcgccag caaccgcacc tgtggcgccg gtgatgaggg cgcgccaagt 11340
cgacgtccgg cagtc 11355
<210> 3
<211> 11420
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 3
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctgcag ccctaggaat gcatctagac aattgtacta accttcttct 600
ctttcctctc ctgacagtcc ggaaagccac catggaattc agcagcccca gcagagagga 660
atgccccaag cctctgagcc gggtgtcaat catggccgga tctctgacag gactgctgct 720
gcttcaggcc gtgtcttggg cttctggcgc tagaccttgc atccccaaga gcttcggcta 780
cagcagcgtc gtgtgcgtgt gcaatgccac ctactgcgac agcttcgacc ctcctacctt 840
tcctgctctg ggcaccttca gcagatacga gagcaccaga tccggcagac ggatggaact 900
gagcatggga cccatccagg ccaatcacac aggcactggc ctgctgctga cactgcagcc 960
tgagcagaaa ttccagaaag tgaaaggctt cggcggagcc atgacagatg ccgccgctct 1020
gaatatcctg gctctgtctc caccagctca gaacctgctg ctcaagagct acttcagcga 1080
ggaaggcatc ggctacaaca tcatcagagt gcccatggcc agctgcgact tcagcatcag 1140
gacctacacc tacgccgaca cacccgacga tttccagctg cacaacttca gcctgcctga 1200
agaggacacc aagctgaaga tccctctgat ccacagagcc ctgcagctgg cacaaagacc 1260
cgtgtcactg ctggcctctc catggacatc tcccacctgg ctgaaaacaa atggcgccgt 1320
gaatggcaag ggcagcctga aaggccaacc tggcgacatc taccaccaga cctgggccag 1380
atacttcgtg aagttcctgg acgcctatgc cgagcacaag ctgcagtttt gggccgtgac 1440
agccgagaac gaaccttctg ctggactgct gagcggctac ccctttcagt gcctgggctt 1500
tacacccgag caccagcggg actttatcgc ccgtgatctg ggacccacac tggccaatag 1560
cacccaccat aatgtgcggc tgctgatgct ggacgaccag agactgcttc tgccccactg 1620
ggctaaagtg gtgctgacag atcctgaggc cgccaaatac gtgcacggaa tcgccgtgca 1680
ctggtatctg gactttctgg cccctgccaa ggccacactg ggagagacac acagactgtt 1740
ccccaacacc atgctgttcg ccagcgaagc ctgtgtgggc agcaagtttt gggaacagag 1800
cgtgcggctc ggcagctggg atagaggcat gcagtacagc cacagcatca tcaccaacct 1860
gctgtaccac gtcgtcggct ggaccgactg gaatctggcc ctgaatcctg aaggcggccc 1920
taactgggtc cgaaacttcg tggacagccc catcatcgtg gacatcacca aggacacctt 1980
ctacaagcag cccatgttct accacctggg acacttcagc aagttcatcc ccgagggctc 2040
tcagcgcgtt ggactggtgg cttcccagaa gaacgatctg gacgccgtgg ctctgatgca 2100
ccctgatgga tctgctgtgg tggtggtcct gaaccgcagc agcaaagatg tgcccctgac 2160
catcaaggat cccgccgtgg gattcctgga aacaatcagc cctggctact ccatccacac 2220
ctacctgtgg cgtagacagt gacaattgtt aattaagttt catcgatacc gtcgactaga 2280
gctcgctgat cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc 2340
cccgtgcctt ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag 2400
gaaattgcat cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag 2460
gacagcaagg gggaggattg ggaagacaat agcaggcatg ctggggagag atccacgata 2520
acaaacagct tttttggggg ggcggagtta gggcggagcc aatcagcgtg cgccgttccg 2580
aaagttgcct tttatggctg ggcggagaat gggcggtgaa cgccgatgat tatataagga 2640
cgcgccgggt gtggcacagc tagttccgtc gcagccggga tttgggtcgc ggttcttgtt 2700
tgtggatccc tgtgatcgtc acttggtaag tcactgactg tctatgcctg ggaaagggtg 2760
ggcaggagat ggggcagtgc aggaaaagtg gcactatgaa ccctgcagcc ctaggaatgc 2820
atctagacaa ttgtactaac cttcttctct ttcctctcct gacagtccgg aaagccacca 2880
tgggccgctg ctgcttctac accgccggca ccctgagcct gctgctgctg gtgaccagcg 2940
tgaccctgct ggtggcccgc gtgttccaga aggccgtgga ccagagcatc gagaagaaga 3000
tcgtgctgcg caacggcacc gaggccttcg acagctggga gaagcccccc ctgcccgtgt 3060
acacccagtt ctacttcttc aacgtgacca accccgagga gatcctgcgc ggcgagaccc 3120
cccgcgtgga gggaggtgggc ccctacacct accgcgagct gcgcaacaag gccaacatcc 3180
agttcggcga caacggcacc accatcagcg ccgtgagcaa caaggcctac gtgttcgagc 3240
gcgaccagag cgtgggcgac cccaagatcg acctgatccg caccctgaac atccccgtgc 3300
tgaccgtgat cgagtggagc caggtgcact tcctgcgcga gatcatcgag gccatgctga 3360
aggcctacca gcagaagctg ttcgtgaccc acaccgtgga cgagctgctg tggggctaca 3420
aggacgagat cctgagcctg atccacgtgt tccgccccga catcagcccc tacttcggcc 3480
tgttctacga gaagaacggc accaacgacg gcgactacgt gttcctgacc ggcgaggaca 3540
gctacctgaa cttcaccaag atcgtggagt ggaacggcaa gaccagcctg gactggtgga 3600
tcaccgacaa gtgcaacatg atcaacggca ccgacggcga cagcttccac cccctgatca 3660
ccaaggacga ggtgctgtac gtgttcccca gcgacttctg ccgcagcgtg tacatcacct 3720
tcagcgacta cgagagcgtg cagggcctgc ccgccttccg ctacaaggtg cccgccgaga 3780
tcctggccaa caccagcgac aacgccggct tctgcatccc cgagggcaac tgcctgggca 3840
gcggcgtgct gaacgtgagc atctgcaaga acggcgcccc catcatcatg agcttccccc 3900
acttctacca ggccgacgag cgcttcgtga gcgccatcga gggcatgcac cccaaccagg 3960
aggaccacga gaccttcgtg gacatcaacc ccctgaccgg catcatcctg aaggccgcca 4020
agcgcttcca gatcaacatc tacgtgaaga agctggacga cttcgtggag accggcgaca 4080
tccgcaccat ggtgttcccc gtgatgtacc tgaacgagag cgtgcacatc gacaaggaga 4140
ccgccagccg cctgaagagc atgatcaaca ccaccctgat catcaccaac atcccctaca 4200
tcatcatggc cctgggcgtg ttcttcggcc tggtgttcac ctggctggcc tgcaagggcc 4260
agggcagcat ggacgagggc accgccgacg agcgcgcccc cctgatccgc acctgaccca 4320
ggggactcaa tcagcctcga agacatgata agatacattg atgagtttgg acaaaccaca 4380
acaagaatgc agtgaaaaaa atgctttatt tgtgaaattt gtgatgctat tgctttattt 4440
gtaaccatta taagctgcaa taaacaagtt aacaacaaca attgcattca ttttatgttt 4500
caggttcagg gggagatgtg ggaggttttt taaagcaagt aaaacctcta caaatgtggt 4560
atgaacatat tgactgaatt ccctgcaggt tggccactcc ctctctgcgc gctcgctcgc 4620
tcactgaggc cgcccgggca aagcccgggc gtcgggcgac ctttggtcgc ccggcctcag 4680
tgagcgagcg agcgcgcaga gagggagtgg ccaactccat cactaggggt tcctgcggcc 4740
gctcgtacgg tctcgaggaa ttcctgcagg ataacttgcc aacctcattc taaaatgtat 4800
atagaagccc aaaagacaat aacaaaaata ttcttgtaga acaaaatggg aaagaatgtt 4860
ccactaaata tcaagattta gagcaaagca tgagatgtgt ggggatagac agtgaggctg 4920
ataaaataga gtagagctca gaaacagacc cattgatata tgtaagtgac ctatgaaaaa 4980
aatatggcat tttacaatgg gaaaatgatg gtctttttct tttttagaaa aacagggaaa 5040
tatatttata tgtaaaaaat aaaagggaac ccatatgtca taccatacac acaaaaaaat 5100
tccagtgaat tataagtcta aatggagaag gcaaaacttt aaatctttta gaaaataata 5160
tagaagcatg cagaccagcc tggccaacat gatgaaaccc tctctactaa taataaaatc 5220
agtagaacta ctcaggacta ctttgagtgg gaagtccttt tctatgaaga cttctttggc 5280
caaaattagg ctctaaatgc aaggagatag tgcatcatgc ctggctgcac ttactgataa 5340
atgatgttat caccatcttt aaccaaatgc acaggaacaa gttatggtac tgatgtgctg 5400
gattgagaag gagctctact tccttgacag gacacatttg tatcaactta aaaaagcaga 5460
tttttgccag cagaactatt cattcagagg taggaaactt agaatagatg atgtcactga 5520
ttagcatggc ttccccatct ccacagctgc ttcccaccca ggttgcccac agttgagttt 5580
gtccagtgct cagggctgcc cactctcagt aagaagcccc acaccagccc ctctccaaat 5640
atgttggctg ttccttccat taaagtgacc ccactttaga gcagcaagtg gatttctgtt 5700
tcttacagtt caggaaggag gagtcagctg tgagaacctg gagcctgaga tgcttctaag 5760
tcccactgct actggggtca gggaagccag actccagcat cagcagtcag gagcactaag 5820
cccttgccaa catcctgttt ctcagagaaa ctgcttccat tataatggtt gtcctttttt 5880
aagctatcaa gccaaacaac cagtgtctac cattattctc atcacctgaa gccaagggtt 5940
ctagcaaaag tcaagctgtc ttgtaatggt tgatgtgcct ccagcttctg tcttcagtca 6000
ctccactctt agcctgctct gaatcaactc tgaccacagt tccctggagc ccctgccacc 6060
tgctgcccct gccaccttct ccatctgcag tgctgtgcag ccttctgcac tcttgcagag 6120
ctaataggtg gagacttgaa ggaagaggag gaaagtttct cataatagcc ttgctgcaag 6180
ctcaaatggg aggtgggcac tgtgcccagg agccttggag caaaggctgt gcccaacctc 6240
tgactgcatc caggtttggt cttgacagag ataagaagcc ctggcttttg gagccaaaat 6300
ctaggtcaga cttaggcagg attctcaaag tttatcagca gaacatgagg cagaagaccc 6360
tttctgctcc agcttcttca ggctcaacct tcatcagaat agatagaaag agaggctgtg 6420
agggttctta aaacagaagc aaatctgact cagagaataa acaacctcct agtaaactac 6480
agcttagaca gagcatctgg tggtgagtgt gctcagtgtc ctactcaact gtctggtatc 6540
agccctcatg aggacttctc ttctttccct catagacctc catctctgtt ttccttagcc 6600
tgcagaaatc tggatggcta ttcacagaat gcctgtgctt tcagagttgc attttttctc 6660
tggtattctg gttcaagcat ttgaaggtag gaaaggttct ccaagtgcaa gaaagccagc 6720
cctgagcctc aactgcctgg ctagtgtggt cagtaggatg caaaggctgt tgaatgccac 6780
aaggccaaac tttaacctgt gtaccacaag cctagcagca gaggcagctc tgctcactgg 6840
aactctctgt cttctttctc ctgagccttt tcttttcctg agttttctag ctctcctcaa 6900
ccttacctct gccctaccca ggacaaaccc aagagccact gtttctgtga tgtcctctcc 6960
agccctaatt aggcatcatg acttcagcct gaccttccat gctcagaagc agtgctaatc 7020
cacttcagat gagctgctct atgcaacaca ggcagagcct acaaaccttt gcaccagagc 7080
cctccacata tcagtgtttg ttcatactca cttcaacagc aaatgtgact gctgagatta 7140
agattttaca caagatggtc tgtaatttca cagttagttt tatcccatta ggtatgaaag 7200
aattagcata attcccctta aacatgaatg aatcttagat tttttaataa atagttttgg 7260
aagtaaagac agagacatca ggagcacaag gaatagcctg agaggacaaa cagaacaaga 7320
aagagtctgg aaatacacag gatgttcttg gcctcctcaa agcaagtgca agcagatagt 7380
accagcagcc ccaggctatc agagcccagt gaagagaagt accatgaaag ccacagctct 7440
aaccaccctg ttccagagtg acagacagtc cccaagacaa gccagcctga gccagagaga 7500
gaactgcaag agaaagtttc taatttaggt tctgttagat tcagacaagt gcaggtcatc 7560
ctctctccac agctactcac ctctccagcc taacaaagcc tgcagtccac actccaaccc 7620
tggtgtctca cctcctagcc tctcccaaca tcctgctctc tgaccatctt ctgcatctct 7680
catctcacca tctcccactg tctacagcct actcttgcaa ctaccatctc attttctgac 7740
atcctgtcta catcttctgc catactctgc catctaccat accacctctt accatctacc 7800
acaccatctt ttatctccat ccctctcaga agcctccaag ctgaatcctg ctttatgtgt 7860
tcatctcagc ccctgcatgg aaagctgacc ccagaggcag aactattccc agagagcttg 7920
gccaagaaaa acaaaactac cagcctggcc aggctcagga gtagtaagct gcagtgtctg 7980
ttgtgttcta gcttcaacag ctgcaggagt tccactctca aatgctccac atttctcaca 8040
tcctcctgat tctggtcact acccatcttc aaagaacaga atatctcaca tcagcatact 8100
gtgaaggact agtcatgggt gcagctgctc agagctgcaa agtcattctg gatggtggag 8160
agcttacaaa catttcatga tgctcccccc gctctgatgg ctggagccca atccctacac 8220
agactcctgc tgtatgtgtt ttcctttcac tctgagccac agccagaggg caggcattca 8280
gtctcctctt caggctgggg ctggggcact gagaactcac ccaacacctt gctctcactc 8340
cttctgcaaa acaagaaaga gctttgtgct gcagtagcca tgaagaatga aaggaaggct 8400
ttaactaaaa aatgtcagag attattttca accccttact gtggatcacc agcaaggagg 8460
aaacacaaca cagagacatt ttttcccctc aaattatcaa aagaatcact gcatttgtta 8520
aagagagcaa ctgaatcagg aagcagagtt ttgaacatat cagaagttag gaatctgcat 8580
cagagacaaa tgcagtcatg gttgtttgct gcataccagc cctaatcatt agaagcctca 8640
tggacttcaa acatcattcc ctctgacaag atgctctagc ctaactccat gagataaaat 8700
aaatctgcct ttcagagcca aagaagagtc caccagcttc ttctcagtgt gaacaagagc 8760
tccagtcagg ttagtcagtc cagtgcagta gaggagacca gtctgcatcc tctaattttc 8820
aaaggcaaga agatttgttt accctggaca ccaggcacaa gtgaggtcac agagctctta 8880
gatatgcagt cctcatgagt gaggagacta aagcgcatgc catcaagact tcagtgtaga 8940
gaaaacctcc aaaaaagcct cctcactact tctggaatag ctcagaggcc gaggcggcct 9000
cggcctctgc ataaataaaa aaaattagtc agccatgggg cggagaatgg gcggaactgg 9060
gcggagttag gggcgggatg ggcggagtta ggggcgggac tatggttgct gactaattga 9120
gatgcatgct ttgcatactt ctgcctgctg gggagcctgg ggactttcca cacctggttg 9180
ctgactaatt gagatgcatg ctttgcatac ttctgcctgc tggggagcct ggggactttc 9240
cacaccctaa ctgacacaca ttccacagct gcattaatga atcggccaac gcgcggggag 9300
aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 9360
cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 9420
atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 9480
taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 9540
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 9600
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 9660
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 9720
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 9780
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 9840
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 9900
tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat 9960
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 10020
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 10080
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 10140
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 10200
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 10260
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 10320
catagttgcc tgactcctgc aaaccacgtt gtgtctcaaa atctctgatg ttacattgca 10380
caagataaaa atatatcatc atgaacaata aaactgtctg cttacataaa cagtaataca 10440
aggggtgtta tgagccatat tcaacgggaa acgtcttgct cgaggccgcg attaaattcc 10500
aacatggatg ctgatttata tgggtataaa tgggctcgcg ataatgtcgg gcaatcaggt 10560
gcgacaatct atcgattgta tgggaagccc gatgcgccag agttgtttct gaaacatggc 10620
aaaggtagcg ttgccaatga tgttacagat gagatggtca gactaaactg gctgacggaa 10680
tttatgcctc ttccgaccat caagcatttt atccgtactc ctgatgatgc atggttactc 10740
accactgcga tccccgggaa aacagcattc caggtattag aagaatatcc tgattcaggt 10800
gaaaatattg ttgatgcgct ggcagtgttc ctgcgccggt tgcattcgat tcctgtttgt 10860
aattgtcctt ttaacagcga tcgcgtattt cgtctcgctc aggcgcaatc acgaatgaat 10920
aacggtttgg ttgatgcgag tgattttgat gacgagcgta atggctggcc tgttgaacaa 10980
gtctggaaag aaatgcataa gcttttgcca ttctcaccgg attcagtcgt cactcatggt 11040
gatttctcac ttgataacct tatttttgac gaggggaaat taataggttg tattgatgtt 11100
ggacgagtcg gaatcgcaga ccgataccag gatcttgcca tcctatggaa ctgcctcggt 11160
gagttttctc cttcattaca gaaacggctt tttcaaaaat atggtattga taatcctgat 11220
atgaataaat tgcagtttca tttgatgctc gatgagtttt tctaagggcg gcctgccacc 11280
atacccacgc cgaaacaagc gctcatgagc ccgaagtggc gagcccgatc ttccccatcg 11340
gtgatgtcgg cgatataggc gccagcaacc gcacctgtgg cgccggtgat gagggcgcgc 11400
caagtcgacg tccggcagtc 11420
<210> 4
<211> 11171
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 4
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctcctg gtggcgaggg gaggggggtg gtcctcgaac gccttgcaga 600
actggcctgg atacagagtg gaccggctgg ccccatctgg aagacttcga gatacactgt 660
tgtcttactg cgctcaacag tgtatctcga agtcttccaa atggtgccag ccatcgcagc 720
ggggtgcagg aaatgggggc agcccccctt tttggctatc cttccacgtg ttcttttttg 780
tatcttttgt gtttcctaga aaacatctca gtcaccaccg cagccctagg aatgcatcta 840
gacaattgta ctaaccttct tctctttcct ctcctgacag tccggaaagc caccatgggc 900
cgctgctgct tctacaccgc cggcaccctg agcctgctgc tgctggtgac cagcgtgacc 960
ctgctggtgg cccgcgtgtt ccagaaggcc gtggaccaga gcatcgagaa gaagatcgtg 1020
ctgcgcaacg gcaccgaggc cttcgacagc tgggagaagc cccccctgcc cgtgtacacc 1080
cagttctact tcttcaacgt gaccaacccc gaggagatcc tgcgcggcga gaccccccgc 1140
gtggaggagg tgggccccta cacctaccgc gagctgcgca acaaggccaa catccagttc 1200
ggcgacaacg gcaccaccat cagcgccgtg agcaacaagg cctacgtgtt cgagcgcgac 1260
cagagcgtgg gcgaccccaa gatcgacctg atccgcaccc tgaacatccc cgtgctgacc 1320
gtgatcgagt ggagccaggt gcacttcctg cgcgagatca tcgaggccat gctgaaggcc 1380
taccagcaga agctgttcgt gacccacacc gtggacgagc tgctgtgggg ctacaaggac 1440
gagatcctga gcctgatcca cgtgttccgc cccgacatca gcccctactt cggcctgttc 1500
tacgagaaga acggcaccaa cgacggcgac tacgtgttcc tgaccggcga ggacagctac 1560
ctgaacttca ccaagatcgt ggagtggaac ggcaagacca gcctggactg gtggatcacc 1620
gacaagtgca acatgatcaa cggcaccgac ggcgacagct tccaccccct gatcaccaag 1680
gacgaggtgc tgtacgtgtt ccccagcgac ttctgccgca gcgtgtacat caccttcagc 1740
gactacgaga gcgtgcaggg cctgcccgcc ttccgctaca aggtgcccgc cgagatcctg 1800
gccaacacca gcgacaacgc cggcttctgc atccccgagg gcaactgcct gggcagcggc 1860
gtgctgaacg tgagcatctg caagaacggc gcccccatca tcatgagctt cccccacttc 1920
taccaggccg acgagcgctt cgtgagcgcc atcgagggca tgcaccccaa ccaggaggac 1980
cacgagacct tcgtggacat caaccccctg accggcatca tcctgaaggc cgccaagcgc 2040
ttccagatca acatctacgt gaagaagctg gacgacttcg tggagaccgg cgacatccgc 2100
accatggtgt tccccgtgat gtacctgaac gagagcgtgc acatcgacaa ggagaccgcc 2160
agccgcctga agagcatgat caacaccacc ctgatcatca ccaacatccc ctacatcatc 2220
atggccctgg gcgtgttctt cggcctggtg ttcacctggc tggcctgcaa gggccagggc 2280
agcatggacg agggcaccgc cgacgagcgc gcccccctga tccgcaccga gggcagagga 2340
agtcttctga catgcggaga cgtggaagag aatcccggcc ctatggaatt cagcagcccc 2400
agcagagagg aatgccccaa gcctctgagc cgggtgtcaa tcatggccgg atctctgaca 2460
ggactgctgc tgcttcaggc cgtgtcttgg gcttctggcg ctagaccttg catccccaag 2520
agcttcggct acagcagcgt cgtgtgcgtg tgcaatgcca cctactgcga cagcttcgac 2580
cctcctacct ttcctgctct gggcaccttc agcagatacg agagcaccag atccggcaga 2640
cggatggaac tgagcatggg acccatccag gccaatcaca caggcactgg cctgctgctg 2700
acactgcagc ctgagcagaa attccagaaa gtgaaaggct tcggcggagc catgacagat 2760
gccgccgctc tgaatatcct ggctctgtct ccaccagctc agaacctgct gctcaagagc 2820
tacttcagcg aggaaggcat cggctacaac atcatcagag tgcccatggc cagctgcgac 2880
ttcagcatca ggacctacac ctacgccgac acacccgacg atttccagct gcacaacttc 2940
agcctgcctg aagaggacac caagctgaag atccctctga tccacagagc cctgcagctg 3000
gcacaaagac ccgtgtcact gctggcctct ccatggacat ctcccacctg gctgaaaaca 3060
aatggcgccg tgaatggcaa gggcagcctg aaaggccaac ctggcgacat ctaccaccag 3120
acctgggcca gatacttcgt gaagttcctg gacgcctatg ccgagcacaa gctgcagttt 3180
tgggccgtga cagccgagaa cgaaccttct gctggactgc tgagcggcta cccctttcag 3240
tgcctgggct ttacacccga gcaccagcgg gactttatcg cccgtgatct gggacccaca 3300
ctggccaata gcacccacca taatgtgcgg ctgctgatgc tggacgacca gagactgctt 3360
ctgccccact gggctaaagt ggtgctgaca gatcctgagg ccgccaaata cgtgcacgga 3420
atcgccgtgc actggtatct ggactttctg gcccctgcca aggccacact gggagagaca 3480
cacagactgt tccccaacac catgctgttc gccagcgaag cctgtgtggg cagcaagttt 3540
tgggaacaga gcgtgcggct cggcagctgg gatagaggca tgcagtacag ccacagcatc 3600
atcaccaacc tgctgtacca cgtcgtcggc tggaccgact ggaatctggc cctgaatcct 3660
gaaggcggcc ctaactgggt ccgaaacttc gtggacagcc ccatcatcgt ggacatcacc 3720
aaggacacct tctacaagca gcccatgttc taccacctgg gacacttcag caagttcatc 3780
cccgagggct ctcagcgcgt tggactggtg gcttcccaga agaacgatct ggacgccgtg 3840
gctctgatgc accctgatgg atctgctgtg gtggtggtcc tgaaccgcag cagcaaagat 3900
gtgcccctga ccatcaagga tcccgccgtg ggattcctgg aaacaatcag ccctggctac 3960
tccatccaca cctacctgtg gcgtagacag tgacaattgt taattaagtt taaaccctcg 4020
aggccgcaag ccgcatcgat accgtcgact agagctcgct gatcagcctc gactgtgcct 4080
tctagttgcc agccatctgt tgtttgcccc tcccccgtgc cttccttgac cctggaaggt 4140
gccactccca ctgtcctttc ctaataaaat gaggaaattg catcgcattg tctgagtagg 4200
tgtcattcta ttctgggggg tggggtgggg caggacagca agggggagga ttgggaagac 4260
aatagcaggc atgctgggga gagatccacg ataacaaaca gcttttttgg ggtgaacata 4320
ttgactgaat tccctgcagg ttggccactc cctctctgcg cgctcgctcg ctcactgagg 4380
ccgcccgggc aaagcccggg cgtcgggcga cctttggtcg cccggcctca gtgagcgagc 4440
gagcgcgcag agagggagtg gccaactcca tcactagggg ttcctgcggc cgctcgtacg 4500
gtctcgagga attcctgcag gataacttgc caacctcatt ctaaaatgta tatagaagcc 4560
caaaagacaa taacaaaaat attcttgtag aacaaaatgg gaaagaatgt tccactaaat 4620
atcaagattt agagcaaagc atgagatgtg tggggataga cagtgaggct gataaaatag 4680
agtagagctc agaaacagac ccattgatat atgtaagtga cctatgaaaa aaatatggca 4740
ttttacaatg ggaaaatgat ggtctttttc ttttttagaa aaacagggaa atatatttat 4800
atgtaaaaaa taaaagggaa cccatatgtc ataccataca cacaaaaaaa ttccagtgaa 4860
ttataagtct aaatggagaa ggcaaaactt taaatctttt agaaaataat atagaagcat 4920
gcagaccagc ctggccaaca tgatgaaacc ctctctacta ataataaaat cagtagaact 4980
actcaggact actttgagtg ggaagtcctt ttctatgaag acttctttgg ccaaaattag 5040
gctctaaatg caaggagata gtgcatcatg cctggctgca cttactgata aatgatgtta 5100
tcaccatctt taaccaaatg cacaggaaca agttatggta ctgatgtgct ggattgagaa 5160
ggagctctac ttccttgaca ggacacattt gtatcaactt aaaaaagcag atttttgcca 5220
gcagaactat tcattcagag gtaggaaact tagaatagat gatgtcactg attagcatgg 5280
cttccccatc tccacagctg cttcccaccc aggttgccca cagttgagtt tgtccagtgc 5340
tcagggctgc ccactctcag taagaagccc cacaccagcc cctctccaaa tatgttggct 5400
gttccttcca ttaaagtgac cccactttag agcagcaagt ggatttctgt ttcttacagt 5460
tcaggaagga ggagtcagct gtgagaacct ggagcctgag atgcttctaa gtcccactgc 5520
tactggggtc agggaagcca gactccagca tcagcagtca ggagcactaa gcccttgcca 5580
acatcctgtt tctcagagaa actgcttcca ttataatggt tgtccttttt taagctatca 5640
agccaaacaa ccagtgtcta ccattattct catcacctga agccaagggt tctagcaaaa 5700
gtcaagctgt cttgtaatgg ttgatgtgcc tccagcttct gtcttcagtc actccactct 5760
tagcctgctc tgaatcaact ctgaccacag ttccctggag cccctgccac ctgctgcccc 5820
tgccaccttc tccatctgca gtgctgtgca gccttctgca ctcttgcaga gctaataggt 5880
ggagacttga aggaagagga ggaaagtttc tcataatagc cttgctgcaa gctcaaatgg 5940
gaggtgggca ctgtgcccag gagccttgga gcaaaggctg tgcccaacct ctgactgcat 6000
ccaggtttgg tcttgacaga gataagaagc cctggctttt ggagccaaaa tctaggtcag 6060
acttaggcag gattctcaaa gtttatcagc agaacatgag gcagaagacc ctttctgctc 6120
cagcttcttc aggctcaacc ttcatcagaa tagatagaaa gagaggctgt gagggttctt 6180
aaaacagaag caaatctgac tcagagaata aacaacctcc tagtaaacta cagcttagac 6240
agagcatctg gtggtgagtg tgctcagtgt cctactcaac tgtctggtat cagccctcat 6300
gaggacttct cttctttccc tcatagacct ccatctctgt tttccttagc ctgcagaaat 6360
ctggatggct attcacagaa tgcctgtgct ttcagagttg cattttttct ctggtattct 6420
ggttcaagca tttgaaggta ggaaaggttc tccaagtgca agaaagccag ccctgagcct 6480
caactgcctg gctagtgtgg tcagtaggat gcaaaggctg ttgaatgcca caaggccaaa 6540
ctttaacctg tgtaccacaa gcctagcagc agaggcagct ctgctcactg gaactctctg 6600
tcttctttct cctgagcctt ttcttttcct gagttttcta gctctcctca accttacctc 6660
tgccctaccc aggacaaacc caagagccac tgtttctgtg atgtcctctc cagccctaat 6720
taggcatcat gacttcagcc tgaccttcca tgctcagaag cagtgctaat ccacttcaga 6780
tgagctgctc tatgcaacac aggcagagcc tacaaacctt tgcaccagag ccctccacat 6840
atcagtgttt gttcatactc acttcaacag caaatgtgac tgctgagatt aagattttac 6900
acaagatggt ctgtaatttc acagttagtt ttatcccatt aggtatgaaa gaattagcat 6960
aattcccctt aaacatgaat gaatcttaga ttttttaata aatagttttg gaagtaaaga 7020
cagagacatc aggagcacaa ggaatagcct gagaggacaa acagaacaag aaagagtctg 7080
gaaatacaca ggatgttctt ggcctcctca aagcaagtgc aagcagatag taccagcagc 7140
cccaggctat cagagcccag tgaagagaag taccatgaaa gccacagctc taaccaccct 7200
gttccagagt gacagacagt ccccaagaca agccagcctg agccagagag agaactgcaa 7260
gagaaagttt ctaatttagg ttctgttaga ttcagacaag tgcaggtcat cctctctcca 7320
cagctactca cctctccagc ctaacaaagc ctgcagtcca cactccaacc ctggtgtctc 7380
acctcctagc ctctcccaac atcctgctct ctgaccatct tctgcatctc tcatctcacc 7440
atctcccact gtctacagcc tactcttgca actaccatct cattttctga catcctgtct 7500
acatcttctg ccatactctg ccatctacca taccacctct taccatctac cacaccatct 7560
tttatctcca tccctctcag aagcctccaa gctgaatcct gctttatgtg ttcatctcag 7620
cccctgcatg gaaagctgac cccagaggca gaactattcc cagagagctt ggccaagaaa 7680
aacaaaacta ccagcctggc caggctcagg agtagtaagc tgcagtgtct gttgtgttct 7740
agcttcaaca gctgcaggag ttccactctc aaatgctcca catttctcac atcctcctga 7800
ttctggtcac tacccatctt caaagaacag aatatctcac atcagcatac tgtgaaggac 7860
tagtcatggg tgcagctgct cagagctgca aagtcattct ggatggtgga gagcttacaa 7920
acatttcatg atgctccccc cgctctgatg gctggagccc aatccctaca cagactcctg 7980
ctgtatgtgt tttcctttca ctctgagcca cagccagagg gcaggcattc agtctcctct 8040
tcaggctggg gctggggcac tgagaactca cccaacacct tgctctcact ccttctgcaa 8100
aacaagaaag agctttgtgc tgcagtagcc atgaagaatg aaaggaaggc tttaactaaa 8160
aaatgtcaga gattattttc aaccccttac tgtggatcac cagcaaggag gaaacacaac 8220
acagagacat tttttcccct caaattatca aaagaatcac tgcatttgtt aaagagagca 8280
actgaatcag gaagcagagt tttgaacata tcagaagtta ggaatctgca tcagagacaa 8340
atgcagtcat ggttgtttgc tgcataccag ccctaatcat tagaagcctc atggacttca 8400
aacatcattc cctctgacaa gatgctctag cctaactcca tgagataaaa taaatctgcc 8460
tttcagagcc aaagaagagt ccaccagctt cttctcagtg tgaacaagag ctccagtcag 8520
gttagtcagt ccagtgcagt agaggagacc agtctgcatc ctctaatttt caaaggcaag 8580
aagatttgtt taccctggac accaggcaca agtgaggtca cagagctctt agatatgcag 8640
tcctcatgag tgaggagact aaagcgcatg ccatcaagac ttcagtgtag agaaaacctc 8700
caaaaaagcc tcctcactac ttctggaata gctcagaggc cgaggcggcc tcggcctctg 8760
cataaataaa aaaaattagt cagccatggg gcggagaatg ggcggaactg ggcggagtta 8820
ggggcgggat gggcggagtt aggggcggga ctatggttgc tgactaattg agatgcatgc 8880
tttgcatact tctgcctgct ggggagcctg gggactttcc acacctggtt gctgactaat 8940
tgagatgcat gctttgcata cttctgcctg ctggggagcc tggggacttt ccacacccta 9000
actgacacac attccacagc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt 9060
gcgtattggg cgctcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct 9120
gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga 9180
taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc 9240
cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg 9300
ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg 9360
aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt 9420
tctcccttcg ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt 9480
gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg 9540
cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact 9600
ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt 9660
cttgaagtgg tggcctaact acggctacac tagaagaaca gtatttggta tctgcgctct 9720
gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac 9780
cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc 9840
tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg 9900
ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta 9960
aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca 10020
atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc 10080
ctgactcctg caaaccacgt tgtgtctcaa aatctctgat gttacattgc acaagataaa 10140
aatatatcat catgaacaat aaaactgtct gcttacataa acagtaatac aaggggtgtt 10200
atgagccata ttcaacggga aacgtcttgc tcgaggccgc gattaaattc caacatggat 10260
gctgatttat atgggtataa atgggctcgc gataatgtcg ggcaatcagg tgcgacaatc 10320
tatcgattgt atgggaagcc cgatgcgcca gagttgtttc tgaaacatgg caaaggtagc 10380
gttgccaatg atgttacaga tgagatggtc agactaaact ggctgacgga atttatgcct 10440
cttccgacca tcaagcattt tatccgtact cctgatgatg catggttact caccactgcg 10500
atccccggga aaacagcatt ccaggtatta gaagaatatc ctgattcagg tgaaaatatt 10560
gttgatgcgc tggcagtgtt cctgcgccgg ttgcattcga ttcctgtttg taattgtcct 10620
tttaacagcg atcgcgtatt tcgtctcgct caggcgcaat cacgaatgaa taacggtttg 10680
gttgatgcga gtgattttga tgacgagcgt aatggctggc ctgttgaaca agtctggaaa 10740
gaaatgcata agcttttgcc attctcaccg gattcagtcg tcactcatgg tgatttctca 10800
cttgataacc ttatttttga cgaggggaaa ttaataggtt gtattgatgt tggacgagtc 10860
ggaatcgcag accgatacca ggatcttgcc atcctatgga actgcctcgg tgagttttct 10920
ccttcattac agaaacggct ttttcaaaaa tatggtattg ataatcctga tatgaataaa 10980
ttgcagtttc atttgatgct cgatgagttt ttctaagggc ggcctgccac catacccacg 11040
ccgaaacaag cgctcatgag cccgaagtgg cgagcccgat cttccccatc ggtgatgtcg 11100
gcgatatagg cgccagcaac cgcacctgtg gcgccggtga tgagggcgcg ccaagtcgac 11160
gtccggcagt c 11171
<210> 5
<211> 11309
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 5
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctcctg gtggcgaggg gaggggggtg gtcctcgaac gccttgcaga 600
actggcctgg atacagagtg gaccggctgg ccccatctgg aagacttcga gatacactgt 660
tgtcttactg cgctcaacag tgtatctcga agtcttccaa atggtgccag ccatcgcagc 720
ggggtgcagg aaatgggggc agcccccctt tttggctatc cttccacgtg ttcttttttg 780
tatcttttgt gtttcctaga aaacatctca gtcaccaccg cagccctagg aatgcatcta 840
gacaattgta ctaaccttct tctctttcct ctcctgacag tccggaaagc caccatgtac 900
gccctgttcc tgctggccag cctgctgggc gccgccctgg ccggccccgt gctgggcctg 960
aaggagtgca cccgcggcag cgccgtgtgg tgccagaacg tgaagaccgc cagcgactgc 1020
ggcgccgtga agcactgcct gcagaccgtg tggaacaagc ccaccgtgaa gagcctgccc 1080
tgcgacatct gcaaggacgt ggtgaccgcc gccggcgaca tgctgaagga caacgccacc 1140
gaggaggaga tcctggtgta cctggagaag acctgcgact ggctgcccaa gcccaacatg 1200
agcgccagct gcaaggagat cgtggacagc tacctgcccg tgatcctgga catcatcaag 1260
ggcgagatga gccgccccgg cgaggtgtgc agcgccctga acctgtgcga gagcctgcag 1320
aagcacctgg ccgagctgaa ccaccagaag cagctggaga gcaacaagat ccccgagctg 1380
gacatgaccg aggtggtggc ccccttcatg gccaacatcc ccctgctgct gtacccccag 1440
gacggccccc gcagcaagcc ccagcccaag gacaacggcg acgtgtgcca ggactgcatc 1500
cagatggtga ccgacatcca gaccgccgtg cgcaccaaca gcaccttcgt gcaggccctg 1560
gtggagcacg tgaaggagga gtgcgaccgc ctgggccccg gcatggccga catctgcaag 1620
aactacatca gccagtacag cgagatcgcc atccagatga tgatgcacat gcagcccaag 1680
gagatctgcg ccctggtggg cttctgcgac gaggtgaagg agatgcccat gcagaccctg 1740
gtgcccgcca aggtggccag caagaacgtg atccccgccc tggagctggt ggagcccatc 1800
aagaagcacg aggtgcccgc caagagcgac gtgtactgcg aggtgtgcga gttcctggtg 1860
aaggaggtga ccaagctgat cgacaacaac aagaccgaga aggagatcct ggacgccttc 1920
gacaagatgt gcagcaagct gcccaagagc ctgagcgagg agtgccagga ggtggtggac 1980
acctacggca gcagcatcct gagcatcctg ctggaggagg tgagccccga gctggtgtgc 2040
agcatgctgc acctgtgcag cggcacccgc ctgcccgccc tgaccgtgca cgtgacccag 2100
cccaaggacg gcggcttctg cgaggtgtgc aagaagctgg tgggctacct ggaccgcaac 2160
ctggagaaga acagcaccaa gcaggagatc ctggccgccc tggagaaggg ctgcagcttc 2220
ctgcccgacc cctaccagaa gcagtgcgac cagttcgtgg ccgagtacga gcccgtgctg 2280
atcgagatcc tggtggaggt gatggacccc agcttcgtgt gcctgaagat cggcgcctgc 2340
cccagcgccc acaagcccct gctgggcacc gagaagtgca tctggggccc cagctactgg 2400
tgccagaaca ccgagaccgc cgcccagtgc aacgccgtgg agcactgcaa gcgccacgtg 2460
tggaacgagg gcagaggaag tcttctgaca tgcggagacg tggaagagaa tcccggccct 2520
atggaattca gcagccccag cagagaggaa tgccccaagc ctctgagccg ggtgtcaatc 2580
atggccggat ctctgacagg actgctgctg cttcaggccg tgtcttgggc ttctggcgct 2640
agaccttgca tccccaagag cttcggctac agcagcgtcg tgtgcgtgtg caatgccacc 2700
tactgcgaca gcttcgaccc tcctaccttt cctgctctgg gcaccttcag cagatacgag 2760
agcaccagat ccggcagacg gatggaactg agcatgggac ccatccaggc caatcacaca 2820
ggcactggcc tgctgctgac actgcagcct gagcagaaat tccagaaagt gaaaggcttc 2880
ggcggagcca tgacagatgc cgccgctctg aatatcctgg ctctgtctcc accagctcag 2940
aacctgctgc tcaagagcta cttcagcgag gaaggcatcg gctacaacat catcagagtg 3000
cccatggcca gctgcgactt cagcatcagg acctacacct acgccgacac acccgacgat 3060
ttccagctgc acaacttcag cctgcctgaa gaggacacca agctgaagat ccctctgatc 3120
cacagagccc tgcagctggc acaaagaccc gtgtcactgc tggcctctcc atggacatct 3180
cccacctggc tgaaaacaaa tggcgccgtg aatggcaagg gcagcctgaa aggccaacct 3240
ggcgacatct accaccagac ctgggccaga tacttcgtga agttcctgga cgcctatgcc 3300
gagcacaagc tgcagttttg ggccgtgaca gccgagaacg aaccttctgc tggactgctg 3360
agcggctacc cctttcagtg cctgggcttt acacccgagc accagcggga ctttatcgcc 3420
cgtgatctgg gacccacact ggccaatagc acccaccata atgtgcggct gctgatgctg 3480
gacgaccaga gactgcttct gccccactgg gctaaagtgg tgctgacaga tcctgaggcc 3540
gccaaatacg tgcacggaat cgccgtgcac tggtatctgg actttctggc ccctgccaag 3600
gccacactgg gagagacaca cagactgttc cccaacacca tgctgttcgc cagcgaagcc 3660
tgtgtgggca gcaagttttg ggaacagagc gtgcggctcg gcagctggga tagaggcatg 3720
cagtacagcc acagcatcat caccaacctg ctgtaccacg tcgtcggctg gaccgactgg 3780
aatctggccc tgaatcctga aggcggccct aactgggtcc gaaacttcgt ggacagcccc 3840
atcatcgtgg acatcaccaa ggacaccttc tacaagcagc ccatgttcta ccacctggga 3900
cacttcagca agttcatccc cgagggctct cagcgcgttg gactggtggc ttcccagaag 3960
aacgatctgg acgccgtggc tctgatgcac cctgatggat ctgctgtggt ggtggtcctg 4020
aaccgcagca gcaaagatgt gcccctgacc atcaaggatc ccgccgtggg attcctggaa 4080
acaatcagcc ctggctactc catccacacc tacctgtggc gtagacagtg acaattgtta 4140
attaagttta aaccctcgag gccgcaagcc gcatcgatac cgtcgactag agctcgctga 4200
tcagcctcga ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct 4260
tccttgaccc tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgca 4320
tcgcattgtc tgagtaggtg tcattctatt ctggggggtg gggtggggca ggacagcaag 4380
ggggaggatt gggaagacaa tagcaggcat gctggggaga gatccacgat aacaaacagc 4440
ttttttgggg tgaacatatt gactgaattc cctgcaggtt ggccactccc tctctgcgcg 4500
ctcgctcgct cactgaggcc gcccgggcaa agcccgggcg tcgggcgacc tttggtcgcc 4560
cggcctcagt gagcgagcga gcgcgcagag agggagtggc caactccatc actaggggtt 4620
cctgcggccg ctcgtacggt ctcgaggaat tcctgcagga taacttgcca acctcattct 4680
aaaatgtata tagaagccca aaagacaata acaaaaatat tcttgtagaa caaaatggga 4740
aagaatgttc cactaaatat caagatttag agcaaagcat gagatgtgtg gggatagaca 4800
gtgaggctga taaaatagag tagagctcag aaacagaccc attgatatat gtaagtgacc 4860
tatgaaaaaa atatggcatt ttacaatggg aaaatgatgg tctttttctt ttttagaaaa 4920
acagggaaat atatttatat gtaaaaaata aaagggaacc catatgtcat accatacaca 4980
caaaaaaatt ccagtgaatt ataagtctaa atggagaagg caaaacttta aatcttttag 5040
aaaataatat agaagcatgc agaccagcct ggccaacatg atgaaaccct ctctactaat 5100
aataaaatca gtagaactac tcaggactac tttgagtggg aagtcctttt ctatgaagac 5160
ttctttggcc aaaattaggc tctaaatgca aggagatagt gcatcatgcc tggctgcact 5220
tactgataaa tgatgttatc accatcttta accaaatgca caggaacaag ttatggtact 5280
gatgtgctgg attgagaagg agctctactt ccttgacagg acacatttgt atcaacttaa 5340
aaaagcagat ttttgccagc agaactattc attcagaggt aggaaactta gaatagatga 5400
tgtcactgat tagcatggct tccccatctc cacagctgct tccccacccag gttgcccaca 5460
gttgagtttg tccagtgctc agggctgccc actctcagta agaagcccca caccagcccc 5520
tctccaaata tgttggctgt tccttccatt aaagtgaccc cactttagag cagcaagtgg 5580
atttctgttt cttacagttc aggaaggagg agtcagctgt gagaacctgg agcctgagat 5640
gcttctaagt cccactgcta ctggggtcag ggaagccaga ctccagcatc agcagtcagg 5700
agcactaagc ccttgccaac atcctgtttc tcagagaaac tgcttccatt ataatggttg 5760
tcctttttta agctatcaag ccaaacaacc agtgtctacc attattctca tcacctgaag 5820
ccaagggttc tagcaaaagt caagctgtct tgtaatggtt gatgtgcctc cagcttctgt 5880
cttcagtcac tccactctta gcctgctctg aatcaactct gaccacagtt ccctggagcc 5940
cctgccacct gctgcccctg ccaccttctc catctgcagt gctgtgcagc cttctgcact 6000
cttgcagagc taataggtgg agacttgaag gaagaggagg aaagtttctc ataatagcct 6060
tgctgcaagc tcaaatggga ggtgggcact gtgcccagga gccttggagc aaaggctgtg 6120
cccaacctct gactgcatcc aggtttggtc ttgacagaga taagaagccc tggcttttgg 6180
agccaaaatc taggtcagac ttaggcagga ttctcaaagt ttatcagcag aacatgaggc 6240
agaagaccct ttctgctcca gcttcttcag gctcaacctt catcagaata gatagaaaga 6300
gaggctgtga gggttcttaa aacagaagca aatctgactc agagaataaa caacctccta 6360
gtaaactaca gcttagacag agcatctggt ggtgagtgtg ctcagtgtcc tactcaactg 6420
tctggtatca gccctcatga ggacttctct tctttccctc atagacctcc atctctgttt 6480
tccttagcct gcagaaatct ggatggctat tcacagaatg cctgtgcttt cagagttgca 6540
ttttttctct ggtattctgg ttcaagcatt tgaaggtagg aaaggttctc caagtgcaag 6600
aaagccagcc ctgagcctca actgcctggc tagtgtggtc agtaggatgc aaaggctgtt 6660
gaatgccaca aggccaaact ttaacctgtg taccacaagc ctagcagcag aggcagctct 6720
gctcactgga actctctgtc ttctttctcc tgagcctttt cttttcctga gttttctagc 6780
tctcctcaac cttacctctg ccctacccag gacaaaccca agagccactg tttctgtgat 6840
gtcctctcca gccctaatta ggcatcatga cttcagcctg accttccatg ctcagaagca 6900
gtgctaatcc acttcagatg agctgctcta tgcaacacag gcagagccta caaacctttg 6960
caccagagcc ctccacatat cagtgtttgt tcatactcac ttcaacagca aatgtgactg 7020
ctgagattaa gattttacac aagatggtct gtaatttcac agttagtttt atcccattag 7080
gtatgaaaga attagcataa ttccccttaa acatgaatga atcttagatt ttttaataaa 7140
tagttttgga agtaaagaca gagacatcag gagcacaagg aatagcctga gaggacaaac 7200
agaacaagaa agagtctgga aatacacagg atgttcttgg cctcctcaaa gcaagtgcaa 7260
gcagatagta ccagcagccc caggctatca gagcccagtg aagagaagta ccatgaaagc 7320
cacagctcta accaccctgt tccagagtga cagacagtcc ccaagacaag ccagcctgag 7380
ccagagagag aactgcaaga gaaagtttct aatttaggtt ctgttagatt cagacaagtg 7440
caggtcatcc tctctccaca gctactcacc tctccagcct aacaaagcct gcagtccaca 7500
ctccaaccct ggtgtctcac ctcctagcct ctcccaacat cctgctctct gaccatcttc 7560
tgcatctctc atctcaccat ctcccactgt ctacagccta ctcttgcaac taccatctca 7620
ttttctgaca tcctgtctac atcttctgcc atactctgcc atctaccata ccacctctta 7680
ccatctacca caccatcttt tatctccatc cctctcagaa gcctccaagc tgaatcctgc 7740
tttatgtgtt catctcagcc cctgcatgga aagctgaccc cagaggcaga actattccca 7800
gagagcttgg ccaagaaaaa caaaactacc agcctggcca ggctcaggag tagtaagctg 7860
cagtgtctgt tgtgttctag cttcaacagc tgcaggagtt ccactctcaa atgctccaca 7920
tttctcacat cctcctgatt ctggtcacta cccatcttca aagaacagaa tatctcacat 7980
cagcatactg tgaaggacta gtcatgggtg cagctgctca gagctgcaaa gtcattctgg 8040
atggtggaga gcttacaaac atttcatgat gctccccccg ctctgatggc tggagcccaa 8100
tccctacaca gactcctgct gtatgtgttt tcctttcact ctgagccaca gccagagggc 8160
aggcattcag tctcctcttc aggctggggc tggggcactg agaactcacc caacaccttg 8220
ctctcactcc ttctgcaaaa caagaaagag ctttgtgctg cagtagccat gaagaatgaa 8280
aggaaggctt taactaaaaa atgtcagaga ttattttcaa ccccttactg tggatcacca 8340
gcaaggagga aacacaacac agagacattt tttcccctca aattatcaaa agaatcactg 8400
catttgttaa agagagcaac tgaatcagga agcagagttt tgaacatatc agaagttagg 8460
aatctgcatc agagacaaat gcagtcatgg ttgtttgctg cataccagcc ctaatcatta 8520
gaagcctcat ggacttcaaa catcattccc tctgacaaga tgctctagcc taactccatg 8580
agataaaata aatctgcctt tcagagccaa agaagagtcc accagcttct tctcagtgtg 8640
aacaagagct ccagtcaggt tagtcagtcc agtgcagtag aggagaccag tctgcatcct 8700
ctaattttca aaggcaagaa gatttgttta ccctggacac caggcacaag tgaggtcaca 8760
gagctcttag atatgcagtc ctcatgagtg aggagactaa agcgcatgcc atcaagactt 8820
cagtgtagag aaaacctcca aaaaagcctc ctcactactt ctggaatagc tcagaggccg 8880
aggcggcctc ggcctctgca taaataaaaa aaattagtca gccatggggc ggagaatggg 8940
cggaactggg cggagttagg ggcgggatgg gcggagttag gggcgggact atggttgctg 9000
actaattgag atgcatgctt tgcatacttc tgcctgctgg ggagcctggg gactttccac 9060
acctggttgc tgactaattg agatgcatgc tttgcatact tctgcctgct ggggagcctg 9120
gggactttcc acaccctaac tgacacacat tccacagctg cattaatgaa tcggccaacg 9180
cgcggggaga ggcggtttgc gtattgggcg ctcttccgct tcctcgctca ctgactcgct 9240
gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt 9300
atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc 9360
caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga 9420
gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata 9480
ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc tgccgcttac 9540
cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata gctcacgctg 9600
taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc 9660
cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag 9720
acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag cgaggtatgt 9780
aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta gaagaacagt 9840
atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg 9900
atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc agcagattac 9960
gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt ctgacgctca 10020
gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac 10080
ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac 10140
ttggtctgac agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtctatt 10200
tcgttcatcc atagttgcct gactcctgca aaccacgttg tgtctcaaaa tctctgatgt 10260
tacattgcac aagataaaaa tatatcatca tgaacaataa aactgtctgc ttacataaac 10320
agtaatacaa ggggtgttat gagccatatt caacgggaaa cgtcttgctc gaggccgcga 10380
ttaaattcca acatggatgc tgatttatat gggtataaat gggctcgcga taatgtcggg 10440
caatcaggtg cgacaatcta tcgattgtat gggaagcccg atgcgccaga gttgtttctg 10500
aaacatggca aaggtagcgt tgccaatgat gttacagatg agatggtcag actaaactgg 10560
ctgacggaat ttatgcctct tccgaccatc aagcatttta tccgtactcc tgatgatgca 10620
tggttactca ccactgcgat ccccgggaaa acagcattcc aggtattaga agaatatcct 10680
gattcaggtg aaaatattgt tgatgcgctg gcagtgttcc tgcgccggtt gcattcgatt 10740
cctgtttgta attgtccttt taacagcgat cgcgtatttc gtctcgctca ggcgcaatca 10800
cgaatgaata acggtttggt tgatgcgagt gattttgatg acgagcgtaa tggctggcct 10860
gttgaacaag tctggaaaga aatgcataag cttttgccat tctcaccgga ttcagtcgtc 10920
actcatggtg atttctcact tgataacctt atttttgacg aggggaaatt aataggttgt 10980
attgatgttg gacgagtcgg aatcgcagac cgataccagg atcttgccat cctatggaac 11040
tgcctcggtg agttttctcc ttcattacag aaacggcttt ttcaaaaata tggtattgat 11100
aatcctgata tgaataaatt gcagtttcat ttgatgctcg atgagttttt ctaagggcgg 11160
cctgccacca tacccacgcc gaaacaagcg ctcatgagcc cgaagtggcg agcccgatct 11220
tccccatcgg tgatgtcggc gatataggcg ccagcaaccg cacctgtggc gccggtgatg 11280
agggcgcgcc aagtcgacgt ccggcagtc 11309
<210> 6
<211> 11293
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 6
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctgcag ccctaggaat gcatctagac aattgtacta accttcttct 600
ctttcctctc ctgacagtcc ggaaagccac catgtacgcc ctgttcctgc tggccagcct 660
gctgggcgcc gccctggccg gccccgtgct gggcctgaag gagtgcaccc gcggcagcgc 720
cgtgtggtgc cagaacgtga agaccgccag cgactgcggc gccgtgaagc actgcctgca 780
gaccgtgtgg aacaagccca ccgtgaagag cctgccctgc gacatctgca aggacgtggt 840
gaccgccgcc ggcgacatgc tgaaggacaa cgccaccgag gaggagatcc tggtgtacct 900
ggagaagacc tgcgactggc tgcccaagcc caacatgagc gccagctgca aggagatcgt 960
ggacagctac ctgcccgtga tcctggacat catcaagggc gagatgagcc gccccggcga 1020
ggtgtgcagc gccctgaacc tgtgcgagag cctgcagaag cacctggccg agctgaacca 1080
ccagaagcag ctggagagca acaagatccc cgagctggac atgaccgagg tggtggcccc 1140
cttcatggcc aacatccccc tgctgctgta cccccaggac ggcccccgca gcaagcccca 1200
gcccaaggac aacggcgacg tgtgccagga ctgcatccag atggtgaccg acatccagac 1260
cgccgtgcgc accaacagca ccttcgtgca ggccctggtg gagcacgtga aggaggagtg 1320
cgaccgcctg ggccccggca tggccgacat ctgcaagaac tacatcagcc agtacagcga 1380
gatcgccatc cagatgatga tgcacatgca gcccaaggag atctgcgccc tggtgggctt 1440
ctgcgacgag gtgaaggaga tgcccatgca gaccctggtg cccgccaagg tggccagcaa 1500
gaacgtgatc cccgccctgg agctggtgga gcccatcaag aagcacgagg tgcccgccaa 1560
gagcgacgtg tactgcgagg tgtgcgagtt cctggtgaag gaggtgacca agctgatcga 1620
caacaacaag accgagaagg agatcctgga cgccttcgac aagatgtgca gcaagctgcc 1680
caagagcctg agcgaggagt gccaggaggt ggtggacacc tacggcagca gcatcctgag 1740
catcctgctg gaggaggtga gccccgagct ggtgtgcagc atgctgcacc tgtgcagcgg 1800
cacccgcctg cccgccctga ccgtgcacgt gacccagccc aaggacggcg gcttctgcga 1860
ggtgtgcaag aagctggtgg gctacctgga ccgcaacctg gagaagaaca gcaccaagca 1920
ggagatcctg gccgccctgg agaagggctg cagcttcctg cccgacccct accagaagca 1980
gtgcgaccag ttcgtggccg agtacgagcc cgtgctgatc gagatcctgg tggaggtgat 2040
ggaccccagc ttcgtgtgcc tgaagatcgg cgcctgcccc agcgcccaca agcccctgct 2100
gggcaccgag aagtgcatct ggggccccag ctactggtgc cagaacaccg agaccgccgc 2160
ccagtgcaac gccgtggagc actgcaagcg ccacgtgtgg aactgattgt ggccgaaccg 2220
ccgaactcag aggccggccc cagaaaaccc gagcgagtag ggggcggcgc gcaggaggga 2280
ggagaactgg gggcgcggga ggctggtggg tgtggggggt ggagatgtag aagatgtgac 2340
gccgcggccc ggcgggtgcc agattagcgg acgcggtgcc cgcggttgca acgggatccc 2400
gggcgctgca gcttgggagg cggctctccc caggcggcgt ccgcggagac acccatccgt 2460
gaaccccagg tcccgggccg ccggctcgcc gcgcaccagg ggccggcgga cagaagagcg 2520
gccgagcggc tcgaggctgg gggaccgcgg gcgcggccgc gcgctgccgg gcgggaggct 2580
ggggggccgg ggccggggcc gtgccccgga gcgggtcgga ggccggggcc ggggccgggg 2640
gacggcggct ccccgcgcgg ctccagcggc tcggggatcc cggccgggcc ccgcagggac 2700
catgatggaa ttcagcagcc ccagcagaga ggaatgcccc aagcctctga gccgggtgtc 2760
aatcatggcc ggatctctga caggactgct gctgcttcag gccgtgtctt gggcttctgg 2820
cgctagacct tgcatcccca agagcttcgg ctacagcagc gtcgtgtgcg tgtgcaatgc 2880
cacctactgc gacagcttcg accctcctac ctttcctgct ctgggcacct tcagcagata 2940
cgagagcacc agatccggca gacggatgga actgagcatg ggacccatcc aggccaatca 3000
cacaggcact ggcctgctgc tgacactgca gcctgagcag aaattccaga aagtgaaagg 3060
cttcggcgga gccatgacag atgccgccgc tctgaatatc ctggctctgt ctccaccagc 3120
tcagaacctg ctgctcaaga gctacttcag cgaggaaggc atcggctaca acatcatcag 3180
agtgcccatg gccagctgcg acttcagcat caggacctac acctacgccg acacacccga 3240
cgatttccag ctgcacaact tcagcctgcc tgaagaggac accaagctga agatccctct 3300
gatccacaga gccctgcagc tggcacaaag acccgtgtca ctgctggcct ctccatggac 3360
atctcccacc tggctgaaaa caaatggcgc cgtgaatggc aagggcagcc tgaaaggcca 3420
acctggcgac atctaccacc agacctgggc cagatacttc gtgaagttcc tggacgccta 3480
tgccgagcac aagctgcagt tttgggccgt gacagccgag aacgaacctt ctgctggact 3540
gctgagcggc tacccctttc agtgcctggg ctttacaccc gagcaccagc gggactttat 3600
cgcccgtgat ctgggaccca cactggccaa tagcacccac cataatgtgc ggctgctgat 3660
gctggacgac cagagactgc ttctgcccca ctgggctaaa gtggtgctga cagatcctga 3720
ggccgccaaa tacgtgcacg gaatcgccgt gcactggtat ctggactttc tggcccctgc 3780
caaggccaca ctgggagaga cacacagact gttccccaac accatgctgt tcgccagcga 3840
agcctgtgtg ggcagcaagt tttgggaaca gagcgtgcgg ctcggcagct gggatagagg 3900
catgcagtac agccacagca tcatcaccaa cctgctgtac cacgtcgtcg gctggaccga 3960
ctggaatctg gccctgaatc ctgaaggcgg ccctaactgg gtccgaaact tcgtggacag 4020
ccccatcatc gtggacatca ccaaggacac cttctacaag cagcccatgt tctaccacct 4080
gggacacttc agcaagttca tccccgaggg ctctcagcgc gttggactgg tggcttccca 4140
gaagaacgat ctggacgccg tggctctgat gcaccctgat ggatctgctg tggtggtggt 4200
cctgaaccgc agcagcaaag atgtgcccct gaccatcaag gatcccgccg tgggattcct 4260
ggaaacaatc agccctggct actccatcca cacctacctg tggcgtagac agtgacaatt 4320
gttaattaag tttaaaccct cgaggccgca agcaataaaa tatctttatt ttcattacat 4380
ctgtgtgttg gttttttgtg tggagatcca cgataacaaa cagctttttt ggggtgaaca 4440
tattgactga attccctgca ggttggccac tccctctctg cgcgctcgct cgctcactga 4500
ggccgcccgg gcaaagcccg ggcgtcgggc gacctttggt cgcccggcct cagtgagcga 4560
gcgagcgcgc agagagggag tggccaactc catcactagg ggttcctgcg gccgctcgta 4620
cggtctcgag gaattcctgc aggataactt gccaacctca ttctaaaatg tatatagaag 4680
cccaaaagac aataacaaaa atattcttgt agaacaaaat gggaaagaat gttccactaa 4740
atatcaagat ttagagcaaa gcatgagatg tgtggggata gacagtgagg ctgataaaat 4800
agagtagagc tcagaaacag acccattgat atatgtaagt gacctatgaa aaaaatatgg 4860
cattttacaa tgggaaaatg atggtctttt tcttttttag aaaaacaggg aaatatattt 4920
atatgtaaaa aataaaaggg aacccatatg tcataccata cacacaaaaa aattccagtg 4980
aattataagt ctaaatggag aaggcaaaac tttaaatctt ttagaaaata atatagaagc 5040
atgcagacca gcctggccaa catgatgaaa ccctctctac taataataaa atcagtagaa 5100
ctactcagga ctactttgag tgggaagtcc ttttctatga agacttcttt ggccaaaatt 5160
aggctctaaa tgcaaggaga tagtgcatca tgcctggctg cacttactga taaatgatgt 5220
tatcaccatc tttaaccaaa tgcacaggaa caagttatgg tactgatgtg ctggattgag 5280
aaggagctct acttccttga caggacacat ttgtatcaac ttaaaaaagc agatttttgc 5340
cagcagaact attcattcag aggtaggaaa cttagaatag atgatgtcac tgattagcat 5400
ggcttcccca tctccacagc tgcttcccac ccaggttgcc cacagttgag tttgtccagt 5460
gctcagggct gcccactctc agtaagaagc cccacaccag cccctctcca aatatgttgg 5520
ctgttccttc cattaaagtg accccacttt agagcagcaa gtggatttct gtttcttaca 5580
gttcaggaag gaggagtcag ctgtgagaac ctggagcctg agatgcttct aagtcccact 5640
gctactgggg tcagggaagc cagactccag catcagcagt caggagcact aagcccttgc 5700
caacatcctg tttctcagag aaactgcttc cattataatg gttgtccttt tttaagctat 5760
caagccaaac aaccagtgtc taccattatt ctcatcacct gaagccaagg gttctagcaa 5820
aagtcaagct gtcttgtaat ggttgatgtg cctccagctt ctgtcttcag tcactccact 5880
cttagcctgc tctgaatcaa ctctgaccac agttccctgg agcccctgcc acctgctgcc 5940
cctgccacct tctccatctg cagtgctgtg cagccttctg cactcttgca gagctaatag 6000
gtggagactt gaaggaagag gaggaaagtt tctcataata gccttgctgc aagctcaaat 6060
gggaggtggg cactgtgccc aggagccttg gagcaaaggc tgtgcccaac ctctgactgc 6120
atccaggttt ggtcttgaca gagataagaa gccctggctt ttggagccaa aatctaggtc 6180
agacttaggc aggattctca aagtttatca gcagaacatg aggcagaaga ccctttctgc 6240
tccagcttct tcaggctcaa ccttcatcag aatagataga aagagaggct gtgagggttc 6300
ttaaaacaga agcaaatctg actcagagaa taaacaacct cctagtaaac tacagcttag 6360
acagagcatc tggtggtgag tgtgctcagt gtcctactca actgtctggt atcagccctc 6420
atgaggactt ctcttctttc cctcatagac ctccatctct gttttcctta gcctgcagaa 6480
atctggatgg ctattcacag aatgcctgtg ctttcagagt tgcatttttt ctctggtatt 6540
ctggttcaag catttgaagg taggaaaggt tctccaagtg caagaaagcc agccctgagc 6600
ctcaactgcc tggctagtgt ggtcagtagg atgcaaaggc tgttgaatgc cacaaggcca 6660
aactttaacc tgtgtaccac aagcctagca gcagaggcag ctctgctcac tggaactctc 6720
tgtcttcttt ctcctgagcc ttttcttttc ctgagtttt tagctctcct caaccttacc 6780
tctgccctac ccaggacaaa cccaagagcc actgtttctg tgatgtcctc tccagcccta 6840
attaggcatc atgacttcag cctgaccttc catgctcaga agcagtgcta atccacttca 6900
gatgagctgc tctatgcaac acaggcagag cctacaaacc tttgcaccag agccctccac 6960
atatcagtgt ttgttcatac tcacttcaac agcaaatgtg actgctgaga ttaagatttt 7020
acacaagatg gtctgtaatt tcacagttag ttttatccca ttaggtatga aagaattagc 7080
ataattcccc ttaaacatga atgaatctta gattttttaa taaatagttt tggaagtaaa 7140
gacagagaca tcaggagcac aaggaatagc ctgagaggac aaacagaaca agaaagagtc 7200
tggaaataca caggatgttc ttggcctcct caaagcaagt gcaagcagat agtaccagca 7260
gccccaggct atcagagccc agtgaagaga agtaccatga aagccacagc tctaaccacc 7320
ctgttccaga gtgacagaca gtccccaaga caagccagcc tgagccagag agagaactgc 7380
aagagaaagt ttctaattta ggttctgtta gattcagaca agtgcaggtc atcctctctc 7440
cacagctact cacctctcca gcctaacaaa gcctgcagtc cacactccaa ccctggtgtc 7500
tcacctccta gcctctccca acatcctgct ctctgaccat cttctgcatc tctcatctca 7560
ccatctccca ctgtctacag cctactcttg caactaccat ctcattttct gacatcctgt 7620
ctacatcttc tgccatactc tgccatctac cataccacct cttaccatct accacaccat 7680
cttttatctc catccctctc agaagcctcc aagctgaatc ctgctttatg tgttcatctc 7740
agcccctgca tggaaagctg accccagagg cagaactatt cccagagagc ttggccaaga 7800
aaaacaaaac taccagcctg gccaggctca ggagtagtaa gctgcagtgt ctgttgtgtt 7860
ctagcttcaa cagctgcagg agttccactc tcaaatgctc cacatttctc acatcctcct 7920
gattctggtc actacccatc ttcaaagaac agaatatctc acatcagcat actgtgaagg 7980
actagtcatg ggtgcagctg ctcagagctg caaagtcatt ctggatggtg gagagcttac 8040
aaacatttca tgatgctccc cccgctctga tggctggagc ccaatcccta cacagactcc 8100
tgctgtatgt gttttccttt cactctgagc cacagccaga gggcaggcat tcagtctcct 8160
cttcaggctg gggctggggc actgagaact cacccaacac cttgctctca ctccttctgc 8220
aaaacaagaa agagctttgt gctgcagtag ccatgaagaa tgaaaggaag gctttaacta 8280
aaaaatgtca gagattattt tcaacccctt actgtggatc accagcaagg aggaaacaca 8340
acacagagac attttttccc ctcaaattat caaaagaatc actgcatttg ttaaagagag 8400
caactgaatc aggaagcaga gttttgaaca tatcagaagt taggaatctg catcagagac 8460
aaatgcagtc atggttgttt gctgcatacc agccctaatc attagaagcc tcatggactt 8520
caaacatcat tccctctgac aagatgctct agcctaactc catgagataa aataaatctg 8580
cctttcagag ccaaagaaga gtccaccagc ttcttctcag tgtgaacaag agctccagtc 8640
aggttagtca gtccagtgca gtagaggaga ccagtctgca tcctctaatt ttcaaaggca 8700
agaagatttg tttaccctgg acaccaggca caagtgaggt cacagagctc ttagatatgc 8760
agtcctcatg agtgaggaga ctaaagcgca tgccatcaag acttcagtgt agagaaaacc 8820
tccaaaaaag cctcctcact acttctggaa tagctcagag gccgaggcgg cctcggcctc 8880
tgcataaata aaaaaaatta gtcagccatg gggcggagaa tgggcggaac tgggcggagt 8940
taggggcggg atgggcggag ttaggggcgg gactatggtt gctgactaat tgagatgcat 9000
gctttgcata cttctgcctg ctggggagcc tggggacttt ccacacctgg ttgctgacta 9060
attgagatgc atgctttgca tacttctgcc tgctggggag cctggggact ttccacaccc 9120
taactgacac acattccaca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt 9180
ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg 9240
ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg 9300
gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag 9360
gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga 9420
cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct 9480
ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc 9540
tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg 9600
gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc 9660
tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca 9720
ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag 9780
ttcttgaagt ggtggcctaa ctacggctac actagaagaa cagtatttgg tatctgcgct 9840
ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc 9900
accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga 9960
tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca 10020
cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat 10080
taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac 10140
caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt 10200
gcctgactcc tgcaaaccac gttgtgtctc aaaatctctg atgttacatt gcacaagata 10260
aaaatatatc atcatgaaca ataaaactgt ctgcttacat aaacagtaat acaaggggtg 10320
ttatgagcca tattcaacgg gaaacgtctt gctcgaggcc gcgattaaat tccaacatgg 10380
atgctgattt atatgggtat aaatgggctc gcgataatgt cgggcaatca ggtgcgacaa 10440
tctatcgatt gtatgggaag cccgatgcgc cagagttgtt tctgaaacat ggcaaaggta 10500
gcgttgccaa tgatgttaca gatgagatgg tcagactaaa ctggctgacg gaatttatgc 10560
ctcttccgac catcaagcat tttatccgta ctcctgatga tgcatggtta ctcaccactg 10620
cgatccccgg gaaaacagca ttccaggtat tagaagaata tcctgattca ggtgaaaata 10680
ttgttgatgc gctggcagtg ttcctgcgcc ggttgcattc gattcctgtt tgtaattgtc 10740
cttttaacag cgatcgcgta tttcgtctcg ctcaggcgca atcacgaatg aataacggtt 10800
tggttgatgc gagtgatttt gatgacgagc gtaatggctg gcctgttgaa caagtctgga 10860
aagaaatgca taagcttttg ccattctcac cggattcagt cgtcactcat ggtgatttct 10920
cacttgataa ccttattttt gacgagggga aattaatagg ttgtattgat gttggacgag 10980
tcggaatcgc agaccgatac caggatcttg ccatcctatg gaactgcctc ggtgagtttt 11040
ctccttcatt acagaaacgg ctttttcaaa aatatggtat tgataatcct gatatgaata 11100
aattgcagtt tcatttgatg ctcgatgagt ttttctaagg gcggcctgcc accataccca 11160
cgccgaaaca agcgctcatg agcccgaagt ggcgagcccg atcttcccca tcggtgatgt 11220
cggcgatata ggcgccagca accgcacctg tggcgccggt gatgagggcg cgccaagtcg 11280
acgtccggca gtc 11293
<210> 7
<211> 10700
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 7
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgcccgggc aaagcccggg 60
cgtcgggcga cctttggtcg cccggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 360
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 420
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 480
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 540
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 600
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 660
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 720
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 780
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 840
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 900
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagtc gctgcgacgc 960
tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg 1020
accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag 1080
cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc 1140
cgggagctag agcctctgct aaccatgttc atgccttctt ctttttccta cagctcctgg 1200
gcaacgtgct ggttattgtg ctgtctcatc attttggcaa agaattcctc gaagatccga 1260
agggaaagtc ttccacgact gtgggatccg ttcgaagata tcaccggttg agccaccatg 1320
gaattcagca gccccagcag agaggaatgc cccaagcctc tgagccgggt gtcaatcatg 1380
gccggatctc tgacaggact gctgctgctt caggccgtgt cttgggcttc tggcgctaga 1440
ccttgcatcc ccaagagctt cggctacagc agcgtcgtgt gcgtgtgcaa tgccacctac 1500
tgcgacagct tcgaccctcc tacctttcct gctctgggca ccttcagcag atacgagagc 1560
accagatccg gcagacggat ggaactgagc atgggaccca tccaggccaa tcacacaggc 1620
actggcctgc tgctgacact gcagcctgag cagaaattcc agaaagtgaa aggcttcggc 1680
ggagccatga cagatgccgc cgctctgaat atcctggctc tgtctccacc agctcagaac 1740
ctgctgctca agagctactt cagcgaggaa ggcatcggct acaacatcat cagagtgccc 1800
atggccagct gcgacttcag catcaggacc tacacctacg ccgacacacc cgacgatttc 1860
cagctgcaca acttcagcct gcctgaagag gacaccaagc tgaagatccc tctgatccac 1920
agagccctgc agctggcaca aagacccgtg tcactgctgg cctctccatg gacatctccc 1980
acctggctga aaacaaatgg cgccgtgaat ggcaagggca gcctgaaagg ccaacctggc 2040
gacatctacc accagacctg ggccagatac ttcgtgaagt tcctggacgc ctatgccgag 2100
cacaagctgc agttttgggc cgtgacagcc gagaacgaac cttctgctgg actgctgagc 2160
ggctacccct ttcagtgcct gggctttaca cccgagcacc agcgggactt tatcgcccgt 2220
gatctgggac ccacactggc caatagcacc caccataatg tgcggctgct gatgctggac 2280
gaccagagac tgcttctgcc ccactgggct aaagtggtgc tgacagatcc tgaggccgcc 2340
aaatacgtgc acggaatcgc cgtgcactgg tatctggact ttctggcccc tgccaaggcc 2400
acactgggag agacacacag actgttcccc aacaccatgc tgttcgccag cgaagcctgt 2460
gtgggcagca agttttggga acagagcgtg cggctcggca gctgggatag aggcatgcag 2520
tacagccaca gcatcatcac caacctgctg taccacgtcg tcggctggac cgactggaat 2580
ctggccctga atcctgaagg cggccctaac tgggtccgaa acttcgtgga cagcccccatc 2640
atcgtggaca tcaccaagga caccttctac aagcagccca tgttctacca cctgggacac 2700
ttcagcaagt tcatccccga gggctctcag cgcgttggac tggtggcttc ccagaagaac 2760
gatctggacg ccgtggctct gatgcaccct gatggatctg ctgtggtggt ggtcctgaac 2820
cgcagcagca aagatgtgcc cctgaccatc aaggatcccg ccgtgggatt cctggaaaca 2880
atcagccctg gctactccat ccacacctac ctgtggcgta gacagtgaca attgttaatt 2940
aagtttaaac cctcgaggcc gcaagcttat cgataatcaa cctctggatt acaaaatttg 3000
tgaaagattg actggtattc ttaactatgt tgctcctttt acgctatgtg gatacgctgc 3060
tttaatgcct ttgtatcatg ctattgcttc ccgtatggct ttcattttct cctccttgta 3120
taaatcctgg ttgctgtctc tttatgagga gttgtggccc gttgtcaggc aacgtggcgt 3180
ggtgtgcact gtgtttgctg acgcaacccc cactggttgg ggcattgcca ccacctgtca 3240
gctcctttcc gggactttcg ctttccccct ccctattgcc acggcggaac tcatcgccgc 3300
ctgccttgcc cgctgctgga caggggctcg gctgttgggc actgacaatt ccgtggtgtt 3360
gtcggggaaa tcatcgtcct ttccttggct gctcgcctgt gttgccacct ggattctgcg 3420
cgggacgtcc ttctgctacg tcccttcggc cctcaatcca gcggaccttc cttcccgcgg 3480
cctgctgccg gctctgcggc ctcttccgcg tcttcgcctt cgccctcaga cgagtcggat 3540
ctccctttgg gccgcctccc cgcatcgata ccgtcgacta gagctcgctg atcagcctcg 3600
actgtgcctt ctagttgcca gccatctgtt gtttgcccct cccccgtgcc ttccttgacc 3660
ctggaaggtg ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt 3720
ctgagtaggt gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat 3780
tgggaagaca atagcaggca tgctggggag agatccacga taacaaacag cttttttggg 3840
gtgaacatat tgactgaatt ccctgcaggt tggccactcc ctctctgcgc gctcgctcgc 3900
tcactgaggc cgcccgggca aagcccgggc gtcgggcgac ctttggtcgc ccggcctcag 3960
tgagcgagcg agcgcgcaga gagggagtgg ccaactccat cactaggggt tcctgcggcc 4020
gctcgtacgg tctcgaggaa ttcctgcagg ataacttgcc aacctcattc taaaatgtat 4080
atagaagccc aaaagacaat aacaaaaata ttcttgtaga acaaaatggg aaagaatgtt 4140
ccactaaata tcaagattta gagcaaagca tgagatgtgt ggggatagac agtgaggctg 4200
ataaaataga gtagagctca gaaacagacc cattgatata tgtaagtgac ctatgaaaaa 4260
aatatggcat tttacaatgg gaaaatgatg gtctttttct tttttagaaa aacagggaaa 4320
tatatttata tgtaaaaaat aaaagggaac ccatatgtca taccatacac acaaaaaaat 4380
tccagtgaat tataagtcta aatggagaag gcaaaacttt aaatctttta gaaaataata 4440
tagaagcatg cagaccagcc tggccaacat gatgaaaccc tctctactaa taataaaatc 4500
agtagaacta ctcaggacta ctttgagtgg gaagtccttt tctatgaaga cttctttggc 4560
caaaattagg ctctaaatgc aaggagatag tgcatcatgc ctggctgcac ttactgataa 4620
atgatgttat caccatcttt aaccaaatgc acaggaacaa gttatggtac tgatgtgctg 4680
gattgagaag gagctctact tccttgacag gacacatttg tatcaactta aaaaagcaga 4740
tttttgccag cagaactatt cattcagagg taggaaactt agaatagatg atgtcactga 4800
ttagcatggc ttccccatct ccacagctgc ttcccaccca ggttgcccac agttgagttt 4860
gtccagtgct cagggctgcc cactctcagt aagaagcccc acaccagccc ctctccaaat 4920
atgttggctg ttccttccat taaagtgacc ccactttaga gcagcaagtg gatttctgtt 4980
tcttacagtt caggaaggag gagtcagctg tgagaacctg gagcctgaga tgcttctaag 5040
tcccactgct actggggtca gggaagccag actccagcat cagcagtcag gagcactaag 5100
cccttgccaa catcctgttt ctcagagaaa ctgcttccat tataatggtt gtcctttttt 5160
aagctatcaa gccaaacaac cagtgtctac cattattctc atcacctgaa gccaagggtt 5220
ctagcaaaag tcaagctgtc ttgtaatggt tgatgtgcct ccagcttctg tcttcagtca 5280
ctccactctt agcctgctct gaatcaactc tgaccacagt tccctggagc ccctgccacc 5340
tgctgcccct gccaccttct ccatctgcag tgctgtgcag ccttctgcac tcttgcagag 5400
ctaataggtg gagacttgaa ggaagaggag gaaagtttct cataatagcc ttgctgcaag 5460
ctcaaatggg aggtgggcac tgtgcccagg agccttggag caaaggctgt gcccaacctc 5520
tgactgcatc caggtttggt cttgacagag ataagaagcc ctggcttttg gagccaaaat 5580
ctaggtcaga cttaggcagg attctcaaag tttatcagca gaacatgagg cagaagaccc 5640
tttctgctcc agcttcttca ggctcaacct tcatcagaat agatagaaag agaggctgtg 5700
agggttctta aaacagaagc aaatctgact cagagaataa acaacctcct agtaaactac 5760
agcttagaca gagcatctgg tggtgagtgt gctcagtgtc ctactcaact gtctggtatc 5820
agccctcatg aggacttctc ttctttccct catagacctc catctctgtt ttccttagcc 5880
tgcagaaatc tggatggcta ttcacagaat gcctgtgctt tcagagttgc attttttctc 5940
tggtattctg gttcaagcat ttgaaggtag gaaaggttct ccaagtgcaa gaaagccagc 6000
cctgagcctc aactgcctgg ctagtgtggt cagtaggatg caaaggctgt tgaatgccac 6060
aaggccaaac tttaacctgt gtaccacaag cctagcagca gaggcagctc tgctcactgg 6120
aactctctgt cttctttctc ctgagccttt tcttttcctg agttttctag ctctcctcaa 6180
ccttacctct gccctaccca ggacaaaccc aagagccact gtttctgtga tgtcctctcc 6240
agccctaatt aggcatcatg acttcagcct gaccttccat gctcagaagc agtgctaatc 6300
cacttcagat gagctgctct atgcaacaca ggcagagcct acaaaccttt gcaccagagc 6360
cctccacata tcagtgtttg ttcatactca cttcaacagc aaatgtgact gctgagatta 6420
agattttaca caagatggtc tgtaatttca cagttagttt tatcccatta ggtatgaaag 6480
aattagcata attcccctta aacatgaatg aatcttagat tttttaataa atagttttgg 6540
aagtaaagac agagacatca ggagcacaag gaatagcctg agaggacaaa cagaacaaga 6600
aagagtctgg aaatacacag gatgttcttg gcctcctcaa agcaagtgca agcagatagt 6660
accagcagcc ccaggctatc agagcccagt gaagagaagt accatgaaag ccacagctct 6720
aaccaccctg ttccagagtg acagacagtc cccaagacaa gccagcctga gccagagaga 6780
gaactgcaag agaaagtttc taatttaggt tctgttagat tcagacaagt gcaggtcatc 6840
ctctctccac agctactcac ctctccagcc taacaaagcc tgcagtccac actccaaccc 6900
tggtgtctca cctcctagcc tctcccaaca tcctgctctc tgaccatctt ctgcatctct 6960
catctcacca tctcccactg tctacagcct actcttgcaa ctaccatctc attttctgac 7020
atcctgtcta catcttctgc catactctgc catctaccat accacctctt accatctacc 7080
acaccatctt ttatctccat ccctctcaga agcctccaag ctgaatcctg ctttatgtgt 7140
tcatctcagc ccctgcatgg aaagctgacc ccagaggcag aactattccc agagagcttg 7200
gccaagaaaa acaaaactac cagcctggcc aggctcagga gtagtaagct gcagtgtctg 7260
ttgtgttcta gcttcaacag ctgcaggagt tccactctca aatgctccac atttctcaca 7320
tcctcctgat tctggtcact acccatcttc aaagaacaga atatctcaca tcagcatact 7380
gtgaaggact agtcatgggt gcagctgctc agagctgcaa agtcattctg gatggtggag 7440
agcttacaaa catttcatga tgctcccccc gctctgatgg ctggagccca atccctacac 7500
agactcctgc tgtatgtgtt ttcctttcac tctgagccac agccagaggg caggcattca 7560
gtctcctctt caggctgggg ctggggcact gagaactcac ccaacacctt gctctcactc 7620
cttctgcaaa acaagaaaga gctttgtgct gcagtagcca tgaagaatga aaggaaggct 7680
ttaactaaaa aatgtcagag attattttca accccttact gtggatcacc agcaaggagg 7740
aaacacaaca cagagacatt ttttcccctc aaattatcaa aagaatcact gcatttgtta 7800
aagagagcaa ctgaatcagg aagcagagtt ttgaacatat cagaagttag gaatctgcat 7860
cagagacaaa tgcagtcatg gttgtttgct gcataccagc cctaatcatt agaagcctca 7920
tggacttcaa acatcattcc ctctgacaag atgctctagc ctaactccat gagataaaat 7980
aaatctgcct ttcagagcca aagaagagtc caccagcttc ttctcagtgt gaacaagagc 8040
tccagtcagg ttagtcagtc cagtgcagta gaggagacca gtctgcatcc tctaattttc 8100
aaaggcaaga agatttgttt accctggaca ccaggcacaa gtgaggtcac agagctctta 8160
gatatgcagt cctcatgagt gaggagacta aagcgcatgc catcaagact tcagtgtaga 8220
gaaaacctcc aaaaaagcct cctcactact tctggaatag ctcagaggcc gaggcggcct 8280
cggcctctgc ataaataaaa aaaattagtc agccatgggg cggagaatgg gcggaactgg 8340
gcggagttag gggcgggatg ggcggagtta ggggcgggac tatggttgct gactaattga 8400
gatgcatgct ttgcatactt ctgcctgctg gggagcctgg ggactttcca cacctggttg 8460
ctgactaatt gagatgcatg ctttgcatac ttctgcctgc tggggagcct ggggactttc 8520
cacaccctaa ctgacacaca ttccacagct gcattaatga atcggccaac gcgcggggag 8580
aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 8640
cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 8700
atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 8760
taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 8820
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 8880
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 8940
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 9000
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 9060
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 9120
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 9180
tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat 9240
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 9300
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 9360
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 9420
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 9480
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 9540
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 9600
catagttgcc tgactcctgc aaaccacgtt gtgtctcaaa atctctgatg ttacattgca 9660
caagataaaa atatatcatc atgaacaata aaactgtctg cttacataaa cagtaataca 9720
aggggtgtta tgagccatat tcaacgggaa acgtcttgct cgaggccgcg attaaattcc 9780
aacatggatg ctgatttata tgggtataaa tgggctcgcg ataatgtcgg gcaatcaggt 9840
gcgacaatct atcgattgta tgggaagccc gatgcgccag agttgtttct gaaacatggc 9900
aaaggtagcg ttgccaatga tgttacagat gagatggtca gactaaactg gctgacggaa 9960
tttatgcctc ttccgaccat caagcatttt atccgtactc ctgatgatgc atggttactc 10020
accactgcga tccccgggaa aacagcattc caggtattag aagaatatcc tgattcaggt 10080
gaaaatattg ttgatgcgct ggcagtgttc ctgcgccggt tgcattcgat tcctgtttgt 10140
aattgtcctt ttaacagcga tcgcgtattt cgtctcgctc aggcgcaatc acgaatgaat 10200
aacggtttgg ttgatgcgag tgattttgat gacgagcgta atggctggcc tgttgaacaa 10260
gtctggaaag aaatgcataa gcttttgcca ttctcaccgg attcagtcgt cactcatggt 10320
gatttctcac ttgataacct tatttttgac gaggggaaat taataggttg tattgatgtt 10380
ggacgagtcg gaatcgcaga ccgataccag gatcttgcca tcctatggaa ctgcctcggt 10440
gagttttctc cttcattaca gaaacggctt tttcaaaaat atggtattga taatcctgat 10500
atgaataaat tgcagtttca tttgatgctc gatgagtttt tctaagggcg gcctgccacc 10560
atacccacgc cgaaacaagc gctcatgagc ccgaagtggc gagcccgatc ttccccatcg 10620
gtgatgtcgg cgatataggc gccagcaacc gcacctgtgg cgccggtgat gagggcgcgc 10680
caagtcgacg tccggcagtc 10700
<210> 8
<211> 10700
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 8
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactatt agatctgatg gccgcgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 360
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 420
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 480
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 540
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 600
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 660
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 720
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 780
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 840
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 900
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagtc gctgcgacgc 960
tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg 1020
accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag 1080
cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc 1140
cgggagctag agcctctgct aaccatgttc atgccttctt ctttttccta cagctcctgg 1200
gcaacgtgct ggttattgtg ctgtctcatc attttggcaa agaattcctc gaagatccga 1260
agggaaagtc ttccacgact gtgggatccg ttcgaagata tcaccggttg agccaccatg 1320
gaattcagca gccccagcag agaggaatgc cccaagcctc tgagccgggt gtcaatcatg 1380
gccggatctc tgacaggact gctgctgctt caggccgtgt cttgggcttc tggcgctaga 1440
ccttgcatcc ccaagagctt cggctacagc agcgtcgtgt gcgtgtgcaa tgccacctac 1500
tgcgacagct tcgaccctcc tacctttcct gctctgggca ccttcagcag atacgagagc 1560
accagatccg gcagacggat ggaactgagc atgggaccca tccaggccaa tcacacaggc 1620
actggcctgc tgctgacact gcagcctgag cagaaattcc agaaagtgaa aggcttcggc 1680
ggagccatga cagatgccgc cgctctgaat atcctggctc tgtctccacc agctcagaac 1740
ctgctgctca agagctactt cagcgaggaa ggcatcggct acaacatcat cagagtgccc 1800
atggccagct gcgacttcag catcaggacc tacacctacg ccgacacacc cgacgatttc 1860
cagctgcaca acttcagcct gcctgaagag gacaccaagc tgaagatccc tctgatccac 1920
agagccctgc agctggcaca aagacccgtg tcactgctgg cctctccatg gacatctccc 1980
acctggctga aaacaaatgg cgccgtgaat ggcaagggca gcctgaaagg ccaacctggc 2040
gacatctacc accagacctg ggccagatac ttcgtgaagt tcctggacgc ctatgccgag 2100
cacaagctgc agttttgggc cgtgacagcc gagaacgaac cttctgctgg actgctgagc 2160
ggctacccct ttcagtgcct gggctttaca cccgagcacc agcgggactt tatcgcccgt 2220
gatctgggac ccacactggc caatagcacc caccataatg tgcggctgct gatgctggac 2280
gaccagagac tgcttctgcc ccactgggct aaagtggtgc tgacagatcc tgaggccgcc 2340
aaatacgtgc acggaatcgc cgtgcactgg tatctggact ttctggcccc tgccaaggcc 2400
acactgggag agacacacag actgttcccc aacaccatgc tgttcgccag cgaagcctgt 2460
gtgggcagca agttttggga acagagcgtg cggctcggca gctgggatag aggcatgcag 2520
tacagccaca gcatcatcac caacctgctg taccacgtcg tcggctggac cgactggaat 2580
ctggccctga atcctgaagg cggccctaac tgggtccgaa acttcgtgga cagcccccatc 2640
atcgtggaca tcaccaagga caccttctac aagcagccca tgttctacca cctgggacac 2700
ttcagcaagt tcatccccga gggctctcag cgcgttggac tggtggcttc ccagaagaac 2760
gatctggacg ccgtggctct gatgcaccct gatggatctg ctgtggtggt ggtcctgaac 2820
cgcagcagca aagatgtgcc cctgaccatc aaggatcccg ccgtgggatt cctggaaaca 2880
atcagccctg gctactccat ccacacctac ctgtggcgta gacagtgaca attgttaatt 2940
aagtttaaac cctcgaggcc gcaagcttat cgataatcaa cctctggatt acaaaatttg 3000
tgaaagattg actggtattc ttaactatgt tgctcctttt acgctatgtg gatacgctgc 3060
tttaatgcct ttgtatcatg ctattgcttc ccgtatggct ttcattttct cctccttgta 3120
taaatcctgg ttgctgtctc tttatgagga gttgtggccc gttgtcaggc aacgtggcgt 3180
ggtgtgcact gtgtttgctg acgcaacccc cactggttgg ggcattgcca ccacctgtca 3240
gctcctttcc gggactttcg ctttccccct ccctattgcc acggcggaac tcatcgccgc 3300
ctgccttgcc cgctgctgga caggggctcg gctgttgggc actgacaatt ccgtggtgtt 3360
gtcggggaaa tcatcgtcct ttccttggct gctcgcctgt gttgccacct ggattctgcg 3420
cgggacgtcc ttctgctacg tcccttcggc cctcaatcca gcggaccttc cttcccgcgg 3480
cctgctgccg gctctgcggc ctcttccgcg tcttcgcctt cgccctcaga cgagtcggat 3540
ctccctttgg gccgcctccc cgcatcgata ccgtcgacta gagctcgctg atcagcctcg 3600
actgtgcctt ctagttgcca gccatctgtt gtttgcccct cccccgtgcc ttccttgacc 3660
ctggaaggtg ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt 3720
ctgagtaggt gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat 3780
tgggaagaca atagcaggca tgctggggag agatccacga taacaaacag cttttttggg 3840
gtgaacatat tgactgaatt ccctgcaggt tggccactcc ctctctgcgc gctcgctcgc 3900
tcactgaggc cgcccgggca aagcccgggc gtcgggcgac ctttggtcgc ccggcctcag 3960
tgagcgagcg agcgcgcaga gagggagtgg ccaactccat cactaggggt tcctgcggcc 4020
gctcgtacgg tctcgaggaa ttcctgcagg ataacttgcc aacctcattc taaaatgtat 4080
atagaagccc aaaagacaat aacaaaaata ttcttgtaga acaaaatggg aaagaatgtt 4140
ccactaaata tcaagattta gagcaaagca tgagatgtgt ggggatagac agtgaggctg 4200
ataaaataga gtagagctca gaaacagacc cattgatata tgtaagtgac ctatgaaaaa 4260
aatatggcat tttacaatgg gaaaatgatg gtctttttct tttttagaaa aacagggaaa 4320
tatatttata tgtaaaaaat aaaagggaac ccatatgtca taccatacac acaaaaaaat 4380
tccagtgaat tataagtcta aatggagaag gcaaaacttt aaatctttta gaaaataata 4440
tagaagcatg cagaccagcc tggccaacat gatgaaaccc tctctactaa taataaaatc 4500
agtagaacta ctcaggacta ctttgagtgg gaagtccttt tctatgaaga cttctttggc 4560
caaaattagg ctctaaatgc aaggagatag tgcatcatgc ctggctgcac ttactgataa 4620
atgatgttat caccatcttt aaccaaatgc acaggaacaa gttatggtac tgatgtgctg 4680
gattgagaag gagctctact tccttgacag gacacatttg tatcaactta aaaaagcaga 4740
tttttgccag cagaactatt cattcagagg taggaaactt agaatagatg atgtcactga 4800
ttagcatggc ttccccatct ccacagctgc ttcccaccca ggttgcccac agttgagttt 4860
gtccagtgct cagggctgcc cactctcagt aagaagcccc acaccagccc ctctccaaat 4920
atgttggctg ttccttccat taaagtgacc ccactttaga gcagcaagtg gatttctgtt 4980
tcttacagtt caggaaggag gagtcagctg tgagaacctg gagcctgaga tgcttctaag 5040
tcccactgct actggggtca gggaagccag actccagcat cagcagtcag gagcactaag 5100
cccttgccaa catcctgttt ctcagagaaa ctgcttccat tataatggtt gtcctttttt 5160
aagctatcaa gccaaacaac cagtgtctac cattattctc atcacctgaa gccaagggtt 5220
ctagcaaaag tcaagctgtc ttgtaatggt tgatgtgcct ccagcttctg tcttcagtca 5280
ctccactctt agcctgctct gaatcaactc tgaccacagt tccctggagc ccctgccacc 5340
tgctgcccct gccaccttct ccatctgcag tgctgtgcag ccttctgcac tcttgcagag 5400
ctaataggtg gagacttgaa ggaagaggag gaaagtttct cataatagcc ttgctgcaag 5460
ctcaaatggg aggtgggcac tgtgcccagg agccttggag caaaggctgt gcccaacctc 5520
tgactgcatc caggtttggt cttgacagag ataagaagcc ctggcttttg gagccaaaat 5580
ctaggtcaga cttaggcagg attctcaaag tttatcagca gaacatgagg cagaagaccc 5640
tttctgctcc agcttcttca ggctcaacct tcatcagaat agatagaaag agaggctgtg 5700
agggttctta aaacagaagc aaatctgact cagagaataa acaacctcct agtaaactac 5760
agcttagaca gagcatctgg tggtgagtgt gctcagtgtc ctactcaact gtctggtatc 5820
agccctcatg aggacttctc ttctttccct catagacctc catctctgtt ttccttagcc 5880
tgcagaaatc tggatggcta ttcacagaat gcctgtgctt tcagagttgc attttttctc 5940
tggtattctg gttcaagcat ttgaaggtag gaaaggttct ccaagtgcaa gaaagccagc 6000
cctgagcctc aactgcctgg ctagtgtggt cagtaggatg caaaggctgt tgaatgccac 6060
aaggccaaac tttaacctgt gtaccacaag cctagcagca gaggcagctc tgctcactgg 6120
aactctctgt cttctttctc ctgagccttt tcttttcctg agttttctag ctctcctcaa 6180
ccttacctct gccctaccca ggacaaaccc aagagccact gtttctgtga tgtcctctcc 6240
agccctaatt aggcatcatg acttcagcct gaccttccat gctcagaagc agtgctaatc 6300
cacttcagat gagctgctct atgcaacaca ggcagagcct acaaaccttt gcaccagagc 6360
cctccacata tcagtgtttg ttcatactca cttcaacagc aaatgtgact gctgagatta 6420
agattttaca caagatggtc tgtaatttca cagttagttt tatcccatta ggtatgaaag 6480
aattagcata attcccctta aacatgaatg aatcttagat tttttaataa atagttttgg 6540
aagtaaagac agagacatca ggagcacaag gaatagcctg agaggacaaa cagaacaaga 6600
aagagtctgg aaatacacag gatgttcttg gcctcctcaa agcaagtgca agcagatagt 6660
accagcagcc ccaggctatc agagcccagt gaagagaagt accatgaaag ccacagctct 6720
aaccaccctg ttccagagtg acagacagtc cccaagacaa gccagcctga gccagagaga 6780
gaactgcaag agaaagtttc taatttaggt tctgttagat tcagacaagt gcaggtcatc 6840
ctctctccac agctactcac ctctccagcc taacaaagcc tgcagtccac actccaaccc 6900
tggtgtctca cctcctagcc tctcccaaca tcctgctctc tgaccatctt ctgcatctct 6960
catctcacca tctcccactg tctacagcct actcttgcaa ctaccatctc attttctgac 7020
atcctgtcta catcttctgc catactctgc catctaccat accacctctt accatctacc 7080
acaccatctt ttatctccat ccctctcaga agcctccaag ctgaatcctg ctttatgtgt 7140
tcatctcagc ccctgcatgg aaagctgacc ccagaggcag aactattccc agagagcttg 7200
gccaagaaaa acaaaactac cagcctggcc aggctcagga gtagtaagct gcagtgtctg 7260
ttgtgttcta gcttcaacag ctgcaggagt tccactctca aatgctccac atttctcaca 7320
tcctcctgat tctggtcact acccatcttc aaagaacaga atatctcaca tcagcatact 7380
gtgaaggact agtcatgggt gcagctgctc agagctgcaa agtcattctg gatggtggag 7440
agcttacaaa catttcatga tgctcccccc gctctgatgg ctggagccca atccctacac 7500
agactcctgc tgtatgtgtt ttcctttcac tctgagccac agccagaggg caggcattca 7560
gtctcctctt caggctgggg ctggggcact gagaactcac ccaacacctt gctctcactc 7620
cttctgcaaa acaagaaaga gctttgtgct gcagtagcca tgaagaatga aaggaaggct 7680
ttaactaaaa aatgtcagag attattttca accccttact gtggatcacc agcaaggagg 7740
aaacacaaca cagagacatt ttttcccctc aaattatcaa aagaatcact gcatttgtta 7800
aagagagcaa ctgaatcagg aagcagagtt ttgaacatat cagaagttag gaatctgcat 7860
cagagacaaa tgcagtcatg gttgtttgct gcataccagc cctaatcatt agaagcctca 7920
tggacttcaa acatcattcc ctctgacaag atgctctagc ctaactccat gagataaaat 7980
aaatctgcct ttcagagcca aagaagagtc caccagcttc ttctcagtgt gaacaagagc 8040
tccagtcagg ttagtcagtc cagtgcagta gaggagacca gtctgcatcc tctaattttc 8100
aaaggcaaga agatttgttt accctggaca ccaggcacaa gtgaggtcac agagctctta 8160
gatatgcagt cctcatgagt gaggagacta aagcgcatgc catcaagact tcagtgtaga 8220
gaaaacctcc aaaaaagcct cctcactact tctggaatag ctcagaggcc gaggcggcct 8280
cggcctctgc ataaataaaa aaaattagtc agccatgggg cggagaatgg gcggaactgg 8340
gcggagttag gggcgggatg ggcggagtta ggggcgggac tatggttgct gactaattga 8400
gatgcatgct ttgcatactt ctgcctgctg gggagcctgg ggactttcca cacctggttg 8460
ctgactaatt gagatgcatg ctttgcatac ttctgcctgc tggggagcct ggggactttc 8520
cacaccctaa ctgacacaca ttccacagct gcattaatga atcggccaac gcgcggggag 8580
aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 8640
cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 8700
atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 8760
taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 8820
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 8880
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 8940
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 9000
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 9060
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 9120
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 9180
tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat 9240
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 9300
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 9360
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 9420
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 9480
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 9540
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 9600
catagttgcc tgactcctgc aaaccacgtt gtgtctcaaa atctctgatg ttacattgca 9660
caagataaaa atatatcatc atgaacaata aaactgtctg cttacataaa cagtaataca 9720
aggggtgtta tgagccatat tcaacgggaa acgtcttgct cgaggccgcg attaaattcc 9780
aacatggatg ctgatttata tgggtataaa tgggctcgcg ataatgtcgg gcaatcaggt 9840
gcgacaatct atcgattgta tgggaagccc gatgcgccag agttgtttct gaaacatggc 9900
aaaggtagcg ttgccaatga tgttacagat gagatggtca gactaaactg gctgacggaa 9960
tttatgcctc ttccgaccat caagcatttt atccgtactc ctgatgatgc atggttactc 10020
accactgcga tccccgggaa aacagcattc caggtattag aagaatatcc tgattcaggt 10080
gaaaatattg ttgatgcgct ggcagtgttc ctgcgccggt tgcattcgat tcctgtttgt 10140
aattgtcctt ttaacagcga tcgcgtattt cgtctcgctc aggcgcaatc acgaatgaat 10200
aacggtttgg ttgatgcgag tgattttgat gacgagcgta atggctggcc tgttgaacaa 10260
gtctggaaag aaatgcataa gcttttgcca ttctcaccgg attcagtcgt cactcatggt 10320
gatttctcac ttgataacct tatttttgac gaggggaaat taataggttg tattgatgtt 10380
ggacgagtcg gaatcgcaga ccgataccag gatcttgcca tcctatggaa ctgcctcggt 10440
gagttttctc cttcattaca gaaacggctt tttcaaaaat atggtattga taatcctgat 10500
atgaataaat tgcagtttca tttgatgctc gatgagtttt tctaagggcg gcctgccacc 10560
atacccacgc cgaaacaagc gctcatgagc ccgaagtggc gagcccgatc ttccccatcg 10620
gtgatgtcgg cgatataggc gccagcaacc gcacctgtgg cgccggtgat gagggcgcgc 10680
caagtcgacg tccggcagtc 10700
<210> 9
<211> 10700
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 9
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 360
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 420
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 480
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 540
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 600
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 660
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 720
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 780
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 840
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 900
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagtc gctgcgacgc 960
tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg 1020
accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag 1080
cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc 1140
cgggagctag agcctctgct aaccatgttc atgccttctt ctttttccta cagctcctgg 1200
gcaacgtgct ggttattgtg ctgtctcatc attttggcaa agaattcctc gaagatccga 1260
agggaaagtc ttccacgact gtgggatccg ttcgaagata tcaccggttg agccaccatg 1320
gaattcagca gccccagcag agaggaatgc cccaagcctc tgagccgggt gtcaatcatg 1380
gccggatctc tgacaggact gctgctgctt caggccgtgt cttgggcttc tggcgctaga 1440
ccttgcatcc ccaagagctt cggctacagc agcgtcgtgt gcgtgtgcaa tgccacctac 1500
tgcgacagct tcgaccctcc tacctttcct gctctgggca ccttcagcag atacgagagc 1560
accagatccg gcagacggat ggaactgagc atgggaccca tccaggccaa tcacacaggc 1620
actggcctgc tgctgacact gcagcctgag cagaaattcc agaaagtgaa aggcttcggc 1680
ggagccatga cagatgccgc cgctctgaat atcctggctc tgtctccacc agctcagaac 1740
ctgctgctca agagctactt cagcgaggaa ggcatcggct acaacatcat cagagtgccc 1800
atggccagct gcgacttcag catcaggacc tacacctacg ccgacacacc cgacgatttc 1860
cagctgcaca acttcagcct gcctgaagag gacaccaagc tgaagatccc tctgatccac 1920
agagccctgc agctggcaca aagacccgtg tcactgctgg cctctccatg gacatctccc 1980
acctggctga aaacaaatgg cgccgtgaat ggcaagggca gcctgaaagg ccaacctggc 2040
gacatctacc accagacctg ggccagatac ttcgtgaagt tcctggacgc ctatgccgag 2100
cacaagctgc agttttgggc cgtgacagcc gagaacgaac cttctgctgg actgctgagc 2160
ggctacccct ttcagtgcct gggctttaca cccgagcacc agcgggactt tatcgcccgt 2220
gatctgggac ccacactggc caatagcacc caccataatg tgcggctgct gatgctggac 2280
gaccagagac tgcttctgcc ccactgggct aaagtggtgc tgacagatcc tgaggccgcc 2340
aaatacgtgc acggaatcgc cgtgcactgg tatctggact ttctggcccc tgccaaggcc 2400
acactgggag agacacacag actgttcccc aacaccatgc tgttcgccag cgaagcctgt 2460
gtgggcagca agttttggga acagagcgtg cggctcggca gctgggatag aggcatgcag 2520
tacagccaca gcatcatcac caacctgctg taccacgtcg tcggctggac cgactggaat 2580
ctggccctga atcctgaagg cggccctaac tgggtccgaa acttcgtgga cagcccccatc 2640
atcgtggaca tcaccaagga caccttctac aagcagccca tgttctacca cctgggacac 2700
ttcagcaagt tcatccccga gggctctcag cgcgttggac tggtggcttc ccagaagaac 2760
gatctggacg ccgtggctct gatgcaccct gatggatctg ctgtggtggt ggtcctgaac 2820
cgcagcagca aagatgtgcc cctgaccatc aaggatcccg ccgtgggatt cctggaaaca 2880
atcagccctg gctactccat ccacacctac ctgtggcgta gacagtgaca attgttaatt 2940
aagtttaaac cctcgaggcc gcaagcttat cgataatcaa cctctggatt acaaaatttg 3000
tgaaagattg actggtattc ttaactatgt tgctcctttt acgctatgtg gatacgctgc 3060
tttaatgcct ttgtatcatg ctattgcttc ccgtatggct ttcattttct cctccttgta 3120
taaatcctgg ttgctgtctc tttatgagga gttgtggccc gttgtcaggc aacgtggcgt 3180
ggtgtgcact gtgtttgctg acgcaacccc cactggttgg ggcattgcca ccacctgtca 3240
gctcctttcc gggactttcg ctttccccct ccctattgcc acggcggaac tcatcgccgc 3300
ctgccttgcc cgctgctgga caggggctcg gctgttgggc actgacaatt ccgtggtgtt 3360
gtcggggaaa tcatcgtcct ttccttggct gctcgcctgt gttgccacct ggattctgcg 3420
cgggacgtcc ttctgctacg tcccttcggc cctcaatcca gcggaccttc cttcccgcgg 3480
cctgctgccg gctctgcggc ctcttccgcg tcttcgcctt cgccctcaga cgagtcggat 3540
ctccctttgg gccgcctccc cgcatcgata ccgtcgacta gagctcgctg atcagcctcg 3600
actgtgcctt ctagttgcca gccatctgtt gtttgcccct cccccgtgcc ttccttgacc 3660
ctggaaggtg ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt 3720
ctgagtaggt gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat 3780
tgggaagaca atagcaggca tgctggggag agatccacga taacaaacag cttttttggg 3840
gtgaacatat tgactgaatt ccctgcaggt tggccactcc ctctctgcgc gctcgctcgc 3900
tcactgaggc cgcccgggca aagcccgggc gtcgggcgac ctttggtcgc ccggcctcag 3960
tgagcgagcg agcgcgcaga gagggagtgg ccaactccat cactaggggt tcctgcggcc 4020
gctcgtacgg tctcgaggaa ttcctgcagg ataacttgcc aacctcattc taaaatgtat 4080
atagaagccc aaaagacaat aacaaaaata ttcttgtaga acaaaatggg aaagaatgtt 4140
ccactaaata tcaagattta gagcaaagca tgagatgtgt ggggatagac agtgaggctg 4200
ataaaataga gtagagctca gaaacagacc cattgatata tgtaagtgac ctatgaaaaa 4260
aatatggcat tttacaatgg gaaaatgatg gtctttttct tttttagaaa aacagggaaa 4320
tatatttata tgtaaaaaat aaaagggaac ccatatgtca taccatacac acaaaaaaat 4380
tccagtgaat tataagtcta aatggagaag gcaaaacttt aaatctttta gaaaataata 4440
tagaagcatg cagaccagcc tggccaacat gatgaaaccc tctctactaa taataaaatc 4500
agtagaacta ctcaggacta ctttgagtgg gaagtccttt tctatgaaga cttctttggc 4560
caaaattagg ctctaaatgc aaggagatag tgcatcatgc ctggctgcac ttactgataa 4620
atgatgttat caccatcttt aaccaaatgc acaggaacaa gttatggtac tgatgtgctg 4680
gattgagaag gagctctact tccttgacag gacacatttg tatcaactta aaaaagcaga 4740
tttttgccag cagaactatt cattcagagg taggaaactt agaatagatg atgtcactga 4800
ttagcatggc ttccccatct ccacagctgc ttcccaccca ggttgcccac agttgagttt 4860
gtccagtgct cagggctgcc cactctcagt aagaagcccc acaccagccc ctctccaaat 4920
atgttggctg ttccttccat taaagtgacc ccactttaga gcagcaagtg gatttctgtt 4980
tcttacagtt caggaaggag gagtcagctg tgagaacctg gagcctgaga tgcttctaag 5040
tcccactgct actggggtca gggaagccag actccagcat cagcagtcag gagcactaag 5100
cccttgccaa catcctgttt ctcagagaaa ctgcttccat tataatggtt gtcctttttt 5160
aagctatcaa gccaaacaac cagtgtctac cattattctc atcacctgaa gccaagggtt 5220
ctagcaaaag tcaagctgtc ttgtaatggt tgatgtgcct ccagcttctg tcttcagtca 5280
ctccactctt agcctgctct gaatcaactc tgaccacagt tccctggagc ccctgccacc 5340
tgctgcccct gccaccttct ccatctgcag tgctgtgcag ccttctgcac tcttgcagag 5400
ctaataggtg gagacttgaa ggaagaggag gaaagtttct cataatagcc ttgctgcaag 5460
ctcaaatggg aggtgggcac tgtgcccagg agccttggag caaaggctgt gcccaacctc 5520
tgactgcatc caggtttggt cttgacagag ataagaagcc ctggcttttg gagccaaaat 5580
ctaggtcaga cttaggcagg attctcaaag tttatcagca gaacatgagg cagaagaccc 5640
tttctgctcc agcttcttca ggctcaacct tcatcagaat agatagaaag agaggctgtg 5700
agggttctta aaacagaagc aaatctgact cagagaataa acaacctcct agtaaactac 5760
agcttagaca gagcatctgg tggtgagtgt gctcagtgtc ctactcaact gtctggtatc 5820
agccctcatg aggacttctc ttctttccct catagacctc catctctgtt ttccttagcc 5880
tgcagaaatc tggatggcta ttcacagaat gcctgtgctt tcagagttgc attttttctc 5940
tggtattctg gttcaagcat ttgaaggtag gaaaggttct ccaagtgcaa gaaagccagc 6000
cctgagcctc aactgcctgg ctagtgtggt cagtaggatg caaaggctgt tgaatgccac 6060
aaggccaaac tttaacctgt gtaccacaag cctagcagca gaggcagctc tgctcactgg 6120
aactctctgt cttctttctc ctgagccttt tcttttcctg agttttctag ctctcctcaa 6180
ccttacctct gccctaccca ggacaaaccc aagagccact gtttctgtga tgtcctctcc 6240
agccctaatt aggcatcatg acttcagcct gaccttccat gctcagaagc agtgctaatc 6300
cacttcagat gagctgctct atgcaacaca ggcagagcct acaaaccttt gcaccagagc 6360
cctccacata tcagtgtttg ttcatactca cttcaacagc aaatgtgact gctgagatta 6420
agattttaca caagatggtc tgtaatttca cagttagttt tatcccatta ggtatgaaag 6480
aattagcata attcccctta aacatgaatg aatcttagat tttttaataa atagttttgg 6540
aagtaaagac agagacatca ggagcacaag gaatagcctg agaggacaaa cagaacaaga 6600
aagagtctgg aaatacacag gatgttcttg gcctcctcaa agcaagtgca agcagatagt 6660
accagcagcc ccaggctatc agagcccagt gaagagaagt accatgaaag ccacagctct 6720
aaccaccctg ttccagagtg acagacagtc cccaagacaa gccagcctga gccagagaga 6780
gaactgcaag agaaagtttc taatttaggt tctgttagat tcagacaagt gcaggtcatc 6840
ctctctccac agctactcac ctctccagcc taacaaagcc tgcagtccac actccaaccc 6900
tggtgtctca cctcctagcc tctcccaaca tcctgctctc tgaccatctt ctgcatctct 6960
catctcacca tctcccactg tctacagcct actcttgcaa ctaccatctc attttctgac 7020
atcctgtcta catcttctgc catactctgc catctaccat accacctctt accatctacc 7080
acaccatctt ttatctccat ccctctcaga agcctccaag ctgaatcctg ctttatgtgt 7140
tcatctcagc ccctgcatgg aaagctgacc ccagaggcag aactattccc agagagcttg 7200
gccaagaaaa acaaaactac cagcctggcc aggctcagga gtagtaagct gcagtgtctg 7260
ttgtgttcta gcttcaacag ctgcaggagt tccactctca aatgctccac atttctcaca 7320
tcctcctgat tctggtcact acccatcttc aaagaacaga atatctcaca tcagcatact 7380
gtgaaggact agtcatgggt gcagctgctc agagctgcaa agtcattctg gatggtggag 7440
agcttacaaa catttcatga tgctcccccc gctctgatgg ctggagccca atccctacac 7500
agactcctgc tgtatgtgtt ttcctttcac tctgagccac agccagaggg caggcattca 7560
gtctcctctt caggctgggg ctggggcact gagaactcac ccaacacctt gctctcactc 7620
cttctgcaaa acaagaaaga gctttgtgct gcagtagcca tgaagaatga aaggaaggct 7680
ttaactaaaa aatgtcagag attattttca accccttact gtggatcacc agcaaggagg 7740
aaacacaaca cagagacatt ttttcccctc aaattatcaa aagaatcact gcatttgtta 7800
aagagagcaa ctgaatcagg aagcagagtt ttgaacatat cagaagttag gaatctgcat 7860
cagagacaaa tgcagtcatg gttgtttgct gcataccagc cctaatcatt agaagcctca 7920
tggacttcaa acatcattcc ctctgacaag atgctctagc ctaactccat gagataaaat 7980
aaatctgcct ttcagagcca aagaagagtc caccagcttc ttctcagtgt gaacaagagc 8040
tccagtcagg ttagtcagtc cagtgcagta gaggagacca gtctgcatcc tctaattttc 8100
aaaggcaaga agatttgttt accctggaca ccaggcacaa gtgaggtcac agagctctta 8160
gatatgcagt cctcatgagt gaggagacta aagcgcatgc catcaagact tcagtgtaga 8220
gaaaacctcc aaaaaagcct cctcactact tctggaatag ctcagaggcc gaggcggcct 8280
cggcctctgc ataaataaaa aaaattagtc agccatgggg cggagaatgg gcggaactgg 8340
gcggagttag gggcgggatg ggcggagtta ggggcgggac tatggttgct gactaattga 8400
gatgcatgct ttgcatactt ctgcctgctg gggagcctgg ggactttcca cacctggttg 8460
ctgactaatt gagatgcatg ctttgcatac ttctgcctgc tggggagcct ggggactttc 8520
cacaccctaa ctgacacaca ttccacagct gcattaatga atcggccaac gcgcggggag 8580
aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 8640
cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 8700
atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 8760
taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 8820
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 8880
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 8940
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 9000
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 9060
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 9120
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 9180
tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat 9240
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 9300
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 9360
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 9420
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 9480
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 9540
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 9600
catagttgcc tgactcctgc aaaccacgtt gtgtctcaaa atctctgatg ttacattgca 9660
caagataaaa atatatcatc atgaacaata aaactgtctg cttacataaa cagtaataca 9720
aggggtgtta tgagccatat tcaacgggaa acgtcttgct cgaggccgcg attaaattcc 9780
aacatggatg ctgatttata tgggtataaa tgggctcgcg ataatgtcgg gcaatcaggt 9840
gcgacaatct atcgattgta tgggaagccc gatgcgccag agttgtttct gaaacatggc 9900
aaaggtagcg ttgccaatga tgttacagat gagatggtca gactaaactg gctgacggaa 9960
tttatgcctc ttccgaccat caagcatttt atccgtactc ctgatgatgc atggttactc 10020
accactgcga tccccgggaa aacagcattc caggtattag aagaatatcc tgattcaggt 10080
gaaaatattg ttgatgcgct ggcagtgttc ctgcgccggt tgcattcgat tcctgtttgt 10140
aattgtcctt ttaacagcga tcgcgtattt cgtctcgctc aggcgcaatc acgaatgaat 10200
aacggtttgg ttgatgcgag tgattttgat gacgagcgta atggctggcc tgttgaacaa 10260
gtctggaaag aaatgcataa gcttttgcca ttctcaccgg attcagtcgt cactcatggt 10320
gatttctcac ttgataacct tatttttgac gaggggaaat taataggttg tattgatgtt 10380
ggacgagtcg gaatcgcaga ccgataccag gatcttgcca tcctatggaa ctgcctcggt 10440
gagttttctc cttcattaca gaaacggctt tttcaaaaat atggtattga taatcctgat 10500
atgaataaat tgcagtttca tttgatgctc gatgagtttt tctaagggcg gcctgccacc 10560
atacccacgc cgaaacaagc gctcatgagc ccgaagtggc gagcccgatc ttccccatcg 10620
gtgatgtcgg cgatataggc gccagcaacc gcacctgtgg cgccggtgat gagggcgcgc 10680
caagtcgacg tccggcagtc 10700
<210> 10
<211> 10700
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 10
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgcccgggc aaagcccggg 60
cgtcgggcga cctttggtcg cccggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 360
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 420
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 480
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 540
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 600
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 660
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 720
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 780
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 840
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 900
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagtc gctgcgacgc 960
tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg 1020
accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag 1080
cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc 1140
cgggagctag agcctctgct aaccatgttc atgccttctt ctttttccta cagctcctgg 1200
gcaacgtgct ggttattgtg ctgtctcatc attttggcaa agaattcctc gaagatccga 1260
agggaaagtc ttccacgact gtgggatccg ttcgaagata tcaccggttg agccaccatg 1320
gaattcagca gccccagcag agaggaatgc cccaagcctc tgagccgggt gtcaatcatg 1380
gccggatctc tgacaggact gctgctgctt caggccgtgt cttgggcttc tggcgctaga 1440
ccttgcatcc ccaagagctt cggctacagc agcgtcgtgt gcgtgtgcaa tgccacctac 1500
tgcgacagct tcgaccctcc tacctttcct gctctgggca ccttcagcag atacgagagc 1560
accagatccg gcagacggat ggaactgagc atgggaccca tccaggccaa tcacacaggc 1620
actggcctgc tgctgacact gcagcctgag cagaaattcc agaaagtgaa aggcttcggc 1680
ggagccatga cagatgccgc cgctctgaat atcctggctc tgtctccacc agctcagaac 1740
ctgctgctca agagctactt cagcgaggaa ggcatcggct acaacatcat cagagtgccc 1800
atggccagct gcgacttcag catcaggacc tacacctacg ccgacacacc cgacgatttc 1860
cagctgcaca acttcagcct gcctgaagag gacaccaagc tgaagatccc tctgatccac 1920
agagccctgc agctggcaca aagacccgtg tcactgctgg cctctccatg gacatctccc 1980
acctggctga aaacaaatgg cgccgtgaat ggcaagggca gcctgaaagg ccaacctggc 2040
gacatctacc accagacctg ggccagatac ttcgtgaagt tcctggacgc ctatgccgag 2100
cacaagctgc agttttgggc cgtgacagcc gagaacgaac cttctgctgg actgctgagc 2160
ggctacccct ttcagtgcct gggctttaca cccgagcacc agcgggactt tatcgcccgt 2220
gatctgggac ccacactggc caatagcacc caccataatg tgcggctgct gatgctggac 2280
gaccagagac tgcttctgcc ccactgggct aaagtggtgc tgacagatcc tgaggccgcc 2340
aaatacgtgc acggaatcgc cgtgcactgg tatctggact ttctggcccc tgccaaggcc 2400
acactgggag agacacacag actgttcccc aacaccatgc tgttcgccag cgaagcctgt 2460
gtgggcagca agttttggga acagagcgtg cggctcggca gctgggatag aggcatgcag 2520
tacagccaca gcatcatcac caacctgctg taccacgtcg tcggctggac cgactggaat 2580
ctggccctga atcctgaagg cggccctaac tgggtccgaa acttcgtgga cagcccccatc 2640
atcgtggaca tcaccaagga caccttctac aagcagccca tgttctacca cctgggacac 2700
ttcagcaagt tcatccccga gggctctcag cgcgttggac tggtggcttc ccagaagaac 2760
gatctggacg ccgtggctct gatgcaccct gatggatctg ctgtggtggt ggtcctgaac 2820
cgcagcagca aagatgtgcc cctgaccatc aaggatcccg ccgtgggatt cctggaaaca 2880
atcagccctg gctactccat ccacacctac ctgtggcgta gacagtgaca attgttaatt 2940
aagtttaaac cctcgaggcc gcaagcttat cgataatcaa cctctggatt acaaaatttg 3000
tgaaagattg actggtattc ttaactatgt tgctcctttt acgctatgtg gatacgctgc 3060
tttaatgcct ttgtatcatg ctattgcttc ccgtatggct ttcattttct cctccttgta 3120
taaatcctgg ttgctgtctc tttatgagga gttgtggccc gttgtcaggc aacgtggcgt 3180
ggtgtgcact gtgtttgctg acgcaacccc cactggttgg ggcattgcca ccacctgtca 3240
gctcctttcc gggactttcg ctttccccct ccctattgcc acggcggaac tcatcgccgc 3300
ctgccttgcc cgctgctgga caggggctcg gctgttgggc actgacaatt ccgtggtgtt 3360
gtcggggaaa tcatcgtcct ttccttggct gctcgcctgt gttgccacct ggattctgcg 3420
cgggacgtcc ttctgctacg tcccttcggc cctcaatcca gcggaccttc cttcccgcgg 3480
cctgctgccg gctctgcggc ctcttccgcg tcttcgcctt cgccctcaga cgagtcggat 3540
ctccctttgg gccgcctccc cgcatcgata ccgtcgacta gagctcgctg atcagcctcg 3600
actgtgcctt ctagttgcca gccatctgtt gtttgcccct cccccgtgcc ttccttgacc 3660
ctggaaggtg ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt 3720
ctgagtaggt gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat 3780
tgggaagaca atagcaggca tgctggggag agatccacga taacaaacag cttttttggg 3840
gtgaacatat tgactgaatt ccctgcagga ggaaccccta gtgatggagt tggccactcc 3900
ctctctgcgc gctcgctcgc tcactgaggc cgcccgggca aagcccgggc gtcgggcgac 3960
ctttggtcgc ccggcctcag tgagcgagcg agcgcgcaga gagggagtgg ccaagcggcc 4020
gctcgtacgg tctcgaggaa ttcctgcagg ataacttgcc aacctcattc taaaatgtat 4080
atagaagccc aaaagacaat aacaaaaata ttcttgtaga acaaaatggg aaagaatgtt 4140
ccactaaata tcaagattta gagcaaagca tgagatgtgt ggggatagac agtgaggctg 4200
ataaaataga gtagagctca gaaacagacc cattgatata tgtaagtgac ctatgaaaaa 4260
aatatggcat tttacaatgg gaaaatgatg gtctttttct tttttagaaa aacagggaaa 4320
tatatttata tgtaaaaaat aaaagggaac ccatatgtca taccatacac acaaaaaaat 4380
tccagtgaat tataagtcta aatggagaag gcaaaacttt aaatctttta gaaaataata 4440
tagaagcatg cagaccagcc tggccaacat gatgaaaccc tctctactaa taataaaatc 4500
agtagaacta ctcaggacta ctttgagtgg gaagtccttt tctatgaaga cttctttggc 4560
caaaattagg ctctaaatgc aaggagatag tgcatcatgc ctggctgcac ttactgataa 4620
atgatgttat caccatcttt aaccaaatgc acaggaacaa gttatggtac tgatgtgctg 4680
gattgagaag gagctctact tccttgacag gacacatttg tatcaactta aaaaagcaga 4740
tttttgccag cagaactatt cattcagagg taggaaactt agaatagatg atgtcactga 4800
ttagcatggc ttccccatct ccacagctgc ttcccaccca ggttgcccac agttgagttt 4860
gtccagtgct cagggctgcc cactctcagt aagaagcccc acaccagccc ctctccaaat 4920
atgttggctg ttccttccat taaagtgacc ccactttaga gcagcaagtg gatttctgtt 4980
tcttacagtt caggaaggag gagtcagctg tgagaacctg gagcctgaga tgcttctaag 5040
tcccactgct actggggtca gggaagccag actccagcat cagcagtcag gagcactaag 5100
cccttgccaa catcctgttt ctcagagaaa ctgcttccat tataatggtt gtcctttttt 5160
aagctatcaa gccaaacaac cagtgtctac cattattctc atcacctgaa gccaagggtt 5220
ctagcaaaag tcaagctgtc ttgtaatggt tgatgtgcct ccagcttctg tcttcagtca 5280
ctccactctt agcctgctct gaatcaactc tgaccacagt tccctggagc ccctgccacc 5340
tgctgcccct gccaccttct ccatctgcag tgctgtgcag ccttctgcac tcttgcagag 5400
ctaataggtg gagacttgaa ggaagaggag gaaagtttct cataatagcc ttgctgcaag 5460
ctcaaatggg aggtgggcac tgtgcccagg agccttggag caaaggctgt gcccaacctc 5520
tgactgcatc caggtttggt cttgacagag ataagaagcc ctggcttttg gagccaaaat 5580
ctaggtcaga cttaggcagg attctcaaag tttatcagca gaacatgagg cagaagaccc 5640
tttctgctcc agcttcttca ggctcaacct tcatcagaat agatagaaag agaggctgtg 5700
agggttctta aaacagaagc aaatctgact cagagaataa acaacctcct agtaaactac 5760
agcttagaca gagcatctgg tggtgagtgt gctcagtgtc ctactcaact gtctggtatc 5820
agccctcatg aggacttctc ttctttccct catagacctc catctctgtt ttccttagcc 5880
tgcagaaatc tggatggcta ttcacagaat gcctgtgctt tcagagttgc attttttctc 5940
tggtattctg gttcaagcat ttgaaggtag gaaaggttct ccaagtgcaa gaaagccagc 6000
cctgagcctc aactgcctgg ctagtgtggt cagtaggatg caaaggctgt tgaatgccac 6060
aaggccaaac tttaacctgt gtaccacaag cctagcagca gaggcagctc tgctcactgg 6120
aactctctgt cttctttctc ctgagccttt tcttttcctg agttttctag ctctcctcaa 6180
ccttacctct gccctaccca ggacaaaccc aagagccact gtttctgtga tgtcctctcc 6240
agccctaatt aggcatcatg acttcagcct gaccttccat gctcagaagc agtgctaatc 6300
cacttcagat gagctgctct atgcaacaca ggcagagcct acaaaccttt gcaccagagc 6360
cctccacata tcagtgtttg ttcatactca cttcaacagc aaatgtgact gctgagatta 6420
agattttaca caagatggtc tgtaatttca cagttagttt tatcccatta ggtatgaaag 6480
aattagcata attcccctta aacatgaatg aatcttagat tttttaataa atagttttgg 6540
aagtaaagac agagacatca ggagcacaag gaatagcctg agaggacaaa cagaacaaga 6600
aagagtctgg aaatacacag gatgttcttg gcctcctcaa agcaagtgca agcagatagt 6660
accagcagcc ccaggctatc agagcccagt gaagagaagt accatgaaag ccacagctct 6720
aaccaccctg ttccagagtg acagacagtc cccaagacaa gccagcctga gccagagaga 6780
gaactgcaag agaaagtttc taatttaggt tctgttagat tcagacaagt gcaggtcatc 6840
ctctctccac agctactcac ctctccagcc taacaaagcc tgcagtccac actccaaccc 6900
tggtgtctca cctcctagcc tctcccaaca tcctgctctc tgaccatctt ctgcatctct 6960
catctcacca tctcccactg tctacagcct actcttgcaa ctaccatctc attttctgac 7020
atcctgtcta catcttctgc catactctgc catctaccat accacctctt accatctacc 7080
acaccatctt ttatctccat ccctctcaga agcctccaag ctgaatcctg ctttatgtgt 7140
tcatctcagc ccctgcatgg aaagctgacc ccagaggcag aactattccc agagagcttg 7200
gccaagaaaa acaaaactac cagcctggcc aggctcagga gtagtaagct gcagtgtctg 7260
ttgtgttcta gcttcaacag ctgcaggagt tccactctca aatgctccac atttctcaca 7320
tcctcctgat tctggtcact acccatcttc aaagaacaga atatctcaca tcagcatact 7380
gtgaaggact agtcatgggt gcagctgctc agagctgcaa agtcattctg gatggtggag 7440
agcttacaaa catttcatga tgctcccccc gctctgatgg ctggagccca atccctacac 7500
agactcctgc tgtatgtgtt ttcctttcac tctgagccac agccagaggg caggcattca 7560
gtctcctctt caggctgggg ctggggcact gagaactcac ccaacacctt gctctcactc 7620
cttctgcaaa acaagaaaga gctttgtgct gcagtagcca tgaagaatga aaggaaggct 7680
ttaactaaaa aatgtcagag attattttca accccttact gtggatcacc agcaaggagg 7740
aaacacaaca cagagacatt ttttcccctc aaattatcaa aagaatcact gcatttgtta 7800
aagagagcaa ctgaatcagg aagcagagtt ttgaacatat cagaagttag gaatctgcat 7860
cagagacaaa tgcagtcatg gttgtttgct gcataccagc cctaatcatt agaagcctca 7920
tggacttcaa acatcattcc ctctgacaag atgctctagc ctaactccat gagataaaat 7980
aaatctgcct ttcagagcca aagaagagtc caccagcttc ttctcagtgt gaacaagagc 8040
tccagtcagg ttagtcagtc cagtgcagta gaggagacca gtctgcatcc tctaattttc 8100
aaaggcaaga agatttgttt accctggaca ccaggcacaa gtgaggtcac agagctctta 8160
gatatgcagt cctcatgagt gaggagacta aagcgcatgc catcaagact tcagtgtaga 8220
gaaaacctcc aaaaaagcct cctcactact tctggaatag ctcagaggcc gaggcggcct 8280
cggcctctgc ataaataaaa aaaattagtc agccatgggg cggagaatgg gcggaactgg 8340
gcggagttag gggcgggatg ggcggagtta ggggcgggac tatggttgct gactaattga 8400
gatgcatgct ttgcatactt ctgcctgctg gggagcctgg ggactttcca cacctggttg 8460
ctgactaatt gagatgcatg ctttgcatac ttctgcctgc tggggagcct ggggactttc 8520
cacaccctaa ctgacacaca ttccacagct gcattaatga atcggccaac gcgcggggag 8580
aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 8640
cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 8700
atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 8760
taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 8820
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 8880
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 8940
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 9000
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 9060
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 9120
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 9180
tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat 9240
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 9300
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 9360
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 9420
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 9480
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 9540
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 9600
catagttgcc tgactcctgc aaaccacgtt gtgtctcaaa atctctgatg ttacattgca 9660
caagataaaa atatatcatc atgaacaata aaactgtctg cttacataaa cagtaataca 9720
aggggtgtta tgagccatat tcaacgggaa acgtcttgct cgaggccgcg attaaattcc 9780
aacatggatg ctgatttata tgggtataaa tgggctcgcg ataatgtcgg gcaatcaggt 9840
gcgacaatct atcgattgta tgggaagccc gatgcgccag agttgtttct gaaacatggc 9900
aaaggtagcg ttgccaatga tgttacagat gagatggtca gactaaactg gctgacggaa 9960
tttatgcctc ttccgaccat caagcatttt atccgtactc ctgatgatgc atggttactc 10020
accactgcga tccccgggaa aacagcattc caggtattag aagaatatcc tgattcaggt 10080
gaaaatattg ttgatgcgct ggcagtgttc ctgcgccggt tgcattcgat tcctgtttgt 10140
aattgtcctt ttaacagcga tcgcgtattt cgtctcgctc aggcgcaatc acgaatgaat 10200
aacggtttgg ttgatgcgag tgattttgat gacgagcgta atggctggcc tgttgaacaa 10260
gtctggaaag aaatgcataa gcttttgcca ttctcaccgg attcagtcgt cactcatggt 10320
gatttctcac ttgataacct tatttttgac gaggggaaat taataggttg tattgatgtt 10380
ggacgagtcg gaatcgcaga ccgataccag gatcttgcca tcctatggaa ctgcctcggt 10440
gagttttctc cttcattaca gaaacggctt tttcaaaaat atggtattga taatcctgat 10500
atgaataaat tgcagtttca tttgatgctc gatgagtttt tctaagggcg gcctgccacc 10560
atacccacgc cgaaacaagc gctcatgagc ccgaagtggc gagcccgatc ttccccatcg 10620
gtgatgtcgg cgatataggc gccagcaacc gcacctgtgg cgccggtgat gagggcgcgc 10680
caagtcgacg tccggcagtc 10700
<210> 11
<211> 11188
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 11
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactatt agatctgatg gccgcgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
gtggtgactg agatgttttc taggaaacac aaaagataca aaaaagaaca cgtggaagga 300
tagccaaaaa ggggggctgc ccccatttcc tgcaccccgc tgcgatggct ggcaccattt 360
ggaagacttc gagatacact gttgagcgca gtaagacaac agtgtatctc gaagtcttcc 420
agatggggcc agccggtcca ctctgtatcc aggccagttc tgcaaggcgt tcgaggacca 480
cccccctccc ctcgccacca gggtggtctc atacagaact tataagattc ccaaatccaa 540
agacatttca cgtttatggt gatttcccag aacacatagc gacatgcaaa tattgcaggg 600
cgccactccc ctgtccctca cagccatctt cctgccaggg cgcacgcgcg ctgggtgttc 660
ccgcctagtg acactgggcc cgcgattcct tggagcgggt tgatgacgtc agcgtttccc 720
atggtgaatc cctaggttct agaaccggtg acgtctccca tggtgaagct tggatctgaa 780
ttcggtacct agttattaat agtaatcaat tacggggtca ttagttcata gcccatatat 840
ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc ccaacgaccc 900
ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag ggactttcca 960
ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac atcaagtgta 1020
tcatatgcca agtacgcccc ctattgacgt caatgacggt aaatggcccg cctggcatta 1080
tgcccagtac atgaccttat gggactttcc tacttggcag tacatctacg tattagtcat 1140
cgctattacc atggtcgagg tgagccccac gttctgcttc actctcccca tctcccccccc 1200
ctccccaccc ccaattttgt atttatttat tttttaatta ttttgtgcag cgatgggggc 1260
gggggggggg ggggggcgcg cgccaggcgg ggcggggcgg ggcgaggggc ggggcggggc 1320
gaggcggaga ggtgcggcgg cagccaatca gagcggcgcg ctccgaaagt ttccttttat 1380
ggcgaggcgg cggcggcggc ggccctataa aaagcgaagc gcgcggcggg cgggagtcgc 1440
tgcgacgctg ccttcgcccc gtgccccgct ccgccgccgc ctcgcgccgc ccgccccggc 1500
tctgactgac cgcgttactc ccacaggtga gcgggcggga cggcccttct cctccgggct 1560
gtaattagcg cttggtttaa tgacggcttg tttcttttct gtggctgcgt gaaagccttg 1620
aggggctccg ggagctagag cctctgctaa ccatgttcat gccttcttct ttttcctaca 1680
gctcctgggc aacgtgctgg ttattgtgct gtctcatcat tttggcaaag aattcctcga 1740
agatccgaag ggaaagtctt ccacgactgt gggatccgtt cgaagatatc accggttgag 1800
ccaccatgga attcagcagc cccagcagag aggaatgccc caagcctctg agccgggtgt 1860
caatcatggc cggatctctg acaggactgc tgctgcttca ggccgtgtct tgggcttctg 1920
gcgctagacc ttgcatcccc aagagcttcg gctacagcag cgtcgtgtgc gtgtgcaatg 1980
ccacctactg cgacagcttc gaccctccta cctttcctgc tctgggcacc ttcagcagat 2040
acgagagcac cagatccggc agacggatgg aactgagcat gggacccatc caggccaatc 2100
acacaggcac tggcctgctg ctgacactgc agcctgagca gaaattccag aaagtgaaag 2160
gcttcggcgg agccatgaca gatgccgccg ctctgaatat cctggctctg tctccaccag 2220
ctcagaacct gctgctcaag agctacttca gcgaggaagg catcggctac aacatcatca 2280
gagtgcccat ggccagctgc gacttcagca tcaggaccta cacctacgcc gacacacccg 2340
acgatttcca gctgcacaac ttcagcctgc ctgaagagga caccaagctg aagatccctc 2400
tgatccacag agccctgcag ctggcacaaa gacccgtgtc actgctggcc tctccatgga 2460
catctcccac ctggctgaaa acaaatggcg ccgtgaatgg caagggcagc ctgaaaggcc 2520
aacctggcga catctaccac cagacctggg ccagatactt cgtgaagttc ctggacgcct 2580
atgccgagca caagctgcag ttttgggccg tgacagccga gaacgaacct tctgctggac 2640
tgctgagcgg ctaccccttt cagtgcctgg gctttacacc cgagcaccag cgggacttta 2700
tcgcccgtga tctgggaccc acactggcca atagcaccca ccataatgtg cggctgctga 2760
tgctggacga ccagagactg cttctgcccc actgggctaa agtggtgctg acagatcctg 2820
aggccgccaa atacgtgcac ggaatcgccg tgcactggta tctggacttt ctggcccctg 2880
ccaaggccac actgggagag acacacagac tgttccccaa caccatgctg ttcgccagcg 2940
aagcctgtgt gggcagcaag ttttgggaac agagcgtgcg gctcggcagc tgggatagag 3000
gcatgcagta cagccacagc atcatcacca acctgctgta ccacgtcgtc ggctggaccg 3060
actggaatct ggccctgaat cctgaaggcg gccctaactg ggtccgaaac ttcgtggaca 3120
gccccatcat cgtggacatc accaaggaca ccttctacaa gcagcccatg ttctaccacc 3180
tgggacactt cagcaagttc atccccgagg gctctcagcg cgttggactg gtggcttccc 3240
agaagaacga tctggacgcc gtggctctga tgcaccctga tggatctgct gtggtggtgg 3300
tcctgaaccg cagcagcaaa gatgtgcccc tgaccatcaa ggatcccgcc gtgggattcc 3360
tggaaacaat cagccctggc tactccatcc acacctacct gtggcgtaga cagtgacaat 3420
tgttaattaa gtttaaaccc tcgaggccgc aagcttatcg ataatcaacc tctggattac 3480
aaaatttgtg aaagattgac tggtattctt aactatgttg ctccttttac gctatgtgga 3540
tacgctgctt taatgccttt gtatcatgct attgcttccc gtatggcttt cattttctcc 3600
tccttgtata aatcctggtt gctgtctctt tatgaggagt tgtggcccgt tgtcaggcaa 3660
cgtggcgtgg tgtgcactgt gtttgctgac gcaaccccca ctggttgggg cattgccacc 3720
acctgtcagc tcctttccgg gactttcgct ttccccctcc ctattgccac ggcggaactc 3780
atcgccgcct gccttgcccg ctgctggaca ggggctcggc tgttgggcac tgacaattcc 3840
gtggtgttgt cgggggaaatc atcgtccttt ccttggctgc tcgcctgtgt tgccacctgg 3900
attctgcgcg ggacgtcctt ctgctacgtc ccttcggccc tcaatccagc ggaccttcct 3960
tcccgcggcc tgctgccggc tctgcggcct cttccgcgtc ttcgccttcg ccctcagacg 4020
agtcggatct ccctttgggc cgcctccccg catcgatacc gtcgactaga gctcgctgat 4080
cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 4140
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 4200
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 4260
gggaggattg ggaagacaat agcaggcatg ctggggagag atccacgata acaaacagct 4320
tttttggggt gaacatattg actgaattcc ctgcaggttg gccactccct ctctgcgcgc 4380
tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt cgggcgacct ttggtcgccc 4440
ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc aactccatca ctaggggttc 4500
ctgcggccgc tcgtacggtc tcgaggaatt cctgcaggat aacttgccaa cctcattcta 4560
aaatgtatat agaagcccaa aagacaataa caaaaatatt cttgtagaac aaaatgggaa 4620
agaatgttcc actaaatatc aagatttaga gcaaagcatg agatgtgtgg ggatagacag 4680
tgaggctgat aaaatagagt agagctcaga aacagaccca ttgatatatg taagtgacct 4740
atgaaaaaaa tatggcattt tacaatggga aaatgatggt ctttttcttt tttagaaaaa 4800
cagggaaata tatttatatg taaaaaataa aagggaaccc atatgtcata ccatacacac 4860
aaaaaaattc cagtgaatta taagtctaaa tggagaaggc aaaactttaa atcttttaga 4920
aaataatata gaagcatgca gaccagcctg gccaacatga tgaaaccctc tctactaata 4980
ataaaatcag tagaactact caggactact ttgagtggga agtccttttc tatgaagact 5040
tctttggcca aaattaggct ctaaatgcaa ggagatagtg catcatgcct ggctgcactt 5100
actgataaat gatgttatca ccatctttaa ccaaatgcac aggaacaagt tatggtactg 5160
atgtgctgga ttgagaagga gctctacttc cttgacagga cacatttgta tcaacttaaa 5220
aaagcagatt tttgccagca gaactattca ttcagaggta ggaaacttag aatagatgat 5280
gtcactgatt agcatggctt ccccatctcc acagctgctt cccacccagg ttgcccacag 5340
ttgagtttgt ccagtgctca gggctgccca ctctcagtaa gaagccccac accagcccct 5400
ctccaaatat gttggctgtt ccttccatta aagtgacccc actttagagc agcaagtgga 5460
tttctgtttc ttacagttca ggaaggagga gtcagctgtg agaacctgga gcctgagatg 5520
cttctaagtc ccactgctac tggggtcagg gaagccagac tccagcatca gcagtcagga 5580
gcactaagcc cttgccaaca tcctgtttct cagagaaact gcttccatta taatggttgt 5640
ccttttttaa gctatcaagc caaacaacca gtgtctacca ttattctcat cacctgaagc 5700
caagggttct agcaaaagtc aagctgtctt gtaatggttg atgtgcctcc agcttctgtc 5760
ttcagtcact ccactcttag cctgctctga atcaactctg accacagttc cctggagccc 5820
ctgccacctg ctgcccctgc caccttctcc atctgcagtg ctgtgcagcc ttctgcactc 5880
ttgcagagct aataggtgga gacttgaagg aagaggagga aagtttctca taatagcctt 5940
gctgcaagct caaatgggag gtgggcactg tgcccaggag ccttggagca aaggctgtgc 6000
ccaacctctg actgcatcca ggtttggtct tgacagagat aagaagccct ggcttttgga 6060
gccaaaatct aggtcagact taggcaggat tctcaaagtt tatcagcaga acatgaggca 6120
gaagaccctt tctgctccag cttcttcagg ctcaaccttc atcagaatag atagaaagag 6180
aggctgtgag ggttcttaaa acagaagcaa atctgactca gagaataaac aacctcctag 6240
taaactacag cttagacaga gcatctggtg gtgagtgtgc tcagtgtcct actcaactgt 6300
ctggtatcag ccctcatgag gacttctctt ctttccctca tagacctcca tctctgtttt 6360
ccttagcctg cagaaatctg gatggctatt cacagaatgc ctgtgctttc agagttgcat 6420
tttttctctg gtattctggt tcaagcattt gaaggtagga aaggttctcc aagtgcaaga 6480
aagccagccc tgagcctcaa ctgcctggct agtgtggtca gtaggatgca aaggctgttg 6540
aatgccacaa ggccaaactt taacctgtgt accacaagcc tagcagcaga ggcagctctg 6600
ctcactggaa ctctctgtct tctttctcct gagccttttc ttttcctgag ttttctagct 6660
ctcctcaacc ttacctctgc cctacccagg acaaacccaa gagccactgt ttctgtgatg 6720
tcctctccag ccctaattag gcatcatgac ttcagcctga ccttccatgc tcagaagcag 6780
tgctaatcca cttcagatga gctgctctat gcaacacagg cagagcctac aaacctttgc 6840
accagagccc tccacatatc agtgtttgtt catactcact tcaacagcaa atgtgactgc 6900
tgagattaag attttacaca agatggtctg taatttcaca gttagtttta tcccattagg 6960
tatgaaagaa ttagcataat tccccttaaa catgaatgaa tcttagattt tttaataaat 7020
agttttggaa gtaaagacag agacatcagg agcacaagga atagcctgag aggacaaaca 7080
gaacaagaaa gagtctggaa atacacagga tgttcttggc ctcctcaaag caagtgcaag 7140
cagatagtac cagcagcccc aggctatcag agcccagtga agagaagtac catgaaagcc 7200
acagctctaa ccaccctgtt ccagagtgac agacagtccc caagacaagc cagcctgagc 7260
cagagagaga actgcaagag aaagtttcta atttaggttc tgttagattc agacaagtgc 7320
aggtcatcct ctctccacag ctactcacct ctccagccta acaaagcctg cagtccacac 7380
tccaaccctg gtgtctcacc tcctagcctc tcccaacatc ctgctctctg accatcttct 7440
gcatctctca tctcaccatc tcccactgtc tacagcctac tcttgcaact accatctcat 7500
tttctgacat cctgtctaca tcttctgcca tactctgcca tctaccatac cacctcttac 7560
catctaccac accatctttt atctccatcc ctctcagaag cctccaagct gaatcctgct 7620
ttatgtgttc atctcagccc ctgcatggaa agctgacccc agaggcagaa ctattcccag 7680
agagcttggc caagaaaaac aaaactacca gcctggccag gctcaggagt agtaagctgc 7740
agtgtctgtt gtgttctagc ttcaacagct gcaggagttc cactctcaaa tgctccacat 7800
ttctcacatc ctcctgattc tggtcactac ccatcttcaa agaacagaat atctcacatc 7860
agcatactgt gaaggactag tcatgggtgc agctgctcag agctgcaaag tcattctgga 7920
tggtggagag cttacaaaca tttcatgatg ctccccccgc tctgatggct ggagcccaat 7980
ccctacacag actcctgctg tatgtgtttt cctttcactc tgagccacag ccagagggca 8040
ggcattcagt ctcctcttca ggctggggct ggggcactga gaactcaccc aacaccttgc 8100
tctcactcct tctgcaaaac aagaaagagc tttgtgctgc agtagccatg aagaatgaaa 8160
ggaaggcttt aactaaaaaa tgtcagagat tattttcaac cccttactgt ggatcaccag 8220
caaggaggaa acacaacaca gagacatttt ttcccctcaa attatcaaaa gaatcactgc 8280
atttgttaaa gagagcaact gaatcaggaa gcagagtttt gaacatatca gaagttagga 8340
atctgcatca gagacaaatg cagtcatggt tgtttgctgc ataccagccc taatcattag 8400
aagcctcatg gacttcaaac atcattccct ctgacaagat gctctagcct aactccatga 8460
gataaaataa atctgccttt cagagccaaa gaagagtcca ccagcttctt ctcagtgtga 8520
acaagagctc cagtcaggtt agtcagtcca gtgcagtaga ggagaccagt ctgcatcctc 8580
taattttcaa aggcaagaag atttgtttac cctggacacc aggcacaagt gaggtcacag 8640
agctcttaga tatgcagtcc tcatgagtga ggagactaaa gcgcatgcca tcaagacttc 8700
agtgtagaga aaacctccaa aaaagcctcc tcactacttc tggaatagct cagaggccga 8760
ggcggcctcg gcctctgcat aaataaaaaa aattagtcag ccatggggcg gagaatgggc 8820
ggaactgggc ggagttaggg gcgggatggg cggagttagg ggcgggacta tggttgctga 8880
ctaattgaga tgcatgcttt gcatacttct gcctgctggg gagcctgggg actttccaca 8940
cctggttgct gactaattga gatgcatgct ttgcatactt ctgcctgctg gggagcctgg 9000
ggactttcca caccctaact gacacacatt ccacagctgc attaatgaat cggccaacgc 9060
gcggggagag gcggtttgcg tattgggcgc tcttccgctt cctcgctcac tgactcgctg 9120
cgctcggtcg ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta 9180
tccacagaat caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc 9240
aggaaccgta aaaaggccgc gttgctggcg tttttccata ggctccgccc ccctgacgag 9300
catcacaaaa atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagatac 9360
caggcgtttc cccctggaag ctccctcgtg cgctctcctg ttccgaccct gccgcttacc 9420
ggatacctgt ccgcctttct cccttcggga agcgtggcgc tttctcatag ctcacgctgt 9480
aggtatctca gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc 9540
gttcagcccg accgctgcgc cttatccggt aactatcgtc ttgagtccaa cccggtaaga 9600
cacgacttat cgccactggc agcagccact ggtaacagga ttagcagagc gaggtatgta 9660
ggcggtgcta cagagttctt gaagtggtgg cctaactacg gctacactag aagaacagta 9720
tttggtatct gcgctctgct gaagccagtt accttcggaa aaagagttgg tagctcttga 9780
tccggcaaac aaaccaccgc tggtagcggt ggttttttttg tttgcaagca gcagattacg 9840
cgcagaaaaa aaggatctca agaagatcct ttgatctttt ctacggggtc tgacgctcag 9900
tggaacgaaa actcacgtta agggattttg gtcatgagat tatcaaaaag gatcttcacc 9960
tagatccttt taaattaaaa atgaagtttt aaatcaatct aaagtatata tgagtaaact 10020
tggtctgaca gttaccaatg cttaatcagt gaggcaccta tctcagcgat ctgtctattt 10080
cgttcatcca tagttgcctg actcctgcaa accacgttgt gtctcaaaat ctctgatgtt 10140
acatgcaca agataaaaat atatcatcat gaacaataaa actgtctgct tacataaaca 10200
gtaatacaag gggtgttatg agccatattc aacgggaaac gtcttgctcg aggccgcgat 10260
taaattccaa catggatgct gatttatatg ggtataaatg ggctcgcgat aatgtcgggc 10320
aatcaggtgc gacaatctat cgattgtatg ggaagcccga tgcgccagag ttgtttctga 10380
aacatggcaa aggtagcgtt gccaatgatg ttacagatga gatggtcaga ctaaactggc 10440
tgacggaatt tatgcctctt ccgaccatca agcattttat ccgtactcct gatgatgcat 10500
ggttactcac cactgcgatc cccgggaaaa cagcattcca ggtattagaa gaatatcctg 10560
attcaggtga aaatattgtt gatgcgctgg cagtgttcct gcgccggttg cattcgattc 10620
ctgtttgtaa ttgtcctttt aacagcgatc gcgtatttcg tctcgctcag gcgcaatcac 10680
gaatgaataa cggtttggtt gatgcgagtg attttgatga cgagcgtaat ggctggcctg 10740
ttgaacaagt ctggaaagaa atgcataagc ttttgccatt ctcaccggat tcagtcgtca 10800
ctcatggtga tttctcactt gataacctta tttttgacga ggggaaatta ataggttgta 10860
ttgatgttgg acgagtcgga atcgcagacc gataccagga tcttgccatc ctatggaact 10920
gcctcggtga gttttctcct tcattacaga aacggctttt tcaaaaatat ggtattgata 10980
atcctgatat gaataaattg cagtttcatt tgatgctcga tgagtttttc taagggcggc 11040
ctgccaccat acccacgccg aaacaagcgc tcatgagccc gaagtggcga gcccgatctt 11100
ccccatcggt gatgtcggcg atataggcgc cagcaaccgc acctgtggcg ccggtgatga 11160
gggcgcgcca agtcgacgtc cggcagtc 11188
<210> 12
<211> 11187
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 12
ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac ctagttataa 60
tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg cgttacataa 120
cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt gacgtcaata 180
atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca atgggtggag 240
tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc aagtacgccc 300
cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta catgacctta 360
tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac catggtcgag 420
gtgagcccca cgttctgctt cactctcccc atctcccccc cctccccacc cccaattttg 480
tatttattta ttttttaatt attttgtgca gcgatggggg cgggggggg gggggggcgc 540
gcgccaggcg gggcggggcg gggcgagggg cggggcgggg cgaggcggag aggtgcggcg 600
gcagccaatc agagcggcgc gctccgaaag tttcctttta tggcgaggcg gcggcggcgg 660
cggccctata aaaagcgaag cgcgcggcgg gcgggagtcg ctgcgacgct gccttcgccc 720
cgtgccccgc tccgccgccg cctcgcgccg cccgccccgg ctctgactga ccgcgttact 780
cccacaggtg agcgggcggg acggcccttc tcctccgggc tgtaattagc gcttggttta 840
atgacggctt gtttcttttc tgtggctgcg tgaaagcctt gaggggctcc gggagctaga 900
gcctctgcta accatgttca tgccttcttc tttttcctac agctcctggg caacgtgctg 960
gttattgtgc tgtctcatca ttttggcaaa gaattcctcg aagatccgaa gggaaagtct 1020
tccacgactg tgggatccgt tcgaagatat caccggttga gccaccatgg aattcagcag 1080
ccccagcaga gaggaatgcc ccaagcctct gagccgggtg tcaatcatgg ccggatctct 1140
gacaggactg ctgctgcttc aggccgtgtc ttgggcttct ggcgctagac cttgcatccc 1200
caagagcttc ggctacagca gcgtcgtgtg cgtgtgcaat gccacctact gcgacagctt 1260
cgaccctcct acctttcctg ctctgggcac cttcagcaga tacgagagca ccagatccgg 1320
cagacggatg gaactgagca tgggacccat ccaggccaat cacacaggca ctggcctgct 1380
gctgacactg cagcctgagc agaaattcca gaaagtgaaa ggcttcggcg gagccatgac 1440
agatgccgcc gctctgaata tcctggctct gtctccacca gctcagaacc tgctgctcaa 1500
gagctacttc agcgaggaag gcatcggcta caacatcatc agagtgccca tggccagctg 1560
cgacttcagc atcaggacct acacctacgc cgacacaccc gacgatttcc agctgcacaa 1620
cttcagcctg cctgaagagg acaccaagct gaagatccct ctgatccaca gagccctgca 1680
gctggcacaa agacccgtgt cactgctggc ctctccatgg acatctccca cctggctgaa 1740
aacaaatggc gccgtgaatg gcaagggcag cctgaaaggc caacctggcg acatctacca 1800
ccagacctgg gccagatact tcgtgaagtt cctggacgcc tatgccgagc acaagctgca 1860
gttttgggcc gtgacagccg agaacgaacc ttctgctgga ctgctgagcg gctacccctt 1920
tcagtgcctg ggctttacac ccgagcacca gcgggacttt atcgcccgtg atctgggacc 1980
cacactggcc aatagcaccc accataatgt gcggctgctg atgctggacg accagagact 2040
gcttctgccc cactgggcta aagtggtgct gacagatcct gaggccgcca aatacgtgca 2100
cggaatcgcc gtgcactggt atctggactt tctggcccct gccaaggcca cactgggaga 2160
gacacacaga ctgttcccca acaccatgct gttcgccagc gaagcctgtg tgggcagcaa 2220
gttttgggaa cagagcgtgc ggctcggcag ctgggataga ggcatgcagt acagccacag 2280
catcatcacc aacctgctgt accacgtcgt cggctggacc gactggaatc tggccctgaa 2340
tcctgaaggc ggccctaact gggtccgaaa cttcgtggac agccccatca tcgtggacat 2400
caccaaggac accttctaca agcagcccat gttctaccac ctgggacact tcagcaagtt 2460
catccccgag ggctctcagc gcgttggact ggtggcttcc cagaagaacg atctggacgc 2520
cgtggctctg atgcaccctg atggatctgc tgtggtggtg gtcctgaacc gcagcagcaa 2580
agatgtgccc ctgaccatca aggatcccgc cgtgggattc ctggaaacaa tcagccctgg 2640
ctactccatc cacacctacc tgtggcgtag acagtgacaa ttgttaatta agtttaaacc 2700
ctcgaggccg caagcttatc gataatcaac ctctggatta caaaatttgt gaaagatga 2760
ctggtattct taactatgtt gctcctttta cgctatgtgg atacgctgct ttaatgcctt 2820
tgtatcatgc tattgcttcc cgtatggctt tcattttctc ctccttgtat aaatcctggt 2880
tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca acgtggcgtg gtgtgcactg 2940
tgtttgctga cgcaaccccc actggttggg gcattgccac cacctgtcag ctcctttccg 3000
ggactttcgc tttccccctc cctattgcca cggcggaact catcgccgcc tgccttgccc 3060
gctgctggac aggggctcgg ctgttgggca ctgacaattc cgtggtgttg tcggggaaat 3120
catcgtcctt tccttggctg ctcgcctgtg ttgccacctg gattctgcgc gggacgtcct 3180
tctgctacgt cccttcggcc ctcaatccag cggaccttcc ttcccgcggc ctgctgccgg 3240
ctctgcggcc tcttccgcgt cttcgccttc gccctcagac gagtcggatc tccctttggg 3300
ccgcctcccc gcatcgatac cgtcgactag agctcgctga tcagcctcga ctgtgccttc 3360
tagttgccag ccatctgttg tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc 3420
cactcccact gtcctttcct aataaaatga ggaaattgca tcgcattgtc tgagtaggtg 3480
tcattctatt ctggggggtg gggtggggca ggacagcaag ggggaggatt gggaagacaa 3540
tagcaggcat gctggggaga gatccacgat aacaaacagc ttttttgggg tgaacatatt 3600
gactgaattc cctgcaggtt ggccactccc tctctgcgcg ctcgctcgct cactgaggcc 3660
gcccgggcaa agcccgggcg tcgggcgacc tttggtcgcc cggcctcagt gagcgagcga 3720
gcgcgcagag agggagtggc caactccatc actaggggtt cctgcggccg ctcgtacggt 3780
ctcgaggaat tcctgcagga taacttgcca acctcattct aaaatgtata tagaagccca 3840
aaagacaata acaaaaatat tcttgtagaa caaaatggga aagaatgttc cactaaatat 3900
caagatttag agcaaagcat gagatgtgtg gggatagaca gtgaggctga taaaatagag 3960
tagagctcag aaacagaccc attgatatat gtaagtgacc tatgaaaaaa atatggcatt 4020
ttacaatggg aaaatgatgg tctttttctt ttttagaaaa acagggaaat atatttatat 4080
gtaaaaaata aaagggaacc catatgtcat accatacaca caaaaaaatt ccagtgaatt 4140
ataagtctaa atggagaagg caaaacttta aatcttttag aaaataatat agaagcatgc 4200
agaccagcct ggccaacatg atgaaaccct ctctactaat aataaaatca gtagaactac 4260
tcaggactac tttgagtggg aagtcctttt ctatgaagac ttctttggcc aaaattaggc 4320
tctaaatgca aggagatagt gcatcatgcc tggctgcact tactgataaa tgatgttatc 4380
accatcttta accaaatgca caggaacaag ttatggtact gatgtgctgg attgagaagg 4440
agctctactt ccttgacagg acacatttgt atcaacttaa aaaagcagat ttttgccagc 4500
agaactattc attcagaggt aggaaactta gaatagatga tgtcactgat tagcatggct 4560
tccccatctc cacagctgct tccccacccag gttgcccaca gttgagtttg tccagtgctc 4620
agggctgccc actctcagta agaagcccca caccagcccc tctccaaata tgttggctgt 4680
tccttccatt aaagtgaccc cactttagag cagcaagtgg atttctgttt cttacagttc 4740
aggaaggagg agtcagctgt gagaacctgg agcctgagat gcttctaagt cccactgcta 4800
ctggggtcag ggaagccaga ctccagcatc agcagtcagg agcactaagc ccttgccaac 4860
atcctgtttc tcagagaaac tgcttccatt ataatggttg tcctttttta agctatcaag 4920
ccaaacaacc agtgtctacc attattctca tcacctgaag ccaagggttc tagcaaaagt 4980
caagctgtct tgtaatggtt gatgtgcctc cagcttctgt cttcagtcac tccactctta 5040
gcctgctctg aatcaactct gaccacagtt ccctggagcc cctgccacct gctgcccctg 5100
ccaccttctc catctgcagt gctgtgcagc cttctgcact cttgcagagc taataggtgg 5160
agacttgaag gaagaggagg aaagtttctc ataatagcct tgctgcaagc tcaaatggga 5220
ggtgggcact gtgcccagga gccttggagc aaaggctgtg cccaacctct gactgcatcc 5280
aggtttggtc ttgacagaga taagaagccc tggcttttgg agccaaaatc taggtcagac 5340
ttaggcagga ttctcaaagt ttatcagcag aacatgaggc agaagaccct ttctgctcca 5400
gcttcttcag gctcaacctt catcagaata gatagaaaga gaggctgtga gggttcttaa 5460
aacagaagca aatctgactc agagaataaa caacctccta gtaaactaca gcttagacag 5520
agcatctggt ggtgagtgtg ctcagtgtcc tactcaactg tctggtatca gccctcatga 5580
ggacttctct tctttccctc atagacctcc atctctgttt tccttagcct gcagaaatct 5640
ggatggctat tcacagaatg cctgtgcttt cagagttgca ttttttctct ggtattctgg 5700
ttcaagcatt tgaaggtagg aaaggttctc caagtgcaag aaagccagcc ctgagcctca 5760
actgcctggc tagtgtggtc agtaggatgc aaaggctgtt gaatgccaca aggccaaact 5820
ttaacctgtg taccacaagc ctagcagcag aggcagctct gctcactgga actctctgtc 5880
ttctttctcc tgagcctttt cttttcctga gttttctagc tctcctcaac cttacctctg 5940
ccctacccag gacaaaccca agagccactg tttctgtgat gtcctctcca gccctaatta 6000
ggcatcatga cttcagcctg accttccatg ctcagaagca gtgctaatcc acttcagatg 6060
agctgctcta tgcaacacag gcagagccta caaacctttg caccagagcc ctccacatat 6120
cagtgtttgt tcatactcac ttcaacagca aatgtgactg ctgagattaa gattttacac 6180
aagatggtct gtaatttcac agttagtttt atcccattag gtatgaaaga attagcataa 6240
ttccccttaa acatgaatga atcttagatt ttttaataaa tagttttgga agtaaagaca 6300
gagacatcag gagcacaagg aatagcctga gaggacaaac agaacaagaa agagtctgga 6360
aatacacagg atgttcttgg cctcctcaaa gcaagtgcaa gcagatagta ccagcagccc 6420
caggctatca gagcccagtg aagagaagta ccatgaaagc cacagctcta accaccctgt 6480
tccagagtga cagacagtcc ccaagacaag ccagcctgag ccagagagag aactgcaaga 6540
gaaagtttct aatttaggtt ctgttagatt cagacaagtg caggtcatcc tctctccaca 6600
gctactcacc tctccagcct aacaaagcct gcagtccaca ctccaaccct ggtgtctcac 6660
ctcctagcct ctcccaacat cctgctctct gaccatcttc tgcatctctc atctcaccat 6720
ctcccactgt ctacagccta ctcttgcaac taccatctca ttttctgaca tcctgtctac 6780
atcttctgcc atactctgcc atctaccata ccacctctta ccatctacca caccatcttt 6840
tatctccatc cctctcagaa gcctccaagc tgaatcctgc tttatgtgtt catctcagcc 6900
cctgcatgga aagctgaccc cagaggcaga actattccca gagagcttgg ccaagaaaaa 6960
caaaactacc agcctggcca ggctcaggag tagtaagctg cagtgtctgt tgtgttctag 7020
cttcaacagc tgcaggagtt ccactctcaa atgctccaca tttctcacat cctcctgatt 7080
ctggtcacta cccatcttca aagaacagaa tatctcacat cagcatactg tgaaggacta 7140
gtcatgggtg cagctgctca gagctgcaaa gtcattctgg atggtggaga gcttacaaac 7200
atttcatgat gctccccccg ctctgatggc tggagcccaa tccctacaca gactcctgct 7260
gtatgtgttt tcctttcact ctgagccaca gccagagggc aggcattcag tctcctcttc 7320
aggctggggc tggggcactg agaactcacc caacaccttg ctctcactcc ttctgcaaaa 7380
caagaaagag ctttgtgctg cagtagccat gaagaatgaa aggaaggctt taactaaaaa 7440
atgtcagaga ttattttcaa ccccttactg tggatcacca gcaaggagga aacacaacac 7500
agagacattt tttcccctca aattatcaaa agaatcactg catttgttaa agagagcaac 7560
tgaatcagga agcagagttt tgaacatatc agaagttagg aatctgcatc agagacaaat 7620
gcagtcatgg ttgtttgctg cataccagcc ctaatcatta gaagcctcat ggacttcaaa 7680
catcattccc tctgacaaga tgctctagcc taactccatg agataaaata aatctgcctt 7740
tcagagccaa agaagagtcc accagcttct tctcagtgtg aacaagagct ccagtcaggt 7800
tagtcagtcc agtgcagtag aggagaccag tctgcatcct ctaattttca aaggcaagaa 7860
gatttgttta ccctggacac caggcacaag tgaggtcaca gagctcttag atatgcagtc 7920
ctcatgagtg aggagactaa agcgcatgcc atcaagactt cagtgtagag aaaacctcca 7980
aaaaagcctc ctcactactt ctggaatagc tcagaggccg aggcggcctc ggcctctgca 8040
taaataaaaa aaattagtca gccatggggc ggagaatggg cggaactggg cggagttagg 8100
ggcgggatgg gcggagttag gggcgggact atggttgctg actaattgag atgcatgctt 8160
tgcatacttc tgcctgctgg ggagcctggg gactttccac acctggttgc tgactaattg 8220
agatgcatgc tttgcatact tctgcctgct ggggagcctg gggactttcc acaccctaac 8280
tgacacacat tccacagctg cattaatgaa tcggccaacg cgcggggaga ggcggtttgc 8340
gtattgggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc 8400
ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata 8460
acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 8520
cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct 8580
caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa 8640
gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc 8700
tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt 8760
aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 8820
ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg 8880
cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct 8940
tgaagtggtg gcctaactac ggctacacta gaagaacagt atttggtatc tgcgctctgc 9000
tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg 9060
ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc 9120
aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt 9180
aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa 9240
aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat 9300
gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct 9360
gactcctgca aaccacgttg tgtctcaaaa tctctgatgt tacattgcac aagataaaaa 9420
tatatcatca tgaacaataa aactgtctgc ttacataaac agtaatacaa ggggtgttat 9480
gagccatatt caacgggaaa cgtcttgctc gaggccgcga ttaaattcca acatggatgc 9540
tgatttatat gggtataaat gggctcgcga taatgtcggg caatcaggtg cgacaatcta 9600
tcgattgtat gggaagcccg atgcgccaga gttgtttctg aaacatggca aaggtagcgt 9660
tgccaatgat gttacagatg agatggtcag actaaactgg ctgacggaat ttatgcctct 9720
tccgaccatc aagcatttta tccgtactcc tgatgatgca tggttactca ccactgcgat 9780
ccccgggaaa acagcattcc aggtattaga agaatatcct gattcaggtg aaaatattgt 9840
tgatgcgctg gcagtgttcc tgcgccggtt gcattcgatt cctgtttgta attgtccttt 9900
taacagcgat cgcgtatttc gtctcgctca ggcgcaatca cgaatgaata acggtttggt 9960
tgatgcgagt gattttgatg acgagcgtaa tggctggcct gttgaacaag tctggaaaga 10020
aatgcataag cttttgccat tctcaccgga ttcagtcgtc actcatggtg atttctcact 10080
tgataacctt atttttgacg aggggaaatt aataggttgt attgatgttg gacgagtcgg 10140
aatcgcagac cgataccagg atcttgccat cctatggaac tgcctcggtg agttttctcc 10200
ttcattacag aaacggcttt ttcaaaaata tggtattgat aatcctgata tgaataaatt 10260
gcagtttcat ttgatgctcg atgagttttt ctaagggcgg cctgccacca tacccacgcc 10320
gaaacaagcg ctcatgagcc cgaagtggcg agcccgatct tccccatcgg tgatgtcggc 10380
gatataggcg ccagcaaccg cacctgtggc gccggtgatg agggcgcgcc aagtcgacgt 10440
ccggcagtct tggccactcc ctctctgcgc gctcgctcgc tcactgaggc cgggcgacca 10500
aaggtcgccc gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg agcgcgcaga 10560
gagggagtgg ccaactccat cactaggggt tcctgctagc tctgggtatt taagcccgag 10620
tgagcacgca gggtctccat tttgaagcgg gaggttacgc gttcgtcgac tactagtggg 10680
taccagagcg tggtgactga gatgttttct aggaaacaca aaagatacaa aaaagaacac 10740
gtggaaggat agccaaaaag gggggctgcc cccatttcct gcaccccgct gcgatggctg 10800
gcaccatttg gaagacttcg agatacactg ttgagcgcag taagacaaca gtgtatctcg 10860
aagtcttcca gatggggcca gccggtccac tctgtatcca ggccagttct gcaaggcgtt 10920
cgaggaccac ccccctcccc tcgccaccag ggtggtctca tacagaactt ataagattcc 10980
caaatccaaa gacatttcac gtttatggtg atttcccaga acacatagcg acatgcaaat 11040
attgcagggc gccactcccc tgtccctcac agccatcttc ctgccagggc gcacgcgcgc 11100
tgggtgttcc cgcctagtga cactgggccc gcgattcctt ggagcgggtt gatgacgtca 11160
gcgtttccca tggtgaatcc ctaggtt 11187
<210> 13
<211> 10960
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 13
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg aattcggtac 300
cctagttatt aatagtaatc aattacgggg tcattagttc atagcccata tatggagttc 360
cgcgttacat aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca 420
ttgacgtcaa taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt 480
caatgggtgg actatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg 540
ccaagtacgc cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag 600
tacatgacct tatgggactt tcctacttgg cagtacatct acgtattagt catcgctatt 660
accatggtcg aggtgagccc cacgttctgc ttcactctcc ccatctcccc cccctcccca 720
cccccaattt tgtatttatt tattttttaa ttattttgtg cagcgatggg ggcgggggg 780
gggggggggc gcgcgccagg cggggcgggg cggggcgagg ggcggggcgg ggcgaggcgg 840
agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa agtttccttt tatggcgagg 900
cggcggcggc ggcggcccta taaaaagcga agcgcgcggc gggcgggagt cgctgcgacg 960
ctgccttcgc cccgtgcccc gctccgccgc cgcctcgcgc cgcccgcccc ggctctgact 1020
gaccgcgtta ctcccacagg tgagcgggcg ggacggccct tctcctccgg gctgtaatta 1080
gcgcttggtt taatgacggc ttgtcctggt ggcgagggga ggggggtggt cctcgaacgc 1140
cttgcagaac tggcctggat acagagtgga ccggctggcc ccatctggaa gacttcgaga 1200
tacactgttg tcttactgcg ctcaacagtg tatctcgaag tcttccaaat ggtgccagcc 1260
atcgcagcgg ggtgcaggaa atggggggcag cccccctttt tggctatcct tccacgtgtt 1320
cttttttgta tcttttgtgt ttcctagaaa acatctcagt caccaccttt ctgtggctgc 1380
gtgaaagcct tgaggggctc cgggagctag agcctctgct aaccatgttc atgccttctt 1440
ctttttccta cagctcctgg gcaacgtgct ggttattgtg ctgtctcatc attttggcaa 1500
agaattcctc gaagatccga agggaaagtc ttccacgact gtgggatccg ttcgaagata 1560
tcaccggttg agccaccatg gaattcagca gccccagcag agaggaatgc cccaagcctc 1620
tgagccgggt gtcaatcatg gccggatctc tgacaggact gctgctgctt caggccgtgt 1680
cttgggcttc tggcgctaga ccttgcatcc ccaagagctt cggctacagc agcgtcgtgt 1740
gcgtgtgcaa tgccacctac tgcgacagct tcgaccctcc tacctttcct gctctgggca 1800
ccttcagcag atacgagagc accagatccg gcagacggat ggaactgagc atgggaccca 1860
tccaggccaa tcacacaggc actggcctgc tgctgacact gcagcctgag cagaaattcc 1920
agaaagtgaa aggcttcggc ggagccatga cagatgccgc cgctctgaat atcctggctc 1980
tgtctccacc agctcagaac ctgctgctca agagctactt cagcgaggaa ggcatcggct 2040
acaacatcat cagagtgccc atggccagct gcgacttcag catcaggacc tacacctacg 2100
ccgacacacc cgacgatttc cagctgcaca acttcagcct gcctgaagag gacaccaagc 2160
tgaagatccc tctgatccac agagccctgc agctggcaca aagacccgtg tcactgctgg 2220
cctctccatg gacatctccc acctggctga aaacaaatgg cgccgtgaat ggcaagggca 2280
gcctgaaagg ccaacctggc gacatctacc accagacctg ggccagatac ttcgtgaagt 2340
tcctggacgc ctatgccgag cacaagctgc agttttgggc cgtgacagcc gagaacgaac 2400
cttctgctgg actgctgagc ggctacccct ttcagtgcct gggctttaca cccgagcacc 2460
agcgggactt tatcgcccgt gatctgggac ccacactggc caatagcacc caccataatg 2520
tgcggctgct gatgctggac gaccagagac tgcttctgcc ccactgggct aaagtggtgc 2580
tgacagatcc tgaggccgcc aaatacgtgc acggaatcgc cgtgcactgg tatctggact 2640
ttctggcccc tgccaaggcc acactgggag agacacacag actgttcccc aacaccatgc 2700
tgttcgccag cgaagcctgt gtgggcagca agttttggga acagagcgtg cggctcggca 2760
gctgggatag aggcatgcag tacagccaca gcatcatcac caacctgctg taccacgtcg 2820
tcggctggac cgactggaat ctggccctga atcctgaagg cggccctaac tgggtccgaa 2880
acttcgtgga cagccccatc atcgtggaca tcaccaagga caccttctac aagcagccca 2940
tgttctacca cctgggacac ttcagcaagt tcatccccga gggctctcag cgcgttggac 3000
tggtggcttc ccagaagaac gatctggacg ccgtggctct gatgcaccct gatggatctg 3060
ctgtggtggt ggtcctgaac cgcagcagca aagatgtgcc cctgaccatc aaggatcccg 3120
ccgtgggatt cctggaaaca atcagccctg gctactccat ccacacctac ctgtggcgta 3180
gacagtgaca attgttaatt aagtttaaac cctcgaggcc gcaagcttat cgataatcaa 3240
cctctggatt acaaaatttg tgaaagattg actggtattc ttaactatgt tgctcctttt 3300
acgctatgtg gatacgctgc tttaatgcct ttgtatcatg ctattgcttc ccgtatggct 3360
ttcattttct cctccttgta taaatcctgg ttgctgtctc tttatgagga gttgtggccc 3420
gttgtcaggc aacgtggcgt ggtgtgcact gtgtttgctg acgcaacccc cactggttgg 3480
ggcattgcca ccacctgtca gctcctttcc gggactttcg ctttccccct ccctattgcc 3540
acggcggaac tcatcgccgc ctgccttgcc cgctgctgga caggggctcg gctgttgggc 3600
actgacaatt ccgtggtgtt gtcggggaaa tcatcgtcct ttccttggct gctcgcctgt 3660
gttgccacct ggattctgcg cgggacgtcc ttctgctacg tcccttcggc cctcaatcca 3720
gcggaccttc cttcccgcgg cctgctgccg gctctgcggc ctcttccgcg tcttcgcctt 3780
cgccctcaga cgagtcggat ctccctttgg gccgcctccc cgcatcgata ccgtcgacta 3840
gagctcgctg atcagcctcg actgtgcctt ctagttgcca gccatctgtt gtttgcccct 3900
cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc taataaaatg 3960
aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt ggggtggggc 4020
aggacagcaa gggggaggat tgggaagaca atagcaggca tgctggggag agatccacga 4080
taacaaacag cttttttggg gtgaacatat tgactgaatt ccctgcaggt tggccactcc 4140
ctctctgcgc gctcgctcgc tcactgaggc cgcccgggca aagcccgggc gtcgggcgac 4200
ctttggtcgc ccggcctcag tgagcgagcg agcgcgcaga gagggagtgg ccaactccat 4260
cactaggggt tcctgcggcc gctcgtacgg tctcgaggaa ttcctgcagg ataacttgcc 4320
aacctcattc taaaatgtat atagaagccc aaaagacaat aacaaaaata ttcttgtaga 4380
acaaaatggg aaagaatgtt ccactaaata tcaagattta gagcaaagca tgagatgtgt 4440
ggggatagac agtgaggctg ataaaataga gtagagctca gaaacagacc cattgatata 4500
tgtaagtgac ctatgaaaaa aatatggcat tttacaatgg gaaaatgatg gtctttttct 4560
tttttagaaa aacagggaaa tatatttata tgtaaaaaat aaaagggaac ccatatgtca 4620
taccatacac acaaaaaaat tccagtgaat tataagtcta aatggagaag gcaaaacttt 4680
aaatctttta gaaaataata tagaagcatg cagaccagcc tggccaacat gatgaaaccc 4740
tctctactaa taataaaatc agtagaacta ctcaggacta ctttgagtgg gaagtccttt 4800
tctatgaaga cttctttggc caaaattagg ctctaaatgc aaggagatag tgcatcatgc 4860
ctggctgcac ttactgataa atgatgttat caccatcttt aaccaaatgc acaggaacaa 4920
gttatggtac tgatgtgctg gattgagaag gagctctact tccttgacag gacacatttg 4980
tatcaactta aaaaagcaga tttttgccag cagaactatt cattcagagg taggaaactt 5040
agaatagatg atgtcactga ttagcatggc ttccccatct ccacagctgc ttccccaccca 5100
ggttgcccac agttgagttt gtccagtgct cagggctgcc cactctcagt aagaagcccc 5160
acaccagccc ctctccaaat atgttggctg ttccttccat taaagtgacc ccactttaga 5220
gcagcaagtg gatttctgtt tcttacagtt caggaaggag gagtcagctg tgagaacctg 5280
gagcctgaga tgcttctaag tcccactgct actggggtca gggaagccag actccagcat 5340
cagcagtcag gagcactaag cccttgccaa catcctgttt ctcagagaaa ctgcttccat 5400
tataatggtt gtcctttttt aagctatcaa gccaaacaac cagtgtctac cattattctc 5460
atcacctgaa gccaagggtt ctagcaaaag tcaagctgtc ttgtaatggt tgatgtgcct 5520
ccagcttctg tcttcagtca ctccactctt agcctgctct gaatcaactc tgaccacagt 5580
tccctggagc ccctgccacc tgctgcccct gccaccttct ccatctgcag tgctgtgcag 5640
ccttctgcac tcttgcagag ctaataggtg gagacttgaa ggaagaggag gaaagtttct 5700
cataatagcc ttgctgcaag ctcaaatggg aggtgggcac tgtgcccagg agccttggag 5760
caaaggctgt gcccaacctc tgactgcatc caggtttggt cttgacagag ataagaagcc 5820
ctggcttttg gagccaaaat ctaggtcaga cttaggcagg attctcaaag tttatcagca 5880
gaacatgagg cagaagaccc tttctgctcc agcttcttca ggctcaacct tcatcagaat 5940
agatagaaag agaggctgtg agggttctta aaacagaagc aaatctgact cagagaataa 6000
acaacctcct agtaaactac agcttagaca gagcatctgg tggtgagtgt gctcagtgtc 6060
ctactcaact gtctggtatc agccctcatg aggacttctc ttctttccct catagacctc 6120
catctctgtt ttccttagcc tgcagaaatc tggatggcta ttcacagaat gcctgtgctt 6180
tcagagttgc attttttctc tggtattctg gttcaagcat ttgaaggtag gaaaggttct 6240
ccaagtgcaa gaaagccagc cctgagcctc aactgcctgg ctagtgtggt cagtaggatg 6300
caaaggctgt tgaatgccac aaggccaaac tttaacctgt gtaccacaag cctagcagca 6360
gaggcagctc tgctcactgg aactctctgt cttctttctc ctgagccttt tcttttcctg 6420
agttttctag ctctcctcaa ccttacctct gccctaccca ggacaaaccc aagagccact 6480
gtttctgtga tgtcctctcc agccctaatt aggcatcatg acttcagcct gaccttccat 6540
gctcagaagc agtgctaatc cacttcagat gagctgctct atgcaacaca ggcagagcct 6600
acaaaccttt gcaccagagc cctccacata tcagtgtttg ttcatactca cttcaacagc 6660
aaatgtgact gctgagatta agattttaca caagatggtc tgtaatttca cagttagttt 6720
tatcccatta ggtatgaaag aattagcata attcccctta aacatgaatg aatcttagat 6780
tttttaataa atagttttgg aagtaaagac agagacatca ggagcacaag gaatagcctg 6840
agaggacaaa cagaacaaga aagagtctgg aaatacacag gatgttcttg gcctcctcaa 6900
agcaagtgca agcagatagt accagcagcc ccaggctatc agagcccagt gaagagaagt 6960
accatgaaag ccacagctct aaccaccctg ttccagagtg acagacagtc cccaagacaa 7020
gccagcctga gccagagaga gaactgcaag agaaagtttc taatttaggt tctgttagat 7080
tcagacaagt gcaggtcatc ctctctccac agctactcac ctctccagcc taacaaagcc 7140
tgcagtccac actccaaccc tggtgtctca cctcctagcc tctcccaaca tcctgctctc 7200
tgaccatctt ctgcatctct catctcacca tctcccactg tctacagcct actcttgcaa 7260
ctaccatctc attttctgac atcctgtcta catcttctgc catactctgc catctaccat 7320
accacctctt accatctacc acaccatctt ttatctccat ccctctcaga agcctccaag 7380
ctgaatcctg ctttatgtgt tcatctcagc ccctgcatgg aaagctgacc ccagaggcag 7440
aactattccc agagagcttg gccaagaaaa acaaaactac cagcctggcc aggctcagga 7500
gtagtaagct gcagtgtctg ttgtgttcta gcttcaacag ctgcaggagt tccactctca 7560
aatgctccac atttctcaca tcctcctgat tctggtcact acccatcttc aaagaacaga 7620
atatctcaca tcagcatact gtgaaggact agtcatgggt gcagctgctc agagctgcaa 7680
agtcattctg gatggtggag agcttacaaa catttcatga tgctcccccc gctctgatgg 7740
ctggagccca atccctacac agactcctgc tgtatgtgtt ttcctttcac tctgagccac 7800
agccagaggg caggcattca gtctcctctt caggctgggg ctggggcact gagaactcac 7860
ccaacacctt gctctcactc cttctgcaaa acaagaaaga gctttgtgct gcagtagcca 7920
tgaagaatga aaggaaggct ttaactaaaa aatgtcagag attattttca accccttact 7980
gtggatcacc agcaaggagg aaacacaaca cagagacatt ttttcccctc aaattatcaa 8040
aagaatcact gcatttgtta aagagagcaa ctgaatcagg aagcagagtt ttgaacatat 8100
cagaagttag gaatctgcat cagagacaaa tgcagtcatg gttgtttgct gcataccagc 8160
cctaatcatt agaagcctca tggacttcaa acatcattcc ctctgacaag atgctctagc 8220
ctaactccat gagataaaat aaatctgcct ttcagagcca aagaagagtc caccagcttc 8280
ttctcagtgt gaacaagagc tccagtcagg ttagtcagtc cagtgcagta gaggagacca 8340
gtctgcatcc tctaattttc aaaggcaaga agatttgttt accctggaca ccaggcacaa 8400
gtgaggtcac agagctctta gatatgcagt cctcatgagt gaggagacta aagcgcatgc 8460
catcaagact tcagtgtaga gaaaacctcc aaaaaagcct cctcactact tctggaatag 8520
ctcagaggcc gaggcggcct cggcctctgc ataaataaaa aaaattagtc agccatgggg 8580
cggagaatgg gcggaactgg gcggagttag gggcgggatg ggcggagtta ggggcgggac 8640
tatggttgct gactaattga gatgcatgct ttgcatactt ctgcctgctg gggagcctgg 8700
ggactttcca cacctggttg ctgactaatt gagatgcatg ctttgcatac ttctgcctgc 8760
tggggagcct ggggactttc cacaccctaa ctgacacaca ttccacagct gcattaatga 8820
atcggccaac gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc 8880
actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg 8940
gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc 9000
cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc 9060
ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga 9120
ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc 9180
ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat 9240
agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg 9300
cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc 9360
aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga 9420
gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact 9480
agaagaacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt 9540
ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag 9600
cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg 9660
tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa 9720
aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata 9780
tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg 9840
atctgtctat ttcgttcatc catagttgcc tgactcctgc aaaccacgtt gtgtctcaaa 9900
atctctgatg ttacattgca caagataaaa atatatcatc atgaacaata aaactgtctg 9960
cttacataaa cagtaataca aggggtgtta tgagccatat tcaacgggaa acgtcttgct 10020
cgaggccgcg attaaattcc aacatggatg ctgattata tgggtataaa tgggctcgcg 10080
ataatgtcgg gcaatcaggt gcgacaatct atcgattgta tgggaagccc gatgcgccag 10140
agttgtttct gaaacatggc aaaggtagcg ttgccaatga tgttacagat gagatggtca 10200
gactaaactg gctgacggaa tttatgcctc ttccgaccat caagcatttt atccgtactc 10260
ctgatgatgc atggttactc accactgcga tccccgggaa aacagcattc caggtattag 10320
aagaatatcc tgattcaggt gaaaatattg ttgatgcgct ggcagtgttc ctgcgccggt 10380
tgcattcgat tcctgtttgt aattgtcctt ttaacagcga tcgcgtattt cgtctcgctc 10440
aggcgcaatc acgaatgaat aacggtttgg ttgatgcgag tgattttgat gacgagcgta 10500
atggctggcc tgttgaacaa gtctggaaag aaatgcataa gcttttgcca ttctcaccgg 10560
attcagtcgt cactcatggt gatttctcac ttgataacct tatttttgac gaggggaaat 10620
taataggttg tattgatgtt ggacgagtcg gaatcgcaga ccgataccag gatcttgcca 10680
tcctatggaa ctgcctcggt gagttttctc cttcattaca gaaacggctt tttcaaaaat 10740
atggtattga taatcctgat atgaataaat tgcagtttca tttgatgctc gatgagtttt 10800
tctaagggcg gcctgccacc atacccacgc cgaaacaagc gctcatgagc ccgaagtggc 10860
gagcccgatc ttccccatcg gtgatgtcgg cgatataggc gccagcaacc gcacctgtgg 10920
cgccggtgat gagggcgcgc caagtcgacg tccggcagtc 10960
<210> 14
<211> 536
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 14
Met Glu Phe Ser Ser Pro Ser Arg Glu Glu Cys Pro Lys Pro Leu Ser
1 5 10 15
Arg Val Ser Ile Met Ala Gly Ser Leu Thr Gly Leu Leu Leu Leu Gln
20 25 30
Ala Val Ser Trp Ala Ser Gly Ala Arg Pro Cys Ile Pro Lys Ser Phe
35 40 45
Gly Tyr Ser Ser Val Val Cys Val Cys Asn Ala Thr Tyr Cys Asp Ser
50 55 60
Phe Asp Pro Pro Thr Phe Pro Ala Leu Gly Thr Phe Ser Arg Tyr Glu
65 70 75 80
Ser Thr Arg Ser Gly Arg Arg Met Glu Leu Ser Met Gly Pro Ile Gln
85 90 95
Ala Asn His Thr Gly Thr Gly Leu Leu Leu Thr Leu Gln Pro Glu Gln
100 105 110
Lys Phe Gln Lys Val Lys Gly Phe Gly Gly Ala Met Thr Asp Ala Ala
115 120 125
Ala Leu Asn Ile Leu Ala Leu Ser Pro Pro Ala Gln Asn Leu Leu Leu
130 135 140
Lys Ser Tyr Phe Ser Glu Glu Gly Ile Gly Tyr Asn Ile Ile Arg Val
145 150 155 160
Pro Met Ala Ser Cys Asp Phe Ser Ile Arg Thr Tyr Thr Tyr Ala Asp
165 170 175
Thr Pro Asp Asp Phe Gln Leu His Asn Phe Ser Leu Pro Glu Glu Asp
180 185 190
Thr Lys Leu Lys Ile Pro Leu Ile His Arg Ala Leu Gln Leu Ala Gln
195 200 205
Arg Pro Val Ser Leu Leu Ala Ser Pro Trp Thr Ser Pro Thr Trp Leu
210 215 220
Lys Thr Asn Gly Ala Val Asn Gly Lys Gly Ser Leu Lys Gly Gln Pro
225 230 235 240
Gly Asp Ile Tyr His Gln Thr Trp Ala Arg Tyr Phe Val Lys Phe Leu
245 250 255
Asp Ala Tyr Ala Glu His Lys Leu Gln Phe Trp Ala Val Thr Ala Glu
260 265 270
Asn Glu Pro Ser Ala Gly Leu Leu Ser Gly Tyr Pro Phe Gln Cys Leu
275 280 285
Gly Phe Thr Pro Glu His Gln Arg Asp Phe Ile Ala Arg Asp Leu Gly
290 295 300
Pro Thr Leu Ala Asn Ser Thr His His Asn Val Arg Leu Leu Met Leu
305 310 315 320
Asp Asp Gln Arg Leu Leu Leu Pro His Trp Ala Lys Val Val Leu Thr
325 330 335
Asp Pro Glu Ala Ala Lys Tyr Val His Gly Ile Ala Val His Trp Tyr
340 345 350
Leu Asp Phe Leu Ala Pro Ala Lys Ala Thr Leu Gly Glu Thr His Arg
355 360 365
Leu Phe Pro Asn Thr Met Leu Phe Ala Ser Glu Ala Cys Val Gly Ser
370 375 380
Lys Phe Trp Glu Gln Ser Val Arg Leu Gly Ser Trp Asp Arg Gly Met
385 390 395 400
Gln Tyr Ser His Ser Ile Ile Thr Asn Leu Leu Tyr His Val Val Gly
405 410 415
Trp Thr Asp Trp Asn Leu Ala Leu Asn Pro Glu Gly Gly Pro Asn Trp
420 425 430
Val Arg Asn Phe Val Asp Ser Pro Ile Ile Val Asp Ile Thr Lys Asp
435 440 445
Thr Phe Tyr Lys Gln Pro Met Phe Tyr His Leu Gly His Phe Ser Lys
450 455 460
Phe Ile Pro Glu Gly Ser Gln Arg Val Gly Leu Val Ala Ser Gln Lys
465 470 475 480
Asn Asp Leu Asp Ala Val Ala Leu Met His Pro Asp Gly Ser Ala Val
485 490 495
Val Val Val Leu Asn Arg Ser Ser Lys Asp Val Pro Leu Thr Ile Lys
500 505 510
Asp Pro Ala Val Gly Phe Leu Glu Thr Ile Ser Pro Gly Tyr Ser Ile
515 520 525
His Thr Tyr Leu Trp Arg Arg Gln
530 535
<210> 15
<211> 1608
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 15
atggaattca gcagccccag cagagaggaa tgccccaagc ctctgagccg ggtgtcaatc 60
atggccggat ctctgacagg actgctgctg cttcaggccg tgtcttgggc ttctggcgct 120
agaccttgca tccccaagag cttcggctac agcagcgtcg tgtgcgtgtg caatgccacc 180
tactgcgaca gcttcgaccc tcctaccttt cctgctctgg gcaccttcag cagatacgag 240
agcaccagat ccggcagacg gatggaactg agcatgggac ccatccaggc caatcacaca 300
ggcactggcc tgctgctgac actgcagcct gagcagaaat tccagaaagt gaaaggcttc 360
ggcggagcca tgacagatgc cgccgctctg aatatcctgg ctctgtctcc accagctcag 420
aacctgctgc tcaagagcta cttcagcgag gaaggcatcg gctacaacat catcagagtg 480
cccatggcca gctgcgactt cagcatcagg acctacacct acgccgacac acccgacgat 540
ttccagctgc acaacttcag cctgcctgaa gaggacacca agctgaagat ccctctgatc 600
cacagagccc tgcagctggc acaaagaccc gtgtcactgc tggcctctcc atggacatct 660
cccacctggc tgaaaacaaa tggcgccgtg aatggcaagg gcagcctgaa aggccaacct 720
ggcgacatct accaccagac ctgggccaga tacttcgtga agttcctgga cgcctatgcc 780
gagcacaagc tgcagttttg ggccgtgaca gccgagaacg aaccttctgc tggactgctg 840
agcggctacc cctttcagtg cctgggcttt acacccgagc accagcggga ctttatcgcc 900
cgtgatctgg gacccacact ggccaatagc acccaccata atgtgcggct gctgatgctg 960
gacgaccaga gactgcttct gccccactgg gctaaagtgg tgctgacaga tcctgaggcc 1020
gccaaatacg tgcacggaat cgccgtgcac tggtatctgg actttctggc ccctgccaag 1080
gccacactgg gagagacaca cagactgttc cccaacacca tgctgttcgc cagcgaagcc 1140
tgtgtgggca gcaagttttg ggaacagagc gtgcggctcg gcagctggga tagaggcatg 1200
cagtacagcc acagcatcat caccaacctg ctgtaccacg tcgtcggctg gaccgactgg 1260
aatctggccc tgaatcctga aggcggccct aactgggtcc gaaacttcgt ggacagcccc 1320
atcatcgtgg acatcaccaa ggacaccttc tacaagcagc ccatgttcta ccacctggga 1380
cacttcagca agttcatccc cgagggctct cagcgcgttg gactggtggc ttcccagaag 1440
aacgatctgg acgccgtggc tctgatgcac cctgatggat ctgctgtggt ggtggtcctg 1500
aaccgcagca gcaaagatgt gcccctgacc atcaaggatc ccgccgtggg attcctggaa 1560
acaatcagcc ctggctactc catccacacc tacctgtggc gtagacag 1608
<210> 16
<211> 524
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 16
Met Tyr Ala Leu Phe Leu Leu Ala Ser Leu Leu Gly Ala Ala Leu Ala
1 5 10 15
Gly Pro Val Leu Gly Leu Lys Glu Cys Thr Arg Gly Ser Ala Val Trp
20 25 30
Cys Gln Asn Val Lys Thr Ala Ser Asp Cys Gly Ala Val Lys His Cys
35 40 45
Leu Gln Thr Val Trp Asn Lys Pro Thr Val Lys Ser Leu Pro Cys Asp
50 55 60
Ile Cys Lys Asp Val Val Thr Ala Ala Gly Asp Met Leu Lys Asp Asn
65 70 75 80
Ala Thr Glu Glu Glu Ile Leu Val Tyr Leu Glu Lys Thr Cys Asp Trp
85 90 95
Leu Pro Lys Pro Asn Met Ser Ala Ser Cys Lys Glu Ile Val Asp Ser
100 105 110
Tyr Leu Pro Val Ile Leu Asp Ile Ile Lys Gly Glu Met Ser Arg Pro
115 120 125
Gly Glu Val Cys Ser Ala Leu Asn Leu Cys Glu Ser Leu Gln Lys His
130 135 140
Leu Ala Glu Leu Asn His Gln Lys Gln Leu Glu Ser Asn Lys Ile Pro
145 150 155 160
Glu Leu Asp Met Thr Glu Val Val Ala Pro Phe Met Ala Asn Ile Pro
165 170 175
Leu Leu Leu Tyr Pro Gln Asp Gly Pro Arg Ser Lys Pro Gln Pro Lys
180 185 190
Asp Asn Gly Asp Val Cys Gln Asp Cys Ile Gln Met Val Thr Asp Ile
195 200 205
Gln Thr Ala Val Arg Thr Asn Ser Thr Phe Val Gln Ala Leu Val Glu
210 215 220
His Val Lys Glu Glu Cys Asp Arg Leu Gly Pro Gly Met Ala Asp Ile
225 230 235 240
Cys Lys Asn Tyr Ile Ser Gln Tyr Ser Glu Ile Ala Ile Gln Met Met
245 250 255
Met His Met Gln Pro Lys Glu Ile Cys Ala Leu Val Gly Phe Cys Asp
260 265 270
Glu Val Lys Glu Met Pro Met Gln Thr Leu Val Pro Ala Lys Val Ala
275 280 285
Ser Lys Asn Val Ile Pro Ala Leu Glu Leu Val Glu Pro Ile Lys Lys
290 295 300
His Glu Val Pro Ala Lys Ser Asp Val Tyr Cys Glu Val Cys Glu Phe
305 310 315 320
Leu Val Lys Glu Val Thr Lys Leu Ile Asp Asn Asn Lys Thr Glu Lys
325 330 335
Glu Ile Leu Asp Ala Phe Asp Lys Met Cys Ser Lys Leu Pro Lys Ser
340 345 350
Leu Ser Glu Glu Cys Gin Glu Val Val Asp Thr Tyr Gly Ser Ser Ile
355 360 365
Leu Ser Ile Leu Leu Glu Glu Val Ser Pro Glu Leu Val Cys Ser Met
370 375 380
Leu His Leu Cys Ser Gly Thr Arg Leu Pro Ala Leu Thr Val His Val
385 390 395 400
Thr Gln Pro Lys Asp Gly Gly Phe Cys Glu Val Cys Lys Lys Leu Val
405 410 415
Gly Tyr Leu Asp Arg Asn Leu Glu Lys Asn Ser Thr Lys Gln Glu Ile
420 425 430
Leu Ala Ala Leu Glu Lys Gly Cys Ser Phe Leu Pro Asp Pro Tyr Gln
435 440 445
Lys Gln Cys Asp Gln Phe Val Ala Glu Tyr Glu Pro Val Leu Ile Glu
450 455 460
Ile Leu Val Glu Val Met Asp Pro Ser Phe Val Cys Leu Lys Ile Gly
465 470 475 480
Ala Cys Pro Ser Ala His Lys Pro Leu Leu Gly Thr Glu Lys Cys Ile
485 490 495
Trp Gly Pro Ser Tyr Trp Cys Gln Asn Thr Glu Thr Ala Ala Gln Cys
500 505 510
Asn Ala Val Glu His Cys Lys Arg His Val Trp Asn
515 520
<210> 17
<211> 1572
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 17
atgtacgccc tgttcctgct ggccagcctg ctgggcgccg ccctggccgg ccccgtgctg 60
ggcctgaagg agtgcacccg cggcagcgcc gtgtggtgcc agaacgtgaa gaccgccagc 120
gactgcggcg ccgtgaagca ctgcctgcag accgtgtgga acaagcccac cgtgaagagc 180
ctgccctgcg acatctgcaa ggacgtggtg accgccgccg gcgacatgct gaaggacaac 240
gccaccgagg aggagatcct ggtgtacctg gagaagacct gcgactggct gcccaagccc 300
aacatgagcg ccagctgcaa ggagatcgtg gacagctacc tgcccgtgat cctggacatc 360
atcaagggcg agatgagccg ccccggcgag gtgtgcagcg ccctgaacct gtgcgagagc 420
ctgcagaagc acctggccga gctgaaccac cagaagcagc tggagagcaa caagatcccc 480
gagctggaca tgaccgaggt ggtggccccc ttcatggcca acatccccct gctgctgtac 540
ccccaggacg gccccccgcag caagccccag cccaaggaca acggcgacgt gtgccaggac 600
tgcatccaga tggtgaccga catccagacc gccgtgcgca ccaacagcac cttcgtgcag 660
gccctggtgg agcacgtgaa ggaggagtgc gaccgcctgg gccccggcat ggccgacatc 720
tgcaagaact acatcagcca gtacagcgag atcgccatcc agatgatgat gcacatgcag 780
cccaaggaga tctgcgccct ggtgggcttc tgcgacgagg tgaaggagat gcccatgcag 840
accctggtgc ccgccaaggt ggccagcaag aacgtgatcc ccgccctgga gctggtggag 900
cccatcaaga agcacgaggt gcccgccaag agcgacgtgt actgcgaggt gtgcgagttc 960
ctggtgaagg aggtgaccaa gctgatcgac aacaacaaga ccgagaagga gatcctggac 1020
gccttcgaca agatgtgcag caagctgccc aagagcctga gcgaggagtg ccaggaggtg 1080
gtggacacct acggcagcag catcctgagc atcctgctgg aggaggtgag ccccgagctg 1140
gtgtgcagca tgctgcacct gtgcagcggc acccgcctgc ccgccctgac cgtgcacgtg 1200
acccagccca aggacggcgg cttctgcgag gtgtgcaaga agctggtggg ctacctggac 1260
cgcaacctgg agaagaacag caccaagcag gagatcctgg ccgccctgga gaagggctgc 1320
agcttcctgc ccgaccccta ccagaagcag tgcgaccagt tcgtggccga gtacgagccc 1380
gtgctgatcg agatcctggt ggaggtgatg gaccccagct tcgtgtgcct gaagatcggc 1440
gcctgcccca gcgcccacaa gcccctgctg ggcaccgaga agtgcatctg gggccccagc 1500
tactggtgcc agaacaccga gaccgccgcc cagtgcaacg ccgtggagca ctgcaagcgc 1560
cacgtgtgga ac 1572
<210> 18
<211> 478
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 18
Met Gly Arg Cys Cys Phe Tyr Thr Ala Gly Thr Leu Ser Leu Leu Leu
1 5 10 15
Leu Val Thr Ser Val Thr Leu Leu Val Ala Arg Val Phe Gln Lys Ala
20 25 30
Val Asp Gln Ser Ile Glu Lys Lys Ile Val Leu Arg Asn Gly Thr Glu
35 40 45
Ala Phe Asp Ser Trp Glu Lys Pro Pro Leu Pro Val Tyr Thr Gln Phe
50 55 60
Tyr Phe Phe Asn Val Thr Asn Pro Glu Glu Ile Leu Arg Gly Glu Thr
65 70 75 80
Pro Arg Val Glu Glu Val Gly Pro Tyr Thr Tyr Arg Glu Leu Arg Asn
85 90 95
Lys Ala Asn Ile Gln Phe Gly Asp Asn Gly Thr Thr Ile Ser Ala Val
100 105 110
Ser Asn Lys Ala Tyr Val Phe Glu Arg Asp Gln Ser Val Gly Asp Pro
115 120 125
Lys Ile Asp Leu Ile Arg Thr Leu Asn Ile Pro Val Leu Thr Val Ile
130 135 140
Glu Trp Ser Gln Val His Phe Leu Arg Glu Ile Ile Glu Ala Met Leu
145 150 155 160
Lys Ala Tyr Gln Gln Lys Leu Phe Val Thr His Thr Val Asp Glu Leu
165 170 175
Leu Trp Gly Tyr Lys Asp Glu Ile Leu Ser Leu Ile His Val Phe Arg
180 185 190
Pro Asp Ile Ser Pro Tyr Phe Gly Leu Phe Tyr Glu Lys Asn Gly Thr
195 200 205
Asn Asp Gly Asp Tyr Val Phe Leu Thr Gly Glu Asp Ser Tyr Leu Asn
210 215 220
Phe Thr Lys Ile Val Glu Trp Asn Gly Lys Thr Ser Leu Asp Trp Trp
225 230 235 240
Ile Thr Asp Lys Cys Asn Met Ile Asn Gly Thr Asp Gly Asp Ser Phe
245 250 255
His Pro Leu Ile Thr Lys Asp Glu Val Leu Tyr Val Phe Pro Ser Asp
260 265 270
Phe Cys Arg Ser Val Tyr Ile Thr Phe Ser Asp Tyr Glu Ser Val Gln
275 280 285
Gly Leu Pro Ala Phe Arg Tyr Lys Val Pro Ala Glu Ile Leu Ala Asn
290 295 300
Thr Ser Asp Asn Ala Gly Phe Cys Ile Pro Glu Gly Asn Cys Leu Gly
305 310 315 320
Ser Gly Val Leu Asn Val Ser Ile Cys Lys Asn Gly Ala Pro Ile Ile
325 330 335
Met Ser Phe Pro His Phe Tyr Gln Ala Asp Glu Arg Phe Val Ser Ala
340 345 350
Ile Glu Gly Met His Pro Asn Gin Glu Asp His Glu Thr Phe Val Asp
355 360 365
Ile Asn Pro Leu Thr Gly Ile Ile Leu Lys Ala Ala Lys Arg Phe Gln
370 375 380
Ile Asn Ile Tyr Val Lys Lys Leu Asp Asp Phe Val Glu Thr Gly Asp
385 390 395 400
Ile Arg Thr Met Val Phe Pro Val Met Tyr Leu Asn Glu Ser Val His
405 410 415
Ile Asp Lys Glu Thr Ala Ser Arg Leu Lys Ser Met Ile Asn Thr Thr
420 425 430
Leu Ile Ile Thr Asn Ile Pro Tyr Ile Ile Met Ala Leu Gly Val Phe
435 440 445
Phe Gly Leu Val Phe Thr Trp Leu Ala Cys Lys Gly Gln Gly Ser Met
450 455 460
Asp Glu Gly Thr Ala Asp Glu Arg Ala Pro Leu Ile Arg Thr
465 470 475
<210> 19
<211> 1434
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 19
atgggccgct gctgcttcta caccgccggc accctgagcc tgctgctgct ggtgaccagc 60
gtgaccctgc tggtggcccg cgtgttccag aaggccgtgg accagagcat cgagaagaag 120
atcgtgctgc gcaacggcac cgaggccttc gacagctggg agaagccccc cctgcccgtg 180
tacacccagt tctacttctt caacgtgacc aaccccgagg agatcctgcg cggcgagacc 240
ccccgcgtgg aggaggtggg cccctacacc taccgcgagc tgcgcaacaa ggccaacatc 300
cagttcggcg acaacggcac caccatcagc gccgtgagca acaaggccta cgtgttcgag 360
cgcgaccaga gcgtgggcga ccccaagatc gacctgatcc gcaccctgaa catccccgtg 420
ctgaccgtga tcgagtggag ccaggtgcac ttcctgcgcg agatcatcga ggccatgctg 480
aaggcctacc agcagaagct gttcgtgacc cacaccgtgg acgagctgct gtggggctac 540
aaggacgaga tcctgagcct gatccacgtg ttccgccccg acatcagccc ctacttcggc 600
ctgttctacg agaagaacgg caccaacgac ggcgactacg tgttcctgac cggcgaggac 660
agctacctga acttcaccaa gatcgtggag tggaacggca agaccagcct ggactggtgg 720
atcaccgaca agtgcaacat gatcaacggc accgacggcg acagcttcca ccccctgatc 780
accaaggacg aggtgctgta cgtgttcccc agcgacttct gccgcagcgt gtacatcacc 840
ttcagcgact acgagagcgt gcagggcctg cccgccttcc gctacaaggt gcccgccgag 900
atcctggcca acaccagcga caacgccggc ttctgcatcc ccgagggcaa ctgcctgggc 960
agcggcgtgc tgaacgtgag catctgcaag aacggcgccc ccatcatcat gagcttcccc 1020
cacttctacc aggccgacga gcgcttcgtg agcgccatcg agggcatgca ccccaaccag 1080
gaggaccacg agaccttcgt ggacatcaac cccctgaccg gcatcatcct gaaggccgcc 1140
aagcgcttcc agatcaacat ctacgtgaag aagctggacg acttcgtgga gaccggcgac 1200
atccgcacca tggtgttccc cgtgatgtac ctgaacgaga gcgtgcacat cgacaaggag 1260
accgccagcc gcctgaagag catgatcaac accaccctga tcatcaccaa catcccctac 1320
atcatcatgg ccctgggcgt gttcttcggc ctggtgttca cctggctggc ctgcaagggc 1380
cagggcagca tggacgaggg caccgccgac gagcgcgccc ccctgatccg cacc 1434
<210> 20
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 20
tggaagactt cgagatacac tgt 23
<210> 21
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 21
acagtgtatc tcgaagtctt cca 23
<210> 22
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 22
tttagaaata agtggtagtc a 21
<210> 23
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 23
tgactaccac ttatttctaa a 21
<210> 24
<211> 19
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 24
agggtatcaa gactacgaa 19
<210> 25
<211> 19
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 25
ttcgtagtct tgataccct 19
<210> 26
<211> 19
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 26
tattagatct gatggccgc 19
<210> 27
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 27
ctccatcact aggggttcct 20
<210> 28
<211> 60
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 28
agctctgggt atttaagccc gagtgagcac gcagggtctc cattttgaag cgggaggtta 60
<210> 29
<211> 145
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 29
aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg ctcactgagg 60
ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc 120
gagcgcgcag agagggagtg gccaa 145
<210> 30
<211> 927
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 30
Met Gly Thr Gln Asp Pro Gly Asn Met Gly Thr Gly Val Pro Ala Ser
1 5 10 15
Glu Gln Ile Ser Cys Ala Lys Glu Asp Pro Gln Val Tyr Cys Pro Glu
20 25 30
Glu Thr Gly Gly Thr Lys Asp Val Gln Val Thr Asp Cys Lys Ser Pro
35 40 45
Glu Asp Ser Arg Pro Lys Glu Thr Asp Cys Cys Asn Pro Glu Asp
50 55 60
Ser Gly Gln Leu Met Val Ser Tyr Glu Gly Lys Ala Met Gly Tyr Gln
65 70 75 80
Val Pro Pro Phe Gly Trp Arg Ile Cys Leu Ala His Glu Phe Thr Glu
85 90 95
Lys Arg Lys Pro Phe Gln Ala Asn Asn Val Ser Leu Ser Asn Met Ile
100 105 110
Lys His Ile Gly Met Gly Leu Arg Tyr Leu Gln Trp Trp Tyr Arg Lys
115 120 125
Thr His Val Glu Lys Lys Thr Pro Phe Ile Asp Met Ile Asn Ser Val
130 135 140
Pro Leu Arg Gln Ile Tyr Gly Cys Pro Leu Gly Gly Ile Gly Gly Gly
145 150 155 160
Thr Ile Thr Arg Gly Trp Arg Gly Gln Phe Cys Arg Trp Gln Leu Asn
165 170 175
Pro Gly Met Tyr Gln His Arg Thr Val Ile Ala Asp Gln Phe Thr Val
180 185 190
Cys Leu Arg Arg Glu Gly Gln Thr Val Tyr Gln Gln Val Leu Ser Leu
195 200 205
Glu Arg Pro Ser Val Leu Arg Ser Trp Asn Trp Gly Leu Cys Gly Tyr
210 215 220
Phe Ala Phe Tyr His Ala Leu Tyr Pro Arg Ala Trp Thr Val Tyr Gln
225 230 235 240
Leu Pro Gly Gln Asn Val Thr Leu Thr Cys Arg Gln Ile Thr Pro Ile
245 250 255
Leu Pro His Asp Tyr Gln Asp Ser Ser Leu Pro Val Gly Val Phe Val
260 265 270
Trp Asp Val Glu Asn Glu Gly Asp Glu Ala Leu Asp Val Ser Ile Met
275 280 285
Phe Ser Met Arg Asn Gly Leu Gly Gly Gly Asp Asp Ala Pro Gly Gly
290 295 300
Leu Trp Asn Glu Pro Phe Cys Leu Glu Arg Ser Gly Glu Thr Val Arg
305 310 315 320
Gly Leu Leu Leu His His Pro Thr Leu Pro Asn Pro Tyr Thr Met Ala
325 330 335
Val Ala Ala Arg Val Thr Ala Ala Thr Thr Val Thr His Ile Thr Ala
340 345 350
Phe Asp Pro Asp Ser Thr Gly Gln Gln Val Trp Gln Asp Leu Leu Gln
355 360 365
Asp Gly Gln Leu Asp Ser Pro Thr Gly Gln Ser Thr Pro Thr Gln Lys
370 375 380
Gly Val Gly Ile Ala Gly Ala Val Cys Val Ser Ser Lys Leu Arg Pro
385 390 395 400
Arg Gly Gln Cys Arg Leu Glu Phe Ser Leu Ala Trp Asp Met Pro Arg
405 410 415
Ile Met Phe Gly Ala Lys Gly Gln Val His Tyr Arg Arg Tyr Thr Arg
420 425 430
Phe Phe Gly Gln Asp Gly Asp Ala Ala Pro Ala Leu Ser His Tyr Ala
435 440 445
Leu Cys Arg Tyr Ala Glu Trp Glu Glu Arg Ile Ser Ala Trp Gln Ser
450 455 460
Pro Val Leu Asp Asp Arg Ser Leu Pro Ala Trp Tyr Lys Ser Ala Leu
465 470 475 480
Phe Asn Glu Leu Tyr Phe Leu Ala Asp Gly Gly Thr Val Trp Leu Glu
485 490 495
Val Leu Glu Asp Ser Leu Pro Glu Glu Leu Gly Arg Asn Met Cys His
500 505 510
Leu Arg Pro Thr Leu Arg Asp Tyr Gly Arg Phe Gly Tyr Leu Glu Gly
515 520 525
Gln Glu Tyr Arg Met Tyr Asn Thr Tyr Asp Val His Phe Tyr Ala Ser
530 535 540
Phe Ala Leu Ile Met Leu Trp Pro Lys Leu Glu Leu Ser Leu Gln Tyr
545 550 555 560
Asp Met Ala Leu Ala Thr Leu Arg Glu Asp Leu Thr Arg Arg Arg Tyr
565 570 575
Leu Met Ser Gly Val Met Ala Pro Val Lys Arg Arg Asn Val Ile Pro
580 585 590
His Asp Ile Gly Asp Pro Asp Asp Glu Pro Trp Leu Arg Val Asn Ala
595 600 605
Tyr Leu Ile His Asp Thr Ala Asp Trp Lys Asp Leu Asn Leu Lys Phe
610 615 620
Val Leu Gln Val Tyr Arg Asp Tyr Tyr Leu Thr Gly Asp Gln Asn Phe
625 630 635 640
Leu Lys Asp Met Trp Pro Val Cys Leu Ala Val Met Glu Ser Glu Met
645 650 655
Lys Phe Asp Lys Asp His Asp Gly Leu Ile Glu Asn Gly Gly Tyr Ala
660 665 670
Asp Gln Thr Tyr Asp Gly Trp Val Thr Thr Gly Pro Ser Ala Tyr Cys
675 680 685
Gly Gly Leu Trp Leu Ala Ala Val Ala Val Met Val Gln Met Ala Ala
690 695 700
Leu Cys Gly Ala Gln Asp Ile Gln Asp Lys Phe Ser Ser Ile Leu Ser
705 710 715 720
Arg Gly Gln Glu Ala Tyr Glu Arg Leu Leu Trp Asn Gly Arg Tyr Tyr
725 730 735
Asn Tyr Asp Ser Ser Ser Arg Pro Gln Ser Arg Ser Val Met Ser Asp
740 745 750
Gln Cys Ala Gly Gln Trp Phe Leu Lys Ala Cys Gly Leu Gly Glu Gly
755 760 765
Asp Thr Glu Val Phe Pro Thr Gln His Val Val Arg Ala Leu Gln Thr
770 775 780
Ile Phe Glu Leu Asn Val Gln Ala Phe Ala Gly Gly Ala Met Gly Ala
785 790 795 800
Val Asn Gly Met Gln Pro His Gly Val Pro Asp Lys Ser Ser Val Gln
805 810 815
Ser Asp Glu Val Trp Val Gly Val Val Tyr Gly Leu Ala Ala Thr Met
820 825 830
Ile Gln Glu Gly Leu Thr Trp Glu Gly Phe Gln Thr Ala Glu Gly Cys
835 840 845
Tyr Arg Thr Val Trp Glu Arg Leu Gly Leu Ala Phe Gln Thr Pro Glu
850 855 860
Ala Tyr Cys Gln Gln Arg Val Phe Arg Ser Leu Ala Tyr Met Arg Pro
865 870 875 880
Leu Ser Ile Trp Ala Met Gln Leu Ala Leu Gln Gln Gln Gln His Lys
885 890 895
Lys Ala Ser Trp Pro Lys Val Lys Gln Gly Thr Gly Leu Arg Thr Gly
900 905 910
Pro Met Phe Gly Pro Lys Glu Ala Met Ala Asn Leu Ser Pro Glu
915 920 925
<210> 31
<211> 2781
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 31
atgggcaccc aggaccccgg caacatgggc accggcgtgc ccgccagcga gcagatcagc 60
tgcgccaagg aggaccccca ggtgtactgc cccgaggaga ccggcggcac caaggacgtg 120
caggtgaccg actgcaagag ccccgaggac agccgccccc ccaaggagac cgactgctgc 180
aaccccgagg acagcggcca gctgatggtg agctacgagg gcaaggccat gggctaccag 240
gtgcccccct tcggctggcg catctgcctg gccccacgagt tcaccgagaa gcgcaagccc 300
ttccaggcca acaacgtgag cctgagcaac atgatcaagc acatcggcat gggcctgcgc 360
tacctgcagt ggtggtaccg caagacccac gtggagaaga agaccccctt catcgacatg 420
atcaacagcg tgcccctgcg ccagatctac ggctgccccc tgggcggcat cggcggcggc 480
accatcaccc gcggctggcg cggccagttc tgccgctggc agctgaaccc cggcatgtac 540
cagcaccgca ccgtgatcgc cgaccagttc accgtgtgcc tgcgccgcga gggccagacc 600
gtgtaccagc aggtgctgag cctggagcgc cccagcgtgc tgcgcagctg gaactggggc 660
ctgtgcggct acttcgcctt ctaccacgcc ctgtaccccc gcgcctggac cgtgtaccag 720
ctgcccggcc agaacgtgac cctgacctgc cgccagatca cccccatcct gccccacgac 780
taccaggaca gcagcctgcc cgtgggcgtg ttcgtgtggg acgtggagaa cgagggcgac 840
gaggccctgg acgtgagcat catgttcagc atgcgcaacg gcctgggcgg cggcgacgac 900
gccccccggcg gcctgtggaa cgagcccttc tgcctggagc gcagcggcga gaccgtgcgc 960
ggcctgctgc tgcaccaccc caccctgccc aacccctaca ccatggccgt ggccgcccgc 1020
gtgaccgccg ccaccaccgt gacccacatc accgccttcg accccgacag caccggccag 1080
caggtgtggc aggacctgct gcaggacggc cagctggaca gccccaccgg ccagagcacc 1140
cccacccaga agggcgtggg catcgccggc gccgtgtgcg tgagcagcaa gctgcgcccc 1200
cgcggccagt gccgcctgga gttcagcctg gcctgggaca tgccccgcat catgttcggc 1260
gccaagggcc aggtgcacta ccgccgctac acccgcttct tcggccagga cggcgacgcc 1320
gccccgccc tgagccacta cgccctgtgc cgctacgccg agtgggagga gcgcatcagc 1380
gcctggcaga gccccgtgct ggacgaccgc agcctgcccg cctggtacaa gagcgccctg 1440
ttcaacgagc tgtacttcct ggccgacggc ggcaccgtgt ggctggaggt gctggaggac 1500
agcctgcccg aggagctggg ccgcaacatg tgccacctgc gccccaccct gcgcgactac 1560
ggccgcttcg gctacctgga gggccaggag taccgcatgt acaacaccta cgacgtgcac 1620
ttctacgcca gcttcgccct gatcatgctg tggcccaagc tggagctgag cctgcagtac 1680
gacatggccc tggccaccct gcgcgaggac ctgacccgcc gccgctacct gatgagcggc 1740
gtgatggccc ccgtgaagcg ccgcaacgtg atccccccacg acatcggcga ccccgacgac 1800
gagccctggc tgcgcgtgaa cgcctacctg atccacgaca ccgccgactg gaaggacctg 1860
aacctgaagt tcgtgctgca ggtgtaccgc gactactacc tgaccggcga ccagaacttc 1920
ctgaaggaca tgtggcccgt gtgcctggcc gtgatggaga gcgagatgaa gttcgacaag 1980
gaccacgacg gcctgatcga gaacggcggc tacgccgacc agacctacga cggctgggtg 2040
accaccggcc ccagcgccta ctgcggcggc ctgtggctgg ccgccgtggc cgtgatggtg 2100
cagatggccg ccctgtgcgg cgcccaggac atccaggaca agttcagcag catcctgagc 2160
cgcggccagg aggcctacga gcgcctgctg tggaacggcc gctactacaa ctacgacagc 2220
agcagccgcc cccagagccg cagcgtgatg agcgaccagt gcgccggcca gtggttcctg 2280
aaggcctgcg gcctgggcga gggcgacacc gaggtgttcc ccacccagca cgtggtgcgc 2340
gccctgcaga ccatcttcga gctgaacgtg caggccttcg ccggcggcgc catgggcgcc 2400
gtgaacggca tgcagcccca cggcgtgccc gacaagagca gcgtgcagag cgacgaggtg 2460
tgggtgggcg tggtgtacgg cctggccgcc accatgatcc aggagggcct gacctgggag 2520
ggcttccaga ccgccgaggg ctgctaccgc accgtgtggg agcgcctggg cctggccttc 2580
cagacccccg aggcctactg ccagcagcgc gtgttccgca gcctggccta catgcgcccc 2640
ctgagcatct gggccatgca gctggccctg cagcagcagc agcacaagaa ggccagctgg 2700
cccaaggtga agcagggcac cggcctgcgc accggcccca tgttcggccc caaggaggcc 2760
atggccaacc tgagccccga g 2781
<210> 32
<211> 11264
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 32
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agtaagtcac 300
tgactgtcta tgcctgggaa agggtgggca ggagatgggg cagtgcagga aaagtggcac 360
tatgaaccct cctggtggcg aggggagggg ggtggtcctc gaacgccttg cagaactggc 420
ctggatacag agtggaccgg ctggccccat ctggaagact tcgagataca ctgttgtctt 480
actgcgctca acagtgtatc tcgaagtctt ccaaatggtg ccagccatcg cagcggggtg 540
caggaaatgg gggcagcccc cctttttggc tatccttcca cgtgttcttt tttgtatctt 600
ttgtgtttcc tagaaaacat ctcagtcacc accgcagccc taggaatgca tctagacaat 660
tgtactaacc ttcttctctt tcctctcctg acagtccgga aagccaccat gggcacccag 720
gaccccggca acatgggcac cggcgtgccc gccagcgagc agatcagctg cgccaaggag 780
gacccccagg tgtactgccc cgaggagacc ggcggcacca aggacgtgca ggtgaccgac 840
tgcaagagcc ccgaggacag ccgccccccc aaggagaccg actgctgcaa ccccgaggac 900
agcggccagc tgatggtgag ctacgagggc aaggccatgg gctaccaggt gccccccttc 960
ggctggcgca tctgcctggc ccacgagttc accgagaagc gcaagccctt ccaggccaac 1020
aacgtgagcc tgagcaacat gatcaagcac atcggcatgg gcctgcgcta cctgcagtgg 1080
tggtaccgca agacccacgt ggagaagaag acccccttca tcgacatgat caacagcgtg 1140
cccctgcgcc agatctacgg ctgccccctg ggcggcatcg gcggcggcac catcacccgc 1200
ggctggcgcg gccagttctg ccgctggcag ctgaaccccg gcatgtacca gcaccgcacc 1260
gtgatcgccg accagttcac cgtgtgcctg cgccgcgagg gccagaccgt gtaccagcag 1320
gtgctgagcc tggagcgccc cagcgtgctg cgcagctgga actggggcct gtgcggctac 1380
ttcgccttct accacgccct gtacccccgc gcctggaccg tgtaccagct gcccggccag 1440
aacgtgaccc tgacctgccg ccagatcacc cccatcctgc cccacgacta ccaggacagc 1500
agcctgcccg tgggcgtgtt cgtgtgggac gtggagaacg agggcgacga ggccctggac 1560
gtgagcatca tgttcagcat gcgcaacggc ctgggcggcg gcgacgacgc ccccggcggc 1620
ctgtggaacg agcccttctg cctggagcgc agcggcgaga ccgtgcgcgg cctgctgctg 1680
caccacccca ccctgcccaa cccctacacc atggccgtgg ccgcccgcgt gaccgccgcc 1740
accaccgtga cccacatcac cgccttcgac cccgacagca ccggccagca ggtgtggcag 1800
gacctgctgc aggacggcca gctggacagc cccaccggcc agagcacccc cacccagaag 1860
ggcgtgggca tcgccggcgc cgtgtgcgtg agcagcaagc tgcgcccccg cggccagtgc 1920
cgcctggagt tcagcctggc ctgggacatg ccccgcatca tgttcggcgc caagggccag 1980
gtgcactacc gccgctacac ccgcttcttc ggccaggacg gcgacgccgc ccccgccctg 2040
agccactacg ccctgtgccg ctacgccgag tgggaggagc gcatcagcgc ctggcagagc 2100
cccgtgctgg acgaccgcag cctgcccgcc tggtacaaga gcgccctgtt caacgagctg 2160
tacttcctgg ccgacggcgg caccgtgtgg ctggaggtgc tggaggacag cctgcccgag 2220
gagctgggcc gcaacatgtg ccacctgcgc cccaccctgc gcgactacgg ccgcttcggc 2280
tacctggagg gccaggagta ccgcatgtac aacacctacg acgtgcactt ctacgccagc 2340
ttcgccctga tcatgctgtg gcccaagctg gagctgagcc tgcagtacga catggccctg 2400
gccaccctgc gcgaggacct gacccgccgc cgctacctga tgagcggcgt gatggccccc 2460
gtgaagcgcc gcaacgtgat cccccacgac atcggcgacc ccgacgacga gccctggctg 2520
cgcgtgaacg cctacctgat ccacgacacc gccgactgga aggacctgaa cctgaagttc 2580
gtgctgcagg tgtaccgcga ctactacctg accggcgacc agaacttcct gaaggacatg 2640
tggcccgtgt gcctggccgt gatggagagc gagatgaagt tcgacaagga ccacgacggc 2700
ctgatcgaga acggcggcta cgccgaccag acctacgacg gctgggtgac caccggcccc 2760
agcgcctact gcggcggcct gtggctggcc gccgtggccg tgatggtgca gatggccgcc 2820
ctgtgcggcg cccaggacat ccaggacaag ttcagcagca tcctgagccg cggccaggag 2880
gcctacgagc gcctgctgtg gaacggccgc tactacaact acgacagcag cagccgcccc 2940
cagagccgca gcgtgatgag cgaccagtgc gccggccagt ggttcctgaa ggcctgcggc 3000
ctgggcgagg gcgacaccga ggtgttcccc acccagcacg tggtgcgcgc cctgcagacc 3060
atcttcgagc tgaacgtgca ggccttcgcc ggcggcgcca tgggcgccgt gaacggcatg 3120
cagccccacg gcgtgcccga caagagcagc gtgcagagcg acgaggtgtg ggtgggcgtg 3180
gtgtacggcc tggccgccac catgatccag gagggcctga cctgggaggg cttccagacc 3240
gccgagggct gctaccgcac cgtgtgggag cgcctgggcc tggccttcca gacccccgag 3300
gcctactgcc agcagcgcgt gttccgcagc ctggcctaca tgcgccccct gagcatctgg 3360
gccatgcagc tggccctgca gcagcagcag cacaagaagg ccagctggcc caaggtgaag 3420
cagggcaccg gcctgcgcac cggccccatg ttcggcccca aggaggccat ggccaacctg 3480
agccccgagt gacaattgtt aattaagttt aaaccctcga ggccgcaagc ttatcgataa 3540
tcaacctctg gattacaaaa tttgtgaaag attgactggt attcttaact atgttgctcc 3600
ttttacgcta tgtggatacg ctgctttaat gcctttgtat catgctattg cttcccgtat 3660
ggctttcatt ttctcctcct tgtataaatc ctggttgctg tctctttatg aggagttgtg 3720
gcccgttgtc aggcaacgtg gcgtggtgtg cactgtgttt gctgacgcaa cccccactgg 3780
ttggggcatt gccaccacct gtcagctcct ttccgggact ttcgctttcc ccctccctat 3840
tgccacggcg gaactcatcg ccgcctgcct tgcccgctgc tggacagggg ctcggctgtt 3900
gggcactgac aattccgtgg tgttgtcggg gaaatcatcg tcctttcctt ggctgctcgc 3960
ctgtgttgcc acctggattc tgcgcgggac gtccttctgc tacgtccctt cggccctcaa 4020
tccagcggac cttccttccc gcggcctgct gccggctctg cggcctcttc cgcgtcttcg 4080
ccttcgccct cagacgagtc ggatctccct ttgggccgcc tccccgcatc gataccgtcg 4140
actagagctc gctgatcagc ctcgactgtg ccttctagtt gccagccatc tgttgtttgc 4200
ccctcccccg tgccttcctt gaccctggaa ggtgccactc ccactgtcct ttcctaataa 4260
aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg gggtggggtg 4320
gggcaggaca gcaaggggga ggattgggaa gacaatagca ggcatgctgg ggagagatcc 4380
acgataacaa acagcttttt tggggtgaac atattgactg aattccctgc aggttggcca 4440
ctccctctct gcgcgctcgc tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg 4500
cgacctttgg tcgcccggcc tcagtgagcg agcgagcgcg cagagaggga gtggccaact 4560
ccatcactag gggttcctgc ggccgctcgt acggtctcga ggaattcctg caggataact 4620
tgccaacctc attctaaaat gtatatagaa gcccaaaaga caataacaaa aatattcttg 4680
tagaacaaaa tgggaaagaa tgttccacta aatatcaaga tttagagcaa agcatgagat 4740
gtgtggggat agacagtgag gctgataaaa tagagtagag ctcagaaaca gacccattga 4800
tatatgtaag tgacctatga aaaaaatatg gcattttaca atgggaaaat gatggtcttt 4860
ttctttttta gaaaaacagg gaaatatatt tatatgtaaa aaataaaagg gaacccatat 4920
gtcataccat acacacaaaa aaattccagt gaattataag tctaaatgga gaaggcaaaa 4980
ctttaaatct tttagaaaat aatatagaag catgcagacc agcctggcca acatgatgaa 5040
accctctcta ctaataataa aatcagtaga actactcagg actactttga gtgggaagtc 5100
cttttctatg aagacttctt tggccaaaat taggctctaa atgcaaggag atagtgcatc 5160
atgcctggct gcacttactg ataaatgatg ttatcaccat ctttaaccaa atgcacagga 5220
acaagttatg gtactgatgt gctggattga gaaggagctc tacttccttg acaggacaca 5280
tttgtatcaa cttaaaaaag cagatttttg ccagcagaac tattcattca gaggtaggaa 5340
acttagaata gatgatgtca ctgattagca tggcttcccc atctccacag ctgcttccca 5400
cccaggttgc ccacagttga gtttgtccag tgctcagggc tgcccactct cagtaagaag 5460
ccccacacca gcccctctcc aaatatgttg gctgttcctt ccattaaagt gaccccactt 5520
tagagcagca agtggatttc tgtttcttac agttcaggaa ggaggagtca gctgtgagaa 5580
cctggagcct gagatgcttc taagtcccac tgctactggg gtcagggaag ccagactcca 5640
gcatcagcag tcaggagcac taagcccttg ccaacatcct gtttctcaga gaaactgctt 5700
ccattataat ggttgtcctt ttttaagcta tcaagccaaa caaccagtgt ctaccattat 5760
tctcatcacc tgaagccaag ggttctagca aaagtcaagc tgtcttgtaa tggttgatgt 5820
gcctccagct tctgtcttca gtcactccac tcttagcctg ctctgaatca actctgacca 5880
cagttccctg gagcccctgc cacctgctgc ccctgccacc ttctccatct gcagtgctgt 5940
gcagccttct gcactcttgc agagctaata ggtggagact tgaaggaaga ggaggaaagt 6000
ttctcataat agccttgctg caagctcaaa tgggaggtgg gcactgtgcc caggagcctt 6060
ggagcaaagg ctgtgcccaa cctctgactg catccaggtt tggtcttgac agagataaga 6120
agccctggct tttggagcca aaatctaggt cagacttagg caggattctc aaagtttatc 6180
agcagaacat gaggcagaag accctttctg ctccagcttc ttcaggctca accttcatca 6240
gaatagatag aaagagaggc tgtgagggtt cttaaaacag aagcaaatct gactcagaga 6300
ataaacaacc tcctagtaaa ctacagctta gacagagcat ctggtggtga gtgtgctcag 6360
tgtcctactc aactgtctgg tatcagccct catgaggact tctcttcttt ccctcataga 6420
cctccatctc tgttttcctt agcctgcaga aatctggatg gctattcaca gaatgcctgt 6480
gctttcagag ttgcattttt tctctggtat tctggttcaa gcatttgaag gtaggaaagg 6540
ttctccaagt gcaagaaagc cagccctgag cctcaactgc ctggctagtg tggtcagtag 6600
gatgcaaagg ctgttgaatg ccacaaggcc aaactttaac ctgtgtacca caagcctagc 6660
agcagaggca gctctgctca ctggaactct ctgtcttctt tctcctgagc cttttctttt 6720
cctgagtttt ctagctctcc tcaaccttac ctctgcccta cccaggacaa acccaagagc 6780
cactgtttct gtgatgtcct ctccagccct aattaggcat catgacttca gcctgacctt 6840
ccatgctcag aagcagtgct aatccacttc agatgagctg ctctatgcaa cacaggcaga 6900
gcctacaaac ctttgcacca gagccctcca catatcagtg tttgttcata ctcacttcaa 6960
cagcaaatgt gactgctgag attaagattt tacacaagat ggtctgtaat ttcacagtta 7020
gttttatccc attaggtatg aaagaattag cataattccc cttaaacatg aatgaatctt 7080
agatttttta ataaatagtt ttggaagtaa agacagagac atcaggagca caaggaatag 7140
cctgagagga caaacagaac aagaaagagt ctggaaatac acaggatgtt cttggcctcc 7200
tcaaagcaag tgcaagcaga tagtaccagc agccccaggc tatcagagcc cagtgaagag 7260
aagtaccatg aaagccacag ctctaaccac cctgttccag agtgacagac agtccccaag 7320
acaagccagc ctgagccaga gagagaactg caagagaaag tttctaattt aggttctgtt 7380
agattcagac aagtgcaggt catcctctct ccacagctac tcacctctcc agcctaacaa 7440
agcctgcagt ccacactcca accctggtgt ctcacctcct agcctctccc aacatcctgc 7500
tctctgacca tcttctgcat ctctcatctc accatctccc actgtctaca gcctactctt 7560
gcaactacca tctcattttc tgacatcctg tctacatctt ctgccatact ctgccatcta 7620
ccataccacc tcttaccatc taccacacca tcttttatct ccatccctct cagaagcctc 7680
caagctgaat cctgctttat gtgttcatct cagcccctgc atggaaagct gaccccagag 7740
gcagaactat tcccagagag cttggccaag aaaaacaaaa ctaccagcct ggccaggctc 7800
aggagtagta agctgcagtg tctgttgtgt tctagcttca acagctgcag gagttccact 7860
ctcaaatgct ccacatttct cacatcctcc tgattctggt cactacccat cttcaaagaa 7920
cagaatatct cacatcagca tactgtgaag gactagtcat gggtgcagct gctcagagct 7980
gcaaagtcat tctggatggt ggagagctta caaacatttc atgatgctcc ccccgctctg 8040
atggctggag cccaatccct acacagactc ctgctgtatg tgttttcctt tcactctgag 8100
ccacagccag agggcaggca ttcagtctcc tcttcaggct ggggctgggg cactgagaac 8160
tcacccaaca ccttgctctc actccttctg caaaacaaga aagagctttg tgctgcagta 8220
gccatgaaga atgaaaggaa ggctttaact aaaaaatgtc agagattatt ttcaacccct 8280
tactgtggat caccagcaag gaggaaacac aacacagaga cattttttcc cctcaaatta 8340
tcaaaagaat cactgcattt gttaaagaga gcaactgaat caggaagcag agttttgaac 8400
atatcagaag ttaggaatct gcatcagaga caaatgcagt catggttgtt tgctgcatac 8460
cagccctaat cattagaagc ctcatggact tcaaacatca ttccctctga caagatgctc 8520
tagcctaact ccatgagata aaataaatct gcctttcaga gccaaagaag agtccaccag 8580
cttcttctca gtgtgaacaa gagctccagt caggttagtc agtccagtgc agtagaggag 8640
accagtctgc atcctctaat tttcaaaggc aagaagattt gtttaccctg gacaccaggc 8700
acaagtgagg tcacagagct cttagatatg cagtcctcat gagtgaggag actaaagcgc 8760
atgccatcaa gacttcagtg tagagaaaac ctccaaaaaa gcctcctcac tacttctgga 8820
atagctcaga ggccgaggcg gcctcggcct ctgcataaat aaaaaaaatt agtcagccat 8880
ggggcggaga atgggcggaa ctgggcggag ttaggggcgg gatgggcgga gttaggggcg 8940
ggactatggt tgctgactaa ttgagatgca tgctttgcat acttctgcct gctggggagc 9000
ctggggactt tccacacctg gttgctgact aattgagatg catgctttgc atacttctgc 9060
ctgctgggga gcctggggac tttccacacc ctaactgaca cacatccac agctgcatta 9120
atgaatcggc caacgcgcgg ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc 9180
gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 9240
ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 9300
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 9360
ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 9420
aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 9480
gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttc 9540
tcatagctca cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca agctgggctg 9600
tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta tccggtaact atcgtcttga 9660
gtccaacccg gtaagacacg acttatcgcc actggcagca gccactggta acaggattag 9720
cagagcgagg tatgtaggcg gtgctacaga gttcttgaag tggtggccta actacggcta 9780
cactagaaga acagtatttg gtatctgcgc tctgctgaag ccagttacct tcggaaaaag 9840
agttggtagc tcttgatccg gcaaacaaac caccgctggt agcggtggtt tttttgtttg 9900
caagcagcag attacgcgca gaaaaaaagg atctcaagaa gatcctttga tcttttctac 9960
ggggtctgac gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc 10020
aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag 10080
tatatatgag taaacttggt ctgacagtta ccaatgctta atcagtgagg cacctatctc 10140
agcgatctgt ctatttcgtt catccatagt tgcctgactc ctgcaaacca cgttgtgtct 10200
caaaatctct gatgttacat tgcacaagat aaaaatatat catcatgaac aataaaactg 10260
tctgcttaca taaacagtaa tacaaggggt gttatgagcc atattcaacg ggaaacgtct 10320
tgctcgaggc cgcgattaaa ttccaacatg gatgctgatt tatatgggta taaatgggct 10380
cgcgataatg tcgggcaatc aggtgcgaca atctatcgat tgtatgggaa gcccgatgcg 10440
ccagagttgt ttctgaaaca tggcaaaggt agcgttgcca atgatgttac agatgagatg 10500
gtcagactaa actggctgac ggaatttatg cctcttccga ccatcaagca ttttatccgt 10560
actcctgatg atgcatggtt actcaccact gcgatccccg ggaaaacagc attccaggta 10620
ttagaagaat atcctgattc aggtgaaaat attgttgatg cgctggcagt gttcctgcgc 10680
cggttgcatt cgattcctgt ttgtaattgt ccttttaaca gcgatcgcgt atttcgtctc 10740
gctcaggcgc aatcacgaat gaataacggt ttggttgatg cgagtgattt tgatgacgag 10800
cgtaatggct ggcctgttga acaagtctgg aaagaaatgc ataagctttt gccattctca 10860
ccggattcag tcgtcactca tggtgatttc tcacttgata accttatttt tgacgagggg 10920
aaattaatag gttgtattga tgttggacga gtcggaatcg cagaccgata ccaggatctt 10980
gccatcctat ggaactgcct cggtgagttt tctccttcat tacagaaacg gctttttcaa 11040
aaatatggta ttgataatcc tgatatgaat aaattgcagt ttcatttgat gctcgatgag 11100
tttttctaag ggcggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag 11160
tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct 11220
gtggcgccgg tgatgagggc gcgccaagtc gacgtccggc agtc 11264
<210> 33
<211> 685
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 33
Met Ala Glu Trp Leu Leu Ser Ala Ser Trp Gln Arg Arg Ala Lys Ala
1 5 10 15
Met Thr Ala Ala Ala Gly Ser Ala Gly Arg Ala Ala Val Pro Leu Leu
20 25 30
Leu Cys Ala Leu Leu Ala Pro Gly Gly Ala Tyr Val Leu Asp Asp Ser
35 40 45
Asp Gly Leu Gly Arg Glu Phe Asp Gly Ile Gly Ala Val Ser Gly Gly
50 55 60
Gly Ala Thr Ser Arg Leu Leu Val Asn Tyr Pro Glu Pro Tyr Arg Ser
65 70 75 80
Gln Ile Leu Asp Tyr Leu Phe Lys Pro Asn Phe Gly Ala Ser Leu His
85 90 95
Ile Leu Lys Val Glu Ile Gly Gly Asp Gly Gln Thr Thr Asp Gly Thr
100 105 110
Glu Pro Ser His Met His Tyr Ala Leu Asp Glu Asn Tyr Phe Arg Gly
115 120 125
Tyr Glu Trp Trp Leu Met Lys Glu Ala Lys Lys Arg Asn Pro Asn Ile
130 135 140
Thr Leu Ile Gly Leu Pro Trp Ser Phe Pro Gly Trp Leu Gly Lys Gly
145 150 155 160
Phe Asp Trp Pro Tyr Val Asn Leu Gln Leu Thr Ala Tyr Tyr Val Val
165 170 175
Thr Trp Ile Val Gly Ala Lys Arg Tyr His Asp Leu Asp Ile Asp Tyr
180 185 190
Ile Gly Ile Trp Asn Glu Arg Ser Tyr Asn Ala Asn Tyr Ile Lys Ile
195 200 205
Leu Arg Lys Met Leu Asn Tyr Gln Gly Leu Gln Arg Val Lys Ile Ile
210 215 220
Ala Ser Asp Asn Leu Trp Glu Ser Ile Ser Ala Ser Met Leu Leu Asp
225 230 235 240
Ala Glu Leu Phe Lys Val Val Asp Val Ile Gly Ala His Tyr Pro Gly
245 250 255
Thr His Ser Ala Lys Asp Ala Lys Leu Thr Gly Lys Lys Leu Trp Ser
260 265 270
Ser Glu Asp Phe Ser Thr Leu Asn Ser Asp Met Gly Ala Gly Cys Trp
275 280 285
Gly Arg Ile Leu Asn Gln Asn Tyr Ile Asn Gly Tyr Met Thr Ser Thr
290 295 300
Ile Ala Trp Asn Leu Val Ala Ser Tyr Tyr Glu Gln Leu Pro Tyr Gly
305 310 315 320
Arg Cys Gly Leu Met Thr Ala Gln Glu Pro Trp Ser Gly His Tyr Val
325 330 335
Val Glu Ser Pro Val Trp Val Ser Ala His Thr Thr Gln Phe Thr Gln
340 345 350
Pro Gly Trp Tyr Tyr Leu Lys Thr Val Gly His Leu Glu Lys Gly Gly
355 360 365
Ser Tyr Val Ala Leu Thr Asp Gly Leu Gly Asn Leu Thr Ile Ile Ile
370 375 380
Glu Thr Met Ser His Lys His Ser Lys Cys Ile Arg Pro Phe Leu Pro
385 390 395 400
Tyr Phe Asn Val Ser Gln Gln Phe Ala Thr Phe Val Leu Lys Gly Ser
405 410 415
Phe Ser Glu Ile Pro Glu Leu Gln Val Trp Tyr Thr Lys Leu Gly Lys
420 425 430
Thr Ser Glu Arg Phe Leu Phe Lys Gln Leu Asp Ser Leu Trp Leu Leu
435 440 445
Asp Ser Asp Gly Ser Phe Thr Leu Ser Leu His Glu Asp Glu Leu Phe
450 455 460
Thr Leu Thr Thr Leu Thr Thr Gly Arg Lys Gly Ser Tyr Pro Leu Pro
465 470 475 480
Pro Lys Ser Gln Pro Phe Pro Ser Thr Tyr Lys Asp Asp Phe Asn Val
485 490 495
Asp Tyr Pro Phe Phe Ser Glu Ala Pro Asn Phe Ala Asp Gln Thr Gly
500 505 510
Val Phe Glu Tyr Phe Thr Asn Ile Glu Asp Pro Gly Glu His His Phe
515 520 525
Thr Leu Arg Gln Val Leu Asn Gln Arg Pro Ile Thr Trp Ala Ala Asp
530 535 540
Ala Ser Asn Thr Ile Ser Ile Ile Gly Asp Tyr Asn Trp Thr Asn Leu
545 550 555 560
Thr Ile Lys Cys Asp Val Tyr Ile Glu Thr Pro Asp Thr Gly Gly Val
565 570 575
Phe Ile Ala Gly Arg Val Asn Lys Gly Gly Ile Leu Ile Arg Ser Ala
580 585 590
Arg Gly Ile Phe Phe Trp Ile Phe Ala Asn Gly Ser Tyr Arg Val Thr
595 600 605
Gly Asp Leu Ala Gly Trp Ile Ile Tyr Ala Leu Gly Arg Val Glu Val
610 615 620
Thr Ala Lys Lys Trp Tyr Thr Leu Thr Leu Thr Ile Lys Gly His Phe
625 630 635 640
Thr Ser Gly Met Leu Asn Asp Lys Ser Leu Trp Thr Asp Ile Pro Val
645 650 655
Asn Phe Pro Lys Asn Gly Trp Ala Ala Ile Gly Thr His Ser Phe Glu
660 665 670
Phe Ala Gln Phe Asp Asn Phe Leu Val Glu Ala Thr Arg
675 680 685
<210> 34
<211> 2055
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 34
atggccgagt ggctgctgag cgccagctgg cagcgccgcg ccaaggccat gaccgccgcc 60
gccggcagcg ccggccgcgc cgccgtgccc ctgctgctgt gcgccctgct ggccccccggc 120
ggcgcctacg tgctggacga cagcgacggc ctgggccgcg agttcgacgg catcggcgcc 180
gtgagcggcg gcggcgccac cagccgcctg ctggtgaact accccgagcc ctaccgcagc 240
cagatcctgg actacctgtt caagcccaac ttcggcgcca gcctgcacat cctgaaggtg 300
gagatcggcg gcgacggcca gaccaccgac ggcaccgagc ccagccacat gcactacgcc 360
ctggacgaga actacttccg cggctacgag tggtggctga tgaaggaggc caagaagcgc 420
aaccccaaca tcaccctgat cggcctgccc tggagcttcc ccggctggct gggcaagggc 480
ttcgactggc cctacgtgaa cctgcagctg accgcctact acgtggtgac ctggatcgtg 540
ggcgccaagc gctaccacga cctggacatc gactacatcg gcatctggaa cgagcgcagc 600
tacaacgcca actacatcaa gatcctgcgc aagatgctga actaccaggg cctgcagcgc 660
gtgaagatca tcgccagcga caacctgtgg gagagcatca gcgccagcat gctgctggac 720
gccgagctgt tcaaggtggt ggacgtgatc ggcgcccact accccggcac ccacagcgcc 780
aaggacgcca agctgaccgg caagaagctg tggagcagcg aggacttcag caccctgaac 840
agcgacatgg gcgccggctg ctggggccgc atcctgaacc agaactacat caacggctac 900
atgaccagca ccatcgcctg gaacctggtg gccagctact acgagcagct gccctacggc 960
cgctgcggcc tgatgaccgc ccaggagccc tggagcggcc actacgtggt ggagagcccc 1020
gtgtgggtga gcgcccacac cacccagttc acccagcccg gctggtacta cctgaagacc 1080
gtgggccacc tggagaaggg cggcagctac gtggccctga ccgacggcct gggcaacctg 1140
accatcatca tcgagaccat gagccacaag cacagcaagt gcatccgccc cttcctgccc 1200
tacttcaacg tgagccagca gttcgccacc ttcgtgctga agggcagctt cagcgagatc 1260
cccgagctgc aggtgtggta caccaagctg ggcaagacca gcgagcgctt cctgttcaag 1320
cagctggaca gcctgtggct gctggacagc gacggcagct tcaccctgag cctgcacgag 1380
gacgagctgt tcaccctgac caccctgacc accggccgca agggcagcta ccccctgccc 1440
cccaagagcc agcccttccc cagcacctac aaggacgact tcaacgtgga ctaccccttc 1500
ttcagcgagg cccccaactt cgccgaccag accggcgtgt tcgagtactt caccaacatc 1560
gaggaccccg gcgagcacca cttcaccctg cgccaggtgc tgaaccagcg ccccatcacc 1620
tgggccgccg acgccagcaa caccatcagc atcatcggcg actacaactg gaccaacctg 1680
accatcaagt gcgacgtgta catcgagacc cccgacaccg gcggcgtgtt catcgccggc 1740
cgcgtgaaca agggcggcat cctgatccgc agcgcccgcg gcatcttctt ctggatcttc 1800
gccaacggca gctaccgcgt gaccggcgac ctggccggct ggatcatcta cgccctgggc 1860
cgcgtggagg tgaccgccaa gaagtggtac accctgaccc tgaccatcaa gggccacttc 1920
accagcggca tgctgaacga caagagcctg tggaccgaca tccccgtgaa cttccccaag 1980
aacggctggg ccgccatcgg cacccacagc ttcgagttcg cccagttcga caacttcctg 2040
gtggaggcca cccgc 2055
<210> 35
<211> 339
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 35
Met Trp Gln Leu Trp Ala Ser Leu Cys Cys Leu Leu Val Leu Ala Asn
1 5 10 15
Ala Arg Ser Arg Pro Ser Phe His Pro Leu Ser Asp Glu Leu Val Asn
20 25 30
Tyr Val Asn Lys Arg Asn Thr Thr Trp Gln Ala Gly His Asn Phe Tyr
35 40 45
Asn Val Asp Met Ser Tyr Leu Lys Arg Leu Cys Gly Thr Phe Leu Gly
50 55 60
Gly Pro Lys Pro Pro Gln Arg Val Met Phe Thr Glu Asp Leu Lys Leu
65 70 75 80
Pro Ala Ser Phe Asp Ala Arg Glu Gln Trp Pro Gln Cys Pro Thr Ile
85 90 95
Lys Glu Ile Arg Asp Gln Gly Ser Cys Gly Ser Cys Trp Ala Phe Gly
100 105 110
Ala Val Glu Ala Ile Ser Asp Arg Ile Cys Ile His Thr Asn Ala His
115 120 125
Val Ser Val Glu Val Ser Ala Glu Asp Leu Leu Thr Cys Cys Gly Ser
130 135 140
Met Cys Gly Asp Gly Cys Asn Gly Gly Tyr Pro Ala Glu Ala Trp Asn
145 150 155 160
Phe Trp Thr Arg Lys Gly Leu Val Ser Gly Gly Leu Tyr Glu Ser His
165 170 175
Val Gly Cys Arg Pro Tyr Ser Ile Pro Cys Glu His His Val Asn
180 185 190
Gly Ser Arg Pro Pro Cys Thr Gly Glu Gly Asp Thr Pro Lys Cys Ser
195 200 205
Lys Ile Cys Glu Pro Gly Tyr Ser Pro Thr Tyr Lys Gln Asp Lys His
210 215 220
Tyr Gly Tyr Asn Ser Tyr Ser Val Ser Asn Ser Glu Lys Asp Ile Met
225 230 235 240
Ala Glu Ile Tyr Lys Asn Gly Pro Val Glu Gly Ala Phe Ser Val Tyr
245 250 255
Ser Asp Phe Leu Leu Tyr Lys Ser Gly Val Tyr Gln His Val Thr Gly
260 265 270
Glu Met Met Gly Gly His Ala Ile Arg Ile Leu Gly Trp Gly Val Glu
275 280 285
Asn Gly Thr Pro Tyr Trp Leu Val Ala Asn Ser Trp Asn Thr Asp Trp
290 295 300
Gly Asp Asn Gly Phe Phe Lys Ile Leu Arg Gly Gln Asp His Cys Gly
305 310 315 320
Ile Glu Ser Glu Val Val Ala Gly Ile Pro Arg Thr Asp Gln Tyr Trp
325 330 335
Glu Lys Ile
<210> 36
<211> 1017
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 36
atgtggcagc tgtgggccag cctgtgctgc ctgctggtgc tggccaacgc ccgcagccgc 60
cccagcttcc accccctgag cgacgagctg gtgaactacg tgaacaagcg caacaccacc 120
tggcaggccg gccacaactt ctacaacgtg gacatgagct acctgaagcg cctgtgcggc 180
accttcctgg gcggccccaa gcccccccag cgcgtgatgt tcaccgagga cctgaagctg 240
cccgccagct tcgacgcccg cgagcagtgg ccccagtgcc ccaccatcaa ggagatccgc 300
gaccagggca gctgcggcag ctgctgggcc ttcggcgccg tggaggccat cagcgaccgc 360
atctgcatcc acaccaacgc ccacgtgagc gtggaggtga gcgccgagga cctgctgacc 420
tgctgcggca gcatgtgcgg cgacggctgc aacggcggct accccgccga ggcctggaac 480
ttctggaccc gcaagggcct ggtgagcggc ggcctgtacg agagccacgt gggctgccgc 540
ccctacagca tccccccctg cgagcaccac gtgaacggca gccgcccccc ctgcaccggc 600
gagggcgaca cccccaagtg cagcaagatc tgcgagcccg gctacagccc cacctacaag 660
caggacaagc actacggcta caacagctac agcgtgagca acagcgagaa ggacatcatg 720
gccgagatct acaagaacgg ccccgtggag ggcgccttca gcgtgtacag cgacttcctg 780
ctgtacaaga gcggcgtgta ccagcacgtg accggcgaga tgatgggcgg ccacgccatc 840
cgcatcctgg gctggggcgt ggagaacggc accccctact ggctggtggc caacagctgg 900
aacaccgact ggggcgacaa cggcttcttc aagatcctgc gcggccagga ccactgcggc 960
atcgagagcg aggtggtggc cggcatcccc cgcaccgacc agtactggga gaagatc 1017
<210> 37
<211> 631
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 37
Met Pro Arg Tyr Gly Ala Ser Leu Arg Gln Ser Cys Pro Arg Ser Gly
1 5 10 15
Arg Glu Gln Gly Gln Asp Gly Thr Ala Gly Ala Pro Gly Leu Leu Trp
20 25 30
Met Gly Leu Val Leu Ala Leu Ala Leu Ala Leu Ala Leu Ala Leu Ala
35 40 45
Leu Ser Asp Ser Arg Val Leu Trp Ala Pro Ala Glu Ala His Pro Leu
50 55 60
Ser Pro Gln Gly His Pro Ala Arg Leu His Arg Ile Val Pro Arg Leu
65 70 75 80
Arg Asp Val Phe Gly Trp Gly Asn Leu Thr Cys Pro Ile Cys Lys Gly
85 90 95
Leu Phe Thr Ala Ile Asn Leu Gly Leu Lys Lys Glu Pro Asn Val Ala
100 105 110
Arg Val Gly Ser Val Ala Ile Lys Leu Cys Asn Leu Leu Lys Ile Ala
115 120 125
Pro Pro Ala Val Cys Gln Ser Ile Val His Leu Phe Glu Asp Asp Met
130 135 140
Val Glu Val Trp Arg Arg Ser Val Leu Ser Pro Ser Glu Ala Cys Gly
145 150 155 160
Leu Leu Leu Gly Ser Thr Cys Gly His Trp Asp Ile Phe Ser Ser Trp
165 170 175
Asn Ile Ser Leu Pro Thr Val Pro Lys Pro Pro Lys Pro Pro Ser
180 185 190
Pro Pro Ala Pro Gly Ala Pro Val Ser Arg Ile Leu Phe Leu Thr Asp
195 200 205
Leu His Trp Asp His Asp Tyr Leu Glu Gly Thr Asp Pro Asp Cys Ala
210 215 220
Asp Pro Leu Cys Cys Arg Arg Gly Ser Gly Leu Pro Pro Ala Ser Arg
225 230 235 240
Pro Gly Ala Gly Tyr Trp Gly Glu Tyr Ser Lys Cys Asp Leu Pro Leu
245 250 255
Arg Thr Leu Glu Ser Leu Leu Ser Gly Leu Gly Pro Ala Gly Pro Phe
260 265 270
Asp Met Val Tyr Trp Thr Gly Asp Ile Pro Ala His Asp Val Trp His
275 280 285
Gln Thr Arg Gln Asp Gln Leu Arg Ala Leu Thr Thr Val Thr Ala Leu
290 295 300
Val Arg Lys Phe Leu Gly Pro Val Pro Val Tyr Pro Ala Val Gly Asn
305 310 315 320
His Glu Ser Thr Pro Val Asn Ser Phe Pro Pro Phe Ile Glu Gly
325 330 335
Asn His Ser Ser Arg Trp Leu Tyr Glu Ala Met Ala Lys Ala Trp Glu
340 345 350
Pro Trp Leu Pro Ala Glu Ala Leu Arg Thr Leu Arg Ile Gly Gly Phe
355 360 365
Tyr Ala Leu Ser Pro Tyr Pro Gly Leu Arg Leu Ile Ser Leu Asn Met
370 375 380
Asn Phe Cys Ser Arg Glu Asn Phe Trp Leu Leu Ile Asn Ser Thr Asp
385 390 395 400
Pro Ala Gly Gln Leu Gln Trp Leu Val Gly Glu Leu Gln Ala Ala Glu
405 410 415
Asp Arg Gly Asp Lys Val His Ile Ile Gly His Ile Pro Gly His
420 425 430
Cys Leu Lys Ser Trp Ser Trp Asn Tyr Tyr Arg Ile Val Ala Arg Tyr
435 440 445
Glu Asn Thr Leu Ala Ala Gln Phe Phe Gly His Thr His Val Asp Glu
450 455 460
Phe Glu Val Phe Tyr Asp Glu Glu Thr Leu Ser Arg Pro Leu Ala Val
465 470 475 480
Ala Phe Leu Ala Pro Ser Ala Thr Thr Tyr Ile Gly Leu Asn Pro Gly
485 490 495
Tyr Arg Val Tyr Gln Ile Asp Gly Asn Tyr Ser Gly Ser Ser His Val
500 505 510
Val Leu Asp His Glu Thr Tyr Ile Leu Asn Leu Thr Gln Ala Asn Ile
515 520 525
Pro Gly Ala Ile Pro His Trp Gln Leu Leu Tyr Arg Ala Arg Glu Thr
530 535 540
Tyr Gly Leu Pro Asn Thr Leu Pro Thr Ala Trp His Asn Leu Val Tyr
545 550 555 560
Arg Met Arg Gly Asp Met Gln Leu Phe Gln Thr Phe Trp Phe Leu Tyr
565 570 575
His Lys Gly His Pro Pro Ser Glu Pro Cys Gly Thr Pro Cys Arg Leu
580 585 590
Ala Thr Leu Cys Ala Gln Leu Ser Ala Arg Ala Asp Ser Pro Ala Leu
595 600 605
Cys Arg His Leu Met Pro Asp Gly Ser Leu Pro Glu Ala Gln Ser Leu
610 615 620
Trp Pro Arg Pro Leu Phe Cys
625 630
<210> 38
<211> 1896
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 38
atgccccgct acggcgccag cctgcgccag agctgccccc gcagcggccg cgagcagggc 60
caggacggca ccgccggcgc ccccggcctg ctgtggatgg gcctggtgct ggccctggcc 120
ctggccctgg ccctggccct ggccctgagc gacagccgcg tgctgtgggc ccccgccgag 180
gcccaccccc tgagccccca gggccacccc gcccgcctgc accgcatcgt gccccgcctg 240
cgcgacgtgt tcggctgggg caacctgacc tgccccatct gcaagggcct gttcaccgcc 300
atcaacctgg gcctgaagaa ggagcccaac gtggcccgcg tgggcagcgt ggccatcaag 360
ctgtgcaacc tgctgaagat cgcccccccc gccgtgtgcc agagcatcgt gcacctgttc 420
gaggacgaca tggtggaggt gtggcgccgc agcgtgctga gccccagcga ggcctgcggc 480
ctgctgctgg gcagcacctg cggccactgg gacatcttca gcagctggaa catcagcctg 540
cccaccgtgc ccaagccccc ccccaagccc cccagccccc ccgccccccgg cgccccccgtg 600
agccgcatcc tgttcctgac cgacctgcac tgggaccacg actacctgga gggcaccgac 660
cccgactgcg ccgaccccct gtgctgccgc cgcggcagcg gcctgccccc cgccagccgc 720
cccggcgccg gctactgggg cgagtacagc aagtgcgacc tgcccctgcg caccctggag 780
agcctgctga gcggcctggg ccccgccggc cccttcgaca tggtgtactg gaccggcgac 840
atccccgccc acgacgtgtg gcaccagacc cgccaggacc agctgcgcgc cctgaccacc 900
gtgaccgccc tggtgcgcaa gttcctgggc cccgtgcccg tgtaccccgc cgtgggcaac 960
cacgagagca cccccgtgaa cagcttcccc ccccccttca tcgagggcaa ccacagcagc 1020
cgctggctgt acgaggccat ggccaaggcc tgggagccct ggctgcccgc cgaggccctg 1080
cgcaccctgc gcatcggcgg cttctacgcc ctgagcccct accccggcct gcgcctgatc 1140
agcctgaaca tgaacttctg cagccgcgag aacttctggc tgctgatcaa cagcaccgac 1200
cccgccggcc agctgcagtg gctggtgggc gagctgcagg ccgccgagga ccgcggcgac 1260
aaggtgcaca tcatcggcca catccccccc ggccactgcc tgaagagctg gagctggaac 1320
tactaccgca tcgtggcccg ctacgagaac accctggccg cccagttctt cggccacacc 1380
cacgtggacg agttcgaggt gttctacgac gaggagaccc tgagccgccc cctggccgtg 1440
gccttcctgg cccccagcgc caccacctac atcggcctga accccggcta ccgcgtgtac 1500
cagatcgacg gcaactacag cggcagcagc cacgtggtgc tggaccacga gacctacat 1560
ctgaacctga cccaggccaa catccccggc gccatccccc actggcagct gctgtaccgc 1620
gcccgcgaga cctacggcct gcccaacacc ctgcccaccg cctggcacaa cctggtgtac 1680
cgcatgcgcg gcgacatgca gctgttccag accttctggt tcctgtacca caagggccac 1740
ccccccagcg agccctgcgg caccccctgc cgcctggcca ccctgtgcgc ccagctgagc 1800
gcccgcgccg acagccccgc cctgtgccgc cacctgatgc ccgacggcag cctgcccgag 1860
gcccagagcc tgtggccccg ccccctgttc tgctaa 1896
<210> 39
<211> 11329
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 39
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctgcag ccctaggaat gcatctagac aattgtacta accttcttct 600
ctttcctctc ctgacagtcc ggaaagccac catggaattc agcagcccca gcagagagga 660
atgccccaag cctctgagcc gggtgtcaat catggccgga tctctgacag gactgctgct 720
gcttcaggcc gtgtcttggg cttctggcgc tagaccttgc atccccaaga gcttcggcta 780
cagcagcgtc gtgtgcgtgt gcaatgccac ctactgcgac agcttcgacc ctcctacctt 840
tcctgctctg ggcaccttca gcagatacga gagcaccaga tccggcagac ggatggaact 900
gagcatggga cccatccagg ccaatcacac aggcactggc ctgctgctga cactgcagcc 960
tgagcagaaa ttccagaaag tgaaaggctt cggcggagcc atgacagatg ccgccgctct 1020
gaatatcctg gctctgtctc caccagctca gaacctgctg ctcaagagct acttcagcga 1080
ggaaggcatc ggctacaaca tcatcagagt gcccatggcc agctgcgact tcagcatcag 1140
gacctacacc tacgccgaca cacccgacga tttccagctg cacaacttca gcctgcctga 1200
agaggacacc aagctgaaga tccctctgat ccacagagcc ctgcagctgg cacaaagacc 1260
cgtgtcactg ctggcctctc catggacatc tcccacctgg ctgaaaacaa atggcgccgt 1320
gaatggcaag ggcagcctga aaggccaacc tggcgacatc taccaccaga cctgggccag 1380
atacttcgtg aagttcctgg acgcctatgc cgagcacaag ctgcagtttt gggccgtgac 1440
agccgagaac gaaccttctg ctggactgct gagcggctac ccctttcagt gcctgggctt 1500
tacacccgag caccagcggg actttatcgc ccgtgatctg ggacccacac tggccaatag 1560
cacccaccat aatgtgcggc tgctgatgct ggacgaccag agactgcttc tgccccactg 1620
ggctaaagtg gtgctgacag atcctgaggc cgccaaatac gtgcacggaa tcgccgtgca 1680
ctggtatctg gactttctgg cccctgccaa ggccacactg ggagagacac acagactgtt 1740
ccccaacacc atgctgttcg ccagcgaagc ctgtgtgggc agcaagtttt gggaacagag 1800
cgtgcggctc ggcagctggg atagaggcat gcagtacagc cacagcatca tcaccaacct 1860
gctgtaccac gtcgtcggct ggaccgactg gaatctggcc ctgaatcctg aaggcggccc 1920
taactgggtc cgaaacttcg tggacagccc catcatcgtg gacatcacca aggacacctt 1980
ctacaagcag cccatgttct accacctggg acacttcagc aagttcatcc ccgagggctc 2040
tcagcgcgtt ggactggtgg cttcccagaa gaacgatctg gacgccgtgg ctctgatgca 2100
ccctgatgga tctgctgtgg tggtggtcct gaaccgcagc agcaaagatg tgcccctgac 2160
catcaaggat cccgccgtgg gattcctgga aacaatcagc cctggctact ccatccacac 2220
ctacctgtgg cgtagacagg agggcagagg aagtcttctg acatgcggag acgtggaaga 2280
gaatcccggc cctatggccg agtggctgct gagcgccagc tggcagcgcc gcgccaaggc 2340
catgaccgcc gccgccggca gcgccggccg cgccgccgtg cccctgctgc tgtgcgccct 2400
gctggccccc ggcggcgcct acgtgctgga cgacagcgac ggcctgggcc gcgagttcga 2460
cggcatcggc gccgtgagcg gcggcggcgc caccagccgc ctgctggtga actaccccga 2520
gccctaccgc agccagatcc tggactacct gttcaagccc aacttcggcg ccagcctgca 2580
catcctgaag gtggagatcg gcggcgacgg ccagaccacc gacggcaccg agcccagcca 2640
catgcactac gccctggacg agaactactt ccgcggctac gagtggtggc tgatgaagga 2700
ggccaagaag cgcaacccca acatcaccct gatcggcctg ccctggagct tccccggctg 2760
gctgggcaag ggcttcgact ggccctacgt gaacctgcag ctgaccgcct actacgtggt 2820
gacctggatc gtgggcgcca agcgctacca cgacctggac atcgactaca tcggcatctg 2880
gaacgagcgc agctacaacg ccaactacat caagatcctg cgcaagatgc tgaactacca 2940
gggcctgcag cgcgtgaaga tcatcgccag cgacaacctg tgggagagca tcagcgccag 3000
catgctgctg gacgccgagc tgttcaaggt ggtggacgtg atcggcgccc actaccccgg 3060
cacccacagc gccaaggacg ccaagctgac cggcaagaag ctgtggagca gcgaggactt 3120
cagcaccctg aacagcgaca tgggcgccgg ctgctggggc cgcatcctga accagaacta 3180
catcaacggc tacatgacca gcaccatcgc ctggaacctg gtggccagct actacgagca 3240
gctgccctac ggccgctgcg gcctgatgac cgcccaggag ccctggagcg gccactacgt 3300
ggtggagagc cccgtgtggg tgagcgccca caccacccag ttcacccagc ccggctggta 3360
ctacctgaag accgtgggcc acctggagaa gggcggcagc tacgtggccc tgaccgacgg 3420
cctgggcaac ctgaccatca tcatcgagac catgagccac aagcacagca agtgcatccg 3480
ccccttcctg ccctacttca acgtgagcca gcagttcgcc accttcgtgc tgaagggcag 3540
cttcagcgag atccccgagc tgcaggtgtg gtacaccaag ctgggcaaga ccagcgagcg 3600
cttcctgttc aagcagctgg acagcctgtg gctgctggac agcgacggca gcttcaccct 3660
gagcctgcac gaggacgagc tgttcaccct gaccaccctg accaccggcc gcaagggcag 3720
ctaccccctg ccccccaaga gccagccctt ccccagcacc tacaaggacg acttcaacgt 3780
ggactacccc ttcttcagcg aggcccccaa cttcgccgac cagaccggcg tgttcgagta 3840
cttcaccaac atcgaggacc ccggcgagca ccacttcacc ctgcgccagg tgctgaacca 3900
gcgccccatc acctgggccg ccgacgccag caacaccatc agcatcatcg gcgactacaa 3960
ctggaccaac ctgaccatca agtgcgacgt gtacatcgag acccccgaca ccggcggcgt 4020
gttcatcgcc ggccgcgtga acaagggcgg catcctgatc cgcagcgccc gcggcatctt 4080
cttctggatc ttcgccaacg gcagctaccg cgtgaccggc gacctggccg gctggatcat 4140
ctacgccctg ggccgcgtgg aggtgaccgc caagaagtgg tacaccctga ccctgaccat 4200
caagggccac ttcaccagcg gcatgctgaa cgacaagagc ctgtggaccg acatccccgt 4260
gaacttcccc aagaacggct gggccgccat cggcacccac agcttcgagt tcgcccagtt 4320
cgacaacttc ctggtggagg ccacccgctg acaattgtta attaagttta aaccctcgag 4380
gccgcaagca ataaaatatc tttattttca ttacatctgt gtgttggttt tttgtgtgga 4440
gatccacgat aacaaacagc ttttttgggg tgaacatatt gactgaattc cctgcaggtt 4500
ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gcccgggcaa agcccgggcg 4560
tcgggcgacc tttggtcgcc cggcctcagt gagcgagcga gcgcgcagag agggagtggc 4620
caactccatc actaggggtt cctgcggccg ctcgtacggt ctcgaggaat tcctgcagga 4680
taacttgcca acctcattct aaaatgtata tagaagccca aaagacaata acaaaaatat 4740
tcttgtagaa caaaatggga aagaatgttc cactaaatat caagatttag agcaaagcat 4800
gagatgtgtg gggatagaca gtgaggctga taaaatagag tagagctcag aaacagaccc 4860
attgatatat gtaagtgacc tatgaaaaaa atatggcatt ttacaatggg aaaatgatgg 4920
tctttttctt ttttagaaaa acagggaaat atatttatat gtaaaaaata aaagggaacc 4980
catatgtcat accatacaca caaaaaaatt ccagtgaatt ataagtctaa atggagaagg 5040
caaaacttta aatcttttag aaaataatat agaagcatgc agaccagcct ggccaacatg 5100
atgaaaccct ctctactaat aataaaatca gtagaactac tcaggactac tttgagtggg 5160
aagtcctttt ctatgaagac ttctttggcc aaaattaggc tctaaatgca aggagatagt 5220
gcatcatgcc tggctgcact tactgataaa tgatgttatc accatcttta accaaatgca 5280
caggaacaag ttatggtact gatgtgctgg attgagaagg agctctactt ccttgacagg 5340
acacatttgt atcaacttaa aaaagcagat ttttgccagc agaactattc attcagaggt 5400
aggaaactta gaatagatga tgtcactgat tagcatggct tccccatctc cacagctgct 5460
tccccacccag gttgcccaca gttgagtttg tccagtgctc agggctgccc actctcagta 5520
agaagcccca caccagcccc tctccaaata tgttggctgt tccttccatt aaagtgaccc 5580
cactttagag cagcaagtgg atttctgttt cttacagttc aggaaggagg agtcagctgt 5640
gagaacctgg agcctgagat gcttctaagt cccactgcta ctggggtcag ggaagccaga 5700
ctccagcatc agcagtcagg agcactaagc ccttgccaac atcctgtttc tcagagaaac 5760
tgcttccatt ataatggttg tcctttttta agctatcaag ccaaacaacc agtgtctacc 5820
attattctca tcacctgaag ccaagggttc tagcaaaagt caagctgtct tgtaatggtt 5880
gatgtgcctc cagcttctgt cttcagtcac tccactctta gcctgctctg aatcaactct 5940
gaccacagtt ccctggagcc cctgccacct gctgcccctg ccaccttctc catctgcagt 6000
gctgtgcagc cttctgcact cttgcagagc taataggtgg agacttgaag gaagaggagg 6060
aaagtttctc ataatagcct tgctgcaagc tcaaatggga ggtgggcact gtgcccagga 6120
gccttggagc aaaggctgtg cccaacctct gactgcatcc aggtttggtc ttgacagaga 6180
taagaagccc tggcttttgg agccaaaatc taggtcagac ttaggcagga ttctcaaagt 6240
ttatcagcag aacatgaggc agaagaccct ttctgctcca gcttcttcag gctcaacctt 6300
catcagaata gatagaaaga gaggctgtga gggttcttaa aacagaagca aatctgactc 6360
agagaataaa caacctccta gtaaactaca gcttagacag agcatctggt ggtgagtgtg 6420
ctcagtgtcc tactcaactg tctggtatca gccctcatga ggacttctct tctttccctc 6480
atagacctcc atctctgttt tccttagcct gcagaaatct ggatggctat tcacagaatg 6540
cctgtgcttt cagagttgca ttttttctct ggtattctgg ttcaagcatt tgaaggtagg 6600
aaaggttctc caagtgcaag aaagccagcc ctgagcctca actgcctggc tagtgtggtc 6660
agtaggatgc aaaggctgtt gaatgccaca aggccaaact ttaacctgtg taccacaagc 6720
ctagcagcag aggcagctct gctcactgga actctctgtc ttctttctcc tgagcctttt 6780
cttttcctga gttttctagc tctcctcaac cttacctctg ccctacccag gacaaaccca 6840
agagccactg tttctgtgat gtcctctcca gccctaatta ggcatcatga cttcagcctg 6900
accttccatg ctcagaagca gtgctaatcc acttcagatg agctgctcta tgcaacacag 6960
gcagagccta caaacctttg caccagagcc ctccacatat cagtgtttgt tcatactcac 7020
ttcaacagca aatgtgactg ctgagattaa gattttacac aagatggtct gtaatttcac 7080
agttagtttt atcccattag gtatgaaaga attagcataa ttccccttaa acatgaatga 7140
atcttagatt ttttaataaa tagttttgga agtaaagaca gagacatcag gagcacaagg 7200
aatagcctga gaggacaaac agaacaagaa agagtctgga aatacacagg atgttcttgg 7260
cctcctcaaa gcaagtgcaa gcagatagta ccagcagccc caggctatca gagcccagtg 7320
aagagaagta ccatgaaagc cacagctcta accaccctgt tccagagtga cagacagtcc 7380
ccaagacaag ccagcctgag ccagagagag aactgcaaga gaaagtttct aatttaggtt 7440
ctgttagatt cagacaagtg caggtcatcc tctctccaca gctactcacc tctccagcct 7500
aacaaagcct gcagtccaca ctccaaccct ggtgtctcac ctcctagcct ctcccaacat 7560
cctgctctct gaccatcttc tgcatctctc atctcaccat ctcccactgt ctacagccta 7620
ctcttgcaac taccatctca ttttctgaca tcctgtctac atcttctgcc atactctgcc 7680
atctaccata ccacctctta ccatctacca caccatcttt tatctccatc cctctcagaa 7740
gcctccaagc tgaatcctgc tttatgtgtt catctcagcc cctgcatgga aagctgaccc 7800
cagaggcaga actattccca gagagcttgg ccaagaaaaa caaaactacc agcctggcca 7860
ggctcaggag tagtaagctg cagtgtctgt tgtgttctag cttcaacagc tgcaggagtt 7920
ccactctcaa atgctccaca tttctcacat cctcctgatt ctggtcacta cccatcttca 7980
aagaacagaa tatctcacat cagcatactg tgaaggacta gtcatgggtg cagctgctca 8040
gagctgcaaa gtcattctgg atggtggaga gcttacaaac atttcatgat gctccccccg 8100
ctctgatggc tggagcccaa tccctacaca gactcctgct gtatgtgttt tcctttcact 8160
ctgagccaca gccagagggc aggcattcag tctcctcttc aggctggggc tggggcactg 8220
agaactcacc caacaccttg ctctcactcc ttctgcaaaa caagaaagag ctttgtgctg 8280
cagtagccat gaagaatgaa aggaaggctt taactaaaaa atgtcagaga ttattttcaa 8340
ccccttactg tggatcacca gcaaggagga aacacaacac agagacattt tttcccctca 8400
aattatcaaa agaatcactg catttgttaa agagagcaac tgaatcagga agcagagttt 8460
tgaacatatc agaagttagg aatctgcatc agagacaaat gcagtcatgg ttgtttgctg 8520
cataccagcc ctaatcatta gaagcctcat ggacttcaaa catcattccc tctgacaaga 8580
tgctctagcc taactccatg agataaaata aatctgcctt tcagagccaa agaagagtcc 8640
accagcttct tctcagtgtg aacaagagct ccagtcaggt tagtcagtcc agtgcagtag 8700
aggagaccag tctgcatcct ctaattttca aaggcaagaa gatttgttta ccctggacac 8760
caggcacaag tgaggtcaca gagctcttag atatgcagtc ctcatgagtg aggagactaa 8820
agcgcatgcc atcaagactt cagtgtagag aaaacctcca aaaaagcctc ctcactactt 8880
ctggaatagc tcagaggccg aggcggcctc ggcctctgca taaataaaaa aaattagtca 8940
gccatggggc ggagaatggg cggaactggg cggagttagg ggcgggatgg gcggagttag 9000
gggcgggact atggttgctg actaattgag atgcatgctt tgcatacttc tgcctgctgg 9060
ggagcctggg gactttccac acctggttgc tgactaattg agatgcatgc tttgcatact 9120
tctgcctgct ggggagcctg gggactttcc acaccctaac tgacacacat tccacagctg 9180
cattaatgaa tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ctcttccgct 9240
tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac 9300
tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga 9360
gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat 9420
aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac 9480
ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct 9540
gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg 9600
ctttctcata gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg 9660
ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt 9720
cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg 9780
attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac 9840
ggctacacta gaagaacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga 9900
aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt 9960
gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt 10020
tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga 10080
ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc 10140
taaagtatat atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct 10200
atctcagcga tctgtctatt tcgttcatcc atagttgcct gactcctgca aaccacgttg 10260
tgtctcaaaa tctctgatgt tacattgcac aagataaaaa tatatcatca tgaacaataa 10320
aactgtctgc ttacataaac agtaatacaa ggggtgttat gagccatatt caacgggaaa 10380
cgtcttgctc gaggccgcga ttaaattcca acatggatgc tgatttatat gggtataaat 10440
gggctcgcga taatgtcggg caatcaggtg cgacaatcta tcgattgtat gggaagcccg 10500
atgcgccaga gttgtttctg aaacatggca aaggtagcgt tgccaatgat gttacagatg 10560
agatggtcag actaaactgg ctgacggaat ttatgcctct tccgaccatc aagcatttta 10620
tccgtactcc tgatgatgca tggttactca ccactgcgat ccccgggaaa acagcattcc 10680
aggtattaga agaatatcct gattcaggtg aaaatattgt tgatgcgctg gcagtgttcc 10740
tgcgccggtt gcattcgatt cctgtttgta attgtccttt taacagcgat cgcgtatttc 10800
gtctcgctca ggcgcaatca cgaatgaata acggtttggt tgatgcgagt gattttgatg 10860
acgagcgtaa tggctggcct gttgaacaag tctggaaaga aatgcataag cttttgccat 10920
tctcaccgga ttcagtcgtc actcatggtg atttctcact tgataacctt atttttgacg 10980
aggggaaatt aataggttgt attgatgttg gacgagtcgg aatcgcagac cgataccagg 11040
atcttgccat cctatggaac tgcctcggtg agttttctcc ttcattacag aaacggcttt 11100
ttcaaaaata tggtattgat aatcctgata tgaataaatt gcagtttcat ttgatgctcg 11160
atgagttttt ctaagggcgg cctgccacca tacccacgcc gaaacaagcg ctcatgagcc 11220
cgaagtggcg agcccgatct tccccatcgg tgatgtcggc gatataggcg ccagcaaccg 11280
cacctgtggc gccggtgatg agggcgcgcc aagtcgacgt ccggcagtc 11329
<210> 40
<211> 11776
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 40
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctgcag ccctaggaat gcatctagac aattgtacta accttcttct 600
ctttcctctc ctgacagtcc ggaaagccac catggccgag tggctgctga gcgccagctg 660
gcagcgccgc gccaaggcca tgaccgccgc cgccggcagc gccggccgcg ccgccgtgcc 720
cctgctgctg tgcgccctgc tggcccccgg cggcgcctac gtgctggacg acagcgacgg 780
cctgggccgc gagttcgacg gcatcggcgc cgtgagcggc ggcggcgcca ccagccgcct 840
gctggtgaac taccccgagc cctaccgcag ccagatcctg gactacctgt tcaagcccaa 900
cttcggcgcc agcctgcaca tcctgaaggt ggagatcggc ggcgacggcc agaccaccga 960
cggcaccgag cccagccaca tgcactacgc cctggacgag aactacttcc gcggctacga 1020
gtggtggctg atgaaggagg ccaagaagcg caaccccaac atcaccctga tcggcctgcc 1080
ctggagcttc cccggctggc tgggcaaggg cttcgactgg ccctacgtga acctgcagct 1140
gaccgcctac tacgtggtga cctggatcgt gggcgccaag cgctaccacg acctggacat 1200
cgactacatc ggcatctgga acgagcgcag ctacaacgcc aactacatca agatcctgcg 1260
caagatgctg aactaccagg gcctgcagcg cgtgaagatc atcgccagcg acaacctgtg 1320
ggagagcatc agcgccagca tgctgctgga cgccgagctg ttcaaggtgg tggacgtgat 1380
cggcgcccac taccccggca cccacagcgc caaggacgcc aagctgaccg gcaagaagct 1440
gtggagcagc gaggacttca gcaccctgaa cagcgacatg ggcgccggct gctggggccg 1500
catcctgaac cagaactaca tcaacggcta catgaccagc accatcgcct ggaacctggt 1560
ggccagctac tacgagcagc tgccctacgg ccgctgcggc ctgatgaccg cccaggagcc 1620
ctggagcggc cactacgtgg tggagagccc cgtgtgggtg agcgcccaca ccacccagtt 1680
cacccagccc ggctggtact acctgaagac cgtgggccac ctggagaagg gcggcagcta 1740
cgtggccctg accgacggcc tgggcaacct gaccatcatc atcgagacca tgagccacaa 1800
gcacagcaag tgcatccgcc ccttcctgcc ctacttcaac gtgagccagc agttcgccac 1860
cttcgtgctg aagggcagct tcagcgagat ccccgagctg caggtgtggt acaccaagct 1920
gggcaagacc agcgagcgct tcctgttcaa gcagctggac agcctgtggc tgctggacag 1980
cgacggcagc ttcaccctga gcctgcacga ggacgagctg ttcaccctga ccaccctgac 2040
caccggccgc aagggcagct accccctgcc ccccaagagc cagcccttcc ccagcaccta 2100
caaggacgac ttcaacgtgg actacccctt cttcagcgag gcccccaact tcgccgacca 2160
gaccggcgtg ttcgagtact tcaccaacat cgaggacccc ggcgagcacc acttcaccct 2220
gcgccaggtg ctgaaccagc gccccatcac ctgggccgcc gacgccagca acaccatcag 2280
catcatcggc gactacaact ggaccaacct gaccatcaag tgcgacgtgt acatcgagac 2340
ccccgacacc ggcggcgtgt tcatcgccgg ccgcgtgaac aagggcggca tcctgatccg 2400
cagcgcccgc ggcatcttct tctggatctt cgccaacggc agctaccgcg tgaccggcga 2460
cctggccggc tggatcatct acgccctggg ccgcgtggag gtgaccgcca agaagtggta 2520
caccctgacc ctgaccatca agggccactt caccagcggc atgctgaacg acaagagcct 2580
gtggaccgac atccccgtga acttccccaa gaacggctgg gccgccatcg gcacccacag 2640
cttcgagttc gcccagttcg acaacttcct ggtggaggcc acccgctgat tgtggccgaa 2700
ccgccgaact cagaggccgg ccccagaaaa cccgagcgag tagggggcgg cgcgcaggag 2760
ggaggagaac tgggggcgcg ggaggctggt gggtgtgggg ggtggagatg tagaagatgt 2820
gacgccgcgg cccggcgggt gccagattag cggacgcggt gcccgcggtt gcaacgggat 2880
cccgggcgct gcagcttggg aggcggctct ccccaggcgg cgtccgcgga gacacccatc 2940
cgtgaacccc aggtcccggg ccgccggctc gccgcgcacc aggggccggc ggacagaaga 3000
gcggccgagc ggctcgaggc tgggggaccg cgggcgcggc cgcgcgctgc cgggcgggag 3060
gctggggggc cggggccggg gccgtgcccc ggagcgggtc ggaggccggg gccggggccg 3120
ggggacggcg gctccccgcg cggctccagc ggctcgggga tcccggccgg gccccgcagg 3180
gaccatgatg gaattcagca gccccagcag agaggaatgc cccaagcctc tgagccgggt 3240
gtcaatcatg gccggatctc tgacaggact gctgctgctt caggccgtgt cttgggcttc 3300
tggcgctaga ccttgcatcc ccaagagctt cggctacagc agcgtcgtgt gcgtgtgcaa 3360
tgccacctac tgcgacagct tcgaccctcc tacctttcct gctctgggca ccttcagcag 3420
atacgagagc accagatccg gcagacggat ggaactgagc atgggaccca tccaggccaa 3480
tcacacaggc actggcctgc tgctgacact gcagcctgag cagaaattcc agaaagtgaa 3540
aggcttcggc ggagccatga cagatgccgc cgctctgaat atcctggctc tgtctccacc 3600
agctcagaac ctgctgctca agagctactt cagcgaggaa ggcatcggct acaacatcat 3660
cagagtgccc atggccagct gcgacttcag catcaggacc tacacctacg ccgacacacc 3720
cgacgatttc cagctgcaca acttcagcct gcctgaagag gacaccaagc tgaagatccc 3780
tctgatccac agagccctgc agctggcaca aagacccgtg tcactgctgg cctctccatg 3840
gacatctccc acctggctga aaacaaatgg cgccgtgaat ggcaagggca gcctgaaagg 3900
ccaacctggc gacatctacc accagacctg ggccagatac ttcgtgaagt tcctggacgc 3960
ctatgccgag cacaagctgc agttttgggc cgtgacagcc gagaacgaac cttctgctgg 4020
actgctgagc ggctacccct ttcagtgcct gggctttaca cccgagcacc agcgggactt 4080
tatcgcccgt gatctgggac ccacactggc caatagcacc caccataatg tgcggctgct 4140
gatgctggac gaccagagac tgcttctgcc ccactgggct aaagtggtgc tgacagatcc 4200
tgaggccgcc aaatacgtgc acggaatcgc cgtgcactgg tatctggact ttctggcccc 4260
tgccaaggcc acactgggag agacacacag actgttcccc aacaccatgc tgttcgccag 4320
cgaagcctgt gtgggcagca agttttggga acagagcgtg cggctcggca gctgggatag 4380
aggcatgcag tacagccaca gcatcatcac caacctgctg taccacgtcg tcggctggac 4440
cgactggaat ctggccctga atcctgaagg cggccctaac tgggtccgaa acttcgtgga 4500
cagccccatc atcgtggaca tcaccaagga caccttctac aagcagccca tgttctacca 4560
cctgggacac ttcagcaagt tcatccccga gggctctcag cgcgttggac tggtggcttc 4620
ccagaagaac gatctggacg ccgtggctct gatgcaccct gatggatctg ctgtggtggt 4680
ggtcctgaac cgcagcagca aagatgtgcc cctgaccatc aaggatcccg ccgtgggatt 4740
cctggaaaca atcagccctg gctactccat ccacacctac ctgtggcgta gacagtgaca 4800
attgttaatt aagtttaaac cctcgaggcc gcaagcaata aaatatcttt attttcatta 4860
catctgtgtg ttggtttttt gtgtggagat ccacgataac aaacagcttt tttggggtga 4920
acatattgac tgaattccct gcaggttggc cactccctct ctgcgcgctc gctcgctcac 4980
tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt ggtcgcccgg cctcagtgag 5040
cgagcgagcg cgcagagagg gagtggccaa ctccatcact aggggttcct gcggccgctc 5100
gtacggtctc gaggaattcc tgcaggataa cttgccaacc tcattctaaa atgtatatag 5160
aagcccaaaa gacaataaca aaaatattct tgtagaacaa aatgggaaag aatgttccac 5220
taaatatcaa gatttagagc aaagcatgag atgtgtgggg atagacagtg aggctgataa 5280
aatagagtag agctcagaaa cagacccatt gatatatgta agtgacctat gaaaaaaata 5340
tggcatttta caatgggaaa atgatggtct ttttcttttt tagaaaaaca gggaaatata 5400
tttatatgta aaaaataaaa gggaacccat atgtcatacc atacacacaa aaaaattcca 5460
gtgaattata agtctaaatg gagaaggcaa aactttaaat cttttagaaa ataatataga 5520
agcatgcaga ccagcctggc caacatgatg aaaccctctc tactaataat aaaatcagta 5580
gaactactca ggactacttt gagtgggaag tccttttcta tgaagacttc tttggccaaa 5640
attaggctct aaatgcaagg agatagtgca tcatgcctgg ctgcacttac tgataaatga 5700
tgttatcacc atctttaacc aaatgcacag gaacaagtta tggtactgat gtgctggatt 5760
gagaaggagc tctacttcct tgacaggaca catttgtatc aacttaaaaa agcagatttt 5820
tgccagcaga actattcatt cagaggtagg aaacttagaa tagatgatgt cactgattag 5880
catggcttcc ccatctccac agctgcttcc cacccaggtt gcccacagtt gagtttgtcc 5940
agtgctcagg gctgcccact ctcagtaaga agccccacac cagcccctct ccaaatatgt 6000
tggctgttcc ttccattaaa gtgaccccac tttagagcag caagtggatt tctgtttctt 6060
acagttcagg aaggaggagt cagctgtgag aacctggagc ctgagatgct tctaagtccc 6120
actgctactg gggtcaggga agccagactc cagcatcagc agtcaggagc actaagccct 6180
tgccaacatc ctgtttctca gagaaactgc ttccattata atggttgtcc ttttttaagc 6240
tatcaagcca aacaaccagt gtctaccatt attctcatca cctgaagcca agggttctag 6300
caaaagtcaa gctgtcttgt aatggttgat gtgcctccag cttctgtctt cagtcactcc 6360
actcttagcc tgctctgaat caactctgac cacagttccc tggagcccct gccacctgct 6420
gcccctgcca ccttctccat ctgcagtgct gtgcagcctt ctgcactctt gcagagctaa 6480
taggtggaga cttgaaggaa gaggaggaaa gtttctcata atagccttgc tgcaagctca 6540
aatgggaggt gggcactgtg cccaggagcc ttggagcaaa ggctgtgccc aacctctgac 6600
tgcatccagg tttggtcttg acagagataa gaagccctgg cttttggagc caaaatctag 6660
gtcagactta ggcaggattc tcaaagttta tcagcagaac atgaggcaga agaccctttc 6720
tgctccagct tcttcaggct caaccttcat cagaatagat agaaagagag gctgtgaggg 6780
ttcttaaaac agaagcaaat ctgactcaga gaataaacaa cctcctagta aactacagct 6840
tagacagagc atctggtggt gagtgtgctc agtgtcctac tcaactgtct ggtatcagcc 6900
ctcatgagga cttctcttct ttccctcata gacctccatc tctgttttcc ttagcctgca 6960
gaaatctgga tggctattca cagaatgcct gtgctttcag agttgcattt tttctctggt 7020
attctggttc aagcatttga aggtaggaaa ggttctccaa gtgcaagaaa gccagccctg 7080
agcctcaact gcctggctag tgtggtcagt aggatgcaaa ggctgttgaa tgccacaagg 7140
ccaaacttta acctgtgtac cacaagccta gcagcagagg cagctctgct cactggaact 7200
ctctgtcttc tttctcctga gccttttctt ttcctgagtt ttctagctct cctcaacctt 7260
acctctgccc tacccaggac aaacccaaga gccactgttt ctgtgatgtc ctctccagcc 7320
ctaattaggc atcatgactt cagcctgacc ttccatgctc agaagcagtg ctaatccact 7380
tcagatgagc tgctctatgc aacacaggca gagcctacaa acctttgcac cagagccctc 7440
cacatatcag tgtttgttca tactcacttc aacagcaaat gtgactgctg agattaagat 7500
tttacacaag atggtctgta atttcacagt tagttttatc ccattaggta tgaaagaatt 7560
agcataattc cccttaaaca tgaatgaatc ttagattttt taataaatag ttttggaagt 7620
aaagacagag acatcaggag cacaaggaat agcctgagag gacaaacaga acaagaaaga 7680
gtctggaaat acagggatg ttcttggcct cctcaaagca agtgcaagca gatagtacca 7740
gcagccccag gctatcagag cccagtgaag agaagtacca tgaaagccac agctctaacc 7800
accctgttcc agagtgacag acagtcccca agacaagcca gcctgagcca gagagagaac 7860
tgcaagagaa agtttctaat ttaggttctg ttagattcag acaagtgcag gtcatcctct 7920
ctccacagct actcacctct ccagcctaac aaagcctgca gtccacactc caaccctggt 7980
gtctcacctc ctagcctctc ccaacatcct gctctctgac catcttctgc atctctcatc 8040
tcaccatctc ccactgtcta cagcctactc ttgcaactac catctcattt tctgacatcc 8100
tgtctacatc ttctgccata ctctgccatc taccatacca cctcttacca tctaccacac 8160
catcttttat ctccatccct ctcagaagcc tccaagctga atcctgcttt atgtgttcat 8220
ctcagcccct gcatggaaag ctgaccccag aggcagaact attcccagag agcttggcca 8280
agaaaaacaa aactaccagc ctggccaggc tcaggagtag taagctgcag tgtctgttgt 8340
gttctagctt caacagctgc aggagttcca ctctcaaatg ctccacattt ctcacatcct 8400
cctgattctg gtcactaccc atcttcaaag aacagaatat ctcacatcag catactgtga 8460
aggactagtc atgggtgcag ctgctcagag ctgcaaagtc attctggatg gtggagagct 8520
tacaaacatt tcatgatgct ccccccgctc tgatggctgg agcccaatcc ctacacagac 8580
tcctgctgta tgtgttttcc tttcactctg agccacagcc agagggcagg cattcagtct 8640
cctcttcagg ctggggctgg ggcactgaga actcacccaa caccttgctc tcactccttc 8700
tgcaaaacaa gaaagagctt tgtgctgcag tagccatgaa gaatgaaagg aaggctttaa 8760
ctaaaaaatg tcagagatta ttttcaaccc cttactgtgg atcaccagca aggaggaaac 8820
acaacacaga gacatttttt cccctcaaat tatcaaaaga atcactgcat ttgttaaaga 8880
gagcaactga atcaggaagc agagttttga acatatcaga agttaggaat ctgcatcaga 8940
gacaaatgca gtcatggttg tttgctgcat accagcccta atcattagaa gcctcatgga 9000
cttcaaacat cattccctct gacaagatgc tctagcctaa ctccatgaga taaaataaat 9060
ctgcctttca gagccaaaga agagtccacc agcttcttct cagtgtgaac aagagctcca 9120
gtcaggttag tcagtccagt gcagtagagg agaccagtct gcatcctcta attttcaaag 9180
gcaagaagat ttgtttaccc tggacaccag gcacaagtga ggtcacagag ctcttagata 9240
tgcagtcctc atgagtgagg agactaaagc gcatgccatc aagacttcag tgtagagaaa 9300
acctccaaaa aagcctcctc actacttctg gaatagctca gaggccgagg cggcctcggc 9360
ctctgcataa ataaaaaaaa ttagtcagcc atggggcgga gaatgggcgg aactgggcgg 9420
agttaggggc gggatgggcg gagttagggg cgggactatg gttgctgact aattgagatg 9480
catgctttgc atacttctgc ctgctgggga gcctggggac tttccacacc tggttgctga 9540
ctaattgaga tgcatgcttt gcatacttct gcctgctggg gagcctgggg actttccaca 9600
ccctaactga cacacattcc acagctgcat taatgaatcg gccaacgcgc ggggagaggc 9660
ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt 9720
cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca 9780
ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa 9840
aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat 9900
cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc 9960
cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc 10020
gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt 10080
tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac 10140
cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg 10200
ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca 10260
gagttcttga agtggtggcc taactacggc tacactagaa gaacagtatt tggtatctgc 10320
gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa 10380
accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa 10440
ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac 10500
tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta 10560
aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt 10620
taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata 10680
gttgcctgac tcctgcaaac cacgttgtgt ctcaaaatct ctgatgttac attgcacaag 10740
ataaaaatat atcatcatga acaataaaac tgtctgctta cataaacagt aatacaaggg 10800
gtgttatgag ccatattcaa cgggaaacgt cttgctcgag gccgcgatta aattccaaca 10860
tggatgctga tttatatggg tataaatggg ctcgcgataa tgtcgggcaa tcaggtgcga 10920
caatctatcg attgtatggg aagcccgatg cgccagagtt gtttctgaaa catggcaaag 10980
gtagcgttgc caatgatgtt acagatgaga tggtcagact aaactggctg acggaattta 11040
tgcctcttcc gaccatcaag cattttatcc gtactcctga tgatgcatgg ttactcacca 11100
ctgcgatccc cgggaaaaca gcattccagg tattagaaga atatcctgat tcaggtgaaa 11160
atattgttga tgcgctggca gtgttcctgc gccggttgca ttcgattcct gtttgtaatt 11220
gtccttttaa cagcgatcgc gtatttcgtc tcgctcaggc gcaatcacga atgaataacg 11280
gtttggttga tgcgagtgat tttgatgacg agcgtaatgg ctggcctgtt gaacaagtct 11340
ggaaagaaat gcataagctt ttgccattct caccggattc agtcgtcact catggtgatt 11400
tctcacttga taaccttatt tttgacgagg ggaaattaat aggttgtatt gatgttggac 11460
gagtcggaat cgcagaccga taccaggatc ttgccatcct atggaactgc ctcggtgagt 11520
tttctccttc attacagaaa cggctttttc aaaaatatgg tattgataat cctgatatga 11580
ataaattgca gtttcatttg atgctcgatg agtttttcta agggcggcct gccaccatac 11640
ccacgccgaa acaagcgctc atgagcccga agtggcgagc ccgatcttcc ccatcggtga 11700
tgtcggcgat ataggcgcca gcaaccgcac ctgtggcgcc ggtgatgagg gcgcgccaag 11760
tcgacgtccg gcagtc 11776
<210> 41
<211> 11348
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 41
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctcctg gtggcgaggg gaggggggtg gtcctcgaac gccttgcaga 600
actggcctgg atacagagtg gaccggctgg ccccatctgg aagacttcga gatacactgt 660
tgtcttactg cgctcaacag tgtatctcga agtcttccaa atggtgccag ccatcgcagc 720
ggggtgcagg aaatgggggc agcccccctt tttggctatc cttccacgtg ttcttttttg 780
tatcttttgt gtttcctaga aaacatctca gtcaccaccg cagccctagg aatgcatcta 840
gacaattgta ctaaccttct tctctttcct ctcctgacag tccggaaagc caccatgtgg 900
cagctgtggg ccagcctgtg ctgcctgctg gtgctggcca acgcccgcag ccgccccagc 960
ttccaccccc tgagcgacga gctggtgaac tacgtgaaca agcgcaacac cacctggcag 1020
gccggccaca acttctacaa cgtggacatg agctacctga agcgcctgtg cggcaccttc 1080
ctgggcggcc ccaagccccc ccagcgcgtg atgttcaccg aggacctgaa gctgcccgcc 1140
agcttcgacg cccgcgagca gtggccccag tgccccacca tcaaggagat ccgcgaccag 1200
ggcagctgcg gcagctgctg ggccttcggc gccgtggagg ccatcagcga ccgcatctgc 1260
atccacacca acgccccacgt gagcgtggag gtgagcgccg aggacctgct gacctgctgc 1320
ggcagcatgt gcggcgacgg ctgcaacggc ggctaccccg ccgaggcctg gaacttctgg 1380
acccgcaagg gcctggtgag cggcggcctg tacgagagcc acgtgggctg ccgcccctac 1440
agcatccccc cctgcgagca ccacgtgaac ggcagccgcc ccccctgcac cggcgagggc 1500
gacaccccca agtgcagcaa gatctgcgag cccggctaca gccccaccta caagcaggac 1560
aagcactacg gctacaacag ctacagcgtg agcaacagcg agaaggacat catggccgag 1620
atctacaaga acggccccgt ggagggcgcc ttcagcgtgt acagcgactt cctgctgtac 1680
aagagcggcg tgtaccagca cgtgaccggc gagatgatgg gcggccacgc catccgcatc 1740
ctgggctggg gcgtggagaa cggcaccccc tactggctgg tggccaacag ctggaacacc 1800
gactggggcg acaacggctt cttcaagatc ctgcgcggcc aggaccactg cggcatcgag 1860
agcgaggtgg tggccggcat cccccgcacc gaccagtact gggagaagat cgagggcaga 1920
ggaagtcttc tgacatgcgg agacgtggaa gagaatcccg gccctatgga attcagcagc 1980
cccagcagag aggaatgccc caagcctctg agccgggtgt caatcatggc cggatctctg 2040
acaggactgc tgctgcttca ggccgtgtct tgggcttctg gcgctagacc ttgcatcccc 2100
aagagcttcg gctacagcag cgtcgtgtgc gtgtgcaatg ccacctactg cgacagcttc 2160
gaccctccta cctttcctgc tctgggcacc ttcagcagat acgagagcac cagatccggc 2220
agacggatgg aactgagcat gggacccatc caggccaatc acacaggcac tggcctgctg 2280
ctgacactgc agcctgagca gaaattccag aaagtgaaag gcttcggcgg agccatgaca 2340
gatgccgccg ctctgaatat cctggctctg tctccaccag ctcagaacct gctgctcaag 2400
agctacttca gcgaggaagg catcggctac aacatcatca gagtgcccat ggccagctgc 2460
gacttcagca tcaggaccta cacctacgcc gacacacccg acgatttcca gctgcacaac 2520
ttcagcctgc ctgaagagga caccaagctg aagatccctc tgatccacag agccctgcag 2580
ctggcacaaa gacccgtgtc actgctggcc tctccatgga catctcccac ctggctgaaa 2640
acaaatggcg ccgtgaatgg caagggcagc ctgaaaggcc aacctggcga catctaccac 2700
cagacctggg ccagatactt cgtgaagttc ctggacgcct atgccgagca caagctgcag 2760
ttttgggccg tgacagccga gaacgaacct tctgctggac tgctgagcgg ctaccccttt 2820
cagtgcctgg gctttacacc cgagcaccag cgggacttta tcgcccgtga tctgggaccc 2880
acactggcca atagcaccca ccataatgtg cggctgctga tgctggacga ccagagactg 2940
cttctgcccc actgggctaa agtggtgctg acagatcctg aggccgccaa atacgtgcac 3000
ggaatcgccg tgcactggta tctggacttt ctggcccctg ccaaggccac actgggagag 3060
acacacagac tgttccccaa caccatgctg ttcgccagcg aagcctgtgt gggcagcaag 3120
ttttgggaac agagcgtgcg gctcggcagc tgggatagag gcatgcagta cagccacagc 3180
atcatcacca acctgctgta ccacgtcgtc ggctggaccg actggaatct ggccctgaat 3240
cctgaaggcg gccctaactg ggtccgaaac ttcgtggaca gccccatcat cgtggacatc 3300
accaaggaca ccttctacaa gcagcccatg ttctaccacc tgggacactt cagcaagttc 3360
atccccgagg gctctcagcg cgttggactg gtggcttccc agaagaacga tctggacgcc 3420
gtggctctga tgcaccctga tggatctgct gtggtggtgg tcctgaaccg cagcagcaaa 3480
gatgtgcccc tgaccatcaa ggatcccgcc gtgggattcc tggaaacaat cagccctggc 3540
tactccatcc acacctacct gtggcgtaga cagtgacaat tgttaattaa gtttaaaccc 3600
tcgaggccgc aagcttatcg ataatcaacc tctggattac aaaatttgtg aaagattgac 3660
tggtattctt aactatgttg ctccttttac gctatgtgga tacgctgctt taatgccttt 3720
gtatcatgct attgcttccc gtatggcttt cattttctcc tccttgtata aatcctggtt 3780
gctgtctctt tatgaggagt tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt 3840
gtttgctgac gcaaccccca ctggttgggg cattgccacc acctgtcagc tcctttccgg 3900
gactttcgct ttccccctcc ctattgccac ggcggaactc atcgccgcct gccttgcccg 3960
ctgctggaca ggggctcggc tgttgggcac tgacaattcc gtggtgttgt cgggggaaatc 4020
atcgtccttt ccttggctgc tcgcctgtgt tgccacctgg attctgcgcg ggacgtcctt 4080
ctgctacgtc ccttcggccc tcaatccagc ggaccttcct tcccgcggcc tgctgccggc 4140
tctgcggcct cttccgcgtc ttcgccttcg ccctcagacg agtcggatct ccctttgggc 4200
cgcctccccg catcgatacc gtcgactaga gctcgctgat cagcctcgac tgtgccttct 4260
agttgccagc catctgttgt ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc 4320
actcccactg tcctttccta ataaaatgag gaaattgcat cgcattgtct gagtaggtgt 4380
cattctattc tggggggtgg ggtggggcag gacagcaagg gggaggattg ggaagacaat 4440
agcaggcatg ctggggagag atccacgata acaaacagct tttttggggt gaacatattg 4500
actgaattcc ctgcaggttg gccactccct ctctgcgcgc tcgctcgctc actgaggccg 4560
cccgggcaaa gcccgggcgt cgggcgacct ttggtcgccc ggcctcagtg agcgagcgag 4620
cgcgcagaga gggagtggcc aactccatca ctaggggttc ctgcggccgc tcgtacggtc 4680
tcgaggaatt cctgcaggat aacttgccaa cctcattcta aaatgtatat agaagcccaa 4740
aagacaataa caaaaatatt cttgtagaac aaaatgggaa agaatgttcc actaaatatc 4800
aagatttaga gcaaagcatg agatgtgtgg ggatagacag tgaggctgat aaaatagagt 4860
agagctcaga aacagaccca ttgatatatg taagtgacct atgaaaaaaa tatggcattt 4920
tacaatggga aaatgatggt ctttttcttt tttagaaaaa cagggaaata tatttatatg 4980
taaaaaataa aagggaaccc atatgtcata ccatacacac aaaaaaattc cagtgaatta 5040
taagtctaaa tggagaaggc aaaactttaa atcttttaga aaataatata gaagcatgca 5100
gaccagcctg gccaacatga tgaaaccctc tctactaata ataaaatcag tagaactact 5160
caggactact ttgagtggga agtccttttc tatgaagact tctttggcca aaattaggct 5220
ctaaatgcaa ggagatagtg catcatgcct ggctgcactt actgataaat gatgttatca 5280
ccatctttaa ccaaatgcac aggaacaagt tatggtactg atgtgctgga ttgagaagga 5340
gctctacttc cttgacagga cacatttgta tcaacttaaa aaagcagatt tttgccagca 5400
gaactattca ttcagaggta ggaaacttag aatagatgat gtcactgatt agcatggctt 5460
ccccatctcc acagctgctt cccacccagg ttgcccacag ttgagtttgt ccagtgctca 5520
gggctgccca ctctcagtaa gaagccccac accagcccct ctccaaatat gttggctgtt 5580
ccttccatta aagtgacccc actttagagc agcaagtgga tttctgtttc ttacagttca 5640
ggaaggagga gtcagctgtg agaacctgga gcctgagatg cttctaagtc ccactgctac 5700
tggggtcagg gaagccagac tccagcatca gcagtcagga gcactaagcc cttgccaaca 5760
tcctgtttct cagagaaact gcttccatta taatggttgt ccttttttaa gctatcaagc 5820
caaacaacca gtgtctacca ttattctcat cacctgaagc caagggttct agcaaaagtc 5880
aagctgtctt gtaatggttg atgtgcctcc agcttctgtc ttcagtcact ccactcttag 5940
cctgctctga atcaactctg accacagttc cctggagccc ctgccacctg ctgcccctgc 6000
caccttctcc atctgcagtg ctgtgcagcc ttctgcactc ttgcagagct aataggtgga 6060
gacttgaagg aagaggagga aagtttctca taatagcctt gctgcaagct caaatgggag 6120
gtgggcactg tgcccaggag ccttggagca aaggctgtgc ccaacctctg actgcatcca 6180
ggtttggtct tgacagagat aagaagccct ggcttttgga gccaaaatct aggtcagact 6240
taggcaggat tctcaaagtt tatcagcaga acatgaggca gaagaccctt tctgctccag 6300
cttcttcagg ctcaaccttc atcagaatag atagaaagag aggctgtgag ggttcttaaa 6360
acagaagcaa atctgactca gagaataaac aacctcctag taaactacag cttagacaga 6420
gcatctggtg gtgagtgtgc tcagtgtcct actcaactgt ctggtatcag ccctcatgag 6480
gacttctctt ctttccctca tagacctcca tctctgtttt ccttagcctg cagaaatctg 6540
gatggctatt cacagaatgc ctgtgctttc agagttgcat tttttctctg gtattctggt 6600
tcaagcattt gaaggtagga aaggttctcc aagtgcaaga aagccagccc tgagcctcaa 6660
ctgcctggct agtgtggtca gtaggatgca aaggctgttg aatgccacaa ggccaaactt 6720
taacctgtgt accacaagcc tagcagcaga ggcagctctg ctcactggaa ctctctgtct 6780
tctttctcct gagccttttc ttttcctgag ttttctagct ctcctcaacc ttacctctgc 6840
cctacccagg acaaacccaa gagccactgt ttctgtgatg tcctctccag ccctaattag 6900
gcatcatgac ttcagcctga ccttccatgc tcagaagcag tgctaatcca cttcagatga 6960
gctgctctat gcaacacagg cagagcctac aaacctttgc accagagccc tccacatatc 7020
agtgtttgtt catactcact tcaacagcaa atgtgactgc tgagattaag attttacaca 7080
agatggtctg taatttcaca gttagtttta tcccattagg tatgaaagaa ttagcataat 7140
tccccttaaa catgaatgaa tcttagattt tttaataaat agttttggaa gtaaagacag 7200
agacatcagg agcacaagga atagcctgag aggacaaaca gaacaagaaa gagtctggaa 7260
atacacagga tgttcttggc ctcctcaaag caagtgcaag cagatagtac cagcagcccc 7320
aggctatcag agcccagtga agagaagtac catgaaagcc acagctctaa ccaccctgtt 7380
ccagagtgac agacagtccc caagacaagc cagcctgagc cagagagaga actgcaagag 7440
aaagtttcta atttaggttc tgttagattc agacaagtgc aggtcatcct ctctccacag 7500
ctactcacct ctccagccta acaaagcctg cagtccacac tccaaccctg gtgtctcacc 7560
tcctagcctc tcccaacatc ctgctctctg accatcttct gcatctctca tctcaccatc 7620
tcccactgtc tacagcctac tcttgcaact accatctcat tttctgacat cctgtctaca 7680
tcttctgcca tactctgcca tctaccatac cacctcttac catctaccac accatctttt 7740
atctccatcc ctctcagaag cctccaagct gaatcctgct ttatgtgttc atctcagccc 7800
ctgcatggaa agctgacccc agaggcagaa ctattcccag agagcttggc caagaaaaac 7860
aaaactacca gcctggccag gctcaggagt agtaagctgc agtgtctgtt gtgttctagc 7920
ttcaacagct gcaggagttc cactctcaaa tgctccacat ttctcacatc ctcctgattc 7980
tggtcactac ccatcttcaa agaacagaat atctcacatc agcatactgt gaaggactag 8040
tcatgggtgc agctgctcag agctgcaaag tcattctgga tggtggagag cttacaaaca 8100
tttcatgatg ctccccccgc tctgatggct ggagcccaat ccctacacag actcctgctg 8160
tatgtgtttt cctttcactc tgagccacag ccagagggca ggcattcagt ctcctcttca 8220
ggctggggct ggggcactga gaactcaccc aacaccttgc tctcactcct tctgcaaaac 8280
aagaaagagc tttgtgctgc agtagccatg aagaatgaaa ggaaggcttt aactaaaaaa 8340
tgtcagagat tattttcaac cccttactgt ggatcaccag caaggaggaa acacaacaca 8400
gagacatttt ttcccctcaa attatcaaaa gaatcactgc atttgttaaa gagagcaact 8460
gaatcaggaa gcagagtttt gaacatatca gaagttagga atctgcatca gagacaaatg 8520
cagtcatggt tgtttgctgc ataccagccc taatcattag aagcctcatg gacttcaaac 8580
atcattccct ctgacaagat gctctagcct aactccatga gataaaataa atctgccttt 8640
cagagccaaa gaagagtcca ccagcttctt ctcagtgtga acaagagctc cagtcaggtt 8700
agtcagtcca gtgcagtaga ggagaccagt ctgcatcctc taattttcaa aggcaagaag 8760
atttgtttac cctggacacc aggcacaagt gaggtcacag agctcttaga tatgcagtcc 8820
tcatgagtga ggagactaaa gcgcatgcca tcaagacttc agtgtagaga aaacctccaa 8880
aaaagcctcc tcactacttc tggaatagct cagaggccga ggcggcctcg gcctctgcat 8940
aaataaaaaa aattagtcag ccatggggcg gagaatgggc ggaactgggc ggagttaggg 9000
gcgggatggg cggagttagg ggcgggacta tggttgctga ctaattgaga tgcatgcttt 9060
gcatacttct gcctgctggg gagcctgggg actttccaca cctggttgct gactaattga 9120
gatgcatgct ttgcatactt ctgcctgctg gggagcctgg ggactttcca caccctaact 9180
gacacacatt ccacagctgc attaatgaat cggccaacgc gcggggagag gcggtttgcg 9240
tattgggcgc tcttccgctt cctcgctcac tgactcgctg cgctcggtcg ttcggctgcg 9300
gcgagcggta tcagctcact caaaggcggt aatacggtta tccacagaat caggggataa 9360
cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc 9420
gttgctggcg tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc 9480
aagtcagagg tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag 9540
ctccctcgtg cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct 9600
cccttcggga agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta 9660
ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc 9720
cttatccggt aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc 9780
agcagccact ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt 9840
gaagtggtgg cctaactacg gctacactag aagaacagta tttggtatct gcgctctgct 9900
gaagccagtt accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc 9960
tggtagcggt ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca 10020
agaagatcct ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta 10080
agggattttg gtcatgagat tatcaaaaag gatcttcacc tagatccttt taaattaaaa 10140
atgaagtttt aaatcaatct aaagtatata tgagtaaact tggtctgaca gttaccaatg 10200
cttaatcagt gaggcaccta tctcagcgat ctgtctattt cgttcatcca tagttgcctg 10260
actcctgcaa accacgttgt gtctcaaaat ctctgatgtt acatgcaca agataaaaat 10320
atatcatcat gaacaataaa actgtctgct tacataaaca gtaatacaag gggtgttatg 10380
agccatattc aacgggaaac gtcttgctcg aggccgcgat taaattccaa catggatgct 10440
gatttatatg ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat 10500
cgattgtatg ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt 10560
gccaatgatg ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt 10620
ccgaccatca agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc 10680
cccgggaaaa cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt 10740
gatgcgctgg cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt 10800
aacagcgatc gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt 10860
gatgcgagtg attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa 10920
atgcataagc ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt 10980
gataacctta tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga 11040
atcgcagacc gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct 11100
tcattacaga aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg 11160
cagtttcatt tgatgctcga tgagtttttc taagggcggc ctgccaccat acccacgccg 11220
aaacaagcgc tcatgagccc gaagtggcga gcccgatctt ccccatcggt gatgtcggcg 11280
atataggcgc cagcaaccgc acctgtggcg ccggtgatga gggcgcgcca agtcgacgtc 11340
cggcagtc 11348
<210> 42
<211> 11433
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 42
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctcctg gtggcgaggg gaggggggtg gtcctcgaac gccttgcaga 600
actggcctgg atacagagtg gaccggctgg ccccatctgg aagacttcga gatacactgt 660
tgtcttactg cgctcaacag tgtatctcga agtcttccaa atggtgccag ccatcgcagc 720
ggggtgcagg aaatgggggc agcccccctt tttggctatc cttccacgtg ttcttttttg 780
tatcttttgt gtttcctaga aaacatctca gtcaccaccg cagccctagg aatgcatcta 840
gacaattgta ctaaccttct tctctttcct ctcctgacag tccggaaagc caccatggaa 900
ttcagcagcc ccagcagaga ggaatgcccc aagcctctga gccgggtgtc aatcatggcc 960
ggatctctga caggactgct gctgcttcag gccgtgtctt gggcttctgg cgctagacct 1020
tgcatcccca agagcttcgg ctacagcagc gtcgtgtgcg tgtgcaatgc cacctactgc 1080
gacagcttcg accctcctac ctttcctgct ctgggcacct tcagcagata cgagagcacc 1140
agatccggca gacggatgga actgagcatg ggacccatcc aggccaatca cacaggcact 1200
ggcctgctgc tgacactgca gcctgagcag aaattccaga aagtgaaagg cttcggcgga 1260
gccatgacag atgccgccgc tctgaatatc ctggctctgt ctccaccagc tcagaacctg 1320
ctgctcaaga gctacttcag cgaggaaggc atcggctaca acatcatcag agtgcccatg 1380
gccagctgcg acttcagcat caggacctac acctacgccg acacacccga cgatttccag 1440
ctgcacaact tcagcctgcc tgaagaggac accaagctga agatccctct gatccacaga 1500
gccctgcagc tggcacaaag acccgtgtca ctgctggcct ctccatggac atctcccacc 1560
tggctgaaaa caaatggcgc cgtgaatggc aagggcagcc tgaaaggcca acctggcgac 1620
atctaccacc agacctgggc cagatacttc gtgaagttcc tggacgccta tgccgagcac 1680
aagctgcagt tttgggccgt gacagccgag aacgaacctt ctgctggact gctgagcggc 1740
tacccctttc agtgcctggg ctttacaccc gagcaccagc gggactttat cgcccgtgat 1800
ctgggaccca cactggccaa tagcacccac cataatgtgc ggctgctgat gctggacgac 1860
cagagactgc ttctgcccca ctgggctaaa gtggtgctga cagatcctga ggccgccaaa 1920
tacgtgcacg gaatcgccgt gcactggtat ctggactttc tggcccctgc caaggccaca 1980
ctgggagaga cacacagact gttccccaac accatgctgt tcgccagcga agcctgtgtg 2040
ggcagcaagt tttgggaaca gagcgtgcgg ctcggcagct gggatagagg catgcagtac 2100
agccacagca tcatcaccaa cctgctgtac cacgtcgtcg gctggaccga ctggaatctg 2160
gccctgaatc ctgaaggcgg ccctaactgg gtccgaaact tcgtggacag ccccatcatc 2220
gtggacatca ccaaggacac cttctacaag cagcccatgt tctaccacct gggacacttc 2280
agcaagttca tccccgaggg ctctcagcgc gttggactgg tggcttccca gaagaacgat 2340
ctggacgccg tggctctgat gcaccctgat ggatctgctg tggtggtggt cctgaaccgc 2400
agcagcaaag atgtgcccct gaccatcaag gatcccgccg tgggattcct ggaaacaatc 2460
agccctggct actccatcca cacctacctg tggcgtagac aggagggcag aggaagtctt 2520
ctgacatgcg gagacgtgga agagaatccc ggccctatgc cccgctacgg cgccagcctg 2580
cgccagagct gcccccgcag cggccgcgag cagggccagg acggcaccgc cggcgccccc 2640
ggcctgctgt ggatgggcct ggtgctggcc ctggccctgg ccctggccct ggccctggcc 2700
ctgagcgaca gccgcgtgct gtgggccccc gccgaggccc accccctgag cccccagggc 2760
caccccgccc gcctgcaccg catcgtgccc cgcctgcgcg acgtgttcgg ctggggcaac 2820
ctgacctgcc ccatctgcaa gggcctgttc accgccatca acctgggcct gaagaaggag 2880
cccaacgtgg cccgcgtggg cagcgtggcc atcaagctgt gcaacctgct gaagatcgcc 2940
ccccccgccg tgtgccagag catcgtgcac ctgttcgagg acgacatggt ggaggtgtgg 3000
cgccgcagcg tgctgagccc cagcgaggcc tgcggcctgc tgctgggcag cacctgcggc 3060
cactgggaca tcttcagcag ctggaacatc agcctgccca ccgtgcccaa gccccccccc 3120
aagcccccca gccccccccgc ccccggcgcc cccgtgagcc gcatcctgtt cctgaccgac 3180
ctgcactggg accacgacta cctggagggc accgaccccg actgcgccga ccccctgtgc 3240
tgccgccgcg gcagcggcct gccccccgcc agccgccccg gcgccggcta ctggggcgag 3300
tacagcaagt gcgacctgcc cctgcgcacc ctggagagcc tgctgagcgg cctgggcccc 3360
gccggcccct tcgacatggt gtactggacc ggcgacatcc ccgcccacga cgtgtggcac 3420
cagacccgcc aggaccagct gcgcgccctg accaccgtga ccgccctggt gcgcaagttc 3480
ctgggccccg tgcccgtgta ccccgccgtg ggcaaccacg agagcacccc cgtgaacagc 3540
ttcccccccc ccttcatcga gggcaaccac agcagccgct ggctgtacga ggccatggcc 3600
aaggcctggg agccctggct gcccgccgag gccctgcgca ccctgcgcat cggcggcttc 3660
tacgccctga gcccctaccc cggcctgcgc ctgatcagcc tgaacatgaa cttctgcagc 3720
cgcgagaact tctggctgct gatcaacagc accgaccccg ccggccagct gcagtggctg 3780
gtgggcgagc tgcaggccgc cgaggaccgc ggcgacaagg tgcacatcat cggccacatc 3840
ccccccggcc actgcctgaa gagctggagc tggaactact accgcatcgt ggcccgctac 3900
gagaacaccc tggccgccca gttcttcggc cacacccacg tggacgagtt cgaggtgttc 3960
tacgacgagg agaccctgag ccgccccctg gccgtggcct tcctggcccc cagcgccacc 4020
acctacatcg gcctgaaccc cggctaccgc gtgtaccaga tcgacggcaa ctacagcggc 4080
agcagccacg tggtgctgga ccacgagacc tacatcctga acctgaccca ggccaacatc 4140
cccggcgcca tccccccactg gcagctgctg taccgcgccc gcgagaccta cggcctgccc 4200
aacaccctgc ccaccgcctg gcacaacctg gtgtaccgca tgcgcggcga catgcagctg 4260
ttccagacct tctggttcct gtaccacaag ggccaccccc ccagcgagcc ctgcggcacc 4320
ccctgccgcc tggccaccct gtgcgcccag ctgagcgccc gcgccgacag ccccgccctg 4380
tgccgccacc tgatgcccga cggcagcctg cccgaggccc agagcctgtg gccccgcccc 4440
ctgttctgct aatgacaatt gttaattaag tttaaaccct cgaggccgca agcaataaaa 4500
tatctttatt ttcattacat ctgtgtgttg gttttttgtg tggagatcca cgataacaaa 4560
cagctttttt ggggtgaaca tattgactga attccctgca ggttggccac tccctctctg 4620
cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg ggcgtcgggc gacctttggt 4680
cgcccggcct cagtgagcga gcgagcgcgc agagagggag tggccaactc catcactagg 4740
ggttcctgcg gccgctcgta cggtctcgag gaattcctgc aggataactt gccaacctca 4800
ttctaaaatg tatatagaag cccaaaagac aataacaaaa atattcttgt agaacaaaat 4860
gggaaagaat gttccactaa atatcaagat ttagagcaaa gcatgagatg tgtggggata 4920
gacagtgagg ctgataaaat agagtagagc tcagaaacag acccattgat atatgtaagt 4980
gacctatgaa aaaaatatgg cattttacaa tgggaaaatg atggtctttt tcttttttag 5040
aaaaacaggg aaatatattt atatgtaaaa aataaaaggg aacccatatg tcataccata 5100
cacacaaaaa aattccagtg aattataagt ctaaatggag aaggcaaaac tttaaatctt 5160
ttagaaaata atatagaagc atgcagacca gcctggccaa catgatgaaa ccctctctac 5220
taataataaa atcagtagaa ctactcagga ctactttgag tgggaagtcc ttttctatga 5280
agacttcttt ggccaaaatt aggctctaaa tgcaaggaga tagtgcatca tgcctggctg 5340
cacttactga taaatgatgt tatcaccatc tttaaccaaa tgcacaggaa caagttatgg 5400
tactgatgtg ctggattgag aaggagctct acttccttga caggacacat ttgtatcaac 5460
ttaaaaaagc agatttttgc cagcagaact attcattcag aggtaggaaa cttagaatag 5520
atgatgtcac tgattagcat ggcttcccca tctccacagc tgcttcccac ccaggttgcc 5580
cacagttgag tttgtccagt gctcagggct gcccactctc agtaagaagc cccacaccag 5640
cccctctcca aatatgttgg ctgttccttc cattaaagtg accccacttt agagcagcaa 5700
gtggatttct gtttcttaca gttcaggaag gaggagtcag ctgtgagaac ctggagcctg 5760
agatgcttct aagtcccact gctactgggg tcagggaagc cagactccag catcagcagt 5820
caggagcact aagcccttgc caacatcctg tttctcagag aaactgcttc cattataatg 5880
gttgtccttt tttaagctat caagccaaac aaccagtgtc taccattatt ctcatcacct 5940
gaagccaagg gttctagcaa aagtcaagct gtcttgtaat ggttgatgtg cctccagctt 6000
ctgtcttcag tcactccact cttagcctgc tctgaatcaa ctctgaccac agttccctgg 6060
agcccctgcc acctgctgcc cctgccacct tctccatctg cagtgctgtg cagccttctg 6120
cactcttgca gagctaatag gtggagactt gaaggaagag gaggaaagtt tctcataata 6180
gccttgctgc aagctcaaat gggaggtggg cactgtgccc aggagccttg gagcaaaggc 6240
tgtgcccaac ctctgactgc atccaggttt ggtcttgaca gagataagaa gccctggctt 6300
ttggagccaa aatctaggtc agacttaggc aggattctca aagtttatca gcagaacatg 6360
aggcagaaga ccctttctgc tccagcttct tcaggctcaa ccttcatcag aatagataga 6420
aagagaggct gtgagggttc ttaaaacaga agcaaatctg actcagagaa taaacaacct 6480
cctagtaaac tacagcttag acagagcatc tggtggtgag tgtgctcagt gtcctactca 6540
actgtctggt atcagccctc atgaggactt ctcttctttc cctcatagac ctccatctct 6600
gttttcctta gcctgcagaa atctggatgg ctattcacag aatgcctgtg ctttcagagt 6660
tgcatttttt ctctggtatt ctggttcaag catttgaagg taggaaaggt tctccaagtg 6720
caagaaagcc agccctgagc ctcaactgcc tggctagtgt ggtcagtagg atgcaaaggc 6780
tgttgaatgc cacaaggcca aactttaacc tgtgtaccac aagcctagca gcagaggcag 6840
ctctgctcac tggaactctc tgtcttcttt ctcctgagcc ttttcttttc ctgagttttc 6900
tagctctcct caaccttacc tctgccctac ccaggacaaa cccaagagcc actgtttctg 6960
tgatgtcctc tccagcccta attaggcatc atgacttcag cctgaccttc catgctcaga 7020
agcagtgcta atccacttca gatgagctgc tctatgcaac acaggcagag cctacaaacc 7080
tttgcaccag agccctccac atatcagtgt ttgttcatac tcacttcaac agcaaatgtg 7140
actgctgaga ttaagatttt acacaagatg gtctgtaatt tcacagttag ttttatccca 7200
ttaggtatga aagaattagc ataattcccc ttaaacatga atgaatctta gattttttaa 7260
taaatagttt tggaagtaaa gacagagaca tcaggagcac aaggaatagc ctgagaggac 7320
aaacagaaca agaaagagtc tggaaataca caggatgttc ttggcctcct caaagcaagt 7380
gcaagcagat agtaccagca gccccaggct atcagagccc agtgaagaga agtaccatga 7440
aagccacagc tctaaccacc ctgttccaga gtgacagaca gtccccaaga caagccagcc 7500
tgagccagag agagaactgc aagagaaagt ttctaattta ggttctgtta gattcagaca 7560
agtgcaggtc atcctctctc cacagctact cacctctcca gcctaacaaa gcctgcagtc 7620
cacactccaa ccctggtgtc tcacctccta gcctctccca acatcctgct ctctgaccat 7680
cttctgcatc tctcatctca ccatctccca ctgtctacag cctactcttg caactaccat 7740
ctcattttct gacatcctgt ctacatcttc tgccatactc tgccatctac cataccacct 7800
cttaccatct accacaccat cttttatctc catccctctc agaagcctcc aagctgaatc 7860
ctgctttatg tgttcatctc agcccctgca tggaaagctg accccagagg cagaactatt 7920
cccagagagc ttggccaaga aaaacaaaac taccagcctg gccaggctca ggagtagtaa 7980
gctgcagtgt ctgttgtgtt ctagcttcaa cagctgcagg agttccactc tcaaatgctc 8040
cacattctc acatcctcct gattctggtc actacccatc ttcaaagaac agaatatctc 8100
acatcagcat actgtgaagg actagtcatg ggtgcagctg ctcagagctg caaagtcatt 8160
ctggatggtg gagagcttac aaacatttca tgatgctccc cccgctctga tggctggagc 8220
ccaatcccta cacagactcc tgctgtatgt gttttccttt cactctgagc cacagccaga 8280
gggcaggcat tcagtctcct cttcaggctg gggctggggc actgagaact cacccaacac 8340
cttgctctca ctccttctgc aaaacaagaa agagctttgt gctgcagtag ccatgaagaa 8400
tgaaaggaag gctttaacta aaaaatgtca gagattattt tcaacccctt actgtggatc 8460
accagcaagg aggaaacaca acacagagac attttttccc ctcaaattat caaaagaatc 8520
actgcatttg ttaaagagag caactgaatc aggaagcaga gttttgaaca tatcagaagt 8580
taggaatctg catcagagac aaatgcagtc atggttgttt gctgcatacc agccctaatc 8640
attagaagcc tcatggactt caaacatcat tccctctgac aagatgctct agcctaactc 8700
catgagataa aataaatctg cctttcagag ccaaagaaga gtccaccagc ttcttctcag 8760
tgtgaacaag agctccagtc aggttagtca gtccagtgca gtagaggaga ccagtctgca 8820
tcctctaatt ttcaaaggca agaagattg tttaccctgg acaccaggca caagtgaggt 8880
cacagagctc ttagatatgc agtcctcatg agtgaggaga ctaaagcgca tgccatcaag 8940
acttcagtgt agagaaaacc tccaaaaaag cctcctcact acttctggaa tagctcagag 9000
gccgaggcgg cctcggcctc tgcataaata aaaaaaatta gtcagccatg gggcggagaa 9060
tgggcggaac tgggcggagt taggggcggg atgggcggag ttaggggcgg gactatggtt 9120
gctgactaat tgagatgcat gctttgcata cttctgcctg ctggggagcc tggggacttt 9180
ccacacctgg ttgctgacta attgagatgc atgctttgca tacttctgcc tgctggggag 9240
cctggggact ttccacaccc taactgacac acatccaca gctgcattaa tgaatcggcc 9300
aacgcgcggg gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact 9360
cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac 9420
ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa 9480
aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg 9540
acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa 9600
gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc 9660
ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac 9720
gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac 9780
cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg 9840
taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt 9900
atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagaa 9960
cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct 10020
cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga 10080
ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg 10140
ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct 10200
tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt 10260
aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc 10320
tatttcgttc atccatagtt gcctgactcc tgcaaaccac gttgtgtctc aaaatctctg 10380
atgttacatt gcacaagata aaaatatatc atcatgaaca ataaaactgt ctgcttacat 10440
aaacagtaat acaaggggtg ttatgagcca tattcaacgg gaaacgtctt gctcgaggcc 10500
gcgattaaat tccaacatgg atgctgattt atatgggtat aaatgggctc gcgataatgt 10560
cgggcaatca ggtgcgacaa tctatcgatt gtatgggaag cccgatgcgc cagagttgtt 10620
tctgaaacat ggcaaaggta gcgttgccaa tgatgttaca gatgagatgg tcagactaaa 10680
ctggctgacg gaatttatgc ctcttccgac catcaagcat tttatccgta ctcctgatga 10740
tgcatggtta ctcaccactg cgatccccgg gaaaacagca ttccaggtat tagaagaata 10800
tcctgattca ggtgaaaata ttgttgatgc gctggcagtg ttcctgcgcc ggttgcattc 10860
gattcctgtt tgtaattgtc cttttaacag cgatcgcgta tttcgtctcg ctcaggcgca 10920
atcacgaatg aataacggtt tggttgatgc gagtgatttt gatgacgagc gtaatggctg 10980
gcctgttgaa caagtctgga aagaaatgca taagcttttg ccattctcac cggattcagt 11040
cgtcactcat ggtgatttct cacttgataa ccttattttt gacgagggga aattaatagg 11100
ttgtattgat gttggacgag tcggaatcgc agaccgatac caggatcttg ccatcctatg 11160
gaactgcctc ggtgagtttt ctccttcatt acagaaacgg ctttttcaaa aatatggtat 11220
tgataatcct gatatgaata aattgcagtt tcatttgatg ctcgatgagt ttttctaagg 11280
gcggcctgcc accataccca cgccgaaaca agcgctcatg agcccgaagt ggcgagcccg 11340
atcttcccca tcggtgatgt cggcgatata ggcgccagca accgcacctg tggcgccggt 11400
gatgagggcg cgccaagtcg acgtccggca gtc 11433
<210> 43
<211> 11776
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 43
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctgcag ccctaggaat gcatctagac aattgtacta accttcttct 600
ctttcctctc ctgacagtcc ggaaagccac catggccgag tggctgctga gcgccagctg 660
gcagcgccgc gccaaggcca tgaccgccgc cgccggcagc gccggccgcg ccgccgtgcc 720
cctgctgctg tgcgccctgc tggcccccgg cggcgcctac gtgctggacg acagcgacgg 780
cctgggccgc gagttcgacg gcatcggcgc cgtgagcggc ggcggcgcca ccagccgcct 840
gctggtgaac taccccgagc cctaccgcag ccagatcctg gactacctgt tcaagcccaa 900
cttcggcgcc agcctgcaca tcctgaaggt ggagatcggc ggcgacggcc agaccaccga 960
cggcaccgag cccagccaca tgcactacgc cctggacgag aactacttcc gcggctacga 1020
gtggtggctg atgaaggagg ccaagaagcg caaccccaac atcaccctga tcggcctgcc 1080
ctggagcttc cccggctggc tgggcaaggg cttcgactgg ccctacgtga acctgcagct 1140
gaccgcctac tacgtggtga cctggatcgt gggcgccaag cgctaccacg acctggacat 1200
cgactacatc ggcatctgga acgagcgcag ctacaacgcc aactacatca agatcctgcg 1260
caagatgctg aactaccagg gcctgcagcg cgtgaagatc atcgccagcg acaacctgtg 1320
ggagagcatc agcgccagca tgctgctgga cgccgagctg ttcaaggtgg tggacgtgat 1380
cggcgcccac taccccggca cccacagcgc caaggacgcc aagctgaccg gcaagaagct 1440
gtggagcagc gaggacttca gcaccctgaa cagcgacatg ggcgccggct gctggggccg 1500
catcctgaac cagaactaca tcaacggcta catgaccagc accatcgcct ggaacctggt 1560
ggccagctac tacgagcagc tgccctacgg ccgctgcggc ctgatgaccg cccaggagcc 1620
ctggagcggc cactacgtgg tggagagccc cgtgtgggtg agcgcccaca ccacccagtt 1680
cacccagccc ggctggtact acctgaagac cgtgggccac ctggagaagg gcggcagcta 1740
cgtggccctg accgacggcc tgggcaacct gaccatcatc atcgagacca tgagccacaa 1800
gcacagcaag tgcatccgcc ccttcctgcc ctacttcaac gtgagccagc agttcgccac 1860
cttcgtgctg aagggcagct tcagcgagat ccccgagctg caggtgtggt acaccaagct 1920
gggcaagacc agcgagcgct tcctgttcaa gcagctggac agcctgtggc tgctggacag 1980
cgacggcagc ttcaccctga gcctgcacga ggacgagctg ttcaccctga ccaccctgac 2040
caccggccgc aagggcagct accccctgcc ccccaagagc cagcccttcc ccagcaccta 2100
caaggacgac ttcaacgtgg actacccctt cttcagcgag gcccccaact tcgccgacca 2160
gaccggcgtg ttcgagtact tcaccaacat cgaggacccc ggcgagcacc acttcaccct 2220
gcgccaggtg ctgaaccagc gccccatcac ctgggccgcc gacgccagca acaccatcag 2280
catcatcggc gactacaact ggaccaacct gaccatcaag tgcgacgtgt acatcgagac 2340
ccccgacacc ggcggcgtgt tcatcgccgg ccgcgtgaac aagggcggca tcctgatccg 2400
cagcgcccgc ggcatcttct tctggatctt cgccaacggc agctaccgcg tgaccggcga 2460
cctggccggc tggatcatct acgccctggg ccgcgtggag gtgaccgcca agaagtggta 2520
caccctgacc ctgaccatca agggccactt caccagcggc atgctgaacg acaagagcct 2580
gtggaccgac atccccgtga acttccccaa gaacggctgg gccgccatcg gcacccacag 2640
cttcgagttc gcccagttcg acaacttcct ggtggaggcc acccgctgat tgtggccgaa 2700
ccgccgaact cagaggccgg ccccagaaaa cccgagcgag tagggggcgg cgcgcaggag 2760
ggaggagaac tgggggcgcg ggaggctggt gggtgtgggg ggtggagatg tagaagatgt 2820
gacgccgcgg cccggcgggt gccagattag cggacgcggt gcccgcggtt gcaacgggat 2880
cccgggcgct gcagcttggg aggcggctct ccccaggcgg cgtccgcgga gacacccatc 2940
cgtgaacccc aggtcccggg ccgccggctc gccgcgcacc aggggccggc ggacagaaga 3000
gcggccgagc ggctcgaggc tgggggaccg cgggcgcggc cgcgcgctgc cgggcgggag 3060
gctggggggc cggggccggg gccgtgcccc ggagcgggtc ggaggccggg gccggggccg 3120
ggggacggcg gctccccgcg cggctccagc ggctcgggga tcccggccgg gccccgcagg 3180
gaccatgatg gaattcagca gccccagcag agaggaatgc cccaagcctc tgagccgggt 3240
gtcaatcatg gccggatctc tgacaggact gctgctgctt caggccgtgt cttgggcttc 3300
tggcgctaga ccttgcatcc ccaagagctt cggctacagc agcgtcgtgt gcgtgtgcaa 3360
tgccacctac tgcgacagct tcgaccctcc tacctttcct gctctgggca ccttcagcag 3420
atacgagagc accagatccg gcagacggat ggaactgagc atgggaccca tccaggccaa 3480
tcacacaggc actggcctgc tgctgacact gcagcctgag cagaaattcc agaaagtgaa 3540
aggcttcggc ggagccatga cagatgccgc cgctctgaat atcctggctc tgtctccacc 3600
agctcagaac ctgctgctca agagctactt cagcgaggaa ggcatcggct acaacatcat 3660
cagagtgccc atggccagct gcgacttcag catcaggacc tacacctacg ccgacacacc 3720
cgacgatttc cagctgcaca acttcagcct gcctgaagag gacaccaagc tgaagatccc 3780
tctgatccac agagccctgc agctggcaca aagacccgtg tcactgctgg cctctccatg 3840
gacatctccc acctggctga aaacaaatgg cgccgtgaat ggcaagggca gcctgaaagg 3900
ccaacctggc gacatctacc accagacctg ggccagatac ttcgtgaagt tcctggacgc 3960
ctatgccgag cacaagctgc agttttgggc cgtgacagcc gagaacgaac cttctgctgg 4020
actgctgagc ggctacccct ttcagtgcct gggctttaca cccgagcacc agcgggactt 4080
tatcgcccgt gatctgggac ccacactggc caatagcacc caccataatg tgcggctgct 4140
gatgctggac gaccagagac tgcttctgcc ccactgggct aaagtggtgc tgacagatcc 4200
tgaggccgcc aaatacgtgc acggaatcgc cgtgcactgg tatctggact ttctggcccc 4260
tgccaaggcc acactgggag agacacacag actgttcccc aacaccatgc tgttcgccag 4320
cgaagcctgt gtgggcagca agttttggga acagagcgtg cggctcggca gctgggatag 4380
aggcatgcag tacagccaca gcatcatcac caacctgctg taccacgtcg tcggctggac 4440
cgactggaat ctggccctga atcctgaagg cggccctaac tgggtccgaa acttcgtgga 4500
cagccccatc atcgtggaca tcaccaagga caccttctac aagcagccca tgttctacca 4560
cctgggacac ttcagcaagt tcatccccga gggctctcag cgcgttggac tggtggcttc 4620
ccagaagaac gatctggacg ccgtggctct gatgcaccct gatggatctg ctgtggtggt 4680
ggtcctgaac cgcagcagca aagatgtgcc cctgaccatc aaggatcccg ccgtgggatt 4740
cctggaaaca atcagccctg gctactccat ccacacctac ctgtggcgta gacagtgaca 4800
attgttaatt aagtttaaac cctcgaggcc gcaagcaata aaatatcttt attttcatta 4860
catctgtgtg ttggtttttt gtgtggagat ccacgataac aaacagcttt tttggggtga 4920
acatattgac tgaattccct gcaggttggc cactccctct ctgcgcgctc gctcgctcac 4980
tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt ggtcgcccgg cctcagtgag 5040
cgagcgagcg cgcagagagg gagtggccaa ctccatcact aggggttcct gcggccgctc 5100
gtacggtctc gaggaattcc tgcaggataa cttgccaacc tcattctaaa atgtatatag 5160
aagcccaaaa gacaataaca aaaatattct tgtagaacaa aatgggaaag aatgttccac 5220
taaatatcaa gatttagagc aaagcatgag atgtgtgggg atagacagtg aggctgataa 5280
aatagagtag agctcagaaa cagacccatt gatatatgta agtgacctat gaaaaaaata 5340
tggcatttta caatgggaaa atgatggtct ttttcttttt tagaaaaaca gggaaatata 5400
tttatatgta aaaaataaaa gggaacccat atgtcatacc atacacacaa aaaaattcca 5460
gtgaattata agtctaaatg gagaaggcaa aactttaaat cttttagaaa ataatataga 5520
agcatgcaga ccagcctggc caacatgatg aaaccctctc tactaataat aaaatcagta 5580
gaactactca ggactacttt gagtgggaag tccttttcta tgaagacttc tttggccaaa 5640
attaggctct aaatgcaagg agatagtgca tcatgcctgg ctgcacttac tgataaatga 5700
tgttatcacc atctttaacc aaatgcacag gaacaagtta tggtactgat gtgctggatt 5760
gagaaggagc tctacttcct tgacaggaca catttgtatc aacttaaaaa agcagatttt 5820
tgccagcaga actattcatt cagaggtagg aaacttagaa tagatgatgt cactgattag 5880
catggcttcc ccatctccac agctgcttcc cacccaggtt gcccacagtt gagtttgtcc 5940
agtgctcagg gctgcccact ctcagtaaga agccccacac cagcccctct ccaaatatgt 6000
tggctgttcc ttccattaaa gtgaccccac tttagagcag caagtggatt tctgtttctt 6060
acagttcagg aaggaggagt cagctgtgag aacctggagc ctgagatgct tctaagtccc 6120
actgctactg gggtcaggga agccagactc cagcatcagc agtcaggagc actaagccct 6180
tgccaacatc ctgtttctca gagaaactgc ttccattata atggttgtcc ttttttaagc 6240
tatcaagcca aacaaccagt gtctaccatt attctcatca cctgaagcca agggttctag 6300
caaaagtcaa gctgtcttgt aatggttgat gtgcctccag cttctgtctt cagtcactcc 6360
actcttagcc tgctctgaat caactctgac cacagttccc tggagcccct gccacctgct 6420
gcccctgcca ccttctccat ctgcagtgct gtgcagcctt ctgcactctt gcagagctaa 6480
taggtggaga cttgaaggaa gaggaggaaa gtttctcata atagccttgc tgcaagctca 6540
aatgggaggt gggcactgtg cccaggagcc ttggagcaaa ggctgtgccc aacctctgac 6600
tgcatccagg tttggtcttg acagagataa gaagccctgg cttttggagc caaaatctag 6660
gtcagactta ggcaggattc tcaaagttta tcagcagaac atgaggcaga agaccctttc 6720
tgctccagct tcttcaggct caaccttcat cagaatagat agaaagagag gctgtgaggg 6780
ttcttaaaac agaagcaaat ctgactcaga gaataaacaa cctcctagta aactacagct 6840
tagacagagc atctggtggt gagtgtgctc agtgtcctac tcaactgtct ggtatcagcc 6900
ctcatgagga cttctcttct ttccctcata gacctccatc tctgttttcc ttagcctgca 6960
gaaatctgga tggctattca cagaatgcct gtgctttcag agttgcattt tttctctggt 7020
attctggttc aagcatttga aggtaggaaa ggttctccaa gtgcaagaaa gccagccctg 7080
agcctcaact gcctggctag tgtggtcagt aggatgcaaa ggctgttgaa tgccacaagg 7140
ccaaacttta acctgtgtac cacaagccta gcagcagagg cagctctgct cactggaact 7200
ctctgtcttc tttctcctga gccttttctt ttcctgagtt ttctagctct cctcaacctt 7260
acctctgccc tacccaggac aaacccaaga gccactgttt ctgtgatgtc ctctccagcc 7320
ctaattaggc atcatgactt cagcctgacc ttccatgctc agaagcagtg ctaatccact 7380
tcagatgagc tgctctatgc aacacaggca gagcctacaa acctttgcac cagagccctc 7440
cacatatcag tgtttgttca tactcacttc aacagcaaat gtgactgctg agattaagat 7500
tttacacaag atggtctgta atttcacagt tagttttatc ccattaggta tgaaagaatt 7560
agcataattc cccttaaaca tgaatgaatc ttagattttt taataaatag ttttggaagt 7620
aaagacagag acatcaggag cacaaggaat agcctgagag gacaaacaga acaagaaaga 7680
gtctggaaat acagggatg ttcttggcct cctcaaagca agtgcaagca gatagtacca 7740
gcagccccag gctatcagag cccagtgaag agaagtacca tgaaagccac agctctaacc 7800
accctgttcc agagtgacag acagtcccca agacaagcca gcctgagcca gagagagaac 7860
tgcaagagaa agtttctaat ttaggttctg ttagattcag acaagtgcag gtcatcctct 7920
ctccacagct actcacctct ccagcctaac aaagcctgca gtccacactc caaccctggt 7980
gtctcacctc ctagcctctc ccaacatcct gctctctgac catcttctgc atctctcatc 8040
tcaccatctc ccactgtcta cagcctactc ttgcaactac catctcattt tctgacatcc 8100
tgtctacatc ttctgccata ctctgccatc taccatacca cctcttacca tctaccacac 8160
catcttttat ctccatccct ctcagaagcc tccaagctga atcctgcttt atgtgttcat 8220
ctcagcccct gcatggaaag ctgaccccag aggcagaact attcccagag agcttggcca 8280
agaaaaacaa aactaccagc ctggccaggc tcaggagtag taagctgcag tgtctgttgt 8340
gttctagctt caacagctgc aggagttcca ctctcaaatg ctccacattt ctcacatcct 8400
cctgattctg gtcactaccc atcttcaaag aacagaatat ctcacatcag catactgtga 8460
aggactagtc atgggtgcag ctgctcagag ctgcaaagtc attctggatg gtggagagct 8520
tacaaacatt tcatgatgct ccccccgctc tgatggctgg agcccaatcc ctacacagac 8580
tcctgctgta tgtgttttcc tttcactctg agccacagcc agagggcagg cattcagtct 8640
cctcttcagg ctggggctgg ggcactgaga actcacccaa caccttgctc tcactccttc 8700
tgcaaaacaa gaaagagctt tgtgctgcag tagccatgaa gaatgaaagg aaggctttaa 8760
ctaaaaaatg tcagagatta ttttcaaccc cttactgtgg atcaccagca aggaggaaac 8820
acaacacaga gacatttttt cccctcaaat tatcaaaaga atcactgcat ttgttaaaga 8880
gagcaactga atcaggaagc agagttttga acatatcaga agttaggaat ctgcatcaga 8940
gacaaatgca gtcatggttg tttgctgcat accagcccta atcattagaa gcctcatgga 9000
cttcaaacat cattccctct gacaagatgc tctagcctaa ctccatgaga taaaataaat 9060
ctgcctttca gagccaaaga agagtccacc agcttcttct cagtgtgaac aagagctcca 9120
gtcaggttag tcagtccagt gcagtagagg agaccagtct gcatcctcta attttcaaag 9180
gcaagaagat ttgtttaccc tggacaccag gcacaagtga ggtcacagag ctcttagata 9240
tgcagtcctc atgagtgagg agactaaagc gcatgccatc aagacttcag tgtagagaaa 9300
acctccaaaa aagcctcctc actacttctg gaatagctca gaggccgagg cggcctcggc 9360
ctctgcataa ataaaaaaaa ttagtcagcc atggggcgga gaatgggcgg aactgggcgg 9420
agttaggggc gggatgggcg gagttagggg cgggactatg gttgctgact aattgagatg 9480
catgctttgc atacttctgc ctgctgggga gcctggggac tttccacacc tggttgctga 9540
ctaattgaga tgcatgcttt gcatacttct gcctgctggg gagcctgggg actttccaca 9600
ccctaactga cacacattcc acagctgcat taatgaatcg gccaacgcgc ggggagaggc 9660
ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt 9720
cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca 9780
ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa 9840
aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat 9900
cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc 9960
cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc 10020
gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt 10080
tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac 10140
cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg 10200
ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca 10260
gagttcttga agtggtggcc taactacggc tacactagaa gaacagtatt tggtatctgc 10320
gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa 10380
accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa 10440
ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac 10500
tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta 10560
aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt 10620
taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata 10680
gttgcctgac tcctgcaaac cacgttgtgt ctcaaaatct ctgatgttac attgcacaag 10740
ataaaaatat atcatcatga acaataaaac tgtctgctta cataaacagt aatacaaggg 10800
gtgttatgag ccatattcaa cgggaaacgt cttgctcgag gccgcgatta aattccaaca 10860
tggatgctga tttatatggg tataaatggg ctcgcgataa tgtcgggcaa tcaggtgcga 10920
caatctatcg attgtatggg aagcccgatg cgccagagtt gtttctgaaa catggcaaag 10980
gtagcgttgc caatgatgtt acagatgaga tggtcagact aaactggctg acggaattta 11040
tgcctcttcc gaccatcaag cattttatcc gtactcctga tgatgcatgg ttactcacca 11100
ctgcgatccc cgggaaaaca gcattccagg tattagaaga atatcctgat tcaggtgaaa 11160
atattgttga tgcgctggca gtgttcctgc gccggttgca ttcgattcct gtttgtaatt 11220
gtccttttaa cagcgatcgc gtatttcgtc tcgctcaggc gcaatcacga atgaataacg 11280
gtttggttga tgcgagtgat tttgatgacg agcgtaatgg ctggcctgtt gaacaagtct 11340
ggaaagaaat gcataagctt ttgccattct caccggattc agtcgtcact catggtgatt 11400
tctcacttga taaccttatt tttgacgagg ggaaattaat aggttgtatt gatgttggac 11460
gagtcggaat cgcagaccga taccaggatc ttgccatcct atggaactgc ctcggtgagt 11520
tttctccttc attacagaaa cggctttttc aaaaatatgg tattgataat cctgatatga 11580
ataaattgca gtttcatttg atgctcgatg agtttttcta agggcggcct gccaccatac 11640
ccacgccgaa acaagcgctc atgagcccga agtggcgagc ccgatcttcc ccatcggtga 11700
tgtcggcgat ataggcgcca gcaaccgcac ctgtggcgcc ggtgatgagg gcgcgccaag 11760
tcgacgtccg gcagtc 11776
<210> 44
<211> 11064
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 44
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctgctag ctctgggtat ttaagcccga gtgagcacgc 180
agggtctcca ttttgaagcg ggaggttacg cgttcgtcga ctactagtgg gtaccagagc 240
tccctaggtt ctagaaccgg tgacgtctcc catggtgaag cttggatctg agggcggagt 300
tagggcggag ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga 360
atgggcggtg aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg 420
tcgcagccgg gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta 480
agtcactgac tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag 540
tggcactatg aaccctgcag ccctaggaat gcatctagac aattgtacta accttcttct 600
ctttcctctc ctgacagtcc ggaaagccac catggaattc agcagcccca gcagagagga 660
atgccccaag cctctgagcc gggtgtcaat catggccgga tctctgacag gactgctgct 720
gcttcaggcc gtgtcttggg cttctggcgc tagaccttgc atccccaaga gcttcggcta 780
cagcagcgtc gtgtgcgtgt gcaatgccac ctactgcgac agcttcgacc ctcctacctt 840
tcctgctctg ggcaccttca gcagatacga gagcaccaga tccggcagac ggatggaact 900
gagcatggga cccatccagg ccaatcacac aggcactggc ctgctgctga cactgcagcc 960
tgagcagaaa ttccagaaag tgaaaggctt cggcggagcc atgacagatg ccgccgctct 1020
gaatatcctg gctctgtctc caccagctca gaacctgctg ctcaagagct acttcagcga 1080
ggaaggcatc ggctacaaca tcatcagagt gcccatggcc agctgcgact tcagcatcag 1140
gacctacacc tacgccgaca cacccgacga tttccagctg cacaacttca gcctgcctga 1200
agaggacacc aagctgaaga tccctctgat ccacagagcc ctgcagctgg cacaaagacc 1260
cgtgtcactg ctggcctctc catggacatc tcccacctgg ctgaaaacaa atggcgccgt 1320
gaatggcaag ggcagcctga aaggccaacc tggcgacatc taccaccaga cctgggccag 1380
atacttcgtg aagttcctgg acgcctatgc cgagcacaag ctgcagtttt gggccgtgac 1440
agccgagaac gaaccttctg ctggactgct gagcggctac ccctttcagt gcctgggctt 1500
tacacccgag caccagcggg actttatcgc ccgtgatctg ggacccacac tggccaatag 1560
cacccaccat aatgtgcggc tgctgatgct ggacgaccag agactgcttc tgccccactg 1620
ggctaaagtg gtgctgacag atcctgaggc cgccaaatac gtgcacggaa tcgccgtgca 1680
ctggtatctg gactttctgg cccctgccaa ggccacactg ggagagacac acagactgtt 1740
ccccaacacc atgctgttcg ccagcgaagc ctgtgtgggc agcaagtttt gggaacagag 1800
cgtgcggctc ggcagctggg atagaggcat gcagtacagc cacagcatca tcaccaacct 1860
gctgtaccac gtcgtcggct ggaccgactg gaatctggcc ctgaatcctg aaggcggccc 1920
taactgggtc cgaaacttcg tggacagccc catcatcgtg gacatcacca aggacacctt 1980
ctacaagcag cccatgttct accacctggg acacttcagc aagttcatcc ccgagggctc 2040
tcagcgcgtt ggactggtgg cttcccagaa gaacgatctg gacgccgtgg ctctgatgca 2100
ccctgatgga tctgctgtgg tggtggtcct gaaccgcagc agcaaagatg tgcccctgac 2160
catcaaggat cccgccgtgg gattcctgga aacaatcagc cctggctact ccatccacac 2220
ctacctgtgg cgtagacagt gacaattgtt aattaagttt aaaccctcga ggccgcaagc 2280
cgcatcgata ccgtcgacta gagctcgctg atcagcctcg actgtgcctt ctagttgcca 2340
gccatctgtt gtttgcccct cccccgtgcc ttccttgacc ctggaaggtg ccactcccac 2400
tgtcctttcc taataaaatg aggaaattgc atcgcattgt ctgagtaggt gtcattctat 2460
tctggggggt ggggtggggc aggacagcaa gggggaggat tgggaagaca atagcaggca 2520
tgctggggag agatccacga taacaaacag cttttttggg ggggcggagt tagggcggag 2580
ccaatcagcg tgcgccgttc cgaaagttgc cttttatggc tgggcggaga atgggcggtg 2640
aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg tcgcagccgg 2700
gatttgggtc gcggttcttg tttgtggatc cctgtgatcg tcacttggta agtcactgac 2760
tgtctatgcc tgggaaaggg tgggcaggag atggggcagt gcaggaaaag tggcactatg 2820
aaccctgcag ccctaggaat gcatctagac aattgtacta accttcttct ctttcctctc 2880
ctgacagtcc ggaaagccac catgtggcag ctgtgggcca gcctgtgctg cctgctggtg 2940
ctggccaacg cccgcagccg ccccagcttc caccccctga gcgacgagct ggtgaactac 3000
gtgaacaagc gcaacaccac ctggcaggcc ggccacaact tctacaacgt ggacatgagc 3060
tacctgaagc gcctgtgcgg caccttcctg ggcggcccca agccccccca gcgcgtgatg 3120
ttcaccgagg acctgaagct gcccgccagc ttcgacgccc gcgagcagtg gccccagtgc 3180
cccaccatca aggagatccg cgaccagggc agctgcggca gctgctgggc cttcggcgcc 3240
gtggaggcca tcagcgaccg catctgcatc cacaccaacg cccacgtgag cgtggaggtg 3300
agcgccgagg acctgctgac ctgctgcggc agcatgtgcg gcgacggctg caacggcggc 3360
taccccgccg aggcctggaa cttctggacc cgcaagggcc tggtgagcgg cggcctgtac 3420
gagagccacg tgggctgccg cccctacagc atccccccct gcgagcacca cgtgaacggc 3480
agccgccccc cctgcaccgg cgagggcgac acccccaagt gcagcaagat ctgcgagccc 3540
ggctacagcc ccacctacaa gcaggacaag cactacggct acaacagcta cagcgtgagc 3600
aacagcgaga aggacatcat ggccgagatc tacaagaacg gccccgtgga gggcgccttc 3660
agcgtgtaca gcgacttcct gctgtacaag agcggcgtgt accagcacgt gaccggcgag 3720
atgatgggcg gccacgccat ccgcatcctg ggctggggcg tggagaacgg caccccctac 3780
tggctggtgg ccaacagctg gaacaccgac tggggcgaca acggcttctt caagatcctg 3840
cgcggccagg accactgcgg catcgagagc gaggtggtgg ccggcatccc ccgcaccgac 3900
cagtactggg agaagatctg acccagggga ctcagcggcc gctcgagtct agagggcccg 3960
tttaaacccg ctgatcagcc tcgaagacat gataagatac attgatgagt ttggacaaac 4020
cacaacaaga atgcagtgaa aaaaatgctt tatttgtgaa atttgtgatg ctattgcttt 4080
atttgtaacc attataagct gcaataaaca agttaacaac aacaattgca ttcattttat 4140
gtttcaggtt cagggggaga tgtgggaggt tttttaaagc aagtaaaacc tctacaaatg 4200
tggtatgaac atattgactg aattccctgc aggttggcca ctccctctct gcgcgctcgc 4260
tcgctcactg aggccgcccg ggcaaagccc gggcgtcggg cgacctttgg tcgcccggcc 4320
tcagtgagcg agcgagcgcg cagagaggga gtggccaact ccatcactag gggttcctgc 4380
ggccgctcgt acggtctcga ggaattcctg caggataact tgccaacctc attctaaaat 4440
gtatatagaa gcccaaaaga caataacaaa aatattcttg tagaacaaaa tgggaaagaa 4500
tgttccacta aatatcaaga tttagagcaa agcatgagat gtgtggggat agacagtgag 4560
gctgataaaa tagagtagag ctcagaaaca gacccattga tatatgtaag tgacctatga 4620
aaaaaatatg gcattttaca atgggaaaat gatggtcttt ttctttttta gaaaaacagg 4680
gaaatatatt tatatgtaaa aaataaaagg gaacccatat gtcataccat acacacaaaa 4740
aaattccagt gaattataag tctaaatgga gaaggcaaaa ctttaaatct tttagaaaat 4800
aatatagaag catgcagacc agcctggcca acatgatgaa accctctcta ctaataataa 4860
aatcagtaga actactcagg actactttga gtgggaagtc cttttctatg aagacttctt 4920
tggccaaaat taggctctaa atgcaaggag atagtgcatc atgcctggct gcacttactg 4980
ataaatgatg ttatcaccat ctttaaccaa atgcacagga acaagttatg gtactgatgt 5040
gctggattga gaaggagctc tacttccttg acaggacaca tttgtatcaa cttaaaaaag 5100
cagatttttg ccagcagaac tattcattca gaggtaggaa acttagaata gatgatgtca 5160
ctgattagca tggcttcccc atctccacag ctgcttccca cccaggttgc ccacagttga 5220
gtttgtccag tgctcagggc tgcccactct cagtaagaag ccccacacca gcccctctcc 5280
aaatatgttg gctgttcctt ccattaaagt gaccccactt tagagcagca agtggatttc 5340
tgtttcttac agttcaggaa ggaggagtca gctgtgagaa cctggagcct gagatgcttc 5400
taagtcccac tgctactggg gtcagggaag ccagactcca gcatcagcag tcaggagcac 5460
taagcccttg ccaacatcct gtttctcaga gaaactgctt ccattataat ggttgtcctt 5520
ttttaagcta tcaagccaaa caaccagtgt ctaccattat tctcatcacc tgaagccaag 5580
ggttctagca aaagtcaagc tgtcttgtaa tggttgatgt gcctccagct tctgtcttca 5640
gtcactccac tcttagcctg ctctgaatca actctgacca cagttccctg gagcccctgc 5700
cacctgctgc ccctgccacc ttctccatct gcagtgctgt gcagccttct gcactcttgc 5760
agagctaata ggtggagact tgaaggaaga ggaggaaagt ttctcataat agccttgctg 5820
caagctcaaa tgggaggtgg gcactgtgcc caggagcctt ggagcaaagg ctgtgcccaa 5880
cctctgactg catccaggtt tggtcttgac agagataaga agccctggct tttggagcca 5940
aaatctaggt cagacttagg caggattctc aaagtttatc agcagaacat gaggcagaag 6000
accctttctg ctccagcttc ttcaggctca accttcatca gaatagatag aaagagaggc 6060
tgtgagggtt cttaaaacag aagcaaatct gactcagaga ataaacaacc tcctagtaaa 6120
ctacagctta gacagagcat ctggtggtga gtgtgctcag tgtcctactc aactgtctgg 6180
tatcagccct catgaggact tctcttcttt ccctcataga cctccatctc tgttttcctt 6240
agcctgcaga aatctggatg gctattcaca gaatgcctgt gctttcagag ttgcattttt 6300
tctctggtat tctggttcaa gcatttgaag gtaggaaagg ttctccaagt gcaagaaagc 6360
cagccctgag cctcaactgc ctggctagtg tggtcagtag gatgcaaagg ctgttgaatg 6420
ccacaaggcc aaactttaac ctgtgtacca caagcctagc agcagaggca gctctgctca 6480
ctggaactct ctgtcttctt tctcctgagc cttttctttt cctgagtttt ctagctctcc 6540
tcaaccttac ctctgcccta cccaggacaa acccaagagc cactgtttct gtgatgtcct 6600
ctccagccct aattaggcat catgacttca gcctgacctt ccatgctcag aagcagtgct 6660
aatccacttc agatgagctg ctctatgcaa cacaggcaga gcctacaaac ctttgcacca 6720
gagccctcca catatcagtg tttgttcata ctcacttcaa cagcaaatgt gactgctgag 6780
attaagattt tacacaagat ggtctgtaat ttcacagtta gttttatccc attaggtatg 6840
aaagaattag cataattccc cttaaacatg aatgaatctt agatttttta ataaatagtt 6900
ttggaagtaa agacagagac atcaggagca caaggaatag cctgagagga caaacagaac 6960
aagaaagagt ctggaaatac acaggatgtt cttggcctcc tcaaagcaag tgcaagcaga 7020
tagtaccagc agccccaggc tatcagagcc cagtgaagag aagtaccatg aaagccacag 7080
ctctaaccac cctgttccag agtgacagac agtccccaag acaagccagc ctgagccaga 7140
gagagaactg caagagaaag tttctaattt aggttctgtt agattcagac aagtgcaggt 7200
catcctctct ccacagctac tcacctctcc agcctaacaa agcctgcagt ccacactcca 7260
accctggtgt ctcacctcct agcctctccc aacatcctgc tctctgacca tcttctgcat 7320
ctctcatctc accatctccc actgtctaca gcctactctt gcaactacca tctcatttt 7380
tgacatcctg tctacatctt ctgccatact ctgccatcta ccataccacc tcttaccatc 7440
taccacacca tcttttatct ccatccctct cagaagcctc caagctgaat cctgctttat 7500
gtgttcatct cagcccctgc atggaaagct gaccccagag gcagaactat tcccaagag 7560
cttggccaag aaaaacaaaa ctaccagcct ggccaggctc aggagtagta agctgcagtg 7620
tctgttgtgt tctagcttca acagctgcag gagttccact ctcaaatgct ccacatttct 7680
cacatcctcc tgattctggt cactacccat cttcaaagaa cagaatatct cacatcagca 7740
tactgtgaag gactagtcat gggtgcagct gctcagagct gcaaagtcat tctggatggt 7800
ggagagctta caaacatttc atgatgctcc ccccgctctg atggctggag cccaatccct 7860
acacagactc ctgctgtatg tgttttcctt tcactctgag ccacagccag agggcaggca 7920
ttcagtctcc tcttcaggct ggggctgggg cactgagaac tcacccaaca ccttgctctc 7980
actccttctg caaaacaaga aagagctttg tgctgcagta gccatgaaga atgaaaggaa 8040
ggctttaact aaaaaatgtc agagattatt ttcaacccct tactgtggat caccagcaag 8100
gaggaaacac aacacagaga cattttttcc cctcaaatta tcaaaagaat cactgcattt 8160
gttaaagaga gcaactgaat caggaagcag agttttgaac atatcagaag ttaggaatct 8220
gcatcagaga caaatgcagt catggttgtt tgctgcatac cagccctaat cattagaagc 8280
ctcatggact tcaaacatca ttccctctga caagatgctc tagcctaact ccatgagata 8340
aaataaatct gcctttcaga gccaaagaag agtccaccag cttcttctca gtgtgaacaa 8400
gagctccagt caggttagtc agtccagtgc agtagaggag accagtctgc atcctctaat 8460
tttcaaaggc aagaagattt gtttaccctg gacaccaggc acaagtgagg tcacagagct 8520
cttagatatg cagtcctcat gagtgaggag actaaagcgc atgccatcaa gacttcagtg 8580
tagagaaaac ctccaaaaaa gcctcctcac tacttctgga atagctcaga ggccgaggcg 8640
gcctcggcct ctgcataaat aaaaaaaatt agtcagccat ggggcggaga atgggcggaa 8700
ctgggcggag ttaggggcgg gatgggcgga gttaggggcg ggactatggt tgctgactaa 8760
ttgagatgca tgctttgcat acttctgcct gctggggagc ctggggactt tccacacctg 8820
gttgctgact aattgagatg catgctttgc atacttctgc ctgctgggga gcctggggac 8880
tttccacacc ctaactgaca cacattccac agctgcatta atgaatcggc caacgcgcgg 8940
ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc gctcactgac tcgctgcgct 9000
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 9060
cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 9120
accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 9180
acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 9240
cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 9300
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 9360
atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 9420
agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 9480
acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 9540
gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 9600
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 9660
gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 9720
gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 9780
acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 9840
tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 9900
ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 9960
catccatagt tgcctgactc ctgcaaacca cgttgtgtct caaaatctct gatgttacat 10020
tgcacaagat aaaaatatat catcatgaac aataaaactg tctgcttaca taaacagtaa 10080
tacaaggggt gttatgagcc atattcaacg ggaaacgtct tgctcgaggc cgcgattaaa 10140
ttccaacatg gatgctgatt tatatgggta taaatgggct cgcgataatg tcgggcaatc 10200
aggtgcgaca atctatcgat tgtatgggaa gcccgatgcg ccagagttgt ttctgaaaca 10260
tggcaaaggt agcgttgcca atgatgttac agatgagatg gtcagactaa actggctgac 10320
ggaatttatg cctcttccga ccatcaagca ttttatccgt actcctgatg atgcatggtt 10380
actcaccact gcgatccccg ggaaaacagc attccaggta ttagaagaat atcctgattc 10440
aggtgaaaat attgttgatg cgctggcagt gttcctgcgc cggttgcatt cgattcctgt 10500
ttgtaattgt ccttttaaca gcgatcgcgt atttcgtctc gctcaggcgc aatcacgaat 10560
gaataacggt ttggttgatg cgagtgattt tgatgacgag cgtaatggct ggcctgttga 10620
acaagtctgg aaagaaatgc ataagctttt gccattctca ccggattcag tcgtcactca 10680
tggtgatttc tcacttgata accttatttt tgacgagggg aaattaatag gttgtattga 10740
tgttggacga gtcggaatcg cagaccgata ccaggatctt gccatcctat ggaactgcct 10800
cggtgagttt tctccttcat tacagaaacg gctttttcaa aaatatggta ttgataatcc 10860
tgatatgaat aaattgcagt ttcatttgat gctcgatgag tttttctaag ggcggcctgc 10920
caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc 10980
atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgagggc 11040
gcgccaagtc gacgtccggc agtc 11064
<210> 45
<211> 250
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 45
Met Glu Lys Gly Pro Val Arg Ala Pro Ala Glu Lys Pro Arg Gly Ala
1 5 10 15
Arg Cys Ser Asn Gly Phe Pro Glu Arg Asp Pro Pro Arg Pro Gly Pro
20 25 30
Ser Arg Pro Ala Glu Lys Pro Pro Arg Pro Glu Ala Lys Ser Ala Gln
35 40 45
Pro Ala Asp Gly Trp Lys Gly Glu Arg Pro Arg Ser Glu Glu Asp Asn
50 55 60
Glu Leu Asn Leu Pro Asn Leu Ala Ala Ala Tyr Ser Ser Ile Leu Ser
65 70 75 80
Ser Leu Gly Glu Asn Pro Gln Arg Gln Gly Leu Leu Lys Thr Pro Trp
85 90 95
Arg Ala Ala Ser Ala Met Gln Phe Phe Thr Lys Gly Tyr Gln Glu Thr
100 105 110
Ile Ser Asp Val Leu Asn Asp Ala Ile Phe Asp Glu Asp His Asp Glu
115 120 125
Met Val Ile Val Lys Asp Ile Asp Met Phe Ser Met Cys Glu His His
130 135 140
Leu Val Pro Phe Val Gly Lys Val His Ile Gly Tyr Leu Pro Asn Lys
145 150 155 160
Gln Val Leu Gly Leu Ser Lys Leu Ala Arg Ile Val Glu Ile Tyr Ser
165 170 175
Arg Arg Leu Gln Val Gln Glu Arg Leu Thr Lys Gln Ile Ala Val Ala
180 185 190
Ile Thr Glu Ala Leu Arg Pro Ala Gly Val Gly Val Val Val Glu Ala
195 200 205
Thr His Met Cys Met Val Met Arg Gly Val Gln Lys Met Asn Ser Lys
210 215 220
Thr Val Thr Ser Thr Met Leu Gly Val Phe Arg Glu Asp Pro Lys Thr
225 230 235 240
Arg Glu Glu Phe Leu Thr Leu Ile Arg Ser
245 250
<210> 46
<211> 750
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 46
atggagaagg gccccgtgcg cgcccccgcc gagaagcccc gcggcgcccg ctgcagcaac 60
ggcttccccg agcgcgaccc cccccgcccc ggccccagcc gccccgccga gaagcccccc 120
cgccccgagg ccaagagcgc ccagcccgcc gacggctgga agggcgagcg cccccgcagc 180
gaggaggaca acgagctgaa cctgcccaac ctggccgccg cctacagcag catcctgagc 240
agcctgggcg agaaccccca gcgccagggc ctgctgaaga ccccctggcg cgccgccagc 300
gccatgcagt tcttcaccaa gggctaccag gagaccatca gcgacgtgct gaacgacgcc 360
atcttcgacg aggaccacga cgagatggtg atcgtgaagg acatcgacat gttcagcatg 420
tgcgagcacc acctggtgcc cttcgtgggc aaggtgcaca tcggctacct gcccaacaag 480
caggtgctgg gcctgagcaa gctggcccgc atcgtggaga tctacagccg ccgcctgcag 540
gtgcaggagc gcctgaccaa gcagatcgcc gtggccatca ccgaggccct gcgccccgcc 600
ggcgtgggcg tggtggtgga ggccacccac atgtgcatgg tgatgcgcgg cgtgcagaag 660
atgaacagca agaccgtgac cagcaccatg ctgggcgtgt tccgcgagga ccccaagacc 720
cgcgaggagt tcctgaccct gatccgcagc 750
<210> 47
<211> 203
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 47
Met Gly Ser Arg Asp His Leu Phe Lys Val Leu Val Val Gly Asp Ala
1 5 10 15
Ala Val Gly Lys Thr Ser Leu Val Gln Arg Tyr Ser Gln Asp Ser Phe
20 25 30
Ser Lys His Tyr Lys Ser Thr Val Gly Val Asp Phe Ala Leu Lys Val
35 40 45
Leu Gln Trp Ser Asp Tyr Glu Ile Val Arg Leu Gln Leu Trp Asp Ile
50 55 60
Ala Gly Gln Glu Arg Phe Thr Ser Met Thr Arg Leu Tyr Tyr Arg Asp
65 70 75 80
Ala Ser Ala Cys Val Ile Met Phe Asp Val Thr Asn Ala Thr Thr Phe
85 90 95
Ser Asn Ser Gln Arg Trp Lys Gln Asp Leu Asp Ser Lys Leu Thr Leu
100 105 110
Pro Asn Gly Glu Pro Val Pro Cys Leu Leu Leu Ala Asn Lys Cys Asp
115 120 125
Leu Ser Pro Trp Ala Val Ser Arg Asp Gln Ile Asp Arg Phe Ser Lys
130 135 140
Glu Asn Gly Phe Thr Gly Trp Thr Glu Thr Ser Val Lys Glu Asn Lys
145 150 155 160
Asn Ile Asn Glu Ala Met Arg Val Leu Ile Glu Lys Met Met Arg Asn
165 170 175
Ser Thr Glu Asp Ile Met Ser Leu Ser Thr Gln Gly Asp Tyr Ile Asn
180 185 190
Leu Gln Thr Lys Ser Ser Ser Ser Trp Ser Cys Cys
195 200
<210> 48
<211> 609
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 48
atgggcagcc gcgaccacct gttcaaggtg ctggtggtgg
Claims (15)
(i) 표 1에 열거된 하나 이상의 유전자 산물 및/또는 표 1에 열거된 하나 이상의 유전자 산물을 표적화하는 하나 이상의 억제성 핵산을 코딩하는 트랜스진을 포함하는 발현 구축물; 및
(ii) 발현 구축물에 플랭킹된 2개의 아데노-연관 바이러스 (AAV) 역위 말단 반복부 (ITR).A method of treating a subject having or suspected of having a central nervous system (CNS) disease, comprising administering to the subject an isolated nucleic acid comprising:
(i) an expression construct comprising a transgene encoding one or more gene products listed in Table 1 and/or one or more inhibitory nucleic acids targeting one or more gene products listed in Table 1; and
(ii) two adeno-associated virus (AAV) inverted terminal repeats (ITRs) flanked by the expression construct.
Applications Claiming Priority (19)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201962831840P | 2019-04-10 | 2019-04-10 | |
US201962831846P | 2019-04-10 | 2019-04-10 | |
US201962831856P | 2019-04-10 | 2019-04-10 | |
US201962832223P | 2019-04-10 | 2019-04-10 | |
US62/831,856 | 2019-04-10 | ||
US62/831,846 | 2019-04-10 | ||
US62/831,840 | 2019-04-10 | ||
US62/832,223 | 2019-04-10 | ||
US201962934450P | 2019-11-12 | 2019-11-12 | |
US62/934,450 | 2019-11-12 | ||
US201962954089P | 2019-12-27 | 2019-12-27 | |
US62/954,089 | 2019-12-27 | ||
US202062960471P | 2020-01-13 | 2020-01-13 | |
US62/960,471 | 2020-01-13 | ||
US202062988665P | 2020-03-12 | 2020-03-12 | |
US62/988,665 | 2020-03-12 | ||
US202062990246P | 2020-03-16 | 2020-03-16 | |
US62/990,246 | 2020-03-16 | ||
PCT/US2020/027658 WO2020210615A1 (en) | 2019-04-10 | 2020-04-10 | Gene therapies for lysosomal disorders |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20210150487A true KR20210150487A (en) | 2021-12-10 |
Family
ID=79032950
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020217036238A KR20210150487A (en) | 2019-04-10 | 2020-04-10 | Gene Therapy for Lysosomal Disorders |
Country Status (6)
Country | Link |
---|---|
EP (1) | EP3952923A4 (en) |
JP (1) | JP2022527015A (en) |
KR (1) | KR20210150487A (en) |
CN (1) | CN114025806A (en) |
IL (1) | IL286582A (en) |
MX (1) | MX2021012467A (en) |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10648001B2 (en) * | 2012-07-11 | 2020-05-12 | Sangamo Therapeutics, Inc. | Method of treating mucopolysaccharidosis type I or II |
SG11201509419QA (en) * | 2013-05-15 | 2015-12-30 | Univ Minnesota | Adeno-associated virus mediated gene transfer to the central nervous system |
CA2985235A1 (en) * | 2015-05-07 | 2016-11-10 | Shire Human Genetic Therapies, Inc. | Glucocerebrosidase gene therapy for parkinson's disease |
EP3662060A2 (en) * | 2017-08-03 | 2020-06-10 | Voyager Therapeutics, Inc. | Compositions and methods for delivery of aav |
-
2020
- 2020-04-10 EP EP20787456.1A patent/EP3952923A4/en not_active Withdrawn
- 2020-04-10 MX MX2021012467A patent/MX2021012467A/en unknown
- 2020-04-10 CN CN202080027713.0A patent/CN114025806A/en not_active Withdrawn
- 2020-04-10 KR KR1020217036238A patent/KR20210150487A/en unknown
- 2020-04-10 JP JP2021559976A patent/JP2022527015A/en active Pending
-
2021
- 2021-09-22 IL IL286582A patent/IL286582A/en unknown
Also Published As
Publication number | Publication date |
---|---|
EP3952923A4 (en) | 2023-10-18 |
IL286582A (en) | 2021-12-01 |
CN114025806A (en) | 2022-02-08 |
MX2021012467A (en) | 2021-12-10 |
JP2022527015A (en) | 2022-05-27 |
EP3952923A1 (en) | 2022-02-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2020260485B2 (en) | Gene therapies for lysosomal disorders | |
AU2020260476B2 (en) | Gene therapies for lysosomal disorders | |
AU2020205228B2 (en) | Gene therapies for lysosomal disorders | |
RU2650860C2 (en) | Vectors for expression of prostate-associated antigens | |
RU2758489C2 (en) | Compositions and methods for expressing several biologically active polypeptides from one vector for the treatment of heart diseases and other pathologies | |
KR20210150486A (en) | Gene therapy for lysosomal disorders | |
KR20210086645A (en) | AAV triple-plasmid system | |
KR20220006527A (en) | Gene therapy for lysosomal disorders | |
KR20230066360A (en) | Gene Therapy for Neurodegenerative Disorders | |
KR20150014505A (en) | Subfamily e simian adenoviruses a1302, a1320, a1331 and a1337 and uses thereof | |
CN113322281B (en) | Recombinant adeno-associated virus for high-efficiency tissue-specific expression of RS1 protein and application thereof | |
AU2020344628A1 (en) | Compositions and methods for TCR reprogramming using fusion proteins | |
KR20230051529A (en) | Gene Therapy for Lysosomal Disorders | |
KR20210150487A (en) | Gene Therapy for Lysosomal Disorders | |
KR20150021839A (en) | Recombinant adenovirus comprising regulated derivatives of tumor-targeting trans-splicing ribozyme and uses thereof | |
TW202233830A (en) | Compositions and methods for the treatment of cancer using next generation engineered t cell therapy |