KR20240036508A - 아데노바이러스성 헬퍼 플라스미드 - Google Patents
아데노바이러스성 헬퍼 플라스미드 Download PDFInfo
- Publication number
- KR20240036508A KR20240036508A KR1020237042905A KR20237042905A KR20240036508A KR 20240036508 A KR20240036508 A KR 20240036508A KR 1020237042905 A KR1020237042905 A KR 1020237042905A KR 20237042905 A KR20237042905 A KR 20237042905A KR 20240036508 A KR20240036508 A KR 20240036508A
- Authority
- KR
- South Korea
- Prior art keywords
- seq
- ala
- helper plasmid
- leu
- adenoviral helper
- Prior art date
Links
- 239000013612 plasmid Substances 0.000 title claims abstract description 374
- 238000004519 manufacturing process Methods 0.000 claims abstract description 29
- 108090000623 proteins and genes Proteins 0.000 claims description 266
- 239000002773 nucleotide Substances 0.000 claims description 162
- 125000003729 nucleotide group Chemical group 0.000 claims description 162
- 108020004414 DNA Proteins 0.000 claims description 121
- NTIZESTWPVYFNL-UHFFFAOYSA-N Methyl isobutyl ketone Chemical compound CC(C)CC(C)=O NTIZESTWPVYFNL-UHFFFAOYSA-N 0.000 claims description 103
- 102000004169 proteins and genes Human genes 0.000 claims description 89
- 230000008488 polyadenylation Effects 0.000 claims description 74
- 101150068034 UL30 gene Proteins 0.000 claims description 63
- 101150099321 UL42 gene Proteins 0.000 claims description 58
- 108091034131 VA RNA Proteins 0.000 claims description 52
- 238000011144 upstream manufacturing Methods 0.000 claims description 51
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 46
- 241000700588 Human alphaherpesvirus 1 Species 0.000 claims description 40
- 230000014509 gene expression Effects 0.000 claims description 38
- 101000834253 Gallus gallus Actin, cytoplasmic 1 Proteins 0.000 claims description 36
- 101710118538 Protease Proteins 0.000 claims description 36
- 239000000835 fiber Substances 0.000 claims description 35
- 239000002243 precursor Substances 0.000 claims description 35
- 108050000932 Packaging protein 3 Proteins 0.000 claims description 34
- 101710193132 Pre-hexon-linking protein VIII Proteins 0.000 claims description 32
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 31
- 238000000034 method Methods 0.000 claims description 20
- 101150008036 UL29 gene Proteins 0.000 claims description 16
- 241000701161 unidentified adenovirus Species 0.000 claims description 16
- 101150026402 DBP gene Proteins 0.000 claims description 15
- 102000010292 Peptide Elongation Factor 1 Human genes 0.000 claims description 11
- 108010077524 Peptide Elongation Factor 1 Proteins 0.000 claims description 11
- 239000013607 AAV vector Substances 0.000 claims description 10
- 101710187001 DNA terminal protein Proteins 0.000 claims description 10
- 108700019146 Transgenes Proteins 0.000 claims description 10
- 238000003776 cleavage reaction Methods 0.000 claims description 10
- 230000007017 scission Effects 0.000 claims description 10
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 9
- 108091030071 RNAI Proteins 0.000 claims description 9
- 102100021519 Hemoglobin subunit beta Human genes 0.000 claims description 8
- 108091005904 Hemoglobin subunit beta Proteins 0.000 claims description 8
- 101710183861 Hexon-associated protein Proteins 0.000 claims description 6
- 229930027917 kanamycin Natural products 0.000 claims description 5
- 229960000318 kanamycin Drugs 0.000 claims description 5
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 claims description 5
- 229930182823 kanamycin A Natural products 0.000 claims description 5
- 108700039691 Genetic Promoter Regions Proteins 0.000 claims description 4
- 108010090851 Simplexvirus DNA polymerase Proteins 0.000 claims description 3
- 108700022465 Simplexvirus ICP8 Proteins 0.000 claims description 3
- 239000013603 viral vector Substances 0.000 claims description 3
- 102000034240 fibrous proteins Human genes 0.000 claims description 2
- 108091005899 fibrous proteins Proteins 0.000 claims description 2
- 230000009452 underexpressoin Effects 0.000 claims 1
- 241000702421 Dependoparvovirus Species 0.000 abstract description 5
- 108091033319 polynucleotide Proteins 0.000 description 76
- 102000040430 polynucleotide Human genes 0.000 description 76
- 239000002157 polynucleotide Substances 0.000 description 76
- 210000004027 cell Anatomy 0.000 description 47
- 239000012634 fragment Substances 0.000 description 40
- 150000007523 nucleic acids Chemical class 0.000 description 40
- 102000039446 nucleic acids Human genes 0.000 description 31
- 108020004707 nucleic acids Proteins 0.000 description 31
- 239000013598 vector Substances 0.000 description 28
- 108010006025 bovine growth hormone Proteins 0.000 description 25
- 108090000765 processed proteins & peptides Proteins 0.000 description 23
- 229920001184 polypeptide Polymers 0.000 description 21
- 102000004196 processed proteins & peptides Human genes 0.000 description 21
- 238000005538 encapsulation Methods 0.000 description 20
- 101100029566 Rattus norvegicus Rabggta gene Proteins 0.000 description 15
- 108010050848 glycylleucine Proteins 0.000 description 14
- 238000013461 design Methods 0.000 description 13
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 12
- 239000003623 enhancer Substances 0.000 description 12
- 108091034117 Oligonucleotide Proteins 0.000 description 11
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 11
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 10
- 241000880493 Leptailurus serval Species 0.000 description 10
- 108010047495 alanylglycine Proteins 0.000 description 10
- 108010013835 arginine glutamate Proteins 0.000 description 10
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 9
- 239000003795 chemical substances by application Substances 0.000 description 9
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 9
- 108010057821 leucylproline Proteins 0.000 description 9
- 108010061238 threonyl-glycine Proteins 0.000 description 9
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 8
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 8
- 238000011529 RT qPCR Methods 0.000 description 8
- 150000001875 compounds Chemical class 0.000 description 8
- 230000010076 replication Effects 0.000 description 8
- 108010079364 N-glycylalanine Proteins 0.000 description 7
- 108010038633 aspartylglutamate Proteins 0.000 description 7
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 7
- 108010034529 leucyl-lysine Proteins 0.000 description 7
- 102200157658 rs1555229948 Human genes 0.000 description 7
- 108010026333 seryl-proline Proteins 0.000 description 7
- 238000001890 transfection Methods 0.000 description 7
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 6
- 108010087924 alanylproline Proteins 0.000 description 6
- 108010081551 glycylphenylalanine Proteins 0.000 description 6
- 108010040030 histidinoalanine Proteins 0.000 description 6
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 6
- 108010000761 leucylarginine Proteins 0.000 description 6
- 210000004962 mammalian cell Anatomy 0.000 description 6
- 108010051242 phenylalanylserine Proteins 0.000 description 6
- 108010077112 prolyl-proline Proteins 0.000 description 6
- 108010070643 prolylglutamic acid Proteins 0.000 description 6
- 230000002103 transcriptional effect Effects 0.000 description 6
- 230000009466 transformation Effects 0.000 description 6
- 230000003612 virological effect Effects 0.000 description 6
- -1 2-thiotimidine Chemical compound 0.000 description 5
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 5
- 208000002267 Anti-neutrophil cytoplasmic antibody-associated vasculitis Diseases 0.000 description 5
- 101710145505 Fiber protein Proteins 0.000 description 5
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 5
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 5
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 5
- 108010005233 alanylglutamic acid Proteins 0.000 description 5
- 238000010367 cloning Methods 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 108010049041 glutamylalanine Proteins 0.000 description 5
- 108010025306 histidylleucine Proteins 0.000 description 5
- 108010012581 phenylalanylglutamate Proteins 0.000 description 5
- 229920000642 polymer Polymers 0.000 description 5
- 108010053725 prolylvaline Proteins 0.000 description 5
- 108010071207 serylmethionine Proteins 0.000 description 5
- 238000013518 transcription Methods 0.000 description 5
- 230000035897 transcription Effects 0.000 description 5
- 102000007469 Actins Human genes 0.000 description 4
- 108010085238 Actins Proteins 0.000 description 4
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 4
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 4
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 4
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 4
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 4
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 4
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 4
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 4
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 4
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 4
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 4
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 4
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 4
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 4
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 4
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 4
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 4
- 241000700605 Viruses Species 0.000 description 4
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 4
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 4
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 4
- 229960000723 ampicillin Drugs 0.000 description 4
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 4
- 108010068380 arginylarginine Proteins 0.000 description 4
- 108010068265 aspartyltyrosine Proteins 0.000 description 4
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 4
- 108010078144 glutaminyl-glycine Proteins 0.000 description 4
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 4
- 108010010147 glycylglutamine Proteins 0.000 description 4
- 108010015792 glycyllysine Proteins 0.000 description 4
- 108010037850 glycylvaline Proteins 0.000 description 4
- 108010036413 histidylglycine Proteins 0.000 description 4
- 108010092114 histidylphenylalanine Proteins 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 150000004713 phosphodiesters Chemical class 0.000 description 4
- 108010031719 prolyl-serine Proteins 0.000 description 4
- 108010004914 prolylarginine Proteins 0.000 description 4
- 230000000576 supplementary effect Effects 0.000 description 4
- 108010080629 tryptophan-leucine Proteins 0.000 description 4
- ZDTFMPXQUSBYRL-UUOKFMHZSA-N 2-Aminoadenosine Chemical compound C12=NC(N)=NC(N)=C2N=CN1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O ZDTFMPXQUSBYRL-UUOKFMHZSA-N 0.000 description 3
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 description 3
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 3
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 3
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 3
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 3
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 3
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 3
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 3
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 3
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 3
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 3
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 3
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 3
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 3
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 3
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 3
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 3
- FIADUEYFRSCCIK-CIUDSAMLSA-N Cys-Glu-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIADUEYFRSCCIK-CIUDSAMLSA-N 0.000 description 3
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 3
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 3
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 3
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 3
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 3
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 3
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 3
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 3
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 3
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 3
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 3
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 3
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 3
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 3
- 108010065920 Insulin Lispro Proteins 0.000 description 3
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 3
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 3
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 3
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 3
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 3
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 3
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 3
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 3
- 101710163270 Nuclease Proteins 0.000 description 3
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 3
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 3
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 3
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 3
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 3
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 3
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 3
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 3
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 3
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 3
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 3
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 3
- YGZWVPBHYABGLT-KJEVXHAQSA-N Thr-Pro-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YGZWVPBHYABGLT-KJEVXHAQSA-N 0.000 description 3
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 3
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 3
- 108010041407 alanylaspartic acid Proteins 0.000 description 3
- 108010044940 alanylglutamine Proteins 0.000 description 3
- 108010070944 alanylhistidine Proteins 0.000 description 3
- 150000001413 amino acids Chemical class 0.000 description 3
- 108010008355 arginyl-glutamine Proteins 0.000 description 3
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 3
- 108010060035 arginylproline Proteins 0.000 description 3
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 3
- 108010047857 aspartylglycine Proteins 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 108010060199 cysteinylproline Proteins 0.000 description 3
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 3
- 108010054813 diprotin B Proteins 0.000 description 3
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 3
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 3
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 3
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 3
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 3
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 3
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 3
- 108010020688 glycylhistidine Proteins 0.000 description 3
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 3
- 239000006166 lysate Substances 0.000 description 3
- 108010009298 lysylglutamic acid Proteins 0.000 description 3
- 108010017391 lysylvaline Proteins 0.000 description 3
- 108010056582 methionylglutamic acid Proteins 0.000 description 3
- 108010085203 methionylmethionine Proteins 0.000 description 3
- 239000002777 nucleoside Substances 0.000 description 3
- 108010018625 phenylalanylarginine Proteins 0.000 description 3
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 3
- 108010090894 prolylleucine Proteins 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 230000003362 replicative effect Effects 0.000 description 3
- 108700004896 tripeptide FEG Proteins 0.000 description 3
- 108010073969 valyllysine Proteins 0.000 description 3
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 2
- QMOQBVOBWVNSNO-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QMOQBVOBWVNSNO-UHFFFAOYSA-N 0.000 description 2
- ZAYHVCMSTBRABG-JXOAFFINSA-N 5-methylcytidine Chemical compound O=C1N=C(N)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 ZAYHVCMSTBRABG-JXOAFFINSA-N 0.000 description 2
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 2
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 2
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 2
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 2
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 2
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 2
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 2
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 2
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 2
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 2
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 2
- VWEWCZSUWOEEFM-WDSKDSINSA-N Ala-Gly-Ala-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(O)=O VWEWCZSUWOEEFM-WDSKDSINSA-N 0.000 description 2
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 2
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 2
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 2
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 2
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 2
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 2
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 2
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 2
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 2
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 2
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 2
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 2
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 2
- XEPSCVXTCUUHDT-AVGNSLFASA-N Arg-Arg-Leu Natural products CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N XEPSCVXTCUUHDT-AVGNSLFASA-N 0.000 description 2
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 2
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 2
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 2
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 2
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 2
- NOZYDJOPOGKUSR-AVGNSLFASA-N Arg-Leu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O NOZYDJOPOGKUSR-AVGNSLFASA-N 0.000 description 2
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 2
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 2
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 2
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 2
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 2
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 2
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 2
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 2
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 2
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 2
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 2
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 2
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 2
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 2
- GSNRZJNHMVMOFV-ACZMJKKPSA-N Cys-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N GSNRZJNHMVMOFV-ACZMJKKPSA-N 0.000 description 2
- HYKFOHGZGLOCAY-ZLUOBGJFSA-N Cys-Cys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O HYKFOHGZGLOCAY-ZLUOBGJFSA-N 0.000 description 2
- SKSJPIBFNFPTJB-NKWVEPMBSA-N Cys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CS)N)C(=O)O SKSJPIBFNFPTJB-NKWVEPMBSA-N 0.000 description 2
- ABLJDBFJPUWQQB-DCAQKATOSA-N Cys-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N ABLJDBFJPUWQQB-DCAQKATOSA-N 0.000 description 2
- 241000282326 Felis catus Species 0.000 description 2
- OYTPNWYZORARHL-XHNCKOQMSA-N Gln-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N OYTPNWYZORARHL-XHNCKOQMSA-N 0.000 description 2
- PONUFVLSGMQFAI-AVGNSLFASA-N Gln-Asn-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PONUFVLSGMQFAI-AVGNSLFASA-N 0.000 description 2
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 2
- IVCOYUURLWQDJQ-LPEHRKFASA-N Gln-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O IVCOYUURLWQDJQ-LPEHRKFASA-N 0.000 description 2
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 2
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 2
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 2
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 2
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 2
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 2
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 2
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 2
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 2
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 2
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 2
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 2
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 2
- LPHGXOWFAXFCPX-KKUMJFAQSA-N Glu-Pro-Phe Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O LPHGXOWFAXFCPX-KKUMJFAQSA-N 0.000 description 2
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 2
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 2
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 2
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 2
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 2
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 2
- JPXNYFOHTHSREU-UWVGGRQHSA-N Gly-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN JPXNYFOHTHSREU-UWVGGRQHSA-N 0.000 description 2
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 2
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 2
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 2
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 2
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 2
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 2
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 2
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 2
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 2
- HTZKFIYQMHJWSQ-INTQDDNPSA-N His-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HTZKFIYQMHJWSQ-INTQDDNPSA-N 0.000 description 2
- ZPVJJPAIUZLSNE-DCAQKATOSA-N His-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O ZPVJJPAIUZLSNE-DCAQKATOSA-N 0.000 description 2
- UROVZOUMHNXPLZ-AVGNSLFASA-N His-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 UROVZOUMHNXPLZ-AVGNSLFASA-N 0.000 description 2
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 2
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 2
- DBXXASNNDTXOLU-MXAVVETBSA-N Ile-Leu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DBXXASNNDTXOLU-MXAVVETBSA-N 0.000 description 2
- FHPZJWJWTWZKNA-LLLHUVSDSA-N Ile-Phe-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N FHPZJWJWTWZKNA-LLLHUVSDSA-N 0.000 description 2
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 2
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 2
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 2
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 2
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 2
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 2
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 2
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 2
- OZTZJMUZVAVJGY-BZSNNMDCSA-N Leu-Tyr-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OZTZJMUZVAVJGY-BZSNNMDCSA-N 0.000 description 2
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 2
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 2
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 2
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 2
- DNEJSAIMVANNPA-DCAQKATOSA-N Lys-Asn-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DNEJSAIMVANNPA-DCAQKATOSA-N 0.000 description 2
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 2
- MQMIRLVJXQNTRJ-SDDRHHMPSA-N Lys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O MQMIRLVJXQNTRJ-SDDRHHMPSA-N 0.000 description 2
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 2
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 2
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 2
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 2
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 2
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 2
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 2
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 2
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 2
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 2
- 101100068676 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) gln-1 gene Proteins 0.000 description 2
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 2
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 2
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 2
- SFKOEHXABNPLRT-KBPBESRZSA-N Phe-His-Gly Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)NCC(O)=O SFKOEHXABNPLRT-KBPBESRZSA-N 0.000 description 2
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 2
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 2
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 2
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 2
- KLXQWABNAWDRAY-ACRUOGEOSA-N Phe-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 KLXQWABNAWDRAY-ACRUOGEOSA-N 0.000 description 2
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 2
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 2
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 2
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 2
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 2
- QCARZLHECSFOGG-CIUDSAMLSA-N Pro-Glu-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O QCARZLHECSFOGG-CIUDSAMLSA-N 0.000 description 2
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 2
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 2
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 2
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 2
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 2
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 2
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 2
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 2
- 230000018199 S phase Effects 0.000 description 2
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 2
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 2
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 2
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 2
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 2
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 2
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 2
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 2
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 description 2
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 2
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 2
- 101710172711 Structural protein Proteins 0.000 description 2
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 2
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 2
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 2
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 2
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 2
- LHNNQVXITHUCAB-QTKMDUPCSA-N Thr-Met-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O LHNNQVXITHUCAB-QTKMDUPCSA-N 0.000 description 2
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 2
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 2
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 2
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 2
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 2
- REJBPZVUHYNMEN-LSJOCFKGSA-N Val-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N REJBPZVUHYNMEN-LSJOCFKGSA-N 0.000 description 2
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 2
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 2
- WJVLTYSHNXRCLT-NHCYSSNCSA-N Val-His-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WJVLTYSHNXRCLT-NHCYSSNCSA-N 0.000 description 2
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 2
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 2
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 2
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 2
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 2
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 2
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 2
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 2
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 2
- 108010067390 Viral Proteins Proteins 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010011559 alanylphenylalanine Proteins 0.000 description 2
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 2
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 2
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 210000000234 capsid Anatomy 0.000 description 2
- 108010004073 cysteinylcysteine Proteins 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 238000001415 gene therapy Methods 0.000 description 2
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 2
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 2
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 2
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 2
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 238000011031 large-scale manufacturing process Methods 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 238000001638 lipofection Methods 0.000 description 2
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 2
- 125000003835 nucleoside group Chemical group 0.000 description 2
- 238000004806 packaging method and process Methods 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 2
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 230000014616 translation Effects 0.000 description 2
- 108010079202 tyrosyl-alanyl-cysteine Proteins 0.000 description 2
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 1
- RIFDKYBNWNPCQK-IOSLPCCCSA-N (2r,3s,4r,5r)-2-(hydroxymethyl)-5-(6-imino-3-methylpurin-9-yl)oxolane-3,4-diol Chemical compound C1=2N(C)C=NC(=N)C=2N=CN1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O RIFDKYBNWNPCQK-IOSLPCCCSA-N 0.000 description 1
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 1
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 1
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- RKSLVDIXBGWPIS-UAKXSSHOSA-N 1-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-5-iodopyrimidine-2,4-dione Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(I)=C1 RKSLVDIXBGWPIS-UAKXSSHOSA-N 0.000 description 1
- QLOCVMVCRJOTTM-TURQNECASA-N 1-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-5-prop-1-ynylpyrimidine-2,4-dione Chemical compound O=C1NC(=O)C(C#CC)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 QLOCVMVCRJOTTM-TURQNECASA-N 0.000 description 1
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 1
- YKBGVTZYEHREMT-KVQBGUIXSA-N 2'-deoxyguanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](CO)O1 YKBGVTZYEHREMT-KVQBGUIXSA-N 0.000 description 1
- CKTSBUTUHBMZGZ-SHYZEUOFSA-N 2'‐deoxycytidine Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 CKTSBUTUHBMZGZ-SHYZEUOFSA-N 0.000 description 1
- WOJJIRYPFAZEPF-YFKPBYRVSA-N 2-[[(2s)-2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]propanoyl]amino]acetate Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)CNC(=O)CN WOJJIRYPFAZEPF-YFKPBYRVSA-N 0.000 description 1
- OTEWWRBKGONZBW-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]-4-methylpentanoyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NC(CC(C)C)C(=O)NCC(=O)NCC(O)=O OTEWWRBKGONZBW-UHFFFAOYSA-N 0.000 description 1
- JRYMOPZHXMVHTA-DAGMQNCNSA-N 2-amino-7-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-1h-pyrrolo[2,3-d]pyrimidin-4-one Chemical compound C1=CC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O JRYMOPZHXMVHTA-DAGMQNCNSA-N 0.000 description 1
- LMMLLWZHCKCFQA-UGKPPGOTSA-N 4-amino-1-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)-2-prop-1-ynyloxolan-2-yl]pyrimidin-2-one Chemical compound C1=CC(N)=NC(=O)N1[C@]1(C#CC)O[C@H](CO)[C@@H](O)[C@H]1O LMMLLWZHCKCFQA-UGKPPGOTSA-N 0.000 description 1
- XXSIICQLPUAUDF-TURQNECASA-N 4-amino-1-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-5-prop-1-ynylpyrimidin-2-one Chemical compound O=C1N=C(N)C(C#CC)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 XXSIICQLPUAUDF-TURQNECASA-N 0.000 description 1
- ZAYHVCMSTBRABG-UHFFFAOYSA-N 5-Methylcytidine Natural products O=C1N=C(N)C(C)=CN1C1C(O)C(O)C(CO)O1 ZAYHVCMSTBRABG-UHFFFAOYSA-N 0.000 description 1
- FHIDNBAQOFJWCA-UAKXSSHOSA-N 5-fluorouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(F)=C1 FHIDNBAQOFJWCA-UAKXSSHOSA-N 0.000 description 1
- KDOPAZIWBAHVJB-UHFFFAOYSA-N 5h-pyrrolo[3,2-d]pyrimidine Chemical compound C1=NC=C2NC=CC2=N1 KDOPAZIWBAHVJB-UHFFFAOYSA-N 0.000 description 1
- UEHOMUNTZPIBIL-UUOKFMHZSA-N 6-amino-9-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-7h-purin-8-one Chemical compound O=C1NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O UEHOMUNTZPIBIL-UUOKFMHZSA-N 0.000 description 1
- HCAJQHYUCKICQH-VPENINKCSA-N 8-Oxo-7,8-dihydro-2'-deoxyguanosine Chemical compound C1=2NC(N)=NC(=O)C=2NC(=O)N1[C@H]1C[C@H](O)[C@@H](CO)O1 HCAJQHYUCKICQH-VPENINKCSA-N 0.000 description 1
- HDZZVAMISRMYHH-UHFFFAOYSA-N 9beta-Ribofuranosyl-7-deazaadenin Natural products C1=CC=2C(N)=NC=NC=2N1C1OC(CO)C(O)C1O HDZZVAMISRMYHH-UHFFFAOYSA-N 0.000 description 1
- 241000023308 Acca Species 0.000 description 1
- 108010057856 Adenovirus E2 Proteins Proteins 0.000 description 1
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 1
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 1
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 1
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 1
- DWINFPQUSSHSFS-UVBJJODRSA-N Ala-Arg-Trp Chemical compound N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O DWINFPQUSSHSFS-UVBJJODRSA-N 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- HGRBNYQIMKTUNT-XVYDVKMFSA-N Ala-Asn-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HGRBNYQIMKTUNT-XVYDVKMFSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 1
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 1
- YEELWQSXYBJVSV-UWJYBYFXSA-N Ala-Cys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YEELWQSXYBJVSV-UWJYBYFXSA-N 0.000 description 1
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 1
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 1
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 1
- ZPXCNXMJEZKRLU-LSJOCFKGSA-N Ala-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 ZPXCNXMJEZKRLU-LSJOCFKGSA-N 0.000 description 1
- GRPHQEMIFDPKOE-HGNGGELXSA-N Ala-His-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GRPHQEMIFDPKOE-HGNGGELXSA-N 0.000 description 1
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 1
- HUUOZYZWNCXTFK-INTQDDNPSA-N Ala-His-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N HUUOZYZWNCXTFK-INTQDDNPSA-N 0.000 description 1
- NJWJSLCQEDMGNC-MBLNEYKQSA-N Ala-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N)O NJWJSLCQEDMGNC-MBLNEYKQSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 1
- FOHXUHGZZKETFI-JBDRJPRFSA-N Ala-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N FOHXUHGZZKETFI-JBDRJPRFSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- ZKEHTYWGPMMGBC-XUXIUFHCSA-N Ala-Leu-Leu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O ZKEHTYWGPMMGBC-XUXIUFHCSA-N 0.000 description 1
- OPZJWMJPCNNZNT-DCAQKATOSA-N Ala-Leu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N OPZJWMJPCNNZNT-DCAQKATOSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- UWIQWPWWZUHBAO-ZLIFDBKOSA-N Ala-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)CC(C)C)C(O)=O)=CNC2=C1 UWIQWPWWZUHBAO-ZLIFDBKOSA-N 0.000 description 1
- RGQCNKIDEQJEBT-CQDKDKBSSA-N Ala-Leu-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RGQCNKIDEQJEBT-CQDKDKBSSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 1
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 1
- DGLQWAFPIXDKRL-UBHSHLNASA-N Ala-Met-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N DGLQWAFPIXDKRL-UBHSHLNASA-N 0.000 description 1
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 1
- RNHKOQHGYMTHFR-UBHSHLNASA-N Ala-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 RNHKOQHGYMTHFR-UBHSHLNASA-N 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- AUFACLFHBAGZEN-ZLUOBGJFSA-N Ala-Ser-Cys Chemical compound N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O AUFACLFHBAGZEN-ZLUOBGJFSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 1
- PXAFZDXYEIIUTF-LKTVYLICSA-N Ala-Trp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXAFZDXYEIIUTF-LKTVYLICSA-N 0.000 description 1
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 1
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 1
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 1
- HULHGJZIZXCPLD-FXQIFTODSA-N Arg-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HULHGJZIZXCPLD-FXQIFTODSA-N 0.000 description 1
- YYOVLDPHIJAOSY-DCAQKATOSA-N Arg-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N YYOVLDPHIJAOSY-DCAQKATOSA-N 0.000 description 1
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 1
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 1
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 1
- PVSNBTCXCQIXSE-JYJNAYRXSA-N Arg-Arg-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PVSNBTCXCQIXSE-JYJNAYRXSA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- JTKLCCFLSLCCST-SZMVWBNQSA-N Arg-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JTKLCCFLSLCCST-SZMVWBNQSA-N 0.000 description 1
- CPSHGRGUPZBMOK-CIUDSAMLSA-N Arg-Asn-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CPSHGRGUPZBMOK-CIUDSAMLSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- JTWOBPNAVBESFW-FXQIFTODSA-N Arg-Cys-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N JTWOBPNAVBESFW-FXQIFTODSA-N 0.000 description 1
- AHPWQERCDZTTNB-FXQIFTODSA-N Arg-Cys-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AHPWQERCDZTTNB-FXQIFTODSA-N 0.000 description 1
- XTGGTAWGUFXJSV-NAKRPEOUSA-N Arg-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N XTGGTAWGUFXJSV-NAKRPEOUSA-N 0.000 description 1
- JVMKBJNSRZWDBO-FXQIFTODSA-N Arg-Cys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O JVMKBJNSRZWDBO-FXQIFTODSA-N 0.000 description 1
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 1
- BGDILZXXDJCKPF-CIUDSAMLSA-N Arg-Gln-Cys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(O)=O BGDILZXXDJCKPF-CIUDSAMLSA-N 0.000 description 1
- VDBKFYYIBLXEIF-GUBZILKMSA-N Arg-Gln-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VDBKFYYIBLXEIF-GUBZILKMSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 1
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 1
- JQFJNGVSGOUQDH-XIRDDKMYSA-N Arg-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JQFJNGVSGOUQDH-XIRDDKMYSA-N 0.000 description 1
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 1
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- MSILNNHVVMMTHZ-UWVGGRQHSA-N Arg-His-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 MSILNNHVVMMTHZ-UWVGGRQHSA-N 0.000 description 1
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 1
- UPKMBGAAEZGHOC-RWMBFGLXSA-N Arg-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O UPKMBGAAEZGHOC-RWMBFGLXSA-N 0.000 description 1
- CRCCTGPNZUCAHE-DCAQKATOSA-N Arg-His-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 CRCCTGPNZUCAHE-DCAQKATOSA-N 0.000 description 1
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 1
- FRMQITGHXMUNDF-GMOBBJLQSA-N Arg-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FRMQITGHXMUNDF-GMOBBJLQSA-N 0.000 description 1
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 1
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 1
- CFGHCPUPFHWMCM-FDARSICLSA-N Arg-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N CFGHCPUPFHWMCM-FDARSICLSA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- OGSQONVYSTZIJB-WDSOQIARSA-N Arg-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OGSQONVYSTZIJB-WDSOQIARSA-N 0.000 description 1
- RIIVUOJDDQXHRV-SRVKXCTJSA-N Arg-Lys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O RIIVUOJDDQXHRV-SRVKXCTJSA-N 0.000 description 1
- MTYLORHAQXVQOW-AVGNSLFASA-N Arg-Lys-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O MTYLORHAQXVQOW-AVGNSLFASA-N 0.000 description 1
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 1
- QBQVKUNBCAFXSV-ULQDDVLXSA-N Arg-Lys-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QBQVKUNBCAFXSV-ULQDDVLXSA-N 0.000 description 1
- VVJTWSRNMJNDPN-IUCAKERBSA-N Arg-Met-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O VVJTWSRNMJNDPN-IUCAKERBSA-N 0.000 description 1
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 1
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 1
- RATVAFHGEFAWDH-JYJNAYRXSA-N Arg-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCN=C(N)N)N RATVAFHGEFAWDH-JYJNAYRXSA-N 0.000 description 1
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 1
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 1
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- AUIJUTGLPVHIRT-FXQIFTODSA-N Arg-Ser-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AUIJUTGLPVHIRT-FXQIFTODSA-N 0.000 description 1
- LFAUVOXPCGJKTB-DCAQKATOSA-N Arg-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N LFAUVOXPCGJKTB-DCAQKATOSA-N 0.000 description 1
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- FBXMCPLCVYUWBO-BPUTZDHNSA-N Arg-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N FBXMCPLCVYUWBO-BPUTZDHNSA-N 0.000 description 1
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 1
- HRCIIMCTUIAKQB-XGEHTFHBSA-N Arg-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O HRCIIMCTUIAKQB-XGEHTFHBSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- WTFIFQWLQXZLIZ-UMPQAUOISA-N Arg-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O WTFIFQWLQXZLIZ-UMPQAUOISA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- DRDWXKWUSIKKOB-PJODQICGSA-N Arg-Trp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O DRDWXKWUSIKKOB-PJODQICGSA-N 0.000 description 1
- ZUVDFJXRAICIAJ-BPUTZDHNSA-N Arg-Trp-Asp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 ZUVDFJXRAICIAJ-BPUTZDHNSA-N 0.000 description 1
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 1
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 1
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 1
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 1
- FMYQECOAIFGQGU-CYDGBPFRSA-N Arg-Val-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMYQECOAIFGQGU-CYDGBPFRSA-N 0.000 description 1
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 1
- WTUZDHWWGUQEKN-SRVKXCTJSA-N Arg-Val-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O WTUZDHWWGUQEKN-SRVKXCTJSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- UTSMXMABBPFVJP-SZMVWBNQSA-N Arg-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UTSMXMABBPFVJP-SZMVWBNQSA-N 0.000 description 1
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 1
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 1
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 1
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 1
- NTXNUXPCNRDMAF-WFBYXXMGSA-N Asn-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC(N)=O)C)C(O)=O)=CNC2=C1 NTXNUXPCNRDMAF-WFBYXXMGSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 1
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 1
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 1
- GOVUDFOGXOONFT-VEVYYDQMSA-N Asn-Arg-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GOVUDFOGXOONFT-VEVYYDQMSA-N 0.000 description 1
- YNSCBOUZTAGIGO-ZLUOBGJFSA-N Asn-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N YNSCBOUZTAGIGO-ZLUOBGJFSA-N 0.000 description 1
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- PAXHINASXXXILC-SRVKXCTJSA-N Asn-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)O PAXHINASXXXILC-SRVKXCTJSA-N 0.000 description 1
- QRHYAUYXBVVDSB-LKXGYXEUSA-N Asn-Cys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QRHYAUYXBVVDSB-LKXGYXEUSA-N 0.000 description 1
- XWFPGQVLOVGSLU-CIUDSAMLSA-N Asn-Gln-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XWFPGQVLOVGSLU-CIUDSAMLSA-N 0.000 description 1
- QPTAGIPWARILES-AVGNSLFASA-N Asn-Gln-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QPTAGIPWARILES-AVGNSLFASA-N 0.000 description 1
- FUHFYEKSGWOWGZ-XHNCKOQMSA-N Asn-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O FUHFYEKSGWOWGZ-XHNCKOQMSA-N 0.000 description 1
- OKZOABJQOMAYEC-NUMRIWBASA-N Asn-Gln-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OKZOABJQOMAYEC-NUMRIWBASA-N 0.000 description 1
- PPMTUXJSQDNUDE-CIUDSAMLSA-N Asn-Glu-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PPMTUXJSQDNUDE-CIUDSAMLSA-N 0.000 description 1
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 1
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 1
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 1
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 1
- BSBNNPICFPXDNH-SRVKXCTJSA-N Asn-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N BSBNNPICFPXDNH-SRVKXCTJSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- UYCPJVYQYARFGB-YDHLFZDLSA-N Asn-Phe-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UYCPJVYQYARFGB-YDHLFZDLSA-N 0.000 description 1
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 1
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 1
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 1
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 1
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 1
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- ANRZCQXIXGDXLR-CWRNSKLLSA-N Asn-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)N)N)C(=O)O ANRZCQXIXGDXLR-CWRNSKLLSA-N 0.000 description 1
- MLJZMGIXXMTEPO-UBHSHLNASA-N Asn-Trp-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O MLJZMGIXXMTEPO-UBHSHLNASA-N 0.000 description 1
- JPPLRQVZMZFOSX-UWJYBYFXSA-N Asn-Tyr-Ala Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 JPPLRQVZMZFOSX-UWJYBYFXSA-N 0.000 description 1
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 1
- LRCIOEVFVGXZKB-BZSNNMDCSA-N Asn-Tyr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LRCIOEVFVGXZKB-BZSNNMDCSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- UWMIZBCTVWVMFI-FXQIFTODSA-N Asp-Ala-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UWMIZBCTVWVMFI-FXQIFTODSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- GVPSCJQLUGIKAM-GUBZILKMSA-N Asp-Arg-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GVPSCJQLUGIKAM-GUBZILKMSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 1
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 1
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- MJKBOVWWADWLHV-ZLUOBGJFSA-N Asp-Cys-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)C(=O)O MJKBOVWWADWLHV-ZLUOBGJFSA-N 0.000 description 1
- NURJSGZGBVJFAD-ZLUOBGJFSA-N Asp-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O NURJSGZGBVJFAD-ZLUOBGJFSA-N 0.000 description 1
- PJERDVUTUDZPGX-ZKWXMUAHSA-N Asp-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC(O)=O PJERDVUTUDZPGX-ZKWXMUAHSA-N 0.000 description 1
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 1
- RSMIHCFQDCVVBR-CIUDSAMLSA-N Asp-Gln-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RSMIHCFQDCVVBR-CIUDSAMLSA-N 0.000 description 1
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 1
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 1
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 1
- LDGUZSIPGSPBJP-XVYDVKMFSA-N Asp-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LDGUZSIPGSPBJP-XVYDVKMFSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 1
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- LDLZOAJRXXBVGF-GMOBBJLQSA-N Asp-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N LDLZOAJRXXBVGF-GMOBBJLQSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- SCQIQCWLOMOEFP-DCAQKATOSA-N Asp-Leu-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SCQIQCWLOMOEFP-DCAQKATOSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- OEDJQRXNDRUGEU-SRVKXCTJSA-N Asp-Leu-His Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O OEDJQRXNDRUGEU-SRVKXCTJSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 1
- QNIACYURSSCLRP-GUBZILKMSA-N Asp-Lys-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O QNIACYURSSCLRP-GUBZILKMSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 1
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 1
- YTXCCDCOHIYQFC-GUBZILKMSA-N Asp-Met-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTXCCDCOHIYQFC-GUBZILKMSA-N 0.000 description 1
- VMVUDJUXJKDGNR-FXQIFTODSA-N Asp-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N VMVUDJUXJKDGNR-FXQIFTODSA-N 0.000 description 1
- SJLDOGLMVPHPLZ-IHRRRGAJSA-N Asp-Met-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SJLDOGLMVPHPLZ-IHRRRGAJSA-N 0.000 description 1
- LKVKODXGSAFOFY-VEVYYDQMSA-N Asp-Met-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKVKODXGSAFOFY-VEVYYDQMSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 1
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 1
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- BKOIIURTQAJHAT-GUBZILKMSA-N Asp-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 BKOIIURTQAJHAT-GUBZILKMSA-N 0.000 description 1
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 1
- ZVYYMCXVPZEAPU-CWRNSKLLSA-N Asp-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZVYYMCXVPZEAPU-CWRNSKLLSA-N 0.000 description 1
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 1
- ZQFZEBRNAMXXJV-KKUMJFAQSA-N Asp-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O ZQFZEBRNAMXXJV-KKUMJFAQSA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 102100021277 Beta-secretase 2 Human genes 0.000 description 1
- 101710150190 Beta-secretase 2 Proteins 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 1
- PLBJMUUEGBBHRH-ZLUOBGJFSA-N Cys-Ala-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLBJMUUEGBBHRH-ZLUOBGJFSA-N 0.000 description 1
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 1
- SZQCDCKIGWQAQN-FXQIFTODSA-N Cys-Arg-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O SZQCDCKIGWQAQN-FXQIFTODSA-N 0.000 description 1
- PRVVCRZLTJNPCS-FXQIFTODSA-N Cys-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N PRVVCRZLTJNPCS-FXQIFTODSA-N 0.000 description 1
- OCEHKDFAWQIBHH-FXQIFTODSA-N Cys-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N OCEHKDFAWQIBHH-FXQIFTODSA-N 0.000 description 1
- XABFFGOGKOORCG-CIUDSAMLSA-N Cys-Asp-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XABFFGOGKOORCG-CIUDSAMLSA-N 0.000 description 1
- QADHATDBZXHRCA-ACZMJKKPSA-N Cys-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N QADHATDBZXHRCA-ACZMJKKPSA-N 0.000 description 1
- SBORMUFGKSCGEN-XHNCKOQMSA-N Cys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N)C(=O)O SBORMUFGKSCGEN-XHNCKOQMSA-N 0.000 description 1
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 1
- LBOLGUYQEPZSKM-YUMQZZPRSA-N Cys-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N LBOLGUYQEPZSKM-YUMQZZPRSA-N 0.000 description 1
- PQHYZJPCYRDYNE-QWRGUYRKSA-N Cys-Gly-Phe Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PQHYZJPCYRDYNE-QWRGUYRKSA-N 0.000 description 1
- SBDVXRYCOIEYNV-YUMQZZPRSA-N Cys-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N SBDVXRYCOIEYNV-YUMQZZPRSA-N 0.000 description 1
- RRJOQIBQVZDVCW-SRVKXCTJSA-N Cys-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N RRJOQIBQVZDVCW-SRVKXCTJSA-N 0.000 description 1
- LKUCSUGWHYVYLP-GHCJXIJMSA-N Cys-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N LKUCSUGWHYVYLP-GHCJXIJMSA-N 0.000 description 1
- DYBIDOHFRRUMLW-CIUDSAMLSA-N Cys-Leu-Cys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O DYBIDOHFRRUMLW-CIUDSAMLSA-N 0.000 description 1
- VPQZSNQICFCCSO-BJDJZHNGSA-N Cys-Leu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VPQZSNQICFCCSO-BJDJZHNGSA-N 0.000 description 1
- JXVFJOMFOLFPMP-KKUMJFAQSA-N Cys-Leu-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JXVFJOMFOLFPMP-KKUMJFAQSA-N 0.000 description 1
- YXPNKXFOBHRUBL-BJDJZHNGSA-N Cys-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N YXPNKXFOBHRUBL-BJDJZHNGSA-N 0.000 description 1
- DQUWSUWXPWGTQT-DCAQKATOSA-N Cys-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CS DQUWSUWXPWGTQT-DCAQKATOSA-N 0.000 description 1
- KSMSFCBQBQPFAD-GUBZILKMSA-N Cys-Pro-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 KSMSFCBQBQPFAD-GUBZILKMSA-N 0.000 description 1
- WKKKNGNJDGATNS-QEJZJMRPSA-N Cys-Trp-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKKKNGNJDGATNS-QEJZJMRPSA-N 0.000 description 1
- UGPCUUWZXRMCIJ-KKUMJFAQSA-N Cys-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CS)N UGPCUUWZXRMCIJ-KKUMJFAQSA-N 0.000 description 1
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 1
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical class OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- CKTSBUTUHBMZGZ-UHFFFAOYSA-N Deoxycytidine Natural products O=C1N=C(N)C=CN1C1OC(CO)C(O)C1 CKTSBUTUHBMZGZ-UHFFFAOYSA-N 0.000 description 1
- 101150029662 E1 gene Proteins 0.000 description 1
- 101150066038 E4 gene Proteins 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 101710199711 Early E1A protein Proteins 0.000 description 1
- 108091029865 Exogenous DNA Proteins 0.000 description 1
- 241001123946 Gaga Species 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- UWZLBXOBVKRUFE-HGNGGELXSA-N Gln-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N UWZLBXOBVKRUFE-HGNGGELXSA-N 0.000 description 1
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 1
- ZFADFBPRMSBPOT-KKUMJFAQSA-N Gln-Arg-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZFADFBPRMSBPOT-KKUMJFAQSA-N 0.000 description 1
- MINZLORERLNSPP-ACZMJKKPSA-N Gln-Asn-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N MINZLORERLNSPP-ACZMJKKPSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 1
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 1
- GNDJOCGXGLNCKY-ACZMJKKPSA-N Gln-Cys-Cys Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O GNDJOCGXGLNCKY-ACZMJKKPSA-N 0.000 description 1
- UVAOVENCIONMJP-GUBZILKMSA-N Gln-Cys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O UVAOVENCIONMJP-GUBZILKMSA-N 0.000 description 1
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 1
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 1
- BLOXULLYFRGYKZ-GUBZILKMSA-N Gln-Glu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BLOXULLYFRGYKZ-GUBZILKMSA-N 0.000 description 1
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 1
- PXAFHUATEHLECW-GUBZILKMSA-N Gln-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N PXAFHUATEHLECW-GUBZILKMSA-N 0.000 description 1
- LFIVHGMKWFGUGK-IHRRRGAJSA-N Gln-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LFIVHGMKWFGUGK-IHRRRGAJSA-N 0.000 description 1
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 1
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 1
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 1
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 1
- YXQCLIVLWCKCRS-RYUDHWBXSA-N Gln-Gly-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N)O YXQCLIVLWCKCRS-RYUDHWBXSA-N 0.000 description 1
- IWUFOVSLWADEJC-AVGNSLFASA-N Gln-His-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IWUFOVSLWADEJC-AVGNSLFASA-N 0.000 description 1
- TWTWUBHEWQPMQW-ZPFDUUQYSA-N Gln-Ile-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWTWUBHEWQPMQW-ZPFDUUQYSA-N 0.000 description 1
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 1
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 1
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 1
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 1
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 1
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 1
- DOQUICBEISTQHE-CIUDSAMLSA-N Gln-Pro-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O DOQUICBEISTQHE-CIUDSAMLSA-N 0.000 description 1
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 1
- UWMDGPFFTKDUIY-HJGDQZAQSA-N Gln-Pro-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWMDGPFFTKDUIY-HJGDQZAQSA-N 0.000 description 1
- DCWNCMRZIZSZBL-KKUMJFAQSA-N Gln-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O DCWNCMRZIZSZBL-KKUMJFAQSA-N 0.000 description 1
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 1
- KVQOVQVGVKDZNW-GUBZILKMSA-N Gln-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KVQOVQVGVKDZNW-GUBZILKMSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 1
- DYVMTEWCGAVKSE-HJGDQZAQSA-N Gln-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O DYVMTEWCGAVKSE-HJGDQZAQSA-N 0.000 description 1
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- GTBXHETZPUURJE-KKUMJFAQSA-N Gln-Tyr-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GTBXHETZPUURJE-KKUMJFAQSA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- ZFBBMCKQSNJZSN-AUTRQRHGSA-N Gln-Val-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFBBMCKQSNJZSN-AUTRQRHGSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 1
- RSUVOPBMWMTVDI-XEGUGMAKSA-N Glu-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(O)=O)C)C(O)=O)=CNC2=C1 RSUVOPBMWMTVDI-XEGUGMAKSA-N 0.000 description 1
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- GZWOBWMOMPFPCD-CIUDSAMLSA-N Glu-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N GZWOBWMOMPFPCD-CIUDSAMLSA-N 0.000 description 1
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- KLJMRPIBBLTDGE-ACZMJKKPSA-N Glu-Cys-Asn Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O KLJMRPIBBLTDGE-ACZMJKKPSA-N 0.000 description 1
- PKYAVRMYTBBRLS-FXQIFTODSA-N Glu-Cys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O PKYAVRMYTBBRLS-FXQIFTODSA-N 0.000 description 1
- PNAOVYHADQRJQU-GUBZILKMSA-N Glu-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N PNAOVYHADQRJQU-GUBZILKMSA-N 0.000 description 1
- ZXLZWUQBRYGDNS-CIUDSAMLSA-N Glu-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N ZXLZWUQBRYGDNS-CIUDSAMLSA-N 0.000 description 1
- FKGNJUCQKXQNRA-NRPADANISA-N Glu-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O FKGNJUCQKXQNRA-NRPADANISA-N 0.000 description 1
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 1
- VFZIDQZAEBORGY-GLLZPBPUSA-N Glu-Gln-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VFZIDQZAEBORGY-GLLZPBPUSA-N 0.000 description 1
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- NUSWUSKZRCGFEX-FXQIFTODSA-N Glu-Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O NUSWUSKZRCGFEX-FXQIFTODSA-N 0.000 description 1
- APHGWLWMOXGZRL-DCAQKATOSA-N Glu-Glu-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O APHGWLWMOXGZRL-DCAQKATOSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 1
- QYPKJXSMLMREKF-BPUTZDHNSA-N Glu-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N QYPKJXSMLMREKF-BPUTZDHNSA-N 0.000 description 1
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- VXQOONWNIWFOCS-HGNGGELXSA-N Glu-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N VXQOONWNIWFOCS-HGNGGELXSA-N 0.000 description 1
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- ZPASCJBSSCRWMC-GVXVVHGQSA-N Glu-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N ZPASCJBSSCRWMC-GVXVVHGQSA-N 0.000 description 1
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- ZWMYUDZLXAQHCK-CIUDSAMLSA-N Glu-Met-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O ZWMYUDZLXAQHCK-CIUDSAMLSA-N 0.000 description 1
- XNOWYPDMSLSRKP-GUBZILKMSA-N Glu-Met-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O XNOWYPDMSLSRKP-GUBZILKMSA-N 0.000 description 1
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 1
- JDUKCSSHWNIQQZ-IHRRRGAJSA-N Glu-Phe-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JDUKCSSHWNIQQZ-IHRRRGAJSA-N 0.000 description 1
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 1
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 1
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- HMJULNMJWOZNFI-XHNCKOQMSA-N Glu-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N)C(=O)O HMJULNMJWOZNFI-XHNCKOQMSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- WXONSNSSBYQGNN-AVGNSLFASA-N Glu-Ser-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WXONSNSSBYQGNN-AVGNSLFASA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- VHPVBPCCWVDGJL-IRIUXVKKSA-N Glu-Thr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VHPVBPCCWVDGJL-IRIUXVKKSA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 1
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- LERGJIVJIIODPZ-ZANVPECISA-N Gly-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)C)C(O)=O)=CNC2=C1 LERGJIVJIIODPZ-ZANVPECISA-N 0.000 description 1
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 1
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 1
- BULIVUZUDBHKKZ-WDSKDSINSA-N Gly-Gln-Asn Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BULIVUZUDBHKKZ-WDSKDSINSA-N 0.000 description 1
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 1
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- JNGJGFMFXREJNF-KBPBESRZSA-N Gly-Glu-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JNGJGFMFXREJNF-KBPBESRZSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 1
- MVORZMQFXBLMHM-QWRGUYRKSA-N Gly-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 MVORZMQFXBLMHM-QWRGUYRKSA-N 0.000 description 1
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 1
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 1
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- PCPOYRCAHPJXII-UWVGGRQHSA-N Gly-Lys-Met Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PCPOYRCAHPJXII-UWVGGRQHSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- BBTCXWTXOXUNFX-IUCAKERBSA-N Gly-Met-Arg Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O BBTCXWTXOXUNFX-IUCAKERBSA-N 0.000 description 1
- QGDOOCIPHSSADO-STQMWFEESA-N Gly-Met-Phe Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGDOOCIPHSSADO-STQMWFEESA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 1
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 1
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 1
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 1
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
- SFOXOSKVTLDEDM-HOTGVXAUSA-N Gly-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)CN)=CNC2=C1 SFOXOSKVTLDEDM-HOTGVXAUSA-N 0.000 description 1
- IROABALAWGJQGM-OALUTQOASA-N Gly-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)CN IROABALAWGJQGM-OALUTQOASA-N 0.000 description 1
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 1
- UVTSZKIATYSKIR-RYUDHWBXSA-N Gly-Tyr-Glu Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O UVTSZKIATYSKIR-RYUDHWBXSA-N 0.000 description 1
- KBBFOULZCHWGJX-KBPBESRZSA-N Gly-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN)O KBBFOULZCHWGJX-KBPBESRZSA-N 0.000 description 1
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 1
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 1
- FULZDMOZUZKGQU-ONGXEEELSA-N Gly-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN FULZDMOZUZKGQU-ONGXEEELSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 1
- JHVCZQFWRLHUQR-DCAQKATOSA-N His-Arg-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N JHVCZQFWRLHUQR-DCAQKATOSA-N 0.000 description 1
- DFHVLUKTTVTCKY-PBCZWWQYSA-N His-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N)O DFHVLUKTTVTCKY-PBCZWWQYSA-N 0.000 description 1
- LSQHWKPPOFDHHZ-YUMQZZPRSA-N His-Asp-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LSQHWKPPOFDHHZ-YUMQZZPRSA-N 0.000 description 1
- LBHOVGUGOBINDL-KKUMJFAQSA-N His-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O LBHOVGUGOBINDL-KKUMJFAQSA-N 0.000 description 1
- OHOXVDFVRDGFND-YUMQZZPRSA-N His-Cys-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CS)C(=O)NCC(O)=O OHOXVDFVRDGFND-YUMQZZPRSA-N 0.000 description 1
- LBCAQRFTWMMWRR-CIUDSAMLSA-N His-Cys-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O LBCAQRFTWMMWRR-CIUDSAMLSA-N 0.000 description 1
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 1
- OSZUPUINVNPCOE-SDDRHHMPSA-N His-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OSZUPUINVNPCOE-SDDRHHMPSA-N 0.000 description 1
- KNNSUUOHFVVJOP-GUBZILKMSA-N His-Glu-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N KNNSUUOHFVVJOP-GUBZILKMSA-N 0.000 description 1
- PYNUBZSXKQKAHL-UWVGGRQHSA-N His-Gly-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O PYNUBZSXKQKAHL-UWVGGRQHSA-N 0.000 description 1
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 1
- CSTNMMIHMYJGFR-IHRRRGAJSA-N His-His-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 CSTNMMIHMYJGFR-IHRRRGAJSA-N 0.000 description 1
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 1
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 1
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 1
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 1
- KHUFDBQXGLEIHC-BZSNNMDCSA-N His-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 KHUFDBQXGLEIHC-BZSNNMDCSA-N 0.000 description 1
- UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 1
- CKRJBQJIGOEKMC-SRVKXCTJSA-N His-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CKRJBQJIGOEKMC-SRVKXCTJSA-N 0.000 description 1
- SAPLASXFNUYUFE-CQDKDKBSSA-N His-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N SAPLASXFNUYUFE-CQDKDKBSSA-N 0.000 description 1
- YAEKRYQASVCDLK-JYJNAYRXSA-N His-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N YAEKRYQASVCDLK-JYJNAYRXSA-N 0.000 description 1
- BSVLMPMIXPQNKC-KBPBESRZSA-N His-Phe-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O BSVLMPMIXPQNKC-KBPBESRZSA-N 0.000 description 1
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 1
- PGXZHYYGOPKYKM-IHRRRGAJSA-N His-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CCCCN)C(=O)O PGXZHYYGOPKYKM-IHRRRGAJSA-N 0.000 description 1
- STGQSBKUYSPPIG-CIUDSAMLSA-N His-Ser-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 STGQSBKUYSPPIG-CIUDSAMLSA-N 0.000 description 1
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 1
- ILUVWFTXAUYOBW-CUJWVEQBSA-N His-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N)O ILUVWFTXAUYOBW-CUJWVEQBSA-N 0.000 description 1
- DEMIXZCKUXVEBO-BWAGICSOSA-N His-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O DEMIXZCKUXVEBO-BWAGICSOSA-N 0.000 description 1
- KDDKJKKQODQQBR-NHCYSSNCSA-N His-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N KDDKJKKQODQQBR-NHCYSSNCSA-N 0.000 description 1
- CGAMSLMBYJHMDY-ONGXEEELSA-N His-Val-Gly Chemical compound CC(C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N CGAMSLMBYJHMDY-ONGXEEELSA-N 0.000 description 1
- QLBXWYXMLHAREM-PYJNHQTQSA-N His-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N QLBXWYXMLHAREM-PYJNHQTQSA-N 0.000 description 1
- DMAPKBANYNZHNR-ULQDDVLXSA-N His-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N DMAPKBANYNZHNR-ULQDDVLXSA-N 0.000 description 1
- DRKZDEFADVYTLU-AVGNSLFASA-N His-Val-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DRKZDEFADVYTLU-AVGNSLFASA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- YPWHUFAAMNHMGS-QSFUFRPTSA-N Ile-Ala-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YPWHUFAAMNHMGS-QSFUFRPTSA-N 0.000 description 1
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- VZIFYHYNQDIPLI-HJWJTTGWSA-N Ile-Arg-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N VZIFYHYNQDIPLI-HJWJTTGWSA-N 0.000 description 1
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 1
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 1
- ZZHGKECPZXPXJF-PCBIJLKTSA-N Ile-Asn-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZZHGKECPZXPXJF-PCBIJLKTSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- DURWCDDDAWVPOP-JBDRJPRFSA-N Ile-Cys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N DURWCDDDAWVPOP-JBDRJPRFSA-N 0.000 description 1
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 1
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 1
- QRTVJGKXFSYJGW-KBIXCLLPSA-N Ile-Glu-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N QRTVJGKXFSYJGW-KBIXCLLPSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 1
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 1
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 1
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 1
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 1
- AFERFBZLVUFWRA-HTFCKZLJSA-N Ile-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)O)N AFERFBZLVUFWRA-HTFCKZLJSA-N 0.000 description 1
- PKGGWLOLRLOPGK-XUXIUFHCSA-N Ile-Leu-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PKGGWLOLRLOPGK-XUXIUFHCSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 1
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- IMRKCLXPYOIHIF-ZPFDUUQYSA-N Ile-Met-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IMRKCLXPYOIHIF-ZPFDUUQYSA-N 0.000 description 1
- MSASLZGZQAXVFP-PEDHHIEDSA-N Ile-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N MSASLZGZQAXVFP-PEDHHIEDSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 1
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 1
- XOZOSAUOGRPCES-STECZYCISA-N Ile-Pro-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XOZOSAUOGRPCES-STECZYCISA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- FBGXMKUWQFPHFB-JBDRJPRFSA-N Ile-Ser-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N FBGXMKUWQFPHFB-JBDRJPRFSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- ZDNNDIJTUHQCAM-MXAVVETBSA-N Ile-Ser-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ZDNNDIJTUHQCAM-MXAVVETBSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- BZUOLKFQVVBTJY-SLBDDTMCSA-N Ile-Trp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BZUOLKFQVVBTJY-SLBDDTMCSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 101710192606 Latent membrane protein 2 Proteins 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 1
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 1
- CNNQBZRGQATKNY-DCAQKATOSA-N Leu-Arg-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N CNNQBZRGQATKNY-DCAQKATOSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- QUAAUWNLWMLERT-IHRRRGAJSA-N Leu-Arg-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O QUAAUWNLWMLERT-IHRRRGAJSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 1
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 1
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- GBDMISNMNXVTNV-XIRDDKMYSA-N Leu-Asp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GBDMISNMNXVTNV-XIRDDKMYSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 1
- DKEZVKFLETVJFY-CIUDSAMLSA-N Leu-Cys-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DKEZVKFLETVJFY-CIUDSAMLSA-N 0.000 description 1
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 1
- PIHFVNPEAHFNLN-KKUMJFAQSA-N Leu-Cys-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N PIHFVNPEAHFNLN-KKUMJFAQSA-N 0.000 description 1
- WCTCIIAGNMFYAO-DCAQKATOSA-N Leu-Cys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O WCTCIIAGNMFYAO-DCAQKATOSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- DXYBNWJZJVSZAE-GUBZILKMSA-N Leu-Gln-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N DXYBNWJZJVSZAE-GUBZILKMSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- AXZGZMGRBDQTEY-SRVKXCTJSA-N Leu-Gln-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O AXZGZMGRBDQTEY-SRVKXCTJSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 1
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- DDEMUMVXNFPDKC-SRVKXCTJSA-N Leu-His-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N DDEMUMVXNFPDKC-SRVKXCTJSA-N 0.000 description 1
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- XBCWOTOCBXXJDG-BZSNNMDCSA-N Leu-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XBCWOTOCBXXJDG-BZSNNMDCSA-N 0.000 description 1
- HMDDEJADNKQTBR-BZSNNMDCSA-N Leu-His-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMDDEJADNKQTBR-BZSNNMDCSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 1
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 1
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- PKKMDPNFGULLNQ-AVGNSLFASA-N Leu-Met-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O PKKMDPNFGULLNQ-AVGNSLFASA-N 0.000 description 1
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 1
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 1
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 1
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- HGUUMQWGYCVPKG-DCAQKATOSA-N Leu-Pro-Cys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HGUUMQWGYCVPKG-DCAQKATOSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- SIGZKCWZEBFNAK-QAETUUGQSA-N Leu-Ser-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SIGZKCWZEBFNAK-QAETUUGQSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- HOMFINRJHIIZNJ-HOCLYGCPSA-N Leu-Trp-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O HOMFINRJHIIZNJ-HOCLYGCPSA-N 0.000 description 1
- SUYRAPCRSCCPAK-VFAJRCTISA-N Leu-Trp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUYRAPCRSCCPAK-VFAJRCTISA-N 0.000 description 1
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 1
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 1
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 1
- HGZHSNBZDOLMLH-DCAQKATOSA-N Lys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N HGZHSNBZDOLMLH-DCAQKATOSA-N 0.000 description 1
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 1
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 1
- SSJBMGCZZXCGJJ-DCAQKATOSA-N Lys-Asp-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O SSJBMGCZZXCGJJ-DCAQKATOSA-N 0.000 description 1
- WTZUSCUIVPVCRH-SRVKXCTJSA-N Lys-Gln-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WTZUSCUIVPVCRH-SRVKXCTJSA-N 0.000 description 1
- PGBPWPTUOSCNLE-JYJNAYRXSA-N Lys-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N PGBPWPTUOSCNLE-JYJNAYRXSA-N 0.000 description 1
- IRRZDAIFYHNIIN-JYJNAYRXSA-N Lys-Gln-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IRRZDAIFYHNIIN-JYJNAYRXSA-N 0.000 description 1
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 1
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 1
- JZMGVXLDOQOKAH-UWVGGRQHSA-N Lys-Gly-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O JZMGVXLDOQOKAH-UWVGGRQHSA-N 0.000 description 1
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 1
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 1
- CBNMHRCLYBJIIZ-XUXIUFHCSA-N Lys-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N CBNMHRCLYBJIIZ-XUXIUFHCSA-N 0.000 description 1
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 1
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 1
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- CTJUSALVKAWFFU-CIUDSAMLSA-N Lys-Ser-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N CTJUSALVKAWFFU-CIUDSAMLSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- TVOOGUNBIWAURO-KATARQTJSA-N Lys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N)O TVOOGUNBIWAURO-KATARQTJSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- SEZADXQOJJTXPG-VFAJRCTISA-N Lys-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N)O SEZADXQOJJTXPG-VFAJRCTISA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- GVKINWYYLOLEFQ-XIRDDKMYSA-N Lys-Trp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O GVKINWYYLOLEFQ-XIRDDKMYSA-N 0.000 description 1
- HONVOXINDBETTI-KKUMJFAQSA-N Lys-Tyr-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CC=C(O)C=C1 HONVOXINDBETTI-KKUMJFAQSA-N 0.000 description 1
- XYLSGAWRCZECIQ-JYJNAYRXSA-N Lys-Tyr-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 XYLSGAWRCZECIQ-JYJNAYRXSA-N 0.000 description 1
- IMDJSVBFQKDDEQ-MGHWNKPDSA-N Lys-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N IMDJSVBFQKDDEQ-MGHWNKPDSA-N 0.000 description 1
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 description 1
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- DLAFCQWUMFMZSN-GUBZILKMSA-N Met-Arg-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N DLAFCQWUMFMZSN-GUBZILKMSA-N 0.000 description 1
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 1
- NKDSBBBPGIVWEI-RCWTZXSCSA-N Met-Arg-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NKDSBBBPGIVWEI-RCWTZXSCSA-N 0.000 description 1
- JMEWFDUAFKVAAT-WDSKDSINSA-N Met-Asn Chemical compound CSCC[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC(N)=O JMEWFDUAFKVAAT-WDSKDSINSA-N 0.000 description 1
- ACYHZNZHIZWLQF-BQBZGAKWSA-N Met-Asn-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ACYHZNZHIZWLQF-BQBZGAKWSA-N 0.000 description 1
- IHITVQKJXQQGLJ-LPEHRKFASA-N Met-Asn-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N IHITVQKJXQQGLJ-LPEHRKFASA-N 0.000 description 1
- TUSOIZOVPJCMFC-FXQIFTODSA-N Met-Asp-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O TUSOIZOVPJCMFC-FXQIFTODSA-N 0.000 description 1
- DRINJBAHUGXNFC-DCAQKATOSA-N Met-Asp-His Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O DRINJBAHUGXNFC-DCAQKATOSA-N 0.000 description 1
- UJDMTKHGWSBHBX-IHRRRGAJSA-N Met-Cys-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UJDMTKHGWSBHBX-IHRRRGAJSA-N 0.000 description 1
- HLYIDXAXQIJYIG-CIUDSAMLSA-N Met-Gln-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HLYIDXAXQIJYIG-CIUDSAMLSA-N 0.000 description 1
- JYCQGAGDJQYEDB-GUBZILKMSA-N Met-Gln-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O JYCQGAGDJQYEDB-GUBZILKMSA-N 0.000 description 1
- NCVJJAJVWILAGI-SRVKXCTJSA-N Met-Gln-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NCVJJAJVWILAGI-SRVKXCTJSA-N 0.000 description 1
- PQPMMGQTRQFSDA-SRVKXCTJSA-N Met-Glu-His Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O PQPMMGQTRQFSDA-SRVKXCTJSA-N 0.000 description 1
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 1
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- FYRUJIJAUPHUNB-IUCAKERBSA-N Met-Gly-Arg Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N FYRUJIJAUPHUNB-IUCAKERBSA-N 0.000 description 1
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- OBCRZLRPJFNLAN-DCAQKATOSA-N Met-His-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OBCRZLRPJFNLAN-DCAQKATOSA-N 0.000 description 1
- XMQZLGBUJMMODC-AVGNSLFASA-N Met-His-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O XMQZLGBUJMMODC-AVGNSLFASA-N 0.000 description 1
- AEQVPPGEJJBFEE-CYDGBPFRSA-N Met-Ile-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEQVPPGEJJBFEE-CYDGBPFRSA-N 0.000 description 1
- NLHSFJQUHGCWSD-PYJNHQTQSA-N Met-Ile-His Chemical compound N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O NLHSFJQUHGCWSD-PYJNHQTQSA-N 0.000 description 1
- QZPXMHVKPHJNTR-DCAQKATOSA-N Met-Leu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O QZPXMHVKPHJNTR-DCAQKATOSA-N 0.000 description 1
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- CHDYFPCQVUOJEB-ULQDDVLXSA-N Met-Leu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CHDYFPCQVUOJEB-ULQDDVLXSA-N 0.000 description 1
- YLBUMXYVQCHBPR-ULQDDVLXSA-N Met-Leu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YLBUMXYVQCHBPR-ULQDDVLXSA-N 0.000 description 1
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 1
- WUYLWZRHRLLEGB-AVGNSLFASA-N Met-Met-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O WUYLWZRHRLLEGB-AVGNSLFASA-N 0.000 description 1
- FBLBCGLSRXBANI-KKUMJFAQSA-N Met-Phe-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FBLBCGLSRXBANI-KKUMJFAQSA-N 0.000 description 1
- RSOMVHWMIAZNLE-HJWJTTGWSA-N Met-Phe-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSOMVHWMIAZNLE-HJWJTTGWSA-N 0.000 description 1
- JQHYVIKEFYETEW-IHRRRGAJSA-N Met-Phe-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=CC=C1 JQHYVIKEFYETEW-IHRRRGAJSA-N 0.000 description 1
- MQASRXPTQJJNFM-JYJNAYRXSA-N Met-Pro-Phe Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MQASRXPTQJJNFM-JYJNAYRXSA-N 0.000 description 1
- NHXXGBXJTLRGJI-GUBZILKMSA-N Met-Pro-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O NHXXGBXJTLRGJI-GUBZILKMSA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 1
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 1
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 1
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 1
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 1
- SQPZCTBSLIIMBL-BPUTZDHNSA-N Met-Trp-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N SQPZCTBSLIIMBL-BPUTZDHNSA-N 0.000 description 1
- CULGJGUDIJATIP-STQMWFEESA-N Met-Tyr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 CULGJGUDIJATIP-STQMWFEESA-N 0.000 description 1
- GHQFLTYXGUETFD-UFYCRDLUSA-N Met-Tyr-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N GHQFLTYXGUETFD-UFYCRDLUSA-N 0.000 description 1
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 1
- JACMWNXOOUYXCD-JYJNAYRXSA-N Met-Val-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JACMWNXOOUYXCD-JYJNAYRXSA-N 0.000 description 1
- VYDLZDRMOFYOGV-TUAOUCFPSA-N Met-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N VYDLZDRMOFYOGV-TUAOUCFPSA-N 0.000 description 1
- IQJMEDDVOGMTKT-SRVKXCTJSA-N Met-Val-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IQJMEDDVOGMTKT-SRVKXCTJSA-N 0.000 description 1
- 101100476480 Mus musculus S100a8 gene Proteins 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- 108010065395 Neuropep-1 Proteins 0.000 description 1
- 101710087110 ORF6 protein Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 108091093037 Peptide nucleic acid Proteins 0.000 description 1
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 1
- NEHSHYOUIWBYSA-DCPHZVHLSA-N Phe-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=CC=C3)N NEHSHYOUIWBYSA-DCPHZVHLSA-N 0.000 description 1
- VHWOBXIWBDWZHK-IHRRRGAJSA-N Phe-Arg-Asp Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHWOBXIWBDWZHK-IHRRRGAJSA-N 0.000 description 1
- XWBJLKDCHJVKAK-KKUMJFAQSA-N Phe-Arg-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XWBJLKDCHJVKAK-KKUMJFAQSA-N 0.000 description 1
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 1
- PLNHHOXNVSYKOB-JYJNAYRXSA-N Phe-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N PLNHHOXNVSYKOB-JYJNAYRXSA-N 0.000 description 1
- GNUCSNWOCQFMMC-UFYCRDLUSA-N Phe-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 GNUCSNWOCQFMMC-UFYCRDLUSA-N 0.000 description 1
- ZWJKVFAYPLPCQB-UNQGMJICSA-N Phe-Arg-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O ZWJKVFAYPLPCQB-UNQGMJICSA-N 0.000 description 1
- UUWCIPUVJJIEEP-SRVKXCTJSA-N Phe-Asn-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N UUWCIPUVJJIEEP-SRVKXCTJSA-N 0.000 description 1
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 1
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 1
- AEEQKUDWJGOFQI-SRVKXCTJSA-N Phe-Cys-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N AEEQKUDWJGOFQI-SRVKXCTJSA-N 0.000 description 1
- HPECNYCQLSVCHH-BZSNNMDCSA-N Phe-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N HPECNYCQLSVCHH-BZSNNMDCSA-N 0.000 description 1
- IILUKIJNFMUBNF-IHRRRGAJSA-N Phe-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O IILUKIJNFMUBNF-IHRRRGAJSA-N 0.000 description 1
- CTNODEMQIKCZGQ-JYJNAYRXSA-N Phe-Gln-His Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 CTNODEMQIKCZGQ-JYJNAYRXSA-N 0.000 description 1
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 1
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 1
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 1
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 1
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 1
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 1
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 1
- MMYUOSCXBJFUNV-QWRGUYRKSA-N Phe-Gly-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N MMYUOSCXBJFUNV-QWRGUYRKSA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 1
- PMKIMKUGCSVFSV-CQDKDKBSSA-N Phe-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PMKIMKUGCSVFSV-CQDKDKBSSA-N 0.000 description 1
- SPXWRYVHOZVYBU-ULQDDVLXSA-N Phe-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N SPXWRYVHOZVYBU-ULQDDVLXSA-N 0.000 description 1
- FXPZZKBHNOMLGA-HJWJTTGWSA-N Phe-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FXPZZKBHNOMLGA-HJWJTTGWSA-N 0.000 description 1
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 1
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 1
- NRKNYPRRWXVELC-NQCBNZPSSA-N Phe-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=CC=C3)N NRKNYPRRWXVELC-NQCBNZPSSA-N 0.000 description 1
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 1
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 1
- MJAYDXWQQUOURZ-JYJNAYRXSA-N Phe-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MJAYDXWQQUOURZ-JYJNAYRXSA-N 0.000 description 1
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 1
- IEOHQGFKHXUALJ-JYJNAYRXSA-N Phe-Met-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IEOHQGFKHXUALJ-JYJNAYRXSA-N 0.000 description 1
- PTLMYJOMJLTMCB-KKUMJFAQSA-N Phe-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N PTLMYJOMJLTMCB-KKUMJFAQSA-N 0.000 description 1
- SRILZRSXIKRGBF-HRCADAONSA-N Phe-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N SRILZRSXIKRGBF-HRCADAONSA-N 0.000 description 1
- GKZIWHRNKRBEOH-HOTGVXAUSA-N Phe-Phe Chemical compound C([C@H]([NH3+])C(=O)N[C@@H](CC=1C=CC=CC=1)C([O-])=O)C1=CC=CC=C1 GKZIWHRNKRBEOH-HOTGVXAUSA-N 0.000 description 1
- OWSLLRKCHLTUND-BZSNNMDCSA-N Phe-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OWSLLRKCHLTUND-BZSNNMDCSA-N 0.000 description 1
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 1
- CBENHWCORLVGEQ-HJOGWXRNSA-N Phe-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CBENHWCORLVGEQ-HJOGWXRNSA-N 0.000 description 1
- DSXPMZMSJHOKKK-HJOGWXRNSA-N Phe-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DSXPMZMSJHOKKK-HJOGWXRNSA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- ILGCZYGFYQLSDZ-KKUMJFAQSA-N Phe-Ser-His Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ILGCZYGFYQLSDZ-KKUMJFAQSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- OLZVAVSJEUAOHI-UNQGMJICSA-N Phe-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O OLZVAVSJEUAOHI-UNQGMJICSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 1
- APXXVISUHOLGEE-ILWGZMRPSA-N Phe-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CC=CC=C4)N)C(=O)O APXXVISUHOLGEE-ILWGZMRPSA-N 0.000 description 1
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 1
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 1
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 1
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 1
- DXWNFNOPBYAFRM-IHRRRGAJSA-N Phe-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N DXWNFNOPBYAFRM-IHRRRGAJSA-N 0.000 description 1
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 1
- 108091036407 Polyadenylation Proteins 0.000 description 1
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- DRVIASBABBMZTF-GUBZILKMSA-N Pro-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1 DRVIASBABBMZTF-GUBZILKMSA-N 0.000 description 1
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 1
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 1
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- HPXVFFIIGOAQRV-DCAQKATOSA-N Pro-Arg-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O HPXVFFIIGOAQRV-DCAQKATOSA-N 0.000 description 1
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 1
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- WECYCNFPGZLOOU-FXQIFTODSA-N Pro-Asn-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O WECYCNFPGZLOOU-FXQIFTODSA-N 0.000 description 1
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 1
- WPQKSRHDTMRSJM-CIUDSAMLSA-N Pro-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 WPQKSRHDTMRSJM-CIUDSAMLSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 1
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- DIZLUAZLNDFDPR-CIUDSAMLSA-N Pro-Cys-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 DIZLUAZLNDFDPR-CIUDSAMLSA-N 0.000 description 1
- NOXSEHJOXCWRHK-DCAQKATOSA-N Pro-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 NOXSEHJOXCWRHK-DCAQKATOSA-N 0.000 description 1
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 1
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 1
- WSRWHZRUOCACLJ-UWVGGRQHSA-N Pro-Gly-His Chemical compound C([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H]1NCCC1)C1=CN=CN1 WSRWHZRUOCACLJ-UWVGGRQHSA-N 0.000 description 1
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 1
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- SXMSEHDMNIUTSP-DCAQKATOSA-N Pro-Lys-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SXMSEHDMNIUTSP-DCAQKATOSA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- BARPGRUZBKFJMA-SRVKXCTJSA-N Pro-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BARPGRUZBKFJMA-SRVKXCTJSA-N 0.000 description 1
- AUYKOPJPKUCYHE-SRVKXCTJSA-N Pro-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 AUYKOPJPKUCYHE-SRVKXCTJSA-N 0.000 description 1
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 1
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 1
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 1
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 1
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- RNEFESSBTOQSAC-DCAQKATOSA-N Pro-Ser-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O RNEFESSBTOQSAC-DCAQKATOSA-N 0.000 description 1
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 1
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 1
- GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 1
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 1
- DLZBBDSPTJBOOD-BPNCWPANSA-N Pro-Tyr-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O DLZBBDSPTJBOOD-BPNCWPANSA-N 0.000 description 1
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 1
- DYJTXTCEXMCPBF-UFYCRDLUSA-N Pro-Tyr-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O DYJTXTCEXMCPBF-UFYCRDLUSA-N 0.000 description 1
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 1
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 1
- WWXNZNWZNZPDIF-SRVKXCTJSA-N Pro-Val-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 WWXNZNWZNZPDIF-SRVKXCTJSA-N 0.000 description 1
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 241000125945 Protoparvovirus Species 0.000 description 1
- 101150030723 RIR2 gene Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- IDCKUIWEIZYVSO-WFBYXXMGSA-N Ser-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C)C(O)=O)=CNC2=C1 IDCKUIWEIZYVSO-WFBYXXMGSA-N 0.000 description 1
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 1
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- BLPYXIXXCFVIIF-FXQIFTODSA-N Ser-Cys-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N)CN=C(N)N BLPYXIXXCFVIIF-FXQIFTODSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 1
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 1
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 1
- HEYZPTCCEIWHRO-IHRRRGAJSA-N Ser-Met-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HEYZPTCCEIWHRO-IHRRRGAJSA-N 0.000 description 1
- AXOHAHIUJHCLQR-IHRRRGAJSA-N Ser-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CO)N AXOHAHIUJHCLQR-IHRRRGAJSA-N 0.000 description 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 1
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 1
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- SOACHCFYJMCMHC-BWBBJGPYSA-N Ser-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O SOACHCFYJMCMHC-BWBBJGPYSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- STIAINRLUUKYKM-WFBYXXMGSA-N Ser-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 STIAINRLUUKYKM-WFBYXXMGSA-N 0.000 description 1
- PZHJLTWGMYERRJ-SRVKXCTJSA-N Ser-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O PZHJLTWGMYERRJ-SRVKXCTJSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 1
- 101710109576 Terminal protein Proteins 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 1
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 1
- PKXHGEXFMIZSER-QTKMDUPCSA-N Thr-Arg-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PKXHGEXFMIZSER-QTKMDUPCSA-N 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 1
- UTSWGQNAQRIHAI-UNQGMJICSA-N Thr-Arg-Phe Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UTSWGQNAQRIHAI-UNQGMJICSA-N 0.000 description 1
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 1
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 1
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 1
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 1
- BIYXEUAFGLTAEM-WUJLRWPWSA-N Thr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(O)=O BIYXEUAFGLTAEM-WUJLRWPWSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- NQVDGKYAUHTCME-QTKMDUPCSA-N Thr-His-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O NQVDGKYAUHTCME-QTKMDUPCSA-N 0.000 description 1
- IGGFFPOIFHZYKC-PBCZWWQYSA-N Thr-His-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O IGGFFPOIFHZYKC-PBCZWWQYSA-N 0.000 description 1
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- XIULAFZYEKSGAJ-IXOXFDKPSA-N Thr-Leu-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XIULAFZYEKSGAJ-IXOXFDKPSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- ZXIHABSKUITPTN-IXOXFDKPSA-N Thr-Lys-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O ZXIHABSKUITPTN-IXOXFDKPSA-N 0.000 description 1
- PCMDGXKXVMBIFP-VEVYYDQMSA-N Thr-Met-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMDGXKXVMBIFP-VEVYYDQMSA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- BDENGIGFTNYZSJ-RCWTZXSCSA-N Thr-Pro-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O BDENGIGFTNYZSJ-RCWTZXSCSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- SOUPNXUJAJENFU-SWRJLBSHSA-N Thr-Trp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O SOUPNXUJAJENFU-SWRJLBSHSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- AVYVKJMBNLPWRX-WFBYXXMGSA-N Trp-Ala-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 AVYVKJMBNLPWRX-WFBYXXMGSA-N 0.000 description 1
- SCQBNMKLZVCXNX-ZFWWWQNUSA-N Trp-Arg-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N SCQBNMKLZVCXNX-ZFWWWQNUSA-N 0.000 description 1
- LAIUAVGWZYTBKN-VHWLVUOQSA-N Trp-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O LAIUAVGWZYTBKN-VHWLVUOQSA-N 0.000 description 1
- PMIJXCLOQFMOKZ-BPUTZDHNSA-N Trp-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N PMIJXCLOQFMOKZ-BPUTZDHNSA-N 0.000 description 1
- LTLBNCDNXQCOLB-UBHSHLNASA-N Trp-Asp-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 LTLBNCDNXQCOLB-UBHSHLNASA-N 0.000 description 1
- UDCHKDYNMRJYMI-QEJZJMRPSA-N Trp-Glu-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UDCHKDYNMRJYMI-QEJZJMRPSA-N 0.000 description 1
- HQJOVVWAPQPYDS-ZFWWWQNUSA-N Trp-Gly-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQJOVVWAPQPYDS-ZFWWWQNUSA-N 0.000 description 1
- OZUJUVFWMHTWCZ-HOCLYGCPSA-N Trp-Gly-His Chemical compound N[C@@H](Cc1c[nH]c2ccccc12)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OZUJUVFWMHTWCZ-HOCLYGCPSA-N 0.000 description 1
- OTWIOROMZLNAQC-XIRDDKMYSA-N Trp-His-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OTWIOROMZLNAQC-XIRDDKMYSA-N 0.000 description 1
- CXPJPTFWKXNDKV-NUTKFTJISA-N Trp-Leu-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CXPJPTFWKXNDKV-NUTKFTJISA-N 0.000 description 1
- OFTGYORHQMSPAI-PJODQICGSA-N Trp-Met-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O OFTGYORHQMSPAI-PJODQICGSA-N 0.000 description 1
- HJWLQSFTGDQSRX-BPUTZDHNSA-N Trp-Met-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HJWLQSFTGDQSRX-BPUTZDHNSA-N 0.000 description 1
- OTJDEIZGUFRGLL-WIRXVTQYSA-N Trp-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC4=CNC5=CC=CC=C54)N OTJDEIZGUFRGLL-WIRXVTQYSA-N 0.000 description 1
- XOLLWQIBBLBAHQ-WDSOQIARSA-N Trp-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O XOLLWQIBBLBAHQ-WDSOQIARSA-N 0.000 description 1
- UIRPULWLRODAEQ-QEJZJMRPSA-N Trp-Ser-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 UIRPULWLRODAEQ-QEJZJMRPSA-N 0.000 description 1
- YCQXZDHDSUHUSG-FJHTZYQYSA-N Trp-Thr-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 YCQXZDHDSUHUSG-FJHTZYQYSA-N 0.000 description 1
- MXKUGFHWYYKVDV-SZMVWBNQSA-N Trp-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(C)C)C(O)=O MXKUGFHWYYKVDV-SZMVWBNQSA-N 0.000 description 1
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 1
- NSOMQRHZMJMZIE-GVARAGBVSA-N Tyr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NSOMQRHZMJMZIE-GVARAGBVSA-N 0.000 description 1
- AKFLVKKWVZMFOT-IHRRRGAJSA-N Tyr-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AKFLVKKWVZMFOT-IHRRRGAJSA-N 0.000 description 1
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 1
- HKIUVWMZYFBIHG-KKUMJFAQSA-N Tyr-Arg-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O HKIUVWMZYFBIHG-KKUMJFAQSA-N 0.000 description 1
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 1
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 1
- BVWADTBVGZHSLW-IHRRRGAJSA-N Tyr-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BVWADTBVGZHSLW-IHRRRGAJSA-N 0.000 description 1
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 1
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 1
- XKDOQXAXKFQWQJ-SRVKXCTJSA-N Tyr-Cys-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O XKDOQXAXKFQWQJ-SRVKXCTJSA-N 0.000 description 1
- FQNUWOHNGJWNLM-QWRGUYRKSA-N Tyr-Cys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FQNUWOHNGJWNLM-QWRGUYRKSA-N 0.000 description 1
- WEFIPBYPXZYPHD-HJPIBITLSA-N Tyr-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WEFIPBYPXZYPHD-HJPIBITLSA-N 0.000 description 1
- WZQZUVWEPMGIMM-JYJNAYRXSA-N Tyr-Gln-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WZQZUVWEPMGIMM-JYJNAYRXSA-N 0.000 description 1
- FXYOYUMPUJONGW-FHWLQOOXSA-N Tyr-Gln-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 FXYOYUMPUJONGW-FHWLQOOXSA-N 0.000 description 1
- CKHQKYHIZCRTAP-SOUVJXGZSA-N Tyr-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CKHQKYHIZCRTAP-SOUVJXGZSA-N 0.000 description 1
- XQYHLZNPOTXRMQ-KKUMJFAQSA-N Tyr-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XQYHLZNPOTXRMQ-KKUMJFAQSA-N 0.000 description 1
- PMDWYLVWHRTJIW-STQMWFEESA-N Tyr-Gly-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PMDWYLVWHRTJIW-STQMWFEESA-N 0.000 description 1
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 1
- OLWFDNLLBWQWCP-STQMWFEESA-N Tyr-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OLWFDNLLBWQWCP-STQMWFEESA-N 0.000 description 1
- ADECJAKCRKPSOR-ULQDDVLXSA-N Tyr-His-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O ADECJAKCRKPSOR-ULQDDVLXSA-N 0.000 description 1
- JHORGUYURUBVOM-KKUMJFAQSA-N Tyr-His-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O JHORGUYURUBVOM-KKUMJFAQSA-N 0.000 description 1
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 1
- YMUQBRQQCPQEQN-CXTHYWKRSA-N Tyr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YMUQBRQQCPQEQN-CXTHYWKRSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- MXFPBNFKVBHIRW-BZSNNMDCSA-N Tyr-Lys-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O MXFPBNFKVBHIRW-BZSNNMDCSA-N 0.000 description 1
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 1
- OGPKMBOPMDTEDM-IHRRRGAJSA-N Tyr-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N OGPKMBOPMDTEDM-IHRRRGAJSA-N 0.000 description 1
- OKDNSNWJEXAMSU-IRXDYDNUSA-N Tyr-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 OKDNSNWJEXAMSU-IRXDYDNUSA-N 0.000 description 1
- FASACHWGQBNSRO-ZEWNOJEFSA-N Tyr-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FASACHWGQBNSRO-ZEWNOJEFSA-N 0.000 description 1
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 1
- PHKQVWWHRYUCJL-HJOGWXRNSA-N Tyr-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PHKQVWWHRYUCJL-HJOGWXRNSA-N 0.000 description 1
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 1
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 1
- GPLTZEMVOCZVAV-UFYCRDLUSA-N Tyr-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 GPLTZEMVOCZVAV-UFYCRDLUSA-N 0.000 description 1
- JQOMHZMWQHXALX-FHWLQOOXSA-N Tyr-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JQOMHZMWQHXALX-FHWLQOOXSA-N 0.000 description 1
- QVYFTFIBKCDHIE-ACRUOGEOSA-N Tyr-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O QVYFTFIBKCDHIE-ACRUOGEOSA-N 0.000 description 1
- MJUTYRIMFIICKL-JYJNAYRXSA-N Tyr-Val-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MJUTYRIMFIICKL-JYJNAYRXSA-N 0.000 description 1
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 1
- YKBUNNNRNZZUID-UFYCRDLUSA-N Tyr-Val-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YKBUNNNRNZZUID-UFYCRDLUSA-N 0.000 description 1
- 101150100826 UL40 gene Proteins 0.000 description 1
- 101710095001 Uncharacterized protein in nifU 5'region Proteins 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- WOCYUGQDXPTQPY-FXQIFTODSA-N Val-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N WOCYUGQDXPTQPY-FXQIFTODSA-N 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- YLHLNFUXDBOAGX-DCAQKATOSA-N Val-Cys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YLHLNFUXDBOAGX-DCAQKATOSA-N 0.000 description 1
- FBVUOEYVGNMRMD-NAKRPEOUSA-N Val-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N FBVUOEYVGNMRMD-NAKRPEOUSA-N 0.000 description 1
- XXDVDTMEVBYRPK-XPUUQOCRSA-N Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O XXDVDTMEVBYRPK-XPUUQOCRSA-N 0.000 description 1
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 1
- IWZYXFRGWKEKBJ-GVXVVHGQSA-N Val-Gln-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IWZYXFRGWKEKBJ-GVXVVHGQSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 1
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 1
- DHINLYMWMXQGMQ-IHRRRGAJSA-N Val-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 DHINLYMWMXQGMQ-IHRRRGAJSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 1
- RSGHLMMKXJGCMK-JYJNAYRXSA-N Val-Met-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N RSGHLMMKXJGCMK-JYJNAYRXSA-N 0.000 description 1
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 1
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 1
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 1
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 1
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 1
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 1
- HPOSMQWRPMRMFO-GUBZILKMSA-N Val-Pro-Cys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HPOSMQWRPMRMFO-GUBZILKMSA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 1
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 1
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- NGXQOQNXSGOYOI-BQFCYCMXSA-N Val-Trp-Gln Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 NGXQOQNXSGOYOI-BQFCYCMXSA-N 0.000 description 1
- POFQRHFHYPSCOI-FHWLQOOXSA-N Val-Trp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N POFQRHFHYPSCOI-FHWLQOOXSA-N 0.000 description 1
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- ZHWZDZFWBXWPDW-GUBZILKMSA-N Val-Val-Cys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O ZHWZDZFWBXWPDW-GUBZILKMSA-N 0.000 description 1
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 1
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 1
- 108700005077 Viral Genes Proteins 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- CUJRVFIICFDLGR-UHFFFAOYSA-N acetylacetonate Chemical compound CC(=O)[CH-]C(C)=O CUJRVFIICFDLGR-UHFFFAOYSA-N 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010039538 alanyl-glycyl-aspartyl-valine Proteins 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000007416 antiviral immune response Effects 0.000 description 1
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical class OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 1
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 1
- 108010080488 arginyl-arginyl-leucine Proteins 0.000 description 1
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010089442 arginyl-leucyl-alanyl-arginine Proteins 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010027234 aspartyl-glycyl-glutamyl-alanine Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- 229960000074 biopharmaceutical Drugs 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000025084 cell cycle arrest Effects 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000012761 co-transfection Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 231100000433 cytotoxic Toxicity 0.000 description 1
- 230000001472 cytotoxic effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 229910003460 diamond Inorganic materials 0.000 description 1
- 239000010432 diamond Substances 0.000 description 1
- 239000012470 diluted sample Substances 0.000 description 1
- 238000011143 downstream manufacturing Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 239000012467 final product Substances 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 1
- 108010037389 glutamyl-cysteinyl-lysine Proteins 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010019407 glycyl-arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010038983 glycyl-histidyl-lysine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 229940029575 guanosine Drugs 0.000 description 1
- 150000002402 hexoses Chemical class 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 230000002163 immunogen Effects 0.000 description 1
- 230000002757 inflammatory effect Effects 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 210000003292 kidney cell Anatomy 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 1
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 229930014626 natural product Natural products 0.000 description 1
- 150000003833 nucleoside derivatives Chemical class 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 108010091617 pentalysine Proteins 0.000 description 1
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010065135 phenylalanyl-phenylalanyl-phenylalanine Proteins 0.000 description 1
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 230000002062 proliferating effect Effects 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 239000013608 rAAV vector Substances 0.000 description 1
- 238000009790 rate-determining step (RDS) Methods 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 239000000523 sample Substances 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010007375 seryl-seryl-seryl-arginine Proteins 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 238000012358 sourcing Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 230000000153 supplemental effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- HDZZVAMISRMYHH-KCGFPETGSA-N tubercidin Chemical compound C1=CC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O HDZZVAMISRMYHH-KCGFPETGSA-N 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010003885 valyl-prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 230000029812 viral genome replication Effects 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10311—Mastadenovirus, e.g. human or simian adenoviruses
- C12N2710/10322—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10311—Mastadenovirus, e.g. human or simian adenoviruses
- C12N2710/10341—Use of virus, viral particle or viral elements as a vector
- C12N2710/10344—Chimeric viral vector comprising heterologous viral elements for production of another viral vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2750/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
- C12N2750/00011—Details
- C12N2750/14011—Parvoviridae
- C12N2750/14111—Dependovirus, e.g. adenoassociated viruses
- C12N2750/14141—Use of virus, viral particle or viral elements as a vector
- C12N2750/14143—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2750/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
- C12N2750/00011—Details
- C12N2750/14011—Parvoviridae
- C12N2750/14111—Dependovirus, e.g. adenoassociated viruses
- C12N2750/14151—Methods of production or purification of viral material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/50—Vector systems having a special element relevant for transcription regulating RNA stability, not being an intron, e.g. poly A signal
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Biomedical Technology (AREA)
- Organic Chemistry (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Zoology (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Virology (AREA)
- Plant Pathology (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Medicines Containing Material From Animals Or Micro-Organisms (AREA)
- Saccharide Compounds (AREA)
Abstract
요약서
본 명세서는 재조합 아데노-연합된 바이러스의 생산을 위한 개선된 아데노바이러스성 헬퍼 플라스미드를 제공한다.
본 명세서는 재조합 아데노-연합된 바이러스의 생산을 위한 개선된 아데노바이러스성 헬퍼 플라스미드를 제공한다.
Description
관련 출원에 대한 상호-참조
본 출원은 2021년 5월 13일자로 제출된 미국 가출원 번호 63/188,294를 우선권으로 주장하며, 이의 전문이 본 명세서의 참고자료에 편입된다.
배경
아데노-연합된 바이러스 (AAV) 기술은 유전적 질환에 대한 유전자 치료법의 지배적인 형태가 빠르게 자리잡고 있다. AAVs는 HEK293 세포와 같은 포유류 세포를 비롯한 다양한 숙주 세포 시스템에서 대규모로 생산될 수 있다. 전통적으로, 포유류 세포에서의 AAV 생산은 숙주 세포, 예를 들어, 인간 유전자 또는 관심대상인 유전자를 인코딩하는 플라스미드, 바이러스 복제 및 패키징에 중요한 다양한 바이러스 유전자에 대한 다중 플라스미드의 도입이 연루된다. 적절한 복제에 필요한 유전자의 수로 인해, 이러한 유전자는 전통적으로 2~3개의 별도 플라스미드로 전달된다.
이러한 플라스미드 중 하나, "아데노바이러스성 헬퍼" 플라스미드라고 불리는 플라스미드는 숙주 세포로 부터 AAV 생산에 주요한 유전자들을 함유한다. E2a, VA RNA, 및 E4 유전자를 함유하는 아데노바이러스성 헬퍼 플라스미드는 포유동물 숙주 세포 시스템에서 AAV 생산을 촉진하는 데 중요한 것으로 나타났다.
지난 20년 동안 많은 발전이 있었음에도 불구하고, AAV 생산의 비용과 안전성에 대한 우려로 인해, AAV 기술의 치료 잠재력이 계속해서 제한되고 있다. 이러한 우려는 부분적으로 많은 헬퍼 플라스미드의 큰 크기로 인한 것이며, 이는 AAV 생산을 지원하기 위해 단일 헬퍼 플라스미드에 많은 수의 유전자를 제공하기 때문이다. 안전성 문제는 부분적으로 비록 낮은 수준이기는 하지만 AAV 복제에 필요하지 않은 잠재적으로 세포독성 및/또는 염증성 바이러스 단백질의 생산에 기인한다.
요약
일부 구체예들에서, 본 명세서는 다른 것들 중에서, 아데노바이러스성 헬퍼 플라스미드를 제공한다. 일부 구체예들에서, 본 명세서는 당업계에 공지된 것과 비교하여 크기가 감소된 아데노바이러스성 헬퍼 플라스미드를 제공한다. 일부 구체예들에서, 본 명세서는 E2a, VA RNA, E4를 인코딩하는 뉴클레오티드 서열들; 그리고 L4 영역을 포함하는 아데노바이러스성 헬퍼 플라스미드를 제공한다. 일부 구체예들에서, 본원에 기술된 바의 아데노바이러스성 헬퍼 플라스미드는 다른 바이러스들로부터 기인된 단백질을 인코딩하는 뉴클레오티드 서열들을 포함한다. 일부 구체예들에서, 본원에 기술된 바의 아데노바이러스성 헬퍼 플라스미드는 HSV-1 UL30, HSV-1 UL42, 및/또는 HSV-1 UL29를 비롯한 다른 바이러스들로부터 기인된 단백질들을 인코딩하는 뉴클레오티드 서열들을 포함한다.
일부 구체예들에서, 본 명세서는 하나 또는 그 이상의 섬유 단백질을 하나 또는 그 이상의 뉴클레오티드 서열들; L1-52/55K (패키징 단백질 3), 페리펜톤성(peripentonal) 헥손-연합된 단백질, 및 L4 영역을 포함하지 않는 아데노바이러스성 헬퍼 플라스미드를 제공한다. 일부 구체예들에서, 본 명세서는 E2a 단백질의 단편, 일부분 또는 부분 형태, VA RNA, E4, L1-52/55K (패키징 단백질 3), 페리펜톤성 헥손-연합된 단백질, 및 L4 영역을 포함하는 아데노바이러스성 헬퍼 플라스미드를 제공한다. 일부 구체예들에서, 본 명세서는 하나 또는 그 이상의 헥손 연합된 전구물질 (L4 pVIII) 단백질을 인코딩하는 하나 또는 그 이상의 뉴클레오티드 서열들, DNA 말단 단백질, 및 23kDa 엔도프로테아제를 포함하지 않는 아데노바이러스성 헬퍼 플라스미드를 제공한다. 일부 구체예들에서, 본 명세서는 하나 또는 그 이상의 E4orf1 및 E4orf2를 인코딩하는 하나 또는 그 이상의 뉴클레오티드 서열들을 포함하지 않는 아데노바이러스성 헬퍼 플라스미드를 제공한다. 일부 구체예들에서, 본원에서 제공되는 아데노바이러스성 헬퍼 플라스미드는 카나마이신 저항성 유전자를 제공한다.
일부 구체예들에서, 본 명세서는 E2a 단백질의 발현이 E2a 프로모터, 닭 β-액틴 프로모터, 및 SV40 프로모터들 중 하나 또는 그 이상의 프로모터 제어 하에 있는 아데노바이러스성 헬퍼 플라스미드를 제공한다. 일부 구체예들에서, 본 명세서는 E4 개방 해독 틀 (orf)의 발현이 하나 또는 그 이상의 닭 β-액틴 프로모터 및 SV40 프로모터중 하나 또는 그 이상의 프로모터 제어하에 있는 아데노바이러스성 헬퍼 플라스미드를 제공한다.
일부 구체예들에서, 본 명세서는 서열 식별 번호: 1-3, 5, 7, 9, 11-12, 14-20, 22, 24, 26-29, 31, 33, 35-37, 39-70, 72, 74, 76, 78, 또는 80에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함하는 아데노바이러스성 헬퍼 플라스미드를 제공한다. 일부 구체예들에서, 본 명세서는 서열 식별 번호: 4, 6, 8, 10, 13, 21, 23, 25, 30, 32, 34, 38, 71, 73, 75, 77, 79, 또는 81에 대해 적어도 80% 동일한 아미노산 서열을 인코드하는 뉴클레오티드 서열을 포함하는 아데노바이러스성 헬퍼 플라스미드를 제공한다. 일부 구체예들에서, 본 명세서는 서열 식별 번호: 41-66 중 임의의 하나에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함하는 아데노바이러스성 헬퍼 플라스미드를 제공한다.
도면의 간단한 설명
도 1은 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.2를 설명하는 플라스미드 지도를 보여준다.
도 2는 pEMBR-1.2 및 상업적으로 이용가능한 pX80를 아데노바이러스성 헬퍼 플라스미드로 이용하여 수득한 벡터 수율을 보여준다.
도 3은 pEMBR-1.2 또는 상업적으로 이용가능한 pX80를 아데노바이러스성 헬퍼 플라스미드로 이용하여 수득한 벡터 도입유전자 순도 및 벡터 캡시드 순도를 보여준다.
도 4는 HEK293 세포를 재조합 AAV RH.10, ssCMV-GFP 도입유전자, 그리고 pX80 또는 pEMBR 헬퍼 플라스미드로 형질전환시킨 후 수득된 GFP 발현 수준 간의 비교를 보여준다.
도 5는 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.3 및 pEMBR-1.3B를 설명하는 플라스미드 지도를 보여준다.
도 6은 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.4 및 pEMBR-1.4B를 설명하는 플라스미드 지도를 보여준다.
도 7은 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.5를 설명하는 플라스미드 지도를 보여준다.
도 8은 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.2B2C를 설명하는 플라스미드 지도를 보여준다.
도 9는 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.2B2D를 설명하는 플라스미드 지도를 보여준다.
도 10은 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.5A를 설명하는 플라스미드 지도를 보여준다.
도 11은 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.55B2를 설명하는 플라스미드 지도를 보여준다.
도 12는 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.55B2 OO를 설명하는 플라스미드 지도를 보여준다.
도 13은 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.55B2C를 설명하는 플라스미드 지도를 보여준다.
도 14는 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.55B2C OO를 설명하는 플라스미드 지도를 보여준다.
도 15는 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.55B2D를 설명하는 플라스미드 지도를 보여준다.
도 16은 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.55B2D OO를 설명하는 플라스미드 지도를 보여준다.
도 17은 각종 pEMBR 플라스미드를 아데노바이러스성 헬퍼 플라스미드로 이용하여 qPCR로 측정하였을 때, 수득한 벡터 수율(VG/mL)을 보여준다.
도 18은 각종 pEMBR 플라스미드 및 pHelper를 아데노바이러스성 헬퍼 플라스미드로 이용하여 qPCR로 측정하였을 때, 수득한 벡터 수율(VG/mL)을 보여준다.
도 1은 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.2를 설명하는 플라스미드 지도를 보여준다.
도 2는 pEMBR-1.2 및 상업적으로 이용가능한 pX80를 아데노바이러스성 헬퍼 플라스미드로 이용하여 수득한 벡터 수율을 보여준다.
도 3은 pEMBR-1.2 또는 상업적으로 이용가능한 pX80를 아데노바이러스성 헬퍼 플라스미드로 이용하여 수득한 벡터 도입유전자 순도 및 벡터 캡시드 순도를 보여준다.
도 4는 HEK293 세포를 재조합 AAV RH.10, ssCMV-GFP 도입유전자, 그리고 pX80 또는 pEMBR 헬퍼 플라스미드로 형질전환시킨 후 수득된 GFP 발현 수준 간의 비교를 보여준다.
도 5는 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.3 및 pEMBR-1.3B를 설명하는 플라스미드 지도를 보여준다.
도 6은 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.4 및 pEMBR-1.4B를 설명하는 플라스미드 지도를 보여준다.
도 7은 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.5를 설명하는 플라스미드 지도를 보여준다.
도 8은 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.2B2C를 설명하는 플라스미드 지도를 보여준다.
도 9는 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.2B2D를 설명하는 플라스미드 지도를 보여준다.
도 10은 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.5A를 설명하는 플라스미드 지도를 보여준다.
도 11은 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.55B2를 설명하는 플라스미드 지도를 보여준다.
도 12는 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.55B2 OO를 설명하는 플라스미드 지도를 보여준다.
도 13은 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.55B2C를 설명하는 플라스미드 지도를 보여준다.
도 14는 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.55B2C OO를 설명하는 플라스미드 지도를 보여준다.
도 15는 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.55B2D를 설명하는 플라스미드 지도를 보여준다.
도 16은 아데노바이러스성 헬퍼 플라스미드 pEMBR-1.55B2D OO를 설명하는 플라스미드 지도를 보여준다.
도 17은 각종 pEMBR 플라스미드를 아데노바이러스성 헬퍼 플라스미드로 이용하여 qPCR로 측정하였을 때, 수득한 벡터 수율(VG/mL)을 보여준다.
도 18은 각종 pEMBR 플라스미드 및 pHelper를 아데노바이러스성 헬퍼 플라스미드로 이용하여 qPCR로 측정하였을 때, 수득한 벡터 수율(VG/mL)을 보여준다.
정의
작용제 : 일반적으로, 본원에 사용된 용어 "작용제(agent)"는 실체 (가령, 지질, 금속, 핵산, 폴리펩티드, 다당류, 소분자 등 또는 이의 복합체, 조합, 혼합물 또는 시스템 [가령, 세포, 조직, 유기체]) 또는 현상 (가령, 열, 전류 또는 장, 자기력 또는 장, 등등)을 지칭하는데 이용된다. 적절한 상황에서, 당업자에게 문맥상 명확한 바와 같이, 상기 용어는 세포 또는 유기체, 또는 이의 분획, 추출물 또는 성분이거나, 또는 이를 포함하는 실체를 지칭하는 데 사용될 수 있다. 대안적으로 또는 추가적으로, 문맥에서 명확해지듯이, 이 용어는 자연에서 발견되고, 및/또는 자연으로부터 얻어지는 천연 생성물을 지칭하는 데 사용될 수 있다. 어떤 경우에는, 다시 문맥에서 알 수 있듯이, 이 용어는 사람의 손에 의한 작업을 통해 설계, 엔지니어링 및/또는 생산된다는 점에서 사람이 만든, 및/또는 자연에서는 발견되지 않는 하나 또는 그 이상의 실체를 지칭하는 데 사용될 수 있다. 일부 구체예들에서, 작용제는 단리된 형태 또는 순수한 형태로 활용될 수 있으며; 일부 구체예들에서, 작용제는 미정제 형태로 사용될 수 있다. 일부 구체예들에서, 잠재적인 작용제는 수집물이나 라이브러리로 제공될 수 있으며, 예를 들어, 그 안에 있는 활성 작용제들을 식별하거나 특성화하기 위해 선별될 수 있다. 일부 경우에, "작용제"라는 용어는 중합체이거나 중합체를 포함하는 화합물 또는 실체를 지칭할 수 있으며; 일부 경우에, 이 용어는 하나 또는 그 이상의 중합체 부분을 포함하는 화합물 또는 실체를 지칭할 수 있다. 일부 구체예들에서, 용어 "작용제"는 중합체가 아니고 및/또는 임의의 중합체 및/또는 하나 또는 그 이상의 특정 중합체성 모이어티가 실질적으로 없는 화합물 또는 실체를 지칭할 수 있다. 일부 구체예들에서, 상기 용어는 임의의 중합체성 모이어티가 없거나 또는 실질적으로 없는 화합물 또는 실체를 지칭할 수 있다.
대략적으로/약(about) : 본원에서 사용된 바와 같이, 하나 이상의 관심 값에 적용되는 용어 "대략적으로" 또는 "약(about)"이란 명시된 기준 값과 유사한 값을 나타낸다. 특정 구체예들에서, 용어 "대략적으로" 또는 "약"이란 달리 언급되지 않거나 또는 문맥상 명백하지 않은 한, 언급된 기준 값의 25%, 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 11%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 또는 미만의 상위 또는 하위 값 (더 크거나 또는 더 작은) (그러한 숫자가 가능한 값의 100%를 초과하는 경우를 제외하고) 범위 안에 속하는 값의 범위를 지칭한다.
필적가능한: 본 명세서에 사용된 바와 같이, 용어 "필적가능한(comparable)"이란 서로 동일하지 않을 수 있지만, 숙련된 기술자가 이해할 수 있도록 이들 간의 비교를 허용할 만큼 충분히 유사한 둘 또는 그 이상의 제제, 실체, 상황, 조건 세트, 등등을 지칭하며, 당업자는 관찰된 차이점 또는 유사점에 기초하여 결론이 합리적으로 도출될 수 있음을 이해할 것이다. 일부 구체예들에서, 조건, 환경, 개체 또는 집단의 필적가능한 세트는 실질적으로 동일한 다수의 속성, 그리고 하나 또는 소수의 다양한 속성으로 특징지어진다. 관련 기술분야의 통상의 기술자는 맥락에서, 둘 이상의 그러한 제제, 엔터티, 상황, 조건 세트 등이 필적가능한 것으로 간주되기 위해 임의의 주어진 상황에서 어느 정도의 동일성이 요구되는지 이해할 것이다. 예를 들자면, 상황, 개인 또는 집단의 집합이 서로 필적가능하다는 것은 상이한 환경, 개체, 또는 집단의 상이한 집합 하에서 또는 이들에 의해 얻어진 결과 또는 관찰된 현상의 차이가 이들 속성에서 가변적인 변이에 의해 또는 변이를 나타내는 합당한 결론에 이를 수 있도록 충분한 수의 그리고 실질적으로 동일한 속성에 의해 특징지어질 때를 말한다.
~에 상응하는 : 본원에 사용된 바와 같이 용어 "~에 상응하는(corresponding to)"이란 적절한 기준 화합물 또는 조성물과의 비교를 통해, 화합물 또는 조성물 내의 구조적 요소의 위치/실체를 지정하는 데 사용될 수 있다. 예를 들면, 일부 구체예들에서, 중합체의 단량체 잔기(가령, 폴리펩티드의 아미노산 잔기 또는 폴리뉴클레오티드의 핵산 잔기)는 적절한 기준 중합체의 잔기에 "상응하는" 것으로 확인될 수 있다. 예를 들면, 당업자들이 인지할 수 있는 바와 같이, 단순화를 목적으로, 폴리펩티드 내 잔기는 관련 기준 폴리펩티드를 기반으로 하는 정법 번호매김 체계를 이용하여 흔히 명시되는데, 예를 들면, 위치 190에 있는 잔기에 "상응하는" 아미노산은 특정 아미노산 쇄에서 190번째 아미노산일 필요는 없으며, 오히려 기준 폴리펩티드에서 190에서 볼 수 있는 잔기에 상응하며; 당업자는 "상응하는" 아미노산들을 어떻게 식별해내는 지를 용이하게 인지한다. 예를 들면, 당업자는 예를 들면, 본 명세서에 따른 폴리펩티드 및/또는 핵산에서 "상응하는" 잔기들을 식별해내는데 이용될 수 있는 각종 서열 정렬 전략을 인지하는 바와 같이, 소프트웨어 프로그램, 이를 테면, 예를 들면, BLAST, CS-BLAST, CUSASW++, DIAMOND, FASTA, GGSEARCH/GLSEARCH, Genoogle, HMMER, HHpred/HHsearch, IDF, Infernal, KLAST, USEARCH, parasail, PSI-BLAST, PSI-Search, ScalaBLAST, Sequilab, SAM, SSEARCH, SWAPHI, SWAPHI-LS, SWIMM, 또는 SWIPE를 인지할 것이다.
하류 : 본원에 사용된 바와 같이, 용어 "하류(downstream)"란 기준 핵산 서열에 대한 핵산 서열의 장소 또는 위치, 구체적으로 RNA 전사 동안, 기준 서열에 의해 인코드된 전사된 RNA 분자의 3' 단부에 더 근접한 위치를 지칭한다. 예를 들면, 두 서열, A 및 B의 경우, 서열 A는 서열 B의 하류이며, 서열 B의 전사는 서열 A를 향해 진행된다.
핵산 : 본원에서 사용된 바와 같이, 가장 넓은 의미에서, "핵산"은 올리고뉴클레오티드 쇄에 있거나, 또는 혼입될 수 있는 모든 화합물 및/또는 물질을 지칭한다. 일부 구체예들에서, 핵산은 포스포디에스테르 링키지를 통해 올리고뉴클레오티드 쇄에 있거나, 또는 혼입될 수 있는 모든 화합물 및/또는 물질을 지칭한다. 내용으로부터 자명하겠지만, 일부 구체예들에서, "핵산"은 개별 핵산 잔기 (가령, 뉴클레오티드 및/또는 뉴클레오시드)를 지칭하며; 일부 구체예들에서, "핵산"이란 개별 핵산 잔기들을 포함하는 올리고뉴클레오티드 쇄를 지칭한다. 일부 구체예들에서, "핵산"이란 RNA이거나 또는 이를 포함하며; 일부 구체예들에서, "핵산"이란 DNA이거나 또는 이를 포함한다. 일부 구체예들에서, 핵산은 하나 또는 그 이상의 천연 핵산 잔기이거나, 이를 포함하거나, 또는 이로 구성된다. 일부 구체예들에서, 핵산은 하나 또는 그 이상의 핵산 유사체이거나, 이를 포함하거나, 또는 이로 구성된다. 일부 구체예들에서, 핵산 유사체는 포스포디에스테르 백본을 이용하지 않는다는 점에서 핵산과 상이하다. 예를 들면, 일부 구체예들에서, 핵산은 당업계에 공지되어 있고, 백본에 포스포디에스테르 결합 대신 펩티드 결합을 갖는 하나 또는 그 이상의 "펩티드 핵산"이거나, 이를 포함하거나 또는 이로 구성되며, 본 발명의 범위 내에 있는 것으로 간주된다. 대안으로 또는 추가적으로, 일부 구체예들에서, 핵산은 포스포디에스테르 결합보다는 하나 또는 그 이상의 포스포로티오에이트 및/또는 5'-N-포스포라미디트 링키지를 갖는다. 일부 구체예들에서, 핵산은 하나 또는 그 이상의 천연 뉴클레오시드 (가령, 아데노신, 티미딘, 구아노신, 시티딘, 우리딘, 데옥시아데노신, 데옥시티미딘, 데옥시 구아노신 및 데옥시시티딘)이거나, 이를 포함하거나, 또는 이로 구성된다. 일부 구체예들에서, 핵산은 하나 또는 그 이상의 뉴클레오시드 유사체 (가령, 2-아미노아데노신, 2-티오티미딘, 이노신, 피롤로-피리미딘, 3-메틸 아데노신, 5-메틸시티딘, C-5 프로피닐-시티딘, C-5 프로피닐-우리딘, 2-아미노아데노신, C5-브로모리딘, C5-플루오로우리딘, C5-요오도우리딘, C5-프로피닐-우리딘, C5-프로피닐-시티딘, C5-메틸시티딘, 2-아미노아데노신, 7-데아자아데노신, 7-데아자구아노신, 8-옥소아데노신, 8-옥소구아노신, 0(6)-메틸구아닌, 2-티오 메틸화 염기, 삽입 염기, 및 이들의 조합)이거나, 이를 포함하거나, 또는 이로 구성된다. 일부 구체예들에서, 핵산은 천연 핵산의 것과 비교하여, 하나 또는 그 이상의 변형된 당 (가령, 2'-플루오로리보스, 리보스, 2'-데옥시리보스, 아라비노스 및 헥소스)을 포함한다. 일부 구체예들에서, 핵산은 기능성 유전자 산물 이를 테면, RNA 또는 단백질을 인코드하는 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 핵산에는 하나 또는 그 이상의 인트론이 내포된다. 일부 구체예들에서, 핵산은 천연 공급원으로부터의 단리, 상보적 주형 (생체내 또는 시험관내)에 기초한 중합에 의한 효소 합성, 재조합 세포 또는 시스템에서의 재생산, 및 화학적 합성 중 하나 또는 그 이상의 방법에 의해 준비된다. 일부 구체예들에서, 핵산의 길이는 적어도 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 1 10, 120, 130, 140, 150, 160, 170, 180, 190, 20, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 600, 700, 800, 900, 1000, 1500, 2000, 2500, 3000, 3500, 4000, 4500, 5000개 또는 그 이상 잔기다. 일부 구체예들에서, 핵산은 부분적으로 또는 전체적으로 단일 가닥이고; 일부 체예들에서, 핵산은 부분적으로 또는 전체적으로 이중 가닥이다. 일부 구체예들에서, 핵산은 폴리펩티드를 인코드하는 적어도 하나의 요소를 포함하는 뉴클레오티드 서열을 갖거나, 또는 폴리펩티드를 인코드하는 서열의 보체다. 일부 구체예들에서, 핵산은 효소적 활성을 갖는다.
작동가능하게 연계된(Operably linked): 본원에 사용된 바와 같이, 용어 "작동가능하게 연결된(operably linked)"이란 설명된 구성 요소의 위치가 의도된 방식으로 기능할 수 있는 관계에 있는 병치(juxtaposition)를 의미한다. 기능적 요소에 "작동가능하게 연결된" 제어 요소는 해당 제어 요소와 양립되는 조건 하에서 해당 기능 요소의 발현 및/또는 활성이 달성되는 방식으로 연합된다. 일부 구체예들에서, "작동가능하게 연결된" 제어 요소는 일부 구체예들에서, 관심대상의 코딩 요소와 연접 (예를 들어, 공유적으로 연결됨)하거나; 제어 요소는 관심대상의 기능 요소와 트랜스(trans)로 작용하거나, 또는 그렇지 않으면 관심대상의 기능 요소로부터 거리를 갖는다.
생산자 세포: 본원에 사용된 바와 같이, 용어 "생산자 세포"란 재조합 AAV (rAAV)를 생산하는데 이용된 임의의 세포를 지칭한다. 일부 구체예들에서, 생산자 세포는 포유류 세포다. 일부 구체예들에서, 생산자 세포는 형질전환된 포유류 세포다. 일부 구체예들에서, 생산자 세포는 Vero, HeLa, HEK293, HEK293T 세포 또는 이의 유도체다.
형질전환: 본원에 사용된 바와 같이, 용어 "형질전환"이란 외인성 DNA가 숙주 세포 내로 도입되는 임의의 과정을 지칭한다. 형질전환은 당업계에 잘 알려진 다양한 방법을 사용하여 자연적 또는 인공적 조건 하에서 일어날 수 있다. 형질전환은 외래 핵산 서열을 원핵 숙주 세포 또는 진핵 숙주 세포에 삽입하기 위한 임의의 공지된 방법에 의존할 수 있다. 일부 구체예들에서, 특정 형질전환 방법은 형질전환되는 숙주 세포에 기초하여 선택되며, 이들 방법에는 바이러스 감염, 전기천공, 교배, 리포펙션이 내포될 수 있으나, 이에 국한되지 않는다. 일부 구체예들에서, "형질전환된" 세포는 삽입된 DNA가 자율 복제 플라스미드로서, 또는 숙주 염색체의 일부로서 복제할 수 있다는 점에서 안정적으로 형질전환된 것이다. 일부 구체예들에서, 형질전환된 세포는 제한된 기간 동안 도입된 핵산을 일시적으로 발현시킨다.
상류: 본원에 사용된 바와 같이, 용어 "상류"란 기준 핵산 서열에 대한 핵산 서열의 장소 또는 위치, 구체적으로 RNA 전사 동안, 기준 서열에 의해 인코드된 전사된 RNA 분자의 5' 단부에 더 근접한 위치를 지칭한다. 예를 들면, 두 서열, A 및 B의 경우, 서열 A는 서열 B의 상류이며, 서열 B의 전사는 서열 A로부터 멀어지도록 진행된다.
벡터 : 본원에 사용된 바와 같이, 용어 "벡터"는 연계된 또다른 핵산을 수송할 수 있는 핵산 분자를 의미한다. 벡터의 한 가지 유형은 "플라스미드"로써, 이는 추가적인 DNA 세그먼트가 결찰될 수 있는 원형 이중 가닥으로 된 DNA 루프를 의미한다. 또다른 유형의 벡터는 바이러스 벡터이며, 여기서 추가 DNA 세그먼트는 바이러스 게놈에 결찰될 수 있다. 특정 벡터는 이들이 도입되는 숙주 세포에서 자율 복제할 수 있다 (예를 들어, 박테리아 복제 원점을 갖는 박테리아 벡터 및 에피솜 포유동물 벡터). 다른 벡터 (예를 들어, 비-에피솜성 포유동물 벡터)는 숙주 세포에 도입될 때, 당해 숙주 세포의 게놈에 통합될 수 있으며, 이로써 숙주 게놈과 함께 복제된다. 더욱이, 특정 벡터는 이들이 작동 가능하게 연결된 유전자의 발현을 지시 할 수 있다. 본원에서는 이러한 벡터를 "발현 벡터"라고 한다. 재조합 DNA, 올리고뉴클레오티드 합성, 및 조직 배양 및 형질 전환 (예를 들어, 전기 천공, 리포펙션)에 표준 기술이 사용될 수 있다. 효소 반응 및 정제 기술은 제조업체의 사양에 따라 또는 당 업계에서 일반적으로 달성되거나 또는 본원에 기재된 바와 같이 수행될 수 있다. 전술한 기술 및 절차는 일반적으로 당업계에 널리 공지된 통상적인 방법에 따라, 그리고 본 명세서 전반에 걸쳐 인용되고 논의된 다양한 일반적이고 보다 구체적인 참고 문헌에 기재된 바와 같이 수행될 수 있다. 이를 테면, Sambrook et al., Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989)) 참고, 이는 임의의 목적을 위해 본원의 참고자료에 편입된다.
특정 구체예들의 상세한 설명
AAV 복제를 위해 아데노바이러스가 제공하는 헬퍼 기능은 이전에 설명되었다. 특정 가설에 얽매이기를 바라지 않고, 아데노바이러스성 E1A 단백질은 AAV P5 rep 프로모터에 결합하고, 활성화함으로써 AAV 유전자 발현을 활성화시키는 것으로 기술되었다. 유사하게, 또다른 아데노바이러스성 단백질인 E2A는 AAV P5 프로모터 전사를 활성화시키는 것으로 기술되었다. E2A는 또한 바이러스 연합된 RNA I(VA RNAI)과 협력하여, AAV RNAs의 해독을 향상시키는 것으로 설명되었다. 아데노바이러스성 E4orf4는 G2/M 경계에서 세포- 주기 정지를 유도하고, AAV 생산을 돕는 것으로 나타났다. 아데노바이러스성 E4orf6은 단일-가닥 재조합 AAV 게놈의 이중-가닥 게놈으로의 전환을 향상시키는 것으로 설명되었으며, 이는 시험관 내 및 생체 내에서 바이러스 DNA-복제의 속도 제한 단계다. VA RNAI는 또한 AAV 복제를 지원하는 것으로 설명되었다. VA RNAI는 이중 가닥 RNA-활성화된 단백질 키나제(PKR)와 물리적으로 상호작용하는 것으로 설명되었고, 그렇지 않으면 바이러스 단백질 생산을 차단하는 항바이러스 면역 반응을 유도하게 된다.
이전 연구에서는 E1 유전자를 제공하는 HEK293 세포에서 효율적인 재조합 AAV 생산을 위한 최소 유전자 세트가 E2a, E4orf6 및 VA RNAI 유전자인 것으로 제안되었다. 이들 유전자 세트를 포함하는 pXX6로 명명된 헬퍼 플라스미드는 아데노바이러스-없는 재조합 AAV를 생산하는 데 사용된다.
임상 적용을 위한 AAV 벡터의 개발 및 최적화에서 진행 중인 주요 과제들 중 하나는 생산되는 바이러스의 양을 늘리는 것이다. 비-증식성 특성으로 인해, 파르보바이러스 게놈 구성요소들의 패키징 세포주(예를 들면 인간 배아 신장 세포, HEK293 또는 HEK293T, 또는 곤충 세포, 가령, Sf9)로의 형질감염 효율에만 의존하여 생산이 이루어진다. 따라서, 재조합 AAV(rAAV) 생산을 증가시키는 수단을 개발하는 것은 여전히 매우 중요하다.
임상 적용을 위한 rAAVs 생산과 관련된 다른 주요 과제는 이러한 rAAVs를 대량으로 생산하는 데 드는 비용 및 최종 제품 자체의 안전성과 관련된 문제다. 예를 들면, 상업적으로 이용가능한 헬퍼 플라스미드, 이를 테면 pXX6-80은 낮은 수준으로 Ad 섬유 단백질을 전사시키는 것으로 보인다. 중요한 것은, 섬유 단백질이 AAV 생산에 필요하지 않으며, 인간에게 면역원성을 가질 수 있다는 것이다. 게다가, pXX6-80의 크기는 18kb를 넘을 정도로 상당히 크다. 이러한 큰 플라스미드 크기는 제조가 어렵고, 제조 비용을 증가시키며, 이는 임상-등급의 AAV 제조를 위해 GMP 플라스미드를 소싱할 때, 큰 영향을 미칠 수 있다.
예를 들어, pFAdDeltaF6(University of Pennsylvania에서 유래) 및 pHelper(Agilent)를 비롯하여, 다양한 버전의 아데노바이러스성 헬퍼 플라스미드들이 파생되었다. pFAdDeltaF6 플라스미드는 pXX6-80 보다는 약 3kb 더 작지만, 섬유 유전자 서열은 유지한다. 상기 pHelper 플라스미드는 Agilent로부터 이용가능하며, pXX6-80보다 더 작은데, 대략 11.6kb이다. 그러나, AAV 생산에 사용되는 플라스미드에는 일반적으로 권장되지 않는 암피실린 저항성 유전자를 함유하고 있다.
본 명세서는 본원에 기술된 조성물 및 방법을 제공함으로써 상기 기술된 기술적 과제를 해결한다.
일부 구체예들에서, 본 명세서는 바이러스성 헬퍼 단백질을 인코딩하는 아데노바이러스성 DNA 서열들을 포함하는 아데노바이러스 유래된 헬퍼 플라스미드 (아데노바이러스성 헬퍼 플라스미드)에 관계한다. 일부 구체예들에서, 본 발명의 아네노바이러스성 헬퍼 플라스미드는 재조합 아데노-연합된 바이러스 (rAAVs) 생산 방법에 이용된다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 rAAVs의 생산을 증가시킨다.
일부 구체예들에서, 본 명세서는 아데노바이러스가 아닌 원천으로부터 유래된 단백질을 인코딩하는 뉴클레오티드 서열들을 포함하는 아데노바이러스성 헬퍼 플라스미드를 제공한다. 일부 구체예들에서, 본 명세서는 아데노바이러스이외의 바이러스로부터 유래된 단백질을 인코딩하는 뉴클레오티드 서열들을 포함하는 아데노바이러스성 헬퍼 플라스미드를 제공한다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 아데노바이러스성 단백질 E2a 및 E4, 뿐만 아니라 넌-코딩 RNA VA RNA를 인코딩하는 아데노바이러스성 뉴클레오티드 서열의 전부 또는 일부분을 포함한다. 일부 구체예들에서, 본 명세서는 선두의 상업적으로 이용가능한 아데노바이러스성 헬퍼 플라스미드들보다 더 작은 개선된 아데노바이러스성 헬퍼 플라스미드를 기술하며, 이는 생산자 세포 발현 시스템에서 rAAVs 생산을 더 안전하고, 더 저렴하게 생산할 수 있도록 한다.
일부 구체예들에서, 본 명세서는 현재 이용가능한 아데노바이러스성 헬퍼 플라스미드와 비교하여 전반적으로 크기가 감소된 아데노바이러스성 헬퍼 플라스미드를 제공한다(가령, pXX6-80는 18.932 kbp이며; pALD-X80는 18.876 kbp이며; pHelper는 11.635 kbp이며; pFAdDeltaF6은 15.420 kbp임).
일부 구체예들에서, 본 명세서는 더 작은 크기의 아데노바이러스성 헬퍼 플라스미드를 제공한다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 대략적으로 6.5 kb 내지 15.5 kb이다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드의 크기는 대략적으로 6 kb, 7 kb, 8 kb, 9 kb, 10 kb, 11 kb, 12 kb, 13 kb, 14 kb, 15 kb, 또는 16 kb이다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드의 크기는 대략적으로 6-7 kb; 6.5-7.5 kb; 7-8 kb; 7.5-8.5 kb; 8-9 kb; 8.5-9.5 kb; 9-10 kb; 9.5-10.5 kb; 10-11 kb; 10.5-11.5 kb; 11-12 kb; 11.5-12.5 kb; 12-13 kb; 12.5-13.5 kb; 13-14 kb; 13.5-14.5 kb; 14-15 kb; 14.5-15.5 kb; 15-16 kb이다. 본 명세서의 크기가 더 작아진 아데노바이러스성 헬퍼 플라스미드로 AAV의 대규모 제조에 필요한 수량으로 AAV를 더 간단하고 저렴하게 생산할 수 있다. 일부 구체예들에서, 유전자 및/또는 유전자의 일부를 제거하면 본 개시내용의 아데노바이러스 헬퍼 플라스미드가 더 안전해지는데, 그 이유는 생산 세포는 다운스트림 처리 중에 AAV와 공동-정제될 수 있는 아데노바이러스 구조 단백질(가령, 섬유질)을 생산하지 않고, 따라서 실수로 아데노바이러스 구조 단백질을 환자에게 도입할 위험이 더 낮아지기 때문이다.
일부 구체예들에서, 아데노바이러스 헬퍼 유전자를 제거하여 더 작은 아데노바이러스 헬퍼 플라스미드를 생성하면, 보충 유전자를 추가하여 AAV 품질과 수율을 더욱 향상시킬 수 있다. 이러한 보충 유전자는 가장 작은 버전에 비해 플라스미드의 크기를 증가시키지만, 유사하거나 더 높은 AAV 생산성을 가능하게 하므로 추가 생산 비용을 감당할 가치가 있다. 중요한 것은, 이들 플라스미드는 상업적으로 이용가능한 헬퍼 플라스미드, 이를 테면, 예를 들면, pALD-X80 보다 여전히 더 작다.
아데노바이러스성 헬퍼 플라스미드
헬퍼 유전자 및 저항성 유전자
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 E2b, E2a, E4orf4, E1B55K, E1b19K, E1a, E4orf6, VA RNA, 및 이의 조합들로 구성된 군에서 선택된 단백질을 인코딩하는 하나 또는 그 이상의 뉴클레오티드 서열(들)을 포함한다.
일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 E2a 단백질, E4 영역, 및 VA RNA 영역을 인코딩하는 뉴클레오티드 서열을 포함한다. 일부 구체예들에서, E4 영역은 하나 또는 그 이상의 E4orf1, E4orf2, E4orf3, E4orf4, E4orf5, E4orf6, 및 E4orf7을 포함한다. 일부 구체예들에서, E4orf1은 서열 식별 번호: 70에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드를 갖는다. 일부 구체예들에서, E4orf1은 서열 식별 번호: 71에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 아미노산 서열을 갖는다. 일부 구체예들에서, E4orf2는 서열 식별 번호: 72에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, E4orf2는 서열 식별 번호: 73에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 아미노산 서열을 갖는다. 일부 구체예들에서, E4orf3는 서열 식별 번호: 74에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, E4orf3는 서열 식별 번호: 75에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 아미노산 서열을 갖는다. 일부 구체예들에서, E4orf4는 서열 식별 번호: 76에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, E4orf4는 서열 식별 번호: 77에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 아미노산 서열을 갖는다. 일부 구체예들에서, E4orf6는 서열 식별 번호: 78에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, E4orf6은 서열 식별 번호: 79에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 아미노산 서열을 갖는다. 일부 구체예들에서, E4orf7은 서열 식별 번호: 80에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, E4orf7은 서열 식별 번호: 81에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 아미노산 서열을 갖는다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 E4orf1을 포함하는 뉴클레오티드 서열을 포함하지 않는다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 E4orf2를 포함하는 뉴클레오티드 서열을 포함하지 않는다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 E4orf1을 포함하는 뉴클레오티드 서열을 포함하지 않고, E4orf2를 포함하는 뉴클레오티드 서열을 포함하지 않는다. 일부 구체예들에서, E4 영역의 발현은 E4 미니 프로모터의 제어 하에 있다. 일부 구체예들에서, E4 영역은 E4 미니 프로모터에 작동가능하도록 연계된다. 일부 구체예들에서, E4 미니 프로모터는 서열 식별 번호: 1에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, E4 영역은 SV40 프로모터에 작동가능하도록 연계된다. 일부 구체예들에서, E4 영역의 발현은 SV40 프로모터의 제어 하에 있다. 일부 구체예들에서, SV40 프로모터는 서열 식별 번호: 2에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다.
일부 구체예들에서, 본 발명의 아데노바이러스성 헬퍼 플라스미드는 저항성 유전자를 포함한다. 일부 구체예들에서, 본 발명의 아데노바이러스성 헬퍼 플라스미드는 암피실린 저항성 유전자 (가령, 암피실린에 대한 저항성을 부여하는 단백질을 인코딩하는 뉴클레오티드 서열)를 포함한다. 일부 구체예들에서, 본 발명의 아데노바이러스성 헬퍼 플라스미드는 암피실린 저항성 유전자를 포함하지 않는다. 일부 구체예들에서, 본 발명의 아데노바이러스성 헬퍼 플라스미드는 카나마이신 저항성 유전자 (가령, 카나마이신에 대한 저항성을 부여하는 단백질을 인코딩하는 뉴클레오티드 서열)를 포함한다. 일부 구체예들에서, 본 발명의 아데노바이러스성 헬퍼 플라스미드는 카나마이신 저항성 유전자를 포함하지 않는다.
섬유 유전자
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 아데노바이러스성 섬유 단백질을 인코딩하는 뉴클레오티드 서열을 포함하지 않는다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 전장의 아데노바이러스성 섬유 단백질을 인코딩하는 뉴클레오티드 서열을 포함하지 않는다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 아데노바이러스성 섬유 단백질의 일부분 또는 단편을 인코딩하는 뉴클레오티드 서열을 포함한다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 아데노바이러스성 섬유 단백질을 인코딩하는 뉴클레오티드 서열을 제외하고, pXX6-80의 서열에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 포함한다.
L1-52/55K (패키징 단백질 3) 유전자
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 L1-52/55K (패키징 단백질 3) 단백질을 인코딩하는 뉴클레오티드 서열을 포함하지 않는다. 일부 구체예들에서, 본 발명의 아데노바이러스성 헬퍼 플라스미드는 페리펜톤성 헥손-연합된 유전자를 인코딩하는 뉴클레오티드 서열을 포함하지 않는다.
L4 영역
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 완전한 L4 (헥손 어셈블리) 유전자를 포함한다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 완전한 L4 (헥손 어셈블리)를 인코딩하는 뉴클레오티드 서열을 포함한다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 3에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 4에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 아미노산 서열을 갖는다. 일부 구체예들에서, 본 발명의 아데노바이러스성 헬퍼 플라스미드는 완전한 L4 (33kDa Ex2) 유전자를 포함한다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 완전한 L4 (33kDa Ex2)를 인코딩하는 뉴클레오티드 서열을 포함한다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 5에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 6에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 아미노산 서열을 갖는다.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 완전한 L4 캡슐화 단백질 유전자를 포함한다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 완전한 L4 캡슐화 단백질을 인코딩하는 뉴클레오티드 서열을 포함한다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 7에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 8에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 아미노산 서열을 갖는다.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 L4 (헥손 어셈블리) 유전자를 포함하지 않는다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 L4 캡슐화 단백질 유전자를 포함하지 않는다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 L4 (헥손 어셈블리) 유전자를 포함하지 않으며, L4 캡슐화 단백질 유전자를 포함하지 않는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 L4 (헥손 어셈블리)를 인코딩하는 뉴클레오티드 서열을 포함하지 않는다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 L4 캡슐화 단백질을 인코딩하는 뉴클레오티드 서열을 포함하지 않는다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 L4 (헥손 어셈블리)를 인코딩하는 뉴클레오티드 서열을 포함하지 않고, L4 캡슐화 단백질 유전자를 인코딩하는 뉴클레오티드 서열을 포함하지 않는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 L4 33kDa Ex2의 단편을 인코딩하는 뉴클레오티드 서열을 포함한다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 9에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 L4 33kDa Ex2의 단편을 인코딩하는 뉴클레오티드 서열을 포함한다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 10에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 아미노산 서열을 갖는다. 일부 구체예들에서, L4 33kDa Ex2의 단편을 인코딩하는 뉴클레오티드 서열은 E2a 프로모터 영역을 포함한다 (예를 들면, Casper et al., "Identification of an adeno-associated virus Rep protein binding site in the adenovirus E2a promoter." Journal of virology 79.1 (2005) 참고). 일부 구체예들에서, E2a 프로모터 영역은 서열 식별 번호: 11에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 L4 33kDa Ex2의 단편을 인코딩하는 뉴클레오티드 서열을 포함하지 않는다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 E2a 프로모터 영역을 포함하지 않는다.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 헥손-연합된 전구물질 (L4 pVIII)의 단편을 인코딩하는 뉴클레오티드 서열을 포함한다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 12에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 13에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 아미노산 서열을 갖는다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 헥손-연합된 전구물질 (L4 pVIII)을 인코딩하는 뉴클레오티드 서열을 포함하지 않는다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 부분 헥손-연합된 전구물질 (L4 pVIII)의 단편을 인코딩하는 뉴클레오티드 서열을 포함하지 않는다.
VA RNA 영역
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 14에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는 VA RNA 영역을 포함한다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 15에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는 VA RNA 영역을 포함한다. 일부 구체예들에서, VA RNA 영역은 서열 식별 번호: 16에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 서열을 갖는 VA RNAI 유전자를 포함한다. 일부 구체예들에서, VA RNA 영역은 서열 식별 번호: 17에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 서열을 갖는 VA RNAI 유전자를 포함한다. 일부 구체예들에서, VA RNA 영역은 서열 식별 번호: 18에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 서열을 갖는 VA RNAII 유전자를 포함한다. 일부 구체예들에서, VA RNA 영역은 서열 식별 번호: 19에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 서열을 갖는 VA RNAII 유전자를 포함한다.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 DNA 말단 단백질의 단편을 인코딩하는 뉴클레오티드 서열을 포함한다. 일부 구체예들에서, DNA 말단 단백질 단편을 인코딩하는 뉴클레오티드 서열은 서열 식별 번호: 20에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일하다. 일부 구체예들에서, DNA 말단 단백질의 단편은 서열 식별 번호: 21에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 아미노산 서열을 갖는다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 DNA 말단 단백질을 인코딩하는 뉴클레오티드 서열을 포함하지 않는다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 23kDa 엔도프로테아제의 단편을 인코딩하는 뉴클레오티드 서열을 포함한다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 22에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 포함한다. 일부 구체예들에서, 23kDa 엔도프로테아제의 단편 영역은 서열 식별 번호: 23에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 아미노산 서열을 갖는다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 23kDa 엔도프로테아제 영역을 인코딩하는 뉴클레오티드 서열을 포함하지 않는다.
보충적인 속성들을 인코딩하는 유전자의 도입
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 E2a 유전자를 포함한다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 E2a를 인코딩하는 뉴클레오티드 서열을 포함한다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 24에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 25에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 아미노산 서열을 갖는다. 일부 구체예들에서, E2a의 발현은 프로모터의 제어 하에 있다. 일부 구체예들에서, E2a를 인코딩하는 뉴클레오티드 서열은 프로모터에 작동가능하도록 연계된다. 일부 구체예들에서, 프로모터는 예를 들면, CMV 프로모터, PGK 프로모터, SV40 프로모터, EF-1α 프로모터, Ubc 프로모터, CAG 프로모터, 또는 β-액틴 프로모터다. 일부 구체예들에서, E2a를 인코딩하는 뉴클레오티드 서열은 전사 인핸서에 작동가능하도록 연계된다. 일부 구체예들에서, 전사 인핸서는 예를 들면, CMV 인핸서이다. 일부 구체예들에서, E2a를 인코딩하는 뉴클레오티드 서열 조절 인트론에 작동가능하도록 연계된다. 일부 구체예들에서, E2a의 발현은 닭 β-액틴 프로모터의 제어 하에 있다. 일부 구체예들에서, E2a를 인코딩하는 뉴클레오티드 서열은 닭 β-액틴 프로모터에 작동가능하도록 연계된다. 일부 구체예들에서, 닭 β-액틴 프로모터는 서열 식별 번호: 26에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 닭 β-액틴 프로모터는 E2a를 인코딩하는 뉴클레오티드 서열의 상류에 위치한다. 일부 구체예들에서, E2a의 발현은 E2a 프로모터 및 닭 β-액틴 프로모터의 제어 하에 있다. 일부 구체예들에서, E2a를 인코딩하는 뉴클레오티드 서열은 E2a 프로모터 및 닭 β-액틴 프로모터에 작동가능하도록 연계된다. 일부 구체예들에서, 닭 β-액틴 프로모터는 E2a프로모터의 상류에 위치한다. 일부 구체예들에서, E2a의 발현은 닭 β-액틴 프로모터 및 CMV 인핸서의 제어 하에 있다. 일부 구체예들에서, E2a를 인코딩하는 뉴클레오티드 서열은 닭 β-액틴 프로모터 및 CMV 인핸서에 작동가능하도록 연계된다. 일부 구체예들에서, 닭 β-액틴 프로모터 및 CMV 인핸서는 E2a 프로모터의 상류에 위치한다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 E2a 폴리아데닐화 신호를 포함한다. 일부 구체예들에서, E2a 폴리아데닐화 신호는 E2a를 인코딩하는 뉴클레오티드 서열의 하류에 위치한다. 일부 구체예들에서, E2a 폴리아데닐화 신호는 서열 식별 번호: 27에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 SV40 폴리아데닐화 신호를 포함한다. 일부 구체예들에서, SV40 폴리아데닐화 신호는 E2a를 인코딩하는 뉴클레오티드 서열의 하류에 위치한다. 일부 구체예들에서, SV40 폴리아데닐화 신호는 E2a 폴리아데닐화 신호의 하류에 위치한다. 일부 구체예들에서, SV40 폴리아데닐화 신호는 서열 식별 번호: 28에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 서열을 갖는다.
일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 HSV-1로부터 유래된 UL30을 인코딩하는 뉴클레오티드 서열을 포함한다. 일부 구체예들에서, UL30을 인코딩하는 뉴클레오티드 서열은 서열 식별 번호: 29에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, UL30의 아미노산 서열은 서열 식별 번호: 30에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일하다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 HSV-1로부터 유래된 UL42를 인코딩하는 뉴클레오티드 서열을 포함한다. 일부 구체예들에서, UL42를 인코딩하는 뉴클레오티드 서열은 서열 식별 번호: 31에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, UL42의 아미노산 서열은 서열 식별 번호: 32에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일하다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 HSV-1로부터 유래된 UL30을 인코딩하는 뉴클레오티드 서열을 포함하고, HSV-1로부터 유래된 UL42를 인코딩하는 뉴클레오티드 서열을 포함한다. 일부 구체예들에서, UL30을 인코딩하는 뉴클레오티드 서열과 UL42를 인코딩하는 뉴클레오티드 서열은 P2a 절단 부위에 의해 분리되어 있다. 일부 구체예들에서, P2a 절단 부위는 서열 식별 번호: 33에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, P2a 절단 부위는 서열 식별 번호: 34에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 아미노산 서열을 갖는다. 일부 구체예들에서, UL30 및/또는 UL42 유전자의 발현은/들은 EF-1α 프로모터의 제어 하에 있다. 일부 구체예들에서, UL30을 인코딩하는 뉴클레오티드 서열은 프로모터에 작동가능하도록 연계된다. 일부 구체예들에서, UL30을 인코딩하는 뉴클레오티드 서열은 CMV 프로모터, PGK 프로모터, SV40 프로모터, EF-1α 프로모터, Ubc 프로모터, CAG 프로모터, 또는 β-액틴 프로모터에 작동가능하도록 연계된다. 일부 구체예들에서, UL30을 인코딩하는 뉴클레오티드 서열은 전사 인핸서에 작동가능하도록 연계된다. 일부 구체예들에서, 전사 인핸서는 예를 들면, CMV 인핸서이다. 일부 구체예들에서, UL30을 인코딩하는 뉴클레오티드 서열은 조절 인트론에 작동가능하도록 연계된다. 일부 구체예들에서 UL42를 인코딩하는 뉴클레오티드 서열 및/또는 UL30을 인코딩하는 뉴클레오티드 서열은 EF-1α 프로모터에 작동가능하도록 연계된다. 일부 구체예들에서, EF-1α 프로모터는 서열 식별 번호: 35에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, UL30 및/또는 UL42의 발현은/들은 SV40 프로모터의 제어 하에 있다. 일부 구체예들에서, UL42를 인코딩하는 뉴클레오티드 서열 및/또는 UL30을 인코딩하는 뉴클레오티드 서열은 SV40 프로모터에 작동가능하도록 연계된다. 일부 구체예들에서, SV40 프로모터는 서열 식별 번호: 68에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다.
일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 폴리아데닐화 신호를 포함한다. 일부 구체예들에서, 폴리아데닐화 신호는 β-글로빈 폴리아데닐화 신호, SV40 폴리아데닐화 신호, 또는 소 성장 호르몬 (bGH) 폴리아데닐화 신호다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 UL42를 인코딩하는 뉴클레오티드 서열의 하류에 폴리아데닐화 신호를 포함한다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 UL42를 인코딩하는 뉴클레오티드 서열의 하류에 β-글로빈 폴리아데닐화 신호를 포함한다. 일부 구체예들에서, β-글로빈 폴리아데닐화 신호는 서열 식별 번호: 36에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 서열을 갖는다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 UL42를 인코딩하는 뉴클레오티드 서열의 하류에 소 성장 호르몬 (bGH) 폴리아데닐화 신호를 포함한다. 일부 구체예들에서, bGH 폴리아데닐화 신호는 서열 식별 번호: 69에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 서열을 갖는다.
일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 HSV-1로부터 유래된 UL29를 인코딩하는 뉴클레오티드 서열을 포함한다. 일부 구체예들에서, UL29를 인코딩하는 뉴클레오티드 서열은 서열 식별 번호: 37에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일하다. 일부 구체예들에서, UL29의 아미노산 서열은 서열 식별 번호: 38에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일하다. 일부 구체예들에서, UL29를 인코딩하는 뉴클레오티드 서열은 프로모터에 작동가능하도록 연계된다. 일부 구체예들에서, UL30을 인코딩하는 뉴클레오티드 서열은 CMV 프로모터, PGK 프로모터, SV40 프로모터, EF-1α 프로모터, Ubc 프로모터, CAG 프로모터, 또는 β-액틴 프로모터에 작동가능하도록 연계된다. 일부 구체예들에서, UL29를 인코딩하는 뉴클레오티드 서열은 전사 인핸서에 작동가능하도록 연계된다. 일부 구체예들에서, 전사 인핸서는 예를 들면, CMV 인핸서이다. 일부 구체예들에서, UL29를 인코딩하는 뉴클레오티드 서열은 조절 인트론에 작동가능하도록 연계된다. 일부 구체예들에서, UL29의 발현은 HSV TK 프로모터의 제어 하에 있다. 일부 구체예들에서, UL29를 인코딩하는 뉴클레오티드 서열은 HSV TK 프로모터에 작동가능하도록 연계된다. 일부 구체예들에서, HSV TK 프로모터는 서열 식별 번호: 39에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다.
일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 UL29를 인코딩하는 뉴클레오티드 서열의 하류에 폴리아데닐화 신호를 포함한다. 일부 구체예들에서, 폴리아데닐화 신호는 β-글로빈 폴리아데닐화 신호, SV40 폴리아데닐화 신호, 또는 소 성장 호르몬 (bGH) 폴리아데닐화 신호다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 UL29를 인코딩하는 뉴클레오티드 서열의 하류에 HSV TK 폴리아데닐화 신호를 포함한다. 일부 구체예들에서, HSV TK 폴리아데닐화 신호는 서열 식별 번호: 40에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 서열을 갖는다.
예시적인 아데노바이러스성 헬퍼 플라스미드
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 41에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 미니 프로모터 (서열 식별 번호: 1), L4 (헥손 어셈블리) (서열 식별 번호: 3; 서열 식별 번호: 4), L4 (33kDa Ex2) (서열 식별 번호: 5; 서열 식별 번호: 6), L4 캡슐화 단백질 (22 kDa) (서열 식별 번호: 7; 서열 식별 번호: 8), L4 pVIII 헥손-연합된 전구물질 (서열 식별 번호: 12; 서열 식별 번호: 13), VA RNA 영역 A (서열 식별 번호: 14), VA RNAI-A (서열 식별 번호: 16), VA RNAII-A (서열 식별 번호: 18), 부분적인 DNA 말단 단백질 (서열 식별 번호: 20; 서열 식별 번호: 21), 23kDa 엔도프로테아제 단편 영역 (서열 식별 번호: 22; 서열 식별 번호: 23), 및 E2a (서열 식별 번호: 24; 서열 식별 번호: 25), 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 및 페리펜톤성 헥손-연합된 유전자.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 42에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 미니 프로모터 (서열 식별 번호: 1), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 A (서열 식별 번호: 14), VA RNAI-A (서열 식별 번호: 16), VA RNAII-A (서열 식별 번호: 18), 부분적인 DNA 말단 단백질 (서열 식별 번호: 20; 서열 식별 번호: 21), 23kDa 엔도프로테아제 단편 영역 (서열 식별 번호: 22; 서열 식별 번호: 23), 및 E2a (서열 식별 번호: 24; 서열 식별 번호: 25), 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, 및 L4 pVIII 헥손-연합된 전구물질.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 43에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 미니 프로모터 (서열 식별 번호: 1), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 B (서열 식별 번호: 15), VA RNAI-B (서열 식별 번호: 17), VA RNAII-B (서열 식별 번호: 19), 및 E2a (서열 식별 번호: 24; 서열 식별 번호: 25), 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, L4 pVIII 헥손-연합된 전구물질, DNA 말단 단백질, 및 23kDa 엔도프로테아제 단편 영역.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 44에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 미니 프로모터 (서열 식별 번호: 1), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 B (서열 식별 번호: 15), VA RNAI-B (서열 식별 번호: 17), VA RNAII-B (서열 식별 번호: 19), E2a (서열 식별 번호: 24; 서열 식별 번호: 25), 및 E2a의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 28), 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, L4 pVIII 헥손-연합된 전구물질, DNA 말단 단백질, 및 23kDa 엔도프로테아제 단편 영역.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 45에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 미니 프로모터 (서열 식별 번호: 1), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 A (서열 식별 번호: 14), VA RNAI-A (서열 식별 번호: 16), VA RNAII-A (서열 식별 번호: 18), 부분적인 DNA 말단 단백질 (서열 식별 번호: 20; 서열 식별 번호: 21), 23kDa 엔도프로테아제 단편 영역 (서열 식별 번호: 22; 서열 식별 번호: 23), 및 E2a (서열 식별 번호: 24; 서열 식별 번호: 25), 및 E4orf6의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 67), 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, 및 L4 pVIII 헥손-연합된 전구물질, 및 E2a의 하류에 SV40 폴리아데닐화 신호.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 46에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 미니 프로모터 (서열 식별 번호: 1), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 B (서열 식별 번호: 15), VA RNAI-B (서열 식별 번호: 17), VA RNAII-B (서열 식별 번호: 19), E2a (서열 식별 번호: 24; 서열 식별 번호: 25), 및 E4orf6의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 67), 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, L4 pVIII 헥손-연합된 전구물질, DNA 말단 단백질, 및 23kDa 엔도프로테아제 단편 영역, 및 E2a의 하류에 SV40 폴리아데닐화 신호.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 47에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 영역의 상류에 SV40 프로모터 (서열 식별 번호: 2), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 A (서열 식별 번호: 14), VA RNAI-A (서열 식별 번호: 16), VA RNAII-A (서열 식별 번호: 18), 부분적인 DNA 말단 단백질 (서열 식별 번호: 20; 서열 식별 번호: 21), 23kDa 엔도프로테아제 단편 영역 (서열 식별 번호: 22; 서열 식별 번호: 23), 및 E2a (서열 식별 번호: 24; 서열 식별 번호: 25), 및 E4orf6의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 67), 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, 및 L4 pVIII 헥손-연합된 전구물질, E2a의 하류에 SV40 폴리아데닐화 신호, 및 E4 영역의 상류 E4 미니 프로모터.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 48에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 영역의 상류에 SV40 프로모터 (서열 식별 번호: 2), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 B (서열 식별 번호: 15), VA RNAI-B (서열 식별 번호: 17), VA RNAII-B (서열 식별 번호: 19), E2a (서열 식별 번호: 24; 서열 식별 번호: 25), 및 E4orf6의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 67), 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, L4 pVIII 헥손-연합된 전구물질, DNA 말단 단백질, 및 23kDa 엔도프로테아제 단편 영역, 및 E2a의 하류에 SV40 폴리아데닐화 신호, 및 E4 영역의 상류에 E4 미니 프로모터.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 49에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 미니 프로모터 (서열 식별 번호: 1), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 A (서열 식별 번호: 14), VA RNAI-A (서열 식별 번호: 16), VA RNAII-A (서열 식별 번호: 18), 부분적인 DNA 말단 단백질 (서열 식별 번호: 20; 서열 식별 번호: 21), 23kDa 엔도프로테아제 단편 영역 (서열 식별 번호: 22; 서열 식별 번호: 23), E2a (서열 식별 번호: 24; 서열 식별 번호: 25), 및 E2a의 상류에 닭 β-액틴 프로모터, 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, 및 L4 pVIII 헥손-연합된 전구물질.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 50에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 미니 프로모터 (서열 식별 번호: 1), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 B (서열 식별 번호: 15), VA RNAI-B (서열 식별 번호: 17), VA RNAII-B (서열 식별 번호: 19), E2a (서열 식별 번호: 24; 서열 식별 번호: 25), 및 E2a의 상류에 닭 β-액틴 프로모터, 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, L4 pVIII 헥손-연합된 전구물질, DNA 말단 단백질, 및 23kDa 엔도프로테아제 단편 영역.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 51에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 미니 프로모터 (서열 식별 번호: 1), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 B (서열 식별 번호: 15), VA RNAI-B (서열 식별 번호: 17), VA RNAII-B (서열 식별 번호: 19), E2a (서열 식별 번호: 24; 서열 식별 번호: 25), E2a의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 28), 및 E2a의 상류에 닭 β-액틴 프로모터, 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, L4 pVIII 헥손-연합된 전구물질, DNA 말단 단백질, 및 23kDa 엔도프로테아제 단편 영역.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 52에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 미니 프로모터 (서열 식별 번호: 1), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 A (서열 식별 번호: 14), VA RNAI-A (서열 식별 번호: 16), VA RNAII-A (서열 식별 번호: 18), 부분적인 DNA 말단 단백질 (서열 식별 번호: 20; 서열 식별 번호: 21), 23kDa 엔도프로테아제 단편 영역 (서열 식별 번호: 22; 서열 식별 번호: 23), 및 E2a (서열 식별 번호: 24; 서열 식별 번호: 25), E4orf6의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 67), 및 E2a의 상류에 닭 β-액틴 프로모터, 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, 및 L4 pVIII 헥손-연합된 전구물질, 및 E2a의 하류에 SV40 폴리아데닐화 신호.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 53에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 미니 프로모터 (서열 식별 번호: 1), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 B (서열 식별 번호: 15), VA RNAI-B (서열 식별 번호: 17), VA RNAII-B (서열 식별 번호: 19), E2a (서열 식별 번호: 24; 서열 식별 번호: 25), E4orf6의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 67), 및 E2a의 상류에 닭 β-액틴 프로모터, 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, L4 pVIII 헥손-연합된 전구물질, DNA 말단 단백질, 및 23kDa 엔도프로테아제 단편 영역, 및 E2a의 하류에 SV40 폴리아데닐화 신호.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 54에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 영역의 상류에 SV40 프로모터 (서열 식별 번호: 2), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 A (서열 식별 번호: 14), VA RNAI-A (서열 식별 번호: 16), VA RNAII-A (서열 식별 번호: 18), 부분적인 DNA 말단 단백질 (서열 식별 번호: 20; 서열 식별 번호: 21), 23kDa 엔도프로테아제 단편 영역 (서열 식별 번호: 22; 서열 식별 번호: 23), 및 E2a (서열 식별 번호: 24; 서열 식별 번호: 25), E4orf6의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 67), 및 E2a의 상류에 닭 β-액틴 프로모터, 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, 및 L4 pVIII 헥손-연합된 전구물질, E2a의 하류에 SV40 폴리아데닐화 신호, 및 E4 영역의 상류에 E4 미니 프로모터.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 55에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 영역의 상류에 SV40 프로모터 (서열 식별 번호: 2), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 B (서열 식별 번호: 15), VA RNAI-B (서열 식별 번호: 17), VA RNAII-B (서열 식별 번호: 19), E2a (서열 식별 번호: 24; 서열 식별 번호: 25), E2a의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 28), E4orf6의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 67), 및 E2a의 상류에 닭 β-액틴 프로모터, 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, L4 pVIII 헥손-연합된 전구물질, DNA 말단 단백질, 및 23kDa 엔도프로테아제 단편 영역, 및 E4 영역의 상류에 E4 미니 프로모터.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 56에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 영역의 상류에 SV40 프로모터 (서열 식별 번호: 2), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 B (서열 식별 번호: 15), VA RNAI-B (서열 식별 번호: 17), VA RNAII-B (서열 식별 번호: 19), E2a (서열 식별 번호: 24; 서열 식별 번호: 25), E2a의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 28), E4orf6의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 67), 및 E2a의 상류에 닭 β-액틴 프로모터, 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, L4 pVIII 헥손-연합된 전구물질, DNA 말단 단백질, 및 23kDa 엔도프로테아제 단편 영역, E4 영역의 상류에 E4 미니 프로모터, E4orf1을 인코딩하는 유전자, E4orf2를 인코딩하는 유전자, 및 E4orf3을 인코딩하는 유전자.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 57에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 미니 프로모터 (서열 식별 번호: 1), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 A (서열 식별 번호: 14), VA RNAI-A (서열 식별 번호: 16), VA RNAII-A (서열 식별 번호: 18), 부분적인 DNA 말단 단백질 (서열 식별 번호: 20; 서열 식별 번호: 21), 23kDa 엔도프로테아제 단편 영역 (서열 식별 번호: 22; 서열 식별 번호: 23), E2a (서열 식별 번호: 24; 서열 식별 번호: 25), E2a의 상류에 닭 β-액틴 프로모터, HSV-1-유래된 UL30 유전자 (서열 식별 번호: 29; 서열 식별 번호: 30), HSV-1-유래된 UL42 유전자 (서열 식별 번호: 31; 서열 식별 번호: 32), UL30의 상류에 EF-1α 프로모터 (서열 식별 번호: 35), 및 UL42의 하류에 β-글로빈 폴리아데닐화 신호 (서열 식별 번호: 36), 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, 및 L4 pVIII 헥손-연합된 전구물질.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 58에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 미니 프로모터 (서열 식별 번호: 1), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 A (서열 식별 번호: 14), VA RNAI-A (서열 식별 번호: 16), VA RNAII-A (서열 식별 번호: 18), 부분적인 DNA 말단 단백질 (서열 식별 번호: 20; 서열 식별 번호: 21), 23kDa 엔도프로테아제 단편 영역 (서열 식별 번호: 22; 서열 식별 번호: 23), E2a (서열 식별 번호: 24; 서열 식별 번호: 25), E2a의 상류에 닭 β-액틴 프로모터, HSV-1-유래된 UL30 유전자 (서열 식별 번호: 29; 서열 식별 번호: 30), HSV-1-유래된 UL42 유전자 (서열 식별 번호: 31; 서열 식별 번호: 32), UL30 상류에 SV40 프로모터 (서열 식별 번호: 68), 및 UL42의 하류에 소 성장 호르몬 (bGH) 폴리아데닐화 신호 (서열 식별 번호: 69), 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, 및 L4 pVIII 헥손-연합된 전구물질.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 59에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 미니 프로모터 (서열 식별 번호: 1), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 B (서열 식별 번호: 15), VA RNAI-B (서열 식별 번호: 17), VA RNAII-B (서열 식별 번호: 19), E2a (서열 식별 번호: 24; 서열 식별 번호: 25), E2a의 상류에 닭 β-액틴 프로모터, HSV-1-유래된 UL30 유전자 (서열 식별 번호: 29; 서열 식별 번호: 30), HSV-1-유래된 UL42 유전자 (서열 식별 번호: 31; 서열 식별 번호: 32), UL30의 상류에 SV40 프로모터 (서열 식별 번호: 68), 및 UL42의 하류에 소 성장 호르몬 (bGH) 폴리아데닐화 신호 (서열 식별 번호: 69), 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, L4 pVIII 헥손-연합된 전구물질, DNA 말단 단백질, 및 23kDa 엔도프로테아제 단편 영역.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 60에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 미니 프로모터 (서열 식별 번호: 1), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 B (서열 식별 번호: 15), VA RNAI-B (서열 식별 번호: 17), VA RNAII-B (서열 식별 번호: 19), E2a (서열 식별 번호: 24; 서열 식별 번호: 25), E2a의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 28), E2a의 상류에 닭 β-액틴 프로모터, HSV-1-유래된 UL30 유전자 (서열 식별 번호: 29; 서열 식별 번호: 30), HSV-1-유래된 UL42 유전자 (서열 식별 번호: 31; 서열 식별 번호: 32), UL30의 상류에 SV40 프로모터 (서열 식별 번호: 68), 및 UL42의 하류에 소 성장 호르몬 (bGH) 폴리아데닐화 신호 (서열 식별 번호: 69), 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, L4 pVIII 헥손-연합된 전구물질, DNA 말단 단백질, 및 23kDa 엔도프로테아제 단편 영역.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 61에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 미니 프로모터 (서열 식별 번호: 1), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 A (서열 식별 번호: 14), VA RNAI-A (서열 식별 번호: 16), VA RNAII-A (서열 식별 번호: 18), 부분적인 DNA 말단 단백질 (서열 식별 번호: 20; 서열 식별 번호: 21), 23kDa 엔도프로테아제 단편 영역 (서열 식별 번호: 22; 서열 식별 번호: 23), 및 E2a (서열 식별 번호: 24; 서열 식별 번호: 25), E4orf6의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 67), E2a의 상류에 닭 β-액틴 프로모터, HSV-1-유래된 UL30 유전자 (서열 식별 번호: 29; 서열 식별 번호: 30), HSV-1-유래된 UL42 유전자 (서열 식별 번호: 31; 서열 식별 번호: 32), UL30의 상류에 SV40 프로모터 (서열 식별 번호: 68), 및 UL42의 하류에 소 성장 호르몬 (bGH) 폴리아데닐화 신호 (서열 식별 번호: 69), 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, 및 L4 pVIII 헥손-연합된 전구물질, 및 E2a의 하류에 SV40 폴리아데닐화 신호.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 62에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 미니 프로모터 (서열 식별 번호: 1), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 B (서열 식별 번호: 15), VA RNAI-B (서열 식별 번호: 17), VA RNAII-B (서열 식별 번호: 19), E2a (서열 식별 번호: 24; 서열 식별 번호: 25), E4orf6의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 67), E2a의 상류에 닭 β-액틴 프로모터, HSV-1-유래된 UL30 유전자 (서열 식별 번호: 29; 서열 식별 번호: 30), HSV-1-유래된 UL42 유전자 (서열 식별 번호: 31; 서열 식별 번호: 32), UL30의 상류에 SV40 프로모터 (서열 식별 번호: 68), 및 UL42의 하류에 소 성장 호르몬 (bGH) 폴리아데닐화 신호 (서열 식별 번호: 69), 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, L4 pVIII 헥손-연합된 전구물질, DNA 말단 단백질, 및 23kDa 엔도프로테아제 단편 영역, 및 E2a의 하류에 SV40 폴리아데닐화 신호.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 63에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 영역의 상류에 SV40 프로모터 (서열 식별 번호: 2), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 A (서열 식별 번호: 14), VA RNAI-A (서열 식별 번호: 16), VA RNAII-A (서열 식별 번호: 18), 부분적인 DNA 말단 단백질 (서열 식별 번호: 20; 서열 식별 번호: 21), 23kDa 엔도프로테아제 단편 영역 (서열 식별 번호: 22; 서열 식별 번호: 23), 및 E2a (서열 식별 번호: 24; 서열 식별 번호: 25), E4orf6의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 67), E2a의 상류에 닭 β-액틴 프로모터, HSV-1-유래된 UL30 유전자 (서열 식별 번호: 29; 서열 식별 번호: 30), HSV-1-유래된 UL42 유전자 (서열 식별 번호: 31; 서열 식별 번호: 32), UL30의 상류에 SV40 프로모터 (서열 식별 번호: 68), 및 UL42의 하류에 소 성장 호르몬 (bGH) 폴리아데닐화 신호 (서열 식별 번호: 69), 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, 및 L4 pVIII 헥손-연합된 전구물질, E2a의 하류에 SV40 폴리아데닐화 신호, 및 E4 영역의 상류에 E4 미니 프로모터.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 64에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 영역의 상류에 SV40 프로모터 (서열 식별 번호: 2), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 B (서열 식별 번호: 15), VA RNAI-B (서열 식별 번호: 17), VA RNAII-B (서열 식별 번호: 19), E2a (서열 식별 번호: 24; 서열 식별 번호: 25), E2a의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 28), E4orf6의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 67), E2a의 상류에 닭 β-액틴 프로모터, HSV-1-유래된 UL30 유전자 (서열 식별 번호: 29; 서열 식별 번호: 30), HSV-1-유래된 UL42 유전자 (서열 식별 번호: 31; 서열 식별 번호: 32), UL30의 상류에 SV40 프로모터 (서열 식별 번호: 68), 및 UL42의 하류에 소 성장 호르몬 (bGH) 폴리아데닐화 신호 (서열 식별 번호: 69), 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, L4 pVIII 헥손-연합된 전구물질, DNA 말단 단백질, 및 23kDa 엔도프로테아제 단편 영역, 및 E4 영역의 상류에 E4 미니 프로모터.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 65에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 미니 프로모터 (서열 식별 번호: 1), L4 (33kDa Ex2) (서열 식별 번호: 9; 서열 식별 번호: 10), VA RNA 영역 A (서열 식별 번호: 14), VA RNAI-A (서열 식별 번호: 16), VA RNAII-A (서열 식별 번호: 18), 부분적인 DNA 말단 단백질 (서열 식별 번호: 20; 서열 식별 번호: 21), 23kDa 엔도프로테아제 단편 영역 (서열 식별 번호: 22; 서열 식별 번호: 23), E2a (서열 식별 번호: 24; 서열 식별 번호: 25), E2a의 상류에 닭 β-액틴 프로모터, HSV-1-유래된 UL29 유전자 (서열 식별 번호: 37; 서열 식별 번호: 38), UL29의 상류에 HSV TK 프로모터 (서열 식별 번호: 39), 및 UL29의 하류에 HSV TK 폴리아데닐화 신호 (서열 식별 번호: 40), 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, 및 L4 pVIII 헥손-연합된 전구물질.
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 66에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는다. 일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 명시된 서열들에 대해 적어도 80%, 85%, 90%, 95%, 99%, 또는 100% 동일한 뉴클레오티드 서열들을 갖는 다음의 성분들을 포함하고: E4 영역의 상류에 SV40 프로모터 (서열 식별 번호: 2),VA RNA 영역 B (서열 식별 번호: 15), VA RNAI-B (서열 식별 번호: 17), VA RNAII-B (서열 식별 번호: 19), E2a (서열 식별 번호: 24; 서열 식별 번호: 25), E2a의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 28), E4orf6의 하류에 SV40 폴리아데닐화 신호 (서열 식별 번호: 67), 및 E2a의 상류에 닭 β-액틴 프로모터, 그리고 다음 성분들을 포함하지 않는다: 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 페리펜톤성 헥손-연합된 유전자, 전장의 L4 (헥손 어셈블리) 유전자, L4 캡슐화 단백질, L4 pVIII 헥손-연합된 전구물질, L4 (33kDa Ex2), DNA 말단 단백질, 및 23kDa 엔도프로테아제 단편 영역, E4 영역의 상류에 E4 미니 프로모터, E4orf1을 인코딩하는 유전자, E4orf2를 인코딩하는 유전자, 및 E4orf3을 인코딩하는 유전자.
생산 방법
일부 구체예들에서, 본 명세서의 아데노바이러스성 헬퍼 플라스미드는 rAAV를 생산하는 방법에 유용하다. 일부 구체예들에서, rAAV는 생산자 세포의 형질감염에 의해 만들어진다. 일부 구체예들에서, 생산자 세포는 포유류 세포다. 일부 구체예들에서, 생산자 세포는 형질전환된 포유류 세포다. 일부 구체예들에서, 생산자 세포는 Vero, HeLa, HEK293, HEK293T 세포 또는 이의 유도체다.
일부 구체예들에서, rAAV를 생산하는 방법은 AAV 벡터 플라스미드, AAV Rep-Cap 발현시키는 플라스미드, 및 아데노바이러스성 헬퍼 플라스미드로 생산자 세포를 형질감염시키는 것을 포함한다. 일부 구체예들에서, AAV 벡터 플라스미드는 AAV 역전된 말단 반복부 (ITRs) 및 관심대상의 도입유전자를 포함한다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 본원에 기술된 임의의 아데노바이러스성 헬퍼 플라스미드이다.
일부 구체예들에서, rAAV를 생산하는 방법은 Rep-Cap를 안정적으로 발현시키는 생산자 세포의 형질감염을 포함한다. 일부 구체예들에서, rAAV를 생산하는 방법은 Rep-Cap를 안정적으로 발현시키는 생산자 세포에 AAV 벡터 플라스미드 및 아데노바이러스성 헬퍼 플라스미드로 형질감염시키는 것을 포함한다. 일부 구체예들에서, AAV 벡터 플라스미드는 AAV 역전된 말단 반복부 (ITRs) 및 관심대상의 도입유전자를 포함한다. 일부 구체예들에서, 아데노바이러스성 헬퍼 플라스미드는 본원에 기술된 임의의 아데노바이러스성 헬퍼 플라스미드이다.
구체예
본 명세서에 기술된 작업의 주요 목적은 크기는 더 작고, 불필요한 아데노바이러스 유전자를 더 적게 함유하며, 가장 일반적으로 사용되는 아데노바이러스 헬퍼 플라스미드와 동일하거나 또는 더 우수하게 기능하는 rAAV 생산을 위한 신규한 아데노바이러스 헬퍼 플라스미드를 개발하는 것이다.
본 명세서에 제공된 플라스미드는 새로(de novo) 합성되었고, 서열-검증되었으며, rAAV 대규모-제조에 사용하기 위해 규모가 확대되었다. rAAV 연구의 생산은 제공된 플라스미드들 대비 기타 시판되는 아데노바이러스 헬퍼 플라스미드를 사용할 때 벡터 수율을 비교하기 위해 수행되었다. 제공된 플라스미드로 생산된 rAAV가 품질이 우수하지는 않더라도 적어도 동등하다는 것을 확인하기 위해, 다양한 아데노바이러스 헬퍼 플라스미드로 생산된 rAAV로부터 벡터 품질 및 활성도 평가되었다. 종합하면, 다음 실시예들은 제공된 아데노바이러스 헬퍼 플라스미드가 잠재적으로 더 안전하고, 비용 효율적인 설계로 높은 수율과 고품질의 rAAV를 생성한다는 것을 보여준다.
실시예 1: 본원에 기술된 아데노바이러스성 헬퍼 플라스미드를 이용한 rAAVs 생산을 위한 예시적인 방법들
HEK293 세포를 대조군 아데노바이러스성 헬퍼 플라스미드 (가령, 상업적으로 이용가능한 플라스미드, 이를 테면 pALD-X80, 또는 본원에서 기술된 아데노바이러스성 헬퍼 플라스미드로 형질감염시켰다. 상기 아데노바이러스성 헬퍼 플라스미드에 pAAVrep2cap9 및 pAAV-CMV-GFP 플라스미드로 PEI 형질감염을 이용하여 공동-형질감염시켜, AAV9/ssCMV-GFP를 만들었다. 형질감염-후 4일차 시점에, 상기 HEK293 세포는 0.5% Triton X-100 용해 및 뉴클레아제 추가 (RNA, 세포 게놈 DNA, 및 잔류 플라스미드 DNA를 제거하기 위해)를 통하여 수거하였다. 용해/뉴클레아제 처리 3시간-후, 상기 세포 용해물을 샘플링하였고, qPCR 역가 분석을 위해 제출되었다. 샘플을 다른 뉴클레아제로 처리한 다음, EDTA 및 열-처리하였고, 그 다음 희석된 샘플의 qPCR을 수행하여 샘플당 벡터 게놈 카피 수를 결정했다. 형질감염 효율의 척도로서, GFP에 대해 양성인 세포를 형광 현미경을 사용하여 정량화했다.
실시예 2: 섬유, L1-52/55K, 및 페리펜톤성 헥손-연합된 유전자가 결여되며, 그리고 부분적인 L4 헥손-연합된 전구물질을 갖는 아데노바이러스성 헬퍼 플라스미드
상기 아데노바이러스성 헬퍼 플라스미드의 크기를 줄이기 위해, 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 그리고 대부분의 헥손 연합된 전구물질, 뿐만 아니라 페리펜톤성 헥손-연합된 단백질이 결여된, 아데노바이러스성 헬퍼 플라스미드 (pEMBR-1.2: 서열 식별 번호: 41)가 기획되었다. 상업적으로 이용가능한 헬퍼 플라스미드, 이를 테면 pXX6-80에 비교하여 이러한 결손들이 만들어졌다. 상기 아데노바이러스성 헬퍼 유전자들을 합성하였고, 카나마이신-저항성 플라스미드 백본으로 어셈블리되었다. 생성 플라스미드는 pXX6-80보다 대략적으로 6.7 kb 더 작다.
상기에서 기술된 아데노바이러스성 헬퍼 플라스미드는 HEK293 세포 내에서 AAV 생산이 가능하였다. qPCR로 측정했을 때, pALD-X80으로 형질감염된 세포와 pEMBR-1.2로 형질감염된 세포 간에 AAV 벡터 수율의 주요 차이는 관찰되지 않았다 (도 2 참고). pEMBR-1.2로 생산된 rAAV 벡터는 SDS-PAGE에 의한 벡터 캡시드 순도를 평가하였을 때 정확한 비율의 VP 단백질을 갖는 정상 벡터를 생성하였고 (도 3 참고), 그리고 알칼리 겔 전기영동에 의해 벡터 도입유전자 순도를 평가하였을 때 패키징된 도입유전자도 정확한 크기를 가졌다 (도 3 참고). 더욱이, pEMBR-1.2는 세포를 형질감염시킬 수 있는 완전한 기능의 벡터 생산이 가능했다. pALD-X80 또는 pEMBR-1.2로 생산된 AAVRH.10/ssCMV-GFP를 생성하기 위한 HEK293 세포의 형질감염에서는 차이가 관찰되지 않았다 (도 4 참고).
실시예 3: 섬유 유전자 및 대부분의 L4 (헥손 어셈블리) 유전자들이 결손된 아데노바이러스성 헬퍼 플라스미드
상기 아데노바이러스성 헬퍼 플라스미드의 크기를 더 줄이기 위해, 섬유 유전자, L1-52/55K (패키징 단백질 3) 유전자, 및 대부분의 헥손 연합된 전구물질, 뿐만 아니라 페리펜톤성 헥손-연합된 단백질 (pEMBR-1.2에서와 같이 - 실시예 2 참고), 그리고 완전한 L4 (헥손 어셈블리) 영역이 추가 결손된 아데노바이러스성 헬퍼 플라스미드가 기획되었다 (pEMBR-1.3: 서열 식별 번호: 42; 도 5 참고). E2A 프로모터 또는 부분적인 L4 (33kDa Ex2; 서열 식별 번호: 9)를 함유하는 L4 영역의 더 작은 단편은 유지된다.
pEMBR-1.3를 추가로 최적화시키기 위해, pEMBR-1.3의 VA RNA 영역은 AAV-2로부터 유래된 VA RNA 영역으로 대체되었다 (VA RNA-B: 서열 식별 번호: 15). 이 형태를 pEMBR-1.3B (서열 식별 번호: 43; 도 5 참고)로 명명한다. 이 형태에서, 측면 StuI 부위 및 BsrGI 부위를 갖는 AAV-2 VA RNA I (서열 식별 번호: 17) 및 VA RNA II (서열 식별 번호: 19) 서열들이 합성되었고 (측면 DNA 말단 단백질 또는 엔도프로테아제 유전자 서열들은 없음) 그리고 이 삽입체를 pEMBR-1.3 안으로 클론시켰다.
실시예 4: 섬유 유전자 및 L4 (헥손 어셈블리) 유전자는 결손되며, E2a 발현을 구동시키기 위한 닭 β-액틴 프로모터를 함유하는 아데노바이러스성 헬퍼 플라스미드
상기 pEMBR-1.3 플라스미드의 바이러스성 생산성을 강화시키기 위해, pEMBR-1.3의 속성을 함유하고, E2a 단백질의 발현을 강화시키기 위해 E2a 유전자 상류에 닭 β-액틴 프로모터 (서열 식별 번호: 26)를 더 내포하고 있는 아데노바이러스성 헬퍼 플라스미드가 기획되었다(pEMBR-1.4: 서열 식별 번호: 49; 도 6 참고). 대부분의 L4 영역 제거를 통해 상실될 수 있는 L4 영역의 다른 부분들에 있는 인핸서 요소를 고려해 닭 β-액틴 프로모터를 추가했다. 더욱이, E2A는 외인성 프로모터 (Gene Therapy. 1998. 5, 938-945) 및 (Journal of Virology. 2007. Vol. 81. No. 21. 11908-11916)에 의해 구동될 수 있음이 이미 확인되었다.
pEMBR-1.3B에서와 같이, AAV-2 유래된 VA RNA 영역이 내포되는 또다른 형태의 pEMBR-1.4가 구축되었다. 이 형태를 pEMBR-1.4B (서열 식별 번호: 50; 도 6 참고)로 명명한다.
E2A의 발현의 발현을 더 강화시키기 위해, SV40 폴리아데닐화 신호가 내포된 또다른 형태의 pEMBR-1.4가 구축되었다. 이 형태를 pEMBR-1.4B2 (서열 식별 번호: 51)로 명명한다.
실시예 5:
변형된 아데노바이러스성 헬퍼 플라스미드에 보충 보조 유전자들의 도입
플라스미드의 크기가 현재 시판되는 아데노바이러스 헬퍼 플라스미드 (이를 테면 pALD-X80)의 크기를 초과하지 않는지 확인하면서, 개시된 아데노바이러스 헬퍼 플라스미드를 사용하여 AAV 생산을 더욱 촉진하기 위해, 여러 보충 보조 유전자들을 최소화된 플라스미드에 첨가했다.
비록 세포가 S 단계에 있지 않을 때 조차도 AAV 도입유전자의 복제를 강화시키기 위해, HSV-1 DNA 중합효소 유전자(UL30 및 UL42)의 추가와 함께, pEMBR-1.4에서 기술된 속성들이 내포되도록 pEMBR-1.5 (서열 식별 번호: 57; 도 7 참고) 아데노바이러스성 헬퍼 플라스미드가 기획되었다. 상기 두 개 HSV-1 중합효소 단백질을 분리시키기 위해 P2A 절단 부위를 이용하여, 단일 전사체 (EF-1α 코어 프로모터에 의해 구동되며, 토끼 β-글로빈 폴리아데닐화 신호에 의해 종료됨)로써 UL30 유전자 및 UL42 유전자들이 기획되었다. CBA, CMV, PGK 등을 비롯하여 임의의 수의 프로모터가 사용될 수 있으며, 임의의 수의 polyA 부위가 사용될 수 있다. 상기 UL30 유전자 및 UL42 유전자가 EF-1α 코어 프로모터 대신 SV40 프로모터에 의해 구동되는, pEMBR-1.5의 추가 형태 (가령, pEMBR-1.5A: 서열 식별 번호: 58)가 기획되었다.
다른 "B" 디자인과 유사하게, pEMBR-1.5B의 추가 버전은 DNA 말단 단백질 또는 엔도프로테아제 유전자 서열이 측면에 위치하지 않고, 더 작은 AAV-2 유래 VA RNA I 및 II를 포함하도록 구축되었다(pEMBR-1.5B: 서열 식별 번호: 59).
다른 "B2" 설계와 유사하게, 더 높은 E2A 발현을 위한 SV40 폴리아데닐화 신호를 포함하도록 pEMBR-1.5B2의 추가 버전을 구축했다 (pEMBR-1.5B2: 서열 식별 번호: 60).
실시예 6: 변형된 아데노바이러스성 헬퍼 플라스미드에 추가 보조 유전자들의 추가 도입
이 실시예는 아데노바이러스 헬퍼 유전자를 제거하여 더 작은 아데노바이러스 헬퍼 플라스미드를 생성하면, 보충 유전자를 추가하여 AAV 품질과 수율을 더욱 향상시킬 수 있음을 추가 확인시켰다. 구체적으로, pEMBR-1.2 및 pEMBR-1.5a 백본 플라스미드로부터 다양한 크기 및 다양한 보충 유전자(가령, UL30, UL42, 등등)를 포함하는 각종 pEMBR 플라스미드들이 기획되었고, AAV 생산에 대해 테스트되었다.
잠재적으로 E2A의 발현의 발현을 증가시키기 위한 SV40 polyA 부위, 그리고 측면 Ad 말단 단백질 서열 및 엔도프로테아제 유전자 서열들을 함유하지 않는 더 작은 VA 영역의 합성된 서열 (Ad2 VA RNA I 및 VA RNA II를 함유함)을 포함하는 "B2" 디자인이 내포되도록 pEMBR-1.2B2 (서열 식별 번호: 94) 아데노바이러스성 헬퍼 플라스미드가 기획되었다. 이 영역은 측면 StuI 부위 및 BsrGI 부위와 함께 합성되었으며, 이 삽입물은 pEMBR-1.2에 클로닝되어 pEMBR-1.2B2를 만들었다.
상기에서 기술된 바와 같이 "B2" 디자인, 그리고 E4 유전자의 발현을 증가시키기 위해 E4 영역에서 E4 ORF6 다음에 첨가된 SV40 poly(A) 꼬리를 포함하는 "C" 디자인이 내포되도록 pEMBR-1.2B2C (서열 식별 번호: 95) 아데노바이러스성 헬퍼 플라스미드 (도 8 참고)가 기획되었다. pEMBR-1.2 벡터와 비교하였을 때, 이 영역은 상기 플라스미드의 크기를 더 감소시키기 위해, 백본 서열의 양을 감소시키도록 합성되었다. 이 E4 영역은 pEMBR-1.2B2로의 클로닝을 위해 측면 PacI 부위와 NotI 부위를 갖도록 합성되었다.
상기에서 기술된 바와 같이 "B2" 디자인, 그리고 E4 유전자의 발현을 증가시키기 위해 E4 영역에서 E4 ORF6 다음에 첨가된 SV40 polyA 꼬리와 SV40 프로모터를 포함하는 "D" 디자인이 내포되도록 pEMBR-1.2B2D (서열 식별 번호: 96) 아데노바이러스성 헬퍼 플라스미드 (도 9 참고)가 기획되었다. pEMBR-1.2 벡터와 비교하였을 때, 이 영역은 상기 플라스미드의 크기를 더 감소시키기 위해, 백본 서열의 양을 감소시키도록 합성되었다. 이 E4 영역은 pEMBR-1.2B2로의 클로닝을 위해 측면 PacI 부위와 NotI 부위를 갖도록 합성되었다.
qPCR로 측정하였을 때, 명확한 용해물 내 AAV (가령, AAV9)에 대한 벡터 수율은 도 17B 및 도 18에서 제공되며, pEMBR-1.2 백본으로부터 기획된 각종 pEMBR 플라스미드가 있다. pEMBR-1.2B2, pEMBR-1.2B2C, 및 pEMBR-1.2B2D 아데노바이러스성 헬퍼 플라스미드는 pEMBR-1.2 플라스미드에 비교하여 필적되는 AAV를 생산하였다. pEMBR-1.2B2, pEMBR-1.2B2C, 및 pEMBR-1.2B2D 아데노바이러스성 헬퍼 플라스미드는 상업적으로 이용가능한 플라스미드 (가령, pHelper)와 비교하여 필적하는 또는 더 많은 양의 AAV를 생산하였다.
pEMBR-1.2C (서열 식별 번호: 97) 아데노바이러스성 헬퍼 플라스미드는 상기에서 기술된 바와 같이, 기타 "C" 디자인에 유사하게, "C" 디자인이 내포되도록 기획되었다. 더욱이, pEMBR-1.2D (서열 식별 번호: 98) 아데노바이러스성 헬퍼 플라스미드는 상기에서 기술된 바와 같이, 기타 "D" 디자인에 유사하게, "D" 디자인이 내포되도록 기획되었다.
qPCR로 측정하였을 때, 명확한 용해물 내 AAV (가령, AAV9)에 대한 벡터 수율은 도 17A 및 도 18에서 제공되며, pEMBR-1.2 백본으로부터 기획된 각종 pEMBR 플라스미드가 있다. pEMBR-1.2C 및 pEMBR-1.2D 아데노바이러스성 헬퍼 플라스미드는 pEMBR-1.2 플라스미드에 비교하여 필적되는 AAV를 생산하였다. pEMBR-1.2C, 및 pEMBR-1.2D 아데노바이러스성 헬퍼 플라스미드는 상업적으로 이용가능한 플라스미드 (가령, pHelper)와 비교하여 필적하는 또는 더 많은 양의 AAV를 생산하였다.
pEMBR-1.5A (서열 식별 번호: 58) 아데노바이러스성 헬퍼 플라스미드 (도 10 참고)는 실시예 5에서 기술된 바와 같이 기획되었다. pEMBR-1.5A는 pEMBR-1.4 플라스미드 (헥손 어셈블리 없음, E2a에 대한 외인성 프로모터 + E2a 프로모터 영역을 포함하는 L4 33 kDa Ex2의 단편을 인코딩하는 뉴클레오티드 서열에 추가된 HSV-1 DNA 중합효소 유전자(UL30 및 UL42)를 포함한다. 상기 HSV-1 DNA 중합효소 유전자(UL30 및 UL42)는 세포들이 S 단계에 있지 않을 때 조차도 AAV 도입유전자의 복제를 지원하기 위해 pEMBR-1.5A 플라스미드로 다시 추가되었다. UL30 유전자와 UL40 유전자는 두 개의 HSV-1 중합효소 단백질을 분리시키기 위해 P2A 절단 부위를 이용하여 단일 전사체 (SV40 프로모터에 의해 구동되며, 소 성장 호르몬 polyA에 의해 종료되는)로써 만들어지도록 기획되었다. CBA, CMV, PGK 등을 비롯하여 임의의 수의 프로모터가 사용될 수 있으며, 임의의 수의 polyA 부위가 사용될 수 있다.
pEMBR-1.5A와 pEMBR-1.4는 모두 pEMBR-1.2에 비해 상당히 낮은 역가에서 AAV를 생산한 것으로 보이는데 (도 17A 및 B 참고), pEMBR-1.5A(기본적으로 UL30 및 UL42 발현 카세트가 추가된 pEMBR-1.4)는 실질적으로 더 낮은 역가에서 AAV를 생성하는 것으로 추론되었는데, 그 것은 상기 플라스미드 백본이 pEMBR-1.4로부터 유래되었기 때문이다. 따라서, UL30 및 UL42 구조체는 UL30 및 UL42의 추가가 AAV 역가에 어떻게 영향을 미칠 수 있는지 테스트하기 위해 상대적으로 더 높은 역가에서 AAV를 생산하는 다른 플라스미드 버전으로 클로닝되었다.
pEMBR-1.55B2 (서열 식별 번호: 99) 아데노바이러스성 헬퍼 플라스미드 (도 11 참고)는 pEMBR-1.5A 플라스미드의 UL30 및 UL42 발현 카세트를 pEMBR-1.2B2 백본에 클로닝시켜 생성되었다. UL30 영역 및 UL42 영역은 pEMBR-1.5A의 블런트(blunt) 절단자 XmnI 및 PmeI로 분해되었고, pEMBR-1.2B2의 블런트화된 NdeI 제한 부위로 클로닝되었다. UL30 유전자와 UL42 유전자는 두 개의 HSV-1 중합효소 단백질을 분리시키기 위해 P2A 절단 부위를 이용하여 단일 전사체 (SV40 프로모터에 의해 구동되며, 소 성장 호르몬 polyA에 의해 종료되는)로써 만들어지도록 기획되었다. 상기 구조체가 플라스미드로 클론되는 방향은 이론적으로 발현에 영향을 미치지 않아야 하지만, 이 영역에는 해당 플라스미드의 나머지와는 독립적으로 UL30 및 UL42의 발현을 유도하는 프로모터와 polyA 신호가 모두 포함되어 있기 때문에 반대 방향 버전이 설계되었다. pEMBR-1.2B2 백본에는 다른 B2 형태 플라스미드와 마찬가지로, 상기에서 기술된 바와 같이, "B2" 디자인이 내포된다.
pEMBR-1.55B2 OO (서열 식별 번호: 100) 아데노바이러스성 헬퍼 플라스미드 (도 12 참조)는 기본적으로 1.55B2 플라스미드와 동일하지만, UL30 구조체 및 UL42 구조체는 pEMBR-1.55B2-OO에 반대 방향으로(OO) 클론되었다.
pEMBR-1.55B2C (서열 식별 번호: 101) 아데노바이러스성 헬퍼 플라스미드 (도 13 참조)는 pEMBR-1.5A 플라스미드의 UL30 및 UL42 발현 카세트를 pEMBR-1.2B2 백본에 클로닝시켜 생성되었다. UL30 영역 및 UL42 영역은 pEMBR-1.5A의 블런트(blunt) 절단자 XmnI 및 PmeI로 분해되었고, pEMBR-1.2B2C의 블런트화된 NdeI 제한 부위로 클로닝되었다. UL30 유전자와 UL42 유전자는 두 개의 HSV-1 중합효소 단백질을 분리시키기 위해 P2A 절단 부위를 이용하여 단일 전사체 (SV40 프로모터에 의해 구동되며, 소 성장 호르몬 polyA에 의해 종료되는)로써 만들어지도록 기획되었다. 상기 구조체가 플라스미드로 클론되는 방향은 이론적으로 발현에 영향을 미치지 않아야 하지만, 이 영역에는 해당 플라스미드의 나머지와는 독립적으로 UL30 및 UL42의 발현을 유도하는 프로모터와 polyA 신호가 모두 포함되어 있기 때문에 반대 방향 버전이 설계되었다. pEMBR-1.2B2C 백본은 다른 B2C 형태 플라스미드와 마찬가지로, 상기에서 기술된 바와 같이, "B2" 및 "C"디자인이 내포된다.
pEMBR-1.55B2C OO (서열 식별 번호: 102) 아데노바이러스성 헬퍼 플라스미드 (도 14 참조)는 1.55B2C 플라스미드와 기본적으로 동일한 플라스미드이지만, 그러나 UL30 구조체 및 UL42 구조체는 pEMBR-1.55B2C-OO에 반대방향(OO)으로 클론되었다.
pEMBR-1.55B2D (서열 식별 번호: 103) 아데노바이러스성 헬퍼 플라스미드 (도 15 참고)는 pEMBR-1.5A 플라스미드의 UL30 및 UL42 발현 카세트를 pEMBR-1.2B2 백본에 클로닝시켜 생성되었다. UL30 영역 및 UL42 영역은 pEMBR-1.5A의 블런트(blunt) 절단자 XmnI 및 PmeI로 분해되었고, pEMBR-1.2B2의 블런트화된 NdeI 제한 부위로 클로닝되었다. UL30 유전자와 UL42 유전자는 두 개의 HSV-1 중합효소 단백질을 분리시키기 위해 P2A 절단 부위를 이용하여 단일 전사체 (SV40 프로모터에 의해 구동되며, 소 성장 호르몬 polyA에 의해 종료되는)로써 만들어지도록 기획되었다. 상기 구조체가 플라스미드로 클론되는 방향은 이론적으로 발현에 영향을 미치지 않아야 하지만, 이 영역에는 해당 플라스미드의 나머지와는 독립적으로 UL30 및 UL42의 발현을 유도하는 프로모터와 polyA 신호가 모두 포함되어 있기 때문에 반대 방향 버전이 설계되었다. pEMBR-1.2B2D 백본은 다른 B2D 형태 플라스미드와 마찬가지로, 상기에서 기술된 바와 같이, "B2" 및 "D" 디자인이 내포된다.
pEMBR-1.55B2D OO (서열 식별 번호: 104) 아데노바이러스성 헬퍼 플라스미드 (도 16 참조)는 1.55B2D 플라스미드와 기본적으로 동일한 플라스미드이지만, 그러나 UL30 구조체 및 UL42 구조체는 pEMBR-1.55B2D-OO에 반대방향(OO)으로 클론되었다.
qPCR로 측정하였을 때, 명확한 용해물 내 AAV (가령, AAV9)에 대한 벡터 수율은 도 17C에서 제공되며, 다양한 pEMBR 플라스미드는 UL30 및 UL42 발현 카세트를 갖도록 기획되었다. pEMBR-1.55B2, pEMBR-1.55B2C, 및 pEMBR-1.55B2D 아데노바이러스성 헬퍼 플라스미드는 pEMBR-1.5A 플라스미드와 비교하여 AAV를 더 많이 생산하였다. pEMBR-1.55B2, pEMBR-1.55B2C, 및 pEMBR-1.55B2D 아데노바이러스성 헬퍼 플라스미드는 pEMBR-1.2 플라스미드와 비교하여 필적되는 또는 더 높은 AAV 생산을 하였다.
실시예 7: 서열들의 표
아래의 서열 목록의 표는 본 명세서에 논의된 다양한 서열을 나열하고 설명한다. 달리 명시하지 않는 한, 모든 서열은 플라스미드 양성 가닥의 5'에서 3' 방향으로 나열된다. 이 방향성은 서열과 연관되어 있는 것으로 기술된 유전자 또는 요소의 방향과 관계없이 보존된다. 본원에 사용된 바와 같이, 별표는 정지 코돈을 나타낸다.
등가물(EQUIVALENTS)
당업자는 일상적인 실험을 사용하여 본 명세서에 기재된 본 발명의 특정 구체예에 대한 많은 등가물을 인식하거나 또는 확인할 수 있을 것이다. 본 발명의 범위는 상기 설명에 제한되지 않으며, 그보다는 첨부한 청구범위에 제시된 바와 같다.
SEQUENCE LISTING
<110> FORGE BIOLOGICS, INC.
<120> ADENOVIRAL HELPER PLASMID
<130> 2013906-0023
<140> PCT/US2022/029193
<141> 2022-05-13
<150> 63/188,294
<151> 2021-05-13
<160> 106
<170> PatentIn version 3.5
<210> 1
<211> 239
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 1
cccttttgcc ttcactgcta aactccttca acacccaaaa aaccgaaagc aaagacccgc 60
atccaagcgc acgccaaaag acccacaaaa aacacctgaa attggcaatg cagtaaaaaa 120
tcaggatata tatgagcgag acgtgaaccg ggaaaaaatg tgacactgac taactcgacc 180
acggcacagc tcaccacaaa aaaattatcc aaaagaaaaa atgaccattc cgactgaca 239
<210> 2
<211> 330
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 2
cacacagtca atcccacacc tttcaggggt ccgaggggtc gtccgtcttc atacgtttcg 60
tacgtagagt taatcagtcg ttggtccaca cctttcaggg gtccgagggg tcgtccgtct 120
tcatacgttt cgtacgtaga gttaatcagt cgttggtatc agggcgggga ttgaggcggg 180
tagggcgggg attgaggcgg gtcaaggcgg gtaagaggcg gggtaccgac tgattaaaaa 240
aaataaatac gtctccggct ccggcggagc cggagactcg ataaggtctt catcactcct 300
ccgaaaaaac ctccggatcc gaaaacgttt 330
<210> 3
<211> 3006
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 3
atgcccttct cccacgcaga cacgatcggc acactcagcg ggttcatcac cgtaatttca 60
ctttccgctt cgctgggctc ttcctcttcc tcttgcgtcc gcataccacg cgccactggg 120
tcgtcttcat tcagccgccg cactgtgcgc ttacctcctt tgccatgctt gattagcacc 180
ggtgggttgc tgaaacccac catttgtagc gccacatctt ctctttcttc ctcgctgtcc 240
acgattacct ctggtgatgg cgggcgctcg ggcttgggag aagggcgctt ctttttcttc 300
ttgggcgcaa tggccaaatc cgccgccgag gtcgatggcc gcgggctggg tgtgcgcggc 360
accagcgcgt cttgtgatga gtcttcctcg tcctcggact cgatacgccg cctcatccgc 420
ttttttgggg gcgcccgggg aggcggcggc gacggggacg gggacgacac gtcctccatg 480
gttgggggac gtcgcgccgc accgcgtccg cgctcggggg tggtttcgcg ctgctcctct 540
tcccgactgg ccatttcctt ctcctatagg cagaaaaaga tcatggagtc agtcgagaag 600
aaggacagcc taaccgcccc ctctgagttc gccaccaccg cctccaccga tgccgccaac 660
gcgcctacca ccttccccgt cgaggcaccc ccgcttgagg aggaggaagt gattatcgag 720
caggacccag gttttgtaag cgaagacgac gaggaccgct cagtaccaac agaggataaa 780
aagcaagacc aggacaacgc agaggcaaac gaggaacaag tcgggcgggg ggacgaaagg 840
catggcgact acctagatgt gggagacgac gtgctgttga agcatctgca gcgccagtgc 900
gccattatct gcgacgcgtt gcaagagcgc agcgatgtgc ccctcgccat agcggatgtc 960
agccttgcct acgaacgcca cctattctca ccgcgcgtac cccccaaacg ccaagaaaac 1020
ggcacatgcg agcccaaccc gcgcctcaac ttctaccccg tatttgccgt gccagaggtg 1080
cttgccacct atcacatctt tttccaaaac tgcaagatac ccctatcctg ccgtgccaac 1140
cgcagccgag cggacaagca gctggccttg cggcagggcg ctgtcatacc tgatatcgcc 1200
tcgctcaacg aagtgccaaa aatctttgag ggtcttggac gcgacgagaa gcgcgcggca 1260
aacgctctgc aacaggaaaa cagcgaaaat gaaagtcact ctggagtgtt ggtggaactc 1320
gagggtgaca acgcgcgcct agccgtacta aaacgcagca tcgaggtcac ccactttgcc 1380
tacccggcac ttaacctacc ccccaaggtc atgagcacag tcatgagtga gctgatcgtg 1440
cgccgtgcgc agcccctgga gagggatgca aatttgcaag aacaaacaga ggagggccta 1500
cccgcagttg gcgacgagca gctagcgcgc tggcttcaaa cgcgcgagcc tgccgacttg 1560
gaggagcgac gcaaactaat gatggccgca gtgctcgtta ccgtggagct tgagtgcatg 1620
cagcggttct ttgctgaccc ggagatgcag cgcaagctag aggaaacatt gcactacacc 1680
tttcgacagg gctacgtacg ccaggcctgc aagatctcca acgtggagct ctgcaacctg 1740
gtctcctacc ttggaatttt gcacgaaaac cgccttgggc aaaacgtgct tcattccacg 1800
ctcaagggcg aggcgcgccg cgactacgtc cgcgactgcg tttacttatt tctatgctac 1860
acctggcaga cggccatggg cgtttggcag cagtgcttgg aggagtgcaa cctcaaggag 1920
ctgcagaaac tgctaaagca aaacttgaag gacctatgga cggccttcaa cgagcgctcc 1980
gtggccgcgc acctggcgga catcattttc cccgaacgcc tgcttaaaac cctgcaacag 2040
ggtctgccag acttcaccag tcaaagcatg ttgcagaact ttaggaactt tatcctagag 2100
cgctcaggaa tcttgcccgc cacctgctgt gcacttccta gcgactttgt gcccattaag 2160
taccgcgaat gccctccgcc gctttggggc cactgctacc ttctgcagct agccaactac 2220
cttgcctacc actctgacat aatggaagac gtgagcggtg acggtctact ggagtgtcac 2280
tgtcgctgca acctatgcac cccgcaccgc tccctggttt gcaattcgca gctgcttaac 2340
gaaagtcaaa ttatcggtac ctttgagctg cagggtccct cgcctgacga aaagtccgcg 2400
gctccggggt tgaaactcac tccggggctg tggacgtcgg cttaccttcg caaatttgta 2460
cctgaggact accacgccca cgagattagg ttctacgaag accaatcccg cccgcctaat 2520
gcggagctta ccgcctgcgt cattacccag ggccacattc ttggccaatt gcaagccatc 2580
aacaaagccc gccaagagtt tctgctacga aagggacggg gggtttactt ggacccccag 2640
tccggcgagg agctcaaccc aatccccccg ccgccgcagc cctatcagca gcagccgcgg 2700
gcccttgctt cccaggatgg cacccaaaaa gaagctgcag ctgccgccgc cacccacgga 2760
cgaggaggaa tactgggaca gtcaggcaga ggaggttttg gacgaggagg aggaggacat 2820
gatggaagac tgggagagcc tagacgagga agcttccgag gtcgaagagg tgtcagacga 2880
aacaccgtca ccctcggtcg cattcccctc gccggcgccc cagaaatcgg caaccggttc 2940
cagcatggct acaacctccg ctcctcaggc gccgccggca ctgcccgttc gccgacccaa 3000
ccgtag 3006
<210> 4
<211> 1001
<212> PRT
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polypeptide
<400> 4
Met Pro Phe Ser His Ala Asp Thr Ile Gly Thr Leu Ser Gly Phe Ile
1 5 10 15
Thr Val Ile Ser Leu Ser Ala Ser Leu Gly Ser Ser Ser Ser Ser Cys
20 25 30
Val Arg Ile Pro Arg Ala Thr Gly Ser Ser Ser Phe Ser Arg Arg Thr
35 40 45
Val Arg Leu Pro Pro Leu Pro Cys Leu Ile Ser Thr Gly Gly Leu Leu
50 55 60
Lys Pro Thr Ile Cys Ser Ala Thr Ser Ser Leu Ser Ser Ser Leu Ser
65 70 75 80
Thr Ile Thr Ser Gly Asp Gly Gly Arg Ser Gly Leu Gly Glu Gly Arg
85 90 95
Phe Phe Phe Phe Leu Gly Ala Met Ala Lys Ser Ala Ala Glu Val Asp
100 105 110
Gly Arg Gly Leu Gly Val Arg Gly Thr Ser Ala Ser Cys Asp Glu Ser
115 120 125
Ser Ser Ser Ser Asp Ser Ile Arg Arg Leu Ile Arg Phe Phe Gly Gly
130 135 140
Ala Arg Gly Gly Gly Gly Asp Gly Asp Gly Asp Asp Thr Ser Ser Met
145 150 155 160
Val Gly Gly Arg Arg Ala Ala Pro Arg Pro Arg Ser Gly Val Val Ser
165 170 175
Arg Cys Ser Ser Ser Arg Leu Ala Ile Ser Phe Ser Tyr Arg Gln Lys
180 185 190
Lys Ile Met Glu Ser Val Glu Lys Lys Asp Ser Leu Thr Ala Pro Ser
195 200 205
Glu Phe Ala Thr Thr Ala Ser Thr Asp Ala Ala Asn Ala Pro Thr Thr
210 215 220
Phe Pro Val Glu Ala Pro Pro Leu Glu Glu Glu Glu Val Ile Ile Glu
225 230 235 240
Gln Asp Pro Gly Phe Val Ser Glu Asp Asp Glu Asp Arg Ser Val Pro
245 250 255
Thr Glu Asp Lys Lys Gln Asp Gln Asp Asn Ala Glu Ala Asn Glu Glu
260 265 270
Gln Val Gly Arg Gly Asp Glu Arg His Gly Asp Tyr Leu Asp Val Gly
275 280 285
Asp Asp Val Leu Leu Lys His Leu Gln Arg Gln Cys Ala Ile Ile Cys
290 295 300
Asp Ala Leu Gln Glu Arg Ser Asp Val Pro Leu Ala Ile Ala Asp Val
305 310 315 320
Ser Leu Ala Tyr Glu Arg His Leu Phe Ser Pro Arg Val Pro Pro Lys
325 330 335
Arg Gln Glu Asn Gly Thr Cys Glu Pro Asn Pro Arg Leu Asn Phe Tyr
340 345 350
Pro Val Phe Ala Val Pro Glu Val Leu Ala Thr Tyr His Ile Phe Phe
355 360 365
Gln Asn Cys Lys Ile Pro Leu Ser Cys Arg Ala Asn Arg Ser Arg Ala
370 375 380
Asp Lys Gln Leu Ala Leu Arg Gln Gly Ala Val Ile Pro Asp Ile Ala
385 390 395 400
Ser Leu Asn Glu Val Pro Lys Ile Phe Glu Gly Leu Gly Arg Asp Glu
405 410 415
Lys Arg Ala Ala Asn Ala Leu Gln Gln Glu Asn Ser Glu Asn Glu Ser
420 425 430
His Ser Gly Val Leu Val Glu Leu Glu Gly Asp Asn Ala Arg Leu Ala
435 440 445
Val Leu Lys Arg Ser Ile Glu Val Thr His Phe Ala Tyr Pro Ala Leu
450 455 460
Asn Leu Pro Pro Lys Val Met Ser Thr Val Met Ser Glu Leu Ile Val
465 470 475 480
Arg Arg Ala Gln Pro Leu Glu Arg Asp Ala Asn Leu Gln Glu Gln Thr
485 490 495
Glu Glu Gly Leu Pro Ala Val Gly Asp Glu Gln Leu Ala Arg Trp Leu
500 505 510
Gln Thr Arg Glu Pro Ala Asp Leu Glu Glu Arg Arg Lys Leu Met Met
515 520 525
Ala Ala Val Leu Val Thr Val Glu Leu Glu Cys Met Gln Arg Phe Phe
530 535 540
Ala Asp Pro Glu Met Gln Arg Lys Leu Glu Glu Thr Leu His Tyr Thr
545 550 555 560
Phe Arg Gln Gly Tyr Val Arg Gln Ala Cys Lys Ile Ser Asn Val Glu
565 570 575
Leu Cys Asn Leu Val Ser Tyr Leu Gly Ile Leu His Glu Asn Arg Leu
580 585 590
Gly Gln Asn Val Leu His Ser Thr Leu Lys Gly Glu Ala Arg Arg Asp
595 600 605
Tyr Val Arg Asp Cys Val Tyr Leu Phe Leu Cys Tyr Thr Trp Gln Thr
610 615 620
Ala Met Gly Val Trp Gln Gln Cys Leu Glu Glu Cys Asn Leu Lys Glu
625 630 635 640
Leu Gln Lys Leu Leu Lys Gln Asn Leu Lys Asp Leu Trp Thr Ala Phe
645 650 655
Asn Glu Arg Ser Val Ala Ala His Leu Ala Asp Ile Ile Phe Pro Glu
660 665 670
Arg Leu Leu Lys Thr Leu Gln Gln Gly Leu Pro Asp Phe Thr Ser Gln
675 680 685
Ser Met Leu Gln Asn Phe Arg Asn Phe Ile Leu Glu Arg Ser Gly Ile
690 695 700
Leu Pro Ala Thr Cys Cys Ala Leu Pro Ser Asp Phe Val Pro Ile Lys
705 710 715 720
Tyr Arg Glu Cys Pro Pro Pro Leu Trp Gly His Cys Tyr Leu Leu Gln
725 730 735
Leu Ala Asn Tyr Leu Ala Tyr His Ser Asp Ile Met Glu Asp Val Ser
740 745 750
Gly Asp Gly Leu Leu Glu Cys His Cys Arg Cys Asn Leu Cys Thr Pro
755 760 765
His Arg Ser Leu Val Cys Asn Ser Gln Leu Leu Asn Glu Ser Gln Ile
770 775 780
Ile Gly Thr Phe Glu Leu Gln Gly Pro Ser Pro Asp Glu Lys Ser Ala
785 790 795 800
Ala Pro Gly Leu Lys Leu Thr Pro Gly Leu Trp Thr Ser Ala Tyr Leu
805 810 815
Arg Lys Phe Val Pro Glu Asp Tyr His Ala His Glu Ile Arg Phe Tyr
820 825 830
Glu Asp Gln Ser Arg Pro Pro Asn Ala Glu Leu Thr Ala Cys Val Ile
835 840 845
Thr Gln Gly His Ile Leu Gly Gln Leu Gln Ala Ile Asn Lys Ala Arg
850 855 860
Gln Glu Phe Leu Leu Arg Lys Gly Arg Gly Val Tyr Leu Asp Pro Gln
865 870 875 880
Ser Gly Glu Glu Leu Asn Pro Ile Pro Pro Pro Pro Gln Pro Tyr Gln
885 890 895
Gln Gln Pro Arg Ala Leu Ala Ser Gln Asp Gly Thr Gln Lys Glu Ala
900 905 910
Ala Ala Ala Ala Ala Thr His Gly Arg Gly Gly Ile Leu Gly Gln Ser
915 920 925
Gly Arg Gly Gly Phe Gly Arg Gly Gly Gly Gly His Asp Gly Arg Leu
930 935 940
Gly Glu Pro Arg Arg Gly Ser Phe Arg Gly Arg Arg Gly Val Arg Arg
945 950 955 960
Asn Thr Val Thr Leu Gly Arg Ile Pro Leu Ala Gly Ala Pro Glu Ile
965 970 975
Gly Asn Arg Phe Gln His Gly Tyr Asn Leu Arg Ser Ser Gly Ala Ala
980 985 990
Gly Thr Ala Arg Ser Pro Thr Gln Pro
995 1000
<210> 5
<211> 369
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 5
gcccatactg caccggcggc agcggcagca acagcagcgg ccacacagaa gcaaaggcga 60
ccggatagca agactctgac aaagcccaag aaatccacag cggcggcagc agcaggagga 120
ggagcgctgc gtctggcgcc caacgaaccc gtatcgaccc gcgagcttag aaacaggatt 180
tttcccactc tgtatgctat atttcaacag agcaggggcc aagaacaaga gctgaaaata 240
aaaaacaggt ctctgcgatc cctcacccgc agctgcctgt atcacaaaag cgaagatcag 300
cttcggcgca cgctggaaga cgcggaggct ctcttcagta aatactgcgc gctgactctt 360
aaggactag 369
<210> 6
<211> 122
<212> PRT
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polypeptide
<400> 6
Ala His Thr Ala Pro Ala Ala Ala Ala Ala Thr Ala Ala Ala Thr Gln
1 5 10 15
Lys Gln Arg Arg Pro Asp Ser Lys Thr Leu Thr Lys Pro Lys Lys Ser
20 25 30
Thr Ala Ala Ala Ala Ala Gly Gly Gly Ala Leu Arg Leu Ala Pro Asn
35 40 45
Glu Pro Val Ser Thr Arg Glu Leu Arg Asn Arg Ile Phe Pro Thr Leu
50 55 60
Tyr Ala Ile Phe Gln Gln Ser Arg Gly Gln Glu Gln Glu Leu Lys Ile
65 70 75 80
Lys Asn Arg Ser Leu Arg Ser Leu Thr Arg Ser Cys Leu Tyr His Lys
85 90 95
Ser Glu Asp Gln Leu Arg Arg Thr Leu Glu Asp Ala Glu Ala Leu Phe
100 105 110
Ser Lys Tyr Cys Ala Leu Thr Leu Lys Asp
115 120
<210> 7
<211> 585
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 7
atggcaccca aaaagaagct gcagctgccg ccgccaccca cggacgagga ggaatactgg 60
gacagtcagg cagaggaggt tttggacgag gaggaggagg acatgatgga agactgggag 120
agcctagacg aggaagcttc cgaggtcgaa gaggtgtcag acgaaacacc gtcaccctcg 180
gtcgcattcc cctcgccggc gccccagaaa tcggcaaccg gttccagcat ggctacaacc 240
tccgctcctc aggcgccgcc ggcactgccc gttcgccgac ccaaccgtag atgggacacc 300
actggaacca gggccggtaa gtccaagcag ccgccgccgt tagcccaaga gcaacaacag 360
cgccaaggct accgctcatg gcgcgggcac aagaacgcca tagttgcttg cttgcaagac 420
tgtgggggca acatctcctt cgcccgccgc tttcttctct accatcacgg cgtggccttc 480
ccccgtaaca tcctgcatta ctaccgtcat ctctacagcc catactgcac cggcggcagc 540
ggcagcaaca gcagcggcca cacagaagca aaggcgaccg gatag 585
<210> 8
<211> 194
<212> PRT
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polypeptide
<400> 8
Met Ala Pro Lys Lys Lys Leu Gln Leu Pro Pro Pro Pro Thr Asp Glu
1 5 10 15
Glu Glu Tyr Trp Asp Ser Gln Ala Glu Glu Val Leu Asp Glu Glu Glu
20 25 30
Glu Asp Met Met Glu Asp Trp Glu Ser Leu Asp Glu Glu Ala Ser Glu
35 40 45
Val Glu Glu Val Ser Asp Glu Thr Pro Ser Pro Ser Val Ala Phe Pro
50 55 60
Ser Pro Ala Pro Gln Lys Ser Ala Thr Gly Ser Ser Met Ala Thr Thr
65 70 75 80
Ser Ala Pro Gln Ala Pro Pro Ala Leu Pro Val Arg Arg Pro Asn Arg
85 90 95
Arg Trp Asp Thr Thr Gly Thr Arg Ala Gly Lys Ser Lys Gln Pro Pro
100 105 110
Pro Leu Ala Gln Glu Gln Gln Gln Arg Gln Gly Tyr Arg Ser Trp Arg
115 120 125
Gly His Lys Asn Ala Ile Val Ala Cys Leu Gln Asp Cys Gly Gly Asn
130 135 140
Ile Ser Phe Ala Arg Arg Phe Leu Leu Tyr His His Gly Val Ala Phe
145 150 155 160
Pro Arg Asn Ile Leu His Tyr Tyr Arg His Leu Tyr Ser Pro Tyr Cys
165 170 175
Thr Gly Gly Ser Gly Ser Asn Ser Ser Gly His Thr Glu Ala Lys Ala
180 185 190
Thr Gly
<210> 9
<211> 87
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
oligonucleotide
<400> 9
cacaaaagcg aagatcagct tcggcgcacg ctggaagacg cggaggctct cttcagtaaa 60
tactgcgcgc tgactcttaa ggactag 87
<210> 10
<211> 28
<212> PRT
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
peptide
<400> 10
His Lys Ser Glu Asp Gln Leu Arg Arg Thr Leu Glu Asp Ala Glu Ala
1 5 10 15
Leu Phe Ser Lys Tyr Cys Ala Leu Thr Leu Lys Asp
20 25
<210> 11
<211> 36
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
oligonucleotide
<400> 11
attctcagtc gcgcgtcata aatgacttct ctcgga 36
<210> 12
<211> 110
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 12
atgagcaagg aaattcccac gccctacatg tggagttacc agccacaaat gggacttgcg 60
gctggagctg cccaagacta ctcaacccga ataaactaca tgagcgcggg 110
<210> 13
<211> 36
<212> PRT
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polypeptide
<400> 13
Met Ser Lys Glu Ile Pro Thr Pro Tyr Met Trp Ser Tyr Gln Pro Gln
1 5 10 15
Met Gly Leu Ala Ala Gly Ala Ala Gln Asp Tyr Ser Thr Arg Ile Asn
20 25 30
Tyr Met Ser Ala
35
<210> 14
<211> 513
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 14
tatccgtaga tgtacctgga catccaggtg atgccggcgg cggtggtgga ggcgcgcgga 60
aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa agtgctccat ggtcgggacg 120
ctctggccgg tcaggcgcgc gcaatcgttg acgctctagc gtgcaaaagg agagcctgta 180
agcgggcact cttccgtggt ctggtggata aattcgcaag ggtatcatgg cggacgaccg 240
gggttcgagc cccgtatccg gccgtccgcc gtgatccatg cggttaccgc ccgcgtgtcg 300
aacccaggtg tgcgacgtca gacaacgggg gagtgctcct tttggcttcc ttccaggcgc 360
ggcggctgct gcgctagctt ttttggccac tggccgcgcg cagcgtaagc ggttaggctg 420
gaaagcgaaa gcattaagtg gctcgctccc tgtagccgga gggttatttt ccaagggttg 480
agtcgcggga cccccggttc gagtctcgga ccg 513
<210> 15
<211> 730
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 15
atccgtagat gtacctggac atccaggtga tgccggcggc ggtggtggag gcgcgcggaa 60
agtcgcggac gcggttccag atgttgcgca gcggcaaaaa gtgctccatg gtcgggacgc 120
tctggccggt gaggcgtgcg cagtcgttga cgctctagac cgtgcaaaag gagagcctgt 180
aagcgggcac tcttccgtgg tctggtggat aaattcgcaa gggtatcatg gcggacgacc 240
ggggttcgaa ccccggatcc ggccgtccgc cgtgatccat gcggttaccg cccgcgtgtc 300
gaacccaggt gtgcgacgtc agacaacggg ggagcgctcc ttttggcttc cttccaggcg 360
cggcggctgc tgcgctagct tttttggcca ctggccgcgc gcggcgtaag cggttaggct 420
ggaaagcgaa agcattaagt ggctcgctcc ctgtagccgg agggttattt tccaagggtt 480
gagtcgcagg acccccggtt cgagtctcgg gccggccgga ctgcggcgaa cgggggtttg 540
cctccccgtc atgcaagacc ccgcttgcaa attcctccgg aaacagggac gagccccttt 600
tttgcttttc ccagatgcat ccggtgctgc ggcagatgcg cccccctcct cagcagcggc 660
aagagcaaga gcagcggcag acatgcaggg caccctcccc ttctcctacc gcgtcaggag 720
gggcaacatc 730
<210> 16
<211> 163
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 16
agcgggcact cttccgtggt ctggtggata aattcgcaag ggtatcatgg cggacgaccg 60
gggttcgagc cccgtatccg gccgtccgcc gtgatccatg cggttaccgc ccgcgtgtcg 120
aacccaggtg tgcgacgtca gacaacgggg gagtgctcct ttt 163
<210> 17
<211> 163
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 17
agcgggcact cttccgtggt ctggtggata aattcgcaag ggtatcatgg cggacgaccg 60
gggttcgaac cccggatccg gccgtccgcc gtgatccatg cggttaccgc ccgcgtgtcg 120
aacccaggtg tgcgacgtca gacaacgggg gagcgctcct ttt 163
<210> 18
<211> 74
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
oligonucleotide
<400> 18
ggctcgctcc ctgtagccgg agggttattt tccaagggtt gagtcgcggg acccccggtt 60
cgagtctcgg accg 74
<210> 19
<211> 161
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 19
ggctcgctcc ctgtagccgg agggttattt tccaagggtt gagtcgcagg acccccggtt 60
cgagtctcgg gccggccgga ctgcggcgaa cgggggtttg cctccccgtc atgcaagacc 120
ccgcttgcaa attcctccgg aaacagggac gagccccttt t 161
<210> 20
<211> 699
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 20
tacctcgtga aaaacggcga cgcgttgtag accttggcgc aggcgctgaa aggcgcgcgg 60
aggtggtggc ggcggccgta gtggacctac aggtccatgt agatgcctat agtagcggaa 120
tacaaccttc tagagcgggg gcctcggggc cggtgggatg cgaccgggga gatggcggtc 180
ggcggcggcg tgaaaaacca ccctatggtc atggaccacg cctgaacgtt gctgatgcat 240
aaactgagct cccgaatgag cgcagagtcc atgtggctcg agagcgtcgg cccagtggtc 300
tggcaattga ccaggcaata ccggttgacg tgaatgtggt agttgtgccc gcgtatggtg 360
gcgaaacacc tgtacctact gaaggtcaga tgggagtgcg tccacgtcgt ccggtataat 420
cggctcgcgc aacagcggct ggatcgggac gaagtcggct actccccgaa gccccagtgt 480
gcgtaccctc cttctcccgc ggtggatgcc ggtttgaggc ggcggcggcg tcgctatcta 540
cgttctctac gtcctgttct ccttcctctt cttcttcacg gccatctttc cgagtacgtt 600
ctgatgatgt ttctggacgc ggctacagtt ttgcttcgga ccccgtaccg gctggcggac 660
gcgtaagtcg tccggcctgg gttcctgtac cacgaagac 699
<210> 21
<211> 233
<212> PRT
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polypeptide
<400> 21
Met Glu His Phe Leu Pro Leu Arg Asn Ile Trp Asn Arg Val Arg Asp
1 5 10 15
Phe Pro Arg Ala Ser Thr Thr Ala Ala Gly Ile Thr Trp Met Ser Arg
20 25 30
Tyr Ile Tyr Gly Tyr His Arg Leu Met Leu Glu Asp Leu Ala Pro Gly
35 40 45
Ala Pro Ala Thr Leu Arg Trp Pro Leu Tyr Arg Gln Pro Pro Pro His
50 55 60
Phe Leu Val Gly Tyr Gln Tyr Leu Val Arg Thr Cys Asn Asp Tyr Val
65 70 75 80
Phe Asp Ser Arg Ala Tyr Ser Arg Leu Arg Tyr Thr Glu Leu Ser Gln
85 90 95
Pro Gly His Gln Thr Val Asn Trp Ser Val Met Ala Asn Cys Thr Tyr
100 105 110
Thr Ile Asn Thr Gly Ala Tyr His Arg Phe Val Asp Met Asp Asp Phe
115 120 125
Gln Ser Thr Leu Thr Gln Val Gln Gln Ala Ile Leu Ala Glu Arg Val
130 135 140
Val Ala Asp Leu Ala Leu Leu Gln Pro Met Arg Gly Phe Gly Val Thr
145 150 155 160
Arg Met Gly Gly Arg Gly Arg His Leu Arg Pro Asn Ser Ala Ala Ala
165 170 175
Ala Ala Ile Asp Ala Arg Asp Ala Gly Gln Glu Glu Gly Glu Glu Glu
180 185 190
Val Pro Val Glu Arg Leu Met Gln Asp Tyr Tyr Lys Asp Leu Arg Arg
195 200 205
Cys Gln Asn Glu Ala Trp Gly Met Ala Asp Arg Leu Arg Ile Gln Gln
210 215 220
Ala Gly Pro Lys Asp Met Val Leu Leu
225 230
<210> 22
<211> 473
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 22
ccgagactgg gggcgtacac tggatggcct ttgcctggaa cccgcactca aaaacatgct 60
acctctttga gccctttggc ttttctgacc agcgactcaa gcaggtttac cagtttgagt 120
acgagtcact cctgcgccgt agcgccattg cttcttcccc cgaccgctgt ataacgctgg 180
aaaagtccac ccaaagcgta caggggccca actcggccgc ctgtggacta ttctgctgca 240
tgtttctcca cgcctttgcc aactggcccc aaactcccat ggatcacaac cccaccatga 300
accttattac cggggtaccc aactccatgc tcaacagtcc ccaggtacag cccaccctgc 360
gtcgcaacca ggaacagctc tacagcttcc tggagcgcca ctcgccctac ttccgcagcc 420
acagtgcgca gattaggagc gccacttctt tttgtcactt gaaaaacatg taa 473
<210> 23
<211> 204
<212> PRT
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polypeptide
<400> 23
Met Gly Ser Ser Glu Gln Glu Leu Lys Ala Ile Val Lys Asp Leu Gly
1 5 10 15
Cys Gly Pro Tyr Phe Leu Gly Thr Tyr Asp Lys Arg Phe Pro Gly Phe
20 25 30
Val Ser Pro His Lys Leu Ala Cys Ala Ile Val Asn Thr Ala Gly Arg
35 40 45
Glu Thr Gly Gly Val His Trp Met Ala Phe Ala Trp Asn Pro Arg Ser
50 55 60
Lys Thr Cys Tyr Leu Phe Glu Pro Phe Gly Phe Ser Asp Gln Arg Leu
65 70 75 80
Lys Gln Val Tyr Gln Phe Glu Tyr Glu Ser Leu Leu Arg Arg Ser Ala
85 90 95
Ile Ala Ser Ser Pro Asp Arg Cys Ile Thr Leu Glu Lys Ser Thr Gln
100 105 110
Ser Val Gln Gly Pro Asn Ser Ala Ala Cys Gly Leu Phe Cys Cys Met
115 120 125
Phe Leu His Ala Phe Ala Asn Trp Pro Gln Thr Pro Met Asp His Asn
130 135 140
Pro Thr Met Asn Leu Ile Thr Gly Val Pro Asn Ser Met Leu Asn Ser
145 150 155 160
Pro Gln Val Gln Pro Thr Leu Arg Arg Asn Gln Glu Gln Leu Tyr Ser
165 170 175
Phe Leu Glu Arg His Ser Pro Tyr Phe Arg Ser His Ser Ala Gln Ile
180 185 190
Arg Ser Ala Thr Ser Phe Cys His Leu Lys Asn Met
195 200
<210> 24
<211> 1590
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 24
taccggtcag cccttctcct cgtcgcgctt tggtgggggc tcgcgcctgc gccacgccgc 60
gctgcagggg gttggtacct cctgcacagc aggggcaggg gcagcggcgg cggaggggcc 120
cgcgggggtt ttttcgccta ctccgccgca tagctcaggc tcctgctcct tctgagtagt 180
gttctgcgcg accacggcgc gtgtgggtcg ggcgccggta gctggagccg ccgcctaaac 240
cggtaacgcg ggttcttctt tttcttcgcg ggaagagggt tcgggctcgc gggcggtagt 300
ggtctccatt agcacctgtc gctccttctt tctcttctac accgcgatgt ttaccaccca 360
aagtcgttgg gtggccacga ttagttcgta ccgtttcctc cattcgcgtg tcacgccgcc 420
gacttacttc tgctgggtca ccgcgcacca tacgcctgcg ttctccttct ccttctcggg 480
tcgcttcgcc tttcacttta atgccactac ttgggcgact cacacggcta gcacagacgc 540
accctcttcc cgtacctccg acgcgcgcgc gactacctgt tcatggtgca cctattgcta 600
gatttccgct tgaagtttga tgacggactg gttcaccttc gagaccgccg gcatacgttc 660
tggaccgact tgctcctcgt ggcgcccaac gtcgactgga agtggtcgtt gttctggaaa 720
cactgctact accccgctaa ggacgtccgc atggacgtca gcaaacgtct ccactggatg 780
ttcgtagtgc tcgggtgccc gacgcgcaac accgacgtgg cgacgcgact ctagcttccg 840
ctcgaattca cagatgtgcc ttcgtaatac tatttattcc tcgtgcacta actttaccta 900
cactgctcgc ttttgcccgt cgcgcgcgac ttcctcgtca gatcgttccg gttctagcac 960
ttcttggcca ccccggcttt acaccacgtc tagaggttgt ggctgcgttc cacgacgcac 1020
gtgctgcgcc ggacaggccg gttagtcaaa aggccgttca gaacgccgta caagaagaga 1080
cttccgcgtt tccgagtcca ccgaaaattc gtctagttcc gaaaatacgt ccgcgacata 1140
ggattgcggg tctggcccgt gccagtggaa aactacggtg atgccacgct cacgttgagt 1200
ttcggacccg tgcgcgggaa aaacccttcc gtcgatggtt tcaactgagg caagcgggac 1260
tcgttgcgcc tcctggacct gcgcctagac tagaggctgt tctcgcacga ccggtcgcac 1320
gtggtgggcc gcgactatca caaggtcacg acgttgggac acatagcgtt gagcgcgcgc 1380
gtcccgcctc cggggttgac gctgaagttc tatagccgcg ggctggacga tttgcgcaac 1440
cactaccacg cgtcggacac ctcacttttg aagtggctcg acggcgccta ccaacacgga 1500
ctcaaattca cctcgtgatt tgtggtcata gcgttgcaca gggacggtca ccgcgtatcg 1560
ctacgcgccg tcttggggaa actaaaaatt 1590
<210> 25
<211> 529
<212> PRT
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polypeptide
<400> 25
Met Ala Ser Arg Glu Glu Glu Gln Arg Glu Thr Thr Pro Glu Arg Gly
1 5 10 15
Arg Gly Ala Ala Arg Arg Pro Pro Thr Met Glu Asp Val Ser Ser Pro
20 25 30
Ser Pro Ser Pro Pro Pro Pro Arg Ala Pro Pro Lys Lys Arg Met Arg
35 40 45
Arg Arg Ile Glu Ser Glu Asp Glu Glu Asp Ser Ser Gln Asp Ala Leu
50 55 60
Val Pro Arg Thr Pro Ser Pro Arg Pro Ser Thr Ser Ala Ala Asp Leu
65 70 75 80
Ala Ile Ala Pro Lys Lys Lys Lys Lys Arg Pro Ser Pro Lys Pro Glu
85 90 95
Arg Pro Pro Ser Pro Glu Val Ile Val Asp Ser Glu Glu Glu Arg Glu
100 105 110
Asp Val Ala Leu Gln Met Val Gly Phe Ser Asn Pro Pro Val Leu Ile
115 120 125
Lys His Gly Lys Gly Gly Lys Arg Thr Val Arg Arg Leu Asn Glu Asp
130 135 140
Asp Pro Val Ala Arg Gly Met Arg Thr Gln Glu Glu Glu Glu Glu Pro
145 150 155 160
Ser Glu Ala Glu Ser Glu Ile Thr Val Met Asn Pro Leu Ser Val Pro
165 170 175
Ile Val Ser Ala Trp Glu Lys Gly Met Glu Ala Ala Arg Ala Leu Met
180 185 190
Asp Lys Tyr His Val Asp Asn Asp Leu Lys Ala Asn Phe Lys Leu Leu
195 200 205
Pro Asp Gln Val Glu Ala Leu Ala Ala Val Cys Lys Thr Trp Leu Asn
210 215 220
Glu Glu His Arg Gly Leu Gln Leu Thr Phe Thr Ser Asn Lys Thr Phe
225 230 235 240
Val Thr Met Met Gly Arg Phe Leu Gln Ala Tyr Leu Gln Ser Phe Ala
245 250 255
Glu Val Thr Tyr Lys His His Glu Pro Thr Gly Cys Ala Leu Trp Leu
260 265 270
His Arg Cys Ala Glu Ile Glu Gly Glu Leu Lys Cys Leu His Gly Ser
275 280 285
Ile Met Ile Asn Lys Glu His Val Ile Glu Met Asp Val Thr Ser Glu
290 295 300
Asn Gly Gln Arg Ala Leu Lys Glu Gln Ser Ser Lys Ala Lys Ile Val
305 310 315 320
Lys Asn Arg Trp Gly Arg Asn Val Val Gln Ile Ser Asn Thr Asp Ala
325 330 335
Arg Cys Cys Val His Asp Ala Ala Cys Pro Ala Asn Gln Phe Ser Gly
340 345 350
Lys Ser Cys Gly Met Phe Phe Ser Glu Gly Ala Lys Ala Gln Val Ala
355 360 365
Phe Lys Gln Ile Lys Ala Phe Met Gln Ala Leu Tyr Pro Asn Ala Gln
370 375 380
Thr Gly His Gly His Leu Leu Met Pro Leu Arg Cys Glu Cys Asn Ser
385 390 395 400
Lys Pro Gly His Ala Pro Phe Leu Gly Arg Gln Leu Pro Lys Leu Thr
405 410 415
Pro Phe Ala Leu Ser Asn Ala Glu Asp Leu Asp Ala Asp Leu Ile Ser
420 425 430
Asp Lys Ser Val Leu Ala Ser Val His His Pro Ala Leu Ile Val Phe
435 440 445
Gln Cys Cys Asn Pro Val Tyr Arg Asn Ser Arg Ala Gln Gly Gly Gly
450 455 460
Pro Asn Cys Asp Phe Lys Ile Ser Ala Pro Asp Leu Leu Asn Ala Leu
465 470 475 480
Val Met Val Arg Ser Leu Trp Ser Glu Asn Phe Thr Glu Leu Pro Arg
485 490 495
Met Val Val Pro Glu Phe Lys Trp Ser Thr Lys His Gln Tyr Arg Asn
500 505 510
Val Ser Leu Pro Val Ala His Ser Asp Ala Arg Gln Asn Pro Phe Asp
515 520 525
Phe
<210> 26
<211> 278
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 26
agctccactc ggggtgcaag acgaagtgag aggggtagag gggggggagg ggtgggggtt 60
aaaacataaa taaataaaaa attaataaaa cacgtcgcta cccccgcccc cccccccccc 120
ccgcgcgcgg tccgccccgc cccgccccgc tccccgcccc gccccgctcc gcctctccac 180
gccgccgtcg gttagtctcg ccgcgcgagg ctttcaaagg aaaataccgc tccgccgccg 240
ccgccgccgg gatatttttc gcttcgcgcg ccgcccgc 278
<210> 27
<211> 35
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
oligonucleotide
<400> 27
tgccgcgtct gccgttccca cccccattta ttagt 35
<210> 28
<211> 135
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 28
gatccagaca tgataagata cattgatgag tttggacaaa ccacaactag aatgcagtga 60
aaaaaatgct ttatttgtga aatttgtgat gctattgctt tatttgtaac cattataagc 120
tgcaataaac aagtt 135
<210> 29
<211> 3705
<212> DNA
<213> Human alphaherpesvirus 1
<400> 29
atgttttccg gtggcggcgg cccgctgtcc cccggaggaa agtcggcggc cagggcggcg 60
tccgggtttt ttgcgcccgc cggccctcgc ggagccagcc ggggaccccc gccttgtttg 120
aggcaaaact tttacaaccc ctacctcgcc ccagtcggga cgcaacagaa gccgaccggg 180
ccaacccagc gccatacgta ctatagcgaa tgcgatgaat ttcgattcat cgccccgcgg 240
gtgctggacg aggatgcccc cccggagaag cgcgccgggg tgcacgacgg tcacctcaag 300
cgcgccccca aggtgtactg cgggggggac gagcgcgacg tcctccgcgt cgggtcgggc 360
ggcttctggc cgcggcgctc gcgcctgtgg ggcggcgtgg accacgcccc ggcggggttc 420
aaccccaccg tcaccgtctt tcacgtgtac gacatcctgg agaacgtgga gcacgcgtac 480
ggcatgcgcg cggcccagtt ccacgcgcgg tttatggacg ccatcacacc gacggggacc 540
gtcatcacgc tcctgggcct gactccggaa ggccaccggg tggccgttca cgtttacggc 600
acgcggcagt acttttacat gaacaaggag gaggtcgaca ggcacctaca atgccgcgcc 660
ccacgagatc tctgcgagcg catggccgcg gccctgcgcg agtccccggg cgcgtcgttc 720
cgcggcatct ccgcggacca cttcgaggcg gaggtggtgg agcgcaccga cgtgtactac 780
tacgagacgc gccccgctct gttttaccgc gtctacgtcc gaagcgggcg cgtgctgtcg 840
tacctgtgcg acaacttctg cccggccatc aagaagtacg agggtggggt cgacgccacc 900
acccggttca tcctggacaa ccccgggttc gtcaccttcg gctggtaccg tctcaaaccg 960
ggccggaaca acacgctagc ccagccgcgg gccccgatgg ccttcgggac atccagcgac 1020
gtcgagttta actgtacggc ggacaacctg gccatcgagg ggggcatgag cgacctaccg 1080
gcatacaagc tcatgtgctt cgatatcgaa tgcaaggcgg ggggggagga cgagctggcc 1140
tttccggtgg ccgggcaccc ggaggacctg gtcatccaga tatcctgtct gctctacgac 1200
ctgtccacca ccgccctgga gcacgtcctc ctgttttcgc tcggttcctg cgacctcccc 1260
gaatcccacc tgaacgagct ggcggccagg ggcctgccca cgcccgtggt tctggaattc 1320
gacagcgaat tcgagatgct gttggccttc atgacccttg tgaaacagta cggccccgag 1380
ttcgtgaccg ggtacaacat catcaacttc gactggccct tcttgctggc caagctgacg 1440
gacatttaca aggtccccct ggacgggtac ggccgcatga acggccgggg cgtgtttcgc 1500
gtgtgggaca taggccagag ccacttccag aagcgcagca agataaaggt gaacggcatg 1560
gtgaacatcg acatgtacgg gattataacc gacaagatca agctctcgag ctacaagctc 1620
aacgccgtgg ccgaagccgt cctgaaggac aagaagaagg acctgagcta tcgcgacatc 1680
cccgcctact acgccgccgg gcccgcgcaa cgcggggtga tcggcgagta ctgcatacag 1740
gattccctgc tggtgggcca gctgtttttt aagtttttgc cccatctgga gctctcggcc 1800
gtcgcgcgct tggcgggtat taacatcacc cgcaccatct acgacggcca gcagatccgc 1860
gtctttacgt gcctgctgcg cctggccgac cagaagggct ttattctgcc ggacacccag 1920
gggcgattta ggggcgccgg gggggaggcg cccaagcgtc cggccgcagc ccgggaggac 1980
gaggagcggc cagaggagga gggggaggac gaggacgaac gcgaggaggg cgggggcgag 2040
cgggagccgg agggcgcgcg ggagaccgcc ggcaggcacg tggggtacca gggggccagg 2100
gtccttgacc ccacttccgg gtttcacgtg aaccccgtgg tggtgttcga ctttgccagc 2160
ctgtacccca gcatcatcca ggcccacaac ctgtgcttca gcacgctctc cctgagggcc 2220
gacgcagtgg cgcacctgga ggcgggcaag gactacctgg agatcgaggt gggggggcga 2280
cggctgttct tcgtcaaggc tcacgtgcga gagagcctcc tcagcatcct cctgcgggac 2340
tggctcgcca tgcgaaagca gatccgctcg cggattcccc agagcagccc cgaggaggcc 2400
gtgctcctgg acaagcagca ggccgccatc aaggtcgtgt gtaactcggt gtacgggttc 2460
acgggagtgc agcacggact cctgccgtgc ctgcacgttg ccgcgacggt gacgaccatc 2520
ggccgcgaga tgctgctcgc gacccgcgag tacgtccacg cgcgctgggc ggccttcgaa 2580
cagctcctgg ccgatttccc ggaggcggcc gacatgcgcg cccccgggcc ctattccatg 2640
cgcatcatct acggggacac ggactccatc tttgtgctgt gccgcggcct cacggccgcc 2700
gggctgacgg ccgtgggcga caagatggcg agccacatct cgcgcgcgct gtttctgccc 2760
cccatcaaac tcgagtgcga aaagacgttc accaagctgc tgctgatcgc caagaaaaag 2820
tacatcggcg tcatctacgg gggtaagatg ctcatcaagg gcgtggatct ggtgcgcaaa 2880
aacaactgcg cgtttatcaa ccgcacctcc agggccctgg tcgacctgct gttttacgac 2940
gataccgtct ccggagccgc cgcggcgtta gccgagcgcc ccgcggagga gtggctggcg 3000
cgacccctgc ccgagggact gcaggcgttc ggggccgtcc tcgtagacgc ccatcggcgc 3060
atcaccgacc cggagaggga catccaggac tttgtcctca ccgccgaact gagcagacac 3120
ccgcgcgcgt acaccaacaa gcgcctggcc cacctgacgg tgtattacaa gctcatggcc 3180
cgccgcgcgc aggtcccgtc catcaaggac cggatcccgt acgtgatcgt ggcccagacc 3240
cgcgaggtag aggagacggt cgcgcggctg gccgccctcc gcgagctaga cgccgccgcc 3300
ccaggggacg agcccgcccc ccccgcggcc ctgccctccc cggccaagcg cccccgggag 3360
acgccgtcgc ctgccgaccc cccgggaggc gcgtccaagc cccgcaagct gctggtgtcc 3420
gagctggccg aggatcccgc atacgccatt gcccacggcg tcgccctgaa cacggactat 3480
tacttctccc acctgttggg ggcggcgtgc gtgacattca aggccctgtt tgggaataac 3540
gccaagatca ccgagagtct gttaaaaagg tttattcccg aagtgtggca ccccccggac 3600
gacgtggccg cgcggctccg gaccgcaggg ttcggggcgg tgggtgccgg cgctacggcg 3660
gaggaaactc gtcgaatgtt gcatagagcc tttgatactc tagca 3705
<210> 30
<211> 1235
<212> PRT
<213> Human alphaherpesvirus 1
<400> 30
Met Phe Ser Gly Gly Gly Gly Pro Leu Ser Pro Gly Gly Lys Ser Ala
1 5 10 15
Ala Arg Ala Ala Ser Gly Phe Phe Ala Pro Ala Gly Pro Arg Gly Ala
20 25 30
Ser Arg Gly Pro Pro Pro Cys Leu Arg Gln Asn Phe Tyr Asn Pro Tyr
35 40 45
Leu Ala Pro Val Gly Thr Gln Gln Lys Pro Thr Gly Pro Thr Gln Arg
50 55 60
His Thr Tyr Tyr Ser Glu Cys Asp Glu Phe Arg Phe Ile Ala Pro Arg
65 70 75 80
Val Leu Asp Glu Asp Ala Pro Pro Glu Lys Arg Ala Gly Val His Asp
85 90 95
Gly His Leu Lys Arg Ala Pro Lys Val Tyr Cys Gly Gly Asp Glu Arg
100 105 110
Asp Val Leu Arg Val Gly Ser Gly Gly Phe Trp Pro Arg Arg Ser Arg
115 120 125
Leu Trp Gly Gly Val Asp His Ala Pro Ala Gly Phe Asn Pro Thr Val
130 135 140
Thr Val Phe His Val Tyr Asp Ile Leu Glu Asn Val Glu His Ala Tyr
145 150 155 160
Gly Met Arg Ala Ala Gln Phe His Ala Arg Phe Met Asp Ala Ile Thr
165 170 175
Pro Thr Gly Thr Val Ile Thr Leu Leu Gly Leu Thr Pro Glu Gly His
180 185 190
Arg Val Ala Val His Val Tyr Gly Thr Arg Gln Tyr Phe Tyr Met Asn
195 200 205
Lys Glu Glu Val Asp Arg His Leu Gln Cys Arg Ala Pro Arg Asp Leu
210 215 220
Cys Glu Arg Met Ala Ala Ala Leu Arg Glu Ser Pro Gly Ala Ser Phe
225 230 235 240
Arg Gly Ile Ser Ala Asp His Phe Glu Ala Glu Val Val Glu Arg Thr
245 250 255
Asp Val Tyr Tyr Tyr Glu Thr Arg Pro Ala Leu Phe Tyr Arg Val Tyr
260 265 270
Val Arg Ser Gly Arg Val Leu Ser Tyr Leu Cys Asp Asn Phe Cys Pro
275 280 285
Ala Ile Lys Lys Tyr Glu Gly Gly Val Asp Ala Thr Thr Arg Phe Ile
290 295 300
Leu Asp Asn Pro Gly Phe Val Thr Phe Gly Trp Tyr Arg Leu Lys Pro
305 310 315 320
Gly Arg Asn Asn Thr Leu Ala Gln Pro Arg Ala Pro Met Ala Phe Gly
325 330 335
Thr Ser Ser Asp Val Glu Phe Asn Cys Thr Ala Asp Asn Leu Ala Ile
340 345 350
Glu Gly Gly Met Ser Asp Leu Pro Ala Tyr Lys Leu Met Cys Phe Asp
355 360 365
Ile Glu Cys Lys Ala Gly Gly Glu Asp Glu Leu Ala Phe Pro Val Ala
370 375 380
Gly His Pro Glu Asp Leu Val Ile Gln Ile Ser Cys Leu Leu Tyr Asp
385 390 395 400
Leu Ser Thr Thr Ala Leu Glu His Val Leu Leu Phe Ser Leu Gly Ser
405 410 415
Cys Asp Leu Pro Glu Ser His Leu Asn Glu Leu Ala Ala Arg Gly Leu
420 425 430
Pro Thr Pro Val Val Leu Glu Phe Asp Ser Glu Phe Glu Met Leu Leu
435 440 445
Ala Phe Met Thr Leu Val Lys Gln Tyr Gly Pro Glu Phe Val Thr Gly
450 455 460
Tyr Asn Ile Ile Asn Phe Asp Trp Pro Phe Leu Leu Ala Lys Leu Thr
465 470 475 480
Asp Ile Tyr Lys Val Pro Leu Asp Gly Tyr Gly Arg Met Asn Gly Arg
485 490 495
Gly Val Phe Arg Val Trp Asp Ile Gly Gln Ser His Phe Gln Lys Arg
500 505 510
Ser Lys Ile Lys Val Asn Gly Met Val Asn Ile Asp Met Tyr Gly Ile
515 520 525
Ile Thr Asp Lys Ile Lys Leu Ser Ser Tyr Lys Leu Asn Ala Val Ala
530 535 540
Glu Ala Val Leu Lys Asp Lys Lys Lys Asp Leu Ser Tyr Arg Asp Ile
545 550 555 560
Pro Ala Tyr Tyr Ala Ala Gly Pro Ala Gln Arg Gly Val Ile Gly Glu
565 570 575
Tyr Cys Ile Gln Asp Ser Leu Leu Val Gly Gln Leu Phe Phe Lys Phe
580 585 590
Leu Pro His Leu Glu Leu Ser Ala Val Ala Arg Leu Ala Gly Ile Asn
595 600 605
Ile Thr Arg Thr Ile Tyr Asp Gly Gln Gln Ile Arg Val Phe Thr Cys
610 615 620
Leu Leu Arg Leu Ala Asp Gln Lys Gly Phe Ile Leu Pro Asp Thr Gln
625 630 635 640
Gly Arg Phe Arg Gly Ala Gly Gly Glu Ala Pro Lys Arg Pro Ala Ala
645 650 655
Ala Arg Glu Asp Glu Glu Arg Pro Glu Glu Glu Gly Glu Asp Glu Asp
660 665 670
Glu Arg Glu Glu Gly Gly Gly Glu Arg Glu Pro Glu Gly Ala Arg Glu
675 680 685
Thr Ala Gly Arg His Val Gly Tyr Gln Gly Ala Arg Val Leu Asp Pro
690 695 700
Thr Ser Gly Phe His Val Asn Pro Val Val Val Phe Asp Phe Ala Ser
705 710 715 720
Leu Tyr Pro Ser Ile Ile Gln Ala His Asn Leu Cys Phe Ser Thr Leu
725 730 735
Ser Leu Arg Ala Asp Ala Val Ala His Leu Glu Ala Gly Lys Asp Tyr
740 745 750
Leu Glu Ile Glu Val Gly Gly Arg Arg Leu Phe Phe Val Lys Ala His
755 760 765
Val Arg Glu Ser Leu Leu Ser Ile Leu Leu Arg Asp Trp Leu Ala Met
770 775 780
Arg Lys Gln Ile Arg Ser Arg Ile Pro Gln Ser Ser Pro Glu Glu Ala
785 790 795 800
Val Leu Leu Asp Lys Gln Gln Ala Ala Ile Lys Val Val Cys Asn Ser
805 810 815
Val Tyr Gly Phe Thr Gly Val Gln His Gly Leu Leu Pro Cys Leu His
820 825 830
Val Ala Ala Thr Val Thr Thr Ile Gly Arg Glu Met Leu Leu Ala Thr
835 840 845
Arg Glu Tyr Val His Ala Arg Trp Ala Ala Phe Glu Gln Leu Leu Ala
850 855 860
Asp Phe Pro Glu Ala Ala Asp Met Arg Ala Pro Gly Pro Tyr Ser Met
865 870 875 880
Arg Ile Ile Tyr Gly Asp Thr Asp Ser Ile Phe Val Leu Cys Arg Gly
885 890 895
Leu Thr Ala Ala Gly Leu Thr Ala Val Gly Asp Lys Met Ala Ser His
900 905 910
Ile Ser Arg Ala Leu Phe Leu Pro Pro Ile Lys Leu Glu Cys Glu Lys
915 920 925
Thr Phe Thr Lys Leu Leu Leu Ile Ala Lys Lys Lys Tyr Ile Gly Val
930 935 940
Ile Tyr Gly Gly Lys Met Leu Ile Lys Gly Val Asp Leu Val Arg Lys
945 950 955 960
Asn Asn Cys Ala Phe Ile Asn Arg Thr Ser Arg Ala Leu Val Asp Leu
965 970 975
Leu Phe Tyr Asp Asp Thr Val Ser Gly Ala Ala Ala Ala Leu Ala Glu
980 985 990
Arg Pro Ala Glu Glu Trp Leu Ala Arg Pro Leu Pro Glu Gly Leu Gln
995 1000 1005
Ala Phe Gly Ala Val Leu Val Asp Ala His Arg Arg Ile Thr Asp
1010 1015 1020
Pro Glu Arg Asp Ile Gln Asp Phe Val Leu Thr Ala Glu Leu Ser
1025 1030 1035
Arg His Pro Arg Ala Tyr Thr Asn Lys Arg Leu Ala His Leu Thr
1040 1045 1050
Val Tyr Tyr Lys Leu Met Ala Arg Arg Ala Gln Val Pro Ser Ile
1055 1060 1065
Lys Asp Arg Ile Pro Tyr Val Ile Val Ala Gln Thr Arg Glu Val
1070 1075 1080
Glu Glu Thr Val Ala Arg Leu Ala Ala Leu Arg Glu Leu Asp Ala
1085 1090 1095
Ala Ala Pro Gly Asp Glu Pro Ala Pro Pro Ala Ala Leu Pro Ser
1100 1105 1110
Pro Ala Lys Arg Pro Arg Glu Thr Pro Ser Pro Ala Asp Pro Pro
1115 1120 1125
Gly Gly Ala Ser Lys Pro Arg Lys Leu Leu Val Ser Glu Leu Ala
1130 1135 1140
Glu Asp Pro Ala Tyr Ala Ile Ala His Gly Val Ala Leu Asn Thr
1145 1150 1155
Asp Tyr Tyr Phe Ser His Leu Leu Gly Ala Ala Cys Val Thr Phe
1160 1165 1170
Lys Ala Leu Phe Gly Asn Asn Ala Lys Ile Thr Glu Ser Leu Leu
1175 1180 1185
Lys Arg Phe Ile Pro Glu Val Trp His Pro Pro Asp Asp Val Ala
1190 1195 1200
Ala Arg Leu Arg Thr Ala Gly Phe Gly Ala Val Gly Ala Gly Ala
1205 1210 1215
Thr Ala Glu Glu Thr Arg Arg Met Leu His Arg Ala Phe Asp Thr
1220 1225 1230
Leu Ala
1235
<210> 31
<211> 1461
<212> DNA
<213> Human alphaherpesvirus 1
<400> 31
acggattccc ctggcggtgt ggcccccgcc tcccccgtgg aggacgcgtc ggacgcgtcc 60
ctcgggcagc cggaggaggg ggcgccctgc caggtggtcc tgcagggcgc cgaacttaat 120
ggaatcctac aggcgtttgc cccgctgcgc acgagccttc tggactcgct tctggttatg 180
ggcgaccggg gcatccttat ccataacacg atctttgggg agcaggtgtt cctgcccctg 240
gaacactcgc aattcagtcg gtatcgctgg cgcggaccca cggcggcgtt cctgtctctc 300
gtggaccaga agcgctccct cctgagcgtg tttcgcgcca accagtaccc ggacctacgt 360
cgggtggagt tggcgatcac gggccaggcc ccgtttcgca cgctggttca gcgcatatgg 420
acgacgacgt ccgacggcga ggccgttgag ctagccagcg agacgctgat gaagcgcgaa 480
ctgacgagct ttgtggtgct ggttccccag ggaacccccg acgttcagtt gcgcctgacg 540
aggccgcagc tcaccaaggt ccttaacgcg accggggccg atagtgccac gcccaccacg 600
ttcgagctcg gggttaacgg caaattttcc gtgttcacca cgagtacctg cgtcaccttt 660
gctgcccgcg aggagggcgt gtcgtccagc accagcaccc aggtccagat cctgtccaac 720
gcgctcacca aggcgggcca ggccgccgcg aacgccaaga cggtgtacgg ggaaaatacc 780
catcgcacct tctctgtggt cgtcgacgat tgcagcatgc gggcggtgct ccggcgactg 840
caggtcggcg ggggcaccct caagttcttc ctcacgaccc ccgtccccag tctgtgcgtc 900
accgccaccg gtcccaacgc ggtatcggcg gtatttctcc tgaaacccca gaagatttgc 960
ctggactggc tgggtcatag ccaggggtct ccttcagccg ggagctcggc ctcccgggcc 1020
tctgggagcg agccaacaga cagccaggac tccgcgtcgg acgcggtcag ccacggcgat 1080
ccggaagacc tcgatggcgc tgcccgggcg ggagaggcgg gggccttgca tgcctgtccg 1140
atgccgtcgt cgaccacgcg ggtcactccc acgaccaagc gggggcgctc ggggggcgag 1200
gatgcgcgcg cggacacggc cctaaagaaa cctaagacgg ggtcgcccac cgcacccccg 1260
cccgcagatc cagtccccct ggacacggag gacgactccg atgcggcgga cgggacggcg 1320
gcccgtcccg ccgctccaga cgcccggagc ggaagccgtt acgcgtgtta ctttcgcgac 1380
ctcccgaccg gagaagcaag ccccggcgcc ttctccgcct tccggggggg cccccaaacc 1440
ccgtatggtt ttggattccc c 1461
<210> 32
<211> 487
<212> PRT
<213> Human alphaherpesvirus 1
<400> 32
Thr Asp Ser Pro Gly Gly Val Ala Pro Ala Ser Pro Val Glu Asp Ala
1 5 10 15
Ser Asp Ala Ser Leu Gly Gln Pro Glu Glu Gly Ala Pro Cys Gln Val
20 25 30
Val Leu Gln Gly Ala Glu Leu Asn Gly Ile Leu Gln Ala Phe Ala Pro
35 40 45
Leu Arg Thr Ser Leu Leu Asp Ser Leu Leu Val Met Gly Asp Arg Gly
50 55 60
Ile Leu Ile His Asn Thr Ile Phe Gly Glu Gln Val Phe Leu Pro Leu
65 70 75 80
Glu His Ser Gln Phe Ser Arg Tyr Arg Trp Arg Gly Pro Thr Ala Ala
85 90 95
Phe Leu Ser Leu Val Asp Gln Lys Arg Ser Leu Leu Ser Val Phe Arg
100 105 110
Ala Asn Gln Tyr Pro Asp Leu Arg Arg Val Glu Leu Ala Ile Thr Gly
115 120 125
Gln Ala Pro Phe Arg Thr Leu Val Gln Arg Ile Trp Thr Thr Thr Ser
130 135 140
Asp Gly Glu Ala Val Glu Leu Ala Ser Glu Thr Leu Met Lys Arg Glu
145 150 155 160
Leu Thr Ser Phe Val Val Leu Val Pro Gln Gly Thr Pro Asp Val Gln
165 170 175
Leu Arg Leu Thr Arg Pro Gln Leu Thr Lys Val Leu Asn Ala Thr Gly
180 185 190
Ala Asp Ser Ala Thr Pro Thr Thr Phe Glu Leu Gly Val Asn Gly Lys
195 200 205
Phe Ser Val Phe Thr Thr Ser Thr Cys Val Thr Phe Ala Ala Arg Glu
210 215 220
Glu Gly Val Ser Ser Ser Thr Ser Thr Gln Val Gln Ile Leu Ser Asn
225 230 235 240
Ala Leu Thr Lys Ala Gly Gln Ala Ala Ala Asn Ala Lys Thr Val Tyr
245 250 255
Gly Glu Asn Thr His Arg Thr Phe Ser Val Val Val Asp Asp Cys Ser
260 265 270
Met Arg Ala Val Leu Arg Arg Leu Gln Val Gly Gly Gly Thr Leu Lys
275 280 285
Phe Phe Leu Thr Thr Pro Val Pro Ser Leu Cys Val Thr Ala Thr Gly
290 295 300
Pro Asn Ala Val Ser Ala Val Phe Leu Leu Lys Pro Gln Lys Ile Cys
305 310 315 320
Leu Asp Trp Leu Gly His Ser Gln Gly Ser Pro Ser Ala Gly Ser Ser
325 330 335
Ala Ser Arg Ala Ser Gly Ser Glu Pro Thr Asp Ser Gln Asp Ser Ala
340 345 350
Ser Asp Ala Val Ser His Gly Asp Pro Glu Asp Leu Asp Gly Ala Ala
355 360 365
Arg Ala Gly Glu Ala Gly Ala Leu His Ala Cys Pro Met Pro Ser Ser
370 375 380
Thr Thr Arg Val Thr Pro Thr Thr Lys Arg Gly Arg Ser Gly Gly Glu
385 390 395 400
Asp Ala Arg Ala Asp Thr Ala Leu Lys Lys Pro Lys Thr Gly Ser Pro
405 410 415
Thr Ala Pro Pro Pro Ala Asp Pro Val Pro Leu Asp Thr Glu Asp Asp
420 425 430
Ser Asp Ala Ala Asp Gly Thr Ala Ala Arg Pro Ala Ala Pro Asp Ala
435 440 445
Arg Ser Gly Ser Arg Tyr Ala Cys Tyr Phe Arg Asp Leu Pro Thr Gly
450 455 460
Glu Ala Ser Pro Gly Ala Phe Ser Ala Phe Arg Gly Gly Pro Gln Thr
465 470 475 480
Pro Tyr Gly Phe Gly Phe Pro
485
<210> 33
<211> 57
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
oligonucleotide
<400> 33
gcaacaaact tctctctgct gaaacaagcc ggagatgtcg aagagaatcc tggaccg 57
<210> 34
<211> 19
<212> PRT
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
peptide
<400> 34
Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn
1 5 10 15
Pro Gly Pro
<210> 35
<211> 212
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 35
gggcagagcg cacatcgccc acagtccccg agaagttggg gggaggggtc ggcaattgaa 60
ccggtgccta gagaaggtgg cgcggggtaa actgggaaag tgatgtcgtg tactggctcc 120
gcctttttcc cgagggtggg ggagaaccgt atataagtgc agtagtcgcc gtgaacgttc 180
tttttcgcaa cgggtttgcc gccagaacac ag 212
<210> 36
<211> 56
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
oligonucleotide
<400> 36
aataaaggaa atttattttc attgcaatag tgtgttggaa ttttttgtgt ctctca 56
<210> 37
<211> 3588
<212> DNA
<213> Human alphaherpesvirus 1
<400> 37
atggagacaa agcccaagac ggcaaccacc atcaaggtcc cccccgggcc cctgggatac 60
gtgtacgctc gcgcgtgtcc gtccgaaggc atcgagcttc tggcgttact gtcggcacgc 120
agcggcgatt ccgacgtcgc cgtggcgccc ctggtcgtgg gcctgaccgt ggagagcggc 180
tttgaggcca acgtggccgt ggtcgtgggt tctcgcacga cggggctcgg gggtaccgcg 240
gtgtccctga aactgacgcc ctcgcactac agctcgtccg tgtacgtctt tcacggcggc 300
cggcacctgg accccagcac ccaggccccg aacctgacgc gactttgcga gcgggcacgc 360
cgccattttg gcttttcgga ctacaccccc cggcccggcg acctcaaaca cgagacgacg 420
ggggaggcgc tgtgtgagcg cctcggcctg gacccggacc gcgccctcct gtatctggtc 480
gttaccgagg gcttcaagga ggccgtgtgc atcaacaaca cctttctgca cctgggaggc 540
tcggacaagg taaccatagg cggggcggag gtgcaccgca tacccgtgta cccgttgcag 600
ctgttcatgc cggattttag ccgtgtcatc gcagagccgt tcaacgccaa ccaccgatcg 660
atcggggaga attttaccta cccgcttccg ttttttaacc gccccctcaa ccgcctcctg 720
ttcgaggcgg tcgtgggacc cgccgccgtg gcactgcgat gccgaaacgt ggacgccgtg 780
gcccgcgccg ccgcccacct ggcgtttgac gaaaaccacg agggcgccgc cctccccgcc 840
gacattacgt tcacggcctt cgaagccagc cagggtaaga ccccgcgggg cgggcgcgac 900
ggcggcggca agggcccggc gggcgggttc gaacagcgcc tggcctccgt catggccgga 960
gacgccgccc tggccctcga gtctatcgtg tcgatggccg tctttgacga gccgcccacc 1020
gacatctccg cgtggccgct gttcgagggc caggacacgg ccgcggcccg cgccaacgcc 1080
gtcggggcgt acctggcgcg cgccgcggga ctcgtggggg ccatggtatt tagcaccaac 1140
tcggccctcc atctcaccga ggtggacgac gccggcccgg cggacccaaa ggaccacagc 1200
aaaccctcct tttaccgctt cttcctcgtg cccgggaccc acgtggcggc caacccacag 1260
gtggaccgcg agggacacgt ggtgcccggg ttcgagggtc ggcccaccgc gcccctcgtc 1320
ggcggaaccc aggaatttgc cggcgagcac ctggccatgc tgtgtgggtt ttccccggcg 1380
ctgctggcca agatgctgtt ttacctggag cgctgcgacg gcggcgtgat cgtcgggcgc 1440
caggagatgg acgtgtttcg atacgtcgcg gactccaacc agaccgacgt gccctgtaac 1500
ctatgcacct tcgacacgcg ccacgcctgc gtacacacga cgctcatgcg cctccgggcg 1560
cgccatccaa agttcgccag cgccgcccgc ggagccatcg gcgtcttcgg gaccatgaac 1620
agcatgtata gcgactgcga cgtgctggga aactacgccg ccttctcggc cctgaagcgc 1680
gcggacggat ccgagaccgc ccggaccatc atgcaggaga cgtaccgcgc ggcgaccgag 1740
cgcgtcatgg ccgaactcga gaccctgcag tacgtggacc aggcggtccc cacggccatg 1800
gggcggctgg agaccatcat caccaaccgc gaggccctgc atacggtggt gaacaacgtc 1860
aggcaggtcg tggaccgcga ggtggagcag ctgatgcgca acctggtgga ggggaggaac 1920
ttcaagtttc gcgacggtct gggcgaggcc aaccacgcca tgtccctgac gctggacccg 1980
tacgcgtgcg ggccgtgccc cctgcttcag cttctcgggc ggcgatccaa cctcgccgtg 2040
taccaggacc tggccctgag tcagtgccac ggggtgttcg ccgggcagtc ggtcgagggg 2100
cgcaactttc gcaatcaatt ccaaccggtg ctgcggcggc gcgtgatgga catgtttaac 2160
aacgggtttc tgtcggccaa aacgctgacg gtcgcgctct cggagggggc ggctatctgc 2220
gcccccagcc taacggcggg ccagacggcc cccgccgaga gcagcttcga gggcgacgtt 2280
gcccgcgtga ccctggggtt tcccaaggag ctgcgcgtca agagccgcgt gttgttcgcg 2340
ggcgcgagcg ccaacgcgtc cgaggccgcc aaggcgcggg tcgccagcct ccagagcgcc 2400
taccagaagc ccgacaagcg cgtggacatc ctcctcggac cgctgggctt tctgctcaag 2460
cagttccacg cggccatctt ccccaacggc aagcccccgg ggtccaacca gccgaacccg 2520
cagtggttct ggacggccct ccaacgcaac cagcttcccg cccggctcct gtcgcgcgag 2580
gacatcgaga ccatcgcgtt cattaaaaag ttttccctgg actacggcgc gataaacttt 2640
attaacctgg cccccaacaa cgtgagcgag ctggcgatgt actacatggc aaaccagatt 2700
ctgcggtact gcgatcactc gacatacttc atcaacaccc ttacggccat catcgcgggg 2760
tcccgccgtc cccccagcgt gcaggctgcc gccgcgtggt ccgcgcaggg cggggcgggc 2820
ctggaggccg gggcccgcgc gctgatggac gccgtggacg cgcatccggg cgcgtggacg 2880
tccatgttcg ccagctgcaa cctgctgcgg cccgtcatgg cggcgcgccc catggtcgtg 2940
ttggggttga gcatcagcaa gtactacggc atggccggca acgaccgtgt gtttcaggcc 3000
gggaactggg ccagcctgat gggcggcaaa aacgcgtgcc cgctccttat ttttgaccgc 3060
acccgcaagt tcgtcctggc ctgtccccgg gccgggtttg tgtgcgcggc ctcaagcctc 3120
ggcggcggag cgcacgaaag ctcgctgtgc gagcagctcc ggggcattat ctccgagggc 3180
ggggcggccg tcgccagtag cgtgttcgtg gcgaccgtga aaagcctggg gccccgcacc 3240
cagcagctgc agatcgagga ctggctggcg ctcctggagg acgagtacct aagcgaggag 3300
atgatggagc tgaccgcgcg tgccctggag cgcggcaacg gcgagtggtc gacggacgcg 3360
gccctggagg tggcgcacga ggccgaggcc ctagtcagcc aactcggcaa cgccggggag 3420
gtgtttaact ttggggattt tggctgcgag gacgacaacg cgacgccgtt cggcggcccg 3480
ggggccccgg gaccggcatt tgccggccgc aaacgggcgt tccacgggga tgacccgttt 3540
ggggaggggc cccccgacaa aaagggagac ctgacgttgg atatgctg 3588
<210> 38
<211> 1196
<212> PRT
<213> Human alphaherpesvirus 1
<400> 38
Met Glu Thr Lys Pro Lys Thr Ala Thr Thr Ile Lys Val Pro Pro Gly
1 5 10 15
Pro Leu Gly Tyr Val Tyr Ala Arg Ala Cys Pro Ser Glu Gly Ile Glu
20 25 30
Leu Leu Ala Leu Leu Ser Ala Arg Ser Gly Asp Ser Asp Val Ala Val
35 40 45
Ala Pro Leu Val Val Gly Leu Thr Val Glu Ser Gly Phe Glu Ala Asn
50 55 60
Val Ala Val Val Val Gly Ser Arg Thr Thr Gly Leu Gly Gly Thr Ala
65 70 75 80
Val Ser Leu Lys Leu Thr Pro Ser His Tyr Ser Ser Ser Val Tyr Val
85 90 95
Phe His Gly Gly Arg His Leu Asp Pro Ser Thr Gln Ala Pro Asn Leu
100 105 110
Thr Arg Leu Cys Glu Arg Ala Arg Arg His Phe Gly Phe Ser Asp Tyr
115 120 125
Thr Pro Arg Pro Gly Asp Leu Lys His Glu Thr Thr Gly Glu Ala Leu
130 135 140
Cys Glu Arg Leu Gly Leu Asp Pro Asp Arg Ala Leu Leu Tyr Leu Val
145 150 155 160
Val Thr Glu Gly Phe Lys Glu Ala Val Cys Ile Asn Asn Thr Phe Leu
165 170 175
His Leu Gly Gly Ser Asp Lys Val Thr Ile Gly Gly Ala Glu Val His
180 185 190
Arg Ile Pro Val Tyr Pro Leu Gln Leu Phe Met Pro Asp Phe Ser Arg
195 200 205
Val Ile Ala Glu Pro Phe Asn Ala Asn His Arg Ser Ile Gly Glu Asn
210 215 220
Phe Thr Tyr Pro Leu Pro Phe Phe Asn Arg Pro Leu Asn Arg Leu Leu
225 230 235 240
Phe Glu Ala Val Val Gly Pro Ala Ala Val Ala Leu Arg Cys Arg Asn
245 250 255
Val Asp Ala Val Ala Arg Ala Ala Ala His Leu Ala Phe Asp Glu Asn
260 265 270
His Glu Gly Ala Ala Leu Pro Ala Asp Ile Thr Phe Thr Ala Phe Glu
275 280 285
Ala Ser Gln Gly Lys Thr Pro Arg Gly Gly Arg Asp Gly Gly Gly Lys
290 295 300
Gly Pro Ala Gly Gly Phe Glu Gln Arg Leu Ala Ser Val Met Ala Gly
305 310 315 320
Asp Ala Ala Leu Ala Leu Glu Ser Ile Val Ser Met Ala Val Phe Asp
325 330 335
Glu Pro Pro Thr Asp Ile Ser Ala Trp Pro Leu Phe Glu Gly Gln Asp
340 345 350
Thr Ala Ala Ala Arg Ala Asn Ala Val Gly Ala Tyr Leu Ala Arg Ala
355 360 365
Ala Gly Leu Val Gly Ala Met Val Phe Ser Thr Asn Ser Ala Leu His
370 375 380
Leu Thr Glu Val Asp Asp Ala Gly Pro Ala Asp Pro Lys Asp His Ser
385 390 395 400
Lys Pro Ser Phe Tyr Arg Phe Phe Leu Val Pro Gly Thr His Val Ala
405 410 415
Ala Asn Pro Gln Val Asp Arg Glu Gly His Val Val Pro Gly Phe Glu
420 425 430
Gly Arg Pro Thr Ala Pro Leu Val Gly Gly Thr Gln Glu Phe Ala Gly
435 440 445
Glu His Leu Ala Met Leu Cys Gly Phe Ser Pro Ala Leu Leu Ala Lys
450 455 460
Met Leu Phe Tyr Leu Glu Arg Cys Asp Gly Gly Val Ile Val Gly Arg
465 470 475 480
Gln Glu Met Asp Val Phe Arg Tyr Val Ala Asp Ser Asn Gln Thr Asp
485 490 495
Val Pro Cys Asn Leu Cys Thr Phe Asp Thr Arg His Ala Cys Val His
500 505 510
Thr Thr Leu Met Arg Leu Arg Ala Arg His Pro Lys Phe Ala Ser Ala
515 520 525
Ala Arg Gly Ala Ile Gly Val Phe Gly Thr Met Asn Ser Met Tyr Ser
530 535 540
Asp Cys Asp Val Leu Gly Asn Tyr Ala Ala Phe Ser Ala Leu Lys Arg
545 550 555 560
Ala Asp Gly Ser Glu Thr Ala Arg Thr Ile Met Gln Glu Thr Tyr Arg
565 570 575
Ala Ala Thr Glu Arg Val Met Ala Glu Leu Glu Thr Leu Gln Tyr Val
580 585 590
Asp Gln Ala Val Pro Thr Ala Met Gly Arg Leu Glu Thr Ile Ile Thr
595 600 605
Asn Arg Glu Ala Leu His Thr Val Val Asn Asn Val Arg Gln Val Val
610 615 620
Asp Arg Glu Val Glu Gln Leu Met Arg Asn Leu Val Glu Gly Arg Asn
625 630 635 640
Phe Lys Phe Arg Asp Gly Leu Gly Glu Ala Asn His Ala Met Ser Leu
645 650 655
Thr Leu Asp Pro Tyr Ala Cys Gly Pro Cys Pro Leu Leu Gln Leu Leu
660 665 670
Gly Arg Arg Ser Asn Leu Ala Val Tyr Gln Asp Leu Ala Leu Ser Gln
675 680 685
Cys His Gly Val Phe Ala Gly Gln Ser Val Glu Gly Arg Asn Phe Arg
690 695 700
Asn Gln Phe Gln Pro Val Leu Arg Arg Arg Val Met Asp Met Phe Asn
705 710 715 720
Asn Gly Phe Leu Ser Ala Lys Thr Leu Thr Val Ala Leu Ser Glu Gly
725 730 735
Ala Ala Ile Cys Ala Pro Ser Leu Thr Ala Gly Gln Thr Ala Pro Ala
740 745 750
Glu Ser Ser Phe Glu Gly Asp Val Ala Arg Val Thr Leu Gly Phe Pro
755 760 765
Lys Glu Leu Arg Val Lys Ser Arg Val Leu Phe Ala Gly Ala Ser Ala
770 775 780
Asn Ala Ser Glu Ala Ala Lys Ala Arg Val Ala Ser Leu Gln Ser Ala
785 790 795 800
Tyr Gln Lys Pro Asp Lys Arg Val Asp Ile Leu Leu Gly Pro Leu Gly
805 810 815
Phe Leu Leu Lys Gln Phe His Ala Ala Ile Phe Pro Asn Gly Lys Pro
820 825 830
Pro Gly Ser Asn Gln Pro Asn Pro Gln Trp Phe Trp Thr Ala Leu Gln
835 840 845
Arg Asn Gln Leu Pro Ala Arg Leu Leu Ser Arg Glu Asp Ile Glu Thr
850 855 860
Ile Ala Phe Ile Lys Lys Phe Ser Leu Asp Tyr Gly Ala Ile Asn Phe
865 870 875 880
Ile Asn Leu Ala Pro Asn Asn Val Ser Glu Leu Ala Met Tyr Tyr Met
885 890 895
Ala Asn Gln Ile Leu Arg Tyr Cys Asp His Ser Thr Tyr Phe Ile Asn
900 905 910
Thr Leu Thr Ala Ile Ile Ala Gly Ser Arg Arg Pro Pro Ser Val Gln
915 920 925
Ala Ala Ala Ala Trp Ser Ala Gln Gly Gly Ala Gly Leu Glu Ala Gly
930 935 940
Ala Arg Ala Leu Met Asp Ala Val Asp Ala His Pro Gly Ala Trp Thr
945 950 955 960
Ser Met Phe Ala Ser Cys Asn Leu Leu Arg Pro Val Met Ala Ala Arg
965 970 975
Pro Met Val Val Leu Gly Leu Ser Ile Ser Lys Tyr Tyr Gly Met Ala
980 985 990
Gly Asn Asp Arg Val Phe Gln Ala Gly Asn Trp Ala Ser Leu Met Gly
995 1000 1005
Gly Lys Asn Ala Cys Pro Leu Leu Ile Phe Asp Arg Thr Arg Lys
1010 1015 1020
Phe Val Leu Ala Cys Pro Arg Ala Gly Phe Val Cys Ala Ala Ser
1025 1030 1035
Ser Leu Gly Gly Gly Ala His Glu Ser Ser Leu Cys Glu Gln Leu
1040 1045 1050
Arg Gly Ile Ile Ser Glu Gly Gly Ala Ala Val Ala Ser Ser Val
1055 1060 1065
Phe Val Ala Thr Val Lys Ser Leu Gly Pro Arg Thr Gln Gln Leu
1070 1075 1080
Gln Ile Glu Asp Trp Leu Ala Leu Leu Glu Asp Glu Tyr Leu Ser
1085 1090 1095
Glu Glu Met Met Glu Leu Thr Ala Arg Ala Leu Glu Arg Gly Asn
1100 1105 1110
Gly Glu Trp Ser Thr Asp Ala Ala Leu Glu Val Ala His Glu Ala
1115 1120 1125
Glu Ala Leu Val Ser Gln Leu Gly Asn Ala Gly Glu Val Phe Asn
1130 1135 1140
Phe Gly Asp Phe Gly Cys Glu Asp Asp Asn Ala Thr Pro Phe Gly
1145 1150 1155
Gly Pro Gly Ala Pro Gly Pro Ala Phe Ala Gly Arg Lys Arg Ala
1160 1165 1170
Phe His Gly Asp Asp Pro Phe Gly Glu Gly Pro Pro Asp Lys Lys
1175 1180 1185
Gly Asp Leu Thr Leu Asp Met Leu
1190 1195
<210> 39
<211> 753
<212> DNA
<213> Human alphaherpesvirus 1
<400> 39
aaatgagtct tcggacctcg cgggggccgc ttaagcggtg gttagggttt gtctgacgcg 60
gggggagggg gaaggaacga aacactctca ttcggaggcg gctcggggtt tggtcttggt 120
ggccacgggc acgcagaaga gcgccgcgat cctcttaagc acccccccgc cctccgtgga 180
ggcgggggtt tggtcggcgg gtggtaactg gcgggccgct gactcgggcg ggtcgcgcgc 240
cccagagtgt gaccttttcg gtctgctcgc agacccccgg gcggcgccgc cgcggcggcg 300
acgggctcgc tgggtcctag gctccatggg gaccgtatac gtggacaggc tctggagcat 360
ccgcacgact gcggtgatat taccggagac cttctgcggg acgagccggg tcacgcggct 420
gacgcggagc gtccgttggg cgacaaacac caggacgggg cacaggtaca ctatcttgtc 480
acccggaggc gcgagggact gcaggagctt cagggagtgg cgcagctgct tcatccccgt 540
ggcccgttgc tcgcgtttgc tggcggtgtc cccggaagaa atatatttgc atgtctttag 600
ttctatgatg acacaaaccc cgcccagcgt cttgtcattg gcgaattcga acacgcagat 660
gcagtcgggg cggcgcggtc ccaggtccac ttcgcatatt aaggtgacgc gtgtggcctc 720
gaacaccgag cgaccctgca gcgacccgct taa 753
<210> 40
<211> 48
<212> DNA
<213> Human alphaherpesvirus 1
<400> 40
cggcaataaa aagacagaat aaaacgcacg gtgttgggtc gtttgttc 48
<210> 41
<211> 12130
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 41
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tctgcagtcg accagaagca ccatgtcctt gggtccggcc 2160
tgctgaatgc gcaggcggtc ggccatgccc caggcttcgt tttgacatcg gcgcaggtct 2220
ttgtagtagt cttgcatgag cctttctacc ggcacttctt cttctccttc ctcttgtcct 2280
gcatctcttg catctatcgc tgcggcggcg gcggagtttg gccgtaggtg gcgccctctt 2340
cctcccatgc gtgtgacccc gaagcccctc atcggctgaa gcagggctag gtcggcgaca 2400
acgcgctcgg ctaatatggc ctgctgcacc tgcgtgaggg tagactggaa gtcatccatg 2460
tccacaaagc ggtggtatgc gcccgtgttg atggtgtaag tgcagttggc cataacggac 2520
cagttaacgg tctggtgacc cggctgcgag agctcggtgt acctgagacg cgagtaagcc 2580
ctcgagtcaa atacgtagtc gttgcaagtc cgcaccaggt actggtatcc caccaaaaag 2640
tgcggcggcg gctggcggta gaggggccag cgtagggtgg ccggggctcc gggggcgaga 2700
tcttccaaca taaggcgatg atatccgtag atgtacctgg acatccaggt gatgccggcg 2760
gcggtggtgg aggcgcgcgg aaagtcgcgg acgcggttcc agatgttgcg cagcggcaaa 2820
aagtgctcca tggtcgggac gctctggccg gtcaggcgcg cgcaatcgtt gacgctctag 2880
cgtgcaaaag gagagcctgt aagcgggcac tcttccgtgg tctggtggat aaattcgcaa 2940
gggtatcatg gcggacgacc ggggttcgag ccccgtatcc ggccgtccgc cgtgatccat 3000
gcggttaccg cccgcgtgtc gaacccaggt gtgcgacgtc agacaacggg ggagtgctcc 3060
ttttggcttc cttccaggcg cggcggctgc tgcgctagct tttttggcca ctggccgcgc 3120
gcagcgtaag cggttaggct ggaaagcgaa agcattaagt ggctcgctcc ctgtagccgg 3180
agggttattt tccaagggtt gagtcgcggg acccccggtt cgagtctcgg accgagactg 3240
ggggcgtaca ctggatggcc tttgcctgga acccgcactc aaaaacatgc tacctctttg 3300
agccctttgg cttttctgac cagcgactca agcaggttta ccagtttgag tacgagtcac 3360
tcctgcgccg tagcgccatt gcttcttccc ccgaccgctg tataacgctg gaaaagtcca 3420
cccaaagcgt acaggggccc aactcggccg cctgtggact attctgctgc atgtttctcc 3480
acgcctttgc caactggccc caaactccca tggatcacaa ccccaccatg aaccttatta 3540
ccggggtacc caactccatg ctcaacagtc cccaggtaca gcccaccctg cgtcgcaacc 3600
aggaacagct ctacagcttc ctggagcgcc actcgcccta cttccgcagc cacagtgcgc 3660
agattaggag cgccacttct ttttgtcact tgaaaaacat gtaaaaataa tgtactagag 3720
acactttcaa taaaggcaaa tgcttttatt tgtacactct cgggtgatta tttaccccca 3780
cccttgccgt ctgcgccgtt taaaaatcaa aggggttctg ccgcgcatcg ctatgcgcca 3840
ctggcaggga cacgttgcga tactggtgtt tagtgctcca cttaaactca ggcacaacca 3900
tccgcggcag ctcggtgaag ttttcactcc acaggctgcg caccatcacc aacgcgttta 3960
gcaggtcggg cgccgatatc ttgaagtcgc agttggggcc tccgccctgc gcgcgcgagt 4020
tgcgatacac agggttgcag cactggaaca ctatcagcgc cgggtggtgc acgctggcca 4080
gcacgctctt gtcggagatc agatccgcgt ccaggtcctc cgcgttgctc agggcgaacg 4140
gagtcaactt tggtagctgc cttcccaaaa agggcgcgtg cccaggcttt gagttgcact 4200
cgcaccgtag tggcatcaaa aggtgaccgt gcccggtctg ggcgttagga tacagcgcct 4260
gcataaaagc cttgatctgc ttaaaagcca cctgagcctt tgcgccttca gagaagaaca 4320
tgccgcaaga cttgccggaa aactgattgg ccggacaggc cgcgtcgtgc acgcagcacc 4380
ttgcgtcggt gttggagatc tgcaccacat ttcggcccca ccggttcttc acgatcttgg 4440
ccttgctaga ctgctccttc agcgcgcgct gcccgttttc gctcgtcaca tccatttcaa 4500
tcacgtgctc cttatttatc ataatgcttc cgtgtagaca cttaagctcg ccttcgatct 4560
cagcgcagcg gtgcagccac aacgcgcagc ccgtgggctc gtgatgcttg taggtcacct 4620
ctgcaaacga ctgcaggtac gcctgcagga atcgccccat catcgtcaca aaggtcttgt 4680
tgctggtgaa ggtcagctgc aacccgcggt gctcctcgtt cagccaggtc ttgcatacgg 4740
ccgccagagc ttccacttgg tcaggcagta gtttgaagtt cgcctttaga tcgttatcca 4800
cgtggtactt gtccatcagc gcgcgcgcag cctccatgcc cttctcccac gcagacacga 4860
tcggcacact cagcgggttc atcaccgtaa tttcactttc cgcttcgctg ggctcttcct 4920
cttcctcttg cgtccgcata ccacgcgcca ctgggtcgtc ttcattcagc cgccgcactg 4980
tgcgcttacc tcctttgcca tgcttgatta gcaccggtgg gttgctgaaa cccaccattt 5040
gtagcgccac atcttctctt tcttcctcgc tgtccacgat tacctctggt gatggcgggc 5100
gctcgggctt gggagaaggg cgcttctttt tcttcttggg cgcaatggcc aaatccgccg 5160
ccgaggtcga tggccgcggg ctgggtgtgc gcggcaccag cgcgtcttgt gatgagtctt 5220
cctcgtcctc ggactcgata cgccgcctca tccgcttttt tgggggcgcc cggggaggcg 5280
gcggcgacgg ggacggggac gacacgtcct ccatggttgg gggacgtcgc gccgcaccgc 5340
gtccgcgctc gggggtggtt tcgcgctgct cctcttcccg actggccatt tccttctcct 5400
ataggcagaa aaagatcatg gagtcagtcg agaagaagga cagcctaacc gccccctctg 5460
agttcgccac caccgcctcc accgatgccg ccaacgcgcc taccaccttc cccgtcgagg 5520
cacccccgct tgaggaggag gaagtgatta tcgagcagga cccaggtttt gtaagcgaag 5580
acgacgagga ccgctcagta ccaacagagg ataaaaagca agaccaggac aacgcagagg 5640
caaacgagga acaagtcggg cggggggacg aaaggcatgg cgactaccta gatgtgggag 5700
acgacgtgct gttgaagcat ctgcagcgcc agtgcgccat tatctgcgac gcgttgcaag 5760
agcgcagcga tgtgcccctc gccatagcgg atgtcagcct tgcctacgaa cgccacctat 5820
tctcaccgcg cgtacccccc aaacgccaag aaaacggcac atgcgagccc aacccgcgcc 5880
tcaacttcta ccccgtattt gccgtgccag aggtgcttgc cacctatcac atctttttcc 5940
aaaactgcaa gataccccta tcctgccgtg ccaaccgcag ccgagcggac aagcagctgg 6000
ccttgcggca gggcgctgtc atacctgata tcgcctcgct caacgaagtg ccaaaaatct 6060
ttgagggtct tggacgcgac gagaagcgcg cggcaaacgc tctgcaacag gaaaacagcg 6120
aaaatgaaag tcactctgga gtgttggtgg aactcgaggg tgacaacgcg cgcctagccg 6180
tactaaaacg cagcatcgag gtcacccact ttgcctaccc ggcacttaac ctacccccca 6240
aggtcatgag cacagtcatg agtgagctga tcgtgcgccg tgcgcagccc ctggagaggg 6300
atgcaaattt gcaagaacaa acagaggagg gcctacccgc agttggcgac gagcagctag 6360
cgcgctggct tcaaacgcgc gagcctgccg acttggagga gcgacgcaaa ctaatgatgg 6420
ccgcagtgct cgttaccgtg gagcttgagt gcatgcagcg gttctttgct gacccggaga 6480
tgcagcgcaa gctagaggaa acattgcact acacctttcg acagggctac gtacgccagg 6540
cctgcaagat ctccaacgtg gagctctgca acctggtctc ctaccttgga attttgcacg 6600
aaaaccgcct tgggcaaaac gtgcttcatt ccacgctcaa gggcgaggcg cgccgcgact 6660
acgtccgcga ctgcgtttac ttatttctat gctacacctg gcagacggcc atgggcgttt 6720
ggcagcagtg cttggaggag tgcaacctca aggagctgca gaaactgcta aagcaaaact 6780
tgaaggacct atggacggcc ttcaacgagc gctccgtggc cgcgcacctg gcggacatca 6840
ttttccccga acgcctgctt aaaaccctgc aacagggtct gccagacttc accagtcaaa 6900
gcatgttgca gaactttagg aactttatcc tagagcgctc aggaatcttg cccgccacct 6960
gctgtgcact tcctagcgac tttgtgccca ttaagtaccg cgaatgccct ccgccgcttt 7020
ggggccactg ctaccttctg cagctagcca actaccttgc ctaccactct gacataatgg 7080
aagacgtgag cggtgacggt ctactggagt gtcactgtcg ctgcaaccta tgcaccccgc 7140
accgctccct ggtttgcaat tcgcagctgc ttaacgaaag tcaaattatc ggtacctttg 7200
agctgcaggg tccctcgcct gacgaaaagt ccgcggctcc ggggttgaaa ctcactccgg 7260
ggctgtggac gtcggcttac cttcgcaaat ttgtacctga ggactaccac gcccacgaga 7320
ttaggttcta cgaagaccaa tcccgcccgc ctaatgcgga gcttaccgcc tgcgtcatta 7380
cccagggcca cattcttggc caattgcaag ccatcaacaa agcccgccaa gagtttctgc 7440
tacgaaaggg acggggggtt tacttggacc cccagtccgg cgaggagctc aacccaatcc 7500
ccccgccgcc gcagccctat cagcagcagc cgcgggccct tgcttcccag gatggcaccc 7560
aaaaagaagc tgcagctgcc gccgccaccc acggacgagg aggaatactg ggacagtcag 7620
gcagaggagg ttttggacga ggaggaggag gacatgatgg aagactggga gagcctagac 7680
gaggaagctt ccgaggtcga agaggtgtca gacgaaacac cgtcaccctc ggtcgcattc 7740
ccctcgccgg cgccccagaa atcggcaacc ggttccagca tggctacaac ctccgctcct 7800
caggcgccgc cggcactgcc cgttcgccga cccaaccgta gatgggacac cactggaacc 7860
agggccggta agtccaagca gccgccgccg ttagcccaag agcaacaaca gcgccaaggc 7920
taccgctcat ggcgcgggca caagaacgcc atagttgctt gcttgcaaga ctgtgggggc 7980
aacatctcct tcgcccgccg ctttcttctc taccatcacg gcgtggcctt cccccgtaac 8040
atcctgcatt actaccgtca tctctacagc ccatactgca ccggcggcag cggcagcaac 8100
agcagcggcc acacagaagc aaaggcgacc ggatagcaag actctgacaa agcccaagaa 8160
atccacagcg gcggcagcag caggaggagg agcgctgcgt ctggcgccca acgaacccgt 8220
atcgacccgc gagcttagaa acaggatttt tcccactctg tatgctatat ttcaacagag 8280
caggggccaa gaacaagagc tgaaaataaa aaacaggtct ctgcgatccc tcacccgcag 8340
ctgcctgtat cacaaaagcg aagatcagct tcggcgcacg ctggaagacg cggaggctct 8400
cttcagtaaa tactgcgcgc tgactcttaa ggactagttt cgcgcccttt ctcaaattta 8460
agcgcgaaaa ctacgtcatc tccagcggcc acacccggcg ccagcacctg ttgtcagcgc 8520
cattatgagc aaggaaattc ccacgcccta catgtggagt taccagccac aaatgggact 8580
tgcggctgga gctgcccaag actactcaac ccgaataaac tacatgagcg cggggcggcc 8640
gccgtttgtg ttatgtttca acgtgtttat ttttcaattg cagaaaattt caagtcattt 8700
ttcattcagt agtatagccc caccaccaca tagcttatac agatcaccgt accttaatca 8760
aactcacaga accctagtat tcaacctgcc acctccctcc caacacacag agtacacagt 8820
cctttctccc cggctggcct taaaaagcat catatcatgg gtaacagaca tattcttagg 8880
tgttatattc cacacggttt cctgtcgagc caaacgctca tcagtgatat taataaactc 8940
cccgggcagc tcacttaagt tcatgtcgct gtccagctgc tgagccacag gctgctgtcc 9000
aacttgcggt tgcttaacgg gcggcgaagg agaagtccac gcctacatgg gggtagagtc 9060
ataatcgtgc atcaggatag ggcggtggtg ctgcagcagc gcgcgaataa actgctgccg 9120
ccgccgctcc gtcctgcagg aatacaacat ggcagtggtc tcctcagcga tgattcgcac 9180
cgcccgcagc ataaggcgcc ttgtcctccg ggcacagcag cgcaccctga tctcacttaa 9240
atcagcacag taactgcagc acagcaccac aatattgttc aaaatcccac agtgcaaggc 9300
gctgtatcca aagctcatgg cggggaccac agaacccacg tggccatcat accacaagcg 9360
caggtagatt aagtggcgac ccctcataaa cacgctggac ataaacatta cctcttttgg 9420
catgttgtaa ttcaccacct cccggtacca tataaacctc tgattaaaca tggcgccatc 9480
caccaccatc ctaaaccagc tggccaaaac ctgcccgccg gctatacact gcagggaacc 9540
gggactggaa caatgacagt ggagagccca ggactcgtaa ccatggatca tcatgctcgt 9600
catgatatca atgttggcac aacacaggca cacgtgcata cacttcctca ggattacaag 9660
ctcctcccgc gttagaacca tatcccaggg aacaacccat tcctgaatca gcgtaaatcc 9720
cacactgcag ggaagacctc gcacgtaact cacgttgtgc attgtcaaag tgttacattc 9780
gggcagcagc ggatgatcct ccagtatggt agcgcgggtt tctgtctcaa aaggaggtag 9840
acgatcccta ctgtacggag tgcgccgaga caaccgagat cgtgttggtc gtagtgtcat 9900
gccaaatgga acgccggacg tagtcatatt tcctgaagca aaaccaggtg cgggcgtgac 9960
aaacagatct gcgtctccgg tctcgccgct tagatcgctc tgtgtagtag ttgtagtata 10020
tccactctct caaagcatcc aggcgccccc tggcttcggg ttctatgtaa actccttcat 10080
gcgccgctgc cctgataaca tccaccaccg cagaataagc cacacccagc caacctacac 10140
attcgttctg cgagtcacac acgggaggag cgggaagagc tggaagaacc atgttttttt 10200
ttttattcca aaagattatc caaaacctca aaatgaagat ctattaagtg aacgcgctcc 10260
cctccggtgg cgtggtcaaa ctctacagcc aaagaacaga taatggcatt tgtaagatgt 10320
tgcacaatgg cttccaaaag gcaaacggcc ctcacgtcca agtggacgta aaggctaaac 10380
ccttcagggt gaatctcctc tataaacatt ccagcacctt caaccatgcc caaataattc 10440
tcatctcgcc accttctcaa tatatctcta agcaaatccc gaatattaag tccggccatt 10500
gtaaaaatct gctccagagc gccctccacc ttcagcctca agcagcgaat catgattgca 10560
aaaattcagg ttcctcacag acctgtataa gattcaaaag cggaacatta acaaaaatac 10620
cgcgatcccg taggtccctt cgcagggcca gctgaacata atcgtgcagg tctgcacgga 10680
ccagcgcggc cacttccccg ccaggaacca tgacaaaaga acccacactg attatgacac 10740
gcatactcgg agctatgcta accagcgtag ccccgatgta agcttgttgc atgggcggcg 10800
atataaaatg caaggtgctg ctcaaaaaat caggcaaagc ctcgcgcaaa aaagaaagca 10860
catcgtagtc atgctcatgc agataaaggc aggtaagctc cggaaccacc acagaaaaag 10920
acaccatttt tctctcaaac atgtctgcgg gtttctgcat aaacacaaaa taaaataaca 10980
aaaaaacatt taaacattag aagcctgtct tacaacagga aaaacaaccc ttataagcat 11040
aagacggact acggccatgc cggcgtgacc gtaaaaaaac tggtcaccgt gattaaaaag 11100
caccaccgac agctcctcgg tcatgtccgg agtcataatg taagactcgg taaacacatc 11160
aggttgattc acatcggtca gtgctaaaaa gcgaccgaaa tagcccgggg gaatacatac 11220
ccgcaggcgt agagacaaca ttacagcccc cataggaggt ataacaaaat taataggaga 11280
gaaaaacaca taaacacctg aaaaaccctc ctgcctaggc aaaatagcac cctcccgctc 11340
cagaacaaca tacagcgctt ccacagcggc agccataaca gtcagcctta ccagtaaaaa 11400
agaaaaccta ttaaaaaaac accactcgac acggcaccag ctcaatcagt cacagtgtaa 11460
aaaagggcca agtgcagagc gagtatatat aggactaaaa aatgacgtaa cggttaaagt 11520
ccacaaaaaa cacccagaaa accgcacgcg aacctacgcc cagaaacgaa agccaaaaaa 11580
cccacaactt cctcaaatcg tcacttccgt tttcccacgt tacgtcactt cccattttaa 11640
gaaaactaca attcccaaca catacaagtt actccgccct taattaaatc ggatccgata 11700
tctagatgta ttcgcgaggt accgagctcg aattctctgg ccgtcgtttt acaacgtcgt 11760
gactgggaaa accctggcgt tacccaactt aatcgccttg cagcacatcc ccctttcgcc 11820
agctggcgta atagcgaaga ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg 11880
aatggcgaat ggcgcctgat gcggtatttt ctccttacgc atctgtgcgg tatttcacac 11940
cgcatatggt gcactctcag tacaatctgc tctgatgccg catagttaag ccagccccga 12000
cacccgccaa cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac 12060
agacaagctg tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg 12120
aaacgcgcga 12130
<210> 42
<211> 9135
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 42
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tctgcagtcg accagaagca ccatgtcctt gggtccggcc 2160
tgctgaatgc gcaggcggtc ggccatgccc caggcttcgt tttgacatcg gcgcaggtct 2220
ttgtagtagt cttgcatgag cctttctacc ggcacttctt cttctccttc ctcttgtcct 2280
gcatctcttg catctatcgc tgcggcggcg gcggagtttg gccgtaggtg gcgccctctt 2340
cctcccatgc gtgtgacccc gaagcccctc atcggctgaa gcagggctag gtcggcgaca 2400
acgcgctcgg ctaatatggc ctgctgcacc tgcgtgaggg tagactggaa gtcatccatg 2460
tccacaaagc ggtggtatgc gcccgtgttg atggtgtaag tgcagttggc cataacggac 2520
cagttaacgg tctggtgacc cggctgcgag agctcggtgt acctgagacg cgagtaagcc 2580
ctcgagtcaa atacgtagtc gttgcaagtc cgcaccaggt actggtatcc caccaaaaag 2640
tgcggcggcg gctggcggta gaggggccag cgtagggtgg ccggggctcc gggggcgaga 2700
tcttccaaca taaggcgatg atatccgtag atgtacctgg acatccaggt gatgccggcg 2760
gcggtggtgg aggcgcgcgg aaagtcgcgg acgcggttcc agatgttgcg cagcggcaaa 2820
aagtgctcca tggtcgggac gctctggccg gtcaggcgcg cgcaatcgtt gacgctctag 2880
cgtgcaaaag gagagcctgt aagcgggcac tcttccgtgg tctggtggat aaattcgcaa 2940
gggtatcatg gcggacgacc ggggttcgag ccccgtatcc ggccgtccgc cgtgatccat 3000
gcggttaccg cccgcgtgtc gaacccaggt gtgcgacgtc agacaacggg ggagtgctcc 3060
ttttggcttc cttccaggcg cggcggctgc tgcgctagct tttttggcca ctggccgcgc 3120
gcagcgtaag cggttaggct ggaaagcgaa agcattaagt ggctcgctcc ctgtagccgg 3180
agggttattt tccaagggtt gagtcgcggg acccccggtt cgagtctcgg accgagactg 3240
ggggcgtaca ctggatggcc tttgcctgga acccgcactc aaaaacatgc tacctctttg 3300
agccctttgg cttttctgac cagcgactca agcaggttta ccagtttgag tacgagtcac 3360
tcctgcgccg tagcgccatt gcttcttccc ccgaccgctg tataacgctg gaaaagtcca 3420
cccaaagcgt acaggggccc aactcggccg cctgtggact attctgctgc atgtttctcc 3480
acgcctttgc caactggccc caaactccca tggatcacaa ccccaccatg aaccttatta 3540
ccggggtacc caactccatg ctcaacagtc cccaggtaca gcccaccctg cgtcgcaacc 3600
aggaacagct ctacagcttc ctggagcgcc actcgcccta cttccgcagc cacagtgcgc 3660
agattaggag cgccacttct ttttgtcact tgaaaaacat gtaaaaataa tgtactagag 3720
acactttcaa taaaggcaaa tgcttttatt tgtacactct cgggtgatta tttaccccca 3780
cccttgccgt ctgcgccgtt taaaaatcaa aggggttctg ccgcgcatcg ctatgcgcca 3840
ctggcaggga cacgttgcga tactggtgtt tagtgctcca cttaaactca ggcacaacca 3900
tccgcggcag ctcggtgaag ttttcactcc acaggctgcg caccatcacc aacgcgttta 3960
gcaggtcggg cgccgatatc ttgaagtcgc agttggggcc tccgccctgc gcgcgcgagt 4020
tgcgatacac agggttgcag cactggaaca ctatcagcgc cgggtggtgc acgctggcca 4080
gcacgctctt gtcggagatc agatccgcgt ccaggtcctc cgcgttgctc agggcgaacg 4140
gagtcaactt tggtagctgc cttcccaaaa agggcgcgtg cccaggcttt gagttgcact 4200
cgcaccgtag tggcatcaaa aggtgaccgt gcccggtctg ggcgttagga tacagcgcct 4260
gcataaaagc cttgatctgc ttaaaagcca cctgagcctt tgcgccttca gagaagaaca 4320
tgccgcaaga cttgccggaa aactgattgg ccggacaggc cgcgtcgtgc acgcagcacc 4380
ttgcgtcggt gttggagatc tgcaccacat ttcggcccca ccggttcttc acgatcttgg 4440
ccttgctaga ctgctccttc agcgcgcgct gcccgttttc gctcgtcaca tccatttcaa 4500
tcacgtgctc cttatttatc ataatgcttc cgtgtagaca cttaagctcg ccttcgatct 4560
cagcgcagcg gtgcagccac aacgcgcagc ccgtgggctc gtgatgcttg taggtcacct 4620
ctgcaaacga ctgcaggtac gcctgcagga atcgccccat catcgtcaca aaggtcttgt 4680
tgctggtgaa ggtcagctgc aacccgcggt gctcctcgtt cagccaggtc ttgcatacgg 4740
ccgccagagc ttccacttgg tcaggcagta gtttgaagtt cgcctttaga tcgttatcca 4800
cgtggtactt gtccatcagc gcgcgcgcag cctccatgcc cttctcccac gcagacacga 4860
tcggcacact cagcgggttc atcaccgtaa tttcactttc cgcttcgctg ggctcttcct 4920
cttcctcttg cgtccgcata ccacgcgcca ctgggtcgtc ttcattcagc cgccgcactg 4980
tgcgcttacc tcctttgcca tgcttgatta gcaccggtgg gttgctgaaa cccaccattt 5040
gtagcgccac atcttctctt tcttcctcgc tgtccacgat tacctctggt gatggcgggc 5100
gctcgggctt gggagaaggg cgcttctttt tcttcttggg cgcaatggcc aaatccgccg 5160
ccgaggtcga tggccgcggg ctgggtgtgc gcggcaccag cgcgtcttgt gatgagtctt 5220
cctcgtcctc ggactcgata cgccgcctca tccgcttttt tgggggcgcc cggggaggcg 5280
gcggcgacgg ggacggggac gacacgtcct ccatggttgg gggacgtcgc gccgcaccgc 5340
gtccgcgctc gggggtggtt tcgcgctgct cctcttcccg actggccatt tccttctcct 5400
ataggcagaa aaagatccac aaaagcgaag atcagcttcg gcgcacgctg gaagacgcgg 5460
aggctctctt cagtaaatac tgcgcgctga ctcttaagga ctagtttcgc gccctttctc 5520
aaatttaagc gcgaaaacta cgtcatctcc agcggccaca cccggcgcca gcacctgttg 5580
tcagcgccat tggcgcgccg gccggccgaa tatcttcatt taaatgttta aacatcgatg 5640
cggccgccgt ttgtgttatg tttcaacgtg tttatttttc aattgcagaa aatttcaagt 5700
catttttcat tcagtagtat agccccacca ccacatagct tatacagatc accgtacctt 5760
aatcaaactc acagaaccct agtattcaac ctgccacctc cctcccaaca cacagagtac 5820
acagtccttt ctccccggct ggccttaaaa agcatcatat catgggtaac agacatattc 5880
ttaggtgtta tattccacac ggtttcctgt cgagccaaac gctcatcagt gatattaata 5940
aactccccgg gcagctcact taagttcatg tcgctgtcca gctgctgagc cacaggctgc 6000
tgtccaactt gcggttgctt aacgggcggc gaaggagaag tccacgccta catgggggta 6060
gagtcataat cgtgcatcag gatagggcgg tggtgctgca gcagcgcgcg aataaactgc 6120
tgccgccgcc gctccgtcct gcaggaatac aacatggcag tggtctcctc agcgatgatt 6180
cgcaccgccc gcagcataag gcgccttgtc ctccgggcac agcagcgcac cctgatctca 6240
cttaaatcag cacagtaact gcagcacagc accacaatat tgttcaaaat cccacagtgc 6300
aaggcgctgt atccaaagct catggcgggg accacagaac ccacgtggcc atcataccac 6360
aagcgcaggt agattaagtg gcgacccctc ataaacacgc tggacataaa cattacctct 6420
tttggcatgt tgtaattcac cacctcccgg taccatataa acctctgatt aaacatggcg 6480
ccatccacca ccatcctaaa ccagctggcc aaaacctgcc cgccggctat acactgcagg 6540
gaaccgggac tggaacaatg acagtggaga gcccaggact cgtaaccatg gatcatcatg 6600
ctcgtcatga tatcaatgtt ggcacaacac aggcacacgt gcatacactt cctcaggatt 6660
acaagctcct cccgcgttag aaccatatcc cagggaacaa cccattcctg aatcagcgta 6720
aatcccacac tgcagggaag acctcgcacg taactcacgt tgtgcattgt caaagtgtta 6780
cattcgggca gcagcggatg atcctccagt atggtagcgc gggtttctgt ctcaaaagga 6840
ggtagacgat ccctactgta cggagtgcgc cgagacaacc gagatcgtgt tggtcgtagt 6900
gtcatgccaa atggaacgcc ggacgtagtc atatttcctg aagcaaaacc aggtgcgggc 6960
gtgacaaaca gatctgcgtc tccggtctcg ccgcttagat cgctctgtgt agtagttgta 7020
gtatatccac tctctcaaag catccaggcg ccccctggct tcgggttcta tgtaaactcc 7080
ttcatgcgcc gctgccctga taacatccac caccgcagaa taagccacac ccagccaacc 7140
tacacattcg ttctgcgagt cacacacggg aggagcggga agagctggaa gaaccatgtt 7200
ttttttttta ttccaaaaga ttatccaaaa cctcaaaatg aagatctatt aagtgaacgc 7260
gctcccctcc ggtggcgtgg tcaaactcta cagccaaaga acagataatg gcatttgtaa 7320
gatgttgcac aatggcttcc aaaaggcaaa cggccctcac gtccaagtgg acgtaaaggc 7380
taaacccttc agggtgaatc tcctctataa acattccagc accttcaacc atgcccaaat 7440
aattctcatc tcgccacctt ctcaatatat ctctaagcaa atcccgaata ttaagtccgg 7500
ccattgtaaa aatctgctcc agagcgccct ccaccttcag cctcaagcag cgaatcatga 7560
ttgcaaaaat tcaggttcct cacagacctg tataagattc aaaagcggaa cattaacaaa 7620
aataccgcga tcccgtaggt cccttcgcag ggccagctga acataatcgt gcaggtctgc 7680
acggaccagc gcggccactt ccccgccagg aaccatgaca aaagaaccca cactgattat 7740
gacacgcata ctcggagcta tgctaaccag cgtagccccg atgtaagctt gttgcatggg 7800
cggcgatata aaatgcaagg tgctgctcaa aaaatcaggc aaagcctcgc gcaaaaaaga 7860
aagcacatcg tagtcatgct catgcagata aaggcaggta agctccggaa ccaccacaga 7920
aaaagacacc atttttctct caaacatgtc tgcgggtttc tgcataaaca caaaataaaa 7980
taacaaaaaa acatttaaac attagaagcc tgtcttacaa caggaaaaac aacccttata 8040
agcataagac ggactacggc catgccggcg tgaccgtaaa aaaactggtc accgtgatta 8100
aaaagcacca ccgacagctc ctcggtcatg tccggagtca taatgtaaga ctcggtaaac 8160
acatcaggtt gattcacatc ggtcagtgct aaaaagcgac cgaaatagcc cgggggaata 8220
catacccgca ggcgtagaga caacattaca gcccccatag gaggtataac aaaattaata 8280
ggagagaaaa acacataaac acctgaaaaa ccctcctgcc taggcaaaat agcaccctcc 8340
cgctccagaa caacatacag cgcttccaca gcggcagcca taacagtcag ccttaccagt 8400
aaaaaagaaa acctattaaa aaaacaccac tcgacacggc accagctcaa tcagtcacag 8460
tgtaaaaaag ggccaagtgc agagcgagta tatataggac taaaaaatga cgtaacggtt 8520
aaagtccaca aaaaacaccc agaaaaccgc acgcgaacct acgcccagaa acgaaagcca 8580
aaaaacccac aacttcctca aatcgtcact tccgttttcc cacgttacgt cacttcccat 8640
tttaagaaaa ctacaattcc caacacatac aagttactcc gcccttaatt aaatcggatc 8700
cgatatctag atgtattcgc gaggtaccga gctcgaattc tctggccgtc gttttacaac 8760
gtcgtgactg ggaaaaccct ggcgttaccc aacttaatcg ccttgcagca catccccctt 8820
tcgccagctg gcgtaatagc gaagaggccc gcaccgatcg cccttcccaa cagttgcgca 8880
gcctgaatgg cgaatggcgc ctgatgcggt attttctcct tacgcatctg tgcggtattt 8940
cacaccgcat atggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagc 9000
cccgacaccc gccaacaccc gctgacgcgc cctgacgggc ttgtctgctc ccggcatccg 9060
cttacagaca agctgtgacc gtctccggga gctgcatgtg tcagaggttt tcaccgtcat 9120
caccgaaacg cgcga 9135
<210> 43
<211> 8236
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 43
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat ctgtacactc tcgggtgatt atttaccccc 2880
acccttgccg tctgcgccgt ttaaaaatca aaggggttct gccgcgcatc gctatgcgcc 2940
actggcaggg acacgttgcg atactggtgt ttagtgctcc acttaaactc aggcacaacc 3000
atccgcggca gctcggtgaa gttttcactc cacaggctgc gcaccatcac caacgcgttt 3060
agcaggtcgg gcgccgatat cttgaagtcg cagttggggc ctccgccctg cgcgcgcgag 3120
ttgcgataca cagggttgca gcactggaac actatcagcg ccgggtggtg cacgctggcc 3180
agcacgctct tgtcggagat cagatccgcg tccaggtcct ccgcgttgct cagggcgaac 3240
ggagtcaact ttggtagctg ccttcccaaa aagggcgcgt gcccaggctt tgagttgcac 3300
tcgcaccgta gtggcatcaa aaggtgaccg tgcccggtct gggcgttagg atacagcgcc 3360
tgcataaaag ccttgatctg cttaaaagcc acctgagcct ttgcgccttc agagaagaac 3420
atgccgcaag acttgccgga aaactgattg gccggacagg ccgcgtcgtg cacgcagcac 3480
cttgcgtcgg tgttggagat ctgcaccaca tttcggcccc accggttctt cacgatcttg 3540
gccttgctag actgctcctt cagcgcgcgc tgcccgtttt cgctcgtcac atccatttca 3600
atcacgtgct ccttatttat cataatgctt ccgtgtagac acttaagctc gccttcgatc 3660
tcagcgcagc ggtgcagcca caacgcgcag cccgtgggct cgtgatgctt gtaggtcacc 3720
tctgcaaacg actgcaggta cgcctgcagg aatcgcccca tcatcgtcac aaaggtcttg 3780
ttgctggtga aggtcagctg caacccgcgg tgctcctcgt tcagccaggt cttgcatacg 3840
gccgccagag cttccacttg gtcaggcagt agtttgaagt tcgcctttag atcgttatcc 3900
acgtggtact tgtccatcag cgcgcgcgca gcctccatgc ccttctccca cgcagacacg 3960
atcggcacac tcagcgggtt catcaccgta atttcacttt ccgcttcgct gggctcttcc 4020
tcttcctctt gcgtccgcat accacgcgcc actgggtcgt cttcattcag ccgccgcact 4080
gtgcgcttac ctcctttgcc atgcttgatt agcaccggtg ggttgctgaa acccaccatt 4140
tgtagcgcca catcttctct ttcttcctcg ctgtccacga ttacctctgg tgatggcggg 4200
cgctcgggct tgggagaagg gcgcttcttt ttcttcttgg gcgcaatggc caaatccgcc 4260
gccgaggtcg atggccgcgg gctgggtgtg cgcggcacca gcgcgtcttg tgatgagtct 4320
tcctcgtcct cggactcgat acgccgcctc atccgctttt ttgggggcgc ccggggaggc 4380
ggcggcgacg gggacgggga cgacacgtcc tccatggttg ggggacgtcg cgccgcaccg 4440
cgtccgcgct cgggggtggt ttcgcgctgc tcctcttccc gactggccat ttccttctcc 4500
tataggcaga aaaagatcca caaaagcgaa gatcagcttc ggcgcacgct ggaagacgcg 4560
gaggctctct tcagtaaata ctgcgcgctg actcttaagg actagtttcg cgccctttct 4620
caaatttaag cgcgaaaact acgtcatctc cagcggccac acccggcgcc agcacctgtt 4680
gtcagcgcca ttggcgcgcc ggccggccga atatcttcat ttaaatgttt aaacatcgat 4740
gcggccgccg tttgtgttat gtttcaacgt gtttattttt caattgcaga aaatttcaag 4800
tcatttttca ttcagtagta tagccccacc accacatagc ttatacagat caccgtacct 4860
taatcaaact cacagaaccc tagtattcaa cctgccacct ccctcccaac acacagagta 4920
cacagtcctt tctccccggc tggccttaaa aagcatcata tcatgggtaa cagacatatt 4980
cttaggtgtt atattccaca cggtttcctg tcgagccaaa cgctcatcag tgatattaat 5040
aaactccccg ggcagctcac ttaagttcat gtcgctgtcc agctgctgag ccacaggctg 5100
ctgtccaact tgcggttgct taacgggcgg cgaaggagaa gtccacgcct acatgggggt 5160
agagtcataa tcgtgcatca ggatagggcg gtggtgctgc agcagcgcgc gaataaactg 5220
ctgccgccgc cgctccgtcc tgcaggaata caacatggca gtggtctcct cagcgatgat 5280
tcgcaccgcc cgcagcataa ggcgccttgt cctccgggca cagcagcgca ccctgatctc 5340
acttaaatca gcacagtaac tgcagcacag caccacaata ttgttcaaaa tcccacagtg 5400
caaggcgctg tatccaaagc tcatggcggg gaccacagaa cccacgtggc catcatacca 5460
caagcgcagg tagattaagt ggcgacccct cataaacacg ctggacataa acattacctc 5520
ttttggcatg ttgtaattca ccacctcccg gtaccatata aacctctgat taaacatggc 5580
gccatccacc accatcctaa accagctggc caaaacctgc ccgccggcta tacactgcag 5640
ggaaccggga ctggaacaat gacagtggag agcccaggac tcgtaaccat ggatcatcat 5700
gctcgtcatg atatcaatgt tggcacaaca caggcacacg tgcatacact tcctcaggat 5760
tacaagctcc tcccgcgtta gaaccatatc ccagggaaca acccattcct gaatcagcgt 5820
aaatcccaca ctgcagggaa gacctcgcac gtaactcacg ttgtgcattg tcaaagtgtt 5880
acattcgggc agcagcggat gatcctccag tatggtagcg cgggtttctg tctcaaaagg 5940
aggtagacga tccctactgt acggagtgcg ccgagacaac cgagatcgtg ttggtcgtag 6000
tgtcatgcca aatggaacgc cggacgtagt catatttcct gaagcaaaac caggtgcggg 6060
cgtgacaaac agatctgcgt ctccggtctc gccgcttaga tcgctctgtg tagtagttgt 6120
agtatatcca ctctctcaaa gcatccaggc gccccctggc ttcgggttct atgtaaactc 6180
cttcatgcgc cgctgccctg ataacatcca ccaccgcaga ataagccaca cccagccaac 6240
ctacacattc gttctgcgag tcacacacgg gaggagcggg aagagctgga agaaccatgt 6300
tttttttttt attccaaaag attatccaaa acctcaaaat gaagatctat taagtgaacg 6360
cgctcccctc cggtggcgtg gtcaaactct acagccaaag aacagataat ggcatttgta 6420
agatgttgca caatggcttc caaaaggcaa acggccctca cgtccaagtg gacgtaaagg 6480
ctaaaccctt cagggtgaat ctcctctata aacattccag caccttcaac catgcccaaa 6540
taattctcat ctcgccacct tctcaatata tctctaagca aatcccgaat attaagtccg 6600
gccattgtaa aaatctgctc cagagcgccc tccaccttca gcctcaagca gcgaatcatg 6660
attgcaaaaa ttcaggttcc tcacagacct gtataagatt caaaagcgga acattaacaa 6720
aaataccgcg atcccgtagg tcccttcgca gggccagctg aacataatcg tgcaggtctg 6780
cacggaccag cgcggccact tccccgccag gaaccatgac aaaagaaccc acactgatta 6840
tgacacgcat actcggagct atgctaacca gcgtagcccc gatgtaagct tgttgcatgg 6900
gcggcgatat aaaatgcaag gtgctgctca aaaaatcagg caaagcctcg cgcaaaaaag 6960
aaagcacatc gtagtcatgc tcatgcagat aaaggcaggt aagctccgga accaccacag 7020
aaaaagacac catttttctc tcaaacatgt ctgcgggttt ctgcataaac acaaaataaa 7080
ataacaaaaa aacatttaaa cattagaagc ctgtcttaca acaggaaaaa caacccttat 7140
aagcataaga cggactacgg ccatgccggc gtgaccgtaa aaaaactggt caccgtgatt 7200
aaaaagcacc accgacagct cctcggtcat gtccggagtc ataatgtaag actcggtaaa 7260
cacatcaggt tgattcacat cggtcagtgc taaaaagcga ccgaaatagc ccgggggaat 7320
acatacccgc aggcgtagag acaacattac agcccccata ggaggtataa caaaattaat 7380
aggagagaaa aacacataaa cacctgaaaa accctcctgc ctaggcaaaa tagcaccctc 7440
ccgctccaga acaacataca gcgcttccac agcggcagcc ataacagtca gccttaccag 7500
taaaaaagaa aacctattaa aaaaacacca ctcgacacgg caccagctca atcagtcaca 7560
gtgtaaaaaa gggccaagtg cagagcgagt atatatagga ctaaaaaatg acgtaacggt 7620
taaagtccac aaaaaacacc cagaaaaccg cacgcgaacc tacgcccaga aacgaaagcc 7680
aaaaaaccca caacttcctc aaatcgtcac ttccgttttc ccacgttacg tcacttccca 7740
ttttaagaaa actacaattc ccaacacata caagttactc cgcccttaat taaatcggat 7800
ccgatatcta gatgtattcg cgaggtaccg agctcgaatt ctctggccgt cgttttacaa 7860
cgtcgtgact gggaaaaccc tggcgttacc caacttaatc gccttgcagc acatccccct 7920
ttcgccagct ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca acagttgcgc 7980
agcctgaatg gcgaatggcg cctgatgcgg tattttctcc ttacgcatct gtgcggtatt 8040
tcacaccgca tatggtgcac tctcagtaca atctgctctg atgccgcata gttaagccag 8100
ccccgacacc cgccaacacc cgctgacgcg ccctgacggg cttgtctgct cccggcatcc 8160
gcttacagac aagctgtgac cgtctccggg agctgcatgt gtcagaggtt ttcaccgtca 8220
tcaccgaaac gcgcga 8236
<210> 44
<211> 8371
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 44
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat cgatccagac atgataagat acattgatga 2880
gtttggacaa accacaacta gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga 2940
tgctattgct ttatttgtaa ccattataag ctgcaataaa caagtttgta cactctcggg 3000
tgattattta cccccaccct tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc 3060
gcatcgctat gcgccactgg cagggacacg ttgcgatact ggtgtttagt gctccactta 3120
aactcaggca caaccatccg cggcagctcg gtgaagtttt cactccacag gctgcgcacc 3180
atcaccaacg cgtttagcag gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg 3240
ccctgcgcgc gcgagttgcg atacacaggg ttgcagcact ggaacactat cagcgccggg 3300
tggtgcacgc tggccagcac gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg 3360
ttgctcaggg cgaacggagt caactttggt agctgccttc ccaaaaaggg cgcgtgccca 3420
ggctttgagt tgcactcgca ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg 3480
ttaggataca gcgcctgcat aaaagccttg atctgcttaa aagccacctg agcctttgcg 3540
ccttcagaga agaacatgcc gcaagacttg ccggaaaact gattggccgg acaggccgcg 3600
tcgtgcacgc agcaccttgc gtcggtgttg gagatctgca ccacatttcg gccccaccgg 3660
ttcttcacga tcttggcctt gctagactgc tccttcagcg cgcgctgccc gttttcgctc 3720
gtcacatcca tttcaatcac gtgctcctta tttatcataa tgcttccgtg tagacactta 3780
agctcgcctt cgatctcagc gcagcggtgc agccacaacg cgcagcccgt gggctcgtga 3840
tgcttgtagg tcacctctgc aaacgactgc aggtacgcct gcaggaatcg ccccatcatc 3900
gtcacaaagg tcttgttgct ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc 3960
caggtcttgc atacggccgc cagagcttcc acttggtcag gcagtagttt gaagttcgcc 4020
tttagatcgt tatccacgtg gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc 4080
tcccacgcag acacgatcgg cacactcagc gggttcatca ccgtaatttc actttccgct 4140
tcgctgggct cttcctcttc ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca 4200
ttcagccgcc gcactgtgcg cttacctcct ttgccatgct tgattagcac cggtgggttg 4260
ctgaaaccca ccatttgtag cgccacatct tctctttctt cctcgctgtc cacgattacc 4320
tctggtgatg gcgggcgctc gggcttggga gaagggcgct tctttttctt cttgggcgca 4380
atggccaaat ccgccgccga ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg 4440
tcttgtgatg agtcttcctc gtcctcggac tcgatacgcc gcctcatccg cttttttggg 4500
ggcgcccggg gaggcggcgg cgacggggac ggggacgaca cgtcctccat ggttggggga 4560
cgtcgcgccg caccgcgtcc gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg 4620
gccatttcct tctcctatag gcagaaaaag atccacaaaa gcgaagatca gcttcggcgc 4680
acgctggaag acgcggaggc tctcttcagt aaatactgcg cgctgactct taaggactag 4740
tttcgcgccc tttctcaaat ttaagcgcga aaactacgtc atctccagcg gccacacccg 4800
gcgccagcac ctgttgtcag cgccattggc gcgccggccg gccgaatatc ttcatttaaa 4860
tgtttaaaca tcgatgcggc cgccgtttgt gttatgtttc aacgtgttta tttttcaatt 4920
gcagaaaatt tcaagtcatt tttcattcag tagtatagcc ccaccaccac atagcttata 4980
cagatcaccg taccttaatc aaactcacag aaccctagta ttcaacctgc cacctccctc 5040
ccaacacaca gagtacacag tcctttctcc ccggctggcc ttaaaaagca tcatatcatg 5100
ggtaacagac atattcttag gtgttatatt ccacacggtt tcctgtcgag ccaaacgctc 5160
atcagtgata ttaataaact ccccgggcag ctcacttaag ttcatgtcgc tgtccagctg 5220
ctgagccaca ggctgctgtc caacttgcgg ttgcttaacg ggcggcgaag gagaagtcca 5280
cgcctacatg ggggtagagt cataatcgtg catcaggata gggcggtggt gctgcagcag 5340
cgcgcgaata aactgctgcc gccgccgctc cgtcctgcag gaatacaaca tggcagtggt 5400
ctcctcagcg atgattcgca ccgcccgcag cataaggcgc cttgtcctcc gggcacagca 5460
gcgcaccctg atctcactta aatcagcaca gtaactgcag cacagcacca caatattgtt 5520
caaaatccca cagtgcaagg cgctgtatcc aaagctcatg gcggggacca cagaacccac 5580
gtggccatca taccacaagc gcaggtagat taagtggcga cccctcataa acacgctgga 5640
cataaacatt acctcttttg gcatgttgta attcaccacc tcccggtacc atataaacct 5700
ctgattaaac atggcgccat ccaccaccat cctaaaccag ctggccaaaa cctgcccgcc 5760
ggctatacac tgcagggaac cgggactgga acaatgacag tggagagccc aggactcgta 5820
accatggatc atcatgctcg tcatgatatc aatgttggca caacacaggc acacgtgcat 5880
acacttcctc aggattacaa gctcctcccg cgttagaacc atatcccagg gaacaaccca 5940
ttcctgaatc agcgtaaatc ccacactgca gggaagacct cgcacgtaac tcacgttgtg 6000
cattgtcaaa gtgttacatt cgggcagcag cggatgatcc tccagtatgg tagcgcgggt 6060
ttctgtctca aaaggaggta gacgatccct actgtacgga gtgcgccgag acaaccgaga 6120
tcgtgttggt cgtagtgtca tgccaaatgg aacgccggac gtagtcatat ttcctgaagc 6180
aaaaccaggt gcgggcgtga caaacagatc tgcgtctccg gtctcgccgc ttagatcgct 6240
ctgtgtagta gttgtagtat atccactctc tcaaagcatc caggcgcccc ctggcttcgg 6300
gttctatgta aactccttca tgcgccgctg ccctgataac atccaccacc gcagaataag 6360
ccacacccag ccaacctaca cattcgttct gcgagtcaca cacgggagga gcgggaagag 6420
ctggaagaac catgtttttt tttttattcc aaaagattat ccaaaacctc aaaatgaaga 6480
tctattaagt gaacgcgctc ccctccggtg gcgtggtcaa actctacagc caaagaacag 6540
ataatggcat ttgtaagatg ttgcacaatg gcttccaaaa ggcaaacggc cctcacgtcc 6600
aagtggacgt aaaggctaaa cccttcaggg tgaatctcct ctataaacat tccagcacct 6660
tcaaccatgc ccaaataatt ctcatctcgc caccttctca atatatctct aagcaaatcc 6720
cgaatattaa gtccggccat tgtaaaaatc tgctccagag cgccctccac cttcagcctc 6780
aagcagcgaa tcatgattgc aaaaattcag gttcctcaca gacctgtata agattcaaaa 6840
gcggaacatt aacaaaaata ccgcgatccc gtaggtccct tcgcagggcc agctgaacat 6900
aatcgtgcag gtctgcacgg accagcgcgg ccacttcccc gccaggaacc atgacaaaag 6960
aacccacact gattatgaca cgcatactcg gagctatgct aaccagcgta gccccgatgt 7020
aagcttgttg catgggcggc gatataaaat gcaaggtgct gctcaaaaaa tcaggcaaag 7080
cctcgcgcaa aaaagaaagc acatcgtagt catgctcatg cagataaagg caggtaagct 7140
ccggaaccac cacagaaaaa gacaccattt ttctctcaaa catgtctgcg ggtttctgca 7200
taaacacaaa ataaaataac aaaaaaacat ttaaacatta gaagcctgtc ttacaacagg 7260
aaaaacaacc cttataagca taagacggac tacggccatg ccggcgtgac cgtaaaaaaa 7320
ctggtcaccg tgattaaaaa gcaccaccga cagctcctcg gtcatgtccg gagtcataat 7380
gtaagactcg gtaaacacat caggttgatt cacatcggtc agtgctaaaa agcgaccgaa 7440
atagcccggg ggaatacata cccgcaggcg tagagacaac attacagccc ccataggagg 7500
tataacaaaa ttaataggag agaaaaacac ataaacacct gaaaaaccct cctgcctagg 7560
caaaatagca ccctcccgct ccagaacaac atacagcgct tccacagcgg cagccataac 7620
agtcagcctt accagtaaaa aagaaaacct attaaaaaaa caccactcga cacggcacca 7680
gctcaatcag tcacagtgta aaaaagggcc aagtgcagag cgagtatata taggactaaa 7740
aaatgacgta acggttaaag tccacaaaaa acacccagaa aaccgcacgc gaacctacgc 7800
ccagaaacga aagccaaaaa acccacaact tcctcaaatc gtcacttccg ttttcccacg 7860
ttacgtcact tcccatttta agaaaactac aattcccaac acatacaagt tactccgccc 7920
ttaattaaat cggatccgat atctagatgt attcgcgagg taccgagctc gaattctctg 7980
gccgtcgttt tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt 8040
gcagcacatc cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct 8100
tcccaacagt tgcgcagcct gaatggcgaa tggcgcctga tgcggtattt tctccttacg 8160
catctgtgcg gtatttcaca ccgcatatgg tgcactctca gtacaatctg ctctgatgcc 8220
gcatagttaa gccagccccg acacccgcca acacccgctg acgcgccctg acgggcttgt 8280
ctgctcccgg catccgctta cagacaagct gtgaccgtct ccgggagctg catgtgtcag 8340
aggttttcac cgtcatcacc gaaacgcgcg a 8371
<210> 45
<211> 8888
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 45
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tctgcagtcg accagaagca ccatgtcctt gggtccggcc 2160
tgctgaatgc gcaggcggtc ggccatgccc caggcttcgt tttgacatcg gcgcaggtct 2220
ttgtagtagt cttgcatgag cctttctacc ggcacttctt cttctccttc ctcttgtcct 2280
gcatctcttg catctatcgc tgcggcggcg gcggagtttg gccgtaggtg gcgccctctt 2340
cctcccatgc gtgtgacccc gaagcccctc atcggctgaa gcagggctag gtcggcgaca 2400
acgcgctcgg ctaatatggc ctgctgcacc tgcgtgaggg tagactggaa gtcatccatg 2460
tccacaaagc ggtggtatgc gcccgtgttg atggtgtaag tgcagttggc cataacggac 2520
cagttaacgg tctggtgacc cggctgcgag agctcggtgt acctgagacg cgagtaagcc 2580
ctcgagtcaa atacgtagtc gttgcaagtc cgcaccaggt actggtatcc caccaaaaag 2640
tgcggcggcg gctggcggta gaggggccag cgtagggtgg ccggggctcc gggggcgaga 2700
tcttccaaca taaggcgatg atatccgtag atgtacctgg acatccaggt gatgccggcg 2760
gcggtggtgg aggcgcgcgg aaagtcgcgg acgcggttcc agatgttgcg cagcggcaaa 2820
aagtgctcca tggtcgggac gctctggccg gtcaggcgcg cgcaatcgtt gacgctctag 2880
cgtgcaaaag gagagcctgt aagcgggcac tcttccgtgg tctggtggat aaattcgcaa 2940
gggtatcatg gcggacgacc ggggttcgag ccccgtatcc ggccgtccgc cgtgatccat 3000
gcggttaccg cccgcgtgtc gaacccaggt gtgcgacgtc agacaacggg ggagtgctcc 3060
ttttggcttc cttccaggcg cggcggctgc tgcgctagct tttttggcca ctggccgcgc 3120
gcagcgtaag cggttaggct ggaaagcgaa agcattaagt ggctcgctcc ctgtagccgg 3180
agggttattt tccaagggtt gagtcgcggg acccccggtt cgagtctcgg accgagactg 3240
ggggcgtaca ctggatggcc tttgcctgga acccgcactc aaaaacatgc tacctctttg 3300
agccctttgg cttttctgac cagcgactca agcaggttta ccagtttgag tacgagtcac 3360
tcctgcgccg tagcgccatt gcttcttccc ccgaccgctg tataacgctg gaaaagtcca 3420
cccaaagcgt acaggggccc aactcggccg cctgtggact attctgctgc atgtttctcc 3480
acgcctttgc caactggccc caaactccca tggatcacaa ccccaccatg aaccttatta 3540
ccggggtacc caactccatg ctcaacagtc cccaggtaca gcccaccctg cgtcgcaacc 3600
aggaacagct ctacagcttc ctggagcgcc actcgcccta cttccgcagc cacagtgcgc 3660
agattaggag cgccacttct ttttgtcact tgaaaaacat gtaaaaataa tgtactagag 3720
acactttcaa taaaggcaaa tgcttttatt tgtacactct cgggtgatta tttaccccca 3780
cccttgccgt ctgcgccgtt taaaaatcaa aggggttctg ccgcgcatcg ctatgcgcca 3840
ctggcaggga cacgttgcga tactggtgtt tagtgctcca cttaaactca ggcacaacca 3900
tccgcggcag ctcggtgaag ttttcactcc acaggctgcg caccatcacc aacgcgttta 3960
gcaggtcggg cgccgatatc ttgaagtcgc agttggggcc tccgccctgc gcgcgcgagt 4020
tgcgatacac agggttgcag cactggaaca ctatcagcgc cgggtggtgc acgctggcca 4080
gcacgctctt gtcggagatc agatccgcgt ccaggtcctc cgcgttgctc agggcgaacg 4140
gagtcaactt tggtagctgc cttcccaaaa agggcgcgtg cccaggcttt gagttgcact 4200
cgcaccgtag tggcatcaaa aggtgaccgt gcccggtctg ggcgttagga tacagcgcct 4260
gcataaaagc cttgatctgc ttaaaagcca cctgagcctt tgcgccttca gagaagaaca 4320
tgccgcaaga cttgccggaa aactgattgg ccggacaggc cgcgtcgtgc acgcagcacc 4380
ttgcgtcggt gttggagatc tgcaccacat ttcggcccca ccggttcttc acgatcttgg 4440
ccttgctaga ctgctccttc agcgcgcgct gcccgttttc gctcgtcaca tccatttcaa 4500
tcacgtgctc cttatttatc ataatgcttc cgtgtagaca cttaagctcg ccttcgatct 4560
cagcgcagcg gtgcagccac aacgcgcagc ccgtgggctc gtgatgcttg taggtcacct 4620
ctgcaaacga ctgcaggtac gcctgcagga atcgccccat catcgtcaca aaggtcttgt 4680
tgctggtgaa ggtcagctgc aacccgcggt gctcctcgtt cagccaggtc ttgcatacgg 4740
ccgccagagc ttccacttgg tcaggcagta gtttgaagtt cgcctttaga tcgttatcca 4800
cgtggtactt gtccatcagc gcgcgcgcag cctccatgcc cttctcccac gcagacacga 4860
tcggcacact cagcgggttc atcaccgtaa tttcactttc cgcttcgctg ggctcttcct 4920
cttcctcttg cgtccgcata ccacgcgcca ctgggtcgtc ttcattcagc cgccgcactg 4980
tgcgcttacc tcctttgcca tgcttgatta gcaccggtgg gttgctgaaa cccaccattt 5040
gtagcgccac atcttctctt tcttcctcgc tgtccacgat tacctctggt gatggcgggc 5100
gctcgggctt gggagaaggg cgcttctttt tcttcttggg cgcaatggcc aaatccgccg 5160
ccgaggtcga tggccgcggg ctgggtgtgc gcggcaccag cgcgtcttgt gatgagtctt 5220
cctcgtcctc ggactcgata cgccgcctca tccgcttttt tgggggcgcc cggggaggcg 5280
gcggcgacgg ggacggggac gacacgtcct ccatggttgg gggacgtcgc gccgcaccgc 5340
gtccgcgctc gggggtggtt tcgcgctgct cctcttcccg actggccatt tccttctcct 5400
ataggcagaa aaagatccac aaaagcgaag atcagcttcg gcgcacgctg gaagacgcgg 5460
aggctctctt cagtaaatac tgcgcgctga ctcttaagga ctagtttcgc gccctttctc 5520
aaatttaagc gcgaaaacta cgtcatctcc agcggccaca cccggcgcca gcacctgttg 5580
tcagcgccat tggcgcgccg gccggccgaa tatcttcatt taaatgttta aacatcgatg 5640
cggccgcaac ttgtttattg cagcttataa tggttacaaa taaagcaata gcatcacaaa 5700
tttcacaaat aaagcatttt tttcactgca ttctagttgt ggtttgtcca aactcatcaa 5760
tgtatcttag cttaacgggc ggcgaaggag aagtccacgc ctacatgggg gtagagtcat 5820
aatcgtgcat caggataggg cggtggtgct gcagcagcgc gcgaataaac tgctgccgcc 5880
gccgctccgt cctgcaggaa tacaacatgg cagtggtctc ctcagcgatg attcgcaccg 5940
cccgcagcat aaggcgcctt gtcctccggg cacagcagcg caccctgatc tcacttaaat 6000
cagcacagta actgcagcac agcaccacaa tattgttcaa aatcccacag tgcaaggcgc 6060
tgtatccaaa gctcatggcg gggaccacag aacccacgtg gccatcatac cacaagcgca 6120
ggtagattaa gtggcgaccc ctcataaaca cgctggacat aaacattacc tcttttggca 6180
tgttgtaatt caccacctcc cggtaccata taaacctctg attaaacatg gcgccatcca 6240
ccaccatcct aaaccagctg gccaaaacct gcccgccggc tatacactgc agggaaccgg 6300
gactggaaca atgacagtgg agagcccagg actcgtaacc atggatcatc atgctcgtca 6360
tgatatcaat gttggcacaa cacaggcaca cgtgcataca cttcctcagg attacaagct 6420
cctcccgcgt tagaaccata tcccagggaa caacccattc ctgaatcagc gtaaatccca 6480
cactgcaggg aagacctcgc acgtaactca cgttgtgcat tgtcaaagtg ttacattcgg 6540
gcagcagcgg atgatcctcc agtatggtag cgcgggtttc tgtctcaaaa ggaggtagac 6600
gatccctact gtacggagtg cgccgagaca accgagatcg tgttggtcgt agtgtcatgc 6660
caaatggaac gccggacgta gtcatatttc ctgaagcaaa accaggtgcg ggcgtgacaa 6720
acagatctgc gtctccggtc tcgccgctta gatcgctctg tgtagtagtt gtagtatatc 6780
cactctctca aagcatccag gcgccccctg gcttcgggtt ctatgtaaac tccttcatgc 6840
gccgctgccc tgataacatc caccaccgca gaataagcca cacccagcca acctacacat 6900
tcgttctgcg agtcacacac gggaggagcg ggaagagctg gaagaaccat gttttttttt 6960
ttattccaaa agattatcca aaacctcaaa atgaagatct attaagtgaa cgcgctcccc 7020
tccggtggcg tggtcaaact ctacagccaa agaacagata atggcatttg taagatgttg 7080
cacaatggct tccaaaaggc aaacggccct cacgtccaag tggacgtaaa ggctaaaccc 7140
ttcagggtga atctcctcta taaacattcc agcaccttca accatgccca aataattctc 7200
atctcgccac cttctcaata tatctctaag caaatcccga atattaagtc cggccattgt 7260
aaaaatctgc tccagagcgc cctccacctt cagcctcaag cagcgaatca tgattgcaaa 7320
aattcaggtt cctcacagac ctgtataaga ttcaaaagcg gaacattaac aaaaataccg 7380
cgatcccgta ggtcccttcg cagggccagc tgaacataat cgtgcaggtc tgcacggacc 7440
agcgcggcca cttccccgcc aggaaccatg acaaaagaac ccacactgat tatgacacgc 7500
atactcggag ctatgctaac cagcgtagcc ccgatgtaag cttgttgcat gggcggcgat 7560
ataaaatgca aggtgctgct caaaaaatca ggcaaagcct cgcgcaaaaa agaaagcaca 7620
tcgtagtcat gctcatgcag ataaaggcag gtaagctccg gaaccaccac agaaaaagac 7680
accatttttc tctcaaacat gtctgcgggt ttctgcataa acacaaaata aaataacaaa 7740
aaaacattta aacattagaa gcctgtctta caacaggaaa aacaaccctt ataagcataa 7800
gacggactac ggccatgccg gcgtgaccgt aaaaaaactg gtcaccgtga ttaaaaagca 7860
ccaccgacag ctcctcggtc atgtccggag tcataatgta agactcggta aacacatcag 7920
gttgattcac atcggtcagt gctaaaaagc gaccgaaata gcccggggga atacataccc 7980
gcaggcgtag agacaacatt acagccccca taggaggtat aacaaaatta ataggagaga 8040
aaaacacata aacacctgaa aaaccctcct gcctaggcaa aatagcaccc tcccgctcca 8100
gaacaacata cagcgcttcc acagcggcag ccataacagt cagccttacc agtaaaaaag 8160
aaaacctatt aaaaaaacac cactcgacac ggcaccagct caatcagtca cagtgtaaaa 8220
aagggccaag tgcagagcga gtatatatag gactaaaaaa tgacgtaacg gttaaagtcc 8280
acaaaaaaca cccagaaaac cgcacgcgaa cctacgccca gaaacgaaag ccaaaaaacc 8340
cacaacttcc tcaaatcgtc acttccgttt tcccacgtta cgtcacttcc cattttaaga 8400
aaactacaat tcccaacaca tacaagttac tccgccctta attaaatcgg atccgatatc 8460
tagatgtatt cgcgaggtac cgagctcgaa ttctctggcc gtcgttttac aacgtcgtga 8520
ctgggaaaac cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag 8580
ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa 8640
tggcgaatgg cgcctgatgc ggtattttct ccttacgcat ctgtgcggta tttcacaccg 8700
catatggtgc actctcagta caatctgctc tgatgccgca tagttaagcc agccccgaca 8760
cccgccaaca cccgctgacg cgccctgacg ggcttgtctg ctcccggcat ccgcttacag 8820
acaagctgtg accgtctccg ggagctgcat gtgtcagagg ttttcaccgt catcaccgaa 8880
acgcgcga 8888
<210> 46
<211> 7989
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 46
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat ctgtacactc tcgggtgatt atttaccccc 2880
acccttgccg tctgcgccgt ttaaaaatca aaggggttct gccgcgcatc gctatgcgcc 2940
actggcaggg acacgttgcg atactggtgt ttagtgctcc acttaaactc aggcacaacc 3000
atccgcggca gctcggtgaa gttttcactc cacaggctgc gcaccatcac caacgcgttt 3060
agcaggtcgg gcgccgatat cttgaagtcg cagttggggc ctccgccctg cgcgcgcgag 3120
ttgcgataca cagggttgca gcactggaac actatcagcg ccgggtggtg cacgctggcc 3180
agcacgctct tgtcggagat cagatccgcg tccaggtcct ccgcgttgct cagggcgaac 3240
ggagtcaact ttggtagctg ccttcccaaa aagggcgcgt gcccaggctt tgagttgcac 3300
tcgcaccgta gtggcatcaa aaggtgaccg tgcccggtct gggcgttagg atacagcgcc 3360
tgcataaaag ccttgatctg cttaaaagcc acctgagcct ttgcgccttc agagaagaac 3420
atgccgcaag acttgccgga aaactgattg gccggacagg ccgcgtcgtg cacgcagcac 3480
cttgcgtcgg tgttggagat ctgcaccaca tttcggcccc accggttctt cacgatcttg 3540
gccttgctag actgctcctt cagcgcgcgc tgcccgtttt cgctcgtcac atccatttca 3600
atcacgtgct ccttatttat cataatgctt ccgtgtagac acttaagctc gccttcgatc 3660
tcagcgcagc ggtgcagcca caacgcgcag cccgtgggct cgtgatgctt gtaggtcacc 3720
tctgcaaacg actgcaggta cgcctgcagg aatcgcccca tcatcgtcac aaaggtcttg 3780
ttgctggtga aggtcagctg caacccgcgg tgctcctcgt tcagccaggt cttgcatacg 3840
gccgccagag cttccacttg gtcaggcagt agtttgaagt tcgcctttag atcgttatcc 3900
acgtggtact tgtccatcag cgcgcgcgca gcctccatgc ccttctccca cgcagacacg 3960
atcggcacac tcagcgggtt catcaccgta atttcacttt ccgcttcgct gggctcttcc 4020
tcttcctctt gcgtccgcat accacgcgcc actgggtcgt cttcattcag ccgccgcact 4080
gtgcgcttac ctcctttgcc atgcttgatt agcaccggtg ggttgctgaa acccaccatt 4140
tgtagcgcca catcttctct ttcttcctcg ctgtccacga ttacctctgg tgatggcggg 4200
cgctcgggct tgggagaagg gcgcttcttt ttcttcttgg gcgcaatggc caaatccgcc 4260
gccgaggtcg atggccgcgg gctgggtgtg cgcggcacca gcgcgtcttg tgatgagtct 4320
tcctcgtcct cggactcgat acgccgcctc atccgctttt ttgggggcgc ccggggaggc 4380
ggcggcgacg gggacgggga cgacacgtcc tccatggttg ggggacgtcg cgccgcaccg 4440
cgtccgcgct cgggggtggt ttcgcgctgc tcctcttccc gactggccat ttccttctcc 4500
tataggcaga aaaagatcca caaaagcgaa gatcagcttc ggcgcacgct ggaagacgcg 4560
gaggctctct tcagtaaata ctgcgcgctg actcttaagg actagtttcg cgccctttct 4620
caaatttaag cgcgaaaact acgtcatctc cagcggccac acccggcgcc agcacctgtt 4680
gtcagcgcca ttggcgcgcc ggccggccga atatcttcat ttaaatgttt aaacatcgat 4740
gcggccgcaa cttgtttatt gcagcttata atggttacaa ataaagcaat agcatcacaa 4800
atttcacaaa taaagcattt ttttcactgc attctagttg tggtttgtcc aaactcatca 4860
atgtatctta gcttaacggg cggcgaagga gaagtccacg cctacatggg ggtagagtca 4920
taatcgtgca tcaggatagg gcggtggtgc tgcagcagcg cgcgaataaa ctgctgccgc 4980
cgccgctccg tcctgcagga atacaacatg gcagtggtct cctcagcgat gattcgcacc 5040
gcccgcagca taaggcgcct tgtcctccgg gcacagcagc gcaccctgat ctcacttaaa 5100
tcagcacagt aactgcagca cagcaccaca atattgttca aaatcccaca gtgcaaggcg 5160
ctgtatccaa agctcatggc ggggaccaca gaacccacgt ggccatcata ccacaagcgc 5220
aggtagatta agtggcgacc cctcataaac acgctggaca taaacattac ctcttttggc 5280
atgttgtaat tcaccacctc ccggtaccat ataaacctct gattaaacat ggcgccatcc 5340
accaccatcc taaaccagct ggccaaaacc tgcccgccgg ctatacactg cagggaaccg 5400
ggactggaac aatgacagtg gagagcccag gactcgtaac catggatcat catgctcgtc 5460
atgatatcaa tgttggcaca acacaggcac acgtgcatac acttcctcag gattacaagc 5520
tcctcccgcg ttagaaccat atcccaggga acaacccatt cctgaatcag cgtaaatccc 5580
acactgcagg gaagacctcg cacgtaactc acgttgtgca ttgtcaaagt gttacattcg 5640
ggcagcagcg gatgatcctc cagtatggta gcgcgggttt ctgtctcaaa aggaggtaga 5700
cgatccctac tgtacggagt gcgccgagac aaccgagatc gtgttggtcg tagtgtcatg 5760
ccaaatggaa cgccggacgt agtcatattt cctgaagcaa aaccaggtgc gggcgtgaca 5820
aacagatctg cgtctccggt ctcgccgctt agatcgctct gtgtagtagt tgtagtatat 5880
ccactctctc aaagcatcca ggcgccccct ggcttcgggt tctatgtaaa ctccttcatg 5940
cgccgctgcc ctgataacat ccaccaccgc agaataagcc acacccagcc aacctacaca 6000
ttcgttctgc gagtcacaca cgggaggagc gggaagagct ggaagaacca tgtttttttt 6060
tttattccaa aagattatcc aaaacctcaa aatgaagatc tattaagtga acgcgctccc 6120
ctccggtggc gtggtcaaac tctacagcca aagaacagat aatggcattt gtaagatgtt 6180
gcacaatggc ttccaaaagg caaacggccc tcacgtccaa gtggacgtaa aggctaaacc 6240
cttcagggtg aatctcctct ataaacattc cagcaccttc aaccatgccc aaataattct 6300
catctcgcca ccttctcaat atatctctaa gcaaatcccg aatattaagt ccggccattg 6360
taaaaatctg ctccagagcg ccctccacct tcagcctcaa gcagcgaatc atgattgcaa 6420
aaattcaggt tcctcacaga cctgtataag attcaaaagc ggaacattaa caaaaatacc 6480
gcgatcccgt aggtcccttc gcagggccag ctgaacataa tcgtgcaggt ctgcacggac 6540
cagcgcggcc acttccccgc caggaaccat gacaaaagaa cccacactga ttatgacacg 6600
catactcgga gctatgctaa ccagcgtagc cccgatgtaa gcttgttgca tgggcggcga 6660
tataaaatgc aaggtgctgc tcaaaaaatc aggcaaagcc tcgcgcaaaa aagaaagcac 6720
atcgtagtca tgctcatgca gataaaggca ggtaagctcc ggaaccacca cagaaaaaga 6780
caccattttt ctctcaaaca tgtctgcggg tttctgcata aacacaaaat aaaataacaa 6840
aaaaacattt aaacattaga agcctgtctt acaacaggaa aaacaaccct tataagcata 6900
agacggacta cggccatgcc ggcgtgaccg taaaaaaact ggtcaccgtg attaaaaagc 6960
accaccgaca gctcctcggt catgtccgga gtcataatgt aagactcggt aaacacatca 7020
ggttgattca catcggtcag tgctaaaaag cgaccgaaat agcccggggg aatacatacc 7080
cgcaggcgta gagacaacat tacagccccc ataggaggta taacaaaatt aataggagag 7140
aaaaacacat aaacacctga aaaaccctcc tgcctaggca aaatagcacc ctcccgctcc 7200
agaacaacat acagcgcttc cacagcggca gccataacag tcagccttac cagtaaaaaa 7260
gaaaacctat taaaaaaaca ccactcgaca cggcaccagc tcaatcagtc acagtgtaaa 7320
aaagggccaa gtgcagagcg agtatatata ggactaaaaa atgacgtaac ggttaaagtc 7380
cacaaaaaac acccagaaaa ccgcacgcga acctacgccc agaaacgaaa gccaaaaaac 7440
ccacaacttc ctcaaatcgt cacttccgtt ttcccacgtt acgtcacttc ccattttaag 7500
aaaactacaa ttcccaacac atacaagtta ctccgccctt aattaaatcg gatccgatat 7560
ctagatgtat tcgcgaggta ccgagctcga attctctggc cgtcgtttta caacgtcgtg 7620
actgggaaaa ccctggcgtt acccaactta atcgccttgc agcacatccc cctttcgcca 7680
gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga 7740
atggcgaatg gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc 7800
gcatatggtg cactctcagt acaatctgct ctgatgccgc atagttaagc cagccccgac 7860
acccgccaac acccgctgac gcgccctgac gggcttgtct gctcccggca tccgcttaca 7920
gacaagctgt gaccgtctcc gggagctgca tgtgtcagag gttttcaccg tcatcaccga 7980
aacgcgcga 7989
<210> 47
<211> 8985
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 47
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tctgcagtcg accagaagca ccatgtcctt gggtccggcc 2160
tgctgaatgc gcaggcggtc ggccatgccc caggcttcgt tttgacatcg gcgcaggtct 2220
ttgtagtagt cttgcatgag cctttctacc ggcacttctt cttctccttc ctcttgtcct 2280
gcatctcttg catctatcgc tgcggcggcg gcggagtttg gccgtaggtg gcgccctctt 2340
cctcccatgc gtgtgacccc gaagcccctc atcggctgaa gcagggctag gtcggcgaca 2400
acgcgctcgg ctaatatggc ctgctgcacc tgcgtgaggg tagactggaa gtcatccatg 2460
tccacaaagc ggtggtatgc gcccgtgttg atggtgtaag tgcagttggc cataacggac 2520
cagttaacgg tctggtgacc cggctgcgag agctcggtgt acctgagacg cgagtaagcc 2580
ctcgagtcaa atacgtagtc gttgcaagtc cgcaccaggt actggtatcc caccaaaaag 2640
tgcggcggcg gctggcggta gaggggccag cgtagggtgg ccggggctcc gggggcgaga 2700
tcttccaaca taaggcgatg atatccgtag atgtacctgg acatccaggt gatgccggcg 2760
gcggtggtgg aggcgcgcgg aaagtcgcgg acgcggttcc agatgttgcg cagcggcaaa 2820
aagtgctcca tggtcgggac gctctggccg gtcaggcgcg cgcaatcgtt gacgctctag 2880
cgtgcaaaag gagagcctgt aagcgggcac tcttccgtgg tctggtggat aaattcgcaa 2940
gggtatcatg gcggacgacc ggggttcgag ccccgtatcc ggccgtccgc cgtgatccat 3000
gcggttaccg cccgcgtgtc gaacccaggt gtgcgacgtc agacaacggg ggagtgctcc 3060
ttttggcttc cttccaggcg cggcggctgc tgcgctagct tttttggcca ctggccgcgc 3120
gcagcgtaag cggttaggct ggaaagcgaa agcattaagt ggctcgctcc ctgtagccgg 3180
agggttattt tccaagggtt gagtcgcggg acccccggtt cgagtctcgg accgagactg 3240
ggggcgtaca ctggatggcc tttgcctgga acccgcactc aaaaacatgc tacctctttg 3300
agccctttgg cttttctgac cagcgactca agcaggttta ccagtttgag tacgagtcac 3360
tcctgcgccg tagcgccatt gcttcttccc ccgaccgctg tataacgctg gaaaagtcca 3420
cccaaagcgt acaggggccc aactcggccg cctgtggact attctgctgc atgtttctcc 3480
acgcctttgc caactggccc caaactccca tggatcacaa ccccaccatg aaccttatta 3540
ccggggtacc caactccatg ctcaacagtc cccaggtaca gcccaccctg cgtcgcaacc 3600
aggaacagct ctacagcttc ctggagcgcc actcgcccta cttccgcagc cacagtgcgc 3660
agattaggag cgccacttct ttttgtcact tgaaaaacat gtaaaaataa tgtactagag 3720
acactttcaa taaaggcaaa tgcttttatt tgtacactct cgggtgatta tttaccccca 3780
cccttgccgt ctgcgccgtt taaaaatcaa aggggttctg ccgcgcatcg ctatgcgcca 3840
ctggcaggga cacgttgcga tactggtgtt tagtgctcca cttaaactca ggcacaacca 3900
tccgcggcag ctcggtgaag ttttcactcc acaggctgcg caccatcacc aacgcgttta 3960
gcaggtcggg cgccgatatc ttgaagtcgc agttggggcc tccgccctgc gcgcgcgagt 4020
tgcgatacac agggttgcag cactggaaca ctatcagcgc cgggtggtgc acgctggcca 4080
gcacgctctt gtcggagatc agatccgcgt ccaggtcctc cgcgttgctc agggcgaacg 4140
gagtcaactt tggtagctgc cttcccaaaa agggcgcgtg cccaggcttt gagttgcact 4200
cgcaccgtag tggcatcaaa aggtgaccgt gcccggtctg ggcgttagga tacagcgcct 4260
gcataaaagc cttgatctgc ttaaaagcca cctgagcctt tgcgccttca gagaagaaca 4320
tgccgcaaga cttgccggaa aactgattgg ccggacaggc cgcgtcgtgc acgcagcacc 4380
ttgcgtcggt gttggagatc tgcaccacat ttcggcccca ccggttcttc acgatcttgg 4440
ccttgctaga ctgctccttc agcgcgcgct gcccgttttc gctcgtcaca tccatttcaa 4500
tcacgtgctc cttatttatc ataatgcttc cgtgtagaca cttaagctcg ccttcgatct 4560
cagcgcagcg gtgcagccac aacgcgcagc ccgtgggctc gtgatgcttg taggtcacct 4620
ctgcaaacga ctgcaggtac gcctgcagga atcgccccat catcgtcaca aaggtcttgt 4680
tgctggtgaa ggtcagctgc aacccgcggt gctcctcgtt cagccaggtc ttgcatacgg 4740
ccgccagagc ttccacttgg tcaggcagta gtttgaagtt cgcctttaga tcgttatcca 4800
cgtggtactt gtccatcagc gcgcgcgcag cctccatgcc cttctcccac gcagacacga 4860
tcggcacact cagcgggttc atcaccgtaa tttcactttc cgcttcgctg ggctcttcct 4920
cttcctcttg cgtccgcata ccacgcgcca ctgggtcgtc ttcattcagc cgccgcactg 4980
tgcgcttacc tcctttgcca tgcttgatta gcaccggtgg gttgctgaaa cccaccattt 5040
gtagcgccac atcttctctt tcttcctcgc tgtccacgat tacctctggt gatggcgggc 5100
gctcgggctt gggagaaggg cgcttctttt tcttcttggg cgcaatggcc aaatccgccg 5160
ccgaggtcga tggccgcggg ctgggtgtgc gcggcaccag cgcgtcttgt gatgagtctt 5220
cctcgtcctc ggactcgata cgccgcctca tccgcttttt tgggggcgcc cggggaggcg 5280
gcggcgacgg ggacggggac gacacgtcct ccatggttgg gggacgtcgc gccgcaccgc 5340
gtccgcgctc gggggtggtt tcgcgctgct cctcttcccg actggccatt tccttctcct 5400
ataggcagaa aaagatccac aaaagcgaag atcagcttcg gcgcacgctg gaagacgcgg 5460
aggctctctt cagtaaatac tgcgcgctga ctcttaagga ctagtttcgc gccctttctc 5520
aaatttaagc gcgaaaacta cgtcatctcc agcggccaca cccggcgcca gcacctgttg 5580
tcagcgccat tggcgcgccg gccggccgaa tatcttcatt taaatgttta aacatcgatg 5640
cggccgcaac ttgtttattg cagcttataa tggttacaaa taaagcaata gcatcacaaa 5700
tttcacaaat aaagcatttt tttcactgca ttctagttgt ggtttgtcca aactcatcaa 5760
tgtatcttag cttaacgggc ggcgaaggag aagtccacgc ctacatgggg gtagagtcat 5820
aatcgtgcat caggataggg cggtggtgct gcagcagcgc gcgaataaac tgctgccgcc 5880
gccgctccgt cctgcaggaa tacaacatgg cagtggtctc ctcagcgatg attcgcaccg 5940
cccgcagcat aaggcgcctt gtcctccggg cacagcagcg caccctgatc tcacttaaat 6000
cagcacagta actgcagcac agcaccacaa tattgttcaa aatcccacag tgcaaggcgc 6060
tgtatccaaa gctcatggcg gggaccacag aacccacgtg gccatcatac cacaagcgca 6120
ggtagattaa gtggcgaccc ctcataaaca cgctggacat aaacattacc tcttttggca 6180
tgttgtaatt caccacctcc cggtaccata taaacctctg attaaacatg gcgccatcca 6240
ccaccatcct aaaccagctg gccaaaacct gcccgccggc tatacactgc agggaaccgg 6300
gactggaaca atgacagtgg agagcccagg actcgtaacc atggatcatc atgctcgtca 6360
tgatatcaat gttggcacaa cacaggcaca cgtgcataca cttcctcagg attacaagct 6420
cctcccgcgt tagaaccata tcccagggaa caacccattc ctgaatcagc gtaaatccca 6480
cactgcaggg aagacctcgc acgtaactca cgttgtgcat tgtcaaagtg ttacattcgg 6540
gcagcagcgg atgatcctcc agtatggtag cgcgggtttc tgtctcaaaa ggaggtagac 6600
gatccctact gtacggagtg cgccgagaca accgagatcg tgttggtcgt agtgtcatgc 6660
caaatggaac gccggacgta gtcatatttc ctgaagcaaa accaggtgcg ggcgtgacaa 6720
acagatctgc gtctccggtc tcgccgctta gatcgctctg tgtagtagtt gtagtatatc 6780
cactctctca aagcatccag gcgccccctg gcttcgggtt ctatgtaaac tccttcatgc 6840
gccgctgccc tgataacatc caccaccgca gaataagcca cacccagcca acctacacat 6900
tcgttctgcg agtcacacac gggaggagcg ggaagagctg gaagaaccat gttttttttt 6960
ttattccaaa agattatcca aaacctcaaa atgaagatct attaagtgaa cgcgctcccc 7020
tccggtggcg tggtcaaact ctacagccaa agaacagata atggcatttg taagatgttg 7080
cacaatggct tccaaaaggc aaacggccct cacgtccaag tggacgtaaa ggctaaaccc 7140
ttcagggtga atctcctcta taaacattcc agcaccttca accatgccca aataattctc 7200
atctcgccac cttctcaata tatctctaag caaatcccga atattaagtc cggccattgt 7260
aaaaatctgc tccagagcgc cctccacctt cagcctcaag cagcgaatca tgattgcaaa 7320
aattcaggtt cctcacagac ctgtataaga ttcaaaagcg gaacattaac aaaaataccg 7380
cgatcccgta ggtcccttcg cagggccagc tgaacataat cgtgcaggtc tgcacggacc 7440
agcgcggcca cttccccgcc aggaaccatg acaaaagaac ccacactgat tatgacacgc 7500
atactcggag ctatgctaac cagcgtagcc ccgatgtaag cttgttgcat gggcggcgat 7560
ataaaatgca aggtgctgct caaaaaatca ggcaaagcct cgcgcaaaaa agaaagcaca 7620
tcgtagtcat gctcatgcag ataaaggcag gtaagctccg gaaccaccac agaaaaagac 7680
accatttttc tctcaaacat gtctgcgggt ttctgcataa acacaaaata aaataacaaa 7740
aaaacattta aacattagaa gcctgtctta caacaggaaa aacaaccctt ataagcataa 7800
gacggactac ggccatgccg gcgtgaccgt aaaaaaactg gtcaccgtga ttaaaaagca 7860
ccaccgacag ctcctcggtc atgtccggag tcataatgta agactcggta aacacatcag 7920
gttgattcac atcggtcagt gctaaaaagc gaccgaaata gcccggggga atacataccc 7980
gcaggcgtag agacaacatt acagccccca taggaggtat aacaaaatta ataggagaga 8040
aaaacacata aacacctgaa aaaccctcct gcctaggcaa aatagcaccc tcccgctcca 8100
gaacaacata cagcgcttcc acagcggcag ccatggtggc atttgcaaaa gcctaggcct 8160
ccaaaaaagc ctcctcacta cttctggaat agctcagagg ccgaggcggc ctcggcctct 8220
gcataaataa aaaaaattag tcagccatgg ggcggagaat gggcggaact gggcggagtt 8280
aggggcggga tgggcggagt taggggcggg actatggttg ctgactaatt gagatgcatg 8340
ctttgcatac ttctgcctgc tggggagcct ggggactttc cacacctggt tgctgactaa 8400
ttgagatgca tgctttgcat acttctgcct gctggggagc ctggggactt tccacaccct 8460
aactgacaca cacgttacgt cacttcccat tttaagaaaa ctacaattcc caacacatac 8520
aagttactcc gcccttaatt aaatcggatc cgatatctag atgtattcgc gaggtaccga 8580
gctcgaattc tctggccgtc gttttacaac gtcgtgactg ggaaaaccct ggcgttaccc 8640
aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc gaagaggccc 8700
gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggcgc ctgatgcggt 8760
attttctcct tacgcatctg tgcggtattt cacaccgcat atggtgcact ctcagtacaa 8820
tctgctctga tgccgcatag ttaagccagc cccgacaccc gccaacaccc gctgacgcgc 8880
cctgacgggc ttgtctgctc ccggcatccg cttacagaca agctgtgacc gtctccggga 8940
gctgcatgtg tcagaggttt tcaccgtcat caccgaaacg cgcga 8985
<210> 48
<211> 8086
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 48
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat ctgtacactc tcgggtgatt atttaccccc 2880
acccttgccg tctgcgccgt ttaaaaatca aaggggttct gccgcgcatc gctatgcgcc 2940
actggcaggg acacgttgcg atactggtgt ttagtgctcc acttaaactc aggcacaacc 3000
atccgcggca gctcggtgaa gttttcactc cacaggctgc gcaccatcac caacgcgttt 3060
agcaggtcgg gcgccgatat cttgaagtcg cagttggggc ctccgccctg cgcgcgcgag 3120
ttgcgataca cagggttgca gcactggaac actatcagcg ccgggtggtg cacgctggcc 3180
agcacgctct tgtcggagat cagatccgcg tccaggtcct ccgcgttgct cagggcgaac 3240
ggagtcaact ttggtagctg ccttcccaaa aagggcgcgt gcccaggctt tgagttgcac 3300
tcgcaccgta gtggcatcaa aaggtgaccg tgcccggtct gggcgttagg atacagcgcc 3360
tgcataaaag ccttgatctg cttaaaagcc acctgagcct ttgcgccttc agagaagaac 3420
atgccgcaag acttgccgga aaactgattg gccggacagg ccgcgtcgtg cacgcagcac 3480
cttgcgtcgg tgttggagat ctgcaccaca tttcggcccc accggttctt cacgatcttg 3540
gccttgctag actgctcctt cagcgcgcgc tgcccgtttt cgctcgtcac atccatttca 3600
atcacgtgct ccttatttat cataatgctt ccgtgtagac acttaagctc gccttcgatc 3660
tcagcgcagc ggtgcagcca caacgcgcag cccgtgggct cgtgatgctt gtaggtcacc 3720
tctgcaaacg actgcaggta cgcctgcagg aatcgcccca tcatcgtcac aaaggtcttg 3780
ttgctggtga aggtcagctg caacccgcgg tgctcctcgt tcagccaggt cttgcatacg 3840
gccgccagag cttccacttg gtcaggcagt agtttgaagt tcgcctttag atcgttatcc 3900
acgtggtact tgtccatcag cgcgcgcgca gcctccatgc ccttctccca cgcagacacg 3960
atcggcacac tcagcgggtt catcaccgta atttcacttt ccgcttcgct gggctcttcc 4020
tcttcctctt gcgtccgcat accacgcgcc actgggtcgt cttcattcag ccgccgcact 4080
gtgcgcttac ctcctttgcc atgcttgatt agcaccggtg ggttgctgaa acccaccatt 4140
tgtagcgcca catcttctct ttcttcctcg ctgtccacga ttacctctgg tgatggcggg 4200
cgctcgggct tgggagaagg gcgcttcttt ttcttcttgg gcgcaatggc caaatccgcc 4260
gccgaggtcg atggccgcgg gctgggtgtg cgcggcacca gcgcgtcttg tgatgagtct 4320
tcctcgtcct cggactcgat acgccgcctc atccgctttt ttgggggcgc ccggggaggc 4380
ggcggcgacg gggacgggga cgacacgtcc tccatggttg ggggacgtcg cgccgcaccg 4440
cgtccgcgct cgggggtggt ttcgcgctgc tcctcttccc gactggccat ttccttctcc 4500
tataggcaga aaaagatcca caaaagcgaa gatcagcttc ggcgcacgct ggaagacgcg 4560
gaggctctct tcagtaaata ctgcgcgctg actcttaagg actagtttcg cgccctttct 4620
caaatttaag cgcgaaaact acgtcatctc cagcggccac acccggcgcc agcacctgtt 4680
gtcagcgcca ttggcgcgcc ggccggccga atatcttcat ttaaatgttt aaacatcgat 4740
gcggccgcaa cttgtttatt gcagcttata atggttacaa ataaagcaat agcatcacaa 4800
atttcacaaa taaagcattt ttttcactgc attctagttg tggtttgtcc aaactcatca 4860
atgtatctta gcttaacggg cggcgaagga gaagtccacg cctacatggg ggtagagtca 4920
taatcgtgca tcaggatagg gcggtggtgc tgcagcagcg cgcgaataaa ctgctgccgc 4980
cgccgctccg tcctgcagga atacaacatg gcagtggtct cctcagcgat gattcgcacc 5040
gcccgcagca taaggcgcct tgtcctccgg gcacagcagc gcaccctgat ctcacttaaa 5100
tcagcacagt aactgcagca cagcaccaca atattgttca aaatcccaca gtgcaaggcg 5160
ctgtatccaa agctcatggc ggggaccaca gaacccacgt ggccatcata ccacaagcgc 5220
aggtagatta agtggcgacc cctcataaac acgctggaca taaacattac ctcttttggc 5280
atgttgtaat tcaccacctc ccggtaccat ataaacctct gattaaacat ggcgccatcc 5340
accaccatcc taaaccagct ggccaaaacc tgcccgccgg ctatacactg cagggaaccg 5400
ggactggaac aatgacagtg gagagcccag gactcgtaac catggatcat catgctcgtc 5460
atgatatcaa tgttggcaca acacaggcac acgtgcatac acttcctcag gattacaagc 5520
tcctcccgcg ttagaaccat atcccaggga acaacccatt cctgaatcag cgtaaatccc 5580
acactgcagg gaagacctcg cacgtaactc acgttgtgca ttgtcaaagt gttacattcg 5640
ggcagcagcg gatgatcctc cagtatggta gcgcgggttt ctgtctcaaa aggaggtaga 5700
cgatccctac tgtacggagt gcgccgagac aaccgagatc gtgttggtcg tagtgtcatg 5760
ccaaatggaa cgccggacgt agtcatattt cctgaagcaa aaccaggtgc gggcgtgaca 5820
aacagatctg cgtctccggt ctcgccgctt agatcgctct gtgtagtagt tgtagtatat 5880
ccactctctc aaagcatcca ggcgccccct ggcttcgggt tctatgtaaa ctccttcatg 5940
cgccgctgcc ctgataacat ccaccaccgc agaataagcc acacccagcc aacctacaca 6000
ttcgttctgc gagtcacaca cgggaggagc gggaagagct ggaagaacca tgtttttttt 6060
tttattccaa aagattatcc aaaacctcaa aatgaagatc tattaagtga acgcgctccc 6120
ctccggtggc gtggtcaaac tctacagcca aagaacagat aatggcattt gtaagatgtt 6180
gcacaatggc ttccaaaagg caaacggccc tcacgtccaa gtggacgtaa aggctaaacc 6240
cttcagggtg aatctcctct ataaacattc cagcaccttc aaccatgccc aaataattct 6300
catctcgcca ccttctcaat atatctctaa gcaaatcccg aatattaagt ccggccattg 6360
taaaaatctg ctccagagcg ccctccacct tcagcctcaa gcagcgaatc atgattgcaa 6420
aaattcaggt tcctcacaga cctgtataag attcaaaagc ggaacattaa caaaaatacc 6480
gcgatcccgt aggtcccttc gcagggccag ctgaacataa tcgtgcaggt ctgcacggac 6540
cagcgcggcc acttccccgc caggaaccat gacaaaagaa cccacactga ttatgacacg 6600
catactcgga gctatgctaa ccagcgtagc cccgatgtaa gcttgttgca tgggcggcga 6660
tataaaatgc aaggtgctgc tcaaaaaatc aggcaaagcc tcgcgcaaaa aagaaagcac 6720
atcgtagtca tgctcatgca gataaaggca ggtaagctcc ggaaccacca cagaaaaaga 6780
caccattttt ctctcaaaca tgtctgcggg tttctgcata aacacaaaat aaaataacaa 6840
aaaaacattt aaacattaga agcctgtctt acaacaggaa aaacaaccct tataagcata 6900
agacggacta cggccatgcc ggcgtgaccg taaaaaaact ggtcaccgtg attaaaaagc 6960
accaccgaca gctcctcggt catgtccgga gtcataatgt aagactcggt aaacacatca 7020
ggttgattca catcggtcag tgctaaaaag cgaccgaaat agcccggggg aatacatacc 7080
cgcaggcgta gagacaacat tacagccccc ataggaggta taacaaaatt aataggagag 7140
aaaaacacat aaacacctga aaaaccctcc tgcctaggca aaatagcacc ctcccgctcc 7200
agaacaacat acagcgcttc cacagcggca gccatggtgg catttgcaaa agcctaggcc 7260
tccaaaaaag cctcctcact acttctggaa tagctcagag gccgaggcgg cctcggcctc 7320
tgcataaata aaaaaaatta gtcagccatg gggcggagaa tgggcggaac tgggcggagt 7380
taggggcggg atgggcggag ttaggggcgg gactatggtt gctgactaat tgagatgcat 7440
gctttgcata cttctgcctg ctggggagcc tggggacttt ccacacctgg ttgctgacta 7500
attgagatgc atgctttgca tacttctgcc tgctggggag cctggggact ttccacaccc 7560
taactgacac acacgttacg tcacttccca ttttaagaaa actacaattc ccaacacata 7620
caagttactc cgcccttaat taaatcggat ccgatatcta gatgtattcg cgaggtaccg 7680
agctcgaatt ctctggccgt cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc 7740
caacttaatc gccttgcagc acatccccct ttcgccagct ggcgtaatag cgaagaggcc 7800
cgcaccgatc gcccttccca acagttgcgc agcctgaatg gcgaatggcg cctgatgcgg 7860
tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatggtgcac tctcagtaca 7920
atctgctctg atgccgcata gttaagccag ccccgacacc cgccaacacc cgctgacgcg 7980
ccctgacggg cttgtctgct cccggcatcc gcttacagac aagctgtgac cgtctccggg 8040
agctgcatgt gtcagaggtt ttcaccgtca tcaccgaaac gcgcga 8086
<210> 49
<211> 9413
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 49
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tctgcagtcg accagaagca ccatgtcctt gggtccggcc 2160
tgctgaatgc gcaggcggtc ggccatgccc caggcttcgt tttgacatcg gcgcaggtct 2220
ttgtagtagt cttgcatgag cctttctacc ggcacttctt cttctccttc ctcttgtcct 2280
gcatctcttg catctatcgc tgcggcggcg gcggagtttg gccgtaggtg gcgccctctt 2340
cctcccatgc gtgtgacccc gaagcccctc atcggctgaa gcagggctag gtcggcgaca 2400
acgcgctcgg ctaatatggc ctgctgcacc tgcgtgaggg tagactggaa gtcatccatg 2460
tccacaaagc ggtggtatgc gcccgtgttg atggtgtaag tgcagttggc cataacggac 2520
cagttaacgg tctggtgacc cggctgcgag agctcggtgt acctgagacg cgagtaagcc 2580
ctcgagtcaa atacgtagtc gttgcaagtc cgcaccaggt actggtatcc caccaaaaag 2640
tgcggcggcg gctggcggta gaggggccag cgtagggtgg ccggggctcc gggggcgaga 2700
tcttccaaca taaggcgatg atatccgtag atgtacctgg acatccaggt gatgccggcg 2760
gcggtggtgg aggcgcgcgg aaagtcgcgg acgcggttcc agatgttgcg cagcggcaaa 2820
aagtgctcca tggtcgggac gctctggccg gtcaggcgcg cgcaatcgtt gacgctctag 2880
cgtgcaaaag gagagcctgt aagcgggcac tcttccgtgg tctggtggat aaattcgcaa 2940
gggtatcatg gcggacgacc ggggttcgag ccccgtatcc ggccgtccgc cgtgatccat 3000
gcggttaccg cccgcgtgtc gaacccaggt gtgcgacgtc agacaacggg ggagtgctcc 3060
ttttggcttc cttccaggcg cggcggctgc tgcgctagct tttttggcca ctggccgcgc 3120
gcagcgtaag cggttaggct ggaaagcgaa agcattaagt ggctcgctcc ctgtagccgg 3180
agggttattt tccaagggtt gagtcgcggg acccccggtt cgagtctcgg accgagactg 3240
ggggcgtaca ctggatggcc tttgcctgga acccgcactc aaaaacatgc tacctctttg 3300
agccctttgg cttttctgac cagcgactca agcaggttta ccagtttgag tacgagtcac 3360
tcctgcgccg tagcgccatt gcttcttccc ccgaccgctg tataacgctg gaaaagtcca 3420
cccaaagcgt acaggggccc aactcggccg cctgtggact attctgctgc atgtttctcc 3480
acgcctttgc caactggccc caaactccca tggatcacaa ccccaccatg aaccttatta 3540
ccggggtacc caactccatg ctcaacagtc cccaggtaca gcccaccctg cgtcgcaacc 3600
aggaacagct ctacagcttc ctggagcgcc actcgcccta cttccgcagc cacagtgcgc 3660
agattaggag cgccacttct ttttgtcact tgaaaaacat gtaaaaataa tgtactagag 3720
acactttcaa taaaggcaaa tgcttttatt tgtacactct cgggtgatta tttaccccca 3780
cccttgccgt ctgcgccgtt taaaaatcaa aggggttctg ccgcgcatcg ctatgcgcca 3840
ctggcaggga cacgttgcga tactggtgtt tagtgctcca cttaaactca ggcacaacca 3900
tccgcggcag ctcggtgaag ttttcactcc acaggctgcg caccatcacc aacgcgttta 3960
gcaggtcggg cgccgatatc ttgaagtcgc agttggggcc tccgccctgc gcgcgcgagt 4020
tgcgatacac agggttgcag cactggaaca ctatcagcgc cgggtggtgc acgctggcca 4080
gcacgctctt gtcggagatc agatccgcgt ccaggtcctc cgcgttgctc agggcgaacg 4140
gagtcaactt tggtagctgc cttcccaaaa agggcgcgtg cccaggcttt gagttgcact 4200
cgcaccgtag tggcatcaaa aggtgaccgt gcccggtctg ggcgttagga tacagcgcct 4260
gcataaaagc cttgatctgc ttaaaagcca cctgagcctt tgcgccttca gagaagaaca 4320
tgccgcaaga cttgccggaa aactgattgg ccggacaggc cgcgtcgtgc acgcagcacc 4380
ttgcgtcggt gttggagatc tgcaccacat ttcggcccca ccggttcttc acgatcttgg 4440
ccttgctaga ctgctccttc agcgcgcgct gcccgttttc gctcgtcaca tccatttcaa 4500
tcacgtgctc cttatttatc ataatgcttc cgtgtagaca cttaagctcg ccttcgatct 4560
cagcgcagcg gtgcagccac aacgcgcagc ccgtgggctc gtgatgcttg taggtcacct 4620
ctgcaaacga ctgcaggtac gcctgcagga atcgccccat catcgtcaca aaggtcttgt 4680
tgctggtgaa ggtcagctgc aacccgcggt gctcctcgtt cagccaggtc ttgcatacgg 4740
ccgccagagc ttccacttgg tcaggcagta gtttgaagtt cgcctttaga tcgttatcca 4800
cgtggtactt gtccatcagc gcgcgcgcag cctccatgcc cttctcccac gcagacacga 4860
tcggcacact cagcgggttc atcaccgtaa tttcactttc cgcttcgctg ggctcttcct 4920
cttcctcttg cgtccgcata ccacgcgcca ctgggtcgtc ttcattcagc cgccgcactg 4980
tgcgcttacc tcctttgcca tgcttgatta gcaccggtgg gttgctgaaa cccaccattt 5040
gtagcgccac atcttctctt tcttcctcgc tgtccacgat tacctctggt gatggcgggc 5100
gctcgggctt gggagaaggg cgcttctttt tcttcttggg cgcaatggcc aaatccgccg 5160
ccgaggtcga tggccgcggg ctgggtgtgc gcggcaccag cgcgtcttgt gatgagtctt 5220
cctcgtcctc ggactcgata cgccgcctca tccgcttttt tgggggcgcc cggggaggcg 5280
gcggcgacgg ggacggggac gacacgtcct ccatggttgg gggacgtcgc gccgcaccgc 5340
gtccgcgctc gggggtggtt tcgcgctgct cctcttcccg actggccatt tccttctcct 5400
ataggcagaa aaagatccac aaaagcgaag atcagcttcg gcgcacgctg gaagacgcgg 5460
aggctctctt cagtaaatac tgcgcgctga ctcttaagga ctagtttcgc gccctttctc 5520
aaatttaagc gcgaaaacta cgtcatctcc agcggccaca cccggcgcca gcacctgttg 5580
tcagcgccat tggcgcgccc gcccgccgcg cgcttcgctt tttatagggc cgccgccgcc 5640
gccgcctcgc cataaaagga aactttcgga gcgcgccgct ctgattggct gccgccgcac 5700
ctctccgcct cgccccgccc cgcccctcgc cccgccccgc cccgcctggc gcgcgccccc 5760
cccccccccc cgcccccatc gctgcacaaa ataattaaaa aataaataaa tacaaaattg 5820
ggggtgggga ggggggggag atggggagag tgaagcagaa cgtggggctc acctcgaggc 5880
cggccgaata tcttcattta aatgtttaaa catcgatgcg gccgccgttt gtgttatgtt 5940
tcaacgtgtt tatttttcaa ttgcagaaaa tttcaagtca tttttcattc agtagtatag 6000
ccccaccacc acatagctta tacagatcac cgtaccttaa tcaaactcac agaaccctag 6060
tattcaacct gccacctccc tcccaacaca cagagtacac agtcctttct ccccggctgg 6120
ccttaaaaag catcatatca tgggtaacag acatattctt aggtgttata ttccacacgg 6180
tttcctgtcg agccaaacgc tcatcagtga tattaataaa ctccccgggc agctcactta 6240
agttcatgtc gctgtccagc tgctgagcca caggctgctg tccaacttgc ggttgcttaa 6300
cgggcggcga aggagaagtc cacgcctaca tgggggtaga gtcataatcg tgcatcagga 6360
tagggcggtg gtgctgcagc agcgcgcgaa taaactgctg ccgccgccgc tccgtcctgc 6420
aggaatacaa catggcagtg gtctcctcag cgatgattcg caccgcccgc agcataaggc 6480
gccttgtcct ccgggcacag cagcgcaccc tgatctcact taaatcagca cagtaactgc 6540
agcacagcac cacaatattg ttcaaaatcc cacagtgcaa ggcgctgtat ccaaagctca 6600
tggcggggac cacagaaccc acgtggccat cataccacaa gcgcaggtag attaagtggc 6660
gacccctcat aaacacgctg gacataaaca ttacctcttt tggcatgttg taattcacca 6720
cctcccggta ccatataaac ctctgattaa acatggcgcc atccaccacc atcctaaacc 6780
agctggccaa aacctgcccg ccggctatac actgcaggga accgggactg gaacaatgac 6840
agtggagagc ccaggactcg taaccatgga tcatcatgct cgtcatgata tcaatgttgg 6900
cacaacacag gcacacgtgc atacacttcc tcaggattac aagctcctcc cgcgttagaa 6960
ccatatccca gggaacaacc cattcctgaa tcagcgtaaa tcccacactg cagggaagac 7020
ctcgcacgta actcacgttg tgcattgtca aagtgttaca ttcgggcagc agcggatgat 7080
cctccagtat ggtagcgcgg gtttctgtct caaaaggagg tagacgatcc ctactgtacg 7140
gagtgcgccg agacaaccga gatcgtgttg gtcgtagtgt catgccaaat ggaacgccgg 7200
acgtagtcat atttcctgaa gcaaaaccag gtgcgggcgt gacaaacaga tctgcgtctc 7260
cggtctcgcc gcttagatcg ctctgtgtag tagttgtagt atatccactc tctcaaagca 7320
tccaggcgcc ccctggcttc gggttctatg taaactcctt catgcgccgc tgccctgata 7380
acatccacca ccgcagaata agccacaccc agccaaccta cacattcgtt ctgcgagtca 7440
cacacgggag gagcgggaag agctggaaga accatgtttt tttttttatt ccaaaagatt 7500
atccaaaacc tcaaaatgaa gatctattaa gtgaacgcgc tcccctccgg tggcgtggtc 7560
aaactctaca gccaaagaac agataatggc atttgtaaga tgttgcacaa tggcttccaa 7620
aaggcaaacg gccctcacgt ccaagtggac gtaaaggcta aacccttcag ggtgaatctc 7680
ctctataaac attccagcac cttcaaccat gcccaaataa ttctcatctc gccaccttct 7740
caatatatct ctaagcaaat cccgaatatt aagtccggcc attgtaaaaa tctgctccag 7800
agcgccctcc accttcagcc tcaagcagcg aatcatgatt gcaaaaattc aggttcctca 7860
cagacctgta taagattcaa aagcggaaca ttaacaaaaa taccgcgatc ccgtaggtcc 7920
cttcgcaggg ccagctgaac ataatcgtgc aggtctgcac ggaccagcgc ggccacttcc 7980
ccgccaggaa ccatgacaaa agaacccaca ctgattatga cacgcatact cggagctatg 8040
ctaaccagcg tagccccgat gtaagcttgt tgcatgggcg gcgatataaa atgcaaggtg 8100
ctgctcaaaa aatcaggcaa agcctcgcgc aaaaaagaaa gcacatcgta gtcatgctca 8160
tgcagataaa ggcaggtaag ctccggaacc accacagaaa aagacaccat ttttctctca 8220
aacatgtctg cgggtttctg cataaacaca aaataaaata acaaaaaaac atttaaacat 8280
tagaagcctg tcttacaaca ggaaaaacaa cccttataag cataagacgg actacggcca 8340
tgccggcgtg accgtaaaaa aactggtcac cgtgattaaa aagcaccacc gacagctcct 8400
cggtcatgtc cggagtcata atgtaagact cggtaaacac atcaggttga ttcacatcgg 8460
tcagtgctaa aaagcgaccg aaatagcccg ggggaataca tacccgcagg cgtagagaca 8520
acattacagc ccccatagga ggtataacaa aattaatagg agagaaaaac acataaacac 8580
ctgaaaaacc ctcctgccta ggcaaaatag caccctcccg ctccagaaca acatacagcg 8640
cttccacagc ggcagccata acagtcagcc ttaccagtaa aaaagaaaac ctattaaaaa 8700
aacaccactc gacacggcac cagctcaatc agtcacagtg taaaaaaggg ccaagtgcag 8760
agcgagtata tataggacta aaaaatgacg taacggttaa agtccacaaa aaacacccag 8820
aaaaccgcac gcgaacctac gcccagaaac gaaagccaaa aaacccacaa cttcctcaaa 8880
tcgtcacttc cgttttccca cgttacgtca cttcccattt taagaaaact acaattccca 8940
acacatacaa gttactccgc ccttaattaa atcggatccg atatctagat gtattcgcga 9000
ggtaccgagc tcgaattctc tggccgtcgt tttacaacgt cgtgactggg aaaaccctgg 9060
cgttacccaa cttaatcgcc ttgcagcaca tccccctttc gccagctggc gtaatagcga 9120
agaggcccgc accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg aatggcgcct 9180
gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct 9240
cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc 9300
tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt 9360
ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cga 9413
<210> 50
<211> 8514
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 50
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat ctgtacactc tcgggtgatt atttaccccc 2880
acccttgccg tctgcgccgt ttaaaaatca aaggggttct gccgcgcatc gctatgcgcc 2940
actggcaggg acacgttgcg atactggtgt ttagtgctcc acttaaactc aggcacaacc 3000
atccgcggca gctcggtgaa gttttcactc cacaggctgc gcaccatcac caacgcgttt 3060
agcaggtcgg gcgccgatat cttgaagtcg cagttggggc ctccgccctg cgcgcgcgag 3120
ttgcgataca cagggttgca gcactggaac actatcagcg ccgggtggtg cacgctggcc 3180
agcacgctct tgtcggagat cagatccgcg tccaggtcct ccgcgttgct cagggcgaac 3240
ggagtcaact ttggtagctg ccttcccaaa aagggcgcgt gcccaggctt tgagttgcac 3300
tcgcaccgta gtggcatcaa aaggtgaccg tgcccggtct gggcgttagg atacagcgcc 3360
tgcataaaag ccttgatctg cttaaaagcc acctgagcct ttgcgccttc agagaagaac 3420
atgccgcaag acttgccgga aaactgattg gccggacagg ccgcgtcgtg cacgcagcac 3480
cttgcgtcgg tgttggagat ctgcaccaca tttcggcccc accggttctt cacgatcttg 3540
gccttgctag actgctcctt cagcgcgcgc tgcccgtttt cgctcgtcac atccatttca 3600
atcacgtgct ccttatttat cataatgctt ccgtgtagac acttaagctc gccttcgatc 3660
tcagcgcagc ggtgcagcca caacgcgcag cccgtgggct cgtgatgctt gtaggtcacc 3720
tctgcaaacg actgcaggta cgcctgcagg aatcgcccca tcatcgtcac aaaggtcttg 3780
ttgctggtga aggtcagctg caacccgcgg tgctcctcgt tcagccaggt cttgcatacg 3840
gccgccagag cttccacttg gtcaggcagt agtttgaagt tcgcctttag atcgttatcc 3900
acgtggtact tgtccatcag cgcgcgcgca gcctccatgc ccttctccca cgcagacacg 3960
atcggcacac tcagcgggtt catcaccgta atttcacttt ccgcttcgct gggctcttcc 4020
tcttcctctt gcgtccgcat accacgcgcc actgggtcgt cttcattcag ccgccgcact 4080
gtgcgcttac ctcctttgcc atgcttgatt agcaccggtg ggttgctgaa acccaccatt 4140
tgtagcgcca catcttctct ttcttcctcg ctgtccacga ttacctctgg tgatggcggg 4200
cgctcgggct tgggagaagg gcgcttcttt ttcttcttgg gcgcaatggc caaatccgcc 4260
gccgaggtcg atggccgcgg gctgggtgtg cgcggcacca gcgcgtcttg tgatgagtct 4320
tcctcgtcct cggactcgat acgccgcctc atccgctttt ttgggggcgc ccggggaggc 4380
ggcggcgacg gggacgggga cgacacgtcc tccatggttg ggggacgtcg cgccgcaccg 4440
cgtccgcgct cgggggtggt ttcgcgctgc tcctcttccc gactggccat ttccttctcc 4500
tataggcaga aaaagatcca caaaagcgaa gatcagcttc ggcgcacgct ggaagacgcg 4560
gaggctctct tcagtaaata ctgcgcgctg actcttaagg actagtttcg cgccctttct 4620
caaatttaag cgcgaaaact acgtcatctc cagcggccac acccggcgcc agcacctgtt 4680
gtcagcgcca ttggcgcgcc cgcccgccgc gcgcttcgct ttttataggg ccgccgccgc 4740
cgccgcctcg ccataaaagg aaactttcgg agcgcgccgc tctgattggc tgccgccgca 4800
cctctccgcc tcgccccgcc ccgcccctcg ccccgccccg ccccgcctgg cgcgcgcccc 4860
cccccccccc ccgcccccat cgctgcacaa aataattaaa aaataaataa atacaaaatt 4920
gggggtgggg agggggggga gatggggaga gtgaagcaga acgtggggct cacctcgagg 4980
ccggccgaat atcttcattt aaatgtttaa acatcgatgc ggccgccgtt tgtgttatgt 5040
ttcaacgtgt ttatttttca attgcagaaa atttcaagtc atttttcatt cagtagtata 5100
gccccaccac cacatagctt atacagatca ccgtacctta atcaaactca cagaacccta 5160
gtattcaacc tgccacctcc ctcccaacac acagagtaca cagtcctttc tccccggctg 5220
gccttaaaaa gcatcatatc atgggtaaca gacatattct taggtgttat attccacacg 5280
gtttcctgtc gagccaaacg ctcatcagtg atattaataa actccccggg cagctcactt 5340
aagttcatgt cgctgtccag ctgctgagcc acaggctgct gtccaacttg cggttgctta 5400
acgggcggcg aaggagaagt ccacgcctac atgggggtag agtcataatc gtgcatcagg 5460
atagggcggt ggtgctgcag cagcgcgcga ataaactgct gccgccgccg ctccgtcctg 5520
caggaataca acatggcagt ggtctcctca gcgatgattc gcaccgcccg cagcataagg 5580
cgccttgtcc tccgggcaca gcagcgcacc ctgatctcac ttaaatcagc acagtaactg 5640
cagcacagca ccacaatatt gttcaaaatc ccacagtgca aggcgctgta tccaaagctc 5700
atggcgggga ccacagaacc cacgtggcca tcataccaca agcgcaggta gattaagtgg 5760
cgacccctca taaacacgct ggacataaac attacctctt ttggcatgtt gtaattcacc 5820
acctcccggt accatataaa cctctgatta aacatggcgc catccaccac catcctaaac 5880
cagctggcca aaacctgccc gccggctata cactgcaggg aaccgggact ggaacaatga 5940
cagtggagag cccaggactc gtaaccatgg atcatcatgc tcgtcatgat atcaatgttg 6000
gcacaacaca ggcacacgtg catacacttc ctcaggatta caagctcctc ccgcgttaga 6060
accatatccc agggaacaac ccattcctga atcagcgtaa atcccacact gcagggaaga 6120
cctcgcacgt aactcacgtt gtgcattgtc aaagtgttac attcgggcag cagcggatga 6180
tcctccagta tggtagcgcg ggtttctgtc tcaaaaggag gtagacgatc cctactgtac 6240
ggagtgcgcc gagacaaccg agatcgtgtt ggtcgtagtg tcatgccaaa tggaacgccg 6300
gacgtagtca tatttcctga agcaaaacca ggtgcgggcg tgacaaacag atctgcgtct 6360
ccggtctcgc cgcttagatc gctctgtgta gtagttgtag tatatccact ctctcaaagc 6420
atccaggcgc cccctggctt cgggttctat gtaaactcct tcatgcgccg ctgccctgat 6480
aacatccacc accgcagaat aagccacacc cagccaacct acacattcgt tctgcgagtc 6540
acacacggga ggagcgggaa gagctggaag aaccatgttt ttttttttat tccaaaagat 6600
tatccaaaac ctcaaaatga agatctatta agtgaacgcg ctcccctccg gtggcgtggt 6660
caaactctac agccaaagaa cagataatgg catttgtaag atgttgcaca atggcttcca 6720
aaaggcaaac ggccctcacg tccaagtgga cgtaaaggct aaacccttca gggtgaatct 6780
cctctataaa cattccagca ccttcaacca tgcccaaata attctcatct cgccaccttc 6840
tcaatatatc tctaagcaaa tcccgaatat taagtccggc cattgtaaaa atctgctcca 6900
gagcgccctc caccttcagc ctcaagcagc gaatcatgat tgcaaaaatt caggttcctc 6960
acagacctgt ataagattca aaagcggaac attaacaaaa ataccgcgat cccgtaggtc 7020
ccttcgcagg gccagctgaa cataatcgtg caggtctgca cggaccagcg cggccacttc 7080
cccgccagga accatgacaa aagaacccac actgattatg acacgcatac tcggagctat 7140
gctaaccagc gtagccccga tgtaagcttg ttgcatgggc ggcgatataa aatgcaaggt 7200
gctgctcaaa aaatcaggca aagcctcgcg caaaaaagaa agcacatcgt agtcatgctc 7260
atgcagataa aggcaggtaa gctccggaac caccacagaa aaagacacca tttttctctc 7320
aaacatgtct gcgggtttct gcataaacac aaaataaaat aacaaaaaaa catttaaaca 7380
ttagaagcct gtcttacaac aggaaaaaca acccttataa gcataagacg gactacggcc 7440
atgccggcgt gaccgtaaaa aaactggtca ccgtgattaa aaagcaccac cgacagctcc 7500
tcggtcatgt ccggagtcat aatgtaagac tcggtaaaca catcaggttg attcacatcg 7560
gtcagtgcta aaaagcgacc gaaatagccc gggggaatac atacccgcag gcgtagagac 7620
aacattacag cccccatagg aggtataaca aaattaatag gagagaaaaa cacataaaca 7680
cctgaaaaac cctcctgcct aggcaaaata gcaccctccc gctccagaac aacatacagc 7740
gcttccacag cggcagccat aacagtcagc cttaccagta aaaaagaaaa cctattaaaa 7800
aaacaccact cgacacggca ccagctcaat cagtcacagt gtaaaaaagg gccaagtgca 7860
gagcgagtat atataggact aaaaaatgac gtaacggtta aagtccacaa aaaacaccca 7920
gaaaaccgca cgcgaaccta cgcccagaaa cgaaagccaa aaaacccaca acttcctcaa 7980
atcgtcactt ccgttttccc acgttacgtc acttcccatt ttaagaaaac tacaattccc 8040
aacacataca agttactccg cccttaatta aatcggatcc gatatctaga tgtattcgcg 8100
aggtaccgag ctcgaattct ctggccgtcg ttttacaacg tcgtgactgg gaaaaccctg 8160
gcgttaccca acttaatcgc cttgcagcac atcccccttt cgccagctgg cgtaatagcg 8220
aagaggcccg caccgatcgc ccttcccaac agttgcgcag cctgaatggc gaatggcgcc 8280
tgatgcggta ttttctcctt acgcatctgt gcggtatttc acaccgcata tggtgcactc 8340
tcagtacaat ctgctctgat gccgcatagt taagccagcc ccgacacccg ccaacacccg 8400
ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc ttacagacaa gctgtgaccg 8460
tctccgggag ctgcatgtgt cagaggtttt caccgtcatc accgaaacgc gcga 8514
<210> 51
<211> 8649
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 51
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat cgatccagac atgataagat acattgatga 2880
gtttggacaa accacaacta gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga 2940
tgctattgct ttatttgtaa ccattataag ctgcaataaa caagtttgta cactctcggg 3000
tgattattta cccccaccct tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc 3060
gcatcgctat gcgccactgg cagggacacg ttgcgatact ggtgtttagt gctccactta 3120
aactcaggca caaccatccg cggcagctcg gtgaagtttt cactccacag gctgcgcacc 3180
atcaccaacg cgtttagcag gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg 3240
ccctgcgcgc gcgagttgcg atacacaggg ttgcagcact ggaacactat cagcgccggg 3300
tggtgcacgc tggccagcac gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg 3360
ttgctcaggg cgaacggagt caactttggt agctgccttc ccaaaaaggg cgcgtgccca 3420
ggctttgagt tgcactcgca ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg 3480
ttaggataca gcgcctgcat aaaagccttg atctgcttaa aagccacctg agcctttgcg 3540
ccttcagaga agaacatgcc gcaagacttg ccggaaaact gattggccgg acaggccgcg 3600
tcgtgcacgc agcaccttgc gtcggtgttg gagatctgca ccacatttcg gccccaccgg 3660
ttcttcacga tcttggcctt gctagactgc tccttcagcg cgcgctgccc gttttcgctc 3720
gtcacatcca tttcaatcac gtgctcctta tttatcataa tgcttccgtg tagacactta 3780
agctcgcctt cgatctcagc gcagcggtgc agccacaacg cgcagcccgt gggctcgtga 3840
tgcttgtagg tcacctctgc aaacgactgc aggtacgcct gcaggaatcg ccccatcatc 3900
gtcacaaagg tcttgttgct ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc 3960
caggtcttgc atacggccgc cagagcttcc acttggtcag gcagtagttt gaagttcgcc 4020
tttagatcgt tatccacgtg gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc 4080
tcccacgcag acacgatcgg cacactcagc gggttcatca ccgtaatttc actttccgct 4140
tcgctgggct cttcctcttc ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca 4200
ttcagccgcc gcactgtgcg cttacctcct ttgccatgct tgattagcac cggtgggttg 4260
ctgaaaccca ccatttgtag cgccacatct tctctttctt cctcgctgtc cacgattacc 4320
tctggtgatg gcgggcgctc gggcttggga gaagggcgct tctttttctt cttgggcgca 4380
atggccaaat ccgccgccga ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg 4440
tcttgtgatg agtcttcctc gtcctcggac tcgatacgcc gcctcatccg cttttttggg 4500
ggcgcccggg gaggcggcgg cgacggggac ggggacgaca cgtcctccat ggttggggga 4560
cgtcgcgccg caccgcgtcc gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg 4620
gccatttcct tctcctatag gcagaaaaag atccacaaaa gcgaagatca gcttcggcgc 4680
acgctggaag acgcggaggc tctcttcagt aaatactgcg cgctgactct taaggactag 4740
tttcgcgccc tttctcaaat ttaagcgcga aaactacgtc atctccagcg gccacacccg 4800
gcgccagcac ctgttgtcag cgccattggc gcgcccgccc gccgcgcgct tcgcttttta 4860
tagggccgcc gccgccgccg cctcgccata aaaggaaact ttcggagcgc gccgctctga 4920
ttggctgccg ccgcacctct ccgcctcgcc ccgccccgcc cctcgccccg ccccgccccg 4980
cctggcgcgc gccccccccc cccccccgcc cccatcgctg cacaaaataa ttaaaaaata 5040
aataaataca aaattggggg tggggagggg ggggagatgg ggagagtgaa gcagaacgtg 5100
gggctcacct cgaggccggc cgaatatctt catttaaatg tttaaacatc gatgcggccg 5160
ccgtttgtgt tatgtttcaa cgtgtttatt tttcaattgc agaaaatttc aagtcatttt 5220
tcattcagta gtatagcccc accaccacat agcttataca gatcaccgta ccttaatcaa 5280
actcacagaa ccctagtatt caacctgcca cctccctccc aacacacaga gtacacagtc 5340
ctttctcccc ggctggcctt aaaaagcatc atatcatggg taacagacat attcttaggt 5400
gttatattcc acacggtttc ctgtcgagcc aaacgctcat cagtgatatt aataaactcc 5460
ccgggcagct cacttaagtt catgtcgctg tccagctgct gagccacagg ctgctgtcca 5520
acttgcggtt gcttaacggg cggcgaagga gaagtccacg cctacatggg ggtagagtca 5580
taatcgtgca tcaggatagg gcggtggtgc tgcagcagcg cgcgaataaa ctgctgccgc 5640
cgccgctccg tcctgcagga atacaacatg gcagtggtct cctcagcgat gattcgcacc 5700
gcccgcagca taaggcgcct tgtcctccgg gcacagcagc gcaccctgat ctcacttaaa 5760
tcagcacagt aactgcagca cagcaccaca atattgttca aaatcccaca gtgcaaggcg 5820
ctgtatccaa agctcatggc ggggaccaca gaacccacgt ggccatcata ccacaagcgc 5880
aggtagatta agtggcgacc cctcataaac acgctggaca taaacattac ctcttttggc 5940
atgttgtaat tcaccacctc ccggtaccat ataaacctct gattaaacat ggcgccatcc 6000
accaccatcc taaaccagct ggccaaaacc tgcccgccgg ctatacactg cagggaaccg 6060
ggactggaac aatgacagtg gagagcccag gactcgtaac catggatcat catgctcgtc 6120
atgatatcaa tgttggcaca acacaggcac acgtgcatac acttcctcag gattacaagc 6180
tcctcccgcg ttagaaccat atcccaggga acaacccatt cctgaatcag cgtaaatccc 6240
acactgcagg gaagacctcg cacgtaactc acgttgtgca ttgtcaaagt gttacattcg 6300
ggcagcagcg gatgatcctc cagtatggta gcgcgggttt ctgtctcaaa aggaggtaga 6360
cgatccctac tgtacggagt gcgccgagac aaccgagatc gtgttggtcg tagtgtcatg 6420
ccaaatggaa cgccggacgt agtcatattt cctgaagcaa aaccaggtgc gggcgtgaca 6480
aacagatctg cgtctccggt ctcgccgctt agatcgctct gtgtagtagt tgtagtatat 6540
ccactctctc aaagcatcca ggcgccccct ggcttcgggt tctatgtaaa ctccttcatg 6600
cgccgctgcc ctgataacat ccaccaccgc agaataagcc acacccagcc aacctacaca 6660
ttcgttctgc gagtcacaca cgggaggagc gggaagagct ggaagaacca tgtttttttt 6720
tttattccaa aagattatcc aaaacctcaa aatgaagatc tattaagtga acgcgctccc 6780
ctccggtggc gtggtcaaac tctacagcca aagaacagat aatggcattt gtaagatgtt 6840
gcacaatggc ttccaaaagg caaacggccc tcacgtccaa gtggacgtaa aggctaaacc 6900
cttcagggtg aatctcctct ataaacattc cagcaccttc aaccatgccc aaataattct 6960
catctcgcca ccttctcaat atatctctaa gcaaatcccg aatattaagt ccggccattg 7020
taaaaatctg ctccagagcg ccctccacct tcagcctcaa gcagcgaatc atgattgcaa 7080
aaattcaggt tcctcacaga cctgtataag attcaaaagc ggaacattaa caaaaatacc 7140
gcgatcccgt aggtcccttc gcagggccag ctgaacataa tcgtgcaggt ctgcacggac 7200
cagcgcggcc acttccccgc caggaaccat gacaaaagaa cccacactga ttatgacacg 7260
catactcgga gctatgctaa ccagcgtagc cccgatgtaa gcttgttgca tgggcggcga 7320
tataaaatgc aaggtgctgc tcaaaaaatc aggcaaagcc tcgcgcaaaa aagaaagcac 7380
atcgtagtca tgctcatgca gataaaggca ggtaagctcc ggaaccacca cagaaaaaga 7440
caccattttt ctctcaaaca tgtctgcggg tttctgcata aacacaaaat aaaataacaa 7500
aaaaacattt aaacattaga agcctgtctt acaacaggaa aaacaaccct tataagcata 7560
agacggacta cggccatgcc ggcgtgaccg taaaaaaact ggtcaccgtg attaaaaagc 7620
accaccgaca gctcctcggt catgtccgga gtcataatgt aagactcggt aaacacatca 7680
ggttgattca catcggtcag tgctaaaaag cgaccgaaat agcccggggg aatacatacc 7740
cgcaggcgta gagacaacat tacagccccc ataggaggta taacaaaatt aataggagag 7800
aaaaacacat aaacacctga aaaaccctcc tgcctaggca aaatagcacc ctcccgctcc 7860
agaacaacat acagcgcttc cacagcggca gccataacag tcagccttac cagtaaaaaa 7920
gaaaacctat taaaaaaaca ccactcgaca cggcaccagc tcaatcagtc acagtgtaaa 7980
aaagggccaa gtgcagagcg agtatatata ggactaaaaa atgacgtaac ggttaaagtc 8040
cacaaaaaac acccagaaaa ccgcacgcga acctacgccc agaaacgaaa gccaaaaaac 8100
ccacaacttc ctcaaatcgt cacttccgtt ttcccacgtt acgtcacttc ccattttaag 8160
aaaactacaa ttcccaacac atacaagtta ctccgccctt aattaaatcg gatccgatat 8220
ctagatgtat tcgcgaggta ccgagctcga attctctggc cgtcgtttta caacgtcgtg 8280
actgggaaaa ccctggcgtt acccaactta atcgccttgc agcacatccc cctttcgcca 8340
gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga 8400
atggcgaatg gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc 8460
gcatatggtg cactctcagt acaatctgct ctgatgccgc atagttaagc cagccccgac 8520
acccgccaac acccgctgac gcgccctgac gggcttgtct gctcccggca tccgcttaca 8580
gacaagctgt gaccgtctcc gggagctgca tgtgtcagag gttttcaccg tcatcaccga 8640
aacgcgcga 8649
<210> 52
<211> 9166
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 52
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tctgcagtcg accagaagca ccatgtcctt gggtccggcc 2160
tgctgaatgc gcaggcggtc ggccatgccc caggcttcgt tttgacatcg gcgcaggtct 2220
ttgtagtagt cttgcatgag cctttctacc ggcacttctt cttctccttc ctcttgtcct 2280
gcatctcttg catctatcgc tgcggcggcg gcggagtttg gccgtaggtg gcgccctctt 2340
cctcccatgc gtgtgacccc gaagcccctc atcggctgaa gcagggctag gtcggcgaca 2400
acgcgctcgg ctaatatggc ctgctgcacc tgcgtgaggg tagactggaa gtcatccatg 2460
tccacaaagc ggtggtatgc gcccgtgttg atggtgtaag tgcagttggc cataacggac 2520
cagttaacgg tctggtgacc cggctgcgag agctcggtgt acctgagacg cgagtaagcc 2580
ctcgagtcaa atacgtagtc gttgcaagtc cgcaccaggt actggtatcc caccaaaaag 2640
tgcggcggcg gctggcggta gaggggccag cgtagggtgg ccggggctcc gggggcgaga 2700
tcttccaaca taaggcgatg atatccgtag atgtacctgg acatccaggt gatgccggcg 2760
gcggtggtgg aggcgcgcgg aaagtcgcgg acgcggttcc agatgttgcg cagcggcaaa 2820
aagtgctcca tggtcgggac gctctggccg gtcaggcgcg cgcaatcgtt gacgctctag 2880
cgtgcaaaag gagagcctgt aagcgggcac tcttccgtgg tctggtggat aaattcgcaa 2940
gggtatcatg gcggacgacc ggggttcgag ccccgtatcc ggccgtccgc cgtgatccat 3000
gcggttaccg cccgcgtgtc gaacccaggt gtgcgacgtc agacaacggg ggagtgctcc 3060
ttttggcttc cttccaggcg cggcggctgc tgcgctagct tttttggcca ctggccgcgc 3120
gcagcgtaag cggttaggct ggaaagcgaa agcattaagt ggctcgctcc ctgtagccgg 3180
agggttattt tccaagggtt gagtcgcggg acccccggtt cgagtctcgg accgagactg 3240
ggggcgtaca ctggatggcc tttgcctgga acccgcactc aaaaacatgc tacctctttg 3300
agccctttgg cttttctgac cagcgactca agcaggttta ccagtttgag tacgagtcac 3360
tcctgcgccg tagcgccatt gcttcttccc ccgaccgctg tataacgctg gaaaagtcca 3420
cccaaagcgt acaggggccc aactcggccg cctgtggact attctgctgc atgtttctcc 3480
acgcctttgc caactggccc caaactccca tggatcacaa ccccaccatg aaccttatta 3540
ccggggtacc caactccatg ctcaacagtc cccaggtaca gcccaccctg cgtcgcaacc 3600
aggaacagct ctacagcttc ctggagcgcc actcgcccta cttccgcagc cacagtgcgc 3660
agattaggag cgccacttct ttttgtcact tgaaaaacat gtaaaaataa tgtactagag 3720
acactttcaa taaaggcaaa tgcttttatt tgtacactct cgggtgatta tttaccccca 3780
cccttgccgt ctgcgccgtt taaaaatcaa aggggttctg ccgcgcatcg ctatgcgcca 3840
ctggcaggga cacgttgcga tactggtgtt tagtgctcca cttaaactca ggcacaacca 3900
tccgcggcag ctcggtgaag ttttcactcc acaggctgcg caccatcacc aacgcgttta 3960
gcaggtcggg cgccgatatc ttgaagtcgc agttggggcc tccgccctgc gcgcgcgagt 4020
tgcgatacac agggttgcag cactggaaca ctatcagcgc cgggtggtgc acgctggcca 4080
gcacgctctt gtcggagatc agatccgcgt ccaggtcctc cgcgttgctc agggcgaacg 4140
gagtcaactt tggtagctgc cttcccaaaa agggcgcgtg cccaggcttt gagttgcact 4200
cgcaccgtag tggcatcaaa aggtgaccgt gcccggtctg ggcgttagga tacagcgcct 4260
gcataaaagc cttgatctgc ttaaaagcca cctgagcctt tgcgccttca gagaagaaca 4320
tgccgcaaga cttgccggaa aactgattgg ccggacaggc cgcgtcgtgc acgcagcacc 4380
ttgcgtcggt gttggagatc tgcaccacat ttcggcccca ccggttcttc acgatcttgg 4440
ccttgctaga ctgctccttc agcgcgcgct gcccgttttc gctcgtcaca tccatttcaa 4500
tcacgtgctc cttatttatc ataatgcttc cgtgtagaca cttaagctcg ccttcgatct 4560
cagcgcagcg gtgcagccac aacgcgcagc ccgtgggctc gtgatgcttg taggtcacct 4620
ctgcaaacga ctgcaggtac gcctgcagga atcgccccat catcgtcaca aaggtcttgt 4680
tgctggtgaa ggtcagctgc aacccgcggt gctcctcgtt cagccaggtc ttgcatacgg 4740
ccgccagagc ttccacttgg tcaggcagta gtttgaagtt cgcctttaga tcgttatcca 4800
cgtggtactt gtccatcagc gcgcgcgcag cctccatgcc cttctcccac gcagacacga 4860
tcggcacact cagcgggttc atcaccgtaa tttcactttc cgcttcgctg ggctcttcct 4920
cttcctcttg cgtccgcata ccacgcgcca ctgggtcgtc ttcattcagc cgccgcactg 4980
tgcgcttacc tcctttgcca tgcttgatta gcaccggtgg gttgctgaaa cccaccattt 5040
gtagcgccac atcttctctt tcttcctcgc tgtccacgat tacctctggt gatggcgggc 5100
gctcgggctt gggagaaggg cgcttctttt tcttcttggg cgcaatggcc aaatccgccg 5160
ccgaggtcga tggccgcggg ctgggtgtgc gcggcaccag cgcgtcttgt gatgagtctt 5220
cctcgtcctc ggactcgata cgccgcctca tccgcttttt tgggggcgcc cggggaggcg 5280
gcggcgacgg ggacggggac gacacgtcct ccatggttgg gggacgtcgc gccgcaccgc 5340
gtccgcgctc gggggtggtt tcgcgctgct cctcttcccg actggccatt tccttctcct 5400
ataggcagaa aaagatccac aaaagcgaag atcagcttcg gcgcacgctg gaagacgcgg 5460
aggctctctt cagtaaatac tgcgcgctga ctcttaagga ctagtttcgc gccctttctc 5520
aaatttaagc gcgaaaacta cgtcatctcc agcggccaca cccggcgcca gcacctgttg 5580
tcagcgccat tggcgcgccc gcccgccgcg cgcttcgctt tttatagggc cgccgccgcc 5640
gccgcctcgc cataaaagga aactttcgga gcgcgccgct ctgattggct gccgccgcac 5700
ctctccgcct cgccccgccc cgcccctcgc cccgccccgc cccgcctggc gcgcgccccc 5760
cccccccccc cgcccccatc gctgcacaaa ataattaaaa aataaataaa tacaaaattg 5820
ggggtgggga ggggggggag atggggagag tgaagcagaa cgtggggctc acctcgaggc 5880
cggccgaata tcttcattta aatgtttaaa catcgatgcg gccgcaactt gtttattgca 5940
gcttataatg gttacaaata aagcaatagc atcacaaatt tcacaaataa agcatttttt 6000
tcactgcatt ctagttgtgg tttgtccaaa ctcatcaatg tatcttagct taacgggcgg 6060
cgaaggagaa gtccacgcct acatgggggt agagtcataa tcgtgcatca ggatagggcg 6120
gtggtgctgc agcagcgcgc gaataaactg ctgccgccgc cgctccgtcc tgcaggaata 6180
caacatggca gtggtctcct cagcgatgat tcgcaccgcc cgcagcataa ggcgccttgt 6240
cctccgggca cagcagcgca ccctgatctc acttaaatca gcacagtaac tgcagcacag 6300
caccacaata ttgttcaaaa tcccacagtg caaggcgctg tatccaaagc tcatggcggg 6360
gaccacagaa cccacgtggc catcatacca caagcgcagg tagattaagt ggcgacccct 6420
cataaacacg ctggacataa acattacctc ttttggcatg ttgtaattca ccacctcccg 6480
gtaccatata aacctctgat taaacatggc gccatccacc accatcctaa accagctggc 6540
caaaacctgc ccgccggcta tacactgcag ggaaccggga ctggaacaat gacagtggag 6600
agcccaggac tcgtaaccat ggatcatcat gctcgtcatg atatcaatgt tggcacaaca 6660
caggcacacg tgcatacact tcctcaggat tacaagctcc tcccgcgtta gaaccatatc 6720
ccagggaaca acccattcct gaatcagcgt aaatcccaca ctgcagggaa gacctcgcac 6780
gtaactcacg ttgtgcattg tcaaagtgtt acattcgggc agcagcggat gatcctccag 6840
tatggtagcg cgggtttctg tctcaaaagg aggtagacga tccctactgt acggagtgcg 6900
ccgagacaac cgagatcgtg ttggtcgtag tgtcatgcca aatggaacgc cggacgtagt 6960
catatttcct gaagcaaaac caggtgcggg cgtgacaaac agatctgcgt ctccggtctc 7020
gccgcttaga tcgctctgtg tagtagttgt agtatatcca ctctctcaaa gcatccaggc 7080
gccccctggc ttcgggttct atgtaaactc cttcatgcgc cgctgccctg ataacatcca 7140
ccaccgcaga ataagccaca cccagccaac ctacacattc gttctgcgag tcacacacgg 7200
gaggagcggg aagagctgga agaaccatgt tttttttttt attccaaaag attatccaaa 7260
acctcaaaat gaagatctat taagtgaacg cgctcccctc cggtggcgtg gtcaaactct 7320
acagccaaag aacagataat ggcatttgta agatgttgca caatggcttc caaaaggcaa 7380
acggccctca cgtccaagtg gacgtaaagg ctaaaccctt cagggtgaat ctcctctata 7440
aacattccag caccttcaac catgcccaaa taattctcat ctcgccacct tctcaatata 7500
tctctaagca aatcccgaat attaagtccg gccattgtaa aaatctgctc cagagcgccc 7560
tccaccttca gcctcaagca gcgaatcatg attgcaaaaa ttcaggttcc tcacagacct 7620
gtataagatt caaaagcgga acattaacaa aaataccgcg atcccgtagg tcccttcgca 7680
gggccagctg aacataatcg tgcaggtctg cacggaccag cgcggccact tccccgccag 7740
gaaccatgac aaaagaaccc acactgatta tgacacgcat actcggagct atgctaacca 7800
gcgtagcccc gatgtaagct tgttgcatgg gcggcgatat aaaatgcaag gtgctgctca 7860
aaaaatcagg caaagcctcg cgcaaaaaag aaagcacatc gtagtcatgc tcatgcagat 7920
aaaggcaggt aagctccgga accaccacag aaaaagacac catttttctc tcaaacatgt 7980
ctgcgggttt ctgcataaac acaaaataaa ataacaaaaa aacatttaaa cattagaagc 8040
ctgtcttaca acaggaaaaa caacccttat aagcataaga cggactacgg ccatgccggc 8100
gtgaccgtaa aaaaactggt caccgtgatt aaaaagcacc accgacagct cctcggtcat 8160
gtccggagtc ataatgtaag actcggtaaa cacatcaggt tgattcacat cggtcagtgc 8220
taaaaagcga ccgaaatagc ccgggggaat acatacccgc aggcgtagag acaacattac 8280
agcccccata ggaggtataa caaaattaat aggagagaaa aacacataaa cacctgaaaa 8340
accctcctgc ctaggcaaaa tagcaccctc ccgctccaga acaacataca gcgcttccac 8400
agcggcagcc ataacagtca gccttaccag taaaaaagaa aacctattaa aaaaacacca 8460
ctcgacacgg caccagctca atcagtcaca gtgtaaaaaa gggccaagtg cagagcgagt 8520
atatatagga ctaaaaaatg acgtaacggt taaagtccac aaaaaacacc cagaaaaccg 8580
cacgcgaacc tacgcccaga aacgaaagcc aaaaaaccca caacttcctc aaatcgtcac 8640
ttccgttttc ccacgttacg tcacttccca ttttaagaaa actacaattc ccaacacata 8700
caagttactc cgcccttaat taaatcggat ccgatatcta gatgtattcg cgaggtaccg 8760
agctcgaatt ctctggccgt cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc 8820
caacttaatc gccttgcagc acatccccct ttcgccagct ggcgtaatag cgaagaggcc 8880
cgcaccgatc gcccttccca acagttgcgc agcctgaatg gcgaatggcg cctgatgcgg 8940
tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatggtgcac tctcagtaca 9000
atctgctctg atgccgcata gttaagccag ccccgacacc cgccaacacc cgctgacgcg 9060
ccctgacggg cttgtctgct cccggcatcc gcttacagac aagctgtgac cgtctccggg 9120
agctgcatgt gtcagaggtt ttcaccgtca tcaccgaaac gcgcga 9166
<210> 53
<211> 8267
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 53
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat ctgtacactc tcgggtgatt atttaccccc 2880
acccttgccg tctgcgccgt ttaaaaatca aaggggttct gccgcgcatc gctatgcgcc 2940
actggcaggg acacgttgcg atactggtgt ttagtgctcc acttaaactc aggcacaacc 3000
atccgcggca gctcggtgaa gttttcactc cacaggctgc gcaccatcac caacgcgttt 3060
agcaggtcgg gcgccgatat cttgaagtcg cagttggggc ctccgccctg cgcgcgcgag 3120
ttgcgataca cagggttgca gcactggaac actatcagcg ccgggtggtg cacgctggcc 3180
agcacgctct tgtcggagat cagatccgcg tccaggtcct ccgcgttgct cagggcgaac 3240
ggagtcaact ttggtagctg ccttcccaaa aagggcgcgt gcccaggctt tgagttgcac 3300
tcgcaccgta gtggcatcaa aaggtgaccg tgcccggtct gggcgttagg atacagcgcc 3360
tgcataaaag ccttgatctg cttaaaagcc acctgagcct ttgcgccttc agagaagaac 3420
atgccgcaag acttgccgga aaactgattg gccggacagg ccgcgtcgtg cacgcagcac 3480
cttgcgtcgg tgttggagat ctgcaccaca tttcggcccc accggttctt cacgatcttg 3540
gccttgctag actgctcctt cagcgcgcgc tgcccgtttt cgctcgtcac atccatttca 3600
atcacgtgct ccttatttat cataatgctt ccgtgtagac acttaagctc gccttcgatc 3660
tcagcgcagc ggtgcagcca caacgcgcag cccgtgggct cgtgatgctt gtaggtcacc 3720
tctgcaaacg actgcaggta cgcctgcagg aatcgcccca tcatcgtcac aaaggtcttg 3780
ttgctggtga aggtcagctg caacccgcgg tgctcctcgt tcagccaggt cttgcatacg 3840
gccgccagag cttccacttg gtcaggcagt agtttgaagt tcgcctttag atcgttatcc 3900
acgtggtact tgtccatcag cgcgcgcgca gcctccatgc ccttctccca cgcagacacg 3960
atcggcacac tcagcgggtt catcaccgta atttcacttt ccgcttcgct gggctcttcc 4020
tcttcctctt gcgtccgcat accacgcgcc actgggtcgt cttcattcag ccgccgcact 4080
gtgcgcttac ctcctttgcc atgcttgatt agcaccggtg ggttgctgaa acccaccatt 4140
tgtagcgcca catcttctct ttcttcctcg ctgtccacga ttacctctgg tgatggcggg 4200
cgctcgggct tgggagaagg gcgcttcttt ttcttcttgg gcgcaatggc caaatccgcc 4260
gccgaggtcg atggccgcgg gctgggtgtg cgcggcacca gcgcgtcttg tgatgagtct 4320
tcctcgtcct cggactcgat acgccgcctc atccgctttt ttgggggcgc ccggggaggc 4380
ggcggcgacg gggacgggga cgacacgtcc tccatggttg ggggacgtcg cgccgcaccg 4440
cgtccgcgct cgggggtggt ttcgcgctgc tcctcttccc gactggccat ttccttctcc 4500
tataggcaga aaaagatcca caaaagcgaa gatcagcttc ggcgcacgct ggaagacgcg 4560
gaggctctct tcagtaaata ctgcgcgctg actcttaagg actagtttcg cgccctttct 4620
caaatttaag cgcgaaaact acgtcatctc cagcggccac acccggcgcc agcacctgtt 4680
gtcagcgcca ttggcgcgcc cgcccgccgc gcgcttcgct ttttataggg ccgccgccgc 4740
cgccgcctcg ccataaaagg aaactttcgg agcgcgccgc tctgattggc tgccgccgca 4800
cctctccgcc tcgccccgcc ccgcccctcg ccccgccccg ccccgcctgg cgcgcgcccc 4860
cccccccccc ccgcccccat cgctgcacaa aataattaaa aaataaataa atacaaaatt 4920
gggggtgggg agggggggga gatggggaga gtgaagcaga acgtggggct cacctcgagg 4980
ccggccgaat atcttcattt aaatgtttaa acatcgatgc ggccgcaact tgtttattgc 5040
agcttataat ggttacaaat aaagcaatag catcacaaat ttcacaaata aagcattttt 5100
ttcactgcat tctagttgtg gtttgtccaa actcatcaat gtatcttagc ttaacgggcg 5160
gcgaaggaga agtccacgcc tacatggggg tagagtcata atcgtgcatc aggatagggc 5220
ggtggtgctg cagcagcgcg cgaataaact gctgccgccg ccgctccgtc ctgcaggaat 5280
acaacatggc agtggtctcc tcagcgatga ttcgcaccgc ccgcagcata aggcgccttg 5340
tcctccgggc acagcagcgc accctgatct cacttaaatc agcacagtaa ctgcagcaca 5400
gcaccacaat attgttcaaa atcccacagt gcaaggcgct gtatccaaag ctcatggcgg 5460
ggaccacaga acccacgtgg ccatcatacc acaagcgcag gtagattaag tggcgacccc 5520
tcataaacac gctggacata aacattacct cttttggcat gttgtaattc accacctccc 5580
ggtaccatat aaacctctga ttaaacatgg cgccatccac caccatccta aaccagctgg 5640
ccaaaacctg cccgccggct atacactgca gggaaccggg actggaacaa tgacagtgga 5700
gagcccagga ctcgtaacca tggatcatca tgctcgtcat gatatcaatg ttggcacaac 5760
acaggcacac gtgcatacac ttcctcagga ttacaagctc ctcccgcgtt agaaccatat 5820
cccagggaac aacccattcc tgaatcagcg taaatcccac actgcaggga agacctcgca 5880
cgtaactcac gttgtgcatt gtcaaagtgt tacattcggg cagcagcgga tgatcctcca 5940
gtatggtagc gcgggtttct gtctcaaaag gaggtagacg atccctactg tacggagtgc 6000
gccgagacaa ccgagatcgt gttggtcgta gtgtcatgcc aaatggaacg ccggacgtag 6060
tcatatttcc tgaagcaaaa ccaggtgcgg gcgtgacaaa cagatctgcg tctccggtct 6120
cgccgcttag atcgctctgt gtagtagttg tagtatatcc actctctcaa agcatccagg 6180
cgccccctgg cttcgggttc tatgtaaact ccttcatgcg ccgctgccct gataacatcc 6240
accaccgcag aataagccac acccagccaa cctacacatt cgttctgcga gtcacacacg 6300
ggaggagcgg gaagagctgg aagaaccatg tttttttttt tattccaaaa gattatccaa 6360
aacctcaaaa tgaagatcta ttaagtgaac gcgctcccct ccggtggcgt ggtcaaactc 6420
tacagccaaa gaacagataa tggcatttgt aagatgttgc acaatggctt ccaaaaggca 6480
aacggccctc acgtccaagt ggacgtaaag gctaaaccct tcagggtgaa tctcctctat 6540
aaacattcca gcaccttcaa ccatgcccaa ataattctca tctcgccacc ttctcaatat 6600
atctctaagc aaatcccgaa tattaagtcc ggccattgta aaaatctgct ccagagcgcc 6660
ctccaccttc agcctcaagc agcgaatcat gattgcaaaa attcaggttc ctcacagacc 6720
tgtataagat tcaaaagcgg aacattaaca aaaataccgc gatcccgtag gtcccttcgc 6780
agggccagct gaacataatc gtgcaggtct gcacggacca gcgcggccac ttccccgcca 6840
ggaaccatga caaaagaacc cacactgatt atgacacgca tactcggagc tatgctaacc 6900
agcgtagccc cgatgtaagc ttgttgcatg ggcggcgata taaaatgcaa ggtgctgctc 6960
aaaaaatcag gcaaagcctc gcgcaaaaaa gaaagcacat cgtagtcatg ctcatgcaga 7020
taaaggcagg taagctccgg aaccaccaca gaaaaagaca ccatttttct ctcaaacatg 7080
tctgcgggtt tctgcataaa cacaaaataa aataacaaaa aaacatttaa acattagaag 7140
cctgtcttac aacaggaaaa acaaccctta taagcataag acggactacg gccatgccgg 7200
cgtgaccgta aaaaaactgg tcaccgtgat taaaaagcac caccgacagc tcctcggtca 7260
tgtccggagt cataatgtaa gactcggtaa acacatcagg ttgattcaca tcggtcagtg 7320
ctaaaaagcg accgaaatag cccgggggaa tacatacccg caggcgtaga gacaacatta 7380
cagcccccat aggaggtata acaaaattaa taggagagaa aaacacataa acacctgaaa 7440
aaccctcctg cctaggcaaa atagcaccct cccgctccag aacaacatac agcgcttcca 7500
cagcggcagc cataacagtc agccttacca gtaaaaaaga aaacctatta aaaaaacacc 7560
actcgacacg gcaccagctc aatcagtcac agtgtaaaaa agggccaagt gcagagcgag 7620
tatatatagg actaaaaaat gacgtaacgg ttaaagtcca caaaaaacac ccagaaaacc 7680
gcacgcgaac ctacgcccag aaacgaaagc caaaaaaccc acaacttcct caaatcgtca 7740
cttccgtttt cccacgttac gtcacttccc attttaagaa aactacaatt cccaacacat 7800
acaagttact ccgcccttaa ttaaatcgga tccgatatct agatgtattc gcgaggtacc 7860
gagctcgaat tctctggccg tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac 7920
ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata gcgaagaggc 7980
ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatggc gcctgatgcg 8040
gtattttctc cttacgcatc tgtgcggtat ttcacaccgc atatggtgca ctctcagtac 8100
aatctgctct gatgccgcat agttaagcca gccccgacac ccgccaacac ccgctgacgc 8160
gccctgacgg gcttgtctgc tcccggcatc cgcttacaga caagctgtga ccgtctccgg 8220
gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa cgcgcga 8267
<210> 54
<211> 9263
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 54
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tctgcagtcg accagaagca ccatgtcctt gggtccggcc 2160
tgctgaatgc gcaggcggtc ggccatgccc caggcttcgt tttgacatcg gcgcaggtct 2220
ttgtagtagt cttgcatgag cctttctacc ggcacttctt cttctccttc ctcttgtcct 2280
gcatctcttg catctatcgc tgcggcggcg gcggagtttg gccgtaggtg gcgccctctt 2340
cctcccatgc gtgtgacccc gaagcccctc atcggctgaa gcagggctag gtcggcgaca 2400
acgcgctcgg ctaatatggc ctgctgcacc tgcgtgaggg tagactggaa gtcatccatg 2460
tccacaaagc ggtggtatgc gcccgtgttg atggtgtaag tgcagttggc cataacggac 2520
cagttaacgg tctggtgacc cggctgcgag agctcggtgt acctgagacg cgagtaagcc 2580
ctcgagtcaa atacgtagtc gttgcaagtc cgcaccaggt actggtatcc caccaaaaag 2640
tgcggcggcg gctggcggta gaggggccag cgtagggtgg ccggggctcc gggggcgaga 2700
tcttccaaca taaggcgatg atatccgtag atgtacctgg acatccaggt gatgccggcg 2760
gcggtggtgg aggcgcgcgg aaagtcgcgg acgcggttcc agatgttgcg cagcggcaaa 2820
aagtgctcca tggtcgggac gctctggccg gtcaggcgcg cgcaatcgtt gacgctctag 2880
cgtgcaaaag gagagcctgt aagcgggcac tcttccgtgg tctggtggat aaattcgcaa 2940
gggtatcatg gcggacgacc ggggttcgag ccccgtatcc ggccgtccgc cgtgatccat 3000
gcggttaccg cccgcgtgtc gaacccaggt gtgcgacgtc agacaacggg ggagtgctcc 3060
ttttggcttc cttccaggcg cggcggctgc tgcgctagct tttttggcca ctggccgcgc 3120
gcagcgtaag cggttaggct ggaaagcgaa agcattaagt ggctcgctcc ctgtagccgg 3180
agggttattt tccaagggtt gagtcgcggg acccccggtt cgagtctcgg accgagactg 3240
ggggcgtaca ctggatggcc tttgcctgga acccgcactc aaaaacatgc tacctctttg 3300
agccctttgg cttttctgac cagcgactca agcaggttta ccagtttgag tacgagtcac 3360
tcctgcgccg tagcgccatt gcttcttccc ccgaccgctg tataacgctg gaaaagtcca 3420
cccaaagcgt acaggggccc aactcggccg cctgtggact attctgctgc atgtttctcc 3480
acgcctttgc caactggccc caaactccca tggatcacaa ccccaccatg aaccttatta 3540
ccggggtacc caactccatg ctcaacagtc cccaggtaca gcccaccctg cgtcgcaacc 3600
aggaacagct ctacagcttc ctggagcgcc actcgcccta cttccgcagc cacagtgcgc 3660
agattaggag cgccacttct ttttgtcact tgaaaaacat gtaaaaataa tgtactagag 3720
acactttcaa taaaggcaaa tgcttttatt tgtacactct cgggtgatta tttaccccca 3780
cccttgccgt ctgcgccgtt taaaaatcaa aggggttctg ccgcgcatcg ctatgcgcca 3840
ctggcaggga cacgttgcga tactggtgtt tagtgctcca cttaaactca ggcacaacca 3900
tccgcggcag ctcggtgaag ttttcactcc acaggctgcg caccatcacc aacgcgttta 3960
gcaggtcggg cgccgatatc ttgaagtcgc agttggggcc tccgccctgc gcgcgcgagt 4020
tgcgatacac agggttgcag cactggaaca ctatcagcgc cgggtggtgc acgctggcca 4080
gcacgctctt gtcggagatc agatccgcgt ccaggtcctc cgcgttgctc agggcgaacg 4140
gagtcaactt tggtagctgc cttcccaaaa agggcgcgtg cccaggcttt gagttgcact 4200
cgcaccgtag tggcatcaaa aggtgaccgt gcccggtctg ggcgttagga tacagcgcct 4260
gcataaaagc cttgatctgc ttaaaagcca cctgagcctt tgcgccttca gagaagaaca 4320
tgccgcaaga cttgccggaa aactgattgg ccggacaggc cgcgtcgtgc acgcagcacc 4380
ttgcgtcggt gttggagatc tgcaccacat ttcggcccca ccggttcttc acgatcttgg 4440
ccttgctaga ctgctccttc agcgcgcgct gcccgttttc gctcgtcaca tccatttcaa 4500
tcacgtgctc cttatttatc ataatgcttc cgtgtagaca cttaagctcg ccttcgatct 4560
cagcgcagcg gtgcagccac aacgcgcagc ccgtgggctc gtgatgcttg taggtcacct 4620
ctgcaaacga ctgcaggtac gcctgcagga atcgccccat catcgtcaca aaggtcttgt 4680
tgctggtgaa ggtcagctgc aacccgcggt gctcctcgtt cagccaggtc ttgcatacgg 4740
ccgccagagc ttccacttgg tcaggcagta gtttgaagtt cgcctttaga tcgttatcca 4800
cgtggtactt gtccatcagc gcgcgcgcag cctccatgcc cttctcccac gcagacacga 4860
tcggcacact cagcgggttc atcaccgtaa tttcactttc cgcttcgctg ggctcttcct 4920
cttcctcttg cgtccgcata ccacgcgcca ctgggtcgtc ttcattcagc cgccgcactg 4980
tgcgcttacc tcctttgcca tgcttgatta gcaccggtgg gttgctgaaa cccaccattt 5040
gtagcgccac atcttctctt tcttcctcgc tgtccacgat tacctctggt gatggcgggc 5100
gctcgggctt gggagaaggg cgcttctttt tcttcttggg cgcaatggcc aaatccgccg 5160
ccgaggtcga tggccgcggg ctgggtgtgc gcggcaccag cgcgtcttgt gatgagtctt 5220
cctcgtcctc ggactcgata cgccgcctca tccgcttttt tgggggcgcc cggggaggcg 5280
gcggcgacgg ggacggggac gacacgtcct ccatggttgg gggacgtcgc gccgcaccgc 5340
gtccgcgctc gggggtggtt tcgcgctgct cctcttcccg actggccatt tccttctcct 5400
ataggcagaa aaagatccac aaaagcgaag atcagcttcg gcgcacgctg gaagacgcgg 5460
aggctctctt cagtaaatac tgcgcgctga ctcttaagga ctagtttcgc gccctttctc 5520
aaatttaagc gcgaaaacta cgtcatctcc agcggccaca cccggcgcca gcacctgttg 5580
tcagcgccat tggcgcgccc gcccgccgcg cgcttcgctt tttatagggc cgccgccgcc 5640
gccgcctcgc cataaaagga aactttcgga gcgcgccgct ctgattggct gccgccgcac 5700
ctctccgcct cgccccgccc cgcccctcgc cccgccccgc cccgcctggc gcgcgccccc 5760
cccccccccc cgcccccatc gctgcacaaa ataattaaaa aataaataaa tacaaaattg 5820
ggggtgggga ggggggggag atggggagag tgaagcagaa cgtggggctc acctcgaggc 5880
cggccgaata tcttcattta aatgtttaaa catcgatgcg gccgcaactt gtttattgca 5940
gcttataatg gttacaaata aagcaatagc atcacaaatt tcacaaataa agcatttttt 6000
tcactgcatt ctagttgtgg tttgtccaaa ctcatcaatg tatcttagct taacgggcgg 6060
cgaaggagaa gtccacgcct acatgggggt agagtcataa tcgtgcatca ggatagggcg 6120
gtggtgctgc agcagcgcgc gaataaactg ctgccgccgc cgctccgtcc tgcaggaata 6180
caacatggca gtggtctcct cagcgatgat tcgcaccgcc cgcagcataa ggcgccttgt 6240
cctccgggca cagcagcgca ccctgatctc acttaaatca gcacagtaac tgcagcacag 6300
caccacaata ttgttcaaaa tcccacagtg caaggcgctg tatccaaagc tcatggcggg 6360
gaccacagaa cccacgtggc catcatacca caagcgcagg tagattaagt ggcgacccct 6420
cataaacacg ctggacataa acattacctc ttttggcatg ttgtaattca ccacctcccg 6480
gtaccatata aacctctgat taaacatggc gccatccacc accatcctaa accagctggc 6540
caaaacctgc ccgccggcta tacactgcag ggaaccggga ctggaacaat gacagtggag 6600
agcccaggac tcgtaaccat ggatcatcat gctcgtcatg atatcaatgt tggcacaaca 6660
caggcacacg tgcatacact tcctcaggat tacaagctcc tcccgcgtta gaaccatatc 6720
ccagggaaca acccattcct gaatcagcgt aaatcccaca ctgcagggaa gacctcgcac 6780
gtaactcacg ttgtgcattg tcaaagtgtt acattcgggc agcagcggat gatcctccag 6840
tatggtagcg cgggtttctg tctcaaaagg aggtagacga tccctactgt acggagtgcg 6900
ccgagacaac cgagatcgtg ttggtcgtag tgtcatgcca aatggaacgc cggacgtagt 6960
catatttcct gaagcaaaac caggtgcggg cgtgacaaac agatctgcgt ctccggtctc 7020
gccgcttaga tcgctctgtg tagtagttgt agtatatcca ctctctcaaa gcatccaggc 7080
gccccctggc ttcgggttct atgtaaactc cttcatgcgc cgctgccctg ataacatcca 7140
ccaccgcaga ataagccaca cccagccaac ctacacattc gttctgcgag tcacacacgg 7200
gaggagcggg aagagctgga agaaccatgt tttttttttt attccaaaag attatccaaa 7260
acctcaaaat gaagatctat taagtgaacg cgctcccctc cggtggcgtg gtcaaactct 7320
acagccaaag aacagataat ggcatttgta agatgttgca caatggcttc caaaaggcaa 7380
acggccctca cgtccaagtg gacgtaaagg ctaaaccctt cagggtgaat ctcctctata 7440
aacattccag caccttcaac catgcccaaa taattctcat ctcgccacct tctcaatata 7500
tctctaagca aatcccgaat attaagtccg gccattgtaa aaatctgctc cagagcgccc 7560
tccaccttca gcctcaagca gcgaatcatg attgcaaaaa ttcaggttcc tcacagacct 7620
gtataagatt caaaagcgga acattaacaa aaataccgcg atcccgtagg tcccttcgca 7680
gggccagctg aacataatcg tgcaggtctg cacggaccag cgcggccact tccccgccag 7740
gaaccatgac aaaagaaccc acactgatta tgacacgcat actcggagct atgctaacca 7800
gcgtagcccc gatgtaagct tgttgcatgg gcggcgatat aaaatgcaag gtgctgctca 7860
aaaaatcagg caaagcctcg cgcaaaaaag aaagcacatc gtagtcatgc tcatgcagat 7920
aaaggcaggt aagctccgga accaccacag aaaaagacac catttttctc tcaaacatgt 7980
ctgcgggttt ctgcataaac acaaaataaa ataacaaaaa aacatttaaa cattagaagc 8040
ctgtcttaca acaggaaaaa caacccttat aagcataaga cggactacgg ccatgccggc 8100
gtgaccgtaa aaaaactggt caccgtgatt aaaaagcacc accgacagct cctcggtcat 8160
gtccggagtc ataatgtaag actcggtaaa cacatcaggt tgattcacat cggtcagtgc 8220
taaaaagcga ccgaaatagc ccgggggaat acatacccgc aggcgtagag acaacattac 8280
agcccccata ggaggtataa caaaattaat aggagagaaa aacacataaa cacctgaaaa 8340
accctcctgc ctaggcaaaa tagcaccctc ccgctccaga acaacataca gcgcttccac 8400
agcggcagcc atggtggcat ttgcaaaagc ctaggcctcc aaaaaagcct cctcactact 8460
tctggaatag ctcagaggcc gaggcggcct cggcctctgc ataaataaaa aaaattagtc 8520
agccatgggg cggagaatgg gcggaactgg gcggagttag gggcgggatg ggcggagtta 8580
ggggcgggac tatggttgct gactaattga gatgcatgct ttgcatactt ctgcctgctg 8640
gggagcctgg ggactttcca cacctggttg ctgactaatt gagatgcatg ctttgcatac 8700
ttctgcctgc tggggagcct ggggactttc cacaccctaa ctgacacaca cgttacgtca 8760
cttcccattt taagaaaact acaattccca acacatacaa gttactccgc ccttaattaa 8820
atcggatccg atatctagat gtattcgcga ggtaccgagc tcgaattctc tggccgtcgt 8880
tttacaacgt cgtgactggg aaaaccctgg cgttacccaa cttaatcgcc ttgcagcaca 8940
tccccctttc gccagctggc gtaatagcga agaggcccgc accgatcgcc cttcccaaca 9000
gttgcgcagc ctgaatggcg aatggcgcct gatgcggtat tttctcctta cgcatctgtg 9060
cggtatttca caccgcatat ggtgcactct cagtacaatc tgctctgatg ccgcatagtt 9120
aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt gtctgctccc 9180
ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc 9240
accgtcatca ccgaaacgcg cga 9263
<210> 55
<211> 8499
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 55
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat cgatccagac atgataagat acattgatga 2880
gtttggacaa accacaacta gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga 2940
tgctattgct ttatttgtaa ccattataag ctgcaataaa caagtttgta cactctcggg 3000
tgattattta cccccaccct tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc 3060
gcatcgctat gcgccactgg cagggacacg ttgcgatact ggtgtttagt gctccactta 3120
aactcaggca caaccatccg cggcagctcg gtgaagtttt cactccacag gctgcgcacc 3180
atcaccaacg cgtttagcag gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg 3240
ccctgcgcgc gcgagttgcg atacacaggg ttgcagcact ggaacactat cagcgccggg 3300
tggtgcacgc tggccagcac gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg 3360
ttgctcaggg cgaacggagt caactttggt agctgccttc ccaaaaaggg cgcgtgccca 3420
ggctttgagt tgcactcgca ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg 3480
ttaggataca gcgcctgcat aaaagccttg atctgcttaa aagccacctg agcctttgcg 3540
ccttcagaga agaacatgcc gcaagacttg ccggaaaact gattggccgg acaggccgcg 3600
tcgtgcacgc agcaccttgc gtcggtgttg gagatctgca ccacatttcg gccccaccgg 3660
ttcttcacga tcttggcctt gctagactgc tccttcagcg cgcgctgccc gttttcgctc 3720
gtcacatcca tttcaatcac gtgctcctta tttatcataa tgcttccgtg tagacactta 3780
agctcgcctt cgatctcagc gcagcggtgc agccacaacg cgcagcccgt gggctcgtga 3840
tgcttgtagg tcacctctgc aaacgactgc aggtacgcct gcaggaatcg ccccatcatc 3900
gtcacaaagg tcttgttgct ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc 3960
caggtcttgc atacggccgc cagagcttcc acttggtcag gcagtagttt gaagttcgcc 4020
tttagatcgt tatccacgtg gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc 4080
tcccacgcag acacgatcgg cacactcagc gggttcatca ccgtaatttc actttccgct 4140
tcgctgggct cttcctcttc ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca 4200
ttcagccgcc gcactgtgcg cttacctcct ttgccatgct tgattagcac cggtgggttg 4260
ctgaaaccca ccatttgtag cgccacatct tctctttctt cctcgctgtc cacgattacc 4320
tctggtgatg gcgggcgctc gggcttggga gaagggcgct tctttttctt cttgggcgca 4380
atggccaaat ccgccgccga ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg 4440
tcttgtgatg agtcttcctc gtcctcggac tcgatacgcc gcctcatccg cttttttggg 4500
ggcgcccggg gaggcggcgg cgacggggac ggggacgaca cgtcctccat ggttggggga 4560
cgtcgcgccg caccgcgtcc gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg 4620
gccatttcct tctcctatag gcagaaaaag atccacaaaa gcgaagatca gcttcggcgc 4680
acgctggaag acgcggaggc tctcttcagt aaatactgcg cgctgactct taaggactag 4740
tttcgcgccc tttctcaaat ttaagcgcga aaactacgtc atctccagcg gccacacccg 4800
gcgccagcac ctgttgtcag cgccattggc gcgcccgccc gccgcgcgct tcgcttttta 4860
tagggccgcc gccgccgccg cctcgccata aaaggaaact ttcggagcgc gccgctctga 4920
ttggctgccg ccgcacctct ccgcctcgcc ccgccccgcc cctcgccccg ccccgccccg 4980
cctggcgcgc gccccccccc cccccccgcc cccatcgctg cacaaaataa ttaaaaaata 5040
aataaataca aaattggggg tggggagggg ggggagatgg ggagagtgaa gcagaacgtg 5100
gggctcacct cgaggccggc cgaatatctt catttaaatg tttaaacatc gatgcggccg 5160
caacttgttt attgcagctt ataatggtta caaataaagc aatagcatca caaatttcac 5220
aaataaagca tttttttcac tgcattctag ttgtggtttg tccaaactca tcaatgtatc 5280
ttagcttaac gggcggcgaa ggagaagtcc acgcctacat gggggtagag tcataatcgt 5340
gcatcaggat agggcggtgg tgctgcagca gcgcgcgaat aaactgctgc cgccgccgct 5400
ccgtcctgca ggaatacaac atggcagtgg tctcctcagc gatgattcgc accgcccgca 5460
gcataaggcg ccttgtcctc cgggcacagc agcgcaccct gatctcactt aaatcagcac 5520
agtaactgca gcacagcacc acaatattgt tcaaaatccc acagtgcaag gcgctgtatc 5580
caaagctcat ggcggggacc acagaaccca cgtggccatc ataccacaag cgcaggtaga 5640
ttaagtggcg acccctcata aacacgctgg acataaacat tacctctttt ggcatgttgt 5700
aattcaccac ctcccggtac catataaacc tctgattaaa catggcgcca tccaccacca 5760
tcctaaacca gctggccaaa acctgcccgc cggctataca ctgcagggaa ccgggactgg 5820
aacaatgaca gtggagagcc caggactcgt aaccatggat catcatgctc gtcatgatat 5880
caatgttggc acaacacagg cacacgtgca tacacttcct caggattaca agctcctccc 5940
gcgttagaac catatcccag ggaacaaccc attcctgaat cagcgtaaat cccacactgc 6000
agggaagacc tcgcacgtaa ctcacgttgt gcattgtcaa agtgttacat tcgggcagca 6060
gcggatgatc ctccagtatg gtagcgcggg tttctgtctc aaaaggaggt agacgatccc 6120
tactgtacgg agtgcgccga gacaaccgag atcgtgttgg tcgtagtgtc atgccaaatg 6180
gaacgccgga cgtagtcata tttcctgaag caaaaccagg tgcgggcgtg acaaacagat 6240
ctgcgtctcc ggtctcgccg cttagatcgc tctgtgtagt agttgtagta tatccactct 6300
ctcaaagcat ccaggcgccc cctggcttcg ggttctatgt aaactccttc atgcgccgct 6360
gccctgataa catccaccac cgcagaataa gccacaccca gccaacctac acattcgttc 6420
tgcgagtcac acacgggagg agcgggaaga gctggaagaa ccatgttttt ttttttattc 6480
caaaagatta tccaaaacct caaaatgaag atctattaag tgaacgcgct cccctccggt 6540
ggcgtggtca aactctacag ccaaagaaca gataatggca tttgtaagat gttgcacaat 6600
ggcttccaaa aggcaaacgg ccctcacgtc caagtggacg taaaggctaa acccttcagg 6660
gtgaatctcc tctataaaca ttccagcacc ttcaaccatg cccaaataat tctcatctcg 6720
ccaccttctc aatatatctc taagcaaatc ccgaatatta agtccggcca ttgtaaaaat 6780
ctgctccaga gcgccctcca ccttcagcct caagcagcga atcatgattg caaaaattca 6840
ggttcctcac agacctgtat aagattcaaa agcggaacat taacaaaaat accgcgatcc 6900
cgtaggtccc ttcgcagggc cagctgaaca taatcgtgca ggtctgcacg gaccagcgcg 6960
gccacttccc cgccaggaac catgacaaaa gaacccacac tgattatgac acgcatactc 7020
ggagctatgc taaccagcgt agccccgatg taagcttgtt gcatgggcgg cgatataaaa 7080
tgcaaggtgc tgctcaaaaa atcaggcaaa gcctcgcgca aaaaagaaag cacatcgtag 7140
tcatgctcat gcagataaag gcaggtaagc tccggaacca ccacagaaaa agacaccatt 7200
tttctctcaa acatgtctgc gggtttctgc ataaacacaa aataaaataa caaaaaaaca 7260
tttaaacatt agaagcctgt cttacaacag gaaaaacaac ccttataagc ataagacgga 7320
ctacggccat gccggcgtga ccgtaaaaaa actggtcacc gtgattaaaa agcaccaccg 7380
acagctcctc ggtcatgtcc ggagtcataa tgtaagactc ggtaaacaca tcaggttgat 7440
tcacatcggt cagtgctaaa aagcgaccga aatagcccgg gggaatacat acccgcaggc 7500
gtagagacaa cattacagcc cccataggag gtataacaaa attaatagga gagaaaaaca 7560
cataaacacc tgaaaaaccc tcctgcctag gcaaaatagc accctcccgc tccagaacaa 7620
catacagcgc ttccacagcg gcagccatgg tggcatttgc aaaagcctag gcctccaaaa 7680
aagcctcctc actacttctg gaatagctca gaggccgagg cggcctcggc ctctgcataa 7740
ataaaaaaaa ttagtcagcc atggggcgga gaatgggcgg aactgggcgg agttaggggc 7800
gggatgggcg gagttagggg cgggactatg gttgctgact aattgagatg catgctttgc 7860
atacttctgc ctgctgggga gcctggggac tttccacacc tggttgctga ctaattgaga 7920
tgcatgcttt gcatacttct gcctgctggg gagcctgggg actttccaca ccctaactga 7980
cacacacgtt acgtcacttc ccattttaag aaaactacaa ttcccaacac atacaagtta 8040
ctccgccctt aattaaatcg gatccgatat ctagatgtat tcgcgaggta ccgagctcga 8100
attctctggc cgtcgtttta caacgtcgtg actgggaaaa ccctggcgtt acccaactta 8160
atcgccttgc agcacatccc cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg 8220
atcgcccttc ccaacagttg cgcagcctga atggcgaatg gcgcctgatg cggtattttc 8280
tccttacgca tctgtgcggt atttcacacc gcatatggtg cactctcagt acaatctgct 8340
ctgatgccgc atagttaagc cagccccgac acccgccaac acccgctgac gcgccctgac 8400
gggcttgtct gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca 8460
tgtgtcagag gttttcaccg tcatcaccga aacgcgcga 8499
<210> 56
<211> 7315
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 56
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat cgatccagac atgataagat acattgatga 2880
gtttggacaa accacaacta gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga 2940
tgctattgct ttatttgtaa ccattataag ctgcaataaa caagtttgta cactctcggg 3000
tgattattta cccccaccct tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc 3060
gcatcgctat gcgccactgg cagggacacg ttgcgatact ggtgtttagt gctccactta 3120
aactcaggca caaccatccg cggcagctcg gtgaagtttt cactccacag gctgcgcacc 3180
atcaccaacg cgtttagcag gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg 3240
ccctgcgcgc gcgagttgcg atacacaggg ttgcagcact ggaacactat cagcgccggg 3300
tggtgcacgc tggccagcac gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg 3360
ttgctcaggg cgaacggagt caactttggt agctgccttc ccaaaaaggg cgcgtgccca 3420
ggctttgagt tgcactcgca ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg 3480
ttaggataca gcgcctgcat aaaagccttg atctgcttaa aagccacctg agcctttgcg 3540
ccttcagaga agaacatgcc gcaagacttg ccggaaaact gattggccgg acaggccgcg 3600
tcgtgcacgc agcaccttgc gtcggtgttg gagatctgca ccacatttcg gccccaccgg 3660
ttcttcacga tcttggcctt gctagactgc tccttcagcg cgcgctgccc gttttcgctc 3720
gtcacatcca tttcaatcac gtgctcctta tttatcataa tgcttccgtg tagacactta 3780
agctcgcctt cgatctcagc gcagcggtgc agccacaacg cgcagcccgt gggctcgtga 3840
tgcttgtagg tcacctctgc aaacgactgc aggtacgcct gcaggaatcg ccccatcatc 3900
gtcacaaagg tcttgttgct ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc 3960
caggtcttgc atacggccgc cagagcttcc acttggtcag gcagtagttt gaagttcgcc 4020
tttagatcgt tatccacgtg gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc 4080
tcccacgcag acacgatcgg cacactcagc gggttcatca ccgtaatttc actttccgct 4140
tcgctgggct cttcctcttc ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca 4200
ttcagccgcc gcactgtgcg cttacctcct ttgccatgct tgattagcac cggtgggttg 4260
ctgaaaccca ccatttgtag cgccacatct tctctttctt cctcgctgtc cacgattacc 4320
tctggtgatg gcgggcgctc gggcttggga gaagggcgct tctttttctt cttgggcgca 4380
atggccaaat ccgccgccga ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg 4440
tcttgtgatg agtcttcctc gtcctcggac tcgatacgcc gcctcatccg cttttttggg 4500
ggcgcccggg gaggcggcgg cgacggggac ggggacgaca cgtcctccat ggttggggga 4560
cgtcgcgccg caccgcgtcc gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg 4620
gccatttcct tctcctatag gcagaaaaag atccacaaaa gcgaagatca gcttcggcgc 4680
acgctggaag acgcggaggc tctcttcagt aaatactgcg cgctgactct taaggactag 4740
tttcgcgccc tttctcaaat ttaagcgcga aaactacgtc atctccagcg gccacacccg 4800
gcgccagcac ctgttgtcag cgccattggc gcgcccgccc gccgcgcgct tcgcttttta 4860
tagggccgcc gccgccgccg cctcgccata aaaggaaact ttcggagcgc gccgctctga 4920
ttggctgccg ccgcacctct ccgcctcgcc ccgccccgcc cctcgccccg ccccgccccg 4980
cctggcgcgc gccccccccc cccccccgcc cccatcgctg cacaaaataa ttaaaaaata 5040
aataaataca aaattggggg tggggagggg ggggagatgg ggagagtgaa gcagaacgtg 5100
gggctcacct cgaggccggc cgaatatctt catttaaatg tttaaacatc gatgcggccg 5160
caacttgttt attgcagctt ataatggtta caaataaagc aatagcatca caaatttcac 5220
aaataaagca tttttttcac tgcattctag ttgtggtttg tccaaactca tcaatgtatc 5280
ttagcttaac gggcggcgaa ggagaagtcc acgcctacat gggggtagag tcataatcgt 5340
gcatcaggat agggcggtgg tgctgcagca gcgcgcgaat aaactgctgc cgccgccgct 5400
ccgtcctgca ggaatacaac atggcagtgg tctcctcagc gatgattcgc accgcccgca 5460
gcataaggcg ccttgtcctc cgggcacagc agcgcaccct gatctcactt aaatcagcac 5520
agtaactgca gcacagcacc acaatattgt tcaaaatccc acagtgcaag gcgctgtatc 5580
caaagctcat ggcggggacc acagaaccca cgtggccatc ataccacaag cgcaggtaga 5640
ttaagtggcg acccctcata aacacgctgg acataaacat tacctctttt ggcatgttgt 5700
aattcaccac ctcccggtac catataaacc tctgattaaa catggcgcca tccaccacca 5760
tcctaaacca gctggccaaa acctgcccgc cggctataca ctgcagggaa ccgggactgg 5820
aacaatgaca gtggagagcc caggactcgt aaccatggat catcatgctc gtcatgatat 5880
caatgttggc acaacacagg cacacgtgca tacacttcct caggattaca agctcctccc 5940
gcgttagaac catatcccag ggaacaaccc attcctgaat cagcgtaaat cccacactgc 6000
agggaagacc tcgcacgtaa ctcacgttgt gcattgtcaa agtgttacat tcgggcagca 6060
gcggatgatc ctccagtatg gtagcgcggg tttctgtctc aaaaggaggt agacgatccc 6120
tactgtacgg agtgcgccga gacaaccgag atcgtgttgg tcgtagtgtc atgccaaatg 6180
gaacgccgga cgtagtcata tttcctgaag caaaaccagg tgcgggcgtg acaaacagat 6240
ctgcgtctcc ggtctcgccg cttagatcgc tctgtgtagt agttgtagta tatccactct 6300
ctcaaagcat ccaggcgccc cctggcttcg ggttctatgt aaactccttc atgcgccgct 6360
gccctgataa catccaccac cgcagaataa gccacaccca gccaacctac acattcgttc 6420
tgcgagtcac acacgggagg agcgggaaga gctggaagaa ccatggtggc atttgcaaaa 6480
gcctaggcct ccaaaaaagc ctcctcacta cttctggaat agctcagagg ccgaggcggc 6540
ctcggcctct gcataaataa aaaaaattag tcagccatgg ggcggagaat gggcggaact 6600
gggcggagtt aggggcggga tgggcggagt taggggcggg actatggttg ctgactaatt 6660
gagatgcatg ctttgcatac ttctgcctgc tggggagcct ggggactttc cacacctggt 6720
tgctgactaa ttgagatgca tgctttgcat acttctgcct gctggggagc ctggggactt 6780
tccacaccct aactgacaca cacgttacgt cacttcccat tttaagaaaa ctacaattcc 6840
caacacatac aagttactcc gcccttaatt aaatcggatc cgatatctag atgtattcgc 6900
gaggtaccga gctcgaattc tctggccgtc gttttacaac gtcgtgactg ggaaaaccct 6960
ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc 7020
gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggcgc 7080
ctgatgcggt attttctcct tacgcatctg tgcggtattt cacaccgcat atggtgcact 7140
ctcagtacaa tctgctctga tgccgcatag ttaagccagc cccgacaccc gccaacaccc 7200
gctgacgcgc cctgacgggc ttgtctgctc ccggcatccg cttacagaca agctgtgacc 7260
gtctccggga gctgcatgtg tcagaggttt tcaccgtcat caccgaaacg cgcga 7315
<210> 57
<211> 14977
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 57
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tctgcagtcg accagaagca ccatgtcctt gggtccggcc 2160
tgctgaatgc gcaggcggtc ggccatgccc caggcttcgt tttgacatcg gcgcaggtct 2220
ttgtagtagt cttgcatgag cctttctacc ggcacttctt cttctccttc ctcttgtcct 2280
gcatctcttg catctatcgc tgcggcggcg gcggagtttg gccgtaggtg gcgccctctt 2340
cctcccatgc gtgtgacccc gaagcccctc atcggctgaa gcagggctag gtcggcgaca 2400
acgcgctcgg ctaatatggc ctgctgcacc tgcgtgaggg tagactggaa gtcatccatg 2460
tccacaaagc ggtggtatgc gcccgtgttg atggtgtaag tgcagttggc cataacggac 2520
cagttaacgg tctggtgacc cggctgcgag agctcggtgt acctgagacg cgagtaagcc 2580
ctcgagtcaa atacgtagtc gttgcaagtc cgcaccaggt actggtatcc caccaaaaag 2640
tgcggcggcg gctggcggta gaggggccag cgtagggtgg ccggggctcc gggggcgaga 2700
tcttccaaca taaggcgatg atatccgtag atgtacctgg acatccaggt gatgccggcg 2760
gcggtggtgg aggcgcgcgg aaagtcgcgg acgcggttcc agatgttgcg cagcggcaaa 2820
aagtgctcca tggtcgggac gctctggccg gtcaggcgcg cgcaatcgtt gacgctctag 2880
cgtgcaaaag gagagcctgt aagcgggcac tcttccgtgg tctggtggat aaattcgcaa 2940
gggtatcatg gcggacgacc ggggttcgag ccccgtatcc ggccgtccgc cgtgatccat 3000
gcggttaccg cccgcgtgtc gaacccaggt gtgcgacgtc agacaacggg ggagtgctcc 3060
ttttggcttc cttccaggcg cggcggctgc tgcgctagct tttttggcca ctggccgcgc 3120
gcagcgtaag cggttaggct ggaaagcgaa agcattaagt ggctcgctcc ctgtagccgg 3180
agggttattt tccaagggtt gagtcgcggg acccccggtt cgagtctcgg accgagactg 3240
ggggcgtaca ctggatggcc tttgcctgga acccgcactc aaaaacatgc tacctctttg 3300
agccctttgg cttttctgac cagcgactca agcaggttta ccagtttgag tacgagtcac 3360
tcctgcgccg tagcgccatt gcttcttccc ccgaccgctg tataacgctg gaaaagtcca 3420
cccaaagcgt acaggggccc aactcggccg cctgtggact attctgctgc atgtttctcc 3480
acgcctttgc caactggccc caaactccca tggatcacaa ccccaccatg aaccttatta 3540
ccggggtacc caactccatg ctcaacagtc cccaggtaca gcccaccctg cgtcgcaacc 3600
aggaacagct ctacagcttc ctggagcgcc actcgcccta cttccgcagc cacagtgcgc 3660
agattaggag cgccacttct ttttgtcact tgaaaaacat gtaaaaataa tgtactagag 3720
acactttcaa taaaggcaaa tgcttttatt tgtacactct cgggtgatta tttaccccca 3780
cccttgccgt ctgcgccgtt taaaaatcaa aggggttctg ccgcgcatcg ctatgcgcca 3840
ctggcaggga cacgttgcga tactggtgtt tagtgctcca cttaaactca ggcacaacca 3900
tccgcggcag ctcggtgaag ttttcactcc acaggctgcg caccatcacc aacgcgttta 3960
gcaggtcggg cgccgatatc ttgaagtcgc agttggggcc tccgccctgc gcgcgcgagt 4020
tgcgatacac agggttgcag cactggaaca ctatcagcgc cgggtggtgc acgctggcca 4080
gcacgctctt gtcggagatc agatccgcgt ccaggtcctc cgcgttgctc agggcgaacg 4140
gagtcaactt tggtagctgc cttcccaaaa agggcgcgtg cccaggcttt gagttgcact 4200
cgcaccgtag tggcatcaaa aggtgaccgt gcccggtctg ggcgttagga tacagcgcct 4260
gcataaaagc cttgatctgc ttaaaagcca cctgagcctt tgcgccttca gagaagaaca 4320
tgccgcaaga cttgccggaa aactgattgg ccggacaggc cgcgtcgtgc acgcagcacc 4380
ttgcgtcggt gttggagatc tgcaccacat ttcggcccca ccggttcttc acgatcttgg 4440
ccttgctaga ctgctccttc agcgcgcgct gcccgttttc gctcgtcaca tccatttcaa 4500
tcacgtgctc cttatttatc ataatgcttc cgtgtagaca cttaagctcg ccttcgatct 4560
cagcgcagcg gtgcagccac aacgcgcagc ccgtgggctc gtgatgcttg taggtcacct 4620
ctgcaaacga ctgcaggtac gcctgcagga atcgccccat catcgtcaca aaggtcttgt 4680
tgctggtgaa ggtcagctgc aacccgcggt gctcctcgtt cagccaggtc ttgcatacgg 4740
ccgccagagc ttccacttgg tcaggcagta gtttgaagtt cgcctttaga tcgttatcca 4800
cgtggtactt gtccatcagc gcgcgcgcag cctccatgcc cttctcccac gcagacacga 4860
tcggcacact cagcgggttc atcaccgtaa tttcactttc cgcttcgctg ggctcttcct 4920
cttcctcttg cgtccgcata ccacgcgcca ctgggtcgtc ttcattcagc cgccgcactg 4980
tgcgcttacc tcctttgcca tgcttgatta gcaccggtgg gttgctgaaa cccaccattt 5040
gtagcgccac atcttctctt tcttcctcgc tgtccacgat tacctctggt gatggcgggc 5100
gctcgggctt gggagaaggg cgcttctttt tcttcttggg cgcaatggcc aaatccgccg 5160
ccgaggtcga tggccgcggg ctgggtgtgc gcggcaccag cgcgtcttgt gatgagtctt 5220
cctcgtcctc ggactcgata cgccgcctca tccgcttttt tgggggcgcc cggggaggcg 5280
gcggcgacgg ggacggggac gacacgtcct ccatggttgg gggacgtcgc gccgcaccgc 5340
gtccgcgctc gggggtggtt tcgcgctgct cctcttcccg actggccatt tccttctcct 5400
ataggcagaa aaagatccac aaaagcgaag atcagcttcg gcgcacgctg gaagacgcgg 5460
aggctctctt cagtaaatac tgcgcgctga ctcttaagga ctagtttcgc gccctttctc 5520
aaatttaagc gcgaaaacta cgtcatctcc agcggccaca cccggcgcca gcacctgttg 5580
tcagcgccat tggcgcgccc gcccgccgcg cgcttcgctt tttatagggc cgccgccgcc 5640
gccgcctcgc cataaaagga aactttcgga gcgcgccgct ctgattggct gccgccgcac 5700
ctctccgcct cgccccgccc cgcccctcgc cccgccccgc cccgcctggc gcgcgccccc 5760
cccccccccc cgcccccatc gctgcacaaa ataattaaaa aataaataaa tacaaaattg 5820
ggggtgggga ggggggggag atggggagag tgaagcagaa cgtggggctc acctcgaggc 5880
cggccgaata tcttcattta aatgggcaga gcgcacatcg cccacagtcc ccgagaagtt 5940
ggggggaggg gtcggcaatt gaaccggtgc ctagagaagg tggcgcgggg taaactggga 6000
aagtgatgtc gtgtactggc tccgcctttt tcccgagggt gggggagaac cgtatataag 6060
tgcagtagtc gccgtgaacg ttctttttcg caacgggttt gccgccagaa cacagcaccg 6120
cgggcccgat ccaccggtac tgttggtaaa gccaccatgt tttccggtgg cggcggcccg 6180
ctgtcccccg gaggaaagtc ggcggccagg gcggcgtccg ggttttttgc gcccgccggc 6240
cctcgcggag ccagccgggg acccccgcct tgtttgaggc aaaactttta caacccctac 6300
ctcgccccag tcgggacgca acagaagccg accgggccaa cccagcgcca tacgtactat 6360
agcgaatgcg atgaatttcg attcatcgcc ccgcgggtgc tggacgagga tgcccccccg 6420
gagaagcgcg ccggggtgca cgacggtcac ctcaagcgcg cccccaaggt gtactgcggg 6480
ggggacgagc gcgacgtcct ccgcgtcggg tcgggcggct tctggccgcg gcgctcgcgc 6540
ctgtggggcg gcgtggacca cgccccggcg gggttcaacc ccaccgtcac cgtctttcac 6600
gtgtacgaca tcctggagaa cgtggagcac gcgtacggca tgcgcgcggc ccagttccac 6660
gcgcggttta tggacgccat cacaccgacg gggaccgtca tcacgctcct gggcctgact 6720
ccggaaggcc accgggtggc cgttcacgtt tacggcacgc ggcagtactt ttacatgaac 6780
aaggaggagg tcgacaggca cctacaatgc cgcgccccac gagatctctg cgagcgcatg 6840
gccgcggccc tgcgcgagtc cccgggcgcg tcgttccgcg gcatctccgc ggaccacttc 6900
gaggcggagg tggtggagcg caccgacgtg tactactacg agacgcgccc cgctctgttt 6960
taccgcgtct acgtccgaag cgggcgcgtg ctgtcgtacc tgtgcgacaa cttctgcccg 7020
gccatcaaga agtacgaggg tggggtcgac gccaccaccc ggttcatcct ggacaacccc 7080
gggttcgtca ccttcggctg gtaccgtctc aaaccgggcc ggaacaacac gctagcccag 7140
ccgcgggccc cgatggcctt cgggacatcc agcgacgtcg agtttaactg tacggcggac 7200
aacctggcca tcgagggggg catgagcgac ctaccggcat acaagctcat gtgcttcgat 7260
atcgaatgca aggcgggggg ggaggacgag ctggcctttc cggtggccgg gcacccggag 7320
gacctggtca tccagatatc ctgtctgctc tacgacctgt ccaccaccgc cctggagcac 7380
gtcctcctgt tttcgctcgg ttcctgcgac ctccccgaat cccacctgaa cgagctggcg 7440
gccaggggcc tgcccacgcc cgtggttctg gaattcgaca gcgaattcga gatgctgttg 7500
gccttcatga cccttgtgaa acagtacggc cccgagttcg tgaccgggta caacatcatc 7560
aacttcgact ggcccttctt gctggccaag ctgacggaca tttacaaggt ccccctggac 7620
gggtacggcc gcatgaacgg ccggggcgtg tttcgcgtgt gggacatagg ccagagccac 7680
ttccagaagc gcagcaagat aaaggtgaac ggcatggtga acatcgacat gtacgggatt 7740
ataaccgaca agatcaagct ctcgagctac aagctcaacg ccgtggccga agccgtcctg 7800
aaggacaaga agaaggacct gagctatcgc gacatccccg cctactacgc cgccgggccc 7860
gcgcaacgcg gggtgatcgg cgagtactgc atacaggatt ccctgctggt gggccagctg 7920
ttttttaagt ttttgcccca tctggagctc tcggccgtcg cgcgcttggc gggtattaac 7980
atcacccgca ccatctacga cggccagcag atccgcgtct ttacgtgcct gctgcgcctg 8040
gccgaccaga agggctttat tctgccggac acccaggggc gatttagggg cgccgggggg 8100
gaggcgccca agcgtccggc cgcagcccgg gaggacgagg agcggccaga ggaggagggg 8160
gaggacgagg acgaacgcga ggagggcggg ggcgagcggg agccggaggg cgcgcgggag 8220
accgccggca ggcacgtggg gtaccagggg gccagggtcc ttgaccccac ttccgggttt 8280
cacgtgaacc ccgtggtggt gttcgacttt gccagcctgt accccagcat catccaggcc 8340
cacaacctgt gcttcagcac gctctccctg agggccgacg cagtggcgca cctggaggcg 8400
ggcaaggact acctggagat cgaggtgggg gggcgacggc tgttcttcgt caaggctcac 8460
gtgcgagaga gcctcctcag catcctcctg cgggactggc tcgccatgcg aaagcagatc 8520
cgctcgcgga ttccccagag cagccccgag gaggccgtgc tcctggacaa gcagcaggcc 8580
gccatcaagg tcgtgtgtaa ctcggtgtac gggttcacgg gagtgcagca cggactcctg 8640
ccgtgcctgc acgttgccgc gacggtgacg accatcggcc gcgagatgct gctcgcgacc 8700
cgcgagtacg tccacgcgcg ctgggcggcc ttcgaacagc tcctggccga tttcccggag 8760
gcggccgaca tgcgcgcccc cgggccctat tccatgcgca tcatctacgg ggacacggac 8820
tccatctttg tgctgtgccg cggcctcacg gccgccgggc tgacggccgt gggcgacaag 8880
atggcgagcc acatctcgcg cgcgctgttt ctgcccccca tcaaactcga gtgcgaaaag 8940
acgttcacca agctgctgct gatcgccaag aaaaagtaca tcggcgtcat ctacgggggt 9000
aagatgctca tcaagggcgt ggatctggtg cgcaaaaaca actgcgcgtt tatcaaccgc 9060
acctccaggg ccctggtcga cctgctgttt tacgacgata ccgtctccgg agcggccgcc 9120
gcgttagccg agcgccccgc ggaggagtgg ctggcgcgac ccctgcccga gggactgcag 9180
gcgttcgggg ccgtcctcgt agacgcccat cggcgcatca ccgacccgga gagggacatc 9240
caggactttg tcctcaccgc cgaactgagc agacacccgc gcgcgtacac caacaagcgc 9300
ctggcccacc tgacggtgta ttacaagctc atggcccgcc gcgcgcaggt cccgtccatc 9360
aaggaccgga tcccgtacgt gatcgtggcc cagacccgcg aggtagagga gacggtcgcg 9420
cggctggccg ccctccgcga gctagacgcc gccgccccag gggacgagcc cgcccccccc 9480
gcggccctgc cctccccggc caagcgcccc cgggagacgc cgtcgcctgc cgaccccccg 9540
ggaggcgcgt ccaagccccg caagctgctg gtgtccgagc tggccgagga tcccgcatac 9600
gccattgccc acggcgtcgc cctgaacacg gactattact tctcccacct gttgggggcg 9660
gcgtgcgtga cattcaaggc cctgtttggg aataacgcca agatcaccga gagtctgtta 9720
aaaaggttta ttcccgaagt gtggcacccc ccggacgacg tggccgcgcg gctccggacc 9780
gcagggttcg gggcggtggg tgccggcgct acggcggagg aaactcgtcg aatgttgcat 9840
agagcctttg atactctagc agaattcggc agtggagcaa caaacttctc tctgctgaaa 9900
caagccggag atgtcgaaga gaatcctgga ccgacggatt cccctggcgg tgtggccccc 9960
gcctcccccg tggaggacgc gtcggacgcg tccctcgggc agccggagga gggggcgccc 10020
tgccaggtgg tcctgcaggg cgccgaactt aatggaatcc tacaggcgtt tgccccgctg 10080
cgcacgagcc ttctggactc gcttctggtt atgggcgacc ggggcatcct tatccataac 10140
acgatctttg gggagcaggt gttcctgccc ctggaacact cgcaattcag tcggtatcgc 10200
tggcgcggac ccacggcggc gttcctgtct ctcgtggacc agaagcgctc cctcctgagc 10260
gtgtttcgcg ccaaccagta cccggaccta cgtcgggtgg agttggcgat cacgggccag 10320
gccccgtttc gcacgctggt tcagcgcata tggacgacga cgtccgacgg cgaggccgtt 10380
gagctagcca gcgagacgct gatgaagcgc gaactgacga gctttgtggt gctggttccc 10440
cagggaaccc ccgacgttca gttgcgcctg acgaggccgc agctcaccaa ggtccttaac 10500
gcgaccgggg ccgatagtgc cacgcccacc acgttcgagc tcggggttaa cggcaaattt 10560
tccgtgttca ccacgagtac ctgcgtcacc tttgctgccc gcgaggaggg cgtgtcgtcc 10620
agcaccagca cccaggtcca gatcctgtcc aacgcgctca ccaaggcggg ccaggcggcc 10680
gccaacgcca agacggtgta cggggaaaat acccatcgca ccttctctgt ggtcgtcgac 10740
gattgcagca tgcgggcggt gctccggcga ctgcaggtcg gcgggggcac cctcaagttc 10800
ttcctcacga cccccgtccc cagtctgtgc gtcaccgcca ccggtcccaa cgcggtatcg 10860
gcggtatttc tcctgaaacc ccagaagatt tgcctggact ggctgggtca tagccagggg 10920
tctccttcag ccgggagctc ggcctcccgg gcctctggga gcgagccaac agacagccag 10980
gactccgcgt cggacgcggt cagccacggc gatccggaag acctcgatgg cgctgcccgg 11040
gcgggagagg cgggggcctt gcatgcctgt ccgatgccgt cgtcgaccac gcgggtcact 11100
cccacgacca agcgggggcg ctcggggggc gaggatgcgc gcgcggacac ggccctaaag 11160
aaacctaaga cggggtcgcc caccgcaccc ccgcccgcag atccagtccc cctggacacg 11220
gaggacgact ccgatgcggc ggacgggacg gcggcccgtc ccgccgctcc agacgcccgg 11280
agcggaagcc gttacgcgtg ttactttcgc gacctcccga ccggagaagc aagccccggc 11340
gccttctccg ccttccgggg gggcccccaa accccgtatg gttttggatt cccctgataa 11400
gatccgactg caggtagaat aaaggaaatt tattttcatt gcaatagtgt gttggaattt 11460
tttgtgtctc tcagtttaaa cgcggccgcc gtttgtgtta tgtttcaacg tgtttatttt 11520
tcaattgcag aaaatttcaa gtcatttttc attcagtagt atagccccac caccacatag 11580
cttatacaga tcaccgtacc ttaatcaaac tcacagaacc ctagtattca acctgccacc 11640
tccctcccaa cacacagagt acacagtcct ttctccccgg ctggccttaa aaagcatcat 11700
atcatgggta acagacatat tcttaggtgt tatattccac acggtttcct gtcgagccaa 11760
acgctcatca gtgatattaa taaactcccc gggcagctca cttaagttca tgtcgctgtc 11820
cagctgctga gccacaggct gctgtccaac ttgcggttgc ttaacgggcg gcgaaggaga 11880
agtccacgcc tacatggggg tagagtcata atcgtgcatc aggatagggc ggtggtgctg 11940
cagcagcgcg cgaataaact gctgccgccg ccgctccgtc ctgcaggaat acaacatggc 12000
agtggtctcc tcagcgatga ttcgcaccgc ccgcagcata aggcgccttg tcctccgggc 12060
acagcagcgc accctgatct cacttaaatc agcacagtaa ctgcagcaca gcaccacaat 12120
attgttcaaa atcccacagt gcaaggcgct gtatccaaag ctcatggcgg ggaccacaga 12180
acccacgtgg ccatcatacc acaagcgcag gtagattaag tggcgacccc tcataaacac 12240
gctggacata aacattacct cttttggcat gttgtaattc accacctccc ggtaccatat 12300
aaacctctga ttaaacatgg cgccatccac caccatccta aaccagctgg ccaaaacctg 12360
cccgccggct atacactgca gggaaccggg actggaacaa tgacagtgga gagcccagga 12420
ctcgtaacca tggatcatca tgctcgtcat gatatcaatg ttggcacaac acaggcacac 12480
gtgcatacac ttcctcagga ttacaagctc ctcccgcgtt agaaccatat cccagggaac 12540
aacccattcc tgaatcagcg taaatcccac actgcaggga agacctcgca cgtaactcac 12600
gttgtgcatt gtcaaagtgt tacattcggg cagcagcgga tgatcctcca gtatggtagc 12660
gcgggtttct gtctcaaaag gaggtagacg atccctactg tacggagtgc gccgagacaa 12720
ccgagatcgt gttggtcgta gtgtcatgcc aaatggaacg ccggacgtag tcatatttcc 12780
tgaagcaaaa ccaggtgcgg gcgtgacaaa cagatctgcg tctccggtct cgccgcttag 12840
atcgctctgt gtagtagttg tagtatatcc actctctcaa agcatccagg cgccccctgg 12900
cttcgggttc tatgtaaact ccttcatgcg ccgctgccct gataacatcc accaccgcag 12960
aataagccac acccagccaa cctacacatt cgttctgcga gtcacacacg ggaggagcgg 13020
gaagagctgg aagaaccatg tttttttttt tattccaaaa gattatccaa aacctcaaaa 13080
tgaagatcta ttaagtgaac gcgctcccct ccggtggcgt ggtcaaactc tacagccaaa 13140
gaacagataa tggcatttgt aagatgttgc acaatggctt ccaaaaggca aacggccctc 13200
acgtccaagt ggacgtaaag gctaaaccct tcagggtgaa tctcctctat aaacattcca 13260
gcaccttcaa ccatgcccaa ataattctca tctcgccacc ttctcaatat atctctaagc 13320
aaatcccgaa tattaagtcc ggccattgta aaaatctgct ccagagcgcc ctccaccttc 13380
agcctcaagc agcgaatcat gattgcaaaa attcaggttc ctcacagacc tgtataagat 13440
tcaaaagcgg aacattaaca aaaataccgc gatcccgtag gtcccttcgc agggccagct 13500
gaacataatc gtgcaggtct gcacggacca gcgcggccac ttccccgcca ggaaccatga 13560
caaaagaacc cacactgatt atgacacgca tactcggagc tatgctaacc agcgtagccc 13620
cgatgtaagc ttgttgcatg ggcggcgata taaaatgcaa ggtgctgctc aaaaaatcag 13680
gcaaagcctc gcgcaaaaaa gaaagcacat cgtagtcatg ctcatgcaga taaaggcagg 13740
taagctccgg aaccaccaca gaaaaagaca ccatttttct ctcaaacatg tctgcgggtt 13800
tctgcataaa cacaaaataa aataacaaaa aaacatttaa acattagaag cctgtcttac 13860
aacaggaaaa acaaccctta taagcataag acggactacg gccatgccgg cgtgaccgta 13920
aaaaaactgg tcaccgtgat taaaaagcac caccgacagc tcctcggtca tgtccggagt 13980
cataatgtaa gactcggtaa acacatcagg ttgattcaca tcggtcagtg ctaaaaagcg 14040
accgaaatag cccgggggaa tacatacccg caggcgtaga gacaacatta cagcccccat 14100
aggaggtata acaaaattaa taggagagaa aaacacataa acacctgaaa aaccctcctg 14160
cctaggcaaa atagcaccct cccgctccag aacaacatac agcgcttcca cagcggcagc 14220
cataacagtc agccttacca gtaaaaaaga aaacctatta aaaaaacacc actcgacacg 14280
gcaccagctc aatcagtcac agtgtaaaaa agggccaagt gcagagcgag tatatatagg 14340
actaaaaaat gacgtaacgg ttaaagtcca caaaaaacac ccagaaaacc gcacgcgaac 14400
ctacgcccag aaacgaaagc caaaaaaccc acaacttcct caaatcgtca cttccgtttt 14460
cccacgttac gtcacttccc attttaagaa aactacaatt cccaacacat acaagttact 14520
ccgcccttaa ttaaatcgga tccgatatct agatgtattc gcgaggtacc gagctcgaat 14580
tctctggccg tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat 14640
cgccttgcag cacatccccc tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat 14700
cgcccttccc aacagttgcg cagcctgaat ggcgaatggc gcctgatgcg gtattttctc 14760
cttacgcatc tgtgcggtat ttcacaccgc atatggtgca ctctcagtac aatctgctct 14820
gatgccgcat agttaagcca gccccgacac ccgccaacac ccgctgacgc gccctgacgg 14880
gcttgtctgc tcccggcatc cgcttacaga caagctgtga ccgtctccgg gagctgcatg 14940
tgtcagaggt tttcaccgtc atcaccgaaa cgcgcga 14977
<210> 58
<211> 15278
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 58
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tctgcagtcg accagaagca ccatgtcctt gggtccggcc 2160
tgctgaatgc gcaggcggtc ggccatgccc caggcttcgt tttgacatcg gcgcaggtct 2220
ttgtagtagt cttgcatgag cctttctacc ggcacttctt cttctccttc ctcttgtcct 2280
gcatctcttg catctatcgc tgcggcggcg gcggagtttg gccgtaggtg gcgccctctt 2340
cctcccatgc gtgtgacccc gaagcccctc atcggctgaa gcagggctag gtcggcgaca 2400
acgcgctcgg ctaatatggc ctgctgcacc tgcgtgaggg tagactggaa gtcatccatg 2460
tccacaaagc ggtggtatgc gcccgtgttg atggtgtaag tgcagttggc cataacggac 2520
cagttaacgg tctggtgacc cggctgcgag agctcggtgt acctgagacg cgagtaagcc 2580
ctcgagtcaa atacgtagtc gttgcaagtc cgcaccaggt actggtatcc caccaaaaag 2640
tgcggcggcg gctggcggta gaggggccag cgtagggtgg ccggggctcc gggggcgaga 2700
tcttccaaca taaggcgatg atatccgtag atgtacctgg acatccaggt gatgccggcg 2760
gcggtggtgg aggcgcgcgg aaagtcgcgg acgcggttcc agatgttgcg cagcggcaaa 2820
aagtgctcca tggtcgggac gctctggccg gtcaggcgcg cgcaatcgtt gacgctctag 2880
cgtgcaaaag gagagcctgt aagcgggcac tcttccgtgg tctggtggat aaattcgcaa 2940
gggtatcatg gcggacgacc ggggttcgag ccccgtatcc ggccgtccgc cgtgatccat 3000
gcggttaccg cccgcgtgtc gaacccaggt gtgcgacgtc agacaacggg ggagtgctcc 3060
ttttggcttc cttccaggcg cggcggctgc tgcgctagct tttttggcca ctggccgcgc 3120
gcagcgtaag cggttaggct ggaaagcgaa agcattaagt ggctcgctcc ctgtagccgg 3180
agggttattt tccaagggtt gagtcgcggg acccccggtt cgagtctcgg accgagactg 3240
ggggcgtaca ctggatggcc tttgcctgga acccgcactc aaaaacatgc tacctctttg 3300
agccctttgg cttttctgac cagcgactca agcaggttta ccagtttgag tacgagtcac 3360
tcctgcgccg tagcgccatt gcttcttccc ccgaccgctg tataacgctg gaaaagtcca 3420
cccaaagcgt acaggggccc aactcggccg cctgtggact attctgctgc atgtttctcc 3480
acgcctttgc caactggccc caaactccca tggatcacaa ccccaccatg aaccttatta 3540
ccggggtacc caactccatg ctcaacagtc cccaggtaca gcccaccctg cgtcgcaacc 3600
aggaacagct ctacagcttc ctggagcgcc actcgcccta cttccgcagc cacagtgcgc 3660
agattaggag cgccacttct ttttgtcact tgaaaaacat gtaaaaataa tgtactagag 3720
acactttcaa taaaggcaaa tgcttttatt tgtacactct cgggtgatta tttaccccca 3780
cccttgccgt ctgcgccgtt taaaaatcaa aggggttctg ccgcgcatcg ctatgcgcca 3840
ctggcaggga cacgttgcga tactggtgtt tagtgctcca cttaaactca ggcacaacca 3900
tccgcggcag ctcggtgaag ttttcactcc acaggctgcg caccatcacc aacgcgttta 3960
gcaggtcggg cgccgatatc ttgaagtcgc agttggggcc tccgccctgc gcgcgcgagt 4020
tgcgatacac agggttgcag cactggaaca ctatcagcgc cgggtggtgc acgctggcca 4080
gcacgctctt gtcggagatc agatccgcgt ccaggtcctc cgcgttgctc agggcgaacg 4140
gagtcaactt tggtagctgc cttcccaaaa agggcgcgtg cccaggcttt gagttgcact 4200
cgcaccgtag tggcatcaaa aggtgaccgt gcccggtctg ggcgttagga tacagcgcct 4260
gcataaaagc cttgatctgc ttaaaagcca cctgagcctt tgcgccttca gagaagaaca 4320
tgccgcaaga cttgccggaa aactgattgg ccggacaggc cgcgtcgtgc acgcagcacc 4380
ttgcgtcggt gttggagatc tgcaccacat ttcggcccca ccggttcttc acgatcttgg 4440
ccttgctaga ctgctccttc agcgcgcgct gcccgttttc gctcgtcaca tccatttcaa 4500
tcacgtgctc cttatttatc ataatgcttc cgtgtagaca cttaagctcg ccttcgatct 4560
cagcgcagcg gtgcagccac aacgcgcagc ccgtgggctc gtgatgcttg taggtcacct 4620
ctgcaaacga ctgcaggtac gcctgcagga atcgccccat catcgtcaca aaggtcttgt 4680
tgctggtgaa ggtcagctgc aacccgcggt gctcctcgtt cagccaggtc ttgcatacgg 4740
ccgccagagc ttccacttgg tcaggcagta gtttgaagtt cgcctttaga tcgttatcca 4800
cgtggtactt gtccatcagc gcgcgcgcag cctccatgcc cttctcccac gcagacacga 4860
tcggcacact cagcgggttc atcaccgtaa tttcactttc cgcttcgctg ggctcttcct 4920
cttcctcttg cgtccgcata ccacgcgcca ctgggtcgtc ttcattcagc cgccgcactg 4980
tgcgcttacc tcctttgcca tgcttgatta gcaccggtgg gttgctgaaa cccaccattt 5040
gtagcgccac atcttctctt tcttcctcgc tgtccacgat tacctctggt gatggcgggc 5100
gctcgggctt gggagaaggg cgcttctttt tcttcttggg cgcaatggcc aaatccgccg 5160
ccgaggtcga tggccgcggg ctgggtgtgc gcggcaccag cgcgtcttgt gatgagtctt 5220
cctcgtcctc ggactcgata cgccgcctca tccgcttttt tgggggcgcc cggggaggcg 5280
gcggcgacgg ggacggggac gacacgtcct ccatggttgg gggacgtcgc gccgcaccgc 5340
gtccgcgctc gggggtggtt tcgcgctgct cctcttcccg actggccatt tccttctcct 5400
ataggcagaa aaagatccac aaaagcgaag atcagcttcg gcgcacgctg gaagacgcgg 5460
aggctctctt cagtaaatac tgcgcgctga ctcttaagga ctagtttcgc gccctttctc 5520
aaatttaagc gcgaaaacta cgtcatctcc agcggccaca cccggcgcca gcacctgttg 5580
tcagcgccat tggcgcgccc gcccgccgcg cgcttcgctt tttatagggc cgccgccgcc 5640
gccgcctcgc cataaaagga aactttcgga gcgcgccgct ctgattggct gccgccgcac 5700
ctctccgcct cgccccgccc cgcccctcgc cccgccccgc cccgcctggc gcgcgccccc 5760
cccccccccc cgcccccatc gctgcacaaa ataattaaaa aataaataaa tacaaaattg 5820
ggggtgggga ggggggggag atggggagag tgaagcagaa cgtggggctc acctcgaggc 5880
cggccgaata tcttcattta aatgtgtgtc agttagggtg tggaaagtcc ccaggctccc 5940
cagcaggcag aagtatgcaa agcatgcatc tcaattagtc agcaaccagg tgtggaaagt 6000
ccccaggctc cccagcaggc agaagtatgc aaagcatgca tctcaattag tcagcaacca 6060
tagtcccgcc cctaactccg cccatcccgc ccctaactcc gcccagttcc gcccattctc 6120
cgccccatgg ctgactaatt ttttttattt atgcagaggc cgaggccgcc tcggcctctg 6180
agctattcca gaagtagtga ggaggctttt ttggaggcct aggcttttgc aaacgccggc 6240
gcaccgcggg cccgatccac cggtactgtt ggtaaagcca ccatgttttc cggtggcggc 6300
ggcccgctgt cccccggagg aaagtcggcg gccagggcgg cgtccgggtt ttttgcgccc 6360
gccggccctc gcggagccag ccggggaccc ccgccttgtt tgaggcaaaa cttttacaac 6420
ccctacctcg ccccagtcgg gacgcaacag aagccgaccg ggccaaccca gcgccatacg 6480
tactatagcg aatgcgatga atttcgattc atcgccccgc gggtgctgga cgaggatgcc 6540
cccccggaga agcgcgccgg ggtgcacgac ggtcacctca agcgcgcccc caaggtgtac 6600
tgcggggggg acgagcgcga cgtcctccgc gtcgggtcgg gcggcttctg gccgcggcgc 6660
tcgcgcctgt ggggcggcgt ggaccacgcc ccggcggggt tcaaccccac cgtcaccgtc 6720
tttcacgtgt acgacatcct ggagaacgtg gagcacgcgt acggcatgcg cgcggcccag 6780
ttccacgcgc ggtttatgga cgccatcaca ccgacgggga ccgtcatcac gctcctgggc 6840
ctgactccgg aaggccaccg ggtggccgtt cacgtttacg gcacgcggca gtacttttac 6900
atgaacaagg aggaggtcga caggcaccta caatgccgcg ccccacgaga tctctgcgag 6960
cgcatggccg cggccctgcg cgagtccccg ggcgcgtcgt tccgcggcat ctccgcggac 7020
cacttcgagg cggaggtggt ggagcgcacc gacgtgtact actacgagac gcgccccgct 7080
ctgttttacc gcgtctacgt ccgaagcggg cgcgtgctgt cgtacctgtg cgacaacttc 7140
tgcccggcca tcaagaagta cgagggtggg gtcgacgcca ccacccggtt catcctggac 7200
aaccccgggt tcgtcacctt cggctggtac cgtctcaaac cgggccggaa caacacgcta 7260
gcccagccgc gggccccgat ggccttcggg acatccagcg acgtcgagtt taactgtacg 7320
gcggacaacc tggccatcga ggggggcatg agcgacctac cggcatacaa gctcatgtgc 7380
ttcgatatcg aatgcaaggc ggggggggag gacgagctgg cctttccggt ggccgggcac 7440
ccggaggacc tggtcatcca gatatcctgt ctgctctacg acctgtccac caccgccctg 7500
gagcacgtcc tcctgttttc gctcggttcc tgcgacctcc ccgaatccca cctgaacgag 7560
ctggcggcca ggggcctgcc cacgcccgtg gttctggaat tcgacagcga attcgagatg 7620
ctgttggcct tcatgaccct tgtgaaacag tacggccccg agttcgtgac cgggtacaac 7680
atcatcaact tcgactggcc cttcttgctg gccaagctga cggacattta caaggtcccc 7740
ctggacgggt acggccgcat gaacggccgg ggcgtgtttc gcgtgtggga cataggccag 7800
agccacttcc agaagcgcag caagataaag gtgaacggca tggtgaacat cgacatgtac 7860
gggattataa ccgacaagat caagctctcg agctacaagc tcaacgccgt ggccgaagcc 7920
gtcctgaagg acaagaagaa ggacctgagc tatcgcgaca tccccgccta ctacgccgcc 7980
gggcccgcgc aacgcggggt gatcggcgag tactgcatac aggattccct gctggtgggc 8040
cagctgtttt ttaagttttt gccccatctg gagctctcgg ccgtcgcgcg cttggcgggt 8100
attaacatca cccgcaccat ctacgacggc cagcagatcc gcgtctttac gtgcctgctg 8160
cgcctggccg accagaaggg ctttattctg ccggacaccc aggggcgatt taggggcgcc 8220
gggggggagg cgcccaagcg tccggccgca gcccgggagg acgaggagcg gccagaggag 8280
gagggggagg acgaggacga acgcgaggag ggcgggggcg agcgggagcc ggagggcgcg 8340
cgggagaccg ccggcaggca cgtggggtac cagggggcca gggtccttga ccccacttcc 8400
gggtttcacg tgaaccccgt ggtggtgttc gactttgcca gcctgtaccc cagcatcatc 8460
caggcccaca acctgtgctt cagcacgctc tccctgaggg ccgacgcagt ggcgcacctg 8520
gaggcgggca aggactacct ggagatcgag gtgggggggc gacggctgtt cttcgtcaag 8580
gctcacgtgc gagagagcct cctcagcatc ctcctgcggg actggctcgc catgcgaaag 8640
cagatccgct cgcggattcc ccagagcagc cccgaggagg ccgtgctcct ggacaagcag 8700
caggccgcca tcaaggtcgt gtgtaactcg gtgtacgggt tcacgggagt gcagcacgga 8760
ctcctgccgt gcctgcacgt tgccgcgacg gtgacgacca tcggccgcga gatgctgctc 8820
gcgacccgcg agtacgtcca cgcgcgctgg gcggccttcg aacagctcct ggccgatttc 8880
ccggaggcgg ccgacatgcg cgcccccggg ccctattcca tgcgcatcat ctacggggac 8940
acggactcca tctttgtgct gtgccgcggc ctcacggccg ccgggctgac ggccgtgggc 9000
gacaagatgg cgagccacat ctcgcgcgcg ctgtttctgc cccccatcaa actcgagtgc 9060
gaaaagacgt tcaccaagct gctgctgatc gccaagaaaa agtacatcgg cgtcatctac 9120
gggggtaaga tgctcatcaa gggcgtggat ctggtgcgca aaaacaactg cgcgtttatc 9180
aaccgcacct ccagggccct ggtcgacctg ctgttttacg acgataccgt ctccggagcc 9240
gccgcggcgt tagccgagcg ccccgcggag gagtggctgg cgcgacccct gcccgaggga 9300
ctgcaggcgt tcggggccgt cctcgtagac gcccatcggc gcatcaccga cccggagagg 9360
gacatccagg actttgtcct caccgccgaa ctgagcagac acccgcgcgc gtacaccaac 9420
aagcgcctgg cccacctgac ggtgtattac aagctcatgg cccgccgcgc gcaggtcccg 9480
tccatcaagg accggatccc gtacgtgatc gtggcccaga cccgcgaggt agaggagacg 9540
gtcgcgcggc tggccgccct ccgcgagcta gacgccgccg ccccagggga cgagcccgcc 9600
ccccccgcgg ccctgccctc cccggccaag cgcccccggg agacgccgtc gcctgccgac 9660
cccccgggag gcgcgtccaa gccccgcaag ctgctggtgt ccgagctggc cgaggatccc 9720
gcatacgcca ttgcccacgg cgtcgccctg aacacggact attacttctc ccacctgttg 9780
ggggcggcgt gcgtgacatt caaggccctg tttgggaata acgccaagat caccgagagt 9840
ctgttaaaaa ggtttattcc cgaagtgtgg caccccccgg acgacgtggc cgcgcggctc 9900
cggaccgcag ggttcggggc ggtgggtgcc ggcgctacgg cggaggaaac tcgtcgaatg 9960
ttgcatagag cctttgatac tctagcagaa ttcggcagtg gagcaacaaa cttctctctg 10020
ctgaaacaag ccggagatgt cgaagagaat cctggaccga cggattcccc tggcggtgtg 10080
gcccccgcct cccccgtgga ggacgcgtcg gacgcgtccc tcgggcagcc ggaggagggg 10140
gcgccctgcc aggtggtcct gcagggcgcc gaacttaatg gaatcctaca ggcgtttgcc 10200
ccgctgcgca cgagccttct ggactcgctt ctggttatgg gcgaccgggg catccttatc 10260
cataacacga tctttgggga gcaggtgttc ctgcccctgg aacactcgca attcagtcgg 10320
tatcgctggc gcggacccac ggcggcgttc ctgtctctcg tggaccagaa gcgctccctc 10380
ctgagcgtgt ttcgcgccaa ccagtacccg gacctacgtc gggtggagtt ggcgatcacg 10440
ggccaggccc cgtttcgcac gctggttcag cgcatatgga cgacgacgtc cgacggcgag 10500
gccgttgagc tagccagcga gacgctgatg aagcgcgaac tgacgagctt tgtggtgctg 10560
gttccccagg gaacccccga cgttcagttg cgcctgacga ggccgcagct caccaaggtc 10620
cttaacgcga ccggggccga tagtgccacg cccaccacgt tcgagctcgg ggttaacggc 10680
aaattttccg tgttcaccac gagtacctgc gtcacctttg ctgcccgcga ggagggcgtg 10740
tcgtccagca ccagcaccca ggtccagatc ctgtccaacg cgctcaccaa ggcgggccag 10800
gccgccgcga acgccaagac ggtgtacggg gaaaataccc atcgcacctt ctctgtggtc 10860
gtcgacgatt gcagcatgcg ggcggtgctc cggcgactgc aggtcggcgg gggcaccctc 10920
aagttcttcc tcacgacccc cgtccccagt ctgtgcgtca ccgccaccgg tcccaacgcg 10980
gtatcggcgg tatttctcct gaaaccccag aagatttgcc tggactggct gggtcatagc 11040
caggggtctc cttcagccgg gagctcggcc tcccgggcct ctgggagcga gccaacagac 11100
agccaggact ccgcgtcgga cgcggtcagc cacggcgatc cggaagacct cgatggcgct 11160
gcccgggcgg gagaggcggg ggccttgcat gcctgtccga tgccgtcgtc gaccacgcgg 11220
gtcactccca cgaccaagcg ggggcgctcg gggggcgagg atgcgcgcgc ggacacggcc 11280
ctaaagaaac ctaagacggg gtcgcccacc gcacccccgc ccgcagatcc agtccccctg 11340
gacacggagg acgactccga tgcggcggac gggacggcgg cccgtcccgc cgctccagac 11400
gcccggagcg gaagccgtta cgcgtgttac tttcgcgacc tcccgaccgg agaagcaagc 11460
cccggcgcct tctccgcctt ccgggggggc ccccaaaccc cgtatggttt tggattcccc 11520
tgataggatc cgactgcagg tagctgtgcc ttctagttgc cagccatctg ttgtttgccc 11580
ctcccccgtg ccttccttga ccctggaagg tgccactccc actgtccttt cctaataaaa 11640
tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct attctggggg gtggggtggg 11700
gcaggacagc aagggggagg attgggaaga caatagcagg catgctgggg atgcggtggg 11760
ctctatgggt ttaaacatcg atgcggccgc cgtttgtgtt atgtttcaac gtgtttattt 11820
ttcaattgca gaaaatttca agtcattttt cattcagtag tatagcccca ccaccacata 11880
gcttatacag atcaccgtac cttaatcaaa ctcacagaac cctagtattc aacctgccac 11940
ctccctccca acacacagag tacacagtcc tttctccccg gctggcctta aaaagcatca 12000
tatcatgggt aacagacata ttcttaggtg ttatattcca cacggtttcc tgtcgagcca 12060
aacgctcatc agtgatatta ataaactccc cgggcagctc acttaagttc atgtcgctgt 12120
ccagctgctg agccacaggc tgctgtccaa cttgcggttg cttaacgggc ggcgaaggag 12180
aagtccacgc ctacatgggg gtagagtcat aatcgtgcat caggataggg cggtggtgct 12240
gcagcagcgc gcgaataaac tgctgccgcc gccgctccgt cctgcaggaa tacaacatgg 12300
cagtggtctc ctcagcgatg attcgcaccg cccgcagcat aaggcgcctt gtcctccggg 12360
cacagcagcg caccctgatc tcacttaaat cagcacagta actgcagcac agcaccacaa 12420
tattgttcaa aatcccacag tgcaaggcgc tgtatccaaa gctcatggcg gggaccacag 12480
aacccacgtg gccatcatac cacaagcgca ggtagattaa gtggcgaccc ctcataaaca 12540
cgctggacat aaacattacc tcttttggca tgttgtaatt caccacctcc cggtaccata 12600
taaacctctg attaaacatg gcgccatcca ccaccatcct aaaccagctg gccaaaacct 12660
gcccgccggc tatacactgc agggaaccgg gactggaaca atgacagtgg agagcccagg 12720
actcgtaacc atggatcatc atgctcgtca tgatatcaat gttggcacaa cacaggcaca 12780
cgtgcataca cttcctcagg attacaagct cctcccgcgt tagaaccata tcccagggaa 12840
caacccattc ctgaatcagc gtaaatccca cactgcaggg aagacctcgc acgtaactca 12900
cgttgtgcat tgtcaaagtg ttacattcgg gcagcagcgg atgatcctcc agtatggtag 12960
cgcgggtttc tgtctcaaaa ggaggtagac gatccctact gtacggagtg cgccgagaca 13020
accgagatcg tgttggtcgt agtgtcatgc caaatggaac gccggacgta gtcatatttc 13080
ctgaagcaaa accaggtgcg ggcgtgacaa acagatctgc gtctccggtc tcgccgctta 13140
gatcgctctg tgtagtagtt gtagtatatc cactctctca aagcatccag gcgccccctg 13200
gcttcgggtt ctatgtaaac tccttcatgc gccgctgccc tgataacatc caccaccgca 13260
gaataagcca cacccagcca acctacacat tcgttctgcg agtcacacac gggaggagcg 13320
ggaagagctg gaagaaccat gttttttttt ttattccaaa agattatcca aaacctcaaa 13380
atgaagatct attaagtgaa cgcgctcccc tccggtggcg tggtcaaact ctacagccaa 13440
agaacagata atggcatttg taagatgttg cacaatggct tccaaaaggc aaacggccct 13500
cacgtccaag tggacgtaaa ggctaaaccc ttcagggtga atctcctcta taaacattcc 13560
agcaccttca accatgccca aataattctc atctcgccac cttctcaata tatctctaag 13620
caaatcccga atattaagtc cggccattgt aaaaatctgc tccagagcgc cctccacctt 13680
cagcctcaag cagcgaatca tgattgcaaa aattcaggtt cctcacagac ctgtataaga 13740
ttcaaaagcg gaacattaac aaaaataccg cgatcccgta ggtcccttcg cagggccagc 13800
tgaacataat cgtgcaggtc tgcacggacc agcgcggcca cttccccgcc aggaaccatg 13860
acaaaagaac ccacactgat tatgacacgc atactcggag ctatgctaac cagcgtagcc 13920
ccgatgtaag cttgttgcat gggcggcgat ataaaatgca aggtgctgct caaaaaatca 13980
ggcaaagcct cgcgcaaaaa agaaagcaca tcgtagtcat gctcatgcag ataaaggcag 14040
gtaagctccg gaaccaccac agaaaaagac accatttttc tctcaaacat gtctgcgggt 14100
ttctgcataa acacaaaata aaataacaaa aaaacattta aacattagaa gcctgtctta 14160
caacaggaaa aacaaccctt ataagcataa gacggactac ggccatgccg gcgtgaccgt 14220
aaaaaaactg gtcaccgtga ttaaaaagca ccaccgacag ctcctcggtc atgtccggag 14280
tcataatgta agactcggta aacacatcag gttgattcac atcggtcagt gctaaaaagc 14340
gaccgaaata gcccggggga atacataccc gcaggcgtag agacaacatt acagccccca 14400
taggaggtat aacaaaatta ataggagaga aaaacacata aacacctgaa aaaccctcct 14460
gcctaggcaa aatagcaccc tcccgctcca gaacaacata cagcgcttcc acagcggcag 14520
ccataacagt cagccttacc agtaaaaaag aaaacctatt aaaaaaacac cactcgacac 14580
ggcaccagct caatcagtca cagtgtaaaa aagggccaag tgcagagcga gtatatatag 14640
gactaaaaaa tgacgtaacg gttaaagtcc acaaaaaaca cccagaaaac cgcacgcgaa 14700
cctacgccca gaaacgaaag ccaaaaaacc cacaacttcc tcaaatcgtc acttccgttt 14760
tcccacgtta cgtcacttcc cattttaaga aaactacaat tcccaacaca tacaagttac 14820
tccgccctta attaaatcgg atccgatatc tagatgtatt cgcgaggtac cgagctcgaa 14880
ttctctggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta cccaacttaa 14940
tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg cccgcaccga 15000
tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg cgcctgatgc ggtattttct 15060
ccttacgcat ctgtgcggta tttcacaccg catatggtgc actctcagta caatctgctc 15120
tgatgccgca tagttaagcc agccccgaca cccgccaaca cccgctgacg cgccctgacg 15180
ggcttgtctg ctcccggcat ccgcttacag acaagctgtg accgtctccg ggagctgcat 15240
gtgtcagagg ttttcaccgt catcaccgaa acgcgcga 15278
<210> 59
<211> 14379
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 59
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat ctgtacactc tcgggtgatt atttaccccc 2880
acccttgccg tctgcgccgt ttaaaaatca aaggggttct gccgcgcatc gctatgcgcc 2940
actggcaggg acacgttgcg atactggtgt ttagtgctcc acttaaactc aggcacaacc 3000
atccgcggca gctcggtgaa gttttcactc cacaggctgc gcaccatcac caacgcgttt 3060
agcaggtcgg gcgccgatat cttgaagtcg cagttggggc ctccgccctg cgcgcgcgag 3120
ttgcgataca cagggttgca gcactggaac actatcagcg ccgggtggtg cacgctggcc 3180
agcacgctct tgtcggagat cagatccgcg tccaggtcct ccgcgttgct cagggcgaac 3240
ggagtcaact ttggtagctg ccttcccaaa aagggcgcgt gcccaggctt tgagttgcac 3300
tcgcaccgta gtggcatcaa aaggtgaccg tgcccggtct gggcgttagg atacagcgcc 3360
tgcataaaag ccttgatctg cttaaaagcc acctgagcct ttgcgccttc agagaagaac 3420
atgccgcaag acttgccgga aaactgattg gccggacagg ccgcgtcgtg cacgcagcac 3480
cttgcgtcgg tgttggagat ctgcaccaca tttcggcccc accggttctt cacgatcttg 3540
gccttgctag actgctcctt cagcgcgcgc tgcccgtttt cgctcgtcac atccatttca 3600
atcacgtgct ccttatttat cataatgctt ccgtgtagac acttaagctc gccttcgatc 3660
tcagcgcagc ggtgcagcca caacgcgcag cccgtgggct cgtgatgctt gtaggtcacc 3720
tctgcaaacg actgcaggta cgcctgcagg aatcgcccca tcatcgtcac aaaggtcttg 3780
ttgctggtga aggtcagctg caacccgcgg tgctcctcgt tcagccaggt cttgcatacg 3840
gccgccagag cttccacttg gtcaggcagt agtttgaagt tcgcctttag atcgttatcc 3900
acgtggtact tgtccatcag cgcgcgcgca gcctccatgc ccttctccca cgcagacacg 3960
atcggcacac tcagcgggtt catcaccgta atttcacttt ccgcttcgct gggctcttcc 4020
tcttcctctt gcgtccgcat accacgcgcc actgggtcgt cttcattcag ccgccgcact 4080
gtgcgcttac ctcctttgcc atgcttgatt agcaccggtg ggttgctgaa acccaccatt 4140
tgtagcgcca catcttctct ttcttcctcg ctgtccacga ttacctctgg tgatggcggg 4200
cgctcgggct tgggagaagg gcgcttcttt ttcttcttgg gcgcaatggc caaatccgcc 4260
gccgaggtcg atggccgcgg gctgggtgtg cgcggcacca gcgcgtcttg tgatgagtct 4320
tcctcgtcct cggactcgat acgccgcctc atccgctttt ttgggggcgc ccggggaggc 4380
ggcggcgacg gggacgggga cgacacgtcc tccatggttg ggggacgtcg cgccgcaccg 4440
cgtccgcgct cgggggtggt ttcgcgctgc tcctcttccc gactggccat ttccttctcc 4500
tataggcaga aaaagatcca caaaagcgaa gatcagcttc ggcgcacgct ggaagacgcg 4560
gaggctctct tcagtaaata ctgcgcgctg actcttaagg actagtttcg cgccctttct 4620
caaatttaag cgcgaaaact acgtcatctc cagcggccac acccggcgcc agcacctgtt 4680
gtcagcgcca ttggcgcgcc cgcccgccgc gcgcttcgct ttttataggg ccgccgccgc 4740
cgccgcctcg ccataaaagg aaactttcgg agcgcgccgc tctgattggc tgccgccgca 4800
cctctccgcc tcgccccgcc ccgcccctcg ccccgccccg ccccgcctgg cgcgcgcccc 4860
cccccccccc ccgcccccat cgctgcacaa aataattaaa aaataaataa atacaaaatt 4920
gggggtgggg agggggggga gatggggaga gtgaagcaga acgtggggct cacctcgagg 4980
ccggccgaat atcttcattt aaatgtgtgt cagttagggt gtggaaagtc cccaggctcc 5040
ccagcaggca gaagtatgca aagcatgcat ctcaattagt cagcaaccag gtgtggaaag 5100
tccccaggct ccccagcagg cagaagtatg caaagcatgc atctcaatta gtcagcaacc 5160
atagtcccgc ccctaactcc gcccatcccg cccctaactc cgcccagttc cgcccattct 5220
ccgccccatg gctgactaat tttttttatt tatgcagagg ccgaggccgc ctcggcctct 5280
gagctattcc agaagtagtg aggaggcttt tttggaggcc taggcttttg caaacgccgg 5340
cgcaccgcgg gcccgatcca ccggtactgt tggtaaagcc accatgtttt ccggtggcgg 5400
cggcccgctg tcccccggag gaaagtcggc ggccagggcg gcgtccgggt tttttgcgcc 5460
cgccggccct cgcggagcca gccggggacc cccgccttgt ttgaggcaaa acttttacaa 5520
cccctacctc gccccagtcg ggacgcaaca gaagccgacc gggccaaccc agcgccatac 5580
gtactatagc gaatgcgatg aatttcgatt catcgccccg cgggtgctgg acgaggatgc 5640
ccccccggag aagcgcgccg gggtgcacga cggtcacctc aagcgcgccc ccaaggtgta 5700
ctgcgggggg gacgagcgcg acgtcctccg cgtcgggtcg ggcggcttct ggccgcggcg 5760
ctcgcgcctg tggggcggcg tggaccacgc cccggcgggg ttcaacccca ccgtcaccgt 5820
ctttcacgtg tacgacatcc tggagaacgt ggagcacgcg tacggcatgc gcgcggccca 5880
gttccacgcg cggtttatgg acgccatcac accgacgggg accgtcatca cgctcctggg 5940
cctgactccg gaaggccacc gggtggccgt tcacgtttac ggcacgcggc agtactttta 6000
catgaacaag gaggaggtcg acaggcacct acaatgccgc gccccacgag atctctgcga 6060
gcgcatggcc gcggccctgc gcgagtcccc gggcgcgtcg ttccgcggca tctccgcgga 6120
ccacttcgag gcggaggtgg tggagcgcac cgacgtgtac tactacgaga cgcgccccgc 6180
tctgttttac cgcgtctacg tccgaagcgg gcgcgtgctg tcgtacctgt gcgacaactt 6240
ctgcccggcc atcaagaagt acgagggtgg ggtcgacgcc accacccggt tcatcctgga 6300
caaccccggg ttcgtcacct tcggctggta ccgtctcaaa ccgggccgga acaacacgct 6360
agcccagccg cgggccccga tggccttcgg gacatccagc gacgtcgagt ttaactgtac 6420
ggcggacaac ctggccatcg aggggggcat gagcgaccta ccggcataca agctcatgtg 6480
cttcgatatc gaatgcaagg cgggggggga ggacgagctg gcctttccgg tggccgggca 6540
cccggaggac ctggtcatcc agatatcctg tctgctctac gacctgtcca ccaccgccct 6600
ggagcacgtc ctcctgtttt cgctcggttc ctgcgacctc cccgaatccc acctgaacga 6660
gctggcggcc aggggcctgc ccacgcccgt ggttctggaa ttcgacagcg aattcgagat 6720
gctgttggcc ttcatgaccc ttgtgaaaca gtacggcccc gagttcgtga ccgggtacaa 6780
catcatcaac ttcgactggc ccttcttgct ggccaagctg acggacattt acaaggtccc 6840
cctggacggg tacggccgca tgaacggccg gggcgtgttt cgcgtgtggg acataggcca 6900
gagccacttc cagaagcgca gcaagataaa ggtgaacggc atggtgaaca tcgacatgta 6960
cgggattata accgacaaga tcaagctctc gagctacaag ctcaacgccg tggccgaagc 7020
cgtcctgaag gacaagaaga aggacctgag ctatcgcgac atccccgcct actacgccgc 7080
cgggcccgcg caacgcgggg tgatcggcga gtactgcata caggattccc tgctggtggg 7140
ccagctgttt tttaagtttt tgccccatct ggagctctcg gccgtcgcgc gcttggcggg 7200
tattaacatc acccgcacca tctacgacgg ccagcagatc cgcgtcttta cgtgcctgct 7260
gcgcctggcc gaccagaagg gctttattct gccggacacc caggggcgat ttaggggcgc 7320
cgggggggag gcgcccaagc gtccggccgc agcccgggag gacgaggagc ggccagagga 7380
ggagggggag gacgaggacg aacgcgagga gggcgggggc gagcgggagc cggagggcgc 7440
gcgggagacc gccggcaggc acgtggggta ccagggggcc agggtccttg accccacttc 7500
cgggtttcac gtgaaccccg tggtggtgtt cgactttgcc agcctgtacc ccagcatcat 7560
ccaggcccac aacctgtgct tcagcacgct ctccctgagg gccgacgcag tggcgcacct 7620
ggaggcgggc aaggactacc tggagatcga ggtggggggg cgacggctgt tcttcgtcaa 7680
ggctcacgtg cgagagagcc tcctcagcat cctcctgcgg gactggctcg ccatgcgaaa 7740
gcagatccgc tcgcggattc cccagagcag ccccgaggag gccgtgctcc tggacaagca 7800
gcaggccgcc atcaaggtcg tgtgtaactc ggtgtacggg ttcacgggag tgcagcacgg 7860
actcctgccg tgcctgcacg ttgccgcgac ggtgacgacc atcggccgcg agatgctgct 7920
cgcgacccgc gagtacgtcc acgcgcgctg ggcggccttc gaacagctcc tggccgattt 7980
cccggaggcg gccgacatgc gcgcccccgg gccctattcc atgcgcatca tctacgggga 8040
cacggactcc atctttgtgc tgtgccgcgg cctcacggcc gccgggctga cggccgtggg 8100
cgacaagatg gcgagccaca tctcgcgcgc gctgtttctg ccccccatca aactcgagtg 8160
cgaaaagacg ttcaccaagc tgctgctgat cgccaagaaa aagtacatcg gcgtcatcta 8220
cgggggtaag atgctcatca agggcgtgga tctggtgcgc aaaaacaact gcgcgtttat 8280
caaccgcacc tccagggccc tggtcgacct gctgttttac gacgataccg tctccggagc 8340
cgccgcggcg ttagccgagc gccccgcgga ggagtggctg gcgcgacccc tgcccgaggg 8400
actgcaggcg ttcggggccg tcctcgtaga cgcccatcgg cgcatcaccg acccggagag 8460
ggacatccag gactttgtcc tcaccgccga actgagcaga cacccgcgcg cgtacaccaa 8520
caagcgcctg gcccacctga cggtgtatta caagctcatg gcccgccgcg cgcaggtccc 8580
gtccatcaag gaccggatcc cgtacgtgat cgtggcccag acccgcgagg tagaggagac 8640
ggtcgcgcgg ctggccgccc tccgcgagct agacgccgcc gccccagggg acgagcccgc 8700
cccccccgcg gccctgccct ccccggccaa gcgcccccgg gagacgccgt cgcctgccga 8760
ccccccggga ggcgcgtcca agccccgcaa gctgctggtg tccgagctgg ccgaggatcc 8820
cgcatacgcc attgcccacg gcgtcgccct gaacacggac tattacttct cccacctgtt 8880
gggggcggcg tgcgtgacat tcaaggccct gtttgggaat aacgccaaga tcaccgagag 8940
tctgttaaaa aggtttattc ccgaagtgtg gcaccccccg gacgacgtgg ccgcgcggct 9000
ccggaccgca gggttcgggg cggtgggtgc cggcgctacg gcggaggaaa ctcgtcgaat 9060
gttgcataga gcctttgata ctctagcaga attcggcagt ggagcaacaa acttctctct 9120
gctgaaacaa gccggagatg tcgaagagaa tcctggaccg acggattccc ctggcggtgt 9180
ggcccccgcc tcccccgtgg aggacgcgtc ggacgcgtcc ctcgggcagc cggaggaggg 9240
ggcgccctgc caggtggtcc tgcagggcgc cgaacttaat ggaatcctac aggcgtttgc 9300
cccgctgcgc acgagccttc tggactcgct tctggttatg ggcgaccggg gcatccttat 9360
ccataacacg atctttgggg agcaggtgtt cctgcccctg gaacactcgc aattcagtcg 9420
gtatcgctgg cgcggaccca cggcggcgtt cctgtctctc gtggaccaga agcgctccct 9480
cctgagcgtg tttcgcgcca accagtaccc ggacctacgt cgggtggagt tggcgatcac 9540
gggccaggcc ccgtttcgca cgctggttca gcgcatatgg acgacgacgt ccgacggcga 9600
ggccgttgag ctagccagcg agacgctgat gaagcgcgaa ctgacgagct ttgtggtgct 9660
ggttccccag ggaacccccg acgttcagtt gcgcctgacg aggccgcagc tcaccaaggt 9720
ccttaacgcg accggggccg atagtgccac gcccaccacg ttcgagctcg gggttaacgg 9780
caaattttcc gtgttcacca cgagtacctg cgtcaccttt gctgcccgcg aggagggcgt 9840
gtcgtccagc accagcaccc aggtccagat cctgtccaac gcgctcacca aggcgggcca 9900
ggccgccgcg aacgccaaga cggtgtacgg ggaaaatacc catcgcacct tctctgtggt 9960
cgtcgacgat tgcagcatgc gggcggtgct ccggcgactg caggtcggcg ggggcaccct 10020
caagttcttc ctcacgaccc ccgtccccag tctgtgcgtc accgccaccg gtcccaacgc 10080
ggtatcggcg gtatttctcc tgaaacccca gaagatttgc ctggactggc tgggtcatag 10140
ccaggggtct ccttcagccg ggagctcggc ctcccgggcc tctgggagcg agccaacaga 10200
cagccaggac tccgcgtcgg acgcggtcag ccacggcgat ccggaagacc tcgatggcgc 10260
tgcccgggcg ggagaggcgg gggccttgca tgcctgtccg atgccgtcgt cgaccacgcg 10320
ggtcactccc acgaccaagc gggggcgctc ggggggcgag gatgcgcgcg cggacacggc 10380
cctaaagaaa cctaagacgg ggtcgcccac cgcacccccg cccgcagatc cagtccccct 10440
ggacacggag gacgactccg atgcggcgga cgggacggcg gcccgtcccg ccgctccaga 10500
cgcccggagc ggaagccgtt acgcgtgtta ctttcgcgac ctcccgaccg gagaagcaag 10560
ccccggcgcc ttctccgcct tccggggggg cccccaaacc ccgtatggtt ttggattccc 10620
ctgataggat ccgactgcag gtagctgtgc cttctagttg ccagccatct gttgtttgcc 10680
cctcccccgt gccttccttg accctggaag gtgccactcc cactgtcctt tcctaataaa 10740
atgaggaaat tgcatcgcat tgtctgagta ggtgtcattc tattctgggg ggtggggtgg 10800
ggcaggacag caagggggag gattgggaag acaatagcag gcatgctggg gatgcggtgg 10860
gctctatggg tttaaacatc gatgcggccg ccgtttgtgt tatgtttcaa cgtgtttatt 10920
tttcaattgc agaaaatttc aagtcatttt tcattcagta gtatagcccc accaccacat 10980
agcttataca gatcaccgta ccttaatcaa actcacagaa ccctagtatt caacctgcca 11040
cctccctccc aacacacaga gtacacagtc ctttctcccc ggctggcctt aaaaagcatc 11100
atatcatggg taacagacat attcttaggt gttatattcc acacggtttc ctgtcgagcc 11160
aaacgctcat cagtgatatt aataaactcc ccgggcagct cacttaagtt catgtcgctg 11220
tccagctgct gagccacagg ctgctgtcca acttgcggtt gcttaacggg cggcgaagga 11280
gaagtccacg cctacatggg ggtagagtca taatcgtgca tcaggatagg gcggtggtgc 11340
tgcagcagcg cgcgaataaa ctgctgccgc cgccgctccg tcctgcagga atacaacatg 11400
gcagtggtct cctcagcgat gattcgcacc gcccgcagca taaggcgcct tgtcctccgg 11460
gcacagcagc gcaccctgat ctcacttaaa tcagcacagt aactgcagca cagcaccaca 11520
atattgttca aaatcccaca gtgcaaggcg ctgtatccaa agctcatggc ggggaccaca 11580
gaacccacgt ggccatcata ccacaagcgc aggtagatta agtggcgacc cctcataaac 11640
acgctggaca taaacattac ctcttttggc atgttgtaat tcaccacctc ccggtaccat 11700
ataaacctct gattaaacat ggcgccatcc accaccatcc taaaccagct ggccaaaacc 11760
tgcccgccgg ctatacactg cagggaaccg ggactggaac aatgacagtg gagagcccag 11820
gactcgtaac catggatcat catgctcgtc atgatatcaa tgttggcaca acacaggcac 11880
acgtgcatac acttcctcag gattacaagc tcctcccgcg ttagaaccat atcccaggga 11940
acaacccatt cctgaatcag cgtaaatccc acactgcagg gaagacctcg cacgtaactc 12000
acgttgtgca ttgtcaaagt gttacattcg ggcagcagcg gatgatcctc cagtatggta 12060
gcgcgggttt ctgtctcaaa aggaggtaga cgatccctac tgtacggagt gcgccgagac 12120
aaccgagatc gtgttggtcg tagtgtcatg ccaaatggaa cgccggacgt agtcatattt 12180
cctgaagcaa aaccaggtgc gggcgtgaca aacagatctg cgtctccggt ctcgccgctt 12240
agatcgctct gtgtagtagt tgtagtatat ccactctctc aaagcatcca ggcgccccct 12300
ggcttcgggt tctatgtaaa ctccttcatg cgccgctgcc ctgataacat ccaccaccgc 12360
agaataagcc acacccagcc aacctacaca ttcgttctgc gagtcacaca cgggaggagc 12420
gggaagagct ggaagaacca tgtttttttt tttattccaa aagattatcc aaaacctcaa 12480
aatgaagatc tattaagtga acgcgctccc ctccggtggc gtggtcaaac tctacagcca 12540
aagaacagat aatggcattt gtaagatgtt gcacaatggc ttccaaaagg caaacggccc 12600
tcacgtccaa gtggacgtaa aggctaaacc cttcagggtg aatctcctct ataaacattc 12660
cagcaccttc aaccatgccc aaataattct catctcgcca ccttctcaat atatctctaa 12720
gcaaatcccg aatattaagt ccggccattg taaaaatctg ctccagagcg ccctccacct 12780
tcagcctcaa gcagcgaatc atgattgcaa aaattcaggt tcctcacaga cctgtataag 12840
attcaaaagc ggaacattaa caaaaatacc gcgatcccgt aggtcccttc gcagggccag 12900
ctgaacataa tcgtgcaggt ctgcacggac cagcgcggcc acttccccgc caggaaccat 12960
gacaaaagaa cccacactga ttatgacacg catactcgga gctatgctaa ccagcgtagc 13020
cccgatgtaa gcttgttgca tgggcggcga tataaaatgc aaggtgctgc tcaaaaaatc 13080
aggcaaagcc tcgcgcaaaa aagaaagcac atcgtagtca tgctcatgca gataaaggca 13140
ggtaagctcc ggaaccacca cagaaaaaga caccattttt ctctcaaaca tgtctgcggg 13200
tttctgcata aacacaaaat aaaataacaa aaaaacattt aaacattaga agcctgtctt 13260
acaacaggaa aaacaaccct tataagcata agacggacta cggccatgcc ggcgtgaccg 13320
taaaaaaact ggtcaccgtg attaaaaagc accaccgaca gctcctcggt catgtccgga 13380
gtcataatgt aagactcggt aaacacatca ggttgattca catcggtcag tgctaaaaag 13440
cgaccgaaat agcccggggg aatacatacc cgcaggcgta gagacaacat tacagccccc 13500
ataggaggta taacaaaatt aataggagag aaaaacacat aaacacctga aaaaccctcc 13560
tgcctaggca aaatagcacc ctcccgctcc agaacaacat acagcgcttc cacagcggca 13620
gccataacag tcagccttac cagtaaaaaa gaaaacctat taaaaaaaca ccactcgaca 13680
cggcaccagc tcaatcagtc acagtgtaaa aaagggccaa gtgcagagcg agtatatata 13740
ggactaaaaa atgacgtaac ggttaaagtc cacaaaaaac acccagaaaa ccgcacgcga 13800
acctacgccc agaaacgaaa gccaaaaaac ccacaacttc ctcaaatcgt cacttccgtt 13860
ttcccacgtt acgtcacttc ccattttaag aaaactacaa ttcccaacac atacaagtta 13920
ctccgccctt aattaaatcg gatccgatat ctagatgtat tcgcgaggta ccgagctcga 13980
attctctggc cgtcgtttta caacgtcgtg actgggaaaa ccctggcgtt acccaactta 14040
atcgccttgc agcacatccc cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg 14100
atcgcccttc ccaacagttg cgcagcctga atggcgaatg gcgcctgatg cggtattttc 14160
tccttacgca tctgtgcggt atttcacacc gcatatggtg cactctcagt acaatctgct 14220
ctgatgccgc atagttaagc cagccccgac acccgccaac acccgctgac gcgccctgac 14280
gggcttgtct gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca 14340
tgtgtcagag gttttcaccg tcatcaccga aacgcgcga 14379
<210> 60
<211> 14514
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 60
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat cgatccagac atgataagat acattgatga 2880
gtttggacaa accacaacta gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga 2940
tgctattgct ttatttgtaa ccattataag ctgcaataaa caagtttgta cactctcggg 3000
tgattattta cccccaccct tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc 3060
gcatcgctat gcgccactgg cagggacacg ttgcgatact ggtgtttagt gctccactta 3120
aactcaggca caaccatccg cggcagctcg gtgaagtttt cactccacag gctgcgcacc 3180
atcaccaacg cgtttagcag gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg 3240
ccctgcgcgc gcgagttgcg atacacaggg ttgcagcact ggaacactat cagcgccggg 3300
tggtgcacgc tggccagcac gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg 3360
ttgctcaggg cgaacggagt caactttggt agctgccttc ccaaaaaggg cgcgtgccca 3420
ggctttgagt tgcactcgca ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg 3480
ttaggataca gcgcctgcat aaaagccttg atctgcttaa aagccacctg agcctttgcg 3540
ccttcagaga agaacatgcc gcaagacttg ccggaaaact gattggccgg acaggccgcg 3600
tcgtgcacgc agcaccttgc gtcggtgttg gagatctgca ccacatttcg gccccaccgg 3660
ttcttcacga tcttggcctt gctagactgc tccttcagcg cgcgctgccc gttttcgctc 3720
gtcacatcca tttcaatcac gtgctcctta tttatcataa tgcttccgtg tagacactta 3780
agctcgcctt cgatctcagc gcagcggtgc agccacaacg cgcagcccgt gggctcgtga 3840
tgcttgtagg tcacctctgc aaacgactgc aggtacgcct gcaggaatcg ccccatcatc 3900
gtcacaaagg tcttgttgct ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc 3960
caggtcttgc atacggccgc cagagcttcc acttggtcag gcagtagttt gaagttcgcc 4020
tttagatcgt tatccacgtg gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc 4080
tcccacgcag acacgatcgg cacactcagc gggttcatca ccgtaatttc actttccgct 4140
tcgctgggct cttcctcttc ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca 4200
ttcagccgcc gcactgtgcg cttacctcct ttgccatgct tgattagcac cggtgggttg 4260
ctgaaaccca ccatttgtag cgccacatct tctctttctt cctcgctgtc cacgattacc 4320
tctggtgatg gcgggcgctc gggcttggga gaagggcgct tctttttctt cttgggcgca 4380
atggccaaat ccgccgccga ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg 4440
tcttgtgatg agtcttcctc gtcctcggac tcgatacgcc gcctcatccg cttttttggg 4500
ggcgcccggg gaggcggcgg cgacggggac ggggacgaca cgtcctccat ggttggggga 4560
cgtcgcgccg caccgcgtcc gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg 4620
gccatttcct tctcctatag gcagaaaaag atccacaaaa gcgaagatca gcttcggcgc 4680
acgctggaag acgcggaggc tctcttcagt aaatactgcg cgctgactct taaggactag 4740
tttcgcgccc tttctcaaat ttaagcgcga aaactacgtc atctccagcg gccacacccg 4800
gcgccagcac ctgttgtcag cgccattggc gcgcccgccc gccgcgcgct tcgcttttta 4860
tagggccgcc gccgccgccg cctcgccata aaaggaaact ttcggagcgc gccgctctga 4920
ttggctgccg ccgcacctct ccgcctcgcc ccgccccgcc cctcgccccg ccccgccccg 4980
cctggcgcgc gccccccccc cccccccgcc cccatcgctg cacaaaataa ttaaaaaata 5040
aataaataca aaattggggg tggggagggg ggggagatgg ggagagtgaa gcagaacgtg 5100
gggctcacct cgaggccggc cgaatatctt catttaaatg tgtgtcagtt agggtgtgga 5160
aagtccccag gctccccagc aggcagaagt atgcaaagca tgcatctcaa ttagtcagca 5220
accaggtgtg gaaagtcccc aggctcccca gcaggcagaa gtatgcaaag catgcatctc 5280
aattagtcag caaccatagt cccgccccta actccgccca tcccgcccct aactccgccc 5340
agttccgccc attctccgcc ccatggctga ctaatttttt ttatttatgc agaggccgag 5400
gccgcctcgg cctctgagct attccagaag tagtgaggag gcttttttgg aggcctaggc 5460
ttttgcaaac gccggcgcac cgcgggcccg atccaccggt actgttggta aagccaccat 5520
gttttccggt ggcggcggcc cgctgtcccc cggaggaaag tcggcggcca gggcggcgtc 5580
cgggtttttt gcgcccgccg gccctcgcgg agccagccgg ggacccccgc cttgtttgag 5640
gcaaaacttt tacaacccct acctcgcccc agtcgggacg caacagaagc cgaccgggcc 5700
aacccagcgc catacgtact atagcgaatg cgatgaattt cgattcatcg ccccgcgggt 5760
gctggacgag gatgcccccc cggagaagcg cgccggggtg cacgacggtc acctcaagcg 5820
cgcccccaag gtgtactgcg ggggggacga gcgcgacgtc ctccgcgtcg ggtcgggcgg 5880
cttctggccg cggcgctcgc gcctgtgggg cggcgtggac cacgccccgg cggggttcaa 5940
ccccaccgtc accgtctttc acgtgtacga catcctggag aacgtggagc acgcgtacgg 6000
catgcgcgcg gcccagttcc acgcgcggtt tatggacgcc atcacaccga cggggaccgt 6060
catcacgctc ctgggcctga ctccggaagg ccaccgggtg gccgttcacg tttacggcac 6120
gcggcagtac ttttacatga acaaggagga ggtcgacagg cacctacaat gccgcgcccc 6180
acgagatctc tgcgagcgca tggccgcggc cctgcgcgag tccccgggcg cgtcgttccg 6240
cggcatctcc gcggaccact tcgaggcgga ggtggtggag cgcaccgacg tgtactacta 6300
cgagacgcgc cccgctctgt tttaccgcgt ctacgtccga agcgggcgcg tgctgtcgta 6360
cctgtgcgac aacttctgcc cggccatcaa gaagtacgag ggtggggtcg acgccaccac 6420
ccggttcatc ctggacaacc ccgggttcgt caccttcggc tggtaccgtc tcaaaccggg 6480
ccggaacaac acgctagccc agccgcgggc cccgatggcc ttcgggacat ccagcgacgt 6540
cgagtttaac tgtacggcgg acaacctggc catcgagggg ggcatgagcg acctaccggc 6600
atacaagctc atgtgcttcg atatcgaatg caaggcgggg ggggaggacg agctggcctt 6660
tccggtggcc gggcacccgg aggacctggt catccagata tcctgtctgc tctacgacct 6720
gtccaccacc gccctggagc acgtcctcct gttttcgctc ggttcctgcg acctccccga 6780
atcccacctg aacgagctgg cggccagggg cctgcccacg cccgtggttc tggaattcga 6840
cagcgaattc gagatgctgt tggccttcat gacccttgtg aaacagtacg gccccgagtt 6900
cgtgaccggg tacaacatca tcaacttcga ctggcccttc ttgctggcca agctgacgga 6960
catttacaag gtccccctgg acgggtacgg ccgcatgaac ggccggggcg tgtttcgcgt 7020
gtgggacata ggccagagcc acttccagaa gcgcagcaag ataaaggtga acggcatggt 7080
gaacatcgac atgtacggga ttataaccga caagatcaag ctctcgagct acaagctcaa 7140
cgccgtggcc gaagccgtcc tgaaggacaa gaagaaggac ctgagctatc gcgacatccc 7200
cgcctactac gccgccgggc ccgcgcaacg cggggtgatc ggcgagtact gcatacagga 7260
ttccctgctg gtgggccagc tgttttttaa gtttttgccc catctggagc tctcggccgt 7320
cgcgcgcttg gcgggtatta acatcacccg caccatctac gacggccagc agatccgcgt 7380
ctttacgtgc ctgctgcgcc tggccgacca gaagggcttt attctgccgg acacccaggg 7440
gcgatttagg ggcgccgggg gggaggcgcc caagcgtccg gccgcagccc gggaggacga 7500
ggagcggcca gaggaggagg gggaggacga ggacgaacgc gaggagggcg ggggcgagcg 7560
ggagccggag ggcgcgcggg agaccgccgg caggcacgtg gggtaccagg gggccagggt 7620
ccttgacccc acttccgggt ttcacgtgaa ccccgtggtg gtgttcgact ttgccagcct 7680
gtaccccagc atcatccagg cccacaacct gtgcttcagc acgctctccc tgagggccga 7740
cgcagtggcg cacctggagg cgggcaagga ctacctggag atcgaggtgg gggggcgacg 7800
gctgttcttc gtcaaggctc acgtgcgaga gagcctcctc agcatcctcc tgcgggactg 7860
gctcgccatg cgaaagcaga tccgctcgcg gattccccag agcagccccg aggaggccgt 7920
gctcctggac aagcagcagg ccgccatcaa ggtcgtgtgt aactcggtgt acgggttcac 7980
gggagtgcag cacggactcc tgccgtgcct gcacgttgcc gcgacggtga cgaccatcgg 8040
ccgcgagatg ctgctcgcga cccgcgagta cgtccacgcg cgctgggcgg ccttcgaaca 8100
gctcctggcc gatttcccgg aggcggccga catgcgcgcc cccgggccct attccatgcg 8160
catcatctac ggggacacgg actccatctt tgtgctgtgc cgcggcctca cggccgccgg 8220
gctgacggcc gtgggcgaca agatggcgag ccacatctcg cgcgcgctgt ttctgccccc 8280
catcaaactc gagtgcgaaa agacgttcac caagctgctg ctgatcgcca agaaaaagta 8340
catcggcgtc atctacgggg gtaagatgct catcaagggc gtggatctgg tgcgcaaaaa 8400
caactgcgcg tttatcaacc gcacctccag ggccctggtc gacctgctgt tttacgacga 8460
taccgtctcc ggagccgccg cggcgttagc cgagcgcccc gcggaggagt ggctggcgcg 8520
acccctgccc gagggactgc aggcgttcgg ggccgtcctc gtagacgccc atcggcgcat 8580
caccgacccg gagagggaca tccaggactt tgtcctcacc gccgaactga gcagacaccc 8640
gcgcgcgtac accaacaagc gcctggccca cctgacggtg tattacaagc tcatggcccg 8700
ccgcgcgcag gtcccgtcca tcaaggaccg gatcccgtac gtgatcgtgg cccagacccg 8760
cgaggtagag gagacggtcg cgcggctggc cgccctccgc gagctagacg ccgccgcccc 8820
aggggacgag cccgcccccc ccgcggccct gccctccccg gccaagcgcc cccgggagac 8880
gccgtcgcct gccgaccccc cgggaggcgc gtccaagccc cgcaagctgc tggtgtccga 8940
gctggccgag gatcccgcat acgccattgc ccacggcgtc gccctgaaca cggactatta 9000
cttctcccac ctgttggggg cggcgtgcgt gacattcaag gccctgtttg ggaataacgc 9060
caagatcacc gagagtctgt taaaaaggtt tattcccgaa gtgtggcacc ccccggacga 9120
cgtggccgcg cggctccgga ccgcagggtt cggggcggtg ggtgccggcg ctacggcgga 9180
ggaaactcgt cgaatgttgc atagagcctt tgatactcta gcagaattcg gcagtggagc 9240
aacaaacttc tctctgctga aacaagccgg agatgtcgaa gagaatcctg gaccgacgga 9300
ttcccctggc ggtgtggccc ccgcctcccc cgtggaggac gcgtcggacg cgtccctcgg 9360
gcagccggag gagggggcgc cctgccaggt ggtcctgcag ggcgccgaac ttaatggaat 9420
cctacaggcg tttgccccgc tgcgcacgag ccttctggac tcgcttctgg ttatgggcga 9480
ccggggcatc cttatccata acacgatctt tggggagcag gtgttcctgc ccctggaaca 9540
ctcgcaattc agtcggtatc gctggcgcgg acccacggcg gcgttcctgt ctctcgtgga 9600
ccagaagcgc tccctcctga gcgtgtttcg cgccaaccag tacccggacc tacgtcgggt 9660
ggagttggcg atcacgggcc aggccccgtt tcgcacgctg gttcagcgca tatggacgac 9720
gacgtccgac ggcgaggccg ttgagctagc cagcgagacg ctgatgaagc gcgaactgac 9780
gagctttgtg gtgctggttc cccagggaac ccccgacgtt cagttgcgcc tgacgaggcc 9840
gcagctcacc aaggtcctta acgcgaccgg ggccgatagt gccacgccca ccacgttcga 9900
gctcggggtt aacggcaaat tttccgtgtt caccacgagt acctgcgtca cctttgctgc 9960
ccgcgaggag ggcgtgtcgt ccagcaccag cacccaggtc cagatcctgt ccaacgcgct 10020
caccaaggcg ggccaggccg ccgcgaacgc caagacggtg tacggggaaa atacccatcg 10080
caccttctct gtggtcgtcg acgattgcag catgcgggcg gtgctccggc gactgcaggt 10140
cggcgggggc accctcaagt tcttcctcac gacccccgtc cccagtctgt gcgtcaccgc 10200
caccggtccc aacgcggtat cggcggtatt tctcctgaaa ccccagaaga tttgcctgga 10260
ctggctgggt catagccagg ggtctccttc agccgggagc tcggcctccc gggcctctgg 10320
gagcgagcca acagacagcc aggactccgc gtcggacgcg gtcagccacg gcgatccgga 10380
agacctcgat ggcgctgccc gggcgggaga ggcgggggcc ttgcatgcct gtccgatgcc 10440
gtcgtcgacc acgcgggtca ctcccacgac caagcggggg cgctcggggg gcgaggatgc 10500
gcgcgcggac acggccctaa agaaacctaa gacggggtcg cccaccgcac ccccgcccgc 10560
agatccagtc cccctggaca cggaggacga ctccgatgcg gcggacggga cggcggcccg 10620
tcccgccgct ccagacgccc ggagcggaag ccgttacgcg tgttactttc gcgacctccc 10680
gaccggagaa gcaagccccg gcgccttctc cgccttccgg gggggccccc aaaccccgta 10740
tggttttgga ttcccctgat aggatccgac tgcaggtagc tgtgccttct agttgccagc 10800
catctgttgt ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc actcccactg 10860
tcctttccta ataaaatgag gaaattgcat cgcattgtct gagtaggtgt cattctattc 10920
tggggggtgg ggtggggcag gacagcaagg gggaggattg ggaagacaat agcaggcatg 10980
ctggggatgc ggtgggctct atgggtttaa acatcgatgc ggccgccgtt tgtgttatgt 11040
ttcaacgtgt ttatttttca attgcagaaa atttcaagtc atttttcatt cagtagtata 11100
gccccaccac cacatagctt atacagatca ccgtacctta atcaaactca cagaacccta 11160
gtattcaacc tgccacctcc ctcccaacac acagagtaca cagtcctttc tccccggctg 11220
gccttaaaaa gcatcatatc atgggtaaca gacatattct taggtgttat attccacacg 11280
gtttcctgtc gagccaaacg ctcatcagtg atattaataa actccccggg cagctcactt 11340
aagttcatgt cgctgtccag ctgctgagcc acaggctgct gtccaacttg cggttgctta 11400
acgggcggcg aaggagaagt ccacgcctac atgggggtag agtcataatc gtgcatcagg 11460
atagggcggt ggtgctgcag cagcgcgcga ataaactgct gccgccgccg ctccgtcctg 11520
caggaataca acatggcagt ggtctcctca gcgatgattc gcaccgcccg cagcataagg 11580
cgccttgtcc tccgggcaca gcagcgcacc ctgatctcac ttaaatcagc acagtaactg 11640
cagcacagca ccacaatatt gttcaaaatc ccacagtgca aggcgctgta tccaaagctc 11700
atggcgggga ccacagaacc cacgtggcca tcataccaca agcgcaggta gattaagtgg 11760
cgacccctca taaacacgct ggacataaac attacctctt ttggcatgtt gtaattcacc 11820
acctcccggt accatataaa cctctgatta aacatggcgc catccaccac catcctaaac 11880
cagctggcca aaacctgccc gccggctata cactgcaggg aaccgggact ggaacaatga 11940
cagtggagag cccaggactc gtaaccatgg atcatcatgc tcgtcatgat atcaatgttg 12000
gcacaacaca ggcacacgtg catacacttc ctcaggatta caagctcctc ccgcgttaga 12060
accatatccc agggaacaac ccattcctga atcagcgtaa atcccacact gcagggaaga 12120
cctcgcacgt aactcacgtt gtgcattgtc aaagtgttac attcgggcag cagcggatga 12180
tcctccagta tggtagcgcg ggtttctgtc tcaaaaggag gtagacgatc cctactgtac 12240
ggagtgcgcc gagacaaccg agatcgtgtt ggtcgtagtg tcatgccaaa tggaacgccg 12300
gacgtagtca tatttcctga agcaaaacca ggtgcgggcg tgacaaacag atctgcgtct 12360
ccggtctcgc cgcttagatc gctctgtgta gtagttgtag tatatccact ctctcaaagc 12420
atccaggcgc cccctggctt cgggttctat gtaaactcct tcatgcgccg ctgccctgat 12480
aacatccacc accgcagaat aagccacacc cagccaacct acacattcgt tctgcgagtc 12540
acacacggga ggagcgggaa gagctggaag aaccatgttt ttttttttat tccaaaagat 12600
tatccaaaac ctcaaaatga agatctatta agtgaacgcg ctcccctccg gtggcgtggt 12660
caaactctac agccaaagaa cagataatgg catttgtaag atgttgcaca atggcttcca 12720
aaaggcaaac ggccctcacg tccaagtgga cgtaaaggct aaacccttca gggtgaatct 12780
cctctataaa cattccagca ccttcaacca tgcccaaata attctcatct cgccaccttc 12840
tcaatatatc tctaagcaaa tcccgaatat taagtccggc cattgtaaaa atctgctcca 12900
gagcgccctc caccttcagc ctcaagcagc gaatcatgat tgcaaaaatt caggttcctc 12960
acagacctgt ataagattca aaagcggaac attaacaaaa ataccgcgat cccgtaggtc 13020
ccttcgcagg gccagctgaa cataatcgtg caggtctgca cggaccagcg cggccacttc 13080
cccgccagga accatgacaa aagaacccac actgattatg acacgcatac tcggagctat 13140
gctaaccagc gtagccccga tgtaagcttg ttgcatgggc ggcgatataa aatgcaaggt 13200
gctgctcaaa aaatcaggca aagcctcgcg caaaaaagaa agcacatcgt agtcatgctc 13260
atgcagataa aggcaggtaa gctccggaac caccacagaa aaagacacca tttttctctc 13320
aaacatgtct gcgggtttct gcataaacac aaaataaaat aacaaaaaaa catttaaaca 13380
ttagaagcct gtcttacaac aggaaaaaca acccttataa gcataagacg gactacggcc 13440
atgccggcgt gaccgtaaaa aaactggtca ccgtgattaa aaagcaccac cgacagctcc 13500
tcggtcatgt ccggagtcat aatgtaagac tcggtaaaca catcaggttg attcacatcg 13560
gtcagtgcta aaaagcgacc gaaatagccc gggggaatac atacccgcag gcgtagagac 13620
aacattacag cccccatagg aggtataaca aaattaatag gagagaaaaa cacataaaca 13680
cctgaaaaac cctcctgcct aggcaaaata gcaccctccc gctccagaac aacatacagc 13740
gcttccacag cggcagccat aacagtcagc cttaccagta aaaaagaaaa cctattaaaa 13800
aaacaccact cgacacggca ccagctcaat cagtcacagt gtaaaaaagg gccaagtgca 13860
gagcgagtat atataggact aaaaaatgac gtaacggtta aagtccacaa aaaacaccca 13920
gaaaaccgca cgcgaaccta cgcccagaaa cgaaagccaa aaaacccaca acttcctcaa 13980
atcgtcactt ccgttttccc acgttacgtc acttcccatt ttaagaaaac tacaattccc 14040
aacacataca agttactccg cccttaatta aatcggatcc gatatctaga tgtattcgcg 14100
aggtaccgag ctcgaattct ctggccgtcg ttttacaacg tcgtgactgg gaaaaccctg 14160
gcgttaccca acttaatcgc cttgcagcac atcccccttt cgccagctgg cgtaatagcg 14220
aagaggcccg caccgatcgc ccttcccaac agttgcgcag cctgaatggc gaatggcgcc 14280
tgatgcggta ttttctcctt acgcatctgt gcggtatttc acaccgcata tggtgcactc 14340
tcagtacaat ctgctctgat gccgcatagt taagccagcc ccgacacccg ccaacacccg 14400
ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc ttacagacaa gctgtgaccg 14460
tctccgggag ctgcatgtgt cagaggtttt caccgtcatc accgaaacgc gcga 14514
<210> 61
<211> 15031
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 61
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tctgcagtcg accagaagca ccatgtcctt gggtccggcc 2160
tgctgaatgc gcaggcggtc ggccatgccc caggcttcgt tttgacatcg gcgcaggtct 2220
ttgtagtagt cttgcatgag cctttctacc ggcacttctt cttctccttc ctcttgtcct 2280
gcatctcttg catctatcgc tgcggcggcg gcggagtttg gccgtaggtg gcgccctctt 2340
cctcccatgc gtgtgacccc gaagcccctc atcggctgaa gcagggctag gtcggcgaca 2400
acgcgctcgg ctaatatggc ctgctgcacc tgcgtgaggg tagactggaa gtcatccatg 2460
tccacaaagc ggtggtatgc gcccgtgttg atggtgtaag tgcagttggc cataacggac 2520
cagttaacgg tctggtgacc cggctgcgag agctcggtgt acctgagacg cgagtaagcc 2580
ctcgagtcaa atacgtagtc gttgcaagtc cgcaccaggt actggtatcc caccaaaaag 2640
tgcggcggcg gctggcggta gaggggccag cgtagggtgg ccggggctcc gggggcgaga 2700
tcttccaaca taaggcgatg atatccgtag atgtacctgg acatccaggt gatgccggcg 2760
gcggtggtgg aggcgcgcgg aaagtcgcgg acgcggttcc agatgttgcg cagcggcaaa 2820
aagtgctcca tggtcgggac gctctggccg gtcaggcgcg cgcaatcgtt gacgctctag 2880
cgtgcaaaag gagagcctgt aagcgggcac tcttccgtgg tctggtggat aaattcgcaa 2940
gggtatcatg gcggacgacc ggggttcgag ccccgtatcc ggccgtccgc cgtgatccat 3000
gcggttaccg cccgcgtgtc gaacccaggt gtgcgacgtc agacaacggg ggagtgctcc 3060
ttttggcttc cttccaggcg cggcggctgc tgcgctagct tttttggcca ctggccgcgc 3120
gcagcgtaag cggttaggct ggaaagcgaa agcattaagt ggctcgctcc ctgtagccgg 3180
agggttattt tccaagggtt gagtcgcggg acccccggtt cgagtctcgg accgagactg 3240
ggggcgtaca ctggatggcc tttgcctgga acccgcactc aaaaacatgc tacctctttg 3300
agccctttgg cttttctgac cagcgactca agcaggttta ccagtttgag tacgagtcac 3360
tcctgcgccg tagcgccatt gcttcttccc ccgaccgctg tataacgctg gaaaagtcca 3420
cccaaagcgt acaggggccc aactcggccg cctgtggact attctgctgc atgtttctcc 3480
acgcctttgc caactggccc caaactccca tggatcacaa ccccaccatg aaccttatta 3540
ccggggtacc caactccatg ctcaacagtc cccaggtaca gcccaccctg cgtcgcaacc 3600
aggaacagct ctacagcttc ctggagcgcc actcgcccta cttccgcagc cacagtgcgc 3660
agattaggag cgccacttct ttttgtcact tgaaaaacat gtaaaaataa tgtactagag 3720
acactttcaa taaaggcaaa tgcttttatt tgtacactct cgggtgatta tttaccccca 3780
cccttgccgt ctgcgccgtt taaaaatcaa aggggttctg ccgcgcatcg ctatgcgcca 3840
ctggcaggga cacgttgcga tactggtgtt tagtgctcca cttaaactca ggcacaacca 3900
tccgcggcag ctcggtgaag ttttcactcc acaggctgcg caccatcacc aacgcgttta 3960
gcaggtcggg cgccgatatc ttgaagtcgc agttggggcc tccgccctgc gcgcgcgagt 4020
tgcgatacac agggttgcag cactggaaca ctatcagcgc cgggtggtgc acgctggcca 4080
gcacgctctt gtcggagatc agatccgcgt ccaggtcctc cgcgttgctc agggcgaacg 4140
gagtcaactt tggtagctgc cttcccaaaa agggcgcgtg cccaggcttt gagttgcact 4200
cgcaccgtag tggcatcaaa aggtgaccgt gcccggtctg ggcgttagga tacagcgcct 4260
gcataaaagc cttgatctgc ttaaaagcca cctgagcctt tgcgccttca gagaagaaca 4320
tgccgcaaga cttgccggaa aactgattgg ccggacaggc cgcgtcgtgc acgcagcacc 4380
ttgcgtcggt gttggagatc tgcaccacat ttcggcccca ccggttcttc acgatcttgg 4440
ccttgctaga ctgctccttc agcgcgcgct gcccgttttc gctcgtcaca tccatttcaa 4500
tcacgtgctc cttatttatc ataatgcttc cgtgtagaca cttaagctcg ccttcgatct 4560
cagcgcagcg gtgcagccac aacgcgcagc ccgtgggctc gtgatgcttg taggtcacct 4620
ctgcaaacga ctgcaggtac gcctgcagga atcgccccat catcgtcaca aaggtcttgt 4680
tgctggtgaa ggtcagctgc aacccgcggt gctcctcgtt cagccaggtc ttgcatacgg 4740
ccgccagagc ttccacttgg tcaggcagta gtttgaagtt cgcctttaga tcgttatcca 4800
cgtggtactt gtccatcagc gcgcgcgcag cctccatgcc cttctcccac gcagacacga 4860
tcggcacact cagcgggttc atcaccgtaa tttcactttc cgcttcgctg ggctcttcct 4920
cttcctcttg cgtccgcata ccacgcgcca ctgggtcgtc ttcattcagc cgccgcactg 4980
tgcgcttacc tcctttgcca tgcttgatta gcaccggtgg gttgctgaaa cccaccattt 5040
gtagcgccac atcttctctt tcttcctcgc tgtccacgat tacctctggt gatggcgggc 5100
gctcgggctt gggagaaggg cgcttctttt tcttcttggg cgcaatggcc aaatccgccg 5160
ccgaggtcga tggccgcggg ctgggtgtgc gcggcaccag cgcgtcttgt gatgagtctt 5220
cctcgtcctc ggactcgata cgccgcctca tccgcttttt tgggggcgcc cggggaggcg 5280
gcggcgacgg ggacggggac gacacgtcct ccatggttgg gggacgtcgc gccgcaccgc 5340
gtccgcgctc gggggtggtt tcgcgctgct cctcttcccg actggccatt tccttctcct 5400
ataggcagaa aaagatccac aaaagcgaag atcagcttcg gcgcacgctg gaagacgcgg 5460
aggctctctt cagtaaatac tgcgcgctga ctcttaagga ctagtttcgc gccctttctc 5520
aaatttaagc gcgaaaacta cgtcatctcc agcggccaca cccggcgcca gcacctgttg 5580
tcagcgccat tggcgcgccc gcccgccgcg cgcttcgctt tttatagggc cgccgccgcc 5640
gccgcctcgc cataaaagga aactttcgga gcgcgccgct ctgattggct gccgccgcac 5700
ctctccgcct cgccccgccc cgcccctcgc cccgccccgc cccgcctggc gcgcgccccc 5760
cccccccccc cgcccccatc gctgcacaaa ataattaaaa aataaataaa tacaaaattg 5820
ggggtgggga ggggggggag atggggagag tgaagcagaa cgtggggctc acctcgaggc 5880
cggccgaata tcttcattta aatgtgtgtc agttagggtg tggaaagtcc ccaggctccc 5940
cagcaggcag aagtatgcaa agcatgcatc tcaattagtc agcaaccagg tgtggaaagt 6000
ccccaggctc cccagcaggc agaagtatgc aaagcatgca tctcaattag tcagcaacca 6060
tagtcccgcc cctaactccg cccatcccgc ccctaactcc gcccagttcc gcccattctc 6120
cgccccatgg ctgactaatt ttttttattt atgcagaggc cgaggccgcc tcggcctctg 6180
agctattcca gaagtagtga ggaggctttt ttggaggcct aggcttttgc aaacgccggc 6240
gcaccgcggg cccgatccac cggtactgtt ggtaaagcca ccatgttttc cggtggcggc 6300
ggcccgctgt cccccggagg aaagtcggcg gccagggcgg cgtccgggtt ttttgcgccc 6360
gccggccctc gcggagccag ccggggaccc ccgccttgtt tgaggcaaaa cttttacaac 6420
ccctacctcg ccccagtcgg gacgcaacag aagccgaccg ggccaaccca gcgccatacg 6480
tactatagcg aatgcgatga atttcgattc atcgccccgc gggtgctgga cgaggatgcc 6540
cccccggaga agcgcgccgg ggtgcacgac ggtcacctca agcgcgcccc caaggtgtac 6600
tgcggggggg acgagcgcga cgtcctccgc gtcgggtcgg gcggcttctg gccgcggcgc 6660
tcgcgcctgt ggggcggcgt ggaccacgcc ccggcggggt tcaaccccac cgtcaccgtc 6720
tttcacgtgt acgacatcct ggagaacgtg gagcacgcgt acggcatgcg cgcggcccag 6780
ttccacgcgc ggtttatgga cgccatcaca ccgacgggga ccgtcatcac gctcctgggc 6840
ctgactccgg aaggccaccg ggtggccgtt cacgtttacg gcacgcggca gtacttttac 6900
atgaacaagg aggaggtcga caggcaccta caatgccgcg ccccacgaga tctctgcgag 6960
cgcatggccg cggccctgcg cgagtccccg ggcgcgtcgt tccgcggcat ctccgcggac 7020
cacttcgagg cggaggtggt ggagcgcacc gacgtgtact actacgagac gcgccccgct 7080
ctgttttacc gcgtctacgt ccgaagcggg cgcgtgctgt cgtacctgtg cgacaacttc 7140
tgcccggcca tcaagaagta cgagggtggg gtcgacgcca ccacccggtt catcctggac 7200
aaccccgggt tcgtcacctt cggctggtac cgtctcaaac cgggccggaa caacacgcta 7260
gcccagccgc gggccccgat ggccttcggg acatccagcg acgtcgagtt taactgtacg 7320
gcggacaacc tggccatcga ggggggcatg agcgacctac cggcatacaa gctcatgtgc 7380
ttcgatatcg aatgcaaggc ggggggggag gacgagctgg cctttccggt ggccgggcac 7440
ccggaggacc tggtcatcca gatatcctgt ctgctctacg acctgtccac caccgccctg 7500
gagcacgtcc tcctgttttc gctcggttcc tgcgacctcc ccgaatccca cctgaacgag 7560
ctggcggcca ggggcctgcc cacgcccgtg gttctggaat tcgacagcga attcgagatg 7620
ctgttggcct tcatgaccct tgtgaaacag tacggccccg agttcgtgac cgggtacaac 7680
atcatcaact tcgactggcc cttcttgctg gccaagctga cggacattta caaggtcccc 7740
ctggacgggt acggccgcat gaacggccgg ggcgtgtttc gcgtgtggga cataggccag 7800
agccacttcc agaagcgcag caagataaag gtgaacggca tggtgaacat cgacatgtac 7860
gggattataa ccgacaagat caagctctcg agctacaagc tcaacgccgt ggccgaagcc 7920
gtcctgaagg acaagaagaa ggacctgagc tatcgcgaca tccccgccta ctacgccgcc 7980
gggcccgcgc aacgcggggt gatcggcgag tactgcatac aggattccct gctggtgggc 8040
cagctgtttt ttaagttttt gccccatctg gagctctcgg ccgtcgcgcg cttggcgggt 8100
attaacatca cccgcaccat ctacgacggc cagcagatcc gcgtctttac gtgcctgctg 8160
cgcctggccg accagaaggg ctttattctg ccggacaccc aggggcgatt taggggcgcc 8220
gggggggagg cgcccaagcg tccggccgca gcccgggagg acgaggagcg gccagaggag 8280
gagggggagg acgaggacga acgcgaggag ggcgggggcg agcgggagcc ggagggcgcg 8340
cgggagaccg ccggcaggca cgtggggtac cagggggcca gggtccttga ccccacttcc 8400
gggtttcacg tgaaccccgt ggtggtgttc gactttgcca gcctgtaccc cagcatcatc 8460
caggcccaca acctgtgctt cagcacgctc tccctgaggg ccgacgcagt ggcgcacctg 8520
gaggcgggca aggactacct ggagatcgag gtgggggggc gacggctgtt cttcgtcaag 8580
gctcacgtgc gagagagcct cctcagcatc ctcctgcggg actggctcgc catgcgaaag 8640
cagatccgct cgcggattcc ccagagcagc cccgaggagg ccgtgctcct ggacaagcag 8700
caggccgcca tcaaggtcgt gtgtaactcg gtgtacgggt tcacgggagt gcagcacgga 8760
ctcctgccgt gcctgcacgt tgccgcgacg gtgacgacca tcggccgcga gatgctgctc 8820
gcgacccgcg agtacgtcca cgcgcgctgg gcggccttcg aacagctcct ggccgatttc 8880
ccggaggcgg ccgacatgcg cgcccccggg ccctattcca tgcgcatcat ctacggggac 8940
acggactcca tctttgtgct gtgccgcggc ctcacggccg ccgggctgac ggccgtgggc 9000
gacaagatgg cgagccacat ctcgcgcgcg ctgtttctgc cccccatcaa actcgagtgc 9060
gaaaagacgt tcaccaagct gctgctgatc gccaagaaaa agtacatcgg cgtcatctac 9120
gggggtaaga tgctcatcaa gggcgtggat ctggtgcgca aaaacaactg cgcgtttatc 9180
aaccgcacct ccagggccct ggtcgacctg ctgttttacg acgataccgt ctccggagcc 9240
gccgcggcgt tagccgagcg ccccgcggag gagtggctgg cgcgacccct gcccgaggga 9300
ctgcaggcgt tcggggccgt cctcgtagac gcccatcggc gcatcaccga cccggagagg 9360
gacatccagg actttgtcct caccgccgaa ctgagcagac acccgcgcgc gtacaccaac 9420
aagcgcctgg cccacctgac ggtgtattac aagctcatgg cccgccgcgc gcaggtcccg 9480
tccatcaagg accggatccc gtacgtgatc gtggcccaga cccgcgaggt agaggagacg 9540
gtcgcgcggc tggccgccct ccgcgagcta gacgccgccg ccccagggga cgagcccgcc 9600
ccccccgcgg ccctgccctc cccggccaag cgcccccggg agacgccgtc gcctgccgac 9660
cccccgggag gcgcgtccaa gccccgcaag ctgctggtgt ccgagctggc cgaggatccc 9720
gcatacgcca ttgcccacgg cgtcgccctg aacacggact attacttctc ccacctgttg 9780
ggggcggcgt gcgtgacatt caaggccctg tttgggaata acgccaagat caccgagagt 9840
ctgttaaaaa ggtttattcc cgaagtgtgg caccccccgg acgacgtggc cgcgcggctc 9900
cggaccgcag ggttcggggc ggtgggtgcc ggcgctacgg cggaggaaac tcgtcgaatg 9960
ttgcatagag cctttgatac tctagcagaa ttcggcagtg gagcaacaaa cttctctctg 10020
ctgaaacaag ccggagatgt cgaagagaat cctggaccga cggattcccc tggcggtgtg 10080
gcccccgcct cccccgtgga ggacgcgtcg gacgcgtccc tcgggcagcc ggaggagggg 10140
gcgccctgcc aggtggtcct gcagggcgcc gaacttaatg gaatcctaca ggcgtttgcc 10200
ccgctgcgca cgagccttct ggactcgctt ctggttatgg gcgaccgggg catccttatc 10260
cataacacga tctttgggga gcaggtgttc ctgcccctgg aacactcgca attcagtcgg 10320
tatcgctggc gcggacccac ggcggcgttc ctgtctctcg tggaccagaa gcgctccctc 10380
ctgagcgtgt ttcgcgccaa ccagtacccg gacctacgtc gggtggagtt ggcgatcacg 10440
ggccaggccc cgtttcgcac gctggttcag cgcatatgga cgacgacgtc cgacggcgag 10500
gccgttgagc tagccagcga gacgctgatg aagcgcgaac tgacgagctt tgtggtgctg 10560
gttccccagg gaacccccga cgttcagttg cgcctgacga ggccgcagct caccaaggtc 10620
cttaacgcga ccggggccga tagtgccacg cccaccacgt tcgagctcgg ggttaacggc 10680
aaattttccg tgttcaccac gagtacctgc gtcacctttg ctgcccgcga ggagggcgtg 10740
tcgtccagca ccagcaccca ggtccagatc ctgtccaacg cgctcaccaa ggcgggccag 10800
gccgccgcga acgccaagac ggtgtacggg gaaaataccc atcgcacctt ctctgtggtc 10860
gtcgacgatt gcagcatgcg ggcggtgctc cggcgactgc aggtcggcgg gggcaccctc 10920
aagttcttcc tcacgacccc cgtccccagt ctgtgcgtca ccgccaccgg tcccaacgcg 10980
gtatcggcgg tatttctcct gaaaccccag aagatttgcc tggactggct gggtcatagc 11040
caggggtctc cttcagccgg gagctcggcc tcccgggcct ctgggagcga gccaacagac 11100
agccaggact ccgcgtcgga cgcggtcagc cacggcgatc cggaagacct cgatggcgct 11160
gcccgggcgg gagaggcggg ggccttgcat gcctgtccga tgccgtcgtc gaccacgcgg 11220
gtcactccca cgaccaagcg ggggcgctcg gggggcgagg atgcgcgcgc ggacacggcc 11280
ctaaagaaac ctaagacggg gtcgcccacc gcacccccgc ccgcagatcc agtccccctg 11340
gacacggagg acgactccga tgcggcggac gggacggcgg cccgtcccgc cgctccagac 11400
gcccggagcg gaagccgtta cgcgtgttac tttcgcgacc tcccgaccgg agaagcaagc 11460
cccggcgcct tctccgcctt ccgggggggc ccccaaaccc cgtatggttt tggattcccc 11520
tgataggatc cgactgcagg tagctgtgcc ttctagttgc cagccatctg ttgtttgccc 11580
ctcccccgtg ccttccttga ccctggaagg tgccactccc actgtccttt cctaataaaa 11640
tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct attctggggg gtggggtggg 11700
gcaggacagc aagggggagg attgggaaga caatagcagg catgctgggg atgcggtggg 11760
ctctatgggt ttaaacatcg atgcggccgc aacttgttta ttgcagctta taatggttac 11820
aaataaagca atagcatcac aaatttcaca aataaagcat ttttttcact gcattctagt 11880
tgtggtttgt ccaaactcat caatgtatct tagcttaacg ggcggcgaag gagaagtcca 11940
cgcctacatg ggggtagagt cataatcgtg catcaggata gggcggtggt gctgcagcag 12000
cgcgcgaata aactgctgcc gccgccgctc cgtcctgcag gaatacaaca tggcagtggt 12060
ctcctcagcg atgattcgca ccgcccgcag cataaggcgc cttgtcctcc gggcacagca 12120
gcgcaccctg atctcactta aatcagcaca gtaactgcag cacagcacca caatattgtt 12180
caaaatccca cagtgcaagg cgctgtatcc aaagctcatg gcggggacca cagaacccac 12240
gtggccatca taccacaagc gcaggtagat taagtggcga cccctcataa acacgctgga 12300
cataaacatt acctcttttg gcatgttgta attcaccacc tcccggtacc atataaacct 12360
ctgattaaac atggcgccat ccaccaccat cctaaaccag ctggccaaaa cctgcccgcc 12420
ggctatacac tgcagggaac cgggactgga acaatgacag tggagagccc aggactcgta 12480
accatggatc atcatgctcg tcatgatatc aatgttggca caacacaggc acacgtgcat 12540
acacttcctc aggattacaa gctcctcccg cgttagaacc atatcccagg gaacaaccca 12600
ttcctgaatc agcgtaaatc ccacactgca gggaagacct cgcacgtaac tcacgttgtg 12660
cattgtcaaa gtgttacatt cgggcagcag cggatgatcc tccagtatgg tagcgcgggt 12720
ttctgtctca aaaggaggta gacgatccct actgtacgga gtgcgccgag acaaccgaga 12780
tcgtgttggt cgtagtgtca tgccaaatgg aacgccggac gtagtcatat ttcctgaagc 12840
aaaaccaggt gcgggcgtga caaacagatc tgcgtctccg gtctcgccgc ttagatcgct 12900
ctgtgtagta gttgtagtat atccactctc tcaaagcatc caggcgcccc ctggcttcgg 12960
gttctatgta aactccttca tgcgccgctg ccctgataac atccaccacc gcagaataag 13020
ccacacccag ccaacctaca cattcgttct gcgagtcaca cacgggagga gcgggaagag 13080
ctggaagaac catgtttttt tttttattcc aaaagattat ccaaaacctc aaaatgaaga 13140
tctattaagt gaacgcgctc ccctccggtg gcgtggtcaa actctacagc caaagaacag 13200
ataatggcat ttgtaagatg ttgcacaatg gcttccaaaa ggcaaacggc cctcacgtcc 13260
aagtggacgt aaaggctaaa cccttcaggg tgaatctcct ctataaacat tccagcacct 13320
tcaaccatgc ccaaataatt ctcatctcgc caccttctca atatatctct aagcaaatcc 13380
cgaatattaa gtccggccat tgtaaaaatc tgctccagag cgccctccac cttcagcctc 13440
aagcagcgaa tcatgattgc aaaaattcag gttcctcaca gacctgtata agattcaaaa 13500
gcggaacatt aacaaaaata ccgcgatccc gtaggtccct tcgcagggcc agctgaacat 13560
aatcgtgcag gtctgcacgg accagcgcgg ccacttcccc gccaggaacc atgacaaaag 13620
aacccacact gattatgaca cgcatactcg gagctatgct aaccagcgta gccccgatgt 13680
aagcttgttg catgggcggc gatataaaat gcaaggtgct gctcaaaaaa tcaggcaaag 13740
cctcgcgcaa aaaagaaagc acatcgtagt catgctcatg cagataaagg caggtaagct 13800
ccggaaccac cacagaaaaa gacaccattt ttctctcaaa catgtctgcg ggtttctgca 13860
taaacacaaa ataaaataac aaaaaaacat ttaaacatta gaagcctgtc ttacaacagg 13920
aaaaacaacc cttataagca taagacggac tacggccatg ccggcgtgac cgtaaaaaaa 13980
ctggtcaccg tgattaaaaa gcaccaccga cagctcctcg gtcatgtccg gagtcataat 14040
gtaagactcg gtaaacacat caggttgatt cacatcggtc agtgctaaaa agcgaccgaa 14100
atagcccggg ggaatacata cccgcaggcg tagagacaac attacagccc ccataggagg 14160
tataacaaaa ttaataggag agaaaaacac ataaacacct gaaaaaccct cctgcctagg 14220
caaaatagca ccctcccgct ccagaacaac atacagcgct tccacagcgg cagccataac 14280
agtcagcctt accagtaaaa aagaaaacct attaaaaaaa caccactcga cacggcacca 14340
gctcaatcag tcacagtgta aaaaagggcc aagtgcagag cgagtatata taggactaaa 14400
aaatgacgta acggttaaag tccacaaaaa acacccagaa aaccgcacgc gaacctacgc 14460
ccagaaacga aagccaaaaa acccacaact tcctcaaatc gtcacttccg ttttcccacg 14520
ttacgtcact tcccatttta agaaaactac aattcccaac acatacaagt tactccgccc 14580
ttaattaaat cggatccgat atctagatgt attcgcgagg taccgagctc gaattctctg 14640
gccgtcgttt tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt 14700
gcagcacatc cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct 14760
tcccaacagt tgcgcagcct gaatggcgaa tggcgcctga tgcggtattt tctccttacg 14820
catctgtgcg gtatttcaca ccgcatatgg tgcactctca gtacaatctg ctctgatgcc 14880
gcatagttaa gccagccccg acacccgcca acacccgctg acgcgccctg acgggcttgt 14940
ctgctcccgg catccgctta cagacaagct gtgaccgtct ccgggagctg catgtgtcag 15000
aggttttcac cgtcatcacc gaaacgcgcg a 15031
<210> 62
<211> 14267
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 62
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat cgatccagac atgataagat acattgatga 2880
gtttggacaa accacaacta gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga 2940
tgctattgct ttatttgtaa ccattataag ctgcaataaa caagtttgta cactctcggg 3000
tgattattta cccccaccct tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc 3060
gcatcgctat gcgccactgg cagggacacg ttgcgatact ggtgtttagt gctccactta 3120
aactcaggca caaccatccg cggcagctcg gtgaagtttt cactccacag gctgcgcacc 3180
atcaccaacg cgtttagcag gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg 3240
ccctgcgcgc gcgagttgcg atacacaggg ttgcagcact ggaacactat cagcgccggg 3300
tggtgcacgc tggccagcac gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg 3360
ttgctcaggg cgaacggagt caactttggt agctgccttc ccaaaaaggg cgcgtgccca 3420
ggctttgagt tgcactcgca ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg 3480
ttaggataca gcgcctgcat aaaagccttg atctgcttaa aagccacctg agcctttgcg 3540
ccttcagaga agaacatgcc gcaagacttg ccggaaaact gattggccgg acaggccgcg 3600
tcgtgcacgc agcaccttgc gtcggtgttg gagatctgca ccacatttcg gccccaccgg 3660
ttcttcacga tcttggcctt gctagactgc tccttcagcg cgcgctgccc gttttcgctc 3720
gtcacatcca tttcaatcac gtgctcctta tttatcataa tgcttccgtg tagacactta 3780
agctcgcctt cgatctcagc gcagcggtgc agccacaacg cgcagcccgt gggctcgtga 3840
tgcttgtagg tcacctctgc aaacgactgc aggtacgcct gcaggaatcg ccccatcatc 3900
gtcacaaagg tcttgttgct ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc 3960
caggtcttgc atacggccgc cagagcttcc acttggtcag gcagtagttt gaagttcgcc 4020
tttagatcgt tatccacgtg gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc 4080
tcccacgcag acacgatcgg cacactcagc gggttcatca ccgtaatttc actttccgct 4140
tcgctgggct cttcctcttc ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca 4200
ttcagccgcc gcactgtgcg cttacctcct ttgccatgct tgattagcac cggtgggttg 4260
ctgaaaccca ccatttgtag cgccacatct tctctttctt cctcgctgtc cacgattacc 4320
tctggtgatg gcgggcgctc gggcttggga gaagggcgct tctttttctt cttgggcgca 4380
atggccaaat ccgccgccga ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg 4440
tcttgtgatg agtcttcctc gtcctcggac tcgatacgcc gcctcatccg cttttttggg 4500
ggcgcccggg gaggcggcgg cgacggggac ggggacgaca cgtcctccat ggttggggga 4560
cgtcgcgccg caccgcgtcc gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg 4620
gccatttcct tctcctatag gcagaaaaag atccacaaaa gcgaagatca gcttcggcgc 4680
acgctggaag acgcggaggc tctcttcagt aaatactgcg cgctgactct taaggactag 4740
tttcgcgccc tttctcaaat ttaagcgcga aaactacgtc atctccagcg gccacacccg 4800
gcgccagcac ctgttgtcag cgccattggc gcgcccgccc gccgcgcgct tcgcttttta 4860
tagggccgcc gccgccgccg cctcgccata aaaggaaact ttcggagcgc gccgctctga 4920
ttggctgccg ccgcacctct ccgcctcgcc ccgccccgcc cctcgccccg ccccgccccg 4980
cctggcgcgc gccccccccc cccccccgcc cccatcgctg cacaaaataa ttaaaaaata 5040
aataaataca aaattggggg tggggagggg ggggagatgg ggagagtgaa gcagaacgtg 5100
gggctcacct cgaggccggc cgaatatctt catttaaatg tgtgtcagtt agggtgtgga 5160
aagtccccag gctccccagc aggcagaagt atgcaaagca tgcatctcaa ttagtcagca 5220
accaggtgtg gaaagtcccc aggctcccca gcaggcagaa gtatgcaaag catgcatctc 5280
aattagtcag caaccatagt cccgccccta actccgccca tcccgcccct aactccgccc 5340
agttccgccc attctccgcc ccatggctga ctaatttttt ttatttatgc agaggccgag 5400
gccgcctcgg cctctgagct attccagaag tagtgaggag gcttttttgg aggcctaggc 5460
ttttgcaaac gccggcgcac cgcgggcccg atccaccggt actgttggta aagccaccat 5520
gttttccggt ggcggcggcc cgctgtcccc cggaggaaag tcggcggcca gggcggcgtc 5580
cgggtttttt gcgcccgccg gccctcgcgg agccagccgg ggacccccgc cttgtttgag 5640
gcaaaacttt tacaacccct acctcgcccc agtcgggacg caacagaagc cgaccgggcc 5700
aacccagcgc catacgtact atagcgaatg cgatgaattt cgattcatcg ccccgcgggt 5760
gctggacgag gatgcccccc cggagaagcg cgccggggtg cacgacggtc acctcaagcg 5820
cgcccccaag gtgtactgcg ggggggacga gcgcgacgtc ctccgcgtcg ggtcgggcgg 5880
cttctggccg cggcgctcgc gcctgtgggg cggcgtggac cacgccccgg cggggttcaa 5940
ccccaccgtc accgtctttc acgtgtacga catcctggag aacgtggagc acgcgtacgg 6000
catgcgcgcg gcccagttcc acgcgcggtt tatggacgcc atcacaccga cggggaccgt 6060
catcacgctc ctgggcctga ctccggaagg ccaccgggtg gccgttcacg tttacggcac 6120
gcggcagtac ttttacatga acaaggagga ggtcgacagg cacctacaat gccgcgcccc 6180
acgagatctc tgcgagcgca tggccgcggc cctgcgcgag tccccgggcg cgtcgttccg 6240
cggcatctcc gcggaccact tcgaggcgga ggtggtggag cgcaccgacg tgtactacta 6300
cgagacgcgc cccgctctgt tttaccgcgt ctacgtccga agcgggcgcg tgctgtcgta 6360
cctgtgcgac aacttctgcc cggccatcaa gaagtacgag ggtggggtcg acgccaccac 6420
ccggttcatc ctggacaacc ccgggttcgt caccttcggc tggtaccgtc tcaaaccggg 6480
ccggaacaac acgctagccc agccgcgggc cccgatggcc ttcgggacat ccagcgacgt 6540
cgagtttaac tgtacggcgg acaacctggc catcgagggg ggcatgagcg acctaccggc 6600
atacaagctc atgtgcttcg atatcgaatg caaggcgggg ggggaggacg agctggcctt 6660
tccggtggcc gggcacccgg aggacctggt catccagata tcctgtctgc tctacgacct 6720
gtccaccacc gccctggagc acgtcctcct gttttcgctc ggttcctgcg acctccccga 6780
atcccacctg aacgagctgg cggccagggg cctgcccacg cccgtggttc tggaattcga 6840
cagcgaattc gagatgctgt tggccttcat gacccttgtg aaacagtacg gccccgagtt 6900
cgtgaccggg tacaacatca tcaacttcga ctggcccttc ttgctggcca agctgacgga 6960
catttacaag gtccccctgg acgggtacgg ccgcatgaac ggccggggcg tgtttcgcgt 7020
gtgggacata ggccagagcc acttccagaa gcgcagcaag ataaaggtga acggcatggt 7080
gaacatcgac atgtacggga ttataaccga caagatcaag ctctcgagct acaagctcaa 7140
cgccgtggcc gaagccgtcc tgaaggacaa gaagaaggac ctgagctatc gcgacatccc 7200
cgcctactac gccgccgggc ccgcgcaacg cggggtgatc ggcgagtact gcatacagga 7260
ttccctgctg gtgggccagc tgttttttaa gtttttgccc catctggagc tctcggccgt 7320
cgcgcgcttg gcgggtatta acatcacccg caccatctac gacggccagc agatccgcgt 7380
ctttacgtgc ctgctgcgcc tggccgacca gaagggcttt attctgccgg acacccaggg 7440
gcgatttagg ggcgccgggg gggaggcgcc caagcgtccg gccgcagccc gggaggacga 7500
ggagcggcca gaggaggagg gggaggacga ggacgaacgc gaggagggcg ggggcgagcg 7560
ggagccggag ggcgcgcggg agaccgccgg caggcacgtg gggtaccagg gggccagggt 7620
ccttgacccc acttccgggt ttcacgtgaa ccccgtggtg gtgttcgact ttgccagcct 7680
gtaccccagc atcatccagg cccacaacct gtgcttcagc acgctctccc tgagggccga 7740
cgcagtggcg cacctggagg cgggcaagga ctacctggag atcgaggtgg gggggcgacg 7800
gctgttcttc gtcaaggctc acgtgcgaga gagcctcctc agcatcctcc tgcgggactg 7860
gctcgccatg cgaaagcaga tccgctcgcg gattccccag agcagccccg aggaggccgt 7920
gctcctggac aagcagcagg ccgccatcaa ggtcgtgtgt aactcggtgt acgggttcac 7980
gggagtgcag cacggactcc tgccgtgcct gcacgttgcc gcgacggtga cgaccatcgg 8040
ccgcgagatg ctgctcgcga cccgcgagta cgtccacgcg cgctgggcgg ccttcgaaca 8100
gctcctggcc gatttcccgg aggcggccga catgcgcgcc cccgggccct attccatgcg 8160
catcatctac ggggacacgg actccatctt tgtgctgtgc cgcggcctca cggccgccgg 8220
gctgacggcc gtgggcgaca agatggcgag ccacatctcg cgcgcgctgt ttctgccccc 8280
catcaaactc gagtgcgaaa agacgttcac caagctgctg ctgatcgcca agaaaaagta 8340
catcggcgtc atctacgggg gtaagatgct catcaagggc gtggatctgg tgcgcaaaaa 8400
caactgcgcg tttatcaacc gcacctccag ggccctggtc gacctgctgt tttacgacga 8460
taccgtctcc ggagccgccg cggcgttagc cgagcgcccc gcggaggagt ggctggcgcg 8520
acccctgccc gagggactgc aggcgttcgg ggccgtcctc gtagacgccc atcggcgcat 8580
caccgacccg gagagggaca tccaggactt tgtcctcacc gccgaactga gcagacaccc 8640
gcgcgcgtac accaacaagc gcctggccca cctgacggtg tattacaagc tcatggcccg 8700
ccgcgcgcag gtcccgtcca tcaaggaccg gatcccgtac gtgatcgtgg cccagacccg 8760
cgaggtagag gagacggtcg cgcggctggc cgccctccgc gagctagacg ccgccgcccc 8820
aggggacgag cccgcccccc ccgcggccct gccctccccg gccaagcgcc cccgggagac 8880
gccgtcgcct gccgaccccc cgggaggcgc gtccaagccc cgcaagctgc tggtgtccga 8940
gctggccgag gatcccgcat acgccattgc ccacggcgtc gccctgaaca cggactatta 9000
cttctcccac ctgttggggg cggcgtgcgt gacattcaag gccctgtttg ggaataacgc 9060
caagatcacc gagagtctgt taaaaaggtt tattcccgaa gtgtggcacc ccccggacga 9120
cgtggccgcg cggctccgga ccgcagggtt cggggcggtg ggtgccggcg ctacggcgga 9180
ggaaactcgt cgaatgttgc atagagcctt tgatactcta gcagaattcg gcagtggagc 9240
aacaaacttc tctctgctga aacaagccgg agatgtcgaa gagaatcctg gaccgacgga 9300
ttcccctggc ggtgtggccc ccgcctcccc cgtggaggac gcgtcggacg cgtccctcgg 9360
gcagccggag gagggggcgc cctgccaggt ggtcctgcag ggcgccgaac ttaatggaat 9420
cctacaggcg tttgccccgc tgcgcacgag ccttctggac tcgcttctgg ttatgggcga 9480
ccggggcatc cttatccata acacgatctt tggggagcag gtgttcctgc ccctggaaca 9540
ctcgcaattc agtcggtatc gctggcgcgg acccacggcg gcgttcctgt ctctcgtgga 9600
ccagaagcgc tccctcctga gcgtgtttcg cgccaaccag tacccggacc tacgtcgggt 9660
ggagttggcg atcacgggcc aggccccgtt tcgcacgctg gttcagcgca tatggacgac 9720
gacgtccgac ggcgaggccg ttgagctagc cagcgagacg ctgatgaagc gcgaactgac 9780
gagctttgtg gtgctggttc cccagggaac ccccgacgtt cagttgcgcc tgacgaggcc 9840
gcagctcacc aaggtcctta acgcgaccgg ggccgatagt gccacgccca ccacgttcga 9900
gctcggggtt aacggcaaat tttccgtgtt caccacgagt acctgcgtca cctttgctgc 9960
ccgcgaggag ggcgtgtcgt ccagcaccag cacccaggtc cagatcctgt ccaacgcgct 10020
caccaaggcg ggccaggccg ccgcgaacgc caagacggtg tacggggaaa atacccatcg 10080
caccttctct gtggtcgtcg acgattgcag catgcgggcg gtgctccggc gactgcaggt 10140
cggcgggggc accctcaagt tcttcctcac gacccccgtc cccagtctgt gcgtcaccgc 10200
caccggtccc aacgcggtat cggcggtatt tctcctgaaa ccccagaaga tttgcctgga 10260
ctggctgggt catagccagg ggtctccttc agccgggagc tcggcctccc gggcctctgg 10320
gagcgagcca acagacagcc aggactccgc gtcggacgcg gtcagccacg gcgatccgga 10380
agacctcgat ggcgctgccc gggcgggaga ggcgggggcc ttgcatgcct gtccgatgcc 10440
gtcgtcgacc acgcgggtca ctcccacgac caagcggggg cgctcggggg gcgaggatgc 10500
gcgcgcggac acggccctaa agaaacctaa gacggggtcg cccaccgcac ccccgcccgc 10560
agatccagtc cccctggaca cggaggacga ctccgatgcg gcggacggga cggcggcccg 10620
tcccgccgct ccagacgccc ggagcggaag ccgttacgcg tgttactttc gcgacctccc 10680
gaccggagaa gcaagccccg gcgccttctc cgccttccgg gggggccccc aaaccccgta 10740
tggttttgga ttcccctgat aggatccgac tgcaggtagc tgtgccttct agttgccagc 10800
catctgttgt ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc actcccactg 10860
tcctttccta ataaaatgag gaaattgcat cgcattgtct gagtaggtgt cattctattc 10920
tggggggtgg ggtggggcag gacagcaagg gggaggattg ggaagacaat agcaggcatg 10980
ctggggatgc ggtgggctct atgggtttaa acatcgatgc ggccgcaact tgtttattgc 11040
agcttataat ggttacaaat aaagcaatag catcacaaat ttcacaaata aagcattttt 11100
ttcactgcat tctagttgtg gtttgtccaa actcatcaat gtatcttagc ttaacgggcg 11160
gcgaaggaga agtccacgcc tacatggggg tagagtcata atcgtgcatc aggatagggc 11220
ggtggtgctg cagcagcgcg cgaataaact gctgccgccg ccgctccgtc ctgcaggaat 11280
acaacatggc agtggtctcc tcagcgatga ttcgcaccgc ccgcagcata aggcgccttg 11340
tcctccgggc acagcagcgc accctgatct cacttaaatc agcacagtaa ctgcagcaca 11400
gcaccacaat attgttcaaa atcccacagt gcaaggcgct gtatccaaag ctcatggcgg 11460
ggaccacaga acccacgtgg ccatcatacc acaagcgcag gtagattaag tggcgacccc 11520
tcataaacac gctggacata aacattacct cttttggcat gttgtaattc accacctccc 11580
ggtaccatat aaacctctga ttaaacatgg cgccatccac caccatccta aaccagctgg 11640
ccaaaacctg cccgccggct atacactgca gggaaccggg actggaacaa tgacagtgga 11700
gagcccagga ctcgtaacca tggatcatca tgctcgtcat gatatcaatg ttggcacaac 11760
acaggcacac gtgcatacac ttcctcagga ttacaagctc ctcccgcgtt agaaccatat 11820
cccagggaac aacccattcc tgaatcagcg taaatcccac actgcaggga agacctcgca 11880
cgtaactcac gttgtgcatt gtcaaagtgt tacattcggg cagcagcgga tgatcctcca 11940
gtatggtagc gcgggtttct gtctcaaaag gaggtagacg atccctactg tacggagtgc 12000
gccgagacaa ccgagatcgt gttggtcgta gtgtcatgcc aaatggaacg ccggacgtag 12060
tcatatttcc tgaagcaaaa ccaggtgcgg gcgtgacaaa cagatctgcg tctccggtct 12120
cgccgcttag atcgctctgt gtagtagttg tagtatatcc actctctcaa agcatccagg 12180
cgccccctgg cttcgggttc tatgtaaact ccttcatgcg ccgctgccct gataacatcc 12240
accaccgcag aataagccac acccagccaa cctacacatt cgttctgcga gtcacacacg 12300
ggaggagcgg gaagagctgg aagaaccatg tttttttttt tattccaaaa gattatccaa 12360
aacctcaaaa tgaagatcta ttaagtgaac gcgctcccct ccggtggcgt ggtcaaactc 12420
tacagccaaa gaacagataa tggcatttgt aagatgttgc acaatggctt ccaaaaggca 12480
aacggccctc acgtccaagt ggacgtaaag gctaaaccct tcagggtgaa tctcctctat 12540
aaacattcca gcaccttcaa ccatgcccaa ataattctca tctcgccacc ttctcaatat 12600
atctctaagc aaatcccgaa tattaagtcc ggccattgta aaaatctgct ccagagcgcc 12660
ctccaccttc agcctcaagc agcgaatcat gattgcaaaa attcaggttc ctcacagacc 12720
tgtataagat tcaaaagcgg aacattaaca aaaataccgc gatcccgtag gtcccttcgc 12780
agggccagct gaacataatc gtgcaggtct gcacggacca gcgcggccac ttccccgcca 12840
ggaaccatga caaaagaacc cacactgatt atgacacgca tactcggagc tatgctaacc 12900
agcgtagccc cgatgtaagc ttgttgcatg ggcggcgata taaaatgcaa ggtgctgctc 12960
aaaaaatcag gcaaagcctc gcgcaaaaaa gaaagcacat cgtagtcatg ctcatgcaga 13020
taaaggcagg taagctccgg aaccaccaca gaaaaagaca ccatttttct ctcaaacatg 13080
tctgcgggtt tctgcataaa cacaaaataa aataacaaaa aaacatttaa acattagaag 13140
cctgtcttac aacaggaaaa acaaccctta taagcataag acggactacg gccatgccgg 13200
cgtgaccgta aaaaaactgg tcaccgtgat taaaaagcac caccgacagc tcctcggtca 13260
tgtccggagt cataatgtaa gactcggtaa acacatcagg ttgattcaca tcggtcagtg 13320
ctaaaaagcg accgaaatag cccgggggaa tacatacccg caggcgtaga gacaacatta 13380
cagcccccat aggaggtata acaaaattaa taggagagaa aaacacataa acacctgaaa 13440
aaccctcctg cctaggcaaa atagcaccct cccgctccag aacaacatac agcgcttcca 13500
cagcggcagc cataacagtc agccttacca gtaaaaaaga aaacctatta aaaaaacacc 13560
actcgacacg gcaccagctc aatcagtcac agtgtaaaaa agggccaagt gcagagcgag 13620
tatatatagg actaaaaaat gacgtaacgg ttaaagtcca caaaaaacac ccagaaaacc 13680
gcacgcgaac ctacgcccag aaacgaaagc caaaaaaccc acaacttcct caaatcgtca 13740
cttccgtttt cccacgttac gtcacttccc attttaagaa aactacaatt cccaacacat 13800
acaagttact ccgcccttaa ttaaatcgga tccgatatct agatgtattc gcgaggtacc 13860
gagctcgaat tctctggccg tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac 13920
ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata gcgaagaggc 13980
ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatggc gcctgatgcg 14040
gtattttctc cttacgcatc tgtgcggtat ttcacaccgc atatggtgca ctctcagtac 14100
aatctgctct gatgccgcat agttaagcca gccccgacac ccgccaacac ccgctgacgc 14160
gccctgacgg gcttgtctgc tcccggcatc cgcttacaga caagctgtga ccgtctccgg 14220
gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa cgcgcga 14267
<210> 63
<211> 15128
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 63
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tctgcagtcg accagaagca ccatgtcctt gggtccggcc 2160
tgctgaatgc gcaggcggtc ggccatgccc caggcttcgt tttgacatcg gcgcaggtct 2220
ttgtagtagt cttgcatgag cctttctacc ggcacttctt cttctccttc ctcttgtcct 2280
gcatctcttg catctatcgc tgcggcggcg gcggagtttg gccgtaggtg gcgccctctt 2340
cctcccatgc gtgtgacccc gaagcccctc atcggctgaa gcagggctag gtcggcgaca 2400
acgcgctcgg ctaatatggc ctgctgcacc tgcgtgaggg tagactggaa gtcatccatg 2460
tccacaaagc ggtggtatgc gcccgtgttg atggtgtaag tgcagttggc cataacggac 2520
cagttaacgg tctggtgacc cggctgcgag agctcggtgt acctgagacg cgagtaagcc 2580
ctcgagtcaa atacgtagtc gttgcaagtc cgcaccaggt actggtatcc caccaaaaag 2640
tgcggcggcg gctggcggta gaggggccag cgtagggtgg ccggggctcc gggggcgaga 2700
tcttccaaca taaggcgatg atatccgtag atgtacctgg acatccaggt gatgccggcg 2760
gcggtggtgg aggcgcgcgg aaagtcgcgg acgcggttcc agatgttgcg cagcggcaaa 2820
aagtgctcca tggtcgggac gctctggccg gtcaggcgcg cgcaatcgtt gacgctctag 2880
cgtgcaaaag gagagcctgt aagcgggcac tcttccgtgg tctggtggat aaattcgcaa 2940
gggtatcatg gcggacgacc ggggttcgag ccccgtatcc ggccgtccgc cgtgatccat 3000
gcggttaccg cccgcgtgtc gaacccaggt gtgcgacgtc agacaacggg ggagtgctcc 3060
ttttggcttc cttccaggcg cggcggctgc tgcgctagct tttttggcca ctggccgcgc 3120
gcagcgtaag cggttaggct ggaaagcgaa agcattaagt ggctcgctcc ctgtagccgg 3180
agggttattt tccaagggtt gagtcgcggg acccccggtt cgagtctcgg accgagactg 3240
ggggcgtaca ctggatggcc tttgcctgga acccgcactc aaaaacatgc tacctctttg 3300
agccctttgg cttttctgac cagcgactca agcaggttta ccagtttgag tacgagtcac 3360
tcctgcgccg tagcgccatt gcttcttccc ccgaccgctg tataacgctg gaaaagtcca 3420
cccaaagcgt acaggggccc aactcggccg cctgtggact attctgctgc atgtttctcc 3480
acgcctttgc caactggccc caaactccca tggatcacaa ccccaccatg aaccttatta 3540
ccggggtacc caactccatg ctcaacagtc cccaggtaca gcccaccctg cgtcgcaacc 3600
aggaacagct ctacagcttc ctggagcgcc actcgcccta cttccgcagc cacagtgcgc 3660
agattaggag cgccacttct ttttgtcact tgaaaaacat gtaaaaataa tgtactagag 3720
acactttcaa taaaggcaaa tgcttttatt tgtacactct cgggtgatta tttaccccca 3780
cccttgccgt ctgcgccgtt taaaaatcaa aggggttctg ccgcgcatcg ctatgcgcca 3840
ctggcaggga cacgttgcga tactggtgtt tagtgctcca cttaaactca ggcacaacca 3900
tccgcggcag ctcggtgaag ttttcactcc acaggctgcg caccatcacc aacgcgttta 3960
gcaggtcggg cgccgatatc ttgaagtcgc agttggggcc tccgccctgc gcgcgcgagt 4020
tgcgatacac agggttgcag cactggaaca ctatcagcgc cgggtggtgc acgctggcca 4080
gcacgctctt gtcggagatc agatccgcgt ccaggtcctc cgcgttgctc agggcgaacg 4140
gagtcaactt tggtagctgc cttcccaaaa agggcgcgtg cccaggcttt gagttgcact 4200
cgcaccgtag tggcatcaaa aggtgaccgt gcccggtctg ggcgttagga tacagcgcct 4260
gcataaaagc cttgatctgc ttaaaagcca cctgagcctt tgcgccttca gagaagaaca 4320
tgccgcaaga cttgccggaa aactgattgg ccggacaggc cgcgtcgtgc acgcagcacc 4380
ttgcgtcggt gttggagatc tgcaccacat ttcggcccca ccggttcttc acgatcttgg 4440
ccttgctaga ctgctccttc agcgcgcgct gcccgttttc gctcgtcaca tccatttcaa 4500
tcacgtgctc cttatttatc ataatgcttc cgtgtagaca cttaagctcg ccttcgatct 4560
cagcgcagcg gtgcagccac aacgcgcagc ccgtgggctc gtgatgcttg taggtcacct 4620
ctgcaaacga ctgcaggtac gcctgcagga atcgccccat catcgtcaca aaggtcttgt 4680
tgctggtgaa ggtcagctgc aacccgcggt gctcctcgtt cagccaggtc ttgcatacgg 4740
ccgccagagc ttccacttgg tcaggcagta gtttgaagtt cgcctttaga tcgttatcca 4800
cgtggtactt gtccatcagc gcgcgcgcag cctccatgcc cttctcccac gcagacacga 4860
tcggcacact cagcgggttc atcaccgtaa tttcactttc cgcttcgctg ggctcttcct 4920
cttcctcttg cgtccgcata ccacgcgcca ctgggtcgtc ttcattcagc cgccgcactg 4980
tgcgcttacc tcctttgcca tgcttgatta gcaccggtgg gttgctgaaa cccaccattt 5040
gtagcgccac atcttctctt tcttcctcgc tgtccacgat tacctctggt gatggcgggc 5100
gctcgggctt gggagaaggg cgcttctttt tcttcttggg cgcaatggcc aaatccgccg 5160
ccgaggtcga tggccgcggg ctgggtgtgc gcggcaccag cgcgtcttgt gatgagtctt 5220
cctcgtcctc ggactcgata cgccgcctca tccgcttttt tgggggcgcc cggggaggcg 5280
gcggcgacgg ggacggggac gacacgtcct ccatggttgg gggacgtcgc gccgcaccgc 5340
gtccgcgctc gggggtggtt tcgcgctgct cctcttcccg actggccatt tccttctcct 5400
ataggcagaa aaagatccac aaaagcgaag atcagcttcg gcgcacgctg gaagacgcgg 5460
aggctctctt cagtaaatac tgcgcgctga ctcttaagga ctagtttcgc gccctttctc 5520
aaatttaagc gcgaaaacta cgtcatctcc agcggccaca cccggcgcca gcacctgttg 5580
tcagcgccat tggcgcgccc gcccgccgcg cgcttcgctt tttatagggc cgccgccgcc 5640
gccgcctcgc cataaaagga aactttcgga gcgcgccgct ctgattggct gccgccgcac 5700
ctctccgcct cgccccgccc cgcccctcgc cccgccccgc cccgcctggc gcgcgccccc 5760
cccccccccc cgcccccatc gctgcacaaa ataattaaaa aataaataaa tacaaaattg 5820
ggggtgggga ggggggggag atggggagag tgaagcagaa cgtggggctc acctcgaggc 5880
cggccgaata tcttcattta aatgtgtgtc agttagggtg tggaaagtcc ccaggctccc 5940
cagcaggcag aagtatgcaa agcatgcatc tcaattagtc agcaaccagg tgtggaaagt 6000
ccccaggctc cccagcaggc agaagtatgc aaagcatgca tctcaattag tcagcaacca 6060
tagtcccgcc cctaactccg cccatcccgc ccctaactcc gcccagttcc gcccattctc 6120
cgccccatgg ctgactaatt ttttttattt atgcagaggc cgaggccgcc tcggcctctg 6180
agctattcca gaagtagtga ggaggctttt ttggaggcct aggcttttgc aaacgccggc 6240
gcaccgcggg cccgatccac cggtactgtt ggtaaagcca ccatgttttc cggtggcggc 6300
ggcccgctgt cccccggagg aaagtcggcg gccagggcgg cgtccgggtt ttttgcgccc 6360
gccggccctc gcggagccag ccggggaccc ccgccttgtt tgaggcaaaa cttttacaac 6420
ccctacctcg ccccagtcgg gacgcaacag aagccgaccg ggccaaccca gcgccatacg 6480
tactatagcg aatgcgatga atttcgattc atcgccccgc gggtgctgga cgaggatgcc 6540
cccccggaga agcgcgccgg ggtgcacgac ggtcacctca agcgcgcccc caaggtgtac 6600
tgcggggggg acgagcgcga cgtcctccgc gtcgggtcgg gcggcttctg gccgcggcgc 6660
tcgcgcctgt ggggcggcgt ggaccacgcc ccggcggggt tcaaccccac cgtcaccgtc 6720
tttcacgtgt acgacatcct ggagaacgtg gagcacgcgt acggcatgcg cgcggcccag 6780
ttccacgcgc ggtttatgga cgccatcaca ccgacgggga ccgtcatcac gctcctgggc 6840
ctgactccgg aaggccaccg ggtggccgtt cacgtttacg gcacgcggca gtacttttac 6900
atgaacaagg aggaggtcga caggcaccta caatgccgcg ccccacgaga tctctgcgag 6960
cgcatggccg cggccctgcg cgagtccccg ggcgcgtcgt tccgcggcat ctccgcggac 7020
cacttcgagg cggaggtggt ggagcgcacc gacgtgtact actacgagac gcgccccgct 7080
ctgttttacc gcgtctacgt ccgaagcggg cgcgtgctgt cgtacctgtg cgacaacttc 7140
tgcccggcca tcaagaagta cgagggtggg gtcgacgcca ccacccggtt catcctggac 7200
aaccccgggt tcgtcacctt cggctggtac cgtctcaaac cgggccggaa caacacgcta 7260
gcccagccgc gggccccgat ggccttcggg acatccagcg acgtcgagtt taactgtacg 7320
gcggacaacc tggccatcga ggggggcatg agcgacctac cggcatacaa gctcatgtgc 7380
ttcgatatcg aatgcaaggc ggggggggag gacgagctgg cctttccggt ggccgggcac 7440
ccggaggacc tggtcatcca gatatcctgt ctgctctacg acctgtccac caccgccctg 7500
gagcacgtcc tcctgttttc gctcggttcc tgcgacctcc ccgaatccca cctgaacgag 7560
ctggcggcca ggggcctgcc cacgcccgtg gttctggaat tcgacagcga attcgagatg 7620
ctgttggcct tcatgaccct tgtgaaacag tacggccccg agttcgtgac cgggtacaac 7680
atcatcaact tcgactggcc cttcttgctg gccaagctga cggacattta caaggtcccc 7740
ctggacgggt acggccgcat gaacggccgg ggcgtgtttc gcgtgtggga cataggccag 7800
agccacttcc agaagcgcag caagataaag gtgaacggca tggtgaacat cgacatgtac 7860
gggattataa ccgacaagat caagctctcg agctacaagc tcaacgccgt ggccgaagcc 7920
gtcctgaagg acaagaagaa ggacctgagc tatcgcgaca tccccgccta ctacgccgcc 7980
gggcccgcgc aacgcggggt gatcggcgag tactgcatac aggattccct gctggtgggc 8040
cagctgtttt ttaagttttt gccccatctg gagctctcgg ccgtcgcgcg cttggcgggt 8100
attaacatca cccgcaccat ctacgacggc cagcagatcc gcgtctttac gtgcctgctg 8160
cgcctggccg accagaaggg ctttattctg ccggacaccc aggggcgatt taggggcgcc 8220
gggggggagg cgcccaagcg tccggccgca gcccgggagg acgaggagcg gccagaggag 8280
gagggggagg acgaggacga acgcgaggag ggcgggggcg agcgggagcc ggagggcgcg 8340
cgggagaccg ccggcaggca cgtggggtac cagggggcca gggtccttga ccccacttcc 8400
gggtttcacg tgaaccccgt ggtggtgttc gactttgcca gcctgtaccc cagcatcatc 8460
caggcccaca acctgtgctt cagcacgctc tccctgaggg ccgacgcagt ggcgcacctg 8520
gaggcgggca aggactacct ggagatcgag gtgggggggc gacggctgtt cttcgtcaag 8580
gctcacgtgc gagagagcct cctcagcatc ctcctgcggg actggctcgc catgcgaaag 8640
cagatccgct cgcggattcc ccagagcagc cccgaggagg ccgtgctcct ggacaagcag 8700
caggccgcca tcaaggtcgt gtgtaactcg gtgtacgggt tcacgggagt gcagcacgga 8760
ctcctgccgt gcctgcacgt tgccgcgacg gtgacgacca tcggccgcga gatgctgctc 8820
gcgacccgcg agtacgtcca cgcgcgctgg gcggccttcg aacagctcct ggccgatttc 8880
ccggaggcgg ccgacatgcg cgcccccggg ccctattcca tgcgcatcat ctacggggac 8940
acggactcca tctttgtgct gtgccgcggc ctcacggccg ccgggctgac ggccgtgggc 9000
gacaagatgg cgagccacat ctcgcgcgcg ctgtttctgc cccccatcaa actcgagtgc 9060
gaaaagacgt tcaccaagct gctgctgatc gccaagaaaa agtacatcgg cgtcatctac 9120
gggggtaaga tgctcatcaa gggcgtggat ctggtgcgca aaaacaactg cgcgtttatc 9180
aaccgcacct ccagggccct ggtcgacctg ctgttttacg acgataccgt ctccggagcc 9240
gccgcggcgt tagccgagcg ccccgcggag gagtggctgg cgcgacccct gcccgaggga 9300
ctgcaggcgt tcggggccgt cctcgtagac gcccatcggc gcatcaccga cccggagagg 9360
gacatccagg actttgtcct caccgccgaa ctgagcagac acccgcgcgc gtacaccaac 9420
aagcgcctgg cccacctgac ggtgtattac aagctcatgg cccgccgcgc gcaggtcccg 9480
tccatcaagg accggatccc gtacgtgatc gtggcccaga cccgcgaggt agaggagacg 9540
gtcgcgcggc tggccgccct ccgcgagcta gacgccgccg ccccagggga cgagcccgcc 9600
ccccccgcgg ccctgccctc cccggccaag cgcccccggg agacgccgtc gcctgccgac 9660
cccccgggag gcgcgtccaa gccccgcaag ctgctggtgt ccgagctggc cgaggatccc 9720
gcatacgcca ttgcccacgg cgtcgccctg aacacggact attacttctc ccacctgttg 9780
ggggcggcgt gcgtgacatt caaggccctg tttgggaata acgccaagat caccgagagt 9840
ctgttaaaaa ggtttattcc cgaagtgtgg caccccccgg acgacgtggc cgcgcggctc 9900
cggaccgcag ggttcggggc ggtgggtgcc ggcgctacgg cggaggaaac tcgtcgaatg 9960
ttgcatagag cctttgatac tctagcagaa ttcggcagtg gagcaacaaa cttctctctg 10020
ctgaaacaag ccggagatgt cgaagagaat cctggaccga cggattcccc tggcggtgtg 10080
gcccccgcct cccccgtgga ggacgcgtcg gacgcgtccc tcgggcagcc ggaggagggg 10140
gcgccctgcc aggtggtcct gcagggcgcc gaacttaatg gaatcctaca ggcgtttgcc 10200
ccgctgcgca cgagccttct ggactcgctt ctggttatgg gcgaccgggg catccttatc 10260
cataacacga tctttgggga gcaggtgttc ctgcccctgg aacactcgca attcagtcgg 10320
tatcgctggc gcggacccac ggcggcgttc ctgtctctcg tggaccagaa gcgctccctc 10380
ctgagcgtgt ttcgcgccaa ccagtacccg gacctacgtc gggtggagtt ggcgatcacg 10440
ggccaggccc cgtttcgcac gctggttcag cgcatatgga cgacgacgtc cgacggcgag 10500
gccgttgagc tagccagcga gacgctgatg aagcgcgaac tgacgagctt tgtggtgctg 10560
gttccccagg gaacccccga cgttcagttg cgcctgacga ggccgcagct caccaaggtc 10620
cttaacgcga ccggggccga tagtgccacg cccaccacgt tcgagctcgg ggttaacggc 10680
aaattttccg tgttcaccac gagtacctgc gtcacctttg ctgcccgcga ggagggcgtg 10740
tcgtccagca ccagcaccca ggtccagatc ctgtccaacg cgctcaccaa ggcgggccag 10800
gccgccgcga acgccaagac ggtgtacggg gaaaataccc atcgcacctt ctctgtggtc 10860
gtcgacgatt gcagcatgcg ggcggtgctc cggcgactgc aggtcggcgg gggcaccctc 10920
aagttcttcc tcacgacccc cgtccccagt ctgtgcgtca ccgccaccgg tcccaacgcg 10980
gtatcggcgg tatttctcct gaaaccccag aagatttgcc tggactggct gggtcatagc 11040
caggggtctc cttcagccgg gagctcggcc tcccgggcct ctgggagcga gccaacagac 11100
agccaggact ccgcgtcgga cgcggtcagc cacggcgatc cggaagacct cgatggcgct 11160
gcccgggcgg gagaggcggg ggccttgcat gcctgtccga tgccgtcgtc gaccacgcgg 11220
gtcactccca cgaccaagcg ggggcgctcg gggggcgagg atgcgcgcgc ggacacggcc 11280
ctaaagaaac ctaagacggg gtcgcccacc gcacccccgc ccgcagatcc agtccccctg 11340
gacacggagg acgactccga tgcggcggac gggacggcgg cccgtcccgc cgctccagac 11400
gcccggagcg gaagccgtta cgcgtgttac tttcgcgacc tcccgaccgg agaagcaagc 11460
cccggcgcct tctccgcctt ccgggggggc ccccaaaccc cgtatggttt tggattcccc 11520
tgataggatc cgactgcagg tagctgtgcc ttctagttgc cagccatctg ttgtttgccc 11580
ctcccccgtg ccttccttga ccctggaagg tgccactccc actgtccttt cctaataaaa 11640
tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct attctggggg gtggggtggg 11700
gcaggacagc aagggggagg attgggaaga caatagcagg catgctgggg atgcggtggg 11760
ctctatgggt ttaaacatcg atgcggccgc aacttgttta ttgcagctta taatggttac 11820
aaataaagca atagcatcac aaatttcaca aataaagcat ttttttcact gcattctagt 11880
tgtggtttgt ccaaactcat caatgtatct tagcttaacg ggcggcgaag gagaagtcca 11940
cgcctacatg ggggtagagt cataatcgtg catcaggata gggcggtggt gctgcagcag 12000
cgcgcgaata aactgctgcc gccgccgctc cgtcctgcag gaatacaaca tggcagtggt 12060
ctcctcagcg atgattcgca ccgcccgcag cataaggcgc cttgtcctcc gggcacagca 12120
gcgcaccctg atctcactta aatcagcaca gtaactgcag cacagcacca caatattgtt 12180
caaaatccca cagtgcaagg cgctgtatcc aaagctcatg gcggggacca cagaacccac 12240
gtggccatca taccacaagc gcaggtagat taagtggcga cccctcataa acacgctgga 12300
cataaacatt acctcttttg gcatgttgta attcaccacc tcccggtacc atataaacct 12360
ctgattaaac atggcgccat ccaccaccat cctaaaccag ctggccaaaa cctgcccgcc 12420
ggctatacac tgcagggaac cgggactgga acaatgacag tggagagccc aggactcgta 12480
accatggatc atcatgctcg tcatgatatc aatgttggca caacacaggc acacgtgcat 12540
acacttcctc aggattacaa gctcctcccg cgttagaacc atatcccagg gaacaaccca 12600
ttcctgaatc agcgtaaatc ccacactgca gggaagacct cgcacgtaac tcacgttgtg 12660
cattgtcaaa gtgttacatt cgggcagcag cggatgatcc tccagtatgg tagcgcgggt 12720
ttctgtctca aaaggaggta gacgatccct actgtacgga gtgcgccgag acaaccgaga 12780
tcgtgttggt cgtagtgtca tgccaaatgg aacgccggac gtagtcatat ttcctgaagc 12840
aaaaccaggt gcgggcgtga caaacagatc tgcgtctccg gtctcgccgc ttagatcgct 12900
ctgtgtagta gttgtagtat atccactctc tcaaagcatc caggcgcccc ctggcttcgg 12960
gttctatgta aactccttca tgcgccgctg ccctgataac atccaccacc gcagaataag 13020
ccacacccag ccaacctaca cattcgttct gcgagtcaca cacgggagga gcgggaagag 13080
ctggaagaac catgtttttt tttttattcc aaaagattat ccaaaacctc aaaatgaaga 13140
tctattaagt gaacgcgctc ccctccggtg gcgtggtcaa actctacagc caaagaacag 13200
ataatggcat ttgtaagatg ttgcacaatg gcttccaaaa ggcaaacggc cctcacgtcc 13260
aagtggacgt aaaggctaaa cccttcaggg tgaatctcct ctataaacat tccagcacct 13320
tcaaccatgc ccaaataatt ctcatctcgc caccttctca atatatctct aagcaaatcc 13380
cgaatattaa gtccggccat tgtaaaaatc tgctccagag cgccctccac cttcagcctc 13440
aagcagcgaa tcatgattgc aaaaattcag gttcctcaca gacctgtata agattcaaaa 13500
gcggaacatt aacaaaaata ccgcgatccc gtaggtccct tcgcagggcc agctgaacat 13560
aatcgtgcag gtctgcacgg accagcgcgg ccacttcccc gccaggaacc atgacaaaag 13620
aacccacact gattatgaca cgcatactcg gagctatgct aaccagcgta gccccgatgt 13680
aagcttgttg catgggcggc gatataaaat gcaaggtgct gctcaaaaaa tcaggcaaag 13740
cctcgcgcaa aaaagaaagc acatcgtagt catgctcatg cagataaagg caggtaagct 13800
ccggaaccac cacagaaaaa gacaccattt ttctctcaaa catgtctgcg ggtttctgca 13860
taaacacaaa ataaaataac aaaaaaacat ttaaacatta gaagcctgtc ttacaacagg 13920
aaaaacaacc cttataagca taagacggac tacggccatg ccggcgtgac cgtaaaaaaa 13980
ctggtcaccg tgattaaaaa gcaccaccga cagctcctcg gtcatgtccg gagtcataat 14040
gtaagactcg gtaaacacat caggttgatt cacatcggtc agtgctaaaa agcgaccgaa 14100
atagcccggg ggaatacata cccgcaggcg tagagacaac attacagccc ccataggagg 14160
tataacaaaa ttaataggag agaaaaacac ataaacacct gaaaaaccct cctgcctagg 14220
caaaatagca ccctcccgct ccagaacaac atacagcgct tccacagcgg cagccatggt 14280
ggcatttgca aaagcctagg cctccaaaaa agcctcctca ctacttctgg aatagctcag 14340
aggccgaggc ggcctcggcc tctgcataaa taaaaaaaat tagtcagcca tggggcggag 14400
aatgggcgga actgggcgga gttaggggcg ggatgggcgg agttaggggc gggactatgg 14460
ttgctgacta attgagatgc atgctttgca tacttctgcc tgctggggag cctggggact 14520
ttccacacct ggttgctgac taattgagat gcatgctttg catacttctg cctgctgggg 14580
agcctgggga ctttccacac cctaactgac acacacgtta cgtcacttcc cattttaaga 14640
aaactacaat tcccaacaca tacaagttac tccgccctta attaaatcgg atccgatatc 14700
tagatgtatt cgcgaggtac cgagctcgaa ttctctggcc gtcgttttac aacgtcgtga 14760
ctgggaaaac cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag 14820
ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa 14880
tggcgaatgg cgcctgatgc ggtattttct ccttacgcat ctgtgcggta tttcacaccg 14940
catatggtgc actctcagta caatctgctc tgatgccgca tagttaagcc agccccgaca 15000
cccgccaaca cccgctgacg cgccctgacg ggcttgtctg ctcccggcat ccgcttacag 15060
acaagctgtg accgtctccg ggagctgcat gtgtcagagg ttttcaccgt catcaccgaa 15120
acgcgcga 15128
<210> 64
<211> 14364
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 64
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat cgatccagac atgataagat acattgatga 2880
gtttggacaa accacaacta gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga 2940
tgctattgct ttatttgtaa ccattataag ctgcaataaa caagtttgta cactctcggg 3000
tgattattta cccccaccct tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc 3060
gcatcgctat gcgccactgg cagggacacg ttgcgatact ggtgtttagt gctccactta 3120
aactcaggca caaccatccg cggcagctcg gtgaagtttt cactccacag gctgcgcacc 3180
atcaccaacg cgtttagcag gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg 3240
ccctgcgcgc gcgagttgcg atacacaggg ttgcagcact ggaacactat cagcgccggg 3300
tggtgcacgc tggccagcac gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg 3360
ttgctcaggg cgaacggagt caactttggt agctgccttc ccaaaaaggg cgcgtgccca 3420
ggctttgagt tgcactcgca ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg 3480
ttaggataca gcgcctgcat aaaagccttg atctgcttaa aagccacctg agcctttgcg 3540
ccttcagaga agaacatgcc gcaagacttg ccggaaaact gattggccgg acaggccgcg 3600
tcgtgcacgc agcaccttgc gtcggtgttg gagatctgca ccacatttcg gccccaccgg 3660
ttcttcacga tcttggcctt gctagactgc tccttcagcg cgcgctgccc gttttcgctc 3720
gtcacatcca tttcaatcac gtgctcctta tttatcataa tgcttccgtg tagacactta 3780
agctcgcctt cgatctcagc gcagcggtgc agccacaacg cgcagcccgt gggctcgtga 3840
tgcttgtagg tcacctctgc aaacgactgc aggtacgcct gcaggaatcg ccccatcatc 3900
gtcacaaagg tcttgttgct ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc 3960
caggtcttgc atacggccgc cagagcttcc acttggtcag gcagtagttt gaagttcgcc 4020
tttagatcgt tatccacgtg gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc 4080
tcccacgcag acacgatcgg cacactcagc gggttcatca ccgtaatttc actttccgct 4140
tcgctgggct cttcctcttc ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca 4200
ttcagccgcc gcactgtgcg cttacctcct ttgccatgct tgattagcac cggtgggttg 4260
ctgaaaccca ccatttgtag cgccacatct tctctttctt cctcgctgtc cacgattacc 4320
tctggtgatg gcgggcgctc gggcttggga gaagggcgct tctttttctt cttgggcgca 4380
atggccaaat ccgccgccga ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg 4440
tcttgtgatg agtcttcctc gtcctcggac tcgatacgcc gcctcatccg cttttttggg 4500
ggcgcccggg gaggcggcgg cgacggggac ggggacgaca cgtcctccat ggttggggga 4560
cgtcgcgccg caccgcgtcc gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg 4620
gccatttcct tctcctatag gcagaaaaag atccacaaaa gcgaagatca gcttcggcgc 4680
acgctggaag acgcggaggc tctcttcagt aaatactgcg cgctgactct taaggactag 4740
tttcgcgccc tttctcaaat ttaagcgcga aaactacgtc atctccagcg gccacacccg 4800
gcgccagcac ctgttgtcag cgccattggc gcgcccgccc gccgcgcgct tcgcttttta 4860
tagggccgcc gccgccgccg cctcgccata aaaggaaact ttcggagcgc gccgctctga 4920
ttggctgccg ccgcacctct ccgcctcgcc ccgccccgcc cctcgccccg ccccgccccg 4980
cctggcgcgc gccccccccc cccccccgcc cccatcgctg cacaaaataa ttaaaaaata 5040
aataaataca aaattggggg tggggagggg ggggagatgg ggagagtgaa gcagaacgtg 5100
gggctcacct cgaggccggc cgaatatctt catttaaatg tgtgtcagtt agggtgtgga 5160
aagtccccag gctccccagc aggcagaagt atgcaaagca tgcatctcaa ttagtcagca 5220
accaggtgtg gaaagtcccc aggctcccca gcaggcagaa gtatgcaaag catgcatctc 5280
aattagtcag caaccatagt cccgccccta actccgccca tcccgcccct aactccgccc 5340
agttccgccc attctccgcc ccatggctga ctaatttttt ttatttatgc agaggccgag 5400
gccgcctcgg cctctgagct attccagaag tagtgaggag gcttttttgg aggcctaggc 5460
ttttgcaaac gccggcgcac cgcgggcccg atccaccggt actgttggta aagccaccat 5520
gttttccggt ggcggcggcc cgctgtcccc cggaggaaag tcggcggcca gggcggcgtc 5580
cgggtttttt gcgcccgccg gccctcgcgg agccagccgg ggacccccgc cttgtttgag 5640
gcaaaacttt tacaacccct acctcgcccc agtcgggacg caacagaagc cgaccgggcc 5700
aacccagcgc catacgtact atagcgaatg cgatgaattt cgattcatcg ccccgcgggt 5760
gctggacgag gatgcccccc cggagaagcg cgccggggtg cacgacggtc acctcaagcg 5820
cgcccccaag gtgtactgcg ggggggacga gcgcgacgtc ctccgcgtcg ggtcgggcgg 5880
cttctggccg cggcgctcgc gcctgtgggg cggcgtggac cacgccccgg cggggttcaa 5940
ccccaccgtc accgtctttc acgtgtacga catcctggag aacgtggagc acgcgtacgg 6000
catgcgcgcg gcccagttcc acgcgcggtt tatggacgcc atcacaccga cggggaccgt 6060
catcacgctc ctgggcctga ctccggaagg ccaccgggtg gccgttcacg tttacggcac 6120
gcggcagtac ttttacatga acaaggagga ggtcgacagg cacctacaat gccgcgcccc 6180
acgagatctc tgcgagcgca tggccgcggc cctgcgcgag tccccgggcg cgtcgttccg 6240
cggcatctcc gcggaccact tcgaggcgga ggtggtggag cgcaccgacg tgtactacta 6300
cgagacgcgc cccgctctgt tttaccgcgt ctacgtccga agcgggcgcg tgctgtcgta 6360
cctgtgcgac aacttctgcc cggccatcaa gaagtacgag ggtggggtcg acgccaccac 6420
ccggttcatc ctggacaacc ccgggttcgt caccttcggc tggtaccgtc tcaaaccggg 6480
ccggaacaac acgctagccc agccgcgggc cccgatggcc ttcgggacat ccagcgacgt 6540
cgagtttaac tgtacggcgg acaacctggc catcgagggg ggcatgagcg acctaccggc 6600
atacaagctc atgtgcttcg atatcgaatg caaggcgggg ggggaggacg agctggcctt 6660
tccggtggcc gggcacccgg aggacctggt catccagata tcctgtctgc tctacgacct 6720
gtccaccacc gccctggagc acgtcctcct gttttcgctc ggttcctgcg acctccccga 6780
atcccacctg aacgagctgg cggccagggg cctgcccacg cccgtggttc tggaattcga 6840
cagcgaattc gagatgctgt tggccttcat gacccttgtg aaacagtacg gccccgagtt 6900
cgtgaccggg tacaacatca tcaacttcga ctggcccttc ttgctggcca agctgacgga 6960
catttacaag gtccccctgg acgggtacgg ccgcatgaac ggccggggcg tgtttcgcgt 7020
gtgggacata ggccagagcc acttccagaa gcgcagcaag ataaaggtga acggcatggt 7080
gaacatcgac atgtacggga ttataaccga caagatcaag ctctcgagct acaagctcaa 7140
cgccgtggcc gaagccgtcc tgaaggacaa gaagaaggac ctgagctatc gcgacatccc 7200
cgcctactac gccgccgggc ccgcgcaacg cggggtgatc ggcgagtact gcatacagga 7260
ttccctgctg gtgggccagc tgttttttaa gtttttgccc catctggagc tctcggccgt 7320
cgcgcgcttg gcgggtatta acatcacccg caccatctac gacggccagc agatccgcgt 7380
ctttacgtgc ctgctgcgcc tggccgacca gaagggcttt attctgccgg acacccaggg 7440
gcgatttagg ggcgccgggg gggaggcgcc caagcgtccg gccgcagccc gggaggacga 7500
ggagcggcca gaggaggagg gggaggacga ggacgaacgc gaggagggcg ggggcgagcg 7560
ggagccggag ggcgcgcggg agaccgccgg caggcacgtg gggtaccagg gggccagggt 7620
ccttgacccc acttccgggt ttcacgtgaa ccccgtggtg gtgttcgact ttgccagcct 7680
gtaccccagc atcatccagg cccacaacct gtgcttcagc acgctctccc tgagggccga 7740
cgcagtggcg cacctggagg cgggcaagga ctacctggag atcgaggtgg gggggcgacg 7800
gctgttcttc gtcaaggctc acgtgcgaga gagcctcctc agcatcctcc tgcgggactg 7860
gctcgccatg cgaaagcaga tccgctcgcg gattccccag agcagccccg aggaggccgt 7920
gctcctggac aagcagcagg ccgccatcaa ggtcgtgtgt aactcggtgt acgggttcac 7980
gggagtgcag cacggactcc tgccgtgcct gcacgttgcc gcgacggtga cgaccatcgg 8040
ccgcgagatg ctgctcgcga cccgcgagta cgtccacgcg cgctgggcgg ccttcgaaca 8100
gctcctggcc gatttcccgg aggcggccga catgcgcgcc cccgggccct attccatgcg 8160
catcatctac ggggacacgg actccatctt tgtgctgtgc cgcggcctca cggccgccgg 8220
gctgacggcc gtgggcgaca agatggcgag ccacatctcg cgcgcgctgt ttctgccccc 8280
catcaaactc gagtgcgaaa agacgttcac caagctgctg ctgatcgcca agaaaaagta 8340
catcggcgtc atctacgggg gtaagatgct catcaagggc gtggatctgg tgcgcaaaaa 8400
caactgcgcg tttatcaacc gcacctccag ggccctggtc gacctgctgt tttacgacga 8460
taccgtctcc ggagccgccg cggcgttagc cgagcgcccc gcggaggagt ggctggcgcg 8520
acccctgccc gagggactgc aggcgttcgg ggccgtcctc gtagacgccc atcggcgcat 8580
caccgacccg gagagggaca tccaggactt tgtcctcacc gccgaactga gcagacaccc 8640
gcgcgcgtac accaacaagc gcctggccca cctgacggtg tattacaagc tcatggcccg 8700
ccgcgcgcag gtcccgtcca tcaaggaccg gatcccgtac gtgatcgtgg cccagacccg 8760
cgaggtagag gagacggtcg cgcggctggc cgccctccgc gagctagacg ccgccgcccc 8820
aggggacgag cccgcccccc ccgcggccct gccctccccg gccaagcgcc cccgggagac 8880
gccgtcgcct gccgaccccc cgggaggcgc gtccaagccc cgcaagctgc tggtgtccga 8940
gctggccgag gatcccgcat acgccattgc ccacggcgtc gccctgaaca cggactatta 9000
cttctcccac ctgttggggg cggcgtgcgt gacattcaag gccctgtttg ggaataacgc 9060
caagatcacc gagagtctgt taaaaaggtt tattcccgaa gtgtggcacc ccccggacga 9120
cgtggccgcg cggctccgga ccgcagggtt cggggcggtg ggtgccggcg ctacggcgga 9180
ggaaactcgt cgaatgttgc atagagcctt tgatactcta gcagaattcg gcagtggagc 9240
aacaaacttc tctctgctga aacaagccgg agatgtcgaa gagaatcctg gaccgacgga 9300
ttcccctggc ggtgtggccc ccgcctcccc cgtggaggac gcgtcggacg cgtccctcgg 9360
gcagccggag gagggggcgc cctgccaggt ggtcctgcag ggcgccgaac ttaatggaat 9420
cctacaggcg tttgccccgc tgcgcacgag ccttctggac tcgcttctgg ttatgggcga 9480
ccggggcatc cttatccata acacgatctt tggggagcag gtgttcctgc ccctggaaca 9540
ctcgcaattc agtcggtatc gctggcgcgg acccacggcg gcgttcctgt ctctcgtgga 9600
ccagaagcgc tccctcctga gcgtgtttcg cgccaaccag tacccggacc tacgtcgggt 9660
ggagttggcg atcacgggcc aggccccgtt tcgcacgctg gttcagcgca tatggacgac 9720
gacgtccgac ggcgaggccg ttgagctagc cagcgagacg ctgatgaagc gcgaactgac 9780
gagctttgtg gtgctggttc cccagggaac ccccgacgtt cagttgcgcc tgacgaggcc 9840
gcagctcacc aaggtcctta acgcgaccgg ggccgatagt gccacgccca ccacgttcga 9900
gctcggggtt aacggcaaat tttccgtgtt caccacgagt acctgcgtca cctttgctgc 9960
ccgcgaggag ggcgtgtcgt ccagcaccag cacccaggtc cagatcctgt ccaacgcgct 10020
caccaaggcg ggccaggccg ccgcgaacgc caagacggtg tacggggaaa atacccatcg 10080
caccttctct gtggtcgtcg acgattgcag catgcgggcg gtgctccggc gactgcaggt 10140
cggcgggggc accctcaagt tcttcctcac gacccccgtc cccagtctgt gcgtcaccgc 10200
caccggtccc aacgcggtat cggcggtatt tctcctgaaa ccccagaaga tttgcctgga 10260
ctggctgggt catagccagg ggtctccttc agccgggagc tcggcctccc gggcctctgg 10320
gagcgagcca acagacagcc aggactccgc gtcggacgcg gtcagccacg gcgatccgga 10380
agacctcgat ggcgctgccc gggcgggaga ggcgggggcc ttgcatgcct gtccgatgcc 10440
gtcgtcgacc acgcgggtca ctcccacgac caagcggggg cgctcggggg gcgaggatgc 10500
gcgcgcggac acggccctaa agaaacctaa gacggggtcg cccaccgcac ccccgcccgc 10560
agatccagtc cccctggaca cggaggacga ctccgatgcg gcggacggga cggcggcccg 10620
tcccgccgct ccagacgccc ggagcggaag ccgttacgcg tgttactttc gcgacctccc 10680
gaccggagaa gcaagccccg gcgccttctc cgccttccgg gggggccccc aaaccccgta 10740
tggttttgga ttcccctgat aggatccgac tgcaggtagc tgtgccttct agttgccagc 10800
catctgttgt ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc actcccactg 10860
tcctttccta ataaaatgag gaaattgcat cgcattgtct gagtaggtgt cattctattc 10920
tggggggtgg ggtggggcag gacagcaagg gggaggattg ggaagacaat agcaggcatg 10980
ctggggatgc ggtgggctct atgggtttaa acatcgatgc ggccgcaact tgtttattgc 11040
agcttataat ggttacaaat aaagcaatag catcacaaat ttcacaaata aagcattttt 11100
ttcactgcat tctagttgtg gtttgtccaa actcatcaat gtatcttagc ttaacgggcg 11160
gcgaaggaga agtccacgcc tacatggggg tagagtcata atcgtgcatc aggatagggc 11220
ggtggtgctg cagcagcgcg cgaataaact gctgccgccg ccgctccgtc ctgcaggaat 11280
acaacatggc agtggtctcc tcagcgatga ttcgcaccgc ccgcagcata aggcgccttg 11340
tcctccgggc acagcagcgc accctgatct cacttaaatc agcacagtaa ctgcagcaca 11400
gcaccacaat attgttcaaa atcccacagt gcaaggcgct gtatccaaag ctcatggcgg 11460
ggaccacaga acccacgtgg ccatcatacc acaagcgcag gtagattaag tggcgacccc 11520
tcataaacac gctggacata aacattacct cttttggcat gttgtaattc accacctccc 11580
ggtaccatat aaacctctga ttaaacatgg cgccatccac caccatccta aaccagctgg 11640
ccaaaacctg cccgccggct atacactgca gggaaccggg actggaacaa tgacagtgga 11700
gagcccagga ctcgtaacca tggatcatca tgctcgtcat gatatcaatg ttggcacaac 11760
acaggcacac gtgcatacac ttcctcagga ttacaagctc ctcccgcgtt agaaccatat 11820
cccagggaac aacccattcc tgaatcagcg taaatcccac actgcaggga agacctcgca 11880
cgtaactcac gttgtgcatt gtcaaagtgt tacattcggg cagcagcgga tgatcctcca 11940
gtatggtagc gcgggtttct gtctcaaaag gaggtagacg atccctactg tacggagtgc 12000
gccgagacaa ccgagatcgt gttggtcgta gtgtcatgcc aaatggaacg ccggacgtag 12060
tcatatttcc tgaagcaaaa ccaggtgcgg gcgtgacaaa cagatctgcg tctccggtct 12120
cgccgcttag atcgctctgt gtagtagttg tagtatatcc actctctcaa agcatccagg 12180
cgccccctgg cttcgggttc tatgtaaact ccttcatgcg ccgctgccct gataacatcc 12240
accaccgcag aataagccac acccagccaa cctacacatt cgttctgcga gtcacacacg 12300
ggaggagcgg gaagagctgg aagaaccatg tttttttttt tattccaaaa gattatccaa 12360
aacctcaaaa tgaagatcta ttaagtgaac gcgctcccct ccggtggcgt ggtcaaactc 12420
tacagccaaa gaacagataa tggcatttgt aagatgttgc acaatggctt ccaaaaggca 12480
aacggccctc acgtccaagt ggacgtaaag gctaaaccct tcagggtgaa tctcctctat 12540
aaacattcca gcaccttcaa ccatgcccaa ataattctca tctcgccacc ttctcaatat 12600
atctctaagc aaatcccgaa tattaagtcc ggccattgta aaaatctgct ccagagcgcc 12660
ctccaccttc agcctcaagc agcgaatcat gattgcaaaa attcaggttc ctcacagacc 12720
tgtataagat tcaaaagcgg aacattaaca aaaataccgc gatcccgtag gtcccttcgc 12780
agggccagct gaacataatc gtgcaggtct gcacggacca gcgcggccac ttccccgcca 12840
ggaaccatga caaaagaacc cacactgatt atgacacgca tactcggagc tatgctaacc 12900
agcgtagccc cgatgtaagc ttgttgcatg ggcggcgata taaaatgcaa ggtgctgctc 12960
aaaaaatcag gcaaagcctc gcgcaaaaaa gaaagcacat cgtagtcatg ctcatgcaga 13020
taaaggcagg taagctccgg aaccaccaca gaaaaagaca ccatttttct ctcaaacatg 13080
tctgcgggtt tctgcataaa cacaaaataa aataacaaaa aaacatttaa acattagaag 13140
cctgtcttac aacaggaaaa acaaccctta taagcataag acggactacg gccatgccgg 13200
cgtgaccgta aaaaaactgg tcaccgtgat taaaaagcac caccgacagc tcctcggtca 13260
tgtccggagt cataatgtaa gactcggtaa acacatcagg ttgattcaca tcggtcagtg 13320
ctaaaaagcg accgaaatag cccgggggaa tacatacccg caggcgtaga gacaacatta 13380
cagcccccat aggaggtata acaaaattaa taggagagaa aaacacataa acacctgaaa 13440
aaccctcctg cctaggcaaa atagcaccct cccgctccag aacaacatac agcgcttcca 13500
cagcggcagc catggtggca tttgcaaaag cctaggcctc caaaaaagcc tcctcactac 13560
ttctggaata gctcagaggc cgaggcggcc tcggcctctg cataaataaa aaaaattagt 13620
cagccatggg gcggagaatg ggcggaactg ggcggagtta ggggcgggat gggcggagtt 13680
aggggcggga ctatggttgc tgactaattg agatgcatgc tttgcatact tctgcctgct 13740
ggggagcctg gggactttcc acacctggtt gctgactaat tgagatgcat gctttgcata 13800
cttctgcctg ctggggagcc tggggacttt ccacacccta actgacacac acgttacgtc 13860
acttcccatt ttaagaaaac tacaattccc aacacataca agttactccg cccttaatta 13920
aatcggatcc gatatctaga tgtattcgcg aggtaccgag ctcgaattct ctggccgtcg 13980
ttttacaacg tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc cttgcagcac 14040
atcccccttt cgccagctgg cgtaatagcg aagaggcccg caccgatcgc ccttcccaac 14100
agttgcgcag cctgaatggc gaatggcgcc tgatgcggta ttttctcctt acgcatctgt 14160
gcggtatttc acaccgcata tggtgcactc tcagtacaat ctgctctgat gccgcatagt 14220
taagccagcc ccgacacccg ccaacacccg ctgacgcgcc ctgacgggct tgtctgctcc 14280
cggcatccgc ttacagacaa gctgtgaccg tctccgggag ctgcatgtgt cagaggtttt 14340
caccgtcatc accgaaacgc gcga 14364
<210> 65
<211> 13814
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 65
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tctgcagtcg accagaagca ccatgtcctt gggtccggcc 2160
tgctgaatgc gcaggcggtc ggccatgccc caggcttcgt tttgacatcg gcgcaggtct 2220
ttgtagtagt cttgcatgag cctttctacc ggcacttctt cttctccttc ctcttgtcct 2280
gcatctcttg catctatcgc tgcggcggcg gcggagtttg gccgtaggtg gcgccctctt 2340
cctcccatgc gtgtgacccc gaagcccctc atcggctgaa gcagggctag gtcggcgaca 2400
acgcgctcgg ctaatatggc ctgctgcacc tgcgtgaggg tagactggaa gtcatccatg 2460
tccacaaagc ggtggtatgc gcccgtgttg atggtgtaag tgcagttggc cataacggac 2520
cagttaacgg tctggtgacc cggctgcgag agctcggtgt acctgagacg cgagtaagcc 2580
ctcgagtcaa atacgtagtc gttgcaagtc cgcaccaggt actggtatcc caccaaaaag 2640
tgcggcggcg gctggcggta gaggggccag cgtagggtgg ccggggctcc gggggcgaga 2700
tcttccaaca taaggcgatg atatccgtag atgtacctgg acatccaggt gatgccggcg 2760
gcggtggtgg aggcgcgcgg aaagtcgcgg acgcggttcc agatgttgcg cagcggcaaa 2820
aagtgctcca tggtcgggac gctctggccg gtcaggcgcg cgcaatcgtt gacgctctag 2880
cgtgcaaaag gagagcctgt aagcgggcac tcttccgtgg tctggtggat aaattcgcaa 2940
gggtatcatg gcggacgacc ggggttcgag ccccgtatcc ggccgtccgc cgtgatccat 3000
gcggttaccg cccgcgtgtc gaacccaggt gtgcgacgtc agacaacggg ggagtgctcc 3060
ttttggcttc cttccaggcg cggcggctgc tgcgctagct tttttggcca ctggccgcgc 3120
gcagcgtaag cggttaggct ggaaagcgaa agcattaagt ggctcgctcc ctgtagccgg 3180
agggttattt tccaagggtt gagtcgcggg acccccggtt cgagtctcgg accgagactg 3240
ggggcgtaca ctggatggcc tttgcctgga acccgcactc aaaaacatgc tacctctttg 3300
agccctttgg cttttctgac cagcgactca agcaggttta ccagtttgag tacgagtcac 3360
tcctgcgccg tagcgccatt gcttcttccc ccgaccgctg tataacgctg gaaaagtcca 3420
cccaaagcgt acaggggccc aactcggccg cctgtggact attctgctgc atgtttctcc 3480
acgcctttgc caactggccc caaactccca tggatcacaa ccccaccatg aaccttatta 3540
ccggggtacc caactccatg ctcaacagtc cccaggtaca gcccaccctg cgtcgcaacc 3600
aggaacagct ctacagcttc ctggagcgcc actcgcccta cttccgcagc cacagtgcgc 3660
agattaggag cgccacttct ttttgtcact tgaaaaacat gtaaaaataa tgtactagag 3720
acactttcaa taaaggcaaa tgcttttatt tgtacactct cgggtgatta tttaccccca 3780
cccttgccgt ctgcgccgtt taaaaatcaa aggggttctg ccgcgcatcg ctatgcgcca 3840
ctggcaggga cacgttgcga tactggtgtt tagtgctcca cttaaactca ggcacaacca 3900
tccgcggcag ctcggtgaag ttttcactcc acaggctgcg caccatcacc aacgcgttta 3960
gcaggtcggg cgccgatatc ttgaagtcgc agttggggcc tccgccctgc gcgcgcgagt 4020
tgcgatacac agggttgcag cactggaaca ctatcagcgc cgggtggtgc acgctggcca 4080
gcacgctctt gtcggagatc agatccgcgt ccaggtcctc cgcgttgctc agggcgaacg 4140
gagtcaactt tggtagctgc cttcccaaaa agggcgcgtg cccaggcttt gagttgcact 4200
cgcaccgtag tggcatcaaa aggtgaccgt gcccggtctg ggcgttagga tacagcgcct 4260
gcataaaagc cttgatctgc ttaaaagcca cctgagcctt tgcgccttca gagaagaaca 4320
tgccgcaaga cttgccggaa aactgattgg ccggacaggc cgcgtcgtgc acgcagcacc 4380
ttgcgtcggt gttggagatc tgcaccacat ttcggcccca ccggttcttc acgatcttgg 4440
ccttgctaga ctgctccttc agcgcgcgct gcccgttttc gctcgtcaca tccatttcaa 4500
tcacgtgctc cttatttatc ataatgcttc cgtgtagaca cttaagctcg ccttcgatct 4560
cagcgcagcg gtgcagccac aacgcgcagc ccgtgggctc gtgatgcttg taggtcacct 4620
ctgcaaacga ctgcaggtac gcctgcagga atcgccccat catcgtcaca aaggtcttgt 4680
tgctggtgaa ggtcagctgc aacccgcggt gctcctcgtt cagccaggtc ttgcatacgg 4740
ccgccagagc ttccacttgg tcaggcagta gtttgaagtt cgcctttaga tcgttatcca 4800
cgtggtactt gtccatcagc gcgcgcgcag cctccatgcc cttctcccac gcagacacga 4860
tcggcacact cagcgggttc atcaccgtaa tttcactttc cgcttcgctg ggctcttcct 4920
cttcctcttg cgtccgcata ccacgcgcca ctgggtcgtc ttcattcagc cgccgcactg 4980
tgcgcttacc tcctttgcca tgcttgatta gcaccggtgg gttgctgaaa cccaccattt 5040
gtagcgccac atcttctctt tcttcctcgc tgtccacgat tacctctggt gatggcgggc 5100
gctcgggctt gggagaaggg cgcttctttt tcttcttggg cgcaatggcc aaatccgccg 5160
ccgaggtcga tggccgcggg ctgggtgtgc gcggcaccag cgcgtcttgt gatgagtctt 5220
cctcgtcctc ggactcgata cgccgcctca tccgcttttt tgggggcgcc cggggaggcg 5280
gcggcgacgg ggacggggac gacacgtcct ccatggttgg gggacgtcgc gccgcaccgc 5340
gtccgcgctc gggggtggtt tcgcgctgct cctcttcccg actggccatt tccttctcct 5400
ataggcagaa aaagatccac aaaagcgaag atcagcttcg gcgcacgctg gaagacgcgg 5460
aggctctctt cagtaaatac tgcgcgctga ctcttaagga ctagtttcgc gccctttctc 5520
aaatttaagc gcgaaaacta cgtcatctcc agcggccaca cccggcgcca gcacctgttg 5580
tcagcgccat tggcgcgccc gcccgccgcg cgcttcgctt tttatagggc cgccgccgcc 5640
gccgcctcgc cataaaagga aactttcgga gcgcgccgct ctgattggct gccgccgcac 5700
ctctccgcct cgccccgccc cgcccctcgc cccgccccgc cccgcctggc gcgcgccccc 5760
cccccccccc cgcccccatc gctgcacaaa ataattaaaa aataaataaa tacaaaattg 5820
ggggtgggga ggggggggag atggggagag tgaagcagaa cgtggggctc acctcgaggc 5880
cggccgaata tcttcattta aataaatgag tcttcggacc tcgcgggggc cgcttaagcg 5940
gtggttaggg tttgtctgac gcggggggag ggggaaggaa cgaaacactc tcattcggag 6000
gcggctcggg gtttggtctt ggtggccacg ggcacgcaga agagcgccgc gatcctctta 6060
agcacccccc cgccctccgt ggaggcgggg gtttggtcgg cgggtggtaa ctggcgggcc 6120
gctgactcgg gcgggtcgcg cgccccagag tgtgaccttt tcggtctgct cgcagacccc 6180
cgggcggcgc cgccgcggcg gcgacgggct cgctgggtcc taggctccat ggggaccgta 6240
tacgtggaca ggctctggag catccgcacg actgcggtga tattaccgga gaccttctgc 6300
gggacgagcc gggtcacgcg gctgacgcgg agcgtccgtt gggcgacaaa caccaggacg 6360
gggcacaggt acactatctt gtcacccgga ggcgcgaggg actgcaggag cttcagggag 6420
tggcgcagct gcttcatccc cgtggcccgt tgctcgcgtt tgctggcggt gtccccggaa 6480
gaaatatatt tgcatgtctt tagttctatg atgacacaaa ccccgcccag cgtcttgtca 6540
ttggcgaatt cgaacacgca gatgcagtcg gggcggcgcg gtcccaggtc cacttcgcat 6600
attaaggtga cgcgtgtggc ctcgaacacc gagcgaccct gcagcgaccc gcttaagcca 6660
ccatggagac aaagcccaag acggcaacca ccatcaaggt cccccccggg cccctgggat 6720
acgtgtacgc tcgcgcgtgt ccgtccgaag gcatcgagct tctggcgtta ctgtcggcac 6780
gcagcggcga ttccgacgtc gccgtggcgc ccctggtcgt gggcctgacc gtggagagcg 6840
gctttgaggc caacgtggcc gtggtcgtgg gttctcgcac gacggggctc gggggtaccg 6900
cggtgtccct gaaactgacg ccctcgcact acagctcgtc cgtgtacgtc tttcacggcg 6960
gccggcacct ggaccccagc acccaggccc cgaacctgac gcgactttgc gagcgggcac 7020
gccgccattt tggcttttcg gactacaccc cccggcccgg cgacctcaaa cacgagacga 7080
cgggggaggc gctgtgtgag cgcctcggcc tggacccgga ccgcgccctc ctgtatctgg 7140
tcgttaccga gggcttcaag gaggccgtgt gcatcaacaa cacctttctg cacctgggag 7200
gctcggacaa ggtaaccata ggcggggcgg aggtgcaccg catacccgtg tacccgttgc 7260
agctgttcat gccggatttt agccgtgtca tcgcagagcc gttcaacgcc aaccaccgat 7320
cgatcgggga gaattttacc tacccgcttc cgttttttaa ccgccccctc aaccgcctcc 7380
tgttcgaggc ggtcgtggga cccgccgccg tggcactgcg atgccgaaac gtggacgccg 7440
tggcccgcgc cgccgcccac ctggcgtttg acgaaaacca cgagggcgcc gccctccccg 7500
ccgacattac gttcacggcc ttcgaagcca gccagggtaa gaccccgcgg ggcgggcgcg 7560
acggcggcgg caagggcccg gcgggcgggt tcgaacagcg cctggcctcc gtcatggccg 7620
gagacgccgc cctggccctc gagtctatcg tgtcgatggc cgtctttgac gagccgccca 7680
ccgacatctc cgcgtggccg ctgttcgagg gccaggacac ggccgcggcc cgcgccaacg 7740
ccgtcggggc gtacctggcg cgcgccgcgg gactcgtggg ggccatggta tttagcacca 7800
actcggccct ccatctcacc gaggtggacg acgccggccc ggcggaccca aaggaccaca 7860
gcaaaccctc cttttaccgc ttcttcctcg tgcccgggac ccacgtggcg gccaacccac 7920
aggtggaccg cgagggacac gtggtgcccg ggttcgaggg tcggcccacc gcgcccctcg 7980
tcggcggaac ccaggaattt gccggcgagc acctggccat gctgtgtggg ttttccccgg 8040
cgctgctggc caagatgctg ttttacctgg agcgctgcga cggcggcgtg atcgtcgggc 8100
gccaggagat ggacgtgttt cgatacgtcg cggactccaa ccagaccgac gtgccctgta 8160
acctatgcac cttcgacacg cgccacgcct gcgtacacac gacgctcatg cgcctccggg 8220
cgcgccatcc aaagttcgcc agcgccgccc gcggagccat cggcgtcttc gggaccatga 8280
acagcatgta tagcgactgc gacgtgctgg gaaactacgc cgccttctcg gccctgaagc 8340
gcgcggacgg atccgagacc gcccggacca tcatgcagga gacgtaccgc gcggcgaccg 8400
agcgcgtcat ggccgaactc gagaccctgc agtacgtgga ccaggcggtc cccacggcca 8460
tggggcggct ggagaccatc atcaccaacc gcgaggccct gcatacggtg gtgaacaacg 8520
tcaggcaggt cgtggaccgc gaggtggagc agctgatgcg caacctggtg gaggggagga 8580
acttcaagtt tcgcgacggt ctgggcgagg ccaaccacgc catgtccctg acgctggacc 8640
cgtacgcgtg cgggccgtgc cccctgcttc agcttctcgg gcggcgatcc aacctcgccg 8700
tgtaccagga cctggccctg agtcagtgcc acggggtgtt cgccgggcag tcggtcgagg 8760
ggcgcaactt tcgcaatcaa ttccaaccgg tgctgcggcg gcgcgtgatg gacatgttta 8820
acaacgggtt tctgtcggcc aaaacgctga cggtcgcgct ctcggagggg gcggctatct 8880
gcgcccccag cctaacggcg ggccagacgg cccccgccga gagcagcttc gagggcgacg 8940
ttgcccgcgt gaccctgggg tttcccaagg agctgcgcgt caagagccgc gtgttgttcg 9000
cgggcgcgag cgccaacgcg tccgaggccg ccaaggcgcg ggtcgccagc ctccagagcg 9060
cctaccagaa gcccgacaag cgcgtggaca tcctcctcgg accgctgggc tttctgctca 9120
agcagttcca cgcggccatc ttccccaacg gcaagccccc ggggtccaac cagccgaacc 9180
cgcagtggtt ctggacggcc ctccaacgca accagcttcc cgcccggctc ctgtcgcgcg 9240
aggacatcga gaccatcgcg ttcattaaaa agttttccct ggactacggc gcgataaact 9300
ttattaacct ggcccccaac aacgtgagcg agctggcgat gtactacatg gcaaaccaga 9360
ttctgcggta ctgcgatcac tcgacatact tcatcaacac ccttacggcc atcatcgcgg 9420
ggtcccgccg tccccccagc gtgcaggctg ccgccgcgtg gtccgcgcag ggcggggcgg 9480
gcctggaggc cggggcccgc gcgctgatgg acgccgtgga cgcgcatccg ggcgcgtgga 9540
cgtccatgtt cgccagctgc aacctgctgc ggcccgtcat ggcggcgcgc cccatggtcg 9600
tgttggggtt gagcatcagc aagtactacg gcatggccgg caacgaccgt gtgtttcagg 9660
ccgggaactg ggccagcctg atgggcggca aaaacgcgtg cccgctcctt atttttgacc 9720
gcacccgcaa gttcgtcctg gcctgtcccc gggccgggtt tgtgtgcgcg gcctcaagcc 9780
tcggcggcgg agcgcacgaa agctcgctgt gcgagcagct ccggggcatt atctccgagg 9840
gcggggcggc cgtcgccagt agcgtgttcg tggcgaccgt gaaaagcctg gggccccgca 9900
cccagcagct gcagatcgag gactggctgg cgctcctgga ggacgagtac ctaagcgagg 9960
agatgatgga gctgaccgcg cgtgccctgg agcgcggcaa cggcgagtgg tcgacggacg 10020
cggccctgga ggtggcgcac gaggccgagg ccctagtcag ccaactcggc aacgccgggg 10080
aggtgtttaa ctttggggat tttggctgcg aggacgacaa cgcgacgccg ttcggcggcc 10140
cgggggcccc gggaccggca tttgccggcc gcaaacgggc gttccacggg gatgacccgt 10200
ttggggaggg gccccccgac aaaaagggag acctgacgtt ggatatgctg tagtaacggc 10260
aataaaaaga cagaataaaa cgcacggtgt tgggtcgttt gttcgtttaa acatcgatgc 10320
ggccgccgtt tgtgttatgt ttcaacgtgt ttatttttca attgcagaaa atttcaagtc 10380
atttttcatt cagtagtata gccccaccac cacatagctt atacagatca ccgtacctta 10440
atcaaactca cagaacccta gtattcaacc tgccacctcc ctcccaacac acagagtaca 10500
cagtcctttc tccccggctg gccttaaaaa gcatcatatc atgggtaaca gacatattct 10560
taggtgttat attccacacg gtttcctgtc gagccaaacg ctcatcagtg atattaataa 10620
actccccggg cagctcactt aagttcatgt cgctgtccag ctgctgagcc acaggctgct 10680
gtccaacttg cggttgctta acgggcggcg aaggagaagt ccacgcctac atgggggtag 10740
agtcataatc gtgcatcagg atagggcggt ggtgctgcag cagcgcgcga ataaactgct 10800
gccgccgccg ctccgtcctg caggaataca acatggcagt ggtctcctca gcgatgattc 10860
gcaccgcccg cagcataagg cgccttgtcc tccgggcaca gcagcgcacc ctgatctcac 10920
ttaaatcagc acagtaactg cagcacagca ccacaatatt gttcaaaatc ccacagtgca 10980
aggcgctgta tccaaagctc atggcgggga ccacagaacc cacgtggcca tcataccaca 11040
agcgcaggta gattaagtgg cgacccctca taaacacgct ggacataaac attacctctt 11100
ttggcatgtt gtaattcacc acctcccggt accatataaa cctctgatta aacatggcgc 11160
catccaccac catcctaaac cagctggcca aaacctgccc gccggctata cactgcaggg 11220
aaccgggact ggaacaatga cagtggagag cccaggactc gtaaccatgg atcatcatgc 11280
tcgtcatgat atcaatgttg gcacaacaca ggcacacgtg catacacttc ctcaggatta 11340
caagctcctc ccgcgttaga accatatccc agggaacaac ccattcctga atcagcgtaa 11400
atcccacact gcagggaaga cctcgcacgt aactcacgtt gtgcattgtc aaagtgttac 11460
attcgggcag cagcggatga tcctccagta tggtagcgcg ggtttctgtc tcaaaaggag 11520
gtagacgatc cctactgtac ggagtgcgcc gagacaaccg agatcgtgtt ggtcgtagtg 11580
tcatgccaaa tggaacgccg gacgtagtca tatttcctga agcaaaacca ggtgcgggcg 11640
tgacaaacag atctgcgtct ccggtctcgc cgcttagatc gctctgtgta gtagttgtag 11700
tatatccact ctctcaaagc atccaggcgc cccctggctt cgggttctat gtaaactcct 11760
tcatgcgccg ctgccctgat aacatccacc accgcagaat aagccacacc cagccaacct 11820
acacattcgt tctgcgagtc acacacggga ggagcgggaa gagctggaag aaccatgttt 11880
ttttttttat tccaaaagat tatccaaaac ctcaaaatga agatctatta agtgaacgcg 11940
ctcccctccg gtggcgtggt caaactctac agccaaagaa cagataatgg catttgtaag 12000
atgttgcaca atggcttcca aaaggcaaac ggccctcacg tccaagtgga cgtaaaggct 12060
aaacccttca gggtgaatct cctctataaa cattccagca ccttcaacca tgcccaaata 12120
attctcatct cgccaccttc tcaatatatc tctaagcaaa tcccgaatat taagtccggc 12180
cattgtaaaa atctgctcca gagcgccctc caccttcagc ctcaagcagc gaatcatgat 12240
tgcaaaaatt caggttcctc acagacctgt ataagattca aaagcggaac attaacaaaa 12300
ataccgcgat cccgtaggtc ccttcgcagg gccagctgaa cataatcgtg caggtctgca 12360
cggaccagcg cggccacttc cccgccagga accatgacaa aagaacccac actgattatg 12420
acacgcatac tcggagctat gctaaccagc gtagccccga tgtaagcttg ttgcatgggc 12480
ggcgatataa aatgcaaggt gctgctcaaa aaatcaggca aagcctcgcg caaaaaagaa 12540
agcacatcgt agtcatgctc atgcagataa aggcaggtaa gctccggaac caccacagaa 12600
aaagacacca tttttctctc aaacatgtct gcgggtttct gcataaacac aaaataaaat 12660
aacaaaaaaa catttaaaca ttagaagcct gtcttacaac aggaaaaaca acccttataa 12720
gcataagacg gactacggcc atgccggcgt gaccgtaaaa aaactggtca ccgtgattaa 12780
aaagcaccac cgacagctcc tcggtcatgt ccggagtcat aatgtaagac tcggtaaaca 12840
catcaggttg attcacatcg gtcagtgcta aaaagcgacc gaaatagccc gggggaatac 12900
atacccgcag gcgtagagac aacattacag cccccatagg aggtataaca aaattaatag 12960
gagagaaaaa cacataaaca cctgaaaaac cctcctgcct aggcaaaata gcaccctccc 13020
gctccagaac aacatacagc gcttccacag cggcagccat aacagtcagc cttaccagta 13080
aaaaagaaaa cctattaaaa aaacaccact cgacacggca ccagctcaat cagtcacagt 13140
gtaaaaaagg gccaagtgca gagcgagtat atataggact aaaaaatgac gtaacggtta 13200
aagtccacaa aaaacaccca gaaaaccgca cgcgaaccta cgcccagaaa cgaaagccaa 13260
aaaacccaca acttcctcaa atcgtcactt ccgttttccc acgttacgtc acttcccatt 13320
ttaagaaaac tacaattccc aacacataca agttactccg cccttaatta aatcggatcc 13380
gatatctaga tgtattcgcg aggtaccgag ctcgaattct ctggccgtcg ttttacaacg 13440
tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc cttgcagcac atcccccttt 13500
cgccagctgg cgtaatagcg aagaggcccg caccgatcgc ccttcccaac agttgcgcag 13560
cctgaatggc gaatggcgcc tgatgcggta ttttctcctt acgcatctgt gcggtatttc 13620
acaccgcata tggtgcactc tcagtacaat ctgctctgat gccgcatagt taagccagcc 13680
ccgacacccg ccaacacccg ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc 13740
ttacagacaa gctgtgaccg tctccgggag ctgcatgtgt cagaggtttt caccgtcatc 13800
accgaaacgc gcga 13814
<210> 66
<211> 6883
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 66
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgctacgt aatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat cgatccagac atgataagat acattgatga 2880
gtttggacaa accacaacta gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga 2940
tgctattgct ttatttgtaa ccattataag ctgcaataaa caagtttgta cactctcggg 3000
tgattattta cccccaccct tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc 3060
gcatcgctat gcgccactgg cagggacacg ttgcgatact ggtgtttagt gctccactta 3120
aactcaggca caaccatccg cggcagctcg gtgaagtttt cactccacag gctgcgcacc 3180
atcaccaacg cgtttagcag gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg 3240
ccctgcgcgc gcgagttgcg atacacaggg ttgcagcact ggaacactat cagcgccggg 3300
tggtgcacgc tggccagcac gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg 3360
ttgctcaggg cgaacggagt caactttggt agctgccttc ccaaaaaggg cgcgtgccca 3420
ggctttgagt tgcactcgca ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg 3480
ttaggataca gcgcctgcat aaaagccttg atctgcttaa aagccacctg agcctttgcg 3540
ccttcagaga agaacatgcc gcaagacttg ccggaaaact gattggccgg acaggccgcg 3600
tcgtgcacgc agcaccttgc gtcggtgttg gagatctgca ccacatttcg gccccaccgg 3660
ttcttcacga tcttggcctt gctagactgc tccttcagcg cgcgctgccc gttttcgctc 3720
gtcacatcca tttcaatcac gtgctcctta tttatcataa tgcttccgtg tagacactta 3780
agctcgcctt cgatctcagc gcagcggtgc agccacaacg cgcagcccgt gggctcgtga 3840
tgcttgtagg tcacctctgc aaacgactgc aggtacgcct gcaggaatcg ccccatcatc 3900
gtcacaaagg tcttgttgct ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc 3960
caggtcttgc atacggccgc cagagcttcc acttggtcag gcagtagttt gaagttcgcc 4020
tttagatcgt tatccacgtg gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc 4080
tcccacgcag acacgatcgg cacactcagc gggttcatca ccgtaatttc actttccgct 4140
tcgctgggct cttcctcttc ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca 4200
ttcagccgcc gcactgtgcg cttacctcct ttgccatgct tgattagcac cggtgggttg 4260
ctgaaaccca ccatttgtag cgccacatct tctctttctt cctcgctgtc cacgattacc 4320
tctggtgatg gcgggcgctc gggcttggga gaagggcgct tctttttctt cttgggcgca 4380
atggccaaat ccgccgccga ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg 4440
tcttgtgatg agtcttcctc gtcctcggac tcgatacgcc gcctcatccg cttttttggg 4500
ggcgcccggg gaggcggcgg cgacggggac ggggacgaca cgtcctccat ggttggggga 4560
cgtcgcgccg caccgcgtcc gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg 4620
gccatggtgg cttccttctc ctataggcag ggcgcgcccg cccgccgcgc gcttcgcttt 4680
ttatagggcc gccgccgccg ccgcctcgcc ataaaaggaa actttcggag cgcgccgctc 4740
tgattggctg ccgccgcacc tctccgcctc gccccgcccc gcccctcgcc ccgccccgcc 4800
ccgcctggcg cgcgcccccc cccccccccc gcccccatcg ctgcacaaaa taattaaaaa 4860
ataaataaat acaaaattgg gggtggggag gggggggaga tggggagagt gaagcagaac 4920
gtggggctca cctcgaggcc ggccgaatat cttcatttaa atgtttaaac atcgatgcgg 4980
ccgcaacttg tttattgcag cttataatgg ttacaaataa agcaatagca tcacaaattt 5040
cacaaataaa gcattttttt cactgcattc tagttgtggt ttgtccaaac tcatcaatgt 5100
atcttagctt aacgggcggc gaaggagaag tccacgccta catgggggta gagtcataat 5160
cgtgcatcag gatagggcgg tggtgctgca gcagcgcgcg aataaactgc tgccgccgcc 5220
gctccgtcct gcaggaatac aacatggcag tggtctcctc agcgatgatt cgcaccgccc 5280
gcagcataag gcgccttgtc ctccgggcac agcagcgcac cctgatctca cttaaatcag 5340
cacagtaact gcagcacagc accacaatat tgttcaaaat cccacagtgc aaggcgctgt 5400
atccaaagct catggcgggg accacagaac ccacgtggcc atcataccac aagcgcaggt 5460
agattaagtg gcgacccctc ataaacacgc tggacataaa cattacctct tttggcatgt 5520
tgtaattcac cacctcccgg taccatataa acctctgatt aaacatggcg ccatccacca 5580
ccatcctaaa ccagctggcc aaaacctgcc cgccggctat acactgcagg gaaccgggac 5640
tggaacaatg acagtggaga gcccaggact cgtaaccatg gatcatcatg ctcgtcatga 5700
tatcaatgtt ggcacaacac aggcacacgt gcatacactt cctcaggatt acaagctcct 5760
cccgcgttag aaccatatcc cagggaacaa cccattcctg aatcagcgta aatcccacac 5820
tgcagggaag acctcgcacg taactcacgt tgtgcattgt caaagtgtta cattcgggca 5880
gcagcggatg atcctccagt atggtagcgc gggtttctgt ctcaaaagga ggtagacgat 5940
ccctactgta cggagtgcgc cgagacaacc gagatcgtgt tggtcgtagt gtcatgccaa 6000
atggaacgcc ggacgtagtc atatttcctg aagcaaaacc aggtgcgggc gtgacaaaca 6060
gatctgcgtc tccggtctcg ccgcttagat cgctctgtgt agtagttgta gtatatccac 6120
tctctcaaag catccaggcg ccccctggct tcgggttcta tgtaaactcc ttcatgcgcc 6180
gctgccctga taacatccac caccgcagaa taagccacac ccagccaacc tacacattcg 6240
ttctgcgagt cacacacggg aggagcggga agagctggaa gaaccatggt ggcatttgca 6300
aaagcctagg cctccaaaaa agcctcctca ctacttctgg aatagctcag aggccgaggc 6360
ggcctcggcc tctgcataaa taaaaaaaat tagtcagcca tggggcggag aatgggcgga 6420
actgggcgga gttaggggcg ggatgggcgg agttaggggc gggactatgg ttgctgacta 6480
attgagatgc atgctttgca tacttctgcc tgctggggag cctggggact ttccacacct 6540
ggttgctgac taattgagat gcatgctttg catacttctg cctgctgggg agcctgggga 6600
ctttccacac cctaactgac acacacgtta cgtcacttcc cattttaaga aaactacaat 6660
tcccaacaca tacaagttac tccgccctta attaacatat ggtgcactct cagtacaatc 6720
tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc tgacgcgccc 6780
tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt ctccgggagc 6840
tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cga 6883
<210> 67
<211> 122
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 67
aacttgttta ttgcagctta taatggttac aaataaagca atagcatcac aaatttcaca 60
aataaagcat ttttttcact gcattctagt tgtggtttgt ccaaactcat caatgtatct 120
ta 122
<210> 68
<211> 330
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 68
gtgtgtcagt tagggtgtgg aaagtcccca ggctccccag caggcagaag tatgcaaagc 60
atgcatctca attagtcagc aaccaggtgt ggaaagtccc caggctcccc agcaggcaga 120
agtatgcaaa gcatgcatct caattagtca gcaaccatag tcccgcccct aactccgccc 180
atcccgcccc taactccgcc cagttccgcc cattctccgc cccatggctg actaattttt 240
tttatttatg cagaggccga ggccgcctcg gcctctgagc tattccagaa gtagtgagga 300
ggcttttttg gaggcctagg cttttgcaaa 330
<210> 69
<211> 225
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 69
ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct tccttgaccc 60
tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgca tcgcattgtc 120
tgagtaggtg tcattctatt ctggggggtg gggtggggca ggacagcaag ggggaggatt 180
gggaagacaa tagcaggcat gctggggatg cggtgggctc tatgg 225
<210> 70
<211> 387
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 70
taccgacggc gacaccttcg cgacatacaa caagacctcg ccctcccacg ataaaacgga 60
tccgtcctcc caaaaagtcc acaaatacac aaaaagagag gataattaaa acaatatgga 120
ggataccccc gacattacaa cagagatgcg gacgcccata cataaggggg cccgataaag 180
ccagcgaaaa atcgtgactg gctacactta gttggactac acaaatggct cagaatgtaa 240
tactgaggcc tgtactggct cctcgacagc caccacgaaa aattagtgcc actggtcaaa 300
aaaatgccag tgcggccgta ccggcatcag gcagaatacg aatattccca acaaaaagga 360
caacattctg tccgaagatt acaaatt 387
<210> 71
<211> 128
<212> PRT
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polypeptide
<400> 71
Met Ala Ala Ala Val Glu Ala Leu Tyr Val Val Leu Glu Arg Glu Gly
1 5 10 15
Ala Ile Leu Pro Arg Gln Glu Gly Phe Ser Gly Val Tyr Val Phe Phe
20 25 30
Ser Pro Ile Asn Phe Val Ile Pro Pro Met Gly Ala Val Met Leu Ser
35 40 45
Leu Arg Leu Arg Val Cys Ile Pro Pro Gly Tyr Phe Gly Arg Phe Leu
50 55 60
Ala Leu Thr Asp Val Asn Gln Pro Asp Val Phe Thr Glu Ser Tyr Ile
65 70 75 80
Met Thr Pro Asp Met Thr Glu Glu Leu Ser Val Val Leu Phe Asn His
85 90 95
Gly Asp Gln Phe Phe Tyr Gly His Ala Gly Met Ala Val Val Arg Leu
100 105 110
Met Leu Ile Arg Val Val Phe Pro Val Val Arg Gln Ala Ser Asn Val
115 120 125
<210> 72
<211> 411
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 72
tacgtctttg ggcgtctgta caaactctct ttttaccaca gaaaaagaca ccaccaaggc 60
ctcgaatgga cggaaataga cgtactcgta ctgatgctac acgaaagaaa aaacgcgctc 120
cgaaacggac taaaaaactc gtcgtggaac gtaaaatata gcggcgggta cgttgttcga 180
atgtagcccc gatgcgacca atcgtatcga ggctcatacg cacagtatta gtcacaccca 240
agaaaacagt accaaggacc gccccttcac cggcgcgacc aggcacgtct ggacgtgcta 300
atacaagtcg accgggacgc ttccctggat gccctagcgc cataaaaaca attacaaggc 360
gaaaacttag aatatgtcca gacactcctt ggacttaaaa acgttagtac t 411
<210> 73
<211> 136
<212> PRT
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polypeptide
<400> 73
Met Gln Lys Pro Ala Asp Met Phe Glu Arg Lys Met Val Ser Phe Ser
1 5 10 15
Val Val Val Pro Glu Leu Thr Cys Leu Tyr Leu His Glu His Asp Tyr
20 25 30
Asp Val Leu Ser Phe Leu Arg Glu Ala Leu Pro Asp Phe Leu Ser Ser
35 40 45
Thr Leu His Phe Ile Ser Pro Pro Met Gln Gln Ala Tyr Ile Gly Ala
50 55 60
Thr Leu Val Ser Ile Ala Pro Ser Met Arg Val Ile Ile Ser Val Gly
65 70 75 80
Ser Phe Val Met Val Pro Gly Gly Glu Val Ala Ala Leu Val Arg Ala
85 90 95
Asp Leu His Asp Tyr Val Gln Leu Ala Leu Arg Arg Asp Leu Arg Asp
100 105 110
Arg Gly Ile Phe Val Asn Val Pro Leu Leu Asn Leu Ile Gln Val Cys
115 120 125
Glu Glu Pro Glu Phe Leu Gln Ser
130 135
<210> 74
<211> 351
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 74
tactaagcga cgaactccga cttccacctc ccgcgagacc tcgtctaaaa atgttaccgg 60
cctgaattat aagccctaaa cgaatctcta tataactctt ccaccgctct actcttaata 120
aacccgtacc aacttccacg accttacaaa tatctcctct aagtgggact tcccaaatcg 180
gaaatgcagg tgaacctgca ctcccggcaa acggaaaacc ttcggtaaca cgttgtagaa 240
tgtttacggt aatagacaag aaaccgacat ctcaaactgg tgcggtggcc tcccctcgcg 300
caagtgaatt atctagaagt aaaactccaa aacctattag aaaaccttat t 351
<210> 75
<211> 116
<212> PRT
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polypeptide
<400> 75
Met Ile Arg Cys Leu Arg Leu Lys Val Glu Gly Ala Leu Glu Gln Ile
1 5 10 15
Phe Thr Met Ala Gly Leu Asn Ile Arg Asp Leu Leu Arg Asp Ile Leu
20 25 30
Arg Arg Trp Arg Asp Glu Asn Tyr Leu Gly Met Val Glu Gly Ala Gly
35 40 45
Met Phe Ile Glu Glu Ile His Pro Glu Gly Phe Ser Leu Tyr Val His
50 55 60
Leu Asp Val Arg Ala Val Cys Leu Leu Glu Ala Ile Val Gln His Leu
65 70 75 80
Thr Asn Ala Ile Ile Cys Ser Leu Ala Val Glu Phe Asp His Ala Thr
85 90 95
Gly Gly Glu Arg Val His Leu Ile Asp Leu His Phe Glu Val Leu Asp
100 105 110
Asn Leu Leu Glu
115
<210> 76
<211> 345
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 76
taccaagaag gtcgagaagg gcgaggaggg cacacactga gcgtcttgct tacacatcca 60
accgacccac accgaataag acgccaccac ctacaatagt cccgtcgccg cgtacttcct 120
caaatgtatc ttgggcttcg gtcccccgcg gacctacgaa actctctcac ctatatgatg 180
ttgatgatgt gtctcgctag attcgccgct ctggcctctg cgtctagaca aacagtgcgg 240
gcgtggacca aaacgaagtc ctttatactg atgcaggccg caaggtaaac cgtactgtga 300
tgctggttgt gctagagcca acagagccgc gtgaggcatg tcatc 345
<210> 77
<211> 114
<212> PRT
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polypeptide
<400> 77
Met Val Leu Pro Ala Leu Pro Ala Pro Pro Val Cys Asp Ser Gln Asn
1 5 10 15
Glu Cys Val Gly Trp Leu Gly Val Ala Tyr Ser Ala Val Val Asp Val
20 25 30
Ile Arg Ala Ala Ala His Glu Gly Val Tyr Ile Glu Pro Glu Ala Arg
35 40 45
Gly Arg Leu Asp Ala Leu Arg Glu Trp Ile Tyr Tyr Asn Tyr Tyr Thr
50 55 60
Glu Arg Ser Lys Arg Arg Asp Arg Arg Arg Arg Ser Val Cys His Ala
65 70 75 80
Arg Thr Trp Phe Cys Phe Arg Lys Tyr Asp Tyr Val Arg Arg Ser Ile
85 90 95
Trp His Asp Thr Thr Thr Asn Thr Ile Ser Val Val Ser Ala His Ser
100 105 110
Val Gln
<210> 78
<211> 885
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 78
tactgatgca ggccgcaagg taaaccgtac tgtgatgctg gttgtgctag agccaacaga 60
gccgcgtgag gcatgtcatc cctagcagat ggaggaaaac tctgtctttg ggcgcgatgg 120
tatgacctcc tagtaggcga cgacgggctt acattgtgaa actgttacgt gttgcactca 180
atgcacgctc cagaagggac gtcacaccct aaatgcgact aagtccttac ccaacaaggg 240
accctatacc aagattgcgc cctcctcgaa cattaggact ccttcacata cgtgcacacg 300
gacacaacac ggttgtaact atagtactgc tcgtactact aggtaccaat gctcaggacc 360
cgagaggtga cagtaacaag gtcagggcca agggacgtca catatcggcc gcccgtccaa 420
aaccggtcga ccaaatccta ccaccaccta ccgcggtaca aattagtctc caaatatacc 480
atggccctcc accacttaat gttgtacggt tttctccatt acaaatacag gtcgcacaaa 540
tactccccag cggtgaatta gatggacgcg aacaccatac taccggtgca cccaagacac 600
caggggcggt actcgaaacc tatgtcgcgg aacgtgacac cctaaaactt gttataacac 660
cacgacacga cgtcaatgac acgactaaat tcactctagt cccacgcgac gacacgggcc 720
tcctgttccg cggaatacga cgcccgccac gcttagtagc gactcctctg gtgacggtac 780
aacataagga cgtcctgcct cgccgccgcc gtcgtcaaat aagcgcgcga cgacgtcgtg 840
gtggcgggat aggactacgt gctaatactg agatgggggt acatc 885
<210> 79
<211> 294
<212> PRT
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polypeptide
<400> 79
Met Thr Thr Ser Gly Val Pro Phe Gly Met Thr Leu Arg Pro Thr Arg
1 5 10 15
Ser Arg Leu Ser Arg Arg Thr Pro Tyr Ser Arg Asp Arg Leu Pro Pro
20 25 30
Phe Glu Thr Glu Thr Arg Ala Thr Ile Leu Glu Asp His Pro Leu Leu
35 40 45
Pro Glu Cys Asn Thr Leu Thr Met His Asn Val Ser Tyr Val Arg Gly
50 55 60
Leu Pro Cys Ser Val Gly Phe Thr Leu Ile Gln Glu Trp Val Val Pro
65 70 75 80
Trp Asp Met Val Leu Thr Arg Glu Glu Leu Val Ile Leu Arg Lys Cys
85 90 95
Met His Val Cys Leu Cys Cys Ala Asn Ile Asp Ile Met Thr Ser Met
100 105 110
Met Ile His Gly Tyr Glu Ser Trp Ala Leu His Cys His Cys Ser Ser
115 120 125
Pro Gly Ser Leu Gln Cys Ile Ala Gly Gly Gln Val Leu Ala Ser Trp
130 135 140
Phe Arg Met Val Val Asp Gly Ala Met Phe Asn Gln Arg Phe Ile Trp
145 150 155 160
Tyr Arg Glu Val Val Asn Tyr Asn Met Pro Lys Glu Val Met Phe Met
165 170 175
Ser Ser Val Phe Met Arg Gly Arg His Leu Ile Tyr Leu Arg Leu Trp
180 185 190
Tyr Asp Gly His Val Gly Ser Val Val Pro Ala Met Ser Phe Gly Tyr
195 200 205
Ser Ala Leu His Cys Gly Ile Leu Asn Asn Ile Val Val Leu Cys Cys
210 215 220
Ser Tyr Cys Ala Asp Leu Ser Glu Ile Arg Val Arg Cys Cys Ala Arg
225 230 235 240
Arg Thr Arg Arg Leu Met Leu Arg Ala Val Arg Ile Ile Ala Glu Glu
245 250 255
Thr Thr Ala Met Leu Tyr Ser Cys Arg Thr Glu Arg Arg Arg Gln Gln
260 265 270
Phe Ile Arg Ala Leu Leu Gln His His Arg Pro Ile Leu Met His Asp
275 280 285
Tyr Asp Ser Thr Pro Met
290
<210> 80
<211> 1164
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 80
atgactacgt ccggcgttcc atttggcatg acactacgac caacacgatc tcggttgtct 60
cggcgcactc cgtacagtag ggatcgtcta cctccttttg agacagaaac ccgcgctacc 120
atactggagg atcatccgct gctgcccgaa tgtaacactt tgacaatgca caacgtgagt 180
tacgtgcgag gtcttccctg cagtgtggga tttacgctga ttcaggaatg ggttgttccc 240
tgggatatgg ttctaacgcg ggaggagctt gtaatcctga ggaagtgtat gcacgtgtgc 300
ctgtgttgtg ccaacattga tatcatgacg agcatgatga tccatggtta cgagtcctgg 360
gctctccact gtcattgttc cagtcccggt tccctgcagt gtatagccgg cgggcaggtt 420
ttggccagct ggtttaggat ggtggtggat ggcgccatgt ttaatcagag gtttatatgg 480
taccgggagg tggtgaatta caacatgcca aaagaggtaa tgtttatgtc cagcgtgttt 540
atgaggggtc gccacttaat ctacctgcgc ttgtggtatg atggccacgt gggttctgtg 600
gtccccgcca tgagctttgg atacagcgcc ttgcactgtg ggattttgaa caatattgtg 660
gtgctgtgct gcagttactg tgctgattta agtgagatca gggtgcgctg ctgtgcccgg 720
aggacaaggc gccttatgct gcgggcggtg cgaatcatcg ctgaggagac cactgccatg 780
ttgtattcct gcaggacgga gcggcggcgg cagcagttta ttcgcgcgct gctgcagcac 840
caccgcccta tcctgatgca cgattatgac tctaccccca tgtaggcgtg gacttctcct 900
tcgccgcccg ttaagcaacc gcaagttgga cagcagcctg tggctcagca gctggacagc 960
gacatgaact taagtgagct gcccggggag tttattaata tcactgatga gcgtttggct 1020
cgacaggaaa ccgtgtggaa tataacacct aagaatatgt ctgttaccca tgatatgatg 1080
ctttttaagg ccagccgggg agaaaggact gtgtactctg tgtgttggga gggaggtggc 1140
aggttgaata ctagggttct gtga 1164
<210> 81
<211> 150
<212> PRT
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polypeptide
<400> 81
Met Thr Thr Ser Gly Val Pro Phe Gly Met Thr Leu Arg Pro Thr Arg
1 5 10 15
Ser Arg Leu Ser Arg Arg Thr Pro Tyr Ser Arg Asp Arg Leu Pro Pro
20 25 30
Phe Glu Thr Glu Thr Arg Ala Thr Ile Leu Glu Asp His Pro Leu Leu
35 40 45
Pro Glu Cys Asn Thr Leu Thr Met His Asn Ala Trp Thr Ser Pro Ser
50 55 60
Pro Pro Val Lys Gln Pro Gln Val Gly Gln Gln Pro Val Ala Gln Gln
65 70 75 80
Leu Asp Ser Asp Met Asn Leu Ser Glu Leu Pro Gly Glu Phe Ile Asn
85 90 95
Ile Thr Asp Glu Arg Leu Ala Arg Gln Glu Thr Val Trp Asn Ile Thr
100 105 110
Pro Lys Asn Met Ser Val Thr His Asp Met Met Leu Phe Lys Ala Ser
115 120 125
Arg Gly Glu Arg Thr Val Tyr Ser Val Cys Trp Glu Gly Gly Gly Arg
130 135 140
Leu Asn Thr Arg Val Leu
145 150
<210> 82
<211> 239
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 82
gggaaaacgg aagtgacgat ttgaggaagt tgtgggtttt ttggctttcg tttctgggcg 60
taggttcgcg tgcggttttc tgggtgtttt ttgtggactt taaccgttac gtcatttttt 120
agtcctatat atactcgctc tgcacttggc ccttttttac actgtgactg attgagctgg 180
tgccgtgtcg agtggtgttt ttttaatagg ttttcttttt tactggtaag gctgactgt 239
<210> 83
<211> 330
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 83
gtgtgtcagt tagggtgtgg aaagtcccca ggctccccag caggcagaag tatgcaaagc 60
atgcatctca attagtcagc aaccaggtgt ggaaagtccc caggctcccc agcaggcaga 120
agtatgcaaa gcatgcatct caattagtca gcaaccatag tcccgcccct aactccgccc 180
atcccgcccc taactccgcc cagttccgcc cattctccgc cccatggctg actaattttt 240
tttatttatg cagaggccga ggccgcctcg gcctctgagc tattccagaa gtagtgagga 300
ggcttttttg gaggcctagg cttttgcaaa 330
<210> 84
<211> 36
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
oligonucleotide
<400> 84
taagagtcag cgcgcagtat ttactgaaga gagcct 36
<210> 85
<211> 699
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 85
atggagcact ttttgccgct gcgcaacatc tggaaccgcg tccgcgactt tccgcgcgcc 60
tccaccaccg ccgccggcat cacctggatg tccaggtaca tctacggata tcatcgcctt 120
atgttggaag atctcgcccc cggagccccg gccaccctac gctggcccct ctaccgccag 180
ccgccgccgc actttttggt gggataccag tacctggtgc ggacttgcaa cgactacgta 240
tttgactcga gggcttactc gcgtctcagg tacaccgagc tctcgcagcc gggtcaccag 300
accgttaact ggtccgttat ggccaactgc acttacacca tcaacacggg cgcataccac 360
cgctttgtgg acatggatga cttccagtct accctcacgc aggtgcagca ggccatatta 420
gccgagcgcg ttgtcgccga cctagccctg cttcagccga tgaggggctt cggggtcaca 480
cgcatgggag gaagagggcg ccacctacgg ccaaactccg ccgccgccgc agcgatagat 540
gcaagagatg caggacaaga ggaaggagaa gaagaagtgc cggtagaaag gctcatgcaa 600
gactactaca aagacctgcg ccgatgtcaa aacgaagcct ggggcatggc cgaccgcctg 660
cgcattcagc aggccggacc caaggacatg gtgcttctg 699
<210> 86
<211> 1590
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 86
atggccagtc gggaagagga gcagcgcgaa accacccccg agcgcggacg cggtgcggcg 60
cgacgtcccc caaccatgga ggacgtgtcg tccccgtccc cgtcgccgcc gcctccccgg 120
gcgcccccaa aaaagcggat gaggcggcgt atcgagtccg aggacgagga agactcatca 180
caagacgcgc tggtgccgcg cacacccagc ccgcggccat cgacctcggc ggcggatttg 240
gccattgcgc ccaagaagaa aaagaagcgc ccttctccca agcccgagcg cccgccatca 300
ccagaggtaa tcgtggacag cgaggaagaa agagaagatg tggcgctaca aatggtgggt 360
ttcagcaacc caccggtgct aatcaagcat ggcaaaggag gtaagcgcac agtgcggcgg 420
ctgaatgaag acgacccagt ggcgcgtggt atgcggacgc aagaggaaga ggaagagccc 480
agcgaagcgg aaagtgaaat tacggtgatg aacccgctga gtgtgccgat cgtgtctgcg 540
tgggagaagg gcatggaggc tgcgcgcgcg ctgatggaca agtaccacgt ggataacgat 600
ctaaaggcga acttcaaact actgcctgac caagtggaag ctctggcggc cgtatgcaag 660
acctggctga acgaggagca ccgcgggttg cagctgacct tcaccagcaa caagaccttt 720
gtgacgatga tggggcgatt cctgcaggcg tacctgcagt cgtttgcaga ggtgacctac 780
aagcatcacg agcccacggg ctgcgcgttg tggctgcacc gctgcgctga gatcgaaggc 840
gagcttaagt gtctacacgg aagcattatg ataaataagg agcacgtgat tgaaatggat 900
gtgacgagcg aaaacgggca gcgcgcgctg aaggagcagt ctagcaaggc caagatcgtg 960
aagaaccggt ggggccgaaa tgtggtgcag atctccaaca ccgacgcaag gtgctgcgtg 1020
cacgacgcgg cctgtccggc caatcagttt tccggcaagt cttgcggcat gttcttctct 1080
gaaggcgcaa aggctcaggt ggcttttaag cagatcaagg cttttatgca ggcgctgtat 1140
cctaacgccc agaccgggca cggtcacctt ttgatgccac tacggtgcga gtgcaactca 1200
aagcctgggc acgcgccctt tttgggaagg cagctaccaa agttgactcc gttcgccctg 1260
agcaacgcgg aggacctgga cgcggatctg atctccgaca agagcgtgct ggccagcgtg 1320
caccacccgg cgctgatagt gttccagtgc tgcaaccctg tgtatcgcaa ctcgcgcgcg 1380
cagggcggag gccccaactg cgacttcaag atatcggcgc ccgacctgct aaacgcgttg 1440
gtgatggtgc gcagcctgtg gagtgaaaac ttcaccgagc tgccgcggat ggttgtgcct 1500
gagtttaagt ggagcactaa acaccagtat cgcaacgtgt ccctgccagt ggcgcatagc 1560
gatgcgcggc agaacccctt tgatttttaa 1590
<210> 87
<211> 278
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 87
tcgaggtgag ccccacgttc tgcttcactc tccccatctc ccccccctcc ccacccccaa 60
ttttgtattt atttattttt taattatttt gtgcagcgat gggggcgggg gggggggggg 120
ggcgcgcgcc aggcggggcg gggcggggcg aggggcgggg cggggcgagg cggagaggtg 180
cggcggcagc caatcagagc ggcgcgctcc gaaagtttcc ttttatggcg aggcggcggc 240
ggcggcggcc ctataaaaag cgaagcgcgc ggcgggcg 278
<210> 88
<211> 35
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
oligonucleotide
<400> 88
acggcgcaga cggcaagggt gggggtaaat aatca 35
<210> 89
<211> 387
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 89
atggctgccg ctgtggaagc gctgtatgtt gttctggagc gggagggtgc tattttgcct 60
aggcaggagg gtttttcagg tgtttatgtg tttttctctc ctattaattt tgttatacct 120
cctatggggg ctgtaatgtt gtctctacgc ctgcgggtat gtattccccc gggctatttc 180
ggtcgctttt tagcactgac cgatgtgaat caacctgatg tgtttaccga gtcttacatt 240
atgactccgg acatgaccga ggagctgtcg gtggtgcttt ttaatcacgg tgaccagttt 300
ttttacggtc acgccggcat ggccgtagtc cgtcttatgc ttataagggt tgtttttcct 360
gttgtaagac aggcttctaa tgtttaa 387
<210> 90
<211> 411
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 90
atgcagaaac ccgcagacat gtttgagaga aaaatggtgt ctttttctgt ggtggttccg 60
gagcttacct gcctttatct gcatgagcat gactacgatg tgctttcttt tttgcgcgag 120
gctttgcctg attttttgag cagcaccttg cattttatat cgccgcccat gcaacaagct 180
tacatcgggg ctacgctggt tagcatagct ccgagtatgc gtgtcataat cagtgtgggt 240
tcttttgtca tggttcctgg cggggaagtg gccgcgctgg tccgtgcaga cctgcacgat 300
tatgttcagc tggccctgcg aagggaccta cgggatcgcg gtatttttgt taatgttccg 360
cttttgaatc ttatacaggt ctgtgaggaa cctgaatttt tgcaatcatg a 411
<210> 91
<211> 351
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 91
atgattcgct gcttgaggct gaaggtggag ggcgctctgg agcagatttt tacaatggcc 60
ggacttaata ttcgggattt gcttagagat atattgagaa ggtggcgaga tgagaattat 120
ttgggcatgg ttgaaggtgc tggaatgttt atagaggaga ttcaccctga agggtttagc 180
ctttacgtcc acttggacgt gagggccgtt tgccttttgg aagccattgt gcaacatctt 240
acaaatgcca ttatctgttc tttggctgta gagtttgacc acgccaccgg aggggagcgc 300
gttcacttaa tagatcttca ttttgaggtt ttggataatc ttttggaata a 351
<210> 92
<211> 345
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 92
atggttcttc cagctcttcc cgctcctccc gtgtgtgact cgcagaacga atgtgtaggt 60
tggctgggtg tggcttattc tgcggtggtg gatgttatca gggcagcggc gcatgaagga 120
gtttacatag aacccgaagc cagggggcgc ctggatgctt tgagagagtg gatatactac 180
aactactaca cagagcgatc taagcggcga gaccggagac gcagatctgt ttgtcacgcc 240
cgcacctggt tttgcttcag gaaatatgac tacgtccggc gttccatttg gcatgacact 300
acgaccaaca cgatctcggt tgtctcggcg cactccgtac agtag 345
<210> 93
<211> 885
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 93
atgactacgt ccggcgttcc atttggcatg acactacgac caacacgatc tcggttgtct 60
cggcgcactc cgtacagtag ggatcgtcta cctccttttg agacagaaac ccgcgctacc 120
atactggagg atcatccgct gctgcccgaa tgtaacactt tgacaatgca caacgtgagt 180
tacgtgcgag gtcttccctg cagtgtggga tttacgctga ttcaggaatg ggttgttccc 240
tgggatatgg ttctaacgcg ggaggagctt gtaatcctga ggaagtgtat gcacgtgtgc 300
ctgtgttgtg ccaacattga tatcatgacg agcatgatga tccatggtta cgagtcctgg 360
gctctccact gtcattgttc cagtcccggt tccctgcagt gtatagccgg cgggcaggtt 420
ttggccagct ggtttaggat ggtggtggat ggcgccatgt ttaatcagag gtttatatgg 480
taccgggagg tggtgaatta caacatgcca aaagaggtaa tgtttatgtc cagcgtgttt 540
atgaggggtc gccacttaat ctacctgcgc ttgtggtatg atggccacgt gggttctgtg 600
gtccccgcca tgagctttgg atacagcgcc ttgcactgtg ggattttgaa caatattgtg 660
gtgctgtgct gcagttactg tgctgattta agtgagatca gggtgcgctg ctgtgcccgg 720
aggacaaggc gccttatgct gcgggcggtg cgaatcatcg ctgaggagac cactgccatg 780
ttgtattcct gcaggacgga gcggcggcgg cagcagttta ttcgcgcgct gctgcagcac 840
caccgcccta tcctgatgca cgattatgac tctaccccca tgtag 885
<210> 94
<211> 11366
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 94
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat cgatccagac atgataagat acattgatga 2880
gtttggacaa accacaacta gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga 2940
tgctattgct ttatttgtaa ccattataag ctgcaataaa caagtttgta cactctcggg 3000
tgattattta cccccaccct tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc 3060
gcatcgctat gcgccactgg cagggacacg ttgcgatact ggtgtttagt gctccactta 3120
aactcaggca caaccatccg cggcagctcg gtgaagtttt cactccacag gctgcgcacc 3180
atcaccaacg cgtttagcag gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg 3240
ccctgcgcgc gcgagttgcg atacacaggg ttgcagcact ggaacactat cagcgccggg 3300
tggtgcacgc tggccagcac gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg 3360
ttgctcaggg cgaacggagt caactttggt agctgccttc ccaaaaaggg cgcgtgccca 3420
ggctttgagt tgcactcgca ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg 3480
ttaggataca gcgcctgcat aaaagccttg atctgcttaa aagccacctg agcctttgcg 3540
ccttcagaga agaacatgcc gcaagacttg ccggaaaact gattggccgg acaggccgcg 3600
tcgtgcacgc agcaccttgc gtcggtgttg gagatctgca ccacatttcg gccccaccgg 3660
ttcttcacga tcttggcctt gctagactgc tccttcagcg cgcgctgccc gttttcgctc 3720
gtcacatcca tttcaatcac gtgctcctta tttatcataa tgcttccgtg tagacactta 3780
agctcgcctt cgatctcagc gcagcggtgc agccacaacg cgcagcccgt gggctcgtga 3840
tgcttgtagg tcacctctgc aaacgactgc aggtacgcct gcaggaatcg ccccatcatc 3900
gtcacaaagg tcttgttgct ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc 3960
caggtcttgc atacggccgc cagagcttcc acttggtcag gcagtagttt gaagttcgcc 4020
tttagatcgt tatccacgtg gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc 4080
tcccacgcag acacgatcgg cacactcagc gggttcatca ccgtaatttc actttccgct 4140
tcgctgggct cttcctcttc ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca 4200
ttcagccgcc gcactgtgcg cttacctcct ttgccatgct tgattagcac cggtgggttg 4260
ctgaaaccca ccatttgtag cgccacatct tctctttctt cctcgctgtc cacgattacc 4320
tctggtgatg gcgggcgctc gggcttggga gaagggcgct tctttttctt cttgggcgca 4380
atggccaaat ccgccgccga ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg 4440
tcttgtgatg agtcttcctc gtcctcggac tcgatacgcc gcctcatccg cttttttggg 4500
ggcgcccggg gaggcggcgg cgacggggac ggggacgaca cgtcctccat ggttggggga 4560
cgtcgcgccg caccgcgtcc gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg 4620
gccatttcct tctcctatag gcagaaaaag atcatggagt cagtcgagaa gaaggacagc 4680
ctaaccgccc cctctgagtt cgccaccacc gcctccaccg atgccgccaa cgcgcctacc 4740
accttccccg tcgaggcacc cccgcttgag gaggaggaag tgattatcga gcaggaccca 4800
ggttttgtaa gcgaagacga cgaggaccgc tcagtaccaa cagaggataa aaagcaagac 4860
caggacaacg cagaggcaaa cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac 4920
tacctagatg tgggagacga cgtgctgttg aagcatctgc agcgccagtg cgccattatc 4980
tgcgacgcgt tgcaagagcg cagcgatgtg cccctcgcca tagcggatgt cagccttgcc 5040
tacgaacgcc acctattctc accgcgcgta ccccccaaac gccaagaaaa cggcacatgc 5100
gagcccaacc cgcgcctcaa cttctacccc gtatttgccg tgccagaggt gcttgccacc 5160
tatcacatct ttttccaaaa ctgcaagata cccctatcct gccgtgccaa ccgcagccga 5220
gcggacaagc agctggcctt gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac 5280
gaagtgccaa aaatctttga gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg 5340
caacaggaaa acagcgaaaa tgaaagtcac tctggagtgt tggtggaact cgagggtgac 5400
aacgcgcgcc tagccgtact aaaacgcagc atcgaggtca cccactttgc ctacccggca 5460
cttaacctac cccccaaggt catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg 5520
cagcccctgg agagggatgc aaatttgcaa gaacaaacag aggagggcct acccgcagtt 5580
ggcgacgagc agctagcgcg ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga 5640
cgcaaactaa tgatggccgc agtgctcgtt accgtggagc ttgagtgcat gcagcggttc 5700
tttgctgacc cggagatgca gcgcaagcta gaggaaacat tgcactacac ctttcgacag 5760
ggctacgtac gccaggcctg caagatctcc aacgtggagc tctgcaacct ggtctcctac 5820
cttggaattt tgcacgaaaa ccgccttggg caaaacgtgc ttcattccac gctcaagggc 5880
gaggcgcgcc gcgactacgt ccgcgactgc gtttacttat ttctatgcta cacctggcag 5940
acggccatgg gcgtttggca gcagtgcttg gaggagtgca acctcaagga gctgcagaaa 6000
ctgctaaagc aaaacttgaa ggacctatgg acggccttca acgagcgctc cgtggccgcg 6060
cacctggcgg acatcatttt ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca 6120
gacttcacca gtcaaagcat gttgcagaac tttaggaact ttatcctaga gcgctcagga 6180
atcttgcccg ccacctgctg tgcacttcct agcgactttg tgcccattaa gtaccgcgaa 6240
tgccctccgc cgctttgggg ccactgctac cttctgcagc tagccaacta ccttgcctac 6300
cactctgaca taatggaaga cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc 6360
aacctatgca ccccgcaccg ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa 6420
attatcggta cctttgagct gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg 6480
ttgaaactca ctccggggct gtggacgtcg gcttaccttc gcaaatttgt acctgaggac 6540
taccacgccc acgagattag gttctacgaa gaccaatccc gcccgcctaa tgcggagctt 6600
accgcctgcg tcattaccca gggccacatt cttggccaat tgcaagccat caacaaagcc 6660
cgccaagagt ttctgctacg aaagggacgg ggggtttact tggaccccca gtccggcgag 6720
gagctcaacc caatcccccc gccgccgcag ccctatcagc agcagccgcg ggcccttgct 6780
tcccaggatg gcacccaaaa agaagctgca gctgccgccg ccacccacgg acgaggagga 6840
atactgggac agtcaggcag aggaggtttt ggacgaggag gaggaggaca tgatggaaga 6900
ctgggagagc ctagacgagg aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc 6960
accctcggtc gcattcccct cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc 7020
tacaacctcc gctcctcagg cgccgccggc actgcccgtt cgccgaccca accgtagatg 7080
ggacaccact ggaaccaggg ccggtaagtc caagcagccg ccgccgttag cccaagagca 7140
acaacagcgc caaggctacc gctcatggcg cgggcacaag aacgccatag ttgcttgctt 7200
gcaagactgt gggggcaaca tctccttcgc ccgccgcttt cttctctacc atcacggcgt 7260
ggccttcccc cgtaacatcc tgcattacta ccgtcatctc tacagcccat actgcaccgg 7320
cggcagcggc agcaacagca gcggccacac agaagcaaag gcgaccggat agcaagactc 7380
tgacaaagcc caagaaatcc acagcggcgg cagcagcagg aggaggagcg ctgcgtctgg 7440
cgcccaacga acccgtatcg acccgcgagc ttagaaacag gatttttccc actctgtatg 7500
ctatatttca acagagcagg ggccaagaac aagagctgaa aataaaaaac aggtctctgc 7560
gatccctcac ccgcagctgc ctgtatcaca aaagcgaaga tcagcttcgg cgcacgctgg 7620
aagacgcgga ggctctcttc agtaaatact gcgcgctgac tcttaaggac tagtttcgcg 7680
ccctttctca aatttaagcg cgaaaactac gtcatctcca gcggccacac ccggcgccag 7740
cacctgttgt cagcgccatt atgagcaagg aaattcccac gccctacatg tggagttacc 7800
agccacaaat gggacttgcg gctggagctg cccaagacta ctcaacccga ataaactaca 7860
tgagcgcggg gcggccgccg tttgtgttat gtttcaacgt gtttattttt caattgcaga 7920
aaatttcaag tcatttttca ttcagtagta tagccccacc accacatagc ttatacagat 7980
caccgtacct taatcaaact cacagaaccc tagtattcaa cctgccacct ccctcccaac 8040
acacagagta cacagtcctt tctccccggc tggccttaaa aagcatcata tcatgggtaa 8100
cagacatatt cttaggtgtt atattccaca cggtttcctg tcgagccaaa cgctcatcag 8160
tgatattaat aaactccccg ggcagctcac ttaagttcat gtcgctgtcc agctgctgag 8220
ccacaggctg ctgtccaact tgcggttgct taacgggcgg cgaaggagaa gtccacgcct 8280
acatgggggt agagtcataa tcgtgcatca ggatagggcg gtggtgctgc agcagcgcgc 8340
gaataaactg ctgccgccgc cgctccgtcc tgcaggaata caacatggca gtggtctcct 8400
cagcgatgat tcgcaccgcc cgcagcataa ggcgccttgt cctccgggca cagcagcgca 8460
ccctgatctc acttaaatca gcacagtaac tgcagcacag caccacaata ttgttcaaaa 8520
tcccacagtg caaggcgctg tatccaaagc tcatggcggg gaccacagaa cccacgtggc 8580
catcatacca caagcgcagg tagattaagt ggcgacccct cataaacacg ctggacataa 8640
acattacctc ttttggcatg ttgtaattca ccacctcccg gtaccatata aacctctgat 8700
taaacatggc gccatccacc accatcctaa accagctggc caaaacctgc ccgccggcta 8760
tacactgcag ggaaccggga ctggaacaat gacagtggag agcccaggac tcgtaaccat 8820
ggatcatcat gctcgtcatg atatcaatgt tggcacaaca caggcacacg tgcatacact 8880
tcctcaggat tacaagctcc tcccgcgtta gaaccatatc ccagggaaca acccattcct 8940
gaatcagcgt aaatcccaca ctgcagggaa gacctcgcac gtaactcacg ttgtgcattg 9000
tcaaagtgtt acattcgggc agcagcggat gatcctccag tatggtagcg cgggtttctg 9060
tctcaaaagg aggtagacga tccctactgt acggagtgcg ccgagacaac cgagatcgtg 9120
ttggtcgtag tgtcatgcca aatggaacgc cggacgtagt catatttcct gaagcaaaac 9180
caggtgcggg cgtgacaaac agatctgcgt ctccggtctc gccgcttaga tcgctctgtg 9240
tagtagttgt agtatatcca ctctctcaaa gcatccaggc gccccctggc ttcgggttct 9300
atgtaaactc cttcatgcgc cgctgccctg ataacatcca ccaccgcaga ataagccaca 9360
cccagccaac ctacacattc gttctgcgag tcacacacgg gaggagcggg aagagctgga 9420
agaaccatgt tttttttttt attccaaaag attatccaaa acctcaaaat gaagatctat 9480
taagtgaacg cgctcccctc cggtggcgtg gtcaaactct acagccaaag aacagataat 9540
ggcatttgta agatgttgca caatggcttc caaaaggcaa acggccctca cgtccaagtg 9600
gacgtaaagg ctaaaccctt cagggtgaat ctcctctata aacattccag caccttcaac 9660
catgcccaaa taattctcat ctcgccacct tctcaatata tctctaagca aatcccgaat 9720
attaagtccg gccattgtaa aaatctgctc cagagcgccc tccaccttca gcctcaagca 9780
gcgaatcatg attgcaaaaa ttcaggttcc tcacagacct gtataagatt caaaagcgga 9840
acattaacaa aaataccgcg atcccgtagg tcccttcgca gggccagctg aacataatcg 9900
tgcaggtctg cacggaccag cgcggccact tccccgccag gaaccatgac aaaagaaccc 9960
acactgatta tgacacgcat actcggagct atgctaacca gcgtagcccc gatgtaagct 10020
tgttgcatgg gcggcgatat aaaatgcaag gtgctgctca aaaaatcagg caaagcctcg 10080
cgcaaaaaag aaagcacatc gtagtcatgc tcatgcagat aaaggcaggt aagctccgga 10140
accaccacag aaaaagacac catttttctc tcaaacatgt ctgcgggttt ctgcataaac 10200
acaaaataaa ataacaaaaa aacatttaaa cattagaagc ctgtcttaca acaggaaaaa 10260
caacccttat aagcataaga cggactacgg ccatgccggc gtgaccgtaa aaaaactggt 10320
caccgtgatt aaaaagcacc accgacagct cctcggtcat gtccggagtc ataatgtaag 10380
actcggtaaa cacatcaggt tgattcacat cggtcagtgc taaaaagcga ccgaaatagc 10440
ccgggggaat acatacccgc aggcgtagag acaacattac agcccccata ggaggtataa 10500
caaaattaat aggagagaaa aacacataaa cacctgaaaa accctcctgc ctaggcaaaa 10560
tagcaccctc ccgctccaga acaacataca gcgcttccac agcggcagcc ataacagtca 10620
gccttaccag taaaaaagaa aacctattaa aaaaacacca ctcgacacgg caccagctca 10680
atcagtcaca gtgtaaaaaa gggccaagtg cagagcgagt atatatagga ctaaaaaatg 10740
acgtaacggt taaagtccac aaaaaacacc cagaaaaccg cacgcgaacc tacgcccaga 10800
aacgaaagcc aaaaaaccca caacttcctc aaatcgtcac ttccgttttc ccacgttacg 10860
tcacttccca ttttaagaaa actacaattc ccaacacata caagttactc cgcccttaat 10920
taaatcggat ccgatatcta gatgtattcg cgaggtaccg agctcgaatt ctctggccgt 10980
cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc caacttaatc gccttgcagc 11040
acatccccct ttcgccagct ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca 11100
acagttgcgc agcctgaatg gcgaatggcg cctgatgcgg tattttctcc ttacgcatct 11160
gtgcggtatt tcacaccgca tatggtgcac tctcagtaca atctgctctg atgccgcata 11220
gttaagccag ccccgacacc cgccaacacc cgctgacgcg ccctgacggg cttgtctgct 11280
cccggcatcc gcttacagac aagctgtgac cgtctccggg agctgcatgt gtcagaggtt 11340
ttcaccgtca tcaccgaaac gcgcga 11366
<210> 95
<211> 11119
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 95
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat cgatccagac atgataagat acattgatga 2880
gtttggacaa accacaacta gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga 2940
tgctattgct ttatttgtaa ccattataag ctgcaataaa caagtttgta cactctcggg 3000
tgattattta cccccaccct tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc 3060
gcatcgctat gcgccactgg cagggacacg ttgcgatact ggtgtttagt gctccactta 3120
aactcaggca caaccatccg cggcagctcg gtgaagtttt cactccacag gctgcgcacc 3180
atcaccaacg cgtttagcag gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg 3240
ccctgcgcgc gcgagttgcg atacacaggg ttgcagcact ggaacactat cagcgccggg 3300
tggtgcacgc tggccagcac gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg 3360
ttgctcaggg cgaacggagt caactttggt agctgccttc ccaaaaaggg cgcgtgccca 3420
ggctttgagt tgcactcgca ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg 3480
ttaggataca gcgcctgcat aaaagccttg atctgcttaa aagccacctg agcctttgcg 3540
ccttcagaga agaacatgcc gcaagacttg ccggaaaact gattggccgg acaggccgcg 3600
tcgtgcacgc agcaccttgc gtcggtgttg gagatctgca ccacatttcg gccccaccgg 3660
ttcttcacga tcttggcctt gctagactgc tccttcagcg cgcgctgccc gttttcgctc 3720
gtcacatcca tttcaatcac gtgctcctta tttatcataa tgcttccgtg tagacactta 3780
agctcgcctt cgatctcagc gcagcggtgc agccacaacg cgcagcccgt gggctcgtga 3840
tgcttgtagg tcacctctgc aaacgactgc aggtacgcct gcaggaatcg ccccatcatc 3900
gtcacaaagg tcttgttgct ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc 3960
caggtcttgc atacggccgc cagagcttcc acttggtcag gcagtagttt gaagttcgcc 4020
tttagatcgt tatccacgtg gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc 4080
tcccacgcag acacgatcgg cacactcagc gggttcatca ccgtaatttc actttccgct 4140
tcgctgggct cttcctcttc ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca 4200
ttcagccgcc gcactgtgcg cttacctcct ttgccatgct tgattagcac cggtgggttg 4260
ctgaaaccca ccatttgtag cgccacatct tctctttctt cctcgctgtc cacgattacc 4320
tctggtgatg gcgggcgctc gggcttggga gaagggcgct tctttttctt cttgggcgca 4380
atggccaaat ccgccgccga ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg 4440
tcttgtgatg agtcttcctc gtcctcggac tcgatacgcc gcctcatccg cttttttggg 4500
ggcgcccggg gaggcggcgg cgacggggac ggggacgaca cgtcctccat ggttggggga 4560
cgtcgcgccg caccgcgtcc gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg 4620
gccatttcct tctcctatag gcagaaaaag atcatggagt cagtcgagaa gaaggacagc 4680
ctaaccgccc cctctgagtt cgccaccacc gcctccaccg atgccgccaa cgcgcctacc 4740
accttccccg tcgaggcacc cccgcttgag gaggaggaag tgattatcga gcaggaccca 4800
ggttttgtaa gcgaagacga cgaggaccgc tcagtaccaa cagaggataa aaagcaagac 4860
caggacaacg cagaggcaaa cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac 4920
tacctagatg tgggagacga cgtgctgttg aagcatctgc agcgccagtg cgccattatc 4980
tgcgacgcgt tgcaagagcg cagcgatgtg cccctcgcca tagcggatgt cagccttgcc 5040
tacgaacgcc acctattctc accgcgcgta ccccccaaac gccaagaaaa cggcacatgc 5100
gagcccaacc cgcgcctcaa cttctacccc gtatttgccg tgccagaggt gcttgccacc 5160
tatcacatct ttttccaaaa ctgcaagata cccctatcct gccgtgccaa ccgcagccga 5220
gcggacaagc agctggcctt gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac 5280
gaagtgccaa aaatctttga gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg 5340
caacaggaaa acagcgaaaa tgaaagtcac tctggagtgt tggtggaact cgagggtgac 5400
aacgcgcgcc tagccgtact aaaacgcagc atcgaggtca cccactttgc ctacccggca 5460
cttaacctac cccccaaggt catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg 5520
cagcccctgg agagggatgc aaatttgcaa gaacaaacag aggagggcct acccgcagtt 5580
ggcgacgagc agctagcgcg ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga 5640
cgcaaactaa tgatggccgc agtgctcgtt accgtggagc ttgagtgcat gcagcggttc 5700
tttgctgacc cggagatgca gcgcaagcta gaggaaacat tgcactacac ctttcgacag 5760
ggctacgtac gccaggcctg caagatctcc aacgtggagc tctgcaacct ggtctcctac 5820
cttggaattt tgcacgaaaa ccgccttggg caaaacgtgc ttcattccac gctcaagggc 5880
gaggcgcgcc gcgactacgt ccgcgactgc gtttacttat ttctatgcta cacctggcag 5940
acggccatgg gcgtttggca gcagtgcttg gaggagtgca acctcaagga gctgcagaaa 6000
ctgctaaagc aaaacttgaa ggacctatgg acggccttca acgagcgctc cgtggccgcg 6060
cacctggcgg acatcatttt ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca 6120
gacttcacca gtcaaagcat gttgcagaac tttaggaact ttatcctaga gcgctcagga 6180
atcttgcccg ccacctgctg tgcacttcct agcgactttg tgcccattaa gtaccgcgaa 6240
tgccctccgc cgctttgggg ccactgctac cttctgcagc tagccaacta ccttgcctac 6300
cactctgaca taatggaaga cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc 6360
aacctatgca ccccgcaccg ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa 6420
attatcggta cctttgagct gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg 6480
ttgaaactca ctccggggct gtggacgtcg gcttaccttc gcaaatttgt acctgaggac 6540
taccacgccc acgagattag gttctacgaa gaccaatccc gcccgcctaa tgcggagctt 6600
accgcctgcg tcattaccca gggccacatt cttggccaat tgcaagccat caacaaagcc 6660
cgccaagagt ttctgctacg aaagggacgg ggggtttact tggaccccca gtccggcgag 6720
gagctcaacc caatcccccc gccgccgcag ccctatcagc agcagccgcg ggcccttgct 6780
tcccaggatg gcacccaaaa agaagctgca gctgccgccg ccacccacgg acgaggagga 6840
atactgggac agtcaggcag aggaggtttt ggacgaggag gaggaggaca tgatggaaga 6900
ctgggagagc ctagacgagg aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc 6960
accctcggtc gcattcccct cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc 7020
tacaacctcc gctcctcagg cgccgccggc actgcccgtt cgccgaccca accgtagatg 7080
ggacaccact ggaaccaggg ccggtaagtc caagcagccg ccgccgttag cccaagagca 7140
acaacagcgc caaggctacc gctcatggcg cgggcacaag aacgccatag ttgcttgctt 7200
gcaagactgt gggggcaaca tctccttcgc ccgccgcttt cttctctacc atcacggcgt 7260
ggccttcccc cgtaacatcc tgcattacta ccgtcatctc tacagcccat actgcaccgg 7320
cggcagcggc agcaacagca gcggccacac agaagcaaag gcgaccggat agcaagactc 7380
tgacaaagcc caagaaatcc acagcggcgg cagcagcagg aggaggagcg ctgcgtctgg 7440
cgcccaacga acccgtatcg acccgcgagc ttagaaacag gatttttccc actctgtatg 7500
ctatatttca acagagcagg ggccaagaac aagagctgaa aataaaaaac aggtctctgc 7560
gatccctcac ccgcagctgc ctgtatcaca aaagcgaaga tcagcttcgg cgcacgctgg 7620
aagacgcgga ggctctcttc agtaaatact gcgcgctgac tcttaaggac tagtttcgcg 7680
ccctttctca aatttaagcg cgaaaactac gtcatctcca gcggccacac ccggcgccag 7740
cacctgttgt cagcgccatt atgagcaagg aaattcccac gccctacatg tggagttacc 7800
agccacaaat gggacttgcg gctggagctg cccaagacta ctcaacccga ataaactaca 7860
tgagcgcggg gcggccgcaa cttgtttatt gcagcttata atggttacaa ataaagcaat 7920
agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg tggtttgtcc 7980
aaactcatca atgtatctta gcttaacggg cggcgaagga gaagtccacg cctacatggg 8040
ggtagagtca taatcgtgca tcaggatagg gcggtggtgc tgcagcagcg cgcgaataaa 8100
ctgctgccgc cgccgctccg tcctgcagga atacaacatg gcagtggtct cctcagcgat 8160
gattcgcacc gcccgcagca taaggcgcct tgtcctccgg gcacagcagc gcaccctgat 8220
ctcacttaaa tcagcacagt aactgcagca cagcaccaca atattgttca aaatcccaca 8280
gtgcaaggcg ctgtatccaa agctcatggc ggggaccaca gaacccacgt ggccatcata 8340
ccacaagcgc aggtagatta agtggcgacc cctcataaac acgctggaca taaacattac 8400
ctcttttggc atgttgtaat tcaccacctc ccggtaccat ataaacctct gattaaacat 8460
ggcgccatcc accaccatcc taaaccagct ggccaaaacc tgcccgccgg ctatacactg 8520
cagggaaccg ggactggaac aatgacagtg gagagcccag gactcgtaac catggatcat 8580
catgctcgtc atgatatcaa tgttggcaca acacaggcac acgtgcatac acttcctcag 8640
gattacaagc tcctcccgcg ttagaaccat atcccaggga acaacccatt cctgaatcag 8700
cgtaaatccc acactgcagg gaagacctcg cacgtaactc acgttgtgca ttgtcaaagt 8760
gttacattcg ggcagcagcg gatgatcctc cagtatggta gcgcgggttt ctgtctcaaa 8820
aggaggtaga cgatccctac tgtacggagt gcgccgagac aaccgagatc gtgttggtcg 8880
tagtgtcatg ccaaatggaa cgccggacgt agtcatattt cctgaagcaa aaccaggtgc 8940
gggcgtgaca aacagatctg cgtctccggt ctcgccgctt agatcgctct gtgtagtagt 9000
tgtagtatat ccactctctc aaagcatcca ggcgccccct ggcttcgggt tctatgtaaa 9060
ctccttcatg cgccgctgcc ctgataacat ccaccaccgc agaataagcc acacccagcc 9120
aacctacaca ttcgttctgc gagtcacaca cgggaggagc gggaagagct ggaagaacca 9180
tgtttttttt tttattccaa aagattatcc aaaacctcaa aatgaagatc tattaagtga 9240
acgcgctccc ctccggtggc gtggtcaaac tctacagcca aagaacagat aatggcattt 9300
gtaagatgtt gcacaatggc ttccaaaagg caaacggccc tcacgtccaa gtggacgtaa 9360
aggctaaacc cttcagggtg aatctcctct ataaacattc cagcaccttc aaccatgccc 9420
aaataattct catctcgcca ccttctcaat atatctctaa gcaaatcccg aatattaagt 9480
ccggccattg taaaaatctg ctccagagcg ccctccacct tcagcctcaa gcagcgaatc 9540
atgattgcaa aaattcaggt tcctcacaga cctgtataag attcaaaagc ggaacattaa 9600
caaaaatacc gcgatcccgt aggtcccttc gcagggccag ctgaacataa tcgtgcaggt 9660
ctgcacggac cagcgcggcc acttccccgc caggaaccat gacaaaagaa cccacactga 9720
ttatgacacg catactcgga gctatgctaa ccagcgtagc cccgatgtaa gcttgttgca 9780
tgggcggcga tataaaatgc aaggtgctgc tcaaaaaatc aggcaaagcc tcgcgcaaaa 9840
aagaaagcac atcgtagtca tgctcatgca gataaaggca ggtaagctcc ggaaccacca 9900
cagaaaaaga caccattttt ctctcaaaca tgtctgcggg tttctgcata aacacaaaat 9960
aaaataacaa aaaaacattt aaacattaga agcctgtctt acaacaggaa aaacaaccct 10020
tataagcata agacggacta cggccatgcc ggcgtgaccg taaaaaaact ggtcaccgtg 10080
attaaaaagc accaccgaca gctcctcggt catgtccgga gtcataatgt aagactcggt 10140
aaacacatca ggttgattca catcggtcag tgctaaaaag cgaccgaaat agcccggggg 10200
aatacatacc cgcaggcgta gagacaacat tacagccccc ataggaggta taacaaaatt 10260
aataggagag aaaaacacat aaacacctga aaaaccctcc tgcctaggca aaatagcacc 10320
ctcccgctcc agaacaacat acagcgcttc cacagcggca gccataacag tcagccttac 10380
cagtaaaaaa gaaaacctat taaaaaaaca ccactcgaca cggcaccagc tcaatcagtc 10440
acagtgtaaa aaagggccaa gtgcagagcg agtatatata ggactaaaaa atgacgtaac 10500
ggttaaagtc cacaaaaaac acccagaaaa ccgcacgcga acctacgccc agaaacgaaa 10560
gccaaaaaac ccacaacttc ctcaaatcgt cacttccgtt ttcccacgtt acgtcacttc 10620
ccattttaag aaaactacaa ttcccaacac atacaagtta ctccgccctt aattaaatcg 10680
gatccgatat ctagatgtat tcgcgaggta ccgagctcga attctctggc cgtcgtttta 10740
caacgtcgtg actgggaaaa ccctggcgtt acccaactta atcgccttgc agcacatccc 10800
cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg 10860
cgcagcctga atggcgaatg gcgcctgatg cggtattttc tccttacgca tctgtgcggt 10920
atttcacacc gcatatggtg cactctcagt acaatctgct ctgatgccgc atagttaagc 10980
cagccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct gctcccggca 11040
tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag gttttcaccg 11100
tcatcaccga aacgcgcga 11119
<210> 96
<211> 11216
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 96
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat cgatccagac atgataagat acattgatga 2880
gtttggacaa accacaacta gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga 2940
tgctattgct ttatttgtaa ccattataag ctgcaataaa caagtttgta cactctcggg 3000
tgattattta cccccaccct tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc 3060
gcatcgctat gcgccactgg cagggacacg ttgcgatact ggtgtttagt gctccactta 3120
aactcaggca caaccatccg cggcagctcg gtgaagtttt cactccacag gctgcgcacc 3180
atcaccaacg cgtttagcag gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg 3240
ccctgcgcgc gcgagttgcg atacacaggg ttgcagcact ggaacactat cagcgccggg 3300
tggtgcacgc tggccagcac gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg 3360
ttgctcaggg cgaacggagt caactttggt agctgccttc ccaaaaaggg cgcgtgccca 3420
ggctttgagt tgcactcgca ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg 3480
ttaggataca gcgcctgcat aaaagccttg atctgcttaa aagccacctg agcctttgcg 3540
ccttcagaga agaacatgcc gcaagacttg ccggaaaact gattggccgg acaggccgcg 3600
tcgtgcacgc agcaccttgc gtcggtgttg gagatctgca ccacatttcg gccccaccgg 3660
ttcttcacga tcttggcctt gctagactgc tccttcagcg cgcgctgccc gttttcgctc 3720
gtcacatcca tttcaatcac gtgctcctta tttatcataa tgcttccgtg tagacactta 3780
agctcgcctt cgatctcagc gcagcggtgc agccacaacg cgcagcccgt gggctcgtga 3840
tgcttgtagg tcacctctgc aaacgactgc aggtacgcct gcaggaatcg ccccatcatc 3900
gtcacaaagg tcttgttgct ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc 3960
caggtcttgc atacggccgc cagagcttcc acttggtcag gcagtagttt gaagttcgcc 4020
tttagatcgt tatccacgtg gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc 4080
tcccacgcag acacgatcgg cacactcagc gggttcatca ccgtaatttc actttccgct 4140
tcgctgggct cttcctcttc ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca 4200
ttcagccgcc gcactgtgcg cttacctcct ttgccatgct tgattagcac cggtgggttg 4260
ctgaaaccca ccatttgtag cgccacatct tctctttctt cctcgctgtc cacgattacc 4320
tctggtgatg gcgggcgctc gggcttggga gaagggcgct tctttttctt cttgggcgca 4380
atggccaaat ccgccgccga ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg 4440
tcttgtgatg agtcttcctc gtcctcggac tcgatacgcc gcctcatccg cttttttggg 4500
ggcgcccggg gaggcggcgg cgacggggac ggggacgaca cgtcctccat ggttggggga 4560
cgtcgcgccg caccgcgtcc gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg 4620
gccatttcct tctcctatag gcagaaaaag atcatggagt cagtcgagaa gaaggacagc 4680
ctaaccgccc cctctgagtt cgccaccacc gcctccaccg atgccgccaa cgcgcctacc 4740
accttccccg tcgaggcacc cccgcttgag gaggaggaag tgattatcga gcaggaccca 4800
ggttttgtaa gcgaagacga cgaggaccgc tcagtaccaa cagaggataa aaagcaagac 4860
caggacaacg cagaggcaaa cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac 4920
tacctagatg tgggagacga cgtgctgttg aagcatctgc agcgccagtg cgccattatc 4980
tgcgacgcgt tgcaagagcg cagcgatgtg cccctcgcca tagcggatgt cagccttgcc 5040
tacgaacgcc acctattctc accgcgcgta ccccccaaac gccaagaaaa cggcacatgc 5100
gagcccaacc cgcgcctcaa cttctacccc gtatttgccg tgccagaggt gcttgccacc 5160
tatcacatct ttttccaaaa ctgcaagata cccctatcct gccgtgccaa ccgcagccga 5220
gcggacaagc agctggcctt gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac 5280
gaagtgccaa aaatctttga gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg 5340
caacaggaaa acagcgaaaa tgaaagtcac tctggagtgt tggtggaact cgagggtgac 5400
aacgcgcgcc tagccgtact aaaacgcagc atcgaggtca cccactttgc ctacccggca 5460
cttaacctac cccccaaggt catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg 5520
cagcccctgg agagggatgc aaatttgcaa gaacaaacag aggagggcct acccgcagtt 5580
ggcgacgagc agctagcgcg ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga 5640
cgcaaactaa tgatggccgc agtgctcgtt accgtggagc ttgagtgcat gcagcggttc 5700
tttgctgacc cggagatgca gcgcaagcta gaggaaacat tgcactacac ctttcgacag 5760
ggctacgtac gccaggcctg caagatctcc aacgtggagc tctgcaacct ggtctcctac 5820
cttggaattt tgcacgaaaa ccgccttggg caaaacgtgc ttcattccac gctcaagggc 5880
gaggcgcgcc gcgactacgt ccgcgactgc gtttacttat ttctatgcta cacctggcag 5940
acggccatgg gcgtttggca gcagtgcttg gaggagtgca acctcaagga gctgcagaaa 6000
ctgctaaagc aaaacttgaa ggacctatgg acggccttca acgagcgctc cgtggccgcg 6060
cacctggcgg acatcatttt ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca 6120
gacttcacca gtcaaagcat gttgcagaac tttaggaact ttatcctaga gcgctcagga 6180
atcttgcccg ccacctgctg tgcacttcct agcgactttg tgcccattaa gtaccgcgaa 6240
tgccctccgc cgctttgggg ccactgctac cttctgcagc tagccaacta ccttgcctac 6300
cactctgaca taatggaaga cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc 6360
aacctatgca ccccgcaccg ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa 6420
attatcggta cctttgagct gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg 6480
ttgaaactca ctccggggct gtggacgtcg gcttaccttc gcaaatttgt acctgaggac 6540
taccacgccc acgagattag gttctacgaa gaccaatccc gcccgcctaa tgcggagctt 6600
accgcctgcg tcattaccca gggccacatt cttggccaat tgcaagccat caacaaagcc 6660
cgccaagagt ttctgctacg aaagggacgg ggggtttact tggaccccca gtccggcgag 6720
gagctcaacc caatcccccc gccgccgcag ccctatcagc agcagccgcg ggcccttgct 6780
tcccaggatg gcacccaaaa agaagctgca gctgccgccg ccacccacgg acgaggagga 6840
atactgggac agtcaggcag aggaggtttt ggacgaggag gaggaggaca tgatggaaga 6900
ctgggagagc ctagacgagg aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc 6960
accctcggtc gcattcccct cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc 7020
tacaacctcc gctcctcagg cgccgccggc actgcccgtt cgccgaccca accgtagatg 7080
ggacaccact ggaaccaggg ccggtaagtc caagcagccg ccgccgttag cccaagagca 7140
acaacagcgc caaggctacc gctcatggcg cgggcacaag aacgccatag ttgcttgctt 7200
gcaagactgt gggggcaaca tctccttcgc ccgccgcttt cttctctacc atcacggcgt 7260
ggccttcccc cgtaacatcc tgcattacta ccgtcatctc tacagcccat actgcaccgg 7320
cggcagcggc agcaacagca gcggccacac agaagcaaag gcgaccggat agcaagactc 7380
tgacaaagcc caagaaatcc acagcggcgg cagcagcagg aggaggagcg ctgcgtctgg 7440
cgcccaacga acccgtatcg acccgcgagc ttagaaacag gatttttccc actctgtatg 7500
ctatatttca acagagcagg ggccaagaac aagagctgaa aataaaaaac aggtctctgc 7560
gatccctcac ccgcagctgc ctgtatcaca aaagcgaaga tcagcttcgg cgcacgctgg 7620
aagacgcgga ggctctcttc agtaaatact gcgcgctgac tcttaaggac tagtttcgcg 7680
ccctttctca aatttaagcg cgaaaactac gtcatctcca gcggccacac ccggcgccag 7740
cacctgttgt cagcgccatt atgagcaagg aaattcccac gccctacatg tggagttacc 7800
agccacaaat gggacttgcg gctggagctg cccaagacta ctcaacccga ataaactaca 7860
tgagcgcggg gcggccgcaa cttgtttatt gcagcttata atggttacaa ataaagcaat 7920
agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg tggtttgtcc 7980
aaactcatca atgtatctta gcttaacggg cggcgaagga gaagtccacg cctacatggg 8040
ggtagagtca taatcgtgca tcaggatagg gcggtggtgc tgcagcagcg cgcgaataaa 8100
ctgctgccgc cgccgctccg tcctgcagga atacaacatg gcagtggtct cctcagcgat 8160
gattcgcacc gcccgcagca taaggcgcct tgtcctccgg gcacagcagc gcaccctgat 8220
ctcacttaaa tcagcacagt aactgcagca cagcaccaca atattgttca aaatcccaca 8280
gtgcaaggcg ctgtatccaa agctcatggc ggggaccaca gaacccacgt ggccatcata 8340
ccacaagcgc aggtagatta agtggcgacc cctcataaac acgctggaca taaacattac 8400
ctcttttggc atgttgtaat tcaccacctc ccggtaccat ataaacctct gattaaacat 8460
ggcgccatcc accaccatcc taaaccagct ggccaaaacc tgcccgccgg ctatacactg 8520
cagggaaccg ggactggaac aatgacagtg gagagcccag gactcgtaac catggatcat 8580
catgctcgtc atgatatcaa tgttggcaca acacaggcac acgtgcatac acttcctcag 8640
gattacaagc tcctcccgcg ttagaaccat atcccaggga acaacccatt cctgaatcag 8700
cgtaaatccc acactgcagg gaagacctcg cacgtaactc acgttgtgca ttgtcaaagt 8760
gttacattcg ggcagcagcg gatgatcctc cagtatggta gcgcgggttt ctgtctcaaa 8820
aggaggtaga cgatccctac tgtacggagt gcgccgagac aaccgagatc gtgttggtcg 8880
tagtgtcatg ccaaatggaa cgccggacgt agtcatattt cctgaagcaa aaccaggtgc 8940
gggcgtgaca aacagatctg cgtctccggt ctcgccgctt agatcgctct gtgtagtagt 9000
tgtagtatat ccactctctc aaagcatcca ggcgccccct ggcttcgggt tctatgtaaa 9060
ctccttcatg cgccgctgcc ctgataacat ccaccaccgc agaataagcc acacccagcc 9120
aacctacaca ttcgttctgc gagtcacaca cgggaggagc gggaagagct ggaagaacca 9180
tgtttttttt tttattccaa aagattatcc aaaacctcaa aatgaagatc tattaagtga 9240
acgcgctccc ctccggtggc gtggtcaaac tctacagcca aagaacagat aatggcattt 9300
gtaagatgtt gcacaatggc ttccaaaagg caaacggccc tcacgtccaa gtggacgtaa 9360
aggctaaacc cttcagggtg aatctcctct ataaacattc cagcaccttc aaccatgccc 9420
aaataattct catctcgcca ccttctcaat atatctctaa gcaaatcccg aatattaagt 9480
ccggccattg taaaaatctg ctccagagcg ccctccacct tcagcctcaa gcagcgaatc 9540
atgattgcaa aaattcaggt tcctcacaga cctgtataag attcaaaagc ggaacattaa 9600
caaaaatacc gcgatcccgt aggtcccttc gcagggccag ctgaacataa tcgtgcaggt 9660
ctgcacggac cagcgcggcc acttccccgc caggaaccat gacaaaagaa cccacactga 9720
ttatgacacg catactcgga gctatgctaa ccagcgtagc cccgatgtaa gcttgttgca 9780
tgggcggcga tataaaatgc aaggtgctgc tcaaaaaatc aggcaaagcc tcgcgcaaaa 9840
aagaaagcac atcgtagtca tgctcatgca gataaaggca ggtaagctcc ggaaccacca 9900
cagaaaaaga caccattttt ctctcaaaca tgtctgcggg tttctgcata aacacaaaat 9960
aaaataacaa aaaaacattt aaacattaga agcctgtctt acaacaggaa aaacaaccct 10020
tataagcata agacggacta cggccatgcc ggcgtgaccg taaaaaaact ggtcaccgtg 10080
attaaaaagc accaccgaca gctcctcggt catgtccgga gtcataatgt aagactcggt 10140
aaacacatca ggttgattca catcggtcag tgctaaaaag cgaccgaaat agcccggggg 10200
aatacatacc cgcaggcgta gagacaacat tacagccccc ataggaggta taacaaaatt 10260
aataggagag aaaaacacat aaacacctga aaaaccctcc tgcctaggca aaatagcacc 10320
ctcccgctcc agaacaacat acagcgcttc cacagcggca gccatggtgg catttgcaaa 10380
agcctaggcc tccaaaaaag cctcctcact acttctggaa tagctcagag gccgaggcgg 10440
cctcggcctc tgcataaata aaaaaaatta gtcagccatg gggcggagaa tgggcggaac 10500
tgggcggagt taggggcggg atgggcggag ttaggggcgg gactatggtt gctgactaat 10560
tgagatgcat gctttgcata cttctgcctg ctggggagcc tggggacttt ccacacctgg 10620
ttgctgacta attgagatgc atgctttgca tacttctgcc tgctggggag cctggggact 10680
ttccacaccc taactgacac acacgttacg tcacttccca ttttaagaaa actacaattc 10740
ccaacacata caagttactc cgcccttaat taaatcggat ccgatatcta gatgtattcg 10800
cgaggtaccg agctcgaatt ctctggccgt cgttttacaa cgtcgtgact gggaaaaccc 10860
tggcgttacc caacttaatc gccttgcagc acatccccct ttcgccagct ggcgtaatag 10920
cgaagaggcc cgcaccgatc gcccttccca acagttgcgc agcctgaatg gcgaatggcg 10980
cctgatgcgg tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatggtgcac 11040
tctcagtaca atctgctctg atgccgcata gttaagccag ccccgacacc cgccaacacc 11100
cgctgacgcg ccctgacggg cttgtctgct cccggcatcc gcttacagac aagctgtgac 11160
cgtctccggg agctgcatgt gtcagaggtt ttcaccgtca tcaccgaaac gcgcga 11216
<210> 97
<211> 11883
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 97
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tctgcagtcg accagaagca ccatgtcctt gggtccggcc 2160
tgctgaatgc gcaggcggtc ggccatgccc caggcttcgt tttgacatcg gcgcaggtct 2220
ttgtagtagt cttgcatgag cctttctacc ggcacttctt cttctccttc ctcttgtcct 2280
gcatctcttg catctatcgc tgcggcggcg gcggagtttg gccgtaggtg gcgccctctt 2340
cctcccatgc gtgtgacccc gaagcccctc atcggctgaa gcagggctag gtcggcgaca 2400
acgcgctcgg ctaatatggc ctgctgcacc tgcgtgaggg tagactggaa gtcatccatg 2460
tccacaaagc ggtggtatgc gcccgtgttg atggtgtaag tgcagttggc cataacggac 2520
cagttaacgg tctggtgacc cggctgcgag agctcggtgt acctgagacg cgagtaagcc 2580
ctcgagtcaa atacgtagtc gttgcaagtc cgcaccaggt actggtatcc caccaaaaag 2640
tgcggcggcg gctggcggta gaggggccag cgtagggtgg ccggggctcc gggggcgaga 2700
tcttccaaca taaggcgatg atatccgtag atgtacctgg acatccaggt gatgccggcg 2760
gcggtggtgg aggcgcgcgg aaagtcgcgg acgcggttcc agatgttgcg cagcggcaaa 2820
aagtgctcca tggtcgggac gctctggccg gtcaggcgcg cgcaatcgtt gacgctctag 2880
cgtgcaaaag gagagcctgt aagcgggcac tcttccgtgg tctggtggat aaattcgcaa 2940
gggtatcatg gcggacgacc ggggttcgag ccccgtatcc ggccgtccgc cgtgatccat 3000
gcggttaccg cccgcgtgtc gaacccaggt gtgcgacgtc agacaacggg ggagtgctcc 3060
ttttggcttc cttccaggcg cggcggctgc tgcgctagct tttttggcca ctggccgcgc 3120
gcagcgtaag cggttaggct ggaaagcgaa agcattaagt ggctcgctcc ctgtagccgg 3180
agggttattt tccaagggtt gagtcgcggg acccccggtt cgagtctcgg accgagactg 3240
ggggcgtaca ctggatggcc tttgcctgga acccgcactc aaaaacatgc tacctctttg 3300
agccctttgg cttttctgac cagcgactca agcaggttta ccagtttgag tacgagtcac 3360
tcctgcgccg tagcgccatt gcttcttccc ccgaccgctg tataacgctg gaaaagtcca 3420
cccaaagcgt acaggggccc aactcggccg cctgtggact attctgctgc atgtttctcc 3480
acgcctttgc caactggccc caaactccca tggatcacaa ccccaccatg aaccttatta 3540
ccggggtacc caactccatg ctcaacagtc cccaggtaca gcccaccctg cgtcgcaacc 3600
aggaacagct ctacagcttc ctggagcgcc actcgcccta cttccgcagc cacagtgcgc 3660
agattaggag cgccacttct ttttgtcact tgaaaaacat gtaaaaataa tgtactagag 3720
acactttcaa taaaggcaaa tgcttttatt tgtacactct cgggtgatta tttaccccca 3780
cccttgccgt ctgcgccgtt taaaaatcaa aggggttctg ccgcgcatcg ctatgcgcca 3840
ctggcaggga cacgttgcga tactggtgtt tagtgctcca cttaaactca ggcacaacca 3900
tccgcggcag ctcggtgaag ttttcactcc acaggctgcg caccatcacc aacgcgttta 3960
gcaggtcggg cgccgatatc ttgaagtcgc agttggggcc tccgccctgc gcgcgcgagt 4020
tgcgatacac agggttgcag cactggaaca ctatcagcgc cgggtggtgc acgctggcca 4080
gcacgctctt gtcggagatc agatccgcgt ccaggtcctc cgcgttgctc agggcgaacg 4140
gagtcaactt tggtagctgc cttcccaaaa agggcgcgtg cccaggcttt gagttgcact 4200
cgcaccgtag tggcatcaaa aggtgaccgt gcccggtctg ggcgttagga tacagcgcct 4260
gcataaaagc cttgatctgc ttaaaagcca cctgagcctt tgcgccttca gagaagaaca 4320
tgccgcaaga cttgccggaa aactgattgg ccggacaggc cgcgtcgtgc acgcagcacc 4380
ttgcgtcggt gttggagatc tgcaccacat ttcggcccca ccggttcttc acgatcttgg 4440
ccttgctaga ctgctccttc agcgcgcgct gcccgttttc gctcgtcaca tccatttcaa 4500
tcacgtgctc cttatttatc ataatgcttc cgtgtagaca cttaagctcg ccttcgatct 4560
cagcgcagcg gtgcagccac aacgcgcagc ccgtgggctc gtgatgcttg taggtcacct 4620
ctgcaaacga ctgcaggtac gcctgcagga atcgccccat catcgtcaca aaggtcttgt 4680
tgctggtgaa ggtcagctgc aacccgcggt gctcctcgtt cagccaggtc ttgcatacgg 4740
ccgccagagc ttccacttgg tcaggcagta gtttgaagtt cgcctttaga tcgttatcca 4800
cgtggtactt gtccatcagc gcgcgcgcag cctccatgcc cttctcccac gcagacacga 4860
tcggcacact cagcgggttc atcaccgtaa tttcactttc cgcttcgctg ggctcttcct 4920
cttcctcttg cgtccgcata ccacgcgcca ctgggtcgtc ttcattcagc cgccgcactg 4980
tgcgcttacc tcctttgcca tgcttgatta gcaccggtgg gttgctgaaa cccaccattt 5040
gtagcgccac atcttctctt tcttcctcgc tgtccacgat tacctctggt gatggcgggc 5100
gctcgggctt gggagaaggg cgcttctttt tcttcttggg cgcaatggcc aaatccgccg 5160
ccgaggtcga tggccgcggg ctgggtgtgc gcggcaccag cgcgtcttgt gatgagtctt 5220
cctcgtcctc ggactcgata cgccgcctca tccgcttttt tgggggcgcc cggggaggcg 5280
gcggcgacgg ggacggggac gacacgtcct ccatggttgg gggacgtcgc gccgcaccgc 5340
gtccgcgctc gggggtggtt tcgcgctgct cctcttcccg actggccatt tccttctcct 5400
ataggcagaa aaagatcatg gagtcagtcg agaagaagga cagcctaacc gccccctctg 5460
agttcgccac caccgcctcc accgatgccg ccaacgcgcc taccaccttc cccgtcgagg 5520
cacccccgct tgaggaggag gaagtgatta tcgagcagga cccaggtttt gtaagcgaag 5580
acgacgagga ccgctcagta ccaacagagg ataaaaagca agaccaggac aacgcagagg 5640
caaacgagga acaagtcggg cggggggacg aaaggcatgg cgactaccta gatgtgggag 5700
acgacgtgct gttgaagcat ctgcagcgcc agtgcgccat tatctgcgac gcgttgcaag 5760
agcgcagcga tgtgcccctc gccatagcgg atgtcagcct tgcctacgaa cgccacctat 5820
tctcaccgcg cgtacccccc aaacgccaag aaaacggcac atgcgagccc aacccgcgcc 5880
tcaacttcta ccccgtattt gccgtgccag aggtgcttgc cacctatcac atctttttcc 5940
aaaactgcaa gataccccta tcctgccgtg ccaaccgcag ccgagcggac aagcagctgg 6000
ccttgcggca gggcgctgtc atacctgata tcgcctcgct caacgaagtg ccaaaaatct 6060
ttgagggtct tggacgcgac gagaagcgcg cggcaaacgc tctgcaacag gaaaacagcg 6120
aaaatgaaag tcactctgga gtgttggtgg aactcgaggg tgacaacgcg cgcctagccg 6180
tactaaaacg cagcatcgag gtcacccact ttgcctaccc ggcacttaac ctacccccca 6240
aggtcatgag cacagtcatg agtgagctga tcgtgcgccg tgcgcagccc ctggagaggg 6300
atgcaaattt gcaagaacaa acagaggagg gcctacccgc agttggcgac gagcagctag 6360
cgcgctggct tcaaacgcgc gagcctgccg acttggagga gcgacgcaaa ctaatgatgg 6420
ccgcagtgct cgttaccgtg gagcttgagt gcatgcagcg gttctttgct gacccggaga 6480
tgcagcgcaa gctagaggaa acattgcact acacctttcg acagggctac gtacgccagg 6540
cctgcaagat ctccaacgtg gagctctgca acctggtctc ctaccttgga attttgcacg 6600
aaaaccgcct tgggcaaaac gtgcttcatt ccacgctcaa gggcgaggcg cgccgcgact 6660
acgtccgcga ctgcgtttac ttatttctat gctacacctg gcagacggcc atgggcgttt 6720
ggcagcagtg cttggaggag tgcaacctca aggagctgca gaaactgcta aagcaaaact 6780
tgaaggacct atggacggcc ttcaacgagc gctccgtggc cgcgcacctg gcggacatca 6840
ttttccccga acgcctgctt aaaaccctgc aacagggtct gccagacttc accagtcaaa 6900
gcatgttgca gaactttagg aactttatcc tagagcgctc aggaatcttg cccgccacct 6960
gctgtgcact tcctagcgac tttgtgccca ttaagtaccg cgaatgccct ccgccgcttt 7020
ggggccactg ctaccttctg cagctagcca actaccttgc ctaccactct gacataatgg 7080
aagacgtgag cggtgacggt ctactggagt gtcactgtcg ctgcaaccta tgcaccccgc 7140
accgctccct ggtttgcaat tcgcagctgc ttaacgaaag tcaaattatc ggtacctttg 7200
agctgcaggg tccctcgcct gacgaaaagt ccgcggctcc ggggttgaaa ctcactccgg 7260
ggctgtggac gtcggcttac cttcgcaaat ttgtacctga ggactaccac gcccacgaga 7320
ttaggttcta cgaagaccaa tcccgcccgc ctaatgcgga gcttaccgcc tgcgtcatta 7380
cccagggcca cattcttggc caattgcaag ccatcaacaa agcccgccaa gagtttctgc 7440
tacgaaaggg acggggggtt tacttggacc cccagtccgg cgaggagctc aacccaatcc 7500
ccccgccgcc gcagccctat cagcagcagc cgcgggccct tgcttcccag gatggcaccc 7560
aaaaagaagc tgcagctgcc gccgccaccc acggacgagg aggaatactg ggacagtcag 7620
gcagaggagg ttttggacga ggaggaggag gacatgatgg aagactggga gagcctagac 7680
gaggaagctt ccgaggtcga agaggtgtca gacgaaacac cgtcaccctc ggtcgcattc 7740
ccctcgccgg cgccccagaa atcggcaacc ggttccagca tggctacaac ctccgctcct 7800
caggcgccgc cggcactgcc cgttcgccga cccaaccgta gatgggacac cactggaacc 7860
agggccggta agtccaagca gccgccgccg ttagcccaag agcaacaaca gcgccaaggc 7920
taccgctcat ggcgcgggca caagaacgcc atagttgctt gcttgcaaga ctgtgggggc 7980
aacatctcct tcgcccgccg ctttcttctc taccatcacg gcgtggcctt cccccgtaac 8040
atcctgcatt actaccgtca tctctacagc ccatactgca ccggcggcag cggcagcaac 8100
agcagcggcc acacagaagc aaaggcgacc ggatagcaag actctgacaa agcccaagaa 8160
atccacagcg gcggcagcag caggaggagg agcgctgcgt ctggcgccca acgaacccgt 8220
atcgacccgc gagcttagaa acaggatttt tcccactctg tatgctatat ttcaacagag 8280
caggggccaa gaacaagagc tgaaaataaa aaacaggtct ctgcgatccc tcacccgcag 8340
ctgcctgtat cacaaaagcg aagatcagct tcggcgcacg ctggaagacg cggaggctct 8400
cttcagtaaa tactgcgcgc tgactcttaa ggactagttt cgcgcccttt ctcaaattta 8460
agcgcgaaaa ctacgtcatc tccagcggcc acacccggcg ccagcacctg ttgtcagcgc 8520
cattatgagc aaggaaattc ccacgcccta catgtggagt taccagccac aaatgggact 8580
tgcggctgga gctgcccaag actactcaac ccgaataaac tacatgagcg cggggcggcc 8640
gcaacttgtt tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca 8700
caaataaagc atttttttca ctgcattcta gttgtggttt gtccaaactc atcaatgtat 8760
cttagcttaa cgggcggcga aggagaagtc cacgcctaca tgggggtaga gtcataatcg 8820
tgcatcagga tagggcggtg gtgctgcagc agcgcgcgaa taaactgctg ccgccgccgc 8880
tccgtcctgc aggaatacaa catggcagtg gtctcctcag cgatgattcg caccgcccgc 8940
agcataaggc gccttgtcct ccgggcacag cagcgcaccc tgatctcact taaatcagca 9000
cagtaactgc agcacagcac cacaatattg ttcaaaatcc cacagtgcaa ggcgctgtat 9060
ccaaagctca tggcggggac cacagaaccc acgtggccat cataccacaa gcgcaggtag 9120
attaagtggc gacccctcat aaacacgctg gacataaaca ttacctcttt tggcatgttg 9180
taattcacca cctcccggta ccatataaac ctctgattaa acatggcgcc atccaccacc 9240
atcctaaacc agctggccaa aacctgcccg ccggctatac actgcaggga accgggactg 9300
gaacaatgac agtggagagc ccaggactcg taaccatgga tcatcatgct cgtcatgata 9360
tcaatgttgg cacaacacag gcacacgtgc atacacttcc tcaggattac aagctcctcc 9420
cgcgttagaa ccatatccca gggaacaacc cattcctgaa tcagcgtaaa tcccacactg 9480
cagggaagac ctcgcacgta actcacgttg tgcattgtca aagtgttaca ttcgggcagc 9540
agcggatgat cctccagtat ggtagcgcgg gtttctgtct caaaaggagg tagacgatcc 9600
ctactgtacg gagtgcgccg agacaaccga gatcgtgttg gtcgtagtgt catgccaaat 9660
ggaacgccgg acgtagtcat atttcctgaa gcaaaaccag gtgcgggcgt gacaaacaga 9720
tctgcgtctc cggtctcgcc gcttagatcg ctctgtgtag tagttgtagt atatccactc 9780
tctcaaagca tccaggcgcc ccctggcttc gggttctatg taaactcctt catgcgccgc 9840
tgccctgata acatccacca ccgcagaata agccacaccc agccaaccta cacattcgtt 9900
ctgcgagtca cacacgggag gagcgggaag agctggaaga accatgtttt tttttttatt 9960
ccaaaagatt atccaaaacc tcaaaatgaa gatctattaa gtgaacgcgc tcccctccgg 10020
tggcgtggtc aaactctaca gccaaagaac agataatggc atttgtaaga tgttgcacaa 10080
tggcttccaa aaggcaaacg gccctcacgt ccaagtggac gtaaaggcta aacccttcag 10140
ggtgaatctc ctctataaac attccagcac cttcaaccat gcccaaataa ttctcatctc 10200
gccaccttct caatatatct ctaagcaaat cccgaatatt aagtccggcc attgtaaaaa 10260
tctgctccag agcgccctcc accttcagcc tcaagcagcg aatcatgatt gcaaaaattc 10320
aggttcctca cagacctgta taagattcaa aagcggaaca ttaacaaaaa taccgcgatc 10380
ccgtaggtcc cttcgcaggg ccagctgaac ataatcgtgc aggtctgcac ggaccagcgc 10440
ggccacttcc ccgccaggaa ccatgacaaa agaacccaca ctgattatga cacgcatact 10500
cggagctatg ctaaccagcg tagccccgat gtaagcttgt tgcatgggcg gcgatataaa 10560
atgcaaggtg ctgctcaaaa aatcaggcaa agcctcgcgc aaaaaagaaa gcacatcgta 10620
gtcatgctca tgcagataaa ggcaggtaag ctccggaacc accacagaaa aagacaccat 10680
ttttctctca aacatgtctg cgggtttctg cataaacaca aaataaaata acaaaaaaac 10740
atttaaacat tagaagcctg tcttacaaca ggaaaaacaa cccttataag cataagacgg 10800
actacggcca tgccggcgtg accgtaaaaa aactggtcac cgtgattaaa aagcaccacc 10860
gacagctcct cggtcatgtc cggagtcata atgtaagact cggtaaacac atcaggttga 10920
ttcacatcgg tcagtgctaa aaagcgaccg aaatagcccg ggggaataca tacccgcagg 10980
cgtagagaca acattacagc ccccatagga ggtataacaa aattaatagg agagaaaaac 11040
acataaacac ctgaaaaacc ctcctgccta ggcaaaatag caccctcccg ctccagaaca 11100
acatacagcg cttccacagc ggcagccata acagtcagcc ttaccagtaa aaaagaaaac 11160
ctattaaaaa aacaccactc gacacggcac cagctcaatc agtcacagtg taaaaaaggg 11220
ccaagtgcag agcgagtata tataggacta aaaaatgacg taacggttaa agtccacaaa 11280
aaacacccag aaaaccgcac gcgaacctac gcccagaaac gaaagccaaa aaacccacaa 11340
cttcctcaaa tcgtcacttc cgttttccca cgttacgtca cttcccattt taagaaaact 11400
acaattccca acacatacaa gttactccgc ccttaattaa atcggatccg atatctagat 11460
gtattcgcga ggtaccgagc tcgaattctc tggccgtcgt tttacaacgt cgtgactggg 11520
aaaaccctgg cgttacccaa cttaatcgcc ttgcagcaca tccccctttc gccagctggc 11580
gtaatagcga agaggcccgc accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg 11640
aatggcgcct gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat 11700
ggtgcactct cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc 11760
caacacccgc tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag 11820
ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg 11880
cga 11883
<210> 98
<211> 11980
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 98
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tctgcagtcg accagaagca ccatgtcctt gggtccggcc 2160
tgctgaatgc gcaggcggtc ggccatgccc caggcttcgt tttgacatcg gcgcaggtct 2220
ttgtagtagt cttgcatgag cctttctacc ggcacttctt cttctccttc ctcttgtcct 2280
gcatctcttg catctatcgc tgcggcggcg gcggagtttg gccgtaggtg gcgccctctt 2340
cctcccatgc gtgtgacccc gaagcccctc atcggctgaa gcagggctag gtcggcgaca 2400
acgcgctcgg ctaatatggc ctgctgcacc tgcgtgaggg tagactggaa gtcatccatg 2460
tccacaaagc ggtggtatgc gcccgtgttg atggtgtaag tgcagttggc cataacggac 2520
cagttaacgg tctggtgacc cggctgcgag agctcggtgt acctgagacg cgagtaagcc 2580
ctcgagtcaa atacgtagtc gttgcaagtc cgcaccaggt actggtatcc caccaaaaag 2640
tgcggcggcg gctggcggta gaggggccag cgtagggtgg ccggggctcc gggggcgaga 2700
tcttccaaca taaggcgatg atatccgtag atgtacctgg acatccaggt gatgccggcg 2760
gcggtggtgg aggcgcgcgg aaagtcgcgg acgcggttcc agatgttgcg cagcggcaaa 2820
aagtgctcca tggtcgggac gctctggccg gtcaggcgcg cgcaatcgtt gacgctctag 2880
cgtgcaaaag gagagcctgt aagcgggcac tcttccgtgg tctggtggat aaattcgcaa 2940
gggtatcatg gcggacgacc ggggttcgag ccccgtatcc ggccgtccgc cgtgatccat 3000
gcggttaccg cccgcgtgtc gaacccaggt gtgcgacgtc agacaacggg ggagtgctcc 3060
ttttggcttc cttccaggcg cggcggctgc tgcgctagct tttttggcca ctggccgcgc 3120
gcagcgtaag cggttaggct ggaaagcgaa agcattaagt ggctcgctcc ctgtagccgg 3180
agggttattt tccaagggtt gagtcgcggg acccccggtt cgagtctcgg accgagactg 3240
ggggcgtaca ctggatggcc tttgcctgga acccgcactc aaaaacatgc tacctctttg 3300
agccctttgg cttttctgac cagcgactca agcaggttta ccagtttgag tacgagtcac 3360
tcctgcgccg tagcgccatt gcttcttccc ccgaccgctg tataacgctg gaaaagtcca 3420
cccaaagcgt acaggggccc aactcggccg cctgtggact attctgctgc atgtttctcc 3480
acgcctttgc caactggccc caaactccca tggatcacaa ccccaccatg aaccttatta 3540
ccggggtacc caactccatg ctcaacagtc cccaggtaca gcccaccctg cgtcgcaacc 3600
aggaacagct ctacagcttc ctggagcgcc actcgcccta cttccgcagc cacagtgcgc 3660
agattaggag cgccacttct ttttgtcact tgaaaaacat gtaaaaataa tgtactagag 3720
acactttcaa taaaggcaaa tgcttttatt tgtacactct cgggtgatta tttaccccca 3780
cccttgccgt ctgcgccgtt taaaaatcaa aggggttctg ccgcgcatcg ctatgcgcca 3840
ctggcaggga cacgttgcga tactggtgtt tagtgctcca cttaaactca ggcacaacca 3900
tccgcggcag ctcggtgaag ttttcactcc acaggctgcg caccatcacc aacgcgttta 3960
gcaggtcggg cgccgatatc ttgaagtcgc agttggggcc tccgccctgc gcgcgcgagt 4020
tgcgatacac agggttgcag cactggaaca ctatcagcgc cgggtggtgc acgctggcca 4080
gcacgctctt gtcggagatc agatccgcgt ccaggtcctc cgcgttgctc agggcgaacg 4140
gagtcaactt tggtagctgc cttcccaaaa agggcgcgtg cccaggcttt gagttgcact 4200
cgcaccgtag tggcatcaaa aggtgaccgt gcccggtctg ggcgttagga tacagcgcct 4260
gcataaaagc cttgatctgc ttaaaagcca cctgagcctt tgcgccttca gagaagaaca 4320
tgccgcaaga cttgccggaa aactgattgg ccggacaggc cgcgtcgtgc acgcagcacc 4380
ttgcgtcggt gttggagatc tgcaccacat ttcggcccca ccggttcttc acgatcttgg 4440
ccttgctaga ctgctccttc agcgcgcgct gcccgttttc gctcgtcaca tccatttcaa 4500
tcacgtgctc cttatttatc ataatgcttc cgtgtagaca cttaagctcg ccttcgatct 4560
cagcgcagcg gtgcagccac aacgcgcagc ccgtgggctc gtgatgcttg taggtcacct 4620
ctgcaaacga ctgcaggtac gcctgcagga atcgccccat catcgtcaca aaggtcttgt 4680
tgctggtgaa ggtcagctgc aacccgcggt gctcctcgtt cagccaggtc ttgcatacgg 4740
ccgccagagc ttccacttgg tcaggcagta gtttgaagtt cgcctttaga tcgttatcca 4800
cgtggtactt gtccatcagc gcgcgcgcag cctccatgcc cttctcccac gcagacacga 4860
tcggcacact cagcgggttc atcaccgtaa tttcactttc cgcttcgctg ggctcttcct 4920
cttcctcttg cgtccgcata ccacgcgcca ctgggtcgtc ttcattcagc cgccgcactg 4980
tgcgcttacc tcctttgcca tgcttgatta gcaccggtgg gttgctgaaa cccaccattt 5040
gtagcgccac atcttctctt tcttcctcgc tgtccacgat tacctctggt gatggcgggc 5100
gctcgggctt gggagaaggg cgcttctttt tcttcttggg cgcaatggcc aaatccgccg 5160
ccgaggtcga tggccgcggg ctgggtgtgc gcggcaccag cgcgtcttgt gatgagtctt 5220
cctcgtcctc ggactcgata cgccgcctca tccgcttttt tgggggcgcc cggggaggcg 5280
gcggcgacgg ggacggggac gacacgtcct ccatggttgg gggacgtcgc gccgcaccgc 5340
gtccgcgctc gggggtggtt tcgcgctgct cctcttcccg actggccatt tccttctcct 5400
ataggcagaa aaagatcatg gagtcagtcg agaagaagga cagcctaacc gccccctctg 5460
agttcgccac caccgcctcc accgatgccg ccaacgcgcc taccaccttc cccgtcgagg 5520
cacccccgct tgaggaggag gaagtgatta tcgagcagga cccaggtttt gtaagcgaag 5580
acgacgagga ccgctcagta ccaacagagg ataaaaagca agaccaggac aacgcagagg 5640
caaacgagga acaagtcggg cggggggacg aaaggcatgg cgactaccta gatgtgggag 5700
acgacgtgct gttgaagcat ctgcagcgcc agtgcgccat tatctgcgac gcgttgcaag 5760
agcgcagcga tgtgcccctc gccatagcgg atgtcagcct tgcctacgaa cgccacctat 5820
tctcaccgcg cgtacccccc aaacgccaag aaaacggcac atgcgagccc aacccgcgcc 5880
tcaacttcta ccccgtattt gccgtgccag aggtgcttgc cacctatcac atctttttcc 5940
aaaactgcaa gataccccta tcctgccgtg ccaaccgcag ccgagcggac aagcagctgg 6000
ccttgcggca gggcgctgtc atacctgata tcgcctcgct caacgaagtg ccaaaaatct 6060
ttgagggtct tggacgcgac gagaagcgcg cggcaaacgc tctgcaacag gaaaacagcg 6120
aaaatgaaag tcactctgga gtgttggtgg aactcgaggg tgacaacgcg cgcctagccg 6180
tactaaaacg cagcatcgag gtcacccact ttgcctaccc ggcacttaac ctacccccca 6240
aggtcatgag cacagtcatg agtgagctga tcgtgcgccg tgcgcagccc ctggagaggg 6300
atgcaaattt gcaagaacaa acagaggagg gcctacccgc agttggcgac gagcagctag 6360
cgcgctggct tcaaacgcgc gagcctgccg acttggagga gcgacgcaaa ctaatgatgg 6420
ccgcagtgct cgttaccgtg gagcttgagt gcatgcagcg gttctttgct gacccggaga 6480
tgcagcgcaa gctagaggaa acattgcact acacctttcg acagggctac gtacgccagg 6540
cctgcaagat ctccaacgtg gagctctgca acctggtctc ctaccttgga attttgcacg 6600
aaaaccgcct tgggcaaaac gtgcttcatt ccacgctcaa gggcgaggcg cgccgcgact 6660
acgtccgcga ctgcgtttac ttatttctat gctacacctg gcagacggcc atgggcgttt 6720
ggcagcagtg cttggaggag tgcaacctca aggagctgca gaaactgcta aagcaaaact 6780
tgaaggacct atggacggcc ttcaacgagc gctccgtggc cgcgcacctg gcggacatca 6840
ttttccccga acgcctgctt aaaaccctgc aacagggtct gccagacttc accagtcaaa 6900
gcatgttgca gaactttagg aactttatcc tagagcgctc aggaatcttg cccgccacct 6960
gctgtgcact tcctagcgac tttgtgccca ttaagtaccg cgaatgccct ccgccgcttt 7020
ggggccactg ctaccttctg cagctagcca actaccttgc ctaccactct gacataatgg 7080
aagacgtgag cggtgacggt ctactggagt gtcactgtcg ctgcaaccta tgcaccccgc 7140
accgctccct ggtttgcaat tcgcagctgc ttaacgaaag tcaaattatc ggtacctttg 7200
agctgcaggg tccctcgcct gacgaaaagt ccgcggctcc ggggttgaaa ctcactccgg 7260
ggctgtggac gtcggcttac cttcgcaaat ttgtacctga ggactaccac gcccacgaga 7320
ttaggttcta cgaagaccaa tcccgcccgc ctaatgcgga gcttaccgcc tgcgtcatta 7380
cccagggcca cattcttggc caattgcaag ccatcaacaa agcccgccaa gagtttctgc 7440
tacgaaaggg acggggggtt tacttggacc cccagtccgg cgaggagctc aacccaatcc 7500
ccccgccgcc gcagccctat cagcagcagc cgcgggccct tgcttcccag gatggcaccc 7560
aaaaagaagc tgcagctgcc gccgccaccc acggacgagg aggaatactg ggacagtcag 7620
gcagaggagg ttttggacga ggaggaggag gacatgatgg aagactggga gagcctagac 7680
gaggaagctt ccgaggtcga agaggtgtca gacgaaacac cgtcaccctc ggtcgcattc 7740
ccctcgccgg cgccccagaa atcggcaacc ggttccagca tggctacaac ctccgctcct 7800
caggcgccgc cggcactgcc cgttcgccga cccaaccgta gatgggacac cactggaacc 7860
agggccggta agtccaagca gccgccgccg ttagcccaag agcaacaaca gcgccaaggc 7920
taccgctcat ggcgcgggca caagaacgcc atagttgctt gcttgcaaga ctgtgggggc 7980
aacatctcct tcgcccgccg ctttcttctc taccatcacg gcgtggcctt cccccgtaac 8040
atcctgcatt actaccgtca tctctacagc ccatactgca ccggcggcag cggcagcaac 8100
agcagcggcc acacagaagc aaaggcgacc ggatagcaag actctgacaa agcccaagaa 8160
atccacagcg gcggcagcag caggaggagg agcgctgcgt ctggcgccca acgaacccgt 8220
atcgacccgc gagcttagaa acaggatttt tcccactctg tatgctatat ttcaacagag 8280
caggggccaa gaacaagagc tgaaaataaa aaacaggtct ctgcgatccc tcacccgcag 8340
ctgcctgtat cacaaaagcg aagatcagct tcggcgcacg ctggaagacg cggaggctct 8400
cttcagtaaa tactgcgcgc tgactcttaa ggactagttt cgcgcccttt ctcaaattta 8460
agcgcgaaaa ctacgtcatc tccagcggcc acacccggcg ccagcacctg ttgtcagcgc 8520
cattatgagc aaggaaattc ccacgcccta catgtggagt taccagccac aaatgggact 8580
tgcggctgga gctgcccaag actactcaac ccgaataaac tacatgagcg cggggcggcc 8640
gcaacttgtt tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca 8700
caaataaagc atttttttca ctgcattcta gttgtggttt gtccaaactc atcaatgtat 8760
cttagcttaa cgggcggcga aggagaagtc cacgcctaca tgggggtaga gtcataatcg 8820
tgcatcagga tagggcggtg gtgctgcagc agcgcgcgaa taaactgctg ccgccgccgc 8880
tccgtcctgc aggaatacaa catggcagtg gtctcctcag cgatgattcg caccgcccgc 8940
agcataaggc gccttgtcct ccgggcacag cagcgcaccc tgatctcact taaatcagca 9000
cagtaactgc agcacagcac cacaatattg ttcaaaatcc cacagtgcaa ggcgctgtat 9060
ccaaagctca tggcggggac cacagaaccc acgtggccat cataccacaa gcgcaggtag 9120
attaagtggc gacccctcat aaacacgctg gacataaaca ttacctcttt tggcatgttg 9180
taattcacca cctcccggta ccatataaac ctctgattaa acatggcgcc atccaccacc 9240
atcctaaacc agctggccaa aacctgcccg ccggctatac actgcaggga accgggactg 9300
gaacaatgac agtggagagc ccaggactcg taaccatgga tcatcatgct cgtcatgata 9360
tcaatgttgg cacaacacag gcacacgtgc atacacttcc tcaggattac aagctcctcc 9420
cgcgttagaa ccatatccca gggaacaacc cattcctgaa tcagcgtaaa tcccacactg 9480
cagggaagac ctcgcacgta actcacgttg tgcattgtca aagtgttaca ttcgggcagc 9540
agcggatgat cctccagtat ggtagcgcgg gtttctgtct caaaaggagg tagacgatcc 9600
ctactgtacg gagtgcgccg agacaaccga gatcgtgttg gtcgtagtgt catgccaaat 9660
ggaacgccgg acgtagtcat atttcctgaa gcaaaaccag gtgcgggcgt gacaaacaga 9720
tctgcgtctc cggtctcgcc gcttagatcg ctctgtgtag tagttgtagt atatccactc 9780
tctcaaagca tccaggcgcc ccctggcttc gggttctatg taaactcctt catgcgccgc 9840
tgccctgata acatccacca ccgcagaata agccacaccc agccaaccta cacattcgtt 9900
ctgcgagtca cacacgggag gagcgggaag agctggaaga accatgtttt tttttttatt 9960
ccaaaagatt atccaaaacc tcaaaatgaa gatctattaa gtgaacgcgc tcccctccgg 10020
tggcgtggtc aaactctaca gccaaagaac agataatggc atttgtaaga tgttgcacaa 10080
tggcttccaa aaggcaaacg gccctcacgt ccaagtggac gtaaaggcta aacccttcag 10140
ggtgaatctc ctctataaac attccagcac cttcaaccat gcccaaataa ttctcatctc 10200
gccaccttct caatatatct ctaagcaaat cccgaatatt aagtccggcc attgtaaaaa 10260
tctgctccag agcgccctcc accttcagcc tcaagcagcg aatcatgatt gcaaaaattc 10320
aggttcctca cagacctgta taagattcaa aagcggaaca ttaacaaaaa taccgcgatc 10380
ccgtaggtcc cttcgcaggg ccagctgaac ataatcgtgc aggtctgcac ggaccagcgc 10440
ggccacttcc ccgccaggaa ccatgacaaa agaacccaca ctgattatga cacgcatact 10500
cggagctatg ctaaccagcg tagccccgat gtaagcttgt tgcatgggcg gcgatataaa 10560
atgcaaggtg ctgctcaaaa aatcaggcaa agcctcgcgc aaaaaagaaa gcacatcgta 10620
gtcatgctca tgcagataaa ggcaggtaag ctccggaacc accacagaaa aagacaccat 10680
ttttctctca aacatgtctg cgggtttctg cataaacaca aaataaaata acaaaaaaac 10740
atttaaacat tagaagcctg tcttacaaca ggaaaaacaa cccttataag cataagacgg 10800
actacggcca tgccggcgtg accgtaaaaa aactggtcac cgtgattaaa aagcaccacc 10860
gacagctcct cggtcatgtc cggagtcata atgtaagact cggtaaacac atcaggttga 10920
ttcacatcgg tcagtgctaa aaagcgaccg aaatagcccg ggggaataca tacccgcagg 10980
cgtagagaca acattacagc ccccatagga ggtataacaa aattaatagg agagaaaaac 11040
acataaacac ctgaaaaacc ctcctgccta ggcaaaatag caccctcccg ctccagaaca 11100
acatacagcg cttccacagc ggcagccatg gtggcatttg caaaagccta ggcctccaaa 11160
aaagcctcct cactacttct ggaatagctc agaggccgag gcggcctcgg cctctgcata 11220
aataaaaaaa attagtcagc catggggcgg agaatgggcg gaactgggcg gagttagggg 11280
cgggatgggc ggagttaggg gcgggactat ggttgctgac taattgagat gcatgctttg 11340
catacttctg cctgctgggg agcctgggga ctttccacac ctggttgctg actaattgag 11400
atgcatgctt tgcatacttc tgcctgctgg ggagcctggg gactttccac accctaactg 11460
acacacacgt tacgtcactt cccattttaa gaaaactaca attcccaaca catacaagtt 11520
actccgccct taattaaatc ggatccgata tctagatgta ttcgcgaggt accgagctcg 11580
aattctctgg ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 11640
aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 11700
gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgcctgat gcggtatttt 11760
ctccttacgc atctgtgcgg tatttcacac cgcatatggt gcactctcag tacaatctgc 11820
tctgatgccg catagttaag ccagccccga cacccgccaa cacccgctga cgcgccctga 11880
cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc cgggagctgc 11940
atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga 11980
<210> 99
<211> 17250
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 99
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat cgatccagac atgataagat acattgatga 2880
gtttggacaa accacaacta gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga 2940
tgctattgct ttatttgtaa ccattataag ctgcaataaa caagtttgta cactctcggg 3000
tgattattta cccccaccct tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc 3060
gcatcgctat gcgccactgg cagggacacg ttgcgatact ggtgtttagt gctccactta 3120
aactcaggca caaccatccg cggcagctcg gtgaagtttt cactccacag gctgcgcacc 3180
atcaccaacg cgtttagcag gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg 3240
ccctgcgcgc gcgagttgcg atacacaggg ttgcagcact ggaacactat cagcgccggg 3300
tggtgcacgc tggccagcac gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg 3360
ttgctcaggg cgaacggagt caactttggt agctgccttc ccaaaaaggg cgcgtgccca 3420
ggctttgagt tgcactcgca ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg 3480
ttaggataca gcgcctgcat aaaagccttg atctgcttaa aagccacctg agcctttgcg 3540
ccttcagaga agaacatgcc gcaagacttg ccggaaaact gattggccgg acaggccgcg 3600
tcgtgcacgc agcaccttgc gtcggtgttg gagatctgca ccacatttcg gccccaccgg 3660
ttcttcacga tcttggcctt gctagactgc tccttcagcg cgcgctgccc gttttcgctc 3720
gtcacatcca tttcaatcac gtgctcctta tttatcataa tgcttccgtg tagacactta 3780
agctcgcctt cgatctcagc gcagcggtgc agccacaacg cgcagcccgt gggctcgtga 3840
tgcttgtagg tcacctctgc aaacgactgc aggtacgcct gcaggaatcg ccccatcatc 3900
gtcacaaagg tcttgttgct ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc 3960
caggtcttgc atacggccgc cagagcttcc acttggtcag gcagtagttt gaagttcgcc 4020
tttagatcgt tatccacgtg gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc 4080
tcccacgcag acacgatcgg cacactcagc gggttcatca ccgtaatttc actttccgct 4140
tcgctgggct cttcctcttc ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca 4200
ttcagccgcc gcactgtgcg cttacctcct ttgccatgct tgattagcac cggtgggttg 4260
ctgaaaccca ccatttgtag cgccacatct tctctttctt cctcgctgtc cacgattacc 4320
tctggtgatg gcgggcgctc gggcttggga gaagggcgct tctttttctt cttgggcgca 4380
atggccaaat ccgccgccga ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg 4440
tcttgtgatg agtcttcctc gtcctcggac tcgatacgcc gcctcatccg cttttttggg 4500
ggcgcccggg gaggcggcgg cgacggggac ggggacgaca cgtcctccat ggttggggga 4560
cgtcgcgccg caccgcgtcc gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg 4620
gccatttcct tctcctatag gcagaaaaag atcatggagt cagtcgagaa gaaggacagc 4680
ctaaccgccc cctctgagtt cgccaccacc gcctccaccg atgccgccaa cgcgcctacc 4740
accttccccg tcgaggcacc cccgcttgag gaggaggaag tgattatcga gcaggaccca 4800
ggttttgtaa gcgaagacga cgaggaccgc tcagtaccaa cagaggataa aaagcaagac 4860
caggacaacg cagaggcaaa cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac 4920
tacctagatg tgggagacga cgtgctgttg aagcatctgc agcgccagtg cgccattatc 4980
tgcgacgcgt tgcaagagcg cagcgatgtg cccctcgcca tagcggatgt cagccttgcc 5040
tacgaacgcc acctattctc accgcgcgta ccccccaaac gccaagaaaa cggcacatgc 5100
gagcccaacc cgcgcctcaa cttctacccc gtatttgccg tgccagaggt gcttgccacc 5160
tatcacatct ttttccaaaa ctgcaagata cccctatcct gccgtgccaa ccgcagccga 5220
gcggacaagc agctggcctt gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac 5280
gaagtgccaa aaatctttga gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg 5340
caacaggaaa acagcgaaaa tgaaagtcac tctggagtgt tggtggaact cgagggtgac 5400
aacgcgcgcc tagccgtact aaaacgcagc atcgaggtca cccactttgc ctacccggca 5460
cttaacctac cccccaaggt catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg 5520
cagcccctgg agagggatgc aaatttgcaa gaacaaacag aggagggcct acccgcagtt 5580
ggcgacgagc agctagcgcg ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga 5640
cgcaaactaa tgatggccgc agtgctcgtt accgtggagc ttgagtgcat gcagcggttc 5700
tttgctgacc cggagatgca gcgcaagcta gaggaaacat tgcactacac ctttcgacag 5760
ggctacgtac gccaggcctg caagatctcc aacgtggagc tctgcaacct ggtctcctac 5820
cttggaattt tgcacgaaaa ccgccttggg caaaacgtgc ttcattccac gctcaagggc 5880
gaggcgcgcc gcgactacgt ccgcgactgc gtttacttat ttctatgcta cacctggcag 5940
acggccatgg gcgtttggca gcagtgcttg gaggagtgca acctcaagga gctgcagaaa 6000
ctgctaaagc aaaacttgaa ggacctatgg acggccttca acgagcgctc cgtggccgcg 6060
cacctggcgg acatcatttt ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca 6120
gacttcacca gtcaaagcat gttgcagaac tttaggaact ttatcctaga gcgctcagga 6180
atcttgcccg ccacctgctg tgcacttcct agcgactttg tgcccattaa gtaccgcgaa 6240
tgccctccgc cgctttgggg ccactgctac cttctgcagc tagccaacta ccttgcctac 6300
cactctgaca taatggaaga cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc 6360
aacctatgca ccccgcaccg ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa 6420
attatcggta cctttgagct gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg 6480
ttgaaactca ctccggggct gtggacgtcg gcttaccttc gcaaatttgt acctgaggac 6540
taccacgccc acgagattag gttctacgaa gaccaatccc gcccgcctaa tgcggagctt 6600
accgcctgcg tcattaccca gggccacatt cttggccaat tgcaagccat caacaaagcc 6660
cgccaagagt ttctgctacg aaagggacgg ggggtttact tggaccccca gtccggcgag 6720
gagctcaacc caatcccccc gccgccgcag ccctatcagc agcagccgcg ggcccttgct 6780
tcccaggatg gcacccaaaa agaagctgca gctgccgccg ccacccacgg acgaggagga 6840
atactgggac agtcaggcag aggaggtttt ggacgaggag gaggaggaca tgatggaaga 6900
ctgggagagc ctagacgagg aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc 6960
accctcggtc gcattcccct cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc 7020
tacaacctcc gctcctcagg cgccgccggc actgcccgtt cgccgaccca accgtagatg 7080
ggacaccact ggaaccaggg ccggtaagtc caagcagccg ccgccgttag cccaagagca 7140
acaacagcgc caaggctacc gctcatggcg cgggcacaag aacgccatag ttgcttgctt 7200
gcaagactgt gggggcaaca tctccttcgc ccgccgcttt cttctctacc atcacggcgt 7260
ggccttcccc cgtaacatcc tgcattacta ccgtcatctc tacagcccat actgcaccgg 7320
cggcagcggc agcaacagca gcggccacac agaagcaaag gcgaccggat agcaagactc 7380
tgacaaagcc caagaaatcc acagcggcgg cagcagcagg aggaggagcg ctgcgtctgg 7440
cgcccaacga acccgtatcg acccgcgagc ttagaaacag gatttttccc actctgtatg 7500
ctatatttca acagagcagg ggccaagaac aagagctgaa aataaaaaac aggtctctgc 7560
gatccctcac ccgcagctgc ctgtatcaca aaagcgaaga tcagcttcgg cgcacgctgg 7620
aagacgcgga ggctctcttc agtaaatact gcgcgctgac tcttaaggac tagtttcgcg 7680
ccctttctca aatttaagcg cgaaaactac gtcatctcca gcggccacac ccggcgccag 7740
cacctgttgt cagcgccatt atgagcaagg aaattcccac gccctacatg tggagttacc 7800
agccacaaat gggacttgcg gctggagctg cccaagacta ctcaacccga ataaactaca 7860
tgagcgcggg gcggccgccg tttgtgttat gtttcaacgt gtttattttt caattgcaga 7920
aaatttcaag tcatttttca ttcagtagta tagccccacc accacatagc ttatacagat 7980
caccgtacct taatcaaact cacagaaccc tagtattcaa cctgccacct ccctcccaac 8040
acacagagta cacagtcctt tctccccggc tggccttaaa aagcatcata tcatgggtaa 8100
cagacatatt cttaggtgtt atattccaca cggtttcctg tcgagccaaa cgctcatcag 8160
tgatattaat aaactccccg ggcagctcac ttaagttcat gtcgctgtcc agctgctgag 8220
ccacaggctg ctgtccaact tgcggttgct taacgggcgg cgaaggagaa gtccacgcct 8280
acatgggggt agagtcataa tcgtgcatca ggatagggcg gtggtgctgc agcagcgcgc 8340
gaataaactg ctgccgccgc cgctccgtcc tgcaggaata caacatggca gtggtctcct 8400
cagcgatgat tcgcaccgcc cgcagcataa ggcgccttgt cctccgggca cagcagcgca 8460
ccctgatctc acttaaatca gcacagtaac tgcagcacag caccacaata ttgttcaaaa 8520
tcccacagtg caaggcgctg tatccaaagc tcatggcggg gaccacagaa cccacgtggc 8580
catcatacca caagcgcagg tagattaagt ggcgacccct cataaacacg ctggacataa 8640
acattacctc ttttggcatg ttgtaattca ccacctcccg gtaccatata aacctctgat 8700
taaacatggc gccatccacc accatcctaa accagctggc caaaacctgc ccgccggcta 8760
tacactgcag ggaaccggga ctggaacaat gacagtggag agcccaggac tcgtaaccat 8820
ggatcatcat gctcgtcatg atatcaatgt tggcacaaca caggcacacg tgcatacact 8880
tcctcaggat tacaagctcc tcccgcgtta gaaccatatc ccagggaaca acccattcct 8940
gaatcagcgt aaatcccaca ctgcagggaa gacctcgcac gtaactcacg ttgtgcattg 9000
tcaaagtgtt acattcgggc agcagcggat gatcctccag tatggtagcg cgggtttctg 9060
tctcaaaagg aggtagacga tccctactgt acggagtgcg ccgagacaac cgagatcgtg 9120
ttggtcgtag tgtcatgcca aatggaacgc cggacgtagt catatttcct gaagcaaaac 9180
caggtgcggg cgtgacaaac agatctgcgt ctccggtctc gccgcttaga tcgctctgtg 9240
tagtagttgt agtatatcca ctctctcaaa gcatccaggc gccccctggc ttcgggttct 9300
atgtaaactc cttcatgcgc cgctgccctg ataacatcca ccaccgcaga ataagccaca 9360
cccagccaac ctacacattc gttctgcgag tcacacacgg gaggagcggg aagagctgga 9420
agaaccatgt tttttttttt attccaaaag attatccaaa acctcaaaat gaagatctat 9480
taagtgaacg cgctcccctc cggtggcgtg gtcaaactct acagccaaag aacagataat 9540
ggcatttgta agatgttgca caatggcttc caaaaggcaa acggccctca cgtccaagtg 9600
gacgtaaagg ctaaaccctt cagggtgaat ctcctctata aacattccag caccttcaac 9660
catgcccaaa taattctcat ctcgccacct tctcaatata tctctaagca aatcccgaat 9720
attaagtccg gccattgtaa aaatctgctc cagagcgccc tccaccttca gcctcaagca 9780
gcgaatcatg attgcaaaaa ttcaggttcc tcacagacct gtataagatt caaaagcgga 9840
acattaacaa aaataccgcg atcccgtagg tcccttcgca gggccagctg aacataatcg 9900
tgcaggtctg cacggaccag cgcggccact tccccgccag gaaccatgac aaaagaaccc 9960
acactgatta tgacacgcat actcggagct atgctaacca gcgtagcccc gatgtaagct 10020
tgttgcatgg gcggcgatat aaaatgcaag gtgctgctca aaaaatcagg caaagcctcg 10080
cgcaaaaaag aaagcacatc gtagtcatgc tcatgcagat aaaggcaggt aagctccgga 10140
accaccacag aaaaagacac catttttctc tcaaacatgt ctgcgggttt ctgcataaac 10200
acaaaataaa ataacaaaaa aacatttaaa cattagaagc ctgtcttaca acaggaaaaa 10260
caacccttat aagcataaga cggactacgg ccatgccggc gtgaccgtaa aaaaactggt 10320
caccgtgatt aaaaagcacc accgacagct cctcggtcat gtccggagtc ataatgtaag 10380
actcggtaaa cacatcaggt tgattcacat cggtcagtgc taaaaagcga ccgaaatagc 10440
ccgggggaat acatacccgc aggcgtagag acaacattac agcccccata ggaggtataa 10500
caaaattaat aggagagaaa aacacataaa cacctgaaaa accctcctgc ctaggcaaaa 10560
tagcaccctc ccgctccaga acaacataca gcgcttccac agcggcagcc ataacagtca 10620
gccttaccag taaaaaagaa aacctattaa aaaaacacca ctcgacacgg caccagctca 10680
atcagtcaca gtgtaaaaaa gggccaagtg cagagcgagt atatatagga ctaaaaaatg 10740
acgtaacggt taaagtccac aaaaaacacc cagaaaaccg cacgcgaacc tacgcccaga 10800
aacgaaagcc aaaaaaccca caacttcctc aaatcgtcac ttccgttttc ccacgttacg 10860
tcacttccca ttttaagaaa actacaattc ccaacacata caagttactc cgcccttaat 10920
taaatcggat ccgatatcta gatgtattcg cgaggtaccg agctcgaatt ctctggccgt 10980
cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc caacttaatc gccttgcagc 11040
acatccccct ttcgccagct ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca 11100
acagttgcgc agcctgaatg gcgaatggcg cctgatgcgg tattttctcc ttacgcatct 11160
gtgcggtatt tcacaccgca tatcttcatt taaatgtgtg tcagttaggg tgtggaaagt 11220
ccccaggctc cccagcaggc agaagtatgc aaagcatgca tctcaattag tcagcaacca 11280
ggtgtggaaa gtccccaggc tccccagcag gcagaagtat gcaaagcatg catctcaatt 11340
agtcagcaac catagtcccg cccctaactc cgcccatccc gcccctaact ccgcccagtt 11400
ccgcccattc tccgccccat ggctgactaa ttttttttat ttatgcagag gccgaggccg 11460
cctcggcctc tgagctattc cagaagtagt gaggaggctt ttttggaggc ctaggctttt 11520
gcaaacgccg gcgcaccgcg ggcccgatcc accggtactg ttggtaaagc caccatgttt 11580
tccggtggcg gcggcccgct gtcccccgga ggaaagtcgg cggccagggc ggcgtccggg 11640
ttttttgcgc ccgccggccc tcgcggagcc agccggggac ccccgccttg tttgaggcaa 11700
aacttttaca acccctacct cgccccagtc gggacgcaac agaagccgac cgggccaacc 11760
cagcgccata cgtactatag cgaatgcgat gaatttcgat tcatcgcccc gcgggtgctg 11820
gacgaggatg cccccccgga gaagcgcgcc ggggtgcacg acggtcacct caagcgcgcc 11880
cccaaggtgt actgcggggg ggacgagcgc gacgtcctcc gcgtcgggtc gggcggcttc 11940
tggccgcggc gctcgcgcct gtggggcggc gtggaccacg ccccggcggg gttcaacccc 12000
accgtcaccg tctttcacgt gtacgacatc ctggagaacg tggagcacgc gtacggcatg 12060
cgcgcggccc agttccacgc gcggtttatg gacgccatca caccgacggg gaccgtcatc 12120
acgctcctgg gcctgactcc ggaaggccac cgggtggccg ttcacgttta cggcacgcgg 12180
cagtactttt acatgaacaa ggaggaggtc gacaggcacc tacaatgccg cgccccacga 12240
gatctctgcg agcgcatggc cgcggccctg cgcgagtccc cgggcgcgtc gttccgcggc 12300
atctccgcgg accacttcga ggcggaggtg gtggagcgca ccgacgtgta ctactacgag 12360
acgcgccccg ctctgtttta ccgcgtctac gtccgaagcg ggcgcgtgct gtcgtacctg 12420
tgcgacaact tctgcccggc catcaagaag tacgagggtg gggtcgacgc caccacccgg 12480
ttcatcctgg acaaccccgg gttcgtcacc ttcggctggt accgtctcaa accgggccgg 12540
aacaacacgc tagcccagcc gcgggccccg atggccttcg ggacatccag cgacgtcgag 12600
tttaactgta cggcggacaa cctggccatc gaggggggca tgagcgacct accggcatac 12660
aagctcatgt gcttcgatat cgaatgcaag gcgggggggg aggacgagct ggcctttccg 12720
gtggccgggc acccggagga cctggtcatc cagatatcct gtctgctcta cgacctgtcc 12780
accaccgccc tggagcacgt cctcctgttt tcgctcggtt cctgcgacct ccccgaatcc 12840
cacctgaacg agctggcggc caggggcctg cccacgcccg tggttctgga attcgacagc 12900
gaattcgaga tgctgttggc cttcatgacc cttgtgaaac agtacggccc cgagttcgtg 12960
accgggtaca acatcatcaa cttcgactgg cccttcttgc tggccaagct gacggacatt 13020
tacaaggtcc ccctggacgg gtacggccgc atgaacggcc ggggcgtgtt tcgcgtgtgg 13080
gacataggcc agagccactt ccagaagcgc agcaagataa aggtgaacgg catggtgaac 13140
atcgacatgt acgggattat aaccgacaag atcaagctct cgagctacaa gctcaacgcc 13200
gtggccgaag ccgtcctgaa ggacaagaag aaggacctga gctatcgcga catccccgcc 13260
tactacgccg ccgggcccgc gcaacgcggg gtgatcggcg agtactgcat acaggattcc 13320
ctgctggtgg gccagctgtt ttttaagttt ttgccccatc tggagctctc ggccgtcgcg 13380
cgcttggcgg gtattaacat cacccgcacc atctacgacg gccagcagat ccgcgtcttt 13440
acgtgcctgc tgcgcctggc cgaccagaag ggctttattc tgccggacac ccaggggcga 13500
tttaggggcg ccggggggga ggcgcccaag cgtccggccg cagcccggga ggacgaggag 13560
cggccagagg aggaggggga ggacgaggac gaacgcgagg agggcggggg cgagcgggag 13620
ccggagggcg cgcgggagac cgccggcagg cacgtggggt accagggggc cagggtcctt 13680
gaccccactt ccgggtttca cgtgaacccc gtggtggtgt tcgactttgc cagcctgtac 13740
cccagcatca tccaggccca caacctgtgc ttcagcacgc tctccctgag ggccgacgca 13800
gtggcgcacc tggaggcggg caaggactac ctggagatcg aggtgggggg gcgacggctg 13860
ttcttcgtca aggctcacgt gcgagagagc ctcctcagca tcctcctgcg ggactggctc 13920
gccatgcgaa agcagatccg ctcgcggatt ccccagagca gccccgagga ggccgtgctc 13980
ctggacaagc agcaggccgc catcaaggtc gtgtgtaact cggtgtacgg gttcacggga 14040
gtgcagcacg gactcctgcc gtgcctgcac gttgccgcga cggtgacgac catcggccgc 14100
gagatgctgc tcgcgacccg cgagtacgtc cacgcgcgct gggcggcctt cgaacagctc 14160
ctggccgatt tcccggaggc ggccgacatg cgcgcccccg ggccctattc catgcgcatc 14220
atctacgggg acacggactc catctttgtg ctgtgccgcg gcctcacggc cgccgggctg 14280
acggccgtgg gcgacaagat ggcgagccac atctcgcgcg cgctgtttct gccccccatc 14340
aaactcgagt gcgaaaagac gttcaccaag ctgctgctga tcgccaagaa aaagtacatc 14400
ggcgtcatct acgggggtaa gatgctcatc aagggcgtgg atctggtgcg caaaaacaac 14460
tgcgcgttta tcaaccgcac ctccagggcc ctggtcgacc tgctgtttta cgacgatacc 14520
gtctccggag ccgccgcggc gttagccgag cgccccgcgg aggagtggct ggcgcgaccc 14580
ctgcccgagg gactgcaggc gttcggggcc gtcctcgtag acgcccatcg gcgcatcacc 14640
gacccggaga gggacatcca ggactttgtc ctcaccgccg aactgagcag acacccgcgc 14700
gcgtacacca acaagcgcct ggcccacctg acggtgtatt acaagctcat ggcccgccgc 14760
gcgcaggtcc cgtccatcaa ggaccggatc ccgtacgtga tcgtggccca gacccgcgag 14820
gtagaggaga cggtcgcgcg gctggccgcc ctccgcgagc tagacgccgc cgccccaggg 14880
gacgagcccg ccccccccgc ggccctgccc tccccggcca agcgcccccg ggagacgccg 14940
tcgcctgccg accccccggg aggcgcgtcc aagccccgca agctgctggt gtccgagctg 15000
gccgaggatc ccgcatacgc cattgcccac ggcgtcgccc tgaacacgga ctattacttc 15060
tcccacctgt tgggggcggc gtgcgtgaca ttcaaggccc tgtttgggaa taacgccaag 15120
atcaccgaga gtctgttaaa aaggtttatt cccgaagtgt ggcacccccc ggacgacgtg 15180
gccgcgcggc tccggaccgc agggttcggg gcggtgggtg ccggcgctac ggcggaggaa 15240
actcgtcgaa tgttgcatag agcctttgat actctagcag aattcggcag tggagcaaca 15300
aacttctctc tgctgaaaca agccggagat gtcgaagaga atcctggacc gacggattcc 15360
cctggcggtg tggcccccgc ctcccccgtg gaggacgcgt cggacgcgtc cctcgggcag 15420
ccggaggagg gggcgccctg ccaggtggtc ctgcagggcg ccgaacttaa tggaatccta 15480
caggcgtttg ccccgctgcg cacgagcctt ctggactcgc ttctggttat gggcgaccgg 15540
ggcatcctta tccataacac gatctttggg gagcaggtgt tcctgcccct ggaacactcg 15600
caattcagtc ggtatcgctg gcgcggaccc acggcggcgt tcctgtctct cgtggaccag 15660
aagcgctccc tcctgagcgt gtttcgcgcc aaccagtacc cggacctacg tcgggtggag 15720
ttggcgatca cgggccaggc cccgtttcgc acgctggttc agcgcatatg gacgacgacg 15780
tccgacggcg aggccgttga gctagccagc gagacgctga tgaagcgcga actgacgagc 15840
tttgtggtgc tggttcccca gggaaccccc gacgttcagt tgcgcctgac gaggccgcag 15900
ctcaccaagg tccttaacgc gaccggggcc gatagtgcca cgcccaccac gttcgagctc 15960
ggggttaacg gcaaattttc cgtgttcacc acgagtacct gcgtcacctt tgctgcccgc 16020
gaggagggcg tgtcgtccag caccagcacc caggtccaga tcctgtccaa cgcgctcacc 16080
aaggcgggcc aggccgccgc gaacgccaag acggtgtacg gggaaaatac ccatcgcacc 16140
ttctctgtgg tcgtcgacga ttgcagcatg cgggcggtgc tccggcgact gcaggtcggc 16200
gggggcaccc tcaagttctt cctcacgacc cccgtcccca gtctgtgcgt caccgccacc 16260
ggtcccaacg cggtatcggc ggtatttctc ctgaaacccc agaagatttg cctggactgg 16320
ctgggtcata gccaggggtc tccttcagcc gggagctcgg cctcccgggc ctctgggagc 16380
gagccaacag acagccagga ctccgcgtcg gacgcggtca gccacggcga tccggaagac 16440
ctcgatggcg ctgcccgggc gggagaggcg ggggccttgc atgcctgtcc gatgccgtcg 16500
tcgaccacgc gggtcactcc cacgaccaag cgggggcgct cggggggcga ggatgcgcgc 16560
gcggacacgg ccctaaagaa acctaagacg gggtcgccca ccgcaccccc gcccgcagat 16620
ccagtccccc tggacacgga ggacgactcc gatgcggcgg acgggacggc ggcccgtccc 16680
gccgctccag acgcccggag cggaagccgt tacgcgtgtt actttcgcga cctcccgacc 16740
ggagaagcaa gccccggcgc cttctccgcc ttccgggggg gcccccaaac cccgtatggt 16800
tttggattcc cctgatagga tccgactgca ggtagctgtg ccttctagtt gccagccatc 16860
tgttgtttgc ccctcccccg tgccttcctt gaccctggaa ggtgccactc ccactgtcct 16920
ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg 16980
gggtggggtg gggcaggaca gcaaggggga ggattgggaa gacaatagca ggcatgctgg 17040
ggatgcggtg ggctctatgg gttttatggt gcactctcag tacaatctgc tctgatgccg 17100
catagttaag ccagccccga cacccgccaa cacccgctga cgcgccctga cgggcttgtc 17160
tgctcccggc atccgcttac agacaagctg tgaccgtctc cgggagctgc atgtgtcaga 17220
ggttttcacc gtcatcaccg aaacgcgcga 17250
<210> 100
<211> 17250
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 100
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat cgatccagac atgataagat acattgatga 2880
gtttggacaa accacaacta gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga 2940
tgctattgct ttatttgtaa ccattataag ctgcaataaa caagtttgta cactctcggg 3000
tgattattta cccccaccct tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc 3060
gcatcgctat gcgccactgg cagggacacg ttgcgatact ggtgtttagt gctccactta 3120
aactcaggca caaccatccg cggcagctcg gtgaagtttt cactccacag gctgcgcacc 3180
atcaccaacg cgtttagcag gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg 3240
ccctgcgcgc gcgagttgcg atacacaggg ttgcagcact ggaacactat cagcgccggg 3300
tggtgcacgc tggccagcac gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg 3360
ttgctcaggg cgaacggagt caactttggt agctgccttc ccaaaaaggg cgcgtgccca 3420
ggctttgagt tgcactcgca ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg 3480
ttaggataca gcgcctgcat aaaagccttg atctgcttaa aagccacctg agcctttgcg 3540
ccttcagaga agaacatgcc gcaagacttg ccggaaaact gattggccgg acaggccgcg 3600
tcgtgcacgc agcaccttgc gtcggtgttg gagatctgca ccacatttcg gccccaccgg 3660
ttcttcacga tcttggcctt gctagactgc tccttcagcg cgcgctgccc gttttcgctc 3720
gtcacatcca tttcaatcac gtgctcctta tttatcataa tgcttccgtg tagacactta 3780
agctcgcctt cgatctcagc gcagcggtgc agccacaacg cgcagcccgt gggctcgtga 3840
tgcttgtagg tcacctctgc aaacgactgc aggtacgcct gcaggaatcg ccccatcatc 3900
gtcacaaagg tcttgttgct ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc 3960
caggtcttgc atacggccgc cagagcttcc acttggtcag gcagtagttt gaagttcgcc 4020
tttagatcgt tatccacgtg gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc 4080
tcccacgcag acacgatcgg cacactcagc gggttcatca ccgtaatttc actttccgct 4140
tcgctgggct cttcctcttc ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca 4200
ttcagccgcc gcactgtgcg cttacctcct ttgccatgct tgattagcac cggtgggttg 4260
ctgaaaccca ccatttgtag cgccacatct tctctttctt cctcgctgtc cacgattacc 4320
tctggtgatg gcgggcgctc gggcttggga gaagggcgct tctttttctt cttgggcgca 4380
atggccaaat ccgccgccga ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg 4440
tcttgtgatg agtcttcctc gtcctcggac tcgatacgcc gcctcatccg cttttttggg 4500
ggcgcccggg gaggcggcgg cgacggggac ggggacgaca cgtcctccat ggttggggga 4560
cgtcgcgccg caccgcgtcc gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg 4620
gccatttcct tctcctatag gcagaaaaag atcatggagt cagtcgagaa gaaggacagc 4680
ctaaccgccc cctctgagtt cgccaccacc gcctccaccg atgccgccaa cgcgcctacc 4740
accttccccg tcgaggcacc cccgcttgag gaggaggaag tgattatcga gcaggaccca 4800
ggttttgtaa gcgaagacga cgaggaccgc tcagtaccaa cagaggataa aaagcaagac 4860
caggacaacg cagaggcaaa cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac 4920
tacctagatg tgggagacga cgtgctgttg aagcatctgc agcgccagtg cgccattatc 4980
tgcgacgcgt tgcaagagcg cagcgatgtg cccctcgcca tagcggatgt cagccttgcc 5040
tacgaacgcc acctattctc accgcgcgta ccccccaaac gccaagaaaa cggcacatgc 5100
gagcccaacc cgcgcctcaa cttctacccc gtatttgccg tgccagaggt gcttgccacc 5160
tatcacatct ttttccaaaa ctgcaagata cccctatcct gccgtgccaa ccgcagccga 5220
gcggacaagc agctggcctt gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac 5280
gaagtgccaa aaatctttga gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg 5340
caacaggaaa acagcgaaaa tgaaagtcac tctggagtgt tggtggaact cgagggtgac 5400
aacgcgcgcc tagccgtact aaaacgcagc atcgaggtca cccactttgc ctacccggca 5460
cttaacctac cccccaaggt catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg 5520
cagcccctgg agagggatgc aaatttgcaa gaacaaacag aggagggcct acccgcagtt 5580
ggcgacgagc agctagcgcg ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga 5640
cgcaaactaa tgatggccgc agtgctcgtt accgtggagc ttgagtgcat gcagcggttc 5700
tttgctgacc cggagatgca gcgcaagcta gaggaaacat tgcactacac ctttcgacag 5760
ggctacgtac gccaggcctg caagatctcc aacgtggagc tctgcaacct ggtctcctac 5820
cttggaattt tgcacgaaaa ccgccttggg caaaacgtgc ttcattccac gctcaagggc 5880
gaggcgcgcc gcgactacgt ccgcgactgc gtttacttat ttctatgcta cacctggcag 5940
acggccatgg gcgtttggca gcagtgcttg gaggagtgca acctcaagga gctgcagaaa 6000
ctgctaaagc aaaacttgaa ggacctatgg acggccttca acgagcgctc cgtggccgcg 6060
cacctggcgg acatcatttt ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca 6120
gacttcacca gtcaaagcat gttgcagaac tttaggaact ttatcctaga gcgctcagga 6180
atcttgcccg ccacctgctg tgcacttcct agcgactttg tgcccattaa gtaccgcgaa 6240
tgccctccgc cgctttgggg ccactgctac cttctgcagc tagccaacta ccttgcctac 6300
cactctgaca taatggaaga cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc 6360
aacctatgca ccccgcaccg ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa 6420
attatcggta cctttgagct gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg 6480
ttgaaactca ctccggggct gtggacgtcg gcttaccttc gcaaatttgt acctgaggac 6540
taccacgccc acgagattag gttctacgaa gaccaatccc gcccgcctaa tgcggagctt 6600
accgcctgcg tcattaccca gggccacatt cttggccaat tgcaagccat caacaaagcc 6660
cgccaagagt ttctgctacg aaagggacgg ggggtttact tggaccccca gtccggcgag 6720
gagctcaacc caatcccccc gccgccgcag ccctatcagc agcagccgcg ggcccttgct 6780
tcccaggatg gcacccaaaa agaagctgca gctgccgccg ccacccacgg acgaggagga 6840
atactgggac agtcaggcag aggaggtttt ggacgaggag gaggaggaca tgatggaaga 6900
ctgggagagc ctagacgagg aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc 6960
accctcggtc gcattcccct cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc 7020
tacaacctcc gctcctcagg cgccgccggc actgcccgtt cgccgaccca accgtagatg 7080
ggacaccact ggaaccaggg ccggtaagtc caagcagccg ccgccgttag cccaagagca 7140
acaacagcgc caaggctacc gctcatggcg cgggcacaag aacgccatag ttgcttgctt 7200
gcaagactgt gggggcaaca tctccttcgc ccgccgcttt cttctctacc atcacggcgt 7260
ggccttcccc cgtaacatcc tgcattacta ccgtcatctc tacagcccat actgcaccgg 7320
cggcagcggc agcaacagca gcggccacac agaagcaaag gcgaccggat agcaagactc 7380
tgacaaagcc caagaaatcc acagcggcgg cagcagcagg aggaggagcg ctgcgtctgg 7440
cgcccaacga acccgtatcg acccgcgagc ttagaaacag gatttttccc actctgtatg 7500
ctatatttca acagagcagg ggccaagaac aagagctgaa aataaaaaac aggtctctgc 7560
gatccctcac ccgcagctgc ctgtatcaca aaagcgaaga tcagcttcgg cgcacgctgg 7620
aagacgcgga ggctctcttc agtaaatact gcgcgctgac tcttaaggac tagtttcgcg 7680
ccctttctca aatttaagcg cgaaaactac gtcatctcca gcggccacac ccggcgccag 7740
cacctgttgt cagcgccatt atgagcaagg aaattcccac gccctacatg tggagttacc 7800
agccacaaat gggacttgcg gctggagctg cccaagacta ctcaacccga ataaactaca 7860
tgagcgcggg gcggccgccg tttgtgttat gtttcaacgt gtttattttt caattgcaga 7920
aaatttcaag tcatttttca ttcagtagta tagccccacc accacatagc ttatacagat 7980
caccgtacct taatcaaact cacagaaccc tagtattcaa cctgccacct ccctcccaac 8040
acacagagta cacagtcctt tctccccggc tggccttaaa aagcatcata tcatgggtaa 8100
cagacatatt cttaggtgtt atattccaca cggtttcctg tcgagccaaa cgctcatcag 8160
tgatattaat aaactccccg ggcagctcac ttaagttcat gtcgctgtcc agctgctgag 8220
ccacaggctg ctgtccaact tgcggttgct taacgggcgg cgaaggagaa gtccacgcct 8280
acatgggggt agagtcataa tcgtgcatca ggatagggcg gtggtgctgc agcagcgcgc 8340
gaataaactg ctgccgccgc cgctccgtcc tgcaggaata caacatggca gtggtctcct 8400
cagcgatgat tcgcaccgcc cgcagcataa ggcgccttgt cctccgggca cagcagcgca 8460
ccctgatctc acttaaatca gcacagtaac tgcagcacag caccacaata ttgttcaaaa 8520
tcccacagtg caaggcgctg tatccaaagc tcatggcggg gaccacagaa cccacgtggc 8580
catcatacca caagcgcagg tagattaagt ggcgacccct cataaacacg ctggacataa 8640
acattacctc ttttggcatg ttgtaattca ccacctcccg gtaccatata aacctctgat 8700
taaacatggc gccatccacc accatcctaa accagctggc caaaacctgc ccgccggcta 8760
tacactgcag ggaaccggga ctggaacaat gacagtggag agcccaggac tcgtaaccat 8820
ggatcatcat gctcgtcatg atatcaatgt tggcacaaca caggcacacg tgcatacact 8880
tcctcaggat tacaagctcc tcccgcgtta gaaccatatc ccagggaaca acccattcct 8940
gaatcagcgt aaatcccaca ctgcagggaa gacctcgcac gtaactcacg ttgtgcattg 9000
tcaaagtgtt acattcgggc agcagcggat gatcctccag tatggtagcg cgggtttctg 9060
tctcaaaagg aggtagacga tccctactgt acggagtgcg ccgagacaac cgagatcgtg 9120
ttggtcgtag tgtcatgcca aatggaacgc cggacgtagt catatttcct gaagcaaaac 9180
caggtgcggg cgtgacaaac agatctgcgt ctccggtctc gccgcttaga tcgctctgtg 9240
tagtagttgt agtatatcca ctctctcaaa gcatccaggc gccccctggc ttcgggttct 9300
atgtaaactc cttcatgcgc cgctgccctg ataacatcca ccaccgcaga ataagccaca 9360
cccagccaac ctacacattc gttctgcgag tcacacacgg gaggagcggg aagagctgga 9420
agaaccatgt tttttttttt attccaaaag attatccaaa acctcaaaat gaagatctat 9480
taagtgaacg cgctcccctc cggtggcgtg gtcaaactct acagccaaag aacagataat 9540
ggcatttgta agatgttgca caatggcttc caaaaggcaa acggccctca cgtccaagtg 9600
gacgtaaagg ctaaaccctt cagggtgaat ctcctctata aacattccag caccttcaac 9660
catgcccaaa taattctcat ctcgccacct tctcaatata tctctaagca aatcccgaat 9720
attaagtccg gccattgtaa aaatctgctc cagagcgccc tccaccttca gcctcaagca 9780
gcgaatcatg attgcaaaaa ttcaggttcc tcacagacct gtataagatt caaaagcgga 9840
acattaacaa aaataccgcg atcccgtagg tcccttcgca gggccagctg aacataatcg 9900
tgcaggtctg cacggaccag cgcggccact tccccgccag gaaccatgac aaaagaaccc 9960
acactgatta tgacacgcat actcggagct atgctaacca gcgtagcccc gatgtaagct 10020
tgttgcatgg gcggcgatat aaaatgcaag gtgctgctca aaaaatcagg caaagcctcg 10080
cgcaaaaaag aaagcacatc gtagtcatgc tcatgcagat aaaggcaggt aagctccgga 10140
accaccacag aaaaagacac catttttctc tcaaacatgt ctgcgggttt ctgcataaac 10200
acaaaataaa ataacaaaaa aacatttaaa cattagaagc ctgtcttaca acaggaaaaa 10260
caacccttat aagcataaga cggactacgg ccatgccggc gtgaccgtaa aaaaactggt 10320
caccgtgatt aaaaagcacc accgacagct cctcggtcat gtccggagtc ataatgtaag 10380
actcggtaaa cacatcaggt tgattcacat cggtcagtgc taaaaagcga ccgaaatagc 10440
ccgggggaat acatacccgc aggcgtagag acaacattac agcccccata ggaggtataa 10500
caaaattaat aggagagaaa aacacataaa cacctgaaaa accctcctgc ctaggcaaaa 10560
tagcaccctc ccgctccaga acaacataca gcgcttccac agcggcagcc ataacagtca 10620
gccttaccag taaaaaagaa aacctattaa aaaaacacca ctcgacacgg caccagctca 10680
atcagtcaca gtgtaaaaaa gggccaagtg cagagcgagt atatatagga ctaaaaaatg 10740
acgtaacggt taaagtccac aaaaaacacc cagaaaaccg cacgcgaacc tacgcccaga 10800
aacgaaagcc aaaaaaccca caacttcctc aaatcgtcac ttccgttttc ccacgttacg 10860
tcacttccca ttttaagaaa actacaattc ccaacacata caagttactc cgcccttaat 10920
taaatcggat ccgatatcta gatgtattcg cgaggtaccg agctcgaatt ctctggccgt 10980
cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc caacttaatc gccttgcagc 11040
acatccccct ttcgccagct ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca 11100
acagttgcgc agcctgaatg gcgaatggcg cctgatgcgg tattttctcc ttacgcatct 11160
gtgcggtatt tcacaccgca taaaacccat agagcccacc gcatccccag catgcctgct 11220
attgtcttcc caatcctccc ccttgctgtc ctgccccacc ccacccccca gaatagaatg 11280
acacctactc agacaatgcg atgcaatttc ctcattttat taggaaagga cagtgggagt 11340
ggcaccttcc agggtcaagg aaggcacggg ggaggggcaa acaacagatg gctggcaact 11400
agaaggcaca gctacctgca gtcggatcct atcaggggaa tccaaaacca tacggggttt 11460
gggggccccc ccggaaggcg gagaaggcgc cggggcttgc ttctccggtc gggaggtcgc 11520
gaaagtaaca cgcgtaacgg cttccgctcc gggcgtctgg agcggcggga cgggccgccg 11580
tcccgtccgc cgcatcggag tcgtcctccg tgtccagggg gactggatct gcgggcgggg 11640
gtgcggtggg cgaccccgtc ttaggtttct ttagggccgt gtccgcgcgc gcatcctcgc 11700
cccccgagcg cccccgcttg gtcgtgggag tgacccgcgt ggtcgacgac ggcatcggac 11760
aggcatgcaa ggcccccgcc tctcccgccc gggcagcgcc atcgaggtct tccggatcgc 11820
cgtggctgac cgcgtccgac gcggagtcct ggctgtctgt tggctcgctc ccagaggccc 11880
gggaggccga gctcccggct gaaggagacc cctggctatg acccagccag tccaggcaaa 11940
tcttctgggg tttcaggaga aataccgccg ataccgcgtt gggaccggtg gcggtgacgc 12000
acagactggg gacgggggtc gtgaggaaga acttgagggt gcccccgccg acctgcagtc 12060
gccggagcac cgcccgcatg ctgcaatcgt cgacgaccac agagaaggtg cgatgggtat 12120
tttccccgta caccgtcttg gcgttcgcgg cggcctggcc cgccttggtg agcgcgttgg 12180
acaggatctg gacctgggtg ctggtgctgg acgacacgcc ctcctcgcgg gcagcaaagg 12240
tgacgcaggt actcgtggtg aacacggaaa atttgccgtt aaccccgagc tcgaacgtgg 12300
tgggcgtggc actatcggcc ccggtcgcgt taaggacctt ggtgagctgc ggcctcgtca 12360
ggcgcaactg aacgtcgggg gttccctggg gaaccagcac cacaaagctc gtcagttcgc 12420
gcttcatcag cgtctcgctg gctagctcaa cggcctcgcc gtcggacgtc gtcgtccata 12480
tgcgctgaac cagcgtgcga aacggggcct ggcccgtgat cgccaactcc acccgacgta 12540
ggtccgggta ctggttggcg cgaaacacgc tcaggaggga gcgcttctgg tccacgagag 12600
acaggaacgc cgccgtgggt ccgcgccagc gataccgact gaattgcgag tgttccaggg 12660
gcaggaacac ctgctcccca aagatcgtgt tatggataag gatgccccgg tcgcccataa 12720
ccagaagcga gtccagaagg ctcgtgcgca gcggggcaaa cgcctgtagg attccattaa 12780
gttcggcgcc ctgcaggacc acctggcagg gcgccccctc ctccggctgc ccgagggacg 12840
cgtccgacgc gtcctccacg ggggaggcgg gggccacacc gccaggggaa tccgtcggtc 12900
caggattctc ttcgacatct ccggcttgtt tcagcagaga gaagtttgtt gctccactgc 12960
cgaattctgc tagagtatca aaggctctat gcaacattcg acgagtttcc tccgccgtag 13020
cgccggcacc caccgccccg aaccctgcgg tccggagccg cgcggccacg tcgtccgggg 13080
ggtgccacac ttcgggaata aaccttttta acagactctc ggtgatcttg gcgttattcc 13140
caaacagggc cttgaatgtc acgcacgccg cccccaacag gtgggagaag taatagtccg 13200
tgttcagggc gacgccgtgg gcaatggcgt atgcgggatc ctcggccagc tcggacacca 13260
gcagcttgcg gggcttggac gcgcctcccg gggggtcggc aggcgacggc gtctcccggg 13320
ggcgcttggc cggggagggc agggccgcgg ggggggcggg ctcgtcccct ggggcggcgg 13380
cgtctagctc gcggagggcg gccagccgcg cgaccgtctc ctctacctcg cgggtctggg 13440
ccacgatcac gtacgggatc cggtccttga tggacgggac ctgcgcgcgg cgggccatga 13500
gcttgtaata caccgtcagg tgggccaggc gcttgttggt gtacgcgcgc gggtgtctgc 13560
tcagttcggc ggtgaggaca aagtcctgga tgtccctctc cgggtcggtg atgcgccgat 13620
gggcgtctac gaggacggcc ccgaacgcct gcagtccctc gggcaggggt cgcgccagcc 13680
actcctccgc ggggcgctcg gctaacgccg cggcggctcc ggagacggta tcgtcgtaaa 13740
acagcaggtc gaccagggcc ctggaggtgc ggttgataaa cgcgcagttg tttttgcgca 13800
ccagatccac gcccttgatg agcatcttac ccccgtagat gacgccgatg tactttttct 13860
tggcgatcag cagcagcttg gtgaacgtct tttcgcactc gagtttgatg gggggcagaa 13920
acagcgcgcg cgagatgtgg ctcgccatct tgtcgcccac ggccgtcagc ccggcggccg 13980
tgaggccgcg gcacagcaca aagatggagt ccgtgtcccc gtagatgatg cgcatggaat 14040
agggcccggg ggcgcgcatg tcggccgcct ccgggaaatc ggccaggagc tgttcgaagg 14100
ccgcccagcg cgcgtggacg tactcgcggg tcgcgagcag catctcgcgg ccgatggtcg 14160
tcaccgtcgc ggcaacgtgc aggcacggca ggagtccgtg ctgcactccc gtgaacccgt 14220
acaccgagtt acacacgacc ttgatggcgg cctgctgctt gtccaggagc acggcctcct 14280
cggggctgct ctggggaatc cgcgagcgga tctgctttcg catggcgagc cagtcccgca 14340
ggaggatgct gaggaggctc tctcgcacgt gagccttgac gaagaacagc cgtcgccccc 14400
ccacctcgat ctccaggtag tccttgcccg cctccaggtg cgccactgcg tcggccctca 14460
gggagagcgt gctgaagcac aggttgtggg cctggatgat gctggggtac aggctggcaa 14520
agtcgaacac caccacgggg ttcacgtgaa acccggaagt ggggtcaagg accctggccc 14580
cctggtaccc cacgtgcctg ccggcggtct cccgcgcgcc ctccggctcc cgctcgcccc 14640
cgccctcctc gcgttcgtcc tcgtcctccc cctcctcctc tggccgctcc tcgtcctccc 14700
gggctgcggc cggacgcttg ggcgcctccc ccccggcgcc cctaaatcgc ccctgggtgt 14760
ccggcagaat aaagcccttc tggtcggcca ggcgcagcag gcacgtaaag acgcggatct 14820
gctggccgtc gtagatggtg cgggtgatgt taatacccgc caagcgcgcg acggccgaga 14880
gctccagatg gggcaaaaac ttaaaaaaca gctggcccac cagcagggaa tcctgtatgc 14940
agtactcgcc gatcaccccg cgttgcgcgg gcccggcggc gtagtaggcg gggatgtcgc 15000
gatagctcag gtccttcttc ttgtccttca ggacggcttc ggccacggcg ttgagcttgt 15060
agctcgagag cttgatcttg tcggttataa tcccgtacat gtcgatgttc accatgccgt 15120
tcacctttat cttgctgcgc ttctggaagt ggctctggcc tatgtcccac acgcgaaaca 15180
cgccccggcc gttcatgcgg ccgtacccgt ccagggggac cttgtaaatg tccgtcagct 15240
tggccagcaa gaagggccag tcgaagttga tgatgttgta cccggtcacg aactcggggc 15300
cgtactgttt cacaagggtc atgaaggcca acagcatctc gaattcgctg tcgaattcca 15360
gaaccacggg cgtgggcagg cccctggccg ccagctcgtt caggtgggat tcggggaggt 15420
cgcaggaacc gagcgaaaac aggaggacgt gctccagggc ggtggtggac aggtcgtaga 15480
gcagacagga tatctggatg accaggtcct ccgggtgccc ggccaccgga aaggccagct 15540
cgtcctcccc ccccgccttg cattcgatat cgaagcacat gagcttgtat gccggtaggt 15600
cgctcatgcc cccctcgatg gccaggttgt ccgccgtaca gttaaactcg acgtcgctgg 15660
atgtcccgaa ggccatcggg gcccgcggct gggctagcgt gttgttccgg cccggtttga 15720
gacggtacca gccgaaggtg acgaacccgg ggttgtccag gatgaaccgg gtggtggcgt 15780
cgaccccacc ctcgtacttc ttgatggccg ggcagaagtt gtcgcacagg tacgacagca 15840
cgcgcccgct tcggacgtag acgcggtaaa acagagcggg gcgcgtctcg tagtagtaca 15900
cgtcggtgcg ctccaccacc tccgcctcga agtggtccgc ggagatgccg cggaacgacg 15960
cgcccgggga ctcgcgcagg gccgcggcca tgcgctcgca gagatctcgt ggggcgcggc 16020
attgtaggtg cctgtcgacc tcctccttgt tcatgtaaaa gtactgccgc gtgccgtaaa 16080
cgtgaacggc cacccggtgg ccttccggag tcaggcccag gagcgtgatg acggtccccg 16140
tcggtgtgat ggcgtccata aaccgcgcgt ggaactgggc cgcgcgcatg ccgtacgcgt 16200
gctccacgtt ctccaggatg tcgtacacgt gaaagacggt gacggtgggg ttgaaccccg 16260
ccggggcgtg gtccacgccg ccccacaggc gcgagcgccg cggccagaag ccgcccgacc 16320
cgacgcggag gacgtcgcgc tcgtcccccc cgcagtacac cttgggggcg cgcttgaggt 16380
gaccgtcgtg caccccggcg cgcttctccg ggggggcatc ctcgtccagc acccgcgggg 16440
cgatgaatcg aaattcatcg cattcgctat agtacgtatg gcgctgggtt ggcccggtcg 16500
gcttctgttg cgtcccgact ggggcgaggt aggggttgta aaagttttgc ctcaaacaag 16560
gcgggggtcc ccggctggct ccgcgagggc cggcgggcgc aaaaaacccg gacgccgccc 16620
tggccgccga ctttcctccg ggggacagcg ggccgccgcc accggaaaac atggtggctt 16680
taccaacagt accggtggat cgggcccgcg gtgcgccggc gtttgcaaaa gcctaggcct 16740
ccaaaaaagc ctcctcacta cttctggaat agctcagagg ccgaggcggc ctcggcctct 16800
gcataaataa aaaaaattag tcagccatgg ggcggagaat gggcggaact gggcggagtt 16860
aggggcggga tgggcggagt taggggcggg actatggttg ctgactaatt gagatgcatg 16920
ctttgcatac ttctgcctgc tggggagcct ggggactttc cacacctggt tgctgactaa 16980
ttgagatgca tgctttgcat acttctgcct gctggggagc ctggggactt tccacaccct 17040
aactgacaca catttaaatg aagatatggt gcactctcag tacaatctgc tctgatgccg 17100
catagttaag ccagccccga cacccgccaa cacccgctga cgcgccctga cgggcttgtc 17160
tgctcccggc atccgcttac agacaagctg tgaccgtctc cgggagctgc atgtgtcaga 17220
ggttttcacc gtcatcaccg aaacgcgcga 17250
<210> 101
<211> 17003
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 101
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat cgatccagac atgataagat acattgatga 2880
gtttggacaa accacaacta gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga 2940
tgctattgct ttatttgtaa ccattataag ctgcaataaa caagtttgta cactctcggg 3000
tgattattta cccccaccct tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc 3060
gcatcgctat gcgccactgg cagggacacg ttgcgatact ggtgtttagt gctccactta 3120
aactcaggca caaccatccg cggcagctcg gtgaagtttt cactccacag gctgcgcacc 3180
atcaccaacg cgtttagcag gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg 3240
ccctgcgcgc gcgagttgcg atacacaggg ttgcagcact ggaacactat cagcgccggg 3300
tggtgcacgc tggccagcac gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg 3360
ttgctcaggg cgaacggagt caactttggt agctgccttc ccaaaaaggg cgcgtgccca 3420
ggctttgagt tgcactcgca ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg 3480
ttaggataca gcgcctgcat aaaagccttg atctgcttaa aagccacctg agcctttgcg 3540
ccttcagaga agaacatgcc gcaagacttg ccggaaaact gattggccgg acaggccgcg 3600
tcgtgcacgc agcaccttgc gtcggtgttg gagatctgca ccacatttcg gccccaccgg 3660
ttcttcacga tcttggcctt gctagactgc tccttcagcg cgcgctgccc gttttcgctc 3720
gtcacatcca tttcaatcac gtgctcctta tttatcataa tgcttccgtg tagacactta 3780
agctcgcctt cgatctcagc gcagcggtgc agccacaacg cgcagcccgt gggctcgtga 3840
tgcttgtagg tcacctctgc aaacgactgc aggtacgcct gcaggaatcg ccccatcatc 3900
gtcacaaagg tcttgttgct ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc 3960
caggtcttgc atacggccgc cagagcttcc acttggtcag gcagtagttt gaagttcgcc 4020
tttagatcgt tatccacgtg gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc 4080
tcccacgcag acacgatcgg cacactcagc gggttcatca ccgtaatttc actttccgct 4140
tcgctgggct cttcctcttc ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca 4200
ttcagccgcc gcactgtgcg cttacctcct ttgccatgct tgattagcac cggtgggttg 4260
ctgaaaccca ccatttgtag cgccacatct tctctttctt cctcgctgtc cacgattacc 4320
tctggtgatg gcgggcgctc gggcttggga gaagggcgct tctttttctt cttgggcgca 4380
atggccaaat ccgccgccga ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg 4440
tcttgtgatg agtcttcctc gtcctcggac tcgatacgcc gcctcatccg cttttttggg 4500
ggcgcccggg gaggcggcgg cgacggggac ggggacgaca cgtcctccat ggttggggga 4560
cgtcgcgccg caccgcgtcc gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg 4620
gccatttcct tctcctatag gcagaaaaag atcatggagt cagtcgagaa gaaggacagc 4680
ctaaccgccc cctctgagtt cgccaccacc gcctccaccg atgccgccaa cgcgcctacc 4740
accttccccg tcgaggcacc cccgcttgag gaggaggaag tgattatcga gcaggaccca 4800
ggttttgtaa gcgaagacga cgaggaccgc tcagtaccaa cagaggataa aaagcaagac 4860
caggacaacg cagaggcaaa cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac 4920
tacctagatg tgggagacga cgtgctgttg aagcatctgc agcgccagtg cgccattatc 4980
tgcgacgcgt tgcaagagcg cagcgatgtg cccctcgcca tagcggatgt cagccttgcc 5040
tacgaacgcc acctattctc accgcgcgta ccccccaaac gccaagaaaa cggcacatgc 5100
gagcccaacc cgcgcctcaa cttctacccc gtatttgccg tgccagaggt gcttgccacc 5160
tatcacatct ttttccaaaa ctgcaagata cccctatcct gccgtgccaa ccgcagccga 5220
gcggacaagc agctggcctt gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac 5280
gaagtgccaa aaatctttga gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg 5340
caacaggaaa acagcgaaaa tgaaagtcac tctggagtgt tggtggaact cgagggtgac 5400
aacgcgcgcc tagccgtact aaaacgcagc atcgaggtca cccactttgc ctacccggca 5460
cttaacctac cccccaaggt catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg 5520
cagcccctgg agagggatgc aaatttgcaa gaacaaacag aggagggcct acccgcagtt 5580
ggcgacgagc agctagcgcg ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga 5640
cgcaaactaa tgatggccgc agtgctcgtt accgtggagc ttgagtgcat gcagcggttc 5700
tttgctgacc cggagatgca gcgcaagcta gaggaaacat tgcactacac ctttcgacag 5760
ggctacgtac gccaggcctg caagatctcc aacgtggagc tctgcaacct ggtctcctac 5820
cttggaattt tgcacgaaaa ccgccttggg caaaacgtgc ttcattccac gctcaagggc 5880
gaggcgcgcc gcgactacgt ccgcgactgc gtttacttat ttctatgcta cacctggcag 5940
acggccatgg gcgtttggca gcagtgcttg gaggagtgca acctcaagga gctgcagaaa 6000
ctgctaaagc aaaacttgaa ggacctatgg acggccttca acgagcgctc cgtggccgcg 6060
cacctggcgg acatcatttt ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca 6120
gacttcacca gtcaaagcat gttgcagaac tttaggaact ttatcctaga gcgctcagga 6180
atcttgcccg ccacctgctg tgcacttcct agcgactttg tgcccattaa gtaccgcgaa 6240
tgccctccgc cgctttgggg ccactgctac cttctgcagc tagccaacta ccttgcctac 6300
cactctgaca taatggaaga cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc 6360
aacctatgca ccccgcaccg ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa 6420
attatcggta cctttgagct gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg 6480
ttgaaactca ctccggggct gtggacgtcg gcttaccttc gcaaatttgt acctgaggac 6540
taccacgccc acgagattag gttctacgaa gaccaatccc gcccgcctaa tgcggagctt 6600
accgcctgcg tcattaccca gggccacatt cttggccaat tgcaagccat caacaaagcc 6660
cgccaagagt ttctgctacg aaagggacgg ggggtttact tggaccccca gtccggcgag 6720
gagctcaacc caatcccccc gccgccgcag ccctatcagc agcagccgcg ggcccttgct 6780
tcccaggatg gcacccaaaa agaagctgca gctgccgccg ccacccacgg acgaggagga 6840
atactgggac agtcaggcag aggaggtttt ggacgaggag gaggaggaca tgatggaaga 6900
ctgggagagc ctagacgagg aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc 6960
accctcggtc gcattcccct cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc 7020
tacaacctcc gctcctcagg cgccgccggc actgcccgtt cgccgaccca accgtagatg 7080
ggacaccact ggaaccaggg ccggtaagtc caagcagccg ccgccgttag cccaagagca 7140
acaacagcgc caaggctacc gctcatggcg cgggcacaag aacgccatag ttgcttgctt 7200
gcaagactgt gggggcaaca tctccttcgc ccgccgcttt cttctctacc atcacggcgt 7260
ggccttcccc cgtaacatcc tgcattacta ccgtcatctc tacagcccat actgcaccgg 7320
cggcagcggc agcaacagca gcggccacac agaagcaaag gcgaccggat agcaagactc 7380
tgacaaagcc caagaaatcc acagcggcgg cagcagcagg aggaggagcg ctgcgtctgg 7440
cgcccaacga acccgtatcg acccgcgagc ttagaaacag gatttttccc actctgtatg 7500
ctatatttca acagagcagg ggccaagaac aagagctgaa aataaaaaac aggtctctgc 7560
gatccctcac ccgcagctgc ctgtatcaca aaagcgaaga tcagcttcgg cgcacgctgg 7620
aagacgcgga ggctctcttc agtaaatact gcgcgctgac tcttaaggac tagtttcgcg 7680
ccctttctca aatttaagcg cgaaaactac gtcatctcca gcggccacac ccggcgccag 7740
cacctgttgt cagcgccatt atgagcaagg aaattcccac gccctacatg tggagttacc 7800
agccacaaat gggacttgcg gctggagctg cccaagacta ctcaacccga ataaactaca 7860
tgagcgcggg gcggccgcaa cttgtttatt gcagcttata atggttacaa ataaagcaat 7920
agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg tggtttgtcc 7980
aaactcatca atgtatctta gcttaacggg cggcgaagga gaagtccacg cctacatggg 8040
ggtagagtca taatcgtgca tcaggatagg gcggtggtgc tgcagcagcg cgcgaataaa 8100
ctgctgccgc cgccgctccg tcctgcagga atacaacatg gcagtggtct cctcagcgat 8160
gattcgcacc gcccgcagca taaggcgcct tgtcctccgg gcacagcagc gcaccctgat 8220
ctcacttaaa tcagcacagt aactgcagca cagcaccaca atattgttca aaatcccaca 8280
gtgcaaggcg ctgtatccaa agctcatggc ggggaccaca gaacccacgt ggccatcata 8340
ccacaagcgc aggtagatta agtggcgacc cctcataaac acgctggaca taaacattac 8400
ctcttttggc atgttgtaat tcaccacctc ccggtaccat ataaacctct gattaaacat 8460
ggcgccatcc accaccatcc taaaccagct ggccaaaacc tgcccgccgg ctatacactg 8520
cagggaaccg ggactggaac aatgacagtg gagagcccag gactcgtaac catggatcat 8580
catgctcgtc atgatatcaa tgttggcaca acacaggcac acgtgcatac acttcctcag 8640
gattacaagc tcctcccgcg ttagaaccat atcccaggga acaacccatt cctgaatcag 8700
cgtaaatccc acactgcagg gaagacctcg cacgtaactc acgttgtgca ttgtcaaagt 8760
gttacattcg ggcagcagcg gatgatcctc cagtatggta gcgcgggttt ctgtctcaaa 8820
aggaggtaga cgatccctac tgtacggagt gcgccgagac aaccgagatc gtgttggtcg 8880
tagtgtcatg ccaaatggaa cgccggacgt agtcatattt cctgaagcaa aaccaggtgc 8940
gggcgtgaca aacagatctg cgtctccggt ctcgccgctt agatcgctct gtgtagtagt 9000
tgtagtatat ccactctctc aaagcatcca ggcgccccct ggcttcgggt tctatgtaaa 9060
ctccttcatg cgccgctgcc ctgataacat ccaccaccgc agaataagcc acacccagcc 9120
aacctacaca ttcgttctgc gagtcacaca cgggaggagc gggaagagct ggaagaacca 9180
tgtttttttt tttattccaa aagattatcc aaaacctcaa aatgaagatc tattaagtga 9240
acgcgctccc ctccggtggc gtggtcaaac tctacagcca aagaacagat aatggcattt 9300
gtaagatgtt gcacaatggc ttccaaaagg caaacggccc tcacgtccaa gtggacgtaa 9360
aggctaaacc cttcagggtg aatctcctct ataaacattc cagcaccttc aaccatgccc 9420
aaataattct catctcgcca ccttctcaat atatctctaa gcaaatcccg aatattaagt 9480
ccggccattg taaaaatctg ctccagagcg ccctccacct tcagcctcaa gcagcgaatc 9540
atgattgcaa aaattcaggt tcctcacaga cctgtataag attcaaaagc ggaacattaa 9600
caaaaatacc gcgatcccgt aggtcccttc gcagggccag ctgaacataa tcgtgcaggt 9660
ctgcacggac cagcgcggcc acttccccgc caggaaccat gacaaaagaa cccacactga 9720
ttatgacacg catactcgga gctatgctaa ccagcgtagc cccgatgtaa gcttgttgca 9780
tgggcggcga tataaaatgc aaggtgctgc tcaaaaaatc aggcaaagcc tcgcgcaaaa 9840
aagaaagcac atcgtagtca tgctcatgca gataaaggca ggtaagctcc ggaaccacca 9900
cagaaaaaga caccattttt ctctcaaaca tgtctgcggg tttctgcata aacacaaaat 9960
aaaataacaa aaaaacattt aaacattaga agcctgtctt acaacaggaa aaacaaccct 10020
tataagcata agacggacta cggccatgcc ggcgtgaccg taaaaaaact ggtcaccgtg 10080
attaaaaagc accaccgaca gctcctcggt catgtccgga gtcataatgt aagactcggt 10140
aaacacatca ggttgattca catcggtcag tgctaaaaag cgaccgaaat agcccggggg 10200
aatacatacc cgcaggcgta gagacaacat tacagccccc ataggaggta taacaaaatt 10260
aataggagag aaaaacacat aaacacctga aaaaccctcc tgcctaggca aaatagcacc 10320
ctcccgctcc agaacaacat acagcgcttc cacagcggca gccataacag tcagccttac 10380
cagtaaaaaa gaaaacctat taaaaaaaca ccactcgaca cggcaccagc tcaatcagtc 10440
acagtgtaaa aaagggccaa gtgcagagcg agtatatata ggactaaaaa atgacgtaac 10500
ggttaaagtc cacaaaaaac acccagaaaa ccgcacgcga acctacgccc agaaacgaaa 10560
gccaaaaaac ccacaacttc ctcaaatcgt cacttccgtt ttcccacgtt acgtcacttc 10620
ccattttaag aaaactacaa ttcccaacac atacaagtta ctccgccctt aattaaatcg 10680
gatccgatat ctagatgtat tcgcgaggta ccgagctcga attctctggc cgtcgtttta 10740
caacgtcgtg actgggaaaa ccctggcgtt acccaactta atcgccttgc agcacatccc 10800
cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg 10860
cgcagcctga atggcgaatg gcgcctgatg cggtattttc tccttacgca tctgtgcggt 10920
atttcacacc gcatatcttc atttaaatgt gtgtcagtta gggtgtggaa agtccccagg 10980
ctccccagca ggcagaagta tgcaaagcat gcatctcaat tagtcagcaa ccaggtgtgg 11040
aaagtcccca ggctccccag caggcagaag tatgcaaagc atgcatctca attagtcagc 11100
aaccatagtc ccgcccctaa ctccgcccat cccgccccta actccgccca gttccgccca 11160
ttctccgccc catggctgac taattttttt tatttatgca gaggccgagg ccgcctcggc 11220
ctctgagcta ttccagaagt agtgaggagg cttttttgga ggcctaggct tttgcaaacg 11280
ccggcgcacc gcgggcccga tccaccggta ctgttggtaa agccaccatg ttttccggtg 11340
gcggcggccc gctgtccccc ggaggaaagt cggcggccag ggcggcgtcc gggttttttg 11400
cgcccgccgg ccctcgcgga gccagccggg gacccccgcc ttgtttgagg caaaactttt 11460
acaaccccta cctcgcccca gtcgggacgc aacagaagcc gaccgggcca acccagcgcc 11520
atacgtacta tagcgaatgc gatgaatttc gattcatcgc cccgcgggtg ctggacgagg 11580
atgccccccc ggagaagcgc gccggggtgc acgacggtca cctcaagcgc gcccccaagg 11640
tgtactgcgg gggggacgag cgcgacgtcc tccgcgtcgg gtcgggcggc ttctggccgc 11700
ggcgctcgcg cctgtggggc ggcgtggacc acgccccggc ggggttcaac cccaccgtca 11760
ccgtctttca cgtgtacgac atcctggaga acgtggagca cgcgtacggc atgcgcgcgg 11820
cccagttcca cgcgcggttt atggacgcca tcacaccgac ggggaccgtc atcacgctcc 11880
tgggcctgac tccggaaggc caccgggtgg ccgttcacgt ttacggcacg cggcagtact 11940
tttacatgaa caaggaggag gtcgacaggc acctacaatg ccgcgcccca cgagatctct 12000
gcgagcgcat ggccgcggcc ctgcgcgagt ccccgggcgc gtcgttccgc ggcatctccg 12060
cggaccactt cgaggcggag gtggtggagc gcaccgacgt gtactactac gagacgcgcc 12120
ccgctctgtt ttaccgcgtc tacgtccgaa gcgggcgcgt gctgtcgtac ctgtgcgaca 12180
acttctgccc ggccatcaag aagtacgagg gtggggtcga cgccaccacc cggttcatcc 12240
tggacaaccc cgggttcgtc accttcggct ggtaccgtct caaaccgggc cggaacaaca 12300
cgctagccca gccgcgggcc ccgatggcct tcgggacatc cagcgacgtc gagtttaact 12360
gtacggcgga caacctggcc atcgaggggg gcatgagcga cctaccggca tacaagctca 12420
tgtgcttcga tatcgaatgc aaggcggggg gggaggacga gctggccttt ccggtggccg 12480
ggcacccgga ggacctggtc atccagatat cctgtctgct ctacgacctg tccaccaccg 12540
ccctggagca cgtcctcctg ttttcgctcg gttcctgcga cctccccgaa tcccacctga 12600
acgagctggc ggccaggggc ctgcccacgc ccgtggttct ggaattcgac agcgaattcg 12660
agatgctgtt ggccttcatg acccttgtga aacagtacgg ccccgagttc gtgaccgggt 12720
acaacatcat caacttcgac tggcccttct tgctggccaa gctgacggac atttacaagg 12780
tccccctgga cgggtacggc cgcatgaacg gccggggcgt gtttcgcgtg tgggacatag 12840
gccagagcca cttccagaag cgcagcaaga taaaggtgaa cggcatggtg aacatcgaca 12900
tgtacgggat tataaccgac aagatcaagc tctcgagcta caagctcaac gccgtggccg 12960
aagccgtcct gaaggacaag aagaaggacc tgagctatcg cgacatcccc gcctactacg 13020
ccgccgggcc cgcgcaacgc ggggtgatcg gcgagtactg catacaggat tccctgctgg 13080
tgggccagct gttttttaag tttttgcccc atctggagct ctcggccgtc gcgcgcttgg 13140
cgggtattaa catcacccgc accatctacg acggccagca gatccgcgtc tttacgtgcc 13200
tgctgcgcct ggccgaccag aagggcttta ttctgccgga cacccagggg cgatttaggg 13260
gcgccggggg ggaggcgccc aagcgtccgg ccgcagcccg ggaggacgag gagcggccag 13320
aggaggaggg ggaggacgag gacgaacgcg aggagggcgg gggcgagcgg gagccggagg 13380
gcgcgcggga gaccgccggc aggcacgtgg ggtaccaggg ggccagggtc cttgacccca 13440
cttccgggtt tcacgtgaac cccgtggtgg tgttcgactt tgccagcctg taccccagca 13500
tcatccaggc ccacaacctg tgcttcagca cgctctccct gagggccgac gcagtggcgc 13560
acctggaggc gggcaaggac tacctggaga tcgaggtggg ggggcgacgg ctgttcttcg 13620
tcaaggctca cgtgcgagag agcctcctca gcatcctcct gcgggactgg ctcgccatgc 13680
gaaagcagat ccgctcgcgg attccccaga gcagccccga ggaggccgtg ctcctggaca 13740
agcagcaggc cgccatcaag gtcgtgtgta actcggtgta cgggttcacg ggagtgcagc 13800
acggactcct gccgtgcctg cacgttgccg cgacggtgac gaccatcggc cgcgagatgc 13860
tgctcgcgac ccgcgagtac gtccacgcgc gctgggcggc cttcgaacag ctcctggccg 13920
atttcccgga ggcggccgac atgcgcgccc ccgggcccta ttccatgcgc atcatctacg 13980
gggacacgga ctccatcttt gtgctgtgcc gcggcctcac ggccgccggg ctgacggccg 14040
tgggcgacaa gatggcgagc cacatctcgc gcgcgctgtt tctgcccccc atcaaactcg 14100
agtgcgaaaa gacgttcacc aagctgctgc tgatcgccaa gaaaaagtac atcggcgtca 14160
tctacggggg taagatgctc atcaagggcg tggatctggt gcgcaaaaac aactgcgcgt 14220
ttatcaaccg cacctccagg gccctggtcg acctgctgtt ttacgacgat accgtctccg 14280
gagccgccgc ggcgttagcc gagcgccccg cggaggagtg gctggcgcga cccctgcccg 14340
agggactgca ggcgttcggg gccgtcctcg tagacgccca tcggcgcatc accgacccgg 14400
agagggacat ccaggacttt gtcctcaccg ccgaactgag cagacacccg cgcgcgtaca 14460
ccaacaagcg cctggcccac ctgacggtgt attacaagct catggcccgc cgcgcgcagg 14520
tcccgtccat caaggaccgg atcccgtacg tgatcgtggc ccagacccgc gaggtagagg 14580
agacggtcgc gcggctggcc gccctccgcg agctagacgc cgccgcccca ggggacgagc 14640
ccgccccccc cgcggccctg ccctccccgg ccaagcgccc ccgggagacg ccgtcgcctg 14700
ccgacccccc gggaggcgcg tccaagcccc gcaagctgct ggtgtccgag ctggccgagg 14760
atcccgcata cgccattgcc cacggcgtcg ccctgaacac ggactattac ttctcccacc 14820
tgttgggggc ggcgtgcgtg acattcaagg ccctgtttgg gaataacgcc aagatcaccg 14880
agagtctgtt aaaaaggttt attcccgaag tgtggcaccc cccggacgac gtggccgcgc 14940
ggctccggac cgcagggttc ggggcggtgg gtgccggcgc tacggcggag gaaactcgtc 15000
gaatgttgca tagagccttt gatactctag cagaattcgg cagtggagca acaaacttct 15060
ctctgctgaa acaagccgga gatgtcgaag agaatcctgg accgacggat tcccctggcg 15120
gtgtggcccc cgcctccccc gtggaggacg cgtcggacgc gtccctcggg cagccggagg 15180
agggggcgcc ctgccaggtg gtcctgcagg gcgccgaact taatggaatc ctacaggcgt 15240
ttgccccgct gcgcacgagc cttctggact cgcttctggt tatgggcgac cggggcatcc 15300
ttatccataa cacgatcttt ggggagcagg tgttcctgcc cctggaacac tcgcaattca 15360
gtcggtatcg ctggcgcgga cccacggcgg cgttcctgtc tctcgtggac cagaagcgct 15420
ccctcctgag cgtgtttcgc gccaaccagt acccggacct acgtcgggtg gagttggcga 15480
tcacgggcca ggccccgttt cgcacgctgg ttcagcgcat atggacgacg acgtccgacg 15540
gcgaggccgt tgagctagcc agcgagacgc tgatgaagcg cgaactgacg agctttgtgg 15600
tgctggttcc ccagggaacc cccgacgttc agttgcgcct gacgaggccg cagctcacca 15660
aggtccttaa cgcgaccggg gccgatagtg ccacgcccac cacgttcgag ctcggggtta 15720
acggcaaatt ttccgtgttc accacgagta cctgcgtcac ctttgctgcc cgcgaggagg 15780
gcgtgtcgtc cagcaccagc acccaggtcc agatcctgtc caacgcgctc accaaggcgg 15840
gccaggccgc cgcgaacgcc aagacggtgt acggggaaaa tacccatcgc accttctctg 15900
tggtcgtcga cgattgcagc atgcgggcgg tgctccggcg actgcaggtc ggcgggggca 15960
ccctcaagtt cttcctcacg acccccgtcc ccagtctgtg cgtcaccgcc accggtccca 16020
acgcggtatc ggcggtattt ctcctgaaac cccagaagat ttgcctggac tggctgggtc 16080
atagccaggg gtctccttca gccgggagct cggcctcccg ggcctctggg agcgagccaa 16140
cagacagcca ggactccgcg tcggacgcgg tcagccacgg cgatccggaa gacctcgatg 16200
gcgctgcccg ggcgggagag gcgggggcct tgcatgcctg tccgatgccg tcgtcgacca 16260
cgcgggtcac tcccacgacc aagcgggggc gctcgggggg cgaggatgcg cgcgcggaca 16320
cggccctaaa gaaacctaag acggggtcgc ccaccgcacc cccgcccgca gatccagtcc 16380
ccctggacac ggaggacgac tccgatgcgg cggacgggac ggcggcccgt cccgccgctc 16440
cagacgcccg gagcggaagc cgttacgcgt gttactttcg cgacctcccg accggagaag 16500
caagccccgg cgccttctcc gccttccggg ggggccccca aaccccgtat ggttttggat 16560
tcccctgata ggatccgact gcaggtagct gtgccttcta gttgccagcc atctgttgtt 16620
tgcccctccc ccgtgccttc cttgaccctg gaaggtgcca ctcccactgt cctttcctaa 16680
taaaatgagg aaattgcatc gcattgtctg agtaggtgtc attctattct ggggggtggg 16740
gtggggcagg acagcaaggg ggaggattgg gaagacaata gcaggcatgc tggggatgcg 16800
gtgggctcta tgggttttat ggtgcactct cagtacaatc tgctctgatg ccgcatagtt 16860
aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt gtctgctccc 16920
ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc 16980
accgtcatca ccgaaacgcg cga 17003
<210> 102
<211> 17003
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 102
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat cgatccagac atgataagat acattgatga 2880
gtttggacaa accacaacta gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga 2940
tgctattgct ttatttgtaa ccattataag ctgcaataaa caagtttgta cactctcggg 3000
tgattattta cccccaccct tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc 3060
gcatcgctat gcgccactgg cagggacacg ttgcgatact ggtgtttagt gctccactta 3120
aactcaggca caaccatccg cggcagctcg gtgaagtttt cactccacag gctgcgcacc 3180
atcaccaacg cgtttagcag gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg 3240
ccctgcgcgc gcgagttgcg atacacaggg ttgcagcact ggaacactat cagcgccggg 3300
tggtgcacgc tggccagcac gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg 3360
ttgctcaggg cgaacggagt caactttggt agctgccttc ccaaaaaggg cgcgtgccca 3420
ggctttgagt tgcactcgca ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg 3480
ttaggataca gcgcctgcat aaaagccttg atctgcttaa aagccacctg agcctttgcg 3540
ccttcagaga agaacatgcc gcaagacttg ccggaaaact gattggccgg acaggccgcg 3600
tcgtgcacgc agcaccttgc gtcggtgttg gagatctgca ccacatttcg gccccaccgg 3660
ttcttcacga tcttggcctt gctagactgc tccttcagcg cgcgctgccc gttttcgctc 3720
gtcacatcca tttcaatcac gtgctcctta tttatcataa tgcttccgtg tagacactta 3780
agctcgcctt cgatctcagc gcagcggtgc agccacaacg cgcagcccgt gggctcgtga 3840
tgcttgtagg tcacctctgc aaacgactgc aggtacgcct gcaggaatcg ccccatcatc 3900
gtcacaaagg tcttgttgct ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc 3960
caggtcttgc atacggccgc cagagcttcc acttggtcag gcagtagttt gaagttcgcc 4020
tttagatcgt tatccacgtg gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc 4080
tcccacgcag acacgatcgg cacactcagc gggttcatca ccgtaatttc actttccgct 4140
tcgctgggct cttcctcttc ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca 4200
ttcagccgcc gcactgtgcg cttacctcct ttgccatgct tgattagcac cggtgggttg 4260
ctgaaaccca ccatttgtag cgccacatct tctctttctt cctcgctgtc cacgattacc 4320
tctggtgatg gcgggcgctc gggcttggga gaagggcgct tctttttctt cttgggcgca 4380
atggccaaat ccgccgccga ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg 4440
tcttgtgatg agtcttcctc gtcctcggac tcgatacgcc gcctcatccg cttttttggg 4500
ggcgcccggg gaggcggcgg cgacggggac ggggacgaca cgtcctccat ggttggggga 4560
cgtcgcgccg caccgcgtcc gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg 4620
gccatttcct tctcctatag gcagaaaaag atcatggagt cagtcgagaa gaaggacagc 4680
ctaaccgccc cctctgagtt cgccaccacc gcctccaccg atgccgccaa cgcgcctacc 4740
accttccccg tcgaggcacc cccgcttgag gaggaggaag tgattatcga gcaggaccca 4800
ggttttgtaa gcgaagacga cgaggaccgc tcagtaccaa cagaggataa aaagcaagac 4860
caggacaacg cagaggcaaa cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac 4920
tacctagatg tgggagacga cgtgctgttg aagcatctgc agcgccagtg cgccattatc 4980
tgcgacgcgt tgcaagagcg cagcgatgtg cccctcgcca tagcggatgt cagccttgcc 5040
tacgaacgcc acctattctc accgcgcgta ccccccaaac gccaagaaaa cggcacatgc 5100
gagcccaacc cgcgcctcaa cttctacccc gtatttgccg tgccagaggt gcttgccacc 5160
tatcacatct ttttccaaaa ctgcaagata cccctatcct gccgtgccaa ccgcagccga 5220
gcggacaagc agctggcctt gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac 5280
gaagtgccaa aaatctttga gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg 5340
caacaggaaa acagcgaaaa tgaaagtcac tctggagtgt tggtggaact cgagggtgac 5400
aacgcgcgcc tagccgtact aaaacgcagc atcgaggtca cccactttgc ctacccggca 5460
cttaacctac cccccaaggt catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg 5520
cagcccctgg agagggatgc aaatttgcaa gaacaaacag aggagggcct acccgcagtt 5580
ggcgacgagc agctagcgcg ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga 5640
cgcaaactaa tgatggccgc agtgctcgtt accgtggagc ttgagtgcat gcagcggttc 5700
tttgctgacc cggagatgca gcgcaagcta gaggaaacat tgcactacac ctttcgacag 5760
ggctacgtac gccaggcctg caagatctcc aacgtggagc tctgcaacct ggtctcctac 5820
cttggaattt tgcacgaaaa ccgccttggg caaaacgtgc ttcattccac gctcaagggc 5880
gaggcgcgcc gcgactacgt ccgcgactgc gtttacttat ttctatgcta cacctggcag 5940
acggccatgg gcgtttggca gcagtgcttg gaggagtgca acctcaagga gctgcagaaa 6000
ctgctaaagc aaaacttgaa ggacctatgg acggccttca acgagcgctc cgtggccgcg 6060
cacctggcgg acatcatttt ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca 6120
gacttcacca gtcaaagcat gttgcagaac tttaggaact ttatcctaga gcgctcagga 6180
atcttgcccg ccacctgctg tgcacttcct agcgactttg tgcccattaa gtaccgcgaa 6240
tgccctccgc cgctttgggg ccactgctac cttctgcagc tagccaacta ccttgcctac 6300
cactctgaca taatggaaga cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc 6360
aacctatgca ccccgcaccg ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa 6420
attatcggta cctttgagct gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg 6480
ttgaaactca ctccggggct gtggacgtcg gcttaccttc gcaaatttgt acctgaggac 6540
taccacgccc acgagattag gttctacgaa gaccaatccc gcccgcctaa tgcggagctt 6600
accgcctgcg tcattaccca gggccacatt cttggccaat tgcaagccat caacaaagcc 6660
cgccaagagt ttctgctacg aaagggacgg ggggtttact tggaccccca gtccggcgag 6720
gagctcaacc caatcccccc gccgccgcag ccctatcagc agcagccgcg ggcccttgct 6780
tcccaggatg gcacccaaaa agaagctgca gctgccgccg ccacccacgg acgaggagga 6840
atactgggac agtcaggcag aggaggtttt ggacgaggag gaggaggaca tgatggaaga 6900
ctgggagagc ctagacgagg aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc 6960
accctcggtc gcattcccct cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc 7020
tacaacctcc gctcctcagg cgccgccggc actgcccgtt cgccgaccca accgtagatg 7080
ggacaccact ggaaccaggg ccggtaagtc caagcagccg ccgccgttag cccaagagca 7140
acaacagcgc caaggctacc gctcatggcg cgggcacaag aacgccatag ttgcttgctt 7200
gcaagactgt gggggcaaca tctccttcgc ccgccgcttt cttctctacc atcacggcgt 7260
ggccttcccc cgtaacatcc tgcattacta ccgtcatctc tacagcccat actgcaccgg 7320
cggcagcggc agcaacagca gcggccacac agaagcaaag gcgaccggat agcaagactc 7380
tgacaaagcc caagaaatcc acagcggcgg cagcagcagg aggaggagcg ctgcgtctgg 7440
cgcccaacga acccgtatcg acccgcgagc ttagaaacag gatttttccc actctgtatg 7500
ctatatttca acagagcagg ggccaagaac aagagctgaa aataaaaaac aggtctctgc 7560
gatccctcac ccgcagctgc ctgtatcaca aaagcgaaga tcagcttcgg cgcacgctgg 7620
aagacgcgga ggctctcttc agtaaatact gcgcgctgac tcttaaggac tagtttcgcg 7680
ccctttctca aatttaagcg cgaaaactac gtcatctcca gcggccacac ccggcgccag 7740
cacctgttgt cagcgccatt atgagcaagg aaattcccac gccctacatg tggagttacc 7800
agccacaaat gggacttgcg gctggagctg cccaagacta ctcaacccga ataaactaca 7860
tgagcgcggg gcggccgcaa cttgtttatt gcagcttata atggttacaa ataaagcaat 7920
agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg tggtttgtcc 7980
aaactcatca atgtatctta gcttaacggg cggcgaagga gaagtccacg cctacatggg 8040
ggtagagtca taatcgtgca tcaggatagg gcggtggtgc tgcagcagcg cgcgaataaa 8100
ctgctgccgc cgccgctccg tcctgcagga atacaacatg gcagtggtct cctcagcgat 8160
gattcgcacc gcccgcagca taaggcgcct tgtcctccgg gcacagcagc gcaccctgat 8220
ctcacttaaa tcagcacagt aactgcagca cagcaccaca atattgttca aaatcccaca 8280
gtgcaaggcg ctgtatccaa agctcatggc ggggaccaca gaacccacgt ggccatcata 8340
ccacaagcgc aggtagatta agtggcgacc cctcataaac acgctggaca taaacattac 8400
ctcttttggc atgttgtaat tcaccacctc ccggtaccat ataaacctct gattaaacat 8460
ggcgccatcc accaccatcc taaaccagct ggccaaaacc tgcccgccgg ctatacactg 8520
cagggaaccg ggactggaac aatgacagtg gagagcccag gactcgtaac catggatcat 8580
catgctcgtc atgatatcaa tgttggcaca acacaggcac acgtgcatac acttcctcag 8640
gattacaagc tcctcccgcg ttagaaccat atcccaggga acaacccatt cctgaatcag 8700
cgtaaatccc acactgcagg gaagacctcg cacgtaactc acgttgtgca ttgtcaaagt 8760
gttacattcg ggcagcagcg gatgatcctc cagtatggta gcgcgggttt ctgtctcaaa 8820
aggaggtaga cgatccctac tgtacggagt gcgccgagac aaccgagatc gtgttggtcg 8880
tagtgtcatg ccaaatggaa cgccggacgt agtcatattt cctgaagcaa aaccaggtgc 8940
gggcgtgaca aacagatctg cgtctccggt ctcgccgctt agatcgctct gtgtagtagt 9000
tgtagtatat ccactctctc aaagcatcca ggcgccccct ggcttcgggt tctatgtaaa 9060
ctccttcatg cgccgctgcc ctgataacat ccaccaccgc agaataagcc acacccagcc 9120
aacctacaca ttcgttctgc gagtcacaca cgggaggagc gggaagagct ggaagaacca 9180
tgtttttttt tttattccaa aagattatcc aaaacctcaa aatgaagatc tattaagtga 9240
acgcgctccc ctccggtggc gtggtcaaac tctacagcca aagaacagat aatggcattt 9300
gtaagatgtt gcacaatggc ttccaaaagg caaacggccc tcacgtccaa gtggacgtaa 9360
aggctaaacc cttcagggtg aatctcctct ataaacattc cagcaccttc aaccatgccc 9420
aaataattct catctcgcca ccttctcaat atatctctaa gcaaatcccg aatattaagt 9480
ccggccattg taaaaatctg ctccagagcg ccctccacct tcagcctcaa gcagcgaatc 9540
atgattgcaa aaattcaggt tcctcacaga cctgtataag attcaaaagc ggaacattaa 9600
caaaaatacc gcgatcccgt aggtcccttc gcagggccag ctgaacataa tcgtgcaggt 9660
ctgcacggac cagcgcggcc acttccccgc caggaaccat gacaaaagaa cccacactga 9720
ttatgacacg catactcgga gctatgctaa ccagcgtagc cccgatgtaa gcttgttgca 9780
tgggcggcga tataaaatgc aaggtgctgc tcaaaaaatc aggcaaagcc tcgcgcaaaa 9840
aagaaagcac atcgtagtca tgctcatgca gataaaggca ggtaagctcc ggaaccacca 9900
cagaaaaaga caccattttt ctctcaaaca tgtctgcggg tttctgcata aacacaaaat 9960
aaaataacaa aaaaacattt aaacattaga agcctgtctt acaacaggaa aaacaaccct 10020
tataagcata agacggacta cggccatgcc ggcgtgaccg taaaaaaact ggtcaccgtg 10080
attaaaaagc accaccgaca gctcctcggt catgtccgga gtcataatgt aagactcggt 10140
aaacacatca ggttgattca catcggtcag tgctaaaaag cgaccgaaat agcccggggg 10200
aatacatacc cgcaggcgta gagacaacat tacagccccc ataggaggta taacaaaatt 10260
aataggagag aaaaacacat aaacacctga aaaaccctcc tgcctaggca aaatagcacc 10320
ctcccgctcc agaacaacat acagcgcttc cacagcggca gccataacag tcagccttac 10380
cagtaaaaaa gaaaacctat taaaaaaaca ccactcgaca cggcaccagc tcaatcagtc 10440
acagtgtaaa aaagggccaa gtgcagagcg agtatatata ggactaaaaa atgacgtaac 10500
ggttaaagtc cacaaaaaac acccagaaaa ccgcacgcga acctacgccc agaaacgaaa 10560
gccaaaaaac ccacaacttc ctcaaatcgt cacttccgtt ttcccacgtt acgtcacttc 10620
ccattttaag aaaactacaa ttcccaacac atacaagtta ctccgccctt aattaaatcg 10680
gatccgatat ctagatgtat tcgcgaggta ccgagctcga attctctggc cgtcgtttta 10740
caacgtcgtg actgggaaaa ccctggcgtt acccaactta atcgccttgc agcacatccc 10800
cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg 10860
cgcagcctga atggcgaatg gcgcctgatg cggtattttc tccttacgca tctgtgcggt 10920
atttcacacc gcataaaacc catagagccc accgcatccc cagcatgcct gctattgtct 10980
tcccaatcct cccccttgct gtcctgcccc accccacccc ccagaataga atgacaccta 11040
ctcagacaat gcgatgcaat ttcctcattt tattaggaaa ggacagtggg agtggcacct 11100
tccagggtca aggaaggcac gggggagggg caaacaacag atggctggca actagaaggc 11160
acagctacct gcagtcggat cctatcaggg gaatccaaaa ccatacgggg tttgggggcc 11220
cccccggaag gcggagaagg cgccggggct tgcttctccg gtcgggaggt cgcgaaagta 11280
acacgcgtaa cggcttccgc tccgggcgtc tggagcggcg ggacgggccg ccgtcccgtc 11340
cgccgcatcg gagtcgtcct ccgtgtccag ggggactgga tctgcgggcg ggggtgcggt 11400
gggcgacccc gtcttaggtt tctttagggc cgtgtccgcg cgcgcatcct cgccccccga 11460
gcgcccccgc ttggtcgtgg gagtgacccg cgtggtcgac gacggcatcg gacaggcatg 11520
caaggccccc gcctctcccg cccgggcagc gccatcgagg tcttccggat cgccgtggct 11580
gaccgcgtcc gacgcggagt cctggctgtc tgttggctcg ctcccagagg cccgggaggc 11640
cgagctcccg gctgaaggag acccctggct atgacccagc cagtccaggc aaatcttctg 11700
gggtttcagg agaaataccg ccgataccgc gttgggaccg gtggcggtga cgcacagact 11760
ggggacgggg gtcgtgagga agaacttgag ggtgcccccg ccgacctgca gtcgccggag 11820
caccgcccgc atgctgcaat cgtcgacgac cacagagaag gtgcgatggg tattttcccc 11880
gtacaccgtc ttggcgttcg cggcggcctg gcccgccttg gtgagcgcgt tggacaggat 11940
ctggacctgg gtgctggtgc tggacgacac gccctcctcg cgggcagcaa aggtgacgca 12000
ggtactcgtg gtgaacacgg aaaatttgcc gttaaccccg agctcgaacg tggtgggcgt 12060
ggcactatcg gccccggtcg cgttaaggac cttggtgagc tgcggcctcg tcaggcgcaa 12120
ctgaacgtcg ggggttccct ggggaaccag caccacaaag ctcgtcagtt cgcgcttcat 12180
cagcgtctcg ctggctagct caacggcctc gccgtcggac gtcgtcgtcc atatgcgctg 12240
aaccagcgtg cgaaacgggg cctggcccgt gatcgccaac tccacccgac gtaggtccgg 12300
gtactggttg gcgcgaaaca cgctcaggag ggagcgcttc tggtccacga gagacaggaa 12360
cgccgccgtg ggtccgcgcc agcgataccg actgaattgc gagtgttcca ggggcaggaa 12420
cacctgctcc ccaaagatcg tgttatggat aaggatgccc cggtcgccca taaccagaag 12480
cgagtccaga aggctcgtgc gcagcggggc aaacgcctgt aggattccat taagttcggc 12540
gccctgcagg accacctggc agggcgcccc ctcctccggc tgcccgaggg acgcgtccga 12600
cgcgtcctcc acgggggagg cgggggccac accgccaggg gaatccgtcg gtccaggatt 12660
ctcttcgaca tctccggctt gtttcagcag agagaagttt gttgctccac tgccgaattc 12720
tgctagagta tcaaaggctc tatgcaacat tcgacgagtt tcctccgccg tagcgccggc 12780
acccaccgcc ccgaaccctg cggtccggag ccgcgcggcc acgtcgtccg gggggtgcca 12840
cacttcggga ataaaccttt ttaacagact ctcggtgatc ttggcgttat tcccaaacag 12900
ggccttgaat gtcacgcacg ccgcccccaa caggtgggag aagtaatagt ccgtgttcag 12960
ggcgacgccg tgggcaatgg cgtatgcggg atcctcggcc agctcggaca ccagcagctt 13020
gcggggcttg gacgcgcctc ccggggggtc ggcaggcgac ggcgtctccc gggggcgctt 13080
ggccggggag ggcagggccg cggggggggc gggctcgtcc cctggggcgg cggcgtctag 13140
ctcgcggagg gcggccagcc gcgcgaccgt ctcctctacc tcgcgggtct gggccacgat 13200
cacgtacggg atccggtcct tgatggacgg gacctgcgcg cggcgggcca tgagcttgta 13260
atacaccgtc aggtgggcca ggcgcttgtt ggtgtacgcg cgcgggtgtc tgctcagttc 13320
ggcggtgagg acaaagtcct ggatgtccct ctccgggtcg gtgatgcgcc gatgggcgtc 13380
tacgaggacg gccccgaacg cctgcagtcc ctcgggcagg ggtcgcgcca gccactcctc 13440
cgcggggcgc tcggctaacg ccgcggcggc tccggagacg gtatcgtcgt aaaacagcag 13500
gtcgaccagg gccctggagg tgcggttgat aaacgcgcag ttgtttttgc gcaccagatc 13560
cacgcccttg atgagcatct tacccccgta gatgacgccg atgtactttt tcttggcgat 13620
cagcagcagc ttggtgaacg tcttttcgca ctcgagtttg atggggggca gaaacagcgc 13680
gcgcgagatg tggctcgcca tcttgtcgcc cacggccgtc agcccggcgg ccgtgaggcc 13740
gcggcacagc acaaagatgg agtccgtgtc cccgtagatg atgcgcatgg aatagggccc 13800
gggggcgcgc atgtcggccg cctccgggaa atcggccagg agctgttcga aggccgccca 13860
gcgcgcgtgg acgtactcgc gggtcgcgag cagcatctcg cggccgatgg tcgtcaccgt 13920
cgcggcaacg tgcaggcacg gcaggagtcc gtgctgcact cccgtgaacc cgtacaccga 13980
gttacacacg accttgatgg cggcctgctg cttgtccagg agcacggcct cctcggggct 14040
gctctgggga atccgcgagc ggatctgctt tcgcatggcg agccagtccc gcaggaggat 14100
gctgaggagg ctctctcgca cgtgagcctt gacgaagaac agccgtcgcc cccccacctc 14160
gatctccagg tagtccttgc ccgcctccag gtgcgccact gcgtcggccc tcagggagag 14220
cgtgctgaag cacaggttgt gggcctggat gatgctgggg tacaggctgg caaagtcgaa 14280
caccaccacg gggttcacgt gaaacccgga agtggggtca aggaccctgg ccccctggta 14340
ccccacgtgc ctgccggcgg tctcccgcgc gccctccggc tcccgctcgc ccccgccctc 14400
ctcgcgttcg tcctcgtcct ccccctcctc ctctggccgc tcctcgtcct cccgggctgc 14460
ggccggacgc ttgggcgcct cccccccggc gcccctaaat cgcccctggg tgtccggcag 14520
aataaagccc ttctggtcgg ccaggcgcag caggcacgta aagacgcgga tctgctggcc 14580
gtcgtagatg gtgcgggtga tgttaatacc cgccaagcgc gcgacggccg agagctccag 14640
atggggcaaa aacttaaaaa acagctggcc caccagcagg gaatcctgta tgcagtactc 14700
gccgatcacc ccgcgttgcg cgggcccggc ggcgtagtag gcggggatgt cgcgatagct 14760
caggtccttc ttcttgtcct tcaggacggc ttcggccacg gcgttgagct tgtagctcga 14820
gagcttgatc ttgtcggtta taatcccgta catgtcgatg ttcaccatgc cgttcacctt 14880
tatcttgctg cgcttctgga agtggctctg gcctatgtcc cacacgcgaa acacgccccg 14940
gccgttcatg cggccgtacc cgtccagggg gaccttgtaa atgtccgtca gcttggccag 15000
caagaagggc cagtcgaagt tgatgatgtt gtacccggtc acgaactcgg ggccgtactg 15060
tttcacaagg gtcatgaagg ccaacagcat ctcgaattcg ctgtcgaatt ccagaaccac 15120
gggcgtgggc aggcccctgg ccgccagctc gttcaggtgg gattcgggga ggtcgcagga 15180
accgagcgaa aacaggagga cgtgctccag ggcggtggtg gacaggtcgt agagcagaca 15240
ggatatctgg atgaccaggt cctccgggtg cccggccacc ggaaaggcca gctcgtcctc 15300
cccccccgcc ttgcattcga tatcgaagca catgagcttg tatgccggta ggtcgctcat 15360
gcccccctcg atggccaggt tgtccgccgt acagttaaac tcgacgtcgc tggatgtccc 15420
gaaggccatc ggggcccgcg gctgggctag cgtgttgttc cggcccggtt tgagacggta 15480
ccagccgaag gtgacgaacc cggggttgtc caggatgaac cgggtggtgg cgtcgacccc 15540
accctcgtac ttcttgatgg ccgggcagaa gttgtcgcac aggtacgaca gcacgcgccc 15600
gcttcggacg tagacgcggt aaaacagagc ggggcgcgtc tcgtagtagt acacgtcggt 15660
gcgctccacc acctccgcct cgaagtggtc cgcggagatg ccgcggaacg acgcgcccgg 15720
ggactcgcgc agggccgcgg ccatgcgctc gcagagatct cgtggggcgc ggcattgtag 15780
gtgcctgtcg acctcctcct tgttcatgta aaagtactgc cgcgtgccgt aaacgtgaac 15840
ggccacccgg tggccttccg gagtcaggcc caggagcgtg atgacggtcc ccgtcggtgt 15900
gatggcgtcc ataaaccgcg cgtggaactg ggccgcgcgc atgccgtacg cgtgctccac 15960
gttctccagg atgtcgtaca cgtgaaagac ggtgacggtg gggttgaacc ccgccggggc 16020
gtggtccacg ccgccccaca ggcgcgagcg ccgcggccag aagccgcccg acccgacgcg 16080
gaggacgtcg cgctcgtccc ccccgcagta caccttgggg gcgcgcttga ggtgaccgtc 16140
gtgcaccccg gcgcgcttct ccgggggggc atcctcgtcc agcacccgcg gggcgatgaa 16200
tcgaaattca tcgcattcgc tatagtacgt atggcgctgg gttggcccgg tcggcttctg 16260
ttgcgtcccg actggggcga ggtaggggtt gtaaaagttt tgcctcaaac aaggcggggg 16320
tccccggctg gctccgcgag ggccggcggg cgcaaaaaac ccggacgccg ccctggccgc 16380
cgactttcct ccgggggaca gcgggccgcc gccaccggaa aacatggtgg ctttaccaac 16440
agtaccggtg gatcgggccc gcggtgcgcc ggcgtttgca aaagcctagg cctccaaaaa 16500
agcctcctca ctacttctgg aatagctcag aggccgaggc ggcctcggcc tctgcataaa 16560
taaaaaaaat tagtcagcca tggggcggag aatgggcgga actgggcgga gttaggggcg 16620
ggatgggcgg agttaggggc gggactatgg ttgctgacta attgagatgc atgctttgca 16680
tacttctgcc tgctggggag cctggggact ttccacacct ggttgctgac taattgagat 16740
gcatgctttg catacttctg cctgctgggg agcctgggga ctttccacac cctaactgac 16800
acacatttaa atgaagatat ggtgcactct cagtacaatc tgctctgatg ccgcatagtt 16860
aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt gtctgctccc 16920
ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc 16980
accgtcatca ccgaaacgcg cga 17003
<210> 103
<211> 17100
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 103
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat cgatccagac atgataagat acattgatga 2880
gtttggacaa accacaacta gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga 2940
tgctattgct ttatttgtaa ccattataag ctgcaataaa caagtttgta cactctcggg 3000
tgattattta cccccaccct tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc 3060
gcatcgctat gcgccactgg cagggacacg ttgcgatact ggtgtttagt gctccactta 3120
aactcaggca caaccatccg cggcagctcg gtgaagtttt cactccacag gctgcgcacc 3180
atcaccaacg cgtttagcag gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg 3240
ccctgcgcgc gcgagttgcg atacacaggg ttgcagcact ggaacactat cagcgccggg 3300
tggtgcacgc tggccagcac gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg 3360
ttgctcaggg cgaacggagt caactttggt agctgccttc ccaaaaaggg cgcgtgccca 3420
ggctttgagt tgcactcgca ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg 3480
ttaggataca gcgcctgcat aaaagccttg atctgcttaa aagccacctg agcctttgcg 3540
ccttcagaga agaacatgcc gcaagacttg ccggaaaact gattggccgg acaggccgcg 3600
tcgtgcacgc agcaccttgc gtcggtgttg gagatctgca ccacatttcg gccccaccgg 3660
ttcttcacga tcttggcctt gctagactgc tccttcagcg cgcgctgccc gttttcgctc 3720
gtcacatcca tttcaatcac gtgctcctta tttatcataa tgcttccgtg tagacactta 3780
agctcgcctt cgatctcagc gcagcggtgc agccacaacg cgcagcccgt gggctcgtga 3840
tgcttgtagg tcacctctgc aaacgactgc aggtacgcct gcaggaatcg ccccatcatc 3900
gtcacaaagg tcttgttgct ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc 3960
caggtcttgc atacggccgc cagagcttcc acttggtcag gcagtagttt gaagttcgcc 4020
tttagatcgt tatccacgtg gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc 4080
tcccacgcag acacgatcgg cacactcagc gggttcatca ccgtaatttc actttccgct 4140
tcgctgggct cttcctcttc ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca 4200
ttcagccgcc gcactgtgcg cttacctcct ttgccatgct tgattagcac cggtgggttg 4260
ctgaaaccca ccatttgtag cgccacatct tctctttctt cctcgctgtc cacgattacc 4320
tctggtgatg gcgggcgctc gggcttggga gaagggcgct tctttttctt cttgggcgca 4380
atggccaaat ccgccgccga ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg 4440
tcttgtgatg agtcttcctc gtcctcggac tcgatacgcc gcctcatccg cttttttggg 4500
ggcgcccggg gaggcggcgg cgacggggac ggggacgaca cgtcctccat ggttggggga 4560
cgtcgcgccg caccgcgtcc gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg 4620
gccatttcct tctcctatag gcagaaaaag atcatggagt cagtcgagaa gaaggacagc 4680
ctaaccgccc cctctgagtt cgccaccacc gcctccaccg atgccgccaa cgcgcctacc 4740
accttccccg tcgaggcacc cccgcttgag gaggaggaag tgattatcga gcaggaccca 4800
ggttttgtaa gcgaagacga cgaggaccgc tcagtaccaa cagaggataa aaagcaagac 4860
caggacaacg cagaggcaaa cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac 4920
tacctagatg tgggagacga cgtgctgttg aagcatctgc agcgccagtg cgccattatc 4980
tgcgacgcgt tgcaagagcg cagcgatgtg cccctcgcca tagcggatgt cagccttgcc 5040
tacgaacgcc acctattctc accgcgcgta ccccccaaac gccaagaaaa cggcacatgc 5100
gagcccaacc cgcgcctcaa cttctacccc gtatttgccg tgccagaggt gcttgccacc 5160
tatcacatct ttttccaaaa ctgcaagata cccctatcct gccgtgccaa ccgcagccga 5220
gcggacaagc agctggcctt gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac 5280
gaagtgccaa aaatctttga gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg 5340
caacaggaaa acagcgaaaa tgaaagtcac tctggagtgt tggtggaact cgagggtgac 5400
aacgcgcgcc tagccgtact aaaacgcagc atcgaggtca cccactttgc ctacccggca 5460
cttaacctac cccccaaggt catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg 5520
cagcccctgg agagggatgc aaatttgcaa gaacaaacag aggagggcct acccgcagtt 5580
ggcgacgagc agctagcgcg ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga 5640
cgcaaactaa tgatggccgc agtgctcgtt accgtggagc ttgagtgcat gcagcggttc 5700
tttgctgacc cggagatgca gcgcaagcta gaggaaacat tgcactacac ctttcgacag 5760
ggctacgtac gccaggcctg caagatctcc aacgtggagc tctgcaacct ggtctcctac 5820
cttggaattt tgcacgaaaa ccgccttggg caaaacgtgc ttcattccac gctcaagggc 5880
gaggcgcgcc gcgactacgt ccgcgactgc gtttacttat ttctatgcta cacctggcag 5940
acggccatgg gcgtttggca gcagtgcttg gaggagtgca acctcaagga gctgcagaaa 6000
ctgctaaagc aaaacttgaa ggacctatgg acggccttca acgagcgctc cgtggccgcg 6060
cacctggcgg acatcatttt ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca 6120
gacttcacca gtcaaagcat gttgcagaac tttaggaact ttatcctaga gcgctcagga 6180
atcttgcccg ccacctgctg tgcacttcct agcgactttg tgcccattaa gtaccgcgaa 6240
tgccctccgc cgctttgggg ccactgctac cttctgcagc tagccaacta ccttgcctac 6300
cactctgaca taatggaaga cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc 6360
aacctatgca ccccgcaccg ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa 6420
attatcggta cctttgagct gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg 6480
ttgaaactca ctccggggct gtggacgtcg gcttaccttc gcaaatttgt acctgaggac 6540
taccacgccc acgagattag gttctacgaa gaccaatccc gcccgcctaa tgcggagctt 6600
accgcctgcg tcattaccca gggccacatt cttggccaat tgcaagccat caacaaagcc 6660
cgccaagagt ttctgctacg aaagggacgg ggggtttact tggaccccca gtccggcgag 6720
gagctcaacc caatcccccc gccgccgcag ccctatcagc agcagccgcg ggcccttgct 6780
tcccaggatg gcacccaaaa agaagctgca gctgccgccg ccacccacgg acgaggagga 6840
atactgggac agtcaggcag aggaggtttt ggacgaggag gaggaggaca tgatggaaga 6900
ctgggagagc ctagacgagg aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc 6960
accctcggtc gcattcccct cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc 7020
tacaacctcc gctcctcagg cgccgccggc actgcccgtt cgccgaccca accgtagatg 7080
ggacaccact ggaaccaggg ccggtaagtc caagcagccg ccgccgttag cccaagagca 7140
acaacagcgc caaggctacc gctcatggcg cgggcacaag aacgccatag ttgcttgctt 7200
gcaagactgt gggggcaaca tctccttcgc ccgccgcttt cttctctacc atcacggcgt 7260
ggccttcccc cgtaacatcc tgcattacta ccgtcatctc tacagcccat actgcaccgg 7320
cggcagcggc agcaacagca gcggccacac agaagcaaag gcgaccggat agcaagactc 7380
tgacaaagcc caagaaatcc acagcggcgg cagcagcagg aggaggagcg ctgcgtctgg 7440
cgcccaacga acccgtatcg acccgcgagc ttagaaacag gatttttccc actctgtatg 7500
ctatatttca acagagcagg ggccaagaac aagagctgaa aataaaaaac aggtctctgc 7560
gatccctcac ccgcagctgc ctgtatcaca aaagcgaaga tcagcttcgg cgcacgctgg 7620
aagacgcgga ggctctcttc agtaaatact gcgcgctgac tcttaaggac tagtttcgcg 7680
ccctttctca aatttaagcg cgaaaactac gtcatctcca gcggccacac ccggcgccag 7740
cacctgttgt cagcgccatt atgagcaagg aaattcccac gccctacatg tggagttacc 7800
agccacaaat gggacttgcg gctggagctg cccaagacta ctcaacccga ataaactaca 7860
tgagcgcggg gcggccgcaa cttgtttatt gcagcttata atggttacaa ataaagcaat 7920
agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg tggtttgtcc 7980
aaactcatca atgtatctta gcttaacggg cggcgaagga gaagtccacg cctacatggg 8040
ggtagagtca taatcgtgca tcaggatagg gcggtggtgc tgcagcagcg cgcgaataaa 8100
ctgctgccgc cgccgctccg tcctgcagga atacaacatg gcagtggtct cctcagcgat 8160
gattcgcacc gcccgcagca taaggcgcct tgtcctccgg gcacagcagc gcaccctgat 8220
ctcacttaaa tcagcacagt aactgcagca cagcaccaca atattgttca aaatcccaca 8280
gtgcaaggcg ctgtatccaa agctcatggc ggggaccaca gaacccacgt ggccatcata 8340
ccacaagcgc aggtagatta agtggcgacc cctcataaac acgctggaca taaacattac 8400
ctcttttggc atgttgtaat tcaccacctc ccggtaccat ataaacctct gattaaacat 8460
ggcgccatcc accaccatcc taaaccagct ggccaaaacc tgcccgccgg ctatacactg 8520
cagggaaccg ggactggaac aatgacagtg gagagcccag gactcgtaac catggatcat 8580
catgctcgtc atgatatcaa tgttggcaca acacaggcac acgtgcatac acttcctcag 8640
gattacaagc tcctcccgcg ttagaaccat atcccaggga acaacccatt cctgaatcag 8700
cgtaaatccc acactgcagg gaagacctcg cacgtaactc acgttgtgca ttgtcaaagt 8760
gttacattcg ggcagcagcg gatgatcctc cagtatggta gcgcgggttt ctgtctcaaa 8820
aggaggtaga cgatccctac tgtacggagt gcgccgagac aaccgagatc gtgttggtcg 8880
tagtgtcatg ccaaatggaa cgccggacgt agtcatattt cctgaagcaa aaccaggtgc 8940
gggcgtgaca aacagatctg cgtctccggt ctcgccgctt agatcgctct gtgtagtagt 9000
tgtagtatat ccactctctc aaagcatcca ggcgccccct ggcttcgggt tctatgtaaa 9060
ctccttcatg cgccgctgcc ctgataacat ccaccaccgc agaataagcc acacccagcc 9120
aacctacaca ttcgttctgc gagtcacaca cgggaggagc gggaagagct ggaagaacca 9180
tgtttttttt tttattccaa aagattatcc aaaacctcaa aatgaagatc tattaagtga 9240
acgcgctccc ctccggtggc gtggtcaaac tctacagcca aagaacagat aatggcattt 9300
gtaagatgtt gcacaatggc ttccaaaagg caaacggccc tcacgtccaa gtggacgtaa 9360
aggctaaacc cttcagggtg aatctcctct ataaacattc cagcaccttc aaccatgccc 9420
aaataattct catctcgcca ccttctcaat atatctctaa gcaaatcccg aatattaagt 9480
ccggccattg taaaaatctg ctccagagcg ccctccacct tcagcctcaa gcagcgaatc 9540
atgattgcaa aaattcaggt tcctcacaga cctgtataag attcaaaagc ggaacattaa 9600
caaaaatacc gcgatcccgt aggtcccttc gcagggccag ctgaacataa tcgtgcaggt 9660
ctgcacggac cagcgcggcc acttccccgc caggaaccat gacaaaagaa cccacactga 9720
ttatgacacg catactcgga gctatgctaa ccagcgtagc cccgatgtaa gcttgttgca 9780
tgggcggcga tataaaatgc aaggtgctgc tcaaaaaatc aggcaaagcc tcgcgcaaaa 9840
aagaaagcac atcgtagtca tgctcatgca gataaaggca ggtaagctcc ggaaccacca 9900
cagaaaaaga caccattttt ctctcaaaca tgtctgcggg tttctgcata aacacaaaat 9960
aaaataacaa aaaaacattt aaacattaga agcctgtctt acaacaggaa aaacaaccct 10020
tataagcata agacggacta cggccatgcc ggcgtgaccg taaaaaaact ggtcaccgtg 10080
attaaaaagc accaccgaca gctcctcggt catgtccgga gtcataatgt aagactcggt 10140
aaacacatca ggttgattca catcggtcag tgctaaaaag cgaccgaaat agcccggggg 10200
aatacatacc cgcaggcgta gagacaacat tacagccccc ataggaggta taacaaaatt 10260
aataggagag aaaaacacat aaacacctga aaaaccctcc tgcctaggca aaatagcacc 10320
ctcccgctcc agaacaacat acagcgcttc cacagcggca gccatggtgg catttgcaaa 10380
agcctaggcc tccaaaaaag cctcctcact acttctggaa tagctcagag gccgaggcgg 10440
cctcggcctc tgcataaata aaaaaaatta gtcagccatg gggcggagaa tgggcggaac 10500
tgggcggagt taggggcggg atgggcggag ttaggggcgg gactatggtt gctgactaat 10560
tgagatgcat gctttgcata cttctgcctg ctggggagcc tggggacttt ccacacctgg 10620
ttgctgacta attgagatgc atgctttgca tacttctgcc tgctggggag cctggggact 10680
ttccacaccc taactgacac acacgttacg tcacttccca ttttaagaaa actacaattc 10740
ccaacacata caagttactc cgcccttaat taaatcggat ccgatatcta gatgtattcg 10800
cgaggtaccg agctcgaatt ctctggccgt cgttttacaa cgtcgtgact gggaaaaccc 10860
tggcgttacc caacttaatc gccttgcagc acatccccct ttcgccagct ggcgtaatag 10920
cgaagaggcc cgcaccgatc gcccttccca acagttgcgc agcctgaatg gcgaatggcg 10980
cctgatgcgg tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatcttcatt 11040
taaatgtgtg tcagttaggg tgtggaaagt ccccaggctc cccagcaggc agaagtatgc 11100
aaagcatgca tctcaattag tcagcaacca ggtgtggaaa gtccccaggc tccccagcag 11160
gcagaagtat gcaaagcatg catctcaatt agtcagcaac catagtcccg cccctaactc 11220
cgcccatccc gcccctaact ccgcccagtt ccgcccattc tccgccccat ggctgactaa 11280
ttttttttat ttatgcagag gccgaggccg cctcggcctc tgagctattc cagaagtagt 11340
gaggaggctt ttttggaggc ctaggctttt gcaaacgccg gcgcaccgcg ggcccgatcc 11400
accggtactg ttggtaaagc caccatgttt tccggtggcg gcggcccgct gtcccccgga 11460
ggaaagtcgg cggccagggc ggcgtccggg ttttttgcgc ccgccggccc tcgcggagcc 11520
agccggggac ccccgccttg tttgaggcaa aacttttaca acccctacct cgccccagtc 11580
gggacgcaac agaagccgac cgggccaacc cagcgccata cgtactatag cgaatgcgat 11640
gaatttcgat tcatcgcccc gcgggtgctg gacgaggatg cccccccgga gaagcgcgcc 11700
ggggtgcacg acggtcacct caagcgcgcc cccaaggtgt actgcggggg ggacgagcgc 11760
gacgtcctcc gcgtcgggtc gggcggcttc tggccgcggc gctcgcgcct gtggggcggc 11820
gtggaccacg ccccggcggg gttcaacccc accgtcaccg tctttcacgt gtacgacatc 11880
ctggagaacg tggagcacgc gtacggcatg cgcgcggccc agttccacgc gcggtttatg 11940
gacgccatca caccgacggg gaccgtcatc acgctcctgg gcctgactcc ggaaggccac 12000
cgggtggccg ttcacgttta cggcacgcgg cagtactttt acatgaacaa ggaggaggtc 12060
gacaggcacc tacaatgccg cgccccacga gatctctgcg agcgcatggc cgcggccctg 12120
cgcgagtccc cgggcgcgtc gttccgcggc atctccgcgg accacttcga ggcggaggtg 12180
gtggagcgca ccgacgtgta ctactacgag acgcgccccg ctctgtttta ccgcgtctac 12240
gtccgaagcg ggcgcgtgct gtcgtacctg tgcgacaact tctgcccggc catcaagaag 12300
tacgagggtg gggtcgacgc caccacccgg ttcatcctgg acaaccccgg gttcgtcacc 12360
ttcggctggt accgtctcaa accgggccgg aacaacacgc tagcccagcc gcgggccccg 12420
atggccttcg ggacatccag cgacgtcgag tttaactgta cggcggacaa cctggccatc 12480
gaggggggca tgagcgacct accggcatac aagctcatgt gcttcgatat cgaatgcaag 12540
gcgggggggg aggacgagct ggcctttccg gtggccgggc acccggagga cctggtcatc 12600
cagatatcct gtctgctcta cgacctgtcc accaccgccc tggagcacgt cctcctgttt 12660
tcgctcggtt cctgcgacct ccccgaatcc cacctgaacg agctggcggc caggggcctg 12720
cccacgcccg tggttctgga attcgacagc gaattcgaga tgctgttggc cttcatgacc 12780
cttgtgaaac agtacggccc cgagttcgtg accgggtaca acatcatcaa cttcgactgg 12840
cccttcttgc tggccaagct gacggacatt tacaaggtcc ccctggacgg gtacggccgc 12900
atgaacggcc ggggcgtgtt tcgcgtgtgg gacataggcc agagccactt ccagaagcgc 12960
agcaagataa aggtgaacgg catggtgaac atcgacatgt acgggattat aaccgacaag 13020
atcaagctct cgagctacaa gctcaacgcc gtggccgaag ccgtcctgaa ggacaagaag 13080
aaggacctga gctatcgcga catccccgcc tactacgccg ccgggcccgc gcaacgcggg 13140
gtgatcggcg agtactgcat acaggattcc ctgctggtgg gccagctgtt ttttaagttt 13200
ttgccccatc tggagctctc ggccgtcgcg cgcttggcgg gtattaacat cacccgcacc 13260
atctacgacg gccagcagat ccgcgtcttt acgtgcctgc tgcgcctggc cgaccagaag 13320
ggctttattc tgccggacac ccaggggcga tttaggggcg ccggggggga ggcgcccaag 13380
cgtccggccg cagcccggga ggacgaggag cggccagagg aggaggggga ggacgaggac 13440
gaacgcgagg agggcggggg cgagcgggag ccggagggcg cgcgggagac cgccggcagg 13500
cacgtggggt accagggggc cagggtcctt gaccccactt ccgggtttca cgtgaacccc 13560
gtggtggtgt tcgactttgc cagcctgtac cccagcatca tccaggccca caacctgtgc 13620
ttcagcacgc tctccctgag ggccgacgca gtggcgcacc tggaggcggg caaggactac 13680
ctggagatcg aggtgggggg gcgacggctg ttcttcgtca aggctcacgt gcgagagagc 13740
ctcctcagca tcctcctgcg ggactggctc gccatgcgaa agcagatccg ctcgcggatt 13800
ccccagagca gccccgagga ggccgtgctc ctggacaagc agcaggccgc catcaaggtc 13860
gtgtgtaact cggtgtacgg gttcacggga gtgcagcacg gactcctgcc gtgcctgcac 13920
gttgccgcga cggtgacgac catcggccgc gagatgctgc tcgcgacccg cgagtacgtc 13980
cacgcgcgct gggcggcctt cgaacagctc ctggccgatt tcccggaggc ggccgacatg 14040
cgcgcccccg ggccctattc catgcgcatc atctacgggg acacggactc catctttgtg 14100
ctgtgccgcg gcctcacggc cgccgggctg acggccgtgg gcgacaagat ggcgagccac 14160
atctcgcgcg cgctgtttct gccccccatc aaactcgagt gcgaaaagac gttcaccaag 14220
ctgctgctga tcgccaagaa aaagtacatc ggcgtcatct acgggggtaa gatgctcatc 14280
aagggcgtgg atctggtgcg caaaaacaac tgcgcgttta tcaaccgcac ctccagggcc 14340
ctggtcgacc tgctgtttta cgacgatacc gtctccggag ccgccgcggc gttagccgag 14400
cgccccgcgg aggagtggct ggcgcgaccc ctgcccgagg gactgcaggc gttcggggcc 14460
gtcctcgtag acgcccatcg gcgcatcacc gacccggaga gggacatcca ggactttgtc 14520
ctcaccgccg aactgagcag acacccgcgc gcgtacacca acaagcgcct ggcccacctg 14580
acggtgtatt acaagctcat ggcccgccgc gcgcaggtcc cgtccatcaa ggaccggatc 14640
ccgtacgtga tcgtggccca gacccgcgag gtagaggaga cggtcgcgcg gctggccgcc 14700
ctccgcgagc tagacgccgc cgccccaggg gacgagcccg ccccccccgc ggccctgccc 14760
tccccggcca agcgcccccg ggagacgccg tcgcctgccg accccccggg aggcgcgtcc 14820
aagccccgca agctgctggt gtccgagctg gccgaggatc ccgcatacgc cattgcccac 14880
ggcgtcgccc tgaacacgga ctattacttc tcccacctgt tgggggcggc gtgcgtgaca 14940
ttcaaggccc tgtttgggaa taacgccaag atcaccgaga gtctgttaaa aaggtttatt 15000
cccgaagtgt ggcacccccc ggacgacgtg gccgcgcggc tccggaccgc agggttcggg 15060
gcggtgggtg ccggcgctac ggcggaggaa actcgtcgaa tgttgcatag agcctttgat 15120
actctagcag aattcggcag tggagcaaca aacttctctc tgctgaaaca agccggagat 15180
gtcgaagaga atcctggacc gacggattcc cctggcggtg tggcccccgc ctcccccgtg 15240
gaggacgcgt cggacgcgtc cctcgggcag ccggaggagg gggcgccctg ccaggtggtc 15300
ctgcagggcg ccgaacttaa tggaatccta caggcgtttg ccccgctgcg cacgagcctt 15360
ctggactcgc ttctggttat gggcgaccgg ggcatcctta tccataacac gatctttggg 15420
gagcaggtgt tcctgcccct ggaacactcg caattcagtc ggtatcgctg gcgcggaccc 15480
acggcggcgt tcctgtctct cgtggaccag aagcgctccc tcctgagcgt gtttcgcgcc 15540
aaccagtacc cggacctacg tcgggtggag ttggcgatca cgggccaggc cccgtttcgc 15600
acgctggttc agcgcatatg gacgacgacg tccgacggcg aggccgttga gctagccagc 15660
gagacgctga tgaagcgcga actgacgagc tttgtggtgc tggttcccca gggaaccccc 15720
gacgttcagt tgcgcctgac gaggccgcag ctcaccaagg tccttaacgc gaccggggcc 15780
gatagtgcca cgcccaccac gttcgagctc ggggttaacg gcaaattttc cgtgttcacc 15840
acgagtacct gcgtcacctt tgctgcccgc gaggagggcg tgtcgtccag caccagcacc 15900
caggtccaga tcctgtccaa cgcgctcacc aaggcgggcc aggccgccgc gaacgccaag 15960
acggtgtacg gggaaaatac ccatcgcacc ttctctgtgg tcgtcgacga ttgcagcatg 16020
cgggcggtgc tccggcgact gcaggtcggc gggggcaccc tcaagttctt cctcacgacc 16080
cccgtcccca gtctgtgcgt caccgccacc ggtcccaacg cggtatcggc ggtatttctc 16140
ctgaaacccc agaagatttg cctggactgg ctgggtcata gccaggggtc tccttcagcc 16200
gggagctcgg cctcccgggc ctctgggagc gagccaacag acagccagga ctccgcgtcg 16260
gacgcggtca gccacggcga tccggaagac ctcgatggcg ctgcccgggc gggagaggcg 16320
ggggccttgc atgcctgtcc gatgccgtcg tcgaccacgc gggtcactcc cacgaccaag 16380
cgggggcgct cggggggcga ggatgcgcgc gcggacacgg ccctaaagaa acctaagacg 16440
gggtcgccca ccgcaccccc gcccgcagat ccagtccccc tggacacgga ggacgactcc 16500
gatgcggcgg acgggacggc ggcccgtccc gccgctccag acgcccggag cggaagccgt 16560
tacgcgtgtt actttcgcga cctcccgacc ggagaagcaa gccccggcgc cttctccgcc 16620
ttccgggggg gcccccaaac cccgtatggt tttggattcc cctgatagga tccgactgca 16680
ggtagctgtg ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt 16740
gaccctggaa ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca 16800
ttgtctgagt aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga 16860
ggattgggaa gacaatagca ggcatgctgg ggatgcggtg ggctctatgg gttttatggt 16920
gcactctcag tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa 16980
cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg 17040
tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga 17100
<210> 104
<211> 17100
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 104
tgcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 60
catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 120
atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 180
ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 240
ggaagcccga tgcgccagag ttgtttctga aacatggcaa aggtagcgtt gccaatgatg 300
ttacagatga gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca 360
agcattttat ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa 420
cagcattcca ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg 480
cagtgttcct gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc 540
gcgtatttcg tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg 600
attttgatga cgagcgtaat ggctggcctg ttgaacaagt ctggaaagaa atgcataaac 660
ttttgccatt ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta 720
tttttgacga ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc 780
gataccagga tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga 840
aacggctttt tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt 900
tgatgctcga tgagtttttc taatcagaat tggttaattg gttgtaacat tattcagatt 960
gggcttgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 1020
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 1080
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 1140
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 1200
tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct tctagtgtag 1260
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 1320
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 1380
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 1440
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 1500
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 1560
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 1620
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1680
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 1740
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 1800
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1860
gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1920
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 1980
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2040
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 2100
accaagcttg catgcaggcc tatccgtaga tgtacctgga catccaggtg atgccggcgg 2160
cggtggtgga ggcgcgcgga aagtcgcgga cgcggttcca gatgttgcgc agcggcaaaa 2220
agtgctccat ggtcgggacg ctctggccgg tgaggcgtgc gcagtcgttg acgctctaga 2280
ccgtgcaaaa ggagagcctg taagcgggca ctcttccgtg gtctggtgga taaattcgca 2340
agggtatcat ggcggacgac cggggttcga accccggatc cggccgtccg ccgtgatcca 2400
tgcggttacc gcccgcgtgt cgaacccagg tgtgcgacgt cagacaacgg gggagcgctc 2460
cttttggctt ccttccaggc gcggcggctg ctgcgctagc ttttttggcc actggccgcg 2520
cgcggcgtaa gcggttaggc tggaaagcga aagcattaag tggctcgctc cctgtagccg 2580
gagggttatt ttccaagggt tgagtcgcag gacccccggt tcgagtctcg ggccggccgg 2640
actgcggcga acgggggttt gcctccccgt catgcaagac cccgcttgca aattcctccg 2700
gaaacaggga cgagcccctt ttttgctttt cccagatgca tccggtgctg cggcagatgc 2760
gcccccctcc tcagcagcgg caagagcaag agcagcggca gacatgcagg gcaccctccc 2820
cttctcctac cgcgtcagga ggggcaacat cgatccagac atgataagat acattgatga 2880
gtttggacaa accacaacta gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga 2940
tgctattgct ttatttgtaa ccattataag ctgcaataaa caagtttgta cactctcggg 3000
tgattattta cccccaccct tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc 3060
gcatcgctat gcgccactgg cagggacacg ttgcgatact ggtgtttagt gctccactta 3120
aactcaggca caaccatccg cggcagctcg gtgaagtttt cactccacag gctgcgcacc 3180
atcaccaacg cgtttagcag gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg 3240
ccctgcgcgc gcgagttgcg atacacaggg ttgcagcact ggaacactat cagcgccggg 3300
tggtgcacgc tggccagcac gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg 3360
ttgctcaggg cgaacggagt caactttggt agctgccttc ccaaaaaggg cgcgtgccca 3420
ggctttgagt tgcactcgca ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg 3480
ttaggataca gcgcctgcat aaaagccttg atctgcttaa aagccacctg agcctttgcg 3540
ccttcagaga agaacatgcc gcaagacttg ccggaaaact gattggccgg acaggccgcg 3600
tcgtgcacgc agcaccttgc gtcggtgttg gagatctgca ccacatttcg gccccaccgg 3660
ttcttcacga tcttggcctt gctagactgc tccttcagcg cgcgctgccc gttttcgctc 3720
gtcacatcca tttcaatcac gtgctcctta tttatcataa tgcttccgtg tagacactta 3780
agctcgcctt cgatctcagc gcagcggtgc agccacaacg cgcagcccgt gggctcgtga 3840
tgcttgtagg tcacctctgc aaacgactgc aggtacgcct gcaggaatcg ccccatcatc 3900
gtcacaaagg tcttgttgct ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc 3960
caggtcttgc atacggccgc cagagcttcc acttggtcag gcagtagttt gaagttcgcc 4020
tttagatcgt tatccacgtg gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc 4080
tcccacgcag acacgatcgg cacactcagc gggttcatca ccgtaatttc actttccgct 4140
tcgctgggct cttcctcttc ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca 4200
ttcagccgcc gcactgtgcg cttacctcct ttgccatgct tgattagcac cggtgggttg 4260
ctgaaaccca ccatttgtag cgccacatct tctctttctt cctcgctgtc cacgattacc 4320
tctggtgatg gcgggcgctc gggcttggga gaagggcgct tctttttctt cttgggcgca 4380
atggccaaat ccgccgccga ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg 4440
tcttgtgatg agtcttcctc gtcctcggac tcgatacgcc gcctcatccg cttttttggg 4500
ggcgcccggg gaggcggcgg cgacggggac ggggacgaca cgtcctccat ggttggggga 4560
cgtcgcgccg caccgcgtcc gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg 4620
gccatttcct tctcctatag gcagaaaaag atcatggagt cagtcgagaa gaaggacagc 4680
ctaaccgccc cctctgagtt cgccaccacc gcctccaccg atgccgccaa cgcgcctacc 4740
accttccccg tcgaggcacc cccgcttgag gaggaggaag tgattatcga gcaggaccca 4800
ggttttgtaa gcgaagacga cgaggaccgc tcagtaccaa cagaggataa aaagcaagac 4860
caggacaacg cagaggcaaa cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac 4920
tacctagatg tgggagacga cgtgctgttg aagcatctgc agcgccagtg cgccattatc 4980
tgcgacgcgt tgcaagagcg cagcgatgtg cccctcgcca tagcggatgt cagccttgcc 5040
tacgaacgcc acctattctc accgcgcgta ccccccaaac gccaagaaaa cggcacatgc 5100
gagcccaacc cgcgcctcaa cttctacccc gtatttgccg tgccagaggt gcttgccacc 5160
tatcacatct ttttccaaaa ctgcaagata cccctatcct gccgtgccaa ccgcagccga 5220
gcggacaagc agctggcctt gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac 5280
gaagtgccaa aaatctttga gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg 5340
caacaggaaa acagcgaaaa tgaaagtcac tctggagtgt tggtggaact cgagggtgac 5400
aacgcgcgcc tagccgtact aaaacgcagc atcgaggtca cccactttgc ctacccggca 5460
cttaacctac cccccaaggt catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg 5520
cagcccctgg agagggatgc aaatttgcaa gaacaaacag aggagggcct acccgcagtt 5580
ggcgacgagc agctagcgcg ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga 5640
cgcaaactaa tgatggccgc agtgctcgtt accgtggagc ttgagtgcat gcagcggttc 5700
tttgctgacc cggagatgca gcgcaagcta gaggaaacat tgcactacac ctttcgacag 5760
ggctacgtac gccaggcctg caagatctcc aacgtggagc tctgcaacct ggtctcctac 5820
cttggaattt tgcacgaaaa ccgccttggg caaaacgtgc ttcattccac gctcaagggc 5880
gaggcgcgcc gcgactacgt ccgcgactgc gtttacttat ttctatgcta cacctggcag 5940
acggccatgg gcgtttggca gcagtgcttg gaggagtgca acctcaagga gctgcagaaa 6000
ctgctaaagc aaaacttgaa ggacctatgg acggccttca acgagcgctc cgtggccgcg 6060
cacctggcgg acatcatttt ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca 6120
gacttcacca gtcaaagcat gttgcagaac tttaggaact ttatcctaga gcgctcagga 6180
atcttgcccg ccacctgctg tgcacttcct agcgactttg tgcccattaa gtaccgcgaa 6240
tgccctccgc cgctttgggg ccactgctac cttctgcagc tagccaacta ccttgcctac 6300
cactctgaca taatggaaga cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc 6360
aacctatgca ccccgcaccg ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa 6420
attatcggta cctttgagct gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg 6480
ttgaaactca ctccggggct gtggacgtcg gcttaccttc gcaaatttgt acctgaggac 6540
taccacgccc acgagattag gttctacgaa gaccaatccc gcccgcctaa tgcggagctt 6600
accgcctgcg tcattaccca gggccacatt cttggccaat tgcaagccat caacaaagcc 6660
cgccaagagt ttctgctacg aaagggacgg ggggtttact tggaccccca gtccggcgag 6720
gagctcaacc caatcccccc gccgccgcag ccctatcagc agcagccgcg ggcccttgct 6780
tcccaggatg gcacccaaaa agaagctgca gctgccgccg ccacccacgg acgaggagga 6840
atactgggac agtcaggcag aggaggtttt ggacgaggag gaggaggaca tgatggaaga 6900
ctgggagagc ctagacgagg aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc 6960
accctcggtc gcattcccct cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc 7020
tacaacctcc gctcctcagg cgccgccggc actgcccgtt cgccgaccca accgtagatg 7080
ggacaccact ggaaccaggg ccggtaagtc caagcagccg ccgccgttag cccaagagca 7140
acaacagcgc caaggctacc gctcatggcg cgggcacaag aacgccatag ttgcttgctt 7200
gcaagactgt gggggcaaca tctccttcgc ccgccgcttt cttctctacc atcacggcgt 7260
ggccttcccc cgtaacatcc tgcattacta ccgtcatctc tacagcccat actgcaccgg 7320
cggcagcggc agcaacagca gcggccacac agaagcaaag gcgaccggat agcaagactc 7380
tgacaaagcc caagaaatcc acagcggcgg cagcagcagg aggaggagcg ctgcgtctgg 7440
cgcccaacga acccgtatcg acccgcgagc ttagaaacag gatttttccc actctgtatg 7500
ctatatttca acagagcagg ggccaagaac aagagctgaa aataaaaaac aggtctctgc 7560
gatccctcac ccgcagctgc ctgtatcaca aaagcgaaga tcagcttcgg cgcacgctgg 7620
aagacgcgga ggctctcttc agtaaatact gcgcgctgac tcttaaggac tagtttcgcg 7680
ccctttctca aatttaagcg cgaaaactac gtcatctcca gcggccacac ccggcgccag 7740
cacctgttgt cagcgccatt atgagcaagg aaattcccac gccctacatg tggagttacc 7800
agccacaaat gggacttgcg gctggagctg cccaagacta ctcaacccga ataaactaca 7860
tgagcgcggg gcggccgcaa cttgtttatt gcagcttata atggttacaa ataaagcaat 7920
agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg tggtttgtcc 7980
aaactcatca atgtatctta gcttaacggg cggcgaagga gaagtccacg cctacatggg 8040
ggtagagtca taatcgtgca tcaggatagg gcggtggtgc tgcagcagcg cgcgaataaa 8100
ctgctgccgc cgccgctccg tcctgcagga atacaacatg gcagtggtct cctcagcgat 8160
gattcgcacc gcccgcagca taaggcgcct tgtcctccgg gcacagcagc gcaccctgat 8220
ctcacttaaa tcagcacagt aactgcagca cagcaccaca atattgttca aaatcccaca 8280
gtgcaaggcg ctgtatccaa agctcatggc ggggaccaca gaacccacgt ggccatcata 8340
ccacaagcgc aggtagatta agtggcgacc cctcataaac acgctggaca taaacattac 8400
ctcttttggc atgttgtaat tcaccacctc ccggtaccat ataaacctct gattaaacat 8460
ggcgccatcc accaccatcc taaaccagct ggccaaaacc tgcccgccgg ctatacactg 8520
cagggaaccg ggactggaac aatgacagtg gagagcccag gactcgtaac catggatcat 8580
catgctcgtc atgatatcaa tgttggcaca acacaggcac acgtgcatac acttcctcag 8640
gattacaagc tcctcccgcg ttagaaccat atcccaggga acaacccatt cctgaatcag 8700
cgtaaatccc acactgcagg gaagacctcg cacgtaactc acgttgtgca ttgtcaaagt 8760
gttacattcg ggcagcagcg gatgatcctc cagtatggta gcgcgggttt ctgtctcaaa 8820
aggaggtaga cgatccctac tgtacggagt gcgccgagac aaccgagatc gtgttggtcg 8880
tagtgtcatg ccaaatggaa cgccggacgt agtcatattt cctgaagcaa aaccaggtgc 8940
gggcgtgaca aacagatctg cgtctccggt ctcgccgctt agatcgctct gtgtagtagt 9000
tgtagtatat ccactctctc aaagcatcca ggcgccccct ggcttcgggt tctatgtaaa 9060
ctccttcatg cgccgctgcc ctgataacat ccaccaccgc agaataagcc acacccagcc 9120
aacctacaca ttcgttctgc gagtcacaca cgggaggagc gggaagagct ggaagaacca 9180
tgtttttttt tttattccaa aagattatcc aaaacctcaa aatgaagatc tattaagtga 9240
acgcgctccc ctccggtggc gtggtcaaac tctacagcca aagaacagat aatggcattt 9300
gtaagatgtt gcacaatggc ttccaaaagg caaacggccc tcacgtccaa gtggacgtaa 9360
aggctaaacc cttcagggtg aatctcctct ataaacattc cagcaccttc aaccatgccc 9420
aaataattct catctcgcca ccttctcaat atatctctaa gcaaatcccg aatattaagt 9480
ccggccattg taaaaatctg ctccagagcg ccctccacct tcagcctcaa gcagcgaatc 9540
atgattgcaa aaattcaggt tcctcacaga cctgtataag attcaaaagc ggaacattaa 9600
caaaaatacc gcgatcccgt aggtcccttc gcagggccag ctgaacataa tcgtgcaggt 9660
ctgcacggac cagcgcggcc acttccccgc caggaaccat gacaaaagaa cccacactga 9720
ttatgacacg catactcgga gctatgctaa ccagcgtagc cccgatgtaa gcttgttgca 9780
tgggcggcga tataaaatgc aaggtgctgc tcaaaaaatc aggcaaagcc tcgcgcaaaa 9840
aagaaagcac atcgtagtca tgctcatgca gataaaggca ggtaagctcc ggaaccacca 9900
cagaaaaaga caccattttt ctctcaaaca tgtctgcggg tttctgcata aacacaaaat 9960
aaaataacaa aaaaacattt aaacattaga agcctgtctt acaacaggaa aaacaaccct 10020
tataagcata agacggacta cggccatgcc ggcgtgaccg taaaaaaact ggtcaccgtg 10080
attaaaaagc accaccgaca gctcctcggt catgtccgga gtcataatgt aagactcggt 10140
aaacacatca ggttgattca catcggtcag tgctaaaaag cgaccgaaat agcccggggg 10200
aatacatacc cgcaggcgta gagacaacat tacagccccc ataggaggta taacaaaatt 10260
aataggagag aaaaacacat aaacacctga aaaaccctcc tgcctaggca aaatagcacc 10320
ctcccgctcc agaacaacat acagcgcttc cacagcggca gccatggtgg catttgcaaa 10380
agcctaggcc tccaaaaaag cctcctcact acttctggaa tagctcagag gccgaggcgg 10440
cctcggcctc tgcataaata aaaaaaatta gtcagccatg gggcggagaa tgggcggaac 10500
tgggcggagt taggggcggg atgggcggag ttaggggcgg gactatggtt gctgactaat 10560
tgagatgcat gctttgcata cttctgcctg ctggggagcc tggggacttt ccacacctgg 10620
ttgctgacta attgagatgc atgctttgca tacttctgcc tgctggggag cctggggact 10680
ttccacaccc taactgacac acacgttacg tcacttccca ttttaagaaa actacaattc 10740
ccaacacata caagttactc cgcccttaat taaatcggat ccgatatcta gatgtattcg 10800
cgaggtaccg agctcgaatt ctctggccgt cgttttacaa cgtcgtgact gggaaaaccc 10860
tggcgttacc caacttaatc gccttgcagc acatccccct ttcgccagct ggcgtaatag 10920
cgaagaggcc cgcaccgatc gcccttccca acagttgcgc agcctgaatg gcgaatggcg 10980
cctgatgcgg tattttctcc ttacgcatct gtgcggtatt tcacaccgca taaaacccat 11040
agagcccacc gcatccccag catgcctgct attgtcttcc caatcctccc ccttgctgtc 11100
ctgccccacc ccacccccca gaatagaatg acacctactc agacaatgcg atgcaatttc 11160
ctcattttat taggaaagga cagtgggagt ggcaccttcc agggtcaagg aaggcacggg 11220
ggaggggcaa acaacagatg gctggcaact agaaggcaca gctacctgca gtcggatcct 11280
atcaggggaa tccaaaacca tacggggttt gggggccccc ccggaaggcg gagaaggcgc 11340
cggggcttgc ttctccggtc gggaggtcgc gaaagtaaca cgcgtaacgg cttccgctcc 11400
gggcgtctgg agcggcggga cgggccgccg tcccgtccgc cgcatcggag tcgtcctccg 11460
tgtccagggg gactggatct gcgggcgggg gtgcggtggg cgaccccgtc ttaggtttct 11520
ttagggccgt gtccgcgcgc gcatcctcgc cccccgagcg cccccgcttg gtcgtgggag 11580
tgacccgcgt ggtcgacgac ggcatcggac aggcatgcaa ggcccccgcc tctcccgccc 11640
gggcagcgcc atcgaggtct tccggatcgc cgtggctgac cgcgtccgac gcggagtcct 11700
ggctgtctgt tggctcgctc ccagaggccc gggaggccga gctcccggct gaaggagacc 11760
cctggctatg acccagccag tccaggcaaa tcttctgggg tttcaggaga aataccgccg 11820
ataccgcgtt gggaccggtg gcggtgacgc acagactggg gacgggggtc gtgaggaaga 11880
acttgagggt gcccccgccg acctgcagtc gccggagcac cgcccgcatg ctgcaatcgt 11940
cgacgaccac agagaaggtg cgatgggtat tttccccgta caccgtcttg gcgttcgcgg 12000
cggcctggcc cgccttggtg agcgcgttgg acaggatctg gacctgggtg ctggtgctgg 12060
acgacacgcc ctcctcgcgg gcagcaaagg tgacgcaggt actcgtggtg aacacggaaa 12120
atttgccgtt aaccccgagc tcgaacgtgg tgggcgtggc actatcggcc ccggtcgcgt 12180
taaggacctt ggtgagctgc ggcctcgtca ggcgcaactg aacgtcgggg gttccctggg 12240
gaaccagcac cacaaagctc gtcagttcgc gcttcatcag cgtctcgctg gctagctcaa 12300
cggcctcgcc gtcggacgtc gtcgtccata tgcgctgaac cagcgtgcga aacggggcct 12360
ggcccgtgat cgccaactcc acccgacgta ggtccgggta ctggttggcg cgaaacacgc 12420
tcaggaggga gcgcttctgg tccacgagag acaggaacgc cgccgtgggt ccgcgccagc 12480
gataccgact gaattgcgag tgttccaggg gcaggaacac ctgctcccca aagatcgtgt 12540
tatggataag gatgccccgg tcgcccataa ccagaagcga gtccagaagg ctcgtgcgca 12600
gcggggcaaa cgcctgtagg attccattaa gttcggcgcc ctgcaggacc acctggcagg 12660
gcgccccctc ctccggctgc ccgagggacg cgtccgacgc gtcctccacg ggggaggcgg 12720
gggccacacc gccaggggaa tccgtcggtc caggattctc ttcgacatct ccggcttgtt 12780
tcagcagaga gaagtttgtt gctccactgc cgaattctgc tagagtatca aaggctctat 12840
gcaacattcg acgagtttcc tccgccgtag cgccggcacc caccgccccg aaccctgcgg 12900
tccggagccg cgcggccacg tcgtccgggg ggtgccacac ttcgggaata aaccttttta 12960
acagactctc ggtgatcttg gcgttattcc caaacagggc cttgaatgtc acgcacgccg 13020
cccccaacag gtgggagaag taatagtccg tgttcagggc gacgccgtgg gcaatggcgt 13080
atgcgggatc ctcggccagc tcggacacca gcagcttgcg gggcttggac gcgcctcccg 13140
gggggtcggc aggcgacggc gtctcccggg ggcgcttggc cggggagggc agggccgcgg 13200
ggggggcggg ctcgtcccct ggggcggcgg cgtctagctc gcggagggcg gccagccgcg 13260
cgaccgtctc ctctacctcg cgggtctggg ccacgatcac gtacgggatc cggtccttga 13320
tggacgggac ctgcgcgcgg cgggccatga gcttgtaata caccgtcagg tgggccaggc 13380
gcttgttggt gtacgcgcgc gggtgtctgc tcagttcggc ggtgaggaca aagtcctgga 13440
tgtccctctc cgggtcggtg atgcgccgat gggcgtctac gaggacggcc ccgaacgcct 13500
gcagtccctc gggcaggggt cgcgccagcc actcctccgc ggggcgctcg gctaacgccg 13560
cggcggctcc ggagacggta tcgtcgtaaa acagcaggtc gaccagggcc ctggaggtgc 13620
ggttgataaa cgcgcagttg tttttgcgca ccagatccac gcccttgatg agcatcttac 13680
ccccgtagat gacgccgatg tactttttct tggcgatcag cagcagcttg gtgaacgtct 13740
tttcgcactc gagtttgatg gggggcagaa acagcgcgcg cgagatgtgg ctcgccatct 13800
tgtcgcccac ggccgtcagc ccggcggccg tgaggccgcg gcacagcaca aagatggagt 13860
ccgtgtcccc gtagatgatg cgcatggaat agggcccggg ggcgcgcatg tcggccgcct 13920
ccgggaaatc ggccaggagc tgttcgaagg ccgcccagcg cgcgtggacg tactcgcggg 13980
tcgcgagcag catctcgcgg ccgatggtcg tcaccgtcgc ggcaacgtgc aggcacggca 14040
ggagtccgtg ctgcactccc gtgaacccgt acaccgagtt acacacgacc ttgatggcgg 14100
cctgctgctt gtccaggagc acggcctcct cggggctgct ctggggaatc cgcgagcgga 14160
tctgctttcg catggcgagc cagtcccgca ggaggatgct gaggaggctc tctcgcacgt 14220
gagccttgac gaagaacagc cgtcgccccc ccacctcgat ctccaggtag tccttgcccg 14280
cctccaggtg cgccactgcg tcggccctca gggagagcgt gctgaagcac aggttgtggg 14340
cctggatgat gctggggtac aggctggcaa agtcgaacac caccacgggg ttcacgtgaa 14400
acccggaagt ggggtcaagg accctggccc cctggtaccc cacgtgcctg ccggcggtct 14460
cccgcgcgcc ctccggctcc cgctcgcccc cgccctcctc gcgttcgtcc tcgtcctccc 14520
cctcctcctc tggccgctcc tcgtcctccc gggctgcggc cggacgcttg ggcgcctccc 14580
ccccggcgcc cctaaatcgc ccctgggtgt ccggcagaat aaagcccttc tggtcggcca 14640
ggcgcagcag gcacgtaaag acgcggatct gctggccgtc gtagatggtg cgggtgatgt 14700
taatacccgc caagcgcgcg acggccgaga gctccagatg gggcaaaaac ttaaaaaaca 14760
gctggcccac cagcagggaa tcctgtatgc agtactcgcc gatcaccccg cgttgcgcgg 14820
gcccggcggc gtagtaggcg gggatgtcgc gatagctcag gtccttcttc ttgtccttca 14880
ggacggcttc ggccacggcg ttgagcttgt agctcgagag cttgatcttg tcggttataa 14940
tcccgtacat gtcgatgttc accatgccgt tcacctttat cttgctgcgc ttctggaagt 15000
ggctctggcc tatgtcccac acgcgaaaca cgccccggcc gttcatgcgg ccgtacccgt 15060
ccagggggac cttgtaaatg tccgtcagct tggccagcaa gaagggccag tcgaagttga 15120
tgatgttgta cccggtcacg aactcggggc cgtactgttt cacaagggtc atgaaggcca 15180
acagcatctc gaattcgctg tcgaattcca gaaccacggg cgtgggcagg cccctggccg 15240
ccagctcgtt caggtgggat tcggggaggt cgcaggaacc gagcgaaaac aggaggacgt 15300
gctccagggc ggtggtggac aggtcgtaga gcagacagga tatctggatg accaggtcct 15360
ccgggtgccc ggccaccgga aaggccagct cgtcctcccc ccccgccttg cattcgatat 15420
cgaagcacat gagcttgtat gccggtaggt cgctcatgcc cccctcgatg gccaggttgt 15480
ccgccgtaca gttaaactcg acgtcgctgg atgtcccgaa ggccatcggg gcccgcggct 15540
gggctagcgt gttgttccgg cccggtttga gacggtacca gccgaaggtg acgaacccgg 15600
ggttgtccag gatgaaccgg gtggtggcgt cgaccccacc ctcgtacttc ttgatggccg 15660
ggcagaagtt gtcgcacagg tacgacagca cgcgcccgct tcggacgtag acgcggtaaa 15720
acagagcggg gcgcgtctcg tagtagtaca cgtcggtgcg ctccaccacc tccgcctcga 15780
agtggtccgc ggagatgccg cggaacgacg cgcccgggga ctcgcgcagg gccgcggcca 15840
tgcgctcgca gagatctcgt ggggcgcggc attgtaggtg cctgtcgacc tcctccttgt 15900
tcatgtaaaa gtactgccgc gtgccgtaaa cgtgaacggc cacccggtgg ccttccggag 15960
tcaggcccag gagcgtgatg acggtccccg tcggtgtgat ggcgtccata aaccgcgcgt 16020
ggaactgggc cgcgcgcatg ccgtacgcgt gctccacgtt ctccaggatg tcgtacacgt 16080
gaaagacggt gacggtgggg ttgaaccccg ccggggcgtg gtccacgccg ccccacaggc 16140
gcgagcgccg cggccagaag ccgcccgacc cgacgcggag gacgtcgcgc tcgtcccccc 16200
cgcagtacac cttgggggcg cgcttgaggt gaccgtcgtg caccccggcg cgcttctccg 16260
ggggggcatc ctcgtccagc acccgcgggg cgatgaatcg aaattcatcg cattcgctat 16320
agtacgtatg gcgctgggtt ggcccggtcg gcttctgttg cgtcccgact ggggcgaggt 16380
aggggttgta aaagttttgc ctcaaacaag gcgggggtcc ccggctggct ccgcgagggc 16440
cggcgggcgc aaaaaacccg gacgccgccc tggccgccga ctttcctccg ggggacagcg 16500
ggccgccgcc accggaaaac atggtggctt taccaacagt accggtggat cgggcccgcg 16560
gtgcgccggc gtttgcaaaa gcctaggcct ccaaaaaagc ctcctcacta cttctggaat 16620
agctcagagg ccgaggcggc ctcggcctct gcataaataa aaaaaattag tcagccatgg 16680
ggcggagaat gggcggaact gggcggagtt aggggcggga tgggcggagt taggggcggg 16740
actatggttg ctgactaatt gagatgcatg ctttgcatac ttctgcctgc tggggagcct 16800
ggggactttc cacacctggt tgctgactaa ttgagatgca tgctttgcat acttctgcct 16860
gctggggagc ctggggactt tccacaccct aactgacaca catttaaatg aagatatggt 16920
gcactctcag tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa 16980
cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg 17040
tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga 17100
<210> 105
<211> 11635
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 105
ggtacccaac tccatgctta acagtcccca ggtacagccc accctgcgtc gcaaccagga 60
acagctctac agcttcctgg agcgccactc gccctacttc cgcagccaca gtgcgcagat 120
taggagcgcc acttcttttt gtcacttgaa aaacatgtaa aaataatgta ctaggagaca 180
ctttcaataa aggcaaatgt ttttatttgt acactctcgg gtgattattt accccccacc 240
cttgccgtct gcgccgttta aaaatcaaag gggttctgcc gcgcatcgct atgcgccact 300
ggcagggaca cgttgcgata ctggtgttta gtgctccact taaactcagg cacaaccatc 360
cgcggcagct cggtgaagtt ttcactccac aggctgcgca ccatcaccaa cgcgtttagc 420
aggtcgggcg ccgatatctt gaagtcgcag ttggggcctc cgccctgcgc gcgcgagttg 480
cgatacacag ggttgcagca ctggaacact atcagcgccg ggtggtgcac gctggccagc 540
acgctcttgt cggagatcag atccgcgtcc aggtcctccg cgttgctcag ggcgaacgga 600
gtcaactttg gtagctgcct tcccaaaaag ggtgcatgcc caggctttga gttgcactcg 660
caccgtagtg gcatcagaag gtgaccgtgc ccggtctggg cgttaggata cagcgcctgc 720
atgaaagcct tgatctgctt aaaagccacc tgagcctttg cgccttcaga gaagaacatg 780
ccgcaagact tgccggaaaa ctgattggcc ggacaggccg cgtcatgcac gcagcacctt 840
gcgtcggtgt tggagatctg caccacattt cggccccacc ggttcttcac gatcttggcc 900
ttgctagact gctccttcag cgcgcgctgc ccgttttcgc tcgtcacatc catttcaatc 960
acgtgctcct tatttatcat aatgctcccg tgtagacact taagctcgcc ttcgatctca 1020
gcgcagcggt gcagccacaa cgcgcagccc gtgggctcgt ggtgcttgta ggttacctct 1080
gcaaacgact gcaggtacgc ctgcaggaat cgccccatca tcgtcacaaa ggtcttgttg 1140
ctggtgaagg tcagctgcaa cccgcggtgc tcctcgttta gccaggtctt gcatacggcc 1200
gccagagctt ccacttggtc aggcagtagc ttgaagtttg cctttagatc gttatccacg 1260
tggtacttgt ccatcaacgc gcgcgcagcc tccatgccct tctcccacgc agacacgatc 1320
ggcaggctca gcgggtttat caccgtgctt tcactttccg cttcactgga ctcttccttt 1380
tcctcttgcg tccgcatacc ccgcgccact gggtcgtctt cattcagccg ccgcaccgtg 1440
cgcttacctc ccttgccgtg cttgattagc accggtgggt tgctgaaacc caccatttgt 1500
agcgccacat cttctctttc ttcctcgctg tccacgatca cctctgggga tggcgggcgc 1560
tcgggcttgg gagaggggcg cttctttttc tttttggacg caatggccaa atccgccgtc 1620
gaggtcgatg gccgcgggct gggtgtgcgc ggcaccagcg catcttgtga cgagtcttct 1680
tcgtcctcgg actcgagacg ccgcctcagc cgcttttttg ggggcgcgcg gggaggcggc 1740
ggcgacggcg acggggacga cacgtcctcc atggttggtg gacgtcgcgc cgcaccgcgt 1800
ccgcgctcgg gggtggtttc gcgctgctcc tcttcccgac tggccatttc cttctcctat 1860
aggcagaaaa agatcatgga gtcagtcgag aaggaggaca gcctaaccgc cccctttgag 1920
ttcgccacca ccgcctccac cgatgccgcc aacgcgccta ccaccttccc cgtcgaggca 1980
cccccgcttg aggaggagga agtgattatc gagcaggacc caggttttgt aagcgaagac 2040
gacgaggatc gctcagtacc aacagaggat aaaaagcaag accaggacga cgcagaggca 2100
aacgaggaac aagtcgggcg gggggaccaa aggcatggcg actacctaga tgtgggagac 2160
gacgtgctgt tgaagcatct gcagcgccag tgcgccatta tctgcgacgc gttgcaagag 2220
cgcagcgatg tgcccctcgc catagcggat gtcagccttg cctacgaacg ccacctgttc 2280
tcaccgcgcg taccccccaa acgccaagaa aacggcacat gcgagcccaa cccgcgcctc 2340
aacttctacc ccgtatttgc cgtgccagag gtgcttgcca cctatcacat ctttttccaa 2400
aactgcaaga tacccctatc ctgccgtgcc aaccgcagcc gagcggacaa gcagctggcc 2460
ttgcggcagg gcgctgtcat acctgatatc gcctcgctcg acgaagtgcc aaaaatcttt 2520
gagggtcttg gacgcgacga gaaacgcgcg gcaaacgctc tgcaacaaga aaacagcgaa 2580
aatgaaagtc actgtggagt gctggtggaa cttgagggtg acaacgcgcg cctagccgtg 2640
ctgaaacgca gcatcgaggt cacccacttt gcctacccgg cacttaacct accccccaag 2700
gttatgagca cagtcatgag cgagctgatc gtgcgccgtg cacgacccct ggagagggat 2760
gcaaacttgc aagaacaaac cgaggagggc ctacccgcag ttggcgatga gcagctggcg 2820
cgctggcttg agacgcgcga gcctgccgac ttggaggagc gacgcaagct aatgatggcc 2880
gcagtgcttg ttaccgtgga gcttgagtgc atgcagcggt tctttgctga cccggagatg 2940
cagcgcaagc tagaggaaac gttgcactac acctttcgcc agggctacgt gcgccaggcc 3000
tgcaaaattt ccaacgtgga gctctgcaac ctggtctcct accttggaat tttgcacgaa 3060
aaccgcctcg ggcaaaacgt gcttcattcc acgctcaagg gcgaggcgcg ccgcgactac 3120
gtccgcgact gcgtttactt atttctgtgc tacacctggc aaacggccat gggcgtgtgg 3180
cagcaatgcc tggaggagcg caacctaaag gagctgcaga agctgctaaa gcaaaacttg 3240
aaggacctat ggacggcctt caacgagcgc tccgtggccg cgcacctggc ggacattatc 3300
ttccccgaac gcctgcttaa aaccctgcaa cagggtctgc cagacttcac cagtcaaagc 3360
atgttgcaaa actttaggaa ctttatccta gagcgttcag gaattctgcc cgccacctgc 3420
tgtgcgcttc ctagcgactt tgtgcccatt aagtaccgtg aatgccctcc gccgctttgg 3480
ggtcactgct accttctgca gctagccaac taccttgcct accactccga catcatggaa 3540
gacgtgagcg gtgacggcct actggagtgt cactgtcgct gcaacctatg caccccgcac 3600
cgctccctgg tctgcaattc gcaactgctt agcgaaagtc aaattatcgg tacctttgag 3660
ctgcagggtc cctcgcctga cgaaaagtcc gcggctccgg ggttgaaact cactccgggg 3720
ctgtggacgt cggcttacct tcgcaaattt gtacctgagg actaccacgc ccacgagatt 3780
aggttctacg aagaccaatc ccgcccgcca aatgcggagc ttaccgcctg cgtcattacc 3840
cagggccaca tccttggcca attgcaagcc atcaacaaag cccgccaaga gtttctgcta 3900
cgaaagggac ggggggttta cctggacccc cagtccggcg aggagctcaa cccaatcccc 3960
ccgccgccgc agccctatca gcagccgcgg gcccttgctt cccaggatgg cacccaaaaa 4020
gaagctgcag ctgccgccgc cgccacccac ggacgaggag gaatactggg acagtcaggc 4080
agaggaggtt ttggacgagg aggaggagat gatggaagac tgggacagcc tagacgaagc 4140
ttccgaggcc gaagaggtgt cagacgaaac accgtcaccc tcggtcgcat tcccctcgcc 4200
ggcgccccag aaattggcaa ccgttcccag catcgctaca acctccgctc ctcaggcgcc 4260
gccggcactg cctgttcgcc gacccaaccg tagatgggac accactggaa ccagggccgg 4320
taagtctaag cagccgccgc cgttagccca agagcaacaa cagcgccaag gctaccgctc 4380
gtggcgcggg cacaagaacg ccatagttgc ttgcttgcaa gactgtgggg gcaacatctc 4440
cttcgcccgc cgctttcttc tctaccatca cggcgtggcc ttcccccgta acatcctgca 4500
ttactaccgt catctctaca gcccctactg caccggcggc agcggcagcg gcagcaacag 4560
cagcggtcac acagaagcaa aggcgaccgg atagcaagac tctgacaaag cccaagaaat 4620
ccacagcggc ggcagcagca ggaggaggag cgctgcgtct ggcgcccaac gaacccgtat 4680
cgacccgcga gcttagaaat aggatttttc ccactctgta tgctatattt caacaaagca 4740
ggggccaaga acaagagctg aaaataaaaa acaggtctct gcgctccctc acccgcagct 4800
gcctgtatca caaaagcgaa gatcagcttc ggcgcacgct ggaagacgcg gaggctctct 4860
tcagcaaata ctgcgcgctg actcttaagg actagtttcg cgccctttct caaatttaag 4920
cgcgaaaact acgtcatctc cagcggccac acccggcgcc agcacctgtc gtcagcgcca 4980
ttatgagcaa ggaaattccc acgccctaca tgtggagtta ccagccacaa atgggacttg 5040
cggctggagc tgcccaagac tactcaaccc gaataaacta catgagcgcg ggaccccaca 5100
tgatatcccg ggtcaacgga atccgcgccc accgaaaccg aattctcctc gaacaggcgg 5160
ctattaccac cacacctcgt aataacctta atccccgtag ttggcccgct gccctggtgt 5220
accaggaaag tcccgctccc accactgtgg tacttcccag agacgcccag gccgaagttc 5280
agatgactaa ctcaggggcg cagcttgcgg gcggctttcg tcacagggtg cggtcgcccg 5340
ggcgttttag ggcggagtaa cttgcatgta ttgggaattg tagttttttt aaaatgggaa 5400
gtgacgtatc gtgggaaaac ggaagtgaag atttgaggaa gttgtgggtt ttttggcttt 5460
cgtttctggg cgtaggttcg cgtgcggttt tctgggtgtt ttttgtggac tttaaccgtt 5520
acgtcatttt ttagtcctat atatactcgc tctgtacttg gcccttttta cactgtgact 5580
gattgagctg gtgccgtgtc gagtggtgtt ttttaatagg tttttttact ggtaaggctg 5640
actgttatgg ctgccgctgt ggaagcgctg tatgttgttc tggagcggga gggtgctatt 5700
ttgcctaggc aggagggttt ttcaggtgtt tatgtgtttt tctctcctat taattttgtt 5760
atacctccta tgggggctgt aatgttgtct ctacgcctgc gggtatgtat tcccccgggc 5820
tatttcggtc gctttttagc actgaccgat gttaaccaac ctgatgtgtt taccgagtct 5880
tacattatga ctccggacat gaccgaggaa ctgtcggtgg tgctttttaa tcacggtgac 5940
cagttttttt acggtcacgc cggcatggcc gtagtccgtc ttatgcttat aagggttgtt 6000
tttcctgttg taagacaggc ttctaatgtt taaatgtttt tttttttgtt attttatttt 6060
gtgtttaatg caggaacccg cagacatgtt tgagagaaaa atggtgtctt tttctgtggt 6120
ggttccggaa cttacctgcc tttatctgca tgagcatgac tacgatgtgc ttgctttttt 6180
gcgcgaggct ttgcctgatt ttttgagcag caccttgcat tttatatcgc cgcccatgca 6240
acaagcttac ataggggcta cgctggttag catagctccg agtatgcgtg tcataatcag 6300
tgtgggttct tttgtcatgg ttcctggcgg ggaagtggcc gcgctggtcc gtgcagacct 6360
gcacgattat gttcagctgg ccctgcgaag ggacctacgg gatcgcggta tttttgttaa 6420
tgttccgctt ttgaatctta tacaggtctg tgaggaacct gaatttttgc aatcatgatt 6480
cgctgcttga ggctgaaggt ggagggcgct ctggagcaga tttttacaat ggccggactt 6540
aatattcggg atttgcttag agacatattg ataaggtggc gagatgaaaa ttatttgggc 6600
atggttgaag gtgctggaat gtttatagag gagattcacc ctgaagggtt tagcctttac 6660
gtccacttgg acgtgagggc agtttgcctt ttggaagcca ttgtgcaaca tcttacaaat 6720
gccattatct gttctttggc tgtagagttt gaccacgcca ccggagggga gcgcgttcac 6780
ttaatagatc ttcattttga ggttttggat aatcttttgg aataaaaaaa aaaaaacatg 6840
gttcttccag ctcttcccgc tcctcccgtg tgtgactcgc agaacgaatg tgtaggttgg 6900
ctgggtgtgg cttattctgc ggtggtggat gttatcaggg cagcggcgca tgaaggagtt 6960
tacatagaac ccgaagccag ggggcgcctg gatgctttga gagagtggat atactacaac 7020
tactacacag agcgagctaa gcgacgagac cggagacgca gatctgtttg tcacgcccgc 7080
acctggtttt gcttcaggaa atatgactac gtccggcgtt ccatttggca tgacactacg 7140
accaacacga tctcggttgt ctcggcgcac tccgtacagt agggatcgcc tacctccttt 7200
tgagacagag acccgcgcta ccatactgga ggatcatccg ctgctgcccg aatgtaacac 7260
tttgacaatg cacaacgtga gttacgtgcg aggtcttccc tgcagtgtgg gatttacgct 7320
gattcaggaa tgggttgttc cctgggatat ggttctgacg cgggaggagc ttgtaatcct 7380
gaggaagtgt atgcacgtgt gcctgtgttg tgccaacatt gatatcatga cgagcatgat 7440
gatccatggt tacgagtcct gggctctcca ctgtcattgt tccagtcccg gttccctgca 7500
gtgcatagcc ggcgggcagg ttttggccag ctggtttagg atggtggtgg atggcgccat 7560
gtttaatcag aggtttatat ggtaccggga ggtggtgaat tacaacatgc caaaagaggt 7620
aatgtttatg tccagcgtgt ttatgagggg tcgccactta atctacctgc gcttgtggta 7680
tgatggccac gtgggttctg tggtccccgc catgagcttt ggatacagcg ccttgcactg 7740
tgggattttg aacaatattg tggtgctgtg ctgcagttac tgtgctgatt taagtgagat 7800
cagggtgcgc tgctgtgccc ggaggacaag gcgtctcatg ctgcgggcgg tgcgaatcat 7860
cgctgaggag accactgcca tgttgtattc ctgcaggacg gagcggcggc ggcagcagtt 7920
tattcgcgcg ctgctgcagc accaccgccc tatcctgatg cacgattatg actctacccc 7980
catgtaggcg tggacttccc cttcgccgcc cgttgagcaa ccgcaagttg gacagcagcc 8040
tgtggctcag cagctggaca gcgacatgaa cttaagcgag ctgcccgggg agtttattaa 8100
tatcactgat gagcgtttgg ctcgacagga aaccgtgtgg aatataacac ctaagaatat 8160
gtctgttacc catgatatga tgctttttaa ggccagccgg ggagaaagga ctgtgtactc 8220
tgtgtgttgg gagggaggtg gcaggttgaa tactagggtt ctgtgagttt gattaaggta 8280
cggtgatcaa tataagctat gtggtggtgg ggctatacta ctgaatgaaa aatgacttga 8340
aattttctgc aattgaaaaa taaacacgtt gaaacataac atgcaacagg ttcacgattc 8400
tctagtgaat ccacagaaac tagcgaggta agcacttact ctatgtcttt tacatggtcc 8460
tgggaaagtg gaaaatacac cactgaaact tttgctacca actcttacac cttctcctac 8520
attgcccagg aataaaatcg atgtaggatg ttgcccctcc tgacgcggta ggagaagggg 8580
agggtgccct gcatgtctgc cgctgctctt gctcttgccg ctgctgagga ggggggcgca 8640
tctgccgcag caccggatgc atctgggaaa agcaaaaaag gggctcgtcc ctgtttccgg 8700
aggaatttgc aagcggggtc ttgcatgacg gggaggcaaa cccccgttcg ccgcagtccg 8760
gccggcccga gactcgaacc gggggtcctg cgactcaacc cttggaaaat aaccctccgg 8820
ctacagggag cgagccactt aatgctttcg ctttccagcc taaccgctta cgccgcgcgc 8880
ggccagtggc caaaaaagct agcgcagcag ccgccgcgcc tggaaggaag ccaaaaggag 8940
cgctcccccg ttgtctgacg tcgcacacct gggttcgaca cgcgggcggt aaccgcatgg 9000
atcacggcgg acggccggat ccggggttcg aaccccggtc gtccgccatg atacccttgc 9060
gaatttatcc accagaccac ggaagagtgc ccgcttacag gctctccttt tgcacggtct 9120
agagcgtcaa cgactgcgca cgcctcaccg gccagagcgt cccgaccatg gagcactttt 9180
tgccgctgcg caacatctgg aaccgcgtcc gcgactttcc gcgcgcctcc accaccgccg 9240
ccggcatcac ctggatgtcc aggtacatct acggattacg tcgacgttta aaccatatga 9300
tcagctcact caaaggcggt aatacggtta tccacagaat caggggataa cgcaggaaag 9360
aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg 9420
tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc aagtcagagg 9480
tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag ctccctcgtg 9540
cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct cccttcggga 9600
agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc 9660
tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc cttatccggt 9720
aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc agcagccact 9780
ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt gaagtggtgg 9840
cctaactacg gctacactag aagaacagta tttggtatct gcgctctgct gaagccagtt 9900
accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc tggtagcggt 9960
ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca agaagatcct 10020
ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta agggattttg 10080
gtcatgagat tatcaaaaag gatcttcacc tagatccttt taaattaaaa atgaagtttt 10140
aaatcaatct aaagtatata tgagtaaact tggtctgaca gttaccaatg cttaatcagt 10200
gaggcaccta tctcagcgat ctgtctattt cgttcatcca tagttgcctg actccccgtc 10260
gtgtagataa ctacgatacg ggagggctta ccatctggcc ccagtgctgc aatgataccg 10320
cgagacccac gctcaccggc tccagattta tcagcaataa accagccagc cggaagggcc 10380
gagcgcagaa gtggtcctgc aactttatcc gcctccatcc agtctattaa ttgttgccgg 10440
gaagctagag taagtagttc gccagttaat agtttgcgca acgttgttgc cattgctaca 10500
ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat tcagctccgg ttcccaacga 10560
tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag cggttagctc cttcggtcct 10620
ccgatcgttg tcagaagtaa gttggccgca gtgttatcac tcatggttat ggcagcactg 10680
cataattctc ttactgtcat gccatccgta agatgctttt ctgtgactgg tgagtactca 10740
accaagtcat tctgagaata gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaata 10800
cgggataata ccgcgccaca tagcagaact ttaaaagtgc tcatcattgg aaaacgttct 10860
tcggggcgaa aactctcaag gatcttaccg ctgttgagat ccagttcgat gtaacccact 10920
cgtgcaccca actgatcttc agcatctttt actttcacca gcgtttctgg gtgagcaaaa 10980
acaggaaggc aaaatgccgc aaaaaaggga ataagggcga cacggaaatg ttgaatactc 11040
atactcttcc tttttcaata ttattgaagc atttatcagg gttattgtct catgagcgga 11100
tacatatttg aatgtattta gaaaaataaa caaatagggg ttccgcgcac atttccccga 11160
aaagtgccac ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt 11220
aaatcagctc attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag 11280
aatagaccga gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga 11340
acgtggactc caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg 11400
aaccatcacc ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc 11460
ctaaagggag cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg 11520
aagggaagaa agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc 11580
gcgtaaccac cacacccgcc gcgcttaatg cgccgctaca gggcgcgatg gatcc 11635
<210> 106
<211> 18932
<212> DNA
<213> Artificial Sequence
<220>
<223> Description of Artificial Sequence: Synthetic
polynucleotide
<400> 106
tcttccgctt cctcgctcac tgactcgctg cgctcggtcg ttcggctgcg gcgagcggta 60
tcagctcact caaaggcggt aatacggtta tccacagaat caggggataa cgcaggaaag 120
aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg 180
tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc aagtcagagg 240
tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag ctccctcgtg 300
cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct cccttcggga 360
agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc 420
tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc cttatccggt 480
aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc agcagccact 540
ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt gaagtggtgg 600
cctaactacg gctacactag aagaacagta tttggtatct gcgctctgct gaagccagtt 660
accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc tggtagcggt 720
ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca agaagatcct 780
ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta agggattttg 840
gtcatgagat tatcaaaaag gatcttcacc tagatccttt taaattaaaa atgaagtttt 900
aaatcaatct aaagtatata tgagtaaact tggtctgaca gttaccaatg cttaatcagt 960
gaggcaccta tctcagcgat ctgtctattt cgttcatcca tagttgcctg actccccgtc 1020
gtgtagataa ctacgatacg ggagggctta ccatctggcc ccagtgctgc aatgataccg 1080
cgagacccac gctcaccggc tccagattta tcagcaataa accagccagc cggaagggcc 1140
gagcgcagaa gtggtcctgc aactttatcc gcctccatcc agtctattaa ttgttgccgg 1200
gaagctagag taagtagttc gccagttaat agtttgcgca acgttgttgc cattgctaca 1260
ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat tcagctccgg ttcccaacga 1320
tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag cggttagctc cttcggtcct 1380
ccgatcgttg tcagaagtaa gttggccgca gtgttatcac tcatggttat ggcagcactg 1440
cataattctc ttactgtcat gccatccgta agatgctttt ctgtgactgg tgagtactca 1500
accaagtcat tctgagaata gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaata 1560
cgggataata ccgcgccaca tagcagaact ttaaaagtgc tcatcattgg aaaacgttct 1620
tcggggcgaa aactctcaag gatcttaccg ctgttgagat ccagttcgat gtaacccact 1680
cgtgcaccca actgatcttc agcatctttt actttcacca gcgtttctgg gtgagcaaaa 1740
acaggaaggc aaaatgccgc aaaaaaggga ataagggcga cacggaaatg ttgaatactc 1800
atactcttcc tttttcaata ttattgaagc atttatcagg gttattgtct catgagcgga 1860
tacatatttg aatgtattta gaaaaataaa caaatagggg ttccgcgcac atttccccga 1920
aaagtgccac ctgacgtcta agaaaccatt attatcatga cattaaccta taaaaatagg 1980
cgtatcacga ggccctttcg tctcgcgcgt ttcggtgatg acggtgaaaa cctctgacac 2040
atgcagctcc cggagacggt cacagcttgt ctgtaagcgg atgccgggag cagacaagcc 2100
cgtcagggcg cgtcagcggg tgttggcggg tgtcggggct ggcttaacta tgcggcatca 2160
gagcagattg tactgagagt gcaccataaa attgtaaacg ttaatatttt gttaaaattc 2220
gcgttaaatt tttgttaaat cagctcattt tttaaccaat aggccgaaat cggcaaaatc 2280
ccttataaat caaaagaata gcccgagata gggttgagtg ttgttccagt ttggaacaag 2340
agtccactat taaagaacgt ggactccaac gtcaaagggc gaaaaaccgt ctatcagggc 2400
gatggcccac tacgtgaacc atcacccaaa tcaagttttt tggggtcgag gtgccgtaaa 2460
gcactaaatc ggaaccctaa agggagcccc cgatttagag cttgacgggg aaagccggcg 2520
aacgtggcga gaaaggaagg gaagaaagcg aaaggagcgg gcgctagggc gctggcaagt 2580
gtagcggtca cgctgcgcgt aaccaccaca cccgccgcgc ttaatgcgcc gctacagggc 2640
gcgtactatg gttgctttga cgtatgcggt gtgaaatacc gcacagatgc gtaaggagaa 2700
aataccgcat caggcgccat tcgccattca ggctgcgcaa ctgttgggaa gggcgatcgg 2760
tgcgggcctc ttcgctatta cgccagctgg cgaaaggggg atgtgctgca aggcgattaa 2820
gttgggtaac gccagggttt tcccagtcac gacgttgtaa aacgacggcc agtgccaagc 2880
ttaaggtgca cggcccacgt ggccactagt acttctcgac agaagcacca tgtccttggg 2940
tccggcctgc tgaatgcgca ggcggtcggc catgccccag gcttcgtttt gacatcggcg 3000
caggtctttg tagtagtctt gcatgagcct ttctaccggc acttcttctt ctccttcctc 3060
ttgtcctgca tctcttgcat ctatcgctgc ggcggcggcg gagtttggcc gtaggtggcg 3120
ccctcttcct cccatgcgtg tgaccccgaa gcccctcatc ggctgaagca gggctaggtc 3180
ggcgacaacg cgctcggcta atatggcctg ctgcacctgc gtgagggtag actggaagtc 3240
atccatgtcc acaaagcggt ggtatgcgcc cgtgttgatg gtgtaagtgc agttggccat 3300
aacggaccag ttaacggtct ggtgacccgg ctgcgagagc tcggtgtacc tgagacgcga 3360
gtaagccctc gagtcaaata cgtagtcgtt gcaagtccgc accaggtact ggtatcccac 3420
caaaaagtgc ggcggcggct ggcggtagag gggccagcgt agggtggccg gggctccggg 3480
ggcgagatct tccaacataa ggcgatgata tccgtagatg tacctggaca tccaggtgat 3540
gccggcggcg gtggtggagg cgcgcggaaa gtcgcggacg cggttccaga tgttgcgcag 3600
cggcaaaaag tgctccatgg tcgggacgct ctggccggtc aggcgcgcgc aatcgttgac 3660
gctctaccgt gcaaaaggag agcctgtaag cgggcactct tccgtggtct ggtggataaa 3720
ttcgcaaggg tatcatggcg gacgaccggg gttcgagccc cgtatccggc cgtccgccgt 3780
gatccatgcg gttaccgccc gcgtgtcgaa cccaggtgtg cgacgtcaga caacggggga 3840
gtgctccttt tggcttcctt ccaggcgcgg cggctgctgc gctagctttt ttggccactg 3900
gccgcgcgca gcgtaagcgg ttaggctgga aagcgaaagc attaagtggc tcgctccctg 3960
tagccggagg gttattttcc aagggttgag tcgcgggacc cccggttcga gtctcggacc 4020
ggccggactg cggcgaacgg gggtttgcct ccccgtcatg caagaccccg cttgcaaatt 4080
cctccggaaa cagggacgag cccctttttt gcttttccca gatgcatccg gtgctgcggc 4140
agatgcgccc ccctcctcag cagcggcaag agcaagagca gcggcagaca tgcagggcac 4200
cctcccctcc tcctaccgcg tcaggagggg cgacatccgc ggttgacgcg gcagcagatg 4260
gtgattacga acccccgcgg cgccgggccc ggcactacct ggacttggag gagggcgagg 4320
gcctggcgcg gctaggagcg ccctctcctg agcggtaccc aagggtgcag ctgaagcgtg 4380
atacgcgtga ggcgtacgtg ccgcggcaga acctgtttcg cgaccgcgag ggagaggagc 4440
ccgaggagat gcgggatcga aagttccacg cagggcgcga gctgcggcat ggcctgaatc 4500
gcgagcggtt gctgcgcgag gaggactttg agcccgacgc gcgaaccggg attagtcccg 4560
cgcgcgcaca cgtggcggcc gccgacctgg taaccgcata cgagcagacg gtgaaccagg 4620
agattaactt tcaaaaaagc tttaacaacc acgtgcgtac gcttgtggcg cgcgaggagg 4680
tggctatagg actgatgcat ctgtgggact ttgtaagcgc gctggagcaa aacccaaata 4740
gcaagccgct catggcgcag ctgttcctta tagtgcagca cagcagggac aacgaggcat 4800
tcagggatgc gctgctaaac atagtagagc ccgagggccg ctggctgctc gatttgataa 4860
acatcctgca gagcatagtg gtgcaggagc gcagcttgag cctggctgac aaggtggccg 4920
ccatcaacta ttccatgctt agcctgggca agttttacgc ccgcaagata taccataccc 4980
cttacgttcc catagacaag gaggtaaaga tcgaggggtt ctacatgcgc atggcgctga 5040
aggtgcttac cttgagcgac gacctgggcg tttatcgcaa cgagcgcatc cacaaggccg 5100
tgagcgtgag ccggcggcgc gagctcagcg accgcgagct gatgcacagc ctgcaaaggg 5160
ccctggctgg cacgggcagc ggcgatagag aggccgagtc ctactttgac gcgggcgctg 5220
acctgcgctg ggccccaagc cgacgcgccc tggaggcagc tggggccgga cctgggctgg 5280
cggtggcacc cgcgcgcgct ggcaacgtcg gcggcgtgga ggaatatgac gaggacgatg 5340
agtacgagcc agaggacggc gagtactaag cggtgatgtt tctgatcaga tgatgcaaga 5400
cgcaacggac ccggcggtgc gggcggcgct gcagagccag ccgtccggcc ttaactccac 5460
ggacgactgg cgccaggtca tggaccgcat catgtcgctg actgcgcgca atcctgacgc 5520
gttccggcag cagccgcagg ccaaccggct ctccgcaatt ctggaagcgg tggtcccggc 5580
gcgcgcaaac cccacgcacg agaaggtgct ggcgatcgta aacgcgctgg ccgaaaacag 5640
ggccatccgg cccgacgagg ccggcctggt ctacgacgcg ctgcttcagc gcgtggctcg 5700
ttacaacagc ggcaacgtgc agaccaacct ggaccggctg gtgggggatg tgcgcgaggc 5760
cgtggcgcag cgtgagcgcg cgcagcagca gggcaacctg ggctccatgg ttgcactaaa 5820
cgccttcctg agtacacagc ccgccaacgt gccgcgggga caggaggact acaccaactt 5880
tgtgagcgca ctgcggctaa tggtgactga gacaccgcaa agtgaggtgt accagtctgg 5940
gccagactat tttttccaga ccagtagaca aggcctgcag accgtaaacc tgagccaggc 6000
tttcaaaaac ttgcaggggc tgtggggggt gcgggctccc acaggcgacc gcgcgaccgt 6060
gtctagcttg ctgacgccca actcgcgcct gttgctgctg ctaatagcgc ccttcacgga 6120
cagtggcagc gtgtcccggg acacatacct aggtcacttg ctgacactgt accgcgaggc 6180
cataggtcag gcgcatgtgg acgagcatac tttccaggag attacaagtg tcagccgcgc 6240
gctggggcag gaggacacgg gcagcctgga ggcaacccta aactacctgc tgaccaaccg 6300
gcggcagaag atcccctcgt tgcacagttt cgcacccttt ggcgcatccc attctccagt 6360
aactttatgt ccatgggcgc actcacagac ctgggccaaa accttctcta cgccaactcc 6420
gcccacgcgc tagacatgac ttttgaggtg gatcccatgg acgagcccac ccttctttat 6480
gttttgtttg aagtctttga cgtggtccgt gtgcaccggc cgcaccgcgg cgtcatcgaa 6540
accgtgtacc tgcgcacgcc cttctcggcc ggcaacgcca caacataaag aagcaagcaa 6600
catcaacaac agctgccgcc atgggctcca gtgagcagga actgaaagcc attgtcaaag 6660
atcttggttg tgggccatat tttttgggca cctatgacaa gcgctttcca ggctttgttt 6720
ctccacacaa gctcgcctgc gccatagtca atacggccgg tcgcgagact gggggcgtac 6780
actggatggc ctttgcctgg aacccgcact caaaaacatg ctacctcttt gagccctttg 6840
gcttttctga ccagcgactc aagcaggttt accagtttga gtacgagtca ctcctgcgcc 6900
gtagcgccat tgcttcttcc cccgaccgct gtataacgct ggaaaagtcc acccaaagcg 6960
tacaggggcc caactcggcc gcctgtggac tattctgctg catgtttctc cacgcctttg 7020
ccaactggcc ccaaactccc atggatcaca accccaccat gaaccttatt accggggtac 7080
ccaactccat gctcaacagt ccccaggtac agcccaccct gcgtcgcaac caggaacagc 7140
tctacagctt cctggagcgc cactcgccct acttccgcag ccacagtgcg cagattagga 7200
gcgccacttc tttttgtcac ttgaaaaaca tgtaaaaata atgtactaga gacactttca 7260
ataaaggcaa atgcttttat ttgtacactc tcgggtgatt atttaccccc acccttgccg 7320
tctgcgccgt ttaaaaatca aaggggttct gccgcgcatc gctatgcgcc actggcaggg 7380
acacgttgcg atactggtgt ttagtgctcc acttaaactc aggcacaacc atccgcggca 7440
gctcggtgaa gttttcactc cacaggctgc gcaccatcac caacgcgttt agcaggtcgg 7500
gcgccgatat cttgaagtcg cagttggggc ctccgccctg cgcgcgcgag ttgcgataca 7560
cagggttgca gcactggaac actatcagcg ccgggtggtg cacgctggcc agcacgctct 7620
tgtcggagat cagatccgcg tccaggtcct ccgcgttgct cagggcgaac ggagtcaact 7680
ttggtagctg ccttcccaaa aagggcgcgt gcccaggctt tgagttgcac tcgcaccgta 7740
gtggcatcaa aaggtgaccg tgcccggtct gggcgttagg atacagcgcc tgcataaaag 7800
ccttgatctg cttaaaagcc acctgagcct ttgcgccttc agagaagaac atgccgcaag 7860
acttgccgga aaactgattg gccggacagg ccgcgtcgtg cacgcagcac cttgcgtcgg 7920
tgttggagat ctgcaccaca tttcggcccc accggttctt cacgatcttg gccttgctag 7980
actgctcctt cagcgcgcgc tgcccgtttt cgctcgtcac atccatttca atcacgtgct 8040
ccttatttat cataatgctt ccgtgtagac acttaagctc gccttcgatc tcagcgcagc 8100
ggtgcagcca caacgcgcag cccgtgggct cgtgatgctt gtaggtcacc tctgcaaacg 8160
actgcaggta cgcctgcagg aatcgcccca tcatcgtcac aaaggtcttg ttgctggtga 8220
aggtcagctg caacccgcgg tgctcctcgt tcagccaggt cttgcatacg gccgccagag 8280
cttccacttg gtcaggcagt agtttgaagt tcgcctttag atcgttatcc acgtggtact 8340
tgtccatcag cgcgcgcgca gcctccatgc ccttctccca cgcagacacg atcggcacac 8400
tcagcgggtt catcaccgta atttcacttt ccgcttcgct gggctcttcc tcttcctctt 8460
gcgtccgcat accacgcgcc actgggtcgt cttcattcag ccgccgcact gtgcgcttac 8520
ctcctttgcc atgcttgatt agcaccggtg ggttgctgaa acccaccatt tgtagcgcca 8580
catcttctct ttcttcctcg ctgtccacga ttacctctgg tgatggcggg cgctcgggct 8640
tgggagaagg gcgcttcttt ttcttcttgg gcgcaatggc caaatccgcc gccgaggtcg 8700
atggccgcgg gctgggtgtg cgcggcacca gcgcgtcttg tgatgagtct tcctcgtcct 8760
cggactcgat acgccgcctc atccgctttt ttgggggcgc ccggggaggc ggcggcgacg 8820
gggacgggga cgacacgtcc tccatggttg ggggacgtcg cgccgcaccg cgtccgcgct 8880
cgggggtggt ttcgcgctgc tcctcttccc gactggccat ttccttctcc tataggcaga 8940
aaaagatcat ggagtcagtc gagaagaagg acagcctaac cgccccctct gagttcgcca 9000
ccaccgcctc caccgatgcc gccaacgcgc ctaccacctt ccccgtcgag gcacccccgc 9060
ttgaggagga ggaagtgatt atcgagcagg acccaggttt tgtaagcgaa gacgacgagg 9120
accgctcagt accaacagag gataaaaagc aagaccagga caacgcagag gcaaacgagg 9180
aacaagtcgg gcggggggac gaaaggcatg gcgactacct agatgtggga gacgacgtgc 9240
tgttgaagca tctgcagcgc cagtgcgcca ttatctgcga cgcgttgcaa gagcgcagcg 9300
atgtgcccct cgccatagcg gatgtcagcc ttgcctacga acgccaccta ttctcaccgc 9360
gcgtaccccc caaacgccaa gaaaacggca catgcgagcc caacccgcgc ctcaacttct 9420
accccgtatt tgccgtgcca gaggtgcttg ccacctatca catctttttc caaaactgca 9480
agatacccct atcctgccgt gccaaccgca gccgagcgga caagcagctg gccttgcggc 9540
agggcgctgt catacctgat atcgcctcgc tcaacgaagt gccaaaaatc tttgagggtc 9600
ttggacgcga cgagaagcgc gcggcaaacg ctctgcaaca ggaaaacagc gaaaatgaaa 9660
gtcactctgg agtgttggtg gaactcgagg gtgacaacgc gcgcctagcc gtactaaaac 9720
gcagcatcga ggtcacccac tttgcctacc cggcacttaa cctacccccc aaggtcatga 9780
gcacagtcat gagtgagctg atcgtgcgcc gtgcgcagcc cctggagagg gatgcaaatt 9840
tgcaagaaca aacagaggag ggcctacccg cagttggcga cgagcagcta gcgcgctggc 9900
ttcaaacgcg cgagcctgcc gacttggagg agcgacgcaa actaatgatg gccgcagtgc 9960
tcgttaccgt ggagcttgag tgcatgcagc ggttctttgc tgacccggag atgcagcgca 10020
agctagagga aacattgcac tacacctttc gacagggcta cgtacgccag gcctgcaaga 10080
tctccaacgt ggagctctgc aacctggtct cctaccttgg aattttgcac gaaaaccgcc 10140
ttgggcaaaa cgtgcttcat tccacgctca agggcgaggc gcgccgcgac tacgtccgcg 10200
actgcgttta cttatttcta tgctacacct ggcagacggc catgggcgtt tggcagcagt 10260
gcttggagga gtgcaacctc aaggagctgc agaaactgct aaagcaaaac ttgaaggacc 10320
tatggacggc cttcaacgag cgctccgtgg ccgcgcacct ggcggacatc attttccccg 10380
aacgcctgct taaaaccctg caacagggtc tgccagactt caccagtcaa agcatgttgc 10440
agaactttag gaactttatc ctagagcgct caggaatctt gcccgccacc tgctgtgcac 10500
ttcctagcga ctttgtgccc attaagtacc gcgaatgccc tccgccgctt tggggccact 10560
gctaccttct gcagctagcc aactaccttg cctaccactc tgacataatg gaagacgtga 10620
gcggtgacgg tctactggag tgtcactgtc gctgcaacct atgcaccccg caccgctccc 10680
tggtttgcaa ttcgcagctg cttaacgaaa gtcaaattat cggtaccttt gagctgcagg 10740
gtccctcgcc tgacgaaaag tccgcggctc cggggttgaa actcactccg gggctgtgga 10800
cgtcggctta ccttcgcaaa tttgtacctg aggactacca cgcccacgag attaggttct 10860
acgaagacca atcccgcccg ccaaatgcgg agcttaccgc ctgcgtcatt acccagggcc 10920
acattcttgg ccaattgcaa gccatcaaca aagcccgcca agagtttctg ctacgaaagg 10980
gacggggggt ttacttggac ccccagtccg gcgaggagct caacccaatc cccccgccgc 11040
cgcagcccta tcagcagcag ccgcgggccc ttgcttccca ggatggcacc caaaaagaag 11100
ctgcagctgc cgccgccacc cacggacgag gaggaatact gggacagtca ggcagaggag 11160
gttttggacg aggaggagga ggacatgatg gaagactggg agagcctaga cgaggaagct 11220
tccgaggtcg aagaggtgtc agacgaaaca ccgtcaccct cggtcgcatt cccctcgccg 11280
gcgccccaga aatcggcaac cggttccagc atggctacaa cctccgctcc tcaggcgccg 11340
ccggcactgc ccgttcgccg acccaaccgt agatgggaca ccactggaac cagggccggt 11400
aagtccaagc agccgccgcc gttagcccaa gagcaacaac agcgccaagg ctaccgctca 11460
tggcgcgggc acaagaacgc catagttgct tgcttgcaag actgtggggg caacatctcc 11520
ttcgcccgcc gctttcttct ctaccatcac ggcgtggcct tcccccgtaa catcctgcat 11580
tactaccgtc atctctacag cccatactgc accggcggca gcggcagcgg cagcaacagc 11640
agcggccaca cagaagcaaa ggcgaccgga tagcaagact ctgacaaagc ccaagaaatc 11700
cacagcggcg gcagcagcag gaggaggagc gctgcgtctg gcgcccaacg aacccgtatc 11760
gacccgcgag cttagaaaca ggatttttcc cactctgtat gctatatttc aacagagcag 11820
gggccaagaa caagagctga aaataaaaaa caggtctctg cgatccctca cccgcagctg 11880
cctgtatcac aaaagcgaag atcagcttcg gcgcacgctg gaagacgcgg aggctctctt 11940
cagtaaatac tgcgcgctga ctcttaagga ctagtttcgc gccctttctc aaatttaagc 12000
gcgaaaacta cgtcatctcc agcggccaca cccggcgcca gcacctgtcg tcagcgccat 12060
tatgagcaag gaaattccca cgccctacat gtggagttac cagccacaaa tgggacttgc 12120
ggctggagct gcccaagact actcaacccg aataaactac atgagcgcgg gaccccacat 12180
gatatcccgg gtcaacggaa tccgcgccca ccgaaaccga attctcttgg aacaggcggc 12240
tattaccacc acacctcgta ataaccttaa tccccgtagt tggcccgctg ccctggtgta 12300
ccaggaaagt cccgctccca ccactgtggt acttcccaga gacgcccagg ccgaagttca 12360
gatgactaac tcaggggcgc agcttgcggg cggctttcgt cacagggtgc ggtcgcccgg 12420
gcagggtata actcacctga caatcagagg gcgaggtatt cagctcaacg acgagtcggt 12480
gagctcctcg cttggtctcc gtccggacgg gacatttcag atcggcggcg ccggccgtcc 12540
ttcattcacg cctcgtcagg caatcctaac tctgcagacc tcgtcctctg agccgcgctc 12600
tggaggcatt ggaactctgc aatttattga ggagtttgtg ccatcggtct actttaaccc 12660
cttctcggga cctcccggcc actatccgga tcaatttatt cctaactttg acgcggtaaa 12720
ggactcggcg gacggctacg actgaatgtt aagtggagag gcagagcaac tgcgcctgaa 12780
acacctggtc cactgtcgcc gccacaagtg ctttgcccgc gactccggtg agttttgcta 12840
ctttgaattg cccgaggatc atatcgaggg cccggcgcac ggcgtccggc ttaccgccca 12900
gggagagctt gcccgtagcc tgattcggga gtttacccag cgccccctgc tagttgagcg 12960
ggacagggga ccctgtgttc tcactgtgat ttgcaactgt cctaaccttg gattacatca 13020
agatcctcta gttaattaac tagagtaccc ggggatctta ttccctttaa ctaataaaaa 13080
aaaataataa agcatcactt acttaaaatc agttagcaaa tttctgtcca gtttattcag 13140
cagcacctcc ttgccctcct cccagctctg gtattgcagc ttcctcctgg ctgcaaactt 13200
tctccacaat ctaaatggaa tgtcagtttc ctcctgttcc tgtccatccg cacccactat 13260
cttcatgttg ttgcagatga agcgcgcaag accgtctgaa gataccttca accccgtgta 13320
tccatatgac acggaaaccg gtcctccaac tgtgcctttt cttactcctc cctttgtatc 13380
ccccaatggg tttcaagaga gtccccctgg ggtactctct ttgcgcctat ccgaacctct 13440
agttacctcc aatggcatgc ttgcgctcaa aatgggcaac ggcctctctc tggacgaggc 13500
cggcaacctt acctcccaaa atgtaaccac tgtgagccca cctctcaaaa aaaccaagtc 13560
aaacataaac ctggaaatat ctgcacccct cacagttacc tcagaagccc taactgtggc 13620
tgccgccgca cctctaatgg tcgcgggcaa cacactcacc atgcaatcac aggccccgct 13680
aaccgtgcac gactccaaac ttagcattgc cacccaagga cccctcacag tgtcagaagg 13740
aaagctagcc ctgcaaacat caggccccct caccaccacc gatagcagta cccttactat 13800
cactgcctca ccccctctaa ctactgccac tggtagcttg ggcattgact tgaaagagcc 13860
catttataca caaaatggaa aactaggact aaagtacggg gctcctttgc atgtaacaga 13920
cgacctaaac actttgaccg tagcaactgg tccaggtgtg actattaata atacttcctt 13980
gcaaactaaa gttactggag ccttgggttt tgattcacaa ggcaatatgc aacttaatgt 14040
agcaggagga ctaaggattg attctcaaaa cagacgcctt atacttgatg ttagttatcc 14100
gtttgatgct caaaaccaac taaatctaag actaggacag ggccctcttt ttataaactc 14160
agcccacaac ttggatatta actacaacaa aggcctttac ttgtttacag cttcaaacaa 14220
ttccaaaaag cttgaggtta acctaagcac tgccaagggg ttgatgtttg acgctacagc 14280
catagccatt aatgcaggag atgggcttga atttggttca cctaatgcac caaacacaaa 14340
tcccctcaaa acaaaaattg gccatggcct agaatttgat tcaaacaagg ctatggttcc 14400
taaactagga actggcctta gttttgacag cacaggtgcc attacagtag gaaacaaaaa 14460
taatgataag ctaactttgt ggaccacacc agctccatct cctaactgta gactaaatgc 14520
agagaaagat gctaaactca ctttggtctt aacaaaatgt ggcagtcaaa tacttgctac 14580
agtttcagtt ttggctgtta aaggcagttt ggctccaata tctggaacag ttcaaagtgc 14640
tcatcttatt ataagatttg acgaaaatgg agtgctacta aacaattcct tcctggaccc 14700
agaatattgg aactttagaa atggagatct tactgaaggc acagcctata caaacgctgt 14760
tggatttatg cctaacctat cagcttatcc aaaatctcac ggtaaaactg ccaaaagtaa 14820
cattgtcagt caagtttact taaacggaga caaaactaaa cctgtaacac taaccattac 14880
actaaacggt acacaggaaa caggagacac aactccaagt gcatactcta tgtcattttc 14940
atgggactgg tctggccaca actacattaa tgaaatattt gccacatcct cttacacttt 15000
ttcatacatt gcccaagaat aaagaatcgt ttgtgttatg tttcaacgtg tttatttttc 15060
aattgcagaa aatttcaagt catttttcat tcagtagtat agccccacca ccacatagct 15120
tatacagatc accgtacctt aatcaaactc acagaaccct agtattcaac ctgccacctc 15180
cctcccaaca cacagagtac acagtccttt ctccccggct ggccttaaaa agcatcatat 15240
catgggtaac agacatattc ttaggtgtta tattccacac ggtttcctgt cgagccaaac 15300
gctcatcagt gatattaata aactccccgg gcagctcact taagttcatg tcgctgtcca 15360
gctgctgagc cacaggctgc tgtccaactt gcggttgctt aacgggcggc gaaggagaag 15420
tccacgccta catgggggta gagtcataat cgtgcatcag gatagggcgg tggtgctgca 15480
gcagcgcgcg aataaactgc tgccgccgcc gctccgtcct gcaggaatac aacatggcag 15540
tggtctcctc agcgatgatt cgcaccgccc gcagcataag gcgccttgtc ctccgggcac 15600
agcagcgcac cctgatctca cttaaatcag cacagtaact gcagcacagc accacaatat 15660
tgttcaaaat cccacagtgc aaggcgctgt atccaaagct catggcgggg accacagaac 15720
ccacgtggcc atcataccac aagcgcaggt agattaagtg gcgacccctc ataaacacgc 15780
tggacataaa cattacctct tttggcatgt tgtaattcac cacctcccgg taccatataa 15840
acctctgatt aaacatggcg ccatccacca ccatcctaaa ccagctggcc aaaacctgcc 15900
cgccggctat acactgcagg gaaccgggac tggaacaatg acagtggaga gcccaggact 15960
cgtaaccatg gatcatcatg ctcgtcatga tatcaatgtt ggcacaacac aggcacacgt 16020
gcatacactt cctcaggatt acaagctcct cccgcgttag aaccatatcc cagggaacaa 16080
cccattcctg aatcagcgta aatcccacac tgcagggaag acctcgcacg taactcacgt 16140
tgtgcattgt caaagtgtta cattcgggca gcagcggatg atcctccagt atggtagcgc 16200
gggtttctgt ctcaaaagga ggtagacgat ccctactgta cggagtgcgc cgagacaacc 16260
gagatcgtgt tggtcgtagt gtcatgccaa atggaacgcc ggacgtagtc atatttcctg 16320
aagcaaaacc aggtgcgggc gtgacaaaca gatctgcgtc tccggtctcg ccgcttagat 16380
cgctctgtgt agtagttgta gtatatccac tctctcaaag catccaggcg ccccctggct 16440
tcgggttcta tgtaaactcc ttcatgcgcc gctgccctga taacatccac caccgcagaa 16500
taagccacac ccagccaacc tacacattcg ttctgcgagt cacacacggg aggagcggga 16560
agagctggaa gaaccatgtt ttttttttta ttccaaaaga ttatccaaaa cctcaaaatg 16620
aagatctatt aagtgaacgc gctcccctcc ggtggcgtgg tcaaactcta cagccaaaga 16680
acagataatg gcatttgtaa gatgttgcac aatggcttcc aaaaggcaaa cggccctcac 16740
gtccaagtgg acgtaaaggc taaacccttc agggtgaatc tcctctataa acattccagc 16800
accttcaacc atgcccaaat aattctcatc tcgccacctt ctcaatatat ctctaagcaa 16860
atcccgaata ttaagtccgg ccattgtaaa aatctgctcc agagcgccct ccaccttcag 16920
cctcaagcag cgaatcatga ttgcaaaaat tcaggttcct cacagacctg tataagattc 16980
aaaagcggaa cattaacaaa aataccgcga tcccgtaggt cccttcgcag ggccagctga 17040
acataatcgt gcaggtctgc acggaccagc gcggccactt ccccgccagg aaccttgaca 17100
aaagaaccca cactgattat gacacgcata ctcggagcta tgctaaccag cgtagccccg 17160
atgtaagctt tgttgcatgg gcggcgatat aaaatgcaag gtgctgctca aaaaatcagg 17220
caaagcctcg cgcaaaaaag aaagcacatc gtagtcatgc tcatgcagat aaaggcaggt 17280
aagctccgga accaccacag aaaaagacac catttttctc tcaaacatgt ctgcgggttt 17340
ctgcataaac acaaaataaa ataacaaaaa aacatttaaa cattagaagc ctgtcttaca 17400
acaggaaaaa caacccttat aagcataaga cggactacgg ccatgccggc gtgaccgtaa 17460
aaaaactggt caccgtgatt aaaaagcacc accgacagct cctcggtcat gtccggagtc 17520
ataatgtaag actcggtaaa cacatcaggt tgattcatcg gtcagtgcta aaaagcgacc 17580
gaaatagccc gggggaatac atacccgcag gcgtagagac aacattacag cccccatagg 17640
aggtataaca aaattaatag gagagaaaaa cacataaaca cctgaaaaac cctcctgcct 17700
aggcaaaata gcaccctccc gctccagaac aacatacagc gcttccacag cggcagccat 17760
aacagtcagc cttaccagta aaaaagaaaa cctattaaaa aaacaccact cgacacggca 17820
ccagctcaat cagtcacagt gtaaaaaagg gccaagtgca gagcgagtat atataggact 17880
aaaaaatgac gtaacggtta aagtccacaa aaaacaccca gaaaaccgca cgcgaaccta 17940
cgcccagaaa cgaaagccaa aaaacccaca acttcctcaa atcgtcactt ccgttttccc 18000
acgttacgta acttcccatt ttaagaaaac tacaattccc aacacataca agttactccg 18060
ccctaaaacc tacgtcaccc gccccgttcc cacgccccgc gccacgtcac aaactccacc 18120
ccctcattat catattggct tcaatccaaa ataaggtata ttattgatga tttattttgg 18180
attgaagcca atatgataat gagggggtgg agtttgtgac gtggcgcggg gcgtgggaac 18240
ggggcgggtg acgtagtagt gtggcggaag tgtgatgttg caagtgtggc ggaacacatg 18300
taagcgacgg atgtggcaaa agtgacgttt ttggtgtgcg ccggatccac aggacgggtg 18360
tggtcgccat gatcgcgtag tcgatagtgg ctccaagtag cgaagcgagc aggactgggc 18420
ggcggccaaa gcggtcggac agtgctccga gaacgggtgc gcatagaaat tgcatcaacg 18480
catatagcgc tagcagcacg ccatagtgac tggcgatgct gtcggaatgg acgatatccc 18540
gcaagaggcc cggcagtacc ggcataacca agcctatgcc tacagcatcc agggtgacgg 18600
tgccgaggat gacgatgagc gcattgttag atttcataca cggtgcctga ctgcgttagc 18660
aatttaactg tgataaacta ccgcattaaa gcttatcgaa ttcgtaatca tggtcatagc 18720
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 18780
taaagtgtaa agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 18840
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 18900
gcgcggggag aggcggtttg cgtattgggc gc 18932
Claims (47)
- 다음을 인코딩하는 뉴클레오티드 서열을 포함하는 아데노바이러스성 헬퍼 플라스미드:
(a) E2a 단백질;
(b) E4 영역;
(c) VA RNA 영역; 그리고
(d) L4 영역;
이때 아데노바이러스성 헬퍼 플라스미드는 다음중 하나 또는 그 이상을 인코딩하는 뉴클레오티드 서열을 포함하지 않고:
섬유 단백질 또는 이의 일부분;
L1-52/55K (패키징 단백질 3); 그리고
페리펜톤성 헥손-연합된 단백질. - 청구항 1에 있어서, 이때 VA RNA 영역은 서열 식별 번호: 14에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 2에 있어서, 이때 VA RNA 영역은 다음을 포함하는, 아데노바이러스성 헬퍼 플라스미드:
(a) 서열 식별 번호: 16에 대해 적어도 80% 동일한 VA RNAI 뉴클레오티드 서열; 그리고
(b) 서열 식별 번호: 18에 대해 적어도 80% 동일한 VA RNAII 뉴클레오티드 서열. - 청구항 1에 있어서, 이때 VA RNA 영역은 서열 식별 번호: 15에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 4에 있어서, 이때 VA RNA 영역은 다음을 포함하는, 아데노바이러스성 헬퍼 플라스미드:
(a) 서열 식별 번호: 17에 대해 적어도 80% 동일한 VA RNAI 뉴클레오티드 서열; 그리고
(b) 서열 식별 번호: 19에 대해 적어도 80% 동일한 VA RNAII 뉴클레오티드 서열. - 청구항 1에 있어서, 이때 L4 영역은 서열 식별 번호: 4에 대해 적어도 80% 동일한 아미노산 서열을 갖는 L4 (헥손 어셈블리) 단백질을 인코드하는 뉴클레오티드 서열을 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, 이때 L4 영역은 서열 식별 번호: 6에 대해 적어도 80% 동일한 아미노산 서열을 갖는 부분적 L4 (헥손 어셈블리) 단백질을 인코드하는 뉴클레오티드 서열을 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, 이때 L4 영역은 서열 식별 번호: 13에 대해 적어도 80% 동일한 아미노산 서열을 갖는 부분적인 헥손 연합된 전구물질 (L4 pVIII) 단백질을 인코드하는 뉴클레오티드 서열을 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 7에 있어서, 이때 부분적인 L4 (헥손 어셈블리) 단백질을 인코딩하는 뉴클레오티드 서열은 E2a 프로모터 영역을 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, 이때 상기 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 21에 대해 적어도 80% 동일한 아미노산 서열을 갖는 부분적인 DNA 말단 단백질을 인코딩하는 뉴클레오티드 서열을 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, 이때 상기 아데노바이러스성 헬퍼 플라스미드는 DNA 말단 단백질을 인코딩하는 뉴클레오티드 서열을 포함하지 않는 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, 이때 상기 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 23에 대해 적어도 80% 동일한 아미노산 서열을 갖는 부분적인 23kDa 엔도프로테아제를 인코딩하는 뉴클레오티드 서열을 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, 이때 상기 아데노바이러스성 헬퍼 플라스미드는 23kDa 엔도프로테아제를 인코딩하는 뉴클레오티드 서열을 포함하지 않는 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, 이때 E2a 단백질의 발현은 E2a 프로모터의 제어 하에 있는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, 이때 E2a 단백질의 발현은 E2a 프로모터 및 닭 β-액틴 프로모터의 발현 하에 있고, 이때 닭 β-액틴 프로모터는 E2a 프로모터의 상류에 있는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, 이때 E2a 단백질의 발현은 닭 β-액틴 프로모터의 제어 하에 있는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 15 또는 16에 있어서, 이때 닭 β-액틴 프로모터는 서열 식별 번호: 26에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, 이때 상기 아데노바이러스성 헬퍼 플라스미드는 E2a의 하류에 E2a 폴리아데닐화 신호를 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, 이때 상기 아데노바이러스성 헬퍼 플라스미드는 E2a의 하류에 SV40 폴리아데닐화 신호를 함유하느, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 18에 있어서, 이때 상기 SV40 폴리아데닐화 신호는 E2a 폴리아데닐화 신호의 하류에 있는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 19 또는 20에 있어서, 이때 상기 SV40 폴리아데닐화 신호는 서열 식별 번호: 28에 대해 적어도 80% 동일한 서열을 갖는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, HSV-1 UL30 및 HSV-1 UL42를 인코딩하는 뉴클레오티드를 더 포함하며,
이때 UL30은 서열 식별 번호: 30에 대해 적어도 80% 동일한 아미노산 서열을 갖고;
이때 UL42는 서열 식별 번호: 32에 대해 적어도 80% 동일한 아미노산 서열을 갖고; 그리고
이때 UL30 및 UL42는 서열 식별 번호: 34에 대해 적어도 80% 동일한 아미노산 서열을 갖는 P2A 절단 부위에 의해 분리되어 있는, 아데노바이러스성 헬퍼 플라스미드. - 청구항 22에 있어서, 이때 UL30 및 UL42의 발현은 이 플라스미드의 EF-1α 프로모터의 제어 하에 있는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 23에 있어서, 이때 EF-1α 프로모터는 서열 식별 번호: 35에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 22에 있어서, UL42의 하류에 β-글로빈 폴리아데닐화 신호를 더 포함하며, 이때 상기 β-글로빈 폴리아데닐화 신호는 서열 식별 번호: 36에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, HSV-1 UL29를 인코드하는 뉴클레오티드 서열을 더 포함하며,
이때 UL29는 서열 식별 번호: 38에 대해 적어도 80% 동일한 아미노산 서열을 갖는, 아데노바이러스성 헬퍼 플라스미드. - 청구항 26에 있어서, 이때 UL29의 발현은 이 플라스미드의 HSV TK 프로모터의 제어 하에 있는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 27에 있어서, 이때 HSV TK 프로모터는 서열 식별 번호: 39에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 26에 있어서, UL29의 하류에 HSV TK 폴리아데닐화 신호를 더 포함하고, 이때 HSV TK 폴리아데닐화 신호는 서열 식별 번호: 40에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, 이때 E4 영역은 E4orf1을 포함하지 않고, 이때 E4 영역은 E4orf2를 포함하지 않는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, 이때 E4 영역은 E4 미니 프로모터에 작동가능하도록 연계되며, 이때 E4 미니 프로모터는 서열 식별 번호: 1에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, 이때 E4 영역은 SV40 프로모터에 작동가능하도록 연계되며, 이때 SV40 프로모터는 서열 식별 번호: 2에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 다음의 아데노바이러스성 DNA 서열들 또는 영역들을 포함하는 아데노바이러스성 헬퍼 플라스미드:
(a) E2a;
(b) E4 영역; 그리고
(c) VA RNA 영역;
이때 아데노바이러스성 헬퍼 플라스미드는 다음 구성성분들 중 하나 또는 그 이상을 포함하지 않고:
섬유 또는 이의 일부분;
L1-52/55K (패키징 단백질 3);
페리펜톤성 헥손-연합된 단백질; 그리고
L4 영역. - 서열 식별 번호: 41-66 중 임의의 하나에 대해 80% 서열 동일성을 갖는 아데노바이러스성 헬퍼 플라스미드.
- 다음 단계를 포함하는, 재조합 아데노바이러스성 연합된 바이러스성 벡터를 만드는 방법:
생산자 세포에 AAV 벡터 플라스미드, AAV Rep-Cap 발현시키는 플라스미드, 및 청구항 1-34 중 임의의 한 항에 따른 아데노바이러스성 헬퍼 플라스미드로 형질감염시키는 단계. - 청구항 35에 있어서, 이때 AAV 벡터 플라스미드는 AAV 역전된 말단 반복부 (ITRs) 및 관심대상의 도입유전자를 포함하는, 방법.
- 다음 단계를 포함하는, 재조합 아데노바이러스성 연합된 바이러스성 벡터를 만드는 방법:
생산자 세포에 AAV 벡터 플라스미드 및 청구항 1-34 중 임의의 한 항에 따른 아데노바이러스성 헬퍼 플라스미드로 형질감염시키는 단계,
이때 생산자 세포는 안정적으로 Rep-Cap를 발현시킨다. - 청구항 37에 있어서, 이때 AAV 벡터 플라스미드는 AAV 역전된 말단 반복부 (ITRs) 및 관심대상의 도입유전자를 포함하는, 방법.
- 청구항 1에 있어서, 이때 L4 영역은 L4 (헥손 어셈블리) 단백질을 인코드하는 뉴클레오티드 서열, 서열 식별 번호: 3에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, 이때 L4 영역은 부분적 L4 (헥손 어셈블리) 단백질을 인코드하는 뉴클레오티드 서열, 서열 식별 번호: 5에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, 이때 L4 영역은 부분적인 헥손 연합된 전구물질 (L4 pVIII) 단백질을 인코드하는 뉴클레오티드 서열, 서열 식별 번호: 12에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, 이때 상기 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 20에 대해 적어도 80% 동일한 아미노산 서열을 갖는 부분적인 DNA 말단 단백질을 인코딩하는 뉴클레오티드 서열을 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, 이때 상기 아데노바이러스성 헬퍼 플라스미드는 서열 식별 번호: 22에 대해 적어도 80% 동일한 아미노산 서열을 갖는 부분적인 23kDa 엔도프로테아제를 인코딩하는 뉴클레오티드 서열을 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 1에 있어서, 이때 아데노바이러스성 헬퍼 플라스미드는 HSV-1 UL30 및 HSV-1 UL42를 인코드하는 뉴클레오티드 서열들을 더 포함하며,
이때 상기 뉴클레오티드 서열들 중 적어도 하나는 서열 식별 번호: 29에 대해 적어도 80% 동일하며;
이때 상기 뉴클레오티드 서열들 중 적어도 하나는 서열 식별 번호: 31에 대해 적어도 80% 동일하며; 그리고
이때 UL30 및 UL42는 서열 식별 번호: 33에 대해 적어도 80% 동일한 뉴클레오티드 서열에 의해 인코드된 P2A 절단 부위에 의해 분리되어 있는, 아데노바이러스성 헬퍼 플라스미드. - 청구항 1에 있어서, 이때 아데노바이러스성 헬퍼 플라스미드는 HSV-1 UL29를 인코딩하는 뉴클레오티드 서열을 더 포함하며,
이때 뉴클레오티드 서열은 서열 식별 번호: 37에 대해 적어도 80% 동일한, 아데노바이러스성 헬퍼 플라스미드. - 상기 청구항들 중 임의의 항에 있어서, 이때 아데노바이러스성 헬퍼 플라스미드는 저항성 유전자를 포함하는, 아데노바이러스성 헬퍼 플라스미드.
- 청구항 46에 있어서, 이때 저항성 카세트는 카나마이신 저항성 유전자인, 아데노바이러스성 헬퍼 플라스미드.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202163188294P | 2021-05-13 | 2021-05-13 | |
US63/188,294 | 2021-05-13 | ||
PCT/US2022/029193 WO2022241215A2 (en) | 2021-05-13 | 2022-05-13 | Adenoviral helper plasmid |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20240036508A true KR20240036508A (ko) | 2024-03-20 |
Family
ID=84029842
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020237042905A KR20240036508A (ko) | 2021-05-13 | 2022-05-13 | 아데노바이러스성 헬퍼 플라스미드 |
Country Status (8)
Country | Link |
---|---|
EP (1) | EP4337236A2 (ko) |
JP (1) | JP2024518553A (ko) |
KR (1) | KR20240036508A (ko) |
CN (1) | CN117897167A (ko) |
AU (1) | AU2022272316A1 (ko) |
CA (1) | CA3218342A1 (ko) |
IL (1) | IL308472A (ko) |
WO (1) | WO2022241215A2 (ko) |
Family Cites Families (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5223391A (en) * | 1990-02-21 | 1993-06-29 | President And Fellows Of Harvard College | Inhibitors of herpes simplex virus replication |
US5543264A (en) * | 1990-06-29 | 1996-08-06 | Associated Universities, Inc. | Co-factor activated recombinant adenovirus proteinases |
US6670188B1 (en) * | 1998-04-24 | 2003-12-30 | Crucell Holland B.V. | Packaging systems for human recombinant adenovirus to be used in gene therapy |
AU2001257611A1 (en) * | 2000-04-28 | 2001-11-12 | Avigen, Inc. | Polynucleotides for use in recombinant adeno-associated virus virion production |
US7754201B2 (en) * | 2000-06-02 | 2010-07-13 | GenPhar, Inc | Method of vaccination through serotype rotation |
AU2001291162A1 (en) * | 2000-09-25 | 2002-04-08 | Regents Of The University Of Michigan | Production of viral vectors |
WO2004026265A2 (en) * | 2002-09-23 | 2004-04-01 | Macrogenics, Inc. | Compositions and methods for treatment of herpesvirus infections |
EP1606397A1 (en) * | 2003-03-17 | 2005-12-21 | Merck & Co., Inc. | Adenovirus serotype 24 vectors, nucleic acids and virus produced thereby |
JP2007530009A (ja) * | 2003-06-11 | 2007-11-01 | ワイス | ポリペプチドを産生する方法 |
AU2005274059A1 (en) * | 2004-08-09 | 2006-02-23 | Merck Sharp & Dohme Corp. | Adenoviral vector compositions |
CN103436507B (zh) * | 2006-05-05 | 2016-04-20 | 冈戈根股份有限公司 | 噬菌体衍生的抗微生物活性剂 |
EP2220217A2 (en) * | 2007-11-28 | 2010-08-25 | The Trustees of the University of Pennsylvania | Simian subfamily c adenoviruses sadv-40, -31, and-34 and uses thereof |
WO2010115172A2 (en) * | 2009-04-03 | 2010-10-07 | University Of Washington | Antigenic peptide of hsv-2 and methods for using same |
WO2010136981A2 (en) * | 2009-05-26 | 2010-12-02 | Cellectis | Meganuclease variants cleaving the genome of a pathogenic non-integrating virus and uses thereof |
US20110293511A1 (en) * | 2009-09-29 | 2011-12-01 | Terrance Grant Johns | Specific binding proteins and uses thereof |
WO2018017925A1 (en) * | 2016-07-22 | 2018-01-25 | President And Fellows Of Harvard College | Targeting lytic and latent herpes simplex virus 1 infection with crispr/cas9 |
JP7335224B2 (ja) * | 2017-07-18 | 2023-08-29 | ジェノヴィー エービー | 全長t細胞受容体オープンリーディングフレームの迅速な組立ておよび多様化のための二成分ベクターライブラリシステム |
US20210046193A1 (en) * | 2018-03-02 | 2021-02-18 | University Of Florida Research Foundation, Incorporated | Drug stabilized therapeutic transgenes delivered by adeno-associated virus expression |
AU2019261361A1 (en) * | 2018-04-23 | 2020-11-19 | Duke University | Downregulation of SNCA expression by targeted editing of DNA-methylation |
JP7384457B2 (ja) * | 2018-10-09 | 2023-11-21 | ナイキジェン,リミテッド | ウイルスベクターを調製するための組成物および方法 |
GB201816919D0 (en) * | 2018-10-17 | 2018-11-28 | Glaxosmithkline Ip Dev Ltd | Adeno-associated viral vector producer cell lines |
SG10201906637UA (en) * | 2019-07-17 | 2021-02-25 | Agency Science Tech & Res | Treatment/prevention of disease by linc complex inhibition |
AU2020374942A1 (en) * | 2019-11-01 | 2022-05-26 | University Of Houston System | Oncolytic virotherapy with induced anti-tumor immunity |
US11130787B2 (en) * | 2020-06-11 | 2021-09-28 | MBF Therapeutics, Inc. | Alphaherpesvirus glycoprotein d-encoding nucleic acid constructs and methods |
-
2022
- 2022-05-13 CN CN202280042134.2A patent/CN117897167A/zh active Pending
- 2022-05-13 AU AU2022272316A patent/AU2022272316A1/en active Pending
- 2022-05-13 CA CA3218342A patent/CA3218342A1/en active Pending
- 2022-05-13 KR KR1020237042905A patent/KR20240036508A/ko unknown
- 2022-05-13 IL IL308472A patent/IL308472A/en unknown
- 2022-05-13 EP EP22808405.9A patent/EP4337236A2/en active Pending
- 2022-05-13 WO PCT/US2022/029193 patent/WO2022241215A2/en active Application Filing
- 2022-05-13 JP JP2023570161A patent/JP2024518553A/ja active Pending
Also Published As
Publication number | Publication date |
---|---|
EP4337236A2 (en) | 2024-03-20 |
CA3218342A1 (en) | 2022-11-17 |
WO2022241215A3 (en) | 2023-02-02 |
IL308472A (en) | 2024-01-01 |
WO2022241215A2 (en) | 2022-11-17 |
JP2024518553A (ja) | 2024-05-01 |
AU2022272316A1 (en) | 2023-11-30 |
CN117897167A (zh) | 2024-04-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2019271972B2 (en) | Adenovirus polynucleotides and polypeptides | |
US6156567A (en) | Truncated transcriptionally active cytomegalovirus promoters | |
US6090393A (en) | Recombinant canine adenoviruses, method for making and uses thereof | |
KR20200074132A (ko) | 리소좀 장애를 위한 유전자 요법 | |
AU2022203504A1 (en) | Oncolytic tumor viruses and methods of use | |
DK2753355T3 (en) | ONCOLYTIC HERP SIMPLEX VIRUSES AND THERAPEUTIC APPLICATIONS THEREOF | |
KR20210086645A (ko) | Aav 삼중-플라스미드 시스템 | |
KR102471633B1 (ko) | 바이러스 동역학에 미치는 영향 최소화를 위한 치료용 아데노바이러스의 외인성 유전자 발현 | |
KR20150014505A (ko) | 아과 e 원숭이 아데노바이러스 a1302, a1320, a1331 및 a1337 및 이것들의 사용 | |
US20020193327A1 (en) | Vectors for occular transduction and use therefor for genetic therapy | |
CN108135991A (zh) | 新型腺病毒 | |
KR20220125332A (ko) | Pcsk9의 표적화를 위한 조성물 및 방법 | |
US20030157688A1 (en) | Adenovirus vectors, packaging cell lines, compositions, and methods for preparation and use | |
KR20230079359A (ko) | 아데노바이러스 벡터 및 아데노바이러스 벡터의 사용 방법 | |
AU772630B2 (en) | Adenovirus vectors, packaging cell lines, compositions, and methods for preparation and use | |
CN114174324A (zh) | 用于溶酶体病症的基因疗法 | |
CA2519680A1 (en) | Adenovirus particles with enhanced infectivity of dendritic cells and particles with decreased infectivity of hepatocytes | |
KR20220027785A (ko) | 신규한 코로나바이러스 재조합 스파이크 단백질, 이를 코딩하는 폴리뉴클레오티드, 상기 폴리뉴클레오티드를 포함하는 벡터 및 상기 벡터를 포함하는 코로나바이러스감염증 예방 또는 치료용 백신 | |
KR20230031929A (ko) | 고릴라 아데노바이러스 핵산 서열 및 아미노산 서열, 이들을 함유하는 벡터, 및 이의 용도 | |
KR20240036508A (ko) | 아데노바이러스성 헬퍼 플라스미드 | |
CN116323955A (zh) | 通过crispr/cas介导的体内末端解析拯救重组腺病毒 | |
RU2816645C1 (ru) | Новый рекомбинантный шиповидный белок коронавируса, кодирующий его полинуклетид, вектор, содержащий полинуклетид, и вакцина для профилактики или лечения коронавирусной инфекции, содержащая вектор | |
RU2821989C1 (ru) | Новый аденовирусный вектор, не включающий компетентный по репликации аденовирус, и его применение | |
KR100884214B1 (ko) | Caev-계 벡터 시스템 | |
KR20230008069A (ko) | 아데노바이러스 발현 벡터, 및 제조를 위한 방법 및 세포주 |