CN111212659B - Hpv疫苗 - Google Patents
Hpv疫苗 Download PDFInfo
- Publication number
- CN111212659B CN111212659B CN201880053063.XA CN201880053063A CN111212659B CN 111212659 B CN111212659 B CN 111212659B CN 201880053063 A CN201880053063 A CN 201880053063A CN 111212659 B CN111212659 B CN 111212659B
- Authority
- CN
- China
- Prior art keywords
- leu
- asp
- sequence
- conserved
- ala
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 229960002566 papillomavirus vaccine Drugs 0.000 title description 6
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 332
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 137
- 229920001184 polypeptide Polymers 0.000 claims abstract description 127
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 78
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 78
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 78
- 229960005486 vaccine Drugs 0.000 claims abstract description 46
- 239000013603 viral vector Substances 0.000 claims abstract description 36
- 238000011282 treatment Methods 0.000 claims abstract description 7
- 238000011321 prophylaxis Methods 0.000 claims abstract 2
- 239000002671 adjuvant Substances 0.000 claims description 29
- 239000000203 mixture Substances 0.000 claims description 25
- 230000002163 immunogen Effects 0.000 claims description 19
- 208000009608 Papillomavirus Infections Diseases 0.000 claims description 13
- 238000002255 vaccination Methods 0.000 claims description 12
- 239000003937 drug carrier Substances 0.000 claims description 6
- 241000701161 unidentified adenovirus Species 0.000 claims description 4
- 241000700605 Viruses Species 0.000 claims description 3
- 239000003795 chemical substances by application Substances 0.000 claims description 3
- 230000002265 prevention Effects 0.000 claims description 3
- 241000124008 Mammalia Species 0.000 claims description 2
- 239000004480 active ingredient Substances 0.000 claims description 2
- 239000013600 plasmid vector Substances 0.000 claims description 2
- 108090000623 proteins and genes Proteins 0.000 abstract description 37
- 102000004169 proteins and genes Human genes 0.000 abstract description 32
- 108091036078 conserved sequence Proteins 0.000 abstract description 9
- 108091035707 Consensus sequence Proteins 0.000 description 90
- 241000701806 Human papillomavirus Species 0.000 description 75
- 150000001413 amino acids Chemical class 0.000 description 38
- 235000001014 amino acid Nutrition 0.000 description 35
- 235000018102 proteins Nutrition 0.000 description 31
- 108010038633 aspartylglutamate Proteins 0.000 description 27
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 20
- 102000003978 Tissue Plasminogen Activator Human genes 0.000 description 20
- 108090000373 Tissue Plasminogen Activator Proteins 0.000 description 20
- 210000003819 peripheral blood mononuclear cell Anatomy 0.000 description 20
- 108010087924 alanylproline Proteins 0.000 description 19
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 19
- 108010040856 glutamyl-cysteinyl-alanine Proteins 0.000 description 19
- OMOZPGCHVWOXHN-BQBZGAKWSA-N Gly-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)CN OMOZPGCHVWOXHN-BQBZGAKWSA-N 0.000 description 18
- 108010016616 cysteinylglycine Proteins 0.000 description 18
- 108010061238 threonyl-glycine Proteins 0.000 description 18
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 17
- 108010060199 cysteinylproline Proteins 0.000 description 17
- 108020004414 DNA Proteins 0.000 description 14
- 241000880493 Leptailurus serval Species 0.000 description 14
- 108010047857 aspartylglycine Proteins 0.000 description 14
- ZPMNECSEJXXNBE-CIUDSAMLSA-N Asn-Cys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZPMNECSEJXXNBE-CIUDSAMLSA-N 0.000 description 13
- TVIZQBFURPLQDV-DJFWLOJKSA-N Asp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N TVIZQBFURPLQDV-DJFWLOJKSA-N 0.000 description 13
- HTSSXFASOUSJQG-IHPCNDPISA-N Asp-Tyr-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HTSSXFASOUSJQG-IHPCNDPISA-N 0.000 description 13
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 13
- NBDHWLZEMKSVHH-UVBJJODRSA-N Pro-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 NBDHWLZEMKSVHH-UVBJJODRSA-N 0.000 description 13
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 12
- 241000341655 Human papillomavirus type 16 Species 0.000 description 12
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 12
- 201000010099 disease Diseases 0.000 description 12
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 12
- 108010049041 glutamylalanine Proteins 0.000 description 12
- 108010090114 methionyl-tyrosyl-lysine Proteins 0.000 description 12
- 239000013612 plasmid Substances 0.000 description 12
- 230000037452 priming Effects 0.000 description 12
- 108010077112 prolyl-proline Proteins 0.000 description 12
- 241000388189 Alphapapillomavirus 6 Species 0.000 description 11
- 241000388169 Alphapapillomavirus 7 Species 0.000 description 11
- 241000701830 Human papillomavirus type 31 Species 0.000 description 11
- 241000701603 Human papillomavirus type 52 Species 0.000 description 11
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 11
- 101710132593 Protein E2 Proteins 0.000 description 11
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 11
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 10
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 10
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 10
- 241000701784 Human papillomavirus type 58 Species 0.000 description 10
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 10
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 10
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 10
- 241000699670 Mus sp. Species 0.000 description 10
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 10
- 239000012634 fragment Substances 0.000 description 10
- 108010050848 glycylleucine Proteins 0.000 description 10
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 9
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 9
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 9
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 9
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 9
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 9
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 9
- 210000001744 T-lymphocyte Anatomy 0.000 description 9
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 9
- 108010011559 alanylphenylalanine Proteins 0.000 description 9
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 9
- 108010092854 aspartyllysine Proteins 0.000 description 9
- 108010027338 isoleucylcysteine Proteins 0.000 description 9
- 238000000034 method Methods 0.000 description 9
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 8
- DTFJUSWYECELTM-BPUTZDHNSA-N Cys-Pro-Trp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O DTFJUSWYECELTM-BPUTZDHNSA-N 0.000 description 8
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 8
- SAEBUDRWKUXLOM-ACZMJKKPSA-N Glu-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O SAEBUDRWKUXLOM-ACZMJKKPSA-N 0.000 description 8
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 8
- WMTOVWLLDGQGCV-GUBZILKMSA-N Leu-Glu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WMTOVWLLDGQGCV-GUBZILKMSA-N 0.000 description 8
- 108010079364 N-glycylalanine Proteins 0.000 description 8
- 101710132596 Protein E4 Proteins 0.000 description 8
- 108010060035 arginylproline Proteins 0.000 description 8
- 108010004073 cysteinylcysteine Proteins 0.000 description 8
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 8
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 7
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 7
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 7
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 7
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 7
- FMDCYTBSPZMPQE-JBDRJPRFSA-N Cys-Ala-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMDCYTBSPZMPQE-JBDRJPRFSA-N 0.000 description 7
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 7
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 7
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 7
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 7
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 7
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 7
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 7
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 7
- 101710132594 Protein E6 Proteins 0.000 description 7
- VZBWRZGNEPBRDE-HZUKXOBISA-N Trp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N VZBWRZGNEPBRDE-HZUKXOBISA-N 0.000 description 7
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 7
- 108010013835 arginine glutamate Proteins 0.000 description 7
- 108010089804 glycyl-threonine Proteins 0.000 description 7
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 7
- 230000003612 virological effect Effects 0.000 description 7
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 6
- ITHMWNNUDPJJER-ULQDDVLXSA-N Arg-His-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ITHMWNNUDPJJER-ULQDDVLXSA-N 0.000 description 6
- IGFJVXOATGZTHD-UHFFFAOYSA-N Arg-Phe-His Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccccc1)C(=O)NC(Cc2c[nH]cn2)C(=O)O IGFJVXOATGZTHD-UHFFFAOYSA-N 0.000 description 6
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 6
- ZMWDUIIACVLIHK-GHCJXIJMSA-N Asn-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N ZMWDUIIACVLIHK-GHCJXIJMSA-N 0.000 description 6
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 6
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 6
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 6
- ZGKXAUIVGIBISK-SZMVWBNQSA-N Glu-His-Trp Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O ZGKXAUIVGIBISK-SZMVWBNQSA-N 0.000 description 6
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 6
- UOPBQSJRBONRON-STECZYCISA-N Ile-Met-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOPBQSJRBONRON-STECZYCISA-N 0.000 description 6
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 6
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 6
- VJLLEKDQJSMHRU-STQMWFEESA-N Phe-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O VJLLEKDQJSMHRU-STQMWFEESA-N 0.000 description 6
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 6
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 6
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 6
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 6
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 6
- LORJKYIPJIRIRT-BVSLBCMMSA-N Trp-Pro-Tyr Chemical compound C([C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 LORJKYIPJIRIRT-BVSLBCMMSA-N 0.000 description 6
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 6
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 6
- 108010062635 beta-lactotensin Proteins 0.000 description 6
- 108010069495 cysteinyltyrosine Proteins 0.000 description 6
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 108010077515 glycylproline Proteins 0.000 description 6
- 108010025306 histidylleucine Proteins 0.000 description 6
- 108010026333 seryl-proline Proteins 0.000 description 6
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 5
- HWPXGQCMZITGFN-XVYDVKMFSA-N Ala-Cys-His Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HWPXGQCMZITGFN-XVYDVKMFSA-N 0.000 description 5
- CVHJIWVKTFNGHT-ACZMJKKPSA-N Ala-Gln-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N CVHJIWVKTFNGHT-ACZMJKKPSA-N 0.000 description 5
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 5
- XCZXVTHYGSMQGH-NAKRPEOUSA-N Ala-Ile-Met Chemical compound C[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C([O-])=O XCZXVTHYGSMQGH-NAKRPEOUSA-N 0.000 description 5
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 5
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 5
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 5
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 5
- VDBKFYYIBLXEIF-GUBZILKMSA-N Arg-Gln-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VDBKFYYIBLXEIF-GUBZILKMSA-N 0.000 description 5
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 5
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 5
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 5
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 5
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 5
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 5
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 5
- OZHXXYOHPLLLMI-CIUDSAMLSA-N Cys-Lys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OZHXXYOHPLLLMI-CIUDSAMLSA-N 0.000 description 5
- KGIHMGPYGXBYJJ-SRVKXCTJSA-N Cys-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CS KGIHMGPYGXBYJJ-SRVKXCTJSA-N 0.000 description 5
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 5
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 5
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 5
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 5
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 5
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 5
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 5
- IUKIDFVOUHZRAK-QWRGUYRKSA-N Gly-Lys-His Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IUKIDFVOUHZRAK-QWRGUYRKSA-N 0.000 description 5
- 208000032843 Hemorrhage Diseases 0.000 description 5
- LSQHWKPPOFDHHZ-YUMQZZPRSA-N His-Asp-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LSQHWKPPOFDHHZ-YUMQZZPRSA-N 0.000 description 5
- IIVZNQCUUMBBKF-GVXVVHGQSA-N His-Gln-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 IIVZNQCUUMBBKF-GVXVVHGQSA-N 0.000 description 5
- AHEBIAHEZWQVHB-QTKMDUPCSA-N His-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O AHEBIAHEZWQVHB-QTKMDUPCSA-N 0.000 description 5
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 5
- JLWLMGADIQFKRD-QSFUFRPTSA-N Ile-His-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CN=CN1 JLWLMGADIQFKRD-QSFUFRPTSA-N 0.000 description 5
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 5
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 5
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 5
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 5
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 5
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 5
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 5
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 5
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 5
- SFQPJNQDUUYCLA-BJDJZHNGSA-N Lys-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N SFQPJNQDUUYCLA-BJDJZHNGSA-N 0.000 description 5
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 5
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 5
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 5
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 5
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 5
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 5
- TUYWCHPXKQTISF-LPEHRKFASA-N Pro-Cys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N2CCC[C@@H]2C(=O)O TUYWCHPXKQTISF-LPEHRKFASA-N 0.000 description 5
- DTQIXTOJHKVEOH-DCAQKATOSA-N Pro-His-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CS)C(=O)O DTQIXTOJHKVEOH-DCAQKATOSA-N 0.000 description 5
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 5
- HOJUNFDJDAPVBI-BZSNNMDCSA-N Pro-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 HOJUNFDJDAPVBI-BZSNNMDCSA-N 0.000 description 5
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 5
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 5
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 5
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 5
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 5
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 5
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 5
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 5
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 5
- DVWAIHZOPSYMSJ-ZVZYQTTQSA-N Trp-Glu-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 DVWAIHZOPSYMSJ-ZVZYQTTQSA-N 0.000 description 5
- NJNCVQYFNKZMAH-JYBASQMISA-N Trp-Thr-Cys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CS)C(O)=O)=CNC2=C1 NJNCVQYFNKZMAH-JYBASQMISA-N 0.000 description 5
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 5
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 5
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 5
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 5
- YQMILNREHKTFBS-IHRRRGAJSA-N Val-Phe-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YQMILNREHKTFBS-IHRRRGAJSA-N 0.000 description 5
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 5
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 5
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 5
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 5
- 108010070783 alanyltyrosine Proteins 0.000 description 5
- 210000004027 cell Anatomy 0.000 description 5
- 108010054813 diprotin B Proteins 0.000 description 5
- 239000012636 effector Substances 0.000 description 5
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 5
- 108010037389 glutamyl-cysteinyl-lysine Proteins 0.000 description 5
- 108010036413 histidylglycine Proteins 0.000 description 5
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 5
- 108010034529 leucyl-lysine Proteins 0.000 description 5
- 108010017391 lysylvaline Proteins 0.000 description 5
- 108010082795 phenylalanyl-arginyl-arginine Proteins 0.000 description 5
- 230000004044 response Effects 0.000 description 5
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 5
- 108010073969 valyllysine Proteins 0.000 description 5
- 210000003462 vein Anatomy 0.000 description 5
- CNKBMTKICGGSCQ-ACRUOGEOSA-N (2S)-2-[[(2S)-2-[[(2S)-2,6-diamino-1-oxohexyl]amino]-1-oxo-3-phenylpropyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CNKBMTKICGGSCQ-ACRUOGEOSA-N 0.000 description 4
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 4
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 4
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 4
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 4
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 4
- VIINVRPKMUZYOI-DCAQKATOSA-N Arg-Met-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIINVRPKMUZYOI-DCAQKATOSA-N 0.000 description 4
- FIQKRDXFTANIEJ-ULQDDVLXSA-N Arg-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FIQKRDXFTANIEJ-ULQDDVLXSA-N 0.000 description 4
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 4
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 4
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 4
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 4
- NONWUQAWAANERO-BZSNNMDCSA-N Asp-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 NONWUQAWAANERO-BZSNNMDCSA-N 0.000 description 4
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 4
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 4
- OFYVKOXTTDCUIL-FXQIFTODSA-N Asp-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OFYVKOXTTDCUIL-FXQIFTODSA-N 0.000 description 4
- 108020004705 Codon Proteins 0.000 description 4
- VBPGTULCFGKGTF-ACZMJKKPSA-N Cys-Glu-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VBPGTULCFGKGTF-ACZMJKKPSA-N 0.000 description 4
- ANRWXLYGJRSQEQ-CIUDSAMLSA-N Cys-His-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O ANRWXLYGJRSQEQ-CIUDSAMLSA-N 0.000 description 4
- DQGIAOGALAQBGK-BWBBJGPYSA-N Cys-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O DQGIAOGALAQBGK-BWBBJGPYSA-N 0.000 description 4
- VRJZMZGGAKVSIQ-SRVKXCTJSA-N Cys-Tyr-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VRJZMZGGAKVSIQ-SRVKXCTJSA-N 0.000 description 4
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 4
- GZWOBWMOMPFPCD-CIUDSAMLSA-N Glu-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N GZWOBWMOMPFPCD-CIUDSAMLSA-N 0.000 description 4
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 4
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 4
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 4
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 4
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 4
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 4
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 4
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 4
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 4
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 4
- JBJNKUOMNZGQIM-PYJNHQTQSA-N His-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JBJNKUOMNZGQIM-PYJNHQTQSA-N 0.000 description 4
- WOAMZMXCLBBQKW-KKUMJFAQSA-N His-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC2=CN=CN2)N)O WOAMZMXCLBBQKW-KKUMJFAQSA-N 0.000 description 4
- LCNNHVQNFNJLGK-AVGNSLFASA-N His-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N LCNNHVQNFNJLGK-AVGNSLFASA-N 0.000 description 4
- ZHMZWSFQRUGLEC-JYJNAYRXSA-N His-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZHMZWSFQRUGLEC-JYJNAYRXSA-N 0.000 description 4
- KDDKJKKQODQQBR-NHCYSSNCSA-N His-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N KDDKJKKQODQQBR-NHCYSSNCSA-N 0.000 description 4
- VZSDQFZFTCVEGF-ZEWNOJEFSA-N Ile-Phe-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O VZSDQFZFTCVEGF-ZEWNOJEFSA-N 0.000 description 4
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 4
- CUXRXAIAVYLVFD-ULQDDVLXSA-N Leu-Arg-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUXRXAIAVYLVFD-ULQDDVLXSA-N 0.000 description 4
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 4
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 4
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 4
- VFQOCUQGMUXTJR-DCAQKATOSA-N Leu-Cys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(=O)O)N VFQOCUQGMUXTJR-DCAQKATOSA-N 0.000 description 4
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 4
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 4
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 4
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 4
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 4
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 4
- FGMHXLULNHTPID-KKUMJFAQSA-N Lys-His-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CN=CN1 FGMHXLULNHTPID-KKUMJFAQSA-N 0.000 description 4
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 4
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 4
- JQSIGLHQNSZZRL-KKUMJFAQSA-N Lys-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N JQSIGLHQNSZZRL-KKUMJFAQSA-N 0.000 description 4
- CTJUSALVKAWFFU-CIUDSAMLSA-N Lys-Ser-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N CTJUSALVKAWFFU-CIUDSAMLSA-N 0.000 description 4
- 241001183012 Modified Vaccinia Ankara virus Species 0.000 description 4
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 4
- 108700001237 Nucleic Acid-Based Vaccines Proteins 0.000 description 4
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 4
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 4
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 4
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 4
- PEYNRYREGPAOAK-LSJOCFKGSA-N Pro-His-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 PEYNRYREGPAOAK-LSJOCFKGSA-N 0.000 description 4
- NFLNBHLMLYALOO-DCAQKATOSA-N Pro-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 NFLNBHLMLYALOO-DCAQKATOSA-N 0.000 description 4
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 4
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 4
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 4
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 4
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 4
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 4
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 4
- 230000005867 T cell response Effects 0.000 description 4
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 4
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 4
- OJCSQAWRJKPKFM-TUSQITKMSA-N Trp-His-Trp Chemical compound N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OJCSQAWRJKPKFM-TUSQITKMSA-N 0.000 description 4
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 4
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 4
- LVILBTSHPTWDGE-PMVMPFDFSA-N Tyr-Trp-Lys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(O)=O)C1=CC=C(O)C=C1 LVILBTSHPTWDGE-PMVMPFDFSA-N 0.000 description 4
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 4
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 4
- 108010044940 alanylglutamine Proteins 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 238000013459 approach Methods 0.000 description 4
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 4
- 108010068265 aspartyltyrosine Proteins 0.000 description 4
- 230000000740 bleeding effect Effects 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 230000004927 fusion Effects 0.000 description 4
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 4
- 230000028993 immune response Effects 0.000 description 4
- 208000015181 infectious disease Diseases 0.000 description 4
- 230000003902 lesion Effects 0.000 description 4
- 210000004698 lymphocyte Anatomy 0.000 description 4
- 108010009298 lysylglutamic acid Proteins 0.000 description 4
- 108010054155 lysyllysine Proteins 0.000 description 4
- 229940023146 nucleic acid vaccine Drugs 0.000 description 4
- 239000002773 nucleotide Substances 0.000 description 4
- 125000003729 nucleotide group Chemical group 0.000 description 4
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 4
- 208000024891 symptom Diseases 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- 108010020532 tyrosyl-proline Proteins 0.000 description 4
- 108010003137 tyrosyltyrosine Proteins 0.000 description 4
- MZOFCQQQCNRIBI-VMXHOPILSA-N (3s)-4-[[(2s)-1-[[(2s)-1-[[(1s)-1-carboxy-2-hydroxyethyl]amino]-4-methyl-1-oxopentan-2-yl]amino]-5-(diaminomethylideneamino)-1-oxopentan-2-yl]amino]-3-[[2-[[(2s)-2,6-diaminohexanoyl]amino]acetyl]amino]-4-oxobutanoic acid Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN MZOFCQQQCNRIBI-VMXHOPILSA-N 0.000 description 3
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 3
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 3
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 3
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 3
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 3
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 3
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 3
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 3
- DYJJJCHDHLEFDW-FXQIFTODSA-N Ala-Pro-Cys Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N DYJJJCHDHLEFDW-FXQIFTODSA-N 0.000 description 3
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 3
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 3
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 3
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 3
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 3
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 3
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 3
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 3
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 3
- XIDSGDJNUJRUHE-VEVYYDQMSA-N Asn-Thr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O XIDSGDJNUJRUHE-VEVYYDQMSA-N 0.000 description 3
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 3
- GISFCCXBVJKGEO-QEJZJMRPSA-N Asp-Glu-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GISFCCXBVJKGEO-QEJZJMRPSA-N 0.000 description 3
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 3
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 3
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 3
- YVHGKXAOSVBGJV-CIUDSAMLSA-N Asp-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N YVHGKXAOSVBGJV-CIUDSAMLSA-N 0.000 description 3
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 3
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 3
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 3
- -1 CD107 Proteins 0.000 description 3
- 206010008263 Cervical dysplasia Diseases 0.000 description 3
- NIPJKKSXHSBEMX-CIUDSAMLSA-N Cys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N NIPJKKSXHSBEMX-CIUDSAMLSA-N 0.000 description 3
- XABFFGOGKOORCG-CIUDSAMLSA-N Cys-Asp-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XABFFGOGKOORCG-CIUDSAMLSA-N 0.000 description 3
- LDIKUWLAMDFHPU-FXQIFTODSA-N Cys-Cys-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LDIKUWLAMDFHPU-FXQIFTODSA-N 0.000 description 3
- BVFQOPGFOQVZTE-ACZMJKKPSA-N Cys-Gln-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O BVFQOPGFOQVZTE-ACZMJKKPSA-N 0.000 description 3
- ZVNFONSZVUBRAV-CIUDSAMLSA-N Cys-Gln-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)CN=C(N)N ZVNFONSZVUBRAV-CIUDSAMLSA-N 0.000 description 3
- ZEXHDOQQYZKOIB-ACZMJKKPSA-N Cys-Glu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZEXHDOQQYZKOIB-ACZMJKKPSA-N 0.000 description 3
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 3
- LKUCSUGWHYVYLP-GHCJXIJMSA-N Cys-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N LKUCSUGWHYVYLP-GHCJXIJMSA-N 0.000 description 3
- MTNJRNQDDSWQQA-GQGQLFGLSA-N Cys-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N MTNJRNQDDSWQQA-GQGQLFGLSA-N 0.000 description 3
- CIVXDCMSSFGWAL-YUMQZZPRSA-N Cys-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N CIVXDCMSSFGWAL-YUMQZZPRSA-N 0.000 description 3
- TXCCRYAZQBUCOV-CIUDSAMLSA-N Cys-Pro-Gln Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O TXCCRYAZQBUCOV-CIUDSAMLSA-N 0.000 description 3
- XBELMDARIGXDKY-GUBZILKMSA-N Cys-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CS)N XBELMDARIGXDKY-GUBZILKMSA-N 0.000 description 3
- OEDPLIBVQGRKGZ-AVGNSLFASA-N Cys-Tyr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O OEDPLIBVQGRKGZ-AVGNSLFASA-N 0.000 description 3
- UGPCUUWZXRMCIJ-KKUMJFAQSA-N Cys-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CS)N UGPCUUWZXRMCIJ-KKUMJFAQSA-N 0.000 description 3
- VIOQRFNAZDMVLO-NRPADANISA-N Cys-Val-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIOQRFNAZDMVLO-NRPADANISA-N 0.000 description 3
- XOKGKOQWADCLFQ-GARJFASQSA-N Gln-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XOKGKOQWADCLFQ-GARJFASQSA-N 0.000 description 3
- NPTGGVQJYRSMCM-GLLZPBPUSA-N Gln-Gln-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPTGGVQJYRSMCM-GLLZPBPUSA-N 0.000 description 3
- SBHVGKBYOQKAEA-SDDRHHMPSA-N Gln-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SBHVGKBYOQKAEA-SDDRHHMPSA-N 0.000 description 3
- PBYFVIQRFLNQCO-GUBZILKMSA-N Gln-Pro-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O PBYFVIQRFLNQCO-GUBZILKMSA-N 0.000 description 3
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 3
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 3
- WPJDPEOQUIXXOY-AVGNSLFASA-N Gln-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WPJDPEOQUIXXOY-AVGNSLFASA-N 0.000 description 3
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 3
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 3
- ZMVCLTGPGWJAEE-JYJNAYRXSA-N Glu-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)O ZMVCLTGPGWJAEE-JYJNAYRXSA-N 0.000 description 3
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 3
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 3
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 3
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 3
- QEJKKJNDDDPSMU-KKUMJFAQSA-N Glu-Tyr-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O QEJKKJNDDDPSMU-KKUMJFAQSA-N 0.000 description 3
- ZALGPUWUVHOGAE-GVXVVHGQSA-N Glu-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZALGPUWUVHOGAE-GVXVVHGQSA-N 0.000 description 3
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 3
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 3
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 3
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 3
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 3
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 3
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 3
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 3
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 3
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 3
- TVTIDSMADMIHEU-KKUMJFAQSA-N His-Cys-Phe Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CS)C(=O)N[C@@H](Cc1ccccc1)C(O)=O TVTIDSMADMIHEU-KKUMJFAQSA-N 0.000 description 3
- NQKRILCJYCASDV-QWRGUYRKSA-N His-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 NQKRILCJYCASDV-QWRGUYRKSA-N 0.000 description 3
- VTZYMXGGXOFBMX-DJFWLOJKSA-N His-Ile-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O VTZYMXGGXOFBMX-DJFWLOJKSA-N 0.000 description 3
- LBQAHBIVXQSBIR-HVTMNAMFSA-N His-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LBQAHBIVXQSBIR-HVTMNAMFSA-N 0.000 description 3
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 3
- YERBCFWVWITTEJ-NAZCDGGXSA-N His-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CN=CN3)N)O YERBCFWVWITTEJ-NAZCDGGXSA-N 0.000 description 3
- JATYGDHMDRAISQ-KKUMJFAQSA-N His-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O JATYGDHMDRAISQ-KKUMJFAQSA-N 0.000 description 3
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 3
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 3
- FADXGVVLSPPEQY-GHCJXIJMSA-N Ile-Cys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FADXGVVLSPPEQY-GHCJXIJMSA-N 0.000 description 3
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 3
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 3
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 3
- REXAUQBGSGDEJY-IGISWZIWSA-N Ile-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N REXAUQBGSGDEJY-IGISWZIWSA-N 0.000 description 3
- NUEHSWNAFIEBCQ-NAKRPEOUSA-N Ile-Val-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N NUEHSWNAFIEBCQ-NAKRPEOUSA-N 0.000 description 3
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 3
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 3
- 102100037850 Interferon gamma Human genes 0.000 description 3
- 108010074328 Interferon-gamma Proteins 0.000 description 3
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 3
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 3
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 3
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 3
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 3
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 3
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 3
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 3
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 3
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 3
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 3
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 3
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 3
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 3
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 3
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 3
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 3
- SXOFUVGLPHCPRQ-KKUMJFAQSA-N Leu-Tyr-Cys Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(O)=O SXOFUVGLPHCPRQ-KKUMJFAQSA-N 0.000 description 3
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 3
- SSYOBDBNBQBSQE-SRVKXCTJSA-N Lys-Cys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O SSYOBDBNBQBSQE-SRVKXCTJSA-N 0.000 description 3
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 3
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 3
- WOEDRPCHKPSFDT-MXAVVETBSA-N Lys-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N WOEDRPCHKPSFDT-MXAVVETBSA-N 0.000 description 3
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 3
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 3
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 3
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 3
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 3
- PNDCUTDWYVKBHX-IHRRRGAJSA-N Met-Asp-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PNDCUTDWYVKBHX-IHRRRGAJSA-N 0.000 description 3
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 3
- 108010067902 Peptide Library Proteins 0.000 description 3
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 3
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 3
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 3
- UPJGUQPLYWTISV-GUBZILKMSA-N Pro-Gln-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UPJGUQPLYWTISV-GUBZILKMSA-N 0.000 description 3
- LCUOTSLIVGSGAU-AVGNSLFASA-N Pro-His-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LCUOTSLIVGSGAU-AVGNSLFASA-N 0.000 description 3
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 3
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 3
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 3
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 3
- 101710132595 Protein E7 Proteins 0.000 description 3
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 3
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 3
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 3
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 3
- BVLGVLWFIZFEAH-BPUTZDHNSA-N Ser-Pro-Trp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BVLGVLWFIZFEAH-BPUTZDHNSA-N 0.000 description 3
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 3
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 3
- UCCNDUPVIFOOQX-CUJWVEQBSA-N Thr-Cys-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 UCCNDUPVIFOOQX-CUJWVEQBSA-N 0.000 description 3
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 3
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 3
- XIULAFZYEKSGAJ-IXOXFDKPSA-N Thr-Leu-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XIULAFZYEKSGAJ-IXOXFDKPSA-N 0.000 description 3
- LHNNQVXITHUCAB-QTKMDUPCSA-N Thr-Met-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O LHNNQVXITHUCAB-QTKMDUPCSA-N 0.000 description 3
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 3
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 3
- XGUAUKUYQHBUNY-SWRJLBSHSA-N Thr-Trp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XGUAUKUYQHBUNY-SWRJLBSHSA-N 0.000 description 3
- VEENWOSZGWWKHW-SZZJOZGLSA-N Thr-Trp-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N)O VEENWOSZGWWKHW-SZZJOZGLSA-N 0.000 description 3
- LTLBNCDNXQCOLB-UBHSHLNASA-N Trp-Asp-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 LTLBNCDNXQCOLB-UBHSHLNASA-N 0.000 description 3
- XEHGAHOCTDKOKP-XIRDDKMYSA-N Trp-Cys-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XEHGAHOCTDKOKP-XIRDDKMYSA-N 0.000 description 3
- HTGJDTPQYFMKNC-VFAJRCTISA-N Trp-Thr-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 HTGJDTPQYFMKNC-VFAJRCTISA-N 0.000 description 3
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 3
- 102000000852 Tumor Necrosis Factor-alpha Human genes 0.000 description 3
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 3
- LTSIAOZUVISRAQ-QWRGUYRKSA-N Tyr-Gly-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O LTSIAOZUVISRAQ-QWRGUYRKSA-N 0.000 description 3
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 3
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 3
- VJOWWOGRNXRQMF-UVBJJODRSA-N Val-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 VJOWWOGRNXRQMF-UVBJJODRSA-N 0.000 description 3
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 3
- ZQGPWORGSNRQLN-NHCYSSNCSA-N Val-Asp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZQGPWORGSNRQLN-NHCYSSNCSA-N 0.000 description 3
- ZTKGDWOUYRRAOQ-ULQDDVLXSA-N Val-His-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N ZTKGDWOUYRRAOQ-ULQDDVLXSA-N 0.000 description 3
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 3
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 3
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 3
- CFIBZQOLUDURST-IHRRRGAJSA-N Val-Tyr-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N CFIBZQOLUDURST-IHRRRGAJSA-N 0.000 description 3
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 3
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 3
- 108010041407 alanylaspartic acid Proteins 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 108010070944 alanylhistidine Proteins 0.000 description 3
- 239000000427 antigen Substances 0.000 description 3
- 108091007433 antigens Proteins 0.000 description 3
- 102000036639 antigens Human genes 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 108010093581 aspartyl-proline Proteins 0.000 description 3
- 208000007951 cervical intraepithelial neoplasia Diseases 0.000 description 3
- 210000003679 cervix uteri Anatomy 0.000 description 3
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- 108010070643 prolylglutamic acid Proteins 0.000 description 3
- 230000001681 protective effect Effects 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 238000002560 therapeutic procedure Methods 0.000 description 3
- 108010009962 valyltyrosine Proteins 0.000 description 3
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- KVWLTGNCJYDJET-LSJOCFKGSA-N Ala-Arg-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KVWLTGNCJYDJET-LSJOCFKGSA-N 0.000 description 2
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 2
- FRFDXQWNDZMREB-ACZMJKKPSA-N Ala-Cys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRFDXQWNDZMREB-ACZMJKKPSA-N 0.000 description 2
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 2
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 2
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 2
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 2
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 2
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 2
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 2
- DJAIOAKQIOGULM-DCAQKATOSA-N Arg-Glu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O DJAIOAKQIOGULM-DCAQKATOSA-N 0.000 description 2
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 2
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 2
- QHVRVUNEAIFTEK-SZMVWBNQSA-N Arg-Pro-Trp Chemical compound N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O QHVRVUNEAIFTEK-SZMVWBNQSA-N 0.000 description 2
- VJIQPOJMISSUPO-BVSLBCMMSA-N Arg-Trp-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VJIQPOJMISSUPO-BVSLBCMMSA-N 0.000 description 2
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 2
- SPIPSJXLZVTXJL-ZLUOBGJFSA-N Asn-Cys-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O SPIPSJXLZVTXJL-ZLUOBGJFSA-N 0.000 description 2
- MOHUTCNYQLMARY-GUBZILKMSA-N Asn-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MOHUTCNYQLMARY-GUBZILKMSA-N 0.000 description 2
- GIQCDTKOIPUDSG-GARJFASQSA-N Asn-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N)C(=O)O GIQCDTKOIPUDSG-GARJFASQSA-N 0.000 description 2
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 2
- MYVBTYXSWILFCG-BQBZGAKWSA-N Asn-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N MYVBTYXSWILFCG-BQBZGAKWSA-N 0.000 description 2
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 2
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 2
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 2
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 2
- ATHZHGQSAIJHQU-XIRDDKMYSA-N Asn-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ATHZHGQSAIJHQU-XIRDDKMYSA-N 0.000 description 2
- AECPDLSSUMDUAA-ZKWXMUAHSA-N Asn-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N AECPDLSSUMDUAA-ZKWXMUAHSA-N 0.000 description 2
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 2
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 2
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 2
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 2
- ZRAOLTNMSCSCLN-ZLUOBGJFSA-N Asp-Cys-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)O ZRAOLTNMSCSCLN-ZLUOBGJFSA-N 0.000 description 2
- APYNREQHZOGYHV-ACZMJKKPSA-N Asp-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N APYNREQHZOGYHV-ACZMJKKPSA-N 0.000 description 2
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 2
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 2
- NRIFEOUAFLTMFJ-AAEUAGOBSA-N Asp-Gly-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NRIFEOUAFLTMFJ-AAEUAGOBSA-N 0.000 description 2
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 2
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 2
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 2
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 2
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 2
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 2
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 2
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 2
- 238000011740 C57BL/6 mouse Methods 0.000 description 2
- 210000001266 CD8-positive T-lymphocyte Anatomy 0.000 description 2
- 206010008342 Cervix carcinoma Diseases 0.000 description 2
- ZOLXQKZHYOHHMD-DLOVCJGASA-N Cys-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N ZOLXQKZHYOHHMD-DLOVCJGASA-N 0.000 description 2
- DEVDFMRWZASYOF-ZLUOBGJFSA-N Cys-Asn-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DEVDFMRWZASYOF-ZLUOBGJFSA-N 0.000 description 2
- NQSUTVRXXBGVDQ-LKXGYXEUSA-N Cys-Asn-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NQSUTVRXXBGVDQ-LKXGYXEUSA-N 0.000 description 2
- MGAWEOHYNIMOQJ-ACZMJKKPSA-N Cys-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MGAWEOHYNIMOQJ-ACZMJKKPSA-N 0.000 description 2
- VTJLJQGUMBWHBP-GUBZILKMSA-N Cys-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N VTJLJQGUMBWHBP-GUBZILKMSA-N 0.000 description 2
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 2
- VDUPGIDTWNQAJD-CIUDSAMLSA-N Cys-Lys-Cys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O VDUPGIDTWNQAJD-CIUDSAMLSA-N 0.000 description 2
- WTEJFWOJHCJDML-FXQIFTODSA-N Cys-Met-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(O)=O WTEJFWOJHCJDML-FXQIFTODSA-N 0.000 description 2
- UIKLEGZPIOXFHJ-DLOVCJGASA-N Cys-Phe-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O UIKLEGZPIOXFHJ-DLOVCJGASA-N 0.000 description 2
- 238000011510 Elispot assay Methods 0.000 description 2
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 2
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 2
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 2
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 2
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 2
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 2
- SWDSRANUCKNBLA-AVGNSLFASA-N Gln-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SWDSRANUCKNBLA-AVGNSLFASA-N 0.000 description 2
- OACQOWPRWGNKTP-AVGNSLFASA-N Gln-Tyr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O OACQOWPRWGNKTP-AVGNSLFASA-N 0.000 description 2
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 2
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 2
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 2
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 2
- NKSGKPWXSWBRRX-ACZMJKKPSA-N Glu-Asn-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N NKSGKPWXSWBRRX-ACZMJKKPSA-N 0.000 description 2
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 2
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 2
- YSPJWDABFLRKDK-QAETUUGQSA-N Glu-Gln-Gln-Tyr Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YSPJWDABFLRKDK-QAETUUGQSA-N 0.000 description 2
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 2
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 2
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 2
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 2
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 2
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 2
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 2
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- XIJOPMSILDNVNJ-ZVZYQTTQSA-N Glu-Val-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XIJOPMSILDNVNJ-ZVZYQTTQSA-N 0.000 description 2
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 2
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 2
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 2
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 2
- VOCMRCVMAPSSAL-IUCAKERBSA-N Gly-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN VOCMRCVMAPSSAL-IUCAKERBSA-N 0.000 description 2
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 2
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 2
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 2
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 2
- QSLKWWDKIXMWJV-SRVKXCTJSA-N His-Cys-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N QSLKWWDKIXMWJV-SRVKXCTJSA-N 0.000 description 2
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 2
- BRZQWIIFIKTJDH-VGDYDELISA-N His-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N BRZQWIIFIKTJDH-VGDYDELISA-N 0.000 description 2
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 2
- LVWIJITYHRZHBO-IXOXFDKPSA-N His-Leu-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LVWIJITYHRZHBO-IXOXFDKPSA-N 0.000 description 2
- FHGVHXCQMJWQPK-SRVKXCTJSA-N His-Lys-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O FHGVHXCQMJWQPK-SRVKXCTJSA-N 0.000 description 2
- HBGKOLSGLYMWSW-DCAQKATOSA-N His-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CS)C(=O)O HBGKOLSGLYMWSW-DCAQKATOSA-N 0.000 description 2
- UNDGQKWQNSTPPW-CYDGBPFRSA-N Ile-Arg-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N UNDGQKWQNSTPPW-CYDGBPFRSA-N 0.000 description 2
- XLDYDEDTGMHUCZ-GHCJXIJMSA-N Ile-Asp-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N XLDYDEDTGMHUCZ-GHCJXIJMSA-N 0.000 description 2
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 2
- LLHYWBGDMBGNHA-VGDYDELISA-N Ile-Cys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LLHYWBGDMBGNHA-VGDYDELISA-N 0.000 description 2
- AWTDTFXPVCTHAK-BJDJZHNGSA-N Ile-Cys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N AWTDTFXPVCTHAK-BJDJZHNGSA-N 0.000 description 2
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 2
- LEHPJMKVGFPSSP-ZQINRCPSSA-N Ile-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 LEHPJMKVGFPSSP-ZQINRCPSSA-N 0.000 description 2
- NUKXXNFEUZGPRO-BJDJZHNGSA-N Ile-Leu-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NUKXXNFEUZGPRO-BJDJZHNGSA-N 0.000 description 2
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 2
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 2
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 2
- WRDTXMBPHMBGIB-STECZYCISA-N Ile-Tyr-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 WRDTXMBPHMBGIB-STECZYCISA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- WCTCIIAGNMFYAO-DCAQKATOSA-N Leu-Cys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O WCTCIIAGNMFYAO-DCAQKATOSA-N 0.000 description 2
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 2
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 2
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 2
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 2
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 2
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 2
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 2
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 2
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 2
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 2
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 2
- ZAENPHCEQXALHO-GUBZILKMSA-N Lys-Cys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZAENPHCEQXALHO-GUBZILKMSA-N 0.000 description 2
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 2
- CAVGLNOOIFHJOF-SRVKXCTJSA-N Lys-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N CAVGLNOOIFHJOF-SRVKXCTJSA-N 0.000 description 2
- YXTKSLRSRXKXNV-IHRRRGAJSA-N Lys-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N YXTKSLRSRXKXNV-IHRRRGAJSA-N 0.000 description 2
- BEGQVWUZFXLNHZ-IHPCNDPISA-N Lys-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 BEGQVWUZFXLNHZ-IHPCNDPISA-N 0.000 description 2
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 2
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 2
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 2
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 2
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 2
- UKUMISIRZAVYOG-CIUDSAMLSA-N Met-Glu-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O UKUMISIRZAVYOG-CIUDSAMLSA-N 0.000 description 2
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 2
- MXEASDMFHUKOGE-ULQDDVLXSA-N Met-His-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MXEASDMFHUKOGE-ULQDDVLXSA-N 0.000 description 2
- OOXVBECOTYHTCK-WDSOQIARSA-N Met-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCSC)N OOXVBECOTYHTCK-WDSOQIARSA-N 0.000 description 2
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 2
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 2
- PPHFTNABKQRAJV-JYJNAYRXSA-N Phe-His-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PPHFTNABKQRAJV-JYJNAYRXSA-N 0.000 description 2
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 2
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 2
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 2
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 2
- 108010047620 Phytohemagglutinins Proteins 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 2
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 2
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 2
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 2
- FISHYTLIMUYTQY-GUBZILKMSA-N Pro-Gln-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 FISHYTLIMUYTQY-GUBZILKMSA-N 0.000 description 2
- LQZZPNDMYNZPFT-KKUMJFAQSA-N Pro-Gln-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LQZZPNDMYNZPFT-KKUMJFAQSA-N 0.000 description 2
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 2
- YYARMJSFDLIDFS-FKBYEOEOSA-N Pro-Phe-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O YYARMJSFDLIDFS-FKBYEOEOSA-N 0.000 description 2
- SVXXJYJCRNKDDE-AVGNSLFASA-N Pro-Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CN=CN1 SVXXJYJCRNKDDE-AVGNSLFASA-N 0.000 description 2
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 2
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 2
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 2
- 101710132597 Protein E5 Proteins 0.000 description 2
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 2
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 2
- UCXDHBORXLVBNC-ZLUOBGJFSA-N Ser-Asn-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O UCXDHBORXLVBNC-ZLUOBGJFSA-N 0.000 description 2
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 2
- UCOYFSCEIWQYNL-FXQIFTODSA-N Ser-Cys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(O)=O UCOYFSCEIWQYNL-FXQIFTODSA-N 0.000 description 2
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 2
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 2
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 2
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 2
- AXVNLRQLPLSIPQ-FXQIFTODSA-N Ser-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N AXVNLRQLPLSIPQ-FXQIFTODSA-N 0.000 description 2
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 2
- SGZVZUCRAVSPKQ-FXQIFTODSA-N Ser-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N SGZVZUCRAVSPKQ-FXQIFTODSA-N 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 2
- VBPDMBAFBRDZSK-HOUAVDHOSA-N Thr-Asn-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VBPDMBAFBRDZSK-HOUAVDHOSA-N 0.000 description 2
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 2
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 2
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 2
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 2
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 2
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 2
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 2
- ODXKUIGEPAGKKV-KATARQTJSA-N Thr-Leu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O ODXKUIGEPAGKKV-KATARQTJSA-N 0.000 description 2
- 108700019146 Transgenes Proteins 0.000 description 2
- BXKWZPXTTSCOMX-AQZXSJQPSA-N Trp-Asn-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXKWZPXTTSCOMX-AQZXSJQPSA-N 0.000 description 2
- WNZRNOGHEONFMS-PXDAIIFMSA-N Trp-Ile-Tyr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WNZRNOGHEONFMS-PXDAIIFMSA-N 0.000 description 2
- CXPJPTFWKXNDKV-NUTKFTJISA-N Trp-Leu-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CXPJPTFWKXNDKV-NUTKFTJISA-N 0.000 description 2
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 2
- OWSRIUBVJOQHNY-IHPCNDPISA-N Trp-Lys-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N OWSRIUBVJOQHNY-IHPCNDPISA-N 0.000 description 2
- GQEXFCQNAJHJTI-IHPCNDPISA-N Trp-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GQEXFCQNAJHJTI-IHPCNDPISA-N 0.000 description 2
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 2
- DWJQKEZKLQCHKO-SRVKXCTJSA-N Tyr-Asn-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O DWJQKEZKLQCHKO-SRVKXCTJSA-N 0.000 description 2
- DANHCMVVXDXOHN-SRVKXCTJSA-N Tyr-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DANHCMVVXDXOHN-SRVKXCTJSA-N 0.000 description 2
- BODHJXJNRVRKFA-BZSNNMDCSA-N Tyr-Cys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BODHJXJNRVRKFA-BZSNNMDCSA-N 0.000 description 2
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 2
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 2
- ANHVRCNNGJMJNG-BZSNNMDCSA-N Tyr-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CS)C(=O)O)N)O ANHVRCNNGJMJNG-BZSNNMDCSA-N 0.000 description 2
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 description 2
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 2
- BWVHQINTNLVWGZ-ZKWXMUAHSA-N Val-Cys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BWVHQINTNLVWGZ-ZKWXMUAHSA-N 0.000 description 2
- VXCAZHCVDBQMTP-NRPADANISA-N Val-Cys-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VXCAZHCVDBQMTP-NRPADANISA-N 0.000 description 2
- AAOPYWQQBXHINJ-DZKIICNBSA-N Val-Gln-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AAOPYWQQBXHINJ-DZKIICNBSA-N 0.000 description 2
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 2
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 2
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 2
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 2
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 2
- LZRWTJSPTJSWDN-FKBYEOEOSA-N Val-Trp-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N LZRWTJSPTJSWDN-FKBYEOEOSA-N 0.000 description 2
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 2
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 2
- 238000007792 addition Methods 0.000 description 2
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 201000010881 cervical cancer Diseases 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 238000002649 immunization Methods 0.000 description 2
- 230000003053 immunization Effects 0.000 description 2
- 108010028930 invariant chain Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 230000001885 phytohemagglutinin Effects 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 230000008707 rearrangement Effects 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 210000004989 spleen cell Anatomy 0.000 description 2
- 230000000638 stimulation Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 108010029384 tryptophyl-histidine Proteins 0.000 description 2
- 241000990167 unclassified Simian adenoviruses Species 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- COEXAQSTZUWMRI-STQMWFEESA-N (2s)-1-[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 COEXAQSTZUWMRI-STQMWFEESA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- ZFXQNADNEBRERM-BJDJZHNGSA-N Ala-Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ZFXQNADNEBRERM-BJDJZHNGSA-N 0.000 description 1
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 1
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 1
- DWINFPQUSSHSFS-UVBJJODRSA-N Ala-Arg-Trp Chemical compound N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O DWINFPQUSSHSFS-UVBJJODRSA-N 0.000 description 1
- YBPLKDWJFYCZSV-ZLUOBGJFSA-N Ala-Asn-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N YBPLKDWJFYCZSV-ZLUOBGJFSA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- OKEWAFFWMHBGPT-XPUUQOCRSA-N Ala-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 OKEWAFFWMHBGPT-XPUUQOCRSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- JNLDTVRGXMSYJC-UVBJJODRSA-N Ala-Pro-Trp Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JNLDTVRGXMSYJC-UVBJJODRSA-N 0.000 description 1
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 1
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- XMIAMUXIMWREBJ-HERUPUMHSA-N Ala-Trp-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XMIAMUXIMWREBJ-HERUPUMHSA-N 0.000 description 1
- UBTKNYUAMYRMKE-GOPGUHFVSA-N Ala-Trp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N UBTKNYUAMYRMKE-GOPGUHFVSA-N 0.000 description 1
- XPBVBZPVNFIHOA-UVBJJODRSA-N Ala-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 XPBVBZPVNFIHOA-UVBJJODRSA-N 0.000 description 1
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 1
- 208000007860 Anus Neoplasms Diseases 0.000 description 1
- GDVDRMUYICMNFJ-CIUDSAMLSA-N Arg-Cys-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O GDVDRMUYICMNFJ-CIUDSAMLSA-N 0.000 description 1
- FLYANDHDFRGGTM-PYJNHQTQSA-N Arg-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FLYANDHDFRGGTM-PYJNHQTQSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- XOZYYXMHMIEJET-XIRDDKMYSA-N Arg-Trp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XOZYYXMHMIEJET-XIRDDKMYSA-N 0.000 description 1
- PYDIIVKGTBRIEL-SZMVWBNQSA-N Arg-Trp-Pro Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(O)=O PYDIIVKGTBRIEL-SZMVWBNQSA-N 0.000 description 1
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 description 1
- CNBIWSCSSCAINS-UFYCRDLUSA-N Arg-Tyr-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNBIWSCSSCAINS-UFYCRDLUSA-N 0.000 description 1
- QQEWINYJRFBLNN-DLOVCJGASA-N Asn-Ala-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QQEWINYJRFBLNN-DLOVCJGASA-N 0.000 description 1
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 1
- KGCUOPPQTPZILL-CIUDSAMLSA-N Asn-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N KGCUOPPQTPZILL-CIUDSAMLSA-N 0.000 description 1
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 1
- FVKHEKVYFTZWDX-GHCJXIJMSA-N Asn-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N FVKHEKVYFTZWDX-GHCJXIJMSA-N 0.000 description 1
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 1
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 1
- UPAGTDJAORYMEC-VHWLVUOQSA-N Asn-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)N)N UPAGTDJAORYMEC-VHWLVUOQSA-N 0.000 description 1
- YQPSDMUGFKJZHR-QRTARXTBSA-N Asn-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)N)N YQPSDMUGFKJZHR-QRTARXTBSA-N 0.000 description 1
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- AITKTFCQOBRJTG-CIUDSAMLSA-N Asp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N AITKTFCQOBRJTG-CIUDSAMLSA-N 0.000 description 1
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 1
- OEDJQRXNDRUGEU-SRVKXCTJSA-N Asp-Leu-His Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O OEDJQRXNDRUGEU-SRVKXCTJSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- KKZHXOOZHFABQQ-UWJYBYFXSA-N Cys-Ala-Tyr Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKZHXOOZHFABQQ-UWJYBYFXSA-N 0.000 description 1
- OCEHKDFAWQIBHH-FXQIFTODSA-N Cys-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N OCEHKDFAWQIBHH-FXQIFTODSA-N 0.000 description 1
- YMBAVNPKBWHDAW-CIUDSAMLSA-N Cys-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N YMBAVNPKBWHDAW-CIUDSAMLSA-N 0.000 description 1
- LWTTURISBKEVAC-CIUDSAMLSA-N Cys-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N LWTTURISBKEVAC-CIUDSAMLSA-N 0.000 description 1
- HAYVLBZZBDCKRA-SRVKXCTJSA-N Cys-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N HAYVLBZZBDCKRA-SRVKXCTJSA-N 0.000 description 1
- DVIHGGUODLILFN-GHCJXIJMSA-N Cys-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DVIHGGUODLILFN-GHCJXIJMSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- UCSXXFRXHGUXCQ-SRVKXCTJSA-N Cys-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N UCSXXFRXHGUXCQ-SRVKXCTJSA-N 0.000 description 1
- IDFVDSBJNMPBSX-SRVKXCTJSA-N Cys-Lys-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O IDFVDSBJNMPBSX-SRVKXCTJSA-N 0.000 description 1
- ZHCCYSDALWJITB-SRVKXCTJSA-N Cys-Phe-Cys Chemical compound N[C@@H](CS)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O ZHCCYSDALWJITB-SRVKXCTJSA-N 0.000 description 1
- IDZDFWJNPOOOHE-KKUMJFAQSA-N Cys-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N IDZDFWJNPOOOHE-KKUMJFAQSA-N 0.000 description 1
- SMEYEQDCCBHTEF-FXQIFTODSA-N Cys-Pro-Ala Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O SMEYEQDCCBHTEF-FXQIFTODSA-N 0.000 description 1
- MBRWOKXNHTUJMB-CIUDSAMLSA-N Cys-Pro-Glu Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O MBRWOKXNHTUJMB-CIUDSAMLSA-N 0.000 description 1
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 1
- RGRMOYQUIJVQQD-SRVKXCTJSA-N Gln-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N RGRMOYQUIJVQQD-SRVKXCTJSA-N 0.000 description 1
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 1
- KZEUVLLVULIPNX-GUBZILKMSA-N Gln-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N KZEUVLLVULIPNX-GUBZILKMSA-N 0.000 description 1
- FJAYYNIXQNERSO-ACZMJKKPSA-N Gln-Cys-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FJAYYNIXQNERSO-ACZMJKKPSA-N 0.000 description 1
- GNDJOCGXGLNCKY-ACZMJKKPSA-N Gln-Cys-Cys Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O GNDJOCGXGLNCKY-ACZMJKKPSA-N 0.000 description 1
- COYGBRTZEVWZBW-XKBZYTNZSA-N Gln-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O COYGBRTZEVWZBW-XKBZYTNZSA-N 0.000 description 1
- LOJYQMFIIJVETK-WDSKDSINSA-N Gln-Gln Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LOJYQMFIIJVETK-WDSKDSINSA-N 0.000 description 1
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 1
- KQOPMGBHNQBCEL-HVTMNAMFSA-N Gln-His-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KQOPMGBHNQBCEL-HVTMNAMFSA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 1
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 1
- JUUNNOLZGVYCJT-JYJNAYRXSA-N Gln-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JUUNNOLZGVYCJT-JYJNAYRXSA-N 0.000 description 1
- LGWNISYVKDNJRP-FXQIFTODSA-N Gln-Ser-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGWNISYVKDNJRP-FXQIFTODSA-N 0.000 description 1
- HGBHRZBXOOHRDH-JBACZVJFSA-N Gln-Tyr-Trp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HGBHRZBXOOHRDH-JBACZVJFSA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- JZDHUJAFXGNDSB-WHFBIAKZSA-N Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O JZDHUJAFXGNDSB-WHFBIAKZSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- ISXJHXGYMJKXOI-GUBZILKMSA-N Glu-Cys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O ISXJHXGYMJKXOI-GUBZILKMSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- HOIPREWORBVRLD-XIRDDKMYSA-N Glu-Met-Trp Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O HOIPREWORBVRLD-XIRDDKMYSA-N 0.000 description 1
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 1
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- LGQZOQRDEUIZJY-YUMQZZPRSA-N Gly-Cys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CS)NC(=O)CN)C(O)=O LGQZOQRDEUIZJY-YUMQZZPRSA-N 0.000 description 1
- VLIJYPMATZSOLL-YUMQZZPRSA-N Gly-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN VLIJYPMATZSOLL-YUMQZZPRSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- RZEDHGORCKRINR-STQMWFEESA-N Gly-Trp-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN RZEDHGORCKRINR-STQMWFEESA-N 0.000 description 1
- UVTSZKIATYSKIR-RYUDHWBXSA-N Gly-Tyr-Glu Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O UVTSZKIATYSKIR-RYUDHWBXSA-N 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- WYWBYSPRCFADBM-GARJFASQSA-N His-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O WYWBYSPRCFADBM-GARJFASQSA-N 0.000 description 1
- WGHJXSONOOTTCZ-JYJNAYRXSA-N His-Glu-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WGHJXSONOOTTCZ-JYJNAYRXSA-N 0.000 description 1
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 1
- PDLQNLSEJXOQNQ-IHPCNDPISA-N His-Trp-Lys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(O)=O)C1=CN=CN1 PDLQNLSEJXOQNQ-IHPCNDPISA-N 0.000 description 1
- YKUAGFAXQRYUQW-KKUMJFAQSA-N His-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O YKUAGFAXQRYUQW-KKUMJFAQSA-N 0.000 description 1
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 1
- 241000598171 Human adenovirus sp. Species 0.000 description 1
- 108700039609 IRW peptide Proteins 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- DXUJSRIVSWEOAG-NAKRPEOUSA-N Ile-Arg-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N DXUJSRIVSWEOAG-NAKRPEOUSA-N 0.000 description 1
- ASCFJMSGKUIRDU-ZPFDUUQYSA-N Ile-Arg-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O ASCFJMSGKUIRDU-ZPFDUUQYSA-N 0.000 description 1
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- RSDHVTMRXSABSV-GHCJXIJMSA-N Ile-Asn-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N RSDHVTMRXSABSV-GHCJXIJMSA-N 0.000 description 1
- CTHAJJYOHOBUDY-GHCJXIJMSA-N Ile-Cys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N CTHAJJYOHOBUDY-GHCJXIJMSA-N 0.000 description 1
- LDRALPZEVHVXEK-KBIXCLLPSA-N Ile-Cys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N LDRALPZEVHVXEK-KBIXCLLPSA-N 0.000 description 1
- SYVMEYAPXRRXAN-MXAVVETBSA-N Ile-Cys-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N SYVMEYAPXRRXAN-MXAVVETBSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 1
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 1
- XHBYEMIUENPZLY-GMOBBJLQSA-N Ile-Pro-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O XHBYEMIUENPZLY-GMOBBJLQSA-N 0.000 description 1
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 1
- TWVKGYNQQAUNRN-ACZMJKKPSA-N Ile-Ser Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H](CO)C([O-])=O TWVKGYNQQAUNRN-ACZMJKKPSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- RWHRUZORDWZESH-ZQINRCPSSA-N Ile-Trp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RWHRUZORDWZESH-ZQINRCPSSA-N 0.000 description 1
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 1
- BAJIJEGGUYXZGC-CIUDSAMLSA-N Leu-Asn-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N BAJIJEGGUYXZGC-CIUDSAMLSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- DDEMUMVXNFPDKC-SRVKXCTJSA-N Leu-His-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N DDEMUMVXNFPDKC-SRVKXCTJSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- FOBUGKUBUJOWAD-IHPCNDPISA-N Leu-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FOBUGKUBUJOWAD-IHPCNDPISA-N 0.000 description 1
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 1
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 1
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 1
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 1
- 241000186779 Listeria monocytogenes Species 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- BTSXLXFPMZXVPR-DLOVCJGASA-N Lys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BTSXLXFPMZXVPR-DLOVCJGASA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 1
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- ZOKVLMBYDSIDKG-CSMHCCOUSA-N Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ZOKVLMBYDSIDKG-CSMHCCOUSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- XGZDDOKIHSYHTO-SZMVWBNQSA-N Lys-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 XGZDDOKIHSYHTO-SZMVWBNQSA-N 0.000 description 1
- ZNAPAUSAUBHENO-IHPCNDPISA-N Lys-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)[C@H](CCCCN)N ZNAPAUSAUBHENO-IHPCNDPISA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- RPEPZINUYHUBKG-FXQIFTODSA-N Met-Cys-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O RPEPZINUYHUBKG-FXQIFTODSA-N 0.000 description 1
- TWTNGJMBFRTKEX-FXQIFTODSA-N Met-Cys-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O TWTNGJMBFRTKEX-FXQIFTODSA-N 0.000 description 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 1
- ANCPZNHGZUCSSC-ULQDDVLXSA-N Met-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=C(O)C=C1 ANCPZNHGZUCSSC-ULQDDVLXSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 208000003445 Mouth Neoplasms Diseases 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 208000002471 Penile Neoplasms Diseases 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- JNRFYJZCMHHGMH-UBHSHLNASA-N Phe-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JNRFYJZCMHHGMH-UBHSHLNASA-N 0.000 description 1
- YQNBKXUTWBRQCS-BVSLBCMMSA-N Phe-Arg-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 YQNBKXUTWBRQCS-BVSLBCMMSA-N 0.000 description 1
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 1
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 1
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 1
- OWCLJDXHHZUNEL-IHRRRGAJSA-N Phe-Cys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OWCLJDXHHZUNEL-IHRRRGAJSA-N 0.000 description 1
- OYQBFWWQSVIHBN-FHWLQOOXSA-N Phe-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OYQBFWWQSVIHBN-FHWLQOOXSA-N 0.000 description 1
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 1
- PMKIMKUGCSVFSV-CQDKDKBSSA-N Phe-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PMKIMKUGCSVFSV-CQDKDKBSSA-N 0.000 description 1
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 1
- 208000006399 Premature Obstetric Labor Diseases 0.000 description 1
- 206010036600 Premature labour Diseases 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 1
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 1
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 1
- KQCCDMFIALWGTL-GUBZILKMSA-N Pro-Asn-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 KQCCDMFIALWGTL-GUBZILKMSA-N 0.000 description 1
- ZBAGOWGNNAXMOY-IHRRRGAJSA-N Pro-Cys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZBAGOWGNNAXMOY-IHRRRGAJSA-N 0.000 description 1
- SHAQGFGGJSLLHE-BQBZGAKWSA-N Pro-Gln Chemical compound NC(=O)CC[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 SHAQGFGGJSLLHE-BQBZGAKWSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- CDGABSWLRMECHC-IHRRRGAJSA-N Pro-Lys-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CDGABSWLRMECHC-IHRRRGAJSA-N 0.000 description 1
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 1
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 1
- HWLKHNDRXWTFTN-GUBZILKMSA-N Pro-Pro-Cys Chemical compound C1C[C@H](NC1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CS)C(=O)O HWLKHNDRXWTFTN-GUBZILKMSA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- AJJDPGVVNPUZCR-RHYQMDGZSA-N Pro-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1)O AJJDPGVVNPUZCR-RHYQMDGZSA-N 0.000 description 1
- OIDKVWTWGDWMHY-RYUDHWBXSA-N Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 OIDKVWTWGDWMHY-RYUDHWBXSA-N 0.000 description 1
- DLZBBDSPTJBOOD-BPNCWPANSA-N Pro-Tyr-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O DLZBBDSPTJBOOD-BPNCWPANSA-N 0.000 description 1
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- 101710185720 Putative ethidium bromide resistance protein Proteins 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- NFDYGNFETJVMSE-BQBZGAKWSA-N Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CO NFDYGNFETJVMSE-BQBZGAKWSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- GVIGVIOEYBOTCB-XIRDDKMYSA-N Ser-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC(C)C)C(O)=O)=CNC2=C1 GVIGVIOEYBOTCB-XIRDDKMYSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- OJFFAQFRCVPHNN-JYBASQMISA-N Ser-Thr-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OJFFAQFRCVPHNN-JYBASQMISA-N 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 1
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- BQBCIBCLXBKYHW-CSMHCCOUSA-N Thr-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])[C@@H](C)O BQBCIBCLXBKYHW-CSMHCCOUSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 1
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- DIHPMRTXPYMDJZ-KAOXEZKKSA-N Thr-Tyr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N)O DIHPMRTXPYMDJZ-KAOXEZKKSA-N 0.000 description 1
- YOPQYBJJNSIQGZ-JNPHEJMOSA-N Thr-Tyr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 YOPQYBJJNSIQGZ-JNPHEJMOSA-N 0.000 description 1
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 1
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 description 1
- OENGVSDBQHHGBU-QEJZJMRPSA-N Trp-Glu-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OENGVSDBQHHGBU-QEJZJMRPSA-N 0.000 description 1
- HJXOFWKCWLHYIJ-SZMVWBNQSA-N Trp-Lys-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HJXOFWKCWLHYIJ-SZMVWBNQSA-N 0.000 description 1
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 1
- PALLCTDPFINNMM-JQHSSLGASA-N Trp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N PALLCTDPFINNMM-JQHSSLGASA-N 0.000 description 1
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 1
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 1
- WDIJBEWLXLQQKD-ULQDDVLXSA-N Tyr-Arg-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O WDIJBEWLXLQQKD-ULQDDVLXSA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 1
- XMNDQSYABVWZRK-BZSNNMDCSA-N Tyr-Asn-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XMNDQSYABVWZRK-BZSNNMDCSA-N 0.000 description 1
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 1
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 1
- QOIKZODVIPOPDD-AVGNSLFASA-N Tyr-Cys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QOIKZODVIPOPDD-AVGNSLFASA-N 0.000 description 1
- FQNUWOHNGJWNLM-QWRGUYRKSA-N Tyr-Cys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FQNUWOHNGJWNLM-QWRGUYRKSA-N 0.000 description 1
- FFCRCJZJARTYCG-KKUMJFAQSA-N Tyr-Cys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O FFCRCJZJARTYCG-KKUMJFAQSA-N 0.000 description 1
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 1
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 1
- JJNXZIPLIXIGBX-HJPIBITLSA-N Tyr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JJNXZIPLIXIGBX-HJPIBITLSA-N 0.000 description 1
- FJBCEFPCVPHPPM-STECZYCISA-N Tyr-Ile-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O FJBCEFPCVPHPPM-STECZYCISA-N 0.000 description 1
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 1
- OFHKXNKJXURPSY-ULQDDVLXSA-N Tyr-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O OFHKXNKJXURPSY-ULQDDVLXSA-N 0.000 description 1
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 1
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- JQOMHZMWQHXALX-FHWLQOOXSA-N Tyr-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JQOMHZMWQHXALX-FHWLQOOXSA-N 0.000 description 1
- VKYDVKAKGDNZED-STECZYCISA-N Tyr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N VKYDVKAKGDNZED-STECZYCISA-N 0.000 description 1
- 108010026404 VGX-3100 Proteins 0.000 description 1
- 229940032099 VGX3100 Drugs 0.000 description 1
- 206010046865 Vaccinia virus infection Diseases 0.000 description 1
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 1
- WBUOKGBHGDPYMH-GUBZILKMSA-N Val-Cys-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)C(C)C WBUOKGBHGDPYMH-GUBZILKMSA-N 0.000 description 1
- DLYOEFGPYTZVSP-AEJSXWLSSA-N Val-Cys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N DLYOEFGPYTZVSP-AEJSXWLSSA-N 0.000 description 1
- AHHJARQXFFGOKF-NRPADANISA-N Val-Glu-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N AHHJARQXFFGOKF-NRPADANISA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 1
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- GTACFKZDQFTVAI-STECZYCISA-N Val-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 GTACFKZDQFTVAI-STECZYCISA-N 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 206010047741 Vulval cancer Diseases 0.000 description 1
- 238000002679 ablation Methods 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 108010080488 arginyl-arginyl-leucine Proteins 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000002238 attenuated effect Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000036755 cellular response Effects 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000011461 current therapy Methods 0.000 description 1
- 231100000050 cytotoxic potential Toxicity 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000003114 enzyme-linked immunosorbent spot assay Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 210000002950 fibroblast Anatomy 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 208000021145 human papilloma virus infection Diseases 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 206010022000 influenza Diseases 0.000 description 1
- 208000020082 intraepithelial neoplasia Diseases 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 206010023841 laryngeal neoplasm Diseases 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 201000004792 malaria Diseases 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 210000003899 penis Anatomy 0.000 description 1
- 239000008194 pharmaceutical composition Substances 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 239000000902 placebo Substances 0.000 description 1
- 229940068196 placebo Drugs 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 208000026440 premature labor Diseases 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 229940021993 prophylactic vaccine Drugs 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 238000001356 surgical procedure Methods 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 229940021747 therapeutic vaccine Drugs 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 229940125575 vaccine candidate Drugs 0.000 description 1
- 208000007089 vaccinia Diseases 0.000 description 1
- 208000013139 vaginal neoplasm Diseases 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 229940023147 viral vector vaccine Drugs 0.000 description 1
- 201000005102 vulva cancer Diseases 0.000 description 1
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/39—Medicinal preparations containing antigens or antibodies characterised by the immunostimulating additives, e.g. chemical adjuvants
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
- A61P31/20—Antivirals for DNA viruses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N7/00—Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/58—Medicinal preparations containing antigens or antibodies raising an immune response against a target which is not the antigen used for immunisation
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/70—Multivalent vaccine
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10041—Use of virus, viral particle or viral elements as a vector
- C12N2710/10043—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/20011—Papillomaviridae
- C12N2710/20022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/20011—Papillomaviridae
- C12N2710/20034—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Virology (AREA)
- Medicinal Chemistry (AREA)
- Microbiology (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Immunology (AREA)
- Biochemistry (AREA)
- Animal Behavior & Ethology (AREA)
- Pharmacology & Pharmacy (AREA)
- Veterinary Medicine (AREA)
- Public Health (AREA)
- General Engineering & Computer Science (AREA)
- Mycology (AREA)
- Epidemiology (AREA)
- Biophysics (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Gastroenterology & Hepatology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Oncology (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Communicable Diseases (AREA)
- Peptides Or Proteins (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
Abstract
本发明涉及一种核酸,其编码包含多个保守肽序列或其变体的多肽,其中所述保守序列在一个或多个HPV基因型16、18、31、52、53和58中是保守的;并且其中所述多肽包含HPV蛋白质E1、E2、E4、E5、E6和E7中的每一种的保守肽序列;并且涉及相关的疫苗、病毒载体、治疗和预防。
Description
技术领域
本发明涉及用于对HPV感染疫苗接种的病毒载体疫苗。
背景技术
人乳头瘤病毒感染是一种由人乳头瘤病毒(HPV)引起的感染。大多数HPV感染不引起症状且自发解决。然而,在一些情况下,它们持续存在并且这可能导致出现疣或癌前病变。癌前病变增加子宫颈、阴门、阴道、阴茎、肛门、口腔或咽喉癌的风险。
全世界每年发生约50万例HPV引起的子宫颈癌,并且这些中的一半以上是致命的。大约85%的病例发生在低/中收入国家,在这些国家,可用治疗有限或没有可用治疗。未接受过预防性疫苗的女性需要每3年进行一次筛查以识别和治疗宫颈上皮内瘤变(CIN)。筛查每年花费英国国家医疗服务体系约1.75亿英镑。
针对CIN的当前疗法是通过电灼术或外科手术切除异常的宫颈组织。当前不存在可用于消除HPV病毒感染的疗法。女性在治疗之后需要多次随访以确保没有复发。疗法还与早产风险增加有关。
已经开发出用于现有HPV感染疗法的HPV疫苗包括Inovio-VGX-3 100(编码E6、E7的DNA),类似于Genexine(GX-188E);编码E2、E6、E7融合蛋白的Janssen-Ad26/Ad35+/-MVA;合成长肽(E6、E7)和类似物,例如PepCan,GTL001;Advaxis-ADXS-HPV;以及编码E7的单核细胞性李斯特菌。然而,这类开发出的疫苗仅针对HPV 16和18,具有安全性问题和/或功效低。例如,在IIb期随机对照试验中,VGX-3100的净功效是18%(疫苗组中的48%对安慰剂组中的30%)。
发明内容
所需要的是一种安全、易于递送并且具有比迄今测试的治疗性疫苗候选物更高的功效的疫苗。因此,本发明的目的是提供一种针对HPV感染的改善疫苗。
根据本发明的第一方面,提供了一种核酸,其编码包含多个保守肽序列或其变体的多肽,
其中所述保守序列在一个或多个HPV基因型16、18、31、52、53和58中是保守的;并且
其中所述多肽包含HPV蛋白质E1、E2、E4、E5、E6和E7中的每一种的保守肽序列。
本发明有利地提供了一种新型的替代且更安全的疫苗接种方法,由此可以将T细胞诱导至病毒体相对保守的抗原。使用来自非结构蛋白的专门设计的保守病毒区段可以提供针对多种重要基因型的保护。
在一个实施方式中,多肽是融合多肽。多肽可以不是野生型多肽。多肽可以是合成的/人工的,例如,多肽可能不在自然界中存在。在一个实施方式中,多肽可以不包含完整的基因序列。多肽可以基本上由保守肽序列组成。在另一个实施方式中,多肽可以基本上由保守肽序列和肽佐剂序列组成。在另一个实施方式中,多肽可以基本上由保守肽序列和其间的一个或多个接头组成。在另一个实施方式中,多肽可以基本上由保守肽序列、肽佐剂序列和其间的一个或多个接头组成。在一个实施方式中,多肽是重组多肽,诸如重组融合多肽。
本文所用的术语“融合多肽”应理解为意指包含衍生自不同基因产物(例如不同的HPV蛋白)的序列的组合或来自相同基因产物(例如单一HPV蛋白)的序列的组合的多肽,其中这些序列来自野生型基因产物的不同/单独的区域。例如,融合多肽可以包含通常被野生型中的其它序列区段分开的序列的组合,并且已经移除了一个或多个分离序列。
本文所用的术语“保守肽序列”或“保守区段”定义为在一个或多个基因型中保守的序列,如下文所定义。在评估保守性之前,从NCBI蛋白质数据库(2014年访问)中收集了来自基因型16、18、31、52、53和58的HPV蛋白E1、E2、E4、E5、E6和E7的所有可用全长序列并且用作本发明方法的输入。使用所有可用序列以确保所选的保守肽序列将等效地代表整个环境种群(参见表1)。使用‘变体’方法识别(图1A)保守肽序列;在保守评估之前,将所有基因型比对并且加权每个基因型内的序列以确保基因型多样性的等效代表并因此确保疫苗候选物代表整个环境种群。然后使用15个氨基酸的滑动窗口评估基因型内的保守性(基因型内保守性),从而根据组合窗口内的氨基酸普遍性和对每个序列的值加权来确定每个窗口的保守值,以识别在每个基因型内保守的片段,以及针对每个窗口创建的标准化基因型内共有序列。‘标准化共有序列’意指代表基因型序列的加权集合的氨基酸序列,而不是每个位置上最常见的氨基酸。要被分类为保守,该窗口必须在蛋白质的所有窗口保守值的第一个四分位数内具有保守值。随后,识别所有基因型中相同位置处的保守基因型内窗口,而与基因型之间共享的基因型内标准化共有序列的同一性百分比无关(基因型间保守)。然后创建所得区域的系统发育并且组合树内群序列以产生具有高水平共享的共有同一性的基因型间共有序列。在此情况下,“基因型间共有序列”是指使用从每个基因型创建的标准化共有序列创建的共有序列。在某些情况下,创建‘经修饰变体’,其中识别了在所有蛋白质中处于相同位置的保守基因型内窗口,这些基因型内窗口在基因型之间共有大于60%共享的基因型内标准化共有序列同一性百分比。
如果从树内群中识别出的血清型间片段具有小于60%的同一性百分比,则这些序列被分类为“高度差异的”,与变体方法中使用的“差异较小”序列形成对比。对于基因型之间高度差异的蛋白质,使用“嵌合”方法识别保守肽序列(图1B);建立系统发育树且仅将基因型内群在一起比对,并且评估基因型内保守性。因此,仅评估内群之间的基因型间保守性并且选择基因型中具有大于60%共享的共有同一性的血清型内保守窗口。在一些情况下,基因型作为‘嵌合体’运行,但未评估基因间保守性(‘嵌合变体’)。
所述多个保守肽序列可以包含10个或更多个保守肽序列。在另一个实施方式中,所述多个保守肽序列可以包含15个或更多个保守肽序列。在另一个实施方式中,所述多个保守肽序列可以包含20个或更多个保守肽序列。在另一个实施方式中,所述多个保守肽序列可以包含25个或更多个保守肽序列。在另一个实施方式中,所述多个保守肽序列可以包含30个或更多个保守肽序列。所述多个保守序列可以包含35个或更多个保守肽序列。在另一个实施方式中,所述多个保守肽序列可以包含40个或更多个保守肽序列。所述多个保守肽序列可以包含45个或更多个保守肽序列。在另一个实施方式中,所述多个保守肽序列可以包含50个或更多个保守肽序列。在另一个实施方式中,所述多个保守肽序列可以包含55个或更多个保守肽序列。在一个实施方式中,所述多个保守肽序列由约56个或更多个保守肽序列组成。在一个实施方式中,所述多个保守肽序列由约57个或更多个保守肽序列组成。在一个实施方式中,所述多个保守肽序列由约58个或更多个保守肽序列组成。在一个实施方式中,所述多个保守肽序列由约59个或更多个保守肽序列组成。在一个实施方式中,所述多个保守肽序列由约59个保守肽序列组成。
经编码的多肽可以包含HPV蛋白质E1、E2、E4、E5、E6和E7中的每一种的至少3个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E1的至少3个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E2的至少3个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E4的至少3个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E5的至少3个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E6的至少3个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E7的至少3个不同的保守肽序列。
经编码的多肽可以包含HPV蛋白质E1的至少4个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E2的至少4个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E4的至少4个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E6的至少4个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E7的至少4个不同的保守肽序列。
经编码的多肽可以包含HPV蛋白质E1的至少5个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E2的至少5个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E4的至少5个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E6的至少5个不同的保守肽序列。
经编码的多肽可以包含HPV蛋白质E1的至少6个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E2的至少6个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E4的至少6个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E6的至少6个不同的保守肽序列。
经编码的多肽可以包含HPV蛋白质E1的至少7个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E2的至少7个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E4的至少7个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E6的至少7个不同的保守肽序列。
经编码的多肽可以包含HPV蛋白质E1的至少8个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E2的至少8个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E4的至少8个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E6的至少8个不同的保守肽序列。
经编码的多肽可以包含HPV蛋白质E1的至少9个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E2的至少9个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E4的至少9个不同的保守肽序列。
经编码的多肽可以包含HPV蛋白质E1的至少10个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E2的至少10个不同的保守肽序列。
经编码的多肽可以包含HPV蛋白质E1的至少11个不同的保守肽序列。经编码的多肽可以包含HPV蛋白质E2的至少11个不同的保守肽序列。
经编码的多肽可以包含HPV蛋白质E2的至少12、13、14、15、16、17、18、19、20、21、22、23、或24个不同的保守肽序列。
经编码的多肽可以包含以下或由其组成:HPV蛋白质E1的11个或更多个不同的保守肽序列、HPV蛋白质E2的24个或更多个不同的保守肽序列、HPV蛋白质E4的9个或更多个不同的保守肽序列、HPV蛋白质E5的3个或更多个不同的保守肽序列、HPV蛋白质E6的8个或更多个不同的保守肽序列、以及HPV蛋白质E7的4个或更多个不同的保守肽序列。
所述多个保守肽序列可以衍生自相对于彼此不同的序列区域(即非天然同时存在的)。例如,提及“不同的保守肽序列”可以包含衍生自相对于彼此不同的野生型序列区域(即非天然同时存在的)。例如,在野生型基因型中,保守序列在野生型基因型中可以由可变/非保守序列分开。所述多个保守肽序列可以彼此不重叠或不显著重叠。所述多个保守肽序列中的两个或更多个或全部可以在多肽中直接连接在一起,例如其间不包含任何非保守/可变残基。多肽序列可能在自然界中找不到。多肽可以不包含非保守序列或残基。保守肽序列在多肽序列中不会间隔超过1、2、3、4或5个残基,例如在其中保守肽序列之间存在接头/连接残基的实施方式中。可替代地,保守肽序列在多肽序列中不会间隔超过6、7、8、9或10个残基,例如在其中保守肽序列之间存在接头/连接残基的实施方式中。多肽可以不包含长于2、3、4、5、6、7、8、9或10个氨基酸的非保守序列。
在一个实施方式中,可以在一个或多个、或所有保守肽序列之间提供接头残基(例如在多肽中的保守肽序列之间提供连接)。接头残基可以包含随机氨基酸序列,或基于表位预测计算机程序或动物模型中的实验选择为非免疫原性的氨基酸。例如,如果预测或已知其为表位(即为了避免对在HPV中未发现的表位(例如人工表位)免疫应答),则不将其视为接头。接头可以是柔性的。接头可以包含K、G、P、A或S氨基酸残基或其组合或由其组成。在一个实施方式中,接头可以包含G和/或P氨基酸残基或由其组成。在一个实施方式中,接头可以包含一个或多个丙氨酸(A)氨基酸残基或由其组成。接头残基的长度可以在1与10个氨基酸之间。在另一个实施方式中,接头残基的长度可以在2与8个残基之间。在另一个实施方式中,接头残基的长度可以在1与6个残基之间。保守肽序列可以在多肽序列中间隔1至10个残基,例如在其中保守肽序列之间存在接头/连接残基的实施方式中。
在一个实施方式中,多肽可以基本上由保守肽序列和一个或多个接头组成,任选地其中一个或多个接头置于相邻的保守肽序列之间。
保守肽序列可以选自包含SEQ ID NO:1至59;其变体或其组合的组中的任一种。在另一个实施方式中,保守肽序列可以选自以任何顺序包含SEQ ID NO:1至59;其变体或其组合的组中的任一种。在一个实施方式中,保守肽序列可以由包含SEQ ID NO:1至59的组组成。
多肽可以包含选自SEQ ID NO:1-11中的任一种的一个或多个保守E1序列;选自SEQ ID NO:12-35中的任一种的一个或多个保守E2序列;选自SEQ ID NO:36-44中的任一种的一个或多个保守E4序列;选自SEQ ID NO:45-47中的任一种的一个或多个保守E5序列;选自SEQ ID NO:48-55中的任一种的一个或多个保守E6序列;以及选自SEQ ID NO:56-59中的任一种的一个或多个保守E7序列。
多肽可以包含选自SEQ ID NO:1-11中的任一种的两个或更多个保守E1序列;选自SEQ ID NO:12-35中的任一种的两个或更多个保守E2序列;选自SEQ ID NO:36-44中的任一种的两个或更多个保守E4序列;选自SEQ ID NO:45-47中的任一种的两个或更多个保守E5序列;选自SEQ ID NO:48-55中的任一种的两个或更多个保守E6序列;以及选自SEQ ID NO:56-59中的任一种的两个或更多个保守E7序列。
多肽可以包含选自SEQ ID NO:1-11中的任一种的三个或更多个保守E1序列;选自SEQ ID NO:12-35中的任一种的三个或更多个保守E2序列;选自SEQ ID NO:36-44中的任一种的三个或更多个保守E4序列;选自SEQ ID NO:45-47中的任一种的三个或更多个保守E5序列;选自SEQ ID NO:48-55中的任一种的三个或更多个保守E6序列;以及选自SEQ ID NO:56-59中的任一种的三个或更多个保守E7序列。
保守序列在HPV基因型16、18、31、52和58中的一个或多个中是保守的。保守序列在所有HPV基因型16、18、31、52和58中是保守的。
多肽可以包含:
选自SEQ ID NO:1-11中的任一种的一个或多个保守E1序列,其中所述基因型16、18、31、52、53和58中的每一种由至少一个保守E1序列表示;
选自SEQ ID NO:12-35中的任一种的一个或多个保守E2序列,其中所述基因型16、18、31、52、53和58中的每一种由至少一个保守E2序列表示;
选自SEQ ID NO:36-44中的任一种的一个或多个保守E4序列,其中所述基因型16、18、31、52、53和58中的每一种由至少一个保守E4序列表示;
选自SEQ ID NO:45-47中的任一种的一个或多个保守E5序列,其中所述基因型16、18、31、52、53和58中的每一种由至少一个保守E5序列表示;
选自SEQ ID NO:48-55中的任一种的一个或多个保守E6序列,其中所述基因型16、18、31、52、53和58中的每一种由至少一个保守E6序列表示;和
选自SEQ ID NO:56-59中的任一种的一个或多个保守E7序列,其中所述基因型16、18、31、52、53和58中的每一种由至少一个保守E7序列表示。
多肽可以包含:
选自SEQ ID NO:1-11中的任一种的一个或多个保守E1序列,其中所述基因型16、18、31、52和58中的每一种由至少一个保守E1序列表示;
选自SEQ ID NO:12-35中的任一种的一个或多个保守E2序列,其中所述基因型16、18、31、52和58中的每一种由至少一个保守E2序列表示;
选自SEQ ID NO:36-44中的任一种的一个或多个保守E4序列,其中所述基因型16、18、31、52和58中的每一种由至少一个保守E4序列表示;
选自SEQ ID NO:45-47中的任一种的一个或多个保守E5序列,其中所述基因型16、18、31、52和58中的每一种由至少一个保守E5序列表示;
选自SEQ ID NO:48-55中的任一种的一个或多个保守E6序列,其中所述基因型16、18、31、52和58中的每一种由至少一个保守E6序列表示;和
选自SEQ ID NO:56-59中的任一种的一个或多个保守E7序列,其中所述基因型16、18、31、52和58中的每一种由至少一个保守E7序列表示。
多肽可以包含:
选自SEQ ID NO:1-11中的任一种的两个或更多个保守E1序列,其中所述基因型16、18、31、52、53和58中的每一种在保守E1序列的组中表示;
选自SEQ ID NO:12-35中的任一种的两个或更多个保守E2序列,其中所述基因型16、18、31、52、53和58中的每一种在保守E2序列的组中表示;
选自SEQ ID NO:36-44中的任一种的两个或更多个保守E4序列,其中所述基因型16、18、31、52、53和58中的每一种在保守E4序列的组中表示;
选自SEQ ID NO:45-47中的任一种的两个或更多个保守E5序列,其中所述基因型16、18、31、52、53和58中的每一种在保守E5序列的组中表示;
选自SEQ ID NO:48-55中的任一种的两个或更多个保守E6序列,其中所述基因型16、18、31、52、53和58中的每一种在保守E6序列的组中表示;和
选自SEQ ID NO:56-59中的任一种的两个或更多个保守E7序列,其中所述基因型16、18、31、52、53和58中的每一种在保守E7序列的组中表示。
多肽可以包含:
选自SEQ ID NO:1-11中的任一种的两个或更多个保守E1序列,其中所述基因型16、18、31、52和58中的每一种在保守E1序列的组中表示;
选自SEQ ID NO:12-35中的任一种的两个或更多个保守E2序列,其中所述基因型16、18、31、52和58中的每一种在保守E2序列的组中表示;
选自SEQ ID NO:36-44中的任一种的两个或更多个保守E4序列,其中所述基因型16、18、31、52和58中的每一种在保守E4序列的组中表示;
选自SEQ ID NO:45-47中的任一种的两个或更多个保守E5序列,其中所述基因型16、18、31、52和58中的每一种在保守E5序列的组中表示;
选自SEQ ID NO:48-55中的任一种的两个或更多个保守E6序列,其中所述基因型16、18、31、52和58中的每一种在保守E6序列的组中表示;和
选自SEQ ID NO:56-59中的任一种的两个或更多个保守E7序列,其中所述基因型16、18、31、52和58中的每一种在保守E7序列的组中表示。
多肽可以包含:
选自SEQ ID NO:1-11中的任一种的三个或更多个保守E1序列,其中所述基因型16、18、31、52、53和58中的每一种在保守E1序列的组中表示;
选自SEQ ID NO:12-35中的任一种的三个或更多个保守E2序列,其中所述基因型16、18、31、52、53和58中的每一种在保守E2序列的组中表示;
选自SEQ ID NO:36-44中的任一种的三个或更多个保守E4序列,其中所述基因型16、18、31、52、53和58中的每一种在保守E4序列的组中表示;
选自SEQ ID NO:45-47的三个保守E5序列,其中所述基因型16、18、31、52、53和58中的每一种在保守E5序列的组中表示;
选自SEQ ID NO:48-55中的任一种的三个或更多个保守E6序列,其中所述基因型16、18、31、52、53和58中的每一种在保守E6序列的组中表示;和
选自SEQ ID NO:56-59中的任一种的三个或更多个保守E7序列,其中所述基因型16、18、31、52、53和58中的每一种在保守E7序列的组中表示。
多肽可以包含:
选自SEQ ID NO:1-11中的任一种的三个或更多个保守E1序列,其中所述基因型16、18、31、52和58中的每一种在保守E1序列的组中表示;
选自SEQ ID NO:12-35中的任一种的三个或更多个保守E2序列,其中所述基因型16、18、31、52和58中的每一种在保守E2序列的组中表示;
选自SEQ ID NO:36-44中的任一种的三个或更多个保守E4序列,其中所述基因型16、18、31、52和58中的每一种在保守E4序列的组中表示;
选自SEQ ID NO:45-47的三个保守E5序列,其中所述基因型16、18、31、52和58中的每一种在保守E5序列的组中表示;
选自SEQ ID NO:48-55中的任一种的三个或更多个保守E6序列,其中所述基因型16、18、31、52和58中的每一种在保守E6序列的组中表示;和
选自SEQ ID NO:56-59中的任一种的三个或更多个保守E7序列,其中所述基因型16、18、31、52和58中的每一种在保守E7序列的组中表示;
提及“表示基因型16、18、31、52、53和58中的每一种”或“表示基因型16、18、31、52和58中的每一种”旨在意指每个已识别的基因型已用于定义保守肽序列的至少一个共有序列。因此,给定组可以包含来自每个基因型的保守肽,或者保守肽可以衍生自两个或更多个基因型的共有序列。如果序列同一性足够相似,则所有基因型16、18、31、52、53和58或16、18、31、52和58可以由单一保守肽序列表示,该保守肽序列可以分别是所有基因型16、18、31、52、53和58或16、18、31、52和58的共有序列。然而,由于序列同一性的差异,单一保守肽可能无法表示来自所有基因型16、18、31、52、53和58或16、18、31、52和58的共有序列并且取代需要两个或更多个保守肽序列来覆盖/表示所有基因型16、18、31、52、53和58或16、18、31、52和58。例如(仅出于说明目的),一个保守E6肽序列可以表示E6基因型16和18,另一个可以表示E6基因型52,并且第三个可以表示E6基因型53和58,使得组中的全部三个保守E6肽序列表示全部E6基因型16、18、31、52、53和58或16、18、31、52和58。
核酸可以包含序列SEQ ID NO:60或其变体或由其组成。在另一个实施方式中,核酸可以包含序列SEQ ID NO:60或其变体或由其组成,并且不编码TPA前导序列。在另一个实施方式中,核酸可以包含序列SEQ ID NO:60或其变体或由其组成,编码不同/替代的肽佐剂而非TPA前导序列。在另一个实施方式中,核酸可以包含序列SEQ ID NO:65或其变体或由其组成。
核酸的变体可以包含与SEQ ID NO:60或65具有至少80%同一性的序列或由其组成。可替代地,核酸的变体可以包含与SEQ ID NO:60或65具有至少85%同一性的序列或由其组成。核酸的变体可以包含与SEQ ID NO:60或65具有至少90%同一性的序列或由其组成。核酸的变体可以包含与SEQ ID NO:60或65具有至少95%同一性的序列或由其组成。核酸的变体可以包含与SEQ ID NO:60或65具有至少98%同一性的序列或由其组成。核酸的变体可以包含与SEQ ID NO:60或65具有至少99%同一性的序列或由其组成。熟练人员将理解,核酸的变体可以包括编码与SEQ ID NO:60或65相同的肽的冗余密码子变体。
核酸可以包含序列SEQ ID NO:62或其变体或由其组成。在另一个实施方式中,核酸可以包含序列SEQ ID NO:62或其变体或由其组成,并且不编码TPA前导序列。在另一个实施方式中,核酸可以包含序列SEQ ID NO:62或其变体或由其组成,编码不同/替代的肽佐剂而非TPA前导序列。
核酸的变体可以包含与SEQ ID NO:62具有至少80%同一性的序列或由其组成。可替代地,核酸的变体可以包含与SEQ ID NO:62具有至少85%同一性的序列或由其组成。核酸的变体可以包含与SEQ ID NO:62具有至少90%同一性的序列或由其组成。核酸的变体可以包含与SEQ ID NO:62具有至少95%同一性的序列或由其组成。核酸的变体可以包含与SEQ ID NO:62具有至少98%同一性的序列或由其组成。核酸的变体可以包含与SEQ ID NO:62具有至少99%同一性的序列或由其组成。核酸的变体可以包含与SEQ ID NO:62具有至少99.5%同一性的序列或由其组成。核酸的变体可以包含与SEQ ID NO:62具有至少99.9%同一性的序列或由其组成。熟练人员将理解,核酸的变体可以包括编码与SEQ ID NO:62相同的病毒载体和/或肽的冗余密码子变体。
核酸可以包含序列SEQ ID NO:71、73或75或其变体或由其组成。在另一个实施方式中,核酸可以包含序列SEQ ID NO:71、73或75或其变体或由其组成,并且不编码TPA前导序列。在另一个实施方式中,核酸可以包含序列SEQ ID NO:71、73或75或其变体或由其组成,编码不同/替代的肽佐剂而非TPA前导序列。
核酸的变体可以包含与SEQ ID NO:71、73或75具有至少80%同一性的序列或由其组成。可替代地,核酸的变体可以包含与SEQ ID NO:71、73或75具有至少85%同一性的序列或由其组成。核酸的变体可以包含与SEQ ID NO:71、73或75具有至少90%同一性的序列或由其组成。核酸的变体可以包含与SEQ ID NO:71、73或75具有至少95%同一性的序列或由其组成。核酸的变体可以包含与SEQ ID NO:71、73或75具有至少98%同一性的序列或由其组成。核酸的变体可以包含与SEQ ID NO:71、73或75具有至少99%同一性的序列或由其组成。核酸的变体可以包含与SEQ ID NO:71、73或75具有至少99.5%同一性的序列或由其组成。核酸的变体可以包含与SEQ ID NO:71、73或75具有至少99.9%同一性的序列或由其组成。熟练人员将理解,核酸的变体可以包括编码与SEQ ID NO:71、73或75相同的病毒载体和/或肽的冗余密码子变体。
多肽可以包含序列SEQ ID NO:61或其变体或由其组成。在另一个实施方式中,多肽可以包含序列SEQ ID NO:61或其变体或由其组成,并且不编码TPA前导序列。在另一个实施方式中,多肽可以包含序列SEQ ID NO:61或其变体或由其组成,具有不同/替代的肽佐剂而非TPA前导序列。在另一个实施方式中,多肽可以包含序列SEQ ID NO:66或其变体或由其组成。
多肽可以包含序列SEQ ID NO:72、74或76或其变体或由其组成。在另一个实施方式中,多肽可以包含序列SEQ ID NO:72、74或76或其变体或由其组成,并且不编码TPA前导序列。在另一个实施方式中,多肽可以包含序列SEQ ID NO:72、74或76或其变体或由其组成,具有不同/替代的肽佐剂而非TPA前导序列。在另一个实施方式中,多肽可以包含序列SEQ ID NO:72、74或76或其变体或由其组成。
在一个实施方式中,多肽可以基本上由保守肽序列和肽佐剂组成。在一个实施方式中,多肽可以基本上由保守肽序列、一个或多个接头和肽佐剂组成。一个或多个接头可置于相邻的保守肽序列之间。肽佐剂可以是N-末端。
多肽的变体可以包含与SEQ ID NO:61、66、72、74或76具有至少80%同一性的序列或由其组成。可替代地,多肽的变体可以包含与SEQ ID NO:61、66、72、74或76具有至少85%同一性的序列或由其组成。多肽的变体可以包含与SEQ ID NO:61、66、72、74或76具有至少90%同一性的序列或由其组成。多肽的变体可以包含与SEQ ID NO:61、66、72、74或76具有至少95%同一性的序列或由其组成。多肽的变体可以包含与SEQ ID NO:61、66、72、74或76具有至少98%同一性的序列或由其组成。多肽的变体可以包含与SEQ ID NO:61、66、72、74或76具有至少99%同一性的序列或由其组成。
保守肽序列的变体可以包含保守肽序列的截短序列或由其组成。例如,本文的SEQID NO:1至59的任一个或多个序列可以被截短并且仍在多肽中提供免疫原性。截短序列可以包含足够数量的氨基酸以从SEQ ID NO:1至59的任一个序列内的序列形成可识别的表位(例如对于特异性T细胞识别至少最小数量的残基)。截短序列可以包含SEQ ID NO:1至59的序列的至少7个氨基酸。可替代地,截短序列可以包含SEQ ID NO:1至59的序列的至少8个氨基酸。可替代地,截短序列可以包含SEQ ID NO:1至59的序列的至少9、10、11或12个氨基酸。可以在SEQ ID NO:1至59的保守肽序列之一内提供多个截短序列。
在一个实施方式中,SEQ ID NO:1至59的任一个保守肽序列可以例如通过残基取代、添加或缺失而变化。在另一个实施方式中,SEQ ID NO:1至59的一些或全部保守肽序列可以例如通过残基取代、添加或缺失而变化。变体保守肽序列仍可起作用以提供可识别的HPV表位。熟练人员将理解,任何给定的群体中都存在天然变异,并且这些变异可以具有共有序列或SEQ ID NO:1至59中提供的示例患者序列的某些序列变异。因此,变体保守肽序列可以与SEQ ID NO:1至59中的任一个具有至少70%序列同一性。在另一个实施方式中,变体保守肽序列可以与SEQ ID NO:1至59中的任一个具有至少74%序列同一性。在另一个实施方式中,变体保守肽序列可以与SEQ ID NO:1至59中的任一个具有至少75%序列同一性。在另一个实施方式中,变体保守肽序列可以与SEQ ID NO:1至59中的任一个具有至少79%序列同一性。在另一个实施方式中,变体保守肽序列可以与SEQ ID NO:1至59中的任一个具有至少80%序列同一性。在另一个实施方式中,变体保守肽序列可以与SEQ ID NO:1至59中的任一个具有至少82%序列同一性。在另一个实施方式中,变体保守肽序列可以与SEQ IDNO:1至59中的任一个具有至少83%序列同一性。在另一个实施方式中,变体保守肽序列可以与SEQ ID NO:1至59中的任一个具有至少85%序列同一性。在另一个实施方式中,变体保守肽序列可以与SEQ ID NO:1至59中的任一个具有至少88%序列同一性。在另一个实施方式中,变体保守肽序列可以与SEQ ID NO:1至59中的任一个具有至少90%序列同一性。在另一个实施方式中,变体保守肽序列可以与SEQ ID NO:1至59中的任一个具有至少92%序列同一性。在另一个实施方式中,变体保守肽序列可以与SEQ ID NO:1至59中的任一个具有至少95%序列同一性。在另一个实施方式中,变体保守肽序列可以与SEQ ID NO:1至59中的任一个具有至少98%序列同一性。在另一个实施方式中,变体保守肽序列可以与SEQ ID NO:1至59中的任一个具有至少99%序列同一性。在另一个实施方式中,变体保守肽序列可以与SEQ ID NO:1至59中的任一个具有至少99.5%序列同一性。
提及本文所用的序列“同一性”可以指使用标准NCBI BLASTp参数的两个比对序列之间的同一性百分比(http://blast.ncbi.nlm.nih.gov)。
保守肽序列的长度可以变化,最小长度定义为形成可识别表位所需的最小残基数。因此,保守肽序列的长度可以是约7至250个氨基酸或更多。例如,至少一个保守肽序列的长度可以是至少约7个氨基酸。在另一个实施方式中,至少一个保守肽序列的长度可以是至少约8个氨基酸。在另一个实施方式中,至少一个保守肽序列的长度可以是至少约10个氨基酸。在另一个实施方式中,至少一个保守肽序列的长度可以是至少约15个氨基酸。在另一个实施方式中,至少一个保守肽序列的长度可以是至少约20个氨基酸。在另一个实施方式中,至少一个保守肽序列的长度可以是至少约30个氨基酸。在一个实施方式中,至少一个保守肽序列的长度可以在约20与约220个氨基酸之间。在一个实施方式中,至少一个保守肽序列的长度可以不超过约300个氨基酸。在另一个实施方式中,至少一个保守肽序列的长度可以不超过约250个氨基酸。在另一个实施方式中,至少一个保守肽序列的长度可以不超过约200个氨基酸。在另一个实施方式中,至少一个保守肽序列的长度可以不超过约150个氨基酸。在另一个实施方式中,至少一个保守肽序列的长度可以不超过约100个氨基酸。在另一个实施方式中,至少一个保守肽序列的长度可以不超过约55个氨基酸。在另一个实施方式中,至少一个保守肽序列的长度可以不超过约54个氨基酸。
保守肽序列可以是保守肽序列群中在约15与约50个氨基酸之间的平均长度。
在本发明的一些实施方式中,多肽可以进一步包含肽佐剂,诸如TPA(组织型纤溶酶原激活物)序列或其功能变体。TPA可以包含以下序列或由其组成:MDAMKRGLCCVLLLCGAVFVSPSQEIHARFRR(SEQ ID NO:63)或其功能变体。在一个实施方式中,肽佐剂可以包含例如序列SLLWGGVTVLAAMLIAGQVASSVVFLV(SEQ ID NO:64)或其功能变体的Shark不变链。肽佐剂可以是本发明的多肽上的N-末端。肽佐剂的功能变体可以是截短或突变肽变体,其仍可以充当佐剂,例如仍充当佐剂的TPA或shark不变链的截短或突变变体。熟练人员将了解,可以在不影响功能的情况下取代、添加或去除1、2、3、4、5或更多个氨基酸残基。例如,可以考虑保守取代。在提供(或适当时编码)肽佐剂的实施方式中,可以另外提供在肽佐剂与第一保守肽序列之间提供(或编码)的接头序列。在没有肽佐剂的实施方式中,可以不提供第一接头序列。
核酸的组合可以编码根据本发明的不同多肽,可以作为疫苗提供。例如,初免和/或加强疫苗制剂可以包含编码两种或更多种本发明的多肽的核酸或病毒载体,所述多肽可以相对于彼此不同。
核酸可以与另一种治疗或预防活性成分组合用于疫苗。核酸可以与佐剂组合用于疫苗。
根据本发明的另一方面,提供了一种包含多种根据本发明的不同核酸的组合物,任选地其中该组合物是药学上可接受的组合物。
根据本发明的另一方面,本文提供了一种由根据本发明的核酸编码的多肽。
在一个实施方式中,多肽是分离多肽。多肽、编码该多肽的核酸或相关的病毒颗粒可以在药学上可接受的载体中提供。
核酸可以是用于疫苗接种的质粒载体。核酸可以包含病毒载体序列。
根据本发明的另一方面,本文提供了一种病毒载体,该病毒载体包含根据本发明的核酸。
病毒载体可以包含病毒。病毒载体可以包含腺病毒,诸如人或猿猴腺病毒。当用于初免加强方案的初免疫苗中时,病毒载体可以包含腺病毒。病毒载体可以包含ChAdOx1(E群猿猴腺病毒,如在疟疾试验中安全使用的AdCh63载体)或ChAdOx2(如Morris等人2016.Future Virol 11(9),第649-659页中所述)。ChAdOx2序列可以包含本文所述的序列(例如SEQ ID NO:67+68)或由其组成。病毒载体可以包含AdCh63。病毒载体可以包含AdC3或AdH6。病毒载体可以是人血清型。病毒载体可以包含修饰的痘苗安卡拉(MVA)。病毒载体可以包含F11 MVA(例如在F11基因座处具有核酸构建体插入物的MVA)。本发明的核酸(HPV疫苗构建体插入物)可以在p7.5启动子的控制下,例如通过与p7.5 MVA穿梭质粒(SEQ ID NO:158)重组在亲本MVA病毒的TK基因座处插入。核酸可以包含序列SEQ ID NO:158,该序列具有如SEQ ID NO:158中提供的核酸疫苗构建体插入物(加下划线),或具有本文根据本发明的替代核酸疫苗构建体。在另一个实施方式中,本发明的核酸(HPV疫苗构建体插入物)可以在F11启动子的控制下,例如通过与F11穿梭质粒(SEQ ID NO:159)重组在亲本MVA病毒的F11基因座处插入。核酸可以包含序列SEQ ID NO:159,该序列具有如SEQ ID NO:159中提供的核酸疫苗构建体插入物(加下划线),或具有本文根据本发明的替代核酸疫苗构建体。MVA序列可以包含本文所述的序列(例如SEQ ID NO:69++70)或由其组成。当用作初免加强方案中的疫苗加强时,病毒载体可以包含MVA。病毒载体可以包含腺相关病毒(AAV)或慢病毒。病毒载体可以是减毒的病毒载体。可以将本发明的多肽序列克隆到已知引起良好免疫应答的任何合适的病毒载体中。合适的病毒载体已经描述于Dicks等人(Vaccine.2015年2月25日;33(9):1121-8.doi:10.1016/j.vaccine.2015.01.042.Epub 2015年1月25日)、Antrobus等人(Mol Ther.2014年3月;22(3):668-74.doi:10.1038/mt.2013.284.Epub 2013年12月30日),和Warimwe等人(Virol J.2013年12月5日;10:349.doi:10.1186/1743-422X-10-349),其通过引用并入本文。
根据本发明的另一方面,提供了一种组合物,该组合物包含以下中的一种或多种:
-根据本发明的多肽;
-根据本发明的核酸;和
-根据本发明的病毒载体。
该组合物例如在哺乳动物诸如人中可以是免疫原性的。该组合物可以进一步包含药学上可接受的载体。该组合物可以是包含药学上可接受的载体的药物组合物。该组合物可以用于预防或治疗HPV感染。
根据本发明的另一方面,提供了一种治疗或预防HPV感染的方法,该方法包括施用:
-根据本发明的多肽;
-根据本发明的核酸;
-根据本发明的组合物;或
-根据本发明的病毒载体。
治疗或预防HPV感染的方法可以是疫苗接种方法。
根据本发明的另一方面,提供了一种用于预防或治疗HPV感染的试剂,该试剂包含以下或由其组成:
-根据本发明的多肽;
-根据本发明的组合物;
-根据本发明的核酸;或
-根据本发明的病毒载体。
在一个实施方式中,治疗或预防HPV感染包括治疗或预防肛门生殖器HPV驱动的病变,诸如肛门、阴门、阴道或阴茎上皮内瘤变。另外或可替代地,治疗或预防HPV感染包括治疗或预防由HPV引起的口咽部病变。
根据本发明的另一方面,提供了根据本发明的多肽;根据本发明的组合物;根据本发明的核酸;或根据本发明的病毒载体;其用于疫苗或用作疫苗。
根据本发明的另一方面,提供了一种疫苗,该疫苗包含本发明的核酸;根据本发明的多肽;根据本发明的组合物;或根据本发明的病毒载体。
疫苗可以是初免疫苗。疫苗可以是加强疫苗。在初免疫苗后提供加强疫苗的情况下,多肽可以不同。例如,多肽可以包含保守肽序列的重排序列。保守肽序列可以是相同的,但是它们在多肽中提供的顺序可以改变。因此,本文的发明提供其中可以改变提供保守肽序列的顺序的本发明的任何序列/实施方式。这类实施方式还可以包括重排或不同的接头/连接序列。
有利地,在初免疫苗与加强疫苗之间多肽的保守肽序列的重排可以避免提供在一个保守肽序列与另一个保守肽序列的连接处形成的“假”表位,即相同的连接可能不会在重排的多肽中出现。
根据本发明的另一方面,提供了根据本发明的核酸或多肽用于疫苗或用作疫苗。
根据本发明的另一方面,提供了一种初免加强疫苗接种试剂盒,其包括:
-根据本发明的初免疫苗接种;
-根据本发明的加强疫苗接种;
初免和加强疫苗接种可以不同。初免和加强疫苗接种的多肽序列可能不同。初免和加强疫苗接种可以包含不同的病毒载体(即来自不同病毒科,诸如MVA与腺病毒)。
根据本发明的另一方面,本文提供了一种组合物,该组合物包含根据本发明的核酸和药学上可接受的载体。
该组合物可以不包含野生型HPV。该组合物可以不包含全长HPV蛋白序列。病毒载体或核酸可能不编码HPV的非保守蛋白/肽序列。
可以与药学上可接受的载体一起使用。另外或可替代地,可以与佐剂一起使用。
当应用于本发明的核酸、多肽或组合物时,术语“免疫原性”意指能够在人或动物体内引发免疫应答。免疫应答可以是保护性的。
术语“保护性”意指预防疾病、疾病感染、传播和/或进展的风险降低、疾病的严重程度降低、治愈病状或疾病、症状缓解、或者疾病或疾病症状的严重程度降低。
术语“预防”意指对疾病的预防或保护性治疗。预防可以包括降低疾病感染、传播和/或进展的风险,或降低疾病的严重程度。
术语“治疗”意指治愈病状或疾病、减轻症状、或降低疾病或疾病症状的严重程度。
熟练人员将理解,本发明的一个实施方式或方面的任选特征在合适时可以适用于本发明的其它实施方式或方面。
附图说明
现在将仅通过例示的方式参考以下附图更详细地描述本发明的实施方式。
图1-用于创建HPV候选物的嵌合和变体方法。
图2-保守区及其下文的所得变体。
图3-A)被识别为在用于形成嵌合体的两个基因型中保守的区域。B)修饰变体的保守图。
图4-A)被识别为在用于形成嵌合体的两个基因型中保守的区域。B)变体的保守图
图5-被识别为在用于形成嵌合体的两个基因型中保守的区域
图6-被识别为在用于形成嵌合变体的两个基因型中保守的区域
图7-被识别为在用于形成嵌合体的两个基因型中保守的区域
图8-在初免加强疫苗接种后诱导高频率的疫苗特异性T细胞。对来自C57BL/6小鼠(六只/组)的PBMC进行IFNγElispot,所述小鼠用DNA-HPV、MVA-HPV或ChAdOx1-5GHPV3肌肉内初免,并且然后在两周后用异源或同源疫苗肌肉内加强。初免后两周和加强后一周和两周通过尾静脉出血收集PBMC。
图9-初免加强疫苗接种能够诱导对免疫原中编码的所有抗原的应答。在加强两周时收集PBMC并且将其用于肽跨越整个免疫原序列的IFNγElispot中,根据蛋白质来源合并。由于存在大量的肽,将跨越免疫原E2区的肽分成两个库,并且将跨越E4和E5的区域的肽合并到一个库中。
图10-疫苗特异性CD8+T细胞可以在ChAdOx1-5GHPV3初免MVA-5GHPV3加强六周之后在高频下检测到。10a:使用PBMC的IFNγElispot,所述PBMC在加强(DM;DNA-5GHPV3初免MVA-5GHPV3加强,CM;ChAdOx1-5GHPV3初免MVA-5GHPV3加强)后两周、三周、四周和六周后通过尾静脉出血收集。10b:对在ChAdOx1-5GHPV3初免、MVA-5GHPV3加强后一周和六周获得的PBMC进行的ICS。用E6和E7肽库刺激PBMC。测量IFN-γ、CD107、TNF-α和IL2。
图11-HPV E6和E7特异性CD8+T细胞应答是多功能的并且具有细胞毒性潜力。在ChAdOx1-5GHPV3初免MVA-5GHPV3加强后一周通过尾静脉出血收集的PBMC用免疫显性肽库E6和E7以及子库22(其是E6内的显性子库)刺激。应答的CD8+T细胞主要表达三种功能(CD107、IFNγ和TNFα)。
图12–ChAdOx1-5GHPV3初免MVA-5GHPV3加强也引发HPV E6和E7特异性CD4+T细胞应答,这些应答在加强后六周仍可检测到。对在ChAdOx1-5GHPV3初免、MVA-5GHPV3加强后一周和六周获得的PBMC进行的ICS。用E6和E7肽库刺激PBMC。测量IFN-γ、CD107、TNF-α和IL2。
图13-大多数E6和E7特异性CD4+T细胞表达两种功能。在ChAdOx1-5GHPV3初免MVA-5GHPV3加强后一周通过尾静脉出血收集的PBMC用免疫显性肽库E6和E7以及子库22(其是E6内的显性子库)刺激。应答的CD4+T细胞主要表达两种功能(折扣CD107+单功能细胞)。
图14-HPV E6和E7特异性CD8+T细胞可以在子宫颈中检测到。对子宫颈阴道淋巴细胞进行ICS,所述子宫颈阴道淋巴细胞在ChAdOx1-5GHPV3初免MVA-5GHPV3加强并且用免疫显性肽库E6和E7以及子库22刺激后两周从小鼠分离。测量IFN-γ、CD107、TNF-α和IL2。
图15-子宫颈阴道HPV E6和E7特异性CD8+T细胞应答是多功能的。在ChAdOx1-5GHPV3初免MVA-5GHPV3加强后一周收集的子宫颈阴道淋巴细胞用免疫显性肽库E6和E7以及子库22(其是E6内的显性子库)刺激。应答的CD8+T细胞主要表达三种功能(CD107、IFNγ和TNFα)。
图16-子宫颈中疫苗诱导的E6和E7特异性CD8+和CD4+T细胞几乎完全是效应子表型。原始:CD44-CD62L+。抗原处理:中央记忆细胞-CD62L+、CD127+;效应记忆细胞-CD62L-、CD127+;效应细胞-CD62L-、CD127-。由于淋巴细胞数量低,将六只小鼠的子宫颈淋巴细胞合并成三对
图17-与子宫颈相比,脾脏中疫苗诱导的E6和E7特异性CD8+和CD4+T细胞包含效应细胞和效应记忆细胞群。
图18-用ChAdOx1-5GHPV3初免、MVA-5GHPV3加强疫苗接种的小鼠相比于对照小鼠展示存活率提高。在第0天给小鼠接种5x104个TC-1细胞,然后在第3天进行初免,并且在第17天加强。每两天用数字卡尺测量肿瘤并且当肿瘤在任一方向上达到10mm时将小鼠杀死。
图19-远交CD1小鼠的疫苗接种诱导高频T细胞应答。对来自CD1小鼠(十只/组)的PBMC进行IFNγElispot,所述小鼠用DNA-5GHPV3、MVA-5GHPV3或ChAdOx1-5GHPV3肌肉内初免,并且然后在两周后用异源或同源疫苗肌肉内加强。初免后两周和加强后两周和三周通过尾静脉出血收集PBMC。
图20-免疫接种CD1小鼠中的T细胞应答指向整个免疫原。在加强两周时收集PBMC并且将其用于肽跨越整个免疫原序列的IFNγElispot中,根据蛋白质来源合并。由于存在大量的肽,将跨越免疫原E2区的肽分成两个库,并且将跨越E4和E5的区域的肽合并到一个库中。
图21-上图:在离体IFN-γElispot分析中,基于高危HPV(hrHPV)的早期蛋白质,测试来自76位16-24岁女性的外周血单核细胞(PBMC)的肽识别。‘插入物’是由跨越hrHPV转基因的11个氨基酸重叠的15聚体肽库。‘参考’肽是基于来自HPV16和HPV52的早期蛋白质的肽库,所述早期蛋白质组合如下:E1/E2、E4/E5和E6/E7。‘FEC’(流感、EBV和CMV)肽和PHA(植物血凝素)用作阳性对照。所示数据是减去阴性对照值(模拟刺激的细胞)之后从肽刺激的孔中获得的斑点形成单位(SFU)。阳性应答的临界值设定为25SFU/百万个PBMC(从所有供体的模拟刺激值的平均值+2个标准偏差推导)。基于阴道采样同时测试了女性的hrHPV DNA:测试的26%呈阳性。数据显示,具有目前hrHPV感染或先前暴露的9/76名女性识别转基因中编码的HPV序列。下图:通过用插入物库的子库(左侧,每百万个PBMC的SFU),之后库内的单个肽(右侧,每个孔的SFU)测试PBMC,在一个应答的供体中进一步询问对插入物库的应答,因此证实存在真实的HPV特异性应答。SFU-报告为每孔(含有200000个PBMC)或每百万个PBMC的斑点形成单位。
图22-在ChAdOX1-5G-HPV3初免、MVA-5GHPV3加强后,C57BL/6小鼠(上图)和CD1小鼠(下图)中的子库绘图。在加强后两周将小鼠处死并分离脾细胞。将脾细胞用于IFNγElispot分析中,该分析使用覆盖免疫原序列的子库。子库21、25和26(例如)不含HPV53序列并且仍然获得高幅度应答,因此提供了不含HPV53片段的序列仍将具有免疫原性的证据。
具体实施方式
HPV免疫原设计
HPV免疫原由保守并且在蛋白质水平上地理上代表全局HPV群体的氨基酸片段构成。使用保守性算法创建每个片段,该保守算法已经用于创建基于嵌合或变体的片段。选择取决于所用每种HPV蛋白的特征。
核心保守性算法使用滑动窗口方法与标准化方法的组合,该标准化方法考虑了收集偏差以识别在基因型内部(基因型内)和基因型之间(基因型间)均保守的窗口。基因型内保守的窗口被归类为保守值小于整个序列所有窗口保守值之和的四分之一的窗口。在此过程中,为每个基因型创建标准化的代表性共有序列。随后,基因型间窗口被识别为每个基因型中相同位置的窗口,这些窗口是保守的并且具有>60%共享的共有同一性。
创建代表每种蛋白质的区域的所有选定基因型的序列的一个关键挑战在于许多基因型显示出独特的系统发育聚类。基因型中保守的区域的识别与这些区域之间共享的共有同一性百分比水平之间存在关键的平衡。在许多情况下,基因型间差异过大以致牺牲共享的共有同一性%。一些基因型彼此非常不同,意味着无法创建代表性序列。为了解决这个,原始序列以两种方式之一输入到核心算法中(图1)。(i)变体;将所有基因型的序列输入到算法中并且识别基因型间保守区与共享的共有同一性%无关。从所得区域创建系统发育并且组合内群序列以创建具有高水平共享的共有同一性的共有序列(图1b);(ii)嵌合体;在一些情况下,基因型之间的距离过于显著以致基因型无法正确比对。因此,仅将基因型内群一起输入到算法中,并且识别具有>60%共享的共有同一性的内群之间的保守区域(图1a)。
HPV候选物
使用从NCBI蛋白质数据库中收集的输入序列为六种HPV蛋白质(E1-2、E4-7)创建候选序列,进行比对并手动审核以除去不完整和质量较差的序列(表1)。
表1|用作输入后审核的序列数。
E1候选物
识别出E1蛋白内的5个片段,这些片段具有在其各自基因型内的相同位置处保守的窗口。支序分类学识别了提供高水平共享的共有同一性的最合适基因型组合(图2)。
E2候选物
由于比对不一致,基于系统发育创建了三个嵌合体(16和31、52和58、53和18)(图3a)。另外,所有基因型都以类似于创建变体的方式输入到算法中,但是没有禁用用于仅从每个基因型中选择具有大于60%共享的共有同一性%的保守窗口的程序过滤器。这识别出一个片段,称为‘修饰变体’(图3b)。
E4候选物
创建了四个嵌合体,其中两个片段是基因型特异性的(图4a)。另外,变体分析识别了一个区域(图4b)。
E5候选物
创建了三个嵌合体(图5)。
E6候选物
E6蛋白在基因型中显示出非常有限的保守性。取代地,处理内群以产生没有共享的共有同一性%过滤器的嵌合体,但是除了基因型52和58,未组合保守窗口以产生共有序列。(图6)
E7候选物
针对基因型16和31以及52和58创建了嵌合体(图7)。基因型53和18是嵌合变体。
具有SEQ ID NO.的疫苗片段的概述
HPV3核苷酸序列(SEQ ID NO:60)
从起始密码子,以用额外接头编码的TPA前导序列开始(TPA和连接子为粗体和下划线)。
不编码肽佐剂/TPA的HPV3核苷酸序列(SEQ ID NO:65)
HPV3多肽序列(SEQ ID NO:61)
从起始密码子,以TPA前导序列开始。
不含TPA/肽佐剂序列的HPV3多肽序列(SEQ ID NO:66)
具有免疫原编码序列插入物的病毒载体序列ChAdOxl.
免疫原编码序列插入物的起始和终止密码子带下划线。前导TPA序列和接头呈粗体。
(SEQ ID NO:62)
ChAdOx2序列(SEQ ID NO:67+68)
免疫原盒的ChAdOx2序列5’以SEQ ID NO:67提供,并且免疫原盒的ChAdOx2序列3’以SEQ ID NO:68提供。
MVA序列(SEQ ID NO:69+70)
免疫原盒的MVA序列5’以SEQ ID NO:69提供,并且免疫原盒的MVA序列3’以SEQ IDNO:70提供。
已经使用两种不同的穿梭质粒制备MVA疫苗:
1.P7.5穿梭质粒。HPV插入物具有与亲本MVA病毒中的TK基因座区域同源的上游和下游侧翼。插入物在p7.5启动子的控制下。
2.F11穿梭质粒。HPV插入物具有与亲本MVA病毒中的F11基因座区域同源的上游和下游侧翼。插入物在F11启动子的控制下。
然后将鸡胚胎成纤维细胞用MVA亲本病毒感染并且用p7.5或F11 MVA穿梭质粒转染,以使其与MVA基因组同源重组。因此获得亲本MVA,其感兴趣基因插入TK基因座(p7.5穿梭质粒)或F11基因座(F11穿梭质粒)处的MVA基因组中。产生两种版本的MVA疫苗。
P7.5穿梭质粒(SEQ ID NO:158)(插入物带下划线)
F11-HPV穿梭质粒(SEQ ID NO:159)(疫苗构建体插入物带下划线)
HPV2随机化核苷酸序列(片段呈随机顺序)(包括tPA前导序列和HindIII克隆接头,带下划线)(SEQ ID NO:71)
HPV2随机化序列多肽(包括tPA前导序列和HindIII克隆接头,带下划线)(SEQ IDNO:72)
HPV53del核苷酸序列(已除去来自HPV53的片段)(包括tPA前导序列和HindIII克隆接头,带下划线)(SEQ ID NO:73)
HPV53del多肽序列(包括tPA前导序列和HindIII克隆接头,带下划线)(SEQ IDNO:74)
HPV3接头核苷酸序列(包括tPA前导序列和HindIII克隆接头,带下划线)(SEQ IDNO:75)
HPV3接头多肽序列(包括tPA前导序列和HindIII克隆接头,带下划线)(SEQ IDNO:76)
序列表
<110> 牛津大学创新有限公司
<120> HPV疫苗
<130> JDM89042P.WOP
<150> GB1713163.2
<151> 2017-08-16
<160> 159
<170> PatentIn 版本3.5
<210> 1
<211> 20
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 1
Asp Glu Asp Glu Thr Ala Tyr Asp Ser Gly Thr Asp Leu Ile Asp Phe
1 5 10 15
Ile Asp Asp Ser
20
<210> 2
<211> 19
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 2
Asp Glu Asn Glu Asn Asp Ser Asp Thr Gly Glu Asp Met Val Asp Phe
1 5 10 15
Ile Asp Asn
<210> 3
<211> 19
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 3
Asp Glu Thr Asp Glu Glu Ser Thr Glu Ser Asp Leu Asp Gly Phe Ile
1 5 10 15
Asp Asn Ser
<210> 4
<211> 15
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 4
Ala Gln Leu Ala Asp Ser Asp Ser Asn Ala Cys Ala Phe Leu Lys
1 5 10 15
<210> 5
<211> 15
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 5
Ala Gln Leu Ala Asp Val Asn Ser Asn Ala Ala Ala Phe Leu Lys
1 5 10 15
<210> 6
<211> 20
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 6
Asn Cys Ile Leu Leu Tyr Gly Ala Ala Asn Thr Gly Lys Ser Leu Phe
1 5 10 15
Gly Met Ser Leu
20
<210> 7
<211> 20
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 7
Asn Cys Leu Val Leu Cys Gly Pro Ala Asn Thr Gly Lys Ser Tyr Phe
1 5 10 15
Gly Met Ser Leu
20
<210> 8
<211> 20
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 8
Asn Cys Leu Val Ile Tyr Gly Pro Pro Asn Thr Gly Lys Ser Cys Phe
1 5 10 15
Ala Met Ser Leu
20
<210> 9
<211> 17
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 9
Trp Pro Tyr Leu His Ser Arg Leu Val Val Phe Thr Phe Pro Asn Pro
1 5 10 15
Phe
<210> 10
<211> 17
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 10
Trp Pro Tyr Leu Glu Ser Arg Ile Thr Val Phe Glu Phe Pro Asn Ala
1 5 10 15
Phe
<210> 11
<211> 17
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 11
Leu Arg Tyr Leu His Ser Arg Ile His Val Leu Gln Phe Leu Asn Pro
1 5 10 15
Phe
<210> 12
<211> 17
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 12
Asn Val Cys Gln Asp Lys Ile Leu Glu His Tyr Glu Asn Asp Ser Lys
1 5 10 15
Asp
<210> 13
<211> 16
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 13
Ile Leu Glu His Tyr Glu Asn Asp Ser Lys Asp Leu Cys Asp His Ile
1 5 10 15
<210> 14
<211> 21
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 14
Cys Asp His Ile Asp Tyr Trp Lys His Ile Arg Leu Glu Cys Ala Ile
1 5 10 15
Met Tyr Lys Ala Arg
20
<210> 15
<211> 17
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 15
Ile Arg Leu Glu Cys Ala Ile Met Tyr Lys Ala Arg Glu Met Gly Phe
1 5 10 15
His
<210> 16
<211> 17
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 16
Ile Arg Leu Glu Cys Ala Ile Met Tyr Lys Ala Arg Glu Met Gly Phe
1 5 10 15
His
<210> 17
<211> 17
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 17
Ile Tyr Ile Cys Glu Asp Ala Gln Cys Thr Val Val Glu Gly Gln Val
1 5 10 15
Asp
<210> 18
<211> 17
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 18
Lys Lys Trp Glu Val His Ala Gly Gly Gln Val Ile Leu Cys Pro Glu
1 5 10 15
Ser
<210> 19
<211> 11
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 19
Gly Gln Arg Arg Ile Lys Arg Pro Arg Ser Glu
1 5 10
<210> 20
<211> 8
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 20
Asn Cys His Pro Asn Lys Leu Leu
1 5
<210> 21
<211> 15
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 21
Ile Leu Lys Cys Leu Arg Tyr Arg Phe Lys Lys His Cys Lys Leu
1 5 10 15
<210> 22
<211> 14
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 22
Ser Ser Thr Trp His Trp Thr Cys His Asp Gly Lys His Lys
1 5 10
<210> 23
<211> 19
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 23
Trp His Trp Thr Cys His Asp Gly Lys His Lys Asn Ala Ile Val Thr
1 5 10 15
Leu Thr Tyr
<210> 24
<211> 34
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 24
Tyr Glu Ala Asp Lys Asn Asp Leu Asn Ala Gln Ile Glu His Trp Lys
1 5 10 15
Leu Ile Arg Met Glu Cys Ala Ile Phe Tyr Lys Ala Lys Glu Leu Gly
20 25 30
Ile Ser
<210> 25
<211> 34
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 25
Ile Cys His Gln Val Val Pro Pro Leu Ala Ala Ser Lys Ala Lys Ala
1 5 10 15
Cys Gln Ala Ile Glu Leu Gln Leu Ala Leu Glu Ala Leu Asn Ala Ser
20 25 30
Pro Tyr
<210> 26
<211> 18
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 26
Asp Glu Trp Thr Leu Gln Gln Thr Ser Leu Glu Met Trp Leu Ala Glu
1 5 10 15
Pro Gln
<210> 27
<211> 29
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 27
Phe Lys Lys His Gly Ile Thr Ile Thr Val Gln Tyr Asp Asn Asp Lys
1 5 10 15
Ala Asn Thr Met Asp Tyr Thr Asn Trp Lys Glu Ile Tyr
20 25
<210> 28
<211> 18
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 28
Val Ile Val Cys Pro Ala Ser Ile Pro Ser Asp Glu Ile Ser Thr Glu
1 5 10 15
Glu Ala
<210> 29
<211> 20
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 29
Asp His Ile Asp Tyr Trp Lys Ala Ile Arg Gln Glu Asn Ala Ile Phe
1 5 10 15
Phe Ala Ala Arg
20
<210> 30
<211> 19
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 30
His Gln Val Val Pro Ala Leu Asn Ile Cys Lys Ala Lys Ala Cys Lys
1 5 10 15
Ala Ile Glu
<210> 31
<211> 21
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 31
Trp Asn Thr Glu Pro Lys His Cys Phe Lys Lys Gly Gly Gln His Ile
1 5 10 15
Glu Val Trp Phe Asp
20
<210> 32
<211> 18
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 32
Tyr Val Ala Trp Asp Ser Val Tyr Tyr Cys Gly Asp Asp Gly Trp Cys
1 5 10 15
Lys Thr
<210> 33
<211> 16
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 33
Glu Ala Glu Lys Tyr Gly Cys Lys Gly Thr Trp Glu Val His Phe Gly
1 5 10 15
<210> 34
<211> 24
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 34
Asn Ser Ile Asp Cys Asn Asp Ser Met Cys Ser Thr Phe Asp Asp Asn
1 5 10 15
Val Ser Ala Thr Glu Leu Val Lys
20
<210> 35
<211> 20
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 35
Asp His Ile Asp Tyr Trp Lys Leu Ile Arg Leu Glu Cys Ala Ile Phe
1 5 10 15
Tyr Lys Ala Arg
20
<210> 36
<211> 11
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 36
Arg Arg Leu Ser Ser Asp Gln Asp Gln Ser Gln
1 5 10
<210> 37
<211> 12
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 37
Leu Val Thr Lys Tyr Pro Leu Leu Lys Leu Leu Ser
1 5 10
<210> 38
<211> 41
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 38
Arg Pro Pro Asn Met Gly Val Lys Ala His Gly Lys Cys Ile Trp Glu
1 5 10 15
Asn Lys Val Phe Ile Val Pro Thr Leu Cys Pro Val Pro Leu Asp Pro
20 25 30
Thr Tyr Pro Leu Leu Lys Leu Leu Thr
35 40
<210> 39
<211> 29
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 39
Thr Gln Thr Thr Thr Pro Glu Asn Thr Ser Leu Val Glu Leu Arg Val
1 5 10 15
Thr Thr Pro Lys Ser Thr Val Val Ile Arg Leu His Leu
20 25
<210> 40
<211> 31
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 40
Thr Thr Arg Tyr Pro Leu Leu Ser Leu Leu Asn Ser Tyr Ser Thr Pro
1 5 10 15
Pro His Arg Ile Pro Ala Pro Cys Pro Trp Ala Pro Gln Arg Pro
20 25 30
<210> 41
<211> 10
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 41
Pro Ile Pro Lys Pro Ser Pro Trp Ala Pro
1 5 10
<210> 42
<211> 10
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 42
Pro Ile Pro Lys Pro Ser Pro Trp Ala Pro
1 5 10
<210> 43
<211> 10
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 43
Pro Arg Pro Pro His Cys Pro Trp Val Pro
1 5 10
<210> 44
<211> 10
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 44
Pro Pro Pro Pro Pro Arg Pro Trp Ala Pro
1 5 10
<210> 45
<211> 26
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 45
Cys Phe Leu Leu Cys Phe Cys Val Leu Leu Cys Val Cys Leu Leu Ile
1 5 10 15
Arg Pro Leu Leu Leu Ser Val Ser Thr Tyr
20 25
<210> 46
<211> 27
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 46
Leu Arg Pro Leu Leu Leu Ser Ile Ser Val Tyr Ala Gln Val Leu Val
1 5 10 15
Leu Val Leu Leu Leu Trp Val Ser Ile Gly Ser
20 25
<210> 47
<211> 28
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 47
Leu Leu Pro Ser Val Cys Met Cys Ala Tyr Ala Trp Val Leu Val Phe
1 5 10 15
Val Tyr Ile Val Val Ile Thr Ser Pro Ala Thr Ala
20 25
<210> 48
<211> 54
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 48
Ile Val Tyr Arg Asp Gly Asn Pro Tyr Ala Val Cys Asp Lys Cys Leu
1 5 10 15
Lys Phe Tyr Ser Lys Ile Ser Glu Tyr Arg His Tyr Cys Tyr Ser Leu
20 25 30
Tyr Gly Thr Thr Leu Glu Gln Gln Tyr Asn Lys Pro Leu Cys Asp Leu
35 40 45
Leu Ile Arg Cys Ile Asn
50
<210> 49
<211> 54
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 49
Val Val Tyr Arg Asp Ser Ile Pro His Ala Ala Cys His Lys Cys Ile
1 5 10 15
Asp Phe Tyr Ser Arg Ile Arg Glu Leu Arg His Tyr Ser Asp Ser Val
20 25 30
Tyr Gly Asp Thr Leu Glu Lys Leu Thr Asn Thr Gly Leu Tyr Asn Leu
35 40 45
Leu Ile Arg Cys Leu Arg
50
<210> 50
<211> 26
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 50
Val Phe Cys Lys Lys Ala Leu Thr Ala Ser Glu Val Tyr Asn Phe Ala
1 5 10 15
Tyr Thr Asp Leu Arg Val Val Tyr Arg Asp
20 25
<210> 51
<211> 18
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 51
Ser Lys Val Arg Lys Leu Arg Tyr Tyr Asn Cys Ser Val Tyr Gly Ala
1 5 10 15
Ser Leu
<210> 52
<211> 26
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 52
Val Tyr Cys Lys Gly Gln Leu Thr Glu Thr Glu Val Leu Asp Phe Ala
1 5 10 15
Phe Thr Asp Leu Thr Ile Val Tyr Arg Asp
20 25
<210> 53
<211> 18
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 53
Ser Lys Val Ser Glu Phe Arg Trp Tyr Arg Tyr Ser Val Tyr Gly Thr
1 5 10 15
Thr Leu
<210> 54
<211> 15
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 54
Cys Val Glu Cys Lys Lys Thr Leu Gln Arg Ser Glu Val Tyr Asp
1 5 10 15
<210> 55
<211> 21
<212> PRT
<213> 人工序列
<220>
<223> CQRPLCPQEKKRHVDLNKRFH
<400> 55
Cys Gln Arg Pro Leu Cys Pro Gln Glu Lys Lys Arg His Val Asp Leu
1 5 10 15
Asn Lys Arg Phe His
20
<210> 56
<211> 21
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 56
Thr Leu His Glu Tyr Met Leu Asp Leu Gln Pro Glu Thr Thr Asp Leu
1 5 10 15
Tyr Cys Tyr Glu Gln
20
<210> 57
<211> 26
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 57
Pro Glu Thr Thr Asp Leu His Cys Tyr Glu Gln Leu Gly Asp Ser Ser
1 5 10 15
Asp Glu Glu Asp Thr Gly Gly Leu Asp Gly
20 25
<210> 58
<211> 39
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 58
Asp Glu Asp Glu Asp Glu Val Asp His Leu Gln Glu Gln Pro Gln Gln
1 5 10 15
Ala Arg Arg Asp Glu Gln His Pro Cys Tyr Leu Ile Glu Thr Gln Cys
20 25 30
Cys Arg Cys Glu Ser Leu Val
35
<210> 59
<211> 37
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 59
Glu Glu Asn Asp Glu Ile Asp Gly Val Asn His Gln His Leu Pro Ala
1 5 10 15
Arg Arg Ala Glu Pro Gln Arg His Thr Met Leu Cys Met Cys Cys Lys
20 25 30
Cys Glu Ala Arg Ile
35
<210> 60
<211> 3960
<212> DNA
<213> 人工序列
<220>
<223> 疫苗构建体
<400> 60
atggatgcta tgaagagggg cctgtgctgc gtgctgctgc tgtgtggcgc cgtgtttgtg 60
tcccccagcc aggaaatcca cgcccggttc agaagaggca gcaagctggc cgacgaggac 120
gagacagcct acgacagcgg caccgacctg atcgacttca tcgacgacag cgacgagaat 180
gagaacgact ccgacaccgg cgaggacatg gtggatttca tcgacaacga cgaaaccgac 240
gaagagagca ccgagagcga cctggacggc tttatcgaca actccgccca gctggctgac 300
agcgacagca atgcctgcgc cttcctgaag gctcagctgg cagacgtgaa cagcaacgcc 360
gctgcttttc tgaagaactg catcctgctg tacggcgctg ccaacaccgg caagagcctg 420
ttcggcatga gcctgaactg cctggtgctg tgcggcccag ccaataccgg aaagtcctac 480
ttcggcatgt ccctgaattg tctcgtgatc tacggcccac ctaacacagg caagtcctgc 540
tttgccatgt ctctgtggcc ctacctgcac agcagactgg tggtgtttac cttccccaac 600
cccttctggc cttacctgga aagccggatc accgtgttcg agttccccaa tgcctttctg 660
agatacctgc actcccggat ccacgtgctg cagtttctga accccttcaa cgtgtgccag 720
gacaagatcc tggaacacta cgagaacgac agcaaggaca ttctggaaca ttatgagaat 780
gattccaagg acctgtgcga ccacatctgc gatcacatcg actactggaa gcacatccgg 840
ctggaatgcg ccatcatgta caaggcccgg atcagactgg aatgtgctat tatgtataag 900
gctcgcgaga tgggcttcca ccagttcgac ggcgacatct gcaacaccat gcactacacc 960
aactggatct atatctgcga ggacgcccag tgcaccgtgg tggaaggcca ggtggacaag 1020
aaatgggagg tgcacgctgg cggccaagtg atcctgtgtc ctgagagcgg ccagcggcgg 1080
atcaagaggc ccagaagcga gaactgccac cccaacaagc tgctgatcct gaagtgcctg 1140
cggtacagat tcaagaagca ctgcaagctg agcagcacct ggcactggac ctgccacgac 1200
ggcaagcaca agtggcattg gacatgtcac gatgggaaac acaagaacgc cattgtgacc 1260
ctgacctact acgaggccga caagaacgac ctgaacgccc agatcgagca ctggaaactg 1320
atccggatgg aatgtgcaat cttctataag gccaaagagc tgggcatcag catctgccac 1380
caggtggtgc ctccactggc cgcctctaaa gccaaagcct gccaggccat cgaactgcag 1440
ctggccctgg aagccctgaa tgccagcccc tacgatgagt ggaccctgca gcagaccagc 1500
ctggaaatgt ggctggccga gccccagttt aagaagcacg gcatcaccat caccgtgcag 1560
tacgacaatg acaaggccaa taccatggat tacacaaatt ggaaagaaat ctacgtgatc 1620
gtgtgccccg ccagcatccc ctccgatgag atcagcaccg aggaagccga ccacattgat 1680
tattggaaag ccatcaggca ggaaaacgcc atcttcttcg ccgccagaca ccaggtggtg 1740
cccgccctga atatctgcaa ggccaaggcc tgtaaagcca tcgagtggaa caccgagccc 1800
aagcactgct tcaagaaggg cggccagcac atcgaagtgt ggttcgacta cgtggcctgg 1860
gacagcgtgt actactgcgg cgacgatggc tggtgcaaga ccgaggccga gaagtacggc 1920
tgcaagggca cctgggaagt gcatttcggc aacagcatcg actgcaacga ctccatgtgc 1980
agcaccttcg acgacaacgt gtccgccacc gagctcgtga aggaccatat cgactattgg 2040
aagctgattc gcctggaatg tgccattttt tacaaggcca gacggcggct gtccagcgac 2100
caggatcagt ctcagctcgt gaccaagtac cccctgctga agctgctgtc cagacccccc 2160
aacatgggcg tgaaggccca cggcaagtgc atctgggaga acaaggtgtt catcgtgccc 2220
accctgtgcc ccgtgcctct ggatccaaca tatcctctgc tgaaactgct gaccacccag 2280
accaccaccc ccgagaatac ctccctggtg gaactgagag tgaccacccc caagagcaca 2340
gtcgtgatca ggctgcacct gaccaccaga tacccactgc tgtcactgct gaacagctac 2400
agcacccccc ctcaccggat ccctgctcca tgtccttggg ctcctcagag gccccccatc 2460
cctaagcctt ctccatgggc ccctagaatc cctgcccctt gcccctgggc acctcctaga 2520
cctccacact gtccatgggt gccccctcca cctcctccaa gaccttgggc cccttgcttc 2580
ctgctgtgct tttgtgtgct gctgtgcgtg tgcctgctga tcagacccct gctgctgagt 2640
gtgtccacct acctgaggcc tctgctgctg tctatcagcg tgtacgctca ggtgctggtg 2700
ctggtgctgc tgctgtgggt gtccatcgga agcctgctgc ccagcgtgtg catgtgtgcc 2760
tatgcctggg tgctggtgtt cgtgtacatc gtcgtgatta ccagccccgc caccgccatc 2820
gtgtaccggg atggcaatcc ttacgccgtg tgcgacaagt gcctgaagtt ctacagcaag 2880
atcagcgagt accggcacta ctgctacagc ctgtacggca ccaccctgga acagcagtac 2940
aacaagcccc tgtgcgatct gctgattcgg tgcatcaacg tggtgtacag agactccatc 3000
ccccacgccg cctgccacaa gtgtatcgac ttctactcca gaatcagaga gctgcggcac 3060
tacagcgact ccgtgtacgg cgataccctg gaaaagctga ccaacactgg cctgtacaac 3120
ctgctgatta gatgcctgcg ggtgttctgc aagaaggccc tgacagccag cgaggtgtac 3180
aacttcgcct acaccgatct gcgggtggtg tatcgggaca gcaaagtgcg gaagctgagg 3240
tactacaact gctctgtgta tggcgccagc ctggtgtatt gcaagggaca gctgaccgag 3300
acagaggtgc tggatttcgc cttcacagac ctgacaatcg tgtatcgcga ctccaaggtg 3360
tccgagttcc ggtggtacag atattccgtg tatggcacca cactgtgcgt ggaatgcaag 3420
aaaaccctgc agagatctga ggtgtacgac tgccagcggc cactgtgtcc gcaggaaaag 3480
aaaagacacg tggacctgaa caagcggttc cacaccctgc acgagtacat gctggatctg 3540
cagcccgaga caaccgacct gtactgctac gagcagcctg aaaccactga tctgcactgt 3600
tatgagcagc tgggagacag ctccgatgaa gaggacactg gcggcctgga tggggacgag 3660
gatgaggacg aagtggacca tctgcaggaa cagccccagc aggctagacg ggacgaacag 3720
cacccttgct atctgatcga gacacagtgc tgcagatgcg aatctctggt ggaagagaac 3780
gacgagatcg acggcgtgaa ccaccagcat ctgcccgcta gaagggccga gcctcagaga 3840
cacaccatgc tgtgtatgtg ctgcaagtgc gaggccagaa tcgccggctc tggacctggc 3900
gcctctggca agcctatccc caatccactg ctgggcctgg actccacccg gacctgataa 3960
<210> 61
<211> 1318
<212> PRT
<213> 人工序列
<220>
<223> 疫苗构建体
<400> 61
Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly
1 5 10 15
Ala Val Phe Val Ser Pro Ser Gln Glu Ile His Ala Arg Phe Arg Arg
20 25 30
Gly Ser Lys Leu Ala Asp Glu Asp Glu Thr Ala Tyr Asp Ser Gly Thr
35 40 45
Asp Leu Ile Asp Phe Ile Asp Asp Ser Asp Glu Asn Glu Asn Asp Ser
50 55 60
Asp Thr Gly Glu Asp Met Val Asp Phe Ile Asp Asn Asp Glu Thr Asp
65 70 75 80
Glu Glu Ser Thr Glu Ser Asp Leu Asp Gly Phe Ile Asp Asn Ser Ala
85 90 95
Gln Leu Ala Asp Ser Asp Ser Asn Ala Cys Ala Phe Leu Lys Ala Gln
100 105 110
Leu Ala Asp Val Asn Ser Asn Ala Ala Ala Phe Leu Lys Asn Cys Ile
115 120 125
Leu Leu Tyr Gly Ala Ala Asn Thr Gly Lys Ser Leu Phe Gly Met Ser
130 135 140
Leu Asn Cys Leu Val Leu Cys Gly Pro Ala Asn Thr Gly Lys Ser Tyr
145 150 155 160
Phe Gly Met Ser Leu Asn Cys Leu Val Ile Tyr Gly Pro Pro Asn Thr
165 170 175
Gly Lys Ser Cys Phe Ala Met Ser Leu Trp Pro Tyr Leu His Ser Arg
180 185 190
Leu Val Val Phe Thr Phe Pro Asn Pro Phe Trp Pro Tyr Leu Glu Ser
195 200 205
Arg Ile Thr Val Phe Glu Phe Pro Asn Ala Phe Leu Arg Tyr Leu His
210 215 220
Ser Arg Ile His Val Leu Gln Phe Leu Asn Pro Phe Asn Val Cys Gln
225 230 235 240
Asp Lys Ile Leu Glu His Tyr Glu Asn Asp Ser Lys Asp Ile Leu Glu
245 250 255
His Tyr Glu Asn Asp Ser Lys Asp Leu Cys Asp His Ile Cys Asp His
260 265 270
Ile Asp Tyr Trp Lys His Ile Arg Leu Glu Cys Ala Ile Met Tyr Lys
275 280 285
Ala Arg Ile Arg Leu Glu Cys Ala Ile Met Tyr Lys Ala Arg Glu Met
290 295 300
Gly Phe His Gln Phe Asp Gly Asp Ile Cys Asn Thr Met His Tyr Thr
305 310 315 320
Asn Trp Ile Tyr Ile Cys Glu Asp Ala Gln Cys Thr Val Val Glu Gly
325 330 335
Gln Val Asp Lys Lys Trp Glu Val His Ala Gly Gly Gln Val Ile Leu
340 345 350
Cys Pro Glu Ser Gly Gln Arg Arg Ile Lys Arg Pro Arg Ser Glu Asn
355 360 365
Cys His Pro Asn Lys Leu Leu Ile Leu Lys Cys Leu Arg Tyr Arg Phe
370 375 380
Lys Lys His Cys Lys Leu Ser Ser Thr Trp His Trp Thr Cys His Asp
385 390 395 400
Gly Lys His Lys Trp His Trp Thr Cys His Asp Gly Lys His Lys Asn
405 410 415
Ala Ile Val Thr Leu Thr Tyr Tyr Glu Ala Asp Lys Asn Asp Leu Asn
420 425 430
Ala Gln Ile Glu His Trp Lys Leu Ile Arg Met Glu Cys Ala Ile Phe
435 440 445
Tyr Lys Ala Lys Glu Leu Gly Ile Ser Ile Cys His Gln Val Val Pro
450 455 460
Pro Leu Ala Ala Ser Lys Ala Lys Ala Cys Gln Ala Ile Glu Leu Gln
465 470 475 480
Leu Ala Leu Glu Ala Leu Asn Ala Ser Pro Tyr Asp Glu Trp Thr Leu
485 490 495
Gln Gln Thr Ser Leu Glu Met Trp Leu Ala Glu Pro Gln Phe Lys Lys
500 505 510
His Gly Ile Thr Ile Thr Val Gln Tyr Asp Asn Asp Lys Ala Asn Thr
515 520 525
Met Asp Tyr Thr Asn Trp Lys Glu Ile Tyr Val Ile Val Cys Pro Ala
530 535 540
Ser Ile Pro Ser Asp Glu Ile Ser Thr Glu Glu Ala Asp His Ile Asp
545 550 555 560
Tyr Trp Lys Ala Ile Arg Gln Glu Asn Ala Ile Phe Phe Ala Ala Arg
565 570 575
His Gln Val Val Pro Ala Leu Asn Ile Cys Lys Ala Lys Ala Cys Lys
580 585 590
Ala Ile Glu Trp Asn Thr Glu Pro Lys His Cys Phe Lys Lys Gly Gly
595 600 605
Gln His Ile Glu Val Trp Phe Asp Tyr Val Ala Trp Asp Ser Val Tyr
610 615 620
Tyr Cys Gly Asp Asp Gly Trp Cys Lys Thr Glu Ala Glu Lys Tyr Gly
625 630 635 640
Cys Lys Gly Thr Trp Glu Val His Phe Gly Asn Ser Ile Asp Cys Asn
645 650 655
Asp Ser Met Cys Ser Thr Phe Asp Asp Asn Val Ser Ala Thr Glu Leu
660 665 670
Val Lys Asp His Ile Asp Tyr Trp Lys Leu Ile Arg Leu Glu Cys Ala
675 680 685
Ile Phe Tyr Lys Ala Arg Arg Arg Leu Ser Ser Asp Gln Asp Gln Ser
690 695 700
Gln Leu Val Thr Lys Tyr Pro Leu Leu Lys Leu Leu Ser Arg Pro Pro
705 710 715 720
Asn Met Gly Val Lys Ala His Gly Lys Cys Ile Trp Glu Asn Lys Val
725 730 735
Phe Ile Val Pro Thr Leu Cys Pro Val Pro Leu Asp Pro Thr Tyr Pro
740 745 750
Leu Leu Lys Leu Leu Thr Thr Gln Thr Thr Thr Pro Glu Asn Thr Ser
755 760 765
Leu Val Glu Leu Arg Val Thr Thr Pro Lys Ser Thr Val Val Ile Arg
770 775 780
Leu His Leu Thr Thr Arg Tyr Pro Leu Leu Ser Leu Leu Asn Ser Tyr
785 790 795 800
Ser Thr Pro Pro His Arg Ile Pro Ala Pro Cys Pro Trp Ala Pro Gln
805 810 815
Arg Pro Pro Ile Pro Lys Pro Ser Pro Trp Ala Pro Arg Ile Pro Ala
820 825 830
Pro Cys Pro Trp Ala Pro Pro Arg Pro Pro His Cys Pro Trp Val Pro
835 840 845
Pro Pro Pro Pro Pro Arg Pro Trp Ala Pro Cys Phe Leu Leu Cys Phe
850 855 860
Cys Val Leu Leu Cys Val Cys Leu Leu Ile Arg Pro Leu Leu Leu Ser
865 870 875 880
Val Ser Thr Tyr Leu Arg Pro Leu Leu Leu Ser Ile Ser Val Tyr Ala
885 890 895
Gln Val Leu Val Leu Val Leu Leu Leu Trp Val Ser Ile Gly Ser Leu
900 905 910
Leu Pro Ser Val Cys Met Cys Ala Tyr Ala Trp Val Leu Val Phe Val
915 920 925
Tyr Ile Val Val Ile Thr Ser Pro Ala Thr Ala Ile Val Tyr Arg Asp
930 935 940
Gly Asn Pro Tyr Ala Val Cys Asp Lys Cys Leu Lys Phe Tyr Ser Lys
945 950 955 960
Ile Ser Glu Tyr Arg His Tyr Cys Tyr Ser Leu Tyr Gly Thr Thr Leu
965 970 975
Glu Gln Gln Tyr Asn Lys Pro Leu Cys Asp Leu Leu Ile Arg Cys Ile
980 985 990
Asn Val Val Tyr Arg Asp Ser Ile Pro His Ala Ala Cys His Lys Cys
995 1000 1005
Ile Asp Phe Tyr Ser Arg Ile Arg Glu Leu Arg His Tyr Ser Asp
1010 1015 1020
Ser Val Tyr Gly Asp Thr Leu Glu Lys Leu Thr Asn Thr Gly Leu
1025 1030 1035
Tyr Asn Leu Leu Ile Arg Cys Leu Arg Val Phe Cys Lys Lys Ala
1040 1045 1050
Leu Thr Ala Ser Glu Val Tyr Asn Phe Ala Tyr Thr Asp Leu Arg
1055 1060 1065
Val Val Tyr Arg Asp Ser Lys Val Arg Lys Leu Arg Tyr Tyr Asn
1070 1075 1080
Cys Ser Val Tyr Gly Ala Ser Leu Val Tyr Cys Lys Gly Gln Leu
1085 1090 1095
Thr Glu Thr Glu Val Leu Asp Phe Ala Phe Thr Asp Leu Thr Ile
1100 1105 1110
Val Tyr Arg Asp Ser Lys Val Ser Glu Phe Arg Trp Tyr Arg Tyr
1115 1120 1125
Ser Val Tyr Gly Thr Thr Leu Cys Val Glu Cys Lys Lys Thr Leu
1130 1135 1140
Gln Arg Ser Glu Val Tyr Asp Cys Gln Arg Pro Leu Cys Pro Gln
1145 1150 1155
Glu Lys Lys Arg His Val Asp Leu Asn Lys Arg Phe His Thr Leu
1160 1165 1170
His Glu Tyr Met Leu Asp Leu Gln Pro Glu Thr Thr Asp Leu Tyr
1175 1180 1185
Cys Tyr Glu Gln Pro Glu Thr Thr Asp Leu His Cys Tyr Glu Gln
1190 1195 1200
Leu Gly Asp Ser Ser Asp Glu Glu Asp Thr Gly Gly Leu Asp Gly
1205 1210 1215
Asp Glu Asp Glu Asp Glu Val Asp His Leu Gln Glu Gln Pro Gln
1220 1225 1230
Gln Ala Arg Arg Asp Glu Gln His Pro Cys Tyr Leu Ile Glu Thr
1235 1240 1245
Gln Cys Cys Arg Cys Glu Ser Leu Val Glu Glu Asn Asp Glu Ile
1250 1255 1260
Asp Gly Val Asn His Gln His Leu Pro Ala Arg Arg Ala Glu Pro
1265 1270 1275
Gln Arg His Thr Met Leu Cys Met Cys Cys Lys Cys Glu Ala Arg
1280 1285 1290
Ile Ala Gly Ser Gly Pro Gly Ala Ser Gly Lys Pro Ile Pro Asn
1295 1300 1305
Pro Leu Leu Gly Leu Asp Ser Thr Arg Thr
1310 1315
<210> 62
<211> 42783
<212> DNA
<213> 人工序列
<220>
<223> 病毒载体构建体
<400> 62
gtttaaacgc ggccgccagg cctacccact agtcaattcg ggaggatcga aacggcagat 60
cgcaaaaaac agtacataca gaaggagaca tgaacatgaa catcaaaaaa attgtaaaac 120
aagccacagt tctgactttt acgactgcac ttctggcagg aggagcgact caagccttcg 180
cgaaagaaaa taaccaaaaa gcatacaaag aaacgtacgg cgtctctcat attacacgcc 240
atgatatgct gcagatccct aaacagcagc aaaacgaaaa ataccaagtg cctcaattcg 300
atcaatcaac gattaaaaat attgagtctg caaaaggact tgatgtgtgg gacagctggc 360
cgctgcaaaa cgctgacgga acagtagctg aatacaacgg ctatcacgtt gtgtttgctc 420
ttgcgggaag cccgaaagac gctgatgaca catcaatcta catgttttat caaaaggtcg 480
gcgacaactc aatcgacagc tggaaaaacg cgggccgtgt ctttaaagac agcgataagt 540
tcgacgccaa cgatccgatc ctgaaagatc agacgcaaga atggtccggt tctgcaacct 600
ttacatctga cggaaaaatc cgtttattct acactgacta ttccggtaaa cattacggca 660
aacaaagcct gacaacagcg caggtaaatg tgtcaaaatc tgatgacaca ctcaaaatca 720
acggagtgga agatcacaaa acgatttttg acggagacgg aaaaacatat cagaacgttc 780
agcagtttat cgatgaaggc aattatacat ccggcgacaa ccatacgctg agagaccctc 840
actacgttga agacaaaggc cataaatacc ttgtattcga agccaacacg ggaacagaaa 900
acggatacca aggcgaagaa tctttattta acaaagcgta ctacggcggc ggcacgaact 960
tcttccgtaa agaaagccag aagcttcagc agagcgctaa aaaacgcgat gctgagttag 1020
cgaacggcgc cctcggtatc atagagttaa ataatgatta cacattgaaa aaagtaatga 1080
agccgctgat cacttcaaac acggtaactg atgaaatcga gcgcgcgaat gttttcaaaa 1140
tgaacggcaa atggtacttg ttcactgatt cacgcggttc aaaaatgacg atcgatggta 1200
ttaactcaaa cgatatttac atgcttggtt atgtatcaaa ctctttaacc ggcccttaca 1260
agccgctgaa caaaacaggg cttgtgctgc aaatgggtct tgatccaaac gatgtgacat 1320
tcacttactc tcacttcgca gtgccgcaag ccaaaggcaa caatgtggtt atcacaagct 1380
acatgacaaa cagaggcttc ttcgaggata aaaaggcaac atttgcgcca agcttcttaa 1440
tgaacatcaa aggcaataaa acatccgttg tcaaaaacag catcctggag caaggacagc 1500
tgacagtcaa ctaataacag caaaaagaaa atgccgatac ttcattggca ttttctttta 1560
tttctcaaca agatggtgaa ttgactagtg ggtagatcca caggacgggt gtggtcgcca 1620
tgatcgcgta gtcgatagtg gctccaagta gcgaagcgag caggactggg cggcggccaa 1680
agcggtcgga cagtgctccg agaacgggtg cgcatagaaa ttgcatcaac gcatatagcg 1740
ctagcagcac gccatagtga ctggcgatgc tgtcggaatg gacgatatcc cgcaagaggc 1800
ccggcagtac cggcataacc aagcctatgc ctacagcatc cagggtgacg gtgccgagga 1860
tgacgatgag cgcattgtta gatttcatac acggtgcctg actgcgttag caatttaact 1920
gtgataaact accgcattaa agcttatcga tgataagctg tcaaacatga gaattgatcc 1980
ggaaccctta atataacttc gtataatgta tgctatacga agttattagg tccctcgact 2040
atagggtcac cgtcgacagc gacacacttg catcggatgc agcccggtta acgtgccggc 2100
acggcctggg taaccaggta ttttgtccac ataaccgtgc gcaaaatgtt gtggataagc 2160
aggacacagc agcaatccac agcaggcata caaccgcaca ccgaggttac tccgttctac 2220
aggttacgac gacatgtcaa tacttgccct tgacaggcat tgatggaatc gtagtctcac 2280
gctgatagtc tgatcgacaa tacaagtggg accgtggtcc cagaccgata atcagaccga 2340
crayacgagt gggaycgtgg tcccagacta ataatcagac cgacgatacg agtgggaccg 2400
tggtcccaga ctaataatca gaccgacgat acgagtggga ccgtggtycc agwctratwa 2460
tcagaccgac gatacragtg gracmgtggk cccagasaka atawtcagrc cgagwtaygc 2520
wktckggcct gtaacaaagg acattaagta aagacagata mrmgtgrgac taaaacgtgg 2580
tcccagtctg attatcagac cgacgatacg agtgggaccg tggtcccaga ctaataatca 2640
gaccgacgat acgagtggga ccgtggtccc agactaataa tcagaccgac gatacgagtg 2700
ggaccgtggt cccagtctga ttatcagacc gacgatacaa gtggaacagt gggcccagag 2760
agaatattca ggccagttat gctttctggc ctgtaacaaa ggacattaag taaagacaga 2820
taaacgtaga ctaaaacgtg gtcgcatcag ggtgctggct tttcaagttc cttaagaatg 2880
gcctcaattt tctctataca ctcagttgga acacgagacc tgtccaggtt aagcaccatt 2940
ttatcgccct tatacaatac tgtcgctcca ggagcaaact gatgtcgtga gcttaaacta 3000
gttcttgatg cagatgacgt tttaagcaca gaagttaaaa gagtgataac ttcttcagct 3060
tcaaatatca ccccagcttt tttctgctca tgaaggttag atgcctgctg cttaagtaat 3120
tcctctttat ctgtaaaggc tttttgaagt gcatcacctg accgggcaga tagttcaccg 3180
gggtgagaaa aaagagcaac aactgattta ggcaatttgg cggtgttgat acagcgggta 3240
ataatcttac gtgaaatatt ttccgcatca gccagcgcag aaatatttcc agcaaattca 3300
ttctgcaatc ggcttgcata acgctgacca cgttcataag cacttgttgg gcgataatcg 3360
ttacccaatc tggataatgc agccatctgc tcatcatcca gctcgccaac cagaacacga 3420
taatcacttt cggtaagtgc agcagcttta cgacggcgac tcccatcggc aatttctatg 3480
acaccagata ctcttcgacc gaacgccggt gtctgttgac cagtcagtag aaaagaaggg 3540
atgagatcat ccagtgcgtc ctcagtaagc agctcctggt cacgttcatt acctgaccat 3600
acccgagagg tcttctcaac actatcaccc cggagcactt caagagtaaa cttcacatcc 3660
cgaccacata caggcaaagt aatggcatta ccgcgagcca ttactcctac gcgcgcaatt 3720
aacgaatcca ccatcggggc agctggtgtc gataacgaag tatcttcaac cggttgagta 3780
ttgagcgtat gttttggaat aacaggcgca cgcttcatta tctaatctcc cagcgtggtt 3840
taatcagacg atcgaaaatt tcattgcaga caggttccca aatagaaaga gcatttctcc 3900
aggcaccagt tgaagagcgt tgatcaatgg cctgttcaaa aacagttctc atccggatct 3960
gacctttacc aacttcatcc gtttcacgta caacattttt tagaaccatg cttccccagg 4020
catcccgaat ttgctcctcc atccacgggg actgagagcc attactattg ctgtatttgg 4080
taagcaaaat acgtacatca ggctcgaacc ctttaagatc aacgttcttg agcagatcac 4140
gaagcatatc gaaaaactgc agtgcggagg tgtagtcaaa caactcagca ggcgtgggaa 4200
caatcagcac atcagcagca catacgacat taatcgtgcc gatacccagg ttaggcgcgc 4260
tgtcaataac tatgacatca tagtcatgag caacagtttc aatggccagt cggagcatca 4320
ggtgtggatc ggtgggcagt ttaccttcat caaatttgcc cattaactca gtttcaatac 4380
ggtgcagagc cagacaggaa ggaataatgt caagccccgg ccagcaagtg ggctttattg 4440
cataagtgac atcgtccttt tccccaagat agaaaggcag gagagtgtct tctgcatgaa 4500
tatgaagatc tggtacccat ccgtgataca ttgaggctgt tccctggggg tcgttacctt 4560
ccacgagcaa aacacgtagc cccttcagag ccagatcctg agcaagatga acagaaactg 4620
aggttttgta aacgccacct ttatgggcag caaccccgat caccggtgga aatacgtctt 4680
cagcacgtcg caatcgcgta ccaaacacat cacgcatatg attaatttgt tcaattgtat 4740
aaccaacacg ttgctcaacc cgtcctcgaa tttccatatc cgggtgcggt agtcgccctg 4800
ctttctcggc atctctgata gcctgagaag aaaccccaac taaatccgct gcttcaccta 4860
ttctccagcg ccgggttatt ttcctcgctt ccgggctgtc atcattaaac tgtgcaatgg 4920
cgatagcctt cgtcatttca tgaccagcgt ttatgcactg gttaagtgtt tccatgagtt 4980
tcattctgaa catcctttaa tcattgcttt gcgttttttt attaaatctt gcaatttact 5040
gcaaagcaac aacaaaatcg caaagtcatc aaaaaaccgc aaagttgttt aaaataagag 5100
caacactaca aaaggagata agaagagcac atacctcagt cacttattat cactagcgct 5160
cgccgcagcc gtgtaaccga gcatagcgag cgaactggcg aggaagcaaa gaagaactgt 5220
tctgtcagat agctcttacg ctcagcgcaa gaagaaatat ccaccgtggg aaaaactcca 5280
ggtagaggta cacacgcgga tagccaattc agagtaataa actgtgataa tcaaccctca 5340
tcaatgatga cgaactaacc cccgatatca ggtcacatga cgaagggaaa gagaaggaaa 5400
tcaactgtga caaactgccc tcaaatttgg cttccttaaa aattacagtt caaaaagtat 5460
gagaaaatcc atgcaggctg aaggaaacag caaaactgtg acaaattacc ctcagtaggt 5520
cagaacaaat gtgacgaacc accctcaaat ctgtgacaga taaccctcag actatcctgt 5580
cgtcatggaa gtgatatcgc ggaaggaaaa tacgatatga gtcgtctggc ggcctttctt 5640
tttctcaatg tatgagaggc gcattggagt tctgctgttg atctcattaa cacagacctg 5700
caggaagcgg cggcggaagt caggcatacg ctggtaactt tgaggcagct ggtaacgctc 5760
tatgatccag tcgattttca gagagacgat gcctgagcca tccggcttac gatactgaca 5820
cagggattcg tataaacgca tggcatacgg attggtgatt tcttttgttt cactaagccg 5880
aaactgcgta aaccggttct gtaacccgat aaagaaggga atgagatatg ggttgatatg 5940
tacactgtaa agccctctgg atggactgtg cgcacgtttg ataaaccaag gaaaagattc 6000
atagcctttt tcatcgccgg catcctcttc agggcgataa aaaaccactt ccttccccgc 6060
gaaactcttc aatgcctgcc gtatatcctt actggcttcc gcagaggtca atccgaatat 6120
ttcagcatat ttagcaacat ggatctcgca gataccgtca tgttcctgta gggtgccatc 6180
agattttctg atctggtcaa cgaacagata cagcatacgt ttttgatccc gggagagact 6240
atatgccgcc tcagtgaggt cgtttgactg gacgattcgc gggctatttt tacgtttctt 6300
gtgattgata accgctgttt ccgccatgac agatccatgt gaagtgtgac aagtttttag 6360
attgtcacac taaataaaaa agagtcaata agcagggata actttgtgaa aaaacagctt 6420
cttctgaggg caatttgtca cagggttaag ggcaatttgt cacagacagg actgtcattt 6480
gagggtgatt tgtcacactg aaagggcaat ttgtcacaac accttctcta gaaccagcat 6540
ggataaaggc ctacaaggcg ctctaaaaaa gaagatctaa aaactataaa aaaaataatt 6600
ataaaaatat ccccgtggat aagtggataa ccccaaggga agttttttca ggcatcgtgt 6660
gtaagcagaa tatataagtg ctgttccctg gtgcttcctc gctcactcga gggcttcgcc 6720
ctgtcgctca actgcggcga gcactactgg ctgtaaaagg acagaccaca tcatggttct 6780
gtgttcatta ggttgttctg tccattgctg acataatccg ctccacttca acgtaacacc 6840
gcacgaagat ttctattgtt cctgaaggca tattcaaatc gttttcgtta ccgcttgcag 6900
gcatcatgac agaacactac ttcctataaa cgctacacag gctcctgaga ttaataatgc 6960
ggatctctac gataatggga gattttcccg actgtttcgt tcgcttctca gtggataaca 7020
gccagcttct ctgtttaaca gacaaaaaca gcatatccac tcagttccac atttccatat 7080
aaaggccaag gcatttattc tcaggataat tgtttcagca tcgcaaccgc atcagactcc 7140
ggcatcgcaa actgcacccg gtgccgggca gccacatcca gcgcaaaaac cttcgtgtag 7200
acttccgttg aactgatgga cttatgtccc atcaggcttt gcagaacttt cagcggtata 7260
ccggcataca gcatgtgcat cgcataggaa tggcggaacg tatgtggtgt gaccggaaca 7320
gagaacgtca caccgtcagc agcagcggcg gcaaccgcct ccccaatcca ggtcctgacc 7380
gttctgtccg tcacttccca gatccgcgct ttctctgtcc ttcctgtgcg acggttacgc 7440
cgctccatga gcttatcgcg aataaatacc tgtgacggaa gatcacttcg cagaataaat 7500
aaatcctggt gtccctgttg ataccgggaa gccctgggcc aacttttggc gaaaatgaga 7560
cgttgatcgg cacgtaagag gttccaactt tcaccataat gaaataagat cactaccggg 7620
cgtatttttt gagttatcga gattttcagg agctaaggaa gctaaaatgg agaaaaaaat 7680
cactggatat accaccgttg atatatccca atggcatcgt aaagaacatt ttgaggcatt 7740
tcagtcagtt gctcaatgta cctataacca gaccgttcag ctggatatta cggccttttt 7800
aaagaccgta aagaaaaata agcacaagtt ttatccggcc tttattcaca ttcttgcccg 7860
cctgatgaat gctcatccgg agttccgtat ggcaatgaaa gacggtgagc tggtgatatg 7920
ggatagtgtt cacccttgtt acaccgtttt ccatgagcaa actgaaacgt tttcatcgct 7980
ctggagtgaa taccacgacg atttccggca gtttctacac atatattcgc aagatgtggc 8040
gtgttacggt gaaaacctgg cctatttccc taaagggttt attgagaata tgtttttcgt 8100
ctcagccaat ccctgggtga gtttcaccag ttttgattta aacgtggcca atatggacaa 8160
cttcttcgcc cccgttttca ccatgggcaa atattatacg caaggcgaca aggtgctgat 8220
gccgctggcg attcaggttc atcatgccgt ttgtgatggc ttccatgtcg gcagaatgct 8280
taatgaatta caacagtact gcgatgagtg gcagggcggg gcgtaatttt tttaaggcag 8340
ttattggtgc ccttaaacgc ctggttgcta cgcctgaata agtgataata agcggatgaa 8400
tggcagaaat tcgatgataa gctgtcaaac atgagaattg gtcgacggcg cgccaaagct 8460
tgcatgcctg cagccgcgta acctggcaaa atcggttacg gttgagtaat aaatggatgc 8520
cctgcgtaag cggggcacat ttcattacct ctttctccgc acccgacata gataataact 8580
tcgtatagta tacattatac gaagttatct agtagactta atcgcgttta aacccatcat 8640
caataatata cctcaaactt tttgtgcgcg ttaatatgca aatgaggcgt ttgaatttgg 8700
gaagggagga aggtgattgg ccgagagaag ggcgaccgtt aggggcgggg cgagtgacgt 8760
tttgatgacg tgaccgcgag gaggagccag tttgcaagtt ctcgtgggaa aagtgacgtc 8820
aaacgaggtg tggtttgaac acggaaatac tcaattttcc cgcgctctct gacaggaaat 8880
gaggtgtttc taggcggatg caagtgaaaa cgggccattt tcgcgcgaaa actgaatgag 8940
gaagtgaaaa tctgagtaat ttcgcgttta tgacagggag gagtatttgc cgagggccga 9000
gtagactttg accgattacg tgggggtttc gattaccgtg tttttcacct aaatttccgc 9060
gtacggtgtc aaagtccggt gtttttacgt aggtgtcagc tgatcgccag ggtatttaaa 9120
cctgcgctct ccagtcaaga ggccactctt gagtgccagc gagaagagtt ttctcctccg 9180
cgcgcgagtc agatctacac tttgaaaggc gatcgctagc gacatcgatc acaagtttgt 9240
acaaaaaagc aggctccacc atgggaacca attcagtcga gcctttcact cattagatgc 9300
atgtcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 9360
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 9420
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 9480
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 9540
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 9600
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 9660
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 9720
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 9780
gtgtacggtg ggaggtctat ataagcagag ctctccctat cagtgataga gatctcccta 9840
tcagtgatag agatcgtcga cgagctcgtt tagtgaaccg tcagatcgcc tggagacgcc 9900
atccacgctg ttttgacctc catagaagac accgggaccg atccagcctc cggttaagct 9960
cggtaccgct agccgcgccg ccaccatgga tgctatgaag aggggcctgt gctgcgtgct 10020
gctgctgtgt ggcgccgtgt ttgtgtcccc cagccaggaa atccacgccc ggttcagaag 10080
aggcagcaag ctggccgacg aggacgagac agcctacgac agcggcaccg acctgatcga 10140
cttcatcgac gacagcgacg agaatgagaa cgactccgac accggcgagg acatggtgga 10200
tttcatcgac aacgacgaaa ccgacgaaga gagcaccgag agcgacctgg acggctttat 10260
cgacaactcc gcccagctgg ctgacagcga cagcaatgcc tgcgccttcc tgaaggctca 10320
gctggcagac gtgaacagca acgccgctgc ttttctgaag aactgcatcc tgctgtacgg 10380
cgctgccaac accggcaaga gcctgttcgg catgagcctg aactgcctgg tgctgtgcgg 10440
cccagccaat accggaaagt cctacttcgg catgtccctg aattgtctcg tgatctacgg 10500
cccacctaac acaggcaagt cctgctttgc catgtctctg tggccctacc tgcacagcag 10560
actggtggtg tttaccttcc ccaacccctt ctggccttac ctggaaagcc ggatcaccgt 10620
gttcgagttc cccaatgcct ttctgagata cctgcactcc cggatccacg tgctgcagtt 10680
tctgaacccc ttcaacgtgt gccaggacaa gatcctggaa cactacgaga acgacagcaa 10740
ggacattctg gaacattatg agaatgattc caaggacctg tgcgaccaca tctgcgatca 10800
catcgactac tggaagcaca tccggctgga atgcgccatc atgtacaagg cccggatcag 10860
actggaatgt gctattatgt ataaggctcg cgagatgggc ttccaccagt tcgacggcga 10920
catctgcaac accatgcact acaccaactg gatctatatc tgcgaggacg cccagtgcac 10980
cgtggtggaa ggccaggtgg acaagaaatg ggaggtgcac gctggcggcc aagtgatcct 11040
gtgtcctgag agcggccagc ggcggatcaa gaggcccaga agcgagaact gccaccccaa 11100
caagctgctg atcctgaagt gcctgcggta cagattcaag aagcactgca agctgagcag 11160
cacctggcac tggacctgcc acgacggcaa gcacaagtgg cattggacat gtcacgatgg 11220
gaaacacaag aacgccattg tgaccctgac ctactacgag gccgacaaga acgacctgaa 11280
cgcccagatc gagcactgga aactgatccg gatggaatgt gcaatcttct ataaggccaa 11340
agagctgggc atcagcatct gccaccaggt ggtgcctcca ctggccgcct ctaaagccaa 11400
agcctgccag gccatcgaac tgcagctggc cctggaagcc ctgaatgcca gcccctacga 11460
tgagtggacc ctgcagcaga ccagcctgga aatgtggctg gccgagcccc agtttaagaa 11520
gcacggcatc accatcaccg tgcagtacga caatgacaag gccaatacca tggattacac 11580
aaattggaaa gaaatctacg tgatcgtgtg ccccgccagc atcccctccg atgagatcag 11640
caccgaggaa gccgaccaca ttgattattg gaaagccatc aggcaggaaa acgccatctt 11700
cttcgccgcc agacaccagg tggtgcccgc cctgaatatc tgcaaggcca aggcctgtaa 11760
agccatcgag tggaacaccg agcccaagca ctgcttcaag aagggcggcc agcacatcga 11820
agtgtggttc gactacgtgg cctgggacag cgtgtactac tgcggcgacg atggctggtg 11880
caagaccgag gccgagaagt acggctgcaa gggcacctgg gaagtgcatt tcggcaacag 11940
catcgactgc aacgactcca tgtgcagcac cttcgacgac aacgtgtccg ccaccgagct 12000
cgtgaaggac catatcgact attggaagct gattcgcctg gaatgtgcca ttttttacaa 12060
ggccagacgg cggctgtcca gcgaccagga tcagtctcag ctcgtgacca agtaccccct 12120
gctgaagctg ctgtccagac cccccaacat gggcgtgaag gcccacggca agtgcatctg 12180
ggagaacaag gtgttcatcg tgcccaccct gtgccccgtg cctctggatc caacatatcc 12240
tctgctgaaa ctgctgacca cccagaccac cacccccgag aatacctccc tggtggaact 12300
gagagtgacc acccccaaga gcacagtcgt gatcaggctg cacctgacca ccagataccc 12360
actgctgtca ctgctgaaca gctacagcac cccccctcac cggatccctg ctccatgtcc 12420
ttgggctcct cagaggcccc ccatccctaa gccttctcca tgggccccta gaatccctgc 12480
cccttgcccc tgggcacctc ctagacctcc acactgtcca tgggtgcccc ctccacctcc 12540
tccaagacct tgggcccctt gcttcctgct gtgcttttgt gtgctgctgt gcgtgtgcct 12600
gctgatcaga cccctgctgc tgagtgtgtc cacctacctg aggcctctgc tgctgtctat 12660
cagcgtgtac gctcaggtgc tggtgctggt gctgctgctg tgggtgtcca tcggaagcct 12720
gctgcccagc gtgtgcatgt gtgcctatgc ctgggtgctg gtgttcgtgt acatcgtcgt 12780
gattaccagc cccgccaccg ccatcgtgta ccgggatggc aatccttacg ccgtgtgcga 12840
caagtgcctg aagttctaca gcaagatcag cgagtaccgg cactactgct acagcctgta 12900
cggcaccacc ctggaacagc agtacaacaa gcccctgtgc gatctgctga ttcggtgcat 12960
caacgtggtg tacagagact ccatccccca cgccgcctgc cacaagtgta tcgacttcta 13020
ctccagaatc agagagctgc ggcactacag cgactccgtg tacggcgata ccctggaaaa 13080
gctgaccaac actggcctgt acaacctgct gattagatgc ctgcgggtgt tctgcaagaa 13140
ggccctgaca gccagcgagg tgtacaactt cgcctacacc gatctgcggg tggtgtatcg 13200
ggacagcaaa gtgcggaagc tgaggtacta caactgctct gtgtatggcg ccagcctggt 13260
gtattgcaag ggacagctga ccgagacaga ggtgctggat ttcgccttca cagacctgac 13320
aatcgtgtat cgcgactcca aggtgtccga gttccggtgg tacagatatt ccgtgtatgg 13380
caccacactg tgcgtggaat gcaagaaaac cctgcagaga tctgaggtgt acgactgcca 13440
gcggccactg tgtccgcagg aaaagaaaag acacgtggac ctgaacaagc ggttccacac 13500
cctgcacgag tacatgctgg atctgcagcc cgagacaacc gacctgtact gctacgagca 13560
gcctgaaacc actgatctgc actgttatga gcagctggga gacagctccg atgaagagga 13620
cactggcggc ctggatgggg acgaggatga ggacgaagtg gaccatctgc aggaacagcc 13680
ccagcaggct agacgggacg aacagcaccc ttgctatctg atcgagacac agtgctgcag 13740
atgcgaatct ctggtggaag agaacgacga gatcgacggc gtgaaccacc agcatctgcc 13800
cgctagaagg gccgagcctc agagacacac catgctgtgt atgtgctgca agtgcgaggc 13860
cagaatcgcc ggctctggac ctggcgcctc tggcaagcct atccccaatc cactgctggg 13920
cctggactcc acccggacct gataagcggc cgctcgagca tgcatctaga gggccctatt 13980
ctatagtgtc acctaaatgc tagagctcgc tgatcagcct cgactgtgcc ttctagttgc 14040
cagccatctg ttgtttgccc ctcccccgtg ccttccttga ccctggaagg tgccactccc 14100
actgtccttt cctaataaaa tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct 14160
attctggggg gtggggtggg gcaggacagc aagggggagg attgggaaga caatagcagg 14220
catgctgggg atgcggtggg ctctatggct tctgaggcgg aaagaaccag ctggggctcg 14280
aggggggatc gatcccgtcg agatatctag acccagcttt cttgtacaaa gtggtgatcg 14340
attcgacaga tcgcgatcgc agtgagtagt gttctggggc gggggaggac ctgcatgagg 14400
gccagaatga ctgaaatctg tgcttttctg tgtgttgcag catcatgagc ggaagcggct 14460
cctttgaggg aggggtattc agcccttatc tgacggggcg tctcccctcc tgggcgggag 14520
tgcgtcagaa tgtgatggga tccacggtgg acggccggcc cgtgcagccc gcgaactctt 14580
caaccctgac ctatgcaacc ctgagctctt cgtcggtgga cgcagctgcc gccgcagctg 14640
ctgcatccgc cgccagcgcc gtgcgcggaa tggccatggg cgccggctac tacggcactc 14700
tggtggccaa ctcgagttcc accaataatc ccgccagcct gaacgaggag aagctgctgc 14760
tgctgatggc ccagcttgag gccttgaccc agcgcctggg cgagctgacc cagcaggtgg 14820
ctcagctgca ggagcagacg cgggccgcgg ttgccacggt gaaatccaaa taaaaaatga 14880
atcaataaat aaacggagac ggttgttgat tttaacacag agtctgaatc tttatttgat 14940
ttttcgcgcg cggtaggccc tggaccaccg gtctcgatca ttgagcaccc ggtggatctt 15000
ttccaggacc cggtagaggt gggcttggat gttgaggtac atgggcatga gcccgtcccg 15060
ggggtggagg tagctccatt gcagggcctc gtgctcgggg gtggtgttgt aaatcaccca 15120
gtcatagcag gggcgcaggg cgtggtgttg cacaatatct ttgaggagga gactgatggc 15180
cacgggcagc cctttggtgt aggtgtttac aaatctgttg agctgggagg gatgcatgcg 15240
gggggagatg aggtgcatct tggcctggat cttgagattg gcgatgttac cgcccagatc 15300
ccgcctgggg ttcatgttgt gcaggaccac cagcacggtg tatccggtgc acttggggaa 15360
tttatcatgc aacttggaag ggaaggcgtg aaagaatttg gcgacgccct tgtgtccgcc 15420
caggttttcc atgcactcat ccatgatgat ggcaatgggc ccgtgggcgg cggcctgggc 15480
aaagacgttt cgggggtcgg acacatcata gttgtggtcc tgggtgaggt catcataggc 15540
cattttaatg aatttggggc ggagggtgcc ggactggggg acaaaggtac cctcgatccc 15600
gggggcgtag ttcccctcac agatctgcat ctcccaggct ttgagctcag agggggggat 15660
catgtccacc tgcggggcga taaagaacac ggtttccggg gcgggggaga tgagctgggc 15720
cgaaagcaag ttccggagca gctgggactt gccgcagccg gtggggccgt aaatgacccc 15780
gatgaccggc tgcaggtggt agttgaggga gagacagctg ccgtcctccc ggaggagggg 15840
ggccacctcg ttcatcatct cgcgcacgtg catgttctcg cgcaccagtt ccgccaggag 15900
gcgctctccc cccagagata ggagctcctg gagcgaggcg aagtttttca gcggcttgag 15960
tccgtcggcc atgggcattt tggagagggt ctgttgcaag agttccaagc ggtcccagag 16020
ctcggtgatg tgctctacgg catctcgatc cagcagacct cctcgtttcg cgggttggga 16080
cgactgcggg agtagggcac cagacgatgg gcgtccagcg cagccagggt ccggtccttc 16140
cagggccgca gcgtccgcgt cagggtggtc tccgtcacgg tgaaggggtg cgcgccgggc 16200
tgggcgcttg cgagggtgcg cttcaggctc atccggctgg tcgaaaaccg ctcccgatcg 16260
gcgccctgcg cgtcggccag gtagcaattg accatgagtt cgtagttgag cgcctcggcc 16320
gcgtggcctt tggcgcggag cttacctttg gaagtctgcc cgcaggcggg acagaggagg 16380
gacttgaggg cgtagagctt gggggcgagg aagacggaat cgggggcgta ggcgtccgcg 16440
ccgcagtggg cgcagacggt ctcgcactcc acgagccagg tgaggtcggg ctggtcgggg 16500
tcaaaaacca gtttcccgcc gttctttttg atgcgtttct tacctttggt ctccatgagc 16560
tcgtgtcccc gctgggtgac aaagaggctg tccgtgtccc cgtagaccga ctttatgggc 16620
cggtcctcga gcggtgtgcc gcggtcctcc tcgtagagga accccgccca ctccgagacg 16680
aaagcccggg tccaggccag cacgaaggag gccacgtggg acgggtagcg gtcgttgtcc 16740
accagcgggt ccactttttc cagggtatgc aaacacatgt ccccctcgtc cacatccagg 16800
aaggtgattg gcttgtaagt gtaggccacg tgaccggggg tcccggccgg gggggtataa 16860
aagggggcgg gcccctgctc gtcctcactg tcttccggat cgctgtccag gagcgccagc 16920
tgttggggta ggtattccct ctcgaaggcg ggcatgacct cggcactcag gttgtcagtt 16980
tctagaaacg aggaggattt gatattgacg gtgccagcgg agatgccttt caagagcccc 17040
tcgtccatct ggtcagaaaa gacgattttt ttgttgtcga gcttggtggc gaaggagccg 17100
tagagggcgt tggaaaggag cttggcgatg gagcgcatgg tctggttttt ttccttgtcg 17160
gcgcgctcct tggccgcgat gttgagctgc acgtactcgc gcgccacgca cttccattcg 17220
gggaagacgg tggtcatctc gtcgggcacg attctgacct gccaacctcg attatgcagg 17280
gtgatgaggt ccacactggt ggccacctcg ccgcgcaggg gctcgttggt ccagcagagg 17340
cggccgccct tgcgcgagca gaaggggggc agagggtcca gcatgacctc gtcggggggg 17400
tcggcatcga tggtgaagat gccgggcagg agatcggggt cgaagtagct gatggaagtg 17460
gccagatcgt ccagggaagc ttgccattcg cgcacggcca gcgcgcgctc gtagggactg 17520
aggggcgtgc cccagggcat ggggtgggtg agcgcggagg cgtacatgcc gcagatgtcg 17580
tagacgtaga ggggctcctc gaggatgccg atgtaggtgg ggtagcagcg ccccccgcgg 17640
atgctggcgc gcacgtagtc atacagctcg tgcgagggcg cgaggagccc cgggcccagg 17700
ttggtgcgac tgggcttttc ggcgcggtag acgatctggc gaaagatggc atgcgagttg 17760
gaggagatgg tgggcctttg gaagatgttg aagtgggcgt gggggaggcc gaccgagtcg 17820
cggatgaagt gggcgtagga gtcttgcagt ttggcgacga gctcggcggt gacgaggacg 17880
tccagagcgc agtagtcgag ggtctcctgg atgatgtcat acttgagctg gcccttttgt 17940
ttccacagct cgcggttgag aaggaactct tcgcggtcct tccagtactc ttcgaggggg 18000
aacccgtcct gatctgcacg gtaagagcct agcatgtaga actggttgac ggccttgtag 18060
gcgcagcagc ccttctccac ggggagggcg taggcctggg cggccttgcg cagggaggtg 18120
tgcgtgaggg cgaaggtgtc cctgaccatg accttgagga actggtgctt gaaatcgata 18180
tcgtcgcagc ccccctgctc ccagagctgg aagtccgtgc gcttcttgta ggcggggttg 18240
ggcaaagcga aagtaacatc gttgaaaagg atcttgcccg cgcggggcat aaagttgcga 18300
gtgatgcgga aaggctgggg cacctcggcc cggttgttga tgacctgggc ggcgagcacg 18360
atctcgtcga aaccgttgat gttgtggccc acgatgtaga gttccacgaa tcgcgggcgg 18420
cccttgacgt ggggcagctt cttgagctcc tcgtaggtga gctcgtcggg gtcgctgaga 18480
ccgtgctgct cgagcgccca gtcggcgaga tgggggttgg cgcggaggaa ggaagtccag 18540
agatccacgg ccagggcggt ttgcagacgg tcccggtact gacggaactg ctgcccgacg 18600
gccatttttt cgggggtgac gcagtagaag gtgcgggggt ccccgtgcca gcggtcccat 18660
ttgagctgga gggcgagatc gagggcgagc tcgacgaggc ggtcgtcccc tgagagtttc 18720
atgaccagca tgaaggggac gagctgcttg ccgaaggacc ccatccaggt gtaggtttcc 18780
acatcgtagg tgaggaagag cctttcggtg cgaggatgcg agccgatggg gaagaactgg 18840
atctcctgcc accaattgga ggaatggctg ttgatgtgat ggaagtagaa atgccgacgg 18900
cgcgccgaac actcgtgctt gtgtttatac aagcggccac agtgctcgca acgctgcacg 18960
ggatgcacgt gctgcacgag ctgtacctga gttcctttga cgaggaattt cagtgggaag 19020
tggagtcgtg gcgcctgcat ctcgtgctgt actacgtcgt ggtggtcggc ctggccctct 19080
tctgcctcga tggtggtcat gctgacgagc ccgcgcggga ggcaggtcca gacctcggcg 19140
cgagcgggtc ggagagcgag gacgagggcg cgcaggccgg agctgtccag ggtcctgaga 19200
cgctgcggag tcaggtcagt gggcagcggc ggcgcgcggt tgacttgcag gagtttttcc 19260
agggcgcgcg ggaggtccag atggtacttg atctccaccg cgccgttggt ggcgacgtcg 19320
atggcttgca gggtcccgtg cccctggggt gtgaccaccg tcccccgttt cttcttgggc 19380
ggctggggcg acgggggcgg tgcctcttcc atggttagaa gcggcggcga ggacgcgcgc 19440
cgggcggcag aggcggctcg gggcccggag gcaggggcgg caggggcacg tcggcgccgc 19500
gcgcgggtag gttctggtac tgcgcccgga gaagactggc gtgagcgacg acgcgacggt 19560
tgacgtcctg gatctgacgc ctctgggtga aggccacggg acccgtgagt ttgaacctga 19620
aagagagttc gacagaatca atctcggtat cgttgacggc ggcctgccgc aggatctctt 19680
gcacgtcgcc cgagttgtcc tggtaggcga tctcggtcat gaactgctcg atctcctcct 19740
cctgaaggtc tccgcgaccg gcgcgctcca cggtggccgc gaggtcgttg gagatgcggc 19800
ccatgagctg cgagaaggcg ttcatgcccg cctcgttcca gacgcggctg tagaccacga 19860
cgccctcggg atcgcgggcg cgcatgacca cctgggcgag gttgagctcc acgtggcgcg 19920
tgaagaccgc gtagttgcag aggcgctggt agaggtagtt gagcgtggtg gcgatgtgct 19980
cggtgacgaa gaaatacatg atccagcggc ggagcggcat ctcgctgacg tcgcccagcg 20040
cctccaagcg ttccatggcc tcgtaaaagt ccacggcgaa gttgaaaaac tgggagttgc 20100
gcgccgagac ggtcaactcc tcctccagaa gacggatgag ctcggcgatg gtggcgcgca 20160
cctcgcgctc gaaggccccc gggagttcct ccacttcctc ctcttcttcc tcctccacta 20220
acatctcttc tacttcctcc tcaggcggtg gtggtggcgg gggagggggc ctgcgtcgcc 20280
ggcggcgcac gggcagacgg tcgatgaagc gctcgatggt ctcgccgcgc cggcgtcgca 20340
tggtctcggt gacggcgcgc ccgtcctcgc ggggccgcag cgtgaagacg ccgccgcgca 20400
tctccaggtg gccggggggg tccccgttgg gcagggagag ggcgctgacg atgcatctta 20460
tcaattgccc cgtagggact ccgcgcaagg acctgagcgt ctcgagatcc acgggatctg 20520
aaaaccgttg aacgaaggct tcgagccagt cgcagtcgca aggtaggctg agcacggttt 20580
cttctgccgg gtcatgttgg ggagcggggc gggcgatgct gctggtgatg aagttgaaat 20640
aggcggttct gagacggcgg atggtggcga ggagcaccag gtctttgggc ccggcttgct 20700
ggatgcgcag acggtcggcc atgccccagg cgtggtcctg acacctggcc aggtccttgt 20760
agtagtcctg catgagccgc tccacgggca cctcctcctc gcccgcgcgg ccgtgcatgc 20820
gcgtgagccc gaagccgcgc tggggctgga cgagcgccag gtcggcgacg acgcgctcgg 20880
cgaggatggc ctgctggatc tgggtgaggg tggtctggaa gtcgtcaaag tcgacgaagc 20940
ggtggtaggc tccggtgttg atggtgtagg agcagttggc catgacggac cagttgacgg 21000
tctggtggcc cggacgcacg agctcgtggt acttgaggcg cgagtaggcg cgcgtgtcga 21060
agatgtagtc gttgcaggtg cgcaccaggt actggtagcc gatgaggaag tgcggcggcg 21120
gctggcggta gagcggccat cgctcggtgg cgggggcgcc gggcgcgagg tcctcgagca 21180
tggtgcggtg gtagccgtag atgtacctgg acatccaggt gatgccggcg gcggtggtgg 21240
aggcgcgcgg gaactcgcgg acgcggttcc agatgttgcg cagcggcagg aagtagttca 21300
tggtgggcac ggtctggccc gtgaggcgcg cgcagtcgtg gatgctctat acgggcaaaa 21360
acgaaagcgg tcagcggctc gactccgtgg cctggaggct aagcgaacgg gttgggctgc 21420
gcgtgtaccc cggttcgaat ctcgaatcag gctggagccg cagctaacgt ggtactggca 21480
ctcccgtctc gacccaagcc tgcaccaacc ctccaggata cggaggcggg tcgttttgca 21540
actttttttg gaggccggaa atgaaactag taagcgcgga aagcggccga ccgcgatggc 21600
tcgctgccgt agtctggaga agaatcgcca gggttgcgtt gcggtgtgcc ccggttcgag 21660
gccggccgga ttccgcggct aacgagggcg tggctgcccc gtcgtttcca agaccccata 21720
gccagccgac ttctccagtt acggagcgag cccctctttt gttttgtttg tttttgccag 21780
atgcatcccg tactgcggca gatgcgcccc caccaccctc caccgcaaca acagccccct 21840
cctccacagc cggcgcttct gcccccgccc cagcagcagc agcaacttcc agccacgacc 21900
gccgcggccg ccgtgagcgg ggctggacag acttctcagt atgatcacct ggccttggaa 21960
gagggcgagg ggctggcgcg cctgggggcg tcgtcgccgg agcggcaccc gcgcgtgcag 22020
atgaaaaggg acgctcgcga ggcctacgtg cccaagcaga acctgttcag agacaggagc 22080
ggcgaggagc ccgaggagat gcgcgcggcc cggttccacg cggggcggga gctgcggcgc 22140
ggcctggacc gaaagagggt gctgagggac gaggatttcg aggcggacga gctgacgggg 22200
atcagccccg cgcgcgcgca cgtggccgcg gccaacctgg tcacggcgta cgagcagacc 22260
gtgaaggagg agagcaactt ccaaaaatcc ttcaacaacc acgtgcgcac cctgatcgcg 22320
cgcgaggagg tgaccctggg cctgatgcac ctgtgggacc tgctggaggc catcgtgcag 22380
aaccccacca gcaagccgct gacggcgcag ctgttcctgg tggtgcagca tagtcgggac 22440
aacgaggcgt tcagggaggc gctgctgaat atcaccgagc ccgagggccg ctggctcctg 22500
gacctggtga acattctgca gagcatcgtg gtgcaggagc gcgggctgcc gctgtccgag 22560
aagctggcgg ccatcaactt ctcggtgctg agtctgggca agtactacgc taggaagatc 22620
tacaagaccc cgtacgtgcc catagacaag gaggtgaaga tcgacgggtt ttacatgcgc 22680
atgaccctga aagtgctgac cctgagcgac gatctggggg tgtaccgcaa cgacaggatg 22740
caccgcgcgg tgagcgccag caggcggcgc gagctgagcg accaggagct gatgcacagc 22800
ctgcagcggg ccctgaccgg ggccgggacc gagggggaga gctactttga catgggcgcg 22860
gacctgcact ggcagcccag ccgccgggcc ttggaggcgg caggcggtcc cccctacata 22920
gaagaggtgg acgatgaggt ggacgaggag ggcgagtacc tggaagactg atggcgcgac 22980
cgtatttttg ctagatgcaa caacagccac ctcctgatcc cgcgatgcgg gcggcgctgc 23040
agagccagcc gtccggcatt aactcctcgg acgattggac ccaggccatg caacgcatca 23100
tggcgctgac gacccgcaac cccgaagcct ttagacagca gccccaggcc aaccggctct 23160
cggccatcct ggaggccgtg gtgccctcgc gctccaaccc cacgcacgag aaggtcctgg 23220
ccatcgtgaa cgcgctggtg gagaacaagg ccatccgcgg cgacgaggcc ggcctggtgt 23280
acaacgcgct gctggagcgc gtggcccgct acaacagcac caacgtgcag accaacctgg 23340
accgcatggt gaccgacgtg cgcgaggccg tggcccagcg cgagcggttc caccgcgagt 23400
ccaacctggg atccatggtg gcgctgaacg ccttcctcag cacccagccc gccaacgtgc 23460
cccggggcca ggaggactac accaacttca tcagcgccct gcgcctgatg gtgaccgagg 23520
tgccccagag cgaggtgtac cagtccgggc cggactactt cttccagacc agtcgccagg 23580
gcttgcagac cgtgaacctg agccaggcgt tcaagaactt gcagggcctg tggggcgtgc 23640
aggccccggt cggggaccgc gcgacggtgt cgagcctgct gacgccgaac tcgcgcctgc 23700
tgctgctgct ggtggccccc ttcacggaca gcggcagcat caaccgcaac tcgtacctgg 23760
gctacctgat taacctgtac cgcgaggcca tcggccaggc gcacgtggac gagcagacct 23820
accaggagat cacccacgtg agccgcgccc tgggccagga cgacccgggc aatctggaag 23880
ccaccctgaa ctttttgctg accaaccggt cgcagaagat cccgccccag tacacgctca 23940
gcgccgagga ggagcgcatc ctgcgatacg tgcagcagag cgtgggcctg ttcctgatgc 24000
aggagggggc cacccccagc gccgcgctcg acatgaccgc gcgcaacatg gagcccagca 24060
tgtacgccag caaccgcccg ttcatcaata aactgatgga ctacttgcat cgggcggccg 24120
ccatgaactc tgactatttc accaacgcca tcctgaatcc ccactggctc ccgccgccgg 24180
ggttctacac gggcgagtac gacatgcccg accccaatga cgggttcctg tgggacgatg 24240
tggacagcag cgtgttctcc ccccgaccgg gtgctaacga gcgccccttg tggaagaagg 24300
aaggcagcga ccgacgcccg tcctcggcgc tgtccggccg cgagggtgct gccgcggcgg 24360
tgcccgaggc cgccagtcct ttcccgagct tgcccttctc gctgaacagt attcgcagca 24420
gcgagctggg caggatcacg cgcccgcgct tgctgggcga ggaggagtac ttgaatgact 24480
cgctgttgag acccgagcgg gagaagaact tccccaataa cgggatagag agcctggtgg 24540
acaagatgag ccgctggaag acgtatgcgc aggagcacag ggacgatccg tcgcaggggg 24600
ccacgagccg gggcagcgcc gcccgtaaac gccggtggca cgacaggcag cggggactga 24660
tgtgggacga tgaggattcc gccgacgaca gcagcgtgtt ggacttgggt gggagtggta 24720
acccgttcgc tcacctgcgc ccccgcatcg ggcgcatgat gtaagagaaa ccgaaaataa 24780
atgatactca ccaaggccat ggcgaccagc gtgcgttcgt ttcttctctg ttgttgtatc 24840
tagtatgatg aggcgtgcgt acccggaggg tcctcctccc tcgtacgaga gcgtgatgca 24900
gcaggcgatg gcggcggcgg cggcgatgca gcccccgctg gaggctcctt acgtgccccc 24960
gcggtacctg gcgcctacgg aggggcggaa cagcattcgt tactcggagc tggcaccctt 25020
gtacgatacc acccggttgt acctggtgga caacaagtcg gcggacatcg cctcgctgaa 25080
ctaccagaac gaccacagca acttcctgac caccgtggtg cagaacaatg acttcacccc 25140
cacggaggcc agcacccaga ccatcaactt tgacgagcgc tcgcggtggg gcggtcagct 25200
gaaaaccatc atgcacacca acatgcccaa cgtgaacgag ttcatgtaca gcaacaagtt 25260
caaggcgcgg gtgatggtct cccgcaagac ccccaacggg gtgacagtga cagatggtag 25320
tcaggatatc ttggagtatg aatgggtgga gtttgagctg cccgaaggca acttctcggt 25380
gaccatgacc atcgacctga tgaacaacgc catcatcgac aattacttgg cggtggggcg 25440
gcagaacggg gtcctggaga gcgatatcgg cgtgaagttc gacactagga acttcaggct 25500
gggctgggac cccgtgaccg agctggtcat gcccggggtg tacaccaacg aggccttcca 25560
ccccgatatt gtcttgctgc ccggctgcgg ggtggacttc accgagagcc gcctcagcaa 25620
cctgctgggc attcgcaaga ggcagccctt ccaggagggc ttccagatca tgtacgagga 25680
tctggagggg ggcaacatcc ccgcgctcct ggatgtcgac gcctatgaga aaagcaagga 25740
ggagagcgcc gccgcggcga ctgcagctgt agccaccgcc tctaccgagg tcaggggcga 25800
taattttgcc agccctgcag cagtggcagc ggccgaggcg gctgaaaccg aaagtaagat 25860
agtcattcag ccggtggaga aggatagcaa ggacaggagc tacaacgtgc tgccggacaa 25920
gataaacacc gcctaccgca gctggtacct ggcctacaac tatggcgacc ccgagaaggg 25980
cgtgcgctcc tggacgctgc tcaccacctc ggacgtcacc tgcggcgtgg agcaagtcta 26040
ctggtcgctg cccgacatga tgcaagaccc ggtcaccttc cgctccacgc gtcaagttag 26100
caactacccg gtggtgggcg ccgagctcct gcccgtctac tccaagagct tcttcaacga 26160
gcaggccgtc tactcgcagc agctgcgcgc cttcacctcg ctcacgcacg tcttcaaccg 26220
cttccccgag aaccagatcc tcgtccgccc gcccgcgccc accattacca ccgtcagtga 26280
aaacgttcct gctctcacag atcacgggac cctgccgctg cgcagcagta tccggggagt 26340
ccagcgcgtg accgttactg acgccagacg ccgcacctgc ccctacgtct acaaggccct 26400
gggcatagtc gcgccgcgcg tcctctcgag ccgcaccttc taaaaaatgt ccattctcat 26460
ctcgcccagt aataacaccg gttggggcct gcgcgcgccc agcaagatgt acggaggcgc 26520
tcgccaacgc tccacgcaac accccgtgcg cgtgcgcggg cacttccgcg ctccctgggg 26580
cgccctcaag ggccgcgtgc ggtcgcgcac caccgtcgac gacgtgatcg accaggtggt 26640
ggccgacgcg cgcaactaca cccccgccgc cgcgcccgtc tccaccgtgg acgccgtcat 26700
cgacagcgtg gtggccgacg cgcgccggta cgcccgcgcc aagagccggc ggcggcgcat 26760
cgcccggcgg caccggagca cccccgccat gcgcgcggcg cgagccttgc tgcgcagggc 26820
caggcgcacg ggacgcaggg ccatgctcag ggcggccaga cgcgcggctt caggcgccag 26880
cgccggcagg acccggagac gcgcggccac ggcggcggca gcggccatcg ccagcatgtc 26940
ccgcccgcgg cgagggaacg tgtactgggt gcgcgacgcc gccaccggtg tgcgcgtgcc 27000
cgtgcgcacc cgcccccctc gcacttgaag atgttcactt cgcgatgttg atgtgtccca 27060
gcggcgagga ggatgtccaa gcgcaaattc aaggaagaga tgctccaggt catcgcgcct 27120
gagatctacg gccccgcggt ggtgaaggag gaaagaaagc cccgcaaaat caagcgggtc 27180
aaaaaggaca aaaaggaaga agatgacgat ctggtggagt ttgtgcgcga gttcgccccc 27240
cggcggcgcg tgcagtggcg cgggcggaaa gtgcacccgg tgctgagacc cggcaccacc 27300
gtggtcttca cgcccggcga gcgctccggc agcgcttcca agcgctccta cgacgaggtg 27360
tacggggacg aggacatcct cgagcaggcg gccgagcgcc tgggcgagtt tgcttacggc 27420
aagcgcagcc gccccgccct gaaggaagag gcggtgtcca tcccgctgga ccacggcaac 27480
cccacgccga gcctcaagcc cgtgaccctg cagcaggtgc tgccgagcgc agcgccgcgc 27540
cgggggttca agcgcgaggg cgaggatctg taccccacca tgcagctgat ggtgcccaag 27600
cgccagaagc tggaagacgt gctggagacc atgaaggtgg acccggacgt gcagcccgag 27660
gtcaaggtgc ggcccatcaa gcaggtggcc ccgggcctgg gcgtgcagac cgtggacatc 27720
aagatcccca cggagcccat ggaaacgcag accgagccca tgatcaagcc cagcaccagc 27780
accatggagg tgcagacgga tccctggatg ccatcggctc ctagccgaag accccggcgc 27840
aagtacggcg cggccagcct gctgatgccc aactacgcgc tgcatccttc catcatcccc 27900
acgccgggct accgcggcac gcgcttctac cgcggtcata caaccagccg ccgccgcaag 27960
accaccaccc gccgccgccg tcgccgcaca gccgctgcat ctacccctgc cgccctggtg 28020
cggagagtgt accgccgcgg ccgcgcgcct ctgaccctac cgcgcgcgcg ctaccacccg 28080
agcatcgcca tttaaacttt cgcctgcttt gcagatggcc ctcacatgcc gcctccgcgt 28140
tcccattacg ggctaccgag gaagaaaacc gcgccgtaga aggctggcgg ggaacgggat 28200
gcgtcgccac caccatcggc ggcggcgcgc catcagcaag cggttggggg gaggcttcct 28260
gcccgcgctg atccccatca tcgccgcggc gatcggggcg atccccggca ttgcttccgt 28320
ggcggtgcag gcctctcagc gccactgaga cacttggaaa acatcttgta ataaaccaat 28380
ggactctgac gctcctggtc ctgtgatgtg ttttcgtaga cagatggaag acatcaattt 28440
ttcgtccctg gctccgcgac acggcacgcg gccgttcatg ggcacctgga gcgacatcgg 28500
caccagccaa ctgaacgggg gcgccttcaa ttggagcagt ctctggagcg ggcttaagaa 28560
tttcgggtcc acgcttaaaa cctatggcag caaggcgtgg aacagcacca cagggcaggc 28620
gctgagggat aagctgaaag agcagaactt ccagcagaag gtggtcgatg ggctcgcctc 28680
gggcatcaac ggggtggtgg acctggccaa ccaggccgtg cagcggcaga tcaacagccg 28740
cctggacccg gtgccgcccg ccggctccgt ggagatgccg caggtggagg aggagctgcc 28800
tcccctggac aagcggggcg agaagcgacc ccgccccgac gcggaggaga cgctgctgac 28860
gcacacggac gagccgcccc cgtacgagga ggcggtgaaa ctgggtctgc ccaccacgcg 28920
gcccatcgcg cccctggcca ccggggtgct gaaacccgaa agtaataagc ccgcgaccct 28980
ggacttgcct cctcccgctt cccgcccctc tacagtggct aagcccctgc cgccggtggc 29040
cgtggcccgc gcgcgacccg ggggctccgc ccgccctcat gcgaactggc agagcactct 29100
gaacagcatc gtgggtctgg gagtgcagag tgtgaagcgc cgccgctgct attaaaccta 29160
ccgtagcgct taacttgctt gtctgtgtgt gtatgtatta tgtcgccgct gtccgccaga 29220
aggaggagtg aagaggcgcg tcgccgagtt gcaagatggc caccccatcg atgctgcccc 29280
agtgggcgta catgcacatc gccggacagg acgcttcgga gtacctgagt ccgggtctgg 29340
tgcagttcgc ccgcgccaca gacacctact tcagtctggg gaacaagttt aggaacccca 29400
cggtggcgcc cacgcacgat gtgaccaccg accgcagcca gcggctgacg ctgcgcttcg 29460
tgcccgtgga ccgcgaggac aacacctact cgtacaaagt gcgctacacg ctggccgtgg 29520
gcgacaaccg cgtgctggac atggccagca cctactttga catccgcggc gtgctggatc 29580
ggggccctag cttcaaaccc tactccggca ccgcctacaa cagcctggct cccaagggag 29640
cgcccaattc cagccagtgg gagcaaaaaa aggcaggcaa tggtgacact atggaaacac 29700
acacatttgg tgtggcccca atgggcggtg agaatattac aatcgacgga ttacaaattg 29760
gaactgacgc tacagctgat caggataaac caatttatgc tgacaaaaca ttccagcctg 29820
aacctcaagt aggagaagaa aattggcaag aaactgaaag cttttatggc ggtagggctc 29880
ttaaaaaaga cacaagcatg aaaccttgct atggctccta tgctagaccc accaatgtaa 29940
agggaggtca agctaaactt aaagttggag ctgatggagt tcctaccaaa gaatttgaca 30000
tagacctggc tttctttgat actcccggtg gcacagtgaa tggacaagat gagtataaag 30060
cagacattgt catgtatacc gaaaacacgt atctggaaac tccagacacg catgtggtat 30120
acaaaccagg caaggatgat gcaagttctg aaattaacct ggttcagcag tccatgccca 30180
atagacccaa ctatattggg ttcagagaca actttattgg gctcatgtat tacaacagta 30240
ctggcaatat gggggtgctg gctggtcagg cctcacagct gaatgctgtg gtcgacttgc 30300
aagacagaaa caccgagctg tcataccagc tcttgcttga ctctttgggt gacagaaccc 30360
ggtatttcag tatgtggaat caggcggtgg acagttatga tcctgatgtg cgcattattg 30420
aaaaccatgg tgtggaagac gaacttccca actattgctt ccccctggat gggtctggca 30480
ctaatgccgc ttaccaaggt gtgaaagtaa aaaatggtaa cgatggtgat gttgagagcg 30540
aatgggaaaa tgatgatact gtcgcagctc gaaatcaatt atgcaagggc aacatttttg 30600
ccatggaaat taacctccaa gccaacctgt ggagaagttt cctctactcg aacgtggccc 30660
tgtacctgcc cgactcttac aagtacacgc cagccaacat caccctgccc accaacacca 30720
acacttatga ttacatgaac gggagagtgg tgcctccctc gctggtggac gcctacatca 30780
acatcggggc gcgctggtcg ctggacccca tggacaacgt caatcccttc aaccaccacc 30840
gcaacgcggg cctgcgctac cgctccatgc tcctgggcaa cgggcgctac gtgcccttcc 30900
acatccaggt gccccagaaa tttttcgcca tcaagagcct cctgctcctg cccgggtcct 30960
acacctacga gtggaacttc cgcaaggacg tcaacatgat cctgcagagc tccctcggca 31020
acgacctgcg cacggacggg gcctccatct ccttcaccag catcaacctc tacgccacct 31080
tcttccccat ggcgcacaac acggcctcca cgctcgaggc catgctgcgc aacgacacca 31140
acgaccagtc cttcaacgac tacctctcgg cggccaacat gctctacccc atcccggcca 31200
acgccaccaa cgtgcccatc tccatcccct cgcgcaactg ggccgccttc cgcggctggt 31260
ccttcacgcg cctcaagacc aaggagacgc cctcgctggg ctccgggttc gacccctact 31320
tcgtctactc gggctccatc ccctacctcg acggcacctt ctacctcaac cacaccttca 31380
agaaggtctc catcaccttc gactcctccg tcagctggcc cggcaacgac cggctcctga 31440
cgcccaacga gttcgaaatc aagcgcaccg tcgacggcga gggatacaac gtggcccagt 31500
gcaacatgac caaggactgg ttcctggtcc agatgctggc ccactacaac atcggctacc 31560
agggcttcta cgtgcccgag ggctacaagg accgcatgta ctccttcttc cgcaacttcc 31620
agcccatgag ccgccaggtg gtggacgagg tcaactacaa ggactaccag gccgtcaccc 31680
tggcctacca gcacaacaac tcgggcttcg tcggctacct cgcgcccacc atgcgccagg 31740
gccagcccta ccccgccaac tacccgtacc cgctcatcgg caagagcgcc gtcaccagcg 31800
tcacccagaa aaagttcctc tgcgacaggg tcatgtggcg catccccttc tccagcaact 31860
tcatgtccat gggcgcgctc accgacctcg gccagaacat gctctatgcc aactccgccc 31920
acgcgctaga catgaatttc gaagtcgacc ccatggatga gtccaccctt ctctatgttg 31980
tcttcgaagt cttcgacgtc gtccgagtgc accagcccca ccgcggcgtc atcgaggccg 32040
tctacctgcg cacccccttc tcggccggta acgccaccac ctaaattgct acttgcatga 32100
tggctgagcc cacaggctcc ggcgagcagg agctcagggc catcatccgc gacctgggct 32160
gcgggcccta cttcctgggc accttcgata agcgcttccc gggattcatg gccccgcaca 32220
agctggcctg cgccatcgtc aacacggccg gccgcgagac cgggggcgag cactggctgg 32280
ccttcgcctg gaacccgcgc tcgaacacct gctacctctt cgaccccttc gggttctcgg 32340
acgagcgcct caagcagatc taccagttcg agtacgaggg cctgctgcgc cgtagcgccc 32400
tggccaccga ggaccgctgc gtcaccctgg aaaagtccac ccagaccgtg cagggtccgc 32460
gctcggccgc ctgcgggctc ttctgctgca tgttcctgca cgccttcgtg cactggcccg 32520
accgccccat ggacaagaac cccaccatga acttgctgac gggggtgccc aacggcatgc 32580
tccagtcgcc ccaggtggaa cccaccctgc gccgcaacca ggaggcgctc taccgcttcc 32640
tcaactccca ctccgcctac tttcgctccc accgcgcgcg catcgagaag gccaccgcct 32700
tcgaccgcat gaacaatcaa gacatgtaaa ccgtgtgtgt atgtttaaaa tatcttttaa 32760
taaacagcac tttaatgtta cacatgcatc tgagatgatt ttattttaga aatcgaaagg 32820
gttctgccgg gtctcggcat ggcccgcggg cagggacacg ttgcggaact ggtacttggc 32880
cagccacttg aactcgggga tcagcagttt gggcagcggg gtgtcgggga aggagtcggt 32940
ccacagcttc cgcgtcagct gcagggcgcc cagcaggtcg ggcgcggaga tcttgaaatc 33000
gcagttggga cccgcgttct gcgcgcgaga gttgcggtac acggggttgc agcactggaa 33060
caccatcagg gccgggtgct tcacgctcgc cagcaccgcc gcgtcggtga tgctctccac 33120
gtcgaggtcc tcggcgttgg ccatcccgaa gggggtcatc ttgcaggtct gccttcccat 33180
ggtgggcacg cacccgggct tgtggttgca atcgcagtgc agggggatca gcatcatctg 33240
ggcctggtcg gcgttcatcc ccgggtacat ggccttcatg aaagcctcca attgcctgaa 33300
cgcctgctgg gccttggctc cctcggtgaa gaagaccccg caggacttgc tagagaactg 33360
gttggtggca cagccggcat cgtgcacgca gcagcgcgcg tcgttgttgg ccagctgcac 33420
cacgctgcgc ccccagcggt tctgggtgat cttggcccgg tcggggttct ccttcagcgc 33480
gcgctgcccg ttctcgctcg ccacatccat ctcgatcatg tgctccttct ggatcatggt 33540
ggtcccgtgc aggcaccgca gtttgccctc ggcctcggtg cacccgtgca gccacagcgc 33600
gcacccggtg cactcccagt tcttgtgggc gatctgggaa tgcgcgtgca cgaacccttg 33660
caggaagcgg cccatcatgg tcgtcagggt cttgttgcta gtgaaggtca acgggatgcc 33720
gcggtgctcc tcgttgatgt acaggtggca gatgcggcgg tacacctcgc cctgctcggg 33780
catcagttgg aagttggctt tcaggtcggt ctccacgcgg tagcggtcca tcagcatagt 33840
catgatttcc atgcccttct cccaggccga gacgatgggc aggctcatag ggttcttcac 33900
catcatctta gcactagcag ccgcggccag ggggtcgctc tcatccaggg tctcaaagct 33960
ccgcttgccg tccttctcgg tgatccgcac cggggggtag ctgaagccca cggccgccag 34020
ctcctcctcg gcctgtcttt cgtcctcgct gtcctggctg acgtcctgca tgaccacatg 34080
cttggtcttg cggggtttct tcttgggcgg cagtggcggc ggagatgctt gtggcgaggg 34140
ggagcgcgag ttctcgctca ccactactat ctcttcctct tcttggtccg aggccacgcg 34200
gcggtaggta tgtctcttcg ggggcagagg cggaggcgac gggctctcgc cgccgcgact 34260
tggcggatgg ctggcagagc cccttccgcg ttcgggggtg cgctcccggc ggcgctctga 34320
ctgacttcct ccgcggccgg ccattgtgtt ctcctaggga ggaacaacaa gcatggagac 34380
tcagccatcg ccaacctcgc catctgcccc caccgccggc gacgagaagc agcagcagca 34440
gaatgaaagc ttaaccgccc cgccgcccag ccccgcctcc gacgcagccg cggtcccaga 34500
catgcaagag atggaggaat ccatcgagat tgacctgggc tatgtgacgc ccgcggagca 34560
tgaggaggag ctggcagtgc gctttcaatc gtcaagccag gaagataaag aacagccaga 34620
gcaggaagca gagaacgagc agagtcaggc tgggctcgag catggcgact acctccacct 34680
gagcggggag gaggacgcgc tcatcaagca tctggcccgg caggccacca tcgtcaagga 34740
cgcgctgctc gaccgcaccg aggtgcccct cagcgtggag gagctcagcc gcgcctacga 34800
gctcaacctc ttctcgccgc gcgtgccccc caagcgccag cccaacggca cctgcgagcc 34860
caacccccgc ctcaacttct acccggtctt cgcggtgccc gaggccctgg ccacctacca 34920
catctttttc aagaaccaaa agatccccgt ctcctgccgc gccaaccgca cccgcgccga 34980
cgccctcttc aacctgggtc ccggcgcccg cctacctgat atcgcctcct tggaagaggt 35040
tcccaagatc ttcgagggtc tgggcagcga cgagactcgg gccgcgaacg ctctgcaagg 35100
agaaggagga ggagagcatg agcaccacag cgccctggtc gagttggaag gcgacaacgc 35160
gcggctggcg gtgctcaaac gcacggtcga gctgacccat ttcgcctacc cggctctgaa 35220
cctgcccccg aaagtcatga gcgcggtcat ggaccaggtg ctcatcaagc gcgcgtcgcc 35280
catctccgag gacgagggca tgcaagactc cgaggagggc aagcccgtgg tcagcgacga 35340
gcagctggcc cggtggctgg gtcctaatgc tacccctcaa agtttggaag agcggcgcaa 35400
gctcatgatg gccgtggtcc tggtgaccgt ggagctggag tgcctgcgcc gcttcttcgc 35460
cgacgcggag accctgcgca aggtcgagga gaacctgcac tacctcttca ggcacgggtt 35520
cgtgcgccag gcctgcaaga tctccaacgt ggagctgacc aacctggtct cctacatggg 35580
catcttgcac gagaaccgcc tggggcagaa cgtgctgcac accaccctgc gcggggaggc 35640
ccgccgcgac tacatccgcg actgcgtcta cctctacctc tgccacacct ggcagacggg 35700
catgggcgtg tggcagcagt gtctggagga gcagaacctg aaagagctct gcaagctcct 35760
gcaaaagaac ctcaagggtc tgtggaccgg gttcgacgag cggaccaccg cctcggacct 35820
ggccgacctc atcttccccg agcgcctcag gctgacgctg cgcaacggcc tgcccgactt 35880
tatgagccaa agcatgttgc aaaactttcg ctctttcatc ctcgaacgct ccggaatcct 35940
gcccgccacc tgctccgcgc tgccctcgga cttcgtgccg ctgaccttcc gcgagtgccc 36000
cccgccgctg tggagccact gctacctgct gcgcctggcc aactacctgg cctaccactc 36060
ggacgtgatc gaggacgtca gcggcgaggg cctgctcgag tgccactgcc gctgcaacct 36120
ctgcacgccg caccgctccc tggcctgcaa cccccagctg ctgagcgaga cccagatcat 36180
cggcaccttc gagttgcaag ggcccagcga gggcgaggga gccaaggggg gtctgaaact 36240
caccccgggg ctgtggacct cggcctactt gcgcaagttc gtgcccgagg attaccatcc 36300
cttcgagatc aggttctacg aggaccaatc ccagccgccc aaggccgagc tgtcggcctg 36360
cgtcatcacc cagggggcga tcctggccca attgcaagcc atccagaaat cccgccaaga 36420
attcttgctg aaaaagggcc gcggggtcta cctcgacccc cagaccggtg aggagctcaa 36480
ccccggcttc ccccaggatg ccccgaggaa acaagaagct gaaagtggag ctgccgcccg 36540
tggaggattt ggaggaagac tgggagaaca gcagtcaggc agaggagatg gaggaagact 36600
gggacagcac tcaggcagag gaggacagcc tgcaagacag tctggaggaa gacgaggagg 36660
aggcagagga ggaggtggaa gaagcagccg ccgccagacc gtcgtcctcg gcgggggaga 36720
aagcaagcag cacggatacc atctccgctc cgggtcgggg tcccgctcgg ccccacagta 36780
gatgggacga gaccgggcga ttcccgaacc ccaccaccca gaccggtaag aaggagcggc 36840
agggatacaa gtcctggcgg gggcacaaaa acgccatcgt ctcctgcttg caggcctgcg 36900
ggggcaacat ctccttcacc cggcgctacc tgctcttcca ccgcggggtg aacttccccc 36960
gcaacatctt gcattactac cgtcacctcc acagccccta ctacttccaa gaagaggcag 37020
cagcagcaga aaaagaccag aaaaccagct agaaaatcca cagcggcggc agcggcaggt 37080
ggactgagga tcgcggcgaa cgagccggcg cagacccggg agctgaggaa ccggatcttt 37140
cccaccctct atgccatctt ccagcagagt cgggggcagg agcaggaact gaaagtcaag 37200
aaccgttctc tgcgctcgct cacccgcagt tgtctgtatc acaagagcga agaccaactt 37260
cagcgcactc tcgaggacgc cgaggctctc ttcaacaagt actgcgcgct cactcttaaa 37320
gagtagcccg cgcccgccca gtcgcagaaa aaggcgggaa ttacgtcacc tgtgcccttc 37380
gccctagccg cctccaccca gcaccgccat gagcaaagag attcccacgc cttacatgtg 37440
gagctaccag ccccagatgg gcctggccgc cggcgccgcc caggactact ccacccgcat 37500
gaattggctc agcgccgggc ccgcgatgat ctcacgggtg aatgacatcc gcgcccaccg 37560
aaaccagata ctcctagaac agtcagcgct caccgccacg ccccgcaatc acctcaatcc 37620
gcgtaattgg cccgccgccc tggtgtacca ggaaattccc cagcccacga ccgtactact 37680
tccgcgagac gcccaggccg aagtccagct gactaactca ggtgtccagc tggcgggcgg 37740
cgccaccctg tgtcgtcacc gccccgctca gggtataaag cggctggtga tccggggcag 37800
aggcacacag ctcaacgacg aggtggtgag ctcttcgctg ggtctgcgac ctgacggagt 37860
cttccaactc gccggatcgg ggagatcttc cttcacgcct cgtcaggcgg tcctgacttt 37920
ggagagttcg tcctcgcagc cccgctcggg cggcatcggc actctccagt tcgtggagga 37980
gttcactccc tcggtctact tcaacccctt ctccggctcc cccggccact acccggacga 38040
gttcatcccg aactttgacg ccatcagcga gtcggtggac ggctacgatt gattaattaa 38100
tcaactaacc ccttacccct ttaccctcca gtaaaaataa agattaaaaa tgattgaatt 38160
gatcaataaa gaatcactta cttgaaatct gaaaccaggt ctctgtccat gttttctgtc 38220
agcagcactt cactcccctc ttcccaactc tggtactgca ggccccggcg ggctgcaaac 38280
ttcctccaca ctctgaaggg gatgtcaaat tcctcctgtc cctcaatctt catttttatc 38340
ttctatcaga tgtccaaaaa gcgcgcgcgg gtggatgatg gcttcgaccc cgtgtacccc 38400
tacgatgcag acaacgcacc gactgtgccc ttcatcaacc ctcccttcgt ctcttcagat 38460
ggattccaag aaaagcccct gggggtgttg tccctgcgac tggccgaccc cgtcaccacc 38520
aagaatgggg ctgtcaccct caagctgggg gagggggtgg acctcgacga ctcgggaaaa 38580
ctcatctcca aaaatgccac caaggccact gcccctctca gtatttccaa cggcaccatt 38640
tcccttaaca tggctgcccc tttttacaac aacaatggaa cgttaagtct caatgtttct 38700
acaccattag cagtatttcc cacttttaac actttaggta tcagtcttgg aaacggtctt 38760
caaacttcta ataagttgct gactgtacag ttaactcatc ctcttacatt cagctcaaat 38820
agcatcacag taaaaacaga caaaggactc tatattaatt ctagtggaaa cagagggctt 38880
gaggctaaca taagcctaaa aagaggactg atttttgatg gtaatgctat tgcaacatac 38940
cttggaagtg gtttagacta tggatcctat gatagcgatg ggaaaacaag acccatcatc 39000
accaaaattg gagcaggttt gaattttgat gctaataatg ccatggctgt gaagctaggc 39060
acaggtttaa gttttgactc tgccggtgcc ttaacagctg gaaacaaaga ggatgacaag 39120
ctaacacttt ggactacacc tgacccaagc cctaattgtc aattactttc agacagagat 39180
gccaaattta ccctatgtct tacaaaatgc ggtagtcaaa tactaggcac tgttgcagta 39240
gctgctgtta ctgtaggttc agcactaaat ccaattaatg acacagtaaa aagcgccata 39300
gtattcctta gatttgactc tgacggtgtg ctcatgtcaa actcatcaat ggtaggtgat 39360
tactggaact ttagggaagg acagaccacc caaagtgtgg cctatacaaa tgctgtggga 39420
ttcatgccca atctaggtgc atatcctaaa acccaaagca aaacaccaaa aaatagtata 39480
gtaagtcagg tatatttaaa tggagaaact actatgccaa tgacactgac aataactttc 39540
aatggcactg atgaaaaaga cacaacacct gtgagcactt actccatgac ttttacatgg 39600
cagtggactg gagactataa ggacaagaat attacctttg ctaccaactc ctttactttc 39660
tcctacatgg cccaagaata aaccctgcat gccaacccca ttgttcccac cactatggaa 39720
aactctgaag cagaaaaaaa taaagttcaa gtgttttatt gattcaacag ttttctcaca 39780
gaaccctagt attcaacctg ccacctccct cccaacacac agagtacaca gtcctttctc 39840
cccggctggc cttaaaaagc atcatatcat gggtaacaga catattctta ggtgttatat 39900
tccacacggt ttcctgtcga gccaaacgct catcagtgat attaataaac tccccgggca 39960
gctcacttaa gttcatgtcg ctgtccagct gctgagccac aggctgctgt ccaacttgcg 40020
gttgcttaac gggcggcgaa ggagaagtcc acgcctacat gggggtagag tcataatcgt 40080
gcatcaggat agggcggtgg tgctgcagca gcgcgcgaat aaactgctgc cgccgccgct 40140
ccgtcctgca ggaatacaac atggcagtgg tctcctcagc gatgattcgc accgcccgca 40200
gcataaggcg ccttgtcctc cgggcacagc agcgcaccct gatctcactt aaatcagcac 40260
agtaactgca gcacagcacc acaatattgt tcaaaatccc acagtgcaag gcgctgtatc 40320
caaagctcat ggcggggacc acagaaccca cgtggccatc ataccacaag cgcaggtaga 40380
ttaagtggcg acccctcata aacacgctgg acataaacat tacctctttt ggcatgttgt 40440
aattcaccac ctcccggtac catataaacc tctgattaaa catggcgcca tccaccacca 40500
tcctaaacca gctggccaaa acctgcccgc cggctataca ctgcagggaa ccgggactgg 40560
aacaatgaca gtggagagcc caggactcgt aaccatggat catcatgctc gtcatgatat 40620
caatgttggc acaacacagg cacacgtgca tacacttcct caggattaca agctcctccc 40680
gcgttagaac catatcccag ggaacaaccc attcctgaat cagcgtaaat cccacactgc 40740
agggaagacc tcgcacgtaa ctcacgttgt gcattgtcaa agtgttacat tcgggcagca 40800
gcggatgatc ctccagtatg gtagcgcggg tttctgtctc aaaaggaggt agacgatccc 40860
tactgtacgg agtgcgccga gacaaccgag atcgtgttgg tcgtagtgtc atgccaaatg 40920
gaacgccgga cgtagtcata tttcctgaag caaaaccagg tgcgggcgtg acaaacagat 40980
ctgcgtctcc ggtctcgccg cttagatcgc tctgtgtagt agttgtagta tatccactct 41040
ctcaaagcat ccaggcgccc cctggcttcg ggttctatgt aaactccttc atgcgccgct 41100
gccctgataa catccaccac cgcagaataa gccacaccca gccaacctac acattcgttc 41160
tgcgagtcac acacgggagg agcgggaaga gctggaagaa ccatgattaa ctttattcca 41220
aacggtctcg gagcacttca aaatgcaggt cccggaggtg gcacctctcg cccccactgt 41280
gttggtggaa aataacagcc aggtcaaagg tgacacggtt ctcgagatgt tccacggtgg 41340
cttccagcaa agcctccacg cgcacatcca gaaacaagag gacagcgaaa gcgggagcgt 41400
tttctaattc ctcaatcatc atattacact cctgcaccat ccccagataa ttttcatttt 41460
tccagccttg aatgattcgt attagttcct gaggtaaatc caagccagcc atgataaaaa 41520
gctcgcgcag agcgccctcc accggcattc ttaagcacac cctcataatt ccaagagatt 41580
ctgctcctgg ttcacctgca gcagattaac aatgggaata tcaaaatctc tgccgcgatc 41640
cctaagctcc tccctcaaca ataactgtat gtaatctttc atatcatctc cgaaattttt 41700
agccataggg ccgccaggaa taagagcagg gcaagccaca ttacagataa agcgaagtcc 41760
tccccagtgw gcattgccaa atgtaagatt gaaataagca tgctggctag accctgtgat 41820
atcttccaga taactggaca gaaaatcagg caagcaattt ttaagaaaat caacaaaaga 41880
aaagtcgtcc aggtgcaggt ttagagcctc aggaacaacg atggaataag tgcaaggagt 41940
gcgttccagc atggttagtg tttttttggt gatctgtaga acaaaaaata aacatgcaat 42000
attaaaccat gctagcctgg cgaacaggtg ggtaaatcac tctttccagc accaggcagg 42060
ctacggggtc tccggcgcga ccctcgtaga agctgtcgcc atgattgaaa agcatcaccg 42120
agagaccttc ccggtggccg gcatggatga ttcgagaaga agcatacact ccgggaacat 42180
tggcatccgt gagtgaaaaa aagcgaccta taaagcctcg gggcactaca atgctcaatc 42240
tcaattccag caaagccacc ccatgcggat ggagcacaaa attggcaggt gcgtaaaaaa 42300
tgtaattact cccctcctgc acaggcagca aagcccccgc tccctccaga aacacataca 42360
aagcctcagc gtccatagct taccgagcac ggcaggcgca agagtcagag aaaaggctga 42420
gctctaacct gactgcccgc tcctgtgctc aatatatagc cctaacctac actgacgtaa 42480
aggccaaagt ctaaaaatac ccgccaaaat gacacacacg cccagcacac gcccagaaac 42540
cggtgacaca ctcaaaaaaa tacgtgcgct tcctcaaacg cccaaaccgg cgtcatttcc 42600
gggttcccac gctacgtcac cgctcagcga ctttcaaatt ccgtcgaccg ttaaaaacgt 42660
cactcgcccc gcccctaacg gtcgcccttc tctcggccaa tcaccttcct cccttcccaa 42720
attcaaacgc ctcatttgca tattaacgcg cacaaaaagt ttgaggtata tatttgaatg 42780
atg 42783
<210> 63
<211> 32
<212> PRT
<213> 人工序列
<220>
<223> 肽佐剂
<400> 63
Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly
1 5 10 15
Ala Val Phe Val Ser Pro Ser Gln Glu Ile His Ala Arg Phe Arg Arg
20 25 30
<210> 64
<211> 27
<212> PRT
<213> 人工序列
<220>
<223> 肽佐剂
<400> 64
Ser Leu Leu Trp Gly Gly Val Thr Val Leu Ala Ala Met Leu Ile Ala
1 5 10 15
Gly Gln Val Ala Ser Ser Val Val Phe Leu Val
20 25
<210> 65
<211> 3849
<212> DNA
<213> 人工序列
<220>
<223> 疫苗构建体
<400> 65
gacgaggacg agacagccta cgacagcggc accgacctga tcgacttcat cgacgacagc 60
gacgagaatg agaacgactc cgacaccggc gaggacatgg tggatttcat cgacaacgac 120
gaaaccgacg aagagagcac cgagagcgac ctggacggct ttatcgacaa ctccgcccag 180
ctggctgaca gcgacagcaa tgcctgcgcc ttcctgaagg ctcagctggc agacgtgaac 240
agcaacgccg ctgcttttct gaagaactgc atcctgctgt acggcgctgc caacaccggc 300
aagagcctgt tcggcatgag cctgaactgc ctggtgctgt gcggcccagc caataccgga 360
aagtcctact tcggcatgtc cctgaattgt ctcgtgatct acggcccacc taacacaggc 420
aagtcctgct ttgccatgtc tctgtggccc tacctgcaca gcagactggt ggtgtttacc 480
ttccccaacc ccttctggcc ttacctggaa agccggatca ccgtgttcga gttccccaat 540
gcctttctga gatacctgca ctcccggatc cacgtgctgc agtttctgaa ccccttcaac 600
gtgtgccagg acaagatcct ggaacactac gagaacgaca gcaaggacat tctggaacat 660
tatgagaatg attccaagga cctgtgcgac cacatctgcg atcacatcga ctactggaag 720
cacatccggc tggaatgcgc catcatgtac aaggcccgga tcagactgga atgtgctatt 780
atgtataagg ctcgcgagat gggcttccac cagttcgacg gcgacatctg caacaccatg 840
cactacacca actggatcta tatctgcgag gacgcccagt gcaccgtggt ggaaggccag 900
gtggacaaga aatgggaggt gcacgctggc ggccaagtga tcctgtgtcc tgagagcggc 960
cagcggcgga tcaagaggcc cagaagcgag aactgccacc ccaacaagct gctgatcctg 1020
aagtgcctgc ggtacagatt caagaagcac tgcaagctga gcagcacctg gcactggacc 1080
tgccacgacg gcaagcacaa gtggcattgg acatgtcacg atgggaaaca caagaacgcc 1140
attgtgaccc tgacctacta cgaggccgac aagaacgacc tgaacgccca gatcgagcac 1200
tggaaactga tccggatgga atgtgcaatc ttctataagg ccaaagagct gggcatcagc 1260
atctgccacc aggtggtgcc tccactggcc gcctctaaag ccaaagcctg ccaggccatc 1320
gaactgcagc tggccctgga agccctgaat gccagcccct acgatgagtg gaccctgcag 1380
cagaccagcc tggaaatgtg gctggccgag ccccagttta agaagcacgg catcaccatc 1440
accgtgcagt acgacaatga caaggccaat accatggatt acacaaattg gaaagaaatc 1500
tacgtgatcg tgtgccccgc cagcatcccc tccgatgaga tcagcaccga ggaagccgac 1560
cacattgatt attggaaagc catcaggcag gaaaacgcca tcttcttcgc cgccagacac 1620
caggtggtgc ccgccctgaa tatctgcaag gccaaggcct gtaaagccat cgagtggaac 1680
accgagccca agcactgctt caagaagggc ggccagcaca tcgaagtgtg gttcgactac 1740
gtggcctggg acagcgtgta ctactgcggc gacgatggct ggtgcaagac cgaggccgag 1800
aagtacggct gcaagggcac ctgggaagtg catttcggca acagcatcga ctgcaacgac 1860
tccatgtgca gcaccttcga cgacaacgtg tccgccaccg agctcgtgaa ggaccatatc 1920
gactattgga agctgattcg cctggaatgt gccatttttt acaaggccag acggcggctg 1980
tccagcgacc aggatcagtc tcagctcgtg accaagtacc ccctgctgaa gctgctgtcc 2040
agacccccca acatgggcgt gaaggcccac ggcaagtgca tctgggagaa caaggtgttc 2100
atcgtgccca ccctgtgccc cgtgcctctg gatccaacat atcctctgct gaaactgctg 2160
accacccaga ccaccacccc cgagaatacc tccctggtgg aactgagagt gaccaccccc 2220
aagagcacag tcgtgatcag gctgcacctg accaccagat acccactgct gtcactgctg 2280
aacagctaca gcaccccccc tcaccggatc cctgctccat gtccttgggc tcctcagagg 2340
ccccccatcc ctaagccttc tccatgggcc cctagaatcc ctgccccttg cccctgggca 2400
cctcctagac ctccacactg tccatgggtg ccccctccac ctcctccaag accttgggcc 2460
ccttgcttcc tgctgtgctt ttgtgtgctg ctgtgcgtgt gcctgctgat cagacccctg 2520
ctgctgagtg tgtccaccta cctgaggcct ctgctgctgt ctatcagcgt gtacgctcag 2580
gtgctggtgc tggtgctgct gctgtgggtg tccatcggaa gcctgctgcc cagcgtgtgc 2640
atgtgtgcct atgcctgggt gctggtgttc gtgtacatcg tcgtgattac cagccccgcc 2700
accgccatcg tgtaccggga tggcaatcct tacgccgtgt gcgacaagtg cctgaagttc 2760
tacagcaaga tcagcgagta ccggcactac tgctacagcc tgtacggcac caccctggaa 2820
cagcagtaca acaagcccct gtgcgatctg ctgattcggt gcatcaacgt ggtgtacaga 2880
gactccatcc cccacgccgc ctgccacaag tgtatcgact tctactccag aatcagagag 2940
ctgcggcact acagcgactc cgtgtacggc gataccctgg aaaagctgac caacactggc 3000
ctgtacaacc tgctgattag atgcctgcgg gtgttctgca agaaggccct gacagccagc 3060
gaggtgtaca acttcgccta caccgatctg cgggtggtgt atcgggacag caaagtgcgg 3120
aagctgaggt actacaactg ctctgtgtat ggcgccagcc tggtgtattg caagggacag 3180
ctgaccgaga cagaggtgct ggatttcgcc ttcacagacc tgacaatcgt gtatcgcgac 3240
tccaaggtgt ccgagttccg gtggtacaga tattccgtgt atggcaccac actgtgcgtg 3300
gaatgcaaga aaaccctgca gagatctgag gtgtacgact gccagcggcc actgtgtccg 3360
caggaaaaga aaagacacgt ggacctgaac aagcggttcc acaccctgca cgagtacatg 3420
ctggatctgc agcccgagac aaccgacctg tactgctacg agcagcctga aaccactgat 3480
ctgcactgtt atgagcagct gggagacagc tccgatgaag aggacactgg cggcctggat 3540
ggggacgagg atgaggacga agtggaccat ctgcaggaac agccccagca ggctagacgg 3600
gacgaacagc acccttgcta tctgatcgag acacagtgct gcagatgcga atctctggtg 3660
gaagagaacg acgagatcga cggcgtgaac caccagcatc tgcccgctag aagggccgag 3720
cctcagagac acaccatgct gtgtatgtgc tgcaagtgcg aggccagaat cgccggctct 3780
ggacctggcg cctctggcaa gcctatcccc aatccactgc tgggcctgga ctccacccgg 3840
acctgataa 3849
<210> 66
<211> 1281
<212> PRT
<213> 人工序列
<220>
<223> 疫苗构建体
<400> 66
Asp Glu Asp Glu Thr Ala Tyr Asp Ser Gly Thr Asp Leu Ile Asp Phe
1 5 10 15
Ile Asp Asp Ser Asp Glu Asn Glu Asn Asp Ser Asp Thr Gly Glu Asp
20 25 30
Met Val Asp Phe Ile Asp Asn Asp Glu Thr Asp Glu Glu Ser Thr Glu
35 40 45
Ser Asp Leu Asp Gly Phe Ile Asp Asn Ser Ala Gln Leu Ala Asp Ser
50 55 60
Asp Ser Asn Ala Cys Ala Phe Leu Lys Ala Gln Leu Ala Asp Val Asn
65 70 75 80
Ser Asn Ala Ala Ala Phe Leu Lys Asn Cys Ile Leu Leu Tyr Gly Ala
85 90 95
Ala Asn Thr Gly Lys Ser Leu Phe Gly Met Ser Leu Asn Cys Leu Val
100 105 110
Leu Cys Gly Pro Ala Asn Thr Gly Lys Ser Tyr Phe Gly Met Ser Leu
115 120 125
Asn Cys Leu Val Ile Tyr Gly Pro Pro Asn Thr Gly Lys Ser Cys Phe
130 135 140
Ala Met Ser Leu Trp Pro Tyr Leu His Ser Arg Leu Val Val Phe Thr
145 150 155 160
Phe Pro Asn Pro Phe Trp Pro Tyr Leu Glu Ser Arg Ile Thr Val Phe
165 170 175
Glu Phe Pro Asn Ala Phe Leu Arg Tyr Leu His Ser Arg Ile His Val
180 185 190
Leu Gln Phe Leu Asn Pro Phe Asn Val Cys Gln Asp Lys Ile Leu Glu
195 200 205
His Tyr Glu Asn Asp Ser Lys Asp Ile Leu Glu His Tyr Glu Asn Asp
210 215 220
Ser Lys Asp Leu Cys Asp His Ile Cys Asp His Ile Asp Tyr Trp Lys
225 230 235 240
His Ile Arg Leu Glu Cys Ala Ile Met Tyr Lys Ala Arg Ile Arg Leu
245 250 255
Glu Cys Ala Ile Met Tyr Lys Ala Arg Glu Met Gly Phe His Gln Phe
260 265 270
Asp Gly Asp Ile Cys Asn Thr Met His Tyr Thr Asn Trp Ile Tyr Ile
275 280 285
Cys Glu Asp Ala Gln Cys Thr Val Val Glu Gly Gln Val Asp Lys Lys
290 295 300
Trp Glu Val His Ala Gly Gly Gln Val Ile Leu Cys Pro Glu Ser Gly
305 310 315 320
Gln Arg Arg Ile Lys Arg Pro Arg Ser Glu Asn Cys His Pro Asn Lys
325 330 335
Leu Leu Ile Leu Lys Cys Leu Arg Tyr Arg Phe Lys Lys His Cys Lys
340 345 350
Leu Ser Ser Thr Trp His Trp Thr Cys His Asp Gly Lys His Lys Trp
355 360 365
His Trp Thr Cys His Asp Gly Lys His Lys Asn Ala Ile Val Thr Leu
370 375 380
Thr Tyr Tyr Glu Ala Asp Lys Asn Asp Leu Asn Ala Gln Ile Glu His
385 390 395 400
Trp Lys Leu Ile Arg Met Glu Cys Ala Ile Phe Tyr Lys Ala Lys Glu
405 410 415
Leu Gly Ile Ser Ile Cys His Gln Val Val Pro Pro Leu Ala Ala Ser
420 425 430
Lys Ala Lys Ala Cys Gln Ala Ile Glu Leu Gln Leu Ala Leu Glu Ala
435 440 445
Leu Asn Ala Ser Pro Tyr Asp Glu Trp Thr Leu Gln Gln Thr Ser Leu
450 455 460
Glu Met Trp Leu Ala Glu Pro Gln Phe Lys Lys His Gly Ile Thr Ile
465 470 475 480
Thr Val Gln Tyr Asp Asn Asp Lys Ala Asn Thr Met Asp Tyr Thr Asn
485 490 495
Trp Lys Glu Ile Tyr Val Ile Val Cys Pro Ala Ser Ile Pro Ser Asp
500 505 510
Glu Ile Ser Thr Glu Glu Ala Asp His Ile Asp Tyr Trp Lys Ala Ile
515 520 525
Arg Gln Glu Asn Ala Ile Phe Phe Ala Ala Arg His Gln Val Val Pro
530 535 540
Ala Leu Asn Ile Cys Lys Ala Lys Ala Cys Lys Ala Ile Glu Trp Asn
545 550 555 560
Thr Glu Pro Lys His Cys Phe Lys Lys Gly Gly Gln His Ile Glu Val
565 570 575
Trp Phe Asp Tyr Val Ala Trp Asp Ser Val Tyr Tyr Cys Gly Asp Asp
580 585 590
Gly Trp Cys Lys Thr Glu Ala Glu Lys Tyr Gly Cys Lys Gly Thr Trp
595 600 605
Glu Val His Phe Gly Asn Ser Ile Asp Cys Asn Asp Ser Met Cys Ser
610 615 620
Thr Phe Asp Asp Asn Val Ser Ala Thr Glu Leu Val Lys Asp His Ile
625 630 635 640
Asp Tyr Trp Lys Leu Ile Arg Leu Glu Cys Ala Ile Phe Tyr Lys Ala
645 650 655
Arg Arg Arg Leu Ser Ser Asp Gln Asp Gln Ser Gln Leu Val Thr Lys
660 665 670
Tyr Pro Leu Leu Lys Leu Leu Ser Arg Pro Pro Asn Met Gly Val Lys
675 680 685
Ala His Gly Lys Cys Ile Trp Glu Asn Lys Val Phe Ile Val Pro Thr
690 695 700
Leu Cys Pro Val Pro Leu Asp Pro Thr Tyr Pro Leu Leu Lys Leu Leu
705 710 715 720
Thr Thr Gln Thr Thr Thr Pro Glu Asn Thr Ser Leu Val Glu Leu Arg
725 730 735
Val Thr Thr Pro Lys Ser Thr Val Val Ile Arg Leu His Leu Thr Thr
740 745 750
Arg Tyr Pro Leu Leu Ser Leu Leu Asn Ser Tyr Ser Thr Pro Pro His
755 760 765
Arg Ile Pro Ala Pro Cys Pro Trp Ala Pro Gln Arg Pro Pro Ile Pro
770 775 780
Lys Pro Ser Pro Trp Ala Pro Arg Ile Pro Ala Pro Cys Pro Trp Ala
785 790 795 800
Pro Pro Arg Pro Pro His Cys Pro Trp Val Pro Pro Pro Pro Pro Pro
805 810 815
Arg Pro Trp Ala Pro Cys Phe Leu Leu Cys Phe Cys Val Leu Leu Cys
820 825 830
Val Cys Leu Leu Ile Arg Pro Leu Leu Leu Ser Val Ser Thr Tyr Leu
835 840 845
Arg Pro Leu Leu Leu Ser Ile Ser Val Tyr Ala Gln Val Leu Val Leu
850 855 860
Val Leu Leu Leu Trp Val Ser Ile Gly Ser Leu Leu Pro Ser Val Cys
865 870 875 880
Met Cys Ala Tyr Ala Trp Val Leu Val Phe Val Tyr Ile Val Val Ile
885 890 895
Thr Ser Pro Ala Thr Ala Ile Val Tyr Arg Asp Gly Asn Pro Tyr Ala
900 905 910
Val Cys Asp Lys Cys Leu Lys Phe Tyr Ser Lys Ile Ser Glu Tyr Arg
915 920 925
His Tyr Cys Tyr Ser Leu Tyr Gly Thr Thr Leu Glu Gln Gln Tyr Asn
930 935 940
Lys Pro Leu Cys Asp Leu Leu Ile Arg Cys Ile Asn Val Val Tyr Arg
945 950 955 960
Asp Ser Ile Pro His Ala Ala Cys His Lys Cys Ile Asp Phe Tyr Ser
965 970 975
Arg Ile Arg Glu Leu Arg His Tyr Ser Asp Ser Val Tyr Gly Asp Thr
980 985 990
Leu Glu Lys Leu Thr Asn Thr Gly Leu Tyr Asn Leu Leu Ile Arg Cys
995 1000 1005
Leu Arg Val Phe Cys Lys Lys Ala Leu Thr Ala Ser Glu Val Tyr
1010 1015 1020
Asn Phe Ala Tyr Thr Asp Leu Arg Val Val Tyr Arg Asp Ser Lys
1025 1030 1035
Val Arg Lys Leu Arg Tyr Tyr Asn Cys Ser Val Tyr Gly Ala Ser
1040 1045 1050
Leu Val Tyr Cys Lys Gly Gln Leu Thr Glu Thr Glu Val Leu Asp
1055 1060 1065
Phe Ala Phe Thr Asp Leu Thr Ile Val Tyr Arg Asp Ser Lys Val
1070 1075 1080
Ser Glu Phe Arg Trp Tyr Arg Tyr Ser Val Tyr Gly Thr Thr Leu
1085 1090 1095
Cys Val Glu Cys Lys Lys Thr Leu Gln Arg Ser Glu Val Tyr Asp
1100 1105 1110
Cys Gln Arg Pro Leu Cys Pro Gln Glu Lys Lys Arg His Val Asp
1115 1120 1125
Leu Asn Lys Arg Phe His Thr Leu His Glu Tyr Met Leu Asp Leu
1130 1135 1140
Gln Pro Glu Thr Thr Asp Leu Tyr Cys Tyr Glu Gln Pro Glu Thr
1145 1150 1155
Thr Asp Leu His Cys Tyr Glu Gln Leu Gly Asp Ser Ser Asp Glu
1160 1165 1170
Glu Asp Thr Gly Gly Leu Asp Gly Asp Glu Asp Glu Asp Glu Val
1175 1180 1185
Asp His Leu Gln Glu Gln Pro Gln Gln Ala Arg Arg Asp Glu Gln
1190 1195 1200
His Pro Cys Tyr Leu Ile Glu Thr Gln Cys Cys Arg Cys Glu Ser
1205 1210 1215
Leu Val Glu Glu Asn Asp Glu Ile Asp Gly Val Asn His Gln His
1220 1225 1230
Leu Pro Ala Arg Arg Ala Glu Pro Gln Arg His Thr Met Leu Cys
1235 1240 1245
Met Cys Cys Lys Cys Glu Ala Arg Ile Ala Gly Ser Gly Pro Gly
1250 1255 1260
Ala Ser Gly Lys Pro Ile Pro Asn Pro Leu Leu Gly Leu Asp Ser
1265 1270 1275
Thr Arg Thr
1280
<210> 67
<211> 2661
<212> DNA
<213> 人工序列
<220>
<223> 病毒构建体
<400> 67
ttaattaacc atcttcaata atatacctca aactttttgt gcgcgttaat atgcaaatga 60
ggcgtttgaa tttggggagg aagggcggtg attggtcgag ggatgagcga ccgttagggg 120
cggggcgagt gacgttttga tgacgtggtt gcgaggagga gccagtttgc aagttctcgt 180
gggaaaagtg acgtcaaacg aggtgtggtt tgaacacgga aatactcaat tttcccgcgc 240
tctctgacag gaaatgaggt gtttctgggc ggatgcaagt gaaaacgggc cattttcgcg 300
cgaaaactga atgaggaagt gaaaatctga gtaatttcgc gtttatggca gggaggagta 360
tttgccgagg gccgagtaga ctttgaccga ttacgtgggg gtttcgatta ccgtgttttt 420
cacctaaatt tccgcgtacg gtgtcaaagt ccggtgtttt tacgcgatcg ctagcgacat 480
cgatcacaag tttgtacaaa aaagcaggct ccaccatggg aacccgcgtt ttgagatttc 540
tgtcgccgac taaattcatg tcgcgcgata gtggtgttta tcgccgatag agatggcgat 600
attggaaaaa tcgatatttg aaaatatggc atattgaaaa tgtcgccgat gtgagtttct 660
gtgtaactga tatcgccatt tttccaaaag tgatttttgg gcatacgcga tatctggcga 720
tagcgcttat atcgtttacg ggggatggcg atagacgact ttggtgactt gggcgattct 780
gtgtgtcgca aatatcgcag tttcgatata ggtgacagac gatatgaggc tatatcgccg 840
atagaggcga catcaagctg gcacatggcc aatgcatatc gatctataca ttgaatcaat 900
attggccatt agccatatta ttcattggtt atatagcata aatcaatatt ggctattggc 960
cattgcatac gttgtatcca tatcataata tgtacattta tattggctca tgtccaacat 1020
taccgccatg ttgacattga ttattgacta gttattaata gtaatcaatt acggggtcat 1080
tagttcatag cccatatatg gagttccgcg ttacataact tacggtaaat ggcccgcctg 1140
gctgaccgcc caacgacccc cgcccattga cgtcaataat gacgtatgtt cccatagtaa 1200
cgccaatagg gactttccat tgacgtcaat gggtggagta tttacggtaa actgcccact 1260
tggcagtaca tcaagtgtat catatgccaa gtacgccccc tattgacgtc aatgacggta 1320
aatggcccgc ctggcattat gcccagtaca tgaccttatg ggactttcct acttggcagt 1380
acatctacgt attagtcatc gctattacca tggtgatgcg gttttggcag tacatcaatg 1440
ggcgtggata gcggtttgac tcacggggat ttccaagtct ccaccccatt gacgtcaatg 1500
ggagtttgtt ttggcaccaa aatcaacggg actttccaaa atgtcgtaac aactccgccc 1560
cattgacgca aatgggcggt aggcgtgtac ggtgggaggt ctatataagc agagctctcc 1620
ctatcagtga tagagatctc cctatcagtg atagagatcg tcgacgagct cgtttagtga 1680
accgtcagat cgcctggaga cgccatccac gctgttttga cctccataga agacaccggg 1740
accgatccag cctccgcggc cgggaacggt gcattggaac gcggattccc cgtgccaaga 1800
gtgacgtaag taccgcctat agagtctata ggcccacccc cttggcttct tatgcatgct 1860
atactgtttt tggcttgggg tctatacacc cccgcttcct catgttatag gtgatggtat 1920
agcttagcct ataggtgtgg gttattgacc attattgacc actcccctat tggtgacgat 1980
actttccatt actaatccat aacatggctc tttgccacaa ctctctttat tggctatatg 2040
ccaatacact gtccttcaga gactgacacg gactctgtat ttttacagga tggggtctca 2100
tttattattt acaaattcac atatacaaca ccaccgtccc cagtgcccgc agtttttatt 2160
aaacataacg tgggatctcc acgcgaatct cgggtacgtg ttccggacat gggctcttct 2220
ccggtagcgg cggagcttct acatccgagc cctgctccca tgcctccagc gactcatggt 2280
cgctcggcag ctccttgctc ctaacagtgg aggccagact taggcacagc acgatgccca 2340
ccaccaccag tgtgccgcac aaggccgtgg cggtagggta tgtgtctgaa aatgagctcg 2400
gggagcgggc ttgcaccgct gacgcatttg gaagacttaa ggcagcggca gaagaagatg 2460
caggcagctg agttgttgtg ttctgataag agtcagaggt aactcccgtt gcggtgctgt 2520
taacggtgga gggcagtgta gtctgagcag tactcgttgc tgccgcgcgc gccaccagac 2580
ataatagctg acagactaac agactgttcc tttccatggg tcttttctgc agtcaccgtc 2640
cttgacacga agcttggtac c 2661
<210> 68
<211> 37490
<212> DNA
<213> 人工序列
<220>
<223> 病毒构建体
<400> 68
gcggccgctc gagcatgcat ctagagggcc ctattctata gtgtcaccta aatgctagag 60
ctcgctgatc agcctcgact gtgccttcta gttgccagcc atctgttgtt tgcccctccc 120
ccgtgccttc cttgaccctg gaaggtgcca ctcccactgt cctttcctaa taaaatgagg 180
aaattgcatc gcattgtctg agtaggtgtc attctattct ggggggtggg gtggggcagg 240
acagcaaggg ggaggattgg gaagacaata gcaggcatgc tggggatgcg gtgggctcta 300
tggcttctga ggcggaaaga accagctggg gctcgagggg ggatcgatcc cgtcgagata 360
tctagaccca gctttcttgt acaaagtggt gatcgattcg acagatcgcg atcgcaagtg 420
agtagtgttc tggggcgggg gaggacctgc atgagggcca gaataactga aatctgtgct 480
tttctgtgtg ttgcagcagc atgagcggaa gcggctcctt tgagggaggg gtattcagcc 540
cttatctgac ggggcgtctc ccctcctggg cgggagtgcg tcagaatgtg atgggatcca 600
cggtggacgg ccggcccgtg cagcccgcga actcttcaac cctgacctat gcaaccctga 660
gctcttcgtc gttggacgca gctgccgccg cagctgctgc atctgccgcc agcgccgtgc 720
gcggaatggc catgggcgcc ggctactacg gcactctggt ggccaactcg agttccacca 780
ataatcccgc cagcctgaac gaggagaagc tgttgctgct gatggcccag ctcgaggcct 840
tgacccagcg cctgggcgag ctgacccagc aggtggctca gctgcaggag cagacgcggg 900
ccgcggttgc cacggtgaaa tccaaataaa aaatgaatca ataaataaac ggagacggtt 960
gttgatttta acacagagtc tgaatcttta tttgattttt cgcgcgcggt aggccctgga 1020
ccaccggtct cgatcattga gcacccggtg gatcttttcc aggacccggt agaggtgggc 1080
ttggatgttg aggtacatgg gcatgagccc gtcccggggg tggaggtagc tccattgcag 1140
ggcctcgtgc tcgggggtgg tgttgtaaat cacccagtca tagcaggggc gcagggcatg 1200
gtgttgcaca atatctttga ggaggagact gatggccacg ggcagccctt tggtgtaggt 1260
gtttacaaat ctgttgagct gggagggatg catgcggggg gagatgaggt gcatcttggc 1320
ctggatcttg agattggcga tgttaccgcc cagatcccgc ctggggttca tgttgtgcag 1380
gaccaccagc acggtgtatc cggtgcactt ggggaattta tcatgcaact tggaagggaa 1440
ggcgtgaaag aatttggcga cgcctttgtg cccgcccagg ttttccatgc actcatccat 1500
gatgatggcg atgggcccgt gggcggcggc ctgggcaaag acgtttcggg ggtcggacac 1560
atcatagttg tggtcctggg tgaggtcatc ataggccatt ttaatgaatt tggggcggag 1620
ggtgccggac tgggggacaa aggtaccctc gatcccgggg gcgtagttcc cctcacagat 1680
ctgcatctcc caggctttga gctcggaggg ggggatcatg tccacctgcg gggcgataaa 1740
gaacacggtt tccggggcgg gggagatgag ctgggccgaa agcaagttcc ggagcagctg 1800
ggacttgccg cagccggtgg ggccgtagat gaccccgatg accggctgca ggtggtagtt 1860
gagggagaga cagctgccgt cctcccggag gaggggggcc acctcgttca tcatctcgcg 1920
cacgtgcatg ttctcgcgca ccagttccgc caggaggcgc tctcccccca gggataggag 1980
ctcctggagc gaggcgaagt ttttcagcgg cttgagtccg tcggccatgg gcattttgga 2040
gagggtttgt tgcaagagtt ccaggcggtc ccagagctcg gtgatgtgct ctacggcatc 2100
tcgatccagc agacctcctc gtttcgcggg ttgggacggc tgcgggagta gggcaccaga 2160
cgatgggcgt ccagcgcagc cagggtccgg tccttccagg gtcgcagcgt ccgcgtcagg 2220
gtggtctccg tcacggtgaa ggggtgcgcg ccgggctggg cgcttgcgag ggtgcgcttc 2280
aggctcatcc ggctggtcga aaaccgctcc cgatcggcgc cctgcgcgtc ggccaggtag 2340
caattgacca tgagttcgta gttgagcgcc tcggccgcgt ggcctttggc gcggagctta 2400
cctttggaag tctgcccgca ggcgggacag aggagggact tgagggcgta gagcttgggg 2460
gcgaggaaga cggactcggg ggcgtaggcg tccgcgccgc agtgggcgca gacggtctcg 2520
cactccacga gccaggtgag gtcgggctgg tcggggtcaa aaaccagttt cccgccgttc 2580
tttttgatgc gtttcttacc tttggtctcc atgagctcgt gtccccgctg ggtgacaaag 2640
aggctgtccg tgtccccgta gaccgacttt atgggccggt cctcgagcgg tgtgccgcgg 2700
tcctcctcgt agaggaaccc cgcccactcc gagacgaaag cccgggtcca ggccagcacg 2760
aaggaggcca cgtgggacgg gtagcggtcg ttgtccacca gcgggtccac cttttccagg 2820
gtatgcaaac acatgtcccc ctcgtccaca tccaggaagg tgattggctt gtaagtgtag 2880
gccacgtgac cgggggtccc ggccgggggg gtataaaagg gtgcgggtcc ctgctcgtcc 2940
tcactgtctt ccggatcgct gtccaggagc gccagctgtt ggggtaggta ttccctctcg 3000
aaggcgggca tgacctcggc actcaggttg tcagtttcta gaaacgagga ggatttgata 3060
ttgacggtgc cggcggagat gcctttcaag agcccctcgt ccatctggtc agaaaagacg 3120
atctttttgt tgtcgagctt ggtggcgaag gagccgtaga gggcgttgga gaggagcttg 3180
gcgatggagc gcatggtctg gtttttttcc ttgtcggcgc gctccttggc ggcgatgttg 3240
agctgcacgt actcgcgcgc cacgcacttc cattcgggga agacggtggt cagctcgtcg 3300
ggcacgattc tgacctgcca gccccgatta tgcagggtga tgaggtccac actggtggcc 3360
acctcgccgc gcaggggctc attagtccag cagaggcgtc cgcccttgcg cgagcagaag 3420
gggggcaggg ggtccagcat gacctcgtcg ggggggtcgg catcgatggt gaagatgccg 3480
ggcaggaggt cggggtcaaa gtagctgatg gaagtggcca gatcgtccag ggcagcttgc 3540
cattcgcgca cggccagcgc gcgctcgtag ggactgaggg gcgtgcccca gggcatggga 3600
tgggtaagcg cggaggcgta catgccgcag atgtcgtaga cgtagagggg ctcctcgagg 3660
atgccgatgt aggtggggta gcagcgcccc ccgcggatgc tggcgcgcac gtagtcatac 3720
agctcgtgcg agggggcgag gagccccggg cccaggttgg tgcgactggg cttttcggcg 3780
cggtagacga tctggcggaa aatggcatgc gagttggagg agatggtggg cctttggaag 3840
atgttgaagt gggcgtgggg cagtccgacc gagtcgcgga tgaagtgggc gtaggagtct 3900
tgcagcttgg cgacgagctc ggcggtgact aggacgtcca gagcgcagta gtcgagggtc 3960
tcctggatga tgtcatactt gagctgtccc ttttgtttcc acagctcgcg gttgagaagg 4020
aactcttcgc ggtccttcca gtactcttcg agggggaacc cgtcctgatc tgcacggtaa 4080
gagcctagca tgtagaactg gttgacggcc ttgtaggcgc agcagccctt ctccacgggg 4140
agggcgtagg cctgggcggc cttgcgcagg gaggtgtgcg tgagggcgaa agtgtccctg 4200
accatgacct tgaggaactg gtgcttgaag tcgatatcgt cgcagccccc ctgctcccag 4260
agctggaagt ccgtgcgctt cttgtaggcg gggttgggca aagcgaaagt aacatcgttg 4320
aagaggatct tgcccgcgcg gggcataaag ttgcgagtga tgcggaaagg ttggggcacc 4380
tcggcccggt tgttgatgac ctgggcggcg agcacgatct cgtcgaagcc gttgatgttg 4440
tggcccacga tgtagagttc cacgaatcgc ggacggccct tgacgtgggg cagtttcttg 4500
agctcctcgt aggtgagctc gtcggggtcg ctgagcccgt gctgctcgag cgcccagtcg 4560
gcgagatggg ggttggcgcg gaggaaggaa gtccagagat ccacggccag ggcggtttgc 4620
agacggtccc ggtactgacg gaactgctgc ccgacggcca ttttttcggg ggtgacgcag 4680
tagaaggtgc gggggtcccc gtgccagcga tcccatttga gctggagggc gagatcgagg 4740
gcgagctcga cgagccggtc gtccccggag agtttcatga ccagcatgaa ggggacgagc 4800
tgcttgccga aggaccccat ccaggtgtag gtttccacat cgtaggtgag gaagagcctt 4860
tcggtgcgag gatgcgagcc gatggggaag aactggatct cctgccacca attggaggaa 4920
tggctgttga tgtgatggaa gtagaaatgc cgacggcgcg ccgaacactc gtgcttgtgt 4980
ttatacaagc ggccacagtg ctcgcaacgc tgcacgggat gcacgtgctg cacgagctgt 5040
acctgagttc ctttgacgag gaatttcagt gggaagtgga gtcgtggcgc ctgcatctcg 5100
tgctgtacta cgtcgtggtg gtcggcctgg ccctcttctg cctcgatggt ggtcatgctg 5160
acgagcccgc gcgggaggca ggtccagacc tcggcgcgag cgggtcggag agcgaggacg 5220
agggcgcgca ggccggagct gtccagggtc ctgagacgct gcggagtcag gtcagtgggc 5280
agcggcggcg cgcggttgac ttgcaggagt ttttccaggg cgcgcgggag gtccagatgg 5340
tacttgatct ccaccgcgcc attggtggcg acgtcgatgg cttgcagggt cccgtgcccc 5400
tggggtgtga ccaccgtccc ccgtttcttc ttgggcggct ggggcgacgg gggcggtgcc 5460
tcttccatgg ttagaagcgg cggcgaggac gcgcgccggg cggcaggggc ggctcggggc 5520
ccggaggcag gggcggcagg ggcacgtcgg cgccgcgcgc gggtaggttc tggtactgcg 5580
cccggagaag actggcgtga gcgacgacgc gacggttgac gtcctggatc tgacgcctct 5640
gggtgaaggc cacgggaccc gtgagtttga acctgaaaga gagttcgaca gaatcaatct 5700
cggtatcgtt gacggcggcc tgccgcagga tctcttgcac gtcgcccgag ttgtcctggt 5760
aggcgatctc ggtcatgaac tgctcgatct cctcctcttg aaggtctccg cggccggcgc 5820
gctccacggt ggccgcgagg tcgttggaga tgcggcccat gagctgcgag aaggcgttca 5880
tgcccgcctc gttccagacg cggctgtaga ccacgacgcc ctcgggatcg ccggcgcgca 5940
tgaccacctg ggcgaggttg agctccacgt ggcgcgtgaa gaccgcgtag ttgcagaggc 6000
gctggtagag gtagttgagc gtggtggcga tgtgctcggt gacgaagaaa tacatgatcc 6060
agcggcggag cggcatctcg ctgacgtcgc ccagcgcctc caaacgttcc atggcctcgt 6120
aaaagtccac ggcgaagttg aaaaactggg agttgcgcgc cgagacggtc aactcctcct 6180
ccagaagacg gatgagctcg gcgatggtgg cgcgcacctc gcgctcgaag gcccccggga 6240
gttcctccac ttcctcttct tcctcctcca ctaacatctc ttctacttcc tcctcaggcg 6300
gcagtggtgg cgggggaggg ggcctgcgtc gccggcggcg cacgggcaga cggtcgatga 6360
agcgctcgat ggtctcgccg cgccggcgtc gcatggtctc ggtgacggcg cgcccgtcct 6420
cgcggggccg cagcgtgaag acgccgccgc gcatctccag gtggccgggg gggtccccgt 6480
tgggcaggga gagggcgctg acgatgcatc ttatcaattg ccccgtaggg actccgcgca 6540
aggacctgag cgtctcgaga tccacgggat ctgaaaaccg ctgaacgaag gcttcgagcc 6600
agtcgcagtc gcaaggtagg ctgagcacgg tttcttctgg cgggtcatgt tggttgggag 6660
cggggcgggc gatgctgctg gtgatgaagt tgaaataggc ggttctgaga cggcggatgg 6720
tggcgaggag caccaggtct ttgggcccgg cttgctggat gcgcagacgg tcggccatgc 6780
cccaggcgtg gtcctgacac ctggccaggt ccttgtagta gtcctgcatg agccgctcca 6840
cgggcacctc ctcctcgccc gcgcggccgt gcatgcgcgt gagcccgaag ccgcgctggg 6900
gctggacgag cgccaggtcg gcgacgacgc gctcggcgag gatggcttgc tggatctggg 6960
tgagggtggt ctggaagtca tcaaagtcga cgaagcggtg gtaggctccg gtgttgatgg 7020
tgtaggagca gttggccatg acggaccagt tgacggtctg gtggcccgga cgcacgagct 7080
cgtggtactt gaggcgcgag taggcgcgcg tgtcgaagat gtagtcgttg caggtgcgca 7140
ccaggtactg gtagccgatg aggaagtgcg gcggcggctg gcggtagagc ggccatcgct 7200
cggtggcggg ggcgccgggc gcgaggtcct cgagcatggt gcggtggtag ccgtagatgt 7260
acctggacat ccaggtgatg ccggcggcgg tggtggaggc gcgcgggaac tcgcggacgc 7320
ggttccagat gttgcgcagc ggcaggaagt agttcatggt gggcacggtc tggcccgtga 7380
ggcgcgcgca gtcgtggatg ctctatacgg gcaaaaacga aagcggtcag cggctcgact 7440
ccgtggcctg gaggctaagc gaacgggttg ggctgcgcgt gtaccccggt tcgaatctcg 7500
aatcaggctg gagccgcagc taacgtggta ttggcactcc cgtctcgacc caagcctgca 7560
ccaaccctcc aggatacgga ggcgggtcgt tttgcaactt ttttttggag gccggatgag 7620
actagtaagc gcggaaagcg gccgaccgcg atggctcgct gccgtagtct ggagaagaat 7680
cgccagggtt gcgttgcggt gtgccccggt tcgaggccgg ccggattccg cggctaacga 7740
gggcgtggct gccccgtcgt ttccaagacc ccatagccag ccgacttctc cagttacgga 7800
gcgagcccct cttttgtttt gtttgttttt gccagatgca tcccgtactg cggcagatgc 7860
gcccccacca ccctccaccg caacaacagc cccctccaca gccggcgctt ctgcccccgc 7920
cccagcagca acttccagcc acgaccgccg cggccgccgt gagcggggct ggacagagtt 7980
atgatcacca gctggccttg gaagagggcg aggggctggc gcgcctgggg gcgtcgtcgc 8040
cggagcggca cccgcgcgtg cagatgaaaa gggacgctcg cgaggcctac gtgcccaagc 8100
agaacctgtt cagagacagg agcggcgagg agcccgagga gatgcgcgcg gcccggttcc 8160
acgcggggcg ggagctgcgg cgcggcctgg accgaaagag ggtgctgagg gacgaggatt 8220
tcgaggcgga cgagctgacg gggatcagcc ccgcgcgcgc gcacgtggcc gcggccaacc 8280
tggtcacggc gtacgagcag accgtgaagg aggagagcaa cttccaaaaa tccttcaaca 8340
accacgtgcg caccctgatc gcgcgcgagg aggtgaccct gggcctgatg cacctgtggg 8400
acctgctgga ggccatcgtg cagaacccca ccagcaagcc gctgacggcg cagctgttcc 8460
tggtggtgca gcatagtcgg gacaacgaag cgttcaggga ggcgctgctg aatatcaccg 8520
agcccgaggg ccgctggctc ctggacctgg tgaacattct gcagagcatc gtggtgcagg 8580
agcgcgggct gccgctgtcc gagaagctgg cggccatcaa cttctcggtg ctgagtttgg 8640
gcaagtacta cgctaggaag atctacaaga ccccgtacgt gcccatagac aaggaggtga 8700
agatcgacgg gttttacatg cgcatgaccc tgaaagtgct gaccctgagc gacgatctgg 8760
gggtgtaccg caacgacagg atgcaccgtg cggtgagcgc cagcaggcgg cgcgagctga 8820
gcgaccagga gctgatgcat agtctgcagc gggccctgac cggggccggg accgaggggg 8880
agagctactt tgacatgggc gcggacctgc actggcagcc cagccgccgg gccttggagg 8940
cggcggcagg accctacgta gaagaggtgg acgatgaggt ggacgaggag ggcgagtacc 9000
tggaagactg atggcgcgac cgtatttttg ctagatgcaa caacaacagc cacctcctga 9060
tcccgcgatg cgggcggcgc tgcagagcca gccgtccggc attaactcct cggacgattg 9120
gacccaggcc atgcaacgca tcatggcgct gacgacccgc aaccccgaag cctttagaca 9180
gcagccccag gccaaccggc tctcggccat cctggaggcc gtggtgccct cgcgctccaa 9240
ccccacgcac gagaaggtcc tggccatcgt gaacgcgctg gtggagaaca aggccatccg 9300
cggcgacgag gccggcctgg tgtacaacgc gctgctggag cgcgtggccc gctacaacag 9360
caccaacgtg cagaccaacc tggaccgcat ggtgaccgac gtgcgcgagg ccgtggccca 9420
gcgcgagcgg ttccaccgcg agtccaacct gggatccatg gtggcgctga acgccttcct 9480
cagcacccag cccgccaacg tgccccgggg ccaggaggac tacaccaact tcatcagcgc 9540
cctgcgcctg atggtgaccg aggtgcccca gagcgaggtg taccagtccg ggccggacta 9600
cttcttccag accagtcgcc agggcttgca gaccgtgaac ctgagccagg ctttcaagaa 9660
cttgcagggc ctgtggggcg tgcaggcccc ggtcggggac cgcgcgacgg tgtcgagcct 9720
gctgacgccg aactcgcgcc tgctgctgct gctggtggcc cccttcacgg acagcggcag 9780
catcaaccgc aactcgtacc tgggctacct gattaacctg taccgcgagg ccatcggcca 9840
ggcgcacgtg gacgagcaga cctaccagga gatcacccac gtgagccgcg ccctgggcca 9900
ggacgacccg ggcaacctgg aagccaccct gaactttttg ctgaccaacc ggtcgcagaa 9960
gatcccgccc cagtacgcgc tcagcaccga ggaggagcgc atcctgcgtt acgtgcagca 10020
gagcgtgggc ctgttcctga tgcaggaggg ggccaccccc agcgccgcgc tcgacatgac 10080
cgcgcgcaac atggagccca gcatgtacgc cagcaaccgc ccgttcatca ataaactgat 10140
ggactacttg catcgggcgg ccgccatgaa ctctgactat ttcaccaacg ccatcctgaa 10200
tccccactgg ctcccgccgc cggggttcta cacgggcgag tacgacatgc ccgaccccaa 10260
tgacgggttc ctgtgggacg atgtggacag cagcgtgttc tccccccgac cgggtgctaa 10320
cgagcgcccc ttgtggaaga aggaaggcag cgaccgacgc ccgtcctcgg cgctgtccgg 10380
ccgcgagggt gctgccgcgg cggtgcccga ggccgccagt cctttcccga gcttgccctt 10440
ctcgctgaac agtatccgca gcagcgagct gggcaggatc acgcgcccgc gcttgctggg 10500
cgaagaggag tacttgaatg actcgctgtt gagacccgag cgggagaaga acttccccaa 10560
taacgggata gaaagcctgg tggacaagat gagccgctgg aagacgtatg cgcaggagca 10620
cagggacgat ccccgggcgt cgcagggggc cacgagccgg ggcagcgccg cccgtaaacg 10680
ccggtggcac gacaggcagc ggggacagat gtgggacgat gaggactccg ccgacgacag 10740
cagcgtgttg gacttgggtg ggagtggtaa cccgttcgct cacctgcgcc cccgtatcgg 10800
gcgcatgatg taagagaaac cgaaaataaa tgatactcac caaggccatg gcgaccagcg 10860
tgcgttcgtt tcttctctgt tgttgttgta tctagtatga tgaggcgtgc gtacccggag 10920
ggtcctcctc cctcgtacga gagcgtgatg cagcaggcga tggcggcggc ggcgatgcag 10980
cccccgctgg aggctcctta cgtgcccccg cggtacctgg cgcctacgga ggggcggaac 11040
agcattcgtt actcggagct ggcacccttg tacgatacca cccggttgta cctggtggac 11100
aacaagtcgg cggacatcgc ctcgctgaac taccagaacg accacagcaa cttcctgacc 11160
accgtggtgc agaacaatga cttcaccccc acggaggcca gcacccagac catcaacttt 11220
gacgagcgct cgcggtgggg cggccagctg aaaaccatca tgcacaccaa catgcccaac 11280
gtgaacgagt tcatgtacag caacaagttc aaggcgcggg tgatggtctc ccgcaagacc 11340
cccaatgggg tgacagtgac agaggattat gatggtagtc aggatgagct gaagtatgaa 11400
tgggtggaat ttgagctgcc cgaaggcaac ttctcggtga ccatgaccat cgacctgatg 11460
aacaacgcca tcatcgacaa ttacttggcg gtggggcggc agaacggggt gctggagagc 11520
gacatcggcg tgaagttcga cactaggaac ttcaggctgg gctgggaccc cgtgaccgag 11580
ctggtcatgc ccggggtgta caccaacgag gctttccatc ccgatattgt cttgctgccc 11640
ggctgcgggg tggacttcac cgagagccgc ctcagcaacc tgctgggcat tcgcaagagg 11700
cagcccttcc aggaaggctt ccagatcatg tacgaggatc tggagggggg caacatcccc 11760
gcgctcctgg atgtcgacgc ctatgagaaa agcaaggagg atgcagcagc tgaagcaact 11820
gcagccgtag ctaccgcctc taccgaggtc aggggcgata attttgcaag cgccgcagca 11880
gtggcagcgg ccgaggcggc tgaaaccgaa agtaagatag tcattcagcc ggtggagaag 11940
gatagcaaga acaggagcta caacgtacta ccggacaaga taaacaccgc ctaccgcagc 12000
tggtacctag cctacaacta tggcgacccc gagaagggcg tgcgctcctg gacgctgctc 12060
accacctcgg acgtcacctg cggcgtggag caagtctact ggtcgctgcc cgacatgatg 12120
caagacccgg tcaccttccg ctccacgcgt caagttagca actacccggt ggtgggcgcc 12180
gagctcctgc ccgtctactc caagagcttc ttcaacgagc aggccgtcta ctcgcagcag 12240
ctgcgcgcct tcacctcgct tacgcacgtc ttcaaccgct tccccgagaa ccagatcctc 12300
gtccgcccgc ccgcgcccac cattaccacc gtcagtgaaa acgttcctgc tctcacagat 12360
cacgggaccc tgccgctgcg cagcagtatc cggggagtcc agcgcgtgac cgttactgac 12420
gccagacgcc gcacctgccc ctacgtctac aaggccctgg gcatagtcgc gccgcgcgtc 12480
ctctcgagcc gcaccttcta aatgtccatt ctcatctcgc ccagtaataa caccggttgg 12540
ggcctgcgcg cgcccagcaa gatgtacgga ggcgctcgcc aacgctccac gcaacacccc 12600
gtgcgcgtgc gcgggcactt ccgcgctccc tggggcgccc tcaagggccg cgtgcggtcg 12660
cgcaccaccg tcgacgacgt gatcgaccag gtggtggccg acgcgcgcaa ctacaccccc 12720
gccgccgcgc ccgtctccac cgtggacgcc gtcatcgaca gcgtggtggc ggacgcgcgc 12780
cggtacgccc gcgccaagag ccggcggcgg cgcatcgccc ggcggcaccg gagcaccccc 12840
gccatgcgcg cggcgcgagc cttgctgcgc agggccaggc gcacgggacg cagggccatg 12900
ctcagggcgg ccagacgcgc ggcttcaggc gccagcgccg gcaggacccg gagacgcgcg 12960
gccacggcgg cggcagcggc catcgccagc atgtcccgcc cgcggcgagg gaacgtgtac 13020
tgggtgcgcg acgccgccac cggtgtgcgc gtgcccgtgc gcacccgccc ccctcgcact 13080
tgaagatgtt cacttcgcga tgttgatgtg tcccagcggc gaggaggatg tccaagcgca 13140
aattcaagga agagatgctc caggtcatcg cgcctgagat ctacggccct gcggtggtga 13200
aggaggaaag aaagccccgc aaaatcaagc gggtcaaaaa ggacaaaaag gaagaagaaa 13260
gtgatgtgga cggattggtg gagtttgtgc gcgagttcgc cccccggcgg cgcgtgcagt 13320
ggcgcgggcg gaaggtgcaa ccggtgctga gacccggcac caccgtggtc ttcacgcccg 13380
gcgagcgctc cggcaccgct tccaagcgct cctacgacga ggtgtacggg gatgatgata 13440
ttctggagca ggcggccgag cgcctgggcg agtttgctta cggcaagcgc agccgttccg 13500
caccgaagga agaggcggtg tccatcccgc tggaccacgg caaccccacg ccgagcctca 13560
agcccgtgac cttgcagcag gtgctgccga ccgcggcgcc gcgccggggg ttcaagcgcg 13620
agggcgagga tctgtacccc accatgcagc tgatggtgcc caagcgccag aagctggaag 13680
acgtgctgga gaccatgaag gtggacccgg acgtgcagcc cgaggtcaag gtgcggccca 13740
tcaagcaggt ggccccgggc ctgggcgtgc agaccgtgga catcaagatt cccacggagc 13800
ccatggaaac gcagaccgag cccatgatca agcccagcac cagcaccatg gaggtgcaga 13860
cggatccctg gatgccatcg gctcctagtc gaagaccccg gcgcaagtac ggcgcggcca 13920
gcctgctgat gcccaactac gcgctgcatc cttccatcat ccccacgccg ggctaccgcg 13980
gcacgcgctt ctaccgcggt cataccagca gccgccgccg caagaccacc actcgccgcc 14040
gccgtcgccg caccgccgct gcaaccaccc ctgccgccct ggtgcggaga gtgtaccgcc 14100
gcggccgcgc acctctgacc ctgccgcgcg cgcgctacca cccgagcatc gccatttaaa 14160
ctttcgccag ctttgcagat caatggccct cacatgccgc cttcgcgttc ccattacggg 14220
ctaccgagga agaaaaccgc gccgtagaag gctggcgggg aacgggatgc gtcgccacca 14280
ccaccggcgg cggcgcgcca tcagcaagcg gttgggggga ggcttcctgc ccgcgctgat 14340
ccccatcatc gccgcggcga tcggggcgat ccccggcatt gcttccgtgg cggtgcaggc 14400
ctctcagcgc cactgagaca cacttggaaa catcttgtaa taaacccatg gactctgacg 14460
ctcctggtcc tgtgatgtgt tttcgtagac agatggaaga catcaatttt tcgtccctgg 14520
ctccgcgaca cggcacgcgg ccgttcatgg gcacctggag cgacatcggc accagccaac 14580
tgaacggggg cgccttcaat tggagcagtc tctggagcgg gcttaagaat ttcgggtcca 14640
cgcttaaaac ctatggcagc aaggcgtgga acagcaccac agggcaggcg ctgagggata 14700
agctgaaaga gcagaacttc cagcagaagg tggtcgatgg gctcgcctcg ggcatcaacg 14760
gggtggtgga cctggccaac caggccgtgc agcggcagat caacagccgc ctggacccgg 14820
tgccgcccgc cggctccgtg gagatgccgc aggtggagga ggagctgcct cccctggaca 14880
agcggggcga gaagcgaccc cgccccgatg cggaggagac gctgctgacg cacacggacg 14940
agccgccccc gtacgaggag gcggtgaaac tgggtctgcc caccacgcgg cccatcgcgc 15000
ccctggccac cggggtgctg aaacccgaaa agcccgcgac cctggacttg cctcctcccc 15060
agccttcccg cccctctaca gtggctaagc ccctgccgcc ggtggccgtg gcccgcgcgc 15120
gacccggggg caccgcccgc cctcatgcga actggcagag cactctgaac agcatcgtgg 15180
gtctgggagt gcagagtgtg aagcgccgcc gctgctatta aacctaccgt agcgcttaac 15240
ttgcttgtct gtgtgtgtat gtattatgtc gccgccgccg ctgtccacca gaaggaggag 15300
tgaagaggcg cgtcgccgag ttgcaagatg gccaccccat cgatgctgcc ccagtgggcg 15360
tacatgcaca tcgccggaca ggacgcttcg gagtacctga gtccgggtct ggtgcagttt 15420
gcccgcgcca cagacaccta cttcagtctg gggaacaagt ttaggaaccc cacggtggcg 15480
cccacgcacg atgtgaccac cgaccgcagc cagcggctga cgctgcgctt cgtgcccgtg 15540
gaccgcgagg acaacaccta ctcgtacaaa gtgcgctaca cgctggccgt gggcgacaac 15600
cgcgtgctgg acatggccag cacctacttt gacatccgcg gcgtgctgga tcggggccct 15660
agcttcaaac cctactccgg caccgcctac aacagtctgg cccccaaggg agcacccaac 15720
acttgtcagt ggacatataa agccgatggt gaaactgcca cagaaaaaac ctatacatat 15780
ggaaatgcac ccgtgcaggg cattaacatc acaaaagatg gtattcaact tggaactgac 15840
accgatgatc agccaatcta cgcagataaa acctatcagc ctgaacctca agtgggtgat 15900
gctgaatggc atgacatcac tggtactgat gaaaagtatg gaggcagagc tcttaagcct 15960
gataccaaaa tgaagccttg ttatggttct tttgccaagc ctactaataa agaaggaggt 16020
caggcaaatg tgaaaacagg aacaggcact actaaagaat atgacataga catggctttc 16080
tttgacaaca gaagtgcggc tgctgctggc ctagctccag aaattgtttt gtatactgaa 16140
aatgtggatt tggaaactcc agatacccat attgtataca aagcaggcac agatgacagc 16200
agctcttcta ttaatttggg tcagcaagcc atgcccaaca gacctaacta cattggtttc 16260
agagacaact ttatcgggct catgtactac aacagcactg gcaatatggg ggtgctggcc 16320
ggtcaggctt ctcagctgaa tgctgtggtt gacttgcaag acagaaacac cgagctgtcc 16380
taccagctct tgcttgactc tctgggtgac agaacccggt atttcagtat gtggaatcag 16440
gcggtggaca gctatgatcc tgatgtgcgc attattgaaa atcatggtgt ggaggatgaa 16500
cttcccaact attgtttccc tctggatgct gttggcagaa cagatactta tcagggaatt 16560
aaggctaatg gaactgatca aaccacatgg accaaagatg acagtgtcaa tgatgctaat 16620
gagataggca agggtaatcc attcgccatg gaaatcaaca tccaagccaa cctgtggagg 16680
aacttcctct acgccaacgt ggccctgtac ctgcccgact cttacaagta cacgccggcc 16740
aatgttaccc tgcccaccaa caccaacacc tacgattaca tgaacggccg ggtggtggcg 16800
ccctcgctgg tggactccta catcaacatc ggggcgcgct ggtcgctgga tcccatggac 16860
aacgtgaacc ccttcaacca ccaccgcaat gcggggctgc gctaccgctc catgctcctg 16920
ggcaacgggc gctacgtgcc cttccacatc caggtgcccc agaaattttt cgccatcaag 16980
agcctcctgc tcctgcccgg gtcctacacc tacgagtgga acttccgcaa ggacgtcaac 17040
atgatcctgc agagctccct cggcaacgac ctgcgcacgg acggggcctc catctccttc 17100
accagcatca acctctacgc caccttcttc cccatggcgc acaacacggc ctccacgctc 17160
gaggccatgc tgcgcaacga caccaacgac cagtccttca acgactacct ctcggcggcc 17220
aacatgctct accccatccc ggccaacgcc accaacgtgc ccatctccat cccctcgcgc 17280
aactgggccg ccttccgcgg ctggtccttc acgcgtctca agaccaagga gacgccctcg 17340
ctgggctccg ggttcgaccc ctacttcgtc tactcgggct ccatccccta cctcgacggc 17400
accttctacc tcaaccacac cttcaagaag gtctccatca ccttcgactc ctccgtcagc 17460
tggcccggca acgaccggct cctgacgccc aacgagttcg aaatcaagcg caccgtcgac 17520
ggcgagggct acaacgtggc ccagtgcaac atgaccaagg actggttcct ggtccagatg 17580
ctggcccact acaacatcgg ctaccagggc ttctacgtgc ccgagggcta caaggaccgc 17640
atgtactcct tcttccgcaa cttccagccc atgagccgcc aggtggtgga cgaggtcaac 17700
tacaaggact accaggccgt caccctggcc taccagcaca acaactcggg cttcgtcggc 17760
tacctcgcgc ccaccatgcg ccagggccag ccctaccccg ccaactaccc ctacccgctc 17820
atcggcaaga gcgccgtcac cagcgtcacc cagaaaaagt tcctctgcga cagggtcatg 17880
tggcgcatcc ccttctccag caacttcatg tccatgggcg cgctcaccga cctcggccag 17940
aacatgctct atgccaactc cgcccacgcg ctagacatga atttcgaagt cgaccccatg 18000
gatgagtcca cccttctcta tgttgtcttc gaagtcttcg acgtcgtccg agtgcaccag 18060
ccccaccgcg gcgtcatcga ggccgtctac ctgcgcaccc ccttctcggc cggtaacgcc 18120
accacctaag ctcttgcttc ttgcaagcca tggccgcggg ctccggcgag caggagctca 18180
gggccatcat ccgcgacctg ggctgcgggc cctacttcct gggcaccttc gataagcgct 18240
tcccgggatt catggccccg cacaagctgg cctgcgccat cgtcaacacg gccggccgcg 18300
agaccggggg cgagcactgg ctggccttcg cctggaaccc gcgctcgaac acctgctacc 18360
tcttcgaccc cttcgggttc tcggacgagc gcctcaagca gatctaccag ttcgagtacg 18420
agggcctgct gcgccgcagc gccctggcca ccgaggaccg ctgcgtcacc ctggaaaagt 18480
ccacccagac cgtgcagggt ccgcgctcgg ccgcctgcgg gctcttctgc tgcatgttcc 18540
tgcacgcctt cgtgcactgg cccgaccgcc ccatggacaa gaaccccacc atgaacttgc 18600
tgacgggggt gcccaacggc atgctccagt cgccccaggt ggaacccacc ctgcgccgca 18660
accaggaggc gctctaccgc ttcctcaact cccactccgc ctactttcgc tcccaccgcg 18720
cgcgcatcga gaaggccacc gccttcgacc gcatgaatca agacatgtaa accgtgtgtg 18780
tatgttaaat gtctttaata aacagcactt tcatgttaca catgcatctg agatgattta 18840
tttagaaatc gaaagggttc tgccgggtct cggcatggcc cgcgggcagg gacacgttgc 18900
ggaactggta cttggccagc cacttgaact cggggatcag cagtttgggc agcggggtgt 18960
cggggaagga gtcggtccac agcttccgcg tcagttgcag ggcgcccagc aggtcgggcg 19020
cggagatctt gaaatcgcag ttgggacccg cgttctgcgc gcgggagttg cggtacacgg 19080
ggttgcagca ctggaacacc atcagggccg ggtgcttcac gctcgccagc accgtcgcgt 19140
cggtgatgct ctccacgtcg aggtcctcgg cgttggccat cccgaagggg gtcatcttgc 19200
aggtctgcct tcccatggtg ggcacgcacc cgggcttgtg gttgcaatcg cagtgcaggg 19260
ggatcagcat catctgggcc tggtcggcgt tcatccccgg gtacatggcc ttcatgaaag 19320
cctccaattg cctgaacgcc tgctgggcct tggctccctc ggtgaagaag accccgcagg 19380
acttgctaga gaactggttg gtggcgcacc cggcgtcgtg cacgcagcag cgcgcgtcgt 19440
tgttggccag ctgcaccacg ctgcgccccc agcggttctg ggtgatcttg gcccggtcgg 19500
ggttctcctt cagcgcgcgc tgcccgttct cgctcgccac atccatctcg atcatgtgct 19560
ccttctggat catggtggtc ccgtgcaggc accgcagctt gccctcggcc tcggtgcacc 19620
cgtgcagcca cagcgcgcac ccggtgcact cccagttctt gtgggcgatc tgggaatgcg 19680
cgtgcacgaa gccctgcagg aagcggccca tcatggtggt cagggtcttg ttgctagtga 19740
aggtcagcgg aatgccgcgg tgctcctcgt tgatgtacag gtggcagatg cggcggtaca 19800
cctcgccctg ctcgggcatc agctggaagt tggctttcag gtcggtctcc acgcggtagc 19860
ggtccatcag catagtcatg atttccatac ccttctccca ggccgagacg atgggcaggc 19920
tcatagggtt cttcaccatc atcttagcgc tagcagccgc ggccaggggg tcgctctcgt 19980
ccagggtctc aaagctccgc ttgccgtcct tctcggtgat ccgcaccggg gggtagctga 20040
agcccacggc cgccagctcc tcctcggcct gtctttcgtc ctcgctgtcc tggctgacgt 20100
cctgcaggac cacatgcttg gtcttgcggg gtttcttctt gggcggcagc ggcggcggag 20160
atgttggaga tggcgagggg gagcgcgagt tctcgctcac cactactatc tcttcctctt 20220
cttggtccga ggccacgcgg cggtaggtat gtctcttcgg gggcagaggc ggaggcgacg 20280
ggctctcgcc gccgcgactt ggcggatggc tggcagagcc ccttccgcgt tcgggggtgc 20340
gctcccggcg gcgctctgac tgacttcctc cgcggccggc cattgtgttc tcctagggag 20400
gaacaacaag catggagact cagccatcgc caacctcgcc atctgccccc accgccgacg 20460
agaagcagca gcagcagaat gaaagcttaa ccgccccgcc gcccagcccc gccacctccg 20520
acgcggccgt cccagacatg caagagatgg aggaatccat cgagattgac ctgggctatg 20580
tgacgcccgc ggagcacgag gaggagctgg cagtgcgctt ttcacaagaa gagatacacc 20640
aagaacagcc agagcaggaa gcagagaatg agcagagtca ggctgggctc gagcatgacg 20700
gcgactacct ccacctgagc gggggggagg acgcgctcat caagcatctg gcccggcagg 20760
ccaccatcgt caaggatgcg ctgctcgacc gcaccgaggt gcccctcagc gtggaggagc 20820
tcagccgcgc ctacgagttg aacctcttct cgccgcgcgt gccccccaag cgccagccca 20880
atggcacctg cgagcccaac ccgcgcctca acttctaccc ggtcttcgcg gtgcccgagg 20940
ccctggccac ctaccacatc tttttcaaga accaaaagat ccccgtctcc tgccgcgcca 21000
accgcacccg cgccgacgcc cttttcaacc tgggtcccgg cgcccgccta cctgatatcg 21060
cctccttgga agaggttccc aagatcttcg agggtctggg cagcgacgag actcgggccg 21120
cgaacgctct gcaaggagaa ggaggagagc atgagcacca cagcgccctg gtcgagttgg 21180
aaggcgacaa cgcgcggctg gcggtgctca aacgcacggt cgagctgacc catttcgcct 21240
acccggctct gaacctgccc cccaaagtca tgagcgcggt catggaccag gtgctcatca 21300
agcgcgcgtc gcccatctcc gaggacgagg gcatgcaaga ctccgaggag ggcaagcccg 21360
tggtcagcga cgagcagctg gcccggtggc tgggtcctaa tgctagtccc cagagtttgg 21420
aagagcggcg caaactcatg atggccgtgg tcctggtgac cgtggagctg gagtgcctgc 21480
gccgcttctt cgccgacgcg gagaccctgc gcaaggtcga ggagaacctg cactacctct 21540
tcaggcacgg gttcgtgcgc caggcctgca agatctccaa cgtggagctg accaacctgg 21600
tctcctacat gggcatcttg cacgagaacc gcctggggca gaacgtgctg cacaccaccc 21660
tgcgcgggga ggcccggcgc gactacatcc gcgactgcgt ctacctctac ctctgccaca 21720
cctggcagac gggcatgggc gtgtggcagc agtgtctgga ggagcagaac ctgaaagagc 21780
tctgcaagct cctgcagaag aacctcaagg gtctgtggac cgggttcgac gagcgcacca 21840
ccgcctcgga cctggccgac ctcattttcc ccgagcgcct caggctgacg ctgcgcaacg 21900
gcctgcccga ctttatgagc caaagcatgt tgcaaaactt tcgctctttc atcctcgaac 21960
gctccggaat cctgcccgcc acctgctccg cgctgccctc ggacttcgtg ccgctgacct 22020
tccgcgagtg ccccccgccg ctgtggagcc actgctacct gctgcgcctg gccaactacc 22080
tggcctacca ctcggacgtg atcgaggacg tcagcggcga gggcctgctc gagtgccact 22140
gccgctgcaa cctctgcacg ccgcaccgct ccctggcctg caacccccag ctgctgagcg 22200
agacccagat catcggcacc ttcgagttgc aagggcccag cgaaggcgag ggttcagccg 22260
ccaagggggg tctgaaactc accccggggc tgtggacctc ggcctacttg cgcaagttcg 22320
tgcccgagga ctaccatccc ttcgagatca ggttctacga ggaccaatcc catccgccca 22380
aggccgagct gtcggcctgc gtcatcaccc agggggcgat cctggcccaa ttgcaagcca 22440
tccagaaatc ccgccaagaa ttcttgctga aaaagggccg cggggtctac ctcgaccccc 22500
agaccggtga ggagctcaac cccggcttcc cccaggatgc cccgaggaaa caagaagctg 22560
aaagtggagc tgccgcccgt ggaggatttg gaggaagact gggagaacag cagtcaggca 22620
gaggaggagg agatggagga agactgggac agcactcagg cagaggagga cagcctgcaa 22680
gacagtctgg aggaagacga ggaggaggca gaggaggagg tggaagaagc agccgccgcc 22740
agaccgtcgt cctcggcggg ggagaaagca agcagcacgg ataccatctc cgctccgggt 22800
cggggtcccg ctcgaccaca cagtagatgg gacgagaccg gacgattccc gaaccccacc 22860
acccagaccg gtaagaagga gcggcaggga tacaagtcct ggcgggggca caaaaacgcc 22920
atcgtctcct gcttgcaggc ctgcgggggc aacatctcct tcacccggcg ctacctgctc 22980
ttccaccgcg gggtgaactt tccccgcaac atcttgcatt actaccgtca cctccacagc 23040
ccctactact tccaagaaga ggcagcagca gcagaaaaag accagcagaa aaccagcagc 23100
tagaaaatcc acagcggcgg cagcaggtgg actgaggatc gcggcgaacg agccggcgca 23160
aacccgggag ctgaggaacc ggatctttcc caccctctat gccatcttcc agcagagtcg 23220
ggggcaggag caggaactga aagtcaagaa ccgttctctg cgctcgctca cccgcagttg 23280
tctgtatcac aagagcgaag accaacttca gcgcactctc gaggacgccg aggctctctt 23340
caacaagtac tgcgcgctca ctcttaaaga gtagcccgcg cccgcccagt cgcagaaaaa 23400
ggcgggaatt acgtcacctg tgcccttcgc cctagccgcc tccacccatc atcatgagca 23460
aagagattcc cacgccttac atgtggagct accagcccca gatgggcctg gccgccggtg 23520
ccgcccagga ctactccacc cgcatgaatt ggctcagcgc cgggcccgcg atgatctcac 23580
gggtgaatga catccgcgcc caccgaaacc agatactcct agaacagtca gcgctcaccg 23640
ccacgccccg caatcacctc aatccgcgta attggcccgc cgccctggtg taccaggaaa 23700
ttccccagcc cacgaccgta ctacttccgc gagacgccca ggccgaagtc cagctgacta 23760
actcaggtgt ccagctggcg ggcggcgcca ccctgtgtcg tcaccgcccc gctcagggta 23820
taaagcggct ggtgatccgg ggcagaggca cacagctcaa cgacgaggtg gtgagctctt 23880
cgctgggtct gcgacctgac ggagtcttcc aactcgccgg atcggggaga tcttccttca 23940
cgcctcgtca ggccgtcctg actttggaga gttcgtcctc gcagccccgc tcgggtggca 24000
tcggcactct ccagttcgtg gaggagttca ctccctcggt ctacttcaac cccttctccg 24060
gctcccccgg ccactacccg gacgagttca tcccgaactt cgacgccatc agcgagtcgg 24120
tggacggcta cgattgagtt taaactcacc cccttatcca gtgaaataaa gatcatattg 24180
atgatgattt tacagaaata aaaaataatc atttgatttg aaataaagat acaatcatat 24240
tgatgatttg agtttaacaa aaaaataaag aatcacttac ttgaaatctg ataccaggtc 24300
tctgtccatg ttttctgcca acaccacttc actcccctct tcccagctct ggtactgcag 24360
gccccggcgg gctgcaaact tcctccacac gctgaagggg atgtcaaatt cctcctgtcc 24420
ctcaatcttc attttatctt ctatcagatg tccaaaaagc gcgtccgggt ggatgatgac 24480
ttcgaccccg tctaccccta cgatgcagac aacgcaccga ccgtgccctt catcaacccc 24540
cccttcgtct cttcagatgg attccaagag aagcccctgg gggtgttgtc cctgcgactg 24600
gccgaccccg tcaccaccaa gaacggggaa atcaccctca agctgggaga gggggtggac 24660
ctcgattcct cgggaaaact catctccaac acggccacca aggccgccgc ccctctcagt 24720
ttttccaaca acaccatttc ccttaacatg gatcacccct tttacactaa agatggaaaa 24780
ttatccttac aagtttctcc accattaaat atactgagaa caagcattct aaacacacta 24840
gctttaggtt ttggatcagg tttaggactc cgtggctctg ccttggcagt acagttagtc 24900
tctccactta catttgatac tgatggaaac ataaagctta ccttagacag aggtttgcat 24960
gttacaacag gagatgcaat tgaaagcaac ataagctggg ctaaaggttt aaaatttgaa 25020
gatggagcca tagcaaccaa cattggaaat gggttagagt ttggaagcag tagtacagaa 25080
acaggtgttg atgatgctta cccaatccaa gttaaacttg gatctggcct tagctttgac 25140
agtacaggag ccataatggc tggtaacaaa gaagacgata aactcacttt gtggacaaca 25200
cctgatccat caccaaactg tcaaatactc gcagaaaatg atgcaaaact aacactttgc 25260
ttgactaaat gtggtagtca aatactggcc actgtgtcag tcttagttgt aggaagtgga 25320
aacctaaacc ccattactgg caccgtaagc agtgctcagg tgtttctacg ttttgatgca 25380
aacggtgttc ttttaacaga acattctaca ctaaaaaaat actgggggta taggcaggga 25440
gatagcatag atggcactcc atataccaat gctgtaggat tcatgcccaa tttaaaagct 25500
tatccaaagt cacaaagttc tactactaaa aataatatag tagggcaagt atacatgaat 25560
ggagatgttt caaaacctat gcttctcact ataaccctca atggtactga tgacagcaac 25620
agtacatatt caatgtcatt ttcatacacc tggactaatg gaagctatgt tggagcaaca 25680
tttggggcta actcttatac cttctcatac atcgcccaag aatgaacact gtatcccacc 25740
ctgcatgcca acccttccca ccccactctg tggaacaaac tctgaaacac aaaataaaat 25800
aaagttcaag tgttttattg attcaacagt ttcacagaac cctagtattc aacctgccac 25860
ctccctccca acacacagag tacacagtcc tttctccccg gctggcctta aaaagcatca 25920
tatcatgggt aacagacata ttcttaggtg ttatattcca cacggtttcc tgtcgagcca 25980
aacgctcatc agtgatatta ataaactccc cgggcagctc acttaagttc atgtcgctgt 26040
ccagctgctg agccacaggc tgctgtccaa cttgcggttg cttaacgggc ggcgaaggag 26100
aagtccacgc ctacatgggg gtagagtcat aatcgtgcat caggataggg cggtggtgct 26160
gcagcagcgc gcgaataaac tgctgccgcc gccgctccgt cctgcaggaa tacaacatgg 26220
cagtggtctc ctcagcgatg attcgcaccg cccgcagcat aaggcgcctt gtcctccggg 26280
cacagcagcg caccctgatc tcacttaaat cagcacagta actgcagcac agcaccacaa 26340
tattgttcaa aatcccacag tgcaaggcgc tgtatccaaa gctcatggcg gggaccacag 26400
aacccacgtg gccatcatac cacaagcgca ggtagattaa gtggcgaccc ctcataaaca 26460
cgctggacat aaacattacc tcttttggca tgttgtaatt caccacctcc cggtaccata 26520
taaacctctg attaaacatg gcgccatcca ccaccatcct aaaccagctg gccaaaacct 26580
gcccgccggc tatacactgc agggaaccgg gactggaaca atgacagtgg agagcccagg 26640
actcgtaacc atggatcatc atgctcgtca tgatatcaat gttggcacaa cacaggcaca 26700
cgtgcataca cttcctcagg attacaagct cctcccgcgt tagaaccata tcccagggaa 26760
caacccattc ctgaatcagc gtaaatccca cactgcaggg aagacctcgc acgtaactca 26820
cgttgtgcat tgtcaaagtg ttacattcgg gcagcagcgg atgatcctcc agtatggtag 26880
cgcgggtttc tgtctcaaaa ggaggtagac gatccctact gtacggagtg cgccgagaca 26940
accgagatcg tgttggtcgt agtgtcatgc caaatggaac gccggacgta gtcatatttc 27000
ctgaagcaaa accaggtgcg ggcgtgacaa acagatctgc gtctccggtc tcgccgctta 27060
gatcgctctg tgtagtagtt gtagtatatc cactctctca aagcatccag gcgccccctg 27120
gcttcgggtt ctatgtaaac tccttcatgc gccgctgccc tgataacatc caccaccgca 27180
gaataagcca cacccagcca acctacacat tcgttctgcg agtcacacac gggaggagcg 27240
ggaagagctg gaagaaccat gattaacttt attccaaacg gtctcggagc acttcaaaat 27300
gcaggtcccg gaggtggcac ctctcgcccc cactgtgttg gtggaaaata acagccaggt 27360
caaaggtgac acggttctcg agatgttcca cggtggcttc cagcaaagcc tccacgcgca 27420
catccagaaa caagaggaca gcgaaagcgg gagcgttttc taattcctca atcatcatat 27480
tacactcctg caccatcccc agataatttt catttttcca gccttgaatg attcgtatta 27540
gttcctgagg taaatccaag ccagccatga taaaaagctc gcgcagagcg ccctccaccg 27600
gcattcttaa gcacaccctc ataattccaa gagattctgc tcctggttca cctgcagcag 27660
attaacaatg ggaatatcaa aatctctgcc gcgatcccta agctcctccc tcaacaataa 27720
ctgtatgtaa tctttcatat catctccgaa atttttagcc atagggccgc caggaataag 27780
agcagggcaa gccacattac agataaagcg aagtcctccc cagtgwgcat tgccaaatgt 27840
aagattgaaa taagcatgct ggctagaccc tgtgatatct tccagataac tggacagaaa 27900
atcaggcaag caatttttaa gaaaatcaac aaaagaaaag tcgtccaggt gcaggtttag 27960
agcctcagga acaacgatgg aataagtgca aggagtgcgt tccagcatgg ttagtgtttt 28020
tttggtgatc tgtagaacaa aaaataaaca tgcaatatta aaccatgcta gcctggcgaa 28080
caggtgggta aatcactctt tccagcacca ggcaggctac ggggtctccg gcgcgaccct 28140
cgtagaagct gtcgccatga ttgaaaagca tcaccgagag accttcccgg tggccggcat 28200
ggatgattcg agaagaagca tacactccgg gaacattggc atccgtgagt gaaaaaaagc 28260
gacctataaa gcctcggggc actacaatgc tcaatctcaa ttccagcaaa gccaccccat 28320
gcggatggag cacaaaattg gcaggtgcgt aaaaaatgta attactcccc tcctgcacag 28380
gcagcaaagc ccccgctccc tccagaaaca catacaaagc ctcagcgtcc atagcttacc 28440
gagcacggca ggcgcaagag tcagagaaaa ggctgagctc taacctgact gcccgctcct 28500
gtgctcaata tatagcccta acctacactg acgtaaaggc caaagtctaa aaatacccgc 28560
caaataatca cacacgccca gcacacgccc agaaaccggt gacacactca aaaaaatacg 28620
cgcacttcct caaacgccca aaactgccgt catttccggg ttcccacgct acgtcatcaa 28680
aacacgactt tcaaattccg tcgaccgtta aaaacgtcac ccgccccgcc cctaacggtc 28740
gcccgtctct cagccaatca gcgccccgca tccccaaatt caaacacctc atttgcatat 28800
taacgcgcac aaaaagtttg aggtatatta ttgatgatgg ttaattaagg atccttctat 28860
agtgtcacct aaatgtcgac ggccaggcgg ccgccaggcc tacccactag tcaattcggg 28920
aggatcgaaa cggcagatcg caaaaaacag tacatacaga aggagacatg aacatgaaca 28980
tcaaaaaaat tgtaaaacaa gccacagttc tgacttttac gactgcactt ctggcaggag 29040
gagcgactca agccttcgcg aaagaaaata accaaaaagc atacaaagaa acgtacggcg 29100
tctctcatat tacacgccat gatatgctgc agatccctaa acagcagcaa aacgaaaaat 29160
accaagtgcc tcaattcgat caatcaacga ttaaaaatat tgagtctgca aaaggacttg 29220
atgtgtggga cagctggccg ctgcaaaacg ctgacggaac agtagctgaa tacaacggct 29280
atcacgttgt gtttgctctt gcgggaagcc cgaaagacgc tgatgacaca tcaatctaca 29340
tgttttatca aaaggtcggc gacaactcaa tcgacagctg gaaaaacgcg ggccgtgtct 29400
ttaaagacag cgataagttc gacgccaacg atccgatcct gaaagatcag acgcaagaat 29460
ggtccggttc tgcaaccttt acatctgacg gaaaaatccg tttattctac actgactatt 29520
ccggtaaaca ttacggcaaa caaagcctga caacagcgca ggtaaatgtg tcaaaatctg 29580
atgacacact caaaatcaac ggagtggaag atcacaaaac gatttttgac ggagacggaa 29640
aaacatatca gaacgttcag cagtttatcg atgaaggcaa ttatacatcc ggcgacaacc 29700
atacgctgag agaccctcac tacgttgaag acaaaggcca taaatacctt gtattcgaag 29760
ccaacacggg aacagaaaac ggataccaag gcgaagaatc tttatttaac aaagcgtact 29820
acggcggcgg cacgaacttc ttccgtaaag aaagccagaa gcttcagcag agcgctaaaa 29880
aacgcgatgc tgagttagcg aacggcgccc tcggtatcat agagttaaat aatgattaca 29940
cattgaaaaa agtaatgaag ccgctgatca cttcaaacac ggtaactgat gaaatcgagc 30000
gcgcgaatgt tttcaaaatg aacggcaaat ggtacttgtt cactgattca cgcggttcaa 30060
aaatgacgat cgatggtatt aactcaaacg atatttacat gcttggttat gtatcaaact 30120
ctttaaccgg cccttacaag ccgctgaaca aaacagggct tgtgctgcaa atgggtcttg 30180
atccaaacga tgtgacattc acttactctc acttcgcagt gccgcaagcc aaaggcaaca 30240
atgtggttat cacaagctac atgacaaaca gaggcttctt cgaggataaa aaggcaacat 30300
ttgcgccaag cttcttaatg aacatcaaag gcaataaaac atccgttgtc aaaaacagca 30360
tcctggagca aggacagctg acagtcaact aataacagca aaaagaaaat gccgatactt 30420
cattggcatt ttcttttatt tctcaacaag atggtgaatt gactagtggg tagatccaca 30480
ggacgggtgt ggtcgccatg atcgcgtagt cgatagtggc tccaagtagc gaagcgagca 30540
ggactgggcg gcggccaaag cggtcggaca gtgctccgag aacgggtgcg catagaaatt 30600
gcatcaacgc atatagcgct agcagcacgc catagtgact ggcgatgctg tcggaatgga 30660
cgatatcccg caagaggccc ggcagtaccg gcataaccaa gcctatgcct acagcatcca 30720
gggtgacggt gccgaggatg acgatgagcg cattgttaga tttcatacac ggtgcctgac 30780
tgcgttagca atttaactgt gataaactac cgcattaaag cttatcgatg ataagctgtc 30840
aaacatgaga attgatccgg aacccttaat ataacttcgt ataatgtatg ctatacgaag 30900
ttattaggtc cctcgactat agggtcaccg tcgacagcga cacacttgca tcggatgcag 30960
cccggttaac gtgccggcac ggcctgggta accaggtatt ttgtccacat aaccgtgcgc 31020
aaaatgttgt ggataagcag gacacagcag caatccacag caggcataca accgcacacc 31080
gaggttactc cgttctacag gttacgacga catgtcaata cttgcccttg acaggcattg 31140
atggaatcgt agtctcacgc tgatagtctg atcgacaata caagtgggac cgtggtccca 31200
gaccgataat cagaccgaca acacgagtgg gatcgtggtc ccagactaat aatcagaccg 31260
acgatacgag tgggaccgtg gtcccagact aataatcaga ccgacgatac gagtgggacc 31320
gtggttccag actaataatc agaccgacga tacgagtggg accgtggtcc cagactaata 31380
atcagaccga cgatacgagt gggaccatgg tcccagacta ataatcagac cgacgatacg 31440
agtgggaccg tggtcccagt ctgattatca gaccgacgat acgagtggga ccgtggtccc 31500
agactaataa tcagaccgac gatacgagtg ggaccgtggt cccagactaa taatcagacc 31560
gacgatacga gtgggaccgt ggtcccagtc tgattatcag accgacgata caagtggaac 31620
agtgggccca gagagaatat tcaggccagt tatgctttct ggcctgtaac aaaggacatt 31680
aagtaaagac agataaacgt agactaaaac gtggtcgcat cagggtgctg gcttttcaag 31740
ttccttaaga atggcctcaa ttttctctat acactcagtt ggaacacgag acctgtccag 31800
gttaagcacc attttatcgc ccttatacaa tactgtcgct ccaggagcaa actgatgtcg 31860
tgagcttaaa ctagttcttg atgcagatga cgttttaagc acagaagtta aaagagtgat 31920
aacttcttca gcttcaaata tcaccccagc ttttttctgc tcatgaaggt tagatgcctg 31980
ctgcttaagt aattcctctt tatctgtaaa ggctttttga agtgcatcac ctgaccgggc 32040
agatagttca ccggggtgag aaaaaagagc aacaactgat ttaggcaatt tggcggtgtt 32100
gatacagcgg gtaataatct tacgtgaaat attttccgca tcagccagcg cagaaatatt 32160
tccagcaaat tcattctgca atcggcttgc ataacgctga ccacgttcat aagcacttgt 32220
tgggcgataa tcgttaccca atctggataa tgcagccatc tgctcatcat ccagctcgcc 32280
aaccagaaca cgataatcac tttcggtaag tgcagcagct ttacgacggc gactcccatc 32340
ggcaatttct atgacaccag atactcttcg accgaacgcc ggtgtctgtt gaccagtcag 32400
tagaaaagaa gggatgagat catccagtgc gtcctcagta agcagctcct ggtcacgttc 32460
attacctgac catacccgag aggtcttctc aacactatca ccccggagca cttcaagagt 32520
aaacttcaca tcccgaccac atacaggcaa agtaatggca ttaccgcgag ccattactcc 32580
tacgcgcgca attaacgaat ccaccatcgg ggcagctggt gtcgataacg aagtatcttc 32640
aaccggttga gtattgagcg tatgttttgg aataacaggc gcacgcttca ttatctaatc 32700
tcccagcgtg gtttaatcag acgatcgaaa atttcattgc agacaggttc ccaaatagaa 32760
agagcatttc tccaggcacc agttgaagag cgttgatcaa tggcctgttc aaaaacagtt 32820
ctcatccgga tctgaccttt accaacttca tccgtttcac gtacaacatt ttttagaacc 32880
atgcttcccc aggcatcccg aatttgctcc tccatccacg gggactgaga gccattacta 32940
ttgctgtatt tggtaagcaa aatacgtaca tcaggctcga accctttaag atcaacgttc 33000
ttgagcagat cacgaagcat atcgaaaaac tgcagtgcgg aggtgtagtc aaacaactca 33060
gcaggcgtgg gaacaatcag cacatcagca gcacatacga cattaatcgt gccgataccc 33120
aggttaggcg cgctgtcaat aactatgaca tcatagtcat gagcaacagt ttcaatggcc 33180
agtcggagca tcaggtgtgg atcggtgggc agtttacctt catcaaattt gcccattaac 33240
tcagtttcaa tacggtgcag agccagacag gaaggaataa tgtcaagccc cggccagcaa 33300
gtgggcttta ttgcataagt gacatcgtcc ttttccccaa gatagaaagg caggagagtg 33360
tcttctgcat gaatatgaag atctggtacc catccgtgat acattgaggc tgttccctgg 33420
gggtcgttac cttccacgag caaaacacgt agccccttca gagccagatc ctgagcaaga 33480
tgaacagaaa ctgaggtttt gtaaacgcca cctttatggg cagcaacccc gatcaccggt 33540
ggaaatacgt cttcagcacg tcgcaatcgc gtaccaaaca catcacgcat atgattaatt 33600
tgttcaattg tataaccaac acgttgctca acccgtcctc gaatttccat atccgggtgc 33660
ggtagtcgcc ctgctttctc ggcatctctg atagcctgag aagaaacccc aactaaatcc 33720
gctgcttcac ctattctcca gcgccgggtt attttcctcg cttccgggct gtcatcatta 33780
aactgtgcaa tggcgatagc cttcgtcatt tcatgaccag cgtttatgca ctggttaagt 33840
gtttccatga gtttcattct gaacatcctt taatcattgc tttgcgtttt tttattaaat 33900
cttgcaattt actgcaaagc aacaacaaaa tcgcaaagtc atcaaaaaac cgcaaagttg 33960
tttaaaataa gagcaacact acaaaaggag ataagaagag cacatacctc agtcacttat 34020
tatcactagc gctcgccgca gccgtgtaac cgagcatagc gagcgaactg gcgaggaagc 34080
aaagaagaac tgttctgtca gatagctctt acgctcagcg caagaagaaa tatccaccgt 34140
gggaaaaact ccaggtagag gtacacacgc ggatagccaa ttcagagtaa taaactgtga 34200
taatcaaccc tcatcaatga tgacgaacta acccccgata tcaggtcaca tgacgaaggg 34260
aaagagaagg aaatcaactg tgacaaactg ccctcaaatt tggcttcctt aaaaattaca 34320
gttcaaaaag tatgagaaaa tccatgcagg ctgaaggaaa cagcaaaact gtgacaaatt 34380
accctcagta ggtcagaaca aatgtgacga accaccctca aatctgtgac agataaccct 34440
cagactatcc tgtcgtcatg gaagtgatat cgcggaagga aaatacgata tgagtcgtct 34500
ggcggccttt ctttttctca atgtatgaga ggcgcattgg agttctgctg ttgatctcat 34560
taacacagac ctgcaggaag cggcggcgga agtcaggcat acgctggtaa ctttgaggca 34620
gctggtaacg ctctatgatc cagtcgattt tcagagagac gatgcctgag ccatccggct 34680
tacgatactg acacagggat tcgtataaac gcatggcata cggattggtg atttcttttg 34740
tttcactaag ccgaaactgc gtaaaccggt tctgtaaccc gataaagaag ggaatgagat 34800
atgggttgat atgtacactg taaagccctc tggatggact gtgcgcacgt ttgataaacc 34860
aaggaaaaga ttcatagcct ttttcatcgc cggcatcctc ttcagggcga taaaaaacca 34920
cttccttccc cgcgaaactc ttcaatgcct gccgtatatc cttactggct tccgcagagg 34980
tcaatccgaa tatttcagca tatttagcaa catggatctc gcagataccg tcatgttcct 35040
gtagggtgcc atcagatttt ctgatctggt caacgaacag atacagcata cgtttttgat 35100
cccgggagag actatatgcc gcctcagtga ggtcgtttga ctggacgatt cgcgggctat 35160
ttttacgttt cttgtgattg ataaccgctg tttccgccat gacagatcca tgtgaagtgt 35220
gacaagtttt tagattgtca cactaaataa aaaagagtca ataagcaggg ataactttgt 35280
gaaaaaacag cttcttctga gggcaatttg tcacagggtt aagggcaatt tgtcacagac 35340
aggactgtca tttgagggtg atttgtcaca ctgaaagggc aatttgtcac aacaccttct 35400
ctagaaccag catggataaa ggcctacaag gcgctctaaa aaagaagatc taaaaactat 35460
aaaaaaaata attataaaaa tatccccgtg gataagtgga taaccccaag ggaagttttt 35520
tcaggcatcg tgtgtaagca gaatatataa gtgctgttcc ctggtgcttc ctcgctcact 35580
cgagggcttc gccctgtcgc tcaactgcgg cgagcactac tggctgtaaa aggacagacc 35640
acatcatggt tctgtgttca ttaggttgtt ctgtccattg ctgacataat ccgctccact 35700
tcaacgtaac accgcacgaa gatttctatt gttcctgaag gcatattcaa atcgttttcg 35760
ttaccgcttg caggcatcat gacagaacac tacttcctat aaacgctaca caggctcctg 35820
agattaataa tgcggatctc tacgataatg ggagattttc ccgactgttt cgttcgcttc 35880
tcagtggata acagccagct tctctgttta acagacaaaa acagcatatc cactcagttc 35940
cacatttcca tataaaggcc aaggcattta ttctcaggat aattgtttca gcatcgcaac 36000
cgcatcagac tccggcatcg caaactgcac ccggtgccgg gcagccacat ccagcgcaaa 36060
aaccttcgtg tagacttccg ttgaactgat ggacttatgt cccatcaggc tttgcagaac 36120
tttcagcggt ataccggcat acagcatgtg catcgcatag gaatggcgga acgtatgtgg 36180
tgtgaccgga acagagaacg tcacaccgtc agcagcagcg gcggcaaccg cctccccaat 36240
ccaggtcctg accgttctgt ccgtcacttc ccagatccgc gctttctctg tccttcctgt 36300
gcgacggtta cgccgctcca tgagcttatc gcgaataaat acctgtgacg gaagatcact 36360
tcgcagaata aataaatcct ggtgtccctg ttgataccgg gaagccctgg gccaactttt 36420
ggcgaaaatg agacgttgat cggcacgtaa gaggttccaa ctttcaccat aatgaaataa 36480
gatcactacc gggcgtattt tttgagttat cgagattttc aggagctaag gaagctaaaa 36540
tggagaaaaa aatcactgga tataccaccg ttgatatatc ccaatggcat cgtaaagaac 36600
attttgaggc atttcagtca gttgctcaat gtacctataa ccagaccgtt cagctggata 36660
ttacggcctt tttaaagacc gtaaagaaaa ataagcacaa gttttatccg gcctttattc 36720
acattcttgc ccgcctgatg aatgctcatc cggagttccg tatggcaatg aaagacggtg 36780
agctggtgat atgggatagt gttcaccctt gttacaccgt tttccatgag caaactgaaa 36840
cgttttcatc gctctggagt gaataccacg acgatttccg gcagtttcta cacatatatt 36900
cgcaagatgt ggcgtgttac ggtgaaaacc tggcctattt ccctaaaggg tttattgaga 36960
atatgttttt cgtctcagcc aatccctggg tgagtttcac cagttttgat ttaaacgtgg 37020
ccaatatgga caacttcttc gcccccgttt tcaccatggg caaatattat acgcaaggcg 37080
acaaggtgct gatgccgctg gcgattcagg ttcatcatgc cgtttgtgat ggcttccatg 37140
tcggcagaat gcttaatgaa ttacaacagt actgcgatga gtggcagggc ggggcgtaat 37200
ttttttaagg cagttattgg tgcccttaaa cgcctggttg ctacgcctga ataagtgata 37260
ataagcggat gaatggcaga aattcgatga taagctgtca aacatgagaa ttggtcgacg 37320
gcgcgccaaa gcttgcatgc ctgcagccgc gtaacctggc aaaatcggtt acggttgagt 37380
aataaatgga tgccctgcgt aagcggggca catttcatta cctctttctc cgcacccgac 37440
atagataata acttcgtata gtatacatta tacgaagtta tctagtagac 37490
<210> 69
<211> 143055
<212> DNA
<213> 人工序列
<220>
<223> 病毒构建体
<400> 69
tatttattta gtgtctagaa aaaaatgtgt gaccaacgac cgtaggaaac tctagagggt 60
aagaaaaatc aatcgcttta tagagaccat cagaaagagg tttaatattt ttgtgagacc 120
atcgaaggag aaagagataa aactttttta cgactccatc agaaagaggt ttaatatttt 180
tgtgagacca tcgaagagag aaagagataa aactttttta cgactccatc agaaagaggt 240
ttaatatttt tgtgagacca tcgaagagag aaagagataa aactttttta cgactccatc 300
agaaagaggt ttaatatttt tgtgagacca tcgaaggaga aagagataaa acttttttac 360
gactccatca gaaagaggtt taatattttt gtgagaccat cgaaggagaa agagataaaa 420
cttttttacg actccatcag aaagaggttt aatatttttg tgagaccatc gaaggagaaa 480
gagataaaac ttttttacga ctccatcaga aagaggttta atatttttgt gagaccatcg 540
aagagagaaa gagataaaac ttttttacga ctccatcaga aagaggttta atatttttgt 600
gagaccatcg aaggagaaag agataaaact tttttacgac tccatcagaa agaggtttaa 660
tatttttgtg agaccatcga agagagaaag agaataaaaa tattttagtg acaccatcag 720
aaagaggttt aatatttttg tgagaccatc gaagagagaa agagataaaa cttttttacg 780
actccatcag aaagaggttt aatatttttg tgagaccatc gaaggagaaa gagataaaac 840
ttttttacga ctccatcaga aagaggttta atatttttgt gagaccatcg aagagagaaa 900
gagataaaac ttttttacga ctccatcaga aagaggttta atatttttgt gagaccatcg 960
aagagagaaa gagataaaac ttttttacga ctccatcaga aagaccatcg aagagagaaa 1020
gagaaagaga tagttagtct agatattttt cttagtacaa aagtcaatgt tttaaaatat 1080
atggacaaga atttgtctgt ataaaaactt gtgtgaaatt ttgtaccaaa gaaaaaatgt 1140
gagcagtatc ccctacatgg attttactag atcatttata taccaaaaaa tattatacga 1200
tctacgtttt attatatgat tttaacgtgt aaattataaa cattatttta tgatatacaa 1260
ttgtctggta acctagatgg gcatagggga tgagtatatg ttgttggacg ttattgttta 1320
agaaatagtt gatgcatcag aaagaggttt aatatttttg tgagaccatc gaagagagaa 1380
agagataaaa cttttttacg actccatcag aaagaggttt aatatttttg tgagaccatc 1440
gaagagagaa agagataaaa cttttttatg actccattga agagagaatg agaataaaaa 1500
tattttagtg acaccatcag aaagaggttt aatatttttt atgagaccat caaagagaga 1560
aagagaataa aaatatttta tgactccatt gaagagagaa agagaaaatg agaataaaaa 1620
tattttagtg acaccatcag aaagaggttt aatatttttt atgagaccat caaagagaga 1680
aagagaataa aaatattttt gtaaaacttt ttttatgaga ccatcaaaga gagaaagaga 1740
ataaaaatat ttttgtaaaa ctttttttat gagaccatca aagagagaaa gagaataaaa 1800
atatttttgt aaaacttttt ttatgagacc atcaaagaga gaaagagaat aaaaatattt 1860
ttgtaaaact ttttttatga gaccatcaaa gagagaaaga gaataaaaat atttttgtaa 1920
aacttttttt atgagaccat caaagagaga aagagaataa aaatattttt gtaaaacttt 1980
ttttatgaga ccatcaaaga gagaaagaga ataaaaatat ttttgtaaaa ctttttttat 2040
gagaccatca aagagagaaa gagaataaaa atattttatg actccattga agagagaaag 2100
agaataaaaa tattttagtg acaccatcag aaagaggttt aatatttttg tgagaccatc 2160
gaagagagaa agagaataaa aatattttat gactccattg aagagagaaa gagaataaaa 2220
atattttagt gacaccatca gaaagaggtt taatattttt tatgagacca tcaaagagag 2280
aaagagaata aaaatatttt tgtaaaactt tttttatgag accatcaaag agagaaagag 2340
aataaaaata tttttgtaaa acttttttta tgagaccatc aaagagagaa agagaataaa 2400
aatatttttg taaaactttt tttatgagac catcaaagag agaaagagaa taaaaatatt 2460
tttgtaaaac tttttttatg agaccatcaa agagagaaag agaataaaaa tatttttgta 2520
aaactttttt tatgagacca tcaaagagag aaagagaata aaaatatttt atgactccat 2580
tgaagagaga atgagaataa aaatatttta gtgacaccat cagaaagagg tttaatattt 2640
ttgtgagacc atcgaagaga gaaagagaat aaaaatattt tatgactcca ttgaagagag 2700
aaagagaata aaaatatttt agtgacacca tcagaaagag gtttaatatt ttttatgaga 2760
ccatcaaaga gagaaagaga ataaaaatat ttttgtaaaa ctttttttat gagaccatca 2820
aagagagaaa gagaataaaa atatttttgt aaaacttttt ttatgagacc atcaaagaga 2880
gaaagagaat aaaaatattt ttgtaaaact ttttttatga gaccatcaaa gagagaaaga 2940
gaataaaaat atttttgtaa aacttttttt atgagaccat caaagagaga aagagaataa 3000
aaatattttt gtaaaacttt ttttatgaga ccatcaaaga gagaaagaga ataaaaatat 3060
ttttgtaaaa ctttttttat gagaccatca aagagagaaa gagaataaaa atatttttgt 3120
aaaacttttt ttatgagacc atcaaagaga gaaagagaat aaaaatattt tatgactcca 3180
ttgaagagag aatgagaata aaaatatttt agtgacacca tcagaaagag gtttaatatt 3240
tttgtgagac catcgaagag agaaagagaa taaaaatatt ttatgactcc attgaagaga 3300
gaatgagaat aaaaatattt tagtgacacc atcagaaaga ggtttaatat tttttatgag 3360
accatcaaag agagaaagag aataaaaata tttttgtaaa acttttttta tgagaccatc 3420
aaagagagaa agagaataaa aatatttttg taaaaattat aaacattatt ttatgatata 3480
caattgtctg gtaacctaga tgggcatagg ggatgttgat aagctcgacg agtatatgtt 3540
gttggacgtt attgtttaag aaatagttga tgcatcagaa agagaataaa aaatatttta 3600
gtgagaccat cgaagagaga aagagataaa acttttttac gactccatca gaaagaggtt 3660
taatattttt gtgagaccat cgaagagaga aagagataaa acttttttac gactccatca 3720
gaaagaggtt taatattttt gtgagaccat cgaaggagaa agagataaaa cttttttacg 3780
actccatcag aaagaggttt aatatttttg tgagaccatc aaagagagaa agagaataaa 3840
aatatttttg taaaactttt tttatgagac catcaaagag agaaagagaa taaaaatatt 3900
tttgtaaaac tttttttatg agaccatcaa agagagaaag agaataaaaa tatttttgta 3960
aaactttttt tatgagacca tcaaagagag aaagagaata aaaatatttt tgtaaaactt 4020
tttttatgag accatcaaag agagaaagag aataaaaata ttttatgact ccattgaaga 4080
gagaatgaga ataaaaatat tttagtgaca ccatcagaaa gaggtttaat atttttgtga 4140
gaccatcgaa gagagaaaga gaataaaaat attttatgac tccattgaag agagaatgag 4200
aataaaaata ttttagtgac accatcagaa agaggtttaa tattttttat gagaccatca 4260
aagagagaaa gagaataaaa atatttttgt aaaacttttt ttatgagacc atcaaagaga 4320
gaaagagaat aaaaatattt ttgtaaaact ttttttatga gaccatcaaa gagagaaaga 4380
gaataaaaat atttttgtaa aacttttttt atgagaccat caaagagaga aagagaataa 4440
aaatattttt gtaaaacttt ttttatgaga ccatcaaaga gagaaagaga ataaaaatat 4500
tttatgactc cattgaagag agaatgagaa taaaaatatt ttagtgacac catcagaaag 4560
aggtttaata tttttgtgag accatcgaag agagaaagag aataaaaata ttttatgact 4620
ccattgaaga gagaaagaga ataaaaatat tttagtgaca ccatcagaaa gaggtttaat 4680
attttttatg agaccatcaa agagagaaag agaataaaaa tatttttgta aaactttttt 4740
tatgagacca tcaaagagag aaagagaata aaaatatttt atgactccat tgaagagaga 4800
atgagaataa aaatatttta gtgacaccat cagaaagagg tttaatattt ttgtgagacc 4860
atcgaagaga gaaagagaat aaaaatattt tatgactcca ttgaagagag aatgagaata 4920
aaaatatttt agtgacacca tcagaaagag gtttaatatt tttgtgagac catcgaagag 4980
agaaagagaa taaaaatatt ttatgactcc attgaagaga gaatgagaat aaaaatattt 5040
tagtgacacc atcagaaaga ggtttaatat tttttatgag accatcaaag agagaaagag 5100
aataaaaata tttttgtaaa acttttttta tgagaccatc aaagagagaa agagaataaa 5160
aatatttttg taaaactttt tttatgagac catcaaagag agaaagagaa taaaaatatt 5220
tttgtaaaac tttttttatg agaccatcaa agagagaaag agaataaaaa tatttttgtg 5280
agaccatcaa agagagaaag agaataaaaa tatttttgtg agaccatcaa agagagaaag 5340
agaataaaaa tatttttgtg agaccatcaa agagagaaag agaataaaaa tattttatga 5400
ctccattgaa gagagaaaga gaataaaaat attttagtga caccatcaga aagaggttta 5460
atatttttgt gagaccatcg aagagagaaa gagaataaaa atattttatg actccattga 5520
agagagaaag agaataaaaa tattttagtg acaccatcag aaagaggttt aatatttttt 5580
atgagaccat caaagagaga aagagaataa aaatattttt tatgagacca tcaaagagag 5640
aaagagaata aaaatatttt atgagaccat caaagagaga aagagaataa aaatattttt 5700
tatgagacca tcaaagagag aaagagaata aaaatatttt atgagaccat caaagagaga 5760
aagagaataa aaatattttt tatgagacca tcaaagagag aaagagaata aaaatatttt 5820
tgtatgagac catcagaaag aggtttaata tttttgtgat accctgaaag gaaataggaa 5880
tagtgtcata atcgtatcac actattgaga cagaaaaaga agaagtcgcg agaggtaact 5940
ttttgttttg caaaccggaa tatagtgtcc ggtacacttt tttaattcgt ggtgtgcctg 6000
aatcgttcga ttaaccctac tcatccaatt tcagatgaat agagttatcg attcagacac 6060
acgctttgag ttttgttgaa tcgatgagtg aagtatcatc ggttgcacct tcagatgccg 6120
atccgtcgac atacttgacc tcaagttcag atgattcctt gcacatgtct ccgatacgaa 6180
cgctaaactc tagattcttg acacattttg tatcgacgat cgttgaaccg atgatatctt 6240
cgtaactcac tttcttatga gagatgttag acccgagtac tggatgggtc ttgatgtcgc 6300
tgtctttctc ttcttcgcta catctgatgt cgatagacac ctcacagtct ttccatcagc 6360
ggattctgag atggatttaa tctgaggaca tttggtgaat ccaaagttca ttctcagacc 6420
tccaccgatg atggagtaat aagtggtagg aggatctaca tcctcgactg attccacctc 6480
gggatctgga tctgactcgg actctgtaat ttccgttacg gattggcaaa tcttatcatc 6540
ggtcggtgtt tggtcttgct ttgtgacttt gataataaca tcgattccca tatgatgttt 6600
gttttcttct tccgtacacg atgaggatga ttgctgaaga ctggcaggca catgcatgcc 6660
agtacgatat attgtttcat gattgctatt gattgagtac tgttctttat gattctactt 6720
ccttaccgtg caataaatta gaatatattt tctactttta cgagaaatta attattgtat 6780
ttatgggtga aaaacttact ataaaaagcg ggtgggtttg gaattagtga tcagtttatg 6840
tatatcgcaa ctaccgggca tatggctaca ttacccacat gataagagat tgtatcagtt 6900
tcgtagtctt gagtattggt attactatat agtatataga tgtcgacgct agagttactg 6960
tctccgaatg cggcatgata gtatcattct ttgctttcgt taactgtttg gaggaagaat 7020
ctttgttatt gcatttaatc tcgaaattca gagtgcacac ctttctcctg taaagaaacc 7080
tgaagtcgct accttattaa gaagacggga tcgcagtctt tatgattcat agtaatagtt 7140
agttccgacg ttgagatgga ttcgctgaga ccggtagtgg tcgtccgagt acacgatgtg 7200
tcgttaactg gatacaggtt aatttccaca tcgatatagt taaaggtatt tctgggtacg 7260
ggttcgcatt tatctgcgga agagacggtg tgagaatatg ttccgagacc acacggagaa 7320
cagatgacgt ctccggatac tccgtatcct attccacatt ttgtttggga aacacatgcc 7380
ttgcatccat gatcgggaga gcattcacag attctattgt gagtcgtgtt acacgatcgc 7440
gtcgacattg ttgacagaaa cgtgaccttc attcttaccg tcgtccataa atacgttagg 7500
tatgtaccac atactgtcgc gaacgatgcg tccatctcat aatgatttac tttttcataa 7560
ttaaagatgt gaaagaaaac cgaacaatat atttttttag taatgtttat gcgagacata 7620
taaaataaac tccgtgttta tgatgccggt aaatgttttt atcatcttgg acggaatcga 7680
ttttgtaata tgccatggaa acaggacatt atcactccat gataaattat ttaatggagt 7740
cgatcctctc attgttcttt gcgtatctca atctgtggcg tttgcttcgt ttaaataata 7800
tatcaaacat ggagacgcct gatatgtagg cattcttcat tctattaatg tctgctctat 7860
agcgctttag ttccttatga cgaccggcga tatcatactt actttagaag gaaaatcatc 7920
atctaggatt aaggcgtatc tgatacaggc gaataatggt tcaggatata gatagcgtat 7980
atctctatta aatgcgtcaa tcatagtctc tagagtggga tggtaactca gtaataaatc 8040
aactagcttc tctttggtaa ctgcttttct ggatggccgt attgattatc gagcgtgaca 8100
ctcgctccat attccaataa ccgctttgca aattgtatat tattgacatc gaccgcgtaa 8160
tatagtagag ttatcgatca tatctatatc atccatgtac ttgcttagta tatcaaatac 8220
atcttcataa cagtgatacc cgcaattatt aaatctcgat aatatcagac cgtacataca 8280
tagacggcca ttgttagata tgtgatttac agccgcgtgt ccatattttc cacgataaac 8340
cttacgacgt ttacatcgac gagattatta ttaacaaagt tgttgtccgt cgtcttatcc 8400
aacatgcatt gaatgatagg tatacttacc atatcgccgt aatgtaagta gtttatcagt 8460
atggcttgta cgatggattc atcctgttgt ctaaatctct ttagaatgtt atcgatgatg 8520
tagtggttat attctctgga atcgtacgaa gtaatactac gcattacgtc gacaagagta 8580
tgacgtctct caataagaag attaacgatt tccatgtcta cattatatgg ggttactcta 8640
aatcgcttgt ttagataata cgcctctaat atagggctga cgtcgtatac tctacacgtg 8700
tccacatcct ttattaataa tctctatatc tatggttgag caagaccagt agtattggat 8760
ggaaacattg ttatcgatca aacatttaat tacatccttg gatagagatt ctctatgaga 8820
cgatatatag taatgaagag agttcttaca catatcactg ttgtacatac aggtacgaaa 8880
tacgtaaccg gtgctgtaac attctgattt aagaagccat agcaatactt ctggtctcgg 8940
attaggcgtc gttacgtata tatccaccaa tccgagacca ttgattgcat aattcgtatt 9000
cttggacgga cgtatccgtt tatccacaat taggtatttt agcagacgta agtcgaaatc 9060
atttatattc gacttgagtt cgttagagga attcgaatag ctggatatca gtagatgcac 9120
aatctgagat tttacgtatc tatgcttact gtatgctcct agcggagtta atccttcgtt 9180
gtttctacaa agtctctcga ctccgcgaga gagtaacagt cgaacaatct taatgtctgt 9240
atcgcattta ttggagacgt aacaatgtag cgcattgttt cctcgtctat ctatatgttt 9300
tgataagttg tgacacgttt caatttctag ttttattttt ttgtacgtca catcttcatc 9360
cagtagacga catagaatac atgtgcaatc catagctatt ctggtgctaa ttattcctca 9420
taagatgata aaaagtgtag tgagagagca tgaaggagat ttagtattta gcagtgcgga 9480
tatgatccaa gagggtgaga tagtcgttct cgttcagaat ctttcgcagc ataagtagta 9540
tgtcgatata cttatcgttg aagactcttc cagagacgat agctgattga gtacaaagtc 9600
caatgattgc acgaagttct tcggcggttt tcatggagtc atttaatgat ctccacggaa 9660
gtgaatcctt caactcacca ccaaagagct ccgttgcatc agttctgaaa gagatgagaa 9720
gcctgtagag agaccctgcg ctttctctat gggtccatct atgagaaacc cacaggatgt 9780
attcagtcag acaatgtctg acgtcggcca cggtattcag ggagtcctta gtagcgtggc 9840
aatgacaggg tctgaactgg gcacaaggaa aggccattgt aaaggtagac ctgtagccgt 9900
ttatgctaat agagggcttt aatttccatt ttttaatggg ttgtggatga ggaatgagag 9960
tgatatcata ttgagatacg tagttatgta gaggtgtatt tcctatatta tttactttcg 10020
gtttcatatt ttaccaactc tttaataaat ttcttttcac gatgcatctt attaaatgac 10080
gttttctcat aagtggacat atagatgcag aagtattacc tctatcatct acataattag 10140
ggtctgctcc taacttatac agtacgtagt agtagtttat cggttttaaa tcaagtctag 10200
aatatatagt ggattaatat atttttatat tagctaaagc atatcattct caacttcatc 10260
atgagttaaa tatttgtgtc tactagtttg tttatatcac agcattctac aaacagtcta 10320
aacaatagag aagacggaca gactttaacg tataaatgac acatgttatc gatattcgtt 10380
gataaatgat tctaacgaca tctctcgcta gagataaaat ctagtatcgt atcatactcg 10440
catagcatag tttttcataa ttaatacaat atttaaaaga cttattcgga aagtatttta 10500
atacatgtat catcgatgga gatccatatg aggagtcact tgtagttctt cagtagtgct 10560
atcatcgata gtataattat atgttgttgt aattggagta actgttggta gttcttccgt 10620
ggaatcaata attatactaa cagcaatagt ataattatat aaatatgttc cgttgatatc 10680
acatatttta atgaactcat ttctaaagcc gtacatccac atctattagg atctgatatt 10740
ttacacaact gtttaataga gtctacattt atatgttctc tatcggtgag atacaaatac 10800
ctagatagtc gcgttatagc acaaatacga tataaataat aataataatt attcagtaat 10860
gtatataaaa atgcattgtg tatttactcc aatactactg tagttgtaag ggttttttca 10920
aaataatagt tgtgtccacg acatttatat gtattaccta tatattcttc agtaacattt 10980
tcaaagtaca aggtcgcctc ggtgataccg cctctactag ttaaaacaga gtatacatca 11040
aaatcgaatc ctataagcca tccggatgga ttttcccatt caatcagtac atcgtcaatc 11100
aataatgacg ttgacacagc agtgcatgtt atattggcag gttctcctat cgttacgttg 11160
atttttggat ctagtattag tttaaacctg tggtcttgcg acggtataac cgtaagtatt 11220
ttacatcttg attctaacgt cgtcgtaatg aacgtaacag ttgtatcttc cactatcttc 11280
taactctgga ttatgaataa ttaatttctt tcccgtttgt gaatacttaa tatcgtcgat 11340
attaatttcc ttattatctt tataccaagt tatattatta taatgttttg cgtaaagaat 11400
tccacagtat aagtctatgc catacttatc atgagtacct agttcatatg tttttggaat 11460
gcatgaagga ggttttttaa tatgagatct aactataccc tgaacacagt caccattctt 11520
tgtagttacg gtacacaaat acctacggtt actgaattta gatgtatagt tggctatcca 11580
taaatcacca tgtttaacac gtttattaga aacctgtcgc cgtctatttt tttctagcct 11640
ctcccattta accacatagt ctttatatct gtgcgataaa agactgtctt caataggagg 11700
acactttgcg ctgaatggct ctcctagagc ggctatatca ttcattgtgc ctccgaacat 11760
acatgctgga ttcaaccatt tagagtcttt agctggtaga gtatctctca ttttattgaa 11820
gaattctgtg atttcatttt cgatgtctat ggcgtaactg tggaatagca ataacaataa 11880
tgatacgaaa tatatatgta ccatcatttt catcgtcatc ttcattatac ggcactaaaa 11940
tatttatata atatcagttt ttttacacac atcgcatgag aaaaatacaa ctatactttg 12000
gtaggtggat acgatatatt ataaagatcg ttaattgtca gcatgtataa tatttcgtat 12060
ataatttctg aaggtagtaa tgttagtaga caatttttat ctgtgaaaac aggaaatagt 12120
ttatcatata cctttgacac gtacatatct ttataatata cattaaggta tctctcattc 12180
ataatattag aatatatttt aaaggactta tcgttattat atttttttaa ctcgttgagg 12240
tatcttctta actttttcga attatggcat cttaaaactg catatagtgt agtgtctaga 12300
acagtggata gtttgagcaa tattgtttct ttatgacatt tagttattat accatcgtat 12360
ttatcaaacg acatgtattt tgaaaatgcg tatcgatacg ttttagaatt gcgattccta 12420
acgtgtctat tgtaaaatgt gtcatctaaa agtatacagt aggatatgaa tttctcaaat 12480
gattttatat cgaagatagg tagagatgga agttgatcca ccaacgatct tatcacattc 12540
ttatctacaa tatatctaag tatatacagt aaacaattaa tggtatatgg ggaattttct 12600
ttgttatcta taatgagttg tgtagcgtgt ttgttattgt cggttaacgc atttatatct 12660
ccgttgatac gtataagata atctattacg tgtgtattat tatattcaca tgctacgtgt 12720
aataatgtct ttccttcata gcgggtataa atatctatat ttttaaacct attcatcaaa 12780
taagttatcg aatctgccgg acgtgtagaa tatgttttga taaactcgaa tacgattata 12840
cgacgctcat ctatcggcat ttctccaaca tttgtttcat agtgatgtat taacataaca 12900
agaatatcgt gttgtatgta gtcggaagtt atataacata gtataggcgt taatccatga 12960
ttattacata ttttgaaatt cggattaaaa gtcaacagca ttttagtcat atgaatatta 13020
ttacacattt caatactaag atacaaatgg aatggtgtat taccacaatt atctaacgcg 13080
tttagattag ctccatgttg tataaacaaa gaaataattc gcttggcctt ttttatcgtt 13140
atcggttttc tacactcacc tggtttataa acgtgtgcga gacacaaata ataataatgt 13200
aacgctgtat atccgtcttg tttaaagtta ggatcgattc ccttatctaa taaagtggat 13260
actatgttct catcgatatc ctcttccttc aataacatgt aacgagattt tatatactct 13320
agtagtaggt tggaatagtc tctatgcgat aagtggtttt tatctttgtc tatcatatct 13380
atcactacat catgtgaaat attacaacat ctagtaagca atatataaac aggcatacgt 13440
ccgctactgg ttttcatcgt tacatccact ccatgattta ataatatctt cagtacatcg 13500
tttgtaaagt aattattata caaatagcag tgtagtgcag tatacccagt tacaggttgt 13560
cgtctgttta gatcatagtt tttaactaca aaatcaagtt ctgtagaagt atggtctata 13620
ttgcaatgac taaaatatgt atatatttca ttctcttgta ttttatgagt tgattcgcgg 13680
ttgatattta aaacataaat cagacgacga ctcattttta tgatgctttg tggtaaaagt 13740
cctcatataa ttgtttaata ttcattatta tagacgattc ccattaacta atctaacatc 13800
tttgatatac ccgtaaatat gtaaatatga tcctaaaata acacggattg taagatgtct 13860
agaaagttta tgcaggtgta tgaatatgac agagagcaat atctcgatga gttcattgaa 13920
gacagatata acgatagttt tatcactagt ccagaatact atagtgcgga aaaatacatg 13980
tgtagatata ctacactaaa tcacaattgt ataaacgtac gacgatgcgc gttagactcc 14040
aagttattac atgatatcat aaccaattgt aaaatatata acaatataga attagttagg 14100
gcgacaaaat ttgtttatta tctggatctg ataaaatgta attgggtatc taaggtaggt 14160
gattcagttc tatatcccgt tatatttata acacatacaa gcactagaaa tttagataaa 14220
gtctctgtaa aaacatacaa gggcgttaaa gtaaaaaaac ttaatagatg cgcggatcat 14280
gctattgtaa ttaatccatt cgtcaagttt aaactaacgt tgccgaacaa aacaagtcat 14340
gcaaaggtat tggttacatt ttgtaagtta agaacggata taacgcagat agaggcaccg 14400
ctttcgggca atgttttagt ttatacattt cctgacatta ataaaagaat tcctggatat 14460
atacatgtca acatagaagg atgtatcgat ggaatgattt atataaattc ttcaaagttc 14520
gcgtgtgttt taaaactaca tagatcaatg tatcgcattc caccctttcc tatagatatc 14580
tgctcttgtt gttcacaata tattaacgat gacatagaaa ttcccattca tgatttaata 14640
aaggatgtgg caatttttaa aaataaggag acggtatatt atctaaaatt aaataataaa 14700
actatagcta gatttacgta ctttaataat atagataccg caattacaca agaacatgag 14760
tatgtcaaaa tagcactagg tatagtctgc aagttaatga ttaataatat gcatagtatc 14820
gtgggagtta atcatagcaa tacgttcgtc aattgtttgt tggaagataa tgtataaaaa 14880
ttcttataaa ctcaattgac atggaaatgt aacaacatac attcacgtta tactaacagt 14940
aactgttttg ctgatgctag gaatagtaaa cgccatacac gtaaacgttg tagcatcttc 15000
ttccttgaca ggattaatgt ttaaccggga tgtaataaca cgtctcttat cggtcatata 15060
gattttattt gctacactta ttctaccgtc tccgtcccca tcatcttctt cgtaatacat 15120
accattactt atccaaaaga cgtctgcatc cgttgtggga ggtctcaacg atactctaca 15180
cgcaatagtc aaattactac ctattgaagt tacaacacct tctggtaatt gcatagtaga 15240
aggtattatt ttatcccgta cctctaattt tacaattctg gttacgttat atgttttgcc 15300
accgtatata tattctaaaa cacatgtata ataaccagca tcattttttc taacatcttc 15360
tatggtaata attccaggtg tccgttgttt aagtctctta tttctaaggc gtcgatgccc 15420
gctccatata atatctgcgt ttacgttact agcaataaat gcattaatat tgggacatac 15480
catttcgcca gtagatctct catttactat ttgtggatac gagataagat ctatatttga 15540
ttctgagaca gacacgattg tcaaatttaa cgacatcatg tcacagtagg tttcgttcgt 15600
ggtaatgcat atataaatac cagagtctga ttgtgtcggg ttcagaatta gcatattgct 15660
accattatct atcggtataa ttctatcatt atccgctcct cgtttttccc ataaaatatc 15720
taatatatta tatccggatg atagcgtatt tatttgagga catggtaaga ttactggctc 15780
gttttctaac tccatgaatg atgcaaaata ttgccctttg tcgatacatt caggcgcgtt 15840
aaaagtctga acgaatgaag aataaaaaaa tatagaaaga aatataacag gtagtatact 15900
cattttattt tatagtgggt aaaaaaagtt tgtttctggg aaagggagaa gagaagaaag 15960
aagaaaatgt caagatgtac aactcatcaa ttcatacgcc ggaatatgat gttataatcc 16020
acgtaattta gcatctcaag catcacaaac aatgcgttca gactgttact agtgggatgg 16080
tcttcacctc cccagtaagt agcagcatat gcacaaagtc cgatgatggt tttttgctca 16140
tccggtatat aacaacggat gatttctaca caatctttga catcatccca cggcatattt 16200
tttaccagct cgcgaatgtc gatgaacact aattccgcat catctctaaa agagaggagt 16260
gtggaataca attgtccagc gcattgcctg ttgcggtatg cataggacga ccaataaata 16320
tattcagtca aacattgttt gacgtcatca atactggtca gtctgtcaca tccacagtgc 16380
tgtggtgaaa agacgtgggt actaaagttg gccgtcatct taccaattgc aattggaaga 16440
ataacgacct gcactaatac cacatgtttt ttctattttt ttaaaaaaaa tggttaacaa 16500
ttagttgtcg gagagcaata tctaccaacg aaaaaaattt ttccatcaac atgcctaatc 16560
acatagatga acggatgatc tacacagaac tcatttgtaa ttgttgatgc acagtctgac 16620
accagtgcac aagttgctgc agctgcttct gtatactctt cattgacatc tatatacgtt 16680
ttgtggatca tagcgtcgac actcacatct aaattacaca tattgctata atctccagtt 16740
gaaccgaaca cctctgtcag tcctgacttt actagagtat ccaccagatt atacgagcct 16800
gttaccttaa acttgggaat gtgaacatcg ataaacgtag cttccagaga gttacaccat 16860
ttcttaaaat ttgtatctgt tagattttgt tctatggatt ctaatccatc aatcttgtct 16920
ggaagaatga ccatcatact agtatctcca acatatggca gttctatgat tgaaaagtta 16980
ccgaatgatt cttttacaga tgcgtgatta aatagctcgc cgtacataga catcatactt 17040
acgtctacca tttccgttgg tgatacgtaa aagggataat cactggtaaa ttccttttcg 17100
aatggcgtca accattttgc tttaaagtat acggcactaa ttgctaggag acaattgttc 17160
atccaatagt ggattgattt tcccctcagt aaagatatct acacacttgt tgattgcatc 17220
tatagtgcga caatcagtga agtcaacagt ttgaaactta tcgccaattt ttctcaaaaa 17280
ggaatcttta aacacggcag aatatcgccc atatacttta ttcatggatt tgaatgagat 17340
attctgagcg ctaaccttat ccgtgttctc ctccttttct acatattttg atagctgttc 17400
agcagtggat ccattagctc cataatacag tattgtcaat actgacgaga ttgacgctgg 17460
agaaatgaat acattctctc ctttcataga agatatccat agtaatcgat attggtcgtg 17520
tagcgcgata gagatagtct aatattaata ttagatatcc gtaacactac cacactctat 17580
aaaaaaagaa tatttcaatc ttgtataaac agtctacgta gtctgtcata attaggagtt 17640
tgagaataat ctaacgtgta tactaattct atatatctaa ctaattccag aggttcattt 17700
ccttcaggaa aacagtcctc aaagaaagta gctataaatt ttttatattc ttttttttgt 17760
tttattactt ttatactact ttcgtttttc catggaagtt tgccaccgaa ccattctatc 17820
atgcaatatc ccaacatttc taaatctcct cgttttgaaa ctgttgctcc aagatgattg 17880
tctacacaca tataattgat atttcctgaa gttatcatgt cctcgttgta atctatatgt 17940
gagtttccac tcttgtatag tttgttagtt ctagaatagt caattagtga aagacgttta 18000
tttctaatca gtatattcct cggttctatt tttccatggg taaatcctcg agagtgtata 18060
aactctaacg tgtttatcat agtgacgcat gcttcgaaga ctgattcagt atcctttggg 18120
gcgaatactc ttcccaaatt ctttataacg aagaacatat aatcatcggt ttctccaatt 18180
ccatacagat ctggaatagc caaatacttt ataccacgtt cccgtgtcca gttgtcgata 18240
gtcaaaggat ccaatacaga tatatagaat cgtatttcag acaacaatgg cttgtgtgat 18300
ttgtgatcta tcttcattac gtaattgtag aacgaactag tgaaattttt tctgacctta 18360
tagagtattg aattaccaga atataaagta tttccgataa tccatttctt gccatcgtta 18420
tcaaaacaat acttgaagga ttccatgcct gctatgtaat aacgagttgt tttttatgtt 18480
ttgtattgat tataaaaatt agtagatatg atctatattc ctacatcgtg actgatacat 18540
tctataactt ctttatcaat ggaatcacta ttactacttg tattaataac gagtgcttct 18600
ctaatatatt cattgggatg attcttgtta atgtttttat tattttcaat ttctacattt 18660
tttagtattt cttctttctc tctatctatt tcattcatga tatcttctac atttgtaaca 18720
tctgtatctg tatccatgtt agtagtaagc aataaaacta tagggtaata ctgctataca 18780
taaaaactat ttatattttc attttcaatc aataaggaga agttgataat gcctttatag 18840
cggggagacc atcatacttt tccacatatc caatgtcctt actaaataca tagaagacgt 18900
tatttagttt acacaatgat gatatggata tcttataaat agtatatgaa atatcaaacc 18960
acttattcgt tctaggattg taacaagata tctcgttgtt tttctctaaa ttaccatatt 19020
gttcggtaac ggatagacct ccggcaaaat ataacattcc aaatgtcgag tcgatgccgt 19080
gatataccct agccataggc atttctataa tcgaccattg tttgtcttta taaatgtcaa 19140
gtttttccat attgttaacc acgatcgtgc tattagagtt atttatgaat aatccgccag 19200
ttacgaaaat agtattatta acggaaacag cggacacatt gcttttttta taatttatcg 19260
gtgtttcgta tatccctgaa tccattgcct tcacttgcag ttgggagatg aacactgcaa 19320
tcagttcttt cgggatagaa atctttggca attatacaca accctatttt caatccccat 19380
gtttcctctt cagtcttctc acatcgtcta atagacatgg agtagatgat agaggggaac 19440
agaaggacta taatcaggga cctcatcttg aaaatggtta gagcctataa gggcgttaac 19500
cagtgtataa tatgcagttt tatttcgttt ttgtatcatt aataaaaatt agttatgaat 19560
atttagtcaa gttaagcatg ctaagaaaag tggtaacatc atttgatgtg ctaccgagaa 19620
atttagttaa attttctctt gataactctg gccattttgt ccttgaaatt gggaacatct 19680
tctttggatc taattgcgca tgtatataca ctctttttag gtgtcagaca tactttcgaa 19740
ctccatggag taattttttc gagaaaccct tctgtggctc cctgtgctgt cacgcacact 19800
ttttctgttg tgacaaagtc tatgctgcaa cctgtagacg tgcaatcgta gtcatcgata 19860
tcatatggta ctggttcttt agaatctccg aatgtgaagt tagccgtata caacaagtag 19920
atatcacaca tgtcaccgcg tttgtaaata ggaatctttt tgctacctct agtagcatac 19980
ggatggtcgt cgtattcagt taatgttaca gtcggtggtc ccgtacaata ctcttccact 20040
ttataatcgc cttgttcaaa tttaacccag aaagatacga aatcattctt tggaaatgcg 20100
gatatgtcgt atgttccttc tggtacattt gtatacattg tttgccacgt cttgatgcca 20160
taatttttaa gggatatatt gtatagacca tccccagtcc attcaatttt ggaatcaaaa 20220
ttgacggatt caaacttata actagttatt ttagcgtgta tactattaat gaacaaaact 20280
gcgagaatta taatatatct catggtgttg tttgttattt gactactgtc actgaagtga 20340
taatatattt taaattttta aaaatcatat tttgaataat atgtattact atgtccatga 20400
cgatcaaata tataagtaga tccaatttta agagtatgtt ttctcaatag agtatcatcg 20460
ttgatcatac ttatttcata ttttattcct ttatagcatt ttccctctgc agtaatagaa 20520
tattgtcgct catcaccatc atatttgtga atataagggt acttatcata atggcatgat 20580
aatttggtag atattctaca cgtgtgacct ctgcgttgaa tagtagacat atcaataatt 20640
ttatagtcgt ttataacatc tatatctatg gaatcaaatc cgacgtcgca tgactctgta 20700
ataatagaaa ttgtgtactc tttaaaattt aacaccaatt cattattatc atcagtatat 20760
gaattttcta tattatctac tccatctata tagagtttta cactcagttt attaaacgga 20820
gtgtactcat tattcgaata ggatgctaat gcacctatta caaataaaaa cgttattagt 20880
tttttataca tttaaaatct taaaatttta actagtatat tgaaataatt atttatacaa 20940
actaactaga tgcatcaaca gataataaca aacacctcca gcgatcgtgc caatagtagt 21000
tagagatgcg tatccgtaca acttgttaat catatttcta cggatgtata taccatcgtc 21060
gttaaaagcg cttctatatc tctcattagc tagaattata gatacgctat taattatatc 21120
atacattagt tgattgttga tactattttt attgtaatcg aaaaacattt ccatgaatag 21180
tattccggtg ccaacgctta cagaaacatc gcgttcattc atcattatac cattaagtgt 21240
atccatcacc cattgtttaa cacgtgattc atttggcata gtacgtacta tatcatcgaa 21300
cggtataacg gcgcatcttg taaatagcct gtatatatta tgtaatatgg cgttacttct 21360
tccatacaca ataaatctct tgttataata gtgcctgagt aaaaatgctc taacggcatc 21420
gtagatatca acatccactg aagaagacat ctcaattgat tctagctata agtctttaat 21480
cttttgatac ttgtttgtta ttaaattatt aattattaat tattcattat tttaacggat 21540
ttatattcac ggtagcaatt tatggaactt atattggtca ttatttttgt cacaggaaca 21600
aactaatact ataacggaga ttaaaaatat gacgcccata attgttaacg ccactatgat 21660
tatatgataa gttgcttcta acgattctat ttcttgttca tattgtacaa cgtctttcga 21720
gagtttgctc aaatctgtct catcgtcggg accatcatcc actggatcaa atttttcgtt 21780
agatcgtaca catgttggga gtatgggatt ccatttaccg tcgatacatg tggatgatgg 21840
agatcccgtt agtataaaac cacttttaca actaagatgt ataacgccac cgatagaaaa 21900
tgtagatccg gaaattaatc cattagatag agacggtata tcacattttt gttgacatga 21960
tggaataaca ttccaagaat tagctgtaca acttatgtac gaagcaccaa taacctcata 22020
tccaacatca cagttgatag ttatatattc cccaaatgag tatttttctt taactggttg 22080
acacgatccg tgttctaatt gaagaggttg acattccgca ttaggacacg taacagtatc 22140
attccaagaa gtatttccat ttttttcttc gcaacgaaaa tattttgttt cgccgttgca 22200
acttagtgtc atggtggaat tcacttcgta tagcggttta ttatatagtt cagagatgta 22260
atcagaaact gtgcacattt ttttgcatgg attttcgtat ttccatttat ctgtttcgca 22320
gacagcattt ggatccgaag aatgatatcc ctgatcacat gtaaacgtaa ctttctggtt 22380
attattaaac gatgtttcgg tagacgttaa tttagcgtta ttcatagtgg gtacagtaca 22440
tgttgaataa acaacagcag gtagtacgca taacaacgta acaacggaaa tcgttttcat 22500
ttttatttat gagcgttaaa aatagtatac actgtcgagc actaaaagga aacaatgatg 22560
tagtgtgatt ttatatttta atagtgttta taagattttt agatatgtgg acagttcgtt 22620
atcgttatat tttatttcgt taggaaaaca cgaccattta tctccaggat ccagcttctt 22680
ggataataga ctaattaatc tctgacgata tctaatagat gctatgatat tacgtacacg 22740
acttccgtat acgtcgagac tagtgcactt cacgaaagaa ggatgtttac cgtatctcat 22800
taaagtatta atgtctttga tacaaaaaac taattggaat acagttttat tttttatata 22860
acatctcttc atgatatcta tttcatcgaa acatcttaaa atataccatt tatattgctg 22920
tagtgattgt acatctataa gagtatcata atcggtcata cacgcagtat attttataca 22980
cattttcaat aaatctgcat tatgctgttt atgtttagta attgctatca tagactgtat 23040
cataattttc aaagatggtc gtttagacaa tagtacttcc attattattt tgttgttgtt 23100
tgcgactgct tccgaaatac atgtacatcc actagtagta atcgtctcaa aatcaccatt 23160
tctgtttaat agatagacca acgtattata cgcattataa ctgacagcgt cgtaaatagt 23220
acgatagtta atatcaataa ctcctctgga taatagataa tctaaagatt caacagcatc 23280
aaaaattatt aaatgacgaa ctatatgttc attatctatt tctaactctg tgtctgattc 23340
caaatatagt tttataagcg agataccgtg tataattcca caataaaatg gagtagttcc 23400
gaatttatta cgcgcattaa catttgctcc tttagaaatt aataacttta caaaatcata 23460
tttattacga catgagacat aatgaagagg tgtatatcta ttaaagtcta cagagtctat 23520
attagcattc tctaaacata tctcaaacaa ttccacgttg ttgatatcgt gtttacataa 23580
taggataaaa ggtgtgtttc cgtacgtatc ttttttatct acatctgatc cattatttaa 23640
caatatctta attaattcat tgtatgattc tgattctgat tctgatttct tatagtgata 23700
tatacatcta tgtaataatg tacaaccgta actgtcttcg gcatctatat aaaatacata 23760
tatagatatt aactgtatct gatatatagg acaaaaaatg tcttgttcta cgtccatttt 23820
caagcattag tcttatccta ttatctggat gatcattatt aactaattga tatacatatt 23880
cttttatatt aatggatgat gataacctga gattgtgtat aaatgacccg ttcttattaa 23940
tatttaatac tctatctaga aaaaaaatta taatattcgt attagcatca tccatagaat 24000
aaatatgtag aatattttcc ccatattcta aaatatggaa taaatttggc aacctagata 24060
aaaaatcaac tattgtttta tccactttct cgtatgttcg aacgagatta taatcctgta 24120
ttatatggga tgtggaaaaa ttggaaaaca cgcgtgctat ataatgaaga gataaatata 24180
cactccagtc aagtatttcc tttttaaaaa aaatccatat ataatttata ttctgtaaca 24240
tgttatccct tttcaattaa caatgttggt ttataaaaaa ttaaagaagc gaatcaatga 24300
ttaatagatg ttaagaacta taattacgat gtattaatag gtatagttag ttagttaaaa 24360
agagataaca gttactaatt aattgttagt tattgtctat atgatattac aacctattat 24420
ttgttctcta tagttacatt aattaaaatt ttatatgtga cacctattca tctggagaat 24480
acttcttgat accatgattc tggccaatct gcaaacacag cacaacagca tctttccact 24540
ttgatagcgc acacgtatgt cgaggtagcc tcatccccag gtttatatac cttgatgaat 24600
cgacacgtgt acttgatgtc ctctttcttc tcacagaaat acacaacaca aagtcttttg 24660
atatgctttt ctatatcatt ctctatcagt ctgaggtagt cgtaacccac cgtgaatcca 24720
aatactttgt gtgtattatt atcaactcca atgaacaaat atccaccctc tgtgttggca 24780
aaagatgaga gtatacgtgt tcttagttgc ttagctgaaa cagatgtatg tttaacatta 24840
atagatttac cagcctgaag ttctgatcta ttgaagaact cctctaccaa tctctcaatt 24900
gattcagtgt cttccactcc atctggatat tcaaattcct gcatttctgg tctgggactc 24960
catccacctg attccttcag ttcgctaaga tactcatttg aatgcatgat cccatagtct 25020
tcatgtacaa atttgttttt acagtactcg gtataagatt ttgattagac aaatgcgtat 25080
gcacataaca gcattcttta ccattcacaa tatcaaaatg atcatataca acagaccaca 25140
tatggcgtat agattccaaa cgtttcatga gtacatgatt cacactgtct tccagagagg 25200
tggttacctc gatggtgccg ccgatatagt agaagaaaga ttttctgtat cattctcccc 25260
aagtttaact tttaccagat ttgggatcgg aagcaccgca ccctttttga atctcattct 25320
cataatttcc tttccatgta cgtccacagc agtaacttga cagatacaat ctcctatcca 25380
gacgtggtaa tcgatgatag gtcctgaact gagtgcattc atttctttgg ccttctcttt 25440
gatagtagaa taatcatgtt gaacttttcc atatacagcg ttttctgttt caagtacgtg 25500
atgatgaatc cctactcctt ccaaacacaa gtctagactc tcgtacccga ttccgcttag 25560
agcatagtag tttttgtggc gatatgtgat ttccttcttg atcatggatc ggatctccaa 25620
tttgtagata ttcattgcac tcaaacatag gcagcagtgc tccaatatat ctcttgttca 25680
cctgactgta acaccacata tatttgtctt ttacaatgtc atacttgtta ttataaactg 25740
acatcattgg caacagtcga tgatattcca ggaaaggcat gaagattctc gtcgtaccca 25800
ccgagagcgt gtgcgtaaaa catcgccatg atttcggttg tacacacgag atcaataata 25860
aattaagtta ttttttaatt tttatcgaca aaaattttac atcaaccaaa caccacactt 25920
aataatatac accctgcatt aatatgtgcc gaaacttgtc gtaattgggt tcctcaaaat 25980
atgtcaaaga gtttaccatg gtaatatatt gcagcaattc tctaggtgca tattgcaaac 26040
tggtcattaa caaagtcgca gtattgttaa catatttctg ttttgtggca cttactaatg 26100
cacaattctt tgtttcagat atcttagtcc atggcaagat acctcccaac catctaatca 26160
tacaatatcc aagtgtttct agatctccac gtctagatac aacgtatcct ttatgcgaat 26220
ctataggtgt aaattctaga gtaccgttat ccattttatt tggatttctt ataaatggaa 26280
catgttcgcc attagacatg aatttagaaa ccaatccgta atccactaga tataatttat 26340
tcttatctat ttgatccaag actatattac tcgctttaat atctccgtga gaatatcctt 26400
gctcgtgcat aaattgtatg gtatttaaga tttcgattcc gatcaacatc accgaccttt 26460
ttggtaatct attattattg gctctgatca ccgcatctag atctgcacct aatctattaa 26520
ttaccaagaa tcgatattcc acattaatgg atttgtatag accaaatgcc ttgcacgtga 26580
taagacctac gtgctttata ttgtgagatt ttttccattc ttcgataacg gatggtttaa 26640
gtactctagt ataaaatgcc tgttcggtaa ataatgatcc gttagctttg ggctctattt 26700
ttactacata attattgtca ttagtagtat aaatactacc aaatccacct tttcctatta 26760
atggtccaac gacccattga tttttgcaat tgtcagttaa cacaagtcct tgaaagttca 26820
taatgtgtga tctatctgtc aatgaaatat cattttaaat tttaagtttt acgtggtaag 26880
ttttaatatt taactaatac attagacgtt gaaatagcca catataaaaa cgagttatat 26940
tattaattat caagttttaa gtcttaagtc tctaattagt gttaaaatac attctaatac 27000
ggtcctgtag tatctgaatt aacttactat atgctaaatt cacatcatct tcaatgataa 27060
tagtgtcgaa tagaccggct tcacctgcct catccatgtc agtttttgcc aacatcacac 27120
gacgatgaat ctcatcatcc gcttcagtgt ttctacaacg aagcttggtc tcaaccattt 27180
taagagaggt aggtcttata tacaccgagt aaggcattag gtaagtattt ttaaaacttc 27240
taacaccgtc gatgtttaga tccatcacac aaatacgatt attaatagcc gctgtattca 27300
cagcagtttt agaagttccg taaatatttc ctaaaaactc agtatgttcc ggcggctatt 27360
cccttccaga tggcctctct gttaacgtaa tggtaatcga caccttctcg ttccatagga 27420
cgaggaaatc tagtggtatg ggacaccaca aatccaaata tattcccata gtcttttggc 27480
tatggctgtc tttcctaaac cagatggccc gctcaatatg atagatttta ctatcccaga 27540
catttatgta agtcaaaaat ctagactttg ttctctgttt tgtatttacg tgaacgttta 27600
ttatatatat aatatgtaat acagaatatt gccacggccg acaatataat taatgcggta 27660
ataccaaata tttctacaaa gtccttggtt ttataattgc taatagaggt tgtactaccg 27720
cctacagtag ttgatggtac tgtatcatta tcattgtacg tatcataaag atccgcatca 27780
tcggtggttg atttagtagt gacaattcca gatgatgtac ttactgtagt gtatgagaca 27840
gtgtctgtaa ctgtatgatc ttcttcttta tcagtaattg gttccggagt ctcgtctgtt 27900
gtggattctc cagatgatgc acttactgta ttaatgctat cactagtgta tgtgacggtg 27960
tctgtatgat cttctacatt atcagtaatt ggttccggag tcgcgatttc gaataccgac 28020
gagcaattag aattatctat ataatcaggt ttctcagaac tagtttccgg tgaatgtgta 28080
gatccagata gtattatgtc tatagtcgat tcactatctg tatttacaat caactctgtg 28140
gagtattctt cataatctac tttatcagtg tcatttgtag gcgatgtcat aaagaatgca 28200
catacataag taccggcatc tctagcagtc aatgatttaa ttgtgatagt tgtaactaga 28260
tcatcgtatg gagagtcgta agatatttta tccttggtat aattatcaaa atacaagacg 28320
tcgcttttag cagctaaaag aataatggaa ttgggctcct tataccaagc actcataaca 28380
acgtagtcat ttgtattatt tcgattacat gataaagttg catcatcacc tattttttta 28440
gatgtctgag gaaaaggtgt agcgtatact aatgatatta gtaacaaaag tattggtaat 28500
cgtgtcatat tagtataaaa agtgatttat ttttacaaaa ttatgtattt tgttctatca 28560
actacctata aaactttcca aataccagcc accgaaagag caatcttaat catgtcaggt 28620
tcatatttcc ccaacatact agatccaaat tcgcctacat caggtaaatt catcataata 28680
caatgtctgt tcatatcaca cgatgatcca ttgagcatct catctctatc aagaatccta 28740
atctgtggtt caaaataaca gcatctacac tcatcgttaa ttgtagtatt gtctaatata 28800
tttttgctaa tatttgcgta agttctatta tcagctattg catgcatcac agatccatca 28860
acaaccatat ataatataga acaatagtcg gactttatac ttatgtaaaa cttgaaccaa 28920
ttggaactcg gaagctcgtc atgtagacgc tggtgtctag atataataac attattatcg 28980
gttacacttc ttaagagagg tgccgcatcg atagagaaat caaacaggag aataatcaat 29040
gatgcatttc ctttggtaaa aaaggaaaca tccatgggaa gaatggctac tttatatgaa 29100
tttaactcta tacacgcaca cgctttatca gatgagatta atagttcaca aacatctcta 29160
tcctttccta tggatataat aacaggaatg gcatctttag gtttaaaata attatataca 29220
ccagtaggag tcttgtcatc gtcatctatc tttatcaaat tagcaaatct ggatattctt 29280
gatacattct ttttatacag tgaattgcat acatcggata ccgcattatc catatatggc 29340
aaatctgcaa tcactgtatt gtttttagat tgtccgccaa tgtgaacgtt cttgactttt 29400
tcacaacatg gtttaatcat gaaatcattt tttatatgat ttatttcctc gccatgtttt 29460
actaacgcgt ttagacagta tacaataaca ccatccatgg cgaccaccaa ctaaatagtg 29520
ttattttaat ttgtatgtaa ctattaagat ttagtcaagt ttactaaatc gtttagatga 29580
gtagattctt tccacgtttt atcctctctt atcctagtaa atctaggaaa tctaatggat 29640
attccatttg cggtatggga cttggaagat gtaaactctg ctcctgaaat ttcccatatc 29700
tgagattgtt ttggatcctc tactacaaaa tcgggaatat agattttatt aactactaac 29760
cactctggaa tttttttggg atccttgtta attttaatca tctttaattg gtcttgcaac 29820
tcccttaacg tattatcatc gtgtcctgaa cacttggtaa ccgtcttcca tttaccggat 29880
tcatcgtcgt aacaacccat tagaaagact gccatgatac caccctttgc tcctttacca 29940
tagtaagcac ctagtactac taaatcggca gaatctgcca tggaaccctc gttcaaatag 30000
tctcgcttta tttttaacca tcttctcttt cccggttcgt atactccatt aatatctttt 30060
aagaccaatc cttctaattt tctcgttagt gcatcatcca atacgtcagt taactgagac 30120
tcgttactaa tattcgtcaa ctctgagaat actattctat tgggtatttc aaccataaca 30180
tctttgagaa aagatcttcg ttcgtacaat ggaatgtccg tcatatcgaa tccatcaaag 30240
tacaaacagt caaacacgaa caaacacatg ttagagtttt tatattcttt ctttttgtgt 30300
atacctaaac ttccaaacgg gagcggtaca ttatgttcgt ctacaagaac aatttcagaa 30360
tccaatacga tagacgtagc ttttttaaat gctttcggta tgtattcttt gagataatcc 30420
actttatgag agagtactgg tttcatgttt ctactaaaga aggcaaactc gttattattt 30480
ttatgaactt gtactctttc accatcgtat ttgacttccg caaacattcc tgatggaaat 30540
tttttaaacg ccttattgac agaatcacac gattccgcta acatgggatt aattggagtc 30600
ataacagaaa tagatataga gtctagattt tgtttagatg cattttctat tatctctttc 30660
aagttattag attttctaaa cacatcatag gcatgaggac taatagcgtt aagtacgtac 30720
cgagggcccg ctttaatttt tagatcttta tcaataagca tgactacaca ttttaaatca 30780
ttacatgtac aaacggatgc gatatcagtc aataatttta tttgatgcga ttctttagtt 30840
acggatgata acgtagttaa aaaactatcc acttcttcta aagttaaaat gcttttatct 30900
cgtggacgga tttctgtgtt ctctttgaag aaagtcctaa tagtgtctcc tatatatccg 30960
tatcctaaat cttgtagcat atcttcctga gattgtttaa atattatact atataatttt 31020
ataatttgtt tatcgttcat gttataaatt ctatcgtcta atccgggaag caatagctta 31080
atgatcaaat atttatcatc cctatctgtt ataaagtctc taattaattt agatttttct 31140
ttatatcctg atgcgtgata tatatcacag cataattttc taaattcgcg aagcgacgtc 31200
atttaataaa aaaagtattt ttttttaata ttttcacaaa tatcgttcgc ggatattatt 31260
agacaattgt agtatattct cacgtatcac ttgtttaata tctatatccg ctattctgga 31320
accgagtact tcggcatacg tagttttaaa atctggatta ttcattatat attttagagt 31380
aatgatagca tctactatat tttctaggtt atccttataa aagctcataa acttgttaac 31440
tatacaatcg ataatttcat tgtcgtctaa tgaattaatc aattcggtct gatagatctc 31500
ttgatctata atagtatata cagtatctag gaaagagata aatggattat ccaagtttga 31560
tctacacatg ataaaattta caatgtcctt gtttaatata tcattatcta tagtttctat 31620
cttggcaata actaattgag atattgatgc gagttcggta tgcatatcgg acacgtatcc 31680
gagtactgat tccaagttgc cagagtaata tgcttcatcc atttataaaa aatgtaattt 31740
cactattaca tccacagttg ccccactggt ccagtaaccg tgtgtatagc ctctataact 31800
atattcttaa tcaactcctt ctttacatct tcctcgaatt cagaagaaat aatttgccaa 31860
tgaatatctc cttcttcaat cattttttta tattcttgta ataccttttg ttggaatgta 31920
acatcttcat aaatttcctc gccgacgttt ctattaattt ctttgctacc agattccaag 31980
aatataacta agtcgggttt aggcaatcca gattcataac tcttactgag agtcattgac 32040
gcgcctttag cggcggcata cgctactcca gagaatgcgt atctatcaac tattaaagta 32100
attccctgtt ctagttgttc ttgtataaaa gatgcaaact cccatctatt tgcacaaaat 32160
aatagattaa ctatatgatc attataggtt tttttacgag ttagatagtc atctatcatc 32220
tttccagtga cagtggatct ctgaggaaag ttaagatatt ttatcgtgtt tgccggtata 32280
gattccatga tgttcataca ttgtgttgtt tttccagatt tgtccaatcc ttcaaaaacg 32340
attaatgccc cacgagacat ttttgtaaac ctaacatatt ttttacaatt tatgcgtata 32400
ataaaactga aaataaatat atggtcatcc gagacgatta ggtctacctt tatagatcat 32460
gggtaacaaa aatattaaac catctaagga aaatagactg tccatcttgt ccaaggataa 32520
gatggattca tttaagagag gatcctttag agaaaagtcg cgtgcaacca tccaaagatt 32580
ttcatctctt agacgagaac atattaaagt agaccatcct gacaagttcc tggagttaaa 32640
gagagggata tatgaaataa ttcagaaatc gtcgtctata gatgtggaca aacggactaa 32700
gctcatgtcc aacataaaaa cgatgatgat aaatccattc atgatcgagg gtttaatgac 32760
atctttagaa aacttggatc ccgataacaa gatgagctac tcatcggtga tgatattggg 32820
agaattcgac atcatcaata taagcgacaa tgaggcggca ttcgagttca taaacagtct 32880
gttgaaatct cttctcttgg aatactccat tagtaatgac ttgttgtatg cccacataaa 32940
tgcgttggag tatatcataa aaaatacatt taatgttcca gaacggcaac tgattctgag 33000
aggtcaatac ctaactccaa ttttcagtga tttgttaaag tatgcgggtc taaccataaa 33060
gtcaaacata cttatgtgga ataaacagtt tatcaaacca gtatctgacc tctatacatc 33120
tataagactc cttcattgtg ttacagaatc atataaggtg attggaatgg gataaataca 33180
acaattatat tttttatcat atccttcacg ctataaaaaa aattatacat ccgtttccct 33240
gtcggttacg gcagaacatg tagacgaatc atcgtcttca aaataatcaa attcatcctc 33300
tatgctctca gtgtgcccgc gtagatactt acatagattg tatcgtagtc tagatagctc 33360
agatattgga atactatcag atatatgttc ggctaatgcc atacacctat attttttatt 33420
tgtcaattcc caatgtttag atactagact agatagagct actactattc tagaatcaca 33480
acatatcata tcctttacaa agttgacaaa gtcaaatagt cccgaatcgg agtctacttt 33540
aagtttctca ttaaataata gatcaatgtc tccatactgt ttaatagcca aatcatctag 33600
tgtcatcaat gccgaataaa tattgccgga tatagtattt ctgtgtctta tatattctgc 33660
taaaatcagt ttaaaactat acttattata caagtaagtc atactaaccg gcgtattaac 33720
cgcgcaacca atattagttt cctctggaag cacccctctt atctctatgg agtcattatg 33780
tctattatat gaaataaacg tgtcaactac tttgtcccta tcaaattcga cagtgtagtg 33840
tgtatcatta acttcattac gtatgactaa tttattttgc tgagtagaaa agtaaactaa 33900
tgcatttatt gttttagacg cattaactga tatatcaaac gccattctcg ttaattgtaa 33960
gaaaatgtat tatcttttca tttatgtaag aaataccaat aactccacac gctaatctct 34020
gatcattttt tgaaatagat aacgcctttc caataattgt agatatattt acatctgtat 34080
ctaaataaac atatgctaca ccatatctgt ttacaaagat gttaccgata aatatttctg 34140
gactgcctat ggaatcacat ccttgactaa tatctccgta acgatgaatt atcaaactat 34200
acgttccgga ttttaatcca ataactttat cttttccatg gactggttca aagtaaataa 34260
ctcctctgat attatcgtgg tctattatac aaacagccat tatttagtaa aatagaataa 34320
gtagtctgat attatgagtg gcagcaatgg ccgtgtacgc ggttactggt ggtgccggat 34380
ttctaggcag gtatatagta aaactgttaa ttagtgcaga tgatgttcaa gaaatcagag 34440
tcatagatat tgtagaagat ccacaaccaa taacctcgaa agttaaggtt ataaactata 34500
tacaatgtga tataaacgac tttgataagg taagagaagc gctagatggg gtaaatctga 34560
ttattcatac agctgctcta gtggatgtat ttggaaaata caccgataat gaaatcatga 34620
aagtaaacta ttatggaaca cagactatat tggcagcttg tgtggaccta ggaatcaagt 34680
atttgatcta tactagtagc atggaagcaa taggacccaa taaacacggt gatccattca 34740
tcggccatga gcataccctt tatgatatat caccaggaca tgtatacgca aaaagtaaac 34800
gtatggccga gcaactggtt atgaaagcca ataattccgt aatcatgaat ggagcaaaat 34860
tgtatacttg ttgcctaaga cccactggaa tttacggaga aggagacaaa ttgatgaaag 34920
tcttttacga gcaatgtaag caacacggta acattatgta tcgtacagtc gatgataacg 34980
cggtacatag ccgggtatat gtaggaaatg ctgcatggat gcacgtgttg gctgcaaaat 35040
atatccagta tccgggatct gagattaaag gaaatgctta cttttgctac gattactctc 35100
catcgtgttc gtacgatatg tttaatcttc tattgatgaa accattggga atagaacaag 35160
gatctaggat tccaagatgg atgctaaaaa tgtacgcgtg caagaatgat atgaagagaa 35220
ttctatttag aaaaccatca ctactcaaca actatacgtt gaagatatcc aacactacat 35280
ttgaggtgcg taccaacaat gcagaactag atttcaacta ctcccctatc tttaacgtcg 35340
atgtggcatt cgaacgaaca cggaaatggc tagaagaatc agaataagta ttttttttta 35400
aaaaaataat cgagtagggt atcaccacta cacagtaaga gtacagtaaa taaaaagtat 35460
ttatatactt atttttatag atcgtattat gcgctttcta catccatatc atcgtcatct 35520
tcaatatcga taatattttt ttcttctacc ttgacatctt ttgcctcttc gacagcaact 35580
gattgcgcta ataatacatc cttttctttt ttctttttag gtgtgacatc aatagttatc 35640
ttttccaagt tatacgatat ggtaacaaca ttaccaattt ttgaatcaac cgtaattttg 35700
attttatcca tttctaatag catagcataa ttaagagttc atttttattt ttttaaacac 35760
gtatatggtc actatcaatg tagacgctat tagcatacat attgcagtaa aatcatagat 35820
gtcatttata tctttatacc atataataga atcgtatcct attatagtaa tacacgtgga 35880
ccgatgtaga gaataagtct tatgtttaga tgttaatata gtagatgtac tcgttgccaa 35940
atttttaatg agacacgata catttactga gcattgtata tcattatatt ttaagtttcc 36000
tgataatgag acaatcaaac taggattaag tgaaccatat ttgtgtttta aaaaataacc 36060
atgtgtaaaa aagacattat cgtctagttc atctacggtt aacgtgcgat taataaaatt 36120
atatctatat ggaatatgcg gattatattt tatatttact acattataga tcctatcaaa 36180
atacaaatgc ccataacata attccacatc ctctgaatga aatctaaaaa tcgatgaact 36240
gtatgccaat accggcatta ttgacatcgt caatatggat attatccatt tcatcatcat 36300
catttaaaat tgacactaca tatgaatatt actttttatt ttaattacca gttgcacgta 36360
cacgggtagc gtgatctcta cacgttctat acacgtcacg tctggatgtt gttggcttac 36420
ccatcaaaaa tataagtgcg gtatgagttc tcgctataac gatcggagat acaggggcgt 36480
acccagtaag caactccata gcatacgtgt tctcatccat tagagagtta gtatatacaa 36540
tacaatattt ctgaccaata agaggtttta gaatattacg attagtgatg agaggattaa 36600
tcttggcaaa tgttctgtta ggaatagcag ctagaacatt ctttgtagtc ttgtaatcaa 36660
cgatggcggc atcctcgaac ttattatttt ttgagatatc ctcgataatt ttatgccatt 36720
cagccattgt tatttattat atatgttttt taataagaca tctattggaa taaacttgac 36780
attagcattc tattcttact acaaaatata aaatataaaa taaaatatac aatccaatac 36840
tcacataatc caactcactc gaacactatt tttccaatta cgataacaat attgcagaat 36900
gtactcgtta ttatttatta ttttgatgtg tataccattt agttttcaaa cagtgtatga 36960
tgataaatcg gtatgcgatt ctgacaataa agaatatatg ggaatcgaag tttacgtaga 37020
agcaacacta gacgaacacc tcagacaaac aacgtgtgaa tccgaaatcc ataaatatgg 37080
cgcatctgta tcaaacggag gattaaatat ttctgttgat ttattaaact gttttcttaa 37140
ttttcataca gttggtgtat acactaatcg cgataccgta tacgcgaagt ttgctagttt 37200
ggatccatgg actacggaac ctataaattc tatgacccat gacgatctag taaaattaac 37260
agaagaatgt atagtggaca tttatttaaa atgtgaagtg gataaaacaa aggatttcat 37320
gaaaactaat ggcaatagat taaaaccaag agactttaaa actgttcctc cttctgatgt 37380
aggaagcatg atagaactac agtctgacta ttgcgtaaac gatgtgactg catacgtcaa 37440
aatatacgat gagtgcggaa acattaaaca gcattccatt ccaacactaa gagattattt 37500
taccaccaag aatggtcaac cacgtaaaat attaaagaaa aaatttgata attgttaatt 37560
gttattttta taaaaacaag aacggtacgg cgatatttat ttttttctaa aacatctaac 37620
cgaagtagtg gtatgataaa aatgtagttt gggagtattc gttgtactac aaatatattt 37680
ccgtttttta gttccattct tcgtggcatt tttagctata aaattatatg gggttgtctg 37740
gtttaatatt tcggattctc ctacccaata tccgcgtcta atgcttctta aaaaacttaa 37800
ctcgtttgga gtctctatct taattagatc cgaatttgga tttagagctt tgcatgcatt 37860
acgtccttcc tcccaggttt ttcgatcagt agataaatgg atacatttat tattatagct 37920
tatccagtca gtaggacaat cttctctaat atatgcatct tttatcgttt tatctattga 37980
tggtgtatga actaatttac gttctacaac ttttaatagt gtcgctgtaa aaataattcc 38040
aacaattaga ccgcatatta cgcagcaagc ataaccagca taatctgtct tatgtttgtt 38100
cattgtcgtg cggtgtactc gattaagatt acattttaag tatttttttc atttgtggca 38160
ttcctactcc tatccctata tctgctataa ataacggttt cgaagttacc attactgaaa 38220
atagtcgcgc atcgacttgc gagctacgag tttgtttaga ttttgtatcg aacgaatata 38280
tagaattttc ttttatgaag tatattctat attcccgatg ttcattttcc ttctccttta 38340
tatcgaacca tttaacacca gacggtcctt cgaagatagg ttggttagat aaaggcttta 38400
taatatcatc tagtacatta tatttttcta tgacttcaaa cgtggtatgt ggaacaactt 38460
ttccagctgg taaacatata ccaggagctg gagacggcaa ttgctttgta tatccttcca 38520
attttgacgt agaaaaagat tgtttaatgg tattcataga ataggtacat aatgcggact 38580
tggaataagg actatcgaag aatacatata gtatcgtatc attatctgtt tttatagttc 38640
tagaatgaat aatttgtcta taacttcttc cgtcgatatc acattctaat tcgactttga 38700
gaaacgtcga ccatctatga ctagacaatg atgatggacc accttcgtcg tttaggcaca 38760
tttgtgctat atacggaatt ttgacaattc tctttgagcc gatagtatca gtgaaaagaa 38820
tgtaaacttt gtcataagta ccatctttat cgacgaatgc tcctcgtaaa ccatcttttg 38880
gaattacgtt atccgccgtg tataaatcat aaccacatgg tccgtcaaat cttctccatc 38940
gtttaattcc ttcttttgat atgtttatgt cagatagtac acatccgttg tgactgatta 39000
tcgttacttt gctattttga taaggagcgt atcctctacc tctatgtttt gggtcgtctg 39060
aaccgtctat tttccaacat ttgggatttc cgttattggt tccgcatcct ctacttttat 39120
agatgttgtt atataattat tattagttaa accagttttg tttagtttat tatttgaaaa 39180
tgtgtatacc gccccattaa cacccgtgta taatacgtcg tctaataagt aagtagaaat 39240
tatttcttca ctcgtttcaa acttatgcca ttcaataccg ttagcaaaat agaataaaat 39300
aaataacaaa ggtatcattt taaataaata aaaaatgtca agagttagaa tatcgttgat 39360
atacctctat acgttggttg ttataacaac tacaaagacc atagagtata cagcatgtaa 39420
tgataccatc attattccgt gtactataga taatccgaca aagtatatta gatggaaatt 39480
ggataaccat gatattttaa cttataataa aacttccaag acgacaatat taagtaaatg 39540
gcatactagt gctagacttc attcgttatc agatagtgat gtctcattga ttatggaata 39600
taaagatatc ttaccaggta cttatacatg cggggataat actggaataa aatctactgt 39660
gaaattagtt caacttcata ctaattggtt taatgattac caaacaatgt tgatgtttat 39720
ctttacgggc attactttat tcttattatt tctcgagatc acttatacat cgatatccgt 39780
tgtattttct actaatttag gaatcttaca agtatttggt tgtgttattg ccatgataga 39840
gttatgcgga gcatttttgt tttatccgtc aatgtttact ctccggcata ttattggatt 39900
gttgatgatg acgttaccat ctatatttct tataattact aaagtatttt cattttggtt 39960
actgtgtaaa tcatcatgcg ctgtacacct cattatctac tatcaattgg ccggatacat 40020
tttaacggtt ttgggtttgg gattgagttt aaaggaatgt gttgatggta ctctgttatt 40080
atctgggttg ggaactatta tggtgtctga acattttagc ctgttatttc tagtctgctt 40140
tccgtcaacg caaagagact attactaggc atgtgtataa taccacgagc tagtagacgt 40200
ggaagaggaa ccaggaacta ttgttatttt ataataatcc ttgatcttat tcttattctt 40260
tttatcagat tttttgcttc cagttttatc agtagatata caacagagtt tcataaaaag 40320
agaagagaat gaatccattc cgtacagtat ttgataatag cttttttttc gttttatgag 40380
gtatttagag attagagatg attaatgatc cccatactag aaatctatta atccacagaa 40440
atactaatgc tcatatttat attatcaatt ctaccgtaca atatttgttc cttgataggt 40500
acagaattgt acgactgcat ttttttgtat actaagtcga atggatgctc gaggttacaa 40560
acatgtatta aagtatcttg ctcgatgcta gaatttatgt acttggcaga ggacatacct 40620
atacagtttg atggaatagg aaactgctta ttgtcgatgg ttataatgga ataccactta 40680
ttattcgcaa atatctgatc gcatatgtgt ctatccaaat agtcggtaac tcgtctaata 40740
atgaatacag acaacggcga ataattaatc agtgatagat tgtttacata cagtatgtcg 40800
tttgtaaacg atgttaccaa tcgtttgcta ggtatatccg ctttaacaac caatatggtt 40860
gtaggaccct tgctaactac gacacattta ctcagtgggg atatgattac agcatgatca 40920
atgggaaagg ttagaatatt tttgccattg acgtcattat ctatagagat aacacatcct 40980
ccgacagaac taatacagag tataatcgtc gataatctat gaggaataag atggtacatc 41040
tcattgtcat ttacaacttc tacgcttaag atgttggtaa gaagtaccgt ttcatccatc 41100
tgtctattga tctcatcaga tataattttt tgatgttctg tatctatata atatctacag 41160
tcattattag caataaaatt gctaatttta gaaatgccga atacagggaa tatctccatt 41220
tgtacaacgt gacggcagca ataatactaa atattacttc aattttataa aaaggaaact 41280
aatcacacca atgattctat atcatcgtgt tcatgatctt gttcttgctt attaagataa 41340
tcatcgctat gtgctcgtct aaacggatta ttctcactaa atttattgtt gagttctgta 41400
atgaacggat ttgacttgct acaaatactg gttttattat aatttacgaa aggattggat 41460
gaatagttag gattctgttt ggtatcttca ttaagtacta ctgtagtgtt ctgataaata 41520
gtctgttcat tacgatcatt atttattagc agcgtgcttc cagcaacact atcgtaaatg 41580
tgttctgtgc taggcgccat aacattggat tcgttatccc atattaaact accagcgaag 41640
ctatcatcgt ccaatatctc attcctagaa atattatcta cgtcattgtt ttgttccata 41700
gcactacagt gatcctccca gtctgattcg ctatcagttg atttactaga tttgctggaa 41760
ttagaactct ttatcttttt taattttgtc atgataattt tattgtcatt atagacagta 41820
cgtatctttt tcctacaaat atataatata taacatacta atattgttcc cgcaactacg 41880
gtcaccgtga taagaggtac cagcatcatt tctgattgct caatatacgt actactagtt 41940
caatatttat tgttttatta tttactttac aacttttatg ataaaaaatt actattgcat 42000
ttcccatcaa ttatgatcca tccattgccg tctgatatgc agaatgaaga attaaacgac 42060
aatttcattt tcagattatc tgaaggattg atgattagat gataatctcc agctgtatac 42120
aggttatacg cttctatgat tgtagcgtta aaaacatcaa tagatgggaa tagataaaat 42180
ggctgatgtt tatcttctat atctaacatg atatcatagt cgcatgctat gatgatggta 42240
ttttttgatc tgtatgcggc acaaatgcta taaatgttga atctagccaa atcattagaa 42300
tcctgaggaa ctattttacc cttatagcta aagtagcatt tttgcatgtc ggaatatttt 42360
atatcccgta catctatttg gacagttttt atgttaccta taatgtatgg tcctataaag 42420
tccatgattg agatatcgag atcatcatac aatgtatctg ttatagtcaa cacacccatt 42480
ggagtaataa caaacgcggc gtccatggcg gcgtacgtta acgacttatt attaattcat 42540
tttttgttgt cacttgtaga attttttaac acatagtaca gattgagtac ttttacatac 42600
tgttttaacc agttttccag acttgtatat ataacacgct tcagcatccg tcgtactgtt 42660
tagttgttta aaatttgtta atttactaat atctatatct ttatcattat taatatctaa 42720
ccatttatca ttggtctttt ttaaacttac ccaataatct ttataaaaaa tactaaacaa 42780
tactctcaga tgtctagtat ccggtctagg caatctggct cgtaatttac gacactgata 42840
aaccgcatta tctgtagaca ttttaatgtt agtatctaaa taacaatgtt tatcgtattg 42900
tatccatcca ttggcgcaag cactaggcat cagttcttct ttgtaatgca gaaatgttcc 42960
tatgccacta ataatggttg agagtatcat cattatagcg accggcaccg acaacttctt 43020
aaacctactt acagtttgtc tattaagcga tttcatttat taatgtacaa aaataaatat 43080
tagttcattg ttttaacaca aaaatacttt ctaacttctt gtgatacatc agaatcttga 43140
taatcggatg tagtttttgt aattggatta ccatcagatc cccatgtatc cttaacataa 43200
tcaatgagcc aggtagtcaa gacatcggat ttattgggta gtgttgatga ttccgcagtg 43260
caatttgctt tagcatccga gaataactgg tagtctgaat gtaatatata acaagaaccc 43320
tggtaatata aaccattaca gctttctttg tgatcatatt gcgtagtgct agacgcaacc 43380
tttctatgag tagatgatgc agcagcaacg gcaacagcgg cgtcagtaat agcagcctcg 43440
ttagcagaca tgcattgatt taggcgcact atgagaaacg cggacatggt aatcatagat 43500
agtagtgaaa taaccataga tattctaata catagaccaa tcacgcgttt gcgtttattc 43560
tttccctgaa ttttgtctct gtaaacagta gcggagaaca cagatgtctg ctcttcgtcg 43620
ttttctggtg tcatcatgat aataaatgtt atttatgtca cgatgtgcct tctttgttct 43680
cctccctact aacgacctta gttttccata ttttgattta ttatcaaatt aatttagtaa 43740
ctgtaaatat aattatgaat tgtttccaag aaaaacaatt ttcaagagaa aatttattaa 43800
aaatgccgtt tagaatggtt ttaacgggag gatctggatc tggaaaaact atctatttac 43860
tatctctgtt ttctacacta gttaaaaaat ataaacatgt attcttgttt acacccgttt 43920
ataatccaga ttatgatgga tacatttggc caaatcatat taatttcgtt agtagtcagg 43980
aatctctaga atataatctg atacgaacta aaagtaacat agaaaaatgt attgctgtcg 44040
cacaaaatca taaaaaatca gcacactttt tacttatttt tgatgatgta ggcgataaac 44100
tatcaaaatg caatactcta atagaattct taaactttgg aaggcattta aacacgtcta 44160
ttattctact atgccaaact tatagacacg taccaatatt aggacgggct aacattacgc 44220
atttttgtag ttttaacatt tccatctcag acgcggaaaa tatgctacga tcgatgcctg 44280
taaaggggaa acgaaaggat atattaaaca tgttgaatat gatacagaca gctagatcca 44340
ataatcgatt ggctattatt atcgaagact ccgtattttg tgaaggtgaa ttacgtatat 44400
gtaccgatac cgccgataag gacgttatag aacaaaagtt aaacatagat attttagtaa 44460
atcaatattc gcacatgaaa aagaatctaa acgctatatt agaaagtaaa aaaacaaaat 44520
tgtgcaatag cgatcaatca tcatcgtcaa aaaatgtatc ataattataa ttattaactt 44580
ttgtaacaat agtcctattt agagaaagtc tatcgataga cgatcccaat ttgtaaattg 44640
ttccactact tgtcactcca tgatatgata aatccatgta aaatagcatc atctttagat 44700
cattaattgt taccttcccc aatacaacca aatcatcatg atatatacct cctccagaca 44760
agtatttaac aacggtagaa tgctttggct tataaaatac aaatgacatt cccttatgtt 44820
taatcttaat cttttcttta gttattgaat cgttacaatt ataaaatgat gttttttcca 44880
aaaacctaag tgtatttaaa atagatgcca tgttaaaaat gtccgccgtc gactttttgg 44940
aacgattgat aaaagctggt gtttatattt atgttttacg gacaaagtgt gtaattgcag 45000
ctttactagt aaaaaactat tccataaaag acgaataaga tacaaacaca aatgtttata 45060
taatatttaa atggaagacc ttaacgaggc aaacttctca catttattga taaatttatc 45120
taataataaa gatatcgatg cgcaatacgc gtctacatta tccgtggtac atgaattgct 45180
atccgctata aattttaaaa tatttaatat aaacaaaaag tcgaaaaaga attccaaatc 45240
aatcgaacaa catcccgtcg ttcatcatgc agcatccgcg ggaagagaat tcaatcgtcg 45300
ttgaactcga accctcattg gctacattta tcaaacaagg atttaataat ctcgtaaaat 45360
ggcccttgtt aaacattgga atagttttgt ctaatacatc taccgctgtc aatgaggaat 45420
ggctaactgc ggtagagcat attcccacca tgaagatatt ttacaaacat atacataaga 45480
tacttactag agaaatgggg tttttagtct atttgaaaag atcccaatct gaacgcgata 45540
attatataac tttatacgat tttgattatt atattataga taaggataca aattctgtaa 45600
ctatggtaga taaaccgacc gagttaaagg aaactttgtt acatgtattt caagaatatc 45660
gtttaaagag ttctcaaaca atagagctta tagcgtttag ttcaggtacg gtaataaacg 45720
aagacatagt ttcaaaatta acatttttag atgtggaggt atttaataga gaatataata 45780
atgttaaaac tatcatagat ccggattttg tatttagatc tccatttata gttatttctc 45840
ctatgggtaa actaactttc ttcgtagaag tatattcgtg gtttgatttt aaatcgtgtt 45900
tcaaagatat tatagatttc ttagaaggtg ctctaatagc caatattcat aatcacatga 45960
ttaaggtagg taattgtgac gaaacagtat cgtcttataa tccagagtct ggaatgttgt 46020
ttgttaatga cttaatgact atgaacatag tcaacttttt cggatgtaat tctaggttag 46080
aatcatacca tcggttcgat atgacaaaag tagatgttga actatttatt aaagcattgt 46140
ctgatgcgtg taaaaaaatt ttgtcagctt ctaatagatt ataaatgaac tctctatcaa 46200
ttttttttat tgtggtagct acggctgcgg tgtgtttact ttttatccag ggttactcaa 46260
tatatgaaaa ttatggcaat attaaggaat ttaatgctac tcatgcagca ttcgaatatt 46320
caaaatctat aggtggaaca ccggcattag ataggagagt tcaagatgtc aacgacacaa 46380
tttctgatgt aaagcaaaag tggagatgtg tggtttatcc aggaaacggt tttgtatccg 46440
cttccatatt tggatttcag gcagaagttg gacccaataa tactagatcc attagaaaat 46500
ttaacacgat gcaacaatgt atagacttta cattttctga tgttattaac atcaatattt 46560
ataatccatg tgttgtacca aatataaata acgcagagtg tcagtttcta aaatctgtac 46620
tttaaatgga cggaactctt ttccccggag atgacgatct tgcaattcca gcaactgaat 46680
ttttttctac aacggctgct aaaaagccag aggctaaacg cgaagcaatt gttaaagccg 46740
atgaagacga caatgaggaa actctcaaac aacggctaac taatttggaa aaaaagatta 46800
ctaatgtaac aacaaagttt gaacaaatag aaaagtgttg taaacgcaac gatgaagttc 46860
tatttaggtt ggaaaatcac gctgaaactc taagagcggc tatgatatct ctggctaaaa 46920
agattgatgt tcagactgga cggcgtccat atgagtaact taactctttt gttaattaaa 46980
agtatattca aaaaatgagt tatataaatg gcgaacatta taaatttatg gaacggaatt 47040
gtaccaacgg ttcaagatgt taatgttgcg agcattactg cgtttaaatc tatgatagat 47100
gaaacatggg ataaaaaaat cgaagcaaat acatgcatca gtagaaaaca tagaaacatt 47160
attcacgaag ttattaggga ctttatgaaa gcctatccta aaatggatga gaataaaaaa 47220
tctccattag gagccccaat gcaatggcta acacaatatt atattttaaa gaatgaatat 47280
cataagacca tgctagcgta tgataatgga tcattgaata caaaatttaa aacgttaaac 47340
atttacatga ttactaacgt gggtcaatat attttatata tagtattttg tataatatct 47400
ggtaagaatc acgatggtac tccttatata tacgattctg aaataacgag caatgataaa 47460
aatcttatta atgatcgtat caagtatgca tgtaagcaaa tattacacgg tcaattaact 47520
atagctctga gaattagaaa taaattcatg tttataggat cacccatgta tttatggttt 47580
aacgtaaacg gatcacaggt atatcacgac atatatgatc gtaatgccgg ttttcataat 47640
aaagagatag gtagactact atacgcattt atgtactatc tatctatcta taagtggtag 47700
atttttgaat gatttcgcac tattaaagtt tacgtattta ggagaatcct ggacatttag 47760
tttgagtgtt cctgaatata tattatatgg tttaggatat tctgttttcg atactattga 47820
aaaatttagc aatgatgcta tactcgttta tattagaaca aacaatagaa atggatatga 47880
ttatgtagag tttaataaaa aaggaattgc taaggtgaca gaagctaaac ccgataacga 47940
taagcgaatt catgctataa gacgcatgaa ggctgaacgt gaaatcgctc gtaaaaactg 48000
cggaggtaac ccatgcgaac gtgaagaggt tggaatatca actagatgcg gagaaagaaa 48060
aagttaagtt ctacaaaaga gaactagaac gtgatcggta tctttctagt agatatctta 48120
cctcttcttc agatccacat gagaaaccat taccaaatta tacatttcct cgcattgaag 48180
tagctccgtt gatgactgag gataaagaac cagaacctgt agaagtggtg cctccatcgt 48240
ccacagacgt taccgaaccg attagtgatg tgacaccatc ggtggatgtc gaaccagaac 48300
atcccccagc tttctgaata tcagacttca gtatcccaag tagcagttac acctccacca 48360
aaacctaaaa ctccacagat tttcgaatat cagacgtccg attctatagt taacaatcca 48420
cgcccatttt ataattcgga tctcgaattt gatgatattg atatgtatct actaccaaac 48480
tagaatatta caccagaaaa gacggtttga gatcaacttt atctaatggt ttataaaacg 48540
aaggaggcct tcgttcgaaa tctaatttaa cttttacgcc tctggcgttc atttgagtaa 48600
gaaatacttt agatacgtgc gtggtatcaa tttttgttaa gagaggagag agattaagtt 48660
ttgaacatct aagacatgta ttaataccct tgatttgtgc tgctatgtct ccacaatttt 48720
cacaaacata cacatcttga taatcttctt ccgaatcttt caaaacttct gtaatagtat 48780
tggctgcacc atgcgctatt aaacagtctc tctccatttc tccgaacttg atacctcctc 48840
cacgttttcg tccctcattc gcctgtctaa tgagcttcgt ctttttacct ctacatctaa 48900
cggttgcctt gtcctgagtt aaatgcctca gacgcaagta ataaattggt ccaaaaaata 48960
ctttggatgc ataaggctta tccgtttcag gatcatagag aatcttttca caaaagattt 49020
tatccgataa ttcttcatca gacaatttcg gatttgaatg ctcataacat tgtttagcga 49080
attgcatata tgtatcgatg gatgtttcgt tactactagg aaaacagaca ggtcggtttt 49140
ctcccttatt gttgtacggc ttagcagaat atgcggctgt taaaataacc tctatcaaca 49200
tagatatagt ttttctagag aagatggatg tagaattaat aatgacatct ggtgtgatac 49260
cattttcatc gtatggaagt tccgtttcat ccgcgatata cgcgactgtt cccttttgac 49320
tcgttctagt ggtaaatttg tctcctagaa tgggtcttct ttctttcatg gttaatactc 49380
gtaccttaac tttgtcagta agttctactt gtactcgttc aacgcgagat ttgtacatat 49440
cggtatattt ttcggaaaca tcaaagctaa tctgattatc tctagcaaaa tcatcttcaa 49500
gagttcttga tgagatattt cgtgctatag cgtctccgga ttccaagaaa gcattcagtc 49560
taactaatcc attactttct aattttgaat aggcgttaga cctatctctt tctttgttat 49620
taaagttttc caacggaatt tctacttgat gtttcttggc tgtaacaata tcgagaccgc 49680
ctctctgaat aaattgtttt ttgatgataa ttccatcctc ttgattgata cctttgtacg 49740
acattaatgc tatagtaaca tgttggccga agcaattagc cgcaatcttt gaagtttcta 49800
aagccttact aatcacgatt ggcctctctg gatacatcaa atgaattcca ttgtctattt 49860
tatttcgtat atccgagctc agacaagaga tagcttgttt agcttgagca catccaagaa 49920
tagctctggg tccagaattg tgattgattc ccactagtga agatgctaca tatccatctc 49980
taaattcggc aggaaagtca cataaatcgt attgctttct ttcatccttt gacatcattc 50040
taaatttttg aaccgattca catacgttac taaaagtaaa ttgttctata tctaccattt 50100
cgatgacatg cggaaactct ttctgaatgt cagagaatgt catgtcgtct aatctgcttt 50160
ctaactccgg acacacatcc atcatgagct ctccgttatc cacaaccaag aatggtctga 50220
ctaatcttcc cgctccaata ttaatgcgaa tttcattcat gtgatcccta actaaagtaa 50280
tacctacctc caagttaccg aagaatccca tccgttttct acgtctaaag tcagttacaa 50340
aatcacatat catatttgga ttaagagatg cgactagagc attttctata gtgattggaa 50400
atcctgtttc aaagtaactt atatcatctt tataatatga tctgatatac tcacaaattt 50460
tcttttccaa atccaaatac tcagacgtta gtatatttgt aatggaactc aagacagaca 50520
attgagatac taaacctacc tgaggacctc tttccggaac gtccgatgaa caaaagtaaa 50580
gatattgact tggatggtat tttctgacag aaaacatctt tgaaatcttt acttgatccg 50640
gataaaatcc aacactccta ggaatagaaa tattctgcat ccatgaatag tgaggatgtg 50700
ttcgataact accgtctgac tttttgaact ttccactcaa tagactagaa aatgcatgat 50760
taagtccagg agttgttagt acatgaatgt ttaccgcgta agtgcctctg ttcttgtgat 50820
tgttcatgat atcgtttcgg atgttgccta tgtaattctc taattcatca tgtgccaacg 50880
tctcaaaata tttgccgtac gttaggatgc gatgacaaac catcgaatct ctatctggat 50940
atctagaagt gtggtagata cagtagagaa actttcttag taaagatatc atataaaatc 51000
cctttagttg atcgggtgta tagttcatat gaggtaaaaa gttatgcagc atttcatatt 51060
tgaactcgtt gatcgttaac tgagattttt gtttaatgtg ttcagcttcc accaggtcat 51120
taatgtatgt atcaatatta aagtcagtaa tagattcagt aattcgttta gcactatcta 51180
ttatcgcgtt tactagatag acaacttctg gaggcatatc gtatgataaa ctacgtctaa 51240
tgaattctag actcactcta gataaatatt gagacagcaa aacaatattg accgttataa 51300
atgtctttgt cgatgatata ttcaacgctt ccaattgtga tatatccaga gatatcttgt 51360
aatggcgata tcttgtcggt aatacattag gaggggatac ggaggaaaac gagaaagtga 51420
atgagtttgg cttaacaacc ctaaattttg gccatgttgt aatcttttct actagattaa 51480
ttcctacttt ttcgacagat tgtttattga taaagactcc acctatgaca ttaggaacaa 51540
gatacttggc tgtatcaagt ggatttttat tcccgtatcc tattagtagc ggaattttta 51600
tcaaatgaga atcttttccc tcataactac ttattttagt gatggatata ccctctttgg 51660
tcattacctc tttaaagata tttacagtga acgtggctaa tgcgtcataa ctcttacctt 51720
taatactcgc gattataggt gagtaatctg gaggagtcac tttaacattg ctaatttcta 51780
tcataattcg ttctgtatta ttcttaaacg ataagagtgg ccgcttgacg gtcaagattt 51840
catgcaatcg atgcaatata aaattagaat acgatacata ttggaaatgt aacggtctat 51900
aaaaaactcc ggctttagga tcaggcacca aaaacttata tccgagtcgt tgatccattt 51960
ctgaattagt gttttttttt cattttagaa gcaattcttt tagacgatct accgattccg 52020
tttctatatc tatgctattt aaaatagtcg atctttcctt catcatttct atcatattct 52080
cggatttagt cacgtaattt atacaatcct ccagagatcg taatgctaca atattgagct 52140
ttttcccctt aaaaaacttt attttgctat cctcatctaa cagagagggg aaaacggtat 52200
gatccaacga cttaataaat gtcaggttaa acatccctat tttgaaattg ctaactccat 52260
aaacatcgtg ttcagccatt ttcttaatta ggaataattt ctcatctaat gccgtagccg 52320
tgagtacgtg gtttacaaca ttagctactt ctttaaagta ctcgttttgg atgatatcct 52380
cgaacaacag aaataaatta taatagacat agaataattt gtcaatcatg tgtatacacc 52440
ccttttcatc cttgtccttg gatagaataa tgtgcatagt agaatccttg aatattttca 52500
cacatgttct attagtaaaa ataatttcta tagtttgtaa tatagagctt tttccacgcg 52560
ttactataga gtgattaatg atatctactc tcattctctt agaaatggtt ttgatccaag 52620
gttgattaat gttggacctg actgtagtgc tcagatactc atcagacttt ttaccggaaa 52680
agtaatttgc attagtgcct ccatgtacag ttttctttat ttccccataa ttgacctttc 52740
tcttcttatt ggtagtttta atatatgata caagagatga cttttccctc tcaaatatct 52800
ctactgtacg agagctaata ttgctattat ctgagtttat atcgaaataa tgacaacatt 52860
tcacttgtga agaatcgtta aataccggat acgtattctg attaataatg tgtttagtaa 52920
gttggatgtc gcaatacact agtttcttaa tggcatgatg tatatcttta ttatacacta 52980
tattatcaaa catatcctct gaggaaatgc gttctttcat aagaccatat atatctccta 53040
tttcatcgca acttattaga tgaaagttaa aaatagttct ggcatatcta tcttctattt 53100
catgtagaaa ggtaaataga ttatccataa tggcgttact aacgttatta catttttttt 53160
atgtaatttc tagatttaca cctattataa ggtgtataat tagtattcca tttatctaat 53220
acgtatctca tagccaaatt gaaactatcc gctacatcgt ctaatttgcg tctatccgga 53280
acggagtctc gcaatccgaa tgtgtccatc caatcaagaa atgcttcgac cgatctcttt 53340
tttcgatctc tatatgaatt accagacatg acaggcgaga cgcaaataac tttggcagcc 53400
gatgtatgat ataaaaagcc tttaataaaa tagataaatt tgacgtacgg cgaccttcta 53460
ggctgacgtt ctagaagaac tgtagtgtat tcatattgtg acaaatcttt agctatgtgc 53520
ctttcccaat cagaactcca gtctaatttt gatatatcca ataccctaac ggagttatcc 53580
ttgacttcta aaacagttct ggcaggattt tttgcaccta tatcaaaggc gcatattatt 53640
tcactcgaat aatctttttt tgacatcggt gacgatatta aagattgaga cgaactggtt 53700
aaagtttcca tatatagtct caatactcgg gttatttaaa caattagtat tttcaatatt 53760
gaaatttaca accttacaat tttctatttt gaaccgcatg ttattaacta acgttataaa 53820
atccgcttct ggatatccca aacattctct gcttaaatca aatgctgatt ggttttttat 53880
ttttcgtttt acagttccgg atttattaat ctcatcaact aactcatcga cagttgcgca 53940
ctcattaata tctaatttat tagtcaattg ctttaagcct gcactaccaa ccttagagaa 54000
atactttccg ttaacataat agtctcgtcc tatagtgtct ataattcgtt ttattacttc 54060
tcctctagtt tctgttccgt ttccatcata gtcgtataaa atggaaaatg tgtttttctt 54120
ttttactttt acaaatgtat gttctctaac cttggatcgt attaaatggt ctacatcttt 54180
tactaatatc ttttttcctg attttgttat aacgctagga ataaaaatat tatctcctat 54240
acgatctacc ttgatggact caatatacat gaatgagaat ttgaaaaagt ctacaacttg 54300
ccgcctaagt tctcccaact taatatacga tatggatatt tttggaaatt tatcatcgaa 54360
agagcgttct ataatagagt ataactcgtc gtcaaataat gtgtcttcct ctatttcaat 54420
ttctaaatat ttgggtatgt ttaatccgta ctcactagct agtttaaata tattaactgg 54480
agactgatat tttgatatag catcttccat tatccatcta ttgtttaaca aaacaaatct 54540
cagaaagtct attccgttat cgctgtctaa tactttgtta cacgcacttc gcatatcatc 54600
tggtatttga aaagagtttc cagatccaag ttctaccata tttttaccct tcgaataaat 54660
aaagacgctc ccaaaatata ccggtaagaa aatgtaatag cccggatcaa tattaaacgg 54720
tttatttttt acctcatcat aataatcaga tattgacgtc tcgacattag ctaccttttg 54780
cacgcctatt ttccagtaag tagatgttcc ccattctacc aaagaattat acttttctat 54840
agccgcagaa tctgaaaatt tcaaactttt gtacagacta agtaattctt ttaagttagt 54900
taaatcagcg ctagaagtca tgataacttt atttttaatc ctatgttatt tcattcttat 54960
ttttaatatt atagtacctg caatatctga aaaaatgaga cgcgaacgag ccgcatacgt 55020
aaactacaaa cgtctaaaca agaatttcat ttgtgtcgat gatagactgt ttagttataa 55080
ttttacaaca tctggaatta aggcaaaggt ggccgttgat aacaaaaatg ttcccattcc 55140
atgttccaag ataaacgagg tcaataataa taaagatgtc gatacactat attgtgataa 55200
agatagagac gatataccag gttttgcacg atcgtgctat agggcatatt ctgacttatt 55260
ttttactacc taaatggata gcaccaatgt gcgttccgga atgaagagcc gcaaaaagaa 55320
gcccaagact acagttatcg atgacgatga tgattgcatg acgtgttctg cctgtcagtc 55380
taaattggtt aagatttccg acatcacaaa agtatcattg gattatatta atactatgag 55440
gggtaataca ctggcctgcg cagcatgcgg atcgtcgctt aaacttctta acgattttgc 55500
gagttaaata ttctatttaa tacacgttct tctctggagg atgttgtaca tacagaagtg 55560
ggatcggatt cttgatgttt ccgataactt ttttgtttaa atcctagttt atctacagac 55620
agactaataa ttcgttgaac gaaatttcct atcatgtact ttatttcttt gatggatgtg 55680
ctaggaaata catataccgt cctatctaat agttctgttt ctcgacatac cctccctagt 55740
aattgctcta tttgcatatt gttgattact gccgagcaaa tgaacaaaga atccaaacta 55800
ggaatatcta aaccagtacc ggaataaaat aaggtggata cgaatataaa tctatttagt 55860
tccttgattg atttgaccat atctggagta cgtctatttt gggcgtctcc tataaataca 55920
acctctgatc cgaaaaaatc taataatcgt ttgtagaaga ataccatatg ttcacgtagt 55980
ttagtaataa ctaaaatgcg attaatagtt cctgacttga attcttctac cagggtatta 56040
agaataagtt gatttctagg ctcgtctaca gataataact tctcggtata tatatgatat 56100
ttattagatg gtccatctaa tcgttttatc atatgtctaa tattatctgt ggaatatggc 56160
tcaaaaaaac tatctactgc atagatagtt ttttttagat cggataactt ggcaatatta 56220
ataatactgt tacaataaat tctgttagat ggtctaggtg tagcagttaa aaaataacac 56280
atcatcggag gataatacgc taaaaatctt gtaactgctg tattgttcat cagattatac 56340
gtatgtgatt catccaagat gaacaaatca taatgcttat tgatatattt acaaaaggca 56400
tcgtttgtca gatgtctact gactactatt aatacatccg gactttgagt ctttagttcc 56460
tttaatagac tacttactcc atctatggat atcttatgtt ccaatccgac tgcctctacc 56520
tgtgtcttcc attgatgtat taacatttta ttgggtacgc aaatgacggt ttttctaccg 56580
tgtgtagcca taagataaca cgtggtaata gtcttaccaa atccacacgc caagtgaaga 56640
gtgatgtata gaggacgttt tgattcgatc attttcctca tattagaaac tacttcagat 56700
acgactttgc gttgaagagg atataacttt ggcgcatttg ttggattgat actagtaatt 56760
ttattatcgg tcgttgttaa tgtttcaaat actagactcg tctttacgga aggagaggcg 56820
taaaatccct taggtatcag aaacttaaag gatgatcccg gttcaacttc tacgaaatcc 56880
ccgtcttcgt taaaaagact taggggttga ccacaagtca ttttttttag ttcggcataa 56940
agattatact ccatctttaa tagtgacatt ttttaatata taaatgagtt atttaagata 57000
ttacaatatg cttgacgact tctctgcggg tgctggagtg cttgataaag atttatttac 57060
agaggaacag cagcaatcgt ttatgcctaa agatggaggt atgatgcaaa acgattatgg 57120
aggaatgaat gattatttgg gaatcttcaa aaataatgat gttagaacgt tactcggttt 57180
gattttgttc gtcttggctc tatatagccc tcctctaatc tctatattga tgatatttat 57240
ctcatctttt ctattgcctc ttactagctt agttattacc tattgcttag taactcaaat 57300
gtatcgtgga ggtaatggca acactgtggg aatgtctatt gtatgtattg tagctgctgt 57360
aattattatg gcaatcaatg tatttacgaa ttcacagata tttaatatta tttcttacat 57420
tattttgttt attctgttct ttgcatatgt gatgaacatc gaaagacagg actatagaaa 57480
aagtataaat gtaaccattc ctgaacagta tacctgcaac aaaccttata ctgcgggaaa 57540
taaggtaaat gttgatatac caacatttaa cagtttaaat actgacgatt attaaaaatg 57600
ggggcagccg ttactcttaa tagaatcaaa atagcaccag gaatagcaga tatacgagac 57660
aaatatatgg aattaggttt taattatcct gaatataata gagctgttaa gtttgcagaa 57720
gaaagttata cgtactatta tgaaacatct ccgggagaaa ttaaacccaa gttttgtttg 57780
atagatggta tgtcgataga tcattgtagt agttttatag ttcctgaatt tgctaaacaa 57840
tatgtattaa ttcatggaga accatgtagt tctttcaaat ttcgtcctgg atcattaatc 57900
tattatcaga acgaggtaac tcctgaatat attaaggatt taaaacacgc tactgattat 57960
atagcatccg ggcaacgatg tcattttata aaaaaggatt atctcctggg cgatagtgat 58020
agcgtggcaa aatgttgttc taagacaaat accaaacact gtccaaaaat atttaataat 58080
aattacaaga cagaacattg tgatgatttc atgactggat tttgtagaaa cgatcctgga 58140
aaccccaatt gtttagaatg gttacgtgca aaacggaaac cggctatgtc tacttattcc 58200
gatatttgct ccaaacatat ggatgcgaga tattgttctg agtttattag aattattcgt 58260
cctgattatt ttacttttgg ggatacggca ttatacgtct tttgtaacga tcataaagga 58320
aatagaaatt gttggtgcgc gaattatcca aaatctaatt ccggagataa atatttagga 58380
cctagggtat gttggttaca tgagtgcacc gacgaatcta gagatagaaa atggttatat 58440
tataatcaag atgttcaaag aactagatgt aaatacgttg ggtgcacgat taacgttaac 58500
tctttagcgt taaaaaattc ccaagcggaa cttacgtcta attgtactag aactacgtcc 58560
gccgttggtg acgtacatcc aggagaacct gtagtaaaag ataaaataaa actgcctacc 58620
tggttgggcg cggccataac attggttgta atatctgtta ttttctattt tatatctatt 58680
tattcgcgta ctaaaattaa aacaaatgat ataaatgttc gtagacgata attcgttgat 58740
aatttattct acatggccca gtacattgtc cgattcatcg ggtagagtca tcgttatgcc 58800
agataataga tcattcacgt ttaaggaagg gtttaaatta gatgaatcga taaaatctat 58860
attgttggta aacccgtcgt ctatagatct attaaagatt agagtatata aacatcgcat 58920
aaaatggatg ggtgatatat tcgtattatt tgagcaagaa aatatcccac caccttttcg 58980
tctagtaaat gataagtaat tacgagccgt tgctgctgtt agttataaca tgctgtgtac 59040
tactatttaa ttttaccata tcttcgaaaa caaaaataga tattattttt gcagtacaaa 59100
ctattgtttt tatatggttt atattccact ttgttcattc ggcgatttaa aatttttatt 59160
agttaaatgg atatgatgct tatgattgga aattattttt ccggcgtgct aatcgctgga 59220
atcattcttt tgattctttc gtgtatcttc gcctttattg actttagtaa gtctaccagt 59280
cccactcgta catggaaagt attgagtatt atggcgttta tacttggtat tattatcaca 59340
gtcggaatgc taatttattc tatgtgggga aagcactgcg cgccccacag agttagcgga 59400
gtcattcata ccaatcatag cgatatttcc atgaactaaa ttattatcgt ccatatatct 59460
cgacattgtt gaatcattat tactacttat ttagtgaaaa gatgatatat tgcatacttg 59520
atcaatagtg aagttattgt caataaatga ttggtattct tttgttgatc ggtatttgcg 59580
tagcagttac cgtcgccatc ctatatgcgc tgtataataa gatcaagaac ccacaaaatc 59640
caaatccaag tccgaattta aattcgcctc ctccagaacc aaaaaatacc aagtttgtaa 59700
ataatctgga aaaggatcat attagttcat tgtataatct agttaaatct tctgcataaa 59760
taaaaatatt tttagcttct aaatggcgga taaaaaaaat ttagccgtta gaagcagtta 59820
cgatgattat atcgaaacag ttaataagat tacaccacag cttaaaaatc tactagcgca 59880
aatcggtgga gatgcagccg tcaaaggagg caacaataat cttaattctc aaacagatgt 59940
gactgccggc gcatgtgata caaaatgtat tacatgtaaa ccaaaatcaa aatcctcgtc 60000
ttcttctaca tcaacatcca agggctccaa aaatacttct ggtgctccta gacgtagaac 60060
aacagttact actacatcgt acaatgcgat ggatggtcag attgtccaag ctgttactaa 60120
tgctggtaaa atagtttatg gtaccgtcag agacggccaa ttagaagttc gtggaatggt 60180
cggagagatc aatcacgatc ttctaggtat cgactcagtt aatgctggga aaaagaaacc 60240
atctaaaaag atgcctacta ataaaaagat taatatgtcg tccggtatga gacgacagga 60300
acagattaat ccagacgatt gttgtctgga tatgggaatg tattaaatta aataatttta 60360
attcgtttaa cgaatatctt gagtataaac aatacaatat taagaaccgg actgttaccg 60420
attcctagtt ttgatgctac agcatcgcag tcatctccga tctccacatc gataatttct 60480
gaagtgatat cggtactcag tcctttgatt tcttcaaatc cgagttttag caatgcttgc 60540
tcgataatat ttatgacaat aataatcacg ttagtgacga taatttttcg ttttctactt 60600
tcgtgatatt gtttaatcgt tttatagatt ctatctattt catcctctga acaaacatcc 60660
aagtcctcga ctgataaggg agacccgttc aaatttgcat aattaacgat ggctatacgc 60720
ttattaactt ctgttgattt atacggagtc ttaccaaata gtattctggt aaaattatct 60780
aaattatcag atgattttgt attagatcgt tgatttctaa gattaaccac ttcatcctcc 60840
aacatctgaa ttcttttatc tttatcataa actactccca aggatggtgt ttgtgttggc 60900
atagaagata tcatagatct agattccatt attgcttcag catttttcga tactgtaata 60960
ttaaatcttt tttctcgttg ttgacgttgt tccgatatca ataactctct acgaaacgta 61020
ggaggtggag tatcgtcaat aatagtagta tgaggttgta ctcgttcgat aatagtggcc 61080
ggagaatcat catctggaaa tagatcttca ttaagttgtg aggatatttc ctctggagat 61140
gacactacgg cggcattatg ttctctatct accacgttgt taacgtgagt tagaatagac 61200
atttgacgaa gagttatttc ataatttttg ttagatggat aattatcttc tgaaaactct 61260
gtaattaaat cgttttgtat atccgtcact ggtacggtcg tcatttaata ctaaataaat 61320
gatgcctatt aagtcaatag ttactcttga tcaattagag gactctgaat atttatttcg 61380
tatagtttct accgttcttc cgcatctatg tctagattac aaagtatgtg accaacttaa 61440
aacaaccttc gttcatccgt tcgatatatt gcttaataac tcattaggat ccgtaactaa 61500
acaagatgag cttcaggctg ctatatccaa attgggcatt aattatttaa ttgataccac 61560
gtcacgtgaa ttaaaactgt ttaatgttac acttaacgct ggaaatatag atattattaa 61620
taccccaatt aacattagtt cggaaactaa tcctatcatt aatactcaca gcttttacga 61680
tcttccacct ttcactcaac accttcttaa tattagattg acggatacag aatacagagc 61740
tagatttatc ggtggttata ttaaaccaga tggctccgac tcaatggatg ttctagcaga 61800
aaagaaatat ccagatctta actttgataa cacttatttg tttaacatcc tctataagga 61860
tgttattaat gcaccaataa aagaattcaa ggcaaaaatt gttaacggtg tattaagcag 61920
acaagatttt gataatctta taggtgttag acaatatata acagcacaag atcgaccccg 61980
ctttgacaac gcttataaca tcgcagatgc tgctagacat tatggagtta atcttaatac 62040
attgccatta ccaaacgtag atctcactac tatgccaaca tataaacatc tcatcatgtt 62100
tgaacagtac ttcatttata catatgacag agtggatatt tattacaatg gtaacaaaat 62160
gctcttagat gatgagatta tgaacttttg tatttctatg cgatatcaat ctcttattcc 62220
tagactggta gaattctttc cagatatacc agtaaacaat aacatcgtac ttcatactcg 62280
cgatcctcaa aatgctgcag tgaatgtaac cgtggcgctt ccaaacgtgc aatttgtaga 62340
cataggtaga aaccacaaat tctttattaa tttctttaac ctgttggcga aggaacaaag 62400
atctacggct atcaaagtta ccaaatccat gttttgggac ggtatggatt acgaggaata 62460
caagtctaaa aaccttcagg acatgatgtt tataaattct acctgttatg tattcggtct 62520
ttataatcac aataatacta cttattgctc tatcctttct gatattatct ccgcagagaa 62580
aacacctatt agagtttgtt tgttacctag agtagtcgga ggtaagactg ttactaatct 62640
tatttcagaa actttgaaga gtatttcatc tatgactata cgagagtttc ccaggaaaga 62700
taaatctatc atgcatatag gactttctga gacaggattc atgagattct tccaactact 62760
caggctcatg gctgataaac ctcatgaaac ggctattaaa gaggttgtta tggcttatgt 62820
gggtataaag ttgggtgaca aaggtagtcc gtactatatt agaaaggagt cataccaaga 62880
ctttatctat ctgctatttg catcaatggg ctttaaggtg actactagaa gatccattat 62940
gggaagtaat aatatctcta tcatcagtat tagaccaaga gtaactaaac aatacatcgt 63000
cactacattg atgaaaacta gttgtagtaa aaacgaggca gaaaaattaa ttacttcagc 63060
gtttgatctt ctcaatttca tggtatcagt tagtgacttt agagattatc agagttacag 63120
acagtataga aactattgtc ctagatattt ctatgcagga tctcccgaag gagaggaaac 63180
cattatctgt gactcggaac cgataagtat cttggataga attgatactc gtggtatctt 63240
ttctgcgtat actattaatg aaatgatgga cactgatatc ttttctccag agaataaggc 63300
atttaagaat aatctgagta gatttatcga gagtggaaat attacaggag aagatatttt 63360
ctgcgcaatg ccatacaaca tcttagatag gattattaca aatgctggta cgtgtaccgt 63420
atccataggt gatatgttgg ataacattac aacccagtca gactgtaata tgactaacga 63480
aatcacagat atgataaacg cctcattgaa gaatacaatt tctaaagata ataatatgct 63540
agtcagccaa gcattggact ctgtagctaa tcgttctaaa caaacgattg gagacttgag 63600
gcaatcatcg tgtaaaatgg cattgttgtt taaaaatctt gctacatcca tctacacaat 63660
agaacgtatt ttcaatgcta aagtaggcga tgatgttaag gcatcgatgt tggagaagta 63720
taaagtattc acagatattt ccatgtcatt gtataaagac ttgatagcta tggagaatct 63780
caaagcgatg ctatacatta ttcgacgaag cggatgcaga atagacgatg cacaaattac 63840
tactgacgat ctagtcaagt cttactcatt gatccgtcct aaaattctaa gtatgataaa 63900
ctattataat gaaatgagta gaggatactt tgaacacatg aaaaaaaatc taaatatgac 63960
agatggtgac tctgtctctt ttgatgatga ataaatgtca tgttatacag ctatattaaa 64020
atctgtagga ggactggcgc tatttcaagt agccaatggc gccatagatt tatgtagaca 64080
tttctttatg tatttttgtg aacaaaagct acgaccaaat tcattttggt tcgtcgttgt 64140
tagagccatt gcaagcatga taatgtattt agtattaggc atagcattgc tgtatatttc 64200
tgaacaagat aacaagaaga atactaataa tgataaacga aatgagtcgt ctataaattc 64260
taactccagt cctaagtaaa atattttagt agcgtatcct tatcgcgttt gcttatcttg 64320
aatgaactgg tcttttttaa ctcgttgatg aattttgtaa ataattcatt tctatcttta 64380
agtaatttta ttggaagttt tgattccaca gccaatgaac atgtaaattg tgaatcattt 64440
tctatgacca gttgtaactt ttcatacact gtaatcaata cagtatttac catttgatta 64500
attacagcgc ccgtactagc actccattga acaagatcag attttagatt aattagataa 64560
aatctatatg ttatttcaga taatgaactt accagataac tctcttgacg gatattaaaa 64620
atgccagatg aatgttcatc tcgaatagcc agagctactg tagagtaata cggttcaaaa 64680
tcataaacgt gatagtttcc aaactggtat ccgagttttt cttggatgat ggatacttga 64740
gagttgaatt taaatttctt actatgttca ttatatagtt ctggaaagaa tgcttcaatc 64800
attgtttctg taagtttgca tccttttgat gcgatagtta aaagtgctat ataaggcgcc 64860
acgataatgt tatttttttt ggtaatagtg acgtctttta cagcgtcaac gcatctgaac 64920
aaataatgaa tctctctatg atctaaaaaa taaagtcttt gtaaaaagaa tcgtagtgct 64980
agttttctct catcatcctt agatctatgg gcgaatagac gtctactacg ggatataaaa 65040
ccgctatttt cctttatcat aggtgttgtt tgatctatat ttacctcccc tagttctacg 65100
gaggcctcca aattaagatc tggtactggt tcgaacattg taagacttac atcatcggta 65160
gtagattttc actttacccc acgatataaa tatgcgatat atagtaagtc cgcaattggt 65220
attacaggtg ggtaaggggc aggaggtaga acgagcccta tatctcactc catatgatta 65280
catagatgag aagtcgccca tatattattt tttacgaagt catttaaata tacaacagcc 65340
ggaaatagtt aagagacata ttctattgac gcttcgaatg actcaattaa agggatattt 65400
aggaaatttg ttagatatta aggacgatat tattatctat tctcataaga ataatttgga 65460
atatagttac gttgataata ctatttttaa tcccttcgta tatactcaga aaaaaacact 65520
actaaaaaac gatagctttt tatacaatgt atatcctgga gcgtgtgact ttttggttat 65580
ctgggtggcc agagcgtgcg atacatctat tccggaattt ggatcgtatg aagatgtaga 65640
taataatatt atcaagtttg aaacaatgtt gatggaagta tttccacaac tagatttgga 65700
cattactgta gaatcaaagt ttaacaatat atttcgtacc aatctaaaac taactgggtt 65760
aaaaaagatc attcagcgag ttcaagactt ggacattaat tataagtcgt tgttatctag 65820
atacgatgaa cactttatta atatgaccgg taatcatttt attctaaacg atgaacagtt 65880
aaatctctcc atttgggact tggatggtac attagcgtta tctagcgacg gcgataccgt 65940
gatgattaat aacgtaaaac tatttacaga tcttgtgtcc gatatagata cacaaatgga 66000
acgcatcaag ggagatataa cgtataaggt acatttggcg actcctatca attctagaat 66060
aaaattggat atcgagacta gcttcatttt tatagagacg gcgactaata atattttact 66120
atcctcggat aaaaaaatat ctatcatttt ggccaaaaac catatatcta ttaaagtgaa 66180
aaaccatatt cctaacatag aaaaatattt tacattttta gttattgcca ttaatgccat 66240
gtttaatagc gttcaaaagt ctgctgattt taccaaagtg gaaactgttt actggtctag 66300
gatatgccaa aatacaaaga ataagaatag aaaacccatc attattaatt atctagatcc 66360
tggaatgaaa aaaatcagta acaactttta cagatccgat gagaaagaag tctttattaa 66420
tgataacggc ataatgttta catgcatgga tcctttgggg aaatataata aggtgggatt 66480
tcttaatata tttcatgata tgtggaaata ttgtatccct tgttgttttc tacatgatca 66540
atctcatcga agtacatttt catcgtgtgt tcatcaaatc gacgttgaga aaaagatagt 66600
aagtccgtat atccttaatt ttggtaaagt tgtaacagaa tccaaaatgt catttctccc 66660
tattatcttt gacgccttct taaatgatgg aatgactgct aatatggaac aagataataa 66720
acgactaaag gaaactagtg gatatcatat agttagatgt tgtgctggtg atgatatagt 66780
tcgtttacga actacatctg atattattca gtttgtaaac gaggataaaa atattcttat 66840
agttaacgac atggtatatt ttccaatgaa cgcgtccgat ataggaaaga aaatacatat 66900
actcattcaa gaaatagttc atgaggtaat gatagtaaaa aagaaagagt ctagtgataa 66960
aatcgatttt ttcccaccca actataagtt attgaaggat ctatttccaa aacaaactat 67020
tcaaactcct attcaatctg acgcgggaat ggtgttaaca accgatggat tctacataga 67080
tggaaaactt tttaacgaag atctgtcgtc taaatacgtt acatttacaa aaaatgttat 67140
tgcgtctgat gccgtagcta aatatttttc tcctttgttt aaatacgtta tttcagaagc 67200
taaagataga tttatcaaaa cgtggatgat taatattatg atacatatga acgtagatcc 67260
taataatata ataccgacgt tagaaaaata ctatcccaac tctggaagag cacaaataaa 67320
ttaaacaact aaatctgtaa ataaataatg gacaaactta gagttctata cgatgagttc 67380
gtcaccatta gtaaagataa tcttgaacgc gagactggtc ttagcgcatc agatgttgat 67440
atggattttg atttaaacat ttttatgacg ttggttccag tcttggagaa aaaggtatgc 67500
gctattacac caactataga agatgataaa atcgtaacta tgatgaaata ttgtagttat 67560
cagagttttt cattctggtt ccttaaatct ggtgccgttg tgaaatcggt atataataaa 67620
ctagatgatg tggaaaagga aaagtttgca gccacattta gagacatgtt gcttaatgta 67680
caaactctaa tttctcttaa ctctatgtat actagattgc gtcaagatac cgaagatatt 67740
gtatccgatt ccaaaaaaat aatggagatt gtttcccatt tgagagcgtc gactacagag 67800
aacgcggcgt atcaagttct ccaacaaaac aatagtttta tcatatctac actaaataaa 67860
atcttatctg atgaaaacta tctcttgaaa attattgcag tattcgactc taaactaatt 67920
tctgaaaaag agacattgaa tgaatacaaa caattgtaca ccatttcttc tgaaagtttg 67980
gtatatggaa tcagatgcgt tagtaatctg gatatatcat ctgttcaact gagtaacaat 68040
aaatacgttc tctttgttaa gaaaatgcta cctaaaatca tactgtttca gaataacgac 68100
atcaatgcac aacaattcgc taatgttatt tctaaaattt atacgttgat ttatagacaa 68160
ttgacgtcga atgtcgatgt tggatgtcta ttgacagata cgatagaatc tgccaaaact 68220
aaaatatctg tagaaaaaat taaacagacg ggtatcaata atgttcaaag tcttatcaaa 68280
ttcatatctg ataacaagaa agaatataag acaataatct ctgaagaata tctatcgaag 68340
gaagatagaa tcattactat tttgcaagat atcgttaatg aacacgatat aaagtacgac 68400
aataaattgc tgaacatgcg agacttgatt gtgacattta gagaacgata ttcgtataaa 68460
ttctaatatc gtttttgaag tttccaaatt ttgataatat agtctagatg gaattttaga 68520
ccatctttgt caaaatcgtt taccgacaac aattctccgt tcttttcgat gactatagga 68580
caagaaccct cctctatctc ttgtataatt acatgtagca tattttgctt catcgtttcg 68640
tcagaaaatt ctgaaactag aggtaatctt cctcgttgaa gcaagttata gctttctgct 68700
attatacccg ctatttcaaa tagacttata cgtctagtat agcgtcgttt aagagcagat 68760
atatgattcc ctatatgttt aagattggaa tgcgcatctt ctatatgagt ggatgctgat 68820
tctacaatct tataagaaga tttgggatct atatcactag tttctagtga ctctccatct 68880
tcttcctctt cttcatcatc ctcgtattcg gtgagatcat cggattcata gtcgataata 68940
tcgtctgtgt ctgccattta atcttataat cgcaattcaa ttttaaagcc ttaaatggac 69000
ttctttaaca agttctcaca ggggctggca gaatcctcta caccaaagtc gtcaatctat 69060
tattctgaag aaaaggatcc ggatacgaaa aaggatgaag cgattgaaat aggactaaag 69120
tctcaagagt cgtattatca aagacagttg cgagaacaac tagctagaga taatatgatg 69180
gccgccagca gacagcctat ccaaccgcta caaccaacta ttcatataac tccacagccg 69240
gttccaacag ctacaccggc tcctacacca aaaccacgac aacaaactaa tacatcatct 69300
gatatgtcta atctttttga ttggctgtct gcagatactg atgcgccggc gagttcactc 69360
cttccagcgt tgacgccgag caatgctgtt caggatatta tctctaaatt taataaagat 69420
caaaagacga cgacgccgcc atctacccaa ccttctcaga cgttaccaac aactacatgt 69480
acacaacaat cggatggaag tatttcttgt actactccaa cggttacacc tcctcaacct 69540
cctattgtgg ccactgtatg tactcctaca cctactggtg gtacagtatg tacaacagca 69600
caacaaaatc caaatccagg agcagcatct caacaaaatc tagacgatat ggcccttaag 69660
gatctcatgt cgaatgttga aagagatatg caccaacttc aggccgaaac aaacgatctg 69720
gtgacgaacg tatatgatgc aagggagtat acgcgtaggg caatagatca aattctacaa 69780
ttagtcaaag gttttgaacg attccaaaag taataagatt gaatattaaa atcacgcttt 69840
cgagtaaaaa ctacgaatat aaataatgga agccgtggtc aatagcgatg tttttttaac 69900
atctaacgca ggactaaaat ctagttatac taatcaaact ctttctttgg tagatgaaga 69960
tcatattcac acttctgata aatctttgtc ttgtagtgta tgcaattcat tatccaaaat 70020
tgtagacgat gactttatat ccgcaggggc tagaaatcaa cgtaccaaac ctaaacgtgc 70080
aggaaataat caatctcaac agcctatcaa aaaggattgt atggtttcca tcgacgaagt 70140
agcatccacg catgattgga gtacgagatt gagaaatgat gggaatgcaa ttgctaaata 70200
tctaactact aacaagtatg acacatctaa ctttactatt caggatatgc ttaacattat 70260
gaataaacta aatattgtca gaacaaatag aaacgagcta tttcaactcc ttacccatgt 70320
aaagagcaca ttgaacaatg ctagtgtttc tgtgaaatgt actcatcctt tagtacttat 70380
tcattctcga gctagtccta gaatcggtga ccaactcaaa gagttagata aaatatactc 70440
tccatctaat catcatattc ttctgtcgac tacacgattc caatccatgc attttaccga 70500
tatgtctagt tcacaagatt tgtcttttat ttatagaaaa ccagaaacta attactatat 70560
tcatcctatt ctgatggcac tattcggtat taaacttcct gcgctcgaga acgcgtatgt 70620
acatggagac acctatagcc taatccagca actttatgaa tttagaaaag taaagtctta 70680
taattatatg ttgttggtta atcgtcttac ggaggataat ccgatagtga ttacaggtgt 70740
atcagatcta atttccacag agattcagag agcaaacatg cataccatga ttagaaaagc 70800
aattatgaac attagaatgg gaatttttta ttgtaacgat gatgatgcgg tagatcccca 70860
tctaatgaag attattcata ctggatgctc tcaagttatg acagatgaag aacagatatt 70920
ggcttctatt ttgtctatag ttggatttag acctacgttg gtttctgtgg ctagacctat 70980
aaacggcatc agttacgata tgaaacttca ggcggcacca tacatagttg ttaatcctat 71040
gaagatgatc acaacatccg acagtccgat ttctatcaat tccaaggata tttattctat 71100
ggcattcgat ggcaatagtg gaagagtggt gttcgctcct cctaacatag gatatggaag 71160
atgttctgga gttacacaca ttgatccatt gggaactaat gtgatgggta gtgctgttca 71220
ttcccctgtt atcgttaatg gagcaatgat gttttatgta gaacgacgtc agaataagaa 71280
tatgtttggt ggagaatgtt acaccggctt tagatctcta atagatgata ctccgattga 71340
cgtatcacca gaaatcatgc taaacggtat catgtatagg ttaaagtccg cagtttgtta 71400
caaactcgga gaccaattct ttgattgtgg atcgtctgat atcttcttga agggacatta 71460
tacgattcta tttacagaaa atggaccctg gatgtacgat cctctttctg ttttcaatcc 71520
gggagctaga aatgctagat tgatgcgagc tctcaaaaac cagtacaaga aattatcaat 71580
ggattcagac gatggttttt atgaatggtt gaatggcgac ggttcagtat ttgctgcctc 71640
aaaacagcaa atgttgatga atcacgttgc taactttgac gacgatcttc taactatgga 71700
agaagccatg tcgatgattt cgagacattg ttgtatctta atttatgcac aggattatga 71760
tcaatatatt agcgctagac atattacaga actattttag attatgatat ttaaatgagt 71820
tggtacgaaa aatataacat tgtactgaat ccgcctaagc ggtgttcttc tgcatgtgcg 71880
gataatttaa ctactatatt ggcggaggac ggtaaccata ttagggcgat actttattca 71940
cagcccaaaa aactaaaaat attacaggat tttctggcaa cgtctagaaa taaaatgttt 72000
ttatataaaa tattggacga cgagatacgt agagtgttaa catgaatcta cgattatgta 72060
gcggttgtag acacaacggt atagtatctg aacaaggata tgaatattgt attttttgcg 72120
agtctgtatt tcaaaaatgt acaaaggtac aaaaaaagtc aaacttccat gtgtctaata 72180
aacttattca tttgagaaat gtattgcgga gattattgtc tcatcaatgt tctggagaaa 72240
ttatctcgga actcttggat atcatggaaa aaaatcaaat atccacggat gatgtagatg 72300
caaattttgt atctagtttt cttaaggcta acgagagaat aaataaaaag gattataagt 72360
tagtctttga aataatcaat caagtaaaag atgagaaact gaatctgagt acagaaaaga 72420
ttaatgaagt agtagaaata tttaagcact tggtattctt ttgccaagaa aacactcctt 72480
ctaagacaat taattattca ttctttttgg ataaaatatt cgatatcact tctgtaacta 72540
aaaatctaaa acctcaaact gttaaaaatt atacgaaaaa taatagtaac caattagtat 72600
gggaaaactt tttagcacat atgagatcta aaaaacgtgt aactatggta gaggattatg 72660
gacacgagta tgtttttgta gatgagaggt tttctacttg ctcattagaa gtataaaaaa 72720
atagttccgt aattaaatgg ctaagcgagt aagccttcca gatgtggtta tttcagcacc 72780
taaagcagtc tttaagcccg ctaaagaaga agcactcgct tgtatactac caaagtatta 72840
taaatctatg gcagatgtgt ctattaagac aaatagtgta attgataagt gttggttttg 72900
taatcaagat ttggttttta gacctattag tattgagaca ttcaagggtg gtgaagttgg 72960
gtatttctgt tctaaaatat gtagggattc gttggcttct atggttaagt ctcacgtagc 73020
tcttagagaa gaaccaaaaa tttctttgtt gcctttagta ttctatgaag ataaggaaaa 73080
ggtcataaat acaataaacc tactaagaga taaagacggc gtttacggaa gctgttactt 73140
taaggaaaac tcacaaatta tagatatttc tctacggagt ttattgtaag ctttttccat 73200
tttaaataga aaatgaataa tactatcatt aattctttga tcggtgggga tgactctatt 73260
aaacggtcta atgtcttcgc agtcgatagt caaattccaa ctttgtatat gccgcaatat 73320
atttctctat ccggagttat gacaaacgat ggtccagaca atcaggctat cgctagcttc 73380
gaaattaggg atcagtatat tactgcgctt aatcatttgg ttctgagttt ggaacttcca 73440
gaagttaaag gtatgggaag attcggttac gtaccatatg ttggatataa atgtattaat 73500
cacgtatcta tctcttcgtg taacggtgtt atttgggaaa ttgagggcga agaattatat 73560
aataattgta tcaataatac aattgctttg aaacactctg gatattctag tgaacttaat 73620
gatatttcta ttggcctaac tcctaatgac actattaaag aaccatctac agtatacgtt 73680
tatattaaaa ctccgtttga tgtggaagat acattcagca gtcttaaact atccgattca 73740
aaaattaccg taacggtaac cttcaatcca gtatccgata tcgttattcg tgactcttcg 73800
ttcgactttg aaacgttcaa caaagaattt gtttatgttc ctgaattgag ctttattgga 73860
tatatggtta agaatgtaca aattaaacca tcgtttatag agaaacctag gagagtaata 73920
ggtcaaataa accaaccaac ggcgactgta actgaagttc atgcggcaac atcgctctct 73980
gtttatacta aaccttatta tggaaatacg gataataaat ttatttcgta tccagggtac 74040
tcacaagatg aaaaagatta tatagatgca tatgtgagta gattgttgga tgatctagtt 74100
attgttagtg atggcccacc gactggttat ccggagtctg ccgagattgt tgaggttcca 74160
gaagatggta tcgtttctat tcaagatgct gatgtgtatg taaaaattga taatgttcct 74220
gataatatga gtgtttatct tcatactaat ctgctaatgt ttggaacacg aaaaaattct 74280
tttatatata acatttctaa aaagttttcc gccattactg gaacatatag tgatgccact 74340
aagagaacaa tctttgctca catatcacat agtatcaaca tcatcgatac atctattcct 74400
gtaagtcttt ggactagtca acgtaacgtc tataacggag ataatagatc agccgaatca 74460
aaggccaagg atttgttcat taacgatccc ttcatcaagg gaatagattt taagaataag 74520
accgatatta tttctagact agaagttaga tttggaaatg atgttctata ttcagagaac 74580
ggacccatct cgagaattta taatgaacta ctgacaaaaa gcaataatgg aacaagaacc 74640
ctaactttta actttacacc aaagatattc tttaggccga caactattac ggccaatgta 74700
tctaggggga aagataaact atctgttcga gtagtttatt ccaccatgga tgtcaaccat 74760
ccaatctatt atgtacaaaa acaattggta gttgtatgta atgacctgta taaggtatct 74820
tacgatcaag gggtaagtat taccaagatt atgggagata ataactaata ataatgaaaa 74880
caaactatag agttgtaaat ggatgaaatt gtaaaaaata tccgggaggg aacgcatgtc 74940
cttcttccat tttatgaaac attgccagaa cttaacttgt ctttaggtaa aagcccatta 75000
cctagtctgg aatacggagc taattacttt cttcagattt ctagagttaa tgatctaaat 75060
agaatgccga ccgacatgtt aaaacttttt acacatgata tcatgttacc agaaagcgat 75120
ctagataaag tctatgaaat tttaaagatt aatagcgtaa agtattatgg gaggagtact 75180
aaagcggacg ccgtagttgc cgacctcagc gcacacaata aactgttcaa acgtgaacga 75240
gatgctatta aatctaataa tcatctcact gaaaacaatc tatacattag cgattataag 75300
atgttaacct tcgacgtgtt tcgaccatta tttgattttg taaacgaaaa atattgtatt 75360
attaaacttc caactttatt cggtagaggt gtaatcgata ctatgagaat atattgtagt 75420
ctctttaaaa atgttagact gctaaaatgc gtaagcgata gctggttgaa agatagcgcc 75480
attatggtgg ctagtgatgt ttgtaaaaaa aatttggatt tatttatgtc tcatgttaag 75540
tccgtcacta agtcttcttc ttggaaggat gtgaacagtg ttcaatttag tattttaaac 75600
aatccagtgg atacggaatt cattaataag ttcttagagt tttcgaatag agtatacgaa 75660
gctctctatt acgttcactc gttgctttat tctagtatga cttctgattc aaaaagtatc 75720
gaaaacaaac atcagagaag actagttaaa ctactgctgt gatttttaaa acatagttat 75780
tacttatcac tcataaatga gtaaatcaca cgcggcctat atcgattatg cattgcgcag 75840
aactactaat atgcctgttg aaatgatggg aacagacgta gtacgcctca aggattatca 75900
acattttgta gcaagagttt tcttaggatt agacagtatg cattctcttt tattgttcca 75960
tgaaacgggt gttggtaaaa caatgactac tgtatatatt ctcaaacatc ttaaggatat 76020
ttatacgaat tgggctatta tcttattggt gaaaaaggct ttgatagaag atccttggat 76080
gaacactata ctcagatacg ctccagagat aacgaaggat tgtattttta ttaattacga 76140
tgatcaaaat tttagaaata aattttttac taatatcaaa actattaatt ccaagagtag 76200
aatatgcgtc attattgatg aatgtcataa cttcatttct aaatcattaa tcaaagaaga 76260
tggtaagatc cgtcctactc gttcagtata taatttttta tctaagacca tcgcattaaa 76320
aaaccataag atgatttgtt tatcggctac acctatcgtt aatagtgtgc aagaattcac 76380
catgttggtt aacttactac gaccaggatc cttacaacac caatcgctat ttgagaataa 76440
acgtctagtt gatgagaaag aattagtctc caaactagga ggcctatgtt cgtacatagt 76500
taataacgag ttttctattt ttgatgacgt agaagggtct gcatcattcg ctaagaaaac 76560
agtattaatg cgatacgtta atatgtcgaa aaagcaagaa gaaatttatc aaaaggctaa 76620
actcgctgaa ataaaaacag gtatatcatc atttagaatt ctgagacgta tggctactac 76680
gtttacgttc gatagctttc ctgaaagaca aaatcgtgat ccgggcgaat acgcgcaaga 76740
gatagcaaca ctatataatg attttaaaaa ttcattaaga gatagagagt tttctaaatc 76800
cgcattagat acctttaaaa agggagaact attgaaaggg gatgctagtg cggctgatat 76860
ctctctattt actgaattaa aagagaaaag cgtcaaattt atagatgtat gtttgggaat 76920
attagcatcc catggtaaat gtctagtctt tgaaccattt gttaatcagt caggaataga 76980
aatcttatta ctatatttca aagtctttgg tatctctaat atagagttct catctagaac 77040
aaaagatact agaatcaagg cggtggctga gtttaaccaa gaatcaaaca ctaacggaga 77100
atgcattaaa acatgcgtat tctcttctag tggaggcgag ggtattagtt ttttctcaat 77160
taatgatatc ttcattttag atatgacatg gaacgaggcg tctcttcgtc agatagtagg 77220
aagagccatt cgtctcaata gtcacgttct tactcctcca gaacgtagat atgtaaacgt 77280
gcactttata atggctagat tatctaatgg tatgcctact gtagacgaag acctatttga 77340
aatcattcaa agcaaatcaa aagaatttgt ccaattgttt agagtgttta aacatacatc 77400
attagaatgg attcatgcta atgaaaaaga cttctcaccg atcgacaatg agtccggttg 77460
gaaaaccttg gtttcaagag ccatcgatct atcgtctaaa aaaaatatta ccaataaact 77520
aattgagggt actaatattt ggtattccaa ttctaataga ttaatgtcaa taaatagagg 77580
atttaaaggc gtagatggtc gagtatacga tgtagacggt aactatctac atgatatgcc 77640
ggacaatccc gttataaaaa tacacgatgg taaattaatt tatattttct aatcaatcat 77700
cctcagttaa tttttttaat gattcgtaat aacatcctct atggccataa catttgagtt 77760
ttgcagtatc tagagcatat tttgcaattt cgtattggag tccattacct gaattcggat 77820
ctaaaaatat tagatccttg atttcatgat tacttttaaa tgtatcaatg atttgatcac 77880
tcgttaaaga tattcttccg acaaagaaga ttacctcaaa aaatttatta ataatggtat 77940
cttctatgat gccatgaata aaaaaccgag tgtctatgaa tacaaaagaa ttgtctatat 78000
taacttcttc tttaatttcc ctggataaac actctggaac attctcaccc cttttgggta 78060
tgccacccgg ataaatagca tctattctat cattatcagt agtagctgga tctagagaaa 78120
aaaatgacga tagtatactt ctttcctgtt tggacaaata attggaataa ttcagaaata 78180
atcgtttctt tctagacatg tttctagttc taattatttc agaatagaga aaactatctc 78240
gtctgttaca tactaatatt ttgttatcag tagttaataa aatagcggac acggatactc 78300
gtttatgaca ccataggcat tggttaacga atgccgtgag tgtaataatt tgagagtcat 78360
cctcgcaaat aatagactta gctagtcgtc tattatactt aataatctga ctaattatac 78420
tagatctgta aaagttcatt tactattaac tagcatatta taaatataag acagatattc 78480
gtatttatcg tcgttaatac aatcatttaa tgatttaatc tttctaattt ctacattgta 78540
aatactagta tgcgataaat cctccaacga ttgattaata aaacacgcca tgcaataact 78600
tatatatact ttattaaata atttatcccg tgttgtaagt tttagaatta catttccaaa 78660
ttcttttact gttatacgtt catcactttc ctcttttaat tctcttctta aacaatcttt 78720
aatagattcc tttttatcta gtttaccacc caataatatt aactcttcaa agtgaggatc 78780
gatatcgttg atagaacctt ttctcaaccg tctaaagatt tctcttagtt cattgtagta 78840
cataaaccgt aatagtttag tggatactct aaagatagaa tctgaatttt gttgagataa 78900
tatcgcttgg aatgcgaatg aagttcttct agctcctatt aacggatatc cgtcacttgt 78960
tatacacgca gcaaacacat gcgtgtcttt tgatcttgga atatctttta ttcgtttaat 79020
agatattaat tctctaggag tttcaaatat cacttcctca tccattgtaa ttcccatact 79080
aagagctatt tttaaacagt tatcatttca tttttactat gccgcaacaa ctatctccta 79140
ttaatataga aactaaaaaa gcaatttcta acgcgcgatt gaagccgtta gacatacatt 79200
ataatgagtc gaaaccaacc actatccaga acactggaaa actagtaagg attaatttta 79260
aaggaggata tataagtgga gggtttctcc ccaatgaata tgtgttatca tcactacgta 79320
tatattgggg aaaggaagac gattatggat ccaatcactt gatagatgtg tacaaatact 79380
ctggagagat taatcttgtt cattggaata agaaaaaata tagttcttat gaagaggcaa 79440
aaaaacacga tgatggactt atcattattt ctatattctt acaagtatcg gatcataaaa 79500
atgtatattt tcaaaagata gttaatcaat tggattccat tagatccacc aatacgtctg 79560
caccgtttga ttcagtattt tatctagaca atttgctgcc tagtaagttg gattatttta 79620
catatctagg aacaactatc aaccactctg cagacgctgt atggataatt tttccaacgc 79680
caataaacat tcattctgat caactatcta aatttagaac actattgtcg tcgtctaatc 79740
atgatggaaa accgcattat ataacagaga actatagaaa tccgtataaa ttgaacgacg 79800
acacgcaagt atattattct ggggagatta tacgagcagc aactacctct ccagcgcgcg 79860
agaactattt tatgagatgg ttgtccgatt tgagagagac atgtttttca tattatcaaa 79920
aatatatcga agggaataaa acattcgcaa ttattgccat agtattcgtg tttatactta 79980
ccgctattct cttttttatg agtcaacgat attcgcgaga aaaacaaaac tagattcgat 80040
accttgttga gcctccatta gaacggcagt gacttcgctg ccattgtcat acgcattacc 80100
atttcgaaaa aagcagtact ttgaatcgct aaatgataca gtacccgaat ctctacttag 80160
tttacagatt aaatctccac attgaatagt tacatttgat tcatcttcga tgtttaatgt 80220
tcctctgact atatccccaa cgtgataata cgcgtaggtt attacacacg gaacgtttat 80280
aactacagaa ttattaacta tttcgccgag aggtaattcc acgtcttcac gaatttctat 80340
tttcttggcc attttaccac taatttctct atggagatac gtcttatata cggcattcct 80400
aatattagtt tttatgtcta acgtcagctc gtgtggctcc aatgtaactg gaaggtatcc 80460
attggtaaca aagctcgaca tttatttctt tatatatctc atcagtttta tggagaagat 80520
accacgttat attctttacg aaagttaact ccccaaacaa atggttcgta ggaaagtcta 80580
aacggtactg taataatatc gttttcgtat ttgtataaaa gagatttatc aaatacttta 80640
ttcgatacaa agaagtgacc gtttacaata tcatctatgt aattcctagc gtcttcttta 80700
ttttttataa ctgatgtaac cattttaagt aacttggtat cgttatactt aatacgagaa 80760
ttattataaa aaaattgtct gaccaattct cccaataaaa cttttacaat tgatggatgt 80820
ggtggaagag aatacgtttc agacatctct tgaagaatag agtatattct attcgtttct 80880
ttagtcttaa attttagata caacagcttt ttgatgtcaa atggtaaaac attaattaat 80940
tcatcctgtg tgtaatcgtt taatgacgtc acttcgtcat tgaaatcgga atatacggcg 81000
gctaaaagat atacattaac tggttcagaa atatcggcgt aagagaattt tctaatagat 81060
cgtccaagaa tttggttgta ttgagaaaaa gtatctggga tagtcataaa ccaaatatgc 81120
cttacctctt tcagagtata ggattcggac ataatgtttg acgaaaacaa aaacatcaat 81180
tgactgccat catcgttttc aggagaatta tacacatcta atagatcctc taaagacgat 81240
ttcattttac tagtaacgat agcaaatgtt tttggtttgc cgtttatcat atgtggatta 81300
gttccctgag aaccattata ttcagaatat ccattactga gcatgatata tttaattacc 81360
aatccgccat atgtagaatt agaaaagtat ataaaatgtt ttccgttgag tgtctgtatc 81420
cgattaataa agtatttaaa tttggaacta atgtttaacg ttaccaattc ttctccgtat 81480
aacacgccat tatttatttt cagatttggg tacaattcct tatcctgttc ctgaaataaa 81540
gtatctaaat tattcatcag attaagttgt cccaatactg ccattgacac gttatacata 81600
tttttatcaa acatttcatg ataacatagc tgtcgtctag taatcatata atctctctct 81660
tgaagtttag acatgtgaca atatactact ctagtatcta gaaacttacg tccgtgatat 81720
cttatcgttg gtagatcttt atcaggcatt tcgtaatatg atattcttcc tttaagcaaa 81780
tccttaagta cattcacacc tcgttcgtta agaagtgtct gaattacttt cttaccacga 81840
ctaataattt caccaaaatc tatcgtctct tcggacatta aatctataat atgacccaga 81900
gtattaggtg tgttagtaat gggagatcca gacaatagta aaaaaggaat cttgttttta 81960
ttttttatca cggtcataag ttctccagta ttattcccaa agatattatg tgcctcatca 82020
acgataaaaa tagagttatt gtagcgagat aatccgttat aattaatgac gttatcgtta 82080
taattaagag aataaaaact tgttgtggaa tgaataaaga tattctcagc tatgaattcg 82140
tcattaaaca agttcatagc tacacccata ttataattaa aaattttcaa aatgttaata 82200
ttaggcacta gaatgtaaac ctttttaaat ctggaagcta ccaaggcgaa caacaaagcg 82260
attattgttt tacctgatcc cataatatgg aacaataaca cgcttctgtt ctcatctatg 82320
atagttctaa ctagataatc tagagtagct aactgatgag gtaatatagt tggtatacta 82380
tcaacatgat tatcaaataa atctatgatt ccggtattca tttagatatg tgatatcttc 82440
tataaatata tgagcatata tttacggaga tgaaatatcc tctatgaata tatattgcag 82500
atattcatca ctaacattgg caaatttctt atgtctattg atgaagctct ctatatcgtg 82560
tccaaatagt ctagaattaa aatacttgga tattttctgt tggaaagtag tcaacggaag 82620
agtaaccaca ttatcgtaca atatatatcc ctttttggag aggtccgtca ttaatggaat 82680
atgctttaca gagctagata ctaacagagt acctattttg agatagaatg caaagtccgg 82740
aatctcttcc ggtgtaggat atagtttcat aataggaaca tgatattttt tgtaccattt 82800
caccaacaag tatagaaatg cgaatctata tctattattt tgtattttac catctaaccc 82860
ctcgtctaat agtttgactt tatcgtacgc gtcattattt tcagcagcct ctctaccaga 82920
aggttgagaa aagtgtgttc tgaatcgcac gacggcaatt cttctcatta atgcgttatc 82980
tatcctatca aagacaggtt tgtaattagt atcgataatg attgtcgcat ggtttctatt 83040
attaatttta ttggagaaac acggtcttcc aatgacacaa ggttctgtca actttttaat 83100
attgtcagat ctaattttct ttgatccact acaggcaaaa tcaggtagtt cgctacagaa 83160
tacagatctt ttcaaatgca tgttagcgat aaatggatta ggtcctttat ccaatacatc 83220
tgttaaaatt gtttgacccg tctcaacaaa caggtcaccg atagcagact ttaacaaacg 83280
tttggttgtc gactttccag ttgcagtttc tccaaaaaag aatgttaaac atcctttggt 83340
agcaccgcat aaacaactag ataaagtttt ttcgtacaac tctctatttt tcttattttc 83400
atccgttaat ggttggatat cgttaatgat attcattaac tcttccattt ctggactgtc 83460
ttcgacgaac tttgtatcgt caaatttaaa tccggttgat acagtacacg tatatttttt 83520
agcatcatct ccagagtaaa acattccgtc taccaggtcc aatacaccat ttttaaacgg 83580
aagtttatcc ggataggtat cggtctccac tgaatctatt aacatgtctc gtatgttagc 83640
ttctacagtc tttcgtttcc tcggacagag taattcgctt gaatattcct taggtagttg 83700
atgtcttatt gacagaatta gttttgttat caagggttct tcgctgttaa atttccatga 83760
attattaatc caaactatat ggtctcctcg ttcggttaat aaaacagagt tagtgtctaa 83820
aattctttgt gcaatattaa acagtttatt accatccaac ggaacaattt taactttaca 83880
actatgtgga ttaccagttt tgtaaattct aatagcacca ttttccaacg atagttgatg 83940
cggatgttta tgcgatcgtt ttttacataa tgcacaaggt gttacgtaat ctatgaccag 84000
tggtaccgtt gtaaaattat tttcatcgag atcattaaag tttattatag aattaatgaa 84060
tatttttgaa actcttttta tagcgtcttc gaatgaaata aaccctggtt cccataactt 84120
atcaggaact aaatcctcca atcgtcgttg tagagaaaag taataactat tgttgttcat 84180
atccacgtaa gtgaataggt aatcttctat attatcatgc ggtggttgca ttacatgaat 84240
agtgtcgcaa tttggatttt tcctagtacc tacaacccga agagttgttt ttctcctata 84300
tacggcagtg tctatcgatc tggttagtgg attttcagat gatctactta attctaatag 84360
tgttcgtttc atagctatca atgtatccat agtggtatac gtgtctaaaa agataatatg 84420
aaaacttgtt ttatctctat ttgtagactt agtcaatgaa aaattagatc tcatggattt 84480
tattacattt tcatgaatgg caccgcattc tgtaaacgcg aatctagcta cacagtttga 84540
cacctcgata ataaaatctt gaatagccgt taaataatct atttcgtcta gacacgcgtc 84600
taaatctaca tccatgaaaa ttctgactat agagtatgct tcctcatccc taagactttc 84660
gaatagtgta cattctggat tattctcaat atatctttct aactcgtcgc atttaaatgc 84720
ttctacaaat cttggatctt cattttgtct gcacgctgac gggacaccta tagtcttaag 84780
aacaaagata acatcattac ctctaatagt tgcatccatt tagaacacaa gttaaaattt 84840
cactaaagca ttaataaata aacccttgag cccaatttat aggtgccttg ttgtctaatt 84900
ccagtaaaac gttgataatt tcaaatgatc tatctttctc gaattggcgg tctctagccg 84960
ctggatgata tccgactatg gtagttaccg gggattctaa ctttgcccgt atattcgaga 85020
aatctgtttt acccaaacaa taaagaacac taacgtgttt agttatatgc tgcagcagta 85080
acttggaaat cttatcccag tagatcgcgt gactttttgt ttctcctaat ttacaactta 85140
agtaataatt ccagggtata accccgtcta ttatattaag gttataacct ttataatcaa 85200
ttactccggt taatctagat atagatgaag ctatctcctt aattgatttt tttgtaaaat 85260
ttggtgattc gaacggtaca ccagttccat ctttcggata cggatctata ccacacacac 85320
atactcgttt atttctaagc ggttgtttca actgtataaa gaacttatca ggaataggcg 85380
acgtctcgtc tcgtagcagc caactggcta cttcgttata aaactctacc aattgactca 85440
ttactggttc ccaatcatcg tgataagtaa tagtatatgg cgcgtgtgat acagtcactg 85500
aattcattat atcaaattag ataccttttt atacgtatct agtacctttt cattttccga 85560
aacgccgtta aaccaagcga atacaaacgc cttaaactta tctctattat aacatattct 85620
cgttagattt aattcttcgg tatccttgta tctaactatt gttatatgtt tagaaactct 85680
atagtggctt ctaatcagat gttctaataa atattttaaa aatgaatctt gattaaaaat 85740
catatcattg accatttgtc tggctactga attacgataa tacggagaca ataacggatt 85800
agtcaaatag ctatcatctc ctaatgatgg aggcacctgt acaaatacca atcgtttatt 85860
cttgaacgaa tcaaacgtat aaatatcata aatgacaaat ttatcagtgt gcatatcttc 85920
agatatatga ttggatgtag aacaccatct aacttgtctt ccagcatcta taaattcctc 85980
cacctctata tatctaccac cgtgctgaaa actagaccca tttaattcga cgcataaaga 86040
tccatatgta tctggagtca caatatattc agaaaaaaat acatgctcct ttaattttgt 86100
taattttgat ctgataaagt cattgtgatt tcctaaaagt ataaatactt cgttatcgtc 86160
gttatcgact ttgggatact tattatcctt aactataaaa atgtccatca atatcgatat 86220
aaaaaaaata actgatttac tcaacagtag tattttattc ccagatgatg tgcaagaact 86280
tcttcgagag aaatatatag tattagaaag aaaatcaaat ggtacaccta cagtagctca 86340
catctataag acaatggcta gatttgataa taagagtata tatagaatcg ccaagttttt 86400
atttatgaac aggccagatg ttatcaaact tttattttta gaagacgtag aacctctgtt 86460
acccgacaaa agtattaata tatctattaa caatacagag tatccacagt tggaaggtcc 86520
tataggaaca aaaatcgctc tattggaatt atttaatgca tttagaacgg ggatatcaga 86580
acccatacca tattattatt taccgcttag aaaagacata aacaacatag taactaagta 86640
agtcttcgac atctaaacct tcacatttaa tggctcctct atttagttcg aaaaagtttt 86700
ttgtagacgg tctatcttcc attgtagatg cgccattaat aaacttttta cttcgttcta 86760
taattgtagc gaaatcaacg ttatctacaa gaacaaatcc gtattcgtta aacactctga 86820
ctatatcgtt ctttttgata atgtattcag tcattggagt agacattgtt gatggattat 86880
ataccactat tctatcatca gctatttttt ctacagacat atagttttcg ctactaggta 86940
aattcttatg aattataaaa gtctttttat ctgttaattt tgataatttg tctccgtcca 87000
tggtagtgat taataccttg cctccagaag cagttagttc ggataagtta ttcatgacgg 87060
tagcataatg tctcggatga aaagaataat ggatagcaaa ctgccagtcg atgatattaa 87120
actttccaaa atagaatact tctctgacac tagagacaaa tgtatcggat cgaatagttt 87180
cctgaatgta gtcaaatttg tagtacttgg ttttaattcc agagtttaat ttgttgtatc 87240
tttcatttcc tctagctata gcatcagcat ccggatccgt cgctaccaat aacgcaatct 87300
ctccataaaa gtatttttcc aggtccgcac cgtttccaaa atcaatcgcc aatacctttc 87360
gtttgttgga atcgtctaaa aatgttttgg aacaatacat agaaataaga agagtcttga 87420
cgtagtttga taaaattccc aacggtcctc tagttcgttt attcgtaaaa taactaactt 87480
ctggatttaa tctaaattta tcattattgg cgtattgatg tcccacatcc gatagtttat 87540
cctcgttaaa gatatctcct attttgatgc tttgatctct taaatgttcg actatgatat 87600
tatgttgatt tccataataa tcttctgagt taatatattt catggtttta tcaattctag 87660
gtttaagtat ttctccatta actaagaatt ctgctataaa cttaataggt acaaccacgg 87720
acttaatacc cacttcatta tgtgtattaa tatattccaa acaatagata ttatttagat 87780
aattaacgcc gttatataac acaatcttac cagaaccata ttctttagga aagcctttat 87840
cgttgctaaa tttcttatac tctacaaaga tagacgattc tccaaagata attggttcac 87900
tggacatgta cctaaatact acatttgcag tttggtctat agtattttcc tttttaattt 87960
taaaatcaat gttagattta ggtccctttg aatagaacag aataacacct tctggttgct 88020
ttggtaaata tgtagataac atatcgacga cttcactagt tgtagtaaac ggaccttcgt 88080
atttctttga cttgaatact atccgatcac aaatatccac tagtttagat tcaacatact 88140
tactttcttc tagtctatca ttgattgcat tcacaggctc tattagctta atgagatata 88200
cggtccagtt cttatcctta actgcctcac caaagactac tacttcggaa tctattattc 88260
tcttaacagg atatctaata atataaccaa gatgtgtaaa ataacaatac aacccttttg 88320
atgtaactct gatagttata ggaattccgt cagtcttagt tacggcatat agattttcca 88380
gatccaaacc tactatatct tgtttaggca acataaaggt ttttataggc gcgttaatag 88440
gcggagaaag aattacattc tctggagaag ccataaatat atgacgtgat agagtcgtca 88500
attcctttat tagttcatca tatggaactt tttcattgtc tctaggtgtg aattctattt 88560
ccagagatgt attaggcctt gactttggat gattaatagc gtgtaataaa gaacttttag 88620
actgggcacc ggatcctaga aaatatttta gcttaaagtc tatcgtaaag ttttttgttt 88680
ttgcttgaat aagattgact aattctagtc ggatagagga tccatatttc ttgtaatcca 88740
aaaatatatg acgttcctct gtcgatagtc tcaacaagca ttctttgtga agacgatttt 88800
ccgtcactaa tgatttcttt tcccaaacta tgttatctat agcatctact aactgtacat 88860
tttttacatc tagaccatgt accttagata atggaatctt agttctaatt ttaacacctt 88920
ccttattagt aacggtaaat cgaataaacg attcctgaat cgtagagata ttcactacat 88980
ttgtcaaagt aattaatggc ggcttaataa atactagttc caattcatta tttatttcgt 89040
atgcggtaga cctctgttct aattctgaag cattcttcgc taaagcgtct atatacgttg 89100
caatagtaga agatgatact atgttggcat ccatattgtg ttttattata aacgactagt 89160
tttttttcat ttacttatta acaagcgtct tttatatatt cgtaatctat gcctttagct 89220
agagctattt taagcttttc tgtatctcga ctgatacgag tatctgatct attgcggtat 89280
ttttttataa atcgtttaag acggggagta gttttgatat attcgctaat atcctcttta 89340
ataatatcca cacacgccaa ttgttcggct atactcgatg cgtgagtctt gcattcatct 89400
ccagatatcg gagtgagggt cagatccaaa tattgagaag ccttataatt atcgtattcg 89460
aaatcaatca taaactgacc atccttatcc actgaaaaaa tggtattgtt tggatggcgt 89520
ttaaatatag acattatcaa tgccataata tctaatgtgt ttagctctcc gaagaaagct 89580
gtcattgcga gagatttcat acgcttatcc atttccattt atcggtcttg taattatttg 89640
tgtaaagatc tatatcatcc atccgtagat gatttaacgt gatctacgac tatagatagg 89700
aattcatcga acgtagtttt agatactaca tctaaaaaat ttttatcctt taccatttct 89760
aaaatagtcg ttgccatata agctcttttt gaaattgatg gagtatgacc taccacttca 89820
gcagtttgtt tgatagttaa cgctattaac ttttttggtg acggaagagg agatatggac 89880
tttacatttg tccaaaaatt atataaaaac gtataattga ctccatacgt tcggagatcc 89940
ttgattctaa taccaaactg tttgatacat tcgtatacct ttcgttcact tagtttgttg 90000
aacagaaatt cttcgggact agaatcatcc gtcagtttca atagcggttt atatagtcta 90060
ttagacttat gaacaacaaa ttcatgtgaa actttgtcct ttcctacaaa cttgataact 90120
atttcatcgg gacttatttc tatgtgttta ttttttagtg ttaataaccc tactgtttca 90180
ttctccttaa gatatttcat tttaccaaat ctaataaaaa acatagtttc cattaacata 90240
aaaaccgcca actgataatt ggaatctgtg gacgatttct ttatattttt gtttataaaa 90300
caattaattc gtttcataac gttatatact ctaacaaaaa tacgatctct tttagcgttg 90360
cgattctgta catgcatttt tccgtaaaag tattgtctac gtccttttga atcacttccc 90420
acaaaaatta atctagttaa cgcctcctcc cacgtttgtt catatactac tacatctgtt 90480
aaatgagtag gaattttaac atgttgcaaa acctcatacg ctggattatc gtctgataca 90540
ggatttaaaa aattattatc ggtaaagagt ttaccatctt tataaaaaag tgcacgcatt 90600
tacttcttac aagttttaac ttttttacga acaactttag attttccctc ggtgactaga 90660
tcagatagtg ttgtaatagc tttagtcata gaagtaaatt gtctagagat accagctgct 90720
tgaacatcct ctagaaccgt cgatactgca gagattctag taataatttt cttaagatct 90780
ttaacgatat tgtcggtagc cacctttagg tcagaaagat cgcttctagc actatgattt 90840
actttaccag cttcaacttg taccataggt tcatcatccc cgtcgctatc atcgagctct 90900
acagcagcca cgctttctac aatgtcgctg actccaggag atggagaatt tttttcagtt 90960
gtttgatgat attcctctat aactacttct tcttccactt cctccttttt ggttgatctt 91020
ttagtagccg ctggtttacg aggagtagtg gctcgtttgg ttttgggctt agtagatgga 91080
attattacat cttccgggaa aatatcctcg tttttatctt tattttcagc gctatttttt 91140
agatgagctc tgatttcagc catctttgtg aagctactag tatccgcttt atttgtaatt 91200
gaccacgcca ttacgataca aacttaacgg atatcgcgat aatgaaataa tttatgatta 91260
tttctcgctt tcaatttaac acaaccctca agaacctttg tatttatttt cattttttaa 91320
gtatagaata aagaatctat aaaaactaaa aaaattatac atcataaacc aatttcctag 91380
ttgtttgtaa ctttaaatgg actctaaaga gactattcta attgagatca ttccaaaaat 91440
aaaagcatat ctactagacg cgaatataag tccaaaatcc tacgatgact ttatctcacg 91500
aaataaaaat attttcgtta tcaaccttta taacgtatcg actatcacag aagaagatat 91560
acgattgtta tacactacga tagaacagaa tattgacgcg gatgatcaaa cactggttgc 91620
tattttttcg tatataggat ataaatttga acaggctgtt aaagaagaga ttagtacgag 91680
tttatccttc aatgacaaga ataccacaga tgaaatgacg tataacttgt atgatctttt 91740
ttttaacaca ttagacatgt atttacgaca aaagaagatc agtattctgg taaatgatga 91800
tgttagaggt gatgtaatcg ttagttataa aaatagtgac ttagtttcat catttaatgc 91860
ggaactagaa ccagagatta agaagatacc gttcaatatg aaaaatctat taccgtactt 91920
ggaaaagaat ttggaccaac taagattctc taaaaaatat ttagactttg catatttatg 91980
tagacacatc ggtattccca tttccaaaaa aaagtataat gtgcgatatg tatttcttta 92040
taaaatagac ggattatcca ttcctattat cattaaggat tttttagatg ttaagtacgt 92100
atatttggaa aatactggaa aaatttataa aaattctttt tccgaagacc ataacaacag 92160
tctatctgat tggggtaaag tcatcatacc tctcttaaag gatcgtcatc tatatagcta 92220
catctttcta tctagttatc atttacatag ttactataca gatctcatcg cgagagacga 92280
gcctgtgttt gtgaaacgca aaaaactaga tattatagag atcgatgaac ctgaggcatg 92340
gaaaagggat gttagagtgg aattcgcacc gtgtgagcat caaattagat tgaaggaagc 92400
tatgaaagtt gacgctaact atttcactaa aattaataat tttgctaacg aatttattta 92460
ttatgaagat ggtgtggcat attgtagagt gtgtggaata aatataccta tatttaattt 92520
agatgccgct gacgtgatta aaaatacagt tatcgtttcc acgtttaaca agactatatt 92580
cttgagcgaa ccatatagct atttcgttca tagtcagcgc tttatcttta atattatcat 92640
gtcttttgat aatattatga aatctcaaac ttgggtaatg aaatacaaca ttaaccgact 92700
aattcttaac tttcttattg atataaactc tagacgtcag gaatacgaaa aaaagttttc 92760
ttctgaaatt aagagaggtc tgttctttct tcgtttgtct gcaaacttat tcgaaagtca 92820
agtatcgtct acagagttat tttatgtttc caagatgctt aatttaaact atatagttgc 92880
gttagtaatc attcttaaca gtagtgcgga ctttatagtt tcttatatga aatccaagaa 92940
caaaacggta gaagaatcca ctcttaaata cgccatctcc gtggttatat acgatttttt 93000
ggttaagact agaatttgcg agaagggatc gttggatact atagttttat ttaccgatgt 93060
atacacatct ataatgccgg aggaattgga tttacatttt cagagaatca cattagaact 93120
tagaaaacta gtatccattc agagatcggc gttagaaccc aattacgatg tagaaagtcg 93180
cggcgaagag cttccattat ctgcattaaa gtttttcgat acaagcacca ttatagttaa 93240
gacaatggct ccagtacata catacatcga acaaaaaatt gttgcaccta ctccatcggt 93300
cgaaccaact gatgcatctc ttaaaaactt caaagaacta acgtgtgacg aagatattaa 93360
gatctcgatt agagttcatg atactaatgc tacaaaatta gtcatttttc catcacatct 93420
aaaaatagaa atcgagagaa aaaaactaat tataccgcta aagagtttat atattaccaa 93480
tactctcaaa tattattatt ctaactccta tttatacgtt ttcagattcg gagatcctat 93540
gccattcgaa gaagaactca tagatcacga acatgtgcaa tacaaaataa attgttacaa 93600
tattctaaga tatcatttat tgccagacag tgacgtgttt gtatatttta gtaattcatt 93660
aaacagagaa gcattggaat acgcatttta tatctttttg tcgaaatatg taaatgtgaa 93720
acaatggata gacgaaaata taactcgtat taaagagttg tatatgatta atttcaataa 93780
ctaaatggcg gcggtgaaaa ctcctgttat tgttgtgcca gttattgata gacctccatc 93840
agaaacattt cctaatgttc atgagcatat taatgatcag aagttcgatg atgtaaagga 93900
caatgaagtt atgccagaaa aaagaaatgt tgtggtagtc aaggatgatc cagatcatta 93960
caaggattat gcgtttatac agtggactgg aggaaacatt agaaatgatg acaagtatac 94020
tcacttcttt tcagggtttt gtaacactat gtgtacagag gaaacgaaaa gaaatatcgc 94080
tagacattta gccctatggg attctaattt ttttaccgag ttagaaaata aaaaggtaga 94140
atatgtagtt attgtagaaa acgataacgt tattgaggat attacgtttc ttcgtcccgt 94200
cttgaaggca atgcatgaca aaaaaataga tatcctacag atgagagaaa ttattacagg 94260
caataaagtt aaaaccgagc ttgtaatgga caaaaatcat gccatattca catatacagg 94320
agggtatgat gttagcttat cagcctatat tattagagtt actacggcgc tgaacatcgt 94380
agatgaaatt ataaagtctg gaggtctatc atcgggattt tattttgaaa tagccagaat 94440
cgaaaacgaa atgaagatca ataggcagat actggataac gccgccaaat atgtagaaca 94500
cgatcctcga cttgttgcag aataccgttt cgaaaacatg aaaccgaatt tttggtctag 94560
aataggaacg gcagctgcta aacgttatcc aggagttatg tacgcgttta ctactccact 94620
gatttcattt tttggattgt ttgatattaa tgttataggt ttgattgtaa ttttgtttat 94680
tatgtttatg ctcatcttta acgttaaatc taagctgtta tggttcctta caggaacatt 94740
cgttaccgca tttatctaac actattccat attactaaaa tcggaacacc aatgcggtga 94800
cataaaataa ccgctataac ctaattcatt taacatctca ttaccacaag taataacatt 94860
attagacttg tgttttatca aatactgaca aaattgttga gcagatggat cgacctttgc 94920
cgccttttta accatccacg cgtctccagt acctcgccta atagcttgcg gcagatatgt 94980
tttcttatcc aatcgcatag ctataaaata ggcgccgaaa tccacacatt tgaattcgaa 95040
tatatcatcc ttaccagcgg ctagaagtct acctctatca ctttctaatt ttgttttgct 95100
atccgttaat gatttccaat cgttaaccgt atttttaatt cgcatatatc tcgttaattc 95160
attaaagact ggattatcag acgtctgaaa ccagagtaat agcgcactaa ttgccaatat 95220
aataacaaag aatataagtg ttgatgtttt ggctgcttgt acgcctacta tagccttttc 95280
tctaacgtat tctaaattac acgcgtttac cgataaagta gttttatcca tttgtacgtt 95340
ataaatggat aagaaaagtt tgtataaata cttactacta cgttcaactg gagatatgca 95400
caaagccaaa tctcccacta taatgacaag agtaaccaat aatgtgtatt tgggaaatta 95460
taaaaatgct atggatgcac catcatctga agttaagttc aaatatgttt taaatttgac 95520
gatggataaa tatacattac ctaactctaa tattaatatt attcatatac cgttggtaga 95580
tgatacaact accgatatta gtaaatattt tgacgacgta accgcctttt tatctaaatg 95640
tgatcaacga aacgagcccg tgttggttca ttgtgctgcg ggagtaaata gaagcggggc 95700
tatgattttg gcatatctaa tgtctaaaaa taaggagtca ttgcctatgc tatatttttt 95760
atacgtgtat cattctatga gggacttgag aggcgcattt gtggaaaatc catcgtttaa 95820
aagacagatc atagaaaaat atgttattga taagaattaa aagtcttctg tttcttccat 95880
ttccttgatc ttttgagaag atatcttatc agatagtttc ctttccattc tcatgagaag 95940
acccaagtcg ataaagtatt tgtaatatcc agttcctata ttagggacct tgctaaaaaa 96000
gtggcagcta ctattatcgt taataggttc tgacttttta tgaagagccg cgtttaacaa 96060
tgctttattg tctccgaacg tagctctctt aagagtacta gctgcgccga acttgaattt 96120
attcactgat tctggttcgt aactagcaca tagtaaacta gcgagaagat cacaaggctg 96180
atacagataa tcgaaccctt ctccataggt gtttaacatg gcttcgcaca agtattcacg 96240
agcggcctcg ataccgaaga tatcgtatgt attccatact ccaggataca cgttgacgtt 96300
ttccaaatcg aaagaaccca attcttttag attcatgagt tctacagtca tcttattgag 96360
cttttttgtt tgattgaagt cgtcatatcc cgtataatca gagataggaa tcttgaattt 96420
actaatcttg cccttgttgg cggcacccgg aagaaccatc ataaacttac taagattcaa 96480
ttcttccggt tcaacgaaat ttaggtagac agtaaatcta atattatcct catcctcaat 96540
gaatgtttcc ataccccatt cctttacaat gacgctaaag gagatgaatc gttcaatcat 96600
atattcgacg actaattcgg taatttctgc tctcttgatg tataatctat tgactattat 96660
atctactaca tacctatctg tttcttttcg aagagtgatg tttggattta attctcccaa 96720
acatacaaat tcgaaattaa tcttaacaga ttgaagttta gagatatcat cggataccag 96780
agtgataatt tcggtcttat tcttactcaa attagtcaag ttattaaact cgttgaaacc 96840
aagtttttgt ttgacggcac cacttttttc agtagtgtga aaactggaca gggcttgttg 96900
tgtaaacttc tcagacaata cctgtgcaga aataattcca ataggagttc caccacctag 96960
agtataattg agtttttcat agaacttttc aaagatagtg atagccgttt cttttgtaat 97020
tctaattcta gaaggattaa gatgcgtcaa gaatatatac tccataaaat ctatattaga 97080
taccgtaaag aagtatttct ctctcacatc atcaatgacg ttatgaatca tatcgtacag 97140
atccttaacc ttaatagcat tatcctcagt ggtgggtttg acgaatacta ggaaattaaa 97200
cggcgccaat gtcttttttg caagtttctg tttctgagag taaacgaatc cctgttttat 97260
tttattccac agagcactaa tttccaaata ccaagtcatg gactcatctg gatagataag 97320
atctacaggt ttacatactg agcctagaat tttggtataa ttggcggcgt acttgatgag 97380
cgtattacct ataactactt gtccgtatcc gtcgaccacc atatcctcca tctttttaat 97440
gatttttcta gccagtgttc cggtacgtga tgtttcacag acgatatcag tagattgaga 97500
tcttgcaacc agcatcgaaa agtaatattg agaacccgtt aatccttttg ttaaagaatt 97560
aagaatgtaa cctcttcctt ctggatcctt agagtctgga agatagtaag gtaagactct 97620
acccaatact cgagtctctg ctggttcacc atcaatcctc tgttgtccat aagtacctag 97680
aatatacatt agttctgtgg gatttacttt ataacccgct ttggccattt tcaggaggtt 97740
attatctgga tcatctatca gcgtttgtct catatgttct tctatctctc ggatattaag 97800
atttgtcaag ttggataaca tggattccac atagtccgcc tctaaagctt tagataatgg 97860
aactattttc ccgtctctta catcgttgag atatttggcg tatgcttctt tgataagttc 97920
tattttttct acgttgatgg cctccaattt attagtgaac gtcgaatttg gtctcagatc 97980
tttgaatgtc accccaaaac cgtaaataga tagatatctc ttaaaaacat aagatgactt 98040
gataataaag ttgatacctt ccacgttcga cttatgatcg gataagagtc cagccaatga 98100
cagatgcctc atagccacaa caaaattact atctacgtcg tttgcaataa tttcaccatc 98160
ctttaagaga cccggataat taatcttttc acctatcaag taagtataga tatctttacc 98220
gctgaattta catttacctt taggatcgaa ctctcttcca tatttcccca agatgttcaa 98280
tacttcatct aaacaaagat cttgtatcct aaacaatgaa tacgctgcta cgatttcatc 98340
ttgaatagat ccataaacgg gggctccatg aatatcgtgt ttgagtaacg tcgtcggata 98400
cataagaata ctttgttcaa ttacggcttt aggattttgc tccaatatca tccattcttc 98460
atctccgtcg aaatcagcat tttgagagtt gacaattccg ggagatattt tgatagtatc 98520
tccttcggta gctctgatag atgaagcgat gacgttgtat ctatgtagag acggctgtct 98580
tccaaaaata atacttgtat attcttgaac agctacttct acccaatcac caggcaataa 98640
atgtatttta tttttgataa actttccttg gcgtattctt gttaattggt ttaatcgttt 98700
attaaagtaa aatttaactt ggtttgacgc taatagttgt ttaactttat ccactgtaaa 98760
ggcattaaca aatatctttt ctgtaagtgt atttctaata tatgcgggca ttcctacctc 98820
attaacggtg atagatgtac tgggaccaat tacagatcta gcggtctgat cttttcgccg 98880
ggcgacgata taacttctaa tcatattatt tttgccggat gtgatatatg ataaattgat 98940
actggaagtg ttattagaaa taattttaat atcatcgtat tctattaccg ccttctggat 99000
aacctgttca tcagcattca agttacaatt cttaacgatc atacctaata agtaagttaa 99060
ttcattggtt tctttgggta tactatctat ccaaaaacta atagccggtc taataatcaa 99120
cggagggatg ggaaagtagt ctgtataaaa taagttagct ggatattgat gaatttctaa 99180
taatggccaa aacttttcat gaatagaaat taacttttga tagatgagag aattaggaac 99240
gttaatatca tccaacttgt tgacgaaaca aacctttttc tttgaaaaag taattttttg 99300
atacggttgc atacattcac tgttccaaca tgactttttc ttggataata ttttatcctt 99360
taatctccta agagcgtgtc ccgataactc ttttaggtta atatcgtcgg aatacggttc 99420
tcgtgaacgc aataatccgc agtgaataca tatataattc agtaaacgaa taatttctga 99480
aataaattca ggcttaacta tatgagtttt ataaatactt actttacccc agtgaccgaa 99540
acattccaat tccgttttcc cacaagtttt acataatgcc ccatccatag cacctagtct 99600
accatcttta acggtaccga tatcgtcgtc atttttaaca tgactaatga taatatctgt 99660
agcattaatc tctttttgat catatagact atacgtaacc ttagagatta cagccatttt 99720
tatcaagtca gtttctttta aagaaccgaa agtatacaat caaatttccc tttttattac 99780
aactataaaa taatagttat atttacactt taaattttta tcatgacgga cgaacaaatt 99840
tatgcattct gtgatgctaa caaagacgat atacgatgta aatgtattta tcctgataaa 99900
agcatagtac ggataggaat agatacaaga ttaccctatt attgttggta cgagccatgt 99960
aaacgaagcg atgcgttgtt accagcctct ttaaaaaaaa atataacaaa atgcaatgta 100020
tcggattgta ccatttcatt gggaaacgtt tccattacag atagtaaatt agatgtaaat 100080
aatgtttgtg attccaaacg agtagctacc gagaatatag ctgtccgcta tctgaatcag 100140
gaaattagat accctattat agatatcaaa tggcttccga ttggattact agcgttagct 100200
attttaatat tagcattttt ctaaacaaga tataagatat aaaatatatt attgattatt 100260
ataatgttct tatctcatct ctactaattg attaatcagc gactgaaata acagatctat 100320
cggctatctc tactccagtt accatgttat tgcggaaaaa tctaacaatt tttaatggta 100380
tattaggacg gtagagaatc ttgacaacta tttccgtctc taacattttg ggaagacgaa 100440
gagtcttttt accatcgcct gtttgtagta cactattaac tatattagtt tctgtagtat 100500
ctagtctaac aaagctagga aataagtcag atattaccac atctatgtat agaaaattta 100560
caggaaactt gttatgacct ttttgaagat acgatgttat aatgattggt tctccataat 100620
catcatcaac cggagtataa tttatggatg attcgttaaa tacctgaaac acagttttca 100680
tagaatcaat tttatgtcta acggttatta aagtaactaa atcattatac tctatatcgg 100740
tagtatatct cagtagtacg tttctattta ttactgcgta cggatctctt gctatttctg 100800
tttttagaca tagaattttt gctagatatt ttacgttgta ttggttcatg actaactttt 100860
tcagttgatg ttgttggaat atttaagaaa cgaaatatag attgttgtag aaatagtacc 100920
tttgctttag tagtaggaaa tgttttattg cagtacacgg tcctcagcat aaagtacatg 100980
tgaaaatagt catattcctg attaggataa tcaaagttaa caactacttt gttacggacg 101040
atcttattaa ggtagtacat ctttttttca taatttacag cgtctgattt ggtaactcga 101100
gtcagtctca tgttctcacc ggtataaata cttaataatc tcatttcagc tgaatatgaa 101160
ggagcaaaag gttgtaacat tttattaccg tgtgggatat aaaagtcctt gatccattga 101220
tctggaaacg ggcatctcca tttaagacta gacgccacgg ggtttaaaat actaatcatg 101280
acattttgta gagcgtaatt acttagtaaa tccgccgtac taggttcatt tcctcctcgt 101340
ttggatctca catcagaaat taaaataatc ttagaaggat gcagttgttt tttgatggat 101400
cgtagatatt cctcatcaac gaaccgagtc actagagtca catcacgcaa tccatttaaa 101460
ataggatcat gatggcggcc gtcaattagc atccatttga tgatcactcc taaattatag 101520
aaatgatctc tcaaataacg tatatgtgta ccgggagcag atcctatata cactacggtg 101580
gcaccatcta atataccgtg tcgctgtaac ttactaagaa aaaataattc tcctagtaat 101640
agttttaact gtccttgata cggcagtttt tttgcgacct catttgcact ttctggttcg 101700
taatctaact cattatcaat ttcctcaaaa tacataaacg gtttatctaa cgacacaaca 101760
tccattttta agtattatat taaaatttaa tcaatgttta tttttagttt tttagataaa 101820
aaatataata ttatgagccg acgtaacact ttctacacac cgattgatac atatcattac 101880
ctcctattat ctctatctcg gtttcctcac ccaatcgttt agaaaaggaa gcctccttaa 101940
agcatttcat acacacagca gttagtttta ccaccatttc agataatgga ataagattca 102000
aaatattatt aaacggttta cgttgaaatg tcccatcgag tgcggctact ataactattt 102060
ttccttcgtt tgccatacgc tcacagaatt caacaatgtc tggaaagaac tgtccttcat 102120
cgatacctat cacggagaaa tctgtaattg attccaagac atcacatagt ttagttgctt 102180
ccaatgcttc aaaattattc ttatcatgcg tccatagtcc cgttccgtat ctattatcgt 102240
tagaatattt tatagtcacg catttatatt gagctatttg ataacgtcta actcgtctaa 102300
ttaattctgt acttttacct gaaaacatgg ggccgattat caactgaata tgtccgccgt 102360
tcatgatgac aataaagaat taattattgt tcactttatt cgactttaat atatccatca 102420
cgttagaaaa tgcgatatcg cgacgaggat ctatgtatct aataggatct attgcggtgg 102480
tagctagaga ggattctttt ttgaatcgca tcaaactaat cacaaagtcg aacaaatatc 102540
ctttattaag tttgaccctt ccatctgtaa caatagggac cttgttaaac agttttttaa 102600
aatcttgaaa gtctgtgaat tttgtcaatt gtctgtattc ctctgaaaga gattcataac 102660
aatgacccac ggcttctaat ttattttttg attggatcaa taataataac agaaagtcta 102720
gatattgagt gatttgcaat atatcagata atgaagattc atcatcttga ctagccaaat 102780
acttaaaaaa tgaatcatca tctgcgaaga acatcgttaa gagatactgg ttgtgatcca 102840
tttattgatc gcaaaagctt tgcacaatct ttatacacta tcggtttact atttattgat 102900
aacgcagatg tttgagttgt catccatggt aatccataga tcattaattt atcgtcttca 102960
cacgctagat tagcacgtcg taatctatca ataggatcgg gtatttttcg tttaggcatg 103020
aagaacatat ttaattcaga tctaaaaaat acatatatta gaatgaatac aatgaatact 103080
tcaaataaaa ctattaatct gtgtttataa acacttaata aagaatgttt aaacgtgggc 103140
tctataaaca caggattaaa gtatacatta ggaacattct ccatttatag taatcaatcc 103200
tttgtcggaa tatctgttag aggaatattc tttttaacac attccaatag tcccagaaaa 103260
tcatttaacc taatgggttc ttgaagaggg ctagaactat acgattctag ttccttatta 103320
gtactaagtt tctctagttt atttttattg gtgaatccgt aaatggcatt caatctcatg 103380
gatgtggaag gagaatacac attcaggaca ttaattcccg catatttaga tatcaaagcc 103440
ttgttgtctc cgataggtgt atactcagtc gacgcggatt ccatattttc tttataaata 103500
ttaatctttt tacgagtttc gaaaatacac aagataggtg atgatccttg acggaaaata 103560
accatgttct tattctttct cctaccactt tttttaggaa tgacggcagt catatccgta 103620
ttattaatga taagaggcat aacctcattg ttaggatcca agaaatagtt ttcaatggct 103680
tcaatagtta gtttatcatc tcctagaccc aaattgtttc ttatattata cattgtggta 103740
aacattgccc ctgtgatatt ctcgttatga atcaactcca actcatattt tgacatcgtt 103800
ctaggtacta tattaaaaat agaaagcata gatcttggaa atttggattt tgcgccggca 103860
ataaccattt gtaaatcatc atactcagat atacttcctg caaaaaatat ggtatcttct 103920
tcgatgaggt tttctagcag tagactcatt tagagaagtt ttttttgtga taaatgaata 103980
cccgtaccga tgttacaaac gataatatag acaaaaatcc aaccaaacga ggtgatagaa 104040
atataccagg aagaaatgaa agatttaatg accaaaatag attcaacaac gatataccaa 104100
agcctaaacc aagactacag cctaatcagc caccgaaaca agataataaa tgcagagaag 104160
agaatggaga ttttatcaat attagattgt gtgcctacga aaaggaatat tgcaatgacg 104220
gatatctatc tcctgcctat tatatgttaa aacaggtgga tgatgaagaa ataagttgct 104280
ggtcagaact atcgtcgttg gtgagatcca gaaaggcggt gggatttcct ctattaaagg 104340
cggctaaacg tatttctcat ggatcgatgc tatattttga acagtttaaa aacagtaaag 104400
ttgtgaaatt aaccccgcaa gttaaatgtt taaatgatac tgttattttt caaactgtag 104460
ttattttata ttccatgtat aaacgtggca tatattctaa cgaattttgt tttgatctgg 104520
tttctattcc cagaacgaac attgtttttt ctgttaatca attaatgttt aacatttgta 104580
cagacatatt ggtagttcta tctatttgcg gcaaccggct ctatagaaca aatctaccac 104640
agtcgtgtta cttaaatttc atacacggcc atgagacaat agcccgtaga ggatatgaac 104700
actccaatta cttttttgag tggttgataa aaaatcacat atcgctattg accaagcaaa 104760
cgatggatat tctcaaggta aagaaaaagt atgctacagg agcaccagta aataggttgt 104820
tagaacctgg tacactggta tatgtgccca aagaagatta ttactttata ggcatatcac 104880
tcaccgatgt gtcaattagc gataatgtca gagtattatt ttccacagat ggaatagtgt 104940
tagaaataga agactttaat atcaagcatt tatttatggc aggtgagatg tttgttagaa 105000
gtcagtctag tactattata gtataaagta ataaaaaata gttaatgtga tgactagcgc 105060
caccaacgcc aacaacattt gataatttct acttactaga cgtaccgtaa aaatataaat 105120
tactataaca aataatagta tatcaataaa caacctaatt aatggtcgaa gtatagcagg 105180
acattgatgc tctagaccgt gtataacaaa atctacaaat ttttcatccg ctatattttg 105240
tttcactata tcgtctagac gatcagcgat aacttccatg ttaatctatt aaaatattat 105300
caatatattt tcagttttgc atatccgtgg tagcaataac catcggagaa gttctaaaga 105360
atgtgtccat gtaagtagtc caatggacgt tttccttatt ggctaaaata agtttgattt 105420
tatcattggt ggatgtgaac aacatacgct tggcatagta cataaacaac gctgccaata 105480
ttataacacc gataacaatc atataaaact gaactcctgt accagcaact tgtctaggtg 105540
ctatttgagt agtggcctta gtcgtcaatt gcatcaacgc tttaatggca caatttcctt 105600
tgctagatcc tgtattaata aattccaaat ttgttggaga tcctggggct ccgtaacatt 105660
catctatgat tacgttttgt atctttaatt tgttatcgac gaccgcgcta gaattacaag 105720
tctgtttcac ataattttca aaatctctaa caacagtgtt tacactcgtc tgaatgttta 105780
acgcagcagt aaacatagct ggtacgtatg ctttttgttc cggtgttaat ccactatatg 105840
tttctgtagc ggctgataac acagcatcca actgagcatc cgcgtccgca gagcacatat 105900
ttttaacagt gaggttacat ccatggtttt gtcggatata aaaatttccg atttctatat 105960
cacattttgt ttgagcacta gcgttcgctt cttgttctaa tttagacgag atacgttcgc 106020
tgagtgtatt caccgtcgtc tgtatgcttg ccgcggcacc catttaaata gctacaatta 106080
gtatccatat taccaagaga gataataaac tgatcaaatg caattttagg tcgaacgaat 106140
gtttaatatt atgttgaact acttttgctc tattgacggg aacagtagaa aatctatcac 106200
tattgcttaa tccacatgac aatcttaatg aggaagtttt atccatctgt aagttgttca 106260
cgctagtatt acatcgtaca atattgcaaa gtcctaaatt attataatta cgtgttagta 106320
agaaattaac attggcattc gaacactctg gatcccaaca ttctcgaggt tccgcatatt 106380
ttaatgactc ttctaactta tctctagtgg gataactaca tctcatatat ttctgtttaa 106440
agtccgcaga ctgttgtctt agaatataat cgatcatctc tttgctatct tctgtattgt 106500
gtgcgcgtaa atgatgcaaa aatgattcac atattggtac actagcatct ttactacata 106560
atgtttgcat cttattatac agatttatta acgattgttg accctctaca gttctatcac 106620
tcctattaaa ggctgaaccg atccactgat ggcatatgtt tctatcgaac gtatccccct 106680
gacaccagtc gaataaatca acatcgcatt ttccagtgtc gtgaacgtct ggccaacacg 106740
attctaatac tgcaccctct tcatacttat ctgatatctt tccatccttt ttccaataat 106800
gagtacgatt aaaagtgcga cagcaattgg gtgcggttcc attatacgat tgtcttaaca 106860
atgccgaaag accaccgggt cctgtgttga ctaatctaaa ctctggatat cttttcttta 106920
ctttatcctc tttatctttt gctagtggtc ctatatgcac atattctaaa agcttagcgg 106980
gagctatcac gtcatgcatt ttatccacgt ttaataacat ctcatcagtg ggtactcccg 107040
gaggcggatc ccgtttaggg agctcaacac ttactctgcc acccatattt atctcattga 107100
aagtattaat ctaaaaacgt cataaagatg ttgatcttaa aggattgaac tctatccgaa 107160
aacaacattc ctagaatgtt atcgtcatta tccattacga ttctagtttc aaaaacattg 107220
actctctttt tgaatcctcg tagtttgttg agagacgaga tagctatttt gaaagaaaac 107280
ttttgtagtt cttgagaaca ttcagtcata gaatattccc tggaaaacgc atcagtatta 107340
ccaggagtct tcataataat attgtcatct ttaaacataa tagccaaatg ctgatgctga 107400
ctaatacact tgataaagcc caacaaccat tctaaatgaa tgacggttct accacaacat 107460
ttttcttcat acttgtgaaa attaaacacg taagatttct tttgatctat acttagacaa 107520
atagtagtat ctgtcctaat aggcatcagt tccttgttac aatcgacact tactacatga 107580
taactagaaa gttttactag attattttcc agatcaggtt ctatatctat gatggcatca 107640
ttgtgaaaac tacacaaaca cgattttacc ttggacacag gaagattaaa cacaatgttt 107700
tcggctccgc ggtagaacac tgatgcactg agaggtataa tggcccaaat gtttacagat 107760
ccgcccaagg cggcaaaaat atacattaac tcatccgtcg agtctacatt tatagatact 107820
tcttcactga actctgaaaa atatgccaca atttggcgca gtttatcgat ttttatacgg 107880
atgctcattt taaatttttg taaattattt aaagttaaat ggctgcagaa cagcgtcgtt 107940
ctacaatttt tgacatagtt tcaaaatgta tagtgcaatc tgtattgaga gatatatcta 108000
ttaattctga atacatagag tccaaagcta aacaattgtg ctattgtccg gcatcgaaaa 108060
aggaatcagt gattaatggt atctacaatt gttgcgagtc aaatatagaa ataatggaca 108120
aagagcagct attaaaaata ttggacaatc ttcgatgtca ttcggctcat gtatgtaacg 108180
ccacagattt ctggagacta tataattcgt taaaacggtt tactcatact accgcattct 108240
ttaatacatg caagcccact attctagcca cgctaaacac tttgataacc ctgattttat 108300
ctaacaagtt attgtatgcg gcagaaatgg tagagtatct agagaaccaa ctagattcat 108360
caaataaatc aatgtctcaa gaactagcag aattattgga aatgaaatat gctctcatta 108420
atctggtaca atataggatt ttgccaatga tcatcggtga gcctattata gtagctggat 108480
tttctggtaa agaaccaatt tctgattatt ctgcagaagt ggaaaggcta atggaactac 108540
cagttaaaac tgatatagtg aataccacat atgacttctt agccagaaaa ggtattgata 108600
ctagcaacaa tatagcagaa tatatagccg gcttgaaaat agaagagatt gaaaaggtag 108660
aaaaatattt accagaagtt atatctacaa ttgccaatag taatataata aaaaataaaa 108720
aatctatctt tccggccaat atcaacgata aacagatcat ggaatgctct agaatgttag 108780
acacgagtga gaaatactct aaaggatata aaactgatgg agctgtgact agtccattga 108840
cgggaaataa tacaattaca acatttatac caatttctgc gtccgatatg caaaagttta 108900
ccattttaga atatctttac attatgagag tgatggcaaa caacgttaag aaaaagaacg 108960
agggaaaaaa caacggagga gtagttatgc atattaactc accctttaag gtaatcaatt 109020
tgccaaaatg ttaaattatt agaatcatat atctttatcc atctattatc atctgtgaag 109080
ttttggctca gaaaaatatc tttacctaat attcgtttag acgatgtatc aataccggca 109140
tttttataac attcagctac caatttaaaa caatacattc tattatgacc aaatccatac 109200
ggaataccca ataaagttaa tgacgtatca gctgctattt tcataacttg aatattctct 109260
aacatgtata cttttgcact aagatatcct tcaaaaaaac tatccaatgg gactattcga 109320
actccttttc tataggtaga ttcaactacc aagtgttccg ataataacgt accataataa 109380
atacccacat gtttttctat cgatggattc aaatgacttg ttaatgacat ggtataatta 109440
ataaaaataa tagaacctct aggcgcatat gtcttgataa aattaaccgg atccatatta 109500
gtttatatcc aatagaggtt gcactgttaa gttacgttga ggttctattt gtgtagataa 109560
ttttaaccta caacattcgt tctttacact gacgagtaca tcctttaatg atttttttcg 109620
tataatcaat ttatatcgtt cgtgagaaat atctttgccg cacgtagagc acacgagttg 109680
gaacaccatt tttaatctag taacctatat ttattatatt atcactttta gtattggtta 109740
ttaacagact atcgttaaag taaaatgaac tatttatact cagtagtaaa ttatccttat 109800
agatgccgaa cactatagtg ttacagtttt cgttagaata tccgaagatc gaagatatag 109860
attttataac atcgatatct gatttagtat tttcgaaatc tatataagat gataaagttt 109920
taacttctga aacatcaatc tccttgcgtg gttctaatgc atacattaat tggcagggta 109980
acggaacggt ctctaaataa gatgatttaa aattgttcca tttttcatct aatgcggaaa 110040
atataaattc ttgaacagtg cattgacaag gaggaacata cgagtagaca tcgtctaaat 110100
tagcgtaatc gtttataaac gtaacaatat tacgcacgtc tacggtagag ttagtctttc 110160
tataataatt tttaattgcc aaactagtga ctatattatc gattgtaaaa tcactaaaca 110220
attgtatttt gtttaagttg gtgggtgtta tagatgcccc atagagtcca ggaaagaaat 110280
cacatccatt cactaacgcc gttaattttg ctaaatagtt atcctctgcc gagggaacaa 110340
atttaaacag ttgagttaag ttttttatca tcttaggatg attatcagta gatgcaaata 110400
gcatagtatc ctgatcggta cttattatca acggccattc tcctgtggta gagaaatgtg 110460
ttttagcctc gagacacata acgaattctg cgtctcgttc atcacaataa aataatgtaa 110520
cattctcatt attgttgaaa tgtgttagta tctcattcaa tgatattttt atgttatcag 110580
aatcagataa atatatttga aatgtgagtt tatcgatttt taattgcatt tctgccttta 110640
tttcttcttc cataaatccg gtaacatttt gtatttcgga tgtacacttt tctaattcta 110700
gcatttttct gtccttggtt aatttagaat atttacgtct cttgtctcta acgtcttgtt 110760
taatttttat acttcctcta tcgataaaaa gggttacatg ccctcccttt tttacccatc 110820
cgtttacgta ttttatgaat accgtagtta actcttctaa gtttctgaca caattggcga 110880
cggctatata aatactcatt gtatccacaa atattccatt gtatactttg tataaattat 110940
catctaatat cgtcagtgat ttattttcca gcagtaacga ttttaagttt ttgataccca 111000
taaatgaaga acgtactgat tattttcggt aaaccgtatt gtagtatttg tgaaaatgtc 111060
agtgatgcag tagaagaatt aaaatccgag tatgatatac tccatgttga tatcttatca 111120
ttttttttaa aggatggtga ttcaagtatg ctgggtgacg taaagcgcgg aaccctaata 111180
ggaaactttg cagcgcatct atctaactac atcgtttcca ttttcaaata caatccacag 111240
acaaaacaaa tggcatttgt ggacattaat aaatccttgg atttcaccaa aaccgataaa 111300
tcgctggtga atttggaaat tctaaaatcc gaaatagaaa aagcaaatta tggagtttgg 111360
ccaccggtta ccgaataaaa tatcttcctg agtattttgt ttattgttcc gcttccttta 111420
tcaaataaat tttgaagtat ttgataattt attggatcga ctagttcatc tatacatttt 111480
ttggaaacga gaacatctac ctttttattt gttgttatat gcgacgttgt atacgccata 111540
aatgtaactt gagaaaatcc acaaagaatt aattttgaac cagttatggg gacgcgctca 111600
ttagaatatg tataaatctg aggagtgtat tccttgagga gatattctat caagtcatca 111660
tttgcttctc taatataaat attcctgtcc tttataacaa ctatgccttt tagattatct 111720
cctaatttat aacacggccg tatacacttg gaatacttgt acaatctaag atagactaaa 111780
tctctcagag ttgtatcatg gaattccata cttaattcct ccgataaatc tacatgttca 111840
aatactttag aaatatattt atggtatttc ctatctggaa agtaagttat caagtcatca 111900
taagatattt caaatcctct acataaggaa gacactatct ccaacgattc ctcatcttct 111960
gtaagtagga atttggacaa gttaaacaaa attagatctc taaatggcat ctttattata 112020
tcttatttta tttttgttat tcgtatgtat ttcttattat tttacatatt atccgaccaa 112080
taaacttcag gcagctgtaa tggaaacaga tagagaaaac gctattatta gacagcgaaa 112140
tgatgaaata ccgactagaa cattagatac agctatattt accgatgcat caaccgtcgc 112200
gagtgcgcaa atacacctat attataattc caatattggt aaaattataa tgtcacttaa 112260
tggtaaaaaa cacaccttta atttatacga tgataacgac atacgaacat tacttcctat 112320
tttactcctt agtaaatgat tgtcttaccg aataaagttc gtattttcat caacgatcgg 112380
atgaaaaagg atatctactt gggaatttct aatttcggat tcgagaatga tatagatgaa 112440
atcttgggaa ttgctcactt gttggaacat ctacttatat cctttgattc tactaatttt 112500
ttagcgaatg cttctacatc tagaagttat atgagttttt ggtgtaaatc cattaattca 112560
gcaacggaat cggacgcaat cagaacatta gtttcgtggt tcttttctaa cggaaaactc 112620
aaagataatt tttccctttc tagtatacga tttcacatta aagaattaga aaacgaatac 112680
tattttagaa atgaagtatt ccattgtatg gatatactaa cgtttcttag cggaggcgat 112740
ttatataacg gtgggagaat agacatgata gataatctta atatagttcg tgatatgctg 112800
gtaaatagaa tgcaaaggat atcgggatcg aatatcgtaa tttttgttaa gagattagga 112860
cctggaacat tggatttctt caaacagaca tttgggtctt taccagcatg tccggagatt 112920
attccttcgt ctattccagt aagtacaaac ggtaaaatag ttatgactcc gtctccattt 112980
tatacagtta tggtaaagat taatccaaca ttagataata ttttagggat tctgtatttg 113040
tacgaaactt accacttaat agactatgag actatcggca accagttata tttaacggta 113100
tcctttatcg atgaaactga atacgagagc tttcttcgtg gcgaggctat attacaaatt 113160
agtcaatgtc aacgtattaa tatgaattat agcgacgatt atatgatgaa catctattta 113220
aattttcctt ggctatcgca tgatttatat gattacatta cacgtattaa tgacgatagc 113280
aagtcgatac taatatcctt gacaaatgaa atatatgcat ctataattaa tagagatatc 113340
atagttattt acccaaactt tagtaaggcc atgtgtaaca ctagagatac ccaacaacat 113400
ccgatagtag ttcttgacgc aaccaatgat ggactgatta agaaacctta tagaagtata 113460
cccctaatga agcgtctaac atctaatgaa atatttatac gatacggaga cgcgtctctc 113520
atggacatga taactttatc attgtctaaa caagatatat cattaaaaag aaatgccgaa 113580
ggaatacgtg taaaacatag tttttcagct gatgatatac aggcaattat ggaatctgat 113640
tcgtttttaa agtatagtag atcaaaacca gctgcgatgt atcaatatat atttctatca 113700
ttttttgcta gtggtaattc catagatgac atattggcaa atagagattc taccttagaa 113760
ttttctaaaa gaactaaaag taaaattttg tttggtagga ataccagata cgacgtcact 113820
gcaaaatcta gttttgtatg tggtatagta cgaggtaaat cattggataa aacgtctctg 113880
gttgaaatga tgtgggatct caagaagaaa ggattaatat attctatgga atttaccaat 113940
ctattgagca agaatacctt ttatctgttc acatttacta tctacactga tgaagtatac 114000
gattatctaa acactaataa acttttttct gcaaaatgtt tagtcgtgtc tacaaaagga 114060
gatgtggaaa atttttcatc tctaaaaaaa gatgtggtca ttagagtttg atttttagtt 114120
attatctaca ggaacaaata tagtatctga aatcatattc atatatcccg ttagaggtct 114180
atgataatat atagtagcgt ttgttccgtt atagacaccg aataatattt tacaaaagtg 114240
tatatacgta tcatcatctt tatgtttaaa atttaaaatc ttaattcgta aatttagaga 114300
taaaatggct tcttgtacaa tactagttaa ttctcccgtc ctctcaaaat tatccaactc 114360
ctcagcgaga ataggactta gtacataaag tttagcatac tctatcatct tcatataata 114420
attggataat aatttattcc atgtttctgt actaatatcg aacgagtcta tatattcctt 114480
tgtacgccat agaatatcca aatttgtagg gattataaac aaatcttcgg ggagtgttag 114540
attaaactta ttagcgtaca atatataatt atgtagaaat tctgaatcta ttcgctgtat 114600
agacttcata taagacagat catagaaata tacgtatgtc ccaggattaa ctcttcctac 114660
tcttcctttt cgttgatctc tcatagattt agaaataaat tcttgcgatc ctccaaaagg 114720
agcggggaca aaaactctac ccatatcata aatgtgtgta acattgcgta tagtaacgct 114780
ggattccaaa taaggagtag aaataattat cgatacatta ggtgatgaat acactttttc 114840
taatatttcg tctatatcta agaccttacc atgaataata tacatatcat acggtaatct 114900
tttttctaaa tatgatttat attcgtgaca ctgtgcaacg gatgccacaa agactatacc 114960
ggatgatcca tcaggaggag tatacatctg tatagcagta actaaatttc tcttttcttc 115020
ttctatgtat gccattctgg aagatggatt tatcttatta tgaataaata cctcgctaat 115080
tttaaacagt gtatctccag gaatatgtat aaatgcggga ttaggtaaaa atacttttag 115140
ccgttccctg tcatcctcta acgtggcagt cattaaaaac atagaatcta ttttcgtatg 115200
atgctttctc gctactgcta taataatatc tcctatttga tcatgctcat gaacttcgtc 115260
tataataaga gtgccataac taaatagttt tgttagagat aacttatggg tagaaaatac 115320
aattccatat ttttttggtt gtttgtttat taattcttcc ggtatagatc cgtaccgtaa 115380
agaaatagga gatccatcta gtaccttaaa tcccaatgat tttaaaatgg tattgctatg 115440
caatctaact aaagctatcc taggaagaga tagaatgact ggtctttcgt gaaagtcagt 115500
gattttatct agagtagaga atccaccaaa taaataatta aaccaaagca ataacttggg 115560
tacctgtgac gtcttaccca ctccagttcc tccagttaaa actacaggtc tatgagaaat 115620
ccacgctgaa aatatctctc gttgtgcttt aggagttaat gaggccaatg gaattttgct 115680
aaatggatac ttattcccta gtacacctat agtatctgtt cgtttaccta cggaattgga 115740
aaatatctgt aaactattac cagcttctaa tagtcctctt aatatatact catcgaacga 115800
attgatttct gtatttgtta tacattttag aaaactataa cattcaaagc ttattgtgtg 115860
ttcactgtat atatattgct tcgcggttag gtttatgtgt gcgggttttc ctagtgatgg 115920
aggagatact ttttttccac gtgcttctgt actaactttg tatattcctt tatgttttac 115980
aacgtgtgcg ttatgccatc tatgttttat aactggaaac accgccaatg agaaactatc 116040
acgttccgtt ttactcatat tactgaattc atcttgtgaa tatttgtaag agaatacatt 116100
aacgcagttt ggaaaaaaga agatatctgg taaattcttt tccatgataa atggaaagat 116160
atacagattt agtaattagt aaaataccag aactaggatt taccaattta ttatgtcata 116220
tatattcact agctggatta tgtagcaata tagatgtatc taaattttta acaaattgta 116280
acggatatgt agtggagaaa tatgataaat ctacaaccgc cggcaaagtg tcttgtattc 116340
ctatcggtat gatgttggaa ctagtagagt cggggcacct gagcagaccc aatagtagcg 116400
acgaactcga tcaaaagaaa gagttaaccg acgagttaaa gacgcgttac cattctatat 116460
atgatgtctt tgagttacct actagtatac cgttagcgta tttctttaaa cctcgactac 116520
gggaaaaagt atctaaggcg atagacttct cacaaatgga tttgaaaatc gatgatttat 116580
cacgtaaagg aatacatact ggtgaaaatc caaaggtcgt caagatgaaa atagagcctg 116640
agagaggagc ctggatgagc aatcgaagta ttaagaactt agtctctcag tttgcttatg 116700
gatccgaagt ggattatata ggacaatttg acatgagatt cttaaactcc ttagcgattc 116760
atgaaaaatt tgacgcgttt atgaataaac atatcttatc gtatatactt aaagacaaaa 116820
ttaaaagttc tacctctaga tttgtaatgt ttggattttg ttatttgtct cattggaaat 116880
gtgtaattta tgataaaaaa caatgtttag tatcctttta tgactccgga ggcaatattc 116940
caactgaatt ccaccactat aataattttt atttttattc cttctccgat ggttttaaca 117000
cgaatcacaa acattctgta ttggataata caaattgtga catcgatgtt ttattcagat 117060
ttttcgaatg tacatttgga gcgaaaatag gctgtattaa tgtagaagtt aatcagctgt 117120
tggaatctga atgtggaatg tttattagtt tgtttatgat attgtgtact aggacaccac 117180
ctaagagttt caaatctctg aaaaaggttt atacattctt taaattttta gcggataaga 117240
aaatgacatt atttaagagc attctattta acttgcaaga tctatccctg gatataacgg 117300
aaacggataa cgcaggatta aaagaatata aacgtatgga aaaatggacc aaaaagtcaa 117360
ttaatgtgat atgtgataaa ttaactacaa aattaaatag aatagtaaac gacgatgaat 117420
aactttgtta aacaagtagc ttcaaagtct ctaaaaccta ccaaaaaatt gtctccgtca 117480
gatgaggtga tatctttaaa cgaatgcata atatccttta acttggataa cttttattat 117540
tgcaacgatg gactgtttac taagcccatt aatactccgg aggatgttct taaatcactc 117600
ttgatcatgg aatcattcgc ctacgagaag atgataatca aaggattgat aaaaatacta 117660
atatctagag catatattaa tgatatttat tttactccat tcggttggtt gacgggcgtc 117720
gacgatgatc ctgaaacaca cgtggtgata aaaataattt tcaattcatc actaatatct 117780
atcaagtctc aagttataga atatttaaaa ccatacaatg tcaataacct atcggtactt 117840
accacagaaa aagaattaag tattaatacg ttcaatgttc cagattctat acctatgtcg 117900
ataatttcgt ttttcccatt cgatacagat tttatactag ttattttgtt ttttggagta 117960
tataatgact cgtattgtgg aataagctat ataagtccga aagagagact accgtatatc 118020
atcgaaatat taaaaccgtt ggtgtcggaa attaacatgt tatcggatga aataggtaga 118080
acatcatcca ttagaatctt caattccact agcgtcaaaa aatttcctac taatacatta 118140
acatccattt gtgaaattgt ttattcgttt gatgaatcat cctttccgac gcctaagacg 118200
ttcactcctc taaacgcgag tccatacatt cctaaaaaga tagtttcact attggattta 118260
ccatctaatg tggaaataaa agcgatatct agaggcggtg tggatttcat cactcatatt 118320
aataataagc gtctaaacac aatcttggta atagcaaaag ataacttttt aaaaaattct 118380
acattttctg gaacttttat caaagagaat attatttgga agggtatcta tacttataga 118440
ataatcaagt ctagttttcc agttcctact attaagtcgg ttactaataa aaaaaaaata 118500
tgtaagaaac attgtttcgt caattctcag tatacaacta ggactttgtc acatattctt 118560
tgatctaatt tttagatata aatggtggat gctataaccg ttctaactgc gatcggcata 118620
actgtattaa tgcttttgat ggtaatttct ggcgccgcca tgatagtcaa ggagttaaat 118680
cctaatgata tattcactat gcaatcatta aagtttaatc gagccgtaac gattttcaaa 118740
tatataggac tctttatcta tataccagga acgataattt tgtacgctac atacgtcaaa 118800
tccctattaa tgaaaagtta aataattttt tttattacac caacaaaaat gtttgtcatt 118860
aaacgaaatg gatacaagga aaatgtcatg tttgataaaa tcacgtctcg tattagaaaa 118920
ttatgttatg gcttaaacac ggatcatata gatcctatta aaatagctat gaaggttatt 118980
caaggaatat ataatggagt aacaacggta gaattggaca ctctggcagc cgaaatagca 119040
gccacttgta ctacacaaca tccggattat gccattctag ccgccagaat agccgtatca 119100
aatctacaca aggaaacaaa aaaactattt agtgaagtga tggaggattt attcaactat 119160
gttaatccta aaaatgggaa acattctccg attatttcaa gtatcaccat ggatatagtt 119220
aacaaatata aggataaact caactcggtt attatttacg aacgagactt ttcatacaac 119280
tattttggtt ttaaaacttt ggaaaaatcc tacttgttga aaataaacaa caagatcgtt 119340
gaaagacctc agcacatgtt aatgcgtgtc gcagtaggaa ttcatcaatg ggatatagac 119400
tcagctattg agacgtacaa tctactttct gaaaaatggt ttacgcacgc ttctcctacc 119460
ttatttaatg cgggaactag tcgtcaccaa atgtctagct gttttctact taacatgatc 119520
gatgatagca tagagggtat ctatgacacg ttaaaacgat gcgcattaat ctctaaaatg 119580
gcagggggaa taggtctatc aattagtaat attcgtgcca gtggaagcta tatctccggt 119640
accaatggta tatcaaacgg tattattcca atgttgagag tttataataa caccgctaga 119700
tacatagatc agggaggaaa caaacggcct ggagttatgg ccatatactt ggaaccgtgg 119760
cattctgata ttatggcgtt cctcgatctt aaaaagaata caggaaacga ggaacataga 119820
accagagatc tatttatagc tctttggatt cctgatctct ttatgaaacg agtgaaggat 119880
gacggagagt ggtcgttgat gtgtccggat gaatgtcctg gattggacaa tgtttgggga 119940
gacgagttcg aacgattgta tacactatac gaaagagaaa ggagatacaa atctataata 120000
aaggctcgag tcgtctggaa agcgattata gaatctcaga ttgaaactgg tactccattc 120060
attctttata aggatgcgtg taacaaaaag agtaatcaac aaaatttagg aactatcaag 120120
tgtagtaatc tttgcactga gataatacaa tatgcggatg ctaatgaggt agccgtttgt 120180
aatctggcat ctgttgcctt gaacatgttt gtaatagatg ggcgatttga ttttctcaaa 120240
ctgaaggatg tggtcaaagt aattgtcaga aatctcaata aaattataga tattaattat 120300
tatcctattc cagaagctga aatctctaat aagagacata gacctatcgg tattggtgtt 120360
caaggattag cggacgcgtt tattctctta aattatccat ttgatagcct ggaagcacaa 120420
gatctaaata agaagatctt cgaaaccatt tattacggtg cattagaggc gagttgtgaa 120480
ctagctgaga aggaaggacc atacgataca tatgtaggat cgtacgccag taacggtatt 120540
ctacaatatg atctttggaa tgttgtaccg tcggatcttt ggaattggga acctctaaaa 120600
gataaaatca gaacatacgg tcttagaaat agtttattgg tggcacctat gccgactgca 120660
tcaactgctc aaattttggg aaataatgag tcggtggaac cgtataccag taatatttac 120720
actcggagag tattgtctgg agaatttcaa gtagttaatc cgcatctcct tagagtttta 120780
accgagagaa aattatggaa tgatgagatc aagaatagga ttatggcaga tggtggatcc 120840
attcagaata caaaccttcc agaagatatt aagcgagttt ataaaactat ttgggaaatt 120900
ccacaaaaga cgatcataaa aatggctgca gacaggggag ccttcatcga tcaaagtcaa 120960
tctatgaata tccatatagc agatccgagt tattccaaac taacgagtat gcatttttac 121020
ggatggagtc tcggtctaaa aacgggaatg tactatctac gtacgaaacc cgcatccgct 121080
cccattcaat tcacattgga caaggataaa ataaaaccac tggtggtttg cgattccgaa 121140
atctgtacat catgcagtgg ttaaacaaaa acatttttat tctcaaatga gataaagtga 121200
aaatatatat cattatatta caaagtacaa ttatttaggt ttaatcatga gtaaggtaat 121260
caagaagaga gttgaaactt caccaagacc tactgcatct agcgattctc tacagacttg 121320
tgcgggtgtt atagagtatg caaaatcgat tagtaaatct aatgcaaaat gtatcgaata 121380
cgttacacta aatgcttctc aatacgctaa ttgttcgtct atctctataa aacttactga 121440
tagtttatct agtcaaatga cttccacttt tattatgttg gaaggagaga ctaaacttta 121500
taaaaataaa tctaaacaag atagaagcga tggatacttt ctaaaaataa aagttaccgc 121560
ggctagtcct atgttgtatc aacttctaga agccgtctat ggaaacatta agcacaagga 121620
acgcattcca aattctttgc atagtctttc ggtggaaact attacagaga aaacatttaa 121680
ggatgaatcc atcttcatca acaaattaaa cggatccatg gtagaatatg tttcgactgg 121740
agaatcatcc attctcagat ctatagaagg tgaactagaa tcactcagta aaagagaaag 121800
acaattggcc aaggcaatta tcacacctat cgtcttctat agatccggaa cggaaacaaa 121860
aattacattc gcactcaaga aactaatcat tgatagagaa gtggtggcta acgttatcgg 121920
actctctgga gatagtgaac gtgtatcaat gactgaaaat gtagaagaag atctggctcg 121980
taatctggga cttgttgata ttgatgatga atatgatgaa gatagcgata aagaaaagcc 122040
aatattcaat gtataaatgg ataagttgta cgccgctata tttggtgtat ttatggggtc 122100
tccggaagat gatttgacag actttataga aattgttaaa tctgttctaa gtgatgagaa 122160
aacagtcaca tcaactaata ataccggttg ttggggttgg tattggttaa ttattatttt 122220
ttttatagtt cttattctac tactattgat atatttgtat ttaaaagttg tttggtgaac 122280
ttaaatggcg gaatttgaag atcaactcgt tttcaatagt atcagtgccc gtgcattgaa 122340
agcttatttc actgctaaaa tcaatgaaat ggtagatgag ttggtcacaa gaaaatgtcc 122400
acaaaagaaa aaatcacaag ctaagaaacc tgaattacgc attcctgtag atcttgtaaa 122460
gtctagtttt gttaaaaagt ttggattgtg caattatgga ggaatcctta tcagtcttat 122520
taatagtcta gtagaaaata atttctttac aaaggatgga aaactggatg atacaggcaa 122580
aaaggaattg gttttgacag atgtcgaaaa acgaattctt aataccatag ataaatcatc 122640
tcctttgtat atcgatatta gtgatgttaa agtattggct gctagactaa aaagaagcgc 122700
tacacaattt aactttaatg gacatacata tcatctggaa aatgataaaa tagaagatct 122760
cattaatcag ttggttaagg acgaatccat tcaactggat gaaaagagtt ctattaaaga 122820
tagtatgtat gtcattcccg atgaacttat cgatgttctc aaaactagat tgtttagatc 122880
tcctcaagtc aaggataata ttatttcgcg tactagattg tatgattatt ttactagagt 122940
tactaagaga gacgaatcgt caatctatgt gattctaaag gatcctagga tcgctagcat 123000
tttgtcacta gaaactgtta aaatgggcgc ctttatgtat acaaaacata gtatgttgac 123060
gaacgctatt tcatctagag tcgatagata ttctaaaaag tttcaagaat ctttttacga 123120
agatattgta gaatttgtta aagaaaatga gagagttaat gtatcgagag tggttgaatg 123180
tttgactgtg cctaatatta ctatatcaag taatgctgaa taaaaatatt tataaatatg 123240
ctcgtcgtaa ttatgttttt tatagcgttt gccttctgta gttggctatc atatagctat 123300
ctgcgtccat atatctcgac taaagagtta aataagtcga gatagtttta tatcacttaa 123360
atattaaaat ggccgaggaa tttgtacaac aaaggttggc caataacaaa gtgacaattt 123420
ttgtcaagta tacatgtcct ttttgtagaa atgcactgga tattctaaat aagtttagtt 123480
tcaaaagagg agcgtatgaa attgtcgata ttaaagaatt taaacccgaa aatgaattgc 123540
gtgactattt tgaacaaatt actggtggta gaactgttcc tagaatcttt tttggaaaaa 123600
cttctattgg tggatatagc gacctgttgg aaatagacaa catggacgca ttgggtgata 123660
ttctatcatc tattggggta ttgagaactt gttgagaaaa taatgaaaaa acaatactta 123720
atcatgtcgc cgacatgttc atgtatccgg aatttgcgag gaaggcttta tcaaagctta 123780
tttcaaaaaa attaaacatt gaaaaggtgt ctagcaagca ccagctcgtg ttactggatt 123840
atggattaca cggactattg ccaaaatcac tgtatctgga agctattaat tccgatattc 123900
tcaatgttag attctttcct cctgaaataa taaacgtcac tgatatcgtt aaggctctcc 123960
aaaattcttg tagagtagat gagtacctaa aatctgtttc cttatatcat aagaattctt 124020
taatggtatc gggaccaaat gtagtcaagc ttatgataga atataatctt cttacacaca 124080
gtgacttgga atggttaatt aatgagaatg tagtcaaggc tacatacctt ttaaaaatca 124140
atgcctatat gattaacttt aaaatagatc taacggttga tgaaatcatt gacttagtta 124200
aagatattcc tgtaggagct acgctacatc tatataatat attaaacaat atagatttgg 124260
acattgttct tcgtatatct gatgaatata atataccacc tgttcacgat attctgtcta 124320
aacttaccga tgaagagatg tgtataaaac tagttacaaa gtatcctatg gacaatgtta 124380
taaattttat taatcaagat gttagatata gtcccacctt catcaagaca attaaagatt 124440
ttgtcaacaa gcatcttcca accatgtacg atggattaaa tgattatcta cattctgtta 124500
ttatcgacga ggacttaata gaggaatata aaattaaatc cgttgccatg tttaatttgg 124560
aatacaaaac tgatgtagat actctaacat tggacgaaca gatatttgta gaggtaaaca 124620
tctcatatta tgattttaga tatagacaat ttgccgatga atttagagat tacattatga 124680
taaaagaaag aagacaaatc accatgcaat ctggtgatag aataagaagg tttagacgtc 124740
ccatgtcatt gagatccact atcatcaaaa aggatactga ttctctagag gatattctcg 124800
cacatataga taatgccaga aaaaatagca aggtatccat tgaagatgtt gagagaatca 124860
tttcatcttt ccgtcttaat ccttgtgttg tcagacgcac catgctgtct gatatagata 124920
tcaaaacaaa gattggaaat cttgtgctct gacactatca gccatcaaag gaattatggt 124980
aacagatacc atcaataccg tgttatccaa aattctgcat catcatagga atgtcttcaa 125040
gtatcttaca tctgtagaga ataaagaaat tgctgtctgt cgctcttcta tagagaatta 125100
aaaagtgtac gatgtgatct acgcacagat gatggattat tggataggct atacgatctg 125160
actagatacg ccttacacgg aaaaatcaat caaaacttaa tcggtcaacg atgttggggt 125220
ccgttgacag aaatgctgtt taacgagaat aaaaagaaaa aactaaataa tttaatggaa 125280
tacatcaaaa tatcagacat gttggtatac ggacactcta tcgagaagac gcttattcca 125340
attactgatt ctctttcatt caagctatct gttgatacca tgtctgtgtt aaatgatcaa 125400
tatgccaaga ttgtcatctt cttcaatacc atcatagaat atattatagc tactatctat 125460
tatagattga cagtcttgga caattatact aatgtcaaac attttgtatc caaagtgtta 125520
cacactgtca tggaagcatg tggcgtactg ttttcataca ttaaagttaa tgacaaaata 125580
gagcatgaat tggaggagat ggtggacaaa ggtaccgtac cttcttattt gtatcatctg 125640
tccatcaacg tcatttcaat aatattggat gatataaatg gaactcgtta atattttttt 125700
agaaacggat gctggaagag taaagtttgc cataaaaaat accgacgatg tatgtgcctc 125760
ggagttaata aataaatttg tggaactgtt aagtgaatac attcacattg accaatcaga 125820
attttatttg gtggtaaagg ataaggatat tttttatttt aagtgtgata gggggtctat 125880
ttcgattgta aacaatgagt tttatgtctt tgacgaaccc ttgctgtttg ttaaagattt 125940
cactaatgta acgggggttg aattcatagt tacagaaacc atgccgtgta gaattatacc 126000
aaaaaataat cacgcggtta tttcagtcgt gactaatcat aagttttata atgggttaag 126060
tttataaagg gttaaccttt gtcacatcga tcgcgtattt gggatcagat gccaaattgt 126120
taaataatct gatgaaaaaa taataaatat aattcagatc atcgctagac atgacattat 126180
tgtcctctat agcgatagtc gcgtgccgtc tacatgcagg acatggaaga gtgctgacta 126240
tagtatatag ttttcgttta cacgcttcta tgttgccgtc taaacccgct tgcgaaagta 126300
ctataaaaat aatggtccat acggctcttc cccaatgttt gggattcatt taaatgaaaa 126360
tatatttcta aattctataa atggatgttc ggtgcattaa ttggtttgaa agtcacggtg 126420
aaaacagatt tttatatctg aaatccagat gtcgaaatgg cgagaccgta tttatacgat 126480
ttcctcatta cttttattac gtagttacgg acgaaatata tcagtcattg tctcctcctc 126540
catttaatgc gaggccgttg ggaaagatga gaactataga cattgacgag acaataagtt 126600
ataatctaga tattaaagat agaaaatgct ccgtcgcaga tatgtggttg atagaagagc 126660
caaagaaacg cagcatacaa aatgccacca tggatgaatt tctcaatatt agttggtttt 126720
atatttctaa cgggatatct ccagacggat gttactcgtt ggacgagcaa tatttgacaa 126780
agattaacaa tggatgttat cattgtgacg atccacgtaa ctgtttcgct aaaaaaatac 126840
ctagattcga tatcccaaga tcgtacttat ttctagatat agagtgtcac ttcgataaga 126900
agtttccttc tgtatttatt aacccaatct cgcatacaag ttactgttat atcgatttaa 126960
gtggtaaacg attattgttt acgctcatta atgaagagat gttaacggaa caggaaatac 127020
aagaagccgt cgatagagga tgtttgagga tacagtcact aatggaaatg gattacgaac 127080
gagaactagt tctatgttct gaaatagttt tgttacgaat agctaaacaa ttgttggaac 127140
taacgttcga ctacgtcgtt acctttaacg gacataactt tgatctgaga tatattacta 127200
atcgtctaga gttattaaca ggagagaaga ttatctttag atctccggac aaaaaggaag 127260
ctgtacatct ctgtatttat gagagaaatc agtctagtca taagggagta ggcggcatgg 127320
ccaatactac gtttcacgtt aataacaata atggaactat atttttcgat ctatattcat 127380
tcattcaaaa atctgaaaaa ttggattcgt acaaattgga ttctatatcc aagaacgcgt 127440
tcagttgcat gggtaaagta ttaaatagag gagttagaga aatgacgttc atcggtgacg 127500
atactacgga cgcgaaaggc aaagccgctg catttgcaaa ggttttaacc acaggtaatt 127560
atgtgactgt tgatgaggat attatatgta aagtaattcg taaagatatt tgggaaaatg 127620
gatttaaagt cgtactatca tgtcctactt tacctaatga tacatataaa ttatctttcg 127680
gaaaggatga cgttgattta gctcagatgt ataaggatta taatctaaac atagctttag 127740
atatggctag atactgtatt catgatgctt gtttgtgtca gtatttgtgg gagtattatg 127800
gagtagaaac aaaaacagac gcgggtgcgt caacatatgt gcttcctcaa tccatggtat 127860
tcgaatatag agcgagtaca gtcatcaagg gtccactgtt aaagctattg ttggaaacta 127920
aaactatctt agttagatca gaaacaaaac aaaagtttcc ttatgaaggc ggtaaggtat 127980
ttgctccaaa acaaaaaatg tttagtaata atgtattaat ctttgattat aacagtctgt 128040
atcctaatgt gtgtatcttt ggaaatctat ctccggaaac attagtcggt gtcgttgtta 128100
gtaccaatag attggaagaa gaaataaata atcagctctt gcttcagaaa tatccacctc 128160
ctagatatat tacggttcat tgtgaaccta gactaccgaa cctcatctct gaaatagcaa 128220
ttttcgatag atcgatagaa ggaaccattc ctagactatt aagaacattt ttggcagaga 128280
gagccagata taaaaagatg ctaaaacagg ctaccagttc aactgaaaag gccatctatg 128340
attccatgca atatacgtac aagatagtag ccaactcagt atatggtctg atgggattta 128400
gaaatagtgc tctatactca tacgcttcgg ctaagagttg cacatccata ggacgtagaa 128460
tgatcttgta tctagaatcg gtactaaatg gagcagagtt atctaacggt atgttacggt 128520
ttgccaatcc attaagtaat ccattttata tggacgatag agatattaat ccgattgtga 128580
aaacatcgtt gcctatagat tacagatttc gttttcgtag cgtgtatgga gataccgact 128640
ccgtgtttac agagatagac agtcaagatg tcgataagtc tatagaaata gcgaaagagt 128700
tagaacgact gattaataat agagtattgt ttaataattt taaaatagag tttgaggcgg 128760
tatataagaa tctgattatg caatcgaaga agaaatatac aacgatgaaa tactcagcat 128820
cgtcgaattc aaaatctgta cctgagagaa ttaataaagg tactagtgaa actagaagag 128880
atgtttccaa gtttcataag aatatgatta agacatacaa gaccagactg tctgagatgt 128940
tgtctgaagg acggatgaat tctaatcagg tatgtataga tattctccgt tctttagaaa 129000
cagatttacg atccgaattt gatagtagat cgtctcctct agaattattt atgttgagtc 129060
gaatgcatca ctcaaattat aaatccgcag ataaccctaa tatgtatttg gttactgaat 129120
ataataaaaa taatccagaa actatagaac ttggagaacg atattatttt gcatatattt 129180
gtccggctaa tgtaccatgg accaaaaaac ttgtaaatat taaaacatat gaaacaatta 129240
tcgatagaag ttttaaactc ggcagtgatc aaagaatatt ttacgaagtt tactttaaac 129300
gattgacgtc cgaaatagtc aatctattgg ataataaagt tttatgcatc tcattctttg 129360
aaagaatgtt tggttcaaaa cctacatttt acgaagcata aaataattta caacagttgt 129420
acgtcgctct ttgttagatt cagtttatcc attagatatt ctacggctgg agtaattttt 129480
gtagtaattg aatacccagg tacgtaaaac aataaaaaca ttaataccgg atcatacttg 129540
gtaaagtaca atatagtact aatattagct aatgtctcat aatcggatgg acttcttgct 129600
ctactaagat tagggtgatc tatatctttg atacttattg gaacgggacc ggtaaatata 129660
taattctctt tgagacgact aaacaaaggc ttatagtctc ctgtttgatg catataccta 129720
gccaacgcca ggtgtttggc attatctgta tagacgtatc tgaataatgt gtgtttagta 129780
atatcaaact gagtattttt aaaccagaac aagatattat agattggaat ggaagtcagt 129840
aacaaaaaac taataaactc gtttgttgga agggcagaaa actttcgaag atatgatacg 129900
tatctaccag agaacgatcc attagcaatg aattcagtat ttactttttc ttcatatacg 129960
ggaataggac ctttatatag agacatgaga aactcgaact ttttaagtaa tcccaacgaa 130020
atgggataat agtcacttgc atctaagttt gcgtatctac tactccaaat tgcagtttct 130080
ggtatccatc catacgcgta attatcaaat agatatgtgt ttttcattaa cggtctactt 130140
agacctctac taaaaaatgt ttcttgatct agaattcttc tttgtgcatt tttgtacacg 130200
tcgtcaaaac gcggaacggt ggcagccata atatttatat cctaccgctt tttatcaact 130260
atgttatgtc ttttagttag gttaatattc taataagatg caggtaatac atcaggttaa 130320
agtattagaa tgggattata cttatatatt tagtttatct ttcatcagat aactaaaaaa 130380
tgtataaaac agacgcgtta cattgcctat gctacatgag ttccatgtgc ggagattgtt 130440
aaagtttagg gtagagagtt gttctagcat ccattcattg tctttacatc cgttatatgt 130500
aattatgtca cccatggaca ttataggttt taatagtact tcagatactg gaataagata 130560
ttgttcgtat atgtgtttga taatcatttg ttgcgtttct ggagaacttt tctctgcatg 130620
atttttacat agctttacaa actcgtgatc actttttata atgagagatc tatagtcttc 130680
gtatctgtta cgaaaatcaa tatattcagg attattttct gaatcacttg attcgtcact 130740
aatatacata actatatgat caacaatttt ttccaaaatc atttctgcat tttctttatt 130800
cattaatttg gtgggagtct gaattgttgc agcagttccc atttatcaaa aatgtgtgat 130860
aattatatct atgtatatac aaattattaa taatatattt ggtctgttct atgatctacc 130920
gtgtcttatc aattgaagta tatatttctt atccgtcttg gttagatgga tgcttttatc 130980
caagaattct tctacatgat atagattatc tctcaaaaac ctttgaaata agacgatgat 131040
ggaaatatta taactagcta aaatagtttt gattagcgat ggaagaatac tagggttttt 131100
atctgtagaa aatcgaacga atgcggatac atcctccaca gacttaagat ggtttatcat 131160
tttaactaac aagtctcctt tataatcatc taacacaaga cctagacgag ccaattgtcc 131220
gtctaaataa tattttaaaa tgtttttgct aaacaagttc agccgtctac tcttggattt 131280
tatagatgac agtttcttta gtattccgtt ctctatgacc atgggatccg actcgtgcaa 131340
aaacaggtat gtattgtatt ccttaacata atacgacgta tcctcgttaa agatttgaca 131400
cttcttcata tcatccattg tatctaatgt gatatcgttc acctcataaa ttatattaga 131460
tatataactg ttattaatat ctaactcatc aaatcgttta atgtccgcag cactcgtaga 131520
gaataaagac ataattcgtt cattatcgaa atattcggga tctatttgaa caacaactct 131580
gaataagtcc tcgttaaacg ttctatcact aataaatgta ttaccgttaa taaaattagc 131640
tttataaaaa ctaataaaat attttttcca atcgtcgttt tttgcatcaa taatagatag 131700
gatatctcca ggaatatcat atatgatttc tagcagaagt tctctgatat ctttttcagg 131760
ctcaaagata taaactaact taagcatgtt tttattaacg cgtccttgtt ttatctcaga 131820
gtatatataa ctaagaattt tatcatacat ggtaatggcg tgtctattaa gaacaatatc 131880
cgcatcatta gctatacttt cgtttaacca cacccaatag ctagtcttat ctccgggaaa 131940
gatataatct ctaaacgttt ctgttagatt gctatggact attctaccag tttgttgaaa 132000
tagagtgtat atatcttgtc tatcattatc cgcaaactct atgcctgtat ctttcttgct 132060
taggtaagcc caaataagta atcttataac acttaatttt ttaacaactt tgaagtctgt 132120
tttcttaaac agttttaaaa aaatgatttc catatctttg atgggttccc ctccgtgaaa 132180
taatggatta agaactaagt cataatttac gcctataagt tctcctaatt cgtccatgta 132240
ttcggcaacc tctaaaaatt tttcatcttg tgttagttta tgcacggttt ccttgtacga 132300
ttgtattgcg ttactaaacg ataaataatt cttagtcaat gctttaacgt acaacgtggg 132360
tgcatcaaat gcgtgtcgta aaaccgcttc gtacacttta caacatcgta ccatatgtat 132420
gaaaacaaaa aaattggtat tagctaatac atccttagta ataacgtgtt gaggaaaatt 132480
gctaactaga tacttgagag ctagtacatg attgagggta aaattgttta ctttacttaa 132540
tgtatcatcc tttaaaaaat ctatattatt ttctactgtg tatataagat actttctacg 132600
aataaaatcc atttttatag aagagagtta tctatgatac tacactctta ttctaataat 132660
tatttttact attttatcac taataactat ttatactatc cattattaat ttaacaattt 132720
gaattagtat tggagaaaga tgaaatgcct attcactaac aaagtcaatg cactcttgaa 132780
gatttttacc aataatatgt agcggattct caaagtattt atttaccatt tctttacaca 132840
gtatcttagc cactggccta cgaacaggca tacatggttc aactatatca tgaacaatgt 132900
ttaattttag tgtatccaat ccaataaatc cattttctac atcatcatcg tatctgtacg 132960
ctttgtattc tccactcttg aacaaatctc taaacagtat cttcaacact ttttccacca 133020
attcctggga tgtcacacat tgttctacca tctgtttgat aaagagggta tcttcttcag 133080
attttccctt gattctatac atgggtttta ggttatacct tgcatctatt gtatatttat 133140
cagattcagc tatcaacatt ttatcaactt gtgttccaat tattctccat ttattagatc 133200
tgcattcatt attatagaat ctttcttcca ctaatattct aataagatta agtttgaaag 133260
gagagaagat cttgtgtttg attttactca ttggatgcat tctataacga atgaatctat 133320
gatcatctcc aaacaaattg ggaaacatgt aaagcagcag agcatatacc atcttactcg 133380
gattctgtga tgctttgccg atggctgaaa tgtccgagaa tagtttataa gttttctgat 133440
tatttggaat agatggtgct atatcttcta gagtagtagt cctaatcatt ctcttaaatt 133500
ttatgtatcc tagtttcaat gtctcgtaat gagtttgtgc tgctcttatt gtctgattta 133560
tttcttttac cattttggct ctattctgaa actttatcct cttcttatcc atttttattg 133620
ttgactccgc actatcgatt tgatactttc ttttcagagt aaagctatcg tcattgatca 133680
tcggacactg acttccactc attatgaaat tgtagccgta taaccacaat acaattatta 133740
acatatatat tcacttttgt taatatcaac ataataatga aaaaatataa atgaacaaag 133800
ttaatacata agtgttataa atggaaaatg tatacattag tagttactca tccaatgaac 133860
aaacatcaat ggcggtagcc gctactgata tccgagaatt actatcacaa tatgtggatg 133920
atgccaactt ggaagactta atagaatggg ccatggaaaa atcatcaaag tactacatca 133980
agaatatagg taatacaaaa tctaatatcg aagaaactaa attcgaatca aagaataata 134040
ttggtataga atactcaaag gattccagaa acaaactatc gtatagaaat aaaccgtcta 134100
ttgccacaaa tttggaatat aaaacactat gtgatatgat taagggtact agcggcaccg 134160
aaaaagaatt ccttcgctat ctcttattcg gtataaaatg cattaagaaa ggagtagaat 134220
acaatataga taaaataaag gatgtgagtt acaacgatta ttttaacgtt ctcgacgaga 134280
aatacaatac accgtgtcct aactgtaaaa gtaggaatac tacgccgatg atgattcaaa 134340
ctagagccgc tgacgaacct ccactagtta gacatgcgtg tagagattgc aaacaacact 134400
ttaagcctcc caaatttaga gcatttcgca atcttaatgt tacaacgcaa tcgatacatg 134460
aaaacaagga aataacagag attcttccag ataataatcc atctcctcca gaatctccag 134520
aaccagcatc acctatagat gacgggttaa tcagatccac attcgataga aacgacgaac 134580
caccagagga tgatgaataa aaaaatgata aaataaatta gttttattgc tggttgtgtt 134640
agttctctct aaaaatgtct aagatctata ttgacgagcg ttctgacgca gagattgtgt 134700
gtgcggctat taaaaacatt ggaatcgaag gagctactgc tgcacaacta actagacaac 134760
ttaatatgga gaagcgagaa gttaataaag ctctgtacga tcttcaacgt agtgctatgg 134820
tgtacagctc cgacgatatt cctcctcgtt ggtttatgac aacggaggcg gataagccgg 134880
atgctgatgt tatggctgac gccataatag atgatgtatc ccgcgaaaaa tcaatgagag 134940
aggatcataa gtcttttgat gatgttattc cggctaaaaa aattattgat tggaaagatg 135000
ctaaccctgt caccattatt aatgagtact gccaaataac taagagagat tggtcttttc 135060
gtattgaatc agtggggcct agtaactctc ctacatttta tgcctgtgta gatatcgacg 135120
gaagagtatt cgataaggca gatggaaaat ctaaacgaga tgctaaaaat aatgcagcta 135180
aattggcagt agataaactt cttggttacg tcatcattag attctgattc tagttatcaa 135240
taacagttag tagtttagtt atacattgaa tcatacatat taattttttt attgagatag 135300
attaaaaaat acaaattgta gtactattaa cgcgactagt atattctcta aagatgatat 135360
ctgtcacaga tattcgtaga gcgtttctag acaatgaatg ccatactatc acaaaagcgt 135420
ttggatatct gcacgaggac aaggctatcg cattaattaa aataggattt catcccactt 135480
atctacccaa agtcctttat aataatgttg tagaattcgt tccagaaaaa ctatatctgt 135540
ttaagccaag aactgtagct ccattggatt tgatatctac tataacaaaa ttaaagaacg 135600
tggacaaatt tgcctcacac ataaattatc acaagaatag tatattgata acaggagaca 135660
agtctctaat tgttaaatgt atgccttaca tgattatttc agatgatgat atacgattca 135720
taagagaaca gtttgttggt acaaattcta ttgagtatat tctatccttc atcaacaagg 135780
aaagcatata tagaatgagt taccaatttt cagagaatga aatagtcact atcatcaata 135840
gagatcattt catgtatgaa ccaatatatg aacatcaggt cttagattct gactttctta 135900
aaactatgtt agatagatac ggaatagttc ccattaattc tggtataata gatgaattat 135960
gtccagaagc tataatagag atattaatgg cagtagttcg tcctagggac gctatccgtt 136020
tcttagatat agtgaataag aatcaattga cagaagatag tgtcaaaaac tatatcatta 136080
atgatatcag aagaggtaaa atagattatt atattccata cgttgaagat tttttagaag 136140
atagaactga agacttggga atatatgcga atatattttt tgaggatgct atagatatta 136200
cgaaactaga catcacaaag acagagttgg aacatatatc gaaatacatg aattattaca 136260
ctacttatat agatcacata gttaacatca tcttacaaaa taattatata gatatcttgg 136320
catctataat agattacgtg caagacgtat taacagaaga attatgtatt agaatagttt 136380
gcgaatcaac aaaccctgtt cccgttacat ctcttcctat acattctacg ttagtaatgg 136440
ttatgtgtat acaaatgaaa tatgtcgata tagttgaatt cttagacgag atcgacatcg 136500
atactttaat agaaaaagga gcagatccga taaccgaata cacatttaca acaagatggt 136560
acaataaaca caatgatttg atcactcttt acattaagaa atatggattc tgtccaatga 136620
tgatgaaacg gttaatgttc gaatatccat tgactaaaga agccagtgat catttactta 136680
aaaccatgga tgaaaacagg ggagctatta tgttttttcc gcgtactatt tgcacacttc 136740
cttatctatt atgttgtaat tataaactaa ttcaaaaacc tattccattc aaagaagaaa 136800
atcgtaacat cgtatataag aaaaccaaca gagtattatg ctttgactcg ttggagaact 136860
ctgcgtttaa aagcctcatt aaaatagatt ctattccagg attaaaaact tataacatga 136920
aagacattac atacgaaaag tctaataata taatttgtgt taggtttata cctcaagaat 136980
caattcataa tgaagagcga agaataaaat tacagttatt cgacattgct agattggcat 137040
cctatggact atattatatt ccctctagat atttatcatc gtggacacca gtagtgaaca 137100
tgatagaggg aagagagtac actaatccac aaaaaataga atgtctagtt attttggatt 137160
tattttcaga ggaattcata gaatatcaaa atctgggcaa tgcggtatct aataaatatg 137220
aactggaata tactatatct aattatcaag ctgccataaa ctgcctaatg agcacgttat 137280
taatatatct agttctagga tcaatcagat cgatatcaaa aactgaagat tttgtattat 137340
ctatattaaa tatcttctat aaaggactga aaattaatga attactttct gaaccagtat 137400
caggagtttg tatcgaatta aataaaataa aagatagagc gagctctgga gacagtagtt 137460
ttatatttct taagaaaaac gagttatcaa aaactctatc gctctgtgaa aaagtttgtg 137520
ttgagaccat attagacaat aatcagagtt ttaaatcctc aaaatgaata ggaatcctga 137580
tcagaatact tttcctaata ttacattaaa gattatagaa acctatttag gcagagtacc 137640
tagtgtgaac gaatatcata tgttaaaatt acaagctaga aatattcaga aaataactgt 137700
ttttaacaaa gacatatttg tatctttagt aaaaaagaat aaaaaaagat ttttttccga 137760
tgttgataca tctgcatcag aaataaaaga tcgtatactt agctactttt ctaaacagac 137820
tcaaacatat aatataggta aattatttac gattatagaa ctacaatctg tattagtgac 137880
cacatacacg gacatattag gagttcttac tattaaagct ccaaatgtaa tttcatctaa 137940
aatttcttat aatgtaacat caatggaaga attggcaaga gatatgctaa attctatgaa 138000
cgtcgcagta atagacaagg caaaagtaat gggacgtcat aatgtatctt ccctagtcaa 138060
aaatgttaat aagttgatgg aagaatatct tagacgccat aataaaagtt gtatatgtta 138120
cggatcatat tctctatatc taattaatcc aaatatacgg tacggcgata tagatattct 138180
tcaaactaat tcgaggactt ttcttataga tttggcattt ctaataaaat ttattacggg 138240
aaataatatt atattaagta aaatcccata tcttagaaac tatatggtga taaaagatga 138300
aaacgataat catatcattg atagttttaa tattcgccag gataccatga acgtagttcc 138360
taaaatcttt atagataata tctatatagt ggatccgacg tttcaactat tgaacatgat 138420
aaaaatgttt tctcaaatag atagattgga agatctatcc aaagatcctg aaaagtttaa 138480
tgcgcgtatg gcaaccatgc tagaatacgt tagatataca catggtatag tctttgatgg 138540
taagcgtaat aatatgccga tgaaatgtat catcgatgaa aataatcgca tagttactgt 138600
tactactaaa gactatttta gctttaaaaa atgtctagtg tatctagatg aaaatgtgtt 138660
atcgagtgat atattagatc ttaacgccga cacatcgtgt gatttcgaga gtgttacaaa 138720
ttctgtatat ctaattcatg ataatatcat gtatacatat ttctcaaata ctattctcct 138780
tagtgataag gggaaggtac atgaaataag tgccagaggt ttatgtgcac atatattgtt 138840
gtatcagatg ctgacatctg gagaatacaa acaatgttta tcggatctct taaattcgat 138900
gatgaataga gataaaatac ctatctattc acatactgaa agagataaaa aacctggacg 138960
acacggattt attaatatcg aaaaggatat aattgtattt taggacaaaa gtctagaagc 139020
tacattatcg cgattagccg cgaacatatt ttgtagcatg tccgtcctca taaacggaac 139080
ttgttttcct gggttattca ttctctcgca tctggaaggg gacgaaggtc ttctctcaca 139140
cgcgggtgat gagggtttat ctactttgag tacgcaggaa gctttacttc cttcgcattc 139200
tacagttcta acatcgcata ctttaacggc ttttagaacc agatatcttc cttctttggt 139260
attgatataa aacggagtat gagcagatgc aaaatgagaa ttcatttata gcatagaaaa 139320
aaaacaaaat gaaattctac tatattttta catatatata ttctaaatat gaaagtggtg 139380
attgtgacta gcgtagcatc gcttctagac gcatctattc agtttcaaaa aacggcatgt 139440
aggcatcact gtaattacct atctatgcaa gtagttaaag agatagaaga atttggtact 139500
atcaatgaaa aaaatttgga atttgacact tggaaagacg ttatacaaaa cgatgaaata 139560
gatgcattag tattttatag agtaaaacaa attagtattt ctacgggtgt tctatataaa 139620
tctatgatgc gcaatagaac aaaacctatt tccatgtact ttgtacgtga ttgtctggca 139680
tttgatggag atcctccgtc ttttagaatg acgtcttgca atatcaacgc atacaatcgt 139740
agtaagatta aagatttaat aatcctaatg aatatgaaaa catgtaataa aaaaattatc 139800
ggtgagttta taatagacaa ttttggaagc gtcgatgcat tactatcgat agttaattcc 139860
aatgttacgt ggattacatc agttataaat aatagtaacg gcaggggtat taatatcagg 139920
gtatcaaata ataaaatgtt aactataact agttttcgac gattcgtcaa taaacttaaa 139980
atgtacaaaa ctactaaatg cgcttctcaa ttggataatc tatgtaccga gatgaacaaa 140040
atggatatta tagacaaaaa atgaaacgta atgaggagta ttgcggggct acataaatta 140100
aaaatggaaa tttttaatgt agaagaattg ataaatatga aaccttttaa gaatatgaat 140160
aaaataacaa ttaatcaaaa tgataattgt atattagcaa atcgatgctt tgttaaaata 140220
gatactccta gatacatacc atcgacatcc attagcagtt ctaatatcat cagaatacgg 140280
aatcatgatt ttacattatc tgaattattg tattcaccgt ttcattttca acagcctcag 140340
tttcaatatc tccttcctgg gtttgtatta acgtgtattg ataaagtttc gaaacagcaa 140400
aaaaaatgta aatattgtat ctctaatcgt ggagatgatg atagtttaag cattaatcta 140460
tttattccga ctattaacaa gtctatatat attattatcg gtttacggat gaaaaatttt 140520
tggaagccta aattcgaaat agaataatgt ttttatatta tacatgttct aaaagaataa 140580
tcgatacagt ttaagtgaaa gctagagagg ggtttttaaa tggtcatcgg tttagtcata 140640
ttcgtgtctg tggcggccgc catcgtcggt gtgttgtcta acgtattgga catgtttatg 140700
tacgtagaag aaaataatga agaggatgct agaatcaagg aggagcaaga actactgttg 140760
ctatattgat acataattga aaatctacca acttaaatac accgcctata aatttacaat 140820
gaaacacaga ttgtattctg aaggattgag tattagtaat gatttaaact cgataatcgg 140880
tcaacaatct acaatggata cggatataga aatagacgaa gatgacatca tggaacttct 140940
taatatattg actgagttag gttgtgatgt cgactttgat gaaaatttta gcgatatagc 141000
cgatgatatt ctagaatcgt tgatagaaca ggatgtataa gtttttatgt taactaaatg 141060
tggccatttg caccggtacc tgcgggagca aaatgtaggc tggtagaaac actaccagaa 141120
aatatggatt ttagatccga tcatttaaca acatttgaat gttttaacga aattatcact 141180
ctagctaaga aatatatata catagcatct ttttgttgta atcctctgag tacgactagg 141240
ggagcgctta tttttgataa actaaaagag gcatctgaaa aagggattaa aataatagtt 141300
ttgctagatg aacgagggaa aagaaatctg ggagagctac aaagtcactg cccggatata 141360
aattttataa ccgttaatat agataaaaaa aataatgtgg gactactact cggttgtttt 141420
tgggtgtcag ataatgaaag atgttatgta ggaaacgcgt catttactgg aggatctata 141480
catacgatta aaacgttagg tgtatattct gattatcccc cgctggccac agatcttcgt 141540
agaagatttg atacttttaa agcctttaat agcgcaaaaa attcatggtt gaatttatgc 141600
tctgcggctt gttgtctgcc agttagcact gcgtatcata ttaagaatcc tataggtgga 141660
gtgttcttta ctgattctcc ggaacaccta ttgggatatt ctagagatct agacactgat 141720
gtagttattg ataaactcaa gtcggctaag actagtatag atattgaaca tttggccata 141780
gttcccacta cacgtgtcga cggtaatagc tactattggc ccgacattta caactccatt 141840
atagaagcag ccattaatag aggagttaag atcagacttc tagttggtaa ttgggataag 141900
aacgacgtat attctatggc aaccgccaga agtctagacg cgttgtgtgt tcaaaatgat 141960
ctatctgtga aggttttcac tattcagaat aatacaaaat tgttgatagt cgacgacgaa 142020
tatgttcata tcacttcggc aaatttcgac ggaacccatt accaaaatca cggattcgtc 142080
agttttaata gtatagataa acagcttgta agcgaggcta aaaaaatatt tgagagagat 142140
tgggtatcta gccacagtaa atcgttaaaa atttaaaaaa tagaaacgta tagaacgcca 142200
tcatgttaaa caggatacaa accttgatga aaacagctaa caattatgaa actattgaga 142260
tattgcgtaa ctatttaaga ctgtatatca ttttggcacg aaatgaagaa ggtcatggta 142320
tactaatata cgatgataac atagatagtg ttatgtcgat gatgaatatt acaatattag 142380
aagttatagg attgacgact cattgcacaa aattaagatc atcgcctcca attcctatgt 142440
ctagattgtt tatggacgaa atagatcatg agtcatatta ttctccaaaa acttcagatt 142500
atccgttgat cgatattata cgaaagcgtt ctcacgaaca gggagatata gcactggctt 142560
tagaacgata cggtattgag aatacagatt ccatatcaga aattaatgaa tggttgtcgt 142620
caaaaggttt agcatgttat agatttgtaa aatttaacga ttataggaaa cagatgtatc 142680
gtaagttctc taggtgtact atagttgaca gtatgataat agggcatata ggtcatcatt 142740
atatttggat taaaaattta gaaacatata cgcgtcccga aattgatgtg ttaccgtttg 142800
atattaaata catatctaga gatgaattgt gggcgcgaat ttcttcctcg ttagatcaaa 142860
cacatataaa aaccatcgcc gtatcagttt atggagctat tactgataat ggaccaatac 142920
catatatgat atccacgtat ccgggtaata cctttgttaa ctttaacagt gtaaaaaatc 142980
taattttaaa tttcttagat tggattaaag atattatgac tagtacacga actatcattc 143040
tagtaggtta catga 143055
<210> 70
<211> 31690
<212> DNA
<213> 人工序列
<220>
<223> 病毒构建体
<400> 70
tggctggaat cattaacaag aaaataaaaa acaatttcaa agttaaacat aactggtatt 60
acgatttcca tttctttgtt catactttat taaaaacata tccagaaatc gaaaaagata 120
tcgaatttag tacggcattg gaagaattca tcatgtgtac caaaacagac tgtgataaat 180
atagattaaa ggtttccatt cttcacccaa ttagtttctt ggaaaaattt attatgagag 240
acattttctc agactggata aatggcggaa actaaagagt ttaaaacttt gtataatctt 300
tttatagata gttatttaca aaaattagct caacattcta tccctactaa tgtcacttgt 360
gctattcata taggagaggt tataggacag tttaaaaatt gcgcgctccg aataactaac 420
aaatgcatga gtaattctcg acttagtttc acactcatgg ttgaatcatt tattgaagtg 480
atttcattgc ttccggaaaa ggatagaaga gctatcgctg aggaaatagg aatagatcta 540
gacgatgtac ctagtgcggt atccaagcta gaaaagaact gtaatgcgta tgcggaggtt 600
aataatatta tagatataca gaaattggat atcggagaat gttcggctcc gcccggtcaa 660
catatgcttt tacagatagt taatacagga tccgcggaag caaattgtgg tttacagaca 720
attgttaagt ccttaaataa aatatacgtt ccacctatta tcgaaaaccg attgccgtat 780
tacgatccgt ggtttctagt gggtgtagca attattctag ttatttttac tgtagctatt 840
tgttctatta gacgaaatct ggctcttaaa tacagatacg gaacgttttt atacgtttaa 900
ttaataaaaa aatttaatta caaggtataa aatagtactc catctacgca atcgcgataa 960
tggagggatc taaacgcaga cacgacagtc ggcgactaca acaagaacag gagcagcctc 1020
gtccacgtac accgccatca tatgaagaaa ttgcaaaata tggacactca tttaacgtga 1080
aaagatttac gaatgaagaa atgtgtctta agaatgatta tccacgaatt atatcatata 1140
atcctccacc aaaatagagt atatatatat catcatttca tgatgtatac tactgacata 1200
gtttcaatgt gaacttttca ctttcttgcc ggttatgaag aatattttta ttttaatggt 1260
cattactaat cgtatattat aattgaaaat ggattagttt aatatgacgc tcgtcatggg 1320
atcctgctgt ggtagattct gtgacgctaa gaataagaat aagaaggaag atgtagaaga 1380
gggaagagaa ggatgttaca attataagaa ccttaatgat ctggatgaat ccgaagcacg 1440
tgtagaattt ggaccattat atatgataaa tgaagaaaaa tcagacataa atacattgga 1500
tataaaaaga agatatagac acacgataga gtctgtatat ttctaaaagt ttttataaaa 1560
aatgagtaaa atactcacgt ttgttaaaaa taagataatt gacttgatta ataatgacca 1620
aattaaatat tctagagtta taatgataga agagtccgat agtcttttac cggttgatga 1680
ggtgcatgct aaccacggat ttgactgtgt ggagatgata gatgaaaata taagcaatga 1740
gaatatcgaa cagtataaaa ccgaatcttt ttttacaata aattgaaatc aaaacattta 1800
ttaaaccgca tcaagatggg tacgaacggc gttagagtat ttgtcatttt atatttgttg 1860
gctgtatgcg gatgtatcga atacgacgta gacgataatg tacatatttg tacccacact 1920
aacgtgtcac atattaatca cactagttgg tattataatg ataaggttat agcgctagcc 1980
accgaggata aaacttctgg ttatatatca tcattcataa aacgcgttaa tatctcatta 2040
acttgtttaa atatatcgag tttgaggtac gaagattctg gtacatacaa aggagtatcg 2100
catctaaaag atggagtcat cgttacgact actatgaata tatctgtaaa ggctaatatc 2160
attgacttga ctggtagagt gcgttatcta accagaaatt attgcgaagt taaaatacga 2220
tgcgaaataa catctttcgc gcttaatggt tctactacac caccacatat gatattagga 2280
acagtagata aatggaaata tcttccattt cctacagatg attatagata cgtaggggaa 2340
ctgaaaagat atatatctgg aaacccatat ccaacagagt cgctagcgtt agaaattagc 2400
tcgacgttta atcggtttac tatcgttaaa aatttgaacg atgacgagtt ttcttgttat 2460
ctgtttccat aaaatgttga acgcgcgcca tatttgtgaa tccgaatggg aggctttaaa 2520
taataataac gataattcat cctccatgcc cgcttcccac aacaatctcg caaacgattt 2580
atctagtatg atgtcacaat tacaaaatga taataacgat aatttaatta tgatagtact 2640
aataacaatg ctatcaataa tacttgtaat tattgtagtg attgcggcga tatcgatgta 2700
caaaagatcc aagtacaggc atatagataa ctgaaaaaaa atttattgtt attgttaatt 2760
tagttatgga acccatcctt gcaccaaatc caaatagatt tgttattttc ccaatccaat 2820
atcatgacat ctggaacatg tataaaaagg cagaggcatc attttggaca gtggaagaag 2880
tagatatatc taaagatatc aatgattgga ataaactaac accagacgaa aaatatttta 2940
taaaacatgt attggcgttt tttgcagcca gtgacggaat agtgaatgaa aatttggcgg 3000
aacgattttg tacagaagta cagattaccg aggctagatg tttctacgga tttcagatgg 3060
ccattgaaaa cattcattcg gaaatgtata gtcttttgat cgatacttat gttaaagata 3120
gtaatgaaaa aaactatctc tttaatgcca tagaaacgat gccttgtgta aaaaagaagg 3180
ccgattgggc tcaaaagtgg atacatgaca gcgccggtta tggagagaga cttattgcct 3240
ttgctgcagt agaaggaatc ttcttttccg gatcattcgc ttccatattt tggcttaaaa 3300
agcgtggcct aatgcccgga ctcacgtttt ccaacgaatt gattagtaga gacgagggtc 3360
tacactgcga cttcgcatgt ttgatgttta aacatttatt gcatccaccg agtgaagaaa 3420
ccgttagatc tattataaca gatgcggtat ccattgaaca agaatttctt actgcggctc 3480
ttccagttaa acttatagga atgaattgtg aaatgatgaa aacatatata gaattcgtcg 3540
cggatagatt gatttctgaa ttgggattta aaaaaattta taatgttacc aatccgtttg 3600
atttcatgga aaatatatca ttagaaggta aaactaattt tttcgaaaaa cgtgtgggtg 3660
aataccaaaa aatgggagtt atgtctcaag aagataatca tttttcttta gatgttgact 3720
tttaaagaaa cataaatgcc gatatttgtc aatactgtgt actgtaagaa tatattagca 3780
ttgtctatga ctaagaaatt caaaacaatt attgatgcca taggaggcaa tataatagtc 3840
aattctacga tattgaaaaa gttatctcct tactttcgca cacatttacg tcaaaaatac 3900
acgaaaaata aagatccagt tacgagggtt tgtctagacc ttgacattca cagtctaact 3960
tctatagtta tttactcata tactggaaag gtatatatag atagtcataa cgtcgtcaat 4020
ttattacgtg cttctatatt aacctctgta gaatttatca tctacacttg tataaacttt 4080
atcctacgag attttagaaa ggaatattgt gtcgagtgtt acatgatggg tatagaatac 4140
ggactatcca atctcttatg tcatactaaa aactttattg ccaaacactt tttggaactg 4200
gaagatgaca tcatagacaa ttttgattat ctatctatga aacttattct agaaagcgat 4260
gaactaaatg ttccagatga ggattttgtc attaagtggt atataaagcg aagaaataaa 4320
ttaggaaatc tgctactcct tatcaaaaat gtaatcaggt caaattatct ttctcccaga 4380
ggtataaata atgtaaaatg gatactagac tgtaccaaaa tatttcattg tgataaacaa 4440
ccacgcaaat catacaagta tccattcata gagtatccta tgaacatgga tcaaattata 4500
gatatattcc atatgtgtac aagtactcat gttggagaag tagtatatct catcggtgga 4560
tggatgaaca atgaaataca taacaatgct atagctgtaa attatatatc aaacaattgg 4620
attccaattc ctccgatgaa tagccccaga ctgtatgcta gcgggatacc cgctaacaat 4680
aaattatacg tagtaggagg tctaccaaat cccacatctg ttgagcgttg gttccacggg 4740
gatgctgctt gggttaatat gccgagtctt ctgaaaccta gatgtaatcc agcagtggca 4800
tccataaaca atgttatata cgtaatggga ggacattctg aaactgatac aactacagaa 4860
tatttgctac ccaatcatga tcagtggcag tttggaccat ccacttatta tcctcattat 4920
aaatcatgcg cgttagtgtt cggtagaagg ttattcttgg ttggtagaaa tgcggaattt 4980
tattgtgaat ccagcaatac atggactctg atagatgatc ctatttatcc gagggataat 5040
ccagaattga tcatagtgga taataaactg ctattgatag gaggatttta tcgtgaatcg 5100
tatatagata ctatagaagt gtacaatcat cacacttatt catggaatat atgggatggt 5160
aaataatttt gaaataaaat attagtttta tgttcaacat gaatattaac tcaccagtta 5220
gatttgttaa ggaaactaac agagctaaat ctcctactag gcaatcgccg ggtgctgccg 5280
gatatgattt gtatagcgct tacgattata ctatccctcc aggagaaaga cagttaatta 5340
agacagatat tagtatgtcc atgcctaaga tttgctatgg tagaatagct cctaggtctg 5400
gtctgtcact aaaaggcatt gatataggag gtggtgtaat agacgaagat tataggggaa 5460
acataggagt cattcttatt aataatggaa aatgtacgtt taatgtaaat actggagata 5520
gaatagctca gctaatctat caacgtatat attatccaga actggaagaa gtacaatctc 5580
tagatagtac aaatagagga gatcaagggt ttggatcaac aggacttaga taataaacaa 5640
tagtatgttg tcgatgttta tgtgtaataa tatcgtagat tatgtagatg gtatagtaca 5700
ggatatagaa gatgaggcta gcaataatgt tgatcacgac tatgtatatc cacttccaga 5760
aaatatggta tatagatttg acaagtccac taacatactc gattatctat caacggaacg 5820
ggaccatgta atgatggctg ttcgatacta tatgagtaaa caacgtttag acgacttgta 5880
tagacagttg cccacaaaga ctagatcata tatagatatt atcaacatat attgtgataa 5940
agttagtaat gattataata gggacatgaa tatcatgtat gatatggcat ctacaaaatc 6000
atttacagtt tatgacataa ataacgaagt taatactata ctaatggata acaaggggtt 6060
gggtgtaaga ttggcgacaa tttcattcat aaccgaattg ggtagacgat gtatgaatcc 6120
agtaaaaact ataaaaatgt ttactctact atcgcatact atatgcgatg attgttttgt 6180
agattatata acggacattt caccaccaga taataccatc cctaacacta gcacgcgtga 6240
atatctaaag cttattggca tcacagctat catgtttgct acatataaaa ctctcaaata 6300
catgatagga taattttttt taacacggat atagaatgct aacgtaataa ttatgcgtta 6360
tgaagacccc tatatcatca attcaatttt ttttctagaa aaagtatcaa gatgtttata 6420
ttatcgtcgg tcatcagatc tgtaatgttt cccaatgatt ggaatctaga ttctgaaatc 6480
tttttgtatc cccaatgttc agtgatttta gcacatattc ctatggtagc gaataaactt 6540
tcctgatcat ggacttttgt aaaattgatg tagtggtctc cttcgctcat agcttcgaca 6600
atctcattaa ttttatcaat accatagtac cgtatagcga catcatcgaa cttcatcaat 6660
tccttgtaca gtcttccaca actggtaata tctttgttaa acactataac atgatttctc 6720
cacgtaatat attcatctat tagatcgatg atggagtcgc gactacatat tttatcatca 6780
tccacaaagt aaaaaacagc atcctcataa tctaatttag tcgccatgac tatctcacaa 6840
aagacagtag ccgtctcctt cctctatatt gatttaattg tatgttttta caattatcaa 6900
taaaacataa aaataatatg atcatcaaac gaactgttaa tattgatagt tatataacgt 6960
gaatcatgag tgcaaactgt atgttcaatc tggacaatga ttacatatcc taaggcatta 7020
gtattcataa gtcatggagc tggtaaacat tctggacgtt atgacgaact agctgaaaac 7080
atatcatcgt taggaatttt agtattctca catgatcata ttggacatgg aagaagtaat 7140
ggtgaaaaaa tgatgattga tgactttggt acagcacgtg gtaactatta aatctactta 7200
ttgggtcatt ccatgggagc aacaatttct atactagcct cttacgataa tccaaacttg 7260
tttacagcaa tgattctaat gtctcctcta gttaatgcag atgctgtttc aagactgaat 7320
ctgctagctg ccaaacttat gggaaccatc acaccaaatg cgccagtcgg aaagctatgt 7380
ccagaatcag tatctagaga tatggataaa gtttataaat accaatacga cccattaatc 7440
aatcatgaaa aaattaaggc tggatttgct agtcaggtct tgaaggctac caacaaggtt 7500
agaaaaataa tttccaagat taacaccccg actctcatac tccagggaac aaacaatgag 7560
attagcgatg ttttaggtgc atattatttc atgcaacatg caaattgtaa tagagaaata 7620
aaaatttatg aaggtgccaa acatcatctt cataaggaaa cggatgaagt taaaaaatca 7680
gtcatgaaag aaatagaaac ttggattttt aatagagtga agtgatatag gattattctt 7740
ttaacaaata aaatgaatcc ggataataca atcgcagtga ttacagagac tattcctata 7800
ggtatgcaat ttgataaagt atatttgtct acatttaaca tgtggaggga aattctatcc 7860
aataccacaa aaacactaga tatatcatct ttttattgga gtttatcgga tgaagtgggt 7920
acgaatttcg gcacgataat attaaacgag attgtacaat tacccaaaag aggagtacga 7980
gttagagtag ccgtcaataa atctaacaaa ccattaaagg atgttgaaag actacaaatg 8040
gccggagttg aagtacgata catagatatt acaaatatcc taggaggagt tcttcataca 8100
aaattttgga tatctgataa tacacatatt tatttaggaa gcgctaacat ggattggaga 8160
tcactaactc aggtcaaaga attgggtatt gcgatcttca ataataggaa cttggcagcg 8220
gatctcactc aaatttttga ggtatactgg tatcttggag ttaacaatct accatataat 8280
tggaaaaact tttatccgtc gtattataat acagatcatc ctcttagtat taacgtaagt 8340
ggtgttccac actctgtatt tattgcttct gcaccgcaac aactatgtac tatggaaaga 8400
accaatgatt taaccgcttt attgtcatgt attagaaatg cgagtaaatt cgtttatgta 8460
tctgttatga actttatccc tattatttat tcgaaggcgg gtaaaatttt gttttggcct 8520
tatatagaag atgaattaag aagatccgct atagacagac aagtatccgt taagctattg 8580
attagttgct ggcaacgatc ttcgtttatc atgagaaact ttttaagatc tatcgctatg 8640
ctaaaatcta aaaacataaa tatagaagta aagctattta ttgtaccaga tgctgatcct 8700
cccattccgt attctagggt aaaccatgcc aaatatatgg taaccgataa aacggcctat 8760
ataggtacct caaattggac aggaaattac tttacggata catgtggagc atctattaat 8820
attacaccgg atgatggatt aggtcttcgt caacaattag aagatatttt tatgcgtgat 8880
tggaattcaa aatacagcta tgaattgtac gatactagtc ctactaaaag gtgtaaacta 8940
ttaaaaaata tgaaacaatg tacaaatgat atatactgcg atgagataca accggaaaaa 9000
gaaattcctg aatattctct tgaataaaat agatataaaa acataatttt tatcccaatt 9060
tacgagcccg ttaacaagat gcttgcattt tgttattcgt tgcccaatgc gggcgatgta 9120
ataaagggca gagtatacga gaatgattat gctctatata tttatctttt tgactatcct 9180
cactctgaag ctatcttggc agagagtgtt aagatgcata tggatagata tgttgaatat 9240
agggataaac tggtagggaa aactgtaaaa gttaaagtga ttagagttga ttatacaaaa 9300
ggatatatag atgtcaatta caaaaggatg tgtagacatc aataattttt ataccgaaca 9360
taaaaataag gttaattatt aataccataa aatcatgatt gcgttattga tactatcgtt 9420
aacgtgttca gcgtctacct atcgtctaca aggatttacc aatgccggta tagtagcgta 9480
taaaaatatt caagatgata atattgtctt ctcaccgttt ggttattcgt tttctatgtt 9540
tatgtcgcta ttgcctgcat caggtaatac tagaatagaa ttattgaaga ctatggattt 9600
gagaaaaaga gatctgggtc cagcatttac agaattaata tcaggattag ctaagctgaa 9660
aacatctaaa tatacgtaca ctgatctaac ttatcaaagt ttcgtagata atactgtgtg 9720
tattaaaccg tcgtattatc aacaatatca tagattcggc ctatatagat taaactttag 9780
acgagatgcg gttaataaaa ttaattctat agtagaacgt agatccggta tgtctaatgt 9840
agtagattct aatatgctcg acaataatac tctatgggca atcattaata ctatatattt 9900
taaaggtata tggcaatatc cgtttgatat cactaaaaca cgcaatgcta gttttactaa 9960
taagtacggt acgaaaacgg ttcccatgat gaacgtagtt actaaattgc aaggaaatac 10020
aatcacaatc gatgacaaag aatatgacat ggtacgcctt ccgtataagg atgctaatat 10080
tagtatgtac ctggcaatag gtgataatat gacccatttc acagattcta ttacggctgc 10140
aaaattagac tattggtcgt ttcaattagg gaataaagtg tacaatctta aactccctaa 10200
attttctatc gaaaataaga gggatattaa gtcgatagcc gaaatgatgg ctcctagtat 10260
gtttaatcca gataatgcgt cgtttaaaca tatgactagg gacccattat atatttataa 10320
aatgtttcag aatgcaaaga tagatgtcga cgaacaagga actgtagcag aggcatctac 10380
tattatggta gctacggcga gatcatctcc tgaaaaactg gaatttaata caccatttgt 10440
gttcatcatc agacatgata ttactggatt tatattgttt atgggtaagg tagaatctcc 10500
ttaatatggg tacggtgtaa ggaatcatta ttttatttat attgatgggt acgtgaaatc 10560
tgaattttct taataaatat tatttttatt aaatgtgtat atgttgtttt gcgatagcca 10620
tgtatctact aatcagatct attagagata ttattaattc tggtgcaata tgacaaaaat 10680
tatacactaa ttagcgtctc gtttcagaca tggatctgtc acgaattaat acttggaagt 10740
ctaagcagct gaaaagcttt ctctctagca aagatgcatt taaggcggat gtccatggac 10800
atagtgcctt gtattatgca atagctgata ataacgtgcg tctagtatgt acgttgttga 10860
acgctggagc attgaaaaat cttctagaga atgaatttcc attacatcag gcagccacat 10920
tggaagatac caaaatagta aagattttgc tattcagtgg actggatgat tcgtattcga 10980
tgattatttt taacaaaata acataaaaat aatatatttt tttaggatgc gatcatgacg 11040
tcctctgcaa tggataacaa tgaacctaaa gtactagaaa tggtatatga tgctacaatt 11100
ttacccgaag gtagtagcat ggattgtata aacagacaca tcaatatgtg tatacaacgc 11160
acctatagtt ctagtataat tgccatattg gatagattcc taatgatgaa caaggatgaa 11220
ctaaataata cacagtgtca tataattaaa gaatttatga catacgaaca aatggcgatt 11280
gaccattatg gagaatatgt aaacgctatt ctatatcaaa ttcgtaaaag acctaatcaa 11340
catcacacca ttaatctgtt taaaaaaata aaaagaaccc ggtatgacac ttttaaagtg 11400
gatcccgtag aattcgtaaa aaaagttatc ggatttgtat ctatcttgaa caaatataaa 11460
ccggtttata gttacgtcct gtacgagaac gtcctgtacg atgagttcaa atgtttcatt 11520
gactacgtgg aaactaagta tttctaaaat taatgatgca ttaatttttg tattgattct 11580
caatcctaaa aactaaaata tgaataagta ttaaacatag cggtgtacta attgatttaa 11640
cataaaaaat agttgttaac taatcatgag gactctactt attagatata ttctttggag 11700
aaatgacaac gatcaaacct attataatga tgattttaaa aagcttatgt tgttggatga 11760
attggtagat gacggcgatg tatgtacatt gattaagaac atgagaatga cgctgtccga 11820
cggtccattg ctagatagat tgaatcaacc agttaataat atagaagacg ctaagcgaat 11880
gatcgctatt agtgccaaag tggctagaga cattggtgaa cgttcagaaa ttagatatag 11940
agaaaatagc tccagaatac agacaatgct tacaggatct ataccatatg aaaattacgc 12000
gtcctagaca ctttgataac tagttatgtc attctcgttt ttatagagtg tactcgttac 12060
taaatattaa ttaatgaaat aataacaatt attaagacat actattcatc caacttaaac 12120
aacgaaaaac atttttcatt aagtttatca tgaatgcgta taataaagcc gattcgtttt 12180
ctttagagtc tgattctatc aaagatgtta tacacgatta tatttgttgg ctcagtatga 12240
ctgatgaaat gagaccatct atcggaaacg tctttaaagc gatggaaacg tttaagatag 12300
acgcggttag atattacgat ggtaacatat acgatttagc taaagatata aatgcgatgt 12360
cattcgacag ttttataaga tctctacaaa atatctcttc aaagaaagat aaactcactg 12420
tttatggaac catgggactg ctgtctattg tcgtagatat taacaaaggt tgtgatatat 12480
ccaatatcaa gttcgctgcc ggaataatca ttttaatgga gtatattttt gatgacacgg 12540
atatgtctca tcttaaagta gcactctatc gtagaataca gagacgtgat gatgtagata 12600
gatatttttt tttcctaaac tgatttctct gtttaaattc gtagcgatat ataaaacaac 12660
atgtaattaa ttaataaact ttaagacatg tgtgttatac taagatggtt ggcttattcc 12720
atagtagctt gtggaattta taaacttatg atagtaaaac tagtacccaa tatgtaaaga 12780
tgaaaaagta aattactatt aacgccgtcg gtattcgttc atccattcag tatgggtata 12840
cagcacgaat tcgacatcat tattaatgga gatatcgcgt tgagaaattt acagttacat 12900
aaaggggata actacggatg caaactaaaa attatttcga atgattacaa gaaattaaag 12960
tttagattca ttatacgccc agattggtcg gaaatcgacg aggtcaaagg attaaccgta 13020
tttgcaaaca actatgcggt gaaagttaat aaggtagatg acacgttcta ttacgtaata 13080
tatgaggctg taatacatct gtataacaaa aaaacagaga tattgattta ttctgatgat 13140
gagaacgaac tctttaaaca ctattaccca tacatcagtc taaatatgat tagtaaaaag 13200
tataaagtta aagaagaaaa ctactcatcc ccgtatatag aacatccgtt aatcccgtat 13260
agagattatg agtccatgga ttaatatgag tatagtgtta aatgacactt actaaatagc 13320
caaggtgatt attcgtattt ttttaaggag taaccatgtc cgcaattaga tttattgcat 13380
gtctatatct catttccatc ttcggaaatt gtcatgagga tccatattat caaccatttg 13440
ataaattaaa cattactcta gatatataca cttatgagga tctagtacca tacaccgtag 13500
acaatgacac aacttctttc gttaagatat actttaaaaa tttttggatt acggttatga 13560
ctaaatggtg tgctccgttt attgataccg ttagcgtata cacatctcat gataatctga 13620
atatacaatt ttatagtagg gacgaatatg atacacaaag cgaggataaa atttgtacca 13680
ttgatgttaa agcacgatgc aaacatctaa caaaacgaga agttacagta caacaagaag 13740
cctacagata ttcattatct tctgacctat cgtgttttga ttctatagat ctagagattg 13800
atcttattga aactaatagt actgacacta cagtactgaa atcatatgag ctcatgcttc 13860
ccaaacgtgc taaatccata cataactgaa atgaaagaaa ccaaaaaatg cgatagcatc 13920
aacaaccaat catggttaac gataagatac tctatgatag ttgtaaaaca tttaacatcg 13980
atgccagcag tgcacaatca ttgatagaaa gtggtgcaaa tccattatat gagtatgatg 14040
gtgaaactcc attaaaggca tacgttacca agaaaaataa taatatcaaa aacgatgttg 14100
tgattttgtt attgtcgtca gtcgactata aaaatatcaa tgattttgat atactcgaat 14160
atctatgttc tgataacatc gatatagact tattgaaatt actaatttcg aaaggtatag 14220
aaataaatag tatcaaaaat ggtattaata ttgtagaaaa atacgctaca acatcaaatc 14280
ccaatgtaga tgtgtttaaa ctattattgg ataaaggaat acctacatgt agcaacatac 14340
agtatggata caagatcaaa atagaacaga ttagacgtgc tggtgaatat tataattggg 14400
atgatgaatt agacaattac gattacgact acaccactga ttatgatgat agaatgggta 14460
aaacagttct ctattattat attattacta ggtcacaaga tggttatgct acatctttgg 14520
acgtaataaa ctatttaatt tcacacgaaa aagagatgcg ttattatact tatcgtgaac 14580
ataccacact ctattattat cttgacaaat gcgatattaa acgggaaata tttgacgcgt 14640
tattcgatag taactatagt ggtcatgaac taatgaatat tctatctaac tatttacgta 14700
aacagtttag gaagaaaaat cacaaaatcg ataattatat agttgatcaa ctattattcg 14760
accgtgatac gttttatatt ttagaattgt gtaatagttt acgtaataat atccacaatt 14820
cttaaaagat atacagattc tatacaagat ctattgttag aatatgtatc ttatcataca 14880
gtatacatca atgttattaa atgtatgatt gatgaaggag ctacattata tagatttaag 14940
catataaata aatattttca aaaatttggc aatagagatc ctaaagttgt cgagtatatt 15000
ttaaaaaatg gaaacttagt tgtagataat gacaatgatg ataacctaat aaatattatg 15060
ccattattcc ctaccttctc tatgcgtgag ttggatgtgt tatcgatact aaaactttgt 15120
aagccgtata ttgatgatat aaacatggat gtagtatact ttatcattgt attaagtcgc 15180
atagtgtcag cctagtagaa tggttaatag ataatggcgc agacattaat ataataacaa 15240
aatatgggtt tacatgtatt actatttgtg ttatactggc agataaatat atcccagaaa 15300
tagcagaatt atatattaag atattggaaa ttattctgag taaattacca accatcgaat 15360
gtattaagaa aacagttgat tacctagacg atcacaggta cttattcata ggtggtaata 15420
ataaatcgtt actgaaaata tgtatcaagt acttcatatt agtcgattat aagtacacat 15480
gtagcatgta tccatcatat atagaattta taaccgactg cgaaaaagaa attgcggata 15540
tgcgtcaaat taaaataaat ggtacggaca tgcttacagt gatgtacatg ttaaataaac 15600
ctacaaagaa acgatatgtt aataatccga tatttacaga ttgggctaat aagcaatata 15660
agttttataa tcaaataata tataatgcta ataagttaat agaacaaagt aagaaaatag 15720
acgacatgat agaggaggta tccattgacg ataatcgttt atcaacacta ccgttagaaa 15780
ttagacattt gattttctcg tacgcgttcc tataaaaata gaaactataa tcatataata 15840
gtgtaggttg gtagtattgc tcttgtgact agagacttta gttaaggtac tgtaaaaata 15900
gaaactataa tcatataata gtgtaggttg gtagtagggt actcgtgatt aattttattg 15960
ttaaacttgt ccttaagtct tattaatatg tcttctaaag ggggtagtgg cggcatgtgg 16020
agtgtcttta tccatggaca tgatggtagt aataaaggat ctaaaactta tacatctggt 16080
ggcggtggaa tgtggggagg aggatcgtcc agtggtgtaa aaagtggggt taacggaggt 16140
gtaaaatctg gaactggtaa aatttaaaca ctaaattatt tttattaata attgtacaag 16200
tttttgacat gatatttaat gacattagtt gtgtgggtgt atagagttca cagtagctca 16260
ttcagtcaaa atgtttgact atttggaaaa tgaggaggtg gctctcgatg aacttaaaca 16320
gatgttgaga gacagagatc ctaatgatac caggaaccaa ttcaagaata atgctttaca 16380
cgcatacctt tttaatgagc attgtaataa tgtcgaggtc gtcaaactac tactagacag 16440
tggcactaat ccattacgca aaaattggag acagctaccc cattagaaga atacacaaat 16500
agtagacatg ttaaagttaa aggatatagc gatggctcta ctagaagcca ctggatttag 16560
caacataaat gactttaata tattcagcta tatgaaatcc aaaaatgtag acgttgactt 16620
gataaaggtg ttggtagaac atggatttga cttgagtgtt aaatgtgaaa accatcgttc 16680
agttatagaa aattatgtaa tgacagatga tcctgttcct gaaattattg atttgttcat 16740
agaaaatggc tgcagtgttc tttatgagga cgagtactga tacgcgtatg atgattatca 16800
actacgaaat tgcggtaccg tattgcatct gtatatcatc tctcatctgt attcagagtc 16860
ggatacgaga gcatatgtgc gtccggaagt tgttaaatgt ctaattaatc acggaatcaa 16920
gccgtctttt atagataaaa actattgtac agctcttcaa tattatatta agtcatctca 16980
tatagatata gacatcgtta aattgttaat gaaaggaata gataacacgg cttattcata 17040
tatagacgat ctaacatgtt gcactcgagt aattatggct gattatctaa atagtgatta 17100
tagatacaat aaagatgtag atttggtcaa attgtttttg gaaaatggaa agccgcacgg 17160
aataatgtgt agtattgtac cactatggag aaatgataag gaaaccatct ttttgatatt 17220
gaaaacaatg aactcggatg tcctccaaca tatactaatt gagtatatga cattcggcga 17280
tatccctcta gtggaatatg gaactgtggt aaataaagag gctatacacg gatactttag 17340
aaatattaat attgattctt acacgatgaa atatctacta aaaaaggaag ggagatgcca 17400
tcaattatct cgatgatgga gagatcccga ttggacacct atgtaaatcc aactatgaat 17460
gttataattt ttacacttat acatacaaaa agggtctttg tgacatgtct tatgcttgcc 17520
caattcttag tactataaac atttgcctac cttatcttaa agacattaac atgattgaca 17580
aacgaggaga aacacttctt cacaaggctg ttagatataa taaacaatct ctagtatctt 17640
tactgctaga atccggttca gatgtcaaca ttagatcaaa taacggatat acatgtatag 17700
ccattgccat caacgaatct aaaaacattg aactgctgaa aatgctatta tgtcataaac 17760
ctacattaga ttgtgtgatt gattcattga gagaaatatc taacatcgta gataacgact 17820
atgctataaa acaatgtatt aaatatgcca tgattataga tgactgtaca tcgtctaaga 17880
ttccagagtc cataagtcaa cgctataatg attatataga tctttgcaat taagaattga 17940
atgagatgaa aaaaataatg gtaggtggta atactatgtt ctcattaata tttactgatc 18000
atggagctaa aattattcat agatatgtca ataatccaga attacgtgag tattatgagt 18060
taaaacaaaa taaaatatat gtggaagcat atgatattat ttccaacgca atagtgaaac 18120
atgatagaat acataaaacc atagaatcag ttgatgataa tacctacatt tctaatcttc 18180
cgtataccat caaatacaaa atattcgagc aacaataagt attttttata cctttaaaat 18240
tgataaataa attttttcta gtgatatttt ggcaagatga aaatcctatt tctcatcgct 18300
ttcatgtatg ggtgtgttca ctcatatgtt aacgcggttg aaaccaaatg tccaaatcta 18360
gacattgtaa catcttctgg agaatttcat tgttcaggat gtgtggaaca tatgcctgag 18420
tttagctata tgtattggtt ggcaaaggat accaagttta tagaacatct gggtgatggc 18480
atcaaagaag atgaaaccgt tcgtaccaca gatagtggaa tcaccactct acgtaaagtc 18540
cttcatgtaa ccgatactaa taaatttgct cattataggt tcacttgtgt cctcactacg 18600
atagatggcg tttcaaaaaa gaatatttgg ctgaagtagt gcgtgctact atttttattt 18660
atgatataat ctaatggaat taatttgaat tgatatttat ccaatactaa agattatatt 18720
agaatcaaat taatctttta tacgagaaaa aataacgaca tacgtcgtca acaaattaaa 18780
ctttttattt attagttaac ttgctcattg ttatgtttct aaaacgggta cgacatatag 18840
gacaattatc cgacgcaccg gtttctcttc gtgttctatg ccatatattg atgcatgtta 18900
tgcaaaatat atgattacac gaatccaata aaccaaagta tctatcgttt tgagtaaaca 18960
acttcatagc aaatttcaca ttctttttct ttacttactc tatacacgtc ctcgtattta 19020
tccagtattt tgatgatatc caactcagaa atggttgttg tattattggg tgtataggta 19080
ttattagcta tgtaccaatt taccaacctt cttaatattg attgataatc acatcggtta 19140
tccaattaat aactaaattg tagtgtatat atagaccata tatgtttcta tttttttgac 19200
agtttcagta agttttgatt gttgtattcc tgtatctcta gataagttag tcatatagtc 19260
ccttccggcg atacgttttt tccaagcccg aaattgatta gccaaatgtg gatttatttt 19320
tgtgataatg catactgtta gtcttatatc atttggttca tctatgtatt gtaatattgt 19380
tacatgatct atagatgatg tattgatttt ggcaggatcg aattccatat ccgcgactaa 19440
acagtgaaaa aaatgtaaat aattttaaat tagtaaaact tttttttatt ttttatgatt 19500
ccaaaaaaac tgaatacaaa gtcctaaatt ataaatatgg agatcatact accacaactt 19560
attattatgt atacaaggcc ggtgtaatag atagatatat ataattctat tacaccggca 19620
gacaattacc gatcggtatt tgtcgttacc aacataccgt ataatatgta atatacaatt 19680
ccataaccca ttgacagttg ttatacatca aaattgcaat tcttttgatt acgatgttat 19740
aagaatgtag ttaattgatg tatgatgtta atgtgtcctc tttcctctta taacatcgta 19800
atcaaaaact tttttataat atatacctaa taatgtgtct taatagttct cgtgattcgt 19860
caaacaatca ttcttataaa atataataaa gcaacgtaaa aacacataaa aataagcgta 19920
actaataaga caatggatat ttacgacgat aaaggtctac agactattaa actgtttaat 19980
aatgaatttg attgtataag gaatgacatc agagaattat ttaaacatgt aactgattcc 20040
gatagtatac aacttccgat ggaagacaat tctgatatta tagaaaatat cagaaaaata 20100
ctatatagac gattaaaaaa tgtagaatgt gttgacatcg atagtacaat aacttttatg 20160
aaatacgatc caaatgatga taataagcgt acgtgttcta attgggtacc cttaactaat 20220
aactatatgg aatattgtct agtaatatat ttggaaacac cgatatgtgg aggcaaaata 20280
aaattatacc accctacagg aaatataaag tcggataagg atattatgtt tgcaaagact 20340
ctagacttta aatcaacgaa agtgttaact ggacgtaaaa caattgccgt tctagacata 20400
tccgtttcat ataatagatc aatgactact attcactaca acgacgacgt tgatatagat 20460
atacatactg ataaaaacgg aaaagagtta tgttattgtt atataacaat agatgatcat 20520
tacttggttg atgtggaaac tataggagtt atagtcaata gatctggaaa atgtctgtta 20580
gtaaataacc atctaggtat aggtatcgtt aaagataaac gtataagcga tagttttgga 20640
gatgtatgta tggatacaat atttgacttt tctgaagcac gagagttatt ttcattaact 20700
aatgatgata acaggaatat agcatgggac gatgatacag atatatggac tcccgtcaca 20760
gaagatgatt acaaatttct ttctagacta gtattgtatg caaaatctca atcggatact 20820
gtattcgact attatgttct tactggtgat acggaaccac ccactgtatt cattttcaag 20880
gtaactagat tttactttaa tatgccgaaa taaaaaattt ttgtataata tctagaggta 20940
gaggtattgt ttagataaat acaaataaca tagatacatc gcatatttag catttttata 21000
aatatacata agacatacac tttatacatt tttgtaaaaa tactcataaa aaaaatttat 21060
aaaaattatg gcacaaccat atcttgtata ggtagtttag ttcgtcgagt gaacctataa 21120
acagataata gacaacacgt aataataata atgcctacta atacaagcat aataccggga 21180
gatgggatat atgacgttgt agtgtttggt ttttctgaac gttgatagtc tactaatact 21240
acatgctgac atctaatgcc tgtataacca tgagagcatc tacaatacat accgtcgata 21300
tctctagcgt ggatacagtc accgtgtaaa caatatccat ctccctctgg accgcataat 21360
ctgatagctg gaatatctgt tgtagcgttt gtaatttctg gcgatgtcgt ttcgatagcg 21420
ttaccactat cggcgaatga tctgattatc atagcagcga acaacaacat cagataattt 21480
atcaacattt ttgatggatt ttgtgtttat gctgtttctc agtgtgtgtt tatgacaaga 21540
ttgggaattt tatattatta attcagtaat ataaactaat aatatattgt taattgtgta 21600
aataatataa aaataacaat acaatattga atgtgttgct gttaaaaatg tatgtgttaa 21660
tataatagaa taaaataaat gagtatgatc attttagata acgattgatt ttatcattac 21720
cgcttcattc ttatattctt tgcttacgga acctatattt agaaacatct actaacgatt 21780
ttttatgctt gcattattaa tggtatgtaa taccaatttg ttaagtatga atacggggta 21840
caaacataaa ctgaaattta gatcattaaa tgtttcatca gaaatgactc catgaaaacc 21900
gccgaagaac ttcgtgcaat cattggactt tgtactcaat cagctatcgt ctctggaaga 21960
gtcttcaacg ataagtatat cgacatacta cttatgctgc gaaagattct gaacgagaac 22020
gactatctca ccctcttgga tcatatccgc actgctaaat actaaatctc cttcatgctc 22080
tctcactaca ctttttatca tcttatgagg aataattagc accagaatag ctatggattg 22140
cacatgtatt ctatgtcgtc tactggatga agatgtgacg tacaaaaaaa taaaactaga 22200
aattgaaacg tgtcacaact tatcaaaaca tatagataga cgaggaaaca atgcgctaca 22260
ttgttacgtc tccaataaat gcgatacaga cattaagatt gttcgactgt tactctctcg 22320
cggagtcgag agactttgta gaaacaacga aggattaact ccgctaggag catacagtaa 22380
gcatagatac gtaaaatctc agattgtgca tctactgata tccagctatt cgaattcctc 22440
taacgaactc aagtcgaata taaatgattt cgacttacgt ctgctaaaat acctaattgt 22500
ggataaacgg atacgtccgt ccaagaatac gaattatgca atcaatggtc tcggattggt 22560
ggatatatac gtaacgacgc ctaatccgag accagaagta ttgctatggc ttcttaaatc 22620
agaatgttac agcaccggtt acgtatttcg tacctgtatg tacaacagtg atatgtgtaa 22680
gaactctctt cattactata tatcgtctca tagagaatct ctatccaagg atgtaattaa 22740
atgtttgatc gataacaatg tttccatcca atactactgg tcttgctcaa ccatagatat 22800
agagattatt aataaaggat gtggacacgt gtagagtata cgacgtcagc cctatattag 22860
aggcgtatta tctaaacaag cgatttagag taaccccata taatgtagac atggaaatcg 22920
ttaatcttct tattgagaga cgtcatactc ttgtcgacgt aatgcgtagt attacttcgt 22980
acgattccag agaatataac cactacatca tcgataacat tctaaagaga tttagacaac 23040
aggatgaatc catcgtacaa gccatactga taaactactt acattacggc gatatggtaa 23100
gtatacctat cattcaatgc atgttggata agacgacgga caacaacttt gttaataata 23160
atctcgtcga tgtaaacgtc gtaaggttta tcgtggaaaa tatggacacg cggctgtaaa 23220
tcacatatct aacaatggcc gtctatgtat gtacggtctg atattatcga gatttaataa 23280
ttgcgggtat cactgttatg aagatgtatt tgatatacta agcaagtaca tggatgatat 23340
agatatgatc gataactcta ctatattacg cggtcgatgt caataatata caatttgcaa 23400
agcggttatt ggaatatgga gcgagtgtca cgctcgataa tcaatacggc catccagaaa 23460
agcagttacc aaagagaagc tagttgattt attactgagt taccatccca ctctagagac 23520
tatgattgac gcatttaata gagatatacg ctatctatat cctgaaccat tattcgcctg 23580
tatcagatac gccttaatcc tagatgatga ttttccttct aaagtaagta tgatatcgcc 23640
ggtcgtcata aggaactaaa gcgctataga gcagacatta atagaatgaa gaatgcctac 23700
atatcaggcg tctccatgtt tgatatatta tttaaacgaa gcaaacgcca cagattgaga 23760
tacgcaaaga acaatgagag gatcgactcc attaaataat ttatcatgga gtgataatgt 23820
cctgtttcca tggcatatta caaaatcgat tccgtccaag atgataaaaa catttaccgg 23880
catcataaac acggagttta ttttatatgt ctcgcataaa cattactaaa aaaatatatt 23940
gttcggtttt ctttcacatc tttaattatg aaaaagtaaa tcattatgag atggacgcat 24000
cgttcgcgac agtatgtggt acatacctaa cgtatttatg gacgacggta agaatgaagg 24060
tcacgtttct gtcaacaatg tcgacgcgat cgtgtaacac gactcacaat agaatctgtg 24120
aatgctctcc cgatcatgga tgcaaggcat gtgtttccca aacaaaatgt ggaataggat 24180
acggagtatc cggagacgtc atctgttctc cgtgtggtct cggaacatat tctcacaccg 24240
tctcttccgc agataaatgc gaacccgtac ccagaaatac ctttaactat atcgatgtgg 24300
aaattaacct gtatccagtt aacgacacat cgtgtactcg gacgaccact accggtctca 24360
gcgaatccat ctcaacgtcg gaactaacta ttactatgaa tcataaagac tgcgatcccg 24420
tcttcttaat aaggtagcga cttcaggttt ctttacagga gaaaggtgtg cactctgaat 24480
ttcgagatta aatgcaataa caaagattct tcctccaaac agttaacgaa agcaaagaat 24540
gatactatca tgccgcattc ggagacagta actctagcgt cgacatctat atactatata 24600
gtaataccaa tactcaagac tacgaaactg atacaatctc ttatcatgtg ggtaatgtag 24660
ccatatgccc ggtagttgcg atatacataa actgatcact aattccaaac ccacccgctt 24720
tttatagtaa gtttttcacc cataaataca ataattaatt tctcgtaaaa gtagaaaata 24780
tattctaatt tattgcacgg taaggaagta gaatcataaa gaacagtact caatcaatag 24840
caatcatgaa acaatatatc gtactggcat gcatgtgcct gccagtcttc agcaatcatc 24900
ctcatcgtgt acggaagaag aaaacaaaca tcatatggga atcgatgtta ttatcaaagt 24960
cacaaagcaa gaccaaacac cgaccgatga taagatttgc caatccgtaa cggaaattac 25020
agagtccgag tcagatccag atcccgaggt ggaatcagtc gaggatgtag atcctcctac 25080
cacttattac tccatcatcg gtggaggtct gagaatgaac tttggattca ccaaatgtcc 25140
tcagattaaa tccatctcag aatccgctga tggaaagact gtgaggtgtc tatcgacatc 25200
agatgtagcg aagaagagaa agacagcgac atcaagaccc atccagtact cgggtctaac 25260
atctctcata agaaagtgag ttacgaagat atcatcggtt caacgatcgt cgatacaaaa 25320
tgtgtcaaga atctagagtt tagcgttcgt atcggagaca tgtgcaagga atcatctgaa 25380
cttgaggtca agtatgtcga cggatcggca tctgaaggtg caaccgatga tacttcactc 25440
atcgattcaa caaaactcaa agcgtgtgtc tgaatcgata actctattca tctgaaattg 25500
gatgagtagg gttaatcgaa cgattcaggc acaccacgaa ttaaaaaagt gtaccggaca 25560
ctatattccg gtttgcaaaa caaaaagtta cctctcgcga cttcttcttt ttctgtctca 25620
atagtgtgat acgattatga cactattcct atttcctttc agggtatcac aaaaatatta 25680
aacctctttc tgatggtctc atacaaaaat atttttattc tctttctctc tttgatggtc 25740
tcataaaaaa tatttttatt ctctttctct ctttgatggt ctcataaaat atttttattc 25800
tctttctctc tttgatggtc tcataaaaaa tatttttatt ctctttctct ctttgatggt 25860
ctcataaaat atttttattc tctttctctc tttgatggtc tcataaaaaa tatttttatt 25920
ctctttctct ctttgatggt ctcataaaaa atattaaacc tctttctgat ggtgtcacta 25980
aaatattttt attctctttc tctcttcaat ggagtcataa aatattttta ttctctttct 26040
ctcttcgatg gtctcacaaa aatattaaac ctctttctga tggtgtcact aaaatatttt 26100
tattctcttt ctctcttcaa tggagtcata aaatattttt attctctttc tctctttgat 26160
ggtctcacaa aaatattttt attctctttc tctctttgat ggtctcacaa aaatattttt 26220
attctctttc tctctttgat ggtctcacaa aaatattttt attctctttc tctctttgat 26280
ggtctcataa aaaaagtttt acaaaaatat ttttattctc tttctctctt tgatggtctc 26340
ataaaaaaag ttttacaaaa atatttttat tctctttctc tctttgatgg tctcataaaa 26400
aaagttttac aaaaatattt ttattctctt tctctctttg atggtctcat aaaaaatatt 26460
aaacctcttt ctgatggtgt cactaaaata tttttattct cattctctct tcaatggagt 26520
cataaaatat ttttattctc tttctctctt cgatggtctc acaaaaatat taaacctctt 26580
tctgatggtg tcactaaaat atttttattc tcattctctc ttcaatggag tcataaaata 26640
tttttattct ctttctctct tcgatggtct cacaaaaata ttaaacctct ttctgatggt 26700
gtcactaaaa tatttttatt ctcattctct cttcaatgga gtcataaaat atttttattc 26760
tctttctctc tttgatggtc tcataaaaaa agttttacaa aaatattttt attctctttc 26820
tctctttgat ggtctcataa aaaatattaa acctctttct gatggtgtca ctaaaatatt 26880
tttattctct ttctctcttc aatggagtca taaaatattt ttattctctt tctctcttcg 26940
atggtctcac aaaaatatta aacctctttc tgatggtgtc actaaaatat ttttattctc 27000
attctctctt caatggagtc ataaaatatt tttattctct ttctctcttt gatggtctca 27060
taaaaaaagt tttacaaaaa tatttttatt ctctttctct ctttgatggt ctcataaaaa 27120
aagttttaca aaaatatttt tattctcttt ctctctttga tggtctcata aaaaaagttt 27180
tacaaaaata tttttattct ctttctctct ttgatggtct cataaaaaaa gttttacaaa 27240
aatattttta ttctctttct ctctttgatg gtctcataaa aaatattaaa cctctttctg 27300
atggtgtcac taaaatattt ttattctcat tctctcttca atggagtcat aaaatatttt 27360
tattctcttt ctctcttcga tggtctcaca aaaatattaa acctctttct gatggtgtca 27420
ctaaaatatt tttattctca ttctctcttc aatggagtca taaaatattt ttattctctt 27480
tctctctttg atggtctcat aaaaaaagtt ttacaaaaat atttttattc tctttctctc 27540
tttgatggtc tcataaaaaa agttttacaa aaatattttt attctctttc tctctttgat 27600
ggtctcataa aaaaagtttt acaaaaatat ttttattctc tttctctctt tgatggtctc 27660
ataaaaaaag ttttacaaaa atatttttat tctctttctc tctttgatgg tctcacaaaa 27720
atattaaacc tctttctgat ggagtcgtaa aaaagtttta tctctttctc cttcgatggt 27780
ctcacaaaaa tattaaacct ctttctgatg gagtcgtaaa aaagttttat ctctttctct 27840
cttcgatggt ctcacaaaaa tattaaacct ctttctgatg gagtcgtaaa aaagttttat 27900
ctctttctct cttcgatggt ctcactaaaa tattttttat tctctttctg atgcatcaac 27960
tatttcttaa acaataacgt ccaacaacat atactcgtcg agcttatcaa catcccctat 28020
gcccatctag gttaccagac aattgtatat cataaaataa tgtttataat ttttacaaaa 28080
atatttttat tctctttctc tctttgatgg tctcataaaa aaagttttac aaaaatattt 28140
ttattctctt tctctctttg atggtctcat aaaaaatatt aaacctcttt ctgatggtgt 28200
cactaaaata tttttattct cattctctct tcaatggagt cataaaatat ttttattctc 28260
tttctctctt cgatggtctc acaaaaatat taaacctctt tctgatggtg tcactaaaat 28320
atttttattc tcattctctc ttcaatggag tcataaaata tttttattct ctttctctct 28380
ttgatggtct cataaaaaaa gttttacaaa aatattttta ttctctttct ctctttgatg 28440
gtctcataaa aaaagtttta caaaaatatt tttattctct ttctctcttt gatggtctca 28500
taaaaaaagt tttacaaaaa tatttttatt ctctttctct ctttgatggt ctcataaaaa 28560
aagttttaca aaaatatttt tattctcttt ctctctttga tggtctcata aaaaaagttt 28620
tacaaaaata tttttattct ctttctctct ttgatggtct cataaaaaaa gttttacaaa 28680
aatattttta ttctctttct ctctttgatg gtctcataaa aaaagtttta caaaaatatt 28740
tttattctct ttctctcttt gatggtctca taaaaaatat taaacctctt tctgatggtg 28800
tcactaaaat atttttattc tctttctctc ttcaatggag tcataaaata tttttattct 28860
ctttctctct tcgatggtct cacaaaaata ttaaacctct ttctgatggt gtcactaaaa 28920
tatttttatt ctcattctct cttcaatgga gtcataaaat atttttattc tctttctctc 28980
tttgatggtc tcataaaaaa agttttacaa aaatattttt attctctttc tctctttgat 29040
ggtctcataa aaaaagtttt acaaaaatat ttttattctc tttctctctt tgatggtctc 29100
ataaaaaaag ttttacaaaa atatttttat tctctttctc tctttgatgg tctcataaaa 29160
aaagttttac aaaaatattt ttattctctt tctctctttg atggtctcat aaaaaaagtt 29220
ttacaaaaat atttttattc tctttctctc tttgatggtc tcataaaaaa tattaaacct 29280
ctttctgatg gtgtcactaa aatattttta ttctctttct ctcttcaatg gagtcataaa 29340
atatttttat tctctttctc tcttcgatgg tctcacaaaa atattaaacc tctttctgat 29400
ggtgtcacta aaatattttt attctctttc tctcttcaat ggagtcataa aatattttta 29460
ttctctttct ctctttgatg gtctcataaa aaaagtttta caaaaatatt tttattctct 29520
ttctctcttt gatggtctca taaaaaaagt tttacaaaaa tatttttatt ctctttctct 29580
ctttgatggt ctcataaaaa aagttttaca aaaatatttt tattctcttt ctctctttga 29640
tggtctcata aaaaaagttt tacaaaaata tttttattct ctttctctct ttgatggtct 29700
cataaaaaaa gttttacaaa aatattttta ttctctttct ctctttgatg gtctcataaa 29760
aaaagtttta caaaaatatt tttattctct ttctctcttt gatggtctca taaaaaaagt 29820
tttacaaaaa tatttttatt ctctttctct ctttgatggt ctcataaaaa atattaaacc 29880
tctttctgat ggtgtcacta aaatattttt attctcattt tctctttctc tcttcaatgg 29940
agtcataaaa tatttttatt ctctttctct ctttgatggt ctcataaaaa atattaaacc 30000
tctttctgat ggtgtcacta aaatattttt attctcattc tctcttcaat ggagtcataa 30060
aaaagtttta tctctttctc tcttcgatgg tctcacaaaa atattaaacc tctttctgat 30120
ggagtcgtaa aaaagtttta tctctttctc tcttcgatgg tctcacaaaa atattaaacc 30180
tctttctgat gcatcaacta tttcttaaac aataacgtcc aacaacatat actcatcccc 30240
tatgcccatc taggttacca gacaattgta tatcataaaa taatgtttat aatttacacg 30300
ttaaaatcat ataataaaac gtagatcgta taatattttt tggtatataa atgatctagt 30360
aaaatccatg taggggatac tgctcacatt ttttctttgg tacaaaattt cacacaagtt 30420
tttatacaga caaattcttg tccatatatt ttaaaacatt gacttttgta ctaagaaaaa 30480
tatctagact aactatctct ttctctttct ctcttcgatg gtctttctga tggagtcgta 30540
aaaaagtttt atctctttct ctcttcgatg gtctcacaaa aatattaaac ctctttctga 30600
tggagtcgta aaaaagtttt atctctttct ctcttcgatg gtctcacaaa aatattaaac 30660
ctctttctga tggagtcgta aaaaagtttt atctctttct ccttcgatgg tctcacaaaa 30720
atattaaacc tctttctgat ggagtcgtaa aaaagtttta tctctttctc tcttcgatgg 30780
tctcacaaaa atattaaacc tctttctgat ggtgtcacta aaatattttt attctctttc 30840
tctcttcgat ggtctcacaa aaatattaaa cctctttctg atggagtcgt aaaaaagttt 30900
tatctctttc tccttcgatg gtctcacaaa aatattaaac ctctttctga tggagtcgta 30960
aaaaagtttt atctctttct ctcttcgatg gtctcacaaa aatattaaac ctctttctga 31020
tggagtcgta aaaaagtttt atctctttct ccttcgatgg tctcacaaaa atattaaacc 31080
tctttctgat ggagtcgtaa aaaagtttta tctctttctc cttcgatggt ctcacaaaaa 31140
tattaaacct ctttctgatg gagtcgtaaa aaagttttat ctctttctcc ttcgatggtc 31200
tcacaaaaat attaaacctc tttctgatgg agtcgtaaaa aagttttatc tctttctctc 31260
ttcgatggtc tcacaaaaat attaaacctc tttctgatgg agtcgtaaaa aagttttatc 31320
tctttctctc ttcgatggtc tcacaaaaat attaaacctc tttctgatgg agtcgtaaaa 31380
aagttttatc tctttctcct tcgatggtct cacaaaaata ttaaacctct ttctgatggt 31440
ctctataaag cgattgattt ttcttaccct ctagagtttc ctacggtcgt tggtcacaca 31500
tttttttcta gacactaaat aaatatttaa aatataatat taatatacta aaatttatgt 31560
attattaatt tataatatta atatactaaa atttatgtat tattaattta tctaactaaa 31620
gttagtaaat catatacata attttataat taatattata tacataattt tataattaat 31680
ttaatcttac 31690
<210> 71
<211> 3948
<212> DNA
<213> 人工序列
<220>
<223> 疫苗构建体
<400> 71
atggatgcta tgaagagggg cctgtgctgc gtgctgctgc tgtgtggcgc cgtgtttgtg 60
tcccccagcc aggaaatcca cgcccggttc agaagaggca gcaagctggc tgctcagctg 120
gccgacagcg acagcaatgc ctgcgccttc ctgaagctga gatacctgca cagccggatc 180
cacgtgctgc agtttctgaa ccccttcacc ctgcacgagt acatgctgga cctgcagccc 240
gagacaaccg acctgtactg ctacgagcag gacgaggatg aggacgaggt ggaccatctg 300
caggaacagc cccagcaggc cagaagggat gagcagcacc cctgctacct gatcgagaca 360
cagtgctgca gatgcgagag cctggtggcc cagctggctg acgtgaactc taacgccgca 420
gcctttctga agaacagcat cgactgcaac gacagcatgt gcagcacctt cgacgacaac 480
gtgtccgcca ccgagctcgt gaagagaatc cctgcccctt gcccctgggc ccctgaggaa 540
aatgacgaga tcgacggcgt gaaccaccag catctgcccg ctagaagggc cgagcctcag 600
agacacacca tgctgtgcat gtgctgcaag tgcgaggccc ggatctgctt cctgctgtgc 660
ttttgtgtgc tgctgtgcgt gtgcctgctg atcagacccc tgctgctgag tgtgtccacc 720
tactgccagc ggccgctgtg tcctcaggaa aagaaacggc acgtggacct gaacaagcgg 780
ttccacatct acatctgcga ggacgcccag tgcaccgtgg tggaaggcca ggtggacagc 840
aaggtgtccg agttccggtg gtacagatac agcgtgtacg gcaccaccct gggccagcgg 900
agaatcaaga ggcccagatc cgaggtgtac tgcaagggac agctgaccga gacagaggtg 960
ctggacttcg ccttcaccga cctgaccatc gtgtaccggg acatccggct ggaatgcgcc 1020
atcatgtaca aggccagaga gatgggcttc cacgaccaca tcgactactg gaagctgatt 1080
agactggaat gtgctatctt ctacaaagcc cggatcctga agtgcctgcg gtacagattc 1140
aagaagcact gcaagctgta cgtggcctgg gactccgtgt actactgcgg cgacgatggc 1200
tggtgcaaga ccagctctac ctggcactgg acatgccacg acggcaagca caagaacgtg 1260
tgccaggaca agatcctgga acactacgag aacgactcca aggacgacga ggacgagaca 1320
gcctacgaca gcggcaccga tctgatcgac ttcatcgacg atagcaactg ccaccccaac 1380
aagctgctgc ggagactgag cagcgaccag gaccagtctc agaggccccc caacatggga 1440
gtgaaggccc acggcaagtg catctgggag aacaaggtgt tcatcgtgcc caccctgtgc 1500
cccgtgcctc tggatccaac ataccccctg ctgaagctgc tgacccccga aaccacagat 1560
ctgcactgtt atgagcagct gggcgactcc tccgacgaag aggatacagg cggcctggat 1620
ggctacgagg ccgacaagaa cgacctgaac gcccagatcg agcactggaa actgatccgg 1680
atggaatgtg caattttcta taaggccaaa gagctgggga tcagcgacga gaatgagaac 1740
gacagcgata ccggcgagga catggtggat ttcatcgaca atgaggccga gaagtacggc 1800
tgcaagggca cctgggaggt gcacttcggc tttaagaagc acggcatcac catcaccgtg 1860
cagtacgaca acgacaaggc caacaccatg gactacacca actggaaaga gatctacccc 1920
cctccccccc cacggccttg ggctcctcca attcctaagc cctctccatg ggcccctcag 1980
ttcgacggcg acatctgcaa taccatgcac tataccaatt gggtggtgta cagagacagc 2040
atcccccacg ccgcctgcca caagtgtatc gacttctaca gcagaatcag agagctgcgg 2100
cactacagcg actctgtgta cggcgatacc ctggaaaagc tgaccaacac cggcctgtac 2160
aatctgctga tccggtgcct gaggctcgtg accaagtatc ctctgctgaa actgctgtcc 2220
aactgcatcc tgtacggcgc tgccaatacc ggcaagagcc tgttcggcat gagcctgagc 2280
aaagtgcgga agctgaggta ctacaactgc tccgtgtatg gggccagcct gtgcgtggaa 2340
tgcaagaaaa ccctgcagcg gagcgaagtg tacgacgacg aaaccgacga ggaaagcacc 2400
gagagcgacc tggacggctt catcgataac agcgtgatcg tgtgccccgc ctccatcccc 2460
tccgatgaga tctctaccga ggaagccccc agaccccctc actgtccttg ggtgccagtg 2520
ttctgcaaga aggccctgac cgcctctgag gtgtacaatt ttgcctatac cgacctgcgc 2580
gtggtgtata gggacattct ggaacattat gagaatgata gcaaggacct gtgcgatcac 2640
atcaactgcc tcgtgatcta cggccctcct aacaccggca agtcctgctt cgccatgtcc 2700
ctgtggaaca ccgagcccaa gcactgcttc aagaagggcg gccagcacat cgaagtgtgg 2760
ttcgatattg tgtacaggga cggcaaccct tacgccgtgt gcgacaagtg cctgaagttc 2820
tactccaaga tcagcgagta ccgccactac tgctactccc tgtatggcac aacactggaa 2880
cagcagtaca acaagcccct gtgcgacctg ctgattcgct gcatcaacac caccagatac 2940
cctctgctgt ccctgctgaa cagctacagc accccccctc atcggattcc cgccccatgt 3000
ccatgggctc cacagaggcc tacccagacc accacccccg agaatacctc cctggtggaa 3060
ctgagagtga ccacccccaa gagcacagtc gtgatcaggc tgcacctgtg gccctacctg 3120
cactccagac tggtggtgtt caccttcccc aacccctttc accaggtggt gcccgccctg 3180
aatatctgca aggccaaggc ctgcaaagcc atcgagaaga aatgggaagt gcacgctggc 3240
ggccaagtga tcctgtgtcc tgagagcctg cggcctctgc tgctgtccat tagcgtgtac 3300
gcccaggtgc tggtgctggt gctgctgctg tgggtgtcca tcggcagcaa ctgtctggtg 3360
ctgtgcggcc ctgccaacac agggaagagt tacttcggca tgtctctgat ctgccatcag 3420
gtggtgcctc cactggccgc ctctaaggct aaagcctgtc aggccatcga actgcagctg 3480
gccctggaag ccctgaatgc cagcccctat gatcacattg attactggaa agccatccgg 3540
caggaaaatg ccatcttctt cgccgccaga tggcattgga cctgtcacga tggaaaacac 3600
aagaatgcca ttgtgaccct gacctacctg ctgcccagcg tgtgtatgtg cgcctacgct 3660
tgggtgctgg tgttcgtgta catcgtcgtg attaccagcc ccgccaccgc cgatgagtgg 3720
acactgcagc agacaagcct ggaaatgtgg ctggccgagc cccagtgtga ccatatcgat 3780
tattggaaac acatccgcct ggaatgtgct attatgtata aggcccggtg gccttacctg 3840
gaaagcagaa ccgtgttcga gttccccaat gccttcgccg gctctggacc tggcgcctct 3900
ggaaaaccca tccccaatcc actgctgggc ctggactcca cccggacc 3948
<210> 72
<211> 1316
<212> PRT
<213> 人工序列
<220>
<223> 疫苗构建体
<400> 72
Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly
1 5 10 15
Ala Val Phe Val Ser Pro Ser Gln Glu Ile His Ala Arg Phe Arg Arg
20 25 30
Gly Ser Lys Leu Ala Ala Gln Leu Ala Asp Ser Asp Ser Asn Ala Cys
35 40 45
Ala Phe Leu Lys Leu Arg Tyr Leu His Ser Arg Ile His Val Leu Gln
50 55 60
Phe Leu Asn Pro Phe Thr Leu His Glu Tyr Met Leu Asp Leu Gln Pro
65 70 75 80
Glu Thr Thr Asp Leu Tyr Cys Tyr Glu Gln Asp Glu Asp Glu Asp Glu
85 90 95
Val Asp His Leu Gln Glu Gln Pro Gln Gln Ala Arg Arg Asp Glu Gln
100 105 110
His Pro Cys Tyr Leu Ile Glu Thr Gln Cys Cys Arg Cys Glu Ser Leu
115 120 125
Val Ala Gln Leu Ala Asp Val Asn Ser Asn Ala Ala Ala Phe Leu Lys
130 135 140
Asn Ser Ile Asp Cys Asn Asp Ser Met Cys Ser Thr Phe Asp Asp Asn
145 150 155 160
Val Ser Ala Thr Glu Leu Val Lys Arg Ile Pro Ala Pro Cys Pro Trp
165 170 175
Ala Pro Glu Glu Asn Asp Glu Ile Asp Gly Val Asn His Gln His Leu
180 185 190
Pro Ala Arg Arg Ala Glu Pro Gln Arg His Thr Met Leu Cys Met Cys
195 200 205
Cys Lys Cys Glu Ala Arg Ile Cys Phe Leu Leu Cys Phe Cys Val Leu
210 215 220
Leu Cys Val Cys Leu Leu Ile Arg Pro Leu Leu Leu Ser Val Ser Thr
225 230 235 240
Tyr Cys Gln Arg Pro Leu Cys Pro Gln Glu Lys Lys Arg His Val Asp
245 250 255
Leu Asn Lys Arg Phe His Ile Tyr Ile Cys Glu Asp Ala Gln Cys Thr
260 265 270
Val Val Glu Gly Gln Val Asp Ser Lys Val Ser Glu Phe Arg Trp Tyr
275 280 285
Arg Tyr Ser Val Tyr Gly Thr Thr Leu Gly Gln Arg Arg Ile Lys Arg
290 295 300
Pro Arg Ser Glu Val Tyr Cys Lys Gly Gln Leu Thr Glu Thr Glu Val
305 310 315 320
Leu Asp Phe Ala Phe Thr Asp Leu Thr Ile Val Tyr Arg Asp Ile Arg
325 330 335
Leu Glu Cys Ala Ile Met Tyr Lys Ala Arg Glu Met Gly Phe His Asp
340 345 350
His Ile Asp Tyr Trp Lys Leu Ile Arg Leu Glu Cys Ala Ile Phe Tyr
355 360 365
Lys Ala Arg Ile Leu Lys Cys Leu Arg Tyr Arg Phe Lys Lys His Cys
370 375 380
Lys Leu Tyr Val Ala Trp Asp Ser Val Tyr Tyr Cys Gly Asp Asp Gly
385 390 395 400
Trp Cys Lys Thr Ser Ser Thr Trp His Trp Thr Cys His Asp Gly Lys
405 410 415
His Lys Asn Val Cys Gln Asp Lys Ile Leu Glu His Tyr Glu Asn Asp
420 425 430
Ser Lys Asp Asp Glu Asp Glu Thr Ala Tyr Asp Ser Gly Thr Asp Leu
435 440 445
Ile Asp Phe Ile Asp Asp Ser Asn Cys His Pro Asn Lys Leu Leu Arg
450 455 460
Arg Leu Ser Ser Asp Gln Asp Gln Ser Gln Arg Pro Pro Asn Met Gly
465 470 475 480
Val Lys Ala His Gly Lys Cys Ile Trp Glu Asn Lys Val Phe Ile Val
485 490 495
Pro Thr Leu Cys Pro Val Pro Leu Asp Pro Thr Tyr Pro Leu Leu Lys
500 505 510
Leu Leu Thr Pro Glu Thr Thr Asp Leu His Cys Tyr Glu Gln Leu Gly
515 520 525
Asp Ser Ser Asp Glu Glu Asp Thr Gly Gly Leu Asp Gly Tyr Glu Ala
530 535 540
Asp Lys Asn Asp Leu Asn Ala Gln Ile Glu His Trp Lys Leu Ile Arg
545 550 555 560
Met Glu Cys Ala Ile Phe Tyr Lys Ala Lys Glu Leu Gly Ile Ser Asp
565 570 575
Glu Asn Glu Asn Asp Ser Asp Thr Gly Glu Asp Met Val Asp Phe Ile
580 585 590
Asp Asn Glu Ala Glu Lys Tyr Gly Cys Lys Gly Thr Trp Glu Val His
595 600 605
Phe Gly Phe Lys Lys His Gly Ile Thr Ile Thr Val Gln Tyr Asp Asn
610 615 620
Asp Lys Ala Asn Thr Met Asp Tyr Thr Asn Trp Lys Glu Ile Tyr Pro
625 630 635 640
Pro Pro Pro Pro Arg Pro Trp Ala Pro Pro Ile Pro Lys Pro Ser Pro
645 650 655
Trp Ala Pro Gln Phe Asp Gly Asp Ile Cys Asn Thr Met His Tyr Thr
660 665 670
Asn Trp Val Val Tyr Arg Asp Ser Ile Pro His Ala Ala Cys His Lys
675 680 685
Cys Ile Asp Phe Tyr Ser Arg Ile Arg Glu Leu Arg His Tyr Ser Asp
690 695 700
Ser Val Tyr Gly Asp Thr Leu Glu Lys Leu Thr Asn Thr Gly Leu Tyr
705 710 715 720
Asn Leu Leu Ile Arg Cys Leu Arg Leu Val Thr Lys Tyr Pro Leu Leu
725 730 735
Lys Leu Leu Ser Asn Cys Ile Leu Tyr Gly Ala Ala Asn Thr Gly Lys
740 745 750
Ser Leu Phe Gly Met Ser Leu Ser Lys Val Arg Lys Leu Arg Tyr Tyr
755 760 765
Asn Cys Ser Val Tyr Gly Ala Ser Leu Cys Val Glu Cys Lys Lys Thr
770 775 780
Leu Gln Arg Ser Glu Val Tyr Asp Asp Glu Thr Asp Glu Glu Ser Thr
785 790 795 800
Glu Ser Asp Leu Asp Gly Phe Ile Asp Asn Ser Val Ile Val Cys Pro
805 810 815
Ala Ser Ile Pro Ser Asp Glu Ile Ser Thr Glu Glu Ala Pro Arg Pro
820 825 830
Pro His Cys Pro Trp Val Pro Val Phe Cys Lys Lys Ala Leu Thr Ala
835 840 845
Ser Glu Val Tyr Asn Phe Ala Tyr Thr Asp Leu Arg Val Val Tyr Arg
850 855 860
Asp Ile Leu Glu His Tyr Glu Asn Asp Ser Lys Asp Leu Cys Asp His
865 870 875 880
Ile Asn Cys Leu Val Ile Tyr Gly Pro Pro Asn Thr Gly Lys Ser Cys
885 890 895
Phe Ala Met Ser Leu Trp Asn Thr Glu Pro Lys His Cys Phe Lys Lys
900 905 910
Gly Gly Gln His Ile Glu Val Trp Phe Asp Ile Val Tyr Arg Asp Gly
915 920 925
Asn Pro Tyr Ala Val Cys Asp Lys Cys Leu Lys Phe Tyr Ser Lys Ile
930 935 940
Ser Glu Tyr Arg His Tyr Cys Tyr Ser Leu Tyr Gly Thr Thr Leu Glu
945 950 955 960
Gln Gln Tyr Asn Lys Pro Leu Cys Asp Leu Leu Ile Arg Cys Ile Asn
965 970 975
Thr Thr Arg Tyr Pro Leu Leu Ser Leu Leu Asn Ser Tyr Ser Thr Pro
980 985 990
Pro His Arg Ile Pro Ala Pro Cys Pro Trp Ala Pro Gln Arg Pro Thr
995 1000 1005
Gln Thr Thr Thr Pro Glu Asn Thr Ser Leu Val Glu Leu Arg Val
1010 1015 1020
Thr Thr Pro Lys Ser Thr Val Val Ile Arg Leu His Leu Trp Pro
1025 1030 1035
Tyr Leu His Ser Arg Leu Val Val Phe Thr Phe Pro Asn Pro Phe
1040 1045 1050
His Gln Val Val Pro Ala Leu Asn Ile Cys Lys Ala Lys Ala Cys
1055 1060 1065
Lys Ala Ile Glu Lys Lys Trp Glu Val His Ala Gly Gly Gln Val
1070 1075 1080
Ile Leu Cys Pro Glu Ser Leu Arg Pro Leu Leu Leu Ser Ile Ser
1085 1090 1095
Val Tyr Ala Gln Val Leu Val Leu Val Leu Leu Leu Trp Val Ser
1100 1105 1110
Ile Gly Ser Asn Cys Leu Val Leu Cys Gly Pro Ala Asn Thr Gly
1115 1120 1125
Lys Ser Tyr Phe Gly Met Ser Leu Ile Cys His Gln Val Val Pro
1130 1135 1140
Pro Leu Ala Ala Ser Lys Ala Lys Ala Cys Gln Ala Ile Glu Leu
1145 1150 1155
Gln Leu Ala Leu Glu Ala Leu Asn Ala Ser Pro Tyr Asp His Ile
1160 1165 1170
Asp Tyr Trp Lys Ala Ile Arg Gln Glu Asn Ala Ile Phe Phe Ala
1175 1180 1185
Ala Arg Trp His Trp Thr Cys His Asp Gly Lys His Lys Asn Ala
1190 1195 1200
Ile Val Thr Leu Thr Tyr Leu Leu Pro Ser Val Cys Met Cys Ala
1205 1210 1215
Tyr Ala Trp Val Leu Val Phe Val Tyr Ile Val Val Ile Thr Ser
1220 1225 1230
Pro Ala Thr Ala Asp Glu Trp Thr Leu Gln Gln Thr Ser Leu Glu
1235 1240 1245
Met Trp Leu Ala Glu Pro Gln Cys Asp His Ile Asp Tyr Trp Lys
1250 1255 1260
His Ile Arg Leu Glu Cys Ala Ile Met Tyr Lys Ala Arg Trp Pro
1265 1270 1275
Tyr Leu Glu Ser Arg Thr Val Phe Glu Phe Pro Asn Ala Phe Ala
1280 1285 1290
Gly Ser Gly Pro Gly Ala Ser Gly Lys Pro Ile Pro Asn Pro Leu
1295 1300 1305
Leu Gly Leu Asp Ser Thr Arg Thr
1310 1315
<210> 73
<211> 3321
<212> DNA
<213> 人工序列
<220>
<223> 疫苗构建体
<400> 73
atggatgcta tgaagcgagg actgtgctgc gtgctgctgc tgtgtggcgc tgtgtttgtg 60
tcccctagcc aagagatcca cgccagattc agacggggca gcaaactggc cgacgaggat 120
gagacagcct acgactctgg caccgacctg atcgacttca tcgacgacag cgacgagaac 180
gagaatgaca gcgacaccgg cgaggacatg gtggatttca tcgacaatgc ccagctggcc 240
gactccgact ctaatgcctg tgcctttctg aaggctcagc tggctgacgt gaacagcaat 300
gccgccgctt tcctgaagaa ctgcatcctg ctgtacggcg ctgccaacac aggcaagagc 360
ctgtttggca tgagcctgaa ctgcctggtg ctgtgcggac ctgccaatac cggcaaaagc 420
tacttcggca tgtccctgtg gccttacctg cacagcagac tggtggtgtt tacattcccc 480
aatcctttct ggccctacct ggaaagccgg atcaccgtgt tcgagttccc caacgccttc 540
aacgtgtgcc aggacaagat cctggaacac tatgagaacg acagcaagga catccttgag 600
cactacgaaa acgactccaa ggacctgtgc gaccacatct gcgatcacat cgactactgg 660
aagcacatcc ggctggaatg cgccatcatg tacaaggccc ggatcagact ggaatgtgct 720
attatgtata aggctcgcga gatgggcttc caccagttcg acggcgacat ctgcaacacc 780
atgcactaca ccaactggat ctatatctgc gaggacgccc agtgcaccgt ggtggaagga 840
caggtggaca agaaatggga agtgcacgct ggcggccaag tgattctgtg tcctgagagc 900
ggccagcgga gaatcaagag gcccagatcc gagaactgtc accccaacaa gctgctgatc 960
ctgaagtgcc tgcggtacag attcaagaag cactgcaagc tgagcagcac ctggcactgg 1020
acatgccacg atggcaagca caagtggcat tggacctgtc acgacgggaa acacaagaac 1080
gccatcgtga ccctgaccta ctacgaggcc gacaagaacg acctgaacgc ccagattgag 1140
cactggaaac tgatccggat ggaatgtgca atcttctata aggccaaaga gctggggatc 1200
agcatctgcc accaggtggt gcctccactg gctgcctcta aagccaaagc ctgtcaggcc 1260
atcgaactgc agctggccct ggaagccctg aacgctagcc cttacgatga gtggaccctg 1320
cagcagacca gcctggaaat gtggctggcc gagcctcagt ttaagaagca cggcatcacc 1380
atcaccgtgc agtacgacaa cgacaaggcc aataccatgg attacacgaa ttggaaagaa 1440
atctacgtga tcgtgtgccc cgccagcatt ccctccgatg agatctctac cgaggaagcc 1500
gaccacattg attattggaa ggccatccgg caagagaatg ccatcttctt cgccgccaga 1560
catcaggtgg tccccgctct gaatatctgc aaggccaagg cctgcaaagc catcgagtgg 1620
aacaccgagc ctaagcactg cttcaagaaa ggcggccagc acatcgaagt ttggttcgac 1680
tacgtggcct gggacagcgt gtactactgc ggagatgatg gctggtgcaa gaccgaggcc 1740
gagaagtacg gctgtaaagg cacctgggaa gtccacttcg gcaacagcat cgactgcaac 1800
gatagcatgt gcagcacctt cgacgacaac gtgtccgcca cagagctggt caaggaccat 1860
atagactatt ggaagctgat caggcttgag tgcgccattt tctacaaggc cagacggcgg 1920
ctgtccagcg accaggatca atctcagctc gtgaccaagt atcccctgct gaagctgctg 1980
tctacccaga ccaccacacc tgagaacaca agcctggtgg aactgagagt gaccacacct 2040
aagagcaccg tcgtgattcg gctgcacctg accacaagat accctctgct gagcctgctg 2100
aacagctaca gcacccctcc acacaggatc cccgctccat gtccttgggc tcctcagagg 2160
cctcctattc ctaagccttc tccatgggct cctagaatcc ccgcaccttg tccatgggca 2220
ccaccaagac ctccacattg cccttgggtg ccctgtttcc tgctgtgctt ttgcgtgctc 2280
ctgtgcgtgt gcctgctgat cagacctctg ctgctgagcg tgtccaccta ccttagacca 2340
ctgctcctgt ccatctccgt gtacgcacag gtgctggtgc tggtcctgct tctgtgggtg 2400
tccatcggaa gcctgctgcc tagcgtgtgc atgtgtgcct atgcttgggt gctcgtgttc 2460
gtgtacatcg tggtcatcac aagccccgcc acagccatcg tgtacagaga tggcaatccc 2520
tacgccgtgt gcgacaagtg cctgaagttc tacagcaaga tcagcgagta ccggcactac 2580
tgctacagcc tgtacggcac cacactggaa cagcagtaca acaagcccct gtgcgatctg 2640
ctgattcggt gcatcaacgt ggtgtaccgg gacagcattc ctcacgccgc ctgccacaag 2700
tgcatcgact tctactccag aatcagagag ctgcggcact acagcgactc tgtgtacggc 2760
gacaccctgg aaaagctgac caacaccggc ctgtacaacc tgctgattag atgcctgcgg 2820
gtgtactgca agggacagct gacagagaca gaggtgctgg acttcgcctt caccgatctg 2880
acaatcgtgt atcgggatag caaggtgtcc gagttccggt ggtacagata tagcgtgtac 2940
ggaacaaccc tgtgcgtcga gtgcaagaaa accctgcaga gaagcgaggt gtacgactgc 3000
cagaggccac tgtgccctca agagaagaaa cggcacgtgg acctgaacaa gcggtttcac 3060
accctgcacg agtacatgct ggacctgcag cctgagacaa ccgacctgta ctgctacgag 3120
cagcccgaaa ccacagatct gcactgttat gagcagctgg gcgacagcag cgacgaagag 3180
gatacaggcg gactggacgg cgaggaaaac gacgaaattg acggcgtgaa ccaccagcat 3240
ctccccgcca gaagggctga acctcagaga cacaccatgc tgtgtatgtg ctgcaagtgc 3300
gaggccagaa tcgcctgatg a 3321
<210> 74
<211> 1105
<212> PRT
<213> 人工序列
<220>
<223> 疫苗构建体
<400> 74
Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly
1 5 10 15
Ala Val Phe Val Ser Pro Ser Gln Glu Ile His Ala Arg Phe Arg Arg
20 25 30
Gly Ser Lys Leu Ala Asp Glu Asp Glu Thr Ala Tyr Asp Ser Gly Thr
35 40 45
Asp Leu Ile Asp Phe Ile Asp Asp Ser Asp Glu Asn Glu Asn Asp Ser
50 55 60
Asp Thr Gly Glu Asp Met Val Asp Phe Ile Asp Asn Ala Gln Leu Ala
65 70 75 80
Asp Ser Asp Ser Asn Ala Cys Ala Phe Leu Lys Ala Gln Leu Ala Asp
85 90 95
Val Asn Ser Asn Ala Ala Ala Phe Leu Lys Asn Cys Ile Leu Leu Tyr
100 105 110
Gly Ala Ala Asn Thr Gly Lys Ser Leu Phe Gly Met Ser Leu Asn Cys
115 120 125
Leu Val Leu Cys Gly Pro Ala Asn Thr Gly Lys Ser Tyr Phe Gly Met
130 135 140
Ser Leu Trp Pro Tyr Leu His Ser Arg Leu Val Val Phe Thr Phe Pro
145 150 155 160
Asn Pro Phe Trp Pro Tyr Leu Glu Ser Arg Ile Thr Val Phe Glu Phe
165 170 175
Pro Asn Ala Phe Asn Val Cys Gln Asp Lys Ile Leu Glu His Tyr Glu
180 185 190
Asn Asp Ser Lys Asp Ile Leu Glu His Tyr Glu Asn Asp Ser Lys Asp
195 200 205
Leu Cys Asp His Ile Cys Asp His Ile Asp Tyr Trp Lys His Ile Arg
210 215 220
Leu Glu Cys Ala Ile Met Tyr Lys Ala Arg Ile Arg Leu Glu Cys Ala
225 230 235 240
Ile Met Tyr Lys Ala Arg Glu Met Gly Phe His Gln Phe Asp Gly Asp
245 250 255
Ile Cys Asn Thr Met His Tyr Thr Asn Trp Ile Tyr Ile Cys Glu Asp
260 265 270
Ala Gln Cys Thr Val Val Glu Gly Gln Val Asp Lys Lys Trp Glu Val
275 280 285
His Ala Gly Gly Gln Val Ile Leu Cys Pro Glu Ser Gly Gln Arg Arg
290 295 300
Ile Lys Arg Pro Arg Ser Glu Asn Cys His Pro Asn Lys Leu Leu Ile
305 310 315 320
Leu Lys Cys Leu Arg Tyr Arg Phe Lys Lys His Cys Lys Leu Ser Ser
325 330 335
Thr Trp His Trp Thr Cys His Asp Gly Lys His Lys Trp His Trp Thr
340 345 350
Cys His Asp Gly Lys His Lys Asn Ala Ile Val Thr Leu Thr Tyr Tyr
355 360 365
Glu Ala Asp Lys Asn Asp Leu Asn Ala Gln Ile Glu His Trp Lys Leu
370 375 380
Ile Arg Met Glu Cys Ala Ile Phe Tyr Lys Ala Lys Glu Leu Gly Ile
385 390 395 400
Ser Ile Cys His Gln Val Val Pro Pro Leu Ala Ala Ser Lys Ala Lys
405 410 415
Ala Cys Gln Ala Ile Glu Leu Gln Leu Ala Leu Glu Ala Leu Asn Ala
420 425 430
Ser Pro Tyr Asp Glu Trp Thr Leu Gln Gln Thr Ser Leu Glu Met Trp
435 440 445
Leu Ala Glu Pro Gln Phe Lys Lys His Gly Ile Thr Ile Thr Val Gln
450 455 460
Tyr Asp Asn Asp Lys Ala Asn Thr Met Asp Tyr Thr Asn Trp Lys Glu
465 470 475 480
Ile Tyr Val Ile Val Cys Pro Ala Ser Ile Pro Ser Asp Glu Ile Ser
485 490 495
Thr Glu Glu Ala Asp His Ile Asp Tyr Trp Lys Ala Ile Arg Gln Glu
500 505 510
Asn Ala Ile Phe Phe Ala Ala Arg His Gln Val Val Pro Ala Leu Asn
515 520 525
Ile Cys Lys Ala Lys Ala Cys Lys Ala Ile Glu Trp Asn Thr Glu Pro
530 535 540
Lys His Cys Phe Lys Lys Gly Gly Gln His Ile Glu Val Trp Phe Asp
545 550 555 560
Tyr Val Ala Trp Asp Ser Val Tyr Tyr Cys Gly Asp Asp Gly Trp Cys
565 570 575
Lys Thr Glu Ala Glu Lys Tyr Gly Cys Lys Gly Thr Trp Glu Val His
580 585 590
Phe Gly Asn Ser Ile Asp Cys Asn Asp Ser Met Cys Ser Thr Phe Asp
595 600 605
Asp Asn Val Ser Ala Thr Glu Leu Val Lys Asp His Ile Asp Tyr Trp
610 615 620
Lys Leu Ile Arg Leu Glu Cys Ala Ile Phe Tyr Lys Ala Arg Arg Arg
625 630 635 640
Leu Ser Ser Asp Gln Asp Gln Ser Gln Leu Val Thr Lys Tyr Pro Leu
645 650 655
Leu Lys Leu Leu Ser Thr Gln Thr Thr Thr Pro Glu Asn Thr Ser Leu
660 665 670
Val Glu Leu Arg Val Thr Thr Pro Lys Ser Thr Val Val Ile Arg Leu
675 680 685
His Leu Thr Thr Arg Tyr Pro Leu Leu Ser Leu Leu Asn Ser Tyr Ser
690 695 700
Thr Pro Pro His Arg Ile Pro Ala Pro Cys Pro Trp Ala Pro Gln Arg
705 710 715 720
Pro Pro Ile Pro Lys Pro Ser Pro Trp Ala Pro Arg Ile Pro Ala Pro
725 730 735
Cys Pro Trp Ala Pro Pro Arg Pro Pro His Cys Pro Trp Val Pro Cys
740 745 750
Phe Leu Leu Cys Phe Cys Val Leu Leu Cys Val Cys Leu Leu Ile Arg
755 760 765
Pro Leu Leu Leu Ser Val Ser Thr Tyr Leu Arg Pro Leu Leu Leu Ser
770 775 780
Ile Ser Val Tyr Ala Gln Val Leu Val Leu Val Leu Leu Leu Trp Val
785 790 795 800
Ser Ile Gly Ser Leu Leu Pro Ser Val Cys Met Cys Ala Tyr Ala Trp
805 810 815
Val Leu Val Phe Val Tyr Ile Val Val Ile Thr Ser Pro Ala Thr Ala
820 825 830
Ile Val Tyr Arg Asp Gly Asn Pro Tyr Ala Val Cys Asp Lys Cys Leu
835 840 845
Lys Phe Tyr Ser Lys Ile Ser Glu Tyr Arg His Tyr Cys Tyr Ser Leu
850 855 860
Tyr Gly Thr Thr Leu Glu Gln Gln Tyr Asn Lys Pro Leu Cys Asp Leu
865 870 875 880
Leu Ile Arg Cys Ile Asn Val Val Tyr Arg Asp Ser Ile Pro His Ala
885 890 895
Ala Cys His Lys Cys Ile Asp Phe Tyr Ser Arg Ile Arg Glu Leu Arg
900 905 910
His Tyr Ser Asp Ser Val Tyr Gly Asp Thr Leu Glu Lys Leu Thr Asn
915 920 925
Thr Gly Leu Tyr Asn Leu Leu Ile Arg Cys Leu Arg Val Tyr Cys Lys
930 935 940
Gly Gln Leu Thr Glu Thr Glu Val Leu Asp Phe Ala Phe Thr Asp Leu
945 950 955 960
Thr Ile Val Tyr Arg Asp Ser Lys Val Ser Glu Phe Arg Trp Tyr Arg
965 970 975
Tyr Ser Val Tyr Gly Thr Thr Leu Cys Val Glu Cys Lys Lys Thr Leu
980 985 990
Gln Arg Ser Glu Val Tyr Asp Cys Gln Arg Pro Leu Cys Pro Gln Glu
995 1000 1005
Lys Lys Arg His Val Asp Leu Asn Lys Arg Phe His Thr Leu His
1010 1015 1020
Glu Tyr Met Leu Asp Leu Gln Pro Glu Thr Thr Asp Leu Tyr Cys
1025 1030 1035
Tyr Glu Gln Pro Glu Thr Thr Asp Leu His Cys Tyr Glu Gln Leu
1040 1045 1050
Gly Asp Ser Ser Asp Glu Glu Asp Thr Gly Gly Leu Asp Gly Glu
1055 1060 1065
Glu Asn Asp Glu Ile Asp Gly Val Asn His Gln His Leu Pro Ala
1070 1075 1080
Arg Arg Ala Glu Pro Gln Arg His Thr Met Leu Cys Met Cys Cys
1085 1090 1095
Lys Cys Glu Ala Arg Ile Ala
1100 1105
<210> 75
<211> 4470
<212> DNA
<213> 人工序列
<220>
<223> 疫苗构建体
<400> 75
atggatgcta tgaagagggg cctgtgctgc gtgctgctgc tgtgtggcgc cgtgtttgtg 60
tcccccagcc aggaaatcca cgcccggttc agaagaggca gcaagctggc cgacgaggac 120
gagacagcct acgacagcgg caccgacctg atcgacttca tcgacgatag cgccgctgcc 180
gacgagaatg agaacgacag cgataccggc gaggacatgg tggatttcat cgacaacgct 240
gccgccgacg aaaccgacga agagagcacc gagagcgacc tggacggctt tatcgacaac 300
agcgcagccg cccagctggc tgacagcgac tctaatgcct gcgccttcct gaaggccgct 360
gctcagctgg cagacgtgaa cagcaatgcc gccgcttttc tgaaggctgc cgccaactgc 420
atcctgctgt acggcgctgc caacaccggc aagagcctgt tcggcatgtc tctggccgca 480
gccaactgcc tggtgctgtg cggacctgcc aatactggca aaagctactt cggcatgagc 540
ctggcagccg ccaattgtct cgtgatctac ggccctccta ataccggcaa gtcctgcttt 600
gccatgagtc tggccgctgc ctggccctac ctgcactcta gactggtggt gttcaccttc 660
cccaacccct tcgctgccgc ttggccttac ctggaaagcc ggatcaccgt gttcgagttc 720
cccaatgcct tcgccgcagc cctgagatac ctgcacagca gaatccacgt gctgcagttt 780
ctgaacccct ttgccgccgc aaacgtgtgc caggacaaga tcctggaaca ctacgagaac 840
gactccaagg atgccgctgc cattctggaa cattatgaga atgatagcaa ggacctgtgc 900
gaccacattg ctgccgcctg cgatcacatc gactactgga agcacatccg gctggaatgc 960
gccatcatgt acaaggccag agccgccgct atcagactgg aatgtgctat tatgtataag 1020
gctcgcgaga tgggcttcca cgctgctgcc cagttcgacg gcgacatctg caacaccatg 1080
cactacacca actgggctgc cgctatctac atctgcgagg acgcccagtg caccgtggtg 1140
gaaggacagg tggacgccgc tgctaagaaa tgggaggtgc acgctggcgg ccaagtgatc 1200
ctgtgtccag agtctgctgc cgcaggccag cggagaatca agaggcctag aagcgaggca 1260
gccgctaact gccaccccaa caaactgctg gctgctgcca tcctgaagtg cctgcggtac 1320
agattcaaga agcactgcaa actggctgca gctagcagca cctggcactg gacctgtcac 1380
gacggcaagc acaaagccgc cgcatggcat tggacatgcc acgatggaaa acacaagaac 1440
gccatcgtga ccctgaccta tgcagccgcc tacgaggccg acaagaacga cctgaacgcc 1500
cagatcgagc actggaagct gatcaggatg gaatgtgcaa tcttctataa ggccaaagag 1560
ctgggcatca gcgctgccgc aatctgccac caggtggtgc ctccactggc cgcctctaaa 1620
gccaaagcct gccaggccat cgaactgcag ctggccctgg aagccctgaa tgccagccct 1680
tatgccgcag ccgatgagtg gaccctgcag cagaccagcc tggaaatgtg gctggccgaa 1740
cctcaggccg cagcttttaa gaagcacggc atcaccatca ccgtgcagta cgacaacgac 1800
aaggccaata ccatggatta caccaattgg aaagagatct acgccgcagc tgtgatcgtg 1860
tgccccgcca gcatccctag cgacgagatc agcacagagg aagcagccgc cgaccacatc 1920
gattattgga aagccatcag acaggaaaac gccatcttct tcgccgctag agccgctgcc 1980
caccaggtgg tgccagccct gaatatctgc aaggccaagg cctgtaaagc catcgaagcc 2040
gctgcttgga acaccgagcc caagcactgc ttcaagaagg gcggccagca catcgaagtg 2100
tggttcgacg ctgcagccta cgtggcctgg gacagcgtgt actactgtgg cgacgacggc 2160
tggtgcaaga ccgccgctgc agaggccgag aagtatggct gcaagggcac ctgggaagtg 2220
catttcggcg cagctgccaa ctccatcgac tgcaacgaca gcatgtgcag caccttcgac 2280
gacaacgtgt ccgccaccga gctcgtgaaa gctgccgctg accatattga ttactggaaa 2340
ctgattcgcc tggaatgcgc tattttctac aaagccaggg ccgcagcacg gcggctgtcc 2400
tcagatcagg atcagagcca ggctgctgca ctcgtgacca agtaccccct gctgaagctg 2460
ctgagcgccg cagcaagacc ccccaacatg ggagtgaagg cccacggcaa gtgcatctgg 2520
gagaacaagg tgttcatcgt gcccaccctg tgccccgtgc ctctggatcc aacatatcct 2580
ctgctgaaac tgctgaccgc tgccgccacc cagaccacca cacctgagaa tacctccctg 2640
gtggaactga gagtgaccac ccccaagagc acagtcgtga tcaggctgca cctggctgcc 2700
gcaaccacca gataccctct gctgtccctg ctgaacagct acagcacccc ccctcatcgg 2760
atccctgccc cttgtccttg ggctcctcag aggcctgccg ctgcacctat ccctaagcct 2820
tctccatggg cccctgccgc agctagaatc ccagctccat gtccatgggc accagctgct 2880
gctcccagac ctcctcattg cccttgggtg ccagcagccg ctcctccacc tcctcctaga 2940
ccttgggccc cagccgccgc ttgtttcctg ctgtgcttct gtgtgctgct gtgcgtgtgc 3000
ctgctgatca gacccctgct gctgagtgtg tccacctacg cagctgctct gcggccactg 3060
ctgctgtcca tctctgtgta cgcacaggtg ctggtgctgg tgctgctgct gtgggtgtcc 3120
atcggatctg ccgcagcact gctgccctcc gtgtgcatgt gtgcctatgc ctgggtgctg 3180
gtgttcgtgt acatcgtcgt gattaccagc cccgccaccg cagccgcaat cgtgtacagg 3240
gacggcaacc cttacgccgt gtgcgacaag tgcctgaagt tctacagcaa gatcagcgag 3300
taccgccact actgctacag cctgtacggc accaccctgg aacagcagta caacaagccc 3360
ctgtgcgatc tgctgatccg gtgcatcaac gcagccgctg tggtgtacag agacagcatc 3420
ccacacgccg cctgccacaa gtgtatcgac ttctactccc ggatcagaga gctgagacac 3480
tactccgact ccgtgtacgg cgataccctg gaaaagctga ccaataccgg cctgtacaac 3540
ctgctgatta gatgcctgcg ggcagccgca gtgttctgca agaaagccct gaccgccagc 3600
gaggtgtaca acttcgccta caccgatctg cgggtggtgt accgggatgc tgctgcctcc 3660
aaagtgcgga agctgcggta ctacaactgc tctgtgtatg gcgcctccct ggcagctgcc 3720
gtgtattgca agggacagct gaccgagaca gaggtgctgg atttcgcctt cacagacctg 3780
accatcgtgt atagagatgc agctgctagc aaggtgtccg agttccggtg gtacagatat 3840
agcgtgtacg gaacaacact ggcagcagct tgcgtggaat gcaagaaaac actgcagcgg 3900
agcgaagtgt acgatgctgc agcttgccag aggccgctgt gtcctcagga aaagaaaaga 3960
cacgtggacc tgaacaagcg gttccacgca gcagctaccc tgcacgagta catgctggac 4020
ctgcagcccg agacaaccga cctgtactgc tacgagcagg cagctgcacc cgaaaccaca 4080
gatctgcact gttatgagca gctgggagac agctccgatg aagaggacac cggcggactg 4140
gatgctgccg ctggggatga ggacgaggat gaggtggacc atctgcagga acagccccag 4200
caggccagaa gggatgagca gcacccctgc tatctgatcg agacacagtg ctgcagatgc 4260
gagagcctgg tggccgctgc tgaggaaaac gacgagatcg acggcgtgaa ccaccagcat 4320
ctgcccgcta gaagggccga gcctcagaga cacaccatgc tgtgtatgtg ttgcaagtgc 4380
gaggcccgga tcgccggatc tggacctggc gctagcggaa agcccatccc caatccactg 4440
ctgggcctgg actccacccg gacctgataa 4470
<210> 76
<211> 1488
<212> PRT
<213> 人工序列
<220>
<223> 疫苗构建体
<400> 76
Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly
1 5 10 15
Ala Val Phe Val Ser Pro Ser Gln Glu Ile His Ala Arg Phe Arg Arg
20 25 30
Gly Ser Lys Leu Ala Asp Glu Asp Glu Thr Ala Tyr Asp Ser Gly Thr
35 40 45
Asp Leu Ile Asp Phe Ile Asp Asp Ser Ala Ala Ala Asp Glu Asn Glu
50 55 60
Asn Asp Ser Asp Thr Gly Glu Asp Met Val Asp Phe Ile Asp Asn Ala
65 70 75 80
Ala Ala Asp Glu Thr Asp Glu Glu Ser Thr Glu Ser Asp Leu Asp Gly
85 90 95
Phe Ile Asp Asn Ser Ala Ala Ala Gln Leu Ala Asp Ser Asp Ser Asn
100 105 110
Ala Cys Ala Phe Leu Lys Ala Ala Ala Gln Leu Ala Asp Val Asn Ser
115 120 125
Asn Ala Ala Ala Phe Leu Lys Ala Ala Ala Asn Cys Ile Leu Leu Tyr
130 135 140
Gly Ala Ala Asn Thr Gly Lys Ser Leu Phe Gly Met Ser Leu Ala Ala
145 150 155 160
Ala Asn Cys Leu Val Leu Cys Gly Pro Ala Asn Thr Gly Lys Ser Tyr
165 170 175
Phe Gly Met Ser Leu Ala Ala Ala Asn Cys Leu Val Ile Tyr Gly Pro
180 185 190
Pro Asn Thr Gly Lys Ser Cys Phe Ala Met Ser Leu Ala Ala Ala Trp
195 200 205
Pro Tyr Leu His Ser Arg Leu Val Val Phe Thr Phe Pro Asn Pro Phe
210 215 220
Ala Ala Ala Trp Pro Tyr Leu Glu Ser Arg Ile Thr Val Phe Glu Phe
225 230 235 240
Pro Asn Ala Phe Ala Ala Ala Leu Arg Tyr Leu His Ser Arg Ile His
245 250 255
Val Leu Gln Phe Leu Asn Pro Phe Ala Ala Ala Asn Val Cys Gln Asp
260 265 270
Lys Ile Leu Glu His Tyr Glu Asn Asp Ser Lys Asp Ala Ala Ala Ile
275 280 285
Leu Glu His Tyr Glu Asn Asp Ser Lys Asp Leu Cys Asp His Ile Ala
290 295 300
Ala Ala Cys Asp His Ile Asp Tyr Trp Lys His Ile Arg Leu Glu Cys
305 310 315 320
Ala Ile Met Tyr Lys Ala Arg Ala Ala Ala Ile Arg Leu Glu Cys Ala
325 330 335
Ile Met Tyr Lys Ala Arg Glu Met Gly Phe His Ala Ala Ala Gln Phe
340 345 350
Asp Gly Asp Ile Cys Asn Thr Met His Tyr Thr Asn Trp Ala Ala Ala
355 360 365
Ile Tyr Ile Cys Glu Asp Ala Gln Cys Thr Val Val Glu Gly Gln Val
370 375 380
Asp Ala Ala Ala Lys Lys Trp Glu Val His Ala Gly Gly Gln Val Ile
385 390 395 400
Leu Cys Pro Glu Ser Ala Ala Ala Gly Gln Arg Arg Ile Lys Arg Pro
405 410 415
Arg Ser Glu Ala Ala Ala Asn Cys His Pro Asn Lys Leu Leu Ala Ala
420 425 430
Ala Ile Leu Lys Cys Leu Arg Tyr Arg Phe Lys Lys His Cys Lys Leu
435 440 445
Ala Ala Ala Ser Ser Thr Trp His Trp Thr Cys His Asp Gly Lys His
450 455 460
Lys Ala Ala Ala Trp His Trp Thr Cys His Asp Gly Lys His Lys Asn
465 470 475 480
Ala Ile Val Thr Leu Thr Tyr Ala Ala Ala Tyr Glu Ala Asp Lys Asn
485 490 495
Asp Leu Asn Ala Gln Ile Glu His Trp Lys Leu Ile Arg Met Glu Cys
500 505 510
Ala Ile Phe Tyr Lys Ala Lys Glu Leu Gly Ile Ser Ala Ala Ala Ile
515 520 525
Cys His Gln Val Val Pro Pro Leu Ala Ala Ser Lys Ala Lys Ala Cys
530 535 540
Gln Ala Ile Glu Leu Gln Leu Ala Leu Glu Ala Leu Asn Ala Ser Pro
545 550 555 560
Tyr Ala Ala Ala Asp Glu Trp Thr Leu Gln Gln Thr Ser Leu Glu Met
565 570 575
Trp Leu Ala Glu Pro Gln Ala Ala Ala Phe Lys Lys His Gly Ile Thr
580 585 590
Ile Thr Val Gln Tyr Asp Asn Asp Lys Ala Asn Thr Met Asp Tyr Thr
595 600 605
Asn Trp Lys Glu Ile Tyr Ala Ala Ala Val Ile Val Cys Pro Ala Ser
610 615 620
Ile Pro Ser Asp Glu Ile Ser Thr Glu Glu Ala Ala Ala Asp His Ile
625 630 635 640
Asp Tyr Trp Lys Ala Ile Arg Gln Glu Asn Ala Ile Phe Phe Ala Ala
645 650 655
Arg Ala Ala Ala His Gln Val Val Pro Ala Leu Asn Ile Cys Lys Ala
660 665 670
Lys Ala Cys Lys Ala Ile Glu Ala Ala Ala Trp Asn Thr Glu Pro Lys
675 680 685
His Cys Phe Lys Lys Gly Gly Gln His Ile Glu Val Trp Phe Asp Ala
690 695 700
Ala Ala Tyr Val Ala Trp Asp Ser Val Tyr Tyr Cys Gly Asp Asp Gly
705 710 715 720
Trp Cys Lys Thr Ala Ala Ala Glu Ala Glu Lys Tyr Gly Cys Lys Gly
725 730 735
Thr Trp Glu Val His Phe Gly Ala Ala Ala Asn Ser Ile Asp Cys Asn
740 745 750
Asp Ser Met Cys Ser Thr Phe Asp Asp Asn Val Ser Ala Thr Glu Leu
755 760 765
Val Lys Ala Ala Ala Asp His Ile Asp Tyr Trp Lys Leu Ile Arg Leu
770 775 780
Glu Cys Ala Ile Phe Tyr Lys Ala Arg Ala Ala Ala Arg Arg Leu Ser
785 790 795 800
Ser Asp Gln Asp Gln Ser Gln Ala Ala Ala Leu Val Thr Lys Tyr Pro
805 810 815
Leu Leu Lys Leu Leu Ser Ala Ala Ala Arg Pro Pro Asn Met Gly Val
820 825 830
Lys Ala His Gly Lys Cys Ile Trp Glu Asn Lys Val Phe Ile Val Pro
835 840 845
Thr Leu Cys Pro Val Pro Leu Asp Pro Thr Tyr Pro Leu Leu Lys Leu
850 855 860
Leu Thr Ala Ala Ala Thr Gln Thr Thr Thr Pro Glu Asn Thr Ser Leu
865 870 875 880
Val Glu Leu Arg Val Thr Thr Pro Lys Ser Thr Val Val Ile Arg Leu
885 890 895
His Leu Ala Ala Ala Thr Thr Arg Tyr Pro Leu Leu Ser Leu Leu Asn
900 905 910
Ser Tyr Ser Thr Pro Pro His Arg Ile Pro Ala Pro Cys Pro Trp Ala
915 920 925
Pro Gln Arg Pro Ala Ala Ala Pro Ile Pro Lys Pro Ser Pro Trp Ala
930 935 940
Pro Ala Ala Ala Arg Ile Pro Ala Pro Cys Pro Trp Ala Pro Ala Ala
945 950 955 960
Ala Pro Arg Pro Pro His Cys Pro Trp Val Pro Ala Ala Ala Pro Pro
965 970 975
Pro Pro Pro Arg Pro Trp Ala Pro Ala Ala Ala Cys Phe Leu Leu Cys
980 985 990
Phe Cys Val Leu Leu Cys Val Cys Leu Leu Ile Arg Pro Leu Leu Leu
995 1000 1005
Ser Val Ser Thr Tyr Ala Ala Ala Leu Arg Pro Leu Leu Leu Ser
1010 1015 1020
Ile Ser Val Tyr Ala Gln Val Leu Val Leu Val Leu Leu Leu Trp
1025 1030 1035
Val Ser Ile Gly Ser Ala Ala Ala Leu Leu Pro Ser Val Cys Met
1040 1045 1050
Cys Ala Tyr Ala Trp Val Leu Val Phe Val Tyr Ile Val Val Ile
1055 1060 1065
Thr Ser Pro Ala Thr Ala Ala Ala Ile Val Tyr Arg Asp Gly Asn
1070 1075 1080
Pro Tyr Ala Val Cys Asp Lys Cys Leu Lys Phe Tyr Ser Lys Ile
1085 1090 1095
Ser Glu Tyr Arg His Tyr Cys Tyr Ser Leu Tyr Gly Thr Thr Leu
1100 1105 1110
Glu Gln Gln Tyr Asn Lys Pro Leu Cys Asp Leu Leu Ile Arg Cys
1115 1120 1125
Ile Asn Ala Ala Ala Val Val Tyr Arg Asp Ser Ile Pro His Ala
1130 1135 1140
Ala Cys His Lys Cys Ile Asp Phe Tyr Ser Arg Ile Arg Glu Leu
1145 1150 1155
Arg His Tyr Ser Asp Ser Val Tyr Gly Asp Thr Leu Glu Lys Leu
1160 1165 1170
Thr Asn Thr Gly Leu Tyr Asn Leu Leu Ile Arg Cys Leu Arg Ala
1175 1180 1185
Ala Ala Val Phe Cys Lys Lys Ala Leu Thr Ala Ser Glu Val Tyr
1190 1195 1200
Asn Phe Ala Tyr Thr Asp Leu Arg Val Val Tyr Arg Asp Ala Ala
1205 1210 1215
Ala Ser Lys Val Arg Lys Leu Arg Tyr Tyr Asn Cys Ser Val Tyr
1220 1225 1230
Gly Ala Ser Leu Ala Ala Ala Val Tyr Cys Lys Gly Gln Leu Thr
1235 1240 1245
Glu Thr Glu Val Leu Asp Phe Ala Phe Thr Asp Leu Thr Ile Val
1250 1255 1260
Tyr Arg Asp Ala Ala Ala Ser Lys Val Ser Glu Phe Arg Trp Tyr
1265 1270 1275
Arg Tyr Ser Val Tyr Gly Thr Thr Leu Ala Ala Ala Cys Val Glu
1280 1285 1290
Cys Lys Lys Thr Leu Gln Arg Ser Glu Val Tyr Asp Ala Ala Ala
1295 1300 1305
Cys Gln Arg Pro Leu Cys Pro Gln Glu Lys Lys Arg His Val Asp
1310 1315 1320
Leu Asn Lys Arg Phe His Ala Ala Ala Thr Leu His Glu Tyr Met
1325 1330 1335
Leu Asp Leu Gln Pro Glu Thr Thr Asp Leu Tyr Cys Tyr Glu Gln
1340 1345 1350
Ala Ala Ala Pro Glu Thr Thr Asp Leu His Cys Tyr Glu Gln Leu
1355 1360 1365
Gly Asp Ser Ser Asp Glu Glu Asp Thr Gly Gly Leu Asp Ala Ala
1370 1375 1380
Ala Gly Asp Glu Asp Glu Asp Glu Val Asp His Leu Gln Glu Gln
1385 1390 1395
Pro Gln Gln Ala Arg Arg Asp Glu Gln His Pro Cys Tyr Leu Ile
1400 1405 1410
Glu Thr Gln Cys Cys Arg Cys Glu Ser Leu Val Ala Ala Ala Glu
1415 1420 1425
Glu Asn Asp Glu Ile Asp Gly Val Asn His Gln His Leu Pro Ala
1430 1435 1440
Arg Arg Ala Glu Pro Gln Arg His Thr Met Leu Cys Met Cys Cys
1445 1450 1455
Lys Cys Glu Ala Arg Ile Ala Gly Ser Gly Pro Gly Ala Ser Gly
1460 1465 1470
Lys Pro Ile Pro Asn Pro Leu Leu Gly Leu Asp Ser Thr Arg Thr
1475 1480 1485
<210> 77
<211> 20
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<220>
<221> 尚未归类的特征
<222> (11)..(11)
<223> Xaa可以是任何天然氨基酸
<400> 77
Asp Glu Asp Glu Asn Ala Ser Asp Thr Gly Xaa Asp Leu Val Asp Phe
1 5 10 15
Ile Asp Asn Ser
20
<210> 78
<211> 19
<212> PRT
<213> 人乳头瘤病毒16型
<400> 78
Asp Glu Asn Glu Asn Asp Ser Thr Gly Glu Asp Leu Val Asp Phe Ile
1 5 10 15
Val Asn Asp
<210> 79
<211> 20
<212> PRT
<213> 人乳头瘤病毒18型
<400> 79
Asp Glu Asp Glu Asn Ala Thr Asp Thr Gly Ser Asp Met Val Asp Phe
1 5 10 15
Ile Asp Thr Gln
20
<210> 80
<211> 20
<212> PRT
<213> 人乳头瘤病毒31型
<400> 80
Asp Glu Asn Glu Asp Ser Ser Asp Thr Gly Glu Asp Met Val Asp Phe
1 5 10 15
Ile Asp Asn Cys
20
<210> 81
<211> 20
<212> PRT
<213> 人乳头瘤病毒52型
<400> 81
Asp Glu Asp Glu Asn Ala Tyr Asp Ser Gly Thr Asp Leu Ile Asp Phe
1 5 10 15
Ile Asp Asp Ser
20
<210> 82
<211> 19
<212> PRT
<213> 人乳头瘤病毒53型
<400> 82
Asp Glu Thr Asp Glu Glu Ser Thr Glu Ser Asp Leu Asp Gly Phe Ile
1 5 10 15
Asp Asn Ser
<210> 83
<211> 20
<212> PRT
<213> 人乳头瘤病毒58型
<400> 83
Asp Glu Asp Glu Thr Ala Asp Asp Ser Gly Thr Asp Leu Ile Glu Phe
1 5 10 15
Ile Asp Asp Ser
20
<210> 84
<211> 15
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 84
Ala Gln Leu Ala Asp Val Asn Ser Asn Ala Ala Ala Phe Leu Lys
1 5 10 15
<210> 85
<211> 15
<212> PRT
<213> 人乳头瘤病毒16型
<400> 85
Ala Gln Leu Ala Asp Thr Asn Ser Asn Ala Ser Ala Phe Leu Lys
1 5 10 15
<210> 86
<211> 15
<212> PRT
<213> 人乳头瘤病毒18型
<400> 86
Ala Leu Leu Ala Asp Ser Asn Ser Asn Ala Ala Ala Phe Leu Lys
1 5 10 15
<210> 87
<211> 15
<212> PRT
<213> 人乳头瘤病毒31型
<400> 87
Ala Gln Leu Ala Asp Ser Asp Ser Asn Ala Cys Ala Phe Leu Lys
1 5 10 15
<210> 88
<211> 15
<212> PRT
<213> 人乳头瘤病毒52型
<400> 88
Ala Gln Leu Ala Asp Val Asn Ser Asn Ala Ala Ala Phe Leu Lys
1 5 10 15
<210> 89
<211> 15
<212> PRT
<213> 人乳头瘤病毒53型
<400> 89
Ala Gln Leu Ala Asp Val Asp Ser Asn Ala Gln Ala Phe Leu Lys
1 5 10 15
<210> 90
<211> 15
<212> PRT
<213> 人乳头瘤病毒58型
<400> 90
Ala Gln Leu Ala Asp Val Asn Ser Asn Ala Ala Ala Phe Leu Arg
1 5 10 15
<210> 91
<211> 20
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<220>
<221> 尚未归类的特征
<222> (4)..(4)
<223> Xaa可以是任何天然氨基酸
<400> 91
Asn Cys Leu Xaa Leu Tyr Gly Pro Ala Asn Thr Gly Lys Ser Tyr Phe
1 5 10 15
Gly Met Ser Leu
20
<210> 92
<211> 20
<212> PRT
<213> 人乳头瘤病毒16型
<400> 92
Asn Cys Ile Leu Leu Tyr Gly Ala Ala Asn Thr Gly Lys Ser Leu Phe
1 5 10 15
Gly Met Ser Leu
20
<210> 93
<211> 20
<212> PRT
<213> 人乳头瘤病毒18型
<400> 93
Asn Cys Leu Val Phe Cys Gly Pro Ala Asn Thr Gly Lys Ser Tyr Phe
1 5 10 15
Gly Met Ser Phe
20
<210> 94
<211> 20
<212> PRT
<213> 人乳头瘤病毒31型
<400> 94
Asn Cys Ile Leu Ile His Gly Ala Pro Asn Thr Gly Lys Ser Tyr Phe
1 5 10 15
Gly Met Ser Leu
20
<210> 95
<211> 20
<212> PRT
<213> 人乳头瘤病毒52型
<400> 95
Asn Cys Leu Val Leu Tyr Gly Pro Ala Asn Thr Gly Lys Ser Tyr Phe
1 5 10 15
Gly Met Ser Leu
20
<210> 96
<211> 20
<212> PRT
<213> 人乳头瘤病毒53型
<400> 96
Asn Cys Leu Val Ile Tyr Gly Pro Pro Asn Thr Gly Lys Ser Cys Gly
1 5 10 15
Ala Met Ser Leu
20
<210> 97
<211> 20
<212> PRT
<213> 人乳头瘤病毒58型
<400> 97
Ser Cys Met Leu Leu Cys Gly Pro Ala Asn Thr Gly Lys Ser Tyr Phe
1 5 10 15
Gly Met Ser Leu
20
<210> 98
<211> 20
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<220>
<221> 尚未归类的特征
<222> (11)..(11)
<223> Xaa可以是任何天然氨基酸
<400> 98
Asp Glu Asp Glu Asn Ala Ser Asp Thr Gly Xaa Asp Leu Val Asp Phe
1 5 10 15
Ile Asp Asn Ser
20
<210> 99
<211> 20
<212> PRT
<213> 人乳头瘤病毒16型
<400> 99
Asp Glu Asn Glu Asn Asp Ser Asp Thr Gly Glu Asp Leu Val Asp Phe
1 5 10 15
Ile Val Asn Asp
20
<210> 100
<211> 20
<212> PRT
<213> 人乳头瘤病毒18型
<400> 100
Asp Glu Asp Glu Asn Ala Thr Asp Thr Gly Ser Asp Met Val Asp Phe
1 5 10 15
Ile Asp Thr Gln
20
<210> 101
<211> 20
<212> PRT
<213> 人乳头瘤病毒31型
<400> 101
Asp Glu Asn Glu Asp Ser Ser Asp Thr Gly Glu Asp Met Val Asp Phe
1 5 10 15
Ile Asp Asn Cys
20
<210> 102
<211> 20
<212> PRT
<213> 人乳头瘤病毒52型
<400> 102
Asp Glu Asp Glu Asn Ala Tyr Asp Ser Gly Thr Asp Leu Ile Asp Phe
1 5 10 15
Ile Asp Asp Ser
20
<210> 103
<211> 19
<212> PRT
<213> 人乳头瘤病毒53型
<400> 103
Asp Glu Thr Asp Glu Glu Ser Thr Glu Ser Asp Leu Asp Gly Phe Ile
1 5 10 15
Asp Asn Ser
<210> 104
<211> 20
<212> PRT
<213> 人乳头瘤病毒58型
<400> 104
Asp Glu Asp Glu Thr Ala Asp Asp Ser Gly Thr Asp Leu Ile Glu Phe
1 5 10 15
Ile Asp Asp Ser
20
<210> 105
<211> 20
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<220>
<221> 尚未归类的特征
<222> (5)..(5)
<223> Xaa可以是任何天然氨基酸
<220>
<221> 尚未归类的特征
<222> (7)..(7)
<223> Xaa可以是任何天然氨基酸
<220>
<221> 尚未归类的特征
<222> (15)..(15)
<223> Xaa可以是任何天然氨基酸
<400> 105
Asp Glu Asp Glu Xaa Ala Xaa Asp Ser Gly Thr Asp Leu Ile Xaa Phe
1 5 10 15
Ile Asp Asp Ser
20
<210> 106
<211> 20
<212> PRT
<213> 人乳头瘤病毒52型
<400> 106
Asp Glu Asp Glu Asn Ala Tyr Asp Ser Gly Thr Asp Leu Ile Asp Phe
1 5 10 15
Ile Asp Asp Ser
20
<210> 107
<211> 20
<212> PRT
<213> 人乳头瘤病毒58型
<400> 107
Asp Glu Asp Glu Thr Ala Asp Asp Ser Gly Thr Asp Leu Ile Glu Phe
1 5 10 15
Ile Asp Asp Ser
20
<210> 108
<211> 15
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 108
Ala Gln Leu Ala Asp Ser Asp Ser Asn Ala Cys Ala Phe Leu Lys
1 5 10 15
<210> 109
<211> 15
<212> PRT
<213> 人乳头瘤病毒31型
<400> 109
Ala Gln Leu Ala Asp Ser Asp Ser Asn Ala Cys Ala Phe Leu Lys
1 5 10 15
<210> 110
<211> 15
<212> PRT
<213> 人乳头瘤病毒53型
<400> 110
Ala Gln Leu Ala Asp Val Asp Ser Asn Ala Gln Ala Phe Leu Lys
1 5 10 15
<210> 111
<211> 20
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 111
Asn Cys Ile Leu Leu Tyr Gly Ala Ala Asn Thr Gly Lys Ser Leu Phe
1 5 10 15
Gly Met Ser Leu
20
<210> 112
<211> 20
<212> PRT
<213> 人乳头瘤病毒16型
<400> 112
Asn Cys Ile Leu Leu Tyr Gly Ala Ala Asn Thr Gly Lys Ser Leu Phe
1 5 10 15
Gly Met Ser Leu
20
<210> 113
<211> 20
<212> PRT
<213> 人乳头瘤病毒31型
<400> 113
Asn Cys Ile Leu Ile His Gly Ala Pro Asn Thr Gly Lys Ser Tyr Phe
1 5 10 15
Gly Met Ser Leu
20
<210> 114
<211> 17
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 114
Trp Pro Tyr Leu His Ser Arg Leu Val Val Phe Thr Phe Pro Asn Pro
1 5 10 15
Phe
<210> 115
<211> 17
<212> PRT
<213> 人乳头瘤病毒16型
<400> 115
Trp Pro Tyr Leu His Asn Arg Leu Val Val Phe Thr Phe Pro Asn Glu
1 5 10 15
Phe
<210> 116
<211> 17
<212> PRT
<213> 人乳头瘤病毒31型
<400> 116
Trp Pro Tyr Leu His Ser Arg Leu Val Val Phe Thr Phe Pro Asn Pro
1 5 10 15
Phe
<210> 117
<211> 17
<212> PRT
<213> 人乳头瘤病毒52型
<400> 117
Trp Pro Tyr Leu His Ser Arg Leu Val Val Phe His Phe Lys Asn Pro
1 5 10 15
Phe
<210> 118
<211> 19
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<220>
<221> 尚未归类的特征
<222> (6)..(6)
<223> Xaa可以是任何天然氨基酸
<400> 118
Asp Glu Asn Glu Asn Xaa Ser Asp Thr Gly Glu Asp Met Val Asp Phe
1 5 10 15
Ile Asp Asn
<210> 119
<211> 19
<212> PRT
<213> 人乳头瘤病毒31型
<400> 119
Asp Glu Asn Glu Asp Ser Ser Asp Thr Gly Glu Asp Met Val Asp Phe
1 5 10 15
Ile Asp Asn
<210> 120
<211> 19
<212> PRT
<213> 人乳头瘤病毒16型
<400> 120
Asp Glu Asn Glu Asn Asp Ser Asp Thr Gly Glu Asp Leu Val Asp Phe
1 5 10 15
Ile Val Asn
<210> 121
<211> 19
<212> PRT
<213> 人乳头瘤病毒18型
<400> 121
Asp Glu Asp Glu Asn Ala Thr Asp Thr Gly Ser Asp Met Val Asp Phe
1 5 10 15
Ile Asp Thr
<210> 122
<211> 19
<212> PRT
<213> 人乳头瘤病毒53型
<400> 122
Asp Glu Thr Asp Glu Glu Ser Thr Glu Ser Asp Leu Asp Gly Phe Ile
1 5 10 15
Asp Asn Ser
<210> 123
<211> 15
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 123
Ala Gln Leu Ala Asp Val Asn Ser Asn Ala Ala Ala Phe Leu Lys
1 5 10 15
<210> 124
<211> 15
<212> PRT
<213> 人乳头瘤病毒52型
<400> 124
Ala Gln Leu Ala Asp Val Asn Ser Asn Ala Ala Ala Phe Leu Lys
1 5 10 15
<210> 125
<211> 15
<212> PRT
<213> 人乳头瘤病毒58型
<400> 125
Ala Gln Leu Ala Asp Val Asn Ser Asn Ala Ala Ala Phe Leu Arg
1 5 10 15
<210> 126
<211> 15
<212> PRT
<213> 人乳头瘤病毒18型
<400> 126
Ala Leu Leu Ala Asp Ser Asn Ser Asn Ala Ala Ala Phe Leu Lys
1 5 10 15
<210> 127
<211> 15
<212> PRT
<213> 人乳头瘤病毒16型
<400> 127
Ala Gln Leu Ala Asp Thr Asn Ser Asn Ala Ser Ala Phe Leu Lys
1 5 10 15
<210> 128
<211> 20
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 128
Asn Cys Leu Val Leu Cys Gly Pro Ala Asn Thr Gly Lys Ser Tyr Phe
1 5 10 15
Gly Met Ser Leu
20
<210> 129
<211> 20
<212> PRT
<213> 人乳头瘤病毒18型
<400> 129
Asn Cys Leu Val Phe Cys Gly Pro Ala Asn Thr Gly Lys Ser Tyr Phe
1 5 10 15
Gly Met Ser Phe
20
<210> 130
<211> 20
<212> PRT
<213> 人乳头瘤病毒52型
<400> 130
Asn Cys Leu Val Leu Tyr Gly Pro Ala Asn Thr Gly Lys Ser Tyr Phe
1 5 10 15
Gly Met Ser Leu
20
<210> 131
<211> 20
<212> PRT
<213> 人乳头瘤病毒58型
<400> 131
Ser Cys Met Leu Leu Cys Gly Pro Ala Asn Thr Gly Lys Ser Tyr Phe
1 5 10 15
Gly Met Ser Leu
20
<210> 132
<211> 20
<212> PRT
<213> 人乳头瘤病毒53型
<400> 132
Asn Cys Leu Val Ile Tyr Gly Pro Pro Asn Thr Gly Lys Ser Cys Phe
1 5 10 15
Ala Met Ser Leu
20
<210> 133
<211> 16
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 133
Trp Pro Tyr Leu Glu Ser Arg Thr Val Phe Glu Phe Pro Asn Ala Phe
1 5 10 15
<210> 134
<211> 17
<212> PRT
<213> 人乳头瘤病毒18型
<400> 134
Trp Pro Tyr Leu Glu Ser Arg Ile Thr Val Phe Glu Phe Pro Asn Ala
1 5 10 15
Phe
<210> 135
<211> 17
<212> PRT
<213> 人乳头瘤病毒58型
<400> 135
Trp Pro Tyr Leu His Ser Arg Leu Thr Val Phe Glu Phe Asn Asn Pro
1 5 10 15
Phe
<210> 136
<211> 17
<212> PRT
<213> 人乳头瘤病毒53型
<400> 136
Leu Arg Tyr Leu His Ser Arg Ile His Val Leu Gln Phe Leu Asn Pro
1 5 10 15
Phe
<210> 137
<211> 20
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 137
Asp His Ile Asp Tyr Trp Lys His Met Arg Leu Glu Cys Ala Ile Tyr
1 5 10 15
Tyr Lys Ala Arg
20
<210> 138
<211> 20
<212> PRT
<213> 人乳头瘤病毒16型
<400> 138
Asp His Ile Asp Tyr Trp Lys His Met Arg Leu Glu Cys Ala Ile Tyr
1 5 10 15
Tyr Lys Ala Arg
20
<210> 139
<211> 20
<212> PRT
<213> 人乳头瘤病毒18型
<400> 139
Ser Gln Ile Gln Tyr Trp Gln Leu Ile Arg Trp Glu Asn Ala Ile Phe
1 5 10 15
Phe Ala Ala Arg
20
<210> 140
<211> 20
<212> PRT
<213> 人乳头瘤病毒31型
<400> 140
Asp His Ile Asp Tyr Trp Lys His Ile Arg Leu Glu Cys Val Leu Met
1 5 10 15
Tyr Lys Ala Arg
20
<210> 141
<211> 20
<212> PRT
<213> 人乳头瘤病毒52型
<400> 141
Ala Gln Ile Glu His Trp Lys Leu Thr Arg Met Glu Cys Val Leu Phe
1 5 10 15
Tyr Lys Ala Lys
20
<210> 142
<211> 20
<212> PRT
<213> 人乳头瘤病毒53型
<400> 142
Asp His Ile Asp Tyr Trp Lys Ala Val Arg Gln Glu Asn Val Ile Tyr
1 5 10 15
Tyr Lys Ala Arg
20
<210> 143
<211> 20
<212> PRT
<213> 人乳头瘤病毒58型
<400> 143
Ser Gln Ile Glu His Trp Lys Leu Ile Arg Met Glu Cys Ala Ile Met
1 5 10 15
Tyr Thr Ala Arg
20
<210> 144
<211> 20
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 144
Asp His Ile Asp Tyr Trp Lys Leu Ile Arg Leu Glu Cys Ala Ile Phe
1 5 10 15
Tyr Lys Ala Arg
20
<210> 145
<211> 15
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<220>
<221> 尚未归类的特征
<222> (1)..(3)
<223> Xaa可以是任何天然氨基酸
<400> 145
Xaa Xaa Xaa Pro Ile Pro Pro Pro Cys Pro Trp Ala Pro Lys Lys
1 5 10 15
<210> 146
<211> 16
<212> PRT
<213> 人乳头瘤病毒16型
<400> 146
Thr Pro Pro Arg Pro Ile Pro Lys Pro Ser Pro Trp Ala Pro Lys Lys
1 5 10 15
<210> 147
<211> 16
<212> PRT
<213> 人乳头瘤病毒18型
<400> 147
Thr Pro Pro His Arg Ile Pro Ala Pro Cys Pro Trp Ala Pro Gln Arg
1 5 10 15
<210> 148
<211> 16
<212> PRT
<213> 人乳头瘤病毒31型
<400> 148
Thr Pro Pro His Arg Ile Pro Lys Pro Ala Pro Trp Ala Pro Val Lys
1 5 10 15
<210> 149
<211> 12
<212> PRT
<213> 人乳头瘤病毒52型
<400> 149
Pro Arg Pro Pro His Cys Pro Trp Val Pro Lys Thr
1 5 10
<210> 150
<211> 12
<212> PRT
<213> 人乳头瘤病毒53型
<400> 150
Pro Pro Pro Pro Pro Arg Pro Trp Ala Pro Thr Lys
1 5 10
<210> 151
<211> 5
<212> PRT
<213> 人乳头瘤病毒58型
<400> 151
Pro Pro Thr Thr Lys
1 5
<210> 152
<211> 10
<212> PRT
<213> 人工序列
<220>
<223> 共有序列
<400> 152
Pro Ile Pro Lys Pro Ser Pro Trp Ala Pro
1 5 10
<210> 153
<211> 10
<212> PRT
<213> 人乳头瘤病毒16型
<400> 153
Pro Ile Pro Lys Pro Ser Pro Trp Ala Pro
1 5 10
<210> 154
<211> 10
<212> PRT
<213> 人乳头瘤病毒31型
<400> 154
Arg Ile Pro Lys Pro Ala Pro Trp Ala Pro
1 5 10
<210> 155
<211> 10
<212> PRT
<213> 人乳头瘤病毒18型
<400> 155
Arg Ile Pro Ala Pro Cys Pro Trp Ala Pro
1 5 10
<210> 156
<211> 10
<212> PRT
<213> 人乳头瘤病毒52型
<400> 156
Pro Arg Pro Pro His Cys Pro Trp Val Pro
1 5 10
<210> 157
<211> 10
<212> PRT
<213> 人乳头瘤病毒53型
<400> 157
Pro Pro Pro Pro Pro Arg Pro Trp Ala Pro
1 5 10
<210> 158
<211> 9713
<212> DNA
<213> 人工序列
<220>
<223> 质粒构建体
<400> 158
agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc 60
acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc 120
tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa 180
ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac gccaagcttg 240
catgcatctg gaaacgggca tctccattta agactagatg ccacggggtt taaaatacta 300
atcatgacat tttgtagagc gtaattactt agtaaatccg ccgtactagg ttcatttcct 360
cctcgtttgg atctcacatc agaaattaaa ataatcttag aaggatgcag ttgttttttg 420
atggatcgta gatattcctc atcaacgaac cgagtcacta gagtcacatc acgcaatcca 480
tttaaaatag gatcatgatg gcggccgtca attagcatcc atttgatgat cactcctaaa 540
ttatagaaat gatctctcaa ataacgtata tgtgtaccgg gagcagatcc tatatacact 600
acggtggcac catctaatat accgtgtcgc tgtaacttac taagaaaaaa taattctcct 660
agtaatagtt ttaactgtcc ttgatacggt agtttttttg cgacctcatt tgcactttct 720
ggttcgtaat ctaactcatt atcaatttcc tcaaaataca taaacggttt atctaacgac 780
acaacatcca tttttaagta ttatattaaa atttaatcaa tgtttatttt tagtttttta 840
gataaaaaat ataatattat gagtcgatgt aacactttct acacaccgat tgatacatat 900
cattacctcc tattatctct atctcggttt cctcacccaa tcgtttagaa aaggaagcct 960
ccttaaagca tttcatacac acagcagtta gttttaccac catttcagat aatggaataa 1020
gattcaaaat attattaaac ggtttacgtt gaaatgtccc atcgagtgcg gctactataa 1080
ctatttttcc ttcgtttgcc atacagatcc tacgtactcg agcggccgct tatcaggtcc 1140
gggtggagtc caggcccagc agtggattgg ggataggctt gccagaggcg ccaggtccag 1200
agccggcgat tctggcctcg cacttgcagc acatacacag catggtgtgt ctctgaggct 1260
cggcccttct agcgggcaga tgctggtggt tcacgccgtc gatctcgtcg ttctcttcca 1320
ccagagattc gcatctgcag cactgtgtct cgatcagata gcaagggtgc tgttcgtccc 1380
gtctagcctg ctggggctgt tcctgcagat ggtccacttc gtcctcatcc tcgtccccat 1440
ccaggccgcc agtgtcctct tcatcggagc tgtctcccag ctgctcataa cagtgcagat 1500
cagtggtttc aggctgctcg tagcagtaca ggtcggttgt ctcgggctgc agatccagca 1560
tgtactcgtg cagggtgtgg aaccgcttgt tcaggtccac gtgtcttttc ttttcctgcg 1620
gacacagtgg ccgctggcag tcgtacacct cagatctctg cagggttttc ttgcattcca 1680
cgcacagtgt ggtgccatac acggaatatc tgtaccaccg gaactcggac accttggagt 1740
cgcgatacac gattgtcagg tctgtgaagg cgaaatccag cacctctgtc tcggtcagct 1800
gtcccttgca atacaccagg ctggcgccat acacagagca gttgtagtac ctcagcttcc 1860
gcactttgct gtcccgatac accacccgca gatcggtgta ggcgaagttg tacacctcgc 1920
tggctgtcag ggccttcttg cagaacaccc gcaggcatct aatcagcagg ttgtacaggc 1980
cagtgttggt cagcttttcc agggtatcgc cgtacacgga gtcgctgtag tgccgcagct 2040
ctctgattct ggagtagaag tcgatacact tgtggcaggc ggcgtggggg atggagtctc 2100
tgtacaccac gttgatgcac cgaatcagca gatcgcacag gggcttgttg tactgctgtt 2160
ccagggtggt gccgtacagg ctgtagcagt agtgccggta ctcgctgatc ttgctgtaga 2220
acttcaggca cttgtcgcac acggcgtaag gattgccatc ccggtacacg atggcggtgg 2280
cggggctggt aatcacgacg atgtacacga acaccagcac ccaggcatag gcacacatgc 2340
acacgctggg cagcaggctt ccgatggaca cccacagcag cagcaccagc accagcacct 2400
gagcgtacac gctgatagac agcagcagag gcctcaggta ggtggacaca ctcagcagca 2460
ggggtctgat cagcaggcac acgcacagca gcacacaaaa gcacagcagg aagcaagggg 2520
cccaaggtct tggaggaggt ggagggggca cccatggaca gtgtggaggt ctaggaggtg 2580
cccaggggca aggggcaggg attctagggg cccatggaga aggcttaggg atggggggcc 2640
tctgaggagc ccaaggacat ggagcaggga tccggtgagg gggggtgctg tagctgttca 2700
gcagtgacag cagtgggtat ctggtggtca ggtgcagcct gatcacgact gtgctcttgg 2760
gggtggtcac tctcagttcc accagggagg tattctcggg ggtggtggtc tgggtggtca 2820
gcagtttcag cagaggatat gttggatcca gaggcacggg gcacagggtg ggcacgatga 2880
acaccttgtt ctcccagatg cacttgccgt gggccttcac gcccatgttg gggggtctgg 2940
acagcagctt cagcaggggg tacttggtca cgagctgaga ctgatcctgg tcgctggaca 3000
gccgccgtct ggccttgtaa aaaatggcac attccaggcg aatcagcttc caatagtcga 3060
tatggtcctt cacgagctcg gtggcggaca cgttgtcgtc gaaggtgctg cacatggagt 3120
cgttgcagtc gatgctgttg ccgaaatgca cttcccaggt gcccttgcag ccgtacttct 3180
cggcctcggt cttgcaccag ccatcgtcgc cgcagtagta cacgctgtcc caggccacgt 3240
agtcgaacca cacttcgatg tgctggccgc ccttcttgaa gcagtgcttg ggctcggtgt 3300
tccactcgat ggctttacag gccttggcct tgcagatatt cagggcgggc accacctggt 3360
gtctggcggc gaagaagatg gcgttttcct gcctgatggc tttccaataa tcaatgtggt 3420
cggcttcctc ggtgctgatc tcatcggagg ggatgctggc ggggcacacg atcacgtaga 3480
tttctttcca atttgtgtaa tccatggtat tggccttgtc attgtcgtac tgcacggtga 3540
tggtgatgcc gtgcttctta aactggggct cggccagcca catttccagg ctggtctgct 3600
gcagggtcca ctcatcgtag gggctggcat tcagggcttc cagggccagc tgcagttcga 3660
tggcctggca ggctttggct ttagaggcgg ccagtggagg caccacctgg tggcagatgc 3720
tgatgcccag ctctttggcc ttatagaaga ttgcacattc catccggatc agtttccagt 3780
gctcgatctg ggcgttcagg tcgttcttgt cggcctcgta gtaggtcagg gtcacaatgg 3840
cgttcttgtg tttcccatcg tgacatgtcc aatgccactt gtgcttgccg tcgtggcagg 3900
tccagtgcca ggtgctgctc agcttgcagt gcttcttgaa tctgtaccgc aggcacttca 3960
ggatcagcag cttgttgggg tggcagttct cgcttctggg cctcttgatc cgccgctggc 4020
cgctctcagg acacaggatc acttggccgc cagcgtgcac ctcccatttc ttgtccacct 4080
ggccttccac cacggtgcac tgggcgtcct cgcagatata gatccagttg gtgtagtgca 4140
tggtgttgca gatgtcgccg tcgaactggt ggaagcccat ctcgcgagcc ttatacataa 4200
tagcacattc cagtctgatc cgggccttgt acatgatggc gcattccagc cggatgtgct 4260
tccagtagtc gatgtgatcg cagatgtggt cgcacaggtc cttggaatca ttctcataat 4320
gttccagaat gtccttgctg tcgttctcgt agtgttccag gatcttgtcc tggcacacgt 4380
tgaaggggtt cagaaactgc agcacgtgga tccgggagtg caggtatctc agaaaggcat 4440
tggggaactc gaacacggtg atccggcttt ccaggtaagg ccagaagggg ttggggaagg 4500
taaacaccac cagtctgctg tgcaggtagg gccacagaga catggcaaag caggacttgc 4560
ctgtgttagg tgggccgtag atcacgagac aattcaggga catgccgaag taggactttc 4620
cggtattggc tgggccgcac agcaccaggc agttcaggct catgccgaac aggctcttgc 4680
cggtgttggc agcgccgtac agcaggatgc agttcttcag aaaagcagcg gcgttgctgt 4740
tcacgtctgc cagctgagcc ttcaggaagg cgcaggcatt gctgtcgctg tcagccagct 4800
gggcggagtt gtcgataaag ccgtccaggt cgctctcggt gctctcttcg tcggtttcgt 4860
cgttgtcgat gaaatccacc atgtcctcgc cggtgtcgga gtcgttctca ttctcgtcgc 4920
tgtcgtcgat gaagtcgatc aggtcggtgc cgctgtcgta ggctgtctcg tcctcgtcgg 4980
ccagcttgct gcctcttctg aaccgggcgt ggatttcctg gctgggggac acaaacacgg 5040
cgccacacag cagcagcacg cagcacaggc ccctcttcat agcatccatg gtggcggcgc 5100
ggctagcggt accggatcta gatggggatc cgtcactgtt ctttatgatt ctacttcctt 5160
accgtgcaat aaattagaat atattttcta cttttacgag aaattaatta ttgtatttat 5220
tatttatggg tgaaaaactt actataaaaa gcgggtgggt ttggaattag tgatcagttt 5280
atgtatatcg caactaccgg gcatatggct atcgacatcg agaacattac ccacatgata 5340
agagattgta tcagtttcgt agtcttgagt attggtatta ctatatagta tatagatgtc 5400
gacctgcagg tcgacgaagt tcctatactt tctagagaat aggaacttcg cagccaagct 5460
ggaattcatc cactttggat aagaaatctg catgataaat atattgatat cctaccacct 5520
attaaagtac cattatctaa tagcaataag atagataaac aaatgttttt tgatgaagtt 5580
attacgtgga taaatatata tcttcaggaa aagggtatta tgttaccaga tgatataaga 5640
gaactcagag atgctattat tccttaacta gttacgtctc tttaggtact tattttgata 5700
cgttacaagt aaaaaactat caaatataaa tggaatctga ttctaatata gcgattgaag 5760
aggatccacc ggtcgccacc atggtgagca agggcgagga gctgttcacc ggggtggtgc 5820
ccatcctggt cgagctggac ggcgacgtaa acggccacaa gttcagcgtg tccggcgagg 5880
gcgagggcga tgccacctac ggcaagctga ccctgaagtt catctgcacc accggcaagc 5940
tgcccgtgcc ctggcccacc ctcgtgacca ccctgaccta cggcgtgcag tgcttcagcc 6000
gctaccccga ccacatgaag cagcacgact tcttcaagtc cgccatgccc gaaggctacg 6060
tccaggagcg caccatcttc ttcaaggacg acggcaacta caagacccgc gccgaggtga 6120
agttcgaggg cgacaccctg gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg 6180
acggcaacat cctggggcac aagctggagt acaactacaa cagccacaac gtctatatca 6240
tggccgacaa gcagaagaac ggcatcaagg tgaacttcaa gatccgccac aacatcgagg 6300
acggcagcgt gcagctcgcc gaccactacc agcagaacac ccccatcggc gacggccccg 6360
tgctgctgcc cgacaaccac tacctgagca cccagtccgc cctgagcaaa gaccccaacg 6420
agaagcgcga tcacatggtc ctgctggagt tcgtgaccgc cgccgggatc actctcggca 6480
tggacgagct gtacaagtaa agcggccgcg aagttcctat actttctaga gaataggaac 6540
ttcaacaatg tctggaaaga actgtccttc atcgatacct atcacggaga aatctgtaat 6600
tgattccaag acatcacata gtttagttgc ttccaatgct tcaaaattat tcttatcatg 6660
cgtccatagt cccgttccgt atctattatc gttagaatat tttatagtca cgcatttata 6720
ttgagctatt tgataacgtc taactcgtct aattaattct gtacttttac ctgaaaacat 6780
ggggccgatt atcaactgaa tatgtccgcc gttcatgatg acaataaaga attaattatt 6840
gttcacttta ttcgacttta atatatccat cacgttagaa aatgcgatat cgcgacgagg 6900
atctatgtat ctaacaggat ctattgcggt ggtagctaga gctgattctt ttttgaatcg 6960
catcaaacta atcacaaagt cgaacaaata tcctttatta agtttgaccc ttccatctgt 7020
aacaataggg accttgttaa acagtttttt aaaatcttga gagtctgtga attttgtcaa 7080
ttgtctgtat tcctctgaaa gagattcata acaatgaccc acggcttcta atttattttt 7140
tgattggatc aataataata acagaaagtc tagatattga gtgatttgca atatatcaga 7200
taatgaagat tcatcatctt gactagccaa atacttaaaa aatgaatcat catctgcgaa 7260
gaacatcgtt aagagatact ggttgtgatc catttatgag ctcgcgaaag cttggcactg 7320
gccgtcgttt tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt 7380
gcagcacatc cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct 7440
tcccaacagt tgcgcagcct gaatggcgaa tggcgcctga tgcggtattt tctccttacg 7500
catctgtgcg gtatttcaca ccgcatatgg tgcactctca gtacaatctg ctctgatgcc 7560
gcatagttaa gccagccccg acacccgcca acacccgctg acgcgccctg acgggcttgt 7620
ctgctcccgg catccgctta cagacaagct gtgaccgtct ccgggagctg catgtgtcag 7680
aggttttcac cgtcatcacc gaaacgcgcg agacgaaagg gcctcgtgat acgcctattt 7740
ttataggtta atgtcatgat aataatggtt tcttagacgt caggtggcac ttttcgggga 7800
aatgtgcgcg gaacccctat ttgtttattt ttctaaatac attcaaatat gtatccgctc 7860
atgagacaat aaccctgata aatgcttcaa taatattgaa aaaggaagag tatgagtatt 7920
caacatttcc gtgtcgccct tattcccttt tttgcggcat tttgccttcc tgtttttgct 7980
cacccagaaa cgctggtgaa agtaaaagat gctgaagatc agttgggtgc acgagtgggt 8040
tacatcgaac tggatctcaa cagcggtaag atccttgaga gttttcgccc cgaagaacgt 8100
tttccaatga tgagcacttt taaagttctg ctatgtggcg cggtattatc ccgtattgac 8160
gccgggcaag agcaactcgg tcgccgcata cactattctc agaatgactt ggttgagtac 8220
tcaccagtca cagaaaagca tcttacggat ggcatgacag taagagaatt atgcagtgct 8280
gccataacca tgagtgataa cactgcggcc aacttacttc tgacaacgat cggaggaccg 8340
aaggagctaa ccgctttttt gcacaacatg ggggatcatg taactcgcct tgatcgttgg 8400
gaaccggagc tgaatgaagc cataccaaac gacgagcgtg acaccacgat gcctgtagca 8460
atggcaacaa cgttgcgcaa actattaact ggcgaactac ttactctagc ttcccggcaa 8520
caattaatag actggatgga ggcggataaa gttgcaggac cacttctgcg ctcggccctt 8580
ccggctggct ggtttattgc tgataaatct ggagccggtg agcgtgggtc tcgcggtatc 8640
attgcagcac tggggccaga tggtaagccc tcccgtatcg tagttatcta cacgacgggg 8700
agtcaggcaa ctatggatga acgaaataga cagatcgctg agataggtgc ctcactgatt 8760
aagcattggt aactgtcaga ccaagtttac tcatatatac tttagattga tttaaaactt 8820
catttttaat ttaaaaggat ctaggtgaag atcctttttg ataatctcat gaccaaaatc 8880
ccttaacgtg agttttcgtt ccactgagcg tcagaccccg tagaaaagat caaaggatct 8940
tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta 9000
ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc tttttccgaa ggtaactggc 9060
ttcagcagag cgcagatacc aaatactgtt cttctagtgt agccgtagtt aggccaccac 9120
ttcaagaact ctgtagcacc gcctacatac ctcgctctgc taatcctgtt accagtggct 9180
gctgccagtg gcgataagtc gtgtcttacc gggttggact caagacgata gttaccggat 9240
aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac agcccagctt ggagcgaacg 9300
acctacaccg aactgagata cctacagcgt gagctatgag aaagcgccac gcttcccgaa 9360
gggagaaagg cggacaggta tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg 9420
gagcttccag ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccacctctga 9480
cttgagcgtc gatttttgtg atgctcgtca ggggggcgga gcctatggaa aaacgccagc 9540
aacgcggcct ttttacggtt cctggccttt tgctggcctt ttgctcacat gttctttcct 9600
gcgttatccc ctgattctgt ggataaccgt attaccgcct ttgagtgagc tgataccgct 9660
cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga aga 9713
<210> 159
<211> 10405
<212> DNA
<213> 人工序列
<220>
<223> 质粒构建体
<400> 159
agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc 60
acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc 120
tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa 180
ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac gccaagctat 240
ttaggtgaca ctatagaata ctcaagctat gcatcaagct tggtaccgag ctcggatcca 300
ctagtaacgg ccgccagtgt gctggaattc gcccttgtaa tctattcgat ataccgttgc 360
taacagtata ctggcccaat aactgtggat ggaaaatcta taataataca ttaatatcat 420
ccgatggtgc tagggttatt tggatggatg cgtataaatt ttcttgcggt ttatctttac 480
aagactattg ttatcattgg ggtagcaaac cagagagccg accattcgat ttaataaaaa 540
aatcagatgc taaacgcaat tctaaatcgt tggtcaaaga atctatggca tccttgaaat 600
ccttgtacga ggcattcgag acacaatcag gagcgttaga agttttaatg agtccatgta 660
ggatgttttc gttttctaga atagaagaca tgttcttaac tagtgtcatt aatagagtat 720
ccgagaatac tggaatgggg atgtattatc ctaccaacga tataccttct ctatttatcg 780
aatcatctat ctgtctagat tatattatag taaataatca ggaatccaac aaatatcgta 840
tcaaatctgt tctcgatatc atttcttcaa aacaataccc tgcaggacgt cccaactacg 900
ttaaaaatgg tacaaaagga aagttatata tcgcgttgtg taaagttacc gtacctacta 960
acgaccatat tccagtagtt tatcacgatg atgacaatac taccaccttt attacagtat 1020
tgacgtccgt cgatattgaa actgctatca gagcaggata ttcgatagtc gaattagggg 1080
ctttacaatg ggataataat attccagaac ttaaaaacgg tttactggat agtatcaaga 1140
tgatttatga cttgaacgca gttacaacaa ataatttatt ggaacagctc atagaaaata 1200
ttaactttaa caactctagt ataatttcgt tgttttatac atttgccatt agttattgcc 1260
gagcattcat ttactcaatt atggaaacca tagatccggt gtatatatct cagttcagtt 1320
ataaagaatt atacgttagt agctcttata aagatattaa tgaatccatg agtcagatgg 1380
taaaattata aaaagtgaaa aacaatatta tttttatcgt tggttgttac actatggatg 1440
ctatgaagag gggcctgtgc tgcgtgctgc tgctgtgtgg cgccgtgttt gtgtccccca 1500
gccaggaaat ccacgcccgg ttcagaagag gcagcaagct ggccgacgag gacgagacag 1560
cctacgacag cggcaccgac ctgatcgact tcatcgacga cagcgacgag aatgagaacg 1620
actccgacac cggcgaggac atggtggatt tcatcgacaa cgacgaaacc gacgaagaga 1680
gcaccgagag cgacctggac ggctttatcg acaactccgc ccagctggct gacagcgaca 1740
gcaatgcctg cgccttcctg aaggctcagc tggcagacgt gaacagcaac gccgctgctt 1800
ttctgaagaa ctgcatcctg ctgtacggcg ctgccaacac cggcaagagc ctgttcggca 1860
tgagcctgaa ctgcctggtg ctgtgcggcc cagccaatac cggaaagtcc tacttcggca 1920
tgtccctgaa ttgtctcgtg atctacggcc cacctaacac aggcaagtcc tgctttgcca 1980
tgtctctgtg gccctacctg cacagcagac tggtggtgtt taccttcccc aaccccttct 2040
ggccttacct ggaaagccgg atcaccgtgt tcgagttccc caatgccttt ctgagatacc 2100
tgcactcccg gatccacgtg ctgcagtttc tgaacccctt caacgtgtgc caggacaaga 2160
tcctggaaca ctacgagaac gacagcaagg acattctgga acattatgag aatgattcca 2220
aggacctgtg cgaccacatc tgcgatcaca tcgactactg gaagcacatc cggctggaat 2280
gcgccatcat gtacaaggcc cggatcagac tggaatgtgc tattatgtat aaggctcgcg 2340
agatgggctt ccaccagttc gacggcgaca tctgcaacac catgcactac accaactgga 2400
tctatatctg cgaggacgcc cagtgcaccg tggtggaagg ccaggtggac aagaaatggg 2460
aggtgcacgc tggcggccaa gtgatcctgt gtcctgagag cggccagcgg cggatcaaga 2520
ggcccagaag cgagaactgc caccccaaca agctgctgat cctgaagtgc ctgcggtaca 2580
gattcaagaa gcactgcaag ctgagcagca cctggcactg gacctgccac gacggcaagc 2640
acaagtggca ttggacatgt cacgatggga aacacaagaa cgccattgtg accctgacct 2700
actacgaggc cgacaagaac gacctgaacg cccagatcga gcactggaaa ctgatccgga 2760
tggaatgtgc aatcttctat aaggccaaag agctgggcat cagcatctgc caccaggtgg 2820
tgcctccact ggccgcctct aaagccaaag cctgccaggc catcgaactg cagctggccc 2880
tggaagccct gaatgccagc ccctacgatg agtggaccct gcagcagacc agcctggaaa 2940
tgtggctggc cgagccccag tttaagaagc acggcatcac catcaccgtg cagtacgaca 3000
atgacaaggc caataccatg gattacacaa attggaaaga aatctacgtg atcgtgtgcc 3060
ccgccagcat cccctccgat gagatcagca ccgaggaagc cgaccacatt gattattgga 3120
aagccatcag gcaggaaaac gccatcttct tcgccgccag acaccaggtg gtgcccgccc 3180
tgaatatctg caaggccaag gcctgtaaag ccatcgagtg gaacaccgag cccaagcact 3240
gcttcaagaa gggcggccag cacatcgaag tgtggttcga ctacgtggcc tgggacagcg 3300
tgtactactg cggcgacgat ggctggtgca agaccgaggc cgagaagtac ggctgcaagg 3360
gcacctggga agtgcatttc ggcaacagca tcgactgcaa cgactccatg tgcagcacct 3420
tcgacgacaa cgtgtccgcc accgagctcg tgaaggacca tatcgactat tggaagctga 3480
ttcgcctgga atgtgccatt ttttacaagg ccagacggcg gctgtccagc gaccaggatc 3540
agtctcagct cgtgaccaag taccccctgc tgaagctgct gtccagaccc cccaacatgg 3600
gcgtgaaggc ccacggcaag tgcatctggg agaacaaggt gttcatcgtg cccaccctgt 3660
gccccgtgcc tctggatcca acatatcctc tgctgaaact gctgaccacc cagaccacca 3720
cccccgagaa tacctccctg gtggaactga gagtgaccac ccccaagagc acagtcgtga 3780
tcaggctgca cctgaccacc agatacccac tgctgtcact gctgaacagc tacagcaccc 3840
cccctcaccg gatccctgct ccatgtcctt gggctcctca gaggcccccc atccctaagc 3900
cttctccatg ggcccctaga atccctgccc cttgcccctg ggcacctcct agacctccac 3960
actgtccatg ggtgccccct ccacctcctc caagaccttg ggccccttgc ttcctgctgt 4020
gcttttgtgt gctgctgtgc gtgtgcctgc tgatcagacc cctgctgctg agtgtgtcca 4080
cctacctgag gcctctgctg ctgtctatca gcgtgtacgc tcaggtgctg gtgctggtgc 4140
tgctgctgtg ggtgtccatc ggaagcctgc tgcccagcgt gtgcatgtgt gcctatgcct 4200
gggtgctggt gttcgtgtac atcgtcgtga ttaccagccc cgccaccgcc atcgtgtacc 4260
gggatggcaa tccttacgcc gtgtgcgaca agtgcctgaa gttctacagc aagatcagcg 4320
agtaccggca ctactgctac agcctgtacg gcaccaccct ggaacagcag tacaacaagc 4380
ccctgtgcga tctgctgatt cggtgcatca acgtggtgta cagagactcc atcccccacg 4440
ccgcctgcca caagtgtatc gacttctact ccagaatcag agagctgcgg cactacagcg 4500
actccgtgta cggcgatacc ctggaaaagc tgaccaacac tggcctgtac aacctgctga 4560
ttagatgcct gcgggtgttc tgcaagaagg ccctgacagc cagcgaggtg tacaacttcg 4620
cctacaccga tctgcgggtg gtgtatcggg acagcaaagt gcggaagctg aggtactaca 4680
actgctctgt gtatggcgcc agcctggtgt attgcaaggg acagctgacc gagacagagg 4740
tgctggattt cgccttcaca gacctgacaa tcgtgtatcg cgactccaag gtgtccgagt 4800
tccggtggta cagatattcc gtgtatggca ccacactgtg cgtggaatgc aagaaaaccc 4860
tgcagagatc tgaggtgtac gactgccagc ggccactgtg tccgcaggaa aagaaaagac 4920
acgtggacct gaacaagcgg ttccacaccc tgcacgagta catgctggat ctgcagcccg 4980
agacaaccga cctgtactgc tacgagcagc ctgaaaccac tgatctgcac tgttatgagc 5040
agctgggaga cagctccgat gaagaggaca ctggcggcct ggatggggac gaggatgagg 5100
acgaagtgga ccatctgcag gaacagcccc agcaggctag acgggacgaa cagcaccctt 5160
gctatctgat cgagacacag tgctgcagat gcgaatctct ggtggaagag aacgacgaga 5220
tcgacggcgt gaaccaccag catctgcccg ctagaagggc cgagcctcag agacacacca 5280
tgctgtgtat gtgctgcaag tgcgaggcca gaatcgccgg ctaattttta taaccgagtt 5340
tctgcattat tgtaattcgt atgctggcac catcaaagaa tcacttctaa aagatatcaa 5400
tatcacacat acaaatatta ctaccctatt gaatgagaca gccaaggtta tcaagttagt 5460
aaaatctctg gtagataaag aagatactga tattgtgaat aatttcatta ccaaagaaat 5520
taaaaacaga gacaaaatag ttaatagttt gtctctatca aacctggact ttcgtttgta 5580
aattggggct ttttgtacaa taaatgggtg ttgccaatga ttcatcccct gaatatcaat 5640
ggatgtctcc ccatagatta tcagatactg ttatattagg agactgtttg tattttaaca 5700
atataatgtc ccaattagat ttacaccaaa attgggctcc atcagttaga ttgttaaatt 5760
attttaagaa ttttaataag gaaacactac taaagataga agagaatgat tacattaatt 5820
catccttttt ccaacaaaag gataaacgat tttatcctat aaacgacgat ttttatcaca 5880
tatctacagg aggatatggt atagtcttta agatagataa ctatgtagta aaatttgtat 5940
tcgaggccac aaaattatat agtcccatgg aaactacggc ggagttcaca gtacccaaat 6000
ttctatacaa caatctaaag ggagatgaaa aaaaattaat cgtgtgtgcg tgggccatgg 6060
gattaaacta taaattaaca tttttacata ctctgtataa acgtgttctt catatgttgc 6120
tattattgat acaaactatg gatggtcagg aactatcatt gagatattct tctaaagttt 6180
ttttaaaggc gtttaacgag agaaaggaca gtatcaaatt cgtgaaatta ctatcccact 6240
tttatccggc agttattaac agtaatatta atgttataaa ctattttaac cgcatgtttc 6300
actttttcga acatgaaaag agaactaact acgaatacga aagaggaaat attataattt 6360
ttcccctagc actgtattcg gcagataaag tagataccga gctagctatc aaattaggat 6420
ttaaatcttt ggtacaatac ataaagttta tctttttaca gatggctctg ttatacatta 6480
aaatttacga actaccatgc tgcgacaact ttttacacgc agatcttaaa cccgataata 6540
tcttactttt tgattccaat gaaccaataa taattcatct aaaggataaa aagtttgttt 6600
ttaatgaacg tattaaatcg gcattaaacg actttgactt ttcccaagaa gggcgaattc 6660
tgcagatatc catcacactg gcggccgctt acttgtacag ctcgtccatg ccgagagtga 6720
tcccggcggc ggtcacgaac tccagcagga ccatgtgatc gcgcttctcg ttggggtctt 6780
tgctcagggc ggactgggtg ctcaggtagt ggttgtcggg cagcagcacg gggccgtcgc 6840
cgatgggggt gttctgctgg tagtggtcgg cgagctgcac gctgccgtcc tcgatgttgt 6900
ggcggatctt gaagttcacc ttgatgccgt tcttctgctt gtcggccatg atatagacgt 6960
tgtggctgtt gtagttgtac tccagcttgt gccccaggat gttgccgtcc tccttgaagt 7020
cgatgccctt cagctcgatg cggttcacca gggtatcgcc ctcgaacttc acctcggcgc 7080
gggtcttgta gttgccgtcg tccttgaaga agatggtgcg ctcctggacg tagccttcgg 7140
gcatggcgga cttgaagaag tcgtgctgct tcatgtggtc ggggtagcgg ctgaagcact 7200
gcacgccgta ggtcagggtg gtcacgaggg tcggccaggg cacgggcagc ttgccggtgg 7260
tgcagatgaa cttcagggtc agcttgccgt aggtggcatc gccctcgccc tcgccggaca 7320
cgctgaactt gtggccgttt acgtcgccgt ccagctcgac caggatgggc accaccccgg 7380
taaacagctc ctcgcccttg ctcaccatgt ttaaacttta tattccaaaa aaaaaaaata 7440
aaatttcaat ttttgtttaa acgttgtacg gcagtttaag gtttacacct ataaaagaga 7500
gagccgttat cgtctgtttg tggatgtaca gagtgatatt attgacacgc cggggcgacg 7560
gatggtgatc cccctggcca gtgcacgtct gctgtcagat aaagtctccc gtgaacttta 7620
cccggtggtg catatcgggg atgaaagctg gcgcatgatg accaccgata tggccagtgt 7680
gccggtctcc gttatcgggg aagaagtggc tgatctcagc caccgcgaaa atgacatcaa 7740
aaacgccatt aacctgatgt tctggggaat ataaatgtca ggcatgagat tatcaaaaag 7800
gatcttcacc tagatccttt tcacgtagaa agccagtccg cagaaacggt gctgaccccg 7860
gatgaatgtc agctactggg ctatctggac aagggaaaac gcaagcgcaa agagaaagca 7920
ggtagcttgc agtgggctta catggcgata gctagactgg gcggttttat ggacagcaag 7980
cgaaccggaa ttgccagctg gggcgccctc tggtaaggtt gggaagccct gcaaagtaaa 8040
ctggatggct ttctcgccgc caaggatctg atggcgcagg ggatcaagct ctgatcaaga 8100
gacaggatga ggatcgtttc gcatgattga acaagatgga ttgcacgcag gttctccggc 8160
cgcttgggtg gagaggctat tcggctatga ctgggcacaa cagacaatcg gctgctctga 8220
tgccgccgtg ttccggctgt cagcgcaggg gcgcccggtt ctttttgtca agaccgacct 8280
gtccggtgcc ctgaatgaac tgcaagacga ggcagcgcgg ctatcgtggc tggccacgac 8340
gggcgttcct tgcgcagctg tgctcgacgt tgtcactgaa gcgggaaggg actggctgct 8400
attgggcgaa gtgccggggc aggatctcct gtcatctcac cttgctcctg ccgagaaagt 8460
atccatcatg gctgatgcaa tgcggcggct gcatacgctt gatccggcta cctgcccatt 8520
cgaccaccaa gcgaaacatc gcatcgagcg agcacgtact cggatggaag ccggtcttgt 8580
cgatcaggat gatctggacg aagagcatca ggggctcgcg ccagccgaac tgttcgccag 8640
gctcaaggcg agcatgcccg acggcgagga tctcgtcgtg acccatggcg atgcctgctt 8700
gccgaatatc atggtggaaa atggccgctt ttctggattc atcgactgtg gccggctggg 8760
tgtggcggac cgctatcagg acatagcgtt ggctacccgt gatattgctg aagagcttgg 8820
cggcgaatgg gctgaccgct tcctcgtgct ttacggtatc gccgctcccg attcgcagcg 8880
catcgccttc tatcgccttc ttgacgagtt cttctgaatt attaacgctt acaatttcct 8940
gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatac aggtggcact 9000
tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg 9060
tatccgctca tgagacaata accctgataa atgcttcaat aatagcacgt gaggagggcc 9120
accatggcca agttgaccag tgccgttccg gtgctcaccg cgcgcgacgt cgccggagcg 9180
gtcgagttct ggaccgaccg gctcgggttc tcccgggact tcgtggagga cgacttcgcc 9240
ggtgtggtcc gggacgacgt gaccctgttc atcagcgcgg tccaggacca ggtggtgccg 9300
gacaacaccc tggcctgggt gtgggtgcgc ggcctggacg agctgtacgc cgagtggtcg 9360
gaggtcgtgt ccacgaactt ccgggacgcc tccgggccgg ccatgaccga gatcggcgag 9420
cagccgtggg ggcgggagtt cgccctgcgc gacccggccg gcaactgcgt gcacttcgtg 9480
gccgaggagc aggactgaca cgtgctaaaa cttcattttt aatttaaaag gatctaggtg 9540
aagatccttt ttgataatct catgaccaaa atcccttaac gtgagttttc gttccactga 9600
gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt tctgcgcgta 9660
atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa 9720
gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat accaaatact 9780
gtccttctag tgtagccgta gttaggccac cacttcaaga actctgtagc accgcctaca 9840
tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt 9900
accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg ctgaacgggg 9960
ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag atacctacag 10020
cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag gtatccggta 10080
agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa cgcctggtat 10140
ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt gtgatgctcg 10200
tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg gttcctgggc 10260
ttttgctggc cttttgctca catgttcttt cctgcgttat cccctgattc tgtggataac 10320
cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac cgagcgcagc 10380
gagtcagtga gcgaggaagc ggaag 10405
Claims (12)
1.一种核酸,其编码包含多个保守肽序列或其变体的多肽,
其中所述多肽包含序列SEQ ID NO:61或SEQ ID NO:66;或者包含SEQ ID NO:66且进一步包含肽佐剂。
2.根据权利要求1所述的核酸,其中所述核酸包含序列SEQ ID NO:60或由序列SEQ IDNO:60组成,或者包含序列SEQ ID NO:65或由序列SEQ ID NO:65组成;或者包含序列SEQ IDNO:65或由序列SEQ ID NO:65组成,且进一步编码肽佐剂。
3.根据任一项前述权利要求所述的核酸,其中所述核酸包含序列SEQ ID NO:62或由序列SEQ ID NO:62组成,具有编码TPA前导序列ATGGATGCTATGAAGAGGGGCCTGTGCTGCGTGCTGCTGCTGTGTGGCGCC GTGTTTGTGTCCCCCAGCCAGGAAATCCACGCCCGGTTCAGAAGAGGCAG CAAGCTGGCC的序列;或者
所述核酸包含序列SEQ ID NO:62或由SEQ ID NO:62组成,且不编码TPA前导序列ATGGATGCTATGAAGAGGGGCCTGTGCTGCGTGCTGCTGCTGTGT GGCGCCGTGTTTGTGTCCCCCAGCCAGGAAATCCACGCCCGGTTCAGAAG AGGCAGCAAGCTGGCC;或者
所述核酸包含序列SEQ ID NO:62或由SEQ ID NO:62组成,且编码不同的肽佐剂而非所述TPA前导序列ATGGATGCTATGAAGAGGGGCCTGTGCTGCG TGCTGCTGCTGTGTGGCGCCGTGTTTGTGTCCCCCAGCCAGGAAATCCACG CCCGGTTCAGAAGAGGCAGCAAGCTGGCC。
4.根据权利要求1或2所述的核酸,
其中所述核酸是用于疫苗接种的质粒载体;和/或
其中所述核酸包含病毒载体序列。
5.一种疫苗,其包含根据任一项前述权利要求所述的核酸。
6.根据权利要求5所述的疫苗,其进一步包含另一种治疗或预防活性成分;和/或其进一步包含佐剂。
7.一种组合物,其包含多种根据权利要求1-4中任一项所述的不同核酸,任选地其中所述组合物是药学上可接受的组合物。
8.一种多肽,其由根据权利要求1-4中任一项所述的核酸编码。
9.一种病毒载体,其包含根据权利要求1或2所述的核酸;任选地其中所述病毒是腺病毒。
10.一种组合物,其包含以下中的一种或多种:
-根据权利要求1-4中任一项所述的核酸
-根据权利要求8所述的多肽;和/或
-根据权利要求9所述的病毒载体;
任选地,其中所述组合物在哺乳动物中是免疫原性的;和/或任选地,所述组合物进一步包含药学上可接受的载体。
11.根据权利要求1-4中任一项所述的核酸、根据权利要求5或6所述的疫苗、根据权利要求8所述的多肽、根据权利要求9所述的病毒载体或根据权利要求10所述的组合物,其用于预防或治疗HPV感染。
12.根据权利要求1-4中任一项所述的核酸、根据权利要求5或6所述的疫苗、根据权利要求8所述的多肽、根据权利要求9所述的病毒载体或根据权利要求10所述的组合物在制备用于治疗或预防HPV感染的试剂中的应用。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB1713163.2 | 2017-08-16 | ||
GBGB1713163.2A GB201713163D0 (en) | 2017-08-16 | 2017-08-16 | HPV vaccine |
PCT/GB2018/052335 WO2019034887A1 (en) | 2017-08-16 | 2018-08-16 | HPV VACCINE |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111212659A CN111212659A (zh) | 2020-05-29 |
CN111212659B true CN111212659B (zh) | 2023-09-22 |
Family
ID=59895974
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201880053063.XA Active CN111212659B (zh) | 2017-08-16 | 2018-08-16 | Hpv疫苗 |
Country Status (10)
Country | Link |
---|---|
US (2) | US11179456B2 (zh) |
EP (1) | EP3668542A1 (zh) |
JP (1) | JP7551495B2 (zh) |
CN (1) | CN111212659B (zh) |
AU (1) | AU2018316734A1 (zh) |
CA (1) | CA3072536A1 (zh) |
GB (1) | GB201713163D0 (zh) |
MX (1) | MX2020001655A (zh) |
SG (1) | SG11202001244SA (zh) |
WO (1) | WO2019034887A1 (zh) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB201713163D0 (en) | 2017-08-16 | 2017-09-27 | Univ Oxford Innovation Ltd | HPV vaccine |
JOP20210186A1 (ar) * | 2019-01-10 | 2023-01-30 | Janssen Biotech Inc | مستضدات البروستاتا المستحدثة واستخداماتها |
GB202019767D0 (en) | 2020-12-15 | 2021-01-27 | Chain Biotechnology Ltd | Compostitions and methods |
GB202209115D0 (en) | 2022-06-21 | 2022-08-10 | Chain Biotechnology Ltd | Compositions and methods |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2009059325A2 (en) * | 2007-11-02 | 2009-05-07 | The Johns Hopkins University | Multitype hpv peptide compositions and methods for treatment or prevention of human papillomavirus infection |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7026443B1 (en) | 1999-12-10 | 2006-04-11 | Epimmune Inc. | Inducing cellular immune responses to human Papillomavirus using peptide and nucleic acid compositions |
AU2005222776A1 (en) * | 2003-12-31 | 2005-09-29 | Genimmune N.V. | Inducing cellular immune responses to human papillomavirus using peptide and nucleic acid compositions |
EP2093410B1 (en) | 2008-02-21 | 2014-04-09 | Delphi International Operations Luxembourg S.à r.l. | A fuel injector with an improved valve control arrangement |
US20120141502A1 (en) | 2009-04-20 | 2012-06-07 | Eric Dixon | Antibodies specific to e6 proteins of hpv and use thereof |
JPWO2014103608A1 (ja) | 2012-12-25 | 2017-01-12 | 一般財団法人化学及血清療法研究所 | HPV/HBsキメラタンパク質を有効成分とするHPV感染症及び/又はB型肝炎用ワクチン |
CN105307678A (zh) | 2013-03-12 | 2016-02-03 | 宾夕法尼亚大学理事会 | 用于人乳头状瘤病毒的改进疫苗及其使用方法 |
US11466292B2 (en) | 2016-09-29 | 2022-10-11 | Glaxosmithkline Biologicals Sa | Compositions and methods of treatment |
GB201713163D0 (en) | 2017-08-16 | 2017-09-27 | Univ Oxford Innovation Ltd | HPV vaccine |
-
2017
- 2017-08-16 GB GBGB1713163.2A patent/GB201713163D0/en not_active Ceased
-
2018
- 2018-08-16 US US16/639,144 patent/US11179456B2/en active Active
- 2018-08-16 MX MX2020001655A patent/MX2020001655A/es unknown
- 2018-08-16 JP JP2020508007A patent/JP7551495B2/ja active Active
- 2018-08-16 CA CA3072536A patent/CA3072536A1/en active Pending
- 2018-08-16 SG SG11202001244SA patent/SG11202001244SA/en unknown
- 2018-08-16 CN CN201880053063.XA patent/CN111212659B/zh active Active
- 2018-08-16 WO PCT/GB2018/052335 patent/WO2019034887A1/en unknown
- 2018-08-16 AU AU2018316734A patent/AU2018316734A1/en active Pending
- 2018-08-16 EP EP18759700.0A patent/EP3668542A1/en active Pending
-
2021
- 2021-11-22 US US17/532,177 patent/US12064477B2/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2009059325A2 (en) * | 2007-11-02 | 2009-05-07 | The Johns Hopkins University | Multitype hpv peptide compositions and methods for treatment or prevention of human papillomavirus infection |
Non-Patent Citations (3)
Title |
---|
Induction of Antitumor Immunity In vivo Following Delivery of a Novel HPV-16 DNA Vaccine Encoding an E6/E7 Fusion antigen;Jian Yan等;《Vaccine》;第431–440页 * |
Sequence-based approach for rapid identification of cross-clade CD8+ T-cell vaccine candidates from all high-risk HPV strains;Krishna P. Singh等;《3 Biotech》;第1-10页 * |
宫颈癌HPV基因疫苗研究进展;侯萌;《中国肿瘤生物治疗杂志》(第01期);第71-74页 * |
Also Published As
Publication number | Publication date |
---|---|
MX2020001655A (es) | 2020-08-17 |
US11179456B2 (en) | 2021-11-23 |
GB201713163D0 (en) | 2017-09-27 |
CA3072536A1 (en) | 2019-02-21 |
US12064477B2 (en) | 2024-08-20 |
JP2020536489A (ja) | 2020-12-17 |
KR20200056384A (ko) | 2020-05-22 |
JP7551495B2 (ja) | 2024-09-17 |
US20200306358A1 (en) | 2020-10-01 |
EP3668542A1 (en) | 2020-06-24 |
AU2018316734A1 (en) | 2020-02-27 |
CN111212659A (zh) | 2020-05-29 |
WO2019034887A1 (en) | 2019-02-21 |
SG11202001244SA (en) | 2020-03-30 |
US20220152189A1 (en) | 2022-05-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2020204194B2 (en) | Optimal soybean loci | |
KR102631985B1 (ko) | 게놈을 변형시키기 위한 조성물 및 방법 | |
KR102357051B1 (ko) | 신규 유전자 재조합 백시니아 바이러스 | |
KR102147007B1 (ko) | Fad3 성능 유전자좌 및 표적화 파단을 유도할 수 있는 상응하는 표적 부위 특이적 결합 단백질 | |
KR102243727B1 (ko) | 유전자 표적화 및 형질 스태킹을 위한 조작된 트랜스진 통합 플랫폼 (etip) | |
AU2021221448B2 (en) | Modified plant | |
AU2013312198B2 (en) | Fluorescence activated cell sorting (FACS) enrichment to generate plants | |
AU2022202248A1 (en) | Nucleic acid-guided nucleases | |
CN111212659B (zh) | Hpv疫苗 | |
KR101234062B1 (ko) | 림프구 내 유전자 도입용 재조합 바이러스 벡터 | |
AU2018232902A1 (en) | Complete genome sequence of the methanogen methanobrevibacter ruminantium | |
KR20210132002A (ko) | 변형된 오르토폭스 바이러스 벡터 | |
KR20220031028A (ko) | 아프리카 돼지 열병 백신 | |
CN113151310B (zh) | 非洲猪瘟基因缺失弱毒株的构建及其作为疫苗的应用 | |
KR20220165731A (ko) | Sars-cov-2 바이러스에 대한 재조합 폭스바이러스 기반 백신 | |
CN113025629A (zh) | 一种基因缺失的减毒非洲猪瘟病毒株及应用 | |
CN112899290B (zh) | 一种天然免疫抑制基因缺失的减毒非洲猪瘟病毒株及应用 | |
CN112543806A (zh) | 合成嵌合痘苗病毒 | |
CN115916803A (zh) | 非洲猪瘟疫苗组合物 | |
KR20220024508A (ko) | 생물학적으로 봉쇄된 박테리아 및 그의 용도 | |
CN111107872A (zh) | 有用于癌症免疫疗法的牛痘病毒突变体 | |
AU2021203084B2 (en) | Modified plant | |
CN112261951A (zh) | 包含合成嵌合痘苗病毒的干细胞及其使用方法 | |
KR102713224B1 (ko) | Hpv 백신 | |
KR20230136600A (ko) | 안정적인 세포주에서 효율적인 성장을 가능하게 하는아프리카 돼지 열병 백신의 게놈 결실 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |