KR20220164500A - 불활성화된 SARS-CoV-2 바이러스 백신 - Google Patents
불활성화된 SARS-CoV-2 바이러스 백신 Download PDFInfo
- Publication number
- KR20220164500A KR20220164500A KR1020227034302A KR20227034302A KR20220164500A KR 20220164500 A KR20220164500 A KR 20220164500A KR 1020227034302 A KR1020227034302 A KR 1020227034302A KR 20227034302 A KR20227034302 A KR 20227034302A KR 20220164500 A KR20220164500 A KR 20220164500A
- Authority
- KR
- South Korea
- Prior art keywords
- cov
- sars
- vaccine
- inactivated
- particles
- Prior art date
Links
- 241001678559 COVID-19 virus Species 0.000 title claims description 449
- 230000002155 anti-virotic effect Effects 0.000 title 1
- 229940022962 COVID-19 vaccine Drugs 0.000 claims abstract description 169
- 238000000034 method Methods 0.000 claims abstract description 164
- 229960005486 vaccine Drugs 0.000 claims abstract description 152
- 239000000203 mixture Substances 0.000 claims abstract description 82
- 239000002245 particle Substances 0.000 claims description 276
- 230000002779 inactivation Effects 0.000 claims description 101
- 108090000623 proteins and genes Proteins 0.000 claims description 100
- 102000004169 proteins and genes Human genes 0.000 claims description 99
- 235000018102 proteins Nutrition 0.000 claims description 96
- VEZXCJBBBCKRPI-UHFFFAOYSA-N beta-propiolactone Chemical compound O=C1CCO1 VEZXCJBBBCKRPI-UHFFFAOYSA-N 0.000 claims description 91
- 230000003612 virological effect Effects 0.000 claims description 87
- 239000002671 adjuvant Substances 0.000 claims description 83
- 229960000380 propiolactone Drugs 0.000 claims description 56
- 238000012986 modification Methods 0.000 claims description 54
- 230000004048 modification Effects 0.000 claims description 51
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 47
- 102100031673 Corneodesmosin Human genes 0.000 claims description 43
- 108020000999 Viral RNA Proteins 0.000 claims description 36
- 239000000126 substance Substances 0.000 claims description 36
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 35
- 208000025721 COVID-19 Diseases 0.000 claims description 34
- 229940096437 Protein S Drugs 0.000 claims description 32
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 31
- 238000011282 treatment Methods 0.000 claims description 31
- 102000007327 Protamines Human genes 0.000 claims description 30
- 108010007568 Protamines Proteins 0.000 claims description 30
- 229940046168 CpG oligodeoxynucleotide Drugs 0.000 claims description 29
- 230000003472 neutralizing effect Effects 0.000 claims description 29
- 229950008679 protamine sulfate Drugs 0.000 claims description 27
- 238000002965 ELISA Methods 0.000 claims description 26
- 230000000415 inactivating effect Effects 0.000 claims description 23
- 230000000937 inactivator Effects 0.000 claims description 23
- 208000037847 SARS-CoV-2-infection Diseases 0.000 claims description 22
- 108010031318 Vitronectin Proteins 0.000 claims description 22
- 238000004806 packaging method and process Methods 0.000 claims description 22
- 238000009472 formulation Methods 0.000 claims description 21
- 238000004519 manufacturing process Methods 0.000 claims description 21
- 125000003729 nucleotide group Chemical group 0.000 claims description 21
- 239000008194 pharmaceutical composition Substances 0.000 claims description 21
- 235000001014 amino acid Nutrition 0.000 claims description 19
- 101710198474 Spike protein Proteins 0.000 claims description 18
- 150000001413 amino acids Chemical class 0.000 claims description 18
- 201000010099 disease Diseases 0.000 claims description 18
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 18
- 239000001963 growth medium Substances 0.000 claims description 18
- 238000003306 harvesting Methods 0.000 claims description 18
- 210000003501 vero cell Anatomy 0.000 claims description 18
- 108010052285 Membrane Proteins Proteins 0.000 claims description 17
- 125000000539 amino acid group Chemical group 0.000 claims description 17
- 229930006000 Sucrose Natural products 0.000 claims description 15
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 claims description 15
- 239000012528 membrane Substances 0.000 claims description 15
- 239000005720 sucrose Substances 0.000 claims description 15
- 108091006905 Human Serum Albumin Proteins 0.000 claims description 14
- 102000008100 Human Serum Albumin Human genes 0.000 claims description 14
- 241000315672 SARS coronavirus Species 0.000 claims description 13
- 238000002156 mixing Methods 0.000 claims description 13
- 230000008569 process Effects 0.000 claims description 13
- 230000001419 dependent effect Effects 0.000 claims description 12
- 239000012634 fragment Substances 0.000 claims description 12
- 230000007062 hydrolysis Effects 0.000 claims description 12
- 238000006460 hydrolysis reaction Methods 0.000 claims description 12
- 239000007788 liquid Substances 0.000 claims description 12
- 229920001184 polypeptide Polymers 0.000 claims description 12
- 230000002829 reductive effect Effects 0.000 claims description 12
- YYGNTYWPHWGJRM-UHFFFAOYSA-N (6E,10E,14E,18E)-2,6,10,15,19,23-hexamethyltetracosa-2,6,10,14,18,22-hexaene Chemical compound CC(C)=CCCC(C)=CCCC(C)=CCCC=C(C)CCC=C(C)CCC=C(C)C YYGNTYWPHWGJRM-UHFFFAOYSA-N 0.000 claims description 11
- BHEOSNUKNHRBNM-UHFFFAOYSA-N Tetramethylsqualene Natural products CC(=C)C(C)CCC(=C)C(C)CCC(C)=CCCC=C(C)CCC(C)C(=C)CCC(C)C(C)=C BHEOSNUKNHRBNM-UHFFFAOYSA-N 0.000 claims description 11
- 108010067390 Viral Proteins Proteins 0.000 claims description 11
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 claims description 11
- PRAKJMSDJKAYCZ-UHFFFAOYSA-N dodecahydrosqualene Natural products CC(C)CCCC(C)CCCC(C)CCCCC(C)CCCC(C)CCCC(C)C PRAKJMSDJKAYCZ-UHFFFAOYSA-N 0.000 claims description 11
- 238000011534 incubation Methods 0.000 claims description 11
- 238000002955 isolation Methods 0.000 claims description 11
- 230000002265 prevention Effects 0.000 claims description 11
- 229940031439 squalene Drugs 0.000 claims description 11
- TUHBEKDERLKLEC-UHFFFAOYSA-N squalene Natural products CC(=CCCC(=CCCC(=CCCC=C(/C)CCC=C(/C)CC=C(C)C)C)C)C TUHBEKDERLKLEC-UHFFFAOYSA-N 0.000 claims description 11
- 108090000288 Glycoproteins Proteins 0.000 claims description 10
- 102000003886 Glycoproteins Human genes 0.000 claims description 10
- 108090000631 Trypsin Proteins 0.000 claims description 10
- 102000004142 Trypsin Human genes 0.000 claims description 10
- 239000012588 trypsin Substances 0.000 claims description 10
- AZDRQVAHHNSJOQ-UHFFFAOYSA-N alumane Chemical class [AlH3] AZDRQVAHHNSJOQ-UHFFFAOYSA-N 0.000 claims description 9
- 235000001815 DL-alpha-tocopherol Nutrition 0.000 claims description 8
- 108010034546 Serratia marcescens nuclease Proteins 0.000 claims description 8
- 229940035032 monophosphoryl lipid a Drugs 0.000 claims description 8
- 229940046166 oligodeoxynucleotide Drugs 0.000 claims description 8
- 239000001397 quillaja saponaria molina bark Substances 0.000 claims description 8
- 229930182490 saponin Natural products 0.000 claims description 8
- 150000007949 saponins Chemical class 0.000 claims description 8
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 7
- 239000011627 DL-alpha-tocopherol Substances 0.000 claims description 7
- 102000000447 Peptide-N4-(N-acetyl-beta-glucosaminyl) Asparagine Amidase Human genes 0.000 claims description 7
- 108010055817 Peptide-N4-(N-acetyl-beta-glucosaminyl) Asparagine Amidase Proteins 0.000 claims description 7
- GVJHHUAWPYXKBD-UHFFFAOYSA-N d-alpha-tocopherol Natural products OC1=C(C)C(C)=C2OC(CCCC(C)CCCC(C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-UHFFFAOYSA-N 0.000 claims description 7
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical group O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 claims description 7
- 229960002751 imiquimod Drugs 0.000 claims description 7
- DOUYETYNHWVLEO-UHFFFAOYSA-N imiquimod Chemical compound C1=CC=CC2=C3N(CC(C)C)C=NC3=C(N)N=C21 DOUYETYNHWVLEO-UHFFFAOYSA-N 0.000 claims description 7
- 210000004962 mammalian cell Anatomy 0.000 claims description 7
- 239000000546 pharmaceutical excipient Substances 0.000 claims description 7
- 238000001556 precipitation Methods 0.000 claims description 7
- 229960000984 tocofersolan Drugs 0.000 claims description 7
- GVJHHUAWPYXKBD-IEOSBIPESA-N α-tocopherol Chemical compound OC1=C(C)C(C)=C2O[C@@](CCC[C@H](C)CCC[C@H](C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-IEOSBIPESA-N 0.000 claims description 7
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 claims description 6
- 239000002253 acid Substances 0.000 claims description 6
- 238000005903 acid hydrolysis reaction Methods 0.000 claims description 6
- ILRRQNADMUWWFW-UHFFFAOYSA-K aluminium phosphate Chemical compound O1[Al]2OP1(=O)O2 ILRRQNADMUWWFW-UHFFFAOYSA-K 0.000 claims description 6
- 239000003814 drug Substances 0.000 claims description 6
- 238000004949 mass spectrometry Methods 0.000 claims description 6
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 claims description 6
- 229920000053 polysorbate 80 Polymers 0.000 claims description 6
- 239000006228 supernatant Substances 0.000 claims description 6
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 claims description 5
- 238000001914 filtration Methods 0.000 claims description 5
- 229930182817 methionine Natural products 0.000 claims description 5
- VGONTNSXDCQUGY-RRKCRQDMSA-N 2'-deoxyinosine Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(N=CNC2=O)=C2N=C1 VGONTNSXDCQUGY-RRKCRQDMSA-N 0.000 claims description 4
- 206010061598 Immunodeficiency Diseases 0.000 claims description 4
- 238000000432 density-gradient centrifugation Methods 0.000 claims description 4
- VGONTNSXDCQUGY-UHFFFAOYSA-N desoxyinosine Natural products C1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 VGONTNSXDCQUGY-UHFFFAOYSA-N 0.000 claims description 4
- 230000003308 immunostimulating effect Effects 0.000 claims description 4
- 230000001376 precipitating effect Effects 0.000 claims description 4
- 238000011321 prophylaxis Methods 0.000 claims description 4
- PRXRUNOAOLTIEF-ADSICKODSA-N Sorbitan trioleate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@@H](OC(=O)CCCCCCC\C=C/CCCCCCCC)[C@H]1OC[C@H](O)[C@H]1OC(=O)CCCCCCC\C=C/CCCCCCCC PRXRUNOAOLTIEF-ADSICKODSA-N 0.000 claims description 3
- 125000002091 cationic group Chemical group 0.000 claims description 3
- 238000004587 chromatography analysis Methods 0.000 claims description 3
- 238000004132 cross linking Methods 0.000 claims description 3
- 238000000502 dialysis Methods 0.000 claims description 3
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 claims description 3
- 230000001939 inductive effect Effects 0.000 claims description 3
- 239000007764 o/w emulsion Substances 0.000 claims description 3
- 239000000244 polyoxyethylene sorbitan monooleate Substances 0.000 claims description 3
- 229940068968 polysorbate 80 Drugs 0.000 claims description 3
- 230000002950 deficient Effects 0.000 claims description 2
- 150000003212 purines Chemical class 0.000 claims description 2
- 230000010076 replication Effects 0.000 claims description 2
- 102400000368 Surface protein Human genes 0.000 claims 2
- 125000000151 cysteine group Chemical class N[C@@H](CS)C(=O)* 0.000 claims 1
- 230000002255 enzymatic effect Effects 0.000 claims 1
- 230000010355 oscillation Effects 0.000 claims 1
- 241000700605 Viruses Species 0.000 description 53
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 40
- 239000000427 antigen Substances 0.000 description 29
- 108091007433 antigens Proteins 0.000 description 29
- 102000036639 antigens Human genes 0.000 description 29
- 108010034529 leucyl-lysine Proteins 0.000 description 27
- 210000004027 cell Anatomy 0.000 description 25
- 125000003275 alpha amino acid group Chemical group 0.000 description 24
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 23
- 201000003176 Severe Acute Respiratory Syndrome Diseases 0.000 description 23
- 210000002966 serum Anatomy 0.000 description 23
- 101710139375 Corneodesmosin Proteins 0.000 description 22
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 22
- 230000028993 immune response Effects 0.000 description 21
- 238000003556 assay Methods 0.000 description 19
- 229940037003 alum Drugs 0.000 description 18
- 108010050848 glycylleucine Proteins 0.000 description 18
- 208000015181 infectious disease Diseases 0.000 description 18
- 241000699670 Mus sp. Species 0.000 description 17
- 108010037850 glycylvaline Proteins 0.000 description 17
- 230000004044 response Effects 0.000 description 17
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 16
- 102000018697 Membrane Proteins Human genes 0.000 description 15
- 108010079364 N-glycylalanine Proteins 0.000 description 15
- 239000000243 solution Substances 0.000 description 15
- 108020004414 DNA Proteins 0.000 description 14
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 14
- 238000002360 preparation method Methods 0.000 description 14
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 13
- 238000012360 testing method Methods 0.000 description 13
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 12
- 241000880493 Leptailurus serval Species 0.000 description 12
- 108091005774 SARS-CoV-2 proteins Proteins 0.000 description 12
- 101000629318 Severe acute respiratory syndrome coronavirus 2 Spike glycoprotein Proteins 0.000 description 12
- 239000000306 component Substances 0.000 description 12
- 230000003053 immunization Effects 0.000 description 12
- 238000002649 immunization Methods 0.000 description 12
- 230000002163 immunogen Effects 0.000 description 12
- 230000002458 infectious effect Effects 0.000 description 12
- 150000007523 nucleic acids Chemical group 0.000 description 12
- 108010061238 threonyl-glycine Proteins 0.000 description 12
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 11
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 11
- 108010062796 arginyllysine Proteins 0.000 description 11
- 108010068265 aspartyltyrosine Proteins 0.000 description 11
- 108010016616 cysteinylglycine Proteins 0.000 description 11
- 238000001514 detection method Methods 0.000 description 11
- 238000002474 experimental method Methods 0.000 description 11
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 11
- 108010057821 leucylproline Proteins 0.000 description 11
- 239000000463 material Substances 0.000 description 11
- 108010090894 prolylleucine Proteins 0.000 description 11
- 238000001228 spectrum Methods 0.000 description 11
- 108010004073 cysteinylcysteine Proteins 0.000 description 10
- 229940088679 drug related substance Drugs 0.000 description 10
- 230000000694 effects Effects 0.000 description 10
- 108010092114 histidylphenylalanine Proteins 0.000 description 10
- 108010051242 phenylalanylserine Proteins 0.000 description 10
- 108010051110 tyrosyl-lysine Proteins 0.000 description 10
- 108010073969 valyllysine Proteins 0.000 description 10
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 9
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 9
- 108090001074 Nucleocapsid Proteins Proteins 0.000 description 9
- 239000008186 active pharmaceutical agent Substances 0.000 description 9
- 108010047495 alanylglycine Proteins 0.000 description 9
- 108010087924 alanylproline Proteins 0.000 description 9
- 108010047857 aspartylglycine Proteins 0.000 description 9
- 239000008280 blood Substances 0.000 description 9
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 9
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 9
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 8
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 8
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 8
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 8
- 210000004369 blood Anatomy 0.000 description 8
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 8
- 108010038320 lysylphenylalanine Proteins 0.000 description 8
- 239000002773 nucleotide Substances 0.000 description 8
- 239000012071 phase Substances 0.000 description 8
- 230000001681 protective effect Effects 0.000 description 8
- 108010048818 seryl-histidine Proteins 0.000 description 8
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 7
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 7
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 7
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 7
- 108010090054 Membrane Glycoproteins Proteins 0.000 description 7
- 102000012750 Membrane Glycoproteins Human genes 0.000 description 7
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 7
- 241000702619 Porcine parvovirus Species 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 7
- 108010038633 aspartylglutamate Proteins 0.000 description 7
- 238000002869 basic local alignment search tool Methods 0.000 description 7
- 108010049041 glutamylalanine Proteins 0.000 description 7
- 230000005847 immunogenicity Effects 0.000 description 7
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 7
- 108010031719 prolyl-serine Proteins 0.000 description 7
- 230000009467 reduction Effects 0.000 description 7
- 238000003998 size exclusion chromatography high performance liquid chromatography Methods 0.000 description 7
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 6
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 6
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 6
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 6
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 6
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 6
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 6
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 6
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 6
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 6
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 6
- 241001465754 Metazoa Species 0.000 description 6
- 108010006232 Neuraminidase Proteins 0.000 description 6
- 102000005348 Neuraminidase Human genes 0.000 description 6
- 102000011931 Nucleoproteins Human genes 0.000 description 6
- 108010061100 Nucleoproteins Proteins 0.000 description 6
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 6
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 6
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 6
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 6
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 6
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 6
- 108010005233 alanylglutamic acid Proteins 0.000 description 6
- 108010044940 alanylglutamine Proteins 0.000 description 6
- 238000005119 centrifugation Methods 0.000 description 6
- 239000003795 chemical substances by application Substances 0.000 description 6
- 239000011248 coating agent Substances 0.000 description 6
- 238000000576 coating method Methods 0.000 description 6
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 6
- 108010015792 glycyllysine Proteins 0.000 description 6
- 108010077515 glycylproline Proteins 0.000 description 6
- 239000003446 ligand Substances 0.000 description 6
- 108010003700 lysyl aspartic acid Proteins 0.000 description 6
- 108010064235 lysylglycine Proteins 0.000 description 6
- 108010054155 lysyllysine Proteins 0.000 description 6
- 108010017391 lysylvaline Proteins 0.000 description 6
- 230000035772 mutation Effects 0.000 description 6
- 108010012581 phenylalanylglutamate Proteins 0.000 description 6
- 239000002953 phosphate buffered saline Substances 0.000 description 6
- 239000011148 porous material Substances 0.000 description 6
- 108010079317 prolyl-tyrosine Proteins 0.000 description 6
- 108010029020 prolylglycine Proteins 0.000 description 6
- 108020003175 receptors Proteins 0.000 description 6
- 102000005962 receptors Human genes 0.000 description 6
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 5
- 208000002109 Argyria Diseases 0.000 description 5
- HBUJSDCLZCXXCW-YDHLFZDLSA-N Asn-Val-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HBUJSDCLZCXXCW-YDHLFZDLSA-N 0.000 description 5
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 5
- 241000711573 Coronaviridae Species 0.000 description 5
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 5
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 5
- 101710154606 Hemagglutinin Proteins 0.000 description 5
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 5
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 5
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 5
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 5
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 5
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 5
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 5
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 5
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 5
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 5
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 5
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 5
- 101710176177 Protein A56 Proteins 0.000 description 5
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 5
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 5
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 5
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 5
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 5
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 5
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 5
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 5
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 5
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 5
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 5
- 230000000890 antigenic effect Effects 0.000 description 5
- 238000013459 approach Methods 0.000 description 5
- 108010013835 arginine glutamate Proteins 0.000 description 5
- 108010077245 asparaginyl-proline Proteins 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 5
- 238000013461 design Methods 0.000 description 5
- 238000011156 evaluation Methods 0.000 description 5
- 108010089804 glycyl-threonine Proteins 0.000 description 5
- 108010010147 glycylglutamine Proteins 0.000 description 5
- 108010081551 glycylphenylalanine Proteins 0.000 description 5
- 239000000185 hemagglutinin Substances 0.000 description 5
- 206010022000 influenza Diseases 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 238000006386 neutralization reaction Methods 0.000 description 5
- 238000012856 packing Methods 0.000 description 5
- 108010084572 phenylalanyl-valine Proteins 0.000 description 5
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 5
- 108010070643 prolylglutamic acid Proteins 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 238000011002 quantification Methods 0.000 description 5
- 238000011160 research Methods 0.000 description 5
- 108010071207 serylmethionine Proteins 0.000 description 5
- 238000001542 size-exclusion chromatography Methods 0.000 description 5
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 4
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 4
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 4
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 4
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 4
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 4
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 4
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 4
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 4
- BSBNNPICFPXDNH-SRVKXCTJSA-N Asn-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N BSBNNPICFPXDNH-SRVKXCTJSA-N 0.000 description 4
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 4
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 4
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 4
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 4
- 108090000317 Chymotrypsin Proteins 0.000 description 4
- 241000494545 Cordyline virus 2 Species 0.000 description 4
- 208000001528 Coronaviridae Infections Diseases 0.000 description 4
- JLZCAZJGWNRXCI-XKBZYTNZSA-N Cys-Thr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O JLZCAZJGWNRXCI-XKBZYTNZSA-N 0.000 description 4
- ALNKNYKSZPSLBD-ZDLURKLDSA-N Cys-Thr-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ALNKNYKSZPSLBD-ZDLURKLDSA-N 0.000 description 4
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 4
- 238000012286 ELISA Assay Methods 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 4
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 4
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 4
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 4
- XLFHCWHXKSFVIB-BQBZGAKWSA-N Gly-Gln-Gln Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLFHCWHXKSFVIB-BQBZGAKWSA-N 0.000 description 4
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 4
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 4
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 4
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 4
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 4
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 4
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 4
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 4
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 4
- 108010065920 Insulin Lispro Proteins 0.000 description 4
- 229940124956 Ixiaro Drugs 0.000 description 4
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 4
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 4
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 4
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 4
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 4
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 4
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 4
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 4
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 4
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 4
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 4
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 4
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 4
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 4
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 4
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 4
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 4
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 4
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 4
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 4
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 4
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 4
- ALHULIGNEXGFRM-QWRGUYRKSA-N Phe-Cys-Gly Chemical compound OC(=O)CNC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=CC=C1 ALHULIGNEXGFRM-QWRGUYRKSA-N 0.000 description 4
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 4
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 4
- 241000831652 Salinivibrio sharmensis Species 0.000 description 4
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 4
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 4
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 4
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 4
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 4
- SOACHCFYJMCMHC-BWBBJGPYSA-N Ser-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O SOACHCFYJMCMHC-BWBBJGPYSA-N 0.000 description 4
- 101001110297 Severe acute respiratory syndrome coronavirus 2 Replicase polyprotein 1ab Proteins 0.000 description 4
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 4
- QWMPARMKIDVBLV-VZFHVOOUSA-N Thr-Cys-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O QWMPARMKIDVBLV-VZFHVOOUSA-N 0.000 description 4
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 4
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 4
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 4
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 4
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 4
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 4
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 4
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 4
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 4
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 4
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 4
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 4
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 4
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 4
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 4
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 4
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 4
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 4
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 4
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 4
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 4
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 4
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 4
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 4
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 4
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 4
- 238000007792 addition Methods 0.000 description 4
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 4
- 108010041407 alanylaspartic acid Proteins 0.000 description 4
- 108010008355 arginyl-glutamine Proteins 0.000 description 4
- 108010060035 arginylproline Proteins 0.000 description 4
- 108010093581 aspartyl-proline Proteins 0.000 description 4
- 239000000872 buffer Substances 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 4
- 229960002376 chymotrypsin Drugs 0.000 description 4
- 238000010790 dilution Methods 0.000 description 4
- 239000012895 dilution Substances 0.000 description 4
- 229940088598 enzyme Drugs 0.000 description 4
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 4
- 108010087823 glycyltyrosine Proteins 0.000 description 4
- 238000004128 high performance liquid chromatography Methods 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 230000036039 immunity Effects 0.000 description 4
- 239000012535 impurity Substances 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- 239000010410 layer Substances 0.000 description 4
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 4
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 4
- 238000004811 liquid chromatography Methods 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 4
- 108010005942 methionylglycine Proteins 0.000 description 4
- 108010068488 methionylphenylalanine Proteins 0.000 description 4
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 4
- 238000004321 preservation Methods 0.000 description 4
- 239000000047 product Substances 0.000 description 4
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 4
- 108010026333 seryl-proline Proteins 0.000 description 4
- 238000007920 subcutaneous administration Methods 0.000 description 4
- 208000024891 symptom Diseases 0.000 description 4
- 108010003137 tyrosyltyrosine Proteins 0.000 description 4
- 210000002845 virion Anatomy 0.000 description 4
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 3
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 3
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 3
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 3
- ZDYNWWQXFRUOEO-XDTLVQLUSA-N Ala-Gln-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDYNWWQXFRUOEO-XDTLVQLUSA-N 0.000 description 3
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 3
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 3
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 3
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 3
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 3
- OQWQTGBOFPJOIF-DLOVCJGASA-N Ala-Lys-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N OQWQTGBOFPJOIF-DLOVCJGASA-N 0.000 description 3
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 3
- VHEVVUZDDUCAKU-FXQIFTODSA-N Ala-Met-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O VHEVVUZDDUCAKU-FXQIFTODSA-N 0.000 description 3
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 3
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 3
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 3
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 3
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 3
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 3
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 3
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 3
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 3
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 3
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 3
- FOWOZYAWODIRFZ-JYJNAYRXSA-N Arg-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCN=C(N)N)N FOWOZYAWODIRFZ-JYJNAYRXSA-N 0.000 description 3
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 3
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 3
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 3
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 3
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 3
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 3
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 3
- IOXWDLNHXZOXQP-FXQIFTODSA-N Asp-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N IOXWDLNHXZOXQP-FXQIFTODSA-N 0.000 description 3
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 3
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 3
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 3
- 241000283707 Capra Species 0.000 description 3
- 241001502567 Chikungunya virus Species 0.000 description 3
- 206010011224 Cough Diseases 0.000 description 3
- 241000699800 Cricetinae Species 0.000 description 3
- CPTUXCUWQIBZIF-ZLUOBGJFSA-N Cys-Asn-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CPTUXCUWQIBZIF-ZLUOBGJFSA-N 0.000 description 3
- JRZMCSIUYGSJKP-ZKWXMUAHSA-N Cys-Val-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JRZMCSIUYGSJKP-ZKWXMUAHSA-N 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 3
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 3
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 3
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 3
- OREPWMPAUWIIAM-ZPFDUUQYSA-N Gln-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N OREPWMPAUWIIAM-ZPFDUUQYSA-N 0.000 description 3
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 3
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 3
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 3
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 3
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 3
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 3
- GQGAFTPXAPKSCF-WHFBIAKZSA-N Gly-Ala-Cys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O GQGAFTPXAPKSCF-WHFBIAKZSA-N 0.000 description 3
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 3
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 3
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 3
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 3
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 3
- MTBIKIMYHUWBRX-QWRGUYRKSA-N Gly-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN MTBIKIMYHUWBRX-QWRGUYRKSA-N 0.000 description 3
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 3
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 3
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 3
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 3
- 101710114810 Glycoprotein Proteins 0.000 description 3
- 206010020751 Hypersensitivity Diseases 0.000 description 3
- JRYQSFOFUFXPTB-RWRJDSDZSA-N Ile-Gln-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N JRYQSFOFUFXPTB-RWRJDSDZSA-N 0.000 description 3
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 3
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 3
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 3
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 3
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 3
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 3
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 3
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 3
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 3
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 3
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 3
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 3
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 3
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 3
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 3
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 3
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 3
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 3
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 3
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 3
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 3
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 3
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 3
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 3
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 3
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 3
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 3
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 3
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 3
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 3
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 3
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 3
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 3
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 3
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 3
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 3
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 3
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 3
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 3
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 3
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 3
- ZMYHJISLFYTQGK-FXQIFTODSA-N Met-Asp-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMYHJISLFYTQGK-FXQIFTODSA-N 0.000 description 3
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 3
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 3
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 3
- 101001028244 Onchocerca volvulus Fatty-acid and retinol-binding protein 1 Proteins 0.000 description 3
- LXVFHIBXOWJTKZ-BZSNNMDCSA-N Phe-Asn-Tyr Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O LXVFHIBXOWJTKZ-BZSNNMDCSA-N 0.000 description 3
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 3
- UEHNWRNADDPYNK-DLOVCJGASA-N Phe-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N UEHNWRNADDPYNK-DLOVCJGASA-N 0.000 description 3
- PSBJZLMFFTULDX-IXOXFDKPSA-N Phe-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N)O PSBJZLMFFTULDX-IXOXFDKPSA-N 0.000 description 3
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 3
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 3
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 3
- DSXPMZMSJHOKKK-HJOGWXRNSA-N Phe-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DSXPMZMSJHOKKK-HJOGWXRNSA-N 0.000 description 3
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 3
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 3
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 3
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 3
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 3
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 3
- 108010079005 RDV peptide Proteins 0.000 description 3
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 3
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 3
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 3
- ZHYMUFQVKGJNRM-ZLUOBGJFSA-N Ser-Cys-Asn Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O ZHYMUFQVKGJNRM-ZLUOBGJFSA-N 0.000 description 3
- MOQDPPUMFSMYOM-KKUMJFAQSA-N Ser-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N MOQDPPUMFSMYOM-KKUMJFAQSA-N 0.000 description 3
- SRKMDKACHDVPMD-SRVKXCTJSA-N Ser-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N SRKMDKACHDVPMD-SRVKXCTJSA-N 0.000 description 3
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 3
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 3
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 3
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 3
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 3
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 3
- SGZVZUCRAVSPKQ-FXQIFTODSA-N Ser-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N SGZVZUCRAVSPKQ-FXQIFTODSA-N 0.000 description 3
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 3
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 3
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 3
- 101710167605 Spike glycoprotein Proteins 0.000 description 3
- 230000005867 T cell response Effects 0.000 description 3
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 3
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 3
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 3
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 3
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 3
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 3
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 3
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 3
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 3
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 3
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 3
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 3
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 3
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 3
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 3
- BXKWZPXTTSCOMX-AQZXSJQPSA-N Trp-Asn-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXKWZPXTTSCOMX-AQZXSJQPSA-N 0.000 description 3
- PWPJLBWYRTVYQS-PMVMPFDFSA-N Trp-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PWPJLBWYRTVYQS-PMVMPFDFSA-N 0.000 description 3
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 3
- GHUNBABNQPIETG-MELADBBJSA-N Tyr-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O GHUNBABNQPIETG-MELADBBJSA-N 0.000 description 3
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 3
- HVPPEXXUDXAPOM-MGHWNKPDSA-N Tyr-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HVPPEXXUDXAPOM-MGHWNKPDSA-N 0.000 description 3
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 3
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 3
- BGFCXQXETBDEHP-BZSNNMDCSA-N Tyr-Phe-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O BGFCXQXETBDEHP-BZSNNMDCSA-N 0.000 description 3
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 3
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 3
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 3
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 3
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 3
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 3
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 3
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 3
- IRLYZKKNBFPQBW-XGEHTFHBSA-N Val-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N)O IRLYZKKNBFPQBW-XGEHTFHBSA-N 0.000 description 3
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 3
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 3
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 3
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 3
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 3
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 3
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 3
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 3
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 3
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 3
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 3
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 3
- ODUHAIXFXFACDY-SRVKXCTJSA-N Val-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C ODUHAIXFXFACDY-SRVKXCTJSA-N 0.000 description 3
- 241000907316 Zika virus Species 0.000 description 3
- 238000002835 absorbance Methods 0.000 description 3
- 238000010521 absorption reaction Methods 0.000 description 3
- 239000004480 active ingredient Substances 0.000 description 3
- 230000010933 acylation Effects 0.000 description 3
- 238000005917 acylation reaction Methods 0.000 description 3
- 108010011559 alanylphenylalanine Proteins 0.000 description 3
- 230000029936 alkylation Effects 0.000 description 3
- 238000005804 alkylation reaction Methods 0.000 description 3
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 3
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 3
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- 238000007796 conventional method Methods 0.000 description 3
- 235000018417 cysteine Nutrition 0.000 description 3
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 3
- 108010060199 cysteinylproline Proteins 0.000 description 3
- 238000013467 fragmentation Methods 0.000 description 3
- 238000006062 fragmentation reaction Methods 0.000 description 3
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 3
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 3
- 230000036541 health Effects 0.000 description 3
- 108010040030 histidinoalanine Proteins 0.000 description 3
- 108010018006 histidylserine Proteins 0.000 description 3
- 210000005260 human cell Anatomy 0.000 description 3
- 230000001965 increasing effect Effects 0.000 description 3
- 108010078274 isoleucylvaline Proteins 0.000 description 3
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 3
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 3
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 3
- 108010012058 leucyltyrosine Proteins 0.000 description 3
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 3
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 3
- 108010009298 lysylglutamic acid Proteins 0.000 description 3
- 108010085203 methionylmethionine Proteins 0.000 description 3
- 102000040430 polynucleotide Human genes 0.000 description 3
- 108091033319 polynucleotide Proteins 0.000 description 3
- 239000002157 polynucleotide Substances 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 108010053725 prolylvaline Proteins 0.000 description 3
- 229940048914 protamine Drugs 0.000 description 3
- 230000009145 protein modification Effects 0.000 description 3
- 230000005180 public health Effects 0.000 description 3
- 210000000952 spleen Anatomy 0.000 description 3
- 238000010186 staining Methods 0.000 description 3
- 238000010561 standard procedure Methods 0.000 description 3
- 238000000856 sucrose gradient centrifugation Methods 0.000 description 3
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 3
- 210000001519 tissue Anatomy 0.000 description 3
- 108010080629 tryptophan-leucine Proteins 0.000 description 3
- 108010078580 tyrosylleucine Proteins 0.000 description 3
- 238000002255 vaccination Methods 0.000 description 3
- 229960004854 viral vaccine Drugs 0.000 description 3
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 2
- IKHGUXGNUITLKF-UHFFFAOYSA-N Acetaldehyde Chemical compound CC=O IKHGUXGNUITLKF-UHFFFAOYSA-N 0.000 description 2
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 2
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- WYPUMLRSQMKIJU-BPNCWPANSA-N Ala-Arg-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WYPUMLRSQMKIJU-BPNCWPANSA-N 0.000 description 2
- PJNSIUPOXFBHDM-GUBZILKMSA-N Ala-Arg-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O PJNSIUPOXFBHDM-GUBZILKMSA-N 0.000 description 2
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 2
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 2
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 2
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 2
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 2
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 2
- HWPXGQCMZITGFN-XVYDVKMFSA-N Ala-Cys-His Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HWPXGQCMZITGFN-XVYDVKMFSA-N 0.000 description 2
- CVHJIWVKTFNGHT-ACZMJKKPSA-N Ala-Gln-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N CVHJIWVKTFNGHT-ACZMJKKPSA-N 0.000 description 2
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 2
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 2
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 2
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 2
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 2
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 2
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 2
- JEPNLGMEZMCFEX-QSFUFRPTSA-N Ala-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N JEPNLGMEZMCFEX-QSFUFRPTSA-N 0.000 description 2
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 2
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 2
- RUQBGIMJQUWXPP-CYDGBPFRSA-N Ala-Leu-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O RUQBGIMJQUWXPP-CYDGBPFRSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 2
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 2
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 2
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 2
- VEAPAYQQLSEKEM-GUBZILKMSA-N Ala-Met-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O VEAPAYQQLSEKEM-GUBZILKMSA-N 0.000 description 2
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 2
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 2
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 2
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 2
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 2
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 2
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 2
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 2
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- XKHLBBQNPSOGPI-GUBZILKMSA-N Ala-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N XKHLBBQNPSOGPI-GUBZILKMSA-N 0.000 description 2
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 2
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 2
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 2
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 2
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 2
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 2
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 2
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 2
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 2
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 2
- UIUXXFIKWQVMEX-UFYCRDLUSA-N Arg-Phe-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UIUXXFIKWQVMEX-UFYCRDLUSA-N 0.000 description 2
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 2
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 2
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 2
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 2
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 2
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 2
- FTMRPIVPSDVGCC-GUBZILKMSA-N Arg-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FTMRPIVPSDVGCC-GUBZILKMSA-N 0.000 description 2
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 2
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 2
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 2
- FUHFYEKSGWOWGZ-XHNCKOQMSA-N Asn-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O FUHFYEKSGWOWGZ-XHNCKOQMSA-N 0.000 description 2
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 2
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 2
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 2
- FVKHEKVYFTZWDX-GHCJXIJMSA-N Asn-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N FVKHEKVYFTZWDX-GHCJXIJMSA-N 0.000 description 2
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 2
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 2
- XLZCLJRGGMBKLR-PCBIJLKTSA-N Asn-Ile-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XLZCLJRGGMBKLR-PCBIJLKTSA-N 0.000 description 2
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 2
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 2
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 2
- RZNAMKZJPBQWDJ-SRVKXCTJSA-N Asn-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N RZNAMKZJPBQWDJ-SRVKXCTJSA-N 0.000 description 2
- NNDSLVWAQAUPPP-GUBZILKMSA-N Asn-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N NNDSLVWAQAUPPP-GUBZILKMSA-N 0.000 description 2
- VITDJIPIJZAVGC-VEVYYDQMSA-N Asn-Met-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VITDJIPIJZAVGC-VEVYYDQMSA-N 0.000 description 2
- RTFWCVDISAMGEQ-SRVKXCTJSA-N Asn-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N RTFWCVDISAMGEQ-SRVKXCTJSA-N 0.000 description 2
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 2
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 2
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 2
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 2
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 2
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 2
- QTKYFZCMSQLYHI-UBHSHLNASA-N Asn-Trp-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O QTKYFZCMSQLYHI-UBHSHLNASA-N 0.000 description 2
- KSZHWTRZPOTIGY-AVGNSLFASA-N Asn-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O KSZHWTRZPOTIGY-AVGNSLFASA-N 0.000 description 2
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 2
- NJPLPRFQLBZAMH-IHRRRGAJSA-N Asn-Tyr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O NJPLPRFQLBZAMH-IHRRRGAJSA-N 0.000 description 2
- LRCIOEVFVGXZKB-BZSNNMDCSA-N Asn-Tyr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LRCIOEVFVGXZKB-BZSNNMDCSA-N 0.000 description 2
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 2
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 2
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 2
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 2
- UWOPETAWXDZUJR-ACZMJKKPSA-N Asp-Cys-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O UWOPETAWXDZUJR-ACZMJKKPSA-N 0.000 description 2
- PJERDVUTUDZPGX-ZKWXMUAHSA-N Asp-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC(O)=O PJERDVUTUDZPGX-ZKWXMUAHSA-N 0.000 description 2
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 2
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 2
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- NRIFEOUAFLTMFJ-AAEUAGOBSA-N Asp-Gly-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NRIFEOUAFLTMFJ-AAEUAGOBSA-N 0.000 description 2
- CMCIMCAQIULNDJ-CIUDSAMLSA-N Asp-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N CMCIMCAQIULNDJ-CIUDSAMLSA-N 0.000 description 2
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 2
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 2
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 2
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 2
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 2
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 2
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 2
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 2
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 2
- LIQNMKIBMPEOOP-IHRRRGAJSA-N Asp-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)O)N LIQNMKIBMPEOOP-IHRRRGAJSA-N 0.000 description 2
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 2
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 2
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 2
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 2
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 2
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 2
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 2
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 2
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 2
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 2
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 2
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 2
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- QFMCHXSGIZPBKG-ZLUOBGJFSA-N Cys-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N QFMCHXSGIZPBKG-ZLUOBGJFSA-N 0.000 description 2
- KKZHXOOZHFABQQ-UWJYBYFXSA-N Cys-Ala-Tyr Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKZHXOOZHFABQQ-UWJYBYFXSA-N 0.000 description 2
- CLDCTNHPILWQCW-CIUDSAMLSA-N Cys-Arg-Glu Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N CLDCTNHPILWQCW-CIUDSAMLSA-N 0.000 description 2
- YRJICXCOIBUCRP-CIUDSAMLSA-N Cys-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N YRJICXCOIBUCRP-CIUDSAMLSA-N 0.000 description 2
- XABFFGOGKOORCG-CIUDSAMLSA-N Cys-Asp-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XABFFGOGKOORCG-CIUDSAMLSA-N 0.000 description 2
- ASHTVGGFIMESRD-LKXGYXEUSA-N Cys-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)O ASHTVGGFIMESRD-LKXGYXEUSA-N 0.000 description 2
- FIADUEYFRSCCIK-CIUDSAMLSA-N Cys-Glu-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIADUEYFRSCCIK-CIUDSAMLSA-N 0.000 description 2
- CFQVGYWKSLKWFX-KBIXCLLPSA-N Cys-Glu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CFQVGYWKSLKWFX-KBIXCLLPSA-N 0.000 description 2
- BSFFNUBDVYTDMV-WHFBIAKZSA-N Cys-Gly-Asn Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BSFFNUBDVYTDMV-WHFBIAKZSA-N 0.000 description 2
- DZSICRGTVPDCRN-YUMQZZPRSA-N Cys-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N DZSICRGTVPDCRN-YUMQZZPRSA-N 0.000 description 2
- DIHCYBRLTVEPBW-SRVKXCTJSA-N Cys-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N DIHCYBRLTVEPBW-SRVKXCTJSA-N 0.000 description 2
- HJXSYJVCMUOUNY-SRVKXCTJSA-N Cys-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N HJXSYJVCMUOUNY-SRVKXCTJSA-N 0.000 description 2
- JAHCWGSVNZXHRR-SVSWQMSJSA-N Cys-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CS)N JAHCWGSVNZXHRR-SVSWQMSJSA-N 0.000 description 2
- JTEGHEWKBCTIAL-IXOXFDKPSA-N Cys-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N)O JTEGHEWKBCTIAL-IXOXFDKPSA-N 0.000 description 2
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 2
- LHRCZIRWNFRIRG-SRVKXCTJSA-N Cys-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)O LHRCZIRWNFRIRG-SRVKXCTJSA-N 0.000 description 2
- UGPCUUWZXRMCIJ-KKUMJFAQSA-N Cys-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CS)N UGPCUUWZXRMCIJ-KKUMJFAQSA-N 0.000 description 2
- 108090000695 Cytokines Proteins 0.000 description 2
- 102000004127 Cytokines Human genes 0.000 description 2
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 2
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 2
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 2
- XFKUFUJECJUQTQ-CIUDSAMLSA-N Gln-Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XFKUFUJECJUQTQ-CIUDSAMLSA-N 0.000 description 2
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 2
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 2
- LFIVHGMKWFGUGK-IHRRRGAJSA-N Gln-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LFIVHGMKWFGUGK-IHRRRGAJSA-N 0.000 description 2
- GLEGHWQNGPMKHO-DCAQKATOSA-N Gln-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GLEGHWQNGPMKHO-DCAQKATOSA-N 0.000 description 2
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 2
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 2
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 2
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 2
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 2
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 2
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 2
- XGKNQFOKIBKFTR-CIUDSAMLSA-N Gln-Met-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCC(N)=O XGKNQFOKIBKFTR-CIUDSAMLSA-N 0.000 description 2
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 2
- PBYFVIQRFLNQCO-GUBZILKMSA-N Gln-Pro-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O PBYFVIQRFLNQCO-GUBZILKMSA-N 0.000 description 2
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 2
- UTOQQOMEJDPDMX-ACZMJKKPSA-N Gln-Ser-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O UTOQQOMEJDPDMX-ACZMJKKPSA-N 0.000 description 2
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 2
- RONJIBWTGKVKFY-HTUGSXCWSA-N Gln-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O RONJIBWTGKVKFY-HTUGSXCWSA-N 0.000 description 2
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 2
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 2
- SAHTWBLTLJWAQA-XIRDDKMYSA-N Gln-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N SAHTWBLTLJWAQA-XIRDDKMYSA-N 0.000 description 2
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 2
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 2
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 2
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 2
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 2
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 2
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 2
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 2
- PBFGQTGPSKWHJA-QEJZJMRPSA-N Glu-Asp-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PBFGQTGPSKWHJA-QEJZJMRPSA-N 0.000 description 2
- KLJMRPIBBLTDGE-ACZMJKKPSA-N Glu-Cys-Asn Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O KLJMRPIBBLTDGE-ACZMJKKPSA-N 0.000 description 2
- XKPOCESCRTVRPL-KBIXCLLPSA-N Glu-Cys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XKPOCESCRTVRPL-KBIXCLLPSA-N 0.000 description 2
- RQNYYRHRKSVKAB-GUBZILKMSA-N Glu-Cys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O RQNYYRHRKSVKAB-GUBZILKMSA-N 0.000 description 2
- ZXQPJYWZSFGWJB-AVGNSLFASA-N Glu-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N ZXQPJYWZSFGWJB-AVGNSLFASA-N 0.000 description 2
- OWVURWCRZZMAOZ-XHNCKOQMSA-N Glu-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)C(=O)O OWVURWCRZZMAOZ-XHNCKOQMSA-N 0.000 description 2
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 2
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 2
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 2
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 2
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 2
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 2
- APHGWLWMOXGZRL-DCAQKATOSA-N Glu-Glu-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O APHGWLWMOXGZRL-DCAQKATOSA-N 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- LYCDZGLXQBPNQU-WDSKDSINSA-N Glu-Gly-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O LYCDZGLXQBPNQU-WDSKDSINSA-N 0.000 description 2
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 2
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 2
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 2
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 2
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 2
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 2
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 2
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 2
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 2
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 2
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 2
- OHWJUIXZHVIXJJ-GUBZILKMSA-N Glu-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N OHWJUIXZHVIXJJ-GUBZILKMSA-N 0.000 description 2
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 2
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 2
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 2
- ZTVGZOIBLRPQNR-KKUMJFAQSA-N Glu-Met-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZTVGZOIBLRPQNR-KKUMJFAQSA-N 0.000 description 2
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 2
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 2
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 2
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 2
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 2
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 2
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 2
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 2
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 2
- LSYFGBRDBIQYAQ-FHWLQOOXSA-N Glu-Tyr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LSYFGBRDBIQYAQ-FHWLQOOXSA-N 0.000 description 2
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 2
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 2
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 2
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- LERGJIVJIIODPZ-ZANVPECISA-N Gly-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)C)C(O)=O)=CNC2=C1 LERGJIVJIIODPZ-ZANVPECISA-N 0.000 description 2
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 2
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 2
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 2
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 2
- XZRZILPOZBVTDB-GJZGRUSLSA-N Gly-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)CN)C(O)=O)=CNC2=C1 XZRZILPOZBVTDB-GJZGRUSLSA-N 0.000 description 2
- DJTXYXZNNDDEOU-WHFBIAKZSA-N Gly-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)C(=O)N DJTXYXZNNDDEOU-WHFBIAKZSA-N 0.000 description 2
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 2
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 2
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 2
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 2
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 2
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 2
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 2
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 2
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 2
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- HPAIKDPJURGQLN-KBPBESRZSA-N Gly-His-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 HPAIKDPJURGQLN-KBPBESRZSA-N 0.000 description 2
- YNIMVVJTPWCUJH-KBPBESRZSA-N Gly-His-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YNIMVVJTPWCUJH-KBPBESRZSA-N 0.000 description 2
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 2
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 2
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 2
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 2
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 2
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 2
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 2
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 2
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 2
- QVDGHDFFYHKJPN-QWRGUYRKSA-N Gly-Phe-Cys Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O QVDGHDFFYHKJPN-QWRGUYRKSA-N 0.000 description 2
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 2
- MXIULRKNFSCJHT-STQMWFEESA-N Gly-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 MXIULRKNFSCJHT-STQMWFEESA-N 0.000 description 2
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 2
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 2
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 2
- ZKJZBRHRWKLVSJ-ZDLURKLDSA-N Gly-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O ZKJZBRHRWKLVSJ-ZDLURKLDSA-N 0.000 description 2
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 2
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 2
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 2
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 2
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 2
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 2
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 2
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- MJNWEIMBXKKCSF-XVYDVKMFSA-N His-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N MJNWEIMBXKKCSF-XVYDVKMFSA-N 0.000 description 2
- KYMUEAZVLPRVAE-GUBZILKMSA-N His-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KYMUEAZVLPRVAE-GUBZILKMSA-N 0.000 description 2
- JWTKVPMQCCRPQY-SRVKXCTJSA-N His-Asn-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JWTKVPMQCCRPQY-SRVKXCTJSA-N 0.000 description 2
- LYSVCKOXIDKEEL-SRVKXCTJSA-N His-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LYSVCKOXIDKEEL-SRVKXCTJSA-N 0.000 description 2
- HRGGKHFHRSFSDE-CIUDSAMLSA-N His-Asn-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N HRGGKHFHRSFSDE-CIUDSAMLSA-N 0.000 description 2
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 2
- XMENRVZYPBKBIL-AVGNSLFASA-N His-Glu-His Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XMENRVZYPBKBIL-AVGNSLFASA-N 0.000 description 2
- VTMLJMNQHKBPON-QWRGUYRKSA-N His-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 VTMLJMNQHKBPON-QWRGUYRKSA-N 0.000 description 2
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 2
- VJJSDSNFXCWCEJ-DJFWLOJKSA-N His-Ile-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O VJJSDSNFXCWCEJ-DJFWLOJKSA-N 0.000 description 2
- VYUXYMRNGALHEA-DLOVCJGASA-N His-Leu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O VYUXYMRNGALHEA-DLOVCJGASA-N 0.000 description 2
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 2
- MTHDIEPOBSRDIV-ULQDDVLXSA-N His-Met-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 MTHDIEPOBSRDIV-ULQDDVLXSA-N 0.000 description 2
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 2
- JUCZDDVZBMPKRT-IXOXFDKPSA-N His-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O JUCZDDVZBMPKRT-IXOXFDKPSA-N 0.000 description 2
- NBWATNYAUVSAEQ-ZEILLAHLSA-N His-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O NBWATNYAUVSAEQ-ZEILLAHLSA-N 0.000 description 2
- WSXNWASHQNSMRX-GVXVVHGQSA-N His-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WSXNWASHQNSMRX-GVXVVHGQSA-N 0.000 description 2
- CGAMSLMBYJHMDY-ONGXEEELSA-N His-Val-Gly Chemical compound CC(C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N CGAMSLMBYJHMDY-ONGXEEELSA-N 0.000 description 2
- MCGOGXFMKHPMSQ-AVGNSLFASA-N His-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MCGOGXFMKHPMSQ-AVGNSLFASA-N 0.000 description 2
- DRKZDEFADVYTLU-AVGNSLFASA-N His-Val-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DRKZDEFADVYTLU-AVGNSLFASA-N 0.000 description 2
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 2
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 2
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 2
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 2
- DXUJSRIVSWEOAG-NAKRPEOUSA-N Ile-Arg-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N DXUJSRIVSWEOAG-NAKRPEOUSA-N 0.000 description 2
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 2
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 2
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 2
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 2
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 2
- XLDYDEDTGMHUCZ-GHCJXIJMSA-N Ile-Asp-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N XLDYDEDTGMHUCZ-GHCJXIJMSA-N 0.000 description 2
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 2
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 2
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 2
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 2
- WTOAPTKSZJJWKK-HTFCKZLJSA-N Ile-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WTOAPTKSZJJWKK-HTFCKZLJSA-N 0.000 description 2
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 2
- JXMSHKFPDIUYGS-SIUGBPQLSA-N Ile-Glu-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N JXMSHKFPDIUYGS-SIUGBPQLSA-N 0.000 description 2
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 2
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 2
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 2
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 2
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 2
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 2
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 2
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 2
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 2
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 2
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 2
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 2
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 2
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 2
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 2
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 2
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 2
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 2
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 2
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 2
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 2
- ZFWISYLMLXFBSX-KKPKCPPISA-N Ile-Trp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N ZFWISYLMLXFBSX-KKPKCPPISA-N 0.000 description 2
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 2
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 2
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 2
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 2
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 2
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 2
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 2
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 2
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 2
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 2
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 2
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 2
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 2
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 2
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 2
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 2
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 2
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 2
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 2
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 2
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- GBDMISNMNXVTNV-XIRDDKMYSA-N Leu-Asp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GBDMISNMNXVTNV-XIRDDKMYSA-N 0.000 description 2
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 2
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 2
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 2
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 2
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 2
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 2
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 2
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 2
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 2
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 2
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 2
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 2
- TVEOVCYCYGKVPP-HSCHXYMDSA-N Leu-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N TVEOVCYCYGKVPP-HSCHXYMDSA-N 0.000 description 2
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 2
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 2
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 2
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 2
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 2
- PKKMDPNFGULLNQ-AVGNSLFASA-N Leu-Met-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O PKKMDPNFGULLNQ-AVGNSLFASA-N 0.000 description 2
- NHRINZSPIUXYQZ-DCAQKATOSA-N Leu-Met-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N NHRINZSPIUXYQZ-DCAQKATOSA-N 0.000 description 2
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 2
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 2
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 2
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 2
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 2
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 2
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 2
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 2
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 2
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 2
- WBRJVRXEGQIDRK-XIRDDKMYSA-N Leu-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 WBRJVRXEGQIDRK-XIRDDKMYSA-N 0.000 description 2
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 2
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 2
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 2
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 2
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 2
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 2
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 2
- ALSRJRIWBNENFY-DCAQKATOSA-N Lys-Arg-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O ALSRJRIWBNENFY-DCAQKATOSA-N 0.000 description 2
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 2
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 2
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 2
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 2
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 2
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 2
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 2
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 2
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 2
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 2
- ZAENPHCEQXALHO-GUBZILKMSA-N Lys-Cys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZAENPHCEQXALHO-GUBZILKMSA-N 0.000 description 2
- MWVUEPNEPWMFBD-SRVKXCTJSA-N Lys-Cys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCCN MWVUEPNEPWMFBD-SRVKXCTJSA-N 0.000 description 2
- GGNOBVSOZPHLCE-GUBZILKMSA-N Lys-Gln-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GGNOBVSOZPHLCE-GUBZILKMSA-N 0.000 description 2
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 2
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 2
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 2
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 2
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 2
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 2
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 2
- WOEDRPCHKPSFDT-MXAVVETBSA-N Lys-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N WOEDRPCHKPSFDT-MXAVVETBSA-N 0.000 description 2
- PGLGNCVOWIORQE-SRVKXCTJSA-N Lys-His-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O PGLGNCVOWIORQE-SRVKXCTJSA-N 0.000 description 2
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 2
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 2
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 2
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 2
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 2
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 2
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 2
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 2
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 2
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 2
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 2
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 2
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 2
- LKDXINHHSWFFJC-SRVKXCTJSA-N Lys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N LKDXINHHSWFFJC-SRVKXCTJSA-N 0.000 description 2
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 2
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 2
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 2
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 2
- CNXOBMMOYZPPGS-NUTKFTJISA-N Lys-Trp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O CNXOBMMOYZPPGS-NUTKFTJISA-N 0.000 description 2
- YUTZYVTZDVZBJJ-IHPCNDPISA-N Lys-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 YUTZYVTZDVZBJJ-IHPCNDPISA-N 0.000 description 2
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 2
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 2
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 2
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 2
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 2
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 2
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 2
- AHZNUGRZHMZGFL-GUBZILKMSA-N Met-Arg-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCNC(N)=N AHZNUGRZHMZGFL-GUBZILKMSA-N 0.000 description 2
- XBYKTPZCWQQSGB-IHRRRGAJSA-N Met-Cys-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XBYKTPZCWQQSGB-IHRRRGAJSA-N 0.000 description 2
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 2
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 2
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 2
- RATXDYWHIYNZLE-DCAQKATOSA-N Met-Lys-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N RATXDYWHIYNZLE-DCAQKATOSA-N 0.000 description 2
- IILAGWCGKJSBGB-IHRRRGAJSA-N Met-Phe-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IILAGWCGKJSBGB-IHRRRGAJSA-N 0.000 description 2
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 2
- SOAYQFDWEIWPPR-IHRRRGAJSA-N Met-Ser-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SOAYQFDWEIWPPR-IHRRRGAJSA-N 0.000 description 2
- NSMXRFMGZYTFEX-KJEVXHAQSA-N Met-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCSC)N)O NSMXRFMGZYTFEX-KJEVXHAQSA-N 0.000 description 2
- OOXVBECOTYHTCK-WDSOQIARSA-N Met-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCSC)N OOXVBECOTYHTCK-WDSOQIARSA-N 0.000 description 2
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 2
- 238000005481 NMR spectroscopy Methods 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 2
- 101710141454 Nucleoprotein Proteins 0.000 description 2
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 2
- MDHZEOMXGNBSIL-DLOVCJGASA-N Phe-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N MDHZEOMXGNBSIL-DLOVCJGASA-N 0.000 description 2
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 2
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 2
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 2
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 2
- LXUJDHOKVUYHRC-KKUMJFAQSA-N Phe-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N LXUJDHOKVUYHRC-KKUMJFAQSA-N 0.000 description 2
- UNLYPPYNDXHGDG-IHRRRGAJSA-N Phe-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UNLYPPYNDXHGDG-IHRRRGAJSA-N 0.000 description 2
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 2
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 2
- CSDMCMITJLKBAH-SOUVJXGZSA-N Phe-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O CSDMCMITJLKBAH-SOUVJXGZSA-N 0.000 description 2
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 2
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 2
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 2
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 2
- XMQSOOJRRVEHRO-ULQDDVLXSA-N Phe-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMQSOOJRRVEHRO-ULQDDVLXSA-N 0.000 description 2
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 2
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 2
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 2
- KPEIBEPEUAZWNS-ULQDDVLXSA-N Phe-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KPEIBEPEUAZWNS-ULQDDVLXSA-N 0.000 description 2
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 2
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 description 2
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 2
- PHJUFDQVVKVOPU-ULQDDVLXSA-N Phe-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=CC=C1)N PHJUFDQVVKVOPU-ULQDDVLXSA-N 0.000 description 2
- WZEWCHQHNCMBEN-PMVMPFDFSA-N Phe-Lys-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N WZEWCHQHNCMBEN-PMVMPFDFSA-N 0.000 description 2
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 2
- WKLMCMXFMQEKCX-SLFFLAALSA-N Phe-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O WKLMCMXFMQEKCX-SLFFLAALSA-N 0.000 description 2
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 2
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 2
- YDUGVDGFKNXFPL-IXOXFDKPSA-N Phe-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YDUGVDGFKNXFPL-IXOXFDKPSA-N 0.000 description 2
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 2
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 2
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 2
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 2
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 2
- IPVPGAADZXRZSH-RNXOBYDBSA-N Phe-Tyr-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O IPVPGAADZXRZSH-RNXOBYDBSA-N 0.000 description 2
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 2
- DXWNFNOPBYAFRM-IHRRRGAJSA-N Phe-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N DXWNFNOPBYAFRM-IHRRRGAJSA-N 0.000 description 2
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 2
- VDTYRPWRWRCROL-UFYCRDLUSA-N Phe-Val-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VDTYRPWRWRCROL-UFYCRDLUSA-N 0.000 description 2
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 2
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 2
- VOHFZDSRPZLXLH-IHRRRGAJSA-N Pro-Asn-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VOHFZDSRPZLXLH-IHRRRGAJSA-N 0.000 description 2
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 2
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 2
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 2
- XJROSHJRQTXWAE-XGEHTFHBSA-N Pro-Cys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XJROSHJRQTXWAE-XGEHTFHBSA-N 0.000 description 2
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 2
- QNZLIVROMORQFH-BQBZGAKWSA-N Pro-Gly-Cys Chemical compound C1C[C@H](NC1)C(=O)NCC(=O)N[C@@H](CS)C(=O)O QNZLIVROMORQFH-BQBZGAKWSA-N 0.000 description 2
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 2
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 2
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 2
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 2
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 2
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 2
- XZBYTHCRAVAXQQ-DCAQKATOSA-N Pro-Met-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O XZBYTHCRAVAXQQ-DCAQKATOSA-N 0.000 description 2
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 2
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 2
- YYARMJSFDLIDFS-FKBYEOEOSA-N Pro-Phe-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O YYARMJSFDLIDFS-FKBYEOEOSA-N 0.000 description 2
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 2
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 2
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 2
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 2
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 2
- AJJDPGVVNPUZCR-RHYQMDGZSA-N Pro-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1)O AJJDPGVVNPUZCR-RHYQMDGZSA-N 0.000 description 2
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 2
- BXHRXLMCYSZSIY-STECZYCISA-N Pro-Tyr-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O BXHRXLMCYSZSIY-STECZYCISA-N 0.000 description 2
- DYJTXTCEXMCPBF-UFYCRDLUSA-N Pro-Tyr-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O DYJTXTCEXMCPBF-UFYCRDLUSA-N 0.000 description 2
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 2
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 2
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 2
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 2
- 108010026552 Proteome Proteins 0.000 description 2
- 101100029566 Rattus norvegicus Rabggta gene Proteins 0.000 description 2
- 108091005634 SARS-CoV-2 receptor-binding domains Proteins 0.000 description 2
- 241000427251 Sabaeus Species 0.000 description 2
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 2
- IYCBDVBJWDXQRR-FXQIFTODSA-N Ser-Ala-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IYCBDVBJWDXQRR-FXQIFTODSA-N 0.000 description 2
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 2
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 2
- WXWDPFVKQRVJBJ-CIUDSAMLSA-N Ser-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N WXWDPFVKQRVJBJ-CIUDSAMLSA-N 0.000 description 2
- SNNSYBWPPVAXQW-ZLUOBGJFSA-N Ser-Cys-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)O SNNSYBWPPVAXQW-ZLUOBGJFSA-N 0.000 description 2
- MPPHJZYXDVDGOF-BWBBJGPYSA-N Ser-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CO MPPHJZYXDVDGOF-BWBBJGPYSA-N 0.000 description 2
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 2
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 2
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 2
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 2
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 2
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 2
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 2
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 2
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 2
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 2
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 2
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 2
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 2
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 2
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 2
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 2
- ZSLFCBHEINFXRS-LPEHRKFASA-N Ser-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ZSLFCBHEINFXRS-LPEHRKFASA-N 0.000 description 2
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 2
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 description 2
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 2
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 2
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 2
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 2
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 2
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 2
- FZNNGIHSIPKFRE-QEJZJMRPSA-N Ser-Trp-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZNNGIHSIPKFRE-QEJZJMRPSA-N 0.000 description 2
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 2
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 2
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 2
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 2
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 2
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 2
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 2
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 2
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 2
- QNJZOAHSYPXTAB-VEVYYDQMSA-N Thr-Asn-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O QNJZOAHSYPXTAB-VEVYYDQMSA-N 0.000 description 2
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 2
- DCCGCVLVVSAJFK-NUMRIWBASA-N Thr-Asp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O DCCGCVLVVSAJFK-NUMRIWBASA-N 0.000 description 2
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 2
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 2
- DHPPWTOLRWYIDS-XKBZYTNZSA-N Thr-Cys-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O DHPPWTOLRWYIDS-XKBZYTNZSA-N 0.000 description 2
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 2
- MQUZMZBFKCHVOB-HJGDQZAQSA-N Thr-Gln-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O MQUZMZBFKCHVOB-HJGDQZAQSA-N 0.000 description 2
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 2
- RCEHMXVEMNXRIW-IRIUXVKKSA-N Thr-Gln-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O RCEHMXVEMNXRIW-IRIUXVKKSA-N 0.000 description 2
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 2
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 2
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 2
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 2
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 2
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 2
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 2
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 2
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 2
- KRGDDWVBBDLPSJ-CUJWVEQBSA-N Thr-His-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O KRGDDWVBBDLPSJ-CUJWVEQBSA-N 0.000 description 2
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 2
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 2
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 2
- FIFDDJFLNVAVMS-RHYQMDGZSA-N Thr-Leu-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O FIFDDJFLNVAVMS-RHYQMDGZSA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 2
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 2
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 2
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 2
- GYUUYCIXELGTJS-MEYUZBJRSA-N Thr-Phe-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O GYUUYCIXELGTJS-MEYUZBJRSA-N 0.000 description 2
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 2
- MEBDIIKMUUNBSB-RPTUDFQQSA-N Thr-Phe-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MEBDIIKMUUNBSB-RPTUDFQQSA-N 0.000 description 2
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 2
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 2
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 2
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 2
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 2
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 2
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 2
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 2
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 2
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 2
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 2
- DKNYWNPPSZCWCJ-GBALPHGKSA-N Thr-Trp-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CS)C(=O)O)N)O DKNYWNPPSZCWCJ-GBALPHGKSA-N 0.000 description 2
- UMFLBPIPAJMNIM-LYARXQMPSA-N Thr-Trp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N)O UMFLBPIPAJMNIM-LYARXQMPSA-N 0.000 description 2
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 2
- NJGMALCNYAMYCB-JRQIVUDYSA-N Thr-Tyr-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJGMALCNYAMYCB-JRQIVUDYSA-N 0.000 description 2
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 2
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 2
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 2
- ADBFWLXCCKIXBQ-XIRDDKMYSA-N Trp-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ADBFWLXCCKIXBQ-XIRDDKMYSA-N 0.000 description 2
- GTNCSPKYWCJZAC-XIRDDKMYSA-N Trp-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GTNCSPKYWCJZAC-XIRDDKMYSA-N 0.000 description 2
- NKUIXQOJUAEIET-AQZXSJQPSA-N Trp-Asp-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@H](O)C)C(O)=O)=CNC2=C1 NKUIXQOJUAEIET-AQZXSJQPSA-N 0.000 description 2
- OFCKFBGRYHOKFP-IHPCNDPISA-N Trp-Asp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N OFCKFBGRYHOKFP-IHPCNDPISA-N 0.000 description 2
- RRVUOLRWIZXBRQ-IHPCNDPISA-N Trp-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RRVUOLRWIZXBRQ-IHPCNDPISA-N 0.000 description 2
- UKWSFUSPGPBJGU-VFAJRCTISA-N Trp-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O UKWSFUSPGPBJGU-VFAJRCTISA-N 0.000 description 2
- GNCPKOZDOCQRAF-BPUTZDHNSA-N Trp-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GNCPKOZDOCQRAF-BPUTZDHNSA-N 0.000 description 2
- ABRICLFKFRFDKS-IHPCNDPISA-N Trp-Ser-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 ABRICLFKFRFDKS-IHPCNDPISA-N 0.000 description 2
- PALLCTDPFINNMM-JQHSSLGASA-N Trp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N PALLCTDPFINNMM-JQHSSLGASA-N 0.000 description 2
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 2
- HSVPZJLMPLMPOX-BPNCWPANSA-N Tyr-Arg-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O HSVPZJLMPLMPOX-BPNCWPANSA-N 0.000 description 2
- GFZQWWDXJVGEMW-ULQDDVLXSA-N Tyr-Arg-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GFZQWWDXJVGEMW-ULQDDVLXSA-N 0.000 description 2
- CKKFTIQYURNSEI-IHRRRGAJSA-N Tyr-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CKKFTIQYURNSEI-IHRRRGAJSA-N 0.000 description 2
- BVWADTBVGZHSLW-IHRRRGAJSA-N Tyr-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BVWADTBVGZHSLW-IHRRRGAJSA-N 0.000 description 2
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 2
- KLGFILUOTCBNLJ-IHRRRGAJSA-N Tyr-Cys-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O KLGFILUOTCBNLJ-IHRRRGAJSA-N 0.000 description 2
- BODHJXJNRVRKFA-BZSNNMDCSA-N Tyr-Cys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BODHJXJNRVRKFA-BZSNNMDCSA-N 0.000 description 2
- CWQZAUYFWRLITN-AVGNSLFASA-N Tyr-Gln-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O CWQZAUYFWRLITN-AVGNSLFASA-N 0.000 description 2
- XQYHLZNPOTXRMQ-KKUMJFAQSA-N Tyr-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XQYHLZNPOTXRMQ-KKUMJFAQSA-N 0.000 description 2
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 2
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 2
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 2
- PMDWYLVWHRTJIW-STQMWFEESA-N Tyr-Gly-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PMDWYLVWHRTJIW-STQMWFEESA-N 0.000 description 2
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 2
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 2
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 2
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 2
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 2
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 2
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 2
- BYAKMYBZADCNMN-JYJNAYRXSA-N Tyr-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYAKMYBZADCNMN-JYJNAYRXSA-N 0.000 description 2
- MXFPBNFKVBHIRW-BZSNNMDCSA-N Tyr-Lys-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O MXFPBNFKVBHIRW-BZSNNMDCSA-N 0.000 description 2
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 2
- UBKKNELWDCBNCF-STQMWFEESA-N Tyr-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UBKKNELWDCBNCF-STQMWFEESA-N 0.000 description 2
- OFHKXNKJXURPSY-ULQDDVLXSA-N Tyr-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O OFHKXNKJXURPSY-ULQDDVLXSA-N 0.000 description 2
- WTTRJMAZPDHPGS-KKXDTOCCSA-N Tyr-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(O)=O WTTRJMAZPDHPGS-KKXDTOCCSA-N 0.000 description 2
- WPRVVBVWIUWLOH-UFYCRDLUSA-N Tyr-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N WPRVVBVWIUWLOH-UFYCRDLUSA-N 0.000 description 2
- ARMNWLJYHCOSHE-KKUMJFAQSA-N Tyr-Pro-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O ARMNWLJYHCOSHE-KKUMJFAQSA-N 0.000 description 2
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 2
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 2
- VXFXIBCCVLJCJT-JYJNAYRXSA-N Tyr-Pro-Pro Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(O)=O VXFXIBCCVLJCJT-JYJNAYRXSA-N 0.000 description 2
- IEWKKXZRJLTIOV-AVGNSLFASA-N Tyr-Ser-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O IEWKKXZRJLTIOV-AVGNSLFASA-N 0.000 description 2
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 2
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 2
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 2
- KLQPIEVIKOQRAW-IZPVPAKOSA-N Tyr-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KLQPIEVIKOQRAW-IZPVPAKOSA-N 0.000 description 2
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 2
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 2
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 2
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 2
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 2
- XPYNXORPPVTVQK-SRVKXCTJSA-N Val-Arg-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N XPYNXORPPVTVQK-SRVKXCTJSA-N 0.000 description 2
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 2
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 2
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 2
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 2
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 2
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 2
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 2
- XIFAHCUNWWKUDE-DCAQKATOSA-N Val-Cys-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XIFAHCUNWWKUDE-DCAQKATOSA-N 0.000 description 2
- XJFXZQKJQGYFMM-GUBZILKMSA-N Val-Cys-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N XJFXZQKJQGYFMM-GUBZILKMSA-N 0.000 description 2
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 2
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 2
- PGBJAZDAEWPDAA-NHCYSSNCSA-N Val-Gln-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N PGBJAZDAEWPDAA-NHCYSSNCSA-N 0.000 description 2
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 2
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 2
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 2
- GMOLURHJBLOBFW-ONGXEEELSA-N Val-Gly-His Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMOLURHJBLOBFW-ONGXEEELSA-N 0.000 description 2
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 2
- ZTKGDWOUYRRAOQ-ULQDDVLXSA-N Val-His-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N ZTKGDWOUYRRAOQ-ULQDDVLXSA-N 0.000 description 2
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 2
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 2
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 2
- BZOSBRIDWSSTFN-AVGNSLFASA-N Val-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N BZOSBRIDWSSTFN-AVGNSLFASA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 2
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 2
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 2
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 2
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 2
- WHVSJHJTMUHYBT-SRVKXCTJSA-N Val-Met-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N WHVSJHJTMUHYBT-SRVKXCTJSA-N 0.000 description 2
- RSGHLMMKXJGCMK-JYJNAYRXSA-N Val-Met-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N RSGHLMMKXJGCMK-JYJNAYRXSA-N 0.000 description 2
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 2
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 2
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 2
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 2
- POFQRHFHYPSCOI-FHWLQOOXSA-N Val-Trp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N POFQRHFHYPSCOI-FHWLQOOXSA-N 0.000 description 2
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 2
- CFIBZQOLUDURST-IHRRRGAJSA-N Val-Tyr-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N CFIBZQOLUDURST-IHRRRGAJSA-N 0.000 description 2
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 2
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 2
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 2
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 2
- 108010081404 acein-2 Proteins 0.000 description 2
- 150000007513 acids Chemical class 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- 230000002411 adverse Effects 0.000 description 2
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 2
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 2
- 230000002152 alkylating effect Effects 0.000 description 2
- 208000026935 allergic disease Diseases 0.000 description 2
- 238000010171 animal model Methods 0.000 description 2
- OHDRQQURAXLVGJ-HLVWOLMTSA-N azane;(2e)-3-ethyl-2-[(e)-(3-ethyl-6-sulfo-1,3-benzothiazol-2-ylidene)hydrazinylidene]-1,3-benzothiazole-6-sulfonic acid Chemical compound [NH4+].[NH4+].S/1C2=CC(S([O-])(=O)=O)=CC=C2N(CC)C\1=N/N=C1/SC2=CC(S([O-])(=O)=O)=CC=C2N1CC OHDRQQURAXLVGJ-HLVWOLMTSA-N 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 239000006143 cell culture medium Substances 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- 238000002983 circular dichroism Methods 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 239000000356 contaminant Substances 0.000 description 2
- 230000037029 cross reaction Effects 0.000 description 2
- 239000013078 crystal Substances 0.000 description 2
- 108010069495 cysteinyltyrosine Proteins 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- -1 e.g. Substances 0.000 description 2
- 239000012091 fetal bovine serum Substances 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 230000009610 hypersensitivity Effects 0.000 description 2
- 210000004201 immune sera Anatomy 0.000 description 2
- 229940042743 immune sera Drugs 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 229940031551 inactivated vaccine Drugs 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 210000004072 lung Anatomy 0.000 description 2
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 2
- 230000014759 maintenance of location Effects 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 108010090114 methionyl-tyrosyl-lysine Proteins 0.000 description 2
- 238000010172 mouse model Methods 0.000 description 2
- 102000039446 nucleic acids Human genes 0.000 description 2
- 108020004707 nucleic acids Proteins 0.000 description 2
- 108010082795 phenylalanyl-arginyl-arginine Proteins 0.000 description 2
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 239000000902 placebo Substances 0.000 description 2
- 229940068196 placebo Drugs 0.000 description 2
- 238000005498 polishing Methods 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 230000001737 promoting effect Effects 0.000 description 2
- 230000000069 prophylactic effect Effects 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 238000001179 sorption measurement Methods 0.000 description 2
- 229940031626 subunit vaccine Drugs 0.000 description 2
- 238000004885 tandem mass spectrometry Methods 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 239000013638 trimer Substances 0.000 description 2
- 229940031418 trivalent vaccine Drugs 0.000 description 2
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 2
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 2
- 229940125575 vaccine candidate Drugs 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 230000029812 viral genome replication Effects 0.000 description 2
- OZFAFGSSMRRTDW-UHFFFAOYSA-N (2,4-dichlorophenyl) benzenesulfonate Chemical compound ClC1=CC(Cl)=CC=C1OS(=O)(=O)C1=CC=CC=C1 OZFAFGSSMRRTDW-UHFFFAOYSA-N 0.000 description 1
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 1
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 1
- INOZZBHURUDQQR-AJNGGQMLSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(1h-imidazol-5-yl)propanoyl]amino]-3-carboxypropanoyl]amino]-4-carboxybutanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 INOZZBHURUDQQR-AJNGGQMLSA-N 0.000 description 1
- PSBDWGZCVUAZQS-UHFFFAOYSA-N (dimethylsulfonio)acetate Chemical compound C[S+](C)CC([O-])=O PSBDWGZCVUAZQS-UHFFFAOYSA-N 0.000 description 1
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 1
- DQVAZKGVGKHQDS-UHFFFAOYSA-N 2-[[1-[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]pyrrolidine-2-carbonyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(O)=O DQVAZKGVGKHQDS-UHFFFAOYSA-N 0.000 description 1
- YRNWIFYIFSBPAU-UHFFFAOYSA-N 4-[4-(dimethylamino)phenyl]-n,n-dimethylaniline Chemical compound C1=CC(N(C)C)=CC=C1C1=CC=C(N(C)C)C=C1 YRNWIFYIFSBPAU-UHFFFAOYSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 1
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 1
- YIQAOPNCIJVKDN-XKNYDFJKSA-N Ala-Asn-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YIQAOPNCIJVKDN-XKNYDFJKSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- JYEBJTDTPNKQJG-FXQIFTODSA-N Ala-Asn-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N JYEBJTDTPNKQJG-FXQIFTODSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 1
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 1
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 1
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 1
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- XCZXVTHYGSMQGH-NAKRPEOUSA-N Ala-Ile-Met Chemical compound C[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C([O-])=O XCZXVTHYGSMQGH-NAKRPEOUSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 1
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 1
- ZKEHTYWGPMMGBC-XUXIUFHCSA-N Ala-Leu-Leu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O ZKEHTYWGPMMGBC-XUXIUFHCSA-N 0.000 description 1
- QPBSRMDNJOTFAL-AICCOOGYSA-N Ala-Leu-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QPBSRMDNJOTFAL-AICCOOGYSA-N 0.000 description 1
- RGQCNKIDEQJEBT-CQDKDKBSSA-N Ala-Leu-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RGQCNKIDEQJEBT-CQDKDKBSSA-N 0.000 description 1
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 1
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 1
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 1
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 1
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- AUFACLFHBAGZEN-ZLUOBGJFSA-N Ala-Ser-Cys Chemical compound N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O AUFACLFHBAGZEN-ZLUOBGJFSA-N 0.000 description 1
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- QKHWNPQNOHEFST-VZFHVOOUSA-N Ala-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N)O QKHWNPQNOHEFST-VZFHVOOUSA-N 0.000 description 1
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 1
- WZGZDOXCDLLTHE-SYWGBEHUSA-N Ala-Trp-Ile Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 WZGZDOXCDLLTHE-SYWGBEHUSA-N 0.000 description 1
- YXXPVUOMPSZURS-ZLIFDBKOSA-N Ala-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 YXXPVUOMPSZURS-ZLIFDBKOSA-N 0.000 description 1
- CKIBTNMWVMKAHB-RWGOJESNSA-N Ala-Trp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC=3C4=CC=CC=C4NC=3)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 CKIBTNMWVMKAHB-RWGOJESNSA-N 0.000 description 1
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 1
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 1
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 1
- 102000053723 Angiotensin-converting enzyme 2 Human genes 0.000 description 1
- 108090000975 Angiotensin-converting enzyme 2 Proteins 0.000 description 1
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- NAARDJBSSPUGCF-FXQIFTODSA-N Arg-Cys-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N NAARDJBSSPUGCF-FXQIFTODSA-N 0.000 description 1
- YWENWUYXQUWRHQ-LPEHRKFASA-N Arg-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O YWENWUYXQUWRHQ-LPEHRKFASA-N 0.000 description 1
- LLZXKVAAEWBUPB-KKUMJFAQSA-N Arg-Gln-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLZXKVAAEWBUPB-KKUMJFAQSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 1
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- SLNCSSWAIDUUGF-LSJOCFKGSA-N Arg-His-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O SLNCSSWAIDUUGF-LSJOCFKGSA-N 0.000 description 1
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 1
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 1
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 1
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 1
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 1
- VEAIMHJZTIDCIH-KKUMJFAQSA-N Arg-Phe-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEAIMHJZTIDCIH-KKUMJFAQSA-N 0.000 description 1
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 description 1
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- BWMMKQPATDUYKB-IHRRRGAJSA-N Arg-Tyr-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=C(O)C=C1 BWMMKQPATDUYKB-IHRRRGAJSA-N 0.000 description 1
- NMTANZXPDAHUKU-ULQDDVLXSA-N Arg-Tyr-Lys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 NMTANZXPDAHUKU-ULQDDVLXSA-N 0.000 description 1
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 1
- JYHIVHINLJUIEG-BVSLBCMMSA-N Arg-Tyr-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JYHIVHINLJUIEG-BVSLBCMMSA-N 0.000 description 1
- UTSMXMABBPFVJP-SZMVWBNQSA-N Arg-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UTSMXMABBPFVJP-SZMVWBNQSA-N 0.000 description 1
- WHLDJYNHXOMGMU-JYJNAYRXSA-N Arg-Val-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WHLDJYNHXOMGMU-JYJNAYRXSA-N 0.000 description 1
- ANAHQDPQQBDOBM-UHFFFAOYSA-N Arg-Val-Tyr Natural products CC(C)C(NC(=O)C(N)CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O ANAHQDPQQBDOBM-UHFFFAOYSA-N 0.000 description 1
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 1
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 1
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 1
- HUZGPXBILPMCHM-IHRRRGAJSA-N Asn-Arg-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HUZGPXBILPMCHM-IHRRRGAJSA-N 0.000 description 1
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 1
- YNSCBOUZTAGIGO-ZLUOBGJFSA-N Asn-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N YNSCBOUZTAGIGO-ZLUOBGJFSA-N 0.000 description 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 1
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 1
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 1
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 1
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- BGINHSZTXRJIPP-FXQIFTODSA-N Asn-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BGINHSZTXRJIPP-FXQIFTODSA-N 0.000 description 1
- JZRLLSOWDYUKOK-SRVKXCTJSA-N Asn-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N JZRLLSOWDYUKOK-SRVKXCTJSA-N 0.000 description 1
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 1
- HLTLEIXYIJDFOY-ZLUOBGJFSA-N Asn-Cys-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O HLTLEIXYIJDFOY-ZLUOBGJFSA-N 0.000 description 1
- ZPMNECSEJXXNBE-CIUDSAMLSA-N Asn-Cys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZPMNECSEJXXNBE-CIUDSAMLSA-N 0.000 description 1
- CZIXHXIJJZLYRJ-SRVKXCTJSA-N Asn-Cys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CZIXHXIJJZLYRJ-SRVKXCTJSA-N 0.000 description 1
- SQZIAWGBBUSSPJ-ZKWXMUAHSA-N Asn-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N SQZIAWGBBUSSPJ-ZKWXMUAHSA-N 0.000 description 1
- QNJIRRVTOXNGMH-GUBZILKMSA-N Asn-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(N)=O QNJIRRVTOXNGMH-GUBZILKMSA-N 0.000 description 1
- QPTAGIPWARILES-AVGNSLFASA-N Asn-Gln-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QPTAGIPWARILES-AVGNSLFASA-N 0.000 description 1
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 1
- JZDZLBJVYWIIQU-AVGNSLFASA-N Asn-Glu-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JZDZLBJVYWIIQU-AVGNSLFASA-N 0.000 description 1
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 1
- OWUCNXMFJRFOFI-BQBZGAKWSA-N Asn-Gly-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OWUCNXMFJRFOFI-BQBZGAKWSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 1
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 1
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 1
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 1
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 description 1
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 1
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 1
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 1
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 1
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 1
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 1
- LTDGPJKGJDIBQD-LAEOZQHASA-N Asn-Val-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LTDGPJKGJDIBQD-LAEOZQHASA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 1
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 1
- ILJQISGMGXRZQQ-IHRRRGAJSA-N Asp-Arg-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ILJQISGMGXRZQQ-IHRRRGAJSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 description 1
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- FTNVLGCFIJEMQT-CIUDSAMLSA-N Asp-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N FTNVLGCFIJEMQT-CIUDSAMLSA-N 0.000 description 1
- KVPHTGVUMJGMCX-BIIVOSGPSA-N Asp-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)C(=O)O KVPHTGVUMJGMCX-BIIVOSGPSA-N 0.000 description 1
- DZQKLNLLWFQONU-LKXGYXEUSA-N Asp-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)O DZQKLNLLWFQONU-LKXGYXEUSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- YRBGRUOSJROZEI-NHCYSSNCSA-N Asp-His-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O YRBGRUOSJROZEI-NHCYSSNCSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 1
- YVHGKXAOSVBGJV-CIUDSAMLSA-N Asp-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N YVHGKXAOSVBGJV-CIUDSAMLSA-N 0.000 description 1
- YWLDTBBUHZJQHW-KKUMJFAQSA-N Asp-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N YWLDTBBUHZJQHW-KKUMJFAQSA-N 0.000 description 1
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 1
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 1
- WZUZGDANRQPCDD-SRVKXCTJSA-N Asp-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N WZUZGDANRQPCDD-SRVKXCTJSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- NONWUQAWAANERO-BZSNNMDCSA-N Asp-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 NONWUQAWAANERO-BZSNNMDCSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- XUVTWGPERWIERB-IHRRRGAJSA-N Asp-Pro-Phe Chemical compound N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O XUVTWGPERWIERB-IHRRRGAJSA-N 0.000 description 1
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 1
- OFYVKOXTTDCUIL-FXQIFTODSA-N Asp-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OFYVKOXTTDCUIL-FXQIFTODSA-N 0.000 description 1
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 1
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- LEYKQPDPZJIRTA-AQZXSJQPSA-N Asp-Trp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LEYKQPDPZJIRTA-AQZXSJQPSA-N 0.000 description 1
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 1
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- 208000031504 Asymptomatic Infections Diseases 0.000 description 1
- NOWKCMXCCJGMRR-UHFFFAOYSA-N Aziridine Chemical compound C1CN1 NOWKCMXCCJGMRR-UHFFFAOYSA-N 0.000 description 1
- 101150011571 BSL2 gene Proteins 0.000 description 1
- BVKZGUZCCUSVTD-UHFFFAOYSA-L Carbonate Chemical compound [O-]C([O-])=O BVKZGUZCCUSVTD-UHFFFAOYSA-L 0.000 description 1
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 1
- YFXFOZPXVFPBDH-VZFHVOOUSA-N Cys-Ala-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)CS)C(O)=O YFXFOZPXVFPBDH-VZFHVOOUSA-N 0.000 description 1
- OCEHKDFAWQIBHH-FXQIFTODSA-N Cys-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N OCEHKDFAWQIBHH-FXQIFTODSA-N 0.000 description 1
- NLCZGISONIGRQP-DCAQKATOSA-N Cys-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N NLCZGISONIGRQP-DCAQKATOSA-N 0.000 description 1
- JTNKVWLMDHIUOG-IHRRRGAJSA-N Cys-Arg-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JTNKVWLMDHIUOG-IHRRRGAJSA-N 0.000 description 1
- FWYBFUDWUUFLDN-FXQIFTODSA-N Cys-Asp-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N FWYBFUDWUUFLDN-FXQIFTODSA-N 0.000 description 1
- VZKXOWRNJDEGLZ-WHFBIAKZSA-N Cys-Asp-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O VZKXOWRNJDEGLZ-WHFBIAKZSA-N 0.000 description 1
- LDIKUWLAMDFHPU-FXQIFTODSA-N Cys-Cys-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LDIKUWLAMDFHPU-FXQIFTODSA-N 0.000 description 1
- ZIKWRNJXFIQECJ-CIUDSAMLSA-N Cys-Cys-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZIKWRNJXFIQECJ-CIUDSAMLSA-N 0.000 description 1
- BVFQOPGFOQVZTE-ACZMJKKPSA-N Cys-Gln-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O BVFQOPGFOQVZTE-ACZMJKKPSA-N 0.000 description 1
- PFAQXUDMZVMADG-AVGNSLFASA-N Cys-Gln-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PFAQXUDMZVMADG-AVGNSLFASA-N 0.000 description 1
- SFRQEQGPRTVDPO-NRPADANISA-N Cys-Gln-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O SFRQEQGPRTVDPO-NRPADANISA-N 0.000 description 1
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 1
- SKSJPIBFNFPTJB-NKWVEPMBSA-N Cys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CS)N)C(=O)O SKSJPIBFNFPTJB-NKWVEPMBSA-N 0.000 description 1
- UXIYYUMGFNSGBK-XPUUQOCRSA-N Cys-Gly-Val Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O UXIYYUMGFNSGBK-XPUUQOCRSA-N 0.000 description 1
- ANRWXLYGJRSQEQ-CIUDSAMLSA-N Cys-His-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O ANRWXLYGJRSQEQ-CIUDSAMLSA-N 0.000 description 1
- DVIHGGUODLILFN-GHCJXIJMSA-N Cys-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DVIHGGUODLILFN-GHCJXIJMSA-N 0.000 description 1
- UCSXXFRXHGUXCQ-SRVKXCTJSA-N Cys-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N UCSXXFRXHGUXCQ-SRVKXCTJSA-N 0.000 description 1
- JXVFJOMFOLFPMP-KKUMJFAQSA-N Cys-Leu-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JXVFJOMFOLFPMP-KKUMJFAQSA-N 0.000 description 1
- VXLXATVURDNDCG-CIUDSAMLSA-N Cys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N VXLXATVURDNDCG-CIUDSAMLSA-N 0.000 description 1
- VDUPGIDTWNQAJD-CIUDSAMLSA-N Cys-Lys-Cys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O VDUPGIDTWNQAJD-CIUDSAMLSA-N 0.000 description 1
- NIXHTNJAGGFBAW-CIUDSAMLSA-N Cys-Lys-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N NIXHTNJAGGFBAW-CIUDSAMLSA-N 0.000 description 1
- SNHRIJBANHPWMO-XGEHTFHBSA-N Cys-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)N)O SNHRIJBANHPWMO-XGEHTFHBSA-N 0.000 description 1
- UIKLEGZPIOXFHJ-DLOVCJGASA-N Cys-Phe-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O UIKLEGZPIOXFHJ-DLOVCJGASA-N 0.000 description 1
- CHRCKSPMGYDLIA-SRVKXCTJSA-N Cys-Phe-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O CHRCKSPMGYDLIA-SRVKXCTJSA-N 0.000 description 1
- QQOWCDCBFFBRQH-IXOXFDKPSA-N Cys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N)O QQOWCDCBFFBRQH-IXOXFDKPSA-N 0.000 description 1
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 1
- BCWIFCLVCRAIQK-ZLUOBGJFSA-N Cys-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O BCWIFCLVCRAIQK-ZLUOBGJFSA-N 0.000 description 1
- UEHCDNYDBBCQEL-CIUDSAMLSA-N Cys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N UEHCDNYDBBCQEL-CIUDSAMLSA-N 0.000 description 1
- VCPHQVQGVSKDHY-FXQIFTODSA-N Cys-Ser-Met Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O VCPHQVQGVSKDHY-FXQIFTODSA-N 0.000 description 1
- PNEAWXSKCKCHDK-XIRDDKMYSA-N Cys-Trp-His Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CS)N)C(O)=O)C1=CN=CN1 PNEAWXSKCKCHDK-XIRDDKMYSA-N 0.000 description 1
- KARBMKZDLYMMOW-JYBASQMISA-N Cys-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CS)N)O KARBMKZDLYMMOW-JYBASQMISA-N 0.000 description 1
- AZDQAZRURQMSQD-XPUUQOCRSA-N Cys-Val-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AZDQAZRURQMSQD-XPUUQOCRSA-N 0.000 description 1
- QQAYIVHVRFJICE-AEJSXWLSSA-N Cys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N QQAYIVHVRFJICE-AEJSXWLSSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 108010041986 DNA Vaccines Proteins 0.000 description 1
- 208000001490 Dengue Diseases 0.000 description 1
- 206010012310 Dengue fever Diseases 0.000 description 1
- 206010061818 Disease progression Diseases 0.000 description 1
- 239000012591 Dulbecco’s Phosphate Buffered Saline Substances 0.000 description 1
- 208000000059 Dyspnea Diseases 0.000 description 1
- 206010013975 Dyspnoeas Diseases 0.000 description 1
- 239000006145 Eagle's minimal essential medium Substances 0.000 description 1
- 206010015548 Euthanasia Diseases 0.000 description 1
- 241000710831 Flavivirus Species 0.000 description 1
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- IGNGBUVODQLMRJ-CIUDSAMLSA-N Gln-Ala-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IGNGBUVODQLMRJ-CIUDSAMLSA-N 0.000 description 1
- OVQXQLWWJSNYFV-XEGUGMAKSA-N Gln-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(N)=O)C)C(O)=O)=CNC2=C1 OVQXQLWWJSNYFV-XEGUGMAKSA-N 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- SOBBAYVQSNXYPQ-ACZMJKKPSA-N Gln-Asn-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SOBBAYVQSNXYPQ-ACZMJKKPSA-N 0.000 description 1
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 1
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 1
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 1
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 1
- UZMWDBOHAOSCCH-ACZMJKKPSA-N Gln-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O UZMWDBOHAOSCCH-ACZMJKKPSA-N 0.000 description 1
- VVWWRZZMPSPVQU-KBIXCLLPSA-N Gln-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N VVWWRZZMPSPVQU-KBIXCLLPSA-N 0.000 description 1
- MFLMFRZBAJSGHK-ACZMJKKPSA-N Gln-Cys-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N MFLMFRZBAJSGHK-ACZMJKKPSA-N 0.000 description 1
- ZDJZEGYVKANKED-NRPADANISA-N Gln-Cys-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O ZDJZEGYVKANKED-NRPADANISA-N 0.000 description 1
- QFJPFPCSXOXMKI-BPUTZDHNSA-N Gln-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N QFJPFPCSXOXMKI-BPUTZDHNSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 1
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 1
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 1
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 1
- GXMBDEGTXHQBAO-NKIYYHGXSA-N Gln-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N)O GXMBDEGTXHQBAO-NKIYYHGXSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 1
- QMVCEWKHIUHTSD-GUBZILKMSA-N Gln-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QMVCEWKHIUHTSD-GUBZILKMSA-N 0.000 description 1
- FALJZCPMTGJOHX-SRVKXCTJSA-N Gln-Met-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O FALJZCPMTGJOHX-SRVKXCTJSA-N 0.000 description 1
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 1
- QBEWLBKBGXVVPD-RYUDHWBXSA-N Gln-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N QBEWLBKBGXVVPD-RYUDHWBXSA-N 0.000 description 1
- WBYHRQBKJGEBQJ-CIUDSAMLSA-N Gln-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CS)C(=O)O WBYHRQBKJGEBQJ-CIUDSAMLSA-N 0.000 description 1
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 1
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 1
- UWMDGPFFTKDUIY-HJGDQZAQSA-N Gln-Pro-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWMDGPFFTKDUIY-HJGDQZAQSA-N 0.000 description 1
- DCWNCMRZIZSZBL-KKUMJFAQSA-N Gln-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O DCWNCMRZIZSZBL-KKUMJFAQSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- GHAXJVNBAKGWEJ-AVGNSLFASA-N Gln-Ser-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GHAXJVNBAKGWEJ-AVGNSLFASA-N 0.000 description 1
- DYVMTEWCGAVKSE-HJGDQZAQSA-N Gln-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O DYVMTEWCGAVKSE-HJGDQZAQSA-N 0.000 description 1
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 1
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 1
- IIMZHVKZBGSEKZ-SZMVWBNQSA-N Gln-Trp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O IIMZHVKZBGSEKZ-SZMVWBNQSA-N 0.000 description 1
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 1
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 1
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 1
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 1
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- SAEBUDRWKUXLOM-ACZMJKKPSA-N Glu-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O SAEBUDRWKUXLOM-ACZMJKKPSA-N 0.000 description 1
- FKGNJUCQKXQNRA-NRPADANISA-N Glu-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O FKGNJUCQKXQNRA-NRPADANISA-N 0.000 description 1
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- NJPQBTJSYCKCNS-HVTMNAMFSA-N Glu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N NJPQBTJSYCKCNS-HVTMNAMFSA-N 0.000 description 1
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 1
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 1
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 1
- GMAGZGCAYLQBKF-NHCYSSNCSA-N Glu-Met-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GMAGZGCAYLQBKF-NHCYSSNCSA-N 0.000 description 1
- FQFWFZWOHOEVMZ-IHRRRGAJSA-N Glu-Phe-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FQFWFZWOHOEVMZ-IHRRRGAJSA-N 0.000 description 1
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 1
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 1
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 1
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 1
- ZKONLKQGTNVAPR-DCAQKATOSA-N Glu-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N ZKONLKQGTNVAPR-DCAQKATOSA-N 0.000 description 1
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- ZAPFAWQHBOHWLL-GUBZILKMSA-N Glu-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N ZAPFAWQHBOHWLL-GUBZILKMSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 1
- QOOFKCCZZWTCEP-AVGNSLFASA-N Glu-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QOOFKCCZZWTCEP-AVGNSLFASA-N 0.000 description 1
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 1
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 1
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 1
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 1
- SABZDFAAOJATBR-QWRGUYRKSA-N Gly-Cys-Phe Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SABZDFAAOJATBR-QWRGUYRKSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 1
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 1
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 1
- LUJVWKKYHSLULQ-ZKWXMUAHSA-N Gly-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN LUJVWKKYHSLULQ-ZKWXMUAHSA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- JPAACTMBBBGAAR-HOTGVXAUSA-N Gly-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)CC(C)C)C(O)=O)=CNC2=C1 JPAACTMBBBGAAR-HOTGVXAUSA-N 0.000 description 1
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- IFHJOBKVXBESRE-YUMQZZPRSA-N Gly-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN IFHJOBKVXBESRE-YUMQZZPRSA-N 0.000 description 1
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 1
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- RIUZKUJUPVFAGY-HOTGVXAUSA-N Gly-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)CN RIUZKUJUPVFAGY-HOTGVXAUSA-N 0.000 description 1
- WTUSRDZLLWGYAT-KCTSRDHCSA-N Gly-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN WTUSRDZLLWGYAT-KCTSRDHCSA-N 0.000 description 1
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 1
- GWNIGUKSRJBIHX-STQMWFEESA-N Gly-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN)O GWNIGUKSRJBIHX-STQMWFEESA-N 0.000 description 1
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 1
- KBBFOULZCHWGJX-KBPBESRZSA-N Gly-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN)O KBBFOULZCHWGJX-KBPBESRZSA-N 0.000 description 1
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 1
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- DKJWUIYLMLUBDX-XPUUQOCRSA-N Gly-Val-Cys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O DKJWUIYLMLUBDX-XPUUQOCRSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 239000007995 HEPES buffer Substances 0.000 description 1
- 208000032843 Hemorrhage Diseases 0.000 description 1
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 1
- FLUVGKKRRMLNPU-CQDKDKBSSA-N His-Ala-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FLUVGKKRRMLNPU-CQDKDKBSSA-N 0.000 description 1
- HTZKFIYQMHJWSQ-INTQDDNPSA-N His-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HTZKFIYQMHJWSQ-INTQDDNPSA-N 0.000 description 1
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 1
- OMNVOTCFQQLEQU-CIUDSAMLSA-N His-Asn-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMNVOTCFQQLEQU-CIUDSAMLSA-N 0.000 description 1
- WGVPDSNCHDEDBP-KKUMJFAQSA-N His-Asp-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WGVPDSNCHDEDBP-KKUMJFAQSA-N 0.000 description 1
- NWGXCPUKPVISSJ-AVGNSLFASA-N His-Gln-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NWGXCPUKPVISSJ-AVGNSLFASA-N 0.000 description 1
- TXLQHACKRLWYCM-DCAQKATOSA-N His-Glu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O TXLQHACKRLWYCM-DCAQKATOSA-N 0.000 description 1
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 1
- RAVLQPXCMRCLKT-KBPBESRZSA-N His-Gly-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RAVLQPXCMRCLKT-KBPBESRZSA-N 0.000 description 1
- VTZYMXGGXOFBMX-DJFWLOJKSA-N His-Ile-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O VTZYMXGGXOFBMX-DJFWLOJKSA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- MJUUWJJEUOBDGW-IHRRRGAJSA-N His-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MJUUWJJEUOBDGW-IHRRRGAJSA-N 0.000 description 1
- KHUFDBQXGLEIHC-BZSNNMDCSA-N His-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 KHUFDBQXGLEIHC-BZSNNMDCSA-N 0.000 description 1
- TTYKEFZRLKQTHH-MELADBBJSA-N His-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O TTYKEFZRLKQTHH-MELADBBJSA-N 0.000 description 1
- HYWZHNUGAYVEEW-KKUMJFAQSA-N His-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HYWZHNUGAYVEEW-KKUMJFAQSA-N 0.000 description 1
- STGQSBKUYSPPIG-CIUDSAMLSA-N His-Ser-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 STGQSBKUYSPPIG-CIUDSAMLSA-N 0.000 description 1
- CUEQQFOGARVNHU-VGDYDELISA-N His-Ser-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUEQQFOGARVNHU-VGDYDELISA-N 0.000 description 1
- ILUVWFTXAUYOBW-CUJWVEQBSA-N His-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N)O ILUVWFTXAUYOBW-CUJWVEQBSA-N 0.000 description 1
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 1
- KECFCPNPPYCGBL-PMVMPFDFSA-N His-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CN=CN4)N KECFCPNPPYCGBL-PMVMPFDFSA-N 0.000 description 1
- CSTDQOOBZBAJKE-BWAGICSOSA-N His-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N)O CSTDQOOBZBAJKE-BWAGICSOSA-N 0.000 description 1
- BCSGDNGNHKBRRJ-ULQDDVLXSA-N His-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N BCSGDNGNHKBRRJ-ULQDDVLXSA-N 0.000 description 1
- KFQDSSNYWKZFOO-LSJOCFKGSA-N His-Val-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KFQDSSNYWKZFOO-LSJOCFKGSA-N 0.000 description 1
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 1
- KDDKJKKQODQQBR-NHCYSSNCSA-N His-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N KDDKJKKQODQQBR-NHCYSSNCSA-N 0.000 description 1
- QLBXWYXMLHAREM-PYJNHQTQSA-N His-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N QLBXWYXMLHAREM-PYJNHQTQSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101000929928 Homo sapiens Angiotensin-converting enzyme 2 Proteins 0.000 description 1
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- ASCFJMSGKUIRDU-ZPFDUUQYSA-N Ile-Arg-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O ASCFJMSGKUIRDU-ZPFDUUQYSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 1
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 1
- LOXMWQOKYBGCHF-JBDRJPRFSA-N Ile-Cys-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O LOXMWQOKYBGCHF-JBDRJPRFSA-N 0.000 description 1
- PFTFEWHJSAXGED-ZKWXMUAHSA-N Ile-Cys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N PFTFEWHJSAXGED-ZKWXMUAHSA-N 0.000 description 1
- DURWCDDDAWVPOP-JBDRJPRFSA-N Ile-Cys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N DURWCDDDAWVPOP-JBDRJPRFSA-N 0.000 description 1
- VQUCKIAECLVLAD-SVSWQMSJSA-N Ile-Cys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VQUCKIAECLVLAD-SVSWQMSJSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- OEQKGSPBDVKYOC-ZKWXMUAHSA-N Ile-Gly-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OEQKGSPBDVKYOC-ZKWXMUAHSA-N 0.000 description 1
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 1
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 1
- APDIECQNNDGFPD-PYJNHQTQSA-N Ile-His-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N APDIECQNNDGFPD-PYJNHQTQSA-N 0.000 description 1
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 1
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 1
- RFMDODRWJZHZCR-BJDJZHNGSA-N Ile-Lys-Cys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(O)=O RFMDODRWJZHZCR-BJDJZHNGSA-N 0.000 description 1
- CEPIAEUVRKGPGP-DSYPUSFNSA-N Ile-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 CEPIAEUVRKGPGP-DSYPUSFNSA-N 0.000 description 1
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 1
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 1
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 1
- CZWANIQKACCEKW-CYDGBPFRSA-N Ile-Pro-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N CZWANIQKACCEKW-CYDGBPFRSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- QQVXERGIFIRCGW-NAKRPEOUSA-N Ile-Ser-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N QQVXERGIFIRCGW-NAKRPEOUSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 1
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 1
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 1
- 241000371980 Influenza B virus (B/Shanghai/361/2002) Species 0.000 description 1
- 229940124873 Influenza virus vaccine Drugs 0.000 description 1
- 102100037850 Interferon gamma Human genes 0.000 description 1
- 108010074328 Interferon-gamma Proteins 0.000 description 1
- 229940124868 Japanese encephalitis virus vaccine Drugs 0.000 description 1
- 102000011782 Keratins Human genes 0.000 description 1
- 108010076876 Keratins Proteins 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 1
- CNNQBZRGQATKNY-DCAQKATOSA-N Leu-Arg-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N CNNQBZRGQATKNY-DCAQKATOSA-N 0.000 description 1
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 1
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 1
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- KWURTLAFFDOTEQ-GUBZILKMSA-N Leu-Cys-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KWURTLAFFDOTEQ-GUBZILKMSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 1
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 1
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 1
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- YWYQSLOTVIRCFE-SRVKXCTJSA-N Leu-His-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O YWYQSLOTVIRCFE-SRVKXCTJSA-N 0.000 description 1
- DDEMUMVXNFPDKC-SRVKXCTJSA-N Leu-His-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N DDEMUMVXNFPDKC-SRVKXCTJSA-N 0.000 description 1
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- ORWTWZXGDBYVCP-BJDJZHNGSA-N Leu-Ile-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(C)C ORWTWZXGDBYVCP-BJDJZHNGSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 1
- IBSGMIPRBMPMHE-IHRRRGAJSA-N Leu-Met-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O IBSGMIPRBMPMHE-IHRRRGAJSA-N 0.000 description 1
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 1
- LQUIENKUVKPNIC-ULQDDVLXSA-N Leu-Met-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LQUIENKUVKPNIC-ULQDDVLXSA-N 0.000 description 1
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- YESNGRDJQWDYLH-KKUMJFAQSA-N Leu-Phe-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YESNGRDJQWDYLH-KKUMJFAQSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- MAXILRZVORNXBE-PMVMPFDFSA-N Leu-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MAXILRZVORNXBE-PMVMPFDFSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- SIGZKCWZEBFNAK-QAETUUGQSA-N Leu-Ser-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SIGZKCWZEBFNAK-QAETUUGQSA-N 0.000 description 1
- HWMQRQIFVGEAPH-XIRDDKMYSA-N Leu-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 HWMQRQIFVGEAPH-XIRDDKMYSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- CNWDWAMPKVYJJB-NUTKFTJISA-N Leu-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CNWDWAMPKVYJJB-NUTKFTJISA-N 0.000 description 1
- HQBOMRTVKVKFMN-WDSOQIARSA-N Leu-Trp-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O HQBOMRTVKVKFMN-WDSOQIARSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- MSFITIBEMPWCBD-ULQDDVLXSA-N Leu-Val-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MSFITIBEMPWCBD-ULQDDVLXSA-N 0.000 description 1
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 1
- BTSXLXFPMZXVPR-DLOVCJGASA-N Lys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BTSXLXFPMZXVPR-DLOVCJGASA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- GQUDMNDPQTXZRV-DCAQKATOSA-N Lys-Arg-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GQUDMNDPQTXZRV-DCAQKATOSA-N 0.000 description 1
- BRSGXFITDXFMFF-IHRRRGAJSA-N Lys-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N BRSGXFITDXFMFF-IHRRRGAJSA-N 0.000 description 1
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 1
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- SSJBMGCZZXCGJJ-DCAQKATOSA-N Lys-Asp-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O SSJBMGCZZXCGJJ-DCAQKATOSA-N 0.000 description 1
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 1
- XFBBBRDEQIPGNR-KATARQTJSA-N Lys-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O XFBBBRDEQIPGNR-KATARQTJSA-N 0.000 description 1
- DZQYZKPINJLLEN-KKUMJFAQSA-N Lys-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O DZQYZKPINJLLEN-KKUMJFAQSA-N 0.000 description 1
- VSRXPEHZMHSFKU-IUCAKERBSA-N Lys-Gln-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VSRXPEHZMHSFKU-IUCAKERBSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 1
- DKTNGXVSCZULPO-YUMQZZPRSA-N Lys-Gly-Cys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O DKTNGXVSCZULPO-YUMQZZPRSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- HQXSFFSLXFHWOX-IXOXFDKPSA-N Lys-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N)O HQXSFFSLXFHWOX-IXOXFDKPSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 1
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 1
- XREQQOATSMMAJP-MGHWNKPDSA-N Lys-Ile-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XREQQOATSMMAJP-MGHWNKPDSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 1
- PFZWARWVRNTPBR-IHPCNDPISA-N Lys-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N PFZWARWVRNTPBR-IHPCNDPISA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 1
- WKUXWMWQTOYTFI-SRVKXCTJSA-N Lys-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N WKUXWMWQTOYTFI-SRVKXCTJSA-N 0.000 description 1
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 1
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 1
- UDXSLGLHFUBRRM-OEAJRASXSA-N Lys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCCN)N)O UDXSLGLHFUBRRM-OEAJRASXSA-N 0.000 description 1
- JCVOHUKUYSYBAD-DCAQKATOSA-N Lys-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CS)C(=O)O JCVOHUKUYSYBAD-DCAQKATOSA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 1
- CFOLERIRBUAYAD-HOCLYGCPSA-N Lys-Trp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O CFOLERIRBUAYAD-HOCLYGCPSA-N 0.000 description 1
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- QFSYGUMEANRNJE-DCAQKATOSA-N Lys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N QFSYGUMEANRNJE-DCAQKATOSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- 241000282560 Macaca mulatta Species 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- ZEDVFJPQNNBMST-CYDGBPFRSA-N Met-Arg-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZEDVFJPQNNBMST-CYDGBPFRSA-N 0.000 description 1
- ZAJNRWKGHWGPDQ-SDDRHHMPSA-N Met-Arg-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N ZAJNRWKGHWGPDQ-SDDRHHMPSA-N 0.000 description 1
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 1
- RPEPZINUYHUBKG-FXQIFTODSA-N Met-Cys-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O RPEPZINUYHUBKG-FXQIFTODSA-N 0.000 description 1
- HGKJFNCLOHKEHS-FXQIFTODSA-N Met-Cys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(O)=O HGKJFNCLOHKEHS-FXQIFTODSA-N 0.000 description 1
- AVTWKENDGGUWDC-BQBZGAKWSA-N Met-Cys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O AVTWKENDGGUWDC-BQBZGAKWSA-N 0.000 description 1
- YKWHHKDMBZBMLG-GUBZILKMSA-N Met-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCSC)N YKWHHKDMBZBMLG-GUBZILKMSA-N 0.000 description 1
- OXHSZBRPUGNMKW-DCAQKATOSA-N Met-Gln-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OXHSZBRPUGNMKW-DCAQKATOSA-N 0.000 description 1
- MYKLINMAGAIRPJ-CIUDSAMLSA-N Met-Gln-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MYKLINMAGAIRPJ-CIUDSAMLSA-N 0.000 description 1
- UOENBSHXYCHSAU-YUMQZZPRSA-N Met-Gln-Gly Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UOENBSHXYCHSAU-YUMQZZPRSA-N 0.000 description 1
- KLFPZIUIXZNEKY-DCAQKATOSA-N Met-Gln-Met Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O KLFPZIUIXZNEKY-DCAQKATOSA-N 0.000 description 1
- HHCOOFPGNXKFGR-HJGDQZAQSA-N Met-Gln-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HHCOOFPGNXKFGR-HJGDQZAQSA-N 0.000 description 1
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 1
- CUICVBQQHMKBRJ-LSJOCFKGSA-N Met-His-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O CUICVBQQHMKBRJ-LSJOCFKGSA-N 0.000 description 1
- RXWPLVRJQNWXRQ-IHRRRGAJSA-N Met-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 RXWPLVRJQNWXRQ-IHRRRGAJSA-N 0.000 description 1
- FWAHLGXNBLWIKB-NAKRPEOUSA-N Met-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCSC FWAHLGXNBLWIKB-NAKRPEOUSA-N 0.000 description 1
- PZUUMQPMHBJJKE-AVGNSLFASA-N Met-Leu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N PZUUMQPMHBJJKE-AVGNSLFASA-N 0.000 description 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 1
- RBGLBUDVQVPTEG-DCAQKATOSA-N Met-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N RBGLBUDVQVPTEG-DCAQKATOSA-N 0.000 description 1
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 1
- OCRSGGIJBDUXHU-WDSOQIARSA-N Met-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 OCRSGGIJBDUXHU-WDSOQIARSA-N 0.000 description 1
- VAGCEUUEMMXFEX-GUBZILKMSA-N Met-Met-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O VAGCEUUEMMXFEX-GUBZILKMSA-N 0.000 description 1
- JOYFULUKJRJCSX-IUCAKERBSA-N Met-Met-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O JOYFULUKJRJCSX-IUCAKERBSA-N 0.000 description 1
- LLKWSEXLNFBKIF-CYDGBPFRSA-N Met-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCSC LLKWSEXLNFBKIF-CYDGBPFRSA-N 0.000 description 1
- CNTNPWWHFWAZGA-JYJNAYRXSA-N Met-Met-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CNTNPWWHFWAZGA-JYJNAYRXSA-N 0.000 description 1
- RSOMVHWMIAZNLE-HJWJTTGWSA-N Met-Phe-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSOMVHWMIAZNLE-HJWJTTGWSA-N 0.000 description 1
- HUURTRNKPBHHKZ-JYJNAYRXSA-N Met-Phe-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 HUURTRNKPBHHKZ-JYJNAYRXSA-N 0.000 description 1
- VQILILSLEFDECU-GUBZILKMSA-N Met-Pro-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O VQILILSLEFDECU-GUBZILKMSA-N 0.000 description 1
- MPCKIRSXNKACRF-GUBZILKMSA-N Met-Pro-Asn Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O MPCKIRSXNKACRF-GUBZILKMSA-N 0.000 description 1
- YLDSJJOGQNEQJK-AVGNSLFASA-N Met-Pro-Leu Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YLDSJJOGQNEQJK-AVGNSLFASA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 1
- YDKYJRZWRJTILC-WDSOQIARSA-N Met-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 YDKYJRZWRJTILC-WDSOQIARSA-N 0.000 description 1
- HOTNHEUETJELDL-BPNCWPANSA-N Met-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N HOTNHEUETJELDL-BPNCWPANSA-N 0.000 description 1
- OOLVTRHJJBCJKB-IHRRRGAJSA-N Met-Tyr-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OOLVTRHJJBCJKB-IHRRRGAJSA-N 0.000 description 1
- LIIXIZKVWNYQHB-STECZYCISA-N Met-Tyr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LIIXIZKVWNYQHB-STECZYCISA-N 0.000 description 1
- PNHRPOWKRRJATF-IHRRRGAJSA-N Met-Tyr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 PNHRPOWKRRJATF-IHRRRGAJSA-N 0.000 description 1
- JHVNNUIQXOGAHI-KJEVXHAQSA-N Met-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N)O JHVNNUIQXOGAHI-KJEVXHAQSA-N 0.000 description 1
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 1
- LBSWWNKMVPAXOI-GUBZILKMSA-N Met-Val-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O LBSWWNKMVPAXOI-GUBZILKMSA-N 0.000 description 1
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 1
- PVSPJQWHEIQTEH-JYJNAYRXSA-N Met-Val-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PVSPJQWHEIQTEH-JYJNAYRXSA-N 0.000 description 1
- IQJMEDDVOGMTKT-SRVKXCTJSA-N Met-Val-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IQJMEDDVOGMTKT-SRVKXCTJSA-N 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 241000699660 Mus musculus Species 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- 241000701945 Parvoviridae Species 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 1
- MPGJIHFJCXTVEX-KKUMJFAQSA-N Phe-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O MPGJIHFJCXTVEX-KKUMJFAQSA-N 0.000 description 1
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 1
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 1
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 1
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 1
- OMHMIXFFRPMYHB-SRVKXCTJSA-N Phe-Cys-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OMHMIXFFRPMYHB-SRVKXCTJSA-N 0.000 description 1
- ZBYHVSHBZYHQBW-SRVKXCTJSA-N Phe-Cys-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZBYHVSHBZYHQBW-SRVKXCTJSA-N 0.000 description 1
- WFDAEEUZPZSMOG-SRVKXCTJSA-N Phe-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O WFDAEEUZPZSMOG-SRVKXCTJSA-N 0.000 description 1
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- PMKIMKUGCSVFSV-CQDKDKBSSA-N Phe-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PMKIMKUGCSVFSV-CQDKDKBSSA-N 0.000 description 1
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 1
- MIICYIIBVYQNKE-QEWYBTABSA-N Phe-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N MIICYIIBVYQNKE-QEWYBTABSA-N 0.000 description 1
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 1
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 1
- TXKWKTWYTIAZSV-KKUMJFAQSA-N Phe-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N TXKWKTWYTIAZSV-KKUMJFAQSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- ZIQQNOXKEFDPBE-BZSNNMDCSA-N Phe-Lys-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N ZIQQNOXKEFDPBE-BZSNNMDCSA-N 0.000 description 1
- KLXQWABNAWDRAY-ACRUOGEOSA-N Phe-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 KLXQWABNAWDRAY-ACRUOGEOSA-N 0.000 description 1
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 1
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 description 1
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 1
- JDMKQHSHKJHAHR-UHFFFAOYSA-N Phe-Phe-Leu-Tyr Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)CC1=CC=CC=C1 JDMKQHSHKJHAHR-UHFFFAOYSA-N 0.000 description 1
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 1
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 1
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 1
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 1
- ILGCZYGFYQLSDZ-KKUMJFAQSA-N Phe-Ser-His Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ILGCZYGFYQLSDZ-KKUMJFAQSA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 1
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 1
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 1
- KCIKTPHTEYBXMG-BVSLBCMMSA-N Phe-Trp-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCIKTPHTEYBXMG-BVSLBCMMSA-N 0.000 description 1
- QTDBZORPVYTRJU-KKXDTOCCSA-N Phe-Tyr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O QTDBZORPVYTRJU-KKXDTOCCSA-N 0.000 description 1
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 1
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 1
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 1
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- 108010089430 Phosphoproteins Proteins 0.000 description 1
- 102000007982 Phosphoproteins Human genes 0.000 description 1
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 1
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 1
- ONPFOYPPPOHMNH-UVBJJODRSA-N Pro-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ONPFOYPPPOHMNH-UVBJJODRSA-N 0.000 description 1
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 1
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 1
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- INXAPZFIOVGHSV-CIUDSAMLSA-N Pro-Asn-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 INXAPZFIOVGHSV-CIUDSAMLSA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- QVIZLAUEAMQKGS-GUBZILKMSA-N Pro-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 QVIZLAUEAMQKGS-GUBZILKMSA-N 0.000 description 1
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- AIZVVCMAFRREQS-GUBZILKMSA-N Pro-Cys-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AIZVVCMAFRREQS-GUBZILKMSA-N 0.000 description 1
- SZZBUDVXWZZPDH-BQBZGAKWSA-N Pro-Cys-Gly Chemical compound OC(=O)CNC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 SZZBUDVXWZZPDH-BQBZGAKWSA-N 0.000 description 1
- HQVPQXMCQKXARZ-FXQIFTODSA-N Pro-Cys-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O HQVPQXMCQKXARZ-FXQIFTODSA-N 0.000 description 1
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 1
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 1
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- AJCRQOHDLCBHFA-SRVKXCTJSA-N Pro-His-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AJCRQOHDLCBHFA-SRVKXCTJSA-N 0.000 description 1
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 1
- BODDREDDDRZUCF-QTKMDUPCSA-N Pro-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2)O BODDREDDDRZUCF-QTKMDUPCSA-N 0.000 description 1
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 1
- BWCZJGJKOFUUCN-ZPFDUUQYSA-N Pro-Ile-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O BWCZJGJKOFUUCN-ZPFDUUQYSA-N 0.000 description 1
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 1
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 1
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 1
- SXMSEHDMNIUTSP-DCAQKATOSA-N Pro-Lys-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SXMSEHDMNIUTSP-DCAQKATOSA-N 0.000 description 1
- YAZNFQUKPUASKB-DCAQKATOSA-N Pro-Lys-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O YAZNFQUKPUASKB-DCAQKATOSA-N 0.000 description 1
- INDVYIOKMXFQFM-SRVKXCTJSA-N Pro-Lys-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O INDVYIOKMXFQFM-SRVKXCTJSA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- VGVCNKSUVSZEIE-IHRRRGAJSA-N Pro-Phe-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O VGVCNKSUVSZEIE-IHRRRGAJSA-N 0.000 description 1
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 1
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 1
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 1
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 1
- DSGSTPRKNYHGCL-JYJNAYRXSA-N Pro-Phe-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DSGSTPRKNYHGCL-JYJNAYRXSA-N 0.000 description 1
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 1
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 1
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 1
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 1
- MDAWMJUZHBQTBO-XGEHTFHBSA-N Pro-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1)O MDAWMJUZHBQTBO-XGEHTFHBSA-N 0.000 description 1
- GBUNEGKQPSAMNK-QTKMDUPCSA-N Pro-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2)O GBUNEGKQPSAMNK-QTKMDUPCSA-N 0.000 description 1
- CNUIHOAISPKQPY-HSHDSVGOSA-N Pro-Thr-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CNUIHOAISPKQPY-HSHDSVGOSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- DIDLUFMLRUJLFB-FKBYEOEOSA-N Pro-Trp-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC4=CC=C(C=C4)O)C(=O)O DIDLUFMLRUJLFB-FKBYEOEOSA-N 0.000 description 1
- 206010037660 Pyrexia Diseases 0.000 description 1
- 229940022005 RNA vaccine Drugs 0.000 description 1
- 238000011529 RT qPCR Methods 0.000 description 1
- 101500027983 Rattus norvegicus Octadecaneuropeptide Proteins 0.000 description 1
- 101710200092 Replicase polyprotein Proteins 0.000 description 1
- 101710151619 Replicase polyprotein 1ab Proteins 0.000 description 1
- 108091006197 SARS-CoV-2 Nucleocapsid Protein Proteins 0.000 description 1
- 108091005609 SARS-CoV-2 Spike Subunit S1 Proteins 0.000 description 1
- 241001678558 SARS-like-CoVZXC21 Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- UCXDHBORXLVBNC-ZLUOBGJFSA-N Ser-Asn-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O UCXDHBORXLVBNC-ZLUOBGJFSA-N 0.000 description 1
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- KNCJWSPMTFFJII-ZLUOBGJFSA-N Ser-Cys-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KNCJWSPMTFFJII-ZLUOBGJFSA-N 0.000 description 1
- RNFKSBPHLTZHLU-WHFBIAKZSA-N Ser-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)O RNFKSBPHLTZHLU-WHFBIAKZSA-N 0.000 description 1
- DSSOYPJWSWFOLK-CIUDSAMLSA-N Ser-Cys-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O DSSOYPJWSWFOLK-CIUDSAMLSA-N 0.000 description 1
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- CXBFHZLODKPIJY-AAEUAGOBSA-N Ser-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N CXBFHZLODKPIJY-AAEUAGOBSA-N 0.000 description 1
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 1
- JEHPKECJCALLRW-CUJWVEQBSA-N Ser-His-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEHPKECJCALLRW-CUJWVEQBSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- JLPMFVAIQHCBDC-CIUDSAMLSA-N Ser-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N JLPMFVAIQHCBDC-CIUDSAMLSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- OCWWJBZQXGYQCA-DCAQKATOSA-N Ser-Lys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O OCWWJBZQXGYQCA-DCAQKATOSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- IFLVBVIYADZIQO-DCAQKATOSA-N Ser-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N IFLVBVIYADZIQO-DCAQKATOSA-N 0.000 description 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 1
- JJUNLJTUIKFPRF-BPUTZDHNSA-N Ser-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CO)N JJUNLJTUIKFPRF-BPUTZDHNSA-N 0.000 description 1
- HJAXVYLCKDPPDF-SRVKXCTJSA-N Ser-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N HJAXVYLCKDPPDF-SRVKXCTJSA-N 0.000 description 1
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- XQAPEISNMXNKGE-FXQIFTODSA-N Ser-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CS)C(=O)O XQAPEISNMXNKGE-FXQIFTODSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- AXKJPUBALUNJEO-UBHSHLNASA-N Ser-Trp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O AXKJPUBALUNJEO-UBHSHLNASA-N 0.000 description 1
- FRPNVPKQVFHSQY-BPUTZDHNSA-N Ser-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FRPNVPKQVFHSQY-BPUTZDHNSA-N 0.000 description 1
- HAUVENOGHPECML-BPUTZDHNSA-N Ser-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 HAUVENOGHPECML-BPUTZDHNSA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- VEVYMLNYMULSMS-AVGNSLFASA-N Ser-Tyr-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEVYMLNYMULSMS-AVGNSLFASA-N 0.000 description 1
- OSFZCEQJLWCIBG-BZSNNMDCSA-N Ser-Tyr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSFZCEQJLWCIBG-BZSNNMDCSA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 1
- 102000007562 Serum Albumin Human genes 0.000 description 1
- 108010071390 Serum Albumin Proteins 0.000 description 1
- 101001024637 Severe acute respiratory syndrome coronavirus 2 Nucleoprotein Proteins 0.000 description 1
- 101000992426 Severe acute respiratory syndrome coronavirus 2 ORF9b protein Proteins 0.000 description 1
- BQCADISMDOOEFD-UHFFFAOYSA-N Silver Chemical compound [Ag] BQCADISMDOOEFD-UHFFFAOYSA-N 0.000 description 1
- 239000004138 Stearyl citrate Substances 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- 239000012505 Superdex™ Substances 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 1
- STGXWWBXWXZOER-MBLNEYKQSA-N Thr-Ala-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 STGXWWBXWXZOER-MBLNEYKQSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 1
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 1
- JMQUAZXYFAEOIH-XGEHTFHBSA-N Thr-Arg-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O JMQUAZXYFAEOIH-XGEHTFHBSA-N 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- VASYSJHSMSBTDU-LKXGYXEUSA-N Thr-Asn-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O VASYSJHSMSBTDU-LKXGYXEUSA-N 0.000 description 1
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 1
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 1
- NRUPKQSXTJNQGD-XGEHTFHBSA-N Thr-Cys-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NRUPKQSXTJNQGD-XGEHTFHBSA-N 0.000 description 1
- DGOJNGCGEYOBKN-BWBBJGPYSA-N Thr-Cys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)O DGOJNGCGEYOBKN-BWBBJGPYSA-N 0.000 description 1
- ZLNWJMRLHLGKFX-SVSWQMSJSA-N Thr-Cys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZLNWJMRLHLGKFX-SVSWQMSJSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 1
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- WDFPMSHYMRBLKM-NKIYYHGXSA-N Thr-Glu-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O WDFPMSHYMRBLKM-NKIYYHGXSA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- IGGFFPOIFHZYKC-PBCZWWQYSA-N Thr-His-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O IGGFFPOIFHZYKC-PBCZWWQYSA-N 0.000 description 1
- FDALPRWYVKJCLL-PMVVWTBXSA-N Thr-His-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O FDALPRWYVKJCLL-PMVVWTBXSA-N 0.000 description 1
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 1
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 1
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 1
- SIEZEMFJLYRUMK-YTWAJWBKSA-N Thr-Met-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N)O SIEZEMFJLYRUMK-YTWAJWBKSA-N 0.000 description 1
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 description 1
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 1
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- BCYUHPXBHCUYBA-CUJWVEQBSA-N Thr-Ser-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BCYUHPXBHCUYBA-CUJWVEQBSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- TZQWJCGVCIJDMU-HEIBUPTGSA-N Thr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N)O TZQWJCGVCIJDMU-HEIBUPTGSA-N 0.000 description 1
- NLWDSYKZUPRMBJ-IEGACIPQSA-N Thr-Trp-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O NLWDSYKZUPRMBJ-IEGACIPQSA-N 0.000 description 1
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 1
- CJEHCEOXPLASCK-MEYUZBJRSA-N Thr-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=C(O)C=C1 CJEHCEOXPLASCK-MEYUZBJRSA-N 0.000 description 1
- XVHAUVJXBFGUPC-RPTUDFQQSA-N Thr-Tyr-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XVHAUVJXBFGUPC-RPTUDFQQSA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 1
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- QAXCHNZDPLSFPC-PJODQICGSA-N Trp-Ala-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QAXCHNZDPLSFPC-PJODQICGSA-N 0.000 description 1
- IBBBOLAPFHRDHW-BPUTZDHNSA-N Trp-Asn-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N IBBBOLAPFHRDHW-BPUTZDHNSA-N 0.000 description 1
- UKINEYBQXPMOJO-UBHSHLNASA-N Trp-Asn-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N UKINEYBQXPMOJO-UBHSHLNASA-N 0.000 description 1
- XZSJDSBPEJBEFZ-QRTARXTBSA-N Trp-Asn-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O XZSJDSBPEJBEFZ-QRTARXTBSA-N 0.000 description 1
- UJRIVCPPPMYCNA-HOCLYGCPSA-N Trp-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UJRIVCPPPMYCNA-HOCLYGCPSA-N 0.000 description 1
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 1
- GRSCONMARGNYHA-PMVMPFDFSA-N Trp-Lys-Phe Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GRSCONMARGNYHA-PMVMPFDFSA-N 0.000 description 1
- FHHYVSCGOMPLLO-IHPCNDPISA-N Trp-Tyr-Asp Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 FHHYVSCGOMPLLO-IHPCNDPISA-N 0.000 description 1
- NSOMQRHZMJMZIE-GVARAGBVSA-N Tyr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NSOMQRHZMJMZIE-GVARAGBVSA-N 0.000 description 1
- XGEUYEOEZYFHRL-KKXDTOCCSA-N Tyr-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XGEUYEOEZYFHRL-KKXDTOCCSA-N 0.000 description 1
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 1
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 1
- AKFLVKKWVZMFOT-IHRRRGAJSA-N Tyr-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AKFLVKKWVZMFOT-IHRRRGAJSA-N 0.000 description 1
- DWJQKEZKLQCHKO-SRVKXCTJSA-N Tyr-Asn-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O DWJQKEZKLQCHKO-SRVKXCTJSA-N 0.000 description 1
- PEVVXUGSAKEPEN-AVGNSLFASA-N Tyr-Asn-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PEVVXUGSAKEPEN-AVGNSLFASA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- ZNFPUOSTMUMUDR-JRQIVUDYSA-N Tyr-Asn-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZNFPUOSTMUMUDR-JRQIVUDYSA-N 0.000 description 1
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 1
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 1
- WPVGRKLNHJJCEN-BZSNNMDCSA-N Tyr-Asp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WPVGRKLNHJJCEN-BZSNNMDCSA-N 0.000 description 1
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 description 1
- BVDHHLMIZFCAAU-BZSNNMDCSA-N Tyr-Cys-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BVDHHLMIZFCAAU-BZSNNMDCSA-N 0.000 description 1
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 1
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 1
- IWRMTNJCCMEBEX-AVGNSLFASA-N Tyr-Glu-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O IWRMTNJCCMEBEX-AVGNSLFASA-N 0.000 description 1
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 1
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 1
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 1
- WVGKPKDWYQXWLU-BZSNNMDCSA-N Tyr-His-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WVGKPKDWYQXWLU-BZSNNMDCSA-N 0.000 description 1
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 1
- CWVHKVVKAQIJKY-ACRUOGEOSA-N Tyr-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N CWVHKVVKAQIJKY-ACRUOGEOSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 1
- OKDNSNWJEXAMSU-IRXDYDNUSA-N Tyr-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 OKDNSNWJEXAMSU-IRXDYDNUSA-N 0.000 description 1
- FASACHWGQBNSRO-ZEWNOJEFSA-N Tyr-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FASACHWGQBNSRO-ZEWNOJEFSA-N 0.000 description 1
- FGVFBDZSGQTYQX-UFYCRDLUSA-N Tyr-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O FGVFBDZSGQTYQX-UFYCRDLUSA-N 0.000 description 1
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 1
- PYJKETPLFITNKS-IHRRRGAJSA-N Tyr-Pro-Asn Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O PYJKETPLFITNKS-IHRRRGAJSA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- SOEGLGLDSUHWTI-STECZYCISA-N Tyr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 SOEGLGLDSUHWTI-STECZYCISA-N 0.000 description 1
- MNWINJDPGBNOED-ULQDDVLXSA-N Tyr-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 MNWINJDPGBNOED-ULQDDVLXSA-N 0.000 description 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 1
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- RIVVDNTUSRVTQT-IRIUXVKKSA-N Tyr-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O RIVVDNTUSRVTQT-IRIUXVKKSA-N 0.000 description 1
- XFEMMSGONWQACR-KJEVXHAQSA-N Tyr-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XFEMMSGONWQACR-KJEVXHAQSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 1
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 1
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 1
- CCEVJBJLPRNAFH-BVSLBCMMSA-N Tyr-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N CCEVJBJLPRNAFH-BVSLBCMMSA-N 0.000 description 1
- YKBUNNNRNZZUID-UFYCRDLUSA-N Tyr-Val-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YKBUNNNRNZZUID-UFYCRDLUSA-N 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- SMKXLHVZIFKQRB-GUBZILKMSA-N Val-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N SMKXLHVZIFKQRB-GUBZILKMSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 1
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 1
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 1
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- KXUKIBHIVRYOIP-ZKWXMUAHSA-N Val-Asp-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KXUKIBHIVRYOIP-ZKWXMUAHSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- XKVXSCHXGJOQND-ZOBUZTSGSA-N Val-Asp-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XKVXSCHXGJOQND-ZOBUZTSGSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- PFMAFMPJJSHNDW-ZKWXMUAHSA-N Val-Cys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N PFMAFMPJJSHNDW-ZKWXMUAHSA-N 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- SRWWRLKBEJZFPW-IHRRRGAJSA-N Val-Cys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N SRWWRLKBEJZFPW-IHRRRGAJSA-N 0.000 description 1
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 1
- XXDVDTMEVBYRPK-XPUUQOCRSA-N Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O XXDVDTMEVBYRPK-XPUUQOCRSA-N 0.000 description 1
- IWZYXFRGWKEKBJ-GVXVVHGQSA-N Val-Gln-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IWZYXFRGWKEKBJ-GVXVVHGQSA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 1
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 1
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 1
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 1
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 1
- KNYHAWKHFQRYOX-PYJNHQTQSA-N Val-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N KNYHAWKHFQRYOX-PYJNHQTQSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- RFKJNTRMXGCKFE-FHWLQOOXSA-N Val-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC(C)C)C(O)=O)=CNC2=C1 RFKJNTRMXGCKFE-FHWLQOOXSA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 1
- MLADEWAIYAPAAU-IHRRRGAJSA-N Val-Lys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MLADEWAIYAPAAU-IHRRRGAJSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- IEBGHUMBJXIXHM-AVGNSLFASA-N Val-Lys-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N IEBGHUMBJXIXHM-AVGNSLFASA-N 0.000 description 1
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 1
- SBJCTAZFSZXWSR-AVGNSLFASA-N Val-Met-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SBJCTAZFSZXWSR-AVGNSLFASA-N 0.000 description 1
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 1
- WSUWDIVCPOJFCX-TUAOUCFPSA-N Val-Met-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N WSUWDIVCPOJFCX-TUAOUCFPSA-N 0.000 description 1
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 1
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- BCBFMJYTNKDALA-UFYCRDLUSA-N Val-Phe-Phe Chemical compound N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O BCBFMJYTNKDALA-UFYCRDLUSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- VSCIANXXVZOYOC-AVGNSLFASA-N Val-Pro-His Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VSCIANXXVZOYOC-AVGNSLFASA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 1
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- QTXGUIMEHKCPBH-FHWLQOOXSA-N Val-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 QTXGUIMEHKCPBH-FHWLQOOXSA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- PGBMPFKFKXYROZ-UFYCRDLUSA-N Val-Tyr-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N PGBMPFKFKXYROZ-UFYCRDLUSA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 1
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- 229940124937 Vaqta Drugs 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 230000010530 Virus Neutralization Effects 0.000 description 1
- 102000018265 Virus Receptors Human genes 0.000 description 1
- 108010066342 Virus Receptors Proteins 0.000 description 1
- 208000020329 Zika virus infectious disease Diseases 0.000 description 1
- 229940124743 Zika virus vaccine Drugs 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 238000013019 agitation Methods 0.000 description 1
- 108010028939 alanyl-alanyl-lysyl-alanine Proteins 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 1
- 229910052782 aluminium Inorganic materials 0.000 description 1
- 230000005875 antibody response Effects 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 230000001174 ascending effect Effects 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 239000007844 bleaching agent Substances 0.000 description 1
- 230000000740 bleeding effect Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 239000002775 capsule Substances 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 239000012534 cell culture medium component Substances 0.000 description 1
- 239000013553 cell monolayer Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- QUWFSKKBMDKAHK-SBOJBMMISA-A chembl2103793 Chemical compound [Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].[Na+].O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C(N=C(N)C=C2)=O)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C3=C(C(NC(N)=N3)=O)N=C2)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C(N=C(N)C=C2)=O)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C3=C(C(NC(N)=N3)=O)N=C2)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C3=C(C(NC(N)=N3)=O)N=C2)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C(N=C(N)C=C2)=O)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C3=C(C(NC(N)=N3)=O)N=C2)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C3=C(C(NC(N)=N3)=O)N=C2)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C(N=C(N)C=C2)=O)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C3=C(C(NC(N)=N3)=O)N=C2)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP([O-])(=S)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)CO)[C@@H](O)C1 QUWFSKKBMDKAHK-SBOJBMMISA-A 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 239000012501 chromatography medium Substances 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 238000002591 computed tomography Methods 0.000 description 1
- 238000004883 computer application Methods 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 230000009260 cross reactivity Effects 0.000 description 1
- 238000012136 culture method Methods 0.000 description 1
- 150000001944 cysteine derivatives Chemical class 0.000 description 1
- LEVWYRKDKASIDU-IMJSIDKUSA-N cystine group Chemical group C([C@@H](C(=O)O)N)SSC[C@@H](C(=O)O)N LEVWYRKDKASIDU-IMJSIDKUSA-N 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 231100000517 death Toxicity 0.000 description 1
- 230000005860 defense response to virus Effects 0.000 description 1
- 239000007857 degradation product Substances 0.000 description 1
- 239000008367 deionised water Substances 0.000 description 1
- 229910021641 deionized water Inorganic materials 0.000 description 1
- 208000025729 dengue disease Diseases 0.000 description 1
- 229960003964 deoxycholic acid Drugs 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 230000005750 disease progression Effects 0.000 description 1
- 125000002228 disulfide group Chemical group 0.000 description 1
- 239000002552 dosage form Substances 0.000 description 1
- 238000000635 electron micrograph Methods 0.000 description 1
- 238000001493 electron microscopy Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 238000007824 enzymatic assay Methods 0.000 description 1
- 230000006862 enzymatic digestion Effects 0.000 description 1
- 238000003114 enzyme-linked immunosorbent spot assay Methods 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 230000029142 excretion Effects 0.000 description 1
- 108010068404 exorphin B4 Proteins 0.000 description 1
- 239000000706 filtrate Substances 0.000 description 1
- 238000009459 flexible packaging Methods 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 108010040856 glutamyl-cysteinyl-alanine Proteins 0.000 description 1
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 1
- 230000005484 gravity Effects 0.000 description 1
- 244000144993 groups of animals Species 0.000 description 1
- 229960000789 guanidine hydrochloride Drugs 0.000 description 1
- PJJJBBJSCAKJQF-UHFFFAOYSA-N guanidinium chloride Chemical compound [Cl-].NC(N)=[NH2+] PJJJBBJSCAKJQF-UHFFFAOYSA-N 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 208000021760 high fever Diseases 0.000 description 1
- 108010041601 histidyl-aspartyl-glutamyl-leucine Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 102000048657 human ACE2 Human genes 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- XXSMGPRMXLTPCZ-UHFFFAOYSA-N hydroxychloroquine Chemical compound ClC1=CC=C2C(NC(C)CCCN(CCO)CC)=CC=NC2=C1 XXSMGPRMXLTPCZ-UHFFFAOYSA-N 0.000 description 1
- 229960004171 hydroxychloroquine Drugs 0.000 description 1
- 210000002865 immune cell Anatomy 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 239000005414 inactive ingredient Substances 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000008595 infiltration Effects 0.000 description 1
- 238000001764 infiltration Methods 0.000 description 1
- 230000002757 inflammatory effect Effects 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 238000012006 liquid chromatography with tandem mass spectrometry Methods 0.000 description 1
- 229940124590 live attenuated vaccine Drugs 0.000 description 1
- 229940023012 live-attenuated vaccine Drugs 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 108010053062 lysyl-arginyl-phenylalanyl-lysine Proteins 0.000 description 1
- 108010064499 lysyl-leucyl-lysyl-leucyl-leucyl-leucyl-leucyl-leucyl-lysyl-leucyl-lysine Proteins 0.000 description 1
- 108700021021 mRNA Vaccine Proteins 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000012768 mass vaccination Methods 0.000 description 1
- 238000013411 master cell bank Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000000691 measurement method Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 230000034217 membrane fusion Effects 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 229920000609 methyl cellulose Polymers 0.000 description 1
- 239000001923 methylcellulose Substances 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 125000003835 nucleoside group Chemical group 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- TWNQGVIAIRXVLR-UHFFFAOYSA-N oxo(oxoalumanyloxy)alumane Chemical compound O=[Al]O[Al]=O TWNQGVIAIRXVLR-UHFFFAOYSA-N 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 230000002572 peristaltic effect Effects 0.000 description 1
- 230000000144 pharmacologic effect Effects 0.000 description 1
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 1
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- 239000011574 phosphorus Substances 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 210000002826 placenta Anatomy 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 229920000136 polysorbate Polymers 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 238000011809 primate model Methods 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 1
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 239000013639 protein trimer Substances 0.000 description 1
- 230000002685 pulmonary effect Effects 0.000 description 1
- 238000005086 pumping Methods 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 208000023504 respiratory system disease Diseases 0.000 description 1
- 239000012898 sample dilution Substances 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000010845 search algorithm Methods 0.000 description 1
- 238000004062 sedimentation Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 238000013207 serial dilution Methods 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 230000005582 sexual transmission Effects 0.000 description 1
- 208000013220 shortness of breath Diseases 0.000 description 1
- 229910052709 silver Inorganic materials 0.000 description 1
- 239000004332 silver Substances 0.000 description 1
- 239000002356 single layer Substances 0.000 description 1
- FHHPUSMSKHSNKW-SMOYURAASA-M sodium deoxycholate Chemical compound [Na+].C([C@H]1CC2)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC([O-])=O)C)[C@@]2(C)[C@@H](O)C1 FHHPUSMSKHSNKW-SMOYURAASA-M 0.000 description 1
- HRZFUMHJMZEROT-UHFFFAOYSA-L sodium disulfite Chemical compound [Na+].[Na+].[O-]S(=O)S([O-])(=O)=O HRZFUMHJMZEROT-UHFFFAOYSA-L 0.000 description 1
- 229940001584 sodium metabisulfite Drugs 0.000 description 1
- 235000010262 sodium metabisulphite Nutrition 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 238000004611 spectroscopical analysis Methods 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 238000012289 standard assay Methods 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 239000011550 stock solution Substances 0.000 description 1
- 239000007929 subcutaneous injection Substances 0.000 description 1
- 238000010254 subcutaneous injection Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 229940117986 sulfobetaine Drugs 0.000 description 1
- 229910052717 sulfur Inorganic materials 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 238000011287 therapeutic dose Methods 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 238000004448 titration Methods 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 238000011830 transgenic mouse model Methods 0.000 description 1
- STCOOQWBFONSKY-UHFFFAOYSA-N tributyl phosphate Chemical compound CCCCOP(=O)(OCCCC)OCCCC STCOOQWBFONSKY-UHFFFAOYSA-N 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- GPRLSGONYQIRFK-MNYXATJNSA-N triton Chemical compound [3H+] GPRLSGONYQIRFK-MNYXATJNSA-N 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 235000002374 tyrosine Nutrition 0.000 description 1
- 108010079202 tyrosyl-alanyl-cysteine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 238000009281 ultraviolet germicidal irradiation Methods 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241000712461 unidentified influenza virus Species 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 230000008478 viral entry into host cell Effects 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 210000000605 viral structure Anatomy 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- 238000010792 warming Methods 0.000 description 1
- 239000011534 wash buffer Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 229910001868 water Inorganic materials 0.000 description 1
- 239000012224 working solution Substances 0.000 description 1
- 238000002424 x-ray crystallography Methods 0.000 description 1
Images
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
- A61K39/215—Coronaviridae, e.g. avian infectious bronchitis virus
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
- A61P31/14—Antivirals for RNA viruses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N7/00—Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/525—Virus
- A61K2039/5252—Virus inactivated (killed)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/545—Medicinal preparations containing antigens or antibodies characterised by the dose, timing or administration schedule
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/55—Medicinal preparations containing antigens or antibodies characterised by the host/recipient, e.g. newborn with maternal antibodies
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/555—Medicinal preparations containing antigens or antibodies characterised by a specific combination antigen/adjuvant
- A61K2039/55505—Inorganic adjuvants
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/555—Medicinal preparations containing antigens or antibodies characterised by a specific combination antigen/adjuvant
- A61K2039/55511—Organic adjuvants
- A61K2039/55555—Liposomes; Vesicles, e.g. nanoparticles; Spheres, e.g. nanospheres; Polymers
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/555—Medicinal preparations containing antigens or antibodies characterised by a specific combination antigen/adjuvant
- A61K2039/55511—Organic adjuvants
- A61K2039/55561—CpG containing adjuvants; Oligonucleotide containing adjuvants
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/57—Medicinal preparations containing antigens or antibodies characterised by the type of response, e.g. Th1, Th2
- A61K2039/575—Medicinal preparations containing antigens or antibodies characterised by the type of response, e.g. Th1, Th2 humoral response
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/20011—Coronaviridae
- C12N2770/20034—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/20011—Coronaviridae
- C12N2770/20061—Methods of inactivation or attenuation
- C12N2770/20063—Methods of inactivation or attenuation by chemical treatment
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Virology (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Public Health (AREA)
- Pharmacology & Pharmacy (AREA)
- Animal Behavior & Ethology (AREA)
- Veterinary Medicine (AREA)
- Communicable Diseases (AREA)
- Microbiology (AREA)
- Immunology (AREA)
- Genetics & Genomics (AREA)
- Oncology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Epidemiology (AREA)
- Wood Science & Technology (AREA)
- Mycology (AREA)
- Molecular Biology (AREA)
- Zoology (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biochemistry (AREA)
- Pulmonology (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biophysics (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Medicinal Preparation (AREA)
- Peptides Or Proteins (AREA)
Abstract
SARS-CoV-2 백신과 조성물 및 상기 백신을 생성하고 이를 필요로 하는 대상체에게 투여하는 방법이 본 명세서에 개시되어 있다.
Description
개시내용은 SARS-CoV-2 백신 및 상기 백신을 생성하고 항-SARS-CoV-2 면역 반응의 생성을 위해 대상체에게 백신을 투여하기 위한 조성물 및 방법에 관한 것이다.
SARS-CoV-2(이하 "바이러스")는 2019년 11월경 중국에서 처음으로 검출되었다. 그런 다음 이 바이러스는 세계적인 대유행을 일으켰다. 천연 보유숙주는 박쥐이고 바이러스는 코로나바이러스과 패밀리인, 베타코로나바이러스(betaCoV) 속에 속한다. 바이러스는 9,860개 아미노산, 25개 비-구조 단백질 및 4개 구조 단백질을 인코딩하는 29,903bp(Wuhan-Hu-1: GenBank 참조 서열: NC_045512.2)인 ssRNA 게놈: 스파이크(S), 엔벨로프(E), 막(M), 뉴클레오캡사이드(N)를 가지고 있다. 바이러스는 직경이 60 내지 140nm 사이의 다양한 크기를 가지고 있다. 그것은 엔벨롭되고 UV, 열 및 지질 용매에 민감하다. 그것은 박쥐 SARS-유사-CoVZXC21과 89% 뉴클레오티드 동일성을 갖고 인간 SARS-CoV와 81% 뉴클레오티드 동일성을 가지고 있다. 증거에 따르면 이 바이러스는 감염된 사람이 기침할 때 - 바이러스로 팩킹된 - 작은 비말을 공기 중으로 퍼트린다. 이들은 숨을 들이쉬거나 닿은 표면을 만진 다음 눈, 코 또는 입을 만지면 감염을 일으킬 수 있다. 부가하여, 다른 병원 매개체가 존재할 수 있고 바이러스는 수혈에 의해, 태반으로 그리고 성적 전달을 통해 전달될 수 있다. SARS-CoV-2 바이러스 감염의 증상은 경미하고 전형적으로 발열과 기침을 포함하지만 무증상이거나 다른 극단적인 경우 치명적일 수도 있다. 주요 증상은 일반적으로 고열, 기침 및 호흡 곤란이다. 현재로 이 바이러스에 대한 특정 치료법이나 백신은 없고 유일한 예방 방법은 사회적 거리두기를 포함한다. SARS-CoV-2는 상당한 공중 보건 위협을 나타낸다. Imperial College COVID-19(SARS-CoV-2로 인한 질환) 대응팀은 2020년 3월 16일에 보고서를 발표하였으며 여기서 그들은 궁극적으로 의료 시스템의 붕괴와 영국에서만 수십만 명의 사망을 초래한 바이러스의 확산을 막거나 지연시키기 위해 가능한 모든 방법을 평가했다. 그들은 인구 전체의 사회적 거리두기만이 효과를 관리가능한 수준으로 줄일 수 있다고 언급했다. 이들 조치는 백신이 이용가능할 때까지 유지되어야 한다. 이 권장사항은 대부분의 인구가 최소 18개월 동안 격리됨을 의미한다. 그들은 대량-생산가능한 백신이 노인 인구를 기꺼이 희생하는 것 외에 이 대유행을 중단하는 유일한 옵션이라고 결론지었다. 극적인 상황을 고려할 때, SARS-CoV-2에 대한 효과적인 백신이 가능한 한 빨리 절대적인 긴급한 필요성이 있다. 더욱이, 상황을 더욱 악화시키는 다양한 탈출 돌연변이(예를 들어, UK_B.1.1.7; South African_B.1.351; Brazil_P.1 변종, 및 캘리포니아 변종 B.1.427 및 B.1.429; 또한 도 2 참조)가 출현하여 상황을 더욱 악화시켰고 따라서 이 불행한 전개도 해결되어야할 필요가 있다.
발명의 요약
따라서, 본 발명은 불활성화된 SARS-CoV-2 백신을 제공한다. SARS-CoV-2 백신을 개발하기 위해 전 세계의 연구 그룹에 의해 이미 광범위한 노력을 투자했지만, 대부분의 접근법은 아단위 백신(예를 들어, SARS-CoV-2 S 단백질 또는 이의 단편을 인코딩함), 약독화 생백신 또는 바이러스 단백질을 인코딩하는 재조합 DNA 또는 RNA 백신에 초점을 맞추었다. 그러나, 전체 바이러스, 불활성화된 백신 접근법에 대한 관심은 거의 없었고, 성공적인 불활성화된 SARS-CoV-2 백신은 아직 완전히 개발되지 않았다. 불활성화된 백신 접근법이 고려되는 한, 표준 조건 하에서 전형적인 불활성화제(예를 들어, 포름알데히드) 및 어쥬번트(예를 들어, 명반)의 사용은 효과적인 백신 후보의 개발을 방해하는 결점을 가질 수 있다. 더욱이, 그러한 백신 후보는 SARS-CoV-2 성분에 대한 과민성 반응으로 인해 가능하기로는 SARS-CoV-2 질환 및/또는 Th2 유형 면역병리의 항체-의존성 증진(ADE)을 초래할 수 있는 위험이 있다. 본 발명은 이들 문제를 해결하고 따라서 종래 기술의 단점을 극복한 안전하고 효과적인 전체 바이러스, 불활성화된 SARS-CoV-2 백신을 생성하는 것을 목적으로 한다.
따라서, 일 양태에서 본 발명은 베타-프로피오락톤-불활성화된 SARS-CoV-2 입자를 포함하는 SARS-CoV-2 백신을 제공하며; 여기서 백신은 인간 대상체에서 천연 SARS-CoV-2 입자에 대한 중화 항체를 생성할 수 있다. 바람직하게는 SARS-CoV-2 입자의 천연 표면 형태는 백신에서 보존된다.
또 다른 양태에서 본 발명은 불활성화된 SARS-CoV-2 입자를 포함하는 SARS-CoV-2 백신을 제공하며; 여기서 SARS-CoV-2 입자의 천연 표면 형태는 백신에서 보존되어 백신이 천연 SARS-CoV-2 입자에 대한 중화 항체 및/또는 인간 대상체에서 상기 백신접종된 인간 대상체의 50% 초과, 바람직하게는 60% 초과, 70% 초과, 80% 초과, 90% 초과를 부분적으로 또는 완전히 보호할 수 있는 기타 면역학적 반응을 생성할 수 있다.
특히, 본 발명은 인간 세포의 복제 및 감염이 불가능하지만 바이러스 표면 단백질의 면역원성 에피토프를 보유하고 따라서 백신접종된 대상체에서 보호적 면역을 생성하기에 적합한 최적으로 불활성화된 SARS-CoV-2 입자를 제공하는 것을 목적으로 한다. 적절한 어쥬번트의 선택을 포함하여 백신의 생성에서 불활성화 과정 및 기타 단계를 최적화함에 의해, SARS-CoV-2 입자의 천연 표면 형태를 보존하고 ADE 및 면역병리학과 같은 음성 효과의 위험을 줄이는 신규한 백신 조성물을 얻을 수 있다. 이러한 백신 조성물은 하기에 보다 상세하게 기재되어 있다.
추가의 특정 실시형태에서, 발명은 인간 세포의 복제 및 감염이 불가능하지만 바이러스 표면 단백질의 면역원성 에피토프를 보유하고 따라서 백신접종된 대상체에서 보호적 면역을 생성하기에 적합한 최적으로 불활성화된 상이한 SARS-CoV-2 입자의 최적 조합을 제공하는 것을 목적으로 한다. 상이하고 최적으로 불활성화된 SARS-CoV-2 입자의 최적 조합에 의해, 천연 SARS-CoV-2 입자에 대한 중화 항체 및/또는 인간 대상체에서 상기 백신접종된 인간 대상체의 50% 초과, 바람직하게는 60% 초과, 70% 초과, 80% 초과, 90% 초과를 부분적으로 또는 완전히 보호할 수 있는 기타 면역학적 반응을 생성할 수 있는 개선된 백신 조성물을 얻을 수 있다.
발명의 각각의 제한은 발명의 다양한 실시형태를 포괄할 수 있다. 따라서 임의의 하나의 요소 또는 요소들의 조합을 포함하는 발명의 각각의 제한이 발명의 각각의 양태에 포함될 수 있음이 예상된다. 본 발명은 다음의 설명에 제시되거나 도면에 예시된 구성요소의 배열 및 구성의 세부사항에 대한 적용에 제한되지 않는다. 발명은 다른 실시형태가 가능하고 다양한 방식으로 실시되거나 수행될 수 있다.
첨부되는 도면은 축척에 맞게 그려진 것으로 의도되지 않는다. 도면은 단지 예시적이고 개시내용을 가능하게 하기 위해 요구되지는 않는다. 명확성을 위해 모든 구성요소가 모든 도면에 표지되지 않을 수 있다. 도면에서:
도 1. 본 발명의 불활성화된 SARS-CoV-2 백신의 생성에 대한 공정. 단계는 Vero 숙주 세포의 세포 증강, SARS-CoV-2로 Vero 세포의 감염, 바이러스 수확, DNA 감소, 1차 불활성화, 정제, 선택적 2차 불활성화 및 어쥬번트로 제형화를 포함한다.
도 2. SARS-CoV-2 전염병의 경과 동안, UK B 1.1.7, 브라질 P1, 캘리포니아 B.1.427/B.1.429 및 남아프리카 B.1.351 계통과 같은 최근의 새로운 변이체 또는 계통을 포함하여 전 세계로부터의 단리물로부터 SARS-CoV-2 게놈 서열이 보고되었다. 완전한 SARS-CoV-2 게놈 서열의 수탁 번호와 기원은, 이용가능한 경우(- 또는 항목 없음 = 이용 불가), 상응하는 orf1ab 다단백질 및 S 단백질에 대한 수탁 번호와 함께 표의 형식으로 제공된다.
도 3. 발명의 SARS-CoV-2 백신에 대한 폴리싱 단계로 사용되는 수크로스 구배 원심분리를 위한 바람직한 설정.
도 4. SARS-CoV-2 백신에 반응한 총 IgG. 코팅 항원: S1(A), 스파이크 단백질(B) 및 핵단백질(C)의 수용체 결합 도메인. 평가변수 역가: 컷-오프로 사용된 3-배 블랭크의 흡광도(점선).
도 5. 명반으로 아쥬번트된 SARS-CoV-2 백신에 반응한 IgG1 및 IgG2a 역가. S1 단백질에 특이적인 항체 역가는 ELISA에 의해 결정되었다. 농도는 mAb 서브클래스 표준 곡선과의 비교에 의해 결정되었다.
도 6. 생성 공정은 고밀도이고 온전한 스파이크 단백질을 제공한다. 실시예 1에 따라 생성된 SARS-CoV-2 불활성화된 약물 물질의 전자 현미경 사진이 도시된다. AU당 약 1-1.5 107 바이러스 입자.
도 7. SARS-CoV-2 및 JEV 약물 물질의 크기-배제-크로마토그래피 및 SDS-PAGE 프로필의 비교. SDS-PAGE에 따른 고순도(>95%)(은색 얼룩, 환원됨) 및 SE-HPLC에 따른 단량체 바이러스(>95%). 바이러스 입자 크기에 기인한 머무름 시간에서의 차이(JEV(IXIARO) 약 50nm, SARS-CoV2 약 100nm)
도 8. NHP 챌린지 연구를 위한 연구 설계. 각각 8마리 동물의 3개 그룹; SARS-CoV-2 백신에 대한 2개 용량 그룹(10 AU & 40 AU, 투여 직전에 추가된 용량당 0.5mg/용량 Al3+ 및 1mg Th1-자극 어쥬번트로 제형화됨) 및 위약 그룹(DPBS). SARS-CoV-2 챌린지 균주는 BetaCoV/France/IDF/0372/2020이다(Maisonmasse et al., Hydroxychloroquine use against SARS-CoV-2 infection in non-human primates, 2020, Nature 585:584-587). 시험 방법 및 시기: d-28, d0, d7, d14, d21, d28, d35, d47, d49, d50, d51, d54, d62에 혈액학. d-28, d0, d14, d21, d28, d35, d47, d54, d62에 Ab 반응(ELISA, IFA). d-28, d0, d14, d35, d54, d62에 T 세포 반응(ICS, ELISPOT). d47, d49, d50, d51, d54, d62에 사이토카인 반응(LUMINEX). SWABS(바이러스 부하(qRT-PCR-게놈 + 서브게놈): d35, d49, d50, d51, d54, d57, d62에 비강 & 기관 면봉; 기준선 및 d2, d7, d15에 직장 면봉. BAL 바이러스 부하(qRT-PCR-게놈 + 서브게놈): d50. 안락사: 폐 수확, 바이러스 부하(qRT-PCR - 게놈 + 서브게놈): d54, d62. CT 스캔: d35, d50, d57.
도 9. 33개의 중화 mAb 또는 각각 클러스터 13, 4, 10, 2, 1, 3의 풋프린트 내 잔기 수. 중화 mAb의 풋프린트 내 및/또는 B.1.1.7, B.1.351 또는 P.1에 대한 돌연변이 위치("x"로 표시)를 정의하는 계통인 잔기가 나열된다. 예를 들어 S-단백질에서 아미노산 위치인 K417 및 E484 돌연변이는 남아프리카 및 브라질 계통에서만 발견된다.
도 10. 실시예 1(iCELLIS 500 생물반응기, 프로타민 설페이트 침전, BPL 불활성화)에 따른 SARS-CoV-2 후보의 2개 샘플의 SDS-PAGE, 은 염색. 밴드는 분명히 3가지 주요 바이러스 단백질(스파이크-단백질, 막-단백질, 핵단백질)뿐만 아니라 숙주 시스템으로부터 배경 단백질에 기인할 수 있다.
도 11. 영국 공중보건국(PHE)으로부터 균주 UK MIG457(B.1.1.7 계통) 및 균주 SA_P2(B.1.351 계통)의 스파이크 단백질 내 SARS-CoV-2 돌연변이.
도 1. 본 발명의 불활성화된 SARS-CoV-2 백신의 생성에 대한 공정. 단계는 Vero 숙주 세포의 세포 증강, SARS-CoV-2로 Vero 세포의 감염, 바이러스 수확, DNA 감소, 1차 불활성화, 정제, 선택적 2차 불활성화 및 어쥬번트로 제형화를 포함한다.
도 2. SARS-CoV-2 전염병의 경과 동안, UK B 1.1.7, 브라질 P1, 캘리포니아 B.1.427/B.1.429 및 남아프리카 B.1.351 계통과 같은 최근의 새로운 변이체 또는 계통을 포함하여 전 세계로부터의 단리물로부터 SARS-CoV-2 게놈 서열이 보고되었다. 완전한 SARS-CoV-2 게놈 서열의 수탁 번호와 기원은, 이용가능한 경우(- 또는 항목 없음 = 이용 불가), 상응하는 orf1ab 다단백질 및 S 단백질에 대한 수탁 번호와 함께 표의 형식으로 제공된다.
도 3. 발명의 SARS-CoV-2 백신에 대한 폴리싱 단계로 사용되는 수크로스 구배 원심분리를 위한 바람직한 설정.
도 4. SARS-CoV-2 백신에 반응한 총 IgG. 코팅 항원: S1(A), 스파이크 단백질(B) 및 핵단백질(C)의 수용체 결합 도메인. 평가변수 역가: 컷-오프로 사용된 3-배 블랭크의 흡광도(점선).
도 5. 명반으로 아쥬번트된 SARS-CoV-2 백신에 반응한 IgG1 및 IgG2a 역가. S1 단백질에 특이적인 항체 역가는 ELISA에 의해 결정되었다. 농도는 mAb 서브클래스 표준 곡선과의 비교에 의해 결정되었다.
도 6. 생성 공정은 고밀도이고 온전한 스파이크 단백질을 제공한다. 실시예 1에 따라 생성된 SARS-CoV-2 불활성화된 약물 물질의 전자 현미경 사진이 도시된다. AU당 약 1-1.5 107 바이러스 입자.
도 7. SARS-CoV-2 및 JEV 약물 물질의 크기-배제-크로마토그래피 및 SDS-PAGE 프로필의 비교. SDS-PAGE에 따른 고순도(>95%)(은색 얼룩, 환원됨) 및 SE-HPLC에 따른 단량체 바이러스(>95%). 바이러스 입자 크기에 기인한 머무름 시간에서의 차이(JEV(IXIARO) 약 50nm, SARS-CoV2 약 100nm)
도 8. NHP 챌린지 연구를 위한 연구 설계. 각각 8마리 동물의 3개 그룹; SARS-CoV-2 백신에 대한 2개 용량 그룹(10 AU & 40 AU, 투여 직전에 추가된 용량당 0.5mg/용량 Al3+ 및 1mg Th1-자극 어쥬번트로 제형화됨) 및 위약 그룹(DPBS). SARS-CoV-2 챌린지 균주는 BetaCoV/France/IDF/0372/2020이다(Maisonmasse et al., Hydroxychloroquine use against SARS-CoV-2 infection in non-human primates, 2020, Nature 585:584-587). 시험 방법 및 시기: d-28, d0, d7, d14, d21, d28, d35, d47, d49, d50, d51, d54, d62에 혈액학. d-28, d0, d14, d21, d28, d35, d47, d54, d62에 Ab 반응(ELISA, IFA). d-28, d0, d14, d35, d54, d62에 T 세포 반응(ICS, ELISPOT). d47, d49, d50, d51, d54, d62에 사이토카인 반응(LUMINEX). SWABS(바이러스 부하(qRT-PCR-게놈 + 서브게놈): d35, d49, d50, d51, d54, d57, d62에 비강 & 기관 면봉; 기준선 및 d2, d7, d15에 직장 면봉. BAL 바이러스 부하(qRT-PCR-게놈 + 서브게놈): d50. 안락사: 폐 수확, 바이러스 부하(qRT-PCR - 게놈 + 서브게놈): d54, d62. CT 스캔: d35, d50, d57.
도 9. 33개의 중화 mAb 또는 각각 클러스터 13, 4, 10, 2, 1, 3의 풋프린트 내 잔기 수. 중화 mAb의 풋프린트 내 및/또는 B.1.1.7, B.1.351 또는 P.1에 대한 돌연변이 위치("x"로 표시)를 정의하는 계통인 잔기가 나열된다. 예를 들어 S-단백질에서 아미노산 위치인 K417 및 E484 돌연변이는 남아프리카 및 브라질 계통에서만 발견된다.
도 10. 실시예 1(iCELLIS 500 생물반응기, 프로타민 설페이트 침전, BPL 불활성화)에 따른 SARS-CoV-2 후보의 2개 샘플의 SDS-PAGE, 은 염색. 밴드는 분명히 3가지 주요 바이러스 단백질(스파이크-단백질, 막-단백질, 핵단백질)뿐만 아니라 숙주 시스템으로부터 배경 단백질에 기인할 수 있다.
도 11. 영국 공중보건국(PHE)으로부터 균주 UK MIG457(B.1.1.7 계통) 및 균주 SA_P2(B.1.351 계통)의 스파이크 단백질 내 SARS-CoV-2 돌연변이.
본 발명의 실시형태는 SARS-CoV-2 백신 또는 불활성화된 SARS-CoV-2 입자를 포함하는 면역원성 조성물에 대한 것이다. 전형적으로, 불활성화된 SARS-CoV-2 입자는 전체 바이러스, 불활성화된 입자이고, 즉 불활성화된 바이러스 입자는 불활성화된 전체 천연 SARS-CoV-2 입자에서 유래된다. 본 명세서에 사용된 "SARS-CoV-2"는 SARS-CoV-2 바이러스를 지칭하고 "SARS-CoV-2 입자"는 전형적으로 전체 SARS-CoV-2 바이러스 입자, 즉 비리온을 지칭한다.
본 발명의 일부 실시형태에서, SARS-CoV-2 입자는 그의 표면 구조를 실질적으로 변형시키지 않고 불활성화된다. 환언하면, SARS-CoV-2 입자의 천연 표면 형태는 불활성화된 바이러스 입자에 유지된다. 불활성화 과정을 최적화함에 의해, 예를 들어, 베타-프로피오락톤을 사용함에 의해, 천연 SARS-CoV-2 입자의 감염성은 그의 항원성 및/또는 면역원성에 부정적으로 영향을 미치지 않으면서 실질적으로 폐기될 수 있다는 것이 놀랍게도 발견되었다. 따라서, 본 발명은 일 양태에서 SARS-CoV-2 감염에 대한 중화 항체 및/또는 보호 면역을 생성하는 불활성화된 바이러스 백신(예를 들어, 베타-프로피오락톤-불활성화된 바이러스 백신)을 제공한다.
일 실시형태에서, SARS-CoV-2 입자는 바이러스 RNA를 우선적으로 표적화하는 방법에 의해 불활성화된다. 이것은 예를 들어 불활성화 단계는 바이러스 단백질보다 바이러스 RNA를 더 많이 변형한다는 것을 의미한다. 따라서, 불활성화된 SARS-CoV-2 입자는 복제-결함 바이러스 RNA를 포함할 수 있으며, 즉 바이러스 RNA는 불활성화된 입자가 복제할 수 없도록 불활성화 단계에서 변형된다. 바이러스 RNA를 우선적으로 표적화하는 불활성화 방법을 이용함에 의해, 본 발명은 유리하게는 바이러스 표면 단백질에서 면역원성 에피토프의 보존을 허용한다.
바람직하게는, 불활성화 방법은 바이러스 RNA에 비해 바이러스(표면) 단백질을 스페어(spare)하고, 예를 들어 바이러스 표면 단백질(예를 들어, 스파이크(S) 단백질)은 바이러스 RNA와 비교하여 불활성화 단계로부터 기인하는 더 적거나 더 드문 변형을 포함할 수 있다. 예로써, 바이러스 표면 단백질(예를 들어, S 단백질)에서 아미노산 잔기의 더 낮은 비율은 바이러스 RNA에서 변형된 뉴클레오티드 잔기의 비율과 비교하여 불활성화 단계에 의해 변형될 수 있다. 일부 실시형태에서, 바이러스 표면 단백질(예를 들어, S 단백질)에서 변형된 아미노산 잔기의 비율은 바이러스 RNA에서 변형된 뉴클레오티드 잔기의 비율보다 적어도 5%, 10%, 20%, 30%, 50%, 70% 또는 90% 낮을 수 있다. "변형" 또는 "변형된 잔기"는 천연 SARS-CoV-2 입자에 존재하지 않는 비-천연 잔기, 예를 들어 불활성화 단계에 기인한 이러한 잔기의 화학적(공유) 변형을 지칭하는 것으로 의미된다.
일 실시형태에서, 바이러스 RNA는 알킬화 및/또는 아실화에 의해 불활성화되고, 즉 SARS-CoV-2 불활성화된 입자에서 변형은 알킬화 및/또는 아실화된 뉴클레오티드 잔기를 포함한다. 일부 실시형태에서, 변형은 퓨린(특히 구아닌) 잔기에 우선적으로 표적화되고, 예를 들어, SARS-CoV-2 불활성화된 입자는 하나 이상의 변형(예를 들어, 알킬화 또는 아실화)된 구아닌 잔기를 포함한다. 일부 경우에, 불활성화 단계는, 예를 들어, 바이러스 RNA에서 구아닌 잔기를 통해 바이러스 RNA와 바이러스 단백질의 가교로 이어질 수 있다. 불활성화 단계는 또한, 예를 들어, 바이러스 게놈의 단편화를 초래하는, 바이러스 RNA에 닉 또는 가닥 파손을 도입할 수 있다.
적합한 알킬화제 및/또는 아실화제는 당업계에 공지되어 있다. 일 실시형태에서, 불활성화제는 베타-프로피오락톤을 포함하고, 즉 백신은 베타-프로피오락톤-불활성화된 바이러스 입자를 포함한다. 임의의 경우에 있어, 특정 실시형태에서, 베타-프로피오락톤(본 명세서에서 "BPL"로도 지칭됨) 처리는 실질적으로 불활성이지만, 천연 SARS-CoV-2에 존재하는 중화 에피토프에 대해 높은 항원성과 면역원성을 유지하는 SARS-CoV-2 입자를 초래하기 때문에 본 발명에 따라 특히 바람직하다. 특히, 베타-프로피오락톤이 최소한의 단백질 변형으로 SARS-CoV-2 입자를 불활성화하는 데 사용될 수 있다는 것이 놀랍게도 발견되었다. 예로써, 하기 실시예 6 및 7에서 입증된 바와 같이, 베타-프로피오락톤을 사용한 SARS-CoV-2 입자의 불활성화는 베타-프로피오락톤에 의한 인플루엔자 입자의 불활성화와 비교하여 훨씬 더 적은 수의 바이러스 단백질의 변형을 초래한다. 따라서 베타-프로피오락톤-불활성화된 SARS-CoV-2 입자에서 바이러스 입자의 천연 표면 형태가 보존될 수 있다.
발명의 바람직한 실시형태에서, 바이러스 RNA는 최적화된 방식으로 불활성화되는데, 즉 더 이상 감염되지 않을 정도로 충분히 불활성화되지만 "과도하게"-불활성화되지 않아 특히 S-단백질에서 상이한 아미노산에서의 다수의 변형이 일어나도록 한다. 추가의 더욱 바람직한 실시형태에서, BPL 불활성화는 SARS-CoV-2 바이러스를 충분히 불활성화시킬 뿐만 아니라(과도하게-불활성화시키지는 않음) 제조 공정에서 공동-농후화되고 공동-배양될 수 있는 바이러스를 아주 충분하게 불활성화시킨다(예를 들어, 실험 부분 참조). 공동-배양되고 공동-농후화될 수 있는 불활성화하기 특히 어려운 바이러스는 PPV(돼지 파보바이러스)이다 - 실험 부분 참조.
불활성화 단계에서 베타-프로피오락톤의 농도는 바이러스에서 표면 단백질의 형태를 보존하면서 바이러스 복제의 완전한 억제를 보장하도록 최적화될 수 있다. 예를 들어, 불활성화 단계에서 베타-프로피오락톤의 농도는 예를 들어 0.01 내지 1 중량%, 바람직하게는 0.01 내지 0.1 중량%, 더욱 바람직하게는 약 0.03 중량%일 수 있다. BPL의 바람직한 양은 SARS-CoV-2 바이러스 뿐만 아니라 다른 관련 바이러스/불순물이 불활성화되는 반면 S-단백질의 아미노산 대부분이 보존되는 경우(즉, 변형되지 않음) 500ppm인것으로 밝혀졌다(즉, 적은 수의 아미노산만이 낮은 확률로 변형되는 것으로 나타났다).
일부 실시형태에서, 천연 SARS-CoV-2 입자는 적어도 5시간, 적어도 10시간, 적어도 24시간 또는 적어도 4일, 예를 들어 5 내지 24시간 또는 그 이상 예컨대 48시간 동안 베타-프로피오락톤과 접촉될 수 있다. 불활성화 단계는 약 0℃ 내지 약 25℃, 바람직하게는 약 4℃ 또는 약 22℃, 또는 예를 들어 약 18 내지 24℃에서 수행될 수 있다. 일 실시형태에서, 불활성화 단계(예를 들어, 베타-프로피오락톤 사용)는 2℃ 내지 8℃에서 24시간 동안 수행된다. 불활성화 단계는 선택적으로 그리고 바람직하게는 당업계에 공지된 바와 같이 불활성화제의 가수분해 단계가 뒤따를 수 있다(이는 베타-프로피오락톤에 대해 예를 들어 약 37℃+/- 2℃에서 2.5시간 +/- 0.5시간의 총 시간 동안 수행될 수 있다). 전형적으로, 불활성화 단계에서 더 긴 인큐베이션 시간 및/또는 더 높은 온도는 바이러스 불활성화를 향상시킬 수 있지만, 또한 바이러스 입자의 바람직하지 않은 표면 변형의 증가된 위험을 초래하여 면역원성을 감소시킬 수 있다. 따라서, 불활성화 단계는 예를 들어 완전히 불활성화된 바이러스 입자를 생성하기 위해 필요한 최단 시간 동안 수행될 수 있다. 가수분해의 완료 후, 불활성화된 바이러스 용액은 일 실시형태에서 즉시 5±3℃로 냉각되었고 불활성화가 대용량 플라크 검정 및 일련의 계대 검정에 의해 확인될 때까지 거기에 보관되었다.
SARS-CoV-2 입자의 베타-프로피오락톤 불활성화는 시스테인, 메티오닌 및/또는 히스티딘 잔기를 우선적으로 변형할 수 있다. 따라서 일부 실시형태에서, 불활성화된 SARS-CoV-2 입자는 하나 이상의 베타-프로피오락톤-변형된 시스테인, 메티오닌 및/또는 히스티딘 잔기를 포함한다. 그러나, 본 발명의 실시형태에서, 베타-프로피오락톤-불활성화된 SARS-CoV-2 입자는 비교적 적은 단백질 변형을 나타낸다. 따라서, 예를 들어, 백신에서 불활성화된 SARS-CoV-2 입자는 200, 100, 50, 30, 20, 15, 10, 9, 8, 7 또는 6개 미만의 베타-프로피오락톤-변형된 아미노산 잔기를 포함할 수 있다. 바람직하게는 불활성화된 SARS-CoV-2 입자의 스파이크(S) 단백질은 100, 50, 30, 20, 15, 10, 9, 8, 7 또는 6개 미만의 베타-프로피오락톤-변형된 아미노산 잔기를 포함한다. 더욱 바람직하게는 불활성화된 SARS-CoV-2 입자 또는 이의 스파이크 단백질은 20개 이하, 15개 이하, 10개 이하, 또는 5개 이하의 베타-프로피오락톤-변형된 아미노산 잔기를 포함한다. 가장 바람직하게는 불활성화된 SARS-CoV-2 입자 또는 이의 스파이크 단백질은 1 내지 100, 2 내지 70, 3 내지 50, 4 내지 30, 5 내지 25, 5 내지 20, 10 내지 20 또는 약 15개의 베타-프로피오락톤-변형된 아미노산 잔기를 포함한다.
또 다른 실시형태에서, SARS-CoV-2 폴리펩티드의 20%, 15%, 10%, 5% 또는 4% 미만이 베타-프로피오락톤-변형된다. 예로써, 입자 내 SARS-CoV-2 폴리펩티드의 0.1 내지 10%, 1 내지 8%, 2 내지 7% 또는 약 3%, 4%, 5% 또는 6%가 베타-프로피오락톤-변형될 수 있다. 백신에서 잔기 및/또는 폴리펩티드의 베타-프로피오락톤 변형은 질량 분석법, 예를 들어, 예로써 실시예 6 및 7에 기술된 방법을 사용하여 탠덤 질량 분석법(LC-MS-MS)과 함께 액체 크로마토그래피를 사용함에 의해 검출될 수 있다. 이러한 방법에서, SARS-CoV-2 입자는 LC-MS-MS 분석을 위해 단백질을 SARS-CoV-2 폴리펩티드로 단편화하기 위해 단리될 수 있다. 단리 단계는 임의의 적합한 효소 또는 효소의 조합에 의해, 예를 들어, 트립신, 키모트립신 및/또는 PNGase F(펩티드:N-글리코시다제 F)에 의해, 또는 예를 들어 산 가수분해에 의해 수행될 수 있다. 바람직하게는 효소 분해 또는 산 가수분해에 이은 LC-MS-MS에 의해 검출된 BPL-변형된 폴리펩티드의 백분율은 다음과 같다: (a) 트립신 단리, 1 내지 5%, 2 내지 4% 또는 약 3%; (b) 트립신 + PNGase F 단리, 1 내지 5%, 2 내지 4% 또는 약 3%; (c) 키모트립신, 1 내지 10%, 3 내지 8% 또는 약 6%; (d) 산 가수분해, 1 내지 6%, 2 내지 5% 또는 약 4%. 이 맥락에서, "베타-프로피오락톤-변형된" 폴리펩티드는 폴리펩티드가 적어도 하나의 베타-프로피오락톤 변형, 예를 들어 적어도 하나의 베타-프로피오락톤-변형된 잔기를 포함한다는 것을 의미한다.
일부 실시형태에서, 불활성화된 SARS-CoV-2 입자의 스파이크(S) 단백질은 다음의 잔기 중 하나 이상에서 베타-프로피오락톤 변형을 포함한다: 예를 들어, 서열번호: 3에서 49, 146, 166, 177, 207, 245, 379, 432, 519, 625, 1029, 1032, 1058, 1083, 1088, 1101, 1159 및/또는 1271, 또는 서열번호: 19, 21, 23, 25 또는 27에서 상응하는 위치. 바람직하게는 불활성화된 SARS-CoV-2 입자는 다음의 잔기 중 하나 이상에서 베타-프로피오락톤 변형을 포함한다: 예를 들어, 서열번호: 3에서 H49, H146, C166, M177, H207, H245, C432, H519, H625, M1029, H1058, H1083, H1088, H1101, H1159 및/또는 H1271, 또는 서열번호: 19, 21, 23, 25 또는 27에서 상응하는 위치. 다른 실시형태에서, 불활성화된 SARS-CoV-2 입자는 다음의 잔기 중 하나 이상에서 베타-프로피오락톤 변형을 포함한다: 예를 들어, 서열번호: 3에서 H207, H245, C379, M1029 및/또는 C1032, 또는 서열번호: 19, 21, 23, 25 또는 27에서 상응하는 위치. "상응하는 위치"는 예를 들어 서열번호: 19, 21, 23, 25 또는 27을 NCBI Basic Local Alignment Search Tool(BLAST)과 같은 프로그램을 사용하여 서열번호: 3과 정렬될 때 서열번호: 3에서 위치 H207, H245, C379, M1029 및/또는 C1032와 정렬하는 서열번호: 19, 21, 23, 25 또는 27에서의 위치를 의미한다.
예로써, 일부 실시형태에서, 서열번호: 3에서 H207, H245, C379, M1029 및 C1032에 상응하는 서열번호: 19, 21, 23, 25 또는 27에서의 위치가 하기에 나타나 있다:
일부 실시형태에서, 불활성화된 SARS-CoV-2 입자의 막(M) 당단백질은 다음의 잔기 중 하나 이상에서 베타-프로피오락톤 변형을 포함한다: 예를 들어, 서열번호: 29에서 125, 154, 155, 159 및/또는 210, 바람직하게는 H154, H155 , C159 및/또는 H210.
일부 실시형태에서, 불활성화된 SARS-CoV-2 입자의 뉴클레오캡시드(N) 단백질은, 예를 들어 서열번호: 28에서의 M234에서 베타-프로피오락톤 변형을 포함한다.
일부 실시형태에서, 불활성화된 SARS-CoV-2 입자에서 다음의 잔기 중 하나 이상의 30%, 20%, 10%, 5%, 3% 또는 1% 미만은 베타-프로피오락톤 변형된다: (i) 스파이크(S) 단백질에서, 예를 들어, 서열번호: 3, 또는 서열번호: 19, 21, 23, 25 또는 27에서 상응하는 위치에서: 잔기 49, 146, 166, 177, 207, 245, 379, 432, 519, 625, 1029, 1032, 1058, 1083, 1088, 1101, 1159 및/또는 1271; 바람직하게는 H49, H146, C166, M177, H207, H245, C432, H519, H625, M1029, H1058, H1083, H1088, H1101, H1159 및/또는 H1271; 대안적으로 H207, H245, C379, M1029 및/또는 C1032; (ii) 막(M) 당단백질에서, 예를 들어, 서열번호: 29에서: 잔기 125, 154, 155, 159 및/또는 210; 바람직하게는 H154, H155, C159 및/또는 H210; 및/또는 (iii) 예를 들어, 서열번호: 29에서 뉴클레오캡시드(N) 단백질의 M234. 바람직한 실시형태에서, 불활성화된 SARS-CoV-2 입자에서 상기 잔기의 적어도 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 또는 각각의 30%, 20%, 10%, 5%, 3% 또는 1% 미만은 베타-프로피오락톤 변형된다. 이 단락에서, 변형된 잔기의 백분율은, 예를 들어, 하기 실시예 6 및/또는 7에 기술된 바와 같이 단백질 풍부도로 정규화된 동일한 변형 부위에 대한 변형된 펩티드 대 비변형된 펩티드의 비율인, 부위 점유를 지칭하기 위한 것이다.
또 다른 바람직한 실시형태에서, 불활성화된 SARS-CoV-2 입자에서의 다음 위치에서 베타-프로피오락톤-변형된 잔기(즉, 부위 점유)의 비율은 다음과 같다:
(i) 스파이크(S) 단백질(예를 들어, 서열번호: 3, 또는 서열번호: 19, 21, 23, 25 또는 27에서 상응하는 위치)에서:
(a) H207: 30% 미만, 바람직하게는 0.01 내지 25%; 및/또는
(b) H245: 10% 미만, 바람직하게는 0.1 내지 5%; 및/또는
(c) C379: 5% 미만, 1% 미만 또는 0.1% 미만; 및/또는
(d) M1029: 5% 미만, 1% 미만 또는 0.1% 미만; 및/또는
(e) C1032: 5% 미만, 1% 미만 또는 0.1% 미만; 및/또는
(ii) 막(M) 당단백질(예를 들어, 서열번호: 29)에서:
(f) H154: 5% 미만, 1% 미만 또는 0.1% 미만; 및/또는
(g) H155: 10% 미만, 바람직하게는 0.1 내지 5%; 및/또는
(h) C159: 5% 미만, 1% 미만 또는 0.1% 미만; 및/또는
(i) H210: 20% 미만, 바람직하게는 0.1 내지 10%; 및/또는
(iii) 뉴클레오캡시드(N) 단백질(예를 들어, 서열번호 28)에서:
(j) M234: 90% 미만, 10% 미만 또는 0.1% 미만.
또 다른 바람직한 실시형태에서, 불활성화된 SARS-CoV-2 입자의 스파이크(S) 단백질에서 다음 위치 각각(예를 들어, 서열번호: 3, 또는 서열번호: 19, 21, 23, 25 또는 27에서 상응하는 위치)에서 베타-프로피오락톤-변형된 잔기(즉, 부위 점유)의 비율은 다음과 같다:
(a) 잔기 H49, H146, C166, H207, H519, M1029, H1083, H1088, H1101, H1159 및/또는 H1271: 20% 미만, 바람직하게는 0.01 내지 10%, 더욱 바람직하게는 0.1 내지 5%; 및/또는
(b) 잔기 M177, C432, H625: 30% 미만, 바람직하게는 0.1 내지 20%, 더욱 바람직하게는 1 내지 10%; 및/또는
(c) 잔기 H245, H1058: 30% 미만, 바람직하게는 0.1 내지 20%, 더욱 바람직하게는 5 내지 15%;
일부 실시형태에서, 불활성화된 SARS-CoV-2 입자(또는 그의 스파이크(S) 단백질)에서 베타-프로피오락톤-변형된 아미노산 잔기의 비율은 베타-프로피오락톤-불활성화된 인플루엔자 입자(또는 이의 헤마글루티닌(HA) 또는 뉴라미니다제(NA) 단백질)에서, 예를 들어 SARS-CoV-2 입자에 유사한 조건 하에서 불활성화된 인플루엔자 입자에서 변형된 잔기의 비율보다 적어도 5%, 10%, 20%, 30%, 50%, 70% 또는 90% 더 낮을 수 있다.
대안적인 실시형태에서, 바이러스 RNA는 자외선(UV) 광으로 처리에 의해 불활성화될 수 있다. UV 처리는 바이러스 입자에서 (폴리펩티드와 비교하여) RNA를 우선적으로 표적화하는 데 사용될 수 있어, 예를 들어, 변형된 뉴클레오티드 및/또는 단편화를 초래한다. 일부 실시형태에서, UV 처리는 바이러스의 불활성화를 개선하기 위해 베타-프로피오락톤 처리와 조합될 수 있으며, 예를 들어 베타-프로피오락톤 처리 단계는 UV 처리 단계가 뒤따르거나 그 반대일 수 있거나, 또는 UV 처리 단계는 베타-프로피오락톤 처리 단계와 동시에 수행될 수 있다.
다른 실시형태에서, 천연 SARS-CoV-2 입자는 포름알데히드를 사용하여 불활성화될 수 있다. 그러나, 포름알데히드 불활성화는 바이러스 RNA를 우선적으로 표적화하고 바이러스 표면 단백질에서 면역원성 에피토프를 보존하는 데 덜 적합하기 때문에 전형적으로 본 발명에서 덜 바람직하다.
따라서 바람직한 실시형태에서, 불활성화 단계(들)(특히 포름알데히드를 사용할 때 뿐만 아니라 예를 들어 베타-프로피오락톤과 같은 다른 불활성화제를 사용할 때)는 표면 항원 완전성, 특히 S 단백질의 완전성을 보존하기 위해 온화한 조건 하에서 수행된다.
일 실시형태에서, 이러한 약한 불활성화 방법은 천연 SARS-CoV-2 입자를 포함하는 액체 조성물을 화학적 바이러스 불활성화제(예컨대 예를 들어, 상기에 열거된 임의의 화학적 불활성화제 또는 조합, 예로써 포름알데히드 또는 바람직하게는 베타-프로피오락톤)와 용기에서 접촉시키고, 난류 흐름이 아닌 층류 흐름의 조건 하에 화학적 바이러스 불활성화제 및 SARS-CoV-2 입자를 포함하는 액체 조성물을 혼합하고, 화학적 바이러스 불활성화제 및 SARS-CoV-2 입자를 포함하는 액체 조성물을 바이러스 입자를 불활성화하기에 충분한 시간 동안 인큐베이션하는 것을 포함한다. 약한 불활성화 단계는 선택적으로 유연한 생물반응기 백에서 수행된다. 약한 불활성화 단계는 바람직하게는 불활성화의 기간 동안 5회 이하의 용기 반전을 포함한다. 바람직하게는, 화학적 바이러스 불활성화제와 천연 SARS-CoV-2 입자를 포함하는 조성물의 혼합은 인큐베이션의 기간 동안 10rpm 이하에서 10분 이하 동안 용기를 흔들기, 회전, 궤도 진탕 또는 진동시키는 것을 포함한다.
적합한 약한 또는 부드러운 불활성화 방법은 하기 실시예에 기재되어 있다. 그러한 방법의 추가 세부사항은 또한 WO 2021/048221에 기술되어 있으며, 그 내용은 그 전체가 본 명세서에 포함된다.
전형적으로, 불활성화 단계는 불활성화된 SARS-CoV-2 입자에 의한 포유동물(예를 들어, 인간) 세포의 감염성을 실질적으로 제거한다. 예로써, 포유동물 세포의 감염성은 천연 SARS-CoV-2 입자와 비교하여 적어도 99%, 99.99% 또는 99.9999% 감소될 수 있거나, 불활성화된 SARS-CoV-2 입자에 의한 인간 세포의 감염성은 검출가능하지 않을 수 없다. 표준 검정, 예를 들어 플라크 검정, TCID50(50% 조직 배양 감염성 용량)의 결정은 잔류 감염성 및 효과적인 바이러스 역가를 결정하는 데 사용할 수 있다. 예로써, 포유동물 세포는 MDCK, COS 또는 Vero 세포일 수 있다.
본 발명의 바람직한 실시형태에서, SARS-CoV-2 입자의 천연 표면 형태는 불활성화된 바이러스 입자에서 보존된다. 이것은 예를 들어 하나 이상의 또는 모든 면역원성(중화) 에피토프는 불활성화된 바이러스 입자에 보유되어, 불활성화된 입자는 인간 대상체에게 투여될 때 천연 SARS-CoV-2 입자에 대한 중화 항체를 생성할 수 있다는 것을 의미한다. "천연 표면 형태"는 천연 SARS-CoV-2 입자, 즉 불활성화되지 않은 SARS-CoV-2 입자(비리온)에서 발견되는 표면 형태를 지칭하는 것으로 의미한다. 대상체에서 중화 항체를 생성할 때 백신 또는 불활성화된 SARS-CoV-2 입자의 특성은 예를 들어 플라크 환원 중화 시험(PRNT 검정), 예를 들어 당해 분야에 공지된 바와 같이 대상체로부터의 혈청 샘플을 사용하여 결정될 수 있다.
바람직한 실시형태에서, 본 발명은 (i) 스파이크(S) 단백질; (ii) 뉴클레오캡시드(N) 단백질; (iii) 막(M) 당단백질; 및/또는 (iv) 엔벨로프(E) 단백질의 천연 형태가 불활성화된 바이러스 입자에 보존된다는 것을 포함한다. 바람직하게는, 불활성화된 SARS-CoV-2 입자는 천연 형태 스파이크(S) 단백질을 포함한다. 따라서, 불활성화된 SARS-CoV-2 입자에서 S(및/또는 N 및/또는 M 및/또는 E) 단백질은 바람직하게는 천연 SARS-CoV-2 입자에 존재하는 하나 이상 또는 모든 (온전한) 면역원성(중화) 에피토프를 포함한다. 바람직하게는, 불활성화된 바이러스 입자에서 S(및/또는 N 및/또는 M 및/또는 E) 단백질은 불활성화 단계에 의해 변형되지 않거나 실질적으로 변형되지 않는다.
바이러스 입자의 표면 형태의 보존은 표준 기술을 사용하여 평가할 수 있다. 예로써, X선 결정학, MS 분석(변형에 의한 아미노산 질량의 이전) 및 저온-전자 현미경과 같은 방법을 사용하여 바이러스 표면을 시각화할 수 있다. 바이러스 입자의 표면에 존재하는 단백질의 2차 및 3차 구조는 원형 이색성(CD) 분광법(예를 들어, 원거리(190-250nm) UV 또는 근거리(250-300nm) UV 범위)과 같은 방법에 의해 분석할 수도 있다. 더욱이, 천연 표면 형태의 보존은, 예를 들어 S 단백질에서 천연 바이러스 표면 상에 존재하는 에피토프에 대한 항체를 사용하여 확인할 수 있다. 따라서 불활성화된 바이러스 입자와 천연 바이러스 입자 사이의 항-SARS-CoV-2 항체의 교차-반응은 백신에서 잠재적으로 중화 에피토프의 보유를 입증하는 데 사용될 수 있다.
SARS-CoV-2 비리온 및 특히 스파이크(S) 단백질의 표면 형태는 알려져 있으며 최근 여러 연구에서 발표되었다. 예로써, SARS-CoV-2 수용체 결합 도메인의 결정 구조를 기술하는, Shang, J. et al. (Structural basis of receptor recognition by SARS-CoV-2. Nature https://doi.org/10.1038/s41586-020-2179-y (2020))을 참고한다. 부가하여, Walls et al. (Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein, Cell 180, 1-12 (2020), https://doi.org/10.1016/j.cell.2020.02.058)은 cryo-EM을 사용한 S 단백질 표면 형태의 상세한 설명을 제공하고 보존된 S 단백질 에피토프를 표적화하는 교차-중화 항체를 기술한다. 감염 및 회복기 환자의 혈청으로부터의 항체의 사용은 중요한 S 단백질 에피토프에 대해 추가로 밝혔다(Zhang B et al. Mining of epitopes on spike protein of SARS-CoV-2 from COVID-19 patients. 2020 Cell Research 30:702-704). 최근 연구는 또한 회복기 혈청을 사용한 연구에서 중요한 항원으로 확인된 SARS-CoV-2 뉴클레오캡시드(N) 단백질의 구조에 초점을 맞추고 있다(Zeng W et al. Biochemical characterization of SARS-CoV-2 nucleocapsid protein. 2020 BBRC 527(3): 618-623). 잠재적으로 중요한 SARS-CoV-2 에피토프에 관한 추가 지침은 코로나바이러스 에피토프 매핑 연구(http://biopharm.zju.edu.cn/coviedb/; Wu J COVIEdb: A Database for Potential Immune Epitopes of Coronaviruses. 2020 Front. Pharmacol. 11:572249; doi: 10.3389/fphar.2020.572249)로부터 정보의 모음인 COVIEdb 데이터베이스에서 이용가능하다.
SARS-CoV-2 표면 에피토프(S 단백질에 포함)에 대한 모노클로날 항체는 문헌(예를 들어, 상기에 언급한 바와 같음)에 기재되어 있으며, 상업적 출처에서 이용가능할 수 있고/있거나 실험 동물의 면역화와 같은 표준 기술을 사용하여 생성될 수 있다. 예를 들어, 2020년 9월 9일 현재, SARS-CoV-2에 대한 최소 169개의 서로 다른 항체가 캘리포니아주 샌디에고 소재의 MyBioSource, Inc.에서 이용가능하다(예를 들어, 카탈로그 번호 MBS8574747, www.MyBioSource.com 참조). 동일한 날짜에 SARS-CoV-2에 대한 최소 28개의 서로 다른 항체가 펜실바니아주 웨인 소재의 Sino Biological US Inc.에서 이용가능하다(예를 들어, 카탈로그 번호 40150-D006, https://www.sinobiological.com/ 참조). 추가 적합한 항체는 Ou et al. (Characterization of spike glycoprotein of SARS-CoV-2 on virus entry and its immune cross-reactivity with SARS-CoV, Nature Communications (2020) 11:1620; https://doi.org/10.1038/s41467-020-15562-9)에 기술되어 있다. 본 발명의 실시형태에서, 숙련가는 이러한 항체와 불활성화된 입자의 교차-반응을 통해 SARS-CoV-2(또는 예를 들어, 그의 S 또는 N 단백질)의 천연 표면 형태의 보존을 검출할 수 있다. 달리 말해서, 불활성화된 입자는 표면 에피토프에 대해 지향된 하나 이상의 항-SARS-CoV-2 항체, 바람직하게는 항-S-단백질 항체, 예를 들어, 천연 SARS-CoV-2 비리온에서 중화 에피토프에 대해 생성된 항체에 특이적으로 결합한다.
백신 조성물에서 SARS-CoV-2 입자는 임의의 공지된 SARS-CoV-2의 균주 또는 이의 변이체로부터 유래될 수 있다. 예로써, 바이러스는 도 2에 정의된 균주일 수 있거나, 그 안에 정의된 뉴클레오티드 또는 아미노산 서열, 또는 이들에 적어도 예를 들어 95% 서열 동일성을 갖는 변이체 서열을 포함할 수 있다. 예로써, 일 실시형태에서 SARS-CoV-2 입자는 (i)(NCBI 참조 서열 NC_045512.2에도 정의된) 서열번호: 1에 정의된 바와 같은 DNA 서열에 상응하는 RNA 서열을 포함한다. "에 상응하는"은 정의된 DNA 서열이 바이러스 RNA 서열의 등가물, 즉 바이러스 RNA 또는 바이러스 RNA에 상보적인 서열을 인코딩하는 DNA 또는 cDNA 서열인 것으로 이해될 것이다. 본 명세서에 기재된 바와 같이, 불활성화된 과정은 바이러스 RNA의 변형(예를 들어, 알킬화 또는 아실화) 및/또는 단편화를 초래할 수 있고, 따라서 불활성화된 바이러스 입자는 본 명세서에 정의된 온전한 RNA 서열을 포함하지 않을 수 있지만 오히려 그러한 서열을 포함하는 천연 바이러스 입자로부터 유래된다는 것으로 이해될 것이다.
SARS-CoV-2 입자는 또한 공지된 SARS-CoV-2 Wuhan-Hu-1 계통의 변이체를 포함하거나 또한, 예를 들어 서열번호: 1 및/또는 NCBI 참조 서열 NC_045512.2에 대해 적어도 85%, 적어도 90%, 적어도 95% 또는 적어도 99% 서열 동일성을 갖는 서열인, 참조 계통으로 지칭될 수 있다. 바람직하게는, 변이체 서열은 감염성 SARS-CoV-2 입자, 예를 들어 독성 SARS-CoV-2 바이러스를 팩킹할 수 있는 RNA 서열을 포함하는 천연(비-불활성화된) SARS-CoV-2 입자를 인코딩한다.
추가로 공지된 SARS-CoV-2 입자는 또한 공지된 SARS-CoV-2 남아프리카 계통 B.1.351의 변이체, 예를 들어 서열번호: 18 및/또는 NCBI 참조 서열 MW598408에 대해 적어도 85%, 적어도 90%, 적어도 95% 또는 적어도 99% 서열 동일성을 갖는 서열을 포함할 수 있다. 바람직하게는, 변이체 서열은 감염성 SARS-CoV-2 입자, 예를 들어 독성 SARS-CoV-2 바이러스를 팩킹할 수 있는 RNA 서열을 포함하는 천연(비-불활성화된) SARS-CoV-2 입자를 인코딩한다. 공지된 SARS-CoV-2 남아프리카 계통 B.1.351의 변이체의 추가 예는 도 2에 제공되어 있다.
추가로 공지된 SARS-CoV-2 입자는 또한 공지된 SARS-CoV-2 브라질 계통 P.1의 변이체, 예를 들어 서열번호: 20 및/또는 NCBI 참조 서열 MW520923에 대해 적어도 85%, 적어도 90%, 적어도 95% 또는 적어도 99% 서열 동일성을 갖는 서열을 포함할 수 있다. 바람직하게는, 변이체 서열은 감염성 SARS-CoV-2 입자, 예를 들어 독성 SARS-CoV-2 바이러스를 팩킹할 수 있는 RNA 서열을 포함하는 천연(비-불활성화된) SARS-CoV-2 입자를 인코딩한다. 공지된 SARS-CoV-2 브라질 계통 P.1의 변형의 추가 예는 도 2에 제공되어 있다.
추가로 공지된 SARS-CoV-2 입자는 또한 공지된 SARS-CoV-2 UK 계통 B.1.1.7의 변이체, 예를 들어 서열번호: 22 및/또는 NCBI 참조 서열 MW422256에 대해 적어도 85%, 적어도 90%, 적어도 95% 또는 적어도 99% 서열 동일성을 갖는 서열을 포함할 수 있다. 바람직하게는, 변이체 서열은 감염성 SARS-CoV-2 입자, 예를 들어 독성 SARS-CoV-2 바이러스를 팩킹할 수 있는 RNA 서열을 포함하는 천연(비-불활성화된) SARS-CoV-2 입자를 인코딩한다. 공지된 SARS-CoV-2 UK 계통 B.1.1.7의 변이체의 추가 예는 도 2에 제공되어 있다.
추가로 공지된 SARS-CoV-2 입자는 또한 공지된 SARS-CoV-2 캘리포니아 계통 B.1.427 및 B.1.429의 변이체, 예를 들어 서열번호: 24 및/또는 서열번호: 26에 대해 적어도 85%, 적어도 90%, 적어도 95% 또는 적어도 99% 서열 동일성을 갖는 서열을 포함할 수 있다. 바람직하게, 변이체 서열은 감염성 SARS-CoV-2 입자, 예를 들어 독성 SARS-CoV-2 바이러스를 팩킹할 수 있는 RNA 서열을 포함하는 천연(비-불활성화된) SARS-CoV-2 입자를 인코딩한다. 공지된 SARS-CoV-2 캘리포니아 계통의 변형의 추가 예는 GenBank에서 찾을 수 있다.
유사하게, 바람직한 실시형태에서 SARS-CoV-2 입자는 (i) 서열번호: 3에 정의된 바와 같은 아미노산 서열, 또는 (ii) 서열번호: 3에 대해 적어도 95%, 적어도 97% 또는 적어도 99% 동일성을 갖는 아미노산 서열을 포함하거나 이로 구성된 Wuhan 계통의 S 단백질을 포함한다.
추가의 바람직한 실시형태에서 SARS-CoV-2 입자는 (i) 서열번호: 19에 정의된 바와 같은 아미노산 서열, 또는 (ii) 서열번호: 19에 대해 적어도 95%, 적어도 97% 또는 적어도 99% 동일성을 갖는 아미노산 서열을 포함하거나 이로 구성된 남아프리카 B1.351 계통의 S 단백질을 포함한다.
추가의 바람직한 실시형태에서 SARS-CoV-2 입자는 (i) 서열번호: 21에 정의된 바와 같은 아미노산 서열, 또는 (ii) 서열번호: 21에 대해 적어도 95%, 적어도 97% 또는 적어도 99% 동일성을 갖는 아미노산 서열을 포함하거나 이로 구성된 브라질 P.1 계통의 S 단백질을 포함한다.
추가의 바람직한 실시형태에서 SARS-CoV-2 입자는 (i) 서열번호: 23에 정의된 바와 같은 아미노산 서열, 또는 (ii) 서열번호: 23에 대해 적어도 95%, 적어도 97% 또는 적어도 99% 동일성을 갖는 아미노산 서열을 포함하거나 이로 구성된 UK B.1.1.7 계통의 S 단백질을 포함한다.
추가의 바람직한 실시형태에서 SARS-CoV-2 입자는 (i) 서열번호: 25에 정의된 바와 같은 아미노산 서열, 또는 (ii) 서열번호: 25에 대해 적어도 95%, 적어도 97% 또는 적어도 99% 동일성을 갖는 아미노산 서열을 포함하거나 이로 구성된 캘리포니아 B.1.427 계통의 S 단백질을 포함한다.
추가의 바람직한 실시형태에서 SARS-CoV-2 입자는 (i) 서열번호: 27에 정의된 바와 같은 아미노산 서열, 또는 (ii) 서열번호: 27에 대해 적어도 95%, 적어도 97% 또는 적어도 99% 동일성을 갖는 아미노산 서열을 포함하거나 이로 구성된 캘리포니아 B.1.429 계통의 S 단백질을 포함한다.
일부 실시형태에서, 불활성화된 SARS-CoV-2 입자는 백신에서 다른 불활성화된 SARS-CoV-2 입자와 조합된다(다른 = 다른 서열).
일부 실시형태에서, 백신 내 SARS-CoV-2 입자의 조합은 i) 참조 Wuhan_1 계통 예컨대, 예를 들어, 서열번호: 1, 9, 12, 15; ii) 남아프리카 B.1.531 계통 예컨대, 예를 들어, 서열번호: 18; 브라질 P.1 계통 예컨대, 예를 들어, 서열번호: 20; UK B.1.1.7 계통 예컨대, 예를 들어, 서열번호: 22 및 캘리포니아 B.1.427 계통 예컨대, 예를 들어, 서열번호: 24 또는 B.1.429 계통 예컨대, 예를 들어, 서열번호: 26으로 구성된 군으로부터 선택된 적어도 2개의 SARS-CoV-2 입자를 포함하거나 이로 구성된다. 바람직한 실시형태는 i) Wuhan_1 계통 예컨대, 예를 들어, 서열번호 9; 및 ii) 남아프리카 B.1.531 계통 예컨대, 예를 들어, 서열번호: 18을 포함하는 조합이다.
추가 실시형태에서, 백신 내 SARS-CoV-2 입자의 조합은 i) 참조 Wuhan_1 계통 예컨대, 예를 들어, 서열번호: 1, 9, 12, 15; ii) 남아프리카 B.1.531 계통 예컨대, 예를 들어, 서열번호: 18; 브라질 P.1 계통 예컨대, 예를 들어, 서열번호: 20; UK B.1.1.7 계통 예컨대, 예를 들어, 서열번호: 22 및 캘리포니아 B.1.427 계통 예컨대, 예를 들어, 서열번호: 24 또는 B.1.429 계통 예컨대, 예를 들어, 서열번호: 26으로 구성된 군으로부터 선택된 적어도 3개, 예를 들어 3의 SARS-CoV-2 입자를 포함하거나 이로 구성된다. 이러한 3가 백신의 바람직한 실시형태는 i) Wuhan_1 계통 예컨대, 예를 들어, 서열번호 9; 및 ii) 남아프리카 B.1.531 계통 예컨대, 예를 들어, 서열번호: 18; 및 iii) UK B.1.1.7 계통 예컨대, 예를 들어, 서열번호: 22를 포함하는 조합이다. 이러한 3가 백신의 또 다른 바람직한 실시형태는 i) Wuhan_1 계통 예컨대, 예를 들어, 서열번호 9; 및 ii) 남아프리카 B.1.531 계통 예컨대, 예를 들어, 서열번호: 18; 및 iii) 브라질 P.1 계통 예컨대, 예를 들어, 서열번호: 20을 포함하는 조합이다.
아미노산 서열 및/또는 핵산 서열 간의 유사성은 서열 간의 유사성의 관점에서 표현되고, 그렇지 않으면 서열 동일성으로 지칭된다. 서열 동일성은 종종 백분율 동일성의 관점에서 측정된다; 백분율이 높을수록 두 서열이 더 유사하다. 폴리뉴클레오티드 또는 폴리펩티드의 상동체, 이종상동체, 또는 변이체는 표준 방법을 사용하여 정렬될 때 상대적으로 높은 정도의 서열 동일성을 가질 것이다.
비교를 위한 서열의 정렬 방법은 당업계에 잘 알려져 있다. 다양한 프로그램 및 정렬 알고리즘은 Smith & Waterman, Adv. Appl. Math. 2:482, 1981; Needleman & Wunsch, Mol. Biol. 48:443, 1970; Pearson & Lipman, Proc. Natl. Acad. Sci. USA 85:2444, 1988; Higgins & Sharp, Gene, 73:237-44, 1988; Higgins & Sharp, CABIOS 5: 151-3, 1989; Corpet et al., Nuc. Acids Res. 16: 10881-90, 1988; Huang et al. Computer Appls. in the Biosciences 8, 155-65, 1992; 및 Pearson et al., Meth. Mol. Bio. 24:307-31, 1994에 기술되어 있다. Altschul et al, J. Mol. Biol. 215:403-10, 1990은 서열 정렬 방법 및 상동성 계산에 대한 자세한 고려사항을 제시한다.
일단 정렬되면, 일치하는 수는 동일한 뉴클레오티드 또는 아미노산 잔기가 양 서열에 존재하는 위치의 수를 계수함에 의해 결정된다. 서열 동일성 퍼센트는 일치하는 수를 식별된 서열에 제시된 서열의 길이 또는 연결 길이(예컨대 식별된 서열에 제시된 서열로부터 100개 연속 뉴클레오티드 또는 아미노산 잔기)로 나누고, 그 다음 결과 값에 100을 곱함에 의해 결정된다. 바람직하게, 서열 동일성 백분율은 서열의 전체 길이에 걸쳐 결정된다. 예를 들어, 1166개의 일치를 갖는 펩티드 서열은 1554개의 아미노산을 갖는 테스트 서열과 정렬될 때 테스트 서열과 75.0% 동일하다(1166÷1554*100=75.0). 퍼센트 서열 동일성 값은 가장 가까운 10분의 1로 반올림된다. 예를 들어 75.11, 75.12, 75.13 및 75.14는 75.1로 반내림하고 반면 75.15, 75.16, 75.17, 75.18 및 75.19는 75.2로 반올림한다. 길이 값은 항상 정수일 것이다.
서열 분석 프로그램 BLASTP, BLASTN, BLASTX, TBLASTN 및 TBLASTX와 관련하여 사용하기 위한 NCBI Basic Local Alignment Search Tool(BLAST)(Altschul et al., Mol. Biol. 215:403, 1990)은 국립 생명공학 정보 센터(NCBI, 메릴랜드주 베데스다 소재) 및 인터넷을 포함한 여러 출처에서 이용가능하다. 이 프로그램을 사용하여 서열 동일성을 결정하는 방법에 대한 설명은 인터넷 상 NCBI 웹사이트에서 이용가능하다. BLAST 및 BLAST 2.0 알고리즘은 또한 Altschul et al., Nucleic Acids Res. 25:3389-3402, 1977에 기술되어 있다. BLAST 분석을 수행하기 위한 소프트웨어는 국립 생명공학 정보 센터(ncbi.nlm.nih.gov)를 통해 공개적으로 이용가능하다. BLASTN 프로그램(뉴클레오티드 서열용)은 기본적으로 단어 길이(W) 11, 정렬(B) 50, 기대값(E) 10, M=5, N=-4 및 양 가닥의 비교를 사용한다. BLASTP 프로그램(아미노산 서열용)은 기본값으로 단어 길이(W) 3, 기대값(E) 10, 및 BLOSUM62 스코어링 매트릭스를 사용한다(Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA 89: 10915, 1989 참조).
폴리뉴클레오티드 또는 폴리펩티드의 상동체 및 변이체는 전형적으로 참조의 적어도 50, 100, 150, 250, 500, 1000, 2000, 5000 또는 10,000개의 뉴클레오티드 또는 아미노산 잔기에 걸쳐, 참조 서열의 전장에 걸쳐 또는 관심있는 참조 아미노산 서열과의 전장 정렬에 걸쳐 계수된 적어도 약 75%, 예를 들어 적어도 약 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% 또는 99% 서열 동일성의 소유를 특징으로 한다. 참조 서열과 훨씬 더 큰 유사성을 가진 폴리뉴클레오티드 또는 단백질은 이 방법에 의해 평가할 때 증가하는 동일성 백분율 예컨대 적어도 80%, 적어도 85%, 적어도 90%, 적어도 95%, 적어도 98%, 또는 적어도 99% 서열 동일성을 나타낼 것이다. 아미노산 또는 핵산 서열의 서열 비교를 위해, 전형적으로 하나의 서열은 테스트 서열이 비교되는 참조 서열로 작용한다. 서열 비교 알고리즘을 사용하는 경우, 테스트 서열과 참조 서열을 컴퓨터에 입력하고 필요에 따라 하위 서열 좌표를 지정하고 서열 알고리즘 프로그램 매개변수를 지정한다. 기본(default) 프로그램 매개변수가 사용된다.
유용한 알고리즘의 일 예는 PILEUP이다. PILEUP은 Feng & Doolittle, Mol. Evol. 35:351-360, 1987의 점진적 정렬 방법의 단순화를 사용한다. 사용된 방법은 Higgins & Sharp, CABIOS 5:151-153, 1989에 의해 기술된 방법과 유사하다. PILEUP을 사용하여 참조 서열을 다른 테스트 서열과 비교하여 다음 매개변수를 사용하여 퍼센트 서열 동일성 상관관계를 결정한다: 기본 간격 가중치(3.00), 기본 간격 길이 가중치(0.10) 및 가중치가 적용된 말단 간격. PILEUP은 GCG 서열 분석 소프트웨어 패키지, 예를 들어 버전 7.0으로부터 얻을 수 있다(Devereaux et al., Nuc. Acids Res. 12:387-395, 1984).
본 명세서에 사용된 "적어도 80% 동일성"에 대한 언급은 특정 참조 서열에 대해, 예를 들어, 참조 서열의 적어도 50, 100, 150, 250, 500, 1000, 5000 또는 10,000개 뉴클레오티드 또는 아미노산 잔기에 대해 또는 서열의 전장에 대해 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 심지어 100% 동일성을 지칭한다. 본 명세서에 사용된 "적어도 90% 동일성"에 대한 언급은 특정 참조 서열에 대해, 예를 들어, 참조 서열의 적어도 50, 100, 150, 250, 500, 1000, 5000 또는 10,000개 뉴클레오티드 또는 아미노산 잔기에 대해 또는 서열의 전장에 대해 "적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96% , 적어도 97%, 적어도 98%, 적어도 99%, 또는 심지어 100% 동일성"을 지칭한다.
일부 실시형태에서, 불활성화된 SARS-CoV-2 입자는 백신에서 어쥬번트와 조합된다. 일부 실시형태에서, 어쥬번트는 Th1 반응-지향 어쥬번트이다. 이것은 백신이 대상체에게 투여될 때 어쥬번트가 대상체에서 (Th2 유형 반응보다는) 우세한 T 헬퍼 유형 1(즉, Th1) 면역 반응의 유도를 촉진한다는 것을 의미한다. 일반적으로 사용되는 백신의 Th1- 또는 Th2-지향 특성은 당업계에 공지되어 있다. 놀랍게도 주로 Th1 반응을 촉진하는 어쥬번트를 사용하면 백신의 면역원성을 개선하고 따라서 항바이러스 반응뿐만 아니라 (바이러스 성분에 대한 과민성에 기인해 가능하기로는 주로 Th2 유형 반응으로 인해 발생할 수 있는) 면역병리와 같은 불리한 영향의 위험을 감소시키는 것을 개선할 수 있다는 것이 발견되었다.
일부 실시형태에서, 어쥬번트는 3-O-데사실-4'-모노포스포릴 지질 A(MPL), 사포닌 QS-21, CpG-함유 올리고데옥시뉴클레오티드(CpG ODN), 스쿠알렌, DL-α-토코페롤, 양이온성 펩티드, 데옥시이노신-함유 면역자극 올리고데옥시핵산 분자(I-ODN) 및/또는 이미퀴모드를 포함한다. 예로써, 적합한 어쥬번트의 예는 3-O-데사실-4'-모노포스포릴 지질 A(MPL) 및 사포닌 QS-21을 포함하는 리포솜 제제인 어쥬번트 시스템 01(AS01); CpG 1018, 서열 5' TGACTGTGAACGTTCGAGATGA 3'(서열번호: 4)을 포함하는 CpG ODN; 스쿠알렌, DL-α-토코페롤 및 폴리소르베이트 80을 포함하는 어쥬번트 시스템 03(AS03); 서열 KLKL5KLK(서열번호: 5)를 포함하는 펩티드 및 올리고-d(IC)13(서열번호: 6)을 포함하는 I-ODN을 포함하는 IC31; 또는 MF59, 스쿠알렌, 트윈 80 및 스팬 85를 포함하는 수중유 에멀젼을 포함할 수 있다.
다른 실시형태에서, 백신 또는 어쥬번트는 CpG-함유 올리고데옥시뉴클레오티드(CpG ODN)를 포함하지 않는다. 또 다른 실시형태에서, 백신 또는 어쥬번트는 CpG 1018을 포함하지 않으며, 즉 백신 또는 어쥬번트는 서열 5' TGACTGTGAACGTTCGAGATGA 3'(서열번호: 4)을 포함하지 않는다.
일부 실시형태에서, 특히 AS01, AS03, MF59, 이미퀴모드 또는 CpG와 같은 Th1 촉진 어쥬번트의 투여량은 경험적으로 도달할 것이다. 일부 실시형태에서, Th1 촉진 어쥬번트의 투여량은 이전 연구로부터 결정될 것이다.
대안적인 실시형태에서, 어쥬번트는 알루미늄 염, 예를 들어, 산화알루미늄, 수산화알루미늄 또는 인산알루미늄을 포함할 수 있다. 바람직한 알루미늄 염은 Cu 함량이 감소된, 예를 들어 WO2013/083726 또는 Schlegl et al., Vaccine 33 (2015) 5989-5996에 상세히 기재된 백신 조성물의 중량을 기준으로 1.25 ppb보다 낮은 어쥬번트를 갖는 수산화알루미늄이다. 일부 실시형태에서, 명반 어쥬번트는 백신 조성물에서 유일한 어쥬번트이다. 본 명세서에서 언급된 바와 같이, 명반 성분의 중량은 사용된 알루미늄염의 유형에 관계없이 용액 내 Al3+의 중량을 지칭한다. 예를 들어, 0.5mg의 Al3+는 1.5mg 명반에 상응한다. 일 실시형태에서, SARS-CoV-2 백신 조성물에 존재하는 명반(Al3+) 양은 약 0.1 내지 2mg/mL, 약 0.2 내지 1.5mg/mL, 약 0.5 내지 1.3mg/mL, 특히 약 0.8 내지 1.2mg/mL, 가장 바람직하게는 약 1mg/mL, 즉, 0.5mg/용량이다. 그러나 알루미늄 어쥬번트 단독의 사용은 일반적으로 본 발명에서 덜 선호되는데, 이는 이들이 현저하게 Th2 유형 면역 반응을 지향하는 경향이 있기 때문이다. 따라서 백신이 알루미늄 염을 포함하는 실시형태에서, 백신이, 예를 들어, 상기에 기술된 바와 같이 Th1-지향 어쥬번트를 추가로 포함하는 것이 특히 바람직하다.
따라서 일 실시형태에서, 어쥬번트는 알루미늄 염 및 CpG ODN, 예를 들어, CpG 1018(서열번호: 4)을 포함할 수 있다. CpG 1018은 명반 상에 흡착될 수 있고 조합 어쥬번트로 사용될 때 Th1 및 Th2 반응 둘 모두(Tian. et al. 2017 Oncotarget 8(28)45951-45964); 즉, 보다 "균형있는" 면역 반응을 유도하는 것으로 나타났다. 특히, 명반과 조합하여 투여하는 경우, CpG는 면역 반응의 전반적인 크기를 증가시키고 명반과 같은 통상적인 어쥬번트에 의해 유도되는 Th2 편향을 감소시키는 것으로 나타났다(X.P. Ioannou et al. CpG-containing oligodeoxynucleotides, in combination with conventional adjuvants, enhance the magnitude and change the bias of the immune responses to a herpesvirus glycoprotein. 2002 Vaccine 21:127-137). 명반과 조합된 CpG의 용량 범위는 10μg 내지 3mg 사이일 수 있다.
전형적으로 어쥬번트는 백신 제품의 제조 동안 불활성화된 SARS-CoV-2 입자와 결합되며, 즉 제조된 백신 제품은 어쥬번트를 포함하고 이 형태로 판매/유통된다. 대안적인 실시형태에서, 어쥬번트는 사용 시점, 예를 들어, 백신의 임상 투여 직전(때때로 백신 성분의 "병상 혼합"으로 지칭됨)에 불활성된 SARS-CoV-2 입자와 조합될 수 있다. 따라서 본 발명은 불활성화된 SARS-CoV-2 입자 및 본 명세서에 기재된 어쥬번트를 포함하는 백신 제품, 뿐만 아니라 이의 개별 성분(예를 들어, 병상 혼합에 적합)을 포함하는 키트 둘 모두와, SARS-CoV-2 감염을 예방하거나 치료하는 백신의 개별 성분의 조합된 사용을 포함한다.
SARS-CoV-2 백신은 상술한 바와 같이 천연 SARS-CoV-2 입자의 불활성화의 단계를 포함하는 방법에 의해 생성될 수 있다. 일반적으로 천연 SARS-CoV-2 입자는 표준 배양 방법, 예를 들어 바람직하게는 Vero 세포를 사용하여 포유동물 세포에서 시험관내 생성에 의해에 의해 수득될 수 있다. 예로써, 천연 SARS-CoV-2 입자는 예를 들어 그의 내용이 그 전체로 본 명세서에 포함된 WO 2017/109225 및/또는 WO 2019/057793에 기재된 것과 유사한 방법을 사용하여 생성될 수 있으며, 이는 Vero 세포에서 지카 및 치쿤구니야 바이러스의 생성에 대한 방법을 기술한다. 이들 문헌에 기술된 계대, 수확, 침전, 투석, 여과 및 정제와 같은 단계는 SARS-CoV-2 입자를 생성하는 현재 공정에 동일하게 적용할 수 있다.
예로써, 일부 실시형태에서, 방법은 (i) 수크로스 밀도 구배 원심분리, (ii) 리간드-활성화 코어 및 기공을 포함한 불활성 쉘을 포함하는 컬럼에 패킹된 고체-상 매트릭스로서, 여기서 기공의 분자량 컷-오프는 바이러스 입자가 리간드-활성화된 코어로 들어가는 것을 배제하고, 여기서 기공의 분자량 컷-오프보다 더 작은 분자는 리간드-활성화된 코어에 들어갈 수 있고 바이러스 입자 수집하는, 매트릭스, 및/또는 (iii) 배치 또는 크기 배제 크로마토그래피와 같은 하나 이상의 크기 배제 방법에 의해 불활성화된 SARS-CoV-2 입자를 정제하는 것을 포함하여; 정제된 불활성화된 SARS-CoV-2 입자를 얻을 수 있다. 바람직하게는, 바이러스 입자의 생성된 정제된 제제에서, (i) 잔류 숙주 세포 DNA의 농도는 100ng/mL 미만이고; (ii) 잔류 숙주 세포 단백질의 농도는 1μg/mL 미만이고; (iii) 감염성 바이러스 입자의 잔류 응집체 농도는 1μg/mL 미만이다.
일부 실시형태에서, 방법은 SARS-CoV-2 입자를 포함하는 수확된 배양 배지를 침전시키고 이에 의해 상등액에서 천연 SARS-CoV-2 입자를 생성하는 단계를 포함할 수 있다. 침전 단계는 배양 배지를 프로타민 설페이트 또는 벤조나아제와 접촉시키는 것을 포함할 수 있다. 이러한 단계를 사용함에 의해, 숙주 세포에서 유래한 오염 DNA뿐만 아니라 미성숙 및 그렇지 않으면 비-감염성 바이러스 입자 둘 모두가 제제로부터 분리될 수 있다. 더욱이, 프로타민 설페이트는 바이러스 분획으로부터, 예를 들어 수크로스 밀도 원심분리 또는 리간드-활성화된 코어 및 기공을 포함한 불활성 쉘을 포함하는 컬럼에 패킹된 고체-상 매트릭스를 사용하여 매우 효율적으로 분리될 수 있으며, 여기서 기공은 바이러스 입자가 리간드-활성화된 코어에 들어가는 것을 배제하는 분자량 컷-오프를 포함하고, 기공의 분자량 컷-오프보다 작은 분자(예를 들어, 프로타민 설페이트)가 리간드-활성화된 코어에 들어갈 수 있어 보다 안전한 백신을 고수율로 생성할 수 있다.
따라서 수득된 바이러스 제제 또는 백신의 잔류 숙주 세포 DNA는 1μg/mL 미만, 특히 900, 800, 700, 600, 500, 400, 300 또는 200ng/mL 미만, 바람직하게는 150 또는 100ng/mL 미만일 수 있다. 바람직한 실시형태에서, 바이러스 제제 또는 백신의 잔류 숙주 세포 DNA는 40pg/mL 미만이다. 일부 실시형태에서, 바이러스 제제 또는 백신의 잔류 숙주 세포 단백질은 10μg/mL 미만, 특히 9, 8, 7, 6, 5, 4, 3 또는 2μg/mL 미만, 바람직하게는 1μg/mL 미만이다. 바람직한 실시형태에서, 바이러스 제제 또는 백신의 잔류 숙주 세포 단백질은 150ng/mL 미만이다. 일부 실시형태에서, 바이러스 제제 또는 백신의 잔류 비-감염성 바이러스 입자는 10μg/mL 미만, 특히 9, 8, 7, 6, 5, 4, 3 또는 2μg/mL 미만, 바람직하게는 1μg/mL 미만이다. 바람직한 실시형태에서, 바이러스 제제 또는 백신의 잔류 비-감염성 바이러스 입자의 함량은 100ng/mL 미만이다.
일부 실시형태에서, 백신 및/또는 SARS-CoV-2 입자는 잔류 프로타민(예를 들어, 프로타민 설페이트)을 전형적으로 미량으로 포함할 수 있다. 일부 실시형태에서, 바이러스 제제 또는 백신에서 잔류 프로타민(예를 들어, 프로타민 설페이트)은 2μg/mL 또는 1μg/mL 미만, 특히 900, 800, 700, 600, 500, 400, 300 또는 200ng/mL 미만, 바람직하게는 100ng/mL 미만이고, 더욱 바람직하게는 HPLC의 검출 한계 미만, 특히 최종 약물 물질에서의 검출 한계 미만이다. 일부 실시형태에서, PS 함량은 HPLC 또는 크기 배제 크로마토그래피(SEC)에 의해 시험된다. 예를 들어, HPLC는 일상적인 릴리스 검정으로 JEV 수크로스 구배 풀 샘플에서 PS 측정을 위해 검증되었고 매우 민감성이다(즉, 정량 한계(LOQ) 3μg/mL; 검출 한계(LOD) 1μg/mL). 본 발명에서, SARS-CoV-2 약물 물질의 PS 함량은 <LOD였다. 일 실시형태에서, PS 함량의 HPLC 평가는 30% 아세토니트릴, 0.1% 트리플루오로아세트산을 용매로서 0.6ml/분의 유속으로 25℃ 및 214nm에서 검출을 사용하여 Superdex Peptide 10/300GL 컬럼(GE: 17-5176-01)에서 수행할 수 있다. 정제된 바이러스 제제에서 잔류 프로타민에 대한 보다 민감한 측정 방법은 질량 분석법(MS)이다. 일부 실시형태에서, 지카 바이러스 제제에서 잔류 PS 수준은 MS 또는 기타 그러한 고도로 민감한 방법, 예를 들어 핵 자기 공명(NMR)에 의해 시험된다. 이 방법을 사용하면 잔류 PS뿐만 아니라 PS의 단편 및/또는 분해 생성물을 미량, 예컨대 전형적인 샘플 장입당 106, 107 또는 108개 분자만큼 낮은 수준으로 검출할 수 있다. 일부 실시형태에서, PS 수준은 의약품에서 시험된다. 일부 실시형태에서, PS 수준은 약물 물질에서 시험된다.
바람직하게는 의약품 또는 약물 물질(예를 들어, 백신 조성물)에서 불활성화제(예를 들어, 베타-프로피오락톤)의 양은 매우 낮으며, 예를 들어 100ppm 미만, 10ppm 미만, 또는 1ppm 미만(중량 기준)이다.
SARS-CoV-2 백신은 대상체, 바람직하게는 포유동물 대상체, 보다 바람직하게는 인간 대상체에게 투여될 수 있다. 전형적으로 SARS-CoV-2 백신은 예를 들어 SARS-CoV-2 감염을 예방하고/하거나 SARS-CoV-2 연관된 질환(COVID-19)을 예방하기 위해 SARS-CoV-2 감염의 위험이 있는 대상체에게 투여된다. 대상체는 바람직하게는 (i) 노인 대상체(예를 들어, 65세, 70세 또는 80세 이상 노인) (ii) 임신한 대상체 (iii) 면역저하 대상체 또는 (iv) 어린이(예를 들어, 18세, 16세, 14세, 12세, 10세, 8세, 6세, 4세, 2세 이하 또는 더 젊은 사람)이다. 본 명세서에 기술된 SARS-CoV-2 백신은 유리하게 SARS-CoV-2-이환율 또는 사망률에 특히 민감하거나 취약한 대상체, 즉 면역저하, 임산부 또는 노인 대상체에서 강력한 면역 반응을 생성할 수 있다. SARS-CoV-2 백신은 단일 용량 또는 2회 이상의 용량, 예를 들어 약 7, 14, 21 또는 28일의 간격으로 분리된 용량으로 대상체에게 투여될 수 있다.
바람직한 실시형태에서, 인간 대상체에 투여시 백신은 SARS-CoV-2-연관된 질환(COVID-19)의 항체-의존성 증진(ADE)을 유도하지 않는다. ADE는 바이러스-특이적 항체(예를 들어, 백신접종에 의해 생성됨)가 숙주 세포 안으로의 바이러스 진입 및/또는 바이러스 복제를 향상시킬 수 있는 현상이다. 본 명세서에 기술된 불활성화된 SARS-CoV-2 백신이 인간 대상체에서 낮은 또는 무 ADE를 나타내고 따라서 대량 백신 접종 목적으로 안전하게 사용할 수 있다는 것이 본 발명의 이점이다. 특히, 본 명세서에 기술된 백신은 고품질 면역원성 에피토프를 보유하고, 따라서 높은 중화 항체 역가를 초래하고 대상체에 투여시 ADE의 위험을 감소시킨다. ADE 발달의 위험은 실시예에 기술된 바와 같이 비-인간 영장류에서 평가될 수 있다(또한 Luo F, et al.(2018), Virologica Sinica 33:201-204 참조).
또 다른 바람직한 실시형태에서, 인간 대상체에 투여시 백신은 면역병리를 초래하지 않는다. 일부 상황 하에서 백신(예를 들어, SARS-CoV 백신)은 예를 들어 Th2-유형 면역병리학, 예를 들어 동물에서 SARS-CoV 성분에 대한 과민 반응을 초래할 수 있다는 것이 알려져 있다. 본 발명의 실시형태에서, 예를 들어 Th1-지향 어쥬번트(예를 들어, 본 명세서에 기재된 바와 같은 AS01 또는 다른 어쥬번트)의 사용에 의한 Th1 유형 반응이 선호된다. 특별히, Th2-자극성 어쥬번트, 예를 들어 명반을 Th1-자극성 어쥬번트와 조합하여 사용함에 의해 유도되는 것과 같은 균형잡힌 Th2/Th1-유형 면역 반응이 바람직하다. 면역병리 발생 위험은 Tseng C.T. et al. (2012) PLoS ONE 7(4):e35421에 기재된 바와 같이, 동물 모델에서 평가할 수 있다. 본 발명의 바람직한 실시형태에서, 발명의 백신은 명반이 보강된 백신과 비교하여 Th1-유형 면역 반응에 대한 Th2/Th1-유형 면역 반응에서의 전이를 나타낸다.
본 명세서에 기재된 임의의 SARS-CoV-2 백신 또는 조성물은 치료적으로 유효한 양 또는 치료적으로 유효한 양의 용량으로 대상체에게 투여될 수 있다. 본 명세서에 사용된 바와 같이, 백신의 "치료적으로 유효한 양"은 SARS-CoV-2에 대한 감염의 예방, 면역 반응 또는 강화된 면역 반응 또는 SARS-CoV-2 질환과 연관된 증상의 예방 또는 감소를 포함하지만 이에 제한되지 않는 본 명세서에 기재된 것과 같은 대상체에서 원하는 반응 또는 결과를 초래하는 임의의 양이다. 보다 구체적으로, 발명의 SARS-CoV-2 백신의 치료량은 약 0.05 내지 50μg, 보다 바람직하게는 약 0.5 내지 10μg의 총 바이러스 단백질 질량일 수 있다.
일부 실시형태에서, 본 명세서에 기재된 SARS-CoV-2 백신 또는 조성물의 치료적으로 유효한 양은 항원-특이적 항체(예를 들어, 항-SARS-CoV-2 항체)를 생성하기에 충분한 양이다. 일부 실시형태에서, 치료적으로 유효한 양은 적어도 70% 확률로 대상체를 혈청전환시키기에 충분하다. 일부 실시형태에서, 치료적으로 유효한 양은 적어도 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 또는 적어도 99% 확률로 대상체를 혈청전환시키기에 충분하다. 대상체가 혈청전환되었는지 여부는 대상체로부터 혈청 샘플을 수득하고 항-SARS-CoV-2 항체를 검출하기 위한 검정을 수행하는 것과 같은 당업계에 공지된 임의의 방법에 의해 평가될 수 있다. 일부 실시형태에서, 대상체로부터의 혈청 샘플이 역치 또는 미리결정된 기준선을 초과하는 양의 항-SARS-CoV-2 항체를 함유하는 경우 대상체는 혈청전환된다. 대상체로부터의 혈청 샘플에 존재하는 항-SARS-CoV-2 항체(즉, 항-SARS-CoV-2 S 단백질 IgG 항체)가 동일한 대상체로부터 이전에 채취한 혈청 샘플에 비해 적어도 4-배 증가가 있는 경우 대상체는 일반적으로 혈청전환된 것으로 간주된다.
일 실시형태에서, 본 발명의 백신 조성물에서 불활성화된 SARS-CoV-2 바이러스의 용량은 약 0.01 내지 25mAU(SEC-HPLC에 의해 평가된 밀리-흡수 단위 x 분), 바람직하게는 약 0.05 내지 10mAU, 보다 바람직하게는 약 0.1 내지 5mAU, 가장 바람직하게는 약 0.25 내지 2.5mAU이다. 일 실시형태에서, 용량은 (μ)BCA 검정에 의해 측정될 때 약 0.05 내지 50μg 총 단백질, 약 0.1 내지 25μg, 약 0.25 내지 12.5μg, 바람직하게는 약 0.5 내지 5μg 총 단백질이다. 보다 바람직하게는 백신 조성물에서 불활성화된 SARS-CoV-2 바이러스의 용량은 적어도 2.5μg 총 단백질, 적어도 3.5μg 총 단백질 또는 적어도 2.5μg 총 단백질이며, 예를 들어 백신 조성물은 2.5μg 내지 25μg, 3.5μg 내지 10μg 또는 4μg 내지 6μg 총 단백질/용량, 바람직하게는 약 5μg 총 단백질/용량을 포함한다. 일부 실시형태에서, 투여량은, 예를 들어 ELISA에 의해 분석된 바와 같이, 불활성화된 SARS-CoV-2 제형에서 S 단백질의 총량에 의해 결정된다. 항원의 질량은 또한 대략 2μg/ml 총 표면 단백질 및 대략 1μg/mL S-단백질인 것으로 추정되는 용량 당량당 SE-HPLC 피크 면적(밀리-흡수 단위 x 분으로 기록됨; mAU)을 분석함에 의해 평가될 수 있다. 일 실시형태에서, 용량은 ELISA에 의해 측정될 때 약 0.025 내지 25μg S-단백질, 약 0.05 내지 12.5μg, 약 0.125 내지 6.25μg, 바람직하게는 약 0.25 내지 2.5μg S-단백질이다.
바람직한 실시형태에서, SARS-CoV-2 백신에서 항원의 양은 ELISA에 의해 결정된다. 일 실시형태에서, ELISA는 SARS-CoV-2 단백질 또는 단백질의 일부, 예를 들어, 뉴클레오캡시드(N), 막(M) 또는 스파이크(S) 단백질을 측정하며; 즉, ELISA는 SARS-CoV-2 단백질 또는 단백질의 일부에 특이적인 코팅 항체를 이용한다. 바람직한 실시형태에서, 코팅 항체는 SARS-CoV-2 스파이크 단백질 S1 서브유닛, 예를 들어, 서열번호:3, 19, 21, 23, 25 또는 27의 S-단백질 서열의 잔기 14-685(또는 14-683), 또는 수용체 결합 도메인(RBD), 예를 들어 서열번호:3, 19, 21, 23, 25 또는 27의 S-단백질 서열의 잔기 331 내지 528(또는 319 내지 541)에 특이적이다(도 9 참조). 일 실시형태에서, ELISA 판독은 검출된 단백질의 단위 측정당 질량, 예를 들어, μg/mL S-단백질이다. 바람직한 실시형태에서, 사용된 표준은 스파이크 단백질 삼량체이고 SARS-CoV-2 ELISA의 결과는 표준 단백질의 ACE-2 결합 능력에 상응하는 "항원 단위"(AU)로 보고된다(제조업체에 의해 결정됨).
일 실시형태에서, 대상체에게 투여되는 SARS-CoV-2 항원의 양은 약 1 내지 100AU/용량, 바람직하게는 약 2 내지 75AU/용량, 바람직하게는 약 3 내지 60AU/용량, 보다 바람직하게는 약 3 내지 55AU/용량, 보다 바람직하게는 약 3 내지 53AU/용량이다. 훨씬 더 바람직한 실시형태에서, 대상체에게 투여되는 SARS-CoV-2 항원의 양은 용량당 3AU, 10AU 또는 40AU이고, 가장 바람직하게는 용량당 40AU이다. 추가의 바람직한 실시형태에서, 대상체에게 투여되는 SARS-CoV-2 항원의 양은 적어도 10AU/용량, 적어도 20AU/용량, 적어도 25AU/용량 또는 적어도 30AU/용량, 예를 들어 약 10 내지 60AU/용량, 20 내지 50AU/용량, 25 내지 45AU/용량 또는 30 내지 40AU/용량, 예를 들어 약 35AU/용량이다. SARS-CoV-2 항원의 양(예를 들어, AU/용량 단위)은 예를 들어 실시예 1에 기술된 바와 같이 SARS-CoV-2 ELISA 검정에 의해 평가될 수 있다. AU당 약 1 내지 1.5 x 107 바이러스 입자가 있고, 그리고 상기에서 기술된 SARS-CoV-2 항원의 양은 그에 따라 해석될 수 있다고 추정된다. 따라서 일부 실시형태에서, 대상체에게 투여되는 SARS-CoV-2 항원의 양은 약 1.5 x 107 내지 1.5 x 109 바이러스 입자/용량, 또는 약 4.5 x 107 내지 9.0 x 108 바이러스 입자/용량, 예를 들어 적어도 1.5 x 108 바이러스 입자/용량 또는 적어도 3.0 x 108 바이러스 입자/용량, 약 1.5 x 108 내지 7.5 x 108 바이러스 입자/용량 또는 약 4.5 x 108 내지 6.0 x 108 바이러스 입자/용량이다.
일부 실시형태에서, 대상체의 혈청전환은 플라크 감소 중화 시험(PRNT)을 수행함에 의해 평가된다. 간단히 말해서, PRNT는 대조군 혈청/항체와 비교하여 SARS-CoV-2 플라크의 수를 50%(PRNT50)까지 감소시키는데 필요한 혈청 역가를 결정하는데 사용된다. PRNT50은 Vero 세포의 단층 또는 SARS-CoV-2에 감염될 수 있는 임의의 다른 세포 유형/주를 사용하여 수행될 수 있다. 대상체로부터의 혈청을 희석하고 살아있는 비-불활성화된 SARS-CoV-2와 함께 인큐베이션한다. 혈청/바이러스 혼합물을 Vero 세포에 적용하고 일정 기간 동안 인큐베이션할 수 있다. Vero 세포 단층 상에 형성된 플라크를 계수하고 혈청 또는 대조군 항체가 없는 SARS-CoV-2에 의해 형성된 플라크의 수와 비교한다. PRNT50에서 혈청의 1:10 희석의 중화 항체의 역치는 일반적으로 JEV의 경우에 보호의 증거로 인정된다(Hombach et. al. Vaccine (2005) 23:5205-5211).
일부 실시형태에서, SARS-CoV-2 입자는 약학적 조성물과 같은 조성물로 투여하기 위해 제형화될 수 있다. 본 명세서에 사용된 용어 "약학적 조성물"은 불활성화된 SARS-CoV-2와 같은 적어도 하나의 활성 성분과 하나 이상의 약학적으로 허용되는 부형제를 포함할 수 있는 하나 이상의 불활성 성분의 혼합 또는 조합으로부터 생성된 생성물을 의미한다. 바람직한 약학적으로 허용가능한 부형제는 인간 혈청 알부민(HSA), 예컨대 특히 재조합 HSA(rHSA)이다. 일 실시형태에서, 발명의 SARS-CoV-2 백신은 약 10 내지 50μg HSA/용량, 바람직하게는 약 20 내지 40μg HSA/용량, 더욱 바람직하게는 약 25 내지 35μg HSA를 함유한다.
백신을 포함하는 발명의 약학적 조성물은 당업계에 잘 알려져 있고 일상적으로 실시되는 방법에 따라 제조될 수 있다(예를 들어, Remington: The Science and Practice of Pharmacy, Mack Publishing Co. 20th ed. 2000; 및 Ingredients of Vaccines - Fact Sheet from the Centers for Disease Control and Prevention 참조, 예를 들어, 백신이 그의 기능을 향상시키는데 도움이 되는 상기에서 기술된 바와 같은 어쥬번트 및 증강제, 백신이 변경되지 않은 상태로 유지되는 데 도움이 되는 보존제 및 안정제(예를 들어, 인간 혈청 알부민(HSA))와 같은 알부민 또는 재조합 HSA(rHSA), 페놀, 글리신)). 본 명세서에 사용된 용어 "백신"은 면역원성 조성물, 예를 들어, 항원에 대한 (예를 들어, SARS-CoV-2 항원에 대한)(인간) 대상체에서 면역 반응을 유도할 수 있는 조성물을 지칭한다. 예로써, 백신 또는 조성물은 SARS-CoV-2에 대한 중화 항체를 생성할 수 있다. 일부 실시형태에서, 백신 또는 조성물은 SARS-CoV-2 S(스파이크) 단백질에 대한 항체(예를 들어, IgG)를 생성할 수 있다. 일부 실시형태에서, 백신 또는 조성물은 SARS-CoV-2 단백질 또는 펩티드에 대한 T 세포 반응, 예로써 SARS-CoV-2 S-단백질, 막(M) 단백질 및/또는 뉴클레오캡시드(N) 단백질이나 이로부터 유래된 펩티드에 대한 T 세포 반응을 생성할 수 있다. 전형적으로 백신 또는 면역원성 조성물은 항원에 의해 유발되는 질환에 대한 보호 효과, 예를 들어 SARS-CoV-2 감염(예를 들어, 증상 및/또는 무증상 감염) 및/또는 COVID-19 질환에 대한 보호 효과를 유도할 수 있다).
약학적 조성물은 바람직하게는 GMP 조건 하에서 제조된다. 전형적으로, 불활성화된 SARS-CoV-2 백신 제제의 치료학적으로 유효한 용량은 발명의 약학적 조성물에 이용된다. 불활성화된 SARS-CoV-2 입자는 당업자에게 공지된 통상적인 방법에 의해 약학적으로 허용가능한 투여 형태로 제형화된다. 최적의 원하는 반응(예를 들어, 예방적 반응)을 제공하도록 투여량 요법을 조정한다.
본 발명의 약학적 조성물에서 활성 성분의 투여량은 대상체에 대해 독성 없이 특정 대상체, 조성물 및 투여의 방식에 대해 원하는 약학적 반응을 달성하는 데 효과적인 활성 성분의 양을 얻도록 다양할 수 있다. 선택된 투여량 수준은 이용된 본 발명의 특정 조성물의 활성, 투여의 경로, 투여의 시간, 이용되는 특정 화합물의 배설의 속도, 치료의 기간, 이용된 특정 조성물과 조합하여 사용되는 기타 약물, 화합물 및/또는 물질, 연령, 성별, 체중, 병태, 전반적인 건강 및 치료되는 대상체의 이전 병력 등을 포함한 다양한 약동학적 인자에 의존한다.
의사, 수의사 또는 기타 훈련된 개업의는 원하는 치료 효과를 달성하는데 필요한 수준보다 낮은 수준에서 약학적 조성물에 이용된 불활성화된 SARS-CoV-2 백신의 투여를 시작할 수 있고 원하는 효과(예를 들어, 항-SARS-CoV-2 바이러스 항체의 생성)이 달성될 때까지 투여량을 증가시킬 수 있다. 일반적으로, 본 명세서에 기재된 바와 같은 사람의 그룹의 예방적 치료를 위한 본 발명의 조성물의 유효한 용량은 투여의 수단, 표적 부위, 환자의 생리학적 상태, 환자가 인간인지 또는 동물인지 여부, 투여된 다른 약물 및 원하는 항-SARS-CoV-2 항체의 역가를 포함하는 많은 상이한 인자에 따라 달라진다. 안전성과 효능을 최적화하기 위해 투여량을 적정해야 한다. 일부 실시형태에서, 투여 요법은 0일차에 1회 및 약 7일차에 1회인, 2번의 불활성화된 SARS-CoV-2 백신의 용량의 피하 또는 근육내 투여를 수반한다. 일부 실시형태에서, 투여 요법은 0일차에 1회 및 약 14일차에 1회인, 2번의 불활성화된 SARS-CoV-2 백신의 용량의 피하 투여를 수반한다. 일부 실시형태에서, 투여 요법은 0일차에 1회 및 약 28일차에 1회인, 2번의 불활성화된 SARS-CoV-2 백신의 용량의 피하 투여를 수반한다. 일부 실시형태에서, 불활성화된 SARS-CoV-2 백신은 대상체에게 1회 투여된다. 바람직한 실시형태에서, SARS-CoV-2 백신은 대상체에게 1회 초과, 바람직하게는 2회 투여된다. 바람직한 실시형태에서, 백신은 0일차 및 21일차에 투여된다. 또 다른 바람직한 실시형태에서, 백신은 0일차 및 28일차에 투여된다.
추가 실시형태에서, 불활성화된 SARS-CoV-2 백신의 제1(프라임) 용량이 투여되고, 불활성화된 SARS-CoV-2 백신의 제2(부스트) 용량이 제1 용량 후 적어도 28일, 적어도 60일, 적어도 70일, 적어도 80일 또는 90일에 투여된다. 따라서 일부 실시형태에서, 불활성화된 SARS-CoV-2 백신의 제2 용량은 제1 용량 후 30 내지 120일 또는 1 내지 4개월(바람직하게는 약 3개월)에 투여된다.
다른 실시형태에서, 불활성화된 SARS-CoV-2 백신은 부스트 용량으로서만 투여되며, 예를 들어 (다른) SARS-CoV-2 백신의 제1(프라임) 용량이 투여된 다음 불활성화된 SARS-CoV-2 백신의 제2(부스트) 용량이 예를 들어 제1 용량 후 적어도 7, 14, 21, 28, 60 또는 90일에 투여된다. SARS-CoV-2 백신의 제1(프라임) 용량은 SARS-CoV-2 바이러스에 대한 대상체에서 면역 반응 및/또는 보호 효과를 자극하는 임의의 다른 백신 또는 면역원성 조성물을 포함할 수 있다. 예를 들어, SARS-CoV-2 백신의 제1 용량은 재조합 바이러스 벡터 또는 하나 이상의 SARS-CoV-2 단백질 및/또는 이의 단편, 예를 들어 SARS-CoV-2 스파이크(S) 단백질을 인코딩하는 mRNA 서열을 포함할 수 있다. 대안적으로 SARS-CoV-2 백신의 제1 용량은, 예를 들어 하나 이상의 SARS-CoV-2 단백질 및/또는 이의 단편, 예를 들어 SARS-CoV-2 스파이크(S) 단백질 또는 이의 단편을 포함하는 서브유닛 백신을 포함할 수 있다.
또한, 예를 들어 SARS-CoV-2 감염의 중증도를 예방하거나 감소시키기 위해 대상체에 예방적 투여에 사용하기 위한 키트가 본 개시내용의 범주 내에 있다. 이러한 키트는 불활성화된 SARS-CoV-2 백신과 같은 불활성화된 SARS-CoV-2를 함유하는 조성물을 포함하는 하나 이상의 용기를 포함할 수 있다. 일부 실시형태에서, 키트는 제2 조성물, 예컨대 제2 백신, 예를 들어, 제1 용량에서와 상이한 기술을 적용한 SARS-CoV-2 백신의 제2 종류를 포함하는 하나 이상의 추가 성분을 추가로 포함할 수 있다. 일부 실시형태에서, 제2 백신은 아르보바이러스에 대한 백신이다. 일부 실시형태에서, 제2 백신은 일본 뇌염 바이러스 백신, 지카 바이러스 백신, 뎅기열 바이러스 백신 및/또는 치쿤구니야 바이러스 백신이다.
일부 실시형태에서, 키트는 본 명세서에 기재된 임의의 방법에 따른 사용 설명서를 포함할 수 있다. 포함된 설명서는 SARS-CoV-2 감염을 예방하거나 발병을 지연시키거나 중증도를 감소시키기 위한 불활성화된 SARS-CoV-2 백신을 함유하는 조성물의 투여에 대한 설명을 포함할 수 있다. 키트는 대상체가 SARS-CoV-2에 노출되거나 SARS-CoV-2 감염에 걸릴 위험이 있는지 여부를 식별하는 것을 기반으로 하여 투여에 적합한 대상체 선택하는 설명을 추가로 포함할 수 있다. 또 다른 실시형태에서, 설명서는 SARS-CoV-2에 노출되거나 SARS-CoV-2 감염에 걸릴 위험이 있는 대상체에게 불활성화된 SARS-CoV-2 백신을 함유하는 조성물을 투여하는 것에 대한 설명을 포함한다.
불활성화된 SARS-CoV-2 백신을 함유하는 조성물의 사용과 관련된 설명서에는 일반적으로 의도된 치료를 위한 투여량, 복용 일정 및 투여의 경로에 대한 정보가 포함된다. 용기는 단위 용량, 벌크 패키지(예를 들어, 다중-용량 패키지) 또는 하위-단위 용량일 수 있다. 발명의 키트에 제공된 설명서는 전형적으로 라벨 또는 패키지 삽입물(예를 들어, 키트에 포함된 종이 시트)에 작성된 설명서이지만 기계-판독가능한 설명서도 허용가능하다.
본 개시내용의 키트는 적합한 포장에 있다. 적합한 포장은 바이알, 병, 항아리, 연포장 등을 포함하지만 이에 제한되지 않는다. 또한 주사기 또는 주입 장치와 같은 특정 장치와 조합하여 사용하기 위한 패키지가 고려된다. 용기는 멸균 접근 포트를 가질 수 있으며, 예를 들어 용기는 피하 주사 바늘로 뚫을 수 있는 마개가 있는 바이알일 수 있다. 조성물에서 적어도 하나의 활성제는 본 명세서에 기재된 바와 같은 불활성화된 SARS-CoV-2이다.
본 발명은 이하의 설명에 제시되거나 도면에 예시된 성분의 배열 및 구성의 세부사항에 대한 적용에 제한되지 않는다. 발명은 다른 실시형태가 가능하고 다양한 방식으로 실시되거나 수행될 수 있다. 또한, 본 명세서에서 사용된 어구 및 용어는 설명을 위한 것이고 제한적인 것으로 간주되어서는 안된다. 본 명세서에서 "포괄하는", "포함하는" 또는 "갖는", "함유하는", "관련하는" 및 이들의 변형의 사용은 이후에 나열된 항목 및 그 등가물뿐만 아니라 추가 항목을 포괄하는 것을 의미한다.
본 명세서에서 달리 정의되지 않는 한, 본 개시내용과 관련하여 사용되는 과학 및 기술 용어는 당업계의 통상인에게 일반적으로 이해되는 의미를 가질 것이다. 더욱이, 문맥상 달리 요구되지 않는 한, 단수 용어는 복수를 포함하고 복수 용어는 단수를 포함해야 한다. 본 개시내용의 방법 및 기술은 일반적으로 당업계에 잘-공지된 통상적인 방법에 따라 수행된다. 일반적으로, 본 명세서에 기재된 생화학, 효소학, 분자 및 세포 생물학, 미생물학, 바이러스학, 세포 또는 조직 배양, 유전학 및 단백질과 핵산 화학과 관련하여 사용되는 명명법 및 기술은 당업계에 잘-공지되고 일반적으로 사용되는 것이다. 본 개시내용의 방법 및 기술은 일반적으로 당업계에 잘 공지된 통상적인 방법에 따라, 그리고 달리 지시되지 않는 한 본 명세서 전반에 걸쳐 인용되고 논의되는 다양한 일반적이고 보다 구체적인 참고문헌에 기재된 바와 같이 수행된다.
본 발명은 다음의 실시예에 의해 추가로 예시되며, 이는 어떠한 방식으로도 추가 제한으로 해석되어서는 안된다. 본 출원 전반에 걸쳐 인용된 모든 참고문헌(문헌 참고문헌, 발행된 특허, 공개된 특허 출원 및 동시-계류 중인 특허 출원을 포함함)의 전체 내용은 특히 상기 본 명세서에서 참조된 교시를 위해 이에 의해 명백히 참고로 포함된다. 그러나, 임의의 참고문헌의 인용은 그 참고문헌이 선행 기술임을 인정하려는 의도가 아니다.
실시예
실시예 1. 약물 물질 생성
SARS-CoV-2의 생성을 위해, JEV 공정 플랫폼(Srivastava et al., Vaccine 19 (2001) 4557-4565; US 6,309,650B1)이 기본으로 사용되었으며, WO2017/109223A1(그 전체로 본 명세서에 포함됨)에 개시된 지카 바이러스 정제에 적용된 바와 같은 공정에서의 개선 또한 고려되었다. 간단히 말해서, 비-감염성 SARS-CoV-2 입자 응집체, 숙주 세포 단백질 및 기타 저분자량 불순물은 황산프로타민 침전 또는 벤조나아제 처리에 의해 제거되고 생성된 제제는 선택적으로 수크로스 구배 원심분리에 의해 추가로 정제된다. 생성 공정의 개요에 대해 도 1 참조.
이탈리아 로마의 국립 전염병 연구소 "Lazzaro Spallanzani" IRCCS에서 식별 및 특성화된 이탈리아로부터 최초의 SARS-CoV-2 단리물(수탁번호: MT066156), 서열번호: 9에 의해 제공된 DNA 서열에 상응하는 이의 RNA 서열은 본 명세서에 개시된 모든 실시예에서 사용되었다. 다른 신종 코로나바이러스 SARS-CoV-2 단리물은 또한 다음 출처로부터 획득될 수 있다:
1. -EVAg(유럽 바이러스 아카이브), 예를 들어 다음의 균주 중 하나:
BetaCoV/France/IDF0372/2020 (Ref-SKU:014V-03890, https://www.european-virus-archive.com/virus/human-2019-ncov-0); 2019-nCoV/Italy-INMI1, (Ref-SKU:008V-03893, 서열번호: 9; https://www.european-virus-archive.com/virus/human-2019-ncov-strain-2019-ncovitaly-inmi1); BetaCoV/Netherlands/01, (Ref-SKU: 010V-03903, https://www.european-virus-archive.com/virus/sars-cov-2-strain-nl2020)
2. -BEI 자원(생물방어 및 신흥 감염 연구 자원): 예를 들어 단리물 USA-WA1/2020, NIAID, NIH: SARS-관련된 코로나바이러스 2, NR-52281(GenBank 수탁 MN985325).
3. -PHE(영국 공중 보건국): https://www.gov.uk/government/collections/contacts-public-health-england-regions-local-centres-and-emergency: 예를 들어 UK B.1.1.7 (UK_MIG457: EVAg Ref-SKU: 004V-04032; 서열번호: 22) 또는 남아프리카 B.1.531 (SA_P2: EVAg Ref-SKU: 004V-04071; 서열번호: 18) 계통의 단리물
SARS-CoV-2로 세포 축적 및 감염. 본 명세서에 기술된 방법에 사용된 Vero 세포는 카탈로그 번호 88020401 하에서 Health Protection Agency 일반 세포 수집에서 얻은 VERO(WHO) 세포주였으며, 이로부터 마스터 세포 은행이 생성되었다. SARS-CoV-2(2019-nCoV/Italy-INMI1 사용 균주)의 연구 마스터 종자 은행(rMSB)을 Vero 세포에 준비하고 게놈 서열을 시퀀싱에 의해 검사했다. SARS-CoV-2의 생성을 위해, Vero 세포를 10% 태아 소 혈청(FBS)을 함유하는 이글의 최소 필수 배지(EMEM)에서 성장시키고 단층을 0.001 내지 1, 바람직하게는 0.01의 감염 다중도(moi), 세포당 플라크 형성 단위(pfu)에서 SARS-CoV-2로 감염시켰다. 바이러스 흡착을 허용한 후, 배양물을 PBS로 2-4회 세정하고, 무-혈청 EMEM을 공급하고, 바이러스 역가가 원하는 수준에 도달할 때까지 35℃에서 5% CO2로 인큐베이션하였다.
SARS-CoV-2 수확. 배양 배지를 2, 3, 5 및 7일차에 수확하고 수확물을 모아 표준 원심분리기에서 원심분리했다. 생성된 상등액을 여과하고 이어서 TFF 한외여과에 의해 세포 배양 배지 성분을 제거하고 배치 부피를 감소시켰다. 숙주 세포 DNA 및 단백질 감소뿐만 아니라 농축된 물질에서 비-감염성 바이러스 응집체의 감소는 프로타민 설페이트로 침전에 의해 달성되었다. 프로타민 설페이트를 정용여과된 SARS-CoV-2 물질에 ~2mg/mL의 최종 공칭 농도로 교반하면서 첨가하고 이어서 2-8℃에서 30분 동안 인큐베이션했다. 대안적으로, 정용여과된 SARS-CoV-2 물질을 벤조나아제로 처리했다.
선택적 일차 불활성화. SARS-CoV-2 바이러스는 BSL2에서 처리하기에 안전한 바이러스를 만들기 위해 Vero 세포로부터 바이러스-함유 세포 배양 배지를 제거한 직후 베타-프로피오락톤으로 처리에 의해 불활성화되었다. 그러나, 불활성화는 예를 들어 원심분리 후, 프로타민 설페이트 또는 벤조나아제로 처리하기 전, 동안 또는 후에 또는 수크로스 구배 원심분리 전 또는 후에와 같은 정제 공정에서 임의의 단계에서 가능하다. 불활성화는 화학적 불활성화제 예컨대 포름알데히드(포르말린); 효소; 베타-프로피오락톤; 에탄올; 트리플루오로아세트산; 아세토니트릴; 표백제; 요소; 구아니딘 염산염; 트리-n-부틸 포스페이트; 에틸렌-이민 또는 이의 유도체; 유기 용매, 선택적으로 트윈, 트리톤, 나트륨 데옥시콜레이트 또는 설포베타인; 또는 이의 조합의 사용에 의해 수행된다. 불활성화를 베타-프로피오락톤을 사용하여 수행하는 것이 특히 바람직하며, 이는 바이러스 표면 단백질 및 이의 면역원성 에피토프를 상대적으로 아끼면서 바이러스 RNA를 우선적으로 표적화한다. 불활성화는 또한 pH 변화(매우 높거나 낮은 pH), 열처리 또는 감마 조사 또는 UV 조사, 특히 UV-C 조사와 같은 조사에 의해 달성될 수 있다. SARS-CoV-2 바이러스는 선택적으로, 예를 들어 베타-프로피오락톤 처리 및 UV-C 조사와 같은 2가지 별도 불활성화 단계에 의해 불활성화된다.
고도로 내성인 모델 바이러스(PPV)의 불활성화를 위한 BPL 시작 농도의 평가. PPV 바이러스 불활성화 역학의 평가를 위한 예비 연구가 수행되어 본 발명자들의 제안된 SARS-CoV-2 BPL 불활성화 절차를 초기에 지원하였다. 돼지 파보바이러스(PPV)는 물리-화학적 불활성화에 대한 저항성이 높기 때문에 수성 용액에서 BPL의 불활성화 능력을 평가하기 위한 모델 바이러스로 선택되었다. BPL의 3가지 시작 농도가 평가되었다: 300ppm(1/3333), 500ppm(1/2000) 및 700ppm(1/1429). 바이러스 용액을 이들 농도에서의 BPL로 스파이킹하고 5±2℃에서 24시간 동안 인큐베이션했다. 0.5, 2, 6, 24h 및 BPL 가수분해 단계 후 역학 샘플을 취하고 잔존하는 감염성을 분석했다. 결과는 표 A에 나타나 있다.
*검출 한계 이하
**500ppm BPL에 대한 검출 한계는 700ppm BPL에 대한 것보다 낮다는 것을 주지한다
불활성화 유효성에 대한 초기 BPL 농도의 명확한 효과는 5±2℃(가수분해 전)에서 24시간 인큐베이션 후 3.3과 5.9 log10 사이의 감소로 관찰되었다. 다음 가수분해 단계는 평균으로 1.7 log10 첨가에 의해 역가를 추가로 감소시키는 반면, 유지 대조군 역가는 전반적인 절차에 걸쳐 일정하게 유지되었다. 이는 고도로 내성인 바이러스 오염의 경우 가수분해 단계가 추가 불활성화 단계로 작용할 수 있음을 나타낸다. 4.84(300ppm), 7.43(500ppm) 및 검출 한계 이하(700ppm)의 전반적인 감소 인자로, 적용된 BPL 처리는 농도 > 300ppm에서 파보바이러스과의 불활성화에 효과적인 것으로 간주되었다. 따라서 본 발명자들은 모든 추가 연구에서 SARS-CoV-2 바이러스 불활성화를 위해 500ppm을 선택하기로 결정했다.
BPL에 의한 SARS-CoV-2 바이러스 불활성화
BPL에 의한 모델 바이러스의 불활성화에 대한 기존 데이터(PPV 불활성화에 대한 상기 섹션 참조)를 기반으로 SARS-CoV-2 바이러스 수확 물질의 불활성화를 위해 500ppm(1/2000)의 BPL 농도가 선택되었다. 용액 내 BPL의 안정성은 고도로 온도 의존성이기 때문에 5±3℃의 인큐베이션 온도와 24시간의 인큐베이션 시간이 선택되어 전반적인 불활성화 전반에 걸쳐 충분한 BPL이 존재하도록 보장했다. 농축된 수확물에 BPL을 첨가하고 혼합한 후, 불활성화 용액은 제어된 조건 하에서 불활성화가 일어나는 신선한 용기로 이동된다. 이 이동은 초기 혼합 동안 잠재적인 사각-지점에 있는 바이러스 입자가 BPL과 접촉하지 않을 가능성을 배제한다.
BPL의 가수분해 동안 불활성화된 바이러스 용액의 pH를 안정화하기 위해, 5±3℃로 사전-냉각된 프로타민 설페이트(PS) 처리되고 농축된 수확물에 25mM HEPES pH 7.4를 보충한다. 불활성화된 후 남아있는 BPL을 줄이기 위해 용액을 37±2℃로 설정된 온도-조절된 인큐베이터에서 총 시간 2.5시간 ± 0.5시간 동안 32℃ 이상의 온도로 가온한다. 현재 공정 부피 약 1L에 대한 가수분해 단계의 총 시간은 32℃ 이상으로의 가온 및 인큐베이션을 포함하여 5시간 15분 내지 6시간 15분이었다.
가수분해의 완료 후, 불활성화된 바이러스 용액(IVS)을 즉시 온도-조절된 냉장고에서 5±3℃로 냉각하고 현재로는 총 18일이 소요되는 대용량 플라크 검정 및 일련의 계대 검정에 의해 불활성화가 확인될 때까지 그곳에 보관하였다. 불활성화 과정 전반에 걸친 바이러스 입자의 회수는 크기-배제 크로마토그래피에 의해 모니터링되었다.
15mL에서 최대 1000mL까지의 실험실 규모에서의 초기 연구는 BPL 추가 후 2시간 이내에 최대 8 log10 pfu/mL의 바이러스 역가가 검출가능한 수준 아래로 감소한 SARS-CoV-2에 대한 매우 빠른 불활성화 동역학을 나타냈다. 이들 결과는 대략 1L의 최종 불활성화 부피에서 GMP 생성 실행에 대해 확인되었다. 모델 바이러스에 대한 불활성화 데이터와 함께 고려할 때 적용된 BPL 처리는 효율적인 것으로 간주될 수 있고 SARS-CoV-2 농축된 수확 물질의 불활성화에 대한 상당한 안전 한계를 포함한다.
추가의 바람직한 실시형태에서, 불활성화 단계(들)는 표면 항원 완전성, 특히 S 단백질의 완전성을 보존하기 위해 특히 온화하다. 일 실시형태에서, 온화한 불활성화 방법은 용기에서 SARS-CoV-2 입자를 포함하는 액체 조성물을 화학적 바이러스 불활성화제(예컨대 예를 들어, 상기 열거된 임의의 화학적 불활성화제 또는 이의 조합, 바람직하게는 베타-프로피오락톤)와 접촉시키는 것, 난류가 아닌 층류의 조건 하에서 화학적 바이러스 불활성화제 및 SARS-CoV-2 입자를 포함하는 액체 조성물을 혼합하는 것, 및 화학적 바이러스 불활성화제 및 SARS-CoV-2 입자를 포함하는 액체 조성물을 바이러스를 불활성화하기에 충분한 시간 동안 인큐베이션하는 것을 포함한다. 온화한 불활성화 단계는 선택적으로 유연한 생물반응기 백에서 수행된다. 온화한 불활성화 단계는 바람직하게는 불활성화의 기간 동안 5회 이하의 용기 반전을 포함한다. 바람직하게는, 화학적 바이러스 불활성화제 및 SARS-CoV-2 입자를 포함하는 조성물의 혼합은 용기를 인큐베이션의 기간 동안 10rpm 이하에서 10분 이하 동안 요동, 회전, 궤도 진탕 또는 진동시키는 것을 포함한다.
SARS-CoV-2의 정제. 선택적으로, 물질은 ~1% CC700 또는 CC400의 최종 농도에서 Capto™ Core 700 또는 CC400 크로마토그래피 매질을 사용한 배치 흡착(본 명세서에서는 배치 크로마토그래피로도 알려짐)에 의해 즉시 추가 처리되었다. 물질을 자기 교반기를 사용하여 일정한 교반 하에 4℃에서 15분 동안 인큐베이션하였다. 인큐베이션 후, CC700 또는 CC400 고형물을 사용하는 경우 이를 중력에 의해 10분 동안 침전시키고 CaptoCore 입자에 의한 필터의 막힘을 피하기 위해 SARS-CoV-2 물질을 용액의 상단에서 제거한다. 임의의 잔존하는 CaptoCore 입자와 DNA 침전물은 그 다음 0.2μm Mini Kleenpak EKV 필터 캡슐(Pall)을 사용하여 여과에 의해 용액에서 제거했다. 풀링된 여과된 수확 물질을 양 성분의 스톡 용액을 사용하여 25mM Tris pH 7.5 및 10% 수크로스(w/w)의 최종 농도로 조정했다. 이를 통해 필요한 경우 <-65℃에서 농축된 수확물을 동결할 수 있다.
생성된 여액은 SARS-CoV-2 물질의 최종 농축 및 폴리싱을 위해 수크로스 밀도 구배 원심분리(본 명세서에서는 배치 원심분리로도 알려짐)에 의해 추가로 처리된다. 농축된 프로타민 설페이트(PS) 또는 벤조나아제, 바람직하게는 PS 처리된 수확물은 밀도가 다른 3개의 수크로스 층으로 구성된 용액의 상단 상에 장입되었다. 100mL 병 규모에서의 원심분리를 위한 개별 층의 부피는 표 1a에 나타나 있다.
[표 1a]: 수크로스 밀도 원심분리를 위한 부피.
수크로스 구배 병을 가장 낮은 수크로스 밀도(10% 수크로스(w/w))를 갖는 SARS-CoV-2 물질로 시작하여 병의 바닥 안으로 용액을 펌핑함에 의해 개별 수크로스 층을 층화하고, 이어서 오름 차순에 있는 다른 수크로스 용액에 의해 준비했다. 기술된 설정은 도 3에 나타나 있다. 준비된 SG 병을 4℃로 사전-냉각된 로터 안으로 이전하고 브레이크/감속 없이 적어도 20시간 동안 4℃에서 최대 ~11,000 RCF로 원심분리했다.
원심분리 후, 수크로스 구배의 일련의 2mL 분획의 수확을 바닥에서 위로 연동 펌프를 사용하여 수행한다. 충분히 높은 순도를 갖는 바이러스-함유 분획을 동정하기 위해 분획을 즉시 SDS-PAGE/은 염색에 의해 시험하였다. 따라서, 동정된 분획을 풀링하고 추가 처리했다. 정제된 SARS-CoV-2는 <-65℃에서 보관하거나 즉시 제형화했다.
어쥬번트로 SARS-CoV-2의 제형화. SARS-CoV-2 입자는 명반으로 제형화되었다. 선택적으로, Th1 어쥬번트도 제형화에 첨가되거나 병상 혼합을 위한 별도 조성물로 제공되었다.
SARS-CoV-2 ELISA 검정. 본 명세서에 기재된 제제에서 불활성화된 SARS-CoV-2 항원 함량(즉, 주요 항원성 단백질로서 S1의 함량)은 ELISA에 의해 결정(정량화)되었다. 본 명세서에 사용된 SARS-CoV-2 ELISA는 SARS-CoV-2 샘플이 첨가된 마이크로타이터 플레이트 상에 고정화된 SARS-CoV-2 스파이크 항체(AM001414; 코팅 항체)로 4-층 면역-효소적 검정이다. 코팅 항체에 대한 항원의 결합 시, 플레이트를 일차 항체(즉, AbFlex®SARS-COV-2 스파이크 항체(rAb)(AM002414))로 추가 처리하였다. 이어서 효소 연결된 접합체 항체(즉, 염소 항-마우스 IgG HRP 접합체)인 2차 항체를 첨가하였다. 플레이트는 임의의 비결합된 단백질 또는 항체를 제거하기 위해 순한 세제 용액(PBS-T)을 사용하여 다양한 단계 사이에 세정되었다. 플레이트는 테트라메틸 벤지딘(TMB) 기질의 첨가에 의하여 전개되었다. 가수분해된 TMB는 샘플에서의 항원 함량의 농도에 직접적으로 비례하는 안정한 유색 접합체를 형성한다. 항원 정량화는 자동화된 플레이트 판독기에서 생성된 표준 곡선을 참조로 사용하여 λ450nm(λ630nm 참조)에서 분광광도계 검출에 의해 수행되었다. 표준은 20 항원 단위(AU)/mL 스파이크 삼량체 작업 용액 순수로 시작하여 제조되어, 다음 표준 농도를 위해 1:2로 추가로 연속으로 희석되었다: 20AU/mL, 10AU/mL, 5AU/mL, 2.5AU/mL, 1.25AU/mL, 0.625AU/mL, 0.3125AU/mL 및 0.1263AU/mL. 각 희석은 플레이트당 이중으로 시험되었다. 공급업체(R&D Systems)에 따르면 스파이크 삼량체 표준의 "항원 단위"는 재조합 인간 ACE-2 His-tag를 사용한 기능적 ELISA에서의 그 결합 능력에 상응한다.
참조 표준 및 항체:
코팅 항체: SARS-CoV-2 스파이크 항체(AM001414)
스파이크 삼량체(S1+S2), His-tag(SARS-CoV-2)(예를 들어, BPS Lot# 200826; Cat#100728)
SARS-CoV-2 QC(예를 들어, RSQC240920AGR)
일차 검출 항체 AbFlex® SARS-CoV-2 스파이크 항체(rAb)(AM002414)
2차 검출 항체 염소 항-마우스 IgG HRP 접합체
코팅 완충액: 탄산염 완충액
ELISA 세정 완충액: PBS + 0.05% 트윈-20(PBS-T).
샘플 희석 완충액: PBS-T + 1% BSA.
생산 공정은 고밀도이고 온전한 스파이크 단백질을 전달했다(도 6 참조). AU당 약 1 내지 1.5 x 107 바이러스 입자로 추정되었다. 베타 프로피오락톤에 의한 불활성화 공정은 빠른 불활성화 동역학을 제공하고 S-단백질의 화학적 변형을 감지할 수 없었다. 핵심 매개변수 및 관련 공정 관련된 불순물은 상용 IXIARO® 생성 공정과 유사했다(표 1b 참조). 발명에 따른 SARS-CoV-2 약물 물질은 SDS-PAGE(은색 염색, 환원됨)에 따라 고순도(>95%)이고 SE-HPLC에 따라 응집체가 없었다(단량체 바이러스(>95%)(도 7 참조).
베타-프로피오락톤-불활성화된 SARS-CoV-2에 따른 S-단백질의 변형을 특성화하기 위한 추가 확인 연구는 S-단백질의 트립신 소화물의 질량 분광 분석에 의해 수행된다. 중요한 에피토프에서 아미노산의 변형은 최소화된다. S 단백질 내의 수용체 결합 도메인(RBD)의 초기 정렬 및 몇 가지 공지된 (교차)-중화 항체(SARS-CoV 및 SARS-CoV-2)의 hACE2 인터페이스 및 에피토프는 잠재적인 높은 전환을 갖는 이들 에피토프 및 잠재적인 낮은 전환율을 갖는 단지 소수 내에서 아미노산을 나타내지 않았다.
[표 1b]. SARS-CoV-2 약물 물질 및 IXIARO® 약물 물질의 핵심 매개변수 및 관련 공정 관련된 불순물의 비교.
실시예 2. 불활성화된 SARS-CoV-2 바이러스 조성물의 면역원성 및 보호능의
시험관내
및
생체내
평가
면역원성. 면역화에 앞서, 10마리 Balb/c 마우스의 실험군을 채혈하고 면역-전 혈청을 제조하였다. 마우스에 명반과 함께 제형화된 불활성화된 SARS-CoV-2의 용량 적정을 피하로 투여했다(표 2 참조). 면역화 후 2개의 상이한 간격(아래 참조)에서 혈액을 수집하고 면역 혈청을 준비하고 최종 채혈 시 비장을 수집했다. 모든 동물 실험은 오스트리아 법(BGB1 Nr. 501/1989)에 따라 수행되었으며 "Magistratsabteilung 58"에 의해 승인을 받았다. 혈청은 ELISA에 의해 총 IgG 및 서브클래스(IgG1/IgG2a)에 대해 평가되었고 PRNT에 의해 중화 항체에 대해 평가되었다. Th1/Th2 반응은 IFN-γ ELISpot 및 세포내 사이토카인 염색(CD4+/CD8+)에 의해 추가로 평가되었다.
- 일정 1: 면역화 0일차/7일차, 중간 채혈 14일차, 최종 채혈 및 비장 수확 28일차
- 일정 2: 면역화 0일차/21일차, 중간 채혈 14일차/28일차, 최종 채혈 및 비장 수확 35일차
[표 2]. 투여 실험의 설계, 10마리 마우스/군: 3개 투여량 군: 0.2 - 2μg 총 단백질; 실험 횟수: 3. 실험 목적을 위해 Th1 어쥬번트는 마우스의 면역화 전에 SARS-CoV-2/명반 제형에 직접적으로 첨가된다.
플라크 감소 중화 테스트(PRNT). 12-웰 조직 배양 플레이트의 각 웰에 Vero 세포를 시딩하고 3일 동안 5% CO2와 함께 35℃에서 인큐베이션했다. 각 처리 군으로부터 열-불활성화된 혈청의 풀에서 일련의 희석을 시험한다. 각 혈청 제제는 대략 50-80pfu의 SARS-CoV-2로 35℃에서 5% CO2와 함께 1시간 동안 인큐베이션되었다. 세포 배양 배지를 Vero 세포로부터 흡인하고 SARS-CoV-2/혈청 혼합물을 각 웰에 첨가하였다. 플레이트를 부드럽게 흔든 다음 5% CO2와 함께 35℃에서 2시간 동안 인큐베이션한다. 각 웰에 EMEM 및 영양소를 함유하는 2% 메틸셀룰로오스 용액 1mL를 첨가하고 플레이트를 5% CO2와 함께 35℃에서 4일 동안 추가로 인큐베이션하였다. 그런 다음 세포를 크리스탈 바이올렛/5% 포름알데히드로 1시간 동안 염색하고 탈이온수로 3회 세정했다. 플레이트를 공기 건조시키고 각 웰에서의 플라크 수를 수동으로 계수했다. 대안적으로, 예를 들어, TCID50과 같은 다른 방법을 적용할 수 있다.
[표 3]. 일정 및 수명 실험의 설계, 10마리 마우스/군; 부가이지만 표 2에 대해서와 같은 면역화 일정; 2차 면역화 후 2, 6, 10, 14, 18 및 22주에 중간 채혈; 2차 면역화 후 26주에 말단-채혈; 단지 바람직한 용량; 단지 피하 경로; 실험 횟수: 1. 실험 목적을 위해, Th1 어쥬번트를 마우스의 면역화 전에 SARS-CoV-2/명반 제형에 직접적으로 첨가했다.
보호능. 불활성화된 SARS-CoV-2의 보호능은 인간화된 ACE2 단백질을 발현하는 SARS-감수성 형질전환 마우스(Jackson Laboratory)(Tseng, C.-T.K. et al., Severe Acute Respiratory Syndrome Coronavirus Infection of Mice Transgenic for the Human Angiotensin-Converting Enzyme 2 Virus Receptor (2007) J of Virol 81:1162-1173) 또는 SARS-CoV-2 감염에 대해 개발된 NHP 모델을 사용하여 평가된다. 동물의 군은 음성 대조군으로서 어쥬번트 또는 PBS를 갖거나 갖지 않는 다양한 투여량의 불활성화된 SARS-CoV-2로 피하로(s.c.) 면역화된다. 마지막 용량 3주 후에 동물은 SARS-CoV-2로 공격되고 질환 진행 및 생존에 대해 모니터링된다. 부가하여, PRNT 검정에서 백신접종에 의해 유도된 중화 항체 역가를 결정하기 위해 혈청 샘플을 취한다.
[표 3A]. SARS-CoV-2 ELISA-결정된 투여량을 사용한 실험 4743을 투여하는 설계.
실험 4743 프로토콜. 암컷 Balb/c 마우스(10마리/군)를 표 3A에 설명된 바와 같은 용량 및 어쥬번트로 0일차 및 21일차에 2회 피하 주사로(100μL) 면역화시켰다. 실험의 판독값은 전체 IgG 및 서브클래스(IgG1/IgG2a) 및 바이러스 중화(PRNT)였다. 실험 4743에 사용된 백신 제형: 17μg Al3+(명반)/용량으로 PBS에서 제형화된 연구 바이러스 종자 은행(rVSB)에서 생성된 정제된 불활성화된 SARS-CoV-2.
SARS-CoV-2 단백질에 대한 항체 반응. 다양한 용량 및 어쥬번트 제형에 대한 마우스에서의 면역 반응을 총 IgG ELISA로 평가했다(도 4). 플레이트는 스파이크 당단백질 또는 핵단백질(도 4c)의 S1 부분(도 4a) 또는 수용체 결합 도메인(RBD)(도 4b)으로 코팅되었다. 28일 및 35일에 채취한 혈청을 분석했다. 플레이트를 2μg/mL 항원(S1, RBD 및 N 단백질)으로 코팅하고 마우스 혈청을 4-배 희석에서 1:50의 시작 희석으로 시험했다. 검출을 위해 2차 모노클로날 항체(HRP-접합된 염소 항-마우스 IgG)를 사용하고 ABTS로 전개하고 흡광도 405nm에서 판독했다. 웰은 각 단계 사이에 PBS-T로 세정되었다. 블랭크 3-배로 설정된 컷-오프를 사용하여 평가변수 역가를 결정했다.
IgG 서브클래스 면역 반응. 플레이트를 스파이크 당단백질의 S1 부분(도 4a)으로 코팅하고 35일차에 취한 혈청을 분석했다. HRP와 접합된 서브클래스 특이적 2차 항체(IgG1 및 IgG2a)를 검출에 사용했다. 상이한 IgG 서브클래스(IgG1 및 IgG2a)의 양 결정을 위한 표준 곡선(4-매개변수 회귀)으로서, 상이한 서브클래스를 갖는 모노클로날 항체가 사용되었다(IgG1 mAb 클론 43 및 IgG2a mAb 클론 CR3022). 결합된 HRP-접합된 2차 mAb는 ABTS로 전개되었고 흡광도 405nm에서 판독되었다. 웰은 각 단계 사이에 PBS-T로 세정되었다. 상대적인 IgG 서브클래스 농도는 도 5a에 도시되고 IgG2a/IgG1의 비율은 도 5b에 도시된다.
실험 4743으로부터의 관찰. 명반으로 제형화된 불활성화된 SARS-CoV-2는 S1 단백질, 수용체 결합 도메인(RBD) 및 뉴클레오캡시드 단백질(N)에 대한 항체를 측정하는 ELISA에 의해 검출된 SARS-CoV-2에 대한 마우스에서 항체를 유도했다(도 4a-c). 28일차와 35일차에 출혈 사이에 면역원성에서 증가가 관찰되었다. 가장 낮은 용량(0.3 AU)을 받은 그룹에서 위약보다 유의하게 크지 않은 더 작은 증가가 S1 및 RBD ELISA 역가에 대해 나타났다.
명반-어쥬번트된 불활성화된 SARS-CoV-2는 S1 ELISA에 의한 IgG 서브클래스의 정량화에 의해 입증된 바와 같이 Th1(IgG2a) 반응에 비해 Th2(IgG1) 쪽으로 면역 반응이 더 많이 이동하는 것을 기대한 바와 같이 촉진했다. 측정된 IgG2a 및 IgG1의 총량 및 처리군에서 IgG2a:IgG1의 비율은 각각 도 5a 및 5b에 도시되어 있다. Th1(IgG2a)에 대한 면역 반응에서 전이는 SARS-CoV-2 백신 조성물에 Th1-자극 어쥬번트의 추가에 의해 마찬가지로 예상된다.
추가 면역화 실험이 연구와 GMP 물질 사이의 가교로서 저용량(3, 1.2 및 0.3 AU)으로 GMP 물질을 사용하는 마우스에서 수행될뿐만 아니라 인간 용량(40, 10 및 3 AU)으로 마우스에서 GMP 물질의 분석이 수행된다.
추가로, 면역화된 비-인간 영장류(NHP)에서 공격 연구가 수행되고(도 8 참조) 발명의 SARS-CoV-2 백신 후보로 백신접종된 인간 대상로부터의 혈청을 사용하여 햄스터에서 수동 전이 연구가 수행된다(표 1c 참조).
[표 1c]. 햄스터에서 발명의 SARS-CoV-2 백신 후보의 수동 전이 연구.
실시예 3. 질환 및 면역병리의 항체-의존성 증진(ADE)에 대한 SARS-CoV-2 백신의 시험
메커니즘은 잘 이해되지 않지만 이전 코로나바이러스 감염 또는 백신접종에 대한 반응으로 생성된 항체는 후속 코로나바이러스 감염(들) 동안 1) 면역병리 및/또는 2) 질환의 항체-의존성 증진(ADE)에 대한 위험을 증가시킬 수 있다. 이와 같이 SARS-CoV-2에 대한 항체의 임의의 자극은 가상의 위험을 나타낸다. 이와 관련하여 현재 백신의 안전성을 보장하기 위해 여러 가지 접근법이 수행된다.
시험관내 항체-의존성 증강 검증. 불활성화된 SARS-CoV-2 백신접종된 마우스로부터의 면역 혈청을 시험관내에서 증강된 질환의 특징에 대해 평가한다. 이러한 검정은 예를 들어 Wang, S.-F., et al. 2014 (Antibody-dependent SARS coronavirus infection is mediated by antibodies against spike proteins (2014) BBRC 451:208-214)에 의해 기술되어 있다. 간단히 말해서, 감수성 세포 유형 또는 세포주는 면역 혈청과 함께 인큐베이션된 후 SARS-CoV-2로 감염된다. 세포는 세포변형 효과 및/또는 염증 마커의 생성에 대해 평가된다.
면역 병리학의 마우스 모델. 공격 시 백신-증강된 면역병리의 위험은 Tseng C.T. et al. (Immunization with SARS Coronavirus Vaccines Leads to Pulmonary Immunopathology on Challenge with the SARS Virus (2012) PLoS ONE 7(4):e35421)에 의해 기술된 바와 같이 Balb/c 마우스 모델에서 평가된다. 간단히 말해서, 마우스는 본 명세서에 기재된 바와 같이 제형화된 불활성화된 SARS-Cov-2로 2-주 간격으로 2회 면역화시킨 후 SARS-CoV-2로 공격한다. SARS-CoV-2 역가와 폐의 면역 세포 침윤을 시험한다.
ADE의 비-인간 영장류 모델. 비-인간 영장류에서 ADE 발달의 위험은 Luo F, et al. (Evaluation of Antibody-Dependent Enhancement of SARS-CoV Infection in Rhesus Macaques Immunized with an Inactivated SARS-CoV Vaccine (2018) Virologica Sinica 33:201-204)에 의해 기술된 바와 같이 평가된다. 간단히 말해서, NHP는 불활성화된 SARS-CoV-2로 면역화되고 이어서 SARS-CoV-2 공격 및 증상과 질환 병리의 평가가 뒤따른다.
실시예 4. 임상 페이스 1 연구
페이스 1 시험을 위한 불활성화된 SARS-CoV-2의 제형. 페이스 1 시험의 목적은 면역원성과 함께 백신의 안전성을 평가하고 최적의 용량과 어쥬번트(들)를 결정하는 것이다. 이와 같이 임상 페이스 1에서 3가지 항원 용량이 시험된다: 높음, 중간 및 낮음으로, 대략 3-배의 각 용량 사이의 거리를 가지고 스팬이 높은 용량과 낮은 용량 사이의 약 10-배 차이를 커버하도록 선택된다. 용량 범위는 Th1 어쥬번트의 임의의 잠재적 용량-절약 효과를 나타내기 위해 부분적으로 선택된다.
본 명세서에서 정제된 SARS-CoV-2 바이러스는 SDS-PAGE, SE-HPLC 및/또는 SARS-CoV-2 ELISA에 의해 평가된 바와 같이 >90%의 고순도를 갖는다(데이터는 나타내지 않음). 더욱이, 예비 연구에 따르면 바이러스가 통과하는 동안 유전적 이질성의 발현률이 낮고 특정 개별 돌연변이가 두드러지지 않는 것으로 나타났다(데이터는 나타내지 않음).
본 명세서에서 정제된 SARS-CoV-2 바이러스는 SDS-PAGE, SE-HPLC 및/또는 SARS-CoV-2 ELISA에 의해 평가된 바와 같이 >90%의 고순도를 갖는다(예를 들어, 도 7 참조). 더욱이 예비 연구에 따르면 바이러스가 통과하는 동안 유전적 이질성의 발현률이 낮고 특정 개별 돌연변이가 두드러지지 않는 것으로 나타났다(데이터는 나타내지 않음).
용량 범위에 도달하기 위해, SARS-CoV-2 바이러스를 JEV와 비교하여, 특히 용량 등가물당 SE-HPLC 피크 면적(밀리-흡수 단위 x 분으로 기록됨; mAU), 용량당 불활성화된 바이러스 입자의 총량 및 용량당 총 바이러스 표면 등가물을 평가한다(표 4 참조). 이 평가는 S(스파이크; SARS-CoV-2)와 E(엔벨로프; JEV) 단백질 사이의 유사한 표면 항원 밀도의 가정을 기반으로 하였다. 총 단백질은 μBCA 검정에 의해 결정되었다(표 4). 검정은 다양했지만 mL당 1mAU에서 ~2μg 총 단백질에 해당하는 것으로 관찰되었다. 실시예 1에 요약된 바와 같이 최적화된 SARS-CoV-2 S-단백질 ELISA를 사용한 또 다른 결정도 수행되었다.
[표 4]. JEV 및 SARS-CoV-2 정량화 매개변수 및 낮은, 중간 및 높은 SARS-CoV-2 투여량 군에서 총 단백질의 비교.
SARS-CoV-2 바이러스 입자(~92nm 직경)는 플라비바이러스 입자(~40nm)보다 훨씬 더 크며, 이는 입자당 대략 5-배 더 큰 바이러스 표면적에 해당하므로 동등하게 더 높은 항원 함량이 예상된다. 더욱이, JEV(IXIARO), TBE(Encepur) 및 HepA(VAQTA)를 포함한 다른 불활성화된 바이러스 백신 제제는 낮은 μg에서 ng 단백질 범위의 항원 용량을 보고했다. 이들 바이러스는 모두 포르말린 불활성화되어 있기 때문에, 본 발명의 BPL-불활성화된 SARS-CoV-2 바이러스는 더 잘 보존된 표면 항원 단백질, 즉 보다 우수한 품질의 항원을 가지고 더 낮은 총 단백질 용량을 요한다.
클리닉에 진입하기 위해 추가 항원 결정 검정(실시예 1에 기술된 SARS-CoV-2 ELISA 검정)이 전개되었고 페이스 1 시험에 진입하기 위한 백신 제형의 용량이 이 검정을 사용하여 결정되었다. 페이스 1 치료 군은 표 5에 제시되어 있다.
페이스 1 시험을 위한 SARS-CoV-2 백신의 제형(0.5mL/용량):
-항원(불활성화된 SARS-CoV-2) 표적 용량:
낮음: 3AU/0.5mL (6AU/mL)*
중간: 10AU/0.5mL (20AU/mL)
높음: 40AU/0.5mL (80AU/mL)
*실시예 1에 기재된 바와 같이 SARS-CoV-2 ELISA 검정에 의해 결정된 용량
- 수산화알루미늄(Al 3+ ): 0.5mg/용량 (1mg/mL)
-Th1 어쥬번트
-재조합 인간 혈청 알부민(rHSA): ~25μg/용량 (~50μg/mL)
-완충액: 인산완충식염수(PBS)
일부 경우에, 백신접종된 대상체에 감염성 용량의 살아있는 SARS-CoV-2 바이러스(아시아 및/또는 유럽 계통)로 공격한다.
[표 5]. 불활성화된 SARS-CoV-2 백신의 페이스 1 시험을 위한 처리군(낮은, 중간 및 높은 용량은 표 4에 제공된 것임).
실시예 5. 중화 검정으로 백신접종된 유기체의 혈청 시험
백신접종된 마우스, 햄스터, 비-인간 영장류 또는 인간의 혈청은 "Szurgot, I., Hanke, L., Sheward, D.J. et al. DNA-launched RNA replicon vaccines induce potent anti-SARS-CoV-2 immune responses in mice. Sci Rep 11, 3125 (2021). https://doi.org/10.1038/s41598-021-82498-5"에 기술된 바와 같은 중화 검정에서 시험할 수 있다.
시험의 판독은 백신접종된 대상체의 혈청이 신규한 변이체를 얼마나 잘 중화할 수 있는지 표시를 제공하고 따라서 백신의 설계를 안내한다.
실시예 6. 불활성화된 SARS-CoV-2의 탠덤 질량 분석법(LC-MS-MS) 분석을 사용한 액체 크로마토그래피
방법론:
2개의 샘플을 SDS-폴리아크릴아미드 겔 전기영동을 사용하여 분리하고 은 염색에 의해 밴드를 시각화했다. 밴드를 절단하고 트립신으로 겔-내 단리를 수행하고 생성된 펩티드를 고-해상도 정밀 질량 분석기에 연결된 나노-액체 크로마토그래피로 분석했다. 펩티드는 MaxQuant 소프트웨어 패키지와 SARS-CoV-2 및 클로로세부스 사바에우스에 대한 UniProt 참조 데이터베이스를 사용하여 원시 스펙트럼으로부터 식별되었다. 변형을 고려하기 위해 데이터는 특별히 β-프로피오락톤 변형에 대해 재-검색되었고 획득된 결과는 제2 독립적인 검색 알고리즘(Sequest in Proteome Discoverer suite)으로 확인되었다. 부가적으로, 데이터는 추가의 알려지지 않은 MS-감지가능한 변형을 고려하기 위해 FragPipe 패키지로 검색되었다.
결과:
단백질 식별:
밴드는 명확히 3가지 주요 바이러스 단백질(스파이크-단백질, 막-단백질, 핵단백질)뿐만 아니라 숙주 시스템으로부터 배경 단백질에 기인할 수 있었다(도 10 참조). SARS-CoV-2 ORF9b와 레플리카제 다단백질의 흔적도 감지할 수 있었지만, 이들 단백질은 가능하기로는 그 크기에 기인하여 겔에서 잘 분해되지 않았다. 겔 상의 분리 패턴은 숙주 단백질 밴드(밴드 2.3), 약간 다른 S-단백질 패턴(밴드 2.10-2.13) 및 샘플 중 하나에서 혈청 알부민의 예상된 강한 밴드(샘플 2)를 제외하고 샘플 둘 모두에 대해 매우 유사했다. 부가적으로, 인간 기원의 다수의 전형적인 실험실 오염물질(예를 들어, 케라틴)이 샘플 둘 모두의 배경에서 검출되었다. 스파이크-단백질의 처리(전체 길이에서 S1, S2 및 S2'까지)는 적용된 방법론으로 해결하기 어렵지만 샘플 둘 모두에서 밴드 9-13의 패턴에 의해 표시될 가능성이 높다.
변형 분석:
Uittenbogaard et al. (Reactions of β-Propiolactone with Nucleobase Analogues, Nucleosides, and Peptides, Protein Structure and Folding| Volume 286, ISSUE 42, P36198-36214, October 21, 2011)에 의한 공보에 기반하여, 시스테인, 메티오닌 및 히스티딘 상에서 β-프로피오락톤(BPL) 변형을 찾을 것으로 예상되었다. Uittenbogaard et al.은 변형의 유형, 예를 들어, 아실화, 알킬화와 함께 베타-프로피오락톤에 의해 변형 대상이 되는 아미노산을 연구했다. 그들은 BPL이 실제 pH에 따라 최대 9개의 다른 아미노산(C,H,M,D,E,Y,K,E,S)과 반응할 수 있음을 보여주었다. 그들의 연구에서 시스테인(>95%), 히스티딘(15-25%) 및 메티오닌(36%) 잔기에 대해 관련 pH 범위 7에서 9 내에서 더 높은 전환이 관찰되었다. 아스파르트산, 글루탐산 및 티로신에 대한 전환율은 대략 3-15% 범위에서 훨씬 낮았다. 시스틴 잔기에서 이황화기는 반응하지 않는 것으로 나타났다.
BPL-불활성화된 SARS-CoV-2 입자에서, BPL 변형이 감지될 수 있었지만(주로 +72 Da의 형태로) 존재비가 낮다. SARS-CoV-2 단백질에 대해 2894(샘플 1) 및 3086(샘플 2) 식별된 스펙트럼 중 각각 단지 73 및 110이 BPL 변형을 수행했으며, 이는 2.5 내지 3.6%로 해석된다(표 6 참조). 이는 또한 FragPipe를 사용한 공개 변형 검색에 의해서 확인되었는데, 이는 BPL-변형과 일치하는 질량 차이로 인해 유사한 낮은 분율의 스펙트럼에 기인했다.
[표 6]. 식별된 SARS-CoV-2 펩티드 스펙트럼의 수
SARS-CoV-2 단백질에 대해 보고된 모든 BPL-변형된 펩티드의 스펙트럼을 수동으로 검사했으며 그 중 샘플 1과 2에 대해 각각 6 내지 8개 부위가 확인되었다(표 6 참조). 이들 검증된 모든 부위에 대해 비변형된 펩티드도 확인되었으며 이는 BPL을 사용한 변형이 100%에 결코 도달하지 않았음을 시사한다. 본 발명자들은 각 밴드에 대한 단백질 풍부도로 정규화된 동일한 변형 부위에 대한 변형된 펩티드 대 비변형된 펩티드의 비율로 특정 부위의 변형(소위 부위 점유)의 정도를 추정했다. 본 발명자들은 그런 다음 부위 변형의 정도를 보수적으로 측정하기 위해 각 부위에 대해 최대 점유를 선택했다. 표 7에서 볼 수 있듯이, 점유는 식별된 스펙트럼의 총 수와 일치하여 식별된 부위에 대해 일반적으로 다소 낮았다. 유일한 예외인 핵단백질의 M234는 주의 깊게 해석되어야 하는데, 그 이유는 특정 펩티드 서열이 다른 부위에 비해 이 특정 펩티드에 대한 추정을 덜 정확하고 신뢰할 수 없게 만드는 문제가 있는 특징을 갖고 있기 때문이다.
[표 7]. 식별된 BPL-변형 부위 및 그의 점유
n.q. = 정량화되지 않음; n.d. = 감지되지 않음
*누락된 절단 및 산화에 기인한 정량화 불확실
예상된 변형과는 별도로 FragPipe 검색은 스펙트럼의 약 10%에서 발생하는 2가지 다른 변형(가장 가능하기로는 아세트알데히드 및 아세틸화)을 밝혔다. 이들 변형은 오염 단백질에서도 발생하기 때문에 가장 가능하기로는 겔 염색 및 샘플 준비 동안 도입된 인공물을 나타낸다.
요약:
상기에 기술된 결과를 기반으로 이들 샘플의 주요 성분은 SARS-CoV2 단백질에 해당한다고 결론지었다. BPL 변형은 검출가능했지만 낮은 것으로, 즉 전체 SARS-CoV-2 프로테옴 수준(즉, 식별된 모든 SARS-CoV-2 단백질)에서 약 3%인 것으로 나타났다. S-단백질의 5개 아미노산만이 변형된 것으로 밝혀졌고 이는 분석된 S-단백질의 소수에 대해서만 검출되었다(예를 들어, H207 아미노산에서 스파이크-단백질의 경우 약 16%, 즉 H207에서 변형을 가질 가능성은 약 16%였다). 두 샘플은 일부 배경 단백질에 관해 그리고 그의 변형의 정도에서 단지 약간 다를 뿐 샘플 1은 BPL-변형의 수준이 약간 더 낮게 나타난다. 스파이크 단백질의 아미노산 중 약 30 내지 40%만이 시험될 수 있었음을 주지한다.
결론:
이 데이터는 발명의 순한 불활성화 접근법이 S-단백질 내의 변형을 최소화하고 따라서 S-단백질의 천연 표면이 크게 보존된다는 견해를 뒷받침한다.
이에 비해, 독감 샘플의 BPL 불활성화에 의한 변형의 결정은 더 빈번했으며, 즉 일 샘플 독감 백신(NIBRG-121xp)의 경우 HA 상의 83개 부위와 NA 상의 43개 부위, 및 다른 샘플(NYMC-X181A)의 경우 HA 상의 99개 부위와 NA 상의 39개 부위가 변형되었으며, 여기서 HA 및 NA는 2가지 주요 막 당단백질, 즉 독감에 대한 일차 면역원이다(She Yi-Min et al., Surface modifications of influenza proteins upon virus inactivation by beta-propiolactone; Proteomics 2013, 13, 3537-3547, DOI 10.1002/pmic.201300096). 따라서 인플루엔자 바이러스의 BPL 불활성화는 막 융합에 영향을 미치는 일부를 포함하여 수많은 단백질 변형을 유발할 수 있다.
실시예 7. 불활성화된 SARS-CoV-2의 탠덤 질량 분석법(LC-MSMS) 분석을 사용한 추가 액체 크로마토그래피
방법론:
단백질의 더 큰 적용범위를 얻기 위하여 실시예 6에 기술된 바와 같이 BPL-불활성화된 SARS-CoV-2 입자의 추가 LC-MSMS 분석을 수행하였다. BPL-불활성화된 SARS-CoV-2 샘플의 5개 분취량을 SDS-PAGE 상에서 분리하고 밴드를 시각화를 위한 은 염색 또는 처리를 위한 쿠마시 염색에 의해 시각화했다. 스파이크 단백질에 해당하는 쿠마시-염색된 밴드(이전 분석 기반)는 트립신 또는 키모트립신을 사용한 겔-내 단리(in-gel digestion) 또는 산 가수분해에 적용되었다. 트립신 단리는 이전 PNGase F(펩티드:N-글리코시다제 F) 단리가 있는 한 번과 없는 한 번인, 두 번 수행하여 글리코실화에 의해 가려진 펩티드를 식별했다.
단리된 펩티드는 본질적으로 실시예 6에 기재된 바와 같이 LC-MSMS에 의해 분석되었다. 특히, 생성된 펩티드는 고-해상도 정밀 질량 분석기에 연결된 나노-액체 크로마토그래피로 분석하였다. 펩티드는 MaxQuant 소프트웨어 패키지와 SARS-CoV-2 및 클로로세부스 사바에우스에 대한 UniProt 참조 데이터베이스를 일반 실험실 오염물질의 데이터베이스와 함께 사용하여 원시 스펙트럼으로부터 식별되었다. 변형을 고려하기 위해 데이터는 특별히 β-프로피오락톤(BPL) 변형에 대해 검색되었고 SARS-CoV-2 스파이크 단백질의 모든 BPL-변형된 펩티드의 스펙트럼이 수동으로 검증되었다. 변형의 정도는 식별된 BPL-변형된 스펙트럼의 백분율로 전체적으로 추정되었고, 별도로 각 펩티드/부위에 대해 변형된 펩티드 대 비변형된 펩티드의 비율로부터 부위 점유를 계산함에 의해 부위-수준에서 추정되었다.
결과:
4가지 단리 방법(즉, (i) 트립신 (ii) 트립신 + PNGase F (iii) 키모트립신 및 (iv) 산 가수분해)의 조합을 사용한 특정 SARS-CoV-2 단백질의 총 적용범위는 다음과 같다:
스파이크(S) 단백질 - 91.5%
막(M) 단백질 - 60.36%
핵단백질(N) - 74.70%
각 단리 방법에 기반한, 불활성화된 SARS-CoV-2 입자에서 BPL-변형된 펩티드의 수는 하기 표 8에 나타나 있다:
[표 8]: 분석된 모든 밴드에 걸쳐 식별된 SARS-CoV-2 펩티드 스펙트럼의 수
실시예 6에 나타낸 바와 같이, 이는 단리 방법에 관계없이 BPL-변형된 펩티드의 백분율이 낮다는 것, 예를 들어 평균 7% 미만, 2 내지 7% 또는 약 2-5%인 것을 확인시켜준다.
상기에 기술된 4가지 단리 방법의 조합을 사용하면 SARS-CoV-2 단백질에서 아미노산 잔기의 더 큰 적용범위가 달성될 수 있다. 따라서, BPL-변형은 하기 표 9에 나타낸 스파이크(S) 및 막(M) 단백질에서의 위치에서 검출되었다. 상기 실시예 6에 기술된 바와 같이 각 부위에서 평균 점유 백분율도 표 9에 나타나 있다.
[표 9]. S 단백질에서 식별된 BPL-변형된 부위 및 그의 점유
표 9에서의 데이터로부터, 스파이크(S) 단백질에서 최대 약 16개 잔기가 변형될 수 있고, 막(M) 단백질에서 최대 4개 잔기가 변형될 수 있음을 알 수 있다. 각 부위에서 점유는, 예를 들어 20% 미만, 전형적으로 10% 미만으로 낮다. 따라서 불활성화된 SARS-CoV-2 입자는 낮은 정도의 BPL-변형을 나타낸다.
발명의 부가적인 양태
추가 양태에서, 본 발명은 다음을 제공한다:
A1. 최적으로(예를 들어, S-단백질의 천연 표면이 보존된) 불활성화된 SARS-CoV-2 입자를 포함하는 SARS-CoV-2 백신으로서, 여기서 SARS-CoV-2 입자는 SARS-CoV-2 백신이 투여된 대상체를 적어도 70% 확률로 혈청변환할 수 있는, SARS-CoV-2 백신.
A2. 양태 A1에 있어서, SARS-CoV-2 입자는 SARS-CoV-2 백신이 투여된 대상체를 적어도 80%, 85%, 90%, 또는 95% 확률로 혈청변환할 수 있는, SARS-CoV-2 백신.
A3. 양태 A1 또는 A2에 있어서, SARS-CoV-2 입자는 하기의 핵산 서열 중 어느 하나에 의해 제공되는 DNA 서열에 상응하는 RNA 게놈을 갖는, 백신
ㆍ서열번호: 1(Genbank NC_045512.2 참조), 또는 서열번호: 1과 적어도 85% 동일하고 독성 SARS-CoV-2를 패킹할 수 있는 변이 핵산 서열; 또는
ㆍ서열번호: 9(NCBI MT066156 참조), 또는 서열번호: 1과 적어도 85% 동일하고 독성 SARS-CoV-2를 패킹할 수 있는 변이 핵산 서열; 또는
ㆍ서열번호: 18(NCBI MW598408 참조). 또는 서열번호: 18과 적어도 85% 동일하고 독성 SARS-CoV-2를 패킹할 수 있는 변이 핵산 서열; 또는
ㆍ서열번호: 20(NCBI MW520923 참조). 또는 서열번호: 20과 적어도 85% 동일하고 독성 SARS-CoV-2를 패킹할 수 있는 변이 핵산 서열; 또는
ㆍ서열번호: 22(NCBI MW422256 참조). 또는 서열번호: 22와 적어도 85% 동일하고 독성 SARS-CoV-2를 패킹할 수 있는 변이 핵산 서열; 또는
ㆍ서열번호: 24(NCBI MW493681 참조). 또는 서열번호: 24와 적어도 85% 동일하고 독성 SARS-CoV-2를 패킹할 수 있는 변이 핵산 서열; 또는
ㆍ서열번호: 26(NCBI MW306426 참조). 또는 서열번호: 26과 적어도 85% 동일하고 독성 SARS-CoV-2를 패킹할 수 있는 변이 핵산 서열.
A4. 양태 A1-A3 중 어느 하나에 있어서, SARS-CoV-2 입자가 아미노산 서열에 의해 정의된 바와 같은 S 단백질을 갖는, 백신
ㆍ서열번호: 3, 또는 서열번호: 3과 적어도 95% 동일하고 독성 SARS-CoV-2를 패킹할 수 있는 변이 아미노산 서열; 또는
ㆍ서열번호: 11, 또는 서열번호: 11과 적어도 95% 동일하고 독성 SARS-CoV-2를 패킹할 수 있는 변이 아미노산 서열; 또는
ㆍ서열번호: 19, 또는 서열번호: 19와 적어도 95% 동일하고 독성 SARS-CoV-2를 패킹할 수 있는 변이 아미노산 서열; 또는
ㆍ서열번호: 21, 또는 서열번호: 21과 적어도 95% 동일하고 독성 SARS-CoV-2를 패킹할 수 있는 변이 아미노산 서열; 또는
ㆍ서열번호: 23, 또는 서열번호: 23과 적어도 95% 동일하고 독성 SARS-CoV-2를 패킹할 수 있는 변이 아미노산 서열; 또는
ㆍ서열번호: 25, 또는 서열번호: 25와 적어도 95% 동일하고 독성 SARS-CoV-2를 패킹할 수 있는 변이 아미노산 서열; 또는
ㆍ서열번호: 27, 또는 서열번호: 27과 적어도 95% 동일하고 독성 SARS-CoV-2를 패킹할 수 있는 변이 아미노산 서열.
A4.1 양태 A1-A4 중 어느 하나에 있어서, 제1 SARS-CoV-2 입자와 상이하고 서열번호: 1, 9, 18, 20, 22, 24 및 26으로 구성된 군으로부터 선택되는 제2 SARS-CoV-2 입자를 포함하는, 백신.
A5. 양태 A1-A4 및 A4.1 중 어느 하나에 있어서, SARS-CoV-2는 화학적 불활성화, 열적 불활성화, pH 불활성화, 또는 UV 불활성화된 또는 방사선 불활성화에 의해 불활성화되는, 백신.
A6. 양태 A5에 있어서, 화학적 불활성화는 플라크 검정에 의해 측정되거나 플라크 검정 플러스 일일에 의해 측정된 바와 같이 SARS-CoV-2를 완전하게 불활성화시키는 데 필요한 것보다 더 오래 동안 SARS-CoV-2 입자를 화학적 불활성화제와 접촉시키는 것을 포함하는, 백신.
A7. 양태 A6에 있어서, 화학적 불활성화는 SARS-CoV-2 입자를 포름알데히드 및/또는 베타-프로피오락톤, 바람직하게는 베타-프로피오락톤과 접촉시키는 것을 포함하는, 백신.
A8. 양태 A7에 있어서, 포름알데히드 및/또는 베타-프로피오락톤 불활성화는 SARS-CoV-2 입자를 2-10일 사이 동안 포름알데히드 및/또는 베타-프로피오락톤과 접촉시키는 것을 포함하는, 백신.
A9. 양태 A5-A8 중 어느 하나에 있어서, 화학적 활성화는 약 4℃ 또는 약 22℃에서 수행되는, 백신.
A10. 양태 A1-A9 중 어느 하나에 있어서, 어쥬번트를 추가로 포함하는, 백신.
A11. 양태 A10에 있어서, 어쥬번트는 선택적으로 AS01, AS03, MF59, 이미퀴모드 및/또는 CpG 1018과 조합된 알루미늄 염 어쥬번트인, 백신.
A12. 양태 A11에 있어서, 알루미늄 염 어쥬번트는 수산화알루미늄 또는 인산알루미늄 염인, 백신.
A13. A10-A12 중 어느 하나에 있어서, 백신은 펩티드 및 데옥시이노신-함유 면역자극 올리고데옥시핵산 분자(I-ODN)를 포함하는 어쥬번트를 포함하거나 추가로 포함하는, 백신.
A14. 양태 A13에 있어서, 펩티드는 서열 KLKL5KLK(서열번호: 5)를 포함하고 I-ODN은 올리고-d(IC)13(서열번호: 6)을 포함하는, 백신.
A15. 양태 A1-A14 중 어느 하나에 있어서, 하나 이상의 약학적으로 허용가능한 부형제를 추가로 포함하는, 백신.
B1. 양태 A1-A15 중 어느 하나의 SARS-CoV-2 백신을 포함하는 키트.
B2. 양태 B1에 있어서, 제2 백신을 추가로 포함하는, 키트.
B3. 양태 B2에 있어서, 제2 백신은 (예를 들어, mRNA 또는 아데노바이러스 벡터와 같은 또 다른 기술의) 또 다른 SARS-CoV-2 바이러스 백신, 인플루엔자 바이러스 백신 또는 치쿤구니야 바이러스 백신인, 키트.
C1. 양태 A1-A15 중 어느 하나의 치료적으로 유효한 양의 SARS-CoV-2 백신의 제1 용량을 이를 필요로 하는 대상체에게 투여하는 것을 포함하는, 방법.
C2. 양태 C1에 있어서, 치료적으로 유효한 양의 SARS-CoV-2 백신의 제2 용량을 투여하는 것을 추가로 포함하는, 방법.
C3. 양태 C1 또는 C2에 있어서, SARS-CoV-2 백신의 제2 용량은 SARS-CoV-2 백신의 제1 용량 후 약 7일에 투여되는, 방법.
C4. 양태 C1 또는 C2에 있어서, SARS-CoV-2 백신의 제2 용량은 SARS-CoV-2 백신의 제1 용량 후 약 14일에 투여되는, 방법.
C5. 양태 C1 또는 C2에 있어서, SARS-CoV-2 백신의 제2 용량은 SARS-CoV-2 백신의 제1 용량 후 약 21일에 투여되는, 방법.
C6. 양태 C1 또는 C2에 있어서, SARS-CoV-2 백신의 제2 용량은 SARS-CoV-2 백신의 제1 용량 후 약 28일에 투여되는, 방법.
C7. 양태 C1-C5 중 어느 하나에 있어서, 투여는 SARS-CoV-2 중화 항체의 생성을 초래하는, 방법.
D1. 하기를 포함하는, SARS-CoV-2 백신을 제조하는 방법
(i) Vero 세포 상에서 SARS-CoV-2를 계대하고, 이에 의해 SARS-CoV-2를 포함하는 배양 배지를 생성하는 단계;
(ii) (i)의 배양 배지를 수확하는 단계;
(iii) (ii)의 수확된 배양 배지를 침전시키고, 이에 의해 SARS-CoV-2 상등액을 생성하는 단계; 및
(iv) (iii)의 SARS-CoV-2 상등액에서 SARS-CoV-2를 최적으로 불활성화하고 이에 의해 불활성화된 SARS-CoV-2를 생성하는 단계.
D2. 양태 D1에 있어서, 단계 (iii) 이전에 (ii)의 배양 배지를 농축하는 것을 추가로 포함하는, 방법.
D3. 양태 D1 또는 D2에 있어서, (iii)의 침전은 (ii)의 배양 배지를 프로타민 설페이트 또는 벤조나아제와 접촉시키는 것을 포함하는, 방법.
D4. 양태 D1-D3 중 어느 하나에 있어서, (v) (iv)의 불활성화된 SARS-CoV-2를 투석하고 이에 의해 투석된 SARS-CoV-2를 생성하는 단계를 추가로 포함하는, 방법.
D5. 양태 D4에 있어서, (v)의 투석된 SARS-CoV-2를 여과하는 것을 포함하는 단계 (vi)를 추가로 포함하는, 방법.
D6. 양태 D1-D5 중 어느 하나에 있어서, 불활성화는 화학적 불활성화, 열적 불활성화, pH 불활성화, 또는 UV 불활성화에 의한 것인, 방법.
D7. 양태 D6에 있어서, 화학적 불활성화는 SARS-CoV-2 입자를 적어도 4일 동안 화학적 불활성화제와 접촉시키는 것을 포함하는, 방법.
D8. 양태 D6 또는 D7에 있어서, 화학적 불활성화제는 포름알데히드를 포함하는, 방법.
D9. 양태 D6-D8 중 어느 하나에 있어서, 화학적 활성화는 약 4℃ 또는 약 22℃에서 수행되는, 방법.
D10. 양태 D8 또는 D9에 있어서, 포름알데히드를 중화시키는 것을 추가로 포함하는, 방법.
D11. 양태 D10에 있어서, 중화는 메타중아황산나트륨으로 수행되는, 방법.
D12. 양태 D1-D11 중 어느 하나에 있어서, 화학적 불활성화는 바람직하게는 300 내지 700ppm, 더욱 바람직하게는 500ppm의 농도에서 BPL로 수행되고 약 1 내지 48시간, 바람직하게는 20 내지 28시간, 가장 바람직하게는 24시간 ± 2시간(예컨대 또한 ±1시간 또는 ±0.5시간) 동안 2℃ 내지 8℃에서 불활성화되는, 방법.
D13. 양태 D12에 있어서, 화학적 불활성화 후에 35℃ 내지 39℃, 바람직하게는 약 37℃에서 2.5시간 ± 0.5시간 동안 가수분해 단계가 이어지는, 방법.
E1. SARS-CoV-2 감염의 치료 및/또는 예방을 위한 양태 A1-A15 중 어느 하나의 최적으로 불활성화된 SARS-CoV-2 백신의 용도.
E2. 양태 E1에 있어서, 불활성화된 SARS-CoV-2 백신은 이를 필요로 하는 대상체에게 치료적으로 유효한 양의 제1 용량으로 투여되는, 용도.
E3. 양태 E2에 있어서, 불활성화된 SARS-CoV-2 백신은 대상체에게 치료적으로 유효한 양의 제2 용량으로 투여되는, 용도.
E4. 양태 E3에 있어서, 불활성화된 SARS-CoV-2 백신의 제2 용량은 SARS-CoV-2 백신의 제1 용량 후 약 7일에 투여되는, 용도.
E5. 양태 E3에 있어서, SARS-CoV-2 백신의 제2 용량은 SARS-CoV-2 백신의 제1 용량 후 약 14일에 투여되는, 용도.
E6. 양태 E3에 있어서, SARS-CoV-2 백신의 제2 용량은 SARS-CoV-2 백신의 제1 용량 후 약 21일에 투여되는, 용도.
E7. 양태 E3에 있어서, SARS-CoV-2 백신의 제2 용량은 SARS-CoV-2 백신의 제1 용량 후 약 28일에 투여되는, 용도.
E8. 양태 E1-E6 중 어느 하나에 있어서, 투여는 SARS-CoV-2 중화 항체의 생성을 초래하는, 용도.
F1. SARS-CoV-2 감염의 치료 및 예방에 사용하기 위한 약학적 조성물로서, 여기서 상기 약학적 조성물은 양태 A1-A15 중 어느 하나의 최적으로 불활성화된 SARS-CoV-2 백신을 포함하는, 약학적 조성물.
F2. 양태 F1에 있어서, 불활성화된 SARS-CoV-2 백신은 이를 필요로 하는 대상체에게 치료적으로 유효한 양의 제1 용량으로 투여되는, 약학적 조성물.
F3. 양태 F2에 있어서, 불활성화된 SARS-CoV-2 백신은 대상체에게 치료적으로 유효한 양의 제2 용량으로 투여되는, 용도.
F4. 양태 F3에 있어서, 불활성화된 SARS-CoV-2 백신의 제2 용량은 SARS-CoV-2 백신의 제1 용량 후 약 7일에 투여되는, 용도.
F5. 양태 F3에 있어서, SARS-CoV-2 백신의 제2 용량은 SARS-CoV-2 백신의 제1 용량 후 약 14일에 투여되는, 용도.
F6. 양태 F3에 있어서, SARS-CoV-2 백신의 제2 용량은 SARS-CoV-2 백신의 제1 용량 후 약 21일에 투여되는, 용도.
F7. 양태 F3에 있어서, SARS-CoV-2 백신의 제2 용량은 SARS-CoV-2 백신의 제1 용량 후 약 28일에 투여되는, 용도.
F8. 양태 F1-F6 중 어느 하나에 있어서, 투여는 SARS-CoV-2 중화 항체의 생성을 초래하는, 용도.
G1. 유효량의 항원을 포함하는 SARS-CoV-2 백신으로서, 여기서 상기 유효량은 SARS-CoV-2 백신이 투여된 대상체를 적어도 70% 확률로 혈청전환시킬 수 있는, SARS-CoV-2 백신.
G2. 양태 G1에 있어서, 상기 유효량은 SARS-CoV-2 백신이 투여된 대상체를 적어도 80%, 85%, 90%, 또는 95% 확률로 혈청전환시킬 수 있는, SARS-CoV-2 백신.
G3. 양태 G1 또는 G2에 있어서, 상기 유효량은 약 1 내지 100AU/용량, 바람직하게는 약 2 내지 75AU/용량, 바람직하게는 약 3 내지 60AU/용량, 보다 바람직하게는 약 3 내지 55AU/용량, 더욱 바람직하게는 약 3 내지 53AU/용량인, SARS-CoV-2 백신.
G4. 양태 G3에 있어서, 상기 유효량이 ELISA에 의해 결정되고 여기서 항원 단위(AU)는 표준으로서 사용된 스파이크 단백질의 ACE-2 결합 능력에 상응하는, SARS-CoV-2 백신.
H1. 불활성화된 SARS-CoV-2 입자를 포함하는 SARS-CoV-2 백신으로서; 여기서 SARS-CoV-2 입자의 천연 표면 형태는 백신이 인간 대상체에서 천연 SARS-CoV-2 입자에 대한 중화 항체를 생성할 수 있도록 백신에 보존되는, SARS-CoV-2 백신.
H2. 양태 H1에 있어서, 불활성화된 SARS-CoV-2 입자 내의 바이러스 RNA는 복제-결핍인, SARS-CoV-2 백신.
H3. 양태 H1 또는 H2에 있어서, 불활성화된 SARS-CoV-2 입자에서 바이러스 RNA는 (i) 알킬화 및/또는 아실화되고 (ii) 하나 이상의 변형된 퓨린(바람직하게는 구아닌) 잔기 또는 가닥 절단을 포함하고/하거나 (iii) 하나 이상의 바이러스 단백질과 가교-결합되는, SARS-CoV-2 백신.
H4. 임의의 선행하는 양태에 있어서, 불활성화된 SARS-CoV-2 입자는 바람직하게는 300 내지 700ppm, 더욱 바람직하게는 500ppm의 농도에서의 베타-프로피오락톤-불활성화된 SARS-CoV-2 입자이고, 2℃ 내지 8℃에서 약 1 내지 48시간, 바람직하게는 20 내지 28시간, 가장 바람직하게는 24시간 ± 2시간(예컨대 또한 ±1시간 또는 ±0.5시간) 동안 불활성화되고, 선택적으로 2.5시간 ± 0.5시간 동안 35℃ 내지 39℃, 바람직하게는 약 37℃에서 가수분해가 이어지는, SARS-CoV-2 백신.
H5. 임의의 선행하는 양태에 있어서, 불활성화된 SARS-CoV-2 입자는 자외선(UV)-불활성화된 SARS-CoV-2 입자인, SARS-CoV-2 백신.
H6. 임의의 선행하는 양태에 있어서, 불활성화된 SARS-CoV-2 입자에서 표면 단백질은 불활성화된 SARS-CoV-2 입자 내의 바이러스 RNA와 비교하여 감소된 변형을 포함하고, 바람직하게는 표면 단백질은 불활성화된 SARS-CoV-2 입자 내의 바이러스 RNA와 비교한 감소된 비율의 변형된 잔기를 포함하고; 상기 변형은 천연 SARS-CoV-2 입자에 관한 것이며, 바람직하게는 상기 변형은 알킬화 및/또는 아실화된 뉴클레오티드 또는 아미노산 잔기를 포함하는, SARS-CoV-2 백신.
H7. 임의의 선행하는 양태에 있어서, 불활성화된 SARS-CoV-2 입자는 (i) 스파이크(S) 단백질; (ii) 뉴클레오캡시드(N) 단백질; (iii) 막(M) 당단백질; 및/또는 (iv) 엔벨로프(E) 단백질을 포함하고; 바람직하게는 불활성화된 SARS-CoV-2 입자는 천연 형태 스파이크(S) 단백질을 포함하는, SARS-CoV-2 백신.
H8. 임의의 선행하는 양태에 있어서, 불활성화된 SARS-CoV-2 입자에 의한 포유동물 세포의 감염성은 천연 SARS-CoV-2 입자와 비교하여 적어도 99%, 99.99% 또는 99.9999% 감소되고, 또는 여기서 불활성화된 SARS-CoV-2 입자에 의한 포유동물 세포의 감염성은 검출불가능한, SARS-CoV-2 백신.
H9. 임의의 선행하는 양태에 있어서, 예를 들어 인간 혈청 알부민(HSA)과 같은 하나 이상의 약학적으로 허용가능한 부형제를 추가로 포함하는, SARS-CoV-2 백신.
H10. 임의의 선행하는 양태에 있어서, 어쥬번트를 추가로 포함하는, SARS-CoV-2 백신.
H11. 양태 H10에 있어서, 어쥬번트는 수산화알루미늄 또는 인산알루미늄을 포함하는, SARS-CoV-2 백신.
H12. 양태 H11에 있어서, 수산화알루미늄 또는 인산알루미늄은 백신에서 유일한 어쥬번트인, SARS-CoV-2 백신.
H13. 양태 H10 또는 11에 있어서, 어쥬번트는 Th1 반응-지향 어쥬번트를 포함하거나 추가로 포함하는, SARS-CoV-2 백신.
H14. 양태 H13에 있어서, Th1 반응-지향 어쥬번트는 3-O-데사실-4'-모노포스포릴 지질 A(MPL), 사포닌 QS-21, CpG-함유 올리고데옥시뉴클레오티드(CpG ODN), 스쿠알렌, DL-α-토코페롤, 양이온성 펩티드, 데옥시이노신-함유 면역자극 올리고데옥시핵산 분자(I-ODN) 및/또는 이미퀴모드를 포함하는, SARS-CoV-2 백신.
H15. 양태 H10에 있어서, 상기 어쥬번트는 다음을 포함하는, SARS-CoV-2 백신:
(i) 3-O-데사실-4'-모노포스포릴 지질 A(MPL) 및 사포닌 QS-21, 바람직하게는 어쥬번트 시스템 01을 포함하는 리포솜 제제;
(ii) 서열 5' TGACTGTGAACGTTCGAGATGA 3'(서열번호: 4), 바람직하게는 CpG 1018을 포함하는 CpG ODN;
(iii) 스쿠알렌, DL-α-토코페롤 및 폴리소르베이트 80(바람직하게는 어쥬번트 시스템 03);
(iv) 스쿠알렌, 트윈 80 및 스팬 85, 바람직하게는 MF59를 포함하는 수중유 에멀젼;
(v) 서열 KLKL5KLK(서열번호: 5) 및 올리고-d(IC)13(서열번호: 6)의 펩티드, 바람직하게는 IC31; 또는
(vi) 알루미늄 염 및 선택적으로 Th1-지향 어쥬번트.
H16. 임의의 선행하는 양태에 있어서, 백신은 SARS-CoV-2 백신이 투여된 대상체를 적어도 70% 확률로 혈청전환시킬 수 있는, SARS-CoV-2 백신.
H17. 양태 H16에 있어서, SARS-CoV-2 백신은 SARS-CoV-2 백신이 투여된 대상체를 적어도 80%, 85%, 90%, 또는 95% 확률로 혈청전환시킬 수 있는, SARS-CoV-2 백신.
H18. 선행하는 양태 중 어느 하나에 있어서, SARS-CoV-2 입자는 (i) 서열번호: 9에 의해 정의된 바와 같은; 또는 (ii) 서열번호: 9에 대해 적어도 80%, 적어도 85%, 적어도 90%, 적어도 95% 또는 적어도 99% 서열 동일성을 갖는 DNA 서열에 상응하는 RNA 서열(및/또는 이의 단편, 선택적으로 변형된(바람직하게는 알킬화된 또는 아실화된) 뉴클레오티드 잔기를 포함함)을 포함하고; 바람직하게 여기서 RNA 서열을 포함하는 천연(비-불활성화된) SARS-CoV-2 입자는 독성 SARS-CoV-2를 패킹할 수 있는, SARS-CoV-2 백신.
H19. 선행하는 양태 중 어느 하나에 있어서, 상기 백신은 (i) 서열번호: 18에 의해 정의된 바와 같은; 또는 (ii) 서열번호: 18에 대해 적어도 80%, 적어도 85%, 적어도 90%, 적어도 95% 또는 적어도 99% 서열 동일성을 갖는 DNA 서열에 상응하는 RNA 서열(및/또는 이의 단편, 선택적으로 변형된(바람직하게는 알킬화된 또는 아실화된) 뉴클레오티드 잔기를 포함함)을 포함하는 부가의 SARS-CoV-2 입자를 포함하고; 바람직하게 여기서 RNA 서열을 포함하는 천연(비-불활성화된) SARS-CoV-2 입자는 독성 SARS-CoV-2를 패킹할 수 있는, SARS-CoV-2 백신.
H20. 선행하는 양태 중 어느 하나에 있어서, 상기 백신은 (i) 서열번호: 22에 의해 정의된 바와 같은; 또는 (ii) 서열번호: 22에 대해 적어도 80%, 적어도 85%, 적어도 90%, 적어도 95% 또는 적어도 99% 서열 동일성을 갖는 DNA 서열에 상응하는 RNA 서열(및/또는 이의 단편, 선택적으로 변형된(바람직하게는 알킬화된 또는 아실화된) 뉴클레오티드 잔기를 포함함)을 포함하는 부가의 SARS-CoV-2 입자를 포함하고; 바람직하게 여기서 RNA 서열을 포함하는 천연(비-불활성화된) SARS-CoV-2 입자는 독성 SARS-CoV-2를 패킹할 수 있는, SARS-CoV-2 백신.
H21. 임의의 선행하는 양태에 있어서, 백신은 Vero 세포로부터 수득되거나 수득가능한, SARS-CoV-2 백신.
H22. 임의의 선행하는 양태에 있어서, 인간 대상체에 투여시, 백신은 (i) SARS-CoV-2-연관된 질환(COVID-19)의 항체-의존성 증진(ADE)을 유도하지 않고; 및/또는 (ii) 대상체에서 면역병리학을 유도하지 않는, SARS-CoV-2 백신.
H23. 예방적으로 또는 치료적으로 유효한 양의 임의의 선행하는 양태의 SARS-CoV-2 백신을 대상체에게 투여하는 것을 포함하는, 예방 또는 치료를 필요로 하는 인간 대상체에서 SARS-CoV-2 감염 및/또는 SARS-CoV-2-연관된 질환(COVID-19)을 예방 또는 치료하는 방법.
H24. 양태 H23에 있어서, 예방적으로 또는 치료적으로 유효한 양의 SARS-CoV-2 백신의 제2 용량을 투여하는 단계를 추가로 포함하고, 바람직하게는 여기서 백신의 제2 용량은 제1 용량과 동일한 제형인, 방법.
H25. 양태 H23 또는 H24에 있어서, 용량당 SARS-CoV-2 백신의 상기 예방적으로 또는 치료적으로 유효한 양은, ELISA에 의해 평가시, 약 1 내지 100AU/용량, 바람직하게는 약 2 내지 75AU/용량, 바람직하게는 약 3 내지 60AU/용량, 보다 바람직하게는 약 3 내지 55AU/용량, 더욱 바람직하게는 약 3 내지 53AU/용량, 훨씬 더 바람직하게는 약 3 내지 40AU/용량 예컨대 예를 들어 40AU/용량으로 한정되는, 방법.
H26. 양태 H23 또는 24에 있어서, SARS-CoV-2 백신의 용량당 상기 예방적으로 또는 치료적으로 유효한 양은 (μ)BCA에 의해 측정시, 약 0.05 내지 50μg 총 단백질, 약 0.1 내지 25μg, 약 0.25 내지 12.5μg, 바람직하게는 약 0.5 내지 5μg 총 단백질로 한정되는, 방법.
H27. 양태 H23 또는 H24에 있어서, SARS-CoV-2 백신의 용량당 상기 예방적으로 또는 치료적으로 유효한 양은 ELISA에 의해 측정시, 약 0.025 내지 25μg S-단백질, 약 0.05 내지 12.5μg, 약 0.125 내지 6.25μg, 바람직하게는 약 0.25 내지 2.5μg S-단백질로 한정되는, 방법.
H28. 양태 H24에 있어서, SARS-CoV-2 백신의 제2 용량은 SARS-CoV-2 백신의 제1 용량 후 약 7일, 약 14일, 약 21일, 또는 약 28일에 투여되고, 바람직하게는 여기서 백신의 제2 용량은 제1 용량과 동일한 제형인, 방법.
H29. 양태 H22 내지 H28 중 어느 하나에 있어서, 투여는 SARS-CoV-2 중화 항체의 생성을 초래하는, 방법.
H30. 다음을 포함하는 SARS-CoV-2 백신을 생성하는 방법:
(a) 천연 SARS-CoV-2 입자를 생성하는 단계;
(b) 천연 SARS-CoV-2 입자를 불활성화하여 불활성화된 SARS-CoV-2 입자를 얻는 단계;
(c) 불활성화된 SARS-CoV-2 입자를 백신 조성물에 혼입하는 단계;
여기서 SARS-CoV-2 입자의 천연 표면 형태는 불활성화 단계에서 보존되어 백신이 인간 대상체에서 천연 SARS-CoV-2 입자에 대한 중화 항체를 생성할 수 있음.
H31. 양태 H30에 있어서, 백신 조성물은 수산화알루미늄을 포함하는, 방법.
H32. 양태 H31에 있어서, 수산화알루미늄을 포함하는 SARS-CoV-2 백신은 1.25ppb 미만의 Cu를 함유하는, 방법.
H33. 양태 H32에 있어서, 불활성화 단계는 SARS-CoV-2 입자에서 바이러스 RNA를 우선적으로 표적화하는, 방법.
H34. 양태 H30 또는 H33에 있어서, 불활성화 단계는 (i) 바이러스 RNA를 알킬화 및/또는 아실화하는 것 (ii) 퓨린(바람직하게는 구아닌) 잔기를 변형하거나 가닥 파손을 바이러스 RNA 안으로 도입하는 것 및/또는 (iii) 하나 이상의 바이러스 단백질과 바이러스 RNA를 가교-결합하는 것을 포함하는, 방법.
H35. 양태 H30, H33 또는 H34 중 어느 하나에 있어서, 불활성화 단계는 천연 SARS-CoV-2 입자를 베타-프로피오락톤으로 처리하는 것을 포함하는, 방법.
H36. 양태 H35에 있어서, 불활성화 단계에서 베타-프로피오락톤의 농도는 0.01 내지 1 중량%, 바람직하게는 0.05 내지 0.5 중량%, 더욱 바람직하게는 약 0.1 중량%인, 방법.
H37. 양태 H35 또는 H36에 있어서, 천연 SARS-CoV-2 입자는 적어도 5시간, 적어도 10시간, 적어도 24시간 또는 적어도 4일 동안 베타-프로피오락톤과 접촉되는, 방법.
H38. 양태 H30 또는 H33 내지 H37 중 어느 하나에 있어서, 불활성화 단계는 약 0℃ 내지 약 25℃, 바람직하게는 약 4℃ 또는 약 22℃에서 수행되는, 방법.
H39. 양태 H30 또는 H33 내지 H38 중 어느 하나에 있어서, 불활성화 단계는 천연 SARS-CoV-2 입자를 자외선(UV) 광으로 처리하는 것을 포함하는, 방법.
H40. 양태 H30 또는 H33 내지 H39 중 어느 하나에 있어서, 단계 (a)는 하기 단계 중 하나 이상을 포함하는, 방법:
(i) Vero 세포 상에 SARS-CoV-2를 계대하고, 이에 의해 SARS-CoV-2를 포함하는 배양 배지를 생성하는 단계;
(ii) (i)의 배양 배지를 수확하는 단계;
(iii) (ii)의 수확된 배양 배지를 침전시키고, 이에 의해 상등액에서 천연 SARS-CoV-2 입자를 생성하는 단계.
H41. 양태 H40에 있어서, 단계 (iii) 이전에 (ii)의 배양 배지를 농축하는 것을 추가로 포함하는, 방법.
H42. 양태 H40 또는 H41에 있어서, (iii)의 침전은 (ii)의 배양 배지를 프로타민 설페이트 또는 벤조나아제와 접촉시키는 것을 포함하는, 방법.
H43. 양태 H30 또는 H33 내지 H42 중 어느 하나에 있어서, 불활성화된 SARS-CoV-2 입자를 투석하고, 이에 의해 투석된 SARS-CoV-2를 생성하는 것을 추가로 포함하는, 방법.
H44. 양태 H43에 있어서, 투석된 SARS-CoV-2를 여과하는 것을 추가로 포함하는, 방법.
H45. 양태 H30 또는 H33 내지 H44 중 어느 하나에 있어서, 불활성화 단계는 천연 SARS-CoV-2 입자를 포함하는 액체 조성물을 용기 내의 화학적 바이러스 불활성화제와 접촉시키는 단계, 화학적 바이러스성 불활성화제 및 SARS-CoV-2 입자를 포함하는 액체 조성물을 난류가 아닌 층류 흐름의 조건 하에서 혼합하는 단계, 및 바이러스 입자를 불활성화시키기에 충분한 시간 동안 화학적 바이러스 불활성화제 및 SARS-CoV-2 입자를 포함하는 액체 조성물을 인큐베이션하는 단계를 포함하는, 방법.
H46. 양태 H45에 있어서, 불활성화 단계는 가요성 생물반응기 백에서 수행되는, 방법.
H47. 양태 H45 또는 H46에 있어서, 불활성화 단계는 불활성화의 기간 동안 5회 이하의 용기 반전을 포함하는, 방법.
H48. 양태 H45 내지 H47 중 어느 하나에 있어서, 화학적 바이러스 불활성화제와 천연 SARS-CoV-2 입자를 포함하는 조성물의 혼합은 인큐베이션의 기간 동안 10rpm 이하에서 10분 이하 동안 용기를 흔들기, 회전, 궤도 진탕 또는 진동시키는 것을 포함하는, 방법.
H49. 양태 H30 또는 H33 내지 H48 중 어느 하나에 있어서, (i) 배치 크로마토그래피 및/또는 (ii) 수크로스 밀도 구배 원심분리로부터 선택되는 하나 이상의 방법에 의해 불활성화된 SARS-CoV-2 입자를 정제하는 것을 추가로 포함하는, 방법.
H50. 양태 H30 또는 H33 내지 H49 중 어느 하나에 있어서, 단계 (c)는 불활성화된 SARS-CoV-2 입자를 어쥬번트와 조합하는 것을 포함하는, 방법.
H51. 양태 H50에 있어서, 어쥬번트는 Th1 반응-지향 어쥬번트를 포함하는, 방법.
H52. 양태 H50 또는 H51에 있어서, 어쥬번트는 3-O-데사실-4'-모노포스포릴 지질 A(MPL), 사포닌 QS-21, CpG-함유 올리고데옥시뉴클레오티드(CpG ODN), 스쿠알렌, DL-α-토코페롤 및/또는 이미퀴모드를 포함하는, 방법.
H53. 양태 H30 또는 H33 내지 H52 중 어느 하나의 방법에 의해 수득되거나 수득가능한 SARS-CoV-2 백신.
H54. 대상체에서 SARS-CoV-2 감염의 치료 또는 예방을 위한 양태 H1 내지 H22 또는 H53 중 어느 하나의 SARS-CoV-2 백신의 용도.
H55. 대상체에서 SARS-CoV-2 감염의 예방 또는 치료에 사용하기 위한 약학적 조성물로서, 상기 약학적 조성물은 선택적으로 하나 이상의 약학적으로 허용가능한 부형제 및/또는 어쥬번트와 조합된 양태 H1 내지 H22 또는 H53 중 어느 하나에 정의된 바와 같은 불활성화된 SARS-CoV-2 백신인, 약학적 조성물.
H56. 의약으로서 사용하기 위한 양태 H1 내지 H22 또는 H53 중 어느 하나에 정의된 바와 같은 SARS-CoV-2 백신.
H57. 임의의 선행하는 양태에 따른 백신, 방법, 용도 또는 약학적 조성물로서, 여기서 대상체는 (i) 고령 대상체, 바람직하게는 65세 이상, 70세 이상 또는 80세 이상의 대상체; (ii) 면역저하 대상체; 또는 (iii) 임신한 대상체인, 백신, 방법, 용도 또는 약학적 조성물.
H58. (i) SARS-CoV-2-연관된 질환(COVID-19)의 항체-의존성 증진(ADE); 및/또는 (ii) 대상체에서 면역병리학의 유도 없이 SARS-CoV-2 감염의 예방 또는 치료에 사용하기 위한, 임의의 선행하는 양태에 따른 백신, 방법, 용도 또는 약학적 조성물.
본 출원은 EP20168324.0(2020년 4월 6일), EP 20202118.4(2020년 10월 15일), EP 20211853.5(2020년 12월 4일) EP21154647.8(2021년 2월 1일), PCT/US2021/20313(2021년 3월 1일) 및 EP 21160913.6(2021년 3월 5일)으로부터 우선권을 주장하며, 그 내용은 참고로 본 명세서에 포함된다. 상기 명세서에서 언급된 모든 간행물은 참고로 본 명세서에 포함된다. 본 발명의 기술된 실시형태의 다양한 변형 및 변경은 본 발명의 범주 및 사상을 벗어나지 않고 당업자에게 명백할 것이다. 본 발명이 특정 바람직한 실시형태와 관련하여 기술되었지만, 청구된 바와 같은 발명이 그러한 특정 실시형태에 과도하게 제한되어서는 안된다는 것을 이해해야 한다. 실제로, 당업자에게 자명한 발명을 수행하기 위한 기술된 양식의 다양한 변형은 다음 청구범위의 범주 내에 있는 것으로 의도된다.
서열
서열번호: 1
중증 급성 호흡기 증후군 코로나바이러스 2 (SARS-CoV-2) 단리물 우한-Hu-1, 완전 게놈 (GenBank: MN908947; Wu, F., et al. A new coronavirus associated with 인간 respiratory disease in China (2020) Nature 579:265-269)
ATTAAAGGTTTATACCTTCCCAGGTAACAAACCAACCAACTTTCGATCTCTTGTAGATCTGTTCTCTAAACGAACTTTAAAATCTGTGTGGCTGTCACTCGGCTGCATGCTTAGTGCACTCACGCAGTATAATTAATAACTAATTACTGTCGTTGACAGGACACGAGTAACTCGTCTATCTTCTGCAGGCTGCTTACGGTTTCGTCCGTGTTGCAGCCGATCATCAGCACATCTAGGTTTCGTCCGGGTGTGACCGAAAGGTAAGATGGAGAGCCTTGTCCCTGGTTTCAACGAGAAAACACACGTCCAACTCAGTTTGCCTGTTTTACAGGTTCGCGACGTGCTCGTACGTGGCTTTGGAGACTCCGTGGAGGAGGTCTTATCAGAGGCACGTCAACATCTTAAAGATGGCACTTGTGGCTTAGTAGAAGTTGAAAAAGGCGTTTTGCCTCAACTTGAACAGCCCTATGTGTTCATCAAACGTTCGGATGCTCGAACTGCACCTCATGGTCATGTTATGGTTGAGCTGGTAGCAGAACTCGAAGGCATTCAGTACGGTCGTAGTGGTGAGACACTTGGTGTCCTTGTCCCTCATGTGGGCGAAATACCAGTGGCTTACCGCAAGGTTCTTCTTCGTAAGAACGGTAATAAAGGAGCTGGTGGCCATAGTTACGGCGCCGATCTAAAGTCATTTGACTTAGGCGACGAGCTTGGCACTGATCCTTATGAAGATTTTCAAGAAAACTGGAACACTAAACATAGCAGTGGTGTTACCCGTGAACTCATGCGTGAGCTTAACGGAGGGGCATACACTCGCTATGTCGATAACAACTTCTGTGGCCCTGATGGCTACCCTCTTGAGTGCATTAAAGACCTTCTAGCACGTGCTGGTAAAGCTTCATGCACTTTGTCCGAACAACTGGACTTTATTGACACTAAGAGGGGTGTATACTGCTGCCGTGAACATGAGCATGAAATTGCTTGGTACACGGAACGTTCTGAAAAGAGCTATGAATTGCAGACACCTTTTGAAATTAAATTGGCAAAGAAATTTGACACCTTCAATGGGGAATGTCCAAATTTTGTATTTCCCTTAAATTCCATAATCAAGACTATTCAACCAAGGGTTGAAAAGAAAAAGCTTGATGGCTTTATGGGTAGAATTCGATCTGTCTATCCAGTTGCGTCACCAAATGAATGCAACCAAATGTGCCTTTCAACTCTCATGAAGTGTGATCATTGTGGTGAAACTTCATGGCAGACGGGCGATTTTGTTAAAGCCACTTGCGAATTTTGTGGCACTGAGAATTTGACTAAAGAAGGTGCCACTACTTGTGGTTACTTACCCCAAAATGCTGTTGTTAAAATTTATTGTCCAGCATGTCACAATTCAGAAGTAGGACCTGAGCATAGTCTTGCCGAATACCATAATGAATCTGGCTTGAAAACCATTCTTCGTAAGGGTGGTCGCACTATTGCCTTTGGAGGCTGTGTGTTCTCTTATGTTGGTTGCCATAACAAGTGTGCCTATTGGGTTCCACGTGCTAGCGCTAACATAGGTTGTAACCATACAGGTGTTGTTGGAGAAGGTTCCGAAGGTCTTAATGACAACCTTCTTGAAATACTCCAAAAAGAGAAAGTCAACATCAATATTGTTGGTGACTTTAAACTTAATGAAGAGATCGCCATTATTTTGGCATCTTTTTCTGCTTCCACAAGTGCTTTTGTGGAAACTGTGAAAGGTTTGGATTATAAAGCATTCAAACAAATTGTTGAATCCTGTGGTAATTTTAAAGTTACAAAAGGAAAAGCTAAAAAAGGTGCCTGGAATATTGGTGAACAGAAATCAATACTGAGTCCTCTTTATGCATTTGCATCAGAGGCTGCTCGTGTTGTACGATCAATTTTCTCCCGCACTCTTGAAACTGCTCAAAATTCTGTGCGTGTTTTACAGAAGGCCGCTATAACAATACTAGATGGAATTTCACAGTATTCACTGAGACTCATTGATGCTATGATGTTCACATCTGATTTGGCTACTAACAATCTAGTTGTAATGGCCTACATTACAGGTGGTGTTGTTCAGTTGACTTCGCAGTGGCTAACTAACATCTTTGGCACTGTTTATGAAAAACTCAAACCCGTCCTTGATTGGCTTGAAGAGAAGTTTAAGGAAGGTGTAGAGTTTCTTAGAGACGGTTGGGAAATTGTTAAATTTATCTCAACCTGTGCTTGTGAAATTGTCGGTGGACAAATTGTCACCTGTGCAAAGGAAATTAAGGAGAGTGTTCAGACATTCTTTAAGCTTGTAAATAAATTTTTGGCTTTGTGTGCTGACTCTATCATTATTGGTGGAGCTAAACTTAAAGCCTTGAATTTAGGTGAAACATTTGTCACGCACTCAAAGGGATTGTACAGAAAGTGTGTTAAATCCAGAGAAGAAACTGGCCTACTCATGCCTCTAAAAGCCCCAAAAGAAATTATCTTCTTAGAGGGAGAAACACTTCCCACAGAAGTGTTAACAGAGGAAGTTGTCTTGAAAACTGGTGATTTACAACCATTAGAACAACCTACTAGTGAAGCTGTTGAAGCTCCATTGGTTGGTACACCAGTTTGTATTAACGGGCTTATGTTGCTCGAAATCAAAGACACAGAAAAGTACTGTGCCCTTGCACCTAATATGATGGTAACAAACAATACCTTCACACTCAAAGGCGGTGCACCAACAAAGGTTACTTTTGGTGATGACACTGTGATAGAAGTGCAAGGTTACAAGAGTGTGAATATCACTTTTGAACTTGATGAAAGGATTGATAAAGTACTTAATGAGAAGTGCTCTGCCTATACAGTTGAACTCGGTACAGAAGTAAATGAGTTCGCCTGTGTTGTGGCAGATGCTGTCATAAAAACTTTGCAACCAGTATCTGAATTACTTACACCACTGGGCATTGATTTAGATGAGTGGAGTATGGCTACATACTACTTATTTGATGAGTCTGGTGAGTTTAAATTGGCTTCACATATGTATTGTTCTTTCTACCCTCCAGATGAGGATGAAGAAGAAGGTGATTGTGAAGAAGAAGAGTTTGAGCCATCAACTCAATATGAGTATGGTACTGAAGATGATTACCAAGGTAAACCTTTGGAATTTGGTGCCACTTCTGCTGCTCTTCAACCTGAAGAAGAGCAAGAAGAAGATTGGTTAGATGATGATAGTCAACAAACTGTTGGTCAACAAGACGGCAGTGAGGACAATCAGACAACTACTATTCAAACAATTGTTGAGGTTCAACCTCAATTAGAGATGGAACTTACACCAGTTGTTCAGACTATTGAAGTGAATAGTTTTAGTGGTTATTTAAAACTTACTGACAATGTATACATTAAAAATGCAGACATTGTGGAAGAAGCTAAAAAGGTAAAACCAACAGTGGTTGTTAATGCAGCCAATGTTTACCTTAAACATGGAGGAGGTGTTGCAGGAGCCTTAAATAAGGCTACTAACAATGCCATGCAAGTTGAATCTGATGATTACATAGCTACTAATGGACCACTTAAAGTGGGTGGTAGTTGTGTTTTAAGCGGACACAATCTTGCTAAACACTGTCTTCATGTTGTCGGCCCAAATGTTAACAAAGGTGAAGACATTCAACTTCTTAAGAGTGCTTATGAAAATTTTAATCAGCACGAAGTTCTACTTGCACCATTATTATCAGCTGGTATTTTTGGTGCTGACCCTATACATTCTTTAAGAGTTTGTGTAGATACTGTTCGCACAAATGTCTACTTAGCTGTCTTTGATAAAAATCTCTATGACAAACTTGTTTCAAGCTTTTTGGAAATGAAGAGTGAAAAGCAAGTTGAACAAAAGATCGCTGAGATTCCTAAAGAGGAAGTTAAGCCATTTATAACTGAAAGTAAACCTTCAGTTGAACAGAGAAAACAAGATGATAAGAAAATCAAAGCTTGTGTTGAAGAAGTTACAACAACTCTGGAAGAAACTAAGTTCCTCACAGAAAACTTGTTACTTTATATTGACATTAATGGCAATCTTCATCCAGATTCTGCCACTCTTGTTAGTGACATTGACATCACTTTCTTAAAGAAAGATGCTCCATATATAGTGGGTGATGTTGTTCAAGAGGGTGTTTTAACTGCTGTGGTTATACCTACTAAAAAGGCTGGTGGCACTACTGAAATGCTAGCGAAAGCTTTGAGAAAAGTGCCAACAGACAATTATATAACCACTTACCCGGGTCAGGGTTTAAATGGTTACACTGTAGAGGAGGCAAAGACAGTGCTTAAAAAGTGTAAAAGTGCCTTTTACATTCTACCATCTATTATCTCTAATGAGAAGCAAGAAATTCTTGGAACTGTTTCTTGGAATTTGCGAGAAATGCTTGCACATGCAGAAGAAACACGCAAATTAATGCCTGTCTGTGTGGAAACTAAAGCCATAGTTTCAACTATACAGCGTAAATATAAGGGTATTAAAATACAAGAGGGTGTGGTTGATTATGGTGCTAGATTTTACTTTTACACCAGTAAAACAACTGTAGCGTCACTTATCAACACACTTAACGATCTAAATGAAACTCTTGTTACAATGCCACTTGGCTATGTAACACATGGCTTAAATTTGGAAGAAGCTGCTCGGTATATGAGATCTCTCAAAGTGCCAGCTACAGTTTCTGTTTCTTCACCTGATGCTGTTACAGCGTATAATGGTTATCTTACTTCTTCTTCTAAAACACCTGAAGAACATTTTATTGAAACCATCTCACTTGCTGGTTCCTATAAAGATTGGTCCTATTCTGGACAATCTACACAACTAGGTATAGAATTTCTTAAGAGAGGTGATAAAAGTGTATATTACACTAGTAATCCTACCACATTCCACCTAGATGGTGAAGTTATCACCTTTGACAATCTTAAGACACTTCTTTCTTTGAGAGAAGTGAGGACTATTAAGGTGTTTACAACAGTAGACAACATTAACCTCCACACGCAAGTTGTGGACATGTCAATGACATATGGACAACAGTTTGGTCCAACTTATTTGGATGGAGCTGATGTTACTAAAATAAAACCTCATAATTCACATGAAGGTAAAACATTTTATGTTTTACCTAATGATGACACTCTACGTGTTGAGGCTTTTGAGTACTACCACACAACTGATCCTAGTTTTCTGGGTAGGTACATGTCAGCATTAAATCACACTAAAAAGTGGAAATACCCACAAGTTAATGGTTTAACTTCTATTAAATGGGCAGATAACAACTGTTATCTTGCCACTGCATTGTTAACACTCCAACAAATAGAGTTGAAGTTTAATCCACCTGCTCTACAAGATGCTTATTACAGAGCAAGGGCTGGTGAAGCTGCTAACTTTTGTGCACTTATCTTAGCCTACTGTAATAAGACAGTAGGTGAGTTAGGTGATGTTAGAGAAACAATGAGTTACTTGTTTCAACATGCCAATTTAGATTCTTGCAAAAGAGTCTTGAACGTGGTGTGTAAAACTTGTGGACAACAGCAGACAACCCTTAAGGGTGTAGAAGCTGTTATGTACATGGGCACACTTTCTTATGAACAATTTAAGAAAGGTGTTCAGATACCTTGTACGTGTGGTAAACAAGCTACAAAATATCTAGTACAACAGGAGTCACCTTTTGTTATGATGTCAGCACCACCTGCTCAGTATGAACTTAAGCATGGTACATTTACTTGTGCTAGTGAGTACACTGGTAATTACCAGTGTGGTCACTATAAACATATAACTTCTAAAGAAACTTTGTATTGCATAGACGGTGCTTTACTTACAAAGTCCTCAGAATACAAAGGTCCTATTACGGATGTTTTCTACAAAGAAAACAGTTACACAACAACCATAAAACCAGTTACTTATAAATTGGATGGTGTTGTTTGTACAGAAATTGACCCTAAGTTGGACAATTATTATAAGAAAGACAATTCTTATTTCACAGAGCAACCAATTGATCTTGTACCAAACCAACCATATCCAAACGCAAGCTTCGATAATTTTAAGTTTGTATGTGATAATATCAAATTTGCTGATGATTTAAACCAGTTAACTGGTTATAAGAAACCTGCTTCAAGAGAGCTTAAAGTTACATTTTTCCCTGACTTAAATGGTGATGTGGTGGCTATTGATTATAAACACTACACACCCTCTTTTAAGAAAGGAGCTAAATTGTTACATAAACCTATTGTTTGGCATGTTAACAATGCAACTAATAAAGCCACGTATAAACCAAATACCTGGTGTATACGTTGTCTTTGGAGCACAAAACCAGTTGAAACATCAAATTCGTTTGATGTACTGAAGTCAGAGGACGCGCAGGGAATGGATAATCTTGCCTGCGAAGATCTAAAACCAGTCTCTGAAGAAGTAGTGGAAAATCCTACCATACAGAAAGACGTTCTTGAGTGTAATGTGAAAACTACCGAAGTTGTAGGAGACATTATACTTAAACCAGCAAATAATAGTTTAAAAATTACAGAAGAGGTTGGCCACACAGATCTAATGGCTGCTTATGTAGACAATTCTAGTCTTACTATTAAGAAACCTAATGAATTATCTAGAGTATTAGGTTTGAAAACCCTTGCTACTCATGGTTTAGCTGCTGTTAATAGTGTCCCTTGGGATACTATAGCTAATTATGCTAAGCCTTTTCTTAACAAAGTTGTTAGTACAACTACTAACATAGTTACACGGTGTTTAAACCGTGTTTGTACTAATTATATGCCTTATTTCTTTACTTTATTGCTACAATTGTGTACTTTTACTAGAAGTACAAATTCTAGAATTAAAGCATCTATGCCGACTACTATAGCAAAGAATACTGTTAAGAGTGTCGGTAAATTTTGTCTAGAGGCTTCATTTAATTATTTGAAGTCACCTAATTTTTCTAAACTGATAAATATTATAATTTGGTTTTTACTATTAAGTGTTTGCCTAGGTTCTTTAATCTACTCAACCGCTGCTTTAGGTGTTTTAATGTCTAATTTAGGCATGCCTTCTTACTGTACTGGTTACAGAGAAGGCTATTTGAACTCTACTAATGTCACTATTGCAACCTACTGTACTGGTTCTATACCTTGTAGTGTTTGTCTTAGTGGTTTAGATTCTTTAGACACCTATCCTTCTTTAGAAACTATACAAATTACCATTTCATCTTTTAAATGGGATTTAACTGCTTTTGGCTTAGTTGCAGAGTGGTTTTTGGCATATATTCTTTTCACTAGGTTTTTCTATGTACTTGGATTGGCTGCAATCATGCAATTGTTTTTCAGCTATTTTGCAGTACATTTTATTAGTAATTCTTGGCTTATGTGGTTAATAATTAATCTTGTACAAATGGCCCCGATTTCAGCTATGGTTAGAATGTACATCTTCTTTGCATCATTTTATTATGTATGGAAAAGTTATGTGCATGTTGTAGACGGTTGTAATTCATCAACTTGTATGATGTGTTACAAACGTAATAGAGCAACAAGAGTCGAATGTACAACTATTGTTAATGGTGTTAGAAGGTCCTTTTATGTCTATGCTAATGGAGGTAAAGGCTTTTGCAAACTACACAATTGGAATTGTGTTAATTGTGATACATTCTGTGCTGGTAGTACATTTATTAGTGATGAAGTTGCGAGAGACTTGTCACTACAGTTTAAAAGACCAATAAATCCTACTGACCAGTCTTCTTACATCGTTGATAGTGTTACAGTGAAGAATGGTTCCATCCATCTTTACTTTGATAAAGCTGGTCAAAAGACTTATGAAAGACATTCTCTCTCTCATTTTGTTAACTTAGACAACCTGAGAGCTAATAACACTAAAGGTTCATTGCCTATTAATGTTATAGTTTTTGATGGTAAATCAAAATGTGAAGAATCATCTGCAAAATCAGCGTCTGTTTACTACAGTCAGCTTATGTGTCAACCTATACTGTTACTAGATCAGGCATTAGTGTCTGATGTTGGTGATAGTGCGGAAGTTGCAGTTAAAATGTTTGATGCTTACGTTAATACGTTTTCATCAACTTTTAACGTACCAATGGAAAAACTCAAAACACTAGTTGCAACTGCAGAAGCTGAACTTGCAAAGAATGTGTCCTTAGACAATGTCTTATCTACTTTTATTTCAGCAGCTCGGCAAGGGTTTGTTGATTCAGATGTAGAAACTAAAGATGTTGTTGAATGTCTTAAATTGTCACATCAATCTGACATAGAAGTTACTGGCGATAGTTGTAATAACTATATGCTCACCTATAACAAAGTTGAAAACATGACACCCCGTGACCTTGGTGCTTGTATTGACTGTAGTGCGCGTCATATTAATGCGCAGGTAGCAAAAAGTCACAACATTGCTTTGATATGGAACGTTAAAGATTTCATGTCATTGTCTGAACAACTACGAAAACAAATACGTAGTGCTGCTAAAAAGAATAACTTACCTTTTAAGTTGACATGTGCAACTACTAGACAAGTTGTTAATGTTGTAACAACAAAGATAGCACTTAAGGGTGGTAAAATTGTTAATAATTGGTTGAAGCAGTTAATTAAAGTTACACTTGTGTTCCTTTTTGTTGCTGCTATTTTCTATTTAATAACACCTGTTCATGTCATGTCTAAACATACTGACTTTTCAAGTGAAATCATAGGATACAAGGCTATTGATGGTGGTGTCACTCGTGACATAGCATCTACAGATACTTGTTTTGCTAACAAACATGCTGATTTTGACACATGGTTTAGCCAGCGTGGTGGTAGTTATACTAATGACAAAGCTTGCCCATTGATTGCTGCAGTCATAACAAGAGAAGTGGGTTTTGTCGTGCCTGGTTTGCCTGGCACGATATTACGCACAACTAATGGTGACTTTTTGCATTTCTTACCTAGAGTTTTTAGTGCAGTTGGTAACATCTGTTACACACCATCAAAACTTATAGAGTACACTGACTTTGCAACATCAGCTTGTGTTTTGGCTGCTGAATGTACAATTTTTAAAGATGCTTCTGGTAAGCCAGTACCATATTGTTATGATACCAATGTACTAGAAGGTTCTGTTGCTTATGAAAGTTTACGCCCTGACACACGTTATGTGCTCATGGATGGCTCTATTATTCAATTTCCTAACACCTACCTTGAAGGTTCTGTTAGAGTGGTAACAACTTTTGATTCTGAGTACTGTAGGCACGGCACTTGTGAAAGATCAGAAGCTGGTGTTTGTGTATCTACTAGTGGTAGATGGGTACTTAACAATGATTATTACAGATCTTTACCAGGAGTTTTCTGTGGTGTAGATGCTGTAAATTTACTTACTAATATGTTTACACCACTAATTCAACCTATTGGTGCTTTGGACATATCAGCATCTATAGTAGCTGGTGGTATTGTAGCTATCGTAGTAACATGCCTTGCCTACTATTTTATGAGGTTTAGAAGAGCTTTTGGTGAATACAGTCATGTAGTTGCCTTTAATACTTTACTATTCCTTATGTCATTCACTGTACTCTGTTTAACACCAGTTTACTCATTCTTACCTGGTGTTTATTCTGTTATTTACTTGTACTTGACATTTTATCTTACTAATGATGTTTCTTTTTTAGCACATATTCAGTGGATGGTTATGTTCACACCTTTAGTACCTTTCTGGATAACAATTGCTTATATCATTTGTATTTCCACAAAGCATTTCTATTGGTTCTTTAGTAATTACCTAAAGAGACGTGTAGTCTTTAATGGTGTTTCCTTTAGTACTTTTGAAGAAGCTGCGCTGTGCACCTTTTTGTTAAATAAAGAAATGTATCTAAAGTTGCGTAGTGATGTGCTATTACCTCTTACGCAATATAATAGATACTTAGCTCTTTATAATAAGTACAAGTATTTTAGTGGAGCAATGGATACAACTAGCTACAGAGAAGCTGCTTGTTGTCATCTCGCAAAGGCTCTCAATGACTTCAGTAACTCAGGTTCTGATGTTCTTTACCAACCACCACAAACCTCTATCACCTCAGCTGTTTTGCAGAGTGGTTTTAGAAAAATGGCATTCCCATCTGGTAAAGTTGAGGGTTGTATGGTACAAGTAACTTGTGGTACAACTACACTTAACGGTCTTTGGCTTGATGACGTAGTTTACTGTCCAAGACATGTGATCTGCACCTCTGAAGACATGCTTAACCCTAATTATGAAGATTTACTCATTCGTAAGTCTAATCATAATTTCTTGGTACAGGCTGGTAATGTTCAACTCAGGGTTATTGGACATTCTATGCAAAATTGTGTACTTAAGCTTAAGGTTGATACAGCCAATCCTAAGACACCTAAGTATAAGTTTGTTCGCATTCAACCAGGACAGACTTTTTCAGTGTTAGCTTGTTACAATGGTTCACCATCTGGTGTTTACCAATGTGCTATGAGGCCCAATTTCACTATTAAGGGTTCATTCCTTAATGGTTCATGTGGTAGTGTTGGTTTTAACATAGATTATGACTGTGTCTCTTTTTGTTACATGCACCATATGGAATTACCAACTGGAGTTCATGCTGGCACAGACTTAGAAGGTAACTTTTATGGACCTTTTGTTGACAGGCAAACAGCACAAGCAGCTGGTACGGACACAACTATTACAGTTAATGTTTTAGCTTGGTTGTACGCTGCTGTTATAAATGGAGACAGGTGGTTTCTCAATCGATTTACCACAACTCTTAATGACTTTAACCTTGTGGCTATGAAGTACAATTATGAACCTCTAACACAAGACCATGTTGACATACTAGGACCTCTTTCTGCTCAAACTGGAATTGCCGTTTTAGATATGTGTGCTTCATTAAAAGAATTACTGCAAAATGGTATGAATGGACGTACCATATTGGGTAGTGCTTTATTAGAAGATGAATTTACACCTTTTGATGTTGTTAGACAATGCTCAGGTGTTACTTTCCAAAGTGCAGTGAAAAGAACAATCAAGGGTACACACCACTGGTTGTTACTCACAATTTTGACTTCACTTTTAGTTTTAGTCCAGAGTACTCAATGGTCTTTGTTCTTTTTTTTGTATGAAAATGCCTTTTTACCTTTTGCTATGGGTATTATTGCTATGTCTGCTTTTGCAATGATGTTTGTCAAACATAAGCATGCATTTCTCTGTTTGTTTTTGTTACCTTCTCTTGCCACTGTAGCTTATTTTAATATGGTCTATATGCCTGCTAGTTGGGTGATGCGTATTATGACATGGTTGGATATGGTTGATACTAGTTTGTCTGGTTTTAAGCTAAAAGACTGTGTTATGTATGCATCAGCTGTAGTGTTACTAATCCTTATGACAGCAAGAACTGTGTATGATGATGGTGCTAGGAGAGTGTGGACACTTATGAATGTCTTGACACTCGTTTATAAAGTTTATTATGGTAATGCTTTAGATCAAGCCATTTCCATGTGGGCTCTTATAATCTCTGTTACTTCTAACTACTCAGGTGTAGTTACAACTGTCATGTTTTTGGCCAGAGGTATTGTTTTTATGTGTGTTGAGTATTGCCCTATTTTCTTCATAACTGGTAATACACTTCAGTGTATAATGCTAGTTTATTGTTTCTTAGGCTATTTTTGTACTTGTTACTTTGGCCTCTTTTGTTTACTCAACCGCTACTTTAGACTGACTCTTGGTGTTTATGATTACTTAGTTTCTACACAGGAGTTTAGATATATGAATTCACAGGGACTACTCCCACCCAAGAATAGCATAGATGCCTTCAAACTCAACATTAAATTGTTGGGTGTTGGTGGCAAACCTTGTATCAAAGTAGCCACTGTACAGTCTAAAATGTCAGATGTAAAGTGCACATCAGTAGTCTTACTCTCAGTTTTGCAACAACTCAGAGTAGAATCATCATCTAAATTGTGGGCTCAATGTGTCCAGTTACACAATGACATTCTCTTAGCTAAAGATACTACTGAAGCCTTTGAAAAAATGGTTTCACTACTTTCTGTTTTGCTTTCCATGCAGGGTGCTGTAGACATAAACAAGCTTTGTGAAGAAATGCTGGACAACAGGGCAACCTTACAAGCTATAGCCTCAGAGTTTAGTTCCCTTCCATCATATGCAGCTTTTGCTACTGCTCAAGAAGCTTATGAGCAGGCTGTTGCTAATGGTGATTCTGAAGTTGTTCTTAAAAAGTTGAAGAAGTCTTTGAATGTGGCTAAATCTGAATTTGACCGTGATGCAGCCATGCAACGTAAGTTGGAAAAGATGGCTGATCAAGCTATGACCCAAATGTATAAACAGGCTAGATCTGAGGACAAGAGGGCAAAAGTTACTAGTGCTATGCAGACAATGCTTTTCACTATGCTTAGAAAGTTGGATAATGATGCACTCAACAACATTATCAACAATGCAAGAGATGGTTGTGTTCCCTTGAACATAATACCTCTTACAACAGCAGCCAAACTAATGGTTGTCATACCAGACTATAACACATATAAAAATACGTGTGATGGTACAACATTTACTTATGCATCAGCATTGTGGGAAATCCAACAGGTTGTAGATGCAGATAGTAAAATTGTTCAACTTAGTGAAATTAGTATGGACAATTCACCTAATTTAGCATGGCCTCTTATTGTAACAGCTTTAAGGGCCAATTCTGCTGTCAAATTACAGAATAATGAGCTTAGTCCTGTTGCACTACGACAGATGTCTTGTGCTGCCGGTACTACACAAACTGCTTGCACTGATGACAATGCGTTAGCTTACTACAACACAACAAAGGGAGGTAGGTTTGTACTTGCACTGTTATCCGATTTACAGGATTTGAAATGGGCTAGATTCCCTAAGAGTGATGGAACTGGTACTATCTATACAGAACTGGAACCACCTTGTAGGTTTGTTACAGACACACCTAAAGGTCCTAAAGTGAAGTATTTATACTTTATTAAAGGATTAAACAACCTAAATAGAGGTATGGTACTTGGTAGTTTAGCTGCCACAGTACGTCTACAAGCTGGTAATGCAACAGAAGTGCCTGCCAATTCAACTGTATTATCTTTCTGTGCTTTTGCTGTAGATGCTGCTAAAGCTTACAAAGATTATCTAGCTAGTGGGGGACAACCAATCACTAATTGTGTTAAGATGTTGTGTACACACACTGGTACTGGTCAGGCAATAACAGTTACACCGGAAGCCAATATGGATCAAGAATCCTTTGGTGGTGCATCGTGTTGTCTGTACTGCCGTTGCCACATAGATCATCCAAATCCTAAAGGATTTTGTGACTTAAAAGGTAAGTATGTACAAATACCTACAACTTGTGCTAATGACCCTGTGGGTTTTACACTTAAAAACACAGTCTGTACCGTCTGCGGTATGTGGAAAGGTTATGGCTGTAGTTGTGATCAACTCCGCGAACCCATGCTTCAGTCAGCTGATGCACAATCGTTTTTAAACGGGTTTGCGGTGTAAGTGCAGCCCGTCTTACACCGTGCGGCACAGGCACTAGTACTGATGTCGTATACAGGGCTTTTGACATCTACAATGATAAAGTAGCTGGTTTTGCTAAATTCCTAAAAACTAATTGTTGTCGCTTCCAAGAAAAGGACGAAGATGACAATTTAATTGATTCTTACTTTGTAGTTAAGAGACACACTTTCTCTAACTACCAACATGAAGAAACAATTTATAATTTACTTAAGGATTGTCCAGCTGTTGCTAAACATGACTTCTTTAAGTTTAGAATAGACGGTGACATGGTACCACATATATCACGTCAACGTCTTACTAAATACACAATGGCAGACCTCGTCTATGCTTTAAGGCATTTTGATGAAGGTAATTGTGACACATTAAAAGAAATACTTGTCACATACAATTGTTGTGATGATGATTATTTCAATAAAAAGGACTGGTATGATTTTGTAGAAAACCCAGATATATTACGCGTATACGCCAACTTAGGTGAACGTGTACGCCAAGCTTTGTTAAAAACAGTACAATTCTGTGATGCCATGCGAAATGCTGGTATTGTTGGTGTACTGACATTAGATAATCAAGATCTCAATGGTAACTGGTATGATTTCGGTGATTTCATACAAACCACGCCAGGTAGTGGAGTTCCTGTTGTAGATTCTTATTATTCATTGTTAATGCCTATATTAACCTTGACCAGGGCTTTAACTGCAGAGTCACATGTTGACACTGACTTAACAAAGCCTTACATTAAGTGGGATTTGTTAAAATATGACTTCACGGAAGAGAGGTTAAAACTCTTTGACCGTTATTTTAAATATTGGGATCAGACATACCACCCAAATTGTGTTAACTGTTTGGATGACAGATGCATTCTGCATTGTGCAAACTTTAATGTTTTATTCTCTACAGTGTTCCCACCTACAAGTTTTGGACCACTAGTGAGAAAAATATTTGTTGATGGTGTTCCATTTGTAGTTTCAACTGGATACCACTTCAGAGAGCTAGGTGTTGTACATAATCAGGATGTAAACTTACATAGCTCTAGACTTAGTTTTAAGGAATTACTTGTGTATGCTGCTGACCCTGCTATGCACGCTGCTTCTGGTAATCTATTACTAGATAAACGCACTACGTGCTTTTCAGTAGCTGCACTTACTAACAATGTTGCTTTTCAAACTGTCAAACCCGGTAATTTTAACAAAGACTTCTATGACTTTGCTGTGTCTAAGGGTTTCTTTAAGGAAGGAAGTTCTGTTGAATTAAAACACTTCTTCTTTGCTCAGGATGGTAATGCTGCTATCAGCGATTATGACTACTATCGTTATAATCTACCAACAATGTGTGATATCAGACAACTACTATTTGTAGTTGAAGTTGTTGATAAGTACTTTGATTGTTACGATGGTGGCTGTATTAATGCTAACCAAGTCATCGTCAACAACCTAGACAAATCAGCTGGTTTTCCATTTAATAAATGGGGTAAGGCTAGACTTTATTATGATTCAATGAGTTATGAGGATCAAGATGCACTTTTCGCATATACAAAACGTAATGTCATCCCTACTATAACTCAAATGAATCTTAAGTATGCCATTAGTGCAAAGAATAGAGCTCGCACCGTAGCTGGTGTCTCTATCTGTAGTACTATGACCAATAGACAGTTTCATCAAAAATTATTGAAATCAATAGCCGCCACTAGAGGAGCTACTGTAGTAATTGGAACAAGCAAATTCTATGGTGGTTGGCACAACATGTTAAAAACTGTTTATAGTGATGTAGAAAACCCTCACCTTATGGGTTGGGATTATCCTAAATGTGATAGAGCCATGCCTAACATGCTTAGAATTATGGCCTCACTTGTTCTTGCTCGCAAACATACAACGTGTTGTAGCTTGTCACACCGTTTCTATAGATTAGCTAATGAGTGTGCTCAAGTATTGAGTGAAATGGTCATGTGTGGCGGTTCACTATATGTTAAACCAGGTGGAACCTCATCAGGAGATGCCACAACTGCTTATGCTAATAGTGTTTTTAACATTTGTCAAGCTGTCACGGCCAATGTTAATGCACTTTTATCTACTGATGGTAACAAAATTGCCGATAAGTATGTCCGCAATTTACAACACAGACTTTATGAGTGTCTCTATAGAAATAGAGATGTTGACACAGACTTTGTGAATGAGTTTTACGCATATTTGCGTAAACATTTCTCAATGATGATACTCTCTGACGATGCTGTTGTGTGTTTCAATAGCACTTATGCATCTCAAGGTCTAGTGGCTAGCATAAAGAACTTTAAGTCAGTTCTTTATTATCAAAACAATGTTTTTATGTCTGAAGCAAAATGTTGGACTGAGACTGACCTTACTAAAGGACCTCATGAATTTTGCTCTCAACATACAATGCTAGTTAAACAGGGTGATGATTATGTGTACCTTCCTTACCCAGATCCATCAAGAATCCTAGGGGCCGGCTGTTTTGTAGATGATATCGTAAAAACAGATGGTACACTTATGATTGAACGGTTCGTGTCTTTAGCTATAGATGCTTACCCACTTACTAAACATCCTAATCAGGAGTATGCTGATGTCTTTCATTTGTACTTACAATACATAAGAAAGCTACATGATGAGTTAACAGGACACATGTTAGACATGTATTCTGTTATGCTTACTAATGATAACACTTCAAGGTATTGGGAACCTGAGTTTTATGAGGCTATGTACACACCGCATACAGTCTTACAGGCTGTTGGGGCTTGTGTTCTTTGCAATTCACAGACTTCATTAAGATGTGGTGCTTGCATACGTAGACCATTCTTATGTTGTAAATGCTGTTACGACCATGTCATATCAACATCACATAAATTAGTCTTGTCTGTTAATCCGTATGTTTGCAATGCTCCAGGTTGTGATGTCACAGATGTGACTCAACTTTACTTAGGAGGTATGAGCTATTATTGTAAATCACATAAACCACCCATTAGTTTTCCATTGTGTGCTAATGGACAAGTTTTTGGTTTATATAAAAATACATGTGTTGGTAGCGATAATGTTACTGACTTTAATGCAATTGCAACATGTGACTGGACAAATGCTGGTGATTACATTTTAGCTAACACCTGTACTGAAAGACTCAAGCTTTTTGCAGCAGAAACGCTCAAAGCTACTGAGGAGACATTTAAACTGTCTTATGGTATTGCTACTGTACGTGAAGTGCTGTCTGACAGAGAATTACATCTTTCATGGGAAGTTGGTAAACCTAGACCACCACTTAACCGAAATTATGTCTTTACTGGTTATCGTGTAACTAAAAACAGTAAAGTACAAATAGGAGAGTACACCTTTGAAAAAGGTGACTATGGTGATGCTGTTGTTTACCGAGGTACAACAACTTACAAATTAAATGTTGGTGATTATTTTGTGCTGACATCACATACAGTAATGCCATTAAGTGCACCTACACTAGTGCCACAAGAGCACTATGTTAGAATTACTGGCTTATACCCAACACTCAATATCTCAGATGAGTTTTCTAGCAATGTTGCAAATTATCAAAAGGTTGGTATGCAAAAGTATTCTACACTCCAGGGACCACCTGGTACTGGTAAGAGTCATTTTGCTATTGGCCTAGCTCTCTACTACCCTTCTGCTCGCATAGTGTATACAGCTTGCTCTCATGCCGCTGTTGATGCACTATGTGAGAAGGCATTAAAATATTTGCCTATAGATAAATGTAGTAGAATTATACCTGCACGTGCTCGTGTAGAGTGTTTTGATAAATTCAAAGTGAATTCAACATTAGAACAGTATGTCTTTTGTACTGTAAATGCATTGCCTGAGACGACAGCAGATATAGTTGTCTTTGATGAAATTTCAATGGCCACAAATTATGATTTGAGTGTTGTCAATGCCAGATTACGTGCTAAGCACTATGTGTACATTGGCGACCCTGCTCAATTACCTGCACCACGCACATTGCTAACTAAGGGCACACTAGAACCAGAATATTTCAATTCAGTGTGTAGACTTATGAAAACTATAGGTCCAGACATGTTCCTCGGAACTTGTCGGCGTTGTCCTGCTGAAATTGTTGACACTGTGAGTGCTTTGGTTTATGATAATAAGCTTAAAGCACATAAAGACAAATCAGCTCAATGCTTTAAAATGTTTTATAAGGGTGTTATCACGCATGATGTTTCATCTGCAATTAACAGGCCACAAATAGGCGTGGTAAGAGAATTCCTTACACGTAACCCTGCTTGGAGAAAAGCTGTCTTTATTTCACCTTATAATTCACAGAATGCTGTAGCCTCAAAGATTTTGGGACTACCAACTCAAACTGTTGATTCATCACAGGGCTCAGAATATGACTATGTCATATTCACTCAAACCACTGAAACAGCTCACTCTTGTAATGTAAACAGATTTAATGTTGCTATTACCAGAGCAAAAGTAGGCATACTTTGCATAATGTCTGATAGAGACCTTTATGACAAGTTGCAATTTACAAGTCTTGAAATTCCACGTAGGAATGTGGCAACTTTACAAGCTGAAAATGTAACAGGACTCTTTAAAGATTGTAGTAAGGTAATCACTGGGTTACATCCTACACAGGCACCTACACACCTCAGTGTTGACACTAAATTCAAAACTGAAGGTTTATGTGTTGACATACCTGGCATACCTAAGGACATGACCTATAGAAGACTCATCTCTATGATGGGTTTTAAAATGAATTATCAAGTTAATGGTTACCCTAACATGTTTATCACCCGCGAAGAAGCTATAAGACATGTACGTGCATGGATTGGCTTCGATGTCGAGGGGTGTCATGCTACTAGAGAAGCTGTTGGTACCAATTTACCTTTACAGCTAGGTTTTTCTACAGGTGTTAACCTAGTTGCTGTACCTACAGGTTATGTTGATACACCTAATAATACAGATTTTTCCAGAGTTAGTGCTAAACCACCGCCTGGAGATCAATTTAAACACCTCATACCACTTATGTACAAAGGACTTCCTTGGAATGTAGTGCGTATAAAGATTGTACAAATGTTAAGTGACACACTTAAAAATCTCTCTGACAGAGTCGTATTTGTCTTATGGGCACATGGCTTTGAGTTGACATCTATGAAGTATTTTGTGAAAATAGGACCTGAGCGCACCTGTTGTCTATGTGATAGACGTGCCACATGCTTTTCCACTGCTTCAGACACTTATGCCTGTTGGCATCATTCTATTGGATTTGATTACGTCTATAATCCGTTTATGATTGATGTTCAACAATGGGGTTTTACAGGTAACCTACAAAGCAACCATGATCTGTATTGTCAAGTCCATGGTAATGCACATGTAGCTAGTTGTGATGCAATCATGACTAGGTGTCTAGCTGTCCACGAGTGCTTTGTTAAGCGTGTTGACTGGACTATTGAATATCCTATAATTGGTGATGAACTGAAGATTAATGCGGCTTGTAGAAAGGTTCAACACATGGTTGTTAAAGCTGCATTATTAGCAGACAAATTCCCAGTTCTTCACGACATTGGTAACCCTAAAGCTATTAAGTGTGTACCTCAAGCTGATGTAGAATGGAAGTTCTATGATGCACAGCCTTGTAGTGACAAAGCTTATAAAATAGAAGAATTATTCTATTCTTATGCCACACATTCTGACAAATTCACAGATGGTGTATGCCTATTTTGGAATTGCAATGTCGATAGATATCCTGCTAATTCCATTGTTTGTAGATTTGACACTAGAGTGCTATCTAACCTTAACTTGCCTGGTTGTGATGGTGGCAGTTTGTATGTAAATAAACATGCATTCCACACACCAGCTTTTGATAAAAGTGCTTTTGTTAATTTAAAACAATTACCATTTTTCTATTACTCTGACAGTCCATGTGAGTCTCATGGAAAACAAGTAGTGTCAGATATAGATTATGTACCACTAAAGTCTGCTACGTGTATAACACGTTGCAATTTAGGTGGTGCTGTCTGTAGACATCATGCTAATGAGTACAGATTGTATCTCGATGCTTATAACATGATGATCTCAGCTGGCTTTAGCTTGTGGGTTTACAAACAATTTGATACTTATAACCTCTGGAACACTTTTACAAGACTTCAGAGTTTAGAAAATGTGGCTTTTAATGTTGTAAATAAGGGACACTTTGATGGACAACAGGGTGAAGTACCAGTTTCTATCATTAATAACACTGTTTACACAAAAGTTGATGGTGTTGATGTAGAATTGTTTGAAAATAAAACAACATTACCTGTTAATGTAGCATTTGAGCTTTGGGCTAAGCGCAACATTAAACCAGTACCAGAGGTGAAAATACTCAATAATTTGGGTGTGGACATTGCTGCTAATACTGTGATCTGGGACTACAAAAGAGATGCTCCAGCACATATATCTACTATTGGTGTTTGTTCTATGACTGACATAGCCAAGAAACCAACTGAAACGATTTGTGCACCACTCACTGTCTTTTTTGATGGTAGAGTTGATGGTCAAGTAGACTTATTTAGAAATGCCCGTAATGGTGTTCTTATTACAGAAGGTAGTGTTAAAGGTTTACAACCATCTGTAGGTCCCAAACAAGCTAGTCTTAATGGAGTCACATTAATTGGAGAAGCCGTAAAAACACAGTTCAATTATTATAAGAAAGTTGATGGTGTTGTCCAACAATTACCTGAAACTTACTTTACTCAGAGTAGAAATTTACAAGAATTTAAACCCAGGAGTCAAATGGAAATTGATTTCTTAGAATTAGCTATGGATGAATTCATTGAACGGTATAAATTAGAAGGCTATGCCTTCGAACATATCGTTTATGGAGATTTTAGTCATAGTCAGTTAGGTGGTTTACATCTACTGATTGGACTAGCTAAACGTTTTAAGGAATCACCTTTTGAATTAGAAGATTTTATTCCTATGGACAGTACAGTTAAAAACTATTTCATAACAGATGCGCAAACAGGTTCATCTAAGTGTGTGTGTTCTGTTATTGATTTATTACTTGATGATTTTGTTGAAATAATAAAATCCCAAGATTTATCTGTAGTTTCTAAGGTTGTCAAAGTGACTATTGACTATACAGAAATTTCATTTATGCTTTGGTGTAAAGATGGCCATGTAGAAACATTTTACCCAAAATTACAATCTAGTCAAGCGTGGCAACCGGGTGTTGCTATGCCTAATCTTTACAAAATGCAAAGAATGCTATTAGAAAAGTGTGACCTTCAAAATTATGGTGATAGTGCAACATTACCTAAAGGCATAATGATGAATGTCGCAAAATATACTCAACTGTGTCAATATTTAAACACATTAACATTAGCTGTACCCTATAATATGAGAGTTATACATTTTGGTGCTGGTTCTGATAAAGGAGTTGCACCAGGTACAGCTGTTTTAAGACAGTGGTTGCCTACGGGTACGCTGCTTGTCGATTCAGATCTTAATGACTTTGTCTCTGATGCAGATTCAACTTTGATTGGTGATTGTGCAACTGTACATACAGCTAATAAATGGGATCTCATTATTAGTGATATGTACGACCCTAAGACTAAAAATGTTACAAAAGAAAATGACTCTAAAGAGGGTTTTTTCACTTACATTTGTGGGTTTATACAACAAAAGCTAGCTCTTGGAGGTTCCGTGGCTATAAAGATAACAGAACATTCTTGGAATGCTGATCTTTATAAGCTCATGGGACACTTCGCATGGTGGACAGCCTTTGTTACTAATGTGAATGCGTCATCATCTGAAGCATTTTTAATTGGATGTAATTATCTTGGCAAACCACGCGAACAAATAGATGGTTATGTCATGCATGCAAATTACATATTTTGGAGGAATACAAATCCAATTCAGTTGTCTTCCTATTCTTTATTTGACATGAGTAAATTTCCCCTTAAATTAAGGGGTACTGCTGTTATGTCTTTAAAAGAAGGTCAAATCAATGATATGATTTTATCTCTTCTTAGTAAAGGTAGACTTATAATTAGAGAAAACAACAGAGTTGTTATTTCTAGTGATGTTCTTGTTAACAACTAAACGAACAATGTTTGTTTTTCTTGTTTTATTGCCACTAGTCTCTAGTCAGTGTGTTAATCTTACAACCAGAACTCAATTACCCCCTGCATACACTAATTCTTTCACACGTGGTGTTTATTACCCTGACAAAGTTTTCAGATCCTCAGTTTTACATTCAACTCAGGACTTGTTCTTACCTTTCTTTTCCAATGTTACTTGGTTCCATGCTATACATGTCTCTGGGACCAATGGTACTAAGAGGTTTGATAACCCTGTCCTACCATTTAATGATGGTGTTTATTTTGCTTCCACTGAGAAGTCTAACATAATAAGAGGCTGGATTTTTGGTACTACTTTAGATTCGAAGACCCAGTCCCTACTTATTGTTAATAACGCTACTAATGTTGTTATTAAAGTCTGTGAATTTCAATTTTGTAATGATCCATTTTTGGGTGTTTATTACCACAAAAACAACAAAAGTTGGATGGAAAGTGAGTTCAGAGTTTATTCTAGTGCGAATAATTGCACTTTTGAATATGTCTCTCAGCCTTTTCTTATGGACCTTGAAGGAAAACAGGGTAATTTCAAAAATCTTAGGGAATTTGTGTTTAAGAATATTGATGGTTATTTTAAAATATATTCTAAGCACACGCCTATTAATTTAGTGCGTGATCTCCCTCAGGGTTTTTCGGCTTTAGAACCATTGGTAGATTTGCCAATAGGTATTAACATCACTAGGTTTCAAACTTTACTTGCTTTACATAGAAGTTATTTGACTCCTGGTGATTCTTCTTCAGGTTGGACAGCTGGTGCTGCAGCTTATTATGTGGGTTATCTTCAACCTAGGACTTTTCTATTAAAATATAATGAAAATGGAACCATTACAGATGCTGTAGACTGTGCACTTGACCCTCTCTCAGAAACAAAGTGTACGTTGAAATCCTTCACTGTAGAAAAAGGAATCTATCAAACTTCTAACTTTAGAGTCCAACCAACAGAATCTATTGTTAGATTTCCTAATATTACAAACTTGTGCCCTTTTGGTGAAGTTTTTAACGCCACCAGATTTGCATCTGTTTATGCTTGGAACAGGAAGAGAATCAGCAACTGTGTTGCTGATTATTCTGTCCTATATAATTCCGCATCATTTTCCACTTTTAAGTGTTATGGAGTGTCTCCTACTAAATTAAATGATCTCTGCTTTACTAATGTCTATGCAGATTCATTTGTAATTAGAGGTGATGAAGTCAGACAAATCGCTCCAGGGCAAACTGGAAAGATTGCTGATTATAATTATAAATTACCAGATGATTTTACAGGCTGCGTTATAGCTTGGAATTCTAACAATCTTGATTCTAAGGTTGGTGGTAATTATAATTACCTGTATAGATTGTTTAGGAAGTCTAATCTCAAACCTTTTGAGAGAGATATTTCAACTGAAATCTATCAGGCCGGTAGCACACCTTGTAATGGTGTTGAAGGTTTTAATTGTTACTTTCCTTTACAATCATATGGTTTCCAACCCACTAATGGTGTTGGTTACCAACCATACAGAGTAGTAGTACTTTCTTTTGAACTTCTACATGCACCAGCAACTGTTTGTGGACCTAAAAAGTCTACTAATTTGGTTAAAAACAAATGTGTCAATTTCAACTTCAATGGTTTAACAGGCACAGGTGTTCTTACTGAGTCTAACAAAAAGTTTCTGCCTTTCCAACAATTTGGCAGAGACATTGCTGACACTACTGATGCTGTCCGTGATCCACAGACACTTGAGATTCTTGACATTACACCATGTTCTTTTGGTGGTGTCAGTGTTATAACACCAGGAACAAATACTTCTAACCAGGTTGCTGTTCTTTATCAGGATGTTAACTGCACAGAAGTCCCTGTTGCTATTCATGCAGATCAACTTACTCCTACTTGGCGTGTTTATTCTACAGGTTCTAATGTTTTTCAAACACGTGCAGGCTGTTTAATAGGGGCTGAACATGTCAACAACTCATATGAGTGTGACATACCCATTGGTGCAGGTATATGCGCTAGTTATCAGACTCAGACTAATTCTCCTCGGCGGGCACGTAGTGTAGCTAGTCAATCCATCATTGCCTACACTATGTCACTTGGTGCAGAAAATTCAGTTGCTTACTCTAATAACTCTATTGCCATACCCACAAATTTTACTATTAGTGTTACCACAGAAATTCTACCAGTGTCTATGACCAAGACATCAGTAGATTGTACAATGTACATTTGTGGTGATTCAACTGAATGCAGCAATCTTTTGTTGCAATATGGCAGTTTTTGTACACAATTAAACCGTGCTTTAACTGGAATAGCTGTTGAACAAGACAAAAACACCCAAGAAGTTTTTGCACAAGTCAAACAAATTTACAAAACACCACCAATTAAAGATTTTGGTGGTTTTAATTTTTCACAAATATTACCAGATCCATCAAAACCAAGCAAGAGGTCATTTATTGAAGATCTACTTTTCAACAAAGTGACACTTGCAGATGCTGGCTTCATCAAACAATATGGTGATTGCCTTGGTGATATTGCTGCTAGAGACCTCATTTGTGCACAAAAGTTTAACGGCCTTACTGTTTTGCCACCTTTGCTCACAGATGAAATGATTGCTCAATACACTTCTGCACTGTTAGCGGGTACAATCACTTCTGGTTGGACCTTTGGTGCAGGTGCTGCATTACAAATACCATTTGCTATGCAAATGGCTTATAGGTTTAATGGTATTGGAGTTACACAGAATGTTCTCTATGAGAACCAAAAATTGATTGCCAACCAATTTAATAGTGCTATTGGCAAAATTCAAGACTCACTTTCTTCCACAGCAAGTGCACTTGGAAAACTTCAAGATGTGGTCAACCAAAATGCACAAGCTTTAAACACGCTTGTTAAACAACTTAGCTCCAATTTTGGTGCAATTTCAAGTGTTTTAAATGATATCCTTTCACGTCTTGACAAAGTTGAGGCTGAAGTGCAAATTGATAGGTTGATCACAGGCAGACTTCAAAGTTTGCAGACATATGTGACTCAACAATTAATTAGAGCTGCAGAAATCAGAGCTTCTGCTAATCTTGCTGCTACTAAAATGTCAGAGTGTGTACTTGGACAATCAAAAAGAGTTGATTTTTGTGGAAAGGGCTATCATCTTATGTCCTTCCCTCAGTCAGCACCTCATGGTGTAGTCTTCTTGCATGTGACTTATGTCCCTGCACAAGAAAAGAACTTCACAACTGCTCCTGCCATTTGTCATGATGGAAAAGCACACTTTCCTCGTGAAGGTGTCTTTGTTTCAAATGGCACACACTGGTTTGTAACACAAAGGAATTTTTATGAACCACAAATCATTACTACAGACAACACATTTGTGTCTGGTAACTGTGATGTTGTAATAGGAATTGTCAACAACACAGTTTATGATCCTTTGCAACCTGAATTAGACTCATTCAAGGAGGAGTTAGATAAATATTTTAAGAATCATACATCACCAGATGTTGATTTAGGTGACATCTCTGGCATTAATGCTTCAGTTGTAAACATTCAAAAAGAAATTGACCGCCTCAATGAGGTTGCCAAGAATTTAAATGAATCTCTCATCGATCTCCAAGAACTTGGAAAGTATGAGCAGTATATAAAATGGCCATGGTACATTTGGCTAGGTTTTATAGCTGGCTTGATTGCCATAGTAATGGTGACAATTATGCTTTGCTGTATGACCAGTTGCTGTAGTTGTCTCAAGGGCTGTTGTTCTTGTGGATCCTGCTGCAAATTTGATGAAGACGACTCTGAGCCAGTGCTCAAAGGAGTCAAATTACATTACACATAAACGAACTTATGGATTTGTTTATGAGAATCTTCACAATTGGAACTGTAACTTTGAAGCAAGGTGAAATCAAGGATGCTACTCCTTCAGATTTTGTTCGCGCTACTGCAACGATACCGATACAAGCCTCACTCCCTTTCGGATGGCTTATTGTTGGCGTTGCACTTCTTGCTGTTTTTCAGAGCGCTTCCAAAATCATAACCCTCAAAAAGAGATGGCAACTAGCACTCTCCAAGGGTGTTCACTTTGTTTGCAACTTGCTGTTGTTGTTTGTAACAGTTTACTCACACCTTTTGCTCGTTGCTGCTGGCCTTGAAGCCCCTTTTCTCTATCTTTATGCTTTAGTCTACTTCTTGCAGAGTATAAACTTTGTAAGAATAATAATGAGGCTTTGGCTTTGCTGGAAATGCCGTTCCAAAAACCCATTACTTTATGATGCCAACTATTTTCTTTGCTGGCATACTAATTGTTACGACTATTGTATACCTTACAATAGTGTAACTTCTTCAATTGTCATTACTTCAGGTGATGGCACAACAAGTCCTATTTCTGAACATGACTACCAGATTGGTGGTTATACTGAAAAATGGGAATCTGGAGTAAAAGACTGTGTTGTATTACACAGTTACTTCACTTCAGACTATTACCAGCTGTACTCAACTCAATTGAGTACAGACACTGGTGTTGAACATGTTACCTTCTTCATCTACAATAAAATTGTTGATGAGCCTGAAGAACATGTCCAAATTCACACAATCGACGGTTCATCCGGAGTTGTTAATCCAGTAATGGAACCAATTTATGATGAACCGACGACGACTACTAGCGTGCCTTTGTAAGCACAAGCTGATGAGTACGAACTTATGTACTCATTCGTTTCGGAAGAGACAGGTACGTTAATAGTTAATAGCGTACTTCTTTTTCTTGCTTTCGTGGTATTCTTGCTAGTTACACTAGCCATCCTTACTGCGCTTCGATTGTGTGCGTACTGCTGCAATATTGTTAACGTGAGTCTTGTAAAACCTTCTTTTTACGTTTACTCTCGTGTTAAAAATCTGAATTCTTCTAGAGTTCCTGATCTTCTGGTCTAAACGAACTAAATATTATATTAGTTTTTCTGTTTGGAACTTTAATTTTAGCCATGGCAGATTCCAACGGTACTATTACCGTTGAAGAGCTTAAAAAGCTCCTTGAACAATGGAACCTAGTAATAGGTTTCCTATTCCTTACATGGATTTGTCTTCTACAATTTGCCTATGCCAACAGGAATAGGTTTTTGTATATAATTAAGTTAATTTTCCTCTGGCTGTTATGGCCAGTAACTTTAGCTTGTTTTGTGCTTGCTGCTGTTTACAGAATAAATTGGATCACCGGTGGAATTGCTATCGCAATGGCTTGTCTTGTAGGCTTGATGTGGCTCAGCTACTTCATTGCTTCTTTCAGACTGTTTGCGCGTACGCGTTCCATGTGGTCATTCAATCCAGAAACTAACATTCTTCTCAACGTGCCACTCCATGGCACTATTCTGACCAGACCGCTTCTAGAAAGTGAACTCGTAATCGGAGCTGTGATCCTTCGTGGACATCTTCGTATTGCTGGACACCATCTAGGACGCTGTGACATCAAGGACCTGCCTAAAGAAATCACTGTTGCTACATCACGAACGCTTTCTTATTACAAATTGGGAGCTTCGCAGCGTGTAGCAGGTGACTCAGGTTTTGCTGCATACAGTCGCTACAGGATTGGCAACTATAAATTAAACACAGACCATTCCAGTAGCAGTGACAATATTGCTTTGCTTGTACAGTAAGTGACAACAGATGTTTCATCTCGTTGACTTTCAGGTTACTATAGCAGAGATATTACTAATTATTATGAGGACTTTTAAAGTTTCCATTTGGAATCTTGATTACATCATAAACCTCATAATTAAAAATTTATCTAAGTCACTAACTGAGAATAAATATTCTCAATTAGATGAAGAGCAACCAATGGAGATTGATTAAACGAACATGAAAATTATTCTTTTCTTGGCACTGATAACACTCGCTACTTGTGAGCTTTATCACTACCAAGAGTGTGTTAGAGGTACAACAGTACTTTTAAAAGAACCTTGCTCTTCTGGAACATACGAGGGCAATTCACCATTTCATCCTCTAGCTGATAACAAATTTGCACTGACTTGCTTTAGCACTCAATTTGCTTTTGCTTGTCCTGACGGCGTAAAACACGTCTATCAGTTACGTGCCAGATCAGTTTCACCTAAACTGTTCATCAGACAAGAGGAAGTTCAAGAACTTTACTCTCCAATTTTTCTTATTGTTGCGGCAATAGTGTTTATAACACTTTGCTTCACACTCAAAAGAAAGACAGAATGATTGAACTTTCATTAATTGACTTCTATTTGTGCTTTTTAGCCTTTCTGCTATTCCTTGTTTTAATTATGCTTATTATCTTTTGGTTCTCACTTGAACTGCAAGATCATAATGAAACTTGTCACGCCTAAACGAACATGAAATTTCTTGTTTTCTTAGGAATCATCACAACTGTAGCTGCATTTCACCAAGAATGTAGTTTACAGTCATGTACTCAACATCAACCATATGTAGTTGATGACCCGTGTCCTATTCACTTCTATTCTAAATGGTATATTAGAGTAGGAGCTAGAAAATCAGCACCTTTAATTGAATTGTGCGTGGATGAGGCTGGTTCTAAATCACCCATTCAGTACATCGATATCGGTAATTATACAGTTTCCTGTTTACCTTTTACAATTAATTGCCAGGAACCTAAATTGGGTAGTCTTGTAGTGCGTTGTTCGTTCTATGAAGACTTTTTAGAGTATCATGACGTTCGTGTTGTTTTAGATTTCATCTAAACGAACAAACTAAAATGTCTGATAATGGACCCCAAAATCAGCGAAATGCACCCCGCATTACGTTTGGTGGACCCTCAGATTCAACTGGCAGTAACCAGAATGGAGAACGCAGTGGGGCGCGATCAAAACAACGTCGGCCCCAAGGTTTACCCAATAATACTGCGTCTTGGTTCACCGCTCTCACTCAACATGGCAAGGAAGACCTTAAATTCCCTCGAGGACAAGGCGTTCCAATTAACACCAATAGCAGTCCAGATGACCAAATTGGCTACTACCGAAGAGCTACCAGACGAATTCGTGGTGGTGACGGTAAAATGAAAGATCTCAGTCCAAGATGGTATTTCTACTACCTAGGAACTGGGCCAGAAGCTGGACTTCCCTATGGTGCTAACAAAGACGGCATCATATGGGTTGCAACTGAGGGAGCCTTGAATACACCAAAAGATCACATTGGCACCCGCAATCCTGCTAACAATGCTGCAATCGTGCTACAACTTCCTCAAGGAACAACATTGCCAAAAGGCTTCTACGCAGAAGGGAGCAGAGGCGGCAGTCAAGCCTCTTCTCGTTCCTCATCACGTAGTCGCAACAGTTCAAGAAATTCAACTCCAGGCAGCAGTAGGGGAACTTCTCCTGCTAGAATGGCTGGCAATGGCGGTGATGCTGCTCTTGCTTTGCTGCTGCTTGACAGATTGAACCAGCTTGAGAGCAAAATGTCTGGTAAAGGCCAACAACAACAAGGCCAAACTGTCACTAAGAAATCTGCTGCTGAGGCTTCTAAGAAGCCTCGGCAAAAACGTACTGCCACTAAAGCATACAATGTAACACAAGCTTTCGGCAGACGTGGTCCAGAACAAACCCAAGGAAATTTTGGGGACCAGGAACTAATCAGACAAGGAACTGATTACAAACATTGGCCGCAAATTGCACAATTTGCCCCCAGCGCTTCAGCGTTCTTCGGAATGTCGCGCATTGGCATGGAAGTCACACCTTCGGGAACGTGGTTGACCTACACAGGTGCCATCAAATTGGATGACAAAGATCCAAATTTCAAAGATCAAGTCATTTTGCTGAATAAGCATATTGACGCATACAAAACATTCCCACCAACAGAGCCTAAAAAGGACAAAAAGAAGAAGGCTGATGAAACTCAAGCCTTACCGCAGAGACAGAAGAAACAGCAAACTGTGACTCTTCTTCCTGCTGCAGATTTGGATGATTTCTCCAAACAATTGCAACAATCCATGAGCAGTGCTGACTCAACTCAGGCCTAAACTCATGCAGACCACACAAGGCAGATGGGCTATATAAACGTTTTCGCTTTTCCGTTTACGATATATAGTCTACTCTTGTGCAGAATGAATTCTCGTAACTACATAGCACAAGTAGATGTAGTTAACTTTAATCTCACATAGCAATCTTTAATCAGTGTGTAACATTAGGGAGGACTTGAAAGAGCCACCACATTTTCACCGAGGCCACGCGGAGTACGATCGAGTGTACAGTGAACAATGCTAGGGAGAGCTGCCTATATGGAAGAGCCCTAATGTGTAAAATTAATTTTAGTAGTGCTATCCCCATGTGATTTTAATAGCTTCTTAGGAGAATGACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
서열번호: 2
중증 급성 호흡기 증후군 코로나바이러스 2 orf1ab 다단백질의 단리물 우한-Hu-1 (GenBank: QHD43415)
MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQHLKDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGETLGVLVPHVGEIPVAYRKVLLRKNGNKGAGGHSYGADLKSFDLGDELGTDPYEDFQENWNTKHSSGVTRELMRELNGGAYTRYVDNNFCGPDGYPLECIKDLLARAGKASCTLSEQLDFIDTKRGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFPLNSIIKTIQPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTGDFVKATCEFCGTENLTKEGATTCGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESGLKTILRKGGRTIAFGGCVFSYVGCHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNLLEILQKEKVNINIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGNFKVTKGKAKKGAWNIGEQKSILSPLYAFASEAARVVRSIFSRTLETAQNSVRVLQKAAITILDGISQYSLRLIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIFGTVYEKLKPVLDWLEEKFKEGVEFLRDGWEIVKFISTCACEIVGGQIVTCAKEIKESVQTFFKLVNKFLALCADSIIIGGAKLKALNLGETFVTHSKGLYRKCVKSREETGLLMPLKAPKEIIFLEGETLPTEVLTEEVVLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEKYCALAPNMMVTNNTFTLKGGAPTKVTFGDDTVIEVQGYKSVNITFELDERIDKVLNEKCSAYTVELGTEVNEFACVVADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDEDEEEGDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPEEEQEEDWLDDDSQQTVGQQDGSEDNQTTTIQTIVEVQPQLEMELTPVVQTIEVNSFSGYLKLTDNVYIKNADIVEEAKKVKPTVVVNAANVYLKHGGGVAGALNKATNNAMQVESDDYIATNGPLKVGGSCVLSGHNLAKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLAPLLSAGIFGADPIHSLRVCVDTVRTNVYLAVFDKNLYDKLVSSFLEMKSEKQVEQKIAEIPKEEVKPFITESKPSVEQRKQDDKKIKACVEEVTTTLEETKFLTENLLLYIDINGNLHPDSATLVSDIDITFLKKDAPYIVGDVVQEGVLTAVVIPTKKAGGTTEMLAKALRKVPTDNYITTYPGQGLNGYTVEEAKTVLKKCKSAFYILPSIISNEKQEILGTVSWNLREMLAHAEETRKLMPVCVETKAIVSTIQRKYKGIKIQEGVVDYGARFYFYTSKTTVASLINTLNDLNETLVTMPLGYVTHGLNLEEAARYMRSLKVPATVSVSSPDAVTAYNGYLTSSSKTPEEHFIETISLAGSYKDWSYSGQSTQLGIEFLKRGDKSVYYTSNPTTFHLDGEVITFDNLKTLLSLREVRTIKVFTTVDNINLHTQVVDMSMTYGQQFGPTYLDGADVTKIKPHNSHEGKTFYVLPNDDTLRVEAFEYYHTTDPSFLGRYMSALNHTKKWKYPQVNGLTSIKWADNNCYLATALLTLQQIELKFNPPALQDAYYRARAGEAANFCALILAYCNKTVGELGDVRETMSYLFQHANLDSCKRVLNVVCKTCGQQQTTLKGVEAVMYMGTLSYEQFKKGVQIPCTCGKQATKYLVQQESPFVMMSAPPAQYELKHGTFTCASEYTGNYQCGHYKHITSKETLYCIDGALLTKSSEYKGPITDVFYKENSYTTTIKPVTYKLDGVVCTEIDPKLDNYYKKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCDNIKFADDLNQLTGYKKPASRELKVTFFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVWHVNNATNKATYKPNTWCIRCLWSTKPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEVVENPTIQKDVLECNVKTTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKPNELSRVLGLKTLATHGLAAVNSVPWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYMPYFFTLLLQLCTFTRSTNSRIKASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLINIIIWFLLLSVCLGSLIYSTAALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIPCSVCLSGLDSLDTYPSLETIQITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAIMQLFFSYFAVHFISNSWLMWLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCNSSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTFISDEVARDLSLQFKRPINPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDNLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGDSAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDSDVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQVAKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIALKGGKIVNNWLKQLIKVTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTRDIASTDTCFANKHADFDTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTTNGDFLHFLPRVFSAVGNICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDTNVLEGSVAYESLRPDTRYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAGVCVSTSGRWVLNNDYYRSLPGVFCGVDAVNLLTNMFTPLIQPIGALDISASIVAGGIVAIVVTCLAYYFMRFRRAFGEYSHVVAFNTLLFLMSFTVLCLTPVYSFLPGVYSVIYLYLTFYLTNDVSFLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSFSTFEEAALCTFLLNKEMYLKLRSDVLLPLTQYNRYLALYNKYKYFSGAMDTTSYREAACCHLAKALNDFSNSGSDVLYQPPQTSITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCVLKLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNFTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNVLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQSAVKRTIKGTHHWLLLTILTSLLVLVQSTQWSLFFFLYENAFLPFAMGIIAMSAFAMMFVKHKHAFLCLFLLPSLATVAYFNMVYMPASWVMRIMTWLDMVDTSLSGFKLKDCVMYASAVVLLILMTARTVYDDGARRVWTLMNVLTLVYKVYYGNALDQAISMWALIISVTSNYSGVVTTVMFLARGIVFMCVEYCPIFFITGNTLQCIMLVYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMNSQGLLPPKNSIDAFKLNIKLLGVGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVESSSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRATLQAIASEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKLEKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLNIIPLTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDADSKIVQLSEISMDNSPNLAWPLIVTALRANSAVKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNTTKGGRFVLALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMVLGSLAATVRLQAGNATEVPANSTVLSFCAFAVDAAKAYKDYLASGGQPITNCVKMLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCANDPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQSADAQSFLNRVCGVSAARLTPCGTGTSTDVVYRAFDIYNDKVAGFAKFLKTNCCRFQEKDEDDNLIDSYFVVKRHTFSNYQHEETIYNLLKDCPAVAKHDFFKFRIDGDMVPHISRQRLTKYTMADLVYALRHFDEGNCDTLKEILVTYNCCDDDYFNKKDWYDFVENPDILRVYANLGERVRQALLKTVQFCDAMRNAGIVGVLTLDNQDLNGNWYDFGDFIQTTPGSGVPVVDSYYSLLMPILTLTRALTAESHVDTDLTKPYIKWDLLKYDFTEERLKLFDRYFKYWDQTYHPNCVNCLDDRCILHCANFNVLFSTVFPPTSFGPLVRKIFVDGVPFVVSTGYHFRELGVVHNQDVNLHSSRLSFKELLVYAADPAMHAASGNLLLDKRTTCFSVAALTNNVAFQTVKPGNFNKDFYDFAVSKGFFKEGSSVELKHFFFAQDGNAAISDYDYYRYNLPTMCDIRQLLFVVEVVDKYFDCYDGGCINANQVIVNNLDKSAGFPFNKWGKARLYYDSMSYEDQDALFAYTKRNVIPTITQMNLKYAISAKNRARTVAGVSICSTMTNRQFHQKLLKSIAATRGATVVIGTSKFYGGWHNMLKTVYSDVENPHLMGWDYPKCDRAMPNMLRIMASLVLARKHTTCCSLSHRFYRLANECAQVLSEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADKYVRNLQHRLYECLYRNRDVDTDFVNEFYAYLRKHFSMMILSDDAVVCFNSTYASQGLVASIKNFKSVLYYQNNVFMSEAKCWTETDLTKGPHEFCSQHTMLVKQGDDYVYLPYPDPSRILGAGCFVDDIVKTDGTLMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYSVMLTNDNTSRYWEPEFYEAMYTPHTVLQAVGACVLCNSQTSLRCGACIRRPFLCCKCCYDHVISTSHKLVLSVNPYVCNAPGCDVTDVTQLYLGGMSYYCKSHKPPISFPLCANGQVFGLYKNTCVGSDNVTDFNAIATCDWTNAGDYILANTCTERLKLFAAETLKATEETFKLSYGIATVREVLSDRELHLSWEVGKPRPPLNRNYVFTGYRVTKNSKVQIGEYTFEKGDYGDAVVYRGTTTYKLNVGDYFVLTSHTVMPLSAPTLVPQEHYVRITGLYPTLNISDEFSSNVANYQKVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARIVYTACSHAAVDALCEKALKYLPIDKCSRIIPARARVECFDKFKVNSTLEQYVFCTVNALPETTADIVVFDEISMATNYDLSVVNARLRAKHYVYIGDPAQLPAPRTLLTKGTLEPEYFNSVCRLMKTIGPDMFLGTCRRCPAEIVDTVSALVYDNKLKAHKDKSAQCFKMFYKGVITHDVSSAINRPQIGVVREFLTRNPAWRKAVFISPYNSQNAVASKILGLPTQTVDSSQGSEYDYVIFTQTTETAHSCNVNRFNVAITRAKVGILCIMSDRDLYDKLQFTSLEIPRRNVATLQAENVTGLFKDCSKVITGLHPTQAPTHLSVDTKFKTEGLCVDIPGIPKDMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATREAVGTNLPLQLGFSTGVNLVAVPTGYVDTPNNTDFSRVSAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQMLSDTLKNLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDRRATCFSTASDTYACWHHSIGFDYVYNPFMIDVQQWGFTGNLQSNHDLYCQVHGNAHVASCDAIMTRCLAVHECFVKRVDWTIEYPIIGDELKINAACRKVQHMVVKAALLADKFPVLHDIGNPKAIKCVPQADVEWKFYDAQPCSDKAYKIEELFYSYATHSDKFTDGVCLFWNCNVDRYPANSIVCRFDTRVLSNLNLPGCDGGSLYVNKHAFHTPAFDKSAFVNLKQLPFFYYSDSPCESHGKQVVSDIDYVPLKSATCITRCNLGGAVCRHHANEYRLYLDAYNMMISAGFSLWVYKQFDTYNLWNTFTRLQSLENVAFNVVNKGHFDGQQGEVPVSIINNTVYTKVDGVDVELFENKTTLPVNVAFELWAKRNIKPVPEVKILNNLGVDIAANTVIWDYKRDAPAHISTIGVCSMTDIAKKPTETICAPLTVFFDGRVDGQVDLFRNARNGVLITEGSVKGLQPSVGPKQASLNGVTLIGEAVKTQFNYYKKVDGVVQQLPETYFTQSRNLQEFKPRSQMEIDFLELAMDEFIERYKLEGYAFEHIVYGDFSHSQLGGLHLLIGLAKRFKESPFELEDFIPMDSTVKNYFITDAQTGSSKCVCSVIDLLLDDFVEIIKSQDLSVVSKVVKVTIDYTEISFMLWCKDGHVETFYPKLQSSQAWQPGVAMPNLYKMQRMLLEKCDLQNYGDSATLPKGIMMNVAKYTQLCQYLNTLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLPTGTLLVDSDLNDFVSDADSTLIGDCATVHTANKWDLIISDMYDPKTKNVTKENDSKEGFFTYICGFIQQKLALGGSVAIKITEHSWNADLYKLMGHFAWWTAFVTNVNASSSEAFLIGCNYLGKPREQIDGYVMHANYIFWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKEGQINDMILSLLSKGRLIIRENNRVVISSDVLVNN
서열번호: 3
중증 급성 호흡기 증후군 코로나바이러스 2 표면 당단백질 (GenBank: QHD43416)
MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT
서열번호: 4
CpG 1018
TGACTGTGAACGTTCGAGATGA
서열번호: 5
KLK 펩티드
KLKLLLLLKLK
서열번호: 6
Oligo-d(IC)13 (ODN1a)
ICICICICICICICICICICICICIC
서열번호: 7
CpG 1826
TCCATGACGTTCCTGACGTT
서열번호: 8
CpG 7909
TCGTCGTTTTGTCGTTTTGTCGTT
서열번호: 9
>hCoV-19/Italy/INMI1-isl/2020|EPI_ISL_410545|2020-01-29 (수탁번호: MT066156)
ATTAAAGGTTTATACCTTCCCAGGTAACAAACCAACCAACTTTCGATCTCTTGTAGATCTGTTCTCTAAACGAACTTTAAAATCTGTGTGGCTGTCACTCGGCTGCATGCTTAGTGCACTCACGCAGTATAATTAATAACTAATTACTGTCGTTGACAGGACACGAGTAACTCGTCTATCTTCTGCAGGCTGCTTACGGTTTCGTCCGTGTTGCAGCCGATCATCAGCACATCTAGGTTTCGTCCGGGTGTGACCGAAAGGTAAGATGGAGAGCCTTGTCCCTGGTTTCAACGAGAAAACACACGTCCAACTCAGTTTGCCTGTTTTACAGGTTCGCGACGTGCTCGTACGTGGCTTTGGAGACTCCGTGGAGGAGGTCTTATCAGAGGCACGTCAACATCTTAAAGATGGCACTTGTGGCTTAGTAGAAGTTGAAAAAGGCGTTTTGCCTCAACTTGAACAGCCCTATGTGTTCATCAAACGTTCGGATGCTCGAACTGCACCTCATGGTCATGTTATGGTTGAGCTGGTAGCAGAACTCGAAGGCATTCAGTACGGTCGTAGTGGTGAGACACTTGGTGTCCTTGTCCCTCATGTGGGCGAAATACCAGTGGCTTACCGCAAGGTTCTTCTTCGTAAGAACGGTAATAAAGGAGCTGGTGGCCATAGTTACGGCGCCGATCTAAAGTCATTTGACTTAGGCGACGAGCTTGGCACTGATCCTTATGAAGATTTTCAAGAAAACTGGAACACTAAACATAGCAGTGGTGTTACCCGTGAACTCATGCGTGAGCTTAACGGAGGGGCATACACTCGCTATGTCGATAACAACTTCTGTGGCCCTGATGGCTACCCTCTTGAGTGCATTAAAGACCTTCTAGCACGTGCTGGTAAAGCTTCATGCACTTTGTCCGAACAACTGGACTTTATTGACACTAAGAGGGGTGTATACTGCTGCCGTGAACATGAGCATGAAATTGCTTGGTACACGGAACGTTCTGAAAAGAGCTATGAATTGCAGACACCTTTTGAAATTAAATTGGCAAAGAAATTTGACACCTTCAATGGGGAATGTCCAAATTTTGTATTTCCCTTAAATTCCATAATCAAGACTATTCAACCAAGGGTTGAAAAGAAAAAGCTTGATGGCTTTATGGGTAGAATTCGATCTGTCTATCCAGTTGCGTCACCAAATGAATGCAACCAAATGTGCCTTTCAACTCTCATGAAGTGTGATCATTGTGGTGAAACTTCATGGCAGACGGGCGATTTTGTTAAAGCCACTTGCGAATTTTGTGGCACTGAGAATTTGACTAAAGAAGGTGCCACTACTTGTGGTTACTTACCCCAAAATGCTGTTGTTAAAATTTATTGTCCAGCATGTCACAATTCAGAAGTAGGACCTGAGCATAGTCTTGCCGAATACCATAATGAATCTGGCTTGAAAACCATTCTTCGTAAGGGTGGTCGCACTATTGCCTTTGGAGGCTGTGTGTTCTCTTATGTTGGTTGCCATAACAAGTGTGCCTATTGGGTTCCACGTGCTAGCGCTAACATAGGTTGTAACCATACAGGTGTTGTTGGAGAAGGTTCCGAAGGTCTTAATGACAACCTTCTTGAAATACTCCAAAAAGAGAAAGTCAACATCAATATTGTTGGTGACTTTAAACTTAATGAAGAGATCGCCATTATTTTGGCATCTTTTTCTGCTTCCACAAGTGCTTTTGTGGAAACTGTGAAAGGTTTGGATTATAAAGCATTCAAACAAATTGTTGAATCCTGTGGTAATTTTAAAGTTACAAAAGGAAAAGCTAAAAAAGGTGCCTGGAATATTGGTGAACAGAAATCAATACTGAGTCCTCTTTATGCATTTGCATCAGAGGCTGCTCGTGTTGTACGATCAATTTTCTCCCGCACTCTTGAAACTGCTCAAAATTCTGTGCGTGTTTTACAGAAGGCCGCTATAACAATACTAGATGGAATTTCACAGTATTCACTGAGACTCATTGATGCTATGATGTTCACATCTGATTTGGCTACTAACAATCTAGTTGTAATGGCCTACATTACAGGTGGTGTTGTTCAGTTGACTTCGCAGTGGCTAACTAACATCTTTGGCACTGTTTATGAAAAACTCAAACCCGTCCTTGATTGGCTTGAAGAGAAGTTTAAGGAAGGTGTAGAGTTTCTTAGAGACGGTTGGGAAATTGTTAAATTTATCTCAACCTGTGCTTGTGAAATTGTCGGTGGACAAATTGTCACCTGTGCTAAGGAAATTAAGGAGAGTGTTCAGACATTCTTTAAGCTTGTAAATAAATTTTTGGCTTTGTGTGCTGACTCTATCATTATTGGTGGAGCTAAACTTAAAGCCTTGAATTTAGGTGAAACATTTGTCACGCACTCAAAGGGATTGTACAGAAAGTGTGTTAAATCCAGAGAAGAAACTGGCCTACTCATGCCTCTAAAAGCCCCAAAAGAAATTATCTTCTTAGAGGGAGAAACACTTCCCACAGAAGTGTTAACAGAGGAAGTTGTCTTGAAAACTGGTGATTTACAACCATTAGAACAACCTACTAGTGAAGCTGTTGAAGCTCCATTGGTTGGTACACCAGTTTGTATTAACGGGCTTATGTTGCTCGAAATCAAAGACACAGAAAAGTACTGTGCCCTTGCACCTAATATGATGGTAACAAACAATACCTTCACACTCAAAGGCGGTGCACCAACAAAGGTTACTTTTGGTGATGACACTGTGATAGAAGTGCAAGGTTACAAGAGTGTGAATATCACTTTTGAACTTGATGAAAGGATTGATAAAGTACTTAATGAGAAGTGCTCTGCCTATACAGTTGAACTCGGTACAGAAGTAAATGAGTTCGCCTGTGTTGTGGCAGATGCTGTCATAAAAACTTTGCAACCAGTATCTGAATTACTTACACCACTGGGCATTGATTTAGATGAGTGGAGTATGGCTACATACTACTTATTTGATGAGTCTGGTGAGTTTAAATTGGCTTCACATATGTATTGTTCTTTCTACCCTCCAGATGAGGATGAAGAAGAAGGTGATTGTGAAGAAGAAGAGTTTGAGCCATCAACTCAATATGAGTATGGTACTGAAGATGATTACCAAGGTAAACCTTTGGAATTTGGTGCCACTTCTGCTGCTCTTCAACCTGAAGAAGAGCAAGAAGAAGATTGGTTAGATGATGATAGTCAACAAACTGTTGGTCAACAAGACGGCAGTGAGGACAATCAGACAACTACTATTCAAACAATTGTTGAGGTTCAACCTCAATTAGAGATGGAACTTACACCAGTTGTTCAGACTATTGAAGTGAATAGTTTTAGTGGTTATTTAAAACTTACTGACAATGTATACATTAAAAATGCAGACATTGTGGAAGAAGCTAAAAAGGTAAAACCAACAGTGGTTGTTAATGCAGCCAATGTTTACCTTAAACATGGAGGAGGTGTTGCAGGAGCCTTAAATAAGGCTACTAACAATGCCATGCAAGTTGAATCTGATGATTACATAGCTACTAATGGACCACTTAAAGTGGGTGGTAGTTGTGTTTTAAGCGGACACAATCTTGCTAAACACTGTCTTCATGTTGTCGGCCCAAATGTTAACAAAGGTGAAGACATTCAACTTCTTAAGAGTGCTTATGAAAATTTTAATCAGCACGAAGTTCTACTTGCACCATTATTATCAGCTGGTATTTTTGGTGCTGACCCTATACATTCTTTAAGAGTTTGTGTAGATACTGTTCGCACAAATGTCTACTTAGCTGTCTTTGATAAAAATCTCTATGACAAACTTGTTTCAAGCTTTTTGGAAATGAAGAGTGAAAAGCAAGTTGAACAAAAGATCGCTGAGATTCCTAAAGAGGAAGTTAAGCCATTTATAACTGAAAGTAAACCTTCAGTTGAACAGAGAAAACAAGATGATAAGAAAATCAAAGCTTGTGTTGAAGAAGTTACAACAACTCTGGAAGAAACTAAGTTCCTCACAGAAAACTTGTTACTTTATATTGACATTAATGGCAATCTTCATCCAGATTCTGCCACTCTTGTTAGTGACATTGACATCACTTTCTTAAAGAAAGATGCTCCATATATAGTGGGTGATGTTGTTCAAGAGGGTGTTTTAACTGCTGTGGTTATACCTACTAAAAAGGCTGGTGGCACTACTGAAATGCTAGCGAAAGCTTTGAGAAAAGTGCCAACAGACAATTATATAACCACTTACCCGGGTCAGGGTTTAAATGGTTACACTGTAGAGGAGGCAAAGACAGTGCTTAAAAAGTGTAAAAGTGCCTTTTACATTCTACCATCTATTATCTCTAATGAGAAGCAAGAAATTCTTGGAACTGTTTCTTGGAATTTGCGAGAAATGCTTGCACATGCAGAAGAAACACGCAAATTAATGCCTGTCTGTGTGGAAACTAAAGCCATAGTTTCAACTATACAGCGTAAATATAAGGGTATTAAAATACAAGAGGGTGTGGTTGATTATGGTGCTAGATTTTACTTTTACACCAGTAAAACAACTGTAGCGTCACTTATCAACACACTTAACGATCTAAATGAAACTCTTGTTACAATGCCACTTGGCTATGTAACACATGGCTTAAATTTGGAAGAAGCTGCTCGGTATATGAGATCTCTCAAAGTGCCAGCTACAGTTTCTGTTTCTTCACCTGATGCTGTTACAGCGTATAATGGTTATCTTACTTCTTCTTCTAAAACACCTGAAGAACATTTTATTGAAACCATCTCACTTGCTGGTTCCTATAAAGATTGGTCCTATTCTGGACAATCTACACAACTAGGTATAGAATTTCTTAAGAGAGGTGATAAAAGTGTATATTACACTAGTAATCCTACCACATTCCACCTAGATGGTGAAGTTATCACCTTTGACAATCTTAAGACACTTCTTTCTTTGAGAGAAGTGAGGACTATTAAGGTGTTTACAACAGTAGACAACATTAACCTCCACACGCAAGTTGTGGACATGTCAATGACATATGGACAACAGTTTGGTCCAACTTATTTGGATGGAGCTGATGTTACTAAAATAAAACCTCATAATTCACATGAAGGTAAAACATTTTATGTTTTACCTAATGATGACACTCTACGTGTTGAGGCTTTTGAGTACTACCACACAACTGATCCTAGTTTTCTGGGTAGGTACATGTCAGCATTAAATCACACTAAAAAGTGGAAATACCCACAAGTTAATGGTTTAACTTCTATTAAATGGGCAGATAACAACTGTTATCTTGCCACTGCATTGTTAACACTCCAACAAATAGAGTTGAAGTTTAATCCACCTGCTCTACAAGATGCTTATTACAGAGCAAGGGCTGGTGAAGCTGCTAACTTTTGTGCACTTATCTTAGCCTACTGTAATAAGACAGTAGGTGAGTTAGGTGATGTTAGAGAAACAATGAGTTACTTGTTTCAACATGCCAATTTAGATTCTTGCAAAAGAGTCTTGAACGTGGTGTGTAAAACTTGTGGACAACAGCAGACAACCCTTAAGGGTGTAGAAGCTGTTATGTACATGGGCACACTTTCTTATGAACAATTTAAGAAAGGTGTTCAGATACCTTGTACGTGTGGTAAACAAGCTACAAAATATCTAGTACAACAGGAGTCACCTTTTGTTATGATGTCAGCACCACCTGCTCAGTATGAACTTAAGCATGGTACATTTACTTGTGCTAGTGAGTACACTGGTAATTACCAGTGTGGTCACTATAAACATATAACTTCTAAAGAAACTTTGTATTGCATAGACGGTGCTTTACTTACAAAGTCCTCAGAATACAAAGGTCCTATTACGGATGTTTTCTACAAAGAAAACAGTTACACAACAACCATAAAACCAGTTACTTATAAATTGGATGGTGTTGTTTGTACAGAAATTGACCCTAAGTTGGACAATTATTATAAGAAAGACAATTCTTATTTCACAGAGCAACCAATTGATCTTGTACCAAACCAACCATATCCAAACGCAAGCTTCGATAATTTTAAGTTTGTATGTGATAATATCAAATTTGCTGATGATTTAAACCAGTTAACTGGTTATAAGAAACCTGCTTCAAGAGAGCTTAAAGTTACATTTTTCCCTGACTTAAATGGTGATGTGGTGGCTATTGATTATAAACACTACACACCCTCTTTTAAGAAAGGAGCTAAATTGTTACATAAACCTATTGTTTGGCATGTTAACAATGCAACTAATAAAGCCACGTATAAACCAAATACCTGGTGTATACGTTGTCTTTGGAGCACAAAACCAGTTGAAACATCAAATTCGTTTGATGTACTGAAGTCAGAGGACGCGCAGGGAATGGATAATCTTGCCTGCGAAGATCTAAAACCAGTCTCTGAAGAAGTAGTGGAAAATCCTACCATACAGAAAGACGTTCTTGAGTGTAATGTGAAAACTACCGAAGTTGTAGGAGACATTATACTTAAACCAGCAAATAATAGTTTAAAAATTACAGAAGAGGTTGGCCACACAGATCTAATGGCTGCTTATGTAGACAATTCTAGTCTTACTATTAAGAAACCTAATGAATTATCTAGAGTATTAGGTTTGAAAACCCTTGCTACTCATGGTTTAGCTGCTGTTAATAGTGTCCCTTGGGATACTATAGCTAATTATGCTAAGCCTTTTCTTAACAAAGTTGTTAGTACAACTACTAACATAGTTACACGGTGTTTAAACCGTGTTTGTACTAATTATATGCCTTATTTCTTTACTTTATTGCTACAATTGTGTACTTTTACTAGAAGTACAAATTCTAGAATTAAAGCATCTATGCCGACTACTATAGCAAAGAATACTGTTAAGAGTGTCGGTAAATTTTGTCTAGAGGCTTCATTTAATTATTTGAAGTCACCTAATTTTTCTAAACTGATAAATATTATAATTTGGTTTTTACTATTAAGTGTTTGCCTAGGTTCTTTAATCTACTCAACCGCTGCTTTAGGTGTTTTAATGTCTAATTTAGGCATGCCTTCTTACTGTACTGGTTACAGAGAAGGCTATTTGAACTCTACTAATGTCACTATTGCAACCTACTGTACTGGTTCTATACCTTGTAGTGTTTGTCTTAGTGGTTTAGATTCTTTAGACACCTATCCTTCTTTAGAAACTATACAAATTACCATTTCATCTTTTAAATGGGATTTAACTGCTTTTGGCTTAGTTGCAGAGTGGTTTTTGGCATATATTCTTTTCACTAGGTTTTTCTATGTACTTGGATTGGCTGCAATCATGCAATTGTTTTTCAGCTATTTTGCAGTACATTTTATTAGTAATTCTTGGCTTATGTGGTTAATAATTAATCTTGTACAAATGGCCCCGATTTCAGCTATGGTTAGAATGTACATCTTCTTTGCATCATTTTATTATGTATGGAAAAGTTATGTGCATGTTGTAGACGGTTGTAATTCATCAACTTGTATGATGTGTTACAAACGTAATAGAGCAACAAGAGTCGAATGTACAACTATTGTTAATGGTGTTAGAAGGTCCTTTTATGTCTATGCTAATGGAGGTAAAGGCTTTTGCAAACTACACAATTGGAATTGTGTTAATTGTGATACATTCTGTGCTGGTAGTACATTTATTAGTGATGAAGTTGCGAGAGACTTGTCACTACAGTTTAAAAGACCAATAAATCCTACTGACCAGTCTTCTTACATCGTTGATAGTGTTACAGTGAAGAATGGTTCCATCCATCTTTACTTTGATAAAGCTGGTCAAAAGACTTATGAAAGACATTCTCTCTCTCATTTTGTTAACTTAGACAACCTGAGAGCTAATAACACTAAAGGTTCATTGCCTATTAATGTTATAGTTTTTGATGGTAAATCAAAATGTGAAGAATCATCTGCAAAATCAGCGTCTGTTTACTACAGTCAGCTTATGTGTCAACCTATACTGTTACTAGATCAGGCATTAGTGTCTGATGTTGGTGATAGTGCGGAAGTTGCAGTTAAAATGTTTGATGCTTACGTTAATACGTTTTCATCAACTTTTAACGTACCAATGGAAAAACTCAAAACACTAGTTGCAACTGCAGAAGCTGAACTTGCAAAGAATGTGTCCTTAGACAATGTCTTATCTACTTTTATTTCAGCAGCTCGGCAAGGGTTTGTTGATTCAGATGTAGAAACTAAAGATGTTGTTGAATGTCTTAAATTGTCACATCAATCTGACATAGAAGTTACTGGCGATAGTTGTAATAACTATATGCTCACCTATAACAAAGTTGAAAACATGACACCCCGTGACCTTGGTGCTTGTATTGACTGTAGTGCGCGTCATATTAATGCGCAGGTAGCAAAAAGTCACAACATTGCTTTGATATGGAACGTTAAAGATTTCATGTCATTGTCTGAACAACTACGAAAACAAATACGTAGTGCTGCTAAAAAGAATAACTTACCTTTTAAGTTGACATGTGCAACTACTAGACAAGTTGTTAATGTTGTAACAACAAAGATAGCACTTAAGGGTGGTAAAATTGTTAATAATTGGTTGAAGCAGTTAATTAAAGTTACACTTGTGTTCCTTTTTGTTGCTGCTATTTTCTATTTAATAACACCTGTTCATGTCATGTCTAAACATACTGACTTTTCAAGTGAAATCATAGGATACAAGGCTATTGATGGTGGTGTCACTCGTGACATAGCATCTACAGATACTTGTTTTGCTAACAAACATGCTGATTTTGACACATGGTTTAGCCAGCGTGGTGGTAGTTATACTAATGACAAAGCTTGCCCATTGATTGCTGCAGTCATAACAAGAGAAGTGGGTTTTGTCGTGCCTGGTTTGCCTGGCACGATATTACGCACAACTAATGGTGACTTTTTGCATTTCTTACCTAGAGTTTTTAGTGCAGTTGGTAACATCTGTTACACACCATCAAAACTTATAGAGTACACTGACTTTGCAACATCAGCTTGTGTTTTGGCTGCTGAATGTACAATTTTTAAAGATGCTTCTGGTAAGCCAGTACCATATTGTTATGATACCAATGTACTAGAAGGTTCTGTTGCTTATGAAAGTTTACGCCCTGACACACGTTATGTGCTCATGGATGGCTCTATTATTCAATTTCCTAACACCTACCTTGAAGGTTCTGTTAGAGTGGTAACAACTTTTGATTCTGAGTACTGTAGGCACGGCACTTGTGAAAGATCAGAAGCTGGTGTTTGTGTATCTACTAGTGGTAGATGGGTACTTAACAATGATTATTACAGATCTTTACCAGGAGTTTTCTGTGGTGTAGATGCTGTAAATTTACTTACTAATATGTTTACACCACTAATTCAACCTATTGGTGCTTTGGACATATCAGCATCTATAGTAGCTGGTGGTATTGTAGCTATCGTAGTAACATGCCTTGCCTACTATTTTATGAGGTTTAGAAGAGCTTTTGGTGAATACAGTCATGTAGTTGCCTTTAATACTTTACTATTCCTTATGTCATTCACTGTACTCTGTTTAACACCAGTTTACTCATTCTTACCTGGTGTTTATTCTGTTATTTACTTGTACTTGACATTTTATCTTACTAATGATGTTTCTTTTTTAGCACATATTCAGTGGATGGTTATGTTCACACCTTTAGTACCTTTCTGGATAACAATTGCTTATATCATTTGTATTTCCACAAAGCATTTCTATTGGTTCTTTAGTAATTACCTAAAGAGACGTGTAGTCTTTAATGGTGTTTCCTTTAGTACTTTTGAAGAAGCTGCGCTGTGCACCTTTTTGTTAAATAAAGAAATGTATCTAAAGTTGCGTAGTGATGTGCTATTACCTCTTACGCAATATAATAGATACTTAGCTCTTTATAATAAGTACAAGTATTTTAGTGGAGCAATGGATACAACTAGCTACAGAGAAGCTGCTTGTTGTCATCTCGCAAAGGCTCTCAATGACTTCAGTAACTCAGGTTCTGATGTTCTTTACCAACCACCACAAACCTCTATCACCTCAGCTGTTTTGCAGAGTGGTTTTAGAAAAATGGCATTCCCATCTGGTAAAGTTGAGGGTTGTATGGTACAAGTAACTTGTGGTACAACTACACTTAACGGTCTTTGGCTTGATGACGTAGTTTACTGTCCAAGACATGTGATCTGCACCTCTGAAGACATGCTTAACCCTAATTATGAAGATTTACTCATTCGTAAGTCTAATCATAATTTCTTGGTACAGGCTGGTAATGTTCAACTCAGGGTTATTGGACATTCTATGCAAAATTGTGTACTTAAGCTTAAGGTTGATACAGCCAATCCTAAGACACCTAAGTATAAGTTTGTTCGCATTCAACCAGGACAGACTTTTTCAGTGTTAGCTTGTTACAATGGTTCACCATCTGGTGTTTACCAATGTGCTATGAGGCCCAATTTCACTATTAAGGGTTCATTCCTTAATGGTTCATGTGGTAGTGTTGGTTTTAACATAGATTATGACTGTGTCTCTTTTTGTTACATGCACCATATGGAATTACCAACTGGAGTTCATGCTGGCACAGACTTAGAAGGTAACTTTTATGGACCTTTTGTTGACAGGCAAACAGCACAAGCAGCTGGTACGGACACAACTATTACAGTTAATGTTTTAGCTTGGTTGTACGCTGCTGTTATAAATGGAGACAGGTGGTTTCTCAATCGATTTACCACAACTCTTAATGACTTTAACCTTGTGGCTATGAAGTACAATTATGAACCTCTAACACAAGACCATGTTGACATACTAGGACCTCTTTCTGCTCAAACTGGAATTGCCGTTTTAGATATGTGTGCTTCATTAAAAGAATTACTGCAAAATGGTATGAATGGACGTACCATATTGGGTAGTGCTTTATTAGAAGATGAATTTACACCTTTTGATGTTGTTAGACAATGCTCAGGTGTTACTTTCCAAAGTGCAGTGAAAAGAACAATCAAGGGTACACACCACTGGTTGTTACTCACAATTTTGACTTCACTTTTAGTTTTAGTCCAGAGTACTCAATGGTCTTTGTTCTTTTTTTTNTATGAAAATGCCTTTTTACCTTTTGCTATGGGTATTATTGCTATGTCTGCTTTTGCAATGATGTTTGTCAAACATAAGCATGCATTTCTCTGTTTGTTTTTGTTACCTTCTCTTGCCACTGTAGCTTATTTTAATATGGTCTATATGCCTGCTAGTTGGGTGATGCGTATTATGACATGGTTGGATATGGTTGATACTAGTTTGTCTGGTTTTAAGCTAAAAGACTGTGTTATGTATGCATCAGCTGTAGTGTTACTAATCCTTATGACAGCAAGAACTGTGTATGATGATGGTGCTAGGAGAGTGTGGACACTTATGAATGTCTTGACACTCGTTTATAAAGTTTATTATGGTAATGCTTTAGATCAAGCCATTTCCATGTGGGCTCTTATAATCTCTGTTACTTCTAACTACTCAGGTGTAGTTACAACTGTCATGTTTTTGGCCAGAGGTATTGTTTTTATGTGTGTTGAGTATTGCCCTATTTTCTTCATAACTGGTAATACACTTCAGTGTATAATGCTAGTTTATTGTTTCTTAGGCTATTTTTGTACTTGTTACTTTGGCCTCTTTTGTTTACTCAACCGCTACTTTAGACTGACTCTTGGTGTTTATGATTACTTAGTTTCTACACAGGAGTTTAGATATATGAATTCACAGGGACTACTCCCACCCAAGAATAGCATAGATGCCTTCAAACTCAACATTAAATTGTTGGGTGTTGGTGGCAAACCTTGTATCAAAGTAGCCACTGTACAGTCTAAAATGTCAGATGTAAAGTGCACATCAGTAGTCTTACTCTCAGTTTTGCAACAACTCAGAGTAGAATCATCATCTAAATTGTGGGCTCAATGTGTCCAGTTACACAATGACATTCTCTTAGCTAAAGATACTACTGAAGCCTTTGAAAAAATGGTTTCACTACTTTCTGTTTTGCTTTCCATGCAGGGTGCTGTAGACATAAACAAGCTTTGTGAAGAAATGCTGGACAACAGGGCAACCTTACAAGCTATAGCCTCAGAGTTTAGTTCCCTTCCATCATATGCAGCTTTTGCTACTGCTCAAGAAGCTTATGAGCAGGCTGTTGCTAATGGTGATTCTGAAGTTGTTCTTAAAAAGTTGAAGAAGTCTTTGAATGTGGCTAAATCTGAATTTGACCGTGATGCAGCCATGCAACGTAAGTTGGAAAAGATGGCTGATCAAGCTATGACCCAAATGTATAAACAGGCTAGATCTGAGGACAAGAGGGCAAAAGTTACTAGTGCTATGCAGACAATGCTTTTCACTATGCTTAGAAAGTTGGATAATGATGCACTCAACAACATTATCAACAATGCAAGAGATGGTTGTGTTCCCTTGAACATAATACCTCTTACAACAGCAGCCAAACTAATGGTTGTCATACCAGACTATAACACATATAAAAATACGTGTGATGGTACAACATTTACTTATGCATCAGCATTGTGGGAAATCCAACAGGTTGTAGATGCAGATAGTAAAATTGTTCAACTTAGTGAAATTAGTATGGACAATTCACCTAATTTAGCATGGCCTCTTATTGTAACAGCTTTAAGGGCCAATTCTGCTGTCAAATTACAGAATAATGAGCTTAGTCCTGTTGCACTACGACAGATGTCTTGTGCTGCCGGTACTACACAAACTGCTTGCACTGATGACAATGCGTTAGCTTACTACAACACAACAAAGGGAGGTAGGTTTGTACTTGCACTGTTATCCGATTTACAGGATTTGAAATGGGCTAGATTCCCTAAGAGTGATGGAACTGGTACTATCTATACAGAACTGGAACCACCTTGTAGGTTTGTTACAGACACACCTAAAGGTCCTAAAGTGAAGTATTTATACTTTATTAAAGGATTAAACAACCTAAATAGAGGTATGGTACTTGGTAGTTTAGCTGCCACAGTACGTCTACAAGCTGGTAATGCAACAGAAGTGCCTGCCAATTCAACTGTATTATCTTTCTGTGCTTTTGCTGTAGATGCTGCTAAAGCTTACAAAGATTATCTAGCTAGTGGGGGACAACCAATCACTAATTGTGTTAAGATGTTGTGTACACACACTGGTACTGGTCAGGCAATAACAGTTACACCGGAAGCCAATATGGATCAAGAATCCTTTGGTGGTGCATCGTGTTGTCTGTACTGCCGTTGCCACATAGATCATCCAAATCCTAAAGGATTTTGTGACTTAAAAGGTAAGTATGTACAAATACCTACAACTTGTGCTAATGACCCTGTGGGTTTTACACTTAAAAACACAGTCTGTACCGTCTGCGGTATGTGGAAAGGTTATGGCTGTAGTTGTGATCAACTCCGCGAACCCATGCTTCAGTCAGCTGATGCACAATCGTTTTTAAACGGGTTTGCGGTGTAAGTGCAGCCCGTCTTACACCGTGCGGCACAGGCACTAGTACTGATGTCGTATACAGGGCTTTTGACATCTACAATGATAAAGTAGCTGGTTTTGCTAAATTCCTAAAAACTAATTGTTGTCGCTTCCAAGAAAAGGACGAAGATGACAATTTAATTGATTCTTACTTTGTAGTTAAGAGACACACTTTCTCTAACTACCAACATGAAGAAACAATTTATAATTTACTTAAGGATTGTCCAGCTGTTGCTAAACATGACTTCTTTAAGTTTAGAATAGACGGTGACATGGTACCACATATATCACGTCAACGTCTTACTAAATACACAATGGCAGACCTCGTCTATGCTTTAAGGCATTTTGATGAAGGTAATTGTGACACATTAAAAGAAATACTTGTCACATACAATTGTTGTGATGATGATTATTTCAATAAAAAGGACTGGTATGATTTTGTAGAAAACCCAGATATATTACGCGTATACGCCAACTTAGGTGAACGTGTACGCCAAGCTTTGTTAAAAACAGTACAATTCTGTGATGCCATGCGAAATGCTGGTATTGTTGGTGTACTGACATTAGATAATCAAGATCTCAATGGTAACTGGTATGATTTCGGTGATTTCATACAAACCACGCCAGGTAGTGGAGTTCCTGTTGTAGATTCTTATTATTCATTGTTAATGCCTATATTAACCTTGACCAGGGCTTTAACTGCAGAGTCACATGTTGACACTGACTTAACAAAGCCTTACATTAAGTGGGATTTGTTAAAATATGACTTCACGGAAGAGAGGTTAAAACTCTTTGACCGTTATTTTAAATATTGGGATCAGACATACCACCCAAATTGTGTTAACTGTTTGGATGACAGATGCATTCTGCATTGTGCAAACTTTAATGTTTTATTCTCTACAGTGTTCCCACCTACAAGTTTTGGACCACTAGTGAGAAAAATATTTGTTGATGGTGTTCCATTTGTAGTTTCAACTGGATACCACTTCAGAGAGCTAGGTGTTGTACATAATCAGGATGTAAACTTACATAGCTCTAGACTTAGTTTTAAGGAATTACTTGTGTATGCTGCTGACCCTGCTATGCACGCTGCTTCTGGTAATCTATTACTAGATAAACGCACTACGTGCTTTTCAGTAGCTGCACTTACTAACAATGTTGCTTTTCAAACTGTCAAACCCGGTAATTTTAACAAAGACTTCTATGACTTTGCTGTGTCTAAGGGTTTCTTTAAGGAAGGAAGTTCTGTTGAATTAAAACACTTCTTCTTTGCTCAGGATGGTAATGCTGCTATCAGCGATTATGACTACTATCGTTATAATCTACCAACAATGTGTGATATCAGACAACTACTATTTGTAGTTGAAGTTGTTGATAAGTACTTTGATTGTTACGATGGTGGCTGTATTAATGCTAACCAAGTCATCGTCAACAACCTAGACAAATCAGCTGGTTTTCCATTTAATAAATGGGGTAAGGCTAGACTTTATTATGATTCAATGAGTTATGAGGATCAAGATGCACTTTTCGCATATACAAAACGTAATGTCATCCCTACTATAACTCAAATGAATCTTAAGTATGCCATTAGTGCAAAGAATAGAGCTCGCACCGTAGCTGGTGTCTCTATCTGTAGTACTATGACCAATAGACAGTTTCATCAAAAATTATTGAAATCAATAGCCGCCACTAGAGGAGCTACTGTAGTAATTGGAACAAGCAAATTCTATGGTGGTTGGCACAACATGTTAAAAACTGTTTATAGTGATGTAGAAAACCCTCACCTTATGGGTTGGGATTATCCTAAATGTGATAGAGCCATGCCTAACATGCTTAGAATTATGGCCTCACTTGTTCTTGCTCGCAAACATACAACGTGTTGTAGCTTGTCACACCGTTTCTATAGATTAGCTAATGAGTGTGCTCAAGTATTGAGTGAAATGGTCATGTGTGGCGGTTCACTATATGTTAAACCAGGTGGAACCTCATCAGGAGATGCCACAACTGCTTATGCTAATAGTGTTTTTAACATTTGTCAAGCTGTCACGGCCAATGTTAATGCACTTTTATCTACTGATGGTAACAAAATTGCCGATAAGTATGTCCGCAATTTACAACACAGACTTTATGAGTGTCTCTATAGAAATAGAGATGTTGACACAGACTTTGTGAATGAGTTTTACGCATATTTGCGTAAACATTTCTCAATGATGATACTCTCTGACGATGCTGTTGTGTGTTTCAATAGCACTTATGCATCTCAAGGTCTAGTGGCTAGCATAAAGAACTTTAAGTCAGTTCTTTATTATCAAAACAATGTTTTTATGTCTGAAGCAAAATGTTGGACTGAGACTGACCTTACTAAAGGACCTCATGAATTTTGCTCTCAACATACAATGCTAGTTAAACAGGGTGATGATTATGTGTACCTTCCTTACCCAGATCCATCAAGAATCCTAGGGGCCGGCTGTTTTGTAGATGATATCGTAAAAACAGATGGTACACTTATGATTGAACGGTTCGTGTCTTTAGCTATAGATGCTTACCCACTTACTAAACATCCTAATCAGGAGTATGCTGATGTCTTTCATTTGTACTTACAATACATAAGAAAGCTACATGATGAGTTAACAGGACACATGTTAGACATGTATTCTGTTATGCTTACTAATGATAACACTTCAAGGTATTGGGAACCTGAGTTTTATGAGGCTATGTACACACCGCATACAGTCTTACAGGCTGTTGGGGCTTGTGTTCTTTGCAATTCACAGACTTCATTAAGATGTGGTGCTTGCATACGTAGACCATTCTTATGTTGTAAATGCTGTTACGACCATGTCATATCAACATCACATAAATTAGTCTTGTCTGTTAATCCGTATGTTTGCAATGCTCCAGGTTGTGATGTCACAGATGTGACTCAACTTTACTTAGGAGGTATGAGCTATTATTGTAAATCACATAAACCACCCATTAGTTTTCCATTGTGTGCTAATGGACAAGTTTTTGGTTTATATAAAAATACATGTGTTGGTAGCGATAATGTTACTGACTTTAATGCAATTGCAACATGTGACTGGACAAATGCTGGTGATTACATTTTAGCTAACACCTGTACTGAAAGACTCAAGCTTTTTGCAGCAGAAACGCTCAAAGCTACTGAGGAGACATTTAAACTGTCTTATGGTATTGCTACTGTACGTGAAGTGCTGTCTGACAGAGAATTACATCTTTCATGGGAAGTTGGTAAACCTAGACCACCACTTAACCGAAATTATGTCTTTACTGGTTATCGTGTAACTAAAAACAGTAAAGTACAAATAGGAGAGTACACCTTTGAAAAAGGTGACTATGGTGATGCTGTTGTTTACCGAGGTACAACAACTTACAAATTAAATGTTGGTGATTATTTTGTGCTGACATCACATACAGTAATGCCATTAAGTGCACCTACACTAGTGCCACAAGAGCACTATGTTAGAATTACTGGCTTATACCCAACACTCAATATCTCAGATGAGTTTTCTAGCAATGTTGCAAATTATCAAAAGGTTGGTATGCAAAAGTATTCTACACTCCAGGGACCACCTGGTACTGGTAAGAGTCATTTTGCTATTGGCCTAGCTCTCTACTACCCTTCTGCTCGCATAGTGTATACAGCTTGCTCTCATGCCGCTGTTGATGCACTATGTGAGAAGGCATTAAAATATTTGCCTATAGATAAATGTAGTAGAATTATACCTGCACGTGCTCGTGTAGAGTGTTTTGATAAATTCAAAGTGAATTCAACATTAGAACAGTATGTCTTTTGTACTGTAAATGCATTGCCTGAGACGACAGCAGATATAGTTGTCTTTGATGAAATTTCAATGGCCACAAATTATGATTTGAGTGTTGTCAATGCCAGATTACGTGCTAAGCACTATGTGTACATTGGCGACCCTGCTCAATTACCTGCACCACGCACATTGCTAACTAAGGGCACACTAGAACCAGAATATTTCAATTCAGTGTGTAGACTTATGAAAACTATAGGTCCAGACATGTTCCTCGGAACTTGTCGGCGTTGTCCTGCTGAAATTGTTGACACTGTGAGTGCTTTGGTTTATGATAATAAGCTTAAAGCACATAAAGACAAATCAGCTCAATGCTTTAAAATGTTTTATAAGGGTGTTATCACGCATGATGTTTCATCTGCAATTAACAGGCCACAAATAGGCGTGGTAAGAGAATTCCTTACACGTAACCCTGCTTGGAGAAAAGCTGTCTTTATTTCACCTTATAATTCACAGAATGCTGTAGCCTCAAAGATTTTGGGACTACCAACTCAAACTGTTGATTCATCACAGGGCTCAGAATATGACTATGTCATATTCACTCAAACCACTGAAACAGCTCACTCTTGTAATGTAAACAGATTTAATGTTGCTATTACCAGAGCAAAAGTAGGCATACTTTGCATAATGTCTGATAGAGACCTTTATGACAAGTTGCAATTTACAAGTCTTGAAATTCCACGTAGGAATGTGGCAACTTTACAAGCTGAAAATGTAACAGGACTCTTTAAAGATTGTAGTAAGGTAATCACTGGGTTACATCCTACACAGGCACCTACACACCTCAGTGTTGACACTAAATTCAAAACTGAAGGTTTATGTGTTGACATACCTGGCATACCTAAGGACATGACCTATAGAAGACTCATCTCTATGATGGGTTTTAAAATGAATTATCAAGTTAATGGTTACCCTAACATGTTTATCACCCGCGAAGAAGCTATAAGACATGTACGTGCATGGATTGGCTTCGATGTCGAGGGGTGTCATGCTACTAGAGAAGCTGTTGGTACCAATTTACCTTTACAGCTAGGTTTTTCTACAGGTGTTAACCTAGTTGCTGTACCTACAGGTTATGTTGATACACCTAATAATACAGATTTTTCCAGAGTTAGTGCTAAACCACCGCCTGGAGATCAATTTAAACACCTCATACCACTTATGTACAAAGGACTTCCTTGGAATGTAGTGCGTATAAAGATTGTACAAATGTTAAGTGACACACTTAAAAATCTCTCTGACAGAGTCGTATTTGTCTTATGGGCACATGGCTTTGAGTTGACATCTATGAAGTATTTTGTGAAAATAGGACCTGAGCGCACCTGTTGTCTATGTGATAGACGTGCCACATGCTTTTCCACTGCTTCAGACACTTATGCCTGTTGGCATCATTCTATTGGATTTGATTACGTCTATAATCCGTTTATGATTGATGTTCAACAATGGGGTTTTACAGGTAACCTACAAAGCAACCATGATCTGTATTGTCAAGTCCATGGTAATGCACATGTAGCTAGTTGTGATGCAATCATGACTAGGTGTCTAGCTGTCCACGAGTGCTTTGTTAAGCGTGTTGACTGGACTATTGAATATCCTATAATTGGTGATGAACTGAAGATTAATGCGGCTTGTAGAAAGGTTCAACACATGGTTGTTAAAGCTGCATTATTAGCAGACAAATTCCCAGTTCTTCACGACATTGGTAACCCTAAAGCTATTAAGTGTGTACCTCAAGCTGATGTAGAATGGAAGTTCTATGATGCACAGCCTTGTAGTGACAAAGCTTATAAAATAGAAGAATTATTCTATTCTTATGCCACACATTCTGACAAATTCACAGATGGTGTATGCCTATTTTGGAATTGCAATGTCGATAGATATCCTGCTAATTCCATTGTTTGTAGATTTGACACTAGAGTGCTATCTAACCTTAACTTGCCTGGTTGTGATGGTGGCAGTTTGTATGTAAATAAACATGCATTCCACACACCAGCTTTTGATAAAAGTGCTTTTGTTAATTTAAAACAATTACCATTTTTCTATTACTCTGACAGTCCATGTGAGTCTCATGGAAAACAAGTAGTGTCAGATATAGATTATGTACCACTAAAGTCTGCTACGTGTATAACACGTTGCAATTTAGGTGGTGCTGTCTGTAGACATCATGCTAATGAGTACAGATTGTATCTCGATGCTTATAACATGATGATCTCAGCTGGCTTTAGCTTGTGGGTTTACAAACAATTTGATACTTATAACCTCTGGAACACTTTTACAAGACTTCAGAGTTTAGAAAATGTGGCTTTTAATGTTGTAAATAAGGGACACTTTGATGGACAACAGGGTGAAGTACCAGTTTCTATCATTAATAACACTGTTTACACAAAAGTTGATGGTGTTGATGTAGAATTGTTTGAAAATAAAACAACATTACCTGTTAATGTAGCATTTGAGCTTTGGGCTAAGCGCAACATTAAACCAGTACCAGAGGTGAAAATACTCAATAATTTGGGTGTGGACATTGCTGCTAATACTGTGATCTGGGACTACAAAAGAGATGCTCCAGCACATATATCTACTATTGGTGTTTGTTCTATGACTGACATAGCCAAGAAACCAACTGAAACGATTTGTGCACCACTCACTGTCTTTTTTGATGGTAGAGTTGATGGTCAAGTAGACTTATTTAGAAATGCCCGTAATGGTGTTCTTATTACAGAAGGTAGTGTTAAAGGTTTACAACCATCTGTAGGTCCCAAACAAGCTAGTCTTAATGGAGTCACATTAATTGGAGAAGCCGTAAAAACACAGTTCAATTATTATAAGAAAGTTGATGGTGTTGTCCAACAATTACCTGAAACTTACTTTACTCAGAGTAGAAATTTACAAGAATTTAAACCCAGGAGTCAAATGGAAATTGATTTCTTAGAATTAGCTATGGATGAATTCATTGAACGGTATAAATTAGAAGGCTATGCCTTCGAACATATCGTTTATGGAGATTTTAGTCATAGTCAGTTAGGTGGTTTACATCTACTGATTGGACTAGCTAAACGTTTTAAGGAATCACCTTTTGAATTAGAAGATTTTATTCCTATGGACAGTACAGTTAAAAACTATTTCATAACAGATGCGCAAACAGGTTCATCTAAGTGTGTGTGTTCTGTTATTGATTTATTACTTGATGATTTTGTTGAAATAATAAAATCCCAAGATTTATCTGTAGTTTCTAAGGTTGTCAAAGTGACTATTGACTATACAGAAATTTCATTTATGCTTTGGTGTAAAGATGGCCATGTAGAAACATTTTACCCAAAATTACAATCTAGTCAAGCGTGGCAACCGGGTGTTGCTATGCCTAATCTTTACAAAATGCAAAGAATGCTATTAGAAAAGTGTGACCTTCAAAATTATGGTGATAGTGCAACATTACCTAAAGGCATAATGATGAATGTCGCAAAATATACTCAACTGTGTCAATATTTAAACACATTAACATTAGCTGTACCCTATAATATGAGAGTTATACATTTTGGTGCTGGTTCTGATAAAGGAGTTGCACCAGGTACAGCTGTTTTAAGACAGTGGTTGCCTACGGGTACGCTGCTTGTCGATTCAGATCTTAATGACTTTGTCTCTGATGCAGATTCAACTTTGATTGGTGATTGTGCAACTGTACATACAGCTAATAAATGGGATCTCATTATTAGTGATATGTACGACCCTAAGACTAAAAATGTTACAAAAGAAAATGACTCTAAAGAGGGTTTTTTCACTTACATTTGTGGGTTTATACAACAAAAGCTAGCTCTTGGAGGTTCCGTGGCTATAAAGATAACAGAACATTCTTGGAATGCTGATCTTTATAAGCTCATGGGACACTTCGCATGGTGGACAGCCTTTGTTACTAATGTGAATGCGTCATCATCTGAAGCATTTTTAATTGGATGTAATTATCTTGGCAAACCACGCGAACAAATAGATGGTTATGTCATGCATGCAAATTACATATTTTGGAGGAATACAAATCCAATTCAGTTGTCTTCCTATTCTTTATTTGACATGAGTAAATTTCCCCTTAAATTAAGGGGTACTGCTGTTATGTCTTTAAAAGAAGGTCAAATCAATGATATGATTTTATCTCTTCTTAGTAAAGGTAGACTTATAATTAGAGAAAACAACAGAGTTGTTATTTCTAGTGATGTTCTTGTTAACAACTAAACGAACAATGTTTGTTTTTCTTGTTTTATTGCCACTAGTCTCTAGTCAGTGTGTTAATCTTACAACCAGAACTCAATTACCCCCTGCATACACTAATTCTTTCACACGTGGTGTTTATTACCCTGACAAAGTTTTCAGATCCTCAGTTTTACATTCAACTCAGGACTTGTTCTTACCTTTCTTTTCCAATGTTACTTGGTTCCATGCTATACATGTCTCTGGGACCAATGGTACTAAGAGGTTTGATAACCCTGTCCTACCATTTAATGATGGTGTTTATTTTGCTTCCACTGAGAAGTCTAACATAATAAGAGGCTGGATTTTTGGTACTACTTTAGATTCGAAGACCCAGTCCCTACTTATTGTTAATAACGCTACTAATGTTGTTATTAAAGTCTGTGAATTTCAATTTTGTAATGATCCATTTTTGGGTGTTTATTACCACAAAAACAACAAAAGTTGGATGGAAAGTGAGTTCAGAGTTTATTCTAGTGCGAATAATTGCACTTTTGAATATGTCTCTCAGCCTTTTCTTATGGACCTTGAAGGAAAACAGGGTAATTTCAAAAATCTTAGGGAATTTGTGTTTAAGAATATTGATGGTTATTTTAAAATATATTCTAAGCACACGCCTATTAATTTAGTGCGTGATCTCCCTCAGGGTTTTTCGGCTTTAGAACCATTGGTAGATTTGCCAATAGGTATTAACATCACTAGGTTTCAAACTTTACTTGCTTTACATAGAAGTTATTTGACTCCTGGTGATTCTTCTTCAGGTTGGACAGCTGGTGCTGCAGCTTATTATGTGGGTTATCTTCAACCTAGGACTTTTCTATTAAAATATAATGAAAATGGAACCATTACAGATGCTGTAGACTGTGCACTTGACCCTCTCTCAGAAACAAAGTGTACGTTGAAATCCTTCACTGTAGAAAAAGGAATCTATCAAACTTCTAACTTTAGAGTCCAACCAACAGAATCTATTGTTAGATTTCCTAATATTACAAACTTGTGCCCTTTTGGTGAAGTTTTTAACGCCACCAGATTTGCATCTGTTTATGCTTGGAACAGGAAGAGAATCAGCAACTGTGTTGCTGATTATTCTGTCCTATATAATTCCGCATCATTTTCCACTTTTAAGTGTTATGGAGTGTCTCCTACTAAATTAAATGATCTCTGCTTTACTAATGTCTATGCAGATTCATTTGTAATTAGAGGTGATGAAGTCAGACAAATCGCTCCAGGGCAAACTGGAAAGATTGCTGATTATAATTATAAATTACCAGATGATTTTACAGGCTGCGTTATAGCTTGGAATTCTAACAATCTTGATTCTAAGGTTGGTGGTAATTATAATTACCTGTATAGATTGTTTAGGAAGTCTAATCTCAAACCTTTTGAGAGAGATATTTCAACTGAAATCTATCAGGCCGGTAGCACACCTTGTAATGGTGTTGAAGGTTTTAATTGTTACTTTCCTTTACAATCATATGGTTTCCAACCCACTAATGGTGTTGGTTACCAACCATACAGAGTAGTAGTACTTTCTTTTGAACTTCTACATGCACCAGCAACTGTTTGTGGACCTAAAAAGTCTACTAATTTGGTTAAAAACAAATGTGTCAATTTCAACTTCAATGGTTTAACAGGCACAGGTGTTCTTACTGAGTCTAACAAAAAGTTTCTGCCTTTCCAACAATTTGGCAGAGACATTGCTGACACTACTGATGCTGTCCGTGATCCACAGACACTTGAGATTCTTGACATTACACCATGTTCTTTTGGTGGTGTCAGTGTTATAACACCAGGAACAAATACTTCTAACCAGGTTGCTGTTCTTTATCAGGATGTTAACTGCACAGAAGTCCCTGTTGCTATTCATGCAGATCAACTTACTCCTACTTGGCGTGTTTATTCTACAGGTTCTAATGTTTTTCAAACACGTGCAGGCTGTTTAATAGGGGCTGAACATGTCAACAACTCATATGAGTGTGACATACCCATTGGTGCAGGTATATGCGCTAGTTATCAGACTCAGACTAATTCTCCTCGGCGGGCACGTAGTGTAGCTAGTCAATCCATCATTGCCTACACTATGTCACTTGGTGCAGAAAATTCAGTTGCTTACTCTAATAACTCTATTGCCATACCCACAAATTTTACTATTAGTGTTACCACAGAAATTCTACCAGTGTCTATGACCAAGACATCAGTAGATTGTACAATGTACATTTGTGGTGATTCAACTGAATGCAGCAATCTTTTGTTGCAATATGGCAGTTTTTGTACACAATTAAACCGTGCTTTAACTGGAATAGCTGTTGAACAAGACAAAAACACCCAAGAAGTTTTTGCACAAGTCAAACAAATTTACAAAACACCACCAATTAAAGATTTTGGTGGTTTTAATTTTTCACAAATATTACCAGATCCATCAAAACCAAGCAAGAGGTCATTTATTGAAGATCTACTTTTCAACAAAGTGACACTTGCAGATGCTGGCTTCATCAAACAATATGGTGATTGCCTTGGTGATATTGCTGCTAGAGACCTCATTTGTGCACAAAAGTTTAACGGCCTTACTGTTTTGCCACCTTTGCTCACAGATGAAATGATTGCTCAATACACTTCTGCACTGTTAGCGGGTACAATCACTTCTGGTTGGACCTTTGGTGCAGGTGCTGCATTACAAATACCATTTGCTATGCAAATGGCTTATAGGTTTAATGGTATTGGAGTTACACAGAATGTTCTCTATGAGAACCAAAAATTGATTGCCAACCAATTTAATAGTGCTATTGGCAAAATTCAAGACTCACTTTCTTCCACAGCAAGTGCACTTGGAAAACTTCAAGATGTGGTCAACCAAAATGCACAAGCTTTAAACACGCTTGTTAAACAACTTAGCTCCAATTTTGGTGCAATTTCAAGTGTTTTAAATGATATCCTTTCACGTCTTGACAAAGTTGAGGCTGAAGTGCAAATTGATAGGTTGATCACAGGCAGACTTCAAAGTTTGCAGACATATGTGACTCAACAATTAATTAGAGCTGCAGAAATCAGAGCTTCTGCTAATCTTGCTGCTACTAAAATGTCAGAGTGTGTACTTGGACAATCAAAAAGAGTTGATTTTTGTGGAAAGGGCTATCATCTTATGTCCTTCCCTCAGTCAGCACCTCATGGTGTAGTCTTCTTGCATGTGACTTATGTCCCTGCACAAGAAAAGAACTTCACAACTGCTCCTGCCATTTGTCATGATGGAAAAGCACACTTTCCTCGTGAAGGTGTCTTTGTTTCAAATGGCACACACTGGTTTGTAACACAAAGGAATTTTTATGAACCACAAATCATTACTACAGACAACACATTTGTGTCTGGTAACTGTGATGTTGTAATAGGAATTGTCAACAACACAGTTTATGATCCTTTGCAACCTGAATTAGACTCATTCAAGGAGGAGTTAGATAAATATTTTAAGAATCATACATCACCAGATGTTGATTTAGGTGACATCTCTGGCATTAATGCTTCAGTTGTAAACATTCAAAAAGAAATTGACCGCCTCAATGAGGTTGCCAAGAATTTAAATGAATCTCTCATCGATCTCCAAGAACTTGGAAAGTATGAGCAGTATATAAAATGGCCATGGTACATTTGGCTAGGTTTTATAGCTGGCTTGATTGCCATAGTAATGGTGACAATTATGCTTTGCTGTATGACCAGTTGCTGTAGTTGTCTCAAGGGCTGTTGTTCTTGTGGATCCTGCTGCAAATTTGATGAAGACGACTCTGAGCCAGTGCTCAAAGGAGTCAAATTACATTACACATAAACGAACTTATGGATTTGTTTATGAGAATCTTCACAATTGGAACTGTAACTTTGAAGCAAGGTGAAATCAAGGATGCTACTCCTTCAGATTTTGTTCGCGCTACTGCAACGATACCGATACAAGCCTCACTCCCTTTCGGATGGCTTATTGTTGGCGTTGCACTTCTTGCTGTTTTTCAGAGCGCTTCCAAAATCATAACCCTCAAAAAGAGATGGCAACTAGCACTCTCCAAGGGTGTTCACTTTGTTTGCAACTTGCTGTTGTTGTTTGTAACAGTTTACTCACACCTTTTGCTCGTTGCTGCTGGCCTTGAAGCCCCTTTTCTCTATCTTTATGCTTTAGTCTACTTCTTGCAGAGTATAAACTTTGTAAGAATAATAATGAGGCTTTGGCTTTGCTGGAAATGCCGTTCCAAAAACCCATTACTTTATGATGCCAACTATTTTCTTTGCTGGCATACTAATTGTTACGACTATTGTATACCTTACAATAGTGTAACTTCTTCAATTGTCATTACTTCAGGTGATGGCACAACAAGTCCTATTTCTGAACATGACTACCAGATTGGTGGTTATACTGAAAAATGGGAATCTGGAGTAAAAGACTGTGTTGTATTACACAGTTACTTCACTTCAGACTATTACCAGCTGTACTCAACTCAATTGAGTACAGACACTGGTGTTGAACATGTTACCTTCTTCATCTACAATAAAATTGTTGATGAGCCTGAAGAACATGTCCAAATTCACACAATCGACGTTTCATCCGGAGTTGTTAATCCAGTAATGGAACCAATTTATGATGAACCGACGACGACTACTAGCGTGCCTTTGTAAGCACAAGCTGATGAGTACGAACTTATGTACTCATTCGTTTCGGAAGAGACAGGTACGTTAATAGTTAATAGCGTACTTCTTTTTCTTGCTTTCGTGGTATTCTTGCTAGTTACACTAGCCATCCTTACTGCGCTTCGATTGTGTGCGTACTGCTGCAATATTGTTAACGTGAGTCTTGTAAAACCTTCTTTTTACGTTTACTCTCGTGTTAAAAATCTGAATTCTTCTAGAGTTCCTGATCTTCTGGTCTAAACGAACTAAATATTATATTAGTTTTTCTGTTTGGAACTTTAATTTTAGCCATGGCAGATTCCAACGGTACTATTACCGTTGAAGAGCTTAAAAAGCTCCTTGAACAATGGAACCTAGTAATAGGTTTCCTATTCCTTACATGGATTTGTCTTCTACAATTTGCCTATGCCAACAGGAATAGGTTTTTGTATATAATTAAGTTAATTTTCCTCTGGCTGTTATGGCCAGTAACTTTAGCTTGTTTTGTGCTTGCTGCTGTTTACAGAATAAATTGGATCACCGGTGGAATTGCTATCGCAATGGCTTGTCTTGTAGGCTTGATGTGGCTCAGCTACTTCATTGCTTCTTTCAGACTGTTTGCGCGTACGCGTTCCATGTGGTCATTCAATCCAGAAACTAACATTCTTCTCAACGTGCCACTCCATGGCACTATTCTGACCAGACCGCTTCTAGAAAGTGAACTCGTAATCGGAGCTGTGATCCTTCGTGGACATCTTCGTATTGCTGGACACCATCTAGGACGCTGTGACATCAAGGACCTGCCTAAAGAAATCACTGTTGCTACATCACGAACGCTTTCTTATTACAAATTGGGAGCTTCGCAGCGTGTAGCAGGTGACTCAGGTTTTGCTGCATACAGTCGCTACAGGATTGGCAACTATAAATTAAACACAGACCATTCCAGTAGCAGTGACAATATTGCTTTGCTTGTACAGTAAGTGACAACAGATGTTTCATCTCGTTGACTTTCAGGTTACTATAGCAGAGATATTACTAATTATTATGAGGACTTTTAAAGTTTCCATTTGGAATCTTGATTACATCATAAACCTCATAATTAAAAATTTATCTAAGTCACTAACTGAGAATAAATATTCTCAATTAGATGAAGAGCAACCAATGGAGATTGATTAAACGAACATGAAAATTATTCTTTTCTTGGCACTGATAACACTCGCTACTTGTGAGCTTTATCACTACCAAGAGTGTGTTAGAGGTACAACAGTACTTTTAAAAGAACCTTGCTCTTCTGGAACATACGAGGGCAATTCACCATTTCATCCTCTAGCTGATAACAAATTTGCACTGACTTGCTTTAGCACTCAATTTGCTTTTGCTTGTCCTGACGGCGTAAAACACGTCTATCAGTTACGTGCCAGATCAGTTTCACCTAAACTGTTCATCAGACAAGAGGAAGTTCAAGAACTTTACTCTCCAATTTTTCTTATTGTTGCGGCAATAGTGTTTATAACACTTTGCTTCACACTCAAAAGAAAGACAGAATGATTGAACTTTCATTAATTGACTTCTATTTGTGCTTTTTAGCCTTTCTGCTATTCCTTGTTTTAATTATGCTTATTATCTTTTGGTTCTCACTTGAACTGCAAGATCATAATGAAACTTGTCACGCCTAAACGAACATGAAATTTCTTGTTTTCTTAGGAATCATCACAACTGTAGCTGCATTTCACCAAGAATGTAGTTTACAGTCATGTACTCAACATCAACCATATGTAGTTGATGACCCGTGTCCTATTCACTTCTATTCTAAATGGTATATTAGAGTAGGAGCTAGAAAATCAGCACCTTTAATTGAATTGTGCGTGGATGAGGCTGGTTCTAAATCACCCATTCAGTACATCGATATCGGTAATTATACAGTTTCCTGTTTACCTTTTACAATTAATTGCCAGGAACCTAAATTGGGTAGTCTTGTAGTGCGTTGTTCGTTCTATGAAGACTTTTTAGAGTATCATGACGTTCGTGTTGTTTTAGATTTCATCTAAACGAACAAACTAAAATGTCTGATAATGGACCCCAAAATCAGCGAAATGCACCCCGCATTACGTTTGGTGGACCCTCAGATTCAACTGGCAGTAACCAGAATGGAGAACGCAGTGGGGCGCGATCAAAACAACGTCGGCCCCAAGGTTTACCCAATAATACTGCGTCTTGGTTCACCGCTCTCACTCAACATGGCAAGGAAGACCTTAAATTCCCTCGAGGACAAGGCGTTCCAATTAACACCAATAGCAGTCCAGATGACCAAATTGGCTACTACCGAAGAGCTACCAGACGAATTCGTGGTGGTGACGGTAAAATGAAAGATCTCAGTCCAAGATGGTATTTCTACTACCTAGGAACTGGGCCAGAAGCTGGACTTCCCTATGGTGCTAACAAAGACGGCATCATATGGGTTGCAACTGAGGGAGCCTTGAATACACCAAAAGATCACATTGGCACCCGCAATCCTGCTAACAATGCTGCAATCGTGCTACAACTTCCTCAAGGAACAACATTGCCAAAAGGCTTCTACGCAGAAGGGAGCAGAGGCGGCAGTCAAGCCTCTTCTCGTTCCTCATCACGTAGTCGCAACAGTTCAAGAAATTCAACTCCAGGCAGCAGTAGGGGAACTTCTCCTGCTAGAATGGCTGGCAATGGCGGTGATGCTGCTCTTGCTTTGCTGCTGCTTGACAGATTGAACCAGCTTGAGAGCAAAATGTCTGGTAAAGGCCAACAACAACAAGGCCAAACTGTCACTAAGAAATCTGCTGCTGAGGCTTCTAAGAAGCCTCGGCAAAAACGTACTGCCACTAAAGCATACAATGTAACACAAGCTTTCGGCAGACGTGGTCCAGAACAAACCCAAGGAAATTTTGGGGACCAGGAACTAATCAGACAAGGAACTGATTACAAACATTGGCCGCAAATTGCACAATTTGCCCCCAGCGCTTCAGCGTTCTTCGGAATGTCGCGCATTGGCATGGAAGTCACACCTTCGGGAACGTGGTTGACCTACACAGGTGCCATCAAATTGGATGACAAAGATCCAAATTTCAAAGATCAAGTCATTTTGCTGAATAAGCATATTGACGCATACAAAACATTCCCACCAACAGAGCCTAAAAAGGACAAAAAGAAGAAGGCTGATGAAACTCAAGCCTTACCGCAGAGACAGAAGAAACAGCAAACTGTGACTCTTCTTCCTGCTGCAGATTTGGATGATTTCTCCAAACAATTGCAACAATCCATGAGCAGTGCTGACTCAACTCAGGCCTAAACTCATGCAGACCACACAAGGCAGATGGGCTATATAAACGTTTTCGCTTTTCCGTTTACGATATATAGTCTACTCTTGTGCAGAATGAATTCTCGTAACTACATAGCACAAGTAGATGTAGTTAACTTTAATCTCACATAGCAATCTTTAATCAGTGTGTAACATTAGGGAGGACTTGAAAGAGCCACCACATTTTCACCGAGGCCACGCGGAGTACGATCGAGTGTACAGTGAACAATGCTAGGGAGAGCTGCCTATATGGAAGAGCCCTAATGTGTAAAATTAATTTTAGTAGTGCTATCCCCATGTGATTTTAATAGCTTCTTAGGAGAAT
서열번호: 10
>중증 급성 호흡기 증후군 코로나바이러스 2 orf1ab 다단백질의 단리물 hCoV-19/Italy/INMI1-isl/2020 (유전자은행 수탁번호: QIA98553)
MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQHLKDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGETLGVLVPHVGEIPVAYRKVLLRKNGNKGAGGHSYGADLKSFDLGDELGTDPYEDFQENWNTKHSSGVTRELMRELNGGAYTRYVDNNFCGPDGYPLECIKDLLARAGKASCTLSEQLDFIDTKRGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFPLNSIIKTIQPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTGDFVKATCEFCGTENLTKEGATTCGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESGLKTILRKGGRTIAFGGCVFSYVGCHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNLLEILQKEKVNINIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGNFKVTKGKAKKGAWNIGEQKSILSPLYAFASEAARVVRSIFSRTLETAQNSVRVLQKAAITILDGISQYSLRLIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIFGTVYEKLKPVLDWLEEKFKEGVEFLRDGWEIVKFISTCACEIVGGQIVTCAKEIKESVQTFFKLVNKFLALCADSIIIGGAKLKALNLGETFVTHSKGLYRKCVKSREETGLLMPLKAPKEIIFLEGETLPTEVLTEEVVLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEKYCALAPNMMVTNNTFTLKGGAPTKVTFGDDTVIEVQGYKSVNITFELDERIDKVLNEKCSAYTVELGTEVNEFACVVADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDEDEEEGDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPEEEQEEDWLDDDSQQTVGQQDGSEDNQTTTIQTIVEVQPQLEMELTPVVQTIEVNSFSGYLKLTDNVYIKNADIVEEAKKVKPTVVVNAANVYLKHGGGVAGALNKATNNAMQVESDDYIATNGPLKVGGSCVLSGHNLAKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLAPLLSAGIFGADPIHSLRVCVDTVRTNVYLAVFDKNLYDKLVSSFLEMKSEKQVEQKIAEIPKEEVKPFITESKPSVEQRKQDDKKIKACVEEVTTTLEETKFLTENLLLYIDINGNLHPDSATLVSDIDITFLKKDAPYIVGDVVQEGVLTAVVIPTKKAGGTTEMLAKALRKVPTDNYITTYPGQGLNGYTVEEAKTVLKKCKSAFYILPSIISNEKQEILGTVSWNLREMLAHAEETRKLMPVCVETKAIVSTIQRKYKGIKIQEGVVDYGARFYFYTSKTTVASLINTLNDLNETLVTMPLGYVTHGLNLEEAARYMRSLKVPATVSVSSPDAVTAYNGYLTSSSKTPEEHFIETISLAGSYKDWSYSGQSTQLGIEFLKRGDKSVYYTSNPTTFHLDGEVITFDNLKTLLSLREVRTIKVFTTVDNINLHTQVVDMSMTYGQQFGPTYLDGADVTKIKPHNSHEGKTFYVLPNDDTLRVEAFEYYHTTDPSFLGRYMSALNHTKKWKYPQVNGLTSIKWADNNCYLATALLTLQQIELKFNPPALQDAYYRARAGEAANFCALILAYCNKTVGELGDVRETMSYLFQHANLDSCKRVLNVVCKTCGQQQTTLKGVEAVMYMGTLSYEQFKKGVQIPCTCGKQATKYLVQQESPFVMMSAPPAQYELKHGTFTCASEYTGNYQCGHYKHITSKETLYCIDGALLTKSSEYKGPITDVFYKENSYTTTIKPVTYKLDGVVCTEIDPKLDNYYKKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCDNIKFADDLNQLTGYKKPASRELKVTFFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVWHVNNATNKATYKPNTWCIRCLWSTKPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEVVENPTIQKDVLECNVKTTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKPNELSRVLGLKTLATHGLAAVNSVPWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYMPYFFTLLLQLCTFTRSTNSRIKASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLINIIIWFLLLSVCLGSLIYSTAALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIPCSVCLSGLDSLDTYPSLETIQITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAIMQLFFSYFAVHFISNSWLMWLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCNSSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTFISDEVARDLSLQFKRPINPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDNLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGDSAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDSDVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQVAKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIALKGGKIVNNWLKQLIKVTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTRDIASTDTCFANKHADFDTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTTNGDFLHFLPRVFSAVGNICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDTNVLEGSVAYESLRPDTRYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAGVCVSTSGRWVLNNDYYRSLPGVFCGVDAVNLLTNMFTPLIQPIGALDISASIVAGGIVAIVVTCLAYYFMRFRRAFGEYSHVVAFNTLLFLMSFTVLCLTPVYSFLPGVYSVIYLYLTFYLTNDVSFLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSFSTFEEAALCTFLLNKEMYLKLRSDVLLPLTQYNRYLALYNKYKYFSGAMDTTSYREAACCHLAKALNDFSNSGSDVLYQPPQTSITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCVLKLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNFTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNVLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQSAVKRTIKGTHHWLLLTILTSLLVLVQSTQWSLFFFXYENAFLPFAMGIIAMSAFAMMFVKHKHAFLCLFLLPSLATVAYFNMVYMPASWVMRIMTWLDMVDTSLSGFKLKDCVMYASAVVLLILMTARTVYDDGARRVWTLMNVLTLVYKVYYGNALDQAISMWALIISVTSNYSGVVTTVMFLARGIVFMCVEYCPIFFITGNTLQCIMLVYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMNSQGLLPPKNSIDAFKLNIKLLGVGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVESSSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRATLQAIASEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKLEKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLNIIPLTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDADSKIVQLSEISMDNSPNLAWPLIVTALRANSAVKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNTTKGGRFVLALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMVLGSLAATVRLQAGNATEVPANSTVLSFCAFAVDAAKAYKDYLASGGQPITNCVKMLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCANDPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQSADAQSFLNRVCGVSAARLTPCGTGTSTDVVYRAFDIYNDKVAGFAKFLKTNCCRFQEKDEDDNLIDSYFVVKRHTFSNYQHEETIYNLLKDCPAVAKHDFFKFRIDGDMVPHISRQRLTKYTMADLVYALRHFDEGNCDTLKEILVTYNCCDDDYFNKKDWYDFVENPDILRVYANLGERVRQALLKTVQFCDAMRNAGIVGVLTLDNQDLNGNWYDFGDFIQTTPGSGVPVVDSYYSLLMPILTLTRALTAESHVDTDLTKPYIKWDLLKYDFTEERLKLFDRYFKYWDQTYHPNCVNCLDDRCILHCANFNVLFSTVFPPTSFGPLVRKIFVDGVPFVVSTGYHFRELGVVHNQDVNLHSSRLSFKELLVYAADPAMHAASGNLLLDKRTTCFSVAALTNNVAFQTVKPGNFNKDFYDFAVSKGFFKEGSSVELKHFFFAQDGNAAISDYDYYRYNLPTMCDIRQLLFVVEVVDKYFDCYDGGCINANQVIVNNLDKSAGFPFNKWGKARLYYDSMSYEDQDALFAYTKRNVIPTITQMNLKYAISAKNRARTVAGVSICSTMTNRQFHQKLLKSIAATRGATVVIGTSKFYGGWHNMLKTVYSDVENPHLMGWDYPKCDRAMPNMLRIMASLVLARKHTTCCSLSHRFYRLANECAQVLSEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADKYVRNLQHRLYECLYRNRDVDTDFVNEFYAYLRKHFSMMILSDDAVVCFNSTYASQGLVASIKNFKSVLYYQNNVFMSEAKCWTETDLTKGPHEFCSQHTMLVKQGDDYVYLPYPDPSRILGAGCFVDDIVKTDGTLMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYSVMLTNDNTSRYWEPEFYEAMYTPHTVLQAVGACVLCNSQTSLRCGACIRRPFLCCKCCYDHVISTSHKLVLSVNPYVCNAPGCDVTDVTQLYLGGMSYYCKSHKPPISFPLCANGQVFGLYKNTCVGSDNVTDFNAIATCDWTNAGDYILANTCTERLKLFAAETLKATEETFKLSYGIATVREVLSDRELHLSWEVGKPRPPLNRNYVFTGYRVTKNSKVQIGEYTFEKGDYGDAVVYRGTTTYKLNVGDYFVLTSHTVMPLSAPTLVPQEHYVRITGLYPTLNISDEFSSNVANYQKVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARIVYTACSHAAVDALCEKALKYLPIDKCSRIIPARARVECFDKFKVNSTLEQYVFCTVNALPETTADIVVFDEISMATNYDLSVVNARLRAKHYVYIGDPAQLPAPRTLLTKGTLEPEYFNSVCRLMKTIGPDMFLGTCRRCPAEIVDTVSALVYDNKLKAHKDKSAQCFKMFYKGVITHDVSSAINRPQIGVVREFLTRNPAWRKAVFISPYNSQNAVASKILGLPTQTVDSSQGSEYDYVIFTQTTETAHSCNVNRFNVAITRAKVGILCIMSDRDLYDKLQFTSLEIPRRNVATLQAENVTGLFKDCSKVITGLHPTQAPTHLSVDTKFKTEGLCVDIPGIPKDMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATREAVGTNLPLQLGFSTGVNLVAVPTGYVDTPNNTDFSRVSAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQMLSDTLKNLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDRRATCFSTASDTYACWHHSIGFDYVYNPFMIDVQQWGFTGNLQSNHDLYCQVHGNAHVASCDAIMTRCLAVHECFVKRVDWTIEYPIIGDELKINAACRKVQHMVVKAALLADKFPVLHDIGNPKAIKCVPQADVEWKFYDAQPCSDKAYKIEELFYSYATHSDKFTDGVCLFWNCNVDRYPANSIVCRFDTRVLSNLNLPGCDGGSLYVNKHAFHTPAFDKSAFVNLKQLPFFYYSDSPCESHGKQVVSDIDYVPLKSATCITRCNLGGAVCRHHANEYRLYLDAYNMMISAGFSLWVYKQFDTYNLWNTFTRLQSLENVAFNVVNKGHFDGQQGEVPVSIINNTVYTKVDGVDVELFENKTTLPVNVAFELWAKRNIKPVPEVKILNNLGVDIAANTVIWDYKRDAPAHISTIGVCSMTDIAKKPTETICAPLTVFFDGRVDGQVDLFRNARNGVLITEGSVKGLQPSVGPKQASLNGVTLIGEAVKTQFNYYKKVDGVVQQLPETYFTQSRNLQEFKPRSQMEIDFLELAMDEFIERYKLEGYAFEHIVYGDFSHSQLGGLHLLIGLAKRFKESPFELEDFIPMDSTVKNYFITDAQTGSSKCVCSVIDLLLDDFVEIIKSQDLSVVSKVVKVTIDYTEISFMLWCKDGHVETFYPKLQSSQAWQPGVAMPNLYKMQRMLLEKCDLQNYGDSATLPKGIMMNVAKYTQLCQYLNTLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLPTGTLLVDSDLNDFVSDADSTLIGDCATVHTANKWDLIISDMYDPKTKNVTKENDSKEGFFTYICGFIQQKLALGGSVAIKITEHSWNADLYKLMGHFAWWTAFVTNVNASSSEAFLIGCNYLGKPREQIDGYVMHANYIFWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKE
서열번호: 11
>단백질\S_2019-nCoV/Italy-INMI1 (S단백질_hCoV19ItalyINMI1isl2020)(유전자은행 수탁번호: QIA98554)
MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT
서열번호: 12
>hCoV-19/France/IDF0372-isl/2020|EPI_ISL_410720|2020-01-23
ATTAAAGGTTTATACCTTCCCAGGTAACAAACCAACCAACTTTCGATCTCTTGTAGATCTGTTCTCTAAACGAACTTTAAAATCTGTGTGGCTGTCACTCGGCTGCATGCTTAGTGCACTCACGCAGTATAATTAATAACTAATTACTGTCGTTGACAGGACACGAGTAACTCGTCTATCTTCTGCAGGCTGCTTACGGTTTCGTCCGTGTTGCAGCCGATCATCAGCACATCTAGGTTTCGTCCGGGTGTGACCGAAAGGTAAGATGGAGAGCCTTGTCCCTGGTTTCAACGAGAAAACACACGTCCAACTCAGTTTGCCTGTTTTACAGGTTCGCGACGTGCTCGTACGTGGCTTTGGAGACTCCGTGGAGGAGGTCTTATCAGAGGCACGTCAACATCTTAAAGATGGCACTTGTGGCTTAGTAGAAGTTGAAAAAGGCGTTTTGCCTCAACTTGAACAGCCCTATGTGTTCATCAAACGTTCGGATGCTCGAACTGCACCTCATGGTCATGTTATGGTTGAGCTGGTAGCAGAACTCGAAGGCATTCAGTACGGTCGTAGTGGTGAGACACTTGGTGTCCTTGTCCCTCATGTGGGCGAAATACCAGTGGCTTACCGCAAGGTTCTTCTTCGTAAGAACGGTAATAAAGGAGCTGGTGGCCATAGTTACGGCGCCGATCTAAAGTCATTTGACTTAGGCGACGAGCTTGGCACTGATCCTTATGAAGATTTTCAAGAAAACTGGAACACTAAACATAGCAGTGGTGTTACCCGTGAACTCATGCGTGAGCTTAACGGAGGGGCATACACTCGCTATGTCGATAACAACTTCTGTGGCCCTGATGGCTACCCTCTTGAGTGCATTAAAGACCTTCTAGCACGTGCTGGTAAAGCTTCATGCACTTTGTCCGAACAACTGGACTTTATTGACACTAAGAGGGGTGTATACTGCTGCCGTGAACATGAGCATGAAATTGCTTGGTACACGGAACGTTCTGAAAAGAGCTATGAATTGCAGACACCTTTTGAAATTAAATTGGCAAAGAAATTTGACACCTTCAATGGGGAATGTCCAAATTTTGTATTTCCCTTAAATTCCATAATCAAGACTATTCAACCAAGGGTTGAAAAGAAAAAGCTTGATGGCTTTATGGGTAGAATTCGATCTGTCTATCCAGTTGCGTCACCAAATGAATGCAACCAAATGTGCCTTTCAACTCTCATGAAGTGTGATCATTGTGGTGAAACTTCATGGCAGACGGGCGATTTTGTTAAAGCCACTTGCGAATTTTGTGGCACTGAGAATTTGACTAAAGAAGGTGCCACTACTTGTGGTTACTTACCCCAAAATGCTGTTGTTAAAATTTATTGTCCAGCATGTCACAATTCAGAAGTAGGACCTGAGCATAGTCTTGCCGAATACCATAATGAATCTGGCTTGAAAACCATTCTTCGTAAGGGTGGTCGCACTATTGCCTTTGGAGGCTGTGTGTTCTCTTATGTTGGTTGCCATAACAAGTGTGCCTATTGGGTTCCACGTGCTAGCGCTAACATAGGTTGTAACCATACAGGTGTTGTTGGAGAAGGTTCCGAAGGTCTTAATGACAACCTTCTTGAAATACTCCAAAAAGAGAAAGTCAACATCAATATTGTTGGTGACTTTAAACTTAATGAAGAGATCGCCATTATTTTGGCATCTTTTTCTGCTTCCACAAGTGCTTTTGTGGAAACTGTGAAAGGTTTGGATTATAAAGCATTCAAACAAATTGTTGAATCCTGTGGTAATTTTAAAGTTACAAAAGGAAAAGCTAAAAAAGGTGCCTGGAATATTGGTGAACAGAAATCAATACTGAGTCCTCTTTATGCATTTGCATCAGAGGCTGCTCGTGTTGTACGATCAATTTTCTCCCGCACTCTTGAAACTGCTCAAAATTCTGTGCGTGTTTTACAGAAGGCCGCTATAACAATACTAGATGGAATTTCACAGTATTCACTGAGACTCATTGATGCTATGATGTTCACATCTGATTTGGCTACTAACAATCTAGTTGTAATGGCCTACATTACAGGTGGTGTTGTTCAGTTGACTTCGCAGTGGCTAACTAACATCTTTGGCACTGTTTATGAAAAACTCAAACCCGTCCTTGATTGGCTTGAAGAGAAGTTTAAGGAAGGTGTAGAGTTTCTTAGAGACGGTTGGGAAATTGTTAAATTTATCTCAACCTGTGCTTGTGAAATTGTCGGTGGACAAATTGTCACCTGTGCAAAGGAAATTAAGGAGAGTGTTCAGACATTCTTTAAGCTTGTAAATAAATTTTTGGCTTTGTGTGCTGACTCTATCATTATTGGTGGAGCTAAACTTAAAGCCTTGAATTTAGGTGAAACATTTGTCACGCACTCAAAGGGATTGTACAGAAAGTGTGTTAAATCCAGAGAAGAAACTGGCCTACTCATGCCTCTAAAAGCCCCAAAAGAAATTATCTTCTTAGAGGGAGAAACACTTCCCACAGAAGTGTTAACAGAGGAAGTTGTCTTGAAAACTGGTGATTTACAACCATTAGAACAACCTACTAGTGAAGCTGTTGAAGCTCCATTGGTTGGTACACCAGTTTGTATTAACGGGCTTATGTTGCTCGAAATCAAAGACACAGAAAAGTACTGTGCCCTTGCACCTAATATGATGGTAACAAACAATACCTTCACACTCAAAGGCGGTGCACCAACAAAGGTTACTTTTGGTGATGACACTGTGATAGAAGTGCAAGGTTACAAGAGTGTGAATATCACTTTTGAACTTGATGAAAGGATTGATAAAGTACTTAATGAGAAGTGCTCTGCCTATACAGTTGAACTCGGTACAGAAGTAAATGAGTTCGCCTGTGTTGTGGCAGATGCTGTCATAAAAACTTTGCAACCAGTATCTGAATTACTTACACCACTGGGCATTGATTTAGATGAGTGGAGTATGGCTACATACTACTTATTTGATGAGTCTGGTGAGTTTAAATTGGCTTCACATATGTATTGTTCTTTCTACCCTCCAGATGAGGATGAAGAAGAAGGTGATTGTGAAGAAGAAGAGTTTGAGCCATCAACTCAATATGAGTATGGTACTGAAGATGATTACCAAGGTAAACCTTTGGAATTTGGTGCCACTTCTGCTGCTCTTCAACCTGAAGAAGAGCAAGAAGAAGATTGGTTAGATGATGATAGTCAACAAACTGTTGGTCAACAAGACGGCAGTGAGGACAATCAGACAACTACTATTCAAACAATTGTTGAGGTTCAACCTCAATTAGAGATGGAACTTACACCAGTTGTTCAGACTATTGAAGTGAATAGTTTTAGTGGTTATTTAAAACTTACTGACAATGTATACATTAAAAATGCAGACATTGTGGAAGAAGCTAAAAAGGTAAAACCAACAGTGGTTGTTAATGCAGCCAATGTTTACCTTAAACATGGAGGAGGTGTTGCAGGAGCCTTAAATAAGGCTACTAACAATGCCATGCAAGTTGAATCTGATGATTACATAGCTACTAATGGACCACTTAAAGTGGGTGGTAGTTGTGTTTTAAGCGGACACAATCTTGCTAAACACTGTCTTCATGTTGTCGGCCCAAATGTTAACAAAGGTGAAGACATTCAACTTCTTAAGAGTGCTTATGAAAATTTTAATCAGCACGAAGTTCTACTTGCACCATTATTATCAGCTGGTATTTTTGGTGCTGACCCTATACATTCTTTAAGAGTTTGTGTAGATACTGTTCGCACAAATGTCTACTTAGCTGTCTTTGATAAAAATCTCTATGACAAACTTGTTTCAAGCTTTTTGGAAATGAAGAGTGAAAAGCAAGTTGAACAAAAGATCGCTGAGATTCCTAAAGAGGAAGTTAAGCCATTTATAACTGAAAGTAAACCTTCAGTTGAACAGAGAAAACAAGATGATAAGAAAATCAAAGCTTGTGTTGAAGAAGTTACAACAACTCTGGAAGAAACTAAGTTCCTCACAGAAAACTTGTTACTTTATATTGACATTAATGGCAATCTTCATCCAGATTCTGCCACTCTTGTTAGTGACATTGACATCACTTTCTTAAAGAAAGATGCTCCATATATAGTGGGTGATGTTGTTCAAGAGGGTGTTTTAACTGCTGTGGTTATACCTACTAAAAAGGCTGGTGGCACTACTGAAATGCTAGCGAAAGCTTTGAGAAAAGTGCCAACAGACAATTATATAACCACTTACCCGGGTCAGGGTTTAAATGGTTACACTGTAGAGGAGGCAAAGACAGTGCTTAAAAAGTGTAAAAGTGCCTTTTACATTCTACCATCTATTATCTCTAATGAGAAGCAAGAAATTCTTGGAACTGTTTCTTGGAATTTGCGAGAAATGCTTGCACATGCAGAAGAAACACGCAAATTAATGCCTGTCTGTGTGGAAACTAAAGCCATAGTTTCAACTATACAGCGTAAATATAAGGGTATTAAAATACAAGAGGGTGTGGTTGATTATGGTGCTAGATTTTACTTTTACACCAGTAAAACAACTGTAGCGTCACTTATCAACACACTTAACGATCTAAATGAAACTCTTGTTACAATGCCACTTGGCTATGTAACACATGGCTTAAATTTGGAAGAAGCTGCTCGGTATATGAGATCTCTCAAAGTGCCAGCTACAGTTTCTGTTTCTTCACCTGATGCTGTTACAGCGTATAATGGTTATCTTACTTCTTCTTCTAAAACACCTGAAGAACATTTTATTGAAACCATCTCACTTGCTGGTTCCTATAAAGATTGGTCCTATTCTGGACAATCTACACAACTAGGTATAGAATTTCTTAAGAGAGGTGATAAAAGTGTATATTACACTAGTAATCCTACCACATTCCACCTAGATGGTGAAGTTATCACCTTTGACAATCTTAAGACACTTCTTTCTTTGAGAGAAGTGAGGACTATTAAGGTGTTTACAACAGTAGACAACATTAACCTCCACACGCAAGTTGTGGACATGTCAATGACATATGGACAACAGTTTGGTCCAACTTATTTGGATGGAGCTGATGTTACTAAAATAAAACCTCATAATTCACATGAAGGTAAAACATTTTATGTTTTACCTAATGATGACACTCTACGTGTTGAGGCTTTTGAGTACTACCACACAACTGATCCTAGTTTTCTGGGTAGGTACATGTCAGCATTAAATCACACTAAAAAGTGGAAATACCCACAAGTTAATGGTTTAACTTCTATTAAATGGGCAGATAACAACTGTTATCTTGCCACTGCATTGTTAACACTCCAACAAATAGAGTTGAAGTTTAATCCACCTGCTCTACAAGATGCTTATTACAGAGCAAGGGCTGGTGAAGCTGCTAACTTTTGTGCACTTATCTTAGCCTACTGTAATAAGACAGTAGGTGAGTTAGGTGATGTTAGAGAAACAATGAGTTACTTGTTTCAACATGCCAATTTAGATTCTTGCAAAAGAGTCTTGAACGTGGTGTGTAAAACTTGTGGACAACAGCAGACAACCCTTAAGGGTGTAGAAGCTGTTATGTACATGGGCACACTTTCTTATGAACAATTTAAGAAAGGTGTTCAGATACCTTGTACGTGTGGTAAACAAGCTACAAAATATCTAGTACAACAGGAGTCACCTTTTGTTATGATGTCAGCACCACCTGCTCAGTATGAACTTAAGCATGGTACATTTACTTGTGCTAGTGAGTACACTGGTAATTACCAGTGTGGTCACTATAAACATATAACTTCTAAAGAAACTTTGTATTGCATAGACGGTGCTTTACTTACAAAGTCCTCAGAATACAAAGGTCCTATTACGGATGTTTTCTACAAAGAAAACAGTTACACAACAACCATAAAACCAGTTACTTATAAATTGGATGGTGTTGTTTGTACAGAAATTGACCCTAAGTTGGACAATTATTATAAGAAAGACAATTCTTATTTCACAGAGCAACCAATTGATCTTGTACCAAACCAACCATATCCAAACGCAAGCTTCGATAATTTTAAGTTTGTATGTGATAATATCAAATTTGCTGATGATTTAAACCAGTTAACTGGTTATAAGAAACCTGCTTCAAGAGAGCTTAAAGTTACATTTTTCCCTGACTTAAATGGTGATGTGGTGGCTATTGATTATAAACACTACACACCCTCTTTTAAGAAAGGAGCTAAATTGTTACATAAACCTATTGTTTGGCATGTTAACAATGCAACTAATAAAGCCACGTATAAACCAAATACCTGGTGTATACGTTGTCTTTGGAGCACAAAACCAGTTGAAACATCAAATTCGTTTGATGTACTGAAGTCAGAGGACGCGCAGGGAATGGATAATCTTGCCTGCGAAGATCTAAAACCAGTCTCTGAAGAAGTAGTGGAAAATCCTACCATACAGAAAGACGTTCTTGAGTGTAATGTGAAAACTACCGAAGTTGTAGGAGACATTATACTTAAACCAGCAAATAATAGTTTAAAAATTACAGAAGAGGTTGGCCACACAGATCTAATGGCTGCTTATGTAGACAATTCTAGTCTTACTATTAAGAAACCTAATGAATTATCTAGAGTATTAGGTTTGAAAACCCTTGCTACTCATGGTTTAGCTGCTGTTAATAGTGTCCCTTGGGATACTATAGCTAATTATGCTAAGCCTTTTCTTAACAAAGTTGTTAGTACAACTACTAACATAGTTACACGGTGTTTAAACCGTGTTTGTACTAATTATATGCCTTATTTCTTTACTTTATTGCTACAATTGTGTACTTTTACTAGAAGTACAAATTCTAGAATTAAAGCATCTATGCCGACTACTATAGCAAAGAATACTGTTAAGAGTGTCGGTAAATTTTGTCTAGAGGCTTCATTTAATTATTTGAAGTCACCTAATTTTTCTAAACTGATAAATATTATAATTTGGTTTTTACTATTAAGTGTTTGCCTAGGTTCTTTAATCTACTCAACCGCTGCTTTAGGTGTTTTAATGTCTAATTTAGGCATGCCTTCTTACTGTACTGGTTACAGAGAAGGCTATTTGAACTCTACTAATGTCACTATTGCAACCTACTGTACTGGTTCTATACCTTGTAGTGTTTGTCTTAGTGGTTTAGATTCTTTAGACACCTATCCTTCTTTAGAAACTATACAAATTACCATTTCATCTTTTAAATGGGATTTAACTGCTTTTGGCTTAGTTGCAGAGTGGTTTTTGGCATATATTCTTTTCACTAGGTTTTTCTATGTACTTGGATTGGCTGCAATCATGCAATTGTTTTTCAGCTATTTTGCAGTACATTTTATTAGTAATTCTTGGCTTATGTGGTTAATAATTAATCTTGTACAAATGGCCCCGATTTCAGCTATGGTTAGAATGTACATCTTCTTTGCATCATTTTATTATGTATGGAAAAGTTATGTGCATGTTGTAGACGGTTGTAATTCATCAACTTGTATGATGTGTTACAAACGTAATAGAGCAACAAGAGTCGAATGTACAACTATTGTTAATGGTGTTAGAAGGTCCTTTTATGTCTATGCTAATGGAGGTAAAGGCTTTTGCAAACTACACAATTGGAATTGTGTTAATTGTGATACATTCTGTGCTGGTAGTACATTTATTAGTGATGAAGTTGCGAGAGACTTGTCACTACAGTTTAAAAGACCAATAAATCCTACTGACCAGTCTTCTTACATCGTTGATAGTGTTACAGTGAAGAATGGTTCCATCCATCTTTACTTTGATAAAGCTGGTCAAAAGACTTATGAAAGACATTCTCTCTCTCATTTTGTTAACTTAGACAACCTGAGAGCTAATAACACTAAAGGTTCATTGCCTATTAATGTTATAGTTTTTGATGGTAAATCAAAATGTGAAGAATCATCTGCAAAATCAGCGTCTGTTTACTACAGTCAGCTTATGTGTCAACCTATACTGTTACTAGATCAGGCATTAGTGTCTGATGTTGGTGATAGTGCGGAAGTTGCAGTTAAAATGTTTGATGCTTACGTTAATACGTTTTCATCAACTTTTAACGTACCAATGGAAAAACTCAAAACACTAGTTGCAACTGCAGAAGCTGAACTTGCAAAGAATGTGTCCTTAGACAATGTCTTATCTACTTTTATTTCAGCAGCTCGGCAAGGGTTTGTTGATTCAGATGTAGAAACTAAAGATGTTGTTGAATGTCTTAAATTGTCACATCAATCTGACATAGAAGTTACTGGCGATAGTTGTAATAACTATATGCTCACCTATAACAAAGTTGAAAACATGACACCCCGTGACCTTGGTGCTTGTATTGACTGTAGTGCGCGTCATATTAATGCGCAGGTAGCAAAAAGTCACAACATTGCTTTGATATGGAACGTTAAAGATTTCATGTCATTGTCTGAACAACTACGAAAACAAATACGTAGTGCTGCTAAAAAGAATAACTTACCTTTTAAGTTGACATGTGCAACTACTAGACAAGTTGTTAATGTTGTAACAACAAAGATAGCACTTAAGGGTGGTAAAATTGTTAATAATTGGTTGAAGCAGTTAATTAAAGTTACACTTGTGTTCCTTTTTGTTGCTGCTATTTTCTATTTAATAACACCTGTTCATGTCATGTCTAAACATACTGACTTTTCAAGTGAAATCATAGGATACAAGGCTATTGATGGTGGTGTCACTCGTGACATAGCATCTACAGATACTTGTTTTGCTAACAAACATGCTGATTTTGACACATGGTTTAGCCAGCGTGGTGGTAGTTATACTAATGACAAAGCTTGCCCATTGATTGCTGCAGTCATAACAAGAGAAGTGGGTTTTGTCGTGCCTGGTTTGCCTGGCACGATATTACGCACAACTAATGGTGACTTTTTGCATTTCTTACCTAGAGTTTTTAGTGCAGTTGGTAACATCTGTTACACACCATCAAAACTTATAGAGTACACTGACTTTGCAACATCAGCTTGTGTTTTGGCTGCTGAATGTACAATTTTTAAAGATGCTTCTGGTAAGCCAGTACCATATTGTTATGATACCAATGTACTAGAAGGTTCTGTTGCTTATGAAAGTTTACGCCCTGACACACGTTATGTGCTCATGGATGGCTCTATTATTCAATTTCCTAACACCTACCTTGAAGGTTCTGTTAGAGTGGTAACAACTTTTGATTCTGAGTACTGTAGGCACGGCACTTGTGAAAGATCAGAAGCTGGTGTTTGTGTATCTACTAGTGGTAGATGGGTACTTAACAATGATTATTACAGATCTTTACCAGGAGTTTTCTGTGGTGTAGATGCTGTAAATTTACTTACTAATATGTTTACACCACTAATTCAACCTATTGGTGCTTTGGACATATCAGCATCTATAGTAGCTGGTGGTATTGTAGCTATCGTAGTAACATGCCTTGCCTACTATTTTATGAGGTTTAGAAGAGCTTTTGGTGAATACAGTCATGTAGTTGCCTTTAATACTTTACTATTCCTTATGTCATTCACTGTACTCTGTTTAACACCAGTTTACTCATTCTTACCTGGTGTTTATTCTGTTATTTACTTGTACTTGACATTTTATCTTACTAATGATGTTTCTTTTTTAGCACATATTCAGTGGATGGTTATGTTCACACCTTTAGTACCTTTCTGGATAACAATTGCTTATATCATTTGTATTTCCACAAAGCATTTCTATTGGTTCTTTAGTAATTACCTAAAGAGACGTGTAGTCTTTAATGGTGTTTCCTTTAGTACTTTTGAAGAAGCTGCGCTGTGCACCTTTTTGTTAAATAAAGAAATGTATCTAAAGTTGCGTAGTGATGTGCTATTACCTCTTACGCAATATAATAGATACTTAGCTCTTTATAATAAGTACAAGTATTTTAGTGGAGCAATGGATACAACTAGCTACAGAGAAGCTGCTTGTTGTCATCTCGCAAAGGCTCTCAATGACTTCAGTAACTCAGGTTCTGATGTTCTTTACCAACCACCACAAACCTCTATCACCTCAGCTGTTTTGCAGAGTGGTTTTAGAAAAATGGCATTCCCATCTGGTAAAGTTGAGGGTTGTATGGTACAAGTAACTTGTGGTACAACTACACTTAACGGTCTTTGGCTTGATGACGTAGTTTACTGTCCAAGACATGTGATCTGCACCTCTGAAGACATGCTTAACCCTAATTATGAAGATTTACTCATTCGTAAGTCTAATCATAATTTCTTGGTACAGGCTGGTAATGTTCAACTCAGGGTTATTGGACATTCTATGCAAAATTGTGTACTTAAGCTTAAGGTTGATACAGCCAATCCTAAGACACCTAAGTATAAGTTTGTTCGCATTCAACCAGGACAGACTTTTTCAGTGTTAGCTTGTTACAATGGTTCACCATCTGGTGTTTACCAATGTGCTATGAGGCCCAATTTCACTATTAAGGGTTCATTCCTTAATGGTTCATGTGGTAGTGTTGGTTTTAACATAGATTATGACTGTGTCTCTTTTTGTTACATGCACCATATGGAATTACCAACTGGAGTTCATGCTGGCACAGACTTAGAAGGTAACTTTTATGGACCTTTTGTTGACAGGCAAACAGCACAAGCAGCTGGTACGGACACAACTATTACAGTTAATGTTTTAGCTTGGTTGTACGCTGCTGTTATAAATGGAGACAGGTGGTTTCTCAATCGATTTACCACAACTCTTAATGACTTTAACCTTGTGGCTATGAAGTACAATTATGAACCTCTAACACAAGACCATGTTGACATACTAGGACCTCTTTCTGCTCAAACTGGAATTGCCGTTTTAGATATGTGTGCTTCATTAAAAGAATTACTGCAAAATGGTATGAATGGACGTACCATATTGGGTAGTGCTTTATTAGAAGATGAATTTACACCTTTTGATGTTGTTAGACAATGCTCAGGTGTTACTTTCCAAAGTGCAGTGAAAAGAACAATCAAGGGTACACACCACTGGTTGTTACTCACAATTTTGACTTCACTTTTAGTTTTAGTCCAGAGTACTCAATGGTCTTTGTTCTTTTTTTTGTATGAAAATGCCTTTTTACCTTTTGCTATGGGTATTATTGCTATGTCTGCTTTTGCAATGATGTTTGTCAAACATAAGCATGCATTTCTCTGTTTGTTTTTGTTACCTTCTCTTGCCACTGTAGCTTATTTTAATATGGTCTATATGCCTGCTAGTTGGGTGATGCGTATTATGACATGGTTGGATATGGTTGATACTAGTTTGTCTGGTTTTAAGCTAAAAGACTGTGTTATGTATGCATCAGCTGTAGTGTTACTAATCCTTATGACAGCAAGAACTGTGTATGATGATGGTGCTAGGAGAGTGTGGACACTTATGAATGTCTTGACACTCGTTTATAAAGTTTATTATGGTAATGCTTTAGATCAAGCCATTTCCATGTGGGCTCTTATAATCTCTGTTACTTCTAACTACTCAGGTGTAGTTACAACTGTCATGTTTTTGGCCAGAGGTATTGTTTTTATGTGTGTTGAGTATTGCCCTATTTTCTTCATAACTGGTAATACACTTCAGTGTATAATGCTAGTTTATTGTTTCTTAGGCTATTTTTGTACTTGTTACTTTGGCCTCTTTTGTTTACTCAACCGCTACTTTAGACTGACTCTTGGTGTTTATGATTACTTAGTTTCTACACAGGAGTTTAGATATATGAATTCACAGGGACTACTCCCACCCAAGAATAGCATAGATGCCTTCAAACTCAACATTAAATTGTTGGGTGTTGGTGGCAAACCTTGTATCAAAGTAGCCACTGTACAGTCTAAAATGTCAGATGTAAAGTGCACATCAGTAGTCTTACTCTCAGTTTTGCAACAACTCAGAGTAGAATCATCATCTAAATTGTGGGCTCAATGTGTCCAGTTACACAATGACATTCTCTTAGCTAAAGATACTACTGAAGCCTTTGAAAAAATGGTTTCACTACTTTCTGTTTTGCTTTCCATGCAGGGTGCTGTAGACATAAACAAGCTTTGTGAAGAAATGCTGGACAACAGGGCAACCTTACAAGCTATAGCCTCAGAGTTTAGTTCCCTTCCATCATATGCAGCTTTTGCTACTGCTCAAGAAGCTTATGAGCAGGCTGTTGCTAATGGTGATTCTGAAGTTGTTCTTAAAAAGTTGAAGAAGTCTTTGAATGTGGCTAAATCTGAATTTGACCGTGATGCAGCCATGCAACGTAAGTTGGAAAAGATGGCTGATCAAGCTATGACCCAAATGTATAAACAGGCTAGATCTGAGGACAAGAGGGCAAAAGTTACTAGTGCTATGCAGACAATGCTTTTCACTATGCTTAGAAAGTTGGATAATGATGCACTCAACAACATTATCAACAATGCAAGAGATGGTTGTGTTCCCTTGAACATAATACCTCTTACAACAGCAGCCAAACTAATGGTTGTCATACCAGACTATAACACATATAAAAATACGTGTGATGGTACAACATTTACTTATGCATCAGCATTGTGGGAAATCCAACAGGTTGTAGATGCAGATAGTAAAATTGTTCAACTTAGTGAAATTAGTATGGACAATTCACCTAATTTAGCATGGCCTCTTATTGTAACAGCTTTAAGGGCCAATTCTGCTGTCAAATTACAGAATAATGAGCTTAGTCCTGTTGCACTACGACAGATGTCTTGTGCTGCCGGTACTACACAAACTGCTTGCACTGATGACAATGCGTTAGCTTACTACAACACAACAAAGGGAGGTAGGTTTGTACTTGCACTGTTATCCGATTTACAGGATTTGAAATGGGCTAGATTCCCTAAGAGTGATGGAACTGGTACTATCTATACAGAACTGGAACCACCTTGTAGGTTTGTTACAGACACACCTAAAGGTCCTAAAGTGAAGTATTTATACTTTATTAAAGGATTAAACAACCTAAATAGAGGTATGGTACTTGGTAGTTTAGCTGCCACAGTACGTCTACAAGCTGGTAATGCAACAGAAGTGCCTGCCAATTCAACTGTATTATCTTTCTGTGCTTTTGCTGTAGATGCTGCTAAAGCTTACAAAGATTATCTAGCTAGTGGGGGACAACCAATCACTAATTGTGTTAAGATGTTGTGTACACACACTGGTACTGGTCAGGCAATAACAGTTACACCGGAAGCCAATATGGATCAAGAATCCTTTGGTGGTGCATCGTGTTGTCTGTACTGCCGTTGCCACATAGATCATCCAAATCCTAAAGGATTTTGTGACTTAAAAGGTAAGTATGTACAAATACCTACAACTTGTGCTAATGACCCTGTGGGTTTTACACTTAAAAACACAGTCTGTACCGTCTGCGGTATGTGGAAAGGTTATGGCTGTAGTTGTGATCAACTCCGCGAACCCATGCTTCAGTCAGCTGATGCACAATCGTTTTTAAACGGGTTTGCGGTGTAAGTGCAGCCCGTCTTACACCGTGCGGCACAGGCACTAGTACTGATGTCGTATACAGGGCTTTTGACATCTACAATGATAAAGTAGCTGGTTTTGCTAAATTCCTAAAAACTAATTGTTGTCGCTTCCAAGAAAAGGACGAAGATGACAATTTAATTGATTCTTACTTTGTAGTTAAGAGACACACTTTCTCTAACTACCAACATGAAGAAACAATTTATAATTTACTTAAGGATTGTCCAGCTGTTGCTAAACATGACTTCTTTAAGTTTAGAATAGACGGTGACATGGTACCACATATATCACGTCAACGTCTTACTAAATACACAATGGCAGACCTCGTCTATGCTTTAAGGCATTTTGATGAAGGTAATTGTGACACATTAAAAGAAATACTTGTCACATACAATTGTTGTGATGATGATTATTTCAATAAAAAGGACTGGTATGATTTTGTAGAAAACCCAGATATATTACGCGTATACGCCAACTTAGGTGAACGTGTACGCCAAGCTTTGTTAAAAACAGTACAATTCTGTGATGCCATGCGAAATGCTGGTATTGTTGGTGTACTGACATTAGATAATCAAGATCTCAATGGTAACTGGTATGATTTCGGTGATTTCATACAAACCACGCCAGGTAGTGGAGTTCCTGTTGTAGATTCTTATTATTCATTGTTAATGCCTATATTAACCTTGACCAGGGCTTTAACTGCAGAGTCACATGTTGACACTGACTTAACAAAGCCTTACATTAAGTGGGATTTGTTAAAATATGACTTCACGGAAGAGAGGTTAAAACTCTTTGACCGTTATTTTAAATATTGGGATCAGACATACCACCCAAATTGTGTTAACTGTTTGGATGACAGATGCATTCTGCATTGTGCAAACTTTAATGTTTTATTCTCTACAGTGTTCCCACCTACAAGTTTTGGACCACTAGTGAGAAAAATATTTGTTGATGGTGTTCCATTTGTAGTTTCAACTGGATACCACTTCAGAGAGCTAGGTGTTGTACATAATCAGGATGTAAACTTACATAGCTCTAGACTTAGTTTTAAGGAATTACTTGTGTATGCTGCTGACCCTGCTATGCACGCTGCTTCTGGTAATCTATTACTAGATAAACGCACTACGTGCTTTTCAGTAGCTGCACTTACTAACAATGTTGCTTTTCAAACTGTCAAACCCGGTAATTTTAACAAAGACTTCTATGACTTTGCTGTGTCTAAGGGTTTCTTTAAGGAAGGAAGTTCTGTTGAATTAAAACACTTCTTCTTTGCTCAGGATGGTAATGCTGCTATCAGCGATTATGACTACTATCGTTATAATCTACCAACAATGTGTGATATCAGACAACTACTATTTGTAGTTGAAGTTGTTGATAAGTACTTTGATTGTTACGATGGTGGCTGTATTAATGCTAACCAAGTCATCGTCAACAACCTAGACAAATCAGCTGGTTTTCCATTTAATAAATGGGGTAAGGCTAGACTTTATTATGATTCAATGAGTTATGAGGATCAAGATGCACTTTTCGCATATACAAAACGTAATGTCATCCCTACTATAACTCAAATGAATCTTAAGTATGCCATTAGTGCAAAGAATAGAGCTCGCACCGTAGCTGGTGTCTCTATCTGTAGTACTATGACCAATAGACAGTTTCATCAAAAATTATTGAAATCAATAGCCGCCACTAGAGGAGCTACTGTAGTAATTGGAACAAGCAAATTCTATGGTGGTTGGCACAACATGTTAAAAACTGTTTATAGTGATGTAGAAAACCCTCACCTTATGGGTTGGGATTATCCTAAATGTGATAGAGCCATGCCTAACATGCTTAGAATTATGGCCTCACTTGTTCTTGCTCGCAAACATACAACGTGTTGTAGCTTGTCACACCGTTTCTATAGATTAGCTAATGAGTGTGCTCAAGTATTGAGTGAAATGGTCATGTGTGGCGGTTCACTATATGTTAAACCAGGTGGAACCTCATCAGGAGATGCCACAACTGCTTATGCTAATAGTGTTTTTAACATTTGTCAAGCTGTCACGGCCAATGTTAATGCACTTTTATCTACTGATGGTAACAAAATTGCCGATAAGTATGTCCGCAATTTACAACACAGACTTTATGAGTGTCTCTATAGAAATAGAGATGTTGACACAGACTTTGTGAATGAGTTTTACGCATATTTGCGTAAACATTTCTCAATGATGATACTCTCTGACGATGCTGTTGTGTGTTTCAATAGCACTTATGCATCTCAAGGTCTAGTGGCTAGCATAAAGAACTTTAAGTCAGTTCTTTATTATCAAAACAATGTTTTTATGTCTGAAGCAAAATGTTGGACTGAGACTGACCTTACTAAAGGACCTCATGAATTTTGCTCTCAACATACAATGCTAGTTAAACAGGGTGATGATTATGTGTACCTTCCTTACCCAGATCCATCAAGAATCCTAGGGGCCGGCTGTTTTGTAGATGATATCGTAAAAACAGATGGTACACTTATGATTGAACGGTTCGTGTCTTTAGCTATAGATGCTTACCCACTTACTAAACATCCTAATCAGGAGTATGCTGATGTCTTTCATTTGTACTTACAATACATAAGAAAGCTACATGATGAGTTAACAGGACACATGTTAGACATGTATTCTGTTATGCTTACTAATGATAACACTTCAAGGTATTGGGAACCTGAGTTTTATGAGGCTATGTACACACCGCATACAGTCTTACAGGCTGTTGGGGCTTGTGTTCTTTGCAATTCACAGACTTCATTAAGATGTGGTGCTTGCATACGTAGACCATTCTTATGTTGTAAATGCTGTTACGACCATGTCATATCAACATCACATAAATTAGTCTTGTCTGTTAATCCGTATGTTTGCAATGCTCCAGGTTGTGATGTCACAGATGTGACTCAACTTTACTTAGGAGGTATGAGCTATTATTGTAAATCACATAAACCACCCATTAGTTTTCCATTGTGTGCTAATGGACAAGTTTTTGGTTTATATAAAAATACATGTGTTGGTAGCGATAATGTTACTGACTTTAATGCAATTGCAACATGTGACTGGACAAATGCTGGTGATTACATTTTAGCTAACACCTGTACTGAAAGACTCAAGCTTTTTGCAGCAGAAACGCTCAAAGCTACTGAGGAGACATTTAAACTGTCTTATGGTATTGCTACTGTACGTGAAGTGCTGTCTGACAGAGAATTACATCTTTCATGGGAAGTTGGTAAACCTAGACCACCACTTAACCGAAATTATGTCTTTACTGGTTATCGTGTAACTAAAAACAGTAAAGTACAAATAGGAGAGTACACCTTTGAAAAAGGTGACTATGGTGATGCTGTTGTTTACCGAGGTACAACAACTTACAAATTAAATGTTGGTGATTATTTTGTGCTGACATCACATACAGTAATGCCATTAAGTGCACCTACACTAGTGCCACAAGAGCACTATGTTAGAATTACTGGCTTATACCCAACACTCAATATCTCAGATGAGTTTTCTAGCAATGTTGCAAATTATCAAAAGGTTGGTATGCAAAAGTATTCTACACTCCAGGGACCACCTGGTACTGGTAAGAGTCATTTTGCTATTGGCCTAGCTCTCTACTACCCTTCTGCTCGCATAGTGTATACAGCTTGCTCTCATGCCGCTGTTGATGCACTATGTGAGAAGGCATTAAAATATTTGCCTATAGATAAATGTAGTAGAATTATACCTGCACGTGCTCGTGTAGAGTGTTTTGATAAATTCAAAGTGAATTCAACATTAGAACAGTATGTCTTTTGTACTGTAAATGCATTGCCTGAGACGACAGCAGATATAGTTGTCTTTGATGAAATTTCAATGGCCACAAATTATGATTTGAGTGTTGTCAATGCCAGATTACGTGCTAAGCACTATGTGTACATTGGCGACCCTGCTCAATTACCTGCACCACGCACATTGCTAACTAAGGGCACACTAGAACCAGAATATTTCAATTCAGTGTGTAGACTTATGAAAACTATAGGTCCAGACATGTTCCTCGGAACTTGTCGGCGTTGTCCTGCTGAAATTGTTGACACTGTGAGTGCTTTGGTTTATGATAATAAGCTTAAAGCACATAAAGACAAATCAGCTCAATGCTTTAAAATGTTTTATAAGGGTGTTATCACGCATGATGTTTCATCTGCAATTAACAGGCCACAAATAGGCGTGGTAAGAGAATTCCTTACACGTAACCCTGCTTGGAGAAAAGCTGTCTTTATTTCACCTTATAATTCACAGAATGCTGTAGCCTCAAAGATTTTGGGACTACCAACTCAAACTGTTGATTCATCACAGGGCTCAGAATATGACTATGTCATATTCACTCAAACCACTGAAACAGCTCACTCTTGTAATGTAAACAGATTTAATGTTGCTATTACCAGAGCAAAAGTAGGCATACTTTGCATAATGTCTGATAGAGACCTTTATGACAAGTTGCAATTTACAAGTCTTGAAATTCCACGTAGGAATGTGGCAACTTTACAAGCTGAAAATGTAACAGGACTCTTTAAAGATTGTAGTAAGGTAATCACTGGGTTACATCCTACACAGGCACCTACACACCTCAGTGTTGACACTAAATTCAAAACTGAAGGTTTATGTGTTGACATACCTGGCATACCTAAGGACATGACCTATAGAAGACTCATCTCTATGATGGGTTTTAAAATGAATTATCAAGTTAATGGTTACCCTAACATGTTTATCACCCGCGAAGAAGCTATAAGACATGTACGTGCATGGATTGGCTTCGATGTCGAGGGGTGTCATGCTACTAGAGAAGCTGTTGGTACCAATTTACCTTTACAGCTAGGTTTTTCTACAGGTGTTAACCTAGTTGCTGTACCTACAGGTTATGTTGATACACCTAATAATACAGATTTTTCCAGAGTTAGTGCTAAACCACCGCCTGGAGATCAATTTAAACACCTCATACCACTTATGTACAAAGGACTTCCTTGGAATGTAGTGCGTATAAAGATTGTACAAATGTTAAGTGACACACTTAAAAATCTCTCTGACAGAGTCGTATTTGTCTTATGGGCACATGGCTTTGAGTTGACATCTATGAAGTATTTTGTGAAAATAGGACCTGAGCGCACCTGTTGTCTATGTGATAGACGTGCCACATGCTTTTCCACTGCTTCAGACACTTATGCCTGTTGGCATCATTCTATTGGATTTGATTACGTCTATAATCCGTTTATGATTGATGTTCAACAATGGGGTTTTACAGGTAACCTACAAAGCAACCATGATCTGTATTGTCAAGTCCATGGTAATGCACATGTAGCTAGTTGTGATGCAATCATGACTAGGTGTCTAGCTGTCCACGAGTGCTTTGTTAAGCGTGTTGACTGGACTATTGAATATCCTATAATTGGTGATGAACTGAAGATTAATGCGGCTTGTAGAAAGGTTCAACACATGGTTGTTAAAGCTGCATTATTAGCAGACAAATTCCCAGTTCTTCACGACATTGGTAACCCTAAAGCTATTAAGTGTGTACCTCAAGCTGATGTAGAATGGAAGTTCTATGATGCACAGCCTTGTAGTGACAAAGCTTATAAAATAGAAGAATTATTCTATTCTTATGCCACACATTCTGACAAATTCACAGATGGTGTATGCCTATTTTGGAATTGCAATGTCGATAGATATCCTGCTAATTCCATTGTTTGTAGATTTGACACTAGAGTGCTATCTAACCTTAACTTGCCTGGTTGTGATGGTGGCAGTTTGTATGTAAATAAACATGCATTCCACACACCAGCTTTTGATAAAAGTGCTTTTGTTAATTTAAAACAATTACCATTTTTCTATTACTCTGACAGTCCATGTGAGTCTCATGGAAAACAAGTAGTGTCAGATATAGATTATGTACCACTAAAGTCTGCTACGTGTATAACACGTTGCAATTTAGGTGGTGCTGTCTGTAGACATCATGCTAATGAGTACAGATTGTATCTCGATGCTTATAACATGATGATCTCAGCTGGCTTTAGCTTGTGGGTTTACAAACAATTTGATACTTATAACCTCTGGAACACTTTTACAAGACTTCAGAGTTTAGAAAATGTGGCTTTTAATGTTGTAAATAAGGGACACTTTGATGGACAACAGGGTGAAGTACCAGTTTCTATCATTAATAACACTGTTTACACAAAAGTTGATGGTGTTGATGTAGAATTGTTTGAAAATAAAACAACATTACCTGTTAATGTAGCATTTGAGCTTTGGGCTAAGCGCAACATTAAACCAGTACCAGAGGTGAAAATACTCAATAATTTGGGTGTGGACATTGCTGCTAATACTGTGATCTGGGACTACAAAAGAGATGCTCCAGCACATATATCTACTATTGGTGTTTGTTCTATGACTGACATAGCCAAGAAACCAACTGAAACGATTTGTGCACCACTCACTGTCTTTTTTGATGGTAGAGTTGATGGTCAAGTAGACTTATTTAGAAATGCCCGTAATGGTGTTCTTATTACAGAAGGTAGTGTTAAAGGTTTACAACCATCTGTAGGTCCCAAACAAGCTAGTCTTAATGGAGTCACATTAATTGGAGAAGCCGTAAAAACACAGTTCAATTATTATAAGAAAGTTGATGGTGTTGTCCAACAATTACCTGAAACTTACTTTACTCAGAGTAGAAATTTACAAGAATTTAAACCCAGGAGTCAAATGGAAATTGATTTCTTAGAATTAGCTATGGATGAATTCATTGAACGGTATAAATTAGAAGGCTATGCCTTCGAACATATCGTTTATGGAGATTTTAGTCATAGTCAGTTAGGTGGTTTACATCTACTGATTGGACTAGCTAAACGTTTTAAGGAATCACCTTTTGAATTAGAAGATTTTATTCCTATGGACAGTACAGTTAAAAACTATTTCATAACAGATGCGCAAACAGGTTCATCTAAGTGTGTGTGTTCTGTTATTGATTTATTACTTGATGATTTTGTTGAAATAATAAAATCCCAAGATTTATCTGTAGTTTCTAAGGTTGTCAAAGTGACTATTGACTATACAGAAATTTCATTTATGCTTTGGTGTAAAGATGGCCATGTAGAAACATTTTACCCAAAATTACAATCTAGTCAAGCGTGGCAACCGGGTGTTGCTATGCCTAATCTTTACAAAATGCAAAGAATGCTATTAGAAAAGTGTGACCTTCAAAATTATGGTGATAGTGCAACATTACCTAAAGGCATAATGATGAATGTCGCAAAATATACTCAACTGTGTCAATATTTAAACACATTAACATTAGCTGTACCCTATAATATGAGAGTTATACATTTTGGTGCTGGTTCTGATAAAGGAGTTGCACCAGGTACAGCTGTTTTAAGACAGTGGTTGCCTACGGGTACGCTGCTTGTCGATTCAGATCTTAATGACTTTGTCTCTGATGCAGATTCAACTTTGATTGGTGATTGTGCAACTGTACATACAGCTAATAAATGGGATCTCATTATTAGTGATATGTACGACCCTAAGACTAAAAATGTTACAAAAGAAAATGACTCTAAAGAGGGTTTTTTCACTTACATTTGTGGGTTTATACAACAAAAGCTAGCTCTTGGAGGTTCCGTGGCTATAAAGATAACAGAACATTCTTGGAATGCTGATCTTTATAAGCTCATGGGACACTTCGCATGGTGGACAGCCTTTGTTACTAATGTGAATGCGTCATCATCTGAAGCATTTTTAATTGGATGTAATTATCTTGGCAAACCACGCGAACAAATAGATGGTTATGTCATGCATGCAAATTACATATTTTGGAGGAATACAAATCCAATTCAGTTGTCTTCCTATTCTTTATTTGACATGAGTAAATTTCCCCTTAAATTAAGGGGTACTGCTGTTATGTCTTTAAAAGAAGGTCAAATCAATGATATGATTTTATCTCTTCTTAGTAAAGGTAGACTTATAATTAGAGAAAACAACAGAGTTGTTATTTCTAGTGATGTTCTTGTTAACAACTAAACGAACAATGTTTGTTTTTCTTGTTTTATTGCCACTAGTCTCTAGTCAGTGTGTTAATCTTACAACCAGAACTCAATTACCCCCTGCATACACTAATTCTTTCACACGTGGTGTTTATTACCCTGACAAAGTTTTCAGATCCTCAGTTTTACATTCAACTCAGGACTTGTTCTTACCTTTCTTTTCCAATGTTACTTGGTTCCATGCTATACATGTCTCTGGGACCAATGGTACTAAGAGGTTTGATAACCCTGTCCTACCATTTAATGATGGTGTTTATTTTGCTTCCACTGAGAAGTCTAACATAATAAGAGGCTGGATTTTTGGTACTACTTTAGATTCGAAGACCCAGTCCCTACTTATTGTTAATAACGCTACTAATGTTGTTATTAAAGTCTGTGAATTTCAATTTTGTAATGATCCATTTTTGGGTGTTTATTACCACAAAAACAACAAAAGTTGGATGGAAAGTGAGTTCAGAGTTTATTCTAGTGCGAATAATTGCACTTTTGAATATGTCTCTCAGCCTTTTCTTATGGACCTTGAAGGAAAACAGGGTAATTTCAAAAATCTTAGGGAATTTGTGTTTAAGAATATTGATGGTTATTTTAAAATATATTCTAAGCACACGCCTATTAATTTAGTGCGTGATCTCCCTCAGGGTTTTTCGGCTTTAGAACCATTGGTAGATTTGCCAATAGGTATTAACATCACTAGGTTTCAAACTTTACTTGCTTTACATAGAAGTTATTTGACTCCTGGTGATTCTTCTTCAGGTTGGACAGCTGGTGCTGCAGCTTATTATGTGGGTTATCTTCAACCTAGGACTTTTCTATTAAAATATAATGAAAATGGAACCATTACAGATGCTGTAGACTGTGCACTTGACCCTCTCTCAGAAACAAAGTGTACGTTGAAATCCTTCACTGTAGAAAAAGGAATCTATCAAACTTCTAACTTTAGAGTCCAACCAACAGAATCTATTGTTAGATTTCCTAATATTACAAACTTGTGCCCTTTTGGTGAAGTTTTTAACGCCACCAGATTTGCATCTGTTTATGCTTGGAACAGGAAGAGAATCAGCAACTGTGTTGCTGATTATTCTTTCCTATATAATTCCGCATCATTTTCCACTTTTAAGTGTTATGGAGTGTCTCCTACTAAATTAAATGATCTCTGCTTTACTAATGTCTATGCAGATTCATTTGTAATTAGAGGTGATGAAGTCAGACAAATCGCTCCAGGGCAAACTGGAAAGATTGCTGATTATAATTATAAATTACCAGATGATTTTACAGGCTGCGTTATAGCTTGGAATTCTAACAATCTTGATTCTAAGGTTGGTGGTAATTATAATTACCTGTATAGATTGTTTAGGAAGTCTAATCTCAAACCTTTTGAGAGAGATATTTCAACTGAAATCTATCAGGCCGGTAGCACACCTTGTAATGGTGTTGAAGGTTTTAATTGTTACTTTCCTTTACAATCATATGGTTTCCAACCCACTAATGGTGTTGGTTACCAACCATACAGAGTAGTAGTACTTTCTTTTGAACTTCTACATGCACCAGCAACTGTTTGTGGACCTAAAAAGTCTACTAATTTGGTTAAAAACAAATGTGTCAATTTCAACTTCAATGGTTTAACAGGCACAGGTGTTCTTACTGAGTCTAACAAAAAGTTTCTGCCTTTCCAACAATTTGGCAGAGACATTGCTGACACTACTGATGCTGTCCGTGATCCACAGACACTTGAGATTCTTGACATTACACCATGTTCTTTTGGTGGTGTCAGTGTTATAACACCAGGAACAAATACTTCTAACCAGGTTGCTGTTCTTTATCAGGATGTTAACTGCACAGAAGTCCCTGTTGCTATTCATGCAGATCAACTTACTCCTACTTGGCGTGTTTATTCTACAGGTTCTAATGTTTTTCAAACACGTGCAGGCTGTTTAATAGGGGCTGAACATGTCAACAACTCATATGAGTGTGACATACCCATTGGTGCAGGTATATGCGCTAGTTATCAGACTCAGACTAATTCTCCTCGGCGGGCACGTAGTGTAGCTAGTCAATCCATCATTGCCTACACTATGTCACTTGGTGCAGAAAATTCAGTTGCTTACTCTAATAACTCTATTGCCATACCCACAAATTTTACTATTAGTGTTACCACAGAAATTCTACCAGTGTCTATGACCAAGACATCAGTAGATTGTACAATGTACATTTGTGGTGATTCAACTGAATGCAGCAATCTTTTGTTGCAATATGGCAGTTTTTGTACACAATTAAACCGTGCTTTAACTGGAATAGCTGTTGAACAAGACAAAAACACCCAAGAAGTTTTTGCACAAGTCAAACAAATTTACAAAACACCACCAATTAAAGATTTTGGTGGTTTTAATTTTTCACAAATATTACCAGATCCATCAAAACCAAGCAAGAGGTCATTTATTGAAGATCTACTTTTCAACAAAGTGACACTTGCAGATGCTGGCTTCATCAAACAATATGGTGATTGCCTTGGTGATATTGCTGCTAGAGACCTCATTTGTGCACAAAAGTTTAACGGCCTTACTGTTTTGCCACCTTTGCTCACAGATGAAATGATTGCTCAATACACTTCTGCACTGTTAGCGGGTACAATCACTTCTGGTTGGACCTTTGGTGCAGGTGCTGCATTACAAATACCATTTGCTATGCAAATGGCTTATAGGTTTAATGGTATTGGAGTTACACAGAATGTTCTCTATGAGAACCAAAAATTGATTGCCAACCAATTTAATAGTGCTATTGGCAAAATTCAAGACTCACTTTCTTCCACAGCAAGTGCACTTGGAAAACTTCAAGATGTGGTCAACCAAAATGCACAAGCTTTAAACACGCTTGTTAAACAACTTAGCTCCAATTTTGGTGCAATTTCAAGTGTTTTAAATGATATCCTTTCACGTCTTGACAAAGTTGAGGCTGAAGTGCAAATTGATAGGTTGATCACAGGCAGACTTCAAAGTTTGCAGACATATGTGACTCAACAATTAATTAGAGCTGCAGAAATCAGAGCTTCTGCTAATCTTGCTGCTACTAAAATGTCAGAGTGTGTACTTGGACAATCAAAAAGAGTTGATTTTTGTGGAAAGGGCTATCATCTTATGTCCTTCCCTCAGTCAGCACCTCATGGTGTAGTCTTCTTGCATGTGACTTATGTCCCTGCACAAGAAAAGAACTTCACAACTGCTCCTGCCATTTGTCATGATGGAAAAGCACACTTTCCTCGTGAAGGTGTCTTTGTTTCAAATGGCACACACTGGTTTGTAACACAAAGGAATTTTTATGAACCACAAATCATTACTACAGACAACACATTTGTGTCTGGTAACTGTGATGTTGTAATAGGAATTGTCAACAACACAGTTTATGATCCTTTGCAACCTGAATTAGACTCATTCAAGGAGGAGTTAGATAAATATTTTAAGAATCATACATCACCAGATGTTGATTTAGGTGACATCTCTGGCATTAATGCTTCAGTTGTAAACATTCAAAAAGAAATTGACCGCCTCAATGAGGTTGCCAAGAATTTAAATGAATCTCTCATCGATCTCCAAGAACTTGGAAAGTATGAGCAGTATATAAAATGGCCATGGTACATTTGGCTAGGTTTTATAGCTGGCTTGATTGCCATAGTAATGGTGACAATTATGCTTTGCTGTATGACCAGTTGCTGTAGTTGTCTCAAGGGCTGTTGTTCTTGTGGATCCTGCTGCAAATTTGATGAAGACGACTCTGAGCCAGTGCTCAAAGGAGTCAAATTACATTACACATAAACGAACTTATGGATTTGTTTATGAGAATCTTCACAATTGGAACTGTAACTTTGAAGCAAGGTGAAATCAAGGATGCTACTCCTTCAGATTTTGTTCGCGCTACTGCAACGATACCGATACAAGCCTCACTCCCTTTCGGATGGCTTATTGTTGGCGTTGCACTTCTTGCTGTTTTTCAGAGCGCTTCCAAAATCATAACCCTCAAAAAGAGATGGCAACTAGCACTCTCCAAGGGTGTTCACTTTGTTTGCAACTTGCTGTTGTTGTTTGTAACAGTTTACTCACACCTTTTGCTCGTTGCTGCTGGCCTTGAAGCCCCTTTTCTCTATCTTTATGCTTTAGTCTACTTCTTGCAGAGTATAAACTTTGTAAGAATAATAATGAGGCTTTGGCTTTGCTGGAAATGCCGTTCCAAAAACCCATTACTTTATGATGCCAACTATTTTCTTTGCTGGCATACTAATTGTTACGACTATTGTATACCTTACAATAGTGTAACTTCTTCAATTGTCATTACTTCAGGTGATGGCACAACAAGTCCTATTTCTGAACATGACTACCAGATTGGTGGTTATACTGAAAAATGGGAATCTGGAGTAAAAGACTGTGTTGTATTACACAGTTACTTCACTTCAGACTATTACCAGCTGTACTCAACTCAATTGAGTACAGACACTGGTGTTGAACATGTTACCTTCTTCATCTACAATAAAATTGTTGATGAGCCTGAAGAACATGTCCAAATTCACACAATCGACGTTTCATCCGGAGTTGTTAATCCAGTAATGGAACCAATTTATGATGAACCGACGACGACTACTAGCGTGCCTTTGTAAGCACAAGCTGATGAGTACGAACTTATGTACTCATTCGTTTCGGAAGAGACAGGTACGTTAATAGTTAATAGCGTACTTCTTTTTCTTGCTTTCGTGGTATTCTTGCTAGTTACACTAGCCATCCTTACTGCGCTTCGATTGTGTGCGTACTGCTGCAATATTGTTAACGTGAGTCTTGTAAAACCTTCTTTTTACGTTTACTCTCGTGTTAAAAATCTGAATTCTTCTAGAGTTCCTGATCTTCTGGTCTAAACGAACTAAATATTATATTAGTTTTTCTGTTTGGAACTTTAATTTTAGCCATGGCAGATTCCAACGGTACTATTACCGTTGAAGAGCTTAAAAAGCTCCTTGAACAATGGAACCTAGTAATAGGTTTCCTATTCCTTACATGGATTTGTCTTCTACAATTTGCCTATGCCAACAGGAATAGGTTTTTGTATATAATTAAGTTAATTTTCCTCTGGCTGTTATGGCCAGTAACTTTAGCTTGTTTTGTGCTTGCTGCTGTTTACAGAATAAATTGGATCACCGGTGGAATTGCTATCGCAATGGCTTGTCTTGTAGGCTTGATGTGGCTCAGCTACTTCATTGCTTCTTTCAGACTGTTTGCGCGTACGCGTTCCATGTGGTCATTCAATCCAGAAACTAACATTCTTCTCAACGTGCCACTCCATGGCACTATTCTGACCAGACCGCTTCTAGAAAGTGAACTCGTAATCGGAGCTGTGATCCTTCGTGGACATCTTCGTATTGCTGGACACCATCTAGGACGCTGTGACATCAAGGACCTGCCTAAAGAAATCACTGTTGCTACATCACGAACGCTTTCTTATTACAAATTGGGAGCTTCGCAGCGTGTAGCAGGTGACTCAGGTTTTGCTGCATACAGTCGCTACAGGATTGGCAACTATAAATTAAACACAGACCATTCCAGTAGCAGTGACAATATTGCTTTGCTTGTACAGTAAGTGACAACAGATGTTTCATCTCGTTGACTTTCAGGTTACTATAGCAGAGATATTACTAATTATTATGAGGACTTTTAAAGTTTCCATTTGGAATCTTGATTACATCATAAACCTCATAATTAAAAATTTATCTAAGTCACTAACTGAGAATAAATATTCTCAATTAGATGAAGAGCAACCAATGGAGATTGATTAAACGAACATGAAAATTATTCTTTTCTTGGCACTGATAACACTCGCTACTTGTGAGCTTTATCACTACCAAGAGTGTGTTAGAGGTACAACAGTACTTTTAAAAGAACCTTGCTCTTCTGGAACATACGAGGGCAATTCACCATTTCATCCTCTAGCTGATAACAAATTTGCACTGACTTGCTTTAGCACTCAATTTGCTTTTGCTTGTCCTGACGGCGTAAAACACGTCTATCAGTTACGTGCCAGATCAGTTTCACCTAAACTGTTCATCAGACAAGAGGAAGTTCAAGAACTTTACTCTCCAATTTTTCTTATTGTTGCGGCAATAGTGTTTATAACACTTTGCTTCACACTCAAAAGAAAGACAGAATGATTGAACTTTCATTAATTGACTTCTATTTGTGCTTTTTAGCCTTTCTGCTATTCCTTGTTTTAATTATGCTTATTATCTTTTGGTTCTCACTTGAACTGCAAGATCATAATGAAACTTGTCACGCCTAAACGAACATGAAATTTCTTGTTTTCTTAGGAATCATCACAACTGTAGCTGCATTTCACCAAGAATGTAGTTTACAGTCATGTACTCAACATCAACCATATGTAGTTGATGACCCGTGTCCTATTCACTTCTATTCTAAATGGTATATTAGAGTAGGAGCTAGAAAATCAGCACCTTTAATTGAATTGTGCGTGGATGAGGCTGGTTCTAAATCACCCATTCAGTACATCGATATCGGTAATTATACAGTTTCCTGTTTACCTTTTACAATTAATTGCCAGGAACCTAAATTGGGTAGTCTTGTAGTGCGTTGTTCGTTCTATGAAGACTTTTTAGAGTATCATGACGTTCGTGTTGTTTTAGATTTCATCTAAACGAACAAACTAAAATGTCTGATAATGGACCCCAAAATCAGCGAAATGCACCCCGCATTACGTTTGGTGGACCCTCAGATTCAACTGGCAGTAACCAGAATGGAGAACGCAGTGGGGCGCGATCAAAACAACGTCGGCCCCAAGGTTTACCCAATAATACTGCGTCTTGGTTCACCGCTCTCACTCAACATGGCAAGGAAGACCTTAAATTCCCTCGAGGACAAGGCGTTCCAATTAACACCAATAGCAGTCCAGATGACCAAATTGGCTACTACCGAAGAGCTACCAGACGAATTCGTGGTGGTGACGGTAAAATGAAAGATCTCAGTCCAAGATGGTATTTCTACTACCTAGGAACTGGGCCAGAAGCTGGACTTCCCTATGGTGCTAACAAAGACGGCATCATATGGGTTGCAACTGAGGGAGCCTTGAATACACCAAAAGATCACATTGGCACCCGCAATCCTGCTAACAATGCTGCAATCGTGCTACAACTTCCTCAAGGAACAACATTGCCAAAAGGCTTCTACGCAGAAGGGAGCAGAGGCGGCAGTCAAGCCTCTTCTCGTTCCTCATCACGTAGTCGCAACAGTTCAAGAAATTCAACTCCAGGCAGCAGTAGGGGAACTTCTCCTGCTAGAATGGCTGGCAATGGCGGTGATGCTGCTCTTGCTTTGCTGCTGCTTGACAGATTGAACCAGCTTGAGAGCAAAATGTCTGGTAAAGGCCAACAACAACAAGGCCAAACTGTCACTAAGAAATCTGCTGCTGAGGCTTCTAAGAAGCCTCGGCAAAAACGTACTGCCACTAAAGCATACAATGTAACACAAGCTTTCGGCAGACGTGGTCCAGAACAAACCCAAGGAAATTTTGGGGACCAGGAACTAATCAGACAAGGAACTGATTACAAACATTGGCCGCAAATTGCACAATTTGCCCCCAGCGCTTCAGCGTTCTTCGGAATGTCGCGCATTGGCATGGAAGTCACACCTTCGGGAACGTGGTTGACCTACACAGGTGCCATCAAATTGGATGACAAAGATCCAAATTTCAAAGATCAAGTCATTTTGCTGAATAAGCATATTGACGCATACAAAACATTCCCACCAACAGAGCCTAAAAAGGACAAAAAGAAGAAGGCTGATGAAACTCAAGCCTTACCGCAGAGACAGAAGAAACAGCAAACTGTGACTCTTCTTCCTGCTGCAGATTTGGATGATTTCTCCAAACAATTGCAACAATCCATGAGCAGTGCTGACTCAACTCAGGCCTAAACTCATGCAGACCACACAAGGCAGATGGGCTATATAAACGTTTTCGCTTTTCCGTTTACGATATATAGTCTACTCTTGTGCAGAATGAATTCTCGTAACTACATAGCACAAGTAGATGTAGTTAACTTTAATCTCACATAGCAATCTTTAATCAGTGTGTAACATTAGGGAGGACTTGAAAGAGCCACCACATTTTCACCGAGGCCACGCGGAGTACGATCGAGTGTACAGTGAACAATGCTAGGGAGAGCTGCCTATATGGAAGAGCCCTAATGTGTAAAATTAATTTTAGTAGTGCTATCCCCATGTGATTTTAATAGCTTCTTAGGAGAATGACAAAA
서열번호: 13
>중증 급성 호흡기 증후군 코로나바이러스 2 orf1ab 다단백질의 단리물 hCoV-19/France/IDF0372-isl/2020
MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQHLKDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGETLGVLVPHVGEIPVAYRKVLLRKNGNKGAGGHSYGADLKSFDLGDELGTDPYEDFQENWNTKHSSGVTRELMRELNGGAYTRYVDNNFCGPDGYPLECIKDLLARAGKASCTLSEQLDFIDTKRGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFPLNSIIKTIQPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTGDFVKATCEFCGTENLTKEGATTCGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESGLKTILRKGGRTIAFGGCVFSYVGCHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNLLEILQKEKVNINIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGNFKVTKGKAKKGAWNIGEQKSILSPLYAFASEAARVVRSIFSRTLETAQNSVRVLQKAAITILDGISQYSLRLIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIFGTVYEKLKPVLDWLEEKFKEGVEFLRDGWEIVKFISTCACEIVGGQIVTCAKEIKESVQTFFKLVNKFLALCADSIIIGGAKLKALNLGETFVTHSKGLYRKCVKSREETGLLMPLKAPKEIIFLEGETLPTEVLTEEVVLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEKYCALAPNMMVTNNTFTLKGGAPTKVTFGDDTVIEVQGYKSVNITFELDERIDKVLNEKCSAYTVELGTEVNEFACVVADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDEDEEEGDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPEEEQEEDWLDDDSQQTVGQQDGSEDNQTTTIQTIVEVQPQLEMELTPVVQTIEVNSFSGYLKLTDNVYIKNADIVEEAKKVKPTVVVNAANVYLKHGGGVAGALNKATNNAMQVESDDYIATNGPLKVGGSCVLSGHNLAKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLAPLLSAGIFGADPIHSLRVCVDTVRTNVYLAVFDKNLYDKLVSSFLEMKSEKQVEQKIAEIPKEEVKPFITESKPSVEQRKQDDKKIKACVEEVTTTLEETKFLTENLLLYIDINGNLHPDSATLVSDIDITFLKKDAPYIVGDVVQEGVLTAVVIPTKKAGGTTEMLAKALRKVPTDNYITTYPGQGLNGYTVEEAKTVLKKCKSAFYILPSIISNEKQEILGTVSWNLREMLAHAEETRKLMPVCVETKAIVSTIQRKYKGIKIQEGVVDYGARFYFYTSKTTVASLINTLNDLNETLVTMPLGYVTHGLNLEEAARYMRSLKVPATVSVSSPDAVTAYNGYLTSSSKTPEEHFIETISLAGSYKDWSYSGQSTQLGIEFLKRGDKSVYYTSNPTTFHLDGEVITFDNLKTLLSLREVRTIKVFTTVDNINLHTQVVDMSMTYGQQFGPTYLDGADVTKIKPHNSHEGKTFYVLPNDDTLRVEAFEYYHTTDPSFLGRYMSALNHTKKWKYPQVNGLTSIKWADNNCYLATALLTLQQIELKFNPPALQDAYYRARAGEAANFCALILAYCNKTVGELGDVRETMSYLFQHANLDSCKRVLNVVCKTCGQQQTTLKGVEAVMYMGTLSYEQFKKGVQIPCTCGKQATKYLVQQESPFVMMSAPPAQYELKHGTFTCASEYTGNYQCGHYKHITSKETLYCIDGALLTKSSEYKGPITDVFYKENSYTTTIKPVTYKLDGVVCTEIDPKLDNYYKKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCDNIKFADDLNQLTGYKKPASRELKVTFFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVWHVNNATNKATYKPNTWCIRCLWSTKPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEVVENPTIQKDVLECNVKTTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKPNELSRVLGLKTLATHGLAAVNSVPWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYMPYFFTLLLQLCTFTRSTNSRIKASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLINIIIWFLLLSVCLGSLIYSTAALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIPCSVCLSGLDSLDTYPSLETIQITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAIMQLFFSYFAVHFISNSWLMWLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCNSSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTFISDEVARDLSLQFKRPINPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDNLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGDSAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDSDVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQVAKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIALKGGKIVNNWLKQLIKVTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTRDIASTDTCFANKHADFDTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTTNGDFLHFLPRVFSAVGNICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDTNVLEGSVAYESLRPDTRYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAGVCVSTSGRWVLNNDYYRSLPGVFCGVDAVNLLTNMFTPLIQPIGALDISASIVAGGIVAIVVTCLAYYFMRFRRAFGEYSHVVAFNTLLFLMSFTVLCLTPVYSFLPGVYSVIYLYLTFYLTNDVSFLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSFSTFEEAALCTFLLNKEMYLKLRSDVLLPLTQYNRYLALYNKYKYFSGAMDTTSYREAACCHLAKALNDFSNSGSDVLYQPPQTSITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCVLKLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNFTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNVLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQSAVKRTIKGTHHWLLLTILTSLLVLVQSTQWSLFFFLYENAFLPFAMGIIAMSAFAMMFVKHKHAFLCLFLLPSLATVAYFNMVYMPASWVMRIMTWLDMVDTSLSGFKLKDCVMYASAVVLLILMTARTVYDDGARRVWTLMNVLTLVYKVYYGNALDQAISMWALIISVTSNYSGVVTTVMFLARGIVFMCVEYCPIFFITGNTLQCIMLVYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMNSQGLLPPKNSIDAFKLNIKLLGVGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVESSSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRATLQAIASEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKLEKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLNIIPLTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDADSKIVQLSEISMDNSPNLAWPLIVTALRANSAVKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNTTKGGRFVLALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMVLGSLAATVRLQAGNATEVPANSTVLSFCAFAVDAAKAYKDYLASGGQPITNCVKMLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCANDPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQSADAQSFLNGFAV
서열번호: 14
>단백질\S_인간\2019-nCoV (S단백질_hCoV19FranceIDF0372isl2020)
MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSFLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT
서열번호: 15
>hCoV-19/Austria/CeMM0360/2020|EPI_ISL_438123|2020-04-05
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNACCAACCAACTTTCGATCTCTTGTAGATCTGTTCTCTAAACGAACTTTAAAATCTGTGTGGCTGTCACTCGGCTGCATGCTTAGTGCACTCACGCAGTATAATTAATAACTAATTACTGTCGTTGACAGGACACGAGTAACTCGTCTATCTTCTGCAGGCTGCTTACGGTTTCGTCCGTGTTGCAGCCGATCATCAGCACATCTAGGTTTTGTCCGGGTGTGACCGAAAGGTAAGATGGAGAGCCTTGTCCCTGGTTTCAACGAGAAAACACACGTCCAACTCAGTTTGCCTGTTTTACAGGTTCGCGACGTGCTCGTACGTGGCTTTGGAGACTCCGTGGAGGAGGTCTTATCAGAGGCACGTCAACATCTTAAAGATGGCACTTGTGGCTTAGTAGAAGTTGAAAAAGGCGTTTTGCCTCAACTTGAACAGCCCTATGTGTTCATCAAACGTTCGGATGCTCGAACTGCACCTCATGGTCATGTTATGGTTGAGCTGGTAGCAGAACTCGAAGGCATTCAGTACGGTCGTAGTGGTGAGACACTTGGTGTCCTTGTCCCTCATGTGGGCGAAATACCAGTGGCTTACCGCAAGGTTCTTCTTCGTAAGAACGGTAATAAAGGAGCTGGTGGCCATAGTTACGGCGCCGATCTAAAGTCATTTGACTTAGGCGACGAGCTTGGCACTGATCCTTATGAAGATTTTCAAGAAAACTGGAACACTAAACATAGCAGTGGTGTTACCCGTGAACTCATGCGTGAGCTTAACGGAGGGGCATACACTCGCTATGTCGATAACAACTTCTGTGGCCCTGATGGCTACCCTCTTGAGTGCATTAAAGACCTTCTAGCACGTGCTGGTAAAGCTTCATGCACTTTGTCCGAACAACTGGACTTTATTGACACTAAGAGGGGTGTATACTGCTGCCGTGAACATGAGCATGAAATTGCTTGGTACACGGAACGTTCTGAAAAGAGCTATGAATTGCAGACACCTTTTGAAATTAAATTGGCAAAGAAATTTGACACCTTCAATGGGGAATGTCCAAATTTTGTATTTCCCTTAAATTCCATAATCAAGACTATTCAACCAAGGGTTGAAAAGAAAAAGCTTGATGGCTTTATGGGTAGAATTCGATCTGTCTATCCAGTTGCGTCACCAAATGAATGCAACCAAATGTGCCTTTCAACTCTCATGAAGTGTGATCATTGTGGTGAAACTTCATGGCAGACGGGCGATTTTGTTAAAGCCACTTGCGAATTTTGTGGCACTGAGAATTTGACTAAAGAAGGTGCCACTACTTGTGGTTACTTACCCCAAAATGCTGTTGTTAAAATTTATTGTCCAGCATGTCACAATTCAGAAGTAGGACCTGAGCATAGTCTTGCCGAATACCATAATGAATCTGGCTTGAAAACCATTCTTCGTAAGGGTGGTCGCACTATTGCCTTTGGAGGCTGTGTGTTCTCTTATGTTGGTTGCCATAACAAGTGTGCCTATTGGGTTCCACGTGCTAGCGCTAACATAGGTTGTAACCATACAGGTGTTGTTGGAGAAGGTTCCGAAGGTCTTAATGACAACCTTCTTGAAATACTCCAAAAAGAGAAAGTCAACATCAATATTGTTGGTGACTTTAAACTTAATGAAGAGATCGCCATTATTTTGGCATCTTTTTCTGCTTCCACAAGTGCTTTTGTGGAAACTGTGAAAGGTTTGGATTATAAAGCATTCAAACAAATTGTTGAATCCTGTGGTAATTTTAAAGTTACAAAAGGAAAAGCTAAAAAAGGTGCCTGGAATATTGGTGAACAGAAATCAATACTGAGTCCTCTTTATGCATTTGCATCAGAGGCTGCTCGTGTTGTACGATCAATTTTCTCCCGCACTCTTGAAACTGCTCAAAATTCTGTGCGTGTTTTACAGAAGGCCGCTATAACAATACTAGATGGAATTTCACAGTATTCACTGAGACTCATTGATGCTATGATGTTCACATCTGATTTGGCTACTAACAATCTAGTTGTAATGGCCTACATTACAGGTGGTGTTGTTCAGTTGACTTCGCAGTGGCTAACTAACATCTTTGGCACTGTTTATGAAAAACTCAAACCCGTCCTTGATTGGCTTGAAGAGAAGTTTAAGGAAGGTGTAGAGTTTCTTAGAGACGGTTGGGAAATTGTTAAATTTATCTCAACCTGTGCTTGTGAAATTGTCGGTGGACAAATTGTCACCTGTGCAAAGGAAATTAAGGAGAGTGTTCAGACATTCTTTAAGCTTGTAAATAAATTTTTGGCTTTGTGTGCTGACTCTATCATTATTGGTGGAGCTAAACTTAAAGCCTTGAATTTAGGTGAAACATTTGTCACGCACTCAAAGGGATTGTACAGAAAGTGTGTTAAATCCAGAGAAGAAACTGGCCTACTCATGCCTCTAAAAGCCCCAAAAGAAATTATCTTCTTAGAGGGAGAAACACTTCCCACAGAAGTGTTAACAGAGGAAGTTGTCTTGAAAACTGGTGATTTACAACCATTAGAACAACCTACTAGTGAAGCTGTTGAAGCTCCATTGGTTGGTACACCAGTTTGTATTAACGGGCTTATGTTGCTCGAAATCAAAGACACAGAAAAGTACTGTGCCCTTGCACCTAATATGATGGTAACAAACAATACCTTCACACTCAAAGGCGGTGCACCAACAAAGGTTACTTTTGGTGATGACACTGTGATAGAAGTGCAAGGTTACAAGAGTGTGAATATCACTTTTGAACTTGATGAAAGGATTGATAAAGTACTTAATGAGAAGTGCTCTGCCTATACAGTTGAACTCGGTACAGAAGTAAATGAGTTCGCCTGTGTTGTGGCAGATGCTGTCATAAAAACTTTGCAACCAGTATCTGAATTACTTACACCACTGGGCATTGATTTAGATGAGTGGAGTATGGCTACATACTACTTATTTGATGAGTCTGGTGAGTTTAAATTGGCTTCACATATGTATTGTTCTTTTTACCCTCCAGATGAGGATGAAGAAGAAGGTGATTGTGAAGAAGAAGAGTTTGAGCCATCAACTCAATATGAGTATGGTACTGAAGATGATTACCAAGGTAAACCTTTGGAATTTGGTGCCACTTCTGCTGCTCTTCAACCTGAAGAAGAGCAAGAAGAAGATTGGTTAGATGATGATAGTCAACAAACTGTTGGTCAACAAGACGGCAGTGAGGACAATCAGACAACTACTATTCAAACAATTGTTGAGGTTCAACCTCAATTAGAGATGGAACTTACACCAGTTGTTCAGACTATTGAAGTGAATAGTTTTAGTGGTTATTTAAAACTTACTGACAATGTATACATTAAAAATGCAGACATTGTGGAAGAAGCTAAAAAGGTAAAACCAACAGTGGTTGTTAATGCAGCCAATGTTTACCTTAAACATGGAGGAGGTGTTGCAGGAGCCTTAAATAAGGCTACTAACAATGCCATGCAAGTTGAATCTGATGATTACATAGCTACTAATGGACCACTTAAAGTGGGTGGTAGTTGTGTTTTAAGCGGACACAATCTTGCTAAACACTGTCTTCATGTTGTCGGCCCAAATGTTAACAAAGGTGAAGACATTCAACTTCTTAAGAGTGCTTATGAAAATTTTAATCAGCACGAAGTTCTACTTGCACCATTATTATCAGCTGGTATTTTTGGTGCTGACCCTATACATTCTTTAAGAGTTTGTGTAGATACTGTTCGCACAAATGTCTACTTAGCTGTCTTTGATAAAAATCTCTATGACAAACTTGTTTCAAGCTTTTTGGAAATGAAGAGTGAAAAGCAAGTTGAACAAAAGATCGCTGAGATTCCTAAAGAGGAAGTTAAGCCATTTATAACTGAAAGTAAACCTTCAGTTGAACAGAGAAAACAAGATGATAAGAAAATCAAAGCTTGTGTTGAAGAAGTTACAACAACTCTGGAAGAAACTAAGTTCCTCACAGAAAACTTGTTACTTTATATTGACATTAATGGCAATCTTCATCCAGATTCTGCCACTCTTGTTAGTGACATTGACATCACTTTCTTAAAGAAAGATGCTCCATATATAGTGGGTGATGTTGTTCAAGAGGGTGTTTTAACTGCTGTGGTTATACCTACTAAAAAGGCTGGTGGCACTACTGAAATGCTAGCGAAAGCTTTGAGAAAAGTGCCAACAGACAATTATATAACCACTTACCCGGGTCAGGGTTTAAATGGTTACACTGTAGAGGAGGCAAAGACAGTGCTTAAAAAGTGTAAAAGTGCCTTTTACATTCTACCATCTATTATCTCTAATGAGAAGCAAGAAATTCTTGGAACTGTTTCTTGGAATTTGCGAGAAATGCTTGCACATGCAGAAGAAACACGCAAATTAATGCCTGTCTGTGTGGAAACTAAAGCCATAGTTTCAACTATACAGCGTAAATATAAGGGTATTAAAATACAAGAGGGTGTGGTTGATTATGGTGCTAGATTTTACTTTTACACCAGTAAAACAACTGTAGCGTCACTTATCAACACACTTAACGATCTAAATGAAACTCTTGTTACAATGCCACTTGGCTATGTAACACATGGCTTAAATTTGGAAGAAGCTGCTCGGTATATGAGATCTCTCAAAGTGCCAGCTACAGTTTCTGTTTCTTCACCTGATGCTGTTACAGCGTATAATGGTTATCTTACTTCTTCTTCTAAAACACCTGAAGAACATTTTATTGAAACCATCTCACTTGCTGGTTCCTATAAAGATTGGTCCTATTCTGGACAATCTACACAACTAGGTATAGAATTTCTTAAGAGAGGTGATAAAAGTGTATATTACACTAGTAATCCTACCACATTCCACCTAGATGGTGAAGTTATCACCTTTGACAATCTTAAGACACTTCTTTCTTTGAGAGAAGTGAGGACTATTAAGGTGTTTACAACAGTAGACAACATTAACCTCCACACGCAAGTTGTGGACATGTCAATGACATATGGACAACAGTTTGGTCCAACTTATTTGGATGGAGCTGATGTTACTAAAATAAAACCTCATAATTCACATGAAGGTAAAACATTTTATGTTTTACCTAATGATGACACTCTACGTGTTGAGGCTTTTGAGTACTACCACACAACTGATCCTAGTTTTCTGGGTAGGTACATGTCAGCATTAAATCACACTAAAAAGTGGAAATACCCACAAGTTAATGGTTTAACTTCTATTAAATGGGCAGATAACAACTGTTATCTTGCCACTGCATTGTTAACACTCCAACAAATAGAGTTGAAGTTTAATCCACCTGCTCTACAAGATGCTTATTACAGAGCAAGGGCTGGTGAAGCTGCTAACTTTTGTGCACTTATCTTAGCCTACTGTAATAAGACAGTAGGTGAGTTAGGTGATGTTAGAGAAACAATGAGTTACTTGTTTCAACATGCCAATTTAGATTCTTGCAAAAGAGTCTTGAACGTGGTGTGTAAAACTTGTGGACAACAGCAGACAACCCTTAAGGGTGTAGAAGCTGTTATGTACATGGGCACACTTTCTTATGAACAATTTAAGAAAGGTGTTCAGATACCTTGTACGTGTGGTAAACAAGCTACAAAATATCTAGTACAACAGGAGTCACCTTTTGTTATGATGTCAGCACCACCTGCTCAGTATGAACTTAAGCATGGTACATTTACTTGTGCTAGTGAGTACACTGGTAATTACCAGTGTGGTCACTATAAACATATAACTTCTAAAGAAACTTTGTATTGCATAGACGGTGCTTTACTTACAAAGTCCTCAGAATACAAAGGTCCTATTACGGATGTTTTCTACAAAGAAAACAGTTACACAACAACCATAAAACCAGTTACTTATAAATTGGATGGTGTTGTTTGTACAGAAATTGACCCTAAGTTGGACAATTATTATAAGAAAGACAATTCTTATTTCACAGAGCAACCAATTGATCTTGTACCAAACCAACCATATCCAAACGCAAGCTTCGATAATTTTAAGTTTGTATGTGATAATATCAAATTTGCTGATGATTTAAACCAGTTAACTGGTTATAAGAAACCTGCTTCAAGAGAGCTTAAAGTTACATTTTTCCCTGACTTAAATGGTGATGTGGTGGCTATTGATTATAAACACTACACACCCTCTTTTAAGAAAGGAGCTAAATTGTTACATAAACCTATTGTTTGGCATGTTAACAATGCAACTAATAAAGCCACGTATAAACCAAATACCTGGTGTATACGTTGTCTTTGGAGCACAAAACCAGTTGAAACATCAAATTCGTTTGATGTACTGAAGTCAGAGGACGCGCAGGGAATGGATAATCTTGCCTGCGAAGATCTAAAACCAGTCTCTGAAGAAGTAGTGGAAAATCCTACCATACAGAAAGACGTTCTTGAGTGTAATGTGAAAACTACCGAAGTTGTAGGAGACATTATACTTAAACCAGCAAATAATAGTTTAAAAATTACAGAAGAGGTTGGCCACACAGATCTAATGGCTGCTTATGTAGACAATTCTAGTCTTACTATTAAGAAACCTAATGAATTATCTAGAGTATTAGGTTTGAAAACCCTTGCTACTCATGGTTTAGCTGCTGTTAATAGTGTCCCTTGGGATACTATAGCTAATTATGCTAAGCCTTTTCTTAACAAAGTTGTTAGTACAACTACTAACATAGTTACACGGTGTTTAAACCGTGTTTGTACTAATTATATGCCTTATTTCTTTACTTTATTGCTACAATTGTGTACTTTTACTAGAAGTACAAATTCTAGAATTAAAGCATCTATGCCGACTACTATAGCAAAGAATACTGTTAAGAGTGTCGGTAAATTTTGTCTAGAGGCTTCATTTAATTATTTGAAGTCACCTAATTTTTCTAAACTGATAAATATTATAATTTGGTTTTTACTATTAAGTGTTTGCCTAGGTTCTTTAATCTACTCAACCGCTGCTTTAGGTGTTTTAATGTCTAATTTAGGCATGCCTTCTTACTGTACTGGTTACAGAGAAGGCTATTTGAACTCTACTAATGTCACTATTGCAACCTACTGTACTGGTTCTATACCTTGTAGTGTTTGTCTTAGTGGTTTAGATTCTTTAGACACCTATCCTTCTTTAGAAACTATACAAATTACCATTTCATCTTTTAAATGGGATTTAACTGCTTTTGGCTTAGTTGCAGAGTGGTTTTTGGCATATATTCTTTTCACTAGGTTTTTCTATGTACTTGGATTGGCTGCAATCATGCAATTGTTTTTCAGCTATTTTGCAGTACATTTTATTAGTAATTCTTGGCTTATGTGGTTAATAATTAATCTTGTACAAATGGCCCCGATTTCAGCTATGGTTAGAATGTACATCTTCTTTGCATCATTTTATTATGTATGGAAAAGTTATGTGCATGTTGTAGACGGTTGTAATTCATCAACTTGTATGATGTGTTACAAACGTAATAGAGCAACAAGAGTCGAATGTACAACTATTGTTAATGGTGTTAGAAGGTCCTTTTATGTCTATGCTAATGGAGGTAAAGGCTTTTGCAAACTACACAATTGGAATTGTGTTAATTGTGATACATTCTGTGCTGGTAGTACATTTATTAGTGATGAAGTTGCGAGAGACTTGTCACTACAGTTTAAAAGACCAATAAATCCTACTGACCAGTCTTCTTACATCGTTGATAGTGTTACAGTGAAGAATGGTTCCATCCATCTTTACTTTGATAAAGCTGGTCAAAAGACTTATGAAAGACATTCTCTCTCTCATTTTGTTAACTTAGACAACCTGAGAGCTAATAACACTAAAGGTTCATTGCCTATTAATGTTATAGTTTTTGATGGTAAATCAAAATGTGAAGAATCATCTGCAAAATCAGCGTCTGTTTACTACAGTCAGCTTATGTGTCAACCTATACTGTTACTAGATCAGGCATTAGTGTCTGATGTTGGTGATAGTGCGGAAGTTGCAGTTAAAATGTTTGATGCTTACGTTAATACGTTTTCATCAACTTTTAACGTACCAATGGAAAAACTCAAAACACTAGTTGCAACTGCAGAAGCTGAACTTGCAAAGAATGTGTCCTTAGACAATGTCTTATCTACTTTTATTTCAGCAGCTCGGCAAGGGTTTGTTGATTCAGATGTAGAAACTAAAGATGTTGTTGAATGTCTTAAATTGTCACATCAATCTGACATAGAAGTTACTGGCGATAGTTGTAATAACTATATGCTCACCTATAACAAAGTTGAAAACATGACACCCCGTGACCTTGGTGCTTGTATTGACTGTAGTGCGCGTCATATTAATGCGCAGGTAGCAAAAAGTCACAACATTGCTTTGATATGGAACGTTAAAGATTTCATGTCATTGTCTGAACAACTACGAAAACAAATACGTAGTGCTGCTAAAAAGAATAACTTACCTTTTAAGTTGACATGTGCAACTACTAGACAAGTTGTTAATGTTGTAACAACAAAGATAGCACTTAAGGGTGGTAAAATTGTTAATAATTGGTTGAAGCAGTTAATTAAAGTTACACTTGTGTTCCTTTTTGTTGCTGCTATTTTCTATTTAATAACACCTGTTCATGTCATGTCTAAACATACTGACTTTTCAAGTGAAATCATAGGATACAAGGCTATTGATGGTGGTGTCACTCGTGACATAGCATCTACAGATACTTGTTTTGCTAACAAACATGCTGATTTTGACACATGGTTTAGCCAGCGTGGTGGTAGTTATACTAATGACAAAGCTTGCCCATTGATTGCTGCAGTCATAACAAGAGAAGTGGGTTTTGTCGTGCCTGGTTTGCCTGGCACGATATTACGCACAACTAATGGTGACTTTTTGCATTTCTTACCTAGAGTTTTTAGTGCAGTTGGTAACATCTGTTACACACCATCAAAACTTATAGAGTACACTGACTTTGCAACATCAGCTTGTGTTTTGGCTGCTGAATGTACAATTTTTAAAGATGCTTCTGGTAAGCCAGTACCATATTGTTATGATACCAATGTACTAGAAGGTTCTGTTGCTTATGAAAGTTTACGCCCTGACACACGTTATGTGCTCATGGATGGCTCTATTATTCAATTTCCTAACACCTACCTTGAAGGTTCTGTTAGAGTGGTAACAACTTTTGATTCTGAGTACTGTAGGCACGGCACTTGTGAAAGATCAGAAGCTGGTGTTTGTGTATCTACTAGTGGTAGATGGGTACTTAACAATGATTATTACAGATCTTTACCAGGAGTTTTCTGTGGTGTAGATGCTGTAAATTTACTTACTAATATGTTTACACCACTAATTCAACCTATTGGTGCTTTGGACATATCAGCATCTATAGTAGCTGGTGGTATTGTAGCTATCGTAGTAACATGCCTTGCCTACTATTTTATGAGGTTTAGAAGAGCTTTTGGTGAATACAGTCATGTAGTTGCCTTTAATACTTTACTATTCCTTATGTCATTCACTGTACTCTGTTTAACACCAGTTTACTCATTCTTACCTGGTGTTTATTCTGTTATTTACTTGTACTTGACATTTTATCTTACTAATGATGTTTCTTTTTTAGCACATATTCAGTGGATGGTTATGTTCACACCTTTAGTACCTTTCTGGATAACAATTGCTTATATCATTTGTATTTCCACAAAGCATTTCTATTGGTTCTTTAGTAATTACCTAAAGAGACGTGTAGTCTTTAATGGTGTTTCCTTTAGTACTTTTGAAGAAGCTGCGCTGTGCACCTTTTTGTTAAATAAAGAAATGTATCTAAAGTTGCGTAGTGATGTGCTATTACCTCTTACGCAATATAATAGATACTTAGCTCTTTATAATAAGTACAAGTATTTTAGTGGAGCAATGGATACAACTAGCTACAGAGAAGCTGCTTGTTGTCATCTCGCAAAGGCTCTCAATGACTTCAGTAACTCAGGTTCTGATGTTCTTTACCAACCACCACAAACCTCTATCACCTCAGCTGTTTTGCAGAGTGGTTTTAGAAAAATGGCATTCCCATCTGGTAAAGTTGAGGGTTGTATGGTACAAGTAACTTGTGGTACAACTACACTTAACGGTCTTTGGCTTGATGACGTAGTTTACTGTCCAAGACATGTGATCTGCACCTCTGAAGACATGCTTAACCCTAATTATGAAGATTTACTCATTCGTAAGTCTAATCATAATTTCTTGGTACAGGCTGGTAATGTTCAACTCAGGGTTATTGGACATTCTATGCAAAATTGTGTACTTAAGCTTAAGGTTGATACAGCCAATCCTAAGACACCTAAGTATAAGTTTGTTCGCATTCAACCAGGACAGACTTTTTCAGTGTTAGCTTGTTACAATGGTTCACCATCTGGTGTTTACCAATGTGCTATGAGGCCCAATTTCACTATTAAGGGTTCATTCCTTAATGGTTCATGTGGTAGTGTTGGTTTTAACATAGATTATGACTGTGTCTCTTTTTGTTACATGCACCATATGGAATTACCAACTGGAGTTCATGCTGGCACAGACTTAGAAGGTAACTTTTATGGACCTTTTGTTGACAGGCAAACAGCACAAGCAGCTGGTACGGACACAACTATTACAGTTAATGTTTTAGCTTGGTTGTACGCTGCTGTTATAAATGGAGACAGGTGGTTTCTCAATCGATTTACCACAACTCTTAATGACTTTAACCTTGTGGCTATGAAGTACAATTATGAACCTCTAACACAAGACCATGTTGACATACTAGGACCTCTTTCTGCTCAAACTGGAATTGCCGTTTTAGATATGTGTGCTTCATTAAAAGAATTACTGCAAAATGGTATGAATGGACGTACCATATTGGGTAGTGCTTTATTAGAAGATGAATTTACACCTTTTGATGTTGTTAGACAATGCTCAGGTGTTACTTTCCAAAGTGCAGTGAAAAGAACAATCAAGGGTACACACCACTGGTTGTTACTCACAATTTTGACTTCACTTTTAGTTTTAGTCCAGAGTACTCAATGGTCTTTGTTCTTTTTTTTGTATGAAAATGCCTTTTTACCTTTTGCTATGGGTATTATTGCTATGTCTGCTTTTGCAATGATGTTTGTCAAACATAAGCATGCATTTCTCTGTTTGTTTTTGTTACCTTCTCTTGCCACTGTAGCTTATTTTAATATGGTCTATATGCCTGCTAGTTGGGTGATGCGTATTATGACATGGTTGGATATGGTTGATACTAGTTTGTCTGGTTTTAAGCTAAAAGACTGTGTTATGTATGCATCAGCTGTAGTGTTACTAATCCTTATGACAGCAAGAACTGTGTATGATGATGGTGCTAGGAGAGTGTGGACACTTATGAATGTCTTGACACTCGTTTATAAAGTTTATTATGGTAATGCTTTAGATCAAGCCATTTCCATGTGGGCTCTTATAATCTCTGTTACTTCTAACTACTCAGGTGTAGTTACAACTGTCATGTTTTTGGCCAGAGGTATTGTTTTTATGTGTGTTGAGTATTGCCCTATTTTCTTCATAACTGGTAATACACTTCAGTGTATAATGCTAGTTTATTGTTTCTTAGGCTATTTTTGTACTTGTTACTTTGGCCTCTTTTGTTTACTCAACCGCTACTTTAGACTGACTCTTGGTGTTTATGATTACTTAGTTTCTACACAGGAGTTTAGATATATGAATTCACAGGGACTACTCCCACCCAAGAATAGCATAGATGCCTTCAAACTCAACATTAAATTGTTGGGTGTTGGTGGCAAACCTTGTATCAAAGTAGCCACTGTACAGTCTAAAATGTCAGATGTAAAGTGCACATCAGTAGTCTTACTCTCAGTTTTGCAACAACTCAGAGTAGAATCATCATCTAAATTGTGGGCTCAATGTGTCCAGTTACACAATGACATTCTCTTAGCTAAAGATACTACTGAAGCCTTTGAAAAAATGGTTTCACTACTTTCTGTTTTGCTTTCCATGCAGGGTGCTGTAGACATAAACAAGCTTTGTGAAGAAATGCTGGACAACAGGGCAACCTTACAAGCTATAGCCTCAGAGTTTAGTTCCCTTCCATCATATGCAGCTTTTGCTACTGCTCAAGAAGCTTATGAGCAGGCTGTTGCTAATGGTGATTCTGAAGTTGTTCTTAAAAAGTTGAAGAAGTCTTTGAATGTGGCTAAATCTGAATTTGACCGTGATGCAGCCATGCAACGTAAGTTGGAAAAGATGGCTGATCAAGCTATGACCCAAATGTATAAACAGGCTAGATCTGAGGACAAGAGGGCAAAAGTTACTAGTGCTATGCAGACAATGCTTTTCACTATGCTTAGAAAGTTGGATAATGATGCACTCAACAACATTATCAACAATGCAAGAGATGGTTGTGTTCCCTTGAACATAATACCTCTTACAACAGCAGCCAAACTAATGGTTGTCATACCAGACTATAACACATATAAAAATACGTGTGATGGTACAACATTTACTTATGCATCAGCATTGTGGGAAATCCAACAGGTTGTAGATGCAGATAGTAAAATTGTTCAACTTAGTGAAATTAGTATGGACAATTCACCTAATTTAGCATGGCCTCTTATTGTAACAGCTTTAAGGGCCAATTCTGCTGTCAAATTACAGAATAATGAGCTTAGTCCTGTTGCACTACGACAGATGTCTTGTGCTGCCGGTACTACACAAACTGCTTGCACTGATGACAATGCGTTAGCTTACTACAACACAACAAAGGGAGGTAGGTTTGTACTTGCACTGTTATCCGATTTACAGGATTTGAAATGGGCTAGATTCCCTAAGAGTGATGGAACTGGTACTATCTATACAGAACTGGAACCACCTTGTAGGTTTGTTACAGACACACCTAAAGGTCCTAAAGTGAAGTATTTATACTTTATTAAAGGATTAAACAACCTAAATAGAGGTATGGTACTTGGTAGTTTAGCTGCCACAGTACGTCTACAAGCTGGTAATGCAACAGAAGTGCCTGCCAATTCAACTGTATTATCTTTCTGTGCTTTTGCTGTAGATGCTGCTAAAGCTTACAAAGATTATCTAGCTAGTGGGGGACAACCAATCACTAATTGTGTTAAGATGTTGTGTACACACACTGGTACTGGTCAGGCAATAACAGTTACACCGGAAGCCAATATGGATCAAGAATCCTTTGGTGGTGCATCGTGTTGTCTGTACTGCCGTTGCCACATAGATCATCCAAATCCTAAAGGATTTTGTGACTTAAAAGGTAAGTATGTACAAATACCTACAACTTGTGCTAATGACCCTGTGGGTTTTACACTTAAAAACACAGTCTGTACCGTCTGCGGTATGTGGAAAGGTTATGGCTGTAGTTGTGATCAACTCCGCGAACCCATGCTTCAGTCAGCTGATGCACAATCGTTTTTAAACGGGTTTGCGGTGTAAGTGCAGCCCGTCTTACACCGTGCGGCACAGGCACTAGTACTGATGTCGTATACAGGGCTTTTGACATCTACAATGATAAAGTAGCTGGTTTTGCTAAATTCCTAAAAACTAATTGTTGTCGCTTCCAAGAAAAGGACGAAGATGACAATTTAATTGATTCTTACTTTGTAGTTAAGAGACACACTTTCTCTAACTACCAACATGAAGAAACAATTTATAATTTACTTAAGGATTGTCCAGCTGTTGCTAAACATGACTTCTTTAAGTTTAGAATAGACGGTGACATGGTACCACATATATCACGTCAACGTCTTACTAAATACACAATGGCAGACCTCGTCTATGCTTTAAGGCATTTTGATGAAGGTAATTGTGACACATTAAAAGAAATACTTGTCACATACAATTGTTGTGATGATGATTATTTCAATAAAAAGGACTGGTATGATTTTGTAGAAAACCCAGATATATTACGCGTATACGCCAACTTAGGTGAACGTGTACGCCAAGCTTTGTTAAAAACAGTACAATTCTGTGATGCCATGCGAAATGCTGGTATTGTTGGTGTACTGACATTAGATAATCAAGATCTCAATGGTAACTGGTATGATTTCGGTGATTTCATACAAACCACGCCAGGTAGTGGAGTTCCTGTTGTAGATTCTTATTATTCATTGTTAATGCCTATATTAACCTTGACCAGGGCTTTAACTGCAGAGTCACATGTTGACACTGACTTAACAAAGCCTTACATTAAGTGGGATTTGTTAAAATATGACTTCACGGAAGAGAGGTTAAAACTCTTTGACCGTTATTTTAAATATTGGGATCAGACATACCACCCAAATTGTGTTAACTGTTTGGATGACAGATGCATTCTGCATTGTGCAAACTTTAATGTTTTATTCTCTACAGTGTTCCCACTTACAAGTTTTGGACCACTAGTGAGAAAAATATTTGTTGATGGTGTTCCATTTGTAGTTTCAACTGGATACCACTTCAGAGAGCTAGGTGTTGTACATAATCAGGATGTAAACTTACATAGCTCTAGACTTAGTTTTAAGGAATTACTTGTGTATGCTGCTGACCCTGCTATGCACGCTGCTTCTGGTAATCTATTACTAGATAAACGCACTACGTGCTTTTCAGTAGCTGCACTTACTAACAATGTTGCTTTTCAAACTGTCAAACCCGGTAATTTTAACAAAGACTTCTATGACTTTGCTGTGTCTAAGGGTTTCTTTAAGGAAGGAAGTTCTGTTGAATTAAAACACTTCTTCTTTGCTCAGGATGGTAATGCTGCTATCAGCGATTATGACTACTATCGTTATAATCTACCAACAATGTGTGATATCAGACAACTACTATTTGTAGTTGAAGTTGTTGATAAGTACTTTGATTGTTACGATGGTGGCTGTATTAATGCTAACCAAGTCATCGTCAACAACCTAGACAAATCAGCTGGTTTTCCATTTAATAAATGGGGTAAGGCTAGACTTTATTATGATTCAATGAGTTATGAGGATCAAGATGCACTTTTCGCATATACAAAACGTAATGTCATCCCTACTATAACTCAAATGAATCTTAAGTATGCCATTAGTGCAAAGAATAGAGCTCGCACCGTAGCTGGTGTCTCTATCTGTAGTACTATGACCAATAGACAGTTTCATCAAAAATTATTGAAATCAATAGCCGCCACTAGAGGAGCTACTGTAGTAATTGGAACAAGCAAATTCTATGGTGGTTGGCACAACATGTTAAAAACTGTTTATAGTGATGTAGAAAACCCTCACCTTATGGGTTGGGATTATCCTAAATGTGATAGAGCCATGCCTAACATGCTTAGAATTATGGCCTCACTTGTTCTTGCTCGCAAACATACAACGTGTTGTAGCTTGTCACACCGTTTCTATAGATTAGCTAATGAGTGTGCTCAAGTATTGAGTGAAATGGTCATGTGTGGCGGTTCACTATATGTTAAACCAGGTGGAACCTCATCAGGAGATGCCACAACTGCTTATGCTAATAGTGTTTTTAACATTTGTCAAGCTGTCACGGCCAATGTTAATGCACTTTTATCTACTGATGGTAACAAAATTGCCGATAAGTATGTCCGCAATTTACAACACAGACTTTATGAGTGTCTCTATAGAAATAGAGATGTTGACACAGACTTTGTGAATGAGTTTTACGCATATTTGCGTAAACATTTCTCAATGATGATACTCTCTGACGATGCTGTTGTGTGTTTCAATAGCACTTATGCATCTCAAGGTCTAGTGGCTAGCATAAAGAACTTTAAGTCAGTTCTTTATTATCAAAACAATGTTTTTATGTCTGAAGCAAAATGTTGGACTGAGACTGACCTTACTAAAGGACCTCATGAATTTTGCTCTCAACATACAATGCTAGTTAAACAGGGTGATGATTATGTGTACCTTCCTTACCCAGATCCATCAAGAATCCTAGGGGCCGGCTGTTTTGTAGATGATATCGTAAAAACAGATGGTACACTTATGATTGAACGGTTCGTGTCTTTAGCTATAGATGCTTACCCACTTACTAAACATCCTAATCAGGAGTATGCTGATGTCTTTCATTTGTACTTACAATACATAAGAAAGCTACATGATGAGTTAACAGGACACATGTTAGACATGTATTCTGTTATGCTTACTAATGATAACACTTCAAGGTATTGGGAACCTGAGTTTTATGAGGCTATGTACACACCGCATACAGTCTTACAGGCTGTTGGGGCTTGTGTTCTTTGCAATTCACAGACTTCATTAAGATGTGGTGCTTGCATACGTAGACCATTCTTATGTTGTAAATGCTGTTACGACCATGTCATATCAACATCACATAAATTAGTCTTGTCTGTTAATCCGTATGTTTGCAATGCTCCAGGTTGTGATGTCACAGATGTGACTCAACTTTACTTAGGAGGTATGAGCTATTATTGTAAATCACATAAACCACCCATTAGTTTTCCATTGTGTGCTAATGGACAAGTTTTTGGTTTATATAAAAATACATGTGTTGGTAGCGATAATGTTACTGACTTTAATGCAATTGCAACATGTGACTGGACAAATGCTGGTGATTACATTTTAGCTAACACCTGTACTGAAAGACTCAAGCTTTTTGCAGCAGAAACGCTCAAAGCTACTGAGGAGACATTTAAACTGTCTTATGGTATTGCTACTGTACGTGAAGTGCTGTCTGACAGAGAATTACATCTTTCATGGGAAGTTGGTAAACCTAGACCACCACTTAACCGAAATTATGTCTTTACTGGTTATCGTGTAACTAAAAACAGTAAAGTACAAATAGGAGAGTACACCTTTGAAAAAGGTGACTATGGTGATGCTGTTGTTTACCGAGGTACAACAACTTACAAATTAAATGTTGGTGATTATTTTGTGCTGACATCACATACAGTAATGCCATTAAGTGCACCTACACTAGTGCCACAAGAGCACTATGTTAGAATTACTGGCTTATACCCAACACTCAATATCTCAGATGAGTTTTCTAGCAATGTTGCAAATTATCAAAAGGTTGGTATGCAAAAGTATTCTACACTCCAGGGACCACCTGGTACTGGTAAGAGTCATTTTGCTATTGGCCTAGCTCTCTACTACCCTTCTGCTCGCATAGTGTATACAGCTTGCTCTCATGCCGCTGTTGATGCACTATGTGAGAAGGCATTAAAATATTTGCCTATAGATAAATGTAGTAGAATTATACCTGCACGTGCTCGTGTAGAGTGTTTTGATAAATTCAAAGTGAATTCAACATTAGAACAGTATGTCTTTTGTACTGTAAATGCATTGCCTGAGACGACAGCAGATATAGTTGTCTTTGATGAAATTTCAATGGCCACAAATTATGATTTGAGTGTTGTCAATGCCAGATTACGTGCTAAGCACTATGTGTACATTGGCGACCCTGCTCAATTACCTGCACCACGCACATTGCTAACTAAGGGCACACTAGAACCAGAATATTTCAATTCAGTGTGTAGACTTATGAAAACTATAGGTCCAGACATGTTCCTCGGAACTTGTCGGCGTTGTCCTGCTGAAATTGTTGACACTGTGAGTGCTTTGGTTTATGATAATAAGCTTAAAGCACATAAAGACAAATCAGCTCAATGCTTTAAAATGTTTTATAAGGGTGTTATCACGCATGATGTTTCATCTGCAATTAACAGGCCACAAATAGGCGTGGTAAGAGAATTCCTTACACGTAACCCTGCTTGGAGAAAAGCTGTCTTTATTTCACCTTATAATTCACAGAATGCTGTAGCCTCAAAGATTTTGGGACTACCAACTCAAACTGTTGATTCATCACAGGGCTCAGAATATGACTATGTCATATTCACTCAAACCACTGAAACAGCTCACTCTTGTAATGTAAACAGATTTAATGTTGCTATTACCAGAGCAAAAGTAGGCATACTTTGCATAATGTCTGATAGAGACCTTTATGACAAGTTGCAATTTACAAGTCTTGAAATTCCACGTAGGAATGTGGCAACTTTACAAGCTGAAAATGTAACAGGACTCTTTAAAGATTGTAGTAAGGTAATCACTGGGTTACATCCTACACAGGCACCTACACACCTCAGTGTTGACACTAAATTCAAAACTGAAGGTTTATGTGTTGACATACCTGGCATACCTAAGGACATGACCTATAGAAGACTCATCTCTATGATGGGTTTTAAAATGAATTATCAAGTTAATGGTTACCCTAACATGTTTATCACCCGCGAAGAAGCTATAAGACATGTACGTGCATGGATTGGCTTCGATGTCGAGGGGTGTCATGCTACTAGAGAAGCTGTTGGTACCAATTTACCTTTACAGCTAGGTTTTTCTACAGGTGTTAACCTAGTTGCTGTACCTACAGGTTATGTTGATACACCTAATAATACAGATTTTTCCAGAGTTAGTGCTAAACCACCGCCTGGAGATCAATTTAAACACCTCATACCACTTATGTACAAAGGACTTCCTTGGAATGTAGTGCGTATAAAGATTGTACAAATGTTAAGTGACACACTTAAAAATCTCTCTGACAGAGTCGTATTTGTCTTATGGGCACATGGCTTTGAGTTGACATCTATGAAGTATTTTGTGAAAATAGGACCTGAGCGCACCTGTTGTCTATGTGATAGACGTGCCACATGCTTTTCCACTGCTTCAGACACTTATGCCTGTTGGCATCATTCTATTGGATTTGATTACGTCTATAATCCGTTTATGATTGATGTTCAACAATGGGGTTTTACAGGTAACCTACAAAGCAACCATGATCTGTATTGTCAAGTCCATGGTAATGCACATGTAGCTAGTTGTGATGCAATCATGACTAGGTGTCTAGCTGTCCACGAGTGCTTTGTTAAGCGTGTTGACTGGACTATTGAATATCCTATAATTGGTGATGAACTGAAGATTAATGCGGCTTGTAGAAAGGTTCAACACATGGTTGTTAAAGCTGCATTATTAGCAGACAAATTCCCAGTTCTTCACGACATTGGTAACCCTAAAGCTATTAAGTGTGTACCTCAAGCTGATGTAGAATGGAAGTTCTATGATGCACAGCCTTGTAGTGACAAAGCTTATAAAATAGAAGAATTATTCTATTCTTATGCCACACATTCTGACAAATTCACAGATGGTGTATGCCTATTTTGGAATTGCAATGTCGATAGATATCCTGCTAATTCCATTGTTTGTAGATTTGACACTAGAGTGCTATCTAACCTTAACTTGCCTGGTTGTGATGGTGGCAGTTTGTATGTAAATAAACATGCATTCCACACACCAGCTTTTGATAAAAGTGCTTTTGTTAATTTAAAACAATTACCATTTTTCTATTACTCTGACAGTCCATGTGAGTCTCATGGAAAACAAGTAGTGTCAGATATAGATTATGTACCACTAAAGTCTGCTACGTGTATAACACGTTGCAATTTAGGTGGTGCTGTCTGTAGACATCATGCTAATGAGTACAGATTGTATCTCGATGCTTATAACATGATGATCTCAGCTGGCTTTAGCTTGTGGGTTTACAAACAATTTGATACTTATAACCTCTGGAACACTTTTACAAGACTTCAGAGTTTAGAAAATGTGGCTTTTAATGTTGTAAATAAGGGACACTTTGATGGACAACAGGGTGAAGTACCAGTTTCTATCATTAATAACACTGTTTACACAAAAGTTGATGGTGTTGATGTAGAATTGTTTGAAAATAAAACAACATTACCTGTTAATGTAGCATTTGAGCTTTGGGCTAAGCGCAACATTAAACCAGTACCAGAGGTGAAAATACTCAATAATTTGGGTGTGGACATTGCTGCTAATACTGTGATCTGGGACTACAAAAGAGATGCTCCAGCACATATATCTACTATTGGTGTTTGTTCTATGACTGACATAGCCAAGAAACCAACTGAAACGATTTGTGCACCACTCACTGTCTTTTTTGATGGTAGAGTTGATGGTCAAGTAGACTTATTTAGAAATGCCCGTAATGGTGTTCTTATTACAGAAGGTAGTGTTAAAGGTTTACAACCATCTGTAGGTCCCAAACAAGCTAGTCTTAATGGAGTCACATTAATTGGAGAAGCCGTAAAAACACAGTTCAATTATTATAAGAAAGTTGATGGTGTTGTCCAACAATTACCTGAAACTTACTTTACTCAGAGTAGAAATTTACAAGAATTTAAACCCAGGAGTCAAATGGAAATTGATTTCTTAGAATTAGCTATGGATGAATTCATTGAACGGTATAAATTAGAAGGCTATGCCTTCGAACATATCGTTTATGGAGATTTTAGTCATAGTCAGTTAGGTGGTTTACATCTACTGATTGGACTAGCTAAACGTTTTAAGGAATCACCTTTTGAATTAGAAGATTTTATTCCTATGGACAGTACAGTTAAAAACTATTTCATAACAGATGCGCAAACAGGTTCATCTAAGTGTGTGTGTTCTGTTATTGATTTATTACTTGATGATTTTGTTGAAATAATAAAATCCCAAGATTTATCTGTAGTTTCTAAGGTTGTCAAAGTGACTATTGACTATACAGAAATTTCATTTATGCTTTGGTGTAAAGATGGCCATGTAGAAACATTTTACCCAAAATTACAATCTAGTCAAGCGTGGCAACCGGGTGTTGCTATGCCTAATCTTTACAAAATGCAAAGAATGCTATTAGAAAAGTGTGACCTTCAAAATTATGGTGATAGTGCAACATTACCTAAAGGCATAATGATGAATGTCGCAAAATATACTCAACTGTGTCAATATTTAAACACATTAACATTAGCTGTACCCTATAATATGAGAGTTATACATTTTGGTGCTGGTTCTGATAAAGGAGTTGCACCAGGTACAGCTGTTTTAAGACAGTGGTTGCCTACGGGTACGCTGCTTGTCGATTCAGATCTTAATGACTTTGTCTCTGATGCAGATTCAACTTTGATTGGTGATTGTGCAACTGTACATACAGCTAATAAATGGGATCTCATTATTAGTGATATGTACGACCCTAAGACTAAAAATGTTACAAAAGAAAATGACTCTAAAGAGGGTTTTTTCACTTACATTTGTGGGTTTATACAACAAAAGCTAGCTCTTGGAGGTTCCGTGGCTATAAAGATAACAGAACATTCTTGGAATGCTGATCTTTATAAGCTCATGGGACACTTCGCATGGTGGACAGCCTTTGTTACTAATGTGAATGCGTCATCATCTGAAGCATTTTTAATTGGATGTAATTATCTTGGCAAACCACGCGAACAAATAGATGGTTATGTCATGCATGCAAATTACATATTTTGGAGGAATACAAATCCAATTCAGTTGTCTTCCTATTCTTTATTTGACATGAGTAAATTTCCCCTTAAATTAAGGGGTACTGCTGTTATGTCTTTAAAAGAAGGTCAAATCAATGATATGATTTTATCTCTTCTTAGTAAAGGTAGACTTATAATTAGAGAAAACAACAGAGTTGTTATTTCTAGTGATGTTCTTGTTAACAACTAAACGAACAATGTTTGTTTTTCTTGTTTTATTGCCACTAGTCTCTAGTCAGTGTGTTAATCTTACAACCAGAACTCAATTACCCCCTGCATACACTAATTCTTTCACACGTGGTGTTTATTACCCTGACAAAGTTTTCAGATCCTCAGTTTTACATTCAACTCAGGACTTGTTCTTACCTTTCTTTTCCAATGTTACTTGGTTCCATGCTATACATGTCTCTGGGACCAATGGTACTAAGAGGTTTGATAACCCTGTCCTACCATTTAATGATGGTGTTTATTTTGCTTCCACTGAGAAGTCTAACATAATAAGAGGCTGGATTTTTGGTACTACTTTAGATTCGAAGACCCAGTCCCTACTTATTGTTAATAACGCTACTAATGTTGTTATTAAAGTCTGTGAATTTCAATTTTGTAATGATCCATTTTTGGGTGTTTATTACCACAAAAACAACAAAAGTTGGATGGAAAGTGAGTTCAGAGTTTATTCTAGTGCGAATAATTGCACTTTTGAATATGTCTCTCAGCCTTTTCTTATGGACCTTGAAGGAAAACAGGGTAATTTCAAAAATCTTAGGGAATTTGTGTTTAAGAATATTGATGGTTATTTTAAAATATATTCTAAGCACACGCCTATTAATTTAGTGCGTGATCTCCCTCAGGGTTTTTCGGCTTTAGAACCATTGGTAGATTTGCCAATAGGTATTAACATCACTAGGTTTCAAACTTTACTTGCTTTACATAGAAGTTATTTGACTCCTGGTGATTCTTCTTCAGGTTGGACAGCTGGTGCTGCAGCTTATTATGTGGGTTATCTTCAACCTAGGACTTTTCTATTAAAATATAATGAAAATGGAACCATTACAGATGCTGTAGACTGTGCACTTGACCCTCTCTCAGAAACAAAGTGTACGTTGAAATCCTTCACTGTAGAAAAAGGAATCTATCAAACTTCTAACTTTAGAGKCCAACCAACAGAATCTATTGTTAGATTTCCTAATATTACAAACTTGTGCCCTTTTGGTGAAGTTTTTAACGCCACCAGATTTGCATCTGTTTATGCTTGGAACAGGAAGAGAATCAGCAACTGTGTTGCTGATTATTCTGTCCTATATAATTCCGCATCATTTTCCACTTTTAAGTGTTATGGAGTGTCTCCTACTAAATTAAATGATCTCTGCTTTACTAATGTCTATGCAGATTCATTTGTAATTAGAGGTGATGAAGTCAGACAAATCGCTCCAGGGCAAACTGGAAAGATTGCTGATTATAATTATAAATTACCAGATGATTTTACAGGCTGCGTTATAGCTTGGAATTCTAACAATCTTGATTCTAAGGTTGGTGGTAATTATAATTACCTGTATAGATTGTTTAGGAAGTCTAATCTCAAACCTTTTGAGAGAGATATTTCAACTGAAATCTATCAGGCCGGTAGCACACCTTGTAATGGTGTTGAAGGTTTTAATTGTTACTTTCCTTTACAATCATATGGTTTCCAACCCACTAATGGTGTTGGTTACCAACCATACAGAGTAGTAGTACTTTCTTTTGAACTTCTACATGCACCAGCAACTGTTTGTGGACCTAAAAAGTCTACTAATTTGGTTAAAAACAAATGTGTCAATTTCAACTTCAATGGTTTAACAGGCACAGGTGTTCTTACTGAGTCTAACAAAAAGTTTCTGCCTTTCCAACAATTTGGCAGAGACATTGCTGACACTACTGATGCTGTCCGTGATCCACAGACACTTGAGATTCTTGACATTACACCATGTTCTTTTGGTGGTGTCAGTGTTATAACACCAGGAACAAATACTTCTAACCAGGTTGCTGTTCTTTATCAGGGTGTTAACTGCACAGAAGTCCCTGTTGCTATTCATGCAGATCAACTTACTCCTACTTGGCGTGTTTATTCTACAGGTTCTAATGTTTTTCAAACACGTGCAGGCTGTTTAATAGGGGCTGAACATGTCAACAACTCATATGAGTGTGACATACCCATTGGTGCAGGTATATGCGCTAGTTATCAGACTCAGACTAATTCTCCTCGGCGGGCACGTAGTGTAGCTAGTCAATCCATCATTGCCTACACTATGTCACTTGGTGCAGAAAATTCAGTTGCTTACTCTRATAACTCTATTGCCATACCCACAAATTTTACTATTAGTGTTACCACAGAAATTCTACCAGTGTCTATGACCAAGACATCAGTAGATTGTACAATGTACATTTGTGGTGATTCAACTGAATGCAGCAATCTTTTGTTGCAATATGGCAGTTTTTGTACACAATTAAACCGTGCTTTAACTGGAATAGCTGTTGAACAAGACAAAAACACCCAAGAAGTTTTTGCACAAGTCAAACAAATTTACAAAACACCACCAATTAAAGATTTTGGTGGTTTTAATTTTTCACAAATATTACCAGATCCATCAAAACCAAGCAAGAGGTCATTTATTGAAGATCTACTTTTCAACAAAGTGACACTTGCAGATGCTGGCTTCATCAAACAATATGGTGATTGCCTTGGTGATATTGCTGCTAGAGACCTCATTTGTGCACAAAAGTTTAACGGCCTTACTGTTTTGCCACCTTTGCTCACAGATGAAATGATTGCTCAATACACTTCTGCACTGTTAGCGGGTACAATCACTTCTGGTTGGACCTTTGGTGCAGGTGCTGCATTACAAATACCATTTGCTATGCAAATGGCTTATAGGTTTAATGGTATTGGAGTTACACAGAATGTTCTCTATGAGAACCAAAAATTGATTGCCAACCAATTTAATAGTGCTATTGGCAAAATTCAAGACTCACTTTCTTCCACAGCAAGTGCACTTGGAAAACTTCAAGATGTGGTCAACCAAAATGCACAAGCTTTAAACACGCTTGTTAAACAACTTAGCTCCAATTTTGGTGCAATTTCAAGTGTTTTAAATGATATCCTTTCACGTCTTGACAAAGTTGAGGCTGAAGTGCAAATTGATAGGTTGATCACAGGCAGACTTCAAAGTTTGCAGACATATGTGACTCAACAATTAATTAGAGCTGCAGAAATCAGAGTTTCTGCTAATCTTGCTGCTACTAAAATGTCAGAGTGTGTACTTGGACAATCAAAAAGAGTTGATTTTTGTGGAAAGGGCTATCATCTTATGTCCTTCCCTCAGTCAGCACCTCATGGTGTAGTCTTCTTGCATGTGACTTATGTCCCTGCACAAGAAAAGAACTTCACAACTGCTCCTGCCATTTGTCATGATGGAAAAGCACACTTTCCTCGTGAAGGTGTCTTTGTTTCAAATGGCACACACTGGTTTGTAACACAAAGGAATTTTTATGAACCACAAATCATTACTACAGACAACACATTTGTGTCTGGTAACTGTGATGTTGTAATAGGAATTGTCAACAACACAGTTTATGATCCTTTGCAACCTGAATTAGACTCATTCAAGGAGGAGTTAGATAAATATTTTAAGAATCATACATCACCAGATGTTGATTTAGGTGACATCTCTGGCATTAATGCTTCAGTTGTAAACATTCAAAAAGAAATTGACCGCCTCAATGAGGTTGCCAAGAATTTAAATGAATCTCTCATCGATCTCCAAGAACTTGGAAAGTATGAGCAGTATATAAAATGGCCATGGTACATTTGGCTAGGTTTTATAGCTGGCTTGATTGCCATAGTAATGGTGACAATTATGCTTTGCTGTATGACCAGTTGCTGTAGTTGTCTCAAGGGCTGTTGTTCTTGTGGATCCTGCTGCAAATTTGATGAAGACGACTCTGAGCCAGTGCTCAAAGGAGTCAAATTACATTACACATAAACGAACTTATGGATTTGTTTATGAGAATCTTCACAATTGGAACTGTAACTTTGAAGCAAGGTGAAATCAAGGATGCTACTCCTTCAGATTTTGTTCGCGCTACTGCAACGATACCGATACAAGCCTCACTCCCTTTCGGATGGCTTATTGTTGGCGTTGCACTTCTTGCTGTTTTTCAGAGCGCTTCCAAAATCATAACCCTCAAAAAGAGATGGCAACTAGCACTCTCCAAGGGTGTTCACTTTGTTTGCAACTTGCTGTTGTTGTTTGTAACAGTTTACTCACACCTTTTGCTCGTTGCTGCTGGCCTTGAAGCCCCTTTTCTCTATCTTTATGCTTTAGTCTACTTCTTGCAGAGTATAAACTTTGTAAGAATAATAATGAGGCTTTGGCTTTGCTGGAAATGCCGTTCCAAAAACCCATTACTTTATGATGCCAACTATTTTCTTTGCTGGCATACTAATTGTTACGACTATTGTATACCTTACAATAGTGTAACTTCTTCAATTGTCATTACTTCAGGTGATGGCACAACAAGTCCTATTTCTGAACATGACTACCAGATTGGTGGTTATACTGAAAAATGGGAATCTGGAGTAAAAGACTGTGTTGTATTACACAGTTACTTCACTTCAGACTATTACCAGCTGTACTCAACTCAATTGAGTACAGACACTGGTGTTGAACATGTTACCTTCTTCATCTACAATAAAATTGTTGATGAGCCTGAAGAACATGTCCAAATTCACACAATCGACGGTTCATCCGGAGTTGTTAATCCAGTAATGGAACCAATTTATGATGAACCGACGACGACTACTAGCGTGCCTTTGTAAGCACAAGCTGATGAGTACGAACTTATGTACTCATTCGTTTCGGAAGAGMCAGGTACGTTAATAGTTAATAGCGTACTTCTTTTTCTTGCTTTCGTGGTATTCTTGCTAGTTACACTAGCCATCCTTACTGCGCTTCGATTGTGTGCGTACTGCTGCAATATTGTTAACGTGAGTCTTGTAAAACCTTCTTTTTACGTTTACTCTCGTGTTAAAAATCTGAATTCTTCTAGAGTTCCTGATCTTCTGGTCTAAACGAACTAAATATTATATTAGTTTTTCTGTTTGGAACTTTAATTTTAGCCATGGCAGATTCCAACGGTACTATTACCGTTGAAGAGCTTAAAAAGCTCCTTGAACAATGGAACCTAGTAATAGGTTTCCTATTCCTTACATGGATTTGTCTTCTACAATTTGCCTATGCCAACAGGAATAGGTTTTTGTATATAATTAAGTTAATTTTCCTCTGGCTGTTATGGCCAGTAACTTTAGCTTGTTTTGTGCTTGCTGCTGTTTACAGAATAAATTGGATCACCGGTGGAATTGCTATCGCAATGGCTTGTCTTGTAGGCTTGATGTGGCTCAGCTACTTCATTGCTTCTTTCAGACTGTTTGCGCGTACGCGTTCCATGTGGTCATTCAATCCAGAAACTAACATTCTTCTCAACGTGCCACTCCATGGCACTATTCTGACCAGACCGCTTCTAGAAAGTGAACTCGTAATCGGAGCTGTGATCCTTCGTGGACATCTTCGTATTGCTGGACACCATCTAGGACGCTGTGACATCAAGGACCTGCCTAAAGAAATCACTGTTGCTACATCACGAATGCTTTCTTATTACAAATTGGGAGCTTCGCAGCGTGTAGCAGGTGACTCAGGTTTTGCTGCATACAGTCGCTACAGGATTGGCAACTATAAATTAAACACAGACCATTCCAGTAGCAGTGACAATATTGCTTTGCTTGTACAGTAAGTGACAACAGATGTTTCATCTCGTTGACTTTCAGGTTACTATAGCAGAGATATTACTAATTATTATGAGGACTTTTAAAGTTTCCATTTGGAATCTTGATTACATCATAAACCTCATAATTAAAAATTTATCTAAGTCACTAACTGAGAATAAATATTCTCAATTAGATGAAGAGCAACCAATGGAGATTGATTAAACGAACATGAAAATTATTCTTTTCTTGGCACTGATAACACTCGCTACTTGTGAGCTTTATCACTACCAAGAGTGTGTTAGAGGTACAACAGTACTTTTAAAAGAACCTTGCTCTTCTGGAACATACGAGGGCAATTCACCATTTCATCCTCTAGCTGATAACAAATTTGCACTGACTTGCTTTAGCACTCAATTTGCTTTTGCTTGTCCTGACGGCGTAAAACACGTCTATCAGTTACGTGCCAGATCAGTTTCACCTAAACTGTTCATCAGACAAGAGGAAGTTCAAGAACTTTACTCTCCAATTTTTCTTATTGTTGCGGCAATAGTGTTTATAACACTTTGCTTCACACTCAAAAGAAAGACAGAATGATTGAACTTTCATTAATTGACTTCTATTTGTGCTTTTTAGCCTTTCTGCTATTCCTTGTTTTAATTATGCTTATTATCTTTTGGTTCTCACTTGAACTGCAAGATCATAATGAAACTTGTCACGCCTAAACGAACATGAAATTTCTTGTTTTCTTAGGAATCATCACAACTGTAGCTGCATTTCACCAAGAATGTAGTTTACAGTCATGTACTCAACATCAACCATATGTAGTTGATGACCCGTGTCCTATTCACTTCTATTCTAAATGGTATATTAGAGTAGGAGCTAGAAAATCAGCACCTTTAATTGAATTGTGCGTGGATGAGGCTGGTTCTAAATCACCCATTCAGTACATCGATATCGGTAATTATACAGTTTCCTGTTTACCTTTTACAATTAATTGCCAGGAACCTAAATTGGGTAGTCTTGTAGTGCGTTGTTCGTTCTATGAAGACTTTTTAGAGTATCATGACGTTCGTGTTGTTTTAGATTTCATCTAAACGAACAAACTAAAATGTCTGATAATGGACCCCAAAATCAGCGAAATGCACCCCGCATTACGTTTGGTGGACCCTCAGATTCAACTGGCAGTAACCAGAATGGAGAACGCAGTGGGGCGCGATCAAAACAACGTCGGCCCCAAGGTTTACCCAATAATACTGCGTCTTGGTTCACCGCTCTCACTCAACATGGCAAGGAAGACCTTAAATTCCCTCGAGGACAAGGCGTTCCAATTAACACCAATAGCAGTCCAGATGACCAAATTGGCTACTACCGAAGAGCTACCAGACGAATTCGTGGTGGTGACGGTAAAATGAAAGATCTCAGTCCAAGATGGTATTTCTACTACCTAGGAACTGGGCCAGAAGCTGGACTTCCCTATGGTGCTAACAAAGACGGCATCATATGGGTTGCAACTGAGGGAGCCTTGAATACACCAAAAGATCACATTGGCACCCGCAATCCTGCTAACAATGCTGCAATCGTGCTACAACTTCCTCAAGGAACAACATTGCCAAAAGGCTTCTACGCAGAAGGGAGCAGAGGCGGCAGTCAAGCCTCTTCTCGTTCCTCATCACGTAGTCGCAACAGTTCAAGAAATTCAACTCCAGGCAGCAGTAAACGAACTTCTCCTGCTAGAATGGCTGGCAATGGCGGTGATGCTGCTCTTGCTTTGCTGCTGCTTGACAGATTGAACCAGCTTGAGAGCAAAATGTCTGGTAAAGGCCAACAACAACAAGGCCAAACTGTCACTAAGAAATCTGCTGCTGAGGCTTCTAAGAAGCCTCGGCAAAAACGTACTGCCACTAAAGCATACAATGTAACACAAGCTTTCGGCAGACGTGGTCCAGAACAAACCCAAGGAAATTTTGGGGACCAGGAACTAATCAGACAAGGAACTGATTACAAACATTGGCCGCAAATTGCACAATTTGCCCCCAGCGCTTCAGCGTTCTTCGGAATGTCGCGCATTGGCATGGAAGTCACACCTTCGGGAACGTGGTTGACCTACACAGGTGCCATCAAATTGGATGACAAAGATCCAAATTTCAAAGATCAAGTCATTTTGCTGAATAAGCATATTGACGCATACAAAACATTCCCACCAACAGAGCCTAAAAAGGACAAAAAGAAGAAGGCTGATGAAACTCAAGCCTTACCGCAGAGACAGAAGAAACAGCAAACTGTGACTCTTCTTCCTGCTGCAGATTTGGATGATTTCTCCAAACAATTGCAACAATCCATGAGCAGTGCTGACTCAACTCAGGCCTAAACTCATGCAGACCACACAAGGCAGATGGGCTATATAAACGTTTTCGCTTTTCCGTTTACGATATATAGTCTACTCTTGTGCAGAATGAATTCTCGTAACTACATAGCACAAGTAGATGTAGTTAACTTTAATCTCACATAGCAATCTTTAATCAGTGTGTAACATTAGGGAGGACTTGAAAGAGCCACCACATTTTCACCGAGGCCACGCGGAGTACGATCGAGTGTACAGTGAACAATGCTAGGGAGAGCTGCCTATATGGAAGAGCCCTAATGTGTAAAATTAATTTTAGTAGTGCTATCCCCATGTGATTTTAATAGCTTCTTAGGAGNATGACANNNNNNNNNNNNNN
서열번호: 16
>중증 급성 호흡기 증후군 코로나바이러스 2 orf1ab 다단백질의 단리물 hCoV-19/Austria/CeMM0360/2020
MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQHLKDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGETLGVLVPHVGEIPVAYRKVLLRKNGNKGAGGHSYGADLKSFDLGDELGTDPYEDFQENWNTKHSSGVTRELMRELNGGAYTRYVDNNFCGPDGYPLECIKDLLARAGKASCTLSEQLDFIDTKRGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFPLNSIIKTIQPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTGDFVKATCEFCGTENLTKEGATTCGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESGLKTILRKGGRTIAFGGCVFSYVGCHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNLLEILQKEKVNINIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGNFKVTKGKAKKGAWNIGEQKSILSPLYAFASEAARVVRSIFSRTLETAQNSVRVLQKAAITILDGISQYSLRLIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIFGTVYEKLKPVLDWLEEKFKEGVEFLRDGWEIVKFISTCACEIVGGQIVTCAKEIKESVQTFFKLVNKFLALCADSIIIGGAKLKALNLGETFVTHSKGLYRKCVKSREETGLLMPLKAPKEIIFLEGETLPTEVLTEEVVLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEKYCALAPNMMVTNNTFTLKGGAPTKVTFGDDTVIEVQGYKSVNITFELDERIDKVLNEKCSAYTVELGTEVNEFACVVADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDEDEEEGDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPEEEQEEDWLDDDSQQTVGQQDGSEDNQTTTIQTIVEVQPQLEMELTPVVQTIEVNSFSGYLKLTDNVYIKNADIVEEAKKVKPTVVVNAANVYLKHGGGVAGALNKATNNAMQVESDDYIATNGPLKVGGSCVLSGHNLAKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLAPLLSAGIFGADPIHSLRVCVDTVRTNVYLAVFDKNLYDKLVSSFLEMKSEKQVEQKIAEIPKEEVKPFITESKPSVEQRKQDDKKIKACVEEVTTTLEETKFLTENLLLYIDINGNLHPDSATLVSDIDITFLKKDAPYIVGDVVQEGVLTAVVIPTKKAGGTTEMLAKALRKVPTDNYITTYPGQGLNGYTVEEAKTVLKKCKSAFYILPSIISNEKQEILGTVSWNLREMLAHAEETRKLMPVCVETKAIVSTIQRKYKGIKIQEGVVDYGARFYFYTSKTTVASLINTLNDLNETLVTMPLGYVTHGLNLEEAARYMRSLKVPATVSVSSPDAVTAYNGYLTSSSKTPEEHFIETISLAGSYKDWSYSGQSTQLGIEFLKRGDKSVYYTSNPTTFHLDGEVITFDNLKTLLSLREVRTIKVFTTVDNINLHTQVVDMSMTYGQQFGPTYLDGADVTKIKPHNSHEGKTFYVLPNDDTLRVEAFEYYHTTDPSFLGRYMSALNHTKKWKYPQVNGLTSIKWADNNCYLATALLTLQQIELKFNPPALQDAYYRARAGEAANFCALILAYCNKTVGELGDVRETMSYLFQHANLDSCKRVLNVVCKTCGQQQTTLKGVEAVMYMGTLSYEQFKKGVQIPCTCGKQATKYLVQQESPFVMMSAPPAQYELKHGTFTCASEYTGNYQCGHYKHITSKETLYCIDGALLTKSSEYKGPITDVFYKENSYTTTIKPVTYKLDGVVCTEIDPKLDNYYKKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCDNIKFADDLNQLTGYKKPASRELKVTFFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVWHVNNATNKATYKPNTWCIRCLWSTKPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEVVENPTIQKDVLECNVKTTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKPNELSRVLGLKTLATHGLAAVNSVPWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYMPYFFTLLLQLCTFTRSTNSRIKASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLINIIIWFLLLSVCLGSLIYSTAALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIPCSVCLSGLDSLDTYPSLETIQITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAIMQLFFSYFAVHFISNSWLMWLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCNSSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTFISDEVARDLSLQFKRPINPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDNLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGDSAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDSDVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQVAKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIALKGGKIVNNWLKQLIKVTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTRDIASTDTCFANKHADFDTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTTNGDFLHFLPRVFSAVGNICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDTNVLEGSVAYESLRPDTRYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAGVCVSTSGRWVLNNDYYRSLPGVFCGVDAVNLLTNMFTPLIQPIGALDISASIVAGGIVAIVVTCLAYYFMRFRRAFGEYSHVVAFNTLLFLMSFTVLCLTPVYSFLPGVYSVIYLYLTFYLTNDVSFLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSFSTFEEAALCTFLLNKEMYLKLRSDVLLPLTQYNRYLALYNKYKYFSGAMDTTSYREAACCHLAKALNDFSNSGSDVLYQPPQTSITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCVLKLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNFTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNVLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQSAVKRTIKGTHHWLLLTILTSLLVLVQSTQWSLFFFLYENAFLPFAMGIIAMSAFAMMFVKHKHAFLCLFLLPSLATVAYFNMVYMPASWVMRIMTWLDMVDTSLSGFKLKDCVMYASAVVLLILMTARTVYDDGARRVWTLMNVLTLVYKVYYGNALDQAISMWALIISVTSNYSGVVTTVMFLARGIVFMCVEYCPIFFITGNTLQCIMLVYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMNSQGLLPPKNSIDAFKLNIKLLGVGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVESSSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRATLQAIASEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKLEKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLNIIPLTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDADSKIVQLSEISMDNSPNLAWPLIVTALRANSAVKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNTTKGGRFVLALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMVLGSLAATVRLQAGNATEVPANSTVLSFCAFAVDAAKAYKDYLASGGQPITNCVKMLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCANDPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQSADAQSFLNGFAV
서열번호: 17
>SARS-CoV-2_S_중간UniWien (S단백질_hCoV19AustriaCeMM03602020)
MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRXQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQGVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGAENSVAYSXNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRVSANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT
서열번호: 18
> 중증 급성 호흡기 증후군 코로나바이러스 2 단리물 SARS-CoV-2/인간/England/ex-SA/2021, EVAg Ref-SKU:004V-04071 (SA_P2) 완전 게놈. South-African B.1.351 계통
ATTAAAGGTTTATACCTTCCCAGGTAACAAACCAACCAACTTTCGATCTCTTGTAGATCTGTTCTCTAAACGAACTTTAAAATCTGTGTGGCTGTCACTCGGCTGCATGCTTAGTGCACTCACGCAGTATAATTAATAACTAATTACTGTCGTTGACAGGACACGAGTAACTCTTCTATCTTCTGCAGGCTGCTTACGGTTTCGTCCGTGTTGCAGCCGATCATCAGCACATCTAGGTTTTGTCCGGGTGTGACCGAAAGGTAAGATGGAGAGCCTTGTCCCTGGTTTCAACGAGAAAACACACGTCCAACTCAGTTTGCCTGTTTTACAGGTTCGCGACGTGCTCGTACGTGGCTTTGGAGACTCCGTGGAGGAGGTCTTATCAGAGGCACGTCAACATCTTAAAGATGGCACTTGTGGCTTAGTAGAAGTTGAAAAAGGCGTTTTGCCTCAACTTGAACAGCCCTATGTGTTCATCAAACGTTCGGATGCTCGAACTGCACCTCATGGTCATGTTATGGTTGAGCTGGTAGCAGAACTCGAAGGCATTCAGTACGGTCGTAGTGGTGAGACACTTGGTGTCCTTGTCCCTCATGTGGGCGAAATACCAGTGGCTTACCGCAAGGTTCTTCTTCGTAAGAACGGTAATAAAGGAGCTGGTGGCCATAGTTACGGCGCCGATCTAAAGTCATTTGACTTAGGCGACGAGCTTGGCACTGATCCTTATGAAGATTTTCAAGAAAACTGGAACACTAAACATAGCAGTGGTGTTACCCGTGAACTCATGCGTGAGCTTAACGGAGGGGCATACACTCGCTATGTCGATAACAACTTCTGTGGCCCTGATGGCTACCCTCTTGAGTGCATTAAAGACCTTCTAGCACGTGCTGGTAAAGCTTCATGCACTTTGTCCGAACAACTGGACTTTATTGACACTAAGAGGGGTGTATACTGCTGCCGTGAACATGAGCATGAAATTGCTTGGTACACGGAACGTTCTGAAAAGAGCTATGAATTGCAGACACCTTTTGAAATTAAATTGGCAAAGAAATTTGACATCTTCAATGGGGAATGTCCAAATTTTGTATTTCCCTTAAATTCCATAATCAAGACTATTCAACCAAGGGTTGAAAAGAAAAAGCTTGATGGCTTTATGGGTAGAATTCGATCTGTCTATCCAGTTGCGTCACCAAATGAATGCAACCAAATGTGCCTTTCAACTCTCATGAAGTGTGATCATTGTGGTGAAACTTCATGGCAGACGGGCGATTTTGTTAAAGCCACTTGCGAATTTTGTGGCACTGAGAATTTGACTAAAGAAGGTGCCACTACTTGTGGTTACTTACCCCAAAATGCTGTTGTTAAAATTTATTGTCCAGCATGTCACAATTCAGAAGTAGGACCTGAGCATAGTCTTGCCGAATACCATAATGAATCTGGCTTGAAAACCATTCTTCGTAAGGGTGGTCGCACTATTGCCTTTGGAGGCTGTGTGTTCTCTTATGTTGGTTGCCATAACAAGTGTGCCTATTGGGTTCCACGTGCTAGCGCTAACATAGGTTGTAACCATACAGGTGTTGTTGGAGAAGGTTCCGAAGGTCTTAATGACAACCTTCTTGAAATACTCCAAAAAGAGAAAGTCAACATCAATATTGTTGGTGACTTTAAACTTAATGAAGAGATCGCCATTATTTTGGCATCTTTTTCTGCTTCCACAAGTGCTTTTGTGGAAACTGTGAAAGGTTTGGATTATAAAGCATTCAAACAAATTGTTGAATCCTGTGGTAATTTTAAAGTTACAAAAGGAAAAGCTAAAAAAGGTGCCTGGAATATTGGTGAACAGAAATCAATACTGAGTCCTCTTTATGCATTTGCATCAGAGGCTGCTCGTGTTGTACGATCAATTTTCTCCCGCACTCTTGAAACTGCTCAAAATTCTGTGCGTGTTTTACAGAAGGCCGCTATAACAATACTAGATGGAATTTCACAGTATTCACTGAGACTCATTGATGCTATGATGTTCACATCTGATTTGGCTACTAACAATCTAGTTGTAATGGCCTACATTACAGGTGGTGTTGTTCAGTTGACTTCGCAGTGGCTAACTAACATCTTTGGCACTGTTTATGAAAAACTCAAACCCGTCCTTGATTGGCTTGAAGAGAAGTTTAAGGAAGGTGTAGAGTTTCTTAGAGACGGTTGGGAAATTGTTAAATTTATCTCAACCTGTGCTTGTGAAATTGTCGGTGGACAAATTGTCACCTGTGCAAAGGAAATTAAGGAGAGTGTTCAGACATTCTTTAAGCTTGTAAATAAATTTTTGGCTTTGTGTGCTGACTCTATCATTATTGGTGGAGCTAAACTTAAAGCCTTGAATTTAGGTGAAACATTTGTCACGCACTCAAAGGGATTGTACAGAAAGTGTGTTAAATCCAGAGAAGAAACTGGCCTACTCATGCCTCTAAAAGCCCCAAAAGAAATTATCTTCTTAGAGGGAGAAACACTTCCCACAGAAGTGTTAACAGAGGAAGTTGTCTTGAAAACTGGTGATTTACAACCATTAGAACAACCTACTAGTGAAGCTGTTGAAGCTCCATTGGTTGGTACACCAGTTTGTATTAACGGGCTTATGTTGCTCGAAATCAAAGACACAGAAAAGTACTGTGCCCTTGCACCTAATATGATGGTAACTAACAATACCTTCACACTCAAAGGCGGTGCACCAACAAAGGTTACTTTTGGTGATGACACTGTGATAGAAGTGCAAGGTTACAAGAGTGTGAATATCACTTTTGAACTTGATGAAAGGATTGATAAAGTACTTAATGAGAAGTGCTCTGCCTATACAGTTGAACTCGGTACAGAAGTAAATGAGTTCGCCTGTGTTGTGGCAGATGCTGTCATAAAAACTTTGCAACCAGTATCTGAATTACTTACACCACTGGGCATTGATTTAGATGAGTGGAGTATGGCTACATACTACTTATTTGATGAGTCTGGTGAGTTTAAATTGGCTTCACATATGTATTGTTCTTTTTACCCTCCAGATGAGGATGAAGAAGAAGGTGATTGTGAAGAAGAAGAGTTTGAGCCATCAACTCAATATGAGTATGGTACTGAAGATGATTACCAAGGTAAACCTTTGGAATTTGGTGCCACTTCTGCTGCTCTTCAACCTGAAGAAGAGCAAGAAGAAGATTGGTTAGATGATGATAGTCAACAAACTGTTGGTCAACAAGACGGCAGTGAGGACAATCAGACAACTACTATTCAAACAATTGTTGAGGTTCAACCTCAATTAGAGATGGAACTTACACCAGTTGTTCAGACTATTGAAGTGAATAGTTTTAGTGGTTATTTAAAACTTACTGACAATGTATACATTAAAAATGCAGACATTGTGGAAGAAGCTAAAAAGGTAAAACCAACAGTGGTTGTTAATGCAGCCAATGTTTACCTTAAACATGGAGGAGGTGTTGCAGGAGCCTTAAATAAGGCTACTAACAATGCCATGCAAGTTGAATCTGATGATTACATAGCTACTAATGGACCACTTAAAGTGGGTGGTAGTTGTGTTTTAAGCGGACACAATCTTGCTAAACACTGTCTTCATGTTGTCGGCCCAAATGTTAACAAAGGTGAAGACATTCAACTTCTTAAGAGTGCTTATGAAAATTTTAATCAGCACGAAGTTCTACTTGCACCATTATTATCAGCTGGTATTTTTGGTGCTGACCCTATACATTCTTTAAGAGTTTGTGTAGATACTGTTCGCACAAATGTCTACTTAGCTGTCTTTGATAAAAATCTCTATGACAAACTTGTTTCAAGCTTTTTGGAAATGAAGAGTGAAAAGCAAGTTGAACAAAAGATCGCTGAGATTCCTAAAGAGGAAGTTAAGCCATTTATAACTGAAAGTAAACCTTCAGTTGAACAGAGAAAACAAGATGATAAGAAAATCAAAGCTTGTGTTGAAGAAGTTACAACAACTCTGGAAGAAACTAAGTTCCTCACAGAAAACTTGTTACTTTATATTGACATTAATGGCAATCTTCATCCAGATTCTGCCACTCTTGTTAGTGACATTGACATCACTTTCTTAAAGAAAGATGCTCCATATATAGTGGGTGATGTTGTTCAAGAGGGTGTTTTAACTGCTGTGGTTATACCTACTAAAAAGGCTGGTGGCACTACTGAAATGTTAGCGAAAGCTTTGAGAAAAGTGCCAACAGACAATTATATAACCACTTACCCGGGTCAGGGTTTAAATGGTTACACTGTAGAGGAGGCAAAGACAGTGCTTAAAAAGTGTAAAAGTGCCTTTTACATTCTACCATCTATTATCTCTAATGAGAAGCAAGAAATTCTTGGAACTGTTTCTTGGAATTTGCGAGAAATGCTTGCACATGCAGAAGAAACACGCAAATTAATGCCTGTCTGTGTGGAAACTAAAGCCATAGTTTCAACTATACAGCGTAAATATAAGGGTATTAAAATACAAGAGGGTGTGGTTGATTATGGTGCTAGATTTTACTTTTACACCAGTAAAACAACTGTAGCGTCACTTATCAACACACTTAACGATCTAAATGAAACTCTTGTTACAATGCCACTTGGCTATGTAACACATGGCTTAAATTTGGAAGAAGCTGCTCGGTATATGAGATCTCTCAAAGTGCCAGCTACAGTTTCTGTTTCTTCACCTGATGCTGTTACAGCGTATAATGGTTATCTTACTTCTTCTTCTAAAACACCTGAAGAACATTTTATTGAAACCATCTCACTTGCTGGTTCCTATAAAGATTGGTCCTATTCTGGACAATCTACACAACTAGGTATAGAATTTCTTAAGAGAGGTGATAAAAGTGTATATTACACTAGTAATCCTACCACATTCCACCTAGATGGTGAAGTTATCACCTTTGACAATCTTAAGACACTTCTTTCTTTGAGAGAAGTGAGGACTATTAAGGTGTTTACAACAGTAGACAACATTAACCTCCACACGCAAGTTGTGGACATGTCAATGACATATGGACAACAGTTTGGTCCAACTTATTTGGATGGAGCTGATGTTACTAAAATAAAACCTCATAATTCACATGAAGGTAAAACATTTTATGTTTTACCTAATGATGACACTCTACGTGTTGAGGCTTTTGAGTACTACCACACAACTGATCCTAGTTTTCTGGGTAGGTACATGTCAGCATTAAATCACACTAAAAATTGGAAATACCCACAAGTTAATGGTTTAACTTCTATTAAATGGGCAGATAACAACTGTTATCTTGCCACTGCATTGTTAACACTCCAACAAATAGAGTTGAAGTTTAATCCACCTGCTCTACAAGATGCTTATTACAGAGCAAGGGCTGGTGAAGCTGCTAACTTTTGTGCACTTATCTTAGCCTACTGTAATAAGACAGTAGGTGAGTTAGGTGATGTTAGAGAAACAATGAGTTACTTGTTTCAACATGCCAATTTAGATTCTTGCAAAAGAGTCTTGAACGTGGTGTGTAAAACTTGTGGACAACAGCAGACAACCCTTAAGGGTGTAGAAGCTGTTATGTACATGGGCACACTTTCTTATGAACAATTTAAGAAAGGTGTTCAGATACCTTGTACGTGTGGTAAACAAGCTACAAAATATCTAGTACAACAGGAGTCACCTTTTGTTATGATGTCAGCACCACCTGCTCAGTATGAACTTAAGCATGGTACATTTACTTGTGCTAGTGAGTACACTGGTAATTACCAGTGTGGTCACTATAAACATATAACTTCTAAAGAAACTTTGTATTGCATAGACGGTGCTTTACTTACAAAGTCCTCAGAATACAAAGGTCCTATTACGGATGTTTTCTACAAAGAAAACAGTTACACAACAACCATAAAACCAGTTACTTATAAATTGGATGGTGTTGTTTGTACAGAAATTGACCCTAAGTTGGACAATTATTATAAGAAAGACAATTCTTATTTCACAGAGCAACCAATTGATCTTGTACCAAACCAACCATATCCAAACGCAAGCTTCGATAATTTTAAGTTTGTATGTGATAATATCAAATTTGCTGATGATTTAAACCAGTTAACTGGTTACAAGAAACCTGCTTCAAGAGAGCTTAAAGTTACATTTTTCCCTGACTTAAATGGTGATGTGGTGGCTATTGATTATAAACACTACACACCCTCTTTTAAGAAAGGAGCTAAATTGTTACATAAACCTATTGTTTGGCATGTTAACAATGCAACTAATAAAGCCACGTATAAACCAAATACCTGGTGTATACGTTGTCTTTGGAGCACAAAACCAGTTGAAACATCAAATTCGTTTGATGTACTGAAGTCAGAGGACGCGCAGGGAATGGATAATCTTGCCTGCGAAGATCTAAAACCAGTCTCTGAAGAAGTAGTGGAAAATCCTACCATACAGAAAGACGTTCTTGAGTGTAATGTGAAAACTACCGAAGTTGTAGGAGACATTATACTTAAACCAGCAAATAATAGTTTAAAAATTACAGAAGAGGTTGGCCACACAGATCTAATGGCTGCTTATGTAGACAATTCTAGTCTTACTATTAAGAAACCTAATGAATTATCTAGAGTATTAGGTTTGAAAACCCTTGCTACTCATGGTTTAGCTGCTGTTAATAGTGTCCCTTGGGATACTATAGCTAATTATGCTAAGCCTTTTCTTAACAAAGTTGTTAGTACAACTACTAACATAGTTACACGGTGTTTAAACCGTGTTTGTACTAATTATATGCCTTATTTCTTTACTTTATTGCTACAATTGTGTACTTTTACTAGAAGTACAAATTCTAGAATTAAAGCATCTATGCCGACTACTATAGCAAAGAATACTGTTAAGAGTGTCGGTAAATTTTGTCTAGAGGCTTCATTTAATTATTTGAAGTCACCTAATTTTTCTAAACTGATAAATATTATAATTTGGTTTTTACTATTAAGTGTTTGCCTAGGTTCTTTAATMTACTCAACCGCTGCTTTAGGTGTTTTAATGTCTAATTTAGGCATGCCTTCTTACTGTACTGGTTACAGAGAAGGCTATTTGAACTCTACTAATGTCACTATTGCAACCTACTGTACTGGTTCTATACCTTGTAGTGTTTGTCTTAGTGGTTTAGATTCTTTAGACACCTATCCTTCTTTAGAAACTATACAAATTACCATTTCATCTTTTAAATGGGATTTAACTGCTTTTGGCTTAGTTGCAGAGTGGTTTTTGGCATATATTCTTTTCACTAGGTTTTTCTATGTACTTGGATTGGCTGCAATCATGCAATTGTTTTTCAGCTATTTTGCAGTACATTTTATTAGTAATTCTTGGCTTATGTGGTTAATAATTAATCTTGTACAAATGGCCCCGATTTCAGCTATGGTTAGAATGTACATCTTCTTTGCATCATTTTATTATGTATGGAAAAGTTATGTGCATGTTGTAGACGGTTGTAATTCATCAACTTGTATGATGTGTTACAAACGTAATAGAGCAACAAGAGTCGAATGTACAACTATTGTTAATGGTGTTAGAAGGTCCTTTTATGTCTATGCTAATGGAGGTAAAGGCTTTTGCAAACTACACAATTGGAATTGTGTTAATTGTGATACATTCTGTGCTGGTAGTACATTTATTAGTGATGAAGTTGCGAGAGACTTGTCACTACAGTTTAAAAGACCAATAAATCCTACTGACCAGTCTTCTTACATCGTTGATAGTGTTACAGTGAAGAATGGTTCCATCCATCTTTACTTTGATAAAGCTGGTCAAAAGACTTATGAAAGACATTCTCTCTCTCATTTTGTTAACTTAGACAACCTGAGAGCTAATAACACTAAAGGTTCATTGCCTATTAATGTTATAGTTTTTGATGGTAAATCAAAATGTGAAGAATCATCTGCAAAATCAGCGTCTGTTTACTACAGTCAGCTTATGTGTCAACCTATACTGTTACTAGATCAGGCATTAGTGTCTGATGTTGGTGATAGTGCGGAAGTTGCAGTTAAAATGTTTGATGCTTACGTTAATACGTTTTCATCAACTTTTAACGTACCAATGGAAAAACTCAAAACACTAGTTGCAACTGCAGAAGCTGAACTTGCAAAGAATGTGTCCTTAGACAATGTCTTATCTACTTTTATTTCAGCAGCTCGGCAAGGGTTTGTTGATTCAGATGTAGAAACTAAAGATGTTGTTGAATGTCTTAAATTGTCACATCAATCTGACATAGAAGTTACTGGCGATAGTTGTAATAACTATATGCTCACCTATAACAAAGTTGAAAACATGACACCCCGTGACCTTGGTGCTTGTATTGACTGTAGTGCGCGTCATATTAATGCGCAGGTAGCAAAAAGTCACAACATTGCTTTGATATGGAACGTTAAAGATTTCATGTCATTGTCTGAACAACTACGAAAACAAATACGTAGTGCTGCTAAAAAGAATAACTTACCTTTTAAGTTGACATGTGCAACTACTAGACAAGTTGTTAATGTTGTAACAACAAAGATAGCACTTAAGGGTGGTAAAATTGTTAATAATTGGTTGAAGCAGTTAATTAAAGTTACACTTGTGTTCCTTTTTGTTGCTGCTATTTTCTATTTAATAACACCTGTTCATGTCATGTCTAAACATACTGACTTTTCAAGTGAAATCATAGGATACAAGGCTATTGATGGTGGTGTCACTCGTGACATAGCATCTACAGATACTTGTTTTGCTAACAAACATGCTGATTTTGACACATGGTTTAGCCAGCGTGGTGGTAGTTATACTAATGACAAAGCTTGCCCATTGATTGCTGCAGTCATAACAAGAGAAGTGGGTTTTGTCGTGCCTGGTTTGCCTGGCACGATATTACGCACAACTAATGGTGACTTTTTGCATTTCTTACCTAGAGTTTTTAGTGCAGTTGGTAACATCTGTTACACACCATCAAAACTTATAGAGTACACTGACTTTGCAACATCAGCTTGTGTTTTGGCTGCTGAATGTACAATTTTTAAAGATGCTTCTGGTAAGCCAGTACCATATTGTTATGATACCAATGTACTAGAAGGTTCTGTTGCTTATGAAAGTTTACGCCCTGACACACGTTATGTGCTCATGGATGGCTCTATTATTCAATTTCCTAACACCTACCTTGAAGGTTCTGTTAGAGTGGTAACAACTTTTGATTCTGAGTACTGTAGGCACGGCACTTGTGAAAGATCAGAAGCTGGTGTTTGTGTATCTACTAGTGGTAGATGGGTACTTAACAATGATTATTACAGATCTTTACCAGGAGTTTTCTGTGGTGTAGATGCTGTAAATTTATTTACTAATATGTTTACACCACTAATTCAACCTATTGGTGCTTTGGACATATCAGCATCTATAGTAGCTGGTGGTATTGTAGCTATCGTAGTAACATGCCTTGCCTACTATTTTATGAGGTTTAGAAGAGCTTTTGGTGAATACAGTCATGTAGTTGCCTTTAATACTTTACTATTCCTTATGTCATTCACTGTACTCTGTTTAACACCAGTTTACTCATTCTTACCTGGTGTTTATTCTGTTATTTACTTGTACTTGACATTTTATCTTACTAATGATGTTTCTTTTTTAGCACATATTCAGTGGATGGTTATGTTCACACCTTTAGTACCTTTCTGGATAACAATTGCTTATATCATTTGTATTTCCACAAAGCATTTCTATTGGTTCTTTAGTAATTACCTAAAGAGACGTGTAGTCTTTAATGGTGTTTCCTTTAGTACTTTTGAAGAAGCTGCGCTGTGCACCTTTTTGTTAAATAAAGAAATGTATCTAAAGTTGCGTAGTGATGTGCTATTACCTCTTACGCAATATAATAGATACTTAGCTCTTTATAATAAGTACAAGTATTTTAGTGGAGCAATGGATACAACTAGCTACAGAGAAGCTGCTTGTTGTCATCTCGCAAAGGCTCTCAATGACTTCAGTAACTCAGGTTCTGATGTTCTTTACCAACCACCACAAACCTCTATCACCTCAGCTGTTTTGCAGAGTGGTTTTAGAAAAATGGCATTCCCATCTGGTAAAGTTGAGGGTTGTATGGTACAAGTAACTTGTGGTACAACTACACTTAACGGTCTTTGGCTTGATGACGTAGTTTACTGTCCAAGACATGTGATCTGCACCTCTGAAGACATGCTTAACCCTAATTATGAAGATTTACTCATTCGTAAGTCTAATCATAATTTCTTGGTACAGGCTGGTAATGTTCAACTCAGGGTTATTGGACATTCTATGCAAAATTGTGTACTTAAGCTTAGGGTTGATACAGCCAATCCTAAGACACCTAAGTATAAGTTTGTTCGCATTCAACCAGGACAGACTTTTTCAGTGTTAGCTTGTTACAATGGTTCACCATCTGGTGTTTACCAATGTGCTATGAGGCCCAATTTCACTATTAAGGGTTCATTCCTTAATGGTTCATGTGGTAGTGTTGGTTTTAACATAGATTATGACTGTGTCTCTTTTTKTTACATGCACCATATGGAATTACCAACTGGAGTTCATGCTGGCACAGACTTAGAAGGTAACTTTTATGGACCTTTTGTTGACAGGCAAACAGCACAAGCAGCTGGTACGGACACAACTATTACAGTTAATGTTTTAGCTTGGTTGTACGCTGCTGTTATAAATGGAGACAGGTGGTTTCTCAATCGATTTACCACAACTCTTAATGACTTTAACCTTGTGGCTATGAAGTACAATTATGAACYTCTAACACAAGACCATGTTGACATACTAGGACCTCTTTCTGCTCAAACTGGAATTGCCGTTTTAGATATGTGTGCTTCATTAAAAGAATTACTGCAAAATGGTATGAATGGACGTACCATATTGGGTAGTGCTTTATTAGAAGATGAATTTACACCTTTTGATGTTGTTAGACAATGCTCAGGTGTTACTTTCCAAAGTGCAGTGAAAAGAACAATCAAGGGTACACACCACTGGTTGTTACTCACAATTTTGACTTCACTTTTAGTTTTAGTCCAGAGTACTCAATGGTCTTTGTTCTTTTTTTTGTATGAAAATGCCTTTTTACCTTTTGCTATGGGTATTATTGCTATGTCTGCTTTTGCAATGATGTTTGTCAAACATAAGCATGCATTTCTCTGTTTGTTTTTGTTACCTTCTCTTGCCACTGTAGCTTATTTTAATATGGTCTATATGCCTGCTAGTTGGGTGATGCGTATTATGACATGGTTGGATATGGTTGATACTAGTTTGNNNNNNAAGCTAAAAGACTGTGTTATGTATGCATCAGCTGTAGTGTTACTAATCCTTATGACAGCAAGAACTGTGTATGATGATGGTGCTAGGAGAGTGTGGACACTTATGAATGTCTTGACACTCGTTTATAAAGTTTATTATGGTAATGCTTTAGATCAAGCCATTTCCATGTGGGCTCTTATAATCTCTGTTACTTCTAACTACTCAGGTGTAGTTACAACTGTCATGTTTTTGGCCAGAGGTATTGTTTTTATGTGTGTTGAGTATTGCCCTATTTTCTTCATAACTGGTAATACACTTCAGTGTATAATGCTAGTTTATTGTTTCTTAGGCTATTTTTGTACTTGTTACTTTGGCCTCTTTTGTTTACTCAACCGCTACTTTAGACTGACTCTTGGTGTTTATGATTACTTAGTTTCTACACAGGAGTTTAGATATATGAATTCACAGGGACTAYTCCCACCCAAGAATAGCATAGATGCCTTCAAACTCAACATTAAATTGTTGGGTGTTGGTGGCAAACCTTGTATCAAAGTAGCCACTGTACAGTCTAAAATGTCAGATGTAAAGTGCACATCAGTAGTCTTACTCTCAGTTTTGCAACAACTCAGAGTAGAATCATCATCTAAATTGTGGGCTCAATGTGTCCAGTTACACAATGACATTCTCTTAGCTAAAGATACTACTGAAGCCTTTGAAAAAATGGTTTCACTACTTTCTGTTTTGCTTTCCATGCAGGGTGCTGTAGACATAAACAAGCTTTGTGAAGAAATGCTGGACAACAGGGCAACCTTACAAGCTATAGCCTCAGAGTTTAGTTCCCTTCCATCATATGCAGCTTTTGCTACTGCTCAAGAAGCTTATGAGCAGGCTGTTGCTAATGGTGATTCTGAAGTTGTTCTTAAAAAGTTGAAGAAGTCTTTGAATGTGGCTAAATCTGAATTTGACCGTGATGCAGCCATGCAACGTAAGTTGGAAAAGATGGCTGATCAAGCTATGACCCAAATGTATAAACAGGCTAGATCTGAGGACAAGAGGGCAAAAGTTACTAGTGCTATGCAGACAATGCTTTTCACTATGCTTAGAAAGTTGGATAATGATGCACTCAACAACATTATCAACAATGCAAGAGATGGTTGTGTTCCCTTGAACATAATACCTCTTACAACAGCAGCCAAACTAATGGTTGTCATACCAGACTATAACACATATAAAAATACGTGTGATGGTACAACATTTACTTATGCATCAGCATTGTGGGAAATCCAACAGGTTGTAGATGCAGATAGTAAAATTGTTCAACTTAGTGAAATTAGTATGGACAATTCACCTAATTTAGCATGGCCTCTTATTGTAACAGCTTTAAGGGCCAATTCTGCTGTCAAATTACAGAATAATGAGCTTAGTCCTGTTGCACTACGACAGATGTCTTGTGCTGCCGGTACTACACAAACTGCTTGCACTGATGACAATGCGTTAGCTTACTACAACACAACAAAGGGAGGTAGGTTTGTACTTGCACTGTTATCCGATTTACAGGATTTGAAATGGGCTAGATTCCCTAAGAGTGATGGAACTGGTACTATCTATACAGAACTGGAACCACCTTGTAGGTTTGTTACAGACACACCTAAAGGTCCTAAAGTGAAGTATTTATACTTTATTAAAGGATTAAACAACCTAAATAGAGGTATGGTACTTGGTAGTTTAGCTGCCACAGTACGTCTACAAGCTGGTAATGCAACAGAAGTGCCTGCCAATTCAACTGTATTATCTTTCTGTGCTTTTGCTGTAGATGCTGCTAAAGCTTACAAAGATTATCTAGCTAGTGGGGGACAACCAATCACTAATTGTGTTAAGATGTTGTGTA
CACACACTGGTACTGGTCAGGCAATAACAGTTACACCGGAAGCCAATATGGATCAAGAATCCTTTGGTGGTGCATCGTGTTGTCTGTACTGCCGTTGCCACATAGATCATCCAAATCCTAAAGGATTTTGTGACTTAAAAGGTAAGTATGTACAAATACCTACAACTTGTGCTAATGACCCTGTGGGTTTTACACTTAAAAACACAGTCTGTACCGTCTGCGGTATGTGGAAAGGTTATGGCTGTAGTTGTGATCAACTCCGCGAACCCATGCTTCAGTCAGCTGATGCACAATCGTTTTTAAACGGGTTTGCGGTGTAAGTGCAGCCCGTCTTACACCGTGCGGCACAGGCACTAGTACTGATGTCGTATACAGGGCTTTTGACATCTACAATGATAAAGTAGCTGGTTTTGCTAAATTCCTAAAAACTAATTGTTGTCGCTTCCAAGAAAAGGACGAAGATGACAATTTAATTGATTCTTACTTTGTAGTTAAGAGACACACTTTCTCTAACTACCAACATGAAGAAACAATTTATAATTTACTTAAGGATTGTCCAGCTGTTGCTAAACATGACTTCTTTAAGTTTAGAATAGACGGTGACATGGTACCACATATATCACGTCAACGTCTTACTAAATACACAATGGCAGACCTCGTCTATGCTTTAAGGCATTTTGATGAAGGTAATTGTGACACATTAAAAGAAATACTTGTCACATACAATTGTTGTGATGATGATTATTTCAATAAAAAGGACTGGTATGATTTTGTAGAAAACCCAGATATATTACGCGTATACGCCAACTTAGGTGAACGTGTACGCCAAGCTTTGTTAAAAACAGTACAATTCTGTGATGCCATGCGAAATGCTGGTATTGTTGGTGTACTGACATTAGATAATCAAGATCTCAATGGTAACTGGTATGATTTCGGTGATTTCATACAAACCACGCCAGGTAGTGGAGTTCCTGTTGTAGATTCTTATTATTCATTGTTAATGCCTATATTAACCTTGACCAGGGCTTTAACTGCAGAGTCACATGTTGACACTGACTTAACAAAGCCTTACATTAAGTGGGATTTGTTAAAATATGACTTCACGGAAGAGAGGTTAAAACTCTTTGACCGTTATTTTAAATATTGGGATCAGACATACCACCCAAATTGTGTTAACTGTTTGGATGACAGATGCATTCTGCATTGTGCAAACTTTAATGTTTTATTCTCTACAGTGTTCCCACTTACAAGTTTTGGACCACTAGTGAGAAAAATATTTGTTGATGGTGTTCCATTTGTAGTTTCAACTGGATACCACTTCAGAGAGCTAGGTGTTGTACATAATCAGGATGTAAACTTACATAGCTCTAGACTTAGTTTTAAGGAATTACTTGTGTATGCTGCTGACCCTGCTATGCACGCTGCTTCTGGTAATCTATTACTAGATAAACGCACTACGTGCTTTTCAGT
AGCTGCACTTACTAACAATGTTGCTTTTCAAACTGTCAAACCCGGTAATTTTAACAAAGACTTCTATGACTTTGCTGTGTCTAAGGGTTTCTTTAAGGAAGGAAGTTCTGTTGAATTAAAACACTTCTTCTTTGCTCAGGATGGTAATGCTGCTATCAGCGATTATGACTACTATCGTTATAATCTACCAACAATGTGTGATATCAGACAACTACTATTTGTAGTTGAAGTTGTTGATAAGTACTTTGATTGTTACGATGGTGGCTGTATTAATGCTAACCAAGTCATCGTCAACAACCTAGACAAATCAGCTGGTTTTCCATTTAATAAATGGGGTAAGGCTAGACTTTATTATGATTCAATGAGTTATGAGGATCAAGATGCACTTTTCGCATATACAAAACGTAATGTCATCCCTACTATAACTCAAATGAATCTTAAGTATGCCATTAGTGCAAAGAATAGAGCTCGCACCGTAGCTGGTGTCTCTATCTGTAGTACTATGACCAATAGACAGTTTCATCAAAAATTATTGAAATCAATAGCCGCCACTAGAGGAGCTACTGTAGTAATTGGAACAAGCAAATTCTATGGTGGTTGGCACAACATGTTAAAAACTGTTTATAGTGATGTAGAAAACCCTCACCTTATGGGTTGGGATTATCCTAAATGTGATAGAGCCATGCCTAACATGCTTAGAATTATGGCCTCACTTGTTCTTGCTCGCAAACATACAACGTGTTGTAGCTTGTCACACCGTTTCTATAGATTAGCTAATGAGTGTGCTCAAGTATTGAGTGAAATGGTCATGTGTGGCGGTTCACTATATGTTAAACCAGGTGGAACCTCATCAGGAGATGCCACAACTGCTTATGCTAATAGTGTTTTTAACATTTGTCAAGCTGTCACGGCCAATGTTAATGCACTTTTATCTACTGATGGTAACAAAATTGCCGATAAGTATGTCCGCAATTTACAAC
ACAGACTTTATGAGTGTCTCTATAGAAATAGAGATGTTGACACAGACTTTGTGAATGAGTTTTACGCATATTTGCGTAAACATTTCTCAATGATGATACTCTCTGACGATGCTGTTGTGTGTTTCAATAGCACTTATGCATCTCAAGGTCTAGTGGCTAGCATAAAGAACTTTAAGTCAGTTCTTTATTATCAAAACAATGTTTTTATGTCTGAAGCAAAATGTTGGACTGAGACTGACCTTACTAAAGGACCTCATGAATTTTGCTCTCAACATACAATGCTAGTTAAACAGGGTGATGATTATGTGTACCTTCCTTACCCAGATCCATCAAGAATCCTAGGGGCCGGCTGTTTTGTAGATGATATCGTAAAAACAGATGGTACACTTATGATTGAACGGTTCGTGTCTTTAGCTATAGATGCTTACCCACTTACTAAACATCCTAATCAGGAGTATGCTGATGTCTTTCATTTGTACTTACAATACATAAGAAAGCTACATGATGAGTTAACAGGACACATGTTAGACATGTATTCTGTTATGCTTACTAATGATAACACTTCAAGGTATTGGGAACCTGAGTTTTATGAGGCTATGTACACACCGCATACAGTCTTACAGGCTGTTGGGGCTTGTGTTCTTTGCAATTCACAGACTTCATTAAGATGTGGTGCTTGCATACGTAGACCATTCTTATGTTGTAAATGCTGTTACGACCATGTCATATCAACATCACATAAATTAGTCTTGTCTGTTAATCCGTATGTTTGCAATGCTTCAGGTTGTGATGTCACAGATGTGACTCAACTTTACTTAGGAGGTATGAGCTATTATTGTAAATCACATAAACCACCCATTAGTTTTCCATTGTGTGCTAATGGACAAGTTTTTGGTTTATATAAAAATACATGTGTTGGTAGCGATAATGTTACTGACTTTAATGCAATTGCAACATGTGACTGGACAAATGCTGGTGATTACATTTTAGCTAACACCTGTACTGAAAGACTCAAGCTTTTTGCAGCAGAAACGCTCAAAGCTACTGAGGAGACATTTAAACTGTCTTATGGTATTGCTACTGTACGTGAAGTGCTGTCTGACAGAGAATTACATCTTTCATGGGAAGTTGGTAAACCTAGACCACCACTTAACCGAAATTATGTCTTTACTGGTTATCGTGTAACTAAAAACAGTAAAGTACAAATAGGAGAGTACACCTTTGAAAAAGGTGACTATGGTGATGCTGTTGTTTACCGAGGTACAACAACTTACAAATTAAATGTTGGTGATTATTTTGTGCTGACATCACATACAGTAATGCCATTAAGTGCACCTACACTAGTGCCACAAGAGCACTATGTTAGAATTACTGGCTTATACCCAACACTCAATATCTCAGATGAGTTTTCTAGCAATGTTGCAAATTATCAAAAGGTTGGTATGCAAAAGTATTCTACACTCCAGGGACCACCTGGTACTGGTAAGAGTCATTTTGCTATTGGCCTAGCTCTCTACTACCCTTCTGCTCGCATAGTGTATACAGCTTGCTCTCATGCCGCTGTTGATGCACTATGTGAGAAGGCATTAAAATATTTGCCTATAGATAAATGTAGTAGAATTATACCTGCACGTGCTCGTGTAGAGTGTTTTGATAAATTCAAAGTGAATTCAACATTAGAACAGTATGTCTTTTGTACTGTAAATGCATTGCCTGAGACGACAGCAGATATAGTTGTCTTTGATGAAATTTCAATGGCCACAAATTATGATTTGAGTGTTGTCAATGCCAGATTACGTGCTAAGCACTATGTGTACATTGGCGACCCTGCTCAATTACCTGCACCACGCACATTGCTAACTAAGGGCACACTAGAACCAGAATATTTCAATTCAGTGTGTAGACTTATGAAAACTATAGGTCCAGACATGTTCCTCGGAACTTGTCGGCGTTGTCCTGCTGAAATTGTTGACACTGTGAGTGCTTTGGTTTATGATAATAAGCTTAAAGCACATAAAGACAAATCAGCTCAATGCTTTAAAATGTTTTATAAGGGTGTTATCACGCATGATGTTTCATCTGCAATTAACAGGCCACAAATAGGCGTGGTAAGAGAATTCCTTACACGTAACCCTGCTTGGAGAAAAGCTGTCTTTATTTCACCTTATAATTCACAGAATGCTGTAGCCTCAAAGATTTTGGGACTACCAACTCAAACTGTTGATTCATCACAGGGCTCAGAATATGACTATGTCATATTCACTCAAACCACTGAAACAGCTCACTCTTGTAATGTAAACAGATTTAATGTTGCTATTACCAGAGCAAAAGTAGGCATACTTTGCATAATGTCTGATAGAGACCTTTATGACAAGTTGCAATTTACAAGTCTTGAAATTCCACGTAGGAATGTGGCAACTTTACAAGCTGAAAATGTAACAGGACTCTTTAAAGATTGTAGTAAGGTAATCACTGGGTTACATCCTACACAGGCACCTACACACCTCAGTGTTGACACTAAATTCAAAACTGAAGGTTTATGTGTTGACATACCTGGCATACCTAAGGACATGACCTATAGAAGACTCATCTCTATGATGGGTTTTAAAATGAATTATCAAGTTAATGGTTACCCTAACATGTTTATCACCCGCGAAGAAGCTATAAGACATGTACGTGCATGGATTGGCTTCGATGTCGAGGGGTGTCATGCTACTAGAGAAGCTGTTGGTACCAATTTACCTTTACAGCTAGGTTTTTCTACAGGTGTTAACCTAGTTGCTGTACCTACAGGTTATGTTGATACACCTAATAATACAGATTTTTCCAGAGTTAGTGCTAAACCACCGCCTGGAGATCAATTTAAACACCTCATACCACTTATGTACAAAGGACTTCCTTGGAATGTAGTGCGTATAAAGATTGTACAAATGTTAAGTG
ACACACTTAAAAATCTCTCTGACAGAGTCGTATTTGTCTTATGGGCACATGGCTTTGAGTTGACATCTATGAAGTATTTTGTGAAAATAGGACCTGAGCGCACCTGTTGTCTATGTGATAGACGTGCCACATGCTTTTCCACTGCTTCAGACACTTATGCCTGTTGGCATCATTCTATTGGATTTGATTACGTCTATAATCCGTTTATGATTGATGTTCAACAATGGGGTTTTACAGGTAACCTACAAAGCAACCATGATCTGTATTGTCAAGTCCATGGTAATGCACATGTAGCTAGTTGTGATGCAATCATGACTAGGTGTCTAGCTGTCCACGAGTGCTTTGTTAAGCGTGTTGACTGGACTATTGAATATCCTATAATTGGTGATGAACTGAAGATTAATGCGGCTTGTAGAAAGGTTCAACACATGGTTGTTAAAGCTGCATTATTAGCAGACAAATTCCCAGTTCTTCACGACATTGGTAACCC
TAAAGCTATTAAGTGTGTACCTCAAGCTGATGTAGAATGGAAGTTCTATGATGCACAGCCTTGTAGTGACAAAGCTTATAAAATAGAAGAATTATTCTATTCTTATGCCACACATTCTGACAAATTCACAGATGGTGTATGCCTATTTTGGAATTGCAATGTCGATAGATATCCTGCTAATTCCATTGTTTGTAGATTTGACACTAGAGTGCTATCTAACCTTAACTTGCCTGGTTGTGATGGTGGCAGTTTGTATGTAAATAAACATGCATTCCACACACCAGCTTTTGATAAAAGTGCTTTTGTTAATTTAAAACAATTACCATTTTTCTATTACTCTGACAGTCCATGTGAGTCTCATGGAAAACAAGTAGTGTCAGATATAGATTATGTACCACTAAAGTCTGCTACGTGTATAACACGTTGCAATTTAGGTGGTGCTGTCTGTAGACATCATGCTAATGAGTACAGATTGTATCTCGATGCTTATAACATGATGATCTCAGCTGGCTTTAGCTTGTGGGTTTACAAACAATTTGATACTTATAACCTCTGGAACACTTTTACAAGACTTCAGAGTTTAGAAAATGTGGCTTTTAATGTTGTAAATAAGGGACACTTTGATGGACAACAGGGTGAAGTACCAGTTTCTATCATTAATAACACTGTTTACACAAAAGTTGATGGTGTTGATGTAGAATTGTTTGAAAATAAAACAACATTACCTGTTAATGTAGCATTTGAGCTTTGGGCTAAGCGCAACATTAAACCAGTACCAGAGGTGAAAATACTCAATAATTTGGGTGTGGACATTGCTGCTAATACTGTGATCTGGGACTACAAAAGAGATGCTCCAGCACATATATCTACTATTGGTGTTTGTTCTATGACTGACATAGCCAAGAAACCAACTGAAACGATTTGTGCACCACTCACTGTCTTTTTTGATGGTAGAGTTGATGGTCAAGTAGACTTATTTAGAAATGCCCGTAATGGTGTTCTTATTACAGAAGGTAGTGTTAAAGGTTTACAACCATCTGTAGGTCCCAAACAAGCTAGTCTTAATGGAGTCACATTAATTGGAGAAGCCGTAAAAACACAGTTCAATTATTATAAGAAAGTTGATGGTGTTGTCCAACAATTACCTGAAACTTACTTTACTCAGAGTAGAAATTTACAAGAATTTAAACCCAGGAGTCAAATGGAAATTGATTTCTTAGAATTAGCTATGGATGAATTCATTGAACGGTATAAATTAGAAGGCTATGCCTTCGAACATATCGTTTATGGAGATTTTAGTCATAGTCAGTTAGGTGGTTTACATCTACTGATTGGACTAGCTAAACGTTTTAAGGAATCACCTTTTGAATTAGAAGATTTTATTCCTATGGACAGTACAGTTAAAAACTATTTCATAACAGATGCGCAAACAGGTTCATCTAAGTGTGTGTGTTCTGTTATTGATTTATTACTTGATGATTTTGTTGAAATAATAAAATCCCAAGATTTATCTGTAGTTTCTAAGGTTGTCAAAGTGACTATTGACTATACAGAAATTTCATTTATGCTTTGGTGTAAAGATGGCCATGTAGAAACATTTTACCCAAAATTACAATCTAGTCAAGCGTGGCAACCGGGTGTTGCTATGCCTAATCTTTACAAAATGCAAAGAATGCTATTAGAAAAGTGTGACCTTCAAAATTATGGTGATAGTGCAACATTACCTAAAGGCATAATGATGAATGTCGCAAAATATACTCAACTGTGTCAATATTTAAACACATTAACATTAGCTGTACCCTATAATATGAGAGTTATACATTTTGGTGCTGGTTCTGATAAAGGAGTTGCACCAGGTACAGCTGTTTTAAGACAGTGGTTGCCTACGGGTACGCTGCTTGTCGATTCAGATCTTAATGACTTTGTCTCTGATGCAGATTCAACTTTGATTGGTGATTGTGCAACTGTACATACAGCTAATAAATGGGATCTCATTATTAGTGATATGTACGACCCTAAGACTAAAAATGTTACAAAAGAAAATGACTCTAAAGAGGGTTTTTTCACTTACATTTGTGGGTTTATACAACAAAAGCTAGCTCTTGGAGGTTCCGTGGCTATAAAGATAACAGAACATTCTTGGAATGCTGATCTTTATAAGCTCATGGGACACTTCGCATGGTGGACAGCCTTTGTTACTAATGTGAATGCGTCATCATCTGAAGCATTTTTAATTGGATGTAATTATCTTGGCAAACCACGCGAACAAATAGATGGTTATGTCATGCATGCAAATTACATATTTTGGAGGAATACAAATCCAATTCAGTTGTCTTCCTATTCTTTATTTGACATGAGTAAATTTCCCCTTAAATTAAGGGGTACTGCTGTTATGTCTTTAAAAGAAGGTCAAATCAATGATATGATTTTATCTCTTCTTAGTAAAGGTAGACTTATAATTAGAGAAAACAACAGAGTTGTTATTTCTAGTGATGTTCTTGTTAACAACTAAACGAACAATGTTTGTTTTTCTTGTTTTATTGCCACTAGTCTCTAGTCAGTGTGTTAATCTTACAACCAGAACTCAATTACCCCCTGCATACACTAATTCTTTCACACGTGGTGTTTATTACCCTGACAAAGTTTTCAGATCCTCAGTTTTACATTCAACTCAGGACTTGTTCTTACCTTTCTTTTCCAATGTTACTTGGTTCCATGCTATACATGTCTCTGGGACCAATGGTACTAAGAGGTTTGCTAACCCTGTCCTACCATTTAATGATGGTGTTTATTTTGCTTCCACTGAGAAGTCTAACATAATAAGAGGCTGGATTTTTGGTACTACTTTAGATTCGAAGACCCAGTCCCTACTTATTGTTAATAACGCTACTAATGTTGTTATTAAAGTCTGTGAATTTCAATTTTGTAATGATCCATTTTTGGGTGTTTATTACCACAAAAACAACAAAAGTTGGATGGAAAGTGAGTTCAGAGTTTATTCTAGTGCGAATAATTGCACTTTTGAATATGTCTCTCAGCCTTTTCTTATGGACCTTGAAGGAAAACAGGGTAATTTCAAAAATCTTAGGGAATTTGTGTTTAAGAATATTGATGGTTATTTTAAAATATATTCTAAGCACACGCCTATTAATTTAGTGCGTGGTCTCCCTCAGGGTTTTTCGGCTTTAGAACCATTGGTAGATTTGCCAATAGGTATTAACATCACTAGGTTTCAAANNNNNNCTTTACATAGAAGTTATTTGACTCCTGGTGATTCTTCTTCAGGTTGGACAGCTGGTGCTGCAGCTTATTATGTGGGTTATCTTCAACCTAGGACTTTTCTATTAAAATATAATGAAA
ATGGAACCATTACAGATGCTGTAGACTGTGCACTTGACCCTCTCTCAGAAACAAAGTGTACGTTGAAATCCTTCACTGTAGAAAAAGGAATCTATCAAACTTCTAACTTTAGAGTCCAACCAACAGAATCTATTGTTAGATTTCCTAATATTACAAACTTGTGCCCTTTTGGTGAAGTTTTTAACGCCACCAGATTTGCATCTGTTTATGCTTGGAACAGGAAGAGAATCAGCAACTGTGTTGCTGATTATTCTGTCCTATATAATTCCGCATCATTTTCCACTTTTAAGTGTTATGGAGTGTCTCCTACTAAATTAAATGATCTCTGCTTTACTAATGTCTATGCAGATTCATTTGTAATTAGAGGTGATGAAGTCAGACAAATCGCTCCAGGGCAAACTGGAAATATTGCTGATTATAATTATAAATTACCAGATGATTTTACAGGCTGCGTTATAGCTTGGAATTCTAACAATCTTGATTCTAAGGTTGGTGGTAATTATAATTACCTGTATAGATTGTTTAGGAAGTCTAATCTCAAACCTTTTGAGAGAGATATTTCAACTGAAATCTATCAGGCCGGTAGCACACCTTGTAATGGTGTTAAAGGTTTTAATTGTTACTTTCCTTTACAATCATATGGTTTCCAACCCACTTATGGTGTTGGTTACCAACCATACAGAGTAGTAGTACTTTCTTTTGAACTTCTACATGCACCAGCAACTGTTTGTGGACCTAAAAAGTCTACTAATTTGGTTAAAAACAAATGTGTCAATTTCAACTTCAATGGTTTAACAGGCACAGGTGTTCTTACTGAGTCTAACAAAAAGTTTCTGCCTTTCCAACAATTTGGCAGAGACATTGCTGACACTACTGATGCTGTCCGTGATCCACAGACACTTGAGATTCTTGACATTACACCATGTTCTTTTGGTGGTGTCAGTGTTATAACACCAGGAACAAATACTTCTAACCAGGTTGCTGTTCTTTATCAGGGTGTTAACTGCACAGAAGTCCCTGTTGCTATTCATGCAGATCAACTTACTCCTACTTGGCGTGTTTATTCTACAGGTTCTAATGTTTTTCAAACACGTGCAGGCTGTTTAATAGGGGCTGAACATGTCAACAACTCATATGAGTGTGACATACCCATTGGTGCAGGTATATGCGCTAGTTATCAGACTCAGACTAATTCTCCTCGGCGGGCACGTAGTGTAGCTAGTCAATCCATCATTGCCTACACTATGTCACTTGGTGTAGAAAATTCAGTTGCTTACTCTAATAACTCTATTGCCATACCCACAAATTTTACTATTAGTGTTACCACAGAAATTCTACCAGTGTCTATGACCAAGACATCAGTAGATTGTACAATGTACATTTGTGGTGATTCAACTGAATGCAGCAATCTTTTGTTGCAATATGGCAGTTTTTGTACACAATTAAACCGTGCTTTAACTGGAATAGCTGTTGAACAAGACAAAAACACCCAAGAAGTTTTTGCACAAGTCAAACAAATTTACAAAACACCACCAATTAAAGATTTTGGTGGTTTTAATTTTTCACAAATATTACCAGATCCATCAAAACCAAGCAAGAGGTCATTTATTGAAGATCTACTTTTCAACAAAGTGACACTTGCAGATGCTGGCTTCATCAAACAATATGGTGATTGCCTTGGTGATATTGCTGCTAGAGACCTCATTTGTGCACAAAAGTTTAACGGCCTTACTGTTTTGCCACCTTTGCTCACAGATGAAATGATTGCTCAATACACTTCTGCACTGTTAGCGGGTACAATCACTTCTGGTTGGACCTTTGGTGCAGGTGCTGCATTACAAATACCATTTGCTATGCAAATGGCTTATAGGTTTAATGGTATTGGAGTTACACAGAATGTTCTCTATGAGAACCAAAAATTGATTGCCAACCAATTTAATAGTGCTATTGGCAAAATTCAAGACTCACTTTCTTCCACAGCAAGTGCACTTGGAAAACTTCAAGATGTGGTCAACCAAAATGCACAAGCTTTAAACACGCTTGTTAAACAACTTAGCTCCAATTTTGGTGCAATTTCAAGTGTTTTAAATGATATCCTTTCACGTCTTGACAAAGTTGAGGCTGAAGTGCAAATTGATAGGTTGATCACAGGCAGACTTCAAAGTTTGCAGACATATGTGACTCAACAATTAATTAGAGCTGCAGAAATCAGAGCTTCTGCTAATCTTGCTGCTACTAAAATGTCAGAGTGTGTACTTGGACAATCAAAAAGAGTTGATTTTTGTGGAAAGGGCTATCATCTTATGTCCTTCCCTCAGTCAGCACCTCATGGTGTAGTCTTCTTGCATGTGACTTATGTCCCTGCACAAGAAAAGAACTTCACAACTGCTCCTGCCATTTGTCATGATGGAAAAGCACACTTTCCTCGTGAAGGTGTCTTTGTTTCAAATGGCACACACTGGTTTGTAACACAAAGGAATTTTTATGAACCACAAATCATTACTACAGACAACACATTTGTGTCTGGTAACTGTGATGTTGTAATAGGAATTGTCAACAACACAGTTTATGATCCTTTGCAACCTGAATTAGACTCATTCAAGGAGGAGTTAGATAAATATTTTAAGAATCATACATCACCAGATGTTGATTTAGGTGACATCTCTGGCATTAATGCTTCAGTTGTAAACATTCAAAAAGAAATTGACCGCCTCAATGAGGTTGCCAAGAATTTAAATGAATCTCTCATCGATCTCCAAGAACTTGGAAAGTATGAGCAGTATATAAAATGGCCATGGTACATTTGGCTAGGTTTTATAGCTGGCTTGATTGCCATAGTAATGGTGACAATTATGCTTTGCTGTATGACCAGTTGCTGTAGTTGTCTCAAGGGCTGTTGTTCTTGTGGATCCTGCTGCAAATTTGATGAAGACGACTCTGAGCCAGTGCTCAAAGGAGTCAAATTACATTACACATAAACGAACTTATGGATTTGTTTATGAGAATCTTCACAATTGGAACTGTAACTTTGAAGCAAGGTGAAATCAAGGATGCTACTCCTTCAGATTTTGTTCGCGCTACTGCAACGATACCGATACAAGCCTCACTCCCTTTCGGATGGCTTATTGTTGGCGTTGCACTTCTTGCTGTTTTTCATAGCGCTTCCAAAATCATAACCCTCAAAAAGAGATGGCAACTAGCACTCTCCAAGGGTGTTCACTTTGTTTGCAACTTGCTGTTGTTGTTTGTAACAGTTTACTCACACCTTTTGCTCGTTGCTGCTGGCCTTGAAGCCCCTTTTCTCTATCTTTATGCTTTAGTCTACTTCTTGCAGAGTATAAACTTTGTAAGAATAATAATGAGGCTTTGGCTTTGCTGGAAATGCCGTTCCAAAAACCCATTACTTTATGATGCCAACTATTTTCTTTGCTGGCATACTAATTGTTACGACTATTGTATACCTTACAATAGTGTAACTTCTTCAATTGTCATTACTTTAGGTGATGGCACAACAAGTCCTATTTCTGAACATGACTACCAGATTGGTGGTTATACTGAAAAATGGGAATCTGGAGTAAAAGACTGTGTTGTATTACACAGTTACTTCACTTCAGACTATTACCAGCTGTACTCAACTCAATTGAGTACAGACACTGGTGTTGAACATGTTACCTTCTTCATCTACAATAAAATTGTTGATGAGCCTGAAGAACATGTCCAAATTCACACAATCGACGGTTCATCCGGAGTTGTTAATCCAGTAATGGAACCAATTTATGATGAACCGACGACGACTACTAGCGTGCCTTTGTAAGCACAAGCTGATGAGTACGAACTTATGTACTCATTC
GTTTCGGAAGAGACAGGTACGTTAATAGTTAATAGCGTACTTCTTTTTCTTGCTTTCGTGGTATTCTTGCTAGTTACACTAGCCATCCTTACTGCGCTTCGATTGTGTGCGTACTGCTGCAATATTGTTAACGTGAGTCTTGTAAAACCTTCTTTTTACGTTTACTCTCGTGTTAAAAATCTGAATTCTTCTAGAGTTCTTGATCTTCTGGTCTAAACGAACTAAATATTATATTAGTTTTTCTGTTTGGAACTTTAATTTTAGCCATGGCAGATTCCAACGGTACTATTACCGTTGAAGAGCTTAAAAAGCTCCTTGAACAATGGAACCTAGTAATAGGTTTCCTATTCCTTACATGGATTTGTCTTCTACAATTTGCCTATGCCAACAGGAATAGGTTTTTGTATATAATTAAGTTAATTTTCCTCTGGCTGTTATGGCCAGTAACTTTAGCTTGTTTTGTGCTTGCTGCTGTTTACAGAATAAATTGGATCACCGGTGGAATTGCTATCGCAATGGCTTGTCTTGTAGGCTTGATGTGGCTCAGCTACTTCATTGCTTCTTTCAGACTGTTTGCGCGTACGCGTTCCATGTGGTCATTCAATCCAGAAACTAACATTCTTCTCAACGTGCCACTCCATGGCACTATTCTGACCAGACCGCTTCTAGAAAGTGAACTCGTAATCGGAGCTGTGATCCTTCGTGGACATCTTCGTATTGCTGGACACCATCTAGGACGCTGTGACATCAAGGACCTGCCTAAAGAAATCACTGTTGCTACATCACGAACGCTTTCTTATTACAAATTGGGAGCTTCGCAGCGTGTAGCAGGTGACTCAGGTTTTGCTGCATACAGTCGCTACAGGATTGGCAACTATAAATTAAACACAGACCATTCCAGTAGCAGTGACAATATTGCTTTGCTTGTACAGTAAGCGACAACAGATGTTTCATCTCGTTGACTTTCAGGTTACTATAGCAGAGATATTACTAATTATTATGAGGACTTTTAAAGTTTCCATTTGGAATCTTGATTACATCATAAACCTCATAATTAAAAATTTATCTAAGTCACTAACTGAGAATAAATATTCTCAATTAGATGAAGAGCAACCAATGGAGATTGATTAAACGAACATGAAAATTATTCTTTTCTTGGCACTGATAACACTCGCTACTTGTGAGCTTTATCACTACCAAGAGTGTGTTAGAGGTACAACAGTACTTTTAAAAGAACCTTGCTCTTCTGGAACATACGAGGGCAATTCACCATTTCATCCTCTAGCTGATAACAAATTTGCACTGACTTGCTTTAGCACTCAATTTGCTTTTGCTTGTCCTGACGGCGTAAAACACGTCTATCAGTTACGTGCCAGATCAGTTTCACCTAAACTGTTCATCAGACAAGAGGAAGTTCAAGAACTTTACTCTCCAATTTTTCTTATTGTTGCGGCAATAGTGTTTATAACACTTTGCTTCACACTCAAAAGAAAGACAGAATGATTGAACTTTCATTAATTGACTTCTATTTGTGCTTTTTAGCCTTTCTGCTATTCCTTGTTTTAATTATGCTTATTATCTTTTGGTTCTCACTTGAACTGCAAGATCATAATGAAACTTGTCACGCCTAAACGAACATGAAATTTCTTGTTTTCTTAGGAATCATCACAACTGTAGCTGCATTTCACCAAGAATGTAGTTTACAGTCATGTACTCAACATCAACCATATGTAGTTGATGACCCGTGTCCTATTCACTTCTATTCTAAATGGTATATTAGAGTAGGAGCTAGAAAATCAGCACCTTTAATTGAATTGTGCGTGGATGAGGCTGGTTCTAAATCACCCATTCAGTACATCGATATCGGTAATTATACAGTTTCCTGTTTACCTTTTACAATTAATTGCCAGGAACCTAAATTGGGTAGTCTTGTAGTGCGTTGTTCGTTCTATGAAGACTTTTTAGAGTATCATGACGTTCGTGTTGTTTTAGATTTTATCTAAACGAACAAACTAAAATGTCTGATAATGGACCCCAAAATCAGCGAAATGCACCCCGCATTACGTTTGGTGGACCCTCAGATTCAACTGGCAGTAACCAGAATGGAGAACGCAGTGGGGCGCGATCAAAACAACGTCGGCCCCAAGGTTTACCCAATAATACTGCGTCTTGGTTCACCGCTCTCACTCAACATGGCAAGGAAGACCTTAAATTCCCTCGAGGACAAGGCGTTCCAATTAACACCAATAGCAGTCCAGATGACCAAATTGGCTACTACCGAAGAGCTACCAGACGAATTCGTGGTGGTGACGGTAAAATGAAAGATCTCAGTCCAAGATGGTATTTCTACTACCTAGGAACTGGGCCAGAAGCTGGACTT
CCCTATGGTGCTAACAAAGACGGCATCATATGGGTTGCAACTGAGGGAGCCTTGAATACACCAAAAGATCACATTGGCACCCGCAATCCTGCTAACAATGCTGCAATCGTGCTACAACTTCCTCAAGGAACAACATTGCCAAAAGGCTTCTACGCAGAAGGGAGCAGAGGCGGCAGTCAAGCCTCTTCTCGTTCCTCATCACGTAGTCGCAACAGTTCAAGAAATTCAACTCCAGGCAGCAGTAGGGGAATTTCTCCTGCTAGAATGGCTGGCAATGGCGGTGATGCTGCTCTTGCTTTGCTGCTGCTTGACAGATTGAACCAGCTTGAGAGCAAAATGTCTGGTAAAGGCCAACAACAACAAGGCCAAACTGTCACTAAGAAATCTGCTGCTGAGGCTTCTAAGAAGCCTCGGCAAAAACGTACTGCCACTAAAGCATACAATGTAACACAAGCTTTCGGCAGACGTGGTCCAGAACAAACCCAAGGAAATTTTGGGGACCAGGAACTAATCAGACAAGGAACTGATTACAAACATTGGCCGCAAATTGCACAATTTGCCCCCAGCGCTTCAGCGTTCTTCGGAATGTCGCGCATTGGCATGGAAGTCACACCTTCGGGAACGTGGTTGACCTACACAGGTGCCATCAAATTGGATGACAAAGATCCAAATTTCAAAGATCAAGTCATTTTGCTGAATAAGCATATTGACGCATACAAAACATTCCCACCAACAGAGCCTAAAAAGGACAAAAAGAAGAAGGCTGATGAAACTCAAGCCTTACCGCAGAGACAGAAGAAACAGCAAACTGTGACTCTTCTTCCTGCTGCAGATTTGGATGATTTCTCCAAACAATTGCAACAATCCATGAGCAGTGCTGACTCAACTCAGGCCTAAACTCATGCAGACCACACAAGGCAGATGGGCTATATAAACGTTTTCGCTTTTCCGTTTACGATATATAGTCTACTCTTGTGCAGAATGAATTCTCGTAACTACATAGCACAAGTAGATGTAGTTAACTTTAATCTCACATAGCAATCTTTAATC
AGTGTGTAACATTAGGGAGGACTTGAAAGAGCCACCACATTTTCACCGAGGCCACGCGGAGTACGATCGAGTGTACAGTGAACAATGCTAGGGAGAGCTGCCTATATGGAAGAGCCCTAATGTGTAAAATTAATTTTAGTAGTGCTATCCCCATGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
서열번호: 19
>SA_P2_gp02 표면 당단백질 , 게놈 수탁 SA_P2_t0.9_q20 유래
MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFANPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRGLPQGFSALEPLVDLPIGINITRFQXXXLHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGNIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVKGFNCYFPLQSYGFQPTYGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQGVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGVENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT
서열번호: 20
>MW520923.1 중증 급성 호흡기 증후군 코로나바이러스 2 단리물 SARS-CoV-2/인간/USA/MN-MDH-2399/2021, 완전 게놈, 브라질 P1 계통의 예.
CAACTTTCGATCTCTTGTAGATCTGTTCTCTAAACGAACTTTAAAATCTGTGTGGCTGTCACTCGGCTGCATGCTTAGTG
CACTCACGCAGTATAATTAATAACTAATTACTGTCGTTGACAGGACACGAGTAACTCGTCTATCTTCTGCAGGCTGCTTA
CGGTTTCGTCCGTGTTGCAGCCGATCATCAGCACATCTAGGTTTTGTCCGGGTGTGACCGAAAGGTAAGATGGAGAGCCT
TGTCCCTGGTTTCAACGAGAAAACACACGTCCAACTCAGTTTGCCTGTTTTACAGGTTCGCGACGTGCTCGTACGTGGCT
TTGGAGACTCCGTGGAGGAGGTCTTATCAGAGGCACGTCAACATCTTAAAGATGGCACTTGTGGCTTAGTAGAAGTTGAA
AAAGGCGTTTTGCCTCAACTTGAACAGCCCTATGTGTTCATCAAACGTTCGGATGCTCGAACTGCACCTCATGGTCATGT
TATGGTTGAGCTGGTAGCAGAACTCGAAGGCATTCAGTACGGTCGTAGTGGTGAGACACTTGGTGTCCTTGTCCCTCATG
TGGGCGAAATACCAGTGGCTTACCGCAAGGTTCTTCTTCGTAAGAACGGTAATAAAGGAGCTGGTGGCCATAGTTACGGC
GCCGATCTAAAGTCATTTGACTTAGGCGACGAGCTTGGCACTGATCCTTATGAAGACTTTCAAGAAAACTGGAACACTAA
ACATAGCAGTGGTGTTACCCGTGAACTCATGCGTGAGCTTAACGGAGGGGCATACACTCGCTATGTCGATAACAACTTCT
GTGGCCCTGATGGCTACCCTCTTGAGTGCATTAAAGACCTTCTAGCACGTGCTGGTAAAGCTTCATGCACTTTGTCCGAA
CAACTGGACTTTATTGACACTAAGAGGGGTGTATACTGCTGCCGTGAACATGAGCATGAAATTGCTTGGTACACGGAACG
TTCTGAAAAGAGCTATGAATTGCAGACACCTTTTGAAATTAAATTGGCAAAGAAATTTGACACCTTCAATGGGGAATGTC
CAAATTTTGTATTTCCCTTAAATTCCATAATCAAGACTATTCAACCAAGGGTTGAAAAGAAAAAGCTTGATGGCTTTATG
GGTAGAATTCGATCTGTCTATCCAGTTGCGTCACCAAATGAATGCAACCAAATGTGCCTTTCAACTCTCATGAAGTGTGA
TCATTGTGGTGAAACTTCATGGCAGACGGGCGATTTTGTTAAAGCCACTTGCGAATTTTGTGGCACTGAGAATTTGACTA
AAGAAGGTGCCACTACTTGTGGTTACTTACCCCAAAATGCTGTTGTTAAAATTTATTGTCCAGCATGTCACAATTCAGAA
GTAGGACCTGAGCATAGTCTTGCCGAATACCATAATGAATCTGGCTTGAAAACCATTCTTCGTAAGGGTGGTCGCACTAT
TGCCTTTGGAGGCTGTGTGTTCTCTTATGTTGGTTGCCATAACAAGTGTGCCTATTGGGTTCCACGTGCTAGCGCTAACA
TAGGTTGTAACCATACAGGTGTTGTTGGAGAAGGTTCCGAAGGTCTTAATGACAACCTTCTTGAAATACTCCAAAAAGAG
AAAGTCAACATCAATATTGTTGGTGACTTTAAACTTAATGAAGAGATCGCCATTATTTTGGCATCTTTTTCTGCTTCCAC
AAGTGCTTTTGTGGAAACTGTGAAAGGTTTGGATTATAAAGCATTCAAACAAATTGTTGAATCCTGTGGTAATTTTAAAG
TTACAAAAGGAAAAGCTAAAAAAGGTGCCTGGAATATTGGTGAACAGAAATCAATACTGAGTCCTCTTTATGCATTTGCA
TCAGAGGCTGCTCGTGTTGTACGATCAATTTTCTCCCGCACTCTTGAAACTGCTCAAAATTCTGTGCGTGTTTTACAGAA
GGCCGCTATAACAATACTAGATGGAATTTCACAGTATTCACTGAGACTCATTGATGCTATGATGTTCACATCTGATTTGG
CTACTAACAATCTAGTTGTAATGGCCTACATTACAGGTGGTGTTGTTCAGTTGACTTCGCAGTGGCTAACTAACATCTTT
GGCACTGTTTATGAAAAACTCAAACCCGTCCTTGATTGGCTTGAAGAGAAGTTTAAGGAAGGTGTAGAGTTTCTTAGAGA
CGGTTGGGAAATTGTTAAATTTATCTCAACCTGTGCTTGTGAAATTGTCGGTGGACAAATTGTCACCTGTGCAAAGGAAA
TTAAGGAGAGTGTTCAGACATTCTTTAAGCTTGTAAATAAATTTTTGGCTTTGTGTGCTGACTCTATCATTATTGGTGGA
GCTAAACTTAAAGCCTTGAATTTAGGTGAAACATTTGTCACGCACTCAAAGGGATTGTACAGAAAGTGTGTTAAATCCAG
AGAAGAAACTGGCCTACTCATGCCTCTAAAAGCCCCAAAAGAAATTATCTTCTTAGAGGGAGAAACACTTCCCACAGAAG
TGTTAACAGAGGAAGTTGTCTTGAAAACTGGTGATTTACAACCATTAGAACAACCTACTAGTGAAGCTGTTGAAGCTCCA
TTGGTTGGTACACCAGTTTGTATTAACGGGCTTATGTTGCTCGAAATCAAAGACACAGAAAAGTACTGTGCCCTTGCACC
TAATATGATGGTAACAAACAATACCTTCACACTCAAAGGCGGTGCACCAACAAAGGTTACTTTTGGTGATGATACTGTGA
TAGAAGTGCAAGGTTACAAGAGTGTGAATATCACTTTTGAACTTGATGAAAGGATTGATAAAGTACTTAATGAGAAGTGC
TCTGCCTATACAGTTGAACTCGGTACAGAAGTAAATGAGTTCGCCTGTGTTGTGGCAGATGCTGTCATAAAAACTTTGCA
ACCAGTATCTGAATTACTTACACCACTGGGCATTGATTTAGATGAGTGGAGTATGGCTACATACTACTTATTTGATGAGT
CTGGTGAGTTTAAATTGGCTTCACATATGTATTGTTCTTTTTACCCTCCAGATGAGGATGAAGAAGAAGGTGATTGTGAA
GAAGAAGAGTTTGAGCCATCAACTCAATATGAGTATGGTACTGAAGATGATTACCAAGGTAAACCTTTGGAATTTGGTGC
CACTTCTGCTGCTCTTCAACCTGAAGAAGAGCAAGAAGAAGATTGGTTAGATGATGATAGTCAACAAACTGTTGGTCAAC
AAGACGGCAGTGAGGACAATCAGACAACTACTATTCAAACAATTGTTGAGGTTCAACCTCAATTAGAGATGGAACTTACA
CCAGTTGTTCAGACTATTGAAGTGAATAGTTTTAGTGGTTATTTAAAACTTACTGACAATGTATACATTAAAAATGCAGA
CATTGTGGAAGAAGCTAAAAAGGTAAAACCAACAGTGGTTGTTAATGCAGCCAATGTTTACCTTAAACATGGAGGAGGTG
TTGCAGGAGCCTTAAATAAGGCTACTAACAATGCCATGCAAGTTGAATCTGATGATTACATAGCTACTAATGGACCACTT
AAAGTGGGTGGTAGTTGTGTTTTAAGCGGACACAATCTTGCTAAACACTGTCTTCATGTTGTCGGCCCAAATGTTAACAA
AGGTGAAGACATTCAACTTCTTAAGAGTGCTTATGAAAATTTTAATCAGCACGAAGTTCTACTTGCACCATTATTATCAG
CTGGTATTTTTGGTGCTGACCCTATACATTCTTTAAGAGTTTGTGTAGATACTGTTCGCACAAATGTCTACTTAGCTGTC
TTTGATAAAAATCTCTATGACAAACTTGTTTTAAGCTTTTTGGAAATGAAGAGTGAAAAGCAAGTTGAACAAAAGATCGC
TGAGATTCCTAAAGAGGAAGTTAAGCCATTTATAACTGAAAGTAAACCTTCAGTTGAACAGAGAAAACAAGATGATAAGA
AAATCAAAGCTTGTGTTGAAGAAGTTACAACAACTCTGGAAGAAACTAAGTTCCTCACAGAAAACTTGTTACTTTATATT
GACATTAATGGCAATCTTCATCCAGATTCTGCCACTCTTGTTAGTGACATTGACATCACTTTCTTAAAGAAAGATGCTCC
ATATATAGTGGGTGATGTTGTTCAAGAGGGTGTTTTAACTGCTGTGGTTATACCTACTAAAAAGGCTGGTGGCACTACTG
AAATGCTAGCGAAAGCTTTGAGAAAAGTGCCAACAGACAATTATATAACCACTTACCCGGGTCAGGGTTTAAATGGTTAC
ACTGTAGAGGAGGCAAAGACAGTGCTTAAAAAGTGTAAAAGTGCCTTTTACATTCTACCATCTATTATCTCTAATGAGAA
GCAAGAAATTCTTGGAACTGTTTCTTGGAATTTGCGAGAAATGCTTGCACATGCAGAAGAAACACGCAAATTAATGCCTG
TCTGTGTGGAAACTAAAGCCATAGTTTCAACTATACAGCGTAAATATAAGGGTATTAAAATACAAGAGGGTGTGGTTGAT
TATGGTGCTAGATTTTACTTTTACACCAGTAAAACAACTGTAGCGTCACTTATCAACACACTTAACGATCTAAATGAAAC
TCTTGTTACAATGCCACTTGGCTATGTAACACATGGCTTAAATTTGGAAGAAGCTGCTCGGTATATGAGATCTCTCAAAG
TGCCAGCTACAGTTTCTGTTTCTTCACCTGATGCTGTTACAGCGTATAATGGTTATCTTACTTCTTCTTCTAAAACACCT
GAAGAACATTTTATTGAAACCATCTCACTTGCTGGTTCCTATAAAGATTGGTCCTATTCTGGACAATCTACACAACTAGG
TATAGAATTTCTTAAGAGAGGTGATAAAAGTGTATATTACACTAGTAATCCTACCACATTCCACCTAGATGGTGAAGTTA
TCACCTTTGACAATCTTAAGACACTTCTTTCTTTGAGAGAAGTGAGGACTATTAAGGTGTTTACAACAGTAGACAACATT
AACCTCCACACGCAAGTTGTGGACATGTCAATGACATATGGACAACAGTTTGGTCCAACTTATTTGGATGGAGCTGATGT
TACTAAAATAAAACCTCATAATTCACATGAAGGTAAAACATTTTATGTTTTACCTAATGATGACACTCTACGTGTTGAGG
CTTTTGAGTACTACCACACAACTGATCCTAGTTTTCTGGGTAGGTACATGTCAGCATTAAATCACACTAAAAAGTGGAAA
TACCCACAAGTTAATGGTTTAACTTCTATTAAATGGGCAGATAACAACTGTTATCTTGCCACTGCATTGTTAACACTCCA
ACAAATAGAGTTGAAGTTTAATCCACCTGCTCTACAAGATGCTTATTACAGAGCAAGGGCTGGTGAAGCTGCTAACTTTT
GTGCACTTATCTTAGCCTACTGTAATAAGACAGTAGGTGAGTTAGGTGATGTTAGAGAAACAATGAGTTACTTGTTTCAA
CATGCCAATTTAGATTCTTGCAAAAGAGTCTTGAACGTGGTGTGTAAAACTTGTGGACAACAGCAGACAACCCTTAAGGG
TGTAGAAGCTGTTATGTACATGGGCACACTTTCTTATGAACAATTTAAGAAAGGTGTTCAGATACCTTGTACGTGTGGTA
AACAAGCTACACAATATCTAGTACAACAGGAGTCACCTTTTGTTATGATGTCAGCACCACCTGCTCAGTATGAACTTAAG
CATGGTACATTTACTTGTGCTAGTGAGTACACTGGTAATTACCAGTGTGGTCACTATAAACATATAACTTCTAAAGAAAC
TTTGTATTGCATAGACGGTGCTTTACTTACAAAGTCCTCAGAATACAAAGGTCCTATTACGGATGTTTTCTACAAAGAAA
ACAGTTACACAACAACCATAAAACCAGTTACTTATAAATTGGATGGTGTTGTTTGTACAGAAATTGACCCTAAGTTGGAC
AATTATTATAAGAAAGACAATTCTTATTTCACAGAGCAACCAATTGATCTTGTACCAAACCAACCATATCCAAACGCAAG
CTTCGATAATTTTAAGTTTGTATGTGATAATATCAAATTTGCTGATGATTTAAACCAGTTAACTGGTTATAAGAAACCTG
CTTCAAGAGAGCTTAAAGTTACATTTTTCCCTGACTTAAATGGTGATGTGGTGGCTATTGATTATAAACACTACACACCC
TCTTTTAAGAAAGGAGCTAAATTGTTACATAAACCTATTGTTTGGCATGTTAACAATGCAACTAATAAAGCCACGTATAA
ACCAAATACCTGGTGTATACGTTGTCTTTGGAGCACAAAACCGGTTGAAACATCAAATTCGTTTGATGTACTGAAGTCAG
AGGACGCGCAGGGAATGGATAATCTTGCCTGCGAAGATCTAAAACCAGTCTCTGAAGAAGTAGTGGAAAATCCTACCATA
CAGAAAGACGTTCTTGAGTGTAATGTGAAAACTACCGAAGTTGTAGGAGACATTATACTTAAACCAGCAAATAATAGTTT
AAAAATTACAGAAGAGGTTGGCCACACAGATCTAATGGCTGCTTATGTAGACAATTCTAGTCTTACTATTAAGAAACCTA
ATGAATTATCTAGAGTGTTAGGTTTGAAAACCCTTGCTACTCATGGTTTAGCTGCTGTTAATAGTGTCCCTTGGGATACT
ATAGCTAATTATGCTAAGCCTTTTCTTAACAAAGTTGTTAGTACAACTACTAACATAGTTACACGGTGTTTAAACCGTGT
TTGTACTAATTATATGCCTTATTTCTTTACTTTATTGCTACAATTGTGTACTTTTACTAGAAGTACAAATTCTAGAATTA
AAGCATCTATGCCGACTACTATAGCAAAGAATACTGTTAAGAGTGTCGGTAAATTTTGTCTAGAGGCTTCATTTAATTAT
TTGAAGTCACCTAATTTTTCTAAACTGATAAATATTATAATTTGGTTTTTACTATTAAGTGTTTGCCTAGGTTCTTTAAT
CTACTCAACCGCTGCTTTAGGTGTTTTAATGTCTAATTTAGGCATGCCTTCTTACTGTACTGGTTACAGAGAAGGCTATT
TGAACTCTACTAATGTCACTATTGCAACCTACTGTACTGGTTCTATACCTTGTAGTGTTTGTCTTAGTGGTTTAGATTCT
TTAGACACCTATCCTTCTTTAGAAACTATACAAATTACCATTTCATCTTTTAAATGGGATTTAACTGCTTTTGGCTTAGT
TGCAGAGTGGTTTTTGGCATATATTCTTTTCACTAGGTTTTTCTATGTACTTGGATTGGCTGCAATCATGCAATTGTTTT
TCAGCTATTTTGCAGTACATTTTATTAGTAATTCTTGGCTTATGTGGTTAATAATTAATCTTGTACAAATGGCCCCGATT
TCAGCTATGGTTAGAATGTACATCTTCTTTGCATCATTTTATTATGTATGGAAAAGTTATGTGCATGTTGTAGACGGTTG
TAATTCATCAACTTGTATGATGTGTTACAAACGTAATAGAGCAACAAGAGTCGAATGTACAACTATTGTTAATGGTGTTA
GAAGGTCCTTTTATGTCTATGCTAATGGAGGTAAAGGCTTTTGCAAACTACACAATTGGAATTGTGTTAATTGTGATACA
TTCTGTGCTGGTAGTACATTTATTAGTGATGAAGTTGCGAGAGACTTGTCACTACAGTTTAAAAGACCAATAAATCCTAC
TGACCAGTCTTCTTACATCGTTGATAGTGTTACAGTGAAGAATGGTTCCATCCATCTTTACTTTGATAAAGCTGGTCAAA
AGACTTATGAAAGACATTCTCTCTCTCATTTTGTTAACTTAGACAACCTGAGAGCTAATAACACTAAAGGTTCATTGCCT
ATTAATGTTATAGTTTTTGATGGTAAATCAAAATGTGAAGAATCATCTGCAAAATCAGCGTCTGTTTACTACAGTCAGCT
TATGTGTCAACCTATACTGTTACTAGATCAGGCATTAGTGTCTGATGTTGGTGATAGTGCGGAAGTTGCAGTTAAAATGT
TTGATGCTTACGTTAATACGTTTTCATCAACTTTTAACGTACCAATGGAAAAACTCAAAACACTAGTTGCAACTGCAGAA
GCTGAACTTGCAAAGAATGTGTCCTTAGACAATGTCTTATCTACTTTTATTTCAGCAGCTCGGCAAGGGTTTGTTGATTC
AGATGTAGAAACTAAAGATGTTGTTGAATGTCTTAAATTGTCACATCAATCTGACATAGAAGTTACTGGCGATAGTTGTA
ATAACTATATGCTCACCTATAACAAAGTTGAAAACATGACACCCCGTGACCTTGGTGCTTGTATTGACTGTAGTGCGCGT
CATATTAATGCGCAGGTAGCAAAAAGTCACAACATTGCTTTGATATGGAACGTTAAAGATTTCATGTCATTGTCTGAACA
ACTACGAAAACAAATACGTAGTGCTGCTAAAAAGAATAACTTACCTTTTAAGTTGACATGTGCAACTACTAGACAAGTTG
TTAATGTTGTAACAACAAAGATAGCACTTAAGGGTGGTAAAATTGTTAATAATTGGTTGAAGCAGTTAATTAAAGTTACA
CTTGTGTTCCTTTTTGTTGCTGCTATTTTCTATTTAATAACACCTGTTCATGTCATGTCTAAACATACTGACTTTTCAAG
TGAAATCATAGGATACAAGGCTATTGATGGTGGTGTCACTCGTGACATAGCATCTACAGATACTTGTTTTGCTAACAAAC
ATGCTGATTTTGACACATGGTTTAGCCAGCGTGGTGGTAGTTATACTAATGACAAAGCTTGCCCATTGATTGCTGCAGTC
ATAACAAGAGAAGTGGGTTTTGTCGTGCCTGGTTTGCCTGGCACGATATTACGCACAACTAATGGTGACTTTTTGCATTT
CTTACCTAGAGTTTTTAGTGCAGTTGGTAACATCTGTTACACACCATCAAAACTTATAGAGTACACTGACTTTGCAACAT
CAGCTTGTGTTTTGGCTGCTGAATGTACAATTTTTAAAGATGCTTCTGGTAAGCCAGTACCATATTGTTATGATACCAAT
GTACTAGAAGGTTCTGTTGCTTATGAAAATTTACGCCCTGACACACGTTATGTGCTCATGGATGGCTCTATTATTCAATT
TCCTAACACCTACCTTGAAGGTTCTGTTAGAGTGGTAACAACTTTTGATTCTGAGTACTGTAGGCACGGCACTTGTGAAA
GATCAGAAGCTGGTGTTTGTGTATCTACTAGTGGTAGATGGGTACTTAACAATGATTATTACAGATCTTTACCAGGAGTT
TTCTGTGGTGTAGATGCTGTAAATTTACTTACTAATATGTTTACACCACTAATTCAACCTATTGGTGCTTTGGACATATC
AGCATCTATAGTAGCTGGTGGTATTGTAGCTATCGTAGTAACATGCCTTGCCTACTATTTTATGAGGTTTAGAAGAGCTT
TTGGTGAATACAGTCATGTAGTTGCCTTTAATACTTTACTATTCCTTATGTCATTCACTGTACTCTGTTTAACACCAGTT
TACTCATTCTTACCTGGTGTTTATTCTGTTATTTACTTGTACTTGACATTTTATCTTACTAATGATGTTTCTTTTTTAGC
ACATATTCAGTGGATGGTTATGTTCACACCTTTAGTACCTTTCTGGATAACAATTGCTTATATCATTTGTATTTCCACAA
AGCATTTCTATTGGTTCTTTAGTAATTACCTAAAGAGACGTGTAGTCTTTAATGGTGTTTCCTTTAGTACTTTTGAAGAA
GCTGCGCTGTGCACCTTTTTGTTAAATAAAGAAATGTATCTAAAGTTGCGTAGTGATGTGCTATTACCTCTTACGCAATA
TAATAGATACTTAGCTCTTTATAATAAGTACAAGTATTTTAGTGGAGCAATGGATACAACTAGCTACAGAGAAGCTGCTT
GTTGTCATCTCGCAAAGGCTCTCAATGACTTCAGTAACTCAGGTTCTGATGTTCTTTACCAACCACCACAAACCTCTATC
ACCTCAGCTGTTTTGCAGAGTGGTTTTAGAAAAATGGCATTCCCATCTGGTAAAGTTGAGGGTTGTATGGTACAAGTAAC
TTGTGGTACAACTACACTTAACGGTCTTTGGCTTGATGACGTAGTTTACTGTCCAAGACATGTGATCTGCACCTCTGAAG
ACATGCTTAACCCTAATTATGAAGATTTACTCATTCGTAAGTCTAATCATAATTTCTTGGTACAGGCTGGTAATGTTCAA
CTCAGGGTTATTGGACATTCTATGCAAAATTGTGTACTTAAGCTTAAGGTTGATACAGCCAATCCTAAGACACCTAAGTA
TAAGTTTGTTCGCATTCAACCAGGACAGACTTTTTCAGTGTTAGCTTGTTACAATGGTTCACCATCTGGTGTTTACCAAT
GTGCTATGAGGCCCAATTTCACTATTAAGGGTTCATTCCTTAATGGTTCATGTGGTAGTGTTGGTTTTAACATAGATTAT
GACTGTGTCTCTTTTTGTTACATGCACCATATGGAATTACCAACTGGAGTTCATGCTGGCACAGACTTAGAAGGTAACTT
TTATGGACCTTTTGTTGACAGGCAAACAGCACAAGCAGCTGGTACGGACACAACTATTACAGTTAATGTTTTAGCTTGGT
TGTACGCTGCTGTTATAAATGGAGACAGGTGGTTTCTCAATCGATTTACCACAACTCTTAATGACTTTAACCTTGTGGCT
ATGAAGTACAATTATGAACCTCTAACACAAGACCATGTTGACATACTAGGACCTCTTTCTGCTCAAACTGGAATTGTCGT
TTTAGATATGTGTGCTTCATTAAAAGAATTACTGCAAAATGGTATGAATGGACGTACCATATTGGGTAGTGCTTTATTAG
AAGATGAATTTACACCTTTTGATGTTGTTAGACAATGCTCAGGTGTTACTTTCCAAAGTGCAGTGAAAAGAACAATCAAG
GGTACACACCACTGGTTGTTACTCACAATTTTGACTTCACTTTTAGTTTTAGTCCAGAGTACTCAATGGTCTTTGTTCTT
TTTTTTGTATGAAAATGCCTTTTTACCTTTTGCTATGGGTATTATTGCTATGTCTGCTTTTGCAATGATGTTTGTCAAAC
ATAAGCATGCATTTCTCTGTTTGTTTTTGTTACCTTCTCTTGCCACTGTAGCTTATTTTAATATGGTCTATATGCCTGCT
AGTTGGGTGATGCGTATTATGACATGGTTGGATATGGTTGATACTAGTTTGAAGCTAAAAGACTGTGTTATGTATGCATC
AGCTGTAGTGTTACTAATCCTTATGACAGCAAGAACTGTGTATGATGATGGTGCTAGGAGAGTGTGGACACTTATGAATG
TCTTGACACTCGTTTATAAAGTTTATTATGGTAATGCTTTAGATCAAGCCATTTCCATGTGGGCTCTTATAATCTCTGTT
ACTTCTAACTACTCAGGTGTAGTTACAACTGTCATGTTTTTGGCCAGAGGTATTGTTTTTATGTGTGTTGAGTATTGCCC
TATTTTCTTCATAACTGGTAATACACTTCAGTGTATAATGCTAGTTTATTGTTTCTTAGGCTATTTTTGTACTTGTTACT
TTGGCCTCTTTTGTTTACTCAACCGCTACTTTAGACTGACTCTTGGTGTTTATGATTACTTAGTTTCTACACAGGAGTTT
AGATATATGAATTCACAGGGACTACTCCCACCCAAGAATAGCATAGATGCCTTCAAACTCAACATTAAATTGTTGGGTGT
TGGTGGCAAACCTTGTATCAAAGTAGCCACTGTACAGTCTAAAATGTCAGATGTAAAGTGCACATCAGTAGTCTTACTCT
CAGTTTTGCAACAACTCAGAGTAGAATCATCATCTAAATTGTGGGCTCAATGTGTCCAGTTACACAATGACATTCTCTTA
GCTAAAGATACTACTGAAGCCTTTGAAAAAATGGTTTCACTACTTTCTGTTTTGCTTTCCATGCAGGGTGCTGTAGACAT
AAACAAGCTTTGTGAAGAAATGCTGGACAACAGGGCAACCTTACAAGCTATAGCCTCAGAGTTTAGTTCCCTTCCATCAT
ATGCAGCTTTTGCTACTGCTCAAGAAGCTTATGAGCAGGCTGTTGCTAATGGTGATTCTGAAGTTGTTCTTAAAAAGTTG
AAGAAGTCTTTGAATGTGGCTAAATCTGAATTTGACCGTGATGCAGCCATGCAACGTAAGTTGGAAAAGATGGCTGATCA
AGCTATGACCCAAATGTATAAACAGGCTAGATCTGAGGACAAGAGGGCAAAAGTTACTAGTGCTATGCAGACAATGCTTT
TCACTATGCTTAGAAAGTTGGATAATGATGCACTCAACAACATTATCAACAATGCAAGAGATGGTTGTGTTCCCTTGAAC
ATAATACCTCTTACAACAGCAGCCAAACTAATGGTTGTCATACCAGACTATAACACATATAAAAATACGTGTGATGGTAC
AACATTTACTTATGCATCAGCATTGTGGGAAATCCAACAGGTTGTAGATGCAGATAGTAAAATTGTTCAACTTAGTGAAA
TTAGTATGGACAATTCACCTAATTTAGCATGGCCTCTTATTGTAACAGCTTTAAGGGCCAATTCTGCTGTCAAATTACAG
AATAATGAGCTTAGTCCTGTTGCACTACGACAGATGTCTTGTGCTGCCGGTACTACACAAACTGCTTGCACTGATGACAA
TGCGTTAGCTTATTACAACACAACAAAGGGAGGTAGGTTTGTACTTGCACTGTTATCCGATTTACAGGATTTGAAATGGG
CTAGATTCCCTAAGAGTGATGGAACTGGTACTATCTATACAGAACTGGAACCACCTTGTAGGTTTGTTACAGACACACCT
AAAGGTCCTAAAGTGAAGTATTTATACTTTATTAAAGGATTAAACAACCTAAATAGAGGTATGGTACTTGGTAGTTTAGC
TGCCACAGTACGTCTACAAGCTGGTAATGCAACAGAAGTGCCTGCCAATTCAACTGTATTATCTTTCTGTGCTTTTGCTG
TAGATGCTGCTAAAGCTTACAAAGATTATCTAGCTAGTGGGGGACAACCAATCACTAATTGTGTTAAGATGTTGTGTACA
CACACTGGTACTGGTCAGGCAATAACAGTTACACCGGAAGCCAATATGGATCAAGAATCCTTTGGTGGTGCATCGTGTTG
TCTGTACTGCCGTTGCCACATAGATCATCCAAATCCTAAAGGATTTTGTGACTTAAAAGGTAAGTATGTACAAATACCTA
CAACTTGTGCTAATGACCCTGTGGGTTTTACACTTAAAAACACAGTCTGTACCGTCTGCGGTATGTGGAAAGGTTATGGC
TGTAGTTGTGATCAACTCCGCGAACCCATGCTTCAGTCAGCTGATGCACAATCGTTTTTAAACGGGTTTGCGGTGTAAGT
GCAGCCCGTCTTACACCGTGCGGCACAGGCACTAGTACTGATGTCGTATACAGGGCTTTTGACATCTACAATGATAAAGT
AGCTGGTTTTGCTAAATTCCTAAAAACTAATTGTTGTCGCTTCCAAGAAAAGGACGAAGATGACAATTTAATTGATTCTT
ACTTTGTAGTTAAGAGACACACTTTCTCTAACTACCAACATGAAGAAACAATTTATAATTTACTTAAGGATTGTCCAGCT
GTTGCTAAACATGACTTCTTTAAGTTTAGAATAGACGGTGACATGGTACCACATATATCACGTCAACGTCTTACTAAATA
CACAATGGCAGACCTCGTCTATGCTTTAAGGCATTTTGATGAAGGTAATTGTGATACATTAAAAGAAATACTTGTCACAT
ACAATTGTTGTGATGATGATTATTTCAATAAAAAGGACTGGTATGATTTTGTAGAAAACCCAGATATATTACGCGTATAC
GCCAACTTAGGTGAACGTGTACGCCAAGCTTTGTTAAAAACAGTACAATTCTGTGATGCCATGCGAAATGCTGGTATTGT
TGGTGTACTGACATTAGATAATCAAGATCTCAATGGTAACTGGTATGATTTCGGTGATTTCATACAAACCACGCCAGGTA
GTGGAGTTCCTGTTGTAGATTCTTATTATTCATTGTTAATGCCTATATTAACCTTGACCAGGGCTTTAACTGCAGAGTCA
CATGTTGACACTGACTTAACAAAGCCTTACATTAAGTGGGATTTGTTAAAATATGACTTCACGGAAGAGAGGTTAAAACT
CTTTGACCGTTATTTTAAATATTGGGATCAGACATACCACCCAAATTGTGTTAACTGTTTGGATGACAGATGCATTCTGC
ATTGTGCAAACTTTAATGTTTTATTCTCTACAGTGTTCCCACTTACAAGTTTTGGACCACTAGTGAGAAAAATATTTGTT
GATGGTGTTCCATTTGTAGTTTCAACTGGATACCACTTCAGAGAGCTAGGTGTTGTACATAATCAGGATGTAAACTTACA
TAGCTCTAGACTTAGTTTTAAGGAATTACTTGTGTATGCTGCTGACCCTGCTATGCACGCTGCTTCTGGTAATCTATTAC
TAGATAAACGCACTACGTGCTTTTCAGTAGCTGCACTTACTAACAATGTTGCTTTTCAAACTGTCAAACCCGGTAATTTT
AACAAAGACTTCTATGACTTTGCTGTGTCTAAGGGTTTCTTTAAGGAAGGAAGTTCTGTTGAATTAAAACACTTCTTCTT
TGCTCAGGATGGTAATGCTGCTATCAGCGATTATGACTACTATCGTTATAATCTACCAACAATGTGTGATATCAGACAAC
TACTATTTGTAGTTGAAGTTGTTGATAAGTACTTTGATTGTTACGATGGTGGCTGTATTAATGCTAACCAAGTCATCGTC
AACAACCTAGACAAATCAGCTGGTTTTCCATTTAATAAATGGGGTAAGGCTAGACTTTATTATGATTCAATGAGTTATGA
GGATCAAGATGCACTTTTCGCATATACAAAACGTAATGTCATCCCTACTATAACTCAAATGAATCTTAAGTATGCCATTA
GTGCAAAGAATAGAGCTCGCACCGTAGCTGGTGTCTCTATCTGTAGTACTATGACCAATAGACAGTTTCATCAAAAATTA
TTGAAATCAATAGCCGCCACTAGAGGAGCTACTGTAGTAATTGGAACAAGCAAATTCTATGGTGGTTGGCACAACATGTT
AAAAACTGTTTATAGTGATGTAGAAAACCCTCACCTTATGGGTTGGGATTATCCTAAATGTGATAGAGCCATGCCTAACA
TGCTTAGAATTATGGCCTCACTTGTTCTTGCTCGCAAACATACAACGTGTTGTAGCTTGTCACACCGTTTCTATAGATTA
GCTAATGAGTGTGCTCAAGTATTGAGTGAAATGGTCATGTGTGGCGGTTCACTATATGTTAAACCAGGTGGAACCTCATC
AGGAGATGCCACAACTGCTTATGCTAATAGTGTTTTTAACATTTGTCAAGCTGTCACGGCCAATGTTAATGCACTTTTAT
CTACTGATGGTAACAAAATTGCCGATAAGTATGTCCGCAATTTACAACACAGACTTTATGAGTGTCTCTATAGAAATAGA
GATGTTGACACAGACTTTGTGAATGAGTTTTACGCATATTTGCGTAAACATTTCTCAATGATGATACTCTCTGACGATGC
TGTTGTGTGTTTCAATAGCACTTATGCATCTCAAGGTCTAGTGGCTAGCATAAAGAACTTTAAGTCAGTTCTTTATTATC
AAAACAATGTTTTTATGTCTGAAGCAAAATGTTGGACTGAGACTGACCTTACTAAAGGACCTCATGAATTTTGCTCTCAA
CATACAATGCTAGTTAAACAGGGTGATGATTATGTGTACCTTCCTTACCCAGATCCATCAAGAATCCTAGGGGCCGGCTG
TTTTGTAGATGATATCGTAAAAACAGATGGTACACTTATGATTGAACGGTTCGTGTCTTTAGCTATAGATGCTTACCCAC
TTACTAAACATCCTAATCAGGAGTATGCTGATGTCTTTCATTTGTACTTACAATACATAAGAAAGCTACATGATGAGTTA
ACAGGACACATGTTAGACATGTATTCTGTTATGCTTACTAATGATAACACTTCAAGGTATTGGGAACCTGAGTTTTATGA
GGCTATGTACACACCGCATACAGTCTTACAGGCTGTTGGGGCTTGTGTTCTTTGCAATTCACAGACTTCATTAAGATGTG
GTGCTTGCATACGTAGACCATTCTTATGTTGTAAATGCTGTTACGACCATGTCATATCAACATCACATAAATTAGTCTTG
TCTGTTAATCCGTATGTTTGCAATGCTCCAGGTTGTGATGTCACAGATGTGACTCAACTTTACTTAGGAGGTATGAGCTA
TTATTGTAAATCACATAAACCACCCATTAGTTTTCCATTGTGTGCTAATGGACAAGTTTTTGGTTTATATAAAAATACAT
GTGTTGGTAGCGATAATGTTACTGACTTTAATGCAATTGCAACATGTGACTGGACAAATGCTGGTGATTACATTTTAGCT
AACACCTGTACTGAAAGACTCAAGCTTTTTGCAGCAGAAACGCTCAAAGCTACTGAGGAGACATTTAAACTGTCTTATGG
TATTGCTACTGTACGTGAAGTGCTGTCTGACAGAGAATTACATCTTTCATGGGAAGTTGGTAAACCTAGACCACCACTTA
ACCGAAATTATGTCTTTACTGGTTATCGTGTAACTAAAAACAGTAAAGTACAAATAGGAGAGTACACCTTTGAAAAAGGT
GACTATGGTGATGCTGTTGTTTACCGAGGTACAACAACTTACAAATTAAATGTTGGTGATTATTTTGTGCTGACATCACA
TACAGTAATGCCATTAAGTGCACCTACACTAGTGCCACAAGAGCACTATGTTAGAATTACTGGCTTATACCCAACACTCA
ATATCTCAGATGAGTTTTCTAGCAATGTTGCAAATTATCAAAAGGTTGGTATGCAAAAGTATTCTACACTCCAGGGACCA
CCTGGTACTGGTAAGAGTCATTTTGCTATTGGCCTAGCTCTCTACTACCCTTCTGCTCGCATAGTGTATACAGCTTGCTC
TCATGCCGCTGTTGATGCACTATGTGAGAAGGCATTAAAATATTTGCCTATAGATAAATGTAGTAGAATTATACCTGCAC
GTGCTCGTGTAGATTGTTTTGATAAATTCAAAGTGAATTCAACATTAGAACAGTATGTCTTTTGTACTGTAAATGCATTG
CCTGAGACGACAGCAGATATAGTTGTCTTTGATGAAATTTCAATGGCCACAAATTATGATTTGAGTGTTGTCAATGCCAG
ATTACGTGCTAAGCACTATGTGTACATTGGCGACCCTGCTCAATTACCTGCACCACGCACATTGCTAACTAAGGGCACAC
TAGAACCAGAATATTTCAATTCAGTGTGTAGACTTATGAAAACTATAGGTCCAGACATGTTCCTCGGAACTTGTCGGCGT
TGTCCTGCTGAAATTGTTGACACTGTGAGTGCTTTGGTTTATGATAATAAGCTTAAAGCACATAAAGACAAATCAGCTCA
ATGCTTTAAAATGTTTTATAAGGGTGTTATCACGCATGATGTTTCATCTGCAATTAACAGGCCACAAATAGGCGTGGTAA
GAGAATTCCTTACACGTAACCCTGCTTGGAGAAAAGCTGTCTTTATTTCACCTTATAATTCACAGAATGCTGTAGCCTCA
AAGATTTTGGGACTACCAACTCAAACTGTTGATTCATCACAGGGCTCAGAATATGACTATGTCATATTCACTCAAACCAC
TGAAACAGCTCACTCTTGTAATGTAAACAGATTTAATGTTGCTATTACCAGAGCAAAAGTAGGCATACTTTGCATAATGT
CTGATAGAGACCTTTATGACAAGTTGCAATTTACAAGTCTTGAAATTCCACGTAGGAATGTGGCAACTTTACAAGCTGAA
AATGTAACAGGACTCTTTAAAGATTGTAGTAAGGTAATCACTGGGTTACATCCTACACAGGCACCTACACACCTCAGTGT
TGACACTAAATTCAAAACTGAAGGTTTATGTGTTGACATACCTGGCATACCTAAGGACATGACCTATAGAAGACTCATCT
CTATGATGGGTTTTAAAATGAATTATCAAGTTAATGGTTACCCTAACATGTTTATCACCCGCGAAGAAGCTATAAGACAT
GTACGTGCATGGATTGGCTTCGATGTCGAGGGGTGTCATGCTACTAGAGAAGCTGTTGGTACCAATTTACCTTTACAGCT
AGGTTTTTCTACAGGTGTTAACCTAGTTGCTGTACCTACAGGTTATGTTGATACACCTAATAATACAGATTTTTCCAGAG
TTAGTGCTAAACCACCGCCTGGAGATCAATTTAAACACCTCATACCACTTATGTACAAAGGACTTCCTTGGAATGTAGTG
CGTATAAAGATTGTACAAATGTTAAGTGACACACTTAAAAATCTCTCTGACAGAGTCGTATTTGTCTTATGGGCACATGG
CTTTGAGTTGACATCTATGAAGTATTTTGTGAAAATAGGACCTGAGCGCACCTGTTGTCTATGTGATAGACGTGCCACAT
GCTTTTCCACTGCTTCAGACACTTATGCCTGTTGGCATCATTCTATTGGATTTGATTACGTCTATAATCCGTTTATGATT
GATGTTCAACAATGGGGTTTTACAGGTAACCTACAAAGCAACCATGATCTGTATTGTCAAGTCCATGGTAATGCACATGT
AGCTAGTTGTGATGCAATCATGACTAGGTGTCTAGCTGTCCACGAGTGCTTTGTTAAGCGTGTTGACTGGACTATTGAAT
ATCCTATAATTGGTGATGAACTGAAGATTAATGCGGCTTGTAGAAAGGTTCAACACATGGTTGTTAAAGCTGCATTATTA
GCAGACAAATTCCCAGTTCTTCACGACATTGGTAACCCTAAAGCTATTAAGTGTGTACCTCAAGCTGATGTAGAATGGAA
GTTCTATGATGCACAGCCTTGTAGTGACAAAGCTTATAAAATAGAAGAATTATTCTATTCTTATGCCACACATTCTGACA
AATTCACAGATGGTGTATGCCTATTTTGGAATTGCAATGTCGATAGATATCCTGCTAATTCCATTGTTTGTAGATTTGAC
ACTAGAGTGCTATCTAACCTTAACTTGCCTGGTTGTGATGGTGGCAGTTTGTATGTAAATAAACATGCATTCCACACACC
AGCTTTTGATAAAAGTGCTTTTGTTAATTTAAAACAATTACCATTTTTCTATTACTCTGACAGTCCATGTGAGTCTCATG
GAAAACAAGTAGTGTCAGATATAGATTATGTACCACTAAAGTCTGCTACGTGTATAACACGTTGCAATTTAGGTGGTGCT
GTCTGTAGACATCATGCTAATGAGTACAGATTGTATCTCGATGCTTATAACATGATGATCTCAGCTGGCTTTAGCTTGTG
GGTTTACAAACAATTTGATACTTATAACCTCTGGAACACTTTTACAAGACTTCAGAGTTTAGAAAATGTGGCTTTTAATG
TTGTAAATAAGGGACACTTTGATGGACAACAGGGTGAAGTACCAGTTTCTATCATTAATAACACTGTTTACACAAAAGTT
GATGGTGTTGATGTAGAATTGTTTGAAAATAAAACAACATTACCTGTTAATGTAGCATTTGAGCTTTGGGCTAAGCGCAA
CATTAAACCAGTACCAGAGGTGAAAATACTCAATAATTTGGGTGTGGACATTGCTGCTAATACTGTGATCTGGGACTACA
AAAGAGATGCTCCAGCACATATATCTACTATTGGTGTTTGTTCTATGACTGACATAGCCAAGAAACCAACTGAAACGATT
TGTGCACCACTCACTGTCTTTTTTGATGGTAGAGTTGATGGTCAAGTAGACTTATTTAGAAATGCCCGTAATGGTGTTCT
TATTACAGAAGGTAGTGTTAAAGGTTTACAACCATCTGTAGGTCCCAAACAAGCTAGTCTTAATGGAGTCACATTAATTG
GAGAAGCCGTAAAAACACAGTTCAATTATTATAAGAAAGTTGATGGTGTTGTCCAACAATTACCTGAAACTTACTTTACT
CAGAGTAGAAATTTACAAGAATTTAAACCCAGGAGTCAAATGGAAATTGATTTCTTAGAATTAGCTATGGATGAATTCAT
TGAACGGTATAAATTAGAAGGCTATGCCTTCGAACATATCGTTTATGGAGATTTTAGTCATAGTCAGTTAGGTGGTTTAC
ATCTACTGATTGGACTAGCTAAACGTTTTAAGGAATCACCTTTTGAATTAGAAGATTTTATTCCTATGGACAGTACAGTT
AAAAACTATTTCATAACAGATGCGCAAACAGGTTCATCTAAGTGTGTGTGTTCTGTTATTGATTTATTACTTGATGATTT
TGTTGAAATAATAAAATCCCAAGATTTATCTGTAGTTTCTAAGGTTGTCAAAGTGACTATTGACTATACAGAAATTTCAT
TTATGCTTTGGTGTAAAGATGGCCATGTAGAAACATTTTACCCAAAATTACAATCTAGTCAAGCGTGGCAACCGGGTGTT
GCTATGCCTAATCTTTACAAAATGCAAAGAATGCTATTAGAAAAGTGTGACCTTCAAAATTATGGTGATAGTGCAACATT
ACCTAAAGGCATAATGATGAATGTCGCAAAATATACTCAACTGTGTCAATATTTAAACACATTAACATTAGCTGTACCCT
ATAATATGAGAGTTATACATTTTGGTGCTGGTTCTGATAAAGGAGTTGCACCAGGTACAGCTGTTTTAAGACAGTGGTTG
CCTACGGGTACGCTGCTTGTCGATTCAGATCTTAATGACTTTGTCTCTGATGCAGATTCAACTTTGATTGGTGATTGTGC
AACTGTACATACAGCTAATAAATGGGATCTCATTATTAGTGATATGTACGACCCTAAGACTAAAAATGTTACAAAAGAAA
ATGACTCTAAAGAGGGTTTTTTCACTTACATTTGTGGGTTTATACAACAAAAGCTAGCTCTTGGAGGTTCCGTGGCTATA
AAGATAACAGAACATTCTTGGAATGCTGATCTTTATAAGCTCATGGGACACTTCGCATGGTGGACAGCCTTTGTTACTAA
TGTGAATGCGTCATCATCTGAAGCATTTTTAATTGGATGTAATTATCTTGGCAAACCACGCGAACAAATAGATGGTTATG
TCATGCATGCAAATTACATATTTTGGAGGAATACAAATCCAATTCAGTTGTCTTCCTATTCTTTATTTGACATGAGTAAA
TTTCCCCTTAAATTAAGGGGTACTGCTGTTATGTCTTTAAAAGAAGGTCAAATCAATGATATGATTTTATCTCTTCTTAG
TAAAGGTAGACTTATAATTAGAGAAAACAACAGAGTTGTTATTTCTAGTGATGTTCTTGTTAACAACTAAACGAACAATG
TTTGTTTTTCTTGTTTTATTGCCACTAGTCTCTAGTCAGTGTGTTAATTTTACAAACAGAACTCAATTACCCTCTGCATA
CACTAATTCTTTCACACGTGGTGTTTATTACCCTGACAAAGTTTTCAGATCCTCAGTTTTACATTCAACTCAGGACTTGT
TCTTACCTTTCTTTTCCAATGTTACTTGGTTCCATGCTATACATGTCTCTGGGACCAATGGTACTAAGAGGTTTGATAAC
CCTGTCCTACCATTTAATGATGGTGTTTATTTTGCTTCCACTGAGAAGTCTAACATAATAAGAGGCTGGATTTTTGGTAC
TACTTTAGATTCGAAGACCCAGTCCCTACTTATTGTTAATAACGCTACTAATGTTGTTATTAAAGTCTGTGAATTTCAAT
TTTGTAATTATCCATTTTTGGGTGTTTATTACCACAAAAACAACAAAAGTTGGATGGAAAGTGAGTTCAGAGTTTATTCT
AGTGCGAATAATTGCACTTTTGAATATGTCTCTCAGCCTTTTCTTATGGACCTTGAAGGAAAACAGGGTAATTTCAAAAA
TCTTAGTGAATTTGTGTTTAAGAATATTGATGGTTATTTTAAAATATATTCTAAGCACACGCCTATTAATTTAGTGCGTG
ATCTCCCTCAGGGTTTTTCGGCTTTAGAACCATTGGTAGATTTGCCAATAGGTATTAACATCACTAGGTTTCAAACTTTA
CTTGCTTTACATAGAAGTTATTTGACTCCTGGTGATTCTTCTTCAGGTTGGACAGCTGGTGCTGCAGCTTATTATGTGGG
TTATCTTCAACCTAGGACTTTTCTATTAAAATATAATGAAAATGGAACCATTACAGATGCTGTAGACTGTGCACTTGACC
CTCTCTCAGAAACAAAGTGTACGTTGAAATCCTTCACTGTAGAAAAAGGAATCTATCAAACTTCTAACTTTAGAGTCCAA
CCAACAGAATCTATTGTTAGATTTCCTAATATTACAAACTTGTGCCCTTTTGGTGAAGTTTTTAACGCCACCAGATTTGC
ATCTGTTTATGCTTGGAACAGGAAGAGAATCAGCAACTGTGTTGCTGATTATTCTGTCCTATATAATTCCGCATCATTTT
CCACTTTTAAGTGTTATGGAGTGTCTCCTACTAAATTAAATGATCTCTGCTTTACTAATGTCTATGCAGATTCATTTGTA
ATTAGAGGTGATGAAGTCAGACAAATCGCTCCAGGGCAAACTGGAACGATTGCTGATTATAATTATAAATTACCAGATGA
TTTTACAGGCTGCGTTATAGCTTGGAATTCTAACAATCTTGATTCTAAGGTTGGTGGTAATTATAATTACCTGTATAGAT
TGTTTAGGAAGTCTAATCTCAAACCTTTTGAGAGAGATATTTCAACTGAAATCTATCAGGCCGGTAGCACACCTTGTAAT
GGTGTTAAAGGTTTTAATTGTTACTTTCCTTTACAATCATATGGTTTCCAACCCACTTATGGTGTTGGTTACCAACCATA
CAGAGTAGTAGTACTTTCTTTTGAACTTCTACATGCACCAGCAACTGTTTGTGGACCTAAAAAGTCTACTAATTTGGTTA
AAAACAAATGTGTCAATTTCAACTTCAATGGTTTAACAGGCACAGGTGTTCTTACTGAGTCTAACAAAAAGTTTCTGCCT
TTCCAACAATTTGGCAGAGACATTGCTGACACTACTGATGCTGTCCGTGATCCACAGACACTTGAGATTCTTGACATTAC
ACCATGTTCTTTTGGTGGTGTCAGTGTTATAACACCAGGAACAAATACTTCTAATCAGGTTGCTGTTCTTTATCAGGGTG
TTAACTGCACAGAAGTCCCTGTTGCTATTCATGCAGATCAACTTACTCCTACTTGGCGTGTTTATTCTACAGGTTCTAAT
GTTTTTCAAACACGTGCAGGCTGTTTAATAGGGGCTGAATATGTCAACAACTCATATGAGTGTGACATACCCATTGGTGC
AGGTATATGCGCTAGTTATCAGACTCAGACTAATTCTCCTCGGCGGGCACGTAGTGTAGCTAGTCAATCCATCATTGCCT
ACACTATGTCACTTGGTGCAGAAAATTCAGTTGCTTACTCTAATAACTCTATTGCCATACCCACAAATTTTACTATTAGT
GTTACCACAGAAATTCTACCAGTGTCTATGACCAAGACATCAGTAGATTGTACAATGTACATTTGTGGTGATTCAACTGA
ATGCAGCAATCTTTTGTTGCAATATGGCAGTTTTTGTACACAATTAAACCGTGCTTTAACTGGAATAGCTGTTGAACAAG
ACAAAAACACCCAAGAAGTTTTTGCACAAGTCAAACAAATTTACAAAACACCACCAATTAAAGATTTTGGTGGTTTTAAT
TTTTCACAAATATTACCAGATCCATCAAAACCAAGCAAGAGGTCATTTATTGAAGATCTACTTTTCAACAAAGTGACACT
TGCAGATGCTGGCTTCATCAAACAATATGGTGATTGCCTTGGTGATATTGCTGCTAGAGACCTCATTTGTGCACAAAAGT
TTAACGGCCTTACTGTTTTGCCACCTTTGCTCACAGATGAAATGATTGCTCAATACACTTCTGCACTGTTAGCGGGTACA
ATCACTTCTGGTTGGACCTTTGGTGCAGGTGCTGCATTACAAATACCATTTGCTATGCAAATGGCTTATAGGTTTAATGG
TATTGGAGTTACACAGAATGTTCTCTATGAGAACCAAAAATTGATTGCCAACCAATTTAATAGTGCTATTGGCAAAATTC
AAGACTCACTTTCTTCCACAGCAAGTGCACTTGGAAAACTTCAAGATGTGGTCAACCAAAATGCACAAGCTTTAAACACG
CTTGTTAAACAACTTAGCTCCAATTTTGGTGCAATTTCAAGTGTTTTAAATGATATCCTTTCACGTCTTGACAAAGTTGA
GGCTGAAGTGCAAATTGATAGGTTGATCACAGGCAGACTTCAAAGTTTGCAGACATATGTGACTCAACAATTAATTAGAG
CTGCAGAAATCAGAGCTTCTGCTAATCTTGCTGCTATTAAAATGTCAGAGTGTGTACTTGGACAATCAAAAAGAGTTGAT
TTTTGTGGAAAGGGCTATCATCTTATGTCCTTCCCTCAGTCAGCACCTCATGGTGTAGTCTTCTTGCATGTGACTTATGT
CCCTGCACAAGAAAAGAACTTCACAACTGCTCCTGCCATTTGTCATGATGGAAAAGCACACTTTCCTCGTGAAGGTGTCT
TTGTTTCAAATGGCACACACTGGTTTGTAACACAAAGGAATTTTTATGAACCACAAATCATTACTACAGACAACACATTT
GTGTCTGGTAACTGTGATGTTGTAATAGGAATTGTCAACAACACAGTTTATGATCCTTTGCAACCTGAATTAGACTCATT
CAAGGAGGAGTTAGATAAATATTTTAAGAATCATACATCACCAGATGTTGATTTAGGTGACATCTCTGGCATTAATGCTT
CATTTGTAAACATTCAAAAAGAAATTGACCGCCTCAATGAGGTTGCCAAGAATTTAAATGAATCTCTCATCGATCTCCAA
GAACTTGGAAAGTATGAGCAGTATATAAAATGGCCATGGTACATTTGGCTAGGTTTTATAGCTGGCTTGATTGCCATAGT
AATGGTGACAATTATGCTTTGCTGTATGACCAGTTGCTGTAGTTGTCTCAAGGGCTGTTGTTCTTGTGGATCCTGCTGCA
AATTTGATGAAGACGACTCTGAGCCAGTGCTCAAAGGAGTCAAATTACATTACACATAAACGAACTTATGGATTTGTTTA
TGAGAATCTTCACAATTGGAACTGTAACTTTGAAGCAAGGTGAAATCAAGGATGCTACTCCTTCAGATTTTGTTCGCGCT
ACTGCAACGATACCGATACAAGCCTCACTCCCTTTCGGATGGCTTATTGTTGGCGTTGCACTTCTTGCTGTTTTTCAGAG
CGCTTCCAAAATCATAACCCTCAAAAAGAGATGGCAACTAGCACTCTCCAAGGGTGTTCACTTTGTTTGCAACTTGCTGT
TGTTGTTTGTAACAGTTTACTCACACCTTTTGCTCGTTGCTGCTGGCCTTGAAGCCCCTTTTCTCTATCTTTATGCTTTA
GTCTACTTCTTGCAGAGTATAAACTTTGTAAGAATAATAATGAGGCTTTGGCTTTGCTGGAAATGCCGTTCCAAAAACCC
ATTACTTTATGATGCCAACTATTTTCTTTGCTGGCATACTAATTGTTACGACTATTGTATACCTTACAATAGTGTAACTT
CTTCAATTGTCATTACTTCAGGTGATGGCACAACAAGTCCTATTTCTGAACATGACTACCAGATTGGTGGTTATACTGAA
AAATGGGAATCTGGAGTAAAAGACTGTGTTGTATTACACAGTTACTTCACTTCAGACTATTACCAGCTGTACTCAACTCA
ATTGAGTACAGACACTGGTGTTGAACATGTTACCTTCTTCATCTACAATAAAATTGTTGATGAGCCTGAAGAACATGTCC
AAATTCACACAATCGACGGTTCACCCGGAGTTGTTAATCCAGTAATGGAACCAATTTATGATGAACCGACGACGACTACT
AGCGTGCCTTTGTAAGCACAAGCTGATGAGTACGAACTTATGTACTCATTCGTTTCGGAAGAGACAGGTACGTTAATAGT
TAATAGCGTACTTCTTTTTCTTGCTTTCGTGGTATTCTTGCTAGTTACACTAGCCATCCTTACTGCGCTTCGATTGTGTG
CGTACTGCTGCAATATTGTTAACGTGAGTCTTGTAAAACCTTCTTTTTACGTTTACTCTCGTGTTAAAAATCTGAATTCT
TCTAGAGTTCCTGATCTTCTGGTCTAAACGAACTAAATATTATATTAGTTTTTCTGTTTGGAACTTTAATTTTAGCCATG
GCAGATTCCAACGGTACTATTACCGTTGAAGAGCTTAAAAAGCTCCTTGAACAATGGAACCTAGTAATAGGTTTCCTATT
CCTTACATGGATTTGTCTTCTACAATTTGCCTATGCCAACAGGAATAGGTTTTTGTATATAATTAAGTTAATTTTCCTCT
GGCTGTTATGGCCAGTAACTTTAGCTTGTTTTGTGCTTGCTGCTGTTTACAGAATAAATTGGATCACCGGTGGAATTGCT
ATCGCAATGGCTTGTCTTGTAGGCTTGATGTGGCTCAGCTACTTCATTGCTTCTTTCAGACTGTTTGCGCGTACGCGTTC
CATGTGGTCATTCAATCCAGAAACTAACATTCTTCTCAACGTGCCACTCCATGGCACTATTCTGACCAGACCGCTTCTAG
AAAGTGAACTCGTAATCGGAGCTGTGATCCTTCGTGGACATCTTCGTATTGCTGGACACCATCTAGGACGCTGTGACATC
AAGGACCTGCCTAAAGAAATCACTGTTGCTACATCACGAACGCTTTCTTATTACAAATTGGGAGCTTCGCAGCGTGTAGC
AGGTGACTCAGGTTTTGCTGCATACAGTCGCTACAGGATTGGCAACTATAAATTAAACACAGACCATTCCAGTAGCAGTG
ACAATATTGCTTTGCTTGTACAGTAAGTGACAACAGATGTTTCATCTCGTTGACTTTCAGGTTACTATAGCAGAGATATT
ACTAATTATTATGAGGACTTTTAAAGTTTCCATTTGGAATCTTGATTACATCATAAACCTCATAATTAAAAATTTATCTA
AGTCACTAACTGAGAATAAATATTCTCAATTAGATGAAGAGCAACCAATGGAGATTGATTAAACGAACATGAAAATTATT
CTTTTCTTGGCACTGATAACACTCGCTACTTGTGAGCTTTATCACTACCAAGAGTGTGTTAGAGGTACAACAGTACTTTT
AAAAGAACCTTGCTCTTCTGGAACATACGAGGGCAATTCACCATTTCATCCTCTAGCTGATAACAAATTTGCACTGACTT
GCTTTAGCACTCAATTTGCTTTTGCTTGTCCTGACGGCGTAAAACACGTCTATCAGTTACGTGCCAGATCAGTTTCACCT
AAACTGTTCATCAGACAAGAGGAAGTTCAAGAACTTTACTCTCCAATTTTTCTTATTGTTGCGGCAATAGTGTTTATAAC
ACTTTGCTTCACACTCAAAAGAAAGACAGAATGATTGAACTTTCATTAATTGACTTCTATTTGTGCTTTTTAGCCTTTCT
GCTATTCCTTGTTTTAATTATGCTTATTATCTTTTGGTTCTCACTTGAACTGCAAGATCATAATGAAACTTGTCACGCCT
AAACGAACATGAAATTTCTTGTTTTCTTAGGAATCATCACAACTGTAGCTGCATTTCACCAAGAATGTAGTTTACAGTCA
TGTACTCAACATCAACCATATGTAGTTGATGACCCGTGTCCTATTCACTTCTATTCTAAATGGTATATTAGAGTAGGAGC
TAGAAAATCAGCACCTTTAATTGAATTGTGCGTGGATGAGGCTGGTTCTAAATCACCCATTCAGTACATCGATATCGGTA
ATTATACAGTTTCCTGTTTACCTTTTACAATTAATTGCCAGAAACCTAAATTGGGTAGTCTTGTAGTGCGTTGTTCGTTC
TATGAAGACTTTTTAGAGTATCATGACGTTCGTGTTGTTTTAGATTTCATCTAAACGAACAAACAAACTAAAATGTCTGA
TAATGGACCCCAAAATCAGCGAAATGCACCCCGCATTACGTTTGGTGGACCCTCAGATTCAACTGGCAGTAACCAGAATG
GAGAACGCAGTGGGGCGCGATCAAAACAACGTCGGCCCCAAGGTTTACCCAATAATACTGCGTCTTGGTTCACCGCTCTC
ACTCAACATGGCAAGGAAGACCTTAAATTCCCTCGAGGACAAGGCGTTCCAATTAACACCAATAGCAGTCGAGATGACCA
AATTGGCTACTACCGAAGAGCTACCAGACGAATTCGTGGTGGTGACGGTAAAATGAAAGATCTCAGTCCAAGATGGTATT
TCTACTACCTAGGAACTGGGCCAGAAGCTGGACTTCCCTATGGTGCTAACAAAGACGGCATCATATGGGTTGCAACTGAG
GGAGCCTTGAATACACCAAAAGATCACATTGGCACCCGCAATCCTGCTAACAATGCTGCAATCGTGCTACAACTTCCTCA
AGGAACAACATTGCCAAAAGGCTTCTACGCAGAAGGGAGCAGAGGCGGCAGTCAAGCCTCTTCTCGTTCCTCATCACGTA
GTCGCAACAGTTCAAGAAATTCAACTCCAGGCAGCTCTAAACGAACTTCTCCTGCTAGAATGGCTGGCAATGGCGGTGAT
GCTGCTCTTGCTTTGCTGCTGCTTGACAGATTGAACCAGCTTGAGAGCAAAATGTCTGGTAAAGGCCAACAACAACAAGG
CCAAACTGTCACTAAGAAATCTGCTGCTGAGGCTTCTAAGAAGCCTCGGCAAAAACGTACTGCCACTAAAGCATACAATG
TAACACAAGCTTTCGGCAGACGTGGTCCAGAACAAACCCAAGGAAATTTTGGGGACCAGGAACTAATCAGACAAGGAACT
GATTACAAACATTGGCCGCAAATTGCACAATTTGCCCCCAGCGCTTCAGCGTTCTTCGGAATGTCGCGCATTGGCATGGA
AGTCACACCTTCGGGAACGTGGTTGACCTACACAGGTGCCATCAAATTGGATGACAAAGATCCAAATTTCAAAGATCAAG
TCATTTTGCTGAATAAGCATATTGACGCATACAAAACATTCCCACCAACAGAGCCTAAAAAGGACAAAAAGAAGAAGGCT
GATGAAACTCAAGCCTTACCGCAGAGACAGAAGAAACAGCAAACTGTGACTCTTCTTCCTGCTGCAGATTTGGATGATTT
CTCCAAACAATTGCAACAATCCATGAGCAGTGCTGACTCAACTCAGGCCTAAACTCATGCAGACCACACAAGGCAGATGG
GCTATATAAACGTTTTCGCTTTTCCGTTTACGATATATAGTCTACTCTTGTGCAGAATGAATTCTCGTAACTACATAGCA
CAAGTAGATGTAGTTAACTTTAATCTCACATAGCAATCTTTAATCAGTGTGTAACATTAGGGAGGACTTGAAAGAGCCAC
CACATTTTCACCGAGGCCACGCGGAGTACGATCGAGTGTACAGTGAACAATGCTAGGGAGAGCTGCCTATATGGAAGAGC
CCTAATGTGTAAAATTAATTTTAGTAGTGCTAACCCCATGTGATTTTAATAGCTTCTTA
서열번호: 21
>QQX12069.1 표면 당단백질, 게놈 수탁 MW520923 유래
MFVFLVLLPLVSSQCVNFTNRTQLPSAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNYPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLSEFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGTIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVKGFNCYFPLQSYGFQPTYGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQGVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEYVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAAIKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASFVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT
서열번호: 22
> 중증 급성 호흡기 증후군 코로나바이러스 2 단리물 SARS-CoV-2/인간/England/MIG457/2020, EVAg Ref-SKU:004V-04032, 완전 게놈. UK B 1.1.7 계통
ATTAAAGGTTTATACCTTCCCAGGTAACAAACCAACCAACTTTCGATCTCTTGTAGATCTGTTCTCTAAACGAACTTTAAAATCTGTGTGGCTGTCACTCGGCTGCATGCTTAGTGCACTCACGCAGTATAATTAATAACTAATTACTGTCGTTGACAGGACACGAGTAACTCGTCTATCTTCTGCAGGCTGCTTACGGTTTCGTCCGTGTTGCAGCCGATCATCAGCACATCTAGGTTTTGTCCGGGTGTGACCGAAAGGTAAGATGGAGAGCCTTGTCCCTGGTTTCAACGAGAAAACACACGTCCAACTCAGTTTGCCTGTTTTACAGGTTCGCGACGTGCTCGTACGTGGCTTTGGAGACTCCGTGGAGGAGGTCTTATCAGAGGCACGTCAACATCTTAAAGATGGCACTTGTGGCTTAGTAGAAGTTGAAAAAGGCGTTTTGCCTCAACTTGAACAGCCCTATGTGTTCATCAAACGTTCGGATGCTCGAACTGCACCTCATGGTCATGTTATGGTTGAGCTGGTAGCAGAACTCGAAGGCATTCAGTACGGTCGTAGTGGTGAGACACTTGGTGTCCTTGTCCCTCATGTGGGCGAAATACCAGTGGCTTACCGCAAGGTTCTTCTTCGTAAGAACGGTAATAAAGGAGCTGGTGGCCATAGTTACGGCGCCGATCTAAAGTCATTTGACTTAGGCGACGAGCTTGGCACTGATCCTTATGAAGATTTTCAAGAAAACTGGAACACTAAACATAGCAGTGGTGTTACCCGTGAACTCATGCGTGAGCTTAACGGAGGGGCATACACTCGCTATGTCGATAACAACTTCTGTGGCCCTGATGGCTACCCTCTTGAGTGCATTAAAGACCTTCTAGCACGTGCTGGTAAAGCTTCATGCACTTTGTCTGAACAACTGGACTTTATTGACACTAAGAGGGGTGTATACTGCTGCCGTGAACATGAGCATGAAATTGCTTGGTACACGGAACGTTCTGAAAAGAGCTATGAATTGCAGACACCTTTTGAAATTAAATTGGCAAAGAAATTTGACACCTTCAATGGGGAATGTCCAAATTTTGTATTTCCCTTAAATTCCATAATCAAGACTATTCAACCAAGGGTTGAAAAGAAAAAGCTTGATGGCTTTATGGGTAGAATTCGATCTGTCTATCCAGTTGCGTCACCAAATGAATGCAACCAAATGTGCCTTTCAACTCTCATGAAGTGTGATCATTGTGGTGAAACTTCATGGCAGACGGGCGATTTTGTTAAAGCCACTTGCGAATTTTGTGGCACTGAGAATTTGACTAAAGAAGGTGCCACTACTTGTGGTTACTTACCCCAAAATGCTGTTGTTAAAATTTATTGTCCAGCATGTCACAATTCAGAAGTAGGACCTGAGCATAGTCTTGCCGAATACCATAATGAATCTGGCTTGAAAACCATTCTTCGTAAGGGTGGTCGCACTATTGCCTTTGGAGGCTGTGTGTTCTCTTATGTTGGTTGCCATAACAAGTGTGCCTATTGGGTTCCACGTGCTAGCGCTAACATAGGTTGTAACCATACAGGTGTTGTTGGAGAAGGTTCCGAAGGTCTTAATGACAACCTTCTTGAAATACTCCAAAAAGAGAAAGTCAACATCAATATTGTTGGTGACTTTAAACTTAATGAAGAGATCGCCATTATTTTGGCATCTTTTTCTGCTTCCACAAGTGCTTTTGTGGAAACTGTGAAAGGTTTGGATTATAAAGCATTCAAACAAATTGTTGAATCCTGTGGTAATTTTAAAGTTACAAAAGGAAAAGCTAAAAAAGGTGCCTGGAATATTGGTGAACAGAAATCAATACTGAGTCCTCTTTATGCATTTGCATCAGAGGCTGCTCGTGTTGTACGATCAATTTTCTCCCGCACTCTTGAAACTGCTCAAAATTCTGTGCGTGTTTTACAGAAGGCCGCTATAACAATACTAGATGGAATTTCACAGTATTCACTGAGACTCATTGATGCTATGATGTTCACATCTGATTTGGCTACTAACAATCTAGTTGTAATGGCCTACATTACAGGTGGTGTTGTTCAGTTGACTTCGCAGTGGCTAACTAACATCTTTGGCACTGTTTATGAAAAACTCAAACCCGTCCTTGATTGGCTTGAAGAGAAGTTTAAGGAAGGTGTAGAGTTTCTTAGAGACGGTTGGGAAATTGTTAAATTTATCTCAACCTGTGCTTGTGAAATTGTCGGTGGACAAATTGTCACCTGTGCAAAGGAAATTAAGGAGAGTGTTCAGACATTCTTTAAGCTTGTAAATAAATTTTTGGCTTTGTGTGCTGACTCTATCATTATTGGTGGAGCTAAACTTAAAGCCTTGAATTTAGGTGAAACATTTGTCACGCACTCAAAGGGATTGTACAGAAAGTGTGTTAAATCCAGAGAAGAAACTGGCCTACTCATGCCTCTAAAAGCCCCAAAAGAAATTATCTTCTTAGAGGGAGAAACACTTCCCACAGAAGTGTTAACAGAGGAAGTTGTCTTGAAAACTGGTGATTTACAACCATTAGAACAACCTACTAGTGAAGCTGTTGAAGCTCCATTGGTTGGTACACCAGTTTGTATTAACGGGCTTATGTTGCTCGAAATCAAAGACACAGAAAAGTACTGTGCCCTTGCACCTAATATGATGGTAACAAACAATACCTTCACACTCAAAGGCGGTGCACCAACAAAGGTTACTTTTGGTGATGACACTGTGATAGAAGTGCAAGGTTACAAGAGTGTGAATATCACTTTTGAACTTGATGAAAGGATTGATAAAGTACTTAATGAGAAGTGCTCTGCCTATACAGTTGAACTCGGTACAGAAGTAAATGAGTTCGCCTGTGTTGTGGCAGATGCTGTCATAAAAACTTTGCAACCAGTATCTGAATTACTTACACCACTGGGCATTGATTTAGATGAGTGGAGTATGGCTACATACTACTTATTTGATGAGTCTGGTGAGTTTAAATTGGCTTCACATATGTATTGTTCTTTTTACCCTCCAGATGAGGATGAAGAAGAAGGTGATTGTGAAGAAGAAGAGTTTGAGCCATCAACTCAATATGAGTATGGTACTGAAGATGATTACCAAGGTAAACCTTTGGAATTTGGTGCCACTTCTGCTGCTCTTCAACCTGAAGAAGAGCAAGAAGAAGATTGGTTAGATGATGATAGTCAACAAACTGTTGGTCAACAAGACGGCAGTGAGGACAATCAGACAACTATTATTCAAACAATTGTTGAGGTTCAACCTCAATTAGAGATGGAACTTACACCAGTTGTTCAGACTATTGAAGTGAATAGTTTTAGTGGTTATTTAAAACTTACTGACAATGTATACATTAAAAATGCAGACATTGTGGAAGAAGCTAAAAAGGTAAAACCAACAGTGGTTGTTAATGCAGCCAATGTTTACCTTAAACATGGAGGAGGTGTTGCAGGAGCCTTAAATAAGGCTACTAACAATGCCATGCAAGTTGAATCTGATGATTACATAGCTACTAATGGACCACTTAAAGTGGGTGGTAGTTGTGTTTTAAGCGGACACAATCTTGCTAAACACTGTCTTCATGTTGTCGGCCCAAATGTTAACAAAGGTGAAGACATTCAACTTCTTAAGAGTGCTTATGAAAATTTTAATCAGCACGAAGTTCTACTTGCACCATTATTATCAGCTGGTATTTTTGGTGCTGACCCTATACATTCTTTAAGAGTTTGTGTAGATACTGTTCGCACAAATGTCTACTTAGCTGTCTTTGATAAAAATCTCTATGACAAACTTGTTTCAAGCTTTTTGGAAATGAAGAGTGAAAAGCAAGTTGAACAAAAGATCGCTGAGATTCCTAAAGAGGAAGTTAAGCCATTTATAACTGAAAGTAAACCTTCAGTTGAACAGAGAAAACAAGATGATAAGAAAATCAAAGCTTGTGTTGAAGAAGTTACAACAACTCTGGAAGAAACTAAGTTCCTCACAGAAAACTTGTTACTTTATATTGACATTAATGGCAATCTTCATCCAGATTCTGCCACTCTTGTTAGTGACATTGACATCACTTTCTTAAAGAAAGATGCTCCATATATAGTGGGTGATGTTGTTCAAGAGGGTGTTTTAACTGCTGTGGTTATACCTACTAAAAAGGCTGGTGGCACTACTGAAATGCTAGCGAAAGCTTTGAGAAAAGTGCCAACAGACAATTATATAACCACTTACCCGGGTCAGGGTTTAAATGGTTACACTGTAGAGGAGGCAAAGACAGTGCTTAAAAAGTGTAAAAGTGCCTTTTACATTCTACCATCTATTATCTCTAATGAGAAGCAAGAAATTCTTGGAACTGTTTCTTGGAATTTGCGAGAAATGCTTGCACATGCAGAAGAAACACGCAAATTAATGCCTGTCTGTGTGGAAACTAAAGCCATAGTTTCAACTATACAGCGTAAATATAAGGGTATTAAAATACAAGAGGGTGTGGTTGATTATGGTGCTAGATTTTACTTTTACACCAGTAAAACAACTGTAGCGTCACTTATCAACACACTTAACGATCTAAATGAAACTCTTGTTACAATGCCACTTGGCTATGTAACACATGGCTTAAATTTGGAAGAAGCTGCTCGGTATATGAGATCTCTCAAAGTGCCAGCTACAGTTTCTGTTTCTTCACCTGATGCTGTTACAGCGTATAATGGTTATCTTACTTCTTCTTCTAAAACACCTGAAGAACATTTTATTGAAACCATCTCACTTGCTGGTTCCTATAAAGATTGGTCCTATTCTGGACAATCTACACAACTAGGTATAGAATTTCTTAAGAGAGGTGATAAAAGTGTATATTACACTAGTAATCCTACCACATTCCACCTAGATGGTGAAGTTATCACCTTTGACAATCTTAAGACACTTCTTTCTTTGAGAGAAGTGAGGACTATTAAGGTGTTTACAACAGTAGACAACATTAATCTCCACACGCAAGTTGTGGACATGTCAATGACATATGGACAACAGTTTGGTCCAACTTATTTGGATGGAGCTGATGTTACTAAAATAAAACCTCATAATTCACATGAAGGTAAAACATTTTATGTTTTACCTAATGATGACACTCTACGTGTTGAGGCTTTTGAGTACTACCACACAACTGATCCTAGTTTTCTGGGTAGGTACATGTCAGCATTAAATCACACTAAAAAGTGGAAATACCCACAAGTTAATGGTTTAACTTCTATTAAATGGGCAGATAACAACTGTTATCTTGCCACTGCATTGTTAACACTCCAACAAATAGAGTTGAAGTTTAATCCACCTGCTCTACAAGATGCTTATTACAGAGCAAGGGCTGGTGAAGCTGATAACTTTTGTGCACTTATCTTAGCCTACTGTAATAAGACAGTAGGTGAGTTAGGTGATGTTAGAGAAACAATGAGTTACTTGTTTCAACATGCCAATTTAGATTCTTGCAAAAGAGTCTTGAACGTGGTGTGTAAAACTTGTGGACAACAGCAGACAACCCTTAAGGGTGTAGAAGCTGTTATGTACATGGGCACACTTTCTTATGAACAATTTAAGAAAGGTGTTCAGATACCTTGTACGTGTGGTAAACAAGCTACAAAATATCTAGTACAACAGGAGTCACCTTTTGTTATGATGTCAGCACCACCTGCTCAGTATGAACTTAAGCATGGTACATTTACTTGTGCTAGTGAGTACACTGGTAATTACCAGTGTGGTCACTATAAACATATAACTTCTAAAGAAACTTTGTATTGCATAGACGGTGCTTTACTTACAAAGTCCTCAGAATACAAAGGTCCTATTACGGATGTTTTCTACAAAGAAAACAGTTACACAACAACCATAAAACCAGTTACTTATAAATTGGATGGTGTTGTTTGTACAGAAATTGACCCTAAGTTGGACAATTATTATAAGAAAGACAATTCTTATTTTACAGAGCAACCAATTGATCTTGTACCAAACCAACCATATCCAAACGCAAGCTTCGATAATTTTAAGTTTGTATGTGATAATATCAAATTTGCTGATGATTTAAACCAGTTAACTGGTTATAAGAAACCTGCTTCAAGAGAGCTTAAAGTTACATTTTTCCCTGACTTAAATGGTGATGTGGTGGCTATTGATTATAAACACTACACACCCTCTTTTAAGAAAGGAGCTAAATTGTTACATAAACCTATTGTTTGGCATGTTAACAATGCAACTAATAAAGCCACGTATAAACCAAATACCTGGTGTATACGTTGTCTTTGGAGCACAAAACCAGTTGAAACATCAAATTCGTTTGATGTACTGAAGTCAGAGGACGCGCAGGGAATGGATAATCTTGCCTGCGAAGATCTAAAACCAGTCTCTGAAGAAGTAGTGGAAAATCCTACCATACAGAAAGACGTTCTTGAGTGTAATGTGAAAACTACCGAAGTTGTAGGAGACATTATACTTAAACCAGCAAATAATAGTTTAAAAATTACAGAAGAGGTTGGCCACACAGATCTAATGGCTGCTTATGTAGACAATTCTAGTCTTACTATTAAGAAACCTAATGAATTATCTAGAGTATTAGGTTTGAAAACCCTTGTTACTCATGGTTTAGCTGCTGTTAATAGTGTCCCTTGGGATACTATAGCTAATTATGCTAAGCCTTTTCTTAACAAAGTTGTTAGTACAACTACTAACATAGTTACACGGTGTTTAAACCGTGTTTGTACTAATTATATGCCTTATTTCTTTACTTTATTGCTACAATTGTGTACTTTTACTAGAAGTACAAATTCTAGAATTAAAGCATCTATGCCGACTACTATAGCAAAGAATACTGTTAAGAGTGTCGGTAAATTTTGTCTAGAGGCTTCATTTAATTATTTGAAGTCACCTAATTTTTCTAAACTGATAAATATTACAATTTGGTTTTTACTATTAAGTGTTTGCCTAGGTTCTTTAATCTACTCAACCGCTGCTTTAGGTGTTTTAATGTCTAATTTAGGCATGCCTTCTTACTGTACTGGTTACAGAGAAGGCTATTTGAACTCTACTAATGTCACTATTGCAACCTACTGTACTGGTTCTATACCTTGTAGTGTTTGTCTTAGTGGTTTAGATTCTTTAGACACCTATCCTTCTTTAGAAACTATACAAATTACCATTTCATCTTTTAAATGGGATTTAACTGCTTTTGGCTTAGTTGCAGAGTGGTTTTTGGCATATATTCTTTTCACTAGGTTTTTCTATGTACTTGGATTGGCTGCAATCATGCAATTGTTTTTCAGCTATTTTGCAGTACATTTTATTAGTAATTCTTGGCTTATGTGGTTAATAATTAATCTTGTACAAATGGCCCCGATTTCAGCTATGGTTAGAATGTACATCTTCTTTGCATCATTTTATTATGTATGGAAAAGTTATGTGCATGTTGTAGACGGTTGTAATTCATCAACTTGTATGATGTGTTACAAACGTAATAGAGCAACAAGAGTCGAATGTACAACTATTGTTAATGGTGTTAGAAGGTCCTTTTATGTCTATGCTAATGGAGGTAAAGGCTTTTGCAAACTACACAATTGGAATTGTGTTAATTGTGATACATTCTGTGCTGGTAGTACATTTATTAGTGATGAAGTTGCGAGAGACTTGTCACTACAGTTTAAAAGACCAATAAATCCTACTGACCAGTCTTCTTACATCGTTGATAGTGTTACAGTGAAGAATGGTTCCATCCATCTTTACTTTGATAAAGCTGGTCAAAAGACTTATGAAAGACATTCTCTCTCTCATTTTGTTAACTTAGACAACCTGAGAGCTAATAACACTAAAGGTTCATTGCCTATTAATGTTATAGTTTTTGATGGTAAATCAAAATGTGAAGAATCATCTGCAAAATCAGCGTCTGTTTACTACAGTCAGCTTATGTGTCAACCTATACTGTTACTAGATCAGGCATTAGTGTCTGATGTTGGTGATAGTGCGGAAGTTGCAGTTAAAATGTTTGATGCTTACGTTAATACGTTTTCATCAACTTTTAACGTACCAATGGAAAAACTCAAAACACTAGTTGCAACTGCAGAAGCTGAACTTGCAAAGAATGTGTCCTTAGACAATGTCTTATCTACTTTTATTTCAGCAGCTCGGCAAGGGTTTGTTGATTCAGATGTAGAAACTAAAGATGTTGTTGAATGTCTTAAATTGTCACATCAATCTGACATAGAAGTTACTGGCGATAGTTGTAATAACTATATGCTCACCTATAACAAAGTTGAAAACATGACACCCCGTGACCTTGGTGCTTGTATTGACTGTAGTGCGCGTCATATTAATGCGCAGGTAGCAAAAAGTCACAACATTGCTTTGATATGGAACGTTAAAGATTTCATGTCATTGTCTGAACAACTACGAAAACAAATACGTAGTGCTGCTAAAAAGAATAACTTACCTTTTAAGTTGACATGTGCAACTACTAGACAAGTTGTTAATGTTGTAACAACAAAGATAGCACTTAAGGGTGGTAAAATTGTTAATAATTGGTTGAAGCAGTTAATTAAAGTTACACTTGTGTTCCTTTTTGTTGCTGCTATTTTCTATTTAATAACACCTGTTCATGTCATGTCTAAACATACTGACTTTTCAAGTGAAATCATAGGATACAAGGCTATTGATGGTGGTGTCACTCGTGACATAGCATCTACAGATACTTGTTTTGCTAACAAACATGCTGATTTTGACACATGGTTTAGCCAGCGTGGTGGTAGTTATACTAATGACAAAGCTTGCCCATTGATTGCTGCAGTCATAACAAGAGAAGTGGGTTTTGTCGTGCCTGGTTTGCCTGGCACGATATTACGCACAACTAATGGTGACTTTTTGCATTTCTTACCTAGAGTTTTTAGTGCAGTTGGTAACATCTGTTACACACCATCAAAACTTATAGAGTACACTGACTTTGCAACATCAGCTTGTGTTTTGGCTGCTGAATGTACAATTTTTAAAGATGCTTCTGGTAAGCCAGTACCATATTGTTATGATACCAATGTACTAGAAGGTTCTGTTGCTTATGAAAGTTTACGCCCTGACACACGTTATGTGCTCATGGATGGCTCTATTATTCAATTTCCTAACACCTACCTTGAAGGTTCTGTTAGAGTGGTAACAACTTTTGATTCTGAGTACTGTAGGCACGGCACTTGTGAAAGATCAGAAGCTGGTGTTTGTGTATCTACTAGTGGTAGATGGGTACTTAACAATGATTATTACAGATCTTTACCAGGAGTTTTCTGTGGTGTAGATGCTGTAAATTTACTTACTAATATGTTTACACCACTAATTCAACCTATTGGTGCTTTGGACATATCAGCATCTATAGTAGCTGGTGGTATTGTAGCTATCGTAGTAACATGCCTTGCCTACTATTTTATGAGGTTTAGAAGAGCTTTTGGTGAATACAGTCATGTAGTTGCCTTTAATACTTTACTATTCCTTATGTCATTCACTGTACTCTGTTTAACACCAGTTTACTCATTCTTACCTGGTGTTTATTCTGTTATTTACTTGTACTTGACATTTTATCTTACTAATGATGTTTCTTTTTTAGCACATATTCAGTGGATGGTTATGTTCACACCTTTAGTACCTTTCTGGATAACAATTGCTTATATCATTTGTATTTCCACAAAGCATTTCTATTGGTTCTTTAGTAATTACCTAAAGAGACGTGTAGTCTTTAATGGTGTTTCCTTTAGTACTTTTGAAGAAGCTGCGCTGTGCACCTTTTTGTTAAATAAAGAAATGTATCTAAAGTTGCGTAGTGATGTGCTATTACCTCTTACGCAATATAATAGATACTTAGCTCTTTATAATAAGTACAAGTATTTTAGTGGAGCAATGGATACAACTAGCTACAGAGAAGCTGCTTGTTGTCATCTCGCAAAGGCTCTCAATGACTTCAGTAACTCAGGTTCTGATGTTCTTTACCAACCACCACAAACCTCTATCACCTCAGCTGTTTTGCAGAGTGGTTTTAGAAAAATGGCATTCCCATCTGGTAAAGTTGAGGGTTGTATGGTACAAGTAACTTGTGGTACAACTACACTTAACGGTCTTTGGCTTGATGACGTAGTTTACTGTCCAAGACATGTGATCTGCACCTCTGAAGACATGCTTAACCCTAATTATGAAGATTTACTCATTCGTAAGTCTAATCATAATTTCTTGGTACAGGCTGGTAATGTTCAACTCAGGGTTATTGGACATTCTATGCAAAATTGTGTACTTAAGCTTAAGGTTGATACAGCCAATCCTAAGACACCTAAGTATAAGTTTGTTCGCATTCAACCAGGACAGACTTTTTCAGTGTTAGCTTGTTACAATGGTTCACCATCTGGTGTTTACCAATGTGCTATGAGGCCCAATTTCACTATTAAGGGTTCATTCCTTAATGGTTCATGTGGTAGTGTTGGTTTTAACATAGATTATGACTGTGTCTCTTTTTGTTACATGCACCATATGGAATTACCAACTGGAGTTCATGCTGGCACAGACTTAGAAGGTAACTTTTATGGACCTTTTGTTGACAGGCAAACAGCACAAGCAGCTGGTACGGACACAACTATTACAGTTAATGTTTTAGCTTGGTTGTACGCTGCTGTTATAAATGGAGACAGGTGGTTTCTCAATCGATTTACCACAACTCTTAATGACTTTAACCTTGTGGCTATGAAGTACAATTATGAACCTCTAACACAAGACCATGTTGACATACTAGGACCTCTTTCTGCTCAAACTGGAATTGCCGTTTTAGATATGTGTGCTTCATTAAAAGAATTACTGCAAAATGGTATGAATGGACGTACCATATTGGGTAGTGCTTTATTAGAAGATGAATTTACACCTTTTGATGTTGTTAGACAATGCTCAGGTGTTACTTTCCAAAGTGCAGTGAAAAGAACAATCAAGGGTACACACCACTGGTTGTTACTCACAATTTTGACTTCACTTTTAGTTTTAGTCCAGAGTACTCAATGGTCTTTGTTCTTTTTTTTGTATGAAAATGCCTTTTTACCTTTTGCTATGGGTATTATTGCTATGTCTGCTTTTGCAATGATGTTTGTCAAACATAAGCATGCATTTCTCTGTTTGTTTTTGTTACCTTCTCTTGCCACTGTAGCTTATTTTAATATGGTCTATATGCCTGCTAGTTGGGTGATGCGTATTATGACATGGTTGGATATGGTTGATACTAGTTTGAAGCTAAAAGACTGTGTTATGTATGCATCAGCTGTAGTGTTACTAATCCTTATGACAGCAAGAACTGTGTATGATGATGGTGCTAGGAGAGTGTGGACACTTATGAATGTCTTGACACTCGTTTATAAAGTTTATTATGGTAATGCTTTAGATCAAGCCATTTCCATGTGGGCTCTTATAATCTCTGTTACTTCTAACTACTCAGGTGTAGTTACAACTGTCATGTTTTTGGCCAGAGGTATTGTTTTTATGTGTGTTGAGTATTGCCCTATTTTCTTCATAACTGGTAATACACTTCAGTGTATAATGCTAGTTTATTGTTTCTTAGGCTATTTTTGTACTTGTTACTTTGGCCTCTTTTGTTTACTCAACCGCTACTTTAGACTGACTCTTGGTGTTTATGATTACTTAGTTTCTACACAGGAGTTTAGATATATGAATTCACAGGGACTACTCCCACCCAAGAATAGCATAGATGCCTTCAAACTCAACATTAAATTGTTGGGTGTTGGTGGCAAACCTTGTATCAAAGTAGCCACTGTACAGTCTAAAATGTCAGATGTAAAGTGCACATCARTAGTCTTACTCTCAGTTTTGCAACAACTCAGAGTAGAATCATCATCTAAATTGTGGGCTCAATGTGTCCAGTTACACAATGACATTCTCTTAGCTAAAGATACTACTGAAGCCTTTGAAAAAATGGTTTCACTACTTTCTGTTTTGCTTTCCATGCAGGGTGCTGTAGACATAAACAAGCTTTGTGAAGAAATGCTGGACAACAGGGCAACCTTACAAGCTATAGCCTCAGAGTTTAGTTCCCTTC
CATCATATGCAGCTTTTGCTACTGCTCAAGAAGCTTATGAGCAGGCTGTTGCTAATGGTGATTCTGAAGTTGTTCTTAAAAAGTTGAAGAAGTCTTTGAATGTGGCTAAATCTGAATTTGACCGTGATGCAGCCATGCAACGTAAGTTGGAAAAGATGGCTGATCAAGCTATGACCCAAATGTATAAACAGGCTAGATCTGAGGACAAGAGGGCAAAAGTTACTAGTGCTATGCAGACAATGCTTTTCACTATGCTTAGAAAGTTGGATAATGATGCACTCAACAACATTATCAACAATGCAAGAGATGGTTGTGTTCCCTTGAACATAATACCTCTTACAACAGCAGCCAAACTAATGGTTGTCATACCAGACTATAACACATATAAAAATACGTGTGATGGTACAACATTTACTTATGCATCAGCATTGTGGGAAATCCAACAGGTTGTAGATGCAGATAGTAAAATTGTTCAACTTAGTGAAATTAGTATGGACAATTCACCTAATTTAGCATGGCCTCTTATTGTAACAGCTTTAAGGGCCAATTCTGCTGTCAAATTACAGAATAATGAGCTTAGTCCTGTTGCACTACGACAGATGTCTTGTGCTGCCGGTACTACACAAACTGCTTGCACTGATGACAATGCGTTAGCTTACTACAACACAACAAAGGGAGGTAGGTTTGTACTTGCACTGTTATCCGATTTACAGGATTTGAAATGGGCTAGATTCCCTAAGAGTGATGGAACTGGTACTATCTATACAGAACTGGAACCACCTTGTAGGTTTGTTACAGACACACCTAAAGGTCCTAAAGTGAAGTATTTATACTTTATTAAAGGATTAAACAACCTAAATAGAGGTATGGTACTTGGTAGTTTAGCTGCCACAGTACGTCTACAAGCTGGTAATGCAACAGAAGTGCCTGCCAATTCAACTGTATTATCTTTCTGTGCTTTTGCTGTAGATGCTGCTAAAGCTTACAAAGATTATCTAGCTAGTGGGGGACAACCAATCACTAATTGTGTTAAGATGTTGTGTACACACACTGGTACTGGTCAGGCAATAACAGTTACACCGGAAGCCAATATGGATCAAGAATCCTTTGGTGGTGCATCGTGTTGTCTGTACTGCCGTTGCCACATAGATCATCCAAATCCTAAAGGATTTTGTGACTTAAAAGGTAAGTATGTACAAATACCTACAACTTGTGCTAATGACCCTGTGGGTTTTACACTTAAAAACACAGTCTGTACCGTCTGCGGTATGTGGAAAGGTTATGGCTGTAGTTGTGATCAACTCCGCGAACCCATGCTTCAGTCAGCTGATGCACAATCGTTTTTAAACGGGTTTGCGGTGTAAGTGCAGCCCGTCTTACACCGTGCGGCACAGGCACTAGTACTGATGTCGTATACAGGGCTTTTGACATCTACAATGATAAAGTAGCTGGTTTTGCTAAATTCCTAAAAACTAATTGTTGTCGCTTCCAAGAAAAGGACGAAGATGACAATTTAATTGATTCTTACTTTGTAGTTAAGAGACACACTTTCTCTAACTACCAACATGAAGAAACAATTTATAATTTACTTAAGGATTGTCCAGCTGTTGCTAAACATGACTTCTTTAAGTTTAGAATAGACGGTGACATGGTACCACATATATCACGTCAACGTCTTACTAAATACACAATGGCAGACCTCGTCTATGCTTTAAGGCATTTTGATGAAGGTAATTGTGACACATTAAAAGAAATACTTGTCACATACAATTGTTGTGATGATGATTATTTCAATAAAAAGGACTGGTATGATTTTGTAGAAAACCCAGATATATTACGCGTATACGCCAACTTAGGTGAACGTGTACGCCAAGCTTTGTTAAAAACAGTACAATTCTGTGATGCCATGCGAAATGCTGGTATTGTTGGTGTACTGACATTAGATAATCAAGATCTCAATGGTAACTGGTATGATTTCGGTGATTTCATACAAACCACGCCAGGTAGTGGAGTTCCTGTTGTAGATTCTTATTATTCATTGTTAATGCCTATATTAACCTTGACCAGGGCTTTAACTGCAGAGTCACATGTTGACACTGACTTAACAAAGCCTTACATTAAGTGGGATTTGTTAAAATATGACTTCACGGAAGAGAGGTTAAAACTCTTTGACCGTTATTTTAAATATTGGGATCAGACATACCACCCAAATTGTGTTAACTGTTTGGATGACAGATGCATTCTGCATTGTGCAAACTTTAATGTTTTATTCTCTACAGTGTTCCCACTTACAAGTTTTGGACCACTAGTGAGAAAAATATTTGTTGATGGTGTTCCATTTGTAGTTTCAACTGGATACCACTTCAGAGAGCTAGGTGTTGTACATAATCAGGATGTAAACTTACATAGCTCTAGACTTAGTTTTAAGGAATTACTTGTGTATGCTGCTGACCCTGCTATGCACGCTGCTTCTGGTAATCTATTACTAGATAAACGCACTACGTGCTTTTCAGTAGCTGCACTTACTAACAATGTTGCTTTTCAAACTGTCAAACCTGGTAATTTTAACAAAGACTTCTATGACTTTGCTGTGTCTAAGGGTTTCTTTAAGGAAGGAAGTTCTGTTGAATTAAAACACTTCTTCTTTGCTCAGGATGGTAATGCTGCTATCAGCGATTATGACTACTATCGTTATAATCTACCAACAATGTGTGATATCAGACAACTACTATTTGTAGTTGAAGTTGTTGATAAGTACTTTGATTGTTACGATGGTGGCTGTATTAATGCTAACCAAGTCATCGTCAACAACCTAGACAAATCAGCTGGTTTTCCATTTAATAAATGGGGTAAGGCTAGACTTTATTATGATTCAATGAGTTATGAGGATCAAGATGCACTTTTCGCATATACAAAACGTAATGTCATCCCTACTATAACTCAAATGAATCTTAAGTATGCCATTAGTGCAAAGAATAGAGCTCGCACCGTAGCTGGTGTCTCTATCTGTAGTACTATGACCAATAGACAGTTTCATCAAAAATTATTGAAATCAATAGCCGCCACTAGAGGAGCTACTGTAGTAATTGGAACAAGCAAATTCTATGGTGGTTGGCACAACATGTTAAAAACTGTTTATAGTGATGTAGAAAACCCTCATCTTATGGGTTGGGATTATCCTAAATGTGATAGAGCCATGCCTAACATGCTTAGAATTATGGCCTCACTTGTTCTTGCTCGCAAACATACAACGTGTTGTAGCTTGTCACACCGTTTCTATAGATTAGCTAATGAGTGTGCTCAAGTATTGAGTGAAATGGTCATGTGTGGCGGTTCACTATATGTTAAACCAGGTGGAACCTCATCAGGAGATGCCACAACTGCTTATGCTAATAGTGTTTTTAACATTTGTCAAGCTGTCACGGCCAATGTTAATGCACTTTTATCTACTGATGGTAACAAAATTGCCGATAAGTATGTCCGCAATTTACAACACAGACTTTATGAGTGTCTCTATAGAAATAGAGATGTTGACACAGACTTTGTGAATGAGTTTTACGCATATTTGCGTAAACATTTCTCAATGATGATACTCTCTGACGATGCTGTTGTGTGTTTCAATAGCACTTATGCATCTCAAGGTCTAGTGGCTAGCATAAAGAACTTTAAGTCAGTTCTTTATTATCAAAACAATGTTTTTATGTCTGAAGCAAAATGTTGGACTGAGACTGACCTTACTAAAGGACCTCATGAATTTTGCTCTCAACATACAATGCTAGTTAAACAGGGTGATGATTATGTGTACCTTCCTTACCCAGATCCATCAAGAATCCTAGGGGCCGGCTGTTTTGTAGATGATATCGTAAAAACAGATGGTACACTTATGATTGAACGGTTCGTGTCTTTAGCTATAGATGCTTACCCACTTACTAAACATCCTAATCAGGAGTATGCTGATGTCTTTCATTTGTACTTACAATACATAAGAAAGCTACATGATGAGTTAACAGGACACATGTTAGACATGTATTCTGTTATGCTTACTAATGATAACACCTCAAGGTATTGGGAACCTGAGTTTTATGAGGCTATGTACACACCGCATACAGTCTTACAGGCTGTTGGGGCTTGTGTTCTTTGCAATTCACAGACTTCATTAAGATGTGGTGCTTGCATACGTAGACCATTCTTATGTTGTAAATGCTGTTACGACCATGTCATATCAACATCACATAAATTAGTCTTGTCTGTTAATCCGTATGTTTGCAATGCTCCAGGTTGTGATGTCACAGATGTGACTCAACTTTACTTAGGAGGTATGAGCTATTATTGTAAATCACATAAACCACCCATTAGTTTTCCATTGTGTGCTAATGGACAAGTTTTTGGTTTATATAAAAATACATGTGTTGGTAGCGATAATGTTACTGACTTTAATGCAATTGCAACATGTGACTGGACAAATGCTGGTGATTACATTTTAGCTAACACCTGTACTGAAAGACTCAAGCTTTTTGCAGCAGAAACGCTCAAAGCTACTGAGGAGACATTTAAACTGTCTTATGGTATTGCTACTGTACGTGAAGTGCTGTCTGACAGAGAATTACATCTTTCATGGGAAGTTGGTAAACCTAGACCACCACTTAACCGAAATTATGTCTTTACTGGTTATCGTGTAACTAAAAACAGTAAAGTACAAATAGGAGAGTACACCTTTGAAAAAGGTGACTATGGTGATGCTGTTGTTTACCGAGGTACAACAACTTACAAATTAAATGTTGGTGATTATTTTGTGCTGACATCACATACAGTAATGCCATTAAGTGCACCTACACTAGTGCCACAAGAGCACTATGTTAGAATTACTGGCTTATACCCAACACTCAATATCTCAGATGAG
TTTTCTAGCAATGTTGCAAATTATCAAAAGGTTGGTATGCAAAAGTATTCTACACTCCAGGGACCACCTGGTACTGGTAAGAGTCATTTTGCTATTGGCCTAGCTCTCTACTACCCTTCTGCTCGCATAGTGTATACAGCTTGCTCTCATGCCGCTGTTGATGCACTATGTGAGAAGGCATTAAAATATTTGCCTATAGATAAATGTAGTAGAATTATACCTGCACGTGCTCGTGTAGAGTGTTTTGATAAATTCAAAGTGAATTCAACATTAGAACAGTATGTCTTTTGTACTGTAAATGCATTGCCTGAGACGACAGCAGATATAGTTGTCTTTGATGAAATTTCAATGGCCACAAATTATGATTTGAGTGTTGTCAATGCCAGATTACGTGCTAAGCACTATGTGTACATTGGCGACCCTGCTCAATTACCTGCACCACGCACATTGCTAACTAAGGGCACACTAGAACCAGAATATTTCAATTCAG
TGTGTAGACTTATGAAAACTATAGGTCCAGACATGTTCCTCGGAACTTGTCGGCGTTGTCCTGCTGAAATTGTTGACACTGTGAGTGCTTTGGTTTATGATAATAGGCTTAAAGCACATAAAGACAAATCAGCTCAATGCTTTAAAATGTTTTATAAGGGTGTTATCACGCATGATGTTTCATCTGCAATTAACAGGCCACAAATAGGCGTGGTAAGAGAATTCCTTACACGTAACCCTGCTTGGAGAAAAGCTGTCTTTATTTCACCTTATAATTCACAGAATGCTGTAGCCTCAAAGATTTTGGGACTACCAACTCAAACTGTTGATTCATCACAGGGCTCAGAATATGACTATGTCATATTCACTCAAACCACTGAAACAGCTCACTCTTGTAATGTAAACAGATTTAATGTTGCTATTACCAGAGCAAAAGTAGGCATACTTTGCATAATGTCTGATAGAGACCTTTATGACAAGTTGCAATTTAC
AAGTCTTGAAATTCCACGTAGGAATGTGGCAACTTTACAAGCTGAAAATGTAACAGGACTCTTTAAAGATTGTAGTAAGGTAATCACTGGGTTACATCCTACACAGGCACCTACACACCTCAGTGTTGACACTAAATTCAAAACTGAAGGTTTATGTGTTGACATACCTGGCATACCTAAGGACATGACCTATAGAAGACTCATCTCTATGATGGGTTTTAAAATGAATTATCAAGTTAATGGTTACCCTAACATGTTTATCACCCGCGAAGAAGCTATAAGACATGTACGTGCATGGATTGGCTTCGATGTCGAGGGGTGTCATGCTACTAGAGAAGCTGTTGGTACCAATTTACCTTTACAGCTAGGTTTTTCTACAGGTGTTAACCTAGTTGCTGTACCTACAGGTTATGTTGATACACCTAATAATACAGATTTTTCCAGAGTTAGTGCTAAACCACCGCCTGGAGATCAATTTAAACACCTCATA
CCACTTATGTACAAAGGACTTCCTTGGAATGTAGTGCGTATAAAGATTGTACAAATGTTAAGTGACACACTTAAAAATCTCTCTGACAGAGTCGTATTTGTCTTATGGGCACATGGCTTTGAGTTGACATCTATGAAGTATTTTGTGAAAATAGGACCTGAGCGCACCTGTTGTCTATGTGATAGACGTGCCACATGCTTTTCCACTGCTTCAGACACTTATGCCTGTTGGCATCATTCTATTGGATTTGATTACGTCTATAATCCGTTTATGATTGATGTTCAACAATGGGGTTTTACAGGTAACCTACAAAGCAACCATGATCTGTATTGTCAAGTCCATGGTAATGCACATGTAGCTAGTTGTGATGCAATCATGACTAGGTGTCTAGCTGTCCACGAGTGCTTTGTTAAGCGTGTTGACTGGACTATTGAATATCCTATAATTGGTGATGAACTGAAGATTAATGCGGCTTGTAGAAAGGTTCAAC
ACATGGTTGTTAAAGCTGCATTATTAGCAGACAAATTCCCAGTTCTTCACGACATTGGTAACCCTAAAGCTATTAAGTGTGTACCTCAAGCTGATGTAGGATGGAAGTTCTATGATGCACAGCCTTGTAGTGACAAAGCTTATAAAATAGAAGAATTATTCTATTCTTATGCCACACATTCTGACAAATTCACAGATGGTGTATGCCTATTTTGGAATTGCAATGTCGATAGATATCCTGCTAATTCCATTGTTTGTAGATTTGACACTAGAGTGCTATCTAACCTTAACTTGCCTGGTTGTGATGGTGGCAGTTTGTATGTAAATAAACATGCATTCCACACACCAGCTTTTGATAAAAGTGCTTTTGTTAATTTAAAACAATTACCATTTTTCTATTACTCTGACAGTCCATGTGAGTCTCATGGAAAACAAGTAGTGTCAGATATAGATTATGTACCACTAAAGTCTGCTACGTGTATAACACGTTGCAATTTAGGTGGTGCTGTCTGTAGACATCATGCTAATGAGTACAGATTGTATCTCGATGCTTATAACATGATGATCTCAGCTGGCTTTAGCTTGTGGGTTTACAAACAATTTGATACTTATAACCTCTGGAACACTTTTACAAGACTTCAGAGTTTAGAAAATGTGGCTTTTAATGTTGTAAATAAGGGACACTTTGATGGACAACAGGGTGAAGTACCAGTTTCTATCATTAATAACACTGTTTACACAAAAGTTGATGGTGTTGATGTAGAATTGTTTGAAAATAAAACAACATTACCTGTTAATGTAGCATTTGAGCTTTGGGCTAAGCGCAACATTAAACCAGTACCAGAGGTGAAAATACTCAATAATTTGGGTGTGGATATTGCTGCTAATACTGTGATCTGGGACTACAAAAGAGATGCTCCAGCACATATATCTACTATTGGTGTTTGTTCTATGACTGACATAGCCAAGAAACCAACTGAAACGATTTGTGCACCACTCACTGTCTTTTTTGATGGTAGAGTTGATGGTCAAGTAGACTTATTTAGAAATGCCCGTAATGGTGTTCTTATTACAGAAGGTAGTGTTAAAGGTTTACAACCATCTGTAGGTCCCAAACAAGCTAGTCTTAATGGAGTCACATTAATTGGAGAAGCCGTAAAAACACAGTTCAATTATTATAAGAAAGTTGATGGTGTTGTCCAACAATTACCTGAAACTTACTTTACTCAGAGTAGAAATTTACAAGAATTTAAACCCAGGAGTCAAATGGAAATTGATTTCTTAGAATTAGCTATGGATGAATTCATTGAACGGTATAAATTAGAAGGCTATGCCTTCGAACATATCGTTTATGGAGATTTTAGTCATAGTCAGTTAGGTGGTTTACATCTACTGATTGGACTAGCTAAACGTTTTAAGGAATCACCTTTTGAATTAGAAGATTTTATTCCTATGGACAGTACAGTTAAAAACTATTTCATAACAGATGCGCAAACAGGTTCATCTAAGTGTGTGTGTTCTGTTATTGATTTATTACTTGATGATTTTGTTGAAATAATAAAATCCCAAGATTTATCTGTAGTTTCTAAGGTTGTCAAAGTGACTATTGACTATACAGAAATTTCATTTATGCTTTGTGTAAAGATGGCCATGTAGAAACATTTTACCCAAAATTACAATCTAGTCAAGCGTGGCAACCGGGTGTTGCTATGCCTAATCTTTACAAAATGCAAAGAATGCTATTAGAAAAGTGTGACCTTCAAAATTATGGTGATAGTGCAACATTACCTAAAGGCATAATGATGAATGTCGCAAAATATACTCAACTGTGTCAATATTTAAACACATTAACATTAGCTGTACCCTATAATATGAGAGTTATACATTTTGGTGCTGGTTCTGATAAAGGAGTTGCACCAGGTACAGCTGTTTTAAGACAGTGGTTGCCTACGGGTACGCTGCTTGTCGATTCAGATCTTAATGACTTTGTCTCTGATGCAGATTCAACTTTGATTGGTGATTGTGCAACTGTACATACAGCTAATAAATGGGATCTCATTATTAGTGATATGTACGACCCTAAGACTAAAAATGTTACAAAAGAAAATGACTCTAAAGAGGGTTTTTTCACTTACATTTGTGGGTTTATACAACAAAAGCTAGCTCTTGGAGGTTCCGTGGCTATAAAGATAACAGAACATTCTTGGAATGCTGATCTTTATAAGCTCATGGGACACTTCGCATGGTGGACAGCCTTTGTTACTAATGTGAATGCGTCATCATCTGAAGCATTTTTAATTGGATGTAATTATCTTGGCAAACCACGCGAACAAATAGATGGTTATGTCATGCATGCAAATTACATATTTTGGAGGAATACAAATCCAATTCAGTTGTCTTCCTATTCTTTATTTGACATGAGTAAATTTCCCCTTAAATTAAGGGGTACTGCTGTTATGTCTTTAAAAGAAGGTCAAATCAATGATATGATTTTATCTCTTCTTAGTAAAGGTAGACTTATAATTAGAGAAAACAACAGAGTTGTTATTTCTAGTGATGTTCTTGTTAACAACTAAACGAACAATGTTTGTTTTTCTTGTTTTATTGCCACTAGTCTCTAGTCAGTGTGTTAATCTTACAACCAGAACTCAATTACCCCCTGCATACACTAATTCTTTCACACGTGGTGTTTATTACCCTGACAAAGTTTTCAGATCCTCAGTTTTACATTCAACTCAGGACTTGTTCTTACCTTTCTTTTCCAATGTTACTTGGTTCCATGCTATCTCTGGGACCAATGGTACTAAGAGGTTTGATAACCCTGTCCTACCATTTAATGATGGTGTTTATTTTGCTTCCACTGAGAAGTC
TAACATAATAAGAGGCTGGATTTTTGGTACTACTTTAGATTCGAAGACCCAGTCCCTACTTATTGTTAATAACGCTACTAATGTTGTTATTAAAGTCTGTGAATTTCAATTTTGTAATGATCCATTTTTGGGTGTTTACCACAAAAACAACAAAAGTTGGATGGAAAGTGAGTTCAGAGTTTATTCTAGTGCGAATAATTGCACTTTTGAATATGTCTCTCAGCCTTTTCTTATGGACCTTGAAGGAAAACAGGGTAATTTCAAAAATCTTAGGGAATTTGTGTTTAAGAATATTGATGGTTATTTTAAAATATATTCTAAGCACACGCCTATTAATTTAGTGCGTGATCTCCCTCAGGGTTTTTCGGCTTTAGAACCATTGGTAGATTTGCCAATAGGTATTAACATCACTAGGTTTCAAACTTTACTTGCTTTACATAGAAGTTATTTGACTCCTGGTGATTCTTCTTCAGGTTGGACAGCTGGTGCTGCAGCTTATTATGTGGGTTATCTTCAACCTAGGACTTTTCTATTAAAATATAATGAAAATGGAACCATTACAGATGCTGTAGACTGTGCACTTGACCCTCTCTCAGAAACAAAGTGTACGTTGAAATCCTTCACTGTAGAAAAAGGAATCTATCAAACTTCTAACTTTAGAGTCCAACCAACAGAATCTATTGTTAGATTTCCTAATATTACAAACTTGTGCCCTTTTGGTGAAGTTTTTAACGCCACCAGATTTGCATCTGTTTATGCTTGGAACAGGAAGAGAATCAGCAACTGTGTTGCTGATTATTCTGTCCTATATAATTCCGCATCATTTTCCACTTTTAAGTGTTATGGAGTGTCTCCTACTAAATTAAATGATCTCTGCTTTACTAATGTCTATGCAGATTCATTTGTAATTAGAGGTGATGAAGTCAGACAAATCGCTCCAGGGCAAACTGGAAAGATTGCTGATTATAATTATAAATTACCAGATGATTTTACAGGCTGCGTTATAGCTTGGAATTCTAACAATCTTGATTCTAAGGTTGGTGGTAATTATAATTACCTGTATAGATTGTTTAGGAAGTCTAATCTCAAACCTTTTGAGAGAGATATTTCAACTGAAATCTATCAGGCCGGTAGCACACCTTGTAATGGTGTTGAAGGTTTTAATTGTTACTTTCCTTTACAATCATATGGTTTCCAACCCACTTATGGTGTTGGTTACCAACCATACAGAGTAGTAGTACTTTCTTTTGAACTTCTACATGCACCAGCAACTGTTTGTGGACCTAAAAAGTCTACTAATTTGGTTAAAAACAAATGTGTCAATTTCAACTTCAATGGTTTAACAGGCACAGGTGTTCTTACTGAGTCTAACAAAAAGTTTCTGCCTTTCCAACAATTTGGCAGAGACATTGATGACACTACTGATGCTGTCCGTGATCCACAGACACTTGAGATTCTTGACATTACACCATGTTCTTTTGGTGGTGTCAGTGTTATAACACCAGGAACAAATACTTCTAACCAGGTTGCTGTTCTTTATCAGGGTGTTAACTGCACAGAAGTCCCTGTTGCTATTCATGCAGATCAACTTACTCCTACTTGGCGTGTTTATTCTACAGGTTCTAATGTTTTTCAAACACGTGCAGGCTGTTTAATAGGGGCTGAACATGTCAACAACTCATATGAGTGTGACATACCCATTGGTGCAGGTATATGCGCTAGTTATCAGACTCAGACTAATTCTCATCGGCGGGCACGTAGTGTAGCTAGTCAATCCATCATTGCCTACACTATGTCACTTGGTGCAGAAAATTCAGTTGCTTACTCTAATAACTCTATTGCCATACCCATAAATTTTACTATTAGTGTTACCACAGAAATTCTACCAGTGTCTATGACCAAGACATCAGTAGATTGTACAATGTACATTTGTGGTGATTCAACTGAATGCAGCAATCTTTTGTTGCAATATGGCAGTTTTTGTACACAATTAAACCGTGCTTTAACTGGAATAGCTGTTGAACAAGACAAAAACACCCAAGAAGTTTTTGCACAAGTCAAACAAATTTACAAAACACCACCAATTAAAGATTTTGGTGGTTTTAATTTTTCACAAATATTACCAGATCCATCAAAACCAAGCAAGAGGTCATTTATTGAAGATCTACTTTTCAACAAAGTGACACTTGCAGATGCTGGCTTCATCAAACAATATGGTGATTGCCTTGGTGATATTGCTGCTAGAGACCTCATTTGTGCACAAAAGTTTAACGGCCTTACTGTTTTGCCACCTTTGCTCACAGATGAAATGATTGCTCAATACACTTCTGCACTGTTAGCGGGTACAATCACTTCTGGTTGGACCTTTGGTGCAGGTGCTGCATTACAAATACCATTTGCTATGCAAATGGCTTATAGGTTTAATGGTATTGGAGTTACACAGAATGTTCTCTATGAGAACCAAAAATTGATTGCCAACCAATTTAATAGTGCTATTGGCAAAATTCAAGACTCACTTTCTTCCACAGCAAGTGCACTTGGAAAACTTCAAGATGTGGTCAACCAAAATGCACAAGCTTTAAACACGCTTGTTAAACAACTTAGCTCCAATTTTGGTGCAATTTCAAGTGTTTTAAATGATATCCTTGCACGTCTTGACAAAGTTGAGGCTGAAGTGCAAATTGATAGGTTGATCACAGGCAGACTTCAAAGTTTGCAGACATATGTGACTCAACAATTAATTAGAGCTGCAGAAATCAGAGCTTCTGCTAATCTTGCTGCTACTAAAATGTCAGAGTGTGTACTTGGACAATCAAAAAGAGTTGATTTTTGTGGAAAGGGCTATCATCTTATGTCCTTCCCTCAGTCAGCACCTCATGGTGTAGTCTTCTTGCATGTGACTTATGTCCCTGCACAAGAAAAGAACTTCACAACTGCTCCTGCCATTTGTCATGATGGAAAAGCACACTTTCCTCGTGAAGGTGTCTTTGTTTCAAATGGCACACACTGGTTTGTAACACAAAGGAATTTTTATGAACCACAAATCATTACTACACACAACACATTTGTGTCTGGTAACTGTGATGTTGTAATAGGAATTGTCAACAACACAGTTTATGATCCTTTGCAACCTGAATTAGACTCATTCAAGGAGGAGTTAGATAAATATTTTAAGAATCATACATCACCAGATGTTGATTTAGGTGACATCTCTGGCATTAATGCTTCAGTTGTAAACATTCAAAAAGAAATTGACCGCCTCAATGAGGTTGCCAAGAATTTAAATGAATCTCTCATCGATCTCCAAGAACTTGGAAAGTATGAGCAGTATATAAAATGGCCATGGTACATTTGGCTAGGTTTTATAGCTGGCTTGATTGCCATAGTAATGGTGACAATTATGCTTTGCTGTATGACCAGTTGCTGTAGTTGTCTCAAGGGCTGTTGTTCTTGTGGATCCTGCTGCAAATTTGATGAAGACGACTCTGAGCCAGTGCTCAAAGGAGTCAAATTACATTACACATAAACGAACTTATGGATTTGTTTATGAGAATCTTCACAATTGGAACTGTAACTTTGAAGCAAGGTGAAATCAAGGATGCTACTCCTTCAGATTTTGTTCGCGCTACTGCAACGATACCGATACAAGCCTCACTCCCTTTCGGATGGCTTATTGTTGGCGTTGCACTTCTTGCTGTTTTTCAGAGCGCTTCCAAAATCATAACCCTCAAAAAGAGATGGCAACTAGCACTCTCCAAGGGTGTTCACTTTGTTTGCAACTTGCTGTTGTTGTTTGTAACAGTTTACTCACACCTTTTGCTCGTTGCTGCTGGCCTTGAAGCCCCTTTTCTCTATCTTTATGCTTTAGTCTACTTCTTGCAGAGTATAAACTTTGTAAGAATAATAATGAGGCTTTGGCTTTGCTGGAAATGCCGTTCCAAAAACCCATTACTTTATGATGCCAACTATTTTCTTTGCTGGCATACTAATTGTTACGACTATTGTATACCTTACAATAGTGTAACTTCTTCAATTGTCATTACTTCAGGTGATGGCACAACAAGTCCTATTTCTGAACATGACTACCAGATTGGTGGTTATACTGAAAAATGGGAATCTGGAGTAAAAGACTGTGTTGTATTACACAGTTACTTCACTTCAGACTATTACCAGCTGTACTCAACTCAATTGAGTACAGACACTGGTGTTGAACATGTTACCTTCTTCATCTACAATAAAATTGTTGATGAGCCTGAAGAACATGTCCAAATTCACACAATCGACGGTTCATCCGGAGTTGTTAATCCAGTAATGGAACCAATTTATGATGAACCGACGACGACTACTAGCGTGCCTTTGTAAGCACAAGCTGATGAGTACGAACTTATGTACTCATTCGTTTCGGAAGAGACAGGTACGTTAATAGTTAATAGCGTACTTCTTTTTCTTGCTTTCGTGGTATTCTTGCTAGTTACACTAGCCATCCTTACTGCGCTTCGATTGTGTGCGTACTGCTGCAATATTGTTAACGTGAGTCTTGTAAAACCTTCTTTTTACGTTTACTCTCGTGTTAAAAATCTGAATTCTTCTAGAGTTCCTGATCTTCTGGTCTAAACGAACTAAATATTATATTAGTTTTTCTGTTTGGAACTTTAATTTTAGCCATGGCAGATTCCAACGGTACTATTACCGTTGAAGAGCTTAAAAAGCTCCTTGAACAATGGAACCTAGTAATAGGTTTCCTATTCCTTACATGGATTTGTCTTCTACAATTTGCCTATGCCAACAGGAATAGGTTTTTGTATATAATTAAGTTAATTTTCCTCTGGCTGTTATGGCCAGTAACTTTAGCTTGTTTTGTGCTTGCTGCTGTTTACAGAATAAATTGGATCACCGGTGGAATTGCTATCGCAATGGCTTGTCTTGTAGGCTTGATGTGGCTCAGCTACTTCATTGCTTCTTTCAGACTGTTTGCGCGTACGCGTTCCATGTGGTCATTCAATCCAGAAACTAACATTCTTCTCAACGTGCCACTCCATGGCACTATTCTGACCAGACCGCTTCTAGAAAGTGAACTCGTAATCGGAGCTGTGATCCTTCGTGGACATCTTCGTATTGCTGGACACCATCTAGGACGCTGTGACATCAAGGACCTGCCTAAAGAAATCACTGTTGCTACATCACGAACGCTTTCTTATTACAAATTGGGAGCTTCGCAGCGTGTAGCAGGTGACTCAGGTTTTGCTGCAT
ACAGTCGCTACAGGATTGGCAACTATAAATTAAACACAGACCATTCCAGTAGCAGTGACAATATTGCTTTGCTTGTACAGTAAGTGACAACAGATGTTTCATCTCGTTGACTTTCAGGTTACTATAGCAGAGATATTACTAATTATTATGAGGACTTTTAAAGTTTCCATTTGGAATCTTGATTACATCATAAACCTCATAATTAAAAATTTATCTAAGTCACTAACTGAGAATAAATATTCTCAATTAGATGAAGAGCAACCAATGGAGATTGATTAAACGAACATGAAAATTATTCTTTTCTTGGCACTGATAACACTCGCTACTTGTGAGCTTTATCACTACCAAGAGTGTGTTAGAGGTACAACAGTACTTTTAAAAGAACCTTGCTCTTCTGGAACATACGAGGGCAATTCACCATTTCATCCTCTAGCTGATAACAAATTTGCACTGACTTGCTTTAGCACTCAATTTGCTTTTGCTTGTCCTGACGGCGTAAAACACGTCTATCAGTTACGTGCCAGATCAGTTTCACCTAAACTGTTCATCAGACAAGAGGAAGTTCAAGAACTTTACTCTCCAATTTTTCTTATTGTTGCGGCAATAGTGTTTATAACACTTTGCTTCACACTCAAAAGAAAGACAGAATGATTGAACTTTCATTAATTGACTTCTATTTGTGCTTTTTAGCCTTTCTGCTATTCCTTGTTTTAATTATGCTTATTATCTTTTGGTTCTCACTTGAACTGCAAGATCATAATGAAACTTGTCACGCCTAAACGAACATGAAATTTCTTGTTTTCTTAGGAATCATCACAACTGTAGCTGCATTTCACCAAGAATGTAGTTTACAGTCATGTACTTAACATCAACCATATGTAGTTGATGACCCGTGTCCTATTCACTTCTATTCTAAATGGTATATTAGAGTAGGAGCTATAAAATCAGCACCTTTAATTGAATTGTGCGTGGATGAGGCTGGTTCTAAATCACCCATTCAGTGCATCGATATCGGTAATTATACAGTTTCCTGTTTACCTTTTACAATTAATTGCCAGGAACCTAAATTGGGTAGTCTTGTAGTGCGTTGTTCGTTCTATGAAGACTTTTTAGAGTATCATGACGTTCGTGTTGTTTTAGATTTCATCTAAACGAACAAACTAAATGTCTCTAAATGGACCCCAAAATCAGCGAAATGCACCCCGCATTACGTTTGGTGGACCCTCAGATTCAACTGGCAGTAACCAGAATGGAGAACGCAGTGGGGCGCGATCAAAACAACGTCGGCCCCAAGGTTTACCCAATAATACTGCGTCTTGGTTCACCGCTCTCACTCAACATGGCAAGGAAGACCTTAAATTCCCTCGAGGACAAGGCGTTCCAATTAACACCAATAGCAGTCCAGATGACCAAATTGGCTACTACCGAAGAGCTACCAGACGAATTCGTGGTGGTGACGGTAAAATGAAAGATCTCAGTCCAAGATGGTATTTCTACTACCTAGGAACTGGGCCAGAAGCTGGACTTCCCTATGGTGCTAACAAAGACGGCATCATATGGGTTGCAACTGAGGGAGCCTTGAATACACCAAAAGATCACATTGGCACCCGCAATCCTGCTAACAATGCTGCAATCGTGCTACAACTTCCTCAAGGAACAACATTGCCAAAAGGCTTCTACGCAGAAGGGAGCAGAGGCGGCAGTCAAGCCTCTTCTCGTTCCTCATCACGTAGTCGCAACAGTTCAAGAAATTCAACTCCAGGCAGCAGTAAACGAACTTCTCCTGCTAGAATGGCTGGCAATGGCGGTGATGCTGCTCTTGCTTTGCTGCTGCTTGACAGATTGAACCAGCTTGAGAGCAAAATGTTTGGTAAAGGCCAACAACAACAAGGCCAAACTGTCACTAAGAAATCTGCTGCTGAGGCTTCTAAGAAGCCTCGGCAAAAACGTACTGCCACTA
AAGCATACAATGTAACACAAGCTTTCGGCAGACGTGGTCCAGAACAAACCCAAGGAAATTTTGGGGACCAGGAACTAATCAGACAAGGAACTGATTACAAACATTGGCCGCAAATTGCACAATTTGCCCCCAGCGCTTCAGCGTTCTTCGGAATGTCGCGCATTGGCATGGAAGTCACACCTTCGGGAACGTGGTTGACCTACACAGGTGCCATCAAATTGGATGACAAAGATCCAAATTTCAAAGATCAAGTCATTTTGCTGAATAAGCATATTGACGCATACAAAACATTCCCACCAACAGAGCCTAAAAAGGACAAAAAGAAGAAGGCTGATGAAACTCAAGCCTTACCGCAGAGACAGAAGAAACAGCAAACTGTGACTCTTCTTCCTGCTGCAGATTTGGATGATTTCTCCAAACAATTGCAACAATCCATGAGCAGTGCTGACTCAACTCAGGCCTAAACTCATGCAGACCACACAAGGCAGATGGGCTATATAAACGTTTTCGCTTTTCCGTTTACGATATATAGTCTACTCTTGTGCAGAATGAATTCTCGTAACTACATAGCACAAGTAGATGTAGTTAACTTTAATCTCACATAGCAATCTTTAATCAGTGTGTAACATTAGGGAGGACTTGAAAGAGCCACCACATTTTCACCGAGGCCACGCGGAGTACGATCGAGTGTACAGTGAACAATGCTAGGGAGAGCTGCCTATATGGAAGAGCCCTAATGTGTAAAATTAATTTTAGTAGTGCTATCCCCATGTGATTTTAATAGCTTCTTAGGAGAATGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
서열번호: 23
> MIG457_gp02 표면 당단백질 , UK_MIG457 유래
MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAISGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTYGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIDDTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQGVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSHRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPINFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILARLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTHNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT
서열번호: 24
>MW493681.1 중증 급성 호흡기 증후군 코로나바이러스 2 단리물 SARS-CoV-2/인간/USA/NMDOH-2021013232/2021, 완전 게놈. [중증 급성 호흡기 증후군 코로나바이러스 2 (SARS-CoV-2)], 캘리포니아 B.1.427 계통
AAGGTTTATACCTTCCCAGGTAACAAACCAACCAACTTTCGATCTCTTGTAGATCTGTTCTCTAAACGAACTTTAAAATCTGTGTGGCTGTCACTCGGCTGCATGCTTAGTGCACTCACGCAGTATAATTAATAACTAATTACTGTCGTTGACAGGACACGAGTAACTCGTCTATCTTCTGCAGGCTGCTTACGGTTTCGTCCGTGTTGCAGCCGATCATCAGCACATCTAGGTTTTGTCCGGGTGTGACCGAAAGGTAAGATGGAGAGCCTTGTCCCTGGTTTCAACGAGAAAACACACGTCCAACTCAGTTTGCCTGTTTTACAGGTTCGCGACGTGCTCGTACGTGGCTTTGGAGACTCCGTGGAGGAGGTCTTATCAGAGGCACGTCAACATCTTAAAGATGGCACTTGTGGCTTAGTAGAAGTTGAAAAAGGCGTTTTGCCTCAACTTGAACAGCCCTATGTGTTCATCAAACGTTCGGATGCTCGAACTGCACCTCATGGTCATGTTATGGTTGAGCTGGTAGCAGAACTCGAAGGCATTCAGTACGGTCGTAGTGGTGAGACACTTGGTGTCCTTGTCCCTCATGTGGGCGAAATACCAGTGGCTTACCGCAAGGTTCTTCTTCGTAAGAACGGTAATAAAGGAGCTGGTGGCCATAGTTACGGCGCCGATCTAAAGTCATTTGACTTAGGCGACGAGCTTGGCACTGATCCTTATGAAGATTTTCAAGAAAACTGGAACACTAAACATAGCAGTGGTGTTACCCGTGAACTCATGCGTGAGCTTAACGGAGGGGCATACACTCGCTATGTCGATAACAACTTCTGTGGCCCTGATGGCTACCCTCTTGAGTGCATTAAAGACCTTCTAGCACGTGCTGGTAAAGCTTCATGCACTTTGTCCGAACAACTGGACTTTATTGACACTAAGAGGGGTGTATACTGCTGCCGTGAACATGAGCATGAAATTGCTTGGTACACGGAACGTTCTGAAAAGAGCTATGAATTGCAGACACCTTTTGAAATTAAATTGGCAAAGAAATTT
GACATCTTCAATGGGGAATGTCCAAATTTTGTATTTCCCTTAAATTCCATAATCAAGACTATTCAACCAAGGGTTGAAAAGAAAAAGCTTGATGGCTTTATGGGTAGAATTCGATCTGTCTATCCAGTTGCGTCACCAAATGAATGCAACCAAATGTGCCTTTCAACTCTCATGAAGTGTGATCATTGTGGTGAAACTTCATGGCAGACGGGCGATTTTGTTAAAGCCACTTGCGAATTTTGTGGCACTGAGAATTTGACTAAAGAAGGTGCCACTACTTGTGGTTACTTACCCCAAAATGCTGTTGTTAAAATTTATTGTCCAGCATGTCACAATTCAGAAGTAGGACCTGAGCATAGTCTTGCCGAATACCATAATGAATCTGGCTTGAAAACCATTCTTCGTAAGGGTGGTCGCACTATTGCCTTTGGAGGCTGTGTGTTCTCTTATGTTGGTTGCCATAACAAGTGTGCCTATTGGGTTCCACGTG
CTAGCGCTAACATAGGTTGTAACCATACAGGTGTTGTTGGAGAAGGTTCCGAAGGTCTTAATGACAACCTTCTTGAAATACTCCAAAAAGAGAAAGTCAACATCAATATTGTTGGTGACTTTAAACTTAATGAAGAGATCGCCATTATTTTGGCATCTTTTTCTGCTTCCACAAGTGCTTTTGTGGAAACTGTGAAAGGTTTGGATTATAAAGCATTCAAACAAATTGTTGAATCCTGTGGTAATTTTAAAGTTACAAAAGGAAAAGCTAAAAAAGGTGCCTGGAATATTGGTGAACAGAAATCAATACTGAGTCCTCTTTATGCATTTGCATCAGAGGCTGCTCGTGTTGTACGATCAATTTTCTCCCGCACTCTTGAAACTGCTCAAAATTCTGTGCGTGTTTTACAGAAGGCCGCTATAACAATACTAGATGGAATTTCACAGTATTCACTGAGACTCATTGATGCTATGATGTTCACATCTGATTT
GGCTACTAACAATCTAGTTGTAATGGCCTACATTACAGGTGGTGTTGTTCAGTTGACTTCGCAGTGGCTAACTAACATCTTTGGCACTGTTTATGAAAAACTCAAACCCGTCCTTGATTGGCTTGAAGAGAAGTTTAAGGAAGGTGTAGAGTTTCTTAGAGACGGTTGGGAAATTGTTAAATTTATCTCAACCTGTGCTTGTGAAATTGTCGGTGGACAAATTGTCACCTGTGCAAAGGAAATTAAGGAGAGTGTTCAGACATTCTTTAAGCTTGTAAATAAATTTTTGGCTTTGTGTGCTGACTCTATCATTATTGGTGGAGCTAAACTTAAAGCCTTGAATTTAGGTGAAACATTTGTCACGCACTCAAAGGGATTGTACAGAAAGTGTGTTAAATCCAGAGAAGAAACTGGCCTACTCATGCCTCTAAAAGCCCCAAAAGAAATTATCTTCTTAGAGGGAGAAACACTTCCCACAGAAGTGTTAACAGAGGAAGTTGTCTTGAAAACTGGTGATTTACAACCATTAGAACAACCTACTAGTGAAGCTGTTGAAGCTCCATTGGTTGGTACACCAGTTTGTATTAACGGGCTTATGTTGCTCGAAATCAAAGACACAGAAAAGTACTGTGCCCTTGCACCTAATATGATGGTAACAAACAATACCTTCACACTCAAAGGCGGTGCACCAACAAAGGTTACTTTTGGTGATGACACTGTGATAGAAGTGCAAGGTTACAAGAGTGTGAATATCACTTTTGAACTTGATGAAAGGATTGATAAAGTACTTAATGAGAAGTGCTCTGCCTATACAGTTGAACTCGGTACAGAAGTAAATGAGTTCGCCTGTGTTGTGGCAGATGCTGTCATAAAAACTTTGCAACCAGTATCTGAATTACTTACACCACTGGGCATTGATTTAGATGAGTGGAGTATGGCTACATACTACTTATTTGATGAGTCTGGTGAGTTTAAATTGGCTTCACATATGTATTGTTCTTTTTACCCTCCAGATGAGGATGAAGAAGAAGGTGATTGTGAAGAAGAAGAGTTTGAGCCATCAACTCAATATGAGTATGGTACTGAAGATGATTACCAAGGTAAACCTTTGGAATTTGGTGCCACTTCTGCTGCTCTTCAACCTGAAGAAGAGCAAGAAGAAGATTGGTTAGATGATGATAGTCAACAAACTGTTGGTCAACAAGACGGCAGTGAGGACAATCAGACAACTACTATTCAAACAATTGTTGAGGTTCAACCTCAATTAGAGATGGAACTTACACCAGTTGTTCAGACTATTGAAGTGAATAGTTTTAGTGGTTATTTAAAACTTACTGACAATGTATACATTAAAAATGCAGACATTGTGGAAGAAGCTAAAAAGGTAAAACCAACAGTGGTTGTTAATGCAGCCAATGTTTACCTTAAACATGGAGGAGGTGTTGCAGGAGCCTTAAATAAGGCTACTAACAATGCCATGCAAGTTGAATCTGATGATTACATAGCTACTAATGGACCACTTAAAGTGGGTGGTAGTTGT
GTTTTAAGCGGACACAATCTTGCTAAACACTGTCTTCATGTTGTCGGCCCAAATGTTAACAAAGGTGAAGACATTCAACTTCTTAAGAGTGCTTATGAAAATTTTAATCAGCACGAAGTTCTACTTGCACCATTATTATCAGCTGGTATTTTTGGTGCTGACCCTATACATTCTTTAAGAGTTTGTGTAGATACTGTTCGCACAAATGTCTACTTAGCTGTCTTTGATAAAAATCTCTATGACAAACTTGTTTCAAGCTTTTTGGAAATGAAGAGTGAAAAGCAAGTTGAACAAAAGATCGCTGAGATTCCTAAAGAGGAAGTTAAGCCATTTATAACTGAAAGTAAACCTTCAGTTGAACAGAGAAAACAAGATGATAAGAAAATCAAAGCTTGTGTTGAAGAAGTTACAACAACTCTGGAAGAAACTAAGTTCCTCACAGAAAACTTGTTACTTTATATTGACATTAATGGCAATCTTCATCCAGATT
CTGCCACTCTTGTTAGTGACATTGACATCACTTTCTTAAAGAAAGATGCTCCATATATAGTGGGTGATGTTGTTCAAGAGGGTGTTTTAACTGCTGTGGTTATACCTACTAAAAAGGCTGGTGGCACTACTGAAATGCTAGCGAAAGCTTTGAGAAAAGTGCCAACAGACAATTATATAACCACTTACCCGGGTCAGGGTTTAAATGGTTACACTGTAGAGGAGGCAAAGACAGTGCTTAAAAAGTGTAAAAGTGCCTTTTACATTCTACCATCTATTATCTCTAATGAGAAGCAAGAAATTCTTGGAACTGTTTCTTGGAATTTGCGAGAAATGCTTGCACATGCAGAAGAAACACGCAAATTAATGCCTGTCTGTGTGGAAACTAAAGCCATAGTTTCAACTATACAGCGTAAATATAAGGGTATTAAAATACAAGAGGGTGTGGTTGATTATGGTGCTAGATTTTACTTTTACACCAGTAAAACAACTGTAGCGTCACTTATCAACACACTTAACGATCTAAATGAAACTCTTGTTACAATGCCACTTGGCTATGTAACACATGGCTTAAATTTGGAAGAAGCTGCTCGGTATATGAGATCTCTCAAAGTGCCAGCTACAGTTTCTGTTTCTTCACCTGATGCTGTTACAGCGTATAATGGTTATCTTACTTCTTCTTCTAAAACACCTGAAGAACATTTTATTGAAACCATCTCACTTGCTGGTTCCTATAAAGATTGGTCCTATTCTGGACAATCTACACAACTAGGTATAGAATTTCTTAAGAGAGGTGATAAAAGTGTATATTACACTAGTAATCCTACCACATTCCACCTAGATGGTGAAGTTATCACCTTTGACAATCTTAAGACACTTCTTTCTTTGAGAGAAGTGAGGACTATTAAGGTGTTTACAACAGTAGACAACATTAACCTCCACACGCAAGTTGTGGACATGTCAATGACATATGGACAACAGTTTGGTCCAACTTATTTGGATGGAGCTGATGTTACTAAAATAAAACCTCATAATTCACATGAAGGTAAAACATTTTATGTTTTACCTAATGATGACACTCTACGTGTTGAGGCTTTTGAGTACTACCACACAACTGATCCTAGTTTTCTGGGTAGGTACATGTCAGCATTAAATCACACTAAAAAGTGGAAATACCCACAAGTTAATGGTTTAACTTCTATTAAATGGGCAGATAACAACTGTTATCTTGCCACTGCATTGTTAACACTCCAACAAATAGAGTTGAAGTTTAATCCACCTGCTCTACAAGATGCTTATTACAGAGCAAGGGCTGGTGAAGCTGCTAACTTTTGTGCACTTATCTTAGCCTACTGTAATAAGACAGTAGGTGAGTTAGGTGATGTTAGAGAAACAATGAGTTACTTGTTTCAACATGCCAATTTAGATTCTTGCAAAAGAGTCTTGAACGTGGTGTGTAAAACTTGTGGACAACAGCAGACAACCCTTAAGGGTGTAGAAGCTGTTATGTACATGGGCACACTTTCTTATGAACAATTTAAGAAAGGTGTTCAGATACCTTGTACGTGTGGTAAACAAGCTACAAAATATCTAGTACAACAGGAGTCACCTTTTGTTATGATGTCAGCACCACCTGCTCAGTATGAACTTAAGCATGGTACATTTACTTGTGCTAGTGAGTACACTGGTAATTACCAGTGTGGTCACTATAAACATATAACTTCTAAAGAAACTTTGTATTGCATAGACGGTGCTTTACTTACAAAGTCCTCAGAATACAAAGGTCCTATTACGGATGTTTTCTACAAAGAAAACAGTTACACAACAACCATAAAACCAGTTACTTATAAATTGGATGGTGTTGTTTGTACAGAAATTGACCCTAAGTTGGACAATTATTATAAGAAAGACAATTCTTATTTCACAGAGCAACCAATTGATCTTGTACCAAACCAACCATATCCAAACGCAAGCTTCGATAATTTTAAGTTTGTATGTGATAATATCAAATTTGCTGATGATTTAAACCAGTTAACTGGTTATAAGAAACCTGCTTCAAGAGAGCTTAAAGTTACATTTTTCCCTGACTTAAATGGTGATGTGGTGGCTATTGATTATAAACACTACACACCCTCTTTTAAGAAAGGAGCTAAATTGTTACATAAACCTATTGTTTGGCATGTTAACAATGCAACTAATAAAGCCACGTATAAACCAAATACCTGGTGTATACGTTGTCTTTGGAGCACAAAACCAGTTGAAACATCAAATTCGTTTGATGTACTGAAGTCAGAGGACGCGCAGGGAATGGATAATCTTGTCTGCGAAGATCTAAAACCAGTCTCTGAAGAAGTAGTGGAAAATCCTACCATACAGAAAGACGTTCTTGAGTGTAATGTGAAAACTACCGAAGTTGTAGGAGACATTATACTTAAACCAGCAAATAATAGTTTAAAAATTACAGAAGAGGTTGGCCACACAGATCTAATGGCTGCTTATGTAGACAATTCTAGTCTTACTATTAAGAAACCTAATGAATTATCTAGAGTATTAGGTTTGAAAACCCTTGCTACTCATGGTTTAGCTGCTGTTAATAGTGTCCCTTGGGATACTATAGCTAATTATGCTAAGCCTTTTCTTAACAAAGTTGTTAGTACAACTACTAACATAGTTACACGGTGTTTAAACCGTGTTTGTACTAATTATATGCCTTATTTCTTTACTTTATTGCTACAATTGTGTACTTTTACTAGAAGTACAAATTCTAGAATTAAAGCATCTATGCCGACTACTATAGCAAAGAATACTGTTAAGAGTGTCGGTAAATTTTGTCTAGAGGCTTCATTTAATTATTTGAAGTCACCTAATTTTTCTAAACTGATAAATATTATAATTTGGTTTTTACTATTAAGTGTTTGCCTAGGTTCTTTAATCTACTCAACCGCTGCTTTAGGTGTTTTAATGTCTAATTTAGGCATGCCTTCTTACTGTACTGGTTACAGAGAAGGCTATTTGAACTCTACTAATGTCACTATTGCAACCTACTGTACTGGTTCTATACCTTGTAGTGTTTGTCTTAGTGGTTTAGATTCTTTAGACACCTATCCTTCTTTAGAAACTATACAAATTACCATTTCATCTTTTAAATGGGATTTAACTGCTTTTGGCTTAGTTGCAGAGTGGTTTTTGGCATATATTCTTTTCACTAGGTTTTTCTATGTACTTGGATTGGCTGCAATCATGCAATTGTTTTTCAGCTATTTTGCAGTACATTTTATTAGTAATTCTTGGCTTATGTGGTTAATAATTAATCTTGTACAAATGGCCCCGATTTCAGCTATGGTTAGAATGTACATCTTCT
TTGCATCATTTTATTATGTATGGAAAAGTTATGTGCATGTTGTAGACGGTTGTAATTCATCAACTTGTATGATGTGTTACAAACGTAATAGAGCAACAAGAGTCGAATGTACAACTATTGTTAATGGTGTTAGAAGGTCCTTTTATGTCTATGCTAATGGAGGTAAAGGCTTTTGCAAACTACACAATTGGAATTGTGTTAATTGTGATACATTCTGTGCTGGTAGTACATTTATTAGTGATGAAGTTGCGAGAGACTTGTCACTACAGTTTAAAAGACCAATAAATCCTACTGACCAGTCTTCTTACATCGTTGATAGTGTTACAGTGAAGAATGGTTCCATCCATCTTTACTTTGATAAAGCTGGTCAAAAGACTTATGAAAGACATTCTCTCTCTCATTTTGTTAACTTAGACAACCTGAGAGCTAATAACACTAAAGGTTCATTGCCTATTAATGTTATAGTTTTTGATGGTAAATCAAAATGTGAAGAATCATCTGCAAAATCAGCGTCTGTTTACTACAGTCAGCTTATGTGTCAACCTATACTGTTACTAGATCAGGCATTAGTGTCTGATGTTGGTGATAGTGCGGAAGTTGCAGTTAAAATGTTTGATGCTTACGTTAATACGTTTTCATCAACTTTTAACGTACCAATGGAAAAACTCAAAACACTAGTTGCAACTGCAGAAGCTGAACTTGCAAAGAATGTGTCCTTAGACAATGTCTTATCTACTTTTATTTCAGCAGCTCGGCAAGGGTTTGTTGATTCAGATGTAGAAACTAAAGATGTTGTTGAATGTCTTAAATTGTCACATCAATCTGACATAGAAGTTACTGGCGATAGTTGTAATAACTATATGCTCACCTATAACAAAGTTGAAAACATGACACCCCGTGACCTTGGTGCTTGTATTGACTGTAGTGCGCGTCATATTAATGCGCAGGTAGCAAAAAGTCACAACATTGCTTTGATATGGAACGTTAAAGATTTCATGTCATTGTCTGAACAACTACGAAAACAAATACGTAGTGCTGCTAAAAAGAATAACTTACCTTTTAAGTTGACATGTGCAACTACTAGACAAGTTGTTAATGTTGTAACAACAAAGATAGCACTTAAGGGTGGTAAAATTGTTAATAATTGGTTGAAGCAGTTAATTAAAGTTACACTTGTGTTCCTTTTTGTTGCTGCTATTTTCTATTTAATAACACCTGTTCATGTCATGTCTAAACATACTGACTTTTCAAGTGAAATCATAGGATACAAGGCTATTGATGGTGGTGTCACTCGTGACATAGCATCTACAGATACTTGTTTTGCTAACAAACATGCTGATTTTGACACATGGTTTAGCCAGCGTGGTGGTAGTTATACTAATGACAAAGCTTGCCCATTGATTGCTGCAGTCATAACAAGAGAAGTGGGTTTTGTCGTGCCTGGTTTGCCTGGCACGATATTACGCACAACTAATGGTGACTTTTTGCATTTCTTACCTAGAGTTTTTAGTGCAGTTGGTAACATCTGTTACACACCATCAAAACTTATAGAGTACACTGACTTTGCAACATCAGCTTGTGTTTTGGCTGCTGAATGTACAATTTTTAAAGATGCTTCTGGTAAGCCAGTACCATATTGTTATGATACCAATGTACTAGAAGGTTCTGTTGCTTATGAAAGTTTACGCCCTGACACACGTTATGTGCTCATGGATGGCTCTATTATTCAATTTCCTAACACCTACCTTGAAGGTTCTGTTAGAGTGGTAACAACTTTTGATTCTGAGTACTGTAGGCACGGCACTTGTGAAAGATCAGAAGCTGGTGTTTGTGTATCTACTAGTGGTAGATGGGTACTTAACAATGATTATTACAGATCTTTACCAGGAGTTTTCTGTGGTGTAGATGCTGTAAATTTACTTACTAATATGTTTACACCACTAATTCAACCTATTGGTGCTTTGGACATATCAGCATCTATAGTAGCTGGTGGTATTGTAGCTATCGTAGTAACATGCCTTGCCTACTATTTTATGAGGTTTAGAAGAGCTTTTGGTGAATACAGTCATGTAGTTGCCTTTAATACTTTACTATTCCTTATGTCATTCACTGTACTCTGTTTAACACCAGTTTACTCATTCTTACCTGGTGTTTATTCTGTTATTTACTTGTACTTGACATTTTATCTTACTAATGATGTTTCTTTTTTAGCACATATTCAGTGGATGGTTATGTTCACACCTTTAGTACCTTTCTGGATAACAATTGCTTATATCATTTGTATTTCCACAAAGCATTTCTATTGGTTCTTTACTAATTACCTAAAGAGACGTGTAGTCTTTAATGGTGTTTCCTTTAGTACTTTTGAAGAAGCTGCGCTGTGCACCTTTTTGTTAAATAAAGAAATGTATCTAAAGTTGCGTAGTGATGTGCTATTACCTCTTACGCAATATAATAGATACTTAGCTCTTTATAATAAGTACAAGTATTTTAGTGGAGCAATGGATACAACTAGCTACAGAGAAGCTGCTTGTTGTCATCTCGCAAAGGCTCTCAATGACTTCAGTAACTCAGGTTCTGATGTTCTTTACCAACCACCACAAACCTCTATCACCTCAGCTGTTTTGCAGAGTGGTTTTAGAAAAATGGCATTCCCATCTGGTAAAGTTGAGGGTTGTATGGTACAAGTAACTTGTGGTACAACTACACTTAACGGTCTTTGGCTTGATGACGTAGTTTACTGTCCAAGACATGTGATCTGCACCTCTGAAGACATGCTTAACCCTAATTATGAAGATTTACTCATTCGTAAGTCTAATCATAATTTCTTGGTACAGGCTGGTAATGTTCAACTCAGGGTTATTGGACATTCTATGCAAAATTGTGTACTTAAGCTTAAGGTTGATACAGCCAATCCTAAGACACCTAAGTATAAGTTTGTTCGCATTCAACCAGGACAGACTTTTTCAGTGTTAGCTTGTTACAATGGTTCACCATCTGGTGTTTACCAATGTGCTATGAGGCCCAATTTCACTATTAAGGGTTCATTCCTTAATGGTTCATGTGGTAGTGTTGGTTTTAACATAGATTATGACTGTGTCTCTTTTTGTTACATGCACCATATGGAATTACCAACTGGAGTTCATGCTGGCACAGACTTAGAAGGTAACTTTTATGGACCTTTTGTTGACAGGCAAACAGCACAAGCAGCTGGTACGGACACAACTATTACAGTTAATGTTTTAGCTTGGTTGTACGCTGCTGTTATAAATGGAGACAGGTGGTTTCTCAATCGATTTACCACAACTCTTAATGACTTTAACCTTGTGGCTATGAAGTACAATTATGAACCTCTAACACAAGACCATGTTGACATACTAGGACCTCTTTCTGCTCAAACTGGAATTGCCGTTTTAGATATGTGTGCTTCATTAAAAGAATTACTGCAAAATGGTATGAATGGACGTACCATATTGGGTAGTGCTTTATTAGAAGATGAATTTACACCTTTTGATGTTGTTAGACAATGCTCAGGTGTTACTTTCCAAAGTGCAGTGAAAAGAACAATCAAGGGTACACACCACTGGTTGTTACTCACAATTTTGACTTCACTTTTAGTTTTAGTCCAGAGTACTCAATGGTCTTTGTTCTTTTTTTTGTATGAAAATGCCTTTTTACCTTTTGCTATGGGTATTATTGCTATGTCTGCTTTTGCAATGATGTTTGTCAAACATAAGCATGCATTTCTCTGTTTGTTTTTGTTACCTTCTCTTGCCACTG
TAGCTTATTTTAATATGGTCTATATGCCTGCTAGTTGGGTGATGCGTATTATGACATGGTTGGATATGGTTGATACTAGTTTGTCTGGTTTTAAGCTAAAAGACTGTGTTATGTATGCATCAGCTGTAGTGTTACTAATCCTTATGACAGCAAGAACTGTGTATGATGATGGTGCTAGGAGAGTGTGGACACTTATGAATGTCTTGACACTCGTTTATAAAGTTTATTATGGTAATGCTTTAGATCAAGCCATTTCCATGTGGGCTCTTATAATCTCTGTTACTTCTAACTACTCAGGTGTAGTTACAACTGTCATGTTTTTGGCCAAAGGTATTGTTTTTATGTGTGTTGAGTATTGCCCTATTTTCTTCATAACTGGTAATACACTTCAGTGTATAATGCTAGTTTATTGTTTCTTAGGCTATTTTTGTACTTGTTACTTTGGCCTCTTTTGTTTACTCAACCGCTACTTTAGACTGACTCTTGGTGTTTATGATTACTTAGTTTCTACACAGGAGTTTAGATATATGAATTCACAGGGACTACTCCCACCCAAGAATAGCATAGATGCCTTCAAACTCAACATTAAATTGTTGGGTGTTGGTGGCAAACCTTGTATCAAAGTAGCCACTGTACAGTCTAAAATGTCAGATGTAAAGTGCACATCAGTAGTCTTACTCTCAGTTTTGCAACAACTCAGAGTAGAATCATCATCTAAATTGTGGGCTCAATGTGTCCAGTTACACAATGACATTCTCTTAGCTAAAGATACTACTGAAGCCTTTGAAAAAATGGTTTCACTACTTTCTGTTTTGCTTTCCATGCAGGGTGCTGTAGACATAAACAAGCTTTGTGAAGAAATGCTGGACAACAGGGCAACCTTACAAGCTATAGCCTCAGAGTTTAGTTCCCTTCCATCATATGCAGCTTTTGCTACTGCTCAAGAAGCTTATGAGCAGGCTGTTGCTAATGGTGATTCTGAAGTTGTTCTTAAAAAGTTGAAGAAGTCTTTGAATGTGGCTAAATCTGAATTTGACCGTGATGCAGCCATGCAACGTAAGTTGGAAAAGATGGCTGATCAAGCTATGACCCAAATGTATAAACAGGCTAGATCTGAGGACAAGAGGGCAAAAGTTACTAGTGCTATGCAGACAATGCTTTTCACTATGCTTAGAAAGTTGGATAATGATGCACTCAACAACATTATCAACAATGCAAGAGATGGTTGTGTTCCCTTGAACATAATACCTCTTACAACAGCAGCCAAATTAATGGTTGTCATACCAGACTATAACACATATAAAAATACGTGTGATGGTACAACATTTACTTATGCATCAGCATTGTGGGAAATCCAACAGGTTGTAGATGCAGATAGTAAAATTGTTCAACTTAGTGAAATTAGTATGGACAATTCACCTAATTTAGCATGGCCTCTTATTGTAACAGCTTTAAGGGCCAATTCTGCTGTCAAATTACAGAATAATGAGCTTAGTCCTGTTGCACTACGACAGATGTCTTGTGCTGCCGGTACTACACAAACTGCTTGCACTGATGACAATGCGTTAGCTTACTACAACACAACAAAGGGAGGTAGGTTTGTACTTGCACTGTTATCCGATTTACAGGATTTGAAATGGGCTAGATTCCCTAAGAGTGATGGAACTGGTACTATCTATACAGAACTGGAACCACCTTGTAGGTTTGTTACAGACACACCTAAAGGTCCTAAAGTGAAGTATTTATACTTTATTAAAGGATTAAACAACCTAAATAGAGGTATGGTACTTGGTAGTTTAGCTGCCACAGTACGTCTACAAGCTGGTAATGCAACAGAAGTGCCTGCCAATTCAACTGTATTATCTTTCTGTGCTTTTGCTGTAGATGCTGCTAAAGCTTACAAAGATTATCTAGCTAGTGGGGGACAACCAATCACTAATTGTGTTAAGATGTTGTGTACACACACTGGTACTGGTCAGGCAATAACAGTTACACCGGAAGCCAATATGGATCAAGAATCCTTTGGTGGTGCATCGTGTTGTCTGTACTGCCGTTGCCACATAGATCATCCAAATCCTAAAGGATTTTGTGACTTAAAAGGTAAGTATGTACAAATACCTACAACTTGTGCTAATGACCCTGTGGGTTTTACACTTAAAAACACAGTCTGTACCGTCTGCGGTATGTGGAAAGGTTATGGCTGTAGTTGTGATCAACTCCGCGAACCCATGCTTCAGTCAGCTGATGCACAATCGTTTTTAAACGGGTTTGCGGTGTAAGTGCAGCCCGTCTTACACCGTGCGGCACAGGCACTAGTACTGATGTCGTATACAGGGCTTTTGACATCTACAATGATAAAGTAGCTGGTTTTGCTAAATTCCTAAAAACTAATTGTTGTCGCTTCCAAGAAAAGGACGAAGATGACAATTTAATTGATTCTTACTTTGTAGTTAAGAGACACACTTTCTCTAACTACCAACATGAAGAAACAATTTATAATTTACTTAAAGATTGTCCAGCTGTTGCTAAACATGACTTCTTTAAGTTTAGAATAGACGGTGACATGGTACCACATATATCACGTCAACGTCTTACTAAATACACAATGGCAGACCTCGTCTATGCTTTAAGGCATTTTGATGAAGGTAATTGTGACACATTAAAAGAAATACTTGTCACATACAATTGTTGTGATGATGATTATTTCAATAAAAAGGACTGGTATGATTTTGTAGAAAACCCAGATATATTACGCGTATACGCCAACTTAGGTGAACGTGTACGCCAAGCTTTGTTAAAAACAGTACAATTCTGTGATGCCATGCGAAATGCTGGTATTGTTGGTGTACTGACATTAGATAATCAAGATCTCAATGGTAACTGGTATGATTTCGGTGATTTCATACAAACCACGCCAGGTAGTGGAGTTCCTGTTGTAGATTCTTATTATTCATTGTTAATGCCTATATTAACCTTGACCAGGGCTTTAACTGCAGAGTCACATGTTGACACTGACTTAACAAAGCCTTACATTAAGTGGGATTTGTTAAAATATGACTTCACGGAAGAGAGGTTAAAACTCTTTGACCGTTATTTTAAATATTGGGATCAGACATACCACCCAAATTGTGTTAACTGTTTGGATGACAGATGCATTCTGCATTGTGCAAACTTTAATGTTTTATTCTCTACAGTGTTCCCACTTACAAGTTTTGGACCACTAGTGAGAAAAATATTTGTTGATGGTGTTCCATTTGTAGTTTCAACTGGATACCACTTCAGAGAGCTAGGTGTTGTACATAATCAGGATGTAAACTTACATAGCTCTAGACTTAGTTTTAAGGAATTACTTGTGTATGCTGCTGACCCTGCTATGCACGCTGCTTCTGGTAATCTATTACTAGATAAACGCACTACGTGCTTTTCAGTA
GCTGCACTTACTAACAATGTTGCTTTTCAAACTGTCAAACCCGGTAATTTTAACAAAGACTTCTATGACTTTGCTGTGTCTAAGGGTTTCTTTAAGGAAGGAAGTTCTGTTGAATTAAAACACTTCTTCTTTGCTCAGGATGGTAATGCTGCTATCAGCGATTATGACTACTATCGTTATAATCTACCAACAATGTGTGATATCAGACAACTACTATTTGTAGTTGAAGTTGTTGATAAGTACTTTGATTGTTACGATGGTGGCTGTATTAATGCTAACCAAGTCATCGTCAACAACCTAGACAAATCAGCTGGTTTTCCATTTAATAAATGGGGTAAGGCTAGACTTTATTATGATTCAATGAGTTATGAGGATCAAGATGCACTTTTCGCATATACAAAACGTAATGTCATCCCTACTATAACTCAAATGAATCTTAAGTATGCCATTAGTGCAAAGAATAGAGCTCGCACCGTAGCTGGTGTCTCTATCTGTAGTACTATGACCAATAGACAGTTTCATCAAAAATTATTGAAATCAATAGCCGCCACTAGAGGAGCTACTGTAGTAATTGGAACAAGCAAATTCTATGGTGGTTGGCACAACATGTTAAAAACTGTTTATAGTGATGTAGAAAACCCTCACCTTATGGGTTGGGATTATCCTAAATGTGATAGAGCCATGCCTAACATGCTTAGAATTATGGCCTCACTTGTTCTTGCTCGCAAACATACAACGTGTTGTAGCTTGTCACACCGTTTCTATAGATTAGCTAATGAGTGTGCTCAAGTATTGAGTGAAATGGTCATGTGTGGCGGTTCACTATATGTTAAACCAGGTGGAACCTCATCAGGAGATGCCACAACTGCTTATGCTAATAGTGTTTTTAACATTTGTCAAGCTGTCACGGCCAATGTTAATGCACTTTTATCTACTGATGGTAACAAAATTGCCGATAAGTATGTCCGCAATTTACAACACAGACTTTATGAGTGTCTCTATAGAAATAGAGATGTTGACACAGACTTTGTGAATGAGTTTTACGCATATTTGCGTAAACATTTCTCAATGATGATACTCTCTGACGATGCTGTTGTGTGTTTCAATAGCACTTATGCATCTCAAGGTCTAGTGGCTAGCATAAAGAACTTTAAGTCAGTTCTTTATTATCAAAACAATGTTTTTATGTCTGAAGCAAAATGTTGGACTGAGACTGACCTTACTAAAGGACCTCATGAATTTTGCTCTCAACATACAATGCTAGTTAAACAGGGTGATGATTATGTGTACCTTCCTTACCCAGATCCATCAAGAATCCTAGGGGCCGGCTGTTTTGTAGATGATATCGTAAAAACAGATGGTACACTTATGATTGAACGGTTCGTGTCTTTAGCTATAGATGCTTACCCACTTACTAAACATCCTAATCAGGAGTATGCTGATGTCTTTCATTTGTACTTACAATACATAAGAAAGCTACATGATGAGTTAACAGGACACATGTTAGACATGTATTCTGTTATGCTTACTAATGATAACACTTCAAGGTATTGGGAACCTGAGTTTTATGAGGCTATGTACACACCGCATACAGTCTTACAGGCTGTTGGGGCTTGTGTTCTTTGCAATTCACAGACTTCATTAAGATGTGGTGCTTGCATACGTAGACCATTCTTATGTTGTAAATGCTGTTACGACCATGTCATATCAACATCACATAAATTAGTCTTGTCTGTTAATCCGTATGTTTGCAATGCTCTAGGTTGTGATGTCACAGATGTGACTCAACTTTACTTAGGAGGTATGAGCTATTATTGTAAATCACATAAACCACCCATTAGTTTTCCATTGTGTGCTAATGGACAAGTTTTTGGTTTATATAAAAATACATGTGTTGGTAGCGATAATGTTACTGACTTTAATGCAATTGCAACATGTGACTGGACAAATGCTGGTGATTACATTTTAGCTAACACCTGTACTGAAAGACTCAAGCTTTTTGCAGCAGAAACGCTCAAAGCTACTGAGGAGACATTTAAACTGTCTTATGGTATTGCTACTGTACGTGAAGTGCTGTCTGACAGAGAATTACATCTTTCATGGGAAGTTGGTAAACCTAGACCACCACTTAACCGAAATTATGTCTTTACTGGTTATCGTGTAACTAAAAACAGTAAAGTACAAATAGGAGAGTACACCTTTGAAAAAGGTGACTATGGTGATGCTGTTGTTTACCGAGGTACAACAACTTACAAATTAAATGTTGGTGATTATTTTGTGCTGACATCACATACAGTAATGCCATTAAGTGCACCTACACTAGTGCCACAAGAGCACTATGTTAGAATTACTGGCTTATACCCAACACTCAATATCTCATATGAGTTTTCTAGCAATGTTGCAAATTATCAAAAGGTTGGTATGCAAAAGTATTCTACACTCCAGGGACCACCTGGTACTGGTAAGAGTCATTTTGCTATTGGCCTAGCTCTCTACTACCCTTCTGCTCGCATAGTGTATACAGCTTGCTCTCATGCCGCTGTTGATGCACTATGTGAGAAGGCATTAAAATATTTGCCTATAGATAAATGTAGTAGAATTATACCTGCACGTGCTCGTGTAGAGTGTTTTGATAAATTCAAAGTGAATTCAACATTAGAACAGTATGTCTTTTGTACTGTAAATGCATTGCCTGAGACGACAGCAGATATAGTTGTCTTTGATGAAATTTCAATGGCCACAAATTATGATTTGAGTGTTGTCAATGCCAGATTACGTGCTAAGCACTATGTGTACATTGGCGACCCTGCTCAATTACCTGCACCACGCACATTGCTAACTAAGGGCACACTAGAACCAGAATATTTCAATTCAGTGTGTAGACTTATGAAAACTATAGGTCCAGACATGTTCCTCGGAACTTGTCGGCGTTGTCCTGCTGAAATTGTTGACACTGTGAGTGCTTTGGTTTATGATAATAAGCTTAAAGCACATAAAGACAAATCAGCTCAATGCTTTAAAATGTTTTATAAGGGTGTTATCACGCATGATGTTTCATCTGCAATTAACAGGCCACAAATAGGCGTGGTAAGAGAATTCCTTACACGTAACCCTGCTTGGAGAAAAGCTGTCTTTATTTCACCTTATAATTCACAGAATGCTGTAGCCTCAAAGATTTTGGGACTACCAACTCAAACTGTTGATTCATCACAGGGCTCAGAATATGACTATGTCATATTCACTCAAACCACTGAAACAGCTCACTCTTGTAATGTAAACAGATTTAATGTTGCTATTACCAGAGCAAAAGTAGGCATACTTTGCATAATGTCTGATAGAGACCTTTATGACAAGTTGCAATTTACAAGTCTTGAAATTCCACGTAGGAATGTGGCAACTTTACAAGCTGAAAATGTAACAGGACTCTTTAAAGATTGTAGTAAGGTAATCACTGGGTTACATCCTACACAGGCACCTACACACCTCAGTGTTGACACTAAATTCAAAACTGAAGGTTTATGTGTTGACATACCTGGCATACCTAAGGACATGACCTATAGAAGACTCATCTCTATGATGGGTTTTAAAATGAATTATCAAGTTAATGGTTACCCTAACATGTTTATCACCCGCGAAGAAGCTATAAGACATGTACGTGCATGGATTGGCTTCGATGTCGAGGGGTGTCATGCTACTAGAGAAGCTGTTGGTACCAATTTACCTTTACAGCTAGGTTTTTCTACAGGTGTTAACCTAGTTGCTGTACCTACAGGTTATGTTGATACACCTAATAATACAGATTTTTCCAGAGTTAGTGCTAAACCACCGCCTGGAGATCAATTTAAACACCTCATACCACTTATGTACAAAGGACTTCCTTGGAATGTAGTGCGTATAAAGATTGTACAAATGTTAAGTGACACACTTAAAAATCTCTCTGACAGAGTCGTATTTGTCTTATGGGCACATGGCTTTGAGTTGACATCTATGAAGTATTTTGTGAAAATAGGACCTGAGCGCACCTGTTGTCTATGTGATAGACGTGCCACATGCTTTTCCACTGCTTCAGACACTTATGCCTGTTGGCATCATTCTATTGGATTTGATTACGTCTATAATCCGTTTATGATTGATGTTCAACAATGGGGTTTTACAGGTAACCTACAAAGCAACCATGATCTGTATTGTCAAGTCCATGGTAATGCACATGTAGCTAGTTGTGATGCAATCATGACTAGGTGTCTAGCTGTCCACGAGTGCTTTGTTAAGCGTGTTGACTGGACTATTGAATATCCTATAATTGGTGATGAACTGAAGATTAATGCGGCTTGTAGAAAGGTTCAACACATGGTTGTTAAAGCTGCATTATTAGCAGACAAATTCCCAGTTCTTCACGACATTGGTAACCCTAAAGCTATTAAGTGTGTACCTCAAGCTGATGTAGAATGGAAGTTCTATGATGCACAGCCTTGTAGTGACAAAGCTTATAAAATAGAAGAATTATTCTATTCTTATGCCACACATTCTGACAAATTCACAGATGGTGTATGCCTATTTTGGAATTGCAATGTCGATAGATATCCTGCTAATTCCATTGTTTGTAGATTTGACACTAGAGTGCTATCTAACCTTAACTTGCCTGGTTGTGATGGTGGCAGTTTGTATGTAAATAAACATGCATTCCACACACCAGCTTTTGATAAAAGTGCTTTTGTTAATTTAAAACAATTACCATTTTTCTATTACTCTGACAGTCCATGTGAGTCTCATGGAAAACAAGTAGTGTCAGATATAGATTATGTACCACTAAAGTCTGCTACGTGTATAACACGTTGCAATTTAGGTGGTGCTGTCTGTAGACATCATGCTAATGAGTACAGATTGTATCTCGATGCTTATAACATGATGATCTCAGCTGGCTTTAGCTTGTGGGTTTACAAACAATTTGATACTTATAACCTCTGGAACACTTTTACAAGACTTCAGAGTTTAGAAAATGTGGCTTTTAATGTTGTAAATAAGGGACACTTTGATGGACAACAGGGTGAAGTACCAGTTTCTATCATTAATAACACTGTTTACACAAAAGTTGATGGTGTTGATGTAGAATTGTTTGAAAATAAAACAACATTACCTGTTAATGTAGCATTTGAGCTTTGGGCTAAGCGCAACATTAAACCAGTACCAGAGGTGAAAATACTCAATAATTTGGGTGTGGACATTGCTGCTAATACTGTGATCTGGGACTACAAAAGAGATGCTCCAGCACATATATCTACTATTGGTGTTTGTTCTATGACTGACATAGCCAAGAAACCAACTGAAACGATTTGTGCACCACTCACTGTCTTTTTTGATGGTAGAGTTGATGGTCAAGTAGACTTATTTAGAAATGCCCGTAATGGTGTTCTTATTACAGAAGGTAGTGTTAAAGGTTTACAACCATCTGTAGGTCCCAAACAAGCTAGTCTTAATGGAGTCACATTAATTGGAGAAGCCGTAAAAACACAGTTCAATTATTATAAGAAAGTTGATGGTGTTGTCCAACAATTACCTGAAACTTACTTTACTCAGAGTAGAAATTTACAAGAATTTAAACCCAGGAGTCAAATGGAAATTGATTTCTTAGAATTAGCTATGGATGAATTCATTGAACGGTATAAATTAGAAGGCTATGCCTTCGAACATATCGTTTATGGAGATTTTAGTCATAGTCAGTTAGGTGGTTTACATCTACTGATTGGACTAGCTAAACGTTTTAAGGAATCACCTTTTGAATTAGAAGATTTTATTCCTATGGACAGTACAGTTAAAAACTATTTCATAACAGATGCGCAAACAGGTTCATCTAAGTGTGTGTGTTCTGTTATTGATTTATTACTTGATGATTTTGTTGAAATAATAAAATCCCAAGATTTATCTGTAGTTTCTAAGGTTGTCAAAGTGACTATTGACTATACAGAAATTTCATTTATGCTTTGGTGTAAAGATGGCCATGTAGAAACATTTTACCCAAAATTACAATCTAGTCAAGCGTGGCAACCGGGTGTTGCTATGCCTAATCTTTACAAAATGCAAAGAATGCTATTAGAAAAGTGTGACCTTCAAAATTATGGTGATAGTGCAACATTACCTAAAGGCATAATGATGAATGTCGCAAAATATACTCAACTGTGTCAATATTTAAACACATTAACATTAGCTGTACCCTATAATATGAGAGTTATACATTTTGGTGCTGGTTCTGATAAAGGAGTTGCACCAGGTACAGCTGTTTTAAGACAGTGGTTGCCTACGGGTACGCTGCTTGTCGATTCAGATCTTAATGACTTTGTCTCTGATGCAGATTCAACTTTGATTGGTGATTGTGCAACTGTACATACAGCTAATAAATGGGATCTCATTATTAGTGATATGTACGACCCTAAGACTAAAAATGTTACAAAAGAAAATGACTCTAAAGAGGGTTTTTTCACTTACATTTGTGGGTTTATACAACAAAAGCTAGCTCTTGGAGGTTCCGTGGCTATAAAGATAACAGAACATTCTTGGAATGCTGATCTTTATAAGCTCATGGGACACTTCGCATGGTGGACAGCCTTTGTTACTAATGTGAATGCGTCATCATCTGAAGCATTTTTAATTGGATGTAATTATCTTGGCAAACCACGCGAACAAATAGATGGTTATGTCATGCATGCAAATTACATATTTTGGAGGAATACAAATCCAATTCAGTTGTCTTCCTATTCTTTATTTGACATGAGTAAATTTCCCCTTAAATTAAGGGGTACTGCTGTTATGTCTTTAAAAGAAGGTCAAATCAATGATATGATTTTATCTCTTCTTAGTAAAGGTAGACTTATAATTAGAGAAAACAACAGAGTTGTTATTTCTAGTGATGTTCTTGTTAACAACTAAACGAACAATGTTTGTTTTTCTTGTTTTATTGCCACTAGTCTCTATTCAGTGTGTTAATCTTACAACCAGAACTCAATTACCCCCTGCATACACTAATTCTTTCACACGTGGTGTTTATTACCCTGACAAAGTTTTCAGATCCTCAGTTTTACATTCAACTCAGGACTTGTTCTTACCTTTCTTTTCCAATGTTACTTGGTTCCATGCTATACATGTCTCTGGGACCAATGGTACTAAGAGGTTTGATAACCCTGTCCTACCATTTAATGATGGTGTTTATTTTGCTTCCACTGAGAAGTCTAACATAATAAGAGGCTGGATTTTTGGTACTACTTTAGATTCGAAGACCCAGTCCCTACTTATTGTTAATAACGCTACTAATGTTGTTATTAAAGTCTGTGAATTTCAATTTTGTAATGATCCATTTTTGGGTGTTTATTACCACAAAAACAACAAAAGTTGTATGGAAAGTGAGTTCAGAGTTTATTCTAGTGCGAATAATTGCACTTTTGAATATGTCTCTCAGCCTTTTCTTATGGACCTTGAAGGAAAACAGGGTAATTTCAAAAATCTTAGGGAATTTGTGTTTAAGAATATTGATGGTTATTTTAAAATATATTCTAAGCACACGCCTATTAATTTAGTGCGTGATCTCCCTCAGGGTTTTTCGGCTTTAGAACCATTGGTAGATTTGCCAATAGGTATTAACATCACTAGGTTTCAAACTTTACTTGCTTTACATAGAAGTTATTTGACTCCTGGTGATTCTTCTTCAGGTTGGACAGCTGGTGCTGCAGCTTATTATGTGGGTTATCTTCAACCTAGGACTTTTCTATTAAAATATAATGAAAATGGAACCATTACAGATGCTGTAGACTGTGCACTTGACCCTCTCTCAGAAACAAAGTGTACGTTGAAATCCTTCACTGTAGAAAAAGGAATCTATCAAACTTCTAACTTTAGAGTCCAACCAACAGAATCTATTGTTAGATTTCCTAATATTACAAACTTGTGCCCTTTTGGTGAAGTTTTTAACGCCACCAGATTTGCATCTGTTTATGCTTGGAACAGGAAGAGAATCAGCAACTGTGTTGCTGATTATTCTGTCCTATATAATTCCGCATCATTTTCCACTTTTAAGTGTTATGGAGTGTCTCCTACTAAATTAAATGATCTCTGCTTTACTAATGTCTATGCAGATTCATTTGTAATTAGAGGTGATGAAGTCAGACAAATCGCTCCAGGGCAAACTGGAAAGATTGCTGATTA
TAATTATAAATTACCAGATGATTTTACAGGCTGCGTTATAGCTTGGAATTCTAACAATCTTGATTCTAAGGTTGGTGGTAATTATAATTACCGGTATAGATTGTTTAGGAAGTCTAATCTCAAACCTTTTGAGAGAGATATTTCAACTGAAATCTATCAGGCCGGTAGCACACCTTGTAATGGTGTTGAAGGTTTTAATTGTTACTTTCCTTTACAATCATATGGTTTCCAACCCACTAATGGTGTTGGTTACCAACCATACAGAGTAGTAGTACTTTCTTTTGAACTTCTACATGCACCAGCAACTGTTTGTGGACCTAAAAAGTCTACTAATTTGGTTAAAAACAAATGTGTCAATTTCAACTTCAATGGTTTAACAGGCACAGGTGTTCTTACTGAGTCTAACAAAAAGTTTCTGCCTTTCCAACAATTTGGCAGAGACATTGCTGACACTACTGATGCTGTCCGTGATCCACAGACACTTGAGATTCTTGACATTACACCATGTTCTTTTGGTGGTGTCAGTGTTATAACACCAGGAACAAATACTTCTAACCAGGTTGCTGTTCTTTATCAGGGTGTTAACTGCACAGAAGTCCCTGTTGCTATTCATGCAGATCAACTTACTCCTACTTGGCGTGTTTATTCTACAGGTTCTAATGTTTTTCAAACACGTGCAGGCTGTTTAATAGGGGCTGAACATGTCAACAACTCATATGAGTGTGACATACCCATTGGTGCAGGTATATGCGCTAGTTATCAGACTCAGACTAATTCTCCTCGGCGGGCACGTAGTGTAGCTAGTCAATCCATCATTGCCTACACTATGTCACTTGGTGCAGAAAATTCAGTTGCTTACTCTAATAACTCTATTGCCATACCCACAAATTTTACTATTAGTGTTACCACAGAAATTCTACCAGTGTCTATGACCAAGACATCAGTAGATTGTACAATGTACATTTGTGGTGATTCAACTGAATGCAGCAATCTTTTGTTGCAATATGGCAGTTTTTGTACACAATTAAACCGTGCTTTAACTGGAATAGCTGTTGAACAAGACAAAAACACCCAAGAAGTTTTTGCACAAGTCAAACAAATTTACAAAACACCACCAATTAAAGATTTTGGTGGTTTTAATTTTTCACAAATATTACCAGATCCATCAAAACCAAGCAAGAGGTCATTTATTGAAGATCTACTTTTCAACAAAGTGACACTTGCAGATGCTGGCTTCATCAAACAATATGGTGATTGCCTTGGTGATATTGCTGCTAGAGACCTCATTTGTGCACAAAAGTTTAACGGCCTTACTGTTTTGCCACCTTTGCTCACAGATGAAATGATTGCTCAATACACTTCTGCACTGTTAGCGGGTACAATCACTTCTGGTTGGACCTTTGGTGCAGGTGCTGCATTACAAATACCATTTGCTATGCAAATGGCTTATAGGTTTAATGGTATTGGAGTTACACAGAATGTTCTCTATGAGAACCAAAAATTGATTGCCAACCAATTTAATAGTGCTATTGGCAAAATTCAAGACTCACTTTCTTCCACAGCAAGTGCACTTGGAAAACTTCAAGATGTGGTCAACCAAAATGCACAAGCTTTAAACACGCTTGTTAAACAACTTAGCTCCAATTTTGGTGCAATTTCAAGTGTTTTAAATGATATCCTTTCACGTCTTGACAAAGTTGAGGCTGAAGTGCAAATTGATAGGTTGATCACAGGCAGACTTCAAAGTTTGCAGACATATGTGACTCAACAATTAATTAGAGCTGCAGAAATCAGAGCTTCTGCTAATCTTGCTGCTACTAAAATGTCAGAGTGTGTACTTGGACAATCAAAAAGAGTTGATTTTTGTGGAAAGGGCTATCATCTTATGTCCTTCCCTCAGTCAGCACCTCATGGTGTAGTCTTCTTGCATGTGACTTATGTCCCTGCACAAGAAAAGAACTTCACAACTGCTCCTGCCATTTGTCATGATGGAAAAGCACACTTTCCTCGTGAAGGTGTCTTTGTTTCAAATGGCACACACTGGTTTGTAACACAAAGGAATTTTTATGAACCACAAATCATTACTACAGACAACACATTTGTGTCTGGTAACTGTGATGTTGTAATAGGAATTGTCAACAACACAGTTTATGATCCTTTGCAACCTGAATTAGACTCATTCAAGGAGGAGTTAGATAAATATTTTAAGAATCATACATCACCAGATGTTGATTTAGGTGACATCTCTGGCATTAATGCTTCAGTTGTAAACATTCAAAAAGAAATTGACCGCCTCAATGAGGTTGCCAAGAATTTAAATGAATCTCTCATCGATCTCCAAGAACTTGGAAAGTATGAGCAGTATATAAAATGGCCATGGTACATTTGGCTAGGTTTTATAGCTGGCTTGATTGCCATAGTAATGGTGACAATTATGCTTTGCTGTATGACCAGTTGCTGTAGTTGTCTCAAGGGCTGTTGTTCTTGTGGATCCTGCTGCAAATTTGATGAAGACGACTCTGAGCCAGTGCTCAAAGGAGTCAAATTACATTACACATAAACGAACTTATGGATTTGTTTATGAGAATCTTCACAATTGGAACTGTAACTTTGAAGCAAGGTGAAATCAAGGATGCTACTCCTTCAGATTTTGTTCGCGCTACTGCAACGATACCGATACAAGCCTCACTCCCTTTCGGATGGCTTATTGTTGGCGTTGCACTTCTTGCTGTTTTTCATAGCGCTTCCAAAATCATAACCCTCAAAAAGAGATGGCAACTAGCACTCTCCAAGGGTGTTCACTTTGTTTGCAACTTGCTGTTGTTGTTTGTAACAGTTTACTCACACCTTTTGCTCGTTGCTGCTGGCCTTGAAGCCCCTTTTCTCTATCTTTATGCTTTAGTCTACTTCTTGCAGAGTATAAACTTTGTAAGAATAATAATGAGGCTTTGGCTTTGCTGGAAATGCCGTTCCAAAAACCCATTACTTTATGATGCCAACTATTTTCTTTGCTGGCATACTAATTGTTACGACTATTGTATACCTTACAATAGTGTAACTTCTTCAATTGTCATTACTTCAGGTGATGGCACAACAAGTCCTATTTCTGAACATGACTACCAGATTGGTGGTTATACTGAAAAATGGGAATCTGGAGTAAAAGACTGTGTTGTATTACACAGTTACTTCACTTCAGACTATTACCAGCTGTACTCAACTCAATTGAGTACAGACACTGGTGTTGAACATGTTACCTTCTTCATCTACAATAAAATTGTTGATGAGCCTGAAGAACATGTCCAAATTCACACAATCGACGGTTCATCCGGAGTTGTTAATCCAGTAATGGAACCAATTTAT
GATGAACCGACGACGACTACTAGCGTGCCTTTGTAAGCACAAGCTGATGAGTACGAACTTATGTACTCATTCGTTTCGGAAGAGACAGGTACGTTAATAGTTAATAGCGTACTTCTTTTTCTTGCTTTCGTGGTATTCTTGCTAGTTACACTAGCCATCCTTACTGCGCTTCGATTGTGTGCGTACTGCTGCAATATTGTTAACGTGAGTCTTGTAAAACCTTCTTTTTACGTTTACTCTCGTGTTAAAAATCTGAATTCTTCTAGAGTTCCTGATCTTCTGGTCTAAACGAACTAAATATTATATTAGTTTTTCTGTTTGGAACTTTAATTTTAGCCATGGTAGATTCCAACGGTACTATTACCGTTGAAGAGCTTAAAAAGCTCCTTGAACAATGGAACCTAGTAATAGGTTTCCTATTCCTTACATGGATTTGTCTTCTACAATTTGCCTATGCCAACAGGAATAGGTTTTTGTATATAATTAAGTTAATTTTTCTCTGGCTGTTATGGCCAGTAACTTTAGCTTGTTTTGTGCTTGCTGCTGTTTACAGAATAAATTGGATCACCGGTGGAATTGCTATCGCAATGGCTTGTCTTGTAGGCTTGATGTGGCTCAGCTACTTCATTGCTTCTTTCAGACTGTTTGCGCGTACGCGTTCCATGTGGTCATTCAATCCAGAAACTAACATTCTTCTCAACGTGCCACTCCATGGCACTATTCTGACCAGACCGCTTCTAGAAAGTGAACTCGTAATCGGAGCTGTGATCCTTCGTGGACATCTTCGTATTGCTGGACACCATCTAGGACGCTGTGACATCAAGGACCTGCCTAAAGAAATCACTGTTGCTACATCACGAACGCTTTCTTATTACAAATTGGGAGCTTCGCAGCGTGTAGCAGGTGACTCAGGTTTTGCTGCATACAGTCGCTACAGGATTGGCAACTATAAATTAAACACAGACCATTCCAGTAGCAGTGACAATATTGCTTTGCTTGTACAGTAAGTGACAACAGATGTTTCATCTCGTTGACTTTCAGGTTACTATAGCAGAGATATTACTAATTATTATGAGGACTTTTAAAGTTTCCATTTGGAATCTTGATTACATCATAAACCTCATAATTAAAAATTTATCTAAGTCACTAACTGAGAATAAATATTCTCAATTAGATGAAGAGCAACCAATGGAGATTGATTAAACGAACATGAAAATTATTCTTTTCTTGGCACTGATAACACTCGCTACTTGTGAGCTTTATCACTACCAAGAGTGTGTTAGAGGTACAACAGTACTTTTAAAAGAACCTTGCTCTTCTGGAACATACGAGGGCAATTCACCATTTCATCCTCTAGCTGATAACAAATTTGCACTGACTTGCTTTAGCACTCAATTTGCTTTTGCTTGTCCTGACGGCGTAAAACACGTCTATCAGTTACGTGCCAGATCAGTTTCACCTAAACTGTTCATCAGACAAGAGGAAGTTCAAGAACTTTACTCTCCAATTTTTCTTATTGTTGCGGCAATAGTGTTTATAACACTTTGCTTCACACTCAAAAGAAAGACAGAATGATTGAACTTTCATTAATTGACTTCTATTTGTGCTTTTTAGCCTTTCTGCTATTCCTTGTTTTAATTATGCTTATTATCTTTTGGTTCTCACTTGAACTGCAAGATCATAATGAAACTTGTCACGCCTAAACGAACATGAAATTTCTTGTTTTCTTAGGAATCATCACAACTGTAGCTGCATTTCACCAAGAATGTAGTTTACAGTCATGTACTCAACATCAACCATATGTAGTTGATGACCCGTGTCCTATTCACTTCTATTCTAAATGGTATATTAGAGTAGGAGCTAGAAAATCAGCACCTTTAATTGAATTGTGCGTGGATGAGGCTGGTTCTAAATCACCCATTCAGTACATCGATATCGGTAATTATACAGTTTCCTGTTTACCTTTTACAATTAATTGCCAGGAACCTAAATTGGGTAGTCTTGTAGTGCGTTGTTCGTTCTATGAAGACTTTTTAGAGTATCATGACGTTCGTGTTGTTTTAGATTTCATCTAAACGAACAAACTATAATGTCTGATAATGGACCCCAAAATCAGCGAAATGCACCCCGCATTACGTTTGGTGGACCCTCAGATTCAACTGGCAGTAACCAGAATGGAGAACGCAGTGGGGCGCGATCAAAACAACGTCGGCCCCAAGGTTTACCCAATAATACTGCGTCTTGGTTCACCGCTCTCACTCAACATGGCAAGGAAGACCTTAAATTCCCTCGAGGACAAGGCGTTCCAATTAACACCAATAGCAGTCCAGATGACCAAATTGGCTACTACCGAAGAGCTACCAGACGAATTCGTGGTGGT
GACGGTAAAATGAAAGATCTCAGTCCAAGATGGTATTTCTACTACCTAGGAACTGGGCCAGAAGCTGGACTTCCCTATGGTGCTAACAAAGACGGCATCATATGGGTTGCAACTGAGGGAGCCTTGAATACACCAAAAGATCACATTGGCACCCGCAATCCTGCTAACAATGCTGCAATCGTGCTACAACTTCCTCAAGGAACAACATTGCCAAAAGGCTTCTACGCAGAAGGGAGCAGAGGCGGCAGTCAAGCCTCTTCTCGTTCCTCATCACGTAGTCGCAACAGTTCAAGAAATTCAACTCCAGGCAGCAGTAGGGGAATTTCTCCTGCTAGAATGGCTGGCAATGGCGGTGATGCTGCTCTTGCTTTGCTGCTGCTTGACAGATTGAACCAGCTTGAGAGCAAAATGTCTGGTAAAGGCCAACAACAACAAGGCCAAACTGTCACTAAGAAATCTGCTGCTGAGGCTTCTAAGAAGCCTCGGCAAAAACGTACTGCCACTAAAGCATACAATGTAACACAAGCTTTCGGCAGACGTGGTCCAGAACAAACCCAAGG
AAATTTTGGGGACCAGGAACTAATCAGACAAGGAACTGATTACAAACATTGGCCGCAAATTGCACAATTTGCCCCCAGCGCTTCAGCGTTCTTCGGAATGTCGCGCATTGGCATGGAAGTCACACCTTCGGGAACGTGGTTGACCTACACAGGTGCCATCAAATTGGATGACAAAGATCCAAATTTCAAAGATCAAGTCATTTTGCTGAATAAGCATATTGACGCATACAAAACATTTCCACCAACAGAGCCTAAAAAGGACAAAAAGAAGAAGGCTGATGAAACTCAAGCCTTACCGCAGAGACAGAAGAAACAGCAAACTGTGACTCTTCTTCCTGCTGCAGATTTGGATGATTTCTCCAAACAATTGCAACAATCCATGAGCAGTGCTGACTCAACTCAGGCCTAAACTCATGCAGACCACACAAGGCAGATGGGCTATATAAACGTTTTCGCTTTTCCGTTTACGATATATAGTCTACTCTTGTGCAGAATGAATTCTCGTAACTACATAGCACAAGTAGATGTAGTTAACTTTAATCTCACATAGCAATCTTTAATCAGTGTGTAACATTAGGGAGGACTTGAAAGAGCCACCACATTTTCACCGAGGCCACGCGGAGTACGATCGAGTGTACAGTGAACAATGCTAGGGAGAGCTGCCTATATGGAAGAGCCCTAATGTGTAAAATTAATTTTAGTAGTGCTATCC
서열번호: 25
>QQV21856.1: S 표면 단백질
MFVFLVLLPLVSIQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSCMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYRYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQGVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT
서열번호: 26
>MW306426.1 중증 급성 호흡기 증후군 코로나바이러스 2 단리물 SARS-CoV-2/인간/USA/CA-CZB-12872/2020, 완전 게놈. [중증 급성 호흡기 증후군 코로나바이러스 2 (SARS-CoV-2)]. 캘리포니아 B.1.429 계통
ACTTTCGATCTCTTGTAGATCTGTTCTCTAAACGAACTTTAAAATCTGTGTGGCTGTCACTCGGCTGCATGCTTAGTGCACTCACGCAGTATAATTAATAACTAATTACTGTCGTTGACAGGACACGAGTAACTCGTCTATCTTCTGCAGGCTGCTTACGGTTTCGTCCGTGTTGCAGCCGATCATCAGCACATCTAGGTTTTGTCCGGGTGTGACCGAAAGGTAAGATGGAGAGCCTTGTCCCTGGTTTCAACGAGAAAACACACGTCCAACTCAGTTTGCCTGTTTTACAGGTTCGCGACGTGCTCGTACGTGGCTTTGGAGACTCCGTGGAGGAGGTCTTATCAGAGGCACGTCAACATCTTAAAGATGGCACTTGTGGCTTAGTAGAAGTTGAAAAAGGCGTTTTGCCTCAACTTGAACAGCCCTATGTGTTCATCAAACGTTCGGATGCTCGAACTGCACCTCATGGTCATGTTATGGTTGAGCTGGTAGCAGAACTCGAAGGCATTCAGTACGGTCGTAGTGGTGAGACACTTGGTGTCCTTGTCCCTCATGTGGGCGAAATACCAGTGGCTTACCGCAAGGTTCTTCTTCGTAAGAACGGTAATAAAGGAGCTGGTGGCCATAGTTACGGCGCCGATCTAAAGTCATTTGACTTAGGCGACGAGCTTGGCACTGATCCTTATGAAGATTTTCAAGAAAACTGGAACACTAAACATAGCAGTGGTGTTACCCGTGAACTCATGCGTGAGCTTAACGGAGGGGCATACACTCGCTATGTCGATAACAACTTCTGTGGCCCTGATGGCTACCCTCTTGAGTGCATTAAAGACCTTCTAGCACGTGCTGGTAAAGCTTCATGCACTTTGTCCGAACAACTGGACTTTATTGACACTAAGAGGGGTGTATACTGCTGCCGTGAACATGAGCATGAAATTGCTTGGTACACGGAACGTTCTGAAAAGAGCTATGAATTGCAGACACCTTTTGAAATTAAATTGGCAAAGAAATTTGACATCTTCAATGGGGAATGTCCAAATTTTGTATTTCCCTTAAATTCCATAATCAAGACTATTCAACCAAGGGTTGAAAAGAAAAAGCTTGATGGCTTTATGGGTAGAATTCGATCTGTCTATCCAGTTGCGTCACCAAATGAATGCAACCAAATGTGCCTTTCAACTCTCATGAAGTGTGATCATTGTGGTGAAACTTCATGGCAGACGGGCGATTTTGTTAAAGCCACTTGCGAATTTTGTGGCACTGAGAATTTGACTAAAGAAGGTGCCACTACTTGTGGTTACTTACCCCAAAATGCTGTTGTTAAAATTTATTGTCCAGCATGTCACAATTCAGAAGTAGGACCTGAGCATAGTCTTGCCGAATACCATAATGAATCTGGCTTGAAAACCATTCTTCGTAAGGGTGGTCGCACTATTGCCTTTGGAGGCTGTGTGTTCTCTTATGTTGGTTGCCATAACAAGTGTGCCTATTGGGTTCCACGTGCTAGCGCTAACATAGGTTGTAACCATACAGGTGTTGTTGGAGAAGGTTCCGAAGGTCTTAATGACAACCTTCTTGAAATACTCCAAAAAGAGAAAGTCAACATCAATATTGTTGGTGACTTTAAACTTAATGAAGAGATCGCCATTATTTTGGCATCTTTTTCTGCTTCCACAAGTGCTTTTGTGGAAACTGTGAAAGGTTTGGATTATAAAGCATTCAAACAAATTGTTGAATCCTGTGGTAATTTTAAAGTTACAAAAGGAAAAGCTAAAAAAGGTGCCTGGAATATTGGTGAACAGAAATCAATACTGAGTCCTCTTTATGCATTTGCATCAGAGGCTGCTCGTGTTGTACGATCAATTTTCTCCCGCACTCTTGAAACTGCTCAAAATTCTGTGCGTGTTTTACAGAAGGCCGCTATAACAATACTAGATGGAATTTCACAGTATTCACTGAGACTCATTGATGCTATGATGTTCACATCTGATTTGGCTACTAACAATCTAGTTGTAATGGCCTACATTACAGGTGGTGTTGTTCAGTTGACTTCGCAGTGGCTAACTAACATCTTTGGCACTGTTTATGAAAAACTCAAACCCGTCCTTGATTGGCTTGAAGAGAAGTTTAAGGAAGGTGTAGAGTTTCTTAGAGACGGTTGGGAAATTGTTAAATTTATCTCAACCTGTGCTTGTGAAATTGTCGGTGGACAAATTGTCACCTGTGCAAAGGAAATTAAGGAGAGTGTTCAGACATTCTTTAAGCTTGTAAATAAATTTTTGGCTTTGTGTGCTGACTCTATCATTATTGGTGGAGCTAAACTTAAAGCCTTGAATTTAGGTGAAACATTTGTTACGCACTCAAAGGGATTGTACAGAAAGTGTGTTAAATCCAGAGAAGAAACTGGCCTACTCATGCCTCTAAAAGCCCCAAAAGAAATTATCTTCTTAGAGGGAGAAACACTTCCCACAGAAGTGTTAACAGAGGAAGTTGTCTTGAAAACTGGTGATTTACAACCATTAGAACAACCTACTAGTGAAGCTGTTGAAGCTCCACTGGTTGGTACACCAGTTTGTATTAACGGGCTTATGTTGCTCGAAATCAAAGACACAGAAAAGTACTGTGCCCTTGCACCTAATATGATGGTAACAAACAATACCTTCACACTCAAAGGCGGTGCACCAACAAAGGTTACTTTTGGTGATGACACTGTGATAGAAGTGCAAGGTTACAAGAGTGTGAATATCACTTTTGAACTTGATGAAAGGATTGATAAAGTACTTAATGAGAAGTGCTCTGCCTATACAGTTGAACTCGGTACAGAAGTAAATGAGTTCGCCTGTGTTGTGGCAGATGCTGTCATAAAAACTTTGCAACCAGTATCTGAATTACTTACACCACTGGGCATTGATTTAGATGAGTGGAGTATGGCTACATACTACTTATTTGATGAGTCTGGTGAGTTTAAATTGGCTTCACATATGTATTGTTCTTTTTACCCTCCAGA
TGAGGATGAAGAAGAAGGTGATTGTGAAGAAGAAGAGTTTGAGCCATCAACTCAATATGAGTATGGTACTGAAGATGATTACCAAGGTAAACCTTTGGAATTTGGTGCCACTTCTGCTGCTCTTCAACCTGAAGAAGAGCAAGAAGAAGATTGGTTAGATGATGATAGTCAACAAACTGTTGGTCAACAAGACGGCAGTGAGGACAATCAGACAACTACTATTCAAACAATTGTTGAGGTTCAACCTCAATTAGAGATGGAACTTACACCAGTTGTTCAGACTATTGAAGTGAATAGTTTTAGTGGTTATTTAAAACTTACTGACAATGTATACATTAAAAATGCAGACATTGTGGAAGAAGCTAAAAAGGTAAAACCAACAGTGGTTGTTAATGCAGCCAATGTTTACCTTAAACATGGAGGAGGTGTTGCAGGAGCCTTAAATAAGGCTACTAACAATGCCATGCAAGTTGAATCTGATGATTACATAGCTACTAATGGACCACTTAAAGTGGGTGGTAGTTGTGTTTTAAGCGGACACAATCTTGCTAAACACTGTCTTCATGTTGTCGGCCCAAATGTTAACAAAGGTGAAGACATTCAACTTCTTAAGAGTGCTTATGAAAATTTTAATCAGCACGAAGTTCTACTTGCACCATTATTATCAGCTGGTATTTTTGGTGCTGACCCTATACATTCTTTAAGAGTTTGTGTAGATACTGTTCGCACAAATGTCTACTTAGCTGTCTTTGATAAAAATCTCTATGACAAACTTGTTTCAAGCTTTTTGGAAATGAAGAGTGAAAAGCAAGTTGAACAAAAGATCGCTGAGATTCCTAAAGAGGAAGTTAAGCCATTTATAACTGAAAGTAAACCTTCAGTTGAACAGAGAAAACAAGATGATAAGAAAATCAAAGCTTGTGTTGAAGAAGTTACAACAACTCTGGAAGAAACTAAGTTCCTCACAGAAAACTTGTTACTTTATATTGACATTAATGGCAATCTTCATCCAGATTCTGCCACTCTTGTTAGTGACATTGACATCACTTTCTTAAAGAAAGATGCTCCATATATAGTGGGTGATGTTGTTCAAGAGGGTGTTTTAACTGCTGTGGTTATACCTACTAAAAAGGCTGGTGGCACTACTGAAATGCTAGCGAAAGCTTTGAGAAAAGTGCCAACAGACAATTATATAACCACTTACCCGGGTCAGGGTTTAAATGGTTACACTGTAGAGGAGGCAAAGACAGTGCTTAAAAAGTGTAAAAGTGCCTTTTACATTCTACCATCTATTATCTCTAATGAGAAGCAAGAAATTCTTGGAACTGTTTCTTGGAATTTGCGAGAAATGCTTGCACATGCAGAAGAAACACGCAAATTAATGCCTGTCTGTGTGGAAACTAAAGCCATAGTTTCAACTATACAGCGTAAATATAAGGGTATTAAAATACAAGAGGGTGTGGTTGATTATGGTGCTAGATTTTACTTTTACACCAGTAAAACAACTGTAGCGTCACTTATCAACACACTTAACGATCTAAATGAAACTCTTGTTACAATGCCACTTGGCTATGTAACACATGGCTTAAATTTGGAAGAAGCTGCTCGGTATATGAGATCTCTCAAAGTGCCAGCTACAGTTTCTGTTTCTTCACCTGATGCTGTTACAGCGTATAATGGTTATCTTACTTCTTCTTCTAAAACACCTGAAGAACATTTTATTGAAACCATCTCACTTGCTGGTTCCTATAAAGATTGGTCCTATTCTGGACAATCTACACAACTAGGTATAGAATTTCTTAAGAGAGGTGATAAAAGTGTATATTACACTAGTAATCCTACCACATTCCACCTAGATGGTGAAGTTATCACCTTTGACAATCTTAAGACACTTCTTTCTTTGAGAGAAGTGAGGACTATTAAGGTGTTTACAACAGTAGACAACATTAACCTCCACACGCAAGTTGTGGACATGTCAATGACATATGGACAACAGTTTGGTCCAACTTATTTGGATGGAGCTGATGTTACTAAAATAAAACCTCATAATTCACATGAAGGTAAAACATTTTATGTTTTACCTAATGATGACACTCTACGTGTTGAGGCTTTTGAGTACTACCACACAACTGATCCTAGTTTTCTGGGTAGGTACATGTCAGCATTAAATCACACTAAAAAGTGGAAATACCCACAAGTTAATGGTTTAACTTCTATTAAATGGGCAGATAACAACTGTTATCTTGCCACTGCATTGTTAACACTCCAACAAATAGAGTTGAAGTTTAATCCACCTGCTCTACAAGATGCTTATTACAGAGCAAGGGCTGGTGAAGCTGCTAACTTTTGTGCACTTATCTTAGCCTACTGTAATAAGACAGTAGGTGAGTTAGGTGATGTTAGAGAAACAATGAGTTACTTGTTTCAACATGCCAATTTAGATTCTTGCAAAAGAGTCTTGAACGTGGTGTGTAAAACTTGTGGACAACAGCAGACAACCCTTAAGGGTGTAGAAGCTGTTATGTACATGGGCACACTTTCTTATGAACAATTTAAGAAAGGTGTTCAGATACCTTGTACGTGTGGTAAACAAGCTACAAAATATCTAGTACAACAGGAGTCACCTTTTGTTATGATGTCAGCACCACCTGCTCAGTATGAACTTAAGCATGGTACATTTACTTGTGCTAGTGAGTACACTGGTAATTACCAGTGTGGTCACTATAAACATATAACTTCTAAAGAAACTTTGTATTGCATAGACGGTGCTTTACTTACAAAGTCCTCAGAATACAAAGGTCCTATTACGGATGTTTTCTACAAAGAAAACAGTTACACAACAACCATAAAACCAGTTACTTATAAATTGGATGGTGTTGTTTGTACAGAAATTGACCCTAAGTTGGACAATTATTATAAGAAAGACAATTCTTATTTCACAGAGCAACCAATTGATCTTGTACCAAACCAACCATATCCAAACGCAAGCTTCGATAATTTTAAGTTTGTA
TGTGATAATATCAAATTTGCTGATGATTTAAACCAGTTAACTGGTTATAAGAAACCTGCTTCAAGAGAGCTTAAAGTTACATTTTTCCCTGACTTAAATGGTGATGTGGTGGCTATTGATTATAAACACTACACACCCTCTTTTAAGAAAGGAGCTAAATTGTTACATAAACCTATTGTTTGGCATGTTAACAATGCAACTAATAAAGCCACGTATAAACCAAATACCTGGTGTATACGTTGTCTTTGGAGCACAAAACCAGTTGAAACATCAAATTCGTTTGATGTACTGAAGTCAGAGGACGCGCAGGGAATGGATAATCTTGCCTGCGAAGATCTAAAACCAGTCTCTGAAGAAGTAGTGGAAAATCCTACCATACAGAAAGACGTTCTTGAGTGTAATGTGAAAACTACCGAAGTTGTAGGAGACATTATACTTAAACCAGCAAATAATAGTTTAAAAATTACAGAAGAGGTTGGCCACACAGATCTAATGGCTGCTTATGTAGACAATTCTAGTCTTACTATTAAGAAACCTAATGAATTATCTAGAGTATTAGGTTTGAAAACCCTTGCTACTCATGGTTTAGCTGCTGTTAATAGTGTCCCTTGGGATACTATAGCTAATTATGCTAAGCCTTTTCTTAACAAAGTTGTTAGTACAACTACTAACATAGTTACACGGTGTTTAAACCGTGTTTGTACTAATTATATGCCTTATTTCTTTACTTTATTGCTACAATTGTGTACTTTTACTAGAAGTACAAATTCTAGAATTAAAGCATCTATGCCGACTACTATAGCAAAGAATACTGTTAAGAGTGTCGGTAAATTTTGTCTAGAGGCTTCATTTAATTATTTGAAGTCACCTAATTTTTCTAAACTGATAAATATTATAATTTGGTTTTTACTATTAAGTGTTTGCCTAGGTTCTTTAATCTACTCAACCGCTGCTTTAGGTGTTTTAATGTCTAATTTAGGCATGCCTTCTTACTGTACTGGTTACAGAGAAGGCTATTTGAACTCTACTAATGTCACTATTGCAACCTACTGTACTGGTTCTATACCTTGTAGTGTTTGTCTTAGTGGTTTAGATTCTTTAGACACCTATCCTTCTTTAGAAACTATACAAATTACCATTTCATCTTTTAAATGGGATTTAACTGCTTTTGGCTTAGTTGCAGAGTGGTTTTTGGCATATATTCTTTTCACTAGGTTTTTCTATGTACTTGGATTGGCTGCAATCATGCAATTGTTTTTCAGCTATTTTGCAGTACATTTTATTAGTAATTCTTGGCTTATGTGGTTAATAATTAATCTTGTACAAATGGCCCCGATTTCAGCTATGGTTAGAATGTACATCTTCTTTGCATCATTTTATTATGTATGGAAAAGTTATGTGCATGTTGTAGACGGTTGTAATTCATCAACTTGTATGATGTGTTACAAACGTAATAGAGCAACAAGAGTCGAATGTACAACTATTGTTAATGGTGTTAGAAGGTCCTTTTATGTCTATGCTAATGGAGGTAAAGGCTTTTGCAAACTACACAATTGGAATTGTGTTAATTGTGATACATTCTGTGCTGGTAGTACATTTATTAGTGATGAAGTTGCGAGAGACTTGTCACTACAGTTTAAAAGACCAATAAATCCTACTGACCAGTCTTCTTACATCGTTGATAGTGTTACAGTGAAGAATGGTTCCATCCATCTTTACTTTGATAAAGCTGGTCAAAAGACTTATGAAAGACATTCTCTCTCTCATTTTGTTAACTTAGACAACCTGAGAGCTAATAACACTAAAGGTTCATTGCCTATTAATGTTATAGTTTTTGATGGTAAATCAAAATGTGAAGAATCATCTGCAAAATCAGCGTCTGTTTACTACAGTCAGCTTATGTGTCAACCTATACTGTTACTAGATCAGGCATTAGTGTCTGATGTTGGTGATAGTGCGGAAGTTGCAGTTAAAATGTTTGATGCTTACGTTAATACGTTTTCATCAACTTTTAACGTACCAATGGAAAAACTCAAAACACTAGTTGCAACTGCAGAAGCTGAACTTGCAAAGAATGTGTCCTTAGACAATGTCTTATCTACTTTTATTTCAGCAGCTCGGCAAGGGTTTGTTGATTCAGATGTAGAAACTAAAGATGTTGTTGAATGTCTTAAATTGTCACATCAATCTGACATAGAAGTTACTGGCGATAGTTGTAATAACTATATGCTCACCTATAACAAAGTTGAAAACATGACACCCCGTGACCTTGGTGCTTGTATTGACTGTAGTGCGCGTCATATTAATGCGCAGGTAGCAAAAAGTCACAACATTGCTTTGATATGGAACGTTAAAGATTTCATGTCATTGTCTGAACAACTACGAAAACAAATACGTAGTGCTGCTAAAAAGAATAACTTACCTTTTAAGTTGACATGTGCAACTACTAGACAAGTTGTTAATGTTGTAACAACAAAGATAGCACTTAAGGGTGGTAAAATTGTTAATAATTGGTTGAAGCAGTTAATTAAAGTTACACTTGTGTTCCTTTTTGTTGCTGCTATTTTCTATTTAATAACACCTGTTCATGTCATGTCTAAACATACTGACTTTTCAAGTGAAATCATAGGATACAAGGCTATTGATGGTGGTGTCACTCGTGACATAGCATCTACAGATACTTGTTTTGCTAACAAACATGCTGATTTTGACACATGGTTTAGCCAGCGTGGTGGTAGTTATACTAATGACAAAGCTTGCCCATTGATTGCTGCAGTCATAACAAGAGAAGTGGGTTTTGTCGTGCCTGGTTTGCCTGGCACGATATTACGCACAACTAATGGTGACTTTTTGCATTTCTTACCTAGAGTTTTTAGTGCAGTTGGTAATATCTGTTACACACCATCAAAACTTATAGAGTACACTGACTTTGCAACATCAGCTTGTGTTTTGGCTGCTGAATGTACAATTTTTAAAGATGCTTCTGGTAAGCCAGTACCATATTGTTATGATACCAATGTACTAGAAGGTTCTGTTGCTTATGAAAGTTTACGCCCTGACACACGTTATGTGCTCATGGATGGCTCTATTATTCAATTTCCTAACACCTACCTTGAAGGTTCTGTTAGAGTGGTAACAACTTTTGATTCTGAGTACTGTAGGCACGGCACTTGTGAAAGATCAGAAGCTGGTGTTTGTGTATCTACTAGTGGTAGATGGGTACTTAACAATGATTATTACAGATCTTTACCAGGAGTTTTCTGTGGTGTAGATGCTGTAAATTTACTTACTAATATGTTTACACCACTAATTCAACCTATTGGTGCTTTGGACATATCAGCATCTATAGTAGCTGGTGGTATTGTAGCTATCGTAGTAACATGCCTTGCCTACTATTTTATGAGGTTTAGAAGAGCTTTTGGTGAATACAGTCATGTAGTTGCCTTTAATACTTTACTATTCCTTATGTCATTCACTGTACTCTGTTTAACACCAGTTTACTCATTCTTACCTGGTGTTTATTCTGTTATTTACTTGTACTTGACATTTTATCTTACTAATGATGTTTCTTTTTTAGCACATATTCAGTGGATGGTTATGTTCACACCTTTAGTACCTTTCTGGATAACAATTGCTTATATCATTTGTATTTCCACAAAGCATTTCTATTGGTTCTTTAGTAATTACCTAAAGAGACGTGTAGTCTTTAATGGTGTTTCCTTTAGTACTTTTGAAGAAGCTGCGCTGTGCACCTTTTTGTTAAATAAAGAAATGTATCTAAAGTTGCGTAGTGATGTGCTATTACCTCTTACGCAATATAATAGATACTTAGCTCTTTATAATAAGTACAAGTATTTTAGTGGAGCAATGGATACAACTAGCTACAGAGAAGCTGCTTGTTGTCATCTCGCAAAGGCTCTCAATGACTTCAGTAACTCAGGTTCTGATGTTCTTTACCAACCACCACAAACCTCTATCACCTCAGCTGTTTTGCAGAGTGGTTTTAGAAAAATGGCATTCCCATCTGGTAAAGTTGAGGGTTGTATGGTACAAGTAACTTGTGGTACAACTACACTTAACGGTCTTTGGCTTGATGACGTAGTTTACTGTCCAAGACATGTGATCTGCACCTCTGAAGACATGCTTAACCCTAATTATGAAGATTTACTCATTCGTAAGTCTAATCATAATTTCTTGGTACAGGCTGGTAATGTTCAACTCAGGGTTATTGGACATTCTATGCAAAATTGTGTACTTAAGCTTAAGGTTGATACAGCCAATCCTAAGACACCTAAGTATAAGTTTGTTCGCATTCAACCAGGACAGACTTTTTCAGTGTTAGCTTGTTACAATGGTTCACCATCTGGTGTTTACCAATGTGCTATGAGGCCCAATTTCACTATTAAGGGTTCATTCCTTAATGGTTCATGTGGTAGTGTTGGTTTTAACATAGATTATGACTGTGTCTCTTTTTGTTACATGCACCATATGGAATTACCAACTGGAGTTCATGCTGGCACAGACTTAGAAGGTAACTTTTATGGACCTTTTGTTGACAGGCAAACAGCACAAGCAGCTGGTACGGACACAACTATTACAGTTAATGTTTTAGCTTGGTTGTACGCTGCTGTTATAAATGGAGACAGGTGGTTTCTCAATCGATTTACCACAACTCTTAATGACTTTAACCTTGTGGCTATGAAGTACAATTATGAACCTCTAACACAAGACCATGTTGACATACTAGGACCTCTTTCTGCTCAAACTGGAATTGCCGTTTTAGATATGTGTGCTTCATTAAAAGAATTACTGCAAAATGGTATGAATGGACGTACCATATTGGGTAGTGCTTTATTAGAAGATGAATTTACACCTTTTGATGTTGTTAGACAATGCTCAGGTGTTACTTTCCAAAGTGCAGTGAAAAGAACAATCAAGGGTACACACCACTGGTTGTTACTCACAATTTTGACTTCACTTTTAGTTTTAGTCCAGAGTACTCAATGGTCTTTGTTCTTTTTTTTGTATGAAAATGCCTTTTTACCTTTTGCTATGGGTATTATTGCTATGTCTGCTTTTGCAATGATGTTTGTCAAACATAAGCATGCATTTCTCTGTTTGTTTTTGTTACCTTCTCTTGCCACTGTAGCTTATTTTAATATGGTCTATATGCCTGCTAGTTGGGTGATGCGTATTATGACATGGTTGGATATGGTTGATACTAGTTTGTCTGGTTTTAAGCTAAAAGACTGTGTTATGTATGCATCAGCTGTAGTGTTACTAATCCTTATGACAGCAAGAACTGTGTATGATGATGGTGCTAGGAGAGTGTGGACACTTATGAATGTCTTGACACTCGTTTATAAAGTTTATTATGGTAATGCTTTAGATCAAGCCATTTCCATGTGGGCTCTTATAATCTCTGTTACTTCTAACTACTCAGGTGTAGTTACAACTGTCATGTTTTTGGCCAGAGGTATTGTTTTTATGTGTGTTGAGTATTGCCCTATTTTCTTCATAACTGGTAATACACTTCAGTGTATAATGCTAGTTTATTGTTTCTTAGGCTATTTTTGTACTTGTTACTTTGGCCTCTTTTGTTTACTCAACCGCTACTTTAGACTGACTCTTGGTGTTTATGATTACTTAGTTTCTACACAGGAGTTTAGATATATGAATTCACAGGGACTACTCCCACCCAAGAATAGCATAGATGCCTTCAAACTCAACATTAAATTGTTGGGTGTTGGTGGCAAACCTTGTATCAAAGTAGCCACTGTACAGTCTAAAATGTCAGATGTAAAGTGCACATCAGTAGTCTTACTCTCAGTTTTGCAACAACTCAGAGTAGAATCATCATCTAAATTGTGGGCTCAATGTGTCCAGTTACACAATGACATTCTCTTAGCTAAAGATACTACTGAAGCCTTTGAAAAAATGGTTTCACTACTTTCTGTTTTGCTTTCCATGCAGGGTGCTGTAGACATAAACAAGCTTTGTGAAGAAATGCTGGACAACAGGGCAACCTTACAAGCTATAGCTTCAGAGTTTAGTTCCCTTCCATCATATGCAGCTTTTGCTACTGCTCAAGAAGCTTATGAGCAGGCTGTTGCTAATGGTGATTCTGAAGTTGTTCTTAAAAAGTTGAAGAAGTCTTTGAATGTGGCTAAATCTGAATTTGACCGTGATGCAGCCATGCAACGTAAGTTGGAAAAGATGGCTGATCAAGCTATGACCCAAATGTATAAACAGGCTAGATCTGAGGACAAGAGGGCAAAAGTTACTAGTGCTATGCAGACAATGCTTTTCACTATGCTTAGAAAGTTGGATAATGATGCACTCAACAACATTATCAACAATGCAAGAGATGGTTGTGTTCCCTTGAACATAATACCTCTTACAACAGCAGCCAAACTAATGGTTGTCATACCAGACTATAACACATATAAAAATACGTGTGATGGTACAACATTTACTTATGCATCAGCATTGTGGGAAATCCAACAGGTTGTAGATGCAGATAGTAAAATTGTTCAACTTAGTGAAATTAGTATGGACAATTCACCTAATTTAGCATGGCCTCTTATTGTAACAGCTTTAAGGGCCAATTCTGCTGTCAAATTACAGAATAATGAGCTTAGTCCTGTTGCACTACGACAGATGTCTTGTGCTGCCGGTACTACACAAACTGCTTGCACTGATGACAATGCGTTAGCTTACTACAACACAACAAAGGGAGGTAGGTTTGTACTTGCACTGTTATCCGATTTACAGGATTTGAAATGGGCTAGATTCCCTAAGAGTGATGGAACTGGTACTGTCTATACAGAACTGGAACCACCTTGTAGGTTTGTTACAGACACACCTAAAGGTCCTAAAGTGAAGTATTTATACTTTATTAAAGGATTAAACAACCTAAATAGAGGTATGGTACTTGGTAGTTTAGCTGCCACAGTACGTCTACAAGCTGGTAATGCAACAGAAGTGCCTGCCAATTCAACTGTATTATCTTTCTGTGCTTTTGCTGTAGATGCTGCTAAAGCTTACAAAGATTATCTAGCTAGTGGGGGACAACCAATCACTAATTGTGTTAAGATGTTGTGTACACACACTGGTACTGGTCAGGCAATAACAGTTACA
CCGGAAGCCAATATGGATCAAGAATCCTTTGGTGGTGCATCGTGTTGTCTGTACTGCCGTTGCCACATAGATCATCCAAATCCTAAAGGATTTTGTGACTTAAAAGGTAAGTATGTACAAATACCTACAACTTGTGCTAATGACCCTGTGGGTTTTACACTTAAAAACACAGTCTGTACCGTCTGCGGTATGTGGAAAGGTTATGGCTGTAGTTGTGATCAACTCCGCGAACCCATGCTTCAGTCAGCTGATGCACAATCGTTTTTAAACGGGTTTGCGGTGTAAGTGCAGCCCGTCTTACACCGTGCGGCACAGGCACTAGTACTGATGTCGTATACAGGGCTTTTGACATCTACAATGATAAAGTAGCTGGTTTTGCTAAATTCCTAAAAACTAATTGTTGTCGCTTCCAAGAAAAGGACGAAGATGACAATTTAATTGATTCTTACTTTGTAGTTAAGAGACACACTTTCTCTAACTACCAACATGAAGAAACAATTTATAATTTACTTAAGGATTGTCCAGCTGTTGCTAAACATGACTTCTTTAAGTTTAGAATAGACGGTGACATGGTACCACATATATCACGTCAACGTCTTACTAAATACACAATGGCAGACCTCGTCTATGCTTTAAGGCATTTTGATGAAGGTAATTGTGACACATTAAAAGAAATACTTGTCACATACAATTGTTGTGATGATGATTATTTCAATAAAAAGGACTGGTATGATTTTGTAGAAAACCCAGATATATTACGCGTATACGCCAACTTAGGTGAACGTGTACGCCAAGCTTTGTTAAAAACAGTACAATTCTGTGATGCCATGCGAAATGCTGGTATTGTTGGTGTACTGACATTAGATAATCAAGATCTCAATGGTAACTGGTATGATTTCGGTGATTTCATACAAACCACGCCAGGTAGTGGAGTTCCTGTTGTAGATTCTTATTATTCATTGTTAATGCCTATATTAACC
TTGACCAGGGCTTTAACTGCAGAGTCACATGTTGACACTGACTTAACAAAGCCTTACATTAAGTGGGATTTGTTAAAATATGACTTCACGGAAGAGAGGTTAAAACTCTTTGACCGTTATTTTAAATATTGGGATCAGACATACCACCCAAATTGTGTTAACTGTTTGGATGACAGATGCATTCTGCATTGTGCAAACTTTAATGTTTTATTCTCTACAGTGTTCCCACTTACAAGTTTTGGACCACTAGTGAGAAAAATATTTGTTGATGGTGTTCCATTTGTAGTTTCAACTGGATACCACTTCAGAGAGCTAGGTGTTGTACATAATCAGGATGTAAACTTACATAGCTCTAGACTTAGTTTTAAGGAATTACTTGTGTATGCTGCTGACCCTGCTATGCACGCTGCTTCTGGTAATCTATTACTAGATAAACGCACTACGTGCTTTTCAGTAGCTGCACTTACTAACAATGTTGCTTTTCAAACTGTCAAACCCGGTAATTTTAACAAAGACTTCTATGACTTTGCTGTGTCTAAGGGTTTCTTTAAGGAAGGAAGTTCTGTTGAATTAAAACACTTCTTCTTTGCTCAGGATGGTAATGCTGCTATCAGCGATTATGACTACTATCGTTATAATCTACCAACAATGTGTGATATCAGACAACTACTATTTGTAGTTGAAGTTGTTGATAAGTACTTTGATTGTTACGATGGTGGCTGTATTAATGCTAACCAAGTCATCGTCAACAACCTAGACAAATCAGCTGGTTTTCCATTTAATAAATGGGGTAAGGCTAGACTTTATTATGATTCAATGAGTTATGAGGATCAAGATGCACTTTTCGCATATACAAAACGTAATGTCATCCCTACTATAACTCAAATGAATCTTAAGTATGCCATTAGTGCAAAGAATAGAGCTCGCACCGTAGCTGGTGTCTCTATCTGTAGTACTATGACCAATAGACAGTTTCATCAAAAATTATTGAAATCAATAGCCGCCACTAGAGGAGCTACTGTAGTAATTGGAACAAGCAAATTCTATGGTGGTTGGCACAACATGTTAAAAACTGTTTATAGTGATGTAGAAAACCCTCACCTTATGGGTTGGGATTATCCTAAATGTGATAGAGCCATGCCTAACATGCTTAGAATTATGGCCTCACTTGTTCTTGCTCGCAAACATACAACGTGTTGTAGCTTGTCACACCGTTTCTATAGATTAGCTAATGAGTGTGCTCAAGTATTGAGTGAAATGGTCATGTGTGGCGGTTCACTATATGTTAAACCAGGTGGAACCTCATCAGGAGATGCCACAACTGCTTATGCTAATAGTGTTTTTAACATTTGTCAAGCTGTCACGGCCAATGTTAATGCACTTTTATCTACTGATGGTAACAAAATTGCCGATAAGTATGTCCGCAATTTACAACACAGACTTTATGAGTGTCTCTATAGAAATAGAGATGTTGACACAGACTTTGTGAATGAGTTTTACGCATATTTGCGTAAACATTTCTCAATGATGATACTCTCTGACGATGCTGTTGTGTGTTTCAATAGCACTTATGCATCTCAAGGTCTAGTGGCTAGCATAAAGAACTTTAAGTCAGTTCTTTATTATCAAAACAATGTTTTTATGTCTGAAGCAAAATGTTGGACTGAGACTGACCTTACTAAAGGACCTCATGAATTTTGCTCTCAACATACAATGCTAGTTAAACAGGGTGATGATTATGTGTACCTTCCTTACCCAGATCCATCAAGAATCCTAGGGGCCGGCTGTTTTGTAGATGATATCGTAAAAACAGATGGTACACTTATGATTGAACGGTTCGTGTCTTTAGCTATAGATGCTTACCCACTTACTAAACATCCTAATCAGGAGTATGCTGATGTCTTTCATTTGTACTTACAATACATAAGAAAGCTACATGATGAGTTAACAGGACACATGTTAGACATGTATTCTGTTATGCTTACTAATGATAACACTTCAAGGTATTGGGAACCTGAGTTTTATGAGGCTATGTACACACCGCATACAGTCTTACAGGCTGTTGGGGCTTGTGTTCTTTGCAATTCACAGACTTCATTAAGATGTGGTGCTTGCATACGTAGACCATTCTTATGTTGTAAATGCTGTTACGACCATGTCATATCAACATCACATAAATTAGTCTTGTCTGTTAATCCGTATGTTTGCAATGCTCCAGGTTGTGATGTCACAGATGTGACTCAACTTTACTTAGGAGGTATGAGCTATTATTGTAAATCACATAAACCACCCATTAGTTTTCCATTGTGTGCTAATGGACAAGTTTTTGGTTTATATAAAAATACATGTGTTGGTAGCGATAATGTTACTGACTTTAATGCAATTGCAACATGTGACTGGACAAATGCTGGTGATTACATTTTAGCTAACACCTGTACTGAAAGACTCAAGCTTTTTGCAGCAGAAACGCTCAAAGCTACTGAGGAGACATTTAAACTGTCTTATGGTATTGCTACTGTACGTGAAGTGCTGTCTGACAGAGAATTACATCTTTCATGGGAAGTTGGTAAACCTAGACCACCACTTAACCGAAATTATGTCTTTACTGGTTATCGTGTAACTAAAAACAGTAAAGTACAAATAGGAGAGTACACCTTTGAAAAAGGTGACTATGGTGATGCTGTTGTTTACCGAGGTACAACAACTTACAAATTAAATGTTGGTGATTATTTTGTGCTGACATCACATACAGTAATGCCATTAAGTGCACCTACACTAGTGCCACAAGAGCACTATGTTAGAATTACTGGCTTATACCCAACACTCAATATCTCATATGAGTTTTCTAGCAATGTTGCAAATTATCAAAAGGTTGGTATGCAAAAGTATTCTACACTCCAGGGACCACCTGGTACTGGTAAGAGTCATTTTGCTATTGGCCTAGCTCTCTACTACCCTTCTGCTCGCATAGTGTATACAGCTTGCTCTCATGCCGCTGTTGATGCACTATGTGAGAAGGCATTAAAATATTTGCCTATAGATAAATGTAGTAGAATTATACCTGCACGTGCTCGTGTAGAGTGTTTTGATAAATTCAAAGTGAATTCAACATTAGAACAGTATGTCTTTTGTACTGTAAATGCATTGCCTGAGACGACAGCAGATATAGTTGTCTTTGATGAAATTTCAATGGCCACAAATTATGATTTGAGTGTTGTCAATGCCAGATTACGTGCTAAGCACTATGTGTACATTGGCGACCCTGCTCAATTACCTGCACCACGCACATTGCTAACTAAGGGCACACTAGAACCAGAATATTTCAATTCAGTGTGTAGACTTATGAAAACTATAGGTCCAGACATGTTCCTCGGAACTTGTCGGCGTTGTCCTGCTGAAATTGTTGACACTGTGAGTGCTTTGGTTTATGATAATAAGCTTAAAGCACATAAAGACAAATCAGCTCAATGCTTTAAAATGTTTTATAAGGGTGTTATCACGCATGATGTTTCATCTGCAATTAACAGGCCACAAATAGGCGTGGTAAGAGAATTCCTTACACGTAACCCTGCTTGGAGAAAAGCTGTCTTTATTTCACCTTATAATTCACAGAATGCTGTAGCCTCAAAGATTTTGGGACTACCAACTCAAACTGTTGATTCATCACAGGGCTCAGAATATGACTATGTCATATTCACTCAAACCACTGAAACAGCTCACTCTTGTAATGTAAACAGATTTAATGTTGCTATTACCAGAGCAAAAGTAGGCATACTTTGCATAATGTCTGATAGAGACCTTTATGACAAGTTGCAATTTACAAGTCTTGAAATTCCACGTAGGAATGTGGCAACTTTACAAGCTGAAAATGTAACAGGACTCTTTAAAGATTGTAGTAAGGTAATCACTGGGTTACATCCTACACAGGCACCTACACACCTCAGTGTTGACACTAAATTCAAAACTGAAGGTTTATGTGTTGACATACCTGGCATACCTAAGGACATGACCTATAGAAGACTCATCTCTATGATGGGTTTTAAAATGAATTATCAAGTTAATGGTTACCCTAACATGTTTATCACCCGCGAAGAAGCTATAAGACATGTACGTGCATGGATTGGCTTCGATGTCGAGGGGTGTCATGCTACTAGAGAAGCTGTTGGTACCAATTTACCTTTACAGCTAGGTTTTTCTACAGGTGTTAACCTAGTTGCTGTACCTACAGGTTATGTTGATACACCTAATAATACAGATTTTTCCAGAGTTAGTGCTAAACCACCGCCTGGAGATCAATTTAAACACCTCATACCACTTATGTACAAAGGACTTCCTTGGAATGTAGTGCGTATAAAGATTGTACAAATGTTAAGTGACACACTTAAAAATCTCTCTGACAGAGTCGTATTTGTCTTATGGGCACATGGCTTTGAGTTGACATCTATGAAGTATTTTGTGAAAATAGGACCTGAGCGCACCTGTTGTCTATGTGATAGACGTGCCACATGCTTTTCCACTGCTTCAGACACTTATGCCTGTTGGCATCATTCTATTGGATTTGATTACGTCTATAATCCGTTTATGATTGATGTTCAACAATGGGGTTTTACAGGTAACCTACAAAGCAACCATGATCTGTATTGTCAAGTCCATGGTAATGCACATGTAGCTAGTTGTGATGCAATCATGACTAGGTGTCTAGCTGTCCACGAGTGCTTTGTTAAGCGTGTTGACTGGACTATTGAATATCCTATAATTGGTGATGAACTGAAGATTAATGCGGCTTGTAGAAAGGTTCAACACATGGTTGTTAAAGCTGCATTATTAGCAGACAAATTCCCAGTTCTTCACGACATTGGTAACCCTAAAGCTATTAAGTGTGTACCTCAAGCTGATGTAGAATGGAAGTTCTATGATGCACAGCCTTGTAGTGACAAAGCTTATAAAATAGAAGAATTATTCTATTCTTATGCCACACATTCTGACAAATTCACAGATGGTGTATGCCTATTTTGGAATTGCAATGTCGATAGATATCCTGCTAATTCCATTGTTTGTAGATTTGACACTAGAGTGCTATCTAACCTTAACTTGCCTGGTTGTGATGGTGGCAGTTTGTATGTAAATAAACATGCATTCCACACACCAGCTTTTGATAAAAGTGCTTTTGTTAATTTAAAACAATTACCATTTTTCTATTACTCTGACAGTCCATGTGAGTCTCATGGAAAACAAGTAGTGTCAGATATAGATTATGTACCACTAAAGTCTGCTACGTGTATAACACGTTGCAATTTAGGTGGTGCTGTCTGTAGACATCATGCTAATGAGTACAGATTGTATCTCGATGCTTATAACATGATGATCTCAGCTGGCTTTAGCTTGTGGGTTTACAAACAATTTGATACTTATAACCTCTGGAACACTTTTACAAGACTTCAGAGTTTAGAAAATGTGGCTTTTAATGTTGTAAATAAGGGACACTTTGATGGACAACAGGGTGAAGTACCAGTTTCTATCATTAATAACACTGTTTACACAAAAGTTGATGGTGTTGATGTAGAATTGTTTGAAAATAAAACAACATTACCTGTTAATGTAGCATTTGAGCTTTGGGCTAAGCGCAACATTAAACCAGTACCAGAGGTGAAAATACTCAATAATTTGGGTGTGGACATTGCTGCTAATACTGTGATCTGGGACTACAAAAGAGATGCTCCAGCACATATATCTACTATTGGTGTTTGTTCTATGACTGACATAGCCAAGAAACCAACTGAAACGATTTGTGCACCACTCACTGTCTTTTTTGATGGTAGAGTTGATGGTCAAGTAGACTTATTTAGAAATGCCCGTAATGGTGTTCTTATTACAGAAGGTAGTGTTAAAGGTTTACAACCATCTGTAGGTCCCAAACAAGCTAGTCTTAATGGAGTCACATTAATTGGAGAAGCCGTAAAAACACAGTTCAATTATTATAAGAAAGTTGATGGTGTTGTCCAACAATTACCTGAAACTTACTTTACTCAGAGTAGAAATTTACAAGAATTTAAACCCAGGAGTCAAATGGAAATTGATTTCTTAGAATTAGCTATGGATGAATTCATTGAACGGTATAAATTAGAAGGCTATGCCTTCGAACATATCGTTTATGGAGATTTTAGTCATAGTCAGTTAGGTGGTTTACATCTACTGATTGGACTAGCTAAACGTTTTAAGGAATCACCTTTTGAATTAGAAGATTTTATTCCTATGGACAGTACAGTTAAAAACTATTTCATAACAGATGCGCAAACAGGTTCATCTAAGTGTGTGTGTTCTGTTATTGATTTATTACTTGATGATTTTGTTGAAATAATAAAATCCCAAGATTTATCTGTAGTTTCTAAGGTTGTCAAAGTGACTATTGACTATACAGAAATTTCATTTATGCTTTGGTGTAAAGATGGCCATGTAGAAACATTTTACCCAAAATTACAATCTAGTCAAGCGTGGCAACCGGGTGTTGCTATGCCTAATCTTTACAAAATGCAAAGAATGCTATTAGAAAAGTGTGACCTTCAAAATTATGGTGATAGTGCAACATTACCTAAAGGCATAATGATGAATGTCGCAAAATATACTCAACTGTGTCAATATTTAAACACATTAACATTAGCTGTACCCTATAATATGAGAGTTATACATTTTGGTGCTGGTTCTGATAAAGGAGTTGCACCAGGTACAGCTGTTTTAAGACAGTGGTTGCCTACGGGTACGCTGCTTGTCGATTCAGATCTTAATGACTTTGTCTCTGATGCAGATTCAACTTTGATTGGTGATTGTGCAACTGTACATACAGCTAATAAATGGGATCTCATTATTAGTGATATGTACGACCCTAAGACTAAAAATGTTACAAAAGAAAATGACTCTAAAGAGGGTTTTTTCACTTACATTTGTGGGTTTATACAACAAAAGCTAGCTCTTGGAGGTTCCGTGGCTATAAAGATAACAGAACATTCTTGGAATGCTGATCTTTATAAGCTCATGGGACACTTCGCATGGTGGACAGCCTTTGTTACTAATGTGAATGCGTCATCATCTGAAGCATTTTTAATTGGATGTAATTATCTTGGCAAACCACGCGAACAAATAGATGGTTATGTCATGCATGCAAATTACATATTTTGGAGGAATACAAATCCAATTCAGTTGTCTTCCTATTCTTTATTTGACATGAGTAAATTTCCCCTTAAATTAAGGGGTACTGCTGTTATGTCTTTAAAAGAAGGTCAAATCAATGATATGATTTTATCTCTTCTTAGTAAAGGTAGACTTATAATTAGAGAAAACAACAGAGTTGTTATTTCTAGTGATGTTCTTGTTAACAACTAAACGAACAATGTTTGTTTTTCTTGTTTTATTGCCACTAGTCTCTATTCAGTGTGTTAATCTTACAACCAGAACTCAATTACCCCCTGCATACACTAATTCTTTCACACGTGGTGTTTATTACCCTGACAAAGTTTTCAGATCCTCAGTTTTACATTCAACTCAGGACTTGTTCTTACCTTTCTTTTCCAATGTTACTTGGTTCCATGCTATACATGTCTCTGGGACCAATGGTACTAAGAGGTTTGATAACCCTGTCCTACCATTTAATGATGGTGTTTATTTTGCTTCCACTGAGAAGTCTAACATAATAAGAGGCTGGATTTTTGGTACTACTTTAGATTCGAAGACCCAGTCCCTACTTATTGTTAATAACGCTACTAATGTTGTTATTAAAGTCTGTGAATTTCAATTTTGTAATGATCCATTTTTGGGTGTTTATTACCACAAAAACAACAAAAGTTGTATGGAAAGTGAGTTCAGAGTTTATTCTAGTGCGAATAATTGCACTTTTGAATATGTCTCTCAGCCTTTTCTTATGGACCTTGAAGGAAAACAGGGTAATTTCAAAAATCTTAGGGAATTTGTGTTTAAGAATATTGATGGTTATTTTAAAATATATTCTAAGCACACGCCTATTAATTTAGTGCGTGATCTCCCTCAGGGTTTTTCGGCTTTAGAACCATTGGTAGATTTGCCAATAGGTATTAACATCACTAGGTTTCAAACTTTACTTGCTTTACATAGAAGTTATTTGACTCCTGGTGATTCTTCTTCAGGTTGGACAGCTGGTGCTGCAGCTTATTATGTGGGTTATCTTCAACCTAGGACTTTTCTATTAAAATATAATGAAAATGGAACCATTACAGATGCTGTAGACTGTGCACTTGACCCTCTCTCAGAAACAAAGTGTACGTTGAAATCCTTCACTGTAGAAAAAGGAATCTATCAAACTTCTAACTTTAGAGTCCAACCAACAGAATCTATTGTTAGATTTCCTAATATTACAAACTTGTGCCCTTTTGGTGAAGTTTTTAACGCCACCAGATTTGCATCTGTTTATGCTTGGAACAGGAAGAGAATCAGCAACTGTGTTGCTGATTATTCTGTCCTATATAATTCCGCATCATTTTCCACTTTTAAGTGTTATGGAGTGTCTCCTACTAAATTAAATGATCTCTGCTTTACTAATGTCTATGCAGATTCATTTGTAATTAGAGGTGATGAAGTCAGACAAATCGCTCCAGGGCAAACTGGAAAGATTGCTGATTATAATTATAAATTACCAGATGATTTTACAGGCTGC
GTTATAGCTTGGAATTCTAACAATCTTGATTCTAAGGTTGGTGGTAATTATAATTACCGGTATAGATTGTTTAGGAAGTCTAATCTCAAACCTTTTGAGAGAGATATTTCAACTGAAATCTATCAGGCCGGTAGCACACCTTGTAATGGTGTTGAAGGTTTTAATTGTTACTTTCCTTTACAATCATATGGTTTCCAACCCACTAATGGTGTTGGTTACCAACCATACAGAGTAGTAGTACTTTCTTTTGAACTTCTACATGCACCAGCAACTGTTTGTGGACCTAAAAAGTCTACTAATTTGGTTAAAAACAAATGTGTCAATTTCAACTTCAATGGTTTAACAGGCACAGGTGTTCTTACTGAGTCTAACAAAAAGTTTCTGCCTTTCCAACAATTTGGCAGAGACATTGCTGACACTACTGATGCTGTCCGTGATCCACAGACACTTGAGATTCTTGACATTACACCATGTTCTTTTGGTGGTGTCAGTGTTATAACACCAGGAACAAATACTTCTAACCAGGTTGCTGTTCTTTATCAGGGTGTTAACTGCACAGAAGTCCCTGTTGCTATTCATGCAGATCAACTTACTCCTACTTGGCGTGTTTATTCTACAGGTTCTAATGTTTTTCAAACACGTGCAGGCTGTTTAATAGGGGCTGAACATGTCAACAACTCATATGAGTGTGACATACCCATTGGTGCAGGTATATGCGCTAGTTATCAGACTCAGACTAATTCTCCTCGGCGGGCACGTAGTGTAGCTAGTCAATCCATCATTGCCTACACTATGTCACTTGGTGCAGAAAATTCAGTTGCTTACTCTAATAACTCTATTGCCATACCCACAAATTTTACTATTAGTGTTACCACAGAAATTCTACCAGTGTCTATGACCAAGACATCAGTAGATTGTACAATGTACATTTGTGGTGATTCAACTGAATGCAGCAATCTTTTGTTGCAATATGGCAGTTTTTGTACACAATTAAACCGTGCTTTAACTGGAATAGCTGTTGAACAAGACAAAAACACCCAAGAAGTTTTTGCACAAGTCAAACAAATTTACAAAACACCACCAATTAAAGATTTTGGTGGTTTTAATTTTTCACAAATATTACCAGATCCATCAAAACCAAGCAAGAGGTCATTTATTGAAGATCTACTTTTCAACAAAGTGACACTTGCAGATGCTGGCTTCATCAAACAATATGGTGATTGCCTTGGTGATATTGCTGCTAGAGACCTCATTTGTGCACAAAAGTTTAACGGCCTTACTGTTTTGCCACCTTTGCTCACAGATGAAATGATTGCTCAATACACTTCTGCACTGTTAGCGGGTACAATCACTTCTGGTTGGACCTTTGGTGCAGGTGCTGCATTACAAATACCATTTGCTATGCAAATGGCTTATAGGTTTAATGGTATTGGAGTTACACAGAATGTTCTCTATGAGAACCAAAAATTGATTGCCAACCAATTTAATAGCGCTATTGGCAAAATTCAAGACTCACTTTCTTCCACAGCAAGTGCACTTGGAAAACTTCAAGATGTGGTCAACCAAAATGCACAAGCTTTAAACACGCTTGTTAAACAACTTAGCTCCAATTTTGGTGCAATTTCAAGTGTTTTAAATGATATCCTTTCACGTCTTGACAAAGTTGAGGCTGAAGTGCAAATTGATAGGTTGATCACAGGCAGACTTCAAAGTTTGCAGACATATGTGACTCAACAATTAATTAGAGCTGCAGAAATCAGAGCTTCTGCTAATCTTGCTGCTACTAAAATGTCAGAGTGTGTACTTGGACAATCAAAAAGAGTTGATTTTTGTGGAAAGGGCTATCATCTTATGTCCTTCCCTCAGTCAGCACCTCATGGTGTAGTCTTCTTGCATGTGACTTATGTCCCTGCACAAGAAAAGAACTTCACAACTGCTCCTGCCATTTGTCATGATGGAAAAGCACACTTTCCTCGTGAAGGTGTCTTTGTTTCAAATGGCACACACTGGTTTGTAACACAAAGGAATTTTTATGAACCACAAATCATTACTACAGACAACACATTTGTGTCTGGTAACTGTGATGTTGTAATAGGAATTGTCAACAACACAGTTTATGATCCTTTGCAACCTGAATTAGACTCATTCAAGGAGGAGTTAGATAAATATTTTAAGAATCATACATCACCAGATGTTGATTTAGGTGACATCTCTGGCATTAATGCTTCAGTTGTAAACATTCAAAAAGAAATTGACCGCCTCAATGAGGTTGCCAAGAATTTAAATGAATCTCTCATCGATCTCCAAGAACTTGGAAAGTATGAGCAGTATATAAAATGGCCATGGTACATTTGGCTAGGTTTTATAGCTGGCTTGATTGCCATAGTAATGGTGACAATTATGCTTTGCTGTATGACCAGTTGCTGTAGTTGTCTCAAGGGCTGTTGTTCTTGTGGATCCTGCTGCAAATTTGATGAAGACGACTCTGAGCCAGTGCTCAAAGGAGTCAAATTACATTACACATAAACGAACTTATGGATTTGTTTATGAGAATCTTCACAATTGGAACTGTAACTTTGAAGCAAGGTGAAATCAAGGATGCTACTCCTTCAGATTTTGTTCGCGCTACTGCAACGATACCGATACAAGCCTCACTCCCTTTCGGATGGCTTATTGTTGGCGTTGCACTTCTTGCTGTTTTTCATAGCGCTTCCAAAATCATAACCCTCAAAAAGAGATGGCAACTAGCACTCTCCAAGGGTGTTCACTTTGTTTGCAACTTGCTGTTGTTGTTTGTAACAGTTTACTCACACCTTTTGCTCGTTGCTGTTGGCCTTGAAGCCCCTTTTCTCTATCTTTATGCTTTAGTC
TACTTCTTGCAGAGTATAAACTTTGTAAGAATAATAATGAGGCTTTGGCTTTGCTGGAAATGCCGTTCCAAAAACCCATTACTTTATGATGCCAACTATTTTCTTTGCTGGCATACTAATTGTTACGACTATTGTATACCTTACAATAGTGTAACTTCTTCAATTGTCATTACTTCAGGTGATGGCACAACAAGTCCTATTTCTGAACATGACTACCAGATTGGTGGTTATACTGAAAAATGGGAATCTGGAGTAAAAGACTGTGTTGTATTACACAGTTACTTCACTTCAGACTATTACCAGCTGTACTCAACTCAATTGAGTACAGACACTGGTGTTGAACATGTTACCTTCTTCATCTACAATAAAATTGTTGATGAGCCTGAAGAACATGTCCAAATTCACACAATCGACGGTTCATCCGGAGTTGTTAATCCAGTAATGGAACCAATTTATGATGAACCGACGACGACTACTAGCGTGCCTTTGTAAGCACAAGCTGATGAGTACGAACTTATGTACTCATTCGTTTCGGAAGAGACAGGTACGTTAATAGTTAATAGCGTACTTCTTTTTCTTGCTTTCGTGGTATTCTTGCTAGTTACACTAGCCATCCTTACTGCGCTTCGATTGTGTGCGTACTGCTGCAATATTGTTAACGTGAGTCTTGTAAAACCTTCTTTTTACGTTTACTCTCGTGTTAAAAATCTGAATTCTTCTAGAGTTCCTGATCTTCTGGTCTAAACGAACTAAATATTATATTAGTTTTTCTGTTTGGAACTTTAATTTTAGCCATGGCAGATTCCAACGGTACTATTACCGTTGAAGAGCTTAAAAAGCTCCTTGAACAATGGAACCTAGTAATAGGTTTCCTATTCCTTACATGGATTTGTCTTCTACAATTTGCCTATGCCAACAGGAATAGGTTTTTGTATATAATTAAGTTAATTTTTCTCTGGCTGTTATGGCCAGTAACTTTAGCTTGTTTTGTGCTTGCTGCTGTTTACAGAATAAATTGGATCACCGGTGGAATTGCTATCGCAATGGCTTGTCTTGTAGGCTTGATGTGGCTCAGCTACTTCATTGCTTCTTTCAGACTGTTTGCGCGTACGCGTTCCATGTGGTCATTCAATCCAGAAACTAACATTCTTCTCAACGTGCCACTCCATGGCACTATTCTGACCAGACCGCTTCTAGAAAGTGAACTCGTAATCGGAGCTGTGATCCTTCGTGGACATCTTCGTATTGCTGGACACCATCTAGGACGCTGTGACATCAAGGACCTGCCTAAAGAAATCACTGTTGCTACATCACGAACGCTTTCTTATTACAAATTGGGAGCTTCGCAGCGTGTAGCAGGTGACTCAGGTTTTGCTGCATACAGTCGCTACAGGATTGGCAACTATAAATTAAACACAGACCATTCCAGTAGCAGTGACAATATTGCTTTGCTTGTACAGTAAGTGACAACAGATGTTTCATCTCGTTGACTTTCAGGTTACTATAGCAGAGATATTACTAATTATTATGAGGACTTTTAAAGTTTCCATTTGGAATCTTGATTACATCATAAACCTCATAATTAAAAATTTATCTAAGTCACTAACTGAGAATAAATATTCTCAATTAGATGAAGAGCAACCAATGGAGATTGATTAAACGAACATGAAAATTATTCTTTTCTTGGCACTGATAACACTCGCTACTTGTGAGCTTTATCACTACCAAGAGTGTGTTAGAGGTACAACAGTACTTTTAAAAGAACCTTGCTCTTCTGGAACATACGAGGGCAATTCACCATTTCATCCTCTAGCTGATAACAAATTTGCACTGACTTGCTTTAGCACTCAATTTGCTTTTGCTTGTCCTGACGGCGTAAAACACGTCTATCAGTTACGTGCCAGATCAGTTTCACCTAAACTGTTCATCAGACAAGAGGAAGTTCAAGAACTTTACTCTCCAATTTTTCTTATTGTTGCGGCAATAGTGTTTATAACACTTTGCTTCACACTCAAAAGAAAGACAGAATGATTGAACTTTCATTAATTGACTTCTATTTGTGCTTTTTAGCCTTTCTGCTATTCCTTGTTTTAATTATGCTTATTATCTTTTGGTTCTCACTTGAACTGCAAGATCATAATGAAACTTGTCACGCCTAAACTAACATGAAATTTCTTGTTTTCTTAGGAATCATCACAACTGTAGCTGCATTTCACCAAGAATGTAGTTTACAGTCATGTACTCAACATCAACCATATGTAGTTGATGACCCGTGTCCTATTCACTTCTATTCTAAATGGTATATTAGAGTAGGAGCTAGAAAATCAGCACCTTTAATTGAATTGTGCGTGGATGAGGCTGGTTCTAAATCACCCATTCAGTACATCGATATCGGTAATTATACAGTTTCCTGTTTACCTTTTACAATTAATTGCCAGGAACCTAAATTGGGTAGTCTTGTAGTGCGTTGTTCGTTCTATGAAGACTTTTTAGAGTATCATGACGTTCGTGTTGTTTTAGATTTCATCTAAACGAACAAACTATAATGTCTGATAATGGACCCCAAAATCAGCGAAATGCACCCCGCATTACGTTTGGTGGACCCTCAGATTCAACTGGCAGTAACCAGAATGGAGAACGCAGTGGGGCGCGATCAAAACAACGTCGGCCCCAAGGTTTACCCAATAATACTGCGTCTTGGTTCACCGCTCTCACTCAACATGGCAAGGAAGACCTTAAATTCCCTCGAGGACAAGGCGTTCCAATTAACACCAATAGCAGTCCAGATGACCAAATTGGCTACTACCGAAGAGCTACCAGACGAATTCGTGGTGGTGACGGTAAAATGAAAGATCTCAGTCCAAGATGGT
ATTTCTACTACCTAGGAACTGGGCCAGAAGCTGGACTTCCCTATGGTGCTAACAAAGACGGCATCATATGGGTTGCAACTGAGGGAGCCTTGAATACACCAAAAGATCACATTGGCACCCGCAATCCTGCTAACAATGCTGCAATCGTGCTACAACTTCCTCAAGGAACAACATTGCCAAAAGGCTTCTACGCAGAAGGGAGCAGAGGCGGCAGTCAAGCCTCTTCTCGTTCCTCATCACGTAGTCGCAACAGTTCAAGAAATTCAACTCCAGGCAGCAGTAGGGGAATTTCTCCTGCTAGAATGGCTGGCAATGGCGGTGATGCTGCTCTTGCTTTGCTGCTGCTTGACAGATTGAACCAGCTTGAGAGCAAAATGTCTGGTAAAGGCCAACAACAACAAGGCCAAACTGTCACTAAGAAATCTGCTGCTGAGGCTTCTAAGAAGCCTCGGCAAAAACGTACTGCCACTAAAGCATACAATGTAACACAAGCTTTCGGCAGACGTGGTCCAGAACAAACCCAAGGAAATTTTGGGGACCAGGAACTAATCAGACAAGGAACTGATTACAAACATTGGCCGCAAATTGCACAATTTGCCCCCAGCGCTTCAGCGTTCTTCGGAATGTCGCGCATTGGCATGGAAGTCACACCTTCGGGAACGTGGTTGACCTACACAGGTGCCATCAAATTGGATGACAAAGATCCAAATTTCAAAGATCAAGTCATTTTGCTGAATAAGCATATTGACGCATACAAAACATTTCCACCAACAGAGCCTAAAAAGGACAAAAAGAAGAAGGCTGATGAAACTCAAGCCTTACCGCAGAGACAGAAGAAACAGCAAACTGTGACTCTTCTTCCTGCTGCAGATTTGGATGATTTCTCCAAACAATTGCAACAATCCATGAGCAGTGCTGACTCAACTCAGGCCTAAACTCATGCAGACCACACAAGGCAGATGGGCTATATAAACGTTTTCGCTTTTCCGTTTACGATATATAGTCTACTCTTGTGCAGAATGAATTCTCGTAACTACATAGCACAAGTAGATGTAGTTAACTTTAATCTCACATAGCAATCTTTAATCAGTGTGTAACATTAGGGAGGACTTGAAAGAGCCACCACATTTTCACCGAGGCCACGCGGAGTACGATCGAGTGTACAGTGAACAATGCTAGGGAGAGCTGCCTATATGGAAGAGCCCTAATGTGTAAAATTAATTTTAGTAGTGCTATCCCCATGTGATTTTAATAGC
서열번호: 27
>QPJ72086.1. S-단백질 표면 당단백질
MFVFLVLLPLVSIQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSCMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYRYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQGVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT
서열번호: 28
>뉴클레오캡시드 인단백질 [중증 급성 호흡기 증후군 코로나바이러스 2] (수탁번호: QIA98561)
MSDNGPQNQRNAPRITFGGPSDSTGSNQNGERSGARSKQRRPQGLPNNTASWFTALTQHGKEDLKFPRGQGVPINTNSSPDDQIGYYRRATRRIRGGDGKMKDLSPRWYFYYLGTGPEAGLPYGANKDGIIWVATEGALNTPKDHIGTRNPANNAAIVLQLPQGTTLPKGFYAEGSRGGSQASSRSSSRSRNSSRNSTPGSSRGTSPARMAGNGGDAALALLLLDRLNQLESKMSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKAYNVTQAFGRRGPEQTQGNFGDQELIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTPSGTWLTYTGAIKLDDKDPNFKDQVILLNKHIDAYKTFPPTEPKKDKKKKADETQALPQRQKKQQTVTLLPAADLDDFSKQLQQSMSSADSTQA
서열번호: 29
>막 당단백질 [중증 급성 호흡기 증후군 코로나바이러스 2] (수탁번호: QIA98557)
MADSNGTITVEELKKLLEQWNLVIGFLFLTWICLLQFAYANRNRFLYIIKLIFLWLLWPVTLACFVLAAVYRINWITGGIAIAMACLVGLMWLSYFIASFRLFARTRSMWSFNPETNILLNVPLHGTILTRPLLESELVIGAVILRGHLRIAGHHLGRCDIKDLPKEITVATSRTLSYYKLGASQRVAGD SGFAAYSRYRIGNYKLNTDHSSSSDNIALLVQ
<110> Valneva Austria GmbH
<120> Inactivated SARS-CoV-2 Virus Vaccine
<130> P6132WO00
<150> EP 20168324.0
<151> 2020-04-06
<150> EP 20202118.4
<151> 2020-10-15
<150> EP 20211853.5
<151> 2020-12-04
<150> EP 21154647.8
<151> 2021-02-01
<150> EP 21160913.6
<151> 2021-03-05
<150> US 21/20313
<151> 2021-03-01
<160> 29
<170> PatentIn version 3.5
<210> 1
<211> 29903
<212> DNA
<213> SARS-CoV2
<400> 1
attaaaggtt tataccttcc caggtaacaa accaaccaac tttcgatctc ttgtagatct 60
gttctctaaa cgaactttaa aatctgtgtg gctgtcactc ggctgcatgc ttagtgcact 120
cacgcagtat aattaataac taattactgt cgttgacagg acacgagtaa ctcgtctatc 180
ttctgcaggc tgcttacggt ttcgtccgtg ttgcagccga tcatcagcac atctaggttt 240
cgtccgggtg tgaccgaaag gtaagatgga gagccttgtc cctggtttca acgagaaaac 300
acacgtccaa ctcagtttgc ctgttttaca ggttcgcgac gtgctcgtac gtggctttgg 360
agactccgtg gaggaggtct tatcagaggc acgtcaacat cttaaagatg gcacttgtgg 420
cttagtagaa gttgaaaaag gcgttttgcc tcaacttgaa cagccctatg tgttcatcaa 480
acgttcggat gctcgaactg cacctcatgg tcatgttatg gttgagctgg tagcagaact 540
cgaaggcatt cagtacggtc gtagtggtga gacacttggt gtccttgtcc ctcatgtggg 600
cgaaatacca gtggcttacc gcaaggttct tcttcgtaag aacggtaata aaggagctgg 660
tggccatagt tacggcgccg atctaaagtc atttgactta ggcgacgagc ttggcactga 720
tccttatgaa gattttcaag aaaactggaa cactaaacat agcagtggtg ttacccgtga 780
actcatgcgt gagcttaacg gaggggcata cactcgctat gtcgataaca acttctgtgg 840
ccctgatggc taccctcttg agtgcattaa agaccttcta gcacgtgctg gtaaagcttc 900
atgcactttg tccgaacaac tggactttat tgacactaag aggggtgtat actgctgccg 960
tgaacatgag catgaaattg cttggtacac ggaacgttct gaaaagagct atgaattgca 1020
gacacctttt gaaattaaat tggcaaagaa atttgacacc ttcaatgggg aatgtccaaa 1080
ttttgtattt cccttaaatt ccataatcaa gactattcaa ccaagggttg aaaagaaaaa 1140
gcttgatggc tttatgggta gaattcgatc tgtctatcca gttgcgtcac caaatgaatg 1200
caaccaaatg tgcctttcaa ctctcatgaa gtgtgatcat tgtggtgaaa cttcatggca 1260
gacgggcgat tttgttaaag ccacttgcga attttgtggc actgagaatt tgactaaaga 1320
aggtgccact acttgtggtt acttacccca aaatgctgtt gttaaaattt attgtccagc 1380
atgtcacaat tcagaagtag gacctgagca tagtcttgcc gaataccata atgaatctgg 1440
cttgaaaacc attcttcgta agggtggtcg cactattgcc tttggaggct gtgtgttctc 1500
ttatgttggt tgccataaca agtgtgccta ttgggttcca cgtgctagcg ctaacatagg 1560
ttgtaaccat acaggtgttg ttggagaagg ttccgaaggt cttaatgaca accttcttga 1620
aatactccaa aaagagaaag tcaacatcaa tattgttggt gactttaaac ttaatgaaga 1680
gatcgccatt attttggcat ctttttctgc ttccacaagt gcttttgtgg aaactgtgaa 1740
aggtttggat tataaagcat tcaaacaaat tgttgaatcc tgtggtaatt ttaaagttac 1800
aaaaggaaaa gctaaaaaag gtgcctggaa tattggtgaa cagaaatcaa tactgagtcc 1860
tctttatgca tttgcatcag aggctgctcg tgttgtacga tcaattttct cccgcactct 1920
tgaaactgct caaaattctg tgcgtgtttt acagaaggcc gctataacaa tactagatgg 1980
aatttcacag tattcactga gactcattga tgctatgatg ttcacatctg atttggctac 2040
taacaatcta gttgtaatgg cctacattac aggtggtgtt gttcagttga cttcgcagtg 2100
gctaactaac atctttggca ctgtttatga aaaactcaaa cccgtccttg attggcttga 2160
agagaagttt aaggaaggtg tagagtttct tagagacggt tgggaaattg ttaaatttat 2220
ctcaacctgt gcttgtgaaa ttgtcggtgg acaaattgtc acctgtgcaa aggaaattaa 2280
ggagagtgtt cagacattct ttaagcttgt aaataaattt ttggctttgt gtgctgactc 2340
tatcattatt ggtggagcta aacttaaagc cttgaattta ggtgaaacat ttgtcacgca 2400
ctcaaaggga ttgtacagaa agtgtgttaa atccagagaa gaaactggcc tactcatgcc 2460
tctaaaagcc ccaaaagaaa ttatcttctt agagggagaa acacttccca cagaagtgtt 2520
aacagaggaa gttgtcttga aaactggtga tttacaacca ttagaacaac ctactagtga 2580
agctgttgaa gctccattgg ttggtacacc agtttgtatt aacgggctta tgttgctcga 2640
aatcaaagac acagaaaagt actgtgccct tgcacctaat atgatggtaa caaacaatac 2700
cttcacactc aaaggcggtg caccaacaaa ggttactttt ggtgatgaca ctgtgataga 2760
agtgcaaggt tacaagagtg tgaatatcac ttttgaactt gatgaaagga ttgataaagt 2820
acttaatgag aagtgctctg cctatacagt tgaactcggt acagaagtaa atgagttcgc 2880
ctgtgttgtg gcagatgctg tcataaaaac tttgcaacca gtatctgaat tacttacacc 2940
actgggcatt gatttagatg agtggagtat ggctacatac tacttatttg atgagtctgg 3000
tgagtttaaa ttggcttcac atatgtattg ttctttctac cctccagatg aggatgaaga 3060
agaaggtgat tgtgaagaag aagagtttga gccatcaact caatatgagt atggtactga 3120
agatgattac caaggtaaac ctttggaatt tggtgccact tctgctgctc ttcaacctga 3180
agaagagcaa gaagaagatt ggttagatga tgatagtcaa caaactgttg gtcaacaaga 3240
cggcagtgag gacaatcaga caactactat tcaaacaatt gttgaggttc aacctcaatt 3300
agagatggaa cttacaccag ttgttcagac tattgaagtg aatagtttta gtggttattt 3360
aaaacttact gacaatgtat acattaaaaa tgcagacatt gtggaagaag ctaaaaaggt 3420
aaaaccaaca gtggttgtta atgcagccaa tgtttacctt aaacatggag gaggtgttgc 3480
aggagcctta aataaggcta ctaacaatgc catgcaagtt gaatctgatg attacatagc 3540
tactaatgga ccacttaaag tgggtggtag ttgtgtttta agcggacaca atcttgctaa 3600
acactgtctt catgttgtcg gcccaaatgt taacaaaggt gaagacattc aacttcttaa 3660
gagtgcttat gaaaatttta atcagcacga agttctactt gcaccattat tatcagctgg 3720
tatttttggt gctgacccta tacattcttt aagagtttgt gtagatactg ttcgcacaaa 3780
tgtctactta gctgtctttg ataaaaatct ctatgacaaa cttgtttcaa gctttttgga 3840
aatgaagagt gaaaagcaag ttgaacaaaa gatcgctgag attcctaaag aggaagttaa 3900
gccatttata actgaaagta aaccttcagt tgaacagaga aaacaagatg ataagaaaat 3960
caaagcttgt gttgaagaag ttacaacaac tctggaagaa actaagttcc tcacagaaaa 4020
cttgttactt tatattgaca ttaatggcaa tcttcatcca gattctgcca ctcttgttag 4080
tgacattgac atcactttct taaagaaaga tgctccatat atagtgggtg atgttgttca 4140
agagggtgtt ttaactgctg tggttatacc tactaaaaag gctggtggca ctactgaaat 4200
gctagcgaaa gctttgagaa aagtgccaac agacaattat ataaccactt acccgggtca 4260
gggtttaaat ggttacactg tagaggaggc aaagacagtg cttaaaaagt gtaaaagtgc 4320
cttttacatt ctaccatcta ttatctctaa tgagaagcaa gaaattcttg gaactgtttc 4380
ttggaatttg cgagaaatgc ttgcacatgc agaagaaaca cgcaaattaa tgcctgtctg 4440
tgtggaaact aaagccatag tttcaactat acagcgtaaa tataagggta ttaaaataca 4500
agagggtgtg gttgattatg gtgctagatt ttacttttac accagtaaaa caactgtagc 4560
gtcacttatc aacacactta acgatctaaa tgaaactctt gttacaatgc cacttggcta 4620
tgtaacacat ggcttaaatt tggaagaagc tgctcggtat atgagatctc tcaaagtgcc 4680
agctacagtt tctgtttctt cacctgatgc tgttacagcg tataatggtt atcttacttc 4740
ttcttctaaa acacctgaag aacattttat tgaaaccatc tcacttgctg gttcctataa 4800
agattggtcc tattctggac aatctacaca actaggtata gaatttctta agagaggtga 4860
taaaagtgta tattacacta gtaatcctac cacattccac ctagatggtg aagttatcac 4920
ctttgacaat cttaagacac ttctttcttt gagagaagtg aggactatta aggtgtttac 4980
aacagtagac aacattaacc tccacacgca agttgtggac atgtcaatga catatggaca 5040
acagtttggt ccaacttatt tggatggagc tgatgttact aaaataaaac ctcataattc 5100
acatgaaggt aaaacatttt atgttttacc taatgatgac actctacgtg ttgaggcttt 5160
tgagtactac cacacaactg atcctagttt tctgggtagg tacatgtcag cattaaatca 5220
cactaaaaag tggaaatacc cacaagttaa tggtttaact tctattaaat gggcagataa 5280
caactgttat cttgccactg cattgttaac actccaacaa atagagttga agtttaatcc 5340
acctgctcta caagatgctt attacagagc aagggctggt gaagctgcta acttttgtgc 5400
acttatctta gcctactgta ataagacagt aggtgagtta ggtgatgtta gagaaacaat 5460
gagttacttg tttcaacatg ccaatttaga ttcttgcaaa agagtcttga acgtggtgtg 5520
taaaacttgt ggacaacagc agacaaccct taagggtgta gaagctgtta tgtacatggg 5580
cacactttct tatgaacaat ttaagaaagg tgttcagata ccttgtacgt gtggtaaaca 5640
agctacaaaa tatctagtac aacaggagtc accttttgtt atgatgtcag caccacctgc 5700
tcagtatgaa cttaagcatg gtacatttac ttgtgctagt gagtacactg gtaattacca 5760
gtgtggtcac tataaacata taacttctaa agaaactttg tattgcatag acggtgcttt 5820
acttacaaag tcctcagaat acaaaggtcc tattacggat gttttctaca aagaaaacag 5880
ttacacaaca accataaaac cagttactta taaattggat ggtgttgttt gtacagaaat 5940
tgaccctaag ttggacaatt attataagaa agacaattct tatttcacag agcaaccaat 6000
tgatcttgta ccaaaccaac catatccaaa cgcaagcttc gataatttta agtttgtatg 6060
tgataatatc aaatttgctg atgatttaaa ccagttaact ggttataaga aacctgcttc 6120
aagagagctt aaagttacat ttttccctga cttaaatggt gatgtggtgg ctattgatta 6180
taaacactac acaccctctt ttaagaaagg agctaaattg ttacataaac ctattgtttg 6240
gcatgttaac aatgcaacta ataaagccac gtataaacca aatacctggt gtatacgttg 6300
tctttggagc acaaaaccag ttgaaacatc aaattcgttt gatgtactga agtcagagga 6360
cgcgcaggga atggataatc ttgcctgcga agatctaaaa ccagtctctg aagaagtagt 6420
ggaaaatcct accatacaga aagacgttct tgagtgtaat gtgaaaacta ccgaagttgt 6480
aggagacatt atacttaaac cagcaaataa tagtttaaaa attacagaag aggttggcca 6540
cacagatcta atggctgctt atgtagacaa ttctagtctt actattaaga aacctaatga 6600
attatctaga gtattaggtt tgaaaaccct tgctactcat ggtttagctg ctgttaatag 6660
tgtcccttgg gatactatag ctaattatgc taagcctttt cttaacaaag ttgttagtac 6720
aactactaac atagttacac ggtgtttaaa ccgtgtttgt actaattata tgccttattt 6780
ctttacttta ttgctacaat tgtgtacttt tactagaagt acaaattcta gaattaaagc 6840
atctatgccg actactatag caaagaatac tgttaagagt gtcggtaaat tttgtctaga 6900
ggcttcattt aattatttga agtcacctaa tttttctaaa ctgataaata ttataatttg 6960
gtttttacta ttaagtgttt gcctaggttc tttaatctac tcaaccgctg ctttaggtgt 7020
tttaatgtct aatttaggca tgccttctta ctgtactggt tacagagaag gctatttgaa 7080
ctctactaat gtcactattg caacctactg tactggttct ataccttgta gtgtttgtct 7140
tagtggttta gattctttag acacctatcc ttctttagaa actatacaaa ttaccatttc 7200
atcttttaaa tgggatttaa ctgcttttgg cttagttgca gagtggtttt tggcatatat 7260
tcttttcact aggtttttct atgtacttgg attggctgca atcatgcaat tgtttttcag 7320
ctattttgca gtacatttta ttagtaattc ttggcttatg tggttaataa ttaatcttgt 7380
acaaatggcc ccgatttcag ctatggttag aatgtacatc ttctttgcat cattttatta 7440
tgtatggaaa agttatgtgc atgttgtaga cggttgtaat tcatcaactt gtatgatgtg 7500
ttacaaacgt aatagagcaa caagagtcga atgtacaact attgttaatg gtgttagaag 7560
gtccttttat gtctatgcta atggaggtaa aggcttttgc aaactacaca attggaattg 7620
tgttaattgt gatacattct gtgctggtag tacatttatt agtgatgaag ttgcgagaga 7680
cttgtcacta cagtttaaaa gaccaataaa tcctactgac cagtcttctt acatcgttga 7740
tagtgttaca gtgaagaatg gttccatcca tctttacttt gataaagctg gtcaaaagac 7800
ttatgaaaga cattctctct ctcattttgt taacttagac aacctgagag ctaataacac 7860
taaaggttca ttgcctatta atgttatagt ttttgatggt aaatcaaaat gtgaagaatc 7920
atctgcaaaa tcagcgtctg tttactacag tcagcttatg tgtcaaccta tactgttact 7980
agatcaggca ttagtgtctg atgttggtga tagtgcggaa gttgcagtta aaatgtttga 8040
tgcttacgtt aatacgtttt catcaacttt taacgtacca atggaaaaac tcaaaacact 8100
agttgcaact gcagaagctg aacttgcaaa gaatgtgtcc ttagacaatg tcttatctac 8160
ttttatttca gcagctcggc aagggtttgt tgattcagat gtagaaacta aagatgttgt 8220
tgaatgtctt aaattgtcac atcaatctga catagaagtt actggcgata gttgtaataa 8280
ctatatgctc acctataaca aagttgaaaa catgacaccc cgtgaccttg gtgcttgtat 8340
tgactgtagt gcgcgtcata ttaatgcgca ggtagcaaaa agtcacaaca ttgctttgat 8400
atggaacgtt aaagatttca tgtcattgtc tgaacaacta cgaaaacaaa tacgtagtgc 8460
tgctaaaaag aataacttac cttttaagtt gacatgtgca actactagac aagttgttaa 8520
tgttgtaaca acaaagatag cacttaaggg tggtaaaatt gttaataatt ggttgaagca 8580
gttaattaaa gttacacttg tgttcctttt tgttgctgct attttctatt taataacacc 8640
tgttcatgtc atgtctaaac atactgactt ttcaagtgaa atcataggat acaaggctat 8700
tgatggtggt gtcactcgtg acatagcatc tacagatact tgttttgcta acaaacatgc 8760
tgattttgac acatggttta gccagcgtgg tggtagttat actaatgaca aagcttgccc 8820
attgattgct gcagtcataa caagagaagt gggttttgtc gtgcctggtt tgcctggcac 8880
gatattacgc acaactaatg gtgacttttt gcatttctta cctagagttt ttagtgcagt 8940
tggtaacatc tgttacacac catcaaaact tatagagtac actgactttg caacatcagc 9000
ttgtgttttg gctgctgaat gtacaatttt taaagatgct tctggtaagc cagtaccata 9060
ttgttatgat accaatgtac tagaaggttc tgttgcttat gaaagtttac gccctgacac 9120
acgttatgtg ctcatggatg gctctattat tcaatttcct aacacctacc ttgaaggttc 9180
tgttagagtg gtaacaactt ttgattctga gtactgtagg cacggcactt gtgaaagatc 9240
agaagctggt gtttgtgtat ctactagtgg tagatgggta cttaacaatg attattacag 9300
atctttacca ggagttttct gtggtgtaga tgctgtaaat ttacttacta atatgtttac 9360
accactaatt caacctattg gtgctttgga catatcagca tctatagtag ctggtggtat 9420
tgtagctatc gtagtaacat gccttgccta ctattttatg aggtttagaa gagcttttgg 9480
tgaatacagt catgtagttg cctttaatac tttactattc cttatgtcat tcactgtact 9540
ctgtttaaca ccagtttact cattcttacc tggtgtttat tctgttattt acttgtactt 9600
gacattttat cttactaatg atgtttcttt tttagcacat attcagtgga tggttatgtt 9660
cacaccttta gtacctttct ggataacaat tgcttatatc atttgtattt ccacaaagca 9720
tttctattgg ttctttagta attacctaaa gagacgtgta gtctttaatg gtgtttcctt 9780
tagtactttt gaagaagctg cgctgtgcac ctttttgtta aataaagaaa tgtatctaaa 9840
gttgcgtagt gatgtgctat tacctcttac gcaatataat agatacttag ctctttataa 9900
taagtacaag tattttagtg gagcaatgga tacaactagc tacagagaag ctgcttgttg 9960
tcatctcgca aaggctctca atgacttcag taactcaggt tctgatgttc tttaccaacc 10020
accacaaacc tctatcacct cagctgtttt gcagagtggt tttagaaaaa tggcattccc 10080
atctggtaaa gttgagggtt gtatggtaca agtaacttgt ggtacaacta cacttaacgg 10140
tctttggctt gatgacgtag tttactgtcc aagacatgtg atctgcacct ctgaagacat 10200
gcttaaccct aattatgaag atttactcat tcgtaagtct aatcataatt tcttggtaca 10260
ggctggtaat gttcaactca gggttattgg acattctatg caaaattgtg tacttaagct 10320
taaggttgat acagccaatc ctaagacacc taagtataag tttgttcgca ttcaaccagg 10380
acagactttt tcagtgttag cttgttacaa tggttcacca tctggtgttt accaatgtgc 10440
tatgaggccc aatttcacta ttaagggttc attccttaat ggttcatgtg gtagtgttgg 10500
ttttaacata gattatgact gtgtctcttt ttgttacatg caccatatgg aattaccaac 10560
tggagttcat gctggcacag acttagaagg taacttttat ggaccttttg ttgacaggca 10620
aacagcacaa gcagctggta cggacacaac tattacagtt aatgttttag cttggttgta 10680
cgctgctgtt ataaatggag acaggtggtt tctcaatcga tttaccacaa ctcttaatga 10740
ctttaacctt gtggctatga agtacaatta tgaacctcta acacaagacc atgttgacat 10800
actaggacct ctttctgctc aaactggaat tgccgtttta gatatgtgtg cttcattaaa 10860
agaattactg caaaatggta tgaatggacg taccatattg ggtagtgctt tattagaaga 10920
tgaatttaca ccttttgatg ttgttagaca atgctcaggt gttactttcc aaagtgcagt 10980
gaaaagaaca atcaagggta cacaccactg gttgttactc acaattttga cttcactttt 11040
agttttagtc cagagtactc aatggtcttt gttctttttt ttgtatgaaa atgccttttt 11100
accttttgct atgggtatta ttgctatgtc tgcttttgca atgatgtttg tcaaacataa 11160
gcatgcattt ctctgtttgt ttttgttacc ttctcttgcc actgtagctt attttaatat 11220
ggtctatatg cctgctagtt gggtgatgcg tattatgaca tggttggata tggttgatac 11280
tagtttgtct ggttttaagc taaaagactg tgttatgtat gcatcagctg tagtgttact 11340
aatccttatg acagcaagaa ctgtgtatga tgatggtgct aggagagtgt ggacacttat 11400
gaatgtcttg acactcgttt ataaagttta ttatggtaat gctttagatc aagccatttc 11460
catgtgggct cttataatct ctgttacttc taactactca ggtgtagtta caactgtcat 11520
gtttttggcc agaggtattg tttttatgtg tgttgagtat tgccctattt tcttcataac 11580
tggtaataca cttcagtgta taatgctagt ttattgtttc ttaggctatt tttgtacttg 11640
ttactttggc ctcttttgtt tactcaaccg ctactttaga ctgactcttg gtgtttatga 11700
ttacttagtt tctacacagg agtttagata tatgaattca cagggactac tcccacccaa 11760
gaatagcata gatgccttca aactcaacat taaattgttg ggtgttggtg gcaaaccttg 11820
tatcaaagta gccactgtac agtctaaaat gtcagatgta aagtgcacat cagtagtctt 11880
actctcagtt ttgcaacaac tcagagtaga atcatcatct aaattgtggg ctcaatgtgt 11940
ccagttacac aatgacattc tcttagctaa agatactact gaagcctttg aaaaaatggt 12000
ttcactactt tctgttttgc tttccatgca gggtgctgta gacataaaca agctttgtga 12060
agaaatgctg gacaacaggg caaccttaca agctatagcc tcagagttta gttcccttcc 12120
atcatatgca gcttttgcta ctgctcaaga agcttatgag caggctgttg ctaatggtga 12180
ttctgaagtt gttcttaaaa agttgaagaa gtctttgaat gtggctaaat ctgaatttga 12240
ccgtgatgca gccatgcaac gtaagttgga aaagatggct gatcaagcta tgacccaaat 12300
gtataaacag gctagatctg aggacaagag ggcaaaagtt actagtgcta tgcagacaat 12360
gcttttcact atgcttagaa agttggataa tgatgcactc aacaacatta tcaacaatgc 12420
aagagatggt tgtgttccct tgaacataat acctcttaca acagcagcca aactaatggt 12480
tgtcatacca gactataaca catataaaaa tacgtgtgat ggtacaacat ttacttatgc 12540
atcagcattg tgggaaatcc aacaggttgt agatgcagat agtaaaattg ttcaacttag 12600
tgaaattagt atggacaatt cacctaattt agcatggcct cttattgtaa cagctttaag 12660
ggccaattct gctgtcaaat tacagaataa tgagcttagt cctgttgcac tacgacagat 12720
gtcttgtgct gccggtacta cacaaactgc ttgcactgat gacaatgcgt tagcttacta 12780
caacacaaca aagggaggta ggtttgtact tgcactgtta tccgatttac aggatttgaa 12840
atgggctaga ttccctaaga gtgatggaac tggtactatc tatacagaac tggaaccacc 12900
ttgtaggttt gttacagaca cacctaaagg tcctaaagtg aagtatttat actttattaa 12960
aggattaaac aacctaaata gaggtatggt acttggtagt ttagctgcca cagtacgtct 13020
acaagctggt aatgcaacag aagtgcctgc caattcaact gtattatctt tctgtgcttt 13080
tgctgtagat gctgctaaag cttacaaaga ttatctagct agtgggggac aaccaatcac 13140
taattgtgtt aagatgttgt gtacacacac tggtactggt caggcaataa cagttacacc 13200
ggaagccaat atggatcaag aatcctttgg tggtgcatcg tgttgtctgt actgccgttg 13260
ccacatagat catccaaatc ctaaaggatt ttgtgactta aaaggtaagt atgtacaaat 13320
acctacaact tgtgctaatg accctgtggg ttttacactt aaaaacacag tctgtaccgt 13380
ctgcggtatg tggaaaggtt atggctgtag ttgtgatcaa ctccgcgaac ccatgcttca 13440
gtcagctgat gcacaatcgt ttttaaacgg gtttgcggtg taagtgcagc ccgtcttaca 13500
ccgtgcggca caggcactag tactgatgtc gtatacaggg cttttgacat ctacaatgat 13560
aaagtagctg gttttgctaa attcctaaaa actaattgtt gtcgcttcca agaaaaggac 13620
gaagatgaca atttaattga ttcttacttt gtagttaaga gacacacttt ctctaactac 13680
caacatgaag aaacaattta taatttactt aaggattgtc cagctgttgc taaacatgac 13740
ttctttaagt ttagaataga cggtgacatg gtaccacata tatcacgtca acgtcttact 13800
aaatacacaa tggcagacct cgtctatgct ttaaggcatt ttgatgaagg taattgtgac 13860
acattaaaag aaatacttgt cacatacaat tgttgtgatg atgattattt caataaaaag 13920
gactggtatg attttgtaga aaacccagat atattacgcg tatacgccaa cttaggtgaa 13980
cgtgtacgcc aagctttgtt aaaaacagta caattctgtg atgccatgcg aaatgctggt 14040
attgttggtg tactgacatt agataatcaa gatctcaatg gtaactggta tgatttcggt 14100
gatttcatac aaaccacgcc aggtagtgga gttcctgttg tagattctta ttattcattg 14160
ttaatgccta tattaacctt gaccagggct ttaactgcag agtcacatgt tgacactgac 14220
ttaacaaagc cttacattaa gtgggatttg ttaaaatatg acttcacgga agagaggtta 14280
aaactctttg accgttattt taaatattgg gatcagacat accacccaaa ttgtgttaac 14340
tgtttggatg acagatgcat tctgcattgt gcaaacttta atgttttatt ctctacagtg 14400
ttcccaccta caagttttgg accactagtg agaaaaatat ttgttgatgg tgttccattt 14460
gtagtttcaa ctggatacca cttcagagag ctaggtgttg tacataatca ggatgtaaac 14520
ttacatagct ctagacttag ttttaaggaa ttacttgtgt atgctgctga ccctgctatg 14580
cacgctgctt ctggtaatct attactagat aaacgcacta cgtgcttttc agtagctgca 14640
cttactaaca atgttgcttt tcaaactgtc aaacccggta attttaacaa agacttctat 14700
gactttgctg tgtctaaggg tttctttaag gaaggaagtt ctgttgaatt aaaacacttc 14760
ttctttgctc aggatggtaa tgctgctatc agcgattatg actactatcg ttataatcta 14820
ccaacaatgt gtgatatcag acaactacta tttgtagttg aagttgttga taagtacttt 14880
gattgttacg atggtggctg tattaatgct aaccaagtca tcgtcaacaa cctagacaaa 14940
tcagctggtt ttccatttaa taaatggggt aaggctagac tttattatga ttcaatgagt 15000
tatgaggatc aagatgcact tttcgcatat acaaaacgta atgtcatccc tactataact 15060
caaatgaatc ttaagtatgc cattagtgca aagaatagag ctcgcaccgt agctggtgtc 15120
tctatctgta gtactatgac caatagacag tttcatcaaa aattattgaa atcaatagcc 15180
gccactagag gagctactgt agtaattgga acaagcaaat tctatggtgg ttggcacaac 15240
atgttaaaaa ctgtttatag tgatgtagaa aaccctcacc ttatgggttg ggattatcct 15300
aaatgtgata gagccatgcc taacatgctt agaattatgg cctcacttgt tcttgctcgc 15360
aaacatacaa cgtgttgtag cttgtcacac cgtttctata gattagctaa tgagtgtgct 15420
caagtattga gtgaaatggt catgtgtggc ggttcactat atgttaaacc aggtggaacc 15480
tcatcaggag atgccacaac tgcttatgct aatagtgttt ttaacatttg tcaagctgtc 15540
acggccaatg ttaatgcact tttatctact gatggtaaca aaattgccga taagtatgtc 15600
cgcaatttac aacacagact ttatgagtgt ctctatagaa atagagatgt tgacacagac 15660
tttgtgaatg agttttacgc atatttgcgt aaacatttct caatgatgat actctctgac 15720
gatgctgttg tgtgtttcaa tagcacttat gcatctcaag gtctagtggc tagcataaag 15780
aactttaagt cagttcttta ttatcaaaac aatgttttta tgtctgaagc aaaatgttgg 15840
actgagactg accttactaa aggacctcat gaattttgct ctcaacatac aatgctagtt 15900
aaacagggtg atgattatgt gtaccttcct tacccagatc catcaagaat cctaggggcc 15960
ggctgttttg tagatgatat cgtaaaaaca gatggtacac ttatgattga acggttcgtg 16020
tctttagcta tagatgctta cccacttact aaacatccta atcaggagta tgctgatgtc 16080
tttcatttgt acttacaata cataagaaag ctacatgatg agttaacagg acacatgtta 16140
gacatgtatt ctgttatgct tactaatgat aacacttcaa ggtattggga acctgagttt 16200
tatgaggcta tgtacacacc gcatacagtc ttacaggctg ttggggcttg tgttctttgc 16260
aattcacaga cttcattaag atgtggtgct tgcatacgta gaccattctt atgttgtaaa 16320
tgctgttacg accatgtcat atcaacatca cataaattag tcttgtctgt taatccgtat 16380
gtttgcaatg ctccaggttg tgatgtcaca gatgtgactc aactttactt aggaggtatg 16440
agctattatt gtaaatcaca taaaccaccc attagttttc cattgtgtgc taatggacaa 16500
gtttttggtt tatataaaaa tacatgtgtt ggtagcgata atgttactga ctttaatgca 16560
attgcaacat gtgactggac aaatgctggt gattacattt tagctaacac ctgtactgaa 16620
agactcaagc tttttgcagc agaaacgctc aaagctactg aggagacatt taaactgtct 16680
tatggtattg ctactgtacg tgaagtgctg tctgacagag aattacatct ttcatgggaa 16740
gttggtaaac ctagaccacc acttaaccga aattatgtct ttactggtta tcgtgtaact 16800
aaaaacagta aagtacaaat aggagagtac acctttgaaa aaggtgacta tggtgatgct 16860
gttgtttacc gaggtacaac aacttacaaa ttaaatgttg gtgattattt tgtgctgaca 16920
tcacatacag taatgccatt aagtgcacct acactagtgc cacaagagca ctatgttaga 16980
attactggct tatacccaac actcaatatc tcagatgagt tttctagcaa tgttgcaaat 17040
tatcaaaagg ttggtatgca aaagtattct acactccagg gaccacctgg tactggtaag 17100
agtcattttg ctattggcct agctctctac tacccttctg ctcgcatagt gtatacagct 17160
tgctctcatg ccgctgttga tgcactatgt gagaaggcat taaaatattt gcctatagat 17220
aaatgtagta gaattatacc tgcacgtgct cgtgtagagt gttttgataa attcaaagtg 17280
aattcaacat tagaacagta tgtcttttgt actgtaaatg cattgcctga gacgacagca 17340
gatatagttg tctttgatga aatttcaatg gccacaaatt atgatttgag tgttgtcaat 17400
gccagattac gtgctaagca ctatgtgtac attggcgacc ctgctcaatt acctgcacca 17460
cgcacattgc taactaaggg cacactagaa ccagaatatt tcaattcagt gtgtagactt 17520
atgaaaacta taggtccaga catgttcctc ggaacttgtc ggcgttgtcc tgctgaaatt 17580
gttgacactg tgagtgcttt ggtttatgat aataagctta aagcacataa agacaaatca 17640
gctcaatgct ttaaaatgtt ttataagggt gttatcacgc atgatgtttc atctgcaatt 17700
aacaggccac aaataggcgt ggtaagagaa ttccttacac gtaaccctgc ttggagaaaa 17760
gctgtcttta tttcacctta taattcacag aatgctgtag cctcaaagat tttgggacta 17820
ccaactcaaa ctgttgattc atcacagggc tcagaatatg actatgtcat attcactcaa 17880
accactgaaa cagctcactc ttgtaatgta aacagattta atgttgctat taccagagca 17940
aaagtaggca tactttgcat aatgtctgat agagaccttt atgacaagtt gcaatttaca 18000
agtcttgaaa ttccacgtag gaatgtggca actttacaag ctgaaaatgt aacaggactc 18060
tttaaagatt gtagtaaggt aatcactggg ttacatccta cacaggcacc tacacacctc 18120
agtgttgaca ctaaattcaa aactgaaggt ttatgtgttg acatacctgg catacctaag 18180
gacatgacct atagaagact catctctatg atgggtttta aaatgaatta tcaagttaat 18240
ggttacccta acatgtttat cacccgcgaa gaagctataa gacatgtacg tgcatggatt 18300
ggcttcgatg tcgaggggtg tcatgctact agagaagctg ttggtaccaa tttaccttta 18360
cagctaggtt tttctacagg tgttaaccta gttgctgtac ctacaggtta tgttgataca 18420
cctaataata cagatttttc cagagttagt gctaaaccac cgcctggaga tcaatttaaa 18480
cacctcatac cacttatgta caaaggactt ccttggaatg tagtgcgtat aaagattgta 18540
caaatgttaa gtgacacact taaaaatctc tctgacagag tcgtatttgt cttatgggca 18600
catggctttg agttgacatc tatgaagtat tttgtgaaaa taggacctga gcgcacctgt 18660
tgtctatgtg atagacgtgc cacatgcttt tccactgctt cagacactta tgcctgttgg 18720
catcattcta ttggatttga ttacgtctat aatccgttta tgattgatgt tcaacaatgg 18780
ggttttacag gtaacctaca aagcaaccat gatctgtatt gtcaagtcca tggtaatgca 18840
catgtagcta gttgtgatgc aatcatgact aggtgtctag ctgtccacga gtgctttgtt 18900
aagcgtgttg actggactat tgaatatcct ataattggtg atgaactgaa gattaatgcg 18960
gcttgtagaa aggttcaaca catggttgtt aaagctgcat tattagcaga caaattccca 19020
gttcttcacg acattggtaa ccctaaagct attaagtgtg tacctcaagc tgatgtagaa 19080
tggaagttct atgatgcaca gccttgtagt gacaaagctt ataaaataga agaattattc 19140
tattcttatg ccacacattc tgacaaattc acagatggtg tatgcctatt ttggaattgc 19200
aatgtcgata gatatcctgc taattccatt gtttgtagat ttgacactag agtgctatct 19260
aaccttaact tgcctggttg tgatggtggc agtttgtatg taaataaaca tgcattccac 19320
acaccagctt ttgataaaag tgcttttgtt aatttaaaac aattaccatt tttctattac 19380
tctgacagtc catgtgagtc tcatggaaaa caagtagtgt cagatataga ttatgtacca 19440
ctaaagtctg ctacgtgtat aacacgttgc aatttaggtg gtgctgtctg tagacatcat 19500
gctaatgagt acagattgta tctcgatgct tataacatga tgatctcagc tggctttagc 19560
ttgtgggttt acaaacaatt tgatacttat aacctctgga acacttttac aagacttcag 19620
agtttagaaa atgtggcttt taatgttgta aataagggac actttgatgg acaacagggt 19680
gaagtaccag tttctatcat taataacact gtttacacaa aagttgatgg tgttgatgta 19740
gaattgtttg aaaataaaac aacattacct gttaatgtag catttgagct ttgggctaag 19800
cgcaacatta aaccagtacc agaggtgaaa atactcaata atttgggtgt ggacattgct 19860
gctaatactg tgatctggga ctacaaaaga gatgctccag cacatatatc tactattggt 19920
gtttgttcta tgactgacat agccaagaaa ccaactgaaa cgatttgtgc accactcact 19980
gtcttttttg atggtagagt tgatggtcaa gtagacttat ttagaaatgc ccgtaatggt 20040
gttcttatta cagaaggtag tgttaaaggt ttacaaccat ctgtaggtcc caaacaagct 20100
agtcttaatg gagtcacatt aattggagaa gccgtaaaaa cacagttcaa ttattataag 20160
aaagttgatg gtgttgtcca acaattacct gaaacttact ttactcagag tagaaattta 20220
caagaattta aacccaggag tcaaatggaa attgatttct tagaattagc tatggatgaa 20280
ttcattgaac ggtataaatt agaaggctat gccttcgaac atatcgttta tggagatttt 20340
agtcatagtc agttaggtgg tttacatcta ctgattggac tagctaaacg ttttaaggaa 20400
tcaccttttg aattagaaga ttttattcct atggacagta cagttaaaaa ctatttcata 20460
acagatgcgc aaacaggttc atctaagtgt gtgtgttctg ttattgattt attacttgat 20520
gattttgttg aaataataaa atcccaagat ttatctgtag tttctaaggt tgtcaaagtg 20580
actattgact atacagaaat ttcatttatg ctttggtgta aagatggcca tgtagaaaca 20640
ttttacccaa aattacaatc tagtcaagcg tggcaaccgg gtgttgctat gcctaatctt 20700
tacaaaatgc aaagaatgct attagaaaag tgtgaccttc aaaattatgg tgatagtgca 20760
acattaccta aaggcataat gatgaatgtc gcaaaatata ctcaactgtg tcaatattta 20820
aacacattaa cattagctgt accctataat atgagagtta tacattttgg tgctggttct 20880
gataaaggag ttgcaccagg tacagctgtt ttaagacagt ggttgcctac gggtacgctg 20940
cttgtcgatt cagatcttaa tgactttgtc tctgatgcag attcaacttt gattggtgat 21000
tgtgcaactg tacatacagc taataaatgg gatctcatta ttagtgatat gtacgaccct 21060
aagactaaaa atgttacaaa agaaaatgac tctaaagagg gttttttcac ttacatttgt 21120
gggtttatac aacaaaagct agctcttgga ggttccgtgg ctataaagat aacagaacat 21180
tcttggaatg ctgatcttta taagctcatg ggacacttcg catggtggac agcctttgtt 21240
actaatgtga atgcgtcatc atctgaagca tttttaattg gatgtaatta tcttggcaaa 21300
ccacgcgaac aaatagatgg ttatgtcatg catgcaaatt acatattttg gaggaataca 21360
aatccaattc agttgtcttc ctattcttta tttgacatga gtaaatttcc ccttaaatta 21420
aggggtactg ctgttatgtc tttaaaagaa ggtcaaatca atgatatgat tttatctctt 21480
cttagtaaag gtagacttat aattagagaa aacaacagag ttgttatttc tagtgatgtt 21540
cttgttaaca actaaacgaa caatgtttgt ttttcttgtt ttattgccac tagtctctag 21600
tcagtgtgtt aatcttacaa ccagaactca attaccccct gcatacacta attctttcac 21660
acgtggtgtt tattaccctg acaaagtttt cagatcctca gttttacatt caactcagga 21720
cttgttctta cctttctttt ccaatgttac ttggttccat gctatacatg tctctgggac 21780
caatggtact aagaggtttg ataaccctgt cctaccattt aatgatggtg tttattttgc 21840
ttccactgag aagtctaaca taataagagg ctggattttt ggtactactt tagattcgaa 21900
gacccagtcc ctacttattg ttaataacgc tactaatgtt gttattaaag tctgtgaatt 21960
tcaattttgt aatgatccat ttttgggtgt ttattaccac aaaaacaaca aaagttggat 22020
ggaaagtgag ttcagagttt attctagtgc gaataattgc acttttgaat atgtctctca 22080
gccttttctt atggaccttg aaggaaaaca gggtaatttc aaaaatctta gggaatttgt 22140
gtttaagaat attgatggtt attttaaaat atattctaag cacacgccta ttaatttagt 22200
gcgtgatctc cctcagggtt tttcggcttt agaaccattg gtagatttgc caataggtat 22260
taacatcact aggtttcaaa ctttacttgc tttacataga agttatttga ctcctggtga 22320
ttcttcttca ggttggacag ctggtgctgc agcttattat gtgggttatc ttcaacctag 22380
gacttttcta ttaaaatata atgaaaatgg aaccattaca gatgctgtag actgtgcact 22440
tgaccctctc tcagaaacaa agtgtacgtt gaaatccttc actgtagaaa aaggaatcta 22500
tcaaacttct aactttagag tccaaccaac agaatctatt gttagatttc ctaatattac 22560
aaacttgtgc ccttttggtg aagtttttaa cgccaccaga tttgcatctg tttatgcttg 22620
gaacaggaag agaatcagca actgtgttgc tgattattct gtcctatata attccgcatc 22680
attttccact tttaagtgtt atggagtgtc tcctactaaa ttaaatgatc tctgctttac 22740
taatgtctat gcagattcat ttgtaattag aggtgatgaa gtcagacaaa tcgctccagg 22800
gcaaactgga aagattgctg attataatta taaattacca gatgatttta caggctgcgt 22860
tatagcttgg aattctaaca atcttgattc taaggttggt ggtaattata attacctgta 22920
tagattgttt aggaagtcta atctcaaacc ttttgagaga gatatttcaa ctgaaatcta 22980
tcaggccggt agcacacctt gtaatggtgt tgaaggtttt aattgttact ttcctttaca 23040
atcatatggt ttccaaccca ctaatggtgt tggttaccaa ccatacagag tagtagtact 23100
ttcttttgaa cttctacatg caccagcaac tgtttgtgga cctaaaaagt ctactaattt 23160
ggttaaaaac aaatgtgtca atttcaactt caatggttta acaggcacag gtgttcttac 23220
tgagtctaac aaaaagtttc tgcctttcca acaatttggc agagacattg ctgacactac 23280
tgatgctgtc cgtgatccac agacacttga gattcttgac attacaccat gttcttttgg 23340
tggtgtcagt gttataacac caggaacaaa tacttctaac caggttgctg ttctttatca 23400
ggatgttaac tgcacagaag tccctgttgc tattcatgca gatcaactta ctcctacttg 23460
gcgtgtttat tctacaggtt ctaatgtttt tcaaacacgt gcaggctgtt taataggggc 23520
tgaacatgtc aacaactcat atgagtgtga catacccatt ggtgcaggta tatgcgctag 23580
ttatcagact cagactaatt ctcctcggcg ggcacgtagt gtagctagtc aatccatcat 23640
tgcctacact atgtcacttg gtgcagaaaa ttcagttgct tactctaata actctattgc 23700
catacccaca aattttacta ttagtgttac cacagaaatt ctaccagtgt ctatgaccaa 23760
gacatcagta gattgtacaa tgtacatttg tggtgattca actgaatgca gcaatctttt 23820
gttgcaatat ggcagttttt gtacacaatt aaaccgtgct ttaactggaa tagctgttga 23880
acaagacaaa aacacccaag aagtttttgc acaagtcaaa caaatttaca aaacaccacc 23940
aattaaagat tttggtggtt ttaatttttc acaaatatta ccagatccat caaaaccaag 24000
caagaggtca tttattgaag atctactttt caacaaagtg acacttgcag atgctggctt 24060
catcaaacaa tatggtgatt gccttggtga tattgctgct agagacctca tttgtgcaca 24120
aaagtttaac ggccttactg ttttgccacc tttgctcaca gatgaaatga ttgctcaata 24180
cacttctgca ctgttagcgg gtacaatcac ttctggttgg acctttggtg caggtgctgc 24240
attacaaata ccatttgcta tgcaaatggc ttataggttt aatggtattg gagttacaca 24300
gaatgttctc tatgagaacc aaaaattgat tgccaaccaa tttaatagtg ctattggcaa 24360
aattcaagac tcactttctt ccacagcaag tgcacttgga aaacttcaag atgtggtcaa 24420
ccaaaatgca caagctttaa acacgcttgt taaacaactt agctccaatt ttggtgcaat 24480
ttcaagtgtt ttaaatgata tcctttcacg tcttgacaaa gttgaggctg aagtgcaaat 24540
tgataggttg atcacaggca gacttcaaag tttgcagaca tatgtgactc aacaattaat 24600
tagagctgca gaaatcagag cttctgctaa tcttgctgct actaaaatgt cagagtgtgt 24660
acttggacaa tcaaaaagag ttgatttttg tggaaagggc tatcatctta tgtccttccc 24720
tcagtcagca cctcatggtg tagtcttctt gcatgtgact tatgtccctg cacaagaaaa 24780
gaacttcaca actgctcctg ccatttgtca tgatggaaaa gcacactttc ctcgtgaagg 24840
tgtctttgtt tcaaatggca cacactggtt tgtaacacaa aggaattttt atgaaccaca 24900
aatcattact acagacaaca catttgtgtc tggtaactgt gatgttgtaa taggaattgt 24960
caacaacaca gtttatgatc ctttgcaacc tgaattagac tcattcaagg aggagttaga 25020
taaatatttt aagaatcata catcaccaga tgttgattta ggtgacatct ctggcattaa 25080
tgcttcagtt gtaaacattc aaaaagaaat tgaccgcctc aatgaggttg ccaagaattt 25140
aaatgaatct ctcatcgatc tccaagaact tggaaagtat gagcagtata taaaatggcc 25200
atggtacatt tggctaggtt ttatagctgg cttgattgcc atagtaatgg tgacaattat 25260
gctttgctgt atgaccagtt gctgtagttg tctcaagggc tgttgttctt gtggatcctg 25320
ctgcaaattt gatgaagacg actctgagcc agtgctcaaa ggagtcaaat tacattacac 25380
ataaacgaac ttatggattt gtttatgaga atcttcacaa ttggaactgt aactttgaag 25440
caaggtgaaa tcaaggatgc tactccttca gattttgttc gcgctactgc aacgataccg 25500
atacaagcct cactcccttt cggatggctt attgttggcg ttgcacttct tgctgttttt 25560
cagagcgctt ccaaaatcat aaccctcaaa aagagatggc aactagcact ctccaagggt 25620
gttcactttg tttgcaactt gctgttgttg tttgtaacag tttactcaca ccttttgctc 25680
gttgctgctg gccttgaagc cccttttctc tatctttatg ctttagtcta cttcttgcag 25740
agtataaact ttgtaagaat aataatgagg ctttggcttt gctggaaatg ccgttccaaa 25800
aacccattac tttatgatgc caactatttt ctttgctggc atactaattg ttacgactat 25860
tgtatacctt acaatagtgt aacttcttca attgtcatta cttcaggtga tggcacaaca 25920
agtcctattt ctgaacatga ctaccagatt ggtggttata ctgaaaaatg ggaatctgga 25980
gtaaaagact gtgttgtatt acacagttac ttcacttcag actattacca gctgtactca 26040
actcaattga gtacagacac tggtgttgaa catgttacct tcttcatcta caataaaatt 26100
gttgatgagc ctgaagaaca tgtccaaatt cacacaatcg acggttcatc cggagttgtt 26160
aatccagtaa tggaaccaat ttatgatgaa ccgacgacga ctactagcgt gcctttgtaa 26220
gcacaagctg atgagtacga acttatgtac tcattcgttt cggaagagac aggtacgtta 26280
atagttaata gcgtacttct ttttcttgct ttcgtggtat tcttgctagt tacactagcc 26340
atccttactg cgcttcgatt gtgtgcgtac tgctgcaata ttgttaacgt gagtcttgta 26400
aaaccttctt tttacgttta ctctcgtgtt aaaaatctga attcttctag agttcctgat 26460
cttctggtct aaacgaacta aatattatat tagtttttct gtttggaact ttaattttag 26520
ccatggcaga ttccaacggt actattaccg ttgaagagct taaaaagctc cttgaacaat 26580
ggaacctagt aataggtttc ctattcctta catggatttg tcttctacaa tttgcctatg 26640
ccaacaggaa taggtttttg tatataatta agttaatttt cctctggctg ttatggccag 26700
taactttagc ttgttttgtg cttgctgctg tttacagaat aaattggatc accggtggaa 26760
ttgctatcgc aatggcttgt cttgtaggct tgatgtggct cagctacttc attgcttctt 26820
tcagactgtt tgcgcgtacg cgttccatgt ggtcattcaa tccagaaact aacattcttc 26880
tcaacgtgcc actccatggc actattctga ccagaccgct tctagaaagt gaactcgtaa 26940
tcggagctgt gatccttcgt ggacatcttc gtattgctgg acaccatcta ggacgctgtg 27000
acatcaagga cctgcctaaa gaaatcactg ttgctacatc acgaacgctt tcttattaca 27060
aattgggagc ttcgcagcgt gtagcaggtg actcaggttt tgctgcatac agtcgctaca 27120
ggattggcaa ctataaatta aacacagacc attccagtag cagtgacaat attgctttgc 27180
ttgtacagta agtgacaaca gatgtttcat ctcgttgact ttcaggttac tatagcagag 27240
atattactaa ttattatgag gacttttaaa gtttccattt ggaatcttga ttacatcata 27300
aacctcataa ttaaaaattt atctaagtca ctaactgaga ataaatattc tcaattagat 27360
gaagagcaac caatggagat tgattaaacg aacatgaaaa ttattctttt cttggcactg 27420
ataacactcg ctacttgtga gctttatcac taccaagagt gtgttagagg tacaacagta 27480
cttttaaaag aaccttgctc ttctggaaca tacgagggca attcaccatt tcatcctcta 27540
gctgataaca aatttgcact gacttgcttt agcactcaat ttgcttttgc ttgtcctgac 27600
ggcgtaaaac acgtctatca gttacgtgcc agatcagttt cacctaaact gttcatcaga 27660
caagaggaag ttcaagaact ttactctcca atttttctta ttgttgcggc aatagtgttt 27720
ataacacttt gcttcacact caaaagaaag acagaatgat tgaactttca ttaattgact 27780
tctatttgtg ctttttagcc tttctgctat tccttgtttt aattatgctt attatctttt 27840
ggttctcact tgaactgcaa gatcataatg aaacttgtca cgcctaaacg aacatgaaat 27900
ttcttgtttt cttaggaatc atcacaactg tagctgcatt tcaccaagaa tgtagtttac 27960
agtcatgtac tcaacatcaa ccatatgtag ttgatgaccc gtgtcctatt cacttctatt 28020
ctaaatggta tattagagta ggagctagaa aatcagcacc tttaattgaa ttgtgcgtgg 28080
atgaggctgg ttctaaatca cccattcagt acatcgatat cggtaattat acagtttcct 28140
gtttaccttt tacaattaat tgccaggaac ctaaattggg tagtcttgta gtgcgttgtt 28200
cgttctatga agacttttta gagtatcatg acgttcgtgt tgttttagat ttcatctaaa 28260
cgaacaaact aaaatgtctg ataatggacc ccaaaatcag cgaaatgcac cccgcattac 28320
gtttggtgga ccctcagatt caactggcag taaccagaat ggagaacgca gtggggcgcg 28380
atcaaaacaa cgtcggcccc aaggtttacc caataatact gcgtcttggt tcaccgctct 28440
cactcaacat ggcaaggaag accttaaatt ccctcgagga caaggcgttc caattaacac 28500
caatagcagt ccagatgacc aaattggcta ctaccgaaga gctaccagac gaattcgtgg 28560
tggtgacggt aaaatgaaag atctcagtcc aagatggtat ttctactacc taggaactgg 28620
gccagaagct ggacttccct atggtgctaa caaagacggc atcatatggg ttgcaactga 28680
gggagccttg aatacaccaa aagatcacat tggcacccgc aatcctgcta acaatgctgc 28740
aatcgtgcta caacttcctc aaggaacaac attgccaaaa ggcttctacg cagaagggag 28800
cagaggcggc agtcaagcct cttctcgttc ctcatcacgt agtcgcaaca gttcaagaaa 28860
ttcaactcca ggcagcagta ggggaacttc tcctgctaga atggctggca atggcggtga 28920
tgctgctctt gctttgctgc tgcttgacag attgaaccag cttgagagca aaatgtctgg 28980
taaaggccaa caacaacaag gccaaactgt cactaagaaa tctgctgctg aggcttctaa 29040
gaagcctcgg caaaaacgta ctgccactaa agcatacaat gtaacacaag ctttcggcag 29100
acgtggtcca gaacaaaccc aaggaaattt tggggaccag gaactaatca gacaaggaac 29160
tgattacaaa cattggccgc aaattgcaca atttgccccc agcgcttcag cgttcttcgg 29220
aatgtcgcgc attggcatgg aagtcacacc ttcgggaacg tggttgacct acacaggtgc 29280
catcaaattg gatgacaaag atccaaattt caaagatcaa gtcattttgc tgaataagca 29340
tattgacgca tacaaaacat tcccaccaac agagcctaaa aaggacaaaa agaagaaggc 29400
tgatgaaact caagccttac cgcagagaca gaagaaacag caaactgtga ctcttcttcc 29460
tgctgcagat ttggatgatt tctccaaaca attgcaacaa tccatgagca gtgctgactc 29520
aactcaggcc taaactcatg cagaccacac aaggcagatg ggctatataa acgttttcgc 29580
ttttccgttt acgatatata gtctactctt gtgcagaatg aattctcgta actacatagc 29640
acaagtagat gtagttaact ttaatctcac atagcaatct ttaatcagtg tgtaacatta 29700
gggaggactt gaaagagcca ccacattttc accgaggcca cgcggagtac gatcgagtgt 29760
acagtgaaca atgctaggga gagctgccta tatggaagag ccctaatgtg taaaattaat 29820
tttagtagtg ctatccccat gtgattttaa tagcttctta ggagaatgac aaaaaaaaaa 29880
aaaaaaaaaa aaaaaaaaaa aaa 29903
<210> 2
<211> 7096
<212> PRT
<213> SARS-CoV2
<400> 2
Met Glu Ser Leu Val Pro Gly Phe Asn Glu Lys Thr His Val Gln Leu
1 5 10 15
Ser Leu Pro Val Leu Gln Val Arg Asp Val Leu Val Arg Gly Phe Gly
20 25 30
Asp Ser Val Glu Glu Val Leu Ser Glu Ala Arg Gln His Leu Lys Asp
35 40 45
Gly Thr Cys Gly Leu Val Glu Val Glu Lys Gly Val Leu Pro Gln Leu
50 55 60
Glu Gln Pro Tyr Val Phe Ile Lys Arg Ser Asp Ala Arg Thr Ala Pro
65 70 75 80
His Gly His Val Met Val Glu Leu Val Ala Glu Leu Glu Gly Ile Gln
85 90 95
Tyr Gly Arg Ser Gly Glu Thr Leu Gly Val Leu Val Pro His Val Gly
100 105 110
Glu Ile Pro Val Ala Tyr Arg Lys Val Leu Leu Arg Lys Asn Gly Asn
115 120 125
Lys Gly Ala Gly Gly His Ser Tyr Gly Ala Asp Leu Lys Ser Phe Asp
130 135 140
Leu Gly Asp Glu Leu Gly Thr Asp Pro Tyr Glu Asp Phe Gln Glu Asn
145 150 155 160
Trp Asn Thr Lys His Ser Ser Gly Val Thr Arg Glu Leu Met Arg Glu
165 170 175
Leu Asn Gly Gly Ala Tyr Thr Arg Tyr Val Asp Asn Asn Phe Cys Gly
180 185 190
Pro Asp Gly Tyr Pro Leu Glu Cys Ile Lys Asp Leu Leu Ala Arg Ala
195 200 205
Gly Lys Ala Ser Cys Thr Leu Ser Glu Gln Leu Asp Phe Ile Asp Thr
210 215 220
Lys Arg Gly Val Tyr Cys Cys Arg Glu His Glu His Glu Ile Ala Trp
225 230 235 240
Tyr Thr Glu Arg Ser Glu Lys Ser Tyr Glu Leu Gln Thr Pro Phe Glu
245 250 255
Ile Lys Leu Ala Lys Lys Phe Asp Thr Phe Asn Gly Glu Cys Pro Asn
260 265 270
Phe Val Phe Pro Leu Asn Ser Ile Ile Lys Thr Ile Gln Pro Arg Val
275 280 285
Glu Lys Lys Lys Leu Asp Gly Phe Met Gly Arg Ile Arg Ser Val Tyr
290 295 300
Pro Val Ala Ser Pro Asn Glu Cys Asn Gln Met Cys Leu Ser Thr Leu
305 310 315 320
Met Lys Cys Asp His Cys Gly Glu Thr Ser Trp Gln Thr Gly Asp Phe
325 330 335
Val Lys Ala Thr Cys Glu Phe Cys Gly Thr Glu Asn Leu Thr Lys Glu
340 345 350
Gly Ala Thr Thr Cys Gly Tyr Leu Pro Gln Asn Ala Val Val Lys Ile
355 360 365
Tyr Cys Pro Ala Cys His Asn Ser Glu Val Gly Pro Glu His Ser Leu
370 375 380
Ala Glu Tyr His Asn Glu Ser Gly Leu Lys Thr Ile Leu Arg Lys Gly
385 390 395 400
Gly Arg Thr Ile Ala Phe Gly Gly Cys Val Phe Ser Tyr Val Gly Cys
405 410 415
His Asn Lys Cys Ala Tyr Trp Val Pro Arg Ala Ser Ala Asn Ile Gly
420 425 430
Cys Asn His Thr Gly Val Val Gly Glu Gly Ser Glu Gly Leu Asn Asp
435 440 445
Asn Leu Leu Glu Ile Leu Gln Lys Glu Lys Val Asn Ile Asn Ile Val
450 455 460
Gly Asp Phe Lys Leu Asn Glu Glu Ile Ala Ile Ile Leu Ala Ser Phe
465 470 475 480
Ser Ala Ser Thr Ser Ala Phe Val Glu Thr Val Lys Gly Leu Asp Tyr
485 490 495
Lys Ala Phe Lys Gln Ile Val Glu Ser Cys Gly Asn Phe Lys Val Thr
500 505 510
Lys Gly Lys Ala Lys Lys Gly Ala Trp Asn Ile Gly Glu Gln Lys Ser
515 520 525
Ile Leu Ser Pro Leu Tyr Ala Phe Ala Ser Glu Ala Ala Arg Val Val
530 535 540
Arg Ser Ile Phe Ser Arg Thr Leu Glu Thr Ala Gln Asn Ser Val Arg
545 550 555 560
Val Leu Gln Lys Ala Ala Ile Thr Ile Leu Asp Gly Ile Ser Gln Tyr
565 570 575
Ser Leu Arg Leu Ile Asp Ala Met Met Phe Thr Ser Asp Leu Ala Thr
580 585 590
Asn Asn Leu Val Val Met Ala Tyr Ile Thr Gly Gly Val Val Gln Leu
595 600 605
Thr Ser Gln Trp Leu Thr Asn Ile Phe Gly Thr Val Tyr Glu Lys Leu
610 615 620
Lys Pro Val Leu Asp Trp Leu Glu Glu Lys Phe Lys Glu Gly Val Glu
625 630 635 640
Phe Leu Arg Asp Gly Trp Glu Ile Val Lys Phe Ile Ser Thr Cys Ala
645 650 655
Cys Glu Ile Val Gly Gly Gln Ile Val Thr Cys Ala Lys Glu Ile Lys
660 665 670
Glu Ser Val Gln Thr Phe Phe Lys Leu Val Asn Lys Phe Leu Ala Leu
675 680 685
Cys Ala Asp Ser Ile Ile Ile Gly Gly Ala Lys Leu Lys Ala Leu Asn
690 695 700
Leu Gly Glu Thr Phe Val Thr His Ser Lys Gly Leu Tyr Arg Lys Cys
705 710 715 720
Val Lys Ser Arg Glu Glu Thr Gly Leu Leu Met Pro Leu Lys Ala Pro
725 730 735
Lys Glu Ile Ile Phe Leu Glu Gly Glu Thr Leu Pro Thr Glu Val Leu
740 745 750
Thr Glu Glu Val Val Leu Lys Thr Gly Asp Leu Gln Pro Leu Glu Gln
755 760 765
Pro Thr Ser Glu Ala Val Glu Ala Pro Leu Val Gly Thr Pro Val Cys
770 775 780
Ile Asn Gly Leu Met Leu Leu Glu Ile Lys Asp Thr Glu Lys Tyr Cys
785 790 795 800
Ala Leu Ala Pro Asn Met Met Val Thr Asn Asn Thr Phe Thr Leu Lys
805 810 815
Gly Gly Ala Pro Thr Lys Val Thr Phe Gly Asp Asp Thr Val Ile Glu
820 825 830
Val Gln Gly Tyr Lys Ser Val Asn Ile Thr Phe Glu Leu Asp Glu Arg
835 840 845
Ile Asp Lys Val Leu Asn Glu Lys Cys Ser Ala Tyr Thr Val Glu Leu
850 855 860
Gly Thr Glu Val Asn Glu Phe Ala Cys Val Val Ala Asp Ala Val Ile
865 870 875 880
Lys Thr Leu Gln Pro Val Ser Glu Leu Leu Thr Pro Leu Gly Ile Asp
885 890 895
Leu Asp Glu Trp Ser Met Ala Thr Tyr Tyr Leu Phe Asp Glu Ser Gly
900 905 910
Glu Phe Lys Leu Ala Ser His Met Tyr Cys Ser Phe Tyr Pro Pro Asp
915 920 925
Glu Asp Glu Glu Glu Gly Asp Cys Glu Glu Glu Glu Phe Glu Pro Ser
930 935 940
Thr Gln Tyr Glu Tyr Gly Thr Glu Asp Asp Tyr Gln Gly Lys Pro Leu
945 950 955 960
Glu Phe Gly Ala Thr Ser Ala Ala Leu Gln Pro Glu Glu Glu Gln Glu
965 970 975
Glu Asp Trp Leu Asp Asp Asp Ser Gln Gln Thr Val Gly Gln Gln Asp
980 985 990
Gly Ser Glu Asp Asn Gln Thr Thr Thr Ile Gln Thr Ile Val Glu Val
995 1000 1005
Gln Pro Gln Leu Glu Met Glu Leu Thr Pro Val Val Gln Thr Ile Glu
1010 1015 1020
Val Asn Ser Phe Ser Gly Tyr Leu Lys Leu Thr Asp Asn Val Tyr Ile
1025 1030 1035 1040
Lys Asn Ala Asp Ile Val Glu Glu Ala Lys Lys Val Lys Pro Thr Val
1045 1050 1055
Val Val Asn Ala Ala Asn Val Tyr Leu Lys His Gly Gly Gly Val Ala
1060 1065 1070
Gly Ala Leu Asn Lys Ala Thr Asn Asn Ala Met Gln Val Glu Ser Asp
1075 1080 1085
Asp Tyr Ile Ala Thr Asn Gly Pro Leu Lys Val Gly Gly Ser Cys Val
1090 1095 1100
Leu Ser Gly His Asn Leu Ala Lys His Cys Leu His Val Val Gly Pro
1105 1110 1115 1120
Asn Val Asn Lys Gly Glu Asp Ile Gln Leu Leu Lys Ser Ala Tyr Glu
1125 1130 1135
Asn Phe Asn Gln His Glu Val Leu Leu Ala Pro Leu Leu Ser Ala Gly
1140 1145 1150
Ile Phe Gly Ala Asp Pro Ile His Ser Leu Arg Val Cys Val Asp Thr
1155 1160 1165
Val Arg Thr Asn Val Tyr Leu Ala Val Phe Asp Lys Asn Leu Tyr Asp
1170 1175 1180
Lys Leu Val Ser Ser Phe Leu Glu Met Lys Ser Glu Lys Gln Val Glu
1185 1190 1195 1200
Gln Lys Ile Ala Glu Ile Pro Lys Glu Glu Val Lys Pro Phe Ile Thr
1205 1210 1215
Glu Ser Lys Pro Ser Val Glu Gln Arg Lys Gln Asp Asp Lys Lys Ile
1220 1225 1230
Lys Ala Cys Val Glu Glu Val Thr Thr Thr Leu Glu Glu Thr Lys Phe
1235 1240 1245
Leu Thr Glu Asn Leu Leu Leu Tyr Ile Asp Ile Asn Gly Asn Leu His
1250 1255 1260
Pro Asp Ser Ala Thr Leu Val Ser Asp Ile Asp Ile Thr Phe Leu Lys
1265 1270 1275 1280
Lys Asp Ala Pro Tyr Ile Val Gly Asp Val Val Gln Glu Gly Val Leu
1285 1290 1295
Thr Ala Val Val Ile Pro Thr Lys Lys Ala Gly Gly Thr Thr Glu Met
1300 1305 1310
Leu Ala Lys Ala Leu Arg Lys Val Pro Thr Asp Asn Tyr Ile Thr Thr
1315 1320 1325
Tyr Pro Gly Gln Gly Leu Asn Gly Tyr Thr Val Glu Glu Ala Lys Thr
1330 1335 1340
Val Leu Lys Lys Cys Lys Ser Ala Phe Tyr Ile Leu Pro Ser Ile Ile
1345 1350 1355 1360
Ser Asn Glu Lys Gln Glu Ile Leu Gly Thr Val Ser Trp Asn Leu Arg
1365 1370 1375
Glu Met Leu Ala His Ala Glu Glu Thr Arg Lys Leu Met Pro Val Cys
1380 1385 1390
Val Glu Thr Lys Ala Ile Val Ser Thr Ile Gln Arg Lys Tyr Lys Gly
1395 1400 1405
Ile Lys Ile Gln Glu Gly Val Val Asp Tyr Gly Ala Arg Phe Tyr Phe
1410 1415 1420
Tyr Thr Ser Lys Thr Thr Val Ala Ser Leu Ile Asn Thr Leu Asn Asp
1425 1430 1435 1440
Leu Asn Glu Thr Leu Val Thr Met Pro Leu Gly Tyr Val Thr His Gly
1445 1450 1455
Leu Asn Leu Glu Glu Ala Ala Arg Tyr Met Arg Ser Leu Lys Val Pro
1460 1465 1470
Ala Thr Val Ser Val Ser Ser Pro Asp Ala Val Thr Ala Tyr Asn Gly
1475 1480 1485
Tyr Leu Thr Ser Ser Ser Lys Thr Pro Glu Glu His Phe Ile Glu Thr
1490 1495 1500
Ile Ser Leu Ala Gly Ser Tyr Lys Asp Trp Ser Tyr Ser Gly Gln Ser
1505 1510 1515 1520
Thr Gln Leu Gly Ile Glu Phe Leu Lys Arg Gly Asp Lys Ser Val Tyr
1525 1530 1535
Tyr Thr Ser Asn Pro Thr Thr Phe His Leu Asp Gly Glu Val Ile Thr
1540 1545 1550
Phe Asp Asn Leu Lys Thr Leu Leu Ser Leu Arg Glu Val Arg Thr Ile
1555 1560 1565
Lys Val Phe Thr Thr Val Asp Asn Ile Asn Leu His Thr Gln Val Val
1570 1575 1580
Asp Met Ser Met Thr Tyr Gly Gln Gln Phe Gly Pro Thr Tyr Leu Asp
1585 1590 1595 1600
Gly Ala Asp Val Thr Lys Ile Lys Pro His Asn Ser His Glu Gly Lys
1605 1610 1615
Thr Phe Tyr Val Leu Pro Asn Asp Asp Thr Leu Arg Val Glu Ala Phe
1620 1625 1630
Glu Tyr Tyr His Thr Thr Asp Pro Ser Phe Leu Gly Arg Tyr Met Ser
1635 1640 1645
Ala Leu Asn His Thr Lys Lys Trp Lys Tyr Pro Gln Val Asn Gly Leu
1650 1655 1660
Thr Ser Ile Lys Trp Ala Asp Asn Asn Cys Tyr Leu Ala Thr Ala Leu
1665 1670 1675 1680
Leu Thr Leu Gln Gln Ile Glu Leu Lys Phe Asn Pro Pro Ala Leu Gln
1685 1690 1695
Asp Ala Tyr Tyr Arg Ala Arg Ala Gly Glu Ala Ala Asn Phe Cys Ala
1700 1705 1710
Leu Ile Leu Ala Tyr Cys Asn Lys Thr Val Gly Glu Leu Gly Asp Val
1715 1720 1725
Arg Glu Thr Met Ser Tyr Leu Phe Gln His Ala Asn Leu Asp Ser Cys
1730 1735 1740
Lys Arg Val Leu Asn Val Val Cys Lys Thr Cys Gly Gln Gln Gln Thr
1745 1750 1755 1760
Thr Leu Lys Gly Val Glu Ala Val Met Tyr Met Gly Thr Leu Ser Tyr
1765 1770 1775
Glu Gln Phe Lys Lys Gly Val Gln Ile Pro Cys Thr Cys Gly Lys Gln
1780 1785 1790
Ala Thr Lys Tyr Leu Val Gln Gln Glu Ser Pro Phe Val Met Met Ser
1795 1800 1805
Ala Pro Pro Ala Gln Tyr Glu Leu Lys His Gly Thr Phe Thr Cys Ala
1810 1815 1820
Ser Glu Tyr Thr Gly Asn Tyr Gln Cys Gly His Tyr Lys His Ile Thr
1825 1830 1835 1840
Ser Lys Glu Thr Leu Tyr Cys Ile Asp Gly Ala Leu Leu Thr Lys Ser
1845 1850 1855
Ser Glu Tyr Lys Gly Pro Ile Thr Asp Val Phe Tyr Lys Glu Asn Ser
1860 1865 1870
Tyr Thr Thr Thr Ile Lys Pro Val Thr Tyr Lys Leu Asp Gly Val Val
1875 1880 1885
Cys Thr Glu Ile Asp Pro Lys Leu Asp Asn Tyr Tyr Lys Lys Asp Asn
1890 1895 1900
Ser Tyr Phe Thr Glu Gln Pro Ile Asp Leu Val Pro Asn Gln Pro Tyr
1905 1910 1915 1920
Pro Asn Ala Ser Phe Asp Asn Phe Lys Phe Val Cys Asp Asn Ile Lys
1925 1930 1935
Phe Ala Asp Asp Leu Asn Gln Leu Thr Gly Tyr Lys Lys Pro Ala Ser
1940 1945 1950
Arg Glu Leu Lys Val Thr Phe Phe Pro Asp Leu Asn Gly Asp Val Val
1955 1960 1965
Ala Ile Asp Tyr Lys His Tyr Thr Pro Ser Phe Lys Lys Gly Ala Lys
1970 1975 1980
Leu Leu His Lys Pro Ile Val Trp His Val Asn Asn Ala Thr Asn Lys
1985 1990 1995 2000
Ala Thr Tyr Lys Pro Asn Thr Trp Cys Ile Arg Cys Leu Trp Ser Thr
2005 2010 2015
Lys Pro Val Glu Thr Ser Asn Ser Phe Asp Val Leu Lys Ser Glu Asp
2020 2025 2030
Ala Gln Gly Met Asp Asn Leu Ala Cys Glu Asp Leu Lys Pro Val Ser
2035 2040 2045
Glu Glu Val Val Glu Asn Pro Thr Ile Gln Lys Asp Val Leu Glu Cys
2050 2055 2060
Asn Val Lys Thr Thr Glu Val Val Gly Asp Ile Ile Leu Lys Pro Ala
2065 2070 2075 2080
Asn Asn Ser Leu Lys Ile Thr Glu Glu Val Gly His Thr Asp Leu Met
2085 2090 2095
Ala Ala Tyr Val Asp Asn Ser Ser Leu Thr Ile Lys Lys Pro Asn Glu
2100 2105 2110
Leu Ser Arg Val Leu Gly Leu Lys Thr Leu Ala Thr His Gly Leu Ala
2115 2120 2125
Ala Val Asn Ser Val Pro Trp Asp Thr Ile Ala Asn Tyr Ala Lys Pro
2130 2135 2140
Phe Leu Asn Lys Val Val Ser Thr Thr Thr Asn Ile Val Thr Arg Cys
2145 2150 2155 2160
Leu Asn Arg Val Cys Thr Asn Tyr Met Pro Tyr Phe Phe Thr Leu Leu
2165 2170 2175
Leu Gln Leu Cys Thr Phe Thr Arg Ser Thr Asn Ser Arg Ile Lys Ala
2180 2185 2190
Ser Met Pro Thr Thr Ile Ala Lys Asn Thr Val Lys Ser Val Gly Lys
2195 2200 2205
Phe Cys Leu Glu Ala Ser Phe Asn Tyr Leu Lys Ser Pro Asn Phe Ser
2210 2215 2220
Lys Leu Ile Asn Ile Ile Ile Trp Phe Leu Leu Leu Ser Val Cys Leu
2225 2230 2235 2240
Gly Ser Leu Ile Tyr Ser Thr Ala Ala Leu Gly Val Leu Met Ser Asn
2245 2250 2255
Leu Gly Met Pro Ser Tyr Cys Thr Gly Tyr Arg Glu Gly Tyr Leu Asn
2260 2265 2270
Ser Thr Asn Val Thr Ile Ala Thr Tyr Cys Thr Gly Ser Ile Pro Cys
2275 2280 2285
Ser Val Cys Leu Ser Gly Leu Asp Ser Leu Asp Thr Tyr Pro Ser Leu
2290 2295 2300
Glu Thr Ile Gln Ile Thr Ile Ser Ser Phe Lys Trp Asp Leu Thr Ala
2305 2310 2315 2320
Phe Gly Leu Val Ala Glu Trp Phe Leu Ala Tyr Ile Leu Phe Thr Arg
2325 2330 2335
Phe Phe Tyr Val Leu Gly Leu Ala Ala Ile Met Gln Leu Phe Phe Ser
2340 2345 2350
Tyr Phe Ala Val His Phe Ile Ser Asn Ser Trp Leu Met Trp Leu Ile
2355 2360 2365
Ile Asn Leu Val Gln Met Ala Pro Ile Ser Ala Met Val Arg Met Tyr
2370 2375 2380
Ile Phe Phe Ala Ser Phe Tyr Tyr Val Trp Lys Ser Tyr Val His Val
2385 2390 2395 2400
Val Asp Gly Cys Asn Ser Ser Thr Cys Met Met Cys Tyr Lys Arg Asn
2405 2410 2415
Arg Ala Thr Arg Val Glu Cys Thr Thr Ile Val Asn Gly Val Arg Arg
2420 2425 2430
Ser Phe Tyr Val Tyr Ala Asn Gly Gly Lys Gly Phe Cys Lys Leu His
2435 2440 2445
Asn Trp Asn Cys Val Asn Cys Asp Thr Phe Cys Ala Gly Ser Thr Phe
2450 2455 2460
Ile Ser Asp Glu Val Ala Arg Asp Leu Ser Leu Gln Phe Lys Arg Pro
2465 2470 2475 2480
Ile Asn Pro Thr Asp Gln Ser Ser Tyr Ile Val Asp Ser Val Thr Val
2485 2490 2495
Lys Asn Gly Ser Ile His Leu Tyr Phe Asp Lys Ala Gly Gln Lys Thr
2500 2505 2510
Tyr Glu Arg His Ser Leu Ser His Phe Val Asn Leu Asp Asn Leu Arg
2515 2520 2525
Ala Asn Asn Thr Lys Gly Ser Leu Pro Ile Asn Val Ile Val Phe Asp
2530 2535 2540
Gly Lys Ser Lys Cys Glu Glu Ser Ser Ala Lys Ser Ala Ser Val Tyr
2545 2550 2555 2560
Tyr Ser Gln Leu Met Cys Gln Pro Ile Leu Leu Leu Asp Gln Ala Leu
2565 2570 2575
Val Ser Asp Val Gly Asp Ser Ala Glu Val Ala Val Lys Met Phe Asp
2580 2585 2590
Ala Tyr Val Asn Thr Phe Ser Ser Thr Phe Asn Val Pro Met Glu Lys
2595 2600 2605
Leu Lys Thr Leu Val Ala Thr Ala Glu Ala Glu Leu Ala Lys Asn Val
2610 2615 2620
Ser Leu Asp Asn Val Leu Ser Thr Phe Ile Ser Ala Ala Arg Gln Gly
2625 2630 2635 2640
Phe Val Asp Ser Asp Val Glu Thr Lys Asp Val Val Glu Cys Leu Lys
2645 2650 2655
Leu Ser His Gln Ser Asp Ile Glu Val Thr Gly Asp Ser Cys Asn Asn
2660 2665 2670
Tyr Met Leu Thr Tyr Asn Lys Val Glu Asn Met Thr Pro Arg Asp Leu
2675 2680 2685
Gly Ala Cys Ile Asp Cys Ser Ala Arg His Ile Asn Ala Gln Val Ala
2690 2695 2700
Lys Ser His Asn Ile Ala Leu Ile Trp Asn Val Lys Asp Phe Met Ser
2705 2710 2715 2720
Leu Ser Glu Gln Leu Arg Lys Gln Ile Arg Ser Ala Ala Lys Lys Asn
2725 2730 2735
Asn Leu Pro Phe Lys Leu Thr Cys Ala Thr Thr Arg Gln Val Val Asn
2740 2745 2750
Val Val Thr Thr Lys Ile Ala Leu Lys Gly Gly Lys Ile Val Asn Asn
2755 2760 2765
Trp Leu Lys Gln Leu Ile Lys Val Thr Leu Val Phe Leu Phe Val Ala
2770 2775 2780
Ala Ile Phe Tyr Leu Ile Thr Pro Val His Val Met Ser Lys His Thr
2785 2790 2795 2800
Asp Phe Ser Ser Glu Ile Ile Gly Tyr Lys Ala Ile Asp Gly Gly Val
2805 2810 2815
Thr Arg Asp Ile Ala Ser Thr Asp Thr Cys Phe Ala Asn Lys His Ala
2820 2825 2830
Asp Phe Asp Thr Trp Phe Ser Gln Arg Gly Gly Ser Tyr Thr Asn Asp
2835 2840 2845
Lys Ala Cys Pro Leu Ile Ala Ala Val Ile Thr Arg Glu Val Gly Phe
2850 2855 2860
Val Val Pro Gly Leu Pro Gly Thr Ile Leu Arg Thr Thr Asn Gly Asp
2865 2870 2875 2880
Phe Leu His Phe Leu Pro Arg Val Phe Ser Ala Val Gly Asn Ile Cys
2885 2890 2895
Tyr Thr Pro Ser Lys Leu Ile Glu Tyr Thr Asp Phe Ala Thr Ser Ala
2900 2905 2910
Cys Val Leu Ala Ala Glu Cys Thr Ile Phe Lys Asp Ala Ser Gly Lys
2915 2920 2925
Pro Val Pro Tyr Cys Tyr Asp Thr Asn Val Leu Glu Gly Ser Val Ala
2930 2935 2940
Tyr Glu Ser Leu Arg Pro Asp Thr Arg Tyr Val Leu Met Asp Gly Ser
2945 2950 2955 2960
Ile Ile Gln Phe Pro Asn Thr Tyr Leu Glu Gly Ser Val Arg Val Val
2965 2970 2975
Thr Thr Phe Asp Ser Glu Tyr Cys Arg His Gly Thr Cys Glu Arg Ser
2980 2985 2990
Glu Ala Gly Val Cys Val Ser Thr Ser Gly Arg Trp Val Leu Asn Asn
2995 3000 3005
Asp Tyr Tyr Arg Ser Leu Pro Gly Val Phe Cys Gly Val Asp Ala Val
3010 3015 3020
Asn Leu Leu Thr Asn Met Phe Thr Pro Leu Ile Gln Pro Ile Gly Ala
3025 3030 3035 3040
Leu Asp Ile Ser Ala Ser Ile Val Ala Gly Gly Ile Val Ala Ile Val
3045 3050 3055
Val Thr Cys Leu Ala Tyr Tyr Phe Met Arg Phe Arg Arg Ala Phe Gly
3060 3065 3070
Glu Tyr Ser His Val Val Ala Phe Asn Thr Leu Leu Phe Leu Met Ser
3075 3080 3085
Phe Thr Val Leu Cys Leu Thr Pro Val Tyr Ser Phe Leu Pro Gly Val
3090 3095 3100
Tyr Ser Val Ile Tyr Leu Tyr Leu Thr Phe Tyr Leu Thr Asn Asp Val
3105 3110 3115 3120
Ser Phe Leu Ala His Ile Gln Trp Met Val Met Phe Thr Pro Leu Val
3125 3130 3135
Pro Phe Trp Ile Thr Ile Ala Tyr Ile Ile Cys Ile Ser Thr Lys His
3140 3145 3150
Phe Tyr Trp Phe Phe Ser Asn Tyr Leu Lys Arg Arg Val Val Phe Asn
3155 3160 3165
Gly Val Ser Phe Ser Thr Phe Glu Glu Ala Ala Leu Cys Thr Phe Leu
3170 3175 3180
Leu Asn Lys Glu Met Tyr Leu Lys Leu Arg Ser Asp Val Leu Leu Pro
3185 3190 3195 3200
Leu Thr Gln Tyr Asn Arg Tyr Leu Ala Leu Tyr Asn Lys Tyr Lys Tyr
3205 3210 3215
Phe Ser Gly Ala Met Asp Thr Thr Ser Tyr Arg Glu Ala Ala Cys Cys
3220 3225 3230
His Leu Ala Lys Ala Leu Asn Asp Phe Ser Asn Ser Gly Ser Asp Val
3235 3240 3245
Leu Tyr Gln Pro Pro Gln Thr Ser Ile Thr Ser Ala Val Leu Gln Ser
3250 3255 3260
Gly Phe Arg Lys Met Ala Phe Pro Ser Gly Lys Val Glu Gly Cys Met
3265 3270 3275 3280
Val Gln Val Thr Cys Gly Thr Thr Thr Leu Asn Gly Leu Trp Leu Asp
3285 3290 3295
Asp Val Val Tyr Cys Pro Arg His Val Ile Cys Thr Ser Glu Asp Met
3300 3305 3310
Leu Asn Pro Asn Tyr Glu Asp Leu Leu Ile Arg Lys Ser Asn His Asn
3315 3320 3325
Phe Leu Val Gln Ala Gly Asn Val Gln Leu Arg Val Ile Gly His Ser
3330 3335 3340
Met Gln Asn Cys Val Leu Lys Leu Lys Val Asp Thr Ala Asn Pro Lys
3345 3350 3355 3360
Thr Pro Lys Tyr Lys Phe Val Arg Ile Gln Pro Gly Gln Thr Phe Ser
3365 3370 3375
Val Leu Ala Cys Tyr Asn Gly Ser Pro Ser Gly Val Tyr Gln Cys Ala
3380 3385 3390
Met Arg Pro Asn Phe Thr Ile Lys Gly Ser Phe Leu Asn Gly Ser Cys
3395 3400 3405
Gly Ser Val Gly Phe Asn Ile Asp Tyr Asp Cys Val Ser Phe Cys Tyr
3410 3415 3420
Met His His Met Glu Leu Pro Thr Gly Val His Ala Gly Thr Asp Leu
3425 3430 3435 3440
Glu Gly Asn Phe Tyr Gly Pro Phe Val Asp Arg Gln Thr Ala Gln Ala
3445 3450 3455
Ala Gly Thr Asp Thr Thr Ile Thr Val Asn Val Leu Ala Trp Leu Tyr
3460 3465 3470
Ala Ala Val Ile Asn Gly Asp Arg Trp Phe Leu Asn Arg Phe Thr Thr
3475 3480 3485
Thr Leu Asn Asp Phe Asn Leu Val Ala Met Lys Tyr Asn Tyr Glu Pro
3490 3495 3500
Leu Thr Gln Asp His Val Asp Ile Leu Gly Pro Leu Ser Ala Gln Thr
3505 3510 3515 3520
Gly Ile Ala Val Leu Asp Met Cys Ala Ser Leu Lys Glu Leu Leu Gln
3525 3530 3535
Asn Gly Met Asn Gly Arg Thr Ile Leu Gly Ser Ala Leu Leu Glu Asp
3540 3545 3550
Glu Phe Thr Pro Phe Asp Val Val Arg Gln Cys Ser Gly Val Thr Phe
3555 3560 3565
Gln Ser Ala Val Lys Arg Thr Ile Lys Gly Thr His His Trp Leu Leu
3570 3575 3580
Leu Thr Ile Leu Thr Ser Leu Leu Val Leu Val Gln Ser Thr Gln Trp
3585 3590 3595 3600
Ser Leu Phe Phe Phe Leu Tyr Glu Asn Ala Phe Leu Pro Phe Ala Met
3605 3610 3615
Gly Ile Ile Ala Met Ser Ala Phe Ala Met Met Phe Val Lys His Lys
3620 3625 3630
His Ala Phe Leu Cys Leu Phe Leu Leu Pro Ser Leu Ala Thr Val Ala
3635 3640 3645
Tyr Phe Asn Met Val Tyr Met Pro Ala Ser Trp Val Met Arg Ile Met
3650 3655 3660
Thr Trp Leu Asp Met Val Asp Thr Ser Leu Ser Gly Phe Lys Leu Lys
3665 3670 3675 3680
Asp Cys Val Met Tyr Ala Ser Ala Val Val Leu Leu Ile Leu Met Thr
3685 3690 3695
Ala Arg Thr Val Tyr Asp Asp Gly Ala Arg Arg Val Trp Thr Leu Met
3700 3705 3710
Asn Val Leu Thr Leu Val Tyr Lys Val Tyr Tyr Gly Asn Ala Leu Asp
3715 3720 3725
Gln Ala Ile Ser Met Trp Ala Leu Ile Ile Ser Val Thr Ser Asn Tyr
3730 3735 3740
Ser Gly Val Val Thr Thr Val Met Phe Leu Ala Arg Gly Ile Val Phe
3745 3750 3755 3760
Met Cys Val Glu Tyr Cys Pro Ile Phe Phe Ile Thr Gly Asn Thr Leu
3765 3770 3775
Gln Cys Ile Met Leu Val Tyr Cys Phe Leu Gly Tyr Phe Cys Thr Cys
3780 3785 3790
Tyr Phe Gly Leu Phe Cys Leu Leu Asn Arg Tyr Phe Arg Leu Thr Leu
3795 3800 3805
Gly Val Tyr Asp Tyr Leu Val Ser Thr Gln Glu Phe Arg Tyr Met Asn
3810 3815 3820
Ser Gln Gly Leu Leu Pro Pro Lys Asn Ser Ile Asp Ala Phe Lys Leu
3825 3830 3835 3840
Asn Ile Lys Leu Leu Gly Val Gly Gly Lys Pro Cys Ile Lys Val Ala
3845 3850 3855
Thr Val Gln Ser Lys Met Ser Asp Val Lys Cys Thr Ser Val Val Leu
3860 3865 3870
Leu Ser Val Leu Gln Gln Leu Arg Val Glu Ser Ser Ser Lys Leu Trp
3875 3880 3885
Ala Gln Cys Val Gln Leu His Asn Asp Ile Leu Leu Ala Lys Asp Thr
3890 3895 3900
Thr Glu Ala Phe Glu Lys Met Val Ser Leu Leu Ser Val Leu Leu Ser
3905 3910 3915 3920
Met Gln Gly Ala Val Asp Ile Asn Lys Leu Cys Glu Glu Met Leu Asp
3925 3930 3935
Asn Arg Ala Thr Leu Gln Ala Ile Ala Ser Glu Phe Ser Ser Leu Pro
3940 3945 3950
Ser Tyr Ala Ala Phe Ala Thr Ala Gln Glu Ala Tyr Glu Gln Ala Val
3955 3960 3965
Ala Asn Gly Asp Ser Glu Val Val Leu Lys Lys Leu Lys Lys Ser Leu
3970 3975 3980
Asn Val Ala Lys Ser Glu Phe Asp Arg Asp Ala Ala Met Gln Arg Lys
3985 3990 3995 4000
Leu Glu Lys Met Ala Asp Gln Ala Met Thr Gln Met Tyr Lys Gln Ala
4005 4010 4015
Arg Ser Glu Asp Lys Arg Ala Lys Val Thr Ser Ala Met Gln Thr Met
4020 4025 4030
Leu Phe Thr Met Leu Arg Lys Leu Asp Asn Asp Ala Leu Asn Asn Ile
4035 4040 4045
Ile Asn Asn Ala Arg Asp Gly Cys Val Pro Leu Asn Ile Ile Pro Leu
4050 4055 4060
Thr Thr Ala Ala Lys Leu Met Val Val Ile Pro Asp Tyr Asn Thr Tyr
4065 4070 4075 4080
Lys Asn Thr Cys Asp Gly Thr Thr Phe Thr Tyr Ala Ser Ala Leu Trp
4085 4090 4095
Glu Ile Gln Gln Val Val Asp Ala Asp Ser Lys Ile Val Gln Leu Ser
4100 4105 4110
Glu Ile Ser Met Asp Asn Ser Pro Asn Leu Ala Trp Pro Leu Ile Val
4115 4120 4125
Thr Ala Leu Arg Ala Asn Ser Ala Val Lys Leu Gln Asn Asn Glu Leu
4130 4135 4140
Ser Pro Val Ala Leu Arg Gln Met Ser Cys Ala Ala Gly Thr Thr Gln
4145 4150 4155 4160
Thr Ala Cys Thr Asp Asp Asn Ala Leu Ala Tyr Tyr Asn Thr Thr Lys
4165 4170 4175
Gly Gly Arg Phe Val Leu Ala Leu Leu Ser Asp Leu Gln Asp Leu Lys
4180 4185 4190
Trp Ala Arg Phe Pro Lys Ser Asp Gly Thr Gly Thr Ile Tyr Thr Glu
4195 4200 4205
Leu Glu Pro Pro Cys Arg Phe Val Thr Asp Thr Pro Lys Gly Pro Lys
4210 4215 4220
Val Lys Tyr Leu Tyr Phe Ile Lys Gly Leu Asn Asn Leu Asn Arg Gly
4225 4230 4235 4240
Met Val Leu Gly Ser Leu Ala Ala Thr Val Arg Leu Gln Ala Gly Asn
4245 4250 4255
Ala Thr Glu Val Pro Ala Asn Ser Thr Val Leu Ser Phe Cys Ala Phe
4260 4265 4270
Ala Val Asp Ala Ala Lys Ala Tyr Lys Asp Tyr Leu Ala Ser Gly Gly
4275 4280 4285
Gln Pro Ile Thr Asn Cys Val Lys Met Leu Cys Thr His Thr Gly Thr
4290 4295 4300
Gly Gln Ala Ile Thr Val Thr Pro Glu Ala Asn Met Asp Gln Glu Ser
4305 4310 4315 4320
Phe Gly Gly Ala Ser Cys Cys Leu Tyr Cys Arg Cys His Ile Asp His
4325 4330 4335
Pro Asn Pro Lys Gly Phe Cys Asp Leu Lys Gly Lys Tyr Val Gln Ile
4340 4345 4350
Pro Thr Thr Cys Ala Asn Asp Pro Val Gly Phe Thr Leu Lys Asn Thr
4355 4360 4365
Val Cys Thr Val Cys Gly Met Trp Lys Gly Tyr Gly Cys Ser Cys Asp
4370 4375 4380
Gln Leu Arg Glu Pro Met Leu Gln Ser Ala Asp Ala Gln Ser Phe Leu
4385 4390 4395 4400
Asn Arg Val Cys Gly Val Ser Ala Ala Arg Leu Thr Pro Cys Gly Thr
4405 4410 4415
Gly Thr Ser Thr Asp Val Val Tyr Arg Ala Phe Asp Ile Tyr Asn Asp
4420 4425 4430
Lys Val Ala Gly Phe Ala Lys Phe Leu Lys Thr Asn Cys Cys Arg Phe
4435 4440 4445
Gln Glu Lys Asp Glu Asp Asp Asn Leu Ile Asp Ser Tyr Phe Val Val
4450 4455 4460
Lys Arg His Thr Phe Ser Asn Tyr Gln His Glu Glu Thr Ile Tyr Asn
4465 4470 4475 4480
Leu Leu Lys Asp Cys Pro Ala Val Ala Lys His Asp Phe Phe Lys Phe
4485 4490 4495
Arg Ile Asp Gly Asp Met Val Pro His Ile Ser Arg Gln Arg Leu Thr
4500 4505 4510
Lys Tyr Thr Met Ala Asp Leu Val Tyr Ala Leu Arg His Phe Asp Glu
4515 4520 4525
Gly Asn Cys Asp Thr Leu Lys Glu Ile Leu Val Thr Tyr Asn Cys Cys
4530 4535 4540
Asp Asp Asp Tyr Phe Asn Lys Lys Asp Trp Tyr Asp Phe Val Glu Asn
4545 4550 4555 4560
Pro Asp Ile Leu Arg Val Tyr Ala Asn Leu Gly Glu Arg Val Arg Gln
4565 4570 4575
Ala Leu Leu Lys Thr Val Gln Phe Cys Asp Ala Met Arg Asn Ala Gly
4580 4585 4590
Ile Val Gly Val Leu Thr Leu Asp Asn Gln Asp Leu Asn Gly Asn Trp
4595 4600 4605
Tyr Asp Phe Gly Asp Phe Ile Gln Thr Thr Pro Gly Ser Gly Val Pro
4610 4615 4620
Val Val Asp Ser Tyr Tyr Ser Leu Leu Met Pro Ile Leu Thr Leu Thr
4625 4630 4635 4640
Arg Ala Leu Thr Ala Glu Ser His Val Asp Thr Asp Leu Thr Lys Pro
4645 4650 4655
Tyr Ile Lys Trp Asp Leu Leu Lys Tyr Asp Phe Thr Glu Glu Arg Leu
4660 4665 4670
Lys Leu Phe Asp Arg Tyr Phe Lys Tyr Trp Asp Gln Thr Tyr His Pro
4675 4680 4685
Asn Cys Val Asn Cys Leu Asp Asp Arg Cys Ile Leu His Cys Ala Asn
4690 4695 4700
Phe Asn Val Leu Phe Ser Thr Val Phe Pro Pro Thr Ser Phe Gly Pro
4705 4710 4715 4720
Leu Val Arg Lys Ile Phe Val Asp Gly Val Pro Phe Val Val Ser Thr
4725 4730 4735
Gly Tyr His Phe Arg Glu Leu Gly Val Val His Asn Gln Asp Val Asn
4740 4745 4750
Leu His Ser Ser Arg Leu Ser Phe Lys Glu Leu Leu Val Tyr Ala Ala
4755 4760 4765
Asp Pro Ala Met His Ala Ala Ser Gly Asn Leu Leu Leu Asp Lys Arg
4770 4775 4780
Thr Thr Cys Phe Ser Val Ala Ala Leu Thr Asn Asn Val Ala Phe Gln
4785 4790 4795 4800
Thr Val Lys Pro Gly Asn Phe Asn Lys Asp Phe Tyr Asp Phe Ala Val
4805 4810 4815
Ser Lys Gly Phe Phe Lys Glu Gly Ser Ser Val Glu Leu Lys His Phe
4820 4825 4830
Phe Phe Ala Gln Asp Gly Asn Ala Ala Ile Ser Asp Tyr Asp Tyr Tyr
4835 4840 4845
Arg Tyr Asn Leu Pro Thr Met Cys Asp Ile Arg Gln Leu Leu Phe Val
4850 4855 4860
Val Glu Val Val Asp Lys Tyr Phe Asp Cys Tyr Asp Gly Gly Cys Ile
4865 4870 4875 4880
Asn Ala Asn Gln Val Ile Val Asn Asn Leu Asp Lys Ser Ala Gly Phe
4885 4890 4895
Pro Phe Asn Lys Trp Gly Lys Ala Arg Leu Tyr Tyr Asp Ser Met Ser
4900 4905 4910
Tyr Glu Asp Gln Asp Ala Leu Phe Ala Tyr Thr Lys Arg Asn Val Ile
4915 4920 4925
Pro Thr Ile Thr Gln Met Asn Leu Lys Tyr Ala Ile Ser Ala Lys Asn
4930 4935 4940
Arg Ala Arg Thr Val Ala Gly Val Ser Ile Cys Ser Thr Met Thr Asn
4945 4950 4955 4960
Arg Gln Phe His Gln Lys Leu Leu Lys Ser Ile Ala Ala Thr Arg Gly
4965 4970 4975
Ala Thr Val Val Ile Gly Thr Ser Lys Phe Tyr Gly Gly Trp His Asn
4980 4985 4990
Met Leu Lys Thr Val Tyr Ser Asp Val Glu Asn Pro His Leu Met Gly
4995 5000 5005
Trp Asp Tyr Pro Lys Cys Asp Arg Ala Met Pro Asn Met Leu Arg Ile
5010 5015 5020
Met Ala Ser Leu Val Leu Ala Arg Lys His Thr Thr Cys Cys Ser Leu
5025 5030 5035 5040
Ser His Arg Phe Tyr Arg Leu Ala Asn Glu Cys Ala Gln Val Leu Ser
5045 5050 5055
Glu Met Val Met Cys Gly Gly Ser Leu Tyr Val Lys Pro Gly Gly Thr
5060 5065 5070
Ser Ser Gly Asp Ala Thr Thr Ala Tyr Ala Asn Ser Val Phe Asn Ile
5075 5080 5085
Cys Gln Ala Val Thr Ala Asn Val Asn Ala Leu Leu Ser Thr Asp Gly
5090 5095 5100
Asn Lys Ile Ala Asp Lys Tyr Val Arg Asn Leu Gln His Arg Leu Tyr
5105 5110 5115 5120
Glu Cys Leu Tyr Arg Asn Arg Asp Val Asp Thr Asp Phe Val Asn Glu
5125 5130 5135
Phe Tyr Ala Tyr Leu Arg Lys His Phe Ser Met Met Ile Leu Ser Asp
5140 5145 5150
Asp Ala Val Val Cys Phe Asn Ser Thr Tyr Ala Ser Gln Gly Leu Val
5155 5160 5165
Ala Ser Ile Lys Asn Phe Lys Ser Val Leu Tyr Tyr Gln Asn Asn Val
5170 5175 5180
Phe Met Ser Glu Ala Lys Cys Trp Thr Glu Thr Asp Leu Thr Lys Gly
5185 5190 5195 5200
Pro His Glu Phe Cys Ser Gln His Thr Met Leu Val Lys Gln Gly Asp
5205 5210 5215
Asp Tyr Val Tyr Leu Pro Tyr Pro Asp Pro Ser Arg Ile Leu Gly Ala
5220 5225 5230
Gly Cys Phe Val Asp Asp Ile Val Lys Thr Asp Gly Thr Leu Met Ile
5235 5240 5245
Glu Arg Phe Val Ser Leu Ala Ile Asp Ala Tyr Pro Leu Thr Lys His
5250 5255 5260
Pro Asn Gln Glu Tyr Ala Asp Val Phe His Leu Tyr Leu Gln Tyr Ile
5265 5270 5275 5280
Arg Lys Leu His Asp Glu Leu Thr Gly His Met Leu Asp Met Tyr Ser
5285 5290 5295
Val Met Leu Thr Asn Asp Asn Thr Ser Arg Tyr Trp Glu Pro Glu Phe
5300 5305 5310
Tyr Glu Ala Met Tyr Thr Pro His Thr Val Leu Gln Ala Val Gly Ala
5315 5320 5325
Cys Val Leu Cys Asn Ser Gln Thr Ser Leu Arg Cys Gly Ala Cys Ile
5330 5335 5340
Arg Arg Pro Phe Leu Cys Cys Lys Cys Cys Tyr Asp His Val Ile Ser
5345 5350 5355 5360
Thr Ser His Lys Leu Val Leu Ser Val Asn Pro Tyr Val Cys Asn Ala
5365 5370 5375
Pro Gly Cys Asp Val Thr Asp Val Thr Gln Leu Tyr Leu Gly Gly Met
5380 5385 5390
Ser Tyr Tyr Cys Lys Ser His Lys Pro Pro Ile Ser Phe Pro Leu Cys
5395 5400 5405
Ala Asn Gly Gln Val Phe Gly Leu Tyr Lys Asn Thr Cys Val Gly Ser
5410 5415 5420
Asp Asn Val Thr Asp Phe Asn Ala Ile Ala Thr Cys Asp Trp Thr Asn
5425 5430 5435 5440
Ala Gly Asp Tyr Ile Leu Ala Asn Thr Cys Thr Glu Arg Leu Lys Leu
5445 5450 5455
Phe Ala Ala Glu Thr Leu Lys Ala Thr Glu Glu Thr Phe Lys Leu Ser
5460 5465 5470
Tyr Gly Ile Ala Thr Val Arg Glu Val Leu Ser Asp Arg Glu Leu His
5475 5480 5485
Leu Ser Trp Glu Val Gly Lys Pro Arg Pro Pro Leu Asn Arg Asn Tyr
5490 5495 5500
Val Phe Thr Gly Tyr Arg Val Thr Lys Asn Ser Lys Val Gln Ile Gly
5505 5510 5515 5520
Glu Tyr Thr Phe Glu Lys Gly Asp Tyr Gly Asp Ala Val Val Tyr Arg
5525 5530 5535
Gly Thr Thr Thr Tyr Lys Leu Asn Val Gly Asp Tyr Phe Val Leu Thr
5540 5545 5550
Ser His Thr Val Met Pro Leu Ser Ala Pro Thr Leu Val Pro Gln Glu
5555 5560 5565
His Tyr Val Arg Ile Thr Gly Leu Tyr Pro Thr Leu Asn Ile Ser Asp
5570 5575 5580
Glu Phe Ser Ser Asn Val Ala Asn Tyr Gln Lys Val Gly Met Gln Lys
5585 5590 5595 5600
Tyr Ser Thr Leu Gln Gly Pro Pro Gly Thr Gly Lys Ser His Phe Ala
5605 5610 5615
Ile Gly Leu Ala Leu Tyr Tyr Pro Ser Ala Arg Ile Val Tyr Thr Ala
5620 5625 5630
Cys Ser His Ala Ala Val Asp Ala Leu Cys Glu Lys Ala Leu Lys Tyr
5635 5640 5645
Leu Pro Ile Asp Lys Cys Ser Arg Ile Ile Pro Ala Arg Ala Arg Val
5650 5655 5660
Glu Cys Phe Asp Lys Phe Lys Val Asn Ser Thr Leu Glu Gln Tyr Val
5665 5670 5675 5680
Phe Cys Thr Val Asn Ala Leu Pro Glu Thr Thr Ala Asp Ile Val Val
5685 5690 5695
Phe Asp Glu Ile Ser Met Ala Thr Asn Tyr Asp Leu Ser Val Val Asn
5700 5705 5710
Ala Arg Leu Arg Ala Lys His Tyr Val Tyr Ile Gly Asp Pro Ala Gln
5715 5720 5725
Leu Pro Ala Pro Arg Thr Leu Leu Thr Lys Gly Thr Leu Glu Pro Glu
5730 5735 5740
Tyr Phe Asn Ser Val Cys Arg Leu Met Lys Thr Ile Gly Pro Asp Met
5745 5750 5755 5760
Phe Leu Gly Thr Cys Arg Arg Cys Pro Ala Glu Ile Val Asp Thr Val
5765 5770 5775
Ser Ala Leu Val Tyr Asp Asn Lys Leu Lys Ala His Lys Asp Lys Ser
5780 5785 5790
Ala Gln Cys Phe Lys Met Phe Tyr Lys Gly Val Ile Thr His Asp Val
5795 5800 5805
Ser Ser Ala Ile Asn Arg Pro Gln Ile Gly Val Val Arg Glu Phe Leu
5810 5815 5820
Thr Arg Asn Pro Ala Trp Arg Lys Ala Val Phe Ile Ser Pro Tyr Asn
5825 5830 5835 5840
Ser Gln Asn Ala Val Ala Ser Lys Ile Leu Gly Leu Pro Thr Gln Thr
5845 5850 5855
Val Asp Ser Ser Gln Gly Ser Glu Tyr Asp Tyr Val Ile Phe Thr Gln
5860 5865 5870
Thr Thr Glu Thr Ala His Ser Cys Asn Val Asn Arg Phe Asn Val Ala
5875 5880 5885
Ile Thr Arg Ala Lys Val Gly Ile Leu Cys Ile Met Ser Asp Arg Asp
5890 5895 5900
Leu Tyr Asp Lys Leu Gln Phe Thr Ser Leu Glu Ile Pro Arg Arg Asn
5905 5910 5915 5920
Val Ala Thr Leu Gln Ala Glu Asn Val Thr Gly Leu Phe Lys Asp Cys
5925 5930 5935
Ser Lys Val Ile Thr Gly Leu His Pro Thr Gln Ala Pro Thr His Leu
5940 5945 5950
Ser Val Asp Thr Lys Phe Lys Thr Glu Gly Leu Cys Val Asp Ile Pro
5955 5960 5965
Gly Ile Pro Lys Asp Met Thr Tyr Arg Arg Leu Ile Ser Met Met Gly
5970 5975 5980
Phe Lys Met Asn Tyr Gln Val Asn Gly Tyr Pro Asn Met Phe Ile Thr
5985 5990 5995 6000
Arg Glu Glu Ala Ile Arg His Val Arg Ala Trp Ile Gly Phe Asp Val
6005 6010 6015
Glu Gly Cys His Ala Thr Arg Glu Ala Val Gly Thr Asn Leu Pro Leu
6020 6025 6030
Gln Leu Gly Phe Ser Thr Gly Val Asn Leu Val Ala Val Pro Thr Gly
6035 6040 6045
Tyr Val Asp Thr Pro Asn Asn Thr Asp Phe Ser Arg Val Ser Ala Lys
6050 6055 6060
Pro Pro Pro Gly Asp Gln Phe Lys His Leu Ile Pro Leu Met Tyr Lys
6065 6070 6075 6080
Gly Leu Pro Trp Asn Val Val Arg Ile Lys Ile Val Gln Met Leu Ser
6085 6090 6095
Asp Thr Leu Lys Asn Leu Ser Asp Arg Val Val Phe Val Leu Trp Ala
6100 6105 6110
His Gly Phe Glu Leu Thr Ser Met Lys Tyr Phe Val Lys Ile Gly Pro
6115 6120 6125
Glu Arg Thr Cys Cys Leu Cys Asp Arg Arg Ala Thr Cys Phe Ser Thr
6130 6135 6140
Ala Ser Asp Thr Tyr Ala Cys Trp His His Ser Ile Gly Phe Asp Tyr
6145 6150 6155 6160
Val Tyr Asn Pro Phe Met Ile Asp Val Gln Gln Trp Gly Phe Thr Gly
6165 6170 6175
Asn Leu Gln Ser Asn His Asp Leu Tyr Cys Gln Val His Gly Asn Ala
6180 6185 6190
His Val Ala Ser Cys Asp Ala Ile Met Thr Arg Cys Leu Ala Val His
6195 6200 6205
Glu Cys Phe Val Lys Arg Val Asp Trp Thr Ile Glu Tyr Pro Ile Ile
6210 6215 6220
Gly Asp Glu Leu Lys Ile Asn Ala Ala Cys Arg Lys Val Gln His Met
6225 6230 6235 6240
Val Val Lys Ala Ala Leu Leu Ala Asp Lys Phe Pro Val Leu His Asp
6245 6250 6255
Ile Gly Asn Pro Lys Ala Ile Lys Cys Val Pro Gln Ala Asp Val Glu
6260 6265 6270
Trp Lys Phe Tyr Asp Ala Gln Pro Cys Ser Asp Lys Ala Tyr Lys Ile
6275 6280 6285
Glu Glu Leu Phe Tyr Ser Tyr Ala Thr His Ser Asp Lys Phe Thr Asp
6290 6295 6300
Gly Val Cys Leu Phe Trp Asn Cys Asn Val Asp Arg Tyr Pro Ala Asn
6305 6310 6315 6320
Ser Ile Val Cys Arg Phe Asp Thr Arg Val Leu Ser Asn Leu Asn Leu
6325 6330 6335
Pro Gly Cys Asp Gly Gly Ser Leu Tyr Val Asn Lys His Ala Phe His
6340 6345 6350
Thr Pro Ala Phe Asp Lys Ser Ala Phe Val Asn Leu Lys Gln Leu Pro
6355 6360 6365
Phe Phe Tyr Tyr Ser Asp Ser Pro Cys Glu Ser His Gly Lys Gln Val
6370 6375 6380
Val Ser Asp Ile Asp Tyr Val Pro Leu Lys Ser Ala Thr Cys Ile Thr
6385 6390 6395 6400
Arg Cys Asn Leu Gly Gly Ala Val Cys Arg His His Ala Asn Glu Tyr
6405 6410 6415
Arg Leu Tyr Leu Asp Ala Tyr Asn Met Met Ile Ser Ala Gly Phe Ser
6420 6425 6430
Leu Trp Val Tyr Lys Gln Phe Asp Thr Tyr Asn Leu Trp Asn Thr Phe
6435 6440 6445
Thr Arg Leu Gln Ser Leu Glu Asn Val Ala Phe Asn Val Val Asn Lys
6450 6455 6460
Gly His Phe Asp Gly Gln Gln Gly Glu Val Pro Val Ser Ile Ile Asn
6465 6470 6475 6480
Asn Thr Val Tyr Thr Lys Val Asp Gly Val Asp Val Glu Leu Phe Glu
6485 6490 6495
Asn Lys Thr Thr Leu Pro Val Asn Val Ala Phe Glu Leu Trp Ala Lys
6500 6505 6510
Arg Asn Ile Lys Pro Val Pro Glu Val Lys Ile Leu Asn Asn Leu Gly
6515 6520 6525
Val Asp Ile Ala Ala Asn Thr Val Ile Trp Asp Tyr Lys Arg Asp Ala
6530 6535 6540
Pro Ala His Ile Ser Thr Ile Gly Val Cys Ser Met Thr Asp Ile Ala
6545 6550 6555 6560
Lys Lys Pro Thr Glu Thr Ile Cys Ala Pro Leu Thr Val Phe Phe Asp
6565 6570 6575
Gly Arg Val Asp Gly Gln Val Asp Leu Phe Arg Asn Ala Arg Asn Gly
6580 6585 6590
Val Leu Ile Thr Glu Gly Ser Val Lys Gly Leu Gln Pro Ser Val Gly
6595 6600 6605
Pro Lys Gln Ala Ser Leu Asn Gly Val Thr Leu Ile Gly Glu Ala Val
6610 6615 6620
Lys Thr Gln Phe Asn Tyr Tyr Lys Lys Val Asp Gly Val Val Gln Gln
6625 6630 6635 6640
Leu Pro Glu Thr Tyr Phe Thr Gln Ser Arg Asn Leu Gln Glu Phe Lys
6645 6650 6655
Pro Arg Ser Gln Met Glu Ile Asp Phe Leu Glu Leu Ala Met Asp Glu
6660 6665 6670
Phe Ile Glu Arg Tyr Lys Leu Glu Gly Tyr Ala Phe Glu His Ile Val
6675 6680 6685
Tyr Gly Asp Phe Ser His Ser Gln Leu Gly Gly Leu His Leu Leu Ile
6690 6695 6700
Gly Leu Ala Lys Arg Phe Lys Glu Ser Pro Phe Glu Leu Glu Asp Phe
6705 6710 6715 6720
Ile Pro Met Asp Ser Thr Val Lys Asn Tyr Phe Ile Thr Asp Ala Gln
6725 6730 6735
Thr Gly Ser Ser Lys Cys Val Cys Ser Val Ile Asp Leu Leu Leu Asp
6740 6745 6750
Asp Phe Val Glu Ile Ile Lys Ser Gln Asp Leu Ser Val Val Ser Lys
6755 6760 6765
Val Val Lys Val Thr Ile Asp Tyr Thr Glu Ile Ser Phe Met Leu Trp
6770 6775 6780
Cys Lys Asp Gly His Val Glu Thr Phe Tyr Pro Lys Leu Gln Ser Ser
6785 6790 6795 6800
Gln Ala Trp Gln Pro Gly Val Ala Met Pro Asn Leu Tyr Lys Met Gln
6805 6810 6815
Arg Met Leu Leu Glu Lys Cys Asp Leu Gln Asn Tyr Gly Asp Ser Ala
6820 6825 6830
Thr Leu Pro Lys Gly Ile Met Met Asn Val Ala Lys Tyr Thr Gln Leu
6835 6840 6845
Cys Gln Tyr Leu Asn Thr Leu Thr Leu Ala Val Pro Tyr Asn Met Arg
6850 6855 6860
Val Ile His Phe Gly Ala Gly Ser Asp Lys Gly Val Ala Pro Gly Thr
6865 6870 6875 6880
Ala Val Leu Arg Gln Trp Leu Pro Thr Gly Thr Leu Leu Val Asp Ser
6885 6890 6895
Asp Leu Asn Asp Phe Val Ser Asp Ala Asp Ser Thr Leu Ile Gly Asp
6900 6905 6910
Cys Ala Thr Val His Thr Ala Asn Lys Trp Asp Leu Ile Ile Ser Asp
6915 6920 6925
Met Tyr Asp Pro Lys Thr Lys Asn Val Thr Lys Glu Asn Asp Ser Lys
6930 6935 6940
Glu Gly Phe Phe Thr Tyr Ile Cys Gly Phe Ile Gln Gln Lys Leu Ala
6945 6950 6955 6960
Leu Gly Gly Ser Val Ala Ile Lys Ile Thr Glu His Ser Trp Asn Ala
6965 6970 6975
Asp Leu Tyr Lys Leu Met Gly His Phe Ala Trp Trp Thr Ala Phe Val
6980 6985 6990
Thr Asn Val Asn Ala Ser Ser Ser Glu Ala Phe Leu Ile Gly Cys Asn
6995 7000 7005
Tyr Leu Gly Lys Pro Arg Glu Gln Ile Asp Gly Tyr Val Met His Ala
7010 7015 7020
Asn Tyr Ile Phe Trp Arg Asn Thr Asn Pro Ile Gln Leu Ser Ser Tyr
7025 7030 7035 7040
Ser Leu Phe Asp Met Ser Lys Phe Pro Leu Lys Leu Arg Gly Thr Ala
7045 7050 7055
Val Met Ser Leu Lys Glu Gly Gln Ile Asn Asp Met Ile Leu Ser Leu
7060 7065 7070
Leu Ser Lys Gly Arg Leu Ile Ile Arg Glu Asn Asn Arg Val Val Ile
7075 7080 7085
Ser Ser Asp Val Leu Val Asn Asn
7090 7095
<210> 3
<211> 1273
<212> PRT
<213> SARS-CoV2
<400> 3
Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
1 5 10 15
Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
20 25 30
Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
35 40 45
His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
50 55 60
Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
65 70 75 80
Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
85 90 95
Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
100 105 110
Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
115 120 125
Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
130 135 140
Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
145 150 155 160
Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
165 170 175
Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
180 185 190
Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
195 200 205
Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
210 215 220
Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
225 230 235 240
Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
245 250 255
Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
260 265 270
Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
275 280 285
Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
290 295 300
Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val
305 310 315 320
Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys
325 330 335
Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
340 345 350
Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
355 360 365
Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
370 375 380
Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
385 390 395 400
Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
405 410 415
Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
420 425 430
Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
435 440 445
Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
450 455 460
Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
465 470 475 480
Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
485 490 495
Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
500 505 510
Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
515 520 525
Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn
530 535 540
Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu
545 550 555 560
Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val
565 570 575
Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe
580 585 590
Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val
595 600 605
Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile
610 615 620
His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser
625 630 635 640
Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val
645 650 655
Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala
660 665 670
Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala
675 680 685
Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser
690 695 700
Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile
705 710 715 720
Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val
725 730 735
Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu
740 745 750
Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr
755 760 765
Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln
770 775 780
Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe
785 790 795 800
Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser
805 810 815
Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly
820 825 830
Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp
835 840 845
Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu
850 855 860
Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly
865 870 875 880
Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile
885 890 895
Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr
900 905 910
Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn
915 920 925
Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala
930 935 940
Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn
945 950 955 960
Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val
965 970 975
Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln
980 985 990
Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val
995 1000 1005
Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu
1010 1015 1020
Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys Arg Val
1025 1030 1035 1040
Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ser Ala
1045 1050 1055
Pro His Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ala Gln Glu
1060 1065 1070
Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His Asp Gly Lys Ala His
1075 1080 1085
Phe Pro Arg Glu Gly Val Phe Val Ser Asn Gly Thr His Trp Phe Val
1090 1095 1100
Thr Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile Thr Thr Asp Asn Thr
1105 1110 1115 1120
Phe Val Ser Gly Asn Cys Asp Val Val Ile Gly Ile Val Asn Asn Thr
1125 1130 1135
Val Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu
1140 1145 1150
Asp Lys Tyr Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp
1155 1160 1165
Ile Ser Gly Ile Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp
1170 1175 1180
Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu
1185 1190 1195 1200
Gln Glu Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Trp Tyr Ile
1205 1210 1215
Trp Leu Gly Phe Ile Ala Gly Leu Ile Ala Ile Val Met Val Thr Ile
1220 1225 1230
Met Leu Cys Cys Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Cys Cys
1235 1240 1245
Ser Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro Val
1250 1255 1260
Leu Lys Gly Val Lys Leu His Tyr Thr
1265 1270
<210> 4
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Artificial sequence
<400> 4
tgactgtgaa cgttcgagat ga 22
<210> 5
<211> 11
<212> PRT
<213> Artificial Sequence
<220>
<223> Artificial sequence
<400> 5
Lys Leu Lys Leu Leu Leu Leu Leu Lys Leu Lys
1 5 10
<210> 6
<211> 26
<212> DNA
<213> Artificial Sequence
<220>
<223> Artificial sequence
<220>
<221> misc_feature
<222> (1)..(26)
<223> n=deoxyinosine
<400> 6
ncncncncnc ncncncncnc ncncnc 26
<210> 7
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Artificial sequence
<400> 7
tccatgacgt tcctgacgtt 20
<210> 8
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> Artificial sequence
<400> 8
tcgtcgtttt gtcgttttgt cgtt 24
<210> 9
<211> 29867
<212> DNA
<213> SARS-CoV2
<220>
<221> misc_feature
<222> (11083)
<223> n is a, c, g, or t
<400> 9
attaaaggtt tataccttcc caggtaacaa accaaccaac tttcgatctc ttgtagatct 60
gttctctaaa cgaactttaa aatctgtgtg gctgtcactc ggctgcatgc ttagtgcact 120
cacgcagtat aattaataac taattactgt cgttgacagg acacgagtaa ctcgtctatc 180
ttctgcaggc tgcttacggt ttcgtccgtg ttgcagccga tcatcagcac atctaggttt 240
cgtccgggtg tgaccgaaag gtaagatgga gagccttgtc cctggtttca acgagaaaac 300
acacgtccaa ctcagtttgc ctgttttaca ggttcgcgac gtgctcgtac gtggctttgg 360
agactccgtg gaggaggtct tatcagaggc acgtcaacat cttaaagatg gcacttgtgg 420
cttagtagaa gttgaaaaag gcgttttgcc tcaacttgaa cagccctatg tgttcatcaa 480
acgttcggat gctcgaactg cacctcatgg tcatgttatg gttgagctgg tagcagaact 540
cgaaggcatt cagtacggtc gtagtggtga gacacttggt gtccttgtcc ctcatgtggg 600
cgaaatacca gtggcttacc gcaaggttct tcttcgtaag aacggtaata aaggagctgg 660
tggccatagt tacggcgccg atctaaagtc atttgactta ggcgacgagc ttggcactga 720
tccttatgaa gattttcaag aaaactggaa cactaaacat agcagtggtg ttacccgtga 780
actcatgcgt gagcttaacg gaggggcata cactcgctat gtcgataaca acttctgtgg 840
ccctgatggc taccctcttg agtgcattaa agaccttcta gcacgtgctg gtaaagcttc 900
atgcactttg tccgaacaac tggactttat tgacactaag aggggtgtat actgctgccg 960
tgaacatgag catgaaattg cttggtacac ggaacgttct gaaaagagct atgaattgca 1020
gacacctttt gaaattaaat tggcaaagaa atttgacacc ttcaatgggg aatgtccaaa 1080
ttttgtattt cccttaaatt ccataatcaa gactattcaa ccaagggttg aaaagaaaaa 1140
gcttgatggc tttatgggta gaattcgatc tgtctatcca gttgcgtcac caaatgaatg 1200
caaccaaatg tgcctttcaa ctctcatgaa gtgtgatcat tgtggtgaaa cttcatggca 1260
gacgggcgat tttgttaaag ccacttgcga attttgtggc actgagaatt tgactaaaga 1320
aggtgccact acttgtggtt acttacccca aaatgctgtt gttaaaattt attgtccagc 1380
atgtcacaat tcagaagtag gacctgagca tagtcttgcc gaataccata atgaatctgg 1440
cttgaaaacc attcttcgta agggtggtcg cactattgcc tttggaggct gtgtgttctc 1500
ttatgttggt tgccataaca agtgtgccta ttgggttcca cgtgctagcg ctaacatagg 1560
ttgtaaccat acaggtgttg ttggagaagg ttccgaaggt cttaatgaca accttcttga 1620
aatactccaa aaagagaaag tcaacatcaa tattgttggt gactttaaac ttaatgaaga 1680
gatcgccatt attttggcat ctttttctgc ttccacaagt gcttttgtgg aaactgtgaa 1740
aggtttggat tataaagcat tcaaacaaat tgttgaatcc tgtggtaatt ttaaagttac 1800
aaaaggaaaa gctaaaaaag gtgcctggaa tattggtgaa cagaaatcaa tactgagtcc 1860
tctttatgca tttgcatcag aggctgctcg tgttgtacga tcaattttct cccgcactct 1920
tgaaactgct caaaattctg tgcgtgtttt acagaaggcc gctataacaa tactagatgg 1980
aatttcacag tattcactga gactcattga tgctatgatg ttcacatctg atttggctac 2040
taacaatcta gttgtaatgg cctacattac aggtggtgtt gttcagttga cttcgcagtg 2100
gctaactaac atctttggca ctgtttatga aaaactcaaa cccgtccttg attggcttga 2160
agagaagttt aaggaaggtg tagagtttct tagagacggt tgggaaattg ttaaatttat 2220
ctcaacctgt gcttgtgaaa ttgtcggtgg acaaattgtc acctgtgcta aggaaattaa 2280
ggagagtgtt cagacattct ttaagcttgt aaataaattt ttggctttgt gtgctgactc 2340
tatcattatt ggtggagcta aacttaaagc cttgaattta ggtgaaacat ttgtcacgca 2400
ctcaaaggga ttgtacagaa agtgtgttaa atccagagaa gaaactggcc tactcatgcc 2460
tctaaaagcc ccaaaagaaa ttatcttctt agagggagaa acacttccca cagaagtgtt 2520
aacagaggaa gttgtcttga aaactggtga tttacaacca ttagaacaac ctactagtga 2580
agctgttgaa gctccattgg ttggtacacc agtttgtatt aacgggctta tgttgctcga 2640
aatcaaagac acagaaaagt actgtgccct tgcacctaat atgatggtaa caaacaatac 2700
cttcacactc aaaggcggtg caccaacaaa ggttactttt ggtgatgaca ctgtgataga 2760
agtgcaaggt tacaagagtg tgaatatcac ttttgaactt gatgaaagga ttgataaagt 2820
acttaatgag aagtgctctg cctatacagt tgaactcggt acagaagtaa atgagttcgc 2880
ctgtgttgtg gcagatgctg tcataaaaac tttgcaacca gtatctgaat tacttacacc 2940
actgggcatt gatttagatg agtggagtat ggctacatac tacttatttg atgagtctgg 3000
tgagtttaaa ttggcttcac atatgtattg ttctttctac cctccagatg aggatgaaga 3060
agaaggtgat tgtgaagaag aagagtttga gccatcaact caatatgagt atggtactga 3120
agatgattac caaggtaaac ctttggaatt tggtgccact tctgctgctc ttcaacctga 3180
agaagagcaa gaagaagatt ggttagatga tgatagtcaa caaactgttg gtcaacaaga 3240
cggcagtgag gacaatcaga caactactat tcaaacaatt gttgaggttc aacctcaatt 3300
agagatggaa cttacaccag ttgttcagac tattgaagtg aatagtttta gtggttattt 3360
aaaacttact gacaatgtat acattaaaaa tgcagacatt gtggaagaag ctaaaaaggt 3420
aaaaccaaca gtggttgtta atgcagccaa tgtttacctt aaacatggag gaggtgttgc 3480
aggagcctta aataaggcta ctaacaatgc catgcaagtt gaatctgatg attacatagc 3540
tactaatgga ccacttaaag tgggtggtag ttgtgtttta agcggacaca atcttgctaa 3600
acactgtctt catgttgtcg gcccaaatgt taacaaaggt gaagacattc aacttcttaa 3660
gagtgcttat gaaaatttta atcagcacga agttctactt gcaccattat tatcagctgg 3720
tatttttggt gctgacccta tacattcttt aagagtttgt gtagatactg ttcgcacaaa 3780
tgtctactta gctgtctttg ataaaaatct ctatgacaaa cttgtttcaa gctttttgga 3840
aatgaagagt gaaaagcaag ttgaacaaaa gatcgctgag attcctaaag aggaagttaa 3900
gccatttata actgaaagta aaccttcagt tgaacagaga aaacaagatg ataagaaaat 3960
caaagcttgt gttgaagaag ttacaacaac tctggaagaa actaagttcc tcacagaaaa 4020
cttgttactt tatattgaca ttaatggcaa tcttcatcca gattctgcca ctcttgttag 4080
tgacattgac atcactttct taaagaaaga tgctccatat atagtgggtg atgttgttca 4140
agagggtgtt ttaactgctg tggttatacc tactaaaaag gctggtggca ctactgaaat 4200
gctagcgaaa gctttgagaa aagtgccaac agacaattat ataaccactt acccgggtca 4260
gggtttaaat ggttacactg tagaggaggc aaagacagtg cttaaaaagt gtaaaagtgc 4320
cttttacatt ctaccatcta ttatctctaa tgagaagcaa gaaattcttg gaactgtttc 4380
ttggaatttg cgagaaatgc ttgcacatgc agaagaaaca cgcaaattaa tgcctgtctg 4440
tgtggaaact aaagccatag tttcaactat acagcgtaaa tataagggta ttaaaataca 4500
agagggtgtg gttgattatg gtgctagatt ttacttttac accagtaaaa caactgtagc 4560
gtcacttatc aacacactta acgatctaaa tgaaactctt gttacaatgc cacttggcta 4620
tgtaacacat ggcttaaatt tggaagaagc tgctcggtat atgagatctc tcaaagtgcc 4680
agctacagtt tctgtttctt cacctgatgc tgttacagcg tataatggtt atcttacttc 4740
ttcttctaaa acacctgaag aacattttat tgaaaccatc tcacttgctg gttcctataa 4800
agattggtcc tattctggac aatctacaca actaggtata gaatttctta agagaggtga 4860
taaaagtgta tattacacta gtaatcctac cacattccac ctagatggtg aagttatcac 4920
ctttgacaat cttaagacac ttctttcttt gagagaagtg aggactatta aggtgtttac 4980
aacagtagac aacattaacc tccacacgca agttgtggac atgtcaatga catatggaca 5040
acagtttggt ccaacttatt tggatggagc tgatgttact aaaataaaac ctcataattc 5100
acatgaaggt aaaacatttt atgttttacc taatgatgac actctacgtg ttgaggcttt 5160
tgagtactac cacacaactg atcctagttt tctgggtagg tacatgtcag cattaaatca 5220
cactaaaaag tggaaatacc cacaagttaa tggtttaact tctattaaat gggcagataa 5280
caactgttat cttgccactg cattgttaac actccaacaa atagagttga agtttaatcc 5340
acctgctcta caagatgctt attacagagc aagggctggt gaagctgcta acttttgtgc 5400
acttatctta gcctactgta ataagacagt aggtgagtta ggtgatgtta gagaaacaat 5460
gagttacttg tttcaacatg ccaatttaga ttcttgcaaa agagtcttga acgtggtgtg 5520
taaaacttgt ggacaacagc agacaaccct taagggtgta gaagctgtta tgtacatggg 5580
cacactttct tatgaacaat ttaagaaagg tgttcagata ccttgtacgt gtggtaaaca 5640
agctacaaaa tatctagtac aacaggagtc accttttgtt atgatgtcag caccacctgc 5700
tcagtatgaa cttaagcatg gtacatttac ttgtgctagt gagtacactg gtaattacca 5760
gtgtggtcac tataaacata taacttctaa agaaactttg tattgcatag acggtgcttt 5820
acttacaaag tcctcagaat acaaaggtcc tattacggat gttttctaca aagaaaacag 5880
ttacacaaca accataaaac cagttactta taaattggat ggtgttgttt gtacagaaat 5940
tgaccctaag ttggacaatt attataagaa agacaattct tatttcacag agcaaccaat 6000
tgatcttgta ccaaaccaac catatccaaa cgcaagcttc gataatttta agtttgtatg 6060
tgataatatc aaatttgctg atgatttaaa ccagttaact ggttataaga aacctgcttc 6120
aagagagctt aaagttacat ttttccctga cttaaatggt gatgtggtgg ctattgatta 6180
taaacactac acaccctctt ttaagaaagg agctaaattg ttacataaac ctattgtttg 6240
gcatgttaac aatgcaacta ataaagccac gtataaacca aatacctggt gtatacgttg 6300
tctttggagc acaaaaccag ttgaaacatc aaattcgttt gatgtactga agtcagagga 6360
cgcgcaggga atggataatc ttgcctgcga agatctaaaa ccagtctctg aagaagtagt 6420
ggaaaatcct accatacaga aagacgttct tgagtgtaat gtgaaaacta ccgaagttgt 6480
aggagacatt atacttaaac cagcaaataa tagtttaaaa attacagaag aggttggcca 6540
cacagatcta atggctgctt atgtagacaa ttctagtctt actattaaga aacctaatga 6600
attatctaga gtattaggtt tgaaaaccct tgctactcat ggtttagctg ctgttaatag 6660
tgtcccttgg gatactatag ctaattatgc taagcctttt cttaacaaag ttgttagtac 6720
aactactaac atagttacac ggtgtttaaa ccgtgtttgt actaattata tgccttattt 6780
ctttacttta ttgctacaat tgtgtacttt tactagaagt acaaattcta gaattaaagc 6840
atctatgccg actactatag caaagaatac tgttaagagt gtcggtaaat tttgtctaga 6900
ggcttcattt aattatttga agtcacctaa tttttctaaa ctgataaata ttataatttg 6960
gtttttacta ttaagtgttt gcctaggttc tttaatctac tcaaccgctg ctttaggtgt 7020
tttaatgtct aatttaggca tgccttctta ctgtactggt tacagagaag gctatttgaa 7080
ctctactaat gtcactattg caacctactg tactggttct ataccttgta gtgtttgtct 7140
tagtggttta gattctttag acacctatcc ttctttagaa actatacaaa ttaccatttc 7200
atcttttaaa tgggatttaa ctgcttttgg cttagttgca gagtggtttt tggcatatat 7260
tcttttcact aggtttttct atgtacttgg attggctgca atcatgcaat tgtttttcag 7320
ctattttgca gtacatttta ttagtaattc ttggcttatg tggttaataa ttaatcttgt 7380
acaaatggcc ccgatttcag ctatggttag aatgtacatc ttctttgcat cattttatta 7440
tgtatggaaa agttatgtgc atgttgtaga cggttgtaat tcatcaactt gtatgatgtg 7500
ttacaaacgt aatagagcaa caagagtcga atgtacaact attgttaatg gtgttagaag 7560
gtccttttat gtctatgcta atggaggtaa aggcttttgc aaactacaca attggaattg 7620
tgttaattgt gatacattct gtgctggtag tacatttatt agtgatgaag ttgcgagaga 7680
cttgtcacta cagtttaaaa gaccaataaa tcctactgac cagtcttctt acatcgttga 7740
tagtgttaca gtgaagaatg gttccatcca tctttacttt gataaagctg gtcaaaagac 7800
ttatgaaaga cattctctct ctcattttgt taacttagac aacctgagag ctaataacac 7860
taaaggttca ttgcctatta atgttatagt ttttgatggt aaatcaaaat gtgaagaatc 7920
atctgcaaaa tcagcgtctg tttactacag tcagcttatg tgtcaaccta tactgttact 7980
agatcaggca ttagtgtctg atgttggtga tagtgcggaa gttgcagtta aaatgtttga 8040
tgcttacgtt aatacgtttt catcaacttt taacgtacca atggaaaaac tcaaaacact 8100
agttgcaact gcagaagctg aacttgcaaa gaatgtgtcc ttagacaatg tcttatctac 8160
ttttatttca gcagctcggc aagggtttgt tgattcagat gtagaaacta aagatgttgt 8220
tgaatgtctt aaattgtcac atcaatctga catagaagtt actggcgata gttgtaataa 8280
ctatatgctc acctataaca aagttgaaaa catgacaccc cgtgaccttg gtgcttgtat 8340
tgactgtagt gcgcgtcata ttaatgcgca ggtagcaaaa agtcacaaca ttgctttgat 8400
atggaacgtt aaagatttca tgtcattgtc tgaacaacta cgaaaacaaa tacgtagtgc 8460
tgctaaaaag aataacttac cttttaagtt gacatgtgca actactagac aagttgttaa 8520
tgttgtaaca acaaagatag cacttaaggg tggtaaaatt gttaataatt ggttgaagca 8580
gttaattaaa gttacacttg tgttcctttt tgttgctgct attttctatt taataacacc 8640
tgttcatgtc atgtctaaac atactgactt ttcaagtgaa atcataggat acaaggctat 8700
tgatggtggt gtcactcgtg acatagcatc tacagatact tgttttgcta acaaacatgc 8760
tgattttgac acatggttta gccagcgtgg tggtagttat actaatgaca aagcttgccc 8820
attgattgct gcagtcataa caagagaagt gggttttgtc gtgcctggtt tgcctggcac 8880
gatattacgc acaactaatg gtgacttttt gcatttctta cctagagttt ttagtgcagt 8940
tggtaacatc tgttacacac catcaaaact tatagagtac actgactttg caacatcagc 9000
ttgtgttttg gctgctgaat gtacaatttt taaagatgct tctggtaagc cagtaccata 9060
ttgttatgat accaatgtac tagaaggttc tgttgcttat gaaagtttac gccctgacac 9120
acgttatgtg ctcatggatg gctctattat tcaatttcct aacacctacc ttgaaggttc 9180
tgttagagtg gtaacaactt ttgattctga gtactgtagg cacggcactt gtgaaagatc 9240
agaagctggt gtttgtgtat ctactagtgg tagatgggta cttaacaatg attattacag 9300
atctttacca ggagttttct gtggtgtaga tgctgtaaat ttacttacta atatgtttac 9360
accactaatt caacctattg gtgctttgga catatcagca tctatagtag ctggtggtat 9420
tgtagctatc gtagtaacat gccttgccta ctattttatg aggtttagaa gagcttttgg 9480
tgaatacagt catgtagttg cctttaatac tttactattc cttatgtcat tcactgtact 9540
ctgtttaaca ccagtttact cattcttacc tggtgtttat tctgttattt acttgtactt 9600
gacattttat cttactaatg atgtttcttt tttagcacat attcagtgga tggttatgtt 9660
cacaccttta gtacctttct ggataacaat tgcttatatc atttgtattt ccacaaagca 9720
tttctattgg ttctttagta attacctaaa gagacgtgta gtctttaatg gtgtttcctt 9780
tagtactttt gaagaagctg cgctgtgcac ctttttgtta aataaagaaa tgtatctaaa 9840
gttgcgtagt gatgtgctat tacctcttac gcaatataat agatacttag ctctttataa 9900
taagtacaag tattttagtg gagcaatgga tacaactagc tacagagaag ctgcttgttg 9960
tcatctcgca aaggctctca atgacttcag taactcaggt tctgatgttc tttaccaacc 10020
accacaaacc tctatcacct cagctgtttt gcagagtggt tttagaaaaa tggcattccc 10080
atctggtaaa gttgagggtt gtatggtaca agtaacttgt ggtacaacta cacttaacgg 10140
tctttggctt gatgacgtag tttactgtcc aagacatgtg atctgcacct ctgaagacat 10200
gcttaaccct aattatgaag atttactcat tcgtaagtct aatcataatt tcttggtaca 10260
ggctggtaat gttcaactca gggttattgg acattctatg caaaattgtg tacttaagct 10320
taaggttgat acagccaatc ctaagacacc taagtataag tttgttcgca ttcaaccagg 10380
acagactttt tcagtgttag cttgttacaa tggttcacca tctggtgttt accaatgtgc 10440
tatgaggccc aatttcacta ttaagggttc attccttaat ggttcatgtg gtagtgttgg 10500
ttttaacata gattatgact gtgtctcttt ttgttacatg caccatatgg aattaccaac 10560
tggagttcat gctggcacag acttagaagg taacttttat ggaccttttg ttgacaggca 10620
aacagcacaa gcagctggta cggacacaac tattacagtt aatgttttag cttggttgta 10680
cgctgctgtt ataaatggag acaggtggtt tctcaatcga tttaccacaa ctcttaatga 10740
ctttaacctt gtggctatga agtacaatta tgaacctcta acacaagacc atgttgacat 10800
actaggacct ctttctgctc aaactggaat tgccgtttta gatatgtgtg cttcattaaa 10860
agaattactg caaaatggta tgaatggacg taccatattg ggtagtgctt tattagaaga 10920
tgaatttaca ccttttgatg ttgttagaca atgctcaggt gttactttcc aaagtgcagt 10980
gaaaagaaca atcaagggta cacaccactg gttgttactc acaattttga cttcactttt 11040
agttttagtc cagagtactc aatggtcttt gttctttttt ttntatgaaa atgccttttt 11100
accttttgct atgggtatta ttgctatgtc tgcttttgca atgatgtttg tcaaacataa 11160
gcatgcattt ctctgtttgt ttttgttacc ttctcttgcc actgtagctt attttaatat 11220
ggtctatatg cctgctagtt gggtgatgcg tattatgaca tggttggata tggttgatac 11280
tagtttgtct ggttttaagc taaaagactg tgttatgtat gcatcagctg tagtgttact 11340
aatccttatg acagcaagaa ctgtgtatga tgatggtgct aggagagtgt ggacacttat 11400
gaatgtcttg acactcgttt ataaagttta ttatggtaat gctttagatc aagccatttc 11460
catgtgggct cttataatct ctgttacttc taactactca ggtgtagtta caactgtcat 11520
gtttttggcc agaggtattg tttttatgtg tgttgagtat tgccctattt tcttcataac 11580
tggtaataca cttcagtgta taatgctagt ttattgtttc ttaggctatt tttgtacttg 11640
ttactttggc ctcttttgtt tactcaaccg ctactttaga ctgactcttg gtgtttatga 11700
ttacttagtt tctacacagg agtttagata tatgaattca cagggactac tcccacccaa 11760
gaatagcata gatgccttca aactcaacat taaattgttg ggtgttggtg gcaaaccttg 11820
tatcaaagta gccactgtac agtctaaaat gtcagatgta aagtgcacat cagtagtctt 11880
actctcagtt ttgcaacaac tcagagtaga atcatcatct aaattgtggg ctcaatgtgt 11940
ccagttacac aatgacattc tcttagctaa agatactact gaagcctttg aaaaaatggt 12000
ttcactactt tctgttttgc tttccatgca gggtgctgta gacataaaca agctttgtga 12060
agaaatgctg gacaacaggg caaccttaca agctatagcc tcagagttta gttcccttcc 12120
atcatatgca gcttttgcta ctgctcaaga agcttatgag caggctgttg ctaatggtga 12180
ttctgaagtt gttcttaaaa agttgaagaa gtctttgaat gtggctaaat ctgaatttga 12240
ccgtgatgca gccatgcaac gtaagttgga aaagatggct gatcaagcta tgacccaaat 12300
gtataaacag gctagatctg aggacaagag ggcaaaagtt actagtgcta tgcagacaat 12360
gcttttcact atgcttagaa agttggataa tgatgcactc aacaacatta tcaacaatgc 12420
aagagatggt tgtgttccct tgaacataat acctcttaca acagcagcca aactaatggt 12480
tgtcatacca gactataaca catataaaaa tacgtgtgat ggtacaacat ttacttatgc 12540
atcagcattg tgggaaatcc aacaggttgt agatgcagat agtaaaattg ttcaacttag 12600
tgaaattagt atggacaatt cacctaattt agcatggcct cttattgtaa cagctttaag 12660
ggccaattct gctgtcaaat tacagaataa tgagcttagt cctgttgcac tacgacagat 12720
gtcttgtgct gccggtacta cacaaactgc ttgcactgat gacaatgcgt tagcttacta 12780
caacacaaca aagggaggta ggtttgtact tgcactgtta tccgatttac aggatttgaa 12840
atgggctaga ttccctaaga gtgatggaac tggtactatc tatacagaac tggaaccacc 12900
ttgtaggttt gttacagaca cacctaaagg tcctaaagtg aagtatttat actttattaa 12960
aggattaaac aacctaaata gaggtatggt acttggtagt ttagctgcca cagtacgtct 13020
acaagctggt aatgcaacag aagtgcctgc caattcaact gtattatctt tctgtgcttt 13080
tgctgtagat gctgctaaag cttacaaaga ttatctagct agtgggggac aaccaatcac 13140
taattgtgtt aagatgttgt gtacacacac tggtactggt caggcaataa cagttacacc 13200
ggaagccaat atggatcaag aatcctttgg tggtgcatcg tgttgtctgt actgccgttg 13260
ccacatagat catccaaatc ctaaaggatt ttgtgactta aaaggtaagt atgtacaaat 13320
acctacaact tgtgctaatg accctgtggg ttttacactt aaaaacacag tctgtaccgt 13380
ctgcggtatg tggaaaggtt atggctgtag ttgtgatcaa ctccgcgaac ccatgcttca 13440
gtcagctgat gcacaatcgt ttttaaacgg gtttgcggtg taagtgcagc ccgtcttaca 13500
ccgtgcggca caggcactag tactgatgtc gtatacaggg cttttgacat ctacaatgat 13560
aaagtagctg gttttgctaa attcctaaaa actaattgtt gtcgcttcca agaaaaggac 13620
gaagatgaca atttaattga ttcttacttt gtagttaaga gacacacttt ctctaactac 13680
caacatgaag aaacaattta taatttactt aaggattgtc cagctgttgc taaacatgac 13740
ttctttaagt ttagaataga cggtgacatg gtaccacata tatcacgtca acgtcttact 13800
aaatacacaa tggcagacct cgtctatgct ttaaggcatt ttgatgaagg taattgtgac 13860
acattaaaag aaatacttgt cacatacaat tgttgtgatg atgattattt caataaaaag 13920
gactggtatg attttgtaga aaacccagat atattacgcg tatacgccaa cttaggtgaa 13980
cgtgtacgcc aagctttgtt aaaaacagta caattctgtg atgccatgcg aaatgctggt 14040
attgttggtg tactgacatt agataatcaa gatctcaatg gtaactggta tgatttcggt 14100
gatttcatac aaaccacgcc aggtagtgga gttcctgttg tagattctta ttattcattg 14160
ttaatgccta tattaacctt gaccagggct ttaactgcag agtcacatgt tgacactgac 14220
ttaacaaagc cttacattaa gtgggatttg ttaaaatatg acttcacgga agagaggtta 14280
aaactctttg accgttattt taaatattgg gatcagacat accacccaaa ttgtgttaac 14340
tgtttggatg acagatgcat tctgcattgt gcaaacttta atgttttatt ctctacagtg 14400
ttcccaccta caagttttgg accactagtg agaaaaatat ttgttgatgg tgttccattt 14460
gtagtttcaa ctggatacca cttcagagag ctaggtgttg tacataatca ggatgtaaac 14520
ttacatagct ctagacttag ttttaaggaa ttacttgtgt atgctgctga ccctgctatg 14580
cacgctgctt ctggtaatct attactagat aaacgcacta cgtgcttttc agtagctgca 14640
cttactaaca atgttgcttt tcaaactgtc aaacccggta attttaacaa agacttctat 14700
gactttgctg tgtctaaggg tttctttaag gaaggaagtt ctgttgaatt aaaacacttc 14760
ttctttgctc aggatggtaa tgctgctatc agcgattatg actactatcg ttataatcta 14820
ccaacaatgt gtgatatcag acaactacta tttgtagttg aagttgttga taagtacttt 14880
gattgttacg atggtggctg tattaatgct aaccaagtca tcgtcaacaa cctagacaaa 14940
tcagctggtt ttccatttaa taaatggggt aaggctagac tttattatga ttcaatgagt 15000
tatgaggatc aagatgcact tttcgcatat acaaaacgta atgtcatccc tactataact 15060
caaatgaatc ttaagtatgc cattagtgca aagaatagag ctcgcaccgt agctggtgtc 15120
tctatctgta gtactatgac caatagacag tttcatcaaa aattattgaa atcaatagcc 15180
gccactagag gagctactgt agtaattgga acaagcaaat tctatggtgg ttggcacaac 15240
atgttaaaaa ctgtttatag tgatgtagaa aaccctcacc ttatgggttg ggattatcct 15300
aaatgtgata gagccatgcc taacatgctt agaattatgg cctcacttgt tcttgctcgc 15360
aaacatacaa cgtgttgtag cttgtcacac cgtttctata gattagctaa tgagtgtgct 15420
caagtattga gtgaaatggt catgtgtggc ggttcactat atgttaaacc aggtggaacc 15480
tcatcaggag atgccacaac tgcttatgct aatagtgttt ttaacatttg tcaagctgtc 15540
acggccaatg ttaatgcact tttatctact gatggtaaca aaattgccga taagtatgtc 15600
cgcaatttac aacacagact ttatgagtgt ctctatagaa atagagatgt tgacacagac 15660
tttgtgaatg agttttacgc atatttgcgt aaacatttct caatgatgat actctctgac 15720
gatgctgttg tgtgtttcaa tagcacttat gcatctcaag gtctagtggc tagcataaag 15780
aactttaagt cagttcttta ttatcaaaac aatgttttta tgtctgaagc aaaatgttgg 15840
actgagactg accttactaa aggacctcat gaattttgct ctcaacatac aatgctagtt 15900
aaacagggtg atgattatgt gtaccttcct tacccagatc catcaagaat cctaggggcc 15960
ggctgttttg tagatgatat cgtaaaaaca gatggtacac ttatgattga acggttcgtg 16020
tctttagcta tagatgctta cccacttact aaacatccta atcaggagta tgctgatgtc 16080
tttcatttgt acttacaata cataagaaag ctacatgatg agttaacagg acacatgtta 16140
gacatgtatt ctgttatgct tactaatgat aacacttcaa ggtattggga acctgagttt 16200
tatgaggcta tgtacacacc gcatacagtc ttacaggctg ttggggcttg tgttctttgc 16260
aattcacaga cttcattaag atgtggtgct tgcatacgta gaccattctt atgttgtaaa 16320
tgctgttacg accatgtcat atcaacatca cataaattag tcttgtctgt taatccgtat 16380
gtttgcaatg ctccaggttg tgatgtcaca gatgtgactc aactttactt aggaggtatg 16440
agctattatt gtaaatcaca taaaccaccc attagttttc cattgtgtgc taatggacaa 16500
gtttttggtt tatataaaaa tacatgtgtt ggtagcgata atgttactga ctttaatgca 16560
attgcaacat gtgactggac aaatgctggt gattacattt tagctaacac ctgtactgaa 16620
agactcaagc tttttgcagc agaaacgctc aaagctactg aggagacatt taaactgtct 16680
tatggtattg ctactgtacg tgaagtgctg tctgacagag aattacatct ttcatgggaa 16740
gttggtaaac ctagaccacc acttaaccga aattatgtct ttactggtta tcgtgtaact 16800
aaaaacagta aagtacaaat aggagagtac acctttgaaa aaggtgacta tggtgatgct 16860
gttgtttacc gaggtacaac aacttacaaa ttaaatgttg gtgattattt tgtgctgaca 16920
tcacatacag taatgccatt aagtgcacct acactagtgc cacaagagca ctatgttaga 16980
attactggct tatacccaac actcaatatc tcagatgagt tttctagcaa tgttgcaaat 17040
tatcaaaagg ttggtatgca aaagtattct acactccagg gaccacctgg tactggtaag 17100
agtcattttg ctattggcct agctctctac tacccttctg ctcgcatagt gtatacagct 17160
tgctctcatg ccgctgttga tgcactatgt gagaaggcat taaaatattt gcctatagat 17220
aaatgtagta gaattatacc tgcacgtgct cgtgtagagt gttttgataa attcaaagtg 17280
aattcaacat tagaacagta tgtcttttgt actgtaaatg cattgcctga gacgacagca 17340
gatatagttg tctttgatga aatttcaatg gccacaaatt atgatttgag tgttgtcaat 17400
gccagattac gtgctaagca ctatgtgtac attggcgacc ctgctcaatt acctgcacca 17460
cgcacattgc taactaaggg cacactagaa ccagaatatt tcaattcagt gtgtagactt 17520
atgaaaacta taggtccaga catgttcctc ggaacttgtc ggcgttgtcc tgctgaaatt 17580
gttgacactg tgagtgcttt ggtttatgat aataagctta aagcacataa agacaaatca 17640
gctcaatgct ttaaaatgtt ttataagggt gttatcacgc atgatgtttc atctgcaatt 17700
aacaggccac aaataggcgt ggtaagagaa ttccttacac gtaaccctgc ttggagaaaa 17760
gctgtcttta tttcacctta taattcacag aatgctgtag cctcaaagat tttgggacta 17820
ccaactcaaa ctgttgattc atcacagggc tcagaatatg actatgtcat attcactcaa 17880
accactgaaa cagctcactc ttgtaatgta aacagattta atgttgctat taccagagca 17940
aaagtaggca tactttgcat aatgtctgat agagaccttt atgacaagtt gcaatttaca 18000
agtcttgaaa ttccacgtag gaatgtggca actttacaag ctgaaaatgt aacaggactc 18060
tttaaagatt gtagtaaggt aatcactggg ttacatccta cacaggcacc tacacacctc 18120
agtgttgaca ctaaattcaa aactgaaggt ttatgtgttg acatacctgg catacctaag 18180
gacatgacct atagaagact catctctatg atgggtttta aaatgaatta tcaagttaat 18240
ggttacccta acatgtttat cacccgcgaa gaagctataa gacatgtacg tgcatggatt 18300
ggcttcgatg tcgaggggtg tcatgctact agagaagctg ttggtaccaa tttaccttta 18360
cagctaggtt tttctacagg tgttaaccta gttgctgtac ctacaggtta tgttgataca 18420
cctaataata cagatttttc cagagttagt gctaaaccac cgcctggaga tcaatttaaa 18480
cacctcatac cacttatgta caaaggactt ccttggaatg tagtgcgtat aaagattgta 18540
caaatgttaa gtgacacact taaaaatctc tctgacagag tcgtatttgt cttatgggca 18600
catggctttg agttgacatc tatgaagtat tttgtgaaaa taggacctga gcgcacctgt 18660
tgtctatgtg atagacgtgc cacatgcttt tccactgctt cagacactta tgcctgttgg 18720
catcattcta ttggatttga ttacgtctat aatccgttta tgattgatgt tcaacaatgg 18780
ggttttacag gtaacctaca aagcaaccat gatctgtatt gtcaagtcca tggtaatgca 18840
catgtagcta gttgtgatgc aatcatgact aggtgtctag ctgtccacga gtgctttgtt 18900
aagcgtgttg actggactat tgaatatcct ataattggtg atgaactgaa gattaatgcg 18960
gcttgtagaa aggttcaaca catggttgtt aaagctgcat tattagcaga caaattccca 19020
gttcttcacg acattggtaa ccctaaagct attaagtgtg tacctcaagc tgatgtagaa 19080
tggaagttct atgatgcaca gccttgtagt gacaaagctt ataaaataga agaattattc 19140
tattcttatg ccacacattc tgacaaattc acagatggtg tatgcctatt ttggaattgc 19200
aatgtcgata gatatcctgc taattccatt gtttgtagat ttgacactag agtgctatct 19260
aaccttaact tgcctggttg tgatggtggc agtttgtatg taaataaaca tgcattccac 19320
acaccagctt ttgataaaag tgcttttgtt aatttaaaac aattaccatt tttctattac 19380
tctgacagtc catgtgagtc tcatggaaaa caagtagtgt cagatataga ttatgtacca 19440
ctaaagtctg ctacgtgtat aacacgttgc aatttaggtg gtgctgtctg tagacatcat 19500
gctaatgagt acagattgta tctcgatgct tataacatga tgatctcagc tggctttagc 19560
ttgtgggttt acaaacaatt tgatacttat aacctctgga acacttttac aagacttcag 19620
agtttagaaa atgtggcttt taatgttgta aataagggac actttgatgg acaacagggt 19680
gaagtaccag tttctatcat taataacact gtttacacaa aagttgatgg tgttgatgta 19740
gaattgtttg aaaataaaac aacattacct gttaatgtag catttgagct ttgggctaag 19800
cgcaacatta aaccagtacc agaggtgaaa atactcaata atttgggtgt ggacattgct 19860
gctaatactg tgatctggga ctacaaaaga gatgctccag cacatatatc tactattggt 19920
gtttgttcta tgactgacat agccaagaaa ccaactgaaa cgatttgtgc accactcact 19980
gtcttttttg atggtagagt tgatggtcaa gtagacttat ttagaaatgc ccgtaatggt 20040
gttcttatta cagaaggtag tgttaaaggt ttacaaccat ctgtaggtcc caaacaagct 20100
agtcttaatg gagtcacatt aattggagaa gccgtaaaaa cacagttcaa ttattataag 20160
aaagttgatg gtgttgtcca acaattacct gaaacttact ttactcagag tagaaattta 20220
caagaattta aacccaggag tcaaatggaa attgatttct tagaattagc tatggatgaa 20280
ttcattgaac ggtataaatt agaaggctat gccttcgaac atatcgttta tggagatttt 20340
agtcatagtc agttaggtgg tttacatcta ctgattggac tagctaaacg ttttaaggaa 20400
tcaccttttg aattagaaga ttttattcct atggacagta cagttaaaaa ctatttcata 20460
acagatgcgc aaacaggttc atctaagtgt gtgtgttctg ttattgattt attacttgat 20520
gattttgttg aaataataaa atcccaagat ttatctgtag tttctaaggt tgtcaaagtg 20580
actattgact atacagaaat ttcatttatg ctttggtgta aagatggcca tgtagaaaca 20640
ttttacccaa aattacaatc tagtcaagcg tggcaaccgg gtgttgctat gcctaatctt 20700
tacaaaatgc aaagaatgct attagaaaag tgtgaccttc aaaattatgg tgatagtgca 20760
acattaccta aaggcataat gatgaatgtc gcaaaatata ctcaactgtg tcaatattta 20820
aacacattaa cattagctgt accctataat atgagagtta tacattttgg tgctggttct 20880
gataaaggag ttgcaccagg tacagctgtt ttaagacagt ggttgcctac gggtacgctg 20940
cttgtcgatt cagatcttaa tgactttgtc tctgatgcag attcaacttt gattggtgat 21000
tgtgcaactg tacatacagc taataaatgg gatctcatta ttagtgatat gtacgaccct 21060
aagactaaaa atgttacaaa agaaaatgac tctaaagagg gttttttcac ttacatttgt 21120
gggtttatac aacaaaagct agctcttgga ggttccgtgg ctataaagat aacagaacat 21180
tcttggaatg ctgatcttta taagctcatg ggacacttcg catggtggac agcctttgtt 21240
actaatgtga atgcgtcatc atctgaagca tttttaattg gatgtaatta tcttggcaaa 21300
ccacgcgaac aaatagatgg ttatgtcatg catgcaaatt acatattttg gaggaataca 21360
aatccaattc agttgtcttc ctattcttta tttgacatga gtaaatttcc ccttaaatta 21420
aggggtactg ctgttatgtc tttaaaagaa ggtcaaatca atgatatgat tttatctctt 21480
cttagtaaag gtagacttat aattagagaa aacaacagag ttgttatttc tagtgatgtt 21540
cttgttaaca actaaacgaa caatgtttgt ttttcttgtt ttattgccac tagtctctag 21600
tcagtgtgtt aatcttacaa ccagaactca attaccccct gcatacacta attctttcac 21660
acgtggtgtt tattaccctg acaaagtttt cagatcctca gttttacatt caactcagga 21720
cttgttctta cctttctttt ccaatgttac ttggttccat gctatacatg tctctgggac 21780
caatggtact aagaggtttg ataaccctgt cctaccattt aatgatggtg tttattttgc 21840
ttccactgag aagtctaaca taataagagg ctggattttt ggtactactt tagattcgaa 21900
gacccagtcc ctacttattg ttaataacgc tactaatgtt gttattaaag tctgtgaatt 21960
tcaattttgt aatgatccat ttttgggtgt ttattaccac aaaaacaaca aaagttggat 22020
ggaaagtgag ttcagagttt attctagtgc gaataattgc acttttgaat atgtctctca 22080
gccttttctt atggaccttg aaggaaaaca gggtaatttc aaaaatctta gggaatttgt 22140
gtttaagaat attgatggtt attttaaaat atattctaag cacacgccta ttaatttagt 22200
gcgtgatctc cctcagggtt tttcggcttt agaaccattg gtagatttgc caataggtat 22260
taacatcact aggtttcaaa ctttacttgc tttacataga agttatttga ctcctggtga 22320
ttcttcttca ggttggacag ctggtgctgc agcttattat gtgggttatc ttcaacctag 22380
gacttttcta ttaaaatata atgaaaatgg aaccattaca gatgctgtag actgtgcact 22440
tgaccctctc tcagaaacaa agtgtacgtt gaaatccttc actgtagaaa aaggaatcta 22500
tcaaacttct aactttagag tccaaccaac agaatctatt gttagatttc ctaatattac 22560
aaacttgtgc ccttttggtg aagtttttaa cgccaccaga tttgcatctg tttatgcttg 22620
gaacaggaag agaatcagca actgtgttgc tgattattct gtcctatata attccgcatc 22680
attttccact tttaagtgtt atggagtgtc tcctactaaa ttaaatgatc tctgctttac 22740
taatgtctat gcagattcat ttgtaattag aggtgatgaa gtcagacaaa tcgctccagg 22800
gcaaactgga aagattgctg attataatta taaattacca gatgatttta caggctgcgt 22860
tatagcttgg aattctaaca atcttgattc taaggttggt ggtaattata attacctgta 22920
tagattgttt aggaagtcta atctcaaacc ttttgagaga gatatttcaa ctgaaatcta 22980
tcaggccggt agcacacctt gtaatggtgt tgaaggtttt aattgttact ttcctttaca 23040
atcatatggt ttccaaccca ctaatggtgt tggttaccaa ccatacagag tagtagtact 23100
ttcttttgaa cttctacatg caccagcaac tgtttgtgga cctaaaaagt ctactaattt 23160
ggttaaaaac aaatgtgtca atttcaactt caatggttta acaggcacag gtgttcttac 23220
tgagtctaac aaaaagtttc tgcctttcca acaatttggc agagacattg ctgacactac 23280
tgatgctgtc cgtgatccac agacacttga gattcttgac attacaccat gttcttttgg 23340
tggtgtcagt gttataacac caggaacaaa tacttctaac caggttgctg ttctttatca 23400
ggatgttaac tgcacagaag tccctgttgc tattcatgca gatcaactta ctcctacttg 23460
gcgtgtttat tctacaggtt ctaatgtttt tcaaacacgt gcaggctgtt taataggggc 23520
tgaacatgtc aacaactcat atgagtgtga catacccatt ggtgcaggta tatgcgctag 23580
ttatcagact cagactaatt ctcctcggcg ggcacgtagt gtagctagtc aatccatcat 23640
tgcctacact atgtcacttg gtgcagaaaa ttcagttgct tactctaata actctattgc 23700
catacccaca aattttacta ttagtgttac cacagaaatt ctaccagtgt ctatgaccaa 23760
gacatcagta gattgtacaa tgtacatttg tggtgattca actgaatgca gcaatctttt 23820
gttgcaatat ggcagttttt gtacacaatt aaaccgtgct ttaactggaa tagctgttga 23880
acaagacaaa aacacccaag aagtttttgc acaagtcaaa caaatttaca aaacaccacc 23940
aattaaagat tttggtggtt ttaatttttc acaaatatta ccagatccat caaaaccaag 24000
caagaggtca tttattgaag atctactttt caacaaagtg acacttgcag atgctggctt 24060
catcaaacaa tatggtgatt gccttggtga tattgctgct agagacctca tttgtgcaca 24120
aaagtttaac ggccttactg ttttgccacc tttgctcaca gatgaaatga ttgctcaata 24180
cacttctgca ctgttagcgg gtacaatcac ttctggttgg acctttggtg caggtgctgc 24240
attacaaata ccatttgcta tgcaaatggc ttataggttt aatggtattg gagttacaca 24300
gaatgttctc tatgagaacc aaaaattgat tgccaaccaa tttaatagtg ctattggcaa 24360
aattcaagac tcactttctt ccacagcaag tgcacttgga aaacttcaag atgtggtcaa 24420
ccaaaatgca caagctttaa acacgcttgt taaacaactt agctccaatt ttggtgcaat 24480
ttcaagtgtt ttaaatgata tcctttcacg tcttgacaaa gttgaggctg aagtgcaaat 24540
tgataggttg atcacaggca gacttcaaag tttgcagaca tatgtgactc aacaattaat 24600
tagagctgca gaaatcagag cttctgctaa tcttgctgct actaaaatgt cagagtgtgt 24660
acttggacaa tcaaaaagag ttgatttttg tggaaagggc tatcatctta tgtccttccc 24720
tcagtcagca cctcatggtg tagtcttctt gcatgtgact tatgtccctg cacaagaaaa 24780
gaacttcaca actgctcctg ccatttgtca tgatggaaaa gcacactttc ctcgtgaagg 24840
tgtctttgtt tcaaatggca cacactggtt tgtaacacaa aggaattttt atgaaccaca 24900
aatcattact acagacaaca catttgtgtc tggtaactgt gatgttgtaa taggaattgt 24960
caacaacaca gtttatgatc ctttgcaacc tgaattagac tcattcaagg aggagttaga 25020
taaatatttt aagaatcata catcaccaga tgttgattta ggtgacatct ctggcattaa 25080
tgcttcagtt gtaaacattc aaaaagaaat tgaccgcctc aatgaggttg ccaagaattt 25140
aaatgaatct ctcatcgatc tccaagaact tggaaagtat gagcagtata taaaatggcc 25200
atggtacatt tggctaggtt ttatagctgg cttgattgcc atagtaatgg tgacaattat 25260
gctttgctgt atgaccagtt gctgtagttg tctcaagggc tgttgttctt gtggatcctg 25320
ctgcaaattt gatgaagacg actctgagcc agtgctcaaa ggagtcaaat tacattacac 25380
ataaacgaac ttatggattt gtttatgaga atcttcacaa ttggaactgt aactttgaag 25440
caaggtgaaa tcaaggatgc tactccttca gattttgttc gcgctactgc aacgataccg 25500
atacaagcct cactcccttt cggatggctt attgttggcg ttgcacttct tgctgttttt 25560
cagagcgctt ccaaaatcat aaccctcaaa aagagatggc aactagcact ctccaagggt 25620
gttcactttg tttgcaactt gctgttgttg tttgtaacag tttactcaca ccttttgctc 25680
gttgctgctg gccttgaagc cccttttctc tatctttatg ctttagtcta cttcttgcag 25740
agtataaact ttgtaagaat aataatgagg ctttggcttt gctggaaatg ccgttccaaa 25800
aacccattac tttatgatgc caactatttt ctttgctggc atactaattg ttacgactat 25860
tgtatacctt acaatagtgt aacttcttca attgtcatta cttcaggtga tggcacaaca 25920
agtcctattt ctgaacatga ctaccagatt ggtggttata ctgaaaaatg ggaatctgga 25980
gtaaaagact gtgttgtatt acacagttac ttcacttcag actattacca gctgtactca 26040
actcaattga gtacagacac tggtgttgaa catgttacct tcttcatcta caataaaatt 26100
gttgatgagc ctgaagaaca tgtccaaatt cacacaatcg acgtttcatc cggagttgtt 26160
aatccagtaa tggaaccaat ttatgatgaa ccgacgacga ctactagcgt gcctttgtaa 26220
gcacaagctg atgagtacga acttatgtac tcattcgttt cggaagagac aggtacgtta 26280
atagttaata gcgtacttct ttttcttgct ttcgtggtat tcttgctagt tacactagcc 26340
atccttactg cgcttcgatt gtgtgcgtac tgctgcaata ttgttaacgt gagtcttgta 26400
aaaccttctt tttacgttta ctctcgtgtt aaaaatctga attcttctag agttcctgat 26460
cttctggtct aaacgaacta aatattatat tagtttttct gtttggaact ttaattttag 26520
ccatggcaga ttccaacggt actattaccg ttgaagagct taaaaagctc cttgaacaat 26580
ggaacctagt aataggtttc ctattcctta catggatttg tcttctacaa tttgcctatg 26640
ccaacaggaa taggtttttg tatataatta agttaatttt cctctggctg ttatggccag 26700
taactttagc ttgttttgtg cttgctgctg tttacagaat aaattggatc accggtggaa 26760
ttgctatcgc aatggcttgt cttgtaggct tgatgtggct cagctacttc attgcttctt 26820
tcagactgtt tgcgcgtacg cgttccatgt ggtcattcaa tccagaaact aacattcttc 26880
tcaacgtgcc actccatggc actattctga ccagaccgct tctagaaagt gaactcgtaa 26940
tcggagctgt gatccttcgt ggacatcttc gtattgctgg acaccatcta ggacgctgtg 27000
acatcaagga cctgcctaaa gaaatcactg ttgctacatc acgaacgctt tcttattaca 27060
aattgggagc ttcgcagcgt gtagcaggtg actcaggttt tgctgcatac agtcgctaca 27120
ggattggcaa ctataaatta aacacagacc attccagtag cagtgacaat attgctttgc 27180
ttgtacagta agtgacaaca gatgtttcat ctcgttgact ttcaggttac tatagcagag 27240
atattactaa ttattatgag gacttttaaa gtttccattt ggaatcttga ttacatcata 27300
aacctcataa ttaaaaattt atctaagtca ctaactgaga ataaatattc tcaattagat 27360
gaagagcaac caatggagat tgattaaacg aacatgaaaa ttattctttt cttggcactg 27420
ataacactcg ctacttgtga gctttatcac taccaagagt gtgttagagg tacaacagta 27480
cttttaaaag aaccttgctc ttctggaaca tacgagggca attcaccatt tcatcctcta 27540
gctgataaca aatttgcact gacttgcttt agcactcaat ttgcttttgc ttgtcctgac 27600
ggcgtaaaac acgtctatca gttacgtgcc agatcagttt cacctaaact gttcatcaga 27660
caagaggaag ttcaagaact ttactctcca atttttctta ttgttgcggc aatagtgttt 27720
ataacacttt gcttcacact caaaagaaag acagaatgat tgaactttca ttaattgact 27780
tctatttgtg ctttttagcc tttctgctat tccttgtttt aattatgctt attatctttt 27840
ggttctcact tgaactgcaa gatcataatg aaacttgtca cgcctaaacg aacatgaaat 27900
ttcttgtttt cttaggaatc atcacaactg tagctgcatt tcaccaagaa tgtagtttac 27960
agtcatgtac tcaacatcaa ccatatgtag ttgatgaccc gtgtcctatt cacttctatt 28020
ctaaatggta tattagagta ggagctagaa aatcagcacc tttaattgaa ttgtgcgtgg 28080
atgaggctgg ttctaaatca cccattcagt acatcgatat cggtaattat acagtttcct 28140
gtttaccttt tacaattaat tgccaggaac ctaaattggg tagtcttgta gtgcgttgtt 28200
cgttctatga agacttttta gagtatcatg acgttcgtgt tgttttagat ttcatctaaa 28260
cgaacaaact aaaatgtctg ataatggacc ccaaaatcag cgaaatgcac cccgcattac 28320
gtttggtgga ccctcagatt caactggcag taaccagaat ggagaacgca gtggggcgcg 28380
atcaaaacaa cgtcggcccc aaggtttacc caataatact gcgtcttggt tcaccgctct 28440
cactcaacat ggcaaggaag accttaaatt ccctcgagga caaggcgttc caattaacac 28500
caatagcagt ccagatgacc aaattggcta ctaccgaaga gctaccagac gaattcgtgg 28560
tggtgacggt aaaatgaaag atctcagtcc aagatggtat ttctactacc taggaactgg 28620
gccagaagct ggacttccct atggtgctaa caaagacggc atcatatggg ttgcaactga 28680
gggagccttg aatacaccaa aagatcacat tggcacccgc aatcctgcta acaatgctgc 28740
aatcgtgcta caacttcctc aaggaacaac attgccaaaa ggcttctacg cagaagggag 28800
cagaggcggc agtcaagcct cttctcgttc ctcatcacgt agtcgcaaca gttcaagaaa 28860
ttcaactcca ggcagcagta ggggaacttc tcctgctaga atggctggca atggcggtga 28920
tgctgctctt gctttgctgc tgcttgacag attgaaccag cttgagagca aaatgtctgg 28980
taaaggccaa caacaacaag gccaaactgt cactaagaaa tctgctgctg aggcttctaa 29040
gaagcctcgg caaaaacgta ctgccactaa agcatacaat gtaacacaag ctttcggcag 29100
acgtggtcca gaacaaaccc aaggaaattt tggggaccag gaactaatca gacaaggaac 29160
tgattacaaa cattggccgc aaattgcaca atttgccccc agcgcttcag cgttcttcgg 29220
aatgtcgcgc attggcatgg aagtcacacc ttcgggaacg tggttgacct acacaggtgc 29280
catcaaattg gatgacaaag atccaaattt caaagatcaa gtcattttgc tgaataagca 29340
tattgacgca tacaaaacat tcccaccaac agagcctaaa aaggacaaaa agaagaaggc 29400
tgatgaaact caagccttac cgcagagaca gaagaaacag caaactgtga ctcttcttcc 29460
tgctgcagat ttggatgatt tctccaaaca attgcaacaa tccatgagca gtgctgactc 29520
aactcaggcc taaactcatg cagaccacac aaggcagatg ggctatataa acgttttcgc 29580
ttttccgttt acgatatata gtctactctt gtgcagaatg aattctcgta actacatagc 29640
acaagtagat gtagttaact ttaatctcac atagcaatct ttaatcagtg tgtaacatta 29700
gggaggactt gaaagagcca ccacattttc accgaggcca cgcggagtac gatcgagtgt 29760
acagtgaaca atgctaggga gagctgccta tatggaagag ccctaatgtg taaaattaat 29820
tttagtagtg ctatccccat gtgattttaa tagcttctta ggagaat 29867
<210> 10
<211> 7062
<212> PRT
<213> SARS-CoV2
<220>
<221> MISC_FEATURE
<222> (3606)
<223> Xaa can be any naturally occurring amino acid
<400> 10
Met Glu Ser Leu Val Pro Gly Phe Asn Glu Lys Thr His Val Gln Leu
1 5 10 15
Ser Leu Pro Val Leu Gln Val Arg Asp Val Leu Val Arg Gly Phe Gly
20 25 30
Asp Ser Val Glu Glu Val Leu Ser Glu Ala Arg Gln His Leu Lys Asp
35 40 45
Gly Thr Cys Gly Leu Val Glu Val Glu Lys Gly Val Leu Pro Gln Leu
50 55 60
Glu Gln Pro Tyr Val Phe Ile Lys Arg Ser Asp Ala Arg Thr Ala Pro
65 70 75 80
His Gly His Val Met Val Glu Leu Val Ala Glu Leu Glu Gly Ile Gln
85 90 95
Tyr Gly Arg Ser Gly Glu Thr Leu Gly Val Leu Val Pro His Val Gly
100 105 110
Glu Ile Pro Val Ala Tyr Arg Lys Val Leu Leu Arg Lys Asn Gly Asn
115 120 125
Lys Gly Ala Gly Gly His Ser Tyr Gly Ala Asp Leu Lys Ser Phe Asp
130 135 140
Leu Gly Asp Glu Leu Gly Thr Asp Pro Tyr Glu Asp Phe Gln Glu Asn
145 150 155 160
Trp Asn Thr Lys His Ser Ser Gly Val Thr Arg Glu Leu Met Arg Glu
165 170 175
Leu Asn Gly Gly Ala Tyr Thr Arg Tyr Val Asp Asn Asn Phe Cys Gly
180 185 190
Pro Asp Gly Tyr Pro Leu Glu Cys Ile Lys Asp Leu Leu Ala Arg Ala
195 200 205
Gly Lys Ala Ser Cys Thr Leu Ser Glu Gln Leu Asp Phe Ile Asp Thr
210 215 220
Lys Arg Gly Val Tyr Cys Cys Arg Glu His Glu His Glu Ile Ala Trp
225 230 235 240
Tyr Thr Glu Arg Ser Glu Lys Ser Tyr Glu Leu Gln Thr Pro Phe Glu
245 250 255
Ile Lys Leu Ala Lys Lys Phe Asp Thr Phe Asn Gly Glu Cys Pro Asn
260 265 270
Phe Val Phe Pro Leu Asn Ser Ile Ile Lys Thr Ile Gln Pro Arg Val
275 280 285
Glu Lys Lys Lys Leu Asp Gly Phe Met Gly Arg Ile Arg Ser Val Tyr
290 295 300
Pro Val Ala Ser Pro Asn Glu Cys Asn Gln Met Cys Leu Ser Thr Leu
305 310 315 320
Met Lys Cys Asp His Cys Gly Glu Thr Ser Trp Gln Thr Gly Asp Phe
325 330 335
Val Lys Ala Thr Cys Glu Phe Cys Gly Thr Glu Asn Leu Thr Lys Glu
340 345 350
Gly Ala Thr Thr Cys Gly Tyr Leu Pro Gln Asn Ala Val Val Lys Ile
355 360 365
Tyr Cys Pro Ala Cys His Asn Ser Glu Val Gly Pro Glu His Ser Leu
370 375 380
Ala Glu Tyr His Asn Glu Ser Gly Leu Lys Thr Ile Leu Arg Lys Gly
385 390 395 400
Gly Arg Thr Ile Ala Phe Gly Gly Cys Val Phe Ser Tyr Val Gly Cys
405 410 415
His Asn Lys Cys Ala Tyr Trp Val Pro Arg Ala Ser Ala Asn Ile Gly
420 425 430
Cys Asn His Thr Gly Val Val Gly Glu Gly Ser Glu Gly Leu Asn Asp
435 440 445
Asn Leu Leu Glu Ile Leu Gln Lys Glu Lys Val Asn Ile Asn Ile Val
450 455 460
Gly Asp Phe Lys Leu Asn Glu Glu Ile Ala Ile Ile Leu Ala Ser Phe
465 470 475 480
Ser Ala Ser Thr Ser Ala Phe Val Glu Thr Val Lys Gly Leu Asp Tyr
485 490 495
Lys Ala Phe Lys Gln Ile Val Glu Ser Cys Gly Asn Phe Lys Val Thr
500 505 510
Lys Gly Lys Ala Lys Lys Gly Ala Trp Asn Ile Gly Glu Gln Lys Ser
515 520 525
Ile Leu Ser Pro Leu Tyr Ala Phe Ala Ser Glu Ala Ala Arg Val Val
530 535 540
Arg Ser Ile Phe Ser Arg Thr Leu Glu Thr Ala Gln Asn Ser Val Arg
545 550 555 560
Val Leu Gln Lys Ala Ala Ile Thr Ile Leu Asp Gly Ile Ser Gln Tyr
565 570 575
Ser Leu Arg Leu Ile Asp Ala Met Met Phe Thr Ser Asp Leu Ala Thr
580 585 590
Asn Asn Leu Val Val Met Ala Tyr Ile Thr Gly Gly Val Val Gln Leu
595 600 605
Thr Ser Gln Trp Leu Thr Asn Ile Phe Gly Thr Val Tyr Glu Lys Leu
610 615 620
Lys Pro Val Leu Asp Trp Leu Glu Glu Lys Phe Lys Glu Gly Val Glu
625 630 635 640
Phe Leu Arg Asp Gly Trp Glu Ile Val Lys Phe Ile Ser Thr Cys Ala
645 650 655
Cys Glu Ile Val Gly Gly Gln Ile Val Thr Cys Ala Lys Glu Ile Lys
660 665 670
Glu Ser Val Gln Thr Phe Phe Lys Leu Val Asn Lys Phe Leu Ala Leu
675 680 685
Cys Ala Asp Ser Ile Ile Ile Gly Gly Ala Lys Leu Lys Ala Leu Asn
690 695 700
Leu Gly Glu Thr Phe Val Thr His Ser Lys Gly Leu Tyr Arg Lys Cys
705 710 715 720
Val Lys Ser Arg Glu Glu Thr Gly Leu Leu Met Pro Leu Lys Ala Pro
725 730 735
Lys Glu Ile Ile Phe Leu Glu Gly Glu Thr Leu Pro Thr Glu Val Leu
740 745 750
Thr Glu Glu Val Val Leu Lys Thr Gly Asp Leu Gln Pro Leu Glu Gln
755 760 765
Pro Thr Ser Glu Ala Val Glu Ala Pro Leu Val Gly Thr Pro Val Cys
770 775 780
Ile Asn Gly Leu Met Leu Leu Glu Ile Lys Asp Thr Glu Lys Tyr Cys
785 790 795 800
Ala Leu Ala Pro Asn Met Met Val Thr Asn Asn Thr Phe Thr Leu Lys
805 810 815
Gly Gly Ala Pro Thr Lys Val Thr Phe Gly Asp Asp Thr Val Ile Glu
820 825 830
Val Gln Gly Tyr Lys Ser Val Asn Ile Thr Phe Glu Leu Asp Glu Arg
835 840 845
Ile Asp Lys Val Leu Asn Glu Lys Cys Ser Ala Tyr Thr Val Glu Leu
850 855 860
Gly Thr Glu Val Asn Glu Phe Ala Cys Val Val Ala Asp Ala Val Ile
865 870 875 880
Lys Thr Leu Gln Pro Val Ser Glu Leu Leu Thr Pro Leu Gly Ile Asp
885 890 895
Leu Asp Glu Trp Ser Met Ala Thr Tyr Tyr Leu Phe Asp Glu Ser Gly
900 905 910
Glu Phe Lys Leu Ala Ser His Met Tyr Cys Ser Phe Tyr Pro Pro Asp
915 920 925
Glu Asp Glu Glu Glu Gly Asp Cys Glu Glu Glu Glu Phe Glu Pro Ser
930 935 940
Thr Gln Tyr Glu Tyr Gly Thr Glu Asp Asp Tyr Gln Gly Lys Pro Leu
945 950 955 960
Glu Phe Gly Ala Thr Ser Ala Ala Leu Gln Pro Glu Glu Glu Gln Glu
965 970 975
Glu Asp Trp Leu Asp Asp Asp Ser Gln Gln Thr Val Gly Gln Gln Asp
980 985 990
Gly Ser Glu Asp Asn Gln Thr Thr Thr Ile Gln Thr Ile Val Glu Val
995 1000 1005
Gln Pro Gln Leu Glu Met Glu Leu Thr Pro Val Val Gln Thr Ile Glu
1010 1015 1020
Val Asn Ser Phe Ser Gly Tyr Leu Lys Leu Thr Asp Asn Val Tyr Ile
1025 1030 1035 1040
Lys Asn Ala Asp Ile Val Glu Glu Ala Lys Lys Val Lys Pro Thr Val
1045 1050 1055
Val Val Asn Ala Ala Asn Val Tyr Leu Lys His Gly Gly Gly Val Ala
1060 1065 1070
Gly Ala Leu Asn Lys Ala Thr Asn Asn Ala Met Gln Val Glu Ser Asp
1075 1080 1085
Asp Tyr Ile Ala Thr Asn Gly Pro Leu Lys Val Gly Gly Ser Cys Val
1090 1095 1100
Leu Ser Gly His Asn Leu Ala Lys His Cys Leu His Val Val Gly Pro
1105 1110 1115 1120
Asn Val Asn Lys Gly Glu Asp Ile Gln Leu Leu Lys Ser Ala Tyr Glu
1125 1130 1135
Asn Phe Asn Gln His Glu Val Leu Leu Ala Pro Leu Leu Ser Ala Gly
1140 1145 1150
Ile Phe Gly Ala Asp Pro Ile His Ser Leu Arg Val Cys Val Asp Thr
1155 1160 1165
Val Arg Thr Asn Val Tyr Leu Ala Val Phe Asp Lys Asn Leu Tyr Asp
1170 1175 1180
Lys Leu Val Ser Ser Phe Leu Glu Met Lys Ser Glu Lys Gln Val Glu
1185 1190 1195 1200
Gln Lys Ile Ala Glu Ile Pro Lys Glu Glu Val Lys Pro Phe Ile Thr
1205 1210 1215
Glu Ser Lys Pro Ser Val Glu Gln Arg Lys Gln Asp Asp Lys Lys Ile
1220 1225 1230
Lys Ala Cys Val Glu Glu Val Thr Thr Thr Leu Glu Glu Thr Lys Phe
1235 1240 1245
Leu Thr Glu Asn Leu Leu Leu Tyr Ile Asp Ile Asn Gly Asn Leu His
1250 1255 1260
Pro Asp Ser Ala Thr Leu Val Ser Asp Ile Asp Ile Thr Phe Leu Lys
1265 1270 1275 1280
Lys Asp Ala Pro Tyr Ile Val Gly Asp Val Val Gln Glu Gly Val Leu
1285 1290 1295
Thr Ala Val Val Ile Pro Thr Lys Lys Ala Gly Gly Thr Thr Glu Met
1300 1305 1310
Leu Ala Lys Ala Leu Arg Lys Val Pro Thr Asp Asn Tyr Ile Thr Thr
1315 1320 1325
Tyr Pro Gly Gln Gly Leu Asn Gly Tyr Thr Val Glu Glu Ala Lys Thr
1330 1335 1340
Val Leu Lys Lys Cys Lys Ser Ala Phe Tyr Ile Leu Pro Ser Ile Ile
1345 1350 1355 1360
Ser Asn Glu Lys Gln Glu Ile Leu Gly Thr Val Ser Trp Asn Leu Arg
1365 1370 1375
Glu Met Leu Ala His Ala Glu Glu Thr Arg Lys Leu Met Pro Val Cys
1380 1385 1390
Val Glu Thr Lys Ala Ile Val Ser Thr Ile Gln Arg Lys Tyr Lys Gly
1395 1400 1405
Ile Lys Ile Gln Glu Gly Val Val Asp Tyr Gly Ala Arg Phe Tyr Phe
1410 1415 1420
Tyr Thr Ser Lys Thr Thr Val Ala Ser Leu Ile Asn Thr Leu Asn Asp
1425 1430 1435 1440
Leu Asn Glu Thr Leu Val Thr Met Pro Leu Gly Tyr Val Thr His Gly
1445 1450 1455
Leu Asn Leu Glu Glu Ala Ala Arg Tyr Met Arg Ser Leu Lys Val Pro
1460 1465 1470
Ala Thr Val Ser Val Ser Ser Pro Asp Ala Val Thr Ala Tyr Asn Gly
1475 1480 1485
Tyr Leu Thr Ser Ser Ser Lys Thr Pro Glu Glu His Phe Ile Glu Thr
1490 1495 1500
Ile Ser Leu Ala Gly Ser Tyr Lys Asp Trp Ser Tyr Ser Gly Gln Ser
1505 1510 1515 1520
Thr Gln Leu Gly Ile Glu Phe Leu Lys Arg Gly Asp Lys Ser Val Tyr
1525 1530 1535
Tyr Thr Ser Asn Pro Thr Thr Phe His Leu Asp Gly Glu Val Ile Thr
1540 1545 1550
Phe Asp Asn Leu Lys Thr Leu Leu Ser Leu Arg Glu Val Arg Thr Ile
1555 1560 1565
Lys Val Phe Thr Thr Val Asp Asn Ile Asn Leu His Thr Gln Val Val
1570 1575 1580
Asp Met Ser Met Thr Tyr Gly Gln Gln Phe Gly Pro Thr Tyr Leu Asp
1585 1590 1595 1600
Gly Ala Asp Val Thr Lys Ile Lys Pro His Asn Ser His Glu Gly Lys
1605 1610 1615
Thr Phe Tyr Val Leu Pro Asn Asp Asp Thr Leu Arg Val Glu Ala Phe
1620 1625 1630
Glu Tyr Tyr His Thr Thr Asp Pro Ser Phe Leu Gly Arg Tyr Met Ser
1635 1640 1645
Ala Leu Asn His Thr Lys Lys Trp Lys Tyr Pro Gln Val Asn Gly Leu
1650 1655 1660
Thr Ser Ile Lys Trp Ala Asp Asn Asn Cys Tyr Leu Ala Thr Ala Leu
1665 1670 1675 1680
Leu Thr Leu Gln Gln Ile Glu Leu Lys Phe Asn Pro Pro Ala Leu Gln
1685 1690 1695
Asp Ala Tyr Tyr Arg Ala Arg Ala Gly Glu Ala Ala Asn Phe Cys Ala
1700 1705 1710
Leu Ile Leu Ala Tyr Cys Asn Lys Thr Val Gly Glu Leu Gly Asp Val
1715 1720 1725
Arg Glu Thr Met Ser Tyr Leu Phe Gln His Ala Asn Leu Asp Ser Cys
1730 1735 1740
Lys Arg Val Leu Asn Val Val Cys Lys Thr Cys Gly Gln Gln Gln Thr
1745 1750 1755 1760
Thr Leu Lys Gly Val Glu Ala Val Met Tyr Met Gly Thr Leu Ser Tyr
1765 1770 1775
Glu Gln Phe Lys Lys Gly Val Gln Ile Pro Cys Thr Cys Gly Lys Gln
1780 1785 1790
Ala Thr Lys Tyr Leu Val Gln Gln Glu Ser Pro Phe Val Met Met Ser
1795 1800 1805
Ala Pro Pro Ala Gln Tyr Glu Leu Lys His Gly Thr Phe Thr Cys Ala
1810 1815 1820
Ser Glu Tyr Thr Gly Asn Tyr Gln Cys Gly His Tyr Lys His Ile Thr
1825 1830 1835 1840
Ser Lys Glu Thr Leu Tyr Cys Ile Asp Gly Ala Leu Leu Thr Lys Ser
1845 1850 1855
Ser Glu Tyr Lys Gly Pro Ile Thr Asp Val Phe Tyr Lys Glu Asn Ser
1860 1865 1870
Tyr Thr Thr Thr Ile Lys Pro Val Thr Tyr Lys Leu Asp Gly Val Val
1875 1880 1885
Cys Thr Glu Ile Asp Pro Lys Leu Asp Asn Tyr Tyr Lys Lys Asp Asn
1890 1895 1900
Ser Tyr Phe Thr Glu Gln Pro Ile Asp Leu Val Pro Asn Gln Pro Tyr
1905 1910 1915 1920
Pro Asn Ala Ser Phe Asp Asn Phe Lys Phe Val Cys Asp Asn Ile Lys
1925 1930 1935
Phe Ala Asp Asp Leu Asn Gln Leu Thr Gly Tyr Lys Lys Pro Ala Ser
1940 1945 1950
Arg Glu Leu Lys Val Thr Phe Phe Pro Asp Leu Asn Gly Asp Val Val
1955 1960 1965
Ala Ile Asp Tyr Lys His Tyr Thr Pro Ser Phe Lys Lys Gly Ala Lys
1970 1975 1980
Leu Leu His Lys Pro Ile Val Trp His Val Asn Asn Ala Thr Asn Lys
1985 1990 1995 2000
Ala Thr Tyr Lys Pro Asn Thr Trp Cys Ile Arg Cys Leu Trp Ser Thr
2005 2010 2015
Lys Pro Val Glu Thr Ser Asn Ser Phe Asp Val Leu Lys Ser Glu Asp
2020 2025 2030
Ala Gln Gly Met Asp Asn Leu Ala Cys Glu Asp Leu Lys Pro Val Ser
2035 2040 2045
Glu Glu Val Val Glu Asn Pro Thr Ile Gln Lys Asp Val Leu Glu Cys
2050 2055 2060
Asn Val Lys Thr Thr Glu Val Val Gly Asp Ile Ile Leu Lys Pro Ala
2065 2070 2075 2080
Asn Asn Ser Leu Lys Ile Thr Glu Glu Val Gly His Thr Asp Leu Met
2085 2090 2095
Ala Ala Tyr Val Asp Asn Ser Ser Leu Thr Ile Lys Lys Pro Asn Glu
2100 2105 2110
Leu Ser Arg Val Leu Gly Leu Lys Thr Leu Ala Thr His Gly Leu Ala
2115 2120 2125
Ala Val Asn Ser Val Pro Trp Asp Thr Ile Ala Asn Tyr Ala Lys Pro
2130 2135 2140
Phe Leu Asn Lys Val Val Ser Thr Thr Thr Asn Ile Val Thr Arg Cys
2145 2150 2155 2160
Leu Asn Arg Val Cys Thr Asn Tyr Met Pro Tyr Phe Phe Thr Leu Leu
2165 2170 2175
Leu Gln Leu Cys Thr Phe Thr Arg Ser Thr Asn Ser Arg Ile Lys Ala
2180 2185 2190
Ser Met Pro Thr Thr Ile Ala Lys Asn Thr Val Lys Ser Val Gly Lys
2195 2200 2205
Phe Cys Leu Glu Ala Ser Phe Asn Tyr Leu Lys Ser Pro Asn Phe Ser
2210 2215 2220
Lys Leu Ile Asn Ile Ile Ile Trp Phe Leu Leu Leu Ser Val Cys Leu
2225 2230 2235 2240
Gly Ser Leu Ile Tyr Ser Thr Ala Ala Leu Gly Val Leu Met Ser Asn
2245 2250 2255
Leu Gly Met Pro Ser Tyr Cys Thr Gly Tyr Arg Glu Gly Tyr Leu Asn
2260 2265 2270
Ser Thr Asn Val Thr Ile Ala Thr Tyr Cys Thr Gly Ser Ile Pro Cys
2275 2280 2285
Ser Val Cys Leu Ser Gly Leu Asp Ser Leu Asp Thr Tyr Pro Ser Leu
2290 2295 2300
Glu Thr Ile Gln Ile Thr Ile Ser Ser Phe Lys Trp Asp Leu Thr Ala
2305 2310 2315 2320
Phe Gly Leu Val Ala Glu Trp Phe Leu Ala Tyr Ile Leu Phe Thr Arg
2325 2330 2335
Phe Phe Tyr Val Leu Gly Leu Ala Ala Ile Met Gln Leu Phe Phe Ser
2340 2345 2350
Tyr Phe Ala Val His Phe Ile Ser Asn Ser Trp Leu Met Trp Leu Ile
2355 2360 2365
Ile Asn Leu Val Gln Met Ala Pro Ile Ser Ala Met Val Arg Met Tyr
2370 2375 2380
Ile Phe Phe Ala Ser Phe Tyr Tyr Val Trp Lys Ser Tyr Val His Val
2385 2390 2395 2400
Val Asp Gly Cys Asn Ser Ser Thr Cys Met Met Cys Tyr Lys Arg Asn
2405 2410 2415
Arg Ala Thr Arg Val Glu Cys Thr Thr Ile Val Asn Gly Val Arg Arg
2420 2425 2430
Ser Phe Tyr Val Tyr Ala Asn Gly Gly Lys Gly Phe Cys Lys Leu His
2435 2440 2445
Asn Trp Asn Cys Val Asn Cys Asp Thr Phe Cys Ala Gly Ser Thr Phe
2450 2455 2460
Ile Ser Asp Glu Val Ala Arg Asp Leu Ser Leu Gln Phe Lys Arg Pro
2465 2470 2475 2480
Ile Asn Pro Thr Asp Gln Ser Ser Tyr Ile Val Asp Ser Val Thr Val
2485 2490 2495
Lys Asn Gly Ser Ile His Leu Tyr Phe Asp Lys Ala Gly Gln Lys Thr
2500 2505 2510
Tyr Glu Arg His Ser Leu Ser His Phe Val Asn Leu Asp Asn Leu Arg
2515 2520 2525
Ala Asn Asn Thr Lys Gly Ser Leu Pro Ile Asn Val Ile Val Phe Asp
2530 2535 2540
Gly Lys Ser Lys Cys Glu Glu Ser Ser Ala Lys Ser Ala Ser Val Tyr
2545 2550 2555 2560
Tyr Ser Gln Leu Met Cys Gln Pro Ile Leu Leu Leu Asp Gln Ala Leu
2565 2570 2575
Val Ser Asp Val Gly Asp Ser Ala Glu Val Ala Val Lys Met Phe Asp
2580 2585 2590
Ala Tyr Val Asn Thr Phe Ser Ser Thr Phe Asn Val Pro Met Glu Lys
2595 2600 2605
Leu Lys Thr Leu Val Ala Thr Ala Glu Ala Glu Leu Ala Lys Asn Val
2610 2615 2620
Ser Leu Asp Asn Val Leu Ser Thr Phe Ile Ser Ala Ala Arg Gln Gly
2625 2630 2635 2640
Phe Val Asp Ser Asp Val Glu Thr Lys Asp Val Val Glu Cys Leu Lys
2645 2650 2655
Leu Ser His Gln Ser Asp Ile Glu Val Thr Gly Asp Ser Cys Asn Asn
2660 2665 2670
Tyr Met Leu Thr Tyr Asn Lys Val Glu Asn Met Thr Pro Arg Asp Leu
2675 2680 2685
Gly Ala Cys Ile Asp Cys Ser Ala Arg His Ile Asn Ala Gln Val Ala
2690 2695 2700
Lys Ser His Asn Ile Ala Leu Ile Trp Asn Val Lys Asp Phe Met Ser
2705 2710 2715 2720
Leu Ser Glu Gln Leu Arg Lys Gln Ile Arg Ser Ala Ala Lys Lys Asn
2725 2730 2735
Asn Leu Pro Phe Lys Leu Thr Cys Ala Thr Thr Arg Gln Val Val Asn
2740 2745 2750
Val Val Thr Thr Lys Ile Ala Leu Lys Gly Gly Lys Ile Val Asn Asn
2755 2760 2765
Trp Leu Lys Gln Leu Ile Lys Val Thr Leu Val Phe Leu Phe Val Ala
2770 2775 2780
Ala Ile Phe Tyr Leu Ile Thr Pro Val His Val Met Ser Lys His Thr
2785 2790 2795 2800
Asp Phe Ser Ser Glu Ile Ile Gly Tyr Lys Ala Ile Asp Gly Gly Val
2805 2810 2815
Thr Arg Asp Ile Ala Ser Thr Asp Thr Cys Phe Ala Asn Lys His Ala
2820 2825 2830
Asp Phe Asp Thr Trp Phe Ser Gln Arg Gly Gly Ser Tyr Thr Asn Asp
2835 2840 2845
Lys Ala Cys Pro Leu Ile Ala Ala Val Ile Thr Arg Glu Val Gly Phe
2850 2855 2860
Val Val Pro Gly Leu Pro Gly Thr Ile Leu Arg Thr Thr Asn Gly Asp
2865 2870 2875 2880
Phe Leu His Phe Leu Pro Arg Val Phe Ser Ala Val Gly Asn Ile Cys
2885 2890 2895
Tyr Thr Pro Ser Lys Leu Ile Glu Tyr Thr Asp Phe Ala Thr Ser Ala
2900 2905 2910
Cys Val Leu Ala Ala Glu Cys Thr Ile Phe Lys Asp Ala Ser Gly Lys
2915 2920 2925
Pro Val Pro Tyr Cys Tyr Asp Thr Asn Val Leu Glu Gly Ser Val Ala
2930 2935 2940
Tyr Glu Ser Leu Arg Pro Asp Thr Arg Tyr Val Leu Met Asp Gly Ser
2945 2950 2955 2960
Ile Ile Gln Phe Pro Asn Thr Tyr Leu Glu Gly Ser Val Arg Val Val
2965 2970 2975
Thr Thr Phe Asp Ser Glu Tyr Cys Arg His Gly Thr Cys Glu Arg Ser
2980 2985 2990
Glu Ala Gly Val Cys Val Ser Thr Ser Gly Arg Trp Val Leu Asn Asn
2995 3000 3005
Asp Tyr Tyr Arg Ser Leu Pro Gly Val Phe Cys Gly Val Asp Ala Val
3010 3015 3020
Asn Leu Leu Thr Asn Met Phe Thr Pro Leu Ile Gln Pro Ile Gly Ala
3025 3030 3035 3040
Leu Asp Ile Ser Ala Ser Ile Val Ala Gly Gly Ile Val Ala Ile Val
3045 3050 3055
Val Thr Cys Leu Ala Tyr Tyr Phe Met Arg Phe Arg Arg Ala Phe Gly
3060 3065 3070
Glu Tyr Ser His Val Val Ala Phe Asn Thr Leu Leu Phe Leu Met Ser
3075 3080 3085
Phe Thr Val Leu Cys Leu Thr Pro Val Tyr Ser Phe Leu Pro Gly Val
3090 3095 3100
Tyr Ser Val Ile Tyr Leu Tyr Leu Thr Phe Tyr Leu Thr Asn Asp Val
3105 3110 3115 3120
Ser Phe Leu Ala His Ile Gln Trp Met Val Met Phe Thr Pro Leu Val
3125 3130 3135
Pro Phe Trp Ile Thr Ile Ala Tyr Ile Ile Cys Ile Ser Thr Lys His
3140 3145 3150
Phe Tyr Trp Phe Phe Ser Asn Tyr Leu Lys Arg Arg Val Val Phe Asn
3155 3160 3165
Gly Val Ser Phe Ser Thr Phe Glu Glu Ala Ala Leu Cys Thr Phe Leu
3170 3175 3180
Leu Asn Lys Glu Met Tyr Leu Lys Leu Arg Ser Asp Val Leu Leu Pro
3185 3190 3195 3200
Leu Thr Gln Tyr Asn Arg Tyr Leu Ala Leu Tyr Asn Lys Tyr Lys Tyr
3205 3210 3215
Phe Ser Gly Ala Met Asp Thr Thr Ser Tyr Arg Glu Ala Ala Cys Cys
3220 3225 3230
His Leu Ala Lys Ala Leu Asn Asp Phe Ser Asn Ser Gly Ser Asp Val
3235 3240 3245
Leu Tyr Gln Pro Pro Gln Thr Ser Ile Thr Ser Ala Val Leu Gln Ser
3250 3255 3260
Gly Phe Arg Lys Met Ala Phe Pro Ser Gly Lys Val Glu Gly Cys Met
3265 3270 3275 3280
Val Gln Val Thr Cys Gly Thr Thr Thr Leu Asn Gly Leu Trp Leu Asp
3285 3290 3295
Asp Val Val Tyr Cys Pro Arg His Val Ile Cys Thr Ser Glu Asp Met
3300 3305 3310
Leu Asn Pro Asn Tyr Glu Asp Leu Leu Ile Arg Lys Ser Asn His Asn
3315 3320 3325
Phe Leu Val Gln Ala Gly Asn Val Gln Leu Arg Val Ile Gly His Ser
3330 3335 3340
Met Gln Asn Cys Val Leu Lys Leu Lys Val Asp Thr Ala Asn Pro Lys
3345 3350 3355 3360
Thr Pro Lys Tyr Lys Phe Val Arg Ile Gln Pro Gly Gln Thr Phe Ser
3365 3370 3375
Val Leu Ala Cys Tyr Asn Gly Ser Pro Ser Gly Val Tyr Gln Cys Ala
3380 3385 3390
Met Arg Pro Asn Phe Thr Ile Lys Gly Ser Phe Leu Asn Gly Ser Cys
3395 3400 3405
Gly Ser Val Gly Phe Asn Ile Asp Tyr Asp Cys Val Ser Phe Cys Tyr
3410 3415 3420
Met His His Met Glu Leu Pro Thr Gly Val His Ala Gly Thr Asp Leu
3425 3430 3435 3440
Glu Gly Asn Phe Tyr Gly Pro Phe Val Asp Arg Gln Thr Ala Gln Ala
3445 3450 3455
Ala Gly Thr Asp Thr Thr Ile Thr Val Asn Val Leu Ala Trp Leu Tyr
3460 3465 3470
Ala Ala Val Ile Asn Gly Asp Arg Trp Phe Leu Asn Arg Phe Thr Thr
3475 3480 3485
Thr Leu Asn Asp Phe Asn Leu Val Ala Met Lys Tyr Asn Tyr Glu Pro
3490 3495 3500
Leu Thr Gln Asp His Val Asp Ile Leu Gly Pro Leu Ser Ala Gln Thr
3505 3510 3515 3520
Gly Ile Ala Val Leu Asp Met Cys Ala Ser Leu Lys Glu Leu Leu Gln
3525 3530 3535
Asn Gly Met Asn Gly Arg Thr Ile Leu Gly Ser Ala Leu Leu Glu Asp
3540 3545 3550
Glu Phe Thr Pro Phe Asp Val Val Arg Gln Cys Ser Gly Val Thr Phe
3555 3560 3565
Gln Ser Ala Val Lys Arg Thr Ile Lys Gly Thr His His Trp Leu Leu
3570 3575 3580
Leu Thr Ile Leu Thr Ser Leu Leu Val Leu Val Gln Ser Thr Gln Trp
3585 3590 3595 3600
Ser Leu Phe Phe Phe Xaa Tyr Glu Asn Ala Phe Leu Pro Phe Ala Met
3605 3610 3615
Gly Ile Ile Ala Met Ser Ala Phe Ala Met Met Phe Val Lys His Lys
3620 3625 3630
His Ala Phe Leu Cys Leu Phe Leu Leu Pro Ser Leu Ala Thr Val Ala
3635 3640 3645
Tyr Phe Asn Met Val Tyr Met Pro Ala Ser Trp Val Met Arg Ile Met
3650 3655 3660
Thr Trp Leu Asp Met Val Asp Thr Ser Leu Ser Gly Phe Lys Leu Lys
3665 3670 3675 3680
Asp Cys Val Met Tyr Ala Ser Ala Val Val Leu Leu Ile Leu Met Thr
3685 3690 3695
Ala Arg Thr Val Tyr Asp Asp Gly Ala Arg Arg Val Trp Thr Leu Met
3700 3705 3710
Asn Val Leu Thr Leu Val Tyr Lys Val Tyr Tyr Gly Asn Ala Leu Asp
3715 3720 3725
Gln Ala Ile Ser Met Trp Ala Leu Ile Ile Ser Val Thr Ser Asn Tyr
3730 3735 3740
Ser Gly Val Val Thr Thr Val Met Phe Leu Ala Arg Gly Ile Val Phe
3745 3750 3755 3760
Met Cys Val Glu Tyr Cys Pro Ile Phe Phe Ile Thr Gly Asn Thr Leu
3765 3770 3775
Gln Cys Ile Met Leu Val Tyr Cys Phe Leu Gly Tyr Phe Cys Thr Cys
3780 3785 3790
Tyr Phe Gly Leu Phe Cys Leu Leu Asn Arg Tyr Phe Arg Leu Thr Leu
3795 3800 3805
Gly Val Tyr Asp Tyr Leu Val Ser Thr Gln Glu Phe Arg Tyr Met Asn
3810 3815 3820
Ser Gln Gly Leu Leu Pro Pro Lys Asn Ser Ile Asp Ala Phe Lys Leu
3825 3830 3835 3840
Asn Ile Lys Leu Leu Gly Val Gly Gly Lys Pro Cys Ile Lys Val Ala
3845 3850 3855
Thr Val Gln Ser Lys Met Ser Asp Val Lys Cys Thr Ser Val Val Leu
3860 3865 3870
Leu Ser Val Leu Gln Gln Leu Arg Val Glu Ser Ser Ser Lys Leu Trp
3875 3880 3885
Ala Gln Cys Val Gln Leu His Asn Asp Ile Leu Leu Ala Lys Asp Thr
3890 3895 3900
Thr Glu Ala Phe Glu Lys Met Val Ser Leu Leu Ser Val Leu Leu Ser
3905 3910 3915 3920
Met Gln Gly Ala Val Asp Ile Asn Lys Leu Cys Glu Glu Met Leu Asp
3925 3930 3935
Asn Arg Ala Thr Leu Gln Ala Ile Ala Ser Glu Phe Ser Ser Leu Pro
3940 3945 3950
Ser Tyr Ala Ala Phe Ala Thr Ala Gln Glu Ala Tyr Glu Gln Ala Val
3955 3960 3965
Ala Asn Gly Asp Ser Glu Val Val Leu Lys Lys Leu Lys Lys Ser Leu
3970 3975 3980
Asn Val Ala Lys Ser Glu Phe Asp Arg Asp Ala Ala Met Gln Arg Lys
3985 3990 3995 4000
Leu Glu Lys Met Ala Asp Gln Ala Met Thr Gln Met Tyr Lys Gln Ala
4005 4010 4015
Arg Ser Glu Asp Lys Arg Ala Lys Val Thr Ser Ala Met Gln Thr Met
4020 4025 4030
Leu Phe Thr Met Leu Arg Lys Leu Asp Asn Asp Ala Leu Asn Asn Ile
4035 4040 4045
Ile Asn Asn Ala Arg Asp Gly Cys Val Pro Leu Asn Ile Ile Pro Leu
4050 4055 4060
Thr Thr Ala Ala Lys Leu Met Val Val Ile Pro Asp Tyr Asn Thr Tyr
4065 4070 4075 4080
Lys Asn Thr Cys Asp Gly Thr Thr Phe Thr Tyr Ala Ser Ala Leu Trp
4085 4090 4095
Glu Ile Gln Gln Val Val Asp Ala Asp Ser Lys Ile Val Gln Leu Ser
4100 4105 4110
Glu Ile Ser Met Asp Asn Ser Pro Asn Leu Ala Trp Pro Leu Ile Val
4115 4120 4125
Thr Ala Leu Arg Ala Asn Ser Ala Val Lys Leu Gln Asn Asn Glu Leu
4130 4135 4140
Ser Pro Val Ala Leu Arg Gln Met Ser Cys Ala Ala Gly Thr Thr Gln
4145 4150 4155 4160
Thr Ala Cys Thr Asp Asp Asn Ala Leu Ala Tyr Tyr Asn Thr Thr Lys
4165 4170 4175
Gly Gly Arg Phe Val Leu Ala Leu Leu Ser Asp Leu Gln Asp Leu Lys
4180 4185 4190
Trp Ala Arg Phe Pro Lys Ser Asp Gly Thr Gly Thr Ile Tyr Thr Glu
4195 4200 4205
Leu Glu Pro Pro Cys Arg Phe Val Thr Asp Thr Pro Lys Gly Pro Lys
4210 4215 4220
Val Lys Tyr Leu Tyr Phe Ile Lys Gly Leu Asn Asn Leu Asn Arg Gly
4225 4230 4235 4240
Met Val Leu Gly Ser Leu Ala Ala Thr Val Arg Leu Gln Ala Gly Asn
4245 4250 4255
Ala Thr Glu Val Pro Ala Asn Ser Thr Val Leu Ser Phe Cys Ala Phe
4260 4265 4270
Ala Val Asp Ala Ala Lys Ala Tyr Lys Asp Tyr Leu Ala Ser Gly Gly
4275 4280 4285
Gln Pro Ile Thr Asn Cys Val Lys Met Leu Cys Thr His Thr Gly Thr
4290 4295 4300
Gly Gln Ala Ile Thr Val Thr Pro Glu Ala Asn Met Asp Gln Glu Ser
4305 4310 4315 4320
Phe Gly Gly Ala Ser Cys Cys Leu Tyr Cys Arg Cys His Ile Asp His
4325 4330 4335
Pro Asn Pro Lys Gly Phe Cys Asp Leu Lys Gly Lys Tyr Val Gln Ile
4340 4345 4350
Pro Thr Thr Cys Ala Asn Asp Pro Val Gly Phe Thr Leu Lys Asn Thr
4355 4360 4365
Val Cys Thr Val Cys Gly Met Trp Lys Gly Tyr Gly Cys Ser Cys Asp
4370 4375 4380
Gln Leu Arg Glu Pro Met Leu Gln Ser Ala Asp Ala Gln Ser Phe Leu
4385 4390 4395 4400
Asn Arg Val Cys Gly Val Ser Ala Ala Arg Leu Thr Pro Cys Gly Thr
4405 4410 4415
Gly Thr Ser Thr Asp Val Val Tyr Arg Ala Phe Asp Ile Tyr Asn Asp
4420 4425 4430
Lys Val Ala Gly Phe Ala Lys Phe Leu Lys Thr Asn Cys Cys Arg Phe
4435 4440 4445
Gln Glu Lys Asp Glu Asp Asp Asn Leu Ile Asp Ser Tyr Phe Val Val
4450 4455 4460
Lys Arg His Thr Phe Ser Asn Tyr Gln His Glu Glu Thr Ile Tyr Asn
4465 4470 4475 4480
Leu Leu Lys Asp Cys Pro Ala Val Ala Lys His Asp Phe Phe Lys Phe
4485 4490 4495
Arg Ile Asp Gly Asp Met Val Pro His Ile Ser Arg Gln Arg Leu Thr
4500 4505 4510
Lys Tyr Thr Met Ala Asp Leu Val Tyr Ala Leu Arg His Phe Asp Glu
4515 4520 4525
Gly Asn Cys Asp Thr Leu Lys Glu Ile Leu Val Thr Tyr Asn Cys Cys
4530 4535 4540
Asp Asp Asp Tyr Phe Asn Lys Lys Asp Trp Tyr Asp Phe Val Glu Asn
4545 4550 4555 4560
Pro Asp Ile Leu Arg Val Tyr Ala Asn Leu Gly Glu Arg Val Arg Gln
4565 4570 4575
Ala Leu Leu Lys Thr Val Gln Phe Cys Asp Ala Met Arg Asn Ala Gly
4580 4585 4590
Ile Val Gly Val Leu Thr Leu Asp Asn Gln Asp Leu Asn Gly Asn Trp
4595 4600 4605
Tyr Asp Phe Gly Asp Phe Ile Gln Thr Thr Pro Gly Ser Gly Val Pro
4610 4615 4620
Val Val Asp Ser Tyr Tyr Ser Leu Leu Met Pro Ile Leu Thr Leu Thr
4625 4630 4635 4640
Arg Ala Leu Thr Ala Glu Ser His Val Asp Thr Asp Leu Thr Lys Pro
4645 4650 4655
Tyr Ile Lys Trp Asp Leu Leu Lys Tyr Asp Phe Thr Glu Glu Arg Leu
4660 4665 4670
Lys Leu Phe Asp Arg Tyr Phe Lys Tyr Trp Asp Gln Thr Tyr His Pro
4675 4680 4685
Asn Cys Val Asn Cys Leu Asp Asp Arg Cys Ile Leu His Cys Ala Asn
4690 4695 4700
Phe Asn Val Leu Phe Ser Thr Val Phe Pro Pro Thr Ser Phe Gly Pro
4705 4710 4715 4720
Leu Val Arg Lys Ile Phe Val Asp Gly Val Pro Phe Val Val Ser Thr
4725 4730 4735
Gly Tyr His Phe Arg Glu Leu Gly Val Val His Asn Gln Asp Val Asn
4740 4745 4750
Leu His Ser Ser Arg Leu Ser Phe Lys Glu Leu Leu Val Tyr Ala Ala
4755 4760 4765
Asp Pro Ala Met His Ala Ala Ser Gly Asn Leu Leu Leu Asp Lys Arg
4770 4775 4780
Thr Thr Cys Phe Ser Val Ala Ala Leu Thr Asn Asn Val Ala Phe Gln
4785 4790 4795 4800
Thr Val Lys Pro Gly Asn Phe Asn Lys Asp Phe Tyr Asp Phe Ala Val
4805 4810 4815
Ser Lys Gly Phe Phe Lys Glu Gly Ser Ser Val Glu Leu Lys His Phe
4820 4825 4830
Phe Phe Ala Gln Asp Gly Asn Ala Ala Ile Ser Asp Tyr Asp Tyr Tyr
4835 4840 4845
Arg Tyr Asn Leu Pro Thr Met Cys Asp Ile Arg Gln Leu Leu Phe Val
4850 4855 4860
Val Glu Val Val Asp Lys Tyr Phe Asp Cys Tyr Asp Gly Gly Cys Ile
4865 4870 4875 4880
Asn Ala Asn Gln Val Ile Val Asn Asn Leu Asp Lys Ser Ala Gly Phe
4885 4890 4895
Pro Phe Asn Lys Trp Gly Lys Ala Arg Leu Tyr Tyr Asp Ser Met Ser
4900 4905 4910
Tyr Glu Asp Gln Asp Ala Leu Phe Ala Tyr Thr Lys Arg Asn Val Ile
4915 4920 4925
Pro Thr Ile Thr Gln Met Asn Leu Lys Tyr Ala Ile Ser Ala Lys Asn
4930 4935 4940
Arg Ala Arg Thr Val Ala Gly Val Ser Ile Cys Ser Thr Met Thr Asn
4945 4950 4955 4960
Arg Gln Phe His Gln Lys Leu Leu Lys Ser Ile Ala Ala Thr Arg Gly
4965 4970 4975
Ala Thr Val Val Ile Gly Thr Ser Lys Phe Tyr Gly Gly Trp His Asn
4980 4985 4990
Met Leu Lys Thr Val Tyr Ser Asp Val Glu Asn Pro His Leu Met Gly
4995 5000 5005
Trp Asp Tyr Pro Lys Cys Asp Arg Ala Met Pro Asn Met Leu Arg Ile
5010 5015 5020
Met Ala Ser Leu Val Leu Ala Arg Lys His Thr Thr Cys Cys Ser Leu
5025 5030 5035 5040
Ser His Arg Phe Tyr Arg Leu Ala Asn Glu Cys Ala Gln Val Leu Ser
5045 5050 5055
Glu Met Val Met Cys Gly Gly Ser Leu Tyr Val Lys Pro Gly Gly Thr
5060 5065 5070
Ser Ser Gly Asp Ala Thr Thr Ala Tyr Ala Asn Ser Val Phe Asn Ile
5075 5080 5085
Cys Gln Ala Val Thr Ala Asn Val Asn Ala Leu Leu Ser Thr Asp Gly
5090 5095 5100
Asn Lys Ile Ala Asp Lys Tyr Val Arg Asn Leu Gln His Arg Leu Tyr
5105 5110 5115 5120
Glu Cys Leu Tyr Arg Asn Arg Asp Val Asp Thr Asp Phe Val Asn Glu
5125 5130 5135
Phe Tyr Ala Tyr Leu Arg Lys His Phe Ser Met Met Ile Leu Ser Asp
5140 5145 5150
Asp Ala Val Val Cys Phe Asn Ser Thr Tyr Ala Ser Gln Gly Leu Val
5155 5160 5165
Ala Ser Ile Lys Asn Phe Lys Ser Val Leu Tyr Tyr Gln Asn Asn Val
5170 5175 5180
Phe Met Ser Glu Ala Lys Cys Trp Thr Glu Thr Asp Leu Thr Lys Gly
5185 5190 5195 5200
Pro His Glu Phe Cys Ser Gln His Thr Met Leu Val Lys Gln Gly Asp
5205 5210 5215
Asp Tyr Val Tyr Leu Pro Tyr Pro Asp Pro Ser Arg Ile Leu Gly Ala
5220 5225 5230
Gly Cys Phe Val Asp Asp Ile Val Lys Thr Asp Gly Thr Leu Met Ile
5235 5240 5245
Glu Arg Phe Val Ser Leu Ala Ile Asp Ala Tyr Pro Leu Thr Lys His
5250 5255 5260
Pro Asn Gln Glu Tyr Ala Asp Val Phe His Leu Tyr Leu Gln Tyr Ile
5265 5270 5275 5280
Arg Lys Leu His Asp Glu Leu Thr Gly His Met Leu Asp Met Tyr Ser
5285 5290 5295
Val Met Leu Thr Asn Asp Asn Thr Ser Arg Tyr Trp Glu Pro Glu Phe
5300 5305 5310
Tyr Glu Ala Met Tyr Thr Pro His Thr Val Leu Gln Ala Val Gly Ala
5315 5320 5325
Cys Val Leu Cys Asn Ser Gln Thr Ser Leu Arg Cys Gly Ala Cys Ile
5330 5335 5340
Arg Arg Pro Phe Leu Cys Cys Lys Cys Cys Tyr Asp His Val Ile Ser
5345 5350 5355 5360
Thr Ser His Lys Leu Val Leu Ser Val Asn Pro Tyr Val Cys Asn Ala
5365 5370 5375
Pro Gly Cys Asp Val Thr Asp Val Thr Gln Leu Tyr Leu Gly Gly Met
5380 5385 5390
Ser Tyr Tyr Cys Lys Ser His Lys Pro Pro Ile Ser Phe Pro Leu Cys
5395 5400 5405
Ala Asn Gly Gln Val Phe Gly Leu Tyr Lys Asn Thr Cys Val Gly Ser
5410 5415 5420
Asp Asn Val Thr Asp Phe Asn Ala Ile Ala Thr Cys Asp Trp Thr Asn
5425 5430 5435 5440
Ala Gly Asp Tyr Ile Leu Ala Asn Thr Cys Thr Glu Arg Leu Lys Leu
5445 5450 5455
Phe Ala Ala Glu Thr Leu Lys Ala Thr Glu Glu Thr Phe Lys Leu Ser
5460 5465 5470
Tyr Gly Ile Ala Thr Val Arg Glu Val Leu Ser Asp Arg Glu Leu His
5475 5480 5485
Leu Ser Trp Glu Val Gly Lys Pro Arg Pro Pro Leu Asn Arg Asn Tyr
5490 5495 5500
Val Phe Thr Gly Tyr Arg Val Thr Lys Asn Ser Lys Val Gln Ile Gly
5505 5510 5515 5520
Glu Tyr Thr Phe Glu Lys Gly Asp Tyr Gly Asp Ala Val Val Tyr Arg
5525 5530 5535
Gly Thr Thr Thr Tyr Lys Leu Asn Val Gly Asp Tyr Phe Val Leu Thr
5540 5545 5550
Ser His Thr Val Met Pro Leu Ser Ala Pro Thr Leu Val Pro Gln Glu
5555 5560 5565
His Tyr Val Arg Ile Thr Gly Leu Tyr Pro Thr Leu Asn Ile Ser Asp
5570 5575 5580
Glu Phe Ser Ser Asn Val Ala Asn Tyr Gln Lys Val Gly Met Gln Lys
5585 5590 5595 5600
Tyr Ser Thr Leu Gln Gly Pro Pro Gly Thr Gly Lys Ser His Phe Ala
5605 5610 5615
Ile Gly Leu Ala Leu Tyr Tyr Pro Ser Ala Arg Ile Val Tyr Thr Ala
5620 5625 5630
Cys Ser His Ala Ala Val Asp Ala Leu Cys Glu Lys Ala Leu Lys Tyr
5635 5640 5645
Leu Pro Ile Asp Lys Cys Ser Arg Ile Ile Pro Ala Arg Ala Arg Val
5650 5655 5660
Glu Cys Phe Asp Lys Phe Lys Val Asn Ser Thr Leu Glu Gln Tyr Val
5665 5670 5675 5680
Phe Cys Thr Val Asn Ala Leu Pro Glu Thr Thr Ala Asp Ile Val Val
5685 5690 5695
Phe Asp Glu Ile Ser Met Ala Thr Asn Tyr Asp Leu Ser Val Val Asn
5700 5705 5710
Ala Arg Leu Arg Ala Lys His Tyr Val Tyr Ile Gly Asp Pro Ala Gln
5715 5720 5725
Leu Pro Ala Pro Arg Thr Leu Leu Thr Lys Gly Thr Leu Glu Pro Glu
5730 5735 5740
Tyr Phe Asn Ser Val Cys Arg Leu Met Lys Thr Ile Gly Pro Asp Met
5745 5750 5755 5760
Phe Leu Gly Thr Cys Arg Arg Cys Pro Ala Glu Ile Val Asp Thr Val
5765 5770 5775
Ser Ala Leu Val Tyr Asp Asn Lys Leu Lys Ala His Lys Asp Lys Ser
5780 5785 5790
Ala Gln Cys Phe Lys Met Phe Tyr Lys Gly Val Ile Thr His Asp Val
5795 5800 5805
Ser Ser Ala Ile Asn Arg Pro Gln Ile Gly Val Val Arg Glu Phe Leu
5810 5815 5820
Thr Arg Asn Pro Ala Trp Arg Lys Ala Val Phe Ile Ser Pro Tyr Asn
5825 5830 5835 5840
Ser Gln Asn Ala Val Ala Ser Lys Ile Leu Gly Leu Pro Thr Gln Thr
5845 5850 5855
Val Asp Ser Ser Gln Gly Ser Glu Tyr Asp Tyr Val Ile Phe Thr Gln
5860 5865 5870
Thr Thr Glu Thr Ala His Ser Cys Asn Val Asn Arg Phe Asn Val Ala
5875 5880 5885
Ile Thr Arg Ala Lys Val Gly Ile Leu Cys Ile Met Ser Asp Arg Asp
5890 5895 5900
Leu Tyr Asp Lys Leu Gln Phe Thr Ser Leu Glu Ile Pro Arg Arg Asn
5905 5910 5915 5920
Val Ala Thr Leu Gln Ala Glu Asn Val Thr Gly Leu Phe Lys Asp Cys
5925 5930 5935
Ser Lys Val Ile Thr Gly Leu His Pro Thr Gln Ala Pro Thr His Leu
5940 5945 5950
Ser Val Asp Thr Lys Phe Lys Thr Glu Gly Leu Cys Val Asp Ile Pro
5955 5960 5965
Gly Ile Pro Lys Asp Met Thr Tyr Arg Arg Leu Ile Ser Met Met Gly
5970 5975 5980
Phe Lys Met Asn Tyr Gln Val Asn Gly Tyr Pro Asn Met Phe Ile Thr
5985 5990 5995 6000
Arg Glu Glu Ala Ile Arg His Val Arg Ala Trp Ile Gly Phe Asp Val
6005 6010 6015
Glu Gly Cys His Ala Thr Arg Glu Ala Val Gly Thr Asn Leu Pro Leu
6020 6025 6030
Gln Leu Gly Phe Ser Thr Gly Val Asn Leu Val Ala Val Pro Thr Gly
6035 6040 6045
Tyr Val Asp Thr Pro Asn Asn Thr Asp Phe Ser Arg Val Ser Ala Lys
6050 6055 6060
Pro Pro Pro Gly Asp Gln Phe Lys His Leu Ile Pro Leu Met Tyr Lys
6065 6070 6075 6080
Gly Leu Pro Trp Asn Val Val Arg Ile Lys Ile Val Gln Met Leu Ser
6085 6090 6095
Asp Thr Leu Lys Asn Leu Ser Asp Arg Val Val Phe Val Leu Trp Ala
6100 6105 6110
His Gly Phe Glu Leu Thr Ser Met Lys Tyr Phe Val Lys Ile Gly Pro
6115 6120 6125
Glu Arg Thr Cys Cys Leu Cys Asp Arg Arg Ala Thr Cys Phe Ser Thr
6130 6135 6140
Ala Ser Asp Thr Tyr Ala Cys Trp His His Ser Ile Gly Phe Asp Tyr
6145 6150 6155 6160
Val Tyr Asn Pro Phe Met Ile Asp Val Gln Gln Trp Gly Phe Thr Gly
6165 6170 6175
Asn Leu Gln Ser Asn His Asp Leu Tyr Cys Gln Val His Gly Asn Ala
6180 6185 6190
His Val Ala Ser Cys Asp Ala Ile Met Thr Arg Cys Leu Ala Val His
6195 6200 6205
Glu Cys Phe Val Lys Arg Val Asp Trp Thr Ile Glu Tyr Pro Ile Ile
6210 6215 6220
Gly Asp Glu Leu Lys Ile Asn Ala Ala Cys Arg Lys Val Gln His Met
6225 6230 6235 6240
Val Val Lys Ala Ala Leu Leu Ala Asp Lys Phe Pro Val Leu His Asp
6245 6250 6255
Ile Gly Asn Pro Lys Ala Ile Lys Cys Val Pro Gln Ala Asp Val Glu
6260 6265 6270
Trp Lys Phe Tyr Asp Ala Gln Pro Cys Ser Asp Lys Ala Tyr Lys Ile
6275 6280 6285
Glu Glu Leu Phe Tyr Ser Tyr Ala Thr His Ser Asp Lys Phe Thr Asp
6290 6295 6300
Gly Val Cys Leu Phe Trp Asn Cys Asn Val Asp Arg Tyr Pro Ala Asn
6305 6310 6315 6320
Ser Ile Val Cys Arg Phe Asp Thr Arg Val Leu Ser Asn Leu Asn Leu
6325 6330 6335
Pro Gly Cys Asp Gly Gly Ser Leu Tyr Val Asn Lys His Ala Phe His
6340 6345 6350
Thr Pro Ala Phe Asp Lys Ser Ala Phe Val Asn Leu Lys Gln Leu Pro
6355 6360 6365
Phe Phe Tyr Tyr Ser Asp Ser Pro Cys Glu Ser His Gly Lys Gln Val
6370 6375 6380
Val Ser Asp Ile Asp Tyr Val Pro Leu Lys Ser Ala Thr Cys Ile Thr
6385 6390 6395 6400
Arg Cys Asn Leu Gly Gly Ala Val Cys Arg His His Ala Asn Glu Tyr
6405 6410 6415
Arg Leu Tyr Leu Asp Ala Tyr Asn Met Met Ile Ser Ala Gly Phe Ser
6420 6425 6430
Leu Trp Val Tyr Lys Gln Phe Asp Thr Tyr Asn Leu Trp Asn Thr Phe
6435 6440 6445
Thr Arg Leu Gln Ser Leu Glu Asn Val Ala Phe Asn Val Val Asn Lys
6450 6455 6460
Gly His Phe Asp Gly Gln Gln Gly Glu Val Pro Val Ser Ile Ile Asn
6465 6470 6475 6480
Asn Thr Val Tyr Thr Lys Val Asp Gly Val Asp Val Glu Leu Phe Glu
6485 6490 6495
Asn Lys Thr Thr Leu Pro Val Asn Val Ala Phe Glu Leu Trp Ala Lys
6500 6505 6510
Arg Asn Ile Lys Pro Val Pro Glu Val Lys Ile Leu Asn Asn Leu Gly
6515 6520 6525
Val Asp Ile Ala Ala Asn Thr Val Ile Trp Asp Tyr Lys Arg Asp Ala
6530 6535 6540
Pro Ala His Ile Ser Thr Ile Gly Val Cys Ser Met Thr Asp Ile Ala
6545 6550 6555 6560
Lys Lys Pro Thr Glu Thr Ile Cys Ala Pro Leu Thr Val Phe Phe Asp
6565 6570 6575
Gly Arg Val Asp Gly Gln Val Asp Leu Phe Arg Asn Ala Arg Asn Gly
6580 6585 6590
Val Leu Ile Thr Glu Gly Ser Val Lys Gly Leu Gln Pro Ser Val Gly
6595 6600 6605
Pro Lys Gln Ala Ser Leu Asn Gly Val Thr Leu Ile Gly Glu Ala Val
6610 6615 6620
Lys Thr Gln Phe Asn Tyr Tyr Lys Lys Val Asp Gly Val Val Gln Gln
6625 6630 6635 6640
Leu Pro Glu Thr Tyr Phe Thr Gln Ser Arg Asn Leu Gln Glu Phe Lys
6645 6650 6655
Pro Arg Ser Gln Met Glu Ile Asp Phe Leu Glu Leu Ala Met Asp Glu
6660 6665 6670
Phe Ile Glu Arg Tyr Lys Leu Glu Gly Tyr Ala Phe Glu His Ile Val
6675 6680 6685
Tyr Gly Asp Phe Ser His Ser Gln Leu Gly Gly Leu His Leu Leu Ile
6690 6695 6700
Gly Leu Ala Lys Arg Phe Lys Glu Ser Pro Phe Glu Leu Glu Asp Phe
6705 6710 6715 6720
Ile Pro Met Asp Ser Thr Val Lys Asn Tyr Phe Ile Thr Asp Ala Gln
6725 6730 6735
Thr Gly Ser Ser Lys Cys Val Cys Ser Val Ile Asp Leu Leu Leu Asp
6740 6745 6750
Asp Phe Val Glu Ile Ile Lys Ser Gln Asp Leu Ser Val Val Ser Lys
6755 6760 6765
Val Val Lys Val Thr Ile Asp Tyr Thr Glu Ile Ser Phe Met Leu Trp
6770 6775 6780
Cys Lys Asp Gly His Val Glu Thr Phe Tyr Pro Lys Leu Gln Ser Ser
6785 6790 6795 6800
Gln Ala Trp Gln Pro Gly Val Ala Met Pro Asn Leu Tyr Lys Met Gln
6805 6810 6815
Arg Met Leu Leu Glu Lys Cys Asp Leu Gln Asn Tyr Gly Asp Ser Ala
6820 6825 6830
Thr Leu Pro Lys Gly Ile Met Met Asn Val Ala Lys Tyr Thr Gln Leu
6835 6840 6845
Cys Gln Tyr Leu Asn Thr Leu Thr Leu Ala Val Pro Tyr Asn Met Arg
6850 6855 6860
Val Ile His Phe Gly Ala Gly Ser Asp Lys Gly Val Ala Pro Gly Thr
6865 6870 6875 6880
Ala Val Leu Arg Gln Trp Leu Pro Thr Gly Thr Leu Leu Val Asp Ser
6885 6890 6895
Asp Leu Asn Asp Phe Val Ser Asp Ala Asp Ser Thr Leu Ile Gly Asp
6900 6905 6910
Cys Ala Thr Val His Thr Ala Asn Lys Trp Asp Leu Ile Ile Ser Asp
6915 6920 6925
Met Tyr Asp Pro Lys Thr Lys Asn Val Thr Lys Glu Asn Asp Ser Lys
6930 6935 6940
Glu Gly Phe Phe Thr Tyr Ile Cys Gly Phe Ile Gln Gln Lys Leu Ala
6945 6950 6955 6960
Leu Gly Gly Ser Val Ala Ile Lys Ile Thr Glu His Ser Trp Asn Ala
6965 6970 6975
Asp Leu Tyr Lys Leu Met Gly His Phe Ala Trp Trp Thr Ala Phe Val
6980 6985 6990
Thr Asn Val Asn Ala Ser Ser Ser Glu Ala Phe Leu Ile Gly Cys Asn
6995 7000 7005
Tyr Leu Gly Lys Pro Arg Glu Gln Ile Asp Gly Tyr Val Met His Ala
7010 7015 7020
Asn Tyr Ile Phe Trp Arg Asn Thr Asn Pro Ile Gln Leu Ser Ser Tyr
7025 7030 7035 7040
Ser Leu Phe Asp Met Ser Lys Phe Pro Leu Lys Leu Arg Gly Thr Ala
7045 7050 7055
Val Met Ser Leu Lys Glu
7060
<210> 11
<211> 1273
<212> PRT
<213> SARS-CoV2
<400> 11
Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
1 5 10 15
Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
20 25 30
Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
35 40 45
His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
50 55 60
Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
65 70 75 80
Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
85 90 95
Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
100 105 110
Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
115 120 125
Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
130 135 140
Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
145 150 155 160
Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
165 170 175
Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
180 185 190
Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
195 200 205
Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
210 215 220
Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
225 230 235 240
Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
245 250 255
Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
260 265 270
Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
275 280 285
Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
290 295 300
Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val
305 310 315 320
Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys
325 330 335
Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
340 345 350
Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
355 360 365
Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
370 375 380
Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
385 390 395 400
Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
405 410 415
Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
420 425 430
Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
435 440 445
Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
450 455 460
Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
465 470 475 480
Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
485 490 495
Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
500 505 510
Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
515 520 525
Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn
530 535 540
Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu
545 550 555 560
Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val
565 570 575
Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe
580 585 590
Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val
595 600 605
Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile
610 615 620
His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser
625 630 635 640
Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val
645 650 655
Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala
660 665 670
Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala
675 680 685
Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser
690 695 700
Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile
705 710 715 720
Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val
725 730 735
Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu
740 745 750
Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr
755 760 765
Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln
770 775 780
Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe
785 790 795 800
Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser
805 810 815
Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly
820 825 830
Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp
835 840 845
Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu
850 855 860
Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly
865 870 875 880
Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile
885 890 895
Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr
900 905 910
Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn
915 920 925
Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala
930 935 940
Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn
945 950 955 960
Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val
965 970 975
Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln
980 985 990
Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val
995 1000 1005
Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu
1010 1015 1020
Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys Arg Val
1025 1030 1035 1040
Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ser Ala
1045 1050 1055
Pro His Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ala Gln Glu
1060 1065 1070
Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His Asp Gly Lys Ala His
1075 1080 1085
Phe Pro Arg Glu Gly Val Phe Val Ser Asn Gly Thr His Trp Phe Val
1090 1095 1100
Thr Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile Thr Thr Asp Asn Thr
1105 1110 1115 1120
Phe Val Ser Gly Asn Cys Asp Val Val Ile Gly Ile Val Asn Asn Thr
1125 1130 1135
Val Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu
1140 1145 1150
Asp Lys Tyr Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp
1155 1160 1165
Ile Ser Gly Ile Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp
1170 1175 1180
Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu
1185 1190 1195 1200
Gln Glu Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Trp Tyr Ile
1205 1210 1215
Trp Leu Gly Phe Ile Ala Gly Leu Ile Ala Ile Val Met Val Thr Ile
1220 1225 1230
Met Leu Cys Cys Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Cys Cys
1235 1240 1245
Ser Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro Val
1250 1255 1260
Leu Lys Gly Val Lys Leu His Tyr Thr
1265 1270
<210> 12
<211> 29874
<212> DNA
<213> SARS-CoV2
<400> 12
attaaaggtt tataccttcc caggtaacaa accaaccaac tttcgatctc ttgtagatct 60
gttctctaaa cgaactttaa aatctgtgtg gctgtcactc ggctgcatgc ttagtgcact 120
cacgcagtat aattaataac taattactgt cgttgacagg acacgagtaa ctcgtctatc 180
ttctgcaggc tgcttacggt ttcgtccgtg ttgcagccga tcatcagcac atctaggttt 240
cgtccgggtg tgaccgaaag gtaagatgga gagccttgtc cctggtttca acgagaaaac 300
acacgtccaa ctcagtttgc ctgttttaca ggttcgcgac gtgctcgtac gtggctttgg 360
agactccgtg gaggaggtct tatcagaggc acgtcaacat cttaaagatg gcacttgtgg 420
cttagtagaa gttgaaaaag gcgttttgcc tcaacttgaa cagccctatg tgttcatcaa 480
acgttcggat gctcgaactg cacctcatgg tcatgttatg gttgagctgg tagcagaact 540
cgaaggcatt cagtacggtc gtagtggtga gacacttggt gtccttgtcc ctcatgtggg 600
cgaaatacca gtggcttacc gcaaggttct tcttcgtaag aacggtaata aaggagctgg 660
tggccatagt tacggcgccg atctaaagtc atttgactta ggcgacgagc ttggcactga 720
tccttatgaa gattttcaag aaaactggaa cactaaacat agcagtggtg ttacccgtga 780
actcatgcgt gagcttaacg gaggggcata cactcgctat gtcgataaca acttctgtgg 840
ccctgatggc taccctcttg agtgcattaa agaccttcta gcacgtgctg gtaaagcttc 900
atgcactttg tccgaacaac tggactttat tgacactaag aggggtgtat actgctgccg 960
tgaacatgag catgaaattg cttggtacac ggaacgttct gaaaagagct atgaattgca 1020
gacacctttt gaaattaaat tggcaaagaa atttgacacc ttcaatgggg aatgtccaaa 1080
ttttgtattt cccttaaatt ccataatcaa gactattcaa ccaagggttg aaaagaaaaa 1140
gcttgatggc tttatgggta gaattcgatc tgtctatcca gttgcgtcac caaatgaatg 1200
caaccaaatg tgcctttcaa ctctcatgaa gtgtgatcat tgtggtgaaa cttcatggca 1260
gacgggcgat tttgttaaag ccacttgcga attttgtggc actgagaatt tgactaaaga 1320
aggtgccact acttgtggtt acttacccca aaatgctgtt gttaaaattt attgtccagc 1380
atgtcacaat tcagaagtag gacctgagca tagtcttgcc gaataccata atgaatctgg 1440
cttgaaaacc attcttcgta agggtggtcg cactattgcc tttggaggct gtgtgttctc 1500
ttatgttggt tgccataaca agtgtgccta ttgggttcca cgtgctagcg ctaacatagg 1560
ttgtaaccat acaggtgttg ttggagaagg ttccgaaggt cttaatgaca accttcttga 1620
aatactccaa aaagagaaag tcaacatcaa tattgttggt gactttaaac ttaatgaaga 1680
gatcgccatt attttggcat ctttttctgc ttccacaagt gcttttgtgg aaactgtgaa 1740
aggtttggat tataaagcat tcaaacaaat tgttgaatcc tgtggtaatt ttaaagttac 1800
aaaaggaaaa gctaaaaaag gtgcctggaa tattggtgaa cagaaatcaa tactgagtcc 1860
tctttatgca tttgcatcag aggctgctcg tgttgtacga tcaattttct cccgcactct 1920
tgaaactgct caaaattctg tgcgtgtttt acagaaggcc gctataacaa tactagatgg 1980
aatttcacag tattcactga gactcattga tgctatgatg ttcacatctg atttggctac 2040
taacaatcta gttgtaatgg cctacattac aggtggtgtt gttcagttga cttcgcagtg 2100
gctaactaac atctttggca ctgtttatga aaaactcaaa cccgtccttg attggcttga 2160
agagaagttt aaggaaggtg tagagtttct tagagacggt tgggaaattg ttaaatttat 2220
ctcaacctgt gcttgtgaaa ttgtcggtgg acaaattgtc acctgtgcaa aggaaattaa 2280
ggagagtgtt cagacattct ttaagcttgt aaataaattt ttggctttgt gtgctgactc 2340
tatcattatt ggtggagcta aacttaaagc cttgaattta ggtgaaacat ttgtcacgca 2400
ctcaaaggga ttgtacagaa agtgtgttaa atccagagaa gaaactggcc tactcatgcc 2460
tctaaaagcc ccaaaagaaa ttatcttctt agagggagaa acacttccca cagaagtgtt 2520
aacagaggaa gttgtcttga aaactggtga tttacaacca ttagaacaac ctactagtga 2580
agctgttgaa gctccattgg ttggtacacc agtttgtatt aacgggctta tgttgctcga 2640
aatcaaagac acagaaaagt actgtgccct tgcacctaat atgatggtaa caaacaatac 2700
cttcacactc aaaggcggtg caccaacaaa ggttactttt ggtgatgaca ctgtgataga 2760
agtgcaaggt tacaagagtg tgaatatcac ttttgaactt gatgaaagga ttgataaagt 2820
acttaatgag aagtgctctg cctatacagt tgaactcggt acagaagtaa atgagttcgc 2880
ctgtgttgtg gcagatgctg tcataaaaac tttgcaacca gtatctgaat tacttacacc 2940
actgggcatt gatttagatg agtggagtat ggctacatac tacttatttg atgagtctgg 3000
tgagtttaaa ttggcttcac atatgtattg ttctttctac cctccagatg aggatgaaga 3060
agaaggtgat tgtgaagaag aagagtttga gccatcaact caatatgagt atggtactga 3120
agatgattac caaggtaaac ctttggaatt tggtgccact tctgctgctc ttcaacctga 3180
agaagagcaa gaagaagatt ggttagatga tgatagtcaa caaactgttg gtcaacaaga 3240
cggcagtgag gacaatcaga caactactat tcaaacaatt gttgaggttc aacctcaatt 3300
agagatggaa cttacaccag ttgttcagac tattgaagtg aatagtttta gtggttattt 3360
aaaacttact gacaatgtat acattaaaaa tgcagacatt gtggaagaag ctaaaaaggt 3420
aaaaccaaca gtggttgtta atgcagccaa tgtttacctt aaacatggag gaggtgttgc 3480
aggagcctta aataaggcta ctaacaatgc catgcaagtt gaatctgatg attacatagc 3540
tactaatgga ccacttaaag tgggtggtag ttgtgtttta agcggacaca atcttgctaa 3600
acactgtctt catgttgtcg gcccaaatgt taacaaaggt gaagacattc aacttcttaa 3660
gagtgcttat gaaaatttta atcagcacga agttctactt gcaccattat tatcagctgg 3720
tatttttggt gctgacccta tacattcttt aagagtttgt gtagatactg ttcgcacaaa 3780
tgtctactta gctgtctttg ataaaaatct ctatgacaaa cttgtttcaa gctttttgga 3840
aatgaagagt gaaaagcaag ttgaacaaaa gatcgctgag attcctaaag aggaagttaa 3900
gccatttata actgaaagta aaccttcagt tgaacagaga aaacaagatg ataagaaaat 3960
caaagcttgt gttgaagaag ttacaacaac tctggaagaa actaagttcc tcacagaaaa 4020
cttgttactt tatattgaca ttaatggcaa tcttcatcca gattctgcca ctcttgttag 4080
tgacattgac atcactttct taaagaaaga tgctccatat atagtgggtg atgttgttca 4140
agagggtgtt ttaactgctg tggttatacc tactaaaaag gctggtggca ctactgaaat 4200
gctagcgaaa gctttgagaa aagtgccaac agacaattat ataaccactt acccgggtca 4260
gggtttaaat ggttacactg tagaggaggc aaagacagtg cttaaaaagt gtaaaagtgc 4320
cttttacatt ctaccatcta ttatctctaa tgagaagcaa gaaattcttg gaactgtttc 4380
ttggaatttg cgagaaatgc ttgcacatgc agaagaaaca cgcaaattaa tgcctgtctg 4440
tgtggaaact aaagccatag tttcaactat acagcgtaaa tataagggta ttaaaataca 4500
agagggtgtg gttgattatg gtgctagatt ttacttttac accagtaaaa caactgtagc 4560
gtcacttatc aacacactta acgatctaaa tgaaactctt gttacaatgc cacttggcta 4620
tgtaacacat ggcttaaatt tggaagaagc tgctcggtat atgagatctc tcaaagtgcc 4680
agctacagtt tctgtttctt cacctgatgc tgttacagcg tataatggtt atcttacttc 4740
ttcttctaaa acacctgaag aacattttat tgaaaccatc tcacttgctg gttcctataa 4800
agattggtcc tattctggac aatctacaca actaggtata gaatttctta agagaggtga 4860
taaaagtgta tattacacta gtaatcctac cacattccac ctagatggtg aagttatcac 4920
ctttgacaat cttaagacac ttctttcttt gagagaagtg aggactatta aggtgtttac 4980
aacagtagac aacattaacc tccacacgca agttgtggac atgtcaatga catatggaca 5040
acagtttggt ccaacttatt tggatggagc tgatgttact aaaataaaac ctcataattc 5100
acatgaaggt aaaacatttt atgttttacc taatgatgac actctacgtg ttgaggcttt 5160
tgagtactac cacacaactg atcctagttt tctgggtagg tacatgtcag cattaaatca 5220
cactaaaaag tggaaatacc cacaagttaa tggtttaact tctattaaat gggcagataa 5280
caactgttat cttgccactg cattgttaac actccaacaa atagagttga agtttaatcc 5340
acctgctcta caagatgctt attacagagc aagggctggt gaagctgcta acttttgtgc 5400
acttatctta gcctactgta ataagacagt aggtgagtta ggtgatgtta gagaaacaat 5460
gagttacttg tttcaacatg ccaatttaga ttcttgcaaa agagtcttga acgtggtgtg 5520
taaaacttgt ggacaacagc agacaaccct taagggtgta gaagctgtta tgtacatggg 5580
cacactttct tatgaacaat ttaagaaagg tgttcagata ccttgtacgt gtggtaaaca 5640
agctacaaaa tatctagtac aacaggagtc accttttgtt atgatgtcag caccacctgc 5700
tcagtatgaa cttaagcatg gtacatttac ttgtgctagt gagtacactg gtaattacca 5760
gtgtggtcac tataaacata taacttctaa agaaactttg tattgcatag acggtgcttt 5820
acttacaaag tcctcagaat acaaaggtcc tattacggat gttttctaca aagaaaacag 5880
ttacacaaca accataaaac cagttactta taaattggat ggtgttgttt gtacagaaat 5940
tgaccctaag ttggacaatt attataagaa agacaattct tatttcacag agcaaccaat 6000
tgatcttgta ccaaaccaac catatccaaa cgcaagcttc gataatttta agtttgtatg 6060
tgataatatc aaatttgctg atgatttaaa ccagttaact ggttataaga aacctgcttc 6120
aagagagctt aaagttacat ttttccctga cttaaatggt gatgtggtgg ctattgatta 6180
taaacactac acaccctctt ttaagaaagg agctaaattg ttacataaac ctattgtttg 6240
gcatgttaac aatgcaacta ataaagccac gtataaacca aatacctggt gtatacgttg 6300
tctttggagc acaaaaccag ttgaaacatc aaattcgttt gatgtactga agtcagagga 6360
cgcgcaggga atggataatc ttgcctgcga agatctaaaa ccagtctctg aagaagtagt 6420
ggaaaatcct accatacaga aagacgttct tgagtgtaat gtgaaaacta ccgaagttgt 6480
aggagacatt atacttaaac cagcaaataa tagtttaaaa attacagaag aggttggcca 6540
cacagatcta atggctgctt atgtagacaa ttctagtctt actattaaga aacctaatga 6600
attatctaga gtattaggtt tgaaaaccct tgctactcat ggtttagctg ctgttaatag 6660
tgtcccttgg gatactatag ctaattatgc taagcctttt cttaacaaag ttgttagtac 6720
aactactaac atagttacac ggtgtttaaa ccgtgtttgt actaattata tgccttattt 6780
ctttacttta ttgctacaat tgtgtacttt tactagaagt acaaattcta gaattaaagc 6840
atctatgccg actactatag caaagaatac tgttaagagt gtcggtaaat tttgtctaga 6900
ggcttcattt aattatttga agtcacctaa tttttctaaa ctgataaata ttataatttg 6960
gtttttacta ttaagtgttt gcctaggttc tttaatctac tcaaccgctg ctttaggtgt 7020
tttaatgtct aatttaggca tgccttctta ctgtactggt tacagagaag gctatttgaa 7080
ctctactaat gtcactattg caacctactg tactggttct ataccttgta gtgtttgtct 7140
tagtggttta gattctttag acacctatcc ttctttagaa actatacaaa ttaccatttc 7200
atcttttaaa tgggatttaa ctgcttttgg cttagttgca gagtggtttt tggcatatat 7260
tcttttcact aggtttttct atgtacttgg attggctgca atcatgcaat tgtttttcag 7320
ctattttgca gtacatttta ttagtaattc ttggcttatg tggttaataa ttaatcttgt 7380
acaaatggcc ccgatttcag ctatggttag aatgtacatc ttctttgcat cattttatta 7440
tgtatggaaa agttatgtgc atgttgtaga cggttgtaat tcatcaactt gtatgatgtg 7500
ttacaaacgt aatagagcaa caagagtcga atgtacaact attgttaatg gtgttagaag 7560
gtccttttat gtctatgcta atggaggtaa aggcttttgc aaactacaca attggaattg 7620
tgttaattgt gatacattct gtgctggtag tacatttatt agtgatgaag ttgcgagaga 7680
cttgtcacta cagtttaaaa gaccaataaa tcctactgac cagtcttctt acatcgttga 7740
tagtgttaca gtgaagaatg gttccatcca tctttacttt gataaagctg gtcaaaagac 7800
ttatgaaaga cattctctct ctcattttgt taacttagac aacctgagag ctaataacac 7860
taaaggttca ttgcctatta atgttatagt ttttgatggt aaatcaaaat gtgaagaatc 7920
atctgcaaaa tcagcgtctg tttactacag tcagcttatg tgtcaaccta tactgttact 7980
agatcaggca ttagtgtctg atgttggtga tagtgcggaa gttgcagtta aaatgtttga 8040
tgcttacgtt aatacgtttt catcaacttt taacgtacca atggaaaaac tcaaaacact 8100
agttgcaact gcagaagctg aacttgcaaa gaatgtgtcc ttagacaatg tcttatctac 8160
ttttatttca gcagctcggc aagggtttgt tgattcagat gtagaaacta aagatgttgt 8220
tgaatgtctt aaattgtcac atcaatctga catagaagtt actggcgata gttgtaataa 8280
ctatatgctc acctataaca aagttgaaaa catgacaccc cgtgaccttg gtgcttgtat 8340
tgactgtagt gcgcgtcata ttaatgcgca ggtagcaaaa agtcacaaca ttgctttgat 8400
atggaacgtt aaagatttca tgtcattgtc tgaacaacta cgaaaacaaa tacgtagtgc 8460
tgctaaaaag aataacttac cttttaagtt gacatgtgca actactagac aagttgttaa 8520
tgttgtaaca acaaagatag cacttaaggg tggtaaaatt gttaataatt ggttgaagca 8580
gttaattaaa gttacacttg tgttcctttt tgttgctgct attttctatt taataacacc 8640
tgttcatgtc atgtctaaac atactgactt ttcaagtgaa atcataggat acaaggctat 8700
tgatggtggt gtcactcgtg acatagcatc tacagatact tgttttgcta acaaacatgc 8760
tgattttgac acatggttta gccagcgtgg tggtagttat actaatgaca aagcttgccc 8820
attgattgct gcagtcataa caagagaagt gggttttgtc gtgcctggtt tgcctggcac 8880
gatattacgc acaactaatg gtgacttttt gcatttctta cctagagttt ttagtgcagt 8940
tggtaacatc tgttacacac catcaaaact tatagagtac actgactttg caacatcagc 9000
ttgtgttttg gctgctgaat gtacaatttt taaagatgct tctggtaagc cagtaccata 9060
ttgttatgat accaatgtac tagaaggttc tgttgcttat gaaagtttac gccctgacac 9120
acgttatgtg ctcatggatg gctctattat tcaatttcct aacacctacc ttgaaggttc 9180
tgttagagtg gtaacaactt ttgattctga gtactgtagg cacggcactt gtgaaagatc 9240
agaagctggt gtttgtgtat ctactagtgg tagatgggta cttaacaatg attattacag 9300
atctttacca ggagttttct gtggtgtaga tgctgtaaat ttacttacta atatgtttac 9360
accactaatt caacctattg gtgctttgga catatcagca tctatagtag ctggtggtat 9420
tgtagctatc gtagtaacat gccttgccta ctattttatg aggtttagaa gagcttttgg 9480
tgaatacagt catgtagttg cctttaatac tttactattc cttatgtcat tcactgtact 9540
ctgtttaaca ccagtttact cattcttacc tggtgtttat tctgttattt acttgtactt 9600
gacattttat cttactaatg atgtttcttt tttagcacat attcagtgga tggttatgtt 9660
cacaccttta gtacctttct ggataacaat tgcttatatc atttgtattt ccacaaagca 9720
tttctattgg ttctttagta attacctaaa gagacgtgta gtctttaatg gtgtttcctt 9780
tagtactttt gaagaagctg cgctgtgcac ctttttgtta aataaagaaa tgtatctaaa 9840
gttgcgtagt gatgtgctat tacctcttac gcaatataat agatacttag ctctttataa 9900
taagtacaag tattttagtg gagcaatgga tacaactagc tacagagaag ctgcttgttg 9960
tcatctcgca aaggctctca atgacttcag taactcaggt tctgatgttc tttaccaacc 10020
accacaaacc tctatcacct cagctgtttt gcagagtggt tttagaaaaa tggcattccc 10080
atctggtaaa gttgagggtt gtatggtaca agtaacttgt ggtacaacta cacttaacgg 10140
tctttggctt gatgacgtag tttactgtcc aagacatgtg atctgcacct ctgaagacat 10200
gcttaaccct aattatgaag atttactcat tcgtaagtct aatcataatt tcttggtaca 10260
ggctggtaat gttcaactca gggttattgg acattctatg caaaattgtg tacttaagct 10320
taaggttgat acagccaatc ctaagacacc taagtataag tttgttcgca ttcaaccagg 10380
acagactttt tcagtgttag cttgttacaa tggttcacca tctggtgttt accaatgtgc 10440
tatgaggccc aatttcacta ttaagggttc attccttaat ggttcatgtg gtagtgttgg 10500
ttttaacata gattatgact gtgtctcttt ttgttacatg caccatatgg aattaccaac 10560
tggagttcat gctggcacag acttagaagg taacttttat ggaccttttg ttgacaggca 10620
aacagcacaa gcagctggta cggacacaac tattacagtt aatgttttag cttggttgta 10680
cgctgctgtt ataaatggag acaggtggtt tctcaatcga tttaccacaa ctcttaatga 10740
ctttaacctt gtggctatga agtacaatta tgaacctcta acacaagacc atgttgacat 10800
actaggacct ctttctgctc aaactggaat tgccgtttta gatatgtgtg cttcattaaa 10860
agaattactg caaaatggta tgaatggacg taccatattg ggtagtgctt tattagaaga 10920
tgaatttaca ccttttgatg ttgttagaca atgctcaggt gttactttcc aaagtgcagt 10980
gaaaagaaca atcaagggta cacaccactg gttgttactc acaattttga cttcactttt 11040
agttttagtc cagagtactc aatggtcttt gttctttttt ttgtatgaaa atgccttttt 11100
accttttgct atgggtatta ttgctatgtc tgcttttgca atgatgtttg tcaaacataa 11160
gcatgcattt ctctgtttgt ttttgttacc ttctcttgcc actgtagctt attttaatat 11220
ggtctatatg cctgctagtt gggtgatgcg tattatgaca tggttggata tggttgatac 11280
tagtttgtct ggttttaagc taaaagactg tgttatgtat gcatcagctg tagtgttact 11340
aatccttatg acagcaagaa ctgtgtatga tgatggtgct aggagagtgt ggacacttat 11400
gaatgtcttg acactcgttt ataaagttta ttatggtaat gctttagatc aagccatttc 11460
catgtgggct cttataatct ctgttacttc taactactca ggtgtagtta caactgtcat 11520
gtttttggcc agaggtattg tttttatgtg tgttgagtat tgccctattt tcttcataac 11580
tggtaataca cttcagtgta taatgctagt ttattgtttc ttaggctatt tttgtacttg 11640
ttactttggc ctcttttgtt tactcaaccg ctactttaga ctgactcttg gtgtttatga 11700
ttacttagtt tctacacagg agtttagata tatgaattca cagggactac tcccacccaa 11760
gaatagcata gatgccttca aactcaacat taaattgttg ggtgttggtg gcaaaccttg 11820
tatcaaagta gccactgtac agtctaaaat gtcagatgta aagtgcacat cagtagtctt 11880
actctcagtt ttgcaacaac tcagagtaga atcatcatct aaattgtggg ctcaatgtgt 11940
ccagttacac aatgacattc tcttagctaa agatactact gaagcctttg aaaaaatggt 12000
ttcactactt tctgttttgc tttccatgca gggtgctgta gacataaaca agctttgtga 12060
agaaatgctg gacaacaggg caaccttaca agctatagcc tcagagttta gttcccttcc 12120
atcatatgca gcttttgcta ctgctcaaga agcttatgag caggctgttg ctaatggtga 12180
ttctgaagtt gttcttaaaa agttgaagaa gtctttgaat gtggctaaat ctgaatttga 12240
ccgtgatgca gccatgcaac gtaagttgga aaagatggct gatcaagcta tgacccaaat 12300
gtataaacag gctagatctg aggacaagag ggcaaaagtt actagtgcta tgcagacaat 12360
gcttttcact atgcttagaa agttggataa tgatgcactc aacaacatta tcaacaatgc 12420
aagagatggt tgtgttccct tgaacataat acctcttaca acagcagcca aactaatggt 12480
tgtcatacca gactataaca catataaaaa tacgtgtgat ggtacaacat ttacttatgc 12540
atcagcattg tgggaaatcc aacaggttgt agatgcagat agtaaaattg ttcaacttag 12600
tgaaattagt atggacaatt cacctaattt agcatggcct cttattgtaa cagctttaag 12660
ggccaattct gctgtcaaat tacagaataa tgagcttagt cctgttgcac tacgacagat 12720
gtcttgtgct gccggtacta cacaaactgc ttgcactgat gacaatgcgt tagcttacta 12780
caacacaaca aagggaggta ggtttgtact tgcactgtta tccgatttac aggatttgaa 12840
atgggctaga ttccctaaga gtgatggaac tggtactatc tatacagaac tggaaccacc 12900
ttgtaggttt gttacagaca cacctaaagg tcctaaagtg aagtatttat actttattaa 12960
aggattaaac aacctaaata gaggtatggt acttggtagt ttagctgcca cagtacgtct 13020
acaagctggt aatgcaacag aagtgcctgc caattcaact gtattatctt tctgtgcttt 13080
tgctgtagat gctgctaaag cttacaaaga ttatctagct agtgggggac aaccaatcac 13140
taattgtgtt aagatgttgt gtacacacac tggtactggt caggcaataa cagttacacc 13200
ggaagccaat atggatcaag aatcctttgg tggtgcatcg tgttgtctgt actgccgttg 13260
ccacatagat catccaaatc ctaaaggatt ttgtgactta aaaggtaagt atgtacaaat 13320
acctacaact tgtgctaatg accctgtggg ttttacactt aaaaacacag tctgtaccgt 13380
ctgcggtatg tggaaaggtt atggctgtag ttgtgatcaa ctccgcgaac ccatgcttca 13440
gtcagctgat gcacaatcgt ttttaaacgg gtttgcggtg taagtgcagc ccgtcttaca 13500
ccgtgcggca caggcactag tactgatgtc gtatacaggg cttttgacat ctacaatgat 13560
aaagtagctg gttttgctaa attcctaaaa actaattgtt gtcgcttcca agaaaaggac 13620
gaagatgaca atttaattga ttcttacttt gtagttaaga gacacacttt ctctaactac 13680
caacatgaag aaacaattta taatttactt aaggattgtc cagctgttgc taaacatgac 13740
ttctttaagt ttagaataga cggtgacatg gtaccacata tatcacgtca acgtcttact 13800
aaatacacaa tggcagacct cgtctatgct ttaaggcatt ttgatgaagg taattgtgac 13860
acattaaaag aaatacttgt cacatacaat tgttgtgatg atgattattt caataaaaag 13920
gactggtatg attttgtaga aaacccagat atattacgcg tatacgccaa cttaggtgaa 13980
cgtgtacgcc aagctttgtt aaaaacagta caattctgtg atgccatgcg aaatgctggt 14040
attgttggtg tactgacatt agataatcaa gatctcaatg gtaactggta tgatttcggt 14100
gatttcatac aaaccacgcc aggtagtgga gttcctgttg tagattctta ttattcattg 14160
ttaatgccta tattaacctt gaccagggct ttaactgcag agtcacatgt tgacactgac 14220
ttaacaaagc cttacattaa gtgggatttg ttaaaatatg acttcacgga agagaggtta 14280
aaactctttg accgttattt taaatattgg gatcagacat accacccaaa ttgtgttaac 14340
tgtttggatg acagatgcat tctgcattgt gcaaacttta atgttttatt ctctacagtg 14400
ttcccaccta caagttttgg accactagtg agaaaaatat ttgttgatgg tgttccattt 14460
gtagtttcaa ctggatacca cttcagagag ctaggtgttg tacataatca ggatgtaaac 14520
ttacatagct ctagacttag ttttaaggaa ttacttgtgt atgctgctga ccctgctatg 14580
cacgctgctt ctggtaatct attactagat aaacgcacta cgtgcttttc agtagctgca 14640
cttactaaca atgttgcttt tcaaactgtc aaacccggta attttaacaa agacttctat 14700
gactttgctg tgtctaaggg tttctttaag gaaggaagtt ctgttgaatt aaaacacttc 14760
ttctttgctc aggatggtaa tgctgctatc agcgattatg actactatcg ttataatcta 14820
ccaacaatgt gtgatatcag acaactacta tttgtagttg aagttgttga taagtacttt 14880
gattgttacg atggtggctg tattaatgct aaccaagtca tcgtcaacaa cctagacaaa 14940
tcagctggtt ttccatttaa taaatggggt aaggctagac tttattatga ttcaatgagt 15000
tatgaggatc aagatgcact tttcgcatat acaaaacgta atgtcatccc tactataact 15060
caaatgaatc ttaagtatgc cattagtgca aagaatagag ctcgcaccgt agctggtgtc 15120
tctatctgta gtactatgac caatagacag tttcatcaaa aattattgaa atcaatagcc 15180
gccactagag gagctactgt agtaattgga acaagcaaat tctatggtgg ttggcacaac 15240
atgttaaaaa ctgtttatag tgatgtagaa aaccctcacc ttatgggttg ggattatcct 15300
aaatgtgata gagccatgcc taacatgctt agaattatgg cctcacttgt tcttgctcgc 15360
aaacatacaa cgtgttgtag cttgtcacac cgtttctata gattagctaa tgagtgtgct 15420
caagtattga gtgaaatggt catgtgtggc ggttcactat atgttaaacc aggtggaacc 15480
tcatcaggag atgccacaac tgcttatgct aatagtgttt ttaacatttg tcaagctgtc 15540
acggccaatg ttaatgcact tttatctact gatggtaaca aaattgccga taagtatgtc 15600
cgcaatttac aacacagact ttatgagtgt ctctatagaa atagagatgt tgacacagac 15660
tttgtgaatg agttttacgc atatttgcgt aaacatttct caatgatgat actctctgac 15720
gatgctgttg tgtgtttcaa tagcacttat gcatctcaag gtctagtggc tagcataaag 15780
aactttaagt cagttcttta ttatcaaaac aatgttttta tgtctgaagc aaaatgttgg 15840
actgagactg accttactaa aggacctcat gaattttgct ctcaacatac aatgctagtt 15900
aaacagggtg atgattatgt gtaccttcct tacccagatc catcaagaat cctaggggcc 15960
ggctgttttg tagatgatat cgtaaaaaca gatggtacac ttatgattga acggttcgtg 16020
tctttagcta tagatgctta cccacttact aaacatccta atcaggagta tgctgatgtc 16080
tttcatttgt acttacaata cataagaaag ctacatgatg agttaacagg acacatgtta 16140
gacatgtatt ctgttatgct tactaatgat aacacttcaa ggtattggga acctgagttt 16200
tatgaggcta tgtacacacc gcatacagtc ttacaggctg ttggggcttg tgttctttgc 16260
aattcacaga cttcattaag atgtggtgct tgcatacgta gaccattctt atgttgtaaa 16320
tgctgttacg accatgtcat atcaacatca cataaattag tcttgtctgt taatccgtat 16380
gtttgcaatg ctccaggttg tgatgtcaca gatgtgactc aactttactt aggaggtatg 16440
agctattatt gtaaatcaca taaaccaccc attagttttc cattgtgtgc taatggacaa 16500
gtttttggtt tatataaaaa tacatgtgtt ggtagcgata atgttactga ctttaatgca 16560
attgcaacat gtgactggac aaatgctggt gattacattt tagctaacac ctgtactgaa 16620
agactcaagc tttttgcagc agaaacgctc aaagctactg aggagacatt taaactgtct 16680
tatggtattg ctactgtacg tgaagtgctg tctgacagag aattacatct ttcatgggaa 16740
gttggtaaac ctagaccacc acttaaccga aattatgtct ttactggtta tcgtgtaact 16800
aaaaacagta aagtacaaat aggagagtac acctttgaaa aaggtgacta tggtgatgct 16860
gttgtttacc gaggtacaac aacttacaaa ttaaatgttg gtgattattt tgtgctgaca 16920
tcacatacag taatgccatt aagtgcacct acactagtgc cacaagagca ctatgttaga 16980
attactggct tatacccaac actcaatatc tcagatgagt tttctagcaa tgttgcaaat 17040
tatcaaaagg ttggtatgca aaagtattct acactccagg gaccacctgg tactggtaag 17100
agtcattttg ctattggcct agctctctac tacccttctg ctcgcatagt gtatacagct 17160
tgctctcatg ccgctgttga tgcactatgt gagaaggcat taaaatattt gcctatagat 17220
aaatgtagta gaattatacc tgcacgtgct cgtgtagagt gttttgataa attcaaagtg 17280
aattcaacat tagaacagta tgtcttttgt actgtaaatg cattgcctga gacgacagca 17340
gatatagttg tctttgatga aatttcaatg gccacaaatt atgatttgag tgttgtcaat 17400
gccagattac gtgctaagca ctatgtgtac attggcgacc ctgctcaatt acctgcacca 17460
cgcacattgc taactaaggg cacactagaa ccagaatatt tcaattcagt gtgtagactt 17520
atgaaaacta taggtccaga catgttcctc ggaacttgtc ggcgttgtcc tgctgaaatt 17580
gttgacactg tgagtgcttt ggtttatgat aataagctta aagcacataa agacaaatca 17640
gctcaatgct ttaaaatgtt ttataagggt gttatcacgc atgatgtttc atctgcaatt 17700
aacaggccac aaataggcgt ggtaagagaa ttccttacac gtaaccctgc ttggagaaaa 17760
gctgtcttta tttcacctta taattcacag aatgctgtag cctcaaagat tttgggacta 17820
ccaactcaaa ctgttgattc atcacagggc tcagaatatg actatgtcat attcactcaa 17880
accactgaaa cagctcactc ttgtaatgta aacagattta atgttgctat taccagagca 17940
aaagtaggca tactttgcat aatgtctgat agagaccttt atgacaagtt gcaatttaca 18000
agtcttgaaa ttccacgtag gaatgtggca actttacaag ctgaaaatgt aacaggactc 18060
tttaaagatt gtagtaaggt aatcactggg ttacatccta cacaggcacc tacacacctc 18120
agtgttgaca ctaaattcaa aactgaaggt ttatgtgttg acatacctgg catacctaag 18180
gacatgacct atagaagact catctctatg atgggtttta aaatgaatta tcaagttaat 18240
ggttacccta acatgtttat cacccgcgaa gaagctataa gacatgtacg tgcatggatt 18300
ggcttcgatg tcgaggggtg tcatgctact agagaagctg ttggtaccaa tttaccttta 18360
cagctaggtt tttctacagg tgttaaccta gttgctgtac ctacaggtta tgttgataca 18420
cctaataata cagatttttc cagagttagt gctaaaccac cgcctggaga tcaatttaaa 18480
cacctcatac cacttatgta caaaggactt ccttggaatg tagtgcgtat aaagattgta 18540
caaatgttaa gtgacacact taaaaatctc tctgacagag tcgtatttgt cttatgggca 18600
catggctttg agttgacatc tatgaagtat tttgtgaaaa taggacctga gcgcacctgt 18660
tgtctatgtg atagacgtgc cacatgcttt tccactgctt cagacactta tgcctgttgg 18720
catcattcta ttggatttga ttacgtctat aatccgttta tgattgatgt tcaacaatgg 18780
ggttttacag gtaacctaca aagcaaccat gatctgtatt gtcaagtcca tggtaatgca 18840
catgtagcta gttgtgatgc aatcatgact aggtgtctag ctgtccacga gtgctttgtt 18900
aagcgtgttg actggactat tgaatatcct ataattggtg atgaactgaa gattaatgcg 18960
gcttgtagaa aggttcaaca catggttgtt aaagctgcat tattagcaga caaattccca 19020
gttcttcacg acattggtaa ccctaaagct attaagtgtg tacctcaagc tgatgtagaa 19080
tggaagttct atgatgcaca gccttgtagt gacaaagctt ataaaataga agaattattc 19140
tattcttatg ccacacattc tgacaaattc acagatggtg tatgcctatt ttggaattgc 19200
aatgtcgata gatatcctgc taattccatt gtttgtagat ttgacactag agtgctatct 19260
aaccttaact tgcctggttg tgatggtggc agtttgtatg taaataaaca tgcattccac 19320
acaccagctt ttgataaaag tgcttttgtt aatttaaaac aattaccatt tttctattac 19380
tctgacagtc catgtgagtc tcatggaaaa caagtagtgt cagatataga ttatgtacca 19440
ctaaagtctg ctacgtgtat aacacgttgc aatttaggtg gtgctgtctg tagacatcat 19500
gctaatgagt acagattgta tctcgatgct tataacatga tgatctcagc tggctttagc 19560
ttgtgggttt acaaacaatt tgatacttat aacctctgga acacttttac aagacttcag 19620
agtttagaaa atgtggcttt taatgttgta aataagggac actttgatgg acaacagggt 19680
gaagtaccag tttctatcat taataacact gtttacacaa aagttgatgg tgttgatgta 19740
gaattgtttg aaaataaaac aacattacct gttaatgtag catttgagct ttgggctaag 19800
cgcaacatta aaccagtacc agaggtgaaa atactcaata atttgggtgt ggacattgct 19860
gctaatactg tgatctggga ctacaaaaga gatgctccag cacatatatc tactattggt 19920
gtttgttcta tgactgacat agccaagaaa ccaactgaaa cgatttgtgc accactcact 19980
gtcttttttg atggtagagt tgatggtcaa gtagacttat ttagaaatgc ccgtaatggt 20040
gttcttatta cagaaggtag tgttaaaggt ttacaaccat ctgtaggtcc caaacaagct 20100
agtcttaatg gagtcacatt aattggagaa gccgtaaaaa cacagttcaa ttattataag 20160
aaagttgatg gtgttgtcca acaattacct gaaacttact ttactcagag tagaaattta 20220
caagaattta aacccaggag tcaaatggaa attgatttct tagaattagc tatggatgaa 20280
ttcattgaac ggtataaatt agaaggctat gccttcgaac atatcgttta tggagatttt 20340
agtcatagtc agttaggtgg tttacatcta ctgattggac tagctaaacg ttttaaggaa 20400
tcaccttttg aattagaaga ttttattcct atggacagta cagttaaaaa ctatttcata 20460
acagatgcgc aaacaggttc atctaagtgt gtgtgttctg ttattgattt attacttgat 20520
gattttgttg aaataataaa atcccaagat ttatctgtag tttctaaggt tgtcaaagtg 20580
actattgact atacagaaat ttcatttatg ctttggtgta aagatggcca tgtagaaaca 20640
ttttacccaa aattacaatc tagtcaagcg tggcaaccgg gtgttgctat gcctaatctt 20700
tacaaaatgc aaagaatgct attagaaaag tgtgaccttc aaaattatgg tgatagtgca 20760
acattaccta aaggcataat gatgaatgtc gcaaaatata ctcaactgtg tcaatattta 20820
aacacattaa cattagctgt accctataat atgagagtta tacattttgg tgctggttct 20880
gataaaggag ttgcaccagg tacagctgtt ttaagacagt ggttgcctac gggtacgctg 20940
cttgtcgatt cagatcttaa tgactttgtc tctgatgcag attcaacttt gattggtgat 21000
tgtgcaactg tacatacagc taataaatgg gatctcatta ttagtgatat gtacgaccct 21060
aagactaaaa atgttacaaa agaaaatgac tctaaagagg gttttttcac ttacatttgt 21120
gggtttatac aacaaaagct agctcttgga ggttccgtgg ctataaagat aacagaacat 21180
tcttggaatg ctgatcttta taagctcatg ggacacttcg catggtggac agcctttgtt 21240
actaatgtga atgcgtcatc atctgaagca tttttaattg gatgtaatta tcttggcaaa 21300
ccacgcgaac aaatagatgg ttatgtcatg catgcaaatt acatattttg gaggaataca 21360
aatccaattc agttgtcttc ctattcttta tttgacatga gtaaatttcc ccttaaatta 21420
aggggtactg ctgttatgtc tttaaaagaa ggtcaaatca atgatatgat tttatctctt 21480
cttagtaaag gtagacttat aattagagaa aacaacagag ttgttatttc tagtgatgtt 21540
cttgttaaca actaaacgaa caatgtttgt ttttcttgtt ttattgccac tagtctctag 21600
tcagtgtgtt aatcttacaa ccagaactca attaccccct gcatacacta attctttcac 21660
acgtggtgtt tattaccctg acaaagtttt cagatcctca gttttacatt caactcagga 21720
cttgttctta cctttctttt ccaatgttac ttggttccat gctatacatg tctctgggac 21780
caatggtact aagaggtttg ataaccctgt cctaccattt aatgatggtg tttattttgc 21840
ttccactgag aagtctaaca taataagagg ctggattttt ggtactactt tagattcgaa 21900
gacccagtcc ctacttattg ttaataacgc tactaatgtt gttattaaag tctgtgaatt 21960
tcaattttgt aatgatccat ttttgggtgt ttattaccac aaaaacaaca aaagttggat 22020
ggaaagtgag ttcagagttt attctagtgc gaataattgc acttttgaat atgtctctca 22080
gccttttctt atggaccttg aaggaaaaca gggtaatttc aaaaatctta gggaatttgt 22140
gtttaagaat attgatggtt attttaaaat atattctaag cacacgccta ttaatttagt 22200
gcgtgatctc cctcagggtt tttcggcttt agaaccattg gtagatttgc caataggtat 22260
taacatcact aggtttcaaa ctttacttgc tttacataga agttatttga ctcctggtga 22320
ttcttcttca ggttggacag ctggtgctgc agcttattat gtgggttatc ttcaacctag 22380
gacttttcta ttaaaatata atgaaaatgg aaccattaca gatgctgtag actgtgcact 22440
tgaccctctc tcagaaacaa agtgtacgtt gaaatccttc actgtagaaa aaggaatcta 22500
tcaaacttct aactttagag tccaaccaac agaatctatt gttagatttc ctaatattac 22560
aaacttgtgc ccttttggtg aagtttttaa cgccaccaga tttgcatctg tttatgcttg 22620
gaacaggaag agaatcagca actgtgttgc tgattattct ttcctatata attccgcatc 22680
attttccact tttaagtgtt atggagtgtc tcctactaaa ttaaatgatc tctgctttac 22740
taatgtctat gcagattcat ttgtaattag aggtgatgaa gtcagacaaa tcgctccagg 22800
gcaaactgga aagattgctg attataatta taaattacca gatgatttta caggctgcgt 22860
tatagcttgg aattctaaca atcttgattc taaggttggt ggtaattata attacctgta 22920
tagattgttt aggaagtcta atctcaaacc ttttgagaga gatatttcaa ctgaaatcta 22980
tcaggccggt agcacacctt gtaatggtgt tgaaggtttt aattgttact ttcctttaca 23040
atcatatggt ttccaaccca ctaatggtgt tggttaccaa ccatacagag tagtagtact 23100
ttcttttgaa cttctacatg caccagcaac tgtttgtgga cctaaaaagt ctactaattt 23160
ggttaaaaac aaatgtgtca atttcaactt caatggttta acaggcacag gtgttcttac 23220
tgagtctaac aaaaagtttc tgcctttcca acaatttggc agagacattg ctgacactac 23280
tgatgctgtc cgtgatccac agacacttga gattcttgac attacaccat gttcttttgg 23340
tggtgtcagt gttataacac caggaacaaa tacttctaac caggttgctg ttctttatca 23400
ggatgttaac tgcacagaag tccctgttgc tattcatgca gatcaactta ctcctacttg 23460
gcgtgtttat tctacaggtt ctaatgtttt tcaaacacgt gcaggctgtt taataggggc 23520
tgaacatgtc aacaactcat atgagtgtga catacccatt ggtgcaggta tatgcgctag 23580
ttatcagact cagactaatt ctcctcggcg ggcacgtagt gtagctagtc aatccatcat 23640
tgcctacact atgtcacttg gtgcagaaaa ttcagttgct tactctaata actctattgc 23700
catacccaca aattttacta ttagtgttac cacagaaatt ctaccagtgt ctatgaccaa 23760
gacatcagta gattgtacaa tgtacatttg tggtgattca actgaatgca gcaatctttt 23820
gttgcaatat ggcagttttt gtacacaatt aaaccgtgct ttaactggaa tagctgttga 23880
acaagacaaa aacacccaag aagtttttgc acaagtcaaa caaatttaca aaacaccacc 23940
aattaaagat tttggtggtt ttaatttttc acaaatatta ccagatccat caaaaccaag 24000
caagaggtca tttattgaag atctactttt caacaaagtg acacttgcag atgctggctt 24060
catcaaacaa tatggtgatt gccttggtga tattgctgct agagacctca tttgtgcaca 24120
aaagtttaac ggccttactg ttttgccacc tttgctcaca gatgaaatga ttgctcaata 24180
cacttctgca ctgttagcgg gtacaatcac ttctggttgg acctttggtg caggtgctgc 24240
attacaaata ccatttgcta tgcaaatggc ttataggttt aatggtattg gagttacaca 24300
gaatgttctc tatgagaacc aaaaattgat tgccaaccaa tttaatagtg ctattggcaa 24360
aattcaagac tcactttctt ccacagcaag tgcacttgga aaacttcaag atgtggtcaa 24420
ccaaaatgca caagctttaa acacgcttgt taaacaactt agctccaatt ttggtgcaat 24480
ttcaagtgtt ttaaatgata tcctttcacg tcttgacaaa gttgaggctg aagtgcaaat 24540
tgataggttg atcacaggca gacttcaaag tttgcagaca tatgtgactc aacaattaat 24600
tagagctgca gaaatcagag cttctgctaa tcttgctgct actaaaatgt cagagtgtgt 24660
acttggacaa tcaaaaagag ttgatttttg tggaaagggc tatcatctta tgtccttccc 24720
tcagtcagca cctcatggtg tagtcttctt gcatgtgact tatgtccctg cacaagaaaa 24780
gaacttcaca actgctcctg ccatttgtca tgatggaaaa gcacactttc ctcgtgaagg 24840
tgtctttgtt tcaaatggca cacactggtt tgtaacacaa aggaattttt atgaaccaca 24900
aatcattact acagacaaca catttgtgtc tggtaactgt gatgttgtaa taggaattgt 24960
caacaacaca gtttatgatc ctttgcaacc tgaattagac tcattcaagg aggagttaga 25020
taaatatttt aagaatcata catcaccaga tgttgattta ggtgacatct ctggcattaa 25080
tgcttcagtt gtaaacattc aaaaagaaat tgaccgcctc aatgaggttg ccaagaattt 25140
aaatgaatct ctcatcgatc tccaagaact tggaaagtat gagcagtata taaaatggcc 25200
atggtacatt tggctaggtt ttatagctgg cttgattgcc atagtaatgg tgacaattat 25260
gctttgctgt atgaccagtt gctgtagttg tctcaagggc tgttgttctt gtggatcctg 25320
ctgcaaattt gatgaagacg actctgagcc agtgctcaaa ggagtcaaat tacattacac 25380
ataaacgaac ttatggattt gtttatgaga atcttcacaa ttggaactgt aactttgaag 25440
caaggtgaaa tcaaggatgc tactccttca gattttgttc gcgctactgc aacgataccg 25500
atacaagcct cactcccttt cggatggctt attgttggcg ttgcacttct tgctgttttt 25560
cagagcgctt ccaaaatcat aaccctcaaa aagagatggc aactagcact ctccaagggt 25620
gttcactttg tttgcaactt gctgttgttg tttgtaacag tttactcaca ccttttgctc 25680
gttgctgctg gccttgaagc cccttttctc tatctttatg ctttagtcta cttcttgcag 25740
agtataaact ttgtaagaat aataatgagg ctttggcttt gctggaaatg ccgttccaaa 25800
aacccattac tttatgatgc caactatttt ctttgctggc atactaattg ttacgactat 25860
tgtatacctt acaatagtgt aacttcttca attgtcatta cttcaggtga tggcacaaca 25920
agtcctattt ctgaacatga ctaccagatt ggtggttata ctgaaaaatg ggaatctgga 25980
gtaaaagact gtgttgtatt acacagttac ttcacttcag actattacca gctgtactca 26040
actcaattga gtacagacac tggtgttgaa catgttacct tcttcatcta caataaaatt 26100
gttgatgagc ctgaagaaca tgtccaaatt cacacaatcg acgtttcatc cggagttgtt 26160
aatccagtaa tggaaccaat ttatgatgaa ccgacgacga ctactagcgt gcctttgtaa 26220
gcacaagctg atgagtacga acttatgtac tcattcgttt cggaagagac aggtacgtta 26280
atagttaata gcgtacttct ttttcttgct ttcgtggtat tcttgctagt tacactagcc 26340
atccttactg cgcttcgatt gtgtgcgtac tgctgcaata ttgttaacgt gagtcttgta 26400
aaaccttctt tttacgttta ctctcgtgtt aaaaatctga attcttctag agttcctgat 26460
cttctggtct aaacgaacta aatattatat tagtttttct gtttggaact ttaattttag 26520
ccatggcaga ttccaacggt actattaccg ttgaagagct taaaaagctc cttgaacaat 26580
ggaacctagt aataggtttc ctattcctta catggatttg tcttctacaa tttgcctatg 26640
ccaacaggaa taggtttttg tatataatta agttaatttt cctctggctg ttatggccag 26700
taactttagc ttgttttgtg cttgctgctg tttacagaat aaattggatc accggtggaa 26760
ttgctatcgc aatggcttgt cttgtaggct tgatgtggct cagctacttc attgcttctt 26820
tcagactgtt tgcgcgtacg cgttccatgt ggtcattcaa tccagaaact aacattcttc 26880
tcaacgtgcc actccatggc actattctga ccagaccgct tctagaaagt gaactcgtaa 26940
tcggagctgt gatccttcgt ggacatcttc gtattgctgg acaccatcta ggacgctgtg 27000
acatcaagga cctgcctaaa gaaatcactg ttgctacatc acgaacgctt tcttattaca 27060
aattgggagc ttcgcagcgt gtagcaggtg actcaggttt tgctgcatac agtcgctaca 27120
ggattggcaa ctataaatta aacacagacc attccagtag cagtgacaat attgctttgc 27180
ttgtacagta agtgacaaca gatgtttcat ctcgttgact ttcaggttac tatagcagag 27240
atattactaa ttattatgag gacttttaaa gtttccattt ggaatcttga ttacatcata 27300
aacctcataa ttaaaaattt atctaagtca ctaactgaga ataaatattc tcaattagat 27360
gaagagcaac caatggagat tgattaaacg aacatgaaaa ttattctttt cttggcactg 27420
ataacactcg ctacttgtga gctttatcac taccaagagt gtgttagagg tacaacagta 27480
cttttaaaag aaccttgctc ttctggaaca tacgagggca attcaccatt tcatcctcta 27540
gctgataaca aatttgcact gacttgcttt agcactcaat ttgcttttgc ttgtcctgac 27600
ggcgtaaaac acgtctatca gttacgtgcc agatcagttt cacctaaact gttcatcaga 27660
caagaggaag ttcaagaact ttactctcca atttttctta ttgttgcggc aatagtgttt 27720
ataacacttt gcttcacact caaaagaaag acagaatgat tgaactttca ttaattgact 27780
tctatttgtg ctttttagcc tttctgctat tccttgtttt aattatgctt attatctttt 27840
ggttctcact tgaactgcaa gatcataatg aaacttgtca cgcctaaacg aacatgaaat 27900
ttcttgtttt cttaggaatc atcacaactg tagctgcatt tcaccaagaa tgtagtttac 27960
agtcatgtac tcaacatcaa ccatatgtag ttgatgaccc gtgtcctatt cacttctatt 28020
ctaaatggta tattagagta ggagctagaa aatcagcacc tttaattgaa ttgtgcgtgg 28080
atgaggctgg ttctaaatca cccattcagt acatcgatat cggtaattat acagtttcct 28140
gtttaccttt tacaattaat tgccaggaac ctaaattggg tagtcttgta gtgcgttgtt 28200
cgttctatga agacttttta gagtatcatg acgttcgtgt tgttttagat ttcatctaaa 28260
cgaacaaact aaaatgtctg ataatggacc ccaaaatcag cgaaatgcac cccgcattac 28320
gtttggtgga ccctcagatt caactggcag taaccagaat ggagaacgca gtggggcgcg 28380
atcaaaacaa cgtcggcccc aaggtttacc caataatact gcgtcttggt tcaccgctct 28440
cactcaacat ggcaaggaag accttaaatt ccctcgagga caaggcgttc caattaacac 28500
caatagcagt ccagatgacc aaattggcta ctaccgaaga gctaccagac gaattcgtgg 28560
tggtgacggt aaaatgaaag atctcagtcc aagatggtat ttctactacc taggaactgg 28620
gccagaagct ggacttccct atggtgctaa caaagacggc atcatatggg ttgcaactga 28680
gggagccttg aatacaccaa aagatcacat tggcacccgc aatcctgcta acaatgctgc 28740
aatcgtgcta caacttcctc aaggaacaac attgccaaaa ggcttctacg cagaagggag 28800
cagaggcggc agtcaagcct cttctcgttc ctcatcacgt agtcgcaaca gttcaagaaa 28860
ttcaactcca ggcagcagta ggggaacttc tcctgctaga atggctggca atggcggtga 28920
tgctgctctt gctttgctgc tgcttgacag attgaaccag cttgagagca aaatgtctgg 28980
taaaggccaa caacaacaag gccaaactgt cactaagaaa tctgctgctg aggcttctaa 29040
gaagcctcgg caaaaacgta ctgccactaa agcatacaat gtaacacaag ctttcggcag 29100
acgtggtcca gaacaaaccc aaggaaattt tggggaccag gaactaatca gacaaggaac 29160
tgattacaaa cattggccgc aaattgcaca atttgccccc agcgcttcag cgttcttcgg 29220
aatgtcgcgc attggcatgg aagtcacacc ttcgggaacg tggttgacct acacaggtgc 29280
catcaaattg gatgacaaag atccaaattt caaagatcaa gtcattttgc tgaataagca 29340
tattgacgca tacaaaacat tcccaccaac agagcctaaa aaggacaaaa agaagaaggc 29400
tgatgaaact caagccttac cgcagagaca gaagaaacag caaactgtga ctcttcttcc 29460
tgctgcagat ttggatgatt tctccaaaca attgcaacaa tccatgagca gtgctgactc 29520
aactcaggcc taaactcatg cagaccacac aaggcagatg ggctatataa acgttttcgc 29580
ttttccgttt acgatatata gtctactctt gtgcagaatg aattctcgta actacatagc 29640
acaagtagat gtagttaact ttaatctcac atagcaatct ttaatcagtg tgtaacatta 29700
gggaggactt gaaagagcca ccacattttc accgaggcca cgcggagtac gatcgagtgt 29760
acagtgaaca atgctaggga gagctgccta tatggaagag ccctaatgtg taaaattaat 29820
tttagtagtg ctatccccat gtgattttaa tagcttctta ggagaatgac aaaa 29874
<210> 13
<211> 4405
<212> PRT
<213> SARS-CoV2
<400> 13
Met Glu Ser Leu Val Pro Gly Phe Asn Glu Lys Thr His Val Gln Leu
1 5 10 15
Ser Leu Pro Val Leu Gln Val Arg Asp Val Leu Val Arg Gly Phe Gly
20 25 30
Asp Ser Val Glu Glu Val Leu Ser Glu Ala Arg Gln His Leu Lys Asp
35 40 45
Gly Thr Cys Gly Leu Val Glu Val Glu Lys Gly Val Leu Pro Gln Leu
50 55 60
Glu Gln Pro Tyr Val Phe Ile Lys Arg Ser Asp Ala Arg Thr Ala Pro
65 70 75 80
His Gly His Val Met Val Glu Leu Val Ala Glu Leu Glu Gly Ile Gln
85 90 95
Tyr Gly Arg Ser Gly Glu Thr Leu Gly Val Leu Val Pro His Val Gly
100 105 110
Glu Ile Pro Val Ala Tyr Arg Lys Val Leu Leu Arg Lys Asn Gly Asn
115 120 125
Lys Gly Ala Gly Gly His Ser Tyr Gly Ala Asp Leu Lys Ser Phe Asp
130 135 140
Leu Gly Asp Glu Leu Gly Thr Asp Pro Tyr Glu Asp Phe Gln Glu Asn
145 150 155 160
Trp Asn Thr Lys His Ser Ser Gly Val Thr Arg Glu Leu Met Arg Glu
165 170 175
Leu Asn Gly Gly Ala Tyr Thr Arg Tyr Val Asp Asn Asn Phe Cys Gly
180 185 190
Pro Asp Gly Tyr Pro Leu Glu Cys Ile Lys Asp Leu Leu Ala Arg Ala
195 200 205
Gly Lys Ala Ser Cys Thr Leu Ser Glu Gln Leu Asp Phe Ile Asp Thr
210 215 220
Lys Arg Gly Val Tyr Cys Cys Arg Glu His Glu His Glu Ile Ala Trp
225 230 235 240
Tyr Thr Glu Arg Ser Glu Lys Ser Tyr Glu Leu Gln Thr Pro Phe Glu
245 250 255
Ile Lys Leu Ala Lys Lys Phe Asp Thr Phe Asn Gly Glu Cys Pro Asn
260 265 270
Phe Val Phe Pro Leu Asn Ser Ile Ile Lys Thr Ile Gln Pro Arg Val
275 280 285
Glu Lys Lys Lys Leu Asp Gly Phe Met Gly Arg Ile Arg Ser Val Tyr
290 295 300
Pro Val Ala Ser Pro Asn Glu Cys Asn Gln Met Cys Leu Ser Thr Leu
305 310 315 320
Met Lys Cys Asp His Cys Gly Glu Thr Ser Trp Gln Thr Gly Asp Phe
325 330 335
Val Lys Ala Thr Cys Glu Phe Cys Gly Thr Glu Asn Leu Thr Lys Glu
340 345 350
Gly Ala Thr Thr Cys Gly Tyr Leu Pro Gln Asn Ala Val Val Lys Ile
355 360 365
Tyr Cys Pro Ala Cys His Asn Ser Glu Val Gly Pro Glu His Ser Leu
370 375 380
Ala Glu Tyr His Asn Glu Ser Gly Leu Lys Thr Ile Leu Arg Lys Gly
385 390 395 400
Gly Arg Thr Ile Ala Phe Gly Gly Cys Val Phe Ser Tyr Val Gly Cys
405 410 415
His Asn Lys Cys Ala Tyr Trp Val Pro Arg Ala Ser Ala Asn Ile Gly
420 425 430
Cys Asn His Thr Gly Val Val Gly Glu Gly Ser Glu Gly Leu Asn Asp
435 440 445
Asn Leu Leu Glu Ile Leu Gln Lys Glu Lys Val Asn Ile Asn Ile Val
450 455 460
Gly Asp Phe Lys Leu Asn Glu Glu Ile Ala Ile Ile Leu Ala Ser Phe
465 470 475 480
Ser Ala Ser Thr Ser Ala Phe Val Glu Thr Val Lys Gly Leu Asp Tyr
485 490 495
Lys Ala Phe Lys Gln Ile Val Glu Ser Cys Gly Asn Phe Lys Val Thr
500 505 510
Lys Gly Lys Ala Lys Lys Gly Ala Trp Asn Ile Gly Glu Gln Lys Ser
515 520 525
Ile Leu Ser Pro Leu Tyr Ala Phe Ala Ser Glu Ala Ala Arg Val Val
530 535 540
Arg Ser Ile Phe Ser Arg Thr Leu Glu Thr Ala Gln Asn Ser Val Arg
545 550 555 560
Val Leu Gln Lys Ala Ala Ile Thr Ile Leu Asp Gly Ile Ser Gln Tyr
565 570 575
Ser Leu Arg Leu Ile Asp Ala Met Met Phe Thr Ser Asp Leu Ala Thr
580 585 590
Asn Asn Leu Val Val Met Ala Tyr Ile Thr Gly Gly Val Val Gln Leu
595 600 605
Thr Ser Gln Trp Leu Thr Asn Ile Phe Gly Thr Val Tyr Glu Lys Leu
610 615 620
Lys Pro Val Leu Asp Trp Leu Glu Glu Lys Phe Lys Glu Gly Val Glu
625 630 635 640
Phe Leu Arg Asp Gly Trp Glu Ile Val Lys Phe Ile Ser Thr Cys Ala
645 650 655
Cys Glu Ile Val Gly Gly Gln Ile Val Thr Cys Ala Lys Glu Ile Lys
660 665 670
Glu Ser Val Gln Thr Phe Phe Lys Leu Val Asn Lys Phe Leu Ala Leu
675 680 685
Cys Ala Asp Ser Ile Ile Ile Gly Gly Ala Lys Leu Lys Ala Leu Asn
690 695 700
Leu Gly Glu Thr Phe Val Thr His Ser Lys Gly Leu Tyr Arg Lys Cys
705 710 715 720
Val Lys Ser Arg Glu Glu Thr Gly Leu Leu Met Pro Leu Lys Ala Pro
725 730 735
Lys Glu Ile Ile Phe Leu Glu Gly Glu Thr Leu Pro Thr Glu Val Leu
740 745 750
Thr Glu Glu Val Val Leu Lys Thr Gly Asp Leu Gln Pro Leu Glu Gln
755 760 765
Pro Thr Ser Glu Ala Val Glu Ala Pro Leu Val Gly Thr Pro Val Cys
770 775 780
Ile Asn Gly Leu Met Leu Leu Glu Ile Lys Asp Thr Glu Lys Tyr Cys
785 790 795 800
Ala Leu Ala Pro Asn Met Met Val Thr Asn Asn Thr Phe Thr Leu Lys
805 810 815
Gly Gly Ala Pro Thr Lys Val Thr Phe Gly Asp Asp Thr Val Ile Glu
820 825 830
Val Gln Gly Tyr Lys Ser Val Asn Ile Thr Phe Glu Leu Asp Glu Arg
835 840 845
Ile Asp Lys Val Leu Asn Glu Lys Cys Ser Ala Tyr Thr Val Glu Leu
850 855 860
Gly Thr Glu Val Asn Glu Phe Ala Cys Val Val Ala Asp Ala Val Ile
865 870 875 880
Lys Thr Leu Gln Pro Val Ser Glu Leu Leu Thr Pro Leu Gly Ile Asp
885 890 895
Leu Asp Glu Trp Ser Met Ala Thr Tyr Tyr Leu Phe Asp Glu Ser Gly
900 905 910
Glu Phe Lys Leu Ala Ser His Met Tyr Cys Ser Phe Tyr Pro Pro Asp
915 920 925
Glu Asp Glu Glu Glu Gly Asp Cys Glu Glu Glu Glu Phe Glu Pro Ser
930 935 940
Thr Gln Tyr Glu Tyr Gly Thr Glu Asp Asp Tyr Gln Gly Lys Pro Leu
945 950 955 960
Glu Phe Gly Ala Thr Ser Ala Ala Leu Gln Pro Glu Glu Glu Gln Glu
965 970 975
Glu Asp Trp Leu Asp Asp Asp Ser Gln Gln Thr Val Gly Gln Gln Asp
980 985 990
Gly Ser Glu Asp Asn Gln Thr Thr Thr Ile Gln Thr Ile Val Glu Val
995 1000 1005
Gln Pro Gln Leu Glu Met Glu Leu Thr Pro Val Val Gln Thr Ile Glu
1010 1015 1020
Val Asn Ser Phe Ser Gly Tyr Leu Lys Leu Thr Asp Asn Val Tyr Ile
1025 1030 1035 1040
Lys Asn Ala Asp Ile Val Glu Glu Ala Lys Lys Val Lys Pro Thr Val
1045 1050 1055
Val Val Asn Ala Ala Asn Val Tyr Leu Lys His Gly Gly Gly Val Ala
1060 1065 1070
Gly Ala Leu Asn Lys Ala Thr Asn Asn Ala Met Gln Val Glu Ser Asp
1075 1080 1085
Asp Tyr Ile Ala Thr Asn Gly Pro Leu Lys Val Gly Gly Ser Cys Val
1090 1095 1100
Leu Ser Gly His Asn Leu Ala Lys His Cys Leu His Val Val Gly Pro
1105 1110 1115 1120
Asn Val Asn Lys Gly Glu Asp Ile Gln Leu Leu Lys Ser Ala Tyr Glu
1125 1130 1135
Asn Phe Asn Gln His Glu Val Leu Leu Ala Pro Leu Leu Ser Ala Gly
1140 1145 1150
Ile Phe Gly Ala Asp Pro Ile His Ser Leu Arg Val Cys Val Asp Thr
1155 1160 1165
Val Arg Thr Asn Val Tyr Leu Ala Val Phe Asp Lys Asn Leu Tyr Asp
1170 1175 1180
Lys Leu Val Ser Ser Phe Leu Glu Met Lys Ser Glu Lys Gln Val Glu
1185 1190 1195 1200
Gln Lys Ile Ala Glu Ile Pro Lys Glu Glu Val Lys Pro Phe Ile Thr
1205 1210 1215
Glu Ser Lys Pro Ser Val Glu Gln Arg Lys Gln Asp Asp Lys Lys Ile
1220 1225 1230
Lys Ala Cys Val Glu Glu Val Thr Thr Thr Leu Glu Glu Thr Lys Phe
1235 1240 1245
Leu Thr Glu Asn Leu Leu Leu Tyr Ile Asp Ile Asn Gly Asn Leu His
1250 1255 1260
Pro Asp Ser Ala Thr Leu Val Ser Asp Ile Asp Ile Thr Phe Leu Lys
1265 1270 1275 1280
Lys Asp Ala Pro Tyr Ile Val Gly Asp Val Val Gln Glu Gly Val Leu
1285 1290 1295
Thr Ala Val Val Ile Pro Thr Lys Lys Ala Gly Gly Thr Thr Glu Met
1300 1305 1310
Leu Ala Lys Ala Leu Arg Lys Val Pro Thr Asp Asn Tyr Ile Thr Thr
1315 1320 1325
Tyr Pro Gly Gln Gly Leu Asn Gly Tyr Thr Val Glu Glu Ala Lys Thr
1330 1335 1340
Val Leu Lys Lys Cys Lys Ser Ala Phe Tyr Ile Leu Pro Ser Ile Ile
1345 1350 1355 1360
Ser Asn Glu Lys Gln Glu Ile Leu Gly Thr Val Ser Trp Asn Leu Arg
1365 1370 1375
Glu Met Leu Ala His Ala Glu Glu Thr Arg Lys Leu Met Pro Val Cys
1380 1385 1390
Val Glu Thr Lys Ala Ile Val Ser Thr Ile Gln Arg Lys Tyr Lys Gly
1395 1400 1405
Ile Lys Ile Gln Glu Gly Val Val Asp Tyr Gly Ala Arg Phe Tyr Phe
1410 1415 1420
Tyr Thr Ser Lys Thr Thr Val Ala Ser Leu Ile Asn Thr Leu Asn Asp
1425 1430 1435 1440
Leu Asn Glu Thr Leu Val Thr Met Pro Leu Gly Tyr Val Thr His Gly
1445 1450 1455
Leu Asn Leu Glu Glu Ala Ala Arg Tyr Met Arg Ser Leu Lys Val Pro
1460 1465 1470
Ala Thr Val Ser Val Ser Ser Pro Asp Ala Val Thr Ala Tyr Asn Gly
1475 1480 1485
Tyr Leu Thr Ser Ser Ser Lys Thr Pro Glu Glu His Phe Ile Glu Thr
1490 1495 1500
Ile Ser Leu Ala Gly Ser Tyr Lys Asp Trp Ser Tyr Ser Gly Gln Ser
1505 1510 1515 1520
Thr Gln Leu Gly Ile Glu Phe Leu Lys Arg Gly Asp Lys Ser Val Tyr
1525 1530 1535
Tyr Thr Ser Asn Pro Thr Thr Phe His Leu Asp Gly Glu Val Ile Thr
1540 1545 1550
Phe Asp Asn Leu Lys Thr Leu Leu Ser Leu Arg Glu Val Arg Thr Ile
1555 1560 1565
Lys Val Phe Thr Thr Val Asp Asn Ile Asn Leu His Thr Gln Val Val
1570 1575 1580
Asp Met Ser Met Thr Tyr Gly Gln Gln Phe Gly Pro Thr Tyr Leu Asp
1585 1590 1595 1600
Gly Ala Asp Val Thr Lys Ile Lys Pro His Asn Ser His Glu Gly Lys
1605 1610 1615
Thr Phe Tyr Val Leu Pro Asn Asp Asp Thr Leu Arg Val Glu Ala Phe
1620 1625 1630
Glu Tyr Tyr His Thr Thr Asp Pro Ser Phe Leu Gly Arg Tyr Met Ser
1635 1640 1645
Ala Leu Asn His Thr Lys Lys Trp Lys Tyr Pro Gln Val Asn Gly Leu
1650 1655 1660
Thr Ser Ile Lys Trp Ala Asp Asn Asn Cys Tyr Leu Ala Thr Ala Leu
1665 1670 1675 1680
Leu Thr Leu Gln Gln Ile Glu Leu Lys Phe Asn Pro Pro Ala Leu Gln
1685 1690 1695
Asp Ala Tyr Tyr Arg Ala Arg Ala Gly Glu Ala Ala Asn Phe Cys Ala
1700 1705 1710
Leu Ile Leu Ala Tyr Cys Asn Lys Thr Val Gly Glu Leu Gly Asp Val
1715 1720 1725
Arg Glu Thr Met Ser Tyr Leu Phe Gln His Ala Asn Leu Asp Ser Cys
1730 1735 1740
Lys Arg Val Leu Asn Val Val Cys Lys Thr Cys Gly Gln Gln Gln Thr
1745 1750 1755 1760
Thr Leu Lys Gly Val Glu Ala Val Met Tyr Met Gly Thr Leu Ser Tyr
1765 1770 1775
Glu Gln Phe Lys Lys Gly Val Gln Ile Pro Cys Thr Cys Gly Lys Gln
1780 1785 1790
Ala Thr Lys Tyr Leu Val Gln Gln Glu Ser Pro Phe Val Met Met Ser
1795 1800 1805
Ala Pro Pro Ala Gln Tyr Glu Leu Lys His Gly Thr Phe Thr Cys Ala
1810 1815 1820
Ser Glu Tyr Thr Gly Asn Tyr Gln Cys Gly His Tyr Lys His Ile Thr
1825 1830 1835 1840
Ser Lys Glu Thr Leu Tyr Cys Ile Asp Gly Ala Leu Leu Thr Lys Ser
1845 1850 1855
Ser Glu Tyr Lys Gly Pro Ile Thr Asp Val Phe Tyr Lys Glu Asn Ser
1860 1865 1870
Tyr Thr Thr Thr Ile Lys Pro Val Thr Tyr Lys Leu Asp Gly Val Val
1875 1880 1885
Cys Thr Glu Ile Asp Pro Lys Leu Asp Asn Tyr Tyr Lys Lys Asp Asn
1890 1895 1900
Ser Tyr Phe Thr Glu Gln Pro Ile Asp Leu Val Pro Asn Gln Pro Tyr
1905 1910 1915 1920
Pro Asn Ala Ser Phe Asp Asn Phe Lys Phe Val Cys Asp Asn Ile Lys
1925 1930 1935
Phe Ala Asp Asp Leu Asn Gln Leu Thr Gly Tyr Lys Lys Pro Ala Ser
1940 1945 1950
Arg Glu Leu Lys Val Thr Phe Phe Pro Asp Leu Asn Gly Asp Val Val
1955 1960 1965
Ala Ile Asp Tyr Lys His Tyr Thr Pro Ser Phe Lys Lys Gly Ala Lys
1970 1975 1980
Leu Leu His Lys Pro Ile Val Trp His Val Asn Asn Ala Thr Asn Lys
1985 1990 1995 2000
Ala Thr Tyr Lys Pro Asn Thr Trp Cys Ile Arg Cys Leu Trp Ser Thr
2005 2010 2015
Lys Pro Val Glu Thr Ser Asn Ser Phe Asp Val Leu Lys Ser Glu Asp
2020 2025 2030
Ala Gln Gly Met Asp Asn Leu Ala Cys Glu Asp Leu Lys Pro Val Ser
2035 2040 2045
Glu Glu Val Val Glu Asn Pro Thr Ile Gln Lys Asp Val Leu Glu Cys
2050 2055 2060
Asn Val Lys Thr Thr Glu Val Val Gly Asp Ile Ile Leu Lys Pro Ala
2065 2070 2075 2080
Asn Asn Ser Leu Lys Ile Thr Glu Glu Val Gly His Thr Asp Leu Met
2085 2090 2095
Ala Ala Tyr Val Asp Asn Ser Ser Leu Thr Ile Lys Lys Pro Asn Glu
2100 2105 2110
Leu Ser Arg Val Leu Gly Leu Lys Thr Leu Ala Thr His Gly Leu Ala
2115 2120 2125
Ala Val Asn Ser Val Pro Trp Asp Thr Ile Ala Asn Tyr Ala Lys Pro
2130 2135 2140
Phe Leu Asn Lys Val Val Ser Thr Thr Thr Asn Ile Val Thr Arg Cys
2145 2150 2155 2160
Leu Asn Arg Val Cys Thr Asn Tyr Met Pro Tyr Phe Phe Thr Leu Leu
2165 2170 2175
Leu Gln Leu Cys Thr Phe Thr Arg Ser Thr Asn Ser Arg Ile Lys Ala
2180 2185 2190
Ser Met Pro Thr Thr Ile Ala Lys Asn Thr Val Lys Ser Val Gly Lys
2195 2200 2205
Phe Cys Leu Glu Ala Ser Phe Asn Tyr Leu Lys Ser Pro Asn Phe Ser
2210 2215 2220
Lys Leu Ile Asn Ile Ile Ile Trp Phe Leu Leu Leu Ser Val Cys Leu
2225 2230 2235 2240
Gly Ser Leu Ile Tyr Ser Thr Ala Ala Leu Gly Val Leu Met Ser Asn
2245 2250 2255
Leu Gly Met Pro Ser Tyr Cys Thr Gly Tyr Arg Glu Gly Tyr Leu Asn
2260 2265 2270
Ser Thr Asn Val Thr Ile Ala Thr Tyr Cys Thr Gly Ser Ile Pro Cys
2275 2280 2285
Ser Val Cys Leu Ser Gly Leu Asp Ser Leu Asp Thr Tyr Pro Ser Leu
2290 2295 2300
Glu Thr Ile Gln Ile Thr Ile Ser Ser Phe Lys Trp Asp Leu Thr Ala
2305 2310 2315 2320
Phe Gly Leu Val Ala Glu Trp Phe Leu Ala Tyr Ile Leu Phe Thr Arg
2325 2330 2335
Phe Phe Tyr Val Leu Gly Leu Ala Ala Ile Met Gln Leu Phe Phe Ser
2340 2345 2350
Tyr Phe Ala Val His Phe Ile Ser Asn Ser Trp Leu Met Trp Leu Ile
2355 2360 2365
Ile Asn Leu Val Gln Met Ala Pro Ile Ser Ala Met Val Arg Met Tyr
2370 2375 2380
Ile Phe Phe Ala Ser Phe Tyr Tyr Val Trp Lys Ser Tyr Val His Val
2385 2390 2395 2400
Val Asp Gly Cys Asn Ser Ser Thr Cys Met Met Cys Tyr Lys Arg Asn
2405 2410 2415
Arg Ala Thr Arg Val Glu Cys Thr Thr Ile Val Asn Gly Val Arg Arg
2420 2425 2430
Ser Phe Tyr Val Tyr Ala Asn Gly Gly Lys Gly Phe Cys Lys Leu His
2435 2440 2445
Asn Trp Asn Cys Val Asn Cys Asp Thr Phe Cys Ala Gly Ser Thr Phe
2450 2455 2460
Ile Ser Asp Glu Val Ala Arg Asp Leu Ser Leu Gln Phe Lys Arg Pro
2465 2470 2475 2480
Ile Asn Pro Thr Asp Gln Ser Ser Tyr Ile Val Asp Ser Val Thr Val
2485 2490 2495
Lys Asn Gly Ser Ile His Leu Tyr Phe Asp Lys Ala Gly Gln Lys Thr
2500 2505 2510
Tyr Glu Arg His Ser Leu Ser His Phe Val Asn Leu Asp Asn Leu Arg
2515 2520 2525
Ala Asn Asn Thr Lys Gly Ser Leu Pro Ile Asn Val Ile Val Phe Asp
2530 2535 2540
Gly Lys Ser Lys Cys Glu Glu Ser Ser Ala Lys Ser Ala Ser Val Tyr
2545 2550 2555 2560
Tyr Ser Gln Leu Met Cys Gln Pro Ile Leu Leu Leu Asp Gln Ala Leu
2565 2570 2575
Val Ser Asp Val Gly Asp Ser Ala Glu Val Ala Val Lys Met Phe Asp
2580 2585 2590
Ala Tyr Val Asn Thr Phe Ser Ser Thr Phe Asn Val Pro Met Glu Lys
2595 2600 2605
Leu Lys Thr Leu Val Ala Thr Ala Glu Ala Glu Leu Ala Lys Asn Val
2610 2615 2620
Ser Leu Asp Asn Val Leu Ser Thr Phe Ile Ser Ala Ala Arg Gln Gly
2625 2630 2635 2640
Phe Val Asp Ser Asp Val Glu Thr Lys Asp Val Val Glu Cys Leu Lys
2645 2650 2655
Leu Ser His Gln Ser Asp Ile Glu Val Thr Gly Asp Ser Cys Asn Asn
2660 2665 2670
Tyr Met Leu Thr Tyr Asn Lys Val Glu Asn Met Thr Pro Arg Asp Leu
2675 2680 2685
Gly Ala Cys Ile Asp Cys Ser Ala Arg His Ile Asn Ala Gln Val Ala
2690 2695 2700
Lys Ser His Asn Ile Ala Leu Ile Trp Asn Val Lys Asp Phe Met Ser
2705 2710 2715 2720
Leu Ser Glu Gln Leu Arg Lys Gln Ile Arg Ser Ala Ala Lys Lys Asn
2725 2730 2735
Asn Leu Pro Phe Lys Leu Thr Cys Ala Thr Thr Arg Gln Val Val Asn
2740 2745 2750
Val Val Thr Thr Lys Ile Ala Leu Lys Gly Gly Lys Ile Val Asn Asn
2755 2760 2765
Trp Leu Lys Gln Leu Ile Lys Val Thr Leu Val Phe Leu Phe Val Ala
2770 2775 2780
Ala Ile Phe Tyr Leu Ile Thr Pro Val His Val Met Ser Lys His Thr
2785 2790 2795 2800
Asp Phe Ser Ser Glu Ile Ile Gly Tyr Lys Ala Ile Asp Gly Gly Val
2805 2810 2815
Thr Arg Asp Ile Ala Ser Thr Asp Thr Cys Phe Ala Asn Lys His Ala
2820 2825 2830
Asp Phe Asp Thr Trp Phe Ser Gln Arg Gly Gly Ser Tyr Thr Asn Asp
2835 2840 2845
Lys Ala Cys Pro Leu Ile Ala Ala Val Ile Thr Arg Glu Val Gly Phe
2850 2855 2860
Val Val Pro Gly Leu Pro Gly Thr Ile Leu Arg Thr Thr Asn Gly Asp
2865 2870 2875 2880
Phe Leu His Phe Leu Pro Arg Val Phe Ser Ala Val Gly Asn Ile Cys
2885 2890 2895
Tyr Thr Pro Ser Lys Leu Ile Glu Tyr Thr Asp Phe Ala Thr Ser Ala
2900 2905 2910
Cys Val Leu Ala Ala Glu Cys Thr Ile Phe Lys Asp Ala Ser Gly Lys
2915 2920 2925
Pro Val Pro Tyr Cys Tyr Asp Thr Asn Val Leu Glu Gly Ser Val Ala
2930 2935 2940
Tyr Glu Ser Leu Arg Pro Asp Thr Arg Tyr Val Leu Met Asp Gly Ser
2945 2950 2955 2960
Ile Ile Gln Phe Pro Asn Thr Tyr Leu Glu Gly Ser Val Arg Val Val
2965 2970 2975
Thr Thr Phe Asp Ser Glu Tyr Cys Arg His Gly Thr Cys Glu Arg Ser
2980 2985 2990
Glu Ala Gly Val Cys Val Ser Thr Ser Gly Arg Trp Val Leu Asn Asn
2995 3000 3005
Asp Tyr Tyr Arg Ser Leu Pro Gly Val Phe Cys Gly Val Asp Ala Val
3010 3015 3020
Asn Leu Leu Thr Asn Met Phe Thr Pro Leu Ile Gln Pro Ile Gly Ala
3025 3030 3035 3040
Leu Asp Ile Ser Ala Ser Ile Val Ala Gly Gly Ile Val Ala Ile Val
3045 3050 3055
Val Thr Cys Leu Ala Tyr Tyr Phe Met Arg Phe Arg Arg Ala Phe Gly
3060 3065 3070
Glu Tyr Ser His Val Val Ala Phe Asn Thr Leu Leu Phe Leu Met Ser
3075 3080 3085
Phe Thr Val Leu Cys Leu Thr Pro Val Tyr Ser Phe Leu Pro Gly Val
3090 3095 3100
Tyr Ser Val Ile Tyr Leu Tyr Leu Thr Phe Tyr Leu Thr Asn Asp Val
3105 3110 3115 3120
Ser Phe Leu Ala His Ile Gln Trp Met Val Met Phe Thr Pro Leu Val
3125 3130 3135
Pro Phe Trp Ile Thr Ile Ala Tyr Ile Ile Cys Ile Ser Thr Lys His
3140 3145 3150
Phe Tyr Trp Phe Phe Ser Asn Tyr Leu Lys Arg Arg Val Val Phe Asn
3155 3160 3165
Gly Val Ser Phe Ser Thr Phe Glu Glu Ala Ala Leu Cys Thr Phe Leu
3170 3175 3180
Leu Asn Lys Glu Met Tyr Leu Lys Leu Arg Ser Asp Val Leu Leu Pro
3185 3190 3195 3200
Leu Thr Gln Tyr Asn Arg Tyr Leu Ala Leu Tyr Asn Lys Tyr Lys Tyr
3205 3210 3215
Phe Ser Gly Ala Met Asp Thr Thr Ser Tyr Arg Glu Ala Ala Cys Cys
3220 3225 3230
His Leu Ala Lys Ala Leu Asn Asp Phe Ser Asn Ser Gly Ser Asp Val
3235 3240 3245
Leu Tyr Gln Pro Pro Gln Thr Ser Ile Thr Ser Ala Val Leu Gln Ser
3250 3255 3260
Gly Phe Arg Lys Met Ala Phe Pro Ser Gly Lys Val Glu Gly Cys Met
3265 3270 3275 3280
Val Gln Val Thr Cys Gly Thr Thr Thr Leu Asn Gly Leu Trp Leu Asp
3285 3290 3295
Asp Val Val Tyr Cys Pro Arg His Val Ile Cys Thr Ser Glu Asp Met
3300 3305 3310
Leu Asn Pro Asn Tyr Glu Asp Leu Leu Ile Arg Lys Ser Asn His Asn
3315 3320 3325
Phe Leu Val Gln Ala Gly Asn Val Gln Leu Arg Val Ile Gly His Ser
3330 3335 3340
Met Gln Asn Cys Val Leu Lys Leu Lys Val Asp Thr Ala Asn Pro Lys
3345 3350 3355 3360
Thr Pro Lys Tyr Lys Phe Val Arg Ile Gln Pro Gly Gln Thr Phe Ser
3365 3370 3375
Val Leu Ala Cys Tyr Asn Gly Ser Pro Ser Gly Val Tyr Gln Cys Ala
3380 3385 3390
Met Arg Pro Asn Phe Thr Ile Lys Gly Ser Phe Leu Asn Gly Ser Cys
3395 3400 3405
Gly Ser Val Gly Phe Asn Ile Asp Tyr Asp Cys Val Ser Phe Cys Tyr
3410 3415 3420
Met His His Met Glu Leu Pro Thr Gly Val His Ala Gly Thr Asp Leu
3425 3430 3435 3440
Glu Gly Asn Phe Tyr Gly Pro Phe Val Asp Arg Gln Thr Ala Gln Ala
3445 3450 3455
Ala Gly Thr Asp Thr Thr Ile Thr Val Asn Val Leu Ala Trp Leu Tyr
3460 3465 3470
Ala Ala Val Ile Asn Gly Asp Arg Trp Phe Leu Asn Arg Phe Thr Thr
3475 3480 3485
Thr Leu Asn Asp Phe Asn Leu Val Ala Met Lys Tyr Asn Tyr Glu Pro
3490 3495 3500
Leu Thr Gln Asp His Val Asp Ile Leu Gly Pro Leu Ser Ala Gln Thr
3505 3510 3515 3520
Gly Ile Ala Val Leu Asp Met Cys Ala Ser Leu Lys Glu Leu Leu Gln
3525 3530 3535
Asn Gly Met Asn Gly Arg Thr Ile Leu Gly Ser Ala Leu Leu Glu Asp
3540 3545 3550
Glu Phe Thr Pro Phe Asp Val Val Arg Gln Cys Ser Gly Val Thr Phe
3555 3560 3565
Gln Ser Ala Val Lys Arg Thr Ile Lys Gly Thr His His Trp Leu Leu
3570 3575 3580
Leu Thr Ile Leu Thr Ser Leu Leu Val Leu Val Gln Ser Thr Gln Trp
3585 3590 3595 3600
Ser Leu Phe Phe Phe Leu Tyr Glu Asn Ala Phe Leu Pro Phe Ala Met
3605 3610 3615
Gly Ile Ile Ala Met Ser Ala Phe Ala Met Met Phe Val Lys His Lys
3620 3625 3630
His Ala Phe Leu Cys Leu Phe Leu Leu Pro Ser Leu Ala Thr Val Ala
3635 3640 3645
Tyr Phe Asn Met Val Tyr Met Pro Ala Ser Trp Val Met Arg Ile Met
3650 3655 3660
Thr Trp Leu Asp Met Val Asp Thr Ser Leu Ser Gly Phe Lys Leu Lys
3665 3670 3675 3680
Asp Cys Val Met Tyr Ala Ser Ala Val Val Leu Leu Ile Leu Met Thr
3685 3690 3695
Ala Arg Thr Val Tyr Asp Asp Gly Ala Arg Arg Val Trp Thr Leu Met
3700 3705 3710
Asn Val Leu Thr Leu Val Tyr Lys Val Tyr Tyr Gly Asn Ala Leu Asp
3715 3720 3725
Gln Ala Ile Ser Met Trp Ala Leu Ile Ile Ser Val Thr Ser Asn Tyr
3730 3735 3740
Ser Gly Val Val Thr Thr Val Met Phe Leu Ala Arg Gly Ile Val Phe
3745 3750 3755 3760
Met Cys Val Glu Tyr Cys Pro Ile Phe Phe Ile Thr Gly Asn Thr Leu
3765 3770 3775
Gln Cys Ile Met Leu Val Tyr Cys Phe Leu Gly Tyr Phe Cys Thr Cys
3780 3785 3790
Tyr Phe Gly Leu Phe Cys Leu Leu Asn Arg Tyr Phe Arg Leu Thr Leu
3795 3800 3805
Gly Val Tyr Asp Tyr Leu Val Ser Thr Gln Glu Phe Arg Tyr Met Asn
3810 3815 3820
Ser Gln Gly Leu Leu Pro Pro Lys Asn Ser Ile Asp Ala Phe Lys Leu
3825 3830 3835 3840
Asn Ile Lys Leu Leu Gly Val Gly Gly Lys Pro Cys Ile Lys Val Ala
3845 3850 3855
Thr Val Gln Ser Lys Met Ser Asp Val Lys Cys Thr Ser Val Val Leu
3860 3865 3870
Leu Ser Val Leu Gln Gln Leu Arg Val Glu Ser Ser Ser Lys Leu Trp
3875 3880 3885
Ala Gln Cys Val Gln Leu His Asn Asp Ile Leu Leu Ala Lys Asp Thr
3890 3895 3900
Thr Glu Ala Phe Glu Lys Met Val Ser Leu Leu Ser Val Leu Leu Ser
3905 3910 3915 3920
Met Gln Gly Ala Val Asp Ile Asn Lys Leu Cys Glu Glu Met Leu Asp
3925 3930 3935
Asn Arg Ala Thr Leu Gln Ala Ile Ala Ser Glu Phe Ser Ser Leu Pro
3940 3945 3950
Ser Tyr Ala Ala Phe Ala Thr Ala Gln Glu Ala Tyr Glu Gln Ala Val
3955 3960 3965
Ala Asn Gly Asp Ser Glu Val Val Leu Lys Lys Leu Lys Lys Ser Leu
3970 3975 3980
Asn Val Ala Lys Ser Glu Phe Asp Arg Asp Ala Ala Met Gln Arg Lys
3985 3990 3995 4000
Leu Glu Lys Met Ala Asp Gln Ala Met Thr Gln Met Tyr Lys Gln Ala
4005 4010 4015
Arg Ser Glu Asp Lys Arg Ala Lys Val Thr Ser Ala Met Gln Thr Met
4020 4025 4030
Leu Phe Thr Met Leu Arg Lys Leu Asp Asn Asp Ala Leu Asn Asn Ile
4035 4040 4045
Ile Asn Asn Ala Arg Asp Gly Cys Val Pro Leu Asn Ile Ile Pro Leu
4050 4055 4060
Thr Thr Ala Ala Lys Leu Met Val Val Ile Pro Asp Tyr Asn Thr Tyr
4065 4070 4075 4080
Lys Asn Thr Cys Asp Gly Thr Thr Phe Thr Tyr Ala Ser Ala Leu Trp
4085 4090 4095
Glu Ile Gln Gln Val Val Asp Ala Asp Ser Lys Ile Val Gln Leu Ser
4100 4105 4110
Glu Ile Ser Met Asp Asn Ser Pro Asn Leu Ala Trp Pro Leu Ile Val
4115 4120 4125
Thr Ala Leu Arg Ala Asn Ser Ala Val Lys Leu Gln Asn Asn Glu Leu
4130 4135 4140
Ser Pro Val Ala Leu Arg Gln Met Ser Cys Ala Ala Gly Thr Thr Gln
4145 4150 4155 4160
Thr Ala Cys Thr Asp Asp Asn Ala Leu Ala Tyr Tyr Asn Thr Thr Lys
4165 4170 4175
Gly Gly Arg Phe Val Leu Ala Leu Leu Ser Asp Leu Gln Asp Leu Lys
4180 4185 4190
Trp Ala Arg Phe Pro Lys Ser Asp Gly Thr Gly Thr Ile Tyr Thr Glu
4195 4200 4205
Leu Glu Pro Pro Cys Arg Phe Val Thr Asp Thr Pro Lys Gly Pro Lys
4210 4215 4220
Val Lys Tyr Leu Tyr Phe Ile Lys Gly Leu Asn Asn Leu Asn Arg Gly
4225 4230 4235 4240
Met Val Leu Gly Ser Leu Ala Ala Thr Val Arg Leu Gln Ala Gly Asn
4245 4250 4255
Ala Thr Glu Val Pro Ala Asn Ser Thr Val Leu Ser Phe Cys Ala Phe
4260 4265 4270
Ala Val Asp Ala Ala Lys Ala Tyr Lys Asp Tyr Leu Ala Ser Gly Gly
4275 4280 4285
Gln Pro Ile Thr Asn Cys Val Lys Met Leu Cys Thr His Thr Gly Thr
4290 4295 4300
Gly Gln Ala Ile Thr Val Thr Pro Glu Ala Asn Met Asp Gln Glu Ser
4305 4310 4315 4320
Phe Gly Gly Ala Ser Cys Cys Leu Tyr Cys Arg Cys His Ile Asp His
4325 4330 4335
Pro Asn Pro Lys Gly Phe Cys Asp Leu Lys Gly Lys Tyr Val Gln Ile
4340 4345 4350
Pro Thr Thr Cys Ala Asn Asp Pro Val Gly Phe Thr Leu Lys Asn Thr
4355 4360 4365
Val Cys Thr Val Cys Gly Met Trp Lys Gly Tyr Gly Cys Ser Cys Asp
4370 4375 4380
Gln Leu Arg Glu Pro Met Leu Gln Ser Ala Asp Ala Gln Ser Phe Leu
4385 4390 4395 4400
Asn Gly Phe Ala Val
4405
<210> 14
<211> 1273
<212> PRT
<213> SARS-CoV2
<400> 14
Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
1 5 10 15
Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
20 25 30
Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
35 40 45
His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
50 55 60
Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
65 70 75 80
Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
85 90 95
Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
100 105 110
Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
115 120 125
Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
130 135 140
Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
145 150 155 160
Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
165 170 175
Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
180 185 190
Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
195 200 205
Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
210 215 220
Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
225 230 235 240
Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
245 250 255
Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
260 265 270
Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
275 280 285
Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
290 295 300
Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val
305 310 315 320
Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys
325 330 335
Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
340 345 350
Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Phe Leu
355 360 365
Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
370 375 380
Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
385 390 395 400
Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
405 410 415
Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
420 425 430
Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
435 440 445
Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
450 455 460
Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
465 470 475 480
Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
485 490 495
Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
500 505 510
Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
515 520 525
Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn
530 535 540
Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu
545 550 555 560
Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val
565 570 575
Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe
580 585 590
Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val
595 600 605
Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile
610 615 620
His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser
625 630 635 640
Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val
645 650 655
Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala
660 665 670
Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala
675 680 685
Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser
690 695 700
Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile
705 710 715 720
Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val
725 730 735
Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu
740 745 750
Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr
755 760 765
Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln
770 775 780
Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe
785 790 795 800
Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser
805 810 815
Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly
820 825 830
Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp
835 840 845
Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu
850 855 860
Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly
865 870 875 880
Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile
885 890 895
Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr
900 905 910
Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn
915 920 925
Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala
930 935 940
Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn
945 950 955 960
Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val
965 970 975
Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln
980 985 990
Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val
995 1000 1005
Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu
1010 1015 1020
Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys Arg Val
1025 1030 1035 1040
Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ser Ala
1045 1050 1055
Pro His Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ala Gln Glu
1060 1065 1070
Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His Asp Gly Lys Ala His
1075 1080 1085
Phe Pro Arg Glu Gly Val Phe Val Ser Asn Gly Thr His Trp Phe Val
1090 1095 1100
Thr Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile Thr Thr Asp Asn Thr
1105 1110 1115 1120
Phe Val Ser Gly Asn Cys Asp Val Val Ile Gly Ile Val Asn Asn Thr
1125 1130 1135
Val Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu
1140 1145 1150
Asp Lys Tyr Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp
1155 1160 1165
Ile Ser Gly Ile Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp
1170 1175 1180
Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu
1185 1190 1195 1200
Gln Glu Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Trp Tyr Ile
1205 1210 1215
Trp Leu Gly Phe Ile Ala Gly Leu Ile Ala Ile Val Met Val Thr Ile
1220 1225 1230
Met Leu Cys Cys Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Cys Cys
1235 1240 1245
Ser Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro Val
1250 1255 1260
Leu Lys Gly Val Lys Leu His Tyr Thr
1265 1270
<210> 15
<211> 29885
<212> DNA
<213> SARS-CoV2
<220>
<221> misc_feature
<222> (1)..(30)
<223> n is a, c, g, or t
<220>
<221> misc_feature
<222> (29865)
<223> n is a, c, g, or t
<220>
<221> misc_feature
<222> (29872)..(29885)
<223> n is a, c, g, or t
<400> 15
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn accaaccaac tttcgatctc ttgtagatct 60
gttctctaaa cgaactttaa aatctgtgtg gctgtcactc ggctgcatgc ttagtgcact 120
cacgcagtat aattaataac taattactgt cgttgacagg acacgagtaa ctcgtctatc 180
ttctgcaggc tgcttacggt ttcgtccgtg ttgcagccga tcatcagcac atctaggttt 240
tgtccgggtg tgaccgaaag gtaagatgga gagccttgtc cctggtttca acgagaaaac 300
acacgtccaa ctcagtttgc ctgttttaca ggttcgcgac gtgctcgtac gtggctttgg 360
agactccgtg gaggaggtct tatcagaggc acgtcaacat cttaaagatg gcacttgtgg 420
cttagtagaa gttgaaaaag gcgttttgcc tcaacttgaa cagccctatg tgttcatcaa 480
acgttcggat gctcgaactg cacctcatgg tcatgttatg gttgagctgg tagcagaact 540
cgaaggcatt cagtacggtc gtagtggtga gacacttggt gtccttgtcc ctcatgtggg 600
cgaaatacca gtggcttacc gcaaggttct tcttcgtaag aacggtaata aaggagctgg 660
tggccatagt tacggcgccg atctaaagtc atttgactta ggcgacgagc ttggcactga 720
tccttatgaa gattttcaag aaaactggaa cactaaacat agcagtggtg ttacccgtga 780
actcatgcgt gagcttaacg gaggggcata cactcgctat gtcgataaca acttctgtgg 840
ccctgatggc taccctcttg agtgcattaa agaccttcta gcacgtgctg gtaaagcttc 900
atgcactttg tccgaacaac tggactttat tgacactaag aggggtgtat actgctgccg 960
tgaacatgag catgaaattg cttggtacac ggaacgttct gaaaagagct atgaattgca 1020
gacacctttt gaaattaaat tggcaaagaa atttgacacc ttcaatgggg aatgtccaaa 1080
ttttgtattt cccttaaatt ccataatcaa gactattcaa ccaagggttg aaaagaaaaa 1140
gcttgatggc tttatgggta gaattcgatc tgtctatcca gttgcgtcac caaatgaatg 1200
caaccaaatg tgcctttcaa ctctcatgaa gtgtgatcat tgtggtgaaa cttcatggca 1260
gacgggcgat tttgttaaag ccacttgcga attttgtggc actgagaatt tgactaaaga 1320
aggtgccact acttgtggtt acttacccca aaatgctgtt gttaaaattt attgtccagc 1380
atgtcacaat tcagaagtag gacctgagca tagtcttgcc gaataccata atgaatctgg 1440
cttgaaaacc attcttcgta agggtggtcg cactattgcc tttggaggct gtgtgttctc 1500
ttatgttggt tgccataaca agtgtgccta ttgggttcca cgtgctagcg ctaacatagg 1560
ttgtaaccat acaggtgttg ttggagaagg ttccgaaggt cttaatgaca accttcttga 1620
aatactccaa aaagagaaag tcaacatcaa tattgttggt gactttaaac ttaatgaaga 1680
gatcgccatt attttggcat ctttttctgc ttccacaagt gcttttgtgg aaactgtgaa 1740
aggtttggat tataaagcat tcaaacaaat tgttgaatcc tgtggtaatt ttaaagttac 1800
aaaaggaaaa gctaaaaaag gtgcctggaa tattggtgaa cagaaatcaa tactgagtcc 1860
tctttatgca tttgcatcag aggctgctcg tgttgtacga tcaattttct cccgcactct 1920
tgaaactgct caaaattctg tgcgtgtttt acagaaggcc gctataacaa tactagatgg 1980
aatttcacag tattcactga gactcattga tgctatgatg ttcacatctg atttggctac 2040
taacaatcta gttgtaatgg cctacattac aggtggtgtt gttcagttga cttcgcagtg 2100
gctaactaac atctttggca ctgtttatga aaaactcaaa cccgtccttg attggcttga 2160
agagaagttt aaggaaggtg tagagtttct tagagacggt tgggaaattg ttaaatttat 2220
ctcaacctgt gcttgtgaaa ttgtcggtgg acaaattgtc acctgtgcaa aggaaattaa 2280
ggagagtgtt cagacattct ttaagcttgt aaataaattt ttggctttgt gtgctgactc 2340
tatcattatt ggtggagcta aacttaaagc cttgaattta ggtgaaacat ttgtcacgca 2400
ctcaaaggga ttgtacagaa agtgtgttaa atccagagaa gaaactggcc tactcatgcc 2460
tctaaaagcc ccaaaagaaa ttatcttctt agagggagaa acacttccca cagaagtgtt 2520
aacagaggaa gttgtcttga aaactggtga tttacaacca ttagaacaac ctactagtga 2580
agctgttgaa gctccattgg ttggtacacc agtttgtatt aacgggctta tgttgctcga 2640
aatcaaagac acagaaaagt actgtgccct tgcacctaat atgatggtaa caaacaatac 2700
cttcacactc aaaggcggtg caccaacaaa ggttactttt ggtgatgaca ctgtgataga 2760
agtgcaaggt tacaagagtg tgaatatcac ttttgaactt gatgaaagga ttgataaagt 2820
acttaatgag aagtgctctg cctatacagt tgaactcggt acagaagtaa atgagttcgc 2880
ctgtgttgtg gcagatgctg tcataaaaac tttgcaacca gtatctgaat tacttacacc 2940
actgggcatt gatttagatg agtggagtat ggctacatac tacttatttg atgagtctgg 3000
tgagtttaaa ttggcttcac atatgtattg ttctttttac cctccagatg aggatgaaga 3060
agaaggtgat tgtgaagaag aagagtttga gccatcaact caatatgagt atggtactga 3120
agatgattac caaggtaaac ctttggaatt tggtgccact tctgctgctc ttcaacctga 3180
agaagagcaa gaagaagatt ggttagatga tgatagtcaa caaactgttg gtcaacaaga 3240
cggcagtgag gacaatcaga caactactat tcaaacaatt gttgaggttc aacctcaatt 3300
agagatggaa cttacaccag ttgttcagac tattgaagtg aatagtttta gtggttattt 3360
aaaacttact gacaatgtat acattaaaaa tgcagacatt gtggaagaag ctaaaaaggt 3420
aaaaccaaca gtggttgtta atgcagccaa tgtttacctt aaacatggag gaggtgttgc 3480
aggagcctta aataaggcta ctaacaatgc catgcaagtt gaatctgatg attacatagc 3540
tactaatgga ccacttaaag tgggtggtag ttgtgtttta agcggacaca atcttgctaa 3600
acactgtctt catgttgtcg gcccaaatgt taacaaaggt gaagacattc aacttcttaa 3660
gagtgcttat gaaaatttta atcagcacga agttctactt gcaccattat tatcagctgg 3720
tatttttggt gctgacccta tacattcttt aagagtttgt gtagatactg ttcgcacaaa 3780
tgtctactta gctgtctttg ataaaaatct ctatgacaaa cttgtttcaa gctttttgga 3840
aatgaagagt gaaaagcaag ttgaacaaaa gatcgctgag attcctaaag aggaagttaa 3900
gccatttata actgaaagta aaccttcagt tgaacagaga aaacaagatg ataagaaaat 3960
caaagcttgt gttgaagaag ttacaacaac tctggaagaa actaagttcc tcacagaaaa 4020
cttgttactt tatattgaca ttaatggcaa tcttcatcca gattctgcca ctcttgttag 4080
tgacattgac atcactttct taaagaaaga tgctccatat atagtgggtg atgttgttca 4140
agagggtgtt ttaactgctg tggttatacc tactaaaaag gctggtggca ctactgaaat 4200
gctagcgaaa gctttgagaa aagtgccaac agacaattat ataaccactt acccgggtca 4260
gggtttaaat ggttacactg tagaggaggc aaagacagtg cttaaaaagt gtaaaagtgc 4320
cttttacatt ctaccatcta ttatctctaa tgagaagcaa gaaattcttg gaactgtttc 4380
ttggaatttg cgagaaatgc ttgcacatgc agaagaaaca cgcaaattaa tgcctgtctg 4440
tgtggaaact aaagccatag tttcaactat acagcgtaaa tataagggta ttaaaataca 4500
agagggtgtg gttgattatg gtgctagatt ttacttttac accagtaaaa caactgtagc 4560
gtcacttatc aacacactta acgatctaaa tgaaactctt gttacaatgc cacttggcta 4620
tgtaacacat ggcttaaatt tggaagaagc tgctcggtat atgagatctc tcaaagtgcc 4680
agctacagtt tctgtttctt cacctgatgc tgttacagcg tataatggtt atcttacttc 4740
ttcttctaaa acacctgaag aacattttat tgaaaccatc tcacttgctg gttcctataa 4800
agattggtcc tattctggac aatctacaca actaggtata gaatttctta agagaggtga 4860
taaaagtgta tattacacta gtaatcctac cacattccac ctagatggtg aagttatcac 4920
ctttgacaat cttaagacac ttctttcttt gagagaagtg aggactatta aggtgtttac 4980
aacagtagac aacattaacc tccacacgca agttgtggac atgtcaatga catatggaca 5040
acagtttggt ccaacttatt tggatggagc tgatgttact aaaataaaac ctcataattc 5100
acatgaaggt aaaacatttt atgttttacc taatgatgac actctacgtg ttgaggcttt 5160
tgagtactac cacacaactg atcctagttt tctgggtagg tacatgtcag cattaaatca 5220
cactaaaaag tggaaatacc cacaagttaa tggtttaact tctattaaat gggcagataa 5280
caactgttat cttgccactg cattgttaac actccaacaa atagagttga agtttaatcc 5340
acctgctcta caagatgctt attacagagc aagggctggt gaagctgcta acttttgtgc 5400
acttatctta gcctactgta ataagacagt aggtgagtta ggtgatgtta gagaaacaat 5460
gagttacttg tttcaacatg ccaatttaga ttcttgcaaa agagtcttga acgtggtgtg 5520
taaaacttgt ggacaacagc agacaaccct taagggtgta gaagctgtta tgtacatggg 5580
cacactttct tatgaacaat ttaagaaagg tgttcagata ccttgtacgt gtggtaaaca 5640
agctacaaaa tatctagtac aacaggagtc accttttgtt atgatgtcag caccacctgc 5700
tcagtatgaa cttaagcatg gtacatttac ttgtgctagt gagtacactg gtaattacca 5760
gtgtggtcac tataaacata taacttctaa agaaactttg tattgcatag acggtgcttt 5820
acttacaaag tcctcagaat acaaaggtcc tattacggat gttttctaca aagaaaacag 5880
ttacacaaca accataaaac cagttactta taaattggat ggtgttgttt gtacagaaat 5940
tgaccctaag ttggacaatt attataagaa agacaattct tatttcacag agcaaccaat 6000
tgatcttgta ccaaaccaac catatccaaa cgcaagcttc gataatttta agtttgtatg 6060
tgataatatc aaatttgctg atgatttaaa ccagttaact ggttataaga aacctgcttc 6120
aagagagctt aaagttacat ttttccctga cttaaatggt gatgtggtgg ctattgatta 6180
taaacactac acaccctctt ttaagaaagg agctaaattg ttacataaac ctattgtttg 6240
gcatgttaac aatgcaacta ataaagccac gtataaacca aatacctggt gtatacgttg 6300
tctttggagc acaaaaccag ttgaaacatc aaattcgttt gatgtactga agtcagagga 6360
cgcgcaggga atggataatc ttgcctgcga agatctaaaa ccagtctctg aagaagtagt 6420
ggaaaatcct accatacaga aagacgttct tgagtgtaat gtgaaaacta ccgaagttgt 6480
aggagacatt atacttaaac cagcaaataa tagtttaaaa attacagaag aggttggcca 6540
cacagatcta atggctgctt atgtagacaa ttctagtctt actattaaga aacctaatga 6600
attatctaga gtattaggtt tgaaaaccct tgctactcat ggtttagctg ctgttaatag 6660
tgtcccttgg gatactatag ctaattatgc taagcctttt cttaacaaag ttgttagtac 6720
aactactaac atagttacac ggtgtttaaa ccgtgtttgt actaattata tgccttattt 6780
ctttacttta ttgctacaat tgtgtacttt tactagaagt acaaattcta gaattaaagc 6840
atctatgccg actactatag caaagaatac tgttaagagt gtcggtaaat tttgtctaga 6900
ggcttcattt aattatttga agtcacctaa tttttctaaa ctgataaata ttataatttg 6960
gtttttacta ttaagtgttt gcctaggttc tttaatctac tcaaccgctg ctttaggtgt 7020
tttaatgtct aatttaggca tgccttctta ctgtactggt tacagagaag gctatttgaa 7080
ctctactaat gtcactattg caacctactg tactggttct ataccttgta gtgtttgtct 7140
tagtggttta gattctttag acacctatcc ttctttagaa actatacaaa ttaccatttc 7200
atcttttaaa tgggatttaa ctgcttttgg cttagttgca gagtggtttt tggcatatat 7260
tcttttcact aggtttttct atgtacttgg attggctgca atcatgcaat tgtttttcag 7320
ctattttgca gtacatttta ttagtaattc ttggcttatg tggttaataa ttaatcttgt 7380
acaaatggcc ccgatttcag ctatggttag aatgtacatc ttctttgcat cattttatta 7440
tgtatggaaa agttatgtgc atgttgtaga cggttgtaat tcatcaactt gtatgatgtg 7500
ttacaaacgt aatagagcaa caagagtcga atgtacaact attgttaatg gtgttagaag 7560
gtccttttat gtctatgcta atggaggtaa aggcttttgc aaactacaca attggaattg 7620
tgttaattgt gatacattct gtgctggtag tacatttatt agtgatgaag ttgcgagaga 7680
cttgtcacta cagtttaaaa gaccaataaa tcctactgac cagtcttctt acatcgttga 7740
tagtgttaca gtgaagaatg gttccatcca tctttacttt gataaagctg gtcaaaagac 7800
ttatgaaaga cattctctct ctcattttgt taacttagac aacctgagag ctaataacac 7860
taaaggttca ttgcctatta atgttatagt ttttgatggt aaatcaaaat gtgaagaatc 7920
atctgcaaaa tcagcgtctg tttactacag tcagcttatg tgtcaaccta tactgttact 7980
agatcaggca ttagtgtctg atgttggtga tagtgcggaa gttgcagtta aaatgtttga 8040
tgcttacgtt aatacgtttt catcaacttt taacgtacca atggaaaaac tcaaaacact 8100
agttgcaact gcagaagctg aacttgcaaa gaatgtgtcc ttagacaatg tcttatctac 8160
ttttatttca gcagctcggc aagggtttgt tgattcagat gtagaaacta aagatgttgt 8220
tgaatgtctt aaattgtcac atcaatctga catagaagtt actggcgata gttgtaataa 8280
ctatatgctc acctataaca aagttgaaaa catgacaccc cgtgaccttg gtgcttgtat 8340
tgactgtagt gcgcgtcata ttaatgcgca ggtagcaaaa agtcacaaca ttgctttgat 8400
atggaacgtt aaagatttca tgtcattgtc tgaacaacta cgaaaacaaa tacgtagtgc 8460
tgctaaaaag aataacttac cttttaagtt gacatgtgca actactagac aagttgttaa 8520
tgttgtaaca acaaagatag cacttaaggg tggtaaaatt gttaataatt ggttgaagca 8580
gttaattaaa gttacacttg tgttcctttt tgttgctgct attttctatt taataacacc 8640
tgttcatgtc atgtctaaac atactgactt ttcaagtgaa atcataggat acaaggctat 8700
tgatggtggt gtcactcgtg acatagcatc tacagatact tgttttgcta acaaacatgc 8760
tgattttgac acatggttta gccagcgtgg tggtagttat actaatgaca aagcttgccc 8820
attgattgct gcagtcataa caagagaagt gggttttgtc gtgcctggtt tgcctggcac 8880
gatattacgc acaactaatg gtgacttttt gcatttctta cctagagttt ttagtgcagt 8940
tggtaacatc tgttacacac catcaaaact tatagagtac actgactttg caacatcagc 9000
ttgtgttttg gctgctgaat gtacaatttt taaagatgct tctggtaagc cagtaccata 9060
ttgttatgat accaatgtac tagaaggttc tgttgcttat gaaagtttac gccctgacac 9120
acgttatgtg ctcatggatg gctctattat tcaatttcct aacacctacc ttgaaggttc 9180
tgttagagtg gtaacaactt ttgattctga gtactgtagg cacggcactt gtgaaagatc 9240
agaagctggt gtttgtgtat ctactagtgg tagatgggta cttaacaatg attattacag 9300
atctttacca ggagttttct gtggtgtaga tgctgtaaat ttacttacta atatgtttac 9360
accactaatt caacctattg gtgctttgga catatcagca tctatagtag ctggtggtat 9420
tgtagctatc gtagtaacat gccttgccta ctattttatg aggtttagaa gagcttttgg 9480
tgaatacagt catgtagttg cctttaatac tttactattc cttatgtcat tcactgtact 9540
ctgtttaaca ccagtttact cattcttacc tggtgtttat tctgttattt acttgtactt 9600
gacattttat cttactaatg atgtttcttt tttagcacat attcagtgga tggttatgtt 9660
cacaccttta gtacctttct ggataacaat tgcttatatc atttgtattt ccacaaagca 9720
tttctattgg ttctttagta attacctaaa gagacgtgta gtctttaatg gtgtttcctt 9780
tagtactttt gaagaagctg cgctgtgcac ctttttgtta aataaagaaa tgtatctaaa 9840
gttgcgtagt gatgtgctat tacctcttac gcaatataat agatacttag ctctttataa 9900
taagtacaag tattttagtg gagcaatgga tacaactagc tacagagaag ctgcttgttg 9960
tcatctcgca aaggctctca atgacttcag taactcaggt tctgatgttc tttaccaacc 10020
accacaaacc tctatcacct cagctgtttt gcagagtggt tttagaaaaa tggcattccc 10080
atctggtaaa gttgagggtt gtatggtaca agtaacttgt ggtacaacta cacttaacgg 10140
tctttggctt gatgacgtag tttactgtcc aagacatgtg atctgcacct ctgaagacat 10200
gcttaaccct aattatgaag atttactcat tcgtaagtct aatcataatt tcttggtaca 10260
ggctggtaat gttcaactca gggttattgg acattctatg caaaattgtg tacttaagct 10320
taaggttgat acagccaatc ctaagacacc taagtataag tttgttcgca ttcaaccagg 10380
acagactttt tcagtgttag cttgttacaa tggttcacca tctggtgttt accaatgtgc 10440
tatgaggccc aatttcacta ttaagggttc attccttaat ggttcatgtg gtagtgttgg 10500
ttttaacata gattatgact gtgtctcttt ttgttacatg caccatatgg aattaccaac 10560
tggagttcat gctggcacag acttagaagg taacttttat ggaccttttg ttgacaggca 10620
aacagcacaa gcagctggta cggacacaac tattacagtt aatgttttag cttggttgta 10680
cgctgctgtt ataaatggag acaggtggtt tctcaatcga tttaccacaa ctcttaatga 10740
ctttaacctt gtggctatga agtacaatta tgaacctcta acacaagacc atgttgacat 10800
actaggacct ctttctgctc aaactggaat tgccgtttta gatatgtgtg cttcattaaa 10860
agaattactg caaaatggta tgaatggacg taccatattg ggtagtgctt tattagaaga 10920
tgaatttaca ccttttgatg ttgttagaca atgctcaggt gttactttcc aaagtgcagt 10980
gaaaagaaca atcaagggta cacaccactg gttgttactc acaattttga cttcactttt 11040
agttttagtc cagagtactc aatggtcttt gttctttttt ttgtatgaaa atgccttttt 11100
accttttgct atgggtatta ttgctatgtc tgcttttgca atgatgtttg tcaaacataa 11160
gcatgcattt ctctgtttgt ttttgttacc ttctcttgcc actgtagctt attttaatat 11220
ggtctatatg cctgctagtt gggtgatgcg tattatgaca tggttggata tggttgatac 11280
tagtttgtct ggttttaagc taaaagactg tgttatgtat gcatcagctg tagtgttact 11340
aatccttatg acagcaagaa ctgtgtatga tgatggtgct aggagagtgt ggacacttat 11400
gaatgtcttg acactcgttt ataaagttta ttatggtaat gctttagatc aagccatttc 11460
catgtgggct cttataatct ctgttacttc taactactca ggtgtagtta caactgtcat 11520
gtttttggcc agaggtattg tttttatgtg tgttgagtat tgccctattt tcttcataac 11580
tggtaataca cttcagtgta taatgctagt ttattgtttc ttaggctatt tttgtacttg 11640
ttactttggc ctcttttgtt tactcaaccg ctactttaga ctgactcttg gtgtttatga 11700
ttacttagtt tctacacagg agtttagata tatgaattca cagggactac tcccacccaa 11760
gaatagcata gatgccttca aactcaacat taaattgttg ggtgttggtg gcaaaccttg 11820
tatcaaagta gccactgtac agtctaaaat gtcagatgta aagtgcacat cagtagtctt 11880
actctcagtt ttgcaacaac tcagagtaga atcatcatct aaattgtggg ctcaatgtgt 11940
ccagttacac aatgacattc tcttagctaa agatactact gaagcctttg aaaaaatggt 12000
ttcactactt tctgttttgc tttccatgca gggtgctgta gacataaaca agctttgtga 12060
agaaatgctg gacaacaggg caaccttaca agctatagcc tcagagttta gttcccttcc 12120
atcatatgca gcttttgcta ctgctcaaga agcttatgag caggctgttg ctaatggtga 12180
ttctgaagtt gttcttaaaa agttgaagaa gtctttgaat gtggctaaat ctgaatttga 12240
ccgtgatgca gccatgcaac gtaagttgga aaagatggct gatcaagcta tgacccaaat 12300
gtataaacag gctagatctg aggacaagag ggcaaaagtt actagtgcta tgcagacaat 12360
gcttttcact atgcttagaa agttggataa tgatgcactc aacaacatta tcaacaatgc 12420
aagagatggt tgtgttccct tgaacataat acctcttaca acagcagcca aactaatggt 12480
tgtcatacca gactataaca catataaaaa tacgtgtgat ggtacaacat ttacttatgc 12540
atcagcattg tgggaaatcc aacaggttgt agatgcagat agtaaaattg ttcaacttag 12600
tgaaattagt atggacaatt cacctaattt agcatggcct cttattgtaa cagctttaag 12660
ggccaattct gctgtcaaat tacagaataa tgagcttagt cctgttgcac tacgacagat 12720
gtcttgtgct gccggtacta cacaaactgc ttgcactgat gacaatgcgt tagcttacta 12780
caacacaaca aagggaggta ggtttgtact tgcactgtta tccgatttac aggatttgaa 12840
atgggctaga ttccctaaga gtgatggaac tggtactatc tatacagaac tggaaccacc 12900
ttgtaggttt gttacagaca cacctaaagg tcctaaagtg aagtatttat actttattaa 12960
aggattaaac aacctaaata gaggtatggt acttggtagt ttagctgcca cagtacgtct 13020
acaagctggt aatgcaacag aagtgcctgc caattcaact gtattatctt tctgtgcttt 13080
tgctgtagat gctgctaaag cttacaaaga ttatctagct agtgggggac aaccaatcac 13140
taattgtgtt aagatgttgt gtacacacac tggtactggt caggcaataa cagttacacc 13200
ggaagccaat atggatcaag aatcctttgg tggtgcatcg tgttgtctgt actgccgttg 13260
ccacatagat catccaaatc ctaaaggatt ttgtgactta aaaggtaagt atgtacaaat 13320
acctacaact tgtgctaatg accctgtggg ttttacactt aaaaacacag tctgtaccgt 13380
ctgcggtatg tggaaaggtt atggctgtag ttgtgatcaa ctccgcgaac ccatgcttca 13440
gtcagctgat gcacaatcgt ttttaaacgg gtttgcggtg taagtgcagc ccgtcttaca 13500
ccgtgcggca caggcactag tactgatgtc gtatacaggg cttttgacat ctacaatgat 13560
aaagtagctg gttttgctaa attcctaaaa actaattgtt gtcgcttcca agaaaaggac 13620
gaagatgaca atttaattga ttcttacttt gtagttaaga gacacacttt ctctaactac 13680
caacatgaag aaacaattta taatttactt aaggattgtc cagctgttgc taaacatgac 13740
ttctttaagt ttagaataga cggtgacatg gtaccacata tatcacgtca acgtcttact 13800
aaatacacaa tggcagacct cgtctatgct ttaaggcatt ttgatgaagg taattgtgac 13860
acattaaaag aaatacttgt cacatacaat tgttgtgatg atgattattt caataaaaag 13920
gactggtatg attttgtaga aaacccagat atattacgcg tatacgccaa cttaggtgaa 13980
cgtgtacgcc aagctttgtt aaaaacagta caattctgtg atgccatgcg aaatgctggt 14040
attgttggtg tactgacatt agataatcaa gatctcaatg gtaactggta tgatttcggt 14100
gatttcatac aaaccacgcc aggtagtgga gttcctgttg tagattctta ttattcattg 14160
ttaatgccta tattaacctt gaccagggct ttaactgcag agtcacatgt tgacactgac 14220
ttaacaaagc cttacattaa gtgggatttg ttaaaatatg acttcacgga agagaggtta 14280
aaactctttg accgttattt taaatattgg gatcagacat accacccaaa ttgtgttaac 14340
tgtttggatg acagatgcat tctgcattgt gcaaacttta atgttttatt ctctacagtg 14400
ttcccactta caagttttgg accactagtg agaaaaatat ttgttgatgg tgttccattt 14460
gtagtttcaa ctggatacca cttcagagag ctaggtgttg tacataatca ggatgtaaac 14520
ttacatagct ctagacttag ttttaaggaa ttacttgtgt atgctgctga ccctgctatg 14580
cacgctgctt ctggtaatct attactagat aaacgcacta cgtgcttttc agtagctgca 14640
cttactaaca atgttgcttt tcaaactgtc aaacccggta attttaacaa agacttctat 14700
gactttgctg tgtctaaggg tttctttaag gaaggaagtt ctgttgaatt aaaacacttc 14760
ttctttgctc aggatggtaa tgctgctatc agcgattatg actactatcg ttataatcta 14820
ccaacaatgt gtgatatcag acaactacta tttgtagttg aagttgttga taagtacttt 14880
gattgttacg atggtggctg tattaatgct aaccaagtca tcgtcaacaa cctagacaaa 14940
tcagctggtt ttccatttaa taaatggggt aaggctagac tttattatga ttcaatgagt 15000
tatgaggatc aagatgcact tttcgcatat acaaaacgta atgtcatccc tactataact 15060
caaatgaatc ttaagtatgc cattagtgca aagaatagag ctcgcaccgt agctggtgtc 15120
tctatctgta gtactatgac caatagacag tttcatcaaa aattattgaa atcaatagcc 15180
gccactagag gagctactgt agtaattgga acaagcaaat tctatggtgg ttggcacaac 15240
atgttaaaaa ctgtttatag tgatgtagaa aaccctcacc ttatgggttg ggattatcct 15300
aaatgtgata gagccatgcc taacatgctt agaattatgg cctcacttgt tcttgctcgc 15360
aaacatacaa cgtgttgtag cttgtcacac cgtttctata gattagctaa tgagtgtgct 15420
caagtattga gtgaaatggt catgtgtggc ggttcactat atgttaaacc aggtggaacc 15480
tcatcaggag atgccacaac tgcttatgct aatagtgttt ttaacatttg tcaagctgtc 15540
acggccaatg ttaatgcact tttatctact gatggtaaca aaattgccga taagtatgtc 15600
cgcaatttac aacacagact ttatgagtgt ctctatagaa atagagatgt tgacacagac 15660
tttgtgaatg agttttacgc atatttgcgt aaacatttct caatgatgat actctctgac 15720
gatgctgttg tgtgtttcaa tagcacttat gcatctcaag gtctagtggc tagcataaag 15780
aactttaagt cagttcttta ttatcaaaac aatgttttta tgtctgaagc aaaatgttgg 15840
actgagactg accttactaa aggacctcat gaattttgct ctcaacatac aatgctagtt 15900
aaacagggtg atgattatgt gtaccttcct tacccagatc catcaagaat cctaggggcc 15960
ggctgttttg tagatgatat cgtaaaaaca gatggtacac ttatgattga acggttcgtg 16020
tctttagcta tagatgctta cccacttact aaacatccta atcaggagta tgctgatgtc 16080
tttcatttgt acttacaata cataagaaag ctacatgatg agttaacagg acacatgtta 16140
gacatgtatt ctgttatgct tactaatgat aacacttcaa ggtattggga acctgagttt 16200
tatgaggcta tgtacacacc gcatacagtc ttacaggctg ttggggcttg tgttctttgc 16260
aattcacaga cttcattaag atgtggtgct tgcatacgta gaccattctt atgttgtaaa 16320
tgctgttacg accatgtcat atcaacatca cataaattag tcttgtctgt taatccgtat 16380
gtttgcaatg ctccaggttg tgatgtcaca gatgtgactc aactttactt aggaggtatg 16440
agctattatt gtaaatcaca taaaccaccc attagttttc cattgtgtgc taatggacaa 16500
gtttttggtt tatataaaaa tacatgtgtt ggtagcgata atgttactga ctttaatgca 16560
attgcaacat gtgactggac aaatgctggt gattacattt tagctaacac ctgtactgaa 16620
agactcaagc tttttgcagc agaaacgctc aaagctactg aggagacatt taaactgtct 16680
tatggtattg ctactgtacg tgaagtgctg tctgacagag aattacatct ttcatgggaa 16740
gttggtaaac ctagaccacc acttaaccga aattatgtct ttactggtta tcgtgtaact 16800
aaaaacagta aagtacaaat aggagagtac acctttgaaa aaggtgacta tggtgatgct 16860
gttgtttacc gaggtacaac aacttacaaa ttaaatgttg gtgattattt tgtgctgaca 16920
tcacatacag taatgccatt aagtgcacct acactagtgc cacaagagca ctatgttaga 16980
attactggct tatacccaac actcaatatc tcagatgagt tttctagcaa tgttgcaaat 17040
tatcaaaagg ttggtatgca aaagtattct acactccagg gaccacctgg tactggtaag 17100
agtcattttg ctattggcct agctctctac tacccttctg ctcgcatagt gtatacagct 17160
tgctctcatg ccgctgttga tgcactatgt gagaaggcat taaaatattt gcctatagat 17220
aaatgtagta gaattatacc tgcacgtgct cgtgtagagt gttttgataa attcaaagtg 17280
aattcaacat tagaacagta tgtcttttgt actgtaaatg cattgcctga gacgacagca 17340
gatatagttg tctttgatga aatttcaatg gccacaaatt atgatttgag tgttgtcaat 17400
gccagattac gtgctaagca ctatgtgtac attggcgacc ctgctcaatt acctgcacca 17460
cgcacattgc taactaaggg cacactagaa ccagaatatt tcaattcagt gtgtagactt 17520
atgaaaacta taggtccaga catgttcctc ggaacttgtc ggcgttgtcc tgctgaaatt 17580
gttgacactg tgagtgcttt ggtttatgat aataagctta aagcacataa agacaaatca 17640
gctcaatgct ttaaaatgtt ttataagggt gttatcacgc atgatgtttc atctgcaatt 17700
aacaggccac aaataggcgt ggtaagagaa ttccttacac gtaaccctgc ttggagaaaa 17760
gctgtcttta tttcacctta taattcacag aatgctgtag cctcaaagat tttgggacta 17820
ccaactcaaa ctgttgattc atcacagggc tcagaatatg actatgtcat attcactcaa 17880
accactgaaa cagctcactc ttgtaatgta aacagattta atgttgctat taccagagca 17940
aaagtaggca tactttgcat aatgtctgat agagaccttt atgacaagtt gcaatttaca 18000
agtcttgaaa ttccacgtag gaatgtggca actttacaag ctgaaaatgt aacaggactc 18060
tttaaagatt gtagtaaggt aatcactggg ttacatccta cacaggcacc tacacacctc 18120
agtgttgaca ctaaattcaa aactgaaggt ttatgtgttg acatacctgg catacctaag 18180
gacatgacct atagaagact catctctatg atgggtttta aaatgaatta tcaagttaat 18240
ggttacccta acatgtttat cacccgcgaa gaagctataa gacatgtacg tgcatggatt 18300
ggcttcgatg tcgaggggtg tcatgctact agagaagctg ttggtaccaa tttaccttta 18360
cagctaggtt tttctacagg tgttaaccta gttgctgtac ctacaggtta tgttgataca 18420
cctaataata cagatttttc cagagttagt gctaaaccac cgcctggaga tcaatttaaa 18480
cacctcatac cacttatgta caaaggactt ccttggaatg tagtgcgtat aaagattgta 18540
caaatgttaa gtgacacact taaaaatctc tctgacagag tcgtatttgt cttatgggca 18600
catggctttg agttgacatc tatgaagtat tttgtgaaaa taggacctga gcgcacctgt 18660
tgtctatgtg atagacgtgc cacatgcttt tccactgctt cagacactta tgcctgttgg 18720
catcattcta ttggatttga ttacgtctat aatccgttta tgattgatgt tcaacaatgg 18780
ggttttacag gtaacctaca aagcaaccat gatctgtatt gtcaagtcca tggtaatgca 18840
catgtagcta gttgtgatgc aatcatgact aggtgtctag ctgtccacga gtgctttgtt 18900
aagcgtgttg actggactat tgaatatcct ataattggtg atgaactgaa gattaatgcg 18960
gcttgtagaa aggttcaaca catggttgtt aaagctgcat tattagcaga caaattccca 19020
gttcttcacg acattggtaa ccctaaagct attaagtgtg tacctcaagc tgatgtagaa 19080
tggaagttct atgatgcaca gccttgtagt gacaaagctt ataaaataga agaattattc 19140
tattcttatg ccacacattc tgacaaattc acagatggtg tatgcctatt ttggaattgc 19200
aatgtcgata gatatcctgc taattccatt gtttgtagat ttgacactag agtgctatct 19260
aaccttaact tgcctggttg tgatggtggc agtttgtatg taaataaaca tgcattccac 19320
acaccagctt ttgataaaag tgcttttgtt aatttaaaac aattaccatt tttctattac 19380
tctgacagtc catgtgagtc tcatggaaaa caagtagtgt cagatataga ttatgtacca 19440
ctaaagtctg ctacgtgtat aacacgttgc aatttaggtg gtgctgtctg tagacatcat 19500
gctaatgagt acagattgta tctcgatgct tataacatga tgatctcagc tggctttagc 19560
ttgtgggttt acaaacaatt tgatacttat aacctctgga acacttttac aagacttcag 19620
agtttagaaa atgtggcttt taatgttgta aataagggac actttgatgg acaacagggt 19680
gaagtaccag tttctatcat taataacact gtttacacaa aagttgatgg tgttgatgta 19740
gaattgtttg aaaataaaac aacattacct gttaatgtag catttgagct ttgggctaag 19800
cgcaacatta aaccagtacc agaggtgaaa atactcaata atttgggtgt ggacattgct 19860
gctaatactg tgatctggga ctacaaaaga gatgctccag cacatatatc tactattggt 19920
gtttgttcta tgactgacat agccaagaaa ccaactgaaa cgatttgtgc accactcact 19980
gtcttttttg atggtagagt tgatggtcaa gtagacttat ttagaaatgc ccgtaatggt 20040
gttcttatta cagaaggtag tgttaaaggt ttacaaccat ctgtaggtcc caaacaagct 20100
agtcttaatg gagtcacatt aattggagaa gccgtaaaaa cacagttcaa ttattataag 20160
aaagttgatg gtgttgtcca acaattacct gaaacttact ttactcagag tagaaattta 20220
caagaattta aacccaggag tcaaatggaa attgatttct tagaattagc tatggatgaa 20280
ttcattgaac ggtataaatt agaaggctat gccttcgaac atatcgttta tggagatttt 20340
agtcatagtc agttaggtgg tttacatcta ctgattggac tagctaaacg ttttaaggaa 20400
tcaccttttg aattagaaga ttttattcct atggacagta cagttaaaaa ctatttcata 20460
acagatgcgc aaacaggttc atctaagtgt gtgtgttctg ttattgattt attacttgat 20520
gattttgttg aaataataaa atcccaagat ttatctgtag tttctaaggt tgtcaaagtg 20580
actattgact atacagaaat ttcatttatg ctttggtgta aagatggcca tgtagaaaca 20640
ttttacccaa aattacaatc tagtcaagcg tggcaaccgg gtgttgctat gcctaatctt 20700
tacaaaatgc aaagaatgct attagaaaag tgtgaccttc aaaattatgg tgatagtgca 20760
acattaccta aaggcataat gatgaatgtc gcaaaatata ctcaactgtg tcaatattta 20820
aacacattaa cattagctgt accctataat atgagagtta tacattttgg tgctggttct 20880
gataaaggag ttgcaccagg tacagctgtt ttaagacagt ggttgcctac gggtacgctg 20940
cttgtcgatt cagatcttaa tgactttgtc tctgatgcag attcaacttt gattggtgat 21000
tgtgcaactg tacatacagc taataaatgg gatctcatta ttagtgatat gtacgaccct 21060
aagactaaaa atgttacaaa agaaaatgac tctaaagagg gttttttcac ttacatttgt 21120
gggtttatac aacaaaagct agctcttgga ggttccgtgg ctataaagat aacagaacat 21180
tcttggaatg ctgatcttta taagctcatg ggacacttcg catggtggac agcctttgtt 21240
actaatgtga atgcgtcatc atctgaagca tttttaattg gatgtaatta tcttggcaaa 21300
ccacgcgaac aaatagatgg ttatgtcatg catgcaaatt acatattttg gaggaataca 21360
aatccaattc agttgtcttc ctattcttta tttgacatga gtaaatttcc ccttaaatta 21420
aggggtactg ctgttatgtc tttaaaagaa ggtcaaatca atgatatgat tttatctctt 21480
cttagtaaag gtagacttat aattagagaa aacaacagag ttgttatttc tagtgatgtt 21540
cttgttaaca actaaacgaa caatgtttgt ttttcttgtt ttattgccac tagtctctag 21600
tcagtgtgtt aatcttacaa ccagaactca attaccccct gcatacacta attctttcac 21660
acgtggtgtt tattaccctg acaaagtttt cagatcctca gttttacatt caactcagga 21720
cttgttctta cctttctttt ccaatgttac ttggttccat gctatacatg tctctgggac 21780
caatggtact aagaggtttg ataaccctgt cctaccattt aatgatggtg tttattttgc 21840
ttccactgag aagtctaaca taataagagg ctggattttt ggtactactt tagattcgaa 21900
gacccagtcc ctacttattg ttaataacgc tactaatgtt gttattaaag tctgtgaatt 21960
tcaattttgt aatgatccat ttttgggtgt ttattaccac aaaaacaaca aaagttggat 22020
ggaaagtgag ttcagagttt attctagtgc gaataattgc acttttgaat atgtctctca 22080
gccttttctt atggaccttg aaggaaaaca gggtaatttc aaaaatctta gggaatttgt 22140
gtttaagaat attgatggtt attttaaaat atattctaag cacacgccta ttaatttagt 22200
gcgtgatctc cctcagggtt tttcggcttt agaaccattg gtagatttgc caataggtat 22260
taacatcact aggtttcaaa ctttacttgc tttacataga agttatttga ctcctggtga 22320
ttcttcttca ggttggacag ctggtgctgc agcttattat gtgggttatc ttcaacctag 22380
gacttttcta ttaaaatata atgaaaatgg aaccattaca gatgctgtag actgtgcact 22440
tgaccctctc tcagaaacaa agtgtacgtt gaaatccttc actgtagaaa aaggaatcta 22500
tcaaacttct aactttagag kccaaccaac agaatctatt gttagatttc ctaatattac 22560
aaacttgtgc ccttttggtg aagtttttaa cgccaccaga tttgcatctg tttatgcttg 22620
gaacaggaag agaatcagca actgtgttgc tgattattct gtcctatata attccgcatc 22680
attttccact tttaagtgtt atggagtgtc tcctactaaa ttaaatgatc tctgctttac 22740
taatgtctat gcagattcat ttgtaattag aggtgatgaa gtcagacaaa tcgctccagg 22800
gcaaactgga aagattgctg attataatta taaattacca gatgatttta caggctgcgt 22860
tatagcttgg aattctaaca atcttgattc taaggttggt ggtaattata attacctgta 22920
tagattgttt aggaagtcta atctcaaacc ttttgagaga gatatttcaa ctgaaatcta 22980
tcaggccggt agcacacctt gtaatggtgt tgaaggtttt aattgttact ttcctttaca 23040
atcatatggt ttccaaccca ctaatggtgt tggttaccaa ccatacagag tagtagtact 23100
ttcttttgaa cttctacatg caccagcaac tgtttgtgga cctaaaaagt ctactaattt 23160
ggttaaaaac aaatgtgtca atttcaactt caatggttta acaggcacag gtgttcttac 23220
tgagtctaac aaaaagtttc tgcctttcca acaatttggc agagacattg ctgacactac 23280
tgatgctgtc cgtgatccac agacacttga gattcttgac attacaccat gttcttttgg 23340
tggtgtcagt gttataacac caggaacaaa tacttctaac caggttgctg ttctttatca 23400
gggtgttaac tgcacagaag tccctgttgc tattcatgca gatcaactta ctcctacttg 23460
gcgtgtttat tctacaggtt ctaatgtttt tcaaacacgt gcaggctgtt taataggggc 23520
tgaacatgtc aacaactcat atgagtgtga catacccatt ggtgcaggta tatgcgctag 23580
ttatcagact cagactaatt ctcctcggcg ggcacgtagt gtagctagtc aatccatcat 23640
tgcctacact atgtcacttg gtgcagaaaa ttcagttgct tactctrata actctattgc 23700
catacccaca aattttacta ttagtgttac cacagaaatt ctaccagtgt ctatgaccaa 23760
gacatcagta gattgtacaa tgtacatttg tggtgattca actgaatgca gcaatctttt 23820
gttgcaatat ggcagttttt gtacacaatt aaaccgtgct ttaactggaa tagctgttga 23880
acaagacaaa aacacccaag aagtttttgc acaagtcaaa caaatttaca aaacaccacc 23940
aattaaagat tttggtggtt ttaatttttc acaaatatta ccagatccat caaaaccaag 24000
caagaggtca tttattgaag atctactttt caacaaagtg acacttgcag atgctggctt 24060
catcaaacaa tatggtgatt gccttggtga tattgctgct agagacctca tttgtgcaca 24120
aaagtttaac ggccttactg ttttgccacc tttgctcaca gatgaaatga ttgctcaata 24180
cacttctgca ctgttagcgg gtacaatcac ttctggttgg acctttggtg caggtgctgc 24240
attacaaata ccatttgcta tgcaaatggc ttataggttt aatggtattg gagttacaca 24300
gaatgttctc tatgagaacc aaaaattgat tgccaaccaa tttaatagtg ctattggcaa 24360
aattcaagac tcactttctt ccacagcaag tgcacttgga aaacttcaag atgtggtcaa 24420
ccaaaatgca caagctttaa acacgcttgt taaacaactt agctccaatt ttggtgcaat 24480
ttcaagtgtt ttaaatgata tcctttcacg tcttgacaaa gttgaggctg aagtgcaaat 24540
tgataggttg atcacaggca gacttcaaag tttgcagaca tatgtgactc aacaattaat 24600
tagagctgca gaaatcagag tttctgctaa tcttgctgct actaaaatgt cagagtgtgt 24660
acttggacaa tcaaaaagag ttgatttttg tggaaagggc tatcatctta tgtccttccc 24720
tcagtcagca cctcatggtg tagtcttctt gcatgtgact tatgtccctg cacaagaaaa 24780
gaacttcaca actgctcctg ccatttgtca tgatggaaaa gcacactttc ctcgtgaagg 24840
tgtctttgtt tcaaatggca cacactggtt tgtaacacaa aggaattttt atgaaccaca 24900
aatcattact acagacaaca catttgtgtc tggtaactgt gatgttgtaa taggaattgt 24960
caacaacaca gtttatgatc ctttgcaacc tgaattagac tcattcaagg aggagttaga 25020
taaatatttt aagaatcata catcaccaga tgttgattta ggtgacatct ctggcattaa 25080
tgcttcagtt gtaaacattc aaaaagaaat tgaccgcctc aatgaggttg ccaagaattt 25140
aaatgaatct ctcatcgatc tccaagaact tggaaagtat gagcagtata taaaatggcc 25200
atggtacatt tggctaggtt ttatagctgg cttgattgcc atagtaatgg tgacaattat 25260
gctttgctgt atgaccagtt gctgtagttg tctcaagggc tgttgttctt gtggatcctg 25320
ctgcaaattt gatgaagacg actctgagcc agtgctcaaa ggagtcaaat tacattacac 25380
ataaacgaac ttatggattt gtttatgaga atcttcacaa ttggaactgt aactttgaag 25440
caaggtgaaa tcaaggatgc tactccttca gattttgttc gcgctactgc aacgataccg 25500
atacaagcct cactcccttt cggatggctt attgttggcg ttgcacttct tgctgttttt 25560
cagagcgctt ccaaaatcat aaccctcaaa aagagatggc aactagcact ctccaagggt 25620
gttcactttg tttgcaactt gctgttgttg tttgtaacag tttactcaca ccttttgctc 25680
gttgctgctg gccttgaagc cccttttctc tatctttatg ctttagtcta cttcttgcag 25740
agtataaact ttgtaagaat aataatgagg ctttggcttt gctggaaatg ccgttccaaa 25800
aacccattac tttatgatgc caactatttt ctttgctggc atactaattg ttacgactat 25860
tgtatacctt acaatagtgt aacttcttca attgtcatta cttcaggtga tggcacaaca 25920
agtcctattt ctgaacatga ctaccagatt ggtggttata ctgaaaaatg ggaatctgga 25980
gtaaaagact gtgttgtatt acacagttac ttcacttcag actattacca gctgtactca 26040
actcaattga gtacagacac tggtgttgaa catgttacct tcttcatcta caataaaatt 26100
gttgatgagc ctgaagaaca tgtccaaatt cacacaatcg acggttcatc cggagttgtt 26160
aatccagtaa tggaaccaat ttatgatgaa ccgacgacga ctactagcgt gcctttgtaa 26220
gcacaagctg atgagtacga acttatgtac tcattcgttt cggaagagmc aggtacgtta 26280
atagttaata gcgtacttct ttttcttgct ttcgtggtat tcttgctagt tacactagcc 26340
atccttactg cgcttcgatt gtgtgcgtac tgctgcaata ttgttaacgt gagtcttgta 26400
aaaccttctt tttacgttta ctctcgtgtt aaaaatctga attcttctag agttcctgat 26460
cttctggtct aaacgaacta aatattatat tagtttttct gtttggaact ttaattttag 26520
ccatggcaga ttccaacggt actattaccg ttgaagagct taaaaagctc cttgaacaat 26580
ggaacctagt aataggtttc ctattcctta catggatttg tcttctacaa tttgcctatg 26640
ccaacaggaa taggtttttg tatataatta agttaatttt cctctggctg ttatggccag 26700
taactttagc ttgttttgtg cttgctgctg tttacagaat aaattggatc accggtggaa 26760
ttgctatcgc aatggcttgt cttgtaggct tgatgtggct cagctacttc attgcttctt 26820
tcagactgtt tgcgcgtacg cgttccatgt ggtcattcaa tccagaaact aacattcttc 26880
tcaacgtgcc actccatggc actattctga ccagaccgct tctagaaagt gaactcgtaa 26940
tcggagctgt gatccttcgt ggacatcttc gtattgctgg acaccatcta ggacgctgtg 27000
acatcaagga cctgcctaaa gaaatcactg ttgctacatc acgaatgctt tcttattaca 27060
aattgggagc ttcgcagcgt gtagcaggtg actcaggttt tgctgcatac agtcgctaca 27120
ggattggcaa ctataaatta aacacagacc attccagtag cagtgacaat attgctttgc 27180
ttgtacagta agtgacaaca gatgtttcat ctcgttgact ttcaggttac tatagcagag 27240
atattactaa ttattatgag gacttttaaa gtttccattt ggaatcttga ttacatcata 27300
aacctcataa ttaaaaattt atctaagtca ctaactgaga ataaatattc tcaattagat 27360
gaagagcaac caatggagat tgattaaacg aacatgaaaa ttattctttt cttggcactg 27420
ataacactcg ctacttgtga gctttatcac taccaagagt gtgttagagg tacaacagta 27480
cttttaaaag aaccttgctc ttctggaaca tacgagggca attcaccatt tcatcctcta 27540
gctgataaca aatttgcact gacttgcttt agcactcaat ttgcttttgc ttgtcctgac 27600
ggcgtaaaac acgtctatca gttacgtgcc agatcagttt cacctaaact gttcatcaga 27660
caagaggaag ttcaagaact ttactctcca atttttctta ttgttgcggc aatagtgttt 27720
ataacacttt gcttcacact caaaagaaag acagaatgat tgaactttca ttaattgact 27780
tctatttgtg ctttttagcc tttctgctat tccttgtttt aattatgctt attatctttt 27840
ggttctcact tgaactgcaa gatcataatg aaacttgtca cgcctaaacg aacatgaaat 27900
ttcttgtttt cttaggaatc atcacaactg tagctgcatt tcaccaagaa tgtagtttac 27960
agtcatgtac tcaacatcaa ccatatgtag ttgatgaccc gtgtcctatt cacttctatt 28020
ctaaatggta tattagagta ggagctagaa aatcagcacc tttaattgaa ttgtgcgtgg 28080
atgaggctgg ttctaaatca cccattcagt acatcgatat cggtaattat acagtttcct 28140
gtttaccttt tacaattaat tgccaggaac ctaaattggg tagtcttgta gtgcgttgtt 28200
cgttctatga agacttttta gagtatcatg acgttcgtgt tgttttagat ttcatctaaa 28260
cgaacaaact aaaatgtctg ataatggacc ccaaaatcag cgaaatgcac cccgcattac 28320
gtttggtgga ccctcagatt caactggcag taaccagaat ggagaacgca gtggggcgcg 28380
atcaaaacaa cgtcggcccc aaggtttacc caataatact gcgtcttggt tcaccgctct 28440
cactcaacat ggcaaggaag accttaaatt ccctcgagga caaggcgttc caattaacac 28500
caatagcagt ccagatgacc aaattggcta ctaccgaaga gctaccagac gaattcgtgg 28560
tggtgacggt aaaatgaaag atctcagtcc aagatggtat ttctactacc taggaactgg 28620
gccagaagct ggacttccct atggtgctaa caaagacggc atcatatggg ttgcaactga 28680
gggagccttg aatacaccaa aagatcacat tggcacccgc aatcctgcta acaatgctgc 28740
aatcgtgcta caacttcctc aaggaacaac attgccaaaa ggcttctacg cagaagggag 28800
cagaggcggc agtcaagcct cttctcgttc ctcatcacgt agtcgcaaca gttcaagaaa 28860
ttcaactcca ggcagcagta aacgaacttc tcctgctaga atggctggca atggcggtga 28920
tgctgctctt gctttgctgc tgcttgacag attgaaccag cttgagagca aaatgtctgg 28980
taaaggccaa caacaacaag gccaaactgt cactaagaaa tctgctgctg aggcttctaa 29040
gaagcctcgg caaaaacgta ctgccactaa agcatacaat gtaacacaag ctttcggcag 29100
acgtggtcca gaacaaaccc aaggaaattt tggggaccag gaactaatca gacaaggaac 29160
tgattacaaa cattggccgc aaattgcaca atttgccccc agcgcttcag cgttcttcgg 29220
aatgtcgcgc attggcatgg aagtcacacc ttcgggaacg tggttgacct acacaggtgc 29280
catcaaattg gatgacaaag atccaaattt caaagatcaa gtcattttgc tgaataagca 29340
tattgacgca tacaaaacat tcccaccaac agagcctaaa aaggacaaaa agaagaaggc 29400
tgatgaaact caagccttac cgcagagaca gaagaaacag caaactgtga ctcttcttcc 29460
tgctgcagat ttggatgatt tctccaaaca attgcaacaa tccatgagca gtgctgactc 29520
aactcaggcc taaactcatg cagaccacac aaggcagatg ggctatataa acgttttcgc 29580
ttttccgttt acgatatata gtctactctt gtgcagaatg aattctcgta actacatagc 29640
acaagtagat gtagttaact ttaatctcac atagcaatct ttaatcagtg tgtaacatta 29700
gggaggactt gaaagagcca ccacattttc accgaggcca cgcggagtac gatcgagtgt 29760
acagtgaaca atgctaggga gagctgccta tatggaagag ccctaatgtg taaaattaat 29820
tttagtagtg ctatccccat gtgattttaa tagcttctta ggagnatgac annnnnnnnn 29880
nnnnn 29885
<210> 16
<211> 4405
<212> PRT
<213> SARS-CoV2
<400> 16
Met Glu Ser Leu Val Pro Gly Phe Asn Glu Lys Thr His Val Gln Leu
1 5 10 15
Ser Leu Pro Val Leu Gln Val Arg Asp Val Leu Val Arg Gly Phe Gly
20 25 30
Asp Ser Val Glu Glu Val Leu Ser Glu Ala Arg Gln His Leu Lys Asp
35 40 45
Gly Thr Cys Gly Leu Val Glu Val Glu Lys Gly Val Leu Pro Gln Leu
50 55 60
Glu Gln Pro Tyr Val Phe Ile Lys Arg Ser Asp Ala Arg Thr Ala Pro
65 70 75 80
His Gly His Val Met Val Glu Leu Val Ala Glu Leu Glu Gly Ile Gln
85 90 95
Tyr Gly Arg Ser Gly Glu Thr Leu Gly Val Leu Val Pro His Val Gly
100 105 110
Glu Ile Pro Val Ala Tyr Arg Lys Val Leu Leu Arg Lys Asn Gly Asn
115 120 125
Lys Gly Ala Gly Gly His Ser Tyr Gly Ala Asp Leu Lys Ser Phe Asp
130 135 140
Leu Gly Asp Glu Leu Gly Thr Asp Pro Tyr Glu Asp Phe Gln Glu Asn
145 150 155 160
Trp Asn Thr Lys His Ser Ser Gly Val Thr Arg Glu Leu Met Arg Glu
165 170 175
Leu Asn Gly Gly Ala Tyr Thr Arg Tyr Val Asp Asn Asn Phe Cys Gly
180 185 190
Pro Asp Gly Tyr Pro Leu Glu Cys Ile Lys Asp Leu Leu Ala Arg Ala
195 200 205
Gly Lys Ala Ser Cys Thr Leu Ser Glu Gln Leu Asp Phe Ile Asp Thr
210 215 220
Lys Arg Gly Val Tyr Cys Cys Arg Glu His Glu His Glu Ile Ala Trp
225 230 235 240
Tyr Thr Glu Arg Ser Glu Lys Ser Tyr Glu Leu Gln Thr Pro Phe Glu
245 250 255
Ile Lys Leu Ala Lys Lys Phe Asp Thr Phe Asn Gly Glu Cys Pro Asn
260 265 270
Phe Val Phe Pro Leu Asn Ser Ile Ile Lys Thr Ile Gln Pro Arg Val
275 280 285
Glu Lys Lys Lys Leu Asp Gly Phe Met Gly Arg Ile Arg Ser Val Tyr
290 295 300
Pro Val Ala Ser Pro Asn Glu Cys Asn Gln Met Cys Leu Ser Thr Leu
305 310 315 320
Met Lys Cys Asp His Cys Gly Glu Thr Ser Trp Gln Thr Gly Asp Phe
325 330 335
Val Lys Ala Thr Cys Glu Phe Cys Gly Thr Glu Asn Leu Thr Lys Glu
340 345 350
Gly Ala Thr Thr Cys Gly Tyr Leu Pro Gln Asn Ala Val Val Lys Ile
355 360 365
Tyr Cys Pro Ala Cys His Asn Ser Glu Val Gly Pro Glu His Ser Leu
370 375 380
Ala Glu Tyr His Asn Glu Ser Gly Leu Lys Thr Ile Leu Arg Lys Gly
385 390 395 400
Gly Arg Thr Ile Ala Phe Gly Gly Cys Val Phe Ser Tyr Val Gly Cys
405 410 415
His Asn Lys Cys Ala Tyr Trp Val Pro Arg Ala Ser Ala Asn Ile Gly
420 425 430
Cys Asn His Thr Gly Val Val Gly Glu Gly Ser Glu Gly Leu Asn Asp
435 440 445
Asn Leu Leu Glu Ile Leu Gln Lys Glu Lys Val Asn Ile Asn Ile Val
450 455 460
Gly Asp Phe Lys Leu Asn Glu Glu Ile Ala Ile Ile Leu Ala Ser Phe
465 470 475 480
Ser Ala Ser Thr Ser Ala Phe Val Glu Thr Val Lys Gly Leu Asp Tyr
485 490 495
Lys Ala Phe Lys Gln Ile Val Glu Ser Cys Gly Asn Phe Lys Val Thr
500 505 510
Lys Gly Lys Ala Lys Lys Gly Ala Trp Asn Ile Gly Glu Gln Lys Ser
515 520 525
Ile Leu Ser Pro Leu Tyr Ala Phe Ala Ser Glu Ala Ala Arg Val Val
530 535 540
Arg Ser Ile Phe Ser Arg Thr Leu Glu Thr Ala Gln Asn Ser Val Arg
545 550 555 560
Val Leu Gln Lys Ala Ala Ile Thr Ile Leu Asp Gly Ile Ser Gln Tyr
565 570 575
Ser Leu Arg Leu Ile Asp Ala Met Met Phe Thr Ser Asp Leu Ala Thr
580 585 590
Asn Asn Leu Val Val Met Ala Tyr Ile Thr Gly Gly Val Val Gln Leu
595 600 605
Thr Ser Gln Trp Leu Thr Asn Ile Phe Gly Thr Val Tyr Glu Lys Leu
610 615 620
Lys Pro Val Leu Asp Trp Leu Glu Glu Lys Phe Lys Glu Gly Val Glu
625 630 635 640
Phe Leu Arg Asp Gly Trp Glu Ile Val Lys Phe Ile Ser Thr Cys Ala
645 650 655
Cys Glu Ile Val Gly Gly Gln Ile Val Thr Cys Ala Lys Glu Ile Lys
660 665 670
Glu Ser Val Gln Thr Phe Phe Lys Leu Val Asn Lys Phe Leu Ala Leu
675 680 685
Cys Ala Asp Ser Ile Ile Ile Gly Gly Ala Lys Leu Lys Ala Leu Asn
690 695 700
Leu Gly Glu Thr Phe Val Thr His Ser Lys Gly Leu Tyr Arg Lys Cys
705 710 715 720
Val Lys Ser Arg Glu Glu Thr Gly Leu Leu Met Pro Leu Lys Ala Pro
725 730 735
Lys Glu Ile Ile Phe Leu Glu Gly Glu Thr Leu Pro Thr Glu Val Leu
740 745 750
Thr Glu Glu Val Val Leu Lys Thr Gly Asp Leu Gln Pro Leu Glu Gln
755 760 765
Pro Thr Ser Glu Ala Val Glu Ala Pro Leu Val Gly Thr Pro Val Cys
770 775 780
Ile Asn Gly Leu Met Leu Leu Glu Ile Lys Asp Thr Glu Lys Tyr Cys
785 790 795 800
Ala Leu Ala Pro Asn Met Met Val Thr Asn Asn Thr Phe Thr Leu Lys
805 810 815
Gly Gly Ala Pro Thr Lys Val Thr Phe Gly Asp Asp Thr Val Ile Glu
820 825 830
Val Gln Gly Tyr Lys Ser Val Asn Ile Thr Phe Glu Leu Asp Glu Arg
835 840 845
Ile Asp Lys Val Leu Asn Glu Lys Cys Ser Ala Tyr Thr Val Glu Leu
850 855 860
Gly Thr Glu Val Asn Glu Phe Ala Cys Val Val Ala Asp Ala Val Ile
865 870 875 880
Lys Thr Leu Gln Pro Val Ser Glu Leu Leu Thr Pro Leu Gly Ile Asp
885 890 895
Leu Asp Glu Trp Ser Met Ala Thr Tyr Tyr Leu Phe Asp Glu Ser Gly
900 905 910
Glu Phe Lys Leu Ala Ser His Met Tyr Cys Ser Phe Tyr Pro Pro Asp
915 920 925
Glu Asp Glu Glu Glu Gly Asp Cys Glu Glu Glu Glu Phe Glu Pro Ser
930 935 940
Thr Gln Tyr Glu Tyr Gly Thr Glu Asp Asp Tyr Gln Gly Lys Pro Leu
945 950 955 960
Glu Phe Gly Ala Thr Ser Ala Ala Leu Gln Pro Glu Glu Glu Gln Glu
965 970 975
Glu Asp Trp Leu Asp Asp Asp Ser Gln Gln Thr Val Gly Gln Gln Asp
980 985 990
Gly Ser Glu Asp Asn Gln Thr Thr Thr Ile Gln Thr Ile Val Glu Val
995 1000 1005
Gln Pro Gln Leu Glu Met Glu Leu Thr Pro Val Val Gln Thr Ile Glu
1010 1015 1020
Val Asn Ser Phe Ser Gly Tyr Leu Lys Leu Thr Asp Asn Val Tyr Ile
1025 1030 1035 1040
Lys Asn Ala Asp Ile Val Glu Glu Ala Lys Lys Val Lys Pro Thr Val
1045 1050 1055
Val Val Asn Ala Ala Asn Val Tyr Leu Lys His Gly Gly Gly Val Ala
1060 1065 1070
Gly Ala Leu Asn Lys Ala Thr Asn Asn Ala Met Gln Val Glu Ser Asp
1075 1080 1085
Asp Tyr Ile Ala Thr Asn Gly Pro Leu Lys Val Gly Gly Ser Cys Val
1090 1095 1100
Leu Ser Gly His Asn Leu Ala Lys His Cys Leu His Val Val Gly Pro
1105 1110 1115 1120
Asn Val Asn Lys Gly Glu Asp Ile Gln Leu Leu Lys Ser Ala Tyr Glu
1125 1130 1135
Asn Phe Asn Gln His Glu Val Leu Leu Ala Pro Leu Leu Ser Ala Gly
1140 1145 1150
Ile Phe Gly Ala Asp Pro Ile His Ser Leu Arg Val Cys Val Asp Thr
1155 1160 1165
Val Arg Thr Asn Val Tyr Leu Ala Val Phe Asp Lys Asn Leu Tyr Asp
1170 1175 1180
Lys Leu Val Ser Ser Phe Leu Glu Met Lys Ser Glu Lys Gln Val Glu
1185 1190 1195 1200
Gln Lys Ile Ala Glu Ile Pro Lys Glu Glu Val Lys Pro Phe Ile Thr
1205 1210 1215
Glu Ser Lys Pro Ser Val Glu Gln Arg Lys Gln Asp Asp Lys Lys Ile
1220 1225 1230
Lys Ala Cys Val Glu Glu Val Thr Thr Thr Leu Glu Glu Thr Lys Phe
1235 1240 1245
Leu Thr Glu Asn Leu Leu Leu Tyr Ile Asp Ile Asn Gly Asn Leu His
1250 1255 1260
Pro Asp Ser Ala Thr Leu Val Ser Asp Ile Asp Ile Thr Phe Leu Lys
1265 1270 1275 1280
Lys Asp Ala Pro Tyr Ile Val Gly Asp Val Val Gln Glu Gly Val Leu
1285 1290 1295
Thr Ala Val Val Ile Pro Thr Lys Lys Ala Gly Gly Thr Thr Glu Met
1300 1305 1310
Leu Ala Lys Ala Leu Arg Lys Val Pro Thr Asp Asn Tyr Ile Thr Thr
1315 1320 1325
Tyr Pro Gly Gln Gly Leu Asn Gly Tyr Thr Val Glu Glu Ala Lys Thr
1330 1335 1340
Val Leu Lys Lys Cys Lys Ser Ala Phe Tyr Ile Leu Pro Ser Ile Ile
1345 1350 1355 1360
Ser Asn Glu Lys Gln Glu Ile Leu Gly Thr Val Ser Trp Asn Leu Arg
1365 1370 1375
Glu Met Leu Ala His Ala Glu Glu Thr Arg Lys Leu Met Pro Val Cys
1380 1385 1390
Val Glu Thr Lys Ala Ile Val Ser Thr Ile Gln Arg Lys Tyr Lys Gly
1395 1400 1405
Ile Lys Ile Gln Glu Gly Val Val Asp Tyr Gly Ala Arg Phe Tyr Phe
1410 1415 1420
Tyr Thr Ser Lys Thr Thr Val Ala Ser Leu Ile Asn Thr Leu Asn Asp
1425 1430 1435 1440
Leu Asn Glu Thr Leu Val Thr Met Pro Leu Gly Tyr Val Thr His Gly
1445 1450 1455
Leu Asn Leu Glu Glu Ala Ala Arg Tyr Met Arg Ser Leu Lys Val Pro
1460 1465 1470
Ala Thr Val Ser Val Ser Ser Pro Asp Ala Val Thr Ala Tyr Asn Gly
1475 1480 1485
Tyr Leu Thr Ser Ser Ser Lys Thr Pro Glu Glu His Phe Ile Glu Thr
1490 1495 1500
Ile Ser Leu Ala Gly Ser Tyr Lys Asp Trp Ser Tyr Ser Gly Gln Ser
1505 1510 1515 1520
Thr Gln Leu Gly Ile Glu Phe Leu Lys Arg Gly Asp Lys Ser Val Tyr
1525 1530 1535
Tyr Thr Ser Asn Pro Thr Thr Phe His Leu Asp Gly Glu Val Ile Thr
1540 1545 1550
Phe Asp Asn Leu Lys Thr Leu Leu Ser Leu Arg Glu Val Arg Thr Ile
1555 1560 1565
Lys Val Phe Thr Thr Val Asp Asn Ile Asn Leu His Thr Gln Val Val
1570 1575 1580
Asp Met Ser Met Thr Tyr Gly Gln Gln Phe Gly Pro Thr Tyr Leu Asp
1585 1590 1595 1600
Gly Ala Asp Val Thr Lys Ile Lys Pro His Asn Ser His Glu Gly Lys
1605 1610 1615
Thr Phe Tyr Val Leu Pro Asn Asp Asp Thr Leu Arg Val Glu Ala Phe
1620 1625 1630
Glu Tyr Tyr His Thr Thr Asp Pro Ser Phe Leu Gly Arg Tyr Met Ser
1635 1640 1645
Ala Leu Asn His Thr Lys Lys Trp Lys Tyr Pro Gln Val Asn Gly Leu
1650 1655 1660
Thr Ser Ile Lys Trp Ala Asp Asn Asn Cys Tyr Leu Ala Thr Ala Leu
1665 1670 1675 1680
Leu Thr Leu Gln Gln Ile Glu Leu Lys Phe Asn Pro Pro Ala Leu Gln
1685 1690 1695
Asp Ala Tyr Tyr Arg Ala Arg Ala Gly Glu Ala Ala Asn Phe Cys Ala
1700 1705 1710
Leu Ile Leu Ala Tyr Cys Asn Lys Thr Val Gly Glu Leu Gly Asp Val
1715 1720 1725
Arg Glu Thr Met Ser Tyr Leu Phe Gln His Ala Asn Leu Asp Ser Cys
1730 1735 1740
Lys Arg Val Leu Asn Val Val Cys Lys Thr Cys Gly Gln Gln Gln Thr
1745 1750 1755 1760
Thr Leu Lys Gly Val Glu Ala Val Met Tyr Met Gly Thr Leu Ser Tyr
1765 1770 1775
Glu Gln Phe Lys Lys Gly Val Gln Ile Pro Cys Thr Cys Gly Lys Gln
1780 1785 1790
Ala Thr Lys Tyr Leu Val Gln Gln Glu Ser Pro Phe Val Met Met Ser
1795 1800 1805
Ala Pro Pro Ala Gln Tyr Glu Leu Lys His Gly Thr Phe Thr Cys Ala
1810 1815 1820
Ser Glu Tyr Thr Gly Asn Tyr Gln Cys Gly His Tyr Lys His Ile Thr
1825 1830 1835 1840
Ser Lys Glu Thr Leu Tyr Cys Ile Asp Gly Ala Leu Leu Thr Lys Ser
1845 1850 1855
Ser Glu Tyr Lys Gly Pro Ile Thr Asp Val Phe Tyr Lys Glu Asn Ser
1860 1865 1870
Tyr Thr Thr Thr Ile Lys Pro Val Thr Tyr Lys Leu Asp Gly Val Val
1875 1880 1885
Cys Thr Glu Ile Asp Pro Lys Leu Asp Asn Tyr Tyr Lys Lys Asp Asn
1890 1895 1900
Ser Tyr Phe Thr Glu Gln Pro Ile Asp Leu Val Pro Asn Gln Pro Tyr
1905 1910 1915 1920
Pro Asn Ala Ser Phe Asp Asn Phe Lys Phe Val Cys Asp Asn Ile Lys
1925 1930 1935
Phe Ala Asp Asp Leu Asn Gln Leu Thr Gly Tyr Lys Lys Pro Ala Ser
1940 1945 1950
Arg Glu Leu Lys Val Thr Phe Phe Pro Asp Leu Asn Gly Asp Val Val
1955 1960 1965
Ala Ile Asp Tyr Lys His Tyr Thr Pro Ser Phe Lys Lys Gly Ala Lys
1970 1975 1980
Leu Leu His Lys Pro Ile Val Trp His Val Asn Asn Ala Thr Asn Lys
1985 1990 1995 2000
Ala Thr Tyr Lys Pro Asn Thr Trp Cys Ile Arg Cys Leu Trp Ser Thr
2005 2010 2015
Lys Pro Val Glu Thr Ser Asn Ser Phe Asp Val Leu Lys Ser Glu Asp
2020 2025 2030
Ala Gln Gly Met Asp Asn Leu Ala Cys Glu Asp Leu Lys Pro Val Ser
2035 2040 2045
Glu Glu Val Val Glu Asn Pro Thr Ile Gln Lys Asp Val Leu Glu Cys
2050 2055 2060
Asn Val Lys Thr Thr Glu Val Val Gly Asp Ile Ile Leu Lys Pro Ala
2065 2070 2075 2080
Asn Asn Ser Leu Lys Ile Thr Glu Glu Val Gly His Thr Asp Leu Met
2085 2090 2095
Ala Ala Tyr Val Asp Asn Ser Ser Leu Thr Ile Lys Lys Pro Asn Glu
2100 2105 2110
Leu Ser Arg Val Leu Gly Leu Lys Thr Leu Ala Thr His Gly Leu Ala
2115 2120 2125
Ala Val Asn Ser Val Pro Trp Asp Thr Ile Ala Asn Tyr Ala Lys Pro
2130 2135 2140
Phe Leu Asn Lys Val Val Ser Thr Thr Thr Asn Ile Val Thr Arg Cys
2145 2150 2155 2160
Leu Asn Arg Val Cys Thr Asn Tyr Met Pro Tyr Phe Phe Thr Leu Leu
2165 2170 2175
Leu Gln Leu Cys Thr Phe Thr Arg Ser Thr Asn Ser Arg Ile Lys Ala
2180 2185 2190
Ser Met Pro Thr Thr Ile Ala Lys Asn Thr Val Lys Ser Val Gly Lys
2195 2200 2205
Phe Cys Leu Glu Ala Ser Phe Asn Tyr Leu Lys Ser Pro Asn Phe Ser
2210 2215 2220
Lys Leu Ile Asn Ile Ile Ile Trp Phe Leu Leu Leu Ser Val Cys Leu
2225 2230 2235 2240
Gly Ser Leu Ile Tyr Ser Thr Ala Ala Leu Gly Val Leu Met Ser Asn
2245 2250 2255
Leu Gly Met Pro Ser Tyr Cys Thr Gly Tyr Arg Glu Gly Tyr Leu Asn
2260 2265 2270
Ser Thr Asn Val Thr Ile Ala Thr Tyr Cys Thr Gly Ser Ile Pro Cys
2275 2280 2285
Ser Val Cys Leu Ser Gly Leu Asp Ser Leu Asp Thr Tyr Pro Ser Leu
2290 2295 2300
Glu Thr Ile Gln Ile Thr Ile Ser Ser Phe Lys Trp Asp Leu Thr Ala
2305 2310 2315 2320
Phe Gly Leu Val Ala Glu Trp Phe Leu Ala Tyr Ile Leu Phe Thr Arg
2325 2330 2335
Phe Phe Tyr Val Leu Gly Leu Ala Ala Ile Met Gln Leu Phe Phe Ser
2340 2345 2350
Tyr Phe Ala Val His Phe Ile Ser Asn Ser Trp Leu Met Trp Leu Ile
2355 2360 2365
Ile Asn Leu Val Gln Met Ala Pro Ile Ser Ala Met Val Arg Met Tyr
2370 2375 2380
Ile Phe Phe Ala Ser Phe Tyr Tyr Val Trp Lys Ser Tyr Val His Val
2385 2390 2395 2400
Val Asp Gly Cys Asn Ser Ser Thr Cys Met Met Cys Tyr Lys Arg Asn
2405 2410 2415
Arg Ala Thr Arg Val Glu Cys Thr Thr Ile Val Asn Gly Val Arg Arg
2420 2425 2430
Ser Phe Tyr Val Tyr Ala Asn Gly Gly Lys Gly Phe Cys Lys Leu His
2435 2440 2445
Asn Trp Asn Cys Val Asn Cys Asp Thr Phe Cys Ala Gly Ser Thr Phe
2450 2455 2460
Ile Ser Asp Glu Val Ala Arg Asp Leu Ser Leu Gln Phe Lys Arg Pro
2465 2470 2475 2480
Ile Asn Pro Thr Asp Gln Ser Ser Tyr Ile Val Asp Ser Val Thr Val
2485 2490 2495
Lys Asn Gly Ser Ile His Leu Tyr Phe Asp Lys Ala Gly Gln Lys Thr
2500 2505 2510
Tyr Glu Arg His Ser Leu Ser His Phe Val Asn Leu Asp Asn Leu Arg
2515 2520 2525
Ala Asn Asn Thr Lys Gly Ser Leu Pro Ile Asn Val Ile Val Phe Asp
2530 2535 2540
Gly Lys Ser Lys Cys Glu Glu Ser Ser Ala Lys Ser Ala Ser Val Tyr
2545 2550 2555 2560
Tyr Ser Gln Leu Met Cys Gln Pro Ile Leu Leu Leu Asp Gln Ala Leu
2565 2570 2575
Val Ser Asp Val Gly Asp Ser Ala Glu Val Ala Val Lys Met Phe Asp
2580 2585 2590
Ala Tyr Val Asn Thr Phe Ser Ser Thr Phe Asn Val Pro Met Glu Lys
2595 2600 2605
Leu Lys Thr Leu Val Ala Thr Ala Glu Ala Glu Leu Ala Lys Asn Val
2610 2615 2620
Ser Leu Asp Asn Val Leu Ser Thr Phe Ile Ser Ala Ala Arg Gln Gly
2625 2630 2635 2640
Phe Val Asp Ser Asp Val Glu Thr Lys Asp Val Val Glu Cys Leu Lys
2645 2650 2655
Leu Ser His Gln Ser Asp Ile Glu Val Thr Gly Asp Ser Cys Asn Asn
2660 2665 2670
Tyr Met Leu Thr Tyr Asn Lys Val Glu Asn Met Thr Pro Arg Asp Leu
2675 2680 2685
Gly Ala Cys Ile Asp Cys Ser Ala Arg His Ile Asn Ala Gln Val Ala
2690 2695 2700
Lys Ser His Asn Ile Ala Leu Ile Trp Asn Val Lys Asp Phe Met Ser
2705 2710 2715 2720
Leu Ser Glu Gln Leu Arg Lys Gln Ile Arg Ser Ala Ala Lys Lys Asn
2725 2730 2735
Asn Leu Pro Phe Lys Leu Thr Cys Ala Thr Thr Arg Gln Val Val Asn
2740 2745 2750
Val Val Thr Thr Lys Ile Ala Leu Lys Gly Gly Lys Ile Val Asn Asn
2755 2760 2765
Trp Leu Lys Gln Leu Ile Lys Val Thr Leu Val Phe Leu Phe Val Ala
2770 2775 2780
Ala Ile Phe Tyr Leu Ile Thr Pro Val His Val Met Ser Lys His Thr
2785 2790 2795 2800
Asp Phe Ser Ser Glu Ile Ile Gly Tyr Lys Ala Ile Asp Gly Gly Val
2805 2810 2815
Thr Arg Asp Ile Ala Ser Thr Asp Thr Cys Phe Ala Asn Lys His Ala
2820 2825 2830
Asp Phe Asp Thr Trp Phe Ser Gln Arg Gly Gly Ser Tyr Thr Asn Asp
2835 2840 2845
Lys Ala Cys Pro Leu Ile Ala Ala Val Ile Thr Arg Glu Val Gly Phe
2850 2855 2860
Val Val Pro Gly Leu Pro Gly Thr Ile Leu Arg Thr Thr Asn Gly Asp
2865 2870 2875 2880
Phe Leu His Phe Leu Pro Arg Val Phe Ser Ala Val Gly Asn Ile Cys
2885 2890 2895
Tyr Thr Pro Ser Lys Leu Ile Glu Tyr Thr Asp Phe Ala Thr Ser Ala
2900 2905 2910
Cys Val Leu Ala Ala Glu Cys Thr Ile Phe Lys Asp Ala Ser Gly Lys
2915 2920 2925
Pro Val Pro Tyr Cys Tyr Asp Thr Asn Val Leu Glu Gly Ser Val Ala
2930 2935 2940
Tyr Glu Ser Leu Arg Pro Asp Thr Arg Tyr Val Leu Met Asp Gly Ser
2945 2950 2955 2960
Ile Ile Gln Phe Pro Asn Thr Tyr Leu Glu Gly Ser Val Arg Val Val
2965 2970 2975
Thr Thr Phe Asp Ser Glu Tyr Cys Arg His Gly Thr Cys Glu Arg Ser
2980 2985 2990
Glu Ala Gly Val Cys Val Ser Thr Ser Gly Arg Trp Val Leu Asn Asn
2995 3000 3005
Asp Tyr Tyr Arg Ser Leu Pro Gly Val Phe Cys Gly Val Asp Ala Val
3010 3015 3020
Asn Leu Leu Thr Asn Met Phe Thr Pro Leu Ile Gln Pro Ile Gly Ala
3025 3030 3035 3040
Leu Asp Ile Ser Ala Ser Ile Val Ala Gly Gly Ile Val Ala Ile Val
3045 3050 3055
Val Thr Cys Leu Ala Tyr Tyr Phe Met Arg Phe Arg Arg Ala Phe Gly
3060 3065 3070
Glu Tyr Ser His Val Val Ala Phe Asn Thr Leu Leu Phe Leu Met Ser
3075 3080 3085
Phe Thr Val Leu Cys Leu Thr Pro Val Tyr Ser Phe Leu Pro Gly Val
3090 3095 3100
Tyr Ser Val Ile Tyr Leu Tyr Leu Thr Phe Tyr Leu Thr Asn Asp Val
3105 3110 3115 3120
Ser Phe Leu Ala His Ile Gln Trp Met Val Met Phe Thr Pro Leu Val
3125 3130 3135
Pro Phe Trp Ile Thr Ile Ala Tyr Ile Ile Cys Ile Ser Thr Lys His
3140 3145 3150
Phe Tyr Trp Phe Phe Ser Asn Tyr Leu Lys Arg Arg Val Val Phe Asn
3155 3160 3165
Gly Val Ser Phe Ser Thr Phe Glu Glu Ala Ala Leu Cys Thr Phe Leu
3170 3175 3180
Leu Asn Lys Glu Met Tyr Leu Lys Leu Arg Ser Asp Val Leu Leu Pro
3185 3190 3195 3200
Leu Thr Gln Tyr Asn Arg Tyr Leu Ala Leu Tyr Asn Lys Tyr Lys Tyr
3205 3210 3215
Phe Ser Gly Ala Met Asp Thr Thr Ser Tyr Arg Glu Ala Ala Cys Cys
3220 3225 3230
His Leu Ala Lys Ala Leu Asn Asp Phe Ser Asn Ser Gly Ser Asp Val
3235 3240 3245
Leu Tyr Gln Pro Pro Gln Thr Ser Ile Thr Ser Ala Val Leu Gln Ser
3250 3255 3260
Gly Phe Arg Lys Met Ala Phe Pro Ser Gly Lys Val Glu Gly Cys Met
3265 3270 3275 3280
Val Gln Val Thr Cys Gly Thr Thr Thr Leu Asn Gly Leu Trp Leu Asp
3285 3290 3295
Asp Val Val Tyr Cys Pro Arg His Val Ile Cys Thr Ser Glu Asp Met
3300 3305 3310
Leu Asn Pro Asn Tyr Glu Asp Leu Leu Ile Arg Lys Ser Asn His Asn
3315 3320 3325
Phe Leu Val Gln Ala Gly Asn Val Gln Leu Arg Val Ile Gly His Ser
3330 3335 3340
Met Gln Asn Cys Val Leu Lys Leu Lys Val Asp Thr Ala Asn Pro Lys
3345 3350 3355 3360
Thr Pro Lys Tyr Lys Phe Val Arg Ile Gln Pro Gly Gln Thr Phe Ser
3365 3370 3375
Val Leu Ala Cys Tyr Asn Gly Ser Pro Ser Gly Val Tyr Gln Cys Ala
3380 3385 3390
Met Arg Pro Asn Phe Thr Ile Lys Gly Ser Phe Leu Asn Gly Ser Cys
3395 3400 3405
Gly Ser Val Gly Phe Asn Ile Asp Tyr Asp Cys Val Ser Phe Cys Tyr
3410 3415 3420
Met His His Met Glu Leu Pro Thr Gly Val His Ala Gly Thr Asp Leu
3425 3430 3435 3440
Glu Gly Asn Phe Tyr Gly Pro Phe Val Asp Arg Gln Thr Ala Gln Ala
3445 3450 3455
Ala Gly Thr Asp Thr Thr Ile Thr Val Asn Val Leu Ala Trp Leu Tyr
3460 3465 3470
Ala Ala Val Ile Asn Gly Asp Arg Trp Phe Leu Asn Arg Phe Thr Thr
3475 3480 3485
Thr Leu Asn Asp Phe Asn Leu Val Ala Met Lys Tyr Asn Tyr Glu Pro
3490 3495 3500
Leu Thr Gln Asp His Val Asp Ile Leu Gly Pro Leu Ser Ala Gln Thr
3505 3510 3515 3520
Gly Ile Ala Val Leu Asp Met Cys Ala Ser Leu Lys Glu Leu Leu Gln
3525 3530 3535
Asn Gly Met Asn Gly Arg Thr Ile Leu Gly Ser Ala Leu Leu Glu Asp
3540 3545 3550
Glu Phe Thr Pro Phe Asp Val Val Arg Gln Cys Ser Gly Val Thr Phe
3555 3560 3565
Gln Ser Ala Val Lys Arg Thr Ile Lys Gly Thr His His Trp Leu Leu
3570 3575 3580
Leu Thr Ile Leu Thr Ser Leu Leu Val Leu Val Gln Ser Thr Gln Trp
3585 3590 3595 3600
Ser Leu Phe Phe Phe Leu Tyr Glu Asn Ala Phe Leu Pro Phe Ala Met
3605 3610 3615
Gly Ile Ile Ala Met Ser Ala Phe Ala Met Met Phe Val Lys His Lys
3620 3625 3630
His Ala Phe Leu Cys Leu Phe Leu Leu Pro Ser Leu Ala Thr Val Ala
3635 3640 3645
Tyr Phe Asn Met Val Tyr Met Pro Ala Ser Trp Val Met Arg Ile Met
3650 3655 3660
Thr Trp Leu Asp Met Val Asp Thr Ser Leu Ser Gly Phe Lys Leu Lys
3665 3670 3675 3680
Asp Cys Val Met Tyr Ala Ser Ala Val Val Leu Leu Ile Leu Met Thr
3685 3690 3695
Ala Arg Thr Val Tyr Asp Asp Gly Ala Arg Arg Val Trp Thr Leu Met
3700 3705 3710
Asn Val Leu Thr Leu Val Tyr Lys Val Tyr Tyr Gly Asn Ala Leu Asp
3715 3720 3725
Gln Ala Ile Ser Met Trp Ala Leu Ile Ile Ser Val Thr Ser Asn Tyr
3730 3735 3740
Ser Gly Val Val Thr Thr Val Met Phe Leu Ala Arg Gly Ile Val Phe
3745 3750 3755 3760
Met Cys Val Glu Tyr Cys Pro Ile Phe Phe Ile Thr Gly Asn Thr Leu
3765 3770 3775
Gln Cys Ile Met Leu Val Tyr Cys Phe Leu Gly Tyr Phe Cys Thr Cys
3780 3785 3790
Tyr Phe Gly Leu Phe Cys Leu Leu Asn Arg Tyr Phe Arg Leu Thr Leu
3795 3800 3805
Gly Val Tyr Asp Tyr Leu Val Ser Thr Gln Glu Phe Arg Tyr Met Asn
3810 3815 3820
Ser Gln Gly Leu Leu Pro Pro Lys Asn Ser Ile Asp Ala Phe Lys Leu
3825 3830 3835 3840
Asn Ile Lys Leu Leu Gly Val Gly Gly Lys Pro Cys Ile Lys Val Ala
3845 3850 3855
Thr Val Gln Ser Lys Met Ser Asp Val Lys Cys Thr Ser Val Val Leu
3860 3865 3870
Leu Ser Val Leu Gln Gln Leu Arg Val Glu Ser Ser Ser Lys Leu Trp
3875 3880 3885
Ala Gln Cys Val Gln Leu His Asn Asp Ile Leu Leu Ala Lys Asp Thr
3890 3895 3900
Thr Glu Ala Phe Glu Lys Met Val Ser Leu Leu Ser Val Leu Leu Ser
3905 3910 3915 3920
Met Gln Gly Ala Val Asp Ile Asn Lys Leu Cys Glu Glu Met Leu Asp
3925 3930 3935
Asn Arg Ala Thr Leu Gln Ala Ile Ala Ser Glu Phe Ser Ser Leu Pro
3940 3945 3950
Ser Tyr Ala Ala Phe Ala Thr Ala Gln Glu Ala Tyr Glu Gln Ala Val
3955 3960 3965
Ala Asn Gly Asp Ser Glu Val Val Leu Lys Lys Leu Lys Lys Ser Leu
3970 3975 3980
Asn Val Ala Lys Ser Glu Phe Asp Arg Asp Ala Ala Met Gln Arg Lys
3985 3990 3995 4000
Leu Glu Lys Met Ala Asp Gln Ala Met Thr Gln Met Tyr Lys Gln Ala
4005 4010 4015
Arg Ser Glu Asp Lys Arg Ala Lys Val Thr Ser Ala Met Gln Thr Met
4020 4025 4030
Leu Phe Thr Met Leu Arg Lys Leu Asp Asn Asp Ala Leu Asn Asn Ile
4035 4040 4045
Ile Asn Asn Ala Arg Asp Gly Cys Val Pro Leu Asn Ile Ile Pro Leu
4050 4055 4060
Thr Thr Ala Ala Lys Leu Met Val Val Ile Pro Asp Tyr Asn Thr Tyr
4065 4070 4075 4080
Lys Asn Thr Cys Asp Gly Thr Thr Phe Thr Tyr Ala Ser Ala Leu Trp
4085 4090 4095
Glu Ile Gln Gln Val Val Asp Ala Asp Ser Lys Ile Val Gln Leu Ser
4100 4105 4110
Glu Ile Ser Met Asp Asn Ser Pro Asn Leu Ala Trp Pro Leu Ile Val
4115 4120 4125
Thr Ala Leu Arg Ala Asn Ser Ala Val Lys Leu Gln Asn Asn Glu Leu
4130 4135 4140
Ser Pro Val Ala Leu Arg Gln Met Ser Cys Ala Ala Gly Thr Thr Gln
4145 4150 4155 4160
Thr Ala Cys Thr Asp Asp Asn Ala Leu Ala Tyr Tyr Asn Thr Thr Lys
4165 4170 4175
Gly Gly Arg Phe Val Leu Ala Leu Leu Ser Asp Leu Gln Asp Leu Lys
4180 4185 4190
Trp Ala Arg Phe Pro Lys Ser Asp Gly Thr Gly Thr Ile Tyr Thr Glu
4195 4200 4205
Leu Glu Pro Pro Cys Arg Phe Val Thr Asp Thr Pro Lys Gly Pro Lys
4210 4215 4220
Val Lys Tyr Leu Tyr Phe Ile Lys Gly Leu Asn Asn Leu Asn Arg Gly
4225 4230 4235 4240
Met Val Leu Gly Ser Leu Ala Ala Thr Val Arg Leu Gln Ala Gly Asn
4245 4250 4255
Ala Thr Glu Val Pro Ala Asn Ser Thr Val Leu Ser Phe Cys Ala Phe
4260 4265 4270
Ala Val Asp Ala Ala Lys Ala Tyr Lys Asp Tyr Leu Ala Ser Gly Gly
4275 4280 4285
Gln Pro Ile Thr Asn Cys Val Lys Met Leu Cys Thr His Thr Gly Thr
4290 4295 4300
Gly Gln Ala Ile Thr Val Thr Pro Glu Ala Asn Met Asp Gln Glu Ser
4305 4310 4315 4320
Phe Gly Gly Ala Ser Cys Cys Leu Tyr Cys Arg Cys His Ile Asp His
4325 4330 4335
Pro Asn Pro Lys Gly Phe Cys Asp Leu Lys Gly Lys Tyr Val Gln Ile
4340 4345 4350
Pro Thr Thr Cys Ala Asn Asp Pro Val Gly Phe Thr Leu Lys Asn Thr
4355 4360 4365
Val Cys Thr Val Cys Gly Met Trp Lys Gly Tyr Gly Cys Ser Cys Asp
4370 4375 4380
Gln Leu Arg Glu Pro Met Leu Gln Ser Ala Asp Ala Gln Ser Phe Leu
4385 4390 4395 4400
Asn Gly Phe Ala Val
4405
<210> 17
<211> 1273
<212> PRT
<213> SARS-CoV2
<220>
<221> MISC_FEATURE
<222> (320)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> MISC_FEATURE
<222> (709)
<223> Xaa can be any naturally occurring amino acid
<400> 17
Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
1 5 10 15
Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
20 25 30
Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
35 40 45
His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
50 55 60
Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
65 70 75 80
Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
85 90 95
Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
100 105 110
Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
115 120 125
Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
130 135 140
Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
145 150 155 160
Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
165 170 175
Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
180 185 190
Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
195 200 205
Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
210 215 220
Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
225 230 235 240
Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
245 250 255
Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
260 265 270
Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
275 280 285
Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
290 295 300
Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Xaa
305 310 315 320
Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys
325 330 335
Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
340 345 350
Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
355 360 365
Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
370 375 380
Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
385 390 395 400
Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
405 410 415
Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
420 425 430
Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
435 440 445
Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
450 455 460
Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
465 470 475 480
Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
485 490 495
Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
500 505 510
Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
515 520 525
Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn
530 535 540
Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu
545 550 555 560
Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val
565 570 575
Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe
580 585 590
Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val
595 600 605
Ala Val Leu Tyr Gln Gly Val Asn Cys Thr Glu Val Pro Val Ala Ile
610 615 620
His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser
625 630 635 640
Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val
645 650 655
Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala
660 665 670
Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala
675 680 685
Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser
690 695 700
Val Ala Tyr Ser Xaa Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile
705 710 715 720
Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val
725 730 735
Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu
740 745 750
Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr
755 760 765
Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln
770 775 780
Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe
785 790 795 800
Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser
805 810 815
Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly
820 825 830
Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp
835 840 845
Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu
850 855 860
Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly
865 870 875 880
Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile
885 890 895
Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr
900 905 910
Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn
915 920 925
Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala
930 935 940
Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn
945 950 955 960
Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val
965 970 975
Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln
980 985 990
Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val
995 1000 1005
Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Val Ser Ala Asn Leu
1010 1015 1020
Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys Arg Val
1025 1030 1035 1040
Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ser Ala
1045 1050 1055
Pro His Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ala Gln Glu
1060 1065 1070
Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His Asp Gly Lys Ala His
1075 1080 1085
Phe Pro Arg Glu Gly Val Phe Val Ser Asn Gly Thr His Trp Phe Val
1090 1095 1100
Thr Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile Thr Thr Asp Asn Thr
1105 1110 1115 1120
Phe Val Ser Gly Asn Cys Asp Val Val Ile Gly Ile Val Asn Asn Thr
1125 1130 1135
Val Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu
1140 1145 1150
Asp Lys Tyr Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp
1155 1160 1165
Ile Ser Gly Ile Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp
1170 1175 1180
Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu
1185 1190 1195 1200
Gln Glu Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Trp Tyr Ile
1205 1210 1215
Trp Leu Gly Phe Ile Ala Gly Leu Ile Ala Ile Val Met Val Thr Ile
1220 1225 1230
Met Leu Cys Cys Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Cys Cys
1235 1240 1245
Ser Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro Val
1250 1255 1260
Leu Lys Gly Val Lys Leu His Tyr Thr
1265 1270
<210> 18
<211> 29897
<212> DNA
<213> SARS-CoV2
<220>
<221> misc_feature
<222> (11288)..(11293)
<223> n is a, c, g, or t
<220>
<221> misc_feature
<222> (22278)..(22283)
<223> n is a, c, g, or t
<220>
<221> misc_feature
<222> (29836)..(29864)
<223> n is a, c, g, or t
<400> 18
attaaaggtt tataccttcc caggtaacaa accaaccaac tttcgatctc ttgtagatct 60
gttctctaaa cgaactttaa aatctgtgtg gctgtcactc ggctgcatgc ttagtgcact 120
cacgcagtat aattaataac taattactgt cgttgacagg acacgagtaa ctcttctatc 180
ttctgcaggc tgcttacggt ttcgtccgtg ttgcagccga tcatcagcac atctaggttt 240
tgtccgggtg tgaccgaaag gtaagatgga gagccttgtc cctggtttca acgagaaaac 300
acacgtccaa ctcagtttgc ctgttttaca ggttcgcgac gtgctcgtac gtggctttgg 360
agactccgtg gaggaggtct tatcagaggc acgtcaacat cttaaagatg gcacttgtgg 420
cttagtagaa gttgaaaaag gcgttttgcc tcaacttgaa cagccctatg tgttcatcaa 480
acgttcggat gctcgaactg cacctcatgg tcatgttatg gttgagctgg tagcagaact 540
cgaaggcatt cagtacggtc gtagtggtga gacacttggt gtccttgtcc ctcatgtggg 600
cgaaatacca gtggcttacc gcaaggttct tcttcgtaag aacggtaata aaggagctgg 660
tggccatagt tacggcgccg atctaaagtc atttgactta ggcgacgagc ttggcactga 720
tccttatgaa gattttcaag aaaactggaa cactaaacat agcagtggtg ttacccgtga 780
actcatgcgt gagcttaacg gaggggcata cactcgctat gtcgataaca acttctgtgg 840
ccctgatggc taccctcttg agtgcattaa agaccttcta gcacgtgctg gtaaagcttc 900
atgcactttg tccgaacaac tggactttat tgacactaag aggggtgtat actgctgccg 960
tgaacatgag catgaaattg cttggtacac ggaacgttct gaaaagagct atgaattgca 1020
gacacctttt gaaattaaat tggcaaagaa atttgacatc ttcaatgggg aatgtccaaa 1080
ttttgtattt cccttaaatt ccataatcaa gactattcaa ccaagggttg aaaagaaaaa 1140
gcttgatggc tttatgggta gaattcgatc tgtctatcca gttgcgtcac caaatgaatg 1200
caaccaaatg tgcctttcaa ctctcatgaa gtgtgatcat tgtggtgaaa cttcatggca 1260
gacgggcgat tttgttaaag ccacttgcga attttgtggc actgagaatt tgactaaaga 1320
aggtgccact acttgtggtt acttacccca aaatgctgtt gttaaaattt attgtccagc 1380
atgtcacaat tcagaagtag gacctgagca tagtcttgcc gaataccata atgaatctgg 1440
cttgaaaacc attcttcgta agggtggtcg cactattgcc tttggaggct gtgtgttctc 1500
ttatgttggt tgccataaca agtgtgccta ttgggttcca cgtgctagcg ctaacatagg 1560
ttgtaaccat acaggtgttg ttggagaagg ttccgaaggt cttaatgaca accttcttga 1620
aatactccaa aaagagaaag tcaacatcaa tattgttggt gactttaaac ttaatgaaga 1680
gatcgccatt attttggcat ctttttctgc ttccacaagt gcttttgtgg aaactgtgaa 1740
aggtttggat tataaagcat tcaaacaaat tgttgaatcc tgtggtaatt ttaaagttac 1800
aaaaggaaaa gctaaaaaag gtgcctggaa tattggtgaa cagaaatcaa tactgagtcc 1860
tctttatgca tttgcatcag aggctgctcg tgttgtacga tcaattttct cccgcactct 1920
tgaaactgct caaaattctg tgcgtgtttt acagaaggcc gctataacaa tactagatgg 1980
aatttcacag tattcactga gactcattga tgctatgatg ttcacatctg atttggctac 2040
taacaatcta gttgtaatgg cctacattac aggtggtgtt gttcagttga cttcgcagtg 2100
gctaactaac atctttggca ctgtttatga aaaactcaaa cccgtccttg attggcttga 2160
agagaagttt aaggaaggtg tagagtttct tagagacggt tgggaaattg ttaaatttat 2220
ctcaacctgt gcttgtgaaa ttgtcggtgg acaaattgtc acctgtgcaa aggaaattaa 2280
ggagagtgtt cagacattct ttaagcttgt aaataaattt ttggctttgt gtgctgactc 2340
tatcattatt ggtggagcta aacttaaagc cttgaattta ggtgaaacat ttgtcacgca 2400
ctcaaaggga ttgtacagaa agtgtgttaa atccagagaa gaaactggcc tactcatgcc 2460
tctaaaagcc ccaaaagaaa ttatcttctt agagggagaa acacttccca cagaagtgtt 2520
aacagaggaa gttgtcttga aaactggtga tttacaacca ttagaacaac ctactagtga 2580
agctgttgaa gctccattgg ttggtacacc agtttgtatt aacgggctta tgttgctcga 2640
aatcaaagac acagaaaagt actgtgccct tgcacctaat atgatggtaa ctaacaatac 2700
cttcacactc aaaggcggtg caccaacaaa ggttactttt ggtgatgaca ctgtgataga 2760
agtgcaaggt tacaagagtg tgaatatcac ttttgaactt gatgaaagga ttgataaagt 2820
acttaatgag aagtgctctg cctatacagt tgaactcggt acagaagtaa atgagttcgc 2880
ctgtgttgtg gcagatgctg tcataaaaac tttgcaacca gtatctgaat tacttacacc 2940
actgggcatt gatttagatg agtggagtat ggctacatac tacttatttg atgagtctgg 3000
tgagtttaaa ttggcttcac atatgtattg ttctttttac cctccagatg aggatgaaga 3060
agaaggtgat tgtgaagaag aagagtttga gccatcaact caatatgagt atggtactga 3120
agatgattac caaggtaaac ctttggaatt tggtgccact tctgctgctc ttcaacctga 3180
agaagagcaa gaagaagatt ggttagatga tgatagtcaa caaactgttg gtcaacaaga 3240
cggcagtgag gacaatcaga caactactat tcaaacaatt gttgaggttc aacctcaatt 3300
agagatggaa cttacaccag ttgttcagac tattgaagtg aatagtttta gtggttattt 3360
aaaacttact gacaatgtat acattaaaaa tgcagacatt gtggaagaag ctaaaaaggt 3420
aaaaccaaca gtggttgtta atgcagccaa tgtttacctt aaacatggag gaggtgttgc 3480
aggagcctta aataaggcta ctaacaatgc catgcaagtt gaatctgatg attacatagc 3540
tactaatgga ccacttaaag tgggtggtag ttgtgtttta agcggacaca atcttgctaa 3600
acactgtctt catgttgtcg gcccaaatgt taacaaaggt gaagacattc aacttcttaa 3660
gagtgcttat gaaaatttta atcagcacga agttctactt gcaccattat tatcagctgg 3720
tatttttggt gctgacccta tacattcttt aagagtttgt gtagatactg ttcgcacaaa 3780
tgtctactta gctgtctttg ataaaaatct ctatgacaaa cttgtttcaa gctttttgga 3840
aatgaagagt gaaaagcaag ttgaacaaaa gatcgctgag attcctaaag aggaagttaa 3900
gccatttata actgaaagta aaccttcagt tgaacagaga aaacaagatg ataagaaaat 3960
caaagcttgt gttgaagaag ttacaacaac tctggaagaa actaagttcc tcacagaaaa 4020
cttgttactt tatattgaca ttaatggcaa tcttcatcca gattctgcca ctcttgttag 4080
tgacattgac atcactttct taaagaaaga tgctccatat atagtgggtg atgttgttca 4140
agagggtgtt ttaactgctg tggttatacc tactaaaaag gctggtggca ctactgaaat 4200
gttagcgaaa gctttgagaa aagtgccaac agacaattat ataaccactt acccgggtca 4260
gggtttaaat ggttacactg tagaggaggc aaagacagtg cttaaaaagt gtaaaagtgc 4320
cttttacatt ctaccatcta ttatctctaa tgagaagcaa gaaattcttg gaactgtttc 4380
ttggaatttg cgagaaatgc ttgcacatgc agaagaaaca cgcaaattaa tgcctgtctg 4440
tgtggaaact aaagccatag tttcaactat acagcgtaaa tataagggta ttaaaataca 4500
agagggtgtg gttgattatg gtgctagatt ttacttttac accagtaaaa caactgtagc 4560
gtcacttatc aacacactta acgatctaaa tgaaactctt gttacaatgc cacttggcta 4620
tgtaacacat ggcttaaatt tggaagaagc tgctcggtat atgagatctc tcaaagtgcc 4680
agctacagtt tctgtttctt cacctgatgc tgttacagcg tataatggtt atcttacttc 4740
ttcttctaaa acacctgaag aacattttat tgaaaccatc tcacttgctg gttcctataa 4800
agattggtcc tattctggac aatctacaca actaggtata gaatttctta agagaggtga 4860
taaaagtgta tattacacta gtaatcctac cacattccac ctagatggtg aagttatcac 4920
ctttgacaat cttaagacac ttctttcttt gagagaagtg aggactatta aggtgtttac 4980
aacagtagac aacattaacc tccacacgca agttgtggac atgtcaatga catatggaca 5040
acagtttggt ccaacttatt tggatggagc tgatgttact aaaataaaac ctcataattc 5100
acatgaaggt aaaacatttt atgttttacc taatgatgac actctacgtg ttgaggcttt 5160
tgagtactac cacacaactg atcctagttt tctgggtagg tacatgtcag cattaaatca 5220
cactaaaaat tggaaatacc cacaagttaa tggtttaact tctattaaat gggcagataa 5280
caactgttat cttgccactg cattgttaac actccaacaa atagagttga agtttaatcc 5340
acctgctcta caagatgctt attacagagc aagggctggt gaagctgcta acttttgtgc 5400
acttatctta gcctactgta ataagacagt aggtgagtta ggtgatgtta gagaaacaat 5460
gagttacttg tttcaacatg ccaatttaga ttcttgcaaa agagtcttga acgtggtgtg 5520
taaaacttgt ggacaacagc agacaaccct taagggtgta gaagctgtta tgtacatggg 5580
cacactttct tatgaacaat ttaagaaagg tgttcagata ccttgtacgt gtggtaaaca 5640
agctacaaaa tatctagtac aacaggagtc accttttgtt atgatgtcag caccacctgc 5700
tcagtatgaa cttaagcatg gtacatttac ttgtgctagt gagtacactg gtaattacca 5760
gtgtggtcac tataaacata taacttctaa agaaactttg tattgcatag acggtgcttt 5820
acttacaaag tcctcagaat acaaaggtcc tattacggat gttttctaca aagaaaacag 5880
ttacacaaca accataaaac cagttactta taaattggat ggtgttgttt gtacagaaat 5940
tgaccctaag ttggacaatt attataagaa agacaattct tatttcacag agcaaccaat 6000
tgatcttgta ccaaaccaac catatccaaa cgcaagcttc gataatttta agtttgtatg 6060
tgataatatc aaatttgctg atgatttaaa ccagttaact ggttacaaga aacctgcttc 6120
aagagagctt aaagttacat ttttccctga cttaaatggt gatgtggtgg ctattgatta 6180
taaacactac acaccctctt ttaagaaagg agctaaattg ttacataaac ctattgtttg 6240
gcatgttaac aatgcaacta ataaagccac gtataaacca aatacctggt gtatacgttg 6300
tctttggagc acaaaaccag ttgaaacatc aaattcgttt gatgtactga agtcagagga 6360
cgcgcaggga atggataatc ttgcctgcga agatctaaaa ccagtctctg aagaagtagt 6420
ggaaaatcct accatacaga aagacgttct tgagtgtaat gtgaaaacta ccgaagttgt 6480
aggagacatt atacttaaac cagcaaataa tagtttaaaa attacagaag aggttggcca 6540
cacagatcta atggctgctt atgtagacaa ttctagtctt actattaaga aacctaatga 6600
attatctaga gtattaggtt tgaaaaccct tgctactcat ggtttagctg ctgttaatag 6660
tgtcccttgg gatactatag ctaattatgc taagcctttt cttaacaaag ttgttagtac 6720
aactactaac atagttacac ggtgtttaaa ccgtgtttgt actaattata tgccttattt 6780
ctttacttta ttgctacaat tgtgtacttt tactagaagt acaaattcta gaattaaagc 6840
atctatgccg actactatag caaagaatac tgttaagagt gtcggtaaat tttgtctaga 6900
ggcttcattt aattatttga agtcacctaa tttttctaaa ctgataaata ttataatttg 6960
gtttttacta ttaagtgttt gcctaggttc tttaatmtac tcaaccgctg ctttaggtgt 7020
tttaatgtct aatttaggca tgccttctta ctgtactggt tacagagaag gctatttgaa 7080
ctctactaat gtcactattg caacctactg tactggttct ataccttgta gtgtttgtct 7140
tagtggttta gattctttag acacctatcc ttctttagaa actatacaaa ttaccatttc 7200
atcttttaaa tgggatttaa ctgcttttgg cttagttgca gagtggtttt tggcatatat 7260
tcttttcact aggtttttct atgtacttgg attggctgca atcatgcaat tgtttttcag 7320
ctattttgca gtacatttta ttagtaattc ttggcttatg tggttaataa ttaatcttgt 7380
acaaatggcc ccgatttcag ctatggttag aatgtacatc ttctttgcat cattttatta 7440
tgtatggaaa agttatgtgc atgttgtaga cggttgtaat tcatcaactt gtatgatgtg 7500
ttacaaacgt aatagagcaa caagagtcga atgtacaact attgttaatg gtgttagaag 7560
gtccttttat gtctatgcta atggaggtaa aggcttttgc aaactacaca attggaattg 7620
tgttaattgt gatacattct gtgctggtag tacatttatt agtgatgaag ttgcgagaga 7680
cttgtcacta cagtttaaaa gaccaataaa tcctactgac cagtcttctt acatcgttga 7740
tagtgttaca gtgaagaatg gttccatcca tctttacttt gataaagctg gtcaaaagac 7800
ttatgaaaga cattctctct ctcattttgt taacttagac aacctgagag ctaataacac 7860
taaaggttca ttgcctatta atgttatagt ttttgatggt aaatcaaaat gtgaagaatc 7920
atctgcaaaa tcagcgtctg tttactacag tcagcttatg tgtcaaccta tactgttact 7980
agatcaggca ttagtgtctg atgttggtga tagtgcggaa gttgcagtta aaatgtttga 8040
tgcttacgtt aatacgtttt catcaacttt taacgtacca atggaaaaac tcaaaacact 8100
agttgcaact gcagaagctg aacttgcaaa gaatgtgtcc ttagacaatg tcttatctac 8160
ttttatttca gcagctcggc aagggtttgt tgattcagat gtagaaacta aagatgttgt 8220
tgaatgtctt aaattgtcac atcaatctga catagaagtt actggcgata gttgtaataa 8280
ctatatgctc acctataaca aagttgaaaa catgacaccc cgtgaccttg gtgcttgtat 8340
tgactgtagt gcgcgtcata ttaatgcgca ggtagcaaaa agtcacaaca ttgctttgat 8400
atggaacgtt aaagatttca tgtcattgtc tgaacaacta cgaaaacaaa tacgtagtgc 8460
tgctaaaaag aataacttac cttttaagtt gacatgtgca actactagac aagttgttaa 8520
tgttgtaaca acaaagatag cacttaaggg tggtaaaatt gttaataatt ggttgaagca 8580
gttaattaaa gttacacttg tgttcctttt tgttgctgct attttctatt taataacacc 8640
tgttcatgtc atgtctaaac atactgactt ttcaagtgaa atcataggat acaaggctat 8700
tgatggtggt gtcactcgtg acatagcatc tacagatact tgttttgcta acaaacatgc 8760
tgattttgac acatggttta gccagcgtgg tggtagttat actaatgaca aagcttgccc 8820
attgattgct gcagtcataa caagagaagt gggttttgtc gtgcctggtt tgcctggcac 8880
gatattacgc acaactaatg gtgacttttt gcatttctta cctagagttt ttagtgcagt 8940
tggtaacatc tgttacacac catcaaaact tatagagtac actgactttg caacatcagc 9000
ttgtgttttg gctgctgaat gtacaatttt taaagatgct tctggtaagc cagtaccata 9060
ttgttatgat accaatgtac tagaaggttc tgttgcttat gaaagtttac gccctgacac 9120
acgttatgtg ctcatggatg gctctattat tcaatttcct aacacctacc ttgaaggttc 9180
tgttagagtg gtaacaactt ttgattctga gtactgtagg cacggcactt gtgaaagatc 9240
agaagctggt gtttgtgtat ctactagtgg tagatgggta cttaacaatg attattacag 9300
atctttacca ggagttttct gtggtgtaga tgctgtaaat ttatttacta atatgtttac 9360
accactaatt caacctattg gtgctttgga catatcagca tctatagtag ctggtggtat 9420
tgtagctatc gtagtaacat gccttgccta ctattttatg aggtttagaa gagcttttgg 9480
tgaatacagt catgtagttg cctttaatac tttactattc cttatgtcat tcactgtact 9540
ctgtttaaca ccagtttact cattcttacc tggtgtttat tctgttattt acttgtactt 9600
gacattttat cttactaatg atgtttcttt tttagcacat attcagtgga tggttatgtt 9660
cacaccttta gtacctttct ggataacaat tgcttatatc atttgtattt ccacaaagca 9720
tttctattgg ttctttagta attacctaaa gagacgtgta gtctttaatg gtgtttcctt 9780
tagtactttt gaagaagctg cgctgtgcac ctttttgtta aataaagaaa tgtatctaaa 9840
gttgcgtagt gatgtgctat tacctcttac gcaatataat agatacttag ctctttataa 9900
taagtacaag tattttagtg gagcaatgga tacaactagc tacagagaag ctgcttgttg 9960
tcatctcgca aaggctctca atgacttcag taactcaggt tctgatgttc tttaccaacc 10020
accacaaacc tctatcacct cagctgtttt gcagagtggt tttagaaaaa tggcattccc 10080
atctggtaaa gttgagggtt gtatggtaca agtaacttgt ggtacaacta cacttaacgg 10140
tctttggctt gatgacgtag tttactgtcc aagacatgtg atctgcacct ctgaagacat 10200
gcttaaccct aattatgaag atttactcat tcgtaagtct aatcataatt tcttggtaca 10260
ggctggtaat gttcaactca gggttattgg acattctatg caaaattgtg tacttaagct 10320
tagggttgat acagccaatc ctaagacacc taagtataag tttgttcgca ttcaaccagg 10380
acagactttt tcagtgttag cttgttacaa tggttcacca tctggtgttt accaatgtgc 10440
tatgaggccc aatttcacta ttaagggttc attccttaat ggttcatgtg gtagtgttgg 10500
ttttaacata gattatgact gtgtctcttt ttkttacatg caccatatgg aattaccaac 10560
tggagttcat gctggcacag acttagaagg taacttttat ggaccttttg ttgacaggca 10620
aacagcacaa gcagctggta cggacacaac tattacagtt aatgttttag cttggttgta 10680
cgctgctgtt ataaatggag acaggtggtt tctcaatcga tttaccacaa ctcttaatga 10740
ctttaacctt gtggctatga agtacaatta tgaacytcta acacaagacc atgttgacat 10800
actaggacct ctttctgctc aaactggaat tgccgtttta gatatgtgtg cttcattaaa 10860
agaattactg caaaatggta tgaatggacg taccatattg ggtagtgctt tattagaaga 10920
tgaatttaca ccttttgatg ttgttagaca atgctcaggt gttactttcc aaagtgcagt 10980
gaaaagaaca atcaagggta cacaccactg gttgttactc acaattttga cttcactttt 11040
agttttagtc cagagtactc aatggtcttt gttctttttt ttgtatgaaa atgccttttt 11100
accttttgct atgggtatta ttgctatgtc tgcttttgca atgatgtttg tcaaacataa 11160
gcatgcattt ctctgtttgt ttttgttacc ttctcttgcc actgtagctt attttaatat 11220
ggtctatatg cctgctagtt gggtgatgcg tattatgaca tggttggata tggttgatac 11280
tagtttgnnn nnnaagctaa aagactgtgt tatgtatgca tcagctgtag tgttactaat 11340
ccttatgaca gcaagaactg tgtatgatga tggtgctagg agagtgtgga cacttatgaa 11400
tgtcttgaca ctcgtttata aagtttatta tggtaatgct ttagatcaag ccatttccat 11460
gtgggctctt ataatctctg ttacttctaa ctactcaggt gtagttacaa ctgtcatgtt 11520
tttggccaga ggtattgttt ttatgtgtgt tgagtattgc cctattttct tcataactgg 11580
taatacactt cagtgtataa tgctagttta ttgtttctta ggctattttt gtacttgtta 11640
ctttggcctc ttttgtttac tcaaccgcta ctttagactg actcttggtg tttatgatta 11700
cttagtttct acacaggagt ttagatatat gaattcacag ggactaytcc cacccaagaa 11760
tagcatagat gccttcaaac tcaacattaa attgttgggt gttggtggca aaccttgtat 11820
caaagtagcc actgtacagt ctaaaatgtc agatgtaaag tgcacatcag tagtcttact 11880
ctcagttttg caacaactca gagtagaatc atcatctaaa ttgtgggctc aatgtgtcca 11940
gttacacaat gacattctct tagctaaaga tactactgaa gcctttgaaa aaatggtttc 12000
actactttct gttttgcttt ccatgcaggg tgctgtagac ataaacaagc tttgtgaaga 12060
aatgctggac aacagggcaa ccttacaagc tatagcctca gagtttagtt cccttccatc 12120
atatgcagct tttgctactg ctcaagaagc ttatgagcag gctgttgcta atggtgattc 12180
tgaagttgtt cttaaaaagt tgaagaagtc tttgaatgtg gctaaatctg aatttgaccg 12240
tgatgcagcc atgcaacgta agttggaaaa gatggctgat caagctatga cccaaatgta 12300
taaacaggct agatctgagg acaagagggc aaaagttact agtgctatgc agacaatgct 12360
tttcactatg cttagaaagt tggataatga tgcactcaac aacattatca acaatgcaag 12420
agatggttgt gttcccttga acataatacc tcttacaaca gcagccaaac taatggttgt 12480
cataccagac tataacacat ataaaaatac gtgtgatggt acaacattta cttatgcatc 12540
agcattgtgg gaaatccaac aggttgtaga tgcagatagt aaaattgttc aacttagtga 12600
aattagtatg gacaattcac ctaatttagc atggcctctt attgtaacag ctttaagggc 12660
caattctgct gtcaaattac agaataatga gcttagtcct gttgcactac gacagatgtc 12720
ttgtgctgcc ggtactacac aaactgcttg cactgatgac aatgcgttag cttactacaa 12780
cacaacaaag ggaggtaggt ttgtacttgc actgttatcc gatttacagg atttgaaatg 12840
ggctagattc cctaagagtg atggaactgg tactatctat acagaactgg aaccaccttg 12900
taggtttgtt acagacacac ctaaaggtcc taaagtgaag tatttatact ttattaaagg 12960
attaaacaac ctaaatagag gtatggtact tggtagttta gctgccacag tacgtctaca 13020
agctggtaat gcaacagaag tgcctgccaa ttcaactgta ttatctttct gtgcttttgc 13080
tgtagatgct gctaaagctt acaaagatta tctagctagt gggggacaac caatcactaa 13140
ttgtgttaag atgttgtgta cacacactgg tactggtcag gcaataacag ttacaccgga 13200
agccaatatg gatcaagaat cctttggtgg tgcatcgtgt tgtctgtact gccgttgcca 13260
catagatcat ccaaatccta aaggattttg tgacttaaaa ggtaagtatg tacaaatacc 13320
tacaacttgt gctaatgacc ctgtgggttt tacacttaaa aacacagtct gtaccgtctg 13380
cggtatgtgg aaaggttatg gctgtagttg tgatcaactc cgcgaaccca tgcttcagtc 13440
agctgatgca caatcgtttt taaacgggtt tgcggtgtaa gtgcagcccg tcttacaccg 13500
tgcggcacag gcactagtac tgatgtcgta tacagggctt ttgacatcta caatgataaa 13560
gtagctggtt ttgctaaatt cctaaaaact aattgttgtc gcttccaaga aaaggacgaa 13620
gatgacaatt taattgattc ttactttgta gttaagagac acactttctc taactaccaa 13680
catgaagaaa caatttataa tttacttaag gattgtccag ctgttgctaa acatgacttc 13740
tttaagttta gaatagacgg tgacatggta ccacatatat cacgtcaacg tcttactaaa 13800
tacacaatgg cagacctcgt ctatgcttta aggcattttg atgaaggtaa ttgtgacaca 13860
ttaaaagaaa tacttgtcac atacaattgt tgtgatgatg attatttcaa taaaaaggac 13920
tggtatgatt ttgtagaaaa cccagatata ttacgcgtat acgccaactt aggtgaacgt 13980
gtacgccaag ctttgttaaa aacagtacaa ttctgtgatg ccatgcgaaa tgctggtatt 14040
gttggtgtac tgacattaga taatcaagat ctcaatggta actggtatga tttcggtgat 14100
ttcatacaaa ccacgccagg tagtggagtt cctgttgtag attcttatta ttcattgtta 14160
atgcctatat taaccttgac cagggcttta actgcagagt cacatgttga cactgactta 14220
acaaagcctt acattaagtg ggatttgtta aaatatgact tcacggaaga gaggttaaaa 14280
ctctttgacc gttattttaa atattgggat cagacatacc acccaaattg tgttaactgt 14340
ttggatgaca gatgcattct gcattgtgca aactttaatg ttttattctc tacagtgttc 14400
ccacttacaa gttttggacc actagtgaga aaaatatttg ttgatggtgt tccatttgta 14460
gtttcaactg gataccactt cagagagcta ggtgttgtac ataatcagga tgtaaactta 14520
catagctcta gacttagttt taaggaatta cttgtgtatg ctgctgaccc tgctatgcac 14580
gctgcttctg gtaatctatt actagataaa cgcactacgt gcttttcagt agctgcactt 14640
actaacaatg ttgcttttca aactgtcaaa cccggtaatt ttaacaaaga cttctatgac 14700
tttgctgtgt ctaagggttt ctttaaggaa ggaagttctg ttgaattaaa acacttcttc 14760
tttgctcagg atggtaatgc tgctatcagc gattatgact actatcgtta taatctacca 14820
acaatgtgtg atatcagaca actactattt gtagttgaag ttgttgataa gtactttgat 14880
tgttacgatg gtggctgtat taatgctaac caagtcatcg tcaacaacct agacaaatca 14940
gctggttttc catttaataa atggggtaag gctagacttt attatgattc aatgagttat 15000
gaggatcaag atgcactttt cgcatataca aaacgtaatg tcatccctac tataactcaa 15060
atgaatctta agtatgccat tagtgcaaag aatagagctc gcaccgtagc tggtgtctct 15120
atctgtagta ctatgaccaa tagacagttt catcaaaaat tattgaaatc aatagccgcc 15180
actagaggag ctactgtagt aattggaaca agcaaattct atggtggttg gcacaacatg 15240
ttaaaaactg tttatagtga tgtagaaaac cctcacctta tgggttggga ttatcctaaa 15300
tgtgatagag ccatgcctaa catgcttaga attatggcct cacttgttct tgctcgcaaa 15360
catacaacgt gttgtagctt gtcacaccgt ttctatagat tagctaatga gtgtgctcaa 15420
gtattgagtg aaatggtcat gtgtggcggt tcactatatg ttaaaccagg tggaacctca 15480
tcaggagatg ccacaactgc ttatgctaat agtgttttta acatttgtca agctgtcacg 15540
gccaatgtta atgcactttt atctactgat ggtaacaaaa ttgccgataa gtatgtccgc 15600
aatttacaac acagacttta tgagtgtctc tatagaaata gagatgttga cacagacttt 15660
gtgaatgagt tttacgcata tttgcgtaaa catttctcaa tgatgatact ctctgacgat 15720
gctgttgtgt gtttcaatag cacttatgca tctcaaggtc tagtggctag cataaagaac 15780
tttaagtcag ttctttatta tcaaaacaat gtttttatgt ctgaagcaaa atgttggact 15840
gagactgacc ttactaaagg acctcatgaa ttttgctctc aacatacaat gctagttaaa 15900
cagggtgatg attatgtgta ccttccttac ccagatccat caagaatcct aggggccggc 15960
tgttttgtag atgatatcgt aaaaacagat ggtacactta tgattgaacg gttcgtgtct 16020
ttagctatag atgcttaccc acttactaaa catcctaatc aggagtatgc tgatgtcttt 16080
catttgtact tacaatacat aagaaagcta catgatgagt taacaggaca catgttagac 16140
atgtattctg ttatgcttac taatgataac acttcaaggt attgggaacc tgagttttat 16200
gaggctatgt acacaccgca tacagtctta caggctgttg gggcttgtgt tctttgcaat 16260
tcacagactt cattaagatg tggtgcttgc atacgtagac cattcttatg ttgtaaatgc 16320
tgttacgacc atgtcatatc aacatcacat aaattagtct tgtctgttaa tccgtatgtt 16380
tgcaatgctt caggttgtga tgtcacagat gtgactcaac tttacttagg aggtatgagc 16440
tattattgta aatcacataa accacccatt agttttccat tgtgtgctaa tggacaagtt 16500
tttggtttat ataaaaatac atgtgttggt agcgataatg ttactgactt taatgcaatt 16560
gcaacatgtg actggacaaa tgctggtgat tacattttag ctaacacctg tactgaaaga 16620
ctcaagcttt ttgcagcaga aacgctcaaa gctactgagg agacatttaa actgtcttat 16680
ggtattgcta ctgtacgtga agtgctgtct gacagagaat tacatctttc atgggaagtt 16740
ggtaaaccta gaccaccact taaccgaaat tatgtcttta ctggttatcg tgtaactaaa 16800
aacagtaaag tacaaatagg agagtacacc tttgaaaaag gtgactatgg tgatgctgtt 16860
gtttaccgag gtacaacaac ttacaaatta aatgttggtg attattttgt gctgacatca 16920
catacagtaa tgccattaag tgcacctaca ctagtgccac aagagcacta tgttagaatt 16980
actggcttat acccaacact caatatctca gatgagtttt ctagcaatgt tgcaaattat 17040
caaaaggttg gtatgcaaaa gtattctaca ctccagggac cacctggtac tggtaagagt 17100
cattttgcta ttggcctagc tctctactac ccttctgctc gcatagtgta tacagcttgc 17160
tctcatgccg ctgttgatgc actatgtgag aaggcattaa aatatttgcc tatagataaa 17220
tgtagtagaa ttatacctgc acgtgctcgt gtagagtgtt ttgataaatt caaagtgaat 17280
tcaacattag aacagtatgt cttttgtact gtaaatgcat tgcctgagac gacagcagat 17340
atagttgtct ttgatgaaat ttcaatggcc acaaattatg atttgagtgt tgtcaatgcc 17400
agattacgtg ctaagcacta tgtgtacatt ggcgaccctg ctcaattacc tgcaccacgc 17460
acattgctaa ctaagggcac actagaacca gaatatttca attcagtgtg tagacttatg 17520
aaaactatag gtccagacat gttcctcgga acttgtcggc gttgtcctgc tgaaattgtt 17580
gacactgtga gtgctttggt ttatgataat aagcttaaag cacataaaga caaatcagct 17640
caatgcttta aaatgtttta taagggtgtt atcacgcatg atgtttcatc tgcaattaac 17700
aggccacaaa taggcgtggt aagagaattc cttacacgta accctgcttg gagaaaagct 17760
gtctttattt caccttataa ttcacagaat gctgtagcct caaagatttt gggactacca 17820
actcaaactg ttgattcatc acagggctca gaatatgact atgtcatatt cactcaaacc 17880
actgaaacag ctcactcttg taatgtaaac agatttaatg ttgctattac cagagcaaaa 17940
gtaggcatac tttgcataat gtctgataga gacctttatg acaagttgca atttacaagt 18000
cttgaaattc cacgtaggaa tgtggcaact ttacaagctg aaaatgtaac aggactcttt 18060
aaagattgta gtaaggtaat cactgggtta catcctacac aggcacctac acacctcagt 18120
gttgacacta aattcaaaac tgaaggttta tgtgttgaca tacctggcat acctaaggac 18180
atgacctata gaagactcat ctctatgatg ggttttaaaa tgaattatca agttaatggt 18240
taccctaaca tgtttatcac ccgcgaagaa gctataagac atgtacgtgc atggattggc 18300
ttcgatgtcg aggggtgtca tgctactaga gaagctgttg gtaccaattt acctttacag 18360
ctaggttttt ctacaggtgt taacctagtt gctgtaccta caggttatgt tgatacacct 18420
aataatacag atttttccag agttagtgct aaaccaccgc ctggagatca atttaaacac 18480
ctcataccac ttatgtacaa aggacttcct tggaatgtag tgcgtataaa gattgtacaa 18540
atgttaagtg acacacttaa aaatctctct gacagagtcg tatttgtctt atgggcacat 18600
ggctttgagt tgacatctat gaagtatttt gtgaaaatag gacctgagcg cacctgttgt 18660
ctatgtgata gacgtgccac atgcttttcc actgcttcag acacttatgc ctgttggcat 18720
cattctattg gatttgatta cgtctataat ccgtttatga ttgatgttca acaatggggt 18780
tttacaggta acctacaaag caaccatgat ctgtattgtc aagtccatgg taatgcacat 18840
gtagctagtt gtgatgcaat catgactagg tgtctagctg tccacgagtg ctttgttaag 18900
cgtgttgact ggactattga atatcctata attggtgatg aactgaagat taatgcggct 18960
tgtagaaagg ttcaacacat ggttgttaaa gctgcattat tagcagacaa attcccagtt 19020
cttcacgaca ttggtaaccc taaagctatt aagtgtgtac ctcaagctga tgtagaatgg 19080
aagttctatg atgcacagcc ttgtagtgac aaagcttata aaatagaaga attattctat 19140
tcttatgcca cacattctga caaattcaca gatggtgtat gcctattttg gaattgcaat 19200
gtcgatagat atcctgctaa ttccattgtt tgtagatttg acactagagt gctatctaac 19260
cttaacttgc ctggttgtga tggtggcagt ttgtatgtaa ataaacatgc attccacaca 19320
ccagcttttg ataaaagtgc ttttgttaat ttaaaacaat taccattttt ctattactct 19380
gacagtccat gtgagtctca tggaaaacaa gtagtgtcag atatagatta tgtaccacta 19440
aagtctgcta cgtgtataac acgttgcaat ttaggtggtg ctgtctgtag acatcatgct 19500
aatgagtaca gattgtatct cgatgcttat aacatgatga tctcagctgg ctttagcttg 19560
tgggtttaca aacaatttga tacttataac ctctggaaca cttttacaag acttcagagt 19620
ttagaaaatg tggcttttaa tgttgtaaat aagggacact ttgatggaca acagggtgaa 19680
gtaccagttt ctatcattaa taacactgtt tacacaaaag ttgatggtgt tgatgtagaa 19740
ttgtttgaaa ataaaacaac attacctgtt aatgtagcat ttgagctttg ggctaagcgc 19800
aacattaaac cagtaccaga ggtgaaaata ctcaataatt tgggtgtgga cattgctgct 19860
aatactgtga tctgggacta caaaagagat gctccagcac atatatctac tattggtgtt 19920
tgttctatga ctgacatagc caagaaacca actgaaacga tttgtgcacc actcactgtc 19980
ttttttgatg gtagagttga tggtcaagta gacttattta gaaatgcccg taatggtgtt 20040
cttattacag aaggtagtgt taaaggttta caaccatctg taggtcccaa acaagctagt 20100
cttaatggag tcacattaat tggagaagcc gtaaaaacac agttcaatta ttataagaaa 20160
gttgatggtg ttgtccaaca attacctgaa acttacttta ctcagagtag aaatttacaa 20220
gaatttaaac ccaggagtca aatggaaatt gatttcttag aattagctat ggatgaattc 20280
attgaacggt ataaattaga aggctatgcc ttcgaacata tcgtttatgg agattttagt 20340
catagtcagt taggtggttt acatctactg attggactag ctaaacgttt taaggaatca 20400
ccttttgaat tagaagattt tattcctatg gacagtacag ttaaaaacta tttcataaca 20460
gatgcgcaaa caggttcatc taagtgtgtg tgttctgtta ttgatttatt acttgatgat 20520
tttgttgaaa taataaaatc ccaagattta tctgtagttt ctaaggttgt caaagtgact 20580
attgactata cagaaatttc atttatgctt tggtgtaaag atggccatgt agaaacattt 20640
tacccaaaat tacaatctag tcaagcgtgg caaccgggtg ttgctatgcc taatctttac 20700
aaaatgcaaa gaatgctatt agaaaagtgt gaccttcaaa attatggtga tagtgcaaca 20760
ttacctaaag gcataatgat gaatgtcgca aaatatactc aactgtgtca atatttaaac 20820
acattaacat tagctgtacc ctataatatg agagttatac attttggtgc tggttctgat 20880
aaaggagttg caccaggtac agctgtttta agacagtggt tgcctacggg tacgctgctt 20940
gtcgattcag atcttaatga ctttgtctct gatgcagatt caactttgat tggtgattgt 21000
gcaactgtac atacagctaa taaatgggat ctcattatta gtgatatgta cgaccctaag 21060
actaaaaatg ttacaaaaga aaatgactct aaagagggtt ttttcactta catttgtggg 21120
tttatacaac aaaagctagc tcttggaggt tccgtggcta taaagataac agaacattct 21180
tggaatgctg atctttataa gctcatggga cacttcgcat ggtggacagc ctttgttact 21240
aatgtgaatg cgtcatcatc tgaagcattt ttaattggat gtaattatct tggcaaacca 21300
cgcgaacaaa tagatggtta tgtcatgcat gcaaattaca tattttggag gaatacaaat 21360
ccaattcagt tgtcttccta ttctttattt gacatgagta aatttcccct taaattaagg 21420
ggtactgctg ttatgtcttt aaaagaaggt caaatcaatg atatgatttt atctcttctt 21480
agtaaaggta gacttataat tagagaaaac aacagagttg ttatttctag tgatgttctt 21540
gttaacaact aaacgaacaa tgtttgtttt tcttgtttta ttgccactag tctctagtca 21600
gtgtgttaat cttacaacca gaactcaatt accccctgca tacactaatt ctttcacacg 21660
tggtgtttat taccctgaca aagttttcag atcctcagtt ttacattcaa ctcaggactt 21720
gttcttacct ttcttttcca atgttacttg gttccatgct atacatgtct ctgggaccaa 21780
tggtactaag aggtttgcta accctgtcct accatttaat gatggtgttt attttgcttc 21840
cactgagaag tctaacataa taagaggctg gatttttggt actactttag attcgaagac 21900
ccagtcccta cttattgtta ataacgctac taatgttgtt attaaagtct gtgaatttca 21960
attttgtaat gatccatttt tgggtgttta ttaccacaaa aacaacaaaa gttggatgga 22020
aagtgagttc agagtttatt ctagtgcgaa taattgcact tttgaatatg tctctcagcc 22080
ttttcttatg gaccttgaag gaaaacaggg taatttcaaa aatcttaggg aatttgtgtt 22140
taagaatatt gatggttatt ttaaaatata ttctaagcac acgcctatta atttagtgcg 22200
tggtctccct cagggttttt cggctttaga accattggta gatttgccaa taggtattaa 22260
catcactagg tttcaaannn nnnctttaca tagaagttat ttgactcctg gtgattcttc 22320
ttcaggttgg acagctggtg ctgcagctta ttatgtgggt tatcttcaac ctaggacttt 22380
tctattaaaa tataatgaaa atggaaccat tacagatgct gtagactgtg cacttgaccc 22440
tctctcagaa acaaagtgta cgttgaaatc cttcactgta gaaaaaggaa tctatcaaac 22500
ttctaacttt agagtccaac caacagaatc tattgttaga tttcctaata ttacaaactt 22560
gtgccctttt ggtgaagttt ttaacgccac cagatttgca tctgtttatg cttggaacag 22620
gaagagaatc agcaactgtg ttgctgatta ttctgtccta tataattccg catcattttc 22680
cacttttaag tgttatggag tgtctcctac taaattaaat gatctctgct ttactaatgt 22740
ctatgcagat tcatttgtaa ttagaggtga tgaagtcaga caaatcgctc cagggcaaac 22800
tggaaatatt gctgattata attataaatt accagatgat tttacaggct gcgttatagc 22860
ttggaattct aacaatcttg attctaaggt tggtggtaat tataattacc tgtatagatt 22920
gtttaggaag tctaatctca aaccttttga gagagatatt tcaactgaaa tctatcaggc 22980
cggtagcaca ccttgtaatg gtgttaaagg ttttaattgt tactttcctt tacaatcata 23040
tggtttccaa cccacttatg gtgttggtta ccaaccatac agagtagtag tactttcttt 23100
tgaacttcta catgcaccag caactgtttg tggacctaaa aagtctacta atttggttaa 23160
aaacaaatgt gtcaatttca acttcaatgg tttaacaggc acaggtgttc ttactgagtc 23220
taacaaaaag tttctgcctt tccaacaatt tggcagagac attgctgaca ctactgatgc 23280
tgtccgtgat ccacagacac ttgagattct tgacattaca ccatgttctt ttggtggtgt 23340
cagtgttata acaccaggaa caaatacttc taaccaggtt gctgttcttt atcagggtgt 23400
taactgcaca gaagtccctg ttgctattca tgcagatcaa cttactccta cttggcgtgt 23460
ttattctaca ggttctaatg tttttcaaac acgtgcaggc tgtttaatag gggctgaaca 23520
tgtcaacaac tcatatgagt gtgacatacc cattggtgca ggtatatgcg ctagttatca 23580
gactcagact aattctcctc ggcgggcacg tagtgtagct agtcaatcca tcattgccta 23640
cactatgtca cttggtgtag aaaattcagt tgcttactct aataactcta ttgccatacc 23700
cacaaatttt actattagtg ttaccacaga aattctacca gtgtctatga ccaagacatc 23760
agtagattgt acaatgtaca tttgtggtga ttcaactgaa tgcagcaatc ttttgttgca 23820
atatggcagt ttttgtacac aattaaaccg tgctttaact ggaatagctg ttgaacaaga 23880
caaaaacacc caagaagttt ttgcacaagt caaacaaatt tacaaaacac caccaattaa 23940
agattttggt ggttttaatt tttcacaaat attaccagat ccatcaaaac caagcaagag 24000
gtcatttatt gaagatctac ttttcaacaa agtgacactt gcagatgctg gcttcatcaa 24060
acaatatggt gattgccttg gtgatattgc tgctagagac ctcatttgtg cacaaaagtt 24120
taacggcctt actgttttgc cacctttgct cacagatgaa atgattgctc aatacacttc 24180
tgcactgtta gcgggtacaa tcacttctgg ttggaccttt ggtgcaggtg ctgcattaca 24240
aataccattt gctatgcaaa tggcttatag gtttaatggt attggagtta cacagaatgt 24300
tctctatgag aaccaaaaat tgattgccaa ccaatttaat agtgctattg gcaaaattca 24360
agactcactt tcttccacag caagtgcact tggaaaactt caagatgtgg tcaaccaaaa 24420
tgcacaagct ttaaacacgc ttgttaaaca acttagctcc aattttggtg caatttcaag 24480
tgttttaaat gatatccttt cacgtcttga caaagttgag gctgaagtgc aaattgatag 24540
gttgatcaca ggcagacttc aaagtttgca gacatatgtg actcaacaat taattagagc 24600
tgcagaaatc agagcttctg ctaatcttgc tgctactaaa atgtcagagt gtgtacttgg 24660
acaatcaaaa agagttgatt tttgtggaaa gggctatcat cttatgtcct tccctcagtc 24720
agcacctcat ggtgtagtct tcttgcatgt gacttatgtc cctgcacaag aaaagaactt 24780
cacaactgct cctgccattt gtcatgatgg aaaagcacac tttcctcgtg aaggtgtctt 24840
tgtttcaaat ggcacacact ggtttgtaac acaaaggaat ttttatgaac cacaaatcat 24900
tactacagac aacacatttg tgtctggtaa ctgtgatgtt gtaataggaa ttgtcaacaa 24960
cacagtttat gatcctttgc aacctgaatt agactcattc aaggaggagt tagataaata 25020
ttttaagaat catacatcac cagatgttga tttaggtgac atctctggca ttaatgcttc 25080
agttgtaaac attcaaaaag aaattgaccg cctcaatgag gttgccaaga atttaaatga 25140
atctctcatc gatctccaag aacttggaaa gtatgagcag tatataaaat ggccatggta 25200
catttggcta ggttttatag ctggcttgat tgccatagta atggtgacaa ttatgctttg 25260
ctgtatgacc agttgctgta gttgtctcaa gggctgttgt tcttgtggat cctgctgcaa 25320
atttgatgaa gacgactctg agccagtgct caaaggagtc aaattacatt acacataaac 25380
gaacttatgg atttgtttat gagaatcttc acaattggaa ctgtaacttt gaagcaaggt 25440
gaaatcaagg atgctactcc ttcagatttt gttcgcgcta ctgcaacgat accgatacaa 25500
gcctcactcc ctttcggatg gcttattgtt ggcgttgcac ttcttgctgt ttttcatagc 25560
gcttccaaaa tcataaccct caaaaagaga tggcaactag cactctccaa gggtgttcac 25620
tttgtttgca acttgctgtt gttgtttgta acagtttact cacacctttt gctcgttgct 25680
gctggccttg aagccccttt tctctatctt tatgctttag tctacttctt gcagagtata 25740
aactttgtaa gaataataat gaggctttgg ctttgctgga aatgccgttc caaaaaccca 25800
ttactttatg atgccaacta ttttctttgc tggcatacta attgttacga ctattgtata 25860
ccttacaata gtgtaacttc ttcaattgtc attactttag gtgatggcac aacaagtcct 25920
atttctgaac atgactacca gattggtggt tatactgaaa aatgggaatc tggagtaaaa 25980
gactgtgttg tattacacag ttacttcact tcagactatt accagctgta ctcaactcaa 26040
ttgagtacag acactggtgt tgaacatgtt accttcttca tctacaataa aattgttgat 26100
gagcctgaag aacatgtcca aattcacaca atcgacggtt catccggagt tgttaatcca 26160
gtaatggaac caatttatga tgaaccgacg acgactacta gcgtgccttt gtaagcacaa 26220
gctgatgagt acgaacttat gtactcattc gtttcggaag agacaggtac gttaatagtt 26280
aatagcgtac ttctttttct tgctttcgtg gtattcttgc tagttacact agccatcctt 26340
actgcgcttc gattgtgtgc gtactgctgc aatattgtta acgtgagtct tgtaaaacct 26400
tctttttacg tttactctcg tgttaaaaat ctgaattctt ctagagttct tgatcttctg 26460
gtctaaacga actaaatatt atattagttt ttctgtttgg aactttaatt ttagccatgg 26520
cagattccaa cggtactatt accgttgaag agcttaaaaa gctccttgaa caatggaacc 26580
tagtaatagg tttcctattc cttacatgga tttgtcttct acaatttgcc tatgccaaca 26640
ggaataggtt tttgtatata attaagttaa ttttcctctg gctgttatgg ccagtaactt 26700
tagcttgttt tgtgcttgct gctgtttaca gaataaattg gatcaccggt ggaattgcta 26760
tcgcaatggc ttgtcttgta ggcttgatgt ggctcagcta cttcattgct tctttcagac 26820
tgtttgcgcg tacgcgttcc atgtggtcat tcaatccaga aactaacatt cttctcaacg 26880
tgccactcca tggcactatt ctgaccagac cgcttctaga aagtgaactc gtaatcggag 26940
ctgtgatcct tcgtggacat cttcgtattg ctggacacca tctaggacgc tgtgacatca 27000
aggacctgcc taaagaaatc actgttgcta catcacgaac gctttcttat tacaaattgg 27060
gagcttcgca gcgtgtagca ggtgactcag gttttgctgc atacagtcgc tacaggattg 27120
gcaactataa attaaacaca gaccattcca gtagcagtga caatattgct ttgcttgtac 27180
agtaagcgac aacagatgtt tcatctcgtt gactttcagg ttactatagc agagatatta 27240
ctaattatta tgaggacttt taaagtttcc atttggaatc ttgattacat cataaacctc 27300
ataattaaaa atttatctaa gtcactaact gagaataaat attctcaatt agatgaagag 27360
caaccaatgg agattgatta aacgaacatg aaaattattc ttttcttggc actgataaca 27420
ctcgctactt gtgagcttta tcactaccaa gagtgtgtta gaggtacaac agtactttta 27480
aaagaacctt gctcttctgg aacatacgag ggcaattcac catttcatcc tctagctgat 27540
aacaaatttg cactgacttg ctttagcact caatttgctt ttgcttgtcc tgacggcgta 27600
aaacacgtct atcagttacg tgccagatca gtttcaccta aactgttcat cagacaagag 27660
gaagttcaag aactttactc tccaattttt cttattgttg cggcaatagt gtttataaca 27720
ctttgcttca cactcaaaag aaagacagaa tgattgaact ttcattaatt gacttctatt 27780
tgtgcttttt agcctttctg ctattccttg ttttaattat gcttattatc ttttggttct 27840
cacttgaact gcaagatcat aatgaaactt gtcacgccta aacgaacatg aaatttcttg 27900
ttttcttagg aatcatcaca actgtagctg catttcacca agaatgtagt ttacagtcat 27960
gtactcaaca tcaaccatat gtagttgatg acccgtgtcc tattcacttc tattctaaat 28020
ggtatattag agtaggagct agaaaatcag cacctttaat tgaattgtgc gtggatgagg 28080
ctggttctaa atcacccatt cagtacatcg atatcggtaa ttatacagtt tcctgtttac 28140
cttttacaat taattgccag gaacctaaat tgggtagtct tgtagtgcgt tgttcgttct 28200
atgaagactt tttagagtat catgacgttc gtgttgtttt agattttatc taaacgaaca 28260
aactaaaatg tctgataatg gaccccaaaa tcagcgaaat gcaccccgca ttacgtttgg 28320
tggaccctca gattcaactg gcagtaacca gaatggagaa cgcagtgggg cgcgatcaaa 28380
acaacgtcgg ccccaaggtt tacccaataa tactgcgtct tggttcaccg ctctcactca 28440
acatggcaag gaagacctta aattccctcg aggacaaggc gttccaatta acaccaatag 28500
cagtccagat gaccaaattg gctactaccg aagagctacc agacgaattc gtggtggtga 28560
cggtaaaatg aaagatctca gtccaagatg gtatttctac tacctaggaa ctgggccaga 28620
agctggactt ccctatggtg ctaacaaaga cggcatcata tgggttgcaa ctgagggagc 28680
cttgaataca ccaaaagatc acattggcac ccgcaatcct gctaacaatg ctgcaatcgt 28740
gctacaactt cctcaaggaa caacattgcc aaaaggcttc tacgcagaag ggagcagagg 28800
cggcagtcaa gcctcttctc gttcctcatc acgtagtcgc aacagttcaa gaaattcaac 28860
tccaggcagc agtaggggaa tttctcctgc tagaatggct ggcaatggcg gtgatgctgc 28920
tcttgctttg ctgctgcttg acagattgaa ccagcttgag agcaaaatgt ctggtaaagg 28980
ccaacaacaa caaggccaaa ctgtcactaa gaaatctgct gctgaggctt ctaagaagcc 29040
tcggcaaaaa cgtactgcca ctaaagcata caatgtaaca caagctttcg gcagacgtgg 29100
tccagaacaa acccaaggaa attttgggga ccaggaacta atcagacaag gaactgatta 29160
caaacattgg ccgcaaattg cacaatttgc ccccagcgct tcagcgttct tcggaatgtc 29220
gcgcattggc atggaagtca caccttcggg aacgtggttg acctacacag gtgccatcaa 29280
attggatgac aaagatccaa atttcaaaga tcaagtcatt ttgctgaata agcatattga 29340
cgcatacaaa acattcccac caacagagcc taaaaaggac aaaaagaaga aggctgatga 29400
aactcaagcc ttaccgcaga gacagaagaa acagcaaact gtgactcttc ttcctgctgc 29460
agatttggat gatttctcca aacaattgca acaatccatg agcagtgctg actcaactca 29520
ggcctaaact catgcagacc acacaaggca gatgggctat ataaacgttt tcgcttttcc 29580
gtttacgata tatagtctac tcttgtgcag aatgaattct cgtaactaca tagcacaagt 29640
agatgtagtt aactttaatc tcacatagca atctttaatc agtgtgtaac attagggagg 29700
acttgaaaga gccaccacat tttcaccgag gccacgcgga gtacgatcga gtgtacagtg 29760
aacaatgcta gggagagctg cctatatgga agagccctaa tgtgtaaaat taattttagt 29820
agtgctatcc ccatgnnnnn nnnnnnnnnn nnnnnnnnnn nnnnaaaaaa aaaaaaaaaa 29880
aaaaaaaaaa aaaaaaa 29897
<210> 19
<211> 1272
<212> PRT
<213> SARS-CoV2
<220>
<221> MISC_FEATURE
<222> (240)..(242)
<223> Xaa can be any naturally occurring amino acid
<400> 19
Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
1 5 10 15
Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
20 25 30
Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
35 40 45
His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
50 55 60
Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Ala
65 70 75 80
Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
85 90 95
Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
100 105 110
Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
115 120 125
Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
130 135 140
Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
145 150 155 160
Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
165 170 175
Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
180 185 190
Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
195 200 205
Pro Ile Asn Leu Val Arg Gly Leu Pro Gln Gly Phe Ser Ala Leu Glu
210 215 220
Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Xaa
225 230 235 240
Xaa Xaa Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser Gly
245 250 255
Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro Arg
260 265 270
Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala Val
275 280 285
Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys Ser
290 295 300
Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val Gln
305 310 315 320
Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys Pro
325 330 335
Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp
340 345 350
Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr
355 360 365
Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr
370 375 380
Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val
385 390 395 400
Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Asn
405 410 415
Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val
420 425 430
Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr
435 440 445
Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu
450 455 460
Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn
465 470 475 480
Gly Val Lys Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe
485 490 495
Gln Pro Thr Tyr Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu
500 505 510
Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Lys
515 520 525
Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn Gly
530 535 540
Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu Pro
545 550 555 560
Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val Arg
565 570 575
Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe Gly
580 585 590
Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val Ala
595 600 605
Val Leu Tyr Gln Gly Val Asn Cys Thr Glu Val Pro Val Ala Ile His
610 615 620
Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser Asn
625 630 635 640
Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val Asn
645 650 655
Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala Ser
660 665 670
Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala Ser
675 680 685
Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Val Glu Asn Ser Val
690 695 700
Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile Ser
705 710 715 720
Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val Asp
725 730 735
Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu Leu
740 745 750
Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr Gly
755 760 765
Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln Val
770 775 780
Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe Asn
785 790 795 800
Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser Phe
805 810 815
Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly Phe
820 825 830
Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp Leu
835 840 845
Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu Leu
850 855 860
Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly Thr
865 870 875 880
Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile Pro
885 890 895
Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr Gln
900 905 910
Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn Ser
915 920 925
Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala Leu
930 935 940
Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn Thr
945 950 955 960
Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val Leu
965 970 975
Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln Ile
980 985 990
Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val Thr
995 1000 1005
Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu Ala
1010 1015 1020
Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys Arg Val Asp
1025 1030 1035 1040
Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ser Ala Pro
1045 1050 1055
His Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ala Gln Glu Lys
1060 1065 1070
Asn Phe Thr Thr Ala Pro Ala Ile Cys His Asp Gly Lys Ala His Phe
1075 1080 1085
Pro Arg Glu Gly Val Phe Val Ser Asn Gly Thr His Trp Phe Val Thr
1090 1095 1100
Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile Thr Thr Asp Asn Thr Phe
1105 1110 1115 1120
Val Ser Gly Asn Cys Asp Val Val Ile Gly Ile Val Asn Asn Thr Val
1125 1130 1135
Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp
1140 1145 1150
Lys Tyr Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile
1155 1160 1165
Ser Gly Ile Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg
1170 1175 1180
Leu Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln
1185 1190 1195 1200
Glu Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Trp Tyr Ile Trp
1205 1210 1215
Leu Gly Phe Ile Ala Gly Leu Ile Ala Ile Val Met Val Thr Ile Met
1220 1225 1230
Leu Cys Cys Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Cys Cys Ser
1235 1240 1245
Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro Val Leu
1250 1255 1260
Lys Gly Val Lys Leu His Tyr Thr
1265 1270
<210> 20
<211> 29819
<212> DNA
<213> SARS-CoV2
<400> 20
caactttcga tctcttgtag atctgttctc taaacgaact ttaaaatctg tgtggctgtc 60
actcggctgc atgcttagtg cactcacgca gtataattaa taactaatta ctgtcgttga 120
caggacacga gtaactcgtc tatcttctgc aggctgctta cggtttcgtc cgtgttgcag 180
ccgatcatca gcacatctag gttttgtccg ggtgtgaccg aaaggtaaga tggagagcct 240
tgtccctggt ttcaacgaga aaacacacgt ccaactcagt ttgcctgttt tacaggttcg 300
cgacgtgctc gtacgtggct ttggagactc cgtggaggag gtcttatcag aggcacgtca 360
acatcttaaa gatggcactt gtggcttagt agaagttgaa aaaggcgttt tgcctcaact 420
tgaacagccc tatgtgttca tcaaacgttc ggatgctcga actgcacctc atggtcatgt 480
tatggttgag ctggtagcag aactcgaagg cattcagtac ggtcgtagtg gtgagacact 540
tggtgtcctt gtccctcatg tgggcgaaat accagtggct taccgcaagg ttcttcttcg 600
taagaacggt aataaaggag ctggtggcca tagttacggc gccgatctaa agtcatttga 660
cttaggcgac gagcttggca ctgatcctta tgaagacttt caagaaaact ggaacactaa 720
acatagcagt ggtgttaccc gtgaactcat gcgtgagctt aacggagggg catacactcg 780
ctatgtcgat aacaacttct gtggccctga tggctaccct cttgagtgca ttaaagacct 840
tctagcacgt gctggtaaag cttcatgcac tttgtccgaa caactggact ttattgacac 900
taagaggggt gtatactgct gccgtgaaca tgagcatgaa attgcttggt acacggaacg 960
ttctgaaaag agctatgaat tgcagacacc ttttgaaatt aaattggcaa agaaatttga 1020
caccttcaat ggggaatgtc caaattttgt atttccctta aattccataa tcaagactat 1080
tcaaccaagg gttgaaaaga aaaagcttga tggctttatg ggtagaattc gatctgtcta 1140
tccagttgcg tcaccaaatg aatgcaacca aatgtgcctt tcaactctca tgaagtgtga 1200
tcattgtggt gaaacttcat ggcagacggg cgattttgtt aaagccactt gcgaattttg 1260
tggcactgag aatttgacta aagaaggtgc cactacttgt ggttacttac cccaaaatgc 1320
tgttgttaaa atttattgtc cagcatgtca caattcagaa gtaggacctg agcatagtct 1380
tgccgaatac cataatgaat ctggcttgaa aaccattctt cgtaagggtg gtcgcactat 1440
tgcctttgga ggctgtgtgt tctcttatgt tggttgccat aacaagtgtg cctattgggt 1500
tccacgtgct agcgctaaca taggttgtaa ccatacaggt gttgttggag aaggttccga 1560
aggtcttaat gacaaccttc ttgaaatact ccaaaaagag aaagtcaaca tcaatattgt 1620
tggtgacttt aaacttaatg aagagatcgc cattattttg gcatcttttt ctgcttccac 1680
aagtgctttt gtggaaactg tgaaaggttt ggattataaa gcattcaaac aaattgttga 1740
atcctgtggt aattttaaag ttacaaaagg aaaagctaaa aaaggtgcct ggaatattgg 1800
tgaacagaaa tcaatactga gtcctcttta tgcatttgca tcagaggctg ctcgtgttgt 1860
acgatcaatt ttctcccgca ctcttgaaac tgctcaaaat tctgtgcgtg ttttacagaa 1920
ggccgctata acaatactag atggaatttc acagtattca ctgagactca ttgatgctat 1980
gatgttcaca tctgatttgg ctactaacaa tctagttgta atggcctaca ttacaggtgg 2040
tgttgttcag ttgacttcgc agtggctaac taacatcttt ggcactgttt atgaaaaact 2100
caaacccgtc cttgattggc ttgaagagaa gtttaaggaa ggtgtagagt ttcttagaga 2160
cggttgggaa attgttaaat ttatctcaac ctgtgcttgt gaaattgtcg gtggacaaat 2220
tgtcacctgt gcaaaggaaa ttaaggagag tgttcagaca ttctttaagc ttgtaaataa 2280
atttttggct ttgtgtgctg actctatcat tattggtgga gctaaactta aagccttgaa 2340
tttaggtgaa acatttgtca cgcactcaaa gggattgtac agaaagtgtg ttaaatccag 2400
agaagaaact ggcctactca tgcctctaaa agccccaaaa gaaattatct tcttagaggg 2460
agaaacactt cccacagaag tgttaacaga ggaagttgtc ttgaaaactg gtgatttaca 2520
accattagaa caacctacta gtgaagctgt tgaagctcca ttggttggta caccagtttg 2580
tattaacggg cttatgttgc tcgaaatcaa agacacagaa aagtactgtg cccttgcacc 2640
taatatgatg gtaacaaaca ataccttcac actcaaaggc ggtgcaccaa caaaggttac 2700
ttttggtgat gatactgtga tagaagtgca aggttacaag agtgtgaata tcacttttga 2760
acttgatgaa aggattgata aagtacttaa tgagaagtgc tctgcctata cagttgaact 2820
cggtacagaa gtaaatgagt tcgcctgtgt tgtggcagat gctgtcataa aaactttgca 2880
accagtatct gaattactta caccactggg cattgattta gatgagtgga gtatggctac 2940
atactactta tttgatgagt ctggtgagtt taaattggct tcacatatgt attgttcttt 3000
ttaccctcca gatgaggatg aagaagaagg tgattgtgaa gaagaagagt ttgagccatc 3060
aactcaatat gagtatggta ctgaagatga ttaccaaggt aaacctttgg aatttggtgc 3120
cacttctgct gctcttcaac ctgaagaaga gcaagaagaa gattggttag atgatgatag 3180
tcaacaaact gttggtcaac aagacggcag tgaggacaat cagacaacta ctattcaaac 3240
aattgttgag gttcaacctc aattagagat ggaacttaca ccagttgttc agactattga 3300
agtgaatagt tttagtggtt atttaaaact tactgacaat gtatacatta aaaatgcaga 3360
cattgtggaa gaagctaaaa aggtaaaacc aacagtggtt gttaatgcag ccaatgttta 3420
ccttaaacat ggaggaggtg ttgcaggagc cttaaataag gctactaaca atgccatgca 3480
agttgaatct gatgattaca tagctactaa tggaccactt aaagtgggtg gtagttgtgt 3540
tttaagcgga cacaatcttg ctaaacactg tcttcatgtt gtcggcccaa atgttaacaa 3600
aggtgaagac attcaacttc ttaagagtgc ttatgaaaat tttaatcagc acgaagttct 3660
acttgcacca ttattatcag ctggtatttt tggtgctgac cctatacatt ctttaagagt 3720
ttgtgtagat actgttcgca caaatgtcta cttagctgtc tttgataaaa atctctatga 3780
caaacttgtt ttaagctttt tggaaatgaa gagtgaaaag caagttgaac aaaagatcgc 3840
tgagattcct aaagaggaag ttaagccatt tataactgaa agtaaacctt cagttgaaca 3900
gagaaaacaa gatgataaga aaatcaaagc ttgtgttgaa gaagttacaa caactctgga 3960
agaaactaag ttcctcacag aaaacttgtt actttatatt gacattaatg gcaatcttca 4020
tccagattct gccactcttg ttagtgacat tgacatcact ttcttaaaga aagatgctcc 4080
atatatagtg ggtgatgttg ttcaagaggg tgttttaact gctgtggtta tacctactaa 4140
aaaggctggt ggcactactg aaatgctagc gaaagctttg agaaaagtgc caacagacaa 4200
ttatataacc acttacccgg gtcagggttt aaatggttac actgtagagg aggcaaagac 4260
agtgcttaaa aagtgtaaaa gtgcctttta cattctacca tctattatct ctaatgagaa 4320
gcaagaaatt cttggaactg tttcttggaa tttgcgagaa atgcttgcac atgcagaaga 4380
aacacgcaaa ttaatgcctg tctgtgtgga aactaaagcc atagtttcaa ctatacagcg 4440
taaatataag ggtattaaaa tacaagaggg tgtggttgat tatggtgcta gattttactt 4500
ttacaccagt aaaacaactg tagcgtcact tatcaacaca cttaacgatc taaatgaaac 4560
tcttgttaca atgccacttg gctatgtaac acatggctta aatttggaag aagctgctcg 4620
gtatatgaga tctctcaaag tgccagctac agtttctgtt tcttcacctg atgctgttac 4680
agcgtataat ggttatctta cttcttcttc taaaacacct gaagaacatt ttattgaaac 4740
catctcactt gctggttcct ataaagattg gtcctattct ggacaatcta cacaactagg 4800
tatagaattt cttaagagag gtgataaaag tgtatattac actagtaatc ctaccacatt 4860
ccacctagat ggtgaagtta tcacctttga caatcttaag acacttcttt ctttgagaga 4920
agtgaggact attaaggtgt ttacaacagt agacaacatt aacctccaca cgcaagttgt 4980
ggacatgtca atgacatatg gacaacagtt tggtccaact tatttggatg gagctgatgt 5040
tactaaaata aaacctcata attcacatga aggtaaaaca ttttatgttt tacctaatga 5100
tgacactcta cgtgttgagg cttttgagta ctaccacaca actgatccta gttttctggg 5160
taggtacatg tcagcattaa atcacactaa aaagtggaaa tacccacaag ttaatggttt 5220
aacttctatt aaatgggcag ataacaactg ttatcttgcc actgcattgt taacactcca 5280
acaaatagag ttgaagttta atccacctgc tctacaagat gcttattaca gagcaagggc 5340
tggtgaagct gctaactttt gtgcacttat cttagcctac tgtaataaga cagtaggtga 5400
gttaggtgat gttagagaaa caatgagtta cttgtttcaa catgccaatt tagattcttg 5460
caaaagagtc ttgaacgtgg tgtgtaaaac ttgtggacaa cagcagacaa cccttaaggg 5520
tgtagaagct gttatgtaca tgggcacact ttcttatgaa caatttaaga aaggtgttca 5580
gataccttgt acgtgtggta aacaagctac acaatatcta gtacaacagg agtcaccttt 5640
tgttatgatg tcagcaccac ctgctcagta tgaacttaag catggtacat ttacttgtgc 5700
tagtgagtac actggtaatt accagtgtgg tcactataaa catataactt ctaaagaaac 5760
tttgtattgc atagacggtg ctttacttac aaagtcctca gaatacaaag gtcctattac 5820
ggatgttttc tacaaagaaa acagttacac aacaaccata aaaccagtta cttataaatt 5880
ggatggtgtt gtttgtacag aaattgaccc taagttggac aattattata agaaagacaa 5940
ttcttatttc acagagcaac caattgatct tgtaccaaac caaccatatc caaacgcaag 6000
cttcgataat tttaagtttg tatgtgataa tatcaaattt gctgatgatt taaaccagtt 6060
aactggttat aagaaacctg cttcaagaga gcttaaagtt acatttttcc ctgacttaaa 6120
tggtgatgtg gtggctattg attataaaca ctacacaccc tcttttaaga aaggagctaa 6180
attgttacat aaacctattg tttggcatgt taacaatgca actaataaag ccacgtataa 6240
accaaatacc tggtgtatac gttgtctttg gagcacaaaa ccggttgaaa catcaaattc 6300
gtttgatgta ctgaagtcag aggacgcgca gggaatggat aatcttgcct gcgaagatct 6360
aaaaccagtc tctgaagaag tagtggaaaa tcctaccata cagaaagacg ttcttgagtg 6420
taatgtgaaa actaccgaag ttgtaggaga cattatactt aaaccagcaa ataatagttt 6480
aaaaattaca gaagaggttg gccacacaga tctaatggct gcttatgtag acaattctag 6540
tcttactatt aagaaaccta atgaattatc tagagtgtta ggtttgaaaa cccttgctac 6600
tcatggttta gctgctgtta atagtgtccc ttgggatact atagctaatt atgctaagcc 6660
ttttcttaac aaagttgtta gtacaactac taacatagtt acacggtgtt taaaccgtgt 6720
ttgtactaat tatatgcctt atttctttac tttattgcta caattgtgta cttttactag 6780
aagtacaaat tctagaatta aagcatctat gccgactact atagcaaaga atactgttaa 6840
gagtgtcggt aaattttgtc tagaggcttc atttaattat ttgaagtcac ctaatttttc 6900
taaactgata aatattataa tttggttttt actattaagt gtttgcctag gttctttaat 6960
ctactcaacc gctgctttag gtgttttaat gtctaattta ggcatgcctt cttactgtac 7020
tggttacaga gaaggctatt tgaactctac taatgtcact attgcaacct actgtactgg 7080
ttctatacct tgtagtgttt gtcttagtgg tttagattct ttagacacct atccttcttt 7140
agaaactata caaattacca tttcatcttt taaatgggat ttaactgctt ttggcttagt 7200
tgcagagtgg tttttggcat atattctttt cactaggttt ttctatgtac ttggattggc 7260
tgcaatcatg caattgtttt tcagctattt tgcagtacat tttattagta attcttggct 7320
tatgtggtta ataattaatc ttgtacaaat ggccccgatt tcagctatgg ttagaatgta 7380
catcttcttt gcatcatttt attatgtatg gaaaagttat gtgcatgttg tagacggttg 7440
taattcatca acttgtatga tgtgttacaa acgtaataga gcaacaagag tcgaatgtac 7500
aactattgtt aatggtgtta gaaggtcctt ttatgtctat gctaatggag gtaaaggctt 7560
ttgcaaacta cacaattgga attgtgttaa ttgtgataca ttctgtgctg gtagtacatt 7620
tattagtgat gaagttgcga gagacttgtc actacagttt aaaagaccaa taaatcctac 7680
tgaccagtct tcttacatcg ttgatagtgt tacagtgaag aatggttcca tccatcttta 7740
ctttgataaa gctggtcaaa agacttatga aagacattct ctctctcatt ttgttaactt 7800
agacaacctg agagctaata acactaaagg ttcattgcct attaatgtta tagtttttga 7860
tggtaaatca aaatgtgaag aatcatctgc aaaatcagcg tctgtttact acagtcagct 7920
tatgtgtcaa cctatactgt tactagatca ggcattagtg tctgatgttg gtgatagtgc 7980
ggaagttgca gttaaaatgt ttgatgctta cgttaatacg ttttcatcaa cttttaacgt 8040
accaatggaa aaactcaaaa cactagttgc aactgcagaa gctgaacttg caaagaatgt 8100
gtccttagac aatgtcttat ctacttttat ttcagcagct cggcaagggt ttgttgattc 8160
agatgtagaa actaaagatg ttgttgaatg tcttaaattg tcacatcaat ctgacataga 8220
agttactggc gatagttgta ataactatat gctcacctat aacaaagttg aaaacatgac 8280
accccgtgac cttggtgctt gtattgactg tagtgcgcgt catattaatg cgcaggtagc 8340
aaaaagtcac aacattgctt tgatatggaa cgttaaagat ttcatgtcat tgtctgaaca 8400
actacgaaaa caaatacgta gtgctgctaa aaagaataac ttacctttta agttgacatg 8460
tgcaactact agacaagttg ttaatgttgt aacaacaaag atagcactta agggtggtaa 8520
aattgttaat aattggttga agcagttaat taaagttaca cttgtgttcc tttttgttgc 8580
tgctattttc tatttaataa cacctgttca tgtcatgtct aaacatactg acttttcaag 8640
tgaaatcata ggatacaagg ctattgatgg tggtgtcact cgtgacatag catctacaga 8700
tacttgtttt gctaacaaac atgctgattt tgacacatgg tttagccagc gtggtggtag 8760
ttatactaat gacaaagctt gcccattgat tgctgcagtc ataacaagag aagtgggttt 8820
tgtcgtgcct ggtttgcctg gcacgatatt acgcacaact aatggtgact ttttgcattt 8880
cttacctaga gtttttagtg cagttggtaa catctgttac acaccatcaa aacttataga 8940
gtacactgac tttgcaacat cagcttgtgt tttggctgct gaatgtacaa tttttaaaga 9000
tgcttctggt aagccagtac catattgtta tgataccaat gtactagaag gttctgttgc 9060
ttatgaaaat ttacgccctg acacacgtta tgtgctcatg gatggctcta ttattcaatt 9120
tcctaacacc taccttgaag gttctgttag agtggtaaca acttttgatt ctgagtactg 9180
taggcacggc acttgtgaaa gatcagaagc tggtgtttgt gtatctacta gtggtagatg 9240
ggtacttaac aatgattatt acagatcttt accaggagtt ttctgtggtg tagatgctgt 9300
aaatttactt actaatatgt ttacaccact aattcaacct attggtgctt tggacatatc 9360
agcatctata gtagctggtg gtattgtagc tatcgtagta acatgccttg cctactattt 9420
tatgaggttt agaagagctt ttggtgaata cagtcatgta gttgccttta atactttact 9480
attccttatg tcattcactg tactctgttt aacaccagtt tactcattct tacctggtgt 9540
ttattctgtt atttacttgt acttgacatt ttatcttact aatgatgttt cttttttagc 9600
acatattcag tggatggtta tgttcacacc tttagtacct ttctggataa caattgctta 9660
tatcatttgt atttccacaa agcatttcta ttggttcttt agtaattacc taaagagacg 9720
tgtagtcttt aatggtgttt cctttagtac ttttgaagaa gctgcgctgt gcaccttttt 9780
gttaaataaa gaaatgtatc taaagttgcg tagtgatgtg ctattacctc ttacgcaata 9840
taatagatac ttagctcttt ataataagta caagtatttt agtggagcaa tggatacaac 9900
tagctacaga gaagctgctt gttgtcatct cgcaaaggct ctcaatgact tcagtaactc 9960
aggttctgat gttctttacc aaccaccaca aacctctatc acctcagctg ttttgcagag 10020
tggttttaga aaaatggcat tcccatctgg taaagttgag ggttgtatgg tacaagtaac 10080
ttgtggtaca actacactta acggtctttg gcttgatgac gtagtttact gtccaagaca 10140
tgtgatctgc acctctgaag acatgcttaa ccctaattat gaagatttac tcattcgtaa 10200
gtctaatcat aatttcttgg tacaggctgg taatgttcaa ctcagggtta ttggacattc 10260
tatgcaaaat tgtgtactta agcttaaggt tgatacagcc aatcctaaga cacctaagta 10320
taagtttgtt cgcattcaac caggacagac tttttcagtg ttagcttgtt acaatggttc 10380
accatctggt gtttaccaat gtgctatgag gcccaatttc actattaagg gttcattcct 10440
taatggttca tgtggtagtg ttggttttaa catagattat gactgtgtct ctttttgtta 10500
catgcaccat atggaattac caactggagt tcatgctggc acagacttag aaggtaactt 10560
ttatggacct tttgttgaca ggcaaacagc acaagcagct ggtacggaca caactattac 10620
agttaatgtt ttagcttggt tgtacgctgc tgttataaat ggagacaggt ggtttctcaa 10680
tcgatttacc acaactctta atgactttaa ccttgtggct atgaagtaca attatgaacc 10740
tctaacacaa gaccatgttg acatactagg acctctttct gctcaaactg gaattgtcgt 10800
tttagatatg tgtgcttcat taaaagaatt actgcaaaat ggtatgaatg gacgtaccat 10860
attgggtagt gctttattag aagatgaatt tacacctttt gatgttgtta gacaatgctc 10920
aggtgttact ttccaaagtg cagtgaaaag aacaatcaag ggtacacacc actggttgtt 10980
actcacaatt ttgacttcac ttttagtttt agtccagagt actcaatggt ctttgttctt 11040
ttttttgtat gaaaatgcct ttttaccttt tgctatgggt attattgcta tgtctgcttt 11100
tgcaatgatg tttgtcaaac ataagcatgc atttctctgt ttgtttttgt taccttctct 11160
tgccactgta gcttatttta atatggtcta tatgcctgct agttgggtga tgcgtattat 11220
gacatggttg gatatggttg atactagttt gaagctaaaa gactgtgtta tgtatgcatc 11280
agctgtagtg ttactaatcc ttatgacagc aagaactgtg tatgatgatg gtgctaggag 11340
agtgtggaca cttatgaatg tcttgacact cgtttataaa gtttattatg gtaatgcttt 11400
agatcaagcc atttccatgt gggctcttat aatctctgtt acttctaact actcaggtgt 11460
agttacaact gtcatgtttt tggccagagg tattgttttt atgtgtgttg agtattgccc 11520
tattttcttc ataactggta atacacttca gtgtataatg ctagtttatt gtttcttagg 11580
ctatttttgt acttgttact ttggcctctt ttgtttactc aaccgctact ttagactgac 11640
tcttggtgtt tatgattact tagtttctac acaggagttt agatatatga attcacaggg 11700
actactccca cccaagaata gcatagatgc cttcaaactc aacattaaat tgttgggtgt 11760
tggtggcaaa ccttgtatca aagtagccac tgtacagtct aaaatgtcag atgtaaagtg 11820
cacatcagta gtcttactct cagttttgca acaactcaga gtagaatcat catctaaatt 11880
gtgggctcaa tgtgtccagt tacacaatga cattctctta gctaaagata ctactgaagc 11940
ctttgaaaaa atggtttcac tactttctgt tttgctttcc atgcagggtg ctgtagacat 12000
aaacaagctt tgtgaagaaa tgctggacaa cagggcaacc ttacaagcta tagcctcaga 12060
gtttagttcc cttccatcat atgcagcttt tgctactgct caagaagctt atgagcaggc 12120
tgttgctaat ggtgattctg aagttgttct taaaaagttg aagaagtctt tgaatgtggc 12180
taaatctgaa tttgaccgtg atgcagccat gcaacgtaag ttggaaaaga tggctgatca 12240
agctatgacc caaatgtata aacaggctag atctgaggac aagagggcaa aagttactag 12300
tgctatgcag acaatgcttt tcactatgct tagaaagttg gataatgatg cactcaacaa 12360
cattatcaac aatgcaagag atggttgtgt tcccttgaac ataatacctc ttacaacagc 12420
agccaaacta atggttgtca taccagacta taacacatat aaaaatacgt gtgatggtac 12480
aacatttact tatgcatcag cattgtggga aatccaacag gttgtagatg cagatagtaa 12540
aattgttcaa cttagtgaaa ttagtatgga caattcacct aatttagcat ggcctcttat 12600
tgtaacagct ttaagggcca attctgctgt caaattacag aataatgagc ttagtcctgt 12660
tgcactacga cagatgtctt gtgctgccgg tactacacaa actgcttgca ctgatgacaa 12720
tgcgttagct tattacaaca caacaaaggg aggtaggttt gtacttgcac tgttatccga 12780
tttacaggat ttgaaatggg ctagattccc taagagtgat ggaactggta ctatctatac 12840
agaactggaa ccaccttgta ggtttgttac agacacacct aaaggtccta aagtgaagta 12900
tttatacttt attaaaggat taaacaacct aaatagaggt atggtacttg gtagtttagc 12960
tgccacagta cgtctacaag ctggtaatgc aacagaagtg cctgccaatt caactgtatt 13020
atctttctgt gcttttgctg tagatgctgc taaagcttac aaagattatc tagctagtgg 13080
gggacaacca atcactaatt gtgttaagat gttgtgtaca cacactggta ctggtcaggc 13140
aataacagtt acaccggaag ccaatatgga tcaagaatcc tttggtggtg catcgtgttg 13200
tctgtactgc cgttgccaca tagatcatcc aaatcctaaa ggattttgtg acttaaaagg 13260
taagtatgta caaataccta caacttgtgc taatgaccct gtgggtttta cacttaaaaa 13320
cacagtctgt accgtctgcg gtatgtggaa aggttatggc tgtagttgtg atcaactccg 13380
cgaacccatg cttcagtcag ctgatgcaca atcgttttta aacgggtttg cggtgtaagt 13440
gcagcccgtc ttacaccgtg cggcacaggc actagtactg atgtcgtata cagggctttt 13500
gacatctaca atgataaagt agctggtttt gctaaattcc taaaaactaa ttgttgtcgc 13560
ttccaagaaa aggacgaaga tgacaattta attgattctt actttgtagt taagagacac 13620
actttctcta actaccaaca tgaagaaaca atttataatt tacttaagga ttgtccagct 13680
gttgctaaac atgacttctt taagtttaga atagacggtg acatggtacc acatatatca 13740
cgtcaacgtc ttactaaata cacaatggca gacctcgtct atgctttaag gcattttgat 13800
gaaggtaatt gtgatacatt aaaagaaata cttgtcacat acaattgttg tgatgatgat 13860
tatttcaata aaaaggactg gtatgatttt gtagaaaacc cagatatatt acgcgtatac 13920
gccaacttag gtgaacgtgt acgccaagct ttgttaaaaa cagtacaatt ctgtgatgcc 13980
atgcgaaatg ctggtattgt tggtgtactg acattagata atcaagatct caatggtaac 14040
tggtatgatt tcggtgattt catacaaacc acgccaggta gtggagttcc tgttgtagat 14100
tcttattatt cattgttaat gcctatatta accttgacca gggctttaac tgcagagtca 14160
catgttgaca ctgacttaac aaagccttac attaagtggg atttgttaaa atatgacttc 14220
acggaagaga ggttaaaact ctttgaccgt tattttaaat attgggatca gacataccac 14280
ccaaattgtg ttaactgttt ggatgacaga tgcattctgc attgtgcaaa ctttaatgtt 14340
ttattctcta cagtgttccc acttacaagt tttggaccac tagtgagaaa aatatttgtt 14400
gatggtgttc catttgtagt ttcaactgga taccacttca gagagctagg tgttgtacat 14460
aatcaggatg taaacttaca tagctctaga cttagtttta aggaattact tgtgtatgct 14520
gctgaccctg ctatgcacgc tgcttctggt aatctattac tagataaacg cactacgtgc 14580
ttttcagtag ctgcacttac taacaatgtt gcttttcaaa ctgtcaaacc cggtaatttt 14640
aacaaagact tctatgactt tgctgtgtct aagggtttct ttaaggaagg aagttctgtt 14700
gaattaaaac acttcttctt tgctcaggat ggtaatgctg ctatcagcga ttatgactac 14760
tatcgttata atctaccaac aatgtgtgat atcagacaac tactatttgt agttgaagtt 14820
gttgataagt actttgattg ttacgatggt ggctgtatta atgctaacca agtcatcgtc 14880
aacaacctag acaaatcagc tggttttcca tttaataaat ggggtaaggc tagactttat 14940
tatgattcaa tgagttatga ggatcaagat gcacttttcg catatacaaa acgtaatgtc 15000
atccctacta taactcaaat gaatcttaag tatgccatta gtgcaaagaa tagagctcgc 15060
accgtagctg gtgtctctat ctgtagtact atgaccaata gacagtttca tcaaaaatta 15120
ttgaaatcaa tagccgccac tagaggagct actgtagtaa ttggaacaag caaattctat 15180
ggtggttggc acaacatgtt aaaaactgtt tatagtgatg tagaaaaccc tcaccttatg 15240
ggttgggatt atcctaaatg tgatagagcc atgcctaaca tgcttagaat tatggcctca 15300
cttgttcttg ctcgcaaaca tacaacgtgt tgtagcttgt cacaccgttt ctatagatta 15360
gctaatgagt gtgctcaagt attgagtgaa atggtcatgt gtggcggttc actatatgtt 15420
aaaccaggtg gaacctcatc aggagatgcc acaactgctt atgctaatag tgtttttaac 15480
atttgtcaag ctgtcacggc caatgttaat gcacttttat ctactgatgg taacaaaatt 15540
gccgataagt atgtccgcaa tttacaacac agactttatg agtgtctcta tagaaataga 15600
gatgttgaca cagactttgt gaatgagttt tacgcatatt tgcgtaaaca tttctcaatg 15660
atgatactct ctgacgatgc tgttgtgtgt ttcaatagca cttatgcatc tcaaggtcta 15720
gtggctagca taaagaactt taagtcagtt ctttattatc aaaacaatgt ttttatgtct 15780
gaagcaaaat gttggactga gactgacctt actaaaggac ctcatgaatt ttgctctcaa 15840
catacaatgc tagttaaaca gggtgatgat tatgtgtacc ttccttaccc agatccatca 15900
agaatcctag gggccggctg ttttgtagat gatatcgtaa aaacagatgg tacacttatg 15960
attgaacggt tcgtgtcttt agctatagat gcttacccac ttactaaaca tcctaatcag 16020
gagtatgctg atgtctttca tttgtactta caatacataa gaaagctaca tgatgagtta 16080
acaggacaca tgttagacat gtattctgtt atgcttacta atgataacac ttcaaggtat 16140
tgggaacctg agttttatga ggctatgtac acaccgcata cagtcttaca ggctgttggg 16200
gcttgtgttc tttgcaattc acagacttca ttaagatgtg gtgcttgcat acgtagacca 16260
ttcttatgtt gtaaatgctg ttacgaccat gtcatatcaa catcacataa attagtcttg 16320
tctgttaatc cgtatgtttg caatgctcca ggttgtgatg tcacagatgt gactcaactt 16380
tacttaggag gtatgagcta ttattgtaaa tcacataaac cacccattag ttttccattg 16440
tgtgctaatg gacaagtttt tggtttatat aaaaatacat gtgttggtag cgataatgtt 16500
actgacttta atgcaattgc aacatgtgac tggacaaatg ctggtgatta cattttagct 16560
aacacctgta ctgaaagact caagcttttt gcagcagaaa cgctcaaagc tactgaggag 16620
acatttaaac tgtcttatgg tattgctact gtacgtgaag tgctgtctga cagagaatta 16680
catctttcat gggaagttgg taaacctaga ccaccactta accgaaatta tgtctttact 16740
ggttatcgtg taactaaaaa cagtaaagta caaataggag agtacacctt tgaaaaaggt 16800
gactatggtg atgctgttgt ttaccgaggt acaacaactt acaaattaaa tgttggtgat 16860
tattttgtgc tgacatcaca tacagtaatg ccattaagtg cacctacact agtgccacaa 16920
gagcactatg ttagaattac tggcttatac ccaacactca atatctcaga tgagttttct 16980
agcaatgttg caaattatca aaaggttggt atgcaaaagt attctacact ccagggacca 17040
cctggtactg gtaagagtca ttttgctatt ggcctagctc tctactaccc ttctgctcgc 17100
atagtgtata cagcttgctc tcatgccgct gttgatgcac tatgtgagaa ggcattaaaa 17160
tatttgccta tagataaatg tagtagaatt atacctgcac gtgctcgtgt agattgtttt 17220
gataaattca aagtgaattc aacattagaa cagtatgtct tttgtactgt aaatgcattg 17280
cctgagacga cagcagatat agttgtcttt gatgaaattt caatggccac aaattatgat 17340
ttgagtgttg tcaatgccag attacgtgct aagcactatg tgtacattgg cgaccctgct 17400
caattacctg caccacgcac attgctaact aagggcacac tagaaccaga atatttcaat 17460
tcagtgtgta gacttatgaa aactataggt ccagacatgt tcctcggaac ttgtcggcgt 17520
tgtcctgctg aaattgttga cactgtgagt gctttggttt atgataataa gcttaaagca 17580
cataaagaca aatcagctca atgctttaaa atgttttata agggtgttat cacgcatgat 17640
gtttcatctg caattaacag gccacaaata ggcgtggtaa gagaattcct tacacgtaac 17700
cctgcttgga gaaaagctgt ctttatttca ccttataatt cacagaatgc tgtagcctca 17760
aagattttgg gactaccaac tcaaactgtt gattcatcac agggctcaga atatgactat 17820
gtcatattca ctcaaaccac tgaaacagct cactcttgta atgtaaacag atttaatgtt 17880
gctattacca gagcaaaagt aggcatactt tgcataatgt ctgatagaga cctttatgac 17940
aagttgcaat ttacaagtct tgaaattcca cgtaggaatg tggcaacttt acaagctgaa 18000
aatgtaacag gactctttaa agattgtagt aaggtaatca ctgggttaca tcctacacag 18060
gcacctacac acctcagtgt tgacactaaa ttcaaaactg aaggtttatg tgttgacata 18120
cctggcatac ctaaggacat gacctataga agactcatct ctatgatggg ttttaaaatg 18180
aattatcaag ttaatggtta ccctaacatg tttatcaccc gcgaagaagc tataagacat 18240
gtacgtgcat ggattggctt cgatgtcgag gggtgtcatg ctactagaga agctgttggt 18300
accaatttac ctttacagct aggtttttct acaggtgtta acctagttgc tgtacctaca 18360
ggttatgttg atacacctaa taatacagat ttttccagag ttagtgctaa accaccgcct 18420
ggagatcaat ttaaacacct cataccactt atgtacaaag gacttccttg gaatgtagtg 18480
cgtataaaga ttgtacaaat gttaagtgac acacttaaaa atctctctga cagagtcgta 18540
tttgtcttat gggcacatgg ctttgagttg acatctatga agtattttgt gaaaatagga 18600
cctgagcgca cctgttgtct atgtgataga cgtgccacat gcttttccac tgcttcagac 18660
acttatgcct gttggcatca ttctattgga tttgattacg tctataatcc gtttatgatt 18720
gatgttcaac aatggggttt tacaggtaac ctacaaagca accatgatct gtattgtcaa 18780
gtccatggta atgcacatgt agctagttgt gatgcaatca tgactaggtg tctagctgtc 18840
cacgagtgct ttgttaagcg tgttgactgg actattgaat atcctataat tggtgatgaa 18900
ctgaagatta atgcggcttg tagaaaggtt caacacatgg ttgttaaagc tgcattatta 18960
gcagacaaat tcccagttct tcacgacatt ggtaacccta aagctattaa gtgtgtacct 19020
caagctgatg tagaatggaa gttctatgat gcacagcctt gtagtgacaa agcttataaa 19080
atagaagaat tattctattc ttatgccaca cattctgaca aattcacaga tggtgtatgc 19140
ctattttgga attgcaatgt cgatagatat cctgctaatt ccattgtttg tagatttgac 19200
actagagtgc tatctaacct taacttgcct ggttgtgatg gtggcagttt gtatgtaaat 19260
aaacatgcat tccacacacc agcttttgat aaaagtgctt ttgttaattt aaaacaatta 19320
ccatttttct attactctga cagtccatgt gagtctcatg gaaaacaagt agtgtcagat 19380
atagattatg taccactaaa gtctgctacg tgtataacac gttgcaattt aggtggtgct 19440
gtctgtagac atcatgctaa tgagtacaga ttgtatctcg atgcttataa catgatgatc 19500
tcagctggct ttagcttgtg ggtttacaaa caatttgata cttataacct ctggaacact 19560
tttacaagac ttcagagttt agaaaatgtg gcttttaatg ttgtaaataa gggacacttt 19620
gatggacaac agggtgaagt accagtttct atcattaata acactgttta cacaaaagtt 19680
gatggtgttg atgtagaatt gtttgaaaat aaaacaacat tacctgttaa tgtagcattt 19740
gagctttggg ctaagcgcaa cattaaacca gtaccagagg tgaaaatact caataatttg 19800
ggtgtggaca ttgctgctaa tactgtgatc tgggactaca aaagagatgc tccagcacat 19860
atatctacta ttggtgtttg ttctatgact gacatagcca agaaaccaac tgaaacgatt 19920
tgtgcaccac tcactgtctt ttttgatggt agagttgatg gtcaagtaga cttatttaga 19980
aatgcccgta atggtgttct tattacagaa ggtagtgtta aaggtttaca accatctgta 20040
ggtcccaaac aagctagtct taatggagtc acattaattg gagaagccgt aaaaacacag 20100
ttcaattatt ataagaaagt tgatggtgtt gtccaacaat tacctgaaac ttactttact 20160
cagagtagaa atttacaaga atttaaaccc aggagtcaaa tggaaattga tttcttagaa 20220
ttagctatgg atgaattcat tgaacggtat aaattagaag gctatgcctt cgaacatatc 20280
gtttatggag attttagtca tagtcagtta ggtggtttac atctactgat tggactagct 20340
aaacgtttta aggaatcacc ttttgaatta gaagatttta ttcctatgga cagtacagtt 20400
aaaaactatt tcataacaga tgcgcaaaca ggttcatcta agtgtgtgtg ttctgttatt 20460
gatttattac ttgatgattt tgttgaaata ataaaatccc aagatttatc tgtagtttct 20520
aaggttgtca aagtgactat tgactataca gaaatttcat ttatgctttg gtgtaaagat 20580
ggccatgtag aaacatttta cccaaaatta caatctagtc aagcgtggca accgggtgtt 20640
gctatgccta atctttacaa aatgcaaaga atgctattag aaaagtgtga ccttcaaaat 20700
tatggtgata gtgcaacatt acctaaaggc ataatgatga atgtcgcaaa atatactcaa 20760
ctgtgtcaat atttaaacac attaacatta gctgtaccct ataatatgag agttatacat 20820
tttggtgctg gttctgataa aggagttgca ccaggtacag ctgttttaag acagtggttg 20880
cctacgggta cgctgcttgt cgattcagat cttaatgact ttgtctctga tgcagattca 20940
actttgattg gtgattgtgc aactgtacat acagctaata aatgggatct cattattagt 21000
gatatgtacg accctaagac taaaaatgtt acaaaagaaa atgactctaa agagggtttt 21060
ttcacttaca tttgtgggtt tatacaacaa aagctagctc ttggaggttc cgtggctata 21120
aagataacag aacattcttg gaatgctgat ctttataagc tcatgggaca cttcgcatgg 21180
tggacagcct ttgttactaa tgtgaatgcg tcatcatctg aagcattttt aattggatgt 21240
aattatcttg gcaaaccacg cgaacaaata gatggttatg tcatgcatgc aaattacata 21300
ttttggagga atacaaatcc aattcagttg tcttcctatt ctttatttga catgagtaaa 21360
tttcccctta aattaagggg tactgctgtt atgtctttaa aagaaggtca aatcaatgat 21420
atgattttat ctcttcttag taaaggtaga cttataatta gagaaaacaa cagagttgtt 21480
atttctagtg atgttcttgt taacaactaa acgaacaatg tttgtttttc ttgttttatt 21540
gccactagtc tctagtcagt gtgttaattt tacaaacaga actcaattac cctctgcata 21600
cactaattct ttcacacgtg gtgtttatta ccctgacaaa gttttcagat cctcagtttt 21660
acattcaact caggacttgt tcttaccttt cttttccaat gttacttggt tccatgctat 21720
acatgtctct gggaccaatg gtactaagag gtttgataac cctgtcctac catttaatga 21780
tggtgtttat tttgcttcca ctgagaagtc taacataata agaggctgga tttttggtac 21840
tactttagat tcgaagaccc agtccctact tattgttaat aacgctacta atgttgttat 21900
taaagtctgt gaatttcaat tttgtaatta tccatttttg ggtgtttatt accacaaaaa 21960
caacaaaagt tggatggaaa gtgagttcag agtttattct agtgcgaata attgcacttt 22020
tgaatatgtc tctcagcctt ttcttatgga ccttgaagga aaacagggta atttcaaaaa 22080
tcttagtgaa tttgtgttta agaatattga tggttatttt aaaatatatt ctaagcacac 22140
gcctattaat ttagtgcgtg atctccctca gggtttttcg gctttagaac cattggtaga 22200
tttgccaata ggtattaaca tcactaggtt tcaaacttta cttgctttac atagaagtta 22260
tttgactcct ggtgattctt cttcaggttg gacagctggt gctgcagctt attatgtggg 22320
ttatcttcaa cctaggactt ttctattaaa atataatgaa aatggaacca ttacagatgc 22380
tgtagactgt gcacttgacc ctctctcaga aacaaagtgt acgttgaaat ccttcactgt 22440
agaaaaagga atctatcaaa cttctaactt tagagtccaa ccaacagaat ctattgttag 22500
atttcctaat attacaaact tgtgcccttt tggtgaagtt tttaacgcca ccagatttgc 22560
atctgtttat gcttggaaca ggaagagaat cagcaactgt gttgctgatt attctgtcct 22620
atataattcc gcatcatttt ccacttttaa gtgttatgga gtgtctccta ctaaattaaa 22680
tgatctctgc tttactaatg tctatgcaga ttcatttgta attagaggtg atgaagtcag 22740
acaaatcgct ccagggcaaa ctggaacgat tgctgattat aattataaat taccagatga 22800
ttttacaggc tgcgttatag cttggaattc taacaatctt gattctaagg ttggtggtaa 22860
ttataattac ctgtatagat tgtttaggaa gtctaatctc aaaccttttg agagagatat 22920
ttcaactgaa atctatcagg ccggtagcac accttgtaat ggtgttaaag gttttaattg 22980
ttactttcct ttacaatcat atggtttcca acccacttat ggtgttggtt accaaccata 23040
cagagtagta gtactttctt ttgaacttct acatgcacca gcaactgttt gtggacctaa 23100
aaagtctact aatttggtta aaaacaaatg tgtcaatttc aacttcaatg gtttaacagg 23160
cacaggtgtt cttactgagt ctaacaaaaa gtttctgcct ttccaacaat ttggcagaga 23220
cattgctgac actactgatg ctgtccgtga tccacagaca cttgagattc ttgacattac 23280
accatgttct tttggtggtg tcagtgttat aacaccagga acaaatactt ctaatcaggt 23340
tgctgttctt tatcagggtg ttaactgcac agaagtccct gttgctattc atgcagatca 23400
acttactcct acttggcgtg tttattctac aggttctaat gtttttcaaa cacgtgcagg 23460
ctgtttaata ggggctgaat atgtcaacaa ctcatatgag tgtgacatac ccattggtgc 23520
aggtatatgc gctagttatc agactcagac taattctcct cggcgggcac gtagtgtagc 23580
tagtcaatcc atcattgcct acactatgtc acttggtgca gaaaattcag ttgcttactc 23640
taataactct attgccatac ccacaaattt tactattagt gttaccacag aaattctacc 23700
agtgtctatg accaagacat cagtagattg tacaatgtac atttgtggtg attcaactga 23760
atgcagcaat cttttgttgc aatatggcag tttttgtaca caattaaacc gtgctttaac 23820
tggaatagct gttgaacaag acaaaaacac ccaagaagtt tttgcacaag tcaaacaaat 23880
ttacaaaaca ccaccaatta aagattttgg tggttttaat ttttcacaaa tattaccaga 23940
tccatcaaaa ccaagcaaga ggtcatttat tgaagatcta cttttcaaca aagtgacact 24000
tgcagatgct ggcttcatca aacaatatgg tgattgcctt ggtgatattg ctgctagaga 24060
cctcatttgt gcacaaaagt ttaacggcct tactgttttg ccacctttgc tcacagatga 24120
aatgattgct caatacactt ctgcactgtt agcgggtaca atcacttctg gttggacctt 24180
tggtgcaggt gctgcattac aaataccatt tgctatgcaa atggcttata ggtttaatgg 24240
tattggagtt acacagaatg ttctctatga gaaccaaaaa ttgattgcca accaatttaa 24300
tagtgctatt ggcaaaattc aagactcact ttcttccaca gcaagtgcac ttggaaaact 24360
tcaagatgtg gtcaaccaaa atgcacaagc tttaaacacg cttgttaaac aacttagctc 24420
caattttggt gcaatttcaa gtgttttaaa tgatatcctt tcacgtcttg acaaagttga 24480
ggctgaagtg caaattgata ggttgatcac aggcagactt caaagtttgc agacatatgt 24540
gactcaacaa ttaattagag ctgcagaaat cagagcttct gctaatcttg ctgctattaa 24600
aatgtcagag tgtgtacttg gacaatcaaa aagagttgat ttttgtggaa agggctatca 24660
tcttatgtcc ttccctcagt cagcacctca tggtgtagtc ttcttgcatg tgacttatgt 24720
ccctgcacaa gaaaagaact tcacaactgc tcctgccatt tgtcatgatg gaaaagcaca 24780
ctttcctcgt gaaggtgtct ttgtttcaaa tggcacacac tggtttgtaa cacaaaggaa 24840
tttttatgaa ccacaaatca ttactacaga caacacattt gtgtctggta actgtgatgt 24900
tgtaatagga attgtcaaca acacagttta tgatcctttg caacctgaat tagactcatt 24960
caaggaggag ttagataaat attttaagaa tcatacatca ccagatgttg atttaggtga 25020
catctctggc attaatgctt catttgtaaa cattcaaaaa gaaattgacc gcctcaatga 25080
ggttgccaag aatttaaatg aatctctcat cgatctccaa gaacttggaa agtatgagca 25140
gtatataaaa tggccatggt acatttggct aggttttata gctggcttga ttgccatagt 25200
aatggtgaca attatgcttt gctgtatgac cagttgctgt agttgtctca agggctgttg 25260
ttcttgtgga tcctgctgca aatttgatga agacgactct gagccagtgc tcaaaggagt 25320
caaattacat tacacataaa cgaacttatg gatttgttta tgagaatctt cacaattgga 25380
actgtaactt tgaagcaagg tgaaatcaag gatgctactc cttcagattt tgttcgcgct 25440
actgcaacga taccgataca agcctcactc cctttcggat ggcttattgt tggcgttgca 25500
cttcttgctg tttttcagag cgcttccaaa atcataaccc tcaaaaagag atggcaacta 25560
gcactctcca agggtgttca ctttgtttgc aacttgctgt tgttgtttgt aacagtttac 25620
tcacaccttt tgctcgttgc tgctggcctt gaagcccctt ttctctatct ttatgcttta 25680
gtctacttct tgcagagtat aaactttgta agaataataa tgaggctttg gctttgctgg 25740
aaatgccgtt ccaaaaaccc attactttat gatgccaact attttctttg ctggcatact 25800
aattgttacg actattgtat accttacaat agtgtaactt cttcaattgt cattacttca 25860
ggtgatggca caacaagtcc tatttctgaa catgactacc agattggtgg ttatactgaa 25920
aaatgggaat ctggagtaaa agactgtgtt gtattacaca gttacttcac ttcagactat 25980
taccagctgt actcaactca attgagtaca gacactggtg ttgaacatgt taccttcttc 26040
atctacaata aaattgttga tgagcctgaa gaacatgtcc aaattcacac aatcgacggt 26100
tcacccggag ttgttaatcc agtaatggaa ccaatttatg atgaaccgac gacgactact 26160
agcgtgcctt tgtaagcaca agctgatgag tacgaactta tgtactcatt cgtttcggaa 26220
gagacaggta cgttaatagt taatagcgta cttctttttc ttgctttcgt ggtattcttg 26280
ctagttacac tagccatcct tactgcgctt cgattgtgtg cgtactgctg caatattgtt 26340
aacgtgagtc ttgtaaaacc ttctttttac gtttactctc gtgttaaaaa tctgaattct 26400
tctagagttc ctgatcttct ggtctaaacg aactaaatat tatattagtt tttctgtttg 26460
gaactttaat tttagccatg gcagattcca acggtactat taccgttgaa gagcttaaaa 26520
agctccttga acaatggaac ctagtaatag gtttcctatt ccttacatgg atttgtcttc 26580
tacaatttgc ctatgccaac aggaataggt ttttgtatat aattaagtta attttcctct 26640
ggctgttatg gccagtaact ttagcttgtt ttgtgcttgc tgctgtttac agaataaatt 26700
ggatcaccgg tggaattgct atcgcaatgg cttgtcttgt aggcttgatg tggctcagct 26760
acttcattgc ttctttcaga ctgtttgcgc gtacgcgttc catgtggtca ttcaatccag 26820
aaactaacat tcttctcaac gtgccactcc atggcactat tctgaccaga ccgcttctag 26880
aaagtgaact cgtaatcgga gctgtgatcc ttcgtggaca tcttcgtatt gctggacacc 26940
atctaggacg ctgtgacatc aaggacctgc ctaaagaaat cactgttgct acatcacgaa 27000
cgctttctta ttacaaattg ggagcttcgc agcgtgtagc aggtgactca ggttttgctg 27060
catacagtcg ctacaggatt ggcaactata aattaaacac agaccattcc agtagcagtg 27120
acaatattgc tttgcttgta cagtaagtga caacagatgt ttcatctcgt tgactttcag 27180
gttactatag cagagatatt actaattatt atgaggactt ttaaagtttc catttggaat 27240
cttgattaca tcataaacct cataattaaa aatttatcta agtcactaac tgagaataaa 27300
tattctcaat tagatgaaga gcaaccaatg gagattgatt aaacgaacat gaaaattatt 27360
cttttcttgg cactgataac actcgctact tgtgagcttt atcactacca agagtgtgtt 27420
agaggtacaa cagtactttt aaaagaacct tgctcttctg gaacatacga gggcaattca 27480
ccatttcatc ctctagctga taacaaattt gcactgactt gctttagcac tcaatttgct 27540
tttgcttgtc ctgacggcgt aaaacacgtc tatcagttac gtgccagatc agtttcacct 27600
aaactgttca tcagacaaga ggaagttcaa gaactttact ctccaatttt tcttattgtt 27660
gcggcaatag tgtttataac actttgcttc acactcaaaa gaaagacaga atgattgaac 27720
tttcattaat tgacttctat ttgtgctttt tagcctttct gctattcctt gttttaatta 27780
tgcttattat cttttggttc tcacttgaac tgcaagatca taatgaaact tgtcacgcct 27840
aaacgaacat gaaatttctt gttttcttag gaatcatcac aactgtagct gcatttcacc 27900
aagaatgtag tttacagtca tgtactcaac atcaaccata tgtagttgat gacccgtgtc 27960
ctattcactt ctattctaaa tggtatatta gagtaggagc tagaaaatca gcacctttaa 28020
ttgaattgtg cgtggatgag gctggttcta aatcacccat tcagtacatc gatatcggta 28080
attatacagt ttcctgttta ccttttacaa ttaattgcca gaaacctaaa ttgggtagtc 28140
ttgtagtgcg ttgttcgttc tatgaagact ttttagagta tcatgacgtt cgtgttgttt 28200
tagatttcat ctaaacgaac aaacaaacta aaatgtctga taatggaccc caaaatcagc 28260
gaaatgcacc ccgcattacg tttggtggac cctcagattc aactggcagt aaccagaatg 28320
gagaacgcag tggggcgcga tcaaaacaac gtcggcccca aggtttaccc aataatactg 28380
cgtcttggtt caccgctctc actcaacatg gcaaggaaga ccttaaattc cctcgaggac 28440
aaggcgttcc aattaacacc aatagcagtc gagatgacca aattggctac taccgaagag 28500
ctaccagacg aattcgtggt ggtgacggta aaatgaaaga tctcagtcca agatggtatt 28560
tctactacct aggaactggg ccagaagctg gacttcccta tggtgctaac aaagacggca 28620
tcatatgggt tgcaactgag ggagccttga atacaccaaa agatcacatt ggcacccgca 28680
atcctgctaa caatgctgca atcgtgctac aacttcctca aggaacaaca ttgccaaaag 28740
gcttctacgc agaagggagc agaggcggca gtcaagcctc ttctcgttcc tcatcacgta 28800
gtcgcaacag ttcaagaaat tcaactccag gcagctctaa acgaacttct cctgctagaa 28860
tggctggcaa tggcggtgat gctgctcttg ctttgctgct gcttgacaga ttgaaccagc 28920
ttgagagcaa aatgtctggt aaaggccaac aacaacaagg ccaaactgtc actaagaaat 28980
ctgctgctga ggcttctaag aagcctcggc aaaaacgtac tgccactaaa gcatacaatg 29040
taacacaagc tttcggcaga cgtggtccag aacaaaccca aggaaatttt ggggaccagg 29100
aactaatcag acaaggaact gattacaaac attggccgca aattgcacaa tttgccccca 29160
gcgcttcagc gttcttcgga atgtcgcgca ttggcatgga agtcacacct tcgggaacgt 29220
ggttgaccta cacaggtgcc atcaaattgg atgacaaaga tccaaatttc aaagatcaag 29280
tcattttgct gaataagcat attgacgcat acaaaacatt cccaccaaca gagcctaaaa 29340
aggacaaaaa gaagaaggct gatgaaactc aagccttacc gcagagacag aagaaacagc 29400
aaactgtgac tcttcttcct gctgcagatt tggatgattt ctccaaacaa ttgcaacaat 29460
ccatgagcag tgctgactca actcaggcct aaactcatgc agaccacaca aggcagatgg 29520
gctatataaa cgttttcgct tttccgttta cgatatatag tctactcttg tgcagaatga 29580
attctcgtaa ctacatagca caagtagatg tagttaactt taatctcaca tagcaatctt 29640
taatcagtgt gtaacattag ggaggacttg aaagagccac cacattttca ccgaggccac 29700
gcggagtacg atcgagtgta cagtgaacaa tgctagggag agctgcctat atggaagagc 29760
cctaatgtgt aaaattaatt ttagtagtgc taaccccatg tgattttaat agcttctta 29819
<210> 21
<211> 1273
<212> PRT
<213> SARS-CoV2
<400> 21
Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
1 5 10 15
Asn Phe Thr Asn Arg Thr Gln Leu Pro Ser Ala Tyr Thr Asn Ser Phe
20 25 30
Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
35 40 45
His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
50 55 60
Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
65 70 75 80
Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
85 90 95
Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
100 105 110
Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
115 120 125
Lys Val Cys Glu Phe Gln Phe Cys Asn Tyr Pro Phe Leu Gly Val Tyr
130 135 140
Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
145 150 155 160
Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
165 170 175
Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Ser Glu Phe
180 185 190
Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
195 200 205
Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
210 215 220
Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
225 230 235 240
Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
245 250 255
Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
260 265 270
Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
275 280 285
Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
290 295 300
Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val
305 310 315 320
Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys
325 330 335
Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
340 345 350
Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
355 360 365
Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
370 375 380
Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
385 390 395 400
Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
405 410 415
Thr Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
420 425 430
Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
435 440 445
Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
450 455 460
Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
465 470 475 480
Asn Gly Val Lys Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
485 490 495
Phe Gln Pro Thr Tyr Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
500 505 510
Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
515 520 525
Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn
530 535 540
Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu
545 550 555 560
Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val
565 570 575
Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe
580 585 590
Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val
595 600 605
Ala Val Leu Tyr Gln Gly Val Asn Cys Thr Glu Val Pro Val Ala Ile
610 615 620
His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser
625 630 635 640
Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu Tyr Val
645 650 655
Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala
660 665 670
Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala
675 680 685
Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser
690 695 700
Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile
705 710 715 720
Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val
725 730 735
Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu
740 745 750
Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr
755 760 765
Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln
770 775 780
Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe
785 790 795 800
Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser
805 810 815
Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly
820 825 830
Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp
835 840 845
Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu
850 855 860
Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly
865 870 875 880
Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile
885 890 895
Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr
900 905 910
Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn
915 920 925
Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala
930 935 940
Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn
945 950 955 960
Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val
965 970 975
Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln
980 985 990
Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val
995 1000 1005
Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu
1010 1015 1020
Ala Ala Ile Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys Arg Val
1025 1030 1035 1040
Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ser Ala
1045 1050 1055
Pro His Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ala Gln Glu
1060 1065 1070
Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His Asp Gly Lys Ala His
1075 1080 1085
Phe Pro Arg Glu Gly Val Phe Val Ser Asn Gly Thr His Trp Phe Val
1090 1095 1100
Thr Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile Thr Thr Asp Asn Thr
1105 1110 1115 1120
Phe Val Ser Gly Asn Cys Asp Val Val Ile Gly Ile Val Asn Asn Thr
1125 1130 1135
Val Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu
1140 1145 1150
Asp Lys Tyr Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp
1155 1160 1165
Ile Ser Gly Ile Asn Ala Ser Phe Val Asn Ile Gln Lys Glu Ile Asp
1170 1175 1180
Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu
1185 1190 1195 1200
Gln Glu Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Trp Tyr Ile
1205 1210 1215
Trp Leu Gly Phe Ile Ala Gly Leu Ile Ala Ile Val Met Val Thr Ile
1220 1225 1230
Met Leu Cys Cys Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Cys Cys
1235 1240 1245
Ser Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro Val
1250 1255 1260
Leu Lys Gly Val Lys Leu His Tyr Thr
1265 1270
<210> 22
<211> 29883
<212> DNA
<213> SARS-CoV2
<220>
<221> misc_feature
<222> (29849)..(29883)
<223> n is a, c, g, or t
<400> 22
attaaaggtt tataccttcc caggtaacaa accaaccaac tttcgatctc ttgtagatct 60
gttctctaaa cgaactttaa aatctgtgtg gctgtcactc ggctgcatgc ttagtgcact 120
cacgcagtat aattaataac taattactgt cgttgacagg acacgagtaa ctcgtctatc 180
ttctgcaggc tgcttacggt ttcgtccgtg ttgcagccga tcatcagcac atctaggttt 240
tgtccgggtg tgaccgaaag gtaagatgga gagccttgtc cctggtttca acgagaaaac 300
acacgtccaa ctcagtttgc ctgttttaca ggttcgcgac gtgctcgtac gtggctttgg 360
agactccgtg gaggaggtct tatcagaggc acgtcaacat cttaaagatg gcacttgtgg 420
cttagtagaa gttgaaaaag gcgttttgcc tcaacttgaa cagccctatg tgttcatcaa 480
acgttcggat gctcgaactg cacctcatgg tcatgttatg gttgagctgg tagcagaact 540
cgaaggcatt cagtacggtc gtagtggtga gacacttggt gtccttgtcc ctcatgtggg 600
cgaaatacca gtggcttacc gcaaggttct tcttcgtaag aacggtaata aaggagctgg 660
tggccatagt tacggcgccg atctaaagtc atttgactta ggcgacgagc ttggcactga 720
tccttatgaa gattttcaag aaaactggaa cactaaacat agcagtggtg ttacccgtga 780
actcatgcgt gagcttaacg gaggggcata cactcgctat gtcgataaca acttctgtgg 840
ccctgatggc taccctcttg agtgcattaa agaccttcta gcacgtgctg gtaaagcttc 900
atgcactttg tctgaacaac tggactttat tgacactaag aggggtgtat actgctgccg 960
tgaacatgag catgaaattg cttggtacac ggaacgttct gaaaagagct atgaattgca 1020
gacacctttt gaaattaaat tggcaaagaa atttgacacc ttcaatgggg aatgtccaaa 1080
ttttgtattt cccttaaatt ccataatcaa gactattcaa ccaagggttg aaaagaaaaa 1140
gcttgatggc tttatgggta gaattcgatc tgtctatcca gttgcgtcac caaatgaatg 1200
caaccaaatg tgcctttcaa ctctcatgaa gtgtgatcat tgtggtgaaa cttcatggca 1260
gacgggcgat tttgttaaag ccacttgcga attttgtggc actgagaatt tgactaaaga 1320
aggtgccact acttgtggtt acttacccca aaatgctgtt gttaaaattt attgtccagc 1380
atgtcacaat tcagaagtag gacctgagca tagtcttgcc gaataccata atgaatctgg 1440
cttgaaaacc attcttcgta agggtggtcg cactattgcc tttggaggct gtgtgttctc 1500
ttatgttggt tgccataaca agtgtgccta ttgggttcca cgtgctagcg ctaacatagg 1560
ttgtaaccat acaggtgttg ttggagaagg ttccgaaggt cttaatgaca accttcttga 1620
aatactccaa aaagagaaag tcaacatcaa tattgttggt gactttaaac ttaatgaaga 1680
gatcgccatt attttggcat ctttttctgc ttccacaagt gcttttgtgg aaactgtgaa 1740
aggtttggat tataaagcat tcaaacaaat tgttgaatcc tgtggtaatt ttaaagttac 1800
aaaaggaaaa gctaaaaaag gtgcctggaa tattggtgaa cagaaatcaa tactgagtcc 1860
tctttatgca tttgcatcag aggctgctcg tgttgtacga tcaattttct cccgcactct 1920
tgaaactgct caaaattctg tgcgtgtttt acagaaggcc gctataacaa tactagatgg 1980
aatttcacag tattcactga gactcattga tgctatgatg ttcacatctg atttggctac 2040
taacaatcta gttgtaatgg cctacattac aggtggtgtt gttcagttga cttcgcagtg 2100
gctaactaac atctttggca ctgtttatga aaaactcaaa cccgtccttg attggcttga 2160
agagaagttt aaggaaggtg tagagtttct tagagacggt tgggaaattg ttaaatttat 2220
ctcaacctgt gcttgtgaaa ttgtcggtgg acaaattgtc acctgtgcaa aggaaattaa 2280
ggagagtgtt cagacattct ttaagcttgt aaataaattt ttggctttgt gtgctgactc 2340
tatcattatt ggtggagcta aacttaaagc cttgaattta ggtgaaacat ttgtcacgca 2400
ctcaaaggga ttgtacagaa agtgtgttaa atccagagaa gaaactggcc tactcatgcc 2460
tctaaaagcc ccaaaagaaa ttatcttctt agagggagaa acacttccca cagaagtgtt 2520
aacagaggaa gttgtcttga aaactggtga tttacaacca ttagaacaac ctactagtga 2580
agctgttgaa gctccattgg ttggtacacc agtttgtatt aacgggctta tgttgctcga 2640
aatcaaagac acagaaaagt actgtgccct tgcacctaat atgatggtaa caaacaatac 2700
cttcacactc aaaggcggtg caccaacaaa ggttactttt ggtgatgaca ctgtgataga 2760
agtgcaaggt tacaagagtg tgaatatcac ttttgaactt gatgaaagga ttgataaagt 2820
acttaatgag aagtgctctg cctatacagt tgaactcggt acagaagtaa atgagttcgc 2880
ctgtgttgtg gcagatgctg tcataaaaac tttgcaacca gtatctgaat tacttacacc 2940
actgggcatt gatttagatg agtggagtat ggctacatac tacttatttg atgagtctgg 3000
tgagtttaaa ttggcttcac atatgtattg ttctttttac cctccagatg aggatgaaga 3060
agaaggtgat tgtgaagaag aagagtttga gccatcaact caatatgagt atggtactga 3120
agatgattac caaggtaaac ctttggaatt tggtgccact tctgctgctc ttcaacctga 3180
agaagagcaa gaagaagatt ggttagatga tgatagtcaa caaactgttg gtcaacaaga 3240
cggcagtgag gacaatcaga caactattat tcaaacaatt gttgaggttc aacctcaatt 3300
agagatggaa cttacaccag ttgttcagac tattgaagtg aatagtttta gtggttattt 3360
aaaacttact gacaatgtat acattaaaaa tgcagacatt gtggaagaag ctaaaaaggt 3420
aaaaccaaca gtggttgtta atgcagccaa tgtttacctt aaacatggag gaggtgttgc 3480
aggagcctta aataaggcta ctaacaatgc catgcaagtt gaatctgatg attacatagc 3540
tactaatgga ccacttaaag tgggtggtag ttgtgtttta agcggacaca atcttgctaa 3600
acactgtctt catgttgtcg gcccaaatgt taacaaaggt gaagacattc aacttcttaa 3660
gagtgcttat gaaaatttta atcagcacga agttctactt gcaccattat tatcagctgg 3720
tatttttggt gctgacccta tacattcttt aagagtttgt gtagatactg ttcgcacaaa 3780
tgtctactta gctgtctttg ataaaaatct ctatgacaaa cttgtttcaa gctttttgga 3840
aatgaagagt gaaaagcaag ttgaacaaaa gatcgctgag attcctaaag aggaagttaa 3900
gccatttata actgaaagta aaccttcagt tgaacagaga aaacaagatg ataagaaaat 3960
caaagcttgt gttgaagaag ttacaacaac tctggaagaa actaagttcc tcacagaaaa 4020
cttgttactt tatattgaca ttaatggcaa tcttcatcca gattctgcca ctcttgttag 4080
tgacattgac atcactttct taaagaaaga tgctccatat atagtgggtg atgttgttca 4140
agagggtgtt ttaactgctg tggttatacc tactaaaaag gctggtggca ctactgaaat 4200
gctagcgaaa gctttgagaa aagtgccaac agacaattat ataaccactt acccgggtca 4260
gggtttaaat ggttacactg tagaggaggc aaagacagtg cttaaaaagt gtaaaagtgc 4320
cttttacatt ctaccatcta ttatctctaa tgagaagcaa gaaattcttg gaactgtttc 4380
ttggaatttg cgagaaatgc ttgcacatgc agaagaaaca cgcaaattaa tgcctgtctg 4440
tgtggaaact aaagccatag tttcaactat acagcgtaaa tataagggta ttaaaataca 4500
agagggtgtg gttgattatg gtgctagatt ttacttttac accagtaaaa caactgtagc 4560
gtcacttatc aacacactta acgatctaaa tgaaactctt gttacaatgc cacttggcta 4620
tgtaacacat ggcttaaatt tggaagaagc tgctcggtat atgagatctc tcaaagtgcc 4680
agctacagtt tctgtttctt cacctgatgc tgttacagcg tataatggtt atcttacttc 4740
ttcttctaaa acacctgaag aacattttat tgaaaccatc tcacttgctg gttcctataa 4800
agattggtcc tattctggac aatctacaca actaggtata gaatttctta agagaggtga 4860
taaaagtgta tattacacta gtaatcctac cacattccac ctagatggtg aagttatcac 4920
ctttgacaat cttaagacac ttctttcttt gagagaagtg aggactatta aggtgtttac 4980
aacagtagac aacattaatc tccacacgca agttgtggac atgtcaatga catatggaca 5040
acagtttggt ccaacttatt tggatggagc tgatgttact aaaataaaac ctcataattc 5100
acatgaaggt aaaacatttt atgttttacc taatgatgac actctacgtg ttgaggcttt 5160
tgagtactac cacacaactg atcctagttt tctgggtagg tacatgtcag cattaaatca 5220
cactaaaaag tggaaatacc cacaagttaa tggtttaact tctattaaat gggcagataa 5280
caactgttat cttgccactg cattgttaac actccaacaa atagagttga agtttaatcc 5340
acctgctcta caagatgctt attacagagc aagggctggt gaagctgata acttttgtgc 5400
acttatctta gcctactgta ataagacagt aggtgagtta ggtgatgtta gagaaacaat 5460
gagttacttg tttcaacatg ccaatttaga ttcttgcaaa agagtcttga acgtggtgtg 5520
taaaacttgt ggacaacagc agacaaccct taagggtgta gaagctgtta tgtacatggg 5580
cacactttct tatgaacaat ttaagaaagg tgttcagata ccttgtacgt gtggtaaaca 5640
agctacaaaa tatctagtac aacaggagtc accttttgtt atgatgtcag caccacctgc 5700
tcagtatgaa cttaagcatg gtacatttac ttgtgctagt gagtacactg gtaattacca 5760
gtgtggtcac tataaacata taacttctaa agaaactttg tattgcatag acggtgcttt 5820
acttacaaag tcctcagaat acaaaggtcc tattacggat gttttctaca aagaaaacag 5880
ttacacaaca accataaaac cagttactta taaattggat ggtgttgttt gtacagaaat 5940
tgaccctaag ttggacaatt attataagaa agacaattct tattttacag agcaaccaat 6000
tgatcttgta ccaaaccaac catatccaaa cgcaagcttc gataatttta agtttgtatg 6060
tgataatatc aaatttgctg atgatttaaa ccagttaact ggttataaga aacctgcttc 6120
aagagagctt aaagttacat ttttccctga cttaaatggt gatgtggtgg ctattgatta 6180
taaacactac acaccctctt ttaagaaagg agctaaattg ttacataaac ctattgtttg 6240
gcatgttaac aatgcaacta ataaagccac gtataaacca aatacctggt gtatacgttg 6300
tctttggagc acaaaaccag ttgaaacatc aaattcgttt gatgtactga agtcagagga 6360
cgcgcaggga atggataatc ttgcctgcga agatctaaaa ccagtctctg aagaagtagt 6420
ggaaaatcct accatacaga aagacgttct tgagtgtaat gtgaaaacta ccgaagttgt 6480
aggagacatt atacttaaac cagcaaataa tagtttaaaa attacagaag aggttggcca 6540
cacagatcta atggctgctt atgtagacaa ttctagtctt actattaaga aacctaatga 6600
attatctaga gtattaggtt tgaaaaccct tgttactcat ggtttagctg ctgttaatag 6660
tgtcccttgg gatactatag ctaattatgc taagcctttt cttaacaaag ttgttagtac 6720
aactactaac atagttacac ggtgtttaaa ccgtgtttgt actaattata tgccttattt 6780
ctttacttta ttgctacaat tgtgtacttt tactagaagt acaaattcta gaattaaagc 6840
atctatgccg actactatag caaagaatac tgttaagagt gtcggtaaat tttgtctaga 6900
ggcttcattt aattatttga agtcacctaa tttttctaaa ctgataaata ttacaatttg 6960
gtttttacta ttaagtgttt gcctaggttc tttaatctac tcaaccgctg ctttaggtgt 7020
tttaatgtct aatttaggca tgccttctta ctgtactggt tacagagaag gctatttgaa 7080
ctctactaat gtcactattg caacctactg tactggttct ataccttgta gtgtttgtct 7140
tagtggttta gattctttag acacctatcc ttctttagaa actatacaaa ttaccatttc 7200
atcttttaaa tgggatttaa ctgcttttgg cttagttgca gagtggtttt tggcatatat 7260
tcttttcact aggtttttct atgtacttgg attggctgca atcatgcaat tgtttttcag 7320
ctattttgca gtacatttta ttagtaattc ttggcttatg tggttaataa ttaatcttgt 7380
acaaatggcc ccgatttcag ctatggttag aatgtacatc ttctttgcat cattttatta 7440
tgtatggaaa agttatgtgc atgttgtaga cggttgtaat tcatcaactt gtatgatgtg 7500
ttacaaacgt aatagagcaa caagagtcga atgtacaact attgttaatg gtgttagaag 7560
gtccttttat gtctatgcta atggaggtaa aggcttttgc aaactacaca attggaattg 7620
tgttaattgt gatacattct gtgctggtag tacatttatt agtgatgaag ttgcgagaga 7680
cttgtcacta cagtttaaaa gaccaataaa tcctactgac cagtcttctt acatcgttga 7740
tagtgttaca gtgaagaatg gttccatcca tctttacttt gataaagctg gtcaaaagac 7800
ttatgaaaga cattctctct ctcattttgt taacttagac aacctgagag ctaataacac 7860
taaaggttca ttgcctatta atgttatagt ttttgatggt aaatcaaaat gtgaagaatc 7920
atctgcaaaa tcagcgtctg tttactacag tcagcttatg tgtcaaccta tactgttact 7980
agatcaggca ttagtgtctg atgttggtga tagtgcggaa gttgcagtta aaatgtttga 8040
tgcttacgtt aatacgtttt catcaacttt taacgtacca atggaaaaac tcaaaacact 8100
agttgcaact gcagaagctg aacttgcaaa gaatgtgtcc ttagacaatg tcttatctac 8160
ttttatttca gcagctcggc aagggtttgt tgattcagat gtagaaacta aagatgttgt 8220
tgaatgtctt aaattgtcac atcaatctga catagaagtt actggcgata gttgtaataa 8280
ctatatgctc acctataaca aagttgaaaa catgacaccc cgtgaccttg gtgcttgtat 8340
tgactgtagt gcgcgtcata ttaatgcgca ggtagcaaaa agtcacaaca ttgctttgat 8400
atggaacgtt aaagatttca tgtcattgtc tgaacaacta cgaaaacaaa tacgtagtgc 8460
tgctaaaaag aataacttac cttttaagtt gacatgtgca actactagac aagttgttaa 8520
tgttgtaaca acaaagatag cacttaaggg tggtaaaatt gttaataatt ggttgaagca 8580
gttaattaaa gttacacttg tgttcctttt tgttgctgct attttctatt taataacacc 8640
tgttcatgtc atgtctaaac atactgactt ttcaagtgaa atcataggat acaaggctat 8700
tgatggtggt gtcactcgtg acatagcatc tacagatact tgttttgcta acaaacatgc 8760
tgattttgac acatggttta gccagcgtgg tggtagttat actaatgaca aagcttgccc 8820
attgattgct gcagtcataa caagagaagt gggttttgtc gtgcctggtt tgcctggcac 8880
gatattacgc acaactaatg gtgacttttt gcatttctta cctagagttt ttagtgcagt 8940
tggtaacatc tgttacacac catcaaaact tatagagtac actgactttg caacatcagc 9000
ttgtgttttg gctgctgaat gtacaatttt taaagatgct tctggtaagc cagtaccata 9060
ttgttatgat accaatgtac tagaaggttc tgttgcttat gaaagtttac gccctgacac 9120
acgttatgtg ctcatggatg gctctattat tcaatttcct aacacctacc ttgaaggttc 9180
tgttagagtg gtaacaactt ttgattctga gtactgtagg cacggcactt gtgaaagatc 9240
agaagctggt gtttgtgtat ctactagtgg tagatgggta cttaacaatg attattacag 9300
atctttacca ggagttttct gtggtgtaga tgctgtaaat ttacttacta atatgtttac 9360
accactaatt caacctattg gtgctttgga catatcagca tctatagtag ctggtggtat 9420
tgtagctatc gtagtaacat gccttgccta ctattttatg aggtttagaa gagcttttgg 9480
tgaatacagt catgtagttg cctttaatac tttactattc cttatgtcat tcactgtact 9540
ctgtttaaca ccagtttact cattcttacc tggtgtttat tctgttattt acttgtactt 9600
gacattttat cttactaatg atgtttcttt tttagcacat attcagtgga tggttatgtt 9660
cacaccttta gtacctttct ggataacaat tgcttatatc atttgtattt ccacaaagca 9720
tttctattgg ttctttagta attacctaaa gagacgtgta gtctttaatg gtgtttcctt 9780
tagtactttt gaagaagctg cgctgtgcac ctttttgtta aataaagaaa tgtatctaaa 9840
gttgcgtagt gatgtgctat tacctcttac gcaatataat agatacttag ctctttataa 9900
taagtacaag tattttagtg gagcaatgga tacaactagc tacagagaag ctgcttgttg 9960
tcatctcgca aaggctctca atgacttcag taactcaggt tctgatgttc tttaccaacc 10020
accacaaacc tctatcacct cagctgtttt gcagagtggt tttagaaaaa tggcattccc 10080
atctggtaaa gttgagggtt gtatggtaca agtaacttgt ggtacaacta cacttaacgg 10140
tctttggctt gatgacgtag tttactgtcc aagacatgtg atctgcacct ctgaagacat 10200
gcttaaccct aattatgaag atttactcat tcgtaagtct aatcataatt tcttggtaca 10260
ggctggtaat gttcaactca gggttattgg acattctatg caaaattgtg tacttaagct 10320
taaggttgat acagccaatc ctaagacacc taagtataag tttgttcgca ttcaaccagg 10380
acagactttt tcagtgttag cttgttacaa tggttcacca tctggtgttt accaatgtgc 10440
tatgaggccc aatttcacta ttaagggttc attccttaat ggttcatgtg gtagtgttgg 10500
ttttaacata gattatgact gtgtctcttt ttgttacatg caccatatgg aattaccaac 10560
tggagttcat gctggcacag acttagaagg taacttttat ggaccttttg ttgacaggca 10620
aacagcacaa gcagctggta cggacacaac tattacagtt aatgttttag cttggttgta 10680
cgctgctgtt ataaatggag acaggtggtt tctcaatcga tttaccacaa ctcttaatga 10740
ctttaacctt gtggctatga agtacaatta tgaacctcta acacaagacc atgttgacat 10800
actaggacct ctttctgctc aaactggaat tgccgtttta gatatgtgtg cttcattaaa 10860
agaattactg caaaatggta tgaatggacg taccatattg ggtagtgctt tattagaaga 10920
tgaatttaca ccttttgatg ttgttagaca atgctcaggt gttactttcc aaagtgcagt 10980
gaaaagaaca atcaagggta cacaccactg gttgttactc acaattttga cttcactttt 11040
agttttagtc cagagtactc aatggtcttt gttctttttt ttgtatgaaa atgccttttt 11100
accttttgct atgggtatta ttgctatgtc tgcttttgca atgatgtttg tcaaacataa 11160
gcatgcattt ctctgtttgt ttttgttacc ttctcttgcc actgtagctt attttaatat 11220
ggtctatatg cctgctagtt gggtgatgcg tattatgaca tggttggata tggttgatac 11280
tagtttgaag ctaaaagact gtgttatgta tgcatcagct gtagtgttac taatccttat 11340
gacagcaaga actgtgtatg atgatggtgc taggagagtg tggacactta tgaatgtctt 11400
gacactcgtt tataaagttt attatggtaa tgctttagat caagccattt ccatgtgggc 11460
tcttataatc tctgttactt ctaactactc aggtgtagtt acaactgtca tgtttttggc 11520
cagaggtatt gtttttatgt gtgttgagta ttgccctatt ttcttcataa ctggtaatac 11580
acttcagtgt ataatgctag tttattgttt cttaggctat ttttgtactt gttactttgg 11640
cctcttttgt ttactcaacc gctactttag actgactctt ggtgtttatg attacttagt 11700
ttctacacag gagtttagat atatgaattc acagggacta ctcccaccca agaatagcat 11760
agatgccttc aaactcaaca ttaaattgtt gggtgttggt ggcaaacctt gtatcaaagt 11820
agccactgta cagtctaaaa tgtcagatgt aaagtgcaca tcartagtct tactctcagt 11880
tttgcaacaa ctcagagtag aatcatcatc taaattgtgg gctcaatgtg tccagttaca 11940
caatgacatt ctcttagcta aagatactac tgaagccttt gaaaaaatgg tttcactact 12000
ttctgttttg ctttccatgc agggtgctgt agacataaac aagctttgtg aagaaatgct 12060
ggacaacagg gcaaccttac aagctatagc ctcagagttt agttcccttc catcatatgc 12120
agcttttgct actgctcaag aagcttatga gcaggctgtt gctaatggtg attctgaagt 12180
tgttcttaaa aagttgaaga agtctttgaa tgtggctaaa tctgaatttg accgtgatgc 12240
agccatgcaa cgtaagttgg aaaagatggc tgatcaagct atgacccaaa tgtataaaca 12300
ggctagatct gaggacaaga gggcaaaagt tactagtgct atgcagacaa tgcttttcac 12360
tatgcttaga aagttggata atgatgcact caacaacatt atcaacaatg caagagatgg 12420
ttgtgttccc ttgaacataa tacctcttac aacagcagcc aaactaatgg ttgtcatacc 12480
agactataac acatataaaa atacgtgtga tggtacaaca tttacttatg catcagcatt 12540
gtgggaaatc caacaggttg tagatgcaga tagtaaaatt gttcaactta gtgaaattag 12600
tatggacaat tcacctaatt tagcatggcc tcttattgta acagctttaa gggccaattc 12660
tgctgtcaaa ttacagaata atgagcttag tcctgttgca ctacgacaga tgtcttgtgc 12720
tgccggtact acacaaactg cttgcactga tgacaatgcg ttagcttact acaacacaac 12780
aaagggaggt aggtttgtac ttgcactgtt atccgattta caggatttga aatgggctag 12840
attccctaag agtgatggaa ctggtactat ctatacagaa ctggaaccac cttgtaggtt 12900
tgttacagac acacctaaag gtcctaaagt gaagtattta tactttatta aaggattaaa 12960
caacctaaat agaggtatgg tacttggtag tttagctgcc acagtacgtc tacaagctgg 13020
taatgcaaca gaagtgcctg ccaattcaac tgtattatct ttctgtgctt ttgctgtaga 13080
tgctgctaaa gcttacaaag attatctagc tagtggggga caaccaatca ctaattgtgt 13140
taagatgttg tgtacacaca ctggtactgg tcaggcaata acagttacac cggaagccaa 13200
tatggatcaa gaatcctttg gtggtgcatc gtgttgtctg tactgccgtt gccacataga 13260
tcatccaaat cctaaaggat tttgtgactt aaaaggtaag tatgtacaaa tacctacaac 13320
ttgtgctaat gaccctgtgg gttttacact taaaaacaca gtctgtaccg tctgcggtat 13380
gtggaaaggt tatggctgta gttgtgatca actccgcgaa cccatgcttc agtcagctga 13440
tgcacaatcg tttttaaacg ggtttgcggt gtaagtgcag cccgtcttac accgtgcggc 13500
acaggcacta gtactgatgt cgtatacagg gcttttgaca tctacaatga taaagtagct 13560
ggttttgcta aattcctaaa aactaattgt tgtcgcttcc aagaaaagga cgaagatgac 13620
aatttaattg attcttactt tgtagttaag agacacactt tctctaacta ccaacatgaa 13680
gaaacaattt ataatttact taaggattgt ccagctgttg ctaaacatga cttctttaag 13740
tttagaatag acggtgacat ggtaccacat atatcacgtc aacgtcttac taaatacaca 13800
atggcagacc tcgtctatgc tttaaggcat tttgatgaag gtaattgtga cacattaaaa 13860
gaaatacttg tcacatacaa ttgttgtgat gatgattatt tcaataaaaa ggactggtat 13920
gattttgtag aaaacccaga tatattacgc gtatacgcca acttaggtga acgtgtacgc 13980
caagctttgt taaaaacagt acaattctgt gatgccatgc gaaatgctgg tattgttggt 14040
gtactgacat tagataatca agatctcaat ggtaactggt atgatttcgg tgatttcata 14100
caaaccacgc caggtagtgg agttcctgtt gtagattctt attattcatt gttaatgcct 14160
atattaacct tgaccagggc tttaactgca gagtcacatg ttgacactga cttaacaaag 14220
ccttacatta agtgggattt gttaaaatat gacttcacgg aagagaggtt aaaactcttt 14280
gaccgttatt ttaaatattg ggatcagaca taccacccaa attgtgttaa ctgtttggat 14340
gacagatgca ttctgcattg tgcaaacttt aatgttttat tctctacagt gttcccactt 14400
acaagttttg gaccactagt gagaaaaata tttgttgatg gtgttccatt tgtagtttca 14460
actggatacc acttcagaga gctaggtgtt gtacataatc aggatgtaaa cttacatagc 14520
tctagactta gttttaagga attacttgtg tatgctgctg accctgctat gcacgctgct 14580
tctggtaatc tattactaga taaacgcact acgtgctttt cagtagctgc acttactaac 14640
aatgttgctt ttcaaactgt caaacctggt aattttaaca aagacttcta tgactttgct 14700
gtgtctaagg gtttctttaa ggaaggaagt tctgttgaat taaaacactt cttctttgct 14760
caggatggta atgctgctat cagcgattat gactactatc gttataatct accaacaatg 14820
tgtgatatca gacaactact atttgtagtt gaagttgttg ataagtactt tgattgttac 14880
gatggtggct gtattaatgc taaccaagtc atcgtcaaca acctagacaa atcagctggt 14940
tttccattta ataaatgggg taaggctaga ctttattatg attcaatgag ttatgaggat 15000
caagatgcac ttttcgcata tacaaaacgt aatgtcatcc ctactataac tcaaatgaat 15060
cttaagtatg ccattagtgc aaagaataga gctcgcaccg tagctggtgt ctctatctgt 15120
agtactatga ccaatagaca gtttcatcaa aaattattga aatcaatagc cgccactaga 15180
ggagctactg tagtaattgg aacaagcaaa ttctatggtg gttggcacaa catgttaaaa 15240
actgtttata gtgatgtaga aaaccctcat cttatgggtt gggattatcc taaatgtgat 15300
agagccatgc ctaacatgct tagaattatg gcctcacttg ttcttgctcg caaacataca 15360
acgtgttgta gcttgtcaca ccgtttctat agattagcta atgagtgtgc tcaagtattg 15420
agtgaaatgg tcatgtgtgg cggttcacta tatgttaaac caggtggaac ctcatcagga 15480
gatgccacaa ctgcttatgc taatagtgtt tttaacattt gtcaagctgt cacggccaat 15540
gttaatgcac ttttatctac tgatggtaac aaaattgccg ataagtatgt ccgcaattta 15600
caacacagac tttatgagtg tctctataga aatagagatg ttgacacaga ctttgtgaat 15660
gagttttacg catatttgcg taaacatttc tcaatgatga tactctctga cgatgctgtt 15720
gtgtgtttca atagcactta tgcatctcaa ggtctagtgg ctagcataaa gaactttaag 15780
tcagttcttt attatcaaaa caatgttttt atgtctgaag caaaatgttg gactgagact 15840
gaccttacta aaggacctca tgaattttgc tctcaacata caatgctagt taaacagggt 15900
gatgattatg tgtaccttcc ttacccagat ccatcaagaa tcctaggggc cggctgtttt 15960
gtagatgata tcgtaaaaac agatggtaca cttatgattg aacggttcgt gtctttagct 16020
atagatgctt acccacttac taaacatcct aatcaggagt atgctgatgt ctttcatttg 16080
tacttacaat acataagaaa gctacatgat gagttaacag gacacatgtt agacatgtat 16140
tctgttatgc ttactaatga taacacctca aggtattggg aacctgagtt ttatgaggct 16200
atgtacacac cgcatacagt cttacaggct gttggggctt gtgttctttg caattcacag 16260
acttcattaa gatgtggtgc ttgcatacgt agaccattct tatgttgtaa atgctgttac 16320
gaccatgtca tatcaacatc acataaatta gtcttgtctg ttaatccgta tgtttgcaat 16380
gctccaggtt gtgatgtcac agatgtgact caactttact taggaggtat gagctattat 16440
tgtaaatcac ataaaccacc cattagtttt ccattgtgtg ctaatggaca agtttttggt 16500
ttatataaaa atacatgtgt tggtagcgat aatgttactg actttaatgc aattgcaaca 16560
tgtgactgga caaatgctgg tgattacatt ttagctaaca cctgtactga aagactcaag 16620
ctttttgcag cagaaacgct caaagctact gaggagacat ttaaactgtc ttatggtatt 16680
gctactgtac gtgaagtgct gtctgacaga gaattacatc tttcatggga agttggtaaa 16740
cctagaccac cacttaaccg aaattatgtc tttactggtt atcgtgtaac taaaaacagt 16800
aaagtacaaa taggagagta cacctttgaa aaaggtgact atggtgatgc tgttgtttac 16860
cgaggtacaa caacttacaa attaaatgtt ggtgattatt ttgtgctgac atcacataca 16920
gtaatgccat taagtgcacc tacactagtg ccacaagagc actatgttag aattactggc 16980
ttatacccaa cactcaatat ctcagatgag ttttctagca atgttgcaaa ttatcaaaag 17040
gttggtatgc aaaagtattc tacactccag ggaccacctg gtactggtaa gagtcatttt 17100
gctattggcc tagctctcta ctacccttct gctcgcatag tgtatacagc ttgctctcat 17160
gccgctgttg atgcactatg tgagaaggca ttaaaatatt tgcctataga taaatgtagt 17220
agaattatac ctgcacgtgc tcgtgtagag tgttttgata aattcaaagt gaattcaaca 17280
ttagaacagt atgtcttttg tactgtaaat gcattgcctg agacgacagc agatatagtt 17340
gtctttgatg aaatttcaat ggccacaaat tatgatttga gtgttgtcaa tgccagatta 17400
cgtgctaagc actatgtgta cattggcgac cctgctcaat tacctgcacc acgcacattg 17460
ctaactaagg gcacactaga accagaatat ttcaattcag tgtgtagact tatgaaaact 17520
ataggtccag acatgttcct cggaacttgt cggcgttgtc ctgctgaaat tgttgacact 17580
gtgagtgctt tggtttatga taataggctt aaagcacata aagacaaatc agctcaatgc 17640
tttaaaatgt tttataaggg tgttatcacg catgatgttt catctgcaat taacaggcca 17700
caaataggcg tggtaagaga attccttaca cgtaaccctg cttggagaaa agctgtcttt 17760
atttcacctt ataattcaca gaatgctgta gcctcaaaga ttttgggact accaactcaa 17820
actgttgatt catcacaggg ctcagaatat gactatgtca tattcactca aaccactgaa 17880
acagctcact cttgtaatgt aaacagattt aatgttgcta ttaccagagc aaaagtaggc 17940
atactttgca taatgtctga tagagacctt tatgacaagt tgcaatttac aagtcttgaa 18000
attccacgta ggaatgtggc aactttacaa gctgaaaatg taacaggact ctttaaagat 18060
tgtagtaagg taatcactgg gttacatcct acacaggcac ctacacacct cagtgttgac 18120
actaaattca aaactgaagg tttatgtgtt gacatacctg gcatacctaa ggacatgacc 18180
tatagaagac tcatctctat gatgggtttt aaaatgaatt atcaagttaa tggttaccct 18240
aacatgttta tcacccgcga agaagctata agacatgtac gtgcatggat tggcttcgat 18300
gtcgaggggt gtcatgctac tagagaagct gttggtacca atttaccttt acagctaggt 18360
ttttctacag gtgttaacct agttgctgta cctacaggtt atgttgatac acctaataat 18420
acagattttt ccagagttag tgctaaacca ccgcctggag atcaatttaa acacctcata 18480
ccacttatgt acaaaggact tccttggaat gtagtgcgta taaagattgt acaaatgtta 18540
agtgacacac ttaaaaatct ctctgacaga gtcgtatttg tcttatgggc acatggcttt 18600
gagttgacat ctatgaagta ttttgtgaaa ataggacctg agcgcacctg ttgtctatgt 18660
gatagacgtg ccacatgctt ttccactgct tcagacactt atgcctgttg gcatcattct 18720
attggatttg attacgtcta taatccgttt atgattgatg ttcaacaatg gggttttaca 18780
ggtaacctac aaagcaacca tgatctgtat tgtcaagtcc atggtaatgc acatgtagct 18840
agttgtgatg caatcatgac taggtgtcta gctgtccacg agtgctttgt taagcgtgtt 18900
gactggacta ttgaatatcc tataattggt gatgaactga agattaatgc ggcttgtaga 18960
aaggttcaac acatggttgt taaagctgca ttattagcag acaaattccc agttcttcac 19020
gacattggta accctaaagc tattaagtgt gtacctcaag ctgatgtagg atggaagttc 19080
tatgatgcac agccttgtag tgacaaagct tataaaatag aagaattatt ctattcttat 19140
gccacacatt ctgacaaatt cacagatggt gtatgcctat tttggaattg caatgtcgat 19200
agatatcctg ctaattccat tgtttgtaga tttgacacta gagtgctatc taaccttaac 19260
ttgcctggtt gtgatggtgg cagtttgtat gtaaataaac atgcattcca cacaccagct 19320
tttgataaaa gtgcttttgt taatttaaaa caattaccat ttttctatta ctctgacagt 19380
ccatgtgagt ctcatggaaa acaagtagtg tcagatatag attatgtacc actaaagtct 19440
gctacgtgta taacacgttg caatttaggt ggtgctgtct gtagacatca tgctaatgag 19500
tacagattgt atctcgatgc ttataacatg atgatctcag ctggctttag cttgtgggtt 19560
tacaaacaat ttgatactta taacctctgg aacactttta caagacttca gagtttagaa 19620
aatgtggctt ttaatgttgt aaataaggga cactttgatg gacaacaggg tgaagtacca 19680
gtttctatca ttaataacac tgtttacaca aaagttgatg gtgttgatgt agaattgttt 19740
gaaaataaaa caacattacc tgttaatgta gcatttgagc tttgggctaa gcgcaacatt 19800
aaaccagtac cagaggtgaa aatactcaat aatttgggtg tggatattgc tgctaatact 19860
gtgatctggg actacaaaag agatgctcca gcacatatat ctactattgg tgtttgttct 19920
atgactgaca tagccaagaa accaactgaa acgatttgtg caccactcac tgtctttttt 19980
gatggtagag ttgatggtca agtagactta tttagaaatg cccgtaatgg tgttcttatt 20040
acagaaggta gtgttaaagg tttacaacca tctgtaggtc ccaaacaagc tagtcttaat 20100
ggagtcacat taattggaga agccgtaaaa acacagttca attattataa gaaagttgat 20160
ggtgttgtcc aacaattacc tgaaacttac tttactcaga gtagaaattt acaagaattt 20220
aaacccagga gtcaaatgga aattgatttc ttagaattag ctatggatga attcattgaa 20280
cggtataaat tagaaggcta tgccttcgaa catatcgttt atggagattt tagtcatagt 20340
cagttaggtg gtttacatct actgattgga ctagctaaac gttttaagga atcacctttt 20400
gaattagaag attttattcc tatggacagt acagttaaaa actatttcat aacagatgcg 20460
caaacaggtt catctaagtg tgtgtgttct gttattgatt tattacttga tgattttgtt 20520
gaaataataa aatcccaaga tttatctgta gtttctaagg ttgtcaaagt gactattgac 20580
tatacagaaa tttcatttat gctttgtgta aagatggcca tgtagaaaca ttttacccaa 20640
aattacaatc tagtcaagcg tggcaaccgg gtgttgctat gcctaatctt tacaaaatgc 20700
aaagaatgct attagaaaag tgtgaccttc aaaattatgg tgatagtgca acattaccta 20760
aaggcataat gatgaatgtc gcaaaatata ctcaactgtg tcaatattta aacacattaa 20820
cattagctgt accctataat atgagagtta tacattttgg tgctggttct gataaaggag 20880
ttgcaccagg tacagctgtt ttaagacagt ggttgcctac gggtacgctg cttgtcgatt 20940
cagatcttaa tgactttgtc tctgatgcag attcaacttt gattggtgat tgtgcaactg 21000
tacatacagc taataaatgg gatctcatta ttagtgatat gtacgaccct aagactaaaa 21060
atgttacaaa agaaaatgac tctaaagagg gttttttcac ttacatttgt gggtttatac 21120
aacaaaagct agctcttgga ggttccgtgg ctataaagat aacagaacat tcttggaatg 21180
ctgatcttta taagctcatg ggacacttcg catggtggac agcctttgtt actaatgtga 21240
atgcgtcatc atctgaagca tttttaattg gatgtaatta tcttggcaaa ccacgcgaac 21300
aaatagatgg ttatgtcatg catgcaaatt acatattttg gaggaataca aatccaattc 21360
agttgtcttc ctattcttta tttgacatga gtaaatttcc ccttaaatta aggggtactg 21420
ctgttatgtc tttaaaagaa ggtcaaatca atgatatgat tttatctctt cttagtaaag 21480
gtagacttat aattagagaa aacaacagag ttgttatttc tagtgatgtt cttgttaaca 21540
actaaacgaa caatgtttgt ttttcttgtt ttattgccac tagtctctag tcagtgtgtt 21600
aatcttacaa ccagaactca attaccccct gcatacacta attctttcac acgtggtgtt 21660
tattaccctg acaaagtttt cagatcctca gttttacatt caactcagga cttgttctta 21720
cctttctttt ccaatgttac ttggttccat gctatctctg ggaccaatgg tactaagagg 21780
tttgataacc ctgtcctacc atttaatgat ggtgtttatt ttgcttccac tgagaagtct 21840
aacataataa gaggctggat ttttggtact actttagatt cgaagaccca gtccctactt 21900
attgttaata acgctactaa tgttgttatt aaagtctgtg aatttcaatt ttgtaatgat 21960
ccatttttgg gtgtttacca caaaaacaac aaaagttgga tggaaagtga gttcagagtt 22020
tattctagtg cgaataattg cacttttgaa tatgtctctc agccttttct tatggacctt 22080
gaaggaaaac agggtaattt caaaaatctt agggaatttg tgtttaagaa tattgatggt 22140
tattttaaaa tatattctaa gcacacgcct attaatttag tgcgtgatct ccctcagggt 22200
ttttcggctt tagaaccatt ggtagatttg ccaataggta ttaacatcac taggtttcaa 22260
actttacttg ctttacatag aagttatttg actcctggtg attcttcttc aggttggaca 22320
gctggtgctg cagcttatta tgtgggttat cttcaaccta ggacttttct attaaaatat 22380
aatgaaaatg gaaccattac agatgctgta gactgtgcac ttgaccctct ctcagaaaca 22440
aagtgtacgt tgaaatcctt cactgtagaa aaaggaatct atcaaacttc taactttaga 22500
gtccaaccaa cagaatctat tgttagattt cctaatatta caaacttgtg cccttttggt 22560
gaagttttta acgccaccag atttgcatct gtttatgctt ggaacaggaa gagaatcagc 22620
aactgtgttg ctgattattc tgtcctatat aattccgcat cattttccac ttttaagtgt 22680
tatggagtgt ctcctactaa attaaatgat ctctgcttta ctaatgtcta tgcagattca 22740
tttgtaatta gaggtgatga agtcagacaa atcgctccag ggcaaactgg aaagattgct 22800
gattataatt ataaattacc agatgatttt acaggctgcg ttatagcttg gaattctaac 22860
aatcttgatt ctaaggttgg tggtaattat aattacctgt atagattgtt taggaagtct 22920
aatctcaaac cttttgagag agatatttca actgaaatct atcaggccgg tagcacacct 22980
tgtaatggtg ttgaaggttt taattgttac tttcctttac aatcatatgg tttccaaccc 23040
acttatggtg ttggttacca accatacaga gtagtagtac tttcttttga acttctacat 23100
gcaccagcaa ctgtttgtgg acctaaaaag tctactaatt tggttaaaaa caaatgtgtc 23160
aatttcaact tcaatggttt aacaggcaca ggtgttctta ctgagtctaa caaaaagttt 23220
ctgcctttcc aacaatttgg cagagacatt gatgacacta ctgatgctgt ccgtgatcca 23280
cagacacttg agattcttga cattacacca tgttcttttg gtggtgtcag tgttataaca 23340
ccaggaacaa atacttctaa ccaggttgct gttctttatc agggtgttaa ctgcacagaa 23400
gtccctgttg ctattcatgc agatcaactt actcctactt ggcgtgttta ttctacaggt 23460
tctaatgttt ttcaaacacg tgcaggctgt ttaatagggg ctgaacatgt caacaactca 23520
tatgagtgtg acatacccat tggtgcaggt atatgcgcta gttatcagac tcagactaat 23580
tctcatcggc gggcacgtag tgtagctagt caatccatca ttgcctacac tatgtcactt 23640
ggtgcagaaa attcagttgc ttactctaat aactctattg ccatacccat aaattttact 23700
attagtgtta ccacagaaat tctaccagtg tctatgacca agacatcagt agattgtaca 23760
atgtacattt gtggtgattc aactgaatgc agcaatcttt tgttgcaata tggcagtttt 23820
tgtacacaat taaaccgtgc tttaactgga atagctgttg aacaagacaa aaacacccaa 23880
gaagtttttg cacaagtcaa acaaatttac aaaacaccac caattaaaga ttttggtggt 23940
tttaattttt cacaaatatt accagatcca tcaaaaccaa gcaagaggtc atttattgaa 24000
gatctacttt tcaacaaagt gacacttgca gatgctggct tcatcaaaca atatggtgat 24060
tgccttggtg atattgctgc tagagacctc atttgtgcac aaaagtttaa cggccttact 24120
gttttgccac ctttgctcac agatgaaatg attgctcaat acacttctgc actgttagcg 24180
ggtacaatca cttctggttg gacctttggt gcaggtgctg cattacaaat accatttgct 24240
atgcaaatgg cttataggtt taatggtatt ggagttacac agaatgttct ctatgagaac 24300
caaaaattga ttgccaacca atttaatagt gctattggca aaattcaaga ctcactttct 24360
tccacagcaa gtgcacttgg aaaacttcaa gatgtggtca accaaaatgc acaagcttta 24420
aacacgcttg ttaaacaact tagctccaat tttggtgcaa tttcaagtgt tttaaatgat 24480
atccttgcac gtcttgacaa agttgaggct gaagtgcaaa ttgataggtt gatcacaggc 24540
agacttcaaa gtttgcagac atatgtgact caacaattaa ttagagctgc agaaatcaga 24600
gcttctgcta atcttgctgc tactaaaatg tcagagtgtg tacttggaca atcaaaaaga 24660
gttgattttt gtggaaaggg ctatcatctt atgtccttcc ctcagtcagc acctcatggt 24720
gtagtcttct tgcatgtgac ttatgtccct gcacaagaaa agaacttcac aactgctcct 24780
gccatttgtc atgatggaaa agcacacttt cctcgtgaag gtgtctttgt ttcaaatggc 24840
acacactggt ttgtaacaca aaggaatttt tatgaaccac aaatcattac tacacacaac 24900
acatttgtgt ctggtaactg tgatgttgta ataggaattg tcaacaacac agtttatgat 24960
cctttgcaac ctgaattaga ctcattcaag gaggagttag ataaatattt taagaatcat 25020
acatcaccag atgttgattt aggtgacatc tctggcatta atgcttcagt tgtaaacatt 25080
caaaaagaaa ttgaccgcct caatgaggtt gccaagaatt taaatgaatc tctcatcgat 25140
ctccaagaac ttggaaagta tgagcagtat ataaaatggc catggtacat ttggctaggt 25200
tttatagctg gcttgattgc catagtaatg gtgacaatta tgctttgctg tatgaccagt 25260
tgctgtagtt gtctcaaggg ctgttgttct tgtggatcct gctgcaaatt tgatgaagac 25320
gactctgagc cagtgctcaa aggagtcaaa ttacattaca cataaacgaa cttatggatt 25380
tgtttatgag aatcttcaca attggaactg taactttgaa gcaaggtgaa atcaaggatg 25440
ctactccttc agattttgtt cgcgctactg caacgatacc gatacaagcc tcactccctt 25500
tcggatggct tattgttggc gttgcacttc ttgctgtttt tcagagcgct tccaaaatca 25560
taaccctcaa aaagagatgg caactagcac tctccaaggg tgttcacttt gtttgcaact 25620
tgctgttgtt gtttgtaaca gtttactcac accttttgct cgttgctgct ggccttgaag 25680
ccccttttct ctatctttat gctttagtct acttcttgca gagtataaac tttgtaagaa 25740
taataatgag gctttggctt tgctggaaat gccgttccaa aaacccatta ctttatgatg 25800
ccaactattt tctttgctgg catactaatt gttacgacta ttgtatacct tacaatagtg 25860
taacttcttc aattgtcatt acttcaggtg atggcacaac aagtcctatt tctgaacatg 25920
actaccagat tggtggttat actgaaaaat gggaatctgg agtaaaagac tgtgttgtat 25980
tacacagtta cttcacttca gactattacc agctgtactc aactcaattg agtacagaca 26040
ctggtgttga acatgttacc ttcttcatct acaataaaat tgttgatgag cctgaagaac 26100
atgtccaaat tcacacaatc gacggttcat ccggagttgt taatccagta atggaaccaa 26160
tttatgatga accgacgacg actactagcg tgcctttgta agcacaagct gatgagtacg 26220
aacttatgta ctcattcgtt tcggaagaga caggtacgtt aatagttaat agcgtacttc 26280
tttttcttgc tttcgtggta ttcttgctag ttacactagc catccttact gcgcttcgat 26340
tgtgtgcgta ctgctgcaat attgttaacg tgagtcttgt aaaaccttct ttttacgttt 26400
actctcgtgt taaaaatctg aattcttcta gagttcctga tcttctggtc taaacgaact 26460
aaatattata ttagtttttc tgtttggaac tttaatttta gccatggcag attccaacgg 26520
tactattacc gttgaagagc ttaaaaagct ccttgaacaa tggaacctag taataggttt 26580
cctattcctt acatggattt gtcttctaca atttgcctat gccaacagga ataggttttt 26640
gtatataatt aagttaattt tcctctggct gttatggcca gtaactttag cttgttttgt 26700
gcttgctgct gtttacagaa taaattggat caccggtgga attgctatcg caatggcttg 26760
tcttgtaggc ttgatgtggc tcagctactt cattgcttct ttcagactgt ttgcgcgtac 26820
gcgttccatg tggtcattca atccagaaac taacattctt ctcaacgtgc cactccatgg 26880
cactattctg accagaccgc ttctagaaag tgaactcgta atcggagctg tgatccttcg 26940
tggacatctt cgtattgctg gacaccatct aggacgctgt gacatcaagg acctgcctaa 27000
agaaatcact gttgctacat cacgaacgct ttcttattac aaattgggag cttcgcagcg 27060
tgtagcaggt gactcaggtt ttgctgcata cagtcgctac aggattggca actataaatt 27120
aaacacagac cattccagta gcagtgacaa tattgctttg cttgtacagt aagtgacaac 27180
agatgtttca tctcgttgac tttcaggtta ctatagcaga gatattacta attattatga 27240
ggacttttaa agtttccatt tggaatcttg attacatcat aaacctcata attaaaaatt 27300
tatctaagtc actaactgag aataaatatt ctcaattaga tgaagagcaa ccaatggaga 27360
ttgattaaac gaacatgaaa attattcttt tcttggcact gataacactc gctacttgtg 27420
agctttatca ctaccaagag tgtgttagag gtacaacagt acttttaaaa gaaccttgct 27480
cttctggaac atacgagggc aattcaccat ttcatcctct agctgataac aaatttgcac 27540
tgacttgctt tagcactcaa tttgcttttg cttgtcctga cggcgtaaaa cacgtctatc 27600
agttacgtgc cagatcagtt tcacctaaac tgttcatcag acaagaggaa gttcaagaac 27660
tttactctcc aatttttctt attgttgcgg caatagtgtt tataacactt tgcttcacac 27720
tcaaaagaaa gacagaatga ttgaactttc attaattgac ttctatttgt gctttttagc 27780
ctttctgcta ttccttgttt taattatgct tattatcttt tggttctcac ttgaactgca 27840
agatcataat gaaacttgtc acgcctaaac gaacatgaaa tttcttgttt tcttaggaat 27900
catcacaact gtagctgcat ttcaccaaga atgtagttta cagtcatgta cttaacatca 27960
accatatgta gttgatgacc cgtgtcctat tcacttctat tctaaatggt atattagagt 28020
aggagctata aaatcagcac ctttaattga attgtgcgtg gatgaggctg gttctaaatc 28080
acccattcag tgcatcgata tcggtaatta tacagtttcc tgtttacctt ttacaattaa 28140
ttgccaggaa cctaaattgg gtagtcttgt agtgcgttgt tcgttctatg aagacttttt 28200
agagtatcat gacgttcgtg ttgttttaga tttcatctaa acgaacaaac taaatgtctc 28260
taaatggacc ccaaaatcag cgaaatgcac cccgcattac gtttggtgga ccctcagatt 28320
caactggcag taaccagaat ggagaacgca gtggggcgcg atcaaaacaa cgtcggcccc 28380
aaggtttacc caataatact gcgtcttggt tcaccgctct cactcaacat ggcaaggaag 28440
accttaaatt ccctcgagga caaggcgttc caattaacac caatagcagt ccagatgacc 28500
aaattggcta ctaccgaaga gctaccagac gaattcgtgg tggtgacggt aaaatgaaag 28560
atctcagtcc aagatggtat ttctactacc taggaactgg gccagaagct ggacttccct 28620
atggtgctaa caaagacggc atcatatggg ttgcaactga gggagccttg aatacaccaa 28680
aagatcacat tggcacccgc aatcctgcta acaatgctgc aatcgtgcta caacttcctc 28740
aaggaacaac attgccaaaa ggcttctacg cagaagggag cagaggcggc agtcaagcct 28800
cttctcgttc ctcatcacgt agtcgcaaca gttcaagaaa ttcaactcca ggcagcagta 28860
aacgaacttc tcctgctaga atggctggca atggcggtga tgctgctctt gctttgctgc 28920
tgcttgacag attgaaccag cttgagagca aaatgtttgg taaaggccaa caacaacaag 28980
gccaaactgt cactaagaaa tctgctgctg aggcttctaa gaagcctcgg caaaaacgta 29040
ctgccactaa agcatacaat gtaacacaag ctttcggcag acgtggtcca gaacaaaccc 29100
aaggaaattt tggggaccag gaactaatca gacaaggaac tgattacaaa cattggccgc 29160
aaattgcaca atttgccccc agcgcttcag cgttcttcgg aatgtcgcgc attggcatgg 29220
aagtcacacc ttcgggaacg tggttgacct acacaggtgc catcaaattg gatgacaaag 29280
atccaaattt caaagatcaa gtcattttgc tgaataagca tattgacgca tacaaaacat 29340
tcccaccaac agagcctaaa aaggacaaaa agaagaaggc tgatgaaact caagccttac 29400
cgcagagaca gaagaaacag caaactgtga ctcttcttcc tgctgcagat ttggatgatt 29460
tctccaaaca attgcaacaa tccatgagca gtgctgactc aactcaggcc taaactcatg 29520
cagaccacac aaggcagatg ggctatataa acgttttcgc ttttccgttt acgatatata 29580
gtctactctt gtgcagaatg aattctcgta actacatagc acaagtagat gtagttaact 29640
ttaatctcac atagcaatct ttaatcagtg tgtaacatta gggaggactt gaaagagcca 29700
ccacattttc accgaggcca cgcggagtac gatcgagtgt acagtgaaca atgctaggga 29760
gagctgccta tatggaagag ccctaatgtg taaaattaat tttagtagtg ctatccccat 29820
gtgattttaa tagcttctta ggagaatgnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 29880
nnn 29883
<210> 23
<211> 1270
<212> PRT
<213> SARS-CoV2
<400> 23
Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
1 5 10 15
Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
20 25 30
Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
35 40 45
His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
50 55 60
Phe His Ala Ile Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp Asn Pro
65 70 75 80
Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu Lys Ser
85 90 95
Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser Lys Thr
100 105 110
Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile Lys Val
115 120 125
Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr His Lys
130 135 140
Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr Ser Ser Ala
145 150 155 160
Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu Met Asp Leu
165 170 175
Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe Val Phe Lys
180 185 190
Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr Pro Ile Asn
195 200 205
Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu Pro Leu Val
210 215 220
Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr Leu Leu Ala
225 230 235 240
Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser Gly Trp Thr
245 250 255
Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro Arg Thr Phe
260 265 270
Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala Val Asp Cys
275 280 285
Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys Ser Phe Thr
290 295 300
Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val Gln Pro Thr
305 310 315 320
Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly
325 330 335
Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg
340 345 350
Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser
355 360 365
Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu
370 375 380
Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg
385 390 395 400
Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala
405 410 415
Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala
420 425 430
Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr
435 440 445
Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp
450 455 460
Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val
465 470 475 480
Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro
485 490 495
Thr Tyr Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe
500 505 510
Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Lys Ser Thr
515 520 525
Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn Gly Leu Thr
530 535 540
Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu Pro Phe Gln
545 550 555 560
Gln Phe Gly Arg Asp Ile Asp Asp Thr Thr Asp Ala Val Arg Asp Pro
565 570 575
Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe Gly Gly Val
580 585 590
Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val Ala Val Leu
595 600 605
Tyr Gln Gly Val Asn Cys Thr Glu Val Pro Val Ala Ile His Ala Asp
610 615 620
Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser Asn Val Phe
625 630 635 640
Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val Asn Asn Ser
645 650 655
Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala Ser Tyr Gln
660 665 670
Thr Gln Thr Asn Ser His Arg Arg Ala Arg Ser Val Ala Ser Gln Ser
675 680 685
Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser Val Ala Tyr
690 695 700
Ser Asn Asn Ser Ile Ala Ile Pro Ile Asn Phe Thr Ile Ser Val Thr
705 710 715 720
Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val Asp Cys Thr
725 730 735
Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu Leu Leu Gln
740 745 750
Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr Gly Ile Ala
755 760 765
Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln Val Lys Gln
770 775 780
Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe Asn Phe Ser
785 790 795 800
Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser Phe Ile Glu
805 810 815
Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly Phe Ile Lys
820 825 830
Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp Leu Ile Cys
835 840 845
Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu Leu Thr Asp
850 855 860
Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly Thr Ile Thr
865 870 875 880
Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile Pro Phe Ala
885 890 895
Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr Gln Asn Val
900 905 910
Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn Ser Ala Ile
915 920 925
Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala Leu Gly Lys
930 935 940
Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn Thr Leu Val
945 950 955 960
Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val Leu Asn Asp
965 970 975
Ile Leu Ala Arg Leu Asp Lys Val Glu Ala Glu Val Gln Ile Asp Arg
980 985 990
Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val Thr Gln Gln
995 1000 1005
Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu Ala Ala Thr
1010 1015 1020
Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys Arg Val Asp Phe Cys
1025 1030 1035 1040
Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ser Ala Pro His Gly
1045 1050 1055
Val Val Phe Leu His Val Thr Tyr Val Pro Ala Gln Glu Lys Asn Phe
1060 1065 1070
Thr Thr Ala Pro Ala Ile Cys His Asp Gly Lys Ala His Phe Pro Arg
1075 1080 1085
Glu Gly Val Phe Val Ser Asn Gly Thr His Trp Phe Val Thr Gln Arg
1090 1095 1100
Asn Phe Tyr Glu Pro Gln Ile Ile Thr Thr His Asn Thr Phe Val Ser
1105 1110 1115 1120
Gly Asn Cys Asp Val Val Ile Gly Ile Val Asn Asn Thr Val Tyr Asp
1125 1130 1135
Pro Leu Gln Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr
1140 1145 1150
Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly
1155 1160 1165
Ile Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn
1170 1175 1180
Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu Leu
1185 1190 1195 1200
Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Trp Tyr Ile Trp Leu Gly
1205 1210 1215
Phe Ile Ala Gly Leu Ile Ala Ile Val Met Val Thr Ile Met Leu Cys
1220 1225 1230
Cys Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Cys Cys Ser Cys Gly
1235 1240 1245
Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro Val Leu Lys Gly
1250 1255 1260
Val Lys Leu His Tyr Thr
1265 1270
<210> 24
<211> 29832
<212> DNA
<213> SARS-CoV2
<400> 24
aaggtttata ccttcccagg taacaaacca accaactttc gatctcttgt agatctgttc 60
tctaaacgaa ctttaaaatc tgtgtggctg tcactcggct gcatgcttag tgcactcacg 120
cagtataatt aataactaat tactgtcgtt gacaggacac gagtaactcg tctatcttct 180
gcaggctgct tacggtttcg tccgtgttgc agccgatcat cagcacatct aggttttgtc 240
cgggtgtgac cgaaaggtaa gatggagagc cttgtccctg gtttcaacga gaaaacacac 300
gtccaactca gtttgcctgt tttacaggtt cgcgacgtgc tcgtacgtgg ctttggagac 360
tccgtggagg aggtcttatc agaggcacgt caacatctta aagatggcac ttgtggctta 420
gtagaagttg aaaaaggcgt tttgcctcaa cttgaacagc cctatgtgtt catcaaacgt 480
tcggatgctc gaactgcacc tcatggtcat gttatggttg agctggtagc agaactcgaa 540
ggcattcagt acggtcgtag tggtgagaca cttggtgtcc ttgtccctca tgtgggcgaa 600
ataccagtgg cttaccgcaa ggttcttctt cgtaagaacg gtaataaagg agctggtggc 660
catagttacg gcgccgatct aaagtcattt gacttaggcg acgagcttgg cactgatcct 720
tatgaagatt ttcaagaaaa ctggaacact aaacatagca gtggtgttac ccgtgaactc 780
atgcgtgagc ttaacggagg ggcatacact cgctatgtcg ataacaactt ctgtggccct 840
gatggctacc ctcttgagtg cattaaagac cttctagcac gtgctggtaa agcttcatgc 900
actttgtccg aacaactgga ctttattgac actaagaggg gtgtatactg ctgccgtgaa 960
catgagcatg aaattgcttg gtacacggaa cgttctgaaa agagctatga attgcagaca 1020
ccttttgaaa ttaaattggc aaagaaattt gacatcttca atggggaatg tccaaatttt 1080
gtatttccct taaattccat aatcaagact attcaaccaa gggttgaaaa gaaaaagctt 1140
gatggcttta tgggtagaat tcgatctgtc tatccagttg cgtcaccaaa tgaatgcaac 1200
caaatgtgcc tttcaactct catgaagtgt gatcattgtg gtgaaacttc atggcagacg 1260
ggcgattttg ttaaagccac ttgcgaattt tgtggcactg agaatttgac taaagaaggt 1320
gccactactt gtggttactt accccaaaat gctgttgtta aaatttattg tccagcatgt 1380
cacaattcag aagtaggacc tgagcatagt cttgccgaat accataatga atctggcttg 1440
aaaaccattc ttcgtaaggg tggtcgcact attgcctttg gaggctgtgt gttctcttat 1500
gttggttgcc ataacaagtg tgcctattgg gttccacgtg ctagcgctaa cataggttgt 1560
aaccatacag gtgttgttgg agaaggttcc gaaggtctta atgacaacct tcttgaaata 1620
ctccaaaaag agaaagtcaa catcaatatt gttggtgact ttaaacttaa tgaagagatc 1680
gccattattt tggcatcttt ttctgcttcc acaagtgctt ttgtggaaac tgtgaaaggt 1740
ttggattata aagcattcaa acaaattgtt gaatcctgtg gtaattttaa agttacaaaa 1800
ggaaaagcta aaaaaggtgc ctggaatatt ggtgaacaga aatcaatact gagtcctctt 1860
tatgcatttg catcagaggc tgctcgtgtt gtacgatcaa ttttctcccg cactcttgaa 1920
actgctcaaa attctgtgcg tgttttacag aaggccgcta taacaatact agatggaatt 1980
tcacagtatt cactgagact cattgatgct atgatgttca catctgattt ggctactaac 2040
aatctagttg taatggccta cattacaggt ggtgttgttc agttgacttc gcagtggcta 2100
actaacatct ttggcactgt ttatgaaaaa ctcaaacccg tccttgattg gcttgaagag 2160
aagtttaagg aaggtgtaga gtttcttaga gacggttggg aaattgttaa atttatctca 2220
acctgtgctt gtgaaattgt cggtggacaa attgtcacct gtgcaaagga aattaaggag 2280
agtgttcaga cattctttaa gcttgtaaat aaatttttgg ctttgtgtgc tgactctatc 2340
attattggtg gagctaaact taaagccttg aatttaggtg aaacatttgt cacgcactca 2400
aagggattgt acagaaagtg tgttaaatcc agagaagaaa ctggcctact catgcctcta 2460
aaagccccaa aagaaattat cttcttagag ggagaaacac ttcccacaga agtgttaaca 2520
gaggaagttg tcttgaaaac tggtgattta caaccattag aacaacctac tagtgaagct 2580
gttgaagctc cattggttgg tacaccagtt tgtattaacg ggcttatgtt gctcgaaatc 2640
aaagacacag aaaagtactg tgcccttgca cctaatatga tggtaacaaa caataccttc 2700
acactcaaag gcggtgcacc aacaaaggtt acttttggtg atgacactgt gatagaagtg 2760
caaggttaca agagtgtgaa tatcactttt gaacttgatg aaaggattga taaagtactt 2820
aatgagaagt gctctgccta tacagttgaa ctcggtacag aagtaaatga gttcgcctgt 2880
gttgtggcag atgctgtcat aaaaactttg caaccagtat ctgaattact tacaccactg 2940
ggcattgatt tagatgagtg gagtatggct acatactact tatttgatga gtctggtgag 3000
tttaaattgg cttcacatat gtattgttct ttttaccctc cagatgagga tgaagaagaa 3060
ggtgattgtg aagaagaaga gtttgagcca tcaactcaat atgagtatgg tactgaagat 3120
gattaccaag gtaaaccttt ggaatttggt gccacttctg ctgctcttca acctgaagaa 3180
gagcaagaag aagattggtt agatgatgat agtcaacaaa ctgttggtca acaagacggc 3240
agtgaggaca atcagacaac tactattcaa acaattgttg aggttcaacc tcaattagag 3300
atggaactta caccagttgt tcagactatt gaagtgaata gttttagtgg ttatttaaaa 3360
cttactgaca atgtatacat taaaaatgca gacattgtgg aagaagctaa aaaggtaaaa 3420
ccaacagtgg ttgttaatgc agccaatgtt taccttaaac atggaggagg tgttgcagga 3480
gccttaaata aggctactaa caatgccatg caagttgaat ctgatgatta catagctact 3540
aatggaccac ttaaagtggg tggtagttgt gttttaagcg gacacaatct tgctaaacac 3600
tgtcttcatg ttgtcggccc aaatgttaac aaaggtgaag acattcaact tcttaagagt 3660
gcttatgaaa attttaatca gcacgaagtt ctacttgcac cattattatc agctggtatt 3720
tttggtgctg accctataca ttctttaaga gtttgtgtag atactgttcg cacaaatgtc 3780
tacttagctg tctttgataa aaatctctat gacaaacttg tttcaagctt tttggaaatg 3840
aagagtgaaa agcaagttga acaaaagatc gctgagattc ctaaagagga agttaagcca 3900
tttataactg aaagtaaacc ttcagttgaa cagagaaaac aagatgataa gaaaatcaaa 3960
gcttgtgttg aagaagttac aacaactctg gaagaaacta agttcctcac agaaaacttg 4020
ttactttata ttgacattaa tggcaatctt catccagatt ctgccactct tgttagtgac 4080
attgacatca ctttcttaaa gaaagatgct ccatatatag tgggtgatgt tgttcaagag 4140
ggtgttttaa ctgctgtggt tatacctact aaaaaggctg gtggcactac tgaaatgcta 4200
gcgaaagctt tgagaaaagt gccaacagac aattatataa ccacttaccc gggtcagggt 4260
ttaaatggtt acactgtaga ggaggcaaag acagtgctta aaaagtgtaa aagtgccttt 4320
tacattctac catctattat ctctaatgag aagcaagaaa ttcttggaac tgtttcttgg 4380
aatttgcgag aaatgcttgc acatgcagaa gaaacacgca aattaatgcc tgtctgtgtg 4440
gaaactaaag ccatagtttc aactatacag cgtaaatata agggtattaa aatacaagag 4500
ggtgtggttg attatggtgc tagattttac ttttacacca gtaaaacaac tgtagcgtca 4560
cttatcaaca cacttaacga tctaaatgaa actcttgtta caatgccact tggctatgta 4620
acacatggct taaatttgga agaagctgct cggtatatga gatctctcaa agtgccagct 4680
acagtttctg tttcttcacc tgatgctgtt acagcgtata atggttatct tacttcttct 4740
tctaaaacac ctgaagaaca ttttattgaa accatctcac ttgctggttc ctataaagat 4800
tggtcctatt ctggacaatc tacacaacta ggtatagaat ttcttaagag aggtgataaa 4860
agtgtatatt acactagtaa tcctaccaca ttccacctag atggtgaagt tatcaccttt 4920
gacaatctta agacacttct ttctttgaga gaagtgagga ctattaaggt gtttacaaca 4980
gtagacaaca ttaacctcca cacgcaagtt gtggacatgt caatgacata tggacaacag 5040
tttggtccaa cttatttgga tggagctgat gttactaaaa taaaacctca taattcacat 5100
gaaggtaaaa cattttatgt tttacctaat gatgacactc tacgtgttga ggcttttgag 5160
tactaccaca caactgatcc tagttttctg ggtaggtaca tgtcagcatt aaatcacact 5220
aaaaagtgga aatacccaca agttaatggt ttaacttcta ttaaatgggc agataacaac 5280
tgttatcttg ccactgcatt gttaacactc caacaaatag agttgaagtt taatccacct 5340
gctctacaag atgcttatta cagagcaagg gctggtgaag ctgctaactt ttgtgcactt 5400
atcttagcct actgtaataa gacagtaggt gagttaggtg atgttagaga aacaatgagt 5460
tacttgtttc aacatgccaa tttagattct tgcaaaagag tcttgaacgt ggtgtgtaaa 5520
acttgtggac aacagcagac aacccttaag ggtgtagaag ctgttatgta catgggcaca 5580
ctttcttatg aacaatttaa gaaaggtgtt cagatacctt gtacgtgtgg taaacaagct 5640
acaaaatatc tagtacaaca ggagtcacct tttgttatga tgtcagcacc acctgctcag 5700
tatgaactta agcatggtac atttacttgt gctagtgagt acactggtaa ttaccagtgt 5760
ggtcactata aacatataac ttctaaagaa actttgtatt gcatagacgg tgctttactt 5820
acaaagtcct cagaatacaa aggtcctatt acggatgttt tctacaaaga aaacagttac 5880
acaacaacca taaaaccagt tacttataaa ttggatggtg ttgtttgtac agaaattgac 5940
cctaagttgg acaattatta taagaaagac aattcttatt tcacagagca accaattgat 6000
cttgtaccaa accaaccata tccaaacgca agcttcgata attttaagtt tgtatgtgat 6060
aatatcaaat ttgctgatga tttaaaccag ttaactggtt ataagaaacc tgcttcaaga 6120
gagcttaaag ttacattttt ccctgactta aatggtgatg tggtggctat tgattataaa 6180
cactacacac cctcttttaa gaaaggagct aaattgttac ataaacctat tgtttggcat 6240
gttaacaatg caactaataa agccacgtat aaaccaaata cctggtgtat acgttgtctt 6300
tggagcacaa aaccagttga aacatcaaat tcgtttgatg tactgaagtc agaggacgcg 6360
cagggaatgg ataatcttgt ctgcgaagat ctaaaaccag tctctgaaga agtagtggaa 6420
aatcctacca tacagaaaga cgttcttgag tgtaatgtga aaactaccga agttgtagga 6480
gacattatac ttaaaccagc aaataatagt ttaaaaatta cagaagaggt tggccacaca 6540
gatctaatgg ctgcttatgt agacaattct agtcttacta ttaagaaacc taatgaatta 6600
tctagagtat taggtttgaa aacccttgct actcatggtt tagctgctgt taatagtgtc 6660
ccttgggata ctatagctaa ttatgctaag ccttttctta acaaagttgt tagtacaact 6720
actaacatag ttacacggtg tttaaaccgt gtttgtacta attatatgcc ttatttcttt 6780
actttattgc tacaattgtg tacttttact agaagtacaa attctagaat taaagcatct 6840
atgccgacta ctatagcaaa gaatactgtt aagagtgtcg gtaaattttg tctagaggct 6900
tcatttaatt atttgaagtc acctaatttt tctaaactga taaatattat aatttggttt 6960
ttactattaa gtgtttgcct aggttcttta atctactcaa ccgctgcttt aggtgtttta 7020
atgtctaatt taggcatgcc ttcttactgt actggttaca gagaaggcta tttgaactct 7080
actaatgtca ctattgcaac ctactgtact ggttctatac cttgtagtgt ttgtcttagt 7140
ggtttagatt ctttagacac ctatccttct ttagaaacta tacaaattac catttcatct 7200
tttaaatggg atttaactgc ttttggctta gttgcagagt ggtttttggc atatattctt 7260
ttcactaggt ttttctatgt acttggattg gctgcaatca tgcaattgtt tttcagctat 7320
tttgcagtac attttattag taattcttgg cttatgtggt taataattaa tcttgtacaa 7380
atggccccga tttcagctat ggttagaatg tacatcttct ttgcatcatt ttattatgta 7440
tggaaaagtt atgtgcatgt tgtagacggt tgtaattcat caacttgtat gatgtgttac 7500
aaacgtaata gagcaacaag agtcgaatgt acaactattg ttaatggtgt tagaaggtcc 7560
ttttatgtct atgctaatgg aggtaaaggc ttttgcaaac tacacaattg gaattgtgtt 7620
aattgtgata cattctgtgc tggtagtaca tttattagtg atgaagttgc gagagacttg 7680
tcactacagt ttaaaagacc aataaatcct actgaccagt cttcttacat cgttgatagt 7740
gttacagtga agaatggttc catccatctt tactttgata aagctggtca aaagacttat 7800
gaaagacatt ctctctctca ttttgttaac ttagacaacc tgagagctaa taacactaaa 7860
ggttcattgc ctattaatgt tatagttttt gatggtaaat caaaatgtga agaatcatct 7920
gcaaaatcag cgtctgttta ctacagtcag cttatgtgtc aacctatact gttactagat 7980
caggcattag tgtctgatgt tggtgatagt gcggaagttg cagttaaaat gtttgatgct 8040
tacgttaata cgttttcatc aacttttaac gtaccaatgg aaaaactcaa aacactagtt 8100
gcaactgcag aagctgaact tgcaaagaat gtgtccttag acaatgtctt atctactttt 8160
atttcagcag ctcggcaagg gtttgttgat tcagatgtag aaactaaaga tgttgttgaa 8220
tgtcttaaat tgtcacatca atctgacata gaagttactg gcgatagttg taataactat 8280
atgctcacct ataacaaagt tgaaaacatg acaccccgtg accttggtgc ttgtattgac 8340
tgtagtgcgc gtcatattaa tgcgcaggta gcaaaaagtc acaacattgc tttgatatgg 8400
aacgttaaag atttcatgtc attgtctgaa caactacgaa aacaaatacg tagtgctgct 8460
aaaaagaata acttaccttt taagttgaca tgtgcaacta ctagacaagt tgttaatgtt 8520
gtaacaacaa agatagcact taagggtggt aaaattgtta ataattggtt gaagcagtta 8580
attaaagtta cacttgtgtt cctttttgtt gctgctattt tctatttaat aacacctgtt 8640
catgtcatgt ctaaacatac tgacttttca agtgaaatca taggatacaa ggctattgat 8700
ggtggtgtca ctcgtgacat agcatctaca gatacttgtt ttgctaacaa acatgctgat 8760
tttgacacat ggtttagcca gcgtggtggt agttatacta atgacaaagc ttgcccattg 8820
attgctgcag tcataacaag agaagtgggt tttgtcgtgc ctggtttgcc tggcacgata 8880
ttacgcacaa ctaatggtga ctttttgcat ttcttaccta gagtttttag tgcagttggt 8940
aacatctgtt acacaccatc aaaacttata gagtacactg actttgcaac atcagcttgt 9000
gttttggctg ctgaatgtac aatttttaaa gatgcttctg gtaagccagt accatattgt 9060
tatgatacca atgtactaga aggttctgtt gcttatgaaa gtttacgccc tgacacacgt 9120
tatgtgctca tggatggctc tattattcaa tttcctaaca cctaccttga aggttctgtt 9180
agagtggtaa caacttttga ttctgagtac tgtaggcacg gcacttgtga aagatcagaa 9240
gctggtgttt gtgtatctac tagtggtaga tgggtactta acaatgatta ttacagatct 9300
ttaccaggag ttttctgtgg tgtagatgct gtaaatttac ttactaatat gtttacacca 9360
ctaattcaac ctattggtgc tttggacata tcagcatcta tagtagctgg tggtattgta 9420
gctatcgtag taacatgcct tgcctactat tttatgaggt ttagaagagc ttttggtgaa 9480
tacagtcatg tagttgcctt taatacttta ctattcctta tgtcattcac tgtactctgt 9540
ttaacaccag tttactcatt cttacctggt gtttattctg ttatttactt gtacttgaca 9600
ttttatctta ctaatgatgt ttctttttta gcacatattc agtggatggt tatgttcaca 9660
cctttagtac ctttctggat aacaattgct tatatcattt gtatttccac aaagcatttc 9720
tattggttct ttactaatta cctaaagaga cgtgtagtct ttaatggtgt ttcctttagt 9780
acttttgaag aagctgcgct gtgcaccttt ttgttaaata aagaaatgta tctaaagttg 9840
cgtagtgatg tgctattacc tcttacgcaa tataatagat acttagctct ttataataag 9900
tacaagtatt ttagtggagc aatggataca actagctaca gagaagctgc ttgttgtcat 9960
ctcgcaaagg ctctcaatga cttcagtaac tcaggttctg atgttcttta ccaaccacca 10020
caaacctcta tcacctcagc tgttttgcag agtggtttta gaaaaatggc attcccatct 10080
ggtaaagttg agggttgtat ggtacaagta acttgtggta caactacact taacggtctt 10140
tggcttgatg acgtagttta ctgtccaaga catgtgatct gcacctctga agacatgctt 10200
aaccctaatt atgaagattt actcattcgt aagtctaatc ataatttctt ggtacaggct 10260
ggtaatgttc aactcagggt tattggacat tctatgcaaa attgtgtact taagcttaag 10320
gttgatacag ccaatcctaa gacacctaag tataagtttg ttcgcattca accaggacag 10380
actttttcag tgttagcttg ttacaatggt tcaccatctg gtgtttacca atgtgctatg 10440
aggcccaatt tcactattaa gggttcattc cttaatggtt catgtggtag tgttggtttt 10500
aacatagatt atgactgtgt ctctttttgt tacatgcacc atatggaatt accaactgga 10560
gttcatgctg gcacagactt agaaggtaac ttttatggac cttttgttga caggcaaaca 10620
gcacaagcag ctggtacgga cacaactatt acagttaatg ttttagcttg gttgtacgct 10680
gctgttataa atggagacag gtggtttctc aatcgattta ccacaactct taatgacttt 10740
aaccttgtgg ctatgaagta caattatgaa cctctaacac aagaccatgt tgacatacta 10800
ggacctcttt ctgctcaaac tggaattgcc gttttagata tgtgtgcttc attaaaagaa 10860
ttactgcaaa atggtatgaa tggacgtacc atattgggta gtgctttatt agaagatgaa 10920
tttacacctt ttgatgttgt tagacaatgc tcaggtgtta ctttccaaag tgcagtgaaa 10980
agaacaatca agggtacaca ccactggttg ttactcacaa ttttgacttc acttttagtt 11040
ttagtccaga gtactcaatg gtctttgttc ttttttttgt atgaaaatgc ctttttacct 11100
tttgctatgg gtattattgc tatgtctgct tttgcaatga tgtttgtcaa acataagcat 11160
gcatttctct gtttgttttt gttaccttct cttgccactg tagcttattt taatatggtc 11220
tatatgcctg ctagttgggt gatgcgtatt atgacatggt tggatatggt tgatactagt 11280
ttgtctggtt ttaagctaaa agactgtgtt atgtatgcat cagctgtagt gttactaatc 11340
cttatgacag caagaactgt gtatgatgat ggtgctagga gagtgtggac acttatgaat 11400
gtcttgacac tcgtttataa agtttattat ggtaatgctt tagatcaagc catttccatg 11460
tgggctctta taatctctgt tacttctaac tactcaggtg tagttacaac tgtcatgttt 11520
ttggccaaag gtattgtttt tatgtgtgtt gagtattgcc ctattttctt cataactggt 11580
aatacacttc agtgtataat gctagtttat tgtttcttag gctatttttg tacttgttac 11640
tttggcctct tttgtttact caaccgctac tttagactga ctcttggtgt ttatgattac 11700
ttagtttcta cacaggagtt tagatatatg aattcacagg gactactccc acccaagaat 11760
agcatagatg ccttcaaact caacattaaa ttgttgggtg ttggtggcaa accttgtatc 11820
aaagtagcca ctgtacagtc taaaatgtca gatgtaaagt gcacatcagt agtcttactc 11880
tcagttttgc aacaactcag agtagaatca tcatctaaat tgtgggctca atgtgtccag 11940
ttacacaatg acattctctt agctaaagat actactgaag cctttgaaaa aatggtttca 12000
ctactttctg ttttgctttc catgcagggt gctgtagaca taaacaagct ttgtgaagaa 12060
atgctggaca acagggcaac cttacaagct atagcctcag agtttagttc ccttccatca 12120
tatgcagctt ttgctactgc tcaagaagct tatgagcagg ctgttgctaa tggtgattct 12180
gaagttgttc ttaaaaagtt gaagaagtct ttgaatgtgg ctaaatctga atttgaccgt 12240
gatgcagcca tgcaacgtaa gttggaaaag atggctgatc aagctatgac ccaaatgtat 12300
aaacaggcta gatctgagga caagagggca aaagttacta gtgctatgca gacaatgctt 12360
ttcactatgc ttagaaagtt ggataatgat gcactcaaca acattatcaa caatgcaaga 12420
gatggttgtg ttcccttgaa cataatacct cttacaacag cagccaaatt aatggttgtc 12480
ataccagact ataacacata taaaaatacg tgtgatggta caacatttac ttatgcatca 12540
gcattgtggg aaatccaaca ggttgtagat gcagatagta aaattgttca acttagtgaa 12600
attagtatgg acaattcacc taatttagca tggcctctta ttgtaacagc tttaagggcc 12660
aattctgctg tcaaattaca gaataatgag cttagtcctg ttgcactacg acagatgtct 12720
tgtgctgccg gtactacaca aactgcttgc actgatgaca atgcgttagc ttactacaac 12780
acaacaaagg gaggtaggtt tgtacttgca ctgttatccg atttacagga tttgaaatgg 12840
gctagattcc ctaagagtga tggaactggt actatctata cagaactgga accaccttgt 12900
aggtttgtta cagacacacc taaaggtcct aaagtgaagt atttatactt tattaaagga 12960
ttaaacaacc taaatagagg tatggtactt ggtagtttag ctgccacagt acgtctacaa 13020
gctggtaatg caacagaagt gcctgccaat tcaactgtat tatctttctg tgcttttgct 13080
gtagatgctg ctaaagctta caaagattat ctagctagtg ggggacaacc aatcactaat 13140
tgtgttaaga tgttgtgtac acacactggt actggtcagg caataacagt tacaccggaa 13200
gccaatatgg atcaagaatc ctttggtggt gcatcgtgtt gtctgtactg ccgttgccac 13260
atagatcatc caaatcctaa aggattttgt gacttaaaag gtaagtatgt acaaatacct 13320
acaacttgtg ctaatgaccc tgtgggtttt acacttaaaa acacagtctg taccgtctgc 13380
ggtatgtgga aaggttatgg ctgtagttgt gatcaactcc gcgaacccat gcttcagtca 13440
gctgatgcac aatcgttttt aaacgggttt gcggtgtaag tgcagcccgt cttacaccgt 13500
gcggcacagg cactagtact gatgtcgtat acagggcttt tgacatctac aatgataaag 13560
tagctggttt tgctaaattc ctaaaaacta attgttgtcg cttccaagaa aaggacgaag 13620
atgacaattt aattgattct tactttgtag ttaagagaca cactttctct aactaccaac 13680
atgaagaaac aatttataat ttacttaaag attgtccagc tgttgctaaa catgacttct 13740
ttaagtttag aatagacggt gacatggtac cacatatatc acgtcaacgt cttactaaat 13800
acacaatggc agacctcgtc tatgctttaa ggcattttga tgaaggtaat tgtgacacat 13860
taaaagaaat acttgtcaca tacaattgtt gtgatgatga ttatttcaat aaaaaggact 13920
ggtatgattt tgtagaaaac ccagatatat tacgcgtata cgccaactta ggtgaacgtg 13980
tacgccaagc tttgttaaaa acagtacaat tctgtgatgc catgcgaaat gctggtattg 14040
ttggtgtact gacattagat aatcaagatc tcaatggtaa ctggtatgat ttcggtgatt 14100
tcatacaaac cacgccaggt agtggagttc ctgttgtaga ttcttattat tcattgttaa 14160
tgcctatatt aaccttgacc agggctttaa ctgcagagtc acatgttgac actgacttaa 14220
caaagcctta cattaagtgg gatttgttaa aatatgactt cacggaagag aggttaaaac 14280
tctttgaccg ttattttaaa tattgggatc agacatacca cccaaattgt gttaactgtt 14340
tggatgacag atgcattctg cattgtgcaa actttaatgt tttattctct acagtgttcc 14400
cacttacaag ttttggacca ctagtgagaa aaatatttgt tgatggtgtt ccatttgtag 14460
tttcaactgg ataccacttc agagagctag gtgttgtaca taatcaggat gtaaacttac 14520
atagctctag acttagtttt aaggaattac ttgtgtatgc tgctgaccct gctatgcacg 14580
ctgcttctgg taatctatta ctagataaac gcactacgtg cttttcagta gctgcactta 14640
ctaacaatgt tgcttttcaa actgtcaaac ccggtaattt taacaaagac ttctatgact 14700
ttgctgtgtc taagggtttc tttaaggaag gaagttctgt tgaattaaaa cacttcttct 14760
ttgctcagga tggtaatgct gctatcagcg attatgacta ctatcgttat aatctaccaa 14820
caatgtgtga tatcagacaa ctactatttg tagttgaagt tgttgataag tactttgatt 14880
gttacgatgg tggctgtatt aatgctaacc aagtcatcgt caacaaccta gacaaatcag 14940
ctggttttcc atttaataaa tggggtaagg ctagacttta ttatgattca atgagttatg 15000
aggatcaaga tgcacttttc gcatatacaa aacgtaatgt catccctact ataactcaaa 15060
tgaatcttaa gtatgccatt agtgcaaaga atagagctcg caccgtagct ggtgtctcta 15120
tctgtagtac tatgaccaat agacagtttc atcaaaaatt attgaaatca atagccgcca 15180
ctagaggagc tactgtagta attggaacaa gcaaattcta tggtggttgg cacaacatgt 15240
taaaaactgt ttatagtgat gtagaaaacc ctcaccttat gggttgggat tatcctaaat 15300
gtgatagagc catgcctaac atgcttagaa ttatggcctc acttgttctt gctcgcaaac 15360
atacaacgtg ttgtagcttg tcacaccgtt tctatagatt agctaatgag tgtgctcaag 15420
tattgagtga aatggtcatg tgtggcggtt cactatatgt taaaccaggt ggaacctcat 15480
caggagatgc cacaactgct tatgctaata gtgtttttaa catttgtcaa gctgtcacgg 15540
ccaatgttaa tgcactttta tctactgatg gtaacaaaat tgccgataag tatgtccgca 15600
atttacaaca cagactttat gagtgtctct atagaaatag agatgttgac acagactttg 15660
tgaatgagtt ttacgcatat ttgcgtaaac atttctcaat gatgatactc tctgacgatg 15720
ctgttgtgtg tttcaatagc acttatgcat ctcaaggtct agtggctagc ataaagaact 15780
ttaagtcagt tctttattat caaaacaatg tttttatgtc tgaagcaaaa tgttggactg 15840
agactgacct tactaaagga cctcatgaat tttgctctca acatacaatg ctagttaaac 15900
agggtgatga ttatgtgtac cttccttacc cagatccatc aagaatccta ggggccggct 15960
gttttgtaga tgatatcgta aaaacagatg gtacacttat gattgaacgg ttcgtgtctt 16020
tagctataga tgcttaccca cttactaaac atcctaatca ggagtatgct gatgtctttc 16080
atttgtactt acaatacata agaaagctac atgatgagtt aacaggacac atgttagaca 16140
tgtattctgt tatgcttact aatgataaca cttcaaggta ttgggaacct gagttttatg 16200
aggctatgta cacaccgcat acagtcttac aggctgttgg ggcttgtgtt ctttgcaatt 16260
cacagacttc attaagatgt ggtgcttgca tacgtagacc attcttatgt tgtaaatgct 16320
gttacgacca tgtcatatca acatcacata aattagtctt gtctgttaat ccgtatgttt 16380
gcaatgctct aggttgtgat gtcacagatg tgactcaact ttacttagga ggtatgagct 16440
attattgtaa atcacataaa ccacccatta gttttccatt gtgtgctaat ggacaagttt 16500
ttggtttata taaaaataca tgtgttggta gcgataatgt tactgacttt aatgcaattg 16560
caacatgtga ctggacaaat gctggtgatt acattttagc taacacctgt actgaaagac 16620
tcaagctttt tgcagcagaa acgctcaaag ctactgagga gacatttaaa ctgtcttatg 16680
gtattgctac tgtacgtgaa gtgctgtctg acagagaatt acatctttca tgggaagttg 16740
gtaaacctag accaccactt aaccgaaatt atgtctttac tggttatcgt gtaactaaaa 16800
acagtaaagt acaaatagga gagtacacct ttgaaaaagg tgactatggt gatgctgttg 16860
tttaccgagg tacaacaact tacaaattaa atgttggtga ttattttgtg ctgacatcac 16920
atacagtaat gccattaagt gcacctacac tagtgccaca agagcactat gttagaatta 16980
ctggcttata cccaacactc aatatctcat atgagttttc tagcaatgtt gcaaattatc 17040
aaaaggttgg tatgcaaaag tattctacac tccagggacc acctggtact ggtaagagtc 17100
attttgctat tggcctagct ctctactacc cttctgctcg catagtgtat acagcttgct 17160
ctcatgccgc tgttgatgca ctatgtgaga aggcattaaa atatttgcct atagataaat 17220
gtagtagaat tatacctgca cgtgctcgtg tagagtgttt tgataaattc aaagtgaatt 17280
caacattaga acagtatgtc ttttgtactg taaatgcatt gcctgagacg acagcagata 17340
tagttgtctt tgatgaaatt tcaatggcca caaattatga tttgagtgtt gtcaatgcca 17400
gattacgtgc taagcactat gtgtacattg gcgaccctgc tcaattacct gcaccacgca 17460
cattgctaac taagggcaca ctagaaccag aatatttcaa ttcagtgtgt agacttatga 17520
aaactatagg tccagacatg ttcctcggaa cttgtcggcg ttgtcctgct gaaattgttg 17580
acactgtgag tgctttggtt tatgataata agcttaaagc acataaagac aaatcagctc 17640
aatgctttaa aatgttttat aagggtgtta tcacgcatga tgtttcatct gcaattaaca 17700
ggccacaaat aggcgtggta agagaattcc ttacacgtaa ccctgcttgg agaaaagctg 17760
tctttatttc accttataat tcacagaatg ctgtagcctc aaagattttg ggactaccaa 17820
ctcaaactgt tgattcatca cagggctcag aatatgacta tgtcatattc actcaaacca 17880
ctgaaacagc tcactcttgt aatgtaaaca gatttaatgt tgctattacc agagcaaaag 17940
taggcatact ttgcataatg tctgatagag acctttatga caagttgcaa tttacaagtc 18000
ttgaaattcc acgtaggaat gtggcaactt tacaagctga aaatgtaaca ggactcttta 18060
aagattgtag taaggtaatc actgggttac atcctacaca ggcacctaca cacctcagtg 18120
ttgacactaa attcaaaact gaaggtttat gtgttgacat acctggcata cctaaggaca 18180
tgacctatag aagactcatc tctatgatgg gttttaaaat gaattatcaa gttaatggtt 18240
accctaacat gtttatcacc cgcgaagaag ctataagaca tgtacgtgca tggattggct 18300
tcgatgtcga ggggtgtcat gctactagag aagctgttgg taccaattta cctttacagc 18360
taggtttttc tacaggtgtt aacctagttg ctgtacctac aggttatgtt gatacaccta 18420
ataatacaga tttttccaga gttagtgcta aaccaccgcc tggagatcaa tttaaacacc 18480
tcataccact tatgtacaaa ggacttcctt ggaatgtagt gcgtataaag attgtacaaa 18540
tgttaagtga cacacttaaa aatctctctg acagagtcgt atttgtctta tgggcacatg 18600
gctttgagtt gacatctatg aagtattttg tgaaaatagg acctgagcgc acctgttgtc 18660
tatgtgatag acgtgccaca tgcttttcca ctgcttcaga cacttatgcc tgttggcatc 18720
attctattgg atttgattac gtctataatc cgtttatgat tgatgttcaa caatggggtt 18780
ttacaggtaa cctacaaagc aaccatgatc tgtattgtca agtccatggt aatgcacatg 18840
tagctagttg tgatgcaatc atgactaggt gtctagctgt ccacgagtgc tttgttaagc 18900
gtgttgactg gactattgaa tatcctataa ttggtgatga actgaagatt aatgcggctt 18960
gtagaaaggt tcaacacatg gttgttaaag ctgcattatt agcagacaaa ttcccagttc 19020
ttcacgacat tggtaaccct aaagctatta agtgtgtacc tcaagctgat gtagaatgga 19080
agttctatga tgcacagcct tgtagtgaca aagcttataa aatagaagaa ttattctatt 19140
cttatgccac acattctgac aaattcacag atggtgtatg cctattttgg aattgcaatg 19200
tcgatagata tcctgctaat tccattgttt gtagatttga cactagagtg ctatctaacc 19260
ttaacttgcc tggttgtgat ggtggcagtt tgtatgtaaa taaacatgca ttccacacac 19320
cagcttttga taaaagtgct tttgttaatt taaaacaatt accatttttc tattactctg 19380
acagtccatg tgagtctcat ggaaaacaag tagtgtcaga tatagattat gtaccactaa 19440
agtctgctac gtgtataaca cgttgcaatt taggtggtgc tgtctgtaga catcatgcta 19500
atgagtacag attgtatctc gatgcttata acatgatgat ctcagctggc tttagcttgt 19560
gggtttacaa acaatttgat acttataacc tctggaacac ttttacaaga cttcagagtt 19620
tagaaaatgt ggcttttaat gttgtaaata agggacactt tgatggacaa cagggtgaag 19680
taccagtttc tatcattaat aacactgttt acacaaaagt tgatggtgtt gatgtagaat 19740
tgtttgaaaa taaaacaaca ttacctgtta atgtagcatt tgagctttgg gctaagcgca 19800
acattaaacc agtaccagag gtgaaaatac tcaataattt gggtgtggac attgctgcta 19860
atactgtgat ctgggactac aaaagagatg ctccagcaca tatatctact attggtgttt 19920
gttctatgac tgacatagcc aagaaaccaa ctgaaacgat ttgtgcacca ctcactgtct 19980
tttttgatgg tagagttgat ggtcaagtag acttatttag aaatgcccgt aatggtgttc 20040
ttattacaga aggtagtgtt aaaggtttac aaccatctgt aggtcccaaa caagctagtc 20100
ttaatggagt cacattaatt ggagaagccg taaaaacaca gttcaattat tataagaaag 20160
ttgatggtgt tgtccaacaa ttacctgaaa cttactttac tcagagtaga aatttacaag 20220
aatttaaacc caggagtcaa atggaaattg atttcttaga attagctatg gatgaattca 20280
ttgaacggta taaattagaa ggctatgcct tcgaacatat cgtttatgga gattttagtc 20340
atagtcagtt aggtggttta catctactga ttggactagc taaacgtttt aaggaatcac 20400
cttttgaatt agaagatttt attcctatgg acagtacagt taaaaactat ttcataacag 20460
atgcgcaaac aggttcatct aagtgtgtgt gttctgttat tgatttatta cttgatgatt 20520
ttgttgaaat aataaaatcc caagatttat ctgtagtttc taaggttgtc aaagtgacta 20580
ttgactatac agaaatttca tttatgcttt ggtgtaaaga tggccatgta gaaacatttt 20640
acccaaaatt acaatctagt caagcgtggc aaccgggtgt tgctatgcct aatctttaca 20700
aaatgcaaag aatgctatta gaaaagtgtg accttcaaaa ttatggtgat agtgcaacat 20760
tacctaaagg cataatgatg aatgtcgcaa aatatactca actgtgtcaa tatttaaaca 20820
cattaacatt agctgtaccc tataatatga gagttataca ttttggtgct ggttctgata 20880
aaggagttgc accaggtaca gctgttttaa gacagtggtt gcctacgggt acgctgcttg 20940
tcgattcaga tcttaatgac tttgtctctg atgcagattc aactttgatt ggtgattgtg 21000
caactgtaca tacagctaat aaatgggatc tcattattag tgatatgtac gaccctaaga 21060
ctaaaaatgt tacaaaagaa aatgactcta aagagggttt tttcacttac atttgtgggt 21120
ttatacaaca aaagctagct cttggaggtt ccgtggctat aaagataaca gaacattctt 21180
ggaatgctga tctttataag ctcatgggac acttcgcatg gtggacagcc tttgttacta 21240
atgtgaatgc gtcatcatct gaagcatttt taattggatg taattatctt ggcaaaccac 21300
gcgaacaaat agatggttat gtcatgcatg caaattacat attttggagg aatacaaatc 21360
caattcagtt gtcttcctat tctttatttg acatgagtaa atttcccctt aaattaaggg 21420
gtactgctgt tatgtcttta aaagaaggtc aaatcaatga tatgatttta tctcttctta 21480
gtaaaggtag acttataatt agagaaaaca acagagttgt tatttctagt gatgttcttg 21540
ttaacaacta aacgaacaat gtttgttttt cttgttttat tgccactagt ctctattcag 21600
tgtgttaatc ttacaaccag aactcaatta ccccctgcat acactaattc tttcacacgt 21660
ggtgtttatt accctgacaa agttttcaga tcctcagttt tacattcaac tcaggacttg 21720
ttcttacctt tcttttccaa tgttacttgg ttccatgcta tacatgtctc tgggaccaat 21780
ggtactaaga ggtttgataa ccctgtccta ccatttaatg atggtgttta ttttgcttcc 21840
actgagaagt ctaacataat aagaggctgg atttttggta ctactttaga ttcgaagacc 21900
cagtccctac ttattgttaa taacgctact aatgttgtta ttaaagtctg tgaatttcaa 21960
ttttgtaatg atccattttt gggtgtttat taccacaaaa acaacaaaag ttgtatggaa 22020
agtgagttca gagtttattc tagtgcgaat aattgcactt ttgaatatgt ctctcagcct 22080
tttcttatgg accttgaagg aaaacagggt aatttcaaaa atcttaggga atttgtgttt 22140
aagaatattg atggttattt taaaatatat tctaagcaca cgcctattaa tttagtgcgt 22200
gatctccctc agggtttttc ggctttagaa ccattggtag atttgccaat aggtattaac 22260
atcactaggt ttcaaacttt acttgcttta catagaagtt atttgactcc tggtgattct 22320
tcttcaggtt ggacagctgg tgctgcagct tattatgtgg gttatcttca acctaggact 22380
tttctattaa aatataatga aaatggaacc attacagatg ctgtagactg tgcacttgac 22440
cctctctcag aaacaaagtg tacgttgaaa tccttcactg tagaaaaagg aatctatcaa 22500
acttctaact ttagagtcca accaacagaa tctattgtta gatttcctaa tattacaaac 22560
ttgtgccctt ttggtgaagt ttttaacgcc accagatttg catctgttta tgcttggaac 22620
aggaagagaa tcagcaactg tgttgctgat tattctgtcc tatataattc cgcatcattt 22680
tccactttta agtgttatgg agtgtctcct actaaattaa atgatctctg ctttactaat 22740
gtctatgcag attcatttgt aattagaggt gatgaagtca gacaaatcgc tccagggcaa 22800
actggaaaga ttgctgatta taattataaa ttaccagatg attttacagg ctgcgttata 22860
gcttggaatt ctaacaatct tgattctaag gttggtggta attataatta ccggtataga 22920
ttgtttagga agtctaatct caaacctttt gagagagata tttcaactga aatctatcag 22980
gccggtagca caccttgtaa tggtgttgaa ggttttaatt gttactttcc tttacaatca 23040
tatggtttcc aacccactaa tggtgttggt taccaaccat acagagtagt agtactttct 23100
tttgaacttc tacatgcacc agcaactgtt tgtggaccta aaaagtctac taatttggtt 23160
aaaaacaaat gtgtcaattt caacttcaat ggtttaacag gcacaggtgt tcttactgag 23220
tctaacaaaa agtttctgcc tttccaacaa tttggcagag acattgctga cactactgat 23280
gctgtccgtg atccacagac acttgagatt cttgacatta caccatgttc ttttggtggt 23340
gtcagtgtta taacaccagg aacaaatact tctaaccagg ttgctgttct ttatcagggt 23400
gttaactgca cagaagtccc tgttgctatt catgcagatc aacttactcc tacttggcgt 23460
gtttattcta caggttctaa tgtttttcaa acacgtgcag gctgtttaat aggggctgaa 23520
catgtcaaca actcatatga gtgtgacata cccattggtg caggtatatg cgctagttat 23580
cagactcaga ctaattctcc tcggcgggca cgtagtgtag ctagtcaatc catcattgcc 23640
tacactatgt cacttggtgc agaaaattca gttgcttact ctaataactc tattgccata 23700
cccacaaatt ttactattag tgttaccaca gaaattctac cagtgtctat gaccaagaca 23760
tcagtagatt gtacaatgta catttgtggt gattcaactg aatgcagcaa tcttttgttg 23820
caatatggca gtttttgtac acaattaaac cgtgctttaa ctggaatagc tgttgaacaa 23880
gacaaaaaca cccaagaagt ttttgcacaa gtcaaacaaa tttacaaaac accaccaatt 23940
aaagattttg gtggttttaa tttttcacaa atattaccag atccatcaaa accaagcaag 24000
aggtcattta ttgaagatct acttttcaac aaagtgacac ttgcagatgc tggcttcatc 24060
aaacaatatg gtgattgcct tggtgatatt gctgctagag acctcatttg tgcacaaaag 24120
tttaacggcc ttactgtttt gccacctttg ctcacagatg aaatgattgc tcaatacact 24180
tctgcactgt tagcgggtac aatcacttct ggttggacct ttggtgcagg tgctgcatta 24240
caaataccat ttgctatgca aatggcttat aggtttaatg gtattggagt tacacagaat 24300
gttctctatg agaaccaaaa attgattgcc aaccaattta atagtgctat tggcaaaatt 24360
caagactcac tttcttccac agcaagtgca cttggaaaac ttcaagatgt ggtcaaccaa 24420
aatgcacaag ctttaaacac gcttgttaaa caacttagct ccaattttgg tgcaatttca 24480
agtgttttaa atgatatcct ttcacgtctt gacaaagttg aggctgaagt gcaaattgat 24540
aggttgatca caggcagact tcaaagtttg cagacatatg tgactcaaca attaattaga 24600
gctgcagaaa tcagagcttc tgctaatctt gctgctacta aaatgtcaga gtgtgtactt 24660
ggacaatcaa aaagagttga tttttgtgga aagggctatc atcttatgtc cttccctcag 24720
tcagcacctc atggtgtagt cttcttgcat gtgacttatg tccctgcaca agaaaagaac 24780
ttcacaactg ctcctgccat ttgtcatgat ggaaaagcac actttcctcg tgaaggtgtc 24840
tttgtttcaa atggcacaca ctggtttgta acacaaagga atttttatga accacaaatc 24900
attactacag acaacacatt tgtgtctggt aactgtgatg ttgtaatagg aattgtcaac 24960
aacacagttt atgatccttt gcaacctgaa ttagactcat tcaaggagga gttagataaa 25020
tattttaaga atcatacatc accagatgtt gatttaggtg acatctctgg cattaatgct 25080
tcagttgtaa acattcaaaa agaaattgac cgcctcaatg aggttgccaa gaatttaaat 25140
gaatctctca tcgatctcca agaacttgga aagtatgagc agtatataaa atggccatgg 25200
tacatttggc taggttttat agctggcttg attgccatag taatggtgac aattatgctt 25260
tgctgtatga ccagttgctg tagttgtctc aagggctgtt gttcttgtgg atcctgctgc 25320
aaatttgatg aagacgactc tgagccagtg ctcaaaggag tcaaattaca ttacacataa 25380
acgaacttat ggatttgttt atgagaatct tcacaattgg aactgtaact ttgaagcaag 25440
gtgaaatcaa ggatgctact ccttcagatt ttgttcgcgc tactgcaacg ataccgatac 25500
aagcctcact ccctttcgga tggcttattg ttggcgttgc acttcttgct gtttttcata 25560
gcgcttccaa aatcataacc ctcaaaaaga gatggcaact agcactctcc aagggtgttc 25620
actttgtttg caacttgctg ttgttgtttg taacagttta ctcacacctt ttgctcgttg 25680
ctgctggcct tgaagcccct tttctctatc tttatgcttt agtctacttc ttgcagagta 25740
taaactttgt aagaataata atgaggcttt ggctttgctg gaaatgccgt tccaaaaacc 25800
cattacttta tgatgccaac tattttcttt gctggcatac taattgttac gactattgta 25860
taccttacaa tagtgtaact tcttcaattg tcattacttc aggtgatggc acaacaagtc 25920
ctatttctga acatgactac cagattggtg gttatactga aaaatgggaa tctggagtaa 25980
aagactgtgt tgtattacac agttacttca cttcagacta ttaccagctg tactcaactc 26040
aattgagtac agacactggt gttgaacatg ttaccttctt catctacaat aaaattgttg 26100
atgagcctga agaacatgtc caaattcaca caatcgacgg ttcatccgga gttgttaatc 26160
cagtaatgga accaatttat gatgaaccga cgacgactac tagcgtgcct ttgtaagcac 26220
aagctgatga gtacgaactt atgtactcat tcgtttcgga agagacaggt acgttaatag 26280
ttaatagcgt acttcttttt cttgctttcg tggtattctt gctagttaca ctagccatcc 26340
ttactgcgct tcgattgtgt gcgtactgct gcaatattgt taacgtgagt cttgtaaaac 26400
cttcttttta cgtttactct cgtgttaaaa atctgaattc ttctagagtt cctgatcttc 26460
tggtctaaac gaactaaata ttatattagt ttttctgttt ggaactttaa ttttagccat 26520
ggtagattcc aacggtacta ttaccgttga agagcttaaa aagctccttg aacaatggaa 26580
cctagtaata ggtttcctat tccttacatg gatttgtctt ctacaatttg cctatgccaa 26640
caggaatagg tttttgtata taattaagtt aatttttctc tggctgttat ggccagtaac 26700
tttagcttgt tttgtgcttg ctgctgttta cagaataaat tggatcaccg gtggaattgc 26760
tatcgcaatg gcttgtcttg taggcttgat gtggctcagc tacttcattg cttctttcag 26820
actgtttgcg cgtacgcgtt ccatgtggtc attcaatcca gaaactaaca ttcttctcaa 26880
cgtgccactc catggcacta ttctgaccag accgcttcta gaaagtgaac tcgtaatcgg 26940
agctgtgatc cttcgtggac atcttcgtat tgctggacac catctaggac gctgtgacat 27000
caaggacctg cctaaagaaa tcactgttgc tacatcacga acgctttctt attacaaatt 27060
gggagcttcg cagcgtgtag caggtgactc aggttttgct gcatacagtc gctacaggat 27120
tggcaactat aaattaaaca cagaccattc cagtagcagt gacaatattg ctttgcttgt 27180
acagtaagtg acaacagatg tttcatctcg ttgactttca ggttactata gcagagatat 27240
tactaattat tatgaggact tttaaagttt ccatttggaa tcttgattac atcataaacc 27300
tcataattaa aaatttatct aagtcactaa ctgagaataa atattctcaa ttagatgaag 27360
agcaaccaat ggagattgat taaacgaaca tgaaaattat tcttttcttg gcactgataa 27420
cactcgctac ttgtgagctt tatcactacc aagagtgtgt tagaggtaca acagtacttt 27480
taaaagaacc ttgctcttct ggaacatacg agggcaattc accatttcat cctctagctg 27540
ataacaaatt tgcactgact tgctttagca ctcaatttgc ttttgcttgt cctgacggcg 27600
taaaacacgt ctatcagtta cgtgccagat cagtttcacc taaactgttc atcagacaag 27660
aggaagttca agaactttac tctccaattt ttcttattgt tgcggcaata gtgtttataa 27720
cactttgctt cacactcaaa agaaagacag aatgattgaa ctttcattaa ttgacttcta 27780
tttgtgcttt ttagcctttc tgctattcct tgttttaatt atgcttatta tcttttggtt 27840
ctcacttgaa ctgcaagatc ataatgaaac ttgtcacgcc taaacgaaca tgaaatttct 27900
tgttttctta ggaatcatca caactgtagc tgcatttcac caagaatgta gtttacagtc 27960
atgtactcaa catcaaccat atgtagttga tgacccgtgt cctattcact tctattctaa 28020
atggtatatt agagtaggag ctagaaaatc agcaccttta attgaattgt gcgtggatga 28080
ggctggttct aaatcaccca ttcagtacat cgatatcggt aattatacag tttcctgttt 28140
accttttaca attaattgcc aggaacctaa attgggtagt cttgtagtgc gttgttcgtt 28200
ctatgaagac tttttagagt atcatgacgt tcgtgttgtt ttagatttca tctaaacgaa 28260
caaactataa tgtctgataa tggaccccaa aatcagcgaa atgcaccccg cattacgttt 28320
ggtggaccct cagattcaac tggcagtaac cagaatggag aacgcagtgg ggcgcgatca 28380
aaacaacgtc ggccccaagg tttacccaat aatactgcgt cttggttcac cgctctcact 28440
caacatggca aggaagacct taaattccct cgaggacaag gcgttccaat taacaccaat 28500
agcagtccag atgaccaaat tggctactac cgaagagcta ccagacgaat tcgtggtggt 28560
gacggtaaaa tgaaagatct cagtccaaga tggtatttct actacctagg aactgggcca 28620
gaagctggac ttccctatgg tgctaacaaa gacggcatca tatgggttgc aactgaggga 28680
gccttgaata caccaaaaga tcacattggc acccgcaatc ctgctaacaa tgctgcaatc 28740
gtgctacaac ttcctcaagg aacaacattg ccaaaaggct tctacgcaga agggagcaga 28800
ggcggcagtc aagcctcttc tcgttcctca tcacgtagtc gcaacagttc aagaaattca 28860
actccaggca gcagtagggg aatttctcct gctagaatgg ctggcaatgg cggtgatgct 28920
gctcttgctt tgctgctgct tgacagattg aaccagcttg agagcaaaat gtctggtaaa 28980
ggccaacaac aacaaggcca aactgtcact aagaaatctg ctgctgaggc ttctaagaag 29040
cctcggcaaa aacgtactgc cactaaagca tacaatgtaa cacaagcttt cggcagacgt 29100
ggtccagaac aaacccaagg aaattttggg gaccaggaac taatcagaca aggaactgat 29160
tacaaacatt ggccgcaaat tgcacaattt gcccccagcg cttcagcgtt cttcggaatg 29220
tcgcgcattg gcatggaagt cacaccttcg ggaacgtggt tgacctacac aggtgccatc 29280
aaattggatg acaaagatcc aaatttcaaa gatcaagtca ttttgctgaa taagcatatt 29340
gacgcataca aaacatttcc accaacagag cctaaaaagg acaaaaagaa gaaggctgat 29400
gaaactcaag ccttaccgca gagacagaag aaacagcaaa ctgtgactct tcttcctgct 29460
gcagatttgg atgatttctc caaacaattg caacaatcca tgagcagtgc tgactcaact 29520
caggcctaaa ctcatgcaga ccacacaagg cagatgggct atataaacgt tttcgctttt 29580
ccgtttacga tatatagtct actcttgtgc agaatgaatt ctcgtaacta catagcacaa 29640
gtagatgtag ttaactttaa tctcacatag caatctttaa tcagtgtgta acattaggga 29700
ggacttgaaa gagccaccac attttcaccg aggccacgcg gagtacgatc gagtgtacag 29760
tgaacaatgc tagggagagc tgcctatatg gaagagccct aatgtgtaaa attaatttta 29820
gtagtgctat cc 29832
<210> 25
<211> 1273
<212> PRT
<213> SARS-CoV2
<400> 25
Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ile Gln Cys Val
1 5 10 15
Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
20 25 30
Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
35 40 45
His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
50 55 60
Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
65 70 75 80
Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
85 90 95
Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
100 105 110
Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
115 120 125
Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
130 135 140
Tyr His Lys Asn Asn Lys Ser Cys Met Glu Ser Glu Phe Arg Val Tyr
145 150 155 160
Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
165 170 175
Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
180 185 190
Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
195 200 205
Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
210 215 220
Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
225 230 235 240
Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
245 250 255
Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
260 265 270
Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
275 280 285
Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
290 295 300
Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val
305 310 315 320
Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys
325 330 335
Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
340 345 350
Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
355 360 365
Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
370 375 380
Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
385 390 395 400
Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
405 410 415
Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
420 425 430
Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
435 440 445
Tyr Asn Tyr Arg Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
450 455 460
Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
465 470 475 480
Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
485 490 495
Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
500 505 510
Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
515 520 525
Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn
530 535 540
Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu
545 550 555 560
Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val
565 570 575
Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe
580 585 590
Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val
595 600 605
Ala Val Leu Tyr Gln Gly Val Asn Cys Thr Glu Val Pro Val Ala Ile
610 615 620
His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser
625 630 635 640
Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val
645 650 655
Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala
660 665 670
Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala
675 680 685
Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser
690 695 700
Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile
705 710 715 720
Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val
725 730 735
Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu
740 745 750
Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr
755 760 765
Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln
770 775 780
Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe
785 790 795 800
Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser
805 810 815
Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly
820 825 830
Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp
835 840 845
Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu
850 855 860
Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly
865 870 875 880
Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile
885 890 895
Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr
900 905 910
Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn
915 920 925
Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala
930 935 940
Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn
945 950 955 960
Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val
965 970 975
Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln
980 985 990
Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val
995 1000 1005
Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu
1010 1015 1020
Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys Arg Val
1025 1030 1035 1040
Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ser Ala
1045 1050 1055
Pro His Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ala Gln Glu
1060 1065 1070
Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His Asp Gly Lys Ala His
1075 1080 1085
Phe Pro Arg Glu Gly Val Phe Val Ser Asn Gly Thr His Trp Phe Val
1090 1095 1100
Thr Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile Thr Thr Asp Asn Thr
1105 1110 1115 1120
Phe Val Ser Gly Asn Cys Asp Val Val Ile Gly Ile Val Asn Asn Thr
1125 1130 1135
Val Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu
1140 1145 1150
Asp Lys Tyr Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp
1155 1160 1165
Ile Ser Gly Ile Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp
1170 1175 1180
Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu
1185 1190 1195 1200
Gln Glu Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Trp Tyr Ile
1205 1210 1215
Trp Leu Gly Phe Ile Ala Gly Leu Ile Ala Ile Val Met Val Thr Ile
1220 1225 1230
Met Leu Cys Cys Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Cys Cys
1235 1240 1245
Ser Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro Val
1250 1255 1260
Leu Lys Gly Val Lys Leu His Tyr Thr
1265 1270
<210> 26
<211> 29816
<212> DNA
<213> SARS-CoV2
<400> 26
actttcgatc tcttgtagat ctgttctcta aacgaacttt aaaatctgtg tggctgtcac 60
tcggctgcat gcttagtgca ctcacgcagt ataattaata actaattact gtcgttgaca 120
ggacacgagt aactcgtcta tcttctgcag gctgcttacg gtttcgtccg tgttgcagcc 180
gatcatcagc acatctaggt tttgtccggg tgtgaccgaa aggtaagatg gagagccttg 240
tccctggttt caacgagaaa acacacgtcc aactcagttt gcctgtttta caggttcgcg 300
acgtgctcgt acgtggcttt ggagactccg tggaggaggt cttatcagag gcacgtcaac 360
atcttaaaga tggcacttgt ggcttagtag aagttgaaaa aggcgttttg cctcaacttg 420
aacagcccta tgtgttcatc aaacgttcgg atgctcgaac tgcacctcat ggtcatgtta 480
tggttgagct ggtagcagaa ctcgaaggca ttcagtacgg tcgtagtggt gagacacttg 540
gtgtccttgt ccctcatgtg ggcgaaatac cagtggctta ccgcaaggtt cttcttcgta 600
agaacggtaa taaaggagct ggtggccata gttacggcgc cgatctaaag tcatttgact 660
taggcgacga gcttggcact gatccttatg aagattttca agaaaactgg aacactaaac 720
atagcagtgg tgttacccgt gaactcatgc gtgagcttaa cggaggggca tacactcgct 780
atgtcgataa caacttctgt ggccctgatg gctaccctct tgagtgcatt aaagaccttc 840
tagcacgtgc tggtaaagct tcatgcactt tgtccgaaca actggacttt attgacacta 900
agaggggtgt atactgctgc cgtgaacatg agcatgaaat tgcttggtac acggaacgtt 960
ctgaaaagag ctatgaattg cagacacctt ttgaaattaa attggcaaag aaatttgaca 1020
tcttcaatgg ggaatgtcca aattttgtat ttcccttaaa ttccataatc aagactattc 1080
aaccaagggt tgaaaagaaa aagcttgatg gctttatggg tagaattcga tctgtctatc 1140
cagttgcgtc accaaatgaa tgcaaccaaa tgtgcctttc aactctcatg aagtgtgatc 1200
attgtggtga aacttcatgg cagacgggcg attttgttaa agccacttgc gaattttgtg 1260
gcactgagaa tttgactaaa gaaggtgcca ctacttgtgg ttacttaccc caaaatgctg 1320
ttgttaaaat ttattgtcca gcatgtcaca attcagaagt aggacctgag catagtcttg 1380
ccgaatacca taatgaatct ggcttgaaaa ccattcttcg taagggtggt cgcactattg 1440
cctttggagg ctgtgtgttc tcttatgttg gttgccataa caagtgtgcc tattgggttc 1500
cacgtgctag cgctaacata ggttgtaacc atacaggtgt tgttggagaa ggttccgaag 1560
gtcttaatga caaccttctt gaaatactcc aaaaagagaa agtcaacatc aatattgttg 1620
gtgactttaa acttaatgaa gagatcgcca ttattttggc atctttttct gcttccacaa 1680
gtgcttttgt ggaaactgtg aaaggtttgg attataaagc attcaaacaa attgttgaat 1740
cctgtggtaa ttttaaagtt acaaaaggaa aagctaaaaa aggtgcctgg aatattggtg 1800
aacagaaatc aatactgagt cctctttatg catttgcatc agaggctgct cgtgttgtac 1860
gatcaatttt ctcccgcact cttgaaactg ctcaaaattc tgtgcgtgtt ttacagaagg 1920
ccgctataac aatactagat ggaatttcac agtattcact gagactcatt gatgctatga 1980
tgttcacatc tgatttggct actaacaatc tagttgtaat ggcctacatt acaggtggtg 2040
ttgttcagtt gacttcgcag tggctaacta acatctttgg cactgtttat gaaaaactca 2100
aacccgtcct tgattggctt gaagagaagt ttaaggaagg tgtagagttt cttagagacg 2160
gttgggaaat tgttaaattt atctcaacct gtgcttgtga aattgtcggt ggacaaattg 2220
tcacctgtgc aaaggaaatt aaggagagtg ttcagacatt ctttaagctt gtaaataaat 2280
ttttggcttt gtgtgctgac tctatcatta ttggtggagc taaacttaaa gccttgaatt 2340
taggtgaaac atttgttacg cactcaaagg gattgtacag aaagtgtgtt aaatccagag 2400
aagaaactgg cctactcatg cctctaaaag ccccaaaaga aattatcttc ttagagggag 2460
aaacacttcc cacagaagtg ttaacagagg aagttgtctt gaaaactggt gatttacaac 2520
cattagaaca acctactagt gaagctgttg aagctccact ggttggtaca ccagtttgta 2580
ttaacgggct tatgttgctc gaaatcaaag acacagaaaa gtactgtgcc cttgcaccta 2640
atatgatggt aacaaacaat accttcacac tcaaaggcgg tgcaccaaca aaggttactt 2700
ttggtgatga cactgtgata gaagtgcaag gttacaagag tgtgaatatc acttttgaac 2760
ttgatgaaag gattgataaa gtacttaatg agaagtgctc tgcctataca gttgaactcg 2820
gtacagaagt aaatgagttc gcctgtgttg tggcagatgc tgtcataaaa actttgcaac 2880
cagtatctga attacttaca ccactgggca ttgatttaga tgagtggagt atggctacat 2940
actacttatt tgatgagtct ggtgagttta aattggcttc acatatgtat tgttcttttt 3000
accctccaga tgaggatgaa gaagaaggtg attgtgaaga agaagagttt gagccatcaa 3060
ctcaatatga gtatggtact gaagatgatt accaaggtaa acctttggaa tttggtgcca 3120
cttctgctgc tcttcaacct gaagaagagc aagaagaaga ttggttagat gatgatagtc 3180
aacaaactgt tggtcaacaa gacggcagtg aggacaatca gacaactact attcaaacaa 3240
ttgttgaggt tcaacctcaa ttagagatgg aacttacacc agttgttcag actattgaag 3300
tgaatagttt tagtggttat ttaaaactta ctgacaatgt atacattaaa aatgcagaca 3360
ttgtggaaga agctaaaaag gtaaaaccaa cagtggttgt taatgcagcc aatgtttacc 3420
ttaaacatgg aggaggtgtt gcaggagcct taaataaggc tactaacaat gccatgcaag 3480
ttgaatctga tgattacata gctactaatg gaccacttaa agtgggtggt agttgtgttt 3540
taagcggaca caatcttgct aaacactgtc ttcatgttgt cggcccaaat gttaacaaag 3600
gtgaagacat tcaacttctt aagagtgctt atgaaaattt taatcagcac gaagttctac 3660
ttgcaccatt attatcagct ggtatttttg gtgctgaccc tatacattct ttaagagttt 3720
gtgtagatac tgttcgcaca aatgtctact tagctgtctt tgataaaaat ctctatgaca 3780
aacttgtttc aagctttttg gaaatgaaga gtgaaaagca agttgaacaa aagatcgctg 3840
agattcctaa agaggaagtt aagccattta taactgaaag taaaccttca gttgaacaga 3900
gaaaacaaga tgataagaaa atcaaagctt gtgttgaaga agttacaaca actctggaag 3960
aaactaagtt cctcacagaa aacttgttac tttatattga cattaatggc aatcttcatc 4020
cagattctgc cactcttgtt agtgacattg acatcacttt cttaaagaaa gatgctccat 4080
atatagtggg tgatgttgtt caagagggtg ttttaactgc tgtggttata cctactaaaa 4140
aggctggtgg cactactgaa atgctagcga aagctttgag aaaagtgcca acagacaatt 4200
atataaccac ttacccgggt cagggtttaa atggttacac tgtagaggag gcaaagacag 4260
tgcttaaaaa gtgtaaaagt gccttttaca ttctaccatc tattatctct aatgagaagc 4320
aagaaattct tggaactgtt tcttggaatt tgcgagaaat gcttgcacat gcagaagaaa 4380
cacgcaaatt aatgcctgtc tgtgtggaaa ctaaagccat agtttcaact atacagcgta 4440
aatataaggg tattaaaata caagagggtg tggttgatta tggtgctaga ttttactttt 4500
acaccagtaa aacaactgta gcgtcactta tcaacacact taacgatcta aatgaaactc 4560
ttgttacaat gccacttggc tatgtaacac atggcttaaa tttggaagaa gctgctcggt 4620
atatgagatc tctcaaagtg ccagctacag tttctgtttc ttcacctgat gctgttacag 4680
cgtataatgg ttatcttact tcttcttcta aaacacctga agaacatttt attgaaacca 4740
tctcacttgc tggttcctat aaagattggt cctattctgg acaatctaca caactaggta 4800
tagaatttct taagagaggt gataaaagtg tatattacac tagtaatcct accacattcc 4860
acctagatgg tgaagttatc acctttgaca atcttaagac acttctttct ttgagagaag 4920
tgaggactat taaggtgttt acaacagtag acaacattaa cctccacacg caagttgtgg 4980
acatgtcaat gacatatgga caacagtttg gtccaactta tttggatgga gctgatgtta 5040
ctaaaataaa acctcataat tcacatgaag gtaaaacatt ttatgtttta cctaatgatg 5100
acactctacg tgttgaggct tttgagtact accacacaac tgatcctagt tttctgggta 5160
ggtacatgtc agcattaaat cacactaaaa agtggaaata cccacaagtt aatggtttaa 5220
cttctattaa atgggcagat aacaactgtt atcttgccac tgcattgtta acactccaac 5280
aaatagagtt gaagtttaat ccacctgctc tacaagatgc ttattacaga gcaagggctg 5340
gtgaagctgc taacttttgt gcacttatct tagcctactg taataagaca gtaggtgagt 5400
taggtgatgt tagagaaaca atgagttact tgtttcaaca tgccaattta gattcttgca 5460
aaagagtctt gaacgtggtg tgtaaaactt gtggacaaca gcagacaacc cttaagggtg 5520
tagaagctgt tatgtacatg ggcacacttt cttatgaaca atttaagaaa ggtgttcaga 5580
taccttgtac gtgtggtaaa caagctacaa aatatctagt acaacaggag tcaccttttg 5640
ttatgatgtc agcaccacct gctcagtatg aacttaagca tggtacattt acttgtgcta 5700
gtgagtacac tggtaattac cagtgtggtc actataaaca tataacttct aaagaaactt 5760
tgtattgcat agacggtgct ttacttacaa agtcctcaga atacaaaggt cctattacgg 5820
atgttttcta caaagaaaac agttacacaa caaccataaa accagttact tataaattgg 5880
atggtgttgt ttgtacagaa attgacccta agttggacaa ttattataag aaagacaatt 5940
cttatttcac agagcaacca attgatcttg taccaaacca accatatcca aacgcaagct 6000
tcgataattt taagtttgta tgtgataata tcaaatttgc tgatgattta aaccagttaa 6060
ctggttataa gaaacctgct tcaagagagc ttaaagttac atttttccct gacttaaatg 6120
gtgatgtggt ggctattgat tataaacact acacaccctc ttttaagaaa ggagctaaat 6180
tgttacataa acctattgtt tggcatgtta acaatgcaac taataaagcc acgtataaac 6240
caaatacctg gtgtatacgt tgtctttgga gcacaaaacc agttgaaaca tcaaattcgt 6300
ttgatgtact gaagtcagag gacgcgcagg gaatggataa tcttgcctgc gaagatctaa 6360
aaccagtctc tgaagaagta gtggaaaatc ctaccataca gaaagacgtt cttgagtgta 6420
atgtgaaaac taccgaagtt gtaggagaca ttatacttaa accagcaaat aatagtttaa 6480
aaattacaga agaggttggc cacacagatc taatggctgc ttatgtagac aattctagtc 6540
ttactattaa gaaacctaat gaattatcta gagtattagg tttgaaaacc cttgctactc 6600
atggtttagc tgctgttaat agtgtccctt gggatactat agctaattat gctaagcctt 6660
ttcttaacaa agttgttagt acaactacta acatagttac acggtgttta aaccgtgttt 6720
gtactaatta tatgccttat ttctttactt tattgctaca attgtgtact tttactagaa 6780
gtacaaattc tagaattaaa gcatctatgc cgactactat agcaaagaat actgttaaga 6840
gtgtcggtaa attttgtcta gaggcttcat ttaattattt gaagtcacct aatttttcta 6900
aactgataaa tattataatt tggtttttac tattaagtgt ttgcctaggt tctttaatct 6960
actcaaccgc tgctttaggt gttttaatgt ctaatttagg catgccttct tactgtactg 7020
gttacagaga aggctatttg aactctacta atgtcactat tgcaacctac tgtactggtt 7080
ctataccttg tagtgtttgt cttagtggtt tagattcttt agacacctat ccttctttag 7140
aaactataca aattaccatt tcatctttta aatgggattt aactgctttt ggcttagttg 7200
cagagtggtt tttggcatat attcttttca ctaggttttt ctatgtactt ggattggctg 7260
caatcatgca attgtttttc agctattttg cagtacattt tattagtaat tcttggctta 7320
tgtggttaat aattaatctt gtacaaatgg ccccgatttc agctatggtt agaatgtaca 7380
tcttctttgc atcattttat tatgtatgga aaagttatgt gcatgttgta gacggttgta 7440
attcatcaac ttgtatgatg tgttacaaac gtaatagagc aacaagagtc gaatgtacaa 7500
ctattgttaa tggtgttaga aggtcctttt atgtctatgc taatggaggt aaaggctttt 7560
gcaaactaca caattggaat tgtgttaatt gtgatacatt ctgtgctggt agtacattta 7620
ttagtgatga agttgcgaga gacttgtcac tacagtttaa aagaccaata aatcctactg 7680
accagtcttc ttacatcgtt gatagtgtta cagtgaagaa tggttccatc catctttact 7740
ttgataaagc tggtcaaaag acttatgaaa gacattctct ctctcatttt gttaacttag 7800
acaacctgag agctaataac actaaaggtt cattgcctat taatgttata gtttttgatg 7860
gtaaatcaaa atgtgaagaa tcatctgcaa aatcagcgtc tgtttactac agtcagctta 7920
tgtgtcaacc tatactgtta ctagatcagg cattagtgtc tgatgttggt gatagtgcgg 7980
aagttgcagt taaaatgttt gatgcttacg ttaatacgtt ttcatcaact tttaacgtac 8040
caatggaaaa actcaaaaca ctagttgcaa ctgcagaagc tgaacttgca aagaatgtgt 8100
ccttagacaa tgtcttatct acttttattt cagcagctcg gcaagggttt gttgattcag 8160
atgtagaaac taaagatgtt gttgaatgtc ttaaattgtc acatcaatct gacatagaag 8220
ttactggcga tagttgtaat aactatatgc tcacctataa caaagttgaa aacatgacac 8280
cccgtgacct tggtgcttgt attgactgta gtgcgcgtca tattaatgcg caggtagcaa 8340
aaagtcacaa cattgctttg atatggaacg ttaaagattt catgtcattg tctgaacaac 8400
tacgaaaaca aatacgtagt gctgctaaaa agaataactt accttttaag ttgacatgtg 8460
caactactag acaagttgtt aatgttgtaa caacaaagat agcacttaag ggtggtaaaa 8520
ttgttaataa ttggttgaag cagttaatta aagttacact tgtgttcctt tttgttgctg 8580
ctattttcta tttaataaca cctgttcatg tcatgtctaa acatactgac ttttcaagtg 8640
aaatcatagg atacaaggct attgatggtg gtgtcactcg tgacatagca tctacagata 8700
cttgttttgc taacaaacat gctgattttg acacatggtt tagccagcgt ggtggtagtt 8760
atactaatga caaagcttgc ccattgattg ctgcagtcat aacaagagaa gtgggttttg 8820
tcgtgcctgg tttgcctggc acgatattac gcacaactaa tggtgacttt ttgcatttct 8880
tacctagagt ttttagtgca gttggtaata tctgttacac accatcaaaa cttatagagt 8940
acactgactt tgcaacatca gcttgtgttt tggctgctga atgtacaatt tttaaagatg 9000
cttctggtaa gccagtacca tattgttatg ataccaatgt actagaaggt tctgttgctt 9060
atgaaagttt acgccctgac acacgttatg tgctcatgga tggctctatt attcaatttc 9120
ctaacaccta ccttgaaggt tctgttagag tggtaacaac ttttgattct gagtactgta 9180
ggcacggcac ttgtgaaaga tcagaagctg gtgtttgtgt atctactagt ggtagatggg 9240
tacttaacaa tgattattac agatctttac caggagtttt ctgtggtgta gatgctgtaa 9300
atttacttac taatatgttt acaccactaa ttcaacctat tggtgctttg gacatatcag 9360
catctatagt agctggtggt attgtagcta tcgtagtaac atgccttgcc tactatttta 9420
tgaggtttag aagagctttt ggtgaataca gtcatgtagt tgcctttaat actttactat 9480
tccttatgtc attcactgta ctctgtttaa caccagttta ctcattctta cctggtgttt 9540
attctgttat ttacttgtac ttgacatttt atcttactaa tgatgtttct tttttagcac 9600
atattcagtg gatggttatg ttcacacctt tagtaccttt ctggataaca attgcttata 9660
tcatttgtat ttccacaaag catttctatt ggttctttag taattaccta aagagacgtg 9720
tagtctttaa tggtgtttcc tttagtactt ttgaagaagc tgcgctgtgc acctttttgt 9780
taaataaaga aatgtatcta aagttgcgta gtgatgtgct attacctctt acgcaatata 9840
atagatactt agctctttat aataagtaca agtattttag tggagcaatg gatacaacta 9900
gctacagaga agctgcttgt tgtcatctcg caaaggctct caatgacttc agtaactcag 9960
gttctgatgt tctttaccaa ccaccacaaa cctctatcac ctcagctgtt ttgcagagtg 10020
gttttagaaa aatggcattc ccatctggta aagttgaggg ttgtatggta caagtaactt 10080
gtggtacaac tacacttaac ggtctttggc ttgatgacgt agtttactgt ccaagacatg 10140
tgatctgcac ctctgaagac atgcttaacc ctaattatga agatttactc attcgtaagt 10200
ctaatcataa tttcttggta caggctggta atgttcaact cagggttatt ggacattcta 10260
tgcaaaattg tgtacttaag cttaaggttg atacagccaa tcctaagaca cctaagtata 10320
agtttgttcg cattcaacca ggacagactt tttcagtgtt agcttgttac aatggttcac 10380
catctggtgt ttaccaatgt gctatgaggc ccaatttcac tattaagggt tcattcctta 10440
atggttcatg tggtagtgtt ggttttaaca tagattatga ctgtgtctct ttttgttaca 10500
tgcaccatat ggaattacca actggagttc atgctggcac agacttagaa ggtaactttt 10560
atggaccttt tgttgacagg caaacagcac aagcagctgg tacggacaca actattacag 10620
ttaatgtttt agcttggttg tacgctgctg ttataaatgg agacaggtgg tttctcaatc 10680
gatttaccac aactcttaat gactttaacc ttgtggctat gaagtacaat tatgaacctc 10740
taacacaaga ccatgttgac atactaggac ctctttctgc tcaaactgga attgccgttt 10800
tagatatgtg tgcttcatta aaagaattac tgcaaaatgg tatgaatgga cgtaccatat 10860
tgggtagtgc tttattagaa gatgaattta caccttttga tgttgttaga caatgctcag 10920
gtgttacttt ccaaagtgca gtgaaaagaa caatcaaggg tacacaccac tggttgttac 10980
tcacaatttt gacttcactt ttagttttag tccagagtac tcaatggtct ttgttctttt 11040
ttttgtatga aaatgccttt ttaccttttg ctatgggtat tattgctatg tctgcttttg 11100
caatgatgtt tgtcaaacat aagcatgcat ttctctgttt gtttttgtta ccttctcttg 11160
ccactgtagc ttattttaat atggtctata tgcctgctag ttgggtgatg cgtattatga 11220
catggttgga tatggttgat actagtttgt ctggttttaa gctaaaagac tgtgttatgt 11280
atgcatcagc tgtagtgtta ctaatcctta tgacagcaag aactgtgtat gatgatggtg 11340
ctaggagagt gtggacactt atgaatgtct tgacactcgt ttataaagtt tattatggta 11400
atgctttaga tcaagccatt tccatgtggg ctcttataat ctctgttact tctaactact 11460
caggtgtagt tacaactgtc atgtttttgg ccagaggtat tgtttttatg tgtgttgagt 11520
attgccctat tttcttcata actggtaata cacttcagtg tataatgcta gtttattgtt 11580
tcttaggcta tttttgtact tgttactttg gcctcttttg tttactcaac cgctacttta 11640
gactgactct tggtgtttat gattacttag tttctacaca ggagtttaga tatatgaatt 11700
cacagggact actcccaccc aagaatagca tagatgcctt caaactcaac attaaattgt 11760
tgggtgttgg tggcaaacct tgtatcaaag tagccactgt acagtctaaa atgtcagatg 11820
taaagtgcac atcagtagtc ttactctcag ttttgcaaca actcagagta gaatcatcat 11880
ctaaattgtg ggctcaatgt gtccagttac acaatgacat tctcttagct aaagatacta 11940
ctgaagcctt tgaaaaaatg gtttcactac tttctgtttt gctttccatg cagggtgctg 12000
tagacataaa caagctttgt gaagaaatgc tggacaacag ggcaacctta caagctatag 12060
cttcagagtt tagttccctt ccatcatatg cagcttttgc tactgctcaa gaagcttatg 12120
agcaggctgt tgctaatggt gattctgaag ttgttcttaa aaagttgaag aagtctttga 12180
atgtggctaa atctgaattt gaccgtgatg cagccatgca acgtaagttg gaaaagatgg 12240
ctgatcaagc tatgacccaa atgtataaac aggctagatc tgaggacaag agggcaaaag 12300
ttactagtgc tatgcagaca atgcttttca ctatgcttag aaagttggat aatgatgcac 12360
tcaacaacat tatcaacaat gcaagagatg gttgtgttcc cttgaacata atacctctta 12420
caacagcagc caaactaatg gttgtcatac cagactataa cacatataaa aatacgtgtg 12480
atggtacaac atttacttat gcatcagcat tgtgggaaat ccaacaggtt gtagatgcag 12540
atagtaaaat tgttcaactt agtgaaatta gtatggacaa ttcacctaat ttagcatggc 12600
ctcttattgt aacagcttta agggccaatt ctgctgtcaa attacagaat aatgagctta 12660
gtcctgttgc actacgacag atgtcttgtg ctgccggtac tacacaaact gcttgcactg 12720
atgacaatgc gttagcttac tacaacacaa caaagggagg taggtttgta cttgcactgt 12780
tatccgattt acaggatttg aaatgggcta gattccctaa gagtgatgga actggtactg 12840
tctatacaga actggaacca ccttgtaggt ttgttacaga cacacctaaa ggtcctaaag 12900
tgaagtattt atactttatt aaaggattaa acaacctaaa tagaggtatg gtacttggta 12960
gtttagctgc cacagtacgt ctacaagctg gtaatgcaac agaagtgcct gccaattcaa 13020
ctgtattatc tttctgtgct tttgctgtag atgctgctaa agcttacaaa gattatctag 13080
ctagtggggg acaaccaatc actaattgtg ttaagatgtt gtgtacacac actggtactg 13140
gtcaggcaat aacagttaca ccggaagcca atatggatca agaatccttt ggtggtgcat 13200
cgtgttgtct gtactgccgt tgccacatag atcatccaaa tcctaaagga ttttgtgact 13260
taaaaggtaa gtatgtacaa atacctacaa cttgtgctaa tgaccctgtg ggttttacac 13320
ttaaaaacac agtctgtacc gtctgcggta tgtggaaagg ttatggctgt agttgtgatc 13380
aactccgcga acccatgctt cagtcagctg atgcacaatc gtttttaaac gggtttgcgg 13440
tgtaagtgca gcccgtctta caccgtgcgg cacaggcact agtactgatg tcgtatacag 13500
ggcttttgac atctacaatg ataaagtagc tggttttgct aaattcctaa aaactaattg 13560
ttgtcgcttc caagaaaagg acgaagatga caatttaatt gattcttact ttgtagttaa 13620
gagacacact ttctctaact accaacatga agaaacaatt tataatttac ttaaggattg 13680
tccagctgtt gctaaacatg acttctttaa gtttagaata gacggtgaca tggtaccaca 13740
tatatcacgt caacgtctta ctaaatacac aatggcagac ctcgtctatg ctttaaggca 13800
ttttgatgaa ggtaattgtg acacattaaa agaaatactt gtcacataca attgttgtga 13860
tgatgattat ttcaataaaa aggactggta tgattttgta gaaaacccag atatattacg 13920
cgtatacgcc aacttaggtg aacgtgtacg ccaagctttg ttaaaaacag tacaattctg 13980
tgatgccatg cgaaatgctg gtattgttgg tgtactgaca ttagataatc aagatctcaa 14040
tggtaactgg tatgatttcg gtgatttcat acaaaccacg ccaggtagtg gagttcctgt 14100
tgtagattct tattattcat tgttaatgcc tatattaacc ttgaccaggg ctttaactgc 14160
agagtcacat gttgacactg acttaacaaa gccttacatt aagtgggatt tgttaaaata 14220
tgacttcacg gaagagaggt taaaactctt tgaccgttat tttaaatatt gggatcagac 14280
ataccaccca aattgtgtta actgtttgga tgacagatgc attctgcatt gtgcaaactt 14340
taatgtttta ttctctacag tgttcccact tacaagtttt ggaccactag tgagaaaaat 14400
atttgttgat ggtgttccat ttgtagtttc aactggatac cacttcagag agctaggtgt 14460
tgtacataat caggatgtaa acttacatag ctctagactt agttttaagg aattacttgt 14520
gtatgctgct gaccctgcta tgcacgctgc ttctggtaat ctattactag ataaacgcac 14580
tacgtgcttt tcagtagctg cacttactaa caatgttgct tttcaaactg tcaaacccgg 14640
taattttaac aaagacttct atgactttgc tgtgtctaag ggtttcttta aggaaggaag 14700
ttctgttgaa ttaaaacact tcttctttgc tcaggatggt aatgctgcta tcagcgatta 14760
tgactactat cgttataatc taccaacaat gtgtgatatc agacaactac tatttgtagt 14820
tgaagttgtt gataagtact ttgattgtta cgatggtggc tgtattaatg ctaaccaagt 14880
catcgtcaac aacctagaca aatcagctgg ttttccattt aataaatggg gtaaggctag 14940
actttattat gattcaatga gttatgagga tcaagatgca cttttcgcat atacaaaacg 15000
taatgtcatc cctactataa ctcaaatgaa tcttaagtat gccattagtg caaagaatag 15060
agctcgcacc gtagctggtg tctctatctg tagtactatg accaatagac agtttcatca 15120
aaaattattg aaatcaatag ccgccactag aggagctact gtagtaattg gaacaagcaa 15180
attctatggt ggttggcaca acatgttaaa aactgtttat agtgatgtag aaaaccctca 15240
ccttatgggt tgggattatc ctaaatgtga tagagccatg cctaacatgc ttagaattat 15300
ggcctcactt gttcttgctc gcaaacatac aacgtgttgt agcttgtcac accgtttcta 15360
tagattagct aatgagtgtg ctcaagtatt gagtgaaatg gtcatgtgtg gcggttcact 15420
atatgttaaa ccaggtggaa cctcatcagg agatgccaca actgcttatg ctaatagtgt 15480
ttttaacatt tgtcaagctg tcacggccaa tgttaatgca cttttatcta ctgatggtaa 15540
caaaattgcc gataagtatg tccgcaattt acaacacaga ctttatgagt gtctctatag 15600
aaatagagat gttgacacag actttgtgaa tgagttttac gcatatttgc gtaaacattt 15660
ctcaatgatg atactctctg acgatgctgt tgtgtgtttc aatagcactt atgcatctca 15720
aggtctagtg gctagcataa agaactttaa gtcagttctt tattatcaaa acaatgtttt 15780
tatgtctgaa gcaaaatgtt ggactgagac tgaccttact aaaggacctc atgaattttg 15840
ctctcaacat acaatgctag ttaaacaggg tgatgattat gtgtaccttc cttacccaga 15900
tccatcaaga atcctagggg ccggctgttt tgtagatgat atcgtaaaaa cagatggtac 15960
acttatgatt gaacggttcg tgtctttagc tatagatgct tacccactta ctaaacatcc 16020
taatcaggag tatgctgatg tctttcattt gtacttacaa tacataagaa agctacatga 16080
tgagttaaca ggacacatgt tagacatgta ttctgttatg cttactaatg ataacacttc 16140
aaggtattgg gaacctgagt tttatgaggc tatgtacaca ccgcatacag tcttacaggc 16200
tgttggggct tgtgttcttt gcaattcaca gacttcatta agatgtggtg cttgcatacg 16260
tagaccattc ttatgttgta aatgctgtta cgaccatgtc atatcaacat cacataaatt 16320
agtcttgtct gttaatccgt atgtttgcaa tgctccaggt tgtgatgtca cagatgtgac 16380
tcaactttac ttaggaggta tgagctatta ttgtaaatca cataaaccac ccattagttt 16440
tccattgtgt gctaatggac aagtttttgg tttatataaa aatacatgtg ttggtagcga 16500
taatgttact gactttaatg caattgcaac atgtgactgg acaaatgctg gtgattacat 16560
tttagctaac acctgtactg aaagactcaa gctttttgca gcagaaacgc tcaaagctac 16620
tgaggagaca tttaaactgt cttatggtat tgctactgta cgtgaagtgc tgtctgacag 16680
agaattacat ctttcatggg aagttggtaa acctagacca ccacttaacc gaaattatgt 16740
ctttactggt tatcgtgtaa ctaaaaacag taaagtacaa ataggagagt acacctttga 16800
aaaaggtgac tatggtgatg ctgttgttta ccgaggtaca acaacttaca aattaaatgt 16860
tggtgattat tttgtgctga catcacatac agtaatgcca ttaagtgcac ctacactagt 16920
gccacaagag cactatgtta gaattactgg cttataccca acactcaata tctcatatga 16980
gttttctagc aatgttgcaa attatcaaaa ggttggtatg caaaagtatt ctacactcca 17040
gggaccacct ggtactggta agagtcattt tgctattggc ctagctctct actacccttc 17100
tgctcgcata gtgtatacag cttgctctca tgccgctgtt gatgcactat gtgagaaggc 17160
attaaaatat ttgcctatag ataaatgtag tagaattata cctgcacgtg ctcgtgtaga 17220
gtgttttgat aaattcaaag tgaattcaac attagaacag tatgtctttt gtactgtaaa 17280
tgcattgcct gagacgacag cagatatagt tgtctttgat gaaatttcaa tggccacaaa 17340
ttatgatttg agtgttgtca atgccagatt acgtgctaag cactatgtgt acattggcga 17400
ccctgctcaa ttacctgcac cacgcacatt gctaactaag ggcacactag aaccagaata 17460
tttcaattca gtgtgtagac ttatgaaaac tataggtcca gacatgttcc tcggaacttg 17520
tcggcgttgt cctgctgaaa ttgttgacac tgtgagtgct ttggtttatg ataataagct 17580
taaagcacat aaagacaaat cagctcaatg ctttaaaatg ttttataagg gtgttatcac 17640
gcatgatgtt tcatctgcaa ttaacaggcc acaaataggc gtggtaagag aattccttac 17700
acgtaaccct gcttggagaa aagctgtctt tatttcacct tataattcac agaatgctgt 17760
agcctcaaag attttgggac taccaactca aactgttgat tcatcacagg gctcagaata 17820
tgactatgtc atattcactc aaaccactga aacagctcac tcttgtaatg taaacagatt 17880
taatgttgct attaccagag caaaagtagg catactttgc ataatgtctg atagagacct 17940
ttatgacaag ttgcaattta caagtcttga aattccacgt aggaatgtgg caactttaca 18000
agctgaaaat gtaacaggac tctttaaaga ttgtagtaag gtaatcactg ggttacatcc 18060
tacacaggca cctacacacc tcagtgttga cactaaattc aaaactgaag gtttatgtgt 18120
tgacatacct ggcataccta aggacatgac ctatagaaga ctcatctcta tgatgggttt 18180
taaaatgaat tatcaagtta atggttaccc taacatgttt atcacccgcg aagaagctat 18240
aagacatgta cgtgcatgga ttggcttcga tgtcgagggg tgtcatgcta ctagagaagc 18300
tgttggtacc aatttacctt tacagctagg tttttctaca ggtgttaacc tagttgctgt 18360
acctacaggt tatgttgata cacctaataa tacagatttt tccagagtta gtgctaaacc 18420
accgcctgga gatcaattta aacacctcat accacttatg tacaaaggac ttccttggaa 18480
tgtagtgcgt ataaagattg tacaaatgtt aagtgacaca cttaaaaatc tctctgacag 18540
agtcgtattt gtcttatggg cacatggctt tgagttgaca tctatgaagt attttgtgaa 18600
aataggacct gagcgcacct gttgtctatg tgatagacgt gccacatgct tttccactgc 18660
ttcagacact tatgcctgtt ggcatcattc tattggattt gattacgtct ataatccgtt 18720
tatgattgat gttcaacaat ggggttttac aggtaaccta caaagcaacc atgatctgta 18780
ttgtcaagtc catggtaatg cacatgtagc tagttgtgat gcaatcatga ctaggtgtct 18840
agctgtccac gagtgctttg ttaagcgtgt tgactggact attgaatatc ctataattgg 18900
tgatgaactg aagattaatg cggcttgtag aaaggttcaa cacatggttg ttaaagctgc 18960
attattagca gacaaattcc cagttcttca cgacattggt aaccctaaag ctattaagtg 19020
tgtacctcaa gctgatgtag aatggaagtt ctatgatgca cagccttgta gtgacaaagc 19080
ttataaaata gaagaattat tctattctta tgccacacat tctgacaaat tcacagatgg 19140
tgtatgccta ttttggaatt gcaatgtcga tagatatcct gctaattcca ttgtttgtag 19200
atttgacact agagtgctat ctaaccttaa cttgcctggt tgtgatggtg gcagtttgta 19260
tgtaaataaa catgcattcc acacaccagc ttttgataaa agtgcttttg ttaatttaaa 19320
acaattacca tttttctatt actctgacag tccatgtgag tctcatggaa aacaagtagt 19380
gtcagatata gattatgtac cactaaagtc tgctacgtgt ataacacgtt gcaatttagg 19440
tggtgctgtc tgtagacatc atgctaatga gtacagattg tatctcgatg cttataacat 19500
gatgatctca gctggcttta gcttgtgggt ttacaaacaa tttgatactt ataacctctg 19560
gaacactttt acaagacttc agagtttaga aaatgtggct tttaatgttg taaataaggg 19620
acactttgat ggacaacagg gtgaagtacc agtttctatc attaataaca ctgtttacac 19680
aaaagttgat ggtgttgatg tagaattgtt tgaaaataaa acaacattac ctgttaatgt 19740
agcatttgag ctttgggcta agcgcaacat taaaccagta ccagaggtga aaatactcaa 19800
taatttgggt gtggacattg ctgctaatac tgtgatctgg gactacaaaa gagatgctcc 19860
agcacatata tctactattg gtgtttgttc tatgactgac atagccaaga aaccaactga 19920
aacgatttgt gcaccactca ctgtcttttt tgatggtaga gttgatggtc aagtagactt 19980
atttagaaat gcccgtaatg gtgttcttat tacagaaggt agtgttaaag gtttacaacc 20040
atctgtaggt cccaaacaag ctagtcttaa tggagtcaca ttaattggag aagccgtaaa 20100
aacacagttc aattattata agaaagttga tggtgttgtc caacaattac ctgaaactta 20160
ctttactcag agtagaaatt tacaagaatt taaacccagg agtcaaatgg aaattgattt 20220
cttagaatta gctatggatg aattcattga acggtataaa ttagaaggct atgccttcga 20280
acatatcgtt tatggagatt ttagtcatag tcagttaggt ggtttacatc tactgattgg 20340
actagctaaa cgttttaagg aatcaccttt tgaattagaa gattttattc ctatggacag 20400
tacagttaaa aactatttca taacagatgc gcaaacaggt tcatctaagt gtgtgtgttc 20460
tgttattgat ttattacttg atgattttgt tgaaataata aaatcccaag atttatctgt 20520
agtttctaag gttgtcaaag tgactattga ctatacagaa atttcattta tgctttggtg 20580
taaagatggc catgtagaaa cattttaccc aaaattacaa tctagtcaag cgtggcaacc 20640
gggtgttgct atgcctaatc tttacaaaat gcaaagaatg ctattagaaa agtgtgacct 20700
tcaaaattat ggtgatagtg caacattacc taaaggcata atgatgaatg tcgcaaaata 20760
tactcaactg tgtcaatatt taaacacatt aacattagct gtaccctata atatgagagt 20820
tatacatttt ggtgctggtt ctgataaagg agttgcacca ggtacagctg ttttaagaca 20880
gtggttgcct acgggtacgc tgcttgtcga ttcagatctt aatgactttg tctctgatgc 20940
agattcaact ttgattggtg attgtgcaac tgtacataca gctaataaat gggatctcat 21000
tattagtgat atgtacgacc ctaagactaa aaatgttaca aaagaaaatg actctaaaga 21060
gggttttttc acttacattt gtgggtttat acaacaaaag ctagctcttg gaggttccgt 21120
ggctataaag ataacagaac attcttggaa tgctgatctt tataagctca tgggacactt 21180
cgcatggtgg acagcctttg ttactaatgt gaatgcgtca tcatctgaag catttttaat 21240
tggatgtaat tatcttggca aaccacgcga acaaatagat ggttatgtca tgcatgcaaa 21300
ttacatattt tggaggaata caaatccaat tcagttgtct tcctattctt tatttgacat 21360
gagtaaattt ccccttaaat taaggggtac tgctgttatg tctttaaaag aaggtcaaat 21420
caatgatatg attttatctc ttcttagtaa aggtagactt ataattagag aaaacaacag 21480
agttgttatt tctagtgatg ttcttgttaa caactaaacg aacaatgttt gtttttcttg 21540
ttttattgcc actagtctct attcagtgtg ttaatcttac aaccagaact caattacccc 21600
ctgcatacac taattctttc acacgtggtg tttattaccc tgacaaagtt ttcagatcct 21660
cagttttaca ttcaactcag gacttgttct tacctttctt ttccaatgtt acttggttcc 21720
atgctataca tgtctctggg accaatggta ctaagaggtt tgataaccct gtcctaccat 21780
ttaatgatgg tgtttatttt gcttccactg agaagtctaa cataataaga ggctggattt 21840
ttggtactac tttagattcg aagacccagt ccctacttat tgttaataac gctactaatg 21900
ttgttattaa agtctgtgaa tttcaatttt gtaatgatcc atttttgggt gtttattacc 21960
acaaaaacaa caaaagttgt atggaaagtg agttcagagt ttattctagt gcgaataatt 22020
gcacttttga atatgtctct cagccttttc ttatggacct tgaaggaaaa cagggtaatt 22080
tcaaaaatct tagggaattt gtgtttaaga atattgatgg ttattttaaa atatattcta 22140
agcacacgcc tattaattta gtgcgtgatc tccctcaggg tttttcggct ttagaaccat 22200
tggtagattt gccaataggt attaacatca ctaggtttca aactttactt gctttacata 22260
gaagttattt gactcctggt gattcttctt caggttggac agctggtgct gcagcttatt 22320
atgtgggtta tcttcaacct aggacttttc tattaaaata taatgaaaat ggaaccatta 22380
cagatgctgt agactgtgca cttgaccctc tctcagaaac aaagtgtacg ttgaaatcct 22440
tcactgtaga aaaaggaatc tatcaaactt ctaactttag agtccaacca acagaatcta 22500
ttgttagatt tcctaatatt acaaacttgt gcccttttgg tgaagttttt aacgccacca 22560
gatttgcatc tgtttatgct tggaacagga agagaatcag caactgtgtt gctgattatt 22620
ctgtcctata taattccgca tcattttcca cttttaagtg ttatggagtg tctcctacta 22680
aattaaatga tctctgcttt actaatgtct atgcagattc atttgtaatt agaggtgatg 22740
aagtcagaca aatcgctcca gggcaaactg gaaagattgc tgattataat tataaattac 22800
cagatgattt tacaggctgc gttatagctt ggaattctaa caatcttgat tctaaggttg 22860
gtggtaatta taattaccgg tatagattgt ttaggaagtc taatctcaaa ccttttgaga 22920
gagatatttc aactgaaatc tatcaggccg gtagcacacc ttgtaatggt gttgaaggtt 22980
ttaattgtta ctttccttta caatcatatg gtttccaacc cactaatggt gttggttacc 23040
aaccatacag agtagtagta ctttcttttg aacttctaca tgcaccagca actgtttgtg 23100
gacctaaaaa gtctactaat ttggttaaaa acaaatgtgt caatttcaac ttcaatggtt 23160
taacaggcac aggtgttctt actgagtcta acaaaaagtt tctgcctttc caacaatttg 23220
gcagagacat tgctgacact actgatgctg tccgtgatcc acagacactt gagattcttg 23280
acattacacc atgttctttt ggtggtgtca gtgttataac accaggaaca aatacttcta 23340
accaggttgc tgttctttat cagggtgtta actgcacaga agtccctgtt gctattcatg 23400
cagatcaact tactcctact tggcgtgttt attctacagg ttctaatgtt tttcaaacac 23460
gtgcaggctg tttaataggg gctgaacatg tcaacaactc atatgagtgt gacataccca 23520
ttggtgcagg tatatgcgct agttatcaga ctcagactaa ttctcctcgg cgggcacgta 23580
gtgtagctag tcaatccatc attgcctaca ctatgtcact tggtgcagaa aattcagttg 23640
cttactctaa taactctatt gccataccca caaattttac tattagtgtt accacagaaa 23700
ttctaccagt gtctatgacc aagacatcag tagattgtac aatgtacatt tgtggtgatt 23760
caactgaatg cagcaatctt ttgttgcaat atggcagttt ttgtacacaa ttaaaccgtg 23820
ctttaactgg aatagctgtt gaacaagaca aaaacaccca agaagttttt gcacaagtca 23880
aacaaattta caaaacacca ccaattaaag attttggtgg ttttaatttt tcacaaatat 23940
taccagatcc atcaaaacca agcaagaggt catttattga agatctactt ttcaacaaag 24000
tgacacttgc agatgctggc ttcatcaaac aatatggtga ttgccttggt gatattgctg 24060
ctagagacct catttgtgca caaaagttta acggccttac tgttttgcca cctttgctca 24120
cagatgaaat gattgctcaa tacacttctg cactgttagc gggtacaatc acttctggtt 24180
ggacctttgg tgcaggtgct gcattacaaa taccatttgc tatgcaaatg gcttataggt 24240
ttaatggtat tggagttaca cagaatgttc tctatgagaa ccaaaaattg attgccaacc 24300
aatttaatag cgctattggc aaaattcaag actcactttc ttccacagca agtgcacttg 24360
gaaaacttca agatgtggtc aaccaaaatg cacaagcttt aaacacgctt gttaaacaac 24420
ttagctccaa ttttggtgca atttcaagtg ttttaaatga tatcctttca cgtcttgaca 24480
aagttgaggc tgaagtgcaa attgataggt tgatcacagg cagacttcaa agtttgcaga 24540
catatgtgac tcaacaatta attagagctg cagaaatcag agcttctgct aatcttgctg 24600
ctactaaaat gtcagagtgt gtacttggac aatcaaaaag agttgatttt tgtggaaagg 24660
gctatcatct tatgtccttc cctcagtcag cacctcatgg tgtagtcttc ttgcatgtga 24720
cttatgtccc tgcacaagaa aagaacttca caactgctcc tgccatttgt catgatggaa 24780
aagcacactt tcctcgtgaa ggtgtctttg tttcaaatgg cacacactgg tttgtaacac 24840
aaaggaattt ttatgaacca caaatcatta ctacagacaa cacatttgtg tctggtaact 24900
gtgatgttgt aataggaatt gtcaacaaca cagtttatga tcctttgcaa cctgaattag 24960
actcattcaa ggaggagtta gataaatatt ttaagaatca tacatcacca gatgttgatt 25020
taggtgacat ctctggcatt aatgcttcag ttgtaaacat tcaaaaagaa attgaccgcc 25080
tcaatgaggt tgccaagaat ttaaatgaat ctctcatcga tctccaagaa cttggaaagt 25140
atgagcagta tataaaatgg ccatggtaca tttggctagg ttttatagct ggcttgattg 25200
ccatagtaat ggtgacaatt atgctttgct gtatgaccag ttgctgtagt tgtctcaagg 25260
gctgttgttc ttgtggatcc tgctgcaaat ttgatgaaga cgactctgag ccagtgctca 25320
aaggagtcaa attacattac acataaacga acttatggat ttgtttatga gaatcttcac 25380
aattggaact gtaactttga agcaaggtga aatcaaggat gctactcctt cagattttgt 25440
tcgcgctact gcaacgatac cgatacaagc ctcactccct ttcggatggc ttattgttgg 25500
cgttgcactt cttgctgttt ttcatagcgc ttccaaaatc ataaccctca aaaagagatg 25560
gcaactagca ctctccaagg gtgttcactt tgtttgcaac ttgctgttgt tgtttgtaac 25620
agtttactca caccttttgc tcgttgctgt tggccttgaa gccccttttc tctatcttta 25680
tgctttagtc tacttcttgc agagtataaa ctttgtaaga ataataatga ggctttggct 25740
ttgctggaaa tgccgttcca aaaacccatt actttatgat gccaactatt ttctttgctg 25800
gcatactaat tgttacgact attgtatacc ttacaatagt gtaacttctt caattgtcat 25860
tacttcaggt gatggcacaa caagtcctat ttctgaacat gactaccaga ttggtggtta 25920
tactgaaaaa tgggaatctg gagtaaaaga ctgtgttgta ttacacagtt acttcacttc 25980
agactattac cagctgtact caactcaatt gagtacagac actggtgttg aacatgttac 26040
cttcttcatc tacaataaaa ttgttgatga gcctgaagaa catgtccaaa ttcacacaat 26100
cgacggttca tccggagttg ttaatccagt aatggaacca atttatgatg aaccgacgac 26160
gactactagc gtgcctttgt aagcacaagc tgatgagtac gaacttatgt actcattcgt 26220
ttcggaagag acaggtacgt taatagttaa tagcgtactt ctttttcttg ctttcgtggt 26280
attcttgcta gttacactag ccatccttac tgcgcttcga ttgtgtgcgt actgctgcaa 26340
tattgttaac gtgagtcttg taaaaccttc tttttacgtt tactctcgtg ttaaaaatct 26400
gaattcttct agagttcctg atcttctggt ctaaacgaac taaatattat attagttttt 26460
ctgtttggaa ctttaatttt agccatggca gattccaacg gtactattac cgttgaagag 26520
cttaaaaagc tccttgaaca atggaaccta gtaataggtt tcctattcct tacatggatt 26580
tgtcttctac aatttgccta tgccaacagg aataggtttt tgtatataat taagttaatt 26640
tttctctggc tgttatggcc agtaacttta gcttgttttg tgcttgctgc tgtttacaga 26700
ataaattgga tcaccggtgg aattgctatc gcaatggctt gtcttgtagg cttgatgtgg 26760
ctcagctact tcattgcttc tttcagactg tttgcgcgta cgcgttccat gtggtcattc 26820
aatccagaaa ctaacattct tctcaacgtg ccactccatg gcactattct gaccagaccg 26880
cttctagaaa gtgaactcgt aatcggagct gtgatccttc gtggacatct tcgtattgct 26940
ggacaccatc taggacgctg tgacatcaag gacctgccta aagaaatcac tgttgctaca 27000
tcacgaacgc tttcttatta caaattggga gcttcgcagc gtgtagcagg tgactcaggt 27060
tttgctgcat acagtcgcta caggattggc aactataaat taaacacaga ccattccagt 27120
agcagtgaca atattgcttt gcttgtacag taagtgacaa cagatgtttc atctcgttga 27180
ctttcaggtt actatagcag agatattact aattattatg aggactttta aagtttccat 27240
ttggaatctt gattacatca taaacctcat aattaaaaat ttatctaagt cactaactga 27300
gaataaatat tctcaattag atgaagagca accaatggag attgattaaa cgaacatgaa 27360
aattattctt ttcttggcac tgataacact cgctacttgt gagctttatc actaccaaga 27420
gtgtgttaga ggtacaacag tacttttaaa agaaccttgc tcttctggaa catacgaggg 27480
caattcacca tttcatcctc tagctgataa caaatttgca ctgacttgct ttagcactca 27540
atttgctttt gcttgtcctg acggcgtaaa acacgtctat cagttacgtg ccagatcagt 27600
ttcacctaaa ctgttcatca gacaagagga agttcaagaa ctttactctc caatttttct 27660
tattgttgcg gcaatagtgt ttataacact ttgcttcaca ctcaaaagaa agacagaatg 27720
attgaacttt cattaattga cttctatttg tgctttttag cctttctgct attccttgtt 27780
ttaattatgc ttattatctt ttggttctca cttgaactgc aagatcataa tgaaacttgt 27840
cacgcctaaa ctaacatgaa atttcttgtt ttcttaggaa tcatcacaac tgtagctgca 27900
tttcaccaag aatgtagttt acagtcatgt actcaacatc aaccatatgt agttgatgac 27960
ccgtgtccta ttcacttcta ttctaaatgg tatattagag taggagctag aaaatcagca 28020
cctttaattg aattgtgcgt ggatgaggct ggttctaaat cacccattca gtacatcgat 28080
atcggtaatt atacagtttc ctgtttacct tttacaatta attgccagga acctaaattg 28140
ggtagtcttg tagtgcgttg ttcgttctat gaagactttt tagagtatca tgacgttcgt 28200
gttgttttag atttcatcta aacgaacaaa ctataatgtc tgataatgga ccccaaaatc 28260
agcgaaatgc accccgcatt acgtttggtg gaccctcaga ttcaactggc agtaaccaga 28320
atggagaacg cagtggggcg cgatcaaaac aacgtcggcc ccaaggttta cccaataata 28380
ctgcgtcttg gttcaccgct ctcactcaac atggcaagga agaccttaaa ttccctcgag 28440
gacaaggcgt tccaattaac accaatagca gtccagatga ccaaattggc tactaccgaa 28500
gagctaccag acgaattcgt ggtggtgacg gtaaaatgaa agatctcagt ccaagatggt 28560
atttctacta cctaggaact gggccagaag ctggacttcc ctatggtgct aacaaagacg 28620
gcatcatatg ggttgcaact gagggagcct tgaatacacc aaaagatcac attggcaccc 28680
gcaatcctgc taacaatgct gcaatcgtgc tacaacttcc tcaaggaaca acattgccaa 28740
aaggcttcta cgcagaaggg agcagaggcg gcagtcaagc ctcttctcgt tcctcatcac 28800
gtagtcgcaa cagttcaaga aattcaactc caggcagcag taggggaatt tctcctgcta 28860
gaatggctgg caatggcggt gatgctgctc ttgctttgct gctgcttgac agattgaacc 28920
agcttgagag caaaatgtct ggtaaaggcc aacaacaaca aggccaaact gtcactaaga 28980
aatctgctgc tgaggcttct aagaagcctc ggcaaaaacg tactgccact aaagcataca 29040
atgtaacaca agctttcggc agacgtggtc cagaacaaac ccaaggaaat tttggggacc 29100
aggaactaat cagacaagga actgattaca aacattggcc gcaaattgca caatttgccc 29160
ccagcgcttc agcgttcttc ggaatgtcgc gcattggcat ggaagtcaca ccttcgggaa 29220
cgtggttgac ctacacaggt gccatcaaat tggatgacaa agatccaaat ttcaaagatc 29280
aagtcatttt gctgaataag catattgacg catacaaaac atttccacca acagagccta 29340
aaaaggacaa aaagaagaag gctgatgaaa ctcaagcctt accgcagaga cagaagaaac 29400
agcaaactgt gactcttctt cctgctgcag atttggatga tttctccaaa caattgcaac 29460
aatccatgag cagtgctgac tcaactcagg cctaaactca tgcagaccac acaaggcaga 29520
tgggctatat aaacgttttc gcttttccgt ttacgatata tagtctactc ttgtgcagaa 29580
tgaattctcg taactacata gcacaagtag atgtagttaa ctttaatctc acatagcaat 29640
ctttaatcag tgtgtaacat tagggaggac ttgaaagagc caccacattt tcaccgaggc 29700
cacgcggagt acgatcgagt gtacagtgaa caatgctagg gagagctgcc tatatggaag 29760
agccctaatg tgtaaaatta attttagtag tgctatcccc atgtgatttt aatagc 29816
<210> 27
<211> 1273
<212> PRT
<213> SARS-CoV2
<400> 27
Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ile Gln Cys Val
1 5 10 15
Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
20 25 30
Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
35 40 45
His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
50 55 60
Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
65 70 75 80
Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
85 90 95
Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
100 105 110
Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
115 120 125
Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
130 135 140
Tyr His Lys Asn Asn Lys Ser Cys Met Glu Ser Glu Phe Arg Val Tyr
145 150 155 160
Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
165 170 175
Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
180 185 190
Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
195 200 205
Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
210 215 220
Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
225 230 235 240
Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
245 250 255
Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
260 265 270
Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
275 280 285
Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
290 295 300
Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val
305 310 315 320
Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys
325 330 335
Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
340 345 350
Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
355 360 365
Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
370 375 380
Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
385 390 395 400
Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
405 410 415
Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
420 425 430
Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
435 440 445
Tyr Asn Tyr Arg Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
450 455 460
Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
465 470 475 480
Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
485 490 495
Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
500 505 510
Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
515 520 525
Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn
530 535 540
Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu
545 550 555 560
Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val
565 570 575
Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe
580 585 590
Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val
595 600 605
Ala Val Leu Tyr Gln Gly Val Asn Cys Thr Glu Val Pro Val Ala Ile
610 615 620
His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser
625 630 635 640
Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val
645 650 655
Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala
660 665 670
Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala
675 680 685
Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser
690 695 700
Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile
705 710 715 720
Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val
725 730 735
Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu
740 745 750
Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr
755 760 765
Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln
770 775 780
Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe
785 790 795 800
Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser
805 810 815
Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly
820 825 830
Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp
835 840 845
Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu
850 855 860
Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly
865 870 875 880
Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile
885 890 895
Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr
900 905 910
Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn
915 920 925
Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala
930 935 940
Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn
945 950 955 960
Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val
965 970 975
Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln
980 985 990
Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val
995 1000 1005
Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu
1010 1015 1020
Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys Arg Val
1025 1030 1035 1040
Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ser Ala
1045 1050 1055
Pro His Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ala Gln Glu
1060 1065 1070
Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His Asp Gly Lys Ala His
1075 1080 1085
Phe Pro Arg Glu Gly Val Phe Val Ser Asn Gly Thr His Trp Phe Val
1090 1095 1100
Thr Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile Thr Thr Asp Asn Thr
1105 1110 1115 1120
Phe Val Ser Gly Asn Cys Asp Val Val Ile Gly Ile Val Asn Asn Thr
1125 1130 1135
Val Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu
1140 1145 1150
Asp Lys Tyr Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp
1155 1160 1165
Ile Ser Gly Ile Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp
1170 1175 1180
Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu
1185 1190 1195 1200
Gln Glu Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Trp Tyr Ile
1205 1210 1215
Trp Leu Gly Phe Ile Ala Gly Leu Ile Ala Ile Val Met Val Thr Ile
1220 1225 1230
Met Leu Cys Cys Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Cys Cys
1235 1240 1245
Ser Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro Val
1250 1255 1260
Leu Lys Gly Val Lys Leu His Tyr Thr
1265 1270
<210> 28
<211> 419
<212> PRT
<213> SARS-CoV2
<400> 28
Met Ser Asp Asn Gly Pro Gln Asn Gln Arg Asn Ala Pro Arg Ile Thr
1 5 10 15
Phe Gly Gly Pro Ser Asp Ser Thr Gly Ser Asn Gln Asn Gly Glu Arg
20 25 30
Ser Gly Ala Arg Ser Lys Gln Arg Arg Pro Gln Gly Leu Pro Asn Asn
35 40 45
Thr Ala Ser Trp Phe Thr Ala Leu Thr Gln His Gly Lys Glu Asp Leu
50 55 60
Lys Phe Pro Arg Gly Gln Gly Val Pro Ile Asn Thr Asn Ser Ser Pro
65 70 75 80
Asp Asp Gln Ile Gly Tyr Tyr Arg Arg Ala Thr Arg Arg Ile Arg Gly
85 90 95
Gly Asp Gly Lys Met Lys Asp Leu Ser Pro Arg Trp Tyr Phe Tyr Tyr
100 105 110
Leu Gly Thr Gly Pro Glu Ala Gly Leu Pro Tyr Gly Ala Asn Lys Asp
115 120 125
Gly Ile Ile Trp Val Ala Thr Glu Gly Ala Leu Asn Thr Pro Lys Asp
130 135 140
His Ile Gly Thr Arg Asn Pro Ala Asn Asn Ala Ala Ile Val Leu Gln
145 150 155 160
Leu Pro Gln Gly Thr Thr Leu Pro Lys Gly Phe Tyr Ala Glu Gly Ser
165 170 175
Arg Gly Gly Ser Gln Ala Ser Ser Arg Ser Ser Ser Arg Ser Arg Asn
180 185 190
Ser Ser Arg Asn Ser Thr Pro Gly Ser Ser Arg Gly Thr Ser Pro Ala
195 200 205
Arg Met Ala Gly Asn Gly Gly Asp Ala Ala Leu Ala Leu Leu Leu Leu
210 215 220
Asp Arg Leu Asn Gln Leu Glu Ser Lys Met Ser Gly Lys Gly Gln Gln
225 230 235 240
Gln Gln Gly Gln Thr Val Thr Lys Lys Ser Ala Ala Glu Ala Ser Lys
245 250 255
Lys Pro Arg Gln Lys Arg Thr Ala Thr Lys Ala Tyr Asn Val Thr Gln
260 265 270
Ala Phe Gly Arg Arg Gly Pro Glu Gln Thr Gln Gly Asn Phe Gly Asp
275 280 285
Gln Glu Leu Ile Arg Gln Gly Thr Asp Tyr Lys His Trp Pro Gln Ile
290 295 300
Ala Gln Phe Ala Pro Ser Ala Ser Ala Phe Phe Gly Met Ser Arg Ile
305 310 315 320
Gly Met Glu Val Thr Pro Ser Gly Thr Trp Leu Thr Tyr Thr Gly Ala
325 330 335
Ile Lys Leu Asp Asp Lys Asp Pro Asn Phe Lys Asp Gln Val Ile Leu
340 345 350
Leu Asn Lys His Ile Asp Ala Tyr Lys Thr Phe Pro Pro Thr Glu Pro
355 360 365
Lys Lys Asp Lys Lys Lys Lys Ala Asp Glu Thr Gln Ala Leu Pro Gln
370 375 380
Arg Gln Lys Lys Gln Gln Thr Val Thr Leu Leu Pro Ala Ala Asp Leu
385 390 395 400
Asp Asp Phe Ser Lys Gln Leu Gln Gln Ser Met Ser Ser Ala Asp Ser
405 410 415
Thr Gln Ala
<210> 29
<211> 222
<212> PRT
<213> SARS-CoV2
<400> 29
Met Ala Asp Ser Asn Gly Thr Ile Thr Val Glu Glu Leu Lys Lys Leu
1 5 10 15
Leu Glu Gln Trp Asn Leu Val Ile Gly Phe Leu Phe Leu Thr Trp Ile
20 25 30
Cys Leu Leu Gln Phe Ala Tyr Ala Asn Arg Asn Arg Phe Leu Tyr Ile
35 40 45
Ile Lys Leu Ile Phe Leu Trp Leu Leu Trp Pro Val Thr Leu Ala Cys
50 55 60
Phe Val Leu Ala Ala Val Tyr Arg Ile Asn Trp Ile Thr Gly Gly Ile
65 70 75 80
Ala Ile Ala Met Ala Cys Leu Val Gly Leu Met Trp Leu Ser Tyr Phe
85 90 95
Ile Ala Ser Phe Arg Leu Phe Ala Arg Thr Arg Ser Met Trp Ser Phe
100 105 110
Asn Pro Glu Thr Asn Ile Leu Leu Asn Val Pro Leu His Gly Thr Ile
115 120 125
Leu Thr Arg Pro Leu Leu Glu Ser Glu Leu Val Ile Gly Ala Val Ile
130 135 140
Leu Arg Gly His Leu Arg Ile Ala Gly His His Leu Gly Arg Cys Asp
145 150 155 160
Ile Lys Asp Leu Pro Lys Glu Ile Thr Val Ala Thr Ser Arg Thr Leu
165 170 175
Ser Tyr Tyr Lys Leu Gly Ala Ser Gln Arg Val Ala Gly Asp Ser Gly
180 185 190
Phe Ala Ala Tyr Ser Arg Tyr Arg Ile Gly Asn Tyr Lys Leu Asn Thr
195 200 205
Asp His Ser Ser Ser Ser Asp Asn Ile Ala Leu Leu Val Gln
210 215 220
Claims (64)
- 베타-프로피오락톤-불활성화된 SARS-CoV-2 입자를 포함하는 SARS-CoV-2 백신으로서, 여기서 백신은 인간 대상체에서 천연 SARS-CoV-2 입자에 대한 중화 항체를 생성할 수 있는, SARS-CoV-2 백신.
- 제1항에 있어서, SARS-CoV-2 입자의 천연 표면 형태는 백신에서 보존되는, SARS-CoV-2 백신.
- 제1항 또는 제2항에 있어서, 불활성화된 SARS-CoV-2 입자에서 바이러스 RNA는 복제-결핍이고, 바람직하게는 불활성화된 SARS-CoV-2 입자에서 바이러스 RNA는 (i) 알킬화 및/또는 아실화되고 (ii) 하나 이상의 변형된 퓨린(바람직하게는 구아닌) 잔기 및/또는 가닥 절단을 포함하고/하거나 (iii) 하나 이상의 바이러스 단백질과 가교-결합되는, SARS-CoV-2 백신.
- 제1항 내지 제3항 중 어느 한 항에 있어서, SARS-CoV-2 입자는 300 내지 700ppm, 보다 바람직하게는 500ppm의 농도에서 베타-프로피오락톤-불활성화되고 약 1 내지 48시간, 바람직하게는 20 내지 28시간, 가장 바람직하게는 24시간 ± 2시간(예컨대 또한 ± 1시간 또는 ± 0.5시간) 동안 2℃ 내지 8℃에서 불활성화되고, 선택적으로 35℃ 내지 39℃, 바람직하게는 약 37℃에서 2.5시간 ± 0.5시간 동안 가수분해가 이어지는, SARS-CoV-2 백신.
- 제1항 내지 제4항 중 어느 한 항에 있어서, 자외선(UV)-불활성화된 SARS-CoV-2 입자를 추가로 포함하는, SARS-CoV-2 백신.
- 제1항 내지 제5항 중 어느 한 항에 있어서, 불활성화된 SARS-CoV-2 입자에서 표면 단백질은 불활성화된 SARS-CoV-2 입자에서 바이러스 RNA와 비교하여 감소된 변형을 포함하고, 바람직하게는 표면 단백질은 불활성화된 SARS-CoV-2 입자에서 바이러스 RNA와 비교하여 변형된 잔기의 감소된 비율을 포함하며; 상기 변형은 천연 SARS-CoV-2 입자에 관한 것이며, 바람직하게 상기 변형은 알킬화 및/또는 아실화된 뉴클레오티드 또는 아미노산 잔기를 포함하는, SARS-CoV-2 백신.
- 제1항 내지 제6항 중 어느 한 항에 있어서, 불활성화된 SARS-CoV-2 입자는 (i) 스파이크(S) 단백질; (ii) 뉴클레오캡시드(N) 단백질; (iii) 막(M) 당단백질; 및/또는 (iv) 엔벨로프(E) 단백질을 포함하며; 바람직하게는 불활성화된 SARS-CoV-2 입자는 천연 형태 스파이크(S) 단백질을 포함하는, SARS-CoV-2 백신.
- 제1항 내지 제7항 중 어느 한 항에 있어서, 불활성화된 SARS-CoV-2 입자는 하나 이상의 베타-프로피오락톤-변형된 시스테인, 메티오닌 및/또는 히스티딘 잔기를 포함하는, SARS-CoV-2 백신.
- 제1항 내지 제8항 중 어느 한 항에 있어서, 불활성화된 SARS-CoV-2 입자는 200, 100, 50, 30, 20, 15, 10, 9, 8, 7 또는 6개 미만의 베타-프로피오락톤-변형된 아미노산 잔기를 포함하고; 바람직하게는 불활성화된 SARS-CoV-2 입자의 스파이크(S) 단백질은 100, 50, 30, 20, 15, 10, 9, 8, 7 또는 6개 미만의 베타-프로피오락톤-변형된 아미노산 잔기를 포함하고; 보다 바람직하게는 불활성화된 SARS-CoV-2 입자 또는 그의 스파이크 단백질은 15개 이하의 베타-프로피오락톤-변형된 아미노산 잔기를 포함하고; 가장 바람직하게는 불활성화된 SARS-CoV-2 입자 또는 그의 스파이크 단백질은 1 내지 100, 2 내지 50, 3 내지 30, 5 내지 20 또는 약 15개의 베타-프로피오락톤-변형된 아미노산 잔기를 포함하는, SARS-CoV-2 백신.
- 제1항 내지 제9항 중 어느 한 항에 있어서, 입자에서 SARS-CoV-2 폴리펩티드의 20%, 15%, 10%, 5% 또는 4% 미만은 베타-프로피오락톤-변형되고; 바람직하게는 입자에서 SARS-CoV-2 폴리펩티드의 0.1 내지 10%, 보다 바람직하게는 1 내지 5%, 더욱 바람직하게는 2 내지 8% 또는 약 3-6%는, 바람직하게는 선택적으로 트립신, 키모트립신 및/또는 PNGase F로 효소적 단리 또는 산 가수분해에 이어 질량 분광법에 의해 백신에서 검출된 바와 같이; 적어도 하나의 베타-프로피오락톤 변형을 포함하는, SARS-CoV-2 백신.
- 제1항 내지 제10항 중 어느 한 항에 있어서,
(i) 불활성화된 SARS-CoV-2 입자의 스파이크(S) 단백질은 예를 들어. 서열번호: 3에서 다음 잔기: 49, 146, 166, 177, 207, 245, 379, 432, 519, 625, 1029, 1032, 1058, 1083, 1088, 1101, 1159 및/또는 1271; 바람직하게는 H49, H146, C166, M177, H207, H245, C432, H519, H625, M1029, H1058, H1083, H1088, H1101, H1159 및/또는 H1271; 또는 H207, H245, C379, M1029 및/또는 C1032, 또는 서열번호: 19, 21, 23, 25 또는 27에서 상응하는 위치 중 하나 이상에서 베타-프로피오락톤 변형을 포함하고/하거나;
(ii) 불활성화된 SARS-CoV-2 입자의 막(M) 당단백질은 다음 잔기: 125, 154, 155, 159 및/또는 210; 바람직하게는 예를 들어 서열번호: 29에서 H154, H155, C159 및/또는 H210 중 하나 이상에서 베타-프로피오락톤 변형을 포함하고;
(iii) 불활성화된 SARS-CoV-2 입자의 뉴클레오캡시드(N) 단백질은 예를 들어 서열번호: 28에서 M234에서 베타-프로피오락톤 변형을 포함하는, SARS-CoV-2 백신. - 제1항 내지 제11항 중 어느 한 항에 있어서, 불활성화된 SARS-CoV-2 입자에서, 다음 잔기 중 하나 이상의 30%, 20%, 10%, 5%, 3% 또는 1% 미만, 바람직하게는 다음 잔기 중 적어도 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 또는 모두가 베타-프로피오락톤 변형되는, SARS-CoV-2 백신:
(i) 스파이크(S) 단백질에서, 잔기 49, 146, 166, 177, 207, 245, 379, 432, 519, 625, 1029, 1032, 1058, 1083, 1088, 1101, 1159 및/또는 1271; 바람직하게는 H49, H146, C166, M177, H207, H245, C432, H519, H625, M1029, H1058, H1083, H1088, H1101, H1159 및/또는 H1271; 또는 H207, H245, C379, M1029 및/또는 C1032; 예를 들어 서열번호: 3, 또는 서열번호: 19, 21, 23, 25 또는 27의 상응하는 위치 내; 및/또는 (ii) 막(M) 당단백질에서, 잔기 125, 154, 155, 159 및/또는 210; 바람직하게는 H154, H155, C159 및/또는 H210; 예를 들어 서열번호: 29 내; 및/또는 (iii) 예를 들어 서열번호: 28 내에서 뉴클레오캡시드(N) 단백질의 M234. - 제1항 내지 제12항 중 어느 한 항에 있어서, 불활성화된 SARS-CoV-2 입자에서 다음 위치의 각각에서 베타-프로피오락톤-변형된 잔기의 비율이
(i) 스파이크(S) 단백질(예를 들어, 서열번호: 3, 또는 서열번호: 19, 21, 23, 25 또는 27에서 상응하는 위치)에서:
(a) 잔기 H49, H146, C166, H207, H519, M1029, H1083, H1088, H1101, H1159 및/또는 H1271: 20% 미만, 바람직하게는 0.01 내지 10%, 더욱 바람직하게는 0.1 내지 5%; 및/또는
(b) 잔기 M177, C432, H625: 30% 미만, 바람직하게는 0.1 내지 20%, 더욱 바람직하게는 1 내지 10%; 및/또는
(c) 잔기 H245, H1058: 30% 미만, 바람직하게는 0.1 내지 20%, 보다 바람직하게는 5 내지 15%;
(ii) 막(M) 당단백질(예를 들어, 서열번호: 29)에서:
(f) H154: 5% 미만, 1% 미만 또는 0.1% 미만; 및/또는
(g) H155: 10% 미만, 바람직하게는 0.1 내지 5%; 및/또는
(h) C159: 5% 미만, 1% 미만 또는 0.1% 미만; 및/또는
(i) H210: 20% 미만, 바람직하게는 0.1 내지 10%; 및/또는
(iii) 뉴클레오캡시드(N) 단백질(예를 들어, 서열번호 28)에서:
(j) M234: 90% 미만, 10% 미만 또는 0.1% 미만인, SARS-CoV-2 백신. - 제1항 내지 제13항 중 어느 한 항에 있어서, 불활성화된 SARS-CoV-2 입자에 의한 포유동물 세포의 감염성은 천연 SARS-CoV-2 입자에 비해 적어도 99%, 99.99% 또는 99.9999% 감소되거나, 또는 불활성화된 SARS-CoV-2 입자에 의한 포유동물 세포의 감염성은 검출불가능한, SARS-CoV-2 백신.
- 제1항 내지 제14항 중 어느 한 항에 있어서, 하나 이상의 약학적으로 허용가능한 부형제, 예컨대 예를 들어 인간 혈청 알부민(HSA)을 추가로 포함하는, SARS-CoV-2 백신.
- 제1항 내지 제15항 중 어느 한 항에 있어서, 어쥬번트를 추가로 포함하는, SARS-CoV-2 백신.
- 제16항에 있어서, 어쥬번트는 수산화알루미늄 또는 인산알루미늄을 포함하는, SARS-CoV-2 백신.
- 제17항에 있어서, 수산화알루미늄 또는 인산알루미늄은 백신에서 유일한 어쥬번트인, SARS-CoV-2 백신.
- 제16항 또는 제17항에 있어서, 어쥬번트는 Th1 반응-지향 어쥬번트를 포함하거나 추가로 포함하는, SARS-CoV-2 백신.
- 제19항에 있어서, Th1 반응-지향 어쥬번트는 3-O-데사실-4'-모노포스포릴 지질 A(MPL), 사포닌 QS-21, CpG-함유 올리고데옥시뉴클레오티드(CpG ODN), 스쿠알렌, DL-α-토코페롤, 양이온성 펩티드, 데옥시이노신-함유 면역자극 올리고데옥시핵산 분자(I-ODN) 및/또는 이미퀴모드를 포함하는, SARS-CoV-2 백신.
- 제16항에 있어서, 어쥬번트는 하기를 포함하는, SARS-CoV-2 백신:
(i) 3-O-데사실-4'-모노포스포릴 지질 A(MPL) 및 사포닌 QS-21, 바람직하게는 어쥬번트 시스템 01을 포함하는 리포솜 제제;
(ii) 서열 5' TGACTGTGAACGTTCGAGATGA 3'(서열번호: 4), 바람직하게는 CpG 1018을 포함하는 CpG ODN;
(iii) 스쿠알렌, DL-α-토코페롤 및 폴리소르베이트 80(바람직하게는 어쥬번트 시스템 03);
(iv) 스쿠알렌, 트윈 80 및 스팬 85, 바람직하게는 MF59를 포함하는 수중유 에멀젼;
(v) 서열 KLKL5KLK(서열번호: 5) 및 올리고-d(IC)13(서열번호: 6)의 펩티드, 바람직하게는 IC31; 또는
(vi) 알루미늄 염 및 선택적으로 Th1-지향 어쥬번트. - 제1항 내지 제21항 중 어느 한 항에 있어서, 백신은 SARS-CoV-2 백신이 투여된 대상체를 적어도 70% 확률로 혈청전환시킬 수 있는, SARS-CoV-2 백신.
- 제22항에 있어서, SARS-CoV-2 백신은 SARS-CoV-2 백신이 투여된 대상체를 적어도 80%, 85%, 90%, 또는 95% 확률로 혈청전환시킬 수 있는, SARS-CoV-2 백신.
- 제1항 내지 제23항 중 어느 한 항에 있어서, SARS-CoV-2 입자는 (i) 서열번호: 9에 의해 정의된 바와 같은; 또는 (ii) 서열번호: 9에 대해 적어도 80%, 적어도 85%, 적어도 90%, 적어도 95% 또는 적어도 99% 서열 동일성을 갖는 DNA 서열에 상응하는 RNA 서열(및/또는 이의 단편, 변형된(바람직하게는 알킬화된 또는 아실화된) 뉴클레오티드 잔기를 선택적으로 포함함)을 포함하고; 바람직하게 RNA 서열을 포함하는 천연(비-불활성화된) SARS-CoV-2 입자는 독성 SARS-CoV-2를 패킹할 수 있는, SARS-CoV-2 백신.
- 제1항 내지 제24항 중 어느 한 항에 있어서, 상기 백신은 (i) 서열번호: 18에 의해 정의된 바와 같은; 또는 (ii) 서열번호: 18에 대해 적어도 80%, 적어도 85%, 적어도 90%, 적어도 95% 또는 적어도 99% 서열 동일성을 갖는 DNA 서열에 상응하는 RNA 서열(및/또는 이의 단편, 변형된(바람직하게는 알킬화된 또는 아실화된) 뉴클레오티드 잔기를 선택적으로 포함함)을 포함하는 부가의 SARS-CoV-2 입자를 포함하고; 바람직하게 RNA 서열을 포함하는 천연(비-불활성화된) SARS-CoV-2 입자는 독성 SARS-CoV-2를 패킹할 수 있는, SARS-CoV-2 백신.
- 제1항 내지 제25항 중 어느 한 항에 있어서, 상기 백신은 (i) 서열번호: 22에 의해 정의된 바와 같은; 또는 (ii) 서열번호: 22에 대해 적어도 80%, 적어도 85%, 적어도 90%, 적어도 95% 또는 적어도 99% 서열 동일성을 갖는 DNA 서열에 상응하는 RNA 서열(및/또는 이의 단편, 변형된(바람직하게는 알킬화된 또는 아실화된) 뉴클레오티드 잔기를 선택적으로 포함함)을 포함하는 부가의 SARS-CoV-2 입자를 포함하고; 바람직하게 RNA 서열을 포함하는 천연(비-불활성화된) SARS-CoV-2 입자는 독성 SARS-CoV-2를 패킹할 수 있는, SARS-CoV-2 백신.
- 제1항 내지 제26항 중 어느 한 항에 있어서, 백신은 Vero 세포로부터 수득되거나 수득가능한, SARS-CoV-2 백신.
- 제1항 내지 제27항 중 어느 한 항에 있어서, 인간 대상체에게 투여시 백신은 (i) SARS-CoV-2-연관된 질환(COVID-19)의 항체-의존성 증진(ADE)을 유도하지 않고/않거나; (ii) 대상체에서 면역병리학을 유도하지 않는, SARS-CoV-2 백신.
- 예방적으로 또는 치료적으로 유효한 양의 제1항 내지 제28항 중 어느 한 항의 SARS-CoV-2 백신을 대상체에게 투여하는 것을 포함하는, SARS-CoV-2 감염 및/또는 SARS-CoV-2-연관된 질환(COVID-19)의 예방 또는 치료를 필요로 하는 인간 대상체에서 SARS-CoV-2 감염 및/또는 SARS-CoV-2-연관된 질환(COVID-19)을 예방 또는 치료하는 방법.
- 제29항에 있어서, 예방적으로 또는 치료적으로 유효한 양의 SARS-CoV-2 백신의 제2 용량을 투여하는 단계를 추가로 포함하고, 바람직하게 백신의 제2 용량은 제1 용량과 동일한 제형인, 방법.
- 제29항 또는 제30항에 있어서, 용량당 SARS-CoV-2 백신의 상기 예방적으로 또는 치료적으로 유효한 양은, ELISA에 의해 평가시, 약 1 내지 100AU/용량, 바람직하게는 약 2 내지 75AU/용량, 바람직하게는 약 3 내지 60AU/용량, 보다 바람직하게는 약 3 내지 55AU/용량, 더욱 바람직하게는 약 3 내지 53AU/용량, 훨씬 더 바람직하게는 약 3 내지 40AU/용량, 더욱 바람직하게는 약 10 내지 60AU/용량, 20 내지 50AU/용량, 25 내지 45AU/용량 또는 30 내지 40AU/용량, 예컨대 예를 들어 35AU/용량 또는 40AU/용량으로 한정되는, 방법.
- 제29항 또는 제30항에 있어서, SARS-CoV-2 백신의 용량당 상기 예방적으로 또는 치료적으로 유효한 양은, 예를 들어 (μ)BCA에 의해 측정시, 약 0.05 내지 50μg 총 단백질, 약 0.1 내지 25μg, 약 0.25 내지 12.5μg, 바람직하게는 약 0.5 내지 5μg 총 단백질, 더욱 바람직하게는 적어도 2.5μg 총 단백질, 적어도 3.5μg 총 단백질 또는 적어도 2.5μg 총 단백질, 훨씬 더 바람직하게는 2.5μg 내지 25μg, 3.5μg 내지 10μg 또는 4μg 내지 6μg 총 단백질/용량, 가장 바람직하게는 약 5μg 총 단백질/용량으로 한정되는, 방법.
- 제29항 또는 제30항에 있어서, SARS-CoV-2 백신의 용량당 상기 예방적으로 또는 치료적으로 유효한 양은, ELISA에 의해 측정시, 약 0.025 내지 25μg S-단백질, 약 0.05 내지 12.5μg, 약 0.125 내지 6.25μg, 바람직하게는 약 0.25 내지 2.5μg S-단백질로 한정되는, 방법.
- 제30항에 있어서, SARS-CoV-2 백신의 제2 용량은 SARS-CoV-2 백신의 제1 용량 후 약 7일, 약 14일, 약 21일, 또는 약 28일에 투여되고, 바람직하게는 백신의 제2 용량은 제1 용량과 동일한 제형인, 방법.
- 제28항 내지 제34항 중 어느 한 항에 있어서, 투여는 SARS-CoV-2 중화 항체의 생성을 초래하는, 방법.
- SARS-CoV-2 백신을 제조하는 방법으로,
(a) 천연 SARS-CoV-2 입자를 생성하는 단계;
(b) 천연 SARS-CoV-2 입자를 불활성화하여 불활성화된 SARS-CoV-2 입자를 얻는 단계;
(c) 불활성화된 SARS-CoV-2 입자를 백신 조성물에 혼입하는 단계
를 포함하고,
여기서 SARS-CoV-2 입자의 천연 표면 형태는 불활성화 단계에서 보존되어 백신이 인간 대상체에서 천연 SARS-CoV-2 입자에 대한 중화 항체를 생성할 수 있는, 방법. - 제36항에 있어서, 백신 조성물은 수산화알루미늄을 포함하는, 방법.
- 제37항에 있어서, 수산화알루미늄을 포함하는 SARS-CoV-2 백신은 1.25ppb 미만의 Cu를 함유하는, 방법.
- 제36항 내지 제38항 중 어느 한 항에 있어서, 불활성화 단계는 SARS-CoV-2 입자에서 바이러스 RNA를 우선적으로 표적화하는, 방법.
- 제36항 또는 제39항에 있어서, 불활성화 단계는 (i) 바이러스 RNA를 알킬화 및/또는 아실화하는 것 (ii) 퓨린(바람직하게는 구아닌) 잔기를 변형하거나 가닥 절단을 바이러스 RNA 안으로 도입하는 것 및/또는 (iii) 하나 이상의 바이러스 단백질과 바이러스 RNA를 가교-결합하는 것을 포함하는, 방법.
- 제36항, 제39항 및 제40항 중 어느 한 항에 있어서, 불활성화 단계는 천연 SARS-CoV-2 입자를 베타-프로피오락톤으로 처리하는 것을 포함하는, 방법.
- 제41항에 있어서, 불활성화 단계에서 베타-프로피오락톤의 농도는 0.01 내지 1 중량%, 바람직하게는 0.05 내지 0.5 중량%, 보다 바람직하게는 약 0.1 중량%인, 방법.
- 제41항 또는 제42항에 있어서, 천연 SARS-CoV-2 입자는 적어도 5시간, 적어도 10시간, 적어도 24시간 또는 적어도 4일 동안 베타-프로피오락톤과 접촉되는, 방법.
- 제36항 및 제39항 내지 제43항 중 어느 한 항에 있어서, 불활성화 단계는 약 0℃ 내지 약 25℃, 바람직하게는 약 4℃ 또는 약 22℃에서 수행되는, 방법.
- 제36항 및 제39항 내지 제44항 중 어느 한 항에 있어서, 불활성화 단계는 천연 SARS-CoV-2 입자를 자외선(UV) 광으로 처리하는 것을 포함하는, 방법.
- 제36항 및 제39항 내지 제45항 중 어느 한 항에 있어서, 단계 (a)는 다음 단계 중 하나 이상을 포함하는, 방법:
(i) Vero 세포 상에서 SARS-CoV-2를 계대하고, 이에 의해 SARS-CoV-2를 포함하는 배양 배지를 생성하는 단계;
(ii) (i)의 배양 배지를 수확하는 단계;
(iii) (ii)의 수확된 배양 배지를 침전시키고, 이에 의해 상등액에서 천연 SARS-CoV-2 입자를 생성하는 단계. - 제46항에 있어서, 단계 (iii) 이전에 (ii)의 배양 배지를 농축하는 것을 추가로 포함하는, 방법.
- 제46항 또는 제47항에 있어서, (iii)의 침전은 (ii)의 배양 배지를 프로타민 설페이트 또는 벤조나아제와 접촉시키는 것을 포함하는, 방법.
- 제36항 및 제39항 내지 제48항 중 어느 한 항에 있어서, 불활성화된 SARS-CoV-2 입자를 투석하고, 이에 의해 투석된 SARS-CoV-2를 생성하는 것을 추가로 포함하는, 방법.
- 제49항에 있어서, 투석된 SARS-CoV-2를 여과하는 것을 추가로 포함하는, 방법.
- 제36항 및 제39항 내지 제50항 중 어느 한 항에 있어서, 불활성화 단계는 천연 SARS-CoV-2 입자를 포함하는 액체 조성물을 용기 내의 화학적 바이러스 불활성화제와 접촉시키기, SARS-CoV-2 입자를 포함하는 액체 조성물과 화학적 바이러스성 불활성화제를 난류가 아닌 층류 흐름의 조건 하에서 혼합하기, 및 바이러스 입자를 불활성화시키기에 충분한 시간 동안 화학적 바이러스 불활성화제 및 SARS-CoV-2 입자를 포함하는 액체 조성물을 인큐베이션하기를 포함하는, 방법.
- 제51항에 있어서, 불활성화 단계는 가요성 생물반응기 백에서 수행되는, 방법.
- 제51항 또는 제52항에 있어서, 불활성화 단계는 불활성화 기간 동안 5회 이하의 용기 반전(inversion)을 포함하는, 방법.
- 제51항 내지 제53항 중 어느 한 항에 있어서, 천연 SARS-CoV-2 입자를 포함하는 조성물과 화학적 바이러스 불활성화제의 혼합은 인큐베이션의 기간 동안 10rpm 이하에서 10분 이하 동안 용기를 흔들기(rocking), 회전(rotation), 궤도 진탕(orbital shaking) 또는 진동(oscillation)시키는 것을 포함하는, 방법.
- 제36항 및 제39항 내지 제54항 중 어느 한 항에 있어서, (i) 배치 크로마토그래피 및/또는 (ii) 수크로스 밀도 구배 원심분리로부터 선택되는 하나 이상의 방법에 의해 불활성화된 SARS-CoV-2 입자를 정제하는 것을 추가로 포함하는, 방법.
- 제36항 및 제39항 내지 제55항 중 어느 한 항에 있어서, 단계 (c)는 불활성화된 SARS-CoV-2 입자를 어쥬번트와 조합하는 것을 포함하는, 방법.
- 제56항에 있어서, 어쥬번트는 Th1 반응-지향 어쥬번트를 포함하는, 방법.
- 제56항 또는 제57항에 있어서, 어쥬번트는 3-O-데사실-4'-모노포스포릴 지질 A(MPL), 사포닌 QS-21, CpG-함유 올리고데옥시뉴클레오티드(CpG ODN), 스쿠알렌, DL-α-토코페롤 및/또는 이미퀴모드를 포함하는, 방법.
- 제36항 및 제39항 내지 제58항 중 어느 한 항의 방법에 의해 수득되거나 수득가능한 SARS-CoV-2 백신.
- 대상체에서 SARS-CoV-2 감염의 치료 또는 예방을 위한 제1항 내지 제28항 및 제59항 중 어느 한 항의 SARS-CoV-2 백신의 용도.
- 대상체에서 SARS-CoV-2 감염의 예방 또는 치료에 사용하기 위한 약학적 조성물로서, 상기 약학적 조성물은 선택적으로 하나 이상의 약학적으로 허용가능한 부형제 및/또는 어쥬번트와 조합된 제1항 내지 제28항 및 제59항 중 어느 한 항에 정의된 불활성화된 SARS-CoV-2 백신인, 약학적 조성물.
- 의약으로서 사용하기 위한 제1항 내지 제28항 및 제59항 중 어느 한 항에 정의된 SARS-CoV-2 백신.
- 제1항 내지 제62항 중 어느 한 항에 있어서, 대상체는 (i) 고령 대상체, 바람직하게는 65세 이상, 70세 이상 또는 80세 이상의 대상체; (ii) 면역저하 대상체; 또는 (iii) 임신한 대상체인, 백신, 방법, 용도 또는 약학적 조성물.
- (i) SARS-CoV-2-연관된 질환(COVID-19)의 항체-의존성 증진(ADE); 및/또는 (ii) 대상체에서 면역병리학의 유도 없이 SARS-CoV-2 감염의 예방 또는 치료에 사용하기 위한, 제1항 내지 제63항 중 어느 한 항에 따른 백신, 방법, 용도 또는 약학적 조성물.
Applications Claiming Priority (13)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP20168324 | 2020-04-06 | ||
EP20168324.0 | 2020-04-06 | ||
EP20202118 | 2020-10-15 | ||
EP20202118.4 | 2020-10-15 | ||
EP20211853.5 | 2020-12-04 | ||
EP20211853 | 2020-12-04 | ||
EP21154647 | 2021-02-01 | ||
EP21154647.8 | 2021-02-01 | ||
USPCT/US2021/020313 | 2021-03-01 | ||
PCT/US2021/020313 WO2021178318A1 (en) | 2020-03-01 | 2021-03-01 | Coronavirus vaccines comprising a tlr9 agonist |
EP21160913.6 | 2021-03-05 | ||
EP21160913 | 2021-03-05 | ||
PCT/EP2021/058974 WO2021204825A2 (en) | 2020-04-06 | 2021-04-06 | INACTIVATED SARS-CoV-2 VIRUS VACCINE |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20220164500A true KR20220164500A (ko) | 2022-12-13 |
Family
ID=79566178
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020227034302A KR20220164500A (ko) | 2020-04-06 | 2021-04-06 | 불활성화된 SARS-CoV-2 바이러스 백신 |
Country Status (14)
Country | Link |
---|---|
US (1) | US20240293531A1 (ko) |
EP (1) | EP3955959A2 (ko) |
JP (1) | JP2023520521A (ko) |
KR (1) | KR20220164500A (ko) |
CN (1) | CN115768469A (ko) |
AU (1) | AU2021253605A1 (ko) |
BR (1) | BR112022020100A2 (ko) |
CA (1) | CA3168784A1 (ko) |
CL (1) | CL2022002365A1 (ko) |
CO (1) | CO2022013715A2 (ko) |
EC (1) | ECSP22072590A (ko) |
IL (1) | IL296072A (ko) |
MX (1) | MX2022012447A (ko) |
ZA (1) | ZA202209826B (ko) |
-
2021
- 2021-04-06 IL IL296072A patent/IL296072A/en unknown
- 2021-04-06 MX MX2022012447A patent/MX2022012447A/es unknown
- 2021-04-06 AU AU2021253605A patent/AU2021253605A1/en active Pending
- 2021-04-06 BR BR112022020100A patent/BR112022020100A2/pt not_active Application Discontinuation
- 2021-04-06 KR KR1020227034302A patent/KR20220164500A/ko unknown
- 2021-04-06 EP EP21716442.5A patent/EP3955959A2/en active Pending
- 2021-04-06 CN CN202180026748.7A patent/CN115768469A/zh active Pending
- 2021-04-06 CA CA3168784A patent/CA3168784A1/en active Pending
- 2021-04-06 US US17/913,638 patent/US20240293531A1/en active Pending
- 2021-04-06 JP JP2022560229A patent/JP2023520521A/ja active Pending
-
2022
- 2022-08-30 CL CL2022002365A patent/CL2022002365A1/es unknown
- 2022-09-02 ZA ZA2022/09826A patent/ZA202209826B/en unknown
- 2022-09-16 EC ECSENADI202272590A patent/ECSP22072590A/es unknown
- 2022-09-23 CO CONC2022/0013715A patent/CO2022013715A2/es unknown
Also Published As
Publication number | Publication date |
---|---|
IL296072A (en) | 2022-11-01 |
EP3955959A2 (en) | 2022-02-23 |
CL2022002365A1 (es) | 2023-02-03 |
BR112022020100A2 (pt) | 2022-11-29 |
AU2021253605A1 (en) | 2022-10-06 |
CN115768469A (zh) | 2023-03-07 |
CO2022013715A2 (es) | 2022-12-30 |
MX2022012447A (es) | 2022-10-27 |
JP2023520521A (ja) | 2023-05-17 |
ZA202209826B (en) | 2023-05-31 |
ECSP22072590A (es) | 2022-10-31 |
US20240293531A1 (en) | 2024-09-05 |
CA3168784A1 (en) | 2021-10-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108697785B (zh) | Zika病毒疫苗 | |
CN113321739B (zh) | 一种covid-19亚单位疫苗及其制备方法与应用 | |
JP7088841B2 (ja) | 安定化された可溶性融合前rsv fタンパク質 | |
Guo et al. | Foot-and-mouth disease virus-like particles produced by a SUMO fusion protein system in Escherichia coli induce potent protective immune responses in guinea pigs, swine and cattle | |
CN112575008B (zh) | 编码新型冠状病毒的结构蛋白的核酸分子以及新型冠状病毒疫苗 | |
TWI688652B (zh) | 作為對抗腸病毒感染之免疫原的病毒顆粒及其製造 | |
CN113151312A (zh) | 新型冠状病毒SARS-CoV-2 mRNA疫苗及其制备方法和应用 | |
CN113666990B (zh) | 一种诱导广谱抗冠状病毒的t细胞疫苗免疫原及其应用 | |
TW202203967A (zh) | 不活化SARS—CoV—2病毒疫苗 | |
KR20230012583A (ko) | 합성 변형된 백시니아 앙카라 (sMVA) 기반 코로나바이러스 백신 | |
KR20230005814A (ko) | Cpg-어쥬번트된 sars-cov-2 바이러스 백신 | |
WO2021222639A2 (en) | Recombinant human metapneumovirus f proteins and their use | |
CN108210921A (zh) | 一种寨卡病毒疫苗及其制备方法 | |
CN112175913A (zh) | SARS-CoV-2减毒株及其在预防新冠肺炎中的应用 | |
CN109923125B (zh) | 免疫原性组合物 | |
TW201938578A (zh) | 腸病毒疫苗 | |
KR20220164500A (ko) | 불활성화된 SARS-CoV-2 바이러스 백신 | |
Li et al. | The CDE region of feline Calicivirus VP1 protein is a potential candidate subunit vaccine | |
RU2290204C1 (ru) | Рекомбинантный гибридный белок, препарат для иммунотерапии на его основе и способ иммунотерапии рецидивирующего папилломатоза гортани | |
RU2813150C2 (ru) | Выделенный рекомбинантный вирус на основе вируса гриппа для индукции специфического иммунитета к вирусу гриппа и/или профилактики заболеваний, вызванных вирусом гриппа | |
KR102680105B1 (ko) | 코로나19 바이러스 및 인플루엔자 h1n1 바이러스에 대한 바이러스 유사입자 및 이의 용도 | |
CN115836124A (zh) | SARS-CoV-2灭活疫苗及其制备 | |
CN111712575B (zh) | 表达外源猫副粘病毒基因的重组病毒载体系统及由其制备的疫苗 | |
EP0949333B1 (en) | Measles virus mutant antigen and gene encoding the same | |
MXPA01010481A (es) | Acidos nucleicos y polipeptidos de lisavirus quimerico. |