KR20220150323A - 코로나바이러스를 방지하는 백신 생산을 위한 완전 합성 장쇄 핵산 - Google Patents
코로나바이러스를 방지하는 백신 생산을 위한 완전 합성 장쇄 핵산 Download PDFInfo
- Publication number
- KR20220150323A KR20220150323A KR1020227033430A KR20227033430A KR20220150323A KR 20220150323 A KR20220150323 A KR 20220150323A KR 1020227033430 A KR1020227033430 A KR 1020227033430A KR 20227033430 A KR20227033430 A KR 20227033430A KR 20220150323 A KR20220150323 A KR 20220150323A
- Authority
- KR
- South Korea
- Prior art keywords
- sequence
- seq
- leu
- val
- ser
- Prior art date
Links
- 150000007523 nucleic acids Chemical class 0.000 title claims abstract description 226
- 102000039446 nucleic acids Human genes 0.000 title claims abstract description 214
- 108020004707 nucleic acids Proteins 0.000 title claims abstract description 214
- 229960005486 vaccine Drugs 0.000 title claims abstract description 92
- 238000004519 manufacturing process Methods 0.000 title claims abstract description 68
- 241000004176 Alphacoronavirus Species 0.000 title 1
- 241001678559 COVID-19 virus Species 0.000 claims abstract description 76
- 210000004779 membrane envelope Anatomy 0.000 claims abstract description 73
- 239000012634 fragment Substances 0.000 claims abstract description 44
- 235000004252 protein component Nutrition 0.000 claims description 109
- 108090000623 proteins and genes Proteins 0.000 claims description 93
- 108020004414 DNA Proteins 0.000 claims description 91
- 102000053602 DNA Human genes 0.000 claims description 91
- 108010003533 Viral Envelope Proteins Proteins 0.000 claims description 90
- 102000004169 proteins and genes Human genes 0.000 claims description 81
- 235000018102 proteins Nutrition 0.000 claims description 80
- 239000013598 vector Substances 0.000 claims description 72
- 238000013452 biotechnological production Methods 0.000 claims description 51
- 229920002477 rna polymer Polymers 0.000 claims description 47
- 239000002773 nucleotide Substances 0.000 claims description 45
- 125000003729 nucleotide group Chemical group 0.000 claims description 45
- 239000002253 acid Substances 0.000 claims description 38
- 238000000034 method Methods 0.000 claims description 38
- 101710091045 Envelope protein Proteins 0.000 claims description 30
- 101710188315 Protein X Proteins 0.000 claims description 30
- 102100021696 Syncytin-1 Human genes 0.000 claims description 30
- 230000014509 gene expression Effects 0.000 claims description 27
- 239000013612 plasmid Substances 0.000 claims description 23
- 108010052285 Membrane Proteins Proteins 0.000 claims description 18
- 102000018697 Membrane Proteins Human genes 0.000 claims description 18
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 12
- 101800001632 Envelope protein E Proteins 0.000 claims description 12
- 230000003321 amplification Effects 0.000 claims description 12
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 12
- 108010089430 Phosphoproteins Proteins 0.000 claims description 11
- 102000007982 Phosphoproteins Human genes 0.000 claims description 11
- 101710087110 ORF6 protein Proteins 0.000 claims description 9
- 101710198378 Uncharacterized 10.8 kDa protein in cox-rep intergenic region Proteins 0.000 claims description 9
- 101710095001 Uncharacterized protein in nifU 5'region Proteins 0.000 claims description 9
- 108020004999 messenger RNA Proteins 0.000 claims description 9
- 102100031673 Corneodesmosin Human genes 0.000 claims description 8
- 238000013519 translation Methods 0.000 claims description 8
- 101710139375 Corneodesmosin Proteins 0.000 claims description 7
- 101000779242 Severe acute respiratory syndrome coronavirus 2 ORF3a protein Proteins 0.000 claims description 7
- 101000596353 Severe acute respiratory syndrome coronavirus 2 ORF7a protein Proteins 0.000 claims description 7
- 239000013600 plasmid vector Substances 0.000 claims description 5
- 101000833492 Homo sapiens Jouberin Proteins 0.000 claims description 2
- 101000651236 Homo sapiens NCK-interacting protein with SH3 domain Proteins 0.000 claims description 2
- 102100024407 Jouberin Human genes 0.000 claims description 2
- 229940023143 protein vaccine Drugs 0.000 claims description 2
- 238000004806 packaging method and process Methods 0.000 claims 1
- 230000003612 virological effect Effects 0.000 abstract description 13
- 208000025721 COVID-19 Diseases 0.000 abstract description 5
- 201000010099 disease Diseases 0.000 abstract description 5
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 5
- 108700002856 Coronavirus Envelope Proteins Proteins 0.000 abstract 1
- 241000008910 Severe acute respiratory syndrome-related coronavirus Species 0.000 abstract 1
- 210000004027 cell Anatomy 0.000 description 66
- 241000700605 Viruses Species 0.000 description 33
- 239000000047 product Substances 0.000 description 30
- 241000711573 Coronaviridae Species 0.000 description 23
- 241000880493 Leptailurus serval Species 0.000 description 23
- 108010050848 glycylleucine Proteins 0.000 description 22
- 230000010076 replication Effects 0.000 description 21
- 108010061238 threonyl-glycine Proteins 0.000 description 21
- 230000004048 modification Effects 0.000 description 20
- 238000012986 modification Methods 0.000 description 20
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 18
- 108010057821 leucylproline Proteins 0.000 description 18
- 108010073969 valyllysine Proteins 0.000 description 18
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 17
- 230000008569 process Effects 0.000 description 17
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 16
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 16
- 108090001074 Nucleocapsid Proteins Proteins 0.000 description 15
- 108010037850 glycylvaline Proteins 0.000 description 15
- 125000003275 alpha amino acid group Chemical group 0.000 description 14
- 108010038633 aspartylglutamate Proteins 0.000 description 14
- 238000012217 deletion Methods 0.000 description 14
- 230000037430 deletion Effects 0.000 description 14
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 13
- 108010047857 aspartylglycine Proteins 0.000 description 13
- 108010089804 glycyl-threonine Proteins 0.000 description 13
- 108010026333 seryl-proline Proteins 0.000 description 13
- 238000003786 synthesis reaction Methods 0.000 description 13
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 12
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 12
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 12
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 11
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 11
- 230000015572 biosynthetic process Effects 0.000 description 11
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 10
- 239000002671 adjuvant Substances 0.000 description 10
- 108010047495 alanylglycine Proteins 0.000 description 10
- 108010087823 glycyltyrosine Proteins 0.000 description 10
- 108010034529 leucyl-lysine Proteins 0.000 description 10
- 108010012058 leucyltyrosine Proteins 0.000 description 10
- 108010017391 lysylvaline Proteins 0.000 description 10
- 241000282326 Felis catus Species 0.000 description 9
- 108010079364 N-glycylalanine Proteins 0.000 description 9
- 108091028043 Nucleic acid sequence Proteins 0.000 description 9
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 9
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 9
- 108010005233 alanylglutamic acid Proteins 0.000 description 9
- 238000012824 chemical production Methods 0.000 description 9
- 108010016616 cysteinylglycine Proteins 0.000 description 9
- 108010069495 cysteinyltyrosine Proteins 0.000 description 9
- 230000028993 immune response Effects 0.000 description 9
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 9
- 108010003700 lysyl aspartic acid Proteins 0.000 description 9
- 239000002245 particle Substances 0.000 description 9
- 230000000890 antigenic effect Effects 0.000 description 8
- 108010004073 cysteinylcysteine Proteins 0.000 description 8
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 8
- 108010049041 glutamylalanine Proteins 0.000 description 8
- 108010081551 glycylphenylalanine Proteins 0.000 description 8
- 108010025306 histidylleucine Proteins 0.000 description 8
- 108010064235 lysylglycine Proteins 0.000 description 8
- 108010051242 phenylalanylserine Proteins 0.000 description 8
- 108010071207 serylmethionine Proteins 0.000 description 8
- 238000006467 substitution reaction Methods 0.000 description 8
- 108010003137 tyrosyltyrosine Proteins 0.000 description 8
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 7
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 7
- 108060004795 Methyltransferase Proteins 0.000 description 7
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 7
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 7
- 108010041407 alanylaspartic acid Proteins 0.000 description 7
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 7
- 239000013613 expression plasmid Substances 0.000 description 7
- 108010078144 glutaminyl-glycine Proteins 0.000 description 7
- 238000003780 insertion Methods 0.000 description 7
- 230000037431 insertion Effects 0.000 description 7
- 108010027338 isoleucylcysteine Proteins 0.000 description 7
- 108010038320 lysylphenylalanine Proteins 0.000 description 7
- 108010031719 prolyl-serine Proteins 0.000 description 7
- 238000001890 transfection Methods 0.000 description 7
- 238000011282 treatment Methods 0.000 description 7
- 108010051110 tyrosyl-lysine Proteins 0.000 description 7
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 6
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 6
- 108010065920 Insulin Lispro Proteins 0.000 description 6
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 6
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 6
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 6
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 6
- 108010013835 arginine glutamate Proteins 0.000 description 6
- 108010008355 arginyl-glutamine Proteins 0.000 description 6
- 108010062796 arginyllysine Proteins 0.000 description 6
- 108010092854 aspartyllysine Proteins 0.000 description 6
- 238000011161 development Methods 0.000 description 6
- 230000018109 developmental process Effects 0.000 description 6
- 108010054813 diprotin B Proteins 0.000 description 6
- 210000005260 human cell Anatomy 0.000 description 6
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 6
- -1 lipoplexes Polymers 0.000 description 6
- 108010009298 lysylglutamic acid Proteins 0.000 description 6
- 239000000203 mixture Substances 0.000 description 6
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 6
- 108090000765 processed proteins & peptides Proteins 0.000 description 6
- 108010029020 prolylglycine Proteins 0.000 description 6
- 238000000746 purification Methods 0.000 description 6
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 5
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 5
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 5
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 5
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 5
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 5
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 5
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 5
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 5
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 5
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 5
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 5
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 5
- 101710141454 Nucleoprotein Proteins 0.000 description 5
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 5
- 201000003176 Severe Acute Respiratory Syndrome Diseases 0.000 description 5
- 101150088517 TCTA gene Proteins 0.000 description 5
- KXUKIBHIVRYOIP-ZKWXMUAHSA-N Val-Asp-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KXUKIBHIVRYOIP-ZKWXMUAHSA-N 0.000 description 5
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 5
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 5
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 5
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 5
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 5
- 108010044940 alanylglutamine Proteins 0.000 description 5
- 108010087924 alanylproline Proteins 0.000 description 5
- 230000008827 biological function Effects 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 5
- 108010010147 glycylglutamine Proteins 0.000 description 5
- 108010018006 histidylserine Proteins 0.000 description 5
- 108010078274 isoleucylvaline Proteins 0.000 description 5
- 108010054155 lysyllysine Proteins 0.000 description 5
- 230000035772 mutation Effects 0.000 description 5
- 102000004196 processed proteins & peptides Human genes 0.000 description 5
- 108010004914 prolylarginine Proteins 0.000 description 5
- 108010015796 prolylisoleucine Proteins 0.000 description 5
- 108010090894 prolylleucine Proteins 0.000 description 5
- 108010053725 prolylvaline Proteins 0.000 description 5
- 238000000926 separation method Methods 0.000 description 5
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 4
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 4
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 4
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 4
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 4
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 4
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 4
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 4
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 4
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 4
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 4
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 4
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 4
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 4
- FCXJJTRGVAZDER-FXQIFTODSA-N Cys-Val-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O FCXJJTRGVAZDER-FXQIFTODSA-N 0.000 description 4
- 238000007702 DNA assembly Methods 0.000 description 4
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 4
- 101710204837 Envelope small membrane protein Proteins 0.000 description 4
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 4
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 4
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 4
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 4
- 101710114810 Glycoprotein Proteins 0.000 description 4
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 4
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 4
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 4
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 4
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 4
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 4
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 4
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 4
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 4
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 4
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 4
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 4
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 4
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 4
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 4
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 4
- 239000006137 Luria-Bertani broth Substances 0.000 description 4
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 4
- 101710145006 Lysis protein Proteins 0.000 description 4
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 4
- 101150001779 ORF1a gene Proteins 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 4
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 4
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 4
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 4
- 208000037847 SARS-CoV-2-infection Diseases 0.000 description 4
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 4
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 4
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 4
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 4
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 4
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 4
- 101710167605 Spike glycoprotein Proteins 0.000 description 4
- 241000169093 Tacca Species 0.000 description 4
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 4
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 4
- WPVGRKLNHJJCEN-BZSNNMDCSA-N Tyr-Asp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WPVGRKLNHJJCEN-BZSNNMDCSA-N 0.000 description 4
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 4
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 4
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 4
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 4
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 4
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 4
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 4
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 4
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 4
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 4
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 4
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 4
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 4
- 108010070944 alanylhistidine Proteins 0.000 description 4
- 230000004075 alteration Effects 0.000 description 4
- 235000001014 amino acid Nutrition 0.000 description 4
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 4
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 4
- 108010077245 asparaginyl-proline Proteins 0.000 description 4
- 238000005119 centrifugation Methods 0.000 description 4
- 108010060199 cysteinylproline Proteins 0.000 description 4
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 4
- 108010015792 glycyllysine Proteins 0.000 description 4
- 108010084389 glycyltryptophan Proteins 0.000 description 4
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 4
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 4
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 4
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 4
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 4
- 108010000761 leucylarginine Proteins 0.000 description 4
- XIXADJRWDQXREU-UHFFFAOYSA-M lithium acetate Chemical compound [Li+].CC([O-])=O XIXADJRWDQXREU-UHFFFAOYSA-M 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 4
- 238000010561 standard procedure Methods 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 108010020532 tyrosyl-proline Proteins 0.000 description 4
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 3
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 3
- 108020005345 3' Untranslated Regions Proteins 0.000 description 3
- 108020003589 5' Untranslated Regions Proteins 0.000 description 3
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 3
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 3
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 3
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 3
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 3
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 3
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 3
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 3
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 3
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 3
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 3
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 3
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 3
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 3
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 3
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 3
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 3
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 3
- IIAXFBUTKIDDIP-ULQDDVLXSA-N Arg-Leu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IIAXFBUTKIDDIP-ULQDDVLXSA-N 0.000 description 3
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 3
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 3
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 3
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 3
- QPTAGIPWARILES-AVGNSLFASA-N Asn-Gln-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QPTAGIPWARILES-AVGNSLFASA-N 0.000 description 3
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 3
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 3
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 3
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 3
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 3
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 3
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 3
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 3
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 3
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 3
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 3
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 3
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 3
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 3
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 3
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 3
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 3
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 3
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 3
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 3
- 101710117545 C protein Proteins 0.000 description 3
- 108091026890 Coding region Proteins 0.000 description 3
- OIMUAKUQOUEPCZ-WHFBIAKZSA-N Cys-Asn-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIMUAKUQOUEPCZ-WHFBIAKZSA-N 0.000 description 3
- YYLBXQJGWOQZOU-IHRRRGAJSA-N Cys-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N YYLBXQJGWOQZOU-IHRRRGAJSA-N 0.000 description 3
- LKHMGNHQULEPFY-ACZMJKKPSA-N Cys-Ser-Glu Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O LKHMGNHQULEPFY-ACZMJKKPSA-N 0.000 description 3
- 230000006820 DNA synthesis Effects 0.000 description 3
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 3
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 3
- YRHZWVKUFWCEPW-GLLZPBPUSA-N Gln-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YRHZWVKUFWCEPW-GLLZPBPUSA-N 0.000 description 3
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 3
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 3
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 3
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 3
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 3
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 3
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 3
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 3
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 3
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 3
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 3
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 3
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 3
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 3
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 3
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 3
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 3
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 3
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 3
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 3
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 3
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 3
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 3
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 3
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 3
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 3
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 3
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 3
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 3
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 3
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 3
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 3
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 3
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 3
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 3
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 3
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 3
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 3
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 3
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 3
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 3
- FPFOYSCDUWTZBF-IHPCNDPISA-N Leu-Trp-Leu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]([NH3+])CC(C)C)C(=O)N[C@@H](CC(C)C)C([O-])=O)=CNC2=C1 FPFOYSCDUWTZBF-IHPCNDPISA-N 0.000 description 3
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 3
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 3
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 3
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 3
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 3
- OPTCSTACHGNULU-DCAQKATOSA-N Lys-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCCN OPTCSTACHGNULU-DCAQKATOSA-N 0.000 description 3
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 3
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 3
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 3
- DKTNGXVSCZULPO-YUMQZZPRSA-N Lys-Gly-Cys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O DKTNGXVSCZULPO-YUMQZZPRSA-N 0.000 description 3
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 3
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 3
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 3
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 3
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 3
- QFSYGUMEANRNJE-DCAQKATOSA-N Lys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N QFSYGUMEANRNJE-DCAQKATOSA-N 0.000 description 3
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 3
- JACAKCWAOHKQBV-UWVGGRQHSA-N Met-Gly-Lys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN JACAKCWAOHKQBV-UWVGGRQHSA-N 0.000 description 3
- LBSWWNKMVPAXOI-GUBZILKMSA-N Met-Val-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O LBSWWNKMVPAXOI-GUBZILKMSA-N 0.000 description 3
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 3
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 3
- 108010047562 NGR peptide Proteins 0.000 description 3
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 3
- LBSARGIQACMGDF-WBAXXEDZSA-N Phe-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 LBSARGIQACMGDF-WBAXXEDZSA-N 0.000 description 3
- PSBJZLMFFTULDX-IXOXFDKPSA-N Phe-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N)O PSBJZLMFFTULDX-IXOXFDKPSA-N 0.000 description 3
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 3
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 3
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 3
- 108010076039 Polyproteins Proteins 0.000 description 3
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 3
- TUYWCHPXKQTISF-LPEHRKFASA-N Pro-Cys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N2CCC[C@@H]2C(=O)O TUYWCHPXKQTISF-LPEHRKFASA-N 0.000 description 3
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 3
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 3
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 3
- 229940096437 Protein S Drugs 0.000 description 3
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 3
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 3
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 3
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 3
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 3
- SNNSYBWPPVAXQW-ZLUOBGJFSA-N Ser-Cys-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)O SNNSYBWPPVAXQW-ZLUOBGJFSA-N 0.000 description 3
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 3
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 3
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 3
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 3
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 3
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 description 3
- OSFZCEQJLWCIBG-BZSNNMDCSA-N Ser-Tyr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSFZCEQJLWCIBG-BZSNNMDCSA-N 0.000 description 3
- 101710172711 Structural protein Proteins 0.000 description 3
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 3
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 3
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 3
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 3
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 3
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 3
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 3
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 3
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 3
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 3
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 3
- 108091023045 Untranslated Region Proteins 0.000 description 3
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 3
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 3
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 3
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 3
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 3
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 3
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 3
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 3
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 3
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 3
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 3
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 3
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 3
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 3
- 108010067390 Viral Proteins Proteins 0.000 description 3
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 3
- 238000000246 agarose gel electrophoresis Methods 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 150000001413 amino acids Chemical class 0.000 description 3
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 3
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 3
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 3
- 108010093581 aspartyl-proline Proteins 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 238000004587 chromatography analysis Methods 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 108010054812 diprotin A Proteins 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 3
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 3
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 3
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 3
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 3
- 108010020688 glycylhistidine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 108010040030 histidinoalanine Proteins 0.000 description 3
- 208000015181 infectious disease Diseases 0.000 description 3
- 108010053037 kyotorphin Proteins 0.000 description 3
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- 108010056582 methionylglutamic acid Proteins 0.000 description 3
- 108010012581 phenylalanylglutamate Proteins 0.000 description 3
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 3
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 3
- 229920001184 polypeptide Polymers 0.000 description 3
- 230000004481 post-translational protein modification Effects 0.000 description 3
- 108010079317 prolyl-tyrosine Proteins 0.000 description 3
- 108010070643 prolylglutamic acid Proteins 0.000 description 3
- 108020003175 receptors Proteins 0.000 description 3
- 238000002560 therapeutic procedure Methods 0.000 description 3
- 108010044292 tryptophyltyrosine Proteins 0.000 description 3
- 108010078580 tyrosylleucine Proteins 0.000 description 3
- 108010009962 valyltyrosine Proteins 0.000 description 3
- 230000007502 viral entry Effects 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical group N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 2
- DQVAZKGVGKHQDS-UHFFFAOYSA-N 2-[[1-[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]pyrrolidine-2-carbonyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(O)=O DQVAZKGVGKHQDS-UHFFFAOYSA-N 0.000 description 2
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 2
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 2
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 2
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 2
- JPGBXANAQYHTLA-DRZSPHRISA-N Ala-Gln-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JPGBXANAQYHTLA-DRZSPHRISA-N 0.000 description 2
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 2
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 2
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 2
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 2
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 2
- XCZXVTHYGSMQGH-NAKRPEOUSA-N Ala-Ile-Met Chemical compound C[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C([O-])=O XCZXVTHYGSMQGH-NAKRPEOUSA-N 0.000 description 2
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 2
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 2
- IAUSCRHURCZUJP-CIUDSAMLSA-N Ala-Lys-Cys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CS)C(O)=O IAUSCRHURCZUJP-CIUDSAMLSA-N 0.000 description 2
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 2
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 2
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 2
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 2
- DXTYEWAQOXYRHZ-KKXDTOCCSA-N Ala-Phe-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DXTYEWAQOXYRHZ-KKXDTOCCSA-N 0.000 description 2
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 2
- XAXHGSOBFPIRFG-LSJOCFKGSA-N Ala-Pro-His Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XAXHGSOBFPIRFG-LSJOCFKGSA-N 0.000 description 2
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 2
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 2
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 2
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 2
- QKHWNPQNOHEFST-VZFHVOOUSA-N Ala-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N)O QKHWNPQNOHEFST-VZFHVOOUSA-N 0.000 description 2
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 2
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- XMIAMUXIMWREBJ-HERUPUMHSA-N Ala-Trp-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XMIAMUXIMWREBJ-HERUPUMHSA-N 0.000 description 2
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 2
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 2
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 2
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 2
- 101100165660 Alternaria brassicicola bsc6 gene Proteins 0.000 description 2
- 102000053723 Angiotensin-converting enzyme 2 Human genes 0.000 description 2
- 108090000975 Angiotensin-converting enzyme 2 Proteins 0.000 description 2
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 2
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 2
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 2
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 2
- VRZDJJWOFXMFRO-ZFWWWQNUSA-N Arg-Gly-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O VRZDJJWOFXMFRO-ZFWWWQNUSA-N 0.000 description 2
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 2
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 2
- JOADBFCFJGNIKF-GUBZILKMSA-N Arg-Met-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O JOADBFCFJGNIKF-GUBZILKMSA-N 0.000 description 2
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 2
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 2
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 2
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 2
- JQHASVQBAKRJKD-GUBZILKMSA-N Arg-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JQHASVQBAKRJKD-GUBZILKMSA-N 0.000 description 2
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 2
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 2
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 2
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 2
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 2
- ANAHQDPQQBDOBM-UHFFFAOYSA-N Arg-Val-Tyr Natural products CC(C)C(NC(=O)C(N)CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O ANAHQDPQQBDOBM-UHFFFAOYSA-N 0.000 description 2
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 2
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 2
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 2
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 2
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 2
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 2
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 2
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 2
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 2
- SQZIAWGBBUSSPJ-ZKWXMUAHSA-N Asn-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N SQZIAWGBBUSSPJ-ZKWXMUAHSA-N 0.000 description 2
- QNJIRRVTOXNGMH-GUBZILKMSA-N Asn-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(N)=O QNJIRRVTOXNGMH-GUBZILKMSA-N 0.000 description 2
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 2
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 2
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 2
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 2
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 2
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 2
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 2
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 2
- NTWOPSIUJBMNRI-KKUMJFAQSA-N Asn-Lys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTWOPSIUJBMNRI-KKUMJFAQSA-N 0.000 description 2
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 2
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 2
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 2
- UYCPJVYQYARFGB-YDHLFZDLSA-N Asn-Phe-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UYCPJVYQYARFGB-YDHLFZDLSA-N 0.000 description 2
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 2
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 2
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 2
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 2
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 2
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 2
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 2
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 2
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 2
- HBUJSDCLZCXXCW-YDHLFZDLSA-N Asn-Val-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HBUJSDCLZCXXCW-YDHLFZDLSA-N 0.000 description 2
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 2
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 2
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 2
- ZSJFGGSPCCHMNE-LAEOZQHASA-N Asp-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N ZSJFGGSPCCHMNE-LAEOZQHASA-N 0.000 description 2
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 2
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 2
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 2
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 2
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 2
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 2
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 2
- YWLDTBBUHZJQHW-KKUMJFAQSA-N Asp-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N YWLDTBBUHZJQHW-KKUMJFAQSA-N 0.000 description 2
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 2
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 2
- NONWUQAWAANERO-BZSNNMDCSA-N Asp-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 NONWUQAWAANERO-BZSNNMDCSA-N 0.000 description 2
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 2
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 2
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 2
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 2
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 2
- OQMGSMNZVHYDTQ-ZKWXMUAHSA-N Asp-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N OQMGSMNZVHYDTQ-ZKWXMUAHSA-N 0.000 description 2
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 2
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 2
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 2
- 241000972773 Aulopiformes Species 0.000 description 2
- 101100499295 Bacillus subtilis (strain 168) disA gene Proteins 0.000 description 2
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 241000699802 Cricetulus griseus Species 0.000 description 2
- XMTDCXXLDZKAGI-ACZMJKKPSA-N Cys-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N XMTDCXXLDZKAGI-ACZMJKKPSA-N 0.000 description 2
- DCXGXDGGXVZVMY-GHCJXIJMSA-N Cys-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CS DCXGXDGGXVZVMY-GHCJXIJMSA-N 0.000 description 2
- NDUSUIGBMZCOIL-ZKWXMUAHSA-N Cys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N NDUSUIGBMZCOIL-ZKWXMUAHSA-N 0.000 description 2
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 2
- UXIYYUMGFNSGBK-XPUUQOCRSA-N Cys-Gly-Val Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O UXIYYUMGFNSGBK-XPUUQOCRSA-N 0.000 description 2
- NLDWTJBJFVWBDQ-KKUMJFAQSA-N Cys-Lys-Phe Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NLDWTJBJFVWBDQ-KKUMJFAQSA-N 0.000 description 2
- QQOWCDCBFFBRQH-IXOXFDKPSA-N Cys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N)O QQOWCDCBFFBRQH-IXOXFDKPSA-N 0.000 description 2
- BCWIFCLVCRAIQK-ZLUOBGJFSA-N Cys-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O BCWIFCLVCRAIQK-ZLUOBGJFSA-N 0.000 description 2
- ZGERHCJBLPQPGV-ACZMJKKPSA-N Cys-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N ZGERHCJBLPQPGV-ACZMJKKPSA-N 0.000 description 2
- JLZCAZJGWNRXCI-XKBZYTNZSA-N Cys-Thr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O JLZCAZJGWNRXCI-XKBZYTNZSA-N 0.000 description 2
- IZJLAQMWJHCHTN-BPUTZDHNSA-N Cys-Trp-Arg Chemical compound N[C@@H](CS)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O IZJLAQMWJHCHTN-BPUTZDHNSA-N 0.000 description 2
- ZKAUCGZIIXXWJQ-BZSNNMDCSA-N Cys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CS)N)O ZKAUCGZIIXXWJQ-BZSNNMDCSA-N 0.000 description 2
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 2
- JFOKLAPFYCTNHW-SRVKXCTJSA-N Gln-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N JFOKLAPFYCTNHW-SRVKXCTJSA-N 0.000 description 2
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 2
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 2
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 2
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 2
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 2
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 2
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 2
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 2
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 2
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 2
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 2
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 2
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 2
- MFORDNZDKAVNSR-SRVKXCTJSA-N Gln-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O MFORDNZDKAVNSR-SRVKXCTJSA-N 0.000 description 2
- DCWNCMRZIZSZBL-KKUMJFAQSA-N Gln-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O DCWNCMRZIZSZBL-KKUMJFAQSA-N 0.000 description 2
- MFHVAWMMKZBSRQ-ACZMJKKPSA-N Gln-Ser-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N MFHVAWMMKZBSRQ-ACZMJKKPSA-N 0.000 description 2
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 2
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 2
- IIMZHVKZBGSEKZ-SZMVWBNQSA-N Gln-Trp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O IIMZHVKZBGSEKZ-SZMVWBNQSA-N 0.000 description 2
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 2
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 2
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 2
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 2
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 2
- ZZIFPJZQHRJERU-WDSKDSINSA-N Glu-Cys-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZZIFPJZQHRJERU-WDSKDSINSA-N 0.000 description 2
- KVBPDJIFRQUQFY-ACZMJKKPSA-N Glu-Cys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O KVBPDJIFRQUQFY-ACZMJKKPSA-N 0.000 description 2
- FKGNJUCQKXQNRA-NRPADANISA-N Glu-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O FKGNJUCQKXQNRA-NRPADANISA-N 0.000 description 2
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 2
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 2
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 2
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 2
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 2
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 2
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 2
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 2
- FQFWFZWOHOEVMZ-IHRRRGAJSA-N Glu-Phe-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FQFWFZWOHOEVMZ-IHRRRGAJSA-N 0.000 description 2
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 2
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 2
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 2
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 2
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 2
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 2
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 2
- PYUCNHJQQVSPGN-BQBZGAKWSA-N Gly-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)CN=C(N)N PYUCNHJQQVSPGN-BQBZGAKWSA-N 0.000 description 2
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 2
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 2
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 2
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 2
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 2
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 2
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 2
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 2
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 2
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 2
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 2
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 2
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 2
- PCPOYRCAHPJXII-UWVGGRQHSA-N Gly-Lys-Met Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PCPOYRCAHPJXII-UWVGGRQHSA-N 0.000 description 2
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 2
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 2
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 2
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 2
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 2
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 2
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 2
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 2
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 2
- KBBFOULZCHWGJX-KBPBESRZSA-N Gly-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN)O KBBFOULZCHWGJX-KBPBESRZSA-N 0.000 description 2
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 2
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 2
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 2
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 2
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 2
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 2
- HTZKFIYQMHJWSQ-INTQDDNPSA-N His-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HTZKFIYQMHJWSQ-INTQDDNPSA-N 0.000 description 2
- PMWSGVRIMIFXQH-KKUMJFAQSA-N His-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 PMWSGVRIMIFXQH-KKUMJFAQSA-N 0.000 description 2
- SKYULSWNBYAQMG-IHRRRGAJSA-N His-Leu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SKYULSWNBYAQMG-IHRRRGAJSA-N 0.000 description 2
- RLAOTFTXBFQJDV-KKUMJFAQSA-N His-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CN=CN1 RLAOTFTXBFQJDV-KKUMJFAQSA-N 0.000 description 2
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 2
- CSTDQOOBZBAJKE-BWAGICSOSA-N His-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N)O CSTDQOOBZBAJKE-BWAGICSOSA-N 0.000 description 2
- 101000595467 Homo sapiens T-complex protein 1 subunit gamma Proteins 0.000 description 2
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 2
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 2
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 2
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 2
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 2
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 2
- QYOGJYIRKACXEP-SLBDDTMCSA-N Ile-Asn-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N QYOGJYIRKACXEP-SLBDDTMCSA-N 0.000 description 2
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 2
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 2
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 2
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 2
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 2
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 2
- LWWILHPVAKKLQS-QXEWZRGKSA-N Ile-Gly-Met Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N LWWILHPVAKKLQS-QXEWZRGKSA-N 0.000 description 2
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 2
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 2
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 2
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 2
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 2
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 2
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 2
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 2
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 2
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 2
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 2
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 2
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 2
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 2
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 2
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 2
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 2
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 2
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 2
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 2
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 2
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 2
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 2
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 2
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 2
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 2
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 2
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 2
- PPTAQBNUFKTJKA-BJDJZHNGSA-N Leu-Cys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PPTAQBNUFKTJKA-BJDJZHNGSA-N 0.000 description 2
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 2
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 2
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 2
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 2
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 2
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 2
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 2
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 2
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 2
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 2
- ORWTWZXGDBYVCP-BJDJZHNGSA-N Leu-Ile-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(C)C ORWTWZXGDBYVCP-BJDJZHNGSA-N 0.000 description 2
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 2
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 2
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 2
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 2
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 2
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 2
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 2
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 2
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 2
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 2
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 2
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 2
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 2
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 2
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 2
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 2
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 2
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 2
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 2
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 2
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 2
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 2
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 2
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 2
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 2
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 2
- RNYLNYTYMXACRI-VFAJRCTISA-N Leu-Thr-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O RNYLNYTYMXACRI-VFAJRCTISA-N 0.000 description 2
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 2
- YWFZWQKWNDOWPA-XIRDDKMYSA-N Leu-Trp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O YWFZWQKWNDOWPA-XIRDDKMYSA-N 0.000 description 2
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 2
- UFPLDOKWDNTTRP-ULQDDVLXSA-N Leu-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=C(O)C=C1 UFPLDOKWDNTTRP-ULQDDVLXSA-N 0.000 description 2
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 2
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 2
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 2
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 2
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 2
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 2
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 2
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 2
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 2
- ZAWOJFFMBANLGE-CIUDSAMLSA-N Lys-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N ZAWOJFFMBANLGE-CIUDSAMLSA-N 0.000 description 2
- RDIILCRAWOSDOQ-CIUDSAMLSA-N Lys-Cys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RDIILCRAWOSDOQ-CIUDSAMLSA-N 0.000 description 2
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 2
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 2
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 2
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 2
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 2
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 2
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 2
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 2
- PINHPJWGVBKQII-SRVKXCTJSA-N Lys-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N PINHPJWGVBKQII-SRVKXCTJSA-N 0.000 description 2
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 2
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 2
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 2
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 2
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 2
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 2
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 2
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 2
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 2
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 2
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 2
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 2
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 2
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 2
- ACYHZNZHIZWLQF-BQBZGAKWSA-N Met-Asn-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ACYHZNZHIZWLQF-BQBZGAKWSA-N 0.000 description 2
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 2
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 2
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 2
- AWGBEIYZPAXXSX-RWMBFGLXSA-N Met-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N AWGBEIYZPAXXSX-RWMBFGLXSA-N 0.000 description 2
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 2
- JACMWNXOOUYXCD-JYJNAYRXSA-N Met-Val-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JACMWNXOOUYXCD-JYJNAYRXSA-N 0.000 description 2
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- 101150007210 ORF6 gene Proteins 0.000 description 2
- MDHZEOMXGNBSIL-DLOVCJGASA-N Phe-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N MDHZEOMXGNBSIL-DLOVCJGASA-N 0.000 description 2
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 2
- HXSUFWQYLPKEHF-IHRRRGAJSA-N Phe-Asn-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HXSUFWQYLPKEHF-IHRRRGAJSA-N 0.000 description 2
- UUWCIPUVJJIEEP-SRVKXCTJSA-N Phe-Asn-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N UUWCIPUVJJIEEP-SRVKXCTJSA-N 0.000 description 2
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 2
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 2
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 2
- UEXCHCYDPAIVDE-SRVKXCTJSA-N Phe-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEXCHCYDPAIVDE-SRVKXCTJSA-N 0.000 description 2
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 2
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 2
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 2
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 2
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 2
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 2
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 2
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 2
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 2
- IEOHQGFKHXUALJ-JYJNAYRXSA-N Phe-Met-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IEOHQGFKHXUALJ-JYJNAYRXSA-N 0.000 description 2
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 2
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 2
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 2
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 2
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 2
- BSTPNLNKHKBONJ-HTUGSXCWSA-N Phe-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O BSTPNLNKHKBONJ-HTUGSXCWSA-N 0.000 description 2
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 2
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 2
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 2
- KIQUCMUULDXTAZ-HJOGWXRNSA-N Phe-Tyr-Tyr Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O KIQUCMUULDXTAZ-HJOGWXRNSA-N 0.000 description 2
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 2
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 2
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 2
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 2
- 101100226894 Phomopsis amygdali PaGT gene Proteins 0.000 description 2
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 2
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 2
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 2
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 2
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 2
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 2
- HQVPQXMCQKXARZ-FXQIFTODSA-N Pro-Cys-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O HQVPQXMCQKXARZ-FXQIFTODSA-N 0.000 description 2
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 2
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 2
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 2
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 2
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 2
- RPLMFKUKFZOTER-AVGNSLFASA-N Pro-Met-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 RPLMFKUKFZOTER-AVGNSLFASA-N 0.000 description 2
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 2
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 2
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 2
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 2
- 108010003201 RGH 0205 Proteins 0.000 description 2
- 229940022005 RNA vaccine Drugs 0.000 description 2
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 2
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 2
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 2
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 2
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 2
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 2
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 2
- DSSOYPJWSWFOLK-CIUDSAMLSA-N Ser-Cys-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O DSSOYPJWSWFOLK-CIUDSAMLSA-N 0.000 description 2
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 2
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 2
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 2
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 2
- SVWQEIRZHHNBIO-WHFBIAKZSA-N Ser-Gly-Cys Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CS)C(O)=O SVWQEIRZHHNBIO-WHFBIAKZSA-N 0.000 description 2
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 2
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 2
- CXBFHZLODKPIJY-AAEUAGOBSA-N Ser-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N CXBFHZLODKPIJY-AAEUAGOBSA-N 0.000 description 2
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 2
- JEHPKECJCALLRW-CUJWVEQBSA-N Ser-His-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEHPKECJCALLRW-CUJWVEQBSA-N 0.000 description 2
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 2
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 2
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 2
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 2
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 2
- ASGYVPAVFNDZMA-GUBZILKMSA-N Ser-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N ASGYVPAVFNDZMA-GUBZILKMSA-N 0.000 description 2
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 2
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 2
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 2
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 2
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 2
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 2
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 2
- VAIWUNAAPZZGRI-IHPCNDPISA-N Ser-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N VAIWUNAAPZZGRI-IHPCNDPISA-N 0.000 description 2
- VEVYMLNYMULSMS-AVGNSLFASA-N Ser-Tyr-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEVYMLNYMULSMS-AVGNSLFASA-N 0.000 description 2
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 2
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 2
- SGZVZUCRAVSPKQ-FXQIFTODSA-N Ser-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N SGZVZUCRAVSPKQ-FXQIFTODSA-N 0.000 description 2
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 2
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 2
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 2
- 102100036049 T-complex protein 1 subunit gamma Human genes 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 2
- JMQUAZXYFAEOIH-XGEHTFHBSA-N Thr-Arg-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O JMQUAZXYFAEOIH-XGEHTFHBSA-N 0.000 description 2
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 2
- UTSWGQNAQRIHAI-UNQGMJICSA-N Thr-Arg-Phe Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UTSWGQNAQRIHAI-UNQGMJICSA-N 0.000 description 2
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 2
- NOWXWJLVGTVJKM-PBCZWWQYSA-N Thr-Asp-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O NOWXWJLVGTVJKM-PBCZWWQYSA-N 0.000 description 2
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 2
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 2
- KZUJCMPVNXOBAF-LKXGYXEUSA-N Thr-Cys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KZUJCMPVNXOBAF-LKXGYXEUSA-N 0.000 description 2
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 2
- GUZGCDIZVGODML-NKIYYHGXSA-N Thr-Gln-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O GUZGCDIZVGODML-NKIYYHGXSA-N 0.000 description 2
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 2
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 2
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 2
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 2
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 2
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 2
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 2
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 2
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 2
- TZJSEJOXAIWOST-RHYQMDGZSA-N Thr-Lys-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N TZJSEJOXAIWOST-RHYQMDGZSA-N 0.000 description 2
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 2
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 2
- FDQXPJCLVPFKJW-KJEVXHAQSA-N Thr-Met-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O FDQXPJCLVPFKJW-KJEVXHAQSA-N 0.000 description 2
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 2
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 2
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 2
- HSQXHRIRJSFDOH-URLPEUOOSA-N Thr-Phe-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HSQXHRIRJSFDOH-URLPEUOOSA-N 0.000 description 2
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 2
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 2
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 2
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 2
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 2
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 2
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 2
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 2
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 2
- VGNLMPBYWWNQFS-ZEILLAHLSA-N Thr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O VGNLMPBYWWNQFS-ZEILLAHLSA-N 0.000 description 2
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 2
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 2
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 2
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 2
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 2
- GWBWCGITOYODER-YTQUADARSA-N Trp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GWBWCGITOYODER-YTQUADARSA-N 0.000 description 2
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 2
- UIRVSEPRMWDVEW-RNXOBYDBSA-N Trp-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N UIRVSEPRMWDVEW-RNXOBYDBSA-N 0.000 description 2
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 2
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 description 2
- IELISNUVHBKYBX-XDTLVQLUSA-N Tyr-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IELISNUVHBKYBX-XDTLVQLUSA-N 0.000 description 2
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 2
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 2
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 2
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 2
- JRXKIVGWMMIIOF-YDHLFZDLSA-N Tyr-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JRXKIVGWMMIIOF-YDHLFZDLSA-N 0.000 description 2
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 2
- FFCRCJZJARTYCG-KKUMJFAQSA-N Tyr-Cys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O FFCRCJZJARTYCG-KKUMJFAQSA-N 0.000 description 2
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 2
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 2
- LTSIAOZUVISRAQ-QWRGUYRKSA-N Tyr-Gly-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O LTSIAOZUVISRAQ-QWRGUYRKSA-N 0.000 description 2
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 2
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 2
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 2
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 description 2
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 2
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 2
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 2
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 2
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 2
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 2
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 2
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 2
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 2
- XFEMMSGONWQACR-KJEVXHAQSA-N Tyr-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XFEMMSGONWQACR-KJEVXHAQSA-N 0.000 description 2
- GPLTZEMVOCZVAV-UFYCRDLUSA-N Tyr-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 GPLTZEMVOCZVAV-UFYCRDLUSA-N 0.000 description 2
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 2
- VJOWWOGRNXRQMF-UVBJJODRSA-N Val-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 VJOWWOGRNXRQMF-UVBJJODRSA-N 0.000 description 2
- JOQSQZFKFYJKKJ-GUBZILKMSA-N Val-Arg-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N JOQSQZFKFYJKKJ-GUBZILKMSA-N 0.000 description 2
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 2
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 2
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 2
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 2
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 2
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 2
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 2
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 2
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 2
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 2
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 2
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 2
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 2
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 2
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 2
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 2
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 2
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 2
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 2
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 2
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 2
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 2
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 2
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 2
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 2
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 2
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 2
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 2
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 2
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 2
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 2
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 2
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 2
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 2
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 2
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 2
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 2
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 2
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 2
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 2
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 2
- JPBGMZDTPVGGMQ-ULQDDVLXSA-N Val-Tyr-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JPBGMZDTPVGGMQ-ULQDDVLXSA-N 0.000 description 2
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 2
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 2
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 2
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 2
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 2
- ZHWZDZFWBXWPDW-GUBZILKMSA-N Val-Val-Cys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O ZHWZDZFWBXWPDW-GUBZILKMSA-N 0.000 description 2
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010011559 alanylphenylalanine Proteins 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 230000037429 base substitution Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000009295 crossflow filtration Methods 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 239000002552 dosage form Substances 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 230000006862 enzymatic digestion Effects 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 2
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 2
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 2
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 2
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 2
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 2
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 210000002443 helper t lymphocyte Anatomy 0.000 description 2
- 108010028403 hemagglutinin esterase Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 230000036039 immunity Effects 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- 239000003446 ligand Substances 0.000 description 2
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 2
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108700021021 mRNA Vaccine Proteins 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 108010090114 methionyl-tyrosyl-lysine Proteins 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 229960005030 other vaccine in atc Drugs 0.000 description 2
- 210000001672 ovary Anatomy 0.000 description 2
- 244000052769 pathogen Species 0.000 description 2
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 2
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 2
- 238000011321 prophylaxis Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 235000019515 salmon Nutrition 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 108010007375 seryl-seryl-seryl-arginine Proteins 0.000 description 2
- 239000003381 stabilizer Substances 0.000 description 2
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 2
- 229940113082 thymine Drugs 0.000 description 2
- 238000010361 transduction Methods 0.000 description 2
- 230000026683 transduction Effects 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000014621 translational initiation Effects 0.000 description 2
- 238000005199 ultracentrifugation Methods 0.000 description 2
- 229940035893 uracil Drugs 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 210000003501 vero cell Anatomy 0.000 description 2
- 230000029812 viral genome replication Effects 0.000 description 2
- 108010027345 wheylin-1 peptide Proteins 0.000 description 2
- 210000005253 yeast cell Anatomy 0.000 description 2
- CNKBMTKICGGSCQ-ACRUOGEOSA-N (2S)-2-[[(2S)-2-[[(2S)-2,6-diamino-1-oxohexyl]amino]-1-oxo-3-phenylpropyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CNKBMTKICGGSCQ-ACRUOGEOSA-N 0.000 description 1
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 1
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 1
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- PQFMROVJTOPVDF-JBDRJPRFSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-carboxypropanoyl]amino]-3-carboxypropanoyl]amino]-4-carboxybutanoyl]amino]butanedioic acid Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PQFMROVJTOPVDF-JBDRJPRFSA-N 0.000 description 1
- RRBGTUQJDFBWNN-MUGJNUQGSA-N (2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-2,6-diaminohexanoyl]amino]hexanoyl]amino]hexanoyl]amino]hexanoic acid Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O RRBGTUQJDFBWNN-MUGJNUQGSA-N 0.000 description 1
- NOUIAHOPEGZYFE-JPLJXNOCSA-N (3S)-4-[[(2S)-1-[[(1S)-1-carboxy-2-(4-hydroxyphenyl)ethyl]amino]-3-methyl-1-oxobutan-2-yl]amino]-3-[[(2S)-2,6-diaminohexanoyl]amino]-4-oxobutanoic acid Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NOUIAHOPEGZYFE-JPLJXNOCSA-N 0.000 description 1
- YYGNTYWPHWGJRM-UHFFFAOYSA-N (6E,10E,14E,18E)-2,6,10,15,19,23-hexamethyltetracosa-2,6,10,14,18,22-hexaene Chemical compound CC(C)=CCCC(C)=CCCC(C)=CCCC=C(C)CCC=C(C)CCC=C(C)C YYGNTYWPHWGJRM-UHFFFAOYSA-N 0.000 description 1
- MKRXAIMALGQSHI-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-3-methylpentanoyl)amino]-3-methylpentanoyl]amino]-3-methylbutanoyl]amino]-3-methylbutanoic acid Chemical compound CCC(C)C(N)C(=O)NC(C(C)CC)C(=O)NC(C(C)C)C(=O)NC(C(C)C)C(O)=O MKRXAIMALGQSHI-UHFFFAOYSA-N 0.000 description 1
- ZLOIGESWDJYCTF-UHFFFAOYSA-N 4-Thiouridine Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=S)C=C1 ZLOIGESWDJYCTF-UHFFFAOYSA-N 0.000 description 1
- ZLOIGESWDJYCTF-XVFCMESISA-N 4-thiouridine Chemical group O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=S)C=C1 ZLOIGESWDJYCTF-XVFCMESISA-N 0.000 description 1
- LQLQRFGHAALLLE-UHFFFAOYSA-N 5-bromouracil Chemical group BrC1=CNC(=O)NC1=O LQLQRFGHAALLLE-UHFFFAOYSA-N 0.000 description 1
- KSNXJLQDQOIRIP-UHFFFAOYSA-N 5-iodouracil Chemical group IC1=CNC(=O)NC1=O KSNXJLQDQOIRIP-UHFFFAOYSA-N 0.000 description 1
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 1
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 1
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- FSBCNCKIQZZASN-GUBZILKMSA-N Ala-Arg-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O FSBCNCKIQZZASN-GUBZILKMSA-N 0.000 description 1
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 1
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 1
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 1
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 1
- JYEBJTDTPNKQJG-FXQIFTODSA-N Ala-Asn-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N JYEBJTDTPNKQJG-FXQIFTODSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- MBWYUTNBYSSUIQ-HERUPUMHSA-N Ala-Asn-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N MBWYUTNBYSSUIQ-HERUPUMHSA-N 0.000 description 1
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- FOWHQTWRLFTELJ-FXQIFTODSA-N Ala-Asp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N FOWHQTWRLFTELJ-FXQIFTODSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- NFDVJAKFMXHJEQ-HERUPUMHSA-N Ala-Asp-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NFDVJAKFMXHJEQ-HERUPUMHSA-N 0.000 description 1
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 1
- ZDYNWWQXFRUOEO-XDTLVQLUSA-N Ala-Gln-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDYNWWQXFRUOEO-XDTLVQLUSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 1
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 1
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 1
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 1
- IVKWMMGFLAMMKJ-XVYDVKMFSA-N Ala-His-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IVKWMMGFLAMMKJ-XVYDVKMFSA-N 0.000 description 1
- HJGZVLLLBJLXFC-LSJOCFKGSA-N Ala-His-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O HJGZVLLLBJLXFC-LSJOCFKGSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- NMXKFWOEASXOGB-QSFUFRPTSA-N Ala-Ile-His Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NMXKFWOEASXOGB-QSFUFRPTSA-N 0.000 description 1
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 1
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 1
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 1
- VGMNWQOPSFBBBG-XUXIUFHCSA-N Ala-Leu-Leu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VGMNWQOPSFBBBG-XUXIUFHCSA-N 0.000 description 1
- OPZJWMJPCNNZNT-DCAQKATOSA-N Ala-Leu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N OPZJWMJPCNNZNT-DCAQKATOSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- MLNSNVLOEIYJIU-ZUDIRPEPSA-N Ala-Leu-Thr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLNSNVLOEIYJIU-ZUDIRPEPSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 1
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 1
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 1
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 1
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 1
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 1
- CUOMGDPDITUMIJ-HZZBMVKVSA-N Ala-Phe-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 CUOMGDPDITUMIJ-HZZBMVKVSA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- OMCKWYSDUQBYCN-FXQIFTODSA-N Ala-Ser-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O OMCKWYSDUQBYCN-FXQIFTODSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- YXXPVUOMPSZURS-ZLIFDBKOSA-N Ala-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 YXXPVUOMPSZURS-ZLIFDBKOSA-N 0.000 description 1
- XPBVBZPVNFIHOA-UVBJJODRSA-N Ala-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 XPBVBZPVNFIHOA-UVBJJODRSA-N 0.000 description 1
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 1
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 1
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 1
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 1
- JNJHNBXBGNJESC-KKXDTOCCSA-N Ala-Tyr-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JNJHNBXBGNJESC-KKXDTOCCSA-N 0.000 description 1
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 1
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 1
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- DDPKBJZLAXLQGZ-KBIXCLLPSA-N Ala-Val-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DDPKBJZLAXLQGZ-KBIXCLLPSA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 1
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- ZDILXFDENZVOTL-BPNCWPANSA-N Ala-Val-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDILXFDENZVOTL-BPNCWPANSA-N 0.000 description 1
- 101100165663 Alternaria brassicicola bsc8 gene Proteins 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 1
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 1
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 1
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 1
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- YUGFLWBWAJFGKY-BQBZGAKWSA-N Arg-Cys-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O YUGFLWBWAJFGKY-BQBZGAKWSA-N 0.000 description 1
- IGULQRCJLQQPSM-DCAQKATOSA-N Arg-Cys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IGULQRCJLQQPSM-DCAQKATOSA-N 0.000 description 1
- VDBKFYYIBLXEIF-GUBZILKMSA-N Arg-Gln-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VDBKFYYIBLXEIF-GUBZILKMSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 1
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 1
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- QKSAZKCRVQYYGS-UWVGGRQHSA-N Arg-Gly-His Chemical compound N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QKSAZKCRVQYYGS-UWVGGRQHSA-N 0.000 description 1
- QEHMMRSQJMOYNO-DCAQKATOSA-N Arg-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QEHMMRSQJMOYNO-DCAQKATOSA-N 0.000 description 1
- BMNVSPMWMICFRV-DCAQKATOSA-N Arg-His-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CN=CN1 BMNVSPMWMICFRV-DCAQKATOSA-N 0.000 description 1
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 1
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- NIUDXSFNLBIWOB-DCAQKATOSA-N Arg-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NIUDXSFNLBIWOB-DCAQKATOSA-N 0.000 description 1
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 1
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 1
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 1
- PYZPXCZNQSEHDT-GUBZILKMSA-N Arg-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PYZPXCZNQSEHDT-GUBZILKMSA-N 0.000 description 1
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 1
- DTBPLQNKYCYUOM-JYJNAYRXSA-N Arg-Met-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DTBPLQNKYCYUOM-JYJNAYRXSA-N 0.000 description 1
- LCBSSOCDWUTQQV-SDDRHHMPSA-N Arg-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LCBSSOCDWUTQQV-SDDRHHMPSA-N 0.000 description 1
- JCROZIFVIYMXHM-GUBZILKMSA-N Arg-Met-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N JCROZIFVIYMXHM-GUBZILKMSA-N 0.000 description 1
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 1
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 1
- VEAIMHJZTIDCIH-KKUMJFAQSA-N Arg-Phe-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEAIMHJZTIDCIH-KKUMJFAQSA-N 0.000 description 1
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 1
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 description 1
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 1
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- AUIJUTGLPVHIRT-FXQIFTODSA-N Arg-Ser-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AUIJUTGLPVHIRT-FXQIFTODSA-N 0.000 description 1
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 1
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- ZUVMUOOHJYNJPP-XIRDDKMYSA-N Arg-Trp-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZUVMUOOHJYNJPP-XIRDDKMYSA-N 0.000 description 1
- VJIQPOJMISSUPO-BVSLBCMMSA-N Arg-Trp-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VJIQPOJMISSUPO-BVSLBCMMSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 1
- BWMMKQPATDUYKB-IHRRRGAJSA-N Arg-Tyr-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=C(O)C=C1 BWMMKQPATDUYKB-IHRRRGAJSA-N 0.000 description 1
- AOJYORNRFWWEIV-IHRRRGAJSA-N Arg-Tyr-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 AOJYORNRFWWEIV-IHRRRGAJSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- FOWOZYAWODIRFZ-JYJNAYRXSA-N Arg-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCN=C(N)N)N FOWOZYAWODIRFZ-JYJNAYRXSA-N 0.000 description 1
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 1
- FXGMURPOWCKNAZ-JYJNAYRXSA-N Arg-Val-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FXGMURPOWCKNAZ-JYJNAYRXSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- WHLDJYNHXOMGMU-JYJNAYRXSA-N Arg-Val-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WHLDJYNHXOMGMU-JYJNAYRXSA-N 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- BRCVLJZIIFBSPF-ZLUOBGJFSA-N Asn-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N BRCVLJZIIFBSPF-ZLUOBGJFSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- QQEWINYJRFBLNN-DLOVCJGASA-N Asn-Ala-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QQEWINYJRFBLNN-DLOVCJGASA-N 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- AKEBUSZTMQLNIX-UWJYBYFXSA-N Asn-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N AKEBUSZTMQLNIX-UWJYBYFXSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- GMRGSBAMMMVDGG-GUBZILKMSA-N Asn-Arg-Arg Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N GMRGSBAMMMVDGG-GUBZILKMSA-N 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- YJRORCOAFUZVKA-FXQIFTODSA-N Asn-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N YJRORCOAFUZVKA-FXQIFTODSA-N 0.000 description 1
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 1
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- YNSCBOUZTAGIGO-ZLUOBGJFSA-N Asn-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N YNSCBOUZTAGIGO-ZLUOBGJFSA-N 0.000 description 1
- RCENDENBBJFJHZ-ACZMJKKPSA-N Asn-Asn-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCENDENBBJFJHZ-ACZMJKKPSA-N 0.000 description 1
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 1
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 1
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 1
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 1
- GMCOADLDNLGOFE-ZLUOBGJFSA-N Asn-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N GMCOADLDNLGOFE-ZLUOBGJFSA-N 0.000 description 1
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 1
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 1
- HLTLEIXYIJDFOY-ZLUOBGJFSA-N Asn-Cys-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O HLTLEIXYIJDFOY-ZLUOBGJFSA-N 0.000 description 1
- ZMWDUIIACVLIHK-GHCJXIJMSA-N Asn-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N ZMWDUIIACVLIHK-GHCJXIJMSA-N 0.000 description 1
- CZIXHXIJJZLYRJ-SRVKXCTJSA-N Asn-Cys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CZIXHXIJJZLYRJ-SRVKXCTJSA-N 0.000 description 1
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 1
- AYKKKGFJXIDYLX-ACZMJKKPSA-N Asn-Gln-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AYKKKGFJXIDYLX-ACZMJKKPSA-N 0.000 description 1
- HJRBIWRXULGMOA-ACZMJKKPSA-N Asn-Gln-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJRBIWRXULGMOA-ACZMJKKPSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 1
- RAKKBBHMTJSXOY-XVYDVKMFSA-N Asn-His-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O RAKKBBHMTJSXOY-XVYDVKMFSA-N 0.000 description 1
- SUEIIIFUBHDCCS-PBCZWWQYSA-N Asn-His-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUEIIIFUBHDCCS-PBCZWWQYSA-N 0.000 description 1
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 1
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 1
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 1
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 1
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 1
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- UBGGJTMETLEXJD-DCAQKATOSA-N Asn-Leu-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O UBGGJTMETLEXJD-DCAQKATOSA-N 0.000 description 1
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- LZLCLRQMUQWUHJ-GUBZILKMSA-N Asn-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N LZLCLRQMUQWUHJ-GUBZILKMSA-N 0.000 description 1
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- WXVGISRWSYGEDK-KKUMJFAQSA-N Asn-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N WXVGISRWSYGEDK-KKUMJFAQSA-N 0.000 description 1
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- QDXQWFBLUVTOFL-FXQIFTODSA-N Asn-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)N)N QDXQWFBLUVTOFL-FXQIFTODSA-N 0.000 description 1
- NNDSLVWAQAUPPP-GUBZILKMSA-N Asn-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N NNDSLVWAQAUPPP-GUBZILKMSA-N 0.000 description 1
- VITDJIPIJZAVGC-VEVYYDQMSA-N Asn-Met-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VITDJIPIJZAVGC-VEVYYDQMSA-N 0.000 description 1
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 1
- RTFWCVDISAMGEQ-SRVKXCTJSA-N Asn-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N RTFWCVDISAMGEQ-SRVKXCTJSA-N 0.000 description 1
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 1
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- JXMREEPBRANWBY-VEVYYDQMSA-N Asn-Thr-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JXMREEPBRANWBY-VEVYYDQMSA-N 0.000 description 1
- GOPFMQJUQDLUFW-LKXGYXEUSA-N Asn-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O GOPFMQJUQDLUFW-LKXGYXEUSA-N 0.000 description 1
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- RDLYUKRPEJERMM-XIRDDKMYSA-N Asn-Trp-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O RDLYUKRPEJERMM-XIRDDKMYSA-N 0.000 description 1
- CPYHLXSGDBDULY-IHPCNDPISA-N Asn-Trp-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CPYHLXSGDBDULY-IHPCNDPISA-N 0.000 description 1
- RTFXPCYMDYBZNQ-SRVKXCTJSA-N Asn-Tyr-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O RTFXPCYMDYBZNQ-SRVKXCTJSA-N 0.000 description 1
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 1
- NJPLPRFQLBZAMH-IHRRRGAJSA-N Asn-Tyr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O NJPLPRFQLBZAMH-IHRRRGAJSA-N 0.000 description 1
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 1
- DPSUVAPLRQDWAO-YDHLFZDLSA-N Asn-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)N)N DPSUVAPLRQDWAO-YDHLFZDLSA-N 0.000 description 1
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 1
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- GBAWQWASNGUNQF-ZLUOBGJFSA-N Asp-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N GBAWQWASNGUNQF-ZLUOBGJFSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 1
- SLHOOKXYTYAJGQ-XVYDVKMFSA-N Asp-Ala-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 SLHOOKXYTYAJGQ-XVYDVKMFSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 1
- ICAYWNTWHRRAQP-FXQIFTODSA-N Asp-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N ICAYWNTWHRRAQP-FXQIFTODSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 1
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 1
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 1
- MJKBOVWWADWLHV-ZLUOBGJFSA-N Asp-Cys-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)C(=O)O MJKBOVWWADWLHV-ZLUOBGJFSA-N 0.000 description 1
- FTNVLGCFIJEMQT-CIUDSAMLSA-N Asp-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N FTNVLGCFIJEMQT-CIUDSAMLSA-N 0.000 description 1
- WEDGJJRCJNHYSF-SRVKXCTJSA-N Asp-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N WEDGJJRCJNHYSF-SRVKXCTJSA-N 0.000 description 1
- DZQKLNLLWFQONU-LKXGYXEUSA-N Asp-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)O DZQKLNLLWFQONU-LKXGYXEUSA-N 0.000 description 1
- PJERDVUTUDZPGX-ZKWXMUAHSA-N Asp-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC(O)=O PJERDVUTUDZPGX-ZKWXMUAHSA-N 0.000 description 1
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 1
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 1
- OEUQMKNNOWJREN-AVGNSLFASA-N Asp-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N OEUQMKNNOWJREN-AVGNSLFASA-N 0.000 description 1
- KIJLEFNHWSXHRU-NUMRIWBASA-N Asp-Gln-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KIJLEFNHWSXHRU-NUMRIWBASA-N 0.000 description 1
- JRBVWZLHBGYZNY-QEJZJMRPSA-N Asp-Gln-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRBVWZLHBGYZNY-QEJZJMRPSA-N 0.000 description 1
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- LTXGDRFJRZSZAV-CIUDSAMLSA-N Asp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N LTXGDRFJRZSZAV-CIUDSAMLSA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 1
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 1
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- TVIZQBFURPLQDV-DJFWLOJKSA-N Asp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N TVIZQBFURPLQDV-DJFWLOJKSA-N 0.000 description 1
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 1
- ICZWAZVKLACMKR-CIUDSAMLSA-N Asp-His-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 ICZWAZVKLACMKR-CIUDSAMLSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 1
- VMVUDJUXJKDGNR-FXQIFTODSA-N Asp-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N VMVUDJUXJKDGNR-FXQIFTODSA-N 0.000 description 1
- HSGOFISJLFDMBJ-CIUDSAMLSA-N Asp-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N HSGOFISJLFDMBJ-CIUDSAMLSA-N 0.000 description 1
- WQSXAPPYLGNMQL-IHRRRGAJSA-N Asp-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N WQSXAPPYLGNMQL-IHRRRGAJSA-N 0.000 description 1
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 1
- WZUZGDANRQPCDD-SRVKXCTJSA-N Asp-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N WZUZGDANRQPCDD-SRVKXCTJSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- XUVTWGPERWIERB-IHRRRGAJSA-N Asp-Pro-Phe Chemical compound N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O XUVTWGPERWIERB-IHRRRGAJSA-N 0.000 description 1
- BKOIIURTQAJHAT-GUBZILKMSA-N Asp-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 BKOIIURTQAJHAT-GUBZILKMSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 1
- FIAKNCXQFFKSSI-ZLUOBGJFSA-N Asp-Ser-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O FIAKNCXQFFKSSI-ZLUOBGJFSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 1
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 1
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 1
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 1
- YODBPLSWNJMZOJ-BPUTZDHNSA-N Asp-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N YODBPLSWNJMZOJ-BPUTZDHNSA-N 0.000 description 1
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 1
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 1
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 1
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 101100512078 Caenorhabditis elegans lys-1 gene Proteins 0.000 description 1
- 102000020313 Cell-Penetrating Peptides Human genes 0.000 description 1
- 108010051109 Cell-Penetrating Peptides Proteins 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 241000494545 Cordyline virus 2 Species 0.000 description 1
- PLBJMUUEGBBHRH-ZLUOBGJFSA-N Cys-Ala-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLBJMUUEGBBHRH-ZLUOBGJFSA-N 0.000 description 1
- FMDCYTBSPZMPQE-JBDRJPRFSA-N Cys-Ala-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMDCYTBSPZMPQE-JBDRJPRFSA-N 0.000 description 1
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 1
- RRIJEABIXPKSGP-FXQIFTODSA-N Cys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CS RRIJEABIXPKSGP-FXQIFTODSA-N 0.000 description 1
- MBPKYKSYUAPLMY-DCAQKATOSA-N Cys-Arg-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MBPKYKSYUAPLMY-DCAQKATOSA-N 0.000 description 1
- NLCZGISONIGRQP-DCAQKATOSA-N Cys-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N NLCZGISONIGRQP-DCAQKATOSA-N 0.000 description 1
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 1
- XGIAHEUULGOZHH-GUBZILKMSA-N Cys-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N XGIAHEUULGOZHH-GUBZILKMSA-N 0.000 description 1
- UPJGYXRAPJWIHD-CIUDSAMLSA-N Cys-Asn-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UPJGYXRAPJWIHD-CIUDSAMLSA-N 0.000 description 1
- CPTUXCUWQIBZIF-ZLUOBGJFSA-N Cys-Asn-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CPTUXCUWQIBZIF-ZLUOBGJFSA-N 0.000 description 1
- NQSUTVRXXBGVDQ-LKXGYXEUSA-N Cys-Asn-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NQSUTVRXXBGVDQ-LKXGYXEUSA-N 0.000 description 1
- VZKXOWRNJDEGLZ-WHFBIAKZSA-N Cys-Asp-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O VZKXOWRNJDEGLZ-WHFBIAKZSA-N 0.000 description 1
- NIPJKKSXHSBEMX-CIUDSAMLSA-N Cys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N NIPJKKSXHSBEMX-CIUDSAMLSA-N 0.000 description 1
- WDQXKVCQXRNOSI-GHCJXIJMSA-N Cys-Asp-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WDQXKVCQXRNOSI-GHCJXIJMSA-N 0.000 description 1
- QYKJOVAXAKTKBR-FXQIFTODSA-N Cys-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N QYKJOVAXAKTKBR-FXQIFTODSA-N 0.000 description 1
- WXKWQSDHEXKKNC-ZKWXMUAHSA-N Cys-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N WXKWQSDHEXKKNC-ZKWXMUAHSA-N 0.000 description 1
- LDIKUWLAMDFHPU-FXQIFTODSA-N Cys-Cys-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LDIKUWLAMDFHPU-FXQIFTODSA-N 0.000 description 1
- SMYXEYRYCLIPIL-ZLUOBGJFSA-N Cys-Cys-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O SMYXEYRYCLIPIL-ZLUOBGJFSA-N 0.000 description 1
- ZIKWRNJXFIQECJ-CIUDSAMLSA-N Cys-Cys-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZIKWRNJXFIQECJ-CIUDSAMLSA-N 0.000 description 1
- LWTTURISBKEVAC-CIUDSAMLSA-N Cys-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N LWTTURISBKEVAC-CIUDSAMLSA-N 0.000 description 1
- WYZLWZNAWQNLGQ-FXQIFTODSA-N Cys-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N WYZLWZNAWQNLGQ-FXQIFTODSA-N 0.000 description 1
- QJUDRFBUWAGUSG-SRVKXCTJSA-N Cys-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N QJUDRFBUWAGUSG-SRVKXCTJSA-N 0.000 description 1
- BVFQOPGFOQVZTE-ACZMJKKPSA-N Cys-Gln-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O BVFQOPGFOQVZTE-ACZMJKKPSA-N 0.000 description 1
- MBILEVLLOHJZMG-FXQIFTODSA-N Cys-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MBILEVLLOHJZMG-FXQIFTODSA-N 0.000 description 1
- YZKOXEJTLWZOQL-GUBZILKMSA-N Cys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N YZKOXEJTLWZOQL-GUBZILKMSA-N 0.000 description 1
- SFRQEQGPRTVDPO-NRPADANISA-N Cys-Gln-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O SFRQEQGPRTVDPO-NRPADANISA-N 0.000 description 1
- RWGDABDXVXRLLH-ACZMJKKPSA-N Cys-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N RWGDABDXVXRLLH-ACZMJKKPSA-N 0.000 description 1
- VBPGTULCFGKGTF-ACZMJKKPSA-N Cys-Glu-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VBPGTULCFGKGTF-ACZMJKKPSA-N 0.000 description 1
- KABHAOSDMIYXTR-GUBZILKMSA-N Cys-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N KABHAOSDMIYXTR-GUBZILKMSA-N 0.000 description 1
- UYYZZJXUVIZTMH-AVGNSLFASA-N Cys-Glu-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UYYZZJXUVIZTMH-AVGNSLFASA-N 0.000 description 1
- ZEXHDOQQYZKOIB-ACZMJKKPSA-N Cys-Glu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZEXHDOQQYZKOIB-ACZMJKKPSA-N 0.000 description 1
- HQZGVYJBRSISDT-BQBZGAKWSA-N Cys-Gly-Arg Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQZGVYJBRSISDT-BQBZGAKWSA-N 0.000 description 1
- BSFFNUBDVYTDMV-WHFBIAKZSA-N Cys-Gly-Asn Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BSFFNUBDVYTDMV-WHFBIAKZSA-N 0.000 description 1
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 1
- LBOLGUYQEPZSKM-YUMQZZPRSA-N Cys-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N LBOLGUYQEPZSKM-YUMQZZPRSA-N 0.000 description 1
- DZSICRGTVPDCRN-YUMQZZPRSA-N Cys-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N DZSICRGTVPDCRN-YUMQZZPRSA-N 0.000 description 1
- SKSJPIBFNFPTJB-NKWVEPMBSA-N Cys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CS)N)C(=O)O SKSJPIBFNFPTJB-NKWVEPMBSA-N 0.000 description 1
- ANRWXLYGJRSQEQ-CIUDSAMLSA-N Cys-His-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O ANRWXLYGJRSQEQ-CIUDSAMLSA-N 0.000 description 1
- WAJDEKCJRKGRPG-CIUDSAMLSA-N Cys-His-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N WAJDEKCJRKGRPG-CIUDSAMLSA-N 0.000 description 1
- LKUCSUGWHYVYLP-GHCJXIJMSA-N Cys-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N LKUCSUGWHYVYLP-GHCJXIJMSA-N 0.000 description 1
- ODDOYXKAHLKKQY-MMWGEVLESA-N Cys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N ODDOYXKAHLKKQY-MMWGEVLESA-N 0.000 description 1
- MRVSLWQRNWEROS-SVSWQMSJSA-N Cys-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CS)N MRVSLWQRNWEROS-SVSWQMSJSA-N 0.000 description 1
- MTNJRNQDDSWQQA-GQGQLFGLSA-N Cys-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N MTNJRNQDDSWQQA-GQGQLFGLSA-N 0.000 description 1
- KXUKWRVYDYIPSQ-CIUDSAMLSA-N Cys-Leu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUKWRVYDYIPSQ-CIUDSAMLSA-N 0.000 description 1
- ABLJDBFJPUWQQB-DCAQKATOSA-N Cys-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N ABLJDBFJPUWQQB-DCAQKATOSA-N 0.000 description 1
- DYBIDOHFRRUMLW-CIUDSAMLSA-N Cys-Leu-Cys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O DYBIDOHFRRUMLW-CIUDSAMLSA-N 0.000 description 1
- IZUNQDRIAOLWCN-YUMQZZPRSA-N Cys-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N IZUNQDRIAOLWCN-YUMQZZPRSA-N 0.000 description 1
- VPQZSNQICFCCSO-BJDJZHNGSA-N Cys-Leu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VPQZSNQICFCCSO-BJDJZHNGSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- UCSXXFRXHGUXCQ-SRVKXCTJSA-N Cys-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N UCSXXFRXHGUXCQ-SRVKXCTJSA-N 0.000 description 1
- VTBGVPWSWJBERH-DCAQKATOSA-N Cys-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CS)N VTBGVPWSWJBERH-DCAQKATOSA-N 0.000 description 1
- CIBLYQCAZRYWHY-UHFFFAOYSA-N Cys-Leu-Phe-Cys Chemical compound SCC(N)C(=O)NC(CC(C)C)C(=O)NC(C(=O)NC(CS)C(O)=O)CC1=CC=CC=C1 CIBLYQCAZRYWHY-UHFFFAOYSA-N 0.000 description 1
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 1
- BNCKELUXXUYRNY-GUBZILKMSA-N Cys-Lys-Glu Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BNCKELUXXUYRNY-GUBZILKMSA-N 0.000 description 1
- XMVZMBGFIOQONW-GARJFASQSA-N Cys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)C(=O)O XMVZMBGFIOQONW-GARJFASQSA-N 0.000 description 1
- NIXHTNJAGGFBAW-CIUDSAMLSA-N Cys-Lys-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N NIXHTNJAGGFBAW-CIUDSAMLSA-N 0.000 description 1
- LWYKPOCGGTYAIH-FXQIFTODSA-N Cys-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N LWYKPOCGGTYAIH-FXQIFTODSA-N 0.000 description 1
- MTNUYDIILCWPEP-GUBZILKMSA-N Cys-Met-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CS MTNUYDIILCWPEP-GUBZILKMSA-N 0.000 description 1
- SNHRIJBANHPWMO-XGEHTFHBSA-N Cys-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)N)O SNHRIJBANHPWMO-XGEHTFHBSA-N 0.000 description 1
- UIKLEGZPIOXFHJ-DLOVCJGASA-N Cys-Phe-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O UIKLEGZPIOXFHJ-DLOVCJGASA-N 0.000 description 1
- WTEACWBAULENKE-SRVKXCTJSA-N Cys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N WTEACWBAULENKE-SRVKXCTJSA-N 0.000 description 1
- CHRCKSPMGYDLIA-SRVKXCTJSA-N Cys-Phe-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O CHRCKSPMGYDLIA-SRVKXCTJSA-N 0.000 description 1
- JEKIARHEWURQRJ-BZSNNMDCSA-N Cys-Phe-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CS)N JEKIARHEWURQRJ-BZSNNMDCSA-N 0.000 description 1
- CNAMJJOZGXPDHW-IHRRRGAJSA-N Cys-Pro-Phe Chemical compound N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O CNAMJJOZGXPDHW-IHRRRGAJSA-N 0.000 description 1
- XBELMDARIGXDKY-GUBZILKMSA-N Cys-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CS)N XBELMDARIGXDKY-GUBZILKMSA-N 0.000 description 1
- CMYVIUWVYHOLRD-ZLUOBGJFSA-N Cys-Ser-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CMYVIUWVYHOLRD-ZLUOBGJFSA-N 0.000 description 1
- BCFXQBXXDSEHRS-FXQIFTODSA-N Cys-Ser-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BCFXQBXXDSEHRS-FXQIFTODSA-N 0.000 description 1
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 1
- SRZZZTMJARUVPI-JBDRJPRFSA-N Cys-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N SRZZZTMJARUVPI-JBDRJPRFSA-N 0.000 description 1
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 1
- WZJLBUPPZRZNTO-CIUDSAMLSA-N Cys-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N WZJLBUPPZRZNTO-CIUDSAMLSA-N 0.000 description 1
- HJXSYJVCMUOUNY-SRVKXCTJSA-N Cys-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N HJXSYJVCMUOUNY-SRVKXCTJSA-N 0.000 description 1
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 1
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 1
- YWEHYKGJWHPGPY-XGEHTFHBSA-N Cys-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N)O YWEHYKGJWHPGPY-XGEHTFHBSA-N 0.000 description 1
- JIVJQYNNAYFXDG-LKXGYXEUSA-N Cys-Thr-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JIVJQYNNAYFXDG-LKXGYXEUSA-N 0.000 description 1
- FTTZLFIEUQHLHH-BWBBJGPYSA-N Cys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O FTTZLFIEUQHLHH-BWBBJGPYSA-N 0.000 description 1
- MWVDDZUTWXFYHL-XKBZYTNZSA-N Cys-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)O MWVDDZUTWXFYHL-XKBZYTNZSA-N 0.000 description 1
- IRKLTAKLAFUTLA-KATARQTJSA-N Cys-Thr-Lys Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CCCCN)C(O)=O IRKLTAKLAFUTLA-KATARQTJSA-N 0.000 description 1
- JTEGHEWKBCTIAL-IXOXFDKPSA-N Cys-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N)O JTEGHEWKBCTIAL-IXOXFDKPSA-N 0.000 description 1
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 1
- KFYPRIGJTICABD-XGEHTFHBSA-N Cys-Thr-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N)O KFYPRIGJTICABD-XGEHTFHBSA-N 0.000 description 1
- PNEAWXSKCKCHDK-XIRDDKMYSA-N Cys-Trp-His Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CS)N)C(O)=O)C1=CN=CN1 PNEAWXSKCKCHDK-XIRDDKMYSA-N 0.000 description 1
- MSWBLPLBSLQVME-XIRDDKMYSA-N Cys-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CS)=CNC2=C1 MSWBLPLBSLQVME-XIRDDKMYSA-N 0.000 description 1
- IWVNIQXKTIQXCT-SRVKXCTJSA-N Cys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)O IWVNIQXKTIQXCT-SRVKXCTJSA-N 0.000 description 1
- LHRCZIRWNFRIRG-SRVKXCTJSA-N Cys-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)O LHRCZIRWNFRIRG-SRVKXCTJSA-N 0.000 description 1
- JIZRUFJGHPIYPS-SRVKXCTJSA-N Cys-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O JIZRUFJGHPIYPS-SRVKXCTJSA-N 0.000 description 1
- ZFHXNNXMNLWKJH-HJPIBITLSA-N Cys-Tyr-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZFHXNNXMNLWKJH-HJPIBITLSA-N 0.000 description 1
- VRJZMZGGAKVSIQ-SRVKXCTJSA-N Cys-Tyr-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VRJZMZGGAKVSIQ-SRVKXCTJSA-N 0.000 description 1
- VXDXZGYXHIADHF-YJRXYDGGSA-N Cys-Tyr-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VXDXZGYXHIADHF-YJRXYDGGSA-N 0.000 description 1
- MHYHLWUGWUBUHF-GUBZILKMSA-N Cys-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N MHYHLWUGWUBUHF-GUBZILKMSA-N 0.000 description 1
- JRZMCSIUYGSJKP-ZKWXMUAHSA-N Cys-Val-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JRZMCSIUYGSJKP-ZKWXMUAHSA-N 0.000 description 1
- DGQJGBDBFVGLGL-ZKWXMUAHSA-N Cys-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DGQJGBDBFVGLGL-ZKWXMUAHSA-N 0.000 description 1
- IOLWXFWVYYCVTJ-NRPADANISA-N Cys-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N IOLWXFWVYYCVTJ-NRPADANISA-N 0.000 description 1
- NGOIQDYZMIKCOK-NAKRPEOUSA-N Cys-Val-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NGOIQDYZMIKCOK-NAKRPEOUSA-N 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 206010061818 Disease progression Diseases 0.000 description 1
- 108010067770 Endopeptidase K Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241000620209 Escherichia coli DH5[alpha] Species 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- 108010092526 GKPV peptide Proteins 0.000 description 1
- 241001200922 Gagata Species 0.000 description 1
- 101000834253 Gallus gallus Actin, cytoplasmic 1 Proteins 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- OYTPNWYZORARHL-XHNCKOQMSA-N Gln-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N OYTPNWYZORARHL-XHNCKOQMSA-N 0.000 description 1
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 1
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 1
- DLOHWQXXGMEZDW-CIUDSAMLSA-N Gln-Arg-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DLOHWQXXGMEZDW-CIUDSAMLSA-N 0.000 description 1
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 1
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 1
- OETQLUYCMBARHJ-CIUDSAMLSA-N Gln-Asn-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OETQLUYCMBARHJ-CIUDSAMLSA-N 0.000 description 1
- SOBBAYVQSNXYPQ-ACZMJKKPSA-N Gln-Asn-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SOBBAYVQSNXYPQ-ACZMJKKPSA-N 0.000 description 1
- TWHDOEYLXXQYOZ-FXQIFTODSA-N Gln-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N TWHDOEYLXXQYOZ-FXQIFTODSA-N 0.000 description 1
- PHZYLYASFWHLHJ-FXQIFTODSA-N Gln-Asn-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PHZYLYASFWHLHJ-FXQIFTODSA-N 0.000 description 1
- PONUFVLSGMQFAI-AVGNSLFASA-N Gln-Asn-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PONUFVLSGMQFAI-AVGNSLFASA-N 0.000 description 1
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 1
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- ULXXDWZMMSQBDC-ACZMJKKPSA-N Gln-Asp-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ULXXDWZMMSQBDC-ACZMJKKPSA-N 0.000 description 1
- KZEUVLLVULIPNX-GUBZILKMSA-N Gln-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N KZEUVLLVULIPNX-GUBZILKMSA-N 0.000 description 1
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 1
- PZVJDMJHKUWSIV-AVGNSLFASA-N Gln-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)O PZVJDMJHKUWSIV-AVGNSLFASA-N 0.000 description 1
- ZDJZEGYVKANKED-NRPADANISA-N Gln-Cys-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O ZDJZEGYVKANKED-NRPADANISA-N 0.000 description 1
- RRBLZNIIMHSHQF-FXQIFTODSA-N Gln-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N RRBLZNIIMHSHQF-FXQIFTODSA-N 0.000 description 1
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 1
- LVNILKSSFHCSJZ-IHRRRGAJSA-N Gln-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LVNILKSSFHCSJZ-IHRRRGAJSA-N 0.000 description 1
- QFJPFPCSXOXMKI-BPUTZDHNSA-N Gln-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N QFJPFPCSXOXMKI-BPUTZDHNSA-N 0.000 description 1
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 1
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 1
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 1
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 1
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 1
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 1
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 1
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 1
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 1
- GQZDDFRXSDGUNG-YVNDNENWSA-N Gln-Ile-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O GQZDDFRXSDGUNG-YVNDNENWSA-N 0.000 description 1
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 1
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 1
- JKGHMESJHRTHIC-SIUGBPQLSA-N Gln-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JKGHMESJHRTHIC-SIUGBPQLSA-N 0.000 description 1
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- OAOOXBSVCJEIFY-QAETUUGQSA-N Gln-Leu-Leu-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O OAOOXBSVCJEIFY-QAETUUGQSA-N 0.000 description 1
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 1
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 1
- GURIQZQSTBBHRV-SRVKXCTJSA-N Gln-Lys-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GURIQZQSTBBHRV-SRVKXCTJSA-N 0.000 description 1
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 1
- ATTWDCRXQNKRII-GUBZILKMSA-N Gln-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ATTWDCRXQNKRII-GUBZILKMSA-N 0.000 description 1
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 1
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 1
- MSHXWFKYXJTLEZ-CIUDSAMLSA-N Gln-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MSHXWFKYXJTLEZ-CIUDSAMLSA-N 0.000 description 1
- DSRVQBZAMPGEKU-AVGNSLFASA-N Gln-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DSRVQBZAMPGEKU-AVGNSLFASA-N 0.000 description 1
- QBEWLBKBGXVVPD-RYUDHWBXSA-N Gln-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N QBEWLBKBGXVVPD-RYUDHWBXSA-N 0.000 description 1
- WHVLABLIJYGVEK-QEWYBTABSA-N Gln-Phe-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WHVLABLIJYGVEK-QEWYBTABSA-N 0.000 description 1
- DBNLXHGDGBUCDV-KKUMJFAQSA-N Gln-Phe-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DBNLXHGDGBUCDV-KKUMJFAQSA-N 0.000 description 1
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 1
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 1
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 1
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 1
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 1
- UWMDGPFFTKDUIY-HJGDQZAQSA-N Gln-Pro-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWMDGPFFTKDUIY-HJGDQZAQSA-N 0.000 description 1
- FGWRYRAVBVOHIB-XIRDDKMYSA-N Gln-Pro-Trp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O FGWRYRAVBVOHIB-XIRDDKMYSA-N 0.000 description 1
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 1
- KPNWAJMEMRCLAL-GUBZILKMSA-N Gln-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KPNWAJMEMRCLAL-GUBZILKMSA-N 0.000 description 1
- OKQLXOYFUPVEHI-CIUDSAMLSA-N Gln-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N OKQLXOYFUPVEHI-CIUDSAMLSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- GHAXJVNBAKGWEJ-AVGNSLFASA-N Gln-Ser-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GHAXJVNBAKGWEJ-AVGNSLFASA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- DYVMTEWCGAVKSE-HJGDQZAQSA-N Gln-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O DYVMTEWCGAVKSE-HJGDQZAQSA-N 0.000 description 1
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 1
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 1
- RNPGPFAVRLERPP-QEJZJMRPSA-N Gln-Trp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O RNPGPFAVRLERPP-QEJZJMRPSA-N 0.000 description 1
- OEIDWQHTRYEYGG-QEJZJMRPSA-N Gln-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N OEIDWQHTRYEYGG-QEJZJMRPSA-N 0.000 description 1
- OACQOWPRWGNKTP-AVGNSLFASA-N Gln-Tyr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O OACQOWPRWGNKTP-AVGNSLFASA-N 0.000 description 1
- YLABFXCRQQMMHS-AVGNSLFASA-N Gln-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YLABFXCRQQMMHS-AVGNSLFASA-N 0.000 description 1
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 1
- HPBKQFJXDUVNQV-FHWLQOOXSA-N Gln-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O HPBKQFJXDUVNQV-FHWLQOOXSA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- KHHDJQRWIFHXHS-NRPADANISA-N Gln-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHHDJQRWIFHXHS-NRPADANISA-N 0.000 description 1
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- GJLXZITZLUUXMJ-NHCYSSNCSA-N Gln-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GJLXZITZLUUXMJ-NHCYSSNCSA-N 0.000 description 1
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 1
- CSMHMEATMDCQNY-DZKIICNBSA-N Gln-Val-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CSMHMEATMDCQNY-DZKIICNBSA-N 0.000 description 1
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- RTOOAKXIJADOLL-GUBZILKMSA-N Glu-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N RTOOAKXIJADOLL-GUBZILKMSA-N 0.000 description 1
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 1
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- KIMXNQXJJWWVIN-AVGNSLFASA-N Glu-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O KIMXNQXJJWWVIN-AVGNSLFASA-N 0.000 description 1
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 1
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 1
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 1
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 1
- APHGWLWMOXGZRL-DCAQKATOSA-N Glu-Glu-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O APHGWLWMOXGZRL-DCAQKATOSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- VGOFRWOTSXVPAU-SDDRHHMPSA-N Glu-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VGOFRWOTSXVPAU-SDDRHHMPSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- DWBBKNPKDHXIAC-SRVKXCTJSA-N Glu-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCC(O)=O DWBBKNPKDHXIAC-SRVKXCTJSA-N 0.000 description 1
- PDLGMYVCPJOYAR-DKIMLUQUSA-N Glu-Leu-Phe-Ala Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 PDLGMYVCPJOYAR-DKIMLUQUSA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 1
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 1
- SOEPMWQCTJITPZ-SRVKXCTJSA-N Glu-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N SOEPMWQCTJITPZ-SRVKXCTJSA-N 0.000 description 1
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 1
- UERORLSAFUHDGU-AVGNSLFASA-N Glu-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UERORLSAFUHDGU-AVGNSLFASA-N 0.000 description 1
- WVWZIPOJECFDAG-AVGNSLFASA-N Glu-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N WVWZIPOJECFDAG-AVGNSLFASA-N 0.000 description 1
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 1
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 1
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 1
- HLYCMRDRWGSTPZ-CIUDSAMLSA-N Glu-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CS)C(=O)O HLYCMRDRWGSTPZ-CIUDSAMLSA-N 0.000 description 1
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 1
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 1
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 1
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- WXONSNSSBYQGNN-AVGNSLFASA-N Glu-Ser-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WXONSNSSBYQGNN-AVGNSLFASA-N 0.000 description 1
- DDXZHOHEABQXSE-NKIYYHGXSA-N Glu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O DDXZHOHEABQXSE-NKIYYHGXSA-N 0.000 description 1
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- RZMXBFUSQNLEQF-QEJZJMRPSA-N Glu-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N RZMXBFUSQNLEQF-QEJZJMRPSA-N 0.000 description 1
- JLCYOCDGIUZMKQ-JBACZVJFSA-N Glu-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)O)N JLCYOCDGIUZMKQ-JBACZVJFSA-N 0.000 description 1
- NTHIHAUEXVTXQG-KKUMJFAQSA-N Glu-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O NTHIHAUEXVTXQG-KKUMJFAQSA-N 0.000 description 1
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 1
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 1
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 1
- ZALGPUWUVHOGAE-GVXVVHGQSA-N Glu-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZALGPUWUVHOGAE-GVXVVHGQSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 1
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- DJTXYXZNNDDEOU-WHFBIAKZSA-N Gly-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)C(=O)N DJTXYXZNNDDEOU-WHFBIAKZSA-N 0.000 description 1
- WJZLEENECIOOSA-WDSKDSINSA-N Gly-Asn-Gln Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)O WJZLEENECIOOSA-WDSKDSINSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- GVVKYKCOFMMTKZ-WHFBIAKZSA-N Gly-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)CN GVVKYKCOFMMTKZ-WHFBIAKZSA-N 0.000 description 1
- YDWZGVCXMVLDQH-WHFBIAKZSA-N Gly-Cys-Asn Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O YDWZGVCXMVLDQH-WHFBIAKZSA-N 0.000 description 1
- LEGMTEAZGRRIMY-ZKWXMUAHSA-N Gly-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN LEGMTEAZGRRIMY-ZKWXMUAHSA-N 0.000 description 1
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 1
- SABZDFAAOJATBR-QWRGUYRKSA-N Gly-Cys-Phe Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SABZDFAAOJATBR-QWRGUYRKSA-N 0.000 description 1
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 1
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 1
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 1
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 1
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 1
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 1
- HFXJIZNEXNIZIJ-BQBZGAKWSA-N Gly-Glu-Gln Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFXJIZNEXNIZIJ-BQBZGAKWSA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- ORXZVPZCPMKHNR-IUCAKERBSA-N Gly-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 ORXZVPZCPMKHNR-IUCAKERBSA-N 0.000 description 1
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 1
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 1
- LUJVWKKYHSLULQ-ZKWXMUAHSA-N Gly-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN LUJVWKKYHSLULQ-ZKWXMUAHSA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- SJLKKOZFHSJJAW-YUMQZZPRSA-N Gly-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN SJLKKOZFHSJJAW-YUMQZZPRSA-N 0.000 description 1
- OMOZPGCHVWOXHN-BQBZGAKWSA-N Gly-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)CN OMOZPGCHVWOXHN-BQBZGAKWSA-N 0.000 description 1
- TTYVAUJGNMVTRN-GJZGRUSLSA-N Gly-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)CN TTYVAUJGNMVTRN-GJZGRUSLSA-N 0.000 description 1
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 1
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- NZOAFWHVAFJERA-OALUTQOASA-N Gly-Phe-Trp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NZOAFWHVAFJERA-OALUTQOASA-N 0.000 description 1
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 1
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 1
- ZZJVYSAQQMDIRD-UWVGGRQHSA-N Gly-Pro-His Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ZZJVYSAQQMDIRD-UWVGGRQHSA-N 0.000 description 1
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 1
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 1
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- WTUSRDZLLWGYAT-KCTSRDHCSA-N Gly-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN WTUSRDZLLWGYAT-KCTSRDHCSA-N 0.000 description 1
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 1
- GWNIGUKSRJBIHX-STQMWFEESA-N Gly-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN)O GWNIGUKSRJBIHX-STQMWFEESA-N 0.000 description 1
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 1
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 1
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 1
- JYGYNWYVKXENNE-OALUTQOASA-N Gly-Tyr-Trp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JYGYNWYVKXENNE-OALUTQOASA-N 0.000 description 1
- NGBGZCUWFVVJKC-IRXDYDNUSA-N Gly-Tyr-Tyr Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NGBGZCUWFVVJKC-IRXDYDNUSA-N 0.000 description 1
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- 235000015842 Hesperis Nutrition 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 1
- HXKZJLWGSWQKEA-LSJOCFKGSA-N His-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 HXKZJLWGSWQKEA-LSJOCFKGSA-N 0.000 description 1
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 1
- MAABHGXCIBEYQR-XVYDVKMFSA-N His-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MAABHGXCIBEYQR-XVYDVKMFSA-N 0.000 description 1
- KYMUEAZVLPRVAE-GUBZILKMSA-N His-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KYMUEAZVLPRVAE-GUBZILKMSA-N 0.000 description 1
- JFFAPRNXXLRINI-NHCYSSNCSA-N His-Asp-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JFFAPRNXXLRINI-NHCYSSNCSA-N 0.000 description 1
- SWSVTNGMKBDTBM-DCAQKATOSA-N His-Gln-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SWSVTNGMKBDTBM-DCAQKATOSA-N 0.000 description 1
- IIVZNQCUUMBBKF-GVXVVHGQSA-N His-Gln-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 IIVZNQCUUMBBKF-GVXVVHGQSA-N 0.000 description 1
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 1
- STWGDDDFLUFCCA-GVXVVHGQSA-N His-Glu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O STWGDDDFLUFCCA-GVXVVHGQSA-N 0.000 description 1
- NQKRILCJYCASDV-QWRGUYRKSA-N His-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 NQKRILCJYCASDV-QWRGUYRKSA-N 0.000 description 1
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 1
- VJJSDSNFXCWCEJ-DJFWLOJKSA-N His-Ile-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O VJJSDSNFXCWCEJ-DJFWLOJKSA-N 0.000 description 1
- VTZYMXGGXOFBMX-DJFWLOJKSA-N His-Ile-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O VTZYMXGGXOFBMX-DJFWLOJKSA-N 0.000 description 1
- NDKSHNQINMRKHT-PEXQALLHSA-N His-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N NDKSHNQINMRKHT-PEXQALLHSA-N 0.000 description 1
- LJUIEESLIAZSFR-SRVKXCTJSA-N His-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LJUIEESLIAZSFR-SRVKXCTJSA-N 0.000 description 1
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 1
- LYDKQVYYCMYNMC-SRVKXCTJSA-N His-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LYDKQVYYCMYNMC-SRVKXCTJSA-N 0.000 description 1
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 1
- DEOQGJUXUQGUJN-KKUMJFAQSA-N His-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DEOQGJUXUQGUJN-KKUMJFAQSA-N 0.000 description 1
- LDFWDDVELNOGII-MXAVVETBSA-N His-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N LDFWDDVELNOGII-MXAVVETBSA-N 0.000 description 1
- CKRJBQJIGOEKMC-SRVKXCTJSA-N His-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CKRJBQJIGOEKMC-SRVKXCTJSA-N 0.000 description 1
- AYUOWUNWZGTNKB-ULQDDVLXSA-N His-Phe-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AYUOWUNWZGTNKB-ULQDDVLXSA-N 0.000 description 1
- BSVLMPMIXPQNKC-KBPBESRZSA-N His-Phe-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O BSVLMPMIXPQNKC-KBPBESRZSA-N 0.000 description 1
- ZFDKSLBEWYCOCS-BZSNNMDCSA-N His-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CC=CC=C1 ZFDKSLBEWYCOCS-BZSNNMDCSA-N 0.000 description 1
- ULRFSEJGSHYLQI-YESZJQIVSA-N His-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ULRFSEJGSHYLQI-YESZJQIVSA-N 0.000 description 1
- HYWZHNUGAYVEEW-KKUMJFAQSA-N His-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HYWZHNUGAYVEEW-KKUMJFAQSA-N 0.000 description 1
- VDHOMPFVSABJKU-ULQDDVLXSA-N His-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N VDHOMPFVSABJKU-ULQDDVLXSA-N 0.000 description 1
- ZVKDCQVQTGYBQT-LSJOCFKGSA-N His-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O ZVKDCQVQTGYBQT-LSJOCFKGSA-N 0.000 description 1
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 1
- CHIAUHSHDARFBD-ULQDDVLXSA-N His-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 CHIAUHSHDARFBD-ULQDDVLXSA-N 0.000 description 1
- ILUVWFTXAUYOBW-CUJWVEQBSA-N His-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N)O ILUVWFTXAUYOBW-CUJWVEQBSA-N 0.000 description 1
- FFKJUTZARGRVTH-KKUMJFAQSA-N His-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FFKJUTZARGRVTH-KKUMJFAQSA-N 0.000 description 1
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 1
- XVZJRZQIHJMUBG-TUBUOCAGSA-N His-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CN=CN1)N XVZJRZQIHJMUBG-TUBUOCAGSA-N 0.000 description 1
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 1
- AHEBIAHEZWQVHB-QTKMDUPCSA-N His-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O AHEBIAHEZWQVHB-QTKMDUPCSA-N 0.000 description 1
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 1
- KECFCPNPPYCGBL-PMVMPFDFSA-N His-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CN=CN4)N KECFCPNPPYCGBL-PMVMPFDFSA-N 0.000 description 1
- PBJOQLUVSGXRSW-YTQUADARSA-N His-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CN=CN4)N)C(=O)O PBJOQLUVSGXRSW-YTQUADARSA-N 0.000 description 1
- KFQDSSNYWKZFOO-LSJOCFKGSA-N His-Val-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KFQDSSNYWKZFOO-LSJOCFKGSA-N 0.000 description 1
- KDDKJKKQODQQBR-NHCYSSNCSA-N His-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N KDDKJKKQODQQBR-NHCYSSNCSA-N 0.000 description 1
- WSXNWASHQNSMRX-GVXVVHGQSA-N His-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WSXNWASHQNSMRX-GVXVVHGQSA-N 0.000 description 1
- PUFNQIPSRXVLQJ-IHRRRGAJSA-N His-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N PUFNQIPSRXVLQJ-IHRRRGAJSA-N 0.000 description 1
- MCGOGXFMKHPMSQ-AVGNSLFASA-N His-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MCGOGXFMKHPMSQ-AVGNSLFASA-N 0.000 description 1
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101000929928 Homo sapiens Angiotensin-converting enzyme 2 Proteins 0.000 description 1
- 101000878605 Homo sapiens Low affinity immunoglobulin epsilon Fc receptor Proteins 0.000 description 1
- 206010020751 Hypersensitivity Diseases 0.000 description 1
- 235000012633 Iberis amara Nutrition 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 1
- ASCFJMSGKUIRDU-ZPFDUUQYSA-N Ile-Arg-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O ASCFJMSGKUIRDU-ZPFDUUQYSA-N 0.000 description 1
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 1
- VZIFYHYNQDIPLI-HJWJTTGWSA-N Ile-Arg-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N VZIFYHYNQDIPLI-HJWJTTGWSA-N 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- IIXDMJNYALIKGP-DJFWLOJKSA-N Ile-Asn-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IIXDMJNYALIKGP-DJFWLOJKSA-N 0.000 description 1
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 1
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- CCHSQWLCOOZREA-GMOBBJLQSA-N Ile-Asp-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N CCHSQWLCOOZREA-GMOBBJLQSA-N 0.000 description 1
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 1
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 1
- FADXGVVLSPPEQY-GHCJXIJMSA-N Ile-Cys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FADXGVVLSPPEQY-GHCJXIJMSA-N 0.000 description 1
- CTHAJJYOHOBUDY-GHCJXIJMSA-N Ile-Cys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N CTHAJJYOHOBUDY-GHCJXIJMSA-N 0.000 description 1
- PFTFEWHJSAXGED-ZKWXMUAHSA-N Ile-Cys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N PFTFEWHJSAXGED-ZKWXMUAHSA-N 0.000 description 1
- LLHYWBGDMBGNHA-VGDYDELISA-N Ile-Cys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LLHYWBGDMBGNHA-VGDYDELISA-N 0.000 description 1
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 1
- SYVMEYAPXRRXAN-MXAVVETBSA-N Ile-Cys-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N SYVMEYAPXRRXAN-MXAVVETBSA-N 0.000 description 1
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 1
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 1
- HTDRTKMNJRRYOJ-SIUGBPQLSA-N Ile-Gln-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HTDRTKMNJRRYOJ-SIUGBPQLSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- KIAOPHMUNPPGEN-PEXQALLHSA-N Ile-Gly-His Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KIAOPHMUNPPGEN-PEXQALLHSA-N 0.000 description 1
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 1
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 1
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 1
- JLWLMGADIQFKRD-QSFUFRPTSA-N Ile-His-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CN=CN1 JLWLMGADIQFKRD-QSFUFRPTSA-N 0.000 description 1
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 1
- CCYGNFBYUNHFSC-MGHWNKPDSA-N Ile-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CCYGNFBYUNHFSC-MGHWNKPDSA-N 0.000 description 1
- APDIECQNNDGFPD-PYJNHQTQSA-N Ile-His-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N APDIECQNNDGFPD-PYJNHQTQSA-N 0.000 description 1
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 1
- HUWYGQOISIJNMK-SIGLWIIPSA-N Ile-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HUWYGQOISIJNMK-SIGLWIIPSA-N 0.000 description 1
- MTONDYJJCIBZTK-PEDHHIEDSA-N Ile-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(=O)O)N MTONDYJJCIBZTK-PEDHHIEDSA-N 0.000 description 1
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 1
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 1
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 1
- CEPIAEUVRKGPGP-DSYPUSFNSA-N Ile-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 CEPIAEUVRKGPGP-DSYPUSFNSA-N 0.000 description 1
- FFJQAEYLAQMGDL-MGHWNKPDSA-N Ile-Lys-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FFJQAEYLAQMGDL-MGHWNKPDSA-N 0.000 description 1
- WSSGUVAKYCQSCT-XUXIUFHCSA-N Ile-Met-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)O)N WSSGUVAKYCQSCT-XUXIUFHCSA-N 0.000 description 1
- ZUPJCJINYQISSN-XUXIUFHCSA-N Ile-Met-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUPJCJINYQISSN-XUXIUFHCSA-N 0.000 description 1
- FTUZWJVSNZMLPI-RVMXOQNASA-N Ile-Met-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N FTUZWJVSNZMLPI-RVMXOQNASA-N 0.000 description 1
- UOPBQSJRBONRON-STECZYCISA-N Ile-Met-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOPBQSJRBONRON-STECZYCISA-N 0.000 description 1
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 1
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- GMUYXHHJAGQHGB-TUBUOCAGSA-N Ile-Thr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMUYXHHJAGQHGB-TUBUOCAGSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 1
- BZUOLKFQVVBTJY-SLBDDTMCSA-N Ile-Trp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BZUOLKFQVVBTJY-SLBDDTMCSA-N 0.000 description 1
- RTSQPLLOYSGMKM-DSYPUSFNSA-N Ile-Trp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N RTSQPLLOYSGMKM-DSYPUSFNSA-N 0.000 description 1
- YBHKCXNNNVDYEB-SPOWBLRKSA-N Ile-Trp-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N YBHKCXNNNVDYEB-SPOWBLRKSA-N 0.000 description 1
- MGUTVMBNOMJLKC-VKOGCVSHSA-N Ile-Trp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C(C)C)C(=O)O)N MGUTVMBNOMJLKC-VKOGCVSHSA-N 0.000 description 1
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 1
- PMAOIIWHZHAPBT-HJPIBITLSA-N Ile-Tyr-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N PMAOIIWHZHAPBT-HJPIBITLSA-N 0.000 description 1
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- IPFKIGNDTUOFAF-CYDGBPFRSA-N Ile-Val-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IPFKIGNDTUOFAF-CYDGBPFRSA-N 0.000 description 1
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 1
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 1
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 1
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 1
- SWNRZNLXMXRCJC-VKOGCVSHSA-N Ile-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 SWNRZNLXMXRCJC-VKOGCVSHSA-N 0.000 description 1
- 102000002227 Interferon Type I Human genes 0.000 description 1
- 108010014726 Interferon Type I Proteins 0.000 description 1
- 108010002352 Interleukin-1 Proteins 0.000 description 1
- 108010065805 Interleukin-12 Proteins 0.000 description 1
- 108010002350 Interleukin-2 Proteins 0.000 description 1
- OTAMFXXAGYBAQL-YXMSTPNBSA-N Kentsin Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O OTAMFXXAGYBAQL-YXMSTPNBSA-N 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 1
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 1
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 1
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- CUXRXAIAVYLVFD-ULQDDVLXSA-N Leu-Arg-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUXRXAIAVYLVFD-ULQDDVLXSA-N 0.000 description 1
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 1
- BAJIJEGGUYXZGC-CIUDSAMLSA-N Leu-Asn-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N BAJIJEGGUYXZGC-CIUDSAMLSA-N 0.000 description 1
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- IIKJNQWOQIWWMR-CIUDSAMLSA-N Leu-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N IIKJNQWOQIWWMR-CIUDSAMLSA-N 0.000 description 1
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 1
- DKEZVKFLETVJFY-CIUDSAMLSA-N Leu-Cys-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DKEZVKFLETVJFY-CIUDSAMLSA-N 0.000 description 1
- GZAUZBUKDXYPEH-CIUDSAMLSA-N Leu-Cys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N GZAUZBUKDXYPEH-CIUDSAMLSA-N 0.000 description 1
- QKIBIXAQKAFZGL-GUBZILKMSA-N Leu-Cys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QKIBIXAQKAFZGL-GUBZILKMSA-N 0.000 description 1
- KWURTLAFFDOTEQ-GUBZILKMSA-N Leu-Cys-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KWURTLAFFDOTEQ-GUBZILKMSA-N 0.000 description 1
- NHHKSOGJYNQENP-SRVKXCTJSA-N Leu-Cys-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N NHHKSOGJYNQENP-SRVKXCTJSA-N 0.000 description 1
- VFQOCUQGMUXTJR-DCAQKATOSA-N Leu-Cys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(=O)O)N VFQOCUQGMUXTJR-DCAQKATOSA-N 0.000 description 1
- YORLGJINWYYIMX-KKUMJFAQSA-N Leu-Cys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YORLGJINWYYIMX-KKUMJFAQSA-N 0.000 description 1
- HQPHMEPBNUHPKD-XIRDDKMYSA-N Leu-Cys-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N HQPHMEPBNUHPKD-XIRDDKMYSA-N 0.000 description 1
- PIHFVNPEAHFNLN-KKUMJFAQSA-N Leu-Cys-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N PIHFVNPEAHFNLN-KKUMJFAQSA-N 0.000 description 1
- WCTCIIAGNMFYAO-DCAQKATOSA-N Leu-Cys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O WCTCIIAGNMFYAO-DCAQKATOSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- DXYBNWJZJVSZAE-GUBZILKMSA-N Leu-Gln-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N DXYBNWJZJVSZAE-GUBZILKMSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- WMTOVWLLDGQGCV-GUBZILKMSA-N Leu-Glu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WMTOVWLLDGQGCV-GUBZILKMSA-N 0.000 description 1
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 1
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- XQXGNBFMAXWIGI-MXAVVETBSA-N Leu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 XQXGNBFMAXWIGI-MXAVVETBSA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- XBCWOTOCBXXJDG-BZSNNMDCSA-N Leu-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XBCWOTOCBXXJDG-BZSNNMDCSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- HMDDEJADNKQTBR-BZSNNMDCSA-N Leu-His-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMDDEJADNKQTBR-BZSNNMDCSA-N 0.000 description 1
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 1
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 1
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 1
- GSSMYQHXZNERFX-WDSOQIARSA-N Leu-Met-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N GSSMYQHXZNERFX-WDSOQIARSA-N 0.000 description 1
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- MAXILRZVORNXBE-PMVMPFDFSA-N Leu-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MAXILRZVORNXBE-PMVMPFDFSA-N 0.000 description 1
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- XGDCYUQSFDQISZ-BQBZGAKWSA-N Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O XGDCYUQSFDQISZ-BQBZGAKWSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- HWMQRQIFVGEAPH-XIRDDKMYSA-N Leu-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 HWMQRQIFVGEAPH-XIRDDKMYSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- CNWDWAMPKVYJJB-NUTKFTJISA-N Leu-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CNWDWAMPKVYJJB-NUTKFTJISA-N 0.000 description 1
- LFXSPAIBSZSTEM-PMVMPFDFSA-N Leu-Trp-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N LFXSPAIBSZSTEM-PMVMPFDFSA-N 0.000 description 1
- ONHCDMBHPQIPAI-YTQUADARSA-N Leu-Trp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N ONHCDMBHPQIPAI-YTQUADARSA-N 0.000 description 1
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- SXOFUVGLPHCPRQ-KKUMJFAQSA-N Leu-Tyr-Cys Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(O)=O SXOFUVGLPHCPRQ-KKUMJFAQSA-N 0.000 description 1
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 1
- OZTZJMUZVAVJGY-BZSNNMDCSA-N Leu-Tyr-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OZTZJMUZVAVJGY-BZSNNMDCSA-N 0.000 description 1
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 1
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 1
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 1
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 1
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 1
- 102100038007 Low affinity immunoglobulin epsilon Fc receptor Human genes 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- BTSXLXFPMZXVPR-DLOVCJGASA-N Lys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BTSXLXFPMZXVPR-DLOVCJGASA-N 0.000 description 1
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 1
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 1
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 1
- GGAPIOORBXHMNY-ULQDDVLXSA-N Lys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)O GGAPIOORBXHMNY-ULQDDVLXSA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 1
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 1
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 1
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- SQXUUGUCGJSWCK-CIUDSAMLSA-N Lys-Asp-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N SQXUUGUCGJSWCK-CIUDSAMLSA-N 0.000 description 1
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 1
- SVJRVFPSHPGWFF-DCAQKATOSA-N Lys-Cys-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVJRVFPSHPGWFF-DCAQKATOSA-N 0.000 description 1
- MLLKLNYPZRDIQG-GUBZILKMSA-N Lys-Cys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N MLLKLNYPZRDIQG-GUBZILKMSA-N 0.000 description 1
- SSYOBDBNBQBSQE-SRVKXCTJSA-N Lys-Cys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O SSYOBDBNBQBSQE-SRVKXCTJSA-N 0.000 description 1
- BYEBKXRNDLTGFW-CIUDSAMLSA-N Lys-Cys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O BYEBKXRNDLTGFW-CIUDSAMLSA-N 0.000 description 1
- DZQYZKPINJLLEN-KKUMJFAQSA-N Lys-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O DZQYZKPINJLLEN-KKUMJFAQSA-N 0.000 description 1
- WTZUSCUIVPVCRH-SRVKXCTJSA-N Lys-Gln-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WTZUSCUIVPVCRH-SRVKXCTJSA-N 0.000 description 1
- GUYHHBZCBQZLFW-GUBZILKMSA-N Lys-Gln-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N GUYHHBZCBQZLFW-GUBZILKMSA-N 0.000 description 1
- MRWXLRGAFDOILG-DCAQKATOSA-N Lys-Gln-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRWXLRGAFDOILG-DCAQKATOSA-N 0.000 description 1
- VSRXPEHZMHSFKU-IUCAKERBSA-N Lys-Gln-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VSRXPEHZMHSFKU-IUCAKERBSA-N 0.000 description 1
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 1
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 1
- IRRZDAIFYHNIIN-JYJNAYRXSA-N Lys-Gln-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IRRZDAIFYHNIIN-JYJNAYRXSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 1
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- KZJQUYFDSCFSCO-DLOVCJGASA-N Lys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N KZJQUYFDSCFSCO-DLOVCJGASA-N 0.000 description 1
- DAOSYIZXRCOKII-SRVKXCTJSA-N Lys-His-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O DAOSYIZXRCOKII-SRVKXCTJSA-N 0.000 description 1
- GTAXSKOXPIISBW-AVGNSLFASA-N Lys-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N GTAXSKOXPIISBW-AVGNSLFASA-N 0.000 description 1
- WOEDRPCHKPSFDT-MXAVVETBSA-N Lys-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N WOEDRPCHKPSFDT-MXAVVETBSA-N 0.000 description 1
- ZMMDPRTXLAEMOD-BZSNNMDCSA-N Lys-His-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZMMDPRTXLAEMOD-BZSNNMDCSA-N 0.000 description 1
- CTBMEDOQJFGNMI-IHPCNDPISA-N Lys-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CCCCN)N CTBMEDOQJFGNMI-IHPCNDPISA-N 0.000 description 1
- OIYWBDBHEGAVST-BZSNNMDCSA-N Lys-His-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OIYWBDBHEGAVST-BZSNNMDCSA-N 0.000 description 1
- XDPLZVNMYQOFQZ-BJDJZHNGSA-N Lys-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N XDPLZVNMYQOFQZ-BJDJZHNGSA-N 0.000 description 1
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 1
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 1
- XREQQOATSMMAJP-MGHWNKPDSA-N Lys-Ile-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XREQQOATSMMAJP-MGHWNKPDSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- AHFOKDZWPPGJAZ-SRVKXCTJSA-N Lys-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N AHFOKDZWPPGJAZ-SRVKXCTJSA-N 0.000 description 1
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 1
- KFSALEZVQJYHCE-AVGNSLFASA-N Lys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N KFSALEZVQJYHCE-AVGNSLFASA-N 0.000 description 1
- JPYPRVHMKRFTAT-KKUMJFAQSA-N Lys-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N JPYPRVHMKRFTAT-KKUMJFAQSA-N 0.000 description 1
- PIXVFCBYEGPZPA-JYJNAYRXSA-N Lys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N PIXVFCBYEGPZPA-JYJNAYRXSA-N 0.000 description 1
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 1
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 1
- UDXSLGLHFUBRRM-OEAJRASXSA-N Lys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCCN)N)O UDXSLGLHFUBRRM-OEAJRASXSA-N 0.000 description 1
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 1
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 1
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 1
- SVSQSPICRKBMSZ-SRVKXCTJSA-N Lys-Pro-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O SVSQSPICRKBMSZ-SRVKXCTJSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- LKDXINHHSWFFJC-SRVKXCTJSA-N Lys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N LKDXINHHSWFFJC-SRVKXCTJSA-N 0.000 description 1
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 1
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- UIJVKVHLCQSPOJ-XIRDDKMYSA-N Lys-Ser-Trp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O UIJVKVHLCQSPOJ-XIRDDKMYSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- TVOOGUNBIWAURO-KATARQTJSA-N Lys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N)O TVOOGUNBIWAURO-KATARQTJSA-N 0.000 description 1
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- WAAZECNCPVGPIV-RHYQMDGZSA-N Lys-Thr-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O WAAZECNCPVGPIV-RHYQMDGZSA-N 0.000 description 1
- ZVXSESPJMKNIQA-YXMSTPNBSA-N Lys-Thr-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ZVXSESPJMKNIQA-YXMSTPNBSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 1
- KTINOHQFVVCEGQ-XIRDDKMYSA-N Lys-Trp-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(O)=O)C(O)=O KTINOHQFVVCEGQ-XIRDDKMYSA-N 0.000 description 1
- CFOLERIRBUAYAD-HOCLYGCPSA-N Lys-Trp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O CFOLERIRBUAYAD-HOCLYGCPSA-N 0.000 description 1
- KXYLFJIQDIMURW-IHPCNDPISA-N Lys-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCCN)=CNC2=C1 KXYLFJIQDIMURW-IHPCNDPISA-N 0.000 description 1
- AWMMBHDKERMOID-YTQUADARSA-N Lys-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCCCN)N)C(=O)O AWMMBHDKERMOID-YTQUADARSA-N 0.000 description 1
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 1
- XATKLFSXFINPSB-JYJNAYRXSA-N Lys-Tyr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O XATKLFSXFINPSB-JYJNAYRXSA-N 0.000 description 1
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 1
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 1
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 1
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 1
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 1
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 1
- 101710085938 Matrix protein Proteins 0.000 description 1
- 101710127721 Membrane protein Proteins 0.000 description 1
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 1
- QRHWTCJBCLGYRB-FXQIFTODSA-N Met-Ala-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O QRHWTCJBCLGYRB-FXQIFTODSA-N 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- MUYQDMBLDFEVRJ-LSJOCFKGSA-N Met-Ala-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 MUYQDMBLDFEVRJ-LSJOCFKGSA-N 0.000 description 1
- QGQGAIBGTUJRBR-NAKRPEOUSA-N Met-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC QGQGAIBGTUJRBR-NAKRPEOUSA-N 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 1
- BVXXDMUMHMXFER-BPNCWPANSA-N Met-Ala-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVXXDMUMHMXFER-BPNCWPANSA-N 0.000 description 1
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 1
- AHZNUGRZHMZGFL-GUBZILKMSA-N Met-Arg-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCNC(N)=N AHZNUGRZHMZGFL-GUBZILKMSA-N 0.000 description 1
- DBOMZJOESVYERT-GUBZILKMSA-N Met-Asn-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N DBOMZJOESVYERT-GUBZILKMSA-N 0.000 description 1
- DZTDEZSHBVRUCQ-FXQIFTODSA-N Met-Asp-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N DZTDEZSHBVRUCQ-FXQIFTODSA-N 0.000 description 1
- FJVJLMZUIGMFFU-BQBZGAKWSA-N Met-Asp-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FJVJLMZUIGMFFU-BQBZGAKWSA-N 0.000 description 1
- MCNGIXXCMJAURZ-VEVYYDQMSA-N Met-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCSC)N)O MCNGIXXCMJAURZ-VEVYYDQMSA-N 0.000 description 1
- TWTNGJMBFRTKEX-FXQIFTODSA-N Met-Cys-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O TWTNGJMBFRTKEX-FXQIFTODSA-N 0.000 description 1
- IZLCDZDNZFEDHB-DCAQKATOSA-N Met-Cys-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N IZLCDZDNZFEDHB-DCAQKATOSA-N 0.000 description 1
- UJDMTKHGWSBHBX-IHRRRGAJSA-N Met-Cys-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UJDMTKHGWSBHBX-IHRRRGAJSA-N 0.000 description 1
- JYCQGAGDJQYEDB-GUBZILKMSA-N Met-Gln-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O JYCQGAGDJQYEDB-GUBZILKMSA-N 0.000 description 1
- UOENBSHXYCHSAU-YUMQZZPRSA-N Met-Gln-Gly Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UOENBSHXYCHSAU-YUMQZZPRSA-N 0.000 description 1
- AWOMRHGUWFBDNU-ZPFDUUQYSA-N Met-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N AWOMRHGUWFBDNU-ZPFDUUQYSA-N 0.000 description 1
- KLFPZIUIXZNEKY-DCAQKATOSA-N Met-Gln-Met Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O KLFPZIUIXZNEKY-DCAQKATOSA-N 0.000 description 1
- RZJOHSFAEZBWLK-CIUDSAMLSA-N Met-Gln-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N RZJOHSFAEZBWLK-CIUDSAMLSA-N 0.000 description 1
- YORIKIDJCPKBON-YUMQZZPRSA-N Met-Glu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YORIKIDJCPKBON-YUMQZZPRSA-N 0.000 description 1
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 description 1
- OGAZPKJHHZPYFK-GARJFASQSA-N Met-Glu-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGAZPKJHHZPYFK-GARJFASQSA-N 0.000 description 1
- BCRQJDMZQUHQSV-STQMWFEESA-N Met-Gly-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BCRQJDMZQUHQSV-STQMWFEESA-N 0.000 description 1
- OBCRZLRPJFNLAN-DCAQKATOSA-N Met-His-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OBCRZLRPJFNLAN-DCAQKATOSA-N 0.000 description 1
- CFRRIZLGFGJEDB-SRVKXCTJSA-N Met-His-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CFRRIZLGFGJEDB-SRVKXCTJSA-N 0.000 description 1
- TZHFJXDKXGZHEN-IHRRRGAJSA-N Met-His-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O TZHFJXDKXGZHEN-IHRRRGAJSA-N 0.000 description 1
- ABHVWYPPHDYFNY-WDSOQIARSA-N Met-His-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CN=CN1 ABHVWYPPHDYFNY-WDSOQIARSA-N 0.000 description 1
- MVMNUCOHQGYYKB-PEDHHIEDSA-N Met-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCSC)N MVMNUCOHQGYYKB-PEDHHIEDSA-N 0.000 description 1
- HWROAFGWPQUPTE-OSUNSFLBSA-N Met-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCSC)N HWROAFGWPQUPTE-OSUNSFLBSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 1
- RBGLBUDVQVPTEG-DCAQKATOSA-N Met-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N RBGLBUDVQVPTEG-DCAQKATOSA-N 0.000 description 1
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 1
- CHDYFPCQVUOJEB-ULQDDVLXSA-N Met-Leu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CHDYFPCQVUOJEB-ULQDDVLXSA-N 0.000 description 1
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 1
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 1
- HOZNVKDCKZPRER-XUXIUFHCSA-N Met-Lys-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HOZNVKDCKZPRER-XUXIUFHCSA-N 0.000 description 1
- ZRACLHJYVRBJFC-ULQDDVLXSA-N Met-Lys-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZRACLHJYVRBJFC-ULQDDVLXSA-N 0.000 description 1
- LLKWSEXLNFBKIF-CYDGBPFRSA-N Met-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCSC LLKWSEXLNFBKIF-CYDGBPFRSA-N 0.000 description 1
- XGIQKEAKUSPCBU-SRVKXCTJSA-N Met-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCSC)N XGIQKEAKUSPCBU-SRVKXCTJSA-N 0.000 description 1
- MIAZEQZXAFTCCG-UBHSHLNASA-N Met-Phe-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 MIAZEQZXAFTCCG-UBHSHLNASA-N 0.000 description 1
- IILAGWCGKJSBGB-IHRRRGAJSA-N Met-Phe-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IILAGWCGKJSBGB-IHRRRGAJSA-N 0.000 description 1
- SJLPOVNXMJFKHJ-ULQDDVLXSA-N Met-Phe-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N SJLPOVNXMJFKHJ-ULQDDVLXSA-N 0.000 description 1
- OIFHHODAXVWKJN-ULQDDVLXSA-N Met-Phe-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 OIFHHODAXVWKJN-ULQDDVLXSA-N 0.000 description 1
- WNJXJJSGUXAIQU-UFYCRDLUSA-N Met-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 WNJXJJSGUXAIQU-UFYCRDLUSA-N 0.000 description 1
- HUURTRNKPBHHKZ-JYJNAYRXSA-N Met-Phe-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 HUURTRNKPBHHKZ-JYJNAYRXSA-N 0.000 description 1
- MQASRXPTQJJNFM-JYJNAYRXSA-N Met-Pro-Phe Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MQASRXPTQJJNFM-JYJNAYRXSA-N 0.000 description 1
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 1
- XPVCDCMPKCERFT-GUBZILKMSA-N Met-Ser-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XPVCDCMPKCERFT-GUBZILKMSA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 1
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 1
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- YIGCDRZMZNDENK-UNQGMJICSA-N Met-Thr-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YIGCDRZMZNDENK-UNQGMJICSA-N 0.000 description 1
- NSMXRFMGZYTFEX-KJEVXHAQSA-N Met-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCSC)N)O NSMXRFMGZYTFEX-KJEVXHAQSA-N 0.000 description 1
- WYNIRYZIFZGWQD-BPUTZDHNSA-N Met-Trp-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WYNIRYZIFZGWQD-BPUTZDHNSA-N 0.000 description 1
- CONKYWFMLIMRLU-BVSLBCMMSA-N Met-Trp-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@@H](N)CCSC)C(O)=O)C1=CC=C(O)C=C1 CONKYWFMLIMRLU-BVSLBCMMSA-N 0.000 description 1
- CULGJGUDIJATIP-STQMWFEESA-N Met-Tyr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 CULGJGUDIJATIP-STQMWFEESA-N 0.000 description 1
- LIIXIZKVWNYQHB-STECZYCISA-N Met-Tyr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LIIXIZKVWNYQHB-STECZYCISA-N 0.000 description 1
- ANCPZNHGZUCSSC-ULQDDVLXSA-N Met-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=C(O)C=C1 ANCPZNHGZUCSSC-ULQDDVLXSA-N 0.000 description 1
- PNHRPOWKRRJATF-IHRRRGAJSA-N Met-Tyr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 PNHRPOWKRRJATF-IHRRRGAJSA-N 0.000 description 1
- MUDYEFAKNSTFAI-JYJNAYRXSA-N Met-Tyr-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O MUDYEFAKNSTFAI-JYJNAYRXSA-N 0.000 description 1
- YGNUDKAPJARTEM-GUBZILKMSA-N Met-Val-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O YGNUDKAPJARTEM-GUBZILKMSA-N 0.000 description 1
- FSTWDRPCQQUJIT-NHCYSSNCSA-N Met-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N FSTWDRPCQQUJIT-NHCYSSNCSA-N 0.000 description 1
- PVSPJQWHEIQTEH-JYJNAYRXSA-N Met-Val-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PVSPJQWHEIQTEH-JYJNAYRXSA-N 0.000 description 1
- 102100032965 Myomesin-2 Human genes 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 108010065395 Neuropep-1 Proteins 0.000 description 1
- 239000005662 Paraffin oil Substances 0.000 description 1
- 208000037273 Pathologic Processes Diseases 0.000 description 1
- 235000019483 Peanut oil Nutrition 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 1
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 1
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 1
- JNRFYJZCMHHGMH-UBHSHLNASA-N Phe-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JNRFYJZCMHHGMH-UBHSHLNASA-N 0.000 description 1
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 1
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 1
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 1
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 1
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 1
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 1
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 1
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 1
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 1
- UEEVBGHEGJMDDV-AVGNSLFASA-N Phe-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEEVBGHEGJMDDV-AVGNSLFASA-N 0.000 description 1
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 1
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 1
- IQXOZIDWLZYYAW-IHRRRGAJSA-N Phe-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IQXOZIDWLZYYAW-IHRRRGAJSA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 1
- OMHMIXFFRPMYHB-SRVKXCTJSA-N Phe-Cys-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OMHMIXFFRPMYHB-SRVKXCTJSA-N 0.000 description 1
- CPTJPDZTFNKFOU-MXAVVETBSA-N Phe-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N CPTJPDZTFNKFOU-MXAVVETBSA-N 0.000 description 1
- VJEZWOSKRCLHRP-MELADBBJSA-N Phe-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O VJEZWOSKRCLHRP-MELADBBJSA-N 0.000 description 1
- WFDAEEUZPZSMOG-SRVKXCTJSA-N Phe-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O WFDAEEUZPZSMOG-SRVKXCTJSA-N 0.000 description 1
- OWCLJDXHHZUNEL-IHRRRGAJSA-N Phe-Cys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OWCLJDXHHZUNEL-IHRRRGAJSA-N 0.000 description 1
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 1
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 1
- OPEVYHFJXLCCRT-AVGNSLFASA-N Phe-Gln-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O OPEVYHFJXLCCRT-AVGNSLFASA-N 0.000 description 1
- NKLDZIPTGKBDBB-HTUGSXCWSA-N Phe-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O NKLDZIPTGKBDBB-HTUGSXCWSA-N 0.000 description 1
- MFQXSDWKUXTOPZ-DZKIICNBSA-N Phe-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N MFQXSDWKUXTOPZ-DZKIICNBSA-N 0.000 description 1
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 1
- UEADQPLTYBWWTG-AVGNSLFASA-N Phe-Glu-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEADQPLTYBWWTG-AVGNSLFASA-N 0.000 description 1
- AKJAKCBHLJGRBU-JYJNAYRXSA-N Phe-Glu-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AKJAKCBHLJGRBU-JYJNAYRXSA-N 0.000 description 1
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 1
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 1
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 1
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 1
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 1
- PMKIMKUGCSVFSV-CQDKDKBSSA-N Phe-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PMKIMKUGCSVFSV-CQDKDKBSSA-N 0.000 description 1
- BVHFFNYBKRTSIU-MEYUZBJRSA-N Phe-His-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BVHFFNYBKRTSIU-MEYUZBJRSA-N 0.000 description 1
- SPXWRYVHOZVYBU-ULQDDVLXSA-N Phe-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N SPXWRYVHOZVYBU-ULQDDVLXSA-N 0.000 description 1
- FXPZZKBHNOMLGA-HJWJTTGWSA-N Phe-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FXPZZKBHNOMLGA-HJWJTTGWSA-N 0.000 description 1
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 1
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 1
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 1
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 1
- JQLQUPIYYJXZLJ-ZEWNOJEFSA-N Phe-Ile-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 JQLQUPIYYJXZLJ-ZEWNOJEFSA-N 0.000 description 1
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 1
- TXKWKTWYTIAZSV-KKUMJFAQSA-N Phe-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N TXKWKTWYTIAZSV-KKUMJFAQSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 1
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- KPEIBEPEUAZWNS-ULQDDVLXSA-N Phe-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KPEIBEPEUAZWNS-ULQDDVLXSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 description 1
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 1
- CJAHQEZWDZNSJO-KKUMJFAQSA-N Phe-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CJAHQEZWDZNSJO-KKUMJFAQSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- ZIQQNOXKEFDPBE-BZSNNMDCSA-N Phe-Lys-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N ZIQQNOXKEFDPBE-BZSNNMDCSA-N 0.000 description 1
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 1
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 1
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 1
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 1
- OAOLATANIHTNCZ-IHRRRGAJSA-N Phe-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N OAOLATANIHTNCZ-IHRRRGAJSA-N 0.000 description 1
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 description 1
- OKQQWSNUSQURLI-JYJNAYRXSA-N Phe-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N OKQQWSNUSQURLI-JYJNAYRXSA-N 0.000 description 1
- ROOQMPCUFLDOSB-FHWLQOOXSA-N Phe-Phe-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ROOQMPCUFLDOSB-FHWLQOOXSA-N 0.000 description 1
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 1
- XUQNHDIMXVZVFN-UHFFFAOYSA-N Phe-Phe-Ile-Tyr Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(C(C)CC)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)CC1=CC=CC=C1 XUQNHDIMXVZVFN-UHFFFAOYSA-N 0.000 description 1
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 1
- GRVMHFCZUIYNKQ-UFYCRDLUSA-N Phe-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GRVMHFCZUIYNKQ-UFYCRDLUSA-N 0.000 description 1
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 1
- YVXPUUOTMVBKDO-IHRRRGAJSA-N Phe-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CS)C(=O)O YVXPUUOTMVBKDO-IHRRRGAJSA-N 0.000 description 1
- CZQZSMJXFGGBHM-KKUMJFAQSA-N Phe-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O CZQZSMJXFGGBHM-KKUMJFAQSA-N 0.000 description 1
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 1
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 1
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 1
- BSJCSHIAMSGQGN-BVSLBCMMSA-N Phe-Pro-Trp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O BSJCSHIAMSGQGN-BVSLBCMMSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 1
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 1
- GKRCCTYAGQPMMP-IHRRRGAJSA-N Phe-Ser-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GKRCCTYAGQPMMP-IHRRRGAJSA-N 0.000 description 1
- MRWOVVNKSXXLRP-IHPCNDPISA-N Phe-Ser-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MRWOVVNKSXXLRP-IHPCNDPISA-N 0.000 description 1
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 1
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- NJONQBYLTANINY-IHPCNDPISA-N Phe-Trp-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(N)=O)C(O)=O NJONQBYLTANINY-IHPCNDPISA-N 0.000 description 1
- NWVMQNAELALJFW-RNXOBYDBSA-N Phe-Trp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NWVMQNAELALJFW-RNXOBYDBSA-N 0.000 description 1
- QTDBZORPVYTRJU-KKXDTOCCSA-N Phe-Tyr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O QTDBZORPVYTRJU-KKXDTOCCSA-N 0.000 description 1
- QUUCAHIYARMNBL-FHWLQOOXSA-N Phe-Tyr-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N QUUCAHIYARMNBL-FHWLQOOXSA-N 0.000 description 1
- AGTHXWTYCLLYMC-FHWLQOOXSA-N Phe-Tyr-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 AGTHXWTYCLLYMC-FHWLQOOXSA-N 0.000 description 1
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 1
- MMPBPRXOFJNCCN-ZEWNOJEFSA-N Phe-Tyr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MMPBPRXOFJNCCN-ZEWNOJEFSA-N 0.000 description 1
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 1
- FRMKIPSIZSFTTE-HJOGWXRNSA-N Phe-Tyr-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FRMKIPSIZSFTTE-HJOGWXRNSA-N 0.000 description 1
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 1
- ZOGICTVLQDWPER-UFYCRDLUSA-N Phe-Tyr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O ZOGICTVLQDWPER-UFYCRDLUSA-N 0.000 description 1
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- BQMFWUKNOCJDNV-HJWJTTGWSA-N Phe-Val-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQMFWUKNOCJDNV-HJWJTTGWSA-N 0.000 description 1
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 1
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 1
- VDTYRPWRWRCROL-UFYCRDLUSA-N Phe-Val-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VDTYRPWRWRCROL-UFYCRDLUSA-N 0.000 description 1
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 1
- GAMLAXHLYGLQBJ-UFYCRDLUSA-N Phe-Val-Tyr Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC1=CC=C(C=C1)O)C(C)C)CC1=CC=CC=C1 GAMLAXHLYGLQBJ-UFYCRDLUSA-N 0.000 description 1
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 1
- 101100226896 Phomopsis amygdali PaMT gene Proteins 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 1
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 1
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 1
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 1
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 1
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 1
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- XZGWNSIRZIUHHP-SRVKXCTJSA-N Pro-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 XZGWNSIRZIUHHP-SRVKXCTJSA-N 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 1
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 1
- AHXPYZRZRMQOAU-QXEWZRGKSA-N Pro-Asn-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1)C(O)=O AHXPYZRZRMQOAU-QXEWZRGKSA-N 0.000 description 1
- WPQKSRHDTMRSJM-CIUDSAMLSA-N Pro-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 WPQKSRHDTMRSJM-CIUDSAMLSA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- GQLOZEMWEBDEAY-NAKRPEOUSA-N Pro-Cys-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GQLOZEMWEBDEAY-NAKRPEOUSA-N 0.000 description 1
- OZAPWFHRPINHND-GUBZILKMSA-N Pro-Cys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OZAPWFHRPINHND-GUBZILKMSA-N 0.000 description 1
- ODPIUQVTULPQEP-CIUDSAMLSA-N Pro-Gln-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ODPIUQVTULPQEP-CIUDSAMLSA-N 0.000 description 1
- UPJGUQPLYWTISV-GUBZILKMSA-N Pro-Gln-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UPJGUQPLYWTISV-GUBZILKMSA-N 0.000 description 1
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 1
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 1
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 1
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 1
- QNZLIVROMORQFH-BQBZGAKWSA-N Pro-Gly-Cys Chemical compound C1C[C@H](NC1)C(=O)NCC(=O)N[C@@H](CS)C(=O)O QNZLIVROMORQFH-BQBZGAKWSA-N 0.000 description 1
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- AJCRQOHDLCBHFA-SRVKXCTJSA-N Pro-His-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AJCRQOHDLCBHFA-SRVKXCTJSA-N 0.000 description 1
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 1
- XFFIGWGYMUFCCQ-ULQDDVLXSA-N Pro-His-Tyr Chemical compound C1=CC(O)=CC=C1C[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)[C@H]1[NH2+]CCC1)CC1=CN=CN1 XFFIGWGYMUFCCQ-ULQDDVLXSA-N 0.000 description 1
- BWCZJGJKOFUUCN-ZPFDUUQYSA-N Pro-Ile-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O BWCZJGJKOFUUCN-ZPFDUUQYSA-N 0.000 description 1
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 1
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 1
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 1
- SRBFGSGDNNQABI-FHWLQOOXSA-N Pro-Leu-Trp Chemical compound N([C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C(=O)[C@@H]1CCCN1 SRBFGSGDNNQABI-FHWLQOOXSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 1
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 1
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 1
- VGVCNKSUVSZEIE-IHRRRGAJSA-N Pro-Phe-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O VGVCNKSUVSZEIE-IHRRRGAJSA-N 0.000 description 1
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 1
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 1
- GNADVDLLGVSXLS-ULQDDVLXSA-N Pro-Phe-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GNADVDLLGVSXLS-ULQDDVLXSA-N 0.000 description 1
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 1
- BUEIYHBJHCDAMI-UFYCRDLUSA-N Pro-Phe-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BUEIYHBJHCDAMI-UFYCRDLUSA-N 0.000 description 1
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 1
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 1
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 1
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 1
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 1
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- PKHDJFHFMGQMPS-RCWTZXSCSA-N Pro-Thr-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKHDJFHFMGQMPS-RCWTZXSCSA-N 0.000 description 1
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 1
- AJJDPGVVNPUZCR-RHYQMDGZSA-N Pro-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1)O AJJDPGVVNPUZCR-RHYQMDGZSA-N 0.000 description 1
- GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 1
- CNUIHOAISPKQPY-HSHDSVGOSA-N Pro-Thr-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CNUIHOAISPKQPY-HSHDSVGOSA-N 0.000 description 1
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- VGFFUEVZKRNRHT-ULQDDVLXSA-N Pro-Trp-Glu Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCC(=O)O)C(=O)O VGFFUEVZKRNRHT-ULQDDVLXSA-N 0.000 description 1
- DIDLUFMLRUJLFB-FKBYEOEOSA-N Pro-Trp-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC4=CC=C(C=C4)O)C(=O)O DIDLUFMLRUJLFB-FKBYEOEOSA-N 0.000 description 1
- ZYJMLBCDFPIGNL-JYJNAYRXSA-N Pro-Tyr-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O ZYJMLBCDFPIGNL-JYJNAYRXSA-N 0.000 description 1
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 1
- BXHRXLMCYSZSIY-STECZYCISA-N Pro-Tyr-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O BXHRXLMCYSZSIY-STECZYCISA-N 0.000 description 1
- QDDJNKWPTJHROJ-UFYCRDLUSA-N Pro-Tyr-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 QDDJNKWPTJHROJ-UFYCRDLUSA-N 0.000 description 1
- FIDNSJUXESUDOV-JYJNAYRXSA-N Pro-Tyr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O FIDNSJUXESUDOV-JYJNAYRXSA-N 0.000 description 1
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- MTMJNKFZDQEVSY-BZSNNMDCSA-N Pro-Val-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MTMJNKFZDQEVSY-BZSNNMDCSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 230000006819 RNA synthesis Effects 0.000 description 1
- 108010025216 RVF peptide Proteins 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108091007539 SARS-CoV-2 ORF1a Proteins 0.000 description 1
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 1
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 1
- UCXDHBORXLVBNC-ZLUOBGJFSA-N Ser-Asn-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O UCXDHBORXLVBNC-ZLUOBGJFSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 1
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 1
- ZHYMUFQVKGJNRM-ZLUOBGJFSA-N Ser-Cys-Asn Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O ZHYMUFQVKGJNRM-ZLUOBGJFSA-N 0.000 description 1
- KNCJWSPMTFFJII-ZLUOBGJFSA-N Ser-Cys-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KNCJWSPMTFFJII-ZLUOBGJFSA-N 0.000 description 1
- RNFKSBPHLTZHLU-WHFBIAKZSA-N Ser-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)O RNFKSBPHLTZHLU-WHFBIAKZSA-N 0.000 description 1
- INCNPLPRPOYTJI-JBDRJPRFSA-N Ser-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N INCNPLPRPOYTJI-JBDRJPRFSA-N 0.000 description 1
- KMWFXJCGRXBQAC-CIUDSAMLSA-N Ser-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N KMWFXJCGRXBQAC-CIUDSAMLSA-N 0.000 description 1
- XSYJDGIDKRNWFX-SRVKXCTJSA-N Ser-Cys-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XSYJDGIDKRNWFX-SRVKXCTJSA-N 0.000 description 1
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 1
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 1
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 1
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 1
- UICKAKRRRBTILH-GUBZILKMSA-N Ser-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N UICKAKRRRBTILH-GUBZILKMSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- GRSLLFZTTLBOQX-CIUDSAMLSA-N Ser-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N GRSLLFZTTLBOQX-CIUDSAMLSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- MOQDPPUMFSMYOM-KKUMJFAQSA-N Ser-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N MOQDPPUMFSMYOM-KKUMJFAQSA-N 0.000 description 1
- WEQAYODCJHZSJZ-KKUMJFAQSA-N Ser-His-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 WEQAYODCJHZSJZ-KKUMJFAQSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 1
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- YMDNFPNTIPQMJP-NAKRPEOUSA-N Ser-Ile-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O YMDNFPNTIPQMJP-NAKRPEOUSA-N 0.000 description 1
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 1
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- SRKMDKACHDVPMD-SRVKXCTJSA-N Ser-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N SRKMDKACHDVPMD-SRVKXCTJSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- JJUNLJTUIKFPRF-BPUTZDHNSA-N Ser-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CO)N JJUNLJTUIKFPRF-BPUTZDHNSA-N 0.000 description 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 1
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 1
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- OJFFAQFRCVPHNN-JYBASQMISA-N Ser-Thr-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OJFFAQFRCVPHNN-JYBASQMISA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- ATEQEHCGZKBEMU-GQGQLFGLSA-N Ser-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N ATEQEHCGZKBEMU-GQGQLFGLSA-N 0.000 description 1
- XPVIVVLLLOFBRH-XIRDDKMYSA-N Ser-Trp-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](N)CO)C(O)=O XPVIVVLLLOFBRH-XIRDDKMYSA-N 0.000 description 1
- FRPNVPKQVFHSQY-BPUTZDHNSA-N Ser-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FRPNVPKQVFHSQY-BPUTZDHNSA-N 0.000 description 1
- HAUVENOGHPECML-BPUTZDHNSA-N Ser-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 HAUVENOGHPECML-BPUTZDHNSA-N 0.000 description 1
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 1
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 1
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 1
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 1
- 101000667982 Severe acute respiratory syndrome coronavirus 2 Envelope small membrane protein Proteins 0.000 description 1
- 101000953880 Severe acute respiratory syndrome coronavirus 2 Membrane protein Proteins 0.000 description 1
- 101001024637 Severe acute respiratory syndrome coronavirus 2 Nucleoprotein Proteins 0.000 description 1
- 101000979057 Severe acute respiratory syndrome coronavirus 2 ORF6 protein Proteins 0.000 description 1
- 101000970479 Severe acute respiratory syndrome coronavirus 2 ORF8 protein Proteins 0.000 description 1
- 101000629318 Severe acute respiratory syndrome coronavirus 2 Spike glycoprotein Proteins 0.000 description 1
- 101710198474 Spike protein Proteins 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 108010090804 Streptavidin Chemical group 0.000 description 1
- 108091027544 Subgenomic mRNA Proteins 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- 101710137500 T7 RNA polymerase Proteins 0.000 description 1
- BHEOSNUKNHRBNM-UHFFFAOYSA-N Tetramethylsqualene Natural products CC(=C)C(C)CCC(=C)C(C)CCC(C)=CCCC=C(C)CCC(C)C(=C)CCC(C)C(C)=C BHEOSNUKNHRBNM-UHFFFAOYSA-N 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 1
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 1
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 1
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 1
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 1
- VASYSJHSMSBTDU-LKXGYXEUSA-N Thr-Asn-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O VASYSJHSMSBTDU-LKXGYXEUSA-N 0.000 description 1
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- QNJZOAHSYPXTAB-VEVYYDQMSA-N Thr-Asn-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O QNJZOAHSYPXTAB-VEVYYDQMSA-N 0.000 description 1
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 1
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 1
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- APIQKJYZDWVOCE-VEVYYDQMSA-N Thr-Asp-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O APIQKJYZDWVOCE-VEVYYDQMSA-N 0.000 description 1
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 1
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 1
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 1
- DGOJNGCGEYOBKN-BWBBJGPYSA-N Thr-Cys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)O DGOJNGCGEYOBKN-BWBBJGPYSA-N 0.000 description 1
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 1
- ZLNWJMRLHLGKFX-SVSWQMSJSA-N Thr-Cys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZLNWJMRLHLGKFX-SVSWQMSJSA-N 0.000 description 1
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 1
- GCXFWAZRHBRYEM-NUMRIWBASA-N Thr-Gln-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O GCXFWAZRHBRYEM-NUMRIWBASA-N 0.000 description 1
- WLDUCKSCDRIVLJ-NUMRIWBASA-N Thr-Gln-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O WLDUCKSCDRIVLJ-NUMRIWBASA-N 0.000 description 1
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 1
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 1
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- VULNJDORNLBPNG-SWRJLBSHSA-N Thr-Glu-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VULNJDORNLBPNG-SWRJLBSHSA-N 0.000 description 1
- LKEKWDJCJSPXNI-IRIUXVKKSA-N Thr-Glu-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LKEKWDJCJSPXNI-IRIUXVKKSA-N 0.000 description 1
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 1
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- NQVDGKYAUHTCME-QTKMDUPCSA-N Thr-His-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O NQVDGKYAUHTCME-QTKMDUPCSA-N 0.000 description 1
- WPSDXXQRIVKBAY-NKIYYHGXSA-N Thr-His-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O WPSDXXQRIVKBAY-NKIYYHGXSA-N 0.000 description 1
- NCGUQWSJUKYCIT-SZZJOZGLSA-N Thr-His-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NCGUQWSJUKYCIT-SZZJOZGLSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- ODXKUIGEPAGKKV-KATARQTJSA-N Thr-Leu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O ODXKUIGEPAGKKV-KATARQTJSA-N 0.000 description 1
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 1
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- OWQKBXKXZFRRQL-XGEHTFHBSA-N Thr-Met-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N)O OWQKBXKXZFRRQL-XGEHTFHBSA-N 0.000 description 1
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 1
- UXUAZXWKIGPUCH-RCWTZXSCSA-N Thr-Met-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O UXUAZXWKIGPUCH-RCWTZXSCSA-N 0.000 description 1
- CGCMNOIQVAXYMA-UNQGMJICSA-N Thr-Met-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CGCMNOIQVAXYMA-UNQGMJICSA-N 0.000 description 1
- JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 description 1
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 1
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 1
- NBIIPOKZPUGATB-BWBBJGPYSA-N Thr-Ser-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O NBIIPOKZPUGATB-BWBBJGPYSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 1
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- CSZFFQBUTMGHAH-UAXMHLISSA-N Thr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O CSZFFQBUTMGHAH-UAXMHLISSA-N 0.000 description 1
- NLWDSYKZUPRMBJ-IEGACIPQSA-N Thr-Trp-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O NLWDSYKZUPRMBJ-IEGACIPQSA-N 0.000 description 1
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 1
- NJGMALCNYAMYCB-JRQIVUDYSA-N Thr-Tyr-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJGMALCNYAMYCB-JRQIVUDYSA-N 0.000 description 1
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 1
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 1
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 1
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 1
- CJEHCEOXPLASCK-MEYUZBJRSA-N Thr-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=C(O)C=C1 CJEHCEOXPLASCK-MEYUZBJRSA-N 0.000 description 1
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 1
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 1
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- SBYQHZCMVSPQCS-RCWTZXSCSA-N Thr-Val-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O SBYQHZCMVSPQCS-RCWTZXSCSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- BTAJAOWZCWOHBU-HSHDSVGOSA-N Thr-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)C(C)C)C(O)=O)=CNC2=C1 BTAJAOWZCWOHBU-HSHDSVGOSA-N 0.000 description 1
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 description 1
- VZBWRZGNEPBRDE-HZUKXOBISA-N Trp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N VZBWRZGNEPBRDE-HZUKXOBISA-N 0.000 description 1
- ICNFHVUVCNWUAB-SZMVWBNQSA-N Trp-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ICNFHVUVCNWUAB-SZMVWBNQSA-N 0.000 description 1
- IBBBOLAPFHRDHW-BPUTZDHNSA-N Trp-Asn-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N IBBBOLAPFHRDHW-BPUTZDHNSA-N 0.000 description 1
- GUWJWCHZNGDKBG-UBHSHLNASA-N Trp-Asn-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N GUWJWCHZNGDKBG-UBHSHLNASA-N 0.000 description 1
- IXEGQBJZDIRRIV-QEJZJMRPSA-N Trp-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IXEGQBJZDIRRIV-QEJZJMRPSA-N 0.000 description 1
- ADBFWLXCCKIXBQ-XIRDDKMYSA-N Trp-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ADBFWLXCCKIXBQ-XIRDDKMYSA-N 0.000 description 1
- UKINEYBQXPMOJO-UBHSHLNASA-N Trp-Asn-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N UKINEYBQXPMOJO-UBHSHLNASA-N 0.000 description 1
- VEYXZZGMIBKXCN-UBHSHLNASA-N Trp-Asp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VEYXZZGMIBKXCN-UBHSHLNASA-N 0.000 description 1
- OFCKFBGRYHOKFP-IHPCNDPISA-N Trp-Asp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N OFCKFBGRYHOKFP-IHPCNDPISA-N 0.000 description 1
- HDQJVXVRGJUDML-UBHSHLNASA-N Trp-Cys-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HDQJVXVRGJUDML-UBHSHLNASA-N 0.000 description 1
- LJCLHMPCYYXVPR-VJBMBRPKSA-N Trp-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N LJCLHMPCYYXVPR-VJBMBRPKSA-N 0.000 description 1
- UDCHKDYNMRJYMI-QEJZJMRPSA-N Trp-Glu-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UDCHKDYNMRJYMI-QEJZJMRPSA-N 0.000 description 1
- HNIWONZFMIPCCT-SIXJUCDHSA-N Trp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N HNIWONZFMIPCCT-SIXJUCDHSA-N 0.000 description 1
- KULBQAVOXHQLIY-HSCHXYMDSA-N Trp-Ile-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 KULBQAVOXHQLIY-HSCHXYMDSA-N 0.000 description 1
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 1
- CMXACOZDEJYZSK-XIRDDKMYSA-N Trp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N CMXACOZDEJYZSK-XIRDDKMYSA-N 0.000 description 1
- UJRIVCPPPMYCNA-HOCLYGCPSA-N Trp-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UJRIVCPPPMYCNA-HOCLYGCPSA-N 0.000 description 1
- WKCFCVBOFKEVKY-HSCHXYMDSA-N Trp-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WKCFCVBOFKEVKY-HSCHXYMDSA-N 0.000 description 1
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 1
- UKWSFUSPGPBJGU-VFAJRCTISA-N Trp-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O UKWSFUSPGPBJGU-VFAJRCTISA-N 0.000 description 1
- WMBFONUKQXGLMU-WDSOQIARSA-N Trp-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WMBFONUKQXGLMU-WDSOQIARSA-N 0.000 description 1
- VUMCLPHXCBIJJB-PMVMPFDFSA-N Trp-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N VUMCLPHXCBIJJB-PMVMPFDFSA-N 0.000 description 1
- PWPJLBWYRTVYQS-PMVMPFDFSA-N Trp-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PWPJLBWYRTVYQS-PMVMPFDFSA-N 0.000 description 1
- JZSLIZLZGWOJBJ-PMVMPFDFSA-N Trp-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N JZSLIZLZGWOJBJ-PMVMPFDFSA-N 0.000 description 1
- UEFHVUQBYNRNQC-SFJXLCSZSA-N Trp-Phe-Thr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=CC=C1 UEFHVUQBYNRNQC-SFJXLCSZSA-N 0.000 description 1
- UQHPXCFAHVTWFU-BVSLBCMMSA-N Trp-Phe-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UQHPXCFAHVTWFU-BVSLBCMMSA-N 0.000 description 1
- BIBZRFIKOLGWFQ-XIRDDKMYSA-N Trp-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O BIBZRFIKOLGWFQ-XIRDDKMYSA-N 0.000 description 1
- XOLLWQIBBLBAHQ-WDSOQIARSA-N Trp-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O XOLLWQIBBLBAHQ-WDSOQIARSA-N 0.000 description 1
- JGLXHHQUSIULAK-OYDLWJJNSA-N Trp-Pro-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]3CCCN3C(=O)[C@H](CC=3C4=CC=CC=C4NC=3)N)C(O)=O)=CNC2=C1 JGLXHHQUSIULAK-OYDLWJJNSA-N 0.000 description 1
- JEYRCNVVYHTZMY-SZMVWBNQSA-N Trp-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JEYRCNVVYHTZMY-SZMVWBNQSA-N 0.000 description 1
- KBKTUNYBNJWFRL-UBHSHLNASA-N Trp-Ser-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 KBKTUNYBNJWFRL-UBHSHLNASA-N 0.000 description 1
- VDCGPCSLAJAKBB-XIRDDKMYSA-N Trp-Ser-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N VDCGPCSLAJAKBB-XIRDDKMYSA-N 0.000 description 1
- GBEAUNVBIMLWIB-IHPCNDPISA-N Trp-Ser-Phe Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 GBEAUNVBIMLWIB-IHPCNDPISA-N 0.000 description 1
- HIZDHWHVOLUGOX-BPUTZDHNSA-N Trp-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O HIZDHWHVOLUGOX-BPUTZDHNSA-N 0.000 description 1
- HHPSUFUXXBOFQY-AQZXSJQPSA-N Trp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O HHPSUFUXXBOFQY-AQZXSJQPSA-N 0.000 description 1
- DTPWXZXGFAHEKL-NWLDYVSISA-N Trp-Thr-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DTPWXZXGFAHEKL-NWLDYVSISA-N 0.000 description 1
- WBZOZLNLXVBCNW-LTHWPDAASA-N Trp-Thr-Ile Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)[C@@H](C)O)=CNC2=C1 WBZOZLNLXVBCNW-LTHWPDAASA-N 0.000 description 1
- YXSSXUIBUJGHJY-SFJXLCSZSA-N Trp-Thr-Phe Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)[C@H](O)C)C(O)=O)C1=CC=CC=C1 YXSSXUIBUJGHJY-SFJXLCSZSA-N 0.000 description 1
- YCEHCFIOIYNQTR-NYVOZVTQSA-N Trp-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)N[C@@H](CO)C(=O)O)N YCEHCFIOIYNQTR-NYVOZVTQSA-N 0.000 description 1
- FHHYVSCGOMPLLO-IHPCNDPISA-N Trp-Tyr-Asp Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 FHHYVSCGOMPLLO-IHPCNDPISA-N 0.000 description 1
- ZPZNQAZHMCLTOA-PXDAIIFMSA-N Trp-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 ZPZNQAZHMCLTOA-PXDAIIFMSA-N 0.000 description 1
- LNGFWVPNKLWATF-ZVZYQTTQSA-N Trp-Val-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LNGFWVPNKLWATF-ZVZYQTTQSA-N 0.000 description 1
- WGBFZZYIWFSYER-BVSLBCMMSA-N Trp-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N WGBFZZYIWFSYER-BVSLBCMMSA-N 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- QJBWZNTWJSZUOY-UWJYBYFXSA-N Tyr-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QJBWZNTWJSZUOY-UWJYBYFXSA-N 0.000 description 1
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 1
- KSVMDJJCYKIXTK-IGNZVWTISA-N Tyr-Ala-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 KSVMDJJCYKIXTK-IGNZVWTISA-N 0.000 description 1
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 1
- WTXQBCCKXIKKHB-JYJNAYRXSA-N Tyr-Arg-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WTXQBCCKXIKKHB-JYJNAYRXSA-N 0.000 description 1
- SEFNTZYRPGBDCY-IHRRRGAJSA-N Tyr-Arg-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O SEFNTZYRPGBDCY-IHRRRGAJSA-N 0.000 description 1
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 1
- WDIJBEWLXLQQKD-ULQDDVLXSA-N Tyr-Arg-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O WDIJBEWLXLQQKD-ULQDDVLXSA-N 0.000 description 1
- KDGFPPHLXCEQRN-STECZYCISA-N Tyr-Arg-Ile Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDGFPPHLXCEQRN-STECZYCISA-N 0.000 description 1
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 1
- PEVVXUGSAKEPEN-AVGNSLFASA-N Tyr-Asn-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PEVVXUGSAKEPEN-AVGNSLFASA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 1
- DANHCMVVXDXOHN-SRVKXCTJSA-N Tyr-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DANHCMVVXDXOHN-SRVKXCTJSA-N 0.000 description 1
- IXTQGBGHWQEEDE-AVGNSLFASA-N Tyr-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IXTQGBGHWQEEDE-AVGNSLFASA-N 0.000 description 1
- QNJYPWZACBACER-KKUMJFAQSA-N Tyr-Asp-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O QNJYPWZACBACER-KKUMJFAQSA-N 0.000 description 1
- JFDGVHXRCKEBAU-KKUMJFAQSA-N Tyr-Asp-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JFDGVHXRCKEBAU-KKUMJFAQSA-N 0.000 description 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 1
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- CGDZGRLRXPNCOC-SRVKXCTJSA-N Tyr-Cys-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CGDZGRLRXPNCOC-SRVKXCTJSA-N 0.000 description 1
- GHUNBABNQPIETG-MELADBBJSA-N Tyr-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O GHUNBABNQPIETG-MELADBBJSA-N 0.000 description 1
- ZAGPDPNPWYPEIR-SRVKXCTJSA-N Tyr-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O ZAGPDPNPWYPEIR-SRVKXCTJSA-N 0.000 description 1
- QHEGAOPHISYNDF-XDTLVQLUSA-N Tyr-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHEGAOPHISYNDF-XDTLVQLUSA-N 0.000 description 1
- CRHFOYCJGVJPLE-AVGNSLFASA-N Tyr-Gln-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CRHFOYCJGVJPLE-AVGNSLFASA-N 0.000 description 1
- IYHNBRUWVBIVJR-IHRRRGAJSA-N Tyr-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IYHNBRUWVBIVJR-IHRRRGAJSA-N 0.000 description 1
- RIJPHPUJRLEOAK-JYJNAYRXSA-N Tyr-Gln-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O RIJPHPUJRLEOAK-JYJNAYRXSA-N 0.000 description 1
- CKHQKYHIZCRTAP-SOUVJXGZSA-N Tyr-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CKHQKYHIZCRTAP-SOUVJXGZSA-N 0.000 description 1
- MPKPIWFFDWVJGC-IRIUXVKKSA-N Tyr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O MPKPIWFFDWVJGC-IRIUXVKKSA-N 0.000 description 1
- PDKILSUYSUGCAO-JBACZVJFSA-N Tyr-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC3=CC=C(C=C3)O)N PDKILSUYSUGCAO-JBACZVJFSA-N 0.000 description 1
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 1
- XQYHLZNPOTXRMQ-KKUMJFAQSA-N Tyr-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XQYHLZNPOTXRMQ-KKUMJFAQSA-N 0.000 description 1
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 1
- IWRMTNJCCMEBEX-AVGNSLFASA-N Tyr-Glu-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O IWRMTNJCCMEBEX-AVGNSLFASA-N 0.000 description 1
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 1
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 1
- HDSKHCBAVVWPCQ-FHWLQOOXSA-N Tyr-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HDSKHCBAVVWPCQ-FHWLQOOXSA-N 0.000 description 1
- LHTGRUZSZOIAKM-SOUVJXGZSA-N Tyr-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O LHTGRUZSZOIAKM-SOUVJXGZSA-N 0.000 description 1
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 1
- JWGXUKHIKXZWNG-RYUDHWBXSA-N Tyr-Gly-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JWGXUKHIKXZWNG-RYUDHWBXSA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 1
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 1
- WVGKPKDWYQXWLU-BZSNNMDCSA-N Tyr-His-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WVGKPKDWYQXWLU-BZSNNMDCSA-N 0.000 description 1
- ARSHSYUZHSIYKR-ACRUOGEOSA-N Tyr-His-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ARSHSYUZHSIYKR-ACRUOGEOSA-N 0.000 description 1
- STTVVMWQKDOKAM-YESZJQIVSA-N Tyr-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O STTVVMWQKDOKAM-YESZJQIVSA-N 0.000 description 1
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 1
- PJWCWGXAVIVXQC-STECZYCISA-N Tyr-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PJWCWGXAVIVXQC-STECZYCISA-N 0.000 description 1
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 1
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 1
- AVIQBBOOTZENLH-KKUMJFAQSA-N Tyr-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AVIQBBOOTZENLH-KKUMJFAQSA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 1
- WOAQYWUEUYMVGK-ULQDDVLXSA-N Tyr-Lys-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOAQYWUEUYMVGK-ULQDDVLXSA-N 0.000 description 1
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 1
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 1
- MXFPBNFKVBHIRW-BZSNNMDCSA-N Tyr-Lys-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O MXFPBNFKVBHIRW-BZSNNMDCSA-N 0.000 description 1
- GYKDRHDMGQUZPU-MGHWNKPDSA-N Tyr-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GYKDRHDMGQUZPU-MGHWNKPDSA-N 0.000 description 1
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 1
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 1
- YSGAPESOXHFTQY-IHRRRGAJSA-N Tyr-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N YSGAPESOXHFTQY-IHRRRGAJSA-N 0.000 description 1
- QPBJXNYYQTUTDD-KKUMJFAQSA-N Tyr-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QPBJXNYYQTUTDD-KKUMJFAQSA-N 0.000 description 1
- HNERGSKJJZQGEA-JYJNAYRXSA-N Tyr-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HNERGSKJJZQGEA-JYJNAYRXSA-N 0.000 description 1
- WTTRJMAZPDHPGS-KKXDTOCCSA-N Tyr-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(O)=O WTTRJMAZPDHPGS-KKXDTOCCSA-N 0.000 description 1
- FDKDGFGTHGJKNV-FHWLQOOXSA-N Tyr-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FDKDGFGTHGJKNV-FHWLQOOXSA-N 0.000 description 1
- PSALWJCUIAQKFW-ACRUOGEOSA-N Tyr-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N PSALWJCUIAQKFW-ACRUOGEOSA-N 0.000 description 1
- FGVFBDZSGQTYQX-UFYCRDLUSA-N Tyr-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O FGVFBDZSGQTYQX-UFYCRDLUSA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- SOEGLGLDSUHWTI-STECZYCISA-N Tyr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 SOEGLGLDSUHWTI-STECZYCISA-N 0.000 description 1
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 1
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 1
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 1
- KWKJGBHDYJOVCR-SRVKXCTJSA-N Tyr-Ser-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O KWKJGBHDYJOVCR-SRVKXCTJSA-N 0.000 description 1
- IEWKKXZRJLTIOV-AVGNSLFASA-N Tyr-Ser-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O IEWKKXZRJLTIOV-AVGNSLFASA-N 0.000 description 1
- BCOBSVIZMQXKFY-KKUMJFAQSA-N Tyr-Ser-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O BCOBSVIZMQXKFY-KKUMJFAQSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 1
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 1
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 1
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 1
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 1
- AKKYBQGHUAWPJR-MNSWYVGCSA-N Tyr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)O AKKYBQGHUAWPJR-MNSWYVGCSA-N 0.000 description 1
- JHDZONWZTCKTJR-KJEVXHAQSA-N Tyr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JHDZONWZTCKTJR-KJEVXHAQSA-N 0.000 description 1
- NUQZCPSZHGIYTA-HKUYNNGSSA-N Tyr-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N NUQZCPSZHGIYTA-HKUYNNGSSA-N 0.000 description 1
- QRCBQDPRKMYTMB-IHPCNDPISA-N Tyr-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N QRCBQDPRKMYTMB-IHPCNDPISA-N 0.000 description 1
- ANHVRCNNGJMJNG-BZSNNMDCSA-N Tyr-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CS)C(=O)O)N)O ANHVRCNNGJMJNG-BZSNNMDCSA-N 0.000 description 1
- DJSYPCWZPNHQQE-FHWLQOOXSA-N Tyr-Tyr-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=C(O)C=C1 DJSYPCWZPNHQQE-FHWLQOOXSA-N 0.000 description 1
- AFWXOGHZEKARFH-ACRUOGEOSA-N Tyr-Tyr-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=C(O)C=C1 AFWXOGHZEKARFH-ACRUOGEOSA-N 0.000 description 1
- QVYFTFIBKCDHIE-ACRUOGEOSA-N Tyr-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O QVYFTFIBKCDHIE-ACRUOGEOSA-N 0.000 description 1
- AGDDLOQMXUQPDY-BZSNNMDCSA-N Tyr-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O AGDDLOQMXUQPDY-BZSNNMDCSA-N 0.000 description 1
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 1
- MJUTYRIMFIICKL-JYJNAYRXSA-N Tyr-Val-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MJUTYRIMFIICKL-JYJNAYRXSA-N 0.000 description 1
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 1
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 1
- SMUWZUSWMWVOSL-JYJNAYRXSA-N Tyr-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SMUWZUSWMWVOSL-JYJNAYRXSA-N 0.000 description 1
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 1
- YKBUNNNRNZZUID-UFYCRDLUSA-N Tyr-Val-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YKBUNNNRNZZUID-UFYCRDLUSA-N 0.000 description 1
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 1
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- SMKXLHVZIFKQRB-GUBZILKMSA-N Val-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N SMKXLHVZIFKQRB-GUBZILKMSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- JYVKKBDANPZIAW-AVGNSLFASA-N Val-Arg-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N JYVKKBDANPZIAW-AVGNSLFASA-N 0.000 description 1
- GNWUWQAVVJQREM-NHCYSSNCSA-N Val-Asn-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GNWUWQAVVJQREM-NHCYSSNCSA-N 0.000 description 1
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 1
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- NMPXRFYMZDIBRF-ZOBUZTSGSA-N Val-Asn-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NMPXRFYMZDIBRF-ZOBUZTSGSA-N 0.000 description 1
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 1
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- XIFAHCUNWWKUDE-DCAQKATOSA-N Val-Cys-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XIFAHCUNWWKUDE-DCAQKATOSA-N 0.000 description 1
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 1
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 1
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 1
- IWZYXFRGWKEKBJ-GVXVVHGQSA-N Val-Gln-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IWZYXFRGWKEKBJ-GVXVVHGQSA-N 0.000 description 1
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 1
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 1
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- OXGVAUFVTOPFFA-XPUUQOCRSA-N Val-Gly-Cys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OXGVAUFVTOPFFA-XPUUQOCRSA-N 0.000 description 1
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- MYLNLEIZWHVENT-VKOGCVSHSA-N Val-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](C(C)C)N MYLNLEIZWHVENT-VKOGCVSHSA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 1
- RFKJNTRMXGCKFE-FHWLQOOXSA-N Val-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC(C)C)C(O)=O)=CNC2=C1 RFKJNTRMXGCKFE-FHWLQOOXSA-N 0.000 description 1
- WDIWOIRFNMLNKO-ULQDDVLXSA-N Val-Leu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WDIWOIRFNMLNKO-ULQDDVLXSA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- MLADEWAIYAPAAU-IHRRRGAJSA-N Val-Lys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MLADEWAIYAPAAU-IHRRRGAJSA-N 0.000 description 1
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 1
- PHZGFLFMGLXCFG-FHWLQOOXSA-N Val-Lys-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N PHZGFLFMGLXCFG-FHWLQOOXSA-N 0.000 description 1
- SBJCTAZFSZXWSR-AVGNSLFASA-N Val-Met-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SBJCTAZFSZXWSR-AVGNSLFASA-N 0.000 description 1
- RSGHLMMKXJGCMK-JYJNAYRXSA-N Val-Met-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N RSGHLMMKXJGCMK-JYJNAYRXSA-N 0.000 description 1
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 1
- QPPZEDOTPZOSEC-RCWTZXSCSA-N Val-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N)O QPPZEDOTPZOSEC-RCWTZXSCSA-N 0.000 description 1
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 1
- LGXUZJIQCGXKGZ-QXEWZRGKSA-N Val-Pro-Asn Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N LGXUZJIQCGXKGZ-QXEWZRGKSA-N 0.000 description 1
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- VSCIANXXVZOYOC-AVGNSLFASA-N Val-Pro-His Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VSCIANXXVZOYOC-AVGNSLFASA-N 0.000 description 1
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 1
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 1
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 1
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 1
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- IRAUYEAFPFPVND-UVBJJODRSA-N Val-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 IRAUYEAFPFPVND-UVBJJODRSA-N 0.000 description 1
- OEVFFOBAXHBXKM-HSHDSVGOSA-N Val-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N)O OEVFFOBAXHBXKM-HSHDSVGOSA-N 0.000 description 1
- VBTFUDNTMCHPII-UHFFFAOYSA-N Val-Trp-Tyr Natural products C=1NC2=CC=CC=C2C=1CC(NC(=O)C(N)C(C)C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 VBTFUDNTMCHPII-UHFFFAOYSA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- CFIBZQOLUDURST-IHRRRGAJSA-N Val-Tyr-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N CFIBZQOLUDURST-IHRRRGAJSA-N 0.000 description 1
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 1
- GTACFKZDQFTVAI-STECZYCISA-N Val-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 GTACFKZDQFTVAI-STECZYCISA-N 0.000 description 1
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- PGBMPFKFKXYROZ-UFYCRDLUSA-N Val-Tyr-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N PGBMPFKFKXYROZ-UFYCRDLUSA-N 0.000 description 1
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- WHNSHJJNWNSTSU-BZSNNMDCSA-N Val-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 WHNSHJJNWNSTSU-BZSNNMDCSA-N 0.000 description 1
- 108010031318 Vitronectin Proteins 0.000 description 1
- 239000004480 active ingredient Substances 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 108010047506 alanyl-glutaminyl-glycyl-valine Proteins 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 108010031014 alanyl-histidyl-leucyl-leucine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- ILRRQNADMUWWFW-UHFFFAOYSA-K aluminium phosphate Chemical compound O1[Al]2OP1(=O)O2 ILRRQNADMUWWFW-UHFFFAOYSA-K 0.000 description 1
- 235000011126 aluminium potassium sulphate Nutrition 0.000 description 1
- 150000001412 amines Chemical class 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000003171 anti-complementary effect Effects 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 1
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 1
- 210000001106 artificial yeast chromosome Anatomy 0.000 description 1
- 108010021908 aspartyl-aspartyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 238000010504 bond cleavage reaction Methods 0.000 description 1
- AXCZMVOFGPJBDE-UHFFFAOYSA-L calcium dihydroxide Chemical compound [OH-].[OH-].[Ca+2] AXCZMVOFGPJBDE-UHFFFAOYSA-L 0.000 description 1
- 239000000920 calcium hydroxide Substances 0.000 description 1
- 229910001861 calcium hydroxide Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 239000013043 chemical agent Substances 0.000 description 1
- 125000003636 chemical group Chemical group 0.000 description 1
- 230000006328 chemical modification of amino acids Effects 0.000 description 1
- 230000003749 cleanliness Effects 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000010924 continuous production Methods 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 230000006240 deamidation Effects 0.000 description 1
- 230000009615 deamination Effects 0.000 description 1
- 238000006481 deamination reaction Methods 0.000 description 1
- 239000000412 dendrimer Substances 0.000 description 1
- 229920000736 dendritic polymer Polymers 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 238000011026 diafiltration Methods 0.000 description 1
- 230000006806 disease prevention Effects 0.000 description 1
- 230000005750 disease progression Effects 0.000 description 1
- 239000006185 dispersion Substances 0.000 description 1
- 239000012153 distilled water Substances 0.000 description 1
- PRAKJMSDJKAYCZ-UHFFFAOYSA-N dodecahydrosqualene Natural products CC(C)CCCC(C)CCCC(C)CCCCC(C)CCCC(C)CCCC(C)C PRAKJMSDJKAYCZ-UHFFFAOYSA-N 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000009881 electrostatic interaction Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000017188 evasion or tolerance of host immune response Effects 0.000 description 1
- 230000000763 evoking effect Effects 0.000 description 1
- 230000028023 exocytosis Effects 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 108010068404 exorphin B4 Proteins 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 125000005313 fatty acid group Chemical group 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010040856 glutamyl-cysteinyl-alanine Proteins 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 1
- 108010038983 glycyl-histidyl-lysine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 102000048657 human ACE2 Human genes 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 238000002649 immunization Methods 0.000 description 1
- 230000003053 immunization Effects 0.000 description 1
- 230000001976 improved effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 150000002484 inorganic compounds Chemical class 0.000 description 1
- 229910010272 inorganic material Inorganic materials 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 230000006338 isoaspartate formation Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 108010043612 kentsin Proteins 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010034507 methionyltryptophan Proteins 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 210000004877 mucosa Anatomy 0.000 description 1
- 239000002105 nanoparticle Substances 0.000 description 1
- 239000003921 oil Substances 0.000 description 1
- 235000019198 oils Nutrition 0.000 description 1
- 238000007911 parenteral administration Methods 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- 230000009054 pathological process Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 239000000312 peanut oil Substances 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 230000003285 pharmacodynamic effect Effects 0.000 description 1
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 239000002504 physiological saline solution Substances 0.000 description 1
- 229920000575 polymersome Polymers 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 229940050271 potassium alum Drugs 0.000 description 1
- GRLPQNLYRHEGIJ-UHFFFAOYSA-J potassium aluminium sulfate Chemical compound [Al+3].[K+].[O-]S([O-])(=O)=O.[O-]S([O-])(=O)=O GRLPQNLYRHEGIJ-UHFFFAOYSA-J 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 238000004393 prognosis Methods 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 125000006239 protecting group Chemical group 0.000 description 1
- 230000016434 protein splicing Effects 0.000 description 1
- 230000006340 racemization Effects 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 238000007480 sanger sequencing Methods 0.000 description 1
- 229930182490 saponin Natural products 0.000 description 1
- 150000007949 saponins Chemical class 0.000 description 1
- 235000017709 saponins Nutrition 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 238000012772 sequence design Methods 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 239000013605 shuttle vector Substances 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 238000010532 solid phase synthesis reaction Methods 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 108010005652 splenotritin Proteins 0.000 description 1
- 229940031439 squalene Drugs 0.000 description 1
- TUHBEKDERLKLEC-UHFFFAOYSA-N squalene Natural products CC(=CCCC(=CCCC(=CCCC=C(/C)CCC=C(/C)CC=C(C)C)C)C)C TUHBEKDERLKLEC-UHFFFAOYSA-N 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 238000011272 standard treatment Methods 0.000 description 1
- 210000000130 stem cell Anatomy 0.000 description 1
- 238000011146 sterile filtration Methods 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 230000009469 supplementation Effects 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 239000012096 transfection reagent Substances 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010029384 tryptophyl-histidine Proteins 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 238000013060 ultrafiltration and diafiltration Methods 0.000 description 1
- 238000002255 vaccination Methods 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 108010000998 wheylin-2 peptide Proteins 0.000 description 1
- 239000002023 wood Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/53—DNA (RNA) vaccination
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/20011—Coronaviridae
- C12N2770/20022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/20011—Coronaviridae
- C12N2770/20034—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/20011—Coronaviridae
- C12N2770/20051—Methods of production or purification of viral material
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Virology (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- Engineering & Computer Science (AREA)
- Gastroenterology & Hepatology (AREA)
- Microbiology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Biotechnology (AREA)
- Mycology (AREA)
- Immunology (AREA)
- Pharmacology & Pharmacy (AREA)
- Epidemiology (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
본 발명에는 백신으로서 COVID-19 및 기타 바이러스 질환을 방지하는, 고도로 정제된 형태의 SARS-CoV-2 및 관련 코로나바이러스의 외피 단백질, 바이러스 외피 및 바이러스 외피의 단편을 생산하는 생명공학적 제조 공정에 사용될 수 있는 완전 합성 장쇄 핵산이 기재되어 있다.
Description
설명
본 발명은 독립항 1에 따른 완전 합성 장쇄 핵산에 관한 것이다. 본 발명은 추가로 이들 핵산 중 2개 이상을 포함하는 키트 및 핵산을 포함하는 적어도 하나의 플라스미드를 포함하는 생명공학적 생산 유닛에 관한 것이다. 본 발명은 추가로 상기 핵산을 사용하여 유전자 발현에 의해 수득 가능한 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질에 관한 것이다. 또한, 본 발명은 핵산을 사용하여 유전자 발현에 의해 수득 가능한 생성물을 포함하는 백신, 특히 코로나바이러스 SARS-CoV-2에 대한 백신뿐만 아니라 백신 생산 방법에 관한 것이다.
백신의 신속한 개발과 가용성은 많은 바이러스와 박테리아를 퇴치하는 데 중요하다. 적합한 백신의 생산은 다단계의 복잡한 과정이며 종종 높은 투자에도 불구하고 항상 성공적인 것은 아니다. 전형적으로, 적합한 백신을 개발하려면 수년이 걸린다. 역학적 관점에서 볼 때 새로운 질환의 출현에 대하여, 가능하다고 하여도, 너무 늦게 반응하는 것만이 가능하기 때문에, 이러한 긴 개발 시간은 특히 새로 출현하는 병원체 또는 돌연변이 병원체와 관련하여 주요 문제이다. 대조적으로, 새롭거나 심하게 돌연변이된 병원체의 분석, 확인 및 추가 검출은 이제 몇 주 또는 심지어 며칠 이내에 가능하며, 이는 지난 세기에 비해 크게 개선된 것이다.
이러한 맥락에서, 바이러스는 다른 종으로부터 인간으로의 확산을 야기하는 높은 돌연변이율을 갖고 있기 때문에 특별한 관심을 갖고 있다. 이러한 바이러스의 급속한 확산은 현대 의학에 주요한 도전과제가 된다. 오늘날(2020) 새로 출현하는 바이러스의 검출/확인과 백신 개발 사이의 일반적인 시간은 전형적으로 몇 년이다. 몇몇 경우에는, 충분한 사전 지식이 있으면 몇 개월 이내에 실험용 백신이 제공될 수 있다. 그러나, 이 기간은 수천 또는 수백만 명의 사람들이 감염될 때까지의 전형적인 시간보다 훨씬 더 길다. 그러한 급속한 확산은 현대 사회의 높은 이동성의 직접적인 결과이기도 하다.
이상적으로, 새로운 바이러스를 확인한 직후에, 충분한 양과 최고 품질의 백신을 이용할 수 있을 것이며, 어떻게든 새로운 바이러스의 초기 발병 지역에 접근한 모든 사람들에 대한 전국적인 백신접종을 허용할 것이다. 또한, 그러한 백신에 이상적인 방법은 바이러스의 진화 및 적응에 반응할 수 있을 것이다. 그러한 이상적인 생산 가능성은 오늘날 당업자에게 유토피아적인 것으로 보인다.
특히 최근, 코로나 팬데믹으로 백신 생산에 적합한 도구 개발의 관련성이 크게 증가했다. 코로나바이러스 SARS-CoV-2에 대한 백신 개발이 팬데믹 및 관련 글로벌 위기를 장기적으로 억제하는 유일한 입증된 수단이라는 데는 이견이 없다.
이러한 배경에서, 본 발명의 과제는 코로나바이러스 SARS-CoV-2에 대한 백신의 대량 및 고품질 생산을 가능하게 하는 기기를 제공하는 것이다.
상기 문제는 청구항 1에 따른 완전 합성 장쇄 핵산에 의해 해결된다. 본 발명의 바람직한 구현예는 구현예 및 종속항에 반영된다.
따라서, 본 발명은 특히 하기 구현예에 관한 것이다:
1. 적어도 4,000개의 염기를 갖는 완전 합성 장쇄 핵산으로서,
임의의 배열로 4개의 서열 부분 A-D 중 적어도 2개를 포함하거나,
서열 부분 A-D에 따른 데옥시리보핵산 서열에 상응하는 리보핵산 서열을 포함하는 것을 특징으로 하는 핵산:
여기서,
i) 서열 부분 A는
a) 서열 번호 50에 정의된 서열 또는 서열 번호 50에 정의된 서열과 적어도 98.5% 서열 동일성을 갖는 서열; 또는
b) 서열 번호 3에 정의된 서열 또는 서열 번호 3에 정의된 서열과 적어도 90% 서열 동일성을 갖는 서열
을 포함하고;
ii) 서열 부분 B는
a) 서열 번호 48에 정의된 서열 또는 서열 번호 48에 정의된 서열과 적어도 98.3% 서열 동일성을 갖는 서열; 또는
b) 서열 번호 7에 정의된 서열 또는 서열 번호 7에 정의된 서열과 적어도 90% 서열 동일성을 갖는 서열
을 포함하고;
iii) 서열 부분 C는
a) 서열 번호 49에 정의된 서열 또는 서열 번호 49에 정의된 서열과 적어도 97.2% 서열 동일성을 갖는 서열; 또는
b) 서열 번호 11에 정의된 서열 또는 서열 번호 11에 정의된 서열과 적어도 90% 서열 동일성을 갖는 서열
을 포함하고;
iv) 서열 부분 D는 서열 번호 17에 정의된 서열 또는 서열 번호 17에 정의된 서열과 적어도 98.5% 서열 동일성을 갖는 서열을 포함한다.
2. 구현예 1에 있어서, 정의된 서열에서 적어도 8,000개의 염기, 바람직하게는 적어도 20,000개의 염기를 갖는 것을 특징으로 하는 것인 핵산.
3. 구현예 1 또는 2에 있어서, 핵산이 하기를 추가로 포함하는 것인 핵산:
a) 1.) 서열 번호 51에 의해 정의된 ORF1ab 서열 또는 서열 번호 51과 적어도 98.5% 서열 동일성을 갖는 서열; 또는
2.) i) 서열 번호 59에 의해 정의된 ORF1b 서열 또는 서열 번호 59와 적어도 98.5% 서열 동일성을 갖는 서열; 및
ii) 서열 번호 58에 의해 정의된 ORF1 서열 또는 서열 번호 58과 적어도 98.6% 서열 동일성을 갖는 서열;
b) 서열 번호 52에 의해 정의된 ORF3a 서열 또는 서열 번호 52와 적어도 99% 서열 동일성을 갖는 서열; 및
c) 서열 번호 54에 의해 정의된 ORF7a 서열 또는 서열 번호 54와 적어도 99.5% 서열 동일성을 갖는 서열.
4. 구현예 3에 있어서, 핵산이 하기를 추가로 포함하는 것인 핵산:
a) 서열 번호 53에 의해 정의된 ORF6 서열 또는 서열 번호 53과 적어도 94.1% 서열 동일성을 갖는 서열; 및/또는
b) 서열 번호 55에 의해 정의된 ORF8 서열 또는 서열 번호 55와 적어도 99% 서열 동일성을 갖는 서열.
5. 구현예 1 내지 4 중 어느 하나에 있어서, 서열 부분 A 내지 C가 서열 번호 19에 따른 서열 또는 상응하는 리보핵산 서열에 상응하는 것을 특징으로 하는 것인 핵산.
6. 구현예 1 내지 5 중 어느 하나에 있어서, 핵산이 임의의 배열로 4개의 서열 부분 A-D 중 적어도 3개 또는 서열 부분 A-D에 따른 데옥시리보핵산 서열에 상응하는 리보핵산 서열을 갖는 4개 서열 부분 중 적어도 3개를 포함하는 것을 특징으로 하는 것인 핵산.
7. 구현예 1 내지 6 중 어느 하나에 있어서, 핵산이 임의의 배열로 4개의 서열 부분 A-D 또는 서열 부분 A-D에 따른 데옥시리보핵산 서열에 상응하는 리보핵산 서열을 갖는 4개의 서열 부분을 포함하는 것을 특징으로 하는 것인 핵산.
8. 구현예 1 내지 7 중 어느 하나에 있어서, 핵산이
서열 번호 15,
서열 번호 28,
서열 번호 29 및
서열 번호 30
으로 이루어진 적어도 하나의 서열을 추가로 포함하거나,
서열 부분인 서열 번호 15, 서열 번호 28, 서열 번호 29 및 서열 번호 30에 따른 데옥시리보핵산 서열 중 하나 또는 상응하는 리보핵산 서열을 포함하는 것을 특징으로 하는 것인 핵산.
9. 구현예 1 내지 8 중 어느 하나에 있어서, 1,000,000개 염기의 최대 크기, 바람직하게는 200,000개 염기의 최대 크기를 갖는 것을 특징으로 하는 것인 핵산.
10. 구현예 1 내지 9 중 어느 하나에 따른 핵산을 포함하는 벡터.
11. 구현예 10에 있어서, 벡터가 서열 번호 46 및 서열 번호 47에 의해 정의된 서열을 포함하는 것인 벡터.
12. 구현예 10 또는 11에 있어서, 벡터가 플라스미드 벡터인 것인 벡터.
13. 구현예 1 내지 9 중 어느 하나에 따른 2개 이상의 핵산을 포함하는 키트.
14. 구현예 13에 있어서, 핵산이 적어도 하나의 플라스미드, 바람직하게는 2개 이상의 플라스미드에 존재하는 것인 키트.
15. 구현예 10 내지 12 중 어느 하나에 따른 적어도 하나의 벡터를 포함하는 생명공학적 생산 유닛.
16. 구현예 1 내지 9 중 어느 하나에 따른 적어도 하나의 핵산, 구현예 10 내지 12 중 어느 하나에 따른 벡터, 구현예 13 또는 14에 따른 키트, 또는 구현예 15에 따른 생명공학적 생산 유닛을 사용하여 유전자 발현에 의해 수득 가능한 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질로서, 여기서, 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질은 구현예 1 내지 9 중 어느 하나에 따른 적어도 하나의 핵산을 패키징하는 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질.
17. 구현예 1 내지 9 중 어느 하나에 따른 적어도 하나의 핵산 및 생산 유기체에서 구현예 1 내지 9 중 어느 하나에 따른 적어도 하나의 핵산, 구현예 10 내지 12 중 어느 하나에 따른 벡터, 구현예 13 또는 14에 따른 키트를 사용하여 유전자 발현에 의해 수득 가능한 생성물을 포함하고, 특히 구현예 16에 따른 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질을 포함하는 코로나바이러스 SARS-CoV-2에 대한 백신.
18. 구현예 17에 있어서, 단백질 성분 a, b1, b2, c1, c2, d1 또는 d2로 이루어진 군으로부터 선택된 적어도 2개의 분자적으로 정확하게 정의된 단백질 성분을 포함하고, 여기서,
(i) 단백질 성분은
a) SARS-CoV-2의 S 단백질과 유사한 서열 번호 14에 따른 서열 또는 서열 번호 14와 적어도 90% 서열 동일성을 갖는 서열; 또는
b) SARS-CoV-2의 S 단백질과 유사한 서열 번호 18에 따른 서열 또는 서열 번호 18과 적어도 90% 서열 동일성을 갖는 서열
을 포함하고;
(ii) 단백질 성분 b1은
a) SARS-CoV-2의 외피 단백질 E와 유사한 서열 번호 6에 따른 서열 또는 서열 번호 6과 적어도 90% 서열 동일성을 갖는 서열; 또는
b) SARS-CoV-2의 외피 단백질 E와 유사한 서열 번호 21에 따른 서열 또는 서열 번호 21과 적어도 90% 서열 동일성을 갖는 서열
을 포함하고;
단백질 성분 b2는 MHV59A의 외피 단백질 E 또는 등가 단백질과 유사한 서열 번호 8에 따른 서열 또는 서열 번호 8과 적어도 90% 서열 동일성을 갖는 서열을 포함하고;
(iii) 단백질 성분 c1은
a) SARS-CoV-2의 외피 단백질 M과 유사한 서열 번호 10에 따른 서열 또는 서열 번호 10과 적어도 90% 서열 동일성을 갖는 서열; 또는
b) SARS-CoV-2의 막 단백질 M과 유사한 서열 번호 22에 따른 서열 또는 서열 번호 22와 적어도 90% 서열 동일성을 갖는 서열
을 포함하고;
단백질 성분 c2는 MHV59A의 막 단백질 M 또는 등가 단백질과 유사한 서열 번호 12에 따른 서열 또는 서열 번호 12와 적어도 90% 서열 동일성을 갖는 서열을 포함하고;
(iv) 단백질 성분 d1은
a) SARS-CoV-2의 뉴클레오캡시드 인단백질 N과 유사한 서열 번호 2에 따른 서열 또는 서열 번호 2와 적어도 90% 서열 동일성을 갖는 서열; 또는
b) SARS-CoV-2의 뉴클레오캡시드 인단백질 N과 유사한 서열 번호 26에 따른 서열 또는 서열 번호 26과 적어도 90% 서열 동일성을 갖는 서열
을 포함하고;
단백질 성분 d2는 MHV59A의 뉴클레오캡시드 인단백질 N 또는 등가 단백질과 유사한 서열 번호 4에 따른 서열 또는 서열 번호 4와 적어도 90% 서열 동일성을 갖는 서열을 포함하는 것인 백신.
19. 하기의 연속 단계를 포함하는 코로나바이러스 SARS-CoV-2에 대한 백신의 생산 방법:
a) 구현예 1 내지 9 중 어느 하나에 따른 뉴클레오티드 산 서열을 생명공학적 생산 유닛, 특히 세포주에 도입하는 단계로서,
단백질 성분 a, b1, b2, c1, c2, d1 또는 d2로 이루어진 군으로부터 선택된 단백질 성분 중 적어도 2개를 코딩하는 핵산 기반 mRNA는 번역에 의해 제조되는 것인 단계;
b) 단계 a)에서 생명공학적 생산 유닛으로부터 단백질 성분을 수득하는 단계; 및
c) 수득된 단백질 성분을 정제하여 코로나바이러스 SARS-CoV-2에 대한 백신을 수득하는 단계.
20. 하기의 연속 단계를 포함하는 구현예 16에 따른 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질을 포함하는 코로나바이러스 SARS-CoV-2에 대한 백신의 생산 방법:
a) 구현예 1 내지 9 중 어느 하나에 따른 뉴클레오티드 산 서열을 생명공학적 생산 유닛에 도입하는 단계로서, 생명공학적 생산 유닛은 단백질 성분 a, b1, c1 및 d1로 이루어진 군으로부터 선택된 단백질 성분 중 적어도 하나를 코딩하는 뉴클레오티드 산을 포함하는 것인 단계;
b) 단계 a)에서 생명공학적 생산 유닛으로부터 바이러스 외피의 단편 및/또는 바이러스 외피 단백질을 수득하는 단계; 및
c) 수득된 단백질 성분을 정제하여 구현예 16에 따른 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질을 포함하는 코로나바이러스 SARS-CoV-2에 대한 백신을 수득하는 단계.
21. 하기의 연속 단계를 포함하는 코로나바이러스 SARS-CoV-2에 대한 백신의 생산 방법:
a) 구현예 10 내지 12 중 어느 하나에 따른 벡터를 증폭 생명공학적 생산 유닛에 도입하는 단계;
b) 증폭 생명공학적 생산 유닛에서 구현예 1 내지 9 중 어느 하나에 따른 뉴클레오티드 산을 증폭하는 단계;
c) 단계 b)에서 증폭된 뉴클레오티드 산을 수득하는 단계;
d) 구현예 19 또는 20에 따른 방법을 사용하여 코로나바이러스 SARS-CoV-2에 대한 백신을 수득하는 단계.
따라서, 본 발명은 적어도 4,000개의 염기를 갖는 완전 합성 장쇄 핵산에 관한 것으로, 핵산은 임의의 배열로 4개의 서열 부분 A-D 중 적어도 2개를 포함하거나, 여기서, i) 서열 부분 A는, a) 서열 번호 1에 정의된 서열 또는 서열 번호 1에 정의된 서열과 적어도 98.5% 서열 동일성을 갖는 서열; 또는 b) 서열 번호 3에 정의된 서열 또는 서열 번호 3에 정의된 서열과 적어도 90% 서열 동일성을 갖는 서열을 포함하고; ii) 서열 부분 B는, a) 서열 번호 5에 정의된 서열 또는 서열 번호 5에 정의된 서열과 적어도 98.3% 서열 동일성을 갖는 서열; 또는 b) 서열 번호 7에 정의된 서열 또는 서열 번호 7에 정의된 서열과 적어도 90% 서열 동일성을 갖는 서열을 포함하고; iii) 서열 부분 C는, a) 서열 번호 9에 정의된 서열 또는 서열 번호 9에 정의된 서열과 적어도 97.2% 서열 동일성을 갖는 서열; 또는 b) 서열 번호 11에 정의된 서열 또는 서열 번호 11에 정의된 서열과 적어도 90% 서열 동일성을 갖는 서열을 포함하고; iv) 서열 부분 D는, 서열 번호 13에 정의된 서열 또는 서열 번호 13에 정의된 서열과 적어도 98.5% 서열 동일성을 갖는 서열을 포함하거나; 서열 부분 A-D에 따른 데옥시리보핵산 서열에 상응하는 리보핵산 서열을 포함한다는 것을 특징으로 한다.
본 발명에 따른 핵산은 언급된 백신의 생산을 상당히 가속화할 수 있게 하고 바이러스 또는 변형, 특히 코로나바이러스 SARS-CoV-2에 매우 특이적인 잘 정의된 백신을 유도한다.
하기에 추가로 나타내는 바와 같이, 본 발명에 따른 핵산 서열에 포함된 서열 부분의 특정 서열 특징은 핵산이 완전히 합성적으로 생산되어 맞춤 제작될 수 있도록 한다. 따라서, 본 발명에 따른 핵산은 특정 구현예에서 RNA 대신 DNA일 뿐만 아니라, 자연 발생 서열과 대조적으로 화학적 합성에 의한 핵산의 완전한 합성 생산을 가능하게 하는 서열이 있다는 점에서 코로나바이러스에 자연적으로 존재하는 핵산과 상이하다.
궁극적으로, 본 발명에 따른 핵산은 따라서 분자 정밀도로 정의된 단백질 성분을 발현하는 것을 가능하게 한다. 이러한 단백질 성분이 백신으로서 투여되는 경우, 따라서 백신 접종자에게 최적의 예방접종(immunization)이 얻어질 수 있다. 동시에, 부정확하게 정의된 단백질 성분으로 매우 만연한 가능한 부작용의 위험이 크게 최소화된다. 또한, 단백질 발현에 사용되는 일반적인 발현 시스템을 사용하여 단백질 성분이 생산될 수 있다는 사실은 백신이 매우 신속하게 대량으로 이용 가능하게 될 수 있다는 것을 의미한다. 이것은 코로나바이러스 SARS-Cov-2와 같은 바이러스에 매우 중요한데, 상기 바이러스의 확산은 팬데믹의 비율을 가정했고 이에 따라 상기 바이러스의 억제는 광범위한 백신 투여를 필요로 한다.
다음 용어 및 개념은 본 발명의 맥락에서 사용될 것이다:
"핵산"이라는 용어는 DNA, RNA 및 이들의 임의의 변형을 지칭한다. 핵산은 단일 가닥 또는 이중 가닥일 수 있다. 변형은 핵산 리간드 염기 또는 핵산 리간드 전체에 대한 추가 전하, 분극성, 수소 결합, 정전기적 상호작용 및 유동성을 포함하는 다른 화학기를 제공하는 것들을 포함하지만, 이에 제한되지 않는다. 그러한 변형은 2'-위치당 변형, 5-위치 피리미딘 변형, 8-위치 퓨린 변형, 엑소사이클릭 아민에서의 변형, 4-티오우리딘의 치환, 5-브로모 또는 5-요오도-우라실의 치환; 골격 변형, 메틸화, 특이한 염기쌍 조합, 예컨대, 이소염기 이소시티딘 및 이소구아니딘을 포함하지만, 이에 제한되지 않는다. 변형은 또한 3' 및 5' 변형, 예컨대, 캡핑을 포함할 수 있다.
완전 합성. 화학적 관점에서, 핵산은 소위 염기로 불리는 반복 단위를 가진 매우 정교한 분자이다. 이러한 맥락에서 "완전 합성"이라는 용어는 본 발명에 따른 핵산이 화학 시약을 사용하는 일련의 화학 반응 단계에 의해 생성된다는 것을 의미한다. 효소와 같은 생화학적 보조제는 이미 더 긴 올리고머의 결합과 같은 개별 후속 생산 단계 동안에도 사용될 수 있다. 후자는 차례로 임의로 합성될 수 있다. 완전 합성 핵산은 화학적 생산 공정을 가능하게 하는 서열 특징을 가지며, 다음 서열 특징 중 하나 이상에서 자연 발생 핵산과 상이하다:
i) 하나 이상의 효소적 제한 부위, 특히, 당업자에게 공지된 IIS형 제한 엔도큐늘레아제에 대한 제한 부위의 부재;
ii) 상응하는 자연 발생 핵산과 비교하여, 완전 합성 핵산 내에 동일한 염기의 9개 초과의 연속적인 단위를 갖는 반복 핵산 서열의 부재 또는 감소된 발생;
iii) 상응하는 자연 발생 핵산과 비교하여, 12개 초과의 염기를 갖는 반복 염기쌍 서열의 부재 또는 감소된 발생;
iv) 상응하는 자연 발생 핵산에 비해, 그에 대한 역-상보성 서열로서 당업자에게 공지된 12개 초과의 염기 단위로 이루어진 간접적으로 반복되는 염기쌍 분절의 부재 또는 감소된 발생;
v) 상응하는 자연 발생 핵산에 비해, 당업자에게 공지된 중복 염기 단위(디뉴클레오티드 반복부)의 9회 초과의 연속적인 반복을 갖는 핵산 서열의 부재 또는 감소된 발생; 및
vi) 상응하는 자연 발생 핵산에 비해, 당업자에게 공지된 삼중 염기 단위(트리뉴클레오티드 반복부)의 5회 초과의 연속적인 반복을 갖는 핵산 서열의 부재 또는 감소된 발생.
일부 구현예에서, 완전 합성 핵산은 문헌(Venetz, J. E., et al., 2019, Proceedings of the National Academy of Sciences, 116(16), 8070-8079 및/또는 이의 SI 부록)에 기재된 방법에 따라 부분적으로 생성되고/되거나 서열 특징을 포함한다.
일부 구현예에서, 완전 합성 핵산은 화학적 생산 공정을 가능하게 하는 서열 특징을 포함하고, 전술한 서열 특징, 특히 전술한 서열 특징 i) - vi) 중 2개 이상에서 자연 발생 핵산과는 상이하다.
일부 구현예에서, 완전 합성 핵산은 화학적 생산 공정을 가능하게 하는 서열 특징을 포함하고, 전술한 서열 특징, 특히 전술한 서열 특징 i) - vi) 중 3개 이상에서 자연 발생 핵산과는 상이하다.
일부 구현예에서, 완전 합성 핵산은 화학적 생산 공정을 가능하게 하는 서열 특징을 포함하고, 전술한 서열 특징, 특히 전술한 서열 특징 i) - vi) 중 4개 이상에서 자연 발생 핵산과는 상이하다.
일부 구현예에서, 완전 합성 핵산은 화학적 생산 공정을 가능하게 하는 서열 특징을 포함하고, 전술한 서열 특징, 특히 전술한 서열 특징 i) - vi) 중 5개 이상에서 자연 발생 핵산과는 상이하다.
일부 구현예에서, 완전 합성 핵산은 화학적 생산 공정을 가능하게 하는 서열 특징을 포함하고, 전술한 서열 특징, 특히 전술한 서열 특징 i) - vi) 중 6개에서 자연 발생 핵산과는 상이하다. 장쇄 올리고뉴클레오티드는 짧은 단편으로 수년간 상업적으로 이용되어 왔으며, 전형적으로 60, 100 또는 200개 염기 조각을 생산한다. 엄청나게 더 긴 올리고뉴클레오티드는 오늘날 사용되는 합성이 합당한 양의 더 긴 핵산을 생산하기에는 오류율이 너무 높기 때문에 쉽게 이용할 수 없다. 따라서, 1,000개 미만의 염기를 가진 그러한 단편은 단쇄로 불리고, 1,000개 이상의 염기를 가진 핵산은 장쇄로 불린다. 1,000 내지 5,000개의 염기를 가진 장쇄 핵산은 오늘날 상당한 비용을 들여 생산될 수 있다(예를 들어, Twist Bioscience, Life-technologies라는 회사에 의함). 5,000개 초과의 염기를 가진 장쇄 핵산은 매우 복잡하지만 화학적으로 잘 정의된 분자이다. 각 분자는 위치, 유형 및 분자의 다른 부분과의 연결에 의해 고전적인 유기 화학의 관점에서 완전히 설명될 수 있다. 따라서, 2개의 동일한 장쇄 핵산은 이들의 크기 및 수만 내지 수백만 개의 원자를 포함하고 있다는 사실에도 불구하고, 모든 구성요소가 동일하고 동일하게 연결되어 있다는 점에서 동일하다.
말단기, 보호기의 임의의 잔기 또는 핵산의 합성으로부터의 기타 보조제에 대한 설명. 상기의 설명은 핵산의 염기 유형을 지칭한다. 당업자는 합성이 말단에서 절단되는 다양한 보조제에 의해 수행된다는 것을 알고 있다. 그러나, 때때로 그러한 기의 잔기가 남아 있거나 분자의 다른 부분이 합성 단계 전 또는 후에 유도체화된다. 그러한 기는 당업자에게 공지되어 있으며, 특히 폴리-A 테일, 변형된 DNA 염기, 고상(solid-phase) 합성으로부터의 절단 가능한 링커, 생화학적 기, 예컨대, 비오틴 또는 스트렙타비딘 등을 포함한다.
다른 가능한 변형 및 표준 방법에 사용되는 변형은 형광 마커에 관한 것이다. 이러한 변형 또는 이의 잔기는 상기의 설명에 영향을 미치지 않아야 하며, 위치 및 유형 염기당 이들의 위치에 있는 모든 n 개의 염기가 모든 염기에 대해 동일한 경우, 동일한 핵산의 군은 동일한 것으로 간주되어야 한다. 다시 말해서, 본 발명의 핵산은 또한, 본 발명에 의해 요구되는 염기 서열을 갖는 한, 상기 변형 또는 잔기를 갖는 핵산을 포함한다.
제1 양태에 따르면, 본 발명은 따라서 특정한 성질을 갖는 핵산에 관한 것이다. 이러한 특정한 성질은 염기 서열, 즉, 서열에 포함되며, 본 발명의 핵산이 특정한 성질을 갖는 경우에만 얻어진다. 이러한 성질은 특정한 분자의 직접적인 부분 또는 특정한 분자에 대한 화학적으로 포괄적인 전체 설명이다. 그러나, 단순함을 위해, 염기 서열은 본문의 해당 설명 내에 표시되어야 하며 항상 특정한 분자를 의미한다는 것을 분명히 해야 한다. 따라서, 염기 서열은 단지 실용적인 형태의 설명일 뿐이며, 분자 또는 그 IUPAC 명칭의 직접적인 표현보다 본 발명의 텍스트 표현에 명맥하게 더 적합하다.
본 발명의 분자는 화학에서 전형적으로 "R"로 약칭되고 이어서 "R"을 설명함으로써 더 자세히 설명될 수 있는, 하나 이상의 분자 부분을 가진 고전적인 화학적 제제의 군에 대한 설명과 유사한, 특정 서열의 존재를 통해 특정한 성질을 얻는다. 따라서, 본 발명에서, 유기 화학에서의 이러한 통상적인 절차와 유사하게, 본 발명의 장쇄 완전 합성 핵산의 특정한 성질을 담당하는 서열의 군이 기재되어 있다.
본 발명의 핵산은 이들이 외피 단백질 코로나바이러스의 4가지 유형의 단백질 중 적어도 2가지를 코딩하는 완전 합성 핵산을 포함한다는 사실을 특징으로 한다.
본원에 사용된 "코로나바이러스의 외피 단백질의 유형"이라는 용어는 코로나바이러스의 A 군, B 군, C 군 또는 D 군 단백질을 지칭한다. 본원에 사용된 용어 "A 군" 단백질은 코로나바이러스의 뉴클레오캡시드 단백질(N-유형)의 군을 지칭한다. 본원에 사용된 용어 "B 군"은 코로나바이러스의 외피 단백질(E-유형)의 군을 지칭한다. 본원에 사용된 용어 "C 군" 단백질은 코로나바이러스의 막 단백질(M-유형)을 지칭한다. 본원에 사용된 용어 "D 군" 단백질은 코로나바이러스의 글리코실화된 표면 단백질(S-유형)을 지칭한다.
일부 구현예에서, 본원에 기재된 핵산은 이들이 적어도 하나의 A 군 단백질 및 적어도 하나의 B 군 단백질을 코딩하는 핵산을 포함한다는 사실을 특징으로 한다. 일부 구현예에서, 본원에 기재된 핵산은 이들이 적어도 하나의 A 군 단백질 및 적어도 하나의 C 군 단백질을 코딩하는 핵산을 포함한다는 사실을 특징으로 한다. 일부 구현예에서, 본원에 기재된 핵산은 이들이 적어도 하나의 A 군 단백질 및 적어도 하나의 D 군 단백질을 코딩하는 핵산을 포함한다는 사실을 특징으로 한다. 일부 구현예에서, 본원에 기재된 핵산은 이들이 적어도 하나의 B 군 단백질 및 적어도 하나의 C 군 단백질을 코딩하는 핵산을 포함한다는 사실을 특징으로 한다. 일부 구현예에서, 본원에 기재된 핵산은 이들이 적어도 하나의 B 군 단백질 및 적어도 하나의 D 군 단백질을 코딩하는 핵산을 포함한다는 사실을 특징으로 한다. 일부 구현예에서, 본원에 기재된 핵산은 이들이 적어도 하나의 C 군 단백질 및 적어도 하나의 D 군 단백질을 코딩하는 핵산을 포함한다는 사실을 특징으로 한다.
일부 구현예에서, 본 발명의 핵산은 이들이 하기 사실을 특징으로 한다:
(a) 잘 정의된 서열에 4,000개 초과의 염기를 포함하고;
(b) 코로나바이러스의 4가지 유형의 외피 단백질을 코딩하는 4개의 서열 군 A-D에 할당된 특히 중요한 4개의 서열 중 적어도 2개를 포함하고, 여기서,
i) 제1 서열 A 군은 코로나바이러스의 뉴클레오캡시드 단백질 N의 외피 단백질을 코딩하고,
ii) 제2 서열 B 군은 코로나바이러스의 외피 단백질 E 유형의 외피 단백질을 코딩하고,
iii) 제3 서열 C 군은 코로나바이러스의 막 단백질 M 유형의 외피 단백질을 코딩하고,
iv) 제4 서열 D 군은 코로나바이러스의 글리코실화된 표면 단백질 S의 외피 단백질을 코딩한다.
본 설명에 개시된 서열 부분 A는 서열 번호 2 또는 서열 번호 4에 따른 상응하는 단백질 서열을 코딩하는 서열 번호 1 또는 서열 번호 3에 따른 서열을 포함한다. 일부 구현예에서, 서열 부분 A는 서열 번호 50에 의해 정의된 서열 또는 서열 번호 50과 적어도 98.5%, 적어도 98.6%, 적어도 98.7%, 적어도 98.8%, 적어도 98.9%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8%, 또는 적어도 99.9% 서열 동일성을 갖는 서열을 포함한다.
일부 구현예에서, 서열 부분 A는 서열 번호 3과 적어도 90% 서열 동일성을 갖는 서열을 포함한다.
일부 구현예에서, 서열 부분 A는 서열 번호 2와 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8%, 또는 적어도 99.9% 서열 동일성을 갖는 아미노산 서열을 코딩하는 서열을 포함한다.
일부 구현예에서, 서열 부분 A는 서열 번호 4와 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8%, 또는 적어도 99.9% 서열 동일성을 갖는 아미노산 서열을 코딩하는 서열을 포함한다.
일부 구현예에서, 서열 부분 A는 서열 번호 2 및 서열 번호 4와 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8%, 또는 적어도 99.9% 서열 동일성을 갖는 아미노산 서열을 코딩하는 서열을 포함한다.
본 설명에 개시된 서열 부분 B는 서열 번호 6 또는 서열 번호 8에 따른 상응하는 단백질 서열을 코딩하는 서열 번호 5 또는 서열 번호 7에 따른 서열을 포함한다. 일부 구현예에서, 서열 부분 B는 서열 번호 48에 의해 정의된 서열 또는 서열 번호 48과 적어도 98.3%, 적어도 98.6%, 적어도 99.1%, 또는 적어도 99.5% 서열 동일성을 갖는 서열을 포함한다.
일부 구현예에서, 서열 부분 B는 서열 번호 7과 적어도 90% 서열 동일성을 갖는 서열을 포함한다.
일부 구현예에서, 서열 부분 B는 서열 번호 6과 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8%, 또는 적어도 99.9% 서열 동일성을 갖는 아미노산 서열을 코딩하는 서열을 포함한다.
일부 구현예에서, 서열 부분 B는 서열 번호 8과 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8%, 또는 적어도 99.9% 서열 동일성을 갖는 아미노산 서열을 코딩하는 서열을 포함한다.
일부 구현예에서, 서열 부분 B는 서열 번호 6 및 서열 번호 8과 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8%, 또는 적어도 99.9% 서열 동일성을 갖는 아미노산 서열을 코딩하는 서열을 포함한다.
본 설명에 개시된 서열 부분 C는 서열 번호 10 또는 서열 번호 12에 따른 상응하는 단백질 서열을 코딩하는 서열 번호 9 또는 서열 번호 11에 따른 서열을 포함한다. 일부 구현예에서, 서열 부분 C는 서열 번호 49에 의해 정의된 서열 또는 서열 번호 49와 적어도 97.2%, 적어도 97.4%, 적어도 97.6%, 적어도 97.8%, 적어도 98%, 적어도 98.2%, 적어도 98.4%, 적어도 98.6%, 적어도 98.8%, 적어도 99%, 적어도 99.2%, 적어도 99.4%, 적어도 99.6%, 적어도 99.8% 서열 동일성을 갖는 서열을 포함한다.
일부 구현예에서, 서열 부분 C는 서열 번호 11과 적어도 90% 서열 동일성을 갖는 서열을 포함한다.
일부 구현예에서, 서열 부분 B는 서열 번호 12와 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8%, 또는 적어도 99.9% 서열 동일성을 갖는 아미노산 서열을 코딩하는 서열을 포함한다.
일부 구현예에서, 서열 부분 B는 서열 번호 10과 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8%, 또는 적어도 99.9% 서열 동일성을 갖는 아미노산 서열을 코딩하는 서열을 포함한다.
일부 구현예에서, 서열 부분 B는 서열 번호 10 및 서열 번호 12와 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8%, 또는 적어도 99.9% 서열 동일성을 갖는 아미노산 서열을 코딩하는 서열을 포함한다.
본 설명에 개시된 서열 부분 D는 서열 번호 14에 따른 상응하는 단백질 서열을 코딩하는 서열 번호 13에 따른 서열을 포함한다. 일부 구현예에서, 서열 부분 D는 서열 번호 17에 의해 정의된 서열 또는 서열 번호 17과 적어도 98.5%, 적어도 98.6%, 적어도 98.7%, 적어도 98.8%, 적어도 98.9%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8%, 또는 적어도 99.9% 서열 동일성을 갖는 서열을 포함한다.
일부 구현예에서, 서열 부분 B는 서열 번호 14와 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8%, 또는 적어도 99.9% 서열 동일성을 갖는 아미노산 서열을 코딩하는 서열을 포함한다.
참조 서열에 대한 "퍼센트(%) 서열 동일성"이라는 용어는, 필요한 경우, 최대 퍼센트 서열 동일성을 달성하기 위해 서열을 정렬하고 갭을 도입한 후 참조 서열의 뉴클레오티드 또는 아미노산 잔기와 동일한 후보 서열의 뉴클레오티드 또는 아미노산 잔기의 백분율로서 정의되며, 서열 동일성의 일부로서 어떠한 보존적 치환도 고려하지 않는다. 퍼센트 아미노산 서열 동일성을 결정하기 위한 정렬은, 예를 들어, BLAST, BLAST-2, ALIGN 또는 Megalign(DNASTAR) 소프트웨어와 같은 공개적으로 이용 가능한 컴퓨터 소프트웨어를 사용하여 당업계의 기술 범위 내에 있는 다양한 방식으로 달성될 수 있다. 당업자는 비교되는 서열의 전장에 걸쳐 최대 정렬을 달성하는 데 필요한 임의의 알고리즘을 포함하여 서열을 정렬하기 위한 적절한 매개변수를 결정할 수 있다.
일부 구현예에서, 본 발명의 뉴클레오티드 산 서열은 단백질 생성물의 성질을 변경하지 않거나 실질적으로 변경하지 않음으로써 (예를 들어, 뉴클레오티드 산 서열 또는 이의 산물의 생산 과정을 촉진하기 위해) 변경된다.
일부 구현예에서, 본 발명의 뉴클레오티드 산 서열의 변경은 하기 군으로부터 선택된 적어도 하나의 변경을 포함한다:
1) 단백질 생성물의 성질을 변경하지 않거나 실질적으로 변경하지 않음으로써 참조 서열에 대한 염기 치환 삽입 또는 결실;
2) 코돈을 아주 밀접한 버전으로 대체; 및
3) 번역 속도를 미세 조정하는 (대체) ORF, 예측된 유전자 내부 전사 시작 부위 및/또는 서열 모티프(예측된 또는 암호) (예를 들어, 리보솜 중단 모티프)와 같은 단백질 코딩 서열 내에 존재하는 가상의 유전 요소의 수 감소.
본 발명의 변경된 뉴클레오티드 산 서열의 유전자가 기능을 유지하는지 여부를 시험하면, 아미노산 코드를 넘어서는 추가 정보가 적절한 기능을 위해 필요한 유전자를 확인할 것이다.
일부 구현예에서, 본원에 기재된 뉴클레오티드 산 서열은 코딩된 단백질 생성물의 생물학적 기능을 개선하도록 변경된다.
그러한 생물학적 기능은 안정성 향상, 생산 촉진(예를 들어, 추가 복제 개시 서열의 삽입), 복제 제한을 포함하지만 이에 제한되지 않는다.
일부 구현예에서, 본원에 기재된 뉴클레오티드 산 서열은 유사한 구조를 갖지만, 돌연변이된 바이러스의 단백질의 기능과 같은 대체 생물학적 기능을 갖는 관심 있는 적어도 하나의 대체 단백질을 코딩하도록 변경된다.
당업자는 관심 있는 적어도 하나의 대체 단백질을 코딩하는 서열(예를 들어, 돌연변이된 바이러스의 뉴클레오티드 산 서열)을 분석하고 관련 변경(예를 들어, 돌연변이)을 본원에 기재된 가장 유사한 뉴클레오티드 산 서열로 구현함으로써, 그러한 변경된 뉴클레오티드 서열을 얻을 수 있다. 일부 구현예에서, 본원에 기재된 가장 유사한 뉴클레오티드 산 서열은 서열 번호 1, 서열 번호 3, 서열 번호 5, 서열 번호 7, 서열 번호 9, 서열 번호 11, 서열 번호 13 및/또는 서열 번호 17에 의해 정의된 서열이다.
일부 구현예에서, 본원에 기재된 가장 유사한 뉴클레오티드 산 서열은 서열 번호 1, 서열 번호 5, 서열 번호 9, 서열 번호 13에 의해 정의된 서열이다.
일부 구현예에서, 본원에 기재된 코로나바이러스는 SARS-CoV-2이다. 일부 구현예에서, 본원에 기재된 SARS-CoV-2는 Lineage B.1.1.207, Lineage B.1.1.7, Cluster 5, 501.V2 변이체, Lineage P.1, Lineage B.1.429/CAL.20C, 및 Lineage B.1.525의 군으로부터 선택된 SARS-CoV-2 변이체이다.
일부 구현예에서, 본원에 기재된 SARS-CoV-2는 19A, 20A, 20C, 20G, 20H, 20B, 20D, 20F, 20I 및 20E의 군으로부터 선택된 Nextstrain 계통군에 의해 기재된 SARS-CoV-2 변이체이다.
일부 구현예에서, 관심 있는 적어도 하나의 대체 단백질을 코딩하는 서열은 적어도 하나의 SARS-CoV-2 변이체에 대해 특징적인 단백질을 코딩하는 서열을 포함한다. 일부 구현예에서, 적어도 하나의 SARS-CoV-2 변이체에 대해 특징적인 단백질은 서열 번호 18, 서열 번호 21, 서열 번호 22 및/또는 서열 번호 26과 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8%, 또는 적어도 99.9% 서열 동일성을 갖는 서열에 의해 코딩되는 단백질이다.
관련 변경의 이러한 구현은, 예를 들어, 적어도 하나의 염기의 삽입, 결실, 치환 및/또는 변형에 의해 달성될 수 있지만, 본원에 기재된 뉴클레오티드 산 서열의 백분율 이하일 수 있다.
일부 구현예에서, 본원에 기재된 가장 유사한 뉴클레오티드 산 서열은 서열 번호 1, 서열 번호 3, 서열 번호 5, 서열 번호 7, 서열 번호 9, 서열 번호 11, 서열 번호 13 및/또는 서열 번호 17에 의해 정의된 서열이다.
일부 구현예에서, 본원에 기재된 가장 유사한 뉴클레오티드 산 서열은 서열 번호 1, 서열 번호 5, 서열 번호 9 및/또는 서열 번호 13에 의해 정의된 적어도 하나의 서열이다.
일부 구현예에서, 삽입, 결실 또는 변형은 본원에 기재된 바와 같은 화학 시약을 사용하는 일련의 화학 반응 단계를 사용하여 본 발명의 핵산의 신규한 합성에 의해 달성될 수 있다.
변경된 서열은 서열 번호 1, 서열 번호 3, 서열 번호 5, 서열 번호 7, 서열 번호 9, 서열 번호 11, 서열 번호 13 및/또는 서열 번호 17에 의해 정의된 뉴클레오티드 산 서열보다 더 많거나 상이한 위치에서 변경된 서열의 화학적 생산 공정을 가능하게 하고/하거나 개선하는 서열 특징(예를 들어, 상기 기재된 서열 특징 i)-vi))을 포함할 수 있다.
IUPAC-분류 가능한 분자로의 이들의 가능한 변형은 당업자에게 공지되어 있다. 위에 정의된 바와 같은 데옥시리보핵산에 대한 대안으로서, 상응하는 리보핵산이 또한 존재할 수 있다. 다시 말해서, 서열 부분 A-D에 따른 데옥시리보핵산 서열에 추가하여, 본 발명에 따른 정의는 또한 상응하는 리보핵산 서열을 포함한다. 이들에서, 상응하는 리보핵산은 티민(T)이 우라실(U)로 대체된 위에 정의된 바와 같은 서열 부분을 갖는다.
MHV 및 SARS-CoV-2의 외피 단백질 E, M, N 및 S, 및 적용 가능하다면, MHV의 RNA-의존성 RNA 폴리머라제를 코딩하는 본 발명의 장쇄 핵산의 염기쌍 서열은 복잡한 발달의 결과를 나타내며, 제1 단계에서 유전자 코드의 중복성을 고려하여 상응하는 단백질의 천연 아미노산 서열부터 시작으로 계산함으로써 많은 수의 서열 변이체가 형성되었다.
특히, SARS-CoV-2의 단백질 E, M, N 및/또는 S를 코딩하는 본 발명의 장쇄 핵산의 염기쌍 서열은 복잡한 발달의 결과를 나타내며, 제1 단계에서 유전자 코드의 중복성을 고려하여 상응하는 단백질의 천연 아미노산 서열부터 시작으로 계산함으로써 많은 수의 서열 변이체가 형성되었다.
생성된 서열 트리로부터, 제2 단계에서, 각각의 코딩된 외피 단백질에 대한 염기쌍 서열은 첫째, 생물학적 기능의 측면에서 자연 서열과 가장 유사하고, 둘째, 화학적 생산 공정을 가능하게 하는 최적의 서열 특성을 또한 갖는 것으로 결정되었다.
또한, 서열은 야생형 바이러스의 구조적 단백질의 조합을 코딩한다. 이것은 T-세포 에피토프를 포함하여 면역계에 이용 가능한 광범위한 에피토프를 가능하게 한다(예를 들어, 문헌(Grifoni, A., et al., 2020, Cell, 181(7), 1489-1501) 참조). 이러한 광범위한 에피토프는 기존 면역이 있거나 없는 환자에서 광범위한 바이러스 변이체에 대한 면역을 가능하게 할 수 있다.
따라서, 본 발명은 본 발명의 핵산이 제한된 복제 능력을 갖지만, 원래의 바이러스와 유사한 항원 효과를 갖는 조합 바이러스-유사 단백질의 효율적인 생산을 가능하게 한다는 발견에 적어도 부분적으로 기반한다.
언급된 바와 같이, 본 발명에 따른 핵산은 적어도 4,000개의 염기 또는 염기쌍을 갖는다. 바람직하게는, 정의된 서열에서 적어도 8,000개의 염기, 특히 바람직하게는 적어도 20,000개의 염기를 갖는다. 또한, 핵산은 1,OOO,OOO개 염기의 최대 크기, 바람직하게는 200,000개 염기의 최대 크기를 갖는 것이 바람직하다.
큰 서열은 생산, 증폭 및/또는 발현하기 어려운 것으로 반복적으로 나타났지만, 다수의 염기는 원래 바이러스와 유사한 항원 효과를 갖는 바이러스-유사 단백질의 특정 조합을 일관되게 생산하는 데 유리하다.
본원에 제공된 수단 및 방법은 특정 길이 범위의 본 발명에 따른 핵산의 생산을 가능하게 한다(예를 들어, 실시예 1-3 참조).
따라서, 본 발명은 특정 길이 범위의 길이를 갖는 본 발명의 핵산이 제한된 복제 능력을 갖지만, 원래의 바이러스와 유사한 항원 효과를 갖는 조합 바이러스-유사 단백질의 효율적인 생산을 가능하게 한다는 발견에 적어도 부분적으로 기반한다.
본 발명에 따른 핵산은 단일 장쇄 핵산 또는 별도의 장쇄 핵산으로 분할된 형태로 존재할 수 있다.
일부 구현예에서, 본 발명에 따른 핵산은 단일 장쇄 핵산 또는 최대 4개의 별도의 장쇄 핵산으로 분할된 형태로 존재할 수 있다.
별도의 장쇄 핵산으로의 분리는 본 발명의 핵산의 증폭을 촉진할 수 있다(실시예 3).
추가의 바람직한 구현예에 따르면, 서열 부분 A-D는 서열 번호 16에 따라 배열된다.
또한, 서열 부분 D는 서열 번호 17로 이루어지고, 서열 번호 18에 따른 단백질 서열을 코딩하는 것이 바람직하다.
추가의 바람직한 구현예에 따르면, 서열 부분 A-C는 서열 번호 19에 따라 배열되고, 이에 의해 서열 부분 A는 서열 번호 26에 따른 단백질 서열을 코딩하고, 서열 부분 B는 서열 번호 21에 따른 단백질 서열을 코딩하고, 서열 부분 C는 서열 번호 22에 따른 단백질 서열을 코딩하고, 또한 서열 부분 A-C는 서열 번호 20, 서열 번호 22, 서열 번호 23, 서열 번호 24, 서열 번호 25 및 서열 번호 27을 코딩하는 서열로 확장될 수 있다.
일부 구현예에서, 본 발명은 본 발명에 따른 뉴클레오티드 산 서열에 관한 것이고, 여기서, 뉴클레오티드 산 서열은 서열 번호 19와 적어도 98.5%, 적어도 98.6%, 적어도 98.7%, 적어도 98.8%, 적어도 98.9%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8%, 또는 적어도 99.9% 서열 동일성을 갖는 서열 또는 상응하는 리보핵산 서열에 의해 정의된다.
본 설명에 개시된 서열 부분 A-D를 코로나바이러스의 RNA-의존성 RNA 폴리머라제의 서열 번호 31 및 서열 번호 32에 따른 폴리단백질 서열을 코딩하는 서열 번호 15 또는 서열 번호 30에 따른 서열을 포함하는 서열 부분 E의 핵산 서열로 보충하는 것이 특히 바람직하다.
서열 번호 15 또는 서열 번호 30에 따른 서열은 본 발명에 따른 핵산의 성분을 나타낼 수 있고, 따라서 서열 부분 A-D의 2개 이상의 서열과 조합하여 동일한 분자에 존재할 수 있다. 독립 분자의 성분으로서 본 발명에 따른 핵산과 함께 키트에 존재하는 것도 생각할 수 있다. IUPAC-분류 가능한 분자로의 가능한 전달은 당업자에게 공지되어 있다.
서열 부분 E의 존재는 상응하는 단백질의 유전자 발현을 위해 RNA가 DNA 플라스미드 대신 생명공학적 생산 유닛에 도입되는 경우 관련이 있다. 이와 관련하여, 서열 부분 E가 서열 번호 33 또는 서열 번호 34에 따른 RNA 형태로 키트에 도입되어 키트에 존재하는 것도 생각할 수 있다. 이것은 아래의 특정 예의 맥락에서 더 설명될 것이다.
이러한 구체적인 서열은 첫째 자연 서열과의 이들의 유사성 또는 이들의 생물학적 기능과 관련하여, 그리고 둘째 화학적 생산 공정과 관련하여 특히 유리한 것으로 나타났다.
또 다른 바람직한 구현예에 따르면, 핵산은 임의의 배열로 4개의 서열 부분 A-D 중 적어도 3개를 포함한다. 이와 관련하여, 핵산이 임의의 배열로 4개의 서열 부분 A-D를 포함하는 것이 특히 바람직하다.
또한, 핵산은 하기의 군으로 이루어진 적어도 하나의 서열을 추가로 포함하는 것이 바람직하다:
서열 번호 15,
서열 번호 28,
서열 번호 29 및
서열 번호 30.
일부 구현예에서, 본 발명의 핵산은 서열 부분인 서열 번호 15, 서열 번호 28, 서열 번호 29 및 서열 번호 30에 따른 데옥시리보핵산 서열 중 하나 또는 상응하는 리보핵산 서열을 포함한다.
일부 구현예에서, 본 발명은 핵산이 서열 번호 28 또는 상응하는 리보핵산 서열을 포함하는 것을 특징으로 하는 본 발명에 따른 핵산에 관한 것이다.
일부 구현예에서, 본 발명은 핵산이 서열 번호 29 또는 상응하는 리보핵산 서열을 포함하는 것을 특징으로 하는 본 발명에 따른 핵산에 관한 것이다.
일부 구현예에서, 본 발명은 핵산이 서열 번호 28 및 서열 번호 29 또는 상응하는 리보핵산 서열을 포함하는 것으로 특징으로 하는 본 발명에 따른 핵산에 관한 것이다.
본 발명의 핵산은 표준 방법에 의해 세포주 또는 기타 생산 유기체에 혼입될 수 있고 바이러스의 단편 또는 전체 외피의 생산을 자극할 수 있는 특정한 성질을 갖는다. 이러한 목적을 위해 요구되는 표준 방법은 당업자에게 공지되어 있고 구체적인 예의 맥락에서 설명된다.
본 발명자들은 원래 바이러스의 효과적인 복제 가능성에 유용한 것으로 간주되었던 특정 ORF의 누락에도 불구하고, 바이러스 입자가 증폭되고 후속적으로 번역되고 성공적으로 어셈블리될 수 있다는 것을 발견하였다. 생성된 바이러스 입자는 여전히 세포를 감염시키고 비감염성 바이러스 단편의 생성을 유도할 수 있다.
본 발명자들은 SARS-CoV-2 바이러스 게놈(도 5 참조)의 ORF6 및 ORF8이 생략되거나 삭제될 수 있고, 바이러스 어셈블리가 가능한 상태로 유지됨을 발견하였다.
일부 구현예에서, 본 발명은 본 발명에 따른 핵산에 관한 것으로, 여기서, 핵산은 하기를 추가로 포함한다:
a)
1.) 서열 번호 51에 의해 정의된 ORF1ab 서열 또는 서열 번호 51과 적어도 98.5%, 적어도 98.6%, 적어도 98.7%, 적어도 98.8%, 적어도 98.9%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열; 또는
2.)
i) 서열 번호 59에 의해 정의된 ORF1b 서열 또는 서열 번호 59와 적어도 98.5%, 적어도 98.6%, 적어도 98.7%, 적어도 98.8%, 적어도 98.9%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열; 및
ii) 서열 번호 58에 의해 정의된 ORF1a 서열 또는 서열 번호 58과 적어도 98.6%, 적어도 98.7%, 적어도 98.8%, 적어도 98.9%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열;
b) 서열 번호 52에 의해 정의된 ORF3a 서열 또는 서열 번호 52와 적어도 99%, 적어도 99.1%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.7%, 적어도 99.8%, 또는 적어도 99.9% 서열 동일성을 갖는 서열; 및
c) 서열 번호 54에 의해 정의된 ORF7a 서열 또는 서열 번호 54와 적어도 99.5% 서열 동일성을 갖는 서열.
일부 구현예에서, 본 발명은 본 발명에 따른 핵산에 관한 것으로, 여기서, 핵산은 하기를 추가로 포함한다:
a)
1.) 서열 번호 51에 의해 정의된 ORF1ab 서열 또는 서열 번호 51과 적어도 98.5%, 적어도 98.6%, 적어도 98.7%, 적어도 98.8%, 적어도 98.9%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열; 또는
2.)
i) 서열 번호 59에 의해 정의된 ORF1b 서열 또는 서열 번호 59와 적어도 98.5%, 적어도 98.6%, 적어도 98.7%, 적어도 98.8%, 적어도 98.9%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열; 및
ii) 서열 번호 58에 의해 정의된 ORF1a 서열 또는 서열 번호 58과 적어도 98.6%, 적어도 98.7%, 적어도 98.8%, 적어도 98.9%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8%, 또는 적어도 99.9% 서열 동일성을 갖는 서열;
b) 서열 번호 52에 의해 정의된 ORF3a 서열 또는 서열 번호 52와 적어도 99%, 적어도 99.1%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.7%, 적어도 99.8%, 또는 적어도 99.9% 서열 동일성을 갖는 서열;
c) 서열 번호 54에 의해 정의된 ORF7a 서열 또는 서열 번호 54와 적어도 99.5% 서열 동일성을 갖는 서열; 및
d) 서열 번호 55에 의해 정의된 ORF8 서열 또는 서열 번호 55와 적어도 99%, 적어도 99.3% 또는 적어도 99.6% 서열 동일성을 갖는 서열.
일부 구현예에서, 본 발명은 본 발명에 따른 핵산에 관한 것으로, 여기서, 핵산은 하기를 추가로 포함한다:
a)
1.) 서열 번호 51에 의해 정의된 ORF1ab 서열 또는 서열 번호 51과 적어도 98.5%, 적어도 98.6%, 적어도 98.7%, 적어도 98.8%, 적어도 98.9%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열; 또는
2.)
i) 서열 번호 59에 의해 정의된 ORF1b 서열 또는 서열 번호 59와 적어도 98.5%, 적어도 98.6%, 적어도 98.7%, 적어도 98.8%, 적어도 98.9%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열; 및
ii) 서열 번호 58에 의해 정의된 ORF1a 서열 또는 서열 번호 58과 적어도 98.6%, 적어도 98.7%, 적어도 98.8%, 적어도 98.9%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8%, 또는 적어도 99.9% 서열 동일성을 갖는 서열;
b) 서열 번호 52에 의해 정의된 ORF3a 서열 또는 서열 번호 52와 적어도 99%, 적어도 99.1%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열;
c) 서열 번호 54에 의해 정의된 ORF7a 서열 또는 서열 번호 54와 적어도 99.5% 서열 동일성을 갖는 서열; 및
d) 서열 번호 53에 의해 정의된 ORF6 서열 또는 서열 번호 53과 적어도 94.1% 적어도 94.7%, 적어도 95.2%, 적어도 95.8%, 적어도 96.3%, 적어도 96.8%, 적어도 97.4%, 적어도 97.9%, 적어도 98.5%, 적어도 99%, 또는 적어도 99.6% 서열 동일성을 갖는 서열.
일부 구현예에서, 본 발명은 본 발명에 따른 핵산에 관한 것으로, 여기서, 핵산은 하기를 추가로 포함한다:
a)
1.) 서열 번호 51에 의해 정의된 ORF1ab 서열 또는 서열 번호 51과 적어도 98.5%, 적어도 98.6%, 적어도 98.7%, 적어도 98.8%, 적어도 98.9%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열; 또는
2.)
i) 서열 번호 59에 의해 정의된 ORF1b 서열 또는 서열 번호 59와 적어도 98.5%, 적어도 98.6%, 적어도 98.7%, 적어도 98.8%, 적어도 98.9%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열; 및
ii) 서열 번호 58에 의해 정의된 ORF1a 서열 또는 서열 번호 58과 적어도 98.6%, 적어도 98.7%, 적어도 98.8%, 적어도 98.9%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열;
b) 서열 번호 52에 의해 정의된 ORF3a 서열 또는 서열 번호 52와 적어도 99%, 적어도 99.1%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열;
c) 서열 번호 54에 의해 정의된 ORF7a 서열 또는 서열 번호 54와 적어도 99.5% 서열 동일성을 갖는 서열;
d) 서열 번호 53에 의해 정의된 ORF6 서열 또는 서열 번호 53과 적어도 94.1% 적어도 94.7%, 적어도 95.2%, 적어도 95.8%, 적어도 96.3%, 적어도 96.8%, 적어도 97.4%, 적어도 97.9%, 적어도 98.5%, 적어도 99% 또는 적어도 99.6% 서열 동일성을 갖는 서열; 및
e) 서열 번호 55에 의해 정의된 ORF8 서열 또는 서열 번호 55와 적어도 99%, 적어도 99.3% 또는 적어도 99.6% 서열 동일성을 갖는 서열.
일부 구현예에서, 본 발명은 본 발명에 따른 핵산에 관한 것으로, 여기서, 핵산은 3'UTR, 5'UTR, TRS-L, TRS-B: S, TRS-B: orf3a, TRS-B: E, TRS-B: M, TRS-B: orf6, TRS-B: orf7a, TRS-B: orf8 및/또는 TRS-B: N을 추가로 포함한다.
일부 구현예에서, 본 발명은 본 발명에 따른 핵산에 관한 것으로, 여기서, 핵산은 서열 번호 57에 의해 정의된 3'UTR 및/또는 서열 번호 56에 의해 정의된 5'UTR을 추가로 포함한다.
일부 구현예에서, 본 발명은 본 발명에 따른 핵산에 관한 것으로, 여기서, 핵산은 서열 ACGAAC에 의해 정의된 TRS-L, TRS-B: S, TRS-B: orf3a, TRS-B: E, TRS-B: M, TRS-B: orf6, TRS-B: orf7a, TRS-B: 01T8 및/또는 TRS-B: N을 추가로 포함한다.
일부 구현예에서, 핵산 서열은 서열 번호 41에 의해 정의된 서열 또는 서열 번호 41과 적어도 98.5%, 적어도 98.6%, 적어도 98.7%, 적어도 98.8%, 적어도 98.9%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열을 포함한다.
일부 구현예에서, 핵산 서열은 서열 번호 42에 의해 정의된 서열 또는 서열 번호 42와 적어도 98.5%, 적어도 98.6%, 적어도 98.7%, 적어도 98.8%, 적어도 98.9%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열을 포함한다.
일부 구현예에서, 핵산 서열은 서열 번호 43에 의해 정의된 서열 또는 서열 번호 43과 적어도 98.5%, 적어도 98.6%, 적어도 98.7%, 적어도 98.8%, 적어도 98.9%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열을 포함한다.
일부 구현예에서, 핵산 서열은 서열 번호 44에 정의된 서열 또는 서열 번호 44와 적어도 98.5%, 적어도 98.6%, 적어도 98.7%, 적어도 98.8%, 적어도 98.9%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열을 포함한다.
일부 구현예에서, 본원에 기재된 뉴클레오티드 산 서열은 상응하는 리보핵산 서열을 지칭한다.
SARS-CoV-2의 ORF6 및 ORF8은 I형 인터페론 신호전달 경로를 억제하므로(Li, J. Y., et al., 2020, Virus research, 286, 198074), 적절한 면역 반응을 방해한다. 따라서, 벡터에서 SARS-CoV-2의 ORF6 및/또는 ORF8 서열의 결실 또는 생략은 코딩된 바이러스 입자의 재현성을 제한할 뿐만 아니라, 이의 항원성을 증가시킨다.
따라서, 본 발명은 본 발명의 뉴클레오티드 산 서열이 놀라운 항원성 및 제한된 복제 능력을 갖는 바이러스 입자 또는 이의 일부를 코딩한다는 발견에 적어도 부분적으로 기반한다.
일부 구현예에서, 본 발명의 핵산 서열은 벡터 또는 벡터의 일부이다.
본원에 사용된 용어 "벡터"는 그 자체 및/또는 또 다른 핵산 분자를 세포 내로 전달 또는 수송할 수 있는 핵산 분자를 지칭한다. 전달된 핵산은 일반적으로 벡터 핵산 분자에 연결, 즉 이에 삽입된다. 벡터는 세포에서 자율 복제를 지시하는 서열을 포함할 수 있거나, 숙주 세포 DNA로의 통합을 허용하기에 충분한 서열을 포함할 수 있다. 일부 구현예에서, 본원에 기재된 벡터는 플라스미드(예를 들어, DNA 플라스미드 또는 RNA 플라스미드), 셔틀 벡터, 트랜스포존, 코스미드, 박테리아 인공 염색체 및 바이러스 벡터의 군으로부터 선택된 벡터이다.
특정 구현예에서, 본 발명은 본 발명에 따른 벡터에 관한 것으로, 여기서, 벡터는 서열 부분 B를 포함하지 않고 서열 부분 A의 조절은 적어도 하나의 부속 단백질을 포함하지 않는다.
특정 구현예에서, 본 발명은 본 발명에 따른 벡터에 관한 것으로, 여기서, 벡터는 플라스미드 벡터이다.
일부 구현예에서, 본원에 기재된 플라스미드 벡터는 복제의 기원을 결정하는 선택 마커와 서열을 갖는다. 일부 구현예에서, 본 발명은 본 발명에 따른 벡터에 관한 것으로, 여기서, 벡터는 서열 번호 46 및 서열 번호 47에 정의된 서열을 포함한다.
일부 구현예에서, 본 발명은 본 발명에 따른 벡터에 관한 것으로, 여기서, 벡터는 RNA-폴리머라제 프로모터를 코딩하는 적어도 하나의 서열, 및 음성 가닥 RNA의 합성을 가능하게 하고/하거나 양성 가닥 RNA 합성을 가능하게 하는 서열을 포함하는 적어도 하나의 비번역 영역을 포함한다.
일부 구현예에서, 본 발명은 본 발명에 따른 벡터에 관한 것으로, 여기서, 벡터는 T7 프로모터를 코딩하는 적어도 하나의 서열, 및 음성 가닥 RNA의 합성을 가능하게 하고/하거나 양성 가닥 RNA 합성을 가능하게 하는 서열을 포함하는 적어도 2개의 비번역 영역을 포함한다.
일부 구현예에서, 본 발명은 본 발명에 따른 벡터에 관한 것으로, 여기서, 벡터는 서열 번호 28에 의해 정의된 T7 프로모터를 코딩하는 적어도 하나의 서열, 및 서열 번호 56 및 57에 따른 서열을 포함하는 적어도 2개의 비번역 영역을 포함한다.
일부 구현예에서, 본 발명은 본 발명에 따른 벡터에 관한 것으로, 여기서, 벡터는 플라스미드 벡터이다.
일부 구현예에서, 본 발명은 본 발명에 따른 벡터에 관한 것으로, 여기서, 벡터는 서열 번호 45에 정의된 서열을 포함한다.
일부 구현예에서, 본원에 기재된 뉴클레오티드 산 서열은 서열 번호 45와 적어도 50%, 적어도 60%, 적어도 70%, 적어도 80%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98% 또는 적어도 99% 서열 동일성을 갖는 서열을 포함한다.
일부 구현예에서, 본원에 기재된 벡터는, i) 서열 번호 45와 적어도 50%, 적어도 60%, 적어도 70%, 적어도 80%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98% 또는 적어도 99% 서열 동일성을 갖고; ii) 서열 번호 47에 의해 정의된 선택 마커 및 서열 번호 46에 의해 정의된 복제 기원을 포함하는 서열을 포함한다.
일부 구현예에서, 본원에 기재된 벡터는 적어도 하나의 형질감염 증강제, 예를 들어, 올리고뉴클레오티드, 리포플렉스, 폴리머솜, 폴리플렉스, 덴드리머, 무기 나노입자 및 세포-투과 펩티드의 군으로부터 선택된 형질감염 증강제와 조합하여 사용된다.
본원에 기재된 벡터는 증폭 생명공학적 생산 유닛에서 본 발명의 핵산 서열의 효율적인 전달 및/또는 증폭을 위해 사용될 수 있다(실시예 3).
증폭 생명공학적 생산 유닛(예를 들어, 효모 세포)에서 증폭 생성물은 단리될 수 있으며 후속적으로 추가 생명공학적 생산 유닛(예를 들어, 인간 세포)에서 번역될 수 있다.
따라서, 본 발명은 본원에 기재된 벡터가 본원에 기재된 핵산의 효율적인 증폭 및 제한된 복제 능력을 갖지만, 높은 항원성을 갖는 조합 바이러스-유사 단백질의 효율적인 생산을 가능하게 한다는 발견에 적어도 부분적으로 기반한다. 본 발명의 핵산은 상기의 절차를 통해 단백질 및 기타 빌딩 블록을 포함하는 분산액을 생성한다.
원심분리 또는 크로마토그래피와 같은 당업자에게 공지된 적합한 분리 방법은 필요한 경우 사용된 생산 세포주 또는 기타 생산 보조제 또는 유기체의 잔류물로부터도 이러한 빌딩 블록을 분리하여 이들을 정제하는 데 사용될 수 있다.
일부 구현예에서, 본원에 기재된 빌딩 블록은 크로마토그래피, 침전, 초원심분리, 접선-유동 여과(tangential-flow filtration) 및 효소 분해의 군으로부터 선택된 적어도 하나의 분리 방법을 사용하여 정제된다.
이러한 임의로 정제된 바이러스 외피 또는 이의 단편은 백신의 기반을 나타내며, 이는 적용 유형에 따라 상이한 투여 형태로 전달된다.
전형적으로, 이 목적을 위해 애쥬번트, 저장 수명 개선을 위한 안정제, 염 및 완충제가 사용된다. 따라서, 백신은 본원에 기재된 장쇄의 완전 합성 핵산 생성물이다.
일부 구현예에서, 본 발명은 본 발명에 따른 적어도 하나의 핵산, 본 발명에 따른 벡터, 본 발명에 따른 키트 또는 본 발명에 따른 생명공학적 생산 유닛을 사용하여 유전자 발현에 의해 수득 가능한 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질에 관한 것으로, 여기서, 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질은 본 발명에 따른 적어도 하나의 핵산을 패키징한다.
일부 구현예에서, 본 발명은 본 발명에 따른 적어도 하나의 핵산, 본 발명에 따른 벡터, 본 발명에 따른 키트 또는 본 발명에 따른 생명공학적 생산 유닛을 사용하여 유전자 발현에 의해 수득 가능한 바이러스 외피에 관한 것이다.
본원에 사용된 용어 "바이러스 외피"는 뉴클레오티드 산 서열(예컨대, 본 발명의 뉴클레오티드 산 서열)에 대한 안정화 기능을 갖는 단백질 층과 같은 단백질 어셈블리를 지칭한다. 일부 구현예에서, 본원에 기재된 바이러스 외피는 본 발명의 뉴클레오티드 산 서열을 인간 세포로 동화(assimilation)시키는 것을 가능하게 한다. 일부 구현예에서, 본원에 기재된 바이러스 외피는 스파이크 단백질, 외피 단백질 및 막 단백질을 포함한다.
일부 구현예에서, 본 발명은 본 발명에 따른 적어도 하나의 핵산, 본 발명에 따른 벡터, 본 발명에 따른 키트 또는 본 발명에 따른 생명공학적 생산 유닛을 사용하여 유전자 발현에 의해 수득 가능한 바이러스 외피의 단편에 관한 것이다.
본원에 사용된 용어 "바이러스 외피의 단편"은 불완전한 바이러스 외피를 형성하는 적어도 2개의 어셈블리된 단백질을 지칭한다.
일부 구현예에서, 본 발명은 본 발명에 따른 적어도 하나의 핵산, 본 발명에 따른 벡터, 본 발명에 따른 키트 또는 본 발명에 따른 생명공학적 생산 유닛을 사용하여 유전자 발현에 의해 수득 가능한 바이러스 외피 단백질에 관한 것이다.
본원에 사용된 용어 "바이러스 외피 단백질"은 바이러스 외피의 일부를 형성할 수 있는 적어도 하나의 단백질을 지칭한다.
일부 구현예에서, 본 발명은 본 발명에 따른 적어도 하나의 핵산, 본 발명에 따른 벡터, 본 발명에 따른 키트 또는 본 발명에 따른 생명공학적 생산 유닛을 사용하여 유전자 발현에 의해 수득 가능한 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질에 관한 것으로, 여기서, 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질은 본 발명에 따른 적어도 하나의 핵산을 패키징한다.
본원에 사용된 용어 "패키지"는 적어도 부분적으로 둘러싸고/둘러싸거나 연결된 것을 의미한다. 일부 구현예에서, 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질에 패키징된 본 발명의 핵산은 인간 세포로의 진입을 가능하게 한다.
본 발명의 핵산 및/또는 벡터의 생성물은, 생성물이 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질로 구현되는 경우, 상응하는 기능성 바이러스와 특히 높은 항원 유사성을 나타낸다. 따라서, 유발/유도된 면역 반응은 기능성 바이러스와의 실제 접촉에 특히 유익한 면역 반응을 유도할 가능성이 높을 것이다.
바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질에 패키징된 뉴클레오티드 산은 대상체의 인간 세포로 전달되어 인간 세포에서 바이러스 단백질의 생산을 유도할 수 있다. 그 결과 제한된 복제 능력을 가진 항원 바이러스-유사 단백질의 노출이 연장되고 강화된다.
따라서, 본 발명은 본원에 기재된 벡터가 제한된 복제 능력을 갖지만, 원래 바이러스와 유사한 항원 효과를 갖는 조합 바이러스-유사 단백질의 효율적인 생산을 가능하게 한다는 발견에 적어도 부분적으로 기반한다.
일부 구현예에서, 본 발명은 치료에 사용하기 위한 본 발명의 벡터에 관한 것이다.
일부 구현예에서, 본 발명은 치료에 사용하기 위한 생명공학적 생산 유닛에 관한 것이다.
일부 구현예에서, 본 발명은 치료에 사용하기 위한 본 발명의 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질에 관한 것이다.
본원에 사용된 용어 "치료" (및 "치료하다" 또는 "치료하는"과 같은 이의 문법적 변형)는 치료를 받는 개체의 자연적인 경과를 변경하려는 시도의 임상 개입을 지칭하며, 예방을 위해 또는 임상 병리학 과정 동안 수행될 수 있다. 치료의 바람직한 효과는 질환 발생 또는 재발 방지, 증상의 경감, 질환의 임의의 직접적인 또는 간접적인 병리학적 결과의 감소, 질환 진행 속도 감소, 질환 상태의 개선 또는 완화, 및 차도 또는 개선된 예후를 포함하지만, 이에 제한되지 않는다.
일부 구현예에서, 본 발명은 SARS-CoV-2 감염의 치료에 사용하기 위한 본 발명의 벡터, 생명공학적 생산 유닛, 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질에 관한 것이다.
일부 구현예에서, 본 발명은 SARS-CoV-2 감염의 예방에 사용하기 위한 본 발명의 벡터, 생명공학적 생산 유닛, 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질에 관한 것이다.
일부 구현예에서, 본 발명은 활성 SARS-CoV-2 감염의 치료에 사용하기 위한 본 발명의 벡터, 생명공학적 생산 유닛, 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질에 관한 것이다.
일부 구현예에서, 본 발명은 본 발명에 따른 적어도 하나의 핵산 및 생산 유기체에서 본 발명에 따른 적어도 하나의 핵산을 사용하여 유전자 발현에 의해 수득 가능한 생성물을 포함하는 코로나바이러스 SARS-CoV-2에 대한 백신에 관한 것이다.
일부 구현예에서, 본 발명은 본 발명에 따른 적어도 하나의 핵산 및 생산 유기체에서 본 발명에 따른 벡터를 사용하여 수득 가능한 생성물을 포함하는 코로나바이러스 SARS-CoV-2에 대한 백신에 관한 것이다.
일부 구현예에서, 본 발명은 본 발명에 따른 적어도 하나의 핵산 및 생산 유기체에서 본 발명에 따른 키트를 사용하여 수득 가능한 생성물을 포함하는 코로나바이러스 SARS-CoV-2에 대한 백신에 관한 것이다.
일부 구현예에서, 본 발명은 본 발명에 따른 적어도 하나의 핵산 및 생산 유기체에서 본 발명에 따른 적어도 하나의 핵산, 본 발명에 따른 벡터, 본 발명에 따른 키트를 사용하여 유전자 발현에 의해 수득 가능한 생성물을 포함하고, 특히 본 발명에 따른 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질을 포함하는 코로나바이러스 SARS-CoV-2에 대한 백신에 관한 것이다.
본원에 사용된 용어 "백신"은 숙주에서 면역 반응을 유도/유발할 수 있고 감염 및/또는 질환을 치료 및/또는 예방할 수 있는 임의의 제제 또는 조성물을 지칭한다. 따라서, 그러한 제제의 비제한적인 예는 단백질, 폴리펩티드, 단백질/폴리펩티드 단편, 면역원, 항원, 펩티드 에피토프, 에피토프, 단백질, 펩티드 또는 에피토프의 혼합물뿐만 아니라, 핵산, 유전자 및/또는 유전자의 일부(관심 있는 폴리펩티드 또는 단백질 또는 이의 단편을 코딩함)를 포함한다.
본원에 사용된 용어 "코로나바이러스 SARS-CoV-2에 대한"은 SARS-CoV-2 감염의 치료 및/또는 예방을 지칭한다.
코로나바이러스의 구조 단백질은 면역 반응을 유발하는 것으로 나타났다(예를 들어, 문헌(Li, J. Y., et al., 2020, Virus research, 286, 198074; Walls, A. C., et al., 2020, Cell, 181(2), 281-292.e6; Chen, Z, et al., 2004, Clinical chemistry, 50(6), 988-995; Peng, Y., et al., 2020, Nature immunology, 21(11), 1336-1345.) 참조). 제공된 수단 및 방법은 동등한 에피토프 및/또는 면역 회피 기전이 감소된 입자를 갖는 백신의 생산 및 투여에 의해 동등한 면역 반응을 유도/유발하는 것을 가능하게 한다. 일부 구현예에서, 백신은 대상체에서 제한된 복제 능력을 갖는 입자의 생성을 유도한다.
따라서, 이러한 백신은 종종 동물 혈청으로부터 유래되어 분자적으로 일관성이 없는 고전적인 백신과는 크게 상이하다. 동물 유기체로부터의 생산은 전통적으로 선택 방법이다. 그러나, 분자적으로 명확하지 않은 생성물은 생산 배치에서 생산 배치에 이르기까지 대량 품질 문제와 편차를 초래한다. 이것은 또한 승인 기간이 길고 종종 뒤늦게만 발견되는 부작용과 관련이 있다. 따라서, 분자적으로 정의된 생성물 조성물은, 본 발명에 따른 핵산을 사용하여 수득할 수 있기 때문에 유리하다.
또한, 본원에 기재된 백신은 명확하게 정의되어 있고 광범위한 항원성 에피토프를 제공한다. 이는 백신이 면역 반응을 향상시키는 애쥬번트에 대한 요구사항이 낮거나 전혀 없다는 이점을 초래한다. 면역 반응을 향상시키는 그러한 보조제는 전형적으로 일부 환자에서 알레르기 반응과 같은 부작용과 관련이 있다. 또한, 본원에 기재된 바와 같은 백신의 주요 활성 성분은 단백질 기반이므로, 다른 백신(예를 들어, RNA 백신)에 비해 열안정성이 더 높다. 따라서, 본 발명의 백신은 이의 안정성으로 인해 쉽게 운반 가능하고 보관 가능하다.
따라서, 본 발명은 본원에 기재된 바와 같은 백신이 코로나바이러스 SARS-CoV-2에 대해 특히 유용하다는 발견에 적어도 부분적으로 기반한다.
일부 구현예에서, 본 발명은 본 발명에 따른 2개 이상의 핵산을 포함하는 키트에 관한 것이다.
일부 구현예에서, 본 발명은 서열 번호 35, 서열 번호 36, 서열 번호 37 및 서열 번호 38의 군으로부터 선택된 적어도 2개의 핵산을 포함하는 키트에 관한 것이다.
이러한 벡터의 조합에서, 키트는 인체 바이러스 단백질의 생산을 가능하게 한다.
핵산에 추가하여, 본 발명은 또한 2개 이상의 핵산을 포함하는 키트에 관한 것으로, 여기서, 핵산은 선행하는 청구항 중 어느 한 항에 따른 데옥시리보핵산(DNA) 및/또는 상응하는 염기쌍 서열을 갖는 상응하는 리보핵산(RNA)이다. 다시 말해서, 상응하는 리보핵산은 티민(T)이 우라실(U)로 대체된 위에 정의된 바와 같은 서열 부분을 갖는다.
본원에 기재된 키트는 필요한 생명공학적 생산 유닛(들) 및 시약을 수집하여 제조될 수 있다. 키트에 포함된 핵산이 DNA 형태로 존재하는 경우, 이들은 적어도 하나의 플라스미드, 바람직하게는 2개 이상의 플라스미드에 존재하는 것이 더 바람직하다. 이는 또한 아래의 구체적인 예의 맥락에서 기재된 바와 같이, 핵산이 상응하는 생명공학적 생산 유닛으로 쉽게 도입되도록 한다.
본 발명의 특정 구현예에서, 본 발명의 키트(상황에 따라 제조될 것임) 또는 본 발명의 방법 및 용도는 사용 설명서(들)를 추가로 포함하거나 제공될 수 있다. 예를 들어, 사용 설명서(들)는 당업자가 본원에 제공된 진단 용도에서 본 발명에 따른 본 발명의 키트를 (어떻게) 사용하는지를 안내할 수 있다. 특히, 상기 사용 설명서(들)는 본원에 제공된 방법 또는 용도를 사용하거나 이를 적용하기 위한 지침을 포함할 수 있다.
따라서, 본 발명은 바이러스 입자 및/또는 이의 부분의 효율적이고 안전한 생산을 가능하게 한다는 발견에 적어도 부분적으로 기반한다.
따라서, 또 다른 양태에 따르면, 본 발명은 또한 위에 정의된 바와 같은 적어도 하나의 플라스미드, 특히 2개 이상의 플라스미드를 포함하는 생명공학적 생산 유닛에 관한 것이다. 본 발명의 이러한 추가 양태가 기반이 되는 생산 유닛은 일반적으로 기재된 목적을 위해 당업자에게 공지된 생산 유기체 또는 세포주이다.
또 다른 양태에 따르면, 본 발명은 또한 적합한 생산 유기체 또는 세포주에서 상응하는 장쇄의 완전 합성 핵산의 적용으로부터 생성된 생성물에 관한 것이다. 이러한 생성물은 종종 추가 당 또는 지방산 기가 있는 외피 단백질 부류에 속한다. 구체적으로, 이러한 추가 양태는 따라서 위에 정의된 바와 같이 핵산을 사용하거나 키트를 사용하여 유전자 발현에 의해 수득 가능한 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질에 관한 것이다.
본원에서 중요한 것은 할당이 수학적으로 명확하다는 것이다: 핵산 i는 그것에 정확히 의존하는 생성물 i를 생성한다. 심지어 약간 상이한 핵산 j는 또한 그것에 정확하게 의존하는 또 다른 생성물 j를 생성한다. 생성물과 핵산 간의 둘의 관계는 명확하고 설명 가능하다. 생성물 k의 각 유형은 핵산 k에 할당될 수 있다. 따라서, 핵산과 생성물, 즉 바이러스 외피 또는 이의 단편 간의 직접적인 관계를 말하는 것이 정당하다.
개별적인 분리 가능한 특징에 대한 대안이 본원에서 "구현예"로 제시되는 경우라면, 그러한 대안들이 자유롭게 조합되어 본원에 개시된 본 발명의 별개의 구현예를 형성할 수 있는 것으로 이해된다.
바이러스 외피의 어셈블리는 유기체와 유형에 따라 상이한 속도로 수행되고 청결도가 다양하므로, 실제로 외피와 이의 단편이 항상 함께 발견된다는 점을 언급해야 한다. 그러나, 필요한 경우, 이들은 일반적인 방법으로 분리될 수 있다.
일부 구현예에서, 본원에 기재된 외피는 크로마토그래피, 침전, 초원심분리, 접선-유동 여과 및 효소 분해의 군으로부터 선택된 적어도 하나의 정제 방법을 사용하여 정제된다.
본 발명의 추가 양태에 따르면, 본 발명의 장쇄 핵산의 직접 생성물은 따라서 임의의 정제 단계 및 가능한 보조 수단에 의해 백신으로 전환된다. 구체적으로, 이러한 추가 양태는 따라서 특히 하나 이상의 전술한 단백질 성분 또는 이의 부분을 포함하는 생산 유기체에서 위에 정의된 바와 같은 적어도 하나의 핵산 또는 키트를 사용하여 유전자 발현에 의해 수득 가능한 생성물을 포함하는 백신에 관한 것이다.
이 백신은 전형적으로 전술한 첨가제와 전형적으로 작은 농도의 상기 기재된 바이러스 외피 및/또는 단편을 갖는 생리 식염수이다.
본원에 기재된 백신이 다른 백신보다 애쥬번트의 효과에 덜 의존적이지만, 백신은 여전히 백신의 효과를 향상시키기 위해 애쥬번트를 포함할 수 있다. 일부 구현예에서, 백신은 무기 화합물(예를 들어, 칼륨 명반, 수산화알루미늄, 인산알루미늄, 수산화인산칼슘), 오일(예를 들어, 파라핀 오일, 땅콩 오일), 박테리아 생성물, 사포닌, 사이토카인(예를 들어, IL-1, IL-2, IL-12) 및 스쿠알렌의 군으로부터 선택된 적어도 하나의 애쥬번트를 포함한다.
일부 구현예에서, 백신은 경구 투여, 직장 투여, 흡입, 비강 투여, 비경구 투여, 근육내 투여, 피하 투여 및 피내 투여의 군으로부터 선택된 적어도 하나의 투여 경로에 의해 투여된다.
전형적인 백신은 투여 형태에 따라 주사되거나 점막을 통해 적용될 수 있다.
전술한 바와 같이, 백신은 특히 코로나바이러스 SARS-CoV-2에 대한 백신이다. 구체적으로, 이는 단백질 성분 a, b1, b2, c1 또는 c2, d1 또는 d2로 이루어진 군으로부터 선택된 적어도 2개의 분자적으로 정확하게 정의된 단백질 성분을 포함하고, 이에 의해,
(i) 단백질 성분 a는 SARS-CoV-2의 S 단백질과 유사한 서열 번호 14 및 서열 번호 18에 의해 정의된 서열을 포함하고;
(ii) 단백질 성분 b1은 SARS-CoV-2의 외피 단백질 E 또는 등가 단백질과 유사한 서열 번호 6 및 서열 번호 21에 제시된 서열을 포함하고, 단백질 성분 b2는 MHV59A의 외피 단백질 E와 유사한 서열 번호 8에 따른 서열을 포함하고;
(iii) 단백질 성분 c1은 SARS-CoV-2의 막 단백질 M과 유사한 서열 번호 10 및 서열 번호 22에 따른 서열을 포함하고, 단백질 성분 c2는 MHV59A의 막 단백질 M 또는 등가 단백질과 유사한 서열 번호 12에 따른 서열을 포함하고;
(iv) 단백질 성분 d1은 SARS-CoV-2의 뉴클레오캡시드 인단백질 N 또는 등가 단백질과 유사한 서열 번호 2 및 서열 번호 26에 따른 서열을 포함하고, 단백질 성분 d2는 MHV59A의 뉴클레오캡시드 인단백질 N과 유사한 서열 번호 4에 따른 서열을 포함한다.
단백질 성분 a, b1, b2, c1, c2, d1 또는 d2는 상응하는 자연 발생 유사체와 유사하지만 동일하지 않으며, 이는 상응하는 천연 핵산의 서열과 서열이 상이한 합성 핵산으로부터 생성된다는 사실로부터 비롯된다는 점에 주목해야 한다.
본 설명에 개시된 단백질 성분 a는 서열 번호 14 및 서열 번호 18에 따른 서열을 포함한다. 일부 구현예에서, 단백질 성분은 서열 번호 14와 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열을 포함한다.
일부 구현예에서, 단백질 성분 a는 서열 번호 18과 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열을 포함한다.
본 설명에 개시된 단백질 성분 b1은 서열 번호 6 및 서열 번호 21에 따른 서열을 포함한다. 일부 구현예에서, 단백질 성분 b1은 서열 번호 6과 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열을 포함한다.
일부 구현예에서, 단백질 성분 b1은 서열 번호 21과 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열을 포함한다.
본 설명에 개시된 단백질 성분 b2는 서열 번호 8에 따른 서열을 포함한다. 일부 구현예에서, 단백질 성분 b2는 서열 번호 8과 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열을 포함한다.
본 설명에 개시된 단백질 성분 c1은 서열 번호 10 및 서열 번호 22에 따른 서열을 포함한다. 일부 구현예에서, 단백질 성분 c1은 서열 번호 10과 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열을 포함한다.
일부 구현예에서, 단백질 성분 c1은 서열 번호 22와 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열을 포함한다.
본 설명에 개시된 단백질 성분 c2는 서열 번호 12에 따른 서열을 포함한다. 일부 구현예에서, 단백질 성분 c2는 서열 번호 12와 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열을 포함한다.
본 설명에 개시된 단백질 성분 d1은 서열 번호 2 및 서열 번호 26에 따른 서열을 포함한다. 일부 구현예에서, 단백질 성분 d1은 서열 번호 2와 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열을 포함한다.
일부 구현예에서, 단백질 성분 d1은 서열 번호 26과 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열을 포함한다.
본 설명에 개시된 단백질 성분 d2는 서열 번호 4에 따른 서열을 포함한다. 일부 구현예에서, 단백질 성분 d2는 서열 번호 4와 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열을 포함한다.
본원에 기재된 아미노산 서열에 대해 특정 % 서열 동일성을 갖는 단백질 성분은, 예를 들어, 적어도 하나의 아미노산을, 그러나 서열 번호 2, 서열 번호 4, 서열 번호 6, 서열 번호 8, 서열 번호 10, 서열 번호 12, 서열 번호 14, 서열 번호 18, 서열 번호 21, 서열 번호 22, 및/또는 서열 번호 26의 아미노산 서열에 대해 아미노산의 10% 이하, 9% 이하, 8% 이하, 7% 이하, 6% 이하, 5% 이하, 4% 이하, 3% 이하, 2% 이하, 1% 이하, 0.9% 이하, 0.8% 이하, 0.7% 이하, 0.6% 이하, 0.5% 이하, 0.4% 이하, 0.3% 이하, 0.2% 이하 또는 0.1% 이하를 삽입, 결실, 치환 및/또는 변형하여 수득될 수 있다. 그러한 삽입, 결실, 치환 및/또는 변형은 원하는 삽입, 결실, 치환 및/또는 변형을 코딩하는 본원에 기재된 상응하는 뉴클레오티드 산 서열(예를 들어, 본원에 기재된 단백질 성분의 돌연변이된 변이체를 코딩하는 SARS-CoV-2 변이체의 뉴클레오티드 산 서열)을 기반으로 달성될 수 있다.
삽입, 결실, 치환 및/또는 변형은 또한 번역 후 변형의 결과일 수 있다. 일부 구현예에서, 본원에 기재된 단백질 성분은 생산 과정을 개선하기 위해 번역 후 변형된다. 일부 구현예에서, 본원에 기재된 단백질 성분은 기재된 단백질 성분의 적어도 하나의 단백질 성질, 예컨대, 항원성, 단백질 안정성, 약동학, 약력학, 약물과의 상호작용 및 애쥬번트와의 상호작용의 군으로부터 선택된 단백질 성질을 개선하기 위해 번역 후 변형된다. 일부 구현예에서, 본원에 기재된 단백질 성분은 적어도 다른 단백질 또는 펩티드에 연결된 작용기의 추가, 아미노산의 화학적 변형(예를 들어, 시트룰린화, 탈아미노화, 탈아미드화, 제거), 이황화 브릿지, 시스테인 아미노산 연결, 펩티드 결합 절단, 이소아스파르테이트 형성, 라세미화 및 단백질 스플라이싱의 군으로부터 선택된 기술에 의해 번역 후 변형된다.
따라서, 본원에 기재된 아미노산 서열은 본원에 기재된 뉴클레오티드 산 서열과 비례하는 % 서열 동일성이 반드시 중복되는 것은 아니다. 일부 구현예에서, 본 발명의 아미노산 서열은 변경된 뉴클레오티드 산 서열이 본원에 기재된 뉴클레오티드 산 서열과 상이한 것보다 서열 번호 2, 서열 번호 4, 서열 번호 6, 서열 번호 8, 서열 번호 10, 서열 번호 12, 서열 번호 14, 서열 번호 18, 서열 번호 21, 서열 번호 22 및/또는 서열 번호 26에 기재된 서열과 적어도 10%, 적어도 9%, 적어도 8%, 적어도 7%, 적어도 6%, 적어도 5%, 적어도 4%, 적어도 3%, 적어도 2%, 적어도 1%, 적어도 0.9%, 적어도 0.8%, 적어도 0.7%, 적어도 0.6%, 적어도 0.5%, 적어도 0.4%, 적어도 0.3%, 적어도 0.2%, 적어도 0.1% 이상 상이하다.
일부 구현예에서, 본 발명은 단백질 성분 a, b1, b2, c1 또는 c2, d1 또는 d2로 이루어진 군으로부터 선택된 적어도 2개의 분자적으로 정확하게 정의된 단백질 성분을 포함하는 본 발명에 따른 백신에 관한 것으로, 여기서
(i) 단백질 성분 a는
a) SARS-CoV-2의 S 단백질과 유사한 서열 번호 14에 따른 서열 또는 서열 번호 14와 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열; 또는
b) SARS-CoV-2의 S 단백질과 유사한 서열 번호 18에 따른 서열 또는 서열 번호 18과 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열
을 포함하고;
(ii) 단백질 성분 b1은
a) SARS-CoV-2의 외피 단백질 E와 유사한 서열 번호 6에 따른 서열 또는 서열 번호 6과 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열; 또는
b) SARS-CoV-2의 외피 단백질 E와 유사한 서열 번호 21에 따른 서열 또는 서열 번호 21과 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열
을 포함하며;
단백질 성분 b2는 MHV59A의 외피 단백질 E 또는 등가 단백질과 유사한 서열 번호 8에 따른 서열 또는 서열 번호 8과 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열을 포함하고;
(iii) 단백질 성분 c1은
a) SARS-CoV-2의 외피 단백질 E와 유사한 서열 번호 10에 따른 서열 또는 서열 번호 10과 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열; 또는
b) SARS-CoV-2의 막 단백질 M과 유사한 서열 번호 22에 따른 서열 또는 서열 번호 22와 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열
을 포함하고;
단백질 성분 c2는 MHV59A의 막 단백질 M 또는 등가 단백질과 유사한 서열 번호 12에 따른 서열 또는 서열 번호 12와 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열을 포함하고;
(iv) 단백질 성분 d1은
a) SARS-CoV-2의 뉴클레오캡시드 인단백질 N과 유사한 서열 번호 2에 따른 서열 또는 서열 번호 2와 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열; 또는
b) SARS-CoV-2의 뉴클레오캡시드 인단백질 N과 유사한 서열 번호 26에 따른 서열 또는 서열 번호 26과 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열
을 포함하고;
단백질 성분 d2는 MHV59A의 뉴클레오캡시드 인단백질 N 또는 등가 단백질과 유사한 서열 번호 4에 따른 서열 또는 서열 번호 4와 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 적어도 99.1%, 적어도 99.2%, 적어도 99.3%, 적어도 99.4%, 적어도 99.5%, 적어도 99.6%, 적어도 99.7%, 적어도 99.8% 또는 적어도 99.9% 서열 동일성을 갖는 서열을 포함한다.
일부 구현예에서, 본 발명은 단백질 성분 a, b1, c1 및 d1로 이루어진 군으로부터 선택된 적어도 2개의 분자적으로 정확하게 정의된 단백질 성분을 포함하는 본 발명에 따른 백신에 관한 것이다.
일부 구현예에서, 본 발명은 단백질 성분 b1, c1 및 d1로 이루어진 군으로부터 선택된 적어도 2개의 분자적으로 정확하게 정의된 단백질 성분을 포함하는 본 발명에 따른 백신에 관한 것이다.
일부 구현예에서, 본 발명은 단백질 성분 a, c1 및 d1로 이루어진 군으로부터 선택된 적어도 2개의 분자적으로 정확하게 정의된 단백질 성분을 포함하는 본 발명에 따른 백신에 관한 것이다.
일부 구현예에서, 본 발명은 단백질 성분 a, b1 및 d1로 이루어진 군으로부터 선택된 적어도 2개의 분자적으로 정확하게 정의된 단백질 성분을 포함하는 본 발명에 따른 백신에 관한 것이다.
일부 구현예에서, 본 발명은 단백질 성분 a, b1 및 c1로 이루어진 군으로부터 선택된 적어도 2개의 분자적으로 정확하게 정의된 단백질 성분을 포함하는 본 발명에 따른 백신에 관한 것이다.
일부 구현예에서, 본 발명은 적어도 2개의 분자적으로 정확하게 정의된 단백질 성분 a 및 c1을 포함하는 본 발명에 따른 백신에 관한 것이다.
일부 구현예에서, 본 발명은 적어도 2개의 분자적으로 정확하게 정의된 단백질 성분 a 및 d1을 포함하는 본 발명에 따른 백신에 관한 것이다.
일부 구현예에서, 본 발명은 적어도 2개의 분자적으로 정확하게 정의된 단백질 성분 c1 및 d1을 포함하는 본 발명에 따른 백신에 관한 것이다.
일부 구현예에서, 본 발명은 적어도 2개의 분자적으로 정확하게 정의된 단백질 성분 a, 및 b1, c1 및 d1을 포함하는 본 발명에 따른 백신에 관한 것이다.
일부 구현예에서, 본 발명은 단백질 성분 a, b1, c1 및 d1로 이루어진 군으로부터 선택된 적어도 3개의 분자적으로 정확하게 정의된 단백질 성분을 포함하는 본 발명에 따른 백신에 관한 것이다.
일부 구현예에서, 본 발명은 단백질 성분 a, b1, c1 및 d1로 이루어진 군으로부터 선택된 3개의 분자적으로 정확하게 정의된 단백질 성분을 포함하는 본 발명에 따른 백신에 관한 것이다.
본원에 기재된 단백질 성분을 포함하는 본 발명에 따른 백신은 실질적이고 광범위한 면역 반응을 유발할 수 있다. 동시에, 백신은 대상체의 체내에서 복제되지 않는다는 점에서 복제 능력이 제한될 수 있다. 그러한 제한된 복제 능력, 예를 들어, 효율적인 복제를 위해 필요한 서열을 생략하거나 변경함으로써 달성될 수 있다.
따라서, 본 발명은 본원에 기재된 단백질 성분의 조합을 포함하는 백신이 항원 가능성을 크게 유지하면서 복제 능력에서 원하는 제한을 나타낼 수 있다는 발견에 적어도 부분적으로 기반한다.
또한, 본 발명은 핵산 기반 mRNA로부터 출발하여 형질감염에 의해 제1항 내지 제10항 중 어느 한 항에 따른 적어도 하나의 핵산을 생명공학적 생산 유닛, 특히 세포주에 도입하고, 단백질 성분 a, b1, b2, c1, c2, d1 또는 d2로 이루어진 군으로부터 선택된 단백질 성분 중 적어도 2개를 번역에 의해 제조하고, 이로부터 수득된 단백질 성분을 정제하는 연속 단계를 포함하는 백신 생산 방법에 관한 것이다.
일부 구현예에서, 본 발명은 하기의 연속 단계를 포함하는 본 발명에 따른 백신의 생산 방법에 관한 것이다:
a) 구현예 10 내지 14 중 어느 하나에 따른 벡터를 생명공학적 생산 유닛, 특히 세포주에 도입하는 단계로서,
여기서, 단백질 성분 a, b1, b2, c1, c2, d1 또는 d2로 이루어진 군으로부터 선택된 단백질 성분 중 적어도 2개를 코딩하는 핵산 기반 mRNA는 번역에 의해 제조되는 것인 단계;
b) 단계 a)에서 생명공학적 생산 유닛으로부터 단백질 성분을 수득하는 단계; 및
c) 수득된 단백질 성분을 정제하여 본 발명에 따른 백신을 수득하는 단계.
일부 구현예에서, 본 발명은 본 발명에 따른 적어도 하나의 벡터를 포함하는 생명공학적 생산 유닛에 관한 것이다.
용어 "생명공학적 생산 유닛" 및 "생산 유기체"는 본원에서 상호교환적으로 사용되며, 발현을 위해 본 발명의 핵산이 도입된 적어도 하나의 숙주 세포를 지칭하며, 그러한 세포의 자손, 유기체 및 그러한 세포 및/또는 그러한 세포의 자손을 포함하는 생명공학적 유닛을 포함한다. 숙주 세포는 계대 수에 관계없이 1차 형질전환된 세포 및 이로부터 유래된 자손을 포함하는 "형질전환체" 및 "형질전환된 세포"를 포함한다. 자손은 핵산 함량이 모세포와 완전히 동일하지 않을 수 있지만, 돌연변이를 포함할 수 있다. 원래 형질전환된 세포에서 스크리닝되거나 선택된 것과 동일한 기능 또는 생물학적 활성을 갖는 돌연변이 자손이 본원에 포함된다.
용어 "증폭 생명공학적 생산 유닛"은 큰 벡터(예를 들어, 4,000개 초과의 염기, 10,000개 초과의 염기, 35,000개 초과의 염기)의 증폭을 허용하는 임의의 생명공학적 생산 유닛을 지칭한다. 일부 구현예에서, 본원에 기재된 증폭 생명공학적 생산 유닛은 효모 세포를 포함한다.
특정 구현예에서, 숙주 세포는 줄기 세포이다. 다른 구현예에서, 숙주 세포는 분화된 세포이다.
본원에 기재된 생명공학적 생산 유닛은 SARS-CoV-2의 바이러스 진입을 허용하는 세포를 포함하여 생명공학적 생산 유닛의 세포 생성물이 생명공학적 생산 유닛의 추가 세포에 진입할 수 있는 경우 특히 유용하다. 생명공학적 생산의 세포에 대한 이러한 후속 감염은 벡터를 숙주 세포로 가져오는 과정을 촉진하고 가속화한다.
일부 구현예에서, 본원에 기재된 생명공학적 생산 유닛은 SARS-CoV-2의 바이러스 진입을 허용하는 세포를 포함한다. 일부 구현예에서, 본원에 기재된 생명공학적 생산 유닛은 인간 ACE2 수용체 또는 기능적 인간-유사 ACE2 수용체를 발현하는 세포를 포함한다. SARS-CoV-2의 바이러스 진입을 허용하는 인간-유사 ACE2 수용체는 당업자에게 공지되어 있다(예를 들어, 문헌(Damas, J., et al., 2020, Proceedings of the National Academy of Sciences, 117(36), 22311-22322) 참조).
일부에서, 본원에 기재된 생명공학적 생산 유닛은 HEK293, MDCK, 차이니즈 햄스터 난소(CHO), SF9, Vero, MRC 5, Per.C6, PMK 및 WI-38의 군으로부터 선택된 적어도 하나의 세포 유형을 포함한다.
일부 구현예에서, 본원에 기재된 생명공학적 생산 유닛은 적어도 부분적으로 인간인 세포 또는 적어도 부분적으로 인간 세포주의 세포를 포함한다.
일부 구현예에서, 본원에 기재된 생명공학적 생산 유닛은 그 안에 선택적으로 복제 가능한 본 발명의 뉴클레오티드 또는 본 발명의 벡터를 포함하는 바이러스 입자의 생산을 허용하는 세포를 포함하며, 그 세포는 생명공학적 생산 유닛의 세포에서는 완전히 복제될 수 있지만, 인체의 세포에서는 그렇지 않거나 실질적으로 복제되지 않는다. 이러한 선택적 복제 가능성은 바이러스 입자의 복제를 위한 상보적 단백질을 포함하는 세포에 의해 달성된다(예를 들어, 실시예 참조).
일부 구현예에서, 본원에 기재된 생명공학적 생산 유닛은 바이러스 복제를 위해 적어도 하나의 단백질을 발현할 수 있는 세포를 포함한다. 일부 구현예에서, 본원에 기재된 생명공학적 생산 유닛은 본 발명의 뉴클레오티드 산 서열 또는 본 발명의 벡터에 코딩되지 않은 바이러스 복제를 위한 적어도 하나의 단백질 성분을 발현할 수 있는 세포를 포함한다.
본 발명의 벡터에 의한 숙주 세포의 형질도입은 안정하거나 일시적인 형질도입에 의해 달성될 수 있다(예를 들어, 문헌(Stepanenko, A. A., and Heng, H. H., 2017, Mutation Research/Reviews in Mutation Research, 773, 91-103) 참조).
DNA가 제1 구현예에 따라 생산 유닛에 도입된다면, 이것은 일반적으로 이러한 목적에 적합한 플라스미드를 사용하여 수행된다.
대안적으로, DNA는 임의의 종류의 벡터에 의해 생명공학적 생산 유닛에 도입될 수 있다.
한편, RNA가 제2 구현예에 따라 도입된다면, 단백질 성분 a, b1, b2, c1, c2, d1 또는 d2를 코딩하는 서열에 추가하여 RNA-의존성 RNA 폴리머라제를 코딩하는 서열(서열 번호 30에 따름)이 도입된다. 이 서열은 템플릿으로서 존재하는 양성 RNA 가닥으로부터 음성 RNA 가닥을 먼저 형성한 다음, 이로부터 상응하는 메신저 RNA를 생성하는 것을 가능하게 한다.
절차의 이러한 제2 구현예의 맥락에서, 수득된 백신이 효소적 전사에 의해 수득 가능한 완전 합성 장쇄 리보핵산(서열 번호 33 또는 34에 따름)을 추가로 포함하는 것이 바람직하다.
절차의 이러한 제2 구현예의 맥락에서, 수득된 백신이 서열 번호 28에 따른 서열의 T7 전사를 통해 수득 가능한 완전 합성 장쇄 리보핵산(서열 번호 33 또는 34에 따름)을 추가로 포함하는 것이 또한 바람직하다.
"a," "an," 및 "the"는 본원에서 관사의 문법적 대상 중 하나 또는 하나 초과(즉, 적어도 하나, 또는 하나 이상)를 지칭하는 데 사용된다.
"또는"은 대안 중 하나, 둘 다 또는 이들의 임의의 조합을 의미하는 것으로 이해되어야 한다.
"및/또는"은 대안 중 하나 또는 둘 다를 의미하는 것으로 이해되어야 한다.
본 명세서 전반에 걸쳐, 문맥에 달리 요구하지 않는 한, "포함하다(comprise)", "포함하다(comprises)" 및 "포함하는(comprising)"이라는 단어는 언급된 단계 또는 요소 또는 단계 또는 요소의 군을 포함하지만 임의의 다른 단계 또는 요소 또는 단계 또는 요소의 군을 배제하지 않음을 의미하는 것으로 이해될 것이다.
"포함하다(include)" 및 "포함하다(comprise)"라는 용어는 동의어로 사용된다. "바람직하게는"은 다른 옵션을 배제하지 않는 일련의 옵션 중 하나의 옵션을 의미한다. "예를 들어"는 언급된 예로 제한되지 않는 하나의 예를 의미한다. "이루어진"이란 "이루어진"이라는 문구 뒤에 오는 모든 것을 포함하며 이에 제한되지 않는다.
본 명세서 전반에 걸쳐 "일 구현예(one embodiment)", "일 구현예(an embodiment)", "특정 구현예(particular embodiment)", "관련 구현예", "특정 구현예(certain embodiment)", "추가 구현예", "일부 구현예", "특정 실시예" 또는 "추가 구현예" 또는 이들의 조합에 대한 참조는 구현예와 관련하여 설명된 특정 특징, 구조 또는 특성이 본 발명의 적어도 일 구현예에 포함된다는 것을 의미한다. 따라서, 본 명세서 전반에 걸쳐 다양한 곳에서 전술한 문구의 출현은 반드시 모두 동일한 구현예를 지칭하는 것은 아니다. 또한, 특정 특징, 구조, 또는 특성은 하나 이상의 구현예에서 임의의 적합한 방식으로 조합될 수 있다. 또한, 일 구현예에서 특징의 긍정적인 언급은 특정 구현예에서 특징을 배제하기 위한 기초 역할을 하는 것으로 이해된다. 본 발명은 첨부된 도면과 함께 하기 설계 예에 의해 추가로 예시되며, 이는 청구범위에 기재된 본 발명의 범위를 제한하지 않는다.
도 1: SARS-CoV2의 뉴클레오캡시드 단백질(N) (서열 번호 35), 외피 단백질(E) (서열 번호 36), 막 단백질(M) (서열 번호 37) 및 스파이크 당단백질(S) (서열 번호 38)을 코딩하는 모노-시스트론 발현 플라스미드의 플라스미드 맵. 플라스미드 맵 내부의 숫자는 염기쌍에서의 DNA 좌표를 나타낸다. N, E, M 및 S의 단백질-코딩 서열은 화살표로 표시되며 서열 목록에 명시된 바와 같이 서열 번호 1, 2, 3 및 4 (N), 5, 6, 7 및 8 (E), 9, 10, 11 및 12 (M), 13 및 14 (S)의 DNA 및 단백질 서열을 나타낸다.
도 2: 모노-시스트론 발현 플라스미드 pcDNA34 syn N(서열 번호 35) (아래 도면)과 함께 실시예 2에 나타낸 바와 같이 세포주에서 백신 생산에 사용될 수 있는 폴리-시스트론 발현 작제물 COVAX191△N(서열 번호 33 및 39) (위 도면)의 게놈 맵. 숫자는 COVAX191△N에 대한 킬로베이스(K)의 DNA 좌표를 지칭하고 pcDNA34 syn N 작제물(서열 번호 35)에 대한 염기쌍 위치를 지칭한다. 폴리단백질 1a 및 1b, E, M S(위 도면) 및 뉴클레오캡시드 단백질 syn N(아래 도면)의 단백질-코딩 서열은 화살표로 표시된다.
도 3: 뉴클레오캡시드 단백질(N), 외피 단백질(E), 막 단백질(M) 및 스파이크 당단백질(S)에 대한 모노-시스트론, 플라스미드 기반 발현 작제물의 아가로스 겔 전기영동 크기 분리. 겔의 좌측은 뉴클레오캡시드 단백질(N), 외피 단백질(E) 및 막 단백질(M)에 대한 MHV A59(MHV) 유래 작제물을 나타낸다. 겔의 우측은 뉴클레오캡시드 단백질(N), 외피 단백질(E), 막 단백질(M) 및 스파이크 당단백질(S)에 대한 SARS-CoV2를 기반으로 하는 유래된 작제물을 나타낸다.
도 4: 원형 40,556 bp DNA 작제물 COVAX191△N(서열 번호 40) (위 도면) 및 38,383 bp DNA 작제물 COVAX191△N△HE(서열 번호 40) (아래 도면)의 상응하는 DNA 시퀀싱 커버 그래프가 있는 개략도. 화살표는 복제 폴리단백질 1A 및 1B(1A, 1B), 헤마글루티닌 에스테라제(HE), 스파이크 당단백질(S), 외피 단백질(E) 및 막 단백질(M)의 재코딩된 CDS에 대한 단백질-코딩 서열의 위치를 나타낸다. 단일 리튬 아세테이트 효모 형질전환을 사용하여 6개의 합성 DNA 블록으로부터 COVAX191△N 및 COVAX191△N△HE의 완전한 게놈을 어셈블리하고 영양요구성(auxotrophic) URA3 마커에 대해 선택하였다.
도 5: SARS-CoV-2 게놈 및 생성된 결실 변이체의 개략도.
표 S1: 에스. 세레비시아(S. cerevisiae) (효모)에서 COVAX191의 DNA 어셈블리 효율
도 2: 모노-시스트론 발현 플라스미드 pcDNA34 syn N(서열 번호 35) (아래 도면)과 함께 실시예 2에 나타낸 바와 같이 세포주에서 백신 생산에 사용될 수 있는 폴리-시스트론 발현 작제물 COVAX191△N(서열 번호 33 및 39) (위 도면)의 게놈 맵. 숫자는 COVAX191△N에 대한 킬로베이스(K)의 DNA 좌표를 지칭하고 pcDNA34 syn N 작제물(서열 번호 35)에 대한 염기쌍 위치를 지칭한다. 폴리단백질 1a 및 1b, E, M S(위 도면) 및 뉴클레오캡시드 단백질 syn N(아래 도면)의 단백질-코딩 서열은 화살표로 표시된다.
도 3: 뉴클레오캡시드 단백질(N), 외피 단백질(E), 막 단백질(M) 및 스파이크 당단백질(S)에 대한 모노-시스트론, 플라스미드 기반 발현 작제물의 아가로스 겔 전기영동 크기 분리. 겔의 좌측은 뉴클레오캡시드 단백질(N), 외피 단백질(E) 및 막 단백질(M)에 대한 MHV A59(MHV) 유래 작제물을 나타낸다. 겔의 우측은 뉴클레오캡시드 단백질(N), 외피 단백질(E), 막 단백질(M) 및 스파이크 당단백질(S)에 대한 SARS-CoV2를 기반으로 하는 유래된 작제물을 나타낸다.
도 4: 원형 40,556 bp DNA 작제물 COVAX191△N(서열 번호 40) (위 도면) 및 38,383 bp DNA 작제물 COVAX191△N△HE(서열 번호 40) (아래 도면)의 상응하는 DNA 시퀀싱 커버 그래프가 있는 개략도. 화살표는 복제 폴리단백질 1A 및 1B(1A, 1B), 헤마글루티닌 에스테라제(HE), 스파이크 당단백질(S), 외피 단백질(E) 및 막 단백질(M)의 재코딩된 CDS에 대한 단백질-코딩 서열의 위치를 나타낸다. 단일 리튬 아세테이트 효모 형질전환을 사용하여 6개의 합성 DNA 블록으로부터 COVAX191△N 및 COVAX191△N△HE의 완전한 게놈을 어셈블리하고 영양요구성(auxotrophic) URA3 마커에 대해 선택하였다.
도 5: SARS-CoV-2 게놈 및 생성된 결실 변이체의 개략도.
표 S1: 에스. 세레비시아(S. cerevisiae) (효모)에서 COVAX191의 DNA 어셈블리 효율
실시예
하기 실시예는 세포가 코로나 바이러스 외피 또는 이의 단편을 생성하도록 자극하기 위해 외피 단백질 E, M, N 및 S를 코딩하는 본 발명의 장쇄 핵산이 어떻게 생산되고 사용되는지를 설명한다.
생산을 위해, 본 발명에 따른 (디지털) 서열은 화학적 DNA 합성 과정에 의해 물리적으로 존재하는 상응하는 장쇄 완전 합성 핵산 분자로 전달된다.
실시예 1
제1 실시예에서, 외피 단백질 E, M, N 및 S를 코딩하는 생성된 장쇄 완전 합성 핵산은 모노-시스트론성인데, 즉, 이들은 별도의 프로모터(SV40, CMV, EF-1, 치킨 β 액틴 프로모터 또는 하이브리드 프로모터) 및 기타 임의의 번역 개시 신호(Kozak 공통 서열) 및 핵 mRNA 배출 신호(Chuck Wood 서열)의 제어 하에 진핵 세포용 발현 플라스미드로 생산된다. 서열 번호 35, 서열 번호 36, 서열 번호 37 및 서열 번호 38 및 도 1에 나타낸 서열은 그러한 발현 시스템의 예로서 작용할 것이다. 다른 발현 플라스미드, 상응하는 내성 유전자 및 프로모터를 갖는 다른 구현예가 가능하고 당업자에게 공지되어 있다.
생성된 4개의 발현 플라스미드는 에스케리키아 콜리(Escherichia coli)에서 증폭되고 표준 화학-물리적 절차에 의해 정제된 다음, 형질감염에 의해 진핵 세포주(HEK293, 차이니즈 햄스터 난소(CHO), SF9, Vero)에 도입된다. 형질감염은 인산칼슘, 리포펙션, 전기천공과 같은 표준 절차에 의해 수행된다.
형질감염 후, 형질감염된 플라스미드 DNA로부터 시작하는 세포는 외피 단백질 E, M, N 및 S가 번역에 의해 발현되는 메신저 RNA(mRNA)를 번역하기 시작한다. 이러한 단백질은 세포에서 자발적으로 어셈블리되어 코로나 바이러스 외피를 형성한 다음, 세포에 의해 엑소시토시스(exocytosis)에 의해 배양 배지로 방출되고 이때 이들은 5-7일 후에 축적된다.
외피 단백질, 바이러스 외피 및 이들의 단편의 정제에는 화학적-물리적 공정이 사용된다. 이를 위해, 원심분리에 의해 세포 배양 상층액을 세포로부터 분리한다. 후속 단계에서, 바이러스 외피는 크로마토그래피 컬럼 분리 방법에 의해 배양 배지의 불순물 및 기타 성분으로부터 추가로 정제된다. 이와 같이 얻어진 코로나바이러스 외피로 이루어진 순수한 형태의 물질은 백신의 기반을 이루며, 그 후 적용 유형에 따라 투여를 위해 다양한 형태로 전환된다. 전형적으로, 이 목적을 위해 애쥬번트, 저장 수명 개선을 위한 안정제, 염 및 완충제가 사용된다. 따라서, 백신은 본원에 기재된 장쇄의 완전 합성 핵산 생성물이다.
실시예 2
제2 실시예에서, 외피 단백질 E, M 및 S를 코딩하는 장쇄의 완전 합성 핵산은 RNA-의존성 RNA 폴리머라제를 코딩하는 완전 합성 핵산과 함께 발현된다. 서열 번호 39 및 서열 번호 40에 의해 밝혀지고 도 2에 나타낸 바와 같은 이러한 폴리-시스트론 발현 시스템에서, 외피 단백질 E, M 및 S는 RNA-의존성 RNA 폴리머라제를 포함하는 음성 RNA 가닥으로부터 직접 전사된다. 서열 군 A-D의 모든 부류의 외피 단백질이 RNA-의존성으로 발현되지 않는다면, 실시예 1에 기재된 바와 같은 추가 발현 플라스미드는 세포주에서 바이러스 외피의 생명공학적 생산을 위한 외피 단백질의 완전한 세트를 발현하는데 사용될 수 있다. 실시예 2에서, N 단백질을 코딩하는 발현 플라스미드가 이러한 목적을 위해 사용된다(서열 번호 35)(도 2 참조).
플라스미드의 정제, 장쇄 핵산의 형질감염뿐만 아니라, 바이러스 외피의 정제는 대체로 실시예 1에 기재된 공정 순서를 따른다. 그러나, 상기 공정은 서열 번호 39 및 서열 번호 40에 기재된 장쇄 핵산이 형질감염 전에 T7 RNA 폴리머라제에 의해 서열 번호 33 및 서열 번호 34에 따른 상응하는 RNA 형태로 형질전환되는 추가 단계를 포함한다. 이러한 양성 RNA 가닥은 세포주에서 RNA-의존성 RNA 폴리머라제의 생성을 유도하며, 이는 이로부터 음성 RNA 가닥을 생성한다. 이어서, 이러한 음성 RNA 가닥으로부터 메신저 RNA(mRNA)가 전사되고, 이는 바이러스 외피에서 외피 단백질의 생산 및 어셈블리를 유도한다.
이러한 방식으로 생산된 백신은 상응하는 데옥시리보핵산의 유전자 발현을 통해 수득된 외피 단백질에 추가하여, 서열 번호 39 및 서열 번호 40의 T7 전사를 통해 발현되는 완전 합성 장쇄 리보핵산을 함유한다는 점에서 제1 실시예 1에 기재된 백신과 상이하다.
제2 실시예는 N 단백질을 발현하는 헬퍼 세포주에서 스스로 증식하는 바이러스 외피를 생성한다는 점에서 제1 적용예에 비해 이점을 갖는다. 이것은 이와 같이 형성된 바이러스 외피가 RNA-의존성 RNA 폴리머라제와 외피 단백질 E, M, S를 코딩하는 양성 RNA 가닥을 추가로 포함하고 있기 때문에 가능하다. 이러한 바이러스 외피가 세포에 의해 흡수되면, 세포 자체가 자극되어 바이러스 외피를 생성한다. 세포가 N 단백질을 에피솜으로 발현하면, 백신 생산 세포주의 경우처럼 자가 복제 바이러스 외피가 형성된다. 이는 생산 공정을 단순화하고 값비싼 형질감염 시약 없이 수행될 수 있다. 표적 세포가 임의의 N-단백질을 발현하지 않는다면, 바이러스 외피도 이로부터 형성되지만, 이어서 이들은 패키징된 RNA 가닥이 없고 더 이상 자가 복제될 수 없다. 이러한 바이러스 외피는 실시예 1에 나타낸 제조 공정에 의해 생산된 바이러스 외피와 동일한 화학적/물리적 구조 및 동일한 항원성을 갖는다. 실시예 2는 추가의 헬퍼 세포주 및 생산 유기체에서 바이러스 외피, 단편 및 바이러스 외피 단백질의 생산뿐만 아니라, RNA 백신으로서의 직접 적용을 가능하게 한다.
방법:
박테리아 및 효모 균주의 배양
에스케리키아 콜리(이. 콜리) DH5알파는 37℃에서 Luria-Broth(LB)에서 배양되었다. 사카로미세스 세레비시아(Saccharomyces cerevisiae) VL6-48N(Kouprina et al. 2006 Methods in Mol. Biol. 349, 85-101)를 30℃에서 우라실이 없는 효모 펩톤-덱스트로스(YPD) 배지 또는 합성 드롭아웃(SD) 배지에서 배양하였다.
서열 설계 및 드 노보(de-novo) DNA 합성.
모노-시스트론 및 폴리-시스트론 발현 작제물에 대한 DNA 서열은 첨부된 서열 목록(서열 번호 1 내지 40)에 개시된 서열 부분으로부터 어셈블리하였다. 합성 제한은 동의어 코돈 교체 및 유전자간 서열 내에서 원하는 염기 치환의 적용에 의해 계산적으로 제거되었다. 최적의 역합성 어셈블리 경로를 정의하기 위해, 합성-최적화된 DNA 설계는 상업적 공급업체에 의해 저비용 합성에 적합한 더 작은 DNA 단편으로 계층적으로 분할되었다. 분할 전략은 4단계의 계층적 어셈블리 공정으로서 설계되었다. 1.4 kb(킬로베이스) 크기의 하위 블록을 5.4 kb 블록으로 어셈블리하고 16 kb 크기의 세그먼트로 추가로 어셈블리한 다음, 35 내지 40 kb의 최종 COVAX 작제물로 어셈블리하였다. 선형 DNA 어셈블리 부분은 말단에 상동성 중첩을 가지며 3' 프리픽스에서 5' 서픽스 서열이 중첩되어 어셈블리된 DNA 부분을 벡터에 통합하고 최종 COVAX DNA 설계의 계층적 어셈블리를 허용한다. DNA 어셈블리 부분은 서열이 검증된 클론 플라스미드 작제물 및 이중 가닥 선형 DNA로서 저비용 DNA 합성에 의해 상업적 공급업체로부터 입수했다.
모노-시스트론 발현 작제물의 생성:
SARS-2 CoV의 S-단백질, SARS-CoV-2 또는 MHV의 M-단백질, N-단백질 및 E-단백질의 완전한 단백질-코딩 서열을 포함하는 합성 핵산 서열은 서열-검증된 합성 DNA로부터 폴리머라제 증폭 기술(PCR)에 의해 증폭되었다. 개시 코돈 이전의 번역 개시 부위는 올리고뉴클레오티드 프라이머에 의해 도입되었다. PCR 생성물은 이들의 분자량에 따라 아가로스 겔 전기영동으로 분리한 다음, 뉴클레오스핀(nucleospin) 컬럼(NucleoSpin Gel 및 PCR Clean-up Kit, Macherey nail)으로 정제하였다. PCR 생성물은 Topo-TA 클로닝 키트(TOPO-TA 클로닝 키트, ThermoFisher)를 사용하여 pcDNA3.4 벡터에 클로닝되었다. 플라스미드의 분자량은 아가로스 겔 전기영동에 의해 결정되었고(도 3) DNA 서열은 Sanger 시퀀싱에 의해 확인되었다.
폴리-시스트론 COVAX DNA 작제물의 생성:
폴리-시스트론 COVAX DNA 작제물에 대한 DNA 어셈블리 부분은 IIS형 제한 효소(Bbsl, BspQl, Pacl 및 Pmel(New England Biolabs))를 사용한 제한 분해에 의해 플라스미드로부터 방출되었다. 등몰량의 DNA 삽입물(100 ng, 0.115 pmol) 및 선형화된 벡터 pXMCS2(100 ng, 0.038 pmol)를 T5 엑소뉴클레아제, 퓨전 폴리머라제 및 Taq DNA 리가아제와 함께 50℃에서 1시간 동안 인큐베이션하였다. 등온 어셈블리 후, 작제물을 E. coli DH5 알파 세포(BioRad MiniPulser)에 전기천공하였다. 세포를 LB 배지에서 1시간 동안 인큐베이션한 다음, LB 플레이트에 플레이팅하였다. 세그먼트 및 완전한 COVAX 작제물은 리튬 아세테이트 형질전환 방법(Gietz et al 2007, Nature Protocols, 2, 31-34)에 따라 플라스미드 pMR10Y(pMR10::CEN/ARS::URA3, Christen et al. 2015, ACS Synthetic Biology, 4, 927-934)를 사용하여 효모 재조합에 의해 블록으로부터 어셈블리되었다. Saccharomyces cerevisiae VL6-48N을 5 ㎖ YPD에서 밤새 성장시키고 50 ㎖ YPD에 1:20으로 희석하고 4시간 동안 인큐베이션하였다. 1,000 rcf에서 5분 동안 원심분리하여 세포를 수집하고, 25 ㎖의 증류수로 세척하고, 3,000 rcf에서 5분 동안 원심분리하였다. 펠렛을 1 ㎖ 리튬 아세테이트 혼합물(0.1 M 리튬 아세테이트, 0.01 M Tris-HCl, pH 7.5, 0.001 M EDTA, pH 8.0)에 용해시켰다. 다음으로, 100 ㎕ 단일 가닥 연어 정자 DNA(1% w/v 연어 정자 DNA(ssDNA), 0.01 M Tris-HCl, pH 7.5, 0.001 M EDTA, pH 8.0) 및 6 ㎖ PEG-믹스(40% w/v 폴리(에틸렌 글리콜) 3015-3685 g/mol, 0.01 M Tris-HCl, pH 7.5, 0.001 M EDTA, pH 8.0)를 첨가하였다. PEG 세포 믹스로부터 710 ㎕ 분취량을 100 ng의 분해된 DNA 블록 및 250 ng의 선형화된 pMR10Y 벡터(Pad, Pmel)와 조합하였다. 샘플을 30℃에서 30분 동안 인큐베이션하였다. 인큐베이션 후, 70 ㎕ 디메틸 설폭사이드(DMSO)를 첨가하고, 샘플을 42℃에서 15분 동안 열 충격하였다. 세포를 1,000 rcf에서 2분 동안 원심분리하여 수집한 다음, 우라실이 없는 SD 플레이트에 플레이팅하고, 콜로니가 보일 때까지 30℃에서 3일 동안 배양하였다(표 S1 참조).
COVAX DNA 작제물의 서열 검증.
어셈블리된 DNA 작제물의 서열 검증은 Nextera DNA Flex Library Prep-Kit를 사용하여 iSeq 기기(Illumina)에서 수행되었다. ura + 효모 형질전환체의 게놈 DNA는 제조업체에 의해 지정된 태깅 프로토콜에 따라 단편화되고 처리되었다. 서열은 리드(read) 서열로부터 새로 계산되었고 생성된 콘티그는 CLC Genomics Workbench 소프트웨어(Quiagen)를 사용하여 참조 서열과 비교되었다. COVAX191△N 및 COVAX191△HEN의 완전한 어셈블리는 완전히 닫힌 서열 커버리지 플롯으로 확인되었다(도 4).
실시예 3
각각 하나의 원형 서열(바이러스 서열, T7 프로모터 및 폴리A-신호뿐만 아니라 벡터, 모두 하나의 효모 인공 염색체 또는 "YAC"에 함께)을 함유하는 효모 클론을 성장시키고, 수확하고, 이의 YAC를 추출하였다. 이와 같이 얻은 YAC는 제한 효소 Eagl로 절단되어 폴리A-신호 직후에 선형화된 이중 가닥 DNA 분자가 생성되었다. 이러한 DNA 분자를 프로테이나제 K로 표준 처리한 후 트리졸(페놀/클로로포름) 추출로 RNase를 제거한 후, T7 폴리머라제를 사용한 시험관내 전사에 의해 백신 바이러스 게놈에 상응하는 단일 가닥 RNA를 수득하였다. 이와 같이 수득된 RNA를 적합한 세포주(HEK293T 또는 Vero 세포)에 형질감염시켰다. 양성 대조군의 경우, 전장 작제물 "GBsyn_V33" 변경되지 않은 HEK293 또는 Vero 세포는 RNA 게놈의 복제, 서브게놈 mRNA의 생성 및 따라서 바이러스 단백질로의 번역을 지원하였다. 이들은 양성 가닥 RNA 게놈 및 세포막으로부터의 구성요소와 함께, 자손 바이러스를 형성하였으며, 이 경우 야생형 천연 SARS-CoV-2 바이러스를 형성하였다. 결실 돌연변이체의 경우, 바이러스 게놈에서 결실된 유전자 또는 유전자들은 DNA 형태로 세포주에 형질감염되어 단백질 또는 단백질들의 일시적인 발현을 유도하여 자손 바이러스의 생성을 가능하게 하는 데 필요한 결손 인자를 제공한다. 대안적으로 (그리고 바람직하게), 선택 압력 하에서 이러한 세포의 배양은 단백질 또는 단백질들이 지속적으로 발현되는 세포 게놈으로 유전자 또는 유전자들의 안정적인 통합을 유도한다(발현을 통해 본 발명자들은 유전자로부터 mRNA의 생성과 단백질로의 후속 번역을 이해한다). 백신 바이러스 게놈에 없는 유전자로부터 만들어진 단백질을 일시적으로 또는 안정적으로 발현하는 그러한 세포는 구조 단백질의 전체 세트 및 하나 이상의 유전자가 결실된 백신 바이러스 게놈을 특징으로 하는 백신 바이러스의 연속 생산을 가능하게 한다. 이와 같이 얻어진 백신 바이러스는 정화(백신 바이러스로부터 세포 분리), 벤조아제에 의한 DNA 분해, 한외 여과/정용 여과("UF/DF") 및 최종적으로 멸균 여과(0.22 μm 여과)를 특징으로 하는 소위 다운스트림 처리(DSP) 공정에서 정제되었다.
SEQUENCE LISTING
<110> Swiss Rockets AG
<120> Fully synthetic, long-chain nucleic acid for vaccine production to protect against coronaviruses
<130> P6086PC00
<140> EP20020240.6
<141> 2020-05-20
<150> EP20020092.1
<151> 2020-03-03
<160> 59
<170> BiSSAP 1.3.6
<210> 1
<211> 1263
<212> DNA
<213> Artificial Sequence
<220>
<223> COVAX192_N
<400> 1
atggtgtctg ataatggacc tcaaaatcag cgaaatgcac ctcgcattac gtttggtgga 60
ccatcagatt caactggcag taaccagaat ggagaacgaa gtggtgcgcg atcaaaacaa 120
cgccgcccgc aaggtttacc caataatact gcgtcttggt tcaccgctct cactcaacat 180
ggcaaggaag atttaaaatt ccctcgagga caaggcgttc caattaacac caatagcagt 240
ccagatgacc aaattggcta ctaccgccgc gccacaagac gaattcgtgg tggtgatggt 300
aaaatgaaag atctcagtcc aagatggtat ttctactatc taggaactgg gccagaagct 360
ggacttcctt atggtgctaa caaagatggc atcatatggg ttgcaactga gggagccttg 420
aatacaccaa aagatcacat tggcaccaga aatcctgcta acaatgctgc aatcgtgcta 480
caacttcctc aaggaacaac attaccaaaa ggtttttacg cagaagggtc tagaggtgga 540
agtcaagcct cttctagatc atcatcacgt agtcgcaaca gttcaagaaa ttcaactcca 600
ggttcaagta gaggaacttc tcctgctaga atggctggaa atggaggtga tgctgctctt 660
gctttgttac tacttgacag attgaaccag cttgagagca aaatgtctgg taaaggccaa 720
caacaacaag gccaaactgt cactaagaaa tctgctgctg aggcttctaa gaagcctaga 780
caaaaacgta ctgccactaa agcatacaat gtaacacaag ctttcggcag acgtggtcca 840
gaacaaactc aaggaaattt tggggatcag gaactaatca gacaaggaac tgattacaaa 900
cattggccgc aaattgcaca atttgctcct tctgcttcag cgttctttgg aatgtcgaga 960
attggaatgg aagtcacacc ttcgggaaca tggttgacct atacaggtgc catcaaattg 1020
gatgacaaag atccaaattt caaagatcaa gtcattttgc tgaataagca tattgacgca 1080
tacaaaacat tcccaccaac agagcctaaa aaggacaaaa agaagaaggc tgatgaaact 1140
caagccttac cgcagagaca gaagaaacag caaactgtga ctcttcttcc tgctgcagat 1200
ttggatgatt tctccaaaca attgcaacaa tccatgagca gtgctgactc aactcaggcc 1260
taa 1263
<210> 2
<211> 420
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic_Nucleocapsid_Protein_Sars-CoV2
<400> 2
Met Val Ser Asp Asn Gly Pro Gln Asn Gln Arg Asn Ala Pro Arg Ile
1 5 10 15
Thr Phe Gly Gly Pro Ser Asp Ser Thr Gly Ser Asn Gln Asn Gly Glu
20 25 30
Arg Ser Gly Ala Arg Ser Lys Gln Arg Arg Pro Gln Gly Leu Pro Asn
35 40 45
Asn Thr Ala Ser Trp Phe Thr Ala Leu Thr Gln His Gly Lys Glu Asp
50 55 60
Leu Lys Phe Pro Arg Gly Gln Gly Val Pro Ile Asn Thr Asn Ser Ser
65 70 75 80
Pro Asp Asp Gln Ile Gly Tyr Tyr Arg Arg Ala Thr Arg Arg Ile Arg
85 90 95
Gly Gly Asp Gly Lys Met Lys Asp Leu Ser Pro Arg Trp Tyr Phe Tyr
100 105 110
Tyr Leu Gly Thr Gly Pro Glu Ala Gly Leu Pro Tyr Gly Ala Asn Lys
115 120 125
Asp Gly Ile Ile Trp Val Ala Thr Glu Gly Ala Leu Asn Thr Pro Lys
130 135 140
Asp His Ile Gly Thr Arg Asn Pro Ala Asn Asn Ala Ala Ile Val Leu
145 150 155 160
Gln Leu Pro Gln Gly Thr Thr Leu Pro Lys Gly Phe Tyr Ala Glu Gly
165 170 175
Ser Arg Gly Gly Ser Gln Ala Ser Ser Arg Ser Ser Ser Arg Ser Arg
180 185 190
Asn Ser Ser Arg Asn Ser Thr Pro Gly Ser Ser Arg Gly Thr Ser Pro
195 200 205
Ala Arg Met Ala Gly Asn Gly Gly Asp Ala Ala Leu Ala Leu Leu Leu
210 215 220
Leu Asp Arg Leu Asn Gln Leu Glu Ser Lys Met Ser Gly Lys Gly Gln
225 230 235 240
Gln Gln Gln Gly Gln Thr Val Thr Lys Lys Ser Ala Ala Glu Ala Ser
245 250 255
Lys Lys Pro Arg Gln Lys Arg Thr Ala Thr Lys Ala Tyr Asn Val Thr
260 265 270
Gln Ala Phe Gly Arg Arg Gly Pro Glu Gln Thr Gln Gly Asn Phe Gly
275 280 285
Asp Gln Glu Leu Ile Arg Gln Gly Thr Asp Tyr Lys His Trp Pro Gln
290 295 300
Ile Ala Gln Phe Ala Pro Ser Ala Ser Ala Phe Phe Gly Met Ser Arg
305 310 315 320
Ile Gly Met Glu Val Thr Pro Ser Gly Thr Trp Leu Thr Tyr Thr Gly
325 330 335
Ala Ile Lys Leu Asp Asp Lys Asp Pro Asn Phe Lys Asp Gln Val Ile
340 345 350
Leu Leu Asn Lys His Ile Asp Ala Tyr Lys Thr Phe Pro Pro Thr Glu
355 360 365
Pro Lys Lys Asp Lys Lys Lys Lys Ala Asp Glu Thr Gln Ala Leu Pro
370 375 380
Gln Arg Gln Lys Lys Gln Gln Thr Val Thr Leu Leu Pro Ala Ala Asp
385 390 395 400
Leu Asp Asp Phe Ser Lys Gln Leu Gln Gln Ser Met Ser Ser Ala Asp
405 410 415
Ser Thr Gln Ala
420
<210> 3
<211> 1368
<212> DNA
<213> Artificial Sequence
<220>
<223> COVAX191_N
<400> 3
atggtgtctt ttgttcctgg gcaagaaaat gccggtggca gaagctcctc tgtaaaccgc 60
gctggtaatg gaatcctcaa gaaaaccact tgggctgacc aaaccgagcg tggaccaaat 120
aatcaaaata gaggcagaag gaatcagcca aagcagactg caactactca acccaactcc 180
gggagtgtgg ttccccatta ctcctggttt tctggcatta cccagttcca aaagggaaag 240
gagtttcagt ttgcagaagg acaaggagtg cctattgcca atggaatccc cgcttcagag 300
caaaagggat attggtatag acacaaccgc cgttctttta aaacacctga tgggcagcag 360
aagcaattac tgcccagatg gtatttttac tatcttggca cagggcccca tgctggagcc 420
agttatggag acagcattga aggcgtcttt tgggttgcaa acagccaagc ggacaccaat 480
acccgctctg atattgtcga aagggaccca agcagtcatg aggctattcc tactaggttt 540
gcgcccggca cggtattgcc tcagggcttt tatgttgaag gctctggaag gtctgccccg 600
gccagccgat ctggttcgcg gtcacaatcc cgtgggccaa ataatcgcgc tagaagcagt 660
tccaaccagc gccagcctgc ctctactgta aaacctgata tggccgaaga aattgctgct 720
cttgttttgg ctaagctcgg taaagatgcc ggccagccca agcaagtaac gaagcaaagt 780
gccaaagaag tcaggcagaa aattttaaac aagcctcgcc aaaagaggac tccaaacaag 840
cagtgcccag tgcagcagtg ttttggaaag agaggcccca atcagaattt tggaggctct 900
gaaatgttaa aacttggaac tagtgatcca cagttcccca ttcttgcaga gttggctcca 960
acagttggtg ccttcttctt tggatctaaa ttagaattgg tcaaaaagaa ttctggtggt 1020
gctgatgaac ccaccaaaga tgtgtatgag ctgcaatatt caggtgcagt tagatttgat 1080
agtactctac ctggttttga gactatcatg aaagtgttga atgagaattt gaatgcctac 1140
cagaaggatg gtggtgcaga tgtggtgagc ccaaagcccc aaagaaaagg gcgtagacag 1200
gctcaggaaa agaaagatga agtagataat gtaagcgttg caaagcccaa aagctctgtg 1260
cagcgaaatg taagtagaga attaacccca gaggatagaa gtctgttggc tcagatcctt 1320
gatgatggcg tagtgccaga tgggttagaa gatgactcta atgtgtaa 1368
<210> 4
<211> 455
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic_Nucleocapsid_Protein_MHV
<400> 4
Met Val Ser Phe Val Pro Gly Gln Glu Asn Ala Gly Gly Arg Ser Ser
1 5 10 15
Ser Val Asn Arg Ala Gly Asn Gly Ile Leu Lys Lys Thr Thr Trp Ala
20 25 30
Asp Gln Thr Glu Arg Gly Pro Asn Asn Gln Asn Arg Gly Arg Arg Asn
35 40 45
Gln Pro Lys Gln Thr Ala Thr Thr Gln Pro Asn Ser Gly Ser Val Val
50 55 60
Pro His Tyr Ser Trp Phe Ser Gly Ile Thr Gln Phe Gln Lys Gly Lys
65 70 75 80
Glu Phe Gln Phe Ala Glu Gly Gln Gly Val Pro Ile Ala Asn Gly Ile
85 90 95
Pro Ala Ser Glu Gln Lys Gly Tyr Trp Tyr Arg His Asn Arg Arg Ser
100 105 110
Phe Lys Thr Pro Asp Gly Gln Gln Lys Gln Leu Leu Pro Arg Trp Tyr
115 120 125
Phe Tyr Tyr Leu Gly Thr Gly Pro His Ala Gly Ala Ser Tyr Gly Asp
130 135 140
Ser Ile Glu Gly Val Phe Trp Val Ala Asn Ser Gln Ala Asp Thr Asn
145 150 155 160
Thr Arg Ser Asp Ile Val Glu Arg Asp Pro Ser Ser His Glu Ala Ile
165 170 175
Pro Thr Arg Phe Ala Pro Gly Thr Val Leu Pro Gln Gly Phe Tyr Val
180 185 190
Glu Gly Ser Gly Arg Ser Ala Pro Ala Ser Arg Ser Gly Ser Arg Ser
195 200 205
Gln Ser Arg Gly Pro Asn Asn Arg Ala Arg Ser Ser Ser Asn Gln Arg
210 215 220
Gln Pro Ala Ser Thr Val Lys Pro Asp Met Ala Glu Glu Ile Ala Ala
225 230 235 240
Leu Val Leu Ala Lys Leu Gly Lys Asp Ala Gly Gln Pro Lys Gln Val
245 250 255
Thr Lys Gln Ser Ala Lys Glu Val Arg Gln Lys Ile Leu Asn Lys Pro
260 265 270
Arg Gln Lys Arg Thr Pro Asn Lys Gln Cys Pro Val Gln Gln Cys Phe
275 280 285
Gly Lys Arg Gly Pro Asn Gln Asn Phe Gly Gly Ser Glu Met Leu Lys
290 295 300
Leu Gly Thr Ser Asp Pro Gln Phe Pro Ile Leu Ala Glu Leu Ala Pro
305 310 315 320
Thr Val Gly Ala Phe Phe Phe Gly Ser Lys Leu Glu Leu Val Lys Lys
325 330 335
Asn Ser Gly Gly Ala Asp Glu Pro Thr Lys Asp Val Tyr Glu Leu Gln
340 345 350
Tyr Ser Gly Ala Val Arg Phe Asp Ser Thr Leu Pro Gly Phe Glu Thr
355 360 365
Ile Met Lys Val Leu Asn Glu Asn Leu Asn Ala Tyr Gln Lys Asp Gly
370 375 380
Gly Ala Asp Val Val Ser Pro Lys Pro Gln Arg Lys Gly Arg Arg Gln
385 390 395 400
Ala Gln Glu Lys Lys Asp Glu Val Asp Asn Val Ser Val Ala Lys Pro
405 410 415
Lys Ser Ser Val Gln Arg Asn Val Ser Arg Glu Leu Thr Pro Glu Asp
420 425 430
Arg Ser Leu Leu Ala Gln Ile Leu Asp Asp Gly Val Val Pro Asp Gly
435 440 445
Leu Glu Asp Asp Ser Asn Val
450 455
<210> 5
<211> 231
<212> DNA
<213> Artificial Sequence
<220>
<223> COVAX192_E
<400> 5
atggtgtact cattcgtttc ggaagagaca ggtacgttaa tagttaatag cgtacttctt 60
tttcttgctt tcgtggtatt cttgctagtt acactagcca ttcttactgc gcttcgattg 120
tgtgcgtact gttgcaatat tgttaacgtg agtcttgtaa aaccttcttt ttacgtttac 180
tctcgtgtta aaaatctgaa ttcttctcgg gttcctgatc ttctggtcta a 231
<210> 6
<211> 76
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic_Envelope_Protein_Sars-CoV2
<400> 6
Met Val Tyr Ser Phe Val Ser Glu Glu Thr Gly Thr Leu Ile Val Asn
1 5 10 15
Ser Val Leu Leu Phe Leu Ala Phe Val Val Phe Leu Leu Val Thr Leu
20 25 30
Ala Ile Leu Thr Ala Leu Arg Leu Cys Ala Tyr Cys Cys Asn Ile Val
35 40 45
Asn Val Ser Leu Val Lys Pro Ser Phe Tyr Val Tyr Ser Arg Val Lys
50 55 60
Asn Leu Asn Ser Ser Arg Val Pro Asp Leu Leu Val
65 70 75
<210> 7
<211> 255
<212> DNA
<213> Artificial Sequence
<220>
<223> COVAX191_E
<400> 7
atggtgttta atttattcct tacagacaca gtatggtatg tggggcagat tatttttata 60
ttcgcagtgt gtttgatggt caccataatt gtggttgcct tccttgcgtc tatcaaactt 120
tgtattcaac tttgcggttt atgtaatact ttggtgctgt ccccttctat ttatttgtat 180
gataggagta agcagcttta taagtactat aatgaagaaa tgagactgcc cctattagag 240
gtggatgata tctaa 255
<210> 8
<211> 84
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic_Envelope_Protein_MHV
<400> 8
Met Val Phe Asn Leu Phe Leu Thr Asp Thr Val Trp Tyr Val Gly Gln
1 5 10 15
Ile Ile Phe Ile Phe Ala Val Cys Leu Met Val Thr Ile Ile Val Val
20 25 30
Ala Phe Leu Ala Ser Ile Lys Leu Cys Ile Gln Leu Cys Gly Leu Cys
35 40 45
Asn Thr Leu Val Leu Ser Pro Ser Ile Tyr Leu Tyr Asp Arg Ser Lys
50 55 60
Gln Leu Tyr Lys Tyr Tyr Asn Glu Glu Met Arg Leu Pro Leu Leu Glu
65 70 75 80
Val Asp Asp Ile
<210> 9
<211> 672
<212> DNA
<213> Artificial Sequence
<220>
<223> COVAX192_M
<400> 9
atggtggcag attccaacgg tactattacc gttgaggagc tgaaaaagct ccttgaacaa 60
tggaacctag taataggttt cctattcctt acatggattt gcctgctgca atttgcctat 120
gccaacagga ataggttttt gtacatcatt aagttgattt tcctctggct gttatggcca 180
gtaactttag cttgttttgt gcttgctgct gtttacagaa taaattggat caccggtgga 240
attgctattg caatggcttg tcttgtagga ttgatgtggc taagctactt cattgcttct 300
ttcagactgt ttgcgcgtac gcgttccatg tggtcattca atccagaaac taacattctt 360
ctcaacgtgc cactccatgg aactattctg actagaccgc ttctagaaag tgaactcgta 420
atcggagctg ttatccttcg tggacatctt cgtattgctg gacatcatct aggacgctgt 480
gacatcaagg atctacctaa agaaatcact gttgctacat cacgaacgct ttcttattac 540
aaattgggag cttcacagcg tgtagcaggt gattcaggtt ttgctgcata tagtcgctac 600
aggattggca actataaatt aaacacagac cattccagta gcagtgacaa tattgctttg 660
cttgtacagt aa 672
<210> 10
<211> 223
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic_Membrane_Protein_Sars-CoV2
<400> 10
Met Val Ala Asp Ser Asn Gly Thr Ile Thr Val Glu Glu Leu Lys Lys
1 5 10 15
Leu Leu Glu Gln Trp Asn Leu Val Ile Gly Phe Leu Phe Leu Thr Trp
20 25 30
Ile Cys Leu Leu Gln Phe Ala Tyr Ala Asn Arg Asn Arg Phe Leu Tyr
35 40 45
Ile Ile Lys Leu Ile Phe Leu Trp Leu Leu Trp Pro Val Thr Leu Ala
50 55 60
Cys Phe Val Leu Ala Ala Val Tyr Arg Ile Asn Trp Ile Thr Gly Gly
65 70 75 80
Ile Ala Ile Ala Met Ala Cys Leu Val Gly Leu Met Trp Leu Ser Tyr
85 90 95
Phe Ile Ala Ser Phe Arg Leu Phe Ala Arg Thr Arg Ser Met Trp Ser
100 105 110
Phe Asn Pro Glu Thr Asn Ile Leu Leu Asn Val Pro Leu His Gly Thr
115 120 125
Ile Leu Thr Arg Pro Leu Leu Glu Ser Glu Leu Val Ile Gly Ala Val
130 135 140
Ile Leu Arg Gly His Leu Arg Ile Ala Gly His His Leu Gly Arg Cys
145 150 155 160
Asp Ile Lys Asp Leu Pro Lys Glu Ile Thr Val Ala Thr Ser Arg Thr
165 170 175
Leu Ser Tyr Tyr Lys Leu Gly Ala Ser Gln Arg Val Ala Gly Asp Ser
180 185 190
Gly Phe Ala Ala Tyr Ser Arg Tyr Arg Ile Gly Asn Tyr Lys Leu Asn
195 200 205
Thr Asp His Ser Ser Ser Ser Asp Asn Ile Ala Leu Leu Val Gln
210 215 220
<210> 11
<211> 690
<212> DNA
<213> Artificial Sequence
<220>
<223> COVAX191_M
<400> 11
atggtgagta gtactactca ggccccagag cccgtctatc aatggaccgc cgacgaggca 60
gttcaattcc ttaaggaatg gaacttctcg ttgggcatta tactactctt tattactatc 120
atactacagt tcggttacac gagccgtagc atgtttattt atgttgtgaa aatgataatc 180
ttgtggttaa tgtggccact gactattgtt ttgtgtattt tcaattgcgt gtatgcgcta 240
aataatgtgt atcttggatt ttctatagtg tttactatag tgtccattgt aatctggatc 300
atgtattttg tgaacagcat aaggttgttt atcaggactg gtagctggtg gagcttcaac 360
cccgaaacaa acaaccttat gtgtatagat atgaaaggta ccgtgtatgt tagacccatt 420
attgaggatt accatacact aacagccact attattcgtg gccacctcta catgcaaggt 480
gttaagctag gcaccggttt ctctttgtct gacttgcccg cttatgttac agttgctaag 540
gtgtcacacc tttgcactta taagcgcgca ttcttagaca aggtagacgg tgttagcggt 600
tttgctgttt atgtgaagtc caaggtcgga aattaccgac tgccctcaaa caaaccgagt 660
ggcgcggaca ccgcattgtt gagaacctaa 690
<210> 12
<211> 229
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic_Membrane_Protein_MHV
<400> 12
Met Val Ser Ser Thr Thr Gln Ala Pro Glu Pro Val Tyr Gln Trp Thr
1 5 10 15
Ala Asp Glu Ala Val Gln Phe Leu Lys Glu Trp Asn Phe Ser Leu Gly
20 25 30
Ile Ile Leu Leu Phe Ile Thr Ile Ile Leu Gln Phe Gly Tyr Thr Ser
35 40 45
Arg Ser Met Phe Ile Tyr Val Val Lys Met Ile Ile Leu Trp Leu Met
50 55 60
Trp Pro Leu Thr Ile Val Leu Cys Ile Phe Asn Cys Val Tyr Ala Leu
65 70 75 80
Asn Asn Val Tyr Leu Gly Phe Ser Ile Val Phe Thr Ile Val Ser Ile
85 90 95
Val Ile Trp Ile Met Tyr Phe Val Asn Ser Ile Arg Leu Phe Ile Arg
100 105 110
Thr Gly Ser Trp Trp Ser Phe Asn Pro Glu Thr Asn Asn Leu Met Cys
115 120 125
Ile Asp Met Lys Gly Thr Val Tyr Val Arg Pro Ile Ile Glu Asp Tyr
130 135 140
His Thr Leu Thr Ala Thr Ile Ile Arg Gly His Leu Tyr Met Gln Gly
145 150 155 160
Val Lys Leu Gly Thr Gly Phe Ser Leu Ser Asp Leu Pro Ala Tyr Val
165 170 175
Thr Val Ala Lys Val Ser His Leu Cys Thr Tyr Lys Arg Ala Phe Leu
180 185 190
Asp Lys Val Asp Gly Val Ser Gly Phe Ala Val Tyr Val Lys Ser Lys
195 200 205
Val Gly Asn Tyr Arg Leu Pro Ser Asn Lys Pro Ser Gly Ala Asp Thr
210 215 220
Ala Leu Leu Arg Thr
225
<210> 13
<211> 3885
<212> DNA
<213> Artificial Sequence
<220>
<223> SARS-CoV-2 S
<400> 13
atggtgtttg tttttcttgt tttattgcca ctagtctcta gtcagtgtgt taatcttaca 60
atggtgtttg tttttcttgt tttattgcca ctagtctcta gtcagtgtgt taatcttaca 120
accagaactc aattaccccc tgcatacact aattctttca cacgtggtgt ttattaccct 180
gacaaagttt tcagatcctc agttttacat tcaactcagg acttgttctt acctttcttt 240
tccaatgtta cttggttcca tgctatacat gtctctggga ccaatggtac taagaggttt 300
gataaccctg tcctaccatt taatgatggt gtttactttg cttccactga gaagtctaac 360
ataataagag gctggatttt tggtactact ttagattcga aaacccagtc cctacttatt 420
gttaataacg ctactaatgt tgttatcaaa gtctgtgaat ttcaattttg taacgatcca 480
tttttgggtg tttattacca caaaaacaac aaaagttgga tggaaagtga gttcagagtt 540
tattctagtg cgaataattg cacttttgaa tacgtctctc agccttttct tatggacctt 600
gaaggaaaac agggtaattt caaaaatctt agggaatttg tgttcaagaa tattgatggt 660
tacttcaaga tatactctaa gcacacgcct attaatttag tgcgtgatct ccctcagggt 720
ttttcggctt tagaaccatt ggtagatttg ccaataggta ttaacatcac taggtttcaa 780
actttacttg ctttacatag aagttattta actcctggtg attcttcttc aggttggaca 840
gctggtgctg cagcttatta tgtgggttat cttcaaccta ggacttttct actgaagtac 900
aatgaaaatg gaaccattac agatgctgta gactgtgcac ttgaccctct ctcagaaaca 960
aagtgtacgt tgaaatcctt cactgtagaa aaaggaatct atcaaacttc taactttaga 1020
gtccaaccaa cagaatctat tgttagattt cctaacatca caaacttgtg cccttttggt 1080
gaagttttta acgccaccag atttgcatct gtttatgctt ggaacaggaa gagaatcagc 1140
aactgtgttg ctgattattc tgtcctgtat aattccgcat cattttccac ttttaagtgt 1200
tatggagtgt ctcctactaa attaaatgat ctctgcttta ctaatgtcta tgcagattca 1260
tttgtaatta gaggtgatga agtcagacaa atcgctccag ggcaaactgg aaagattgct 1320
gattataact acaaattacc agatgatttt acaggctgcg ttatagcttg gaattctaac 1380
aatcttgatt ctaaggttgg tggtaattat aattacctgt acagattgtt taggaagtct 1440
aatctcaaac cttttgagag agatatttca actgaaatct atcaggccgg tagcacacct 1500
tgtaatggtg ttgaaggttt taattgttac tttcctctgc aatcatatgg tttccaaccc 1560
actaatggtg ttggttacca accatacaga gtagtagtac tttcttttga acttctacat 1620
gcaccagcaa ctgtttgtgg acctaaaaag tctactaatt tggttaagaa caagtgtgtc 1680
aatttcaact tcaatggttt aacaggcaca ggtgttctta ctgagtctaa caaaaagttt 1740
ctgcctttcc aacaatttgg cagagacatt gctgacacta ctgatgctgt tcgtgatcca 1800
caaacacttg agattcttga cattacacca tgttcttttg gtggtgtcag tgttataaca 1860
ccaggaacaa atacttctaa ccaggttgct gttctttatc aggatgttaa ctgcacagaa 1920
gtccctgttg ctattcatgc agatcaactt actcctactt ggcgtgttta ttctacaggt 1980
tctaatgttt ttcaaacacg tgcaggctgt ttaatagggg ctgaacatgt caacaactca 2040
tatgagtgtg acatacccat tggtgcaggt atatgcgcta gttatcagac tcagactaat 2100
tctcctcgga gagcaagaag tgtagctagt caatccatca ttgcctacac tatgtcactt 2160
ggtgcagaaa attcagttgc ttactctaat aactctattg ccatacccac aaattttact 2220
attagcgtta ccacagaaat tctaccagtg tctatgacca agacatcagt agattgtaca 2280
atgtacattt gtggtgattc aactgaatgc agcaatcttt tgttgcaata tggcagtttt 2340
tgtacacaat taaaccgtgc tttaactgga atagctgttg aacaagacaa aaacacccaa 2400
gaagtttttg cacaagtcaa acaaatttac aagacaccac caattaaaga ttttggcggt 2460
tttaatttta gccagatact gccagatcca tcaaaaccaa gcaagaggtc atttattgaa 2520
gatctactgt tcaacaaagt gacacttgca gatgctggct tcatcaaaca atatggtgat 2580
tgccttggtg atattgctgc tagagacctc atttgtgcac aaaagtttaa cggccttact 2640
gttttgccac ctttgctcac agatgaaatg attgctcaat acacttctgc actgttagca 2700
ggtacaatca cttctggttg gacttttggt gcaggtgctg cattacaaat accatttgct 2760
atgcaaatgg cttataggtt taatggtatt ggagttacac agaatgttct ctatgagaac 2820
caaaaattga ttgccaacca atttaatagt gctattggca aaattcaaga ctcactttct 2880
tccacagcaa gtgcacttgg aaaacttcaa gatgtggtca accaaaatgc acaagcttta 2940
aacacgcttg ttaaacaact tagctccaat tttggtgcaa tttcaagtgt tttaaacgac 3000
atcctttcac gtcttgacaa agttgaggct gaagtgcaaa ttgataggtt gatcacaggc 3060
agacttcaaa gtttgcagac atatgtgact caacaattaa ttagagctgc agaaatcaga 3120
gcttctgcta atcttgctgc tactaaaatg tcagagtgtg tacttggaca atcaaaaaga 3180
gttgactttt gcggaaaggg ctatcatctt atgtcatttc ctcagtcagc acctcatggt 3240
gtcgtctttt tgcatgtgac ttatgtccct gcacaagaaa agaacttcac aactgctcct 3300
gccatttgtc atgatggaaa agcacacttt cctcgtgaag gtgtctttgt ttcaaatggc 3360
acacactggt ttgtaacaca aaggaatttt tatgaaccac aaatcattac tacagacaac 3420
acatttgtgt ctggtaactg tgatgttgta ataggaattg tcaacaacac agtttatgat 3480
cctttgcaac ctgaattaga ctcattcaag gaggagcttg ataaatactt caagaaccat 3540
acctcaccag atgttgattt aggtgacatc tctggcatta atgcttcagt tgtaaacatt 3600
cagaaagaaa tcgaccgcct caatgaggtt gccaagaatt taaatgaatc tctcatcgat 3660
ctccaagaac ttggaaagta tgagcagtat ataaaatggc catggtacat ttggctaggt 3720
tttatagctg gcttgattgc catagtaatg gtgacaatta tgctttgctg tatgaccagt 3780
tgctgtagtt gtctcaaggg ctgttgttct tgtggatcct gctgcaaatt tgacgaggac 3840
gactctgagc cagtgctcaa aggagtcaaa ttacattaca cataa 3885
<210> 14
<211> 1274
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic_Spike_Protein_Sars-CoV2
<400> 14
Met Val Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys
1 5 10 15
Val Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser
20 25 30
Phe Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val
35 40 45
Leu His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr
50 55 60
Trp Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe
65 70 75 80
Asp Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr
85 90 95
Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp
100 105 110
Ser Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val
115 120 125
Ile Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val
130 135 140
Tyr Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val
145 150 155 160
Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe
165 170 175
Leu Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu
180 185 190
Phe Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His
195 200 205
Thr Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu
210 215 220
Glu Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln
225 230 235 240
Thr Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser
245 250 255
Ser Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln
260 265 270
Pro Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp
275 280 285
Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu
290 295 300
Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg
305 310 315 320
Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu
325 330 335
Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr
340 345 350
Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val
355 360 365
Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser
370 375 380
Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser
385 390 395 400
Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr
405 410 415
Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly
420 425 430
Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly
435 440 445
Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro
450 455 460
Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro
465 470 475 480
Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr
485 490 495
Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val
500 505 510
Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro
515 520 525
Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe
530 535 540
Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe
545 550 555 560
Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala
565 570 575
Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser
580 585 590
Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln
595 600 605
Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala
610 615 620
Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly
625 630 635 640
Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His
645 650 655
Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys
660 665 670
Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val
675 680 685
Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn
690 695 700
Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr
705 710 715 720
Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser
725 730 735
Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn
740 745 750
Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu
755 760 765
Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala
770 775 780
Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly
785 790 795 800
Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg
805 810 815
Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala
820 825 830
Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg
835 840 845
Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro
850 855 860
Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala
865 870 875 880
Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln
885 890 895
Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val
900 905 910
Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe
915 920 925
Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser
930 935 940
Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu
945 950 955 960
Asn Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser
965 970 975
Val Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val
980 985 990
Gln Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr
995 1000 1005
Val Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn
1010 1015 1020
Leu Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys Arg
1025 1030 1035 1040
Val Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ser
1045 1050 1055
Ala Pro His Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ala Gln
1060 1065 1070
Glu Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His Asp Gly Lys Ala
1075 1080 1085
His Phe Pro Arg Glu Gly Val Phe Val Ser Asn Gly Thr His Trp Phe
1090 1095 1100
Val Thr Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile Thr Thr Asp Asn
1105 1110 1115 1120
Thr Phe Val Ser Gly Asn Cys Asp Val Val Ile Gly Ile Val Asn Asn
1125 1130 1135
Thr Val Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser Phe Lys Glu Glu
1140 1145 1150
Leu Asp Lys Tyr Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu Gly
1155 1160 1165
Asp Ile Ser Gly Ile Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile
1170 1175 1180
Asp Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp
1185 1190 1195 1200
Leu Gln Glu Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Trp Tyr
1205 1210 1215
Ile Trp Leu Gly Phe Ile Ala Gly Leu Ile Ala Ile Val Met Val Thr
1220 1225 1230
Ile Met Leu Cys Cys Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Cys
1235 1240 1245
Cys Ser Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro
1250 1255 1260
Val Leu Lys Gly Val Lys Leu His Tyr Thr
1265 1270
<210> 15
<211> 21746
<212> DNA
<213> Artificial Sequence
<220>
<223> COVAX_Syn_RepA56
<400> 15
gtataagagt gattggcgtc cgtacgtacc ctctcaactc taaaactctt gtagtttaaa 60
tctaatctaa actttataaa cggcacttcc tgcgtgtcca tgcccgcggg cctggtcttg 120
tcatagtgct gacatttgta gttccttgac tttcgttctc tgccagtgac gtgtccattc 180
ggcgccagca gcccacccat aggttgcata atggcaaaga tgggcaaata cggcctgggc 240
ttcaaatggg ccccagaatt tccatggatg cttccgaacg catcggagaa gttgggtaac 300
cctgagaggt cagaggagga tgggttttgc ccctctgctg cgcaagaacc gaaagttaaa 360
ggaaaaactt tggttaatca cgtgagggtg aattgtagcc ggcttccagc tttggaatgc 420
tgtgttcagt ctgccataat ccgtgatatt tttgtagatg aggatcccca gaaggtggag 480
gcctcaacta tgatggcatt gcagttcggt agtgccgtct tggttaagcc atccaagcgc 540
ttgtctattc aggcatggac taatttgggt gtgcttccca aaacagctgc catggggttg 600
ttcaagcgcg tctgcctgtg taacaccagg gagtgctctt gtgacgccca cgtggccttt 660
caccttttta cggtccaacc cgatggtgta tgcctgggta atggccgttt tataggctgg 720
ttcgttccag tcacagccat accggagtat gcgaagcagt ggttgcaacc ctggtccatc 780
cttcttcgta agggtggtaa caaagggtct gtgacatccg gccacttccg ccgcgctgtt 840
accatgcctg tgtatgactt taatgtagag gatgcttgtg aggaggttca tcttaacccg 900
aagggtaagt actcctgcaa ggcgtatgcc ctgctgaagg gctatcgcgg tgttaagccc 960
atcctgtttg tggaccagta tggttgcgac tatactggat gtctcgccaa gggtcttgag 1020
gactatggcg atctcacctt gagtgagatg aaggagttgt tccctgtgtg gcgtgactcc 1080
ttggatagtg aagtccttgt ggcttggcac gttgatcgag atcctcgggc tgctatgcgt 1140
ctgcagactc ttgctactgt acgttgcatt gattatgtgg gccaaccgac cgaggatgtg 1200
gtggatggag atgtggtagt gcgtgagcct gctcatcttc tcgcagccaa tgccattgtt 1260
aaaagactcc cccgtttggt ggagactatg ctgtatacgg attcgtccgt tacagaattc 1320
tgttataaaa ccaagctgtg tgaatgcggt tttatcacgc agtttggcta tgtggattgt 1380
tgtggtgaca cctgtgattt tcgtgggtgg gttgccggca atatgatgga tggctttcca 1440
tgtccagggt gtaccaaaaa ttatatgccc tgggaattgg aggcccagtc atcaggtgtt 1500
ataccagaag gaggtgttct attcactcag agcactgata cagtgaatcg tgagtccttt 1560
aagctctacg gtcatgctgt tgtgcctttt ggttctgctg tgtattggag cccttgccca 1620
ggtatgtggc ttccagtaat ttggtcgtcg gttaagtcat actctggttt gacttataca 1680
ggagtagttg gttgtaaggc aattgttcaa gagacagacg ctatatgtcg ttctctgtat 1740
atggattatg tccagcacaa gtgtggcaat ctcgagcaga gagctatcct tggattggac 1800
gatgtctatc atagacagtt gcttgtgaat aggggtgact atagtctcct ccttgagaat 1860
gtggatttgt ttgttaagcg gcgcgctgaa tttgcttgca aattcgccac ctgtggagat 1920
ggtcttgtac ccctcctact agatggttta gtgccccgca gttattattt gattaagagt 1980
ggtcaagctt tcacctctat gatggttaat tttagccatg aggtgactga catgtgtatg 2040
gacatggctt tattgttcat gcatgatgtt aaagtggcca ctaagtatgt taagaaggtt 2100
actggcaaac tggccgtgcg ctttaaagcg ttgggtgtag ccgttgtcag aaaaattact 2160
gaatggtttg atttagccgt ggacattgct gctagtgccg ctggatggct ttgctaccag 2220
ctggtaaatg gcttatttgc agtggccaat ggtgttataa cctttgtaca ggaggtgcct 2280
gagcttgtca agaattttgt tgacaagttc aaggcatttt tcaaggtttt gatcgactct 2340
atgtcggttt ctatcttgtc tggacttact gttgtcaaga ctgcctcaaa tagggtgtgt 2400
cttgctggca gtaaggttta tgaagttgtg cagaaatctt tgtctgcata tgttatgcct 2460
gtgggttgca gcgaagccac ttgtttggtg ggtgagattg aacctgcagt ttttgaagat 2520
gatgttgttg atgtggttaa agccccatta acatatcaag gctgttgtaa gccacccact 2580
tctttcgaga agatttgtat tgtggataaa ttgtatatgg ccaagtgtgg tgatcaattt 2640
taccctgtgg ttgttgataa cgacactgtt ggcgtgttag atcagtgctg gaggtttccc 2700
tgtgcgggca agaaagtcga gtttaacgac aagcccaaag tcaggaagat accctccacc 2760
cgtaagatta agatcacctt cgcactggat gcgacctttg atagtgttct ttcgaaggcg 2820
tgttcagagt ttgaagttga taaagatgtt acattggatg agctgcttga tgttgtgctt 2880
gacgcagttg agagtacgct cagcccttgt aaggagcatg atgtgatagg cacaaaagtt 2940
tgtgctttac ttgataggtt ggcaggagat tatgtctatc tttttgatga gggaggcgat 3000
gaagtgatcg ccccgaggat gtattgttcc ttttctgctc ctgatgacga ggactgcgtt 3060
gcagcggatg ttgtagatgc agatgaaaac caagatgatg atgccgagga ctcagcagtc 3120
cttgtcgctg atacccaaga agaggacggc gttgccaagg ggcaggttga ggcggattcg 3180
gaaatttgcg ttgcgcatac tggtagtcaa gaagaattgg ctgagcctga tgctgtcgga 3240
tctcaaactc ccatcgcctc tgctgaggaa accgaagtcg gagaggcaag cgacagggaa 3300
gggattgctg aggcgaaggc aactgtgtgt gctgatgctg tagatgcctg ccccgatcaa 3360
gtggaggcat ttgaaattga aaaggtcgag gactctatct tggatgagct tcaaactgaa 3420
cttaatgcgc cagcggacaa gacctatgag gatgtcttgg cattcgatgc cgtatgctca 3480
gaggcgttgt ctgcattcta tgctgtgccg agtgatgaga cgcactttaa agtgtgtgga 3540
ttctattcgc ctgctataga gcgcactaat tgttggctgc gttctacttt gatagtaatg 3600
cagagtctac ctttggaatt taaagacttg gagatgcaaa agctctggtt gtcttacaag 3660
gccggctatg accaatgctt tgtggacaaa ctagttaaga gcgtgcccaa gtctattatc 3720
cttccacaag gtggttatgt ggcagatttt gcctatttct ttctaagcca gtgtagcttt 3780
aaagcttatg ctaactggcg ttgtttagag tgtgacatgg agttaaagct tcaaggcttg 3840
gacgccatgt ttttctatgg ggacgttgtg tctcatatgt gcaagtgtgg taatagcatg 3900
accttgttgt ctgcagatat accctacact ttgcattttg gagtgcgaga tgataagttt 3960
tgcgcttttt acacgccaag aaaggtcttt agggctgctt gtgcggtaga tgttaatgat 4020
tgtcactcta tggctgtagt agagggcaag caaattgatg gtaaagtggt taccaaattt 4080
attggtgaca aatttgattt tatggtgggt tacgggatga catttagtat gtctcctttt 4140
gaactcgccc agttatatgg ttcatgtata acaccaaatg tttgttttgt taaaggagat 4200
gttataaagg ttgttcgctt agttaatgct gaagtcattg ttaaccctgc taatgggcgt 4260
atggctcatg gtgccggcgt cgccggcgcc atagctgaaa aggcgggcag tgcttttatt 4320
aaagaaacct ccgatatggt gaaggctcag ggcgtttgcc aggttggtga atgctatgaa 4380
tctgccggtg gtaagttatg taaaaaggtg cttaacattg tagggccaga tgcgcgaggg 4440
catggcaagc aatgctattc acttttagag cgtgcttatc agcatattaa taagtgtgac 4500
aatgttgtca ctactttaat ttcggctggt atatttagtg tgcctactga tgtctcccta 4560
acttacttac ttggtgtagt gacaaagaat gtcattcttg tcagtaacaa ccaggatgat 4620
tttgatgtga tagagaagtg tcaggtgacc tccgttgctg gtaccaaagc gctatcactt 4680
caattggcca aaaatttgtg ccgtgatgta aagtttgtga cgaatgcatg tagttcgctt 4740
tttagtgaat cttgctttgt ctcaagctat gatgtgttgc aggaagttga agcgctgcga 4800
catgatatac aattggatga tgatgctcgt gtctttgtgc aggctaatat ggactgtctg 4860
cccacagact ggcgtctcgt taacaaattt gatagtgttg atggtgttag aaccattaag 4920
tattttgaat gcccgggcgg gatttttgta tccagccagg gcaaaaagtt tggttatgtt 4980
cagaatggtt catttaagga ggcgagtgtt agccaaataa gggctttact cgctaataag 5040
gttgatgtct tgtgtactgt tgatggtgtt aacttccgct cctgctgcgt agcagagggt 5100
gaagtttttg gcaagacatt aggttcagtc ttttgtgatg gcataaatgt caccaaagtt 5160
aggtgtagtg ccatttacaa gggtaaggtt ttctttcagt acagtgattt gtccgaggca 5220
gatcttgtgg ctgttaaaga tgcctttggt tttgatgaac cacaactgct gaagtactac 5280
actatgcttg gcatgtgtaa gtggccagta gttgtttgtg gcaattattt tgctttcaag 5340
cagtcaaata ataattgcta catcaacgtg gcatgtttaa tgctgcaaca cttgagttta 5400
aagtttccta agtggcaatg gcaagaggct tggaacgagt tccgctctgg taaaccacta 5460
aggtttgtgt ccttggtatt agcaaagggc agctttaaat ttaatgaacc ttctgattct 5520
atcgatttta tgcgtgtggt gctacgtgaa gcagatttga gtggtgccac gtgcaatttg 5580
gaatttgttt gtaaatgtgg tgtgaagcaa gagcagcgca aaggtgttga cgctgttatg 5640
cattttggta cgttggataa aggtgatctt gtcaggggtt ataatatcgc atgtacgtgc 5700
ggtagtaaac ttgtgcattg cacccaattt aacgtaccat ttttaatttg ctccaacaca 5760
ccagagggta ggaaactgcc cgacgatgtt gttgcagcta atatttttac tggtggtagt 5820
gtgggccatt acacgcatgt gaaatgtaaa cccaagtacc agctttatga tgcttgtaat 5880
gttaataagg tttcggaggc taagggtaat tttaccgatt gcctctacct taaaaattta 5940
aagcaaacct tctcgtctgt gctgacgact ttttatttag atgacgtaaa gtgtgtggag 6000
tataagccag atttatcgca gtattactgt gagtctggta aatattatac aaaacccatt 6060
attaaggccc aatttagaac atttgagaag gttgatggtg tctataccaa ctttaaattg 6120
gtgggacata gtattgctga aaaactcaat gctaagctgg gatttgattg taattctccc 6180
tttgtggagt ataaaattac agagtggcca acagctactg gagatgtggt gttggctagt 6240
gatgatttgt atgtaagtcg gtacttaagc gggtgcatta cttttggtaa accggttgtc 6300
tggcttggcc atgaggaagc atcgctgaaa tctctcacat attttaatag acctagtgtc 6360
gtttgtgaaa ataaatttaa cgtgttgccc gttgatgtca gtgaacccac ggacaagggg 6420
cctgtgcctg ctgcagtcct tgttaccggc gtccctggag ctgatgcgtc agctggtgcc 6480
ggtattgcca aggagcaaaa agcctgtgct tctgctagtg tggaggatca ggttgttacg 6540
gaggttcgtc aagagccatc tgtttcagct gctgatgtca aagaggttaa attgaatggt 6600
gttaaaaagc ctgttaaggt ggaaggtagt gtggttgtta atgatcccac tagcgaaacc 6660
aaagttgtta aaagtttgtc tattgttgat gtctatgata tgttcctgac agggtgtaag 6720
tatgtggttt ggactgctaa tgagttgtct cgactagtaa attcaccgac tgttagggag 6780
tatgtgaagt ggggtatggg aaagattgta acacccgcta agttgttgtt gttaagagat 6840
gagaagcaag agttcgtagc gccaaaagta gtcaaggcga aagctattgc ctgctattgt 6900
gctgtgaagt ggtttctcct ctattgtttt agttggataa agtttaatac tgacaataag 6960
gttatataca ccacagaagt agcttcaaag cttactttca agttgtgctg tttggccttt 7020
aagaatgcct tacagacgtt taattggagc gttgtgtcta ggggcttttt cctagttgca 7080
acggtctttt tactctggtt taactttttg tatgctaatg ttattttgag tgacttctat 7140
ttgcctaata ttgggcctct ccctacgttt gtgggacaga tagttgcgtg gtttaagact 7200
acatttggtg tgtcaaccat ctgtgatttc taccaggtga cggatttggg ctatagaagt 7260
tcgttttgta atggaagtat ggtatgtgaa ctatgcttct caggttttga tatgctggac 7320
aactatgatg ctataaatgt tgttcaacac gttgtagata ggcgtttgtc ctttgactat 7380
attagcctat ttaaactggt agttgagctt gtaatcggct actctcttta tactgtgtgc 7440
ttctacccac tgtttgtcct tattggaatg cagttattga ccacatggtt gcctgaattc 7500
tttatgctgg agactatgca ttggagtgct cgtttgtttg tgtttgttgc caatatgctt 7560
ccagctttta cgttactgcg attttacatc gtggtgacag ctatgtataa ggtctattgt 7620
ctttgtagac atgttatgta tggatgtagt aagcctggtt gcttgttttg ttataagaga 7680
aaccgtagtg tccgtgttaa gtgtagcacc gttgttggtg gttcactacg ctattacgat 7740
gtaatggcta acggcggcac aggtttctgt acaaagcacc agtggaactg tcttaattgc 7800
aattcctgga aaccaggcaa tacattcata actcatgaag cagcggcgga cctctctaag 7860
gagttgaaac gccctgtgaa tccaacagat tctgcttatt actcggtcac agaggttaag 7920
caggttggtt gttccatgcg tttgttctac gagagagatg gacagcgtgt ttatgatgat 7980
gttaatgcta gtttgtttgt ggacatgaat ggtctgctgc attctaaagt taaaggtgtg 8040
cctgaaacgc atgttgtggt tgttgagaat gaagctgata aagctggttt tctcggcgcc 8100
gcagtgtttt atgcacaatc gctctacaga cctatgttga tggtggaaaa gaaattaata 8160
actaccgcca acactggttt gtctgttagt cgaactatgt ttgaccttta tgtagattca 8220
ttgctgaacg tcctcgacgt ggatcgcaag agtctaacaa gttttgtaaa tgctgcgcac 8280
aactctctaa aggagggtgt tcagcttgaa caagttatgg atacctttat tggctgtgcc 8340
cgacgtaagt gtgctataga ttctgatgtt gaaaccaagt ctattaccaa gtccgtcatg 8400
tcggcagtaa atgctggcgt tgattttacg gatgagagtt gtaataactt ggtgcctacc 8460
tatgttaaaa gtgacactat cgttgcagcc gatttgggtg ttcttattca gaataatgct 8520
aagcatgtac aggctaatgt tgctaaagcc gctaatgtgg cttgcatttg gtctgtggat 8580
gcttttaacc agctatctgc tgacttacag cataggctgc gaaaagcatg ttcaaaaact 8640
ggcttgaaga ttaagcttac ttataataag caggaggcaa atgttcctat tttaactaca 8700
ccgttctctc ttaaaggggg cgctgttttt agtagaatgt tacaatggtt gtttgttgct 8760
aatttgattt gtttcattgt gttgtgggcc cttatgccaa catatgcagt gcacaaatcg 8820
gatatgcagt tgcctttata tgccagtttt aaagttatag ataacggtgt gctaagggat 8880
gtgtctgtta ctgacgcatg cttcgcaaac aaatttaatc aattcgacca atggtatgag 8940
tctacttttg gtcttgctta ttaccgcaac tctaaggctt gtcctgttgt ggttgctgta 9000
atagatcaag acattggcca taccttattt aatgttccta ccacagtttt aagatatgga 9060
tttcatgtgt tgcattttat aacccatgca tttgctactg atagcgtgca gtgttacacg 9120
ccacatatgc aaatccccta tgataatttc tatgctagtg gttgcgtgtt gtcatccctc 9180
tgtactatgc ttgcgcatgc agatggaacc ccgcatcctt attgttatac agggggtgtt 9240
atgcataatg cctctctgta tagttctttg gctcctcatg tccgttataa cctggctagt 9300
tcaaatggtt atatacgttt tcccgaagtg gttagtgaag gcattgtgcg tgttgtgcgc 9360
actcgctcta tgacctactg cagggttggt ttatgtgagg aggccgagga gggtatctgc 9420
tttaatttta atcgttcatg ggtattgaac aacccgtatt atagggccat gcctggaact 9480
ttttgtggta ggaatgcttt tgatttaata catcaagttt taggaggatt agtgcggcct 9540
attgatttct ttgccttaac ggcgagttca gtggctggtg ctatccttgc aattattgtc 9600
gttttggctt tctattattt aatcaagctt aagcgtgcct ttggtgacta cactagtgtt 9660
gtggttatca atgtaattgt gtggtgtata aattttctga tgctttttgt gtttcaggtt 9720
tatcccacat tgtcttgttt atatgcttgt ttctacttct acaccacgct ttatttccct 9780
tcggagataa gtgttgttat gcatttgcaa tggcttgtca tgtatggtgc tattatgccc 9840
ttgtggtttt gcattattta cgtggcagtc gttgtttcaa accatgcatt gtggttgttc 9900
tcttactgcc gcaaaattgg taccgaggtt cgtagtgacg gcacatttga ggaaatggcc 9960
cttactacct ttatgattac taaagaatct tattgtaagt tgaaaaactc tgtttctgat 10020
gttgctttta acaggtactt gagtctttac aacaagtacc gttacttcag tggcaaaatg 10080
gatactgccg cttatagaga ggctgcctgt tcacaactgg caaaggcaat ggaaacattt 10140
aaccataata atggtaatga tgttctctat cagcctccaa ccgcctctgt tactacatca 10200
tttttacagt ctggtatagt gaagatggtg tcgcccacct ctaaagtgga gccttgtatt 10260
gttagtgtta cttatggtaa catgacactt aatgggttgt ggttggatga taaagtttat 10320
tgcccaagac atgttatctg ttcttcagct gacatgacag accctgatta tcctaatttg 10380
ctttgtagag tgacatcaag tgatttttgt gttatgtctg gtcgtatgag ccttactgta 10440
atgtcttatc aaatgcaggg ctgccaactt gttttgactg ttacactgca aaatcctaac 10500
acgcctaagt attccttcgg tgttgttaag cctggtgaga catttactgt actggctgca 10560
tacaatggca gacctcaagg agccttccat gttacgcttc gtagtagcca taccataaag 10620
ggctcctttc tatgtggatc ctgcggttct gtaggatatg ttttaactgg cgatagtgta 10680
cgatttgttt atatgcatca gctagagttg agtactggtt gtcataccgg tactgacttt 10740
agtgggaact tttatggtcc ctatagagat gcgcaagttg tacaattgcc tgttcaggat 10800
tatacgcaga ctgttaatgt tgtagcttgg ctttatgctg ctatttttaa cagatgcaac 10860
tggtttgtgc aaagtgatag ttgttccctg gaggagttta atgtttgggc tatgaccaat 10920
ggttttagct caatcaaagc cgatcttgtc ttggatgcgc ttgcttctat gacaggcgtt 10980
acagttgaac aggtgttggc cgctattaag aggctgcatt ctggattcca gggcaaacaa 11040
attttaggta gttgtgtgct tgaagatgag ctgacaccaa gtgatgttta tcaacaacta 11100
gctggtgtca agctacagtc aaagcgcaca agagttataa aaggtacatg ttgctggata 11160
ttggcttcaa cgtttttgtt ctgtagcatt atctcagcat ttgtaaaatg gactatgttt 11220
atgtatgtta ctacccatat gttgggagtg acattgtgtg cactttgttt tgtaagcttt 11280
gctatgttgt tgatcaagca taagcatttg tatttaacta tgtacatcat gcctgtgtta 11340
tgcacactgt tttacaccaa ctatttggtt gtgtacaaac agagttttag aggtctagct 11400
tatgcttggc tttcacactt tgtccctgct gtagattata catatatgga tgaagtttta 11460
tatggtgttg tgttgctagt agctatggtg tttgttacca tgcgtagcat aaaccacgac 11520
gtcttttcta ttatgttctt ggttggtaga cttgtcagcc tggtatccat gtggtatttt 11580
ggagccaatt tagaggaaga ggtactattg ttcctcacat ccctatttgg cacgtacaca 11640
tggactacta tgttgtcatt ggctaccgct aaggttattg ctaaatggtt ggctgtgaat 11700
gtcttgtact tcacagacgt accgcaaatt aaattagttc tgttgagcta cttgtgtatt 11760
ggttatgtgt gttgttgtta ttggggaatc ttgtcactcc ttaatagcat ttttaggatg 11820
ccattgggcg tctacaatta taaaatctcc gttcaggagt tacgttatat gaatgctaat 11880
ggcttgcgcc cacctagaaa tagttttgag gccctgatgc ttaattttaa gctgttggga 11940
attggtggtg tgccagtcat tgaagtatct caaattcaat caagattgac ggatgttaaa 12000
tgtgctaatg ttgtgttgct taattgcctc cagcacttgc atattgcatc taattctaag 12060
ttgtggcagt attgtagtac tttgcacaat gaaatactgg ctacatctga tttgagcgtg 12120
gccttcgata agttggctca actcttagtt gttttatttg ctaatccagc agcagtggat 12180
agcaagtgcc ttgcaagtat tgaagaagtg agcgatgatt acgttcgcga caatactgtc 12240
ttgcaagcct tacagagtga atttgttaat atggctagct tcgttgagta tgaacttgct 12300
aagaagaatc tagatgaggc taaggctagc ggctctgcca atcaacagca gattaagcag 12360
ctagagaagg cgtgtaatat tgctaagtca gcatatgagc gcgacagagc tgttgctcgt 12420
aagctggaac gtatggctga tttagctctt acaaacatgt ataaagaagc tagaattaat 12480
gataagaaga gtaaggtagt gtctgcattg caaaccatgc tctttagtat ggtgcgtaag 12540
ctagataacc aagctcttaa ttctatttta gacaacgcag ttaagggttg tgtacctttg 12600
aatgcaatac catcattgac ttcgaacact ctgactataa tagtgccaga taagcaggtt 12660
tttgatcagg ttgtggataa tgtgtatgtc acctatgctg ggaatgtatg gcatatacag 12720
tttattcaag atgctgatgg tgctgttaaa caattgaatg agatagatgt taattcaacc 12780
tggcctctag tcattgctgc aaataggcat aatgaagtgt ctactgttgt tttgcagaac 12840
aatgagttga tgcctcagaa gttgagaact caggttgtca atagtggctc agatatgaat 12900
tgtaatactc ctacccagtg ttactataat actactggca cgggtaagat tgtgtatgct 12960
atacttagtg actgtgacgg cctgaagtac actaagatag taaaagaaga tggaaattgt 13020
gttgttttgg aattggatcc tccctgtaag ttttctgttc aggatgtgaa gggccttaaa 13080
attaagtacc tttactttgt gaaggggtgt aatacactgg ctagaggctg ggttgtaggc 13140
accttatcct cgacagtgag attgcaggcg ggtacggcaa ctgagtatgc ctccaactct 13200
gcaatactgt cgctgtgtgc gttttctgta gatcctaaga aaacgtactt ggattatata 13260
aaacagggtg gagttcccgt tactaattgt gttaagatgt tatgtgacca tgctggcact 13320
ggtatggcca ttactattaa gccggaggca accactaatc aggattctta tggtggtgct 13380
tccgtttgta tatattgccg ctcgcgtgtt gaacatccag atgttgatgg attgtgcaaa 13440
ttacgcggca agtttgtcca agtgccctta ggcataaaag atcctgtgtc atatgtgttg 13500
acgcatgatg tttgtcaggt ttgtggcttt tggcgagatg gtagctgttc ctgtgtaggc 13560
acaggctccc agtttcagtc aaaagacacg aactttttaa acggattcgg ggtacaagtg 13620
taaatgcccg tcttgtaccc tgtgccagtg gcttggacac tgatgttcaa ttaagggcat 13680
ttgacatttg taatgctaat cgagctggca ttggtttgta ttataaagtg aattgctgcc 13740
gcttccagcg tgtagatgag gacggcaaca agttggataa gttctttgtt gttaaaagaa 13800
ctaatttaga agtgtataac aaggagaaag aatgctatga gttgacaaaa gaatgcggtg 13860
ttgtggctga acacgagttc ttcacatttg atgtggaggg aagtcgggta ccacacatag 13920
tccgtaaaga tctttcaaag tttactatgt tagatctttg ctatgcattg cgtcattttg 13980
accgcaatga ttgttcaact cttaaggaaa ttctccttac atatgctgag tgtgaagagt 14040
cctacttcca aaagaaggac tggtatgatt ttgttgagaa tcctgatata attaatgtgt 14100
acaagaagct tggtcctata tttaatagag ccctgcttaa cactgccaag tttgcagacg 14160
cattagtgga ggcaggctta gtaggtgttt taacacttga taatcaagat ttatatggtc 14220
aatggtatga ctttggagat tttgtcaaga cagtacctgg ttgtggtgtt gccgtggcag 14280
actcttatta ttcatatatg atgccaatgc tgactatgtg tcatgcgttg gatagtgagt 14340
tgtttgttaa tggtacttat agggagtttg accttgttca gtatgatttt actgatttca 14400
agctagagct gttcactaag tattttaagc attggagtat gacctaccac ccgaacacct 14460
gtgagtgcga ggatgacagg tgcattattc attgcgccaa ttttaatata cttttcagca 14520
tggtcttacc taagacctgt tttgggcctc ttgttaggca gatatttgtg gatggtgttc 14580
ctttcgttgt gtcgatcggt taccattata aagaattagg tgttgttatg aatatggatg 14640
tggatacaca tcgttatcgc ttgtctctta aggacttgct tttgtatgct gcagaccctg 14700
cccttcatgt ggcgtctgct agtgcactgc ttgatttgcg cacatgttgt tttagcgttg 14760
cagctattac aagtggcgta aaatttcaaa cagttaaacc tggaaatttt aatcaggatt 14820
tctacgagtt tattttgagt aaaggcctgc ttaaagaggg gagctccgtt gatttgaagc 14880
acttcttctt tacgcaggat ggtaatgctg ctattactga ttacaattac tacaagtata 14940
atctacccac catggtggat attaagcagt tgttgtttgt tttagaagtt gttaataagt 15000
acttcgagat ctatgagggt gggtgtatac ccgcaacaca ggtcattgtt aataattatg 15060
acaagagtgc tggctatcca tttaataaat ttggaaaggc caggctctat tatgaggcat 15120
tatcatttga ggagcaggat gaaatttatg cgtataccaa acgcaatgtc ctgccgaccc 15180
taactcaaat gaatcttaaa tatgctatta gtgctaagaa tagggcccgc accgttgctg 15240
gtgtctctat tctcagtact atgactggca gaatgtttca tcaaaagtgt ctaaagagta 15300
tagcagctac tcgcggtgtt cctgtagtta taggcaccac gaagttctat ggcggttggg 15360
atgatatgtt acgccgcctt attaaagatg ttgatagtcc tgtactcatg ggttgggact 15420
atcctaaatg tgatcgtgct atgccaaaca tactgcgtat tgttagtagt ttggtgctag 15480
cccgtaaaca tgattcgtgc tgttcgcata cggatagatt ctatcgtctt gcgaacgagt 15540
gcgcccaagt tttgagtgaa attgttatgt gtggtggttg ttattatgtt aaaccaggtg 15600
gcactagtag tggggatgca accactgctt ttgctaattc tgtgtttaac atttgtcaag 15660
ctgtttccgc caatgtatgc tcgcttatgg catgcaatgg acacaaaatt gaagatttga 15720
gtatacgcga gttacaaaag cgcctatact ctaatgtcta tcgtgcggac catgttgacc 15780
ccgcatttgt tagtgagtat tatgagtttt taaacaagca ttttagtatg atgattttga 15840
gtgatgatgg tgttgtgtgt tataattcag agtttgcgtc caagggttat attgctaata 15900
taagtgcctt tcaacaggta ttatattatc aaaacaacgt gtttatgtct gaggccaaat 15960
gttgggtaga aacagacatc gaaaagggac cgcatgaatt ttgttctcaa catacaatgc 16020
tagtcaagat ggatggtgat gaagtctacc ttccataccc tgatccttcg agaatcttag 16080
gagcaggctg ttttgttgat gatttactca agactgatag cgttctcttg atagagcgtt 16140
tcgtaagtct tgcaattgat gcttatcctt tagtatacca tgagaaccca gagtatcaaa 16200
atgtgttccg ggtatattta gaatacatca agaagctgta caatgatctc ggtaatcaga 16260
tcctggacag ctacagtgtt attttaagta cttgtgatgg tcaaaagttt actgacgaga 16320
cgttttacaa gaacatgtat ttaagaagtg cagtgctgca aagcgttggt gcctgcgttg 16380
tctgtagttc tcaaacatca ttacgttgtg gcagttgcat acgcaagcct ttgctgtgtt 16440
gcaaatgcgc ctatgatcat gttatgtcca ctgatcataa atatgtcctg agtgtgtcac 16500
catatgtgtg taattcaccg ggatgtgatg taaatgatgt taccaaattg tatttaggtg 16560
gtatgtcata ttattgtgag gaccataaac cacagtattc attcaaattg gtgatgaatg 16620
gtatggtttt tggtttatat aagcagtctt gtactggttc gccctacata gaggatttta 16680
ataaaatcgc tagttgcaaa tggacagaag tcgatgatta tgtgctagct aatgaatgca 16740
ccgaacgcct taaattgttt gccgcagaaa cgcagaaggc cacagaagag gcctttaagc 16800
aatgttatgc gtcagcaacg atccgtgaga tcgtgagcga tcgggagtta attttatctt 16860
gggaaattgg taaagtccgc ccgccactta ataaaaatta cgtgttcacc ggctaccatt 16920
ttactaataa tggtaagaca gttttaggtg agtatgtttt tgataagagt gagttgacta 16980
atggtgtgta ttatcgcgcc acaaccactt ataagttatc tgtaggtgat gtgttcattt 17040
taacatcaca cgcagtgtct agtttaagtg ctcctacatt agtaccgcag gagaattata 17100
ctagcattcg ttttgctagt gtttatagtg tgcctgagac gtttcagaat aatgtgccta 17160
attatcagca cattggaatg aagcgctatt gtactgtaca gggaccgcct ggtactggta 17220
agtcccatct agccattggg ctagctgttt attattgtac agcgcgcgtg gtgtataccg 17280
ctgctagcca tgctgcagtt gacgcgctgt gtgaaaaggc acataaattt ctcaacatca 17340
acgactgcac gcgtattgtt cctgcaaagg tgcgtgtaga ttgttatgat aaattcaagg 17400
tcaatgacac cactcgcaag tatgtgttta ctacaataaa tgcattacct gagttggtga 17460
ctgacattat tgtcgttgat gaagttagta tgcttaccaa ctatgagctg tctgttatta 17520
acagtcgtgt tagggctaag cattatgtgt atattggcga cccggcgcag ttacctgcac 17580
cacgtgtgct actgaataag ggaactctag aacctagata ttttaattcc gttaccaagc 17640
taatgtgttg tttgggtcca gatattttct tgggcacctg ttatagatgc cctaaggaga 17700
ttgtggatac ggtgtcagcc ttggtttata ataataagct gaaggctaaa aatgataata 17760
gctccatgtg ctttaaggtt tattataagg gccagactac acatgagagt tctagtgctg 17820
ttaatatgca gcaaatacat ttaatttcca agtttctgaa ggcaaacccc agttggagta 17880
acgccgtatt tattagtcct tataactcgc agaactatgt tgctaagaga gtcttgggat 17940
tacaaaccca gacagtagac tcagcgcagg gttctgaata tgattttgtt atctactcac 18000
agactgcgga aacagcgcat tctgtcaatg taaatagatt caatgttgct attacacgtg 18060
ctaagaaggg tattctctgt gtcatgagta gtatgcaatt atttgagtct cttaatttta 18120
ctacactgac gttggataag attaacaatc cacgattaca gtgtactaca aatttgttta 18180
aggattgtag caggagctat gtaggatatc acccagccca tgcaccatcc tttttggcag 18240
ttgatgacaa atataaggta ggcggtgatt tagccgtttg ccttaatgtt gctgattctg 18300
ctgtcactta ttcgcggctt atatcactca tgggattcaa gcttgacttg acccttgatg 18360
gttattgtaa gctgtttata actagagatg aagctatcaa acgtgttaga gcctgggttg 18420
gcttcgatgc agaaggtgcc catgcgatac gtgatagcat tgggacaaat ttcccattac 18480
aattaggctt ttcgactgga attgattttg ttgtcgaagc cactggaatg tttgctgaga 18540
gagatggtta tgtctttaaa aaggcagccg cacgagctcc tcctggcgaa caatttaaac 18600
accttatccc acttatgtca agagggcaga aatgggatgt ggttcgcatt agaatagtac 18660
aaatgttgtc agaccaccta gtggatttgg cagacagtgt tgtacttgtg acgtgggctg 18720
ccagctttga gctcacatgt ttgcgatatt tcgctaaagt tggaagagaa gttgtgtgta 18780
gtgtctgcac caagcgtgcg acatgtttta attctagaac tggatactat ggatgctggc 18840
gacatagtta ttcctgtgat tacctgtaca acccactaat agttgacatt caacagtggg 18900
gatatacagg atctttaact agcaatcatg atcctatttg cagcgtgcat aagggtgctc 18960
atgttgcatc atctgatgct atcatgaccc ggtgtctagc tgttcatgat tgcttttgta 19020
agtctgttaa ttggaattta gaatacccca ttatttcaaa tgaggtcagt gttaatacct 19080
cctgcaggtt attgcagcgc gtaatgttta gggctgcgat gctatgcaat aggtatgatg 19140
tgtgttatga cattggcaac cctaaaggtc ttgcctgtgt caaaggatat gattttaagt 19200
tctatgacgc ctcccctgtt gttaagtctg ttaaacagtt tgtttacaaa tacgaggcac 19260
ataaagatca atttttagat ggtttgtgta tgttttggaa ctgcaatgtg gataagtatc 19320
cagcgaatgc agttgtgtgt aggtttgaca cgcgtgtgtt gaacaaatta aatctccctg 19380
gctgtaatgg tggcagtttg tatgttaaca aacatgcatt ccacaccagt ccctttaccc 19440
gggctgcctt cgagaatttg aagcctatgc ctttctttta ttattcagat acgccctgtg 19500
tgtatatgga aggcatggaa tctaagcagg tcgattatgt cccattgaga agcgctacat 19560
gcatcacaag atgcaattta ggtggcgctg tttgtttaaa acatgctgag gagtatcgtg 19620
agtaccttga gtcttacaat acggcaacca cagcgggttt tactttttgg gtctataaga 19680
cttttgattt ttacaacctt tggaatactt ttactaggct ccaaagttta gaaaatgtag 19740
tgtataacct ggtcaacgct ggacactttg atggccgggc gggtgaactg ccttgtgctg 19800
ttataggtga gaaagtcatt gccaagattc aaaatgagga tgtcgtggtc tttaaaaata 19860
acacgccatt ccccactaat gtggctgtcg aattatttgc taagcgcagt attcggcccc 19920
accccgagct taagctcttt agaaatttga atattgacgt gtgctggagt cacgtccttt 19980
gggattatgc taaggatagt gtgttttgca gttcgacgta taaggtctgc aaatacacag 20040
atttacagtg cattgaaagc ttgaatgtac tttttgatgg tcgtgataat ggtgctcttg 20100
aagcttttaa gaagtgccgg aatggcgtct acattaacac gacaaaaatt aaaagtctgt 20160
cgatgattaa aggcccacaa cgtgccgatt tgaatggcgt agttgtggag aaagttggag 20220
attctgatgt ggaattttgg tttgctgtgc gtaaagacgg tgacgatgtt atcttcagcc 20280
gtacagggag ccttgaaccg agccattacc ggagcccaca aggtaatccg ggtggtaatc 20340
gcgtgggtga tctcagcggt aatgaagctc tagcgcgtgg cactatcttt actcaaagca 20400
gattattatc ttctttcaca cctcgatcag agatggagaa agattttatg gatttagatg 20460
atgatgtgtt cattgcaaaa tatagtttac aggactacgc gtttgaacac gttgtttatg 20520
gtagttttaa ccagaagatt attggaggtt tgcatttgct tattggctta gcccgtaggc 20580
agcaaaaatc caatctggta attcaagagt tcgtgacata cgactctagc attcattcgt 20640
actttatcac tgacgagaac agtggtagta gtaagagtgt gtgcactgtt attgatttat 20700
tgttagatga ttttgtggac attgtaaagt ccctgaatct aaagtgtgtg agtaaggttg 20760
ttaatgttaa tgtggatttt aaggacttcc agtttatgtt gtggtgcaat gaggagaagg 20820
tcatgacttt ctatcctcgt ttgcaggctg ctgctgactg gaaacctggt tatgttatgc 20880
ctgtcttata taagtatttg gaatcgcctc tggaaagagt aaacctctgg aattatggca 20940
agccgattac tttacctaca ggatgtatga tgaatgttgc taagtatact caattatgtc 21000
aatatttgag cactacaaca ttagcagttc cggctaatat gcgtgtctta caccttggtg 21060
ccggttcgga taagggtgtt gcccctgggt ctgcagttct taggcagtgg ctaccagcgg 21120
gaagtattct tgtagataat gatgtgaatc catttgtgag tgacagtgtc gcctcatatt 21180
atggaaattg tataacctta ccctttgatt gtcagtggga tctgataatt tctgatatgt 21240
acgaccctct tactaagaac attggggagt acaacgtgag taaagatgga ttctttactt 21300
acctctgtca tttaattcgt gacaagttgg ctctgggtgg cagtgttgcc ataaaaataa 21360
cagagttttc ttggaacgct gagttatata gtttaatggg gaagtttgcg ttctggacaa 21420
tcttttgcac caacgtaaac gcctcttcaa gtgaaggatt tttgattggc ataaattggt 21480
tgaataagac ccgtaccgaa attgacggta aaaccatgca tgccaattat ctgttttgga 21540
gaaatagtac aatgtggaat ggaggggctt acagtctctt tgacatgagt aagttccctt 21600
tgaaagcggc tggtacggct gttgttagcc ttaaaccaga ccaaataaat gacttagtcc 21660
tctccttgat tgagaagggc aagttattag tgcgtgatac acgcaaagaa gtttttgttg 21720
gcgatagcct agtaaatgtc aaataa 21746
<210> 16
<211> 9589
<212> DNA
<213> Artificial Sequence
<220>
<223> COVAX_SYNCoat56
<400> 16
atctatactt gtcgtggctg tgaaaatggc ctttgctgac aagcctaatc atttcataaa 60
ctttcccctg gcccaattta gtggctttat gggtaagtat ttaaagctac agtctcaact 120
tgtggaaatg ggtttagact gtaaattaca gaaggcacca catgttagta ttaccctgct 180
tgatattaaa gcagaccaat acaaacaggt ggaatttgca atacaagaaa taatagatga 240
tctggcggca tatgagggag atattgtctt tgacaaccct cacatgcttg gcagatgcct 300
tgttcttgat gttagaggat ttgaagagtt gcatgaagat attgttgaaa ttctccgcag 360
aaggggttgc acggcagatc aatccagaca ctggattccg cactgcactg tggcccaatt 420
tgacgaagaa agagaaacaa aaggaatgca attctatcat aaagaaccct tctacctcaa 480
gcataacaac ctattaacgg atgctgggct tgagctcgtg aagataggtt cttccaaaat 540
agatgggttt tattgtagtg aactgagtgt ttggtgtggt gagaggcttt gttataagcc 600
tccaacaccc aaattcagtg atatatttgg ctattgctgc atagataaaa tacgtggtga 660
tttagaaata ggcgacctgc cgcaggatga tgaggaagcg tgggccgagc taagttacca 720
ctatcaaaga aacacctact tcttcagaca tgtgcacgat aatagcatct attttcgtac 780
cgtgtgtaga atgaagggtt gtatgtgttg atttgttttt acactattag tgtaataagc 840
ttattatttt gttgaaaagg gcaggatgtg catagctatg gctcctcgca cactgctttt 900
gctgatttga tgtcagctgg tgtttgggtt caatgaacct cttaacatcg tttcacattt 960
aaatgatgac tggtttctat ttggtgacag tcggtccgac tgtacctatg tagaaaataa 1020
cggtcatcct aaattagatt ggcttgacct cgacccaaag ttgtgtaatt caggaaagat 1080
ttccgcaaag agtggtaact ctctctttag gagttttcac ttcactgatt tttacaatta 1140
tacgggtgag ggataccaaa ttgtatttta tgaaggagtt aattttagtc ccagccatgg 1200
ctttaaatgc ctggctcatg gagataataa aagatggatg ggcaataaag ctcgatttta 1260
tgcccgagtg tatgagaaga tggcccaata taggagccta tcgtttgtta atgtgtctta 1320
tgcctatgga ggtaatgcaa agcccgcctc catttgcaaa gacaatactt taacactcaa 1380
taaccccacc ttcatatcga aggagtctaa ttatgttgat tactactacg agagtgaggc 1440
taatttcaca ctagaaggtt gtgatgaatt tatagtaccg ctctgtggtt ttaatggcca 1500
ttccaagggc tcgtcgtcgg atgctgccaa taaatattat actgactctc agagttacta 1560
taatatggat attggtgtct tatatgggtt caattcgacc ttggatgttg gcaacactgc 1620
taaggatccg ggtcttgatc tcacttgtag gtatcttgca ttgactcctg gtaattataa 1680
ggctgtgtcc ttagaatatt tgttaagctt accctcaaag gctatttgcc tccataagac 1740
aaagcgcttt atgcctgtgc aggtagttga ctcaaggtgg agtagcatcc gccagtcaga 1800
caatatgacc gctgcagcct gtcagctgcc atattgtttc tttcgcaaca catctgcgaa 1860
ttatagtggt ggcacacatg atgcgcacca tggtgatttt catttcaggc agttattgtc 1920
tggtttgtta tataatgttt cctgtattgc ccagcagggt gcatttcttt ataataatgt 1980
gtcgtcctct tggccagcct atgggtacgg tcattgtcca acggcagcta acattggtta 2040
tatggcacct gtttgtatct atgaccctct cccggtcata ctgctaggtg tgttattggg 2100
tatagctgtg ttgactattg tgtttctgat gttttatttt atgacggata gcggtgttag 2160
attgcatgag gcataatcta aacatgctgt tcgtgtttat tctatttttg ccctcttgtt 2220
tagggtatat tggtgatttt agatgtatcc agcttgtgaa ttcaaacggt gctaatgtta 2280
gtgctccaag cattagcacc gagacggttg aagtttcaca aggcctgggg acatattatg 2340
tgttagatcg agtttattta aatgccacat tattgcttac tggttactac ccggtcgatg 2400
gttctaagtt tagaaacctc gctcttacgg gaactaactc agttagcttg tcgtggtttc 2460
aaccacccta tttaagtcag tttaatgatg gcatatttgc gaaggtgcag aaccttaaga 2520
caagtacgcc atcaggtgca actgcatatt ttcctactat agttataggt agtttgtttg 2580
gctatacttc ctataccgtt gtaatagagc catataatgg tgttataatg gcctcagtgt 2640
gccagtatac catttgtcag ttaccttaca ctgattgtaa gcctaacact aatggtaata 2700
aactgatagg gttttggcac acggatgtaa aacccccaat ttgtgtgtta aagcgaaatt 2760
tcacgcttaa tgttaatgct gatgcatttt attttcattt ctaccaacat ggtggtactt 2820
tttatgcgta ctatgcggat aaaccctccg ctactacgtt tttgtttagt gtatatatcg 2880
gcgatatttt aacacagtat tatgtgttac ctttcatctg caacccaaca gctggtagca 2940
cttttgctcc gcgctattgg gttacacctt tggttaagcg ccaatatttg tttaatttca 3000
accagaaggg tgtcattact agtgctgttg attgtgctag tagttatacc agtgaaataa 3060
aatgtaagac ccagagcatg ttacctagca ctggtgtcta tgagttatcc ggttatacgg 3120
tccaaccagt tggagttgta taccggcgtg ttgctaacct cccagcttgt aatatagagg 3180
agtggcttac tgctaggtca gtcccctccc ctctcaactg ggagcgtaag acttttcaga 3240
attgcaattt taacttaagc agcctgttac gttatgttca ggctgagagt ttgttttgta 3300
ataatatcga tgcttccaaa gtgtatggcc gctgctttgg tagtatttca gttgataagt 3360
ttgctgtacc ccgaagtagg caagttgatt tacagcttgg taactctgga tttctgcaga 3420
ctgctaatta taagattgat acagctgcca cttcgtgtca gctgcattac accttgccta 3480
agaataatgt caccataaac aaccataacc cctcgtcttg gaataggagg tatggcttta 3540
atgatgctgg cgtctttggc aaaaaccaac atgacgttgt ttacgctcag caatgtttta 3600
ctgtaagatc tagttattgc ccgtgtgctc aaccggacat agttagccct tgcactactc 3660
agactaagcc taagtctgct tttgttaatg tgggtgacca ttgtgaaggc ttaggtgttt 3720
tagaagataa ttgtggcaat gctgatccac ataagggttg tatctgtgcc aacaattcat 3780
ttattggatg gtcacatgat acctgccttg ttaatgatcg ctgccaaatt tttgctaata 3840
tattgctgaa tggcattaat agtggtacca catgttccac agatttgcag ttgcctaata 3900
ctgaagtggt tactggcatt tgtgtcaaat atgacctcta cggtattact ggacaaggtg 3960
tttttaaaga ggttaaggct gactattata atagctggca aacccttctg tatgatgtta 4020
atggtaattt gaatggtttt cgtgatctta ccactaacaa gacttatacg ataaggagct 4080
gttatagtgg ccgtgtttct gctgcatttc ataaagatgc acccgaaccg gctctgctct 4140
atcgtaatat aaattgtagc tatgttttta gcaataatat ctcccgtgag gagaacccac 4200
ttaattactt tgatagttat ctgggttgtg ttgttaatgc tgataaccgc acggatgagg 4260
cgcttcctaa ttgtgatctc cgtatgggtg ctggcttatg cgttgattat tcaaaatcac 4320
gcagggctca ccgatcagtt tctactggct atcggttaac tacatttgag ccatacactc 4380
cgatgttagt taatgatagt gtccaatccg ttgatggatt atatgagatg caaataccaa 4440
ccaattttac tattgggcac catgaggagt tcattcaaac tagatctcca aaggtgacta 4500
tagattgtgc tgcatttgtc tgtggtgata acactgcatg caggcagcag ttggttgagt 4560
atggctcttt ctgtgttaat gttaatgcca ttcttaatga ggttaataac ctcttggata 4620
atatgcaact acaagttgct agtgcattaa tgcagggtgt tactataagc tcgagactgc 4680
cagacggcat ctcaggccct atagatgaca ttaattttag tcctctactt ggatgcatag 4740
gttcaacatg tgccgaggac ggcaatggac ctagtgcaat ccgagggcgt tctgctatag 4800
aggatttgtt atttgacaag gtcaaattat ctgatgttgg ctttgtcgag gcttataata 4860
attgcaccgg tggtcaagaa gttcgtgacc tcctttgtgt acaatctttt aatggcatca 4920
aagtattacc tcctgtgttg tcagagagtc agatctctgg ctacacaacc ggtgctactg 4980
cggcagctat gttcccaccg tggtcagcag ctgccggtgt gccatttagt ttaagtgttc 5040
aatatagaat taatggttta ggtgtcacta tgaatgtgct tagtgagaac caaaagatga 5100
ttgctagtgc ttttaacaat gcgctgggtg ctatccagga tgggtttgat gcaaccaatt 5160
ctgctttagg taagatccag tccgttgtta atgcaaatgc tgaagcactc aataacttac 5220
taaatcaact ttctaacagg tttggtgcta ttagtgcttc tttacaagaa attctaactc 5280
ggcttgaggc tgtagaagca aaagcccaga tagatcgtct tattaatggc aggttaactg 5340
cacttaatgc gtatatatcc aagcaactta gtgatagtac gcttattaaa gttagtgctg 5400
ctcaggccat agaaaaggtc aatgagtgcg ttaagagcca aaccacgcgt attaatttct 5460
gtggcaatgg taatcatata ttatctcttg tccagaatgc gccttatggc ttatatttta 5520
tacacttcag ctatgtgcca atatccttta caaccgcaaa tgtgagtcct ggactttgca 5580
tttctggtga tagaggatta gcacctaaag ctggatattt tgttcaagat gatggagaat 5640
ggaagttcac aggcagttca tattactacc ctgaacccat tacagataaa aacagtgtca 5700
ttatgagtag ttgcgcagta aactacacaa aggcacctga agttttcttg aacacttcaa 5760
tacctaatcc acccgacttt aaggaggagt tagataaatg gtttaagaat cagacgtcta 5820
ttgcgcctga tttatctctc gatttcgaga agttaaatgt tactttgctg gacctgacgt 5880
atgagatgaa caggattcag gatgcaatta agaagttaaa tgagagctac atcaacctca 5940
aggaagttgg cacatatgaa atgtatgtga aatggccttg gtatgtttgg ttgctaattg 6000
gattagctgg tgtagctgtt tgtgtgttgt tattctttat atgttgctgc acaggttgtg 6060
gctcatgttg ttttaagaag tgtggaaatt gttgtgatga gtatggagga caccaggaca 6120
gtattgtgat acataatatt tcctctcatg aggattgact atcacagcct ctcctggaaa 6180
gacagaaaat ctaaacaatt tatagcattc tcattgctac ctggccccgt aagaggcagt 6240
catagctatg gccgtgttgg tcctaaggct acattggctg ctgtctttat tggtccattt 6300
attgtagcat gtatgctagg cattggccta gtttatttat tgcaattgca agttcaaatt 6360
tttcatgtta aggataccat acgtgtgact ggcaagccag ccactgtgtc ttatactaca 6420
agtacaccag taacaccgag cgcgacgacg ctcgatggta ctacgtatac tttaattaga 6480
cccactagct cttatacaag agtttatctt ggtactccaa gaggttttga ttatagtaca 6540
tttgggccta agaccctaga ttatgttact aatctaaacc tcatcttaat tctggtcgtc 6600
catatacttt taaggcattg tccaggcata tgaggccaac agccacatgg atttggcatg 6660
tgagtgatgc atggttacgc cgcacgcggg actttggtgt cattcgccta gaagattttt 6720
gttttcaatt taattatagc caaccccgag ttggttattg tagagttcct ttaaaggctt 6780
ggtgtagcaa ccagggtaaa tttgcagcgc agtttaccct aaaaagttgc gaaaaaccag 6840
gtcacgaaaa atttattact agcttcacgg cctacggcag aactgtccaa caggccgtta 6900
gcaagttagt agaagaagct gttgatttta ttctttttag ggccacgcag ctcgaaagaa 6960
atgtttaatt tattccttac agacacagta tggtatgtgg ggcagattat ttttatattc 7020
gcagtgtgtt tgatggtcac cataattgtg gttgccttcc ttgcgtctat caaactttgt 7080
attcaacttt gcggtttatg taatactttg gtgctgtccc cttctattta tttgtatgat 7140
aggagtaagc agctttataa gtactataat gaagaaatga gactgcccct attagaggtg 7200
gatgatatct aatccaaaca ttatgagtag tactactcag gccccagagc ccgtctatca 7260
atggaccgcc gacgaggcag ttcaattcct taaggaatgg aacttctcgt tgggcattat 7320
actactcttt attactatca tactacagtt cggttacacg agccgtagca tgtttattta 7380
tgttgtgaaa atgataatct tgtggttaat gtggccactg actattgttt tgtgtatttt 7440
caattgcgtg tatgcgctaa ataatgtgta tcttggattt tctatagtgt ttactatagt 7500
gtccattgta atctggatca tgtattttgt gaacagcata aggttgttta tcaggactgg 7560
tagctggtgg agcttcaacc ccgaaacaaa caaccttatg tgtatagata tgaaaggtac 7620
cgtgtatgtt agacccatta ttgaggatta ccatacacta acagccacta ttattcgtgg 7680
ccacctctac atgcaaggtg ttaagctagg caccggtttc tctttgtctg acttgcccgc 7740
ttatgttaca gttgctaagg tgtcacacct ttgcacttat aagcgcgcat tcttagacaa 7800
ggtagacggt gttagcggtt ttgctgttta tgtgaagtcc aaggtcggaa attaccgact 7860
gccctcaaac aaaccgagtg gcgcggacac cgcattgttg agaacctaat ctaaacttta 7920
aggatgtctt ttgttcctgg gcaagaaaat gccggtggca gaagctcctc tgtaaaccgc 7980
gctggtaatg gaatcctcaa gaaaaccact tgggctgacc aaaccgagcg tggaccaaat 8040
aatcaaaata gaggcagaag gaatcagcca aagcagactg caactactca acccaactcc 8100
gggagtgtgg ttccccatta ctcctggttt tctggcatta cccagttcca aaagggaaag 8160
gagtttcagt ttgcagaagg acaaggagtg cctattgcca atggaatccc cgcttcagag 8220
caaaagggat attggtatag acacaaccgc cgttctttta aaacacctga tgggcagcag 8280
aagcaattac tgcccagatg gtatttttac tatcttggca cagggcccca tgctggagcc 8340
agttatggag acagcattga aggcgtcttt tgggttgcaa acagccaagc ggacaccaat 8400
acccgctctg atattgtcga aagggaccca agcagtcatg aggctattcc tactaggttt 8460
gcgcccggca cggtattgcc tcagggcttt tatgttgaag gctctggaag gtctgccccg 8520
gccagccgat ctggttcgcg gtcacaatcc cgtgggccaa ataatcgcgc tagaagcagt 8580
tccaaccagc gccagcctgc ctctactgta aaacctgata tggccgaaga aattgctgct 8640
cttgttttgg ctaagctcgg taaagatgcc ggccagccca agcaagtaac gaagcaaagt 8700
gccaaagaag tcaggcagaa aattttaaac aagcctcgcc aaaagaggac tccaaacaag 8760
cagtgcccag tgcagcagtg ttttggaaag agaggcccca atcagaattt tggaggctct 8820
gaaatgttaa aacttggaac tagtgatcca cagttcccca ttcttgcaga gttggctcca 8880
acagttggtg ccttcttctt tggatctaaa ttagaattgg tcaaaaagaa ttctggtggt 8940
gctgatgaac ccaccaaaga tgtgtatgag ctgcaatatt caggtgcagt tagatttgat 9000
agtactctac ctggttttga gactatcatg aaagtgttga atgagaattt gaatgcctac 9060
cagaaggatg gtggtgcaga tgtggtgagc ccaaagcccc aaagaaaagg gcgtagacag 9120
gctcaggaaa agaaagatga agtagataat gtaagcgttg caaagcccaa aagctctgtg 9180
cagcgaaatg taagtagaga attaacccca gaggatagaa gtctgttggc tcagatcctt 9240
gatgatggcg tagtgccaga tgggttagaa gatgactcta atgtgtaaag agaatgaatc 9300
ctatgtcggc gctcggtggt aacccctcgc gagaaagtcg ggataggaca ctctctatca 9360
gaatggatgt cttgctgtca taacagatag agaaggttgt ggcagaccct gtatcaatta 9420
gttgaaagag attgcaaaat agagaatgtg tgagagaagt tagcaaggtc ctacgtctaa 9480
ccataagaac ggcgataggc gccccctggg aacagctcac atcagggtac tattcctgca 9540
atgccctagt aaatgaatga agttgatcat ggccaattgg aagaatcac 9589
<210> 17
<211> 3822
<212> DNA
<213> Artificial Sequence
<220>
<223> COVAX-S19-1
<400> 17
atgtttgttt ttcttgtttt attgccacta gtctctagtc agtgtgttaa tcttacaacc 60
agaactcaat taccccctgc atacactaat tctttcacac gtggtgttta ttaccctgac 120
aaagttttca gatcctcagt tttacattca actcaggact tgttcttacc tttcttttcc 180
aatgttactt ggttccatgc tatacatgtc tctgggacca atggtactaa gaggtttgat 240
aaccctgtcc taccatttaa tgatggtgtt tactttgctt ccactgagaa gtctaacata 300
ataagaggct ggatttttgg tactacttta gattcgaaaa cccagtccct acttattgtt 360
aataacgcta ctaatgttgt tatcaaagtc tgtgaatttc aattttgtaa cgatccattt 420
ttgggtgttt attaccacaa aaacaacaaa agttggatgg aaagtgagtt cagagtttat 480
tctagtgcga ataattgcac ttttgaatac gtctctcagc cttttcttat ggaccttgaa 540
ggaaaacagg gtaatttcaa aaatcttagg gaatttgtgt tcaagaatat tgatggttac 600
ttcaagatat actctaagca cacgcctatt aatttagtgc gtgatctccc tcagggtttt 660
tcggctttag aaccattggt agatttgcca ataggtatta acatcactag gtttcaaact 720
ttacttgctt tacatagaag ttatttaact cctggtgatt cttcttcagg ttggacagct 780
ggtgctgcag cttattatgt gggttatctt caacctagga cttttctact gaagtacaat 840
gaaaatggaa ccattacaga tgctgtagac tgtgcacttg accctctctc agaaacaaag 900
tgtacgttga aatccttcac tgtagaaaaa ggaatctatc aaacttctaa ctttagagtc 960
caaccaacag aatctattgt tagatttcct aacatcacaa acttgtgccc ttttggtgaa 1020
gtttttaacg ccaccagatt tgcatctgtt tatgcttgga acaggaagag aatcagcaac 1080
tgtgttgctg attattctgt cctgtataat tccgcatcat tttccacttt taagtgttat 1140
ggagtgtctc ctactaaatt aaatgatctc tgctttacta atgtctatgc agattcattt 1200
gtaattagag gtgatgaagt cagacaaatc gctccagggc aaactggaaa gattgctgat 1260
tataactaca aattaccaga tgattttaca ggctgcgtta tagcttggaa ttctaacaat 1320
cttgattcta aggttggtgg taattataat tacctgtaca gattgtttag gaagtctaat 1380
ctcaaacctt ttgagagaga tatttcaact gaaatctatc aggccggtag cacaccttgt 1440
aatggtgttg aaggttttaa ttgttacttt cctctgcaat catatggttt ccaacccact 1500
aatggtgttg gttaccaacc atacagagta gtagtacttt cttttgaact tctacatgca 1560
ccagcaactg tttgtggacc taaaaagtct actaatttgg ttaagaacaa gtgtgtcaat 1620
ttcaacttca atggtttaac aggcacaggt gttcttactg agtctaacaa aaagtttctg 1680
cctttccaac aatttggcag agacattgct gacactactg atgctgttcg tgatccacaa 1740
acacttgaga ttcttgacat tacaccatgt tcttttggtg gtgtcagtgt tataacacca 1800
ggaacaaata cttctaacca ggttgctgtt ctttatcagg atgttaactg cacagaagtc 1860
cctgttgcta ttcatgcaga tcaacttact cctacttggc gtgtttattc tacaggttct 1920
aatgtttttc aaacacgtgc aggctgttta ataggggctg aacatgtcaa caactcatat 1980
gagtgtgaca tacccattgg tgcaggtata tgcgctagtt atcagactca gactaattct 2040
cctcggagag caagaagtgt agctagtcaa tccatcattg cctacactat gtcacttggt 2100
gcagaaaatt cagttgctta ctctaataac tctattgcca tacccacaaa ttttactatt 2160
agcgttacca cagaaattct accagtgtct atgaccaaga catcagtaga ttgtacaatg 2220
tacatttgtg gtgattcaac tgaatgcagc aatcttttgt tgcaatatgg cagtttttgt 2280
acacaattaa accgtgcttt aactggaata gctgttgaac aagacaaaaa cacccaagaa 2340
gtttttgcac aagtcaaaca aatttacaag acaccaccaa ttaaagattt tggcggtttt 2400
aattttagcc agatactgcc agatccatca aaaccaagca agaggtcatt tattgaagat 2460
ctactgttca acaaagtgac acttgcagat gctggcttca tcaaacaata tggtgattgc 2520
cttggtgata ttgctgctag agacctcatt tgtgcacaaa agtttaacgg ccttactgtt 2580
ttgccacctt tgctcacaga tgaaatgatt gctcaataca cttctgcact gttagcaggt 2640
acaatcactt ctggttggac ttttggtgca ggtgctgcat tacaaatacc atttgctatg 2700
caaatggctt ataggtttaa tggtattgga gttacacaga atgttctcta tgagaaccaa 2760
aaattgattg ccaaccaatt taatagtgct attggcaaaa ttcaagactc actttcttcc 2820
acagcaagtg cacttggaaa acttcaagat gtggtcaacc aaaatgcaca agctttaaac 2880
acgcttgtta aacaacttag ctccaatttt ggtgcaattt caagtgtttt aaacgacatc 2940
ctttcacgtc ttgacaaagt tgaggctgaa gtgcaaattg ataggttgat cacaggcaga 3000
cttcaaagtt tgcagacata tgtgactcaa caattaatta gagctgcaga aatcagagct 3060
tctgctaatc ttgctgctac taaaatgtca gagtgtgtac ttggacaatc aaaaagagtt 3120
gacttttgcg gaaagggcta tcatcttatg tcatttcctc agtcagcacc tcatggtgtc 3180
gtctttttgc atgtgactta tgtccctgca caagaaaaga acttcacaac tgctcctgcc 3240
atttgtcatg atggaaaagc acactttcct cgtgaaggtg tctttgtttc aaatggcaca 3300
cactggtttg taacacaaag gaatttttat gaaccacaaa tcattactac agacaacaca 3360
tttgtgtctg gtaactgtga tgttgtaata ggaattgtca acaacacagt ttatgatcct 3420
ttgcaacctg aattagactc attcaaggag gagcttgata aatacttcaa gaaccatacc 3480
tcaccagatg ttgatttagg tgacatctct ggcattaatg cttcagttgt aaacattcag 3540
aaagaaatcg accgcctcaa tgaggttgcc aagaatttaa atgaatctct catcgatctc 3600
caagaacttg gaaagtatga gcagtatata aaatggccat ggtacatttg gctaggtttt 3660
atagctggct tgattgccat agtaatggtg acaattatgc tttgctgtat gaccagttgc 3720
tgtagttgtc tcaagggctg ttgttcttgt ggatcctgct gcaaatttga cgaggacgac 3780
tctgagccag tgctcaaagg agtcaaatta cattacacat aa 3822
<210> 18
<211> 1273
<212> PRT
<213> Artificial Sequence
<220>
<223> S-Protein_Sars-CoV2
<400> 18
Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
1 5 10 15
Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
20 25 30
Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
35 40 45
His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
50 55 60
Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
65 70 75 80
Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
85 90 95
Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
100 105 110
Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
115 120 125
Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
130 135 140
Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
145 150 155 160
Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
165 170 175
Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
180 185 190
Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
195 200 205
Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
210 215 220
Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
225 230 235 240
Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
245 250 255
Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
260 265 270
Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
275 280 285
Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
290 295 300
Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val
305 310 315 320
Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys
325 330 335
Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
340 345 350
Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
355 360 365
Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
370 375 380
Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
385 390 395 400
Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
405 410 415
Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
420 425 430
Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
435 440 445
Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
450 455 460
Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
465 470 475 480
Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
485 490 495
Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
500 505 510
Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
515 520 525
Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn
530 535 540
Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu
545 550 555 560
Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val
565 570 575
Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe
580 585 590
Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val
595 600 605
Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile
610 615 620
His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser
625 630 635 640
Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val
645 650 655
Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala
660 665 670
Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala
675 680 685
Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser
690 695 700
Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile
705 710 715 720
Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val
725 730 735
Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu
740 745 750
Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr
755 760 765
Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln
770 775 780
Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe
785 790 795 800
Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser
805 810 815
Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly
820 825 830
Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp
835 840 845
Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu
850 855 860
Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly
865 870 875 880
Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile
885 890 895
Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr
900 905 910
Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn
915 920 925
Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala
930 935 940
Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn
945 950 955 960
Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val
965 970 975
Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln
980 985 990
Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val
995 1000 1005
Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu
1010 1015 1020
Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys Arg Val
1025 1030 1035 1040
Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ser Ala
1045 1050 1055
Pro His Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ala Gln Glu
1060 1065 1070
Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His Asp Gly Lys Ala His
1075 1080 1085
Phe Pro Arg Glu Gly Val Phe Val Ser Asn Gly Thr His Trp Phe Val
1090 1095 1100
Thr Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile Thr Thr Asp Asn Thr
1105 1110 1115 1120
Phe Val Ser Gly Asn Cys Asp Val Val Ile Gly Ile Val Asn Asn Thr
1125 1130 1135
Val Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu
1140 1145 1150
Asp Lys Tyr Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp
1155 1160 1165
Ile Ser Gly Ile Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp
1170 1175 1180
Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu
1185 1190 1195 1200
Gln Glu Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Trp Tyr Ile
1205 1210 1215
Trp Leu Gly Phe Ile Ala Gly Leu Ile Ala Ile Val Met Val Thr Ile
1220 1225 1230
Met Leu Cys Cys Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Cys Cys
1235 1240 1245
Ser Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro Val
1250 1255 1260
Leu Lys Gly Val Lys Leu His Tyr Thr
1265 1270
<210> 19
<211> 4486
<212> DNA
<213> Artificial Sequence
<220>
<223> COVAX-S19-2
<400> 19
acgaacttat ggatttgttt atgagaatct tcacaattgg aactgtaact ttgaagcaag 60
gtgaaatcaa ggatgctact ccttcagatt ttgttagagc tactgcaacg ataccgatac 120
aagcatcact tcctttcgga tggcttattg ttggcgttgc acttcttgct gtttttcaga 180
gcgcttccaa aatcataacc ctcaaaaaga gatggcaact agcactctcc aagggtgttc 240
actttgtttg caacttgctg ttgttgtttg taacagttta ctcacatctt ttgcttgttg 300
ctgctggcct tgaagcccct tttctctatc tttatgcttt agtctacttc ttgcagagta 360
taaactttgt acgcataata atgaggcttt ggctttgctg gaaatgccgt tccaaaaacc 420
cattacttta tgatgccaac tattttcttt gctggcatac taattgttac gactattgta 480
taccttacaa tagtgtaact tcttcaattg tcattacttc aggtgatggc acaacaagtc 540
ctatttctga acatgactac cagattggtg gttatactga aaaatgggaa tctggagtaa 600
aagactgtgt tgtattacac agttacttca cttcagacta ttaccagctg tactcaactc 660
aattgagtac agacactggt gttgaacatg ttaccttctt catctacaat aaaatcgttg 720
atgagcctga agaacatgtc caaattcaca caatcgacgg ttcatccgga gttgttaatc 780
cagtaatgga accaatttat gatgaaccga cgacgactac tagcgtgcct ttgtaagcac 840
aagctgatga gtacgaactt atgtactcat tcgtttcgga agagacaggt acgttaatag 900
ttaatagcgt acttcttttt cttgctttcg tggtattctt gctagttaca ctagccattc 960
ttactgcgct tcgattgtgt gcgtactgtt gcaatattgt taacgtgagt cttgtaaaac 1020
cttcttttta cgtttactct cgtgttaaaa atctgaattc ttctcgggtt cctgatcttc 1080
tggtctaaac gaactaaata ttatattagt ttttctgttt ggaactttaa ttttagccat 1140
ggcagattcc aacggtacta ttaccgttga ggagctgaaa aagctccttg aacaatggaa 1200
cctagtaata ggtttcctat tccttacatg gatttgcctg ctgcaatttg cctatgccaa 1260
caggaatagg tttttgtaca tcattaagtt gattttcctc tggctgttat ggccagtaac 1320
tttagcttgt tttgtgcttg ctgctgttta cagaataaat tggatcaccg gtggaattgc 1380
tattgcaatg gcttgtcttg taggattgat gtggctaagc tacttcattg cttctttcag 1440
actgtttgcg cgtacgcgtt ccatgtggtc attcaatcca gaaactaaca ttcttctcaa 1500
cgtgccactc catggaacta ttctgactag accgcttcta gaaagtgaac tcgtaatcgg 1560
agctgttatc cttcgtggac atcttcgtat tgctggacat catctaggac gctgtgacat 1620
caaggatcta cctaaagaaa tcactgttgc tacatcacga acgctttctt attacaaatt 1680
gggagcttca cagcgtgtag caggtgattc aggttttgct gcatatagtc gctacaggat 1740
tggcaactat aaattaaaca cagaccattc cagtagcagt gacaatattg ctttgcttgt 1800
acagtaagtg acaacagatg tttcatctcg ttgactttca ggttactata gcagagatat 1860
tactaatcat catgaggact tttaaagttt ccatttggaa tcttgattac atcataaacc 1920
tcataattaa gaacttaagc aagtcactaa ctgagaataa atattctcaa ctagacgagg 1980
agcagccaat ggagattgat taaacgaaca tgaaaattat tcttttcttg gcactgataa 2040
cactcgctac ttgtgagctt tatcactacc aagagtgtgt tagaggtaca acagtacttt 2100
taaaagaacc ttgctcgtcg ggaacatacg agggcaattc accatttcat cctctagctg 2160
ataacaaatt tgcactgact tgctttagca ctcaatttgc ttttgcttgt cctgacggcg 2220
taaaacacgt ctatcagtta cgtgccagat cagtttcacc taaactgttc atcagacaag 2280
aggaagttca agaactttac tctccaattt ttcttattgt tgcggcaata gtgtttataa 2340
cactttgctt cacactcaaa agaaagacag aatgattgaa ctttcattaa ttgacttcta 2400
tttgtgcttt ttagcctttc tgctattcct tgttttaatt atgcttatta tcttttggtt 2460
ctcacttgaa ctgcaagatc ataatgaaac ttgtcacgcc taaacgaaca tgaaatttct 2520
tgttttctta ggaatcatca caactgtagc tgcatttcac caagaatgta gtttacagtc 2580
atgtactcaa catcaaccat atgtagttga tgacccgtgt cctattcact tctattctaa 2640
atggtatatc agagtaggag ctagaaaatc agcaccttta attgaattgt gcgtggatga 2700
ggctggttct aaatcaccca ttcagtacat cgatatcggt aattatacag tttcctgttt 2760
accttttaca attaactgcc aggaacctaa attgggtagt cttgtagtgc gttgttcgtt 2820
ctacgaggac tttttagagt atcatgacgt tcgtgttgtt ttagatttca tctaaacgaa 2880
caaactaaaa tgtctgataa tggacctcaa aatcagcgaa atgcacctcg cattacgttt 2940
ggtggaccat cagattcaac tggcagtaac cagaatggag aacgaagtgg tgcgcgatca 3000
aaacaacgcc gcccgcaagg tttacccaat aatactgcgt cttggttcac cgctctcact 3060
caacatggca aggaagattt aaaattccct cgaggacaag gcgttccaat taacaccaat 3120
agcagtccag atgaccaaat tggctactac cgccgcgcca caagacgaat tcgtggtggt 3180
gatggtaaaa tgaaagatct cagtccaaga tggtatttct actatctagg aactgggcca 3240
gaagctggac ttccttatgg tgctaacaaa gatggcatca tatgggttgc aactgaggga 3300
gccttgaata caccaaaaga tcacattggc accagaaatc ctgctaacaa tgctgcaatc 3360
gtgctacaac ttcctcaagg aacaacatta ccaaaaggtt tttacgcaga agggtctaga 3420
ggtggaagtc aagcctcttc tagatcatca tcacgtagtc gcaacagttc aagaaattca 3480
actccaggtt caagtagagg aacttctcct gctagaatgg ctggaaatgg aggtgatgct 3540
gctcttgctt tgttactact tgacagattg aaccagcttg agagcaaaat gtctggtaaa 3600
ggccaacaac aacaaggcca aactgtcact aagaaatctg ctgctgaggc ttctaagaag 3660
cctagacaaa aacgtactgc cactaaagca tacaatgtaa cacaagcttt cggcagacgt 3720
ggtccagaac aaactcaagg aaattttggg gatcaggaac taatcagaca aggaactgat 3780
tacaaacatt ggccgcaaat tgcacaattt gctccttctg cttcagcgtt ctttggaatg 3840
tcgagaattg gaatggaagt cacaccttcg ggaacatggt tgacctatac aggtgccatc 3900
aaattggatg acaaagatcc aaatttcaaa gatcaagtca ttttgctgaa taagcatatt 3960
gacgcataca aaacattccc accaacagag cctaaaaagg acaaaaagaa gaaggctgat 4020
gaaactcaag ccttaccgca gagacagaag aaacagcaaa ctgtgactct tcttcctgct 4080
gcagatttgg atgatttctc caaacaattg caacaatcca tgagcagtgc tgactcaact 4140
caggcctaaa ctcatgcaga ccacacaagg cagatgggct atataaacgt tttcgctttt 4200
ccgtttacga tatatagtct actcttgtgc agaatgaatt ctcgtaacta catagcacaa 4260
gtagatgtag ttaactttaa tctcacatag caatctttaa tcagtgtgta acattaggga 4320
ggacttgaaa gagccaccac attttcaccg aggccacgcg gagtacgatc gagtgtacag 4380
tgaacaatgc tagggagagc tgcctatatg gatgagccct aatgtgtaaa attaatttta 4440
gtagtgctat ccccatgtga ttttaatagc ttcttaggag aatgac 4486
<210> 20
<211> 275
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic_ORF3a_Protein
<400> 20
Met Asp Leu Phe Met Arg Ile Phe Thr Ile Gly Thr Val Thr Leu Lys
1 5 10 15
Gln Gly Glu Ile Lys Asp Ala Thr Pro Ser Asp Phe Val Arg Ala Thr
20 25 30
Ala Thr Ile Pro Ile Gln Ala Ser Leu Pro Phe Gly Trp Leu Ile Val
35 40 45
Gly Val Ala Leu Leu Ala Val Phe Gln Ser Ala Ser Lys Ile Ile Thr
50 55 60
Leu Lys Lys Arg Trp Gln Leu Ala Leu Ser Lys Gly Val His Phe Val
65 70 75 80
Cys Asn Leu Leu Leu Leu Phe Val Thr Val Tyr Ser His Leu Leu Leu
85 90 95
Val Ala Ala Gly Leu Glu Ala Pro Phe Leu Tyr Leu Tyr Ala Leu Val
100 105 110
Tyr Phe Leu Gln Ser Ile Asn Phe Val Arg Ile Ile Met Arg Leu Trp
115 120 125
Leu Cys Trp Lys Cys Arg Ser Lys Asn Pro Leu Leu Tyr Asp Ala Asn
130 135 140
Tyr Phe Leu Cys Trp His Thr Asn Cys Tyr Asp Tyr Cys Ile Pro Tyr
145 150 155 160
Asn Ser Val Thr Ser Ser Ile Val Ile Thr Ser Gly Asp Gly Thr Thr
165 170 175
Ser Pro Ile Ser Glu His Asp Tyr Gln Ile Gly Gly Tyr Thr Glu Lys
180 185 190
Trp Glu Ser Gly Val Lys Asp Cys Val Val Leu His Ser Tyr Phe Thr
195 200 205
Ser Asp Tyr Tyr Gln Leu Tyr Ser Thr Gln Leu Ser Thr Asp Thr Gly
210 215 220
Val Glu His Val Thr Phe Phe Ile Tyr Asn Lys Ile Val Asp Glu Pro
225 230 235 240
Glu Glu His Val Gln Ile His Thr Ile Asp Gly Ser Ser Gly Val Val
245 250 255
Asn Pro Val Met Glu Pro Ile Tyr Asp Glu Pro Thr Thr Thr Thr Ser
260 265 270
Val Pro Leu
275
<210> 21
<211> 75
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic_Structural_Protein_E
<400> 21
Met Tyr Ser Phe Val Ser Glu Glu Thr Gly Thr Leu Ile Val Asn Ser
1 5 10 15
Val Leu Leu Phe Leu Ala Phe Val Val Phe Leu Leu Val Thr Leu Ala
20 25 30
Ile Leu Thr Ala Leu Arg Leu Cys Ala Tyr Cys Cys Asn Ile Val Asn
35 40 45
Val Ser Leu Val Lys Pro Ser Phe Tyr Val Tyr Ser Arg Val Lys Asn
50 55 60
Leu Asn Ser Ser Arg Val Pro Asp Leu Leu Val
65 70 75
<210> 22
<211> 222
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic_Membrane_Glycoprotein_M
<400> 22
Met Ala Asp Ser Asn Gly Thr Ile Thr Val Glu Glu Leu Lys Lys Leu
1 5 10 15
Leu Glu Gln Trp Asn Leu Val Ile Gly Phe Leu Phe Leu Thr Trp Ile
20 25 30
Cys Leu Leu Gln Phe Ala Tyr Ala Asn Arg Asn Arg Phe Leu Tyr Ile
35 40 45
Ile Lys Leu Ile Phe Leu Trp Leu Leu Trp Pro Val Thr Leu Ala Cys
50 55 60
Phe Val Leu Ala Ala Val Tyr Arg Ile Asn Trp Ile Thr Gly Gly Ile
65 70 75 80
Ala Ile Ala Met Ala Cys Leu Val Gly Leu Met Trp Leu Ser Tyr Phe
85 90 95
Ile Ala Ser Phe Arg Leu Phe Ala Arg Thr Arg Ser Met Trp Ser Phe
100 105 110
Asn Pro Glu Thr Asn Ile Leu Leu Asn Val Pro Leu His Gly Thr Ile
115 120 125
Leu Thr Arg Pro Leu Leu Glu Ser Glu Leu Val Ile Gly Ala Val Ile
130 135 140
Leu Arg Gly His Leu Arg Ile Ala Gly His His Leu Gly Arg Cys Asp
145 150 155 160
Ile Lys Asp Leu Pro Lys Glu Ile Thr Val Ala Thr Ser Arg Thr Leu
165 170 175
Ser Tyr Tyr Lys Leu Gly Ala Ser Gln Arg Val Ala Gly Asp Ser Gly
180 185 190
Phe Ala Ala Tyr Ser Arg Tyr Arg Ile Gly Asn Tyr Lys Leu Asn Thr
195 200 205
Asp His Ser Ser Ser Ser Asp Asn Ile Ala Leu Leu Val Gln
210 215 220
<210> 23
<211> 61
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic_ORF6_Protein
<400> 23
Met Phe His Leu Val Asp Phe Gln Val Thr Ile Ala Glu Ile Leu Leu
1 5 10 15
Ile Ile Met Arg Thr Phe Lys Val Ser Ile Trp Asn Leu Asp Tyr Ile
20 25 30
Ile Asn Leu Ile Ile Lys Asn Leu Ser Lys Ser Leu Thr Glu Asn Lys
35 40 45
Tyr Ser Gln Leu Asp Glu Glu Gln Pro Met Glu Ile Asp
50 55 60
<210> 24
<211> 121
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic_ORF7a_Protein
<400> 24
Met Lys Ile Ile Leu Phe Leu Ala Leu Ile Thr Leu Ala Thr Cys Glu
1 5 10 15
Leu Tyr His Tyr Gln Glu Cys Val Arg Gly Thr Thr Val Leu Leu Lys
20 25 30
Glu Pro Cys Ser Ser Gly Thr Tyr Glu Gly Asn Ser Pro Phe His Pro
35 40 45
Leu Ala Asp Asn Lys Phe Ala Leu Thr Cys Phe Ser Thr Gln Phe Ala
50 55 60
Phe Ala Cys Pro Asp Gly Val Lys His Val Tyr Gln Leu Arg Ala Arg
65 70 75 80
Ser Val Ser Pro Lys Leu Phe Ile Arg Gln Glu Glu Val Gln Glu Leu
85 90 95
Tyr Ser Pro Ile Phe Leu Ile Val Ala Ala Ile Val Phe Ile Thr Leu
100 105 110
Cys Phe Thr Leu Lys Arg Lys Thr Glu
115 120
<210> 25
<211> 121
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic_ORF8_Protein
<400> 25
Met Lys Phe Leu Val Phe Leu Gly Ile Ile Thr Thr Val Ala Ala Phe
1 5 10 15
His Gln Glu Cys Ser Leu Gln Ser Cys Thr Gln His Gln Pro Tyr Val
20 25 30
Val Asp Asp Pro Cys Pro Ile His Phe Tyr Ser Lys Trp Tyr Ile Arg
35 40 45
Val Gly Ala Arg Lys Ser Ala Pro Leu Ile Glu Leu Cys Val Asp Glu
50 55 60
Ala Gly Ser Lys Ser Pro Ile Gln Tyr Ile Asp Ile Gly Asn Tyr Thr
65 70 75 80
Val Ser Cys Leu Pro Phe Thr Ile Asn Cys Gln Glu Pro Lys Leu Gly
85 90 95
Ser Leu Val Val Arg Cys Ser Phe Tyr Glu Asp Phe Leu Glu Tyr His
100 105 110
Asp Val Arg Val Val Leu Asp Phe Ile
115 120
<210> 26
<211> 419
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic_Nulceocapsid_Phosphoprotein
<400> 26
Met Ser Asp Asn Gly Pro Gln Asn Gln Arg Asn Ala Pro Arg Ile Thr
1 5 10 15
Phe Gly Gly Pro Ser Asp Ser Thr Gly Ser Asn Gln Asn Gly Glu Arg
20 25 30
Ser Gly Ala Arg Ser Lys Gln Arg Arg Pro Gln Gly Leu Pro Asn Asn
35 40 45
Thr Ala Ser Trp Phe Thr Ala Leu Thr Gln His Gly Lys Glu Asp Leu
50 55 60
Lys Phe Pro Arg Gly Gln Gly Val Pro Ile Asn Thr Asn Ser Ser Pro
65 70 75 80
Asp Asp Gln Ile Gly Tyr Tyr Arg Arg Ala Thr Arg Arg Ile Arg Gly
85 90 95
Gly Asp Gly Lys Met Lys Asp Leu Ser Pro Arg Trp Tyr Phe Tyr Tyr
100 105 110
Leu Gly Thr Gly Pro Glu Ala Gly Leu Pro Tyr Gly Ala Asn Lys Asp
115 120 125
Gly Ile Ile Trp Val Ala Thr Glu Gly Ala Leu Asn Thr Pro Lys Asp
130 135 140
His Ile Gly Thr Arg Asn Pro Ala Asn Asn Ala Ala Ile Val Leu Gln
145 150 155 160
Leu Pro Gln Gly Thr Thr Leu Pro Lys Gly Phe Tyr Ala Glu Gly Ser
165 170 175
Arg Gly Gly Ser Gln Ala Ser Ser Arg Ser Ser Ser Arg Ser Arg Asn
180 185 190
Ser Ser Arg Asn Ser Thr Pro Gly Ser Ser Arg Gly Thr Ser Pro Ala
195 200 205
Arg Met Ala Gly Asn Gly Gly Asp Ala Ala Leu Ala Leu Leu Leu Leu
210 215 220
Asp Arg Leu Asn Gln Leu Glu Ser Lys Met Ser Gly Lys Gly Gln Gln
225 230 235 240
Gln Gln Gly Gln Thr Val Thr Lys Lys Ser Ala Ala Glu Ala Ser Lys
245 250 255
Lys Pro Arg Gln Lys Arg Thr Ala Thr Lys Ala Tyr Asn Val Thr Gln
260 265 270
Ala Phe Gly Arg Arg Gly Pro Glu Gln Thr Gln Gly Asn Phe Gly Asp
275 280 285
Gln Glu Leu Ile Arg Gln Gly Thr Asp Tyr Lys His Trp Pro Gln Ile
290 295 300
Ala Gln Phe Ala Pro Ser Ala Ser Ala Phe Phe Gly Met Ser Arg Ile
305 310 315 320
Gly Met Glu Val Thr Pro Ser Gly Thr Trp Leu Thr Tyr Thr Gly Ala
325 330 335
Ile Lys Leu Asp Asp Lys Asp Pro Asn Phe Lys Asp Gln Val Ile Leu
340 345 350
Leu Asn Lys His Ile Asp Ala Tyr Lys Thr Phe Pro Pro Thr Glu Pro
355 360 365
Lys Lys Asp Lys Lys Lys Lys Ala Asp Glu Thr Gln Ala Leu Pro Gln
370 375 380
Arg Gln Lys Lys Gln Gln Thr Val Thr Leu Leu Pro Ala Ala Asp Leu
385 390 395 400
Asp Asp Phe Ser Lys Gln Leu Gln Gln Ser Met Ser Ser Ala Asp Ser
405 410 415
Thr Gln Ala
<210> 27
<211> 38
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic_ORF10_Protein
<400> 27
Met Gly Tyr Ile Asn Val Phe Ala Phe Pro Phe Thr Ile Tyr Ser Leu
1 5 10 15
Leu Leu Cys Arg Met Asn Ser Arg Asn Tyr Ile Ala Gln Val Asp Val
20 25 30
Val Asn Phe Asn Leu Thr
35
<210> 28
<211> 18
<212> DNA
<213> Artificial Sequence
<220>
<223> T7_promotor
<400> 28
taatacgact cactatag 18
<210> 29
<211> 26
<212> DNA
<213> Artificial Sequence
<220>
<223> PolyA-Element
<400> 29
aaaaaaaaaa aaaaaaaaaa cggccg 26
<210> 30
<211> 21536
<212> DNA
<213> Artificial Sequence
<220>
<223> COVAX-Polyprotein encoding sequence
<400> 30
atggcaaaga tgggcaaata cggcctgggc ttcaaatggg ccccagaatt tccatggatg 60
cttccgaacg catcggagaa gttgggtaac cctgagaggt cagaggagga tgggttttgc 120
ccctctgctg cgcaagaacc gaaagttaaa ggaaaaactt tggttaatca cgtgagggtg 180
aattgtagcc ggcttccagc tttggaatgc tgtgttcagt ctgccataat ccgtgatatt 240
tttgtagatg aggatcccca gaaggtggag gcctcaacta tgatggcatt gcagttcggt 300
agtgccgtct tggttaagcc atccaagcgc ttgtctattc aggcatggac taatttgggt 360
gtgcttccca aaacagctgc catggggttg ttcaagcgcg tctgcctgtg taacaccagg 420
gagtgctctt gtgacgccca cgtggccttt caccttttta cggtccaacc cgatggtgta 480
tgcctgggta atggccgttt tataggctgg ttcgttccag tcacagccat accggagtat 540
gcgaagcagt ggttgcaacc ctggtccatc cttcttcgta agggtggtaa caaagggtct 600
gtgacatccg gccacttccg ccgcgctgtt accatgcctg tgtatgactt taatgtagag 660
gatgcttgtg aggaggttca tcttaacccg aagggtaagt actcctgcaa ggcgtatgcc 720
ctgctgaagg gctatcgcgg tgttaagccc atcctgtttg tggaccagta tggttgcgac 780
tatactggat gtctcgccaa gggtcttgag gactatggcg atctcacctt gagtgagatg 840
aaggagttgt tccctgtgtg gcgtgactcc ttggatagtg aagtccttgt ggcttggcac 900
gttgatcgag atcctcgggc tgctatgcgt ctgcagactc ttgctactgt acgttgcatt 960
gattatgtgg gccaaccgac cgaggatgtg gtggatggag atgtggtagt gcgtgagcct 1020
gctcatcttc tcgcagccaa tgccattgtt aaaagactcc cccgtttggt ggagactatg 1080
ctgtatacgg attcgtccgt tacagaattc tgttataaaa ccaagctgtg tgaatgcggt 1140
tttatcacgc agtttggcta tgtggattgt tgtggtgaca cctgtgattt tcgtgggtgg 1200
gttgccggca atatgatgga tggctttcca tgtccagggt gtaccaaaaa ttatatgccc 1260
tgggaattgg aggcccagtc atcaggtgtt ataccagaag gaggtgttct attcactcag 1320
agcactgata cagtgaatcg tgagtccttt aagctctacg gtcatgctgt tgtgcctttt 1380
ggttctgctg tgtattggag cccttgccca ggtatgtggc ttccagtaat ttggtcgtcg 1440
gttaagtcat actctggttt gacttataca ggagtagttg gttgtaaggc aattgttcaa 1500
gagacagacg ctatatgtcg ttctctgtat atggattatg tccagcacaa gtgtggcaat 1560
ctcgagcaga gagctatcct tggattggac gatgtctatc atagacagtt gcttgtgaat 1620
aggggtgact atagtctcct ccttgagaat gtggatttgt ttgttaagcg gcgcgctgaa 1680
tttgcttgca aattcgccac ctgtggagat ggtcttgtac ccctcctact agatggttta 1740
gtgccccgca gttattattt gattaagagt ggtcaagctt tcacctctat gatggttaat 1800
tttagccatg aggtgactga catgtgtatg gacatggctt tattgttcat gcatgatgtt 1860
aaagtggcca ctaagtatgt taagaaggtt actggcaaac tggccgtgcg ctttaaagcg 1920
ttgggtgtag ccgttgtcag aaaaattact gaatggtttg atttagccgt ggacattgct 1980
gctagtgccg ctggatggct ttgctaccag ctggtaaatg gcttatttgc agtggccaat 2040
ggtgttataa cctttgtaca ggaggtgcct gagcttgtca agaattttgt tgacaagttc 2100
aaggcatttt tcaaggtttt gatcgactct atgtcggttt ctatcttgtc tggacttact 2160
gttgtcaaga ctgcctcaaa tagggtgtgt cttgctggca gtaaggttta tgaagttgtg 2220
cagaaatctt tgtctgcata tgttatgcct gtgggttgca gcgaagccac ttgtttggtg 2280
ggtgagattg aacctgcagt ttttgaagat gatgttgttg atgtggttaa agccccatta 2340
acatatcaag gctgttgtaa gccacccact tctttcgaga agatttgtat tgtggataaa 2400
ttgtatatgg ccaagtgtgg tgatcaattt taccctgtgg ttgttgataa cgacactgtt 2460
ggcgtgttag atcagtgctg gaggtttccc tgtgcgggca agaaagtcga gtttaacgac 2520
aagcccaaag tcaggaagat accctccacc cgtaagatta agatcacctt cgcactggat 2580
gcgacctttg atagtgttct ttcgaaggcg tgttcagagt ttgaagttga taaagatgtt 2640
acattggatg agctgcttga tgttgtgctt gacgcagttg agagtacgct cagcccttgt 2700
aaggagcatg atgtgatagg cacaaaagtt tgtgctttac ttgataggtt ggcaggagat 2760
tatgtctatc tttttgatga gggaggcgat gaagtgatcg ccccgaggat gtattgttcc 2820
ttttctgctc ctgatgacga ggactgcgtt gcagcggatg ttgtagatgc agatgaaaac 2880
caagatgatg atgccgagga ctcagcagtc cttgtcgctg atacccaaga agaggacggc 2940
gttgccaagg ggcaggttga ggcggattcg gaaatttgcg ttgcgcatac tggtagtcaa 3000
gaagaattgg ctgagcctga tgctgtcgga tctcaaactc ccatcgcctc tgctgaggaa 3060
accgaagtcg gagaggcaag cgacagggaa gggattgctg aggcgaaggc aactgtgtgt 3120
gctgatgctg tagatgcctg ccccgatcaa gtggaggcat ttgaaattga aaaggtcgag 3180
gactctatct tggatgagct tcaaactgaa cttaatgcgc cagcggacaa gacctatgag 3240
gatgtcttgg cattcgatgc cgtatgctca gaggcgttgt ctgcattcta tgctgtgccg 3300
agtgatgaga cgcactttaa agtgtgtgga ttctattcgc ctgctataga gcgcactaat 3360
tgttggctgc gttctacttt gatagtaatg cagagtctac ctttggaatt taaagacttg 3420
gagatgcaaa agctctggtt gtcttacaag gccggctatg accaatgctt tgtggacaaa 3480
ctagttaaga gcgtgcccaa gtctattatc cttccacaag gtggttatgt ggcagatttt 3540
gcctatttct ttctaagcca gtgtagcttt aaagcttatg ctaactggcg ttgtttagag 3600
tgtgacatgg agttaaagct tcaaggcttg gacgccatgt ttttctatgg ggacgttgtg 3660
tctcatatgt gcaagtgtgg taatagcatg accttgttgt ctgcagatat accctacact 3720
ttgcattttg gagtgcgaga tgataagttt tgcgcttttt acacgccaag aaaggtcttt 3780
agggctgctt gtgcggtaga tgttaatgat tgtcactcta tggctgtagt agagggcaag 3840
caaattgatg gtaaagtggt taccaaattt attggtgaca aatttgattt tatggtgggt 3900
tacgggatga catttagtat gtctcctttt gaactcgccc agttatatgg ttcatgtata 3960
acaccaaatg tttgttttgt taaaggagat gttataaagg ttgttcgctt agttaatgct 4020
gaagtcattg ttaaccctgc taatgggcgt atggctcatg gtgccggcgt cgccggcgcc 4080
atagctgaaa aggcgggcag tgcttttatt aaagaaacct ccgatatggt gaaggctcag 4140
ggcgtttgcc aggttggtga atgctatgaa tctgccggtg gtaagttatg taaaaaggtg 4200
cttaacattg tagggccaga tgcgcgaggg catggcaagc aatgctattc acttttagag 4260
cgtgcttatc agcatattaa taagtgtgac aatgttgtca ctactttaat ttcggctggt 4320
atatttagtg tgcctactga tgtctcccta acttacttac ttggtgtagt gacaaagaat 4380
gtcattcttg tcagtaacaa ccaggatgat tttgatgtga tagagaagtg tcaggtgacc 4440
tccgttgctg gtaccaaagc gctatcactt caattggcca aaaatttgtg ccgtgatgta 4500
aagtttgtga cgaatgcatg tagttcgctt tttagtgaat cttgctttgt ctcaagctat 4560
gatgtgttgc aggaagttga agcgctgcga catgatatac aattggatga tgatgctcgt 4620
gtctttgtgc aggctaatat ggactgtctg cccacagact ggcgtctcgt taacaaattt 4680
gatagtgttg atggtgttag aaccattaag tattttgaat gcccgggcgg gatttttgta 4740
tccagccagg gcaaaaagtt tggttatgtt cagaatggtt catttaagga ggcgagtgtt 4800
agccaaataa gggctttact cgctaataag gttgatgtct tgtgtactgt tgatggtgtt 4860
aacttccgct cctgctgcgt agcagagggt gaagtttttg gcaagacatt aggttcagtc 4920
ttttgtgatg gcataaatgt caccaaagtt aggtgtagtg ccatttacaa gggtaaggtt 4980
ttctttcagt acagtgattt gtccgaggca gatcttgtgg ctgttaaaga tgcctttggt 5040
tttgatgaac cacaactgct gaagtactac actatgcttg gcatgtgtaa gtggccagta 5100
gttgtttgtg gcaattattt tgctttcaag cagtcaaata ataattgcta catcaacgtg 5160
gcatgtttaa tgctgcaaca cttgagttta aagtttccta agtggcaatg gcaagaggct 5220
tggaacgagt tccgctctgg taaaccacta aggtttgtgt ccttggtatt agcaaagggc 5280
agctttaaat ttaatgaacc ttctgattct atcgatttta tgcgtgtggt gctacgtgaa 5340
gcagatttga gtggtgccac gtgcaatttg gaatttgttt gtaaatgtgg tgtgaagcaa 5400
gagcagcgca aaggtgttga cgctgttatg cattttggta cgttggataa aggtgatctt 5460
gtcaggggtt ataatatcgc atgtacgtgc ggtagtaaac ttgtgcattg cacccaattt 5520
aacgtaccat ttttaatttg ctccaacaca ccagagggta ggaaactgcc cgacgatgtt 5580
gttgcagcta atatttttac tggtggtagt gtgggccatt acacgcatgt gaaatgtaaa 5640
cccaagtacc agctttatga tgcttgtaat gttaataagg tttcggaggc taagggtaat 5700
tttaccgatt gcctctacct taaaaattta aagcaaacct tctcgtctgt gctgacgact 5760
ttttatttag atgacgtaaa gtgtgtggag tataagccag atttatcgca gtattactgt 5820
gagtctggta aatattatac aaaacccatt attaaggccc aatttagaac atttgagaag 5880
gttgatggtg tctataccaa ctttaaattg gtgggacata gtattgctga aaaactcaat 5940
gctaagctgg gatttgattg taattctccc tttgtggagt ataaaattac agagtggcca 6000
acagctactg gagatgtggt gttggctagt gatgatttgt atgtaagtcg gtacttaagc 6060
gggtgcatta cttttggtaa accggttgtc tggcttggcc atgaggaagc atcgctgaaa 6120
tctctcacat attttaatag acctagtgtc gtttgtgaaa ataaatttaa cgtgttgccc 6180
gttgatgtca gtgaacccac ggacaagggg cctgtgcctg ctgcagtcct tgttaccggc 6240
gtccctggag ctgatgcgtc agctggtgcc ggtattgcca aggagcaaaa agcctgtgct 6300
tctgctagtg tggaggatca ggttgttacg gaggttcgtc aagagccatc tgtttcagct 6360
gctgatgtca aagaggttaa attgaatggt gttaaaaagc ctgttaaggt ggaaggtagt 6420
gtggttgtta atgatcccac tagcgaaacc aaagttgtta aaagtttgtc tattgttgat 6480
gtctatgata tgttcctgac agggtgtaag tatgtggttt ggactgctaa tgagttgtct 6540
cgactagtaa attcaccgac tgttagggag tatgtgaagt ggggtatggg aaagattgta 6600
acacccgcta agttgttgtt gttaagagat gagaagcaag agttcgtagc gccaaaagta 6660
gtcaaggcga aagctattgc ctgctattgt gctgtgaagt ggtttctcct ctattgtttt 6720
agttggataa agtttaatac tgacaataag gttatataca ccacagaagt agcttcaaag 6780
cttactttca agttgtgctg tttggccttt aagaatgcct tacagacgtt taattggagc 6840
gttgtgtcta ggggcttttt cctagttgca acggtctttt tactctggtt taactttttg 6900
tatgctaatg ttattttgag tgacttctat ttgcctaata ttgggcctct ccctacgttt 6960
gtgggacaga tagttgcgtg gtttaagact acatttggtg tgtcaaccat ctgtgatttc 7020
taccaggtga cggatttggg ctatagaagt tcgttttgta atggaagtat ggtatgtgaa 7080
ctatgcttct caggttttga tatgctggac aactatgatg ctataaatgt tgttcaacac 7140
gttgtagata ggcgtttgtc ctttgactat attagcctat ttaaactggt agttgagctt 7200
gtaatcggct actctcttta tactgtgtgc ttctacccac tgtttgtcct tattggaatg 7260
cagttattga ccacatggtt gcctgaattc tttatgctgg agactatgca ttggagtgct 7320
cgtttgtttg tgtttgttgc caatatgctt ccagctttta cgttactgcg attttacatc 7380
gtggtgacag ctatgtataa ggtctattgt ctttgtagac atgttatgta tggatgtagt 7440
aagcctggtt gcttgttttg ttataagaga aaccgtagtg tccgtgttaa gtgtagcacc 7500
gttgttggtg gttcactacg ctattacgat gtaatggcta acggcggcac aggtttctgt 7560
acaaagcacc agtggaactg tcttaattgc aattcctgga aaccaggcaa tacattcata 7620
actcatgaag cagcggcgga cctctctaag gagttgaaac gccctgtgaa tccaacagat 7680
tctgcttatt actcggtcac agaggttaag caggttggtt gttccatgcg tttgttctac 7740
gagagagatg gacagcgtgt ttatgatgat gttaatgcta gtttgtttgt ggacatgaat 7800
ggtctgctgc attctaaagt taaaggtgtg cctgaaacgc atgttgtggt tgttgagaat 7860
gaagctgata aagctggttt tctcggcgcc gcagtgtttt atgcacaatc gctctacaga 7920
cctatgttga tggtggaaaa gaaattaata actaccgcca acactggttt gtctgttagt 7980
cgaactatgt ttgaccttta tgtagattca ttgctgaacg tcctcgacgt ggatcgcaag 8040
agtctaacaa gttttgtaaa tgctgcgcac aactctctaa aggagggtgt tcagcttgaa 8100
caagttatgg atacctttat tggctgtgcc cgacgtaagt gtgctataga ttctgatgtt 8160
gaaaccaagt ctattaccaa gtccgtcatg tcggcagtaa atgctggcgt tgattttacg 8220
gatgagagtt gtaataactt ggtgcctacc tatgttaaaa gtgacactat cgttgcagcc 8280
gatttgggtg ttcttattca gaataatgct aagcatgtac aggctaatgt tgctaaagcc 8340
gctaatgtgg cttgcatttg gtctgtggat gcttttaacc agctatctgc tgacttacag 8400
cataggctgc gaaaagcatg ttcaaaaact ggcttgaaga ttaagcttac ttataataag 8460
caggaggcaa atgttcctat tttaactaca ccgttctctc ttaaaggggg cgctgttttt 8520
agtagaatgt tacaatggtt gtttgttgct aatttgattt gtttcattgt gttgtgggcc 8580
cttatgccaa catatgcagt gcacaaatcg gatatgcagt tgcctttata tgccagtttt 8640
aaagttatag ataacggtgt gctaagggat gtgtctgtta ctgacgcatg cttcgcaaac 8700
aaatttaatc aattcgacca atggtatgag tctacttttg gtcttgctta ttaccgcaac 8760
tctaaggctt gtcctgttgt ggttgctgta atagatcaag acattggcca taccttattt 8820
aatgttccta ccacagtttt aagatatgga tttcatgtgt tgcattttat aacccatgca 8880
tttgctactg atagcgtgca gtgttacacg ccacatatgc aaatccccta tgataatttc 8940
tatgctagtg gttgcgtgtt gtcatccctc tgtactatgc ttgcgcatgc agatggaacc 9000
ccgcatcctt attgttatac agggggtgtt atgcataatg cctctctgta tagttctttg 9060
gctcctcatg tccgttataa cctggctagt tcaaatggtt atatacgttt tcccgaagtg 9120
gttagtgaag gcattgtgcg tgttgtgcgc actcgctcta tgacctactg cagggttggt 9180
ttatgtgagg aggccgagga gggtatctgc tttaatttta atcgttcatg ggtattgaac 9240
aacccgtatt atagggccat gcctggaact ttttgtggta ggaatgcttt tgatttaata 9300
catcaagttt taggaggatt agtgcggcct attgatttct ttgccttaac ggcgagttca 9360
gtggctggtg ctatccttgc aattattgtc gttttggctt tctattattt aatcaagctt 9420
aagcgtgcct ttggtgacta cactagtgtt gtggttatca atgtaattgt gtggtgtata 9480
aattttctga tgctttttgt gtttcaggtt tatcccacat tgtcttgttt atatgcttgt 9540
ttctacttct acaccacgct ttatttccct tcggagataa gtgttgttat gcatttgcaa 9600
tggcttgtca tgtatggtgc tattatgccc ttgtggtttt gcattattta cgtggcagtc 9660
gttgtttcaa accatgcatt gtggttgttc tcttactgcc gcaaaattgg taccgaggtt 9720
cgtagtgacg gcacatttga ggaaatggcc cttactacct ttatgattac taaagaatct 9780
tattgtaagt tgaaaaactc tgtttctgat gttgctttta acaggtactt gagtctttac 9840
aacaagtacc gttacttcag tggcaaaatg gatactgccg cttatagaga ggctgcctgt 9900
tcacaactgg caaaggcaat ggaaacattt aaccataata atggtaatga tgttctctat 9960
cagcctccaa ccgcctctgt tactacatca tttttacagt ctggtatagt gaagatggtg 10020
tcgcccacct ctaaagtgga gccttgtatt gttagtgtta cttatggtaa catgacactt 10080
aatgggttgt ggttggatga taaagtttat tgcccaagac atgttatctg ttcttcagct 10140
gacatgacag accctgatta tcctaatttg ctttgtagag tgacatcaag tgatttttgt 10200
gttatgtctg gtcgtatgag ccttactgta atgtcttatc aaatgcaggg ctgccaactt 10260
gttttgactg ttacactgca aaatcctaac acgcctaagt attccttcgg tgttgttaag 10320
cctggtgaga catttactgt actggctgca tacaatggca gacctcaagg agccttccat 10380
gttacgcttc gtagtagcca taccataaag ggctcctttc tatgtggatc ctgcggttct 10440
gtaggatatg ttttaactgg cgatagtgta cgatttgttt atatgcatca gctagagttg 10500
agtactggtt gtcataccgg tactgacttt agtgggaact tttatggtcc ctatagagat 10560
gcgcaagttg tacaattgcc tgttcaggat tatacgcaga ctgttaatgt tgtagcttgg 10620
ctttatgctg ctatttttaa cagatgcaac tggtttgtgc aaagtgatag ttgttccctg 10680
gaggagttta atgtttgggc tatgaccaat ggttttagct caatcaaagc cgatcttgtc 10740
ttggatgcgc ttgcttctat gacaggcgtt acagttgaac aggtgttggc cgctattaag 10800
aggctgcatt ctggattcca gggcaaacaa attttaggta gttgtgtgct tgaagatgag 10860
ctgacaccaa gtgatgttta tcaacaacta gctggtgtca agctacagtc aaagcgcaca 10920
agagttataa aaggtacatg ttgctggata ttggcttcaa cgtttttgtt ctgtagcatt 10980
atctcagcat ttgtaaaatg gactatgttt atgtatgtta ctacccatat gttgggagtg 11040
acattgtgtg cactttgttt tgtaagcttt gctatgttgt tgatcaagca taagcatttg 11100
tatttaacta tgtacatcat gcctgtgtta tgcacactgt tttacaccaa ctatttggtt 11160
gtgtacaaac agagttttag aggtctagct tatgcttggc tttcacactt tgtccctgct 11220
gtagattata catatatgga tgaagtttta tatggtgttg tgttgctagt agctatggtg 11280
tttgttacca tgcgtagcat aaaccacgac gtcttttcta ttatgttctt ggttggtaga 11340
cttgtcagcc tggtatccat gtggtatttt ggagccaatt tagaggaaga ggtactattg 11400
ttcctcacat ccctatttgg cacgtacaca tggactacta tgttgtcatt ggctaccgct 11460
aaggttattg ctaaatggtt ggctgtgaat gtcttgtact tcacagacgt accgcaaatt 11520
aaattagttc tgttgagcta cttgtgtatt ggttatgtgt gttgttgtta ttggggaatc 11580
ttgtcactcc ttaatagcat ttttaggatg ccattgggcg tctacaatta taaaatctcc 11640
gttcaggagt tacgttatat gaatgctaat ggcttgcgcc cacctagaaa tagttttgag 11700
gccctgatgc ttaattttaa gctgttggga attggtggtg tgccagtcat tgaagtatct 11760
caaattcaat caagattgac ggatgttaaa tgtgctaatg ttgtgttgct taattgcctc 11820
cagcacttgc atattgcatc taattctaag ttgtggcagt attgtagtac tttgcacaat 11880
gaaatactgg ctacatctga tttgagcgtg gccttcgata agttggctca actcttagtt 11940
gttttatttg ctaatccagc agcagtggat agcaagtgcc ttgcaagtat tgaagaagtg 12000
agcgatgatt acgttcgcga caatactgtc ttgcaagcct tacagagtga atttgttaat 12060
atggctagct tcgttgagta tgaacttgct aagaagaatc tagatgaggc taaggctagc 12120
ggctctgcca atcaacagca gattaagcag ctagagaagg cgtgtaatat tgctaagtca 12180
gcatatgagc gcgacagagc tgttgctcgt aagctggaac gtatggctga tttagctctt 12240
acaaacatgt ataaagaagc tagaattaat gataagaaga gtaaggtagt gtctgcattg 12300
caaaccatgc tctttagtat ggtgcgtaag ctagataacc aagctcttaa ttctatttta 12360
gacaacgcag ttaagggttg tgtacctttg aatgcaatac catcattgac ttcgaacact 12420
ctgactataa tagtgccaga taagcaggtt tttgatcagg ttgtggataa tgtgtatgtc 12480
acctatgctg ggaatgtatg gcatatacag tttattcaag atgctgatgg tgctgttaaa 12540
caattgaatg agatagatgt taattcaacc tggcctctag tcattgctgc aaataggcat 12600
aatgaagtgt ctactgttgt tttgcagaac aatgagttga tgcctcagaa gttgagaact 12660
caggttgtca atagtggctc agatatgaat tgtaatactc ctacccagtg ttactataat 12720
actactggca cgggtaagat tgtgtatgct atacttagtg actgtgacgg cctgaagtac 12780
actaagatag taaaagaaga tggaaattgt gttgttttgg aattggatcc tccctgtaag 12840
ttttctgttc aggatgtgaa gggccttaaa attaagtacc tttactttgt gaaggggtgt 12900
aatacactgg ctagaggctg ggttgtaggc accttatcct cgacagtgag attgcaggcg 12960
ggtacggcaa ctgagtatgc ctccaactct gcaatactgt cgctgtgtgc gttttctgta 13020
gatcctaaga aaacgtactt ggattatata aaacagggtg gagttcccgt tactaattgt 13080
gttaagatgt tatgtgacca tgctggcact ggtatggcca ttactattaa gccggaggca 13140
accactaatc aggattctta tggtggtgct tccgtttgta tatattgccg ctcgcgtgtt 13200
gaacatccag atgttgatgg attgtgcaaa ttacgcggca agtttgtcca agtgccctta 13260
ggcataaaag atcctgtgtc atatgtgttg acgcatgatg tttgtcaggt ttgtggcttt 13320
tggcgagatg gtagctgttc ctgtgtaggc acaggctccc agtttcagtc aaaagacacg 13380
aactttttaa acggattcgg ggtacaagtg taaatgcccg tcttgtaccc tgtgccagtg 13440
gcttggacac tgatgttcaa ttaagggcat ttgacatttg taatgctaat cgagctggca 13500
ttggtttgta ttataaagtg aattgctgcc gcttccagcg tgtagatgag gacggcaaca 13560
agttggataa gttctttgtt gttaaaagaa ctaatttaga agtgtataac aaggagaaag 13620
aatgctatga gttgacaaaa gaatgcggtg ttgtggctga acacgagttc ttcacatttg 13680
atgtggaggg aagtcgggta ccacacatag tccgtaaaga tctttcaaag tttactatgt 13740
tagatctttg ctatgcattg cgtcattttg accgcaatga ttgttcaact cttaaggaaa 13800
ttctccttac atatgctgag tgtgaagagt cctacttcca aaagaaggac tggtatgatt 13860
ttgttgagaa tcctgatata attaatgtgt acaagaagct tggtcctata tttaatagag 13920
ccctgcttaa cactgccaag tttgcagacg cattagtgga ggcaggctta gtaggtgttt 13980
taacacttga taatcaagat ttatatggtc aatggtatga ctttggagat tttgtcaaga 14040
cagtacctgg ttgtggtgtt gccgtggcag actcttatta ttcatatatg atgccaatgc 14100
tgactatgtg tcatgcgttg gatagtgagt tgtttgttaa tggtacttat agggagtttg 14160
accttgttca gtatgatttt actgatttca agctagagct gttcactaag tattttaagc 14220
attggagtat gacctaccac ccgaacacct gtgagtgcga ggatgacagg tgcattattc 14280
attgcgccaa ttttaatata cttttcagca tggtcttacc taagacctgt tttgggcctc 14340
ttgttaggca gatatttgtg gatggtgttc ctttcgttgt gtcgatcggt taccattata 14400
aagaattagg tgttgttatg aatatggatg tggatacaca tcgttatcgc ttgtctctta 14460
aggacttgct tttgtatgct gcagaccctg cccttcatgt ggcgtctgct agtgcactgc 14520
ttgatttgcg cacatgttgt tttagcgttg cagctattac aagtggcgta aaatttcaaa 14580
cagttaaacc tggaaatttt aatcaggatt tctacgagtt tattttgagt aaaggcctgc 14640
ttaaagaggg gagctccgtt gatttgaagc acttcttctt tacgcaggat ggtaatgctg 14700
ctattactga ttacaattac tacaagtata atctacccac catggtggat attaagcagt 14760
tgttgtttgt tttagaagtt gttaataagt acttcgagat ctatgagggt gggtgtatac 14820
ccgcaacaca ggtcattgtt aataattatg acaagagtgc tggctatcca tttaataaat 14880
ttggaaaggc caggctctat tatgaggcat tatcatttga ggagcaggat gaaatttatg 14940
cgtataccaa acgcaatgtc ctgccgaccc taactcaaat gaatcttaaa tatgctatta 15000
gtgctaagaa tagggcccgc accgttgctg gtgtctctat tctcagtact atgactggca 15060
gaatgtttca tcaaaagtgt ctaaagagta tagcagctac tcgcggtgtt cctgtagtta 15120
taggcaccac gaagttctat ggcggttggg atgatatgtt acgccgcctt attaaagatg 15180
ttgatagtcc tgtactcatg ggttgggact atcctaaatg tgatcgtgct atgccaaaca 15240
tactgcgtat tgttagtagt ttggtgctag cccgtaaaca tgattcgtgc tgttcgcata 15300
cggatagatt ctatcgtctt gcgaacgagt gcgcccaagt tttgagtgaa attgttatgt 15360
gtggtggttg ttattatgtt aaaccaggtg gcactagtag tggggatgca accactgctt 15420
ttgctaattc tgtgtttaac atttgtcaag ctgtttccgc caatgtatgc tcgcttatgg 15480
catgcaatgg acacaaaatt gaagatttga gtatacgcga gttacaaaag cgcctatact 15540
ctaatgtcta tcgtgcggac catgttgacc ccgcatttgt tagtgagtat tatgagtttt 15600
taaacaagca ttttagtatg atgattttga gtgatgatgg tgttgtgtgt tataattcag 15660
agtttgcgtc caagggttat attgctaata taagtgcctt tcaacaggta ttatattatc 15720
aaaacaacgt gtttatgtct gaggccaaat gttgggtaga aacagacatc gaaaagggac 15780
cgcatgaatt ttgttctcaa catacaatgc tagtcaagat ggatggtgat gaagtctacc 15840
ttccataccc tgatccttcg agaatcttag gagcaggctg ttttgttgat gatttactca 15900
agactgatag cgttctcttg atagagcgtt tcgtaagtct tgcaattgat gcttatcctt 15960
tagtatacca tgagaaccca gagtatcaaa atgtgttccg ggtatattta gaatacatca 16020
agaagctgta caatgatctc ggtaatcaga tcctggacag ctacagtgtt attttaagta 16080
cttgtgatgg tcaaaagttt actgacgaga cgttttacaa gaacatgtat ttaagaagtg 16140
cagtgctgca aagcgttggt gcctgcgttg tctgtagttc tcaaacatca ttacgttgtg 16200
gcagttgcat acgcaagcct ttgctgtgtt gcaaatgcgc ctatgatcat gttatgtcca 16260
ctgatcataa atatgtcctg agtgtgtcac catatgtgtg taattcaccg ggatgtgatg 16320
taaatgatgt taccaaattg tatttaggtg gtatgtcata ttattgtgag gaccataaac 16380
cacagtattc attcaaattg gtgatgaatg gtatggtttt tggtttatat aagcagtctt 16440
gtactggttc gccctacata gaggatttta ataaaatcgc tagttgcaaa tggacagaag 16500
tcgatgatta tgtgctagct aatgaatgca ccgaacgcct taaattgttt gccgcagaaa 16560
cgcagaaggc cacagaagag gcctttaagc aatgttatgc gtcagcaacg atccgtgaga 16620
tcgtgagcga tcgggagtta attttatctt gggaaattgg taaagtccgc ccgccactta 16680
ataaaaatta cgtgttcacc ggctaccatt ttactaataa tggtaagaca gttttaggtg 16740
agtatgtttt tgataagagt gagttgacta atggtgtgta ttatcgcgcc acaaccactt 16800
ataagttatc tgtaggtgat gtgttcattt taacatcaca cgcagtgtct agtttaagtg 16860
ctcctacatt agtaccgcag gagaattata ctagcattcg ttttgctagt gtttatagtg 16920
tgcctgagac gtttcagaat aatgtgccta attatcagca cattggaatg aagcgctatt 16980
gtactgtaca gggaccgcct ggtactggta agtcccatct agccattggg ctagctgttt 17040
attattgtac agcgcgcgtg gtgtataccg ctgctagcca tgctgcagtt gacgcgctgt 17100
gtgaaaaggc acataaattt ctcaacatca acgactgcac gcgtattgtt cctgcaaagg 17160
tgcgtgtaga ttgttatgat aaattcaagg tcaatgacac cactcgcaag tatgtgttta 17220
ctacaataaa tgcattacct gagttggtga ctgacattat tgtcgttgat gaagttagta 17280
tgcttaccaa ctatgagctg tctgttatta acagtcgtgt tagggctaag cattatgtgt 17340
atattggcga cccggcgcag ttacctgcac cacgtgtgct actgaataag ggaactctag 17400
aacctagata ttttaattcc gttaccaagc taatgtgttg tttgggtcca gatattttct 17460
tgggcacctg ttatagatgc cctaaggaga ttgtggatac ggtgtcagcc ttggtttata 17520
ataataagct gaaggctaaa aatgataata gctccatgtg ctttaaggtt tattataagg 17580
gccagactac acatgagagt tctagtgctg ttaatatgca gcaaatacat ttaatttcca 17640
agtttctgaa ggcaaacccc agttggagta acgccgtatt tattagtcct tataactcgc 17700
agaactatgt tgctaagaga gtcttgggat tacaaaccca gacagtagac tcagcgcagg 17760
gttctgaata tgattttgtt atctactcac agactgcgga aacagcgcat tctgtcaatg 17820
taaatagatt caatgttgct attacacgtg ctaagaaggg tattctctgt gtcatgagta 17880
gtatgcaatt atttgagtct cttaatttta ctacactgac gttggataag attaacaatc 17940
cacgattaca gtgtactaca aatttgttta aggattgtag caggagctat gtaggatatc 18000
acccagccca tgcaccatcc tttttggcag ttgatgacaa atataaggta ggcggtgatt 18060
tagccgtttg ccttaatgtt gctgattctg ctgtcactta ttcgcggctt atatcactca 18120
tgggattcaa gcttgacttg acccttgatg gttattgtaa gctgtttata actagagatg 18180
aagctatcaa acgtgttaga gcctgggttg gcttcgatgc agaaggtgcc catgcgatac 18240
gtgatagcat tgggacaaat ttcccattac aattaggctt ttcgactgga attgattttg 18300
ttgtcgaagc cactggaatg tttgctgaga gagatggtta tgtctttaaa aaggcagccg 18360
cacgagctcc tcctggcgaa caatttaaac accttatccc acttatgtca agagggcaga 18420
aatgggatgt ggttcgcatt agaatagtac aaatgttgtc agaccaccta gtggatttgg 18480
cagacagtgt tgtacttgtg acgtgggctg ccagctttga gctcacatgt ttgcgatatt 18540
tcgctaaagt tggaagagaa gttgtgtgta gtgtctgcac caagcgtgcg acatgtttta 18600
attctagaac tggatactat ggatgctggc gacatagtta ttcctgtgat tacctgtaca 18660
acccactaat agttgacatt caacagtggg gatatacagg atctttaact agcaatcatg 18720
atcctatttg cagcgtgcat aagggtgctc atgttgcatc atctgatgct atcatgaccc 18780
ggtgtctagc tgttcatgat tgcttttgta agtctgttaa ttggaattta gaatacccca 18840
ttatttcaaa tgaggtcagt gttaatacct cctgcaggtt attgcagcgc gtaatgttta 18900
gggctgcgat gctatgcaat aggtatgatg tgtgttatga cattggcaac cctaaaggtc 18960
ttgcctgtgt caaaggatat gattttaagt tctatgacgc ctcccctgtt gttaagtctg 19020
ttaaacagtt tgtttacaaa tacgaggcac ataaagatca atttttagat ggtttgtgta 19080
tgttttggaa ctgcaatgtg gataagtatc cagcgaatgc agttgtgtgt aggtttgaca 19140
cgcgtgtgtt gaacaaatta aatctccctg gctgtaatgg tggcagtttg tatgttaaca 19200
aacatgcatt ccacaccagt ccctttaccc gggctgcctt cgagaatttg aagcctatgc 19260
ctttctttta ttattcagat acgccctgtg tgtatatgga aggcatggaa tctaagcagg 19320
tcgattatgt cccattgaga agcgctacat gcatcacaag atgcaattta ggtggcgctg 19380
tttgtttaaa acatgctgag gagtatcgtg agtaccttga gtcttacaat acggcaacca 19440
cagcgggttt tactttttgg gtctataaga cttttgattt ttacaacctt tggaatactt 19500
ttactaggct ccaaagttta gaaaatgtag tgtataacct ggtcaacgct ggacactttg 19560
atggccgggc gggtgaactg ccttgtgctg ttataggtga gaaagtcatt gccaagattc 19620
aaaatgagga tgtcgtggtc tttaaaaata acacgccatt ccccactaat gtggctgtcg 19680
aattatttgc taagcgcagt attcggcccc accccgagct taagctcttt agaaatttga 19740
atattgacgt gtgctggagt cacgtccttt gggattatgc taaggatagt gtgttttgca 19800
gttcgacgta taaggtctgc aaatacacag atttacagtg cattgaaagc ttgaatgtac 19860
tttttgatgg tcgtgataat ggtgctcttg aagcttttaa gaagtgccgg aatggcgtct 19920
acattaacac gacaaaaatt aaaagtctgt cgatgattaa aggcccacaa cgtgccgatt 19980
tgaatggcgt agttgtggag aaagttggag attctgatgt ggaattttgg tttgctgtgc 20040
gtaaagacgg tgacgatgtt atcttcagcc gtacagggag ccttgaaccg agccattacc 20100
ggagcccaca aggtaatccg ggtggtaatc gcgtgggtga tctcagcggt aatgaagctc 20160
tagcgcgtgg cactatcttt actcaaagca gattattatc ttctttcaca cctcgatcag 20220
agatggagaa agattttatg gatttagatg atgatgtgtt cattgcaaaa tatagtttac 20280
aggactacgc gtttgaacac gttgtttatg gtagttttaa ccagaagatt attggaggtt 20340
tgcatttgct tattggctta gcccgtaggc agcaaaaatc caatctggta attcaagagt 20400
tcgtgacata cgactctagc attcattcgt actttatcac tgacgagaac agtggtagta 20460
gtaagagtgt gtgcactgtt attgatttat tgttagatga ttttgtggac attgtaaagt 20520
ccctgaatct aaagtgtgtg agtaaggttg ttaatgttaa tgtggatttt aaggacttcc 20580
agtttatgtt gtggtgcaat gaggagaagg tcatgacttt ctatcctcgt ttgcaggctg 20640
ctgctgactg gaaacctggt tatgttatgc ctgtcttata taagtatttg gaatcgcctc 20700
tggaaagagt aaacctctgg aattatggca agccgattac tttacctaca ggatgtatga 20760
tgaatgttgc taagtatact caattatgtc aatatttgag cactacaaca ttagcagttc 20820
cggctaatat gcgtgtctta caccttggtg ccggttcgga taagggtgtt gcccctgggt 20880
ctgcagttct taggcagtgg ctaccagcgg gaagtattct tgtagataat gatgtgaatc 20940
catttgtgag tgacagtgtc gcctcatatt atggaaattg tataacctta ccctttgatt 21000
gtcagtggga tctgataatt tctgatatgt acgaccctct tactaagaac attggggagt 21060
acaacgtgag taaagatgga ttctttactt acctctgtca tttaattcgt gacaagttgg 21120
ctctgggtgg cagtgttgcc ataaaaataa cagagttttc ttggaacgct gagttatata 21180
gtttaatggg gaagtttgcg ttctggacaa tcttttgcac caacgtaaac gcctcttcaa 21240
gtgaaggatt tttgattggc ataaattggt tgaataagac ccgtaccgaa attgacggta 21300
aaaccatgca tgccaattat ctgttttgga gaaatagtac aatgtggaat ggaggggctt 21360
acagtctctt tgacatgagt aagttccctt tgaaagcggc tggtacggct gttgttagcc 21420
ttaaaccaga ccaaataaat gacttagtcc tctccttgat tgagaagggc aagttattag 21480
tgcgtgatac acgcaaagaa gtttttgttg gcgatagcct agtaaatgtc aaataa 21536
<210> 31
<211> 4470
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic_Replicative_Polyprotein_1a
<400> 31
Met Ala Lys Met Gly Lys Tyr Gly Leu Gly Phe Lys Trp Ala Pro Glu
1 5 10 15
Phe Pro Trp Met Leu Pro Asn Ala Ser Glu Lys Leu Gly Asn Pro Glu
20 25 30
Arg Ser Glu Glu Asp Gly Phe Cys Pro Ser Ala Ala Gln Glu Pro Lys
35 40 45
Val Lys Gly Lys Thr Leu Val Asn His Val Arg Val Asn Cys Ser Arg
50 55 60
Leu Pro Ala Leu Glu Cys Cys Val Gln Ser Ala Ile Ile Arg Asp Ile
65 70 75 80
Phe Val Asp Glu Asp Pro Gln Lys Val Glu Ala Ser Thr Met Met Ala
85 90 95
Leu Gln Phe Gly Ser Ala Val Leu Val Lys Pro Ser Lys Arg Leu Ser
100 105 110
Ile Gln Ala Trp Thr Asn Leu Gly Val Leu Pro Lys Thr Ala Ala Met
115 120 125
Gly Leu Phe Lys Arg Val Cys Leu Cys Asn Thr Arg Glu Cys Ser Cys
130 135 140
Asp Ala His Val Ala Phe His Leu Phe Thr Val Gln Pro Asp Gly Val
145 150 155 160
Cys Leu Gly Asn Gly Arg Phe Ile Gly Trp Phe Val Pro Val Thr Ala
165 170 175
Ile Pro Glu Tyr Ala Lys Gln Trp Leu Gln Pro Trp Ser Ile Leu Leu
180 185 190
Arg Lys Gly Gly Asn Lys Gly Ser Val Thr Ser Gly His Phe Arg Arg
195 200 205
Ala Val Thr Met Pro Val Tyr Asp Phe Asn Val Glu Asp Ala Cys Glu
210 215 220
Glu Val His Leu Asn Pro Lys Gly Lys Tyr Ser Cys Lys Ala Tyr Ala
225 230 235 240
Leu Leu Lys Gly Tyr Arg Gly Val Lys Pro Ile Leu Phe Val Asp Gln
245 250 255
Tyr Gly Cys Asp Tyr Thr Gly Cys Leu Ala Lys Gly Leu Glu Asp Tyr
260 265 270
Gly Asp Leu Thr Leu Ser Glu Met Lys Glu Leu Phe Pro Val Trp Arg
275 280 285
Asp Ser Leu Asp Ser Glu Val Leu Val Ala Trp His Val Asp Arg Asp
290 295 300
Pro Arg Ala Ala Met Arg Leu Gln Thr Leu Ala Thr Val Arg Cys Ile
305 310 315 320
Asp Tyr Val Gly Gln Pro Thr Glu Asp Val Val Asp Gly Asp Val Val
325 330 335
Val Arg Glu Pro Ala His Leu Leu Ala Ala Asn Ala Ile Val Lys Arg
340 345 350
Leu Pro Arg Leu Val Glu Thr Met Leu Tyr Thr Asp Ser Ser Val Thr
355 360 365
Glu Phe Cys Tyr Lys Thr Lys Leu Cys Glu Cys Gly Phe Ile Thr Gln
370 375 380
Phe Gly Tyr Val Asp Cys Cys Gly Asp Thr Cys Asp Phe Arg Gly Trp
385 390 395 400
Val Ala Gly Asn Met Met Asp Gly Phe Pro Cys Pro Gly Cys Thr Lys
405 410 415
Asn Tyr Met Pro Trp Glu Leu Glu Ala Gln Ser Ser Gly Val Ile Pro
420 425 430
Glu Gly Gly Val Leu Phe Thr Gln Ser Thr Asp Thr Val Asn Arg Glu
435 440 445
Ser Phe Lys Leu Tyr Gly His Ala Val Val Pro Phe Gly Ser Ala Val
450 455 460
Tyr Trp Ser Pro Cys Pro Gly Met Trp Leu Pro Val Ile Trp Ser Ser
465 470 475 480
Val Lys Ser Tyr Ser Gly Leu Thr Tyr Thr Gly Val Val Gly Cys Lys
485 490 495
Ala Ile Val Gln Glu Thr Asp Ala Ile Cys Arg Ser Leu Tyr Met Asp
500 505 510
Tyr Val Gln His Lys Cys Gly Asn Leu Glu Gln Arg Ala Ile Leu Gly
515 520 525
Leu Asp Asp Val Tyr His Arg Gln Leu Leu Val Asn Arg Gly Asp Tyr
530 535 540
Ser Leu Leu Leu Glu Asn Val Asp Leu Phe Val Lys Arg Arg Ala Glu
545 550 555 560
Phe Ala Cys Lys Phe Ala Thr Cys Gly Asp Gly Leu Val Pro Leu Leu
565 570 575
Leu Asp Gly Leu Val Pro Arg Ser Tyr Tyr Leu Ile Lys Ser Gly Gln
580 585 590
Ala Phe Thr Ser Met Met Val Asn Phe Ser His Glu Val Thr Asp Met
595 600 605
Cys Met Asp Met Ala Leu Leu Phe Met His Asp Val Lys Val Ala Thr
610 615 620
Lys Tyr Val Lys Lys Val Thr Gly Lys Leu Ala Val Arg Phe Lys Ala
625 630 635 640
Leu Gly Val Ala Val Val Arg Lys Ile Thr Glu Trp Phe Asp Leu Ala
645 650 655
Val Asp Ile Ala Ala Ser Ala Ala Gly Trp Leu Cys Tyr Gln Leu Val
660 665 670
Asn Gly Leu Phe Ala Val Ala Asn Gly Val Ile Thr Phe Val Gln Glu
675 680 685
Val Pro Glu Leu Val Lys Asn Phe Val Asp Lys Phe Lys Ala Phe Phe
690 695 700
Lys Val Leu Ile Asp Ser Met Ser Val Ser Ile Leu Ser Gly Leu Thr
705 710 715 720
Val Val Lys Thr Ala Ser Asn Arg Val Cys Leu Ala Gly Ser Lys Val
725 730 735
Tyr Glu Val Val Gln Lys Ser Leu Ser Ala Tyr Val Met Pro Val Gly
740 745 750
Cys Ser Glu Ala Thr Cys Leu Val Gly Glu Ile Glu Pro Ala Val Phe
755 760 765
Glu Asp Asp Val Val Asp Val Val Lys Ala Pro Leu Thr Tyr Gln Gly
770 775 780
Cys Cys Lys Pro Pro Thr Ser Phe Glu Lys Ile Cys Ile Val Asp Lys
785 790 795 800
Leu Tyr Met Ala Lys Cys Gly Asp Gln Phe Tyr Pro Val Val Val Asp
805 810 815
Asn Asp Thr Val Gly Val Leu Asp Gln Cys Trp Arg Phe Pro Cys Ala
820 825 830
Gly Lys Lys Val Glu Phe Asn Asp Lys Pro Lys Val Arg Lys Ile Pro
835 840 845
Ser Thr Arg Lys Ile Lys Ile Thr Phe Ala Leu Asp Ala Thr Phe Asp
850 855 860
Ser Val Leu Ser Lys Ala Cys Ser Glu Phe Glu Val Asp Lys Asp Val
865 870 875 880
Thr Leu Asp Glu Leu Leu Asp Val Val Leu Asp Ala Val Glu Ser Thr
885 890 895
Leu Ser Pro Cys Lys Glu His Asp Val Ile Gly Thr Lys Val Cys Ala
900 905 910
Leu Leu Asp Arg Leu Ala Gly Asp Tyr Val Tyr Leu Phe Asp Glu Gly
915 920 925
Gly Asp Glu Val Ile Ala Pro Arg Met Tyr Cys Ser Phe Ser Ala Pro
930 935 940
Asp Asp Glu Asp Cys Val Ala Ala Asp Val Val Asp Ala Asp Glu Asn
945 950 955 960
Gln Asp Asp Asp Ala Glu Asp Ser Ala Val Leu Val Ala Asp Thr Gln
965 970 975
Glu Glu Asp Gly Val Ala Lys Gly Gln Val Glu Ala Asp Ser Glu Ile
980 985 990
Cys Val Ala His Thr Gly Ser Gln Glu Glu Leu Ala Glu Pro Asp Ala
995 1000 1005
Val Gly Ser Gln Thr Pro Ile Ala Ser Ala Glu Glu Thr Glu Val Gly
1010 1015 1020
Glu Ala Ser Asp Arg Glu Gly Ile Ala Glu Ala Lys Ala Thr Val Cys
1025 1030 1035 1040
Ala Asp Ala Val Asp Ala Cys Pro Asp Gln Val Glu Ala Phe Glu Ile
1045 1050 1055
Glu Lys Val Glu Asp Ser Ile Leu Asp Glu Leu Gln Thr Glu Leu Asn
1060 1065 1070
Ala Pro Ala Asp Lys Thr Tyr Glu Asp Val Leu Ala Phe Asp Ala Val
1075 1080 1085
Cys Ser Glu Ala Leu Ser Ala Phe Tyr Ala Val Pro Ser Asp Glu Thr
1090 1095 1100
His Phe Lys Val Cys Gly Phe Tyr Ser Pro Ala Ile Glu Arg Thr Asn
1105 1110 1115 1120
Cys Trp Leu Arg Ser Thr Leu Ile Val Met Gln Ser Leu Pro Leu Glu
1125 1130 1135
Phe Lys Asp Leu Glu Met Gln Lys Leu Trp Leu Ser Tyr Lys Ala Gly
1140 1145 1150
Tyr Asp Gln Cys Phe Val Asp Lys Leu Val Lys Ser Val Pro Lys Ser
1155 1160 1165
Ile Ile Leu Pro Gln Gly Gly Tyr Val Ala Asp Phe Ala Tyr Phe Phe
1170 1175 1180
Leu Ser Gln Cys Ser Phe Lys Ala Tyr Ala Asn Trp Arg Cys Leu Glu
1185 1190 1195 1200
Cys Asp Met Glu Leu Lys Leu Gln Gly Leu Asp Ala Met Phe Phe Tyr
1205 1210 1215
Gly Asp Val Val Ser His Met Cys Lys Cys Gly Asn Ser Met Thr Leu
1220 1225 1230
Leu Ser Ala Asp Ile Pro Tyr Thr Leu His Phe Gly Val Arg Asp Asp
1235 1240 1245
Lys Phe Cys Ala Phe Tyr Thr Pro Arg Lys Val Phe Arg Ala Ala Cys
1250 1255 1260
Ala Val Asp Val Asn Asp Cys His Ser Met Ala Val Val Glu Gly Lys
1265 1270 1275 1280
Gln Ile Asp Gly Lys Val Val Thr Lys Phe Ile Gly Asp Lys Phe Asp
1285 1290 1295
Phe Met Val Gly Tyr Gly Met Thr Phe Ser Met Ser Pro Phe Glu Leu
1300 1305 1310
Ala Gln Leu Tyr Gly Ser Cys Ile Thr Pro Asn Val Cys Phe Val Lys
1315 1320 1325
Gly Asp Val Ile Lys Val Val Arg Leu Val Asn Ala Glu Val Ile Val
1330 1335 1340
Asn Pro Ala Asn Gly Arg Met Ala His Gly Ala Gly Val Ala Gly Ala
1345 1350 1355 1360
Ile Ala Glu Lys Ala Gly Ser Ala Phe Ile Lys Glu Thr Ser Asp Met
1365 1370 1375
Val Lys Ala Gln Gly Val Cys Gln Val Gly Glu Cys Tyr Glu Ser Ala
1380 1385 1390
Gly Gly Lys Leu Cys Lys Lys Val Leu Asn Ile Val Gly Pro Asp Ala
1395 1400 1405
Arg Gly His Gly Lys Gln Cys Tyr Ser Leu Leu Glu Arg Ala Tyr Gln
1410 1415 1420
His Ile Asn Lys Cys Asp Asn Val Val Thr Thr Leu Ile Ser Ala Gly
1425 1430 1435 1440
Ile Phe Ser Val Pro Thr Asp Val Ser Leu Thr Tyr Leu Leu Gly Val
1445 1450 1455
Val Thr Lys Asn Val Ile Leu Val Ser Asn Asn Gln Asp Asp Phe Asp
1460 1465 1470
Val Ile Glu Lys Cys Gln Val Thr Ser Val Ala Gly Thr Lys Ala Leu
1475 1480 1485
Ser Leu Gln Leu Ala Lys Asn Leu Cys Arg Asp Val Lys Phe Val Thr
1490 1495 1500
Asn Ala Cys Ser Ser Leu Phe Ser Glu Ser Cys Phe Val Ser Ser Tyr
1505 1510 1515 1520
Asp Val Leu Gln Glu Val Glu Ala Leu Arg His Asp Ile Gln Leu Asp
1525 1530 1535
Asp Asp Ala Arg Val Phe Val Gln Ala Asn Met Asp Cys Leu Pro Thr
1540 1545 1550
Asp Trp Arg Leu Val Asn Lys Phe Asp Ser Val Asp Gly Val Arg Thr
1555 1560 1565
Ile Lys Tyr Phe Glu Cys Pro Gly Gly Ile Phe Val Ser Ser Gln Gly
1570 1575 1580
Lys Lys Phe Gly Tyr Val Gln Asn Gly Ser Phe Lys Glu Ala Ser Val
1585 1590 1595 1600
Ser Gln Ile Arg Ala Leu Leu Ala Asn Lys Val Asp Val Leu Cys Thr
1605 1610 1615
Val Asp Gly Val Asn Phe Arg Ser Cys Cys Val Ala Glu Gly Glu Val
1620 1625 1630
Phe Gly Lys Thr Leu Gly Ser Val Phe Cys Asp Gly Ile Asn Val Thr
1635 1640 1645
Lys Val Arg Cys Ser Ala Ile Tyr Lys Gly Lys Val Phe Phe Gln Tyr
1650 1655 1660
Ser Asp Leu Ser Glu Ala Asp Leu Val Ala Val Lys Asp Ala Phe Gly
1665 1670 1675 1680
Phe Asp Glu Pro Gln Leu Leu Lys Tyr Tyr Thr Met Leu Gly Met Cys
1685 1690 1695
Lys Trp Pro Val Val Val Cys Gly Asn Tyr Phe Ala Phe Lys Gln Ser
1700 1705 1710
Asn Asn Asn Cys Tyr Ile Asn Val Ala Cys Leu Met Leu Gln His Leu
1715 1720 1725
Ser Leu Lys Phe Pro Lys Trp Gln Trp Gln Glu Ala Trp Asn Glu Phe
1730 1735 1740
Arg Ser Gly Lys Pro Leu Arg Phe Val Ser Leu Val Leu Ala Lys Gly
1745 1750 1755 1760
Ser Phe Lys Phe Asn Glu Pro Ser Asp Ser Ile Asp Phe Met Arg Val
1765 1770 1775
Val Leu Arg Glu Ala Asp Leu Ser Gly Ala Thr Cys Asn Leu Glu Phe
1780 1785 1790
Val Cys Lys Cys Gly Val Lys Gln Glu Gln Arg Lys Gly Val Asp Ala
1795 1800 1805
Val Met His Phe Gly Thr Leu Asp Lys Gly Asp Leu Val Arg Gly Tyr
1810 1815 1820
Asn Ile Ala Cys Thr Cys Gly Ser Lys Leu Val His Cys Thr Gln Phe
1825 1830 1835 1840
Asn Val Pro Phe Leu Ile Cys Ser Asn Thr Pro Glu Gly Arg Lys Leu
1845 1850 1855
Pro Asp Asp Val Val Ala Ala Asn Ile Phe Thr Gly Gly Ser Val Gly
1860 1865 1870
His Tyr Thr His Val Lys Cys Lys Pro Lys Tyr Gln Leu Tyr Asp Ala
1875 1880 1885
Cys Asn Val Asn Lys Val Ser Glu Ala Lys Gly Asn Phe Thr Asp Cys
1890 1895 1900
Leu Tyr Leu Lys Asn Leu Lys Gln Thr Phe Ser Ser Val Leu Thr Thr
1905 1910 1915 1920
Phe Tyr Leu Asp Asp Val Lys Cys Val Glu Tyr Lys Pro Asp Leu Ser
1925 1930 1935
Gln Tyr Tyr Cys Glu Ser Gly Lys Tyr Tyr Thr Lys Pro Ile Ile Lys
1940 1945 1950
Ala Gln Phe Arg Thr Phe Glu Lys Val Asp Gly Val Tyr Thr Asn Phe
1955 1960 1965
Lys Leu Val Gly His Ser Ile Ala Glu Lys Leu Asn Ala Lys Leu Gly
1970 1975 1980
Phe Asp Cys Asn Ser Pro Phe Val Glu Tyr Lys Ile Thr Glu Trp Pro
1985 1990 1995 2000
Thr Ala Thr Gly Asp Val Val Leu Ala Ser Asp Asp Leu Tyr Val Ser
2005 2010 2015
Arg Tyr Leu Ser Gly Cys Ile Thr Phe Gly Lys Pro Val Val Trp Leu
2020 2025 2030
Gly His Glu Glu Ala Ser Leu Lys Ser Leu Thr Tyr Phe Asn Arg Pro
2035 2040 2045
Ser Val Val Cys Glu Asn Lys Phe Asn Val Leu Pro Val Asp Val Ser
2050 2055 2060
Glu Pro Thr Asp Lys Gly Pro Val Pro Ala Ala Val Leu Val Thr Gly
2065 2070 2075 2080
Val Pro Gly Ala Asp Ala Ser Ala Gly Ala Gly Ile Ala Lys Glu Gln
2085 2090 2095
Lys Ala Cys Ala Ser Ala Ser Val Glu Asp Gln Val Val Thr Glu Val
2100 2105 2110
Arg Gln Glu Pro Ser Val Ser Ala Ala Asp Val Lys Glu Val Lys Leu
2115 2120 2125
Asn Gly Val Lys Lys Pro Val Lys Val Glu Gly Ser Val Val Val Asn
2130 2135 2140
Asp Pro Thr Ser Glu Thr Lys Val Val Lys Ser Leu Ser Ile Val Asp
2145 2150 2155 2160
Val Tyr Asp Met Phe Leu Thr Gly Cys Lys Tyr Val Val Trp Thr Ala
2165 2170 2175
Asn Glu Leu Ser Arg Leu Val Asn Ser Pro Thr Val Arg Glu Tyr Val
2180 2185 2190
Lys Trp Gly Met Gly Lys Ile Val Thr Pro Ala Lys Leu Leu Leu Leu
2195 2200 2205
Arg Asp Glu Lys Gln Glu Phe Val Ala Pro Lys Val Val Lys Ala Lys
2210 2215 2220
Ala Ile Ala Cys Tyr Cys Ala Val Lys Trp Phe Leu Leu Tyr Cys Phe
2225 2230 2235 2240
Ser Trp Ile Lys Phe Asn Thr Asp Asn Lys Val Ile Tyr Thr Thr Glu
2245 2250 2255
Val Ala Ser Lys Leu Thr Phe Lys Leu Cys Cys Leu Ala Phe Lys Asn
2260 2265 2270
Ala Leu Gln Thr Phe Asn Trp Ser Val Val Ser Arg Gly Phe Phe Leu
2275 2280 2285
Val Ala Thr Val Phe Leu Leu Trp Phe Asn Phe Leu Tyr Ala Asn Val
2290 2295 2300
Ile Leu Ser Asp Phe Tyr Leu Pro Asn Ile Gly Pro Leu Pro Thr Phe
2305 2310 2315 2320
Val Gly Gln Ile Val Ala Trp Phe Lys Thr Thr Phe Gly Val Ser Thr
2325 2330 2335
Ile Cys Asp Phe Tyr Gln Val Thr Asp Leu Gly Tyr Arg Ser Ser Phe
2340 2345 2350
Cys Asn Gly Ser Met Val Cys Glu Leu Cys Phe Ser Gly Phe Asp Met
2355 2360 2365
Leu Asp Asn Tyr Asp Ala Ile Asn Val Val Gln His Val Val Asp Arg
2370 2375 2380
Arg Leu Ser Phe Asp Tyr Ile Ser Leu Phe Lys Leu Val Val Glu Leu
2385 2390 2395 2400
Val Ile Gly Tyr Ser Leu Tyr Thr Val Cys Phe Tyr Pro Leu Phe Val
2405 2410 2415
Leu Ile Gly Met Gln Leu Leu Thr Thr Trp Leu Pro Glu Phe Phe Met
2420 2425 2430
Leu Glu Thr Met His Trp Ser Ala Arg Leu Phe Val Phe Val Ala Asn
2435 2440 2445
Met Leu Pro Ala Phe Thr Leu Leu Arg Phe Tyr Ile Val Val Thr Ala
2450 2455 2460
Met Tyr Lys Val Tyr Cys Leu Cys Arg His Val Met Tyr Gly Cys Ser
2465 2470 2475 2480
Lys Pro Gly Cys Leu Phe Cys Tyr Lys Arg Asn Arg Ser Val Arg Val
2485 2490 2495
Lys Cys Ser Thr Val Val Gly Gly Ser Leu Arg Tyr Tyr Asp Val Met
2500 2505 2510
Ala Asn Gly Gly Thr Gly Phe Cys Thr Lys His Gln Trp Asn Cys Leu
2515 2520 2525
Asn Cys Asn Ser Trp Lys Pro Gly Asn Thr Phe Ile Thr His Glu Ala
2530 2535 2540
Ala Ala Asp Leu Ser Lys Glu Leu Lys Arg Pro Val Asn Pro Thr Asp
2545 2550 2555 2560
Ser Ala Tyr Tyr Ser Val Thr Glu Val Lys Gln Val Gly Cys Ser Met
2565 2570 2575
Arg Leu Phe Tyr Glu Arg Asp Gly Gln Arg Val Tyr Asp Asp Val Asn
2580 2585 2590
Ala Ser Leu Phe Val Asp Met Asn Gly Leu Leu His Ser Lys Val Lys
2595 2600 2605
Gly Val Pro Glu Thr His Val Val Val Val Glu Asn Glu Ala Asp Lys
2610 2615 2620
Ala Gly Phe Leu Gly Ala Ala Val Phe Tyr Ala Gln Ser Leu Tyr Arg
2625 2630 2635 2640
Pro Met Leu Met Val Glu Lys Lys Leu Ile Thr Thr Ala Asn Thr Gly
2645 2650 2655
Leu Ser Val Ser Arg Thr Met Phe Asp Leu Tyr Val Asp Ser Leu Leu
2660 2665 2670
Asn Val Leu Asp Val Asp Arg Lys Ser Leu Thr Ser Phe Val Asn Ala
2675 2680 2685
Ala His Asn Ser Leu Lys Glu Gly Val Gln Leu Glu Gln Val Met Asp
2690 2695 2700
Thr Phe Ile Gly Cys Ala Arg Arg Lys Cys Ala Ile Asp Ser Asp Val
2705 2710 2715 2720
Glu Thr Lys Ser Ile Thr Lys Ser Val Met Ser Ala Val Asn Ala Gly
2725 2730 2735
Val Asp Phe Thr Asp Glu Ser Cys Asn Asn Leu Val Pro Thr Tyr Val
2740 2745 2750
Lys Ser Asp Thr Ile Val Ala Ala Asp Leu Gly Val Leu Ile Gln Asn
2755 2760 2765
Asn Ala Lys His Val Gln Ala Asn Val Ala Lys Ala Ala Asn Val Ala
2770 2775 2780
Cys Ile Trp Ser Val Asp Ala Phe Asn Gln Leu Ser Ala Asp Leu Gln
2785 2790 2795 2800
His Arg Leu Arg Lys Ala Cys Ser Lys Thr Gly Leu Lys Ile Lys Leu
2805 2810 2815
Thr Tyr Asn Lys Gln Glu Ala Asn Val Pro Ile Leu Thr Thr Pro Phe
2820 2825 2830
Ser Leu Lys Gly Gly Ala Val Phe Ser Arg Met Leu Gln Trp Leu Phe
2835 2840 2845
Val Ala Asn Leu Ile Cys Phe Ile Val Leu Trp Ala Leu Met Pro Thr
2850 2855 2860
Tyr Ala Val His Lys Ser Asp Met Gln Leu Pro Leu Tyr Ala Ser Phe
2865 2870 2875 2880
Lys Val Ile Asp Asn Gly Val Leu Arg Asp Val Ser Val Thr Asp Ala
2885 2890 2895
Cys Phe Ala Asn Lys Phe Asn Gln Phe Asp Gln Trp Tyr Glu Ser Thr
2900 2905 2910
Phe Gly Leu Ala Tyr Tyr Arg Asn Ser Lys Ala Cys Pro Val Val Val
2915 2920 2925
Ala Val Ile Asp Gln Asp Ile Gly His Thr Leu Phe Asn Val Pro Thr
2930 2935 2940
Thr Val Leu Arg Tyr Gly Phe His Val Leu His Phe Ile Thr His Ala
2945 2950 2955 2960
Phe Ala Thr Asp Ser Val Gln Cys Tyr Thr Pro His Met Gln Ile Pro
2965 2970 2975
Tyr Asp Asn Phe Tyr Ala Ser Gly Cys Val Leu Ser Ser Leu Cys Thr
2980 2985 2990
Met Leu Ala His Ala Asp Gly Thr Pro His Pro Tyr Cys Tyr Thr Gly
2995 3000 3005
Gly Val Met His Asn Ala Ser Leu Tyr Ser Ser Leu Ala Pro His Val
3010 3015 3020
Arg Tyr Asn Leu Ala Ser Ser Asn Gly Tyr Ile Arg Phe Pro Glu Val
3025 3030 3035 3040
Val Ser Glu Gly Ile Val Arg Val Val Arg Thr Arg Ser Met Thr Tyr
3045 3050 3055
Cys Arg Val Gly Leu Cys Glu Glu Ala Glu Glu Gly Ile Cys Phe Asn
3060 3065 3070
Phe Asn Arg Ser Trp Val Leu Asn Asn Pro Tyr Tyr Arg Ala Met Pro
3075 3080 3085
Gly Thr Phe Cys Gly Arg Asn Ala Phe Asp Leu Ile His Gln Val Leu
3090 3095 3100
Gly Gly Leu Val Arg Pro Ile Asp Phe Phe Ala Leu Thr Ala Ser Ser
3105 3110 3115 3120
Val Ala Gly Ala Ile Leu Ala Ile Ile Val Val Leu Ala Phe Tyr Tyr
3125 3130 3135
Leu Ile Lys Leu Lys Arg Ala Phe Gly Asp Tyr Thr Ser Val Val Val
3140 3145 3150
Ile Asn Val Ile Val Trp Cys Ile Asn Phe Leu Met Leu Phe Val Phe
3155 3160 3165
Gln Val Tyr Pro Thr Leu Ser Cys Leu Tyr Ala Cys Phe Tyr Phe Tyr
3170 3175 3180
Thr Thr Leu Tyr Phe Pro Ser Glu Ile Ser Val Val Met His Leu Gln
3185 3190 3195 3200
Trp Leu Val Met Tyr Gly Ala Ile Met Pro Leu Trp Phe Cys Ile Ile
3205 3210 3215
Tyr Val Ala Val Val Val Ser Asn His Ala Leu Trp Leu Phe Ser Tyr
3220 3225 3230
Cys Arg Lys Ile Gly Thr Glu Val Arg Ser Asp Gly Thr Phe Glu Glu
3235 3240 3245
Met Ala Leu Thr Thr Phe Met Ile Thr Lys Glu Ser Tyr Cys Lys Leu
3250 3255 3260
Lys Asn Ser Val Ser Asp Val Ala Phe Asn Arg Tyr Leu Ser Leu Tyr
3265 3270 3275 3280
Asn Lys Tyr Arg Tyr Phe Ser Gly Lys Met Asp Thr Ala Ala Tyr Arg
3285 3290 3295
Glu Ala Ala Cys Ser Gln Leu Ala Lys Ala Met Glu Thr Phe Asn His
3300 3305 3310
Asn Asn Gly Asn Asp Val Leu Tyr Gln Pro Pro Thr Ala Ser Val Thr
3315 3320 3325
Thr Ser Phe Leu Gln Ser Gly Ile Val Lys Met Val Ser Pro Thr Ser
3330 3335 3340
Lys Val Glu Pro Cys Ile Val Ser Val Thr Tyr Gly Asn Met Thr Leu
3345 3350 3355 3360
Asn Gly Leu Trp Leu Asp Asp Lys Val Tyr Cys Pro Arg His Val Ile
3365 3370 3375
Cys Ser Ser Ala Asp Met Thr Asp Pro Asp Tyr Pro Asn Leu Leu Cys
3380 3385 3390
Arg Val Thr Ser Ser Asp Phe Cys Val Met Ser Gly Arg Met Ser Leu
3395 3400 3405
Thr Val Met Ser Tyr Gln Met Gln Gly Cys Gln Leu Val Leu Thr Val
3410 3415 3420
Thr Leu Gln Asn Pro Asn Thr Pro Lys Tyr Ser Phe Gly Val Val Lys
3425 3430 3435 3440
Pro Gly Glu Thr Phe Thr Val Leu Ala Ala Tyr Asn Gly Arg Pro Gln
3445 3450 3455
Gly Ala Phe His Val Thr Leu Arg Ser Ser His Thr Ile Lys Gly Ser
3460 3465 3470
Phe Leu Cys Gly Ser Cys Gly Ser Val Gly Tyr Val Leu Thr Gly Asp
3475 3480 3485
Ser Val Arg Phe Val Tyr Met His Gln Leu Glu Leu Ser Thr Gly Cys
3490 3495 3500
His Thr Gly Thr Asp Phe Ser Gly Asn Phe Tyr Gly Pro Tyr Arg Asp
3505 3510 3515 3520
Ala Gln Val Val Gln Leu Pro Val Gln Asp Tyr Thr Gln Thr Val Asn
3525 3530 3535
Val Val Ala Trp Leu Tyr Ala Ala Ile Phe Asn Arg Cys Asn Trp Phe
3540 3545 3550
Val Gln Ser Asp Ser Cys Ser Leu Glu Glu Phe Asn Val Trp Ala Met
3555 3560 3565
Thr Asn Gly Phe Ser Ser Ile Lys Ala Asp Leu Val Leu Asp Ala Leu
3570 3575 3580
Ala Ser Met Thr Gly Val Thr Val Glu Gln Val Leu Ala Ala Ile Lys
3585 3590 3595 3600
Arg Leu His Ser Gly Phe Gln Gly Lys Gln Ile Leu Gly Ser Cys Val
3605 3610 3615
Leu Glu Asp Glu Leu Thr Pro Ser Asp Val Tyr Gln Gln Leu Ala Gly
3620 3625 3630
Val Lys Leu Gln Ser Lys Arg Thr Arg Val Ile Lys Gly Thr Cys Cys
3635 3640 3645
Trp Ile Leu Ala Ser Thr Phe Leu Phe Cys Ser Ile Ile Ser Ala Phe
3650 3655 3660
Val Lys Trp Thr Met Phe Met Tyr Val Thr Thr His Met Leu Gly Val
3665 3670 3675 3680
Thr Leu Cys Ala Leu Cys Phe Val Ser Phe Ala Met Leu Leu Ile Lys
3685 3690 3695
His Lys His Leu Tyr Leu Thr Met Tyr Ile Met Pro Val Leu Cys Thr
3700 3705 3710
Leu Phe Tyr Thr Asn Tyr Leu Val Val Tyr Lys Gln Ser Phe Arg Gly
3715 3720 3725
Leu Ala Tyr Ala Trp Leu Ser His Phe Val Pro Ala Val Asp Tyr Thr
3730 3735 3740
Tyr Met Asp Glu Val Leu Tyr Gly Val Val Leu Leu Val Ala Met Val
3745 3750 3755 3760
Phe Val Thr Met Arg Ser Ile Asn His Asp Val Phe Ser Ile Met Phe
3765 3770 3775
Leu Val Gly Arg Leu Val Ser Leu Val Ser Met Trp Tyr Phe Gly Ala
3780 3785 3790
Asn Leu Glu Glu Glu Val Leu Leu Phe Leu Thr Ser Leu Phe Gly Thr
3795 3800 3805
Tyr Thr Trp Thr Thr Met Leu Ser Leu Ala Thr Ala Lys Val Ile Ala
3810 3815 3820
Lys Trp Leu Ala Val Asn Val Leu Tyr Phe Thr Asp Val Pro Gln Ile
3825 3830 3835 3840
Lys Leu Val Leu Leu Ser Tyr Leu Cys Ile Gly Tyr Val Cys Cys Cys
3845 3850 3855
Tyr Trp Gly Ile Leu Ser Leu Leu Asn Ser Ile Phe Arg Met Pro Leu
3860 3865 3870
Gly Val Tyr Asn Tyr Lys Ile Ser Val Gln Glu Leu Arg Tyr Met Asn
3875 3880 3885
Ala Asn Gly Leu Arg Pro Pro Arg Asn Ser Phe Glu Ala Leu Met Leu
3890 3895 3900
Asn Phe Lys Leu Leu Gly Ile Gly Gly Val Pro Val Ile Glu Val Ser
3905 3910 3915 3920
Gln Ile Gln Ser Arg Leu Thr Asp Val Lys Cys Ala Asn Val Val Leu
3925 3930 3935
Leu Asn Cys Leu Gln His Leu His Ile Ala Ser Asn Ser Lys Leu Trp
3940 3945 3950
Gln Tyr Cys Ser Thr Leu His Asn Glu Ile Leu Ala Thr Ser Asp Leu
3955 3960 3965
Ser Val Ala Phe Asp Lys Leu Ala Gln Leu Leu Val Val Leu Phe Ala
3970 3975 3980
Asn Pro Ala Ala Val Asp Ser Lys Cys Leu Ala Ser Ile Glu Glu Val
3985 3990 3995 4000
Ser Asp Asp Tyr Val Arg Asp Asn Thr Val Leu Gln Ala Leu Gln Ser
4005 4010 4015
Glu Phe Val Asn Met Ala Ser Phe Val Glu Tyr Glu Leu Ala Lys Lys
4020 4025 4030
Asn Leu Asp Glu Ala Lys Ala Ser Gly Ser Ala Asn Gln Gln Gln Ile
4035 4040 4045
Lys Gln Leu Glu Lys Ala Cys Asn Ile Ala Lys Ser Ala Tyr Glu Arg
4050 4055 4060
Asp Arg Ala Val Ala Arg Lys Leu Glu Arg Met Ala Asp Leu Ala Leu
4065 4070 4075 4080
Thr Asn Met Tyr Lys Glu Ala Arg Ile Asn Asp Lys Lys Ser Lys Val
4085 4090 4095
Val Ser Ala Leu Gln Thr Met Leu Phe Ser Met Val Arg Lys Leu Asp
4100 4105 4110
Asn Gln Ala Leu Asn Ser Ile Leu Asp Asn Ala Val Lys Gly Cys Val
4115 4120 4125
Pro Leu Asn Ala Ile Pro Ser Leu Thr Ser Asn Thr Leu Thr Ile Ile
4130 4135 4140
Val Pro Asp Lys Gln Val Phe Asp Gln Val Val Asp Asn Val Tyr Val
4145 4150 4155 4160
Thr Tyr Ala Gly Asn Val Trp His Ile Gln Phe Ile Gln Asp Ala Asp
4165 4170 4175
Gly Ala Val Lys Gln Leu Asn Glu Ile Asp Val Asn Ser Thr Trp Pro
4180 4185 4190
Leu Val Ile Ala Ala Asn Arg His Asn Glu Val Ser Thr Val Val Leu
4195 4200 4205
Gln Asn Asn Glu Leu Met Pro Gln Lys Leu Arg Thr Gln Val Val Asn
4210 4215 4220
Ser Gly Ser Asp Met Asn Cys Asn Thr Pro Thr Gln Cys Tyr Tyr Asn
4225 4230 4235 4240
Thr Thr Gly Thr Gly Lys Ile Val Tyr Ala Ile Leu Ser Asp Cys Asp
4245 4250 4255
Gly Leu Lys Tyr Thr Lys Ile Val Lys Glu Asp Gly Asn Cys Val Val
4260 4265 4270
Leu Glu Leu Asp Pro Pro Cys Lys Phe Ser Val Gln Asp Val Lys Gly
4275 4280 4285
Leu Lys Ile Lys Tyr Leu Tyr Phe Val Lys Gly Cys Asn Thr Leu Ala
4290 4295 4300
Arg Gly Trp Val Val Gly Thr Leu Ser Ser Thr Val Arg Leu Gln Ala
4305 4310 4315 4320
Gly Thr Ala Thr Glu Tyr Ala Ser Asn Ser Ala Ile Leu Ser Leu Cys
4325 4330 4335
Ala Phe Ser Val Asp Pro Lys Lys Thr Tyr Leu Asp Tyr Ile Lys Gln
4340 4345 4350
Gly Gly Val Pro Val Thr Asn Cys Val Lys Met Leu Cys Asp His Ala
4355 4360 4365
Gly Thr Gly Met Ala Ile Thr Ile Lys Pro Glu Ala Thr Thr Asn Gln
4370 4375 4380
Asp Ser Tyr Gly Gly Ala Ser Val Cys Ile Tyr Cys Arg Ser Arg Val
4385 4390 4395 4400
Glu His Pro Asp Val Asp Gly Leu Cys Lys Leu Arg Gly Lys Phe Val
4405 4410 4415
Gln Val Pro Leu Gly Ile Lys Asp Pro Val Ser Tyr Val Leu Thr His
4420 4425 4430
Asp Val Cys Gln Val Cys Gly Phe Trp Arg Asp Gly Ser Cys Ser Cys
4435 4440 4445
Val Gly Thr Gly Ser Gln Phe Gln Ser Lys Asp Thr Asn Phe Leu Asn
4450 4455 4460
Gly Phe Gly Val Gln Val
4465 4470
<210> 32
<211> 2714
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic_Replicative_Polyprotein1ab
<400> 32
Arg Ile Arg Gly Thr Ser Val Asn Ala Arg Leu Val Pro Cys Ala Ser
1 5 10 15
Gly Leu Asp Thr Asp Val Gln Leu Arg Ala Phe Asp Ile Cys Asn Ala
20 25 30
Asn Arg Ala Gly Ile Gly Leu Tyr Tyr Lys Val Asn Cys Cys Arg Phe
35 40 45
Gln Arg Val Asp Glu Asp Gly Asn Lys Leu Asp Lys Phe Phe Val Val
50 55 60
Lys Arg Thr Asn Leu Glu Val Tyr Asn Lys Glu Lys Glu Cys Tyr Glu
65 70 75 80
Leu Thr Lys Glu Cys Gly Val Val Ala Glu His Glu Phe Phe Thr Phe
85 90 95
Asp Val Glu Gly Ser Arg Val Pro His Ile Val Arg Lys Asp Leu Ser
100 105 110
Lys Phe Thr Met Leu Asp Leu Cys Tyr Ala Leu Arg His Phe Asp Arg
115 120 125
Asn Asp Cys Ser Thr Leu Lys Glu Ile Leu Leu Thr Tyr Ala Glu Cys
130 135 140
Glu Glu Ser Tyr Phe Gln Lys Lys Asp Trp Tyr Asp Phe Val Glu Asn
145 150 155 160
Pro Asp Ile Ile Asn Val Tyr Lys Lys Leu Gly Pro Ile Phe Asn Arg
165 170 175
Ala Leu Leu Asn Thr Ala Lys Phe Ala Asp Ala Leu Val Glu Ala Gly
180 185 190
Leu Val Gly Val Leu Thr Leu Asp Asn Gln Asp Leu Tyr Gly Gln Trp
195 200 205
Tyr Asp Phe Gly Asp Phe Val Lys Thr Val Pro Gly Cys Gly Val Ala
210 215 220
Val Ala Asp Ser Tyr Tyr Ser Tyr Met Met Pro Met Leu Thr Met Cys
225 230 235 240
His Ala Leu Asp Ser Glu Leu Phe Val Asn Gly Thr Tyr Arg Glu Phe
245 250 255
Asp Leu Val Gln Tyr Asp Phe Thr Asp Phe Lys Leu Glu Leu Phe Thr
260 265 270
Lys Tyr Phe Lys His Trp Ser Met Thr Tyr His Pro Asn Thr Cys Glu
275 280 285
Cys Glu Asp Asp Arg Cys Ile Ile His Cys Ala Asn Phe Asn Ile Leu
290 295 300
Phe Ser Met Val Leu Pro Lys Thr Cys Phe Gly Pro Leu Val Arg Gln
305 310 315 320
Ile Phe Val Asp Gly Val Pro Phe Val Val Ser Ile Gly Tyr His Tyr
325 330 335
Lys Glu Leu Gly Val Val Met Asn Met Asp Val Asp Thr His Arg Tyr
340 345 350
Arg Leu Ser Leu Lys Asp Leu Leu Leu Tyr Ala Ala Asp Pro Ala Leu
355 360 365
His Val Ala Ser Ala Ser Ala Leu Leu Asp Leu Arg Thr Cys Cys Phe
370 375 380
Ser Val Ala Ala Ile Thr Ser Gly Val Lys Phe Gln Thr Val Lys Pro
385 390 395 400
Gly Asn Phe Asn Gln Asp Phe Tyr Glu Phe Ile Leu Ser Lys Gly Leu
405 410 415
Leu Lys Glu Gly Ser Ser Val Asp Leu Lys His Phe Phe Phe Thr Gln
420 425 430
Asp Gly Asn Ala Ala Ile Thr Asp Tyr Asn Tyr Tyr Lys Tyr Asn Leu
435 440 445
Pro Thr Met Val Asp Ile Lys Gln Leu Leu Phe Val Leu Glu Val Val
450 455 460
Asn Lys Tyr Phe Glu Ile Tyr Glu Gly Gly Cys Ile Pro Ala Thr Gln
465 470 475 480
Val Ile Val Asn Asn Tyr Asp Lys Ser Ala Gly Tyr Pro Phe Asn Lys
485 490 495
Phe Gly Lys Ala Arg Leu Tyr Tyr Glu Ala Leu Ser Phe Glu Glu Gln
500 505 510
Asp Glu Ile Tyr Ala Tyr Thr Lys Arg Asn Val Leu Pro Thr Leu Thr
515 520 525
Gln Met Asn Leu Lys Tyr Ala Ile Ser Ala Lys Asn Arg Ala Arg Thr
530 535 540
Val Ala Gly Val Ser Ile Leu Ser Thr Met Thr Gly Arg Met Phe His
545 550 555 560
Gln Lys Cys Leu Lys Ser Ile Ala Ala Thr Arg Gly Val Pro Val Val
565 570 575
Ile Gly Thr Thr Lys Phe Tyr Gly Gly Trp Asp Asp Met Leu Arg Arg
580 585 590
Leu Ile Lys Asp Val Asp Ser Pro Val Leu Met Gly Trp Asp Tyr Pro
595 600 605
Lys Cys Asp Arg Ala Met Pro Asn Ile Leu Arg Ile Val Ser Ser Leu
610 615 620
Val Leu Ala Arg Lys His Asp Ser Cys Cys Ser His Thr Asp Arg Phe
625 630 635 640
Tyr Arg Leu Ala Asn Glu Cys Ala Gln Val Leu Ser Glu Ile Val Met
645 650 655
Cys Gly Gly Cys Tyr Tyr Val Lys Pro Gly Gly Thr Ser Ser Gly Asp
660 665 670
Ala Thr Thr Ala Phe Ala Asn Ser Val Phe Asn Ile Cys Gln Ala Val
675 680 685
Ser Ala Asn Val Cys Ser Leu Met Ala Cys Asn Gly His Lys Ile Glu
690 695 700
Asp Leu Ser Ile Arg Glu Leu Gln Lys Arg Leu Tyr Ser Asn Val Tyr
705 710 715 720
Arg Ala Asp His Val Asp Pro Ala Phe Val Ser Glu Tyr Tyr Glu Phe
725 730 735
Leu Asn Lys His Phe Ser Met Met Ile Leu Ser Asp Asp Gly Val Val
740 745 750
Cys Tyr Asn Ser Glu Phe Ala Ser Lys Gly Tyr Ile Ala Asn Ile Ser
755 760 765
Ala Phe Gln Gln Val Leu Tyr Tyr Gln Asn Asn Val Phe Met Ser Glu
770 775 780
Ala Lys Cys Trp Val Glu Thr Asp Ile Glu Lys Gly Pro His Glu Phe
785 790 795 800
Cys Ser Gln His Thr Met Leu Val Lys Met Asp Gly Asp Glu Val Tyr
805 810 815
Leu Pro Tyr Pro Asp Pro Ser Arg Ile Leu Gly Ala Gly Cys Phe Val
820 825 830
Asp Asp Leu Leu Lys Thr Asp Ser Val Leu Leu Ile Glu Arg Phe Val
835 840 845
Ser Leu Ala Ile Asp Ala Tyr Pro Leu Val Tyr His Glu Asn Pro Glu
850 855 860
Tyr Gln Asn Val Phe Arg Val Tyr Leu Glu Tyr Ile Lys Lys Leu Tyr
865 870 875 880
Asn Asp Leu Gly Asn Gln Ile Leu Asp Ser Tyr Ser Val Ile Leu Ser
885 890 895
Thr Cys Asp Gly Gln Lys Phe Thr Asp Glu Thr Phe Tyr Lys Asn Met
900 905 910
Tyr Leu Arg Ser Ala Val Leu Gln Ser Val Gly Ala Cys Val Val Cys
915 920 925
Ser Ser Gln Thr Ser Leu Arg Cys Gly Ser Cys Ile Arg Lys Pro Leu
930 935 940
Leu Cys Cys Lys Cys Ala Tyr Asp His Val Met Ser Thr Asp His Lys
945 950 955 960
Tyr Val Leu Ser Val Ser Pro Tyr Val Cys Asn Ser Pro Gly Cys Asp
965 970 975
Val Asn Asp Val Thr Lys Leu Tyr Leu Gly Gly Met Ser Tyr Tyr Cys
980 985 990
Glu Asp His Lys Pro Gln Tyr Ser Phe Lys Leu Val Met Asn Gly Met
995 1000 1005
Val Phe Gly Leu Tyr Lys Gln Ser Cys Thr Gly Ser Pro Tyr Ile Glu
1010 1015 1020
Asp Phe Asn Lys Ile Ala Ser Cys Lys Trp Thr Glu Val Asp Asp Tyr
1025 1030 1035 1040
Val Leu Ala Asn Glu Cys Thr Glu Arg Leu Lys Leu Phe Ala Ala Glu
1045 1050 1055
Thr Gln Lys Ala Thr Glu Glu Ala Phe Lys Gln Cys Tyr Ala Ser Ala
1060 1065 1070
Thr Ile Arg Glu Ile Val Ser Asp Arg Glu Leu Ile Leu Ser Trp Glu
1075 1080 1085
Ile Gly Lys Val Arg Pro Pro Leu Asn Lys Asn Tyr Val Phe Thr Gly
1090 1095 1100
Tyr His Phe Thr Asn Asn Gly Lys Thr Val Leu Gly Glu Tyr Val Phe
1105 1110 1115 1120
Asp Lys Ser Glu Leu Thr Asn Gly Val Tyr Tyr Arg Ala Thr Thr Thr
1125 1130 1135
Tyr Lys Leu Ser Val Gly Asp Val Phe Ile Leu Thr Ser His Ala Val
1140 1145 1150
Ser Ser Leu Ser Ala Pro Thr Leu Val Pro Gln Glu Asn Tyr Thr Ser
1155 1160 1165
Ile Arg Phe Ala Ser Val Tyr Ser Val Pro Glu Thr Phe Gln Asn Asn
1170 1175 1180
Val Pro Asn Tyr Gln His Ile Gly Met Lys Arg Tyr Cys Thr Val Gln
1185 1190 1195 1200
Gly Pro Pro Gly Thr Gly Lys Ser His Leu Ala Ile Gly Leu Ala Val
1205 1210 1215
Tyr Tyr Cys Thr Ala Arg Val Val Tyr Thr Ala Ala Ser His Ala Ala
1220 1225 1230
Val Asp Ala Leu Cys Glu Lys Ala His Lys Phe Leu Asn Ile Asn Asp
1235 1240 1245
Cys Thr Arg Ile Val Pro Ala Lys Val Arg Val Asp Cys Tyr Asp Lys
1250 1255 1260
Phe Lys Val Asn Asp Thr Thr Arg Lys Tyr Val Phe Thr Thr Ile Asn
1265 1270 1275 1280
Ala Leu Pro Glu Leu Val Thr Asp Ile Ile Val Val Asp Glu Val Ser
1285 1290 1295
Met Leu Thr Asn Tyr Glu Leu Ser Val Ile Asn Ser Arg Val Arg Ala
1300 1305 1310
Lys His Tyr Val Tyr Ile Gly Asp Pro Ala Gln Leu Pro Ala Pro Arg
1315 1320 1325
Val Leu Leu Asn Lys Gly Thr Leu Glu Pro Arg Tyr Phe Asn Ser Val
1330 1335 1340
Thr Lys Leu Met Cys Cys Leu Gly Pro Asp Ile Phe Leu Gly Thr Cys
1345 1350 1355 1360
Tyr Arg Cys Pro Lys Glu Ile Val Asp Thr Val Ser Ala Leu Val Tyr
1365 1370 1375
Asn Asn Lys Leu Lys Ala Lys Asn Asp Asn Ser Ser Met Cys Phe Lys
1380 1385 1390
Val Tyr Tyr Lys Gly Gln Thr Thr His Glu Ser Ser Ser Ala Val Asn
1395 1400 1405
Met Gln Gln Ile His Leu Ile Ser Lys Phe Leu Lys Ala Asn Pro Ser
1410 1415 1420
Trp Ser Asn Ala Val Phe Ile Ser Pro Tyr Asn Ser Gln Asn Tyr Val
1425 1430 1435 1440
Ala Lys Arg Val Leu Gly Leu Gln Thr Gln Thr Val Asp Ser Ala Gln
1445 1450 1455
Gly Ser Glu Tyr Asp Phe Val Ile Tyr Ser Gln Thr Ala Glu Thr Ala
1460 1465 1470
His Ser Val Asn Val Asn Arg Phe Asn Val Ala Ile Thr Arg Ala Lys
1475 1480 1485
Lys Gly Ile Leu Cys Val Met Ser Ser Met Gln Leu Phe Glu Ser Leu
1490 1495 1500
Asn Phe Thr Thr Leu Thr Leu Asp Lys Ile Asn Asn Pro Arg Leu Gln
1505 1510 1515 1520
Cys Thr Thr Asn Leu Phe Lys Asp Cys Ser Arg Ser Tyr Val Gly Tyr
1525 1530 1535
His Pro Ala His Ala Pro Ser Phe Leu Ala Val Asp Asp Lys Tyr Lys
1540 1545 1550
Val Gly Gly Asp Leu Ala Val Cys Leu Asn Val Ala Asp Ser Ala Val
1555 1560 1565
Thr Tyr Ser Arg Leu Ile Ser Leu Met Gly Phe Lys Leu Asp Leu Thr
1570 1575 1580
Leu Asp Gly Tyr Cys Lys Leu Phe Ile Thr Arg Asp Glu Ala Ile Lys
1585 1590 1595 1600
Arg Val Arg Ala Trp Val Gly Phe Asp Ala Glu Gly Ala His Ala Ile
1605 1610 1615
Arg Asp Ser Ile Gly Thr Asn Phe Pro Leu Gln Leu Gly Phe Ser Thr
1620 1625 1630
Gly Ile Asp Phe Val Val Glu Ala Thr Gly Met Phe Ala Glu Arg Asp
1635 1640 1645
Gly Tyr Val Phe Lys Lys Ala Ala Ala Arg Ala Pro Pro Gly Glu Gln
1650 1655 1660
Phe Lys His Leu Ile Pro Leu Met Ser Arg Gly Gln Lys Trp Asp Val
1665 1670 1675 1680
Val Arg Ile Arg Ile Val Gln Met Leu Ser Asp His Leu Val Asp Leu
1685 1690 1695
Ala Asp Ser Val Val Leu Val Thr Trp Ala Ala Ser Phe Glu Leu Thr
1700 1705 1710
Cys Leu Arg Tyr Phe Ala Lys Val Gly Arg Glu Val Val Cys Ser Val
1715 1720 1725
Cys Thr Lys Arg Ala Thr Cys Phe Asn Ser Arg Thr Gly Tyr Tyr Gly
1730 1735 1740
Cys Trp Arg His Ser Tyr Ser Cys Asp Tyr Leu Tyr Asn Pro Leu Ile
1745 1750 1755 1760
Val Asp Ile Gln Gln Trp Gly Tyr Thr Gly Ser Leu Thr Ser Asn His
1765 1770 1775
Asp Pro Ile Cys Ser Val His Lys Gly Ala His Val Ala Ser Ser Asp
1780 1785 1790
Ala Ile Met Thr Arg Cys Leu Ala Val His Asp Cys Phe Cys Lys Ser
1795 1800 1805
Val Asn Trp Asn Leu Glu Tyr Pro Ile Ile Ser Asn Glu Val Ser Val
1810 1815 1820
Asn Thr Ser Cys Arg Leu Leu Gln Arg Val Met Phe Arg Ala Ala Met
1825 1830 1835 1840
Leu Cys Asn Arg Tyr Asp Val Cys Tyr Asp Ile Gly Asn Pro Lys Gly
1845 1850 1855
Leu Ala Cys Val Lys Gly Tyr Asp Phe Lys Phe Tyr Asp Ala Ser Pro
1860 1865 1870
Val Val Lys Ser Val Lys Gln Phe Val Tyr Lys Tyr Glu Ala His Lys
1875 1880 1885
Asp Gln Phe Leu Asp Gly Leu Cys Met Phe Trp Asn Cys Asn Val Asp
1890 1895 1900
Lys Tyr Pro Ala Asn Ala Val Val Cys Arg Phe Asp Thr Arg Val Leu
1905 1910 1915 1920
Asn Lys Leu Asn Leu Pro Gly Cys Asn Gly Gly Ser Leu Tyr Val Asn
1925 1930 1935
Lys His Ala Phe His Thr Ser Pro Phe Thr Arg Ala Ala Phe Glu Asn
1940 1945 1950
Leu Lys Pro Met Pro Phe Phe Tyr Tyr Ser Asp Thr Pro Cys Val Tyr
1955 1960 1965
Met Glu Gly Met Glu Ser Lys Gln Val Asp Tyr Val Pro Leu Arg Ser
1970 1975 1980
Ala Thr Cys Ile Thr Arg Cys Asn Leu Gly Gly Ala Val Cys Leu Lys
1985 1990 1995 2000
His Ala Glu Glu Tyr Arg Glu Tyr Leu Glu Ser Tyr Asn Thr Ala Thr
2005 2010 2015
Thr Ala Gly Phe Thr Phe Trp Val Tyr Lys Thr Phe Asp Phe Tyr Asn
2020 2025 2030
Leu Trp Asn Thr Phe Thr Arg Leu Gln Ser Leu Glu Asn Val Val Tyr
2035 2040 2045
Asn Leu Val Asn Ala Gly His Phe Asp Gly Arg Ala Gly Glu Leu Pro
2050 2055 2060
Cys Ala Val Ile Gly Glu Lys Val Ile Ala Lys Ile Gln Asn Glu Asp
2065 2070 2075 2080
Val Val Val Phe Lys Asn Asn Thr Pro Phe Pro Thr Asn Val Ala Val
2085 2090 2095
Glu Leu Phe Ala Lys Arg Ser Ile Arg Pro His Pro Glu Leu Lys Leu
2100 2105 2110
Phe Arg Asn Leu Asn Ile Asp Val Cys Trp Ser His Val Leu Trp Asp
2115 2120 2125
Tyr Ala Lys Asp Ser Val Phe Cys Ser Ser Thr Tyr Lys Val Cys Lys
2130 2135 2140
Tyr Thr Asp Leu Gln Cys Ile Glu Ser Leu Asn Val Leu Phe Asp Gly
2145 2150 2155 2160
Arg Asp Asn Gly Ala Leu Glu Ala Phe Lys Lys Cys Arg Asn Gly Val
2165 2170 2175
Tyr Ile Asn Thr Thr Lys Ile Lys Ser Leu Ser Met Ile Lys Gly Pro
2180 2185 2190
Gln Arg Ala Asp Leu Asn Gly Val Val Val Glu Lys Val Gly Asp Ser
2195 2200 2205
Asp Val Glu Phe Trp Phe Ala Val Arg Lys Asp Gly Asp Asp Val Ile
2210 2215 2220
Phe Ser Arg Thr Gly Ser Leu Glu Pro Ser His Tyr Arg Ser Pro Gln
2225 2230 2235 2240
Gly Asn Pro Gly Gly Asn Arg Val Gly Asp Leu Ser Gly Asn Glu Ala
2245 2250 2255
Leu Ala Arg Gly Thr Ile Phe Thr Gln Ser Arg Leu Leu Ser Ser Phe
2260 2265 2270
Thr Pro Arg Ser Glu Met Glu Lys Asp Phe Met Asp Leu Asp Asp Asp
2275 2280 2285
Val Phe Ile Ala Lys Tyr Ser Leu Gln Asp Tyr Ala Phe Glu His Val
2290 2295 2300
Val Tyr Gly Ser Phe Asn Gln Lys Ile Ile Gly Gly Leu His Leu Leu
2305 2310 2315 2320
Ile Gly Leu Ala Arg Arg Gln Gln Lys Ser Asn Leu Val Ile Gln Glu
2325 2330 2335
Phe Val Thr Tyr Asp Ser Ser Ile His Ser Tyr Phe Ile Thr Asp Glu
2340 2345 2350
Asn Ser Gly Ser Ser Lys Ser Val Cys Thr Val Ile Asp Leu Leu Leu
2355 2360 2365
Asp Asp Phe Val Asp Ile Val Lys Ser Leu Asn Leu Lys Cys Val Ser
2370 2375 2380
Lys Val Val Asn Val Asn Val Asp Phe Lys Asp Phe Gln Phe Met Leu
2385 2390 2395 2400
Trp Cys Asn Glu Glu Lys Val Met Thr Phe Tyr Pro Arg Leu Gln Ala
2405 2410 2415
Ala Ala Asp Trp Lys Pro Gly Tyr Val Met Pro Val Leu Tyr Lys Tyr
2420 2425 2430
Leu Glu Ser Pro Leu Glu Arg Val Asn Leu Trp Asn Tyr Gly Lys Pro
2435 2440 2445
Ile Thr Leu Pro Thr Gly Cys Met Met Asn Val Ala Lys Tyr Thr Gln
2450 2455 2460
Leu Cys Gln Tyr Leu Ser Thr Thr Thr Leu Ala Val Pro Ala Asn Met
2465 2470 2475 2480
Arg Val Leu His Leu Gly Ala Gly Ser Asp Lys Gly Val Ala Pro Gly
2485 2490 2495
Ser Ala Val Leu Arg Gln Trp Leu Pro Ala Gly Ser Ile Leu Val Asp
2500 2505 2510
Asn Asp Val Asn Pro Phe Val Ser Asp Ser Val Ala Ser Tyr Tyr Gly
2515 2520 2525
Asn Cys Ile Thr Leu Pro Phe Asp Cys Gln Trp Asp Leu Ile Ile Ser
2530 2535 2540
Asp Met Tyr Asp Pro Leu Thr Lys Asn Ile Gly Glu Tyr Asn Val Ser
2545 2550 2555 2560
Lys Asp Gly Phe Phe Thr Tyr Leu Cys His Leu Ile Arg Asp Lys Leu
2565 2570 2575
Ala Leu Gly Gly Ser Val Ala Ile Lys Ile Thr Glu Phe Ser Trp Asn
2580 2585 2590
Ala Glu Leu Tyr Ser Leu Met Gly Lys Phe Ala Phe Trp Thr Ile Phe
2595 2600 2605
Cys Thr Asn Val Asn Ala Ser Ser Ser Glu Gly Phe Leu Ile Gly Ile
2610 2615 2620
Asn Trp Leu Asn Lys Thr Arg Thr Glu Ile Asp Gly Lys Thr Met His
2625 2630 2635 2640
Ala Asn Tyr Leu Phe Trp Arg Asn Ser Thr Met Trp Asn Gly Gly Ala
2645 2650 2655
Tyr Ser Leu Phe Asp Met Ser Lys Phe Pro Leu Lys Ala Ala Gly Thr
2660 2665 2670
Ala Val Val Ser Leu Lys Pro Asp Gln Ile Asn Asp Leu Val Leu Ser
2675 2680 2685
Leu Ile Glu Lys Gly Lys Leu Leu Val Arg Asp Thr Arg Lys Glu Val
2690 2695 2700
Phe Val Gly Asp Ser Leu Val Asn Val Lys
2705 2710
<210> 33
<211> 29844
<212> DNA
<213> Artificial Sequence
<220>
<223> COVAX191_delta_N_RNA
<400> 33
gtataagagt gattggcgtc cgtacgtacc ctctcaactc taaaactctt gtagtttaaa 60
tctaatctaa actttataaa cggcacttcc tgcgtgtcca tgcccgcggg cctggtcttg 120
tcatagtgct gacatttgta gttccttgac tttcgttctc tgccagtgac gtgtccattc 180
ggcgccagca gcccacccat aggttgcata atggcaaaga tgggcaaata cggcctgggc 240
ttcaaatggg ccccagaatt tccatggatg cttccgaacg catcggagaa gttgggtaac 300
cctgagaggt cagaggagga tgggttttgc ccctctgctg cgcaagaacc gaaagttaaa 360
ggaaaaactt tggttaatca cgtgagggtg aattgtagcc ggcttccagc tttggaatgc 420
tgtgttcagt ctgccataat ccgtgatatt tttgtagatg aggatcccca gaaggtggag 480
gcctcaacta tgatggcatt gcagttcggt agtgccgtct tggttaagcc atccaagcgc 540
ttgtctattc aggcatggac taatttgggt gtgcttccca aaacagctgc catggggttg 600
ttcaagcgcg tctgcctgtg taacaccagg gagtgctctt gtgacgccca cgtggccttt 660
caccttttta cggtccaacc cgatggtgta tgcctgggta atggccgttt tataggctgg 720
ttcgttccag tcacagccat accggagtat gcgaagcagt ggttgcaacc ctggtccatc 780
cttcttcgta agggtggtaa caaagggtct gtgacatccg gccacttccg ccgcgctgtt 840
accatgcctg tgtatgactt taatgtagag gatgcttgtg aggaggttca tcttaacccg 900
aagggtaagt actcctgcaa ggcgtatgcc ctgctgaagg gctatcgcgg tgttaagccc 960
atcctgtttg tggaccagta tggttgcgac tatactggat gtctcgccaa gggtcttgag 1020
gactatggcg atctcacctt gagtgagatg aaggagttgt tccctgtgtg gcgtgactcc 1080
ttggatagtg aagtccttgt ggcttggcac gttgatcgag atcctcgggc tgctatgcgt 1140
ctgcagactc ttgctactgt acgttgcatt gattatgtgg gccaaccgac cgaggatgtg 1200
gtggatggag atgtggtagt gcgtgagcct gctcatcttc tcgcagccaa tgccattgtt 1260
aaaagactcc cccgtttggt ggagactatg ctgtatacgg attcgtccgt tacagaattc 1320
tgttataaaa ccaagctgtg tgaatgcggt tttatcacgc agtttggcta tgtggattgt 1380
tgtggtgaca cctgtgattt tcgtgggtgg gttgccggca atatgatgga tggctttcca 1440
tgtccagggt gtaccaaaaa ttatatgccc tgggaattgg aggcccagtc atcaggtgtt 1500
ataccagaag gaggtgttct attcactcag agcactgata cagtgaatcg tgagtccttt 1560
aagctctacg gtcatgctgt tgtgcctttt ggttctgctg tgtattggag cccttgccca 1620
ggtatgtggc ttccagtaat ttggtcgtcg gttaagtcat actctggttt gacttataca 1680
ggagtagttg gttgtaaggc aattgttcaa gagacagacg ctatatgtcg ttctctgtat 1740
atggattatg tccagcacaa gtgtggcaat ctcgagcaga gagctatcct tggattggac 1800
gatgtctatc atagacagtt gcttgtgaat aggggtgact atagtctcct ccttgagaat 1860
gtggatttgt ttgttaagcg gcgcgctgaa tttgcttgca aattcgccac ctgtggagat 1920
ggtcttgtac ccctcctact agatggttta gtgccccgca gttattattt gattaagagt 1980
ggtcaagctt tcacctctat gatggttaat tttagccatg aggtgactga catgtgtatg 2040
gacatggctt tattgttcat gcatgatgtt aaagtggcca ctaagtatgt taagaaggtt 2100
actggcaaac tggccgtgcg ctttaaagcg ttgggtgtag ccgttgtcag aaaaattact 2160
gaatggtttg atttagccgt ggacattgct gctagtgccg ctggatggct ttgctaccag 2220
ctggtaaatg gcttatttgc agtggccaat ggtgttataa cctttgtaca ggaggtgcct 2280
gagcttgtca agaattttgt tgacaagttc aaggcatttt tcaaggtttt gatcgactct 2340
atgtcggttt ctatcttgtc tggacttact gttgtcaaga ctgcctcaaa tagggtgtgt 2400
cttgctggca gtaaggttta tgaagttgtg cagaaatctt tgtctgcata tgttatgcct 2460
gtgggttgca gcgaagccac ttgtttggtg ggtgagattg aacctgcagt ttttgaagat 2520
gatgttgttg atgtggttaa agccccatta acatatcaag gctgttgtaa gccacccact 2580
tctttcgaga agatttgtat tgtggataaa ttgtatatgg ccaagtgtgg tgatcaattt 2640
taccctgtgg ttgttgataa cgacactgtt ggcgtgttag atcagtgctg gaggtttccc 2700
tgtgcgggca agaaagtcga gtttaacgac aagcccaaag tcaggaagat accctccacc 2760
cgtaagatta agatcacctt cgcactggat gcgacctttg atagtgttct ttcgaaggcg 2820
tgttcagagt ttgaagttga taaagatgtt acattggatg agctgcttga tgttgtgctt 2880
gacgcagttg agagtacgct cagcccttgt aaggagcatg atgtgatagg cacaaaagtt 2940
tgtgctttac ttgataggtt ggcaggagat tatgtctatc tttttgatga gggaggcgat 3000
gaagtgatcg ccccgaggat gtattgttcc ttttctgctc ctgatgacga ggactgcgtt 3060
gcagcggatg ttgtagatgc agatgaaaac caagatgatg atgccgagga ctcagcagtc 3120
cttgtcgctg atacccaaga agaggacggc gttgccaagg ggcaggttga ggcggattcg 3180
gaaatttgcg ttgcgcatac tggtagtcaa gaagaattgg ctgagcctga tgctgtcgga 3240
tctcaaactc ccatcgcctc tgctgaggaa accgaagtcg gagaggcaag cgacagggaa 3300
gggattgctg aggcgaaggc aactgtgtgt gctgatgctg tagatgcctg ccccgatcaa 3360
gtggaggcat ttgaaattga aaaggtcgag gactctatct tggatgagct tcaaactgaa 3420
cttaatgcgc cagcggacaa gacctatgag gatgtcttgg cattcgatgc cgtatgctca 3480
gaggcgttgt ctgcattcta tgctgtgccg agtgatgaga cgcactttaa agtgtgtgga 3540
ttctattcgc ctgctataga gcgcactaat tgttggctgc gttctacttt gatagtaatg 3600
cagagtctac ctttggaatt taaagacttg gagatgcaaa agctctggtt gtcttacaag 3660
gccggctatg accaatgctt tgtggacaaa ctagttaaga gcgtgcccaa gtctattatc 3720
cttccacaag gtggttatgt ggcagatttt gcctatttct ttctaagcca gtgtagcttt 3780
aaagcttatg ctaactggcg ttgtttagag tgtgacatgg agttaaagct tcaaggcttg 3840
gacgccatgt ttttctatgg ggacgttgtg tctcatatgt gcaagtgtgg taatagcatg 3900
accttgttgt ctgcagatat accctacact ttgcattttg gagtgcgaga tgataagttt 3960
tgcgcttttt acacgccaag aaaggtcttt agggctgctt gtgcggtaga tgttaatgat 4020
tgtcactcta tggctgtagt agagggcaag caaattgatg gtaaagtggt taccaaattt 4080
attggtgaca aatttgattt tatggtgggt tacgggatga catttagtat gtctcctttt 4140
gaactcgccc agttatatgg ttcatgtata acaccaaatg tttgttttgt taaaggagat 4200
gttataaagg ttgttcgctt agttaatgct gaagtcattg ttaaccctgc taatgggcgt 4260
atggctcatg gtgccggcgt cgccggcgcc atagctgaaa aggcgggcag tgcttttatt 4320
aaagaaacct ccgatatggt gaaggctcag ggcgtttgcc aggttggtga atgctatgaa 4380
tctgccggtg gtaagttatg taaaaaggtg cttaacattg tagggccaga tgcgcgaggg 4440
catggcaagc aatgctattc acttttagag cgtgcttatc agcatattaa taagtgtgac 4500
aatgttgtca ctactttaat ttcggctggt atatttagtg tgcctactga tgtctcccta 4560
acttacttac ttggtgtagt gacaaagaat gtcattcttg tcagtaacaa ccaggatgat 4620
tttgatgtga tagagaagtg tcaggtgacc tccgttgctg gtaccaaagc gctatcactt 4680
caattggcca aaaatttgtg ccgtgatgta aagtttgtga cgaatgcatg tagttcgctt 4740
tttagtgaat cttgctttgt ctcaagctat gatgtgttgc aggaagttga agcgctgcga 4800
catgatatac aattggatga tgatgctcgt gtctttgtgc aggctaatat ggactgtctg 4860
cccacagact ggcgtctcgt taacaaattt gatagtgttg atggtgttag aaccattaag 4920
tattttgaat gcccgggcgg gatttttgta tccagccagg gcaaaaagtt tggttatgtt 4980
cagaatggtt catttaagga ggcgagtgtt agccaaataa gggctttact cgctaataag 5040
gttgatgtct tgtgtactgt tgatggtgtt aacttccgct cctgctgcgt agcagagggt 5100
gaagtttttg gcaagacatt aggttcagtc ttttgtgatg gcataaatgt caccaaagtt 5160
aggtgtagtg ccatttacaa gggtaaggtt ttctttcagt acagtgattt gtccgaggca 5220
gatcttgtgg ctgttaaaga tgcctttggt tttgatgaac cacaactgct gaagtactac 5280
actatgcttg gcatgtgtaa gtggccagta gttgtttgtg gcaattattt tgctttcaag 5340
cagtcaaata ataattgcta catcaacgtg gcatgtttaa tgctgcaaca cttgagttta 5400
aagtttccta agtggcaatg gcaagaggct tggaacgagt tccgctctgg taaaccacta 5460
aggtttgtgt ccttggtatt agcaaagggc agctttaaat ttaatgaacc ttctgattct 5520
atcgatttta tgcgtgtggt gctacgtgaa gcagatttga gtggtgccac gtgcaatttg 5580
gaatttgttt gtaaatgtgg tgtgaagcaa gagcagcgca aaggtgttga cgctgttatg 5640
cattttggta cgttggataa aggtgatctt gtcaggggtt ataatatcgc atgtacgtgc 5700
ggtagtaaac ttgtgcattg cacccaattt aacgtaccat ttttaatttg ctccaacaca 5760
ccagagggta ggaaactgcc cgacgatgtt gttgcagcta atatttttac tggtggtagt 5820
gtgggccatt acacgcatgt gaaatgtaaa cccaagtacc agctttatga tgcttgtaat 5880
gttaataagg tttcggaggc taagggtaat tttaccgatt gcctctacct taaaaattta 5940
aagcaaacct tctcgtctgt gctgacgact ttttatttag atgacgtaaa gtgtgtggag 6000
tataagccag atttatcgca gtattactgt gagtctggta aatattatac aaaacccatt 6060
attaaggccc aatttagaac atttgagaag gttgatggtg tctataccaa ctttaaattg 6120
gtgggacata gtattgctga aaaactcaat gctaagctgg gatttgattg taattctccc 6180
tttgtggagt ataaaattac agagtggcca acagctactg gagatgtggt gttggctagt 6240
gatgatttgt atgtaagtcg gtacttaagc gggtgcatta cttttggtaa accggttgtc 6300
tggcttggcc atgaggaagc atcgctgaaa tctctcacat attttaatag acctagtgtc 6360
gtttgtgaaa ataaatttaa cgtgttgccc gttgatgtca gtgaacccac ggacaagggg 6420
cctgtgcctg ctgcagtcct tgttaccggc gtccctggag ctgatgcgtc agctggtgcc 6480
ggtattgcca aggagcaaaa agcctgtgct tctgctagtg tggaggatca ggttgttacg 6540
gaggttcgtc aagagccatc tgtttcagct gctgatgtca aagaggttaa attgaatggt 6600
gttaaaaagc ctgttaaggt ggaaggtagt gtggttgtta atgatcccac tagcgaaacc 6660
aaagttgtta aaagtttgtc tattgttgat gtctatgata tgttcctgac agggtgtaag 6720
tatgtggttt ggactgctaa tgagttgtct cgactagtaa attcaccgac tgttagggag 6780
tatgtgaagt ggggtatggg aaagattgta acacccgcta agttgttgtt gttaagagat 6840
gagaagcaag agttcgtagc gccaaaagta gtcaaggcga aagctattgc ctgctattgt 6900
gctgtgaagt ggtttctcct ctattgtttt agttggataa agtttaatac tgacaataag 6960
gttatataca ccacagaagt agcttcaaag cttactttca agttgtgctg tttggccttt 7020
aagaatgcct tacagacgtt taattggagc gttgtgtcta ggggcttttt cctagttgca 7080
acggtctttt tactctggtt taactttttg tatgctaatg ttattttgag tgacttctat 7140
ttgcctaata ttgggcctct ccctacgttt gtgggacaga tagttgcgtg gtttaagact 7200
acatttggtg tgtcaaccat ctgtgatttc taccaggtga cggatttggg ctatagaagt 7260
tcgttttgta atggaagtat ggtatgtgaa ctatgcttct caggttttga tatgctggac 7320
aactatgatg ctataaatgt tgttcaacac gttgtagata ggcgtttgtc ctttgactat 7380
attagcctat ttaaactggt agttgagctt gtaatcggct actctcttta tactgtgtgc 7440
ttctacccac tgtttgtcct tattggaatg cagttattga ccacatggtt gcctgaattc 7500
tttatgctgg agactatgca ttggagtgct cgtttgtttg tgtttgttgc caatatgctt 7560
ccagctttta cgttactgcg attttacatc gtggtgacag ctatgtataa ggtctattgt 7620
ctttgtagac atgttatgta tggatgtagt aagcctggtt gcttgttttg ttataagaga 7680
aaccgtagtg tccgtgttaa gtgtagcacc gttgttggtg gttcactacg ctattacgat 7740
gtaatggcta acggcggcac aggtttctgt acaaagcacc agtggaactg tcttaattgc 7800
aattcctgga aaccaggcaa tacattcata actcatgaag cagcggcgga cctctctaag 7860
gagttgaaac gccctgtgaa tccaacagat tctgcttatt actcggtcac agaggttaag 7920
caggttggtt gttccatgcg tttgttctac gagagagatg gacagcgtgt ttatgatgat 7980
gttaatgcta gtttgtttgt ggacatgaat ggtctgctgc attctaaagt taaaggtgtg 8040
cctgaaacgc atgttgtggt tgttgagaat gaagctgata aagctggttt tctcggcgcc 8100
gcagtgtttt atgcacaatc gctctacaga cctatgttga tggtggaaaa gaaattaata 8160
actaccgcca acactggttt gtctgttagt cgaactatgt ttgaccttta tgtagattca 8220
ttgctgaacg tcctcgacgt ggatcgcaag agtctaacaa gttttgtaaa tgctgcgcac 8280
aactctctaa aggagggtgt tcagcttgaa caagttatgg atacctttat tggctgtgcc 8340
cgacgtaagt gtgctataga ttctgatgtt gaaaccaagt ctattaccaa gtccgtcatg 8400
tcggcagtaa atgctggcgt tgattttacg gatgagagtt gtaataactt ggtgcctacc 8460
tatgttaaaa gtgacactat cgttgcagcc gatttgggtg ttcttattca gaataatgct 8520
aagcatgtac aggctaatgt tgctaaagcc gctaatgtgg cttgcatttg gtctgtggat 8580
gcttttaacc agctatctgc tgacttacag cataggctgc gaaaagcatg ttcaaaaact 8640
ggcttgaaga ttaagcttac ttataataag caggaggcaa atgttcctat tttaactaca 8700
ccgttctctc ttaaaggggg cgctgttttt agtagaatgt tacaatggtt gtttgttgct 8760
aatttgattt gtttcattgt gttgtgggcc cttatgccaa catatgcagt gcacaaatcg 8820
gatatgcagt tgcctttata tgccagtttt aaagttatag ataacggtgt gctaagggat 8880
gtgtctgtta ctgacgcatg cttcgcaaac aaatttaatc aattcgacca atggtatgag 8940
tctacttttg gtcttgctta ttaccgcaac tctaaggctt gtcctgttgt ggttgctgta 9000
atagatcaag acattggcca taccttattt aatgttccta ccacagtttt aagatatgga 9060
tttcatgtgt tgcattttat aacccatgca tttgctactg atagcgtgca gtgttacacg 9120
ccacatatgc aaatccccta tgataatttc tatgctagtg gttgcgtgtt gtcatccctc 9180
tgtactatgc ttgcgcatgc agatggaacc ccgcatcctt attgttatac agggggtgtt 9240
atgcataatg cctctctgta tagttctttg gctcctcatg tccgttataa cctggctagt 9300
tcaaatggtt atatacgttt tcccgaagtg gttagtgaag gcattgtgcg tgttgtgcgc 9360
actcgctcta tgacctactg cagggttggt ttatgtgagg aggccgagga gggtatctgc 9420
tttaatttta atcgttcatg ggtattgaac aacccgtatt atagggccat gcctggaact 9480
ttttgtggta ggaatgcttt tgatttaata catcaagttt taggaggatt agtgcggcct 9540
attgatttct ttgccttaac ggcgagttca gtggctggtg ctatccttgc aattattgtc 9600
gttttggctt tctattattt aatcaagctt aagcgtgcct ttggtgacta cactagtgtt 9660
gtggttatca atgtaattgt gtggtgtata aattttctga tgctttttgt gtttcaggtt 9720
tatcccacat tgtcttgttt atatgcttgt ttctacttct acaccacgct ttatttccct 9780
tcggagataa gtgttgttat gcatttgcaa tggcttgtca tgtatggtgc tattatgccc 9840
ttgtggtttt gcattattta cgtggcagtc gttgtttcaa accatgcatt gtggttgttc 9900
tcttactgcc gcaaaattgg taccgaggtt cgtagtgacg gcacatttga ggaaatggcc 9960
cttactacct ttatgattac taaagaatct tattgtaagt tgaaaaactc tgtttctgat 10020
gttgctttta acaggtactt gagtctttac aacaagtacc gttacttcag tggcaaaatg 10080
gatactgccg cttatagaga ggctgcctgt tcacaactgg caaaggcaat ggaaacattt 10140
aaccataata atggtaatga tgttctctat cagcctccaa ccgcctctgt tactacatca 10200
tttttacagt ctggtatagt gaagatggtg tcgcccacct ctaaagtgga gccttgtatt 10260
gttagtgtta cttatggtaa catgacactt aatgggttgt ggttggatga taaagtttat 10320
tgcccaagac atgttatctg ttcttcagct gacatgacag accctgatta tcctaatttg 10380
ctttgtagag tgacatcaag tgatttttgt gttatgtctg gtcgtatgag ccttactgta 10440
atgtcttatc aaatgcaggg ctgccaactt gttttgactg ttacactgca aaatcctaac 10500
acgcctaagt attccttcgg tgttgttaag cctggtgaga catttactgt actggctgca 10560
tacaatggca gacctcaagg agccttccat gttacgcttc gtagtagcca taccataaag 10620
ggctcctttc tatgtggatc ctgcggttct gtaggatatg ttttaactgg cgatagtgta 10680
cgatttgttt atatgcatca gctagagttg agtactggtt gtcataccgg tactgacttt 10740
agtgggaact tttatggtcc ctatagagat gcgcaagttg tacaattgcc tgttcaggat 10800
tatacgcaga ctgttaatgt tgtagcttgg ctttatgctg ctatttttaa cagatgcaac 10860
tggtttgtgc aaagtgatag ttgttccctg gaggagttta atgtttgggc tatgaccaat 10920
ggttttagct caatcaaagc cgatcttgtc ttggatgcgc ttgcttctat gacaggcgtt 10980
acagttgaac aggtgttggc cgctattaag aggctgcatt ctggattcca gggcaaacaa 11040
attttaggta gttgtgtgct tgaagatgag ctgacaccaa gtgatgttta tcaacaacta 11100
gctggtgtca agctacagtc aaagcgcaca agagttataa aaggtacatg ttgctggata 11160
ttggcttcaa cgtttttgtt ctgtagcatt atctcagcat ttgtaaaatg gactatgttt 11220
atgtatgtta ctacccatat gttgggagtg acattgtgtg cactttgttt tgtaagcttt 11280
gctatgttgt tgatcaagca taagcatttg tatttaacta tgtacatcat gcctgtgtta 11340
tgcacactgt tttacaccaa ctatttggtt gtgtacaaac agagttttag aggtctagct 11400
tatgcttggc tttcacactt tgtccctgct gtagattata catatatgga tgaagtttta 11460
tatggtgttg tgttgctagt agctatggtg tttgttacca tgcgtagcat aaaccacgac 11520
gtcttttcta ttatgttctt ggttggtaga cttgtcagcc tggtatccat gtggtatttt 11580
ggagccaatt tagaggaaga ggtactattg ttcctcacat ccctatttgg cacgtacaca 11640
tggactacta tgttgtcatt ggctaccgct aaggttattg ctaaatggtt ggctgtgaat 11700
gtcttgtact tcacagacgt accgcaaatt aaattagttc tgttgagcta cttgtgtatt 11760
ggttatgtgt gttgttgtta ttggggaatc ttgtcactcc ttaatagcat ttttaggatg 11820
ccattgggcg tctacaatta taaaatctcc gttcaggagt tacgttatat gaatgctaat 11880
ggcttgcgcc cacctagaaa tagttttgag gccctgatgc ttaattttaa gctgttggga 11940
attggtggtg tgccagtcat tgaagtatct caaattcaat caagattgac ggatgttaaa 12000
tgtgctaatg ttgtgttgct taattgcctc cagcacttgc atattgcatc taattctaag 12060
ttgtggcagt attgtagtac tttgcacaat gaaatactgg ctacatctga tttgagcgtg 12120
gccttcgata agttggctca actcttagtt gttttatttg ctaatccagc agcagtggat 12180
agcaagtgcc ttgcaagtat tgaagaagtg agcgatgatt acgttcgcga caatactgtc 12240
ttgcaagcct tacagagtga atttgttaat atggctagct tcgttgagta tgaacttgct 12300
aagaagaatc tagatgaggc taaggctagc ggctctgcca atcaacagca gattaagcag 12360
ctagagaagg cgtgtaatat tgctaagtca gcatatgagc gcgacagagc tgttgctcgt 12420
aagctggaac gtatggctga tttagctctt acaaacatgt ataaagaagc tagaattaat 12480
gataagaaga gtaaggtagt gtctgcattg caaaccatgc tctttagtat ggtgcgtaag 12540
ctagataacc aagctcttaa ttctatttta gacaacgcag ttaagggttg tgtacctttg 12600
aatgcaatac catcattgac ttcgaacact ctgactataa tagtgccaga taagcaggtt 12660
tttgatcagg ttgtggataa tgtgtatgtc acctatgctg ggaatgtatg gcatatacag 12720
tttattcaag atgctgatgg tgctgttaaa caattgaatg agatagatgt taattcaacc 12780
tggcctctag tcattgctgc aaataggcat aatgaagtgt ctactgttgt tttgcagaac 12840
aatgagttga tgcctcagaa gttgagaact caggttgtca atagtggctc agatatgaat 12900
tgtaatactc ctacccagtg ttactataat actactggca cgggtaagat tgtgtatgct 12960
atacttagtg actgtgacgg cctgaagtac actaagatag taaaagaaga tggaaattgt 13020
gttgttttgg aattggatcc tccctgtaag ttttctgttc aggatgtgaa gggccttaaa 13080
attaagtacc tttactttgt gaaggggtgt aatacactgg ctagaggctg ggttgtaggc 13140
accttatcct cgacagtgag attgcaggcg ggtacggcaa ctgagtatgc ctccaactct 13200
gcaatactgt cgctgtgtgc gttttctgta gatcctaaga aaacgtactt ggattatata 13260
aaacagggtg gagttcccgt tactaattgt gttaagatgt tatgtgacca tgctggcact 13320
ggtatggcca ttactattaa gccggaggca accactaatc aggattctta tggtggtgct 13380
tccgtttgta tatattgccg ctcgcgtgtt gaacatccag atgttgatgg attgtgcaaa 13440
ttacgcggca agtttgtcca agtgccctta ggcataaaag atcctgtgtc atatgtgttg 13500
acgcatgatg tttgtcaggt ttgtggcttt tggcgagatg gtagctgttc ctgtgtaggc 13560
acaggctccc agtttcagtc aaaagacacg aactttttaa acggattcgg ggtacaagtg 13620
taaatgcccg tcttgtaccc tgtgccagtg gcttggacac tgatgttcaa ttaagggcat 13680
ttgacatttg taatgctaat cgagctggca ttggtttgta ttataaagtg aattgctgcc 13740
gcttccagcg tgtagatgag gacggcaaca agttggataa gttctttgtt gttaaaagaa 13800
ctaatttaga agtgtataac aaggagaaag aatgctatga gttgacaaaa gaatgcggtg 13860
ttgtggctga acacgagttc ttcacatttg atgtggaggg aagtcgggta ccacacatag 13920
tccgtaaaga tctttcaaag tttactatgt tagatctttg ctatgcattg cgtcattttg 13980
accgcaatga ttgttcaact cttaaggaaa ttctccttac atatgctgag tgtgaagagt 14040
cctacttcca aaagaaggac tggtatgatt ttgttgagaa tcctgatata attaatgtgt 14100
acaagaagct tggtcctata tttaatagag ccctgcttaa cactgccaag tttgcagacg 14160
cattagtgga ggcaggctta gtaggtgttt taacacttga taatcaagat ttatatggtc 14220
aatggtatga ctttggagat tttgtcaaga cagtacctgg ttgtggtgtt gccgtggcag 14280
actcttatta ttcatatatg atgccaatgc tgactatgtg tcatgcgttg gatagtgagt 14340
tgtttgttaa tggtacttat agggagtttg accttgttca gtatgatttt actgatttca 14400
agctagagct gttcactaag tattttaagc attggagtat gacctaccac ccgaacacct 14460
gtgagtgcga ggatgacagg tgcattattc attgcgccaa ttttaatata cttttcagca 14520
tggtcttacc taagacctgt tttgggcctc ttgttaggca gatatttgtg gatggtgttc 14580
ctttcgttgt gtcgatcggt taccattata aagaattagg tgttgttatg aatatggatg 14640
tggatacaca tcgttatcgc ttgtctctta aggacttgct tttgtatgct gcagaccctg 14700
cccttcatgt ggcgtctgct agtgcactgc ttgatttgcg cacatgttgt tttagcgttg 14760
cagctattac aagtggcgta aaatttcaaa cagttaaacc tggaaatttt aatcaggatt 14820
tctacgagtt tattttgagt aaaggcctgc ttaaagaggg gagctccgtt gatttgaagc 14880
acttcttctt tacgcaggat ggtaatgctg ctattactga ttacaattac tacaagtata 14940
atctacccac catggtggat attaagcagt tgttgtttgt tttagaagtt gttaataagt 15000
acttcgagat ctatgagggt gggtgtatac ccgcaacaca ggtcattgtt aataattatg 15060
acaagagtgc tggctatcca tttaataaat ttggaaaggc caggctctat tatgaggcat 15120
tatcatttga ggagcaggat gaaatttatg cgtataccaa acgcaatgtc ctgccgaccc 15180
taactcaaat gaatcttaaa tatgctatta gtgctaagaa tagggcccgc accgttgctg 15240
gtgtctctat tctcagtact atgactggca gaatgtttca tcaaaagtgt ctaaagagta 15300
tagcagctac tcgcggtgtt cctgtagtta taggcaccac gaagttctat ggcggttggg 15360
atgatatgtt acgccgcctt attaaagatg ttgatagtcc tgtactcatg ggttgggact 15420
atcctaaatg tgatcgtgct atgccaaaca tactgcgtat tgttagtagt ttggtgctag 15480
cccgtaaaca tgattcgtgc tgttcgcata cggatagatt ctatcgtctt gcgaacgagt 15540
gcgcccaagt tttgagtgaa attgttatgt gtggtggttg ttattatgtt aaaccaggtg 15600
gcactagtag tggggatgca accactgctt ttgctaattc tgtgtttaac atttgtcaag 15660
ctgtttccgc caatgtatgc tcgcttatgg catgcaatgg acacaaaatt gaagatttga 15720
gtatacgcga gttacaaaag cgcctatact ctaatgtcta tcgtgcggac catgttgacc 15780
ccgcatttgt tagtgagtat tatgagtttt taaacaagca ttttagtatg atgattttga 15840
gtgatgatgg tgttgtgtgt tataattcag agtttgcgtc caagggttat attgctaata 15900
taagtgcctt tcaacaggta ttatattatc aaaacaacgt gtttatgtct gaggccaaat 15960
gttgggtaga aacagacatc gaaaagggac cgcatgaatt ttgttctcaa catacaatgc 16020
tagtcaagat ggatggtgat gaagtctacc ttccataccc tgatccttcg agaatcttag 16080
gagcaggctg ttttgttgat gatttactca agactgatag cgttctcttg atagagcgtt 16140
tcgtaagtct tgcaattgat gcttatcctt tagtatacca tgagaaccca gagtatcaaa 16200
atgtgttccg ggtatattta gaatacatca agaagctgta caatgatctc ggtaatcaga 16260
tcctggacag ctacagtgtt attttaagta cttgtgatgg tcaaaagttt actgacgaga 16320
cgttttacaa gaacatgtat ttaagaagtg cagtgctgca aagcgttggt gcctgcgttg 16380
tctgtagttc tcaaacatca ttacgttgtg gcagttgcat acgcaagcct ttgctgtgtt 16440
gcaaatgcgc ctatgatcat gttatgtcca ctgatcataa atatgtcctg agtgtgtcac 16500
catatgtgtg taattcaccg ggatgtgatg taaatgatgt taccaaattg tatttaggtg 16560
gtatgtcata ttattgtgag gaccataaac cacagtattc attcaaattg gtgatgaatg 16620
gtatggtttt tggtttatat aagcagtctt gtactggttc gccctacata gaggatttta 16680
ataaaatcgc tagttgcaaa tggacagaag tcgatgatta tgtgctagct aatgaatgca 16740
ccgaacgcct taaattgttt gccgcagaaa cgcagaaggc cacagaagag gcctttaagc 16800
aatgttatgc gtcagcaacg atccgtgaga tcgtgagcga tcgggagtta attttatctt 16860
gggaaattgg taaagtccgc ccgccactta ataaaaatta cgtgttcacc ggctaccatt 16920
ttactaataa tggtaagaca gttttaggtg agtatgtttt tgataagagt gagttgacta 16980
atggtgtgta ttatcgcgcc acaaccactt ataagttatc tgtaggtgat gtgttcattt 17040
taacatcaca cgcagtgtct agtttaagtg ctcctacatt agtaccgcag gagaattata 17100
ctagcattcg ttttgctagt gtttatagtg tgcctgagac gtttcagaat aatgtgccta 17160
attatcagca cattggaatg aagcgctatt gtactgtaca gggaccgcct ggtactggta 17220
agtcccatct agccattggg ctagctgttt attattgtac agcgcgcgtg gtgtataccg 17280
ctgctagcca tgctgcagtt gacgcgctgt gtgaaaaggc acataaattt ctcaacatca 17340
acgactgcac gcgtattgtt cctgcaaagg tgcgtgtaga ttgttatgat aaattcaagg 17400
tcaatgacac cactcgcaag tatgtgttta ctacaataaa tgcattacct gagttggtga 17460
ctgacattat tgtcgttgat gaagttagta tgcttaccaa ctatgagctg tctgttatta 17520
acagtcgtgt tagggctaag cattatgtgt atattggcga cccggcgcag ttacctgcac 17580
cacgtgtgct actgaataag ggaactctag aacctagata ttttaattcc gttaccaagc 17640
taatgtgttg tttgggtcca gatattttct tgggcacctg ttatagatgc cctaaggaga 17700
ttgtggatac ggtgtcagcc ttggtttata ataataagct gaaggctaaa aatgataata 17760
gctccatgtg ctttaaggtt tattataagg gccagactac acatgagagt tctagtgctg 17820
ttaatatgca gcaaatacat ttaatttcca agtttctgaa ggcaaacccc agttggagta 17880
acgccgtatt tattagtcct tataactcgc agaactatgt tgctaagaga gtcttgggat 17940
tacaaaccca gacagtagac tcagcgcagg gttctgaata tgattttgtt atctactcac 18000
agactgcgga aacagcgcat tctgtcaatg taaatagatt caatgttgct attacacgtg 18060
ctaagaaggg tattctctgt gtcatgagta gtatgcaatt atttgagtct cttaatttta 18120
ctacactgac gttggataag attaacaatc cacgattaca gtgtactaca aatttgttta 18180
aggattgtag caggagctat gtaggatatc acccagccca tgcaccatcc tttttggcag 18240
ttgatgacaa atataaggta ggcggtgatt tagccgtttg ccttaatgtt gctgattctg 18300
ctgtcactta ttcgcggctt atatcactca tgggattcaa gcttgacttg acccttgatg 18360
gttattgtaa gctgtttata actagagatg aagctatcaa acgtgttaga gcctgggttg 18420
gcttcgatgc agaaggtgcc catgcgatac gtgatagcat tgggacaaat ttcccattac 18480
aattaggctt ttcgactgga attgattttg ttgtcgaagc cactggaatg tttgctgaga 18540
gagatggtta tgtctttaaa aaggcagccg cacgagctcc tcctggcgaa caatttaaac 18600
accttatccc acttatgtca agagggcaga aatgggatgt ggttcgcatt agaatagtac 18660
aaatgttgtc agaccaccta gtggatttgg cagacagtgt tgtacttgtg acgtgggctg 18720
ccagctttga gctcacatgt ttgcgatatt tcgctaaagt tggaagagaa gttgtgtgta 18780
gtgtctgcac caagcgtgcg acatgtttta attctagaac tggatactat ggatgctggc 18840
gacatagtta ttcctgtgat tacctgtaca acccactaat agttgacatt caacagtggg 18900
gatatacagg atctttaact agcaatcatg atcctatttg cagcgtgcat aagggtgctc 18960
atgttgcatc atctgatgct atcatgaccc ggtgtctagc tgttcatgat tgcttttgta 19020
agtctgttaa ttggaattta gaatacccca ttatttcaaa tgaggtcagt gttaatacct 19080
cctgcaggtt attgcagcgc gtaatgttta gggctgcgat gctatgcaat aggtatgatg 19140
tgtgttatga cattggcaac cctaaaggtc ttgcctgtgt caaaggatat gattttaagt 19200
tctatgacgc ctcccctgtt gttaagtctg ttaaacagtt tgtttacaaa tacgaggcac 19260
ataaagatca atttttagat ggtttgtgta tgttttggaa ctgcaatgtg gataagtatc 19320
cagcgaatgc agttgtgtgt aggtttgaca cgcgtgtgtt gaacaaatta aatctccctg 19380
gctgtaatgg tggcagtttg tatgttaaca aacatgcatt ccacaccagt ccctttaccc 19440
gggctgcctt cgagaatttg aagcctatgc ctttctttta ttattcagat acgccctgtg 19500
tgtatatgga aggcatggaa tctaagcagg tcgattatgt cccattgaga agcgctacat 19560
gcatcacaag atgcaattta ggtggcgctg tttgtttaaa acatgctgag gagtatcgtg 19620
agtaccttga gtcttacaat acggcaacca cagcgggttt tactttttgg gtctataaga 19680
cttttgattt ttacaacctt tggaatactt ttactaggct ccaaagttta gaaaatgtag 19740
tgtataacct ggtcaacgct ggacactttg atggccgggc gggtgaactg ccttgtgctg 19800
ttataggtga gaaagtcatt gccaagattc aaaatgagga tgtcgtggtc tttaaaaata 19860
acacgccatt ccccactaat gtggctgtcg aattatttgc taagcgcagt attcggcccc 19920
accccgagct taagctcttt agaaatttga atattgacgt gtgctggagt cacgtccttt 19980
gggattatgc taaggatagt gtgttttgca gttcgacgta taaggtctgc aaatacacag 20040
atttacagtg cattgaaagc ttgaatgtac tttttgatgg tcgtgataat ggtgctcttg 20100
aagcttttaa gaagtgccgg aatggcgtct acattaacac gacaaaaatt aaaagtctgt 20160
cgatgattaa aggcccacaa cgtgccgatt tgaatggcgt agttgtggag aaagttggag 20220
attctgatgt ggaattttgg tttgctgtgc gtaaagacgg tgacgatgtt atcttcagcc 20280
gtacagggag ccttgaaccg agccattacc ggagcccaca aggtaatccg ggtggtaatc 20340
gcgtgggtga tctcagcggt aatgaagctc tagcgcgtgg cactatcttt actcaaagca 20400
gattattatc ttctttcaca cctcgatcag agatggagaa agattttatg gatttagatg 20460
atgatgtgtt cattgcaaaa tatagtttac aggactacgc gtttgaacac gttgtttatg 20520
gtagttttaa ccagaagatt attggaggtt tgcatttgct tattggctta gcccgtaggc 20580
agcaaaaatc caatctggta attcaagagt tcgtgacata cgactctagc attcattcgt 20640
actttatcac tgacgagaac agtggtagta gtaagagtgt gtgcactgtt attgatttat 20700
tgttagatga ttttgtggac attgtaaagt ccctgaatct aaagtgtgtg agtaaggttg 20760
ttaatgttaa tgtggatttt aaggacttcc agtttatgtt gtggtgcaat gaggagaagg 20820
tcatgacttt ctatcctcgt ttgcaggctg ctgctgactg gaaacctggt tatgttatgc 20880
ctgtcttata taagtatttg gaatcgcctc tggaaagagt aaacctctgg aattatggca 20940
agccgattac tttacctaca ggatgtatga tgaatgttgc taagtatact caattatgtc 21000
aatatttgag cactacaaca ttagcagttc cggctaatat gcgtgtctta caccttggtg 21060
ccggttcgga taagggtgtt gcccctgggt ctgcagttct taggcagtgg ctaccagcgg 21120
gaagtattct tgtagataat gatgtgaatc catttgtgag tgacagtgtc gcctcatatt 21180
atggaaattg tataacctta ccctttgatt gtcagtggga tctgataatt tctgatatgt 21240
acgaccctct tactaagaac attggggagt acaacgtgag taaagatgga ttctttactt 21300
acctctgtca tttaattcgt gacaagttgg ctctgggtgg cagtgttgcc ataaaaataa 21360
cagagttttc ttggaacgct gagttatata gtttaatggg gaagtttgcg ttctggacaa 21420
tcttttgcac caacgtaaac gcctcttcaa gtgaaggatt tttgattggc ataaattggt 21480
tgaataagac ccgtaccgaa attgacggta aaaccatgca tgccaattat ctgttttgga 21540
gaaatagtac aatgtggaat ggaggggctt acagtctctt tgacatgagt aagttccctt 21600
tgaaagcggc tggtacggct gttgttagcc ttaaaccaga ccaaataaat gacttagtcc 21660
tctccttgat tgagaagggc aagttattag tgcgtgatac acgcaaagaa gtttttgttg 21720
gcgatagcct agtaaatgtc aaataaatct atacttgtcg tggctgtgaa aatggccttt 21780
gctgacaagc ctaatcattt cataaacttt cccctggccc aatttagtgg ctttatgggt 21840
aagtatttaa agctacagtc tcaacttgtg gaaatgggtt tagactgtaa attacagaag 21900
gcaccacatg ttagtattac cctgcttgat attaaagcag accaatacaa acaggtggaa 21960
tttgcaatac aagaaataat agatgatctg gcggcatatg agggagatat tgtctttgac 22020
aaccctcaca tgcttggcag atgccttgtt cttgatgtta gaggatttga agagttgcat 22080
gaagatattg ttgaaattct ccgcagaagg ggttgcacgg cagatcaatc cagacactgg 22140
attccgcact gcactgtggc ccaatttgac gaagaaagag aaacaaaagg aatgcaattc 22200
tatcataaag aacccttcta cctcaagcat aacaacctat taacggatgc tgggcttgag 22260
ctcgtgaaga taggttcttc caaaatagat gggttttatt gtagtgaact gagtgtttgg 22320
tgtggtgaga ggctttgtta taagcctcca acacccaaat tcagtgatat atttggctat 22380
tgctgcatag ataaaatacg tggtgattta gaaataggcg acctgccgca ggatgatgag 22440
gaagcgtggg ccgagctaag ttaccactat caaagaaaca cctacttctt cagacatgtg 22500
cacgataata gcatctattt tcgtaccgtg tgtagaatga agggttgtat gtgttgattt 22560
gtttttacac tattagtgta ataagcttat tattttgttg aaaagggcag gatgtgcata 22620
gctatggctc ctcgcacact gcttttgctg atttgatgtc agctggtgtt tgggttcaat 22680
gaacctctta acatcgtttc acatttaaat gatgactggt ttctatttgg tgacagtcgg 22740
tccgactgta cctatgtaga aaataacggt catcctaaat tagattggct tgacctcgac 22800
ccaaagttgt gtaattcagg aaagatttcc gcaaagagtg gtaactctct ctttaggagt 22860
tttcacttca ctgattttta caattatacg ggtgagggat accaaattgt attttatgaa 22920
ggagttaatt ttagtcccag ccatggcttt aaatgcctgg ctcatggaga taataaaaga 22980
tggatgggca ataaagctcg attttatgcc cgagtgtatg agaagatggc ccaatatagg 23040
agcctatcgt ttgttaatgt gtcttatgcc tatggaggta atgcaaagcc cgcctccatt 23100
tgcaaagaca atactttaac actcaataac cccaccttca tatcgaagga gtctaattat 23160
gttgattact actacgagag tgaggctaat ttcacactag aaggttgtga tgaatttata 23220
gtaccgctct gtggttttaa tggccattcc aagggctcgt cgtcggatgc tgccaataaa 23280
tattatactg actctcagag ttactataat atggatattg gtgtcttata tgggttcaat 23340
tcgaccttgg atgttggcaa cactgctaag gatccgggtc ttgatctcac ttgtaggtat 23400
cttgcattga ctcctggtaa ttataaggct gtgtccttag aatatttgtt aagcttaccc 23460
tcaaaggcta tttgcctcca taagacaaag cgctttatgc ctgtgcaggt agttgactca 23520
aggtggagta gcatccgcca gtcagacaat atgaccgctg cagcctgtca gctgccatat 23580
tgtttctttc gcaacacatc tgcgaattat agtggtggca cacatgatgc gcaccatggt 23640
gattttcatt tcaggcagtt attgtctggt ttgttatata atgtttcctg tattgcccag 23700
cagggtgcat ttctttataa taatgtgtcg tcctcttggc cagcctatgg gtacggtcat 23760
tgtccaacgg cagctaacat tggttatatg gcacctgttt gtatctatga ccctctcccg 23820
gtcatactgc taggtgtgtt attgggtata gctgtgttga ctattgtgtt tctgatgttt 23880
tattttatga cggatagcgg tgttagattg catgaggcat aatctaaaca tgtttgtttt 23940
tcttgtttta ttgccactag tctctagtca gtgtgttaat cttacaacca gaactcaatt 24000
accccctgca tacactaatt ctttcacacg tggtgtttat taccctgaca aagttttcag 24060
atcctcagtt ttacattcaa ctcaggactt gttcttacct ttcttttcca atgttacttg 24120
gttccatgct atacatgtct ctgggaccaa tggtactaag aggtttgata accctgtcct 24180
accatttaat gatggtgttt actttgcttc cactgagaag tctaacataa taagaggctg 24240
gatttttggt actactttag attcgaaaac ccagtcccta cttattgtta ataacgctac 24300
taatgttgtt atcaaagtct gtgaatttca attttgtaac gatccatttt tgggtgttta 24360
ttaccacaaa aacaacaaaa gttggatgga aagtgagttc agagtttatt ctagtgcgaa 24420
taattgcact tttgaatacg tctctcagcc ttttcttatg gaccttgaag gaaaacaggg 24480
taatttcaaa aatcttaggg aatttgtgtt caagaatatt gatggttact tcaagatata 24540
ctctaagcac acgcctatta atttagtgcg tgatctccct cagggttttt cggctttaga 24600
accattggta gatttgccaa taggtattaa catcactagg tttcaaactt tacttgcttt 24660
acatagaagt tatttaactc ctggtgattc ttcttcaggt tggacagctg gtgctgcagc 24720
ttattatgtg ggttatcttc aacctaggac ttttctactg aagtacaatg aaaatggaac 24780
cattacagat gctgtagact gtgcacttga ccctctctca gaaacaaagt gtacgttgaa 24840
atccttcact gtagaaaaag gaatctatca aacttctaac tttagagtcc aaccaacaga 24900
atctattgtt agatttccta acatcacaaa cttgtgccct tttggtgaag tttttaacgc 24960
caccagattt gcatctgttt atgcttggaa caggaagaga atcagcaact gtgttgctga 25020
ttattctgtc ctgtataatt ccgcatcatt ttccactttt aagtgttatg gagtgtctcc 25080
tactaaatta aatgatctct gctttactaa tgtctatgca gattcatttg taattagagg 25140
tgatgaagtc agacaaatcg ctccagggca aactggaaag attgctgatt ataactacaa 25200
attaccagat gattttacag gctgcgttat agcttggaat tctaacaatc ttgattctaa 25260
ggttggtggt aattataatt acctgtacag attgtttagg aagtctaatc tcaaaccttt 25320
tgagagagat atttcaactg aaatctatca ggccggtagc acaccttgta atggtgttga 25380
aggttttaat tgttactttc ctctgcaatc atatggtttc caacccacta atggtgttgg 25440
ttaccaacca tacagagtag tagtactttc ttttgaactt ctacatgcac cagcaactgt 25500
ttgtggacct aaaaagtcta ctaatttggt taagaacaag tgtgtcaatt tcaacttcaa 25560
tggtttaaca ggcacaggtg ttcttactga gtctaacaaa aagtttctgc ctttccaaca 25620
atttggcaga gacattgctg acactactga tgctgttcgt gatccacaaa cacttgagat 25680
tcttgacatt acaccatgtt cttttggtgg tgtcagtgtt ataacaccag gaacaaatac 25740
ttctaaccag gttgctgttc tttatcagga tgttaactgc acagaagtcc ctgttgctat 25800
tcatgcagat caacttactc ctacttggcg tgtttattct acaggttcta atgtttttca 25860
aacacgtgca ggctgtttaa taggggctga acatgtcaac aactcatatg agtgtgacat 25920
acccattggt gcaggtatat gcgctagtta tcagactcag actaattctc ctcggagagc 25980
aagaagtgta gctagtcaat ccatcattgc ctacactatg tcacttggtg cagaaaattc 26040
agttgcttac tctaataact ctattgccat acccacaaat tttactatta gcgttaccac 26100
agaaattcta ccagtgtcta tgaccaagac atcagtagat tgtacaatgt acatttgtgg 26160
tgattcaact gaatgcagca atcttttgtt gcaatatggc agtttttgta cacaattaaa 26220
ccgtgcttta actggaatag ctgttgaaca agacaaaaac acccaagaag tttttgcaca 26280
agtcaaacaa atttacaaga caccaccaat taaagatttt ggcggtttta attttagcca 26340
gatactgcca gatccatcaa aaccaagcaa gaggtcattt attgaagatc tactgttcaa 26400
caaagtgaca cttgcagatg ctggcttcat caaacaatat ggtgattgcc ttggtgatat 26460
tgctgctaga gacctcattt gtgcacaaaa gtttaacggc cttactgttt tgccaccttt 26520
gctcacagat gaaatgattg ctcaatacac ttctgcactg ttagcaggta caatcacttc 26580
tggttggact tttggtgcag gtgctgcatt acaaatacca tttgctatgc aaatggctta 26640
taggtttaat ggtattggag ttacacagaa tgttctctat gagaaccaaa aattgattgc 26700
caaccaattt aatagtgcta ttggcaaaat tcaagactca ctttcttcca cagcaagtgc 26760
acttggaaaa cttcaagatg tggtcaacca aaatgcacaa gctttaaaca cgcttgttaa 26820
acaacttagc tccaattttg gtgcaatttc aagtgtttta aacgacatcc tttcacgtct 26880
tgacaaagtt gaggctgaag tgcaaattga taggttgatc acaggcagac ttcaaagttt 26940
gcagacatat gtgactcaac aattaattag agctgcagaa atcagagctt ctgctaatct 27000
tgctgctact aaaatgtcag agtgtgtact tggacaatca aaaagagttg acttttgcgg 27060
aaagggctat catcttatgt catttcctca gtcagcacct catggtgtcg tctttttgca 27120
tgtgacttat gtccctgcac aagaaaagaa cttcacaact gctcctgcca tttgtcatga 27180
tggaaaagca cactttcctc gtgaaggtgt ctttgtttca aatggcacac actggtttgt 27240
aacacaaagg aatttttatg aaccacaaat cattactaca gacaacacat ttgtgtctgg 27300
taactgtgat gttgtaatag gaattgtcaa caacacagtt tatgatcctt tgcaacctga 27360
attagactca ttcaaggagg agcttgataa atacttcaag aaccatacct caccagatgt 27420
tgatttaggt gacatctctg gcattaatgc ttcagttgta aacattcaga aagaaatcga 27480
ccgcctcaat gaggttgcca agaatttaaa tgaatctctc atcgatctcc aagaacttgg 27540
aaagtatgag cagtatataa aatggccatg gtacatttgg ctaggtttta tagctggctt 27600
gattgccata gtaatggtga caattatgct ttgctgtatg accagttgct gtagttgtct 27660
caagggctgt tgttcttgtg gatcctgctg caaatttgac gaggacgact ctgagccagt 27720
gctcaaagga gtcaaattac attacacata actatcacag cctctcctgg aaagacagaa 27780
aatctaaaca atttatagca ttctcattgc tacctggccc cgtaagaggc agtcatagct 27840
atggccgtgt tggtcctaag gctacattgg ctgctgtctt tattggtcca tttattgtag 27900
catgtatgct aggcattggc ctagtttatt tattgcaatt gcaagttcaa atttttcatg 27960
ttaaggatac catacgtgtg actggcaagc cagccactgt gtcttatact acaagtacac 28020
cagtaacacc gagcgcgacg acgctcgatg gtactacgta tactttaatt agacccacta 28080
gctcttatac aagagtttat cttggtactc caagaggttt tgattatagt acatttgggc 28140
ctaagaccct agattatgtt actaatctaa acctcatctt aattctggtc gtccatatac 28200
ttttaaggca ttgtccaggc atatgaggcc aacagccaca tggatttggc atgtgagtga 28260
tgcatggtta cgccgcacgc gggactttgg tgtcattcgc ctagaagatt tttgttttca 28320
atttaattat agccaacccc gagttggtta ttgtagagtt cctttaaagg cttggtgtag 28380
caaccagggt aaatttgcag cgcagtttac cctaaaaagt tgcgaaaaac caggtcacga 28440
aaaatttatt actagcttca cggcctacgg cagaactgtc caacaggccg ttagcaagtt 28500
agtagaagaa gctgttgatt ttattctttt tagggccacg cagctcgaaa gaaatgttta 28560
atttattcct tacagacaca gtatggtatg tggggcagat tatttttata ttcgcagtgt 28620
gtttgatggt caccataatt gtggttgcct tccttgcgtc tatcaaactt tgtattcaac 28680
tttgcggttt atgtaatact ttggtgctgt ccccttctat ttatttgtat gataggagta 28740
agcagcttta taagtactat aatgaagaaa tgagactgcc cctattagag gtggatgata 28800
tctaatccaa acattatgag tagtactact caggccccag agcccgtcta tcaatggacc 28860
gccgacgagg cagttcaatt ccttaaggaa tggaacttct cgttgggcat tatactactc 28920
tttattacta tcatactaca gttcggttac acgagccgta gcatgtttat ttatgttgtg 28980
aaaatgataa tcttgtggtt aatgtggcca ctgactattg ttttgtgtat tttcaattgc 29040
gtgtatgcgc taaataatgt gtatcttgga ttttctatag tgtttactat agtgtccatt 29100
gtaatctgga tcatgtattt tgtgaacagc ataaggttgt ttatcaggac tggtagctgg 29160
tggagcttca accccgaaac aaacaacctt atgtgtatag atatgaaagg taccgtgtat 29220
gttagaccca ttattgagga ttaccataca ctaacagcca ctattattcg tggccacctc 29280
tacatgcaag gtgttaagct aggcaccggt ttctctttgt ctgacttgcc cgcttatgtt 29340
acagttgcta aggtgtcaca cctttgcact tataagcgcg cattcttaga caaggtagac 29400
ggtgttagcg gttttgctgt ttatgtgaag tccaaggtcg gaaattaccg actgccctca 29460
aacaaaccga gtggcgcgga caccgcattg ttgagaacct aatctaaact ttaaggagag 29520
aatgaatcct atgtcggcgc tcggtggtaa cccctcgcga gaaagtcggg ataggacact 29580
ctctatcaga atggatgtct tgctgtcata acagatagag aaggttgtgg cagaccctgt 29640
atcaattagt tgaaagagat tgcaaaatag agaatgtgtg agagaagtta gcaaggtcct 29700
acgtctaacc ataagaacgg cgataggcgc cccctgggaa cagctcacat cagggtacta 29760
ttcctgcaat gccctagtaa atgaatgaag ttgatcatgg ccaattggaa gaatcacaaa 29820
aaaaaaaaaa aaaacggccg gttt 29844
<210> 34
<211> 27671
<212> DNA
<213> Artificial Sequence
<220>
<223> COVAX191_delta_HEN_RNA
<400> 34
gtataagagt gattggcgtc cgtacgtacc ctctcaactc taaaactctt gtagtttaaa 60
tctaatctaa actttataaa cggcacttcc tgcgtgtcca tgcccgcggg cctggtcttg 120
tcatagtgct gacatttgta gttccttgac tttcgttctc tgccagtgac gtgtccattc 180
ggcgccagca gcccacccat aggttgcata atggcaaaga tgggcaaata cggcctgggc 240
ttcaaatggg ccccagaatt tccatggatg cttccgaacg catcggagaa gttgggtaac 300
cctgagaggt cagaggagga tgggttttgc ccctctgctg cgcaagaacc gaaagttaaa 360
ggaaaaactt tggttaatca cgtgagggtg aattgtagcc ggcttccagc tttggaatgc 420
tgtgttcagt ctgccataat ccgtgatatt tttgtagatg aggatcccca gaaggtggag 480
gcctcaacta tgatggcatt gcagttcggt agtgccgtct tggttaagcc atccaagcgc 540
ttgtctattc aggcatggac taatttgggt gtgcttccca aaacagctgc catggggttg 600
ttcaagcgcg tctgcctgtg taacaccagg gagtgctctt gtgacgccca cgtggccttt 660
caccttttta cggtccaacc cgatggtgta tgcctgggta atggccgttt tataggctgg 720
ttcgttccag tcacagccat accggagtat gcgaagcagt ggttgcaacc ctggtccatc 780
cttcttcgta agggtggtaa caaagggtct gtgacatccg gccacttccg ccgcgctgtt 840
accatgcctg tgtatgactt taatgtagag gatgcttgtg aggaggttca tcttaacccg 900
aagggtaagt actcctgcaa ggcgtatgcc ctgctgaagg gctatcgcgg tgttaagccc 960
atcctgtttg tggaccagta tggttgcgac tatactggat gtctcgccaa gggtcttgag 1020
gactatggcg atctcacctt gagtgagatg aaggagttgt tccctgtgtg gcgtgactcc 1080
ttggatagtg aagtccttgt ggcttggcac gttgatcgag atcctcgggc tgctatgcgt 1140
ctgcagactc ttgctactgt acgttgcatt gattatgtgg gccaaccgac cgaggatgtg 1200
gtggatggag atgtggtagt gcgtgagcct gctcatcttc tcgcagccaa tgccattgtt 1260
aaaagactcc cccgtttggt ggagactatg ctgtatacgg attcgtccgt tacagaattc 1320
tgttataaaa ccaagctgtg tgaatgcggt tttatcacgc agtttggcta tgtggattgt 1380
tgtggtgaca cctgtgattt tcgtgggtgg gttgccggca atatgatgga tggctttcca 1440
tgtccagggt gtaccaaaaa ttatatgccc tgggaattgg aggcccagtc atcaggtgtt 1500
ataccagaag gaggtgttct attcactcag agcactgata cagtgaatcg tgagtccttt 1560
aagctctacg gtcatgctgt tgtgcctttt ggttctgctg tgtattggag cccttgccca 1620
ggtatgtggc ttccagtaat ttggtcgtcg gttaagtcat actctggttt gacttataca 1680
ggagtagttg gttgtaaggc aattgttcaa gagacagacg ctatatgtcg ttctctgtat 1740
atggattatg tccagcacaa gtgtggcaat ctcgagcaga gagctatcct tggattggac 1800
gatgtctatc atagacagtt gcttgtgaat aggggtgact atagtctcct ccttgagaat 1860
gtggatttgt ttgttaagcg gcgcgctgaa tttgcttgca aattcgccac ctgtggagat 1920
ggtcttgtac ccctcctact agatggttta gtgccccgca gttattattt gattaagagt 1980
ggtcaagctt tcacctctat gatggttaat tttagccatg aggtgactga catgtgtatg 2040
gacatggctt tattgttcat gcatgatgtt aaagtggcca ctaagtatgt taagaaggtt 2100
actggcaaac tggccgtgcg ctttaaagcg ttgggtgtag ccgttgtcag aaaaattact 2160
gaatggtttg atttagccgt ggacattgct gctagtgccg ctggatggct ttgctaccag 2220
ctggtaaatg gcttatttgc agtggccaat ggtgttataa cctttgtaca ggaggtgcct 2280
gagcttgtca agaattttgt tgacaagttc aaggcatttt tcaaggtttt gatcgactct 2340
atgtcggttt ctatcttgtc tggacttact gttgtcaaga ctgcctcaaa tagggtgtgt 2400
cttgctggca gtaaggttta tgaagttgtg cagaaatctt tgtctgcata tgttatgcct 2460
gtgggttgca gcgaagccac ttgtttggtg ggtgagattg aacctgcagt ttttgaagat 2520
gatgttgttg atgtggttaa agccccatta acatatcaag gctgttgtaa gccacccact 2580
tctttcgaga agatttgtat tgtggataaa ttgtatatgg ccaagtgtgg tgatcaattt 2640
taccctgtgg ttgttgataa cgacactgtt ggcgtgttag atcagtgctg gaggtttccc 2700
tgtgcgggca agaaagtcga gtttaacgac aagcccaaag tcaggaagat accctccacc 2760
cgtaagatta agatcacctt cgcactggat gcgacctttg atagtgttct ttcgaaggcg 2820
tgttcagagt ttgaagttga taaagatgtt acattggatg agctgcttga tgttgtgctt 2880
gacgcagttg agagtacgct cagcccttgt aaggagcatg atgtgatagg cacaaaagtt 2940
tgtgctttac ttgataggtt ggcaggagat tatgtctatc tttttgatga gggaggcgat 3000
gaagtgatcg ccccgaggat gtattgttcc ttttctgctc ctgatgacga ggactgcgtt 3060
gcagcggatg ttgtagatgc agatgaaaac caagatgatg atgccgagga ctcagcagtc 3120
cttgtcgctg atacccaaga agaggacggc gttgccaagg ggcaggttga ggcggattcg 3180
gaaatttgcg ttgcgcatac tggtagtcaa gaagaattgg ctgagcctga tgctgtcgga 3240
tctcaaactc ccatcgcctc tgctgaggaa accgaagtcg gagaggcaag cgacagggaa 3300
gggattgctg aggcgaaggc aactgtgtgt gctgatgctg tagatgcctg ccccgatcaa 3360
gtggaggcat ttgaaattga aaaggtcgag gactctatct tggatgagct tcaaactgaa 3420
cttaatgcgc cagcggacaa gacctatgag gatgtcttgg cattcgatgc cgtatgctca 3480
gaggcgttgt ctgcattcta tgctgtgccg agtgatgaga cgcactttaa agtgtgtgga 3540
ttctattcgc ctgctataga gcgcactaat tgttggctgc gttctacttt gatagtaatg 3600
cagagtctac ctttggaatt taaagacttg gagatgcaaa agctctggtt gtcttacaag 3660
gccggctatg accaatgctt tgtggacaaa ctagttaaga gcgtgcccaa gtctattatc 3720
cttccacaag gtggttatgt ggcagatttt gcctatttct ttctaagcca gtgtagcttt 3780
aaagcttatg ctaactggcg ttgtttagag tgtgacatgg agttaaagct tcaaggcttg 3840
gacgccatgt ttttctatgg ggacgttgtg tctcatatgt gcaagtgtgg taatagcatg 3900
accttgttgt ctgcagatat accctacact ttgcattttg gagtgcgaga tgataagttt 3960
tgcgcttttt acacgccaag aaaggtcttt agggctgctt gtgcggtaga tgttaatgat 4020
tgtcactcta tggctgtagt agagggcaag caaattgatg gtaaagtggt taccaaattt 4080
attggtgaca aatttgattt tatggtgggt tacgggatga catttagtat gtctcctttt 4140
gaactcgccc agttatatgg ttcatgtata acaccaaatg tttgttttgt taaaggagat 4200
gttataaagg ttgttcgctt agttaatgct gaagtcattg ttaaccctgc taatgggcgt 4260
atggctcatg gtgccggcgt cgccggcgcc atagctgaaa aggcgggcag tgcttttatt 4320
aaagaaacct ccgatatggt gaaggctcag ggcgtttgcc aggttggtga atgctatgaa 4380
tctgccggtg gtaagttatg taaaaaggtg cttaacattg tagggccaga tgcgcgaggg 4440
catggcaagc aatgctattc acttttagag cgtgcttatc agcatattaa taagtgtgac 4500
aatgttgtca ctactttaat ttcggctggt atatttagtg tgcctactga tgtctcccta 4560
acttacttac ttggtgtagt gacaaagaat gtcattcttg tcagtaacaa ccaggatgat 4620
tttgatgtga tagagaagtg tcaggtgacc tccgttgctg gtaccaaagc gctatcactt 4680
caattggcca aaaatttgtg ccgtgatgta aagtttgtga cgaatgcatg tagttcgctt 4740
tttagtgaat cttgctttgt ctcaagctat gatgtgttgc aggaagttga agcgctgcga 4800
catgatatac aattggatga tgatgctcgt gtctttgtgc aggctaatat ggactgtctg 4860
cccacagact ggcgtctcgt taacaaattt gatagtgttg atggtgttag aaccattaag 4920
tattttgaat gcccgggcgg gatttttgta tccagccagg gcaaaaagtt tggttatgtt 4980
cagaatggtt catttaagga ggcgagtgtt agccaaataa gggctttact cgctaataag 5040
gttgatgtct tgtgtactgt tgatggtgtt aacttccgct cctgctgcgt agcagagggt 5100
gaagtttttg gcaagacatt aggttcagtc ttttgtgatg gcataaatgt caccaaagtt 5160
aggtgtagtg ccatttacaa gggtaaggtt ttctttcagt acagtgattt gtccgaggca 5220
gatcttgtgg ctgttaaaga tgcctttggt tttgatgaac cacaactgct gaagtactac 5280
actatgcttg gcatgtgtaa gtggccagta gttgtttgtg gcaattattt tgctttcaag 5340
cagtcaaata ataattgcta catcaacgtg gcatgtttaa tgctgcaaca cttgagttta 5400
aagtttccta agtggcaatg gcaagaggct tggaacgagt tccgctctgg taaaccacta 5460
aggtttgtgt ccttggtatt agcaaagggc agctttaaat ttaatgaacc ttctgattct 5520
atcgatttta tgcgtgtggt gctacgtgaa gcagatttga gtggtgccac gtgcaatttg 5580
gaatttgttt gtaaatgtgg tgtgaagcaa gagcagcgca aaggtgttga cgctgttatg 5640
cattttggta cgttggataa aggtgatctt gtcaggggtt ataatatcgc atgtacgtgc 5700
ggtagtaaac ttgtgcattg cacccaattt aacgtaccat ttttaatttg ctccaacaca 5760
ccagagggta ggaaactgcc cgacgatgtt gttgcagcta atatttttac tggtggtagt 5820
gtgggccatt acacgcatgt gaaatgtaaa cccaagtacc agctttatga tgcttgtaat 5880
gttaataagg tttcggaggc taagggtaat tttaccgatt gcctctacct taaaaattta 5940
aagcaaacct tctcgtctgt gctgacgact ttttatttag atgacgtaaa gtgtgtggag 6000
tataagccag atttatcgca gtattactgt gagtctggta aatattatac aaaacccatt 6060
attaaggccc aatttagaac atttgagaag gttgatggtg tctataccaa ctttaaattg 6120
gtgggacata gtattgctga aaaactcaat gctaagctgg gatttgattg taattctccc 6180
tttgtggagt acaaaattac agagtggcca acagctactg gagatgtggt gttggctagt 6240
gatgatttgt atgtaagtcg gtacttaagc gggtgcatta cttttggtaa accggttgtc 6300
tggcttggcc atgaggaagc atcgctgaaa tctctcacat attttaatag acctagtgtc 6360
gtttgtgaaa ataaatttaa cgtgttgccc gttgatgtca gtgaacccac ggacaagggg 6420
cctgtgcctg ctgcagtcct tgttaccggc gtccctggag ctgatgcgtc agctggtgcc 6480
ggtattgcca aggagcaaaa agcctgtgct tctgctagtg tggaggatca ggttgttacg 6540
gaggttcgtc aagagccatc tgtttcagct gctgatgtca aagaggttaa attgaatggt 6600
gttaaaaagc ctgttaaggt ggaaggtagt gtggttgtta atgatcccac tagcgaaacc 6660
aaagttgtta aaagtttgtc tattgttgat gtctatgata tgttcctgac agggtgtaag 6720
tatgtggttt ggactgctaa tgagttgtct cgactagtaa attcaccgac tgttagggag 6780
tatgtgaagt ggggtatggg aaagattgta acacccgcta agttgttgtt gttaagagat 6840
gagaagcaag agttcgtagc gccaaaagta gtcaaggcga aagctattgc ctgctattgt 6900
gctgtgaagt ggtttctcct ctattgtttt agttggataa agtttaatac tgacaataag 6960
gttatataca ccacagaagt agcttcaaag cttactttca agttgtgctg tttggccttt 7020
aagaatgcct tacagacgtt taattggagc gttgtgtcta ggggcttttt cctagttgca 7080
acggtctttt tactctggtt taactttttg tatgctaatg ttattttgag tgacttctat 7140
ttgcctaata ttgggcctct ccctacgttt gtgggacaga tagttgcgtg gtttaagact 7200
acatttggtg tgtcaaccat ctgtgatttc taccaggtga cggatttggg ctatagaagt 7260
tcgttttgta atggaagtat ggtatgtgaa ctatgcttct caggttttga tatgctggac 7320
aactatgatg ctataaatgt tgttcaacac gttgtagata ggcgtttgtc ctttgactat 7380
attagcctat ttaaactggt agttgagctt gtaatcggct actctcttta tactgtgtgc 7440
ttctacccac tgtttgtcct tattggaatg cagttattga ccacatggtt gcctgaattc 7500
tttatgctgg agactatgca ttggagtgct cgtttgtttg tgtttgttgc caatatgctt 7560
ccagctttta cgttactgcg attttacatc gtggtgacag ctatgtataa ggtctattgt 7620
ctttgtagac atgttatgta tggatgtagt aagcctggtt gcttgttttg ttataagaga 7680
aaccgtagtg tccgtgttaa gtgtagcacc gttgttggtg gttcactacg ctattacgat 7740
gtaatggcta acggcggcac aggtttctgt acaaagcacc agtggaactg tcttaattgc 7800
aattcctgga aaccaggcaa tacattcata actcatgaag cagcggcgga cctctctaag 7860
gagttgaaac gccctgtgaa tccaacagat tctgcttatt actcggtcac agaggttaag 7920
caggttggtt gttccatgcg tttgttctac gagagagatg gacagcgtgt ttatgatgat 7980
gttaatgcta gtttgtttgt ggacatgaat ggtctgctgc attctaaagt taaaggtgtg 8040
cctgaaacgc atgttgtggt tgttgagaat gaagctgata aagctggttt tctcggcgcc 8100
gcagtgtttt atgcacaatc gctctacaga cctatgttga tggtggaaaa gaaattaata 8160
actaccgcca acactggttt gtctgttagt cgaactatgt ttgaccttta tgtagattca 8220
ttgctgaacg tcctcgacgt ggatcgcaag agtctaacaa gttttgtaaa tgctgcgcac 8280
aactctctaa aggagggtgt tcagcttgaa caagttatgg atacctttat tggctgtgcc 8340
cgacgtaagt gtgctataga ttctgatgtt gaaaccaagt ctattaccaa gtccgtcatg 8400
tcggcagtaa atgctggcgt tgattttacg gatgagagtt gtaataactt ggtgcctacc 8460
tatgttaaaa gtgacactat cgttgcagcc gatttgggtg ttcttattca gaataatgct 8520
aagcatgtac aggctaatgt tgctaaagcc gctaatgtgg cttgcatttg gtctgtggat 8580
gcttttaacc agctatctgc tgacttacag cataggctgc gaaaagcatg ttcaaaaact 8640
ggcttgaaga ttaagcttac ttataataag caggaggcaa atgttcctat tttaactaca 8700
ccgttctctc ttaaaggggg cgctgttttt agtagaatgt tacaatggtt gtttgttgct 8760
aatttgattt gtttcattgt gttgtgggcc cttatgccaa catatgcagt gcacaaatcg 8820
gatatgcagt tgcctttata tgccagtttt aaagttatag ataacggtgt gctaagggat 8880
gtgtctgtta ctgacgcatg cttcgcaaac aaatttaatc aattcgacca atggtatgag 8940
tctacttttg gtcttgctta ttaccgcaac tctaaggctt gtcctgttgt ggttgctgta 9000
atagatcaag acattggcca taccttattt aatgttccta ccacagtttt aagatatgga 9060
tttcatgtgt tgcattttat aacccatgca tttgctactg atagcgtgca gtgttacacg 9120
ccacatatgc aaatccccta tgataatttc tatgctagtg gttgcgtgtt gtcatccctc 9180
tgtactatgc ttgcgcatgc agatggaacc ccgcatcctt attgttatac agggggtgtt 9240
atgcataatg cctctctgta tagttctttg gctcctcatg tccgttataa cctggctagt 9300
tcaaatggtt atatacgttt tcccgaagtg gttagtgaag gcattgtgcg tgttgtgcgc 9360
actcgctcta tgacctactg cagggttggt ttatgtgagg aggccgagga gggtatctgc 9420
tttaatttta atcgttcatg ggtattgaac aacccgtatt atagggccat gcctggaact 9480
ttttgtggta ggaatgcttt tgatttaata catcaagttt taggaggatt agtgcggcct 9540
attgatttct ttgccttaac ggcgagttca gtggctggtg ctatccttgc aattattgtc 9600
gttttggctt tctattattt aatcaagctt aagcgtgcct ttggtgacta cactagtgtt 9660
gtggttatca atgtaattgt gtggtgtata aattttctga tgctttttgt gtttcaggtt 9720
tatcccacat tgtcttgttt atatgcttgt ttctacttct acaccacgct ttatttccct 9780
tcggagataa gtgttgttat gcatttgcaa tggcttgtca tgtatggtgc tattatgccc 9840
ttgtggtttt gcattattta cgtggcagtc gttgtttcaa accatgcatt gtggttgttc 9900
tcttactgcc gcaaaattgg taccgaggtt cgtagtgacg gcacatttga ggaaatggcc 9960
cttactacct ttatgattac taaagaatct tattgtaagt tgaaaaactc tgtttctgat 10020
gttgctttta acaggtactt gagtctttac aacaagtacc gttacttcag tggcaaaatg 10080
gatactgccg cttatagaga ggctgcctgt tcacaactgg caaaggcaat ggaaacattt 10140
aaccataata atggtaatga tgttctctat cagcctccaa ccgcctctgt tactacatca 10200
tttttacagt ctggtatagt gaagatggtg tcgcccacct ctaaagtgga gccttgtatt 10260
gttagtgtta cttatggtaa catgacactt aatgggttgt ggttggatga taaagtttat 10320
tgcccaagac atgttatctg ttcttcagct gacatgacag accctgatta tcctaatttg 10380
ctttgtagag tgacatcaag tgatttttgt gttatgtctg gtcgtatgag ccttactgta 10440
atgtcttatc aaatgcaggg ctgccaactt gttttgactg ttacactgca aaatcctaac 10500
acgcctaagt attccttcgg tgttgttaag cctggtgaga catttactgt actggctgca 10560
tacaatggca gacctcaagg agccttccat gttacgcttc gtagtagcca taccataaag 10620
ggctcctttc tatgtggatc ctgcggttct gtaggatatg ttttaactgg cgatagtgta 10680
cgatttgttt atatgcatca gctagagttg agtactggtt gtcataccgg tactgacttt 10740
agtgggaact tttatggtcc ctatagagat gcgcaagttg tacaattgcc tgttcaggat 10800
tatacgcaga ctgttaatgt tgtagcttgg ctttatgctg ctatttttaa cagatgcaac 10860
tggtttgtgc aaagtgatag ttgttccctg gaggagttta atgtttgggc tatgaccaat 10920
ggttttagct caatcaaagc cgatcttgtc ttggatgcgc ttgcttctat gacaggcgtt 10980
acagttgaac aggtgttggc cgctattaag aggctgcatt ctggattcca gggcaaacaa 11040
attttaggta gttgtgtgct tgaagatgag ctgacaccaa gtgatgttta tcaacaacta 11100
gctggtgtca agctacagtc aaagcgcaca agagttataa aaggtacatg ttgctggata 11160
ttggcttcaa cgtttttgtt ctgtagcatt atctcagcat ttgtaaaatg gactatgttt 11220
atgtatgtta ctacccatat gttgggagtg acattgtgtg cactttgttt tgtaagcttt 11280
gctatgttgt tgatcaagca taagcatttg tatttaacta tgtacatcat gcctgtgtta 11340
tgcacactgt tttacaccaa ctatttggtt gtgtacaaac agagttttag aggtctagct 11400
tatgcttggc tttcacactt tgtccctgct gtagattata catatatgga tgaagtttta 11460
tatggtgttg tgttgctagt agctatggtg tttgttacca tgcgtagcat aaaccacgac 11520
gtcttttcta ttatgttctt ggttggtaga cttgtcagcc tggtatccat gtggtatttt 11580
ggagccaatt tagaggaaga ggtactattg ttcctcacat ccctatttgg cacgtacaca 11640
tggactacta tgttgtcatt ggctaccgct aaggttattg ctaaatggtt ggctgtgaat 11700
gtcttgtact tcacagacgt accgcaaatt aaattagttc tgttgagcta cttgtgtatt 11760
ggttatgtgt gttgttgtta ttggggaatc ttgtcactcc ttaatagcat ttttaggatg 11820
ccattgggcg tctacaatta taaaatctcc gttcaggagt tacgttatat gaatgctaat 11880
ggcttgcgcc cacctagaaa tagttttgag gccctgatgc ttaattttaa gctgttggga 11940
attggtggtg tgccagtcat tgaagtatct caaattcaat caagattgac ggatgttaaa 12000
tgtgctaatg ttgtgttgct taattgcctc cagcacttgc atattgcatc taattctaag 12060
ttgtggcagt attgtagtac tttgcacaat gaaatactgg ctacatctga tttgagcgtg 12120
gccttcgata agttggctca actcttagtt gttttatttg ctaatccagc agcagtggat 12180
agcaagtgcc ttgcaagtat tgaagaagtg agcgatgatt acgttcgcga caatactgtc 12240
ttgcaagcct tacagagtga atttgttaat atggctagct tcgttgagta tgaacttgct 12300
aagaagaatc tagatgaggc taaggctagc ggctctgcca atcaacagca gattaagcag 12360
ctagagaagg cgtgtaatat tgctaagtca gcatatgagc gcgacagagc tgttgctcgt 12420
aagctggaac gtatggctga tttagctctt acaaacatgt ataaagaagc tagaattaat 12480
gataagaaga gtaaggtagt gtctgcattg caaaccatgc tctttagtat ggtgcgtaag 12540
ctagataacc aagctcttaa ttctatttta gacaacgcag ttaagggttg tgtacctttg 12600
aatgcaatac catcattgac ttcgaacact ctgactataa tagtgccaga taagcaggtt 12660
tttgatcagg ttgtggataa tgtgtatgtc acctatgctg ggaatgtatg gcatatacag 12720
tttattcaag atgctgatgg tgctgttaaa caattgaatg agatagatgt taattcaacc 12780
tggcctctag tcattgctgc aaataggcat aatgaagtgt ctactgttgt tttgcagaac 12840
aatgagttga tgcctcagaa gttgagaact caggttgtca atagtggctc agatatgaat 12900
tgtaatactc ctacccagtg ttactataat actactggca cgggtaagat tgtgtatgct 12960
atacttagtg actgtgacgg cctgaagtac actaagatag taaaagaaga tggaaattgt 13020
gttgttttgg aattggatcc tccctgtaag ttttctgttc aggatgtgaa gggccttaaa 13080
attaagtacc tttactttgt gaaggggtgt aatacactgg ctagaggctg ggttgtaggc 13140
accttatcct cgacagtgag attgcaggcg ggtacggcaa ctgagtatgc ctccaactct 13200
gcaatactgt cgctgtgtgc gttttctgta gatcctaaga aaacgtactt ggattatata 13260
aaacagggtg gagttcccgt tactaattgt gttaagatgt tatgtgacca tgctggcact 13320
ggtatggcca ttactattaa gccggaggca accactaatc aggattctta tggtggtgct 13380
tccgtttgta tatattgccg ctcgcgtgtt gaacatccag atgttgatgg attgtgcaaa 13440
ttacgcggca agtttgtcca agtgccctta ggcataaaag atcctgtgtc atatgtgttg 13500
acgcatgatg tttgtcaggt ttgtggcttt tggcgagatg gtagctgttc ctgtgtaggc 13560
acaggctccc agtttcagtc aaaagacacg aactttttaa acggattcgg ggtacaagtg 13620
taaatgcccg tcttgtaccc tgtgccagtg gcttggacac tgatgttcaa ttaagggcat 13680
ttgacatttg taatgctaat cgagctggca ttggtttgta ttataaagtg aattgctgcc 13740
gcttccagcg tgtagatgag gacggcaaca agttggataa gttctttgtt gttaaaagaa 13800
ctaatttaga agtgtataac aaggagaaag aatgctatga gttgacaaaa gaatgcggtg 13860
ttgtggctga acacgagttc ttcacatttg atgtggaggg aagtcgggta ccacacatag 13920
tccgtaaaga tctttcaaag tttactatgt tagatctttg ctatgcattg cgtcattttg 13980
accgcaatga ttgttcaact cttaaggaaa ttctccttac atatgctgag tgtgaagagt 14040
cctacttcca aaagaaggac tggtatgatt ttgttgagaa tcctgatata attaatgtgt 14100
acaagaagct tggtcctata tttaatagag ccctgcttaa cactgccaag tttgcagacg 14160
cattagtgga ggcaggctta gtaggtgttt taacacttga taatcaagat ttatatggtc 14220
aatggtatga ctttggagat tttgtcaaga cagtacctgg ttgtggtgtt gccgtggcag 14280
actcttatta ttcatatatg atgccaatgc tgactatgtg tcatgcgttg gatagtgagt 14340
tgtttgttaa tggtacttat agggagtttg accttgttca gtatgatttt actgatttca 14400
agctagagct gttcactaag tattttaagc attggagtat gacctaccac ccgaacacct 14460
gtgagtgcga ggatgacagg tgcattattc attgcgccaa ttttaatata cttttcagca 14520
tggtcttacc taagacctgt tttgggcctc ttgttaggca gatatttgtg gatggtgttc 14580
ctttcgttgt gtcgatcggt taccattata aagaattagg tgttgttatg aatatggatg 14640
tggatacaca tcgttatcgc ttgtctctta aggacttgct tttgtatgct gcagaccctg 14700
cccttcatgt ggcgtctgct agtgcactgc ttgatttgcg cacatgttgt tttagcgttg 14760
cagctattac aagtggcgta aaatttcaaa cagttaaacc tggaaatttt aatcaggatt 14820
tctacgagtt tattttgagt aaaggcctgc ttaaagaggg gagctccgtt gatttgaagc 14880
acttcttctt tacgcaggat ggtaatgctg ctattactga ttacaattac tacaagtata 14940
atctacccac catggtggat attaagcagt tgttgtttgt tttagaagtt gttaataagt 15000
acttcgagat ctatgagggt gggtgtatac ccgcaacaca ggtcattgtt aataattatg 15060
acaagagtgc tggctatcca tttaataaat ttggaaaggc caggctctat tatgaggcat 15120
tatcatttga ggagcaggat gaaatttatg cgtataccaa acgcaatgtc ctgccgaccc 15180
taactcaaat gaatcttaaa tatgctatta gtgctaagaa tagggcccgc accgttgctg 15240
gtgtctctat tctcagtact atgactggca gaatgtttca tcaaaagtgt ctaaagagta 15300
tagcagctac tcgcggtgtt cctgtagtta taggcaccac gaagttctat ggcggttggg 15360
atgatatgtt acgccgcctt attaaagatg ttgatagtcc tgtactcatg ggttgggact 15420
atcctaaatg tgatcgtgct atgccaaaca tactgcgtat tgttagtagt ttggtgctag 15480
cccgtaaaca tgattcgtgc tgttcgcata cggatagatt ctatcgtctt gcgaacgagt 15540
gcgcccaagt tttgagtgaa attgttatgt gtggtggttg ttattatgtt aaaccaggtg 15600
gcactagtag tggggatgca accactgctt ttgctaattc tgtgtttaac atttgtcaag 15660
ctgtttccgc caatgtatgc tcgcttatgg catgcaatgg acacaaaatt gaagatttga 15720
gtatacgcga gttacaaaag cgcctatact ctaatgtcta tcgtgcggac catgttgacc 15780
ccgcatttgt tagtgagtat tatgagtttt taaacaagca ttttagtatg atgattttga 15840
gtgatgatgg tgttgtgtgt tataattcag agtttgcgtc caagggttat attgctaata 15900
taagtgcctt tcaacaggta ttatattatc aaaacaacgt gtttatgtct gaggccaaat 15960
gttgggtaga aacagacatc gaaaagggac cgcatgaatt ttgttctcaa catacaatgc 16020
tagtcaagat ggatggtgat gaagtctacc ttccataccc tgatccttcg agaatcttag 16080
gagcaggctg ttttgttgat gatttactca agactgatag cgttctcttg atagagcgtt 16140
tcgtaagtct tgcaattgat gcttatcctt tagtatacca tgagaaccca gagtatcaaa 16200
atgtgttccg ggtatattta gaatacatca agaagctgta caatgatctc ggtaatcaga 16260
tcctggacag ctacagtgtt attttaagta cttgtgatgg tcaaaagttt actgacgaga 16320
cgttttacaa gaacatgtat ttaagaagtg cagtgctgca aagcgttggt gcctgcgttg 16380
tctgtagttc tcaaacatca ttacgttgtg gcagttgcat acgcaagcct ttgctgtgtt 16440
gcaaatgcgc ctatgatcat gttatgtcca ctgatcataa atatgtcctg agtgtgtcac 16500
catatgtgtg taattcaccg ggatgtgatg taaatgatgt taccaaattg tatttaggtg 16560
gtatgtcata ttattgtgag gaccataaac cacagtattc attcaaattg gtgatgaatg 16620
gtatggtttt tggtttatat aagcagtctt gtactggttc gccctacata gaggatttta 16680
ataaaatcgc tagttgcaaa tggacagaag tcgatgatta tgtgctagct aatgaatgca 16740
ccgaacgcct taaattgttt gccgcagaaa cgcagaaggc cacagaagag gcctttaagc 16800
aatgttatgc gtcagcaacg atccgtgaga tcgtgagcga tcgggagtta attttatctt 16860
gggaaattgg taaagtccgc ccgccactta ataaaaatta cgtgttcacc ggctaccatt 16920
ttactaataa tggtaagaca gttttaggtg agtatgtttt tgataagagt gagttgacta 16980
atggtgtgta ttatcgcgcc acaaccactt ataagttatc tgtaggtgat gtgttcattt 17040
taacatcaca cgcagtgtct agtttaagtg ctcctacatt agtaccgcag gagaattata 17100
ctagcattcg ttttgctagt gtttatagtg tgcctgagac gtttcagaat aatgtgccta 17160
attatcagca cattggaatg aagcgctatt gtactgtaca gggaccgcct ggtactggta 17220
agtcccatct agccattggg ctagctgttt attattgtac agcgcgcgtg gtgtataccg 17280
ctgctagcca tgctgcagtt gacgcgctgt gtgaaaaggc acataaattt ctcaacatca 17340
acgactgcac gcgtattgtt cctgcaaagg tgcgtgtaga ttgttatgat aaattcaagg 17400
tcaatgacac cactcgcaag tatgtgttta ctacaataaa tgcattacct gagttggtga 17460
ctgacattat tgtcgttgat gaagttagta tgcttaccaa ctatgagctg tctgttatta 17520
acagtcgtgt tagggctaag cattatgtgt atattggcga cccggcgcag ttacctgcac 17580
cacgtgtgct actgaataag ggaactctag aacctagata ttttaattcc gttaccaagc 17640
taatgtgttg tttgggtcca gatattttct tgggcacctg ttatagatgc cctaaggaga 17700
ttgtggatac ggtgtcagcc ttggtttata ataataagct gaaggctaaa aatgataata 17760
gctccatgtg ctttaaggtt tattataagg gccagactac acatgagagt tctagtgctg 17820
ttaatatgca gcaaatacat ttaatttcca agtttctgaa ggcaaacccc agttggagta 17880
acgccgtatt tattagtcct tataactcgc agaactatgt tgctaagaga gtcttgggat 17940
tacaaaccca gacagtagac tcagcgcagg gttctgaata tgattttgtt atctactcac 18000
agactgcgga aacagcgcat tctgtcaatg taaatagatt caatgttgct attacacgtg 18060
ctaagaaggg tattctctgt gtcatgagta gtatgcaatt atttgagtct cttaatttta 18120
ctacactgac gttggataag attaacaatc cacgattaca gtgtactaca aatttgttta 18180
aggattgtag caggagctat gtaggatatc acccagccca tgcaccatcc tttttggcag 18240
ttgatgacaa atataaggta ggcggtgatt tagccgtttg ccttaatgtt gctgattctg 18300
ctgtcactta ttcgcggctt atatcactca tgggattcaa gcttgacttg acccttgatg 18360
gttattgtaa gctgtttata actagagatg aagctatcaa acgtgttaga gcctgggttg 18420
gcttcgatgc agaaggtgcc catgcgatac gtgatagcat tgggacaaat ttcccattac 18480
aattaggctt ttcgactgga attgattttg ttgtcgaagc cactggaatg tttgctgaga 18540
gagatggtta tgtctttaaa aaggcagccg cacgagctcc tcctggcgaa caatttaaac 18600
accttatccc acttatgtca agagggcaga aatgggatgt ggttcgcatt agaatagtac 18660
aaatgttgtc agaccaccta gtggatttgg cagacagtgt tgtacttgtg acgtgggctg 18720
ccagctttga gctcacatgt ttgcgatatt tcgctaaagt tggaagagaa gttgtgtgta 18780
gtgtctgcac caagcgtgcg acatgtttta attctagaac tggatactat ggatgctggc 18840
gacatagtta ttcctgtgat tacctgtaca acccactaat agttgacatt caacagtggg 18900
gatatacagg atctttaact agcaatcatg atcctatttg cagcgtgcat aagggtgctc 18960
atgttgcatc atctgatgct atcatgaccc ggtgtctagc tgttcatgat tgcttttgta 19020
agtctgttaa ttggaattta gaatacccca ttatttcaaa tgaggtcagt gttaatacct 19080
cctgcaggtt attgcagcgc gtaatgttta gggctgcgat gctatgcaat aggtatgatg 19140
tgtgttatga cattggcaac cctaaaggtc ttgcctgtgt caaaggatat gattttaagt 19200
tctatgacgc ctcccctgtt gttaagtcgg tcaaacagtt tgtttacaaa tacgaggcac 19260
ataaagatca atttttagat ggtttgtgta tgttttggaa ctgcaatgtg gataagtatc 19320
cagcgaatgc agttgtgtgt aggtttgaca cgcgtgtgtt gaacaaatta aatctccctg 19380
gctgtaatgg tggcagtttg tatgttaaca aacatgcatt ccacaccagt ccctttaccc 19440
gggctgcctt cgagaatttg aagcctatgc ctttctttta ttattcagat acgccctgtg 19500
tgtatatgga aggcatggaa tctaagcagg tcgattatgt cccattgaga agcgctacat 19560
gcatcacaag atgcaattta ggtggcgctg tttgtttaaa acatgctgag gagtatcgtg 19620
agtaccttga gtcttacaat acggcaacca cagcgggttt tactttttgg gtctataaga 19680
cttttgattt ttacaacctt tggaatactt ttactaggct ccaaagttta gaaaatgtag 19740
tgtataacct ggtcaacgct ggacactttg atggccgggc gggtgaactg ccttgtgctg 19800
ttataggtga gaaagtcatt gccaagattc aaaatgagga tgtcgtggtc tttaaaaata 19860
acacgccatt ccccactaat gtggctgtcg aattatttgc taagcgcagt attcggcccc 19920
accccgagct taagctcttt agaaatttga atattgacgt gtgctggagt cacgtccttt 19980
gggattatgc taaggatagt gtgttttgca gttcgacgta taaggtctgc aaatacacag 20040
atttacagtg cattgaaagc ttgaatgtac tttttgatgg tcgtgataat ggtgctcttg 20100
aagcttttaa gaagtgccgg aatggcgtct acattaacac gacaaaaatt aaaagtctgt 20160
cgatgattaa aggcccacaa cgtgccgatt tgaatggcgt agttgtggag aaagttggag 20220
attctgatgt ggaattttgg tttgctgtgc gtaaagacgg tgacgatgtt atcttcagcc 20280
gtacagggag ccttgaaccg agccattacc ggagcccaca aggtaatccg ggtggtaatc 20340
gcgtgggtga tctcagcggt aatgaagctc tagcgcgtgg cactatcttt actcaaagca 20400
gattattatc ttctttcaca cctcgatcag agatggagaa agattttatg gatttagatg 20460
atgatgtgtt cattgcaaaa tatagtttac aggactacgc gtttgaacac gttgtttatg 20520
gtagttttaa ccagaagatt attggaggtt tgcatttgct tattggctta gcccgtaggc 20580
agcaaaaatc caatctggta attcaagagt tcgtgacata cgactctagc attcattcgt 20640
actttatcac tgacgagaac agtggtagta gtaagagtgt gtgcactgtt attgatttat 20700
tgttagatga ttttgtggac attgtaaagt ccctgaatct aaagtgtgtg agtaaggttg 20760
ttaatgttaa tgtggatttt aaggacttcc agtttatgtt gtggtgcaat gaggagaagg 20820
tcatgacttt ctatcctcgt ttgcaggctg ctgctgactg gaaacctggt tatgttatgc 20880
ctgtcttata taagtatttg gaatcgcctc tggaaagagt aaacctctgg aattatggca 20940
agccgattac tttacctaca ggatgtatga tgaatgttgc taagtatact caattatgtc 21000
aatatttgag cactacaaca ttagcagttc cggctaatat gcgtgtctta caccttggtg 21060
ccggttcgga taagggtgtt gcccctgggt ctgcagttct taggcagtgg ctaccagcgg 21120
gaagtattct tgtagataat gatgtgaatc catttgtgag tgacagtgtc gcctcatatt 21180
atggaaattg tataacctta ccctttgatt gtcagtggga tctgataatt tctgatatgt 21240
acgaccctct tactaagaac attggggagt acaacgtgag taaagatgga ttctttactt 21300
acctctgtca tttaattcgt gacaagttgg ctctgggtgg cagtgttgcc ataaaaataa 21360
cagagttttc ttggaacgct gagttatata gtttaatggg gaagtttgcg ttctggacaa 21420
tcttttgcac caacgtaaac gcctcttcaa gtgaaggatt tttgattggc ataaattggt 21480
tgaataagac ccgtaccgaa attgacggta aaaccatgca tgccaattat ctgttttgga 21540
gaaatagtac aatgtggaat ggaggggctt acagtctctt tgacatgagt aagttccctt 21600
tgaaagcggc tggtacggct gttgttagcc ttaaaccaga ccaaataaat gacttagtcc 21660
tctccttgat tgagaagggc aagttattag tgcgtgatac acgcaaagaa gtttttgttg 21720
gcgatagcct agtaaatgtc aaataaacga acaatgtttg tttttcttgt tttattgcca 21780
ctagtctcta gtcagtgtgt taatcttaca accagaactc aattaccccc tgcatacact 21840
aattctttca cacgtggtgt ttattaccct gacaaagttt tcagatcctc agttttacat 21900
tcaactcagg acttgttctt acctttcttt tccaatgtta cttggttcca tgctatacat 21960
gtctctggga ccaatggtac taagaggttt gataaccctg tcctaccatt taatgatggt 22020
gtttactttg cttccactga gaagtctaac ataataagag gctggatttt tggtactact 22080
ttagattcga aaacccagtc cctacttatt gttaataacg ctactaatgt tgttatcaaa 22140
gtctgtgaat ttcaattttg taacgatcca tttttgggtg tttattacca caaaaacaac 22200
aaaagttgga tggaaagtga gttcagagtt tattctagtg cgaataattg cacttttgaa 22260
tacgtctctc agccttttct tatggacctt gaaggaaaac agggtaattt caaaaatctt 22320
agggaatttg tgttcaagaa tattgatggt tacttcaaga tatactctaa gcacacgcct 22380
attaatttag tgcgtgatct ccctcagggt ttttcggctt tagaaccatt ggtagatttg 22440
ccaataggta ttaacatcac taggtttcaa actttacttg ctttacatag aagttattta 22500
actcctggtg attcttcttc aggttggaca gctggtgctg cagcttatta tgtgggttat 22560
cttcaaccta ggacttttct actgaagtac aatgaaaatg gaaccattac agatgctgta 22620
gactgtgcac ttgaccctct ctcagaaaca aagtgtacgt tgaaatcctt cactgtagaa 22680
aaaggaatct atcaaacttc taactttaga gtccaaccaa cagaatctat tgttagattt 22740
cctaacatca caaacttgtg cccttttggt gaagttttta acgccaccag atttgcatct 22800
gtttatgctt ggaacaggaa gagaatcagc aactgtgttg ctgattattc tgtcctgtat 22860
aattccgcat cattttccac ttttaagtgt tatggagtgt ctcctactaa attaaatgat 22920
ctctgcttta ctaatgtcta tgcagattca tttgtaatta gaggtgatga agtcagacaa 22980
atcgctccag ggcaaactgg aaagattgct gattataact acaaattacc agatgatttt 23040
acaggctgcg ttatagcttg gaattctaac aatcttgatt ctaaggttgg tggtaattat 23100
aattacctgt acagattgtt taggaagtct aatctcaaac cttttgagag agatatttca 23160
actgaaatct atcaggccgg tagcacacct tgtaatggtg ttgaaggttt taattgttac 23220
tttcctctgc aatcatatgg tttccaaccc actaatggtg ttggttacca accatacaga 23280
gtagtagtac tttcttttga acttctacat gcaccagcaa ctgtttgtgg acctaaaaag 23340
tctactaatt tggttaagaa caagtgtgtc aatttcaact tcaatggttt aacaggcaca 23400
ggtgttctta ctgagtctaa caaaaagttt ctgcctttcc aacaatttgg cagagacatt 23460
gctgacacta ctgatgctgt tcgtgatcca caaacacttg agattcttga cattacacca 23520
tgttcttttg gtggtgtcag tgttataaca ccaggaacaa atacttctaa ccaggttgct 23580
gttctttatc aggatgttaa ctgcacagaa gtccctgttg ctattcatgc agatcaactt 23640
actcctactt ggcgtgttta ttctacaggt tctaatgttt ttcaaacacg tgcaggctgt 23700
ttaatagggg ctgaacatgt caacaactca tatgagtgtg acatacccat tggtgcaggt 23760
atatgcgcta gttatcagac tcagactaat tctcctcgga gagcaagaag tgtagctagt 23820
caatccatca ttgcctacac tatgtcactt ggtgcagaaa attcagttgc ttactctaat 23880
aactctattg ccatacccac aaattttact attagcgtta ccacagaaat tctaccagtg 23940
tctatgacca agacatcagt agattgtaca atgtacattt gtggtgattc aactgaatgc 24000
agcaatcttt tgttgcaata tggcagtttt tgtacacaat taaaccgtgc tttaactgga 24060
atagctgttg aacaagacaa aaacacccaa gaagtttttg cacaagtcaa acaaatttac 24120
aagacaccac caattaaaga ttttggcggt tttaatttta gccagatact gccagatcca 24180
tcaaaaccaa gcaagaggtc atttattgaa gatctactgt tcaacaaagt gacacttgca 24240
gatgctggct tcatcaaaca atatggtgat tgccttggtg atattgctgc tagagacctc 24300
atttgtgcac aaaagtttaa cggccttact gttttgccac ctttgctcac agatgaaatg 24360
attgctcaat acacttctgc actgttagca ggtacaatca cttctggttg gacttttggt 24420
gcaggtgctg cattacaaat accatttgct atgcaaatgg cttataggtt taatggtatt 24480
ggagttacac agaatgttct ctatgagaac caaaaattga ttgccaacca atttaatagt 24540
gctattggca aaattcaaga ctcactttct tccacagcaa gtgcacttgg aaaacttcaa 24600
gatgtggtca accaaaatgc acaagcttta aacacgcttg ttaaacaact tagctccaat 24660
tttggtgcaa tttcaagtgt tttaaacgac atcctttcac gtcttgacaa agttgaggct 24720
gaagtgcaaa ttgataggtt gatcacaggc agacttcaaa gtttgcagac atatgtgact 24780
caacaattaa ttagagctgc agaaatcaga gcttctgcta atcttgctgc tactaaaatg 24840
tcagagtgtg tacttggaca atcaaaaaga gttgactttt gcggaaaggg ctatcatctt 24900
atgtcatttc ctcagtcagc acctcatggt gtcgtctttt tgcatgtgac ttatgtccct 24960
gcacaagaaa agaacttcac aactgctcct gccatttgtc atgatggaaa agcacacttt 25020
cctcgtgaag gtgtctttgt ttcaaatggc acacactggt ttgtaacaca aaggaatttt 25080
tatgaaccac aaatcattac tacagacaac acatttgtgt ctggtaactg tgatgttgta 25140
ataggaattg tcaacaacac agtttatgat cctttgcaac ctgaattaga ctcattcaag 25200
gaggagcttg ataaatactt caagaaccat acctcaccag atgttgattt aggtgacatc 25260
tctggcatta atgcttcagt tgtaaacatt cagaaagaaa tcgaccgcct caatgaggtt 25320
gccaagaatt taaatgaatc tctcatcgat ctccaagaac ttggaaagta tgagcagtat 25380
ataaaatggc catggtacat ttggctaggt tttatagctg gcttgattgc catagtaatg 25440
gtgacaatta tgctttgctg tatgaccagt tgctgtagtt gtctcaaggg ctgttgttct 25500
tgtggatcct gctgcaaatt tgacgaggac gactctgagc cagtgctcaa aggagtcaaa 25560
ttacattaca cataactatc acagcctctc ctggaaagac agaaaatcta aacaatttat 25620
agcattctca ttgctacctg gccccgtaag aggcagtcat agctatggcc gtgttggtcc 25680
taaggctaca ttggctgctg tctttattgg tccatttatt gtagcatgta tgctaggcat 25740
tggcctagtt tatttattgc aattgcaagt tcaaattttt catgttaagg ataccatacg 25800
tgtgactggc aagccagcca ctgtgtctta tactacaagt acaccagtaa caccgagcgc 25860
gacgacgctc gatggtacta cgtatacttt aattagaccc actagctctt atacaagagt 25920
ttatcttggt actccaagag gttttgatta tagtacattt gggcctaaga ccctagatta 25980
tgttactaat ctaaacctca tcttaattct ggtcgtccat atacttttaa ggcattgtcc 26040
aggcatatga ggccaacagc cacatggatt tggcatgtga gtgatgcatg gttacgccgc 26100
acgcgggact ttggtgtcat tcgcctagaa gatttttgtt ttcaatttaa ttatagccaa 26160
ccccgagttg gttattgtag agttccttta aaggcttggt gtagcaacca gggtaaattt 26220
gcagcgcagt ttaccctaaa aagttgcgaa aaaccaggtc acgaaaaatt tattactagc 26280
ttcacggcct acggcagaac tgtccaacag gccgttagca agttagtaga agaagctgtt 26340
gattttattc tttttagggc cacgcagctc gaaagaaatg tttaatttat tccttacaga 26400
cacagtatgg tatgtggggc agattatttt tatattcgca gtgtgtttga tggtcaccat 26460
aattgtggtt gccttccttg cgtctatcaa actttgtatt caactttgcg gtttatgtaa 26520
tactttggtg ctgtcccctt ctatttattt gtatgatagg agtaagcagc tttataagta 26580
ctataatgaa gaaatgagac tgcccctatt agaggtggat gatatctaat ccaaacatta 26640
tgagtagtac tactcaggcc ccagagcccg tctatcaatg gaccgccgac gaggcagttc 26700
aattccttaa ggaatggaac ttctcgttgg gcattatact actctttatt actatcatac 26760
tacagttcgg ttacacgagc cgtagcatgt ttatttatgt tgtgaaaatg ataatcttgt 26820
ggttaatgtg gccactgact attgttttgt gtattttcaa ttgcgtgtat gcgctaaata 26880
atgtgtatct tggattttct atagtgttta ctatagtgtc cattgtaatc tggatcatgt 26940
attttgtgaa cagcataagg ttgtttatca ggactggtag ctggtggagc ttcaaccccg 27000
aaacaaacaa ccttatgtgt atagatatga aaggtaccgt gtatgttaga cccattattg 27060
aggattacca tacactaaca gccactatta ttcgtggcca cctctacatg caaggtgtta 27120
agctaggcac cggtttctct ttgtctgact tgcccgctta tgttacagtt gctaaggtgt 27180
cacacctttg cacttataag cgcgcattct tagacaaggt agacggtgtt agcggttttg 27240
ctgtttatgt gaagtccaag gtcggaaatt accgactgcc ctcaaacaaa ccgagtggcg 27300
cggacaccgc attgttgaga acctaatcta aactttaagg agagaatgaa tcctatgtcg 27360
gcgctcggtg gtaacccctc gcgagaaagt cgggatagga cactctctat cagaatggat 27420
gtcttgctgt cataacagat agagaaggtt gtggcagacc ctgtatcaat tagttgaaag 27480
agattgcaaa atagagaatg tgtgagagaa gttagcaagg tcctacgtct aaccataaga 27540
acggcgatag gcgccccctg ggaacagctc acatcagggt actattcctg caatgcccta 27600
gtaaatgaat gaagttgatc atggccaatt ggaagaatca caaaaaaaaa aaaaaaaaaa 27660
acggccggtt t 27671
<210> 35
<211> 7341
<212> DNA
<213> Artificial Sequence
<220>
<223> pcDNA34_syn_N
<400> 35
agtacttaat acgactcact ataggctagc cgccaccatg gtgtctgata atggacctca 60
aaatcagcga aatgcacctc gcattacgtt tggtggacca tcagattcaa ctggcagtaa 120
ccagaatgga gaacgaagtg gtgcgcgatc aaaacaacgc cgcccgcaag gtttacccaa 180
taatactgcg tcttggttca ccgctctcac tcaacatggc aaggaagatt taaaattccc 240
tcgaggacaa ggcgttccaa ttaacaccaa tagcagtcca gatgaccaaa ttggctacta 300
ccgccgcgcc acaagacgaa ttcgtggtgg tgatggtaaa atgaaagatc tcagtccaag 360
atggtatttc tactatctag gaactgggcc agaagctgga cttccttatg gtgctaacaa 420
agatggcatc atatgggttg caactgaggg agccttgaat acaccaaaag atcacattgg 480
caccagaaat cctgctaaca atgctgcaat cgtgctacaa cttcctcaag gaacaacatt 540
accaaaaggt ttttacgcag aagggtctag aggtggaagt caagcctctt ctagatcatc 600
atcacgtagt cgcaacagtt caagaaattc aactccaggt tcaagtagag gaacttctcc 660
tgctagaatg gctggaaatg gaggtgatgc tgctcttgct ttgttactac ttgacagatt 720
gaaccagctt gagagcaaaa tgtctggtaa aggccaacaa caacaaggcc aaactgtcac 780
taagaaatct gctgctgagg cttctaagaa gcctagacaa aaacgtactg ccactaaagc 840
atacaatgta acacaagctt tcggcagacg tggtccagaa caaactcaag gaaattttgg 900
ggatcaggaa ctaatcagac aaggaactga ttacaaacat tggccgcaaa ttgcacaatt 960
tgctccttct gcttcagcgt tctttggaat gtcgagaatt ggaatggaag tcacaccttc 1020
gggaacatgg ttgacctata caggtgccat caaattggat gacaaagatc caaatttcaa 1080
agatcaagtc attttgctga ataagcatat tgacgcatac aaaacattcc caccaacaga 1140
gcctaaaaag gacaaaaaga agaaggctga tgaaactcaa gccttaccgc agagacagaa 1200
gaaacagcaa actgtgactc ttcttcctgc tgcagatttg gatgatttct ccaaacaatt 1260
gcaacaatcc atgagcagtg ctgactcaac tcaggcctaa gcggccgctt cgagcagaca 1320
tgataagata aagggttcga tccctaccgg ttagtaatga gtttgatatc tcgacaatca 1380
acctctggat tacaaaattt gtgaaagatt gactggtatt cttaactatg ttgctccttt 1440
tacgctatgt ggatacgctg ctttaatgcc tttgtatcat gctattgctt cccgtatggc 1500
tttcattttc tcctccttgt ataaatcctg gttgctgtct ctttatgagg agttgtggcc 1560
cgttgtcagg caacgtggcg tggtgtgcac tgtgtttgct gacgcaaccc ccactggttg 1620
gggcattgcc accacctgtc agctcctttc cgggactttc gctttccccc tccctattgc 1680
cacggcggaa ctcatcgccg cctgccttgc ccgctgctgg acaggggctc ggctgttggg 1740
cactgacaat tccgtggtgt tgtcggggaa gctgacgtcc tttccatggc tgctcgcctg 1800
tgttgccacc tggattctgc gcgggacgtc cttctgctac gtcccttcgg ccctcaatcc 1860
agcggacctt ccttcccgcg gcctgctgcc ggctctgcgg cctcttccgc gtcttcgcct 1920
tcgccctcag acgagtcgga tctccctttg ggccgcctcc ccgcctggaa acgggggagg 1980
ctaactgaaa cacggaagga gacaataccg gaaggaaccc gcgctatgac ggcaataaaa 2040
agacagaata aaacgcacgg gtgttgggtc gtttgttcat aaacgcgggg ttcggtccca 2100
gggctggcac tctgtcgata ccccaccgag accccattgg ggccaatacg cccgcgtttc 2160
ttccttttcc ccaccccacc ccccaagttc gggtgaaggc ccagggctcg cagccaacgt 2220
cggggcggca ggccctgcca tagcagatct gcgcagctgg ggctctaggg ggtatcccca 2280
cgcgccctgt agcggcgcat taagcgcggc gggtgtggtg gttacgcgca gcgtgaccgc 2340
tacacttgcc agcgccctag cgcccgctcc tttcgctttc ttcccttcct ttctcgccac 2400
gttcgccggc tttccccgtc aagctctaaa tcggggcatc cctttagggt tccgatttag 2460
tgctttacgg cacctcgacc ccaaaaaact tgattagggt gatggttcac gtagtgggcc 2520
atcgccctga tagacggttt ttcgcccttt gacgttggag tccacgttct ttaatagtgg 2580
actcttgttc caaactggaa caacactcaa ccctatctcg gtctattctt ttgatttata 2640
agggattttg gggatttcgg cctattggtt aaaaaatgag ctgatttaac aaaaatttaa 2700
cgcgaattaa ttctgtggaa tgtgtgtcag ttagggtgtg gaaagtcccc aggctcccca 2760
gcaggcagaa gtatgcaaag catgcatctc aattagtcag caaccaggtg tggaaagtcc 2820
ccaggctccc cagcaggcag aagtatgcaa agcatgcatc tcaattagtc agcaaccata 2880
gtcccgcccc taactccgcc catcccgccc ctaactccgc ccagttccgc ccattctccg 2940
ccccatggct gactaatttt ttttatttat gcagaggccg aggccgcctc tgcctctgag 3000
ctattccaga agtagtgagg aggctttttt ggaggcctag gcttttgcaa aaagctcccg 3060
ggagcttgta tatccatttt cggatctgat caagagacag gatgaggatc gtttcgcatg 3120
attgaacaag atggattgca cgcaggttct ccggccgctt gggtggagag gctattcggc 3180
tatgactggg cacaacagac aatcggctgc tctgatgccg ccgtgttccg gctgtcagcg 3240
caggggcgcc cggttctttt tgtcaagacc gacctgtccg gtgccctgaa tgaactgcag 3300
gacgaggcag cgcggctatc gtggctggcc acgacgggcg ttccttgcgc agctgtgctc 3360
gacgttgtca ctgaagcggg aagggactgg ctgctattgg gcgaagtgcc ggggcaggat 3420
ctcctgtcat ctcaccttgc tcctgccgag aaagtatcca tcatggctga tgcaatgcgg 3480
cggctgcata cgcttgatcc ggctacctgc ccattcgacc accaagcgaa acatcgcatc 3540
gagcgagcac gtactcggat ggaagccggt cttgtcgatc aggatgatct ggacgaagag 3600
catcaggggc tcgcgccagc cgaactgttc gccaggctca aggcgcgcat gcccgacggc 3660
gaggatctcg tcgtgaccca tggcgatgcc tgcttgccga atatcatggt ggaaaatggc 3720
cgcttttctg gattcatcga ctgtggccgg ctgggtgtgg cggaccgcta tcaggacata 3780
gcgttggcta cccgtgatat tgctgaagag cttggcggcg aatgggctga ccgcttcctc 3840
gtgctttacg gtatcgccgc tcccgattcg cagcgcatcg ccttctatcg ccttcttgac 3900
gagttcttct gagcgggact ctggggttcg cgaaatgacc gaccaagcga cgcccaacct 3960
gccatcacga gatttcgatt ccaccgccgc cttctatgaa aggttgggct tcggaatcgt 4020
tttccgggac gccggctgga tgatcctcca gcgcggggat ctcatgctgg agttcttcgc 4080
ccaccccaac ttgtttattg cagcttataa tggttacaaa taaagcaata gcatcacaaa 4140
tttcacaaat aaagcatttt tttcactgca ttctagttgt ggtttgtcca aactcatcaa 4200
tgtatcttat catgtctgta taccgtcgac ctctagctag agcttggcgt aatcatggtc 4260
atagctgttt cctgtgtgaa attgttatcc gctcacaatt ccacacaaca tacgagccgg 4320
aagcataaag tgtaaagcct ggggtgccta atgagtgagc taactcacat taattgcgtt 4380
gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt aatgaatcgg 4440
ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct tccgcttcct cgctcactga 4500
ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa aggcggtaat 4560
acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca 4620
aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc 4680
tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata 4740
aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc 4800
gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcaatgctc 4860
acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga 4920
accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc 4980
ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag 5040
gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag 5100
gacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag 5160
ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca 5220
gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga 5280
cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat 5340
cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga 5400
gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct cagcgatctg 5460
tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta cgatacggga 5520
gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct caccggctcc 5580
agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg gtcctgcaac 5640
tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa gtagttcgcc 5700
agttaatagt ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt cacgctcgtc 5760
gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta catgatcccc 5820
catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca gaagtaagtt 5880
ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta ctgtcatgcc 5940
atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct gagaatagtg 6000
tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg cgccacatag 6060
cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac tctcaaggat 6120
cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact gatcttcagc 6180
atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa 6240
aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt ttcaatatta 6300
ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 6360
aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtcgacgg 6420
atcgggagat ctcccgatcc cctatggtcg actctcagta caatctgctc tgatgccgca 6480
tagttaagcc agtatctgct ccctgcttgt gtgttggagg tcgctgagta gtgcgcgagc 6540
aaaatttaag ctacaacaag gcaaggcttg accgacaatt gcatgaagaa tctgcttagg 6600
gttaggcgtt ttgcgctgct tcgcgatgta cgggccagat atacgcgttg acattgatta 6660
ttgactagtt attaatagta atcaattacg gggtcattag ttcatagccc atatatggag 6720
ttccgcgtta cataacttac ggtaaatggc ccgcctggct gaccgcccaa cgacccccgc 6780
ccattgacgt caataatgac gtatgttccc atagtaacgc caatagggac tttccattga 6840
cgtcaatggg tggagtattt acggtaaact gcccacttgg cagtacatca agtgtatcat 6900
atgccaagta cgccccctat tgacgtcaat gacggtaaat ggcccgcctg gcattatgcc 6960
cagtacatga ccttatggga ctttcctact tggcagtaca tctacgtatt agtcatcgct 7020
attaccatgg tgatgcggtt ttggcagtac atcaatgggc gtggatagcg gtttgactca 7080
cggggatttc caagtctcca ccccattgac gtcaatggga gtttgttttg gcaccaaaat 7140
caacgggact ttccaaaatg tcgtaacaac tccgccccat tgacgcaaat gggcggtagg 7200
cgtgtacggt gggaggtcta tataagcaga gctcgtttag tgaaccgtca gatcgcctgg 7260
agacgccatc cacgctgttt tgacctccat agaagacacc gggaccgatc cagcctccgg 7320
actctagagg atcgaaccct t 7341
<210> 36
<211> 6309
<212> DNA
<213> Artificial Sequence
<220>
<223> pcDNA34_syn_E
<400> 36
agtacttaat acgactcact ataggctagc cgccaccatg gtgtactcat tcgtttcgga 60
agagacaggt acgttaatag ttaatagcgt acttcttttt cttgctttcg tggtattctt 120
gctagttaca ctagccattc ttactgcgct tcgattgtgt gcgtactgtt gcaatattgt 180
taacgtgagt cttgtaaaac cttcttttta cgtttactct cgtgttaaaa atctgaattc 240
ttctcgggtt cctgatcttc tggtctaagc ggccgcttcg agcagacatg ataagataaa 300
gggttcgatc cctaccggtt agtaatgagt ttgatatctc gacaatcaac ctctggatta 360
caaaatttgt gaaagattga ctggtattct taactatgtt gctcctttta cgctatgtgg 420
atacgctgct ttaatgcctt tgtatcatgc tattgcttcc cgtatggctt tcattttctc 480
ctccttgtat aaatcctggt tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca 540
acgtggcgtg gtgtgcactg tgtttgctga cgcaaccccc actggttggg gcattgccac 600
cacctgtcag ctcctttccg ggactttcgc tttccccctc cctattgcca cggcggaact 660
catcgccgcc tgccttgccc gctgctggac aggggctcgg ctgttgggca ctgacaattc 720
cgtggtgttg tcggggaagc tgacgtcctt tccatggctg ctcgcctgtg ttgccacctg 780
gattctgcgc gggacgtcct tctgctacgt cccttcggcc ctcaatccag cggaccttcc 840
ttcccgcggc ctgctgccgg ctctgcggcc tcttccgcgt cttcgccttc gccctcagac 900
gagtcggatc tccctttggg ccgcctcccc gcctggaaac gggggaggct aactgaaaca 960
cggaaggaga caataccgga aggaacccgc gctatgacgg caataaaaag acagaataaa 1020
acgcacgggt gttgggtcgt ttgttcataa acgcggggtt cggtcccagg gctggcactc 1080
tgtcgatacc ccaccgagac cccattgggg ccaatacgcc cgcgtttctt ccttttcccc 1140
accccacccc ccaagttcgg gtgaaggccc agggctcgca gccaacgtcg gggcggcagg 1200
ccctgccata gcagatctgc gcagctgggg ctctaggggg tatccccacg cgccctgtag 1260
cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc gtgaccgcta cacttgccag 1320
cgccctagcg cccgctcctt tcgctttctt cccttccttt ctcgccacgt tcgccggctt 1380
tccccgtcaa gctctaaatc ggggcatccc tttagggttc cgatttagtg ctttacggca 1440
cctcgacccc aaaaaacttg attagggtga tggttcacgt agtgggccat cgccctgata 1500
gacggttttt cgccctttga cgttggagtc cacgttcttt aatagtggac tcttgttcca 1560
aactggaaca acactcaacc ctatctcggt ctattctttt gatttataag ggattttggg 1620
gatttcggcc tattggttaa aaaatgagct gatttaacaa aaatttaacg cgaattaatt 1680
ctgtggaatg tgtgtcagtt agggtgtgga aagtccccag gctccccagc aggcagaagt 1740
atgcaaagca tgcatctcaa ttagtcagca accaggtgtg gaaagtcccc aggctcccca 1800
gcaggcagaa gtatgcaaag catgcatctc aattagtcag caaccatagt cccgccccta 1860
actccgccca tcccgcccct aactccgccc agttccgccc attctccgcc ccatggctga 1920
ctaatttttt ttatttatgc agaggccgag gccgcctctg cctctgagct attccagaag 1980
tagtgaggag gcttttttgg aggcctaggc ttttgcaaaa agctcccggg agcttgtata 2040
tccattttcg gatctgatca agagacagga tgaggatcgt ttcgcatgat tgaacaagat 2100
ggattgcacg caggttctcc ggccgcttgg gtggagaggc tattcggcta tgactgggca 2160
caacagacaa tcggctgctc tgatgccgcc gtgttccggc tgtcagcgca ggggcgcccg 2220
gttctttttg tcaagaccga cctgtccggt gccctgaatg aactgcagga cgaggcagcg 2280
cggctatcgt ggctggccac gacgggcgtt ccttgcgcag ctgtgctcga cgttgtcact 2340
gaagcgggaa gggactggct gctattgggc gaagtgccgg ggcaggatct cctgtcatct 2400
caccttgctc ctgccgagaa agtatccatc atggctgatg caatgcggcg gctgcatacg 2460
cttgatccgg ctacctgccc attcgaccac caagcgaaac atcgcatcga gcgagcacgt 2520
actcggatgg aagccggtct tgtcgatcag gatgatctgg acgaagagca tcaggggctc 2580
gcgccagccg aactgttcgc caggctcaag gcgcgcatgc ccgacggcga ggatctcgtc 2640
gtgacccatg gcgatgcctg cttgccgaat atcatggtgg aaaatggccg cttttctgga 2700
ttcatcgact gtggccggct gggtgtggcg gaccgctatc aggacatagc gttggctacc 2760
cgtgatattg ctgaagagct tggcggcgaa tgggctgacc gcttcctcgt gctttacggt 2820
atcgccgctc ccgattcgca gcgcatcgcc ttctatcgcc ttcttgacga gttcttctga 2880
gcgggactct ggggttcgcg aaatgaccga ccaagcgacg cccaacctgc catcacgaga 2940
tttcgattcc accgccgcct tctatgaaag gttgggcttc ggaatcgttt tccgggacgc 3000
cggctggatg atcctccagc gcggggatct catgctggag ttcttcgccc accccaactt 3060
gtttattgca gcttataatg gttacaaata aagcaatagc atcacaaatt tcacaaataa 3120
agcatttttt tcactgcatt ctagttgtgg tttgtccaaa ctcatcaatg tatcttatca 3180
tgtctgtata ccgtcgacct ctagctagag cttggcgtaa tcatggtcat agctgtttcc 3240
tgtgtgaaat tgttatccgc tcacaattcc acacaacata cgagccggaa gcataaagtg 3300
taaagcctgg ggtgcctaat gagtgagcta actcacatta attgcgttgc gctcactgcc 3360
cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg 3420
gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 3480
ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac 3540
agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 3600
ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 3660
caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 3720
gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 3780
cctgtccgcc tttctccctt cgggaagcgt ggcgctttct caatgctcac gctgtaggta 3840
tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 3900
gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 3960
cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 4020
tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg 4080
tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 4140
caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 4200
aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 4260
cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 4320
ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 4380
tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 4440
atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc 4500
tggccccagt gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc 4560
aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc 4620
catccagtct attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt 4680
gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc 4740
ttcattcagc tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa 4800
aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt 4860
atcactcatg gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg 4920
cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc 4980
gagttgctct tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa 5040
agtgctcatc attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt 5100
gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt 5160
caccagcgtt tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag 5220
ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta 5280
tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat 5340
aggggttccg cgcacatttc cccgaaaagt gccacctgac gtcgacggat cgggagatct 5400
cccgatcccc tatggtcgac tctcagtaca atctgctctg atgccgcata gttaagccag 5460
tatctgctcc ctgcttgtgt gttggaggtc gctgagtagt gcgcgagcaa aatttaagct 5520
acaacaaggc aaggcttgac cgacaattgc atgaagaatc tgcttagggt taggcgtttt 5580
gcgctgcttc gcgatgtacg ggccagatat acgcgttgac attgattatt gactagttat 5640
taatagtaat caattacggg gtcattagtt catagcccat atatggagtt ccgcgttaca 5700
taacttacgg taaatggccc gcctggctga ccgcccaacg acccccgccc attgacgtca 5760
ataatgacgt atgttcccat agtaacgcca atagggactt tccattgacg tcaatgggtg 5820
gagtatttac ggtaaactgc ccacttggca gtacatcaag tgtatcatat gccaagtacg 5880
ccccctattg acgtcaatga cggtaaatgg cccgcctggc attatgccca gtacatgacc 5940
ttatgggact ttcctacttg gcagtacatc tacgtattag tcatcgctat taccatggtg 6000
atgcggtttt ggcagtacat caatgggcgt ggatagcggt ttgactcacg gggatttcca 6060
agtctccacc ccattgacgt caatgggagt ttgttttggc accaaaatca acgggacttt 6120
ccaaaatgtc gtaacaactc cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg 6180
gaggtctata taagcagagc tcgtttagtg aaccgtcaga tcgcctggag acgccatcca 6240
cgctgttttg acctccatag aagacaccgg gaccgatcca gcctccggac tctagaggat 6300
cgaaccctt 6309
<210> 37
<211> 6750
<212> DNA
<213> Artificial Sequence
<220>
<223> pcDNA34_syn_M
<400> 37
agtacttaat acgactcact ataggctagc cgccaccatg gtggcagatt ccaacggtac 60
tattaccgtt gaggagctga aaaagctcct tgaacaatgg aacctagtaa taggtttcct 120
attccttaca tggatttgcc tgctgcaatt tgcctatgcc aacaggaata ggtttttgta 180
catcattaag ttgattttcc tctggctgtt atggccagta actttagctt gttttgtgct 240
tgctgctgtt tacagaataa attggatcac cggtggaatt gctattgcaa tggcttgtct 300
tgtaggattg atgtggctaa gctacttcat tgcttctttc agactgtttg cgcgtacgcg 360
ttccatgtgg tcattcaatc cagaaactaa cattcttctc aacgtgccac tccatggaac 420
tattctgact agaccgcttc tagaaagtga actcgtaatc ggagctgtta tccttcgtgg 480
acatcttcgt attgctggac atcatctagg acgctgtgac atcaaggatc tacctaaaga 540
aatcactgtt gctacatcac gaacgctttc ttattacaaa ttgggagctt cacagcgtgt 600
agcaggtgat tcaggttttg ctgcatatag tcgctacagg attggcaact ataaattaaa 660
cacagaccat tccagtagca gtgacaatat tgctttgctt gtacagtaag cggccgcttc 720
gagcagacat gataagataa agggttcgat ccctaccggt tagtaatgag tttgatatct 780
cgacaatcaa cctctggatt acaaaatttg tgaaagattg actggtattc ttaactatgt 840
tgctcctttt acgctatgtg gatacgctgc tttaatgcct ttgtatcatg ctattgcttc 900
ccgtatggct ttcattttct cctccttgta taaatcctgg ttgctgtctc tttatgagga 960
gttgtggccc gttgtcaggc aacgtggcgt ggtgtgcact gtgtttgctg acgcaacccc 1020
cactggttgg ggcattgcca ccacctgtca gctcctttcc gggactttcg ctttccccct 1080
ccctattgcc acggcggaac tcatcgccgc ctgccttgcc cgctgctgga caggggctcg 1140
gctgttgggc actgacaatt ccgtggtgtt gtcggggaag ctgacgtcct ttccatggct 1200
gctcgcctgt gttgccacct ggattctgcg cgggacgtcc ttctgctacg tcccttcggc 1260
cctcaatcca gcggaccttc cttcccgcgg cctgctgccg gctctgcggc ctcttccgcg 1320
tcttcgcctt cgccctcaga cgagtcggat ctccctttgg gccgcctccc cgcctggaaa 1380
cgggggaggc taactgaaac acggaaggag acaataccgg aaggaacccg cgctatgacg 1440
gcaataaaaa gacagaataa aacgcacggg tgttgggtcg tttgttcata aacgcggggt 1500
tcggtcccag ggctggcact ctgtcgatac cccaccgaga ccccattggg gccaatacgc 1560
ccgcgtttct tccttttccc caccccaccc cccaagttcg ggtgaaggcc cagggctcgc 1620
agccaacgtc ggggcggcag gccctgccat agcagatctg cgcagctggg gctctagggg 1680
gtatccccac gcgccctgta gcggcgcatt aagcgcggcg ggtgtggtgg ttacgcgcag 1740
cgtgaccgct acacttgcca gcgccctagc gcccgctcct ttcgctttct tcccttcctt 1800
tctcgccacg ttcgccggct ttccccgtca agctctaaat cggggcatcc ctttagggtt 1860
ccgatttagt gctttacggc acctcgaccc caaaaaactt gattagggtg atggttcacg 1920
tagtgggcca tcgccctgat agacggtttt tcgccctttg acgttggagt ccacgttctt 1980
taatagtgga ctcttgttcc aaactggaac aacactcaac cctatctcgg tctattcttt 2040
tgatttataa gggattttgg ggatttcggc ctattggtta aaaaatgagc tgatttaaca 2100
aaaatttaac gcgaattaat tctgtggaat gtgtgtcagt tagggtgtgg aaagtcccca 2160
ggctccccag caggcagaag tatgcaaagc atgcatctca attagtcagc aaccaggtgt 2220
ggaaagtccc caggctcccc agcaggcaga agtatgcaaa gcatgcatct caattagtca 2280
gcaaccatag tcccgcccct aactccgccc atcccgcccc taactccgcc cagttccgcc 2340
cattctccgc cccatggctg actaattttt tttatttatg cagaggccga ggccgcctct 2400
gcctctgagc tattccagaa gtagtgagga ggcttttttg gaggcctagg cttttgcaaa 2460
aagctcccgg gagcttgtat atccattttc ggatctgatc aagagacagg atgaggatcg 2520
tttcgcatga ttgaacaaga tggattgcac gcaggttctc cggccgcttg ggtggagagg 2580
ctattcggct atgactgggc acaacagaca atcggctgct ctgatgccgc cgtgttccgg 2640
ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg acctgtccgg tgccctgaat 2700
gaactgcagg acgaggcagc gcggctatcg tggctggcca cgacgggcgt tccttgcgca 2760
gctgtgctcg acgttgtcac tgaagcggga agggactggc tgctattggg cgaagtgccg 2820
gggcaggatc tcctgtcatc tcaccttgct cctgccgaga aagtatccat catggctgat 2880
gcaatgcggc ggctgcatac gcttgatccg gctacctgcc cattcgacca ccaagcgaaa 2940
catcgcatcg agcgagcacg tactcggatg gaagccggtc ttgtcgatca ggatgatctg 3000
gacgaagagc atcaggggct cgcgccagcc gaactgttcg ccaggctcaa ggcgcgcatg 3060
cccgacggcg aggatctcgt cgtgacccat ggcgatgcct gcttgccgaa tatcatggtg 3120
gaaaatggcc gcttttctgg attcatcgac tgtggccggc tgggtgtggc ggaccgctat 3180
caggacatag cgttggctac ccgtgatatt gctgaagagc ttggcggcga atgggctgac 3240
cgcttcctcg tgctttacgg tatcgccgct cccgattcgc agcgcatcgc cttctatcgc 3300
cttcttgacg agttcttctg agcgggactc tggggttcgc gaaatgaccg accaagcgac 3360
gcccaacctg ccatcacgag atttcgattc caccgccgcc ttctatgaaa ggttgggctt 3420
cggaatcgtt ttccgggacg ccggctggat gatcctccag cgcggggatc tcatgctgga 3480
gttcttcgcc caccccaact tgtttattgc agcttataat ggttacaaat aaagcaatag 3540
catcacaaat ttcacaaata aagcattttt ttcactgcat tctagttgtg gtttgtccaa 3600
actcatcaat gtatcttatc atgtctgtat accgtcgacc tctagctaga gcttggcgta 3660
atcatggtca tagctgtttc ctgtgtgaaa ttgttatccg ctcacaattc cacacaacat 3720
acgagccgga agcataaagt gtaaagcctg gggtgcctaa tgagtgagct aactcacatt 3780
aattgcgttg cgctcactgc ccgctttcca gtcgggaaac ctgtcgtgcc agctgcatta 3840
atgaatcggc caacgcgcgg ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc 3900
gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 3960
ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 4020
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 4080
ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 4140
aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 4200
gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttc 4260
tcaatgctca cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca agctgggctg 4320
tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta tccggtaact atcgtcttga 4380
gtccaacccg gtaagacacg acttatcgcc actggcagca gccactggta acaggattag 4440
cagagcgagg tatgtaggcg gtgctacaga gttcttgaag tggtggccta actacggcta 4500
cactagaagg acagtatttg gtatctgcgc tctgctgaag ccagttacct tcggaaaaag 4560
agttggtagc tcttgatccg gcaaacaaac caccgctggt agcggtggtt tttttgtttg 4620
caagcagcag attacgcgca gaaaaaaagg atctcaagaa gatcctttga tcttttctac 4680
ggggtctgac gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc 4740
aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag 4800
tatatatgag taaacttggt ctgacagtta ccaatgctta atcagtgagg cacctatctc 4860
agcgatctgt ctatttcgtt catccatagt tgcctgactc cccgtcgtgt agataactac 4920
gatacgggag ggcttaccat ctggccccag tgctgcaatg ataccgcgag acccacgctc 4980
accggctcca gatttatcag caataaacca gccagccgga agggccgagc gcagaagtgg 5040
tcctgcaact ttatccgcct ccatccagtc tattaattgt tgccgggaag ctagagtaag 5100
tagttcgcca gttaatagtt tgcgcaacgt tgttgccatt gctacaggca tcgtggtgtc 5160
acgctcgtcg tttggtatgg cttcattcag ctccggttcc caacgatcaa ggcgagttac 5220
atgatccccc atgttgtgca aaaaagcggt tagctccttc ggtcctccga tcgttgtcag 5280
aagtaagttg gccgcagtgt tatcactcat ggttatggca gcactgcata attctcttac 5340
tgtcatgcca tccgtaagat gcttttctgt gactggtgag tactcaacca agtcattctg 5400
agaatagtgt atgcggcgac cgagttgctc ttgcccggcg tcaatacggg ataataccgc 5460
gccacatagc agaactttaa aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact 5520
ctcaaggatc ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg cacccaactg 5580
atcttcagca tcttttactt tcaccagcgt ttctgggtga gcaaaaacag gaaggcaaaa 5640
tgccgcaaaa aagggaataa gggcgacacg gaaatgttga atactcatac tcttcctttt 5700
tcaatattat tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg 5760
tatttagaaa aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga 5820
cgtcgacgga tcgggagatc tcccgatccc ctatggtcga ctctcagtac aatctgctct 5880
gatgccgcat agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag 5940
tgcgcgagca aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat 6000
ctgcttaggg ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga 6060
cattgattat tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca 6120
tatatggagt tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac 6180
gacccccgcc cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact 6240
ttccattgac gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa 6300
gtgtatcata tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg 6360
cattatgccc agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta 6420
gtcatcgcta ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg 6480
tttgactcac ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg 6540
caccaaaatc aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg 6600
ggcggtaggc gtgtacggtg ggaggtctat ataagcagag ctcgtttagt gaaccgtcag 6660
atcgcctgga gacgccatcc acgctgtttt gacctccata gaagacaccg ggaccgatcc 6720
agcctccgga ctctagagga tcgaaccctt 6750
<210> 38
<211> 9905
<212> DNA
<213> Artificial Sequence
<220>
<223> pcDNA34_syn_S
<400> 38
agtacttaat acgactcact ataggctagc gccgccacca tggtgtttgt ttttcttgtt 60
ttattgccac tagtctctag tcagtgtgtt aatcttacaa ccagaactca attaccccct 120
gcatacacta attctttcac acgtggtgtt tattaccctg acaaagtttt cagatcctca 180
gttttacatt caactcagga cttgttctta cctttctttt ccaatgttac ttggttccat 240
gctatacatg tctctgggac caatggtact aagaggtttg ataaccctgt cctaccattt 300
aatgatggtg tttactttgc ttccactgag aagtctaaca taataagagg ctggattttt 360
ggtactactt tagattcgaa aacccagtcc ctacttattg ttaataacgc tactaatgtt 420
gttatcaaag tctgtgaatt tcaattttgt aacgatccat ttttgggtgt ttattaccac 480
aaaaacaaca aaagttggat ggaaagtgag ttcagagttt attctagtgc gaataattgc 540
acttttgaat acgtctctca gccttttctt atggaccttg aaggaaaaca gggtaatttc 600
aaaaatctta gggaatttgt gttcaagaat attgatggtt acttcaagat atactctaag 660
cacacgccta ttaatttagt gcgtgatctc cctcagggtt tttcggcttt agaaccattg 720
gtagatttgc caataggtat taacatcact aggtttcaaa ctttacttgc tttacataga 780
agttatttaa ctcctggtga ttcttcttca ggttggacag ctggtgctgc agcttattat 840
gtgggttatc ttcaacctag gacttttcta ctgaagtaca atgaaaatgg aaccattaca 900
gatgctgtag actgtgcact tgaccctctc tcagaaacaa agtgtacgtt gaaatccttc 960
actgtagaaa aaggaatcta tcaaacttct aactttagag tccaaccaac agaatctatt 1020
gttagatttc ctaacatcac aaacttgtgc ccttttggtg aagtttttaa cgccaccaga 1080
tttgcatctg tttatgcttg gaacaggaag agaatcagca actgtgttgc tgattattct 1140
gtcctgtata attccgcatc attttccact tttaagtgtt atggagtgtc tcctactaaa 1200
ttaaatgatc tctgctttac taatgtctat gcagattcat ttgtaattag aggtgatgaa 1260
gtcagacaaa tcgctccagg gcaaactgga aagattgctg attataacta caaattacca 1320
gatgatttta caggctgcgt tatagcttgg aattctaaca atcttgattc taaggttggt 1380
ggtaattata attacctgta cagattgttt aggaagtcta atctcaaacc ttttgagaga 1440
gatatttcaa ctgaaatcta tcaggccggt agcacacctt gtaatggtgt tgaaggtttt 1500
aattgttact ttcctctgca atcatatggt ttccaaccca ctaatggtgt tggttaccaa 1560
ccatacagag tagtagtact ttcttttgaa cttctacatg caccagcaac tgtttgtgga 1620
cctaaaaagt ctactaattt ggttaagaac aagtgtgtca atttcaactt caatggttta 1680
acaggcacag gtgttcttac tgagtctaac aaaaagtttc tgcctttcca acaatttggc 1740
agagacattg ctgacactac tgatgctgtt cgtgatccac aaacacttga gattcttgac 1800
attacaccat gttcttttgg tggtgtcagt gttataacac caggaacaaa tacttctaac 1860
caggttgctg ttctttatca ggatgttaac tgcacagaag tccctgttgc tattcatgca 1920
gatcaactta ctcctacttg gcgtgtttat tctacaggtt ctaatgtttt tcaaacacgt 1980
gcaggctgtt taataggggc tgaacatgtc aacaactcat atgagtgtga catacccatt 2040
ggtgcaggta tatgcgctag ttatcagact cagactaatt ctcctcggag agcaagaagt 2100
gtagctagtc aatccatcat tgcctacact atgtcacttg gtgcagaaaa ttcagttgct 2160
tactctaata actctattgc catacccaca aattttacta ttagcgttac cacagaaatt 2220
ctaccagtgt ctatgaccaa gacatcagta gattgtacaa tgtacatttg tggtgattca 2280
actgaatgca gcaatctttt gttgcaatat ggcagttttt gtacacaatt aaaccgtgct 2340
ttaactggaa tagctgttga acaagacaaa aacacccaag aagtttttgc acaagtcaaa 2400
caaatttaca agacaccacc aattaaagat tttggcggtt ttaattttag ccagatactg 2460
ccagatccat caaaaccaag caagaggtca tttattgaag atctactgtt caacaaagtg 2520
acacttgcag atgctggctt catcaaacaa tatggtgatt gccttggtga tattgctgct 2580
agagacctca tttgtgcaca aaagtttaac ggccttactg ttttgccacc tttgctcaca 2640
gatgaaatga ttgctcaata cacttctgca ctgttagcag gtacaatcac ttctggttgg 2700
acttttggtg caggtgctgc attacaaata ccatttgcta tgcaaatggc ttataggttt 2760
aatggtattg gagttacaca gaatgttctc tatgagaacc aaaaattgat tgccaaccaa 2820
tttaatagtg ctattggcaa aattcaagac tcactttctt ccacagcaag tgcacttgga 2880
aaacttcaag atgtggtcaa ccaaaatgca caagctttaa acacgcttgt taaacaactt 2940
agctccaatt ttggtgcaat ttcaagtgtt ttaaacgaca tcctttcacg tcttgacaaa 3000
gttgaggctg aagtgcaaat tgataggttg atcacaggca gacttcaaag tttgcagaca 3060
tatgtgactc aacaattaat tagagctgca gaaatcagag cttctgctaa tcttgctgct 3120
actaaaatgt cagagtgtgt acttggacaa tcaaaaagag ttgacttttg cggaaagggc 3180
tatcatctta tgtcatttcc tcagtcagca cctcatggtg tcgtcttttt gcatgtgact 3240
tatgtccctg cacaagaaaa gaacttcaca actgctcctg ccatttgtca tgatggaaaa 3300
gcacactttc ctcgtgaagg tgtctttgtt tcaaatggca cacactggtt tgtaacacaa 3360
aggaattttt atgaaccaca aatcattact acagacaaca catttgtgtc tggtaactgt 3420
gatgttgtaa taggaattgt caacaacaca gtttatgatc ctttgcaacc tgaattagac 3480
tcattcaagg aggagcttga taaatacttc aagaaccata cctcaccaga tgttgattta 3540
ggtgacatct ctggcattaa tgcttcagtt gtaaacattc agaaagaaat cgaccgcctc 3600
aatgaggttg ccaagaattt aaatgaatct ctcatcgatc tccaagaact tggaaagtat 3660
gagcagtata taaaatggcc atggtacatt tggctaggtt ttatagctgg cttgattgcc 3720
atagtaatgg tgacaattat gctttgctgt atgaccagtt gctgtagttg tctcaagggc 3780
tgttgttctt gtggatcctg ctgcaaattt gacgaggacg actctgagcc agtgctcaaa 3840
ggagtcaaat tacattacac ataagcggcc gcttcgagca gacatgataa gataaagggt 3900
tcgatcccta ccggttagta atgagtttga tatctcgaca atcaacctct ggattacaaa 3960
atttgtgaaa gattgactgg tattcttaac tatgttgctc cttttacgct atgtggatac 4020
gctgctttaa tgcctttgta tcatgctatt gcttcccgta tggctttcat tttctcctcc 4080
ttgtataaat cctggttgct gtctctttat gaggagttgt ggcccgttgt caggcaacgt 4140
ggcgtggtgt gcactgtgtt tgctgacgca acccccactg gttggggcat tgccaccacc 4200
tgtcagctcc tttccgggac tttcgctttc cccctcccta ttgccacggc ggaactcatc 4260
gccgcctgcc ttgcccgctg ctggacaggg gctcggctgt tgggcactga caattccgtg 4320
gtgttgtcgg ggaagctgac gtcctttcca tggctgctcg cctgtgttgc cacctggatt 4380
ctgcgcggga cgtccttctg ctacgtccct tcggccctca atccagcgga ccttccttcc 4440
cgcggcctgc tgccggctct gcggcctctt ccgcgtcttc gccttcgccc tcagacgagt 4500
cggatctccc tttgggccgc ctccccgcct ggaaacgggg gaggctaact gaaacacgga 4560
aggagacaat accggaagga acccgcgcta tgacggcaat aaaaagacag aataaaacgc 4620
acgggtgttg ggtcgtttgt tcataaacgc ggggttcggt cccagggctg gcactctgtc 4680
gataccccac cgagacccca ttggggccaa tacgcccgcg tttcttcctt ttccccaccc 4740
caccccccaa gttcgggtga aggcccaggg ctcgcagcca acgtcggggc ggcaggccct 4800
gccatagcag atctgcgcag ctggggctct agggggtatc cccacgcgcc ctgtagcggc 4860
gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc 4920
ctagcgcccg ctcctttcgc tttcttccct tcctttctcg ccacgttcgc cggctttccc 4980
cgtcaagctc taaatcgggg catcccttta gggttccgat ttagtgcttt acggcacctc 5040
gaccccaaaa aacttgatta gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg 5100
gtttttcgcc ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact 5160
ggaacaacac tcaaccctat ctcggtctat tcttttgatt tataagggat tttggggatt 5220
tcggcctatt ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa ttaattctgt 5280
ggaatgtgtg tcagttaggg tgtggaaagt ccccaggctc cccagcaggc agaagtatgc 5340
aaagcatgca tctcaattag tcagcaacca ggtgtggaaa gtccccaggc tccccagcag 5400
gcagaagtat gcaaagcatg catctcaatt agtcagcaac catagtcccg cccctaactc 5460
cgcccatccc gcccctaact ccgcccagtt ccgcccattc tccgccccat ggctgactaa 5520
ttttttttat ttatgcagag gccgaggccg cctctgcctc tgagctattc cagaagtagt 5580
gaggaggctt ttttggaggc ctaggctttt gcaaaaagct cccgggagct tgtatatcca 5640
ttttcggatc tgatcaagag acaggatgag gatcgtttcg catgattgaa caagatggat 5700
tgcacgcagg ttctccggcc gcttgggtgg agaggctatt cggctatgac tgggcacaac 5760
agacaatcgg ctgctctgat gccgccgtgt tccggctgtc agcgcagggg cgcccggttc 5820
tttttgtcaa gaccgacctg tccggtgccc tgaatgaact gcaggacgag gcagcgcggc 5880
tatcgtggct ggccacgacg ggcgttcctt gcgcagctgt gctcgacgtt gtcactgaag 5940
cgggaaggga ctggctgcta ttgggcgaag tgccggggca ggatctcctg tcatctcacc 6000
ttgctcctgc cgagaaagta tccatcatgg ctgatgcaat gcggcggctg catacgcttg 6060
atccggctac ctgcccattc gaccaccaag cgaaacatcg catcgagcga gcacgtactc 6120
ggatggaagc cggtcttgtc gatcaggatg atctggacga agagcatcag gggctcgcgc 6180
cagccgaact gttcgccagg ctcaaggcgc gcatgcccga cggcgaggat ctcgtcgtga 6240
cccatggcga tgcctgcttg ccgaatatca tggtggaaaa tggccgcttt tctggattca 6300
tcgactgtgg ccggctgggt gtggcggacc gctatcagga catagcgttg gctacccgtg 6360
atattgctga agagcttggc ggcgaatggg ctgaccgctt cctcgtgctt tacggtatcg 6420
ccgctcccga ttcgcagcgc atcgccttct atcgccttct tgacgagttc ttctgagcgg 6480
gactctgggg ttcgcgaaat gaccgaccaa gcgacgccca acctgccatc acgagatttc 6540
gattccaccg ccgccttcta tgaaaggttg ggcttcggaa tcgttttccg ggacgccggc 6600
tggatgatcc tccagcgcgg ggatctcatg ctggagttct tcgcccaccc caacttgttt 6660
attgcagctt ataatggtta caaataaagc aatagcatca caaatttcac aaataaagca 6720
tttttttcac tgcattctag ttgtggtttg tccaaactca tcaatgtatc ttatcatgtc 6780
tgtataccgt cgacctctag ctagagcttg gcgtaatcat ggtcatagct gtttcctgtg 6840
tgaaattgtt atccgctcac aattccacac aacatacgag ccggaagcat aaagtgtaaa 6900
gcctggggtg cctaatgagt gagctaactc acattaattg cgttgcgctc actgcccgct 6960
ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa tcggccaacg cgcggggaga 7020
ggcggtttgc gtattgggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc 7080
gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa 7140
tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt 7200
aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa 7260
aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt 7320
ccccctggaa gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg 7380
tccgcctttc tcccttcggg aagcgtggcg ctttctcaat gctcacgctg taggtatctc 7440
agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc 7500
gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta 7560
tcgccactgg cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct 7620
acagagttct tgaagtggtg gcctaactac ggctacacta gaaggacagt atttggtatc 7680
tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa 7740
caaaccaccg ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa 7800
aaaggatctc aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa 7860
aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt 7920
ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac 7980
agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc 8040
atagttgcct gactccccgt cgtgtagata actacgatac gggagggctt accatctggc 8100
cccagtgctg caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata 8160
aaccagccag ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc 8220
cagtctatta attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc 8280
aacgttgttg ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca 8340
ttcagctccg gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa 8400
gcggttagct ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca 8460
ctcatggtta tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt 8520
tctgtgactg gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt 8580
tgctcttgcc cggcgtcaat acgggataat accgcgccac atagcagaac tttaaaagtg 8640
ctcatcattg gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga 8700
tccagttcga tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc 8760
agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg 8820
acacggaaat gttgaatact catactcttc ctttttcaat attattgaag catttatcag 8880
ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg 8940
gttccgcgca catttccccg aaaagtgcca cctgacgtcg acggatcggg agatctcccg 9000
atcccctatg gtcgactctc agtacaatct gctctgatgc cgcatagtta agccagtatc 9060
tgctccctgc ttgtgtgttg gaggtcgctg agtagtgcgc gagcaaaatt taagctacaa 9120
caaggcaagg cttgaccgac aattgcatga agaatctgct tagggttagg cgttttgcgc 9180
tgcttcgcga tgtacgggcc agatatacgc gttgacattg attattgact agttattaat 9240
agtaatcaat tacggggtca ttagttcata gcccatatat ggagttccgc gttacataac 9300
ttacggtaaa tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa 9360
tgacgtatgt tcccatagta acgccaatag ggactttcca ttgacgtcaa tgggtggagt 9420
atttacggta aactgcccac ttggcagtac atcaagtgta tcatatgcca agtacgcccc 9480
ctattgacgt caatgacggt aaatggcccg cctggcatta tgcccagtac atgaccttat 9540
gggactttcc tacttggcag tacatctacg tattagtcat cgctattacc atggtgatgc 9600
ggttttggca gtacatcaat gggcgtggat agcggtttga ctcacgggga tttccaagtc 9660
tccaccccat tgacgtcaat gggagtttgt tttggcacca aaatcaacgg gactttccaa 9720
aatgtcgtaa caactccgcc ccattgacgc aaatgggcgg taggcgtgta cggtgggagg 9780
tctatataag cagagctcgt ttagtgaacc gtcagatcgc ctggagacgc catccacgct 9840
gttttgacct ccatagaaga caccgggacc gatccagcct ccggactcta gaggatcgaa 9900
ccctt 9905
<210> 39
<211> 40556
<212> DNA
<213> Artificial Sequence
<220>
<223> pMR10Y_COVAX191_delN
<400> 39
atgaatcgga cgtttgaccg gaaggcatac aggcaagaac tgatcgacgc ggggttttcc 60
gccgaggatg ccgaaaccat cgcaagccgc accgtcatgc gtgcgccccg cgaaaccttc 120
cagtccgtcg gctcgatggt ccagcaagct acggccaaga tcgagcgcga cagcgtgcaa 180
ctggctcccc ctgccctgcc cgcgccatcg gccgccgtgg agcgttcgcg tcgtctcgaa 240
caggaggcgg caggtttggc gaagtcgatg accatcgaca cgcgaggaac tatgacgacc 300
aagaagcgaa aaaccgccgg cgaggacctg gcaaaacagg tcagcgaggc caagcaggcc 360
gcgttgctga aacacacgaa gcagcagatc aaggaaatgc agctttcctt gttcgatatt 420
gcgccgtggc cggacacgat gcgagcgatg ccaaacgaca cggcccgctc tgccctgttc 480
accacgcgca acaagaaaat cccgcgcgag gcgctgcaaa acaaggtcat tttccacgtc 540
aacaaggacg tgaagatcac ctacaccggc gtcgagctgc gggccgacga tgacgaactg 600
gtgtggcagc aggtgttgga gtacgcgaag cgcaccccta tcggcgagcc gatcaccttc 660
acgttctacg agctttgcca ggacctgggc tggtcgatca atggccggta ttacacgaag 720
gccgaggaat gcctgtcgcg cctacaggcg acggcgatgg gcttcacgtc cgaccgcgtt 780
gggcacctgg aatcggtgtc gctgctgcac cgcttccgcg tcctggaccg tggcaagaaa 840
acgtcccgtt gccaggtcct gatcgacgag gaaatcgtcg tgctgtttgc tggcgaccac 900
tacacgaaat tcatatggga gaagtaccgc aagctgtcgc cgacggcccg acggatgttc 960
gactatttca gctcgcaccg ggagccgtac ccgctcaagc tggaaacctt ccgcctcatg 1020
tgcggatcgg attccacccg cgtgaagaag tggcgcgagc aggtcggcga agcctgcgaa 1080
gagttgcgag gcagcggcct ggtggaacac gcctgggtca atgatgacct ggtgcattgc 1140
aaacgctagg gccttgtggg gtcagttccg gctgggggtt cagcagccac tcgatcgagg 1200
tcccaatacg caaaccgcct ctccccgcgc gttggccgat tcattaatgc agctggcacg 1260
acaggtttcc cgactggaaa gcgggcagtg agcgcaacgc aattaatgtg agttagctca 1320
ctcattaggc accccaggct ttacacttta tgcttccggc tcgtatgttg tgtggaattg 1380
tgagcggata acaatttcac acaggaaaca gctatgacca tgattacgcc aagcttccat 1440
gggatatcga gatctcctgc agagctctag agtcgagact agtctcgacg ggcccggtac 1500
cccctcgagg gggccgcact taagttacgc gtggatcgtg gagctttcgg gttttaacta 1560
taacggtcct aaggtagcga actcgggtct tgccttaatc ccaacaaccg gattatctac 1620
acggatttca atagctgata tagcgaatca ccgagattaa ttaataatac gactcactat 1680
agtataagag tgattggcgt ccgtacgtac cctctcaact ctaaaactct tgtagtttaa 1740
atctaatcta aactttataa acggcacttc ctgcgtgtcc atgcccgcgg gcctggtctt 1800
gtcatagtgc tgacatttgt agttccttga ctttcgttct ctgccagtga cgtgtccatt 1860
cggcgccagc agcccaccca taggttgcat aatggcaaag atgggcaaat acggcctggg 1920
cttcaaatgg gccccagaat ttccatggat gcttccgaac gcatcggaga agttgggtaa 1980
ccctgagagg tcagaggagg atgggttttg cccctctgct gcgcaagaac cgaaagttaa 2040
aggaaaaact ttggttaatc acgtgagggt gaattgtagc cggcttccag ctttggaatg 2100
ctgtgttcag tctgccataa tccgtgatat ttttgtagat gaggatcccc agaaggtgga 2160
ggcctcaact atgatggcat tgcagttcgg tagtgccgtc ttggttaagc catccaagcg 2220
cttgtctatt caggcatgga ctaatttggg tgtgcttccc aaaacagctg ccatggggtt 2280
gttcaagcgc gtctgcctgt gtaacaccag ggagtgctct tgtgacgccc acgtggcctt 2340
tcaccttttt acggtccaac ccgatggtgt atgcctgggt aatggccgtt ttataggctg 2400
gttcgttcca gtcacagcca taccggagta tgcgaagcag tggttgcaac cctggtccat 2460
ccttcttcgt aagggtggta acaaagggtc tgtgacatcc ggccacttcc gccgcgctgt 2520
taccatgcct gtgtatgact ttaatgtaga ggatgcttgt gaggaggttc atcttaaccc 2580
gaagggtaag tactcctgca aggcgtatgc cctgctgaag ggctatcgcg gtgttaagcc 2640
catcctgttt gtggaccagt atggttgcga ctatactgga tgtctcgcca agggtcttga 2700
ggactatggc gatctcacct tgagtgagat gaaggagttg ttccctgtgt ggcgtgactc 2760
cttggatagt gaagtccttg tggcttggca cgttgatcga gatcctcggg ctgctatgcg 2820
tctgcagact cttgctactg tacgttgcat tgattatgtg ggccaaccga ccgaggatgt 2880
ggtggatgga gatgtggtag tgcgtgagcc tgctcatctt ctcgcagcca atgccattgt 2940
taaaagactc ccccgtttgg tggagactat gctgtatacg gattcgtccg ttacagaatt 3000
ctgttataaa accaagctgt gtgaatgcgg ttttatcacg cagtttggct atgtggattg 3060
ttgtggtgac acctgtgatt ttcgtgggtg ggttgccggc aatatgatgg atggctttcc 3120
atgtccaggg tgtaccaaaa attatatgcc ctgggaattg gaggcccagt catcaggtgt 3180
tataccagaa ggaggtgttc tattcactca gagcactgat acagtgaatc gtgagtcctt 3240
taagctctac ggtcatgctg ttgtgccttt tggttctgct gtgtattgga gcccttgccc 3300
aggtatgtgg cttccagtaa tttggtcgtc ggttaagtca tactctggtt tgacttatac 3360
aggagtagtt ggttgtaagg caattgttca agagacagac gctatatgtc gttctctgta 3420
tatggattat gtccagcaca agtgtggcaa tctcgagcag agagctatcc ttggattgga 3480
cgatgtctat catagacagt tgcttgtgaa taggggtgac tatagtctcc tccttgagaa 3540
tgtggatttg tttgttaagc ggcgcgctga atttgcttgc aaattcgcca cctgtggaga 3600
tggtcttgta cccctcctac tagatggttt agtgccccgc agttattatt tgattaagag 3660
tggtcaagct ttcacctcta tgatggttaa ttttagccat gaggtgactg acatgtgtat 3720
ggacatggct ttattgttca tgcatgatgt taaagtggcc actaagtatg ttaagaaggt 3780
tactggcaaa ctggccgtgc gctttaaagc gttgggtgta gccgttgtca gaaaaattac 3840
tgaatggttt gatttagccg tggacattgc tgctagtgcc gctggatggc tttgctacca 3900
gctggtaaat ggcttatttg cagtggccaa tggtgttata acctttgtac aggaggtgcc 3960
tgagcttgtc aagaattttg ttgacaagtt caaggcattt ttcaaggttt tgatcgactc 4020
tatgtcggtt tctatcttgt ctggacttac tgttgtcaag actgcctcaa atagggtgtg 4080
tcttgctggc agtaaggttt atgaagttgt gcagaaatct ttgtctgcat atgttatgcc 4140
tgtgggttgc agcgaagcca cttgtttggt gggtgagatt gaacctgcag tttttgaaga 4200
tgatgttgtt gatgtggtta aagccccatt aacatatcaa ggctgttgta agccacccac 4260
ttctttcgag aagatttgta ttgtggataa attgtatatg gccaagtgtg gtgatcaatt 4320
ttaccctgtg gttgttgata acgacactgt tggcgtgtta gatcagtgct ggaggtttcc 4380
ctgtgcgggc aagaaagtcg agtttaacga caagcccaaa gtcaggaaga taccctccac 4440
ccgtaagatt aagatcacct tcgcactgga tgcgaccttt gatagtgttc tttcgaaggc 4500
gtgttcagag tttgaagttg ataaagatgt tacattggat gagctgcttg atgttgtgct 4560
tgacgcagtt gagagtacgc tcagcccttg taaggagcat gatgtgatag gcacaaaagt 4620
ttgtgcttta cttgataggt tggcaggaga ttatgtctat ctttttgatg agggaggcga 4680
tgaagtgatc gccccgagga tgtattgttc cttttctgct cctgatgacg aggactgcgt 4740
tgcagcggat gttgtagatg cagatgaaaa ccaagatgat gatgccgagg actcagcagt 4800
ccttgtcgct gatacccaag aagaggacgg cgttgccaag gggcaggttg aggcggattc 4860
ggaaatttgc gttgcgcata ctggtagtca agaagaattg gctgagcctg atgctgtcgg 4920
atctcaaact cccatcgcct ctgctgagga aaccgaagtc ggagaggcaa gcgacaggga 4980
agggattgct gaggcgaagg caactgtgtg tgctgatgct gtagatgcct gccccgatca 5040
agtggaggca tttgaaattg aaaaggtcga ggactctatc ttggatgagc ttcaaactga 5100
acttaatgcg ccagcggaca agacctatga ggatgtcttg gcattcgatg ccgtatgctc 5160
agaggcgttg tctgcattct atgctgtgcc gagtgatgag acgcacttta aagtgtgtgg 5220
attctattcg cctgctatag agcgcactaa ttgttggctg cgttctactt tgatagtaat 5280
gcagagtcta cctttggaat ttaaagactt ggagatgcaa aagctctggt tgtcttacaa 5340
ggccggctat gaccaatgct ttgtggacaa actagttaag agcgtgccca agtctattat 5400
ccttccacaa ggtggttatg tggcagattt tgcctatttc tttctaagcc agtgtagctt 5460
taaagcttat gctaactggc gttgtttaga gtgtgacatg gagttaaagc ttcaaggctt 5520
ggacgccatg tttttctatg gggacgttgt gtctcatatg tgcaagtgtg gtaatagcat 5580
gaccttgttg tctgcagata taccctacac tttgcatttt ggagtgcgag atgataagtt 5640
ttgcgctttt tacacgccaa gaaaggtctt tagggctgct tgtgcggtag atgttaatga 5700
ttgtcactct atggctgtag tagagggcaa gcaaattgat ggtaaagtgg ttaccaaatt 5760
tattggtgac aaatttgatt ttatggtggg ttacgggatg acatttagta tgtctccttt 5820
tgaactcgcc cagttatatg gttcatgtat aacaccaaat gtttgttttg ttaaaggaga 5880
tgttataaag gttgttcgct tagttaatgc tgaagtcatt gttaaccctg ctaatgggcg 5940
tatggctcat ggtgccggcg tcgccggcgc catagctgaa aaggcgggca gtgcttttat 6000
taaagaaacc tccgatatgg tgaaggctca gggcgtttgc caggttggtg aatgctatga 6060
atctgccggt ggtaagttat gtaaaaaggt gcttaacatt gtagggccag atgcgcgagg 6120
gcatggcaag caatgctatt cacttttaga gcgtgcttat cagcatatta ataagtgtga 6180
caatgttgtc actactttaa tttcggctgg tatatttagt gtgcctactg atgtctccct 6240
aacttactta cttggtgtag tgacaaagaa tgtcattctt gtcagtaaca accaggatga 6300
ttttgatgtg atagagaagt gtcaggtgac ctccgttgct ggtaccaaag cgctatcact 6360
tcaattggcc aaaaatttgt gccgtgatgt aaagtttgtg acgaatgcat gtagttcgct 6420
ttttagtgaa tcttgctttg tctcaagcta tgatgtgttg caggaagttg aagcgctgcg 6480
acatgatata caattggatg atgatgctcg tgtctttgtg caggctaata tggactgtct 6540
gcccacagac tggcgtctcg ttaacaaatt tgatagtgtt gatggtgtta gaaccattaa 6600
gtattttgaa tgcccgggcg ggatttttgt atccagccag ggcaaaaagt ttggttatgt 6660
tcagaatggt tcatttaagg aggcgagtgt tagccaaata agggctttac tcgctaataa 6720
ggttgatgtc ttgtgtactg ttgatggtgt taacttccgc tcctgctgcg tagcagaggg 6780
tgaagttttt ggcaagacat taggttcagt cttttgtgat ggcataaatg tcaccaaagt 6840
taggtgtagt gccatttaca agggtaaggt tttctttcag tacagtgatt tgtccgaggc 6900
agatcttgtg gctgttaaag atgcctttgg ttttgatgaa ccacaactgc tgaagtacta 6960
cactatgctt ggcatgtgta agtggccagt agttgtttgt ggcaattatt ttgctttcaa 7020
gcagtcaaat aataattgct acatcaacgt ggcatgttta atgctgcaac acttgagttt 7080
aaagtttcct aagtggcaat ggcaagaggc ttggaacgag ttccgctctg gtaaaccact 7140
aaggtttgtg tccttggtat tagcaaaggg cagctttaaa tttaatgaac cttctgattc 7200
tatcgatttt atgcgtgtgg tgctacgtga agcagatttg agtggtgcca cgtgcaattt 7260
ggaatttgtt tgtaaatgtg gtgtgaagca agagcagcgc aaaggtgttg acgctgttat 7320
gcattttggt acgttggata aaggtgatct tgtcaggggt tataatatcg catgtacgtg 7380
cggtagtaaa cttgtgcatt gcacccaatt taacgtacca tttttaattt gctccaacac 7440
accagagggt aggaaactgc ccgacgatgt tgttgcagct aatattttta ctggtggtag 7500
tgtgggccat tacacgcatg tgaaatgtaa acccaagtac cagctttatg atgcttgtaa 7560
tgttaataag gtttcggagg ctaagggtaa ttttaccgat tgcctctacc ttaaaaattt 7620
aaagcaaacc ttctcgtctg tgctgacgac tttttattta gatgacgtaa agtgtgtgga 7680
gtataagcca gatttatcgc agtattactg tgagtctggt aaatattata caaaacccat 7740
tattaaggcc caatttagaa catttgagaa ggttgatggt gtctatacca actttaaatt 7800
ggtgggacat agtattgctg aaaaactcaa tgctaagctg ggatttgatt gtaattctcc 7860
ctttgtggag tataaaatta cagagtggcc aacagctact ggagatgtgg tgttggctag 7920
tgatgatttg tatgtaagtc ggtacttaag cgggtgcatt acttttggta aaccggttgt 7980
ctggcttggc catgaggaag catcgctgaa atctctcaca tattttaata gacctagtgt 8040
cgtttgtgaa aataaattta acgtgttgcc cgttgatgtc agtgaaccca cggacaaggg 8100
gcctgtgcct gctgcagtcc ttgttaccgg cgtccctgga gctgatgcgt cagctggtgc 8160
cggtattgcc aaggagcaaa aagcctgtgc ttctgctagt gtggaggatc aggttgttac 8220
ggaggttcgt caagagccat ctgtttcagc tgctgatgtc aaagaggtta aattgaatgg 8280
tgttaaaaag cctgttaagg tggaaggtag tgtggttgtt aatgatccca ctagcgaaac 8340
caaagttgtt aaaagtttgt ctattgttga tgtctatgat atgttcctga cagggtgtaa 8400
gtatgtggtt tggactgcta atgagttgtc tcgactagta aattcaccga ctgttaggga 8460
gtatgtgaag tggggtatgg gaaagattgt aacacccgct aagttgttgt tgttaagaga 8520
tgagaagcaa gagttcgtag cgccaaaagt agtcaaggcg aaagctattg cctgctattg 8580
tgctgtgaag tggtttctcc tctattgttt tagttggata aagtttaata ctgacaataa 8640
ggttatatac accacagaag tagcttcaaa gcttactttc aagttgtgct gtttggcctt 8700
taagaatgcc ttacagacgt ttaattggag cgttgtgtct aggggctttt tcctagttgc 8760
aacggtcttt ttactctggt ttaacttttt gtatgctaat gttattttga gtgacttcta 8820
tttgcctaat attgggcctc tccctacgtt tgtgggacag atagttgcgt ggtttaagac 8880
tacatttggt gtgtcaacca tctgtgattt ctaccaggtg acggatttgg gctatagaag 8940
ttcgttttgt aatggaagta tggtatgtga actatgcttc tcaggttttg atatgctgga 9000
caactatgat gctataaatg ttgttcaaca cgttgtagat aggcgtttgt cctttgacta 9060
tattagccta tttaaactgg tagttgagct tgtaatcggc tactctcttt atactgtgtg 9120
cttctaccca ctgtttgtcc ttattggaat gcagttattg accacatggt tgcctgaatt 9180
ctttatgctg gagactatgc attggagtgc tcgtttgttt gtgtttgttg ccaatatgct 9240
tccagctttt acgttactgc gattttacat cgtggtgaca gctatgtata aggtctattg 9300
tctttgtaga catgttatgt atggatgtag taagcctggt tgcttgtttt gttataagag 9360
aaaccgtagt gtccgtgtta agtgtagcac cgttgttggt ggttcactac gctattacga 9420
tgtaatggct aacggcggca caggtttctg tacaaagcac cagtggaact gtcttaattg 9480
caattcctgg aaaccaggca atacattcat aactcatgaa gcagcggcgg acctctctaa 9540
ggagttgaaa cgccctgtga atccaacaga ttctgcttat tactcggtca cagaggttaa 9600
gcaggttggt tgttccatgc gtttgttcta cgagagagat ggacagcgtg tttatgatga 9660
tgttaatgct agtttgtttg tggacatgaa tggtctgctg cattctaaag ttaaaggtgt 9720
gcctgaaacg catgttgtgg ttgttgagaa tgaagctgat aaagctggtt ttctcggcgc 9780
cgcagtgttt tatgcacaat cgctctacag acctatgttg atggtggaaa agaaattaat 9840
aactaccgcc aacactggtt tgtctgttag tcgaactatg tttgaccttt atgtagattc 9900
attgctgaac gtcctcgacg tggatcgcaa gagtctaaca agttttgtaa atgctgcgca 9960
caactctcta aaggagggtg ttcagcttga acaagttatg gataccttta ttggctgtgc 10020
ccgacgtaag tgtgctatag attctgatgt tgaaaccaag tctattacca agtccgtcat 10080
gtcggcagta aatgctggcg ttgattttac ggatgagagt tgtaataact tggtgcctac 10140
ctatgttaaa agtgacacta tcgttgcagc cgatttgggt gttcttattc agaataatgc 10200
taagcatgta caggctaatg ttgctaaagc cgctaatgtg gcttgcattt ggtctgtgga 10260
tgcttttaac cagctatctg ctgacttaca gcataggctg cgaaaagcat gttcaaaaac 10320
tggcttgaag attaagctta cttataataa gcaggaggca aatgttccta ttttaactac 10380
accgttctct cttaaagggg gcgctgtttt tagtagaatg ttacaatggt tgtttgttgc 10440
taatttgatt tgtttcattg tgttgtgggc ccttatgcca acatatgcag tgcacaaatc 10500
ggatatgcag ttgcctttat atgccagttt taaagttata gataacggtg tgctaaggga 10560
tgtgtctgtt actgacgcat gcttcgcaaa caaatttaat caattcgacc aatggtatga 10620
gtctactttt ggtcttgctt attaccgcaa ctctaaggct tgtcctgttg tggttgctgt 10680
aatagatcaa gacattggcc ataccttatt taatgttcct accacagttt taagatatgg 10740
atttcatgtg ttgcatttta taacccatgc atttgctact gatagcgtgc agtgttacac 10800
gccacatatg caaatcccct atgataattt ctatgctagt ggttgcgtgt tgtcatccct 10860
ctgtactatg cttgcgcatg cagatggaac cccgcatcct tattgttata cagggggtgt 10920
tatgcataat gcctctctgt atagttcttt ggctcctcat gtccgttata acctggctag 10980
ttcaaatggt tatatacgtt ttcccgaagt ggttagtgaa ggcattgtgc gtgttgtgcg 11040
cactcgctct atgacctact gcagggttgg tttatgtgag gaggccgagg agggtatctg 11100
ctttaatttt aatcgttcat gggtattgaa caacccgtat tatagggcca tgcctggaac 11160
tttttgtggt aggaatgctt ttgatttaat acatcaagtt ttaggaggat tagtgcggcc 11220
tattgatttc tttgccttaa cggcgagttc agtggctggt gctatccttg caattattgt 11280
cgttttggct ttctattatt taatcaagct taagcgtgcc tttggtgact acactagtgt 11340
tgtggttatc aatgtaattg tgtggtgtat aaattttctg atgctttttg tgtttcaggt 11400
ttatcccaca ttgtcttgtt tatatgcttg tttctacttc tacaccacgc tttatttccc 11460
ttcggagata agtgttgtta tgcatttgca atggcttgtc atgtatggtg ctattatgcc 11520
cttgtggttt tgcattattt acgtggcagt cgttgtttca aaccatgcat tgtggttgtt 11580
ctcttactgc cgcaaaattg gtaccgaggt tcgtagtgac ggcacatttg aggaaatggc 11640
ccttactacc tttatgatta ctaaagaatc ttattgtaag ttgaaaaact ctgtttctga 11700
tgttgctttt aacaggtact tgagtcttta caacaagtac cgttacttca gtggcaaaat 11760
ggatactgcc gcttatagag aggctgcctg ttcacaactg gcaaaggcaa tggaaacatt 11820
taaccataat aatggtaatg atgttctcta tcagcctcca accgcctctg ttactacatc 11880
atttttacag tctggtatag tgaagatggt gtcgcccacc tctaaagtgg agccttgtat 11940
tgttagtgtt acttatggta acatgacact taatgggttg tggttggatg ataaagttta 12000
ttgcccaaga catgttatct gttcttcagc tgacatgaca gaccctgatt atcctaattt 12060
gctttgtaga gtgacatcaa gtgatttttg tgttatgtct ggtcgtatga gccttactgt 12120
aatgtcttat caaatgcagg gctgccaact tgttttgact gttacactgc aaaatcctaa 12180
cacgcctaag tattccttcg gtgttgttaa gcctggtgag acatttactg tactggctgc 12240
atacaatggc agacctcaag gagccttcca tgttacgctt cgtagtagcc ataccataaa 12300
gggctccttt ctatgtggat cctgcggttc tgtaggatat gttttaactg gcgatagtgt 12360
acgatttgtt tatatgcatc agctagagtt gagtactggt tgtcataccg gtactgactt 12420
tagtgggaac ttttatggtc cctatagaga tgcgcaagtt gtacaattgc ctgttcagga 12480
ttatacgcag actgttaatg ttgtagcttg gctttatgct gctattttta acagatgcaa 12540
ctggtttgtg caaagtgata gttgttccct ggaggagttt aatgtttggg ctatgaccaa 12600
tggttttagc tcaatcaaag ccgatcttgt cttggatgcg cttgcttcta tgacaggcgt 12660
tacagttgaa caggtgttgg ccgctattaa gaggctgcat tctggattcc agggcaaaca 12720
aattttaggt agttgtgtgc ttgaagatga gctgacacca agtgatgttt atcaacaact 12780
agctggtgtc aagctacagt caaagcgcac aagagttata aaaggtacat gttgctggat 12840
attggcttca acgtttttgt tctgtagcat tatctcagca tttgtaaaat ggactatgtt 12900
tatgtatgtt actacccata tgttgggagt gacattgtgt gcactttgtt ttgtaagctt 12960
tgctatgttg ttgatcaagc ataagcattt gtatttaact atgtacatca tgcctgtgtt 13020
atgcacactg ttttacacca actatttggt tgtgtacaaa cagagtttta gaggtctagc 13080
ttatgcttgg ctttcacact ttgtccctgc tgtagattat acatatatgg atgaagtttt 13140
atatggtgtt gtgttgctag tagctatggt gtttgttacc atgcgtagca taaaccacga 13200
cgtcttttct attatgttct tggttggtag acttgtcagc ctggtatcca tgtggtattt 13260
tggagccaat ttagaggaag aggtactatt gttcctcaca tccctatttg gcacgtacac 13320
atggactact atgttgtcat tggctaccgc taaggttatt gctaaatggt tggctgtgaa 13380
tgtcttgtac ttcacagacg taccgcaaat taaattagtt ctgttgagct acttgtgtat 13440
tggttatgtg tgttgttgtt attggggaat cttgtcactc cttaatagca tttttaggat 13500
gccattgggc gtctacaatt ataaaatctc cgttcaggag ttacgttata tgaatgctaa 13560
tggcttgcgc ccacctagaa atagttttga ggccctgatg cttaatttta agctgttggg 13620
aattggtggt gtgccagtca ttgaagtatc tcaaattcaa tcaagattga cggatgttaa 13680
atgtgctaat gttgtgttgc ttaattgcct ccagcacttg catattgcat ctaattctaa 13740
gttgtggcag tattgtagta ctttgcacaa tgaaatactg gctacatctg atttgagcgt 13800
ggccttcgat aagttggctc aactcttagt tgttttattt gctaatccag cagcagtgga 13860
tagcaagtgc cttgcaagta ttgaagaagt gagcgatgat tacgttcgcg acaatactgt 13920
cttgcaagcc ttacagagtg aatttgttaa tatggctagc ttcgttgagt atgaacttgc 13980
taagaagaat ctagatgagg ctaaggctag cggctctgcc aatcaacagc agattaagca 14040
gctagagaag gcgtgtaata ttgctaagtc agcatatgag cgcgacagag ctgttgctcg 14100
taagctggaa cgtatggctg atttagctct tacaaacatg tataaagaag ctagaattaa 14160
tgataagaag agtaaggtag tgtctgcatt gcaaaccatg ctctttagta tggtgcgtaa 14220
gctagataac caagctctta attctatttt agacaacgca gttaagggtt gtgtaccttt 14280
gaatgcaata ccatcattga cttcgaacac tctgactata atagtgccag ataagcaggt 14340
ttttgatcag gttgtggata atgtgtatgt cacctatgct gggaatgtat ggcatataca 14400
gtttattcaa gatgctgatg gtgctgttaa acaattgaat gagatagatg ttaattcaac 14460
ctggcctcta gtcattgctg caaataggca taatgaagtg tctactgttg ttttgcagaa 14520
caatgagttg atgcctcaga agttgagaac tcaggttgtc aatagtggct cagatatgaa 14580
ttgtaatact cctacccagt gttactataa tactactggc acgggtaaga ttgtgtatgc 14640
tatacttagt gactgtgacg gcctgaagta cactaagata gtaaaagaag atggaaattg 14700
tgttgttttg gaattggatc ctccctgtaa gttttctgtt caggatgtga agggccttaa 14760
aattaagtac ctttactttg tgaaggggtg taatacactg gctagaggct gggttgtagg 14820
caccttatcc tcgacagtga gattgcaggc gggtacggca actgagtatg cctccaactc 14880
tgcaatactg tcgctgtgtg cgttttctgt agatcctaag aaaacgtact tggattatat 14940
aaaacagggt ggagttcccg ttactaattg tgttaagatg ttatgtgacc atgctggcac 15000
tggtatggcc attactatta agccggaggc aaccactaat caggattctt atggtggtgc 15060
ttccgtttgt atatattgcc gctcgcgtgt tgaacatcca gatgttgatg gattgtgcaa 15120
attacgcggc aagtttgtcc aagtgccctt aggcataaaa gatcctgtgt catatgtgtt 15180
gacgcatgat gtttgtcagg tttgtggctt ttggcgagat ggtagctgtt cctgtgtagg 15240
cacaggctcc cagtttcagt caaaagacac gaacttttta aacggattcg gggtacaagt 15300
gtaaatgccc gtcttgtacc ctgtgccagt ggcttggaca ctgatgttca attaagggca 15360
tttgacattt gtaatgctaa tcgagctggc attggtttgt attataaagt gaattgctgc 15420
cgcttccagc gtgtagatga ggacggcaac aagttggata agttctttgt tgttaaaaga 15480
actaatttag aagtgtataa caaggagaaa gaatgctatg agttgacaaa agaatgcggt 15540
gttgtggctg aacacgagtt cttcacattt gatgtggagg gaagtcgggt accacacata 15600
gtccgtaaag atctttcaaa gtttactatg ttagatcttt gctatgcatt gcgtcatttt 15660
gaccgcaatg attgttcaac tcttaaggaa attctcctta catatgctga gtgtgaagag 15720
tcctacttcc aaaagaagga ctggtatgat tttgttgaga atcctgatat aattaatgtg 15780
tacaagaagc ttggtcctat atttaataga gccctgctta acactgccaa gtttgcagac 15840
gcattagtgg aggcaggctt agtaggtgtt ttaacacttg ataatcaaga tttatatggt 15900
caatggtatg actttggaga ttttgtcaag acagtacctg gttgtggtgt tgccgtggca 15960
gactcttatt attcatatat gatgccaatg ctgactatgt gtcatgcgtt ggatagtgag 16020
ttgtttgtta atggtactta tagggagttt gaccttgttc agtatgattt tactgatttc 16080
aagctagagc tgttcactaa gtattttaag cattggagta tgacctacca cccgaacacc 16140
tgtgagtgcg aggatgacag gtgcattatt cattgcgcca attttaatat acttttcagc 16200
atggtcttac ctaagacctg ttttgggcct cttgttaggc agatatttgt ggatggtgtt 16260
cctttcgttg tgtcgatcgg ttaccattat aaagaattag gtgttgttat gaatatggat 16320
gtggatacac atcgttatcg cttgtctctt aaggacttgc ttttgtatgc tgcagaccct 16380
gcccttcatg tggcgtctgc tagtgcactg cttgatttgc gcacatgttg ttttagcgtt 16440
gcagctatta caagtggcgt aaaatttcaa acagttaaac ctggaaattt taatcaggat 16500
ttctacgagt ttattttgag taaaggcctg cttaaagagg ggagctccgt tgatttgaag 16560
cacttcttct ttacgcagga tggtaatgct gctattactg attacaatta ctacaagtat 16620
aatctaccca ccatggtgga tattaagcag ttgttgtttg ttttagaagt tgttaataag 16680
tacttcgaga tctatgaggg tgggtgtata cccgcaacac aggtcattgt taataattat 16740
gacaagagtg ctggctatcc atttaataaa tttggaaagg ccaggctcta ttatgaggca 16800
ttatcatttg aggagcagga tgaaatttat gcgtatacca aacgcaatgt cctgccgacc 16860
ctaactcaaa tgaatcttaa atatgctatt agtgctaaga atagggcccg caccgttgct 16920
ggtgtctcta ttctcagtac tatgactggc agaatgtttc atcaaaagtg tctaaagagt 16980
atagcagcta ctcgcggtgt tcctgtagtt ataggcacca cgaagttcta tggcggttgg 17040
gatgatatgt tacgccgcct tattaaagat gttgatagtc ctgtactcat gggttgggac 17100
tatcctaaat gtgatcgtgc tatgccaaac atactgcgta ttgttagtag tttggtgcta 17160
gcccgtaaac atgattcgtg ctgttcgcat acggatagat tctatcgtct tgcgaacgag 17220
tgcgcccaag ttttgagtga aattgttatg tgtggtggtt gttattatgt taaaccaggt 17280
ggcactagta gtggggatgc aaccactgct tttgctaatt ctgtgtttaa catttgtcaa 17340
gctgtttccg ccaatgtatg ctcgcttatg gcatgcaatg gacacaaaat tgaagatttg 17400
agtatacgcg agttacaaaa gcgcctatac tctaatgtct atcgtgcgga ccatgttgac 17460
cccgcatttg ttagtgagta ttatgagttt ttaaacaagc attttagtat gatgattttg 17520
agtgatgatg gtgttgtgtg ttataattca gagtttgcgt ccaagggtta tattgctaat 17580
ataagtgcct ttcaacaggt attatattat caaaacaacg tgtttatgtc tgaggccaaa 17640
tgttgggtag aaacagacat cgaaaaggga ccgcatgaat tttgttctca acatacaatg 17700
ctagtcaaga tggatggtga tgaagtctac cttccatacc ctgatccttc gagaatctta 17760
ggagcaggct gttttgttga tgatttactc aagactgata gcgttctctt gatagagcgt 17820
ttcgtaagtc ttgcaattga tgcttatcct ttagtatacc atgagaaccc agagtatcaa 17880
aatgtgttcc gggtatattt agaatacatc aagaagctgt acaatgatct cggtaatcag 17940
atcctggaca gctacagtgt tattttaagt acttgtgatg gtcaaaagtt tactgacgag 18000
acgttttaca agaacatgta tttaagaagt gcagtgctgc aaagcgttgg tgcctgcgtt 18060
gtctgtagtt ctcaaacatc attacgttgt ggcagttgca tacgcaagcc tttgctgtgt 18120
tgcaaatgcg cctatgatca tgttatgtcc actgatcata aatatgtcct gagtgtgtca 18180
ccatatgtgt gtaattcacc gggatgtgat gtaaatgatg ttaccaaatt gtatttaggt 18240
ggtatgtcat attattgtga ggaccataaa ccacagtatt cattcaaatt ggtgatgaat 18300
ggtatggttt ttggtttata taagcagtct tgtactggtt cgccctacat agaggatttt 18360
aataaaatcg ctagttgcaa atggacagaa gtcgatgatt atgtgctagc taatgaatgc 18420
accgaacgcc ttaaattgtt tgccgcagaa acgcagaagg ccacagaaga ggcctttaag 18480
caatgttatg cgtcagcaac gatccgtgag atcgtgagcg atcgggagtt aattttatct 18540
tgggaaattg gtaaagtccg cccgccactt aataaaaatt acgtgttcac cggctaccat 18600
tttactaata atggtaagac agttttaggt gagtatgttt ttgataagag tgagttgact 18660
aatggtgtgt attatcgcgc cacaaccact tataagttat ctgtaggtga tgtgttcatt 18720
ttaacatcac acgcagtgtc tagtttaagt gctcctacat tagtaccgca ggagaattat 18780
actagcattc gttttgctag tgtttatagt gtgcctgaga cgtttcagaa taatgtgcct 18840
aattatcagc acattggaat gaagcgctat tgtactgtac agggaccgcc tggtactggt 18900
aagtcccatc tagccattgg gctagctgtt tattattgta cagcgcgcgt ggtgtatacc 18960
gctgctagcc atgctgcagt tgacgcgctg tgtgaaaagg cacataaatt tctcaacatc 19020
aacgactgca cgcgtattgt tcctgcaaag gtgcgtgtag attgttatga taaattcaag 19080
gtcaatgaca ccactcgcaa gtatgtgttt actacaataa atgcattacc tgagttggtg 19140
actgacatta ttgtcgttga tgaagttagt atgcttacca actatgagct gtctgttatt 19200
aacagtcgtg ttagggctaa gcattatgtg tatattggcg acccggcgca gttacctgca 19260
ccacgtgtgc tactgaataa gggaactcta gaacctagat attttaattc cgttaccaag 19320
ctaatgtgtt gtttgggtcc agatattttc ttgggcacct gttatagatg ccctaaggag 19380
attgtggata cggtgtcagc cttggtttat aataataagc tgaaggctaa aaatgataat 19440
agctccatgt gctttaaggt ttattataag ggccagacta cacatgagag ttctagtgct 19500
gttaatatgc agcaaataca tttaatttcc aagtttctga aggcaaaccc cagttggagt 19560
aacgccgtat ttattagtcc ttataactcg cagaactatg ttgctaagag agtcttggga 19620
ttacaaaccc agacagtaga ctcagcgcag ggttctgaat atgattttgt tatctactca 19680
cagactgcgg aaacagcgca ttctgtcaat gtaaatagat tcaatgttgc tattacacgt 19740
gctaagaagg gtattctctg tgtcatgagt agtatgcaat tatttgagtc tcttaatttt 19800
actacactga cgttggataa gattaacaat ccacgattac agtgtactac aaatttgttt 19860
aaggattgta gcaggagcta tgtaggatat cacccagccc atgcaccatc ctttttggca 19920
gttgatgaca aatataaggt aggcggtgat ttagccgttt gccttaatgt tgctgattct 19980
gctgtcactt attcgcggct tatatcactc atgggattca agcttgactt gacccttgat 20040
ggttattgta agctgtttat aactagagat gaagctatca aacgtgttag agcctgggtt 20100
ggcttcgatg cagaaggtgc ccatgcgata cgtgatagca ttgggacaaa tttcccatta 20160
caattaggct tttcgactgg aattgatttt gttgtcgaag ccactggaat gtttgctgag 20220
agagatggtt atgtctttaa aaaggcagcc gcacgagctc ctcctggcga acaatttaaa 20280
caccttatcc cacttatgtc aagagggcag aaatgggatg tggttcgcat tagaatagta 20340
caaatgttgt cagaccacct agtggatttg gcagacagtg ttgtacttgt gacgtgggct 20400
gccagctttg agctcacatg tttgcgatat ttcgctaaag ttggaagaga agttgtgtgt 20460
agtgtctgca ccaagcgtgc gacatgtttt aattctagaa ctggatacta tggatgctgg 20520
cgacatagtt attcctgtga ttacctgtac aacccactaa tagttgacat tcaacagtgg 20580
ggatatacag gatctttaac tagcaatcat gatcctattt gcagcgtgca taagggtgct 20640
catgttgcat catctgatgc tatcatgacc cggtgtctag ctgttcatga ttgcttttgt 20700
aagtctgtta attggaattt agaatacccc attatttcaa atgaggtcag tgttaatacc 20760
tcctgcaggt tattgcagcg cgtaatgttt agggctgcga tgctatgcaa taggtatgat 20820
gtgtgttatg acattggcaa ccctaaaggt cttgcctgtg tcaaaggata tgattttaag 20880
ttctatgacg cctcccctgt tgttaagtct gttaaacagt ttgtttacaa atacgaggca 20940
cataaagatc aatttttaga tggtttgtgt atgttttgga actgcaatgt ggataagtat 21000
ccagcgaatg cagttgtgtg taggtttgac acgcgtgtgt tgaacaaatt aaatctccct 21060
ggctgtaatg gtggcagttt gtatgttaac aaacatgcat tccacaccag tccctttacc 21120
cgggctgcct tcgagaattt gaagcctatg cctttctttt attattcaga tacgccctgt 21180
gtgtatatgg aaggcatgga atctaagcag gtcgattatg tcccattgag aagcgctaca 21240
tgcatcacaa gatgcaattt aggtggcgct gtttgtttaa aacatgctga ggagtatcgt 21300
gagtaccttg agtcttacaa tacggcaacc acagcgggtt ttactttttg ggtctataag 21360
acttttgatt tttacaacct ttggaatact tttactaggc tccaaagttt agaaaatgta 21420
gtgtataacc tggtcaacgc tggacacttt gatggccggg cgggtgaact gccttgtgct 21480
gttataggtg agaaagtcat tgccaagatt caaaatgagg atgtcgtggt ctttaaaaat 21540
aacacgccat tccccactaa tgtggctgtc gaattatttg ctaagcgcag tattcggccc 21600
caccccgagc ttaagctctt tagaaatttg aatattgacg tgtgctggag tcacgtcctt 21660
tgggattatg ctaaggatag tgtgttttgc agttcgacgt ataaggtctg caaatacaca 21720
gatttacagt gcattgaaag cttgaatgta ctttttgatg gtcgtgataa tggtgctctt 21780
gaagctttta agaagtgccg gaatggcgtc tacattaaca cgacaaaaat taaaagtctg 21840
tcgatgatta aaggcccaca acgtgccgat ttgaatggcg tagttgtgga gaaagttgga 21900
gattctgatg tggaattttg gtttgctgtg cgtaaagacg gtgacgatgt tatcttcagc 21960
cgtacaggga gccttgaacc gagccattac cggagcccac aaggtaatcc gggtggtaat 22020
cgcgtgggtg atctcagcgg taatgaagct ctagcgcgtg gcactatctt tactcaaagc 22080
agattattat cttctttcac acctcgatca gagatggaga aagattttat ggatttagat 22140
gatgatgtgt tcattgcaaa atatagttta caggactacg cgtttgaaca cgttgtttat 22200
ggtagtttta accagaagat tattggaggt ttgcatttgc ttattggctt agcccgtagg 22260
cagcaaaaat ccaatctggt aattcaagag ttcgtgacat acgactctag cattcattcg 22320
tactttatca ctgacgagaa cagtggtagt agtaagagtg tgtgcactgt tattgattta 22380
ttgttagatg attttgtgga cattgtaaag tccctgaatc taaagtgtgt gagtaaggtt 22440
gttaatgtta atgtggattt taaggacttc cagtttatgt tgtggtgcaa tgaggagaag 22500
gtcatgactt tctatcctcg tttgcaggct gctgctgact ggaaacctgg ttatgttatg 22560
cctgtcttat ataagtattt ggaatcgcct ctggaaagag taaacctctg gaattatggc 22620
aagccgatta ctttacctac aggatgtatg atgaatgttg ctaagtatac tcaattatgt 22680
caatatttga gcactacaac attagcagtt ccggctaata tgcgtgtctt acaccttggt 22740
gccggttcgg ataagggtgt tgcccctggg tctgcagttc ttaggcagtg gctaccagcg 22800
ggaagtattc ttgtagataa tgatgtgaat ccatttgtga gtgacagtgt cgcctcatat 22860
tatggaaatt gtataacctt accctttgat tgtcagtggg atctgataat ttctgatatg 22920
tacgaccctc ttactaagaa cattggggag tacaacgtga gtaaagatgg attctttact 22980
tacctctgtc atttaattcg tgacaagttg gctctgggtg gcagtgttgc cataaaaata 23040
acagagtttt cttggaacgc tgagttatat agtttaatgg ggaagtttgc gttctggaca 23100
atcttttgca ccaacgtaaa cgcctcttca agtgaaggat ttttgattgg cataaattgg 23160
ttgaataaga cccgtaccga aattgacggt aaaaccatgc atgccaatta tctgttttgg 23220
agaaatagta caatgtggaa tggaggggct tacagtctct ttgacatgag taagttccct 23280
ttgaaagcgg ctggtacggc tgttgttagc cttaaaccag accaaataaa tgacttagtc 23340
ctctccttga ttgagaaggg caagttatta gtgcgtgata cacgcaaaga agtttttgtt 23400
ggcgatagcc tagtaaatgt caaataaatc tatacttgtc gtggctgtga aaatggcctt 23460
tgctgacaag cctaatcatt tcataaactt tcccctggcc caatttagtg gctttatggg 23520
taagtattta aagctacagt ctcaacttgt ggaaatgggt ttagactgta aattacagaa 23580
ggcaccacat gttagtatta ccctgcttga tattaaagca gaccaataca aacaggtgga 23640
atttgcaata caagaaataa tagatgatct ggcggcatat gagggagata ttgtctttga 23700
caaccctcac atgcttggca gatgccttgt tcttgatgtt agaggatttg aagagttgca 23760
tgaagatatt gttgaaattc tccgcagaag gggttgcacg gcagatcaat ccagacactg 23820
gattccgcac tgcactgtgg cccaatttga cgaagaaaga gaaacaaaag gaatgcaatt 23880
ctatcataaa gaacccttct acctcaagca taacaaccta ttaacggatg ctgggcttga 23940
gctcgtgaag ataggttctt ccaaaataga tgggttttat tgtagtgaac tgagtgtttg 24000
gtgtggtgag aggctttgtt ataagcctcc aacacccaaa ttcagtgata tatttggcta 24060
ttgctgcata gataaaatac gtggtgattt agaaataggc gacctgccgc aggatgatga 24120
ggaagcgtgg gccgagctaa gttaccacta tcaaagaaac acctacttct tcagacatgt 24180
gcacgataat agcatctatt ttcgtaccgt gtgtagaatg aagggttgta tgtgttgatt 24240
tgtttttaca ctattagtgt aataagctta ttattttgtt gaaaagggca ggatgtgcat 24300
agctatggct cctcgcacac tgcttttgct gatttgatgt cagctggtgt ttgggttcaa 24360
tgaacctctt aacatcgttt cacatttaaa tgatgactgg tttctatttg gtgacagtcg 24420
gtccgactgt acctatgtag aaaataacgg tcatcctaaa ttagattggc ttgacctcga 24480
cccaaagttg tgtaattcag gaaagatttc cgcaaagagt ggtaactctc tctttaggag 24540
ttttcacttc actgattttt acaattatac gggtgaggga taccaaattg tattttatga 24600
aggagttaat tttagtccca gccatggctt taaatgcctg gctcatggag ataataaaag 24660
atggatgggc aataaagctc gattttatgc ccgagtgtat gagaagatgg cccaatatag 24720
gagcctatcg tttgttaatg tgtcttatgc ctatggaggt aatgcaaagc ccgcctccat 24780
ttgcaaagac aatactttaa cactcaataa ccccaccttc atatcgaagg agtctaatta 24840
tgttgattac tactacgaga gtgaggctaa tttcacacta gaaggttgtg atgaatttat 24900
agtaccgctc tgtggtttta atggccattc caagggctcg tcgtcggatg ctgccaataa 24960
atattatact gactctcaga gttactataa tatggatatt ggtgtcttat atgggttcaa 25020
ttcgaccttg gatgttggca acactgctaa ggatccgggt cttgatctca cttgtaggta 25080
tcttgcattg actcctggta attataaggc tgtgtcctta gaatatttgt taagcttacc 25140
ctcaaaggct atttgcctcc ataagacaaa gcgctttatg cctgtgcagg tagttgactc 25200
aaggtggagt agcatccgcc agtcagacaa tatgaccgct gcagcctgtc agctgccata 25260
ttgtttcttt cgcaacacat ctgcgaatta tagtggtggc acacatgatg cgcaccatgg 25320
tgattttcat ttcaggcagt tattgtctgg tttgttatat aatgtttcct gtattgccca 25380
gcagggtgca tttctttata ataatgtgtc gtcctcttgg ccagcctatg ggtacggtca 25440
ttgtccaacg gcagctaaca ttggttatat ggcacctgtt tgtatctatg accctctccc 25500
ggtcatactg ctaggtgtgt tattgggtat agctgtgttg actattgtgt ttctgatgtt 25560
ttattttatg acggatagcg gtgttagatt gcatgaggca taatctaaac atgtttgttt 25620
ttcttgtttt attgccacta gtctctagtc agtgtgttaa tcttacaacc agaactcaat 25680
taccccctgc atacactaat tctttcacac gtggtgttta ttaccctgac aaagttttca 25740
gatcctcagt tttacattca actcaggact tgttcttacc tttcttttcc aatgttactt 25800
ggttccatgc tatacatgtc tctgggacca atggtactaa gaggtttgat aaccctgtcc 25860
taccatttaa tgatggtgtt tactttgctt ccactgagaa gtctaacata ataagaggct 25920
ggatttttgg tactacttta gattcgaaaa cccagtccct acttattgtt aataacgcta 25980
ctaatgttgt tatcaaagtc tgtgaatttc aattttgtaa cgatccattt ttgggtgttt 26040
attaccacaa aaacaacaaa agttggatgg aaagtgagtt cagagtttat tctagtgcga 26100
ataattgcac ttttgaatac gtctctcagc cttttcttat ggaccttgaa ggaaaacagg 26160
gtaatttcaa aaatcttagg gaatttgtgt tcaagaatat tgatggttac ttcaagatat 26220
actctaagca cacgcctatt aatttagtgc gtgatctccc tcagggtttt tcggctttag 26280
aaccattggt agatttgcca ataggtatta acatcactag gtttcaaact ttacttgctt 26340
tacatagaag ttatttaact cctggtgatt cttcttcagg ttggacagct ggtgctgcag 26400
cttattatgt gggttatctt caacctagga cttttctact gaagtacaat gaaaatggaa 26460
ccattacaga tgctgtagac tgtgcacttg accctctctc agaaacaaag tgtacgttga 26520
aatccttcac tgtagaaaaa ggaatctatc aaacttctaa ctttagagtc caaccaacag 26580
aatctattgt tagatttcct aacatcacaa acttgtgccc ttttggtgaa gtttttaacg 26640
ccaccagatt tgcatctgtt tatgcttgga acaggaagag aatcagcaac tgtgttgctg 26700
attattctgt cctgtataat tccgcatcat tttccacttt taagtgttat ggagtgtctc 26760
ctactaaatt aaatgatctc tgctttacta atgtctatgc agattcattt gtaattagag 26820
gtgatgaagt cagacaaatc gctccagggc aaactggaaa gattgctgat tataactaca 26880
aattaccaga tgattttaca ggctgcgtta tagcttggaa ttctaacaat cttgattcta 26940
aggttggtgg taattataat tacctgtaca gattgtttag gaagtctaat ctcaaacctt 27000
ttgagagaga tatttcaact gaaatctatc aggccggtag cacaccttgt aatggtgttg 27060
aaggttttaa ttgttacttt cctctgcaat catatggttt ccaacccact aatggtgttg 27120
gttaccaacc atacagagta gtagtacttt cttttgaact tctacatgca ccagcaactg 27180
tttgtggacc taaaaagtct actaatttgg ttaagaacaa gtgtgtcaat ttcaacttca 27240
atggtttaac aggcacaggt gttcttactg agtctaacaa aaagtttctg cctttccaac 27300
aatttggcag agacattgct gacactactg atgctgttcg tgatccacaa acacttgaga 27360
ttcttgacat tacaccatgt tcttttggtg gtgtcagtgt tataacacca ggaacaaata 27420
cttctaacca ggttgctgtt ctttatcagg atgttaactg cacagaagtc cctgttgcta 27480
ttcatgcaga tcaacttact cctacttggc gtgtttattc tacaggttct aatgtttttc 27540
aaacacgtgc aggctgttta ataggggctg aacatgtcaa caactcatat gagtgtgaca 27600
tacccattgg tgcaggtata tgcgctagtt atcagactca gactaattct cctcggagag 27660
caagaagtgt agctagtcaa tccatcattg cctacactat gtcacttggt gcagaaaatt 27720
cagttgctta ctctaataac tctattgcca tacccacaaa ttttactatt agcgttacca 27780
cagaaattct accagtgtct atgaccaaga catcagtaga ttgtacaatg tacatttgtg 27840
gtgattcaac tgaatgcagc aatcttttgt tgcaatatgg cagtttttgt acacaattaa 27900
accgtgcttt aactggaata gctgttgaac aagacaaaaa cacccaagaa gtttttgcac 27960
aagtcaaaca aatttacaag acaccaccaa ttaaagattt tggcggtttt aattttagcc 28020
agatactgcc agatccatca aaaccaagca agaggtcatt tattgaagat ctactgttca 28080
acaaagtgac acttgcagat gctggcttca tcaaacaata tggtgattgc cttggtgata 28140
ttgctgctag agacctcatt tgtgcacaaa agtttaacgg ccttactgtt ttgccacctt 28200
tgctcacaga tgaaatgatt gctcaataca cttctgcact gttagcaggt acaatcactt 28260
ctggttggac ttttggtgca ggtgctgcat tacaaatacc atttgctatg caaatggctt 28320
ataggtttaa tggtattgga gttacacaga atgttctcta tgagaaccaa aaattgattg 28380
ccaaccaatt taatagtgct attggcaaaa ttcaagactc actttcttcc acagcaagtg 28440
cacttggaaa acttcaagat gtggtcaacc aaaatgcaca agctttaaac acgcttgtta 28500
aacaacttag ctccaatttt ggtgcaattt caagtgtttt aaacgacatc ctttcacgtc 28560
ttgacaaagt tgaggctgaa gtgcaaattg ataggttgat cacaggcaga cttcaaagtt 28620
tgcagacata tgtgactcaa caattaatta gagctgcaga aatcagagct tctgctaatc 28680
ttgctgctac taaaatgtca gagtgtgtac ttggacaatc aaaaagagtt gacttttgcg 28740
gaaagggcta tcatcttatg tcatttcctc agtcagcacc tcatggtgtc gtctttttgc 28800
atgtgactta tgtccctgca caagaaaaga acttcacaac tgctcctgcc atttgtcatg 28860
atggaaaagc acactttcct cgtgaaggtg tctttgtttc aaatggcaca cactggtttg 28920
taacacaaag gaatttttat gaaccacaaa tcattactac agacaacaca tttgtgtctg 28980
gtaactgtga tgttgtaata ggaattgtca acaacacagt ttatgatcct ttgcaacctg 29040
aattagactc attcaaggag gagcttgata aatacttcaa gaaccatacc tcaccagatg 29100
ttgatttagg tgacatctct ggcattaatg cttcagttgt aaacattcag aaagaaatcg 29160
accgcctcaa tgaggttgcc aagaatttaa atgaatctct catcgatctc caagaacttg 29220
gaaagtatga gcagtatata aaatggccat ggtacatttg gctaggtttt atagctggct 29280
tgattgccat agtaatggtg acaattatgc tttgctgtat gaccagttgc tgtagttgtc 29340
tcaagggctg ttgttcttgt ggatcctgct gcaaatttga cgaggacgac tctgagccag 29400
tgctcaaagg agtcaaatta cattacacat aactatcaca gcctctcctg gaaagacaga 29460
aaatctaaac aatttatagc attctcattg ctacctggcc ccgtaagagg cagtcatagc 29520
tatggccgtg ttggtcctaa ggctacattg gctgctgtct ttattggtcc atttattgta 29580
gcatgtatgc taggcattgg cctagtttat ttattgcaat tgcaagttca aatttttcat 29640
gttaaggata ccatacgtgt gactggcaag ccagccactg tgtcttatac tacaagtaca 29700
ccagtaacac cgagcgcgac gacgctcgat ggtactacgt atactttaat tagacccact 29760
agctcttata caagagttta tcttggtact ccaagaggtt ttgattatag tacatttggg 29820
cctaagaccc tagattatgt tactaatcta aacctcatct taattctggt cgtccatata 29880
cttttaaggc attgtccagg catatgaggc caacagccac atggatttgg catgtgagtg 29940
atgcatggtt acgccgcacg cgggactttg gtgtcattcg cctagaagat ttttgttttc 30000
aatttaatta tagccaaccc cgagttggtt attgtagagt tcctttaaag gcttggtgta 30060
gcaaccaggg taaatttgca gcgcagttta ccctaaaaag ttgcgaaaaa ccaggtcacg 30120
aaaaatttat tactagcttc acggcctacg gcagaactgt ccaacaggcc gttagcaagt 30180
tagtagaaga agctgttgat tttattcttt ttagggccac gcagctcgaa agaaatgttt 30240
aatttattcc ttacagacac agtatggtat gtggggcaga ttatttttat attcgcagtg 30300
tgtttgatgg tcaccataat tgtggttgcc ttccttgcgt ctatcaaact ttgtattcaa 30360
ctttgcggtt tatgtaatac tttggtgctg tccccttcta tttatttgta tgataggagt 30420
aagcagcttt ataagtacta taatgaagaa atgagactgc ccctattaga ggtggatgat 30480
atctaatcca aacattatga gtagtactac tcaggcccca gagcccgtct atcaatggac 30540
cgccgacgag gcagttcaat tccttaagga atggaacttc tcgttgggca ttatactact 30600
ctttattact atcatactac agttcggtta cacgagccgt agcatgttta tttatgttgt 30660
gaaaatgata atcttgtggt taatgtggcc actgactatt gttttgtgta ttttcaattg 30720
cgtgtatgcg ctaaataatg tgtatcttgg attttctata gtgtttacta tagtgtccat 30780
tgtaatctgg atcatgtatt ttgtgaacag cataaggttg tttatcagga ctggtagctg 30840
gtggagcttc aaccccgaaa caaacaacct tatgtgtata gatatgaaag gtaccgtgta 30900
tgttagaccc attattgagg attaccatac actaacagcc actattattc gtggccacct 30960
ctacatgcaa ggtgttaagc taggcaccgg tttctctttg tctgacttgc ccgcttatgt 31020
tacagttgct aaggtgtcac acctttgcac ttataagcgc gcattcttag acaaggtaga 31080
cggtgttagc ggttttgctg tttatgtgaa gtccaaggtc ggaaattacc gactgccctc 31140
aaacaaaccg agtggcgcgg acaccgcatt gttgagaacc taatctaaac tttaaggaga 31200
gaatgaatcc tatgtcggcg ctcggtggta acccctcgcg agaaagtcgg gataggacac 31260
tctctatcag aatggatgtc ttgctgtcat aacagataga gaaggttgtg gcagaccctg 31320
tatcaattag ttgaaagaga ttgcaaaata gagaatgtgt gagagaagtt agcaaggtcc 31380
tacgtctaac cataagaacg gcgataggcg ccccctggga acagctcaca tcagggtact 31440
attcctgcaa tgccctagta aatgaatgaa gttgatcatg gccaattgga agaatcacaa 31500
aaaaaaaaaa aaaaacggcc ggtttaaacg ctacagtcca agttccaagc gggatactag 31560
atgtataatg tccgccatgc agacgaaacc agtcggagat taccgagcat tctatcacgt 31620
cggcgaccaa tagtgagctt agggataaca gggtaataaa cgatccccgg gaattcactg 31680
gccgtcgttt tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt 31740
gcagcacatc cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct 31800
tcccaacagt tgcgcagcct gaatggcgaa tggcgataga tccggtggat gaccttttga 31860
atgaccttta atagattata ttactaatta attggggacc ctagaggtcc ccttttttat 31920
tttaaaaatt ttttcacaaa acggtttaca agcataaagc tcggacggat cttttccgct 31980
gcataaccct gcttcggggt cattatagcg attttttcgg tatatccatc ctttttcgca 32040
cgatatacag gattttgcca aagggttcgt gtagactttc cttggtgtat ccaacggcgt 32100
cagccgggca ggataggtga agtaggccca cccgcgagcg ggtgttcctt cttcactgtc 32160
ccttattcgc acctggcggt gctcaacggg aatcctgctc tgcgaggctg gccggctacc 32220
gccggcgtaa cagatgaggg caagcggatg gctgatgaaa ccaagccaac caggaagggc 32280
agcccaccta tcaaggtgtc gatgcagggg ggggggaaag ccacgttgtg tctcaaaatc 32340
tctgatgtta cattgcacaa gataaaaata tatcatcatg aacaataaaa ctgtctgctt 32400
acataaacag taatacaagg ggtgttatga gccatattca acgggaaacg tcttgctcaa 32460
ggccgcgatt aaattccaac atggatgctg atttatatgg gtataaatgg gctcgcgata 32520
atgtcgggca atcaggtgcg acaatctatc gattgtatgg gaagcccgat gcgccagagt 32580
tgtttctgaa acatggcaaa ggtagcgttg ccaatgatgt tacagatgag atggtcagac 32640
taaactggct gacggaattt atgcctcttc cgaccatcaa gcattttatc cgtactcctg 32700
atgatgcatg gttactcacc actgcgatcc ccggaaaaac agcattccag gtattagaag 32760
aatatcctga ttcaggtgaa aatattgttg atgcgctggc agtgttcctg cgccggttgc 32820
attcgattcc tgtttgtaat tgtcctttta acagcgatcg cgtatttcgt ctcgctcagg 32880
cgcaatcacg aatgaataac ggtttggttg atgcgagtga ttttgatgac gagcgtaatg 32940
gctggcctgt tgaacaagtc tggaaagaaa tgcataagtt tttgccattc tcaccggatt 33000
cagtcgtcac tcatggtgat ttctcacttg ataaccttat ttttgacgag gggaaattaa 33060
taggttgtat tgatgttgga cgagtcggaa tcgcagaccg ataccaggat cttgccatcc 33120
tatggaactg cctcggtgag ttttctcctt cattacagaa acggcttttt caaaaatatg 33180
gtattgataa tcctgatatg aataaattgc agtttcattt gatgctcgat gagtttttct 33240
aatcagaatt ggttaattgg ttgtaacact ggcagagcat tacgctgact tgacgggacg 33300
gcggctttgt tgaataaatc gaacttttgc tgagttgaag gatcagatca cgcatcttcc 33360
cgacaacgca gaccgttccg tggcaaagca aaagttcaaa atcaccaact ggtccaccta 33420
caacaaagct ctcatcaacc gtggctccct cactttctgg ctggatgatg gggcgattca 33480
ggcctggtat gagtcagcaa caccttcttc acgaggcaga cctcagacgg tatcggatcg 33540
atcccccgat gtgtagcagt ggcggaccat ataggcagat cagaaggcgc ggttctccta 33600
catgagcttt tcaattcaat tcatcatttt ttttttattc ttttttttga tttcggtttc 33660
cttgaaattt ttttgattcg gtaatctccg aacagaagga agaacgaagg aaggagcaca 33720
gacttagatt ggtatatata cgcatatgta gtgttgaaga aacatgaaat tgcccagtat 33780
tcttaaccca actgcacaga acaaaaacct gcaggaaacg aagataaatc atgtcgaaag 33840
ctacatataa ggaacgtgct gctactcatc ctagtcctgt tgctgccaag ctatttaata 33900
tcatgcacga aaagcaaaca aacttgtgtg cttcattgga tgttcgtacc accaaggaat 33960
tactggagtt agttgaagca ttaggtccca aaatttgttt actaaaaaca catgtggata 34020
tcttgactga tttttccatg gagggcacag ttaagccgct aaaggcatta tccgccaagt 34080
acaatttttt actcttcgaa gacagaaaat ttgctgacat tggtaataca gtcaaattgc 34140
agtactctgc gggtgtatac agaatagcag aatgggcaga cattacgaat gcacacggtg 34200
tggtgggccc aggtattgtt agcggtttga agcaggcggc agaagaagta acaaaggaac 34260
ctagaggcct tttgatgtta gcagaattgt catgcaaggg ctccctatct actggagaat 34320
atactaaggg tactgttgac attgcgaaga gcgacaaaga ttttgttatc ggctttattg 34380
ctcaaagaga catgggtgga agagatgaag gttacgattg gttgattatg acacccggtg 34440
tgggtttaga tgacaaggga gacgcattgg gtcaacagta tagaaccgtg gatgatgtgg 34500
tctctacagg atctgacatt attattgttg gaagaggact atttgcaaag ggaagggatg 34560
ctaaggtaga gggtgaacgt tacagaaaag caggctggga agcatatttg agaagatgcg 34620
gccagcaaaa ctaaaaaact gtattataag taaatgcatg tatactaaac tcacaaatta 34680
gagcttcaat ttaattatat cagttattac ccgggaatct cggtcgtaat gatttttata 34740
atgacgaaaa aaaaaaaatt ggaaagaaaa agctgggcgc gccggccggc ccttttcatc 34800
acgtgctata aaaataatta taatttaaat tttttaatat aaatatataa attaaaaata 34860
gaaagtaaaa aaagaaatta aagaaaaaat agtttttgtt ttccgaagat gtaaaagact 34920
ctagggggat cgccaacaaa tactaccttt tatcttgctc ttcctgctct caggtattaa 34980
tgccgaattg tttcatcttg tctgtgtaga agaccacaca cgaaaatcct gtgattttac 35040
attttactta tcgttaatcg aatgtatatc tatttaatct gcttttcttg tctaataaat 35100
atatatgtaa agtacgcttt ttgttgaaat tttttaaacc tttgtttatt tttttttttc 35160
ttcattccgt aactcttcta ccttctttat ttactttcta aaatccaaat acaaaacata 35220
aaaataaata aacacagagt aaattcccaa attattccat cattaaaaga tacgaggcgc 35280
gtgtaagtta caggcaagcg atcggccggc ccgggcattt aaatgcaggc cgcgtacgcg 35340
tcgacggtac cgaattcgct taaacgagct catgttcgcc ggtgaacgcg ttgaggaagc 35400
cgggcagtgc ctcggcaaaa tccttgcgtg tagacaagac atctgcgtag cagttgtcct 35460
caacaacgat gtcgaaatcc aaatcggagt gctcatcgag tcctccgtga acgtaagagc 35520
cgccgatcag aagagcgcgg aagcgaacat cggaagcgac cgcatcgcgg atgcggttca 35580
agaaagttgc atgagcttgt ggaagtgtgc tgagcataaa tgattctcct agctgttctt 35640
tgggtaagta cgccatcagg acgttgtgag tggcgcgatt tttagcggct gaaatcagcc 35700
cttgagcctg tcggcaagtc gcgtcatgag gtccatgcgc tcatgcagga tcgccacgac 35760
caacgcgggt tcgcccgcac gcggcaggca aaaaacgtag tggtgttcgc agcgggccat 35820
ccgcagcgcg ggaaagagtt cgctcatgtc cttaaacggg ccttcgccgg cggcaagcct 35880
ggctatgccc tgttccagct tagcgatata gcggcgcacc tgcgccgcgc cccactcccg 35940
gcgcgtgtag cggatgatgc cgcgtagatc ggcttcggcc tcagccgtga ggatgtaggc 36000
cgtcaagcgc gatccccgct gagttcttca tcaagaattt cgccgacgct cttggtggac 36060
accttgccgg caagcccatc gttgatgcgg ttccccagca tggttttcag ttcctgccat 36120
gcctgatcgg catcagcgtc accggggaac agacgttcga gggcgtattg cttaatggtc 36180
ttgccctgca aggcggccag ggctttcagg ctctggtgct gctggtccgt catgtcgatt 36240
gtcaggcggc tcattggata acctccataa aatacacgta accacattag cacatatgtg 36300
ggcgtgaggc tacagcgcga ggcgcattaa ggtcgggaaa atgcgctagg cgcatttaaa 36360
ttgcgtattg ctgtaatgcg ccatgccggc tagactaggc ccaaatgggt atacccaatt 36420
tgaccaaggg ggacgcgatg agggcggcca agcactaccg acaacttcta tccatcgact 36480
tcaacatcga ggcgctggcc ttcgtgcctg gacccgacgg cacacgcggc cggcgcatcc 36540
acgtcctggg gcgcgaggtc cgcgaccggc ccggcctggt cgagtacctt tcgccggcgt 36600
tcggctcgcg ggtggcgctg gacggctact gcaaggccaa tttcgatgca gtgctgcacc 36660
tggcgtaccc cgatcatcag caatggggcc acgcatgaag cgccgaagct acgccatgct 36720
gcgcgccgct gccgcgctgg ccgtcctggt cgttgcctcg ccggcatggg ccgagctgcg 36780
cggcgaggtc gtgcgcatca tcgacggcga caccatcgac gtgctggtag acaagcagcc 36840
ggtgcgcgtg cgcctggtgg acattgacgc gccggaaaag cggcaagcct tcggcgaacg 36900
tgcgcgccag gcgctggccg gcatggtgtt ccgccggcac gtcctggtcg acgagaagga 36960
caccgaccgt tacggccgca cgctgggcac cgtgtgggtc aacatggagc tggccagccg 37020
gccgccgcag ccgcgcaacg tcaacgccgc gatggttcac cagggcatgg cgtgggccta 37080
tcgcttccac ggccgcgcgg ccgaccctga aatgctgcgg ctcgaacagg aggcgcgagg 37140
caagcgcgtc ggcctctggt ccgatccgca cgccgtcgag ccgtggaaat ggcgacgcga 37200
gagcaacaac cggagggacg aaggttgaag gtcgcccgca tctacctgcg cgccagtacg 37260
gacgagcaga atcttgaacg ccaggagagc cttgtagcgg ccacgcgggc cgccgggtac 37320
tacgtcgccg gcatctaccg cgagaaggcg tccggcgcac gcgccgaccg gcccgagctg 37380
ctgcgcatga tcgcggacct gcaacctggt gaagtcgtcg ttgcggagaa gatcgaccgc 37440
atcagccgct tgccgttggc cgaggccgag cgcctggttg cgtcgatccg ggccaaaggg 37500
gccaagctgg ccgtgcctgg cgtggtggac ctgtcggagc tggccgccga ggcgaacgga 37560
gtggcgaaaa tcgttctgga atccgtccag gacatgcttt tgaagctcgc cttgcagatg 37620
gcccgcgacg actacgagga tcggcgcgag cgtcaacgtc agggtgtcca gttggcgaag 37680
gccgccggcc gctacaccgg ccgcaaacgt gacgccggca tgcacgaccg catcatcacg 37740
cttcgctccg gcggatcgag cattgccaag acggccaagc tggtcggatg cagcccgagc 37800
caggtcaaac gagtgtgggc ggcctggaac gcgcagcagc aaaaataaag ccgggcagtg 37860
cccggctttt ctcacctttt cgcgtcccgc agggccgctg cgagcgccct acctagatcc 37920
tcgctttccc cctcggtgta gtccggccag ggcacgaagg gcgcggatgc gaacctgttg 37980
agcaggtacg ccttcgggca gcggtagacc accggcgagt tcgccttttc atcccaccgg 38040
gccaggatca cgtccgcatc acagtgcatg tccttcacct ggtcgcggaa gaagccgaag 38100
gccaccatgc cgctatgttc gccgaggaac gccagttgct tcgcgctggc gatcgcgccg 38160
acgccgccgg ccaaaaccga cgccatcacc cagccgacga accagaagct ggcatgcttg 38220
cggttgacca ccgcacgcgc agccgcgacc aggacaacgg ccaagctgcc gaccagggcc 38280
atgacgaccg tgatccggcc gttgtggaaa gcgatgggct tgccagcgtc cgcttgcacg 38340
gcgtcgtaaa tgctggaccc gatgggcgcg cacatcagca cgacaggcag cagcaccagg 38400
aacatcgtcc gcgtccattg cgcgagtgcc ttgcggcgtt cgccggcggc aagcgcctcc 38460
atcatcggcg tgaagcccaa cagggccacc gcagccgcca agccggcaac gatgccgcag 38520
gcgattacat acatacatcc tccctaatgc gccttgcgca cggttgtagt cagagtccgc 38580
ggtggggcga taagctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 38640
cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgggggatca 38700
ggaccgctgc cggagcgcaa cccactcact acagcagagc catgtagaca acatcccctc 38760
cccctttcca ccgcgtcaga cgcccgtagc agcccgctac gggctttttc atgccctgcc 38820
ctagcgtcca agcctcacgg ccgcgctcgg cctctctggc ggccttctgg cgctcctgct 38880
gcggcgtccg ctcgtgggcc gtggcgcggg tccgcgcgcc ggcctcgtgc gcctggcgct 38940
cgcgggcgag gtccagggcg gccgtcttca cgttctgcct tgcgcagatg agatagatcg 39000
atctagcgtg gactcaaggc tctcgcgaat ggctcgcgtt ggaaactttc attgacactt 39060
gaggggcacc gcagggaaat tctcgtcctt gcgagaaccg gctatgtcgt gctgcgcatc 39120
gagcctgcgc ccttggcttg tctcgcccct ctccgcgtcg ctacggggct tccagcgcct 39180
ttccgacgct caccgggctg gttgccctcg ccgctgggct ggcggccgtc tatggccctg 39240
caaacgcgcc agaaacgccg tcgaagccgt gtgcgagaca ccgcggccgc cggcgttgtg 39300
gatacctcgc ggaaaacttg gccctcactg acagatgagg ggcggacgtt gacacttgag 39360
gggccgactc acccggcgcg gcgttgacag atgaggggca ggctcgattt cggccggcga 39420
cgtggagctg gccagcctcg caaatcggcg aaaacgcctg attttacgcg agtttcccac 39480
agatgatgtg gacaagcctg gggataagtg ccctgcggta ttgacacttg aggggcgcga 39540
ctactgacag atgaggggcg cgatccttga cacttgaggg gcagagtgct gacagatgag 39600
gggcgcacct attgacattt gaggggctgt ccacaggcag aaaatccagc atttgcaagg 39660
gtttccgccc gtttttcggc caccgctaac ctgtctttta acctgctttt aaaccaatat 39720
ttataaacct tgtttttaac cagggctgcg ccctgtgcgc gtgaccgcgc acgccgaagg 39780
ggggtgcccc cccttctcga accctcccgg cccgctaacg cgggcctccc atccccccag 39840
gggctgcgcc cctcggccgc gaacggcctc accccaaaaa tggcagcgct ggcagtcctt 39900
gccattgccg ggatcggggc agtaacggga tgggcgatca gcccgagcgc gacgcccgga 39960
agcattgacg tgccgcaggt gctggcatcg acattcagcg accaggtgcc gggcagtgag 40020
ggcggcggcc tgggtggcgg cctgcccttc acttcggccg tcggggcatt cacggacttc 40080
atggcggggc cggcaatttt taccttgggc attcttggca tagtggtcgc gggtgccgtg 40140
ctcgtgttcg ggggtgaatt aattccccgg atcgatccgt cagcttcacg ctgccgcaag 40200
cactcagggc gcaagggctg ctaaaggaag cggaacacgt agaaagccag tccgcagaaa 40260
cggtgctgac cccggatgaa tgtcagctac tgggctatct ggacaaggga aaacgcaagc 40320
gcaaagagaa agcaggtagc ttgcagtggg cttacatggc gatagctaga ctgggcggtt 40380
ttatggacag caagcgaacc ggaattgcca gctggggcgc cctctggtaa ggttgggaag 40440
ccctgcaaag taaactggat ggctttcttg ccgccaagga tctgatggcg caggggatca 40500
agatcgacgg atcgatccgg ggaattaatt ccggggcaat cccgcaagga gggtga 40556
<210> 40
<211> 38383
<212> DNA
<213> Artificial Sequence
<220>
<223> pMR10Y_COVAX191_delHEN
<400> 40
atgaatcgga cgtttgaccg gaaggcatac aggcaagaac tgatcgacgc ggggttttcc 60
gccgaggatg ccgaaaccat cgcaagccgc accgtcatgc gtgcgccccg cgaaaccttc 120
cagtccgtcg gctcgatggt ccagcaagct acggccaaga tcgagcgcga cagcgtgcaa 180
ctggctcccc ctgccctgcc cgcgccatcg gccgccgtgg agcgttcgcg tcgtctcgaa 240
caggaggcgg caggtttggc gaagtcgatg accatcgaca cgcgaggaac tatgacgacc 300
aagaagcgaa aaaccgccgg cgaggacctg gcaaaacagg tcagcgaggc caagcaggcc 360
gcgttgctga aacacacgaa gcagcagatc aaggaaatgc agctttcctt gttcgatatt 420
gcgccgtggc cggacacgat gcgagcgatg ccaaacgaca cggcccgctc tgccctgttc 480
accacgcgca acaagaaaat cccgcgcgag gcgctgcaaa acaaggtcat tttccacgtc 540
aacaaggacg tgaagatcac ctacaccggc gtcgagctgc gggccgacga tgacgaactg 600
gtgtggcagc aggtgttgga gtacgcgaag cgcaccccta tcggcgagcc gatcaccttc 660
acgttctacg agctttgcca ggacctgggc tggtcgatca atggccggta ttacacgaag 720
gccgaggaat gcctgtcgcg cctacaggcg acggcgatgg gcttcacgtc cgaccgcgtt 780
gggcacctgg aatcggtgtc gctgctgcac cgcttccgcg tcctggaccg tggcaagaaa 840
acgtcccgtt gccaggtcct gatcgacgag gaaatcgtcg tgctgtttgc tggcgaccac 900
tacacgaaat tcatatggga gaagtaccgc aagctgtcgc cgacggcccg acggatgttc 960
gactatttca gctcgcaccg ggagccgtac ccgctcaagc tggaaacctt ccgcctcatg 1020
tgcggatcgg attccacccg cgtgaagaag tggcgcgagc aggtcggcga agcctgcgaa 1080
gagttgcgag gcagcggcct ggtggaacac gcctgggtca atgatgacct ggtgcattgc 1140
aaacgctagg gccttgtggg gtcagttccg gctgggggtt cagcagccac tcgatcgagg 1200
tcccaatacg caaaccgcct ctccccgcgc gttggccgat tcattaatgc agctggcacg 1260
acaggtttcc cgactggaaa gcgggcagtg agcgcaacgc aattaatgtg agttagctca 1320
ctcattaggc accccaggct ttacacttta tgcttccggc tcgtatgttg tgtggaattg 1380
tgagcggata acaatttcac acaggaaaca gctatgacca tgattacgcc aagcttccat 1440
gggatatcga gatctcctgc agagctctag agtcgagact agtctcgacg ggcccggtac 1500
cccctcgagg gggccgcact taagttacgc gtggatcgtg gagctttcgg gttttaacta 1560
taacggtcct aaggtagcga actcgggtct tgccttaatc ccaacaaccg gattatctac 1620
acggatttca atagctgata tagcgaatca ccgagattaa ttaataatac gactcactat 1680
agtataagag tgattggcgt ccgtacgtac cctctcaact ctaaaactct tgtagtttaa 1740
atctaatcta aactttataa acggcacttc ctgcgtgtcc atgcccgcgg gcctggtctt 1800
gtcatagtgc tgacatttgt agttccttga ctttcgttct ctgccagtga cgtgtccatt 1860
cggcgccagc agcccaccca taggttgcat aatggcaaag atgggcaaat acggcctggg 1920
cttcaaatgg gccccagaat ttccatggat gcttccgaac gcatcggaga agttgggtaa 1980
ccctgagagg tcagaggagg atgggttttg cccctctgct gcgcaagaac cgaaagttaa 2040
aggaaaaact ttggttaatc acgtgagggt gaattgtagc cggcttccag ctttggaatg 2100
ctgtgttcag tctgccataa tccgtgatat ttttgtagat gaggatcccc agaaggtgga 2160
ggcctcaact atgatggcat tgcagttcgg tagtgccgtc ttggttaagc catccaagcg 2220
cttgtctatt caggcatgga ctaatttggg tgtgcttccc aaaacagctg ccatggggtt 2280
gttcaagcgc gtctgcctgt gtaacaccag ggagtgctct tgtgacgccc acgtggcctt 2340
tcaccttttt acggtccaac ccgatggtgt atgcctgggt aatggccgtt ttataggctg 2400
gttcgttcca gtcacagcca taccggagta tgcgaagcag tggttgcaac cctggtccat 2460
ccttcttcgt aagggtggta acaaagggtc tgtgacatcc ggccacttcc gccgcgctgt 2520
taccatgcct gtgtatgact ttaatgtaga ggatgcttgt gaggaggttc atcttaaccc 2580
gaagggtaag tactcctgca aggcgtatgc cctgctgaag ggctatcgcg gtgttaagcc 2640
catcctgttt gtggaccagt atggttgcga ctatactgga tgtctcgcca agggtcttga 2700
ggactatggc gatctcacct tgagtgagat gaaggagttg ttccctgtgt ggcgtgactc 2760
cttggatagt gaagtccttg tggcttggca cgttgatcga gatcctcggg ctgctatgcg 2820
tctgcagact cttgctactg tacgttgcat tgattatgtg ggccaaccga ccgaggatgt 2880
ggtggatgga gatgtggtag tgcgtgagcc tgctcatctt ctcgcagcca atgccattgt 2940
taaaagactc ccccgtttgg tggagactat gctgtatacg gattcgtccg ttacagaatt 3000
ctgttataaa accaagctgt gtgaatgcgg ttttatcacg cagtttggct atgtggattg 3060
ttgtggtgac acctgtgatt ttcgtgggtg ggttgccggc aatatgatgg atggctttcc 3120
atgtccaggg tgtaccaaaa attatatgcc ctgggaattg gaggcccagt catcaggtgt 3180
tataccagaa ggaggtgttc tattcactca gagcactgat acagtgaatc gtgagtcctt 3240
taagctctac ggtcatgctg ttgtgccttt tggttctgct gtgtattgga gcccttgccc 3300
aggtatgtgg cttccagtaa tttggtcgtc ggttaagtca tactctggtt tgacttatac 3360
aggagtagtt ggttgtaagg caattgttca agagacagac gctatatgtc gttctctgta 3420
tatggattat gtccagcaca agtgtggcaa tctcgagcag agagctatcc ttggattgga 3480
cgatgtctat catagacagt tgcttgtgaa taggggtgac tatagtctcc tccttgagaa 3540
tgtggatttg tttgttaagc ggcgcgctga atttgcttgc aaattcgcca cctgtggaga 3600
tggtcttgta cccctcctac tagatggttt agtgccccgc agttattatt tgattaagag 3660
tggtcaagct ttcacctcta tgatggttaa ttttagccat gaggtgactg acatgtgtat 3720
ggacatggct ttattgttca tgcatgatgt taaagtggcc actaagtatg ttaagaaggt 3780
tactggcaaa ctggccgtgc gctttaaagc gttgggtgta gccgttgtca gaaaaattac 3840
tgaatggttt gatttagccg tggacattgc tgctagtgcc gctggatggc tttgctacca 3900
gctggtaaat ggcttatttg cagtggccaa tggtgttata acctttgtac aggaggtgcc 3960
tgagcttgtc aagaattttg ttgacaagtt caaggcattt ttcaaggttt tgatcgactc 4020
tatgtcggtt tctatcttgt ctggacttac tgttgtcaag actgcctcaa atagggtgtg 4080
tcttgctggc agtaaggttt atgaagttgt gcagaaatct ttgtctgcat atgttatgcc 4140
tgtgggttgc agcgaagcca cttgtttggt gggtgagatt gaacctgcag tttttgaaga 4200
tgatgttgtt gatgtggtta aagccccatt aacatatcaa ggctgttgta agccacccac 4260
ttctttcgag aagatttgta ttgtggataa attgtatatg gccaagtgtg gtgatcaatt 4320
ttaccctgtg gttgttgata acgacactgt tggcgtgtta gatcagtgct ggaggtttcc 4380
ctgtgcgggc aagaaagtcg agtttaacga caagcccaaa gtcaggaaga taccctccac 4440
ccgtaagatt aagatcacct tcgcactgga tgcgaccttt gatagtgttc tttcgaaggc 4500
gtgttcagag tttgaagttg ataaagatgt tacattggat gagctgcttg atgttgtgct 4560
tgacgcagtt gagagtacgc tcagcccttg taaggagcat gatgtgatag gcacaaaagt 4620
ttgtgcttta cttgataggt tggcaggaga ttatgtctat ctttttgatg agggaggcga 4680
tgaagtgatc gccccgagga tgtattgttc cttttctgct cctgatgacg aggactgcgt 4740
tgcagcggat gttgtagatg cagatgaaaa ccaagatgat gatgccgagg actcagcagt 4800
ccttgtcgct gatacccaag aagaggacgg cgttgccaag gggcaggttg aggcggattc 4860
ggaaatttgc gttgcgcata ctggtagtca agaagaattg gctgagcctg atgctgtcgg 4920
atctcaaact cccatcgcct ctgctgagga aaccgaagtc ggagaggcaa gcgacaggga 4980
agggattgct gaggcgaagg caactgtgtg tgctgatgct gtagatgcct gccccgatca 5040
agtggaggca tttgaaattg aaaaggtcga ggactctatc ttggatgagc ttcaaactga 5100
acttaatgcg ccagcggaca agacctatga ggatgtcttg gcattcgatg ccgtatgctc 5160
agaggcgttg tctgcattct atgctgtgcc gagtgatgag acgcacttta aagtgtgtgg 5220
attctattcg cctgctatag agcgcactaa ttgttggctg cgttctactt tgatagtaat 5280
gcagagtcta cctttggaat ttaaagactt ggagatgcaa aagctctggt tgtcttacaa 5340
ggccggctat gaccaatgct ttgtggacaa actagttaag agcgtgccca agtctattat 5400
ccttccacaa ggtggttatg tggcagattt tgcctatttc tttctaagcc agtgtagctt 5460
taaagcttat gctaactggc gttgtttaga gtgtgacatg gagttaaagc ttcaaggctt 5520
ggacgccatg tttttctatg gggacgttgt gtctcatatg tgcaagtgtg gtaatagcat 5580
gaccttgttg tctgcagata taccctacac tttgcatttt ggagtgcgag atgataagtt 5640
ttgcgctttt tacacgccaa gaaaggtctt tagggctgct tgtgcggtag atgttaatga 5700
ttgtcactct atggctgtag tagagggcaa gcaaattgat ggtaaagtgg ttaccaaatt 5760
tattggtgac aaatttgatt ttatggtggg ttacgggatg acatttagta tgtctccttt 5820
tgaactcgcc cagttatatg gttcatgtat aacaccaaat gtttgttttg ttaaaggaga 5880
tgttataaag gttgttcgct tagttaatgc tgaagtcatt gttaaccctg ctaatgggcg 5940
tatggctcat ggtgccggcg tcgccggcgc catagctgaa aaggcgggca gtgcttttat 6000
taaagaaacc tccgatatgg tgaaggctca gggcgtttgc caggttggtg aatgctatga 6060
atctgccggt ggtaagttat gtaaaaaggt gcttaacatt gtagggccag atgcgcgagg 6120
gcatggcaag caatgctatt cacttttaga gcgtgcttat cagcatatta ataagtgtga 6180
caatgttgtc actactttaa tttcggctgg tatatttagt gtgcctactg atgtctccct 6240
aacttactta cttggtgtag tgacaaagaa tgtcattctt gtcagtaaca accaggatga 6300
ttttgatgtg atagagaagt gtcaggtgac ctccgttgct ggtaccaaag cgctatcact 6360
tcaattggcc aaaaatttgt gccgtgatgt aaagtttgtg acgaatgcat gtagttcgct 6420
ttttagtgaa tcttgctttg tctcaagcta tgatgtgttg caggaagttg aagcgctgcg 6480
acatgatata caattggatg atgatgctcg tgtctttgtg caggctaata tggactgtct 6540
gcccacagac tggcgtctcg ttaacaaatt tgatagtgtt gatggtgtta gaaccattaa 6600
gtattttgaa tgcccgggcg ggatttttgt atccagccag ggcaaaaagt ttggttatgt 6660
tcagaatggt tcatttaagg aggcgagtgt tagccaaata agggctttac tcgctaataa 6720
ggttgatgtc ttgtgtactg ttgatggtgt taacttccgc tcctgctgcg tagcagaggg 6780
tgaagttttt ggcaagacat taggttcagt cttttgtgat ggcataaatg tcaccaaagt 6840
taggtgtagt gccatttaca agggtaaggt tttctttcag tacagtgatt tgtccgaggc 6900
agatcttgtg gctgttaaag atgcctttgg ttttgatgaa ccacaactgc tgaagtacta 6960
cactatgctt ggcatgtgta agtggccagt agttgtttgt ggcaattatt ttgctttcaa 7020
gcagtcaaat aataattgct acatcaacgt ggcatgttta atgctgcaac acttgagttt 7080
aaagtttcct aagtggcaat ggcaagaggc ttggaacgag ttccgctctg gtaaaccact 7140
aaggtttgtg tccttggtat tagcaaaggg cagctttaaa tttaatgaac cttctgattc 7200
tatcgatttt atgcgtgtgg tgctacgtga agcagatttg agtggtgcca cgtgcaattt 7260
ggaatttgtt tgtaaatgtg gtgtgaagca agagcagcgc aaaggtgttg acgctgttat 7320
gcattttggt acgttggata aaggtgatct tgtcaggggt tataatatcg catgtacgtg 7380
cggtagtaaa cttgtgcatt gcacccaatt taacgtacca tttttaattt gctccaacac 7440
accagagggt aggaaactgc ccgacgatgt tgttgcagct aatattttta ctggtggtag 7500
tgtgggccat tacacgcatg tgaaatgtaa acccaagtac cagctttatg atgcttgtaa 7560
tgttaataag gtttcggagg ctaagggtaa ttttaccgat tgcctctacc ttaaaaattt 7620
aaagcaaacc ttctcgtctg tgctgacgac tttttattta gatgacgtaa agtgtgtgga 7680
gtataagcca gatttatcgc agtattactg tgagtctggt aaatattata caaaacccat 7740
tattaaggcc caatttagaa catttgagaa ggttgatggt gtctatacca actttaaatt 7800
ggtgggacat agtattgctg aaaaactcaa tgctaagctg ggatttgatt gtaattctcc 7860
ctttgtggag tacaaaatta cagagtggcc aacagctact ggagatgtgg tgttggctag 7920
tgatgatttg tatgtaagtc ggtacttaag cgggtgcatt acttttggta aaccggttgt 7980
ctggcttggc catgaggaag catcgctgaa atctctcaca tattttaata gacctagtgt 8040
cgtttgtgaa aataaattta acgtgttgcc cgttgatgtc agtgaaccca cggacaaggg 8100
gcctgtgcct gctgcagtcc ttgttaccgg cgtccctgga gctgatgcgt cagctggtgc 8160
cggtattgcc aaggagcaaa aagcctgtgc ttctgctagt gtggaggatc aggttgttac 8220
ggaggttcgt caagagccat ctgtttcagc tgctgatgtc aaagaggtta aattgaatgg 8280
tgttaaaaag cctgttaagg tggaaggtag tgtggttgtt aatgatccca ctagcgaaac 8340
caaagttgtt aaaagtttgt ctattgttga tgtctatgat atgttcctga cagggtgtaa 8400
gtatgtggtt tggactgcta atgagttgtc tcgactagta aattcaccga ctgttaggga 8460
gtatgtgaag tggggtatgg gaaagattgt aacacccgct aagttgttgt tgttaagaga 8520
tgagaagcaa gagttcgtag cgccaaaagt agtcaaggcg aaagctattg cctgctattg 8580
tgctgtgaag tggtttctcc tctattgttt tagttggata aagtttaata ctgacaataa 8640
ggttatatac accacagaag tagcttcaaa gcttactttc aagttgtgct gtttggcctt 8700
taagaatgcc ttacagacgt ttaattggag cgttgtgtct aggggctttt tcctagttgc 8760
aacggtcttt ttactctggt ttaacttttt gtatgctaat gttattttga gtgacttcta 8820
tttgcctaat attgggcctc tccctacgtt tgtgggacag atagttgcgt ggtttaagac 8880
tacatttggt gtgtcaacca tctgtgattt ctaccaggtg acggatttgg gctatagaag 8940
ttcgttttgt aatggaagta tggtatgtga actatgcttc tcaggttttg atatgctgga 9000
caactatgat gctataaatg ttgttcaaca cgttgtagat aggcgtttgt cctttgacta 9060
tattagccta tttaaactgg tagttgagct tgtaatcggc tactctcttt atactgtgtg 9120
cttctaccca ctgtttgtcc ttattggaat gcagttattg accacatggt tgcctgaatt 9180
ctttatgctg gagactatgc attggagtgc tcgtttgttt gtgtttgttg ccaatatgct 9240
tccagctttt acgttactgc gattttacat cgtggtgaca gctatgtata aggtctattg 9300
tctttgtaga catgttatgt atggatgtag taagcctggt tgcttgtttt gttataagag 9360
aaaccgtagt gtccgtgtta agtgtagcac cgttgttggt ggttcactac gctattacga 9420
tgtaatggct aacggcggca caggtttctg tacaaagcac cagtggaact gtcttaattg 9480
caattcctgg aaaccaggca atacattcat aactcatgaa gcagcggcgg acctctctaa 9540
ggagttgaaa cgccctgtga atccaacaga ttctgcttat tactcggtca cagaggttaa 9600
gcaggttggt tgttccatgc gtttgttcta cgagagagat ggacagcgtg tttatgatga 9660
tgttaatgct agtttgtttg tggacatgaa tggtctgctg cattctaaag ttaaaggtgt 9720
gcctgaaacg catgttgtgg ttgttgagaa tgaagctgat aaagctggtt ttctcggcgc 9780
cgcagtgttt tatgcacaat cgctctacag acctatgttg atggtggaaa agaaattaat 9840
aactaccgcc aacactggtt tgtctgttag tcgaactatg tttgaccttt atgtagattc 9900
attgctgaac gtcctcgacg tggatcgcaa gagtctaaca agttttgtaa atgctgcgca 9960
caactctcta aaggagggtg ttcagcttga acaagttatg gataccttta ttggctgtgc 10020
ccgacgtaag tgtgctatag attctgatgt tgaaaccaag tctattacca agtccgtcat 10080
gtcggcagta aatgctggcg ttgattttac ggatgagagt tgtaataact tggtgcctac 10140
ctatgttaaa agtgacacta tcgttgcagc cgatttgggt gttcttattc agaataatgc 10200
taagcatgta caggctaatg ttgctaaagc cgctaatgtg gcttgcattt ggtctgtgga 10260
tgcttttaac cagctatctg ctgacttaca gcataggctg cgaaaagcat gttcaaaaac 10320
tggcttgaag attaagctta cttataataa gcaggaggca aatgttccta ttttaactac 10380
accgttctct cttaaagggg gcgctgtttt tagtagaatg ttacaatggt tgtttgttgc 10440
taatttgatt tgtttcattg tgttgtgggc ccttatgcca acatatgcag tgcacaaatc 10500
ggatatgcag ttgcctttat atgccagttt taaagttata gataacggtg tgctaaggga 10560
tgtgtctgtt actgacgcat gcttcgcaaa caaatttaat caattcgacc aatggtatga 10620
gtctactttt ggtcttgctt attaccgcaa ctctaaggct tgtcctgttg tggttgctgt 10680
aatagatcaa gacattggcc ataccttatt taatgttcct accacagttt taagatatgg 10740
atttcatgtg ttgcatttta taacccatgc atttgctact gatagcgtgc agtgttacac 10800
gccacatatg caaatcccct atgataattt ctatgctagt ggttgcgtgt tgtcatccct 10860
ctgtactatg cttgcgcatg cagatggaac cccgcatcct tattgttata cagggggtgt 10920
tatgcataat gcctctctgt atagttcttt ggctcctcat gtccgttata acctggctag 10980
ttcaaatggt tatatacgtt ttcccgaagt ggttagtgaa ggcattgtgc gtgttgtgcg 11040
cactcgctct atgacctact gcagggttgg tttatgtgag gaggccgagg agggtatctg 11100
ctttaatttt aatcgttcat gggtattgaa caacccgtat tatagggcca tgcctggaac 11160
tttttgtggt aggaatgctt ttgatttaat acatcaagtt ttaggaggat tagtgcggcc 11220
tattgatttc tttgccttaa cggcgagttc agtggctggt gctatccttg caattattgt 11280
cgttttggct ttctattatt taatcaagct taagcgtgcc tttggtgact acactagtgt 11340
tgtggttatc aatgtaattg tgtggtgtat aaattttctg atgctttttg tgtttcaggt 11400
ttatcccaca ttgtcttgtt tatatgcttg tttctacttc tacaccacgc tttatttccc 11460
ttcggagata agtgttgtta tgcatttgca atggcttgtc atgtatggtg ctattatgcc 11520
cttgtggttt tgcattattt acgtggcagt cgttgtttca aaccatgcat tgtggttgtt 11580
ctcttactgc cgcaaaattg gtaccgaggt tcgtagtgac ggcacatttg aggaaatggc 11640
ccttactacc tttatgatta ctaaagaatc ttattgtaag ttgaaaaact ctgtttctga 11700
tgttgctttt aacaggtact tgagtcttta caacaagtac cgttacttca gtggcaaaat 11760
ggatactgcc gcttatagag aggctgcctg ttcacaactg gcaaaggcaa tggaaacatt 11820
taaccataat aatggtaatg atgttctcta tcagcctcca accgcctctg ttactacatc 11880
atttttacag tctggtatag tgaagatggt gtcgcccacc tctaaagtgg agccttgtat 11940
tgttagtgtt acttatggta acatgacact taatgggttg tggttggatg ataaagttta 12000
ttgcccaaga catgttatct gttcttcagc tgacatgaca gaccctgatt atcctaattt 12060
gctttgtaga gtgacatcaa gtgatttttg tgttatgtct ggtcgtatga gccttactgt 12120
aatgtcttat caaatgcagg gctgccaact tgttttgact gttacactgc aaaatcctaa 12180
cacgcctaag tattccttcg gtgttgttaa gcctggtgag acatttactg tactggctgc 12240
atacaatggc agacctcaag gagccttcca tgttacgctt cgtagtagcc ataccataaa 12300
gggctccttt ctatgtggat cctgcggttc tgtaggatat gttttaactg gcgatagtgt 12360
acgatttgtt tatatgcatc agctagagtt gagtactggt tgtcataccg gtactgactt 12420
tagtgggaac ttttatggtc cctatagaga tgcgcaagtt gtacaattgc ctgttcagga 12480
ttatacgcag actgttaatg ttgtagcttg gctttatgct gctattttta acagatgcaa 12540
ctggtttgtg caaagtgata gttgttccct ggaggagttt aatgtttggg ctatgaccaa 12600
tggttttagc tcaatcaaag ccgatcttgt cttggatgcg cttgcttcta tgacaggcgt 12660
tacagttgaa caggtgttgg ccgctattaa gaggctgcat tctggattcc agggcaaaca 12720
aattttaggt agttgtgtgc ttgaagatga gctgacacca agtgatgttt atcaacaact 12780
agctggtgtc aagctacagt caaagcgcac aagagttata aaaggtacat gttgctggat 12840
attggcttca acgtttttgt tctgtagcat tatctcagca tttgtaaaat ggactatgtt 12900
tatgtatgtt actacccata tgttgggagt gacattgtgt gcactttgtt ttgtaagctt 12960
tgctatgttg ttgatcaagc ataagcattt gtatttaact atgtacatca tgcctgtgtt 13020
atgcacactg ttttacacca actatttggt tgtgtacaaa cagagtttta gaggtctagc 13080
ttatgcttgg ctttcacact ttgtccctgc tgtagattat acatatatgg atgaagtttt 13140
atatggtgtt gtgttgctag tagctatggt gtttgttacc atgcgtagca taaaccacga 13200
cgtcttttct attatgttct tggttggtag acttgtcagc ctggtatcca tgtggtattt 13260
tggagccaat ttagaggaag aggtactatt gttcctcaca tccctatttg gcacgtacac 13320
atggactact atgttgtcat tggctaccgc taaggttatt gctaaatggt tggctgtgaa 13380
tgtcttgtac ttcacagacg taccgcaaat taaattagtt ctgttgagct acttgtgtat 13440
tggttatgtg tgttgttgtt attggggaat cttgtcactc cttaatagca tttttaggat 13500
gccattgggc gtctacaatt ataaaatctc cgttcaggag ttacgttata tgaatgctaa 13560
tggcttgcgc ccacctagaa atagttttga ggccctgatg cttaatttta agctgttggg 13620
aattggtggt gtgccagtca ttgaagtatc tcaaattcaa tcaagattga cggatgttaa 13680
atgtgctaat gttgtgttgc ttaattgcct ccagcacttg catattgcat ctaattctaa 13740
gttgtggcag tattgtagta ctttgcacaa tgaaatactg gctacatctg atttgagcgt 13800
ggccttcgat aagttggctc aactcttagt tgttttattt gctaatccag cagcagtgga 13860
tagcaagtgc cttgcaagta ttgaagaagt gagcgatgat tacgttcgcg acaatactgt 13920
cttgcaagcc ttacagagtg aatttgttaa tatggctagc ttcgttgagt atgaacttgc 13980
taagaagaat ctagatgagg ctaaggctag cggctctgcc aatcaacagc agattaagca 14040
gctagagaag gcgtgtaata ttgctaagtc agcatatgag cgcgacagag ctgttgctcg 14100
taagctggaa cgtatggctg atttagctct tacaaacatg tataaagaag ctagaattaa 14160
tgataagaag agtaaggtag tgtctgcatt gcaaaccatg ctctttagta tggtgcgtaa 14220
gctagataac caagctctta attctatttt agacaacgca gttaagggtt gtgtaccttt 14280
gaatgcaata ccatcattga cttcgaacac tctgactata atagtgccag ataagcaggt 14340
ttttgatcag gttgtggata atgtgtatgt cacctatgct gggaatgtat ggcatataca 14400
gtttattcaa gatgctgatg gtgctgttaa acaattgaat gagatagatg ttaattcaac 14460
ctggcctcta gtcattgctg caaataggca taatgaagtg tctactgttg ttttgcagaa 14520
caatgagttg atgcctcaga agttgagaac tcaggttgtc aatagtggct cagatatgaa 14580
ttgtaatact cctacccagt gttactataa tactactggc acgggtaaga ttgtgtatgc 14640
tatacttagt gactgtgacg gcctgaagta cactaagata gtaaaagaag atggaaattg 14700
tgttgttttg gaattggatc ctccctgtaa gttttctgtt caggatgtga agggccttaa 14760
aattaagtac ctttactttg tgaaggggtg taatacactg gctagaggct gggttgtagg 14820
caccttatcc tcgacagtga gattgcaggc gggtacggca actgagtatg cctccaactc 14880
tgcaatactg tcgctgtgtg cgttttctgt agatcctaag aaaacgtact tggattatat 14940
aaaacagggt ggagttcccg ttactaattg tgttaagatg ttatgtgacc atgctggcac 15000
tggtatggcc attactatta agccggaggc aaccactaat caggattctt atggtggtgc 15060
ttccgtttgt atatattgcc gctcgcgtgt tgaacatcca gatgttgatg gattgtgcaa 15120
attacgcggc aagtttgtcc aagtgccctt aggcataaaa gatcctgtgt catatgtgtt 15180
gacgcatgat gtttgtcagg tttgtggctt ttggcgagat ggtagctgtt cctgtgtagg 15240
cacaggctcc cagtttcagt caaaagacac gaacttttta aacggattcg gggtacaagt 15300
gtaaatgccc gtcttgtacc ctgtgccagt ggcttggaca ctgatgttca attaagggca 15360
tttgacattt gtaatgctaa tcgagctggc attggtttgt attataaagt gaattgctgc 15420
cgcttccagc gtgtagatga ggacggcaac aagttggata agttctttgt tgttaaaaga 15480
actaatttag aagtgtataa caaggagaaa gaatgctatg agttgacaaa agaatgcggt 15540
gttgtggctg aacacgagtt cttcacattt gatgtggagg gaagtcgggt accacacata 15600
gtccgtaaag atctttcaaa gtttactatg ttagatcttt gctatgcatt gcgtcatttt 15660
gaccgcaatg attgttcaac tcttaaggaa attctcctta catatgctga gtgtgaagag 15720
tcctacttcc aaaagaagga ctggtatgat tttgttgaga atcctgatat aattaatgtg 15780
tacaagaagc ttggtcctat atttaataga gccctgctta acactgccaa gtttgcagac 15840
gcattagtgg aggcaggctt agtaggtgtt ttaacacttg ataatcaaga tttatatggt 15900
caatggtatg actttggaga ttttgtcaag acagtacctg gttgtggtgt tgccgtggca 15960
gactcttatt attcatatat gatgccaatg ctgactatgt gtcatgcgtt ggatagtgag 16020
ttgtttgtta atggtactta tagggagttt gaccttgttc agtatgattt tactgatttc 16080
aagctagagc tgttcactaa gtattttaag cattggagta tgacctacca cccgaacacc 16140
tgtgagtgcg aggatgacag gtgcattatt cattgcgcca attttaatat acttttcagc 16200
atggtcttac ctaagacctg ttttgggcct cttgttaggc agatatttgt ggatggtgtt 16260
cctttcgttg tgtcgatcgg ttaccattat aaagaattag gtgttgttat gaatatggat 16320
gtggatacac atcgttatcg cttgtctctt aaggacttgc ttttgtatgc tgcagaccct 16380
gcccttcatg tggcgtctgc tagtgcactg cttgatttgc gcacatgttg ttttagcgtt 16440
gcagctatta caagtggcgt aaaatttcaa acagttaaac ctggaaattt taatcaggat 16500
ttctacgagt ttattttgag taaaggcctg cttaaagagg ggagctccgt tgatttgaag 16560
cacttcttct ttacgcagga tggtaatgct gctattactg attacaatta ctacaagtat 16620
aatctaccca ccatggtgga tattaagcag ttgttgtttg ttttagaagt tgttaataag 16680
tacttcgaga tctatgaggg tgggtgtata cccgcaacac aggtcattgt taataattat 16740
gacaagagtg ctggctatcc atttaataaa tttggaaagg ccaggctcta ttatgaggca 16800
ttatcatttg aggagcagga tgaaatttat gcgtatacca aacgcaatgt cctgccgacc 16860
ctaactcaaa tgaatcttaa atatgctatt agtgctaaga atagggcccg caccgttgct 16920
ggtgtctcta ttctcagtac tatgactggc agaatgtttc atcaaaagtg tctaaagagt 16980
atagcagcta ctcgcggtgt tcctgtagtt ataggcacca cgaagttcta tggcggttgg 17040
gatgatatgt tacgccgcct tattaaagat gttgatagtc ctgtactcat gggttgggac 17100
tatcctaaat gtgatcgtgc tatgccaaac atactgcgta ttgttagtag tttggtgcta 17160
gcccgtaaac atgattcgtg ctgttcgcat acggatagat tctatcgtct tgcgaacgag 17220
tgcgcccaag ttttgagtga aattgttatg tgtggtggtt gttattatgt taaaccaggt 17280
ggcactagta gtggggatgc aaccactgct tttgctaatt ctgtgtttaa catttgtcaa 17340
gctgtttccg ccaatgtatg ctcgcttatg gcatgcaatg gacacaaaat tgaagatttg 17400
agtatacgcg agttacaaaa gcgcctatac tctaatgtct atcgtgcgga ccatgttgac 17460
cccgcatttg ttagtgagta ttatgagttt ttaaacaagc attttagtat gatgattttg 17520
agtgatgatg gtgttgtgtg ttataattca gagtttgcgt ccaagggtta tattgctaat 17580
ataagtgcct ttcaacaggt attatattat caaaacaacg tgtttatgtc tgaggccaaa 17640
tgttgggtag aaacagacat cgaaaaggga ccgcatgaat tttgttctca acatacaatg 17700
ctagtcaaga tggatggtga tgaagtctac cttccatacc ctgatccttc gagaatctta 17760
ggagcaggct gttttgttga tgatttactc aagactgata gcgttctctt gatagagcgt 17820
ttcgtaagtc ttgcaattga tgcttatcct ttagtatacc atgagaaccc agagtatcaa 17880
aatgtgttcc gggtatattt agaatacatc aagaagctgt acaatgatct cggtaatcag 17940
atcctggaca gctacagtgt tattttaagt acttgtgatg gtcaaaagtt tactgacgag 18000
acgttttaca agaacatgta tttaagaagt gcagtgctgc aaagcgttgg tgcctgcgtt 18060
gtctgtagtt ctcaaacatc attacgttgt ggcagttgca tacgcaagcc tttgctgtgt 18120
tgcaaatgcg cctatgatca tgttatgtcc actgatcata aatatgtcct gagtgtgtca 18180
ccatatgtgt gtaattcacc gggatgtgat gtaaatgatg ttaccaaatt gtatttaggt 18240
ggtatgtcat attattgtga ggaccataaa ccacagtatt cattcaaatt ggtgatgaat 18300
ggtatggttt ttggtttata taagcagtct tgtactggtt cgccctacat agaggatttt 18360
aataaaatcg ctagttgcaa atggacagaa gtcgatgatt atgtgctagc taatgaatgc 18420
accgaacgcc ttaaattgtt tgccgcagaa acgcagaagg ccacagaaga ggcctttaag 18480
caatgttatg cgtcagcaac gatccgtgag atcgtgagcg atcgggagtt aattttatct 18540
tgggaaattg gtaaagtccg cccgccactt aataaaaatt acgtgttcac cggctaccat 18600
tttactaata atggtaagac agttttaggt gagtatgttt ttgataagag tgagttgact 18660
aatggtgtgt attatcgcgc cacaaccact tataagttat ctgtaggtga tgtgttcatt 18720
ttaacatcac acgcagtgtc tagtttaagt gctcctacat tagtaccgca ggagaattat 18780
actagcattc gttttgctag tgtttatagt gtgcctgaga cgtttcagaa taatgtgcct 18840
aattatcagc acattggaat gaagcgctat tgtactgtac agggaccgcc tggtactggt 18900
aagtcccatc tagccattgg gctagctgtt tattattgta cagcgcgcgt ggtgtatacc 18960
gctgctagcc atgctgcagt tgacgcgctg tgtgaaaagg cacataaatt tctcaacatc 19020
aacgactgca cgcgtattgt tcctgcaaag gtgcgtgtag attgttatga taaattcaag 19080
gtcaatgaca ccactcgcaa gtatgtgttt actacaataa atgcattacc tgagttggtg 19140
actgacatta ttgtcgttga tgaagttagt atgcttacca actatgagct gtctgttatt 19200
aacagtcgtg ttagggctaa gcattatgtg tatattggcg acccggcgca gttacctgca 19260
ccacgtgtgc tactgaataa gggaactcta gaacctagat attttaattc cgttaccaag 19320
ctaatgtgtt gtttgggtcc agatattttc ttgggcacct gttatagatg ccctaaggag 19380
attgtggata cggtgtcagc cttggtttat aataataagc tgaaggctaa aaatgataat 19440
agctccatgt gctttaaggt ttattataag ggccagacta cacatgagag ttctagtgct 19500
gttaatatgc agcaaataca tttaatttcc aagtttctga aggcaaaccc cagttggagt 19560
aacgccgtat ttattagtcc ttataactcg cagaactatg ttgctaagag agtcttggga 19620
ttacaaaccc agacagtaga ctcagcgcag ggttctgaat atgattttgt tatctactca 19680
cagactgcgg aaacagcgca ttctgtcaat gtaaatagat tcaatgttgc tattacacgt 19740
gctaagaagg gtattctctg tgtcatgagt agtatgcaat tatttgagtc tcttaatttt 19800
actacactga cgttggataa gattaacaat ccacgattac agtgtactac aaatttgttt 19860
aaggattgta gcaggagcta tgtaggatat cacccagccc atgcaccatc ctttttggca 19920
gttgatgaca aatataaggt aggcggtgat ttagccgttt gccttaatgt tgctgattct 19980
gctgtcactt attcgcggct tatatcactc atgggattca agcttgactt gacccttgat 20040
ggttattgta agctgtttat aactagagat gaagctatca aacgtgttag agcctgggtt 20100
ggcttcgatg cagaaggtgc ccatgcgata cgtgatagca ttgggacaaa tttcccatta 20160
caattaggct tttcgactgg aattgatttt gttgtcgaag ccactggaat gtttgctgag 20220
agagatggtt atgtctttaa aaaggcagcc gcacgagctc ctcctggcga acaatttaaa 20280
caccttatcc cacttatgtc aagagggcag aaatgggatg tggttcgcat tagaatagta 20340
caaatgttgt cagaccacct agtggatttg gcagacagtg ttgtacttgt gacgtgggct 20400
gccagctttg agctcacatg tttgcgatat ttcgctaaag ttggaagaga agttgtgtgt 20460
agtgtctgca ccaagcgtgc gacatgtttt aattctagaa ctggatacta tggatgctgg 20520
cgacatagtt attcctgtga ttacctgtac aacccactaa tagttgacat tcaacagtgg 20580
ggatatacag gatctttaac tagcaatcat gatcctattt gcagcgtgca taagggtgct 20640
catgttgcat catctgatgc tatcatgacc cggtgtctag ctgttcatga ttgcttttgt 20700
aagtctgtta attggaattt agaatacccc attatttcaa atgaggtcag tgttaatacc 20760
tcctgcaggt tattgcagcg cgtaatgttt agggctgcga tgctatgcaa taggtatgat 20820
gtgtgttatg acattggcaa ccctaaaggt cttgcctgtg tcaaaggata tgattttaag 20880
ttctatgacg cctcccctgt tgttaagtcg gtcaaacagt ttgtttacaa atacgaggca 20940
cataaagatc aatttttaga tggtttgtgt atgttttgga actgcaatgt ggataagtat 21000
ccagcgaatg cagttgtgtg taggtttgac acgcgtgtgt tgaacaaatt aaatctccct 21060
ggctgtaatg gtggcagttt gtatgttaac aaacatgcat tccacaccag tccctttacc 21120
cgggctgcct tcgagaattt gaagcctatg cctttctttt attattcaga tacgccctgt 21180
gtgtatatgg aaggcatgga atctaagcag gtcgattatg tcccattgag aagcgctaca 21240
tgcatcacaa gatgcaattt aggtggcgct gtttgtttaa aacatgctga ggagtatcgt 21300
gagtaccttg agtcttacaa tacggcaacc acagcgggtt ttactttttg ggtctataag 21360
acttttgatt tttacaacct ttggaatact tttactaggc tccaaagttt agaaaatgta 21420
gtgtataacc tggtcaacgc tggacacttt gatggccggg cgggtgaact gccttgtgct 21480
gttataggtg agaaagtcat tgccaagatt caaaatgagg atgtcgtggt ctttaaaaat 21540
aacacgccat tccccactaa tgtggctgtc gaattatttg ctaagcgcag tattcggccc 21600
caccccgagc ttaagctctt tagaaatttg aatattgacg tgtgctggag tcacgtcctt 21660
tgggattatg ctaaggatag tgtgttttgc agttcgacgt ataaggtctg caaatacaca 21720
gatttacagt gcattgaaag cttgaatgta ctttttgatg gtcgtgataa tggtgctctt 21780
gaagctttta agaagtgccg gaatggcgtc tacattaaca cgacaaaaat taaaagtctg 21840
tcgatgatta aaggcccaca acgtgccgat ttgaatggcg tagttgtgga gaaagttgga 21900
gattctgatg tggaattttg gtttgctgtg cgtaaagacg gtgacgatgt tatcttcagc 21960
cgtacaggga gccttgaacc gagccattac cggagcccac aaggtaatcc gggtggtaat 22020
cgcgtgggtg atctcagcgg taatgaagct ctagcgcgtg gcactatctt tactcaaagc 22080
agattattat cttctttcac acctcgatca gagatggaga aagattttat ggatttagat 22140
gatgatgtgt tcattgcaaa atatagttta caggactacg cgtttgaaca cgttgtttat 22200
ggtagtttta accagaagat tattggaggt ttgcatttgc ttattggctt agcccgtagg 22260
cagcaaaaat ccaatctggt aattcaagag ttcgtgacat acgactctag cattcattcg 22320
tactttatca ctgacgagaa cagtggtagt agtaagagtg tgtgcactgt tattgattta 22380
ttgttagatg attttgtgga cattgtaaag tccctgaatc taaagtgtgt gagtaaggtt 22440
gttaatgtta atgtggattt taaggacttc cagtttatgt tgtggtgcaa tgaggagaag 22500
gtcatgactt tctatcctcg tttgcaggct gctgctgact ggaaacctgg ttatgttatg 22560
cctgtcttat ataagtattt ggaatcgcct ctggaaagag taaacctctg gaattatggc 22620
aagccgatta ctttacctac aggatgtatg atgaatgttg ctaagtatac tcaattatgt 22680
caatatttga gcactacaac attagcagtt ccggctaata tgcgtgtctt acaccttggt 22740
gccggttcgg ataagggtgt tgcccctggg tctgcagttc ttaggcagtg gctaccagcg 22800
ggaagtattc ttgtagataa tgatgtgaat ccatttgtga gtgacagtgt cgcctcatat 22860
tatggaaatt gtataacctt accctttgat tgtcagtggg atctgataat ttctgatatg 22920
tacgaccctc ttactaagaa cattggggag tacaacgtga gtaaagatgg attctttact 22980
tacctctgtc atttaattcg tgacaagttg gctctgggtg gcagtgttgc cataaaaata 23040
acagagtttt cttggaacgc tgagttatat agtttaatgg ggaagtttgc gttctggaca 23100
atcttttgca ccaacgtaaa cgcctcttca agtgaaggat ttttgattgg cataaattgg 23160
ttgaataaga cccgtaccga aattgacggt aaaaccatgc atgccaatta tctgttttgg 23220
agaaatagta caatgtggaa tggaggggct tacagtctct ttgacatgag taagttccct 23280
ttgaaagcgg ctggtacggc tgttgttagc cttaaaccag accaaataaa tgacttagtc 23340
ctctccttga ttgagaaggg caagttatta gtgcgtgata cacgcaaaga agtttttgtt 23400
ggcgatagcc tagtaaatgt caaataaacg aacaatgttt gtttttcttg ttttattgcc 23460
actagtctct agtcagtgtg ttaatcttac aaccagaact caattacccc ctgcatacac 23520
taattctttc acacgtggtg tttattaccc tgacaaagtt ttcagatcct cagttttaca 23580
ttcaactcag gacttgttct tacctttctt ttccaatgtt acttggttcc atgctataca 23640
tgtctctggg accaatggta ctaagaggtt tgataaccct gtcctaccat ttaatgatgg 23700
tgtttacttt gcttccactg agaagtctaa cataataaga ggctggattt ttggtactac 23760
tttagattcg aaaacccagt ccctacttat tgttaataac gctactaatg ttgttatcaa 23820
agtctgtgaa tttcaatttt gtaacgatcc atttttgggt gtttattacc acaaaaacaa 23880
caaaagttgg atggaaagtg agttcagagt ttattctagt gcgaataatt gcacttttga 23940
atacgtctct cagccttttc ttatggacct tgaaggaaaa cagggtaatt tcaaaaatct 24000
tagggaattt gtgttcaaga atattgatgg ttacttcaag atatactcta agcacacgcc 24060
tattaattta gtgcgtgatc tccctcaggg tttttcggct ttagaaccat tggtagattt 24120
gccaataggt attaacatca ctaggtttca aactttactt gctttacata gaagttattt 24180
aactcctggt gattcttctt caggttggac agctggtgct gcagcttatt atgtgggtta 24240
tcttcaacct aggacttttc tactgaagta caatgaaaat ggaaccatta cagatgctgt 24300
agactgtgca cttgaccctc tctcagaaac aaagtgtacg ttgaaatcct tcactgtaga 24360
aaaaggaatc tatcaaactt ctaactttag agtccaacca acagaatcta ttgttagatt 24420
tcctaacatc acaaacttgt gcccttttgg tgaagttttt aacgccacca gatttgcatc 24480
tgtttatgct tggaacagga agagaatcag caactgtgtt gctgattatt ctgtcctgta 24540
taattccgca tcattttcca cttttaagtg ttatggagtg tctcctacta aattaaatga 24600
tctctgcttt actaatgtct atgcagattc atttgtaatt agaggtgatg aagtcagaca 24660
aatcgctcca gggcaaactg gaaagattgc tgattataac tacaaattac cagatgattt 24720
tacaggctgc gttatagctt ggaattctaa caatcttgat tctaaggttg gtggtaatta 24780
taattacctg tacagattgt ttaggaagtc taatctcaaa ccttttgaga gagatatttc 24840
aactgaaatc tatcaggccg gtagcacacc ttgtaatggt gttgaaggtt ttaattgtta 24900
ctttcctctg caatcatatg gtttccaacc cactaatggt gttggttacc aaccatacag 24960
agtagtagta ctttcttttg aacttctaca tgcaccagca actgtttgtg gacctaaaaa 25020
gtctactaat ttggttaaga acaagtgtgt caatttcaac ttcaatggtt taacaggcac 25080
aggtgttctt actgagtcta acaaaaagtt tctgcctttc caacaatttg gcagagacat 25140
tgctgacact actgatgctg ttcgtgatcc acaaacactt gagattcttg acattacacc 25200
atgttctttt ggtggtgtca gtgttataac accaggaaca aatacttcta accaggttgc 25260
tgttctttat caggatgtta actgcacaga agtccctgtt gctattcatg cagatcaact 25320
tactcctact tggcgtgttt attctacagg ttctaatgtt tttcaaacac gtgcaggctg 25380
tttaataggg gctgaacatg tcaacaactc atatgagtgt gacataccca ttggtgcagg 25440
tatatgcgct agttatcaga ctcagactaa ttctcctcgg agagcaagaa gtgtagctag 25500
tcaatccatc attgcctaca ctatgtcact tggtgcagaa aattcagttg cttactctaa 25560
taactctatt gccataccca caaattttac tattagcgtt accacagaaa ttctaccagt 25620
gtctatgacc aagacatcag tagattgtac aatgtacatt tgtggtgatt caactgaatg 25680
cagcaatctt ttgttgcaat atggcagttt ttgtacacaa ttaaaccgtg ctttaactgg 25740
aatagctgtt gaacaagaca aaaacaccca agaagttttt gcacaagtca aacaaattta 25800
caagacacca ccaattaaag attttggcgg ttttaatttt agccagatac tgccagatcc 25860
atcaaaacca agcaagaggt catttattga agatctactg ttcaacaaag tgacacttgc 25920
agatgctggc ttcatcaaac aatatggtga ttgccttggt gatattgctg ctagagacct 25980
catttgtgca caaaagttta acggccttac tgttttgcca cctttgctca cagatgaaat 26040
gattgctcaa tacacttctg cactgttagc aggtacaatc acttctggtt ggacttttgg 26100
tgcaggtgct gcattacaaa taccatttgc tatgcaaatg gcttataggt ttaatggtat 26160
tggagttaca cagaatgttc tctatgagaa ccaaaaattg attgccaacc aatttaatag 26220
tgctattggc aaaattcaag actcactttc ttccacagca agtgcacttg gaaaacttca 26280
agatgtggtc aaccaaaatg cacaagcttt aaacacgctt gttaaacaac ttagctccaa 26340
ttttggtgca atttcaagtg ttttaaacga catcctttca cgtcttgaca aagttgaggc 26400
tgaagtgcaa attgataggt tgatcacagg cagacttcaa agtttgcaga catatgtgac 26460
tcaacaatta attagagctg cagaaatcag agcttctgct aatcttgctg ctactaaaat 26520
gtcagagtgt gtacttggac aatcaaaaag agttgacttt tgcggaaagg gctatcatct 26580
tatgtcattt cctcagtcag cacctcatgg tgtcgtcttt ttgcatgtga cttatgtccc 26640
tgcacaagaa aagaacttca caactgctcc tgccatttgt catgatggaa aagcacactt 26700
tcctcgtgaa ggtgtctttg tttcaaatgg cacacactgg tttgtaacac aaaggaattt 26760
ttatgaacca caaatcatta ctacagacaa cacatttgtg tctggtaact gtgatgttgt 26820
aataggaatt gtcaacaaca cagtttatga tcctttgcaa cctgaattag actcattcaa 26880
ggaggagctt gataaatact tcaagaacca tacctcacca gatgttgatt taggtgacat 26940
ctctggcatt aatgcttcag ttgtaaacat tcagaaagaa atcgaccgcc tcaatgaggt 27000
tgccaagaat ttaaatgaat ctctcatcga tctccaagaa cttggaaagt atgagcagta 27060
tataaaatgg ccatggtaca tttggctagg ttttatagct ggcttgattg ccatagtaat 27120
ggtgacaatt atgctttgct gtatgaccag ttgctgtagt tgtctcaagg gctgttgttc 27180
ttgtggatcc tgctgcaaat ttgacgagga cgactctgag ccagtgctca aaggagtcaa 27240
attacattac acataactat cacagcctct cctggaaaga cagaaaatct aaacaattta 27300
tagcattctc attgctacct ggccccgtaa gaggcagtca tagctatggc cgtgttggtc 27360
ctaaggctac attggctgct gtctttattg gtccatttat tgtagcatgt atgctaggca 27420
ttggcctagt ttatttattg caattgcaag ttcaaatttt tcatgttaag gataccatac 27480
gtgtgactgg caagccagcc actgtgtctt atactacaag tacaccagta acaccgagcg 27540
cgacgacgct cgatggtact acgtatactt taattagacc cactagctct tatacaagag 27600
tttatcttgg tactccaaga ggttttgatt atagtacatt tgggcctaag accctagatt 27660
atgttactaa tctaaacctc atcttaattc tggtcgtcca tatactttta aggcattgtc 27720
caggcatatg aggccaacag ccacatggat ttggcatgtg agtgatgcat ggttacgccg 27780
cacgcgggac tttggtgtca ttcgcctaga agatttttgt tttcaattta attatagcca 27840
accccgagtt ggttattgta gagttccttt aaaggcttgg tgtagcaacc agggtaaatt 27900
tgcagcgcag tttaccctaa aaagttgcga aaaaccaggt cacgaaaaat ttattactag 27960
cttcacggcc tacggcagaa ctgtccaaca ggccgttagc aagttagtag aagaagctgt 28020
tgattttatt ctttttaggg ccacgcagct cgaaagaaat gtttaattta ttccttacag 28080
acacagtatg gtatgtgggg cagattattt ttatattcgc agtgtgtttg atggtcacca 28140
taattgtggt tgccttcctt gcgtctatca aactttgtat tcaactttgc ggtttatgta 28200
atactttggt gctgtcccct tctatttatt tgtatgatag gagtaagcag ctttataagt 28260
actataatga agaaatgaga ctgcccctat tagaggtgga tgatatctaa tccaaacatt 28320
atgagtagta ctactcaggc cccagagccc gtctatcaat ggaccgccga cgaggcagtt 28380
caattcctta aggaatggaa cttctcgttg ggcattatac tactctttat tactatcata 28440
ctacagttcg gttacacgag ccgtagcatg tttatttatg ttgtgaaaat gataatcttg 28500
tggttaatgt ggccactgac tattgttttg tgtattttca attgcgtgta tgcgctaaat 28560
aatgtgtatc ttggattttc tatagtgttt actatagtgt ccattgtaat ctggatcatg 28620
tattttgtga acagcataag gttgtttatc aggactggta gctggtggag cttcaacccc 28680
gaaacaaaca accttatgtg tatagatatg aaaggtaccg tgtatgttag acccattatt 28740
gaggattacc atacactaac agccactatt attcgtggcc acctctacat gcaaggtgtt 28800
aagctaggca ccggtttctc tttgtctgac ttgcccgctt atgttacagt tgctaaggtg 28860
tcacaccttt gcacttataa gcgcgcattc ttagacaagg tagacggtgt tagcggtttt 28920
gctgtttatg tgaagtccaa ggtcggaaat taccgactgc cctcaaacaa accgagtggc 28980
gcggacaccg cattgttgag aacctaatct aaactttaag gagagaatga atcctatgtc 29040
ggcgctcggt ggtaacccct cgcgagaaag tcgggatagg acactctcta tcagaatgga 29100
tgtcttgctg tcataacaga tagagaaggt tgtggcagac cctgtatcaa ttagttgaaa 29160
gagattgcaa aatagagaat gtgtgagaga agttagcaag gtcctacgtc taaccataag 29220
aacggcgata ggcgccccct gggaacagct cacatcaggg tactattcct gcaatgccct 29280
agtaaatgaa tgaagttgat catggccaat tggaagaatc acaaaaaaaa aaaaaaaaaa 29340
aacggccggt ttaaacgcta cagtccaagt tccaagcggg atactagatg tataatgtcc 29400
gccatgcaga cgaaaccagt cggagattac cgagcattct atcacgtcgg cgaccaatag 29460
tgagcttagg gataacaggg taataaacga tccccgggaa ttcactggcc gtcgttttac 29520
aacgtcgtga ctgggaaaac cctggcgtta cccaacttaa tcgccttgca gcacatcccc 29580
ctttcgccag ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc caacagttgc 29640
gcagcctgaa tggcgaatgg cgatagatcc ggtggatgac cttttgaatg acctttaata 29700
gattatatta ctaattaatt ggggacccta gaggtcccct tttttatttt aaaaattttt 29760
tcacaaaacg gtttacaagc ataaagctcg gacggatctt ttccgctgca taaccctgct 29820
tcggggtcat tatagcgatt ttttcggtat atccatcctt tttcgcacga tatacaggat 29880
tttgccaaag ggttcgtgta gactttcctt ggtgtatcca acggcgtcag ccgggcagga 29940
taggtgaagt aggcccaccc gcgagcgggt gttccttctt cactgtccct tattcgcacc 30000
tggcggtgct caacgggaat cctgctctgc gaggctggcc ggctaccgcc ggcgtaacag 30060
atgagggcaa gcggatggct gatgaaacca agccaaccag gaagggcagc ccacctatca 30120
aggtgtcgat gcaggggggg gggaaagcca cgttgtgtct caaaatctct gatgttacat 30180
tgcacaagat aaaaatatat catcatgaac aataaaactg tctgcttaca taaacagtaa 30240
tacaaggggt gttatgagcc atattcaacg ggaaacgtct tgctcaaggc cgcgattaaa 30300
ttccaacatg gatgctgatt tatatgggta taaatgggct cgcgataatg tcgggcaatc 30360
aggtgcgaca atctatcgat tgtatgggaa gcccgatgcg ccagagttgt ttctgaaaca 30420
tggcaaaggt agcgttgcca atgatgttac agatgagatg gtcagactaa actggctgac 30480
ggaatttatg cctcttccga ccatcaagca ttttatccgt actcctgatg atgcatggtt 30540
actcaccact gcgatccccg gaaaaacagc attccaggta ttagaagaat atcctgattc 30600
aggtgaaaat attgttgatg cgctggcagt gttcctgcgc cggttgcatt cgattcctgt 30660
ttgtaattgt ccttttaaca gcgatcgcgt atttcgtctc gctcaggcgc aatcacgaat 30720
gaataacggt ttggttgatg cgagtgattt tgatgacgag cgtaatggct ggcctgttga 30780
acaagtctgg aaagaaatgc ataagttttt gccattctca ccggattcag tcgtcactca 30840
tggtgatttc tcacttgata accttatttt tgacgagggg aaattaatag gttgtattga 30900
tgttggacga gtcggaatcg cagaccgata ccaggatctt gccatcctat ggaactgcct 30960
cggtgagttt tctccttcat tacagaaacg gctttttcaa aaatatggta ttgataatcc 31020
tgatatgaat aaattgcagt ttcatttgat gctcgatgag tttttctaat cagaattggt 31080
taattggttg taacactggc agagcattac gctgacttga cgggacggcg gctttgttga 31140
ataaatcgaa cttttgctga gttgaaggat cagatcacgc atcttcccga caacgcagac 31200
cgttccgtgg caaagcaaaa gttcaaaatc accaactggt ccacctacaa caaagctctc 31260
atcaaccgtg gctccctcac tttctggctg gatgatgggg cgattcaggc ctggtatgag 31320
tcagcaacac cttcttcacg aggcagacct cagacggtat cggatcgatc ccccgatgtg 31380
tagcagtggc ggaccatata ggcagatcag aaggcgcggt tctcctacat gagcttttca 31440
attcaattca tcattttttt tttattcttt tttttgattt cggtttcctt gaaatttttt 31500
tgattcggta atctccgaac agaaggaaga acgaaggaag gagcacagac ttagattggt 31560
atatatacgc atatgtagtg ttgaagaaac atgaaattgc ccagtattct taacccaact 31620
gcacagaaca aaaacctgca ggaaacgaag ataaatcatg tcgaaagcta catataagga 31680
acgtgctgct actcatccta gtcctgttgc tgccaagcta tttaatatca tgcacgaaaa 31740
gcaaacaaac ttgtgtgctt cattggatgt tcgtaccacc aaggaattac tggagttagt 31800
tgaagcatta ggtcccaaaa tttgtttact aaaaacacat gtggatatct tgactgattt 31860
ttccatggag ggcacagtta agccgctaaa ggcattatcc gccaagtaca attttttact 31920
cttcgaagac agaaaatttg ctgacattgg taatacagtc aaattgcagt actctgcggg 31980
tgtatacaga atagcagaat gggcagacat tacgaatgca cacggtgtgg tgggcccagg 32040
tattgttagc ggtttgaagc aggcggcaga agaagtaaca aaggaaccta gaggcctttt 32100
gatgttagca gaattgtcat gcaagggctc cctatctact ggagaatata ctaagggtac 32160
tgttgacatt gcgaagagcg acaaagattt tgttatcggc tttattgctc aaagagacat 32220
gggtggaaga gatgaaggtt acgattggtt gattatgaca cccggtgtgg gtttagatga 32280
caagggagac gcattgggtc aacagtatag aaccgtggat gatgtggtct ctacaggatc 32340
tgacattatt attgttggaa gaggactatt tgcaaaggga agggatgcta aggtagaggg 32400
tgaacgttac agaaaagcag gctgggaagc atatttgaga agatgcggcc agcaaaacta 32460
aaaaactgta ttataagtaa atgcatgtat actaaactca caaattagag cttcaattta 32520
attatatcag ttattacccg ggaatctcgg tcgtaatgat ttttataatg acgaaaaaaa 32580
aaaaattgga aagaaaaagc tgggcgcgcc ggccggccct tttcatcacg tgctataaaa 32640
ataattataa tttaaatttt ttaatataaa tatataaatt aaaaatagaa agtaaaaaaa 32700
gaaattaaag aaaaaatagt ttttgttttc cgaagatgta aaagactcta gggggatcgc 32760
caacaaatac taccttttat cttgctcttc ctgctctcag gtattaatgc cgaattgttt 32820
catcttgtct gtgtagaaga ccacacacga aaatcctgtg attttacatt ttacttatcg 32880
ttaatcgaat gtatatctat ttaatctgct tttcttgtct aataaatata tatgtaaagt 32940
acgctttttg ttgaaatttt ttaaaccttt gtttattttt ttttttcttc attccgtaac 33000
tcttctacct tctttattta ctttctaaaa tccaaataca aaacataaaa ataaataaac 33060
acagagtaaa ttcccaaatt attccatcat taaaagatac gaggcgcgtg taagttacag 33120
gcaagcgatc ggccggcccg ggcatttaaa tgcaggccgc gtacgcgtcg acggtaccga 33180
attcgcttaa acgagctcat gttcgccggt gaacgcgttg aggaagccgg gcagtgcctc 33240
ggcaaaatcc ttgcgtgtag acaagacatc tgcgtagcag ttgtcctcaa caacgatgtc 33300
gaaatccaaa tcggagtgct catcgagtcc tccgtgaacg taagagccgc cgatcagaag 33360
agcgcggaag cgaacatcgg aagcgaccgc atcgcggatg cggttcaaga aagttgcatg 33420
agcttgtgga agtgtgctga gcataaatga ttctcctagc tgttctttgg gtaagtacgc 33480
catcaggacg ttgtgagtgg cgcgattttt agcggctgaa atcagccctt gagcctgtcg 33540
gcaagtcgcg tcatgaggtc catgcgctca tgcaggatcg ccacgaccaa cgcgggttcg 33600
cccgcacgcg gcaggcaaaa aacgtagtgg tgttcgcagc gggccatccg cagcgcggga 33660
aagagttcgc tcatgtcctt aaacgggcct tcgccggcgg caagcctggc tatgccctgt 33720
tccagcttag cgatatagcg gcgcacctgc gccgcgcccc actcccggcg cgtgtagcgg 33780
atgatgccgc gtagatcggc ttcggcctca gccgtgagga tgtaggccgt caagcgcgat 33840
ccccgctgag ttcttcatca agaatttcgc cgacgctctt ggtggacacc ttgccggcaa 33900
gcccatcgtt gatgcggttc cccagcatgg ttttcagttc ctgccatgcc tgatcggcat 33960
cagcgtcacc ggggaacaga cgttcgaggg cgtattgctt aatggtcttg ccctgcaagg 34020
cggccagggc tttcaggctc tggtgctgct ggtccgtcat gtcgattgtc aggcggctca 34080
ttggataacc tccataaaat acacgtaacc acattagcac atatgtgggc gtgaggctac 34140
agcgcgaggc gcattaaggt cgggaaaatg cgctaggcgc atttaaattg cgtattgctg 34200
taatgcgcca tgccggctag actaggccca aatgggtata cccaatttga ccaaggggga 34260
cgcgatgagg gcggccaagc actaccgaca acttctatcc atcgacttca acatcgaggc 34320
gctggccttc gtgcctggac ccgacggcac acgcggccgg cgcatccacg tcctggggcg 34380
cgaggtccgc gaccggcccg gcctggtcga gtacctttcg ccggcgttcg gctcgcgggt 34440
ggcgctggac ggctactgca aggccaattt cgatgcagtg ctgcacctgg cgtaccccga 34500
tcatcagcaa tggggccacg catgaagcgc cgaagctacg ccatgctgcg cgccgctgcc 34560
gcgctggccg tcctggtcgt tgcctcgccg gcatgggccg agctgcgcgg cgaggtcgtg 34620
cgcatcatcg acggcgacac catcgacgtg ctggtagaca agcagccggt gcgcgtgcgc 34680
ctggtggaca ttgacgcgcc ggaaaagcgg caagccttcg gcgaacgtgc gcgccaggcg 34740
ctggccggca tggtgttccg ccggcacgtc ctggtcgacg agaaggacac cgaccgttac 34800
ggccgcacgc tgggcaccgt gtgggtcaac atggagctgg ccagccggcc gccgcagccg 34860
cgcaacgtca acgccgcgat ggttcaccag ggcatggcgt gggcctatcg cttccacggc 34920
cgcgcggccg accctgaaat gctgcggctc gaacaggagg cgcgaggcaa gcgcgtcggc 34980
ctctggtccg atccgcacgc cgtcgagccg tggaaatggc gacgcgagag caacaaccgg 35040
agggacgaag gttgaaggtc gcccgcatct acctgcgcgc cagtacggac gagcagaatc 35100
ttgaacgcca ggagagcctt gtagcggcca cgcgggccgc cgggtactac gtcgccggca 35160
tctaccgcga gaaggcgtcc ggcgcacgcg ccgaccggcc cgagctgctg cgcatgatcg 35220
cggacctgca acctggtgaa gtcgtcgttg cggagaagat cgaccgcatc agccgcttgc 35280
cgttggccga ggccgagcgc ctggttgcgt cgatccgggc caaaggggcc aagctggccg 35340
tgcctggcgt ggtggacctg tcggagctgg ccgccgaggc gaacggagtg gcgaaaatcg 35400
ttctggaatc cgtccaggac atgcttttga agctcgcctt gcagatggcc cgcgacgact 35460
acgaggatcg gcgcgagcgt caacgtcagg gtgtccagtt ggcgaaggcc gccggccgct 35520
acaccggccg caaacgtgac gccggcatgc acgaccgcat catcacgctt cgctccggcg 35580
gatcgagcat tgccaagacg gccaagctgg tcggatgcag cccgagccag gtcaaacgag 35640
tgtgggcggc ctggaacgcg cagcagcaaa aataaagccg ggcagtgccc ggcttttctc 35700
accttttcgc gtcccgcagg gccgctgcga gcgccctacc tagatcctcg ctttccccct 35760
cggtgtagtc cggccagggc acgaagggcg cggatgcgaa cctgttgagc aggtacgcct 35820
tcgggcagcg gtagaccacc ggcgagttcg ccttttcatc ccaccgggcc aggatcacgt 35880
ccgcatcgca gtgcatgtcc ttcacctggt cgcggaagaa gccgaaggcc accatgccgc 35940
tatgttcgcc gaggaacgcc agttgcttcg cgctggcgat cgcgccgacg ccgccggcca 36000
aaaccgacgc catcacccag ccgacgaacc agaagctggc atgcttgcgg ttgaccaccg 36060
cacgcgcagc cgcgaccagg acaacggcca agctgccgac cagggccatg acgaccgtga 36120
tccggccgtt gtggaaagcg atgggcttgc cagcgtccgc ttgcacggcg tcgtaaatgc 36180
tggacccgat gggcgcgcac atcagcacga caggcagcag caccaggaac atcgtccgcg 36240
tccattgcgc gagtgccttg cggcgttcgc cggcggcaag cgcctccatc atcggcgtga 36300
agcccaacag ggccaccgca gccgccaagc cggcaacgat gccgcaggcg attacataca 36360
tacatcctcc ctaatgcgcc ttgcgcacgg ttgtagtcag agtccgcggt ggggcgataa 36420
gctcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag accccgtaga 36480
aaagatcaaa ggatcttctt gagatccttt ttttctgcgg gggatcagga ccgctgccgg 36540
agcgcaaccc actcactaca gcagagccat gtagacaaca tcccctcccc ctttccaccg 36600
cgtcagacgc ccgtagcagc ccgctacggg ctttttcatg ccctgcccta gcgtccaagc 36660
ctcacggccg cgctcggcct ctctggcggc cttctggcgc tcctgctgcg gcgtccgctc 36720
gtgggccgtg gcgcgggtcc gcgcgccggc ctcgtgcgcc tggcgctcgc gggcgaggtc 36780
cagggcggcc gtcttcacgt tctgccttgc gcagatgaga tagatcgatc tagcgtggac 36840
tcaaggctct cgcgaatggc tcgcgttgga aactttcatt gacacttgag gggcaccgca 36900
gggaaattct cgtccttgcg agaaccggct atgtcgtgct gcgcatcgag cctgcgccct 36960
tggcttgtct cgcccctctc cgcgtcgcta cggggcttcc agcgcctttc cgacgctcac 37020
cgggctggtt gccctcgccg ctgggctggc ggccgtctat ggccctgcaa acgcgccaga 37080
aacgccgtcg aagccgtgtg cgagacaccg cggccgccgg cgttgtggat acctcgcgga 37140
aaacttggcc ctcactgaca gatgaggggc ggacgttgac acttgagggg ccgactcacc 37200
cggcgcggcg ttgacagatg aggggcaggc tcgatttcgg ccggcgacgt ggagctggcc 37260
agcctcgcaa atcggcgaaa acgcctgatt ttacgcgagt ttcccacaga tgatgtggac 37320
aagcctgggg ataagtgccc tgcggtattg acacttgagg ggcgcgacta ctgacagatg 37380
aggggcgcga tccttgacac ttgaggggca gagtgctgac agatgagggg cgcacctatt 37440
gacatttgag gggctgtcca caggcagaaa atccagcatt tgcaagggtt tccgcccgtt 37500
tttcggccac cgctaacctg tcttttaacc tgcttttaaa ccaatattta taaaccttgt 37560
ttttaaccag ggctgcgccc tgtgcgcgtg accgcgcacg ccgaaggggg gtgccccccc 37620
ttctcgaacc ctcccggccc gctaacgcgg gcctcccatc cccccagggg ctgcgcccct 37680
cggccgcgaa cggcctcacc ccaaaaatgg cagcgctggc agtccttgcc attgccggga 37740
tcggggcagt aacgggatgg gcgatcagcc cgagcgcgac gcccggaagc attgacgtgc 37800
cgcaggtgct ggcatcgaca ttcagcgacc aggtgccggg cagtgagggc ggcggcctgg 37860
gtggcggcct gcccttcact tcggccgtcg gggcattcac ggacttcatg gcggggccgg 37920
caatttttac cttgggcatt cttggcatag tggtcgcggg tgccgtgctc gtgttcgggg 37980
gtgaattaat tccccggatc gatccgtcag cttcacgctg ccgcaagcac tcagggcgca 38040
agggctgcta aaggaagcgg aacacgtaga aagccagtcc gcagaaacgg tgctgacccc 38100
ggatgaatgt cagctactgg gctatctgga caagggaaaa cgcaagcgca aagagaaagc 38160
aggtagcttg cagtgggctt acatggcgat agctagactg ggcggtttta tggacagcaa 38220
gcgaaccgga attgccagct ggggcgccct ctggtaaggt tgggaagccc tgcaaagtaa 38280
actggatggc tttcttgccg ccaaggatct gatggcgcag gggatcaaga tcgacggatc 38340
gatccgggga attaattccg gggcaatccc gcaaggaggg tga 38383
<210> 41
<211> 29494
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthesis optimized sequence E-protein and ORF6 double deletion
<400> 41
caaatataac gaaaggctca gtcgaaagac tgggcctttc gttttatctg ttgtttgtcg 60
taatacgact cactataggg attaaaggtt tataccttcc caggtaacaa accaaccaac 120
tttcgatctc ttgtagatct gttctctaaa cgaactttaa aatctgtgtg gctgtcactc 180
ggctgcatgc ttagtgcact cacgcagtat aattaataac taattactgt cgttgacagg 240
acacgagtaa ctcgtctatc ttctgcaggc tgcttacggt ttcgtccgtg ttgcagccga 300
tcatcagcac atctaggttt cgtccgggtg tgaccgaaag gtaagatgga gagccttgtc 360
cctggtttca acgagaaaac acacgtccaa ctcagtttgc ctgttttaca ggttcgcgac 420
gtgttagtac gtggttttgg agattcagtg gaagaagtct tatcagaggc acgtcaacat 480
cttaaagatg gcacttgtgg cttagtagaa gttgaaaaag gcgttttgcc tcaacttgaa 540
cagccctatg tgttcatcaa acgttctgat gctagaactg cacctcatgg tcatgttatg 600
gttgagctgg tagcagaatt agaaggtatt cagtacggtc gtagtggtga gacattaggt 660
gttttagttc ctcatgtggg cgaaatacca gtggcttacc gcaaagttct tcttagaaag 720
aacggtaata aaggagctgg tggccatagt tacggcgctg atttaaagtc atttgactta 780
ggcgacgagc ttggcactga tccttatgaa gatttccaag aaaactggaa cactaaacat 840
agcagtggtg ttacccgtga actcatgcgt gagttaaatg gaggtgcata cactcgctat 900
gtcgataaca acttctgtgg acctgatggt taccctcttg agtgcattaa agaccttcta 960
gcacgtgctg gtaaagcttc atgcactttg tccgaacaac tggactttat tgacactaag 1020
aggggtgtat actgctgccg tgaacatgag catgaaattg cttggtacac ggaacgttct 1080
gaaaagagct atgaattgca gacacctttt gaaattaaac tggcaaagaa atttgacacc 1140
ttcaatgggg aatgtccaaa ttttgtattt cccctcaatt ccataatcaa gactattcaa 1200
ccaagggttg aaaagaaaaa gcttgatggc tttatgggta gaattcgatc tgtctatcca 1260
gttgcgtcac caaatgaatg caaccaaatg tgcctttcaa ctctcatgaa gtgtgatcat 1320
tgtggtgaaa cttcatggca gacgggcgat tttgttaaag ccacttgcga attttgtggc 1380
actgagaatt tgactaaaga aggtgccact acttgtggtt acttacccca aaatgctgtt 1440
gttaaaattt actgtccagc atgtcacaat tcagaagtag gacctgagca tagtcttgcc 1500
gaataccata atgaatctgg cttgaaaacc attcttcgta agggtggtcg cactattgct 1560
tttggaggat gtgtgttctc ttatgttggt tgccataaca agtgtgctta ttgggttcca 1620
cgtgcttcag ctaacatagg ttgtaaccat acaggtgttg ttggagaagg ttccgaaggt 1680
cttaatgaca accttcttga aatactccaa aaagagaaag tcaacatcaa tattgttggt 1740
gactttaaac ttaatgaaga gatcgccatt attttggcat ctttttctgc ttccacaagt 1800
gcttttgtgg aaactgtgaa aggtttggat tataaagcat tcaaacagat tgttgaatcc 1860
tgtggtaatt ttaaggttac aaagggaaaa gctaaaaaag gtgcctggaa tattggtgaa 1920
cagaaatcaa tactgagtcc tctttatgca tttgcatcag aggctgctcg tgttgtacga 1980
tcaattttct cccgcactct tgaaactgct caaaattctg tgcgtgtttt acagaaggcc 2040
gctataacaa tactagatgg aatttcacag tattcactga gactcattga tgctatgatg 2100
ttcacatctg atttggctac taacaatcta gttgtaatgg cctacattac aggtggtgtt 2160
gttcagttga cttcgcagtg gctaactaac atctttggca ctgtttatga aaaactcaaa 2220
cccgtccttg attggcttga agagaagttt aaggaaggtg tagagtttct tagagacggt 2280
tgggagattg ttaaattcat ctcaacctgt gcttgtgaaa ttgtcggtgg acaaattgtc 2340
acctgtgcta aggaaattaa ggagagtgtt cagacattct ttaagcttgt aaacaagttt 2400
ttggctttgt gtgctgactc tatcattatt ggtggagcta aacttaaagc cttgaattta 2460
ggtgaaacat ttgtcacgca ctcaaaggga ttgtacagaa agtgtgttaa atccagagaa 2520
gaaactggcc tactcatgcc tctaaaagcc ccaaaagaaa ttatcttctt agagggagaa 2580
acacttccca cagaagtgtt aacagaggaa gttgtcttga aaactggtga tttacaacca 2640
ttagaacaac ctactagtga agctgttgaa gctccattgg ttggtacacc agtttgtatt 2700
aacgggctta tgttgctcga aatcaaagac acagaaaagt actgtgccct tgcacctaat 2760
atgatggtaa caaacaatac cttcacactc aaaggcggtg caccaacaaa ggttactttt 2820
ggtgatgaca ctgtgataga agtgcaaggt tacaagagtg tgaatatcac ttttgaactt 2880
gatgaaagga ttgataaagt acttaatgag aagtgctctg cctatacagt tgaactcggt 2940
acagaagtaa atgagttcgc ctgtgttgtg gcagatgctg tcataaaaac tttgcaacca 3000
gtatctgaat tacttacacc actgggcatt gatttagatg agtggagtat ggctacatac 3060
tacttatttg atgagtctgg tgagtttaaa ttggcttcac atatgtattg ttctttctac 3120
cctccagatg aggatgaaga agaaggtgat tgtgaagaag aagagtttga gccatcaact 3180
caatatgagt atggtactga agatgattac caaggtaaac ctttggaatt tggtgccact 3240
tctgctgctt tacaacctga agaagaacaa gaagaagatt ggttagatga tgatagtcaa 3300
caaactgttg gtcaacaaga cggcagtgag gacaatcaga caactactat tcaaacaatt 3360
gttgaggttc aacctcaatt agagatggaa cttacaccag ttgttcagac tattgaagtg 3420
aatagtttta gtggttatct taaacttact gacaatgtat acatcaagaa tgcagacatt 3480
gtggaagaag ctaaaaaggt aaaaccaaca gtggttgtta atgcagccaa tgtttacctt 3540
aaacatggag gaggtgttgc aggagcctta aataaggcta ctaacaatgc catgcaagtt 3600
gaatctgatg attacatagc tactaatgga ccacttaaag tgggtggtag ttgtgtttta 3660
agcggacaca atcttgctaa acactgttta catgttgtcg gcccaaatgt taacaaaggt 3720
gaagatattc aacttcttaa gagtgcttat gaaaatttta accagcacga agttctactt 3780
gcaccattat tatcagctgg tatttttggt gctgacccta tacattcttt aagagtttgt 3840
gtagatactg ttcgcacaaa tgtctactta gctgtctttg ataaaaatct ctatgacaaa 3900
cttgtttcaa gctttttgga aatgaagagt gaaaagcaag ttgaacaaaa gatcgctgag 3960
attcctaaag aggaagttaa gccatttata actgaaagta aaccttcagt tgaacagaga 4020
aaacaagatg ataagaagat caaagcttgt gttgaagaag ttacaacaac tctggaagaa 4080
actaagttcc tcacagaaaa cttgctcctt tatatcgaca ttaatggcaa tcttcatcca 4140
gattctgcca ctcttgttag tgacattgac atcactttct taaagaaaga tgctccatat 4200
atagtgggtg atgttgttca agagggtgtt ttaactgctg tggttatacc tactaaaaag 4260
gctggtggca ctactgaaat gctagcgaaa gctttgagaa aagtgccaac agacaattat 4320
ataaccactt acccgggtca gggtttaaat ggttacactg tagaggaggc aaagacagtg 4380
cttaaaaagt gtaaaagtgc cttttacatt ctaccatcta ttatctctaa tgagaagcaa 4440
gaaattcttg gaactgtttc ttggaatttg cgagaaatgc ttgcacatgc agaagaaaca 4500
cgcaaattaa tgcctgtctg tgtggaaact aaagccatag tttcaactat acagcgtaaa 4560
tataagggta tcaagataca agagggtgtg gttgattatg gtgctagatt ttacttttac 4620
accagtaaaa caactgtagc gtcacttatc aacacactta acgatctaaa tgaaactctt 4680
gttacaatgc cacttggcta tgtaacacat ggcttaaatt tggaagaagc tgctcggtat 4740
atgagatctc tcaaagtgcc agctacagtt tctgtttctt cacctgatgc tgttacagcg 4800
tataatggtt atcttacttc ttcttctaaa acacctgaag aacattttat tgaaaccatc 4860
tcacttgctg gttcctataa agattggtcc tattctggac aatctacaca actaggtata 4920
gaatttctta agagaggtga taaaagtgta tattacacgt ccaatcctac cacattccac 4980
ctagatggtg aagttatcac ctttgacaat cttaagacac ttctttcttt gagagaagtg 5040
aggactatta aggtgtttac aacagtagac aacattaacc tccacacgca agttgtggac 5100
atgtcaatga catatggaca acagtttggt ccaacttatt tggatggagc tgatgttact 5160
aagataaaac ctcataactc acatgaaggt aaaacatttt acgttttgcc taatgatgac 5220
actctacgtg ttgaggcttt tgagtactac cacacaactg atcctagttt tctgggtagg 5280
tacatgtcag cattaaatca cactaaaaag tggaaatacc cacaagttaa tggtttaact 5340
tcgattaaat gggcagataa caactgttat cttgccactg cattgttaac actccaacaa 5400
atagagttga agtttaatcc acctgctcta caagatgctt attacagagc aagggctggt 5460
gaagctgcta acttttgtgc acttatctta gcctactgta ataagacagt aggtgagtta 5520
ggtgatgtta gagaaacaat gagttacttg tttcaacatg ccaatttaga ttcttgcaaa 5580
agagtcttga acgtggtgtg taaaacttgt ggacaacagc agacaaccct taagggtgta 5640
gaagctgtta tgtacatggg cacactttct tatgaacaat tcaagaaagg tgttcagata 5700
ccttgtacgt gtggtaaaca agctacaaaa tatctagtac aacaggagtc accttttgtt 5760
atgatgtcag caccacctgc tcagtatgaa cttaagcatg gtacatttac ttgtgctagt 5820
gagtacactg gtaattacca gtgtggtcac tataagcata taacttctaa ggaaactttg 5880
tattgcatag acggtgcttt acttacaaag tcctcagaat acaaaggtcc tattacggat 5940
gttttctaca aagaaaacag ttacacaaca accataaaac cagttactta taagttggat 6000
ggtgttgttt gtacagaaat tgaccctaag ttggacaatt attataagaa ggacaactct 6060
tatttcacag agcaaccaat tgatcttgta ccaaaccaac catatccaaa cgcaagcttc 6120
gataatttta agttcgtatg cgataatatc aaatttgctg atgatctcaa ccagttaact 6180
ggttataaga aacctgcttc aagagagctt aaagttacat ttttccctga cttaaatggt 6240
gatgtggtgg ctattgatta taaacactac acaccctctt ttaagaaagg agctaaattg 6300
ttacataagc ctattgtttg gcatgttaac aatgcaacta ataaagccac gtataaacca 6360
aatacctggt gtatacgttg tctttggagc acaaaaccag ttgaaacatc aaattcgttt 6420
gatgtactga agtcagagga cgcgcaggga atggataatc ttgcatgtga agatctaaaa 6480
ccagtctctg aagaagtagt ggaaaatcct accatacaga aagacgttct tgagtgtaat 6540
gtgaaaacta ccgaagttgt aggagacatt atacttaaac cagcaaataa tagtttgaag 6600
atcacagaag aggttggcca cacagatcta atggctgctt atgtagacaa ttctagtctt 6660
actattaaga aacctaatga actctctaga gtattaggtt tgaaaaccct tgctactcat 6720
ggtttagctg ctgttaatag tgtcccttgg gatactatag ctaattatgc taagcctttt 6780
cttaacaaag ttgttagtac aactactaac atagttacac ggtgtcttaa tcgtgtttgt 6840
actaattata tgccttactt ctttacttta ttgctacaat tgtgtacttt tactagaagt 6900
acaaattcta gaatcaaggc atctatgccg actactatag caaagaatac tgttaagagt 6960
gtcggtaaat tttgtctaga ggcttcattt aattatctca agtcacctaa cttttctaag 7020
ctgataaaca ttatcatctg gtttttgcta ttaagtgttt gcctaggttc tttaatctac 7080
tcaaccgctg ctttaggtgt tttaatgtct aatttaggca tgccttctta ctgtactggt 7140
tacagagaag gctatttgaa ctctactaat gtcactattg caacctactg tactggatct 7200
ataccttgta gtgtttgtct tagtggttta gattctttag acacctatcc ttctcttgaa 7260
actatacaga ttaccatttc atctttcaaa tgggatttaa ctgcttttgg cttagttgca 7320
gagtggtttt tggcatatat tcttttcact aggtttttct atgtacttgg attggctgca 7380
atcatgcaat tgtttttcag ctattttgca gtccatttta ttagtaactc ttggcttatg 7440
tggcttataa ttaatcttgt gcagatggcc ccgatttcag ctatggttag aatgtacatc 7500
ttctttgcct cattttatta tgtgtggaaa agttatgtgc atgttgtaga cggttgtaat 7560
tcatcaactt gtatgatgtg ttacaaacgt aatagagcaa caagagtcga atgtacaact 7620
attgttaatg gtgttagaag gtccttttat gtctatgcta atggaggtaa aggcttttgc 7680
aaactacaca attggaattg tgttaattgt gatacattct gtgctggtag tacatttatt 7740
agtgatgaag ttgcgagaga cttgtcacta cagtttaaaa gaccaataaa tcctactgac 7800
caatcttctt acatcgttga tagtgttaca gtgaagaatg gttccatcca tctttacttt 7860
gataaagctg gtcaaaagac ttatgaaaga cattctctct ctcattttgt taacttagac 7920
aacctgagag ctaataacac taaaggttca ttgcctatta atgttatcgt tttcgacggt 7980
aaatcaaaat gtgaagaatc atctgcaaaa tcagcgtctg tttactacag tcagcttatg 8040
tgtcaaccta tactgttact agatcaggca ttagtgtctg atgttggtga tagtgcggaa 8100
gttgcagtta aaatgtttga tgcttacgtt aatacgtttt catcaacttt taacgtacca 8160
atggaaaaac tcaaaacact agttgcaact gcagaagctg aacttgcaaa gaatgtgtcc 8220
ttagacaatg tcttatctac gtttatttca gcagctcggc aagggtttgt tgattcagat 8280
gtagaaacta aagatgttgt tgaatgtctt aaattgtcac atcaatctga catagaagtt 8340
actggcgata gttgtaataa ctatatgctc acctataaca aagttgaaaa catgacaccc 8400
cgtgaccttg gtgcttgtat tgactgtagt gctagacata ttaatgcgca ggtagcaaaa 8460
agtcacaaca ttgctttgat atggaacgtt aaagatttca tgtcattgtc tgaacaacta 8520
cgaaaacaaa tacgtagtgc tgctaaaaag aataacttac ccttcaagtt gacatgtgca 8580
actactagac aagttgttaa tgttgtaaca acaaagatag cacttaaggg tggtaaaatt 8640
gtgaataact ggttgaagca gcttattaaa gttacacttg tgttcctttt tgttgctgct 8700
attttctatc tgataacacc tgttcatgtc atgtctaaac atactgactt ttcaagtgaa 8760
atcataggat acaaggctat tgatggtggt gtcactcgtg acatagcatc tacagatact 8820
tgttttgcta acaaacatgc tgattttgac acatggttta gccagcgtgg tggtagttat 8880
actaatgaca aagcttgccc attgattgct gcagtcataa caagagaagt gggttttgtc 8940
gttcctggtt tgcctggaac gatattacgc acaactaatg gtgacttttt gcatttctta 9000
cctagagttt ttagtgcagt tggtaacatc tgttacacac catcaaaact tatagagtac 9060
actgactttg caacatcagc ttgtgttttg gctgctgaat gtacaatttt taaagacgct 9120
tctggtaagc cagtaccata ttgttatgat accaatgtac tagaaggttc tgttgcttat 9180
gaaagtttac gccctgacac acgttatgtg ctcatggatg gctctattat tcaatttcct 9240
aacacctacc ttgaaggttc tgtaagagtg gtaacaactt ttgattctga gtactgtagg 9300
cacggcactt gtgaaagatc agaagctggt gtttgtgtat ctactagtgg tagatgggta 9360
cttaacaacg attattacag atctttacca ggagttttct gtggtgtaga tgctgtaaat 9420
ttgcttacta acatgtttac accactaatt caacctattg gtgctttgga catatcagca 9480
tctatagtag ctggtggtat tgtagctatc gtagtaacat gccttgccta ctattttatg 9540
aggtttagac gtgcttttgg tgaatacagt catgtagttg cctttaatac tctcctattc 9600
cttatgtcat tcactgtact ctgtttaaca ccagtttact cattcttacc tggtgtttat 9660
tctgttattt acctgtactt gacattttat ctgactaatg atgtttcttt tctcgcacat 9720
attcagtgga tggttatgtt cacaccttta gtacctttct ggataacaat tgcttacatc 9780
atttgtattt ccacaaagca tttctattgg ttctttagta attacctaaa gagacgtgta 9840
gtctttaatg gtgtttcctt tagtactttt gaagaagctg cgctgtgcac ctttttgtta 9900
aataaggaga tgtatctaaa gttgcgtagt gatgtgctat tacctcttac gcaatataat 9960
agatacttag ctctttataa caagtacaag tatttcagtg gagcaatgga tacaactagc 10020
tacagagaag ctgcttgttg tcatctcgca aaggctctca atgacttcag taactcaggt 10080
tctgatgttc tttaccaacc accacaaacc tctatcacct cagctgtttt gcagagtggt 10140
tttagaaaaa tggcattccc atctggtaaa gttgagggtt gtatggtaca agtaacttgt 10200
ggtacaacta cacttaacgg tctttggctt gatgacgtag tttactgtcc aagacatgtg 10260
atctgcacct ctgaagatat gcttaaccct aattatgaag atctactcat ccgtaagtct 10320
aatcataact tcttggtaca ggctggtaat gttcaactca gggttattgg acattctatg 10380
caaaattgtg tacttaagct taaggttgat acagccaatc ctaagacacc taagtataag 10440
tttgttcgca ttcaaccagg acagactttt tcagtgttag cttgttacaa tggttcacca 10500
tctggtgttt accaatgtgc tatgaggccc aatttcacta ttaagggttc attccttaat 10560
ggttcatgtg gtagtgttgg ttttaacata gattatgact gtgtctcttt ttgttacatg 10620
caccatatgg aattaccaac tggagttcat gctggcacag acttagaagg taacttttat 10680
ggaccttttg ttgacaggca aacagcacaa gcagctggta cagatacaac tattacagtt 10740
aatgttcttg cttggttgta cgctgctgtt ataaatggag acaggtggtt tctcaatcga 10800
tttaccacaa ctcttaatga ctttaacctt gtggctatga agtacaatta tgaacctcta 10860
acacaagacc atgttgacat actaggacct ctttctgctc aaactggaat tgccgtttta 10920
gatatgtgtg cttcattaaa agaacttctg caaaatggta tgaatggacg taccatattg 10980
ggtagtgctt tattagaaga tgagtttaca ccttttgatg ttgttagaca atgctcaggt 11040
gttactttcc aaagtgcagt gaaaagaaca atcaagggta cacaccactg gttgttactc 11100
acaattttga cttcactttt agttttagtc cagagtactc aatggtcttt gttctttttc 11160
ttctacgaaa atgccttttt accttttgct atgggtatta ttgctatgtc tgcttttgca 11220
atgatgtttg tcaaacataa gcatgcattt ctctgtttgt ttttgttacc ttctcttgcc 11280
actgtagctt actttaatat ggtctacatg cctgctagtt gggtgatgcg tattatgaca 11340
tggttggata tggttgatac tagtttgtct ggttttaagc taaaagactg tgttatgtat 11400
gcatcagctg tagtgttact aatccttatg acagcaagaa ctgtgtatga tgatggtgct 11460
aggagagtgt ggacacttat gaatgtcttg acactcgttt ataaagttta ctatggcaac 11520
gctttagatc aagccatttc catgtgggct cttataatct ctgttacttc taactactca 11580
ggtgtagtta caactgtcat gtttttggcc agaggtattg tttttatgtg tgttgagtat 11640
tgccctattt tcttcataac tggtaataca cttcagtgta taatgctagt ctattgtttc 11700
ttaggctatt tttgtacttg ttacttcggc ctcttttgtt tactcaaccg ctactttaga 11760
ctgactcttg gtgtttatga ttacttagtg tctacacagg agtttagata tatgaattca 11820
cagggactac tcccacccaa gaatagcata gatgccttca aactcaacat taaattgttg 11880
ggtgttggtg gcaaaccttg tatcaaagta gccactgtac agtctaaaat gtcagatgta 11940
aagtgcacat cagtagtctt actctcagtt ttgcaacaac tcagagtaga atcatcatct 12000
aaattgtggg ctcaatgtgt ccagttacac aatgacattc tcttagctaa agatactact 12060
gaagcctttg aaaaaatggt ttcactactt tctgttttgc tttccatgca gggtgctgta 12120
gacataaaca agctttgtga agaaatgctg gacaacaggg caaccttaca agctatagcc 12180
tcagagttta gttcccttcc atcatatgca gcttttgcta ctgctcaaga agcttatgag 12240
caggctgttg ctaatggtga ttctgaagtt gttcttaaaa agttgaagaa gtctttgaat 12300
gtggctaaat ctgaatttga ccgtgatgca gccatgcaac gtaagttgga aaagatggct 12360
gatcaagcta tgacccaaat gtataaacag gctagatctg aggacaagag ggcaaaagtt 12420
actagtgcta tgcagacaat gcttttcact atgcttagaa agttggataa tgatgcactc 12480
aacaacatta tcaacaatgc aagagatggt tgtgttccct tgaacataat acctcttaca 12540
acagcagcca aactaatggt tgtcatacca gactacaaca catataagaa tacgtgtgat 12600
ggtacaacat ttacttatgc atcagcattg tgggaaatcc aacaggttgt agatgcagat 12660
agtaaaattg ttcagcttag tgaaattagt atggacaatt cacctaattt agcatggcct 12720
cttattgtaa cagctttaag ggccaattct gctgtcaaat tacagaataa tgagcttagt 12780
cctgttgcac taagacaaat gtcttgtgct gccggtacta cacaaactgc ttgcactgat 12840
gacaatgcgt tagcttacta caacacaaca aagggaggta ggtttgtact tgcactgtta 12900
tccgatttac aggatttgaa atgggctaga ttccctaaga gtgatggaac tggtactatc 12960
tatacagaac tggaaccacc ttgtaggttt gttacagaca cacctaaagg tcctaaagtg 13020
aagtatcttt acttcatcaa aggattaaac aacctaaata gaggtatggt acttggtagt 13080
ttagctgcca cagtacgttt acaagctggt aatgcaacag aagttcctgc taattcaact 13140
gtactttctt tctgtgcttt tgctgtagat gctgctaaag cttacaaaga ttatctagct 13200
agtgggggac aaccaatcac taattgtgtt aagatgttgt gtacacacac tggtactggt 13260
caggcaataa cagttacacc ggaagccaat atggatcaag aatcctttgg tggtgcatcg 13320
tgttgtctgt actgccgttg tcatatagat catccaaatc ctaaaggatt ttgtgactta 13380
aaaggtaagt atgtacaaat acctacaact tgtgctaatg accctgtggg ttttacactt 13440
aaaaacacag tctgtaccgt ctgcggtatg tggaaaggtt atggttgtag ttgtgatcaa 13500
ctccgcgaac ccatgcttca gtcagctgat gcacaatcgt ttttaaacgg gtttgcggtg 13560
taagtgcagc ccgtcttaca ccgtgcggca caggcactag tactgatgtc gtatatagag 13620
cttttgacat ctacaatgat aaagtagctg gttttgctaa gttcctaaaa actaattgtt 13680
gtcgcttcca agaaaaggac gaagatgaca atctcattga ttcttacttt gtagttaaga 13740
gacacacttt ctctaactac caacatgaag aaacaattta caacctgctt aaggattgtc 13800
cagctgttgc taaacatgac ttctttaagt ttagaataga cggtgacatg gtaccacata 13860
tatcacgtca acgtcttact aaatacacaa tggcagacct cgtctatgct ttaaggcatt 13920
ttgatgaagg taattgtgac acattaaaag aaatacttgt cacatacaat tgttgtgatg 13980
atgactactt caataaaaag gactggtatg attttgtaga aaacccagat atattacgcg 14040
tatacgccaa cttaggtgaa cgtgtacgcc aagctttgtt aaaaacagta cagttctgtg 14100
atgccatgcg aaatgctggt attgttggtg tactgacatt agataatcaa gatctcaatg 14160
gtaactggta tgactttggt gatttcatac aaaccacgcc aggtagtgga gttcctgttg 14220
tagactctta ttattcattg ctcatgccta tattaacctt gaccagggct ttaactgcag 14280
agtcacatgt tgacactgac ttaacaaagc cttacattaa gtgggatttg ttaaaatacg 14340
acttcacgga agagaggtta aaactctttg accgttattt taaatactgg gatcagacat 14400
accacccaaa ttgtgttaac tgtttggatg acagatgcat tctgcattgt gcaaacttta 14460
atgttctgtt ctctacagtg ttcccaccta caagttttgg accactagtg agaaaaatat 14520
ttgttgatgg tgttccattt gtagtttcaa ctggatacca cttcagagag ctaggtgttg 14580
tacataatca ggatgtaaac ttacatagct ctagacttag ttttaaggaa ttacttgtgt 14640
atgctgctga tcctgctatg catgctgctt ctggtaatct attactagat aaacgcacta 14700
cgtgcttttc agtagctgca cttactaaca atgttgcttt tcaaactgtc aaacccggta 14760
attttaacaa ggacttctat gactttgctg tgtctaaggg tttctttaag gaaggaagtt 14820
ctgttgaatt aaaacacttc ttctttgctc aggatggtaa tgctgctatc agcgattatg 14880
actactatcg ttataatcta ccaacaatgt gtgatatcag acaactacta tttgtagttg 14940
aagttgttga taagtacttt gattgttacg atggtggctg tattaatgct aaccaagtca 15000
tcgtcaacaa cctagacaaa tcagctggtt ttccatttaa taaatggggt aaggctagac 15060
tttattatga ttccatgagt tatgaggatc aagatgcact tttcgcatat acaaaacgta 15120
atgtcatccc tactataact caaatgaacc ttaagtatgc cattagtgca aagaatagag 15180
ctcgcaccgt agctggtgtc tctatctgta gtactatgac caatagacag tttcatcaaa 15240
aattactcaa gtcaatagcc gccactagag gagctactgt agtaattgga acaagcaaat 15300
tctatggtgg ttggcacaac atgctcaaaa ctgtttatag tgatgtagaa aaccctcacc 15360
ttatgggttg ggattatcct aaatgtgata gagccatgcc taacatgctt agaattatgg 15420
cctcacttgt tcttgctcgc aaacatacaa cgtgttgtag cttgtcacac cgtttctata 15480
gattagctaa tgagtgtgct caagtattga gtgaaatggt catgtgtggc ggttcactat 15540
atgttaaacc aggtggaacc tcatcaggag atgccacaac tgcttatgct aatagtgtgt 15600
ttaacatttg tcaagctgtc acggccaatg ttaatgcact tttatctact gatggtaaca 15660
aaattgccga taagtatgtc cgcaatttac aacacagact ttatgagtgt ctctatagaa 15720
atagagatgt tgacacagac tttgtgaatg agttttacgc atatttgcgt aaacatttct 15780
caatgatgat actctctgac gatgctgttg tgtgtttcaa tagcacttat gcatctcaag 15840
gtctagtggc tagcataaag aactttaagt cagttcttta ctatcaaaac aacgttttta 15900
tgtctgaagc aaaatgttgg actgagactg accttactaa aggacctcat gaattttgct 15960
ctcaacatac aatgctagtt aaacagggtg atgattatgt gtaccttcct tacccagatc 16020
catcaagaat cctaggtgcc ggttgttttg tagatgatat cgtaaaaaca gatggtacac 16080
ttatgattga acggttcgtg tctttagcta tagatgctta cccacttact aaacatccta 16140
atcaggagta tgctgatgtc tttcatttgt acttacaata catacgtaag ctacatgatg 16200
agttaacagg acacatgtta gacatgtatt ctgttatgct tactaatgat aacacttcaa 16260
ggtattggga acctgagttt tatgaggcta tgtacacacc gcatacagtc ttacaagctg 16320
ttggtgcttg tgttctttgc aattcacaga cttcattaag atgtggtgct tgcatacgta 16380
gaccattctt atgttgtaaa tgctgttacg accatgtcat ctcaacatca cataaattag 16440
tcttgtctgt taatccgtat gtttgcaatg ctccaggttg tgatgtcaca gatgtgactc 16500
aactttactt aggaggtatg agctattact gtaagtcaca taaaccaccc attagttttc 16560
cattgtgtgc taatggacaa gtttttggtc tctacaagaa tacatgtgtt ggtagcgata 16620
atgttactga ctttaatgca attgcaacat gtgactggac aaatgctggt gattacattt 16680
tagctaacac ctgtactgaa agactcaagc tttttgcagc agaaacgctc aaagctactg 16740
aggagacatt taaactgtct tatggtattg ctactgtacg tgaagtgctg tctgacagag 16800
aattacatct ttcatgggaa gttggtaaac ctagaccacc acttaaccga aattatgtct 16860
ttactggtta tcgtgtaact aaaaacagta aagtgcaaat cggagagtac acctttgaaa 16920
aaggtgacta tggtgatgct gttgtttacc gaggtacaac aacttacaaa ctcaacgttg 16980
gtgattattt tgtgctgaca tcacatacag taatgccatt aagtgcacct acactagtgc 17040
cacaagagca ctatgttaga attactggct tatacccaac actcaatatc tcagatgagt 17100
tttctagcaa tgttgcaaat tatcaaaagg ttggtatgca aaagtattct acactccagg 17160
gaccacctgg tactggtaaa agtcattttg ctattggtct agctctctac tacccttctg 17220
ctcgcatagt atatacagct tgctctcatg cagctgttga tgcactatgt gagaaggcat 17280
taaaatattt gcccatagac aaatgtagta gaattatacc tgcacgtgct cgtgtagagt 17340
gttttgataa attcaaggtg aattcaacat tagaacagta tgtcttttgt actgtaaatg 17400
cattgcctga gacgacagca gatatagttg tctttgatga aatttcaatg gccacaaatt 17460
atgatttgag tgttgtcaat gccagattac gtgctaagca ctatgtgtac attggtgatc 17520
ctgctcaatt acctgcacca cgcacattac taactaaggg tacactagaa ccagaatatt 17580
tcaattcagt gtgtagactt atgaaaacta taggtccaga catgttcctc ggaacttgtc 17640
gtagatgtcc tgctgaaatt gttgacactg tgagtgcttt ggtttatgat aataagctta 17700
aggcacataa agacaaatca gctcaatgct ttaaaatgtt ctacaagggt gttatcacgc 17760
atgatgtttc atctgcaatt aacaggccac aaataggcgt ggtaagagaa ttccttacac 17820
gtaaccctgc ttggagaaaa gctgtcttta tttcacctta caattcccag aatgctgtag 17880
cctcaaagat tttgggacta ccaactcaaa ctgttgattc atcacagggc tcagaatatg 17940
actatgtcat attcactcaa accactgaaa cagctcactc ttgtaatgta aacagattca 18000
acgttgctat taccagagca aaagtaggca tactttgcat aatgtctgat agagaccttt 18060
atgacaagtt gcaatttaca agtcttgaaa ttccacgtag gaatgtggca actttacaag 18120
ctgaaaatgt aacaggactc tttaaagatt gtagtaaggt aatcactggg ttacatccta 18180
cacaggcacc tacacactta agtgttgata ctaaattcaa aactgaaggt ttatgtgttg 18240
acatacctgg catacctaag gacatgacct atagaagatt aatctctatg atgggtttca 18300
aaatgaatta ccaggttaat ggttacccta acatgtttat cacccgcgaa gaagctataa 18360
gacatgtacg tgcatggatt ggcttcgatg tcgaaggttg tcatgctact agagaagctg 18420
ttggtaccaa tttaccttta cagctaggtt tttctacagg tgttaaccta gttgctgtac 18480
ctacaggtta tgttgataca cctaataata cagatttttc cagagttagt gctaaaccac 18540
cgcctggaga tcaatttaaa cacctcatac cacttatgta caaaggactt ccttggaatg 18600
tagtgcgtat aaagattgtc caaatgttaa gtgacacact taaaaatctc tctgacagag 18660
tcgtatttgt cttatgggca catggctttg agttgacatc tatgaagtat tttgtgaaga 18720
tcggacctga gcgcacatgt tgtctatgtg atagacgtgc tacatgcttt tccactgctt 18780
cagacactta tgcctgttgg catcattcta ttggatttga ttacgtctat aatccgttta 18840
tgattgatgt tcaacaatgg ggttttacag gtaacctaca aagcaaccat gatctgtatt 18900
gtcaagtcca tggtaatgca catgtagcta gttgtgatgc aatcatgact aggtgtctag 18960
ctgtccacga gtgctttgtt aagcgtgttg actggactat tgaatatcct ataatcggtg 19020
atgaactgaa gattaatgcg gcttgtagaa aggttcaaca catggttgtt aaagctgcat 19080
tattagcaga caaattccca gttcttcacg acattggtaa ccctaaagct attaagtgtg 19140
tacctcaagc tgatgtagaa tggaagttct atgatgcaca gccttgtagt gacaaagctt 19200
acaaaataga agaactgttc tattcttatg ccacacattc tgacaaattc acagatggtg 19260
tatgcctatt ttggaattgc aatgtcgata gatatcctgc taattccatt gtttgtagat 19320
ttgacactag agtgctatct aaccttaact tgcctggttg tgatggtggc agtttgtatg 19380
taaataagca tgcattccac acaccagctt ttgataaaag tgcttttgtt aatctaaagc 19440
aacttccatt tttctattac tctgacagtc catgtgagtc tcatggaaaa caagtagtgt 19500
cagatataga ttatgtacca ctaaagtctg ctacgtgtat aacacgttgc aatttaggtg 19560
gtgctgtctg tagacatcat gctaatgagt acagattgta tctcgatgct tataacatga 19620
tgatctcagc tggctttagc ttgtgggttt acaaacaatt tgatacctat aacctctgga 19680
acacttttac aagacttcag agtttagaaa atgtggcttt taatgttgta aataagggac 19740
actttgatgg acaacagggt gaagtaccag tttctatcat taacaacact gtttacacaa 19800
aagttgatgg tgttgatgta gaattgtttg agaacaaaac cacattacct gttaatgtag 19860
catttgagct ttgggctaag cgcaacatta aaccagtacc agaggtgaaa atactcaata 19920
atttgggtgt ggacattgct gctaatactg tgatctggga ctacaaaaga gatgctccag 19980
cacatatatc tactattggt gtttgttcta tgactgacat agccaagaaa ccaactgaaa 20040
cgatttgtgc accactcact gtcttttttg atggtagagt tgatggtcaa gtagacttat 20100
ttagaaatgc ccgtaatggt gttcttatta cagaaggtag tgttaaaggt ttacaaccat 20160
ctgtaggtcc caaacaagct agtcttaatg gagtcacatt aattggagaa gccgtaaaaa 20220
cacagttcaa ttattacaag aaagtggatg gtgttgtcca acaattacct gaaacttact 20280
ttactcagag tagaaactta caggaattta agcccaggag tcaaatggaa attgatttct 20340
tagaacttgc tatggatgaa ttcattgaac ggtataaatt agaaggctat gccttcgaac 20400
atatcgttta tggagatttt agtcatagtc agttaggtgg tttacatcta ctgattggac 20460
tagctaaacg ttttaaggaa tcaccttttg aacttgaaga ttttattcct atggacagta 20520
cagttaaaaa ctacttcata acagatgcgc aaacaggttc atctaagtgt gtgtgttctg 20580
ttattgatct tttacttgat gacttcgttg aaataataaa gtcccaagat ttatctgtag 20640
tttctaaggt tgtcaaagtg actattgact atacagaaat ctcatttatg ctttggtgta 20700
aagatggcca tgtagaaaca ttttacccaa aattacaatc tagtcaagcg tggcaaccgg 20760
gtgttgctat gcctaatctt tacaaaatgc aaagaatgct attagaaaag tgtgaccttc 20820
aaaattatgg tgatagtgca acattaccta aaggcataat gatgaatgtc gcaaaatata 20880
ctcaactgtg tcaatattta aacacactga cattagctgt accctataat atgagagtta 20940
tccattttgg tgctggttct gataaaggag ttgcaccagg tacagctgtt ttaagacaat 21000
ggttgcctac aggtacgctg cttgtcgatt cagatcttaa tgactttgtc tctgatgcag 21060
attcaacttt gattggtgat tgtgcaactg tacatacagc taataaatgg gatctcatta 21120
ttagtgatat gtacgaccct aagactaaga atgtcacaaa agaaaacgac tctaaagagg 21180
gttttttcac ttacatttgt gggtttatac aacaaaagct agctcttgga ggttccgtgg 21240
ctataaagat aacagaacat tcttggaatg ctgatcttta taagctcatg ggacacttcg 21300
catggtggac agcctttgtt actaatgtga atgcgtcatc atctgaagca tttttaatcg 21360
gatgtaacta ccttggcaaa ccacgcgaac aaatagatgg ttatgtcatg catgcaaatt 21420
acatattttg gaggaataca aatccaattc agctttcttc ttattcttta ttcgacatga 21480
gtaaattccc ccttaaatta aggggtactg ctgttatgtc tttaaaagaa ggtcaaatca 21540
atgatatgat tctctctctt cttagtaaag gtagacttat aattagagaa aacaacagag 21600
ttgttatttc tagtgatgtt cttgttaaca actaaacgaa caatgtttgt ttttcttgtt 21660
ttattgccac tagtctctag tcagtgtgtt aatcttacaa ccagaactca attaccccct 21720
gcatacacta attctttcac acgtggtgtt tattaccctg acaaagtttt cagatcctca 21780
gttttacatt caactcagga cttgttctta cctttctttt ccaatgttac ttggttccat 21840
gctatacatg tctctgggac caatggtact aagaggtttg ataaccctgt cctaccattt 21900
aatgatggtg tttactttgc ttccactgag aagtctaaca taataagagg ctggattttt 21960
ggtactactt tagattcgaa aacccagtcc ctacttattg ttaataacgc tactaatgtt 22020
gttatcaaag tctgtgaatt tcaattttgt aacgatccat ttttgggtgt ttattaccac 22080
aaaaacaaca aaagttggat ggaaagtgag ttcagagttt attctagtgc gaataattgc 22140
acttttgaat acgtctctca gccttttctt atggaccttg aaggaaaaca gggtaatttc 22200
aaaaatctta gggaatttgt gttcaagaat attgatggtt acttcaagat atactctaag 22260
cacacgccta ttaatttagt gcgtgatctc cctcagggtt tttcggcttt agaaccattg 22320
gtagatttgc caataggtat taacatcact aggtttcaaa ctttacttgc tttacataga 22380
agttatttaa ctcctggtga ttcttcttca ggttggacag ctggtgctgc agcttattat 22440
gtgggttatc ttcaacctag gacttttcta ctgaagtaca atgaaaatgg aaccattaca 22500
gatgctgtag actgtgcact tgaccctctc tcagaaacaa agtgtacgtt gaaatccttc 22560
actgtagaaa aaggaatcta tcaaacttct aactttagag tccaaccaac agaatctatt 22620
gttagatttc ctaacatcac aaacttgtgc ccttttggtg aagtttttaa cgccaccaga 22680
tttgcatctg tttatgcttg gaacaggaag agaatcagca actgtgttgc tgattattct 22740
gtcctgtata attccgcatc attttccact tttaagtgtt atggagtgtc tcctactaaa 22800
ttaaatgatc tctgctttac taatgtctat gcagattcat ttgtaattag aggtgatgaa 22860
gtcagacaaa tcgctccagg gcaaactgga aagattgctg attataacta caaattacca 22920
gatgatttta caggctgcgt tatagcttgg aattctaaca atcttgattc taaggttggt 22980
ggtaattata attacctgta cagattgttt aggaagtcta atctcaaacc ttttgagaga 23040
gatatttcaa ctgaaatcta tcaggccggt agcacacctt gtaatggtgt tgaaggtttt 23100
aattgttact ttcctctgca atcatatggt ttccaaccca ctaatggtgt tggttaccaa 23160
ccatacagag tagtagtact ttcttttgaa cttctacatg caccagcaac tgtttgtgga 23220
cctaaaaagt ctactaattt ggttaagaac aagtgtgtca atttcaactt caatggttta 23280
acaggcacag gtgttcttac tgagtctaac aaaaagtttc tgcctttcca acaatttggc 23340
agagacattg ctgacactac tgatgctgtt cgtgatccac aaacacttga gattcttgac 23400
attacaccat gttcttttgg tggtgtcagt gttataacac caggaacaaa tacttctaac 23460
caggttgctg ttctttatca ggatgttaac tgcacagaag tccctgttgc tattcatgca 23520
gatcaactta ctcctacttg gcgtgtttat tctacaggtt ctaatgtttt tcaaacacgt 23580
gcaggctgtt taataggggc tgaacatgtc aacaactcat atgagtgtga catacccatt 23640
ggtgcaggta tatgcgctag ttatcagact cagactaatt ctcctcggag agcaagaagt 23700
gtagctagtc aatccatcat tgcctacact atgtcacttg gtgcagaaaa ttcagttgct 23760
tactctaata actctattgc catacccaca aattttacta ttagcgttac cacagaaatt 23820
ctaccagtgt ctatgaccaa gacatcagta gattgtacaa tgtacatttg tggtgattca 23880
actgaatgca gcaatctttt gttgcaatat ggcagttttt gtacacaatt aaaccgtgct 23940
ttaactggaa tagctgttga acaagacaaa aacacccaag aagtttttgc acaagtcaaa 24000
caaatttaca agacaccacc aattaaagat tttggcggtt ttaattttag ccagatactg 24060
ccagatccat caaaaccaag caagaggtca tttattgaag atctactgtt caacaaagtg 24120
acacttgcag atgctggctt catcaaacaa tatggtgatt gccttggtga tattgctgct 24180
agagacctca tttgtgcaca aaagtttaac ggccttactg ttttgccacc tttgctcaca 24240
gatgaaatga ttgctcaata cacttctgca ctgttagcag gtacaatcac ttctggttgg 24300
acttttggtg caggtgctgc attacaaata ccatttgcta tgcaaatggc ttataggttt 24360
aatggtattg gagttacaca gaatgttctc tatgagaacc aaaaattgat tgccaaccaa 24420
tttaatagtg ctattggcaa aattcaagac tcactttctt ccacagcaag tgcacttgga 24480
aaacttcaag atgtggtcaa ccaaaatgca caagctttaa acacgcttgt taaacaactt 24540
agctccaatt ttggtgcaat ttcaagtgtt ttaaacgaca tcctttcacg tcttgacaaa 24600
gttgaggctg aagtgcaaat tgataggttg atcacaggca gacttcaaag tttgcagaca 24660
tatgtgactc aacaattaat tagagctgca gaaatcagag cttctgctaa tcttgctgct 24720
actaaaatgt cagagtgtgt acttggacaa tcaaaaagag ttgacttttg cggaaagggc 24780
tatcatctta tgtcatttcc tcagtcagca cctcatggtg tcgtcttttt gcatgtgact 24840
tatgtccctg cacaagaaaa gaacttcaca actgctcctg ccatttgtca tgatggaaaa 24900
gcacactttc ctcgtgaagg tgtctttgtt tcaaatggca cacactggtt tgtaacacaa 24960
aggaattttt atgaaccaca aatcattact acagacaaca catttgtgtc tggtaactgt 25020
gatgttgtaa taggaattgt caacaacaca gtttatgatc ctttgcaacc tgaattagac 25080
tcattcaagg aggagcttga taaatacttc aagaaccata cctcaccaga tgttgattta 25140
ggtgacatct ctggcattaa tgcttcagtt gtaaacattc agaaagaaat cgaccgcctc 25200
aatgaggttg ccaagaattt aaatgaatct ctcatcgatc tccaagaact tggaaagtat 25260
gagcagtata taaaatggcc atggtacatt tggctaggtt ttatagctgg cttgattgcc 25320
atagtaatgg tgacaattat gctttgctgt atgaccagtt gctgtagttg tctcaagggc 25380
tgttgttctt gtggatcctg ctgcaaattt gacgaggacg actctgagcc agtgctcaaa 25440
ggagtcaaat tacattacac ataaacgaac ttatggattt gtttatgaga atcttcacaa 25500
ttggaactgt aactttgaag caaggtgaaa tcaaggatgc tactccttca gattttgtta 25560
gagctactgc aacgataccg atacaagcat cacttccttt cggatggctt attgttggcg 25620
ttgcacttct tgctgttttt cagagcgctt ccaaaatcat aaccctcaaa aagagatggc 25680
aactagcact ctccaagggt gttcactttg tttgcaactt gctgttgttg tttgtaacag 25740
tttactcaca tcttttgctt gttgctgctg gccttgaagc cccttttctc tatctttatg 25800
ctttagtcta cttcttgcag agtataaact ttgtacgcat aataatgagg ctttggcttt 25860
gctggaaatg ccgttccaaa aacccattac tttatgatgc caactatttt ctttgctggc 25920
atactaattg ttacgactat tgtatacctt acaatagtgt aacttcttca attgtcatta 25980
cttcaggtga tggcacaaca agtcctattt ctgaacatga ctaccagatt ggtggttata 26040
ctgaaaaatg ggaatctgga gtaaaagact gtgttgtatt acacagttac ttcacttcag 26100
actattacca gctgtactca actcaattga gtacagacac tggtgttgaa catgttacct 26160
tcttcatcta caataaaatc gttgatgagc ctgaagaaca tgtccaaatt cacacaatcg 26220
acgtttcatc cggagttgtt aatccagtaa tggaaccaat ttatgatgaa ccgacgacga 26280
ctactagcgt gcctttgtaa gcacaagctg atgagtacga acttatggca gattccaacg 26340
gtactattac cgttgaggag ctgaaaaagc tccttgaaca atggaaccta gtaataggtt 26400
tcctattcct tacatggatt tgcctgctgc aatttgccta tgccaacagg aataggtttt 26460
tgtacatcat taagttgatt ttcctctggc tgttatggcc agtaacttta gcttgttttg 26520
tgcttgctgc tgtttacaga ataaattgga tcaccggtgg aattgctatt gcaatggctt 26580
gtcttgtagg attgatgtgg ctaagctact tcattgcttc tttcagactg tttgcgcgta 26640
cgcgttccat gtggtcattc aatccagaaa ctaacattct tctcaacgtg ccactccatg 26700
gaactattct gactagaccg cttctagaaa gtgaactcgt aatcggagct gttatccttc 26760
gtggacatct tcgtattgct ggacatcatc taggacgctg tgacatcaag gatctaccta 26820
aagaaatcac tgttgctaca tcacgaacgc tttcttatta caaattggga gcttcacagc 26880
gtgtagcagg tgattcaggt tttgctgcat atagtcgcta caggattggc aactataaat 26940
taaacacaga ccattccagt agcagtgaca atattgcttt gcttgtacag taaacgaaca 27000
tgaaaattat tcttttcttg gcactgataa cactcgctac ttgtgagctt tatcactacc 27060
aagagtgtgt tagaggtaca acagtacttt taaaagaacc ttgctcgtcg ggaacatacg 27120
agggcaattc accatttcat cctctagctg ataacaaatt tgcactgact tgctttagca 27180
ctcaatttgc ttttgcttgt cctgacggcg taaaacacgt ctatcagtta cgtgccagat 27240
cagtttcacc taaactgttc atcagacaag aggaagttca agaactttac tctccaattt 27300
ttcttattgt tgcggcaata gtgtttataa cactttgctt cacactcaaa agaaagacag 27360
aatgattgaa ctttcattaa ttgacttcta tttgtgcttt ttagcctttc tgctattcct 27420
tgttttaatt atgcttatta tcttttggtt ctcacttgaa ctgcaagatc ataatgaaac 27480
ttgtcacgcc taaacgaaca tgaaatttct tgttttctta ggaatcatca caactgtagc 27540
tgcatttcac caagaatgta gtttacagtc atgtactcaa catcaaccat atgtagttga 27600
tgacccgtgt cctattcact tctattctaa atggtatatc agagtaggag ctagaaaatc 27660
agcaccttta attgaattgt gcgtggatga ggctggttct aaatcaccca ttcagtacat 27720
cgatatcggt aattatacag tttcctgttt accttttaca attaactgcc aggaacctaa 27780
attgggtagt cttgtagtgc gttgttcgtt ctacgaggac tttttagagt atcatgacgt 27840
tcgtgttgtt ttagatttca tctaaacgaa caaactaaaa tgtctgataa tggacctcaa 27900
aatcagcgaa atgcacctcg cattacgttt ggtggaccat cagattcaac tggcagtaac 27960
cagaatggag aacgaagtgg tgcgcgatca aaacaacgcc gcccgcaagg tttacccaat 28020
aatactgcgt cttggttcac cgctctcact caacatggca aggaagattt aaaattccct 28080
cgaggacaag gcgttccaat taacaccaat agcagtccag atgaccaaat tggctactac 28140
cgccgcgcca caagacgaat tcgtggtggt gatggtaaaa tgaaagatct cagtccaaga 28200
tggtatttct actatctagg aactgggcca gaagctggac ttccttatgg tgctaacaaa 28260
gatggcatca tatgggttgc aactgaggga gccttgaata caccaaaaga tcacattggc 28320
accagaaatc ctgctaacaa tgctgcaatc gtgctacaac ttcctcaagg aacaacatta 28380
ccaaaaggtt tttacgcaga agggtctaga ggtggaagtc aagcctcttc tagatcatca 28440
tcacgtagtc gcaacagttc aagaaattca actccaggtt caagtagagg aacttctcct 28500
gctagaatgg ctggaaatgg aggtgatgct gctcttgctt tgttactact tgacagattg 28560
aaccagcttg agagcaaaat gtctggtaaa ggccaacaac aacaaggcca aactgtcact 28620
aagaaatctg ctgctgaggc ttctaagaag cctagacaaa aacgtactgc cactaaagca 28680
tacaatgtaa cacaagcttt cggcagacgt ggtccagaac aaactcaagg aaattttggg 28740
gatcaggaac taatcagaca aggaactgat tacaaacatt ggccgcaaat tgcacaattt 28800
gctccttctg cttcagcgtt ctttggaatg tcgagaattg gaatggaagt cacaccttcg 28860
ggaacatggt tgacctatac aggtgccatc aaattggatg acaaagatcc aaatttcaaa 28920
gatcaagtca ttttgctgaa taagcatatt gacgcataca aaacattccc accaacagag 28980
cctaaaaagg acaaaaagaa gaaggctgat gaaactcaag ccttaccgca gagacagaag 29040
aaacagcaaa ctgtgactct tcttcctgct gcagatttgg atgatttctc caaacaattg 29100
caacaatcca tgagcagtgc tgactcaact caggcctaaa ctcatgcaga ccacacaagg 29160
cagatgggct atataaacgt tttcgctttt ccgtttacga tatatagtct actcttgtgc 29220
agaatgaatt ctcgtaacta catagcacaa gtagatgtag ttaactttaa tctcacatag 29280
caatctttaa tcagtgtgta acattaggga ggacttgaaa gagccaccac attttcaccg 29340
aggccacgcg gagtacgatc gagtgtacag tgaacaatgc tagggagagc tgcctatatg 29400
gaagagccct aatgtgtaaa attaatttta gtagtgctat ccccatgtga ttttaatagc 29460
ttcttaggag aatgacaaaa aaaaacaaaa aaaa 29494
<210> 42
<211> 29348
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthesis optimized sequence E-protein and ORF8 double deletion
<400> 42
caaatataac gaaaggctca gtcgaaagac tgggcctttc gttttatctg ttgtttgtcg 60
taatacgact cactataggg attaaaggtt tataccttcc caggtaacaa accaaccaac 120
tttcgatctc ttgtagatct gttctctaaa cgaactttaa aatctgtgtg gctgtcactc 180
ggctgcatgc ttagtgcact cacgcagtat aattaataac taattactgt cgttgacagg 240
acacgagtaa ctcgtctatc ttctgcaggc tgcttacggt ttcgtccgtg ttgcagccga 300
tcatcagcac atctaggttt cgtccgggtg tgaccgaaag gtaagatgga gagccttgtc 360
cctggtttca acgagaaaac acacgtccaa ctcagtttgc ctgttttaca ggttcgcgac 420
gtgttagtac gtggttttgg agattcagtg gaagaagtct tatcagaggc acgtcaacat 480
cttaaagatg gcacttgtgg cttagtagaa gttgaaaaag gcgttttgcc tcaacttgaa 540
cagccctatg tgttcatcaa acgttctgat gctagaactg cacctcatgg tcatgttatg 600
gttgagctgg tagcagaatt agaaggtatt cagtacggtc gtagtggtga gacattaggt 660
gttttagttc ctcatgtggg cgaaatacca gtggcttacc gcaaagttct tcttagaaag 720
aacggtaata aaggagctgg tggccatagt tacggcgctg atttaaagtc atttgactta 780
ggcgacgagc ttggcactga tccttatgaa gatttccaag aaaactggaa cactaaacat 840
agcagtggtg ttacccgtga actcatgcgt gagttaaatg gaggtgcata cactcgctat 900
gtcgataaca acttctgtgg acctgatggt taccctcttg agtgcattaa agaccttcta 960
gcacgtgctg gtaaagcttc atgcactttg tccgaacaac tggactttat tgacactaag 1020
aggggtgtat actgctgccg tgaacatgag catgaaattg cttggtacac ggaacgttct 1080
gaaaagagct atgaattgca gacacctttt gaaattaaac tggcaaagaa atttgacacc 1140
ttcaatgggg aatgtccaaa ttttgtattt cccctcaatt ccataatcaa gactattcaa 1200
ccaagggttg aaaagaaaaa gcttgatggc tttatgggta gaattcgatc tgtctatcca 1260
gttgcgtcac caaatgaatg caaccaaatg tgcctttcaa ctctcatgaa gtgtgatcat 1320
tgtggtgaaa cttcatggca gacgggcgat tttgttaaag ccacttgcga attttgtggc 1380
actgagaatt tgactaaaga aggtgccact acttgtggtt acttacccca aaatgctgtt 1440
gttaaaattt actgtccagc atgtcacaat tcagaagtag gacctgagca tagtcttgcc 1500
gaataccata atgaatctgg cttgaaaacc attcttcgta agggtggtcg cactattgct 1560
tttggaggat gtgtgttctc ttatgttggt tgccataaca agtgtgctta ttgggttcca 1620
cgtgcttcag ctaacatagg ttgtaaccat acaggtgttg ttggagaagg ttccgaaggt 1680
cttaatgaca accttcttga aatactccaa aaagagaaag tcaacatcaa tattgttggt 1740
gactttaaac ttaatgaaga gatcgccatt attttggcat ctttttctgc ttccacaagt 1800
gcttttgtgg aaactgtgaa aggtttggat tataaagcat tcaaacagat tgttgaatcc 1860
tgtggtaatt ttaaggttac aaagggaaaa gctaaaaaag gtgcctggaa tattggtgaa 1920
cagaaatcaa tactgagtcc tctttatgca tttgcatcag aggctgctcg tgttgtacga 1980
tcaattttct cccgcactct tgaaactgct caaaattctg tgcgtgtttt acagaaggcc 2040
gctataacaa tactagatgg aatttcacag tattcactga gactcattga tgctatgatg 2100
ttcacatctg atttggctac taacaatcta gttgtaatgg cctacattac aggtggtgtt 2160
gttcagttga cttcgcagtg gctaactaac atctttggca ctgtttatga aaaactcaaa 2220
cccgtccttg attggcttga agagaagttt aaggaaggtg tagagtttct tagagacggt 2280
tgggagattg ttaaattcat ctcaacctgt gcttgtgaaa ttgtcggtgg acaaattgtc 2340
acctgtgcta aggaaattaa ggagagtgtt cagacattct ttaagcttgt aaacaagttt 2400
ttggctttgt gtgctgactc tatcattatt ggtggagcta aacttaaagc cttgaattta 2460
ggtgaaacat ttgtcacgca ctcaaaggga ttgtacagaa agtgtgttaa atccagagaa 2520
gaaactggcc tactcatgcc tctaaaagcc ccaaaagaaa ttatcttctt agagggagaa 2580
acacttccca cagaagtgtt aacagaggaa gttgtcttga aaactggtga tttacaacca 2640
ttagaacaac ctactagtga agctgttgaa gctccattgg ttggtacacc agtttgtatt 2700
aacgggctta tgttgctcga aatcaaagac acagaaaagt actgtgccct tgcacctaat 2760
atgatggtaa caaacaatac cttcacactc aaaggcggtg caccaacaaa ggttactttt 2820
ggtgatgaca ctgtgataga agtgcaaggt tacaagagtg tgaatatcac ttttgaactt 2880
gatgaaagga ttgataaagt acttaatgag aagtgctctg cctatacagt tgaactcggt 2940
acagaagtaa atgagttcgc ctgtgttgtg gcagatgctg tcataaaaac tttgcaacca 3000
gtatctgaat tacttacacc actgggcatt gatttagatg agtggagtat ggctacatac 3060
tacttatttg atgagtctgg tgagtttaaa ttggcttcac atatgtattg ttctttctac 3120
cctccagatg aggatgaaga agaaggtgat tgtgaagaag aagagtttga gccatcaact 3180
caatatgagt atggtactga agatgattac caaggtaaac ctttggaatt tggtgccact 3240
tctgctgctt tacaacctga agaagaacaa gaagaagatt ggttagatga tgatagtcaa 3300
caaactgttg gtcaacaaga cggcagtgag gacaatcaga caactactat tcaaacaatt 3360
gttgaggttc aacctcaatt agagatggaa cttacaccag ttgttcagac tattgaagtg 3420
aatagtttta gtggttatct taaacttact gacaatgtat acatcaagaa tgcagacatt 3480
gtggaagaag ctaaaaaggt aaaaccaaca gtggttgtta atgcagccaa tgtttacctt 3540
aaacatggag gaggtgttgc aggagcctta aataaggcta ctaacaatgc catgcaagtt 3600
gaatctgatg attacatagc tactaatgga ccacttaaag tgggtggtag ttgtgtttta 3660
agcggacaca atcttgctaa acactgttta catgttgtcg gcccaaatgt taacaaaggt 3720
gaagatattc aacttcttaa gagtgcttat gaaaatttta accagcacga agttctactt 3780
gcaccattat tatcagctgg tatttttggt gctgacccta tacattcttt aagagtttgt 3840
gtagatactg ttcgcacaaa tgtctactta gctgtctttg ataaaaatct ctatgacaaa 3900
cttgtttcaa gctttttgga aatgaagagt gaaaagcaag ttgaacaaaa gatcgctgag 3960
attcctaaag aggaagttaa gccatttata actgaaagta aaccttcagt tgaacagaga 4020
aaacaagatg ataagaagat caaagcttgt gttgaagaag ttacaacaac tctggaagaa 4080
actaagttcc tcacagaaaa cttgctcctt tatatcgaca ttaatggcaa tcttcatcca 4140
gattctgcca ctcttgttag tgacattgac atcactttct taaagaaaga tgctccatat 4200
atagtgggtg atgttgttca agagggtgtt ttaactgctg tggttatacc tactaaaaag 4260
gctggtggca ctactgaaat gctagcgaaa gctttgagaa aagtgccaac agacaattat 4320
ataaccactt acccgggtca gggtttaaat ggttacactg tagaggaggc aaagacagtg 4380
cttaaaaagt gtaaaagtgc cttttacatt ctaccatcta ttatctctaa tgagaagcaa 4440
gaaattcttg gaactgtttc ttggaatttg cgagaaatgc ttgcacatgc agaagaaaca 4500
cgcaaattaa tgcctgtctg tgtggaaact aaagccatag tttcaactat acagcgtaaa 4560
tataagggta tcaagataca agagggtgtg gttgattatg gtgctagatt ttacttttac 4620
accagtaaaa caactgtagc gtcacttatc aacacactta acgatctaaa tgaaactctt 4680
gttacaatgc cacttggcta tgtaacacat ggcttaaatt tggaagaagc tgctcggtat 4740
atgagatctc tcaaagtgcc agctacagtt tctgtttctt cacctgatgc tgttacagcg 4800
tataatggtt atcttacttc ttcttctaaa acacctgaag aacattttat tgaaaccatc 4860
tcacttgctg gttcctataa agattggtcc tattctggac aatctacaca actaggtata 4920
gaatttctta agagaggtga taaaagtgta tattacacgt ccaatcctac cacattccac 4980
ctagatggtg aagttatcac ctttgacaat cttaagacac ttctttcttt gagagaagtg 5040
aggactatta aggtgtttac aacagtagac aacattaacc tccacacgca agttgtggac 5100
atgtcaatga catatggaca acagtttggt ccaacttatt tggatggagc tgatgttact 5160
aagataaaac ctcataactc acatgaaggt aaaacatttt acgttttgcc taatgatgac 5220
actctacgtg ttgaggcttt tgagtactac cacacaactg atcctagttt tctgggtagg 5280
tacatgtcag cattaaatca cactaaaaag tggaaatacc cacaagttaa tggtttaact 5340
tcgattaaat gggcagataa caactgttat cttgccactg cattgttaac actccaacaa 5400
atagagttga agtttaatcc acctgctcta caagatgctt attacagagc aagggctggt 5460
gaagctgcta acttttgtgc acttatctta gcctactgta ataagacagt aggtgagtta 5520
ggtgatgtta gagaaacaat gagttacttg tttcaacatg ccaatttaga ttcttgcaaa 5580
agagtcttga acgtggtgtg taaaacttgt ggacaacagc agacaaccct taagggtgta 5640
gaagctgtta tgtacatggg cacactttct tatgaacaat tcaagaaagg tgttcagata 5700
ccttgtacgt gtggtaaaca agctacaaaa tatctagtac aacaggagtc accttttgtt 5760
atgatgtcag caccacctgc tcagtatgaa cttaagcatg gtacatttac ttgtgctagt 5820
gagtacactg gtaattacca gtgtggtcac tataagcata taacttctaa ggaaactttg 5880
tattgcatag acggtgcttt acttacaaag tcctcagaat acaaaggtcc tattacggat 5940
gttttctaca aagaaaacag ttacacaaca accataaaac cagttactta taagttggat 6000
ggtgttgttt gtacagaaat tgaccctaag ttggacaatt attataagaa ggacaactct 6060
tatttcacag agcaaccaat tgatcttgta ccaaaccaac catatccaaa cgcaagcttc 6120
gataatttta agttcgtatg cgataatatc aaatttgctg atgatctcaa ccagttaact 6180
ggttataaga aacctgcttc aagagagctt aaagttacat ttttccctga cttaaatggt 6240
gatgtggtgg ctattgatta taaacactac acaccctctt ttaagaaagg agctaaattg 6300
ttacataagc ctattgtttg gcatgttaac aatgcaacta ataaagccac gtataaacca 6360
aatacctggt gtatacgttg tctttggagc acaaaaccag ttgaaacatc aaattcgttt 6420
gatgtactga agtcagagga cgcgcaggga atggataatc ttgcatgtga agatctaaaa 6480
ccagtctctg aagaagtagt ggaaaatcct accatacaga aagacgttct tgagtgtaat 6540
gtgaaaacta ccgaagttgt aggagacatt atacttaaac cagcaaataa tagtttgaag 6600
atcacagaag aggttggcca cacagatcta atggctgctt atgtagacaa ttctagtctt 6660
actattaaga aacctaatga actctctaga gtattaggtt tgaaaaccct tgctactcat 6720
ggtttagctg ctgttaatag tgtcccttgg gatactatag ctaattatgc taagcctttt 6780
cttaacaaag ttgttagtac aactactaac atagttacac ggtgtcttaa tcgtgtttgt 6840
actaattata tgccttactt ctttacttta ttgctacaat tgtgtacttt tactagaagt 6900
acaaattcta gaatcaaggc atctatgccg actactatag caaagaatac tgttaagagt 6960
gtcggtaaat tttgtctaga ggcttcattt aattatctca agtcacctaa cttttctaag 7020
ctgataaaca ttatcatctg gtttttgcta ttaagtgttt gcctaggttc tttaatctac 7080
tcaaccgctg ctttaggtgt tttaatgtct aatttaggca tgccttctta ctgtactggt 7140
tacagagaag gctatttgaa ctctactaat gtcactattg caacctactg tactggatct 7200
ataccttgta gtgtttgtct tagtggttta gattctttag acacctatcc ttctcttgaa 7260
actatacaga ttaccatttc atctttcaaa tgggatttaa ctgcttttgg cttagttgca 7320
gagtggtttt tggcatatat tcttttcact aggtttttct atgtacttgg attggctgca 7380
atcatgcaat tgtttttcag ctattttgca gtccatttta ttagtaactc ttggcttatg 7440
tggcttataa ttaatcttgt gcagatggcc ccgatttcag ctatggttag aatgtacatc 7500
ttctttgcct cattttatta tgtgtggaaa agttatgtgc atgttgtaga cggttgtaat 7560
tcatcaactt gtatgatgtg ttacaaacgt aatagagcaa caagagtcga atgtacaact 7620
attgttaatg gtgttagaag gtccttttat gtctatgcta atggaggtaa aggcttttgc 7680
aaactacaca attggaattg tgttaattgt gatacattct gtgctggtag tacatttatt 7740
agtgatgaag ttgcgagaga cttgtcacta cagtttaaaa gaccaataaa tcctactgac 7800
caatcttctt acatcgttga tagtgttaca gtgaagaatg gttccatcca tctttacttt 7860
gataaagctg gtcaaaagac ttatgaaaga cattctctct ctcattttgt taacttagac 7920
aacctgagag ctaataacac taaaggttca ttgcctatta atgttatcgt tttcgacggt 7980
aaatcaaaat gtgaagaatc atctgcaaaa tcagcgtctg tttactacag tcagcttatg 8040
tgtcaaccta tactgttact agatcaggca ttagtgtctg atgttggtga tagtgcggaa 8100
gttgcagtta aaatgtttga tgcttacgtt aatacgtttt catcaacttt taacgtacca 8160
atggaaaaac tcaaaacact agttgcaact gcagaagctg aacttgcaaa gaatgtgtcc 8220
ttagacaatg tcttatctac gtttatttca gcagctcggc aagggtttgt tgattcagat 8280
gtagaaacta aagatgttgt tgaatgtctt aaattgtcac atcaatctga catagaagtt 8340
actggcgata gttgtaataa ctatatgctc acctataaca aagttgaaaa catgacaccc 8400
cgtgaccttg gtgcttgtat tgactgtagt gctagacata ttaatgcgca ggtagcaaaa 8460
agtcacaaca ttgctttgat atggaacgtt aaagatttca tgtcattgtc tgaacaacta 8520
cgaaaacaaa tacgtagtgc tgctaaaaag aataacttac ccttcaagtt gacatgtgca 8580
actactagac aagttgttaa tgttgtaaca acaaagatag cacttaaggg tggtaaaatt 8640
gtgaataact ggttgaagca gcttattaaa gttacacttg tgttcctttt tgttgctgct 8700
attttctatc tgataacacc tgttcatgtc atgtctaaac atactgactt ttcaagtgaa 8760
atcataggat acaaggctat tgatggtggt gtcactcgtg acatagcatc tacagatact 8820
tgttttgcta acaaacatgc tgattttgac acatggttta gccagcgtgg tggtagttat 8880
actaatgaca aagcttgccc attgattgct gcagtcataa caagagaagt gggttttgtc 8940
gttcctggtt tgcctggaac gatattacgc acaactaatg gtgacttttt gcatttctta 9000
cctagagttt ttagtgcagt tggtaacatc tgttacacac catcaaaact tatagagtac 9060
actgactttg caacatcagc ttgtgttttg gctgctgaat gtacaatttt taaagacgct 9120
tctggtaagc cagtaccata ttgttatgat accaatgtac tagaaggttc tgttgcttat 9180
gaaagtttac gccctgacac acgttatgtg ctcatggatg gctctattat tcaatttcct 9240
aacacctacc ttgaaggttc tgtaagagtg gtaacaactt ttgattctga gtactgtagg 9300
cacggcactt gtgaaagatc agaagctggt gtttgtgtat ctactagtgg tagatgggta 9360
cttaacaacg attattacag atctttacca ggagttttct gtggtgtaga tgctgtaaat 9420
ttgcttacta acatgtttac accactaatt caacctattg gtgctttgga catatcagca 9480
tctatagtag ctggtggtat tgtagctatc gtagtaacat gccttgccta ctattttatg 9540
aggtttagac gtgcttttgg tgaatacagt catgtagttg cctttaatac tctcctattc 9600
cttatgtcat tcactgtact ctgtttaaca ccagtttact cattcttacc tggtgtttat 9660
tctgttattt acctgtactt gacattttat ctgactaatg atgtttcttt tctcgcacat 9720
attcagtgga tggttatgtt cacaccttta gtacctttct ggataacaat tgcttacatc 9780
atttgtattt ccacaaagca tttctattgg ttctttagta attacctaaa gagacgtgta 9840
gtctttaatg gtgtttcctt tagtactttt gaagaagctg cgctgtgcac ctttttgtta 9900
aataaggaga tgtatctaaa gttgcgtagt gatgtgctat tacctcttac gcaatataat 9960
agatacttag ctctttataa caagtacaag tatttcagtg gagcaatgga tacaactagc 10020
tacagagaag ctgcttgttg tcatctcgca aaggctctca atgacttcag taactcaggt 10080
tctgatgttc tttaccaacc accacaaacc tctatcacct cagctgtttt gcagagtggt 10140
tttagaaaaa tggcattccc atctggtaaa gttgagggtt gtatggtaca agtaacttgt 10200
ggtacaacta cacttaacgg tctttggctt gatgacgtag tttactgtcc aagacatgtg 10260
atctgcacct ctgaagatat gcttaaccct aattatgaag atctactcat ccgtaagtct 10320
aatcataact tcttggtaca ggctggtaat gttcaactca gggttattgg acattctatg 10380
caaaattgtg tacttaagct taaggttgat acagccaatc ctaagacacc taagtataag 10440
tttgttcgca ttcaaccagg acagactttt tcagtgttag cttgttacaa tggttcacca 10500
tctggtgttt accaatgtgc tatgaggccc aatttcacta ttaagggttc attccttaat 10560
ggttcatgtg gtagtgttgg ttttaacata gattatgact gtgtctcttt ttgttacatg 10620
caccatatgg aattaccaac tggagttcat gctggcacag acttagaagg taacttttat 10680
ggaccttttg ttgacaggca aacagcacaa gcagctggta cagatacaac tattacagtt 10740
aatgttcttg cttggttgta cgctgctgtt ataaatggag acaggtggtt tctcaatcga 10800
tttaccacaa ctcttaatga ctttaacctt gtggctatga agtacaatta tgaacctcta 10860
acacaagacc atgttgacat actaggacct ctttctgctc aaactggaat tgccgtttta 10920
gatatgtgtg cttcattaaa agaacttctg caaaatggta tgaatggacg taccatattg 10980
ggtagtgctt tattagaaga tgagtttaca ccttttgatg ttgttagaca atgctcaggt 11040
gttactttcc aaagtgcagt gaaaagaaca atcaagggta cacaccactg gttgttactc 11100
acaattttga cttcactttt agttttagtc cagagtactc aatggtcttt gttctttttc 11160
ttctacgaaa atgccttttt accttttgct atgggtatta ttgctatgtc tgcttttgca 11220
atgatgtttg tcaaacataa gcatgcattt ctctgtttgt ttttgttacc ttctcttgcc 11280
actgtagctt actttaatat ggtctacatg cctgctagtt gggtgatgcg tattatgaca 11340
tggttggata tggttgatac tagtttgtct ggttttaagc taaaagactg tgttatgtat 11400
gcatcagctg tagtgttact aatccttatg acagcaagaa ctgtgtatga tgatggtgct 11460
aggagagtgt ggacacttat gaatgtcttg acactcgttt ataaagttta ctatggcaac 11520
gctttagatc aagccatttc catgtgggct cttataatct ctgttacttc taactactca 11580
ggtgtagtta caactgtcat gtttttggcc agaggtattg tttttatgtg tgttgagtat 11640
tgccctattt tcttcataac tggtaataca cttcagtgta taatgctagt ctattgtttc 11700
ttaggctatt tttgtacttg ttacttcggc ctcttttgtt tactcaaccg ctactttaga 11760
ctgactcttg gtgtttatga ttacttagtg tctacacagg agtttagata tatgaattca 11820
cagggactac tcccacccaa gaatagcata gatgccttca aactcaacat taaattgttg 11880
ggtgttggtg gcaaaccttg tatcaaagta gccactgtac agtctaaaat gtcagatgta 11940
aagtgcacat cagtagtctt actctcagtt ttgcaacaac tcagagtaga atcatcatct 12000
aaattgtggg ctcaatgtgt ccagttacac aatgacattc tcttagctaa agatactact 12060
gaagcctttg aaaaaatggt ttcactactt tctgttttgc tttccatgca gggtgctgta 12120
gacataaaca agctttgtga agaaatgctg gacaacaggg caaccttaca agctatagcc 12180
tcagagttta gttcccttcc atcatatgca gcttttgcta ctgctcaaga agcttatgag 12240
caggctgttg ctaatggtga ttctgaagtt gttcttaaaa agttgaagaa gtctttgaat 12300
gtggctaaat ctgaatttga ccgtgatgca gccatgcaac gtaagttgga aaagatggct 12360
gatcaagcta tgacccaaat gtataaacag gctagatctg aggacaagag ggcaaaagtt 12420
actagtgcta tgcagacaat gcttttcact atgcttagaa agttggataa tgatgcactc 12480
aacaacatta tcaacaatgc aagagatggt tgtgttccct tgaacataat acctcttaca 12540
acagcagcca aactaatggt tgtcatacca gactacaaca catataagaa tacgtgtgat 12600
ggtacaacat ttacttatgc atcagcattg tgggaaatcc aacaggttgt agatgcagat 12660
agtaaaattg ttcagcttag tgaaattagt atggacaatt cacctaattt agcatggcct 12720
cttattgtaa cagctttaag ggccaattct gctgtcaaat tacagaataa tgagcttagt 12780
cctgttgcac taagacaaat gtcttgtgct gccggtacta cacaaactgc ttgcactgat 12840
gacaatgcgt tagcttacta caacacaaca aagggaggta ggtttgtact tgcactgtta 12900
tccgatttac aggatttgaa atgggctaga ttccctaaga gtgatggaac tggtactatc 12960
tatacagaac tggaaccacc ttgtaggttt gttacagaca cacctaaagg tcctaaagtg 13020
aagtatcttt acttcatcaa aggattaaac aacctaaata gaggtatggt acttggtagt 13080
ttagctgcca cagtacgttt acaagctggt aatgcaacag aagttcctgc taattcaact 13140
gtactttctt tctgtgcttt tgctgtagat gctgctaaag cttacaaaga ttatctagct 13200
agtgggggac aaccaatcac taattgtgtt aagatgttgt gtacacacac tggtactggt 13260
caggcaataa cagttacacc ggaagccaat atggatcaag aatcctttgg tggtgcatcg 13320
tgttgtctgt actgccgttg tcatatagat catccaaatc ctaaaggatt ttgtgactta 13380
aaaggtaagt atgtacaaat acctacaact tgtgctaatg accctgtggg ttttacactt 13440
aaaaacacag tctgtaccgt ctgcggtatg tggaaaggtt atggttgtag ttgtgatcaa 13500
ctccgcgaac ccatgcttca gtcagctgat gcacaatcgt ttttaaacgg gtttgcggtg 13560
taagtgcagc ccgtcttaca ccgtgcggca caggcactag tactgatgtc gtatatagag 13620
cttttgacat ctacaatgat aaagtagctg gttttgctaa gttcctaaaa actaattgtt 13680
gtcgcttcca agaaaaggac gaagatgaca atctcattga ttcttacttt gtagttaaga 13740
gacacacttt ctctaactac caacatgaag aaacaattta caacctgctt aaggattgtc 13800
cagctgttgc taaacatgac ttctttaagt ttagaataga cggtgacatg gtaccacata 13860
tatcacgtca acgtcttact aaatacacaa tggcagacct cgtctatgct ttaaggcatt 13920
ttgatgaagg taattgtgac acattaaaag aaatacttgt cacatacaat tgttgtgatg 13980
atgactactt caataaaaag gactggtatg attttgtaga aaacccagat atattacgcg 14040
tatacgccaa cttaggtgaa cgtgtacgcc aagctttgtt aaaaacagta cagttctgtg 14100
atgccatgcg aaatgctggt attgttggtg tactgacatt agataatcaa gatctcaatg 14160
gtaactggta tgactttggt gatttcatac aaaccacgcc aggtagtgga gttcctgttg 14220
tagactctta ttattcattg ctcatgccta tattaacctt gaccagggct ttaactgcag 14280
agtcacatgt tgacactgac ttaacaaagc cttacattaa gtgggatttg ttaaaatacg 14340
acttcacgga agagaggtta aaactctttg accgttattt taaatactgg gatcagacat 14400
accacccaaa ttgtgttaac tgtttggatg acagatgcat tctgcattgt gcaaacttta 14460
atgttctgtt ctctacagtg ttcccaccta caagttttgg accactagtg agaaaaatat 14520
ttgttgatgg tgttccattt gtagtttcaa ctggatacca cttcagagag ctaggtgttg 14580
tacataatca ggatgtaaac ttacatagct ctagacttag ttttaaggaa ttacttgtgt 14640
atgctgctga tcctgctatg catgctgctt ctggtaatct attactagat aaacgcacta 14700
cgtgcttttc agtagctgca cttactaaca atgttgcttt tcaaactgtc aaacccggta 14760
attttaacaa ggacttctat gactttgctg tgtctaaggg tttctttaag gaaggaagtt 14820
ctgttgaatt aaaacacttc ttctttgctc aggatggtaa tgctgctatc agcgattatg 14880
actactatcg ttataatcta ccaacaatgt gtgatatcag acaactacta tttgtagttg 14940
aagttgttga taagtacttt gattgttacg atggtggctg tattaatgct aaccaagtca 15000
tcgtcaacaa cctagacaaa tcagctggtt ttccatttaa taaatggggt aaggctagac 15060
tttattatga ttccatgagt tatgaggatc aagatgcact tttcgcatat acaaaacgta 15120
atgtcatccc tactataact caaatgaacc ttaagtatgc cattagtgca aagaatagag 15180
ctcgcaccgt agctggtgtc tctatctgta gtactatgac caatagacag tttcatcaaa 15240
aattactcaa gtcaatagcc gccactagag gagctactgt agtaattgga acaagcaaat 15300
tctatggtgg ttggcacaac atgctcaaaa ctgtttatag tgatgtagaa aaccctcacc 15360
ttatgggttg ggattatcct aaatgtgata gagccatgcc taacatgctt agaattatgg 15420
cctcacttgt tcttgctcgc aaacatacaa cgtgttgtag cttgtcacac cgtttctata 15480
gattagctaa tgagtgtgct caagtattga gtgaaatggt catgtgtggc ggttcactat 15540
atgttaaacc aggtggaacc tcatcaggag atgccacaac tgcttatgct aatagtgtgt 15600
ttaacatttg tcaagctgtc acggccaatg ttaatgcact tttatctact gatggtaaca 15660
aaattgccga taagtatgtc cgcaatttac aacacagact ttatgagtgt ctctatagaa 15720
atagagatgt tgacacagac tttgtgaatg agttttacgc atatttgcgt aaacatttct 15780
caatgatgat actctctgac gatgctgttg tgtgtttcaa tagcacttat gcatctcaag 15840
gtctagtggc tagcataaag aactttaagt cagttcttta ctatcaaaac aacgttttta 15900
tgtctgaagc aaaatgttgg actgagactg accttactaa aggacctcat gaattttgct 15960
ctcaacatac aatgctagtt aaacagggtg atgattatgt gtaccttcct tacccagatc 16020
catcaagaat cctaggtgcc ggttgttttg tagatgatat cgtaaaaaca gatggtacac 16080
ttatgattga acggttcgtg tctttagcta tagatgctta cccacttact aaacatccta 16140
atcaggagta tgctgatgtc tttcatttgt acttacaata catacgtaag ctacatgatg 16200
agttaacagg acacatgtta gacatgtatt ctgttatgct tactaatgat aacacttcaa 16260
ggtattggga acctgagttt tatgaggcta tgtacacacc gcatacagtc ttacaagctg 16320
ttggtgcttg tgttctttgc aattcacaga cttcattaag atgtggtgct tgcatacgta 16380
gaccattctt atgttgtaaa tgctgttacg accatgtcat ctcaacatca cataaattag 16440
tcttgtctgt taatccgtat gtttgcaatg ctccaggttg tgatgtcaca gatgtgactc 16500
aactttactt aggaggtatg agctattact gtaagtcaca taaaccaccc attagttttc 16560
cattgtgtgc taatggacaa gtttttggtc tctacaagaa tacatgtgtt ggtagcgata 16620
atgttactga ctttaatgca attgcaacat gtgactggac aaatgctggt gattacattt 16680
tagctaacac ctgtactgaa agactcaagc tttttgcagc agaaacgctc aaagctactg 16740
aggagacatt taaactgtct tatggtattg ctactgtacg tgaagtgctg tctgacagag 16800
aattacatct ttcatgggaa gttggtaaac ctagaccacc acttaaccga aattatgtct 16860
ttactggtta tcgtgtaact aaaaacagta aagtgcaaat cggagagtac acctttgaaa 16920
aaggtgacta tggtgatgct gttgtttacc gaggtacaac aacttacaaa ctcaacgttg 16980
gtgattattt tgtgctgaca tcacatacag taatgccatt aagtgcacct acactagtgc 17040
cacaagagca ctatgttaga attactggct tatacccaac actcaatatc tcagatgagt 17100
tttctagcaa tgttgcaaat tatcaaaagg ttggtatgca aaagtattct acactccagg 17160
gaccacctgg tactggtaaa agtcattttg ctattggtct agctctctac tacccttctg 17220
ctcgcatagt atatacagct tgctctcatg cagctgttga tgcactatgt gagaaggcat 17280
taaaatattt gcccatagac aaatgtagta gaattatacc tgcacgtgct cgtgtagagt 17340
gttttgataa attcaaggtg aattcaacat tagaacagta tgtcttttgt actgtaaatg 17400
cattgcctga gacgacagca gatatagttg tctttgatga aatttcaatg gccacaaatt 17460
atgatttgag tgttgtcaat gccagattac gtgctaagca ctatgtgtac attggtgatc 17520
ctgctcaatt acctgcacca cgcacattac taactaaggg tacactagaa ccagaatatt 17580
tcaattcagt gtgtagactt atgaaaacta taggtccaga catgttcctc ggaacttgtc 17640
gtagatgtcc tgctgaaatt gttgacactg tgagtgcttt ggtttatgat aataagctta 17700
aggcacataa agacaaatca gctcaatgct ttaaaatgtt ctacaagggt gttatcacgc 17760
atgatgtttc atctgcaatt aacaggccac aaataggcgt ggtaagagaa ttccttacac 17820
gtaaccctgc ttggagaaaa gctgtcttta tttcacctta caattcccag aatgctgtag 17880
cctcaaagat tttgggacta ccaactcaaa ctgttgattc atcacagggc tcagaatatg 17940
actatgtcat attcactcaa accactgaaa cagctcactc ttgtaatgta aacagattca 18000
acgttgctat taccagagca aaagtaggca tactttgcat aatgtctgat agagaccttt 18060
atgacaagtt gcaatttaca agtcttgaaa ttccacgtag gaatgtggca actttacaag 18120
ctgaaaatgt aacaggactc tttaaagatt gtagtaaggt aatcactggg ttacatccta 18180
cacaggcacc tacacactta agtgttgata ctaaattcaa aactgaaggt ttatgtgttg 18240
acatacctgg catacctaag gacatgacct atagaagatt aatctctatg atgggtttca 18300
aaatgaatta ccaggttaat ggttacccta acatgtttat cacccgcgaa gaagctataa 18360
gacatgtacg tgcatggatt ggcttcgatg tcgaaggttg tcatgctact agagaagctg 18420
ttggtaccaa tttaccttta cagctaggtt tttctacagg tgttaaccta gttgctgtac 18480
ctacaggtta tgttgataca cctaataata cagatttttc cagagttagt gctaaaccac 18540
cgcctggaga tcaatttaaa cacctcatac cacttatgta caaaggactt ccttggaatg 18600
tagtgcgtat aaagattgtc caaatgttaa gtgacacact taaaaatctc tctgacagag 18660
tcgtatttgt cttatgggca catggctttg agttgacatc tatgaagtat tttgtgaaga 18720
tcggacctga gcgcacatgt tgtctatgtg atagacgtgc tacatgcttt tccactgctt 18780
cagacactta tgcctgttgg catcattcta ttggatttga ttacgtctat aatccgttta 18840
tgattgatgt tcaacaatgg ggttttacag gtaacctaca aagcaaccat gatctgtatt 18900
gtcaagtcca tggtaatgca catgtagcta gttgtgatgc aatcatgact aggtgtctag 18960
ctgtccacga gtgctttgtt aagcgtgttg actggactat tgaatatcct ataatcggtg 19020
atgaactgaa gattaatgcg gcttgtagaa aggttcaaca catggttgtt aaagctgcat 19080
tattagcaga caaattccca gttcttcacg acattggtaa ccctaaagct attaagtgtg 19140
tacctcaagc tgatgtagaa tggaagttct atgatgcaca gccttgtagt gacaaagctt 19200
acaaaataga agaactgttc tattcttatg ccacacattc tgacaaattc acagatggtg 19260
tatgcctatt ttggaattgc aatgtcgata gatatcctgc taattccatt gtttgtagat 19320
ttgacactag agtgctatct aaccttaact tgcctggttg tgatggtggc agtttgtatg 19380
taaataagca tgcattccac acaccagctt ttgataaaag tgcttttgtt aatctaaagc 19440
aacttccatt tttctattac tctgacagtc catgtgagtc tcatggaaaa caagtagtgt 19500
cagatataga ttatgtacca ctaaagtctg ctacgtgtat aacacgttgc aatttaggtg 19560
gtgctgtctg tagacatcat gctaatgagt acagattgta tctcgatgct tataacatga 19620
tgatctcagc tggctttagc ttgtgggttt acaaacaatt tgatacctat aacctctgga 19680
acacttttac aagacttcag agtttagaaa atgtggcttt taatgttgta aataagggac 19740
actttgatgg acaacagggt gaagtaccag tttctatcat taacaacact gtttacacaa 19800
aagttgatgg tgttgatgta gaattgtttg agaacaaaac cacattacct gttaatgtag 19860
catttgagct ttgggctaag cgcaacatta aaccagtacc agaggtgaaa atactcaata 19920
atttgggtgt ggacattgct gctaatactg tgatctggga ctacaaaaga gatgctccag 19980
cacatatatc tactattggt gtttgttcta tgactgacat agccaagaaa ccaactgaaa 20040
cgatttgtgc accactcact gtcttttttg atggtagagt tgatggtcaa gtagacttat 20100
ttagaaatgc ccgtaatggt gttcttatta cagaaggtag tgttaaaggt ttacaaccat 20160
ctgtaggtcc caaacaagct agtcttaatg gagtcacatt aattggagaa gccgtaaaaa 20220
cacagttcaa ttattacaag aaagtggatg gtgttgtcca acaattacct gaaacttact 20280
ttactcagag tagaaactta caggaattta agcccaggag tcaaatggaa attgatttct 20340
tagaacttgc tatggatgaa ttcattgaac ggtataaatt agaaggctat gccttcgaac 20400
atatcgttta tggagatttt agtcatagtc agttaggtgg tttacatcta ctgattggac 20460
tagctaaacg ttttaaggaa tcaccttttg aacttgaaga ttttattcct atggacagta 20520
cagttaaaaa ctacttcata acagatgcgc aaacaggttc atctaagtgt gtgtgttctg 20580
ttattgatct tttacttgat gacttcgttg aaataataaa gtcccaagat ttatctgtag 20640
tttctaaggt tgtcaaagtg actattgact atacagaaat ctcatttatg ctttggtgta 20700
aagatggcca tgtagaaaca ttttacccaa aattacaatc tagtcaagcg tggcaaccgg 20760
gtgttgctat gcctaatctt tacaaaatgc aaagaatgct attagaaaag tgtgaccttc 20820
aaaattatgg tgatagtgca acattaccta aaggcataat gatgaatgtc gcaaaatata 20880
ctcaactgtg tcaatattta aacacactga cattagctgt accctataat atgagagtta 20940
tccattttgg tgctggttct gataaaggag ttgcaccagg tacagctgtt ttaagacaat 21000
ggttgcctac aggtacgctg cttgtcgatt cagatcttaa tgactttgtc tctgatgcag 21060
attcaacttt gattggtgat tgtgcaactg tacatacagc taataaatgg gatctcatta 21120
ttagtgatat gtacgaccct aagactaaga atgtcacaaa agaaaacgac tctaaagagg 21180
gttttttcac ttacatttgt gggtttatac aacaaaagct agctcttgga ggttccgtgg 21240
ctataaagat aacagaacat tcttggaatg ctgatcttta taagctcatg ggacacttcg 21300
catggtggac agcctttgtt actaatgtga atgcgtcatc atctgaagca tttttaatcg 21360
gatgtaacta ccttggcaaa ccacgcgaac aaatagatgg ttatgtcatg catgcaaatt 21420
acatattttg gaggaataca aatccaattc agctttcttc ttattcttta ttcgacatga 21480
gtaaattccc ccttaaatta aggggtactg ctgttatgtc tttaaaagaa ggtcaaatca 21540
atgatatgat tctctctctt cttagtaaag gtagacttat aattagagaa aacaacagag 21600
ttgttatttc tagtgatgtt cttgttaaca actaaacgaa caatgtttgt ttttcttgtt 21660
ttattgccac tagtctctag tcagtgtgtt aatcttacaa ccagaactca attaccccct 21720
gcatacacta attctttcac acgtggtgtt tattaccctg acaaagtttt cagatcctca 21780
gttttacatt caactcagga cttgttctta cctttctttt ccaatgttac ttggttccat 21840
gctatacatg tctctgggac caatggtact aagaggtttg ataaccctgt cctaccattt 21900
aatgatggtg tttactttgc ttccactgag aagtctaaca taataagagg ctggattttt 21960
ggtactactt tagattcgaa aacccagtcc ctacttattg ttaataacgc tactaatgtt 22020
gttatcaaag tctgtgaatt tcaattttgt aacgatccat ttttgggtgt ttattaccac 22080
aaaaacaaca aaagttggat ggaaagtgag ttcagagttt attctagtgc gaataattgc 22140
acttttgaat acgtctctca gccttttctt atggaccttg aaggaaaaca gggtaatttc 22200
aaaaatctta gggaatttgt gttcaagaat attgatggtt acttcaagat atactctaag 22260
cacacgccta ttaatttagt gcgtgatctc cctcagggtt tttcggcttt agaaccattg 22320
gtagatttgc caataggtat taacatcact aggtttcaaa ctttacttgc tttacataga 22380
agttatttaa ctcctggtga ttcttcttca ggttggacag ctggtgctgc agcttattat 22440
gtgggttatc ttcaacctag gacttttcta ctgaagtaca atgaaaatgg aaccattaca 22500
gatgctgtag actgtgcact tgaccctctc tcagaaacaa agtgtacgtt gaaatccttc 22560
actgtagaaa aaggaatcta tcaaacttct aactttagag tccaaccaac agaatctatt 22620
gttagatttc ctaacatcac aaacttgtgc ccttttggtg aagtttttaa cgccaccaga 22680
tttgcatctg tttatgcttg gaacaggaag agaatcagca actgtgttgc tgattattct 22740
gtcctgtata attccgcatc attttccact tttaagtgtt atggagtgtc tcctactaaa 22800
ttaaatgatc tctgctttac taatgtctat gcagattcat ttgtaattag aggtgatgaa 22860
gtcagacaaa tcgctccagg gcaaactgga aagattgctg attataacta caaattacca 22920
gatgatttta caggctgcgt tatagcttgg aattctaaca atcttgattc taaggttggt 22980
ggtaattata attacctgta cagattgttt aggaagtcta atctcaaacc ttttgagaga 23040
gatatttcaa ctgaaatcta tcaggccggt agcacacctt gtaatggtgt tgaaggtttt 23100
aattgttact ttcctctgca atcatatggt ttccaaccca ctaatggtgt tggttaccaa 23160
ccatacagag tagtagtact ttcttttgaa cttctacatg caccagcaac tgtttgtgga 23220
cctaaaaagt ctactaattt ggttaagaac aagtgtgtca atttcaactt caatggttta 23280
acaggcacag gtgttcttac tgagtctaac aaaaagtttc tgcctttcca acaatttggc 23340
agagacattg ctgacactac tgatgctgtt cgtgatccac aaacacttga gattcttgac 23400
attacaccat gttcttttgg tggtgtcagt gttataacac caggaacaaa tacttctaac 23460
caggttgctg ttctttatca ggatgttaac tgcacagaag tccctgttgc tattcatgca 23520
gatcaactta ctcctacttg gcgtgtttat tctacaggtt ctaatgtttt tcaaacacgt 23580
gcaggctgtt taataggggc tgaacatgtc aacaactcat atgagtgtga catacccatt 23640
ggtgcaggta tatgcgctag ttatcagact cagactaatt ctcctcggag agcaagaagt 23700
gtagctagtc aatccatcat tgcctacact atgtcacttg gtgcagaaaa ttcagttgct 23760
tactctaata actctattgc catacccaca aattttacta ttagcgttac cacagaaatt 23820
ctaccagtgt ctatgaccaa gacatcagta gattgtacaa tgtacatttg tggtgattca 23880
actgaatgca gcaatctttt gttgcaatat ggcagttttt gtacacaatt aaaccgtgct 23940
ttaactggaa tagctgttga acaagacaaa aacacccaag aagtttttgc acaagtcaaa 24000
caaatttaca agacaccacc aattaaagat tttggcggtt ttaattttag ccagatactg 24060
ccagatccat caaaaccaag caagaggtca tttattgaag atctactgtt caacaaagtg 24120
acacttgcag atgctggctt catcaaacaa tatggtgatt gccttggtga tattgctgct 24180
agagacctca tttgtgcaca aaagtttaac ggccttactg ttttgccacc tttgctcaca 24240
gatgaaatga ttgctcaata cacttctgca ctgttagcag gtacaatcac ttctggttgg 24300
acttttggtg caggtgctgc attacaaata ccatttgcta tgcaaatggc ttataggttt 24360
aatggtattg gagttacaca gaatgttctc tatgagaacc aaaaattgat tgccaaccaa 24420
tttaatagtg ctattggcaa aattcaagac tcactttctt ccacagcaag tgcacttgga 24480
aaacttcaag atgtggtcaa ccaaaatgca caagctttaa acacgcttgt taaacaactt 24540
agctccaatt ttggtgcaat ttcaagtgtt ttaaacgaca tcctttcacg tcttgacaaa 24600
gttgaggctg aagtgcaaat tgataggttg atcacaggca gacttcaaag tttgcagaca 24660
tatgtgactc aacaattaat tagagctgca gaaatcagag cttctgctaa tcttgctgct 24720
actaaaatgt cagagtgtgt acttggacaa tcaaaaagag ttgacttttg cggaaagggc 24780
tatcatctta tgtcatttcc tcagtcagca cctcatggtg tcgtcttttt gcatgtgact 24840
tatgtccctg cacaagaaaa gaacttcaca actgctcctg ccatttgtca tgatggaaaa 24900
gcacactttc ctcgtgaagg tgtctttgtt tcaaatggca cacactggtt tgtaacacaa 24960
aggaattttt atgaaccaca aatcattact acagacaaca catttgtgtc tggtaactgt 25020
gatgttgtaa taggaattgt caacaacaca gtttatgatc ctttgcaacc tgaattagac 25080
tcattcaagg aggagcttga taaatacttc aagaaccata cctcaccaga tgttgattta 25140
ggtgacatct ctggcattaa tgcttcagtt gtaaacattc agaaagaaat cgaccgcctc 25200
aatgaggttg ccaagaattt aaatgaatct ctcatcgatc tccaagaact tggaaagtat 25260
gagcagtata taaaatggcc atggtacatt tggctaggtt ttatagctgg cttgattgcc 25320
atagtaatgg tgacaattat gctttgctgt atgaccagtt gctgtagttg tctcaagggc 25380
tgttgttctt gtggatcctg ctgcaaattt gacgaggacg actctgagcc agtgctcaaa 25440
ggagtcaaat tacattacac ataaacgaac ttatggattt gtttatgaga atcttcacaa 25500
ttggaactgt aactttgaag caaggtgaaa tcaaggatgc tactccttca gattttgtta 25560
gagctactgc aacgataccg atacaagcat cacttccttt cggatggctt attgttggcg 25620
ttgcacttct tgctgttttt cagagcgctt ccaaaatcat aaccctcaaa aagagatggc 25680
aactagcact ctccaagggt gttcactttg tttgcaactt gctgttgttg tttgtaacag 25740
tttactcaca tcttttgctt gttgctgctg gccttgaagc cccttttctc tatctttatg 25800
ctttagtcta cttcttgcag agtataaact ttgtacgcat aataatgagg ctttggcttt 25860
gctggaaatg ccgttccaaa aacccattac tttatgatgc caactatttt ctttgctggc 25920
atactaattg ttacgactat tgtatacctt acaatagtgt aacttcttca attgtcatta 25980
cttcaggtga tggcacaaca agtcctattt ctgaacatga ctaccagatt ggtggttata 26040
ctgaaaaatg ggaatctgga gtaaaagact gtgttgtatt acacagttac ttcacttcag 26100
actattacca gctgtactca actcaattga gtacagacac tggtgttgaa catgttacct 26160
tcttcatcta caataaaatc gttgatgagc ctgaagaaca tgtccaaatt cacacaatcg 26220
acgtttcatc cggagttgtt aatccagtaa tggaaccaat ttatgatgaa ccgacgacga 26280
ctactagcgt gcctttgtaa gcacaagctg atgagtacga acttatggca gattccaacg 26340
gtactattac cgttgaggag ctgaaaaagc tccttgaaca atggaaccta gtaataggtt 26400
tcctattcct tacatggatt tgcctgctgc aatttgccta tgccaacagg aataggtttt 26460
tgtacatcat taagttgatt ttcctctggc tgttatggcc agtaacttta gcttgttttg 26520
tgcttgctgc tgtttacaga ataaattgga tcaccggtgg aattgctatt gcaatggctt 26580
gtcttgtagg attgatgtgg ctaagctact tcattgcttc tttcagactg tttgcgcgta 26640
cgcgttccat gtggtcattc aatccagaaa ctaacattct tctcaacgtg ccactccatg 26700
gaactattct gactagaccg cttctagaaa gtgaactcgt aatcggagct gttatccttc 26760
gtggacatct tcgtattgct ggacatcatc taggacgctg tgacatcaag gatctaccta 26820
aagaaatcac tgttgctaca tcacgaacgc tttcttatta caaattggga gcttcacagc 26880
gtgtagcagg tgattcaggt tttgctgcat atagtcgcta caggattggc aactataaat 26940
taaacacaga ccattccagt agcagtgaca atattgcttt gcttgtacag taagtgacaa 27000
cagatgtttc atctcgttga ctttcaggtt actatagcag agatattact aatcatcatg 27060
aggactttta aagtttccat ttggaatctt gattacatca taaacctcat aattaagaac 27120
ttaagcaagt cactaactga gaataaatat tctcaactag acgaggagca gccaatggag 27180
attgattaaa cgaacatgaa aattattctt ttcttggcac tgataacact cgctacttgt 27240
gagctttatc actaccaaga gtgtgttaga ggtacaacag tacttttaaa agaaccttgc 27300
tcgtcgggaa catacgaggg caattcacca tttcatcctc tagctgataa caaatttgca 27360
ctgacttgct ttagcactca atttgctttt gcttgtcctg acggcgtaaa acacgtctat 27420
cagttacgtg ccagatcagt ttcacctaaa ctgttcatca gacaagagga agttcaagaa 27480
ctttactctc caatttttct tattgttgcg gcaatagtgt ttataacact ttgcttcaca 27540
ctcaaaagaa agacagaatg attgaacttt cattaattga cttctatttg tgctttttag 27600
cctttctgct attccttgtt ttaattatgc ttattatctt ttggttctca cttgaactgc 27660
aagatcataa tgaaacttgt cacgcctaag acgttcgtgt tgttttagat ttcatctaaa 27720
cgaacaaact aaaatgtctg ataatggacc tcaaaatcag cgaaatgcac ctcgcattac 27780
gtttggtgga ccatcagatt caactggcag taaccagaat ggagaacgaa gtggtgcgcg 27840
atcaaaacaa cgccgcccgc aaggtttacc caataatact gcgtcttggt tcaccgctct 27900
cactcaacat ggcaaggaag atttaaaatt ccctcgagga caaggcgttc caattaacac 27960
caatagcagt ccagatgacc aaattggcta ctaccgccgc gccacaagac gaattcgtgg 28020
tggtgatggt aaaatgaaag atctcagtcc aagatggtat ttctactatc taggaactgg 28080
gccagaagct ggacttcctt atggtgctaa caaagatggc atcatatggg ttgcaactga 28140
gggagccttg aatacaccaa aagatcacat tggcaccaga aatcctgcta acaatgctgc 28200
aatcgtgcta caacttcctc aaggaacaac attaccaaaa ggtttttacg cagaagggtc 28260
tagaggtgga agtcaagcct cttctagatc atcatcacgt agtcgcaaca gttcaagaaa 28320
ttcaactcca ggttcaagta gaggaacttc tcctgctaga atggctggaa atggaggtga 28380
tgctgctctt gctttgttac tacttgacag attgaaccag cttgagagca aaatgtctgg 28440
taaaggccaa caacaacaag gccaaactgt cactaagaaa tctgctgctg aggcttctaa 28500
gaagcctaga caaaaacgta ctgccactaa agcatacaat gtaacacaag ctttcggcag 28560
acgtggtcca gaacaaactc aaggaaattt tggggatcag gaactaatca gacaaggaac 28620
tgattacaaa cattggccgc aaattgcaca atttgctcct tctgcttcag cgttctttgg 28680
aatgtcgaga attggaatgg aagtcacacc ttcgggaaca tggttgacct atacaggtgc 28740
catcaaattg gatgacaaag atccaaattt caaagatcaa gtcattttgc tgaataagca 28800
tattgacgca tacaaaacat tcccaccaac agagcctaaa aaggacaaaa agaagaaggc 28860
tgatgaaact caagccttac cgcagagaca gaagaaacag caaactgtga ctcttcttcc 28920
tgctgcagat ttggatgatt tctccaaaca attgcaacaa tccatgagca gtgctgactc 28980
aactcaggcc taaactcatg cagaccacac aaggcagatg ggctatataa acgttttcgc 29040
ttttccgttt acgatatata gtctactctt gtgcagaatg aattctcgta actacatagc 29100
acaagtagat gtagttaact ttaatctcac atagcaatct ttaatcagtg tgtaacatta 29160
gggaggactt gaaagagcca ccacattttc accgaggcca cgcggagtac gatcgagtgt 29220
acagtgaaca atgctaggga gagctgccta tatggaagag ccctaatgtg taaaattaat 29280
tttagtagtg ctatccccat gtgattttaa tagcttctta ggagaatgac aaaaaaaaac 29340
aaaaaaaa 29348
<210> 43
<211> 29152
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthesis optimized sequence E-protein ORF6, and ORF8 triple
deletion
<400> 43
caaatataac gaaaggctca gtcgaaagac tgggcctttc gttttatctg ttgtttgtcg 60
taatacgact cactataggg attaaaggtt tataccttcc caggtaacaa accaaccaac 120
tttcgatctc ttgtagatct gttctctaaa cgaactttaa aatctgtgtg gctgtcactc 180
ggctgcatgc ttagtgcact cacgcagtat aattaataac taattactgt cgttgacagg 240
acacgagtaa ctcgtctatc ttctgcaggc tgcttacggt ttcgtccgtg ttgcagccga 300
tcatcagcac atctaggttt cgtccgggtg tgaccgaaag gtaagatgga gagccttgtc 360
cctggtttca acgagaaaac acacgtccaa ctcagtttgc ctgttttaca ggttcgcgac 420
gtgttagtac gtggttttgg agattcagtg gaagaagtct tatcagaggc acgtcaacat 480
cttaaagatg gcacttgtgg cttagtagaa gttgaaaaag gcgttttgcc tcaacttgaa 540
cagccctatg tgttcatcaa acgttctgat gctagaactg cacctcatgg tcatgttatg 600
gttgagctgg tagcagaatt agaaggtatt cagtacggtc gtagtggtga gacattaggt 660
gttttagttc ctcatgtggg cgaaatacca gtggcttacc gcaaagttct tcttagaaag 720
aacggtaata aaggagctgg tggccatagt tacggcgctg atttaaagtc atttgactta 780
ggcgacgagc ttggcactga tccttatgaa gatttccaag aaaactggaa cactaaacat 840
agcagtggtg ttacccgtga actcatgcgt gagttaaatg gaggtgcata cactcgctat 900
gtcgataaca acttctgtgg acctgatggt taccctcttg agtgcattaa agaccttcta 960
gcacgtgctg gtaaagcttc atgcactttg tccgaacaac tggactttat tgacactaag 1020
aggggtgtat actgctgccg tgaacatgag catgaaattg cttggtacac ggaacgttct 1080
gaaaagagct atgaattgca gacacctttt gaaattaaac tggcaaagaa atttgacacc 1140
ttcaatgggg aatgtccaaa ttttgtattt cccctcaatt ccataatcaa gactattcaa 1200
ccaagggttg aaaagaaaaa gcttgatggc tttatgggta gaattcgatc tgtctatcca 1260
gttgcgtcac caaatgaatg caaccaaatg tgcctttcaa ctctcatgaa gtgtgatcat 1320
tgtggtgaaa cttcatggca gacgggcgat tttgttaaag ccacttgcga attttgtggc 1380
actgagaatt tgactaaaga aggtgccact acttgtggtt acttacccca aaatgctgtt 1440
gttaaaattt actgtccagc atgtcacaat tcagaagtag gacctgagca tagtcttgcc 1500
gaataccata atgaatctgg cttgaaaacc attcttcgta agggtggtcg cactattgct 1560
tttggaggat gtgtgttctc ttatgttggt tgccataaca agtgtgctta ttgggttcca 1620
cgtgcttcag ctaacatagg ttgtaaccat acaggtgttg ttggagaagg ttccgaaggt 1680
cttaatgaca accttcttga aatactccaa aaagagaaag tcaacatcaa tattgttggt 1740
gactttaaac ttaatgaaga gatcgccatt attttggcat ctttttctgc ttccacaagt 1800
gcttttgtgg aaactgtgaa aggtttggat tataaagcat tcaaacagat tgttgaatcc 1860
tgtggtaatt ttaaggttac aaagggaaaa gctaaaaaag gtgcctggaa tattggtgaa 1920
cagaaatcaa tactgagtcc tctttatgca tttgcatcag aggctgctcg tgttgtacga 1980
tcaattttct cccgcactct tgaaactgct caaaattctg tgcgtgtttt acagaaggcc 2040
gctataacaa tactagatgg aatttcacag tattcactga gactcattga tgctatgatg 2100
ttcacatctg atttggctac taacaatcta gttgtaatgg cctacattac aggtggtgtt 2160
gttcagttga cttcgcagtg gctaactaac atctttggca ctgtttatga aaaactcaaa 2220
cccgtccttg attggcttga agagaagttt aaggaaggtg tagagtttct tagagacggt 2280
tgggagattg ttaaattcat ctcaacctgt gcttgtgaaa ttgtcggtgg acaaattgtc 2340
acctgtgcta aggaaattaa ggagagtgtt cagacattct ttaagcttgt aaacaagttt 2400
ttggctttgt gtgctgactc tatcattatt ggtggagcta aacttaaagc cttgaattta 2460
ggtgaaacat ttgtcacgca ctcaaaggga ttgtacagaa agtgtgttaa atccagagaa 2520
gaaactggcc tactcatgcc tctaaaagcc ccaaaagaaa ttatcttctt agagggagaa 2580
acacttccca cagaagtgtt aacagaggaa gttgtcttga aaactggtga tttacaacca 2640
ttagaacaac ctactagtga agctgttgaa gctccattgg ttggtacacc agtttgtatt 2700
aacgggctta tgttgctcga aatcaaagac acagaaaagt actgtgccct tgcacctaat 2760
atgatggtaa caaacaatac cttcacactc aaaggcggtg caccaacaaa ggttactttt 2820
ggtgatgaca ctgtgataga agtgcaaggt tacaagagtg tgaatatcac ttttgaactt 2880
gatgaaagga ttgataaagt acttaatgag aagtgctctg cctatacagt tgaactcggt 2940
acagaagtaa atgagttcgc ctgtgttgtg gcagatgctg tcataaaaac tttgcaacca 3000
gtatctgaat tacttacacc actgggcatt gatttagatg agtggagtat ggctacatac 3060
tacttatttg atgagtctgg tgagtttaaa ttggcttcac atatgtattg ttctttctac 3120
cctccagatg aggatgaaga agaaggtgat tgtgaagaag aagagtttga gccatcaact 3180
caatatgagt atggtactga agatgattac caaggtaaac ctttggaatt tggtgccact 3240
tctgctgctt tacaacctga agaagaacaa gaagaagatt ggttagatga tgatagtcaa 3300
caaactgttg gtcaacaaga cggcagtgag gacaatcaga caactactat tcaaacaatt 3360
gttgaggttc aacctcaatt agagatggaa cttacaccag ttgttcagac tattgaagtg 3420
aatagtttta gtggttatct taaacttact gacaatgtat acatcaagaa tgcagacatt 3480
gtggaagaag ctaaaaaggt aaaaccaaca gtggttgtta atgcagccaa tgtttacctt 3540
aaacatggag gaggtgttgc aggagcctta aataaggcta ctaacaatgc catgcaagtt 3600
gaatctgatg attacatagc tactaatgga ccacttaaag tgggtggtag ttgtgtttta 3660
agcggacaca atcttgctaa acactgttta catgttgtcg gcccaaatgt taacaaaggt 3720
gaagatattc aacttcttaa gagtgcttat gaaaatttta accagcacga agttctactt 3780
gcaccattat tatcagctgg tatttttggt gctgacccta tacattcttt aagagtttgt 3840
gtagatactg ttcgcacaaa tgtctactta gctgtctttg ataaaaatct ctatgacaaa 3900
cttgtttcaa gctttttgga aatgaagagt gaaaagcaag ttgaacaaaa gatcgctgag 3960
attcctaaag aggaagttaa gccatttata actgaaagta aaccttcagt tgaacagaga 4020
aaacaagatg ataagaagat caaagcttgt gttgaagaag ttacaacaac tctggaagaa 4080
actaagttcc tcacagaaaa cttgctcctt tatatcgaca ttaatggcaa tcttcatcca 4140
gattctgcca ctcttgttag tgacattgac atcactttct taaagaaaga tgctccatat 4200
atagtgggtg atgttgttca agagggtgtt ttaactgctg tggttatacc tactaaaaag 4260
gctggtggca ctactgaaat gctagcgaaa gctttgagaa aagtgccaac agacaattat 4320
ataaccactt acccgggtca gggtttaaat ggttacactg tagaggaggc aaagacagtg 4380
cttaaaaagt gtaaaagtgc cttttacatt ctaccatcta ttatctctaa tgagaagcaa 4440
gaaattcttg gaactgtttc ttggaatttg cgagaaatgc ttgcacatgc agaagaaaca 4500
cgcaaattaa tgcctgtctg tgtggaaact aaagccatag tttcaactat acagcgtaaa 4560
tataagggta tcaagataca agagggtgtg gttgattatg gtgctagatt ttacttttac 4620
accagtaaaa caactgtagc gtcacttatc aacacactta acgatctaaa tgaaactctt 4680
gttacaatgc cacttggcta tgtaacacat ggcttaaatt tggaagaagc tgctcggtat 4740
atgagatctc tcaaagtgcc agctacagtt tctgtttctt cacctgatgc tgttacagcg 4800
tataatggtt atcttacttc ttcttctaaa acacctgaag aacattttat tgaaaccatc 4860
tcacttgctg gttcctataa agattggtcc tattctggac aatctacaca actaggtata 4920
gaatttctta agagaggtga taaaagtgta tattacacgt ccaatcctac cacattccac 4980
ctagatggtg aagttatcac ctttgacaat cttaagacac ttctttcttt gagagaagtg 5040
aggactatta aggtgtttac aacagtagac aacattaacc tccacacgca agttgtggac 5100
atgtcaatga catatggaca acagtttggt ccaacttatt tggatggagc tgatgttact 5160
aagataaaac ctcataactc acatgaaggt aaaacatttt acgttttgcc taatgatgac 5220
actctacgtg ttgaggcttt tgagtactac cacacaactg atcctagttt tctgggtagg 5280
tacatgtcag cattaaatca cactaaaaag tggaaatacc cacaagttaa tggtttaact 5340
tcgattaaat gggcagataa caactgttat cttgccactg cattgttaac actccaacaa 5400
atagagttga agtttaatcc acctgctcta caagatgctt attacagagc aagggctggt 5460
gaagctgcta acttttgtgc acttatctta gcctactgta ataagacagt aggtgagtta 5520
ggtgatgtta gagaaacaat gagttacttg tttcaacatg ccaatttaga ttcttgcaaa 5580
agagtcttga acgtggtgtg taaaacttgt ggacaacagc agacaaccct taagggtgta 5640
gaagctgtta tgtacatggg cacactttct tatgaacaat tcaagaaagg tgttcagata 5700
ccttgtacgt gtggtaaaca agctacaaaa tatctagtac aacaggagtc accttttgtt 5760
atgatgtcag caccacctgc tcagtatgaa cttaagcatg gtacatttac ttgtgctagt 5820
gagtacactg gtaattacca gtgtggtcac tataagcata taacttctaa ggaaactttg 5880
tattgcatag acggtgcttt acttacaaag tcctcagaat acaaaggtcc tattacggat 5940
gttttctaca aagaaaacag ttacacaaca accataaaac cagttactta taagttggat 6000
ggtgttgttt gtacagaaat tgaccctaag ttggacaatt attataagaa ggacaactct 6060
tatttcacag agcaaccaat tgatcttgta ccaaaccaac catatccaaa cgcaagcttc 6120
gataatttta agttcgtatg cgataatatc aaatttgctg atgatctcaa ccagttaact 6180
ggttataaga aacctgcttc aagagagctt aaagttacat ttttccctga cttaaatggt 6240
gatgtggtgg ctattgatta taaacactac acaccctctt ttaagaaagg agctaaattg 6300
ttacataagc ctattgtttg gcatgttaac aatgcaacta ataaagccac gtataaacca 6360
aatacctggt gtatacgttg tctttggagc acaaaaccag ttgaaacatc aaattcgttt 6420
gatgtactga agtcagagga cgcgcaggga atggataatc ttgcatgtga agatctaaaa 6480
ccagtctctg aagaagtagt ggaaaatcct accatacaga aagacgttct tgagtgtaat 6540
gtgaaaacta ccgaagttgt aggagacatt atacttaaac cagcaaataa tagtttgaag 6600
atcacagaag aggttggcca cacagatcta atggctgctt atgtagacaa ttctagtctt 6660
actattaaga aacctaatga actctctaga gtattaggtt tgaaaaccct tgctactcat 6720
ggtttagctg ctgttaatag tgtcccttgg gatactatag ctaattatgc taagcctttt 6780
cttaacaaag ttgttagtac aactactaac atagttacac ggtgtcttaa tcgtgtttgt 6840
actaattata tgccttactt ctttacttta ttgctacaat tgtgtacttt tactagaagt 6900
acaaattcta gaatcaaggc atctatgccg actactatag caaagaatac tgttaagagt 6960
gtcggtaaat tttgtctaga ggcttcattt aattatctca agtcacctaa cttttctaag 7020
ctgataaaca ttatcatctg gtttttgcta ttaagtgttt gcctaggttc tttaatctac 7080
tcaaccgctg ctttaggtgt tttaatgtct aatttaggca tgccttctta ctgtactggt 7140
tacagagaag gctatttgaa ctctactaat gtcactattg caacctactg tactggatct 7200
ataccttgta gtgtttgtct tagtggttta gattctttag acacctatcc ttctcttgaa 7260
actatacaga ttaccatttc atctttcaaa tgggatttaa ctgcttttgg cttagttgca 7320
gagtggtttt tggcatatat tcttttcact aggtttttct atgtacttgg attggctgca 7380
atcatgcaat tgtttttcag ctattttgca gtccatttta ttagtaactc ttggcttatg 7440
tggcttataa ttaatcttgt gcagatggcc ccgatttcag ctatggttag aatgtacatc 7500
ttctttgcct cattttatta tgtgtggaaa agttatgtgc atgttgtaga cggttgtaat 7560
tcatcaactt gtatgatgtg ttacaaacgt aatagagcaa caagagtcga atgtacaact 7620
attgttaatg gtgttagaag gtccttttat gtctatgcta atggaggtaa aggcttttgc 7680
aaactacaca attggaattg tgttaattgt gatacattct gtgctggtag tacatttatt 7740
agtgatgaag ttgcgagaga cttgtcacta cagtttaaaa gaccaataaa tcctactgac 7800
caatcttctt acatcgttga tagtgttaca gtgaagaatg gttccatcca tctttacttt 7860
gataaagctg gtcaaaagac ttatgaaaga cattctctct ctcattttgt taacttagac 7920
aacctgagag ctaataacac taaaggttca ttgcctatta atgttatcgt tttcgacggt 7980
aaatcaaaat gtgaagaatc atctgcaaaa tcagcgtctg tttactacag tcagcttatg 8040
tgtcaaccta tactgttact agatcaggca ttagtgtctg atgttggtga tagtgcggaa 8100
gttgcagtta aaatgtttga tgcttacgtt aatacgtttt catcaacttt taacgtacca 8160
atggaaaaac tcaaaacact agttgcaact gcagaagctg aacttgcaaa gaatgtgtcc 8220
ttagacaatg tcttatctac gtttatttca gcagctcggc aagggtttgt tgattcagat 8280
gtagaaacta aagatgttgt tgaatgtctt aaattgtcac atcaatctga catagaagtt 8340
actggcgata gttgtaataa ctatatgctc acctataaca aagttgaaaa catgacaccc 8400
cgtgaccttg gtgcttgtat tgactgtagt gctagacata ttaatgcgca ggtagcaaaa 8460
agtcacaaca ttgctttgat atggaacgtt aaagatttca tgtcattgtc tgaacaacta 8520
cgaaaacaaa tacgtagtgc tgctaaaaag aataacttac ccttcaagtt gacatgtgca 8580
actactagac aagttgttaa tgttgtaaca acaaagatag cacttaaggg tggtaaaatt 8640
gtgaataact ggttgaagca gcttattaaa gttacacttg tgttcctttt tgttgctgct 8700
attttctatc tgataacacc tgttcatgtc atgtctaaac atactgactt ttcaagtgaa 8760
atcataggat acaaggctat tgatggtggt gtcactcgtg acatagcatc tacagatact 8820
tgttttgcta acaaacatgc tgattttgac acatggttta gccagcgtgg tggtagttat 8880
actaatgaca aagcttgccc attgattgct gcagtcataa caagagaagt gggttttgtc 8940
gttcctggtt tgcctggaac gatattacgc acaactaatg gtgacttttt gcatttctta 9000
cctagagttt ttagtgcagt tggtaacatc tgttacacac catcaaaact tatagagtac 9060
actgactttg caacatcagc ttgtgttttg gctgctgaat gtacaatttt taaagacgct 9120
tctggtaagc cagtaccata ttgttatgat accaatgtac tagaaggttc tgttgcttat 9180
gaaagtttac gccctgacac acgttatgtg ctcatggatg gctctattat tcaatttcct 9240
aacacctacc ttgaaggttc tgtaagagtg gtaacaactt ttgattctga gtactgtagg 9300
cacggcactt gtgaaagatc agaagctggt gtttgtgtat ctactagtgg tagatgggta 9360
cttaacaacg attattacag atctttacca ggagttttct gtggtgtaga tgctgtaaat 9420
ttgcttacta acatgtttac accactaatt caacctattg gtgctttgga catatcagca 9480
tctatagtag ctggtggtat tgtagctatc gtagtaacat gccttgccta ctattttatg 9540
aggtttagac gtgcttttgg tgaatacagt catgtagttg cctttaatac tctcctattc 9600
cttatgtcat tcactgtact ctgtttaaca ccagtttact cattcttacc tggtgtttat 9660
tctgttattt acctgtactt gacattttat ctgactaatg atgtttcttt tctcgcacat 9720
attcagtgga tggttatgtt cacaccttta gtacctttct ggataacaat tgcttacatc 9780
atttgtattt ccacaaagca tttctattgg ttctttagta attacctaaa gagacgtgta 9840
gtctttaatg gtgtttcctt tagtactttt gaagaagctg cgctgtgcac ctttttgtta 9900
aataaggaga tgtatctaaa gttgcgtagt gatgtgctat tacctcttac gcaatataat 9960
agatacttag ctctttataa caagtacaag tatttcagtg gagcaatgga tacaactagc 10020
tacagagaag ctgcttgttg tcatctcgca aaggctctca atgacttcag taactcaggt 10080
tctgatgttc tttaccaacc accacaaacc tctatcacct cagctgtttt gcagagtggt 10140
tttagaaaaa tggcattccc atctggtaaa gttgagggtt gtatggtaca agtaacttgt 10200
ggtacaacta cacttaacgg tctttggctt gatgacgtag tttactgtcc aagacatgtg 10260
atctgcacct ctgaagatat gcttaaccct aattatgaag atctactcat ccgtaagtct 10320
aatcataact tcttggtaca ggctggtaat gttcaactca gggttattgg acattctatg 10380
caaaattgtg tacttaagct taaggttgat acagccaatc ctaagacacc taagtataag 10440
tttgttcgca ttcaaccagg acagactttt tcagtgttag cttgttacaa tggttcacca 10500
tctggtgttt accaatgtgc tatgaggccc aatttcacta ttaagggttc attccttaat 10560
ggttcatgtg gtagtgttgg ttttaacata gattatgact gtgtctcttt ttgttacatg 10620
caccatatgg aattaccaac tggagttcat gctggcacag acttagaagg taacttttat 10680
ggaccttttg ttgacaggca aacagcacaa gcagctggta cagatacaac tattacagtt 10740
aatgttcttg cttggttgta cgctgctgtt ataaatggag acaggtggtt tctcaatcga 10800
tttaccacaa ctcttaatga ctttaacctt gtggctatga agtacaatta tgaacctcta 10860
acacaagacc atgttgacat actaggacct ctttctgctc aaactggaat tgccgtttta 10920
gatatgtgtg cttcattaaa agaacttctg caaaatggta tgaatggacg taccatattg 10980
ggtagtgctt tattagaaga tgagtttaca ccttttgatg ttgttagaca atgctcaggt 11040
gttactttcc aaagtgcagt gaaaagaaca atcaagggta cacaccactg gttgttactc 11100
acaattttga cttcactttt agttttagtc cagagtactc aatggtcttt gttctttttc 11160
ttctacgaaa atgccttttt accttttgct atgggtatta ttgctatgtc tgcttttgca 11220
atgatgtttg tcaaacataa gcatgcattt ctctgtttgt ttttgttacc ttctcttgcc 11280
actgtagctt actttaatat ggtctacatg cctgctagtt gggtgatgcg tattatgaca 11340
tggttggata tggttgatac tagtttgtct ggttttaagc taaaagactg tgttatgtat 11400
gcatcagctg tagtgttact aatccttatg acagcaagaa ctgtgtatga tgatggtgct 11460
aggagagtgt ggacacttat gaatgtcttg acactcgttt ataaagttta ctatggcaac 11520
gctttagatc aagccatttc catgtgggct cttataatct ctgttacttc taactactca 11580
ggtgtagtta caactgtcat gtttttggcc agaggtattg tttttatgtg tgttgagtat 11640
tgccctattt tcttcataac tggtaataca cttcagtgta taatgctagt ctattgtttc 11700
ttaggctatt tttgtacttg ttacttcggc ctcttttgtt tactcaaccg ctactttaga 11760
ctgactcttg gtgtttatga ttacttagtg tctacacagg agtttagata tatgaattca 11820
cagggactac tcccacccaa gaatagcata gatgccttca aactcaacat taaattgttg 11880
ggtgttggtg gcaaaccttg tatcaaagta gccactgtac agtctaaaat gtcagatgta 11940
aagtgcacat cagtagtctt actctcagtt ttgcaacaac tcagagtaga atcatcatct 12000
aaattgtggg ctcaatgtgt ccagttacac aatgacattc tcttagctaa agatactact 12060
gaagcctttg aaaaaatggt ttcactactt tctgttttgc tttccatgca gggtgctgta 12120
gacataaaca agctttgtga agaaatgctg gacaacaggg caaccttaca agctatagcc 12180
tcagagttta gttcccttcc atcatatgca gcttttgcta ctgctcaaga agcttatgag 12240
caggctgttg ctaatggtga ttctgaagtt gttcttaaaa agttgaagaa gtctttgaat 12300
gtggctaaat ctgaatttga ccgtgatgca gccatgcaac gtaagttgga aaagatggct 12360
gatcaagcta tgacccaaat gtataaacag gctagatctg aggacaagag ggcaaaagtt 12420
actagtgcta tgcagacaat gcttttcact atgcttagaa agttggataa tgatgcactc 12480
aacaacatta tcaacaatgc aagagatggt tgtgttccct tgaacataat acctcttaca 12540
acagcagcca aactaatggt tgtcatacca gactacaaca catataagaa tacgtgtgat 12600
ggtacaacat ttacttatgc atcagcattg tgggaaatcc aacaggttgt agatgcagat 12660
agtaaaattg ttcagcttag tgaaattagt atggacaatt cacctaattt agcatggcct 12720
cttattgtaa cagctttaag ggccaattct gctgtcaaat tacagaataa tgagcttagt 12780
cctgttgcac taagacaaat gtcttgtgct gccggtacta cacaaactgc ttgcactgat 12840
gacaatgcgt tagcttacta caacacaaca aagggaggta ggtttgtact tgcactgtta 12900
tccgatttac aggatttgaa atgggctaga ttccctaaga gtgatggaac tggtactatc 12960
tatacagaac tggaaccacc ttgtaggttt gttacagaca cacctaaagg tcctaaagtg 13020
aagtatcttt acttcatcaa aggattaaac aacctaaata gaggtatggt acttggtagt 13080
ttagctgcca cagtacgttt acaagctggt aatgcaacag aagttcctgc taattcaact 13140
gtactttctt tctgtgcttt tgctgtagat gctgctaaag cttacaaaga ttatctagct 13200
agtgggggac aaccaatcac taattgtgtt aagatgttgt gtacacacac tggtactggt 13260
caggcaataa cagttacacc ggaagccaat atggatcaag aatcctttgg tggtgcatcg 13320
tgttgtctgt actgccgttg tcatatagat catccaaatc ctaaaggatt ttgtgactta 13380
aaaggtaagt atgtacaaat acctacaact tgtgctaatg accctgtggg ttttacactt 13440
aaaaacacag tctgtaccgt ctgcggtatg tggaaaggtt atggttgtag ttgtgatcaa 13500
ctccgcgaac ccatgcttca gtcagctgat gcacaatcgt ttttaaacgg gtttgcggtg 13560
taagtgcagc ccgtcttaca ccgtgcggca caggcactag tactgatgtc gtatatagag 13620
cttttgacat ctacaatgat aaagtagctg gttttgctaa gttcctaaaa actaattgtt 13680
gtcgcttcca agaaaaggac gaagatgaca atctcattga ttcttacttt gtagttaaga 13740
gacacacttt ctctaactac caacatgaag aaacaattta caacctgctt aaggattgtc 13800
cagctgttgc taaacatgac ttctttaagt ttagaataga cggtgacatg gtaccacata 13860
tatcacgtca acgtcttact aaatacacaa tggcagacct cgtctatgct ttaaggcatt 13920
ttgatgaagg taattgtgac acattaaaag aaatacttgt cacatacaat tgttgtgatg 13980
atgactactt caataaaaag gactggtatg attttgtaga aaacccagat atattacgcg 14040
tatacgccaa cttaggtgaa cgtgtacgcc aagctttgtt aaaaacagta cagttctgtg 14100
atgccatgcg aaatgctggt attgttggtg tactgacatt agataatcaa gatctcaatg 14160
gtaactggta tgactttggt gatttcatac aaaccacgcc aggtagtgga gttcctgttg 14220
tagactctta ttattcattg ctcatgccta tattaacctt gaccagggct ttaactgcag 14280
agtcacatgt tgacactgac ttaacaaagc cttacattaa gtgggatttg ttaaaatacg 14340
acttcacgga agagaggtta aaactctttg accgttattt taaatactgg gatcagacat 14400
accacccaaa ttgtgttaac tgtttggatg acagatgcat tctgcattgt gcaaacttta 14460
atgttctgtt ctctacagtg ttcccaccta caagttttgg accactagtg agaaaaatat 14520
ttgttgatgg tgttccattt gtagtttcaa ctggatacca cttcagagag ctaggtgttg 14580
tacataatca ggatgtaaac ttacatagct ctagacttag ttttaaggaa ttacttgtgt 14640
atgctgctga tcctgctatg catgctgctt ctggtaatct attactagat aaacgcacta 14700
cgtgcttttc agtagctgca cttactaaca atgttgcttt tcaaactgtc aaacccggta 14760
attttaacaa ggacttctat gactttgctg tgtctaaggg tttctttaag gaaggaagtt 14820
ctgttgaatt aaaacacttc ttctttgctc aggatggtaa tgctgctatc agcgattatg 14880
actactatcg ttataatcta ccaacaatgt gtgatatcag acaactacta tttgtagttg 14940
aagttgttga taagtacttt gattgttacg atggtggctg tattaatgct aaccaagtca 15000
tcgtcaacaa cctagacaaa tcagctggtt ttccatttaa taaatggggt aaggctagac 15060
tttattatga ttccatgagt tatgaggatc aagatgcact tttcgcatat acaaaacgta 15120
atgtcatccc tactataact caaatgaacc ttaagtatgc cattagtgca aagaatagag 15180
ctcgcaccgt agctggtgtc tctatctgta gtactatgac caatagacag tttcatcaaa 15240
aattactcaa gtcaatagcc gccactagag gagctactgt agtaattgga acaagcaaat 15300
tctatggtgg ttggcacaac atgctcaaaa ctgtttatag tgatgtagaa aaccctcacc 15360
ttatgggttg ggattatcct aaatgtgata gagccatgcc taacatgctt agaattatgg 15420
cctcacttgt tcttgctcgc aaacatacaa cgtgttgtag cttgtcacac cgtttctata 15480
gattagctaa tgagtgtgct caagtattga gtgaaatggt catgtgtggc ggttcactat 15540
atgttaaacc aggtggaacc tcatcaggag atgccacaac tgcttatgct aatagtgtgt 15600
ttaacatttg tcaagctgtc acggccaatg ttaatgcact tttatctact gatggtaaca 15660
aaattgccga taagtatgtc cgcaatttac aacacagact ttatgagtgt ctctatagaa 15720
atagagatgt tgacacagac tttgtgaatg agttttacgc atatttgcgt aaacatttct 15780
caatgatgat actctctgac gatgctgttg tgtgtttcaa tagcacttat gcatctcaag 15840
gtctagtggc tagcataaag aactttaagt cagttcttta ctatcaaaac aacgttttta 15900
tgtctgaagc aaaatgttgg actgagactg accttactaa aggacctcat gaattttgct 15960
ctcaacatac aatgctagtt aaacagggtg atgattatgt gtaccttcct tacccagatc 16020
catcaagaat cctaggtgcc ggttgttttg tagatgatat cgtaaaaaca gatggtacac 16080
ttatgattga acggttcgtg tctttagcta tagatgctta cccacttact aaacatccta 16140
atcaggagta tgctgatgtc tttcatttgt acttacaata catacgtaag ctacatgatg 16200
agttaacagg acacatgtta gacatgtatt ctgttatgct tactaatgat aacacttcaa 16260
ggtattggga acctgagttt tatgaggcta tgtacacacc gcatacagtc ttacaagctg 16320
ttggtgcttg tgttctttgc aattcacaga cttcattaag atgtggtgct tgcatacgta 16380
gaccattctt atgttgtaaa tgctgttacg accatgtcat ctcaacatca cataaattag 16440
tcttgtctgt taatccgtat gtttgcaatg ctccaggttg tgatgtcaca gatgtgactc 16500
aactttactt aggaggtatg agctattact gtaagtcaca taaaccaccc attagttttc 16560
cattgtgtgc taatggacaa gtttttggtc tctacaagaa tacatgtgtt ggtagcgata 16620
atgttactga ctttaatgca attgcaacat gtgactggac aaatgctggt gattacattt 16680
tagctaacac ctgtactgaa agactcaagc tttttgcagc agaaacgctc aaagctactg 16740
aggagacatt taaactgtct tatggtattg ctactgtacg tgaagtgctg tctgacagag 16800
aattacatct ttcatgggaa gttggtaaac ctagaccacc acttaaccga aattatgtct 16860
ttactggtta tcgtgtaact aaaaacagta aagtgcaaat cggagagtac acctttgaaa 16920
aaggtgacta tggtgatgct gttgtttacc gaggtacaac aacttacaaa ctcaacgttg 16980
gtgattattt tgtgctgaca tcacatacag taatgccatt aagtgcacct acactagtgc 17040
cacaagagca ctatgttaga attactggct tatacccaac actcaatatc tcagatgagt 17100
tttctagcaa tgttgcaaat tatcaaaagg ttggtatgca aaagtattct acactccagg 17160
gaccacctgg tactggtaaa agtcattttg ctattggtct agctctctac tacccttctg 17220
ctcgcatagt atatacagct tgctctcatg cagctgttga tgcactatgt gagaaggcat 17280
taaaatattt gcccatagac aaatgtagta gaattatacc tgcacgtgct cgtgtagagt 17340
gttttgataa attcaaggtg aattcaacat tagaacagta tgtcttttgt actgtaaatg 17400
cattgcctga gacgacagca gatatagttg tctttgatga aatttcaatg gccacaaatt 17460
atgatttgag tgttgtcaat gccagattac gtgctaagca ctatgtgtac attggtgatc 17520
ctgctcaatt acctgcacca cgcacattac taactaaggg tacactagaa ccagaatatt 17580
tcaattcagt gtgtagactt atgaaaacta taggtccaga catgttcctc ggaacttgtc 17640
gtagatgtcc tgctgaaatt gttgacactg tgagtgcttt ggtttatgat aataagctta 17700
aggcacataa agacaaatca gctcaatgct ttaaaatgtt ctacaagggt gttatcacgc 17760
atgatgtttc atctgcaatt aacaggccac aaataggcgt ggtaagagaa ttccttacac 17820
gtaaccctgc ttggagaaaa gctgtcttta tttcacctta caattcccag aatgctgtag 17880
cctcaaagat tttgggacta ccaactcaaa ctgttgattc atcacagggc tcagaatatg 17940
actatgtcat attcactcaa accactgaaa cagctcactc ttgtaatgta aacagattca 18000
acgttgctat taccagagca aaagtaggca tactttgcat aatgtctgat agagaccttt 18060
atgacaagtt gcaatttaca agtcttgaaa ttccacgtag gaatgtggca actttacaag 18120
ctgaaaatgt aacaggactc tttaaagatt gtagtaaggt aatcactggg ttacatccta 18180
cacaggcacc tacacactta agtgttgata ctaaattcaa aactgaaggt ttatgtgttg 18240
acatacctgg catacctaag gacatgacct atagaagatt aatctctatg atgggtttca 18300
aaatgaatta ccaggttaat ggttacccta acatgtttat cacccgcgaa gaagctataa 18360
gacatgtacg tgcatggatt ggcttcgatg tcgaaggttg tcatgctact agagaagctg 18420
ttggtaccaa tttaccttta cagctaggtt tttctacagg tgttaaccta gttgctgtac 18480
ctacaggtta tgttgataca cctaataata cagatttttc cagagttagt gctaaaccac 18540
cgcctggaga tcaatttaaa cacctcatac cacttatgta caaaggactt ccttggaatg 18600
tagtgcgtat aaagattgtc caaatgttaa gtgacacact taaaaatctc tctgacagag 18660
tcgtatttgt cttatgggca catggctttg agttgacatc tatgaagtat tttgtgaaga 18720
tcggacctga gcgcacatgt tgtctatgtg atagacgtgc tacatgcttt tccactgctt 18780
cagacactta tgcctgttgg catcattcta ttggatttga ttacgtctat aatccgttta 18840
tgattgatgt tcaacaatgg ggttttacag gtaacctaca aagcaaccat gatctgtatt 18900
gtcaagtcca tggtaatgca catgtagcta gttgtgatgc aatcatgact aggtgtctag 18960
ctgtccacga gtgctttgtt aagcgtgttg actggactat tgaatatcct ataatcggtg 19020
atgaactgaa gattaatgcg gcttgtagaa aggttcaaca catggttgtt aaagctgcat 19080
tattagcaga caaattccca gttcttcacg acattggtaa ccctaaagct attaagtgtg 19140
tacctcaagc tgatgtagaa tggaagttct atgatgcaca gccttgtagt gacaaagctt 19200
acaaaataga agaactgttc tattcttatg ccacacattc tgacaaattc acagatggtg 19260
tatgcctatt ttggaattgc aatgtcgata gatatcctgc taattccatt gtttgtagat 19320
ttgacactag agtgctatct aaccttaact tgcctggttg tgatggtggc agtttgtatg 19380
taaataagca tgcattccac acaccagctt ttgataaaag tgcttttgtt aatctaaagc 19440
aacttccatt tttctattac tctgacagtc catgtgagtc tcatggaaaa caagtagtgt 19500
cagatataga ttatgtacca ctaaagtctg ctacgtgtat aacacgttgc aatttaggtg 19560
gtgctgtctg tagacatcat gctaatgagt acagattgta tctcgatgct tataacatga 19620
tgatctcagc tggctttagc ttgtgggttt acaaacaatt tgatacctat aacctctgga 19680
acacttttac aagacttcag agtttagaaa atgtggcttt taatgttgta aataagggac 19740
actttgatgg acaacagggt gaagtaccag tttctatcat taacaacact gtttacacaa 19800
aagttgatgg tgttgatgta gaattgtttg agaacaaaac cacattacct gttaatgtag 19860
catttgagct ttgggctaag cgcaacatta aaccagtacc agaggtgaaa atactcaata 19920
atttgggtgt ggacattgct gctaatactg tgatctggga ctacaaaaga gatgctccag 19980
cacatatatc tactattggt gtttgttcta tgactgacat agccaagaaa ccaactgaaa 20040
cgatttgtgc accactcact gtcttttttg atggtagagt tgatggtcaa gtagacttat 20100
ttagaaatgc ccgtaatggt gttcttatta cagaaggtag tgttaaaggt ttacaaccat 20160
ctgtaggtcc caaacaagct agtcttaatg gagtcacatt aattggagaa gccgtaaaaa 20220
cacagttcaa ttattacaag aaagtggatg gtgttgtcca acaattacct gaaacttact 20280
ttactcagag tagaaactta caggaattta agcccaggag tcaaatggaa attgatttct 20340
tagaacttgc tatggatgaa ttcattgaac ggtataaatt agaaggctat gccttcgaac 20400
atatcgttta tggagatttt agtcatagtc agttaggtgg tttacatcta ctgattggac 20460
tagctaaacg ttttaaggaa tcaccttttg aacttgaaga ttttattcct atggacagta 20520
cagttaaaaa ctacttcata acagatgcgc aaacaggttc atctaagtgt gtgtgttctg 20580
ttattgatct tttacttgat gacttcgttg aaataataaa gtcccaagat ttatctgtag 20640
tttctaaggt tgtcaaagtg actattgact atacagaaat ctcatttatg ctttggtgta 20700
aagatggcca tgtagaaaca ttttacccaa aattacaatc tagtcaagcg tggcaaccgg 20760
gtgttgctat gcctaatctt tacaaaatgc aaagaatgct attagaaaag tgtgaccttc 20820
aaaattatgg tgatagtgca acattaccta aaggcataat gatgaatgtc gcaaaatata 20880
ctcaactgtg tcaatattta aacacactga cattagctgt accctataat atgagagtta 20940
tccattttgg tgctggttct gataaaggag ttgcaccagg tacagctgtt ttaagacaat 21000
ggttgcctac aggtacgctg cttgtcgatt cagatcttaa tgactttgtc tctgatgcag 21060
attcaacttt gattggtgat tgtgcaactg tacatacagc taataaatgg gatctcatta 21120
ttagtgatat gtacgaccct aagactaaga atgtcacaaa agaaaacgac tctaaagagg 21180
gttttttcac ttacatttgt gggtttatac aacaaaagct agctcttgga ggttccgtgg 21240
ctataaagat aacagaacat tcttggaatg ctgatcttta taagctcatg ggacacttcg 21300
catggtggac agcctttgtt actaatgtga atgcgtcatc atctgaagca tttttaatcg 21360
gatgtaacta ccttggcaaa ccacgcgaac aaatagatgg ttatgtcatg catgcaaatt 21420
acatattttg gaggaataca aatccaattc agctttcttc ttattcttta ttcgacatga 21480
gtaaattccc ccttaaatta aggggtactg ctgttatgtc tttaaaagaa ggtcaaatca 21540
atgatatgat tctctctctt cttagtaaag gtagacttat aattagagaa aacaacagag 21600
ttgttatttc tagtgatgtt cttgttaaca actaaacgaa caatgtttgt ttttcttgtt 21660
ttattgccac tagtctctag tcagtgtgtt aatcttacaa ccagaactca attaccccct 21720
gcatacacta attctttcac acgtggtgtt tattaccctg acaaagtttt cagatcctca 21780
gttttacatt caactcagga cttgttctta cctttctttt ccaatgttac ttggttccat 21840
gctatacatg tctctgggac caatggtact aagaggtttg ataaccctgt cctaccattt 21900
aatgatggtg tttactttgc ttccactgag aagtctaaca taataagagg ctggattttt 21960
ggtactactt tagattcgaa aacccagtcc ctacttattg ttaataacgc tactaatgtt 22020
gttatcaaag tctgtgaatt tcaattttgt aacgatccat ttttgggtgt ttattaccac 22080
aaaaacaaca aaagttggat ggaaagtgag ttcagagttt attctagtgc gaataattgc 22140
acttttgaat acgtctctca gccttttctt atggaccttg aaggaaaaca gggtaatttc 22200
aaaaatctta gggaatttgt gttcaagaat attgatggtt acttcaagat atactctaag 22260
cacacgccta ttaatttagt gcgtgatctc cctcagggtt tttcggcttt agaaccattg 22320
gtagatttgc caataggtat taacatcact aggtttcaaa ctttacttgc tttacataga 22380
agttatttaa ctcctggtga ttcttcttca ggttggacag ctggtgctgc agcttattat 22440
gtgggttatc ttcaacctag gacttttcta ctgaagtaca atgaaaatgg aaccattaca 22500
gatgctgtag actgtgcact tgaccctctc tcagaaacaa agtgtacgtt gaaatccttc 22560
actgtagaaa aaggaatcta tcaaacttct aactttagag tccaaccaac agaatctatt 22620
gttagatttc ctaacatcac aaacttgtgc ccttttggtg aagtttttaa cgccaccaga 22680
tttgcatctg tttatgcttg gaacaggaag agaatcagca actgtgttgc tgattattct 22740
gtcctgtata attccgcatc attttccact tttaagtgtt atggagtgtc tcctactaaa 22800
ttaaatgatc tctgctttac taatgtctat gcagattcat ttgtaattag aggtgatgaa 22860
gtcagacaaa tcgctccagg gcaaactgga aagattgctg attataacta caaattacca 22920
gatgatttta caggctgcgt tatagcttgg aattctaaca atcttgattc taaggttggt 22980
ggtaattata attacctgta cagattgttt aggaagtcta atctcaaacc ttttgagaga 23040
gatatttcaa ctgaaatcta tcaggccggt agcacacctt gtaatggtgt tgaaggtttt 23100
aattgttact ttcctctgca atcatatggt ttccaaccca ctaatggtgt tggttaccaa 23160
ccatacagag tagtagtact ttcttttgaa cttctacatg caccagcaac tgtttgtgga 23220
cctaaaaagt ctactaattt ggttaagaac aagtgtgtca atttcaactt caatggttta 23280
acaggcacag gtgttcttac tgagtctaac aaaaagtttc tgcctttcca acaatttggc 23340
agagacattg ctgacactac tgatgctgtt cgtgatccac aaacacttga gattcttgac 23400
attacaccat gttcttttgg tggtgtcagt gttataacac caggaacaaa tacttctaac 23460
caggttgctg ttctttatca ggatgttaac tgcacagaag tccctgttgc tattcatgca 23520
gatcaactta ctcctacttg gcgtgtttat tctacaggtt ctaatgtttt tcaaacacgt 23580
gcaggctgtt taataggggc tgaacatgtc aacaactcat atgagtgtga catacccatt 23640
ggtgcaggta tatgcgctag ttatcagact cagactaatt ctcctcggag agcaagaagt 23700
gtagctagtc aatccatcat tgcctacact atgtcacttg gtgcagaaaa ttcagttgct 23760
tactctaata actctattgc catacccaca aattttacta ttagcgttac cacagaaatt 23820
ctaccagtgt ctatgaccaa gacatcagta gattgtacaa tgtacatttg tggtgattca 23880
actgaatgca gcaatctttt gttgcaatat ggcagttttt gtacacaatt aaaccgtgct 23940
ttaactggaa tagctgttga acaagacaaa aacacccaag aagtttttgc acaagtcaaa 24000
caaatttaca agacaccacc aattaaagat tttggcggtt ttaattttag ccagatactg 24060
ccagatccat caaaaccaag caagaggtca tttattgaag atctactgtt caacaaagtg 24120
acacttgcag atgctggctt catcaaacaa tatggtgatt gccttggtga tattgctgct 24180
agagacctca tttgtgcaca aaagtttaac ggccttactg ttttgccacc tttgctcaca 24240
gatgaaatga ttgctcaata cacttctgca ctgttagcag gtacaatcac ttctggttgg 24300
acttttggtg caggtgctgc attacaaata ccatttgcta tgcaaatggc ttataggttt 24360
aatggtattg gagttacaca gaatgttctc tatgagaacc aaaaattgat tgccaaccaa 24420
tttaatagtg ctattggcaa aattcaagac tcactttctt ccacagcaag tgcacttgga 24480
aaacttcaag atgtggtcaa ccaaaatgca caagctttaa acacgcttgt taaacaactt 24540
agctccaatt ttggtgcaat ttcaagtgtt ttaaacgaca tcctttcacg tcttgacaaa 24600
gttgaggctg aagtgcaaat tgataggttg atcacaggca gacttcaaag tttgcagaca 24660
tatgtgactc aacaattaat tagagctgca gaaatcagag cttctgctaa tcttgctgct 24720
actaaaatgt cagagtgtgt acttggacaa tcaaaaagag ttgacttttg cggaaagggc 24780
tatcatctta tgtcatttcc tcagtcagca cctcatggtg tcgtcttttt gcatgtgact 24840
tatgtccctg cacaagaaaa gaacttcaca actgctcctg ccatttgtca tgatggaaaa 24900
gcacactttc ctcgtgaagg tgtctttgtt tcaaatggca cacactggtt tgtaacacaa 24960
aggaattttt atgaaccaca aatcattact acagacaaca catttgtgtc tggtaactgt 25020
gatgttgtaa taggaattgt caacaacaca gtttatgatc ctttgcaacc tgaattagac 25080
tcattcaagg aggagcttga taaatacttc aagaaccata cctcaccaga tgttgattta 25140
ggtgacatct ctggcattaa tgcttcagtt gtaaacattc agaaagaaat cgaccgcctc 25200
aatgaggttg ccaagaattt aaatgaatct ctcatcgatc tccaagaact tggaaagtat 25260
gagcagtata taaaatggcc atggtacatt tggctaggtt ttatagctgg cttgattgcc 25320
atagtaatgg tgacaattat gctttgctgt atgaccagtt gctgtagttg tctcaagggc 25380
tgttgttctt gtggatcctg ctgcaaattt gacgaggacg actctgagcc agtgctcaaa 25440
ggagtcaaat tacattacac ataaacgaac ttatggattt gtttatgaga atcttcacaa 25500
ttggaactgt aactttgaag caaggtgaaa tcaaggatgc tactccttca gattttgtta 25560
gagctactgc aacgataccg atacaagcat cacttccttt cggatggctt attgttggcg 25620
ttgcacttct tgctgttttt cagagcgctt ccaaaatcat aaccctcaaa aagagatggc 25680
aactagcact ctccaagggt gttcactttg tttgcaactt gctgttgttg tttgtaacag 25740
tttactcaca tcttttgctt gttgctgctg gccttgaagc cccttttctc tatctttatg 25800
ctttagtcta cttcttgcag agtataaact ttgtacgcat aataatgagg ctttggcttt 25860
gctggaaatg ccgttccaaa aacccattac tttatgatgc caactatttt ctttgctggc 25920
atactaattg ttacgactat tgtatacctt acaatagtgt aacttcttca attgtcatta 25980
cttcaggtga tggcacaaca agtcctattt ctgaacatga ctaccagatt ggtggttata 26040
ctgaaaaatg ggaatctgga gtaaaagact gtgttgtatt acacagttac ttcacttcag 26100
actattacca gctgtactca actcaattga gtacagacac tggtgttgaa catgttacct 26160
tcttcatcta caataaaatc gttgatgagc ctgaagaaca tgtccaaatt cacacaatcg 26220
acgtttcatc cggagttgtt aatccagtaa tggaaccaat ttatgatgaa ccgacgacga 26280
ctactagcgt gcctttgtaa gcacaagctg atgagtacga acttatggca gattccaacg 26340
gtactattac cgttgaggag ctgaaaaagc tccttgaaca atggaaccta gtaataggtt 26400
tcctattcct tacatggatt tgcctgctgc aatttgccta tgccaacagg aataggtttt 26460
tgtacatcat taagttgatt ttcctctggc tgttatggcc agtaacttta gcttgttttg 26520
tgcttgctgc tgtttacaga ataaattgga tcaccggtgg aattgctatt gcaatggctt 26580
gtcttgtagg attgatgtgg ctaagctact tcattgcttc tttcagactg tttgcgcgta 26640
cgcgttccat gtggtcattc aatccagaaa ctaacattct tctcaacgtg ccactccatg 26700
gaactattct gactagaccg cttctagaaa gtgaactcgt aatcggagct gttatccttc 26760
gtggacatct tcgtattgct ggacatcatc taggacgctg tgacatcaag gatctaccta 26820
aagaaatcac tgttgctaca tcacgaacgc tttcttatta caaattggga gcttcacagc 26880
gtgtagcagg tgattcaggt tttgctgcat atagtcgcta caggattggc aactataaat 26940
taaacacaga ccattccagt agcagtgaca atattgcttt gcttgtacag taaacgaaca 27000
tgaaaattat tcttttcttg gcactgataa cactcgctac ttgtgagctt tatcactacc 27060
aagagtgtgt tagaggtaca acagtacttt taaaagaacc ttgctcgtcg ggaacatacg 27120
agggcaattc accatttcat cctctagctg ataacaaatt tgcactgact tgctttagca 27180
ctcaatttgc ttttgcttgt cctgacggcg taaaacacgt ctatcagtta cgtgccagat 27240
cagtttcacc taaactgttc atcagacaag aggaagttca agaactttac tctccaattt 27300
ttcttattgt tgcggcaata gtgtttataa cactttgctt cacactcaaa agaaagacag 27360
aatgattgaa ctttcattaa ttgacttcta tttgtgcttt ttagcctttc tgctattcct 27420
tgttttaatt atgcttatta tcttttggtt ctcacttgaa ctgcaagatc ataatgaaac 27480
ttgtcacgcc taagacgttc gtgttgtttt agatttcatc taaacgaaca aactaaaatg 27540
tctgataatg gacctcaaaa tcagcgaaat gcacctcgca ttacgtttgg tggaccatca 27600
gattcaactg gcagtaacca gaatggagaa cgaagtggtg cgcgatcaaa acaacgccgc 27660
ccgcaaggtt tacccaataa tactgcgtct tggttcaccg ctctcactca acatggcaag 27720
gaagatttaa aattccctcg aggacaaggc gttccaatta acaccaatag cagtccagat 27780
gaccaaattg gctactaccg ccgcgccaca agacgaattc gtggtggtga tggtaaaatg 27840
aaagatctca gtccaagatg gtatttctac tatctaggaa ctgggccaga agctggactt 27900
ccttatggtg ctaacaaaga tggcatcata tgggttgcaa ctgagggagc cttgaataca 27960
ccaaaagatc acattggcac cagaaatcct gctaacaatg ctgcaatcgt gctacaactt 28020
cctcaaggaa caacattacc aaaaggtttt tacgcagaag ggtctagagg tggaagtcaa 28080
gcctcttcta gatcatcatc acgtagtcgc aacagttcaa gaaattcaac tccaggttca 28140
agtagaggaa cttctcctgc tagaatggct ggaaatggag gtgatgctgc tcttgctttg 28200
ttactacttg acagattgaa ccagcttgag agcaaaatgt ctggtaaagg ccaacaacaa 28260
caaggccaaa ctgtcactaa gaaatctgct gctgaggctt ctaagaagcc tagacaaaaa 28320
cgtactgcca ctaaagcata caatgtaaca caagctttcg gcagacgtgg tccagaacaa 28380
actcaaggaa attttgggga tcaggaacta atcagacaag gaactgatta caaacattgg 28440
ccgcaaattg cacaatttgc tccttctgct tcagcgttct ttggaatgtc gagaattgga 28500
atggaagtca caccttcggg aacatggttg acctatacag gtgccatcaa attggatgac 28560
aaagatccaa atttcaaaga tcaagtcatt ttgctgaata agcatattga cgcatacaaa 28620
acattcccac caacagagcc taaaaaggac aaaaagaaga aggctgatga aactcaagcc 28680
ttaccgcaga gacagaagaa acagcaaact gtgactcttc ttcctgctgc agatttggat 28740
gatttctcca aacaattgca acaatccatg agcagtgctg actcaactca ggcctaaact 28800
catgcagacc acacaaggca gatgggctat ataaacgttt tcgcttttcc gtttacgata 28860
tatagtctac tcttgtgcag aatgaattct cgtaactaca tagcacaagt agatgtagtt 28920
aactttaatc tcacatagca atctttaatc agtgtgtaac attagggagg acttgaaaga 28980
gccaccacat tttcaccgag gccacgcgga gtacgatcga gtgtacagtg aacaatgcta 29040
gggagagctg cctatatgga agagccctaa tgtgtaaaat taattttagt agtgctatcc 29100
ccatgtgatt ttaatagctt cttaggagaa tgacaaaaaa aaacaaaaaa aa 29152
<210> 44
<211> 29968
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthesis optimized
<400> 44
caaatataac gaaaggctca gtcgaaagac tgggcctttc gttttatctg ttgtttgtcg 60
taatacgact cactataggg attaaaggtt tataccttcc caggtaacaa accaaccaac 120
tttcgatctc ttgtagatct gttctctaaa cgaactttaa aatctgtgtg gctgtcactc 180
ggctgcatgc ttagtgcact cacgcagtat aattaataac taattactgt cgttgacagg 240
acacgagtaa ctcgtctatc ttctgcaggc tgcttacggt ttcgtccgtg ttgcagccga 300
tcatcagcac atctaggttt cgtccgggtg tgaccgaaag gtaagatgga gagccttgtc 360
cctggtttca acgagaaaac acacgtccaa ctcagtttgc ctgttttaca ggttcgcgac 420
gtgttagtac gtggttttgg agattcagtg gaagaagtct tatcagaggc acgtcaacat 480
cttaaagatg gcacttgtgg cttagtagaa gttgaaaaag gcgttttgcc tcaacttgaa 540
cagccctatg tgttcatcaa acgttctgat gctagaactg cacctcatgg tcatgttatg 600
gttgagctgg tagcagaatt agaaggtatt cagtacggtc gtagtggtga gacattaggt 660
gttttagttc ctcatgtggg cgaaatacca gtggcttacc gcaaagttct tcttagaaag 720
aacggtaata aaggagctgg tggccatagt tacggcgctg atttaaagtc atttgactta 780
ggcgacgagc ttggcactga tccttatgaa gatttccaag aaaactggaa cactaaacat 840
agcagtggtg ttacccgtga actcatgcgt gagttaaatg gaggtgcata cactcgctat 900
gtcgataaca acttctgtgg acctgatggt taccctcttg agtgcattaa agaccttcta 960
gcacgtgctg gtaaagcttc atgcactttg tccgaacaac tggactttat tgacactaag 1020
aggggtgtat actgctgccg tgaacatgag catgaaattg cttggtacac ggaacgttct 1080
gaaaagagct atgaattgca gacacctttt gaaattaaac tggcaaagaa atttgacacc 1140
ttcaatgggg aatgtccaaa ttttgtattt cccctcaatt ccataatcaa gactattcaa 1200
ccaagggttg aaaagaaaaa gcttgatggc tttatgggta gaattcgatc tgtctatcca 1260
gttgcgtcac caaatgaatg caaccaaatg tgcctttcaa ctctcatgaa gtgtgatcat 1320
tgtggtgaaa cttcatggca gacgggcgat tttgttaaag ccacttgcga attttgtggc 1380
actgagaatt tgactaaaga aggtgccact acttgtggtt acttacccca aaatgctgtt 1440
gttaaaattt actgtccagc atgtcacaat tcagaagtag gacctgagca tagtcttgcc 1500
gaataccata atgaatctgg cttgaaaacc attcttcgta agggtggtcg cactattgct 1560
tttggaggat gtgtgttctc ttatgttggt tgccataaca agtgtgctta ttgggttcca 1620
cgtgcttcag ctaacatagg ttgtaaccat acaggtgttg ttggagaagg ttccgaaggt 1680
cttaatgaca accttcttga aatactccaa aaagagaaag tcaacatcaa tattgttggt 1740
gactttaaac ttaatgaaga gatcgccatt attttggcat ctttttctgc ttccacaagt 1800
gcttttgtgg aaactgtgaa aggtttggat tataaagcat tcaaacagat tgttgaatcc 1860
tgtggtaatt ttaaggttac aaagggaaaa gctaaaaaag gtgcctggaa tattggtgaa 1920
cagaaatcaa tactgagtcc tctttatgca tttgcatcag aggctgctcg tgttgtacga 1980
tcaattttct cccgcactct tgaaactgct caaaattctg tgcgtgtttt acagaaggcc 2040
gctataacaa tactagatgg aatttcacag tattcactga gactcattga tgctatgatg 2100
ttcacatctg atttggctac taacaatcta gttgtaatgg cctacattac aggtggtgtt 2160
gttcagttga cttcgcagtg gctaactaac atctttggca ctgtttatga aaaactcaaa 2220
cccgtccttg attggcttga agagaagttt aaggaaggtg tagagtttct tagagacggt 2280
tgggagattg ttaaattcat ctcaacctgt gcttgtgaaa ttgtcggtgg acaaattgtc 2340
acctgtgcta aggaaattaa ggagagtgtt cagacattct ttaagcttgt aaacaagttt 2400
ttggctttgt gtgctgactc tatcattatt ggtggagcta aacttaaagc cttgaattta 2460
ggtgaaacat ttgtcacgca ctcaaaggga ttgtacagaa agtgtgttaa atccagagaa 2520
gaaactggcc tactcatgcc tctaaaagcc ccaaaagaaa ttatcttctt agagggagaa 2580
acacttccca cagaagtgtt aacagaggaa gttgtcttga aaactggtga tttacaacca 2640
ttagaacaac ctactagtga agctgttgaa gctccattgg ttggtacacc agtttgtatt 2700
aacgggctta tgttgctcga aatcaaagac acagaaaagt actgtgccct tgcacctaat 2760
atgatggtaa caaacaatac cttcacactc aaaggcggtg caccaacaaa ggttactttt 2820
ggtgatgaca ctgtgataga agtgcaaggt tacaagagtg tgaatatcac ttttgaactt 2880
gatgaaagga ttgataaagt acttaatgag aagtgctctg cctatacagt tgaactcggt 2940
acagaagtaa atgagttcgc ctgtgttgtg gcagatgctg tcataaaaac tttgcaacca 3000
gtatctgaat tacttacacc actgggcatt gatttagatg agtggagtat ggctacatac 3060
tacttatttg atgagtctgg tgagtttaaa ttggcttcac atatgtattg ttctttctac 3120
cctccagatg aggatgaaga agaaggtgat tgtgaagaag aagagtttga gccatcaact 3180
caatatgagt atggtactga agatgattac caaggtaaac ctttggaatt tggtgccact 3240
tctgctgctt tacaacctga agaagaacaa gaagaagatt ggttagatga tgatagtcaa 3300
caaactgttg gtcaacaaga cggcagtgag gacaatcaga caactactat tcaaacaatt 3360
gttgaggttc aacctcaatt agagatggaa cttacaccag ttgttcagac tattgaagtg 3420
aatagtttta gtggttatct taaacttact gacaatgtat acatcaagaa tgcagacatt 3480
gtggaagaag ctaaaaaggt aaaaccaaca gtggttgtta atgcagccaa tgtttacctt 3540
aaacatggag gaggtgttgc aggagcctta aataaggcta ctaacaatgc catgcaagtt 3600
gaatctgatg attacatagc tactaatgga ccacttaaag tgggtggtag ttgtgtttta 3660
agcggacaca atcttgctaa acactgttta catgttgtcg gcccaaatgt taacaaaggt 3720
gaagatattc aacttcttaa gagtgcttat gaaaatttta accagcacga agttctactt 3780
gcaccattat tatcagctgg tatttttggt gctgacccta tacattcttt aagagtttgt 3840
gtagatactg ttcgcacaaa tgtctactta gctgtctttg ataaaaatct ctatgacaaa 3900
cttgtttcaa gctttttgga aatgaagagt gaaaagcaag ttgaacaaaa gatcgctgag 3960
attcctaaag aggaagttaa gccatttata actgaaagta aaccttcagt tgaacagaga 4020
aaacaagatg ataagaagat caaagcttgt gttgaagaag ttacaacaac tctggaagaa 4080
actaagttcc tcacagaaaa cttgctcctt tatatcgaca ttaatggcaa tcttcatcca 4140
gattctgcca ctcttgttag tgacattgac atcactttct taaagaaaga tgctccatat 4200
atagtgggtg atgttgttca agagggtgtt ttaactgctg tggttatacc tactaaaaag 4260
gctggtggca ctactgaaat gctagcgaaa gctttgagaa aagtgccaac agacaattat 4320
ataaccactt acccgggtca gggtttaaat ggttacactg tagaggaggc aaagacagtg 4380
cttaaaaagt gtaaaagtgc cttttacatt ctaccatcta ttatctctaa tgagaagcaa 4440
gaaattcttg gaactgtttc ttggaatttg cgagaaatgc ttgcacatgc agaagaaaca 4500
cgcaaattaa tgcctgtctg tgtggaaact aaagccatag tttcaactat acagcgtaaa 4560
tataagggta tcaagataca agagggtgtg gttgattatg gtgctagatt ttacttttac 4620
accagtaaaa caactgtagc gtcacttatc aacacactta acgatctaaa tgaaactctt 4680
gttacaatgc cacttggcta tgtaacacat ggcttaaatt tggaagaagc tgctcggtat 4740
atgagatctc tcaaagtgcc agctacagtt tctgtttctt cacctgatgc tgttacagcg 4800
tataatggtt atcttacttc ttcttctaaa acacctgaag aacattttat tgaaaccatc 4860
tcacttgctg gttcctataa agattggtcc tattctggac aatctacaca actaggtata 4920
gaatttctta agagaggtga taaaagtgta tattacacgt ccaatcctac cacattccac 4980
ctagatggtg aagttatcac ctttgacaat cttaagacac ttctttcttt gagagaagtg 5040
aggactatta aggtgtttac aacagtagac aacattaacc tccacacgca agttgtggac 5100
atgtcaatga catatggaca acagtttggt ccaacttatt tggatggagc tgatgttact 5160
aagataaaac ctcataactc acatgaaggt aaaacatttt acgttttgcc taatgatgac 5220
actctacgtg ttgaggcttt tgagtactac cacacaactg atcctagttt tctgggtagg 5280
tacatgtcag cattaaatca cactaaaaag tggaaatacc cacaagttaa tggtttaact 5340
tcgattaaat gggcagataa caactgttat cttgccactg cattgttaac actccaacaa 5400
atagagttga agtttaatcc acctgctcta caagatgctt attacagagc aagggctggt 5460
gaagctgcta acttttgtgc acttatctta gcctactgta ataagacagt aggtgagtta 5520
ggtgatgtta gagaaacaat gagttacttg tttcaacatg ccaatttaga ttcttgcaaa 5580
agagtcttga acgtggtgtg taaaacttgt ggacaacagc agacaaccct taagggtgta 5640
gaagctgtta tgtacatggg cacactttct tatgaacaat tcaagaaagg tgttcagata 5700
ccttgtacgt gtggtaaaca agctacaaaa tatctagtac aacaggagtc accttttgtt 5760
atgatgtcag caccacctgc tcagtatgaa cttaagcatg gtacatttac ttgtgctagt 5820
gagtacactg gtaattacca gtgtggtcac tataagcata taacttctaa ggaaactttg 5880
tattgcatag acggtgcttt acttacaaag tcctcagaat acaaaggtcc tattacggat 5940
gttttctaca aagaaaacag ttacacaaca accataaaac cagttactta taagttggat 6000
ggtgttgttt gtacagaaat tgaccctaag ttggacaatt attataagaa ggacaactct 6060
tatttcacag agcaaccaat tgatcttgta ccaaaccaac catatccaaa cgcaagcttc 6120
gataatttta agttcgtatg cgataatatc aaatttgctg atgatctcaa ccagttaact 6180
ggttataaga aacctgcttc aagagagctt aaagttacat ttttccctga cttaaatggt 6240
gatgtggtgg ctattgatta taaacactac acaccctctt ttaagaaagg agctaaattg 6300
ttacataagc ctattgtttg gcatgttaac aatgcaacta ataaagccac gtataaacca 6360
aatacctggt gtatacgttg tctttggagc acaaaaccag ttgaaacatc aaattcgttt 6420
gatgtactga agtcagagga cgcgcaggga atggataatc ttgcatgtga agatctaaaa 6480
ccagtctctg aagaagtagt ggaaaatcct accatacaga aagacgttct tgagtgtaat 6540
gtgaaaacta ccgaagttgt aggagacatt atacttaaac cagcaaataa tagtttgaag 6600
atcacagaag aggttggcca cacagatcta atggctgctt atgtagacaa ttctagtctt 6660
actattaaga aacctaatga actctctaga gtattaggtt tgaaaaccct tgctactcat 6720
ggtttagctg ctgttaatag tgtcccttgg gatactatag ctaattatgc taagcctttt 6780
cttaacaaag ttgttagtac aactactaac atagttacac ggtgtcttaa tcgtgtttgt 6840
actaattata tgccttactt ctttacttta ttgctacaat tgtgtacttt tactagaagt 6900
acaaattcta gaatcaaggc atctatgccg actactatag caaagaatac tgttaagagt 6960
gtcggtaaat tttgtctaga ggcttcattt aattatctca agtcacctaa cttttctaag 7020
ctgataaaca ttatcatctg gtttttgcta ttaagtgttt gcctaggttc tttaatctac 7080
tcaaccgctg ctttaggtgt tttaatgtct aatttaggca tgccttctta ctgtactggt 7140
tacagagaag gctatttgaa ctctactaat gtcactattg caacctactg tactggatct 7200
ataccttgta gtgtttgtct tagtggttta gattctttag acacctatcc ttctcttgaa 7260
actatacaga ttaccatttc atctttcaaa tgggatttaa ctgcttttgg cttagttgca 7320
gagtggtttt tggcatatat tcttttcact aggtttttct atgtacttgg attggctgca 7380
atcatgcaat tgtttttcag ctattttgca gtccatttta ttagtaactc ttggcttatg 7440
tggcttataa ttaatcttgt gcagatggcc ccgatttcag ctatggttag aatgtacatc 7500
ttctttgcct cattttatta tgtgtggaaa agttatgtgc atgttgtaga cggttgtaat 7560
tcatcaactt gtatgatgtg ttacaaacgt aatagagcaa caagagtcga atgtacaact 7620
attgttaatg gtgttagaag gtccttttat gtctatgcta atggaggtaa aggcttttgc 7680
aaactacaca attggaattg tgttaattgt gatacattct gtgctggtag tacatttatt 7740
agtgatgaag ttgcgagaga cttgtcacta cagtttaaaa gaccaataaa tcctactgac 7800
caatcttctt acatcgttga tagtgttaca gtgaagaatg gttccatcca tctttacttt 7860
gataaagctg gtcaaaagac ttatgaaaga cattctctct ctcattttgt taacttagac 7920
aacctgagag ctaataacac taaaggttca ttgcctatta atgttatcgt tttcgacggt 7980
aaatcaaaat gtgaagaatc atctgcaaaa tcagcgtctg tttactacag tcagcttatg 8040
tgtcaaccta tactgttact agatcaggca ttagtgtctg atgttggtga tagtgcggaa 8100
gttgcagtta aaatgtttga tgcttacgtt aatacgtttt catcaacttt taacgtacca 8160
atggaaaaac tcaaaacact agttgcaact gcagaagctg aacttgcaaa gaatgtgtcc 8220
ttagacaatg tcttatctac gtttatttca gcagctcggc aagggtttgt tgattcagat 8280
gtagaaacta aagatgttgt tgaatgtctt aaattgtcac atcaatctga catagaagtt 8340
actggcgata gttgtaataa ctatatgctc acctataaca aagttgaaaa catgacaccc 8400
cgtgaccttg gtgcttgtat tgactgtagt gctagacata ttaatgcgca ggtagcaaaa 8460
agtcacaaca ttgctttgat atggaacgtt aaagatttca tgtcattgtc tgaacaacta 8520
cgaaaacaaa tacgtagtgc tgctaaaaag aataacttac ccttcaagtt gacatgtgca 8580
actactagac aagttgttaa tgttgtaaca acaaagatag cacttaaggg tggtaaaatt 8640
gtgaataact ggttgaagca gcttattaaa gttacacttg tgttcctttt tgttgctgct 8700
attttctatc tgataacacc tgttcatgtc atgtctaaac atactgactt ttcaagtgaa 8760
atcataggat acaaggctat tgatggtggt gtcactcgtg acatagcatc tacagatact 8820
tgttttgcta acaaacatgc tgattttgac acatggttta gccagcgtgg tggtagttat 8880
actaatgaca aagcttgccc attgattgct gcagtcataa caagagaagt gggttttgtc 8940
gttcctggtt tgcctggaac gatattacgc acaactaatg gtgacttttt gcatttctta 9000
cctagagttt ttagtgcagt tggtaacatc tgttacacac catcaaaact tatagagtac 9060
actgactttg caacatcagc ttgtgttttg gctgctgaat gtacaatttt taaagacgct 9120
tctggtaagc cagtaccata ttgttatgat accaatgtac tagaaggttc tgttgcttat 9180
gaaagtttac gccctgacac acgttatgtg ctcatggatg gctctattat tcaatttcct 9240
aacacctacc ttgaaggttc tgtaagagtg gtaacaactt ttgattctga gtactgtagg 9300
cacggcactt gtgaaagatc agaagctggt gtttgtgtat ctactagtgg tagatgggta 9360
cttaacaacg attattacag atctttacca ggagttttct gtggtgtaga tgctgtaaat 9420
ttgcttacta acatgtttac accactaatt caacctattg gtgctttgga catatcagca 9480
tctatagtag ctggtggtat tgtagctatc gtagtaacat gccttgccta ctattttatg 9540
aggtttagac gtgcttttgg tgaatacagt catgtagttg cctttaatac tctcctattc 9600
cttatgtcat tcactgtact ctgtttaaca ccagtttact cattcttacc tggtgtttat 9660
tctgttattt acctgtactt gacattttat ctgactaatg atgtttcttt tctcgcacat 9720
attcagtgga tggttatgtt cacaccttta gtacctttct ggataacaat tgcttacatc 9780
atttgtattt ccacaaagca tttctattgg ttctttagta attacctaaa gagacgtgta 9840
gtctttaatg gtgtttcctt tagtactttt gaagaagctg cgctgtgcac ctttttgtta 9900
aataaggaga tgtatctaaa gttgcgtagt gatgtgctat tacctcttac gcaatataat 9960
agatacttag ctctttataa caagtacaag tatttcagtg gagcaatgga tacaactagc 10020
tacagagaag ctgcttgttg tcatctcgca aaggctctca atgacttcag taactcaggt 10080
tctgatgttc tttaccaacc accacaaacc tctatcacct cagctgtttt gcagagtggt 10140
tttagaaaaa tggcattccc atctggtaaa gttgagggtt gtatggtaca agtaacttgt 10200
ggtacaacta cacttaacgg tctttggctt gatgacgtag tttactgtcc aagacatgtg 10260
atctgcacct ctgaagatat gcttaaccct aattatgaag atctactcat ccgtaagtct 10320
aatcataact tcttggtaca ggctggtaat gttcaactca gggttattgg acattctatg 10380
caaaattgtg tacttaagct taaggttgat acagccaatc ctaagacacc taagtataag 10440
tttgttcgca ttcaaccagg acagactttt tcagtgttag cttgttacaa tggttcacca 10500
tctggtgttt accaatgtgc tatgaggccc aatttcacta ttaagggttc attccttaat 10560
ggttcatgtg gtagtgttgg ttttaacata gattatgact gtgtctcttt ttgttacatg 10620
caccatatgg aattaccaac tggagttcat gctggcacag acttagaagg taacttttat 10680
ggaccttttg ttgacaggca aacagcacaa gcagctggta cagatacaac tattacagtt 10740
aatgttcttg cttggttgta cgctgctgtt ataaatggag acaggtggtt tctcaatcga 10800
tttaccacaa ctcttaatga ctttaacctt gtggctatga agtacaatta tgaacctcta 10860
acacaagacc atgttgacat actaggacct ctttctgctc aaactggaat tgccgtttta 10920
gatatgtgtg cttcattaaa agaacttctg caaaatggta tgaatggacg taccatattg 10980
ggtagtgctt tattagaaga tgagtttaca ccttttgatg ttgttagaca atgctcaggt 11040
gttactttcc aaagtgcagt gaaaagaaca atcaagggta cacaccactg gttgttactc 11100
acaattttga cttcactttt agttttagtc cagagtactc aatggtcttt gttctttttc 11160
ttctacgaaa atgccttttt accttttgct atgggtatta ttgctatgtc tgcttttgca 11220
atgatgtttg tcaaacataa gcatgcattt ctctgtttgt ttttgttacc ttctcttgcc 11280
actgtagctt actttaatat ggtctacatg cctgctagtt gggtgatgcg tattatgaca 11340
tggttggata tggttgatac tagtttgtct ggttttaagc taaaagactg tgttatgtat 11400
gcatcagctg tagtgttact aatccttatg acagcaagaa ctgtgtatga tgatggtgct 11460
aggagagtgt ggacacttat gaatgtcttg acactcgttt ataaagttta ctatggcaac 11520
gctttagatc aagccatttc catgtgggct cttataatct ctgttacttc taactactca 11580
ggtgtagtta caactgtcat gtttttggcc agaggtattg tttttatgtg tgttgagtat 11640
tgccctattt tcttcataac tggtaataca cttcagtgta taatgctagt ctattgtttc 11700
ttaggctatt tttgtacttg ttacttcggc ctcttttgtt tactcaaccg ctactttaga 11760
ctgactcttg gtgtttatga ttacttagtg tctacacagg agtttagata tatgaattca 11820
cagggactac tcccacccaa gaatagcata gatgccttca aactcaacat taaattgttg 11880
ggtgttggtg gcaaaccttg tatcaaagta gccactgtac agtctaaaat gtcagatgta 11940
aagtgcacat cagtagtctt actctcagtt ttgcaacaac tcagagtaga atcatcatct 12000
aaattgtggg ctcaatgtgt ccagttacac aatgacattc tcttagctaa agatactact 12060
gaagcctttg aaaaaatggt ttcactactt tctgttttgc tttccatgca gggtgctgta 12120
gacataaaca agctttgtga agaaatgctg gacaacaggg caaccttaca agctatagcc 12180
tcagagttta gttcccttcc atcatatgca gcttttgcta ctgctcaaga agcttatgag 12240
caggctgttg ctaatggtga ttctgaagtt gttcttaaaa agttgaagaa gtctttgaat 12300
gtggctaaat ctgaatttga ccgtgatgca gccatgcaac gtaagttgga aaagatggct 12360
gatcaagcta tgacccaaat gtataaacag gctagatctg aggacaagag ggcaaaagtt 12420
actagtgcta tgcagacaat gcttttcact atgcttagaa agttggataa tgatgcactc 12480
aacaacatta tcaacaatgc aagagatggt tgtgttccct tgaacataat acctcttaca 12540
acagcagcca aactaatggt tgtcatacca gactacaaca catataagaa tacgtgtgat 12600
ggtacaacat ttacttatgc atcagcattg tgggaaatcc aacaggttgt agatgcagat 12660
agtaaaattg ttcagcttag tgaaattagt atggacaatt cacctaattt agcatggcct 12720
cttattgtaa cagctttaag ggccaattct gctgtcaaat tacagaataa tgagcttagt 12780
cctgttgcac taagacaaat gtcttgtgct gccggtacta cacaaactgc ttgcactgat 12840
gacaatgcgt tagcttacta caacacaaca aagggaggta ggtttgtact tgcactgtta 12900
tccgatttac aggatttgaa atgggctaga ttccctaaga gtgatggaac tggtactatc 12960
tatacagaac tggaaccacc ttgtaggttt gttacagaca cacctaaagg tcctaaagtg 13020
aagtatcttt acttcatcaa aggattaaac aacctaaata gaggtatggt acttggtagt 13080
ttagctgcca cagtacgttt acaagctggt aatgcaacag aagttcctgc taattcaact 13140
gtactttctt tctgtgcttt tgctgtagat gctgctaaag cttacaaaga ttatctagct 13200
agtgggggac aaccaatcac taattgtgtt aagatgttgt gtacacacac tggtactggt 13260
caggcaataa cagttacacc ggaagccaat atggatcaag aatcctttgg tggtgcatcg 13320
tgttgtctgt actgccgttg tcatatagat catccaaatc ctaaaggatt ttgtgactta 13380
aaaggtaagt atgtacaaat acctacaact tgtgctaatg accctgtggg ttttacactt 13440
aaaaacacag tctgtaccgt ctgcggtatg tggaaaggtt atggttgtag ttgtgatcaa 13500
ctccgcgaac ccatgcttca gtcagctgat gcacaatcgt ttttaaacgg gtttgcggtg 13560
taagtgcagc ccgtcttaca ccgtgcggca caggcactag tactgatgtc gtatatagag 13620
cttttgacat ctacaatgat aaagtagctg gttttgctaa gttcctaaaa actaattgtt 13680
gtcgcttcca agaaaaggac gaagatgaca atctcattga ttcttacttt gtagttaaga 13740
gacacacttt ctctaactac caacatgaag aaacaattta caacctgctt aaggattgtc 13800
cagctgttgc taaacatgac ttctttaagt ttagaataga cggtgacatg gtaccacata 13860
tatcacgtca acgtcttact aaatacacaa tggcagacct cgtctatgct ttaaggcatt 13920
ttgatgaagg taattgtgac acattaaaag aaatacttgt cacatacaat tgttgtgatg 13980
atgactactt caataaaaag gactggtatg attttgtaga aaacccagat atattacgcg 14040
tatacgccaa cttaggtgaa cgtgtacgcc aagctttgtt aaaaacagta cagttctgtg 14100
atgccatgcg aaatgctggt attgttggtg tactgacatt agataatcaa gatctcaatg 14160
gtaactggta tgactttggt gatttcatac aaaccacgcc aggtagtgga gttcctgttg 14220
tagactctta ttattcattg ctcatgccta tattaacctt gaccagggct ttaactgcag 14280
agtcacatgt tgacactgac ttaacaaagc cttacattaa gtgggatttg ttaaaatacg 14340
acttcacgga agagaggtta aaactctttg accgttattt taaatactgg gatcagacat 14400
accacccaaa ttgtgttaac tgtttggatg acagatgcat tctgcattgt gcaaacttta 14460
atgttctgtt ctctacagtg ttcccaccta caagttttgg accactagtg agaaaaatat 14520
ttgttgatgg tgttccattt gtagtttcaa ctggatacca cttcagagag ctaggtgttg 14580
tacataatca ggatgtaaac ttacatagct ctagacttag ttttaaggaa ttacttgtgt 14640
atgctgctga tcctgctatg catgctgctt ctggtaatct attactagat aaacgcacta 14700
cgtgcttttc agtagctgca cttactaaca atgttgcttt tcaaactgtc aaacccggta 14760
attttaacaa ggacttctat gactttgctg tgtctaaggg tttctttaag gaaggaagtt 14820
ctgttgaatt aaaacacttc ttctttgctc aggatggtaa tgctgctatc agcgattatg 14880
actactatcg ttataatcta ccaacaatgt gtgatatcag acaactacta tttgtagttg 14940
aagttgttga taagtacttt gattgttacg atggtggctg tattaatgct aaccaagtca 15000
tcgtcaacaa cctagacaaa tcagctggtt ttccatttaa taaatggggt aaggctagac 15060
tttattatga ttccatgagt tatgaggatc aagatgcact tttcgcatat acaaaacgta 15120
atgtcatccc tactataact caaatgaacc ttaagtatgc cattagtgca aagaatagag 15180
ctcgcaccgt agctggtgtc tctatctgta gtactatgac caatagacag tttcatcaaa 15240
aattactcaa gtcaatagcc gccactagag gagctactgt agtaattgga acaagcaaat 15300
tctatggtgg ttggcacaac atgctcaaaa ctgtttatag tgatgtagaa aaccctcacc 15360
ttatgggttg ggattatcct aaatgtgata gagccatgcc taacatgctt agaattatgg 15420
cctcacttgt tcttgctcgc aaacatacaa cgtgttgtag cttgtcacac cgtttctata 15480
gattagctaa tgagtgtgct caagtattga gtgaaatggt catgtgtggc ggttcactat 15540
atgttaaacc aggtggaacc tcatcaggag atgccacaac tgcttatgct aatagtgtgt 15600
ttaacatttg tcaagctgtc acggccaatg ttaatgcact tttatctact gatggtaaca 15660
aaattgccga taagtatgtc cgcaatttac aacacagact ttatgagtgt ctctatagaa 15720
atagagatgt tgacacagac tttgtgaatg agttttacgc atatttgcgt aaacatttct 15780
caatgatgat actctctgac gatgctgttg tgtgtttcaa tagcacttat gcatctcaag 15840
gtctagtggc tagcataaag aactttaagt cagttcttta ctatcaaaac aacgttttta 15900
tgtctgaagc aaaatgttgg actgagactg accttactaa aggacctcat gaattttgct 15960
ctcaacatac aatgctagtt aaacagggtg atgattatgt gtaccttcct tacccagatc 16020
catcaagaat cctaggtgcc ggttgttttg tagatgatat cgtaaaaaca gatggtacac 16080
ttatgattga acggttcgtg tctttagcta tagatgctta cccacttact aaacatccta 16140
atcaggagta tgctgatgtc tttcatttgt acttacaata catacgtaag ctacatgatg 16200
agttaacagg acacatgtta gacatgtatt ctgttatgct tactaatgat aacacttcaa 16260
ggtattggga acctgagttt tatgaggcta tgtacacacc gcatacagtc ttacaagctg 16320
ttggtgcttg tgttctttgc aattcacaga cttcattaag atgtggtgct tgcatacgta 16380
gaccattctt atgttgtaaa tgctgttacg accatgtcat ctcaacatca cataaattag 16440
tcttgtctgt taatccgtat gtttgcaatg ctccaggttg tgatgtcaca gatgtgactc 16500
aactttactt aggaggtatg agctattact gtaagtcaca taaaccaccc attagttttc 16560
cattgtgtgc taatggacaa gtttttggtc tctacaagaa tacatgtgtt ggtagcgata 16620
atgttactga ctttaatgca attgcaacat gtgactggac aaatgctggt gattacattt 16680
tagctaacac ctgtactgaa agactcaagc tttttgcagc agaaacgctc aaagctactg 16740
aggagacatt taaactgtct tatggtattg ctactgtacg tgaagtgctg tctgacagag 16800
aattacatct ttcatgggaa gttggtaaac ctagaccacc acttaaccga aattatgtct 16860
ttactggtta tcgtgtaact aaaaacagta aagtgcaaat cggagagtac acctttgaaa 16920
aaggtgacta tggtgatgct gttgtttacc gaggtacaac aacttacaaa ctcaacgttg 16980
gtgattattt tgtgctgaca tcacatacag taatgccatt aagtgcacct acactagtgc 17040
cacaagagca ctatgttaga attactggct tatacccaac actcaatatc tcagatgagt 17100
tttctagcaa tgttgcaaat tatcaaaagg ttggtatgca aaagtattct acactccagg 17160
gaccacctgg tactggtaaa agtcattttg ctattggtct agctctctac tacccttctg 17220
ctcgcatagt atatacagct tgctctcatg cagctgttga tgcactatgt gagaaggcat 17280
taaaatattt gcccatagac aaatgtagta gaattatacc tgcacgtgct cgtgtagagt 17340
gttttgataa attcaaggtg aattcaacat tagaacagta tgtcttttgt actgtaaatg 17400
cattgcctga gacgacagca gatatagttg tctttgatga aatttcaatg gccacaaatt 17460
atgatttgag tgttgtcaat gccagattac gtgctaagca ctatgtgtac attggtgatc 17520
ctgctcaatt acctgcacca cgcacattac taactaaggg tacactagaa ccagaatatt 17580
tcaattcagt gtgtagactt atgaaaacta taggtccaga catgttcctc ggaacttgtc 17640
gtagatgtcc tgctgaaatt gttgacactg tgagtgcttt ggtttatgat aataagctta 17700
aggcacataa agacaaatca gctcaatgct ttaaaatgtt ctacaagggt gttatcacgc 17760
atgatgtttc atctgcaatt aacaggccac aaataggcgt ggtaagagaa ttccttacac 17820
gtaaccctgc ttggagaaaa gctgtcttta tttcacctta caattcccag aatgctgtag 17880
cctcaaagat tttgggacta ccaactcaaa ctgttgattc atcacagggc tcagaatatg 17940
actatgtcat attcactcaa accactgaaa cagctcactc ttgtaatgta aacagattca 18000
acgttgctat taccagagca aaagtaggca tactttgcat aatgtctgat agagaccttt 18060
atgacaagtt gcaatttaca agtcttgaaa ttccacgtag gaatgtggca actttacaag 18120
ctgaaaatgt aacaggactc tttaaagatt gtagtaaggt aatcactggg ttacatccta 18180
cacaggcacc tacacactta agtgttgata ctaaattcaa aactgaaggt ttatgtgttg 18240
acatacctgg catacctaag gacatgacct atagaagatt aatctctatg atgggtttca 18300
aaatgaatta ccaggttaat ggttacccta acatgtttat cacccgcgaa gaagctataa 18360
gacatgtacg tgcatggatt ggcttcgatg tcgaaggttg tcatgctact agagaagctg 18420
ttggtaccaa tttaccttta cagctaggtt tttctacagg tgttaaccta gttgctgtac 18480
ctacaggtta tgttgataca cctaataata cagatttttc cagagttagt gctaaaccac 18540
cgcctggaga tcaatttaaa cacctcatac cacttatgta caaaggactt ccttggaatg 18600
tagtgcgtat aaagattgtc caaatgttaa gtgacacact taaaaatctc tctgacagag 18660
tcgtatttgt cttatgggca catggctttg agttgacatc tatgaagtat tttgtgaaga 18720
tcggacctga gcgcacatgt tgtctatgtg atagacgtgc tacatgcttt tccactgctt 18780
cagacactta tgcctgttgg catcattcta ttggatttga ttacgtctat aatccgttta 18840
tgattgatgt tcaacaatgg ggttttacag gtaacctaca aagcaaccat gatctgtatt 18900
gtcaagtcca tggtaatgca catgtagcta gttgtgatgc aatcatgact aggtgtctag 18960
ctgtccacga gtgctttgtt aagcgtgttg actggactat tgaatatcct ataatcggtg 19020
atgaactgaa gattaatgcg gcttgtagaa aggttcaaca catggttgtt aaagctgcat 19080
tattagcaga caaattccca gttcttcacg acattggtaa ccctaaagct attaagtgtg 19140
tacctcaagc tgatgtagaa tggaagttct atgatgcaca gccttgtagt gacaaagctt 19200
acaaaataga agaactgttc tattcttatg ccacacattc tgacaaattc acagatggtg 19260
tatgcctatt ttggaattgc aatgtcgata gatatcctgc taattccatt gtttgtagat 19320
ttgacactag agtgctatct aaccttaact tgcctggttg tgatggtggc agtttgtatg 19380
taaataagca tgcattccac acaccagctt ttgataaaag tgcttttgtt aatctaaagc 19440
aacttccatt tttctattac tctgacagtc catgtgagtc tcatggaaaa caagtagtgt 19500
cagatataga ttatgtacca ctaaagtctg ctacgtgtat aacacgttgc aatttaggtg 19560
gtgctgtctg tagacatcat gctaatgagt acagattgta tctcgatgct tataacatga 19620
tgatctcagc tggctttagc ttgtgggttt acaaacaatt tgatacctat aacctctgga 19680
acacttttac aagacttcag agtttagaaa atgtggcttt taatgttgta aataagggac 19740
actttgatgg acaacagggt gaagtaccag tttctatcat taacaacact gtttacacaa 19800
aagttgatgg tgttgatgta gaattgtttg agaacaaaac cacattacct gttaatgtag 19860
catttgagct ttgggctaag cgcaacatta aaccagtacc agaggtgaaa atactcaata 19920
atttgggtgt ggacattgct gctaatactg tgatctggga ctacaaaaga gatgctccag 19980
cacatatatc tactattggt gtttgttcta tgactgacat agccaagaaa ccaactgaaa 20040
cgatttgtgc accactcact gtcttttttg atggtagagt tgatggtcaa gtagacttat 20100
ttagaaatgc ccgtaatggt gttcttatta cagaaggtag tgttaaaggt ttacaaccat 20160
ctgtaggtcc caaacaagct agtcttaatg gagtcacatt aattggagaa gccgtaaaaa 20220
cacagttcaa ttattacaag aaagtggatg gtgttgtcca acaattacct gaaacttact 20280
ttactcagag tagaaactta caggaattta agcccaggag tcaaatggaa attgatttct 20340
tagaacttgc tatggatgaa ttcattgaac ggtataaatt agaaggctat gccttcgaac 20400
atatcgttta tggagatttt agtcatagtc agttaggtgg tttacatcta ctgattggac 20460
tagctaaacg ttttaaggaa tcaccttttg aacttgaaga ttttattcct atggacagta 20520
cagttaaaaa ctacttcata acagatgcgc aaacaggttc atctaagtgt gtgtgttctg 20580
ttattgatct tttacttgat gacttcgttg aaataataaa gtcccaagat ttatctgtag 20640
tttctaaggt tgtcaaagtg actattgact atacagaaat ctcatttatg ctttggtgta 20700
aagatggcca tgtagaaaca ttttacccaa aattacaatc tagtcaagcg tggcaaccgg 20760
gtgttgctat gcctaatctt tacaaaatgc aaagaatgct attagaaaag tgtgaccttc 20820
aaaattatgg tgatagtgca acattaccta aaggcataat gatgaatgtc gcaaaatata 20880
ctcaactgtg tcaatattta aacacactga cattagctgt accctataat atgagagtta 20940
tccattttgg tgctggttct gataaaggag ttgcaccagg tacagctgtt ttaagacaat 21000
ggttgcctac aggtacgctg cttgtcgatt cagatcttaa tgactttgtc tctgatgcag 21060
attcaacttt gattggtgat tgtgcaactg tacatacagc taataaatgg gatctcatta 21120
ttagtgatat gtacgaccct aagactaaga atgtcacaaa agaaaacgac tctaaagagg 21180
gttttttcac ttacatttgt gggtttatac aacaaaagct agctcttgga ggttccgtgg 21240
ctataaagat aacagaacat tcttggaatg ctgatcttta taagctcatg ggacacttcg 21300
catggtggac agcctttgtt actaatgtga atgcgtcatc atctgaagca tttttaatcg 21360
gatgtaacta ccttggcaaa ccacgcgaac aaatagatgg ttatgtcatg catgcaaatt 21420
acatattttg gaggaataca aatccaattc agctttcttc ttattcttta ttcgacatga 21480
gtaaattccc ccttaaatta aggggtactg ctgttatgtc tttaaaagaa ggtcaaatca 21540
atgatatgat tctctctctt cttagtaaag gtagacttat aattagagaa aacaacagag 21600
ttgttatttc tagtgatgtt cttgttaaca actaaacgaa caatgtttgt ttttcttgtt 21660
ttattgccac tagtctctag tcagtgtgtt aatcttacaa ccagaactca attaccccct 21720
gcatacacta attctttcac acgtggtgtt tattaccctg acaaagtttt cagatcctca 21780
gttttacatt caactcagga cttgttctta cctttctttt ccaatgttac ttggttccat 21840
gctatacatg tctctgggac caatggtact aagaggtttg ataaccctgt cctaccattt 21900
aatgatggtg tttactttgc ttccactgag aagtctaaca taataagagg ctggattttt 21960
ggtactactt tagattcgaa aacccagtcc ctacttattg ttaataacgc tactaatgtt 22020
gttatcaaag tctgtgaatt tcaattttgt aacgatccat ttttgggtgt ttattaccac 22080
aaaaacaaca aaagttggat ggaaagtgag ttcagagttt attctagtgc gaataattgc 22140
acttttgaat acgtctctca gccttttctt atggaccttg aaggaaaaca gggtaatttc 22200
aaaaatctta gggaatttgt gttcaagaat attgatggtt acttcaagat atactctaag 22260
cacacgccta ttaatttagt gcgtgatctc cctcagggtt tttcggcttt agaaccattg 22320
gtagatttgc caataggtat taacatcact aggtttcaaa ctttacttgc tttacataga 22380
agttatttaa ctcctggtga ttcttcttca ggttggacag ctggtgctgc agcttattat 22440
gtgggttatc ttcaacctag gacttttcta ctgaagtaca atgaaaatgg aaccattaca 22500
gatgctgtag actgtgcact tgaccctctc tcagaaacaa agtgtacgtt gaaatccttc 22560
actgtagaaa aaggaatcta tcaaacttct aactttagag tccaaccaac agaatctatt 22620
gttagatttc ctaacatcac aaacttgtgc ccttttggtg aagtttttaa cgccaccaga 22680
tttgcatctg tttatgcttg gaacaggaag agaatcagca actgtgttgc tgattattct 22740
gtcctgtata attccgcatc attttccact tttaagtgtt atggagtgtc tcctactaaa 22800
ttaaatgatc tctgctttac taatgtctat gcagattcat ttgtaattag aggtgatgaa 22860
gtcagacaaa tcgctccagg gcaaactgga aagattgctg attataacta caaattacca 22920
gatgatttta caggctgcgt tatagcttgg aattctaaca atcttgattc taaggttggt 22980
ggtaattata attacctgta cagattgttt aggaagtcta atctcaaacc ttttgagaga 23040
gatatttcaa ctgaaatcta tcaggccggt agcacacctt gtaatggtgt tgaaggtttt 23100
aattgttact ttcctctgca atcatatggt ttccaaccca ctaatggtgt tggttaccaa 23160
ccatacagag tagtagtact ttcttttgaa cttctacatg caccagcaac tgtttgtgga 23220
cctaaaaagt ctactaattt ggttaagaac aagtgtgtca atttcaactt caatggttta 23280
acaggcacag gtgttcttac tgagtctaac aaaaagtttc tgcctttcca acaatttggc 23340
agagacattg ctgacactac tgatgctgtt cgtgatccac aaacacttga gattcttgac 23400
attacaccat gttcttttgg tggtgtcagt gttataacac caggaacaaa tacttctaac 23460
caggttgctg ttctttatca ggatgttaac tgcacagaag tccctgttgc tattcatgca 23520
gatcaactta ctcctacttg gcgtgtttat tctacaggtt ctaatgtttt tcaaacacgt 23580
gcaggctgtt taataggggc tgaacatgtc aacaactcat atgagtgtga catacccatt 23640
ggtgcaggta tatgcgctag ttatcagact cagactaatt ctcctcggag agcaagaagt 23700
gtagctagtc aatccatcat tgcctacact atgtcacttg gtgcagaaaa ttcagttgct 23760
tactctaata actctattgc catacccaca aattttacta ttagcgttac cacagaaatt 23820
ctaccagtgt ctatgaccaa gacatcagta gattgtacaa tgtacatttg tggtgattca 23880
actgaatgca gcaatctttt gttgcaatat ggcagttttt gtacacaatt aaaccgtgct 23940
ttaactggaa tagctgttga acaagacaaa aacacccaag aagtttttgc acaagtcaaa 24000
caaatttaca agacaccacc aattaaagat tttggcggtt ttaattttag ccagatactg 24060
ccagatccat caaaaccaag caagaggtca tttattgaag atctactgtt caacaaagtg 24120
acacttgcag atgctggctt catcaaacaa tatggtgatt gccttggtga tattgctgct 24180
agagacctca tttgtgcaca aaagtttaac ggccttactg ttttgccacc tttgctcaca 24240
gatgaaatga ttgctcaata cacttctgca ctgttagcag gtacaatcac ttctggttgg 24300
acttttggtg caggtgctgc attacaaata ccatttgcta tgcaaatggc ttataggttt 24360
aatggtattg gagttacaca gaatgttctc tatgagaacc aaaaattgat tgccaaccaa 24420
tttaatagtg ctattggcaa aattcaagac tcactttctt ccacagcaag tgcacttgga 24480
aaacttcaag atgtggtcaa ccaaaatgca caagctttaa acacgcttgt taaacaactt 24540
agctccaatt ttggtgcaat ttcaagtgtt ttaaacgaca tcctttcacg tcttgacaaa 24600
gttgaggctg aagtgcaaat tgataggttg atcacaggca gacttcaaag tttgcagaca 24660
tatgtgactc aacaattaat tagagctgca gaaatcagag cttctgctaa tcttgctgct 24720
actaaaatgt cagagtgtgt acttggacaa tcaaaaagag ttgacttttg cggaaagggc 24780
tatcatctta tgtcatttcc tcagtcagca cctcatggtg tcgtcttttt gcatgtgact 24840
tatgtccctg cacaagaaaa gaacttcaca actgctcctg ccatttgtca tgatggaaaa 24900
gcacactttc ctcgtgaagg tgtctttgtt tcaaatggca cacactggtt tgtaacacaa 24960
aggaattttt atgaaccaca aatcattact acagacaaca catttgtgtc tggtaactgt 25020
gatgttgtaa taggaattgt caacaacaca gtttatgatc ctttgcaacc tgaattagac 25080
tcattcaagg aggagcttga taaatacttc aagaaccata cctcaccaga tgttgattta 25140
ggtgacatct ctggcattaa tgcttcagtt gtaaacattc agaaagaaat cgaccgcctc 25200
aatgaggttg ccaagaattt aaatgaatct ctcatcgatc tccaagaact tggaaagtat 25260
gagcagtata taaaatggcc atggtacatt tggctaggtt ttatagctgg cttgattgcc 25320
atagtaatgg tgacaattat gctttgctgt atgaccagtt gctgtagttg tctcaagggc 25380
tgttgttctt gtggatcctg ctgcaaattt gacgaggacg actctgagcc agtgctcaaa 25440
ggagtcaaat tacattacac ataaacgaac ttatggattt gtttatgaga atcttcacaa 25500
ttggaactgt aactttgaag caaggtgaaa tcaaggatgc tactccttca gattttgtta 25560
gagctactgc aacgataccg atacaagcat cacttccttt cggatggctt attgttggcg 25620
ttgcacttct tgctgttttt cagagcgctt ccaaaatcat aaccctcaaa aagagatggc 25680
aactagcact ctccaagggt gttcactttg tttgcaactt gctgttgttg tttgtaacag 25740
tttactcaca tcttttgctt gttgctgctg gccttgaagc cccttttctc tatctttatg 25800
ctttagtcta cttcttgcag agtataaact ttgtacgcat aataatgagg ctttggcttt 25860
gctggaaatg ccgttccaaa aacccattac tttatgatgc caactatttt ctttgctggc 25920
atactaattg ttacgactat tgtatacctt acaatagtgt aacttcttca attgtcatta 25980
cttcaggtga tggcacaaca agtcctattt ctgaacatga ctaccagatt ggtggttata 26040
ctgaaaaatg ggaatctgga gtaaaagact gtgttgtatt acacagttac ttcacttcag 26100
actattacca gctgtactca actcaattga gtacagacac tggtgttgaa catgttacct 26160
tcttcatcta caataaaatc gttgatgagc ctgaagaaca tgtccaaatt cacacaatcg 26220
acgtttcatc cggagttgtt aatccagtaa tggaaccaat ttatgatgaa ccgacgacga 26280
ctactagcgt gcctttgtaa gcacaagctg atgagtacga acttatgtac tcattcgttt 26340
cggaagagac aggtacgtta atagttaata gcgtacttct ttttcttgct ttcgtggtat 26400
tcttgctagt tacactagcc attcttactg cgcttcgatt gtgtgcgtac tgttgcaata 26460
ttgttaacgt gagtcttgta aaaccttctt tttacgttta ctctcgtgtt aaaaatctga 26520
attcttctcg ggttcctgat cttctggtct aaacgaacta aatattatat tagtttttct 26580
gtttggaact ttaattttag ccatggcaga ttccaacggt actattaccg ttgaggagct 26640
gaaaaagctc cttgaacaat ggaacctagt aataggtttc ctattcctta catggatttg 26700
cctgctgcaa tttgcctatg ccaacaggaa taggtttttg tacatcatta agttgatttt 26760
cctctggctg ttatggccag taactttagc ttgttttgtg cttgctgctg tttacagaat 26820
aaattggatc accggtggaa ttgctattgc aatggcttgt cttgtaggat tgatgtggct 26880
aagctacttc attgcttctt tcagactgtt tgcgcgtacg cgttccatgt ggtcattcaa 26940
tccagaaact aacattcttc tcaacgtgcc actccatgga actattctga ctagaccgct 27000
tctagaaagt gaactcgtaa tcggagctgt tatccttcgt ggacatcttc gtattgctgg 27060
acatcatcta ggacgctgtg acatcaagga tctacctaaa gaaatcactg ttgctacatc 27120
acgaacgctt tcttattaca aattgggagc ttcacagcgt gtagcaggtg attcaggttt 27180
tgctgcatat agtcgctaca ggattggcaa ctataaatta aacacagacc attccagtag 27240
cagtgacaat attgctttgc ttgtacagta agtgacaaca gatgtttcat ctcgttgact 27300
ttcaggttac tatagcagag atattactaa tcatcatgag gacttttaaa gtttccattt 27360
ggaatcttga ttacatcata aacctcataa ttaagaactt aagcaagtca ctaactgaga 27420
ataaatattc tcaactagac gaggagcagc caatggagat tgattaaacg aacatgaaaa 27480
ttattctttt cttggcactg ataacactcg ctacttgtga gctttatcac taccaagagt 27540
gtgttagagg tacaacagta cttttaaaag aaccttgctc gtcgggaaca tacgagggca 27600
attcaccatt tcatcctcta gctgataaca aatttgcact gacttgcttt agcactcaat 27660
ttgcttttgc ttgtcctgac ggcgtaaaac acgtctatca gttacgtgcc agatcagttt 27720
cacctaaact gttcatcaga caagaggaag ttcaagaact ttactctcca atttttctta 27780
ttgttgcggc aatagtgttt ataacacttt gcttcacact caaaagaaag acagaatgat 27840
tgaactttca ttaattgact tctatttgtg ctttttagcc tttctgctat tccttgtttt 27900
aattatgctt attatctttt ggttctcact tgaactgcaa gatcataatg aaacttgtca 27960
cgcctaaacg aacatgaaat ttcttgtttt cttaggaatc atcacaactg tagctgcatt 28020
tcaccaagaa tgtagtttac agtcatgtac tcaacatcaa ccatatgtag ttgatgaccc 28080
gtgtcctatt cacttctatt ctaaatggta tatcagagta ggagctagaa aatcagcacc 28140
tttaattgaa ttgtgcgtgg atgaggctgg ttctaaatca cccattcagt acatcgatat 28200
cggtaattat acagtttcct gtttaccttt tacaattaac tgccaggaac ctaaattggg 28260
tagtcttgta gtgcgttgtt cgttctacga ggacttttta gagtatcatg acgttcgtgt 28320
tgttttagat ttcatctaaa cgaacaaact aaaatgtctg ataatggacc tcaaaatcag 28380
cgaaatgcac ctcgcattac gtttggtgga ccatcagatt caactggcag taaccagaat 28440
ggagaacgaa gtggtgcgcg atcaaaacaa cgccgcccgc aaggtttacc caataatact 28500
gcgtcttggt tcaccgctct cactcaacat ggcaaggaag atttaaaatt ccctcgagga 28560
caaggcgttc caattaacac caatagcagt ccagatgacc aaattggcta ctaccgccgc 28620
gccacaagac gaattcgtgg tggtgatggt aaaatgaaag atctcagtcc aagatggtat 28680
ttctactatc taggaactgg gccagaagct ggacttcctt atggtgctaa caaagatggc 28740
atcatatggg ttgcaactga gggagccttg aatacaccaa aagatcacat tggcaccaga 28800
aatcctgcta acaatgctgc aatcgtgcta caacttcctc aaggaacaac attaccaaaa 28860
ggtttttacg cagaagggtc tagaggtgga agtcaagcct cttctagatc atcatcacgt 28920
agtcgcaaca gttcaagaaa ttcaactcca ggttcaagta gaggaacttc tcctgctaga 28980
atggctggaa atggaggtga tgctgctctt gctttgttac tacttgacag attgaaccag 29040
cttgagagca aaatgtctgg taaaggccaa caacaacaag gccaaactgt cactaagaaa 29100
tctgctgctg aggcttctaa gaagcctaga caaaaacgta ctgccactaa agcatacaat 29160
gtaacacaag ctttcggcag acgtggtcca gaacaaactc aaggaaattt tggggatcag 29220
gaactaatca gacaaggaac tgattacaaa cattggccgc aaattgcaca atttgctcct 29280
tctgcttcag cgttctttgg aatgtcgaga attggaatgg aagtcacacc ttcgggaaca 29340
tggttgacct atacaggtgc catcaaattg gatgacaaag atccaaattt caaagatcaa 29400
gtcattttgc tgaataagca tattgacgca tacaaaacat tcccaccaac agagcctaaa 29460
aaggacaaaa agaagaaggc tgatgaaact caagccttac cgcagagaca gaagaaacag 29520
caaactgtga ctcttcttcc tgctgcagat ttggatgatt tctccaaaca attgcaacaa 29580
tccatgagca gtgctgactc aactcaggcc taaactcatg cagaccacac aaggcagatg 29640
ggctatataa acgttttcgc ttttccgttt acgatatata gtctactctt gtgcagaatg 29700
aattctcgta actacatagc acaagtagat gtagttaact ttaatctcac atagcaatct 29760
ttaatcagtg tgtaacatta gggaggactt gaaagagcca ccacattttc accgaggcca 29820
cgcggagtac gatcgagtgt acagtgaaca atgctaggga gagctgccta tatggaagag 29880
ccctaatgtg taaaattaat tttagtagtg ctatccccat gtgattttaa tagcttctta 29940
ggagaatgac aaaaaaaaac aaaaaaaa 29968
<210> 45
<211> 10827
<212> DNA
<213> Artificial Sequence
<220>
<223> vector
<400> 45
cggccgtaag atacattgat gagtttggac aaaccacaac tagaatgcag tgaaaaaaat 60
gctttatttg tgaaatttgt gatgctatag ctttatttgt aaccattata agctgcaata 120
aacaagttgt ttaaaccacg tgatgaccat acacctcggg atactagatg tataatgtcc 180
gccatgcaga cgaaaccagt cggagattac cgagcattct atcacgtcgg cgaccaatag 240
tgagcttagg gataacaggg taataaacga tccccgggaa ttcactggcc gtcgttttac 300
aacgtcgtga ctgggaaaac cctggcgtta cccaacttaa tcgccttgca gcacatcccc 360
ctttcgccag ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc caacagttgc 420
gcagcctgaa tggcgaatgg cgatagatcc ggtggatgac cttttgaatg acctttaata 480
gattatatta ctaattaatt ggggacccta gaggtcccct tttttatttt aaaaattttt 540
tcacaaaacg gtttacaagc ataaagctcg gacggatctt ttccgctgca taaccctgct 600
tcggggtcat tatagcgatt ttttcggtat atccatcctt tttcgcacga tatacaggat 660
tttgccaaag ggttcgtgta gactttcctt ggtgtatcca acggcgtcag ccgggcagga 720
taggtgaagt aggcccaccc gcgagcgggt gttccttctt cactgtccct tattcgcacc 780
tggcggtgct caacgggaat cctgctctgc gaggctggcc ggctaccgcc ggcgtaacag 840
atgagggcaa gcggatggct gatgaaacca agccaaccag gaagggcagc ccacctatca 900
aggtgtcgat gcaggggggg gggaaagcca cgttgtgtct caaaatctct gatgttacat 960
tgcacaagat aaaaatatat catcatgaac aataaaactg tctgcttaca taaacagtaa 1020
tacaaggggt gttatgagcc atattcaacg ggaaacgtct tgctcaaggc cgcgattaaa 1080
ttccaacatg gatgctgatt tatatgggta taaatgggct cgcgataatg tcgggcaatc 1140
aggtgcgaca atctatcgat tgtatgggaa gcccgatgcg ccagagttgt ttctgaaaca 1200
tggcaaaggt agcgttgcca atgatgttac agatgagatg gtcagactaa actggctgac 1260
ggaatttatg cctcttccga ccatcaagca ttttatccgt actcctgatg atgcatggtt 1320
actcaccact gcgatccccg gaaaaacagc attccaggta ttagaagaat atcctgattc 1380
aggtgaaaat attgttgatg cgctggcagt gttcctgcgc cggttgcatt cgattcctgt 1440
ttgtaattgt ccttttaaca gcgatcgcgt atttcgtctc gctcaggcgc aatcacgaat 1500
gaataacggt ttggttgatg cgagtgattt tgatgacgag cgtaatggct ggcctgttga 1560
acaagtctgg aaagaaatgc ataagttttt gccattctca ccggattcag tcgtcactca 1620
tggtgatttc tcacttgata accttatttt tgacgagggg aaattaatag gttgtattga 1680
tgttggacga gtcggaatcg cagaccgata ccaggatctt gccatcctat ggaactgcct 1740
cggtgagttt tctccttcat tacagaaacg gctttttcaa aaatatggta ttgataatcc 1800
tgatatgaat aaattgcagt ttcatttgat gctcgatgag tttttctaat cagaattggt 1860
taattggttg taacactggc agagcattac gctgacttga cgggacggcg gctttgttga 1920
ataaatcgaa cttttgctga gttgaaggat cagatcacgc atcttcccga caacgcagac 1980
cgttccgtgg caaagcaaaa gttcaaaatc accaactggt ccacctacaa caaagctctc 2040
atcaaccgtg gctccctcac tttctggctg gatgatgggg cgattcaggc ctggtatgag 2100
tcagcaacac cttcttcacg aggcagacct cagacggtat cggatcgatc ccccgatgtg 2160
tagcagtggc ggaccatata ggcagatcag aaggcgcggt tctcctacat gagcttttca 2220
attcaattca tcattttttt tttattcttt tttttgattt cggtttcctt gaaatttttt 2280
tgattcggta atctccgaac agaaggaaga acgaaggaag gagcacagac ttagattggt 2340
atatatacgc atatgtagtg ttgaagaaac atgaaattgc ccagtattct taacccaact 2400
gcacagaaca aaaacctgca ggaaacgaag ataaatcatg tcgaaagcta catataagga 2460
acgtgctgct actcatccta gtcctgttgc tgccaagcta tttaatatca tgcacgaaaa 2520
gcaaacaaac ttgtgtgctt cattggatgt tcgtaccacc aaggaattac tggagttagt 2580
tgaagcatta ggtcccaaaa tttgtttact aaaaacacat gtggatatct tgactgattt 2640
ttccatggag ggcacagtta agccgctaaa ggcattatcc gccaagtaca attttttact 2700
cttcgaagac agaaaatttg ctgacattgg taatacagtc aaattgcagt actctgcggg 2760
tgtatacaga atagcagaat gggcagacat tacgaatgca cacggtgtgg tgggcccagg 2820
tattgttagc ggtttgaagc aggcggcaga agaagtaaca aaggaaccta gaggcctttt 2880
gatgttagca gaattgtcat gcaagggctc cctatctact ggagaatata ctaagggtac 2940
tgttgacatt gcgaagagcg acaaagattt tgttatcggc tttattgctc aaagagacat 3000
gggtggaaga gatgaaggtt acgattggtt gattatgaca cccggtgtgg gtttagatga 3060
caagggagac gcattgggtc aacagtatag aaccgtggat gatgtggtct ctacaggatc 3120
tgacattatt attgttggaa gaggactatt tgcaaaggga agggatgcta aggtagaggg 3180
tgaacgttac agaaaagcag gctgggaagc atatttgaga agatgcggcc agcaaaacta 3240
aaaaactgta ttataagtaa atgcatgtat actaaactca caaattagag cttcaattta 3300
attatatcag ttattacccg ggaatctcgg tcgtaatgat ttttataatg acgaaaaaaa 3360
aaaaattgga aagaaaaagc tgggcgcgcc ggccggccct tttcatcacg tgctataaaa 3420
ataattataa tttaaatttt ttaatataaa tatataaatt aaaaatagaa agtaaaaaaa 3480
gaaattaaag aaaaaatagt ttttgttttc cgaagatgta aaagactcta gggggatcgc 3540
caacaaatac taccttttat cttgctcttc ctgctctcag gtattaatgc cgaattgttt 3600
catcttgtct gtgtagaaga ccacacacga aaatcctgtg attttacatt ttacttatcg 3660
ttaatcgaat gtatatctat ttaatctgct tttcttgtct aataaatata tatgtaaagt 3720
acgctttttg ttgaaatttt ttaaaccttt gtttattttt ttttttcttc attccgtaac 3780
tcttctacct tctttattta ctttctaaaa tccaaataca aaacataaaa ataaataaac 3840
acagagtaaa ttcccaaatt attccatcat taaaagatac gaggcgcgtg taagttacag 3900
gcaagcgatc ggccggcccg ggcatttaaa tgcaggccgc gtacgcgtcg acggtaccga 3960
attcgcttaa acgagctcat gttcgccggt gaacgcgttg aggaagccgg gcagtgcctc 4020
ggcaaaatcc ttgcgtgtag acaagacatc tgcgtagcag ttgtcctcaa caacgatgtc 4080
gaaatccaaa tcggagtgct catcgagtcc tccgtgaacg taagagccgc cgatcagaag 4140
agcgcggaag cgaacatcgg aagcgaccgc atcgcggatg cggttcaaga aagttgcatg 4200
agcttgtgga agtgtgctga gcataaatga ttctcctagc tgttctttgg gtaagtacgc 4260
catcaggacg ttgtgagtgg cgcgattttt agcggctgaa atcagccctt gagcctgtcg 4320
gcaagtcgcg tcatgaggtc catgcgctca tgcaggatcg ccacgaccaa cgcgggttcg 4380
cccgcacgcg gcaggcaaaa aacgtagtgg tgttcgcagc gggccatccg cagcgcggga 4440
aagagttcgc tcatgtcctt aaacgggcct tcgccggcgg caagcctggc tatgccctgt 4500
tccagcttag cgatatagcg gcgcacctgc gccgcgcccc actcccggcg cgtgtagcgg 4560
atgatgccgc gtagatcggc ttcggcctca gccgtgagga tgtaggccgt caagcgcgat 4620
ccccgctgag ttcttcatca agaatttcgc cgacgctctt ggtggacacc ttgccggcaa 4680
gcccatcgtt gatgcggttc cccagcatgg ttttcagttc ctgccatgcc tgatcggcat 4740
cagcgtcacc ggggaacaga cgttcgaggg cgtattgctt aatggtcttg ccctgcaagg 4800
cggccagggc tttcaggctc tggtgctgct ggtccgtcat gtcgattgtc aggcggctca 4860
ttggataacc tccataaaat acacgtaacc acattagcac atatgtgggc gtgaggctac 4920
agcgcgaggc gcattaaggt cgggaaaatg cgctaggcgc atttaaattg cgtattgctg 4980
taatgcgcca tgccggctag actaggccca aatgggtata cccaatttga ccaaggggga 5040
cgcgatgagg gcggccaagc actaccgaca acttctatcc atcgacttca acatcgaggc 5100
gctggccttc gtgcctggac ccgacggcac acgcggccgg cgcatccacg tcctggggcg 5160
cgaggtccgc gaccggcccg gcctggtcga gtacctttcg ccggcgttcg gctcgcgggt 5220
ggcgctggac ggctactgca aggccaattt cgatgcagtg ctgcacctgg cgtaccccga 5280
tcatcagcaa tggggccacg catgaagcgc cgaagctacg ccatgctgcg cgccgctgcc 5340
gcgctggccg tcctggtcgt tgcctcgccg gcatgggccg agctgcgcgg cgaggtcgtg 5400
cgcatcatcg acggcgacac catcgacgtg ctggtagaca agcagccggt gcgcgtgcgc 5460
ctggtggaca ttgacgcgcc ggaaaagcgg caagccttcg gcgaacgtgc gcgccaggcg 5520
ctggccggca tggtgttccg ccggcacgtc ctggtcgacg agaaggacac cgaccgttac 5580
ggccgcacgc tgggcaccgt gtgggtcaac atggagctgg ccagccggcc gccgcagccg 5640
cgcaacgtca acgccgcgat ggttcaccag ggcatggcgt gggcctatcg cttccacggc 5700
cgcgcggccg accctgaaat gctgcggctc gaacaggagg cgcgaggcaa gcgcgtcggc 5760
ctctggtccg atccgcacgc cgtcgagccg tggaaatggc gacgcgagag caacaaccgg 5820
agggacgaag gttgaaggtc gcccgcatct acctgcgcgc cagtacggac gagcagaatc 5880
ttgaacgcca ggagagcctt gtagcggcca cgcgggccgc cgggtactac gtcgccggca 5940
tctaccgcga gaaggcgtcc ggcgcacgcg ccgaccggcc cgagctgctg cgcatgatcg 6000
cggacctgca acctggtgaa gtcgtcgttg cggagaagat cgaccgcatc agccgcttgc 6060
cgttggccga ggccgagcgc ctggttgcgt cgatccgggc caaaggggcc aagctggccg 6120
tgcctggcgt ggtggacctg tcggagctgg ccgccgaggc gaacggagtg gcgaaaatcg 6180
ttctggaatc cgtccaggac atgcttttga agctcgcctt gcagatggcc cgcgacgact 6240
acgaggatcg gcgcgagcgt caacgtcagg gtgtccagtt ggcgaaggcc gccggccgct 6300
acaccggccg caaacgtgac gccggcatgc acgaccgcat catcacgctt cgctccggcg 6360
gatcgagcat tgccaagacg gccaagctgg tcggatgcag cccgagccag gtcaaacgag 6420
tgtgggcggc ctggaacgcg cagcagcaaa aataaagccg ggcagtgccc ggcttttctc 6480
accttttcgc gtcccgcagg gccgctgcga gcgccctacc tagatcctcg ctttccccct 6540
cggtgtagtc cggccagggc acgaagggcg cggatgcgaa cctgttgagc aggtacgcct 6600
tcgggcagcg gtagaccacc ggcgagttcg ccttttcatc ccaccgggcc aggatcacgt 6660
ccgcatcaca gtgcatgtcc ttcacctggt cgcggaagaa gccgaaggcc accatgccgc 6720
tatgttcgcc gaggaacgcc agttgcttcg cgctggcgat cgcgccgacg ccgccggcca 6780
aaaccgacgc catcacccag ccgacgaacc agaagctggc atgcttgcgg ttgaccaccg 6840
cacgcgcagc cgcgaccagg acaacggcca agctgccgac cagggccatg acgaccgtga 6900
tccggccgtt gtggaaagcg atgggcttgc cagcgtccgc ttgcacggcg tcgtaaatgc 6960
tggacccgat gggcgcgcac atcagcacga caggcagcag caccaggaac atcgtccgcg 7020
tccattgcgc gagtgccttg cggcgttcgc cggcggcaag cgcctccatc atcggcgtga 7080
agcccaacag ggccaccgca gccgccaagc cggcaacgat gccgcaggcg attacataca 7140
tacatcctcc ctaatgcgcc ttgcgcacgg ttgtagtcag agtccgcggt ggggcgataa 7200
gctcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag accccgtaga 7260
aaagatcaaa ggatcttctt gagatccttt ttttctgcgg gggatcagga ccgctgccgg 7320
agcgcaaccc actcactaca gcagagccat gtagacaaca tcccctcccc ctttccaccg 7380
cgtcagacgc ccgtagcagc ccgctacggg ctttttcatg ccctgcccta gcgtccaagc 7440
ctcacggccg cgctcggcct ctctggcggc cttctggcgc tcctgctgcg gcgtccgctc 7500
gtgggccgtg gcgcgggtcc gcgcgccggc ctcgtgcgcc tggcgctcgc gggcgaggtc 7560
cagggcggcc gtcttcacgt tctgccttgc gcagatgaga tagatcgatc tagcgtggac 7620
tcaaggctct cgcgaatggc tcgcgttgga aactttcatt gacacttgag gggcaccgca 7680
gggaaattct cgtccttgcg agaaccggct atgtcgtgct gcgcatcgag cctgcgccct 7740
tggcttgtct cgcccctctc cgcgtcgcta cggggcttcc agcgcctttc cgacgctcac 7800
cgggctggtt gccctcgccg ctgggctggc ggccgtctat ggccctgcaa acgcgccaga 7860
aacgccgtcg aagccgtgtg cgagacaccg cggccgccgg cgttgtggat acctcgcgga 7920
aaacttggcc ctcactgaca gatgaggggc ggacgttgac acttgagggg ccgactcacc 7980
cggcgcggcg ttgacagatg aggggcaggc tcgatttcgg ccggcgacgt ggagctggcc 8040
agcctcgcaa atcggcgaaa acgcctgatt ttacgcgagt ttcccacaga tgatgtggac 8100
aagcctgggg ataagtgccc tgcggtattg acacttgagg ggcgcgacta ctgacagatg 8160
aggggcgcga tccttgacac ttgaggggca gagtgctgac agatgagggg cgcacctatt 8220
gacatttgag gggctgtcca caggcagaaa atccagcatt tgcaagggtt tccgcccgtt 8280
tttcggccac cgctaacctg tcttttaacc tgcttttaaa ccaatattta taaaccttgt 8340
ttttaaccag ggctgcgccc tgtgcgcgtg accgcgcacg ccgaaggggg gtgccccccc 8400
ttctcgaacc ctcccggccc gctaacgcgg gcctcccatc cccccagggg ctgcgcccct 8460
cggccgcgaa cggcctcacc ccaaaaatgg cagcgctggc agtccttgcc attgccggga 8520
tcggggcagt aacgggatgg gcgatcagcc cgagcgcgac gcccggaagc attgacgtgc 8580
cgcaggtgct ggcatcgaca ttcagcgacc aggtgccggg cagtgagggc ggcggcctgg 8640
gtggcggcct gcccttcact tcggccgtcg gggcattcac ggacttcatg gcggggccgg 8700
caatttttac cttgggcatt cttggcatag tggtcgcggg tgccgtgctc gtgttcgggg 8760
gtgaattaat tccccggatc gatccgtcag cttcacgctg ccgcaagcac tcagggcgca 8820
agggctgcta aaggaagcgg aacacgtaga aagccagtcc gcagaaacgg tgctgacccc 8880
ggatgaatgt cagctactgg gctatctgga caagggaaaa cgcaagcgca aagagaaagc 8940
aggtagcttg cagtgggctt acatggcgat agctagactg ggcggtttta tggacagcaa 9000
gcgaaccgga attgccagct ggggcgccct ctggtaaggt tgggaagccc tgcaaagtaa 9060
actggatggc tttcttgccg ccaaggatct gatggcgcag gggatcaaga tcgacggatc 9120
gatccgggga attaattccg gggcaatccc gcaaggaggg tgaatgaatc ggacgtttga 9180
ccggaaggca tacaggcaag aactgatcga cgcggggttt tccgccgagg atgccgaaac 9240
catcgcaagc cgcaccgtca tgcgtgcgcc ccgcgaaacc ttccagtccg tcggctcgat 9300
ggtccagcaa gctacggcca agatcgagcg cgacagcgtg caactggctc cccctgccct 9360
gcccgcgcca tcggccgccg tggagcgttc gcgtcgtctc gaacaggagg cggcaggttt 9420
ggcgaagtcg atgaccatcg acacgcgagg aactatgacg accaagaagc gaaaaaccgc 9480
cggcgaggac ctggcaaaac aggtcagcga ggccaagcag gccgcgttgc tgaaacacac 9540
gaagcagcag atcaaggaaa tgcagctttc cttgttcgat attgcgccgt ggccggacac 9600
gatgcgagcg atgccaaacg acacggcccg ctctgccctg ttcaccacgc gcaacaagaa 9660
aatcccgcgc gaggcgctgc aaaacaaggt cattttccac gtcaacaagg acgtgaagat 9720
cacctacacc ggcgtcgagc tgcgggccga cgatgacgaa ctggtgtggc agcaggtgtt 9780
ggagtacgcg aagcgcaccc ctatcggcga gccgatcacc ttcacgttct acgagctttg 9840
ccaggacctg ggctggtcga tcaatggccg gtattacacg aaggccgagg aatgcctgtc 9900
gcgcctacag gcgacggcga tgggcttcac gtccgaccgc gttgggcacc tggaatcggt 9960
gtcgctgctg caccgcttcc gcgtcctgga ccgtggcaag aaaacgtccc gttgccaggt 10020
cctgatcgac gaggaaatcg tcgtgctgtt tgctggcgac cactacacga aattcatatg 10080
ggagaagtac cgcaagctgt cgccgacggc ccgacggatg ttcgactatt tcagctcgca 10140
ccgggagccg tacccgctca agctggaaac cttccgcctc atgtgcggat cggattccac 10200
ccgcgtgaag aagtggcgcg agcaggtcgg cgaagcctgc gaagagttgc gaggcagcgg 10260
cctggtggaa cacgcctggg tcaatgatga cctggtgcat tgcaaacgct agggccttgt 10320
ggggtcagtt ccggctgggg gttcagcagc cactcgatcg aggtcccaat acgcaaaccg 10380
cctctccccg cgcgttggcc gattcattaa tgcagctggc acgacaggtt tcccgactgg 10440
aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc tcactcatta ggcaccccag 10500
gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt 10560
cacacaggaa acagctatga ccatgattac gccaagcttc catgggatat cgagatctcc 10620
tgcagagctc tagagtcgag actagtctcg acgggcccgg taccccctcg agggggccgc 10680
acttaagtta cgcgtggatc gtggagcttt cgggttttaa ctataacggt cctaaggtag 10740
cgaactcggg tcttgcctta atcccaacaa ccggattatc tacacggatt tcaatagctg 10800
atatagcgaa tcaccgagat taattaa 10827
<210> 46
<211> 506
<212> DNA
<213> Artificial Sequence
<220>
<223> origin of replication
<400> 46
atcacgtgct ataaaaataa ttataattta aattttttaa tataaatata taaattaaaa 60
atagaaagta aaaaaagaaa ttaaagaaaa aatagttttt gttttccgaa gatgtaaaag 120
actctagggg gatcgccaac aaatactacc ttttatcttg ctcttcctgc tctcaggtat 180
taatgccgaa ttgtttcatc ttgtctgtgt agaagaccac acacgaaaat cctgtgattt 240
tacattttac ttatcgttaa tcgaatgtat atctatttaa tctgcttttc ttgtctaata 300
aatatatatg taaagtacgc tttttgttga aattttttaa acctttgttt attttttttt 360
ttcttcattc cgtaactctt ctaccttctt tatttacttt ctaaaatcca aatacaaaac 420
ataaaaataa ataaacacag agtaaattcc caaattattc catcattaaa agatacgagg 480
cgcgtgtaag ttacaggcaa gcgatc 506
<210> 47
<211> 1020
<212> DNA
<213> Artificial Sequence
<220>
<223> selectionmarker
<400> 47
ttcaattcat catttttttt ttattctttt ttttgatttc ggtttccttg aaattttttt 60
gattcggtaa tctccgaaca gaaggaagaa cgaaggaagg agcacagact tagattggta 120
tatatacgca tatgtagtgt tgaagaaaca tgaaattgcc cagtattctt aacccaactg 180
cacagaacaa aaacctgcag gaaacgaaga taaatcatgt cgaaagctac atataaggaa 240
cgtgctgcta ctcatcctag tcctgttgct gccaagctat ttaatatcat gcacgaaaag 300
caaacaaact tgtgtgcttc attggatgtt cgtaccacca aggaattact ggagttagtt 360
gaagcattag gtcccaaaat ttgtttacta aaaacacatg tggatatctt gactgatttt 420
tccatggagg gcacagttaa gccgctaaag gcattatccg ccaagtacaa ttttttactc 480
ttcgaagaca gaaaatttgc tgacattggt aatacagtca aattgcagta ctctgcgggt 540
gtatacagaa tagcagaatg ggcagacatt acgaatgcac acggtgtggt gggcccaggt 600
attgttagcg gtttgaagca ggcggcagaa gaagtaacaa aggaacctag aggccttttg 660
atgttagcag aattgtcatg caagggctcc ctatctactg gagaatatac taagggtact 720
gttgacattg cgaagagcga caaagatttt gttatcggct ttattgctca aagagacatg 780
ggtggaagag atgaaggtta cgattggttg attatgacac ccggtgtggg tttagatgac 840
aagggagacg cattgggtca acagtataga accgtggatg atgtggtctc tacaggatct 900
gacattatta ttgttggaag aggactattt gcaaagggaa gggatgctaa ggtagagggt 960
gaacgttaca gaaaagcagg ctgggaagca tatttgagaa gatgcggcca gcaaaactaa 1020
<210> 48
<211> 228
<212> DNA
<213> Artificial Sequence
<220>
<223> SARS-CoV-2 E
<400> 48
atgtactcat tcgtttcgga agagacaggt acgttaatag ttaatagcgt acttcttttt 60
cttgctttcg tggtattctt gctagttaca ctagccattc ttactgcgct tcgattgtgt 120
gcgtactgtt gcaatattgt taacgtgagt cttgtaaaac cttcttttta cgtttactct 180
cgtgttaaaa atctgaattc ttctcgggtt cctgatcttc tggtctaa 228
<210> 49
<211> 669
<212> DNA
<213> Artificial Sequence
<220>
<223> SARS-CoV-2 M
<400> 49
atggcagatt ccaacggtac tattaccgtt gaggagctga aaaagctcct tgaacaatgg 60
aacctagtaa taggtttcct attccttaca tggatttgcc tgctgcaatt tgcctatgcc 120
aacaggaata ggtttttgta catcattaag ttgattttcc tctggctgtt atggccagta 180
actttagctt gttttgtgct tgctgctgtt tacagaataa attggatcac cggtggaatt 240
gctattgcaa tggcttgtct tgtaggattg atgtggctaa gctacttcat tgcttctttc 300
agactgtttg cgcgtacgcg ttccatgtgg tcattcaatc cagaaactaa cattcttctc 360
aacgtgccac tccatggaac tattctgact agaccgcttc tagaaagtga actcgtaatc 420
ggagctgtta tccttcgtgg acatcttcgt attgctggac atcatctagg acgctgtgac 480
atcaaggatc tacctaaaga aatcactgtt gctacatcac gaacgctttc ttattacaaa 540
ttgggagctt cacagcgtgt agcaggtgat tcaggttttg ctgcatatag tcgctacagg 600
attggcaact ataaattaaa cacagaccat tccagtagca gtgacaatat tgctttgctt 660
gtacagtaa 669
<210> 50
<211> 1260
<212> DNA
<213> Artificial Sequence
<220>
<223> SARS-CoV-2 N
<400> 50
atgtctgata atggacctca aaatcagcga aatgcacctc gcattacgtt tggtggacca 60
tcagattcaa ctggcagtaa ccagaatgga gaacgaagtg gtgcgcgatc aaaacaacgc 120
cgcccgcaag gtttacccaa taatactgcg tcttggttca ccgctctcac tcaacatggc 180
aaggaagatt taaaattccc tcgaggacaa ggcgttccaa ttaacaccaa tagcagtcca 240
gatgaccaaa ttggctacta ccgccgcgcc acaagacgaa ttcgtggtgg tgatggtaaa 300
atgaaagatc tcagtccaag atggtatttc tactatctag gaactgggcc agaagctgga 360
cttccttatg gtgctaacaa agatggcatc atatgggttg caactgaggg agccttgaat 420
acaccaaaag atcacattgg caccagaaat cctgctaaca atgctgcaat cgtgctacaa 480
cttcctcaag gaacaacatt accaaaaggt ttttacgcag aagggtctag aggtggaagt 540
caagcctctt ctagatcatc atcacgtagt cgcaacagtt caagaaattc aactccaggt 600
tcaagtagag gaacttctcc tgctagaatg gctggaaatg gaggtgatgc tgctcttgct 660
ttgttactac ttgacagatt gaaccagctt gagagcaaaa tgtctggtaa aggccaacaa 720
caacaaggcc aaactgtcac taagaaatct gctgctgagg cttctaagaa gcctagacaa 780
aaacgtactg ccactaaagc atacaatgta acacaagctt tcggcagacg tggtccagaa 840
caaactcaag gaaattttgg ggatcaggaa ctaatcagac aaggaactga ttacaaacat 900
tggccgcaaa ttgcacaatt tgctccttct gcttcagcgt tctttggaat gtcgagaatt 960
ggaatggaag tcacaccttc gggaacatgg ttgacctata caggtgccat caaattggat 1020
gacaaagatc caaatttcaa agatcaagtc attttgctga ataagcatat tgacgcatac 1080
aaaacattcc caccaacaga gcctaaaaag gacaaaaaga agaaggctga tgaaactcaa 1140
gccttaccgc agagacagaa gaaacagcaa actgtgactc ttcttcctgc tgcagatttg 1200
gatgatttct ccaaacaatt gcaacaatcc atgagcagtg ctgactcaac tcaggcctaa 1260
<210> 51
<211> 21290
<212> DNA
<213> Artificial Sequence
<220>
<223> SARS-CoV-2 ORF1ab
<400> 51
atggagagcc ttgtccctgg tttcaacgag aaaacacacg tccaactcag tttgcctgtt 60
ttacaggttc gcgacgtgtt agtacgtggt tttggagatt cagtggaaga agtcttatca 120
gaggcacgtc aacatcttaa agatggcact tgtggcttag tagaagttga aaaaggcgtt 180
ttgcctcaac ttgaacagcc ctatgtgttc atcaaacgtt ctgatgctag aactgcacct 240
catggtcatg ttatggttga gctggtagca gaattagaag gtattcagta cggtcgtagt 300
ggtgagacat taggtgtttt agttcctcat gtgggcgaaa taccagtggc ttaccgcaaa 360
gttcttctta gaaagaacgg taataaagga gctggtggcc atagttacgg cgctgattta 420
aagtcatttg acttaggcga cgagcttggc actgatcctt atgaagattt ccaagaaaac 480
tggaacacta aacatagcag tggtgttacc cgtgaactca tgcgtgagtt aaatggaggt 540
gcatacactc gctatgtcga taacaacttc tgtggacctg atggttaccc tcttgagtgc 600
attaaagacc ttctagcacg tgctggtaaa gcttcatgca ctttgtccga acaactggac 660
tttattgaca ctaagagggg tgtatactgc tgccgtgaac atgagcatga aattgcttgg 720
tacacggaac gttctgaaaa gagctatgaa ttgcagacac cttttgaaat taaactggca 780
aagaaatttg acaccttcaa tggggaatgt ccaaattttg tatttcccct caattccata 840
atcaagacta ttcaaccaag ggttgaaaag aaaaagcttg atggctttat gggtagaatt 900
cgatctgtct atccagttgc gtcaccaaat gaatgcaacc aaatgtgcct ttcaactctc 960
atgaagtgtg atcattgtgg tgaaacttca tggcagacgg gcgattttgt taaagccact 1020
tgcgaatttt gtggcactga gaatttgact aaagaaggtg ccactacttg tggttactta 1080
ccccaaaatg ctgttgttaa aatttactgt ccagcatgtc acaattcaga agtaggacct 1140
gagcatagtc ttgccgaata ccataatgaa tctggcttga aaaccattct tcgtaagggt 1200
ggtcgcacta ttgcttttgg aggatgtgtg ttctcttatg ttggttgcca taacaagtgt 1260
gcttattggg ttccacgtgc ttcagctaac ataggttgta accatacagg tgttgttgga 1320
gaaggttccg aaggtcttaa tgacaacctt cttgaaatac tccaaaaaga gaaagtcaac 1380
atcaatattg ttggtgactt taaacttaat gaagagatcg ccattatttt ggcatctttt 1440
tctgcttcca caagtgcttt tgtggaaact gtgaaaggtt tggattataa agcattcaaa 1500
cagattgttg aatcctgtgg taattttaag gttacaaagg gaaaagctaa aaaaggtgcc 1560
tggaatattg gtgaacagaa atcaatactg agtcctcttt atgcatttgc atcagaggct 1620
gctcgtgttg tacgatcaat tttctcccgc actcttgaaa ctgctcaaaa ttctgtgcgt 1680
gttttacaga aggccgctat aacaatacta gatggaattt cacagtattc actgagactc 1740
attgatgcta tgatgttcac atctgatttg gctactaaca atctagttgt aatggcctac 1800
attacaggtg gtgttgttca gttgacttcg cagtggctaa ctaacatctt tggcactgtt 1860
tatgaaaaac tcaaacccgt ccttgattgg cttgaagaga agtttaagga aggtgtagag 1920
tttcttagag acggttggga gattgttaaa ttcatctcaa cctgtgcttg tgaaattgtc 1980
ggtggacaaa ttgtcacctg tgctaaggaa attaaggaga gtgttcagac attctttaag 2040
cttgtaaaca agtttttggc tttgtgtgct gactctatca ttattggtgg agctaaactt 2100
aaagccttga atttaggtga aacatttgtc acgcactcaa agggattgta cagaaagtgt 2160
gttaaatcca gagaagaaac tggcctactc atgcctctaa aagccccaaa agaaattatc 2220
ttcttagagg gagaaacact tcccacagaa gtgttaacag aggaagttgt cttgaaaact 2280
ggtgatttac aaccattaga acaacctact agtgaagctg ttgaagctcc attggttggt 2340
acaccagttt gtattaacgg gcttatgttg ctcgaaatca aagacacaga aaagtactgt 2400
gcccttgcac ctaatatgat ggtaacaaac aataccttca cactcaaagg cggtgcacca 2460
acaaaggtta cttttggtga tgacactgtg atagaagtgc aaggttacaa gagtgtgaat 2520
atcacttttg aacttgatga aaggattgat aaagtactta atgagaagtg ctctgcctat 2580
acagttgaac tcggtacaga agtaaatgag ttcgcctgtg ttgtggcaga tgctgtcata 2640
aaaactttgc aaccagtatc tgaattactt acaccactgg gcattgattt agatgagtgg 2700
agtatggcta catactactt atttgatgag tctggtgagt ttaaattggc ttcacatatg 2760
tattgttctt tctaccctcc agatgaggat gaagaagaag gtgattgtga agaagaagag 2820
tttgagccat caactcaata tgagtatggt actgaagatg attaccaagg taaacctttg 2880
gaatttggtg ccacttctgc tgctttacaa cctgaagaag aacaagaaga agattggtta 2940
gatgatgata gtcaacaaac tgttggtcaa caagacggca gtgaggacaa tcagacaact 3000
actattcaaa caattgttga ggttcaacct caattagaga tggaacttac accagttgtt 3060
cagactattg aagtgaatag ttttagtggt tatcttaaac ttactgacaa tgtatacatc 3120
aagaatgcag acattgtgga agaagctaaa aaggtaaaac caacagtggt tgttaatgca 3180
gccaatgttt accttaaaca tggaggaggt gttgcaggag ccttaaataa ggctactaac 3240
aatgccatgc aagttgaatc tgatgattac atagctacta atggaccact taaagtgggt 3300
ggtagttgtg ttttaagcgg acacaatctt gctaaacact gtttacatgt tgtcggccca 3360
aatgttaaca aaggtgaaga tattcaactt cttaagagtg cttatgaaaa ttttaaccag 3420
cacgaagttc tacttgcacc attattatca gctggtattt ttggtgctga ccctatacat 3480
tctttaagag tttgtgtaga tactgttcgc acaaatgtct acttagctgt ctttgataaa 3540
aatctctatg acaaacttgt ttcaagcttt ttggaaatga agagtgaaaa gcaagttgaa 3600
caaaagatcg ctgagattcc taaagaggaa gttaagccat ttataactga aagtaaacct 3660
tcagttgaac agagaaaaca agatgataag aagatcaaag cttgtgttga agaagttaca 3720
acaactctgg aagaaactaa gttcctcaca gaaaacttgc tcctttatat cgacattaat 3780
ggcaatcttc atccagattc tgccactctt gttagtgaca ttgacatcac tttcttaaag 3840
aaagatgctc catatatagt gggtgatgtt gttcaagagg gtgttttaac tgctgtggtt 3900
atacctacta aaaaggctgg tggcactact gaaatgctag cgaaagcttt gagaaaagtg 3960
ccaacagaca attatataac cacttacccg ggtcagggtt taaatggtta cactgtagag 4020
gaggcaaaga cagtgcttaa aaagtgtaaa agtgcctttt acattctacc atctattatc 4080
tctaatgaga agcaagaaat tcttggaact gtttcttgga atttgcgaga aatgcttgca 4140
catgcagaag aaacacgcaa attaatgcct gtctgtgtgg aaactaaagc catagtttca 4200
actatacagc gtaaatataa gggtatcaag atacaagagg gtgtggttga ttatggtgct 4260
agattttact tttacaccag taaaacaact gtagcgtcac ttatcaacac acttaacgat 4320
ctaaatgaaa ctcttgttac aatgccactt ggctatgtaa cacatggctt aaatttggaa 4380
gaagctgctc ggtatatgag atctctcaaa gtgccagcta cagtttctgt ttcttcacct 4440
gatgctgtta cagcgtataa tggttatctt acttcttctt ctaaaacacc tgaagaacat 4500
tttattgaaa ccatctcact tgctggttcc tataaagatt ggtcctattc tggacaatct 4560
acacaactag gtatagaatt tcttaagaga ggtgataaaa gtgtatatta cacgtccaat 4620
cctaccacat tccacctaga tggtgaagtt atcacctttg acaatcttaa gacacttctt 4680
tctttgagag aagtgaggac tattaaggtg tttacaacag tagacaacat taacctccac 4740
acgcaagttg tggacatgtc aatgacatat ggacaacagt ttggtccaac ttatttggat 4800
ggagctgatg ttactaagat aaaacctcat aactcacatg aaggtaaaac attttacgtt 4860
ttgcctaatg atgacactct acgtgttgag gcttttgagt actaccacac aactgatcct 4920
agttttctgg gtaggtacat gtcagcatta aatcacacta aaaagtggaa atacccacaa 4980
gttaatggtt taacttcgat taaatgggca gataacaact gttatcttgc cactgcattg 5040
ttaacactcc aacaaataga gttgaagttt aatccacctg ctctacaaga tgcttattac 5100
agagcaaggg ctggtgaagc tgctaacttt tgtgcactta tcttagccta ctgtaataag 5160
acagtaggtg agttaggtga tgttagagaa acaatgagtt acttgtttca acatgccaat 5220
ttagattctt gcaaaagagt cttgaacgtg gtgtgtaaaa cttgtggaca acagcagaca 5280
acccttaagg gtgtagaagc tgttatgtac atgggcacac tttcttatga acaattcaag 5340
aaaggtgttc agataccttg tacgtgtggt aaacaagcta caaaatatct agtacaacag 5400
gagtcacctt ttgttatgat gtcagcacca cctgctcagt atgaacttaa gcatggtaca 5460
tttacttgtg ctagtgagta cactggtaat taccagtgtg gtcactataa gcatataact 5520
tctaaggaaa ctttgtattg catagacggt gctttactta caaagtcctc agaatacaaa 5580
ggtcctatta cggatgtttt ctacaaagaa aacagttaca caacaaccat aaaaccagtt 5640
acttataagt tggatggtgt tgtttgtaca gaaattgacc ctaagttgga caattattat 5700
aagaaggaca actcttattt cacagagcaa ccaattgatc ttgtaccaaa ccaaccatat 5760
ccaaacgcaa gcttcgataa ttttaagttc gtatgcgata atatcaaatt tgctgatgat 5820
ctcaaccagt taactggtta taagaaacct gcttcaagag agcttaaagt tacatttttc 5880
cctgacttaa atggtgatgt ggtggctatt gattataaac actacacacc ctcttttaag 5940
aaaggagcta aattgttaca taagcctatt gtttggcatg ttaacaatgc aactaataaa 6000
gccacgtata aaccaaatac ctggtgtata cgttgtcttt ggagcacaaa accagttgaa 6060
acatcaaatt cgtttgatgt actgaagtca gaggacgcgc agggaatgga taatcttgca 6120
tgtgaagatc taaaaccagt ctctgaagaa gtagtggaaa atcctaccat acagaaagac 6180
gttcttgagt gtaatgtgaa aactaccgaa gttgtaggag acattatact taaaccagca 6240
aataatagtt tgaagatcac agaagaggtt ggccacacag atctaatggc tgcttatgta 6300
gacaattcta gtcttactat taagaaacct aatgaactct ctagagtatt aggtttgaaa 6360
acccttgcta ctcatggttt agctgctgtt aatagtgtcc cttgggatac tatagctaat 6420
tatgctaagc cttttcttaa caaagttgtt agtacaacta ctaacatagt tacacggtgt 6480
cttaatcgtg tttgtactaa ttatatgcct tacttcttta ctttattgct acaattgtgt 6540
acttttacta gaagtacaaa ttctagaatc aaggcatcta tgccgactac tatagcaaag 6600
aatactgtta agagtgtcgg taaattttgt ctagaggctt catttaatta tctcaagtca 6660
cctaactttt ctaagctgat aaacattatc atctggtttt tgctattaag tgtttgccta 6720
ggttctttaa tctactcaac cgctgcttta ggtgttttaa tgtctaattt aggcatgcct 6780
tcttactgta ctggttacag agaaggctat ttgaactcta ctaatgtcac tattgcaacc 6840
tactgtactg gatctatacc ttgtagtgtt tgtcttagtg gtttagattc tttagacacc 6900
tatccttctc ttgaaactat acagattacc atttcatctt tcaaatggga tttaactgct 6960
tttggcttag ttgcagagtg gtttttggca tatattcttt tcactaggtt tttctatgta 7020
cttggattgg ctgcaatcat gcaattgttt ttcagctatt ttgcagtcca ttttattagt 7080
aactcttggc ttatgtggct tataattaat cttgtgcaga tggccccgat ttcagctatg 7140
gttagaatgt acatcttctt tgcctcattt tattatgtgt ggaaaagtta tgtgcatgtt 7200
gtagacggtt gtaattcatc aacttgtatg atgtgttaca aacgtaatag agcaacaaga 7260
gtcgaatgta caactattgt taatggtgtt agaaggtcct tttatgtcta tgctaatgga 7320
ggtaaaggct tttgcaaact acacaattgg aattgtgtta attgtgatac attctgtgct 7380
ggtagtacat ttattagtga tgaagttgcg agagacttgt cactacagtt taaaagacca 7440
ataaatccta ctgaccaatc ttcttacatc gttgatagtg ttacagtgaa gaatggttcc 7500
atccatcttt actttgataa agctggtcaa aagacttatg aaagacattc tctctctcat 7560
tttgttaact tagacaacct gagagctaat aacactaaag gttcattgcc tattaatgtt 7620
atcgttttcg acggtaaatc aaaatgtgaa gaatcatctg caaaatcagc gtctgtttac 7680
tacagtcagc ttatgtgtca acctatactg ttactagatc aggcattagt gtctgatgtt 7740
ggtgatagtg cggaagttgc agttaaaatg tttgatgctt acgttaatac gttttcatca 7800
acttttaacg taccaatgga aaaactcaaa acactagttg caactgcaga agctgaactt 7860
gcaaagaatg tgtccttaga caatgtctta tctacgttta tttcagcagc tcggcaaggg 7920
tttgttgatt cagatgtaga aactaaagat gttgttgaat gtcttaaatt gtcacatcaa 7980
tctgacatag aagttactgg cgatagttgt aataactata tgctcaccta taacaaagtt 8040
gaaaacatga caccccgtga ccttggtgct tgtattgact gtagtgctag acatattaat 8100
gcgcaggtag caaaaagtca caacattgct ttgatatgga acgttaaaga tttcatgtca 8160
ttgtctgaac aactacgaaa acaaatacgt agtgctgcta aaaagaataa cttacccttc 8220
aagttgacat gtgcaactac tagacaagtt gttaatgttg taacaacaaa gatagcactt 8280
aagggtggta aaattgtgaa taactggttg aagcagctta ttaaagttac acttgtgttc 8340
ctttttgttg ctgctatttt ctatctgata acacctgttc atgtcatgtc taaacatact 8400
gacttttcaa gtgaaatcat aggatacaag gctattgatg gtggtgtcac tcgtgacata 8460
gcatctacag atacttgttt tgctaacaaa catgctgatt ttgacacatg gtttagccag 8520
cgtggtggta gttatactaa tgacaaagct tgcccattga ttgctgcagt cataacaaga 8580
gaagtgggtt ttgtcgttcc tggtttgcct ggaacgatat tacgcacaac taatggtgac 8640
tttttgcatt tcttacctag agtttttagt gcagttggta acatctgtta cacaccatca 8700
aaacttatag agtacactga ctttgcaaca tcagcttgtg ttttggctgc tgaatgtaca 8760
atttttaaag acgcttctgg taagccagta ccatattgtt atgataccaa tgtactagaa 8820
ggttctgttg cttatgaaag tttacgccct gacacacgtt atgtgctcat ggatggctct 8880
attattcaat ttcctaacac ctaccttgaa ggttctgtaa gagtggtaac aacttttgat 8940
tctgagtact gtaggcacgg cacttgtgaa agatcagaag ctggtgtttg tgtatctact 9000
agtggtagat gggtacttaa caacgattat tacagatctt taccaggagt tttctgtggt 9060
gtagatgctg taaatttgct tactaacatg tttacaccac taattcaacc tattggtgct 9120
ttggacatat cagcatctat agtagctggt ggtattgtag ctatcgtagt aacatgcctt 9180
gcctactatt ttatgaggtt tagacgtgct tttggtgaat acagtcatgt agttgccttt 9240
aatactctcc tattccttat gtcattcact gtactctgtt taacaccagt ttactcattc 9300
ttacctggtg tttattctgt tatttacctg tacttgacat tttatctgac taatgatgtt 9360
tcttttctcg cacatattca gtggatggtt atgttcacac ctttagtacc tttctggata 9420
acaattgctt acatcatttg tatttccaca aagcatttct attggttctt tagtaattac 9480
ctaaagagac gtgtagtctt taatggtgtt tcctttagta cttttgaaga agctgcgctg 9540
tgcacctttt tgttaaataa ggagatgtat ctaaagttgc gtagtgatgt gctattacct 9600
cttacgcaat ataatagata cttagctctt tataacaagt acaagtattt cagtggagca 9660
atggatacaa ctagctacag agaagctgct tgttgtcatc tcgcaaaggc tctcaatgac 9720
ttcagtaact caggttctga tgttctttac caaccaccac aaacctctat cacctcagct 9780
gttttgcaga gtggttttag aaaaatggca ttcccatctg gtaaagttga gggttgtatg 9840
gtacaagtaa cttgtggtac aactacactt aacggtcttt ggcttgatga cgtagtttac 9900
tgtccaagac atgtgatctg cacctctgaa gatatgctta accctaatta tgaagatcta 9960
ctcatccgta agtctaatca taacttcttg gtacaggctg gtaatgttca actcagggtt 10020
attggacatt ctatgcaaaa ttgtgtactt aagcttaagg ttgatacagc caatcctaag 10080
acacctaagt ataagtttgt tcgcattcaa ccaggacaga ctttttcagt gttagcttgt 10140
tacaatggtt caccatctgg tgtttaccaa tgtgctatga ggcccaattt cactattaag 10200
ggttcattcc ttaatggttc atgtggtagt gttggtttta acatagatta tgactgtgtc 10260
tctttttgtt acatgcacca tatggaatta ccaactggag ttcatgctgg cacagactta 10320
gaaggtaact tttatggacc ttttgttgac aggcaaacag cacaagcagc tggtacagat 10380
acaactatta cagttaatgt tcttgcttgg ttgtacgctg ctgttataaa tggagacagg 10440
tggtttctca atcgatttac cacaactctt aatgacttta accttgtggc tatgaagtac 10500
aattatgaac ctctaacaca agaccatgtt gacatactag gacctctttc tgctcaaact 10560
ggaattgccg ttttagatat gtgtgcttca ttaaaagaac ttctgcaaaa tggtatgaat 10620
ggacgtacca tattgggtag tgctttatta gaagatgagt ttacaccttt tgatgttgtt 10680
agacaatgct caggtgttac tttccaaagt gcagtgaaaa gaacaatcaa gggtacacac 10740
cactggttgt tactcacaat tttgacttca cttttagttt tagtccagag tactcaatgg 10800
tctttgttct ttttcttcta cgaaaatgcc tttttacctt ttgctatggg tattattgct 10860
atgtctgctt ttgcaatgat gtttgtcaaa cataagcatg catttctctg tttgtttttg 10920
ttaccttctc ttgccactgt agcttacttt aatatggtct acatgcctgc tagttgggtg 10980
atgcgtatta tgacatggtt ggatatggtt gatactagtt tgtctggttt taagctaaaa 11040
gactgtgtta tgtatgcatc agctgtagtg ttactaatcc ttatgacagc aagaactgtg 11100
tatgatgatg gtgctaggag agtgtggaca cttatgaatg tcttgacact cgtttataaa 11160
gtttactatg gcaacgcttt agatcaagcc atttccatgt gggctcttat aatctctgtt 11220
acttctaact actcaggtgt agttacaact gtcatgtttt tggccagagg tattgttttt 11280
atgtgtgttg agtattgccc tattttcttc ataactggta atacacttca gtgtataatg 11340
ctagtctatt gtttcttagg ctatttttgt acttgttact tcggcctctt ttgtttactc 11400
aaccgctact ttagactgac tcttggtgtt tatgattact tagtgtctac acaggagttt 11460
agatatatga attcacaggg actactccca cccaagaata gcatagatgc cttcaaactc 11520
aacattaaat tgttgggtgt tggtggcaaa ccttgtatca aagtagccac tgtacagtct 11580
aaaatgtcag atgtaaagtg cacatcagta gtcttactct cagttttgca acaactcaga 11640
gtagaatcat catctaaatt gtgggctcaa tgtgtccagt tacacaatga cattctctta 11700
gctaaagata ctactgaagc ctttgaaaaa atggtttcac tactttctgt tttgctttcc 11760
atgcagggtg ctgtagacat aaacaagctt tgtgaagaaa tgctggacaa cagggcaacc 11820
ttacaagcta tagcctcaga gtttagttcc cttccatcat atgcagcttt tgctactgct 11880
caagaagctt atgagcaggc tgttgctaat ggtgattctg aagttgttct taaaaagttg 11940
aagaagtctt tgaatgtggc taaatctgaa tttgaccgtg atgcagccat gcaacgtaag 12000
ttggaaaaga tggctgatca agctatgacc caaatgtata aacaggctag atctgaggac 12060
aagagggcaa aagttactag tgctatgcag acaatgcttt tcactatgct tagaaagttg 12120
gataatgatg cactcaacaa cattatcaac aatgcaagag atggttgtgt tcccttgaac 12180
ataatacctc ttacaacagc agccaaacta atggttgtca taccagacta caacacatat 12240
aagaatacgt gtgatggtac aacatttact tatgcatcag cattgtggga aatccaacag 12300
gttgtagatg cagatagtaa aattgttcag cttagtgaaa ttagtatgga caattcacct 12360
aatttagcat ggcctcttat tgtaacagct ttaagggcca attctgctgt caaattacag 12420
aataatgagc ttagtcctgt tgcactaaga caaatgtctt gtgctgccgg tactacacaa 12480
actgcttgca ctgatgacaa tgcgttagct tactacaaca caacaaaggg aggtaggttt 12540
gtacttgcac tgttatccga tttacaggat ttgaaatggg ctagattccc taagagtgat 12600
ggaactggta ctatctatac agaactggaa ccaccttgta ggtttgttac agacacacct 12660
aaaggtccta aagtgaagta tctttacttc atcaaaggat taaacaacct aaatagaggt 12720
atggtacttg gtagtttagc tgccacagta cgtttacaag ctggtaatgc aacagaagtt 12780
cctgctaatt caactgtact ttctttctgt gcttttgctg tagatgctgc taaagcttac 12840
aaagattatc tagctagtgg gggacaacca atcactaatt gtgttaagat gttgtgtaca 12900
cacactggta ctggtcaggc aataacagtt acaccggaag ccaatatgga tcaagaatcc 12960
tttggtggtg catcgtgttg tctgtactgc cgttgtcata tagatcatcc aaatcctaaa 13020
ggattttgtg acttaaaagg taagtatgta caaataccta caacttgtgc taatgaccct 13080
gtgggtttta cacttaaaaa cacagtctgt accgtctgcg gtatgtggaa aggttatggt 13140
tgtagttgtg atcaactccg cgaacccatg cttcagtcag ctgatgcaca atcgttttta 13200
aacgggtttg cggtgtaagt gcagcccgtc ttacaccgtg cggcacaggc actagtactg 13260
atgtcgtata tagagctttt gacatctaca atgataaagt agctggtttt gctaagttcc 13320
taaaaactaa ttgttgtcgc ttccaagaaa aggacgaaga tgacaatctc attgattctt 13380
actttgtagt taagagacac actttctcta actaccaaca tgaagaaaca atttacaacc 13440
tgcttaagga ttgtccagct gttgctaaac atgacttctt taagtttaga atagacggtg 13500
acatggtacc acatatatca cgtcaacgtc ttactaaata cacaatggca gacctcgtct 13560
atgctttaag gcattttgat gaaggtaatt gtgacacatt aaaagaaata cttgtcacat 13620
acaattgttg tgatgatgac tacttcaata aaaaggactg gtatgatttt gtagaaaacc 13680
cagatatatt acgcgtatac gccaacttag gtgaacgtgt acgccaagct ttgttaaaaa 13740
cagtacagtt ctgtgatgcc atgcgaaatg ctggtattgt tggtgtactg acattagata 13800
atcaagatct caatggtaac tggtatgact ttggtgattt catacaaacc acgccaggta 13860
gtggagttcc tgttgtagac tcttattatt cattgctcat gcctatatta accttgacca 13920
gggctttaac tgcagagtca catgttgaca ctgacttaac aaagccttac attaagtggg 13980
atttgttaaa atacgacttc acggaagaga ggttaaaact ctttgaccgt tattttaaat 14040
actgggatca gacataccac ccaaattgtg ttaactgttt ggatgacaga tgcattctgc 14100
attgtgcaaa ctttaatgtt ctgttctcta cagtgttccc acctacaagt tttggaccac 14160
tagtgagaaa aatatttgtt gatggtgttc catttgtagt ttcaactgga taccacttca 14220
gagagctagg tgttgtacat aatcaggatg taaacttaca tagctctaga cttagtttta 14280
aggaattact tgtgtatgct gctgatcctg ctatgcatgc tgcttctggt aatctattac 14340
tagataaacg cactacgtgc ttttcagtag ctgcacttac taacaatgtt gcttttcaaa 14400
ctgtcaaacc cggtaatttt aacaaggact tctatgactt tgctgtgtct aagggtttct 14460
ttaaggaagg aagttctgtt gaattaaaac acttcttctt tgctcaggat ggtaatgctg 14520
ctatcagcga ttatgactac tatcgttata atctaccaac aatgtgtgat atcagacaac 14580
tactatttgt agttgaagtt gttgataagt actttgattg ttacgatggt ggctgtatta 14640
atgctaacca agtcatcgtc aacaacctag acaaatcagc tggttttcca tttaataaat 14700
ggggtaaggc tagactttat tatgattcca tgagttatga ggatcaagat gcacttttcg 14760
catatacaaa acgtaatgtc atccctacta taactcaaat gaaccttaag tatgccatta 14820
gtgcaaagaa tagagctcgc accgtagctg gtgtctctat ctgtagtact atgaccaata 14880
gacagtttca tcaaaaatta ctcaagtcaa tagccgccac tagaggagct actgtagtaa 14940
ttggaacaag caaattctat ggtggttggc acaacatgct caaaactgtt tatagtgatg 15000
tagaaaaccc tcaccttatg ggttgggatt atcctaaatg tgatagagcc atgcctaaca 15060
tgcttagaat tatggcctca cttgttcttg ctcgcaaaca tacaacgtgt tgtagcttgt 15120
cacaccgttt ctatagatta gctaatgagt gtgctcaagt attgagtgaa atggtcatgt 15180
gtggcggttc actatatgtt aaaccaggtg gaacctcatc aggagatgcc acaactgctt 15240
atgctaatag tgtgtttaac atttgtcaag ctgtcacggc caatgttaat gcacttttat 15300
ctactgatgg taacaaaatt gccgataagt atgtccgcaa tttacaacac agactttatg 15360
agtgtctcta tagaaataga gatgttgaca cagactttgt gaatgagttt tacgcatatt 15420
tgcgtaaaca tttctcaatg atgatactct ctgacgatgc tgttgtgtgt ttcaatagca 15480
cttatgcatc tcaaggtcta gtggctagca taaagaactt taagtcagtt ctttactatc 15540
aaaacaacgt ttttatgtct gaagcaaaat gttggactga gactgacctt actaaaggac 15600
ctcatgaatt ttgctctcaa catacaatgc tagttaaaca gggtgatgat tatgtgtacc 15660
ttccttaccc agatccatca agaatcctag gtgccggttg ttttgtagat gatatcgtaa 15720
aaacagatgg tacacttatg attgaacggt tcgtgtcttt agctatagat gcttacccac 15780
ttactaaaca tcctaatcag gagtatgctg atgtctttca tttgtactta caatacatac 15840
gtaagctaca tgatgagtta acaggacaca tgttagacat gtattctgtt atgcttacta 15900
atgataacac ttcaaggtat tgggaacctg agttttatga ggctatgtac acaccgcata 15960
cagtcttaca agctgttggt gcttgtgttc tttgcaattc acagacttca ttaagatgtg 16020
gtgcttgcat acgtagacca ttcttatgtt gtaaatgctg ttacgaccat gtcatctcaa 16080
catcacataa attagtcttg tctgttaatc cgtatgtttg caatgctcca ggttgtgatg 16140
tcacagatgt gactcaactt tacttaggag gtatgagcta ttactgtaag tcacataaac 16200
cacccattag ttttccattg tgtgctaatg gacaagtttt tggtctctac aagaatacat 16260
gtgttggtag cgataatgtt actgacttta atgcaattgc aacatgtgac tggacaaatg 16320
ctggtgatta cattttagct aacacctgta ctgaaagact caagcttttt gcagcagaaa 16380
cgctcaaagc tactgaggag acatttaaac tgtcttatgg tattgctact gtacgtgaag 16440
tgctgtctga cagagaatta catctttcat gggaagttgg taaacctaga ccaccactta 16500
accgaaatta tgtctttact ggttatcgtg taactaaaaa cagtaaagtg caaatcggag 16560
agtacacctt tgaaaaaggt gactatggtg atgctgttgt ttaccgaggt acaacaactt 16620
acaaactcaa cgttggtgat tattttgtgc tgacatcaca tacagtaatg ccattaagtg 16680
cacctacact agtgccacaa gagcactatg ttagaattac tggcttatac ccaacactca 16740
atatctcaga tgagttttct agcaatgttg caaattatca aaaggttggt atgcaaaagt 16800
attctacact ccagggacca cctggtactg gtaaaagtca ttttgctatt ggtctagctc 16860
tctactaccc ttctgctcgc atagtatata cagcttgctc tcatgcagct gttgatgcac 16920
tatgtgagaa ggcattaaaa tatttgccca tagacaaatg tagtagaatt atacctgcac 16980
gtgctcgtgt agagtgtttt gataaattca aggtgaattc aacattagaa cagtatgtct 17040
tttgtactgt aaatgcattg cctgagacga cagcagatat agttgtcttt gatgaaattt 17100
caatggccac aaattatgat ttgagtgttg tcaatgccag attacgtgct aagcactatg 17160
tgtacattgg tgatcctgct caattacctg caccacgcac attactaact aagggtacac 17220
tagaaccaga atatttcaat tcagtgtgta gacttatgaa aactataggt ccagacatgt 17280
tcctcggaac ttgtcgtaga tgtcctgctg aaattgttga cactgtgagt gctttggttt 17340
atgataataa gcttaaggca cataaagaca aatcagctca atgctttaaa atgttctaca 17400
agggtgttat cacgcatgat gtttcatctg caattaacag gccacaaata ggcgtggtaa 17460
gagaattcct tacacgtaac cctgcttgga gaaaagctgt ctttatttca ccttacaatt 17520
cccagaatgc tgtagcctca aagattttgg gactaccaac tcaaactgtt gattcatcac 17580
agggctcaga atatgactat gtcatattca ctcaaaccac tgaaacagct cactcttgta 17640
atgtaaacag attcaacgtt gctattacca gagcaaaagt aggcatactt tgcataatgt 17700
ctgatagaga cctttatgac aagttgcaat ttacaagtct tgaaattcca cgtaggaatg 17760
tggcaacttt acaagctgaa aatgtaacag gactctttaa agattgtagt aaggtaatca 17820
ctgggttaca tcctacacag gcacctacac acttaagtgt tgatactaaa ttcaaaactg 17880
aaggtttatg tgttgacata cctggcatac ctaaggacat gacctataga agattaatct 17940
ctatgatggg tttcaaaatg aattaccagg ttaatggtta ccctaacatg tttatcaccc 18000
gcgaagaagc tataagacat gtacgtgcat ggattggctt cgatgtcgaa ggttgtcatg 18060
ctactagaga agctgttggt accaatttac ctttacagct aggtttttct acaggtgtta 18120
acctagttgc tgtacctaca ggttatgttg atacacctaa taatacagat ttttccagag 18180
ttagtgctaa accaccgcct ggagatcaat ttaaacacct cataccactt atgtacaaag 18240
gacttccttg gaatgtagtg cgtataaaga ttgtccaaat gttaagtgac acacttaaaa 18300
atctctctga cagagtcgta tttgtcttat gggcacatgg ctttgagttg acatctatga 18360
agtattttgt gaagatcgga cctgagcgca catgttgtct atgtgataga cgtgctacat 18420
gcttttccac tgcttcagac acttatgcct gttggcatca ttctattgga tttgattacg 18480
tctataatcc gtttatgatt gatgttcaac aatggggttt tacaggtaac ctacaaagca 18540
accatgatct gtattgtcaa gtccatggta atgcacatgt agctagttgt gatgcaatca 18600
tgactaggtg tctagctgtc cacgagtgct ttgttaagcg tgttgactgg actattgaat 18660
atcctataat cggtgatgaa ctgaagatta atgcggcttg tagaaaggtt caacacatgg 18720
ttgttaaagc tgcattatta gcagacaaat tcccagttct tcacgacatt ggtaacccta 18780
aagctattaa gtgtgtacct caagctgatg tagaatggaa gttctatgat gcacagcctt 18840
gtagtgacaa agcttacaaa atagaagaac tgttctattc ttatgccaca cattctgaca 18900
aattcacaga tggtgtatgc ctattttgga attgcaatgt cgatagatat cctgctaatt 18960
ccattgtttg tagatttgac actagagtgc tatctaacct taacttgcct ggttgtgatg 19020
gtggcagttt gtatgtaaat aagcatgcat tccacacacc agcttttgat aaaagtgctt 19080
ttgttaatct aaagcaactt ccatttttct attactctga cagtccatgt gagtctcatg 19140
gaaaacaagt agtgtcagat atagattatg taccactaaa gtctgctacg tgtataacac 19200
gttgcaattt aggtggtgct gtctgtagac atcatgctaa tgagtacaga ttgtatctcg 19260
atgcttataa catgatgatc tcagctggct ttagcttgtg ggtttacaaa caatttgata 19320
cctataacct ctggaacact tttacaagac ttcagagttt agaaaatgtg gcttttaatg 19380
ttgtaaataa gggacacttt gatggacaac agggtgaagt accagtttct atcattaaca 19440
acactgttta cacaaaagtt gatggtgttg atgtagaatt gtttgagaac aaaaccacat 19500
tacctgttaa tgtagcattt gagctttggg ctaagcgcaa cattaaacca gtaccagagg 19560
tgaaaatact caataatttg ggtgtggaca ttgctgctaa tactgtgatc tgggactaca 19620
aaagagatgc tccagcacat atatctacta ttggtgtttg ttctatgact gacatagcca 19680
agaaaccaac tgaaacgatt tgtgcaccac tcactgtctt ttttgatggt agagttgatg 19740
gtcaagtaga cttatttaga aatgcccgta atggtgttct tattacagaa ggtagtgtta 19800
aaggtttaca accatctgta ggtcccaaac aagctagtct taatggagtc acattaattg 19860
gagaagccgt aaaaacacag ttcaattatt acaagaaagt ggatggtgtt gtccaacaat 19920
tacctgaaac ttactttact cagagtagaa acttacagga atttaagccc aggagtcaaa 19980
tggaaattga tttcttagaa cttgctatgg atgaattcat tgaacggtat aaattagaag 20040
gctatgcctt cgaacatatc gtttatggag attttagtca tagtcagtta ggtggtttac 20100
atctactgat tggactagct aaacgtttta aggaatcacc ttttgaactt gaagatttta 20160
ttcctatgga cagtacagtt aaaaactact tcataacaga tgcgcaaaca ggttcatcta 20220
agtgtgtgtg ttctgttatt gatcttttac ttgatgactt cgttgaaata ataaagtccc 20280
aagatttatc tgtagtttct aaggttgtca aagtgactat tgactataca gaaatctcat 20340
ttatgctttg gtgtaaagat ggccatgtag aaacatttta cccaaaatta caatctagtc 20400
aagcgtggca accgggtgtt gctatgccta atctttacaa aatgcaaaga atgctattag 20460
aaaagtgtga ccttcaaaat tatggtgata gtgcaacatt acctaaaggc ataatgatga 20520
atgtcgcaaa atatactcaa ctgtgtcaat atttaaacac actgacatta gctgtaccct 20580
ataatatgag agttatccat tttggtgctg gttctgataa aggagttgca ccaggtacag 20640
ctgttttaag acaatggttg cctacaggta cgctgcttgt cgattcagat cttaatgact 20700
ttgtctctga tgcagattca actttgattg gtgattgtgc aactgtacat acagctaata 20760
aatgggatct cattattagt gatatgtacg accctaagac taagaatgtc acaaaagaaa 20820
acgactctaa agagggtttt ttcacttaca tttgtgggtt tatacaacaa aagctagctc 20880
ttggaggttc cgtggctata aagataacag aacattcttg gaatgctgat ctttataagc 20940
tcatgggaca cttcgcatgg tggacagcct ttgttactaa tgtgaatgcg tcatcatctg 21000
aagcattttt aatcggatgt aactaccttg gcaaaccacg cgaacaaata gatggttatg 21060
tcatgcatgc aaattacata ttttggagga atacaaatcc aattcagctt tcttcttatt 21120
ctttattcga catgagtaaa ttccccctta aattaagggg tactgctgtt atgtctttaa 21180
aagaaggtca aatcaatgat atgattctct ctcttcttag taaaggtaga cttataatta 21240
gagaaaacaa cagagttgtt atttctagtg atgttcttgt taacaactaa 21290
<210> 52
<211> 828
<212> DNA
<213> Artificial Sequence
<220>
<223> SARS-CoV-2 ORF3a
<400> 52
atggatttgt ttatgagaat cttcacaatt ggaactgtaa ctttgaagca aggtgaaatc 60
aaggatgcta ctccttcaga ttttgttaga gctactgcaa cgataccgat acaagcatca 120
cttcctttcg gatggcttat tgttggcgtt gcacttcttg ctgtttttca gagcgcttcc 180
aaaatcataa ccctcaaaaa gagatggcaa ctagcactct ccaagggtgt tcactttgtt 240
tgcaacttgc tgttgttgtt tgtaacagtt tactcacatc ttttgcttgt tgctgctggc 300
cttgaagccc cttttctcta tctttatgct ttagtctact tcttgcagag tataaacttt 360
gtacgcataa taatgaggct ttggctttgc tggaaatgcc gttccaaaaa cccattactt 420
tatgatgcca actattttct ttgctggcat actaattgtt acgactattg tataccttac 480
aatagtgtaa cttcttcaat tgtcattact tcaggtgatg gcacaacaag tcctatttct 540
gaacatgact accagattgg tggttatact gaaaaatggg aatctggagt aaaagactgt 600
gttgtattac acagttactt cacttcagac tattaccagc tgtactcaac tcaattgagt 660
acagacactg gtgttgaaca tgttaccttc ttcatctaca ataaaatcgt tgatgagcct 720
gaagaacatg tccaaattca cacaatcgac gtttcatccg gagttgttaa tccagtaatg 780
gaaccaattt atgatgaacc gacgacgact actagcgtgc ctttgtaa 828
<210> 53
<211> 186
<212> DNA
<213> Artificial Sequence
<220>
<223> SARS-CoV-2 ORF6
<400> 53
atgtttcatc tcgttgactt tcaggttact atagcagaga tattactaat catcatgagg 60
acttttaaag tttccatttg gaatcttgat tacatcataa acctcataat taagaactta 120
agcaagtcac taactgagaa taaatattct caactagacg aggagcagcc aatggagatt 180
gattaa 186
<210> 54
<211> 366
<212> DNA
<213> Artificial Sequence
<220>
<223> SARS-CoV-2 ORF7a
<400> 54
atgaaaatta ttcttttctt ggcactgata acactcgcta cttgtgagct ttatcactac 60
caagagtgtg ttagaggtac aacagtactt ttaaaagaac cttgctcgtc gggaacatac 120
gagggcaatt caccatttca tcctctagct gataacaaat ttgcactgac ttgctttagc 180
actcaatttg cttttgcttg tcctgacggc gtaaaacacg tctatcagtt acgtgccaga 240
tcagtttcac ctaaactgtt catcagacaa gaggaagttc aagaacttta ctctccaatt 300
tttcttattg ttgcggcaat agtgtttata acactttgct tcacactcaa aagaaagaca 360
gaatga 366
<210> 55
<211> 366
<212> DNA
<213> Artificial Sequence
<220>
<223> SARS-CoV-2 ORF8
<400> 55
atgaaatttc ttgttttctt aggaatcatc acaactgtag ctgcatttca ccaagaatgt 60
agtttacagt catgtactca acatcaacca tatgtagttg atgacccgtg tcctattcac 120
ttctattcta aatggtatat cagagtagga gctagaaaat cagcaccttt aattgaattg 180
tgcgtggatg aggctggttc taaatcaccc attcagtaca tcgatatcgg taattataca 240
gtttcctgtt taccttttac aattaactgc caggaaccta aattgggtag tcttgtagtg 300
cgttgttcgt tctacgagga ctttttagag tatcatgacg ttcgtgttgt tttagatttc 360
atctaa 366
<210> 56
<211> 265
<212> DNA
<213> Artificial Sequence
<220>
<223> SARS-CoV-2 5'UTR
<400> 56
attaaaggtt tataccttcc caggtaacaa accaaccaac tttcgatctc ttgtagatct 60
gttctctaaa cgaactttaa aatctgtgtg gctgtcactc ggctgcatgc ttagtgcact 120
cacgcagtat aattaataac taattactgt cgttgacagg acacgagtaa ctcgtctatc 180
ttctgcaggc tgcttacggt ttcgtccgtg ttgcagccga tcatcagcac atctaggttt 240
cgtccgggtg tgaccgaaag gtaag 265
<210> 57
<211> 206
<212> DNA
<213> Artificial Sequence
<220>
<223> SARS-CoV-2 3'UTR
<400> 57
caatctttaa tcagtgtgta acattaggga ggacttgaaa gagccaccac attttcaccg 60
aggccacgcg gagtacgatc gagtgtacag tgaacaatgc tagggagagc tgcctatatg 120
gaagagccct aatgtgtaaa attaatttta gtagtgctat ccccatgtga ttttaatagc 180
ttcttaggag aatgacaaaa aaaaac 206
<210> 58
<211> 13203
<212> DNA
<213> Artificial Sequence
<220>
<223> SARS-CoV-2 orf1a
<400> 58
atggagagcc ttgtccctgg tttcaacgag aaaacacacg tccaactcag tttgcctgtt 60
ttacaggttc gcgacgtgtt agtacgtggt tttggagatt cagtggaaga agtcttatca 120
gaggcacgtc aacatcttaa agatggcact tgtggcttag tagaagttga aaaaggcgtt 180
ttgcctcaac ttgaacagcc ctatgtgttc atcaaacgtt ctgatgctag aactgcacct 240
catggtcatg ttatggttga gctggtagca gaattagaag gtattcagta cggtcgtagt 300
ggtgagacat taggtgtttt agttcctcat gtgggcgaaa taccagtggc ttaccgcaaa 360
gttcttctta gaaagaacgg taataaagga gctggtggcc atagttacgg cgctgattta 420
aagtcatttg acttaggcga cgagcttggc actgatcctt atgaagattt ccaagaaaac 480
tggaacacta aacatagcag tggtgttacc cgtgaactca tgcgtgagtt aaatggaggt 540
gcatacactc gctatgtcga taacaacttc tgtggacctg atggttaccc tcttgagtgc 600
attaaagacc ttctagcacg tgctggtaaa gcttcatgca ctttgtccga acaactggac 660
tttattgaca ctaagagggg tgtatactgc tgccgtgaac atgagcatga aattgcttgg 720
tacacggaac gttctgaaaa gagctatgaa ttgcagacac cttttgaaat taaactggca 780
aagaaatttg acaccttcaa tggggaatgt ccaaattttg tatttcccct caattccata 840
atcaagacta ttcaaccaag ggttgaaaag aaaaagcttg atggctttat gggtagaatt 900
cgatctgtct atccagttgc gtcaccaaat gaatgcaacc aaatgtgcct ttcaactctc 960
atgaagtgtg atcattgtgg tgaaacttca tggcagacgg gcgattttgt taaagccact 1020
tgcgaatttt gtggcactga gaatttgact aaagaaggtg ccactacttg tggttactta 1080
ccccaaaatg ctgttgttaa aatttactgt ccagcatgtc acaattcaga agtaggacct 1140
gagcatagtc ttgccgaata ccataatgaa tctggcttga aaaccattct tcgtaagggt 1200
ggtcgcacta ttgcttttgg aggatgtgtg ttctcttatg ttggttgcca taacaagtgt 1260
gcttattggg ttccacgtgc ttcagctaac ataggttgta accatacagg tgttgttgga 1320
gaaggttccg aaggtcttaa tgacaacctt cttgaaatac tccaaaaaga gaaagtcaac 1380
atcaatattg ttggtgactt taaacttaat gaagagatcg ccattatttt ggcatctttt 1440
tctgcttcca caagtgcttt tgtggaaact gtgaaaggtt tggattataa agcattcaaa 1500
cagattgttg aatcctgtgg taattttaag gttacaaagg gaaaagctaa aaaaggtgcc 1560
tggaatattg gtgaacagaa atcaatactg agtcctcttt atgcatttgc atcagaggct 1620
gctcgtgttg tacgatcaat tttctcccgc actcttgaaa ctgctcaaaa ttctgtgcgt 1680
gttttacaga aggccgctat aacaatacta gatggaattt cacagtattc actgagactc 1740
attgatgcta tgatgttcac atctgatttg gctactaaca atctagttgt aatggcctac 1800
attacaggtg gtgttgttca gttgacttcg cagtggctaa ctaacatctt tggcactgtt 1860
tatgaaaaac tcaaacccgt ccttgattgg cttgaagaga agtttaagga aggtgtagag 1920
tttcttagag acggttggga gattgttaaa ttcatctcaa cctgtgcttg tgaaattgtc 1980
ggtggacaaa ttgtcacctg tgctaaggaa attaaggaga gtgttcagac attctttaag 2040
cttgtaaaca agtttttggc tttgtgtgct gactctatca ttattggtgg agctaaactt 2100
aaagccttga atttaggtga aacatttgtc acgcactcaa agggattgta cagaaagtgt 2160
gttaaatcca gagaagaaac tggcctactc atgcctctaa aagccccaaa agaaattatc 2220
ttcttagagg gagaaacact tcccacagaa gtgttaacag aggaagttgt cttgaaaact 2280
ggtgatttac aaccattaga acaacctact agtgaagctg ttgaagctcc attggttggt 2340
acaccagttt gtattaacgg gcttatgttg ctcgaaatca aagacacaga aaagtactgt 2400
gcccttgcac ctaatatgat ggtaacaaac aataccttca cactcaaagg cggtgcacca 2460
acaaaggtta cttttggtga tgacactgtg atagaagtgc aaggttacaa gagtgtgaat 2520
atcacttttg aacttgatga aaggattgat aaagtactta atgagaagtg ctctgcctat 2580
acagttgaac tcggtacaga agtaaatgag ttcgcctgtg ttgtggcaga tgctgtcata 2640
aaaactttgc aaccagtatc tgaattactt acaccactgg gcattgattt agatgagtgg 2700
agtatggcta catactactt atttgatgag tctggtgagt ttaaattggc ttcacatatg 2760
tattgttctt tctaccctcc agatgaggat gaagaagaag gtgattgtga agaagaagag 2820
tttgagccat caactcaata tgagtatggt actgaagatg attaccaagg taaacctttg 2880
gaatttggtg ccacttctgc tgctttacaa cctgaagaag aacaagaaga agattggtta 2940
gatgatgata gtcaacaaac tgttggtcaa caagacggca gtgaggacaa tcagacaact 3000
actattcaaa caattgttga ggttcaacct caattagaga tggaacttac accagttgtt 3060
cagactattg aagtgaatag ttttagtggt tatcttaaac ttactgacaa tgtatacatc 3120
aagaatgcag acattgtgga agaagctaaa aaggtaaaac caacagtggt tgttaatgca 3180
gccaatgttt accttaaaca tggaggaggt gttgcaggag ccttaaataa ggctactaac 3240
aatgccatgc aagttgaatc tgatgattac atagctacta atggaccact taaagtgggt 3300
ggtagttgtg ttttaagcgg acacaatctt gctaaacact gtttacatgt tgtcggccca 3360
aatgttaaca aaggtgaaga tattcaactt cttaagagtg cttatgaaaa ttttaaccag 3420
cacgaagttc tacttgcacc attattatca gctggtattt ttggtgctga ccctatacat 3480
tctttaagag tttgtgtaga tactgttcgc acaaatgtct acttagctgt ctttgataaa 3540
aatctctatg acaaacttgt ttcaagcttt ttggaaatga agagtgaaaa gcaagttgaa 3600
caaaagatcg ctgagattcc taaagaggaa gttaagccat ttataactga aagtaaacct 3660
tcagttgaac agagaaaaca agatgataag aagatcaaag cttgtgttga agaagttaca 3720
acaactctgg aagaaactaa gttcctcaca gaaaacttgc tcctttatat cgacattaat 3780
ggcaatcttc atccagattc tgccactctt gttagtgaca ttgacatcac tttcttaaag 3840
aaagatgctc catatatagt gggtgatgtt gttcaagagg gtgttttaac tgctgtggtt 3900
atacctacta aaaaggctgg tggcactact gaaatgctag cgaaagcttt gagaaaagtg 3960
ccaacagaca attatataac cacttacccg ggtcagggtt taaatggtta cactgtagag 4020
gaggcaaaga cagtgcttaa aaagtgtaaa agtgcctttt acattctacc atctattatc 4080
tctaatgaga agcaagaaat tcttggaact gtttcttgga atttgcgaga aatgcttgca 4140
catgcagaag aaacacgcaa attaatgcct gtctgtgtgg aaactaaagc catagtttca 4200
actatacagc gtaaatataa gggtatcaag atacaagagg gtgtggttga ttatggtgct 4260
agattttact tttacaccag taaaacaact gtagcgtcac ttatcaacac acttaacgat 4320
ctaaatgaaa ctcttgttac aatgccactt ggctatgtaa cacatggctt aaatttggaa 4380
gaagctgctc ggtatatgag atctctcaaa gtgccagcta cagtttctgt ttcttcacct 4440
gatgctgtta cagcgtataa tggttatctt acttcttctt ctaaaacacc tgaagaacat 4500
tttattgaaa ccatctcact tgctggttcc tataaagatt ggtcctattc tggacaatct 4560
acacaactag gtatagaatt tcttaagaga ggtgataaaa gtgtatatta cacgtccaat 4620
cctaccacat tccacctaga tggtgaagtt atcacctttg acaatcttaa gacacttctt 4680
tctttgagag aagtgaggac tattaaggtg tttacaacag tagacaacat taacctccac 4740
acgcaagttg tggacatgtc aatgacatat ggacaacagt ttggtccaac ttatttggat 4800
ggagctgatg ttactaagat aaaacctcat aactcacatg aaggtaaaac attttacgtt 4860
ttgcctaatg atgacactct acgtgttgag gcttttgagt actaccacac aactgatcct 4920
agttttctgg gtaggtacat gtcagcatta aatcacacta aaaagtggaa atacccacaa 4980
gttaatggtt taacttcgat taaatgggca gataacaact gttatcttgc cactgcattg 5040
ttaacactcc aacaaataga gttgaagttt aatccacctg ctctacaaga tgcttattac 5100
agagcaaggg ctggtgaagc tgctaacttt tgtgcactta tcttagccta ctgtaataag 5160
acagtaggtg agttaggtga tgttagagaa acaatgagtt acttgtttca acatgccaat 5220
ttagattctt gcaaaagagt cttgaacgtg gtgtgtaaaa cttgtggaca acagcagaca 5280
acccttaagg gtgtagaagc tgttatgtac atgggcacac tttcttatga acaattcaag 5340
aaaggtgttc agataccttg tacgtgtggt aaacaagcta caaaatatct agtacaacag 5400
gagtcacctt ttgttatgat gtcagcacca cctgctcagt atgaacttaa gcatggtaca 5460
tttacttgtg ctagtgagta cactggtaat taccagtgtg gtcactataa gcatataact 5520
tctaaggaaa ctttgtattg catagacggt gctttactta caaagtcctc agaatacaaa 5580
ggtcctatta cggatgtttt ctacaaagaa aacagttaca caacaaccat aaaaccagtt 5640
acttataagt tggatggtgt tgtttgtaca gaaattgacc ctaagttgga caattattat 5700
aagaaggaca actcttattt cacagagcaa ccaattgatc ttgtaccaaa ccaaccatat 5760
ccaaacgcaa gcttcgataa ttttaagttc gtatgcgata atatcaaatt tgctgatgat 5820
ctcaaccagt taactggtta taagaaacct gcttcaagag agcttaaagt tacatttttc 5880
cctgacttaa atggtgatgt ggtggctatt gattataaac actacacacc ctcttttaag 5940
aaaggagcta aattgttaca taagcctatt gtttggcatg ttaacaatgc aactaataaa 6000
gccacgtata aaccaaatac ctggtgtata cgttgtcttt ggagcacaaa accagttgaa 6060
acatcaaatt cgtttgatgt actgaagtca gaggacgcgc agggaatgga taatcttgca 6120
tgtgaagatc taaaaccagt ctctgaagaa gtagtggaaa atcctaccat acagaaagac 6180
gttcttgagt gtaatgtgaa aactaccgaa gttgtaggag acattatact taaaccagca 6240
aataatagtt tgaagatcac agaagaggtt ggccacacag atctaatggc tgcttatgta 6300
gacaattcta gtcttactat taagaaacct aatgaactct ctagagtatt aggtttgaaa 6360
acccttgcta ctcatggttt agctgctgtt aatagtgtcc cttgggatac tatagctaat 6420
tatgctaagc cttttcttaa caaagttgtt agtacaacta ctaacatagt tacacggtgt 6480
cttaatcgtg tttgtactaa ttatatgcct tacttcttta ctttattgct acaattgtgt 6540
acttttacta gaagtacaaa ttctagaatc aaggcatcta tgccgactac tatagcaaag 6600
aatactgtta agagtgtcgg taaattttgt ctagaggctt catttaatta tctcaagtca 6660
cctaactttt ctaagctgat aaacattatc atctggtttt tgctattaag tgtttgccta 6720
ggttctttaa tctactcaac cgctgcttta ggtgttttaa tgtctaattt aggcatgcct 6780
tcttactgta ctggttacag agaaggctat ttgaactcta ctaatgtcac tattgcaacc 6840
tactgtactg gatctatacc ttgtagtgtt tgtcttagtg gtttagattc tttagacacc 6900
tatccttctc ttgaaactat acagattacc atttcatctt tcaaatggga tttaactgct 6960
tttggcttag ttgcagagtg gtttttggca tatattcttt tcactaggtt tttctatgta 7020
cttggattgg ctgcaatcat gcaattgttt ttcagctatt ttgcagtcca ttttattagt 7080
aactcttggc ttatgtggct tataattaat cttgtgcaga tggccccgat ttcagctatg 7140
gttagaatgt acatcttctt tgcctcattt tattatgtgt ggaaaagtta tgtgcatgtt 7200
gtagacggtt gtaattcatc aacttgtatg atgtgttaca aacgtaatag agcaacaaga 7260
gtcgaatgta caactattgt taatggtgtt agaaggtcct tttatgtcta tgctaatgga 7320
ggtaaaggct tttgcaaact acacaattgg aattgtgtta attgtgatac attctgtgct 7380
ggtagtacat ttattagtga tgaagttgcg agagacttgt cactacagtt taaaagacca 7440
ataaatccta ctgaccaatc ttcttacatc gttgatagtg ttacagtgaa gaatggttcc 7500
atccatcttt actttgataa agctggtcaa aagacttatg aaagacattc tctctctcat 7560
tttgttaact tagacaacct gagagctaat aacactaaag gttcattgcc tattaatgtt 7620
atcgttttcg acggtaaatc aaaatgtgaa gaatcatctg caaaatcagc gtctgtttac 7680
tacagtcagc ttatgtgtca acctatactg ttactagatc aggcattagt gtctgatgtt 7740
ggtgatagtg cggaagttgc agttaaaatg tttgatgctt acgttaatac gttttcatca 7800
acttttaacg taccaatgga aaaactcaaa acactagttg caactgcaga agctgaactt 7860
gcaaagaatg tgtccttaga caatgtctta tctacgttta tttcagcagc tcggcaaggg 7920
tttgttgatt cagatgtaga aactaaagat gttgttgaat gtcttaaatt gtcacatcaa 7980
tctgacatag aagttactgg cgatagttgt aataactata tgctcaccta taacaaagtt 8040
gaaaacatga caccccgtga ccttggtgct tgtattgact gtagtgctag acatattaat 8100
gcgcaggtag caaaaagtca caacattgct ttgatatgga acgttaaaga tttcatgtca 8160
ttgtctgaac aactacgaaa acaaatacgt agtgctgcta aaaagaataa cttacccttc 8220
aagttgacat gtgcaactac tagacaagtt gttaatgttg taacaacaaa gatagcactt 8280
aagggtggta aaattgtgaa taactggttg aagcagctta ttaaagttac acttgtgttc 8340
ctttttgttg ctgctatttt ctatctgata acacctgttc atgtcatgtc taaacatact 8400
gacttttcaa gtgaaatcat aggatacaag gctattgatg gtggtgtcac tcgtgacata 8460
gcatctacag atacttgttt tgctaacaaa catgctgatt ttgacacatg gtttagccag 8520
cgtggtggta gttatactaa tgacaaagct tgcccattga ttgctgcagt cataacaaga 8580
gaagtgggtt ttgtcgttcc tggtttgcct ggaacgatat tacgcacaac taatggtgac 8640
tttttgcatt tcttacctag agtttttagt gcagttggta acatctgtta cacaccatca 8700
aaacttatag agtacactga ctttgcaaca tcagcttgtg ttttggctgc tgaatgtaca 8760
atttttaaag acgcttctgg taagccagta ccatattgtt atgataccaa tgtactagaa 8820
ggttctgttg cttatgaaag tttacgccct gacacacgtt atgtgctcat ggatggctct 8880
attattcaat ttcctaacac ctaccttgaa ggttctgtaa gagtggtaac aacttttgat 8940
tctgagtact gtaggcacgg cacttgtgaa agatcagaag ctggtgtttg tgtatctact 9000
agtggtagat gggtacttaa caacgattat tacagatctt taccaggagt tttctgtggt 9060
gtagatgctg taaatttgct tactaacatg tttacaccac taattcaacc tattggtgct 9120
ttggacatat cagcatctat agtagctggt ggtattgtag ctatcgtagt aacatgcctt 9180
gcctactatt ttatgaggtt tagacgtgct tttggtgaat acagtcatgt agttgccttt 9240
aatactctcc tattccttat gtcattcact gtactctgtt taacaccagt ttactcattc 9300
ttacctggtg tttattctgt tatttacctg tacttgacat tttatctgac taatgatgtt 9360
tcttttctcg cacatattca gtggatggtt atgttcacac ctttagtacc tttctggata 9420
acaattgctt acatcatttg tatttccaca aagcatttct attggttctt tagtaattac 9480
ctaaagagac gtgtagtctt taatggtgtt tcctttagta cttttgaaga agctgcgctg 9540
tgcacctttt tgttaaataa ggagatgtat ctaaagttgc gtagtgatgt gctattacct 9600
cttacgcaat ataatagata cttagctctt tataacaagt acaagtattt cagtggagca 9660
atggatacaa ctagctacag agaagctgct tgttgtcatc tcgcaaaggc tctcaatgac 9720
ttcagtaact caggttctga tgttctttac caaccaccac aaacctctat cacctcagct 9780
gttttgcaga gtggttttag aaaaatggca ttcccatctg gtaaagttga gggttgtatg 9840
gtacaagtaa cttgtggtac aactacactt aacggtcttt ggcttgatga cgtagtttac 9900
tgtccaagac atgtgatctg cacctctgaa gatatgctta accctaatta tgaagatcta 9960
ctcatccgta agtctaatca taacttcttg gtacaggctg gtaatgttca actcagggtt 10020
attggacatt ctatgcaaaa ttgtgtactt aagcttaagg ttgatacagc caatcctaag 10080
acacctaagt ataagtttgt tcgcattcaa ccaggacaga ctttttcagt gttagcttgt 10140
tacaatggtt caccatctgg tgtttaccaa tgtgctatga ggcccaattt cactattaag 10200
ggttcattcc ttaatggttc atgtggtagt gttggtttta acatagatta tgactgtgtc 10260
tctttttgtt acatgcacca tatggaatta ccaactggag ttcatgctgg cacagactta 10320
gaaggtaact tttatggacc ttttgttgac aggcaaacag cacaagcagc tggtacagat 10380
acaactatta cagttaatgt tcttgcttgg ttgtacgctg ctgttataaa tggagacagg 10440
tggtttctca atcgatttac cacaactctt aatgacttta accttgtggc tatgaagtac 10500
aattatgaac ctctaacaca agaccatgtt gacatactag gacctctttc tgctcaaact 10560
ggaattgccg ttttagatat gtgtgcttca ttaaaagaac ttctgcaaaa tggtatgaat 10620
ggacgtacca tattgggtag tgctttatta gaagatgagt ttacaccttt tgatgttgtt 10680
agacaatgct caggtgttac tttccaaagt gcagtgaaaa gaacaatcaa gggtacacac 10740
cactggttgt tactcacaat tttgacttca cttttagttt tagtccagag tactcaatgg 10800
tctttgttct ttttcttcta cgaaaatgcc tttttacctt ttgctatggg tattattgct 10860
atgtctgctt ttgcaatgat gtttgtcaaa cataagcatg catttctctg tttgtttttg 10920
ttaccttctc ttgccactgt agcttacttt aatatggtct acatgcctgc tagttgggtg 10980
atgcgtatta tgacatggtt ggatatggtt gatactagtt tgtctggttt taagctaaaa 11040
gactgtgtta tgtatgcatc agctgtagtg ttactaatcc ttatgacagc aagaactgtg 11100
tatgatgatg gtgctaggag agtgtggaca cttatgaatg tcttgacact cgtttataaa 11160
gtttactatg gcaacgcttt agatcaagcc atttccatgt gggctcttat aatctctgtt 11220
acttctaact actcaggtgt agttacaact gtcatgtttt tggccagagg tattgttttt 11280
atgtgtgttg agtattgccc tattttcttc ataactggta atacacttca gtgtataatg 11340
ctagtctatt gtttcttagg ctatttttgt acttgttact tcggcctctt ttgtttactc 11400
aaccgctact ttagactgac tcttggtgtt tatgattact tagtgtctac acaggagttt 11460
agatatatga attcacaggg actactccca cccaagaata gcatagatgc cttcaaactc 11520
aacattaaat tgttgggtgt tggtggcaaa ccttgtatca aagtagccac tgtacagtct 11580
aaaatgtcag atgtaaagtg cacatcagta gtcttactct cagttttgca acaactcaga 11640
gtagaatcat catctaaatt gtgggctcaa tgtgtccagt tacacaatga cattctctta 11700
gctaaagata ctactgaagc ctttgaaaaa atggtttcac tactttctgt tttgctttcc 11760
atgcagggtg ctgtagacat aaacaagctt tgtgaagaaa tgctggacaa cagggcaacc 11820
ttacaagcta tagcctcaga gtttagttcc cttccatcat atgcagcttt tgctactgct 11880
caagaagctt atgagcaggc tgttgctaat ggtgattctg aagttgttct taaaaagttg 11940
aagaagtctt tgaatgtggc taaatctgaa tttgaccgtg atgcagccat gcaacgtaag 12000
ttggaaaaga tggctgatca agctatgacc caaatgtata aacaggctag atctgaggac 12060
aagagggcaa aagttactag tgctatgcag acaatgcttt tcactatgct tagaaagttg 12120
gataatgatg cactcaacaa cattatcaac aatgcaagag atggttgtgt tcccttgaac 12180
ataatacctc ttacaacagc agccaaacta atggttgtca taccagacta caacacatat 12240
aagaatacgt gtgatggtac aacatttact tatgcatcag cattgtggga aatccaacag 12300
gttgtagatg cagatagtaa aattgttcag cttagtgaaa ttagtatgga caattcacct 12360
aatttagcat ggcctcttat tgtaacagct ttaagggcca attctgctgt caaattacag 12420
aataatgagc ttagtcctgt tgcactaaga caaatgtctt gtgctgccgg tactacacaa 12480
actgcttgca ctgatgacaa tgcgttagct tactacaaca caacaaaggg aggtaggttt 12540
gtacttgcac tgttatccga tttacaggat ttgaaatggg ctagattccc taagagtgat 12600
ggaactggta ctatctatac agaactggaa ccaccttgta ggtttgttac agacacacct 12660
aaaggtccta aagtgaagta tctttacttc atcaaaggat taaacaacct aaatagaggt 12720
atggtacttg gtagtttagc tgccacagta cgtttacaag ctggtaatgc aacagaagtt 12780
cctgctaatt caactgtact ttctttctgt gcttttgctg tagatgctgc taaagcttac 12840
aaagattatc tagctagtgg gggacaacca atcactaatt gtgttaagat gttgtgtaca 12900
cacactggta ctggtcaggc aataacagtt acaccggaag ccaatatgga tcaagaatcc 12960
tttggtggtg catcgtgttg tctgtactgc cgttgtcata tagatcatcc aaatcctaaa 13020
ggattttgtg acttaaaagg taagtatgta caaataccta caacttgtgc taatgaccct 13080
gtgggtttta cacttaaaaa cacagtctgt accgtctgcg gtatgtggaa aggttatggt 13140
tgtagttgtg atcaactccg cgaacccatg cttcagtcag ctgatgcaca atcgttttta 13200
aac 13203
<210> 59
<211> 8088
<212> DNA
<213> Artificial Sequence
<220>
<223> SARS-CoV-2 orf1b
<400> 59
cgggtttgcg gtgtaagtgc agcccgtctt acaccgtgcg gcacaggcac tagtactgat 60
gtcgtatata gagcttttga catctacaat gataaagtag ctggttttgc taagttccta 120
aaaactaatt gttgtcgctt ccaagaaaag gacgaagatg acaatctcat tgattcttac 180
tttgtagtta agagacacac tttctctaac taccaacatg aagaaacaat ttacaacctg 240
cttaaggatt gtccagctgt tgctaaacat gacttcttta agtttagaat agacggtgac 300
atggtaccac atatatcacg tcaacgtctt actaaataca caatggcaga cctcgtctat 360
gctttaaggc attttgatga aggtaattgt gacacattaa aagaaatact tgtcacatac 420
aattgttgtg atgatgacta cttcaataaa aaggactggt atgattttgt agaaaaccca 480
gatatattac gcgtatacgc caacttaggt gaacgtgtac gccaagcttt gttaaaaaca 540
gtacagttct gtgatgccat gcgaaatgct ggtattgttg gtgtactgac attagataat 600
caagatctca atggtaactg gtatgacttt ggtgatttca tacaaaccac gccaggtagt 660
ggagttcctg ttgtagactc ttattattca ttgctcatgc ctatattaac cttgaccagg 720
gctttaactg cagagtcaca tgttgacact gacttaacaa agccttacat taagtgggat 780
ttgttaaaat acgacttcac ggaagagagg ttaaaactct ttgaccgtta ttttaaatac 840
tgggatcaga cataccaccc aaattgtgtt aactgtttgg atgacagatg cattctgcat 900
tgtgcaaact ttaatgttct gttctctaca gtgttcccac ctacaagttt tggaccacta 960
gtgagaaaaa tatttgttga tggtgttcca tttgtagttt caactggata ccacttcaga 1020
gagctaggtg ttgtacataa tcaggatgta aacttacata gctctagact tagttttaag 1080
gaattacttg tgtatgctgc tgatcctgct atgcatgctg cttctggtaa tctattacta 1140
gataaacgca ctacgtgctt ttcagtagct gcacttacta acaatgttgc ttttcaaact 1200
gtcaaacccg gtaattttaa caaggacttc tatgactttg ctgtgtctaa gggtttcttt 1260
aaggaaggaa gttctgttga attaaaacac ttcttctttg ctcaggatgg taatgctgct 1320
atcagcgatt atgactacta tcgttataat ctaccaacaa tgtgtgatat cagacaacta 1380
ctatttgtag ttgaagttgt tgataagtac tttgattgtt acgatggtgg ctgtattaat 1440
gctaaccaag tcatcgtcaa caacctagac aaatcagctg gttttccatt taataaatgg 1500
ggtaaggcta gactttatta tgattccatg agttatgagg atcaagatgc acttttcgca 1560
tatacaaaac gtaatgtcat ccctactata actcaaatga accttaagta tgccattagt 1620
gcaaagaata gagctcgcac cgtagctggt gtctctatct gtagtactat gaccaataga 1680
cagtttcatc aaaaattact caagtcaata gccgccacta gaggagctac tgtagtaatt 1740
ggaacaagca aattctatgg tggttggcac aacatgctca aaactgttta tagtgatgta 1800
gaaaaccctc accttatggg ttgggattat cctaaatgtg atagagccat gcctaacatg 1860
cttagaatta tggcctcact tgttcttgct cgcaaacata caacgtgttg tagcttgtca 1920
caccgtttct atagattagc taatgagtgt gctcaagtat tgagtgaaat ggtcatgtgt 1980
ggcggttcac tatatgttaa accaggtgga acctcatcag gagatgccac aactgcttat 2040
gctaatagtg tgtttaacat ttgtcaagct gtcacggcca atgttaatgc acttttatct 2100
actgatggta acaaaattgc cgataagtat gtccgcaatt tacaacacag actttatgag 2160
tgtctctata gaaatagaga tgttgacaca gactttgtga atgagtttta cgcatatttg 2220
cgtaaacatt tctcaatgat gatactctct gacgatgctg ttgtgtgttt caatagcact 2280
tatgcatctc aaggtctagt ggctagcata aagaacttta agtcagttct ttactatcaa 2340
aacaacgttt ttatgtctga agcaaaatgt tggactgaga ctgaccttac taaaggacct 2400
catgaatttt gctctcaaca tacaatgcta gttaaacagg gtgatgatta tgtgtacctt 2460
ccttacccag atccatcaag aatcctaggt gccggttgtt ttgtagatga tatcgtaaaa 2520
acagatggta cacttatgat tgaacggttc gtgtctttag ctatagatgc ttacccactt 2580
actaaacatc ctaatcagga gtatgctgat gtctttcatt tgtacttaca atacatacgt 2640
aagctacatg atgagttaac aggacacatg ttagacatgt attctgttat gcttactaat 2700
gataacactt caaggtattg ggaacctgag ttttatgagg ctatgtacac accgcataca 2760
gtcttacaag ctgttggtgc ttgtgttctt tgcaattcac agacttcatt aagatgtggt 2820
gcttgcatac gtagaccatt cttatgttgt aaatgctgtt acgaccatgt catctcaaca 2880
tcacataaat tagtcttgtc tgttaatccg tatgtttgca atgctccagg ttgtgatgtc 2940
acagatgtga ctcaacttta cttaggaggt atgagctatt actgtaagtc acataaacca 3000
cccattagtt ttccattgtg tgctaatgga caagtttttg gtctctacaa gaatacatgt 3060
gttggtagcg ataatgttac tgactttaat gcaattgcaa catgtgactg gacaaatgct 3120
ggtgattaca ttttagctaa cacctgtact gaaagactca agctttttgc agcagaaacg 3180
ctcaaagcta ctgaggagac atttaaactg tcttatggta ttgctactgt acgtgaagtg 3240
ctgtctgaca gagaattaca tctttcatgg gaagttggta aacctagacc accacttaac 3300
cgaaattatg tctttactgg ttatcgtgta actaaaaaca gtaaagtgca aatcggagag 3360
tacacctttg aaaaaggtga ctatggtgat gctgttgttt accgaggtac aacaacttac 3420
aaactcaacg ttggtgatta ttttgtgctg acatcacata cagtaatgcc attaagtgca 3480
cctacactag tgccacaaga gcactatgtt agaattactg gcttataccc aacactcaat 3540
atctcagatg agttttctag caatgttgca aattatcaaa aggttggtat gcaaaagtat 3600
tctacactcc agggaccacc tggtactggt aaaagtcatt ttgctattgg tctagctctc 3660
tactaccctt ctgctcgcat agtatataca gcttgctctc atgcagctgt tgatgcacta 3720
tgtgagaagg cattaaaata tttgcccata gacaaatgta gtagaattat acctgcacgt 3780
gctcgtgtag agtgttttga taaattcaag gtgaattcaa cattagaaca gtatgtcttt 3840
tgtactgtaa atgcattgcc tgagacgaca gcagatatag ttgtctttga tgaaatttca 3900
atggccacaa attatgattt gagtgttgtc aatgccagat tacgtgctaa gcactatgtg 3960
tacattggtg atcctgctca attacctgca ccacgcacat tactaactaa gggtacacta 4020
gaaccagaat atttcaattc agtgtgtaga cttatgaaaa ctataggtcc agacatgttc 4080
ctcggaactt gtcgtagatg tcctgctgaa attgttgaca ctgtgagtgc tttggtttat 4140
gataataagc ttaaggcaca taaagacaaa tcagctcaat gctttaaaat gttctacaag 4200
ggtgttatca cgcatgatgt ttcatctgca attaacaggc cacaaatagg cgtggtaaga 4260
gaattcctta cacgtaaccc tgcttggaga aaagctgtct ttatttcacc ttacaattcc 4320
cagaatgctg tagcctcaaa gattttggga ctaccaactc aaactgttga ttcatcacag 4380
ggctcagaat atgactatgt catattcact caaaccactg aaacagctca ctcttgtaat 4440
gtaaacagat tcaacgttgc tattaccaga gcaaaagtag gcatactttg cataatgtct 4500
gatagagacc tttatgacaa gttgcaattt acaagtcttg aaattccacg taggaatgtg 4560
gcaactttac aagctgaaaa tgtaacagga ctctttaaag attgtagtaa ggtaatcact 4620
gggttacatc ctacacaggc acctacacac ttaagtgttg atactaaatt caaaactgaa 4680
ggtttatgtg ttgacatacc tggcatacct aaggacatga cctatagaag attaatctct 4740
atgatgggtt tcaaaatgaa ttaccaggtt aatggttacc ctaacatgtt tatcacccgc 4800
gaagaagcta taagacatgt acgtgcatgg attggcttcg atgtcgaagg ttgtcatgct 4860
actagagaag ctgttggtac caatttacct ttacagctag gtttttctac aggtgttaac 4920
ctagttgctg tacctacagg ttatgttgat acacctaata atacagattt ttccagagtt 4980
agtgctaaac caccgcctgg agatcaattt aaacacctca taccacttat gtacaaagga 5040
cttccttgga atgtagtgcg tataaagatt gtccaaatgt taagtgacac acttaaaaat 5100
ctctctgaca gagtcgtatt tgtcttatgg gcacatggct ttgagttgac atctatgaag 5160
tattttgtga agatcggacc tgagcgcaca tgttgtctat gtgatagacg tgctacatgc 5220
ttttccactg cttcagacac ttatgcctgt tggcatcatt ctattggatt tgattacgtc 5280
tataatccgt ttatgattga tgttcaacaa tggggtttta caggtaacct acaaagcaac 5340
catgatctgt attgtcaagt ccatggtaat gcacatgtag ctagttgtga tgcaatcatg 5400
actaggtgtc tagctgtcca cgagtgcttt gttaagcgtg ttgactggac tattgaatat 5460
cctataatcg gtgatgaact gaagattaat gcggcttgta gaaaggttca acacatggtt 5520
gttaaagctg cattattagc agacaaattc ccagttcttc acgacattgg taaccctaaa 5580
gctattaagt gtgtacctca agctgatgta gaatggaagt tctatgatgc acagccttgt 5640
agtgacaaag cttacaaaat agaagaactg ttctattctt atgccacaca ttctgacaaa 5700
ttcacagatg gtgtatgcct attttggaat tgcaatgtcg atagatatcc tgctaattcc 5760
attgtttgta gatttgacac tagagtgcta tctaacctta acttgcctgg ttgtgatggt 5820
ggcagtttgt atgtaaataa gcatgcattc cacacaccag cttttgataa aagtgctttt 5880
gttaatctaa agcaacttcc atttttctat tactctgaca gtccatgtga gtctcatgga 5940
aaacaagtag tgtcagatat agattatgta ccactaaagt ctgctacgtg tataacacgt 6000
tgcaatttag gtggtgctgt ctgtagacat catgctaatg agtacagatt gtatctcgat 6060
gcttataaca tgatgatctc agctggcttt agcttgtggg tttacaaaca atttgatacc 6120
tataacctct ggaacacttt tacaagactt cagagtttag aaaatgtggc ttttaatgtt 6180
gtaaataagg gacactttga tggacaacag ggtgaagtac cagtttctat cattaacaac 6240
actgtttaca caaaagttga tggtgttgat gtagaattgt ttgagaacaa aaccacatta 6300
cctgttaatg tagcatttga gctttgggct aagcgcaaca ttaaaccagt accagaggtg 6360
aaaatactca ataatttggg tgtggacatt gctgctaata ctgtgatctg ggactacaaa 6420
agagatgctc cagcacatat atctactatt ggtgtttgtt ctatgactga catagccaag 6480
aaaccaactg aaacgatttg tgcaccactc actgtctttt ttgatggtag agttgatggt 6540
caagtagact tatttagaaa tgcccgtaat ggtgttctta ttacagaagg tagtgttaaa 6600
ggtttacaac catctgtagg tcccaaacaa gctagtctta atggagtcac attaattgga 6660
gaagccgtaa aaacacagtt caattattac aagaaagtgg atggtgttgt ccaacaatta 6720
cctgaaactt actttactca gagtagaaac ttacaggaat ttaagcccag gagtcaaatg 6780
gaaattgatt tcttagaact tgctatggat gaattcattg aacggtataa attagaaggc 6840
tatgccttcg aacatatcgt ttatggagat tttagtcata gtcagttagg tggtttacat 6900
ctactgattg gactagctaa acgttttaag gaatcacctt ttgaacttga agattttatt 6960
cctatggaca gtacagttaa aaactacttc ataacagatg cgcaaacagg ttcatctaag 7020
tgtgtgtgtt ctgttattga tcttttactt gatgacttcg ttgaaataat aaagtcccaa 7080
gatttatctg tagtttctaa ggttgtcaaa gtgactattg actatacaga aatctcattt 7140
atgctttggt gtaaagatgg ccatgtagaa acattttacc caaaattaca atctagtcaa 7200
gcgtggcaac cgggtgttgc tatgcctaat ctttacaaaa tgcaaagaat gctattagaa 7260
aagtgtgacc ttcaaaatta tggtgatagt gcaacattac ctaaaggcat aatgatgaat 7320
gtcgcaaaat atactcaact gtgtcaatat ttaaacacac tgacattagc tgtaccctat 7380
aatatgagag ttatccattt tggtgctggt tctgataaag gagttgcacc aggtacagct 7440
gttttaagac aatggttgcc tacaggtacg ctgcttgtcg attcagatct taatgacttt 7500
gtctctgatg cagattcaac tttgattggt gattgtgcaa ctgtacatac agctaataaa 7560
tgggatctca ttattagtga tatgtacgac cctaagacta agaatgtcac aaaagaaaac 7620
gactctaaag agggtttttt cacttacatt tgtgggttta tacaacaaaa gctagctctt 7680
ggaggttccg tggctataaa gataacagaa cattcttgga atgctgatct ttataagctc 7740
atgggacact tcgcatggtg gacagccttt gttactaatg tgaatgcgtc atcatctgaa 7800
gcatttttaa tcggatgtaa ctaccttggc aaaccacgcg aacaaataga tggttatgtc 7860
atgcatgcaa attacatatt ttggaggaat acaaatccaa ttcagctttc ttcttattct 7920
ttattcgaca tgagtaaatt cccccttaaa ttaaggggta ctgctgttat gtctttaaaa 7980
gaaggtcaaa tcaatgatat gattctctct cttcttagta aaggtagact tataattaga 8040
gaaaacaaca gagttgttat ttctagtgat gttcttgtta acaactaa 8088
Claims (21)
- 적어도 4,000개의 염기를 갖는 완전 합성 장쇄 핵산으로서,
임의의 배열로 4개의 서열 부분 A-D 중 적어도 2개를 포함하거나, 또는
서열 부분 A-D에 따른 데옥시리보핵산 서열에 상응하는 리보핵산 서열을 포함하는 것을 특징으로 하는 핵산:
여기서,
i) 서열 부분 A는
a) 서열 번호 50에 정의된 서열 또는 서열 번호 50에 정의된 서열과 적어도 98.5% 서열 동일성을 갖는 서열; 또는
b) 서열 번호 3에 정의된 서열
을 포함하고;
ii) 서열 부분 B는
a) 서열 번호 48에 정의된 서열 또는 서열 번호 48에 정의된 서열과 적어도 98.3% 서열 동일성을 갖는 서열; 또는
b) 서열 번호 7에 정의된 서열
을 포함하고;
iii) 서열 부분 C는
a) 서열 번호 49에 정의된 서열 또는 서열 번호 49에 정의된 서열과 적어도 97.2% 서열 동일성을 갖는 서열; 또는
b) 서열 번호 11에 정의된 서열
을 포함하고;
iv) 서열 부분 D는 서열 번호 17에 정의된 서열 또는 서열 번호 17에 정의된 서열과 적어도 98.5% 서열 동일성을 갖는 서열을 포함한다. - 제1항에 있어서, 정의된 서열에서 적어도 8,000개의 염기, 바람직하게는 적어도 20,000개의 염기를 갖는 것을 특징으로 하는 핵산.
- 제1항 또는 제2항에 있어서, 하기를 추가로 포함하는 핵산:
a) 1.) 서열 번호 51에 의해 정의된 ORF1ab 서열 또는 서열 번호 51과 적어도 98.5% 서열 동일성을 갖는 서열; 또는
2.) i) 서열 번호 59에 의해 정의된 ORF1b 서열 또는 서열 번호 59와 적어도 98.5% 서열 동일성을 갖는 서열; 및
ii) 서열 번호 58에 의해 정의된 ORF1 서열 또는 서열 번호 58과 적어도 98.6% 서열 동일성을 갖는 서열;
b) 서열 번호 52에 의해 정의된 ORF3a 서열 또는 서열 번호 52와 적어도 99% 서열 동일성을 갖는 서열; 및
c) 서열 번호 54에 의해 정의된 ORF7a 서열 또는 서열 번호 54와 적어도 99.5% 서열 동일성을 갖는 서열. - 제3항에 있어서, 하기를 추가로 포함하는 핵산:
a) 서열 번호 53에 의해 정의된 ORF6 서열 또는 서열 번호 53과 적어도 94.1% 서열 동일성을 갖는 서열; 및/또는
b) 서열 번호 55에 의해 정의된 ORF8 서열 또는 서열 번호 55와 적어도 99% 서열 동일성을 갖는 서열. - 제1항 내지 제4항 중 어느 한 항에 있어서, 서열 부분 A 내지 C가 서열 번호 19에 따른 서열 또는 상응하는 리보핵산 서열에 상응하는 것을 특징으로 하는 핵산.
- 제1항 내지 제5항 중 어느 한 항에 있어서, 핵산이 임의의 배열로 4개의 서열 부분 A-D 중 적어도 3개 또는 서열 부분 A-D에 따른 데옥시리보핵산 서열에 상응하는 리보핵산 서열을 갖는 4개 서열 부분 중 적어도 3개를 포함하는 것을 특징으로 하는 핵산.
- 제1항 내지 제6항 중 어느 한 항에 있어서, 핵산이 임의의 배열로 4개의 서열 부분 A-D 또는 서열 부분 A-D에 따른 데옥시리보핵산 서열에 상응하는 리보핵산 서열을 갖는 4개의 서열 부분을 포함하는 것을 특징으로 하는 핵산.
- 제1항 내지 제7항 중 어느 한 항에 있어서,
서열 번호 15
서열 번호 28
서열 번호 29 및
서열 번호 30
으로 이루어진 적어도 하나의 서열을 추가로 포함하거나,
서열 부분인 서열 번호 15, 서열 번호 28, 서열 번호 29 및 서열 번호 30에 따른 데옥시리보핵산 서열 중 하나 또는 상응하는 리보핵산 서열을 포함하는 것을 특징으로 하는 핵산. - 제1항 내지 제8항 중 어느 한 항에 있어서, 1,000,000개 염기의 최대 크기, 바람직하게는 200,000개 염기의 최대 크기를 갖는 것을 특징으로 하는 핵산.
- 제1항 내지 제9항 중 어느 한 항에 따른 핵산을 포함하는 벡터.
- 제10항에 있어서, 서열 번호 46 및 서열 번호 47에 의해 정의된 서열을 포함하는 벡터.
- 제10항 또는 제11항에 있어서, 플라스미드 벡터인 벡터.
- 제1항 내지 제9항 중 어느 한 항에 따른 2개 이상의 핵산을 포함하는 키트.
- 제13항에 있어서, 핵산이 적어도 하나의 플라스미드, 바람직하게는 2개 이상의 플라스미드에 존재하는 것인 키트.
- 제10항 내지 제12항 중 어느 한 항에 따른 적어도 하나의 벡터를 포함하는 생명공학적 생산 유닛.
- 제1항 내지 제9항 중 어느 한 항에 따른 적어도 하나의 핵산, 제10항 내지 제12항 중 어느 한 항에 따른 벡터, 제13항 또는 제14항에 따른 키트, 또는 제15항에 따른 생명공학적 생산 유닛을 사용하여 유전자 발현에 의해 수득 가능한 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질로서, 제1항 내지 제9항 중 어느 한 항에 따른 적어도 하나의 핵산을 패키징하는 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질.
- 제1항 내지 제9항 중 어느 한 항에 따른 적어도 하나의 핵산 및 생산 유기체에서 제1항 내지 제9항 중 어느 한 항에 따른 적어도 하나의 핵산, 제10항 내지 제12항 중 어느 한 항에 따른 벡터, 제13항 또는 제14항에 따른 키트를 사용하여 유전자 발현에 의해 수득 가능한 생성물을 포함하고, 특히 제16항에 따른 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질을 포함하는 코로나바이러스 SARS-CoV-2에 대한 백신.
- 제17항에 있어서, 단백질 성분 a, b1, b2, c1, c2, d1 또는 d2로 이루어진 군으로부터 선택되는 적어도 2개의 분자적으로 정확하게 정의된 단백질 성분을 포함하고, 여기서,
(i) 단백질 성분은
a) SARS-CoV-2의 S 단백질과 유사한 서열 번호 14에 따른 서열 또는 서열 번호 14와 적어도 90% 서열 동일성을 갖는 서열; 또는
b) SARS-CoV-2의 S 단백질과 유사한 서열 번호 18에 따른 서열 또는 서열 번호 18과 적어도 90% 서열 동일성을 갖는 서열
을 포함하고;
(ii) 단백질 성분 b1은
a) SARS-CoV-2의 외피 단백질 E와 유사한 서열 번호 6에 따른 서열 또는 서열 번호 6과 적어도 90% 서열 동일성을 갖는 서열; 또는
b) SARS-CoV-2의 외피 단백질 E와 유사한 서열 번호 21에 따른 서열 또는 서열 번호 21과 적어도 90% 서열 동일성을 갖는 서열
을 포함하고;
단백질 성분 b2는 MHV59A의 외피 단백질 E 또는 등가 단백질과 유사한 서열 번호 8에 따른 서열 또는 서열 번호 8과 적어도 90% 서열 동일성을 갖는 서열을 포함하고;
(iii) 단백질 성분 c1은
a) SARS-CoV-2의 외피 단백질 M과 유사한 서열 번호 10에 따른 서열 또는 서열 번호 10과 적어도 90% 서열 동일성을 갖는 서열; 또는
b) SARS-CoV-2의 막 단백질 M과 유사한 서열 번호 22에 따른 서열 또는 서열 번호 22와 적어도 90% 서열 동일성을 갖는 서열
을 포함하고;
단백질 성분 c2는 MHV59A의 막 단백질 M 또는 등가 단백질과 유사한 서열 번호 12에 따른 서열 또는 서열 번호 12와 적어도 90% 서열 동일성을 갖는 서열을 포함하고;
(iv) 단백질 성분 d1은
a) SARS-CoV-2의 뉴클레오캡시드 인단백질 N과 유사한 서열 번호 2에 따른 서열 또는 서열 번호 2와 적어도 90% 서열 동일성을 갖는 서열; 또는
b) SARS-CoV-2의 뉴클레오캡시드 인단백질 N과 유사한 서열 번호 26에 따른 서열 또는 서열 번호 26과 적어도 90% 서열 동일성을 갖는 서열
을 포함하고;
단백질 성분 d2는 MHV59A의 뉴클레오캡시드 인단백질 N 또는 등가 단백질과 유사한 서열 번호 4에 따른 서열 또는 서열 번호 4와 적어도 90% 서열 동일성을 갖는 서열을 포함하는 것인 백신. - 하기의 연속 단계를 포함하는 코로나바이러스 SARS-CoV-2에 대한 백신의 생산 방법:
a) 제1항 내지 제9항 중 어느 한 항에 따른 뉴클레오티드 산 서열을 생명공학적 생산 유닛, 특히 세포주에 도입하는 단계로서,
단백질 성분 a, b1, b2, c1, c2, d1 또는 d2로 이루어진 군으로부터 선택된 단백질 성분 중 적어도 2개를 코딩하는 핵산 기반 mRNA는 번역에 의해 제조되는 것인 단계;
b) 단계 a)에서 생명공학적 생산 유닛으로부터 단백질 성분을 수득하는 단계; 및
c) 수득된 단백질 성분을 정제하여 코로나바이러스 SARS-CoV-2에 대한 백신을 수득하는 단계. - 하기의 연속 단계를 포함하는 제16항에 따른 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질을 포함하는 코로나바이러스 SARS-CoV-2에 대한 백신의 생산 방법:
a) 제1항 내지 제9항 중 어느 한 항에 따른 뉴클레오티드 산 서열을 생명공학적 생산 유닛에 도입하는 단계로서, 생명공학적 생산 유닛은 단백질 성분 a, b1, c1 및 d1로 이루어진 군으로부터 선택된 단백질 성분 중 적어도 하나를 코딩하는 뉴클레오티드 산을 포함하는 것인 단계;
b) 단계 a)에서 생명공학적 생산 유닛으로부터 바이러스 외피의 단편 및/또는 바이러스 외피 단백질을 수득하는 단계; 및
c) 수득된 단백질 성분을 정제하여 제16항에 따른 바이러스 외피, 바이러스 외피의 단편 및/또는 바이러스 외피 단백질을 포함하는 코로나바이러스 SARS-CoV-2에 대한 백신을 수득하는 단계. - 하기의 연속 단계를 포함하는 코로나바이러스 SARS-CoV-2에 대한 백신의 생산 방법:
a) 제10항 내지 제12항 중 어느 한 항에 따른 벡터를 증폭 생명공학적 생산 유닛에 도입하는 단계;
b) 증폭 생명공학적 생산 유닛에서 제1항 내지 제9항 중 어느 한 항에 따른 뉴클레오티드 산을 증폭하는 단계;
c) 단계 b)에서 증폭된 뉴클레오티드 산을 수득하는 단계;
d) 제19항 또는 제20항에 따른 방법을 사용하여 코로나바이러스 SARS-CoV-2에 대한 백신을 수득하는 단계.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP20020092.1 | 2020-03-03 | ||
EP20020092 | 2020-03-03 | ||
EP20020240 | 2020-05-20 | ||
EP20020240.6 | 2020-05-20 | ||
PCT/EP2021/055401 WO2021175960A1 (en) | 2020-03-03 | 2021-03-03 | Fully synthetic, long-chain nucleic acid for vaccine production to protect against coronaviruses |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20220150323A true KR20220150323A (ko) | 2022-11-10 |
Family
ID=83450836
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020227033430A KR20220150323A (ko) | 2020-03-03 | 2021-03-03 | 코로나바이러스를 방지하는 백신 생산을 위한 완전 합성 장쇄 핵산 |
Country Status (9)
Country | Link |
---|---|
EP (1) | EP4114452A1 (ko) |
JP (1) | JP2023517540A (ko) |
KR (1) | KR20220150323A (ko) |
CN (1) | CN115768470A (ko) |
AU (1) | AU2021231238A1 (ko) |
BR (1) | BR112022017733A2 (ko) |
CA (1) | CA3170281A1 (ko) |
IL (1) | IL296147A (ko) |
MX (1) | MX2022010928A (ko) |
-
2021
- 2021-03-03 AU AU2021231238A patent/AU2021231238A1/en active Pending
- 2021-03-03 IL IL296147A patent/IL296147A/en unknown
- 2021-03-03 EP EP21709001.8A patent/EP4114452A1/en active Pending
- 2021-03-03 CA CA3170281A patent/CA3170281A1/en active Pending
- 2021-03-03 KR KR1020227033430A patent/KR20220150323A/ko active Search and Examination
- 2021-03-03 CN CN202180032734.6A patent/CN115768470A/zh active Pending
- 2021-03-03 MX MX2022010928A patent/MX2022010928A/es unknown
- 2021-03-03 JP JP2022553056A patent/JP2023517540A/ja active Pending
- 2021-03-03 BR BR112022017733A patent/BR112022017733A2/pt unknown
Also Published As
Publication number | Publication date |
---|---|
EP4114452A1 (en) | 2023-01-11 |
MX2022010928A (es) | 2022-10-27 |
IL296147A (en) | 2022-11-01 |
JP2023517540A (ja) | 2023-04-26 |
AU2021231238A1 (en) | 2022-10-06 |
BR112022017733A2 (pt) | 2022-11-29 |
CA3170281A1 (en) | 2021-09-10 |
CN115768470A (zh) | 2023-03-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111295449B (zh) | 腺病毒载体及其用途 | |
AU2013221187B9 (en) | Virus like particle composition | |
AU2022200903B2 (en) | Engineered Cascade components and Cascade complexes | |
DK2753355T3 (en) | ONCOLYTIC HERP SIMPLEX VIRUSES AND THERAPEUTIC APPLICATIONS THEREOF | |
US20030119104A1 (en) | Chromosome-based platforms | |
KR20220141332A (ko) | 홍역-벡터화된 covid-19 면역원성 조성물 및 백신 | |
KR20160029124A (ko) | Pd-1 항원 또는 pd-1 리간드 항원을 포함하는 바이러스 유사 입자 | |
US20040197910A1 (en) | Gene regulation in transgenic animals using a transposon-based vector | |
KR20180081527A (ko) | 클로스트리듐 박테리아의 형질전환을 위한 유전자 도구 | |
DK2623594T3 (da) | Antistof mod human prostaglandin-E2-receptor EP4 | |
CN113396222A (zh) | 腺相关病毒(aav)生产细胞系和相关方法 | |
KR20210144861A (ko) | 아마이엘로이스로부터의 트랜스포사제를 이용한 핵산 작제물의 진핵세포 게놈으로의 전위 | |
WO2005081716A2 (en) | DNA VACCINES TARGETING ANTIGENS OF THE SEVERE ACUTE RESPIRATORY SYNDROME CORONAVIRUS (SARS-CoV) | |
CN112877292A (zh) | 产生人抗体的细胞 | |
US20240207318A1 (en) | Chimeric costimulatory receptors, chemokine receptors, and the use of same in cellular immunotherapies | |
KR20230031929A (ko) | 고릴라 아데노바이러스 핵산 서열 및 아미노산 서열, 이들을 함유하는 벡터, 및 이의 용도 | |
CN110305902B (zh) | 一种在工具细胞中激活hSyn启动子的方法及其应用 | |
US20210130818A1 (en) | Compositions and Methods for Enhancement of Homology-Directed Repair Mediated Precise Gene Editing by Programming DNA Repair with a Single RNA-Guided Endonuclease | |
KR20240029020A (ko) | Dna 변형을 위한 crispr-트랜스포손 시스템 | |
KR20240022571A (ko) | Rna-가이드된 이펙터 동원을 위한 시스템, 방법 및 성분 | |
KR20220150323A (ko) | 코로나바이러스를 방지하는 백신 생산을 위한 완전 합성 장쇄 핵산 | |
KR20230153437A (ko) | 코로나바이러스를 방지하는 백신 생산을 위한 완전 합성 장쇄 핵산 | |
KR20240021906A (ko) | 발현 벡터, 박테리아 서열-무함유 벡터, 및 이를 제조하고 사용하는 방법 | |
CN117881788A (zh) | 表达载体、无细菌序列载体及其制备和使用方法 | |
KR20230117327A (ko) | 가용성 알칼리성 포스파타제 작제물 및 가용성 알칼리성 포스파타제 작제물을 인코딩하는 폴리뉴클레오티드를 포함하는 발현 벡터 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination |