KR20070024649A - 신규한 배열을 가진 hcv rna - Google Patents
신규한 배열을 가진 hcv rna Download PDFInfo
- Publication number
- KR20070024649A KR20070024649A KR1020067027313A KR20067027313A KR20070024649A KR 20070024649 A KR20070024649 A KR 20070024649A KR 1020067027313 A KR1020067027313 A KR 1020067027313A KR 20067027313 A KR20067027313 A KR 20067027313A KR 20070024649 A KR20070024649 A KR 20070024649A
- Authority
- KR
- South Korea
- Prior art keywords
- leu
- ala
- gly
- val
- arg
- Prior art date
Links
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 178
- 241000711549 Hepacivirus C Species 0.000 claims abstract description 83
- 101710118188 DNA-binding protein HU-alpha Proteins 0.000 claims abstract description 28
- 101710144128 Non-structural protein 2 Proteins 0.000 claims abstract description 28
- 101710199667 Nuclear export protein Proteins 0.000 claims abstract description 28
- 101710132601 Capsid protein Proteins 0.000 claims abstract description 18
- 238000013519 translation Methods 0.000 claims abstract description 13
- 101710125507 Integrase/recombinase Proteins 0.000 claims abstract description 12
- 101710185720 Putative ethidium bromide resistance protein Proteins 0.000 claims abstract description 11
- 238000000034 method Methods 0.000 claims description 49
- 102000004169 proteins and genes Human genes 0.000 claims description 45
- 238000012217 deletion Methods 0.000 claims description 40
- 230000037430 deletion Effects 0.000 claims description 40
- 239000003814 drug Substances 0.000 claims description 29
- 108010076039 Polyproteins Proteins 0.000 claims description 25
- 229940079593 drug Drugs 0.000 claims description 25
- 239000006260 foam Substances 0.000 claims description 24
- 150000007523 nucleic acids Chemical group 0.000 claims description 22
- 241000700605 Viruses Species 0.000 claims description 16
- 101710172711 Structural protein Proteins 0.000 claims description 15
- 239000002245 particle Substances 0.000 claims description 13
- 230000003362 replicative effect Effects 0.000 claims description 13
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 12
- 239000003550 marker Substances 0.000 claims description 11
- 239000013598 vector Substances 0.000 claims description 11
- 238000012216 screening Methods 0.000 claims description 8
- 108091036066 Three prime untranslated region Proteins 0.000 claims description 7
- 238000002405 diagnostic procedure Methods 0.000 claims description 3
- 238000011156 evaluation Methods 0.000 claims description 3
- 241000282326 Felis catus Species 0.000 description 91
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 90
- 238000003752 polymerase chain reaction Methods 0.000 description 86
- 108010050848 glycylleucine Proteins 0.000 description 66
- 108010061238 threonyl-glycine Proteins 0.000 description 64
- 210000004027 cell Anatomy 0.000 description 53
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 51
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 48
- 239000002299 complementary DNA Substances 0.000 description 47
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 44
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 42
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 37
- 108020004414 DNA Proteins 0.000 description 36
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 35
- 108010062796 arginyllysine Proteins 0.000 description 35
- 239000012634 fragment Substances 0.000 description 35
- 208000006454 hepatitis Diseases 0.000 description 35
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 32
- 108010047495 alanylglycine Proteins 0.000 description 32
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 32
- ZKEHTYWGPMMGBC-XUXIUFHCSA-N Ala-Leu-Leu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O ZKEHTYWGPMMGBC-XUXIUFHCSA-N 0.000 description 31
- 108010064997 VPY tripeptide Proteins 0.000 description 31
- 108010057821 leucylproline Proteins 0.000 description 31
- 210000004185 liver Anatomy 0.000 description 30
- 108010029020 prolylglycine Proteins 0.000 description 30
- 108010079364 N-glycylalanine Proteins 0.000 description 29
- 239000000047 product Substances 0.000 description 29
- XZKJEOMFLDVXJG-KATARQTJSA-N Cys-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)N)O XZKJEOMFLDVXJG-KATARQTJSA-N 0.000 description 27
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 27
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 26
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 26
- 231100000283 hepatitis Toxicity 0.000 description 26
- 108010017391 lysylvaline Proteins 0.000 description 26
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 description 25
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 24
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 23
- 108010093581 aspartyl-proline Proteins 0.000 description 23
- 108010073969 valyllysine Proteins 0.000 description 23
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 22
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 22
- 238000006243 chemical reaction Methods 0.000 description 22
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 22
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 22
- 108010010147 glycylglutamine Proteins 0.000 description 21
- 108010037850 glycylvaline Proteins 0.000 description 21
- 239000000523 sample Substances 0.000 description 21
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 20
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 20
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 20
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 20
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 20
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 20
- 108010065920 Insulin Lispro Proteins 0.000 description 20
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 20
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 20
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 20
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 19
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 19
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 19
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 19
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 19
- QNZLIVROMORQFH-BQBZGAKWSA-N Pro-Gly-Cys Chemical compound C1C[C@H](NC1)C(=O)NCC(=O)N[C@@H](CS)C(=O)O QNZLIVROMORQFH-BQBZGAKWSA-N 0.000 description 19
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 19
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 19
- 108010016616 cysteinylglycine Proteins 0.000 description 19
- 108010049041 glutamylalanine Proteins 0.000 description 19
- 108010084389 glycyltryptophan Proteins 0.000 description 19
- 108010090894 prolylleucine Proteins 0.000 description 19
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 18
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 18
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 18
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 18
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 18
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 18
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 18
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 18
- BCAVNDNYOGTQMQ-AAEUAGOBSA-N Ser-Trp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O BCAVNDNYOGTQMQ-AAEUAGOBSA-N 0.000 description 18
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 18
- 238000000338 in vitro Methods 0.000 description 18
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 17
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 17
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 17
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 17
- DYJTXTCEXMCPBF-UFYCRDLUSA-N Pro-Tyr-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O DYJTXTCEXMCPBF-UFYCRDLUSA-N 0.000 description 17
- 210000004369 blood Anatomy 0.000 description 17
- 239000008280 blood Substances 0.000 description 17
- 108010053725 prolylvaline Proteins 0.000 description 17
- 230000010076 replication Effects 0.000 description 17
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 16
- HULHGJZIZXCPLD-FXQIFTODSA-N Arg-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HULHGJZIZXCPLD-FXQIFTODSA-N 0.000 description 16
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 16
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 16
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 16
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 16
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 16
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 16
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 16
- CXUFDWZBHKUGKK-CABZTGNLSA-N Trp-Ala-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O)=CNC2=C1 CXUFDWZBHKUGKK-CABZTGNLSA-N 0.000 description 16
- FRMFMFNMGQGMNB-BVSLBCMMSA-N Tyr-Pro-Trp Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 FRMFMFNMGQGMNB-BVSLBCMMSA-N 0.000 description 16
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 16
- 238000010804 cDNA synthesis Methods 0.000 description 16
- 208000014018 liver neoplasm Diseases 0.000 description 16
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 15
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 15
- 241000880493 Leptailurus serval Species 0.000 description 15
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 15
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 15
- 150000001413 amino acids Chemical class 0.000 description 15
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 15
- 201000007270 liver cancer Diseases 0.000 description 15
- 238000011282 treatment Methods 0.000 description 15
- 108010078580 tyrosylleucine Proteins 0.000 description 15
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 14
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 14
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 14
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 14
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 14
- 101710144111 Non-structural protein 3 Proteins 0.000 description 14
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 14
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 14
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 14
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 14
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 14
- 208000015181 infectious disease Diseases 0.000 description 14
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 14
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 13
- LVWIJITYHRZHBO-IXOXFDKPSA-N His-Leu-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LVWIJITYHRZHBO-IXOXFDKPSA-N 0.000 description 13
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 13
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 13
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 13
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 13
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 13
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 13
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 13
- 239000013612 plasmid Substances 0.000 description 13
- 108010004914 prolylarginine Proteins 0.000 description 13
- 108010026333 seryl-proline Proteins 0.000 description 13
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 12
- 206010008909 Chronic Hepatitis Diseases 0.000 description 12
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 12
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 12
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 12
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 12
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 12
- 108010087924 alanylproline Proteins 0.000 description 12
- 108010013835 arginine glutamate Proteins 0.000 description 12
- 108010025306 histidylleucine Proteins 0.000 description 12
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 12
- 210000001519 tissue Anatomy 0.000 description 12
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 12
- 108020003589 5' Untranslated Regions Proteins 0.000 description 11
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 11
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 11
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 11
- YNIMVVJTPWCUJH-KBPBESRZSA-N Gly-His-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YNIMVVJTPWCUJH-KBPBESRZSA-N 0.000 description 11
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 11
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 11
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 11
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 11
- XEVHXNLPUBVQEX-DVJZZOLTSA-N Thr-Trp-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N)O XEVHXNLPUBVQEX-DVJZZOLTSA-N 0.000 description 11
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 11
- 238000001514 detection method Methods 0.000 description 11
- 108010009297 diglycyl-histidine Proteins 0.000 description 11
- 108020001507 fusion proteins Proteins 0.000 description 11
- 102000037865 fusion proteins Human genes 0.000 description 11
- 108010036413 histidylglycine Proteins 0.000 description 11
- 108010084572 phenylalanyl-valine Proteins 0.000 description 11
- 230000035755 proliferation Effects 0.000 description 11
- 210000002966 serum Anatomy 0.000 description 11
- 108010084932 tryptophyl-proline Proteins 0.000 description 11
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 11
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 10
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 10
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 10
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 10
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 10
- FTMRPIVPSDVGCC-GUBZILKMSA-N Arg-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FTMRPIVPSDVGCC-GUBZILKMSA-N 0.000 description 10
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 10
- GMRGSBAMMMVDGG-GUBZILKMSA-N Asn-Arg-Arg Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N GMRGSBAMMMVDGG-GUBZILKMSA-N 0.000 description 10
- NURJSGZGBVJFAD-ZLUOBGJFSA-N Asp-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O NURJSGZGBVJFAD-ZLUOBGJFSA-N 0.000 description 10
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 10
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 10
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 10
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 10
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 10
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 10
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 10
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 10
- WEDDFMCSUNNZJR-WDSKDSINSA-N Met-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(O)=O WEDDFMCSUNNZJR-WDSKDSINSA-N 0.000 description 10
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 10
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 10
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 10
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 10
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 10
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 10
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 10
- PGBJAZDAEWPDAA-NHCYSSNCSA-N Val-Gln-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N PGBJAZDAEWPDAA-NHCYSSNCSA-N 0.000 description 10
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 10
- 108010044940 alanylglutamine Proteins 0.000 description 10
- 108010070944 alanylhistidine Proteins 0.000 description 10
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 10
- 108020004707 nucleic acids Proteins 0.000 description 10
- 102000039446 nucleic acids Human genes 0.000 description 10
- 108010015796 prolylisoleucine Proteins 0.000 description 10
- 239000000243 solution Substances 0.000 description 10
- 230000003612 virological effect Effects 0.000 description 10
- ZCPBEAHAVUJKAE-UHTWSYAYSA-N (2s)-2-[[(2s)-2-[[(2r)-2-[(2-aminoacetyl)amino]-3-phenylpropanoyl]amino]propanoyl]amino]butanedioic acid Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](NC(=O)CN)CC1=CC=CC=C1 ZCPBEAHAVUJKAE-UHTWSYAYSA-N 0.000 description 9
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 9
- FVNAUOZKIPAYNA-BPNCWPANSA-N Ala-Met-Tyr Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FVNAUOZKIPAYNA-BPNCWPANSA-N 0.000 description 9
- JGDGLDNAQJJGJI-AVGNSLFASA-N Arg-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N JGDGLDNAQJJGJI-AVGNSLFASA-N 0.000 description 9
- ASQYTJJWAMDISW-BPUTZDHNSA-N Arg-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N ASQYTJJWAMDISW-BPUTZDHNSA-N 0.000 description 9
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 9
- PQHYZJPCYRDYNE-QWRGUYRKSA-N Cys-Gly-Phe Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PQHYZJPCYRDYNE-QWRGUYRKSA-N 0.000 description 9
- MHYHLWUGWUBUHF-GUBZILKMSA-N Cys-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N MHYHLWUGWUBUHF-GUBZILKMSA-N 0.000 description 9
- QQAYIVHVRFJICE-AEJSXWLSSA-N Cys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N QQAYIVHVRFJICE-AEJSXWLSSA-N 0.000 description 9
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 9
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 9
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 9
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 9
- UPJODPVSKKWGDQ-KLHWPWHYSA-N His-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O UPJODPVSKKWGDQ-KLHWPWHYSA-N 0.000 description 9
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 9
- 102000014150 Interferons Human genes 0.000 description 9
- 108010050904 Interferons Proteins 0.000 description 9
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 9
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 9
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 9
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 9
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 9
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 9
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 9
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 9
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 9
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 9
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 9
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 9
- FGVFBDZSGQTYQX-UFYCRDLUSA-N Tyr-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O FGVFBDZSGQTYQX-UFYCRDLUSA-N 0.000 description 9
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 9
- JPBGMZDTPVGGMQ-ULQDDVLXSA-N Val-Tyr-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JPBGMZDTPVGGMQ-ULQDDVLXSA-N 0.000 description 9
- 108010036951 achatin I Proteins 0.000 description 9
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 9
- 229940079322 interferon Drugs 0.000 description 9
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 9
- 238000012317 liver biopsy Methods 0.000 description 9
- 108010051242 phenylalanylserine Proteins 0.000 description 9
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 9
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 8
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 8
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 8
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 8
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 8
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 8
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 8
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 description 8
- QUQHPUMRFGFINP-BPUTZDHNSA-N Cys-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CS)N QUQHPUMRFGFINP-BPUTZDHNSA-N 0.000 description 8
- 206010059866 Drug resistance Diseases 0.000 description 8
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 8
- 208000005176 Hepatitis C Diseases 0.000 description 8
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 8
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 8
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 8
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 8
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 8
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 8
- GLUYKHMBGKQBHE-JYJNAYRXSA-N Phe-Val-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 GLUYKHMBGKQBHE-JYJNAYRXSA-N 0.000 description 8
- XJROSHJRQTXWAE-XGEHTFHBSA-N Pro-Cys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XJROSHJRQTXWAE-XGEHTFHBSA-N 0.000 description 8
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 8
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 8
- CDRYEAWHKJSGAF-BPNCWPANSA-N Tyr-Ala-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O CDRYEAWHKJSGAF-BPNCWPANSA-N 0.000 description 8
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 8
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 8
- QWCZXKIFPWPQHR-JYJNAYRXSA-N Val-Pro-Tyr Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QWCZXKIFPWPQHR-JYJNAYRXSA-N 0.000 description 8
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 8
- 108010092854 aspartyllysine Proteins 0.000 description 8
- 230000002440 hepatic effect Effects 0.000 description 8
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 8
- 239000002609 medium Substances 0.000 description 8
- 108010005942 methionylglycine Proteins 0.000 description 8
- 238000003757 reverse transcription PCR Methods 0.000 description 8
- 108010029384 tryptophyl-histidine Proteins 0.000 description 8
- 108010027345 wheylin-1 peptide Proteins 0.000 description 8
- 108020005345 3' Untranslated Regions Proteins 0.000 description 7
- IMIZPWSVYADSCN-UHFFFAOYSA-N 4-methyl-2-[[4-methyl-2-[[4-methyl-2-(pyrrolidine-2-carbonylamino)pentanoyl]amino]pentanoyl]amino]pentanoic acid Chemical compound CC(C)CC(C(O)=O)NC(=O)C(CC(C)C)NC(=O)C(CC(C)C)NC(=O)C1CCCN1 IMIZPWSVYADSCN-UHFFFAOYSA-N 0.000 description 7
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 7
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 7
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 7
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 7
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 7
- QKHWNPQNOHEFST-VZFHVOOUSA-N Ala-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N)O QKHWNPQNOHEFST-VZFHVOOUSA-N 0.000 description 7
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 7
- ZVWXMTTZJKBJCI-BHDSKKPTSA-N Ala-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 ZVWXMTTZJKBJCI-BHDSKKPTSA-N 0.000 description 7
- XPSGESXVBSQZPL-SRVKXCTJSA-N Arg-Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XPSGESXVBSQZPL-SRVKXCTJSA-N 0.000 description 7
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 7
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 7
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 7
- XOASPVGNFAMYBD-WFBYXXMGSA-N Asp-Trp-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O XOASPVGNFAMYBD-WFBYXXMGSA-N 0.000 description 7
- RRIJEABIXPKSGP-FXQIFTODSA-N Cys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CS RRIJEABIXPKSGP-FXQIFTODSA-N 0.000 description 7
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 7
- FTTZLFIEUQHLHH-BWBBJGPYSA-N Cys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O FTTZLFIEUQHLHH-BWBBJGPYSA-N 0.000 description 7
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 7
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 7
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 7
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 7
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 7
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 7
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 7
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 7
- KWHFUMYCSPJCFQ-NGTWOADLSA-N Ile-Thr-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N KWHFUMYCSPJCFQ-NGTWOADLSA-N 0.000 description 7
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 7
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 7
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 7
- YKIRNDPUWONXQN-GUBZILKMSA-N Lys-Asn-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKIRNDPUWONXQN-GUBZILKMSA-N 0.000 description 7
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 7
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 7
- XGZDDOKIHSYHTO-SZMVWBNQSA-N Lys-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 XGZDDOKIHSYHTO-SZMVWBNQSA-N 0.000 description 7
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 7
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 7
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 7
- JHVNNUIQXOGAHI-KJEVXHAQSA-N Met-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N)O JHVNNUIQXOGAHI-KJEVXHAQSA-N 0.000 description 7
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 7
- 229930193140 Neomycin Natural products 0.000 description 7
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 7
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 7
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 7
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 7
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 7
- IEWKKXZRJLTIOV-AVGNSLFASA-N Tyr-Ser-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O IEWKKXZRJLTIOV-AVGNSLFASA-N 0.000 description 7
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 7
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 7
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 7
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 7
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 7
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 7
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 7
- 108010060035 arginylproline Proteins 0.000 description 7
- 238000005119 centrifugation Methods 0.000 description 7
- 238000010367 cloning Methods 0.000 description 7
- 238000011161 development Methods 0.000 description 7
- 108010020688 glycylhistidine Proteins 0.000 description 7
- 108010085325 histidylproline Proteins 0.000 description 7
- 229960004927 neomycin Drugs 0.000 description 7
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 7
- 239000006228 supernatant Substances 0.000 description 7
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 6
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 6
- GRIFPSOFWFIICX-GOPGUHFVSA-N Ala-His-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GRIFPSOFWFIICX-GOPGUHFVSA-N 0.000 description 6
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 6
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 6
- OLDOLPWZEMHNIA-PJODQICGSA-N Arg-Ala-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OLDOLPWZEMHNIA-PJODQICGSA-N 0.000 description 6
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 6
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 6
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 6
- UBEKKPOFLCVTEZ-UHFFFAOYSA-N Arg-Lys-Val-Ser Chemical compound OCC(C(O)=O)NC(=O)C(C(C)C)NC(=O)C(CCCCN)NC(=O)C(N)CCCN=C(N)N UBEKKPOFLCVTEZ-UHFFFAOYSA-N 0.000 description 6
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 6
- MLJZMGIXXMTEPO-UBHSHLNASA-N Asn-Trp-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O MLJZMGIXXMTEPO-UBHSHLNASA-N 0.000 description 6
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 6
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 6
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 6
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 6
- GNBMOZPQUXTCRW-STQMWFEESA-N Gly-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)CN)C(O)=O)=CNC2=C1 GNBMOZPQUXTCRW-STQMWFEESA-N 0.000 description 6
- TVDHVLGFJSHPAX-UWVGGRQHSA-N Gly-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 TVDHVLGFJSHPAX-UWVGGRQHSA-N 0.000 description 6
- IMRNSEPSPFQNHF-STQMWFEESA-N Gly-Ser-Trp Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O IMRNSEPSPFQNHF-STQMWFEESA-N 0.000 description 6
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 6
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 6
- XDVKZSJODLMNLJ-GGQYPGDFSA-N Ile-Trp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC=3C4=CC=CC=C4NC=3)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 XDVKZSJODLMNLJ-GGQYPGDFSA-N 0.000 description 6
- 102100034343 Integrase Human genes 0.000 description 6
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 6
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 6
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 6
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 6
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 6
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 6
- PWPBGAJJYJJVPI-PJODQICGSA-N Met-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 PWPBGAJJYJJVPI-PJODQICGSA-N 0.000 description 6
- FTQOFRPGLYXRFM-CYDGBPFRSA-N Met-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCSC)N FTQOFRPGLYXRFM-CYDGBPFRSA-N 0.000 description 6
- VWWGEKCAPBMIFE-SRVKXCTJSA-N Met-Met-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O VWWGEKCAPBMIFE-SRVKXCTJSA-N 0.000 description 6
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 6
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 6
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 6
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 6
- HATVCTYBNCNMAA-AVGNSLFASA-N Pro-Leu-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O HATVCTYBNCNMAA-AVGNSLFASA-N 0.000 description 6
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 6
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 6
- RNFKSBPHLTZHLU-WHFBIAKZSA-N Ser-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)O RNFKSBPHLTZHLU-WHFBIAKZSA-N 0.000 description 6
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 6
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- 108091027544 Subgenomic mRNA Proteins 0.000 description 6
- QWMPARMKIDVBLV-VZFHVOOUSA-N Thr-Cys-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O QWMPARMKIDVBLV-VZFHVOOUSA-N 0.000 description 6
- AXWBYOVVDRBOGU-SIUGBPQLSA-N Tyr-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AXWBYOVVDRBOGU-SIUGBPQLSA-N 0.000 description 6
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 6
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 6
- IRLYZKKNBFPQBW-XGEHTFHBSA-N Val-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N)O IRLYZKKNBFPQBW-XGEHTFHBSA-N 0.000 description 6
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 6
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 6
- 108010060199 cysteinylproline Proteins 0.000 description 6
- 238000001962 electrophoresis Methods 0.000 description 6
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 6
- 238000001727 in vivo Methods 0.000 description 6
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 6
- 208000024891 symptom Diseases 0.000 description 6
- 229940126585 therapeutic drug Drugs 0.000 description 6
- 108010036387 trimethionine Proteins 0.000 description 6
- PKOHVHWNGUHYRE-ZFWWWQNUSA-N (2s)-1-[2-[[(2s)-2-amino-3-(1h-indol-3-yl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)NCC(=O)N1CCC[C@H]1C(O)=O PKOHVHWNGUHYRE-ZFWWWQNUSA-N 0.000 description 5
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 5
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 5
- CXZFXHGJJPVUJE-CIUDSAMLSA-N Ala-Cys-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O)N CXZFXHGJJPVUJE-CIUDSAMLSA-N 0.000 description 5
- XYKDZXKKYOOTGC-FXQIFTODSA-N Ala-Cys-Met Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(=O)O)N XYKDZXKKYOOTGC-FXQIFTODSA-N 0.000 description 5
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 5
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 5
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 5
- OKEWAFFWMHBGPT-XPUUQOCRSA-N Ala-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 OKEWAFFWMHBGPT-XPUUQOCRSA-N 0.000 description 5
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 5
- QDGMZAOSMNGBLP-MRFFXTKBSA-N Ala-Trp-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N QDGMZAOSMNGBLP-MRFFXTKBSA-N 0.000 description 5
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 5
- DIIGDGJKTMLQQW-IHRRRGAJSA-N Arg-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N DIIGDGJKTMLQQW-IHRRRGAJSA-N 0.000 description 5
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 5
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 5
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 5
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 5
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 5
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 5
- DQUWSUWXPWGTQT-DCAQKATOSA-N Cys-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CS DQUWSUWXPWGTQT-DCAQKATOSA-N 0.000 description 5
- HJXSYJVCMUOUNY-SRVKXCTJSA-N Cys-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N HJXSYJVCMUOUNY-SRVKXCTJSA-N 0.000 description 5
- KARBMKZDLYMMOW-JYBASQMISA-N Cys-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CS)N)O KARBMKZDLYMMOW-JYBASQMISA-N 0.000 description 5
- FNXOZWPPOJRBRE-XGEHTFHBSA-N Cys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CS)N)O FNXOZWPPOJRBRE-XGEHTFHBSA-N 0.000 description 5
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 5
- 241001200922 Gagata Species 0.000 description 5
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 5
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 5
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 5
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 5
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 5
- PYFHPYDQHCEVIT-KBPBESRZSA-N Gly-Trp-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O PYFHPYDQHCEVIT-KBPBESRZSA-N 0.000 description 5
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 5
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 5
- VYUXYMRNGALHEA-DLOVCJGASA-N His-Leu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O VYUXYMRNGALHEA-DLOVCJGASA-N 0.000 description 5
- UROVZOUMHNXPLZ-AVGNSLFASA-N His-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 UROVZOUMHNXPLZ-AVGNSLFASA-N 0.000 description 5
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 5
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 5
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 5
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 5
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 5
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 5
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 5
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 5
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 5
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 5
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 5
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 5
- 108091005804 Peptidases Proteins 0.000 description 5
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 5
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 5
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 5
- 108010079005 RDV peptide Proteins 0.000 description 5
- SVWQEIRZHHNBIO-WHFBIAKZSA-N Ser-Gly-Cys Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CS)C(O)=O SVWQEIRZHHNBIO-WHFBIAKZSA-N 0.000 description 5
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 5
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 5
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 5
- KWQBJOUOSNJDRR-XAVMHZPKSA-N Thr-Cys-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N)O KWQBJOUOSNJDRR-XAVMHZPKSA-N 0.000 description 5
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 5
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 5
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 5
- YVXIAOOYAKBAAI-SZMVWBNQSA-N Trp-Leu-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 YVXIAOOYAKBAAI-SZMVWBNQSA-N 0.000 description 5
- ZPZNQAZHMCLTOA-PXDAIIFMSA-N Trp-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 ZPZNQAZHMCLTOA-PXDAIIFMSA-N 0.000 description 5
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 5
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 5
- AAOPYWQQBXHINJ-DZKIICNBSA-N Val-Gln-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AAOPYWQQBXHINJ-DZKIICNBSA-N 0.000 description 5
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 5
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 5
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 5
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 5
- WFTKOJGOOUJLJV-VKOGCVSHSA-N Val-Trp-Ile Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O)NC(=O)[C@@H]([NH3+])C(C)C)=CNC2=C1 WFTKOJGOOUJLJV-VKOGCVSHSA-N 0.000 description 5
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 5
- 108010068380 arginylarginine Proteins 0.000 description 5
- 238000003491 array Methods 0.000 description 5
- 108010047857 aspartylglycine Proteins 0.000 description 5
- 238000003776 cleavage reaction Methods 0.000 description 5
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 5
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 5
- 108010089804 glycyl-threonine Proteins 0.000 description 5
- 208000010710 hepatitis C virus infection Diseases 0.000 description 5
- 108010034529 leucyl-lysine Proteins 0.000 description 5
- 108010003700 lysyl aspartic acid Proteins 0.000 description 5
- 108010025488 pinealon Proteins 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 230000007017 scission Effects 0.000 description 5
- 239000008223 sterile water Substances 0.000 description 5
- 108010038745 tryptophylglycine Proteins 0.000 description 5
- 108010020532 tyrosyl-proline Proteins 0.000 description 5
- 108010003137 tyrosyltyrosine Proteins 0.000 description 5
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 4
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 4
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 4
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 4
- NJWJSLCQEDMGNC-MBLNEYKQSA-N Ala-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N)O NJWJSLCQEDMGNC-MBLNEYKQSA-N 0.000 description 4
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 4
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 4
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 4
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 4
- DXTYEWAQOXYRHZ-KKXDTOCCSA-N Ala-Phe-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DXTYEWAQOXYRHZ-KKXDTOCCSA-N 0.000 description 4
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 4
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 4
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 4
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 4
- YUGFLWBWAJFGKY-BQBZGAKWSA-N Arg-Cys-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O YUGFLWBWAJFGKY-BQBZGAKWSA-N 0.000 description 4
- 108010010777 Arg-Gly-Asp-Gly Proteins 0.000 description 4
- AZHXYLJRGVMQKW-UMPQAUOISA-N Arg-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCN=C(N)N)N)O AZHXYLJRGVMQKW-UMPQAUOISA-N 0.000 description 4
- PJOPLXOCKACMLK-KKUMJFAQSA-N Arg-Tyr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PJOPLXOCKACMLK-KKUMJFAQSA-N 0.000 description 4
- YNSCBOUZTAGIGO-ZLUOBGJFSA-N Asn-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N YNSCBOUZTAGIGO-ZLUOBGJFSA-N 0.000 description 4
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 4
- VJTWLBMESLDOMK-WDSKDSINSA-N Asn-Gln-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VJTWLBMESLDOMK-WDSKDSINSA-N 0.000 description 4
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 4
- ZRAOLTNMSCSCLN-ZLUOBGJFSA-N Asp-Cys-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)O ZRAOLTNMSCSCLN-ZLUOBGJFSA-N 0.000 description 4
- 208000006154 Chronic hepatitis C Diseases 0.000 description 4
- NQSUTVRXXBGVDQ-LKXGYXEUSA-N Cys-Asn-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NQSUTVRXXBGVDQ-LKXGYXEUSA-N 0.000 description 4
- ZWNFOZNJYNDNGM-UBHSHLNASA-N Cys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N ZWNFOZNJYNDNGM-UBHSHLNASA-N 0.000 description 4
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 4
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 4
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 4
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 4
- NXPXQIZKDOXIHH-JSGCOSHPSA-N Gln-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N NXPXQIZKDOXIHH-JSGCOSHPSA-N 0.000 description 4
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 4
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 4
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 4
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 4
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 4
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 4
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 4
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 4
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 4
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 4
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 4
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 4
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 4
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 4
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 4
- 206010019755 Hepatitis chronic active Diseases 0.000 description 4
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 4
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 4
- VJJSDSNFXCWCEJ-DJFWLOJKSA-N His-Ile-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O VJJSDSNFXCWCEJ-DJFWLOJKSA-N 0.000 description 4
- DLTCGJZBNFOWFL-LKTVYLICSA-N His-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N DLTCGJZBNFOWFL-LKTVYLICSA-N 0.000 description 4
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 4
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 4
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 4
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 4
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 4
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 4
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 4
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 4
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 4
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 4
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 4
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 4
- PKKMDPNFGULLNQ-AVGNSLFASA-N Leu-Met-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O PKKMDPNFGULLNQ-AVGNSLFASA-N 0.000 description 4
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 4
- URJUVJDTPXCQFL-IHPCNDPISA-N Leu-Trp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N URJUVJDTPXCQFL-IHPCNDPISA-N 0.000 description 4
- OZTZJMUZVAVJGY-BZSNNMDCSA-N Leu-Tyr-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OZTZJMUZVAVJGY-BZSNNMDCSA-N 0.000 description 4
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 4
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 4
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 4
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 4
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 4
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 4
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 4
- WUYLWZRHRLLEGB-AVGNSLFASA-N Met-Met-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O WUYLWZRHRLLEGB-AVGNSLFASA-N 0.000 description 4
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 4
- NBEFNGUZUOUGFG-KKUMJFAQSA-N Met-Tyr-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NBEFNGUZUOUGFG-KKUMJFAQSA-N 0.000 description 4
- MUDYEFAKNSTFAI-JYJNAYRXSA-N Met-Tyr-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O MUDYEFAKNSTFAI-JYJNAYRXSA-N 0.000 description 4
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 4
- 241000713869 Moloney murine leukemia virus Species 0.000 description 4
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 4
- 101800001014 Non-structural protein 5A Proteins 0.000 description 4
- UEHNWRNADDPYNK-DLOVCJGASA-N Phe-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N UEHNWRNADDPYNK-DLOVCJGASA-N 0.000 description 4
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 4
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 4
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 4
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 4
- WCUXLLCKKVVCTQ-UHFFFAOYSA-M Potassium chloride Chemical compound [Cl-].[K+] WCUXLLCKKVVCTQ-UHFFFAOYSA-M 0.000 description 4
- HQVPQXMCQKXARZ-FXQIFTODSA-N Pro-Cys-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O HQVPQXMCQKXARZ-FXQIFTODSA-N 0.000 description 4
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 4
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 4
- XFFIGWGYMUFCCQ-ULQDDVLXSA-N Pro-His-Tyr Chemical compound C1=CC(O)=CC=C1C[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)[C@H]1[NH2+]CCC1)CC1=CN=CN1 XFFIGWGYMUFCCQ-ULQDDVLXSA-N 0.000 description 4
- HWLKHNDRXWTFTN-GUBZILKMSA-N Pro-Pro-Cys Chemical compound C1C[C@H](NC1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CS)C(=O)O HWLKHNDRXWTFTN-GUBZILKMSA-N 0.000 description 4
- ZYJMLBCDFPIGNL-JYJNAYRXSA-N Pro-Tyr-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O ZYJMLBCDFPIGNL-JYJNAYRXSA-N 0.000 description 4
- BKOKTRCZXRIQPX-ZLUOBGJFSA-N Ser-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N BKOKTRCZXRIQPX-ZLUOBGJFSA-N 0.000 description 4
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 4
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 4
- ASGYVPAVFNDZMA-GUBZILKMSA-N Ser-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N ASGYVPAVFNDZMA-GUBZILKMSA-N 0.000 description 4
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 4
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 4
- NLJKZUGAIIRWJN-LKXGYXEUSA-N Thr-Asp-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O NLJKZUGAIIRWJN-LKXGYXEUSA-N 0.000 description 4
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 4
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 4
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 4
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 4
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 4
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 4
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 4
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 4
- 108090000631 Trypsin Proteins 0.000 description 4
- 102000004142 Trypsin Human genes 0.000 description 4
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 4
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 4
- QVYFTFIBKCDHIE-ACRUOGEOSA-N Tyr-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O QVYFTFIBKCDHIE-ACRUOGEOSA-N 0.000 description 4
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 4
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 4
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 4
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 4
- ZSZFTYVFQLUWBF-QXEWZRGKSA-N Val-Asp-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N ZSZFTYVFQLUWBF-QXEWZRGKSA-N 0.000 description 4
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 4
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 4
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 4
- HQYVQDRYODWONX-DCAQKATOSA-N Val-His-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N HQYVQDRYODWONX-DCAQKATOSA-N 0.000 description 4
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 4
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 4
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 4
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 4
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 4
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 4
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 4
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 4
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 4
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 4
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 4
- 239000011543 agarose gel Substances 0.000 description 4
- 108010031014 alanyl-histidyl-leucyl-leucine Proteins 0.000 description 4
- 108010089975 arginyl-glycyl-aspartyl-serine Proteins 0.000 description 4
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 4
- 239000012503 blood component Substances 0.000 description 4
- 208000019425 cirrhosis of liver Diseases 0.000 description 4
- 238000003745 diagnosis Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 108010078144 glutaminyl-glycine Proteins 0.000 description 4
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 4
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 4
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 4
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 4
- 108010015792 glycyllysine Proteins 0.000 description 4
- 108010077515 glycylproline Proteins 0.000 description 4
- 108010087823 glycyltyrosine Proteins 0.000 description 4
- 238000002955 isolation Methods 0.000 description 4
- 108010054155 lysyllysine Proteins 0.000 description 4
- 239000012528 membrane Substances 0.000 description 4
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 4
- 230000035772 mutation Effects 0.000 description 4
- 239000008188 pellet Substances 0.000 description 4
- 108010012581 phenylalanylglutamate Proteins 0.000 description 4
- 108090000765 processed proteins & peptides Proteins 0.000 description 4
- 108010079317 prolyl-tyrosine Proteins 0.000 description 4
- 238000011002 quantification Methods 0.000 description 4
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 4
- 108010071207 serylmethionine Proteins 0.000 description 4
- LWIHDJKSTIGBAC-UHFFFAOYSA-K tripotassium phosphate Chemical compound [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 4
- 239000012588 trypsin Substances 0.000 description 4
- 108010080629 tryptophan-leucine Proteins 0.000 description 4
- 108010045269 tryptophyltryptophan Proteins 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 3
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 3
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 3
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 3
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 3
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 3
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 3
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 3
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 3
- BDQNLQSWRAPHGU-DLOVCJGASA-N Ala-Phe-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N BDQNLQSWRAPHGU-DLOVCJGASA-N 0.000 description 3
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 3
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 3
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 3
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 3
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 3
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 3
- JTWOBPNAVBESFW-FXQIFTODSA-N Arg-Cys-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N JTWOBPNAVBESFW-FXQIFTODSA-N 0.000 description 3
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 3
- YBIAYFFIVAZXPK-AVGNSLFASA-N Arg-His-Arg Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YBIAYFFIVAZXPK-AVGNSLFASA-N 0.000 description 3
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 3
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 3
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 3
- GMCOADLDNLGOFE-ZLUOBGJFSA-N Asn-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N GMCOADLDNLGOFE-ZLUOBGJFSA-N 0.000 description 3
- JXMREEPBRANWBY-VEVYYDQMSA-N Asn-Thr-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JXMREEPBRANWBY-VEVYYDQMSA-N 0.000 description 3
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 3
- ULZOQOKFYMXHPZ-AQZXSJQPSA-N Asn-Trp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ULZOQOKFYMXHPZ-AQZXSJQPSA-N 0.000 description 3
- WEDGJJRCJNHYSF-SRVKXCTJSA-N Asp-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N WEDGJJRCJNHYSF-SRVKXCTJSA-N 0.000 description 3
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 3
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 3
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 3
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 3
- QLCPDGRAEJSYQM-LPEHRKFASA-N Cys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)C(=O)O QLCPDGRAEJSYQM-LPEHRKFASA-N 0.000 description 3
- UPURLDIGQGTUPJ-ZKWXMUAHSA-N Cys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N UPURLDIGQGTUPJ-ZKWXMUAHSA-N 0.000 description 3
- TXGDWPBLUFQODU-XGEHTFHBSA-N Cys-Pro-Thr Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O TXGDWPBLUFQODU-XGEHTFHBSA-N 0.000 description 3
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 3
- FANFRJOFTYCNRG-JYBASQMISA-N Cys-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N)O FANFRJOFTYCNRG-JYBASQMISA-N 0.000 description 3
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 3
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 3
- 241000710188 Encephalomyocarditis virus Species 0.000 description 3
- 206010016654 Fibrosis Diseases 0.000 description 3
- OYTPNWYZORARHL-XHNCKOQMSA-N Gln-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N OYTPNWYZORARHL-XHNCKOQMSA-N 0.000 description 3
- XOKGKOQWADCLFQ-GARJFASQSA-N Gln-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XOKGKOQWADCLFQ-GARJFASQSA-N 0.000 description 3
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 3
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 3
- DYVMTEWCGAVKSE-HJGDQZAQSA-N Gln-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O DYVMTEWCGAVKSE-HJGDQZAQSA-N 0.000 description 3
- KHHDJQRWIFHXHS-NRPADANISA-N Gln-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHHDJQRWIFHXHS-NRPADANISA-N 0.000 description 3
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 3
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 3
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 3
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 3
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 3
- ZNOHKCPYDAYYDA-BPUTZDHNSA-N Glu-Trp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNOHKCPYDAYYDA-BPUTZDHNSA-N 0.000 description 3
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 3
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 3
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 3
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 3
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 3
- ISSDODCYBOWWIP-GJZGRUSLSA-N Gly-Pro-Trp Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISSDODCYBOWWIP-GJZGRUSLSA-N 0.000 description 3
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 3
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 3
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 3
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 3
- VYMGAXSNYUFVCK-GUBZILKMSA-N His-Gln-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N VYMGAXSNYUFVCK-GUBZILKMSA-N 0.000 description 3
- ZRSJXIKQXUGKRB-TUBUOCAGSA-N His-Ile-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZRSJXIKQXUGKRB-TUBUOCAGSA-N 0.000 description 3
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 3
- BCSGDNGNHKBRRJ-ULQDDVLXSA-N His-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N BCSGDNGNHKBRRJ-ULQDDVLXSA-N 0.000 description 3
- 241000282412 Homo Species 0.000 description 3
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 3
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 3
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 3
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 3
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 3
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 3
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 3
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 3
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 3
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 3
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 3
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 3
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 3
- HQLSBZFLOUHQJK-STECZYCISA-N Ile-Tyr-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HQLSBZFLOUHQJK-STECZYCISA-N 0.000 description 3
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 3
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 3
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 3
- DQPQTXMIRBUWKO-DCAQKATOSA-N Leu-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N DQPQTXMIRBUWKO-DCAQKATOSA-N 0.000 description 3
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 3
- BAJIJEGGUYXZGC-CIUDSAMLSA-N Leu-Asn-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N BAJIJEGGUYXZGC-CIUDSAMLSA-N 0.000 description 3
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 3
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 3
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 3
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 3
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 3
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 3
- TVEOVCYCYGKVPP-HSCHXYMDSA-N Leu-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N TVEOVCYCYGKVPP-HSCHXYMDSA-N 0.000 description 3
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 3
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 3
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 3
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 3
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 3
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 3
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 3
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 3
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 3
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 3
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 3
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 3
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 3
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 3
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 3
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 3
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 3
- TVOOGUNBIWAURO-KATARQTJSA-N Lys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N)O TVOOGUNBIWAURO-KATARQTJSA-N 0.000 description 3
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 3
- 108010066427 N-valyltryptophan Proteins 0.000 description 3
- 101100068676 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) gln-1 gene Proteins 0.000 description 3
- 101800001020 Non-structural protein 4A Proteins 0.000 description 3
- 101800001019 Non-structural protein 4B Proteins 0.000 description 3
- 241000282579 Pan Species 0.000 description 3
- DPUOLKQSMYLRDR-UBHSHLNASA-N Phe-Arg-Ala Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 DPUOLKQSMYLRDR-UBHSHLNASA-N 0.000 description 3
- WFDAEEUZPZSMOG-SRVKXCTJSA-N Phe-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O WFDAEEUZPZSMOG-SRVKXCTJSA-N 0.000 description 3
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 3
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 3
- LUGOKRWYNMDGTD-FXQIFTODSA-N Pro-Cys-Asn Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O LUGOKRWYNMDGTD-FXQIFTODSA-N 0.000 description 3
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 3
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 3
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 3
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 3
- 239000004365 Protease Substances 0.000 description 3
- 239000012083 RIPA buffer Substances 0.000 description 3
- 108091034057 RNA (poly(A)) Proteins 0.000 description 3
- 101800001554 RNA-directed RNA polymerase Proteins 0.000 description 3
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 3
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 3
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 3
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 3
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 3
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 3
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 3
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 3
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 3
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 3
- LSHUNRICNSEEAN-BPUTZDHNSA-N Ser-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CO)N LSHUNRICNSEEAN-BPUTZDHNSA-N 0.000 description 3
- 239000006180 TBST buffer Substances 0.000 description 3
- 101150025711 TF gene Proteins 0.000 description 3
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 3
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 3
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 3
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 3
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 3
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 3
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 3
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 description 3
- OAZLRFLMQASGNW-PMVMPFDFSA-N Trp-His-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CC4=CC=C(C=C4)O)C(=O)O)N OAZLRFLMQASGNW-PMVMPFDFSA-N 0.000 description 3
- RCMHSGRBJCMFLR-BPUTZDHNSA-N Trp-Met-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 RCMHSGRBJCMFLR-BPUTZDHNSA-N 0.000 description 3
- ACGIVBXINJFALS-HKUYNNGSSA-N Trp-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N ACGIVBXINJFALS-HKUYNNGSSA-N 0.000 description 3
- SEXRBCGSZRCIPE-LYSGOOTNSA-N Trp-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O SEXRBCGSZRCIPE-LYSGOOTNSA-N 0.000 description 3
- XGEUYEOEZYFHRL-KKXDTOCCSA-N Tyr-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XGEUYEOEZYFHRL-KKXDTOCCSA-N 0.000 description 3
- QNJYPWZACBACER-KKUMJFAQSA-N Tyr-Asp-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O QNJYPWZACBACER-KKUMJFAQSA-N 0.000 description 3
- BVDHHLMIZFCAAU-BZSNNMDCSA-N Tyr-Cys-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BVDHHLMIZFCAAU-BZSNNMDCSA-N 0.000 description 3
- YWXMGBUGMLJMIP-IHPCNDPISA-N Tyr-Cys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC3=CC=C(C=C3)O)N YWXMGBUGMLJMIP-IHPCNDPISA-N 0.000 description 3
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 3
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 3
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 3
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 3
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 3
- BCBFMJYTNKDALA-UFYCRDLUSA-N Val-Phe-Phe Chemical compound N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O BCBFMJYTNKDALA-UFYCRDLUSA-N 0.000 description 3
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 3
- ZHWZDZFWBXWPDW-GUBZILKMSA-N Val-Val-Cys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O ZHWZDZFWBXWPDW-GUBZILKMSA-N 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 238000000137 annealing Methods 0.000 description 3
- 230000000692 anti-sense effect Effects 0.000 description 3
- 239000000427 antigen Substances 0.000 description 3
- 108091007433 antigens Proteins 0.000 description 3
- 102000036639 antigens Human genes 0.000 description 3
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 3
- 108010068265 aspartyltyrosine Proteins 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 210000004899 c-terminal region Anatomy 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 238000004520 electroporation Methods 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 210000003494 hepatocyte Anatomy 0.000 description 3
- 108010018006 histidylserine Proteins 0.000 description 3
- 230000006872 improvement Effects 0.000 description 3
- 230000002401 inhibitory effect Effects 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- 108010000761 leucylarginine Proteins 0.000 description 3
- 210000005228 liver tissue Anatomy 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 230000007170 pathology Effects 0.000 description 3
- 108010073101 phenylalanylleucine Proteins 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 108010070643 prolylglutamic acid Proteins 0.000 description 3
- 235000019419 proteases Nutrition 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 108010009962 valyltyrosine Proteins 0.000 description 3
- 108700026220 vif Genes Proteins 0.000 description 3
- 238000005406 washing Methods 0.000 description 3
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 2
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 2
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 2
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 2
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 2
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 2
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 2
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 2
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 2
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 2
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 2
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 2
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 2
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 2
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 2
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 2
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 2
- HJGZVLLLBJLXFC-LSJOCFKGSA-N Ala-His-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O HJGZVLLLBJLXFC-LSJOCFKGSA-N 0.000 description 2
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 2
- QPBSRMDNJOTFAL-AICCOOGYSA-N Ala-Leu-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QPBSRMDNJOTFAL-AICCOOGYSA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 2
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 2
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 2
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 2
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 2
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 2
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 2
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 2
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 2
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 2
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 2
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 2
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 2
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 2
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 2
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 2
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 2
- XTGGTAWGUFXJSV-NAKRPEOUSA-N Arg-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N XTGGTAWGUFXJSV-NAKRPEOUSA-N 0.000 description 2
- VSPLYCLMFAUZRF-GUBZILKMSA-N Arg-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N VSPLYCLMFAUZRF-GUBZILKMSA-N 0.000 description 2
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 2
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 2
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 2
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 2
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 2
- ITHMWNNUDPJJER-ULQDDVLXSA-N Arg-His-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ITHMWNNUDPJJER-ULQDDVLXSA-N 0.000 description 2
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 2
- JEXPNDORFYHJTM-IHRRRGAJSA-N Arg-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCN=C(N)N JEXPNDORFYHJTM-IHRRRGAJSA-N 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 2
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 2
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 2
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 2
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 2
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 2
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 2
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 2
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 2
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 2
- UTSMXMABBPFVJP-SZMVWBNQSA-N Arg-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UTSMXMABBPFVJP-SZMVWBNQSA-N 0.000 description 2
- ANAHQDPQQBDOBM-UHFFFAOYSA-N Arg-Val-Tyr Natural products CC(C)C(NC(=O)C(N)CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O ANAHQDPQQBDOBM-UHFFFAOYSA-N 0.000 description 2
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 2
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 2
- SPIPSJXLZVTXJL-ZLUOBGJFSA-N Asn-Cys-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O SPIPSJXLZVTXJL-ZLUOBGJFSA-N 0.000 description 2
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 2
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 2
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 2
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 2
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 2
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 2
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 2
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 2
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 2
- CPYHLXSGDBDULY-IHPCNDPISA-N Asn-Trp-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CPYHLXSGDBDULY-IHPCNDPISA-N 0.000 description 2
- JPPLRQVZMZFOSX-UWJYBYFXSA-N Asn-Tyr-Ala Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 JPPLRQVZMZFOSX-UWJYBYFXSA-N 0.000 description 2
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 2
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 2
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 2
- AKPLMZMNJGNUKT-ZLUOBGJFSA-N Asp-Asp-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O AKPLMZMNJGNUKT-ZLUOBGJFSA-N 0.000 description 2
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 2
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 2
- RSMIHCFQDCVVBR-CIUDSAMLSA-N Asp-Gln-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RSMIHCFQDCVVBR-CIUDSAMLSA-N 0.000 description 2
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 2
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 2
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 2
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 2
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 2
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 2
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- JXGJJQJHXHXJQF-CIUDSAMLSA-N Asp-Met-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O JXGJJQJHXHXJQF-CIUDSAMLSA-N 0.000 description 2
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 2
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 2
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 2
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 2
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 2
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 2
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 2
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 2
- ZVYYMCXVPZEAPU-CWRNSKLLSA-N Asp-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZVYYMCXVPZEAPU-CWRNSKLLSA-N 0.000 description 2
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 2
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 2
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 2
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 2
- 108091035707 Consensus sequence Proteins 0.000 description 2
- AMRLSQGGERHDHJ-FXQIFTODSA-N Cys-Ala-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMRLSQGGERHDHJ-FXQIFTODSA-N 0.000 description 2
- BYALSSDCQYHKMY-XGEHTFHBSA-N Cys-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)O BYALSSDCQYHKMY-XGEHTFHBSA-N 0.000 description 2
- CFQVGYWKSLKWFX-KBIXCLLPSA-N Cys-Glu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CFQVGYWKSLKWFX-KBIXCLLPSA-N 0.000 description 2
- BSFFNUBDVYTDMV-WHFBIAKZSA-N Cys-Gly-Asn Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BSFFNUBDVYTDMV-WHFBIAKZSA-N 0.000 description 2
- SKSJPIBFNFPTJB-NKWVEPMBSA-N Cys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CS)N)C(=O)O SKSJPIBFNFPTJB-NKWVEPMBSA-N 0.000 description 2
- CUXIOFHFFXNUGG-HTFCKZLJSA-N Cys-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CS)N CUXIOFHFFXNUGG-HTFCKZLJSA-N 0.000 description 2
- XXDATQFUGMAJRV-XIRDDKMYSA-N Cys-Leu-Trp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XXDATQFUGMAJRV-XIRDDKMYSA-N 0.000 description 2
- HSAWNMMTZCLTPY-DCAQKATOSA-N Cys-Met-Leu Chemical compound SC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HSAWNMMTZCLTPY-DCAQKATOSA-N 0.000 description 2
- QQOWCDCBFFBRQH-IXOXFDKPSA-N Cys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N)O QQOWCDCBFFBRQH-IXOXFDKPSA-N 0.000 description 2
- RAGIABZNLPZBGS-FXQIFTODSA-N Cys-Pro-Cys Chemical compound N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O RAGIABZNLPZBGS-FXQIFTODSA-N 0.000 description 2
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 2
- SRZZZTMJARUVPI-JBDRJPRFSA-N Cys-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N SRZZZTMJARUVPI-JBDRJPRFSA-N 0.000 description 2
- 102000012410 DNA Ligases Human genes 0.000 description 2
- 108010061982 DNA Ligases Proteins 0.000 description 2
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 2
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 2
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 2
- KYFSMWLWHYZRNW-ACZMJKKPSA-N Gln-Asp-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KYFSMWLWHYZRNW-ACZMJKKPSA-N 0.000 description 2
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 2
- XQEAVUJIRZRLQQ-SZMVWBNQSA-N Gln-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CCC(=O)N)N XQEAVUJIRZRLQQ-SZMVWBNQSA-N 0.000 description 2
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 2
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 2
- KLKYKPXITJBSNI-CIUDSAMLSA-N Gln-Met-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O KLKYKPXITJBSNI-CIUDSAMLSA-N 0.000 description 2
- FTMLQFPULNGION-ZVZYQTTQSA-N Gln-Val-Trp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O FTMLQFPULNGION-ZVZYQTTQSA-N 0.000 description 2
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 2
- IYAUFWMUCGBFMQ-CIUDSAMLSA-N Glu-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N IYAUFWMUCGBFMQ-CIUDSAMLSA-N 0.000 description 2
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 2
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 2
- NUSWUSKZRCGFEX-FXQIFTODSA-N Glu-Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O NUSWUSKZRCGFEX-FXQIFTODSA-N 0.000 description 2
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 2
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 2
- BRKUZSLQMPNVFN-SRVKXCTJSA-N Glu-His-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BRKUZSLQMPNVFN-SRVKXCTJSA-N 0.000 description 2
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 2
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 2
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 2
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 2
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 2
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 2
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 2
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 2
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 2
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 2
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 2
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 2
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 2
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 2
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 2
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 2
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 2
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 2
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 2
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 2
- PASHZZBXZYEXFE-LSDHHAIUSA-N Gly-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)CN)C(=O)O PASHZZBXZYEXFE-LSDHHAIUSA-N 0.000 description 2
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 2
- COZMNNJEGNPDED-HOCLYGCPSA-N Gly-Val-Trp Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O COZMNNJEGNPDED-HOCLYGCPSA-N 0.000 description 2
- 239000007995 HEPES buffer Substances 0.000 description 2
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 2
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 2
- WSXNWASHQNSMRX-GVXVVHGQSA-N His-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WSXNWASHQNSMRX-GVXVVHGQSA-N 0.000 description 2
- 101000600434 Homo sapiens Putative uncharacterized protein encoded by MIR7-3HG Proteins 0.000 description 2
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 2
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 2
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 2
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 2
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 2
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 2
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 2
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 2
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 2
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 2
- WRDTXMBPHMBGIB-STECZYCISA-N Ile-Tyr-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 WRDTXMBPHMBGIB-STECZYCISA-N 0.000 description 2
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 2
- ZSESFIFAYQEKRD-CYDGBPFRSA-N Ile-Val-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N ZSESFIFAYQEKRD-CYDGBPFRSA-N 0.000 description 2
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 2
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 2
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- XYUBOFCTGPZFSA-WDSOQIARSA-N Leu-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 XYUBOFCTGPZFSA-WDSOQIARSA-N 0.000 description 2
- DUBAVOVZNZKEQQ-AVGNSLFASA-N Leu-Arg-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCN=C(N)N DUBAVOVZNZKEQQ-AVGNSLFASA-N 0.000 description 2
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 2
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 2
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 2
- NHRINZSPIUXYQZ-DCAQKATOSA-N Leu-Met-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N NHRINZSPIUXYQZ-DCAQKATOSA-N 0.000 description 2
- FZMNAYBEFGZEIF-AVGNSLFASA-N Leu-Met-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N FZMNAYBEFGZEIF-AVGNSLFASA-N 0.000 description 2
- MJTOYIHCKVQICL-ULQDDVLXSA-N Leu-Met-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MJTOYIHCKVQICL-ULQDDVLXSA-N 0.000 description 2
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 2
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 2
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 2
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 2
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 2
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 2
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 2
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 2
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 2
- RLZDUFRBMQNYIJ-YUMQZZPRSA-N Lys-Cys-Gly Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N RLZDUFRBMQNYIJ-YUMQZZPRSA-N 0.000 description 2
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 2
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 2
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 2
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 2
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 2
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 2
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 2
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 2
- BXNZDLVLGYYFIB-FXQIFTODSA-N Met-Asn-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N BXNZDLVLGYYFIB-FXQIFTODSA-N 0.000 description 2
- HDNOQCZWJGGHSS-VEVYYDQMSA-N Met-Asn-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HDNOQCZWJGGHSS-VEVYYDQMSA-N 0.000 description 2
- FBQMBZLJHOQAIH-GUBZILKMSA-N Met-Asp-Met Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O FBQMBZLJHOQAIH-GUBZILKMSA-N 0.000 description 2
- YKWHHKDMBZBMLG-GUBZILKMSA-N Met-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCSC)N YKWHHKDMBZBMLG-GUBZILKMSA-N 0.000 description 2
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 2
- BCRQJDMZQUHQSV-STQMWFEESA-N Met-Gly-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BCRQJDMZQUHQSV-STQMWFEESA-N 0.000 description 2
- SCKPOOMCTFEVTN-QTKMDUPCSA-N Met-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCSC)N)O SCKPOOMCTFEVTN-QTKMDUPCSA-N 0.000 description 2
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 2
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 2
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 2
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 2
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 2
- CXMSESHALPOLRE-MEYUZBJRSA-N Phe-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O CXMSESHALPOLRE-MEYUZBJRSA-N 0.000 description 2
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 2
- BQMFWUKNOCJDNV-HJWJTTGWSA-N Phe-Val-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQMFWUKNOCJDNV-HJWJTTGWSA-N 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 2
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 2
- OLHDPZMYUSBGDE-GUBZILKMSA-N Pro-Arg-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O OLHDPZMYUSBGDE-GUBZILKMSA-N 0.000 description 2
- QBFONMUYNSNKIX-AVGNSLFASA-N Pro-Arg-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QBFONMUYNSNKIX-AVGNSLFASA-N 0.000 description 2
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 2
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 2
- SZZBUDVXWZZPDH-BQBZGAKWSA-N Pro-Cys-Gly Chemical compound OC(=O)CNC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 SZZBUDVXWZZPDH-BQBZGAKWSA-N 0.000 description 2
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 2
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 2
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 2
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 2
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 2
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 2
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 2
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 2
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 2
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 2
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 2
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 2
- XSXABUHLKPUVLX-JYJNAYRXSA-N Pro-Ser-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O XSXABUHLKPUVLX-JYJNAYRXSA-N 0.000 description 2
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 2
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 2
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 2
- BVTYXOFTHDXSNI-IHRRRGAJSA-N Pro-Tyr-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 BVTYXOFTHDXSNI-IHRRRGAJSA-N 0.000 description 2
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 2
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 2
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 2
- 102100037401 Putative uncharacterized protein encoded by MIR7-3HG Human genes 0.000 description 2
- 239000013614 RNA sample Substances 0.000 description 2
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 2
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 2
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 2
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 2
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 2
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 2
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 2
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 2
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 2
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 2
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 2
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 2
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 2
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 2
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 2
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 2
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 2
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 2
- OJFFAQFRCVPHNN-JYBASQMISA-N Ser-Thr-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OJFFAQFRCVPHNN-JYBASQMISA-N 0.000 description 2
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 2
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 2
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 2
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 2
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 2
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 2
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 2
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 2
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 2
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 2
- VULNJDORNLBPNG-SWRJLBSHSA-N Thr-Glu-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VULNJDORNLBPNG-SWRJLBSHSA-N 0.000 description 2
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 2
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 2
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 2
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 2
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 2
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 2
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 2
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 2
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 2
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 2
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 2
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 2
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 2
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 2
- FBQHKSPOIAFUEI-OWLDWWDNSA-N Thr-Trp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O FBQHKSPOIAFUEI-OWLDWWDNSA-N 0.000 description 2
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 2
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 2
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 2
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 2
- QAXCHNZDPLSFPC-PJODQICGSA-N Trp-Ala-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QAXCHNZDPLSFPC-PJODQICGSA-N 0.000 description 2
- PXYJUECTGMGIDT-WDSOQIARSA-N Trp-Arg-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 PXYJUECTGMGIDT-WDSOQIARSA-N 0.000 description 2
- MDDYTWOFHZFABW-SZMVWBNQSA-N Trp-Gln-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 MDDYTWOFHZFABW-SZMVWBNQSA-N 0.000 description 2
- DZIKVMCFXIIETR-JSGCOSHPSA-N Trp-Gly-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O DZIKVMCFXIIETR-JSGCOSHPSA-N 0.000 description 2
- WLBZWXXGSOLJBA-HOCLYGCPSA-N Trp-Gly-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 WLBZWXXGSOLJBA-HOCLYGCPSA-N 0.000 description 2
- UKWSFUSPGPBJGU-VFAJRCTISA-N Trp-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O UKWSFUSPGPBJGU-VFAJRCTISA-N 0.000 description 2
- XOLLWQIBBLBAHQ-WDSOQIARSA-N Trp-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O XOLLWQIBBLBAHQ-WDSOQIARSA-N 0.000 description 2
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 description 2
- AKXBNSZMYAOGLS-STQMWFEESA-N Tyr-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AKXBNSZMYAOGLS-STQMWFEESA-N 0.000 description 2
- CRWOSTCODDFEKZ-HRCADAONSA-N Tyr-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CRWOSTCODDFEKZ-HRCADAONSA-N 0.000 description 2
- CYDVHRFXDMDMGX-KKUMJFAQSA-N Tyr-Asn-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O CYDVHRFXDMDMGX-KKUMJFAQSA-N 0.000 description 2
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 2
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 2
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 2
- MXFPBNFKVBHIRW-BZSNNMDCSA-N Tyr-Lys-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O MXFPBNFKVBHIRW-BZSNNMDCSA-N 0.000 description 2
- VYTUETMEZZLJFU-IHRRRGAJSA-N Tyr-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)N[C@@H](CS)C(=O)O VYTUETMEZZLJFU-IHRRRGAJSA-N 0.000 description 2
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 2
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 2
- PLVVHGFEMSDRET-IHPCNDPISA-N Tyr-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC3=CC=C(C=C3)O)N PLVVHGFEMSDRET-IHPCNDPISA-N 0.000 description 2
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 2
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 2
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 2
- AKKYBQGHUAWPJR-MNSWYVGCSA-N Tyr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)O AKKYBQGHUAWPJR-MNSWYVGCSA-N 0.000 description 2
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 description 2
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 2
- REJBPZVUHYNMEN-LSJOCFKGSA-N Val-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N REJBPZVUHYNMEN-LSJOCFKGSA-N 0.000 description 2
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 2
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 2
- XPYNXORPPVTVQK-SRVKXCTJSA-N Val-Arg-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N XPYNXORPPVTVQK-SRVKXCTJSA-N 0.000 description 2
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 2
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 2
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 2
- WBUOKGBHGDPYMH-GUBZILKMSA-N Val-Cys-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)C(C)C WBUOKGBHGDPYMH-GUBZILKMSA-N 0.000 description 2
- DBMMKEHYWIZTPN-JYJNAYRXSA-N Val-Cys-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N DBMMKEHYWIZTPN-JYJNAYRXSA-N 0.000 description 2
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 2
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 2
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 2
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 2
- JVYIGCARISMLMV-HOCLYGCPSA-N Val-Gly-Trp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JVYIGCARISMLMV-HOCLYGCPSA-N 0.000 description 2
- YTUABZMPYKCWCQ-XQQFMLRXSA-N Val-His-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N YTUABZMPYKCWCQ-XQQFMLRXSA-N 0.000 description 2
- PYPZMFDMCCWNST-NAKRPEOUSA-N Val-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N PYPZMFDMCCWNST-NAKRPEOUSA-N 0.000 description 2
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 2
- PYXQBKJPHNCTNW-CYDGBPFRSA-N Val-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N PYXQBKJPHNCTNW-CYDGBPFRSA-N 0.000 description 2
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 2
- JKHXYJKMNSSFFL-IUCAKERBSA-N Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN JKHXYJKMNSSFFL-IUCAKERBSA-N 0.000 description 2
- SVFRYKBZHUGKLP-QXEWZRGKSA-N Val-Met-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVFRYKBZHUGKLP-QXEWZRGKSA-N 0.000 description 2
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 2
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 2
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 2
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 2
- KJFBXCFOPAKPTM-BZSNNMDCSA-N Val-Trp-Val Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 KJFBXCFOPAKPTM-BZSNNMDCSA-N 0.000 description 2
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 2
- CFIBZQOLUDURST-IHRRRGAJSA-N Val-Tyr-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N CFIBZQOLUDURST-IHRRRGAJSA-N 0.000 description 2
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 2
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 2
- YKZVPMUGEJXEOR-JYJNAYRXSA-N Val-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N YKZVPMUGEJXEOR-JYJNAYRXSA-N 0.000 description 2
- 235000010724 Wisteria floribunda Nutrition 0.000 description 2
- 238000000246 agarose gel electrophoresis Methods 0.000 description 2
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- 108010070783 alanyltyrosine Proteins 0.000 description 2
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 2
- 229960000723 ampicillin Drugs 0.000 description 2
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 2
- 108010089442 arginyl-leucyl-alanyl-arginine Proteins 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 238000001574 biopsy Methods 0.000 description 2
- 239000001110 calcium chloride Substances 0.000 description 2
- 229910001628 calcium chloride Inorganic materials 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 230000001684 chronic effect Effects 0.000 description 2
- 229910052956 cinnabar Inorganic materials 0.000 description 2
- 230000007882 cirrhosis Effects 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 210000004748 cultured cell Anatomy 0.000 description 2
- 108010069495 cysteinyltyrosine Proteins 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- 229940009976 deoxycholate Drugs 0.000 description 2
- KXGVEGMKQFWNSR-LLQZFEROSA-N deoxycholic acid Chemical compound C([C@H]1CC2)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(O)=O)C)[C@@]2(C)[C@@H](O)C1 KXGVEGMKQFWNSR-LLQZFEROSA-N 0.000 description 2
- FFYPMLJYZAEMQB-UHFFFAOYSA-N diethyl pyrocarbonate Chemical compound CCOC(=O)OC(=O)OCC FFYPMLJYZAEMQB-UHFFFAOYSA-N 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- DEFVIWRASFVYLL-UHFFFAOYSA-N ethylene glycol bis(2-aminoethyl)tetraacetic acid Chemical compound OC(=O)CN(CC(O)=O)CCOCCOCCN(CC(O)=O)CC(O)=O DEFVIWRASFVYLL-UHFFFAOYSA-N 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 239000012091 fetal bovine serum Substances 0.000 description 2
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 2
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 230000003834 intracellular effect Effects 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 239000011777 magnesium Substances 0.000 description 2
- 229910001629 magnesium chloride Inorganic materials 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 239000002773 nucleotide Substances 0.000 description 2
- 125000003729 nucleotide group Chemical group 0.000 description 2
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 2
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 2
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 2
- 229920001184 polypeptide Polymers 0.000 description 2
- 239000001103 potassium chloride Substances 0.000 description 2
- 235000011164 potassium chloride Nutrition 0.000 description 2
- 229910000160 potassium phosphate Inorganic materials 0.000 description 2
- 235000011009 potassium phosphates Nutrition 0.000 description 2
- 102000004196 processed proteins & peptides Human genes 0.000 description 2
- 230000002062 proliferating effect Effects 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 2
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 2
- 235000019833 protease Nutrition 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 238000010839 reverse transcription Methods 0.000 description 2
- 239000003161 ribonuclease inhibitor Substances 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- 230000001225 therapeutic effect Effects 0.000 description 2
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 108010087967 type I signal peptidase Proteins 0.000 description 2
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 description 2
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 1
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 1
- KIUKXJAPPMFGSW-DNGZLQJQSA-N (2S,3S,4S,5R,6R)-6-[(2S,3R,4R,5S,6R)-3-Acetamido-2-[(2S,3S,4R,5R,6R)-6-[(2R,3R,4R,5S,6R)-3-acetamido-2,5-dihydroxy-6-(hydroxymethyl)oxan-4-yl]oxy-2-carboxy-4,5-dihydroxyoxan-3-yl]oxy-5-hydroxy-6-(hydroxymethyl)oxan-4-yl]oxy-3,4,5-trihydroxyoxane-2-carboxylic acid Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O[C@H]3[C@@H]([C@@H](O)[C@H](O)[C@H](O3)C(O)=O)O)[C@H](O)[C@@H](CO)O2)NC(C)=O)[C@@H](C(O)=O)O1 KIUKXJAPPMFGSW-DNGZLQJQSA-N 0.000 description 1
- SCAKQYSGEIHPLV-IUCAKERBSA-N (4S)-4-[(2-aminoacetyl)amino]-5-[(2S)-2-(carboxymethylcarbamoyl)pyrrolidin-1-yl]-5-oxopentanoic acid Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SCAKQYSGEIHPLV-IUCAKERBSA-N 0.000 description 1
- JILCEWWZTBBOFS-UHFFFAOYSA-N 4-(methylamino)antipyrine Chemical compound O=C1C(NC)=C(C)N(C)N1C1=CC=CC=C1 JILCEWWZTBBOFS-UHFFFAOYSA-N 0.000 description 1
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 1
- PJNSIUPOXFBHDM-GUBZILKMSA-N Ala-Arg-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O PJNSIUPOXFBHDM-GUBZILKMSA-N 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- YEELWQSXYBJVSV-UWJYBYFXSA-N Ala-Cys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YEELWQSXYBJVSV-UWJYBYFXSA-N 0.000 description 1
- CSAHOYQKNHGDHX-ACZMJKKPSA-N Ala-Gln-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CSAHOYQKNHGDHX-ACZMJKKPSA-N 0.000 description 1
- CRWFEKLFPVRPBV-CIUDSAMLSA-N Ala-Gln-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CRWFEKLFPVRPBV-CIUDSAMLSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 1
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 1
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- BHTBAVZSZCQZPT-GUBZILKMSA-N Ala-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N BHTBAVZSZCQZPT-GUBZILKMSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- AUFACLFHBAGZEN-ZLUOBGJFSA-N Ala-Ser-Cys Chemical compound N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O AUFACLFHBAGZEN-ZLUOBGJFSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- PXAFZDXYEIIUTF-LKTVYLICSA-N Ala-Trp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXAFZDXYEIIUTF-LKTVYLICSA-N 0.000 description 1
- PHQXWZGXKAFWAZ-ZLIFDBKOSA-N Ala-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 PHQXWZGXKAFWAZ-ZLIFDBKOSA-N 0.000 description 1
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 1
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 1
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- OOBVTWHLKYJFJH-FXQIFTODSA-N Arg-Ala-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O OOBVTWHLKYJFJH-FXQIFTODSA-N 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- YYOVLDPHIJAOSY-DCAQKATOSA-N Arg-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N YYOVLDPHIJAOSY-DCAQKATOSA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- KGSJCPBERYUXCN-BPNCWPANSA-N Arg-Ala-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KGSJCPBERYUXCN-BPNCWPANSA-N 0.000 description 1
- OMLWNBVRVJYMBQ-YUMQZZPRSA-N Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OMLWNBVRVJYMBQ-YUMQZZPRSA-N 0.000 description 1
- KJGNDQCYBNBXDA-GUBZILKMSA-N Arg-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N KJGNDQCYBNBXDA-GUBZILKMSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 1
- HJAICMSAKODKRF-GUBZILKMSA-N Arg-Cys-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O HJAICMSAKODKRF-GUBZILKMSA-N 0.000 description 1
- IGULQRCJLQQPSM-DCAQKATOSA-N Arg-Cys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IGULQRCJLQQPSM-DCAQKATOSA-N 0.000 description 1
- YHSNASXGBPAHRL-BPUTZDHNSA-N Arg-Cys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N YHSNASXGBPAHRL-BPUTZDHNSA-N 0.000 description 1
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 1
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 1
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- ZZZWQALDSQQBEW-STQMWFEESA-N Arg-Gly-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZZZWQALDSQQBEW-STQMWFEESA-N 0.000 description 1
- SLNCSSWAIDUUGF-LSJOCFKGSA-N Arg-His-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O SLNCSSWAIDUUGF-LSJOCFKGSA-N 0.000 description 1
- CVKOQHYVDVYJSI-QTKMDUPCSA-N Arg-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N)O CVKOQHYVDVYJSI-QTKMDUPCSA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 1
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 1
- JBIRFLWXWDSDTR-CYDGBPFRSA-N Arg-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N JBIRFLWXWDSDTR-CYDGBPFRSA-N 0.000 description 1
- BSGSDLYGGHGMND-IHRRRGAJSA-N Arg-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N BSGSDLYGGHGMND-IHRRRGAJSA-N 0.000 description 1
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 1
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 1
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 1
- HRCIIMCTUIAKQB-XGEHTFHBSA-N Arg-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O HRCIIMCTUIAKQB-XGEHTFHBSA-N 0.000 description 1
- LYJXHXGPWDTLKW-HJGDQZAQSA-N Arg-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O LYJXHXGPWDTLKW-HJGDQZAQSA-N 0.000 description 1
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 1
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 1
- LOVIQNMIPQVIGT-BVSLBCMMSA-N Arg-Trp-Phe Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)C1=CC=CC=C1 LOVIQNMIPQVIGT-BVSLBCMMSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 1
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- WHLDJYNHXOMGMU-JYJNAYRXSA-N Arg-Val-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WHLDJYNHXOMGMU-JYJNAYRXSA-N 0.000 description 1
- AKEBUSZTMQLNIX-UWJYBYFXSA-N Asn-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N AKEBUSZTMQLNIX-UWJYBYFXSA-N 0.000 description 1
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 1
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 1
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 1
- RRVBEKYEFMCDIF-WHFBIAKZSA-N Asn-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)C(=O)N RRVBEKYEFMCDIF-WHFBIAKZSA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- YGHCVNQOZZMHRZ-DJFWLOJKSA-N Asn-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N YGHCVNQOZZMHRZ-DJFWLOJKSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 1
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 1
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- XHTUGJCAEYOZOR-UBHSHLNASA-N Asn-Ser-Trp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XHTUGJCAEYOZOR-UBHSHLNASA-N 0.000 description 1
- GOPFMQJUQDLUFW-LKXGYXEUSA-N Asn-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O GOPFMQJUQDLUFW-LKXGYXEUSA-N 0.000 description 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 1
- IPPFAOCLQSGHJV-WFBYXXMGSA-N Asn-Trp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O IPPFAOCLQSGHJV-WFBYXXMGSA-N 0.000 description 1
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 1
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 1
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 1
- ZCKYZTGLXIEOKS-CIUDSAMLSA-N Asp-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N ZCKYZTGLXIEOKS-CIUDSAMLSA-N 0.000 description 1
- APYNREQHZOGYHV-ACZMJKKPSA-N Asp-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N APYNREQHZOGYHV-ACZMJKKPSA-N 0.000 description 1
- DZQKLNLLWFQONU-LKXGYXEUSA-N Asp-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)O DZQKLNLLWFQONU-LKXGYXEUSA-N 0.000 description 1
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 1
- RATOMFTUDRYMKX-ACZMJKKPSA-N Asp-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N RATOMFTUDRYMKX-ACZMJKKPSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- LTXGDRFJRZSZAV-CIUDSAMLSA-N Asp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N LTXGDRFJRZSZAV-CIUDSAMLSA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- RQYMKRMRZWJGHC-BQBZGAKWSA-N Asp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N RQYMKRMRZWJGHC-BQBZGAKWSA-N 0.000 description 1
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- AITKTFCQOBRJTG-CIUDSAMLSA-N Asp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N AITKTFCQOBRJTG-CIUDSAMLSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- RRUWMFBLFLUZSI-LPEHRKFASA-N Asp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N RRUWMFBLFLUZSI-LPEHRKFASA-N 0.000 description 1
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 1
- KRQFMDNIUOVRIF-KKUMJFAQSA-N Asp-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N KRQFMDNIUOVRIF-KKUMJFAQSA-N 0.000 description 1
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- GZYDPEJSZYZWEF-MXAVVETBSA-N Asp-Val-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O GZYDPEJSZYZWEF-MXAVVETBSA-N 0.000 description 1
- 101100315624 Caenorhabditis elegans tyr-1 gene Proteins 0.000 description 1
- 241000282472 Canis lupus familiaris Species 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 102000004266 Collagen Type IV Human genes 0.000 description 1
- 108010042086 Collagen Type IV Proteins 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- UISYPAHPLXGLNH-ACZMJKKPSA-N Cys-Asn-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UISYPAHPLXGLNH-ACZMJKKPSA-N 0.000 description 1
- GSNRZJNHMVMOFV-ACZMJKKPSA-N Cys-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N GSNRZJNHMVMOFV-ACZMJKKPSA-N 0.000 description 1
- GGIHYKLJUIZYGH-ZLUOBGJFSA-N Cys-Cys-Asp Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N)C(=O)O GGIHYKLJUIZYGH-ZLUOBGJFSA-N 0.000 description 1
- ATPDEYTYWVMINF-ZLUOBGJFSA-N Cys-Cys-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O ATPDEYTYWVMINF-ZLUOBGJFSA-N 0.000 description 1
- SFRQEQGPRTVDPO-NRPADANISA-N Cys-Gln-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O SFRQEQGPRTVDPO-NRPADANISA-N 0.000 description 1
- BCSYBBMFGLHCOA-ACZMJKKPSA-N Cys-Glu-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BCSYBBMFGLHCOA-ACZMJKKPSA-N 0.000 description 1
- SBORMUFGKSCGEN-XHNCKOQMSA-N Cys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N)C(=O)O SBORMUFGKSCGEN-XHNCKOQMSA-N 0.000 description 1
- CVLIHKBUPSFRQP-WHFBIAKZSA-N Cys-Gly-Ala Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C)C(O)=O CVLIHKBUPSFRQP-WHFBIAKZSA-N 0.000 description 1
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 1
- DZSICRGTVPDCRN-YUMQZZPRSA-N Cys-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N DZSICRGTVPDCRN-YUMQZZPRSA-N 0.000 description 1
- YKKHFPGOZXQAGK-QWRGUYRKSA-N Cys-Gly-Tyr Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YKKHFPGOZXQAGK-QWRGUYRKSA-N 0.000 description 1
- WAJDEKCJRKGRPG-CIUDSAMLSA-N Cys-His-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N WAJDEKCJRKGRPG-CIUDSAMLSA-N 0.000 description 1
- ZLHPWFSAUJEEAN-KBIXCLLPSA-N Cys-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N ZLHPWFSAUJEEAN-KBIXCLLPSA-N 0.000 description 1
- ABLJDBFJPUWQQB-DCAQKATOSA-N Cys-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N ABLJDBFJPUWQQB-DCAQKATOSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- VTBGVPWSWJBERH-DCAQKATOSA-N Cys-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CS)N VTBGVPWSWJBERH-DCAQKATOSA-N 0.000 description 1
- IDFVDSBJNMPBSX-SRVKXCTJSA-N Cys-Lys-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O IDFVDSBJNMPBSX-SRVKXCTJSA-N 0.000 description 1
- XMVZMBGFIOQONW-GARJFASQSA-N Cys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)C(=O)O XMVZMBGFIOQONW-GARJFASQSA-N 0.000 description 1
- LHJDLVVQRJIURS-SRVKXCTJSA-N Cys-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N LHJDLVVQRJIURS-SRVKXCTJSA-N 0.000 description 1
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 1
- YXQDRIRSAHTJKM-IMJSIDKUSA-N Cys-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(O)=O YXQDRIRSAHTJKM-IMJSIDKUSA-N 0.000 description 1
- GFAPBMCRSMSGDZ-XGEHTFHBSA-N Cys-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CS)N)O GFAPBMCRSMSGDZ-XGEHTFHBSA-N 0.000 description 1
- UGPCUUWZXRMCIJ-KKUMJFAQSA-N Cys-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CS)N UGPCUUWZXRMCIJ-KKUMJFAQSA-N 0.000 description 1
- JRZMCSIUYGSJKP-ZKWXMUAHSA-N Cys-Val-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JRZMCSIUYGSJKP-ZKWXMUAHSA-N 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 1
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 1
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 1
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- SXIJQMBEVYWAQT-GUBZILKMSA-N Gln-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXIJQMBEVYWAQT-GUBZILKMSA-N 0.000 description 1
- GNDJOCGXGLNCKY-ACZMJKKPSA-N Gln-Cys-Cys Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O GNDJOCGXGLNCKY-ACZMJKKPSA-N 0.000 description 1
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 1
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 1
- JKGHMESJHRTHIC-SIUGBPQLSA-N Gln-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JKGHMESJHRTHIC-SIUGBPQLSA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- CULXMOZETKLBDI-XIRDDKMYSA-N Gln-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)N)N CULXMOZETKLBDI-XIRDDKMYSA-N 0.000 description 1
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 1
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- UTKICHUQEQBDGC-ACZMJKKPSA-N Glu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UTKICHUQEQBDGC-ACZMJKKPSA-N 0.000 description 1
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 1
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- NADWTMLCUDMDQI-ACZMJKKPSA-N Glu-Asp-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N NADWTMLCUDMDQI-ACZMJKKPSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- ZXLZWUQBRYGDNS-CIUDSAMLSA-N Glu-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N ZXLZWUQBRYGDNS-CIUDSAMLSA-N 0.000 description 1
- KIMXNQXJJWWVIN-AVGNSLFASA-N Glu-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O KIMXNQXJJWWVIN-AVGNSLFASA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 1
- SOEPMWQCTJITPZ-SRVKXCTJSA-N Glu-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N SOEPMWQCTJITPZ-SRVKXCTJSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- YUXIEONARHPUTK-JBACZVJFSA-N Glu-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CCC(=O)O)N YUXIEONARHPUTK-JBACZVJFSA-N 0.000 description 1
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 1
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 1
- WXONSNSSBYQGNN-AVGNSLFASA-N Glu-Ser-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WXONSNSSBYQGNN-AVGNSLFASA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- GQGAFTPXAPKSCF-WHFBIAKZSA-N Gly-Ala-Cys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O GQGAFTPXAPKSCF-WHFBIAKZSA-N 0.000 description 1
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- JPXNYFOHTHSREU-UWVGGRQHSA-N Gly-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN JPXNYFOHTHSREU-UWVGGRQHSA-N 0.000 description 1
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- DUYYPIRFTLOAJQ-YUMQZZPRSA-N Gly-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN DUYYPIRFTLOAJQ-YUMQZZPRSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 1
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 1
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 1
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- GVVKYKCOFMMTKZ-WHFBIAKZSA-N Gly-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)CN GVVKYKCOFMMTKZ-WHFBIAKZSA-N 0.000 description 1
- LEGMTEAZGRRIMY-ZKWXMUAHSA-N Gly-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN LEGMTEAZGRRIMY-ZKWXMUAHSA-N 0.000 description 1
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 1
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 1
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 1
- BULIVUZUDBHKKZ-WDSKDSINSA-N Gly-Gln-Asn Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BULIVUZUDBHKKZ-WDSKDSINSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 1
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 1
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 1
- BBTCXWTXOXUNFX-IUCAKERBSA-N Gly-Met-Arg Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O BBTCXWTXOXUNFX-IUCAKERBSA-N 0.000 description 1
- IFHJOBKVXBESRE-YUMQZZPRSA-N Gly-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN IFHJOBKVXBESRE-YUMQZZPRSA-N 0.000 description 1
- RUDRIZRGOLQSMX-IUCAKERBSA-N Gly-Met-Met Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O RUDRIZRGOLQSMX-IUCAKERBSA-N 0.000 description 1
- QGDOOCIPHSSADO-STQMWFEESA-N Gly-Met-Phe Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGDOOCIPHSSADO-STQMWFEESA-N 0.000 description 1
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 1
- QSQXZZCGPXQBPP-BQBZGAKWSA-N Gly-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)CN)C(=O)N[C@@H](CS)C(=O)O QSQXZZCGPXQBPP-BQBZGAKWSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 1
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 1
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- FXTUGWXZTFMTIV-GJZGRUSLSA-N Gly-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN FXTUGWXZTFMTIV-GJZGRUSLSA-N 0.000 description 1
- MREVELMMFOLESM-HOCLYGCPSA-N Gly-Trp-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O MREVELMMFOLESM-HOCLYGCPSA-N 0.000 description 1
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 1
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 1
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 1
- DKJWUIYLMLUBDX-XPUUQOCRSA-N Gly-Val-Cys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O DKJWUIYLMLUBDX-XPUUQOCRSA-N 0.000 description 1
- 206010073069 Hepatic cancer Diseases 0.000 description 1
- 108700039791 Hepatitis C virus nucleocapsid Proteins 0.000 description 1
- 206010019799 Hepatitis viral Diseases 0.000 description 1
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 1
- HXKZJLWGSWQKEA-LSJOCFKGSA-N His-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 HXKZJLWGSWQKEA-LSJOCFKGSA-N 0.000 description 1
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 1
- VIVSWEBJUHXCDS-DCAQKATOSA-N His-Asn-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O VIVSWEBJUHXCDS-DCAQKATOSA-N 0.000 description 1
- VOEGKUNRHYKYSU-XVYDVKMFSA-N His-Asp-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O VOEGKUNRHYKYSU-XVYDVKMFSA-N 0.000 description 1
- IMPKSPYRPUXYAP-SZMVWBNQSA-N His-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC3=CN=CN3)N IMPKSPYRPUXYAP-SZMVWBNQSA-N 0.000 description 1
- ZUPVLBAXUUGKKN-VHSXEESVSA-N His-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CN=CN2)N)C(=O)O ZUPVLBAXUUGKKN-VHSXEESVSA-N 0.000 description 1
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 1
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 1
- CSTNMMIHMYJGFR-IHRRRGAJSA-N His-His-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 CSTNMMIHMYJGFR-IHRRRGAJSA-N 0.000 description 1
- CTJHHEQNUNIYNN-SRVKXCTJSA-N His-His-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O CTJHHEQNUNIYNN-SRVKXCTJSA-N 0.000 description 1
- JJHWJUYYTWYXPL-PYJNHQTQSA-N His-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CN=CN1 JJHWJUYYTWYXPL-PYJNHQTQSA-N 0.000 description 1
- LBQAHBIVXQSBIR-HVTMNAMFSA-N His-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LBQAHBIVXQSBIR-HVTMNAMFSA-N 0.000 description 1
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- CTEMYIWDSVICKS-WDSOQIARSA-N His-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N CTEMYIWDSVICKS-WDSOQIARSA-N 0.000 description 1
- SGLXGEDPYJPGIQ-ACRUOGEOSA-N His-Phe-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N SGLXGEDPYJPGIQ-ACRUOGEOSA-N 0.000 description 1
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 1
- WCHONUZTYDQMBY-PYJNHQTQSA-N His-Pro-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WCHONUZTYDQMBY-PYJNHQTQSA-N 0.000 description 1
- KAXZXLSXFWSNNZ-XVYDVKMFSA-N His-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KAXZXLSXFWSNNZ-XVYDVKMFSA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- UOYGZBIPZYKGSH-SRVKXCTJSA-N His-Ser-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N UOYGZBIPZYKGSH-SRVKXCTJSA-N 0.000 description 1
- ILUVWFTXAUYOBW-CUJWVEQBSA-N His-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N)O ILUVWFTXAUYOBW-CUJWVEQBSA-N 0.000 description 1
- NBWATNYAUVSAEQ-ZEILLAHLSA-N His-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O NBWATNYAUVSAEQ-ZEILLAHLSA-N 0.000 description 1
- FRDFAWHTPDKRHG-ULQDDVLXSA-N His-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 FRDFAWHTPDKRHG-ULQDDVLXSA-N 0.000 description 1
- RNVUQLOKVIPNEM-BZSNNMDCSA-N His-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O RNVUQLOKVIPNEM-BZSNNMDCSA-N 0.000 description 1
- GYXDQXPCPASCNR-NHCYSSNCSA-N His-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N GYXDQXPCPASCNR-NHCYSSNCSA-N 0.000 description 1
- KDDKJKKQODQQBR-NHCYSSNCSA-N His-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N KDDKJKKQODQQBR-NHCYSSNCSA-N 0.000 description 1
- CGAMSLMBYJHMDY-ONGXEEELSA-N His-Val-Gly Chemical compound CC(C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N CGAMSLMBYJHMDY-ONGXEEELSA-N 0.000 description 1
- 241001272567 Hominoidea Species 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- HLYBGMZJVDHJEO-CYDGBPFRSA-N Ile-Arg-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HLYBGMZJVDHJEO-CYDGBPFRSA-N 0.000 description 1
- UNDGQKWQNSTPPW-CYDGBPFRSA-N Ile-Arg-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N UNDGQKWQNSTPPW-CYDGBPFRSA-N 0.000 description 1
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 1
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- XLDYDEDTGMHUCZ-GHCJXIJMSA-N Ile-Asp-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N XLDYDEDTGMHUCZ-GHCJXIJMSA-N 0.000 description 1
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 1
- VQUCKIAECLVLAD-SVSWQMSJSA-N Ile-Cys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VQUCKIAECLVLAD-SVSWQMSJSA-N 0.000 description 1
- WNQKUUQIVDDAFA-ZPFDUUQYSA-N Ile-Gln-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N WNQKUUQIVDDAFA-ZPFDUUQYSA-N 0.000 description 1
- HTDRTKMNJRRYOJ-SIUGBPQLSA-N Ile-Gln-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HTDRTKMNJRRYOJ-SIUGBPQLSA-N 0.000 description 1
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 1
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 1
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 1
- MTONDYJJCIBZTK-PEDHHIEDSA-N Ile-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(=O)O)N MTONDYJJCIBZTK-PEDHHIEDSA-N 0.000 description 1
- NUKXXNFEUZGPRO-BJDJZHNGSA-N Ile-Leu-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NUKXXNFEUZGPRO-BJDJZHNGSA-N 0.000 description 1
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 1
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 1
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 1
- IALVDKNUFSTICJ-GMOBBJLQSA-N Ile-Met-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IALVDKNUFSTICJ-GMOBBJLQSA-N 0.000 description 1
- NNVXABCGXOLIEB-PYJNHQTQSA-N Ile-Met-His Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NNVXABCGXOLIEB-PYJNHQTQSA-N 0.000 description 1
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 1
- RKQAYOWLSFLJEE-SVSWQMSJSA-N Ile-Thr-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N RKQAYOWLSFLJEE-SVSWQMSJSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- GNXGAVNTVNOCLL-SIUGBPQLSA-N Ile-Tyr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GNXGAVNTVNOCLL-SIUGBPQLSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- RNKSNIBMTUYWSH-YFKPBYRVSA-N L-prolylglycine Chemical compound [O-]C(=O)CNC(=O)[C@@H]1CCC[NH2+]1 RNKSNIBMTUYWSH-YFKPBYRVSA-N 0.000 description 1
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 1
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- FOBUGKUBUJOWAD-IHPCNDPISA-N Leu-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FOBUGKUBUJOWAD-IHPCNDPISA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 1
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- HGUUMQWGYCVPKG-DCAQKATOSA-N Leu-Pro-Cys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HGUUMQWGYCVPKG-DCAQKATOSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- IDGRADDMTTWOQC-WDSOQIARSA-N Leu-Trp-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IDGRADDMTTWOQC-WDSOQIARSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 239000000232 Lipid Bilayer Substances 0.000 description 1
- 239000012097 Lipofectamine 2000 Substances 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- BTSXLXFPMZXVPR-DLOVCJGASA-N Lys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BTSXLXFPMZXVPR-DLOVCJGASA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 1
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 1
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- RDIILCRAWOSDOQ-CIUDSAMLSA-N Lys-Cys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RDIILCRAWOSDOQ-CIUDSAMLSA-N 0.000 description 1
- SSYOBDBNBQBSQE-SRVKXCTJSA-N Lys-Cys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O SSYOBDBNBQBSQE-SRVKXCTJSA-N 0.000 description 1
- KSFQPRLZAUXXPT-GARJFASQSA-N Lys-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)C(=O)O KSFQPRLZAUXXPT-GARJFASQSA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 1
- YXTKSLRSRXKXNV-IHRRRGAJSA-N Lys-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N YXTKSLRSRXKXNV-IHRRRGAJSA-N 0.000 description 1
- CBNMHRCLYBJIIZ-XUXIUFHCSA-N Lys-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N CBNMHRCLYBJIIZ-XUXIUFHCSA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- AHFOKDZWPPGJAZ-SRVKXCTJSA-N Lys-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N AHFOKDZWPPGJAZ-SRVKXCTJSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 1
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 1
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- ZVXSESPJMKNIQA-YXMSTPNBSA-N Lys-Thr-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ZVXSESPJMKNIQA-YXMSTPNBSA-N 0.000 description 1
- ZNAPAUSAUBHENO-IHPCNDPISA-N Lys-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)[C@H](CCCCN)N ZNAPAUSAUBHENO-IHPCNDPISA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- ZEDVFJPQNNBMST-CYDGBPFRSA-N Met-Arg-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZEDVFJPQNNBMST-CYDGBPFRSA-N 0.000 description 1
- SBSIKVMCCJUCBZ-GUBZILKMSA-N Met-Asn-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N SBSIKVMCCJUCBZ-GUBZILKMSA-N 0.000 description 1
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 1
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 1
- LQMHZERGCQJKAH-STQMWFEESA-N Met-Gly-Phe Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LQMHZERGCQJKAH-STQMWFEESA-N 0.000 description 1
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 1
- ZIIMORLEZLVRIP-SRVKXCTJSA-N Met-Leu-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZIIMORLEZLVRIP-SRVKXCTJSA-N 0.000 description 1
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 1
- IILAGWCGKJSBGB-IHRRRGAJSA-N Met-Phe-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IILAGWCGKJSBGB-IHRRRGAJSA-N 0.000 description 1
- XIGAHPDZLAYQOS-SRVKXCTJSA-N Met-Pro-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 XIGAHPDZLAYQOS-SRVKXCTJSA-N 0.000 description 1
- NHXXGBXJTLRGJI-GUBZILKMSA-N Met-Pro-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O NHXXGBXJTLRGJI-GUBZILKMSA-N 0.000 description 1
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 1
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 1
- SOAYQFDWEIWPPR-IHRRRGAJSA-N Met-Ser-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SOAYQFDWEIWPPR-IHRRRGAJSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- KZKVVWBOGDKHKE-QTKMDUPCSA-N Met-Thr-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 KZKVVWBOGDKHKE-QTKMDUPCSA-N 0.000 description 1
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 1
- SGWDZVVIRDOXSG-BPUTZDHNSA-N Met-Trp-Cys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CS)C(O)=O)=CNC2=C1 SGWDZVVIRDOXSG-BPUTZDHNSA-N 0.000 description 1
- YDKYJRZWRJTILC-WDSOQIARSA-N Met-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 YDKYJRZWRJTILC-WDSOQIARSA-N 0.000 description 1
- HOTNHEUETJELDL-BPNCWPANSA-N Met-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N HOTNHEUETJELDL-BPNCWPANSA-N 0.000 description 1
- FSTWDRPCQQUJIT-NHCYSSNCSA-N Met-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N FSTWDRPCQQUJIT-NHCYSSNCSA-N 0.000 description 1
- PVSPJQWHEIQTEH-JYJNAYRXSA-N Met-Val-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PVSPJQWHEIQTEH-JYJNAYRXSA-N 0.000 description 1
- 201000002481 Myositis Diseases 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- 101800000135 N-terminal protein Proteins 0.000 description 1
- 101800000511 Non-structural protein 2 Proteins 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 101800001452 P1 proteinase Proteins 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 239000002033 PVDF binder Substances 0.000 description 1
- 241000282577 Pan troglodytes Species 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 1
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 1
- MQVFHOPCKNTHGT-MELADBBJSA-N Phe-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O MQVFHOPCKNTHGT-MELADBBJSA-N 0.000 description 1
- QEPZQAPZKIPVDV-KKUMJFAQSA-N Phe-Cys-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N QEPZQAPZKIPVDV-KKUMJFAQSA-N 0.000 description 1
- CPTJPDZTFNKFOU-MXAVVETBSA-N Phe-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N CPTJPDZTFNKFOU-MXAVVETBSA-N 0.000 description 1
- ABQFNJAFONNUTH-FHWLQOOXSA-N Phe-Gln-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N ABQFNJAFONNUTH-FHWLQOOXSA-N 0.000 description 1
- MFQXSDWKUXTOPZ-DZKIICNBSA-N Phe-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N MFQXSDWKUXTOPZ-DZKIICNBSA-N 0.000 description 1
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 1
- MJAYDXWQQUOURZ-JYJNAYRXSA-N Phe-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MJAYDXWQQUOURZ-JYJNAYRXSA-N 0.000 description 1
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 1
- RYAUPBMDRMJVRM-BVSLBCMMSA-N Phe-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=CC=C3)N RYAUPBMDRMJVRM-BVSLBCMMSA-N 0.000 description 1
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 1
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 1
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 1
- FZBGMXYQPACKNC-HJWJTTGWSA-N Phe-Pro-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FZBGMXYQPACKNC-HJWJTTGWSA-N 0.000 description 1
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 1
- IIEOLPMQYRBZCN-SRVKXCTJSA-N Phe-Ser-Cys Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O IIEOLPMQYRBZCN-SRVKXCTJSA-N 0.000 description 1
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 1
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 1
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 1
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 1
- BPIFSOUEUYDJRM-DCPHZVHLSA-N Phe-Trp-Ala Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C)C(O)=O)C1=CC=CC=C1 BPIFSOUEUYDJRM-DCPHZVHLSA-N 0.000 description 1
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 1
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 1
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- NUZHSNLQJDYSRW-BZSNNMDCSA-N Pro-Arg-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NUZHSNLQJDYSRW-BZSNNMDCSA-N 0.000 description 1
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 1
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 1
- OZAPWFHRPINHND-GUBZILKMSA-N Pro-Cys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OZAPWFHRPINHND-GUBZILKMSA-N 0.000 description 1
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 1
- PZSCUPVOJGKHEP-CIUDSAMLSA-N Pro-Gln-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PZSCUPVOJGKHEP-CIUDSAMLSA-N 0.000 description 1
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 1
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 1
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 1
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 1
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- QGOZJLYCGRYYRW-KKUMJFAQSA-N Pro-Glu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QGOZJLYCGRYYRW-KKUMJFAQSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 1
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- YTWNSIDWAFSEEI-RWMBFGLXSA-N Pro-His-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N3CCC[C@@H]3C(=O)O YTWNSIDWAFSEEI-RWMBFGLXSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 1
- BUEIYHBJHCDAMI-UFYCRDLUSA-N Pro-Phe-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BUEIYHBJHCDAMI-UFYCRDLUSA-N 0.000 description 1
- HOTVCUAVDQHUDB-UFYCRDLUSA-N Pro-Phe-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 HOTVCUAVDQHUDB-UFYCRDLUSA-N 0.000 description 1
- SVXXJYJCRNKDDE-AVGNSLFASA-N Pro-Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CN=CN1 SVXXJYJCRNKDDE-AVGNSLFASA-N 0.000 description 1
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 1
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 1
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 1
- GBUNEGKQPSAMNK-QTKMDUPCSA-N Pro-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2)O GBUNEGKQPSAMNK-QTKMDUPCSA-N 0.000 description 1
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 1
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- FYXCBXDAMPEHIQ-FHWLQOOXSA-N Pro-Trp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCCCN)C(=O)O FYXCBXDAMPEHIQ-FHWLQOOXSA-N 0.000 description 1
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 1
- IWUCXVSUMQZMFG-AFCXAGJDSA-N Ribavirin Chemical compound N1=C(C(=O)N)N=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 IWUCXVSUMQZMFG-AFCXAGJDSA-N 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 239000012722 SDS sample buffer Substances 0.000 description 1
- CGNLCCVKSWNSDG-UHFFFAOYSA-N SYBR Green I Chemical compound CN(C)CCCN(CCC)C1=CC(C=C2N(C3=CC=CC=C3S2)C)=C2C=CC=CC2=[N+]1C1=CC=CC=C1 CGNLCCVKSWNSDG-UHFFFAOYSA-N 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- IYCBDVBJWDXQRR-FXQIFTODSA-N Ser-Ala-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IYCBDVBJWDXQRR-FXQIFTODSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- JJKSSJVYOVRJMZ-FXQIFTODSA-N Ser-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)CN=C(N)N JJKSSJVYOVRJMZ-FXQIFTODSA-N 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 1
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- TUYBIWUZWJUZDD-ACZMJKKPSA-N Ser-Cys-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(N)=O TUYBIWUZWJUZDD-ACZMJKKPSA-N 0.000 description 1
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 1
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 1
- PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- UGHCUDLCCVVIJR-VGDYDELISA-N Ser-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N UGHCUDLCCVVIJR-VGDYDELISA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- LPSKHZWBQONOQJ-XIRDDKMYSA-N Ser-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N LPSKHZWBQONOQJ-XIRDDKMYSA-N 0.000 description 1
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 1
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 1
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- AABIBDJHSKIMJK-FXQIFTODSA-N Ser-Ser-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O AABIBDJHSKIMJK-FXQIFTODSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- UQGAAZXSCGWMFU-UBHSHLNASA-N Ser-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N UQGAAZXSCGWMFU-UBHSHLNASA-N 0.000 description 1
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 1
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 1
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- 239000008049 TAE buffer Substances 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 1
- JMQUAZXYFAEOIH-XGEHTFHBSA-N Thr-Arg-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O JMQUAZXYFAEOIH-XGEHTFHBSA-N 0.000 description 1
- PKXHGEXFMIZSER-QTKMDUPCSA-N Thr-Arg-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PKXHGEXFMIZSER-QTKMDUPCSA-N 0.000 description 1
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 1
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 1
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 1
- NRBUKAHTWRCUEQ-XGEHTFHBSA-N Thr-Cys-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(O)=O NRBUKAHTWRCUEQ-XGEHTFHBSA-N 0.000 description 1
- VEWZSFGRQDUAJM-YJRXYDGGSA-N Thr-Cys-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O VEWZSFGRQDUAJM-YJRXYDGGSA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 1
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 1
- XSTGOZBBXFKGHA-YJRXYDGGSA-N Thr-His-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O XSTGOZBBXFKGHA-YJRXYDGGSA-N 0.000 description 1
- WBCCCPZIJIJTSD-TUBUOCAGSA-N Thr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H]([C@@H](C)O)N WBCCCPZIJIJTSD-TUBUOCAGSA-N 0.000 description 1
- YUOCMLNTUZAGNF-KLHWPWHYSA-N Thr-His-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N)O YUOCMLNTUZAGNF-KLHWPWHYSA-N 0.000 description 1
- YDWLCDQXLCILCZ-BWAGICSOSA-N Thr-His-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YDWLCDQXLCILCZ-BWAGICSOSA-N 0.000 description 1
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- LCCSEJSPBWKBNT-OSUNSFLBSA-N Thr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N LCCSEJSPBWKBNT-OSUNSFLBSA-N 0.000 description 1
- XIULAFZYEKSGAJ-IXOXFDKPSA-N Thr-Leu-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XIULAFZYEKSGAJ-IXOXFDKPSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- ISLDRLHVPXABBC-IEGACIPQSA-N Thr-Leu-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISLDRLHVPXABBC-IEGACIPQSA-N 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 1
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- OHDXOXIZXSFCDN-RCWTZXSCSA-N Thr-Met-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OHDXOXIZXSFCDN-RCWTZXSCSA-N 0.000 description 1
- PZSDPRBZINDEJV-HTUGSXCWSA-N Thr-Phe-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PZSDPRBZINDEJV-HTUGSXCWSA-N 0.000 description 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 1
- DNCUODYZAMHLCV-XGEHTFHBSA-N Thr-Pro-Cys Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N)O DNCUODYZAMHLCV-XGEHTFHBSA-N 0.000 description 1
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 1
- NBIIPOKZPUGATB-BWBBJGPYSA-N Thr-Ser-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O NBIIPOKZPUGATB-BWBBJGPYSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- VEENWOSZGWWKHW-SZZJOZGLSA-N Thr-Trp-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N)O VEENWOSZGWWKHW-SZZJOZGLSA-N 0.000 description 1
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 1
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 208000007536 Thrombosis Diseases 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- HYVLNORXQGKONN-NUTKFTJISA-N Trp-Ala-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 HYVLNORXQGKONN-NUTKFTJISA-N 0.000 description 1
- UTQBQJNSNXJNIH-IHPCNDPISA-N Trp-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N UTQBQJNSNXJNIH-IHPCNDPISA-N 0.000 description 1
- VKMOGXREKGVZAF-QEJZJMRPSA-N Trp-Asp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VKMOGXREKGVZAF-QEJZJMRPSA-N 0.000 description 1
- DEZKIRSBKKXUEV-NYVOZVTQSA-N Trp-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N DEZKIRSBKKXUEV-NYVOZVTQSA-N 0.000 description 1
- CZSMNLQMRWPGQF-XEGUGMAKSA-N Trp-Gln-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CZSMNLQMRWPGQF-XEGUGMAKSA-N 0.000 description 1
- VTHNLRXALGUDBS-BPUTZDHNSA-N Trp-Gln-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VTHNLRXALGUDBS-BPUTZDHNSA-N 0.000 description 1
- VMBBTANKMSRJSS-JSGCOSHPSA-N Trp-Glu-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VMBBTANKMSRJSS-JSGCOSHPSA-N 0.000 description 1
- SNJAPSVIPKUMCK-NWLDYVSISA-N Trp-Glu-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SNJAPSVIPKUMCK-NWLDYVSISA-N 0.000 description 1
- AZBIIKDSDLVJAK-VHWLVUOQSA-N Trp-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N AZBIIKDSDLVJAK-VHWLVUOQSA-N 0.000 description 1
- BONYBFXWMXBAND-GQGQLFGLSA-N Trp-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BONYBFXWMXBAND-GQGQLFGLSA-N 0.000 description 1
- KRCPXGSWDOGHAM-XIRDDKMYSA-N Trp-Lys-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O KRCPXGSWDOGHAM-XIRDDKMYSA-N 0.000 description 1
- PWPJLBWYRTVYQS-PMVMPFDFSA-N Trp-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PWPJLBWYRTVYQS-PMVMPFDFSA-N 0.000 description 1
- UQHPXCFAHVTWFU-BVSLBCMMSA-N Trp-Phe-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UQHPXCFAHVTWFU-BVSLBCMMSA-N 0.000 description 1
- STKZKWFOKOCSLW-UMPQAUOISA-N Trp-Thr-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 STKZKWFOKOCSLW-UMPQAUOISA-N 0.000 description 1
- WVAKXMOGMWLWHK-VJBMBRPKSA-N Trp-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N WVAKXMOGMWLWHK-VJBMBRPKSA-N 0.000 description 1
- CRCHQCUINSOGFD-JBACZVJFSA-N Trp-Tyr-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N CRCHQCUINSOGFD-JBACZVJFSA-N 0.000 description 1
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 1
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 1
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 1
- NLMXVDDEQFKQQU-CFMVVWHZSA-N Tyr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLMXVDDEQFKQQU-CFMVVWHZSA-N 0.000 description 1
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 1
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 1
- MVYRJYISVJWKSX-KBPBESRZSA-N Tyr-His-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)NCC(=O)O)N)O MVYRJYISVJWKSX-KBPBESRZSA-N 0.000 description 1
- JHORGUYURUBVOM-KKUMJFAQSA-N Tyr-His-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O JHORGUYURUBVOM-KKUMJFAQSA-N 0.000 description 1
- HHFMNAVFGBYSAT-IGISWZIWSA-N Tyr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HHFMNAVFGBYSAT-IGISWZIWSA-N 0.000 description 1
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 1
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 1
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 1
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- OGPKMBOPMDTEDM-IHRRRGAJSA-N Tyr-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N OGPKMBOPMDTEDM-IHRRRGAJSA-N 0.000 description 1
- AUZADXNWQMBZOO-JYJNAYRXSA-N Tyr-Pro-Arg Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 AUZADXNWQMBZOO-JYJNAYRXSA-N 0.000 description 1
- VPEFOFYNHBWFNQ-UFYCRDLUSA-N Tyr-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 VPEFOFYNHBWFNQ-UFYCRDLUSA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 1
- KLQPIEVIKOQRAW-IZPVPAKOSA-N Tyr-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KLQPIEVIKOQRAW-IZPVPAKOSA-N 0.000 description 1
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- JYVKKBDANPZIAW-AVGNSLFASA-N Val-Arg-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N JYVKKBDANPZIAW-AVGNSLFASA-N 0.000 description 1
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- GNWUWQAVVJQREM-NHCYSSNCSA-N Val-Asn-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GNWUWQAVVJQREM-NHCYSSNCSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- VXCAZHCVDBQMTP-NRPADANISA-N Val-Cys-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VXCAZHCVDBQMTP-NRPADANISA-N 0.000 description 1
- KOPBYUSPXBQIHD-NRPADANISA-N Val-Cys-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KOPBYUSPXBQIHD-NRPADANISA-N 0.000 description 1
- OUUBKKIJQIAPRI-LAEOZQHASA-N Val-Gln-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OUUBKKIJQIAPRI-LAEOZQHASA-N 0.000 description 1
- OXVPMZVGCAPFIG-BQFCYCMXSA-N Val-Gln-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N OXVPMZVGCAPFIG-BQFCYCMXSA-N 0.000 description 1
- AHHJARQXFFGOKF-NRPADANISA-N Val-Glu-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N AHHJARQXFFGOKF-NRPADANISA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- OXGVAUFVTOPFFA-XPUUQOCRSA-N Val-Gly-Cys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OXGVAUFVTOPFFA-XPUUQOCRSA-N 0.000 description 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 1
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 1
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 1
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- IRAUYEAFPFPVND-UVBJJODRSA-N Val-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 IRAUYEAFPFPVND-UVBJJODRSA-N 0.000 description 1
- HVRRJRMULCPNRO-BZSNNMDCSA-N Val-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 HVRRJRMULCPNRO-BZSNNMDCSA-N 0.000 description 1
- SUGRIIAOLCDLBD-ZOBUZTSGSA-N Val-Trp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SUGRIIAOLCDLBD-ZOBUZTSGSA-N 0.000 description 1
- QTXGUIMEHKCPBH-FHWLQOOXSA-N Val-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 QTXGUIMEHKCPBH-FHWLQOOXSA-N 0.000 description 1
- AYHNXCJKBLYVOA-KSZLIROESA-N Val-Trp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N AYHNXCJKBLYVOA-KSZLIROESA-N 0.000 description 1
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- 108700005077 Viral Genes Proteins 0.000 description 1
- 108020000999 Viral RNA Proteins 0.000 description 1
- 108010087302 Viral Structural Proteins Proteins 0.000 description 1
- 238000009557 abdominal ultrasonography Methods 0.000 description 1
- 229960000583 acetic acid Drugs 0.000 description 1
- HGEVZDLYZYVYHD-UHFFFAOYSA-N acetic acid;2-amino-2-(hydroxymethyl)propane-1,3-diol;2-[2-[bis(carboxymethyl)amino]ethyl-(carboxymethyl)amino]acetic acid Chemical compound CC(O)=O.OCC(N)(CO)CO.OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O HGEVZDLYZYVYHD-UHFFFAOYSA-N 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 239000002981 blocking agent Substances 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 210000001772 blood platelet Anatomy 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 210000000234 capsid Anatomy 0.000 description 1
- 239000001569 carbon dioxide Substances 0.000 description 1
- 229910002092 carbon dioxide Inorganic materials 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 238000002648 combination therapy Methods 0.000 description 1
- 239000000306 component Substances 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 1
- 239000000032 diagnostic agent Substances 0.000 description 1
- 229940039227 diagnostic agent Drugs 0.000 description 1
- ZFTFAPZRGNKQPU-UHFFFAOYSA-N dicarbonic acid Chemical compound OC(=O)OC(O)=O ZFTFAPZRGNKQPU-UHFFFAOYSA-N 0.000 description 1
- 125000004177 diethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 231100000676 disease causative agent Toxicity 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 238000007877 drug screening Methods 0.000 description 1
- 239000012149 elution buffer Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000011841 epidemiological investigation Methods 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- 230000005713 exacerbation Effects 0.000 description 1
- 230000004761 fibrosis Effects 0.000 description 1
- 239000012362 glacial acetic acid Substances 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- STKYPAFSDFAEPH-LURJTMIESA-N glycylvaline Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CN STKYPAFSDFAEPH-LURJTMIESA-N 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 229920002674 hyaluronan Polymers 0.000 description 1
- 229960003160 hyaluronic acid Drugs 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 208000019423 liver disease Diseases 0.000 description 1
- 239000012160 loading buffer Substances 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 108010034507 methionyltryptophan Proteins 0.000 description 1
- 239000008267 milk Substances 0.000 description 1
- 235000013336 milk Nutrition 0.000 description 1
- 210000004080 milk Anatomy 0.000 description 1
- 239000011259 mixed solution Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 239000013610 patient sample Substances 0.000 description 1
- 230000035515 penetration Effects 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 125000001151 peptidyl group Chemical group 0.000 description 1
- 235000020030 perry Nutrition 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000004393 prognosis Methods 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- 238000003753 real-time PCR Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 239000012925 reference material Substances 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 229960000329 ribavirin Drugs 0.000 description 1
- HZCAHMRRMINHDJ-DBRKOABJSA-N ribavirin Natural products O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1N=CN=C1 HZCAHMRRMINHDJ-DBRKOABJSA-N 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 238000011895 specific detection Methods 0.000 description 1
- 210000000130 stem cell Anatomy 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 238000001356 surgical procedure Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 108091035539 telomere Proteins 0.000 description 1
- 102000055501 telomere Human genes 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 230000029812 viral genome replication Effects 0.000 description 1
- 201000001862 viral hepatitis Diseases 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/08—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from viruses
- C07K16/10—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from viruses from RNA viruses
- C07K16/1081—Togaviridae, e.g. flavivirus, rubella virus, hog cholera virus
- C07K16/109—Hepatitis C virus; Hepatitis G virus
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/24011—Flaviviridae
- C12N2770/24211—Hepacivirus, e.g. hepatitis C virus, hepatitis G virus
- C12N2770/24222—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Virology (AREA)
- Biotechnology (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Plant Pathology (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Immunology (AREA)
- Communicable Diseases (AREA)
- Gastroenterology & Hepatology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
C형 간염 바이러스의 코어 단백으로부터 NS2 단백을 코드하는 유전자의 일부의 영역을 번역틀을 유지하면서 결실된 트런케이트 폼 C형 간염 바이러스 유전자.
특히, 상기 유전자의 일부의 영역이 적어도 E1 단백 및 E2 단백을 코드하는 영역에 존재하는 유전자.
C형 간염 바이러스 유전자
Description
본 발명은 C형 만성 간염 바이러스 (이하,「HCV」라고 약기한다)의 게놈에 관한 것이다.
HCV는 C형 만성 간염의 원인 인자로서, WHO의 통계에 의하면 세계에서 1.7억명의 감염자가 있는 것으로 추정되고 있다. HCV는 다른 바이러스성 간염과 달리, 감염 초기에는 비교적 경미한 증상을 일으킬 뿐이지만, 감염자는 높은 빈도로 만성화되고, 일정 기간의 무증후기를 거친 후, 만성 간염이 발병한다. 또한, 감염이 장기화함에 따라, 간경변으로 병상이 점차 악화되고, 높은 빈도로 간암에 이른다. 간암의 95%는 간염 바이러스와 관여되어 있고, 그 대부분 (80%)이 HCV의 감염에 의한 것으로 여겨지고 있다.
HCV는 혈액, 혈액 성분이나, 저빈도이지만 체액 성분을 매개하여 감염된다. HCV의 검사법이 헌혈시의 스크리닝에 도입됨으로써, 선진국에서는 신규한 수혈성 C형 간염의 발증은 거의 없어졌다. 또한, 의료 기술의 진보에 따라서, 의료 과실에 의한 전파도 억제되어 있고, 일본에서는 신규 환자의 발생은 거의 억제되어 있다. 그러나, 역학 조사에서 일본에 있어서의 HCV 캐리어는 170만명 이상으로 추정되고 있는데, 그 대부분은 40대 이상으로, 감염이 장기화하고 있는 것으로 나타나고 있 다. 따라서 향후 간암의 증례 수의 증가가 크게 염려된다.
간암의 위험은 간 섬유화 상태와 크게 관련이 있으며, 섬유화가 진행될수록 간암의 발생율이 높아진다. 간 섬유화 상태는 Type IV 콜라겐, 히알론산의 혈중 농도의 측정, 혈중 혈소판수, 화상 진단 (복부 초음파 검사) 등을 조합함으로써 일반적으로 실시되고 있으나, 확정 진단을 위하여는 간생검(肝生檢)을 한다. 그러나, 간생검은 환자에게 다대한 부담을 강요하는 것이 큰 문제이어서, 보다 환자 본위의 진단 방법이 요구되고 있다.
HCV는 약 9,600 염기의 플러스 사슬 RNA를 게놈으로 하는 프라비바이러스과, 프라비바이러스속으로 분류되는 바이러스이며, 혈액이나 혈액 성분을 매개하여 감염되고, 간장에서 증식하는 것으로 생각되고 있다. 유전자 배열의 해석 결과, 적어도 6 종류의 유전자형이 존재하고 있는 것으로 추정되고 있다. 약 9,600 염기의 게놈은 감염 후, 숙주 세포 내에서 mRNA로서 기능하여, 약 3,000 아미노산 길이의 하나의 폴리프로테인이 합성되고, 숙주인 시그널 펩티다제, 시그널 펩티딜펩티다제 및 HCV 게놈이 코드하는 프로테아제에 의하여 절단된다. 그 결과, 코어 (core), E1, E2, p7, NS2, NS3, NS4A, NS4B , NS5A, NS5B의 10 종류의 단백질이 산생된다. 이 번역틀 (오픈 리딩 프레임) 이외에, 5' 말단, 3' 말단에 비번역 영역 (UTR)이 존재하여, 번역 조절, 게놈의 복제 조절 기능을 담당하고 있다.
이 중에서, 코어, E1 및 E2는 바이러스를 구성하는 구조 단백질로서, 바이러스 게놈은 코어 단백질에 의하여 패키지되어 캡시드를 형성하고, 지질 이중막에 앵커된 E1, E2 단백질에 의하여 둘러싸여 바이러스 입자 (빌리언)를 형성하는 것으로 생각되고 있다. p7에 대하여는 그 기능은 분명하지 않지만, 바이러스의 증식에 필수인 것이 보고되어 있다. NS2는 메탈프로테아제이며, 그 자신의 절단에 필요하지만, 그 이외의 기능에 대하여는 알려져 있지 않다. NS3 내지 NS5B는 복합체를 형성하고, 숙주 단백질과 함께, RNA 복제 장치를 형성하고, 게놈 RNA의 복제를 실시하는 것으로 생각되고 있다.
HCV의 감염을 진단하는 방법으로는 환자 생체 성분 (혈액, 혈청, 혈전 등)에 포함되는 HCV가 만들어 내는 단백질에 대한 항체를 검출하는 방법이 널리 이용되고 있으나, 항체의 유무만으로는 HCV가 활동하고 있는지 여부를 판별할 수 없기 때문에, 혈중 바이러스량을 측정하는 것이 진단에는 중요하다. 혈중 바이러스의 검출 또는 측정에는 HCV의 바이러스 게놈을 검출 또는 측정하는 방법, 또는 HCV가 산생하는 코어 단백질을 측정 또는 검출하는 방법이 이용된다.
C형 만성 간염의 치료에는 인터페론이 널리 사용되고 있다. 최근 약제의 개량이나, 리바비린과의 병용 요법 등 투여 방법의 개선 등에 의하여, HCV가 체내로부터 구제되어 완치하는 비율도 서서히 늘고 있다. 그러나, 아직도 완치율은 5할 정도이다. 또한, 인터페론 투여는 중대한 부작용을 일으키는 경우가 있고, 고령자 등에게는 사용할 수할 수 없는 증례가 많이 있어서, 더 효과적인 치료법, 약제의 개발이 요구되고 있다.
인터페론 치료와 HCV와의 관련성에 관하여는, 혈중 바이러스량이 많을수록 인터페론 치료에 대한 저항성이 높은 경향이 인정되고 있다. 또한, 유전자형에 따라서 인터페론에의 감수성이 다른 경우가 있고, 특히 유전자형 2는 비교적 치료에 높은 감수성을 나타낸다. 그러나, 혈중 바이러스량과 간염의 악화도와의 사이, 그리고 유전자형과 간염의 경중(severity) 사이에 명확한 관련은 없다. 즉, 간 질환의 경중을 나타내는 바이러스 마커는 없다.
HCV는 사람에게는 혈액 또는 혈액 성분을 매개하여 감염된다. 사람 이외의 생물에서는 유인원 (침팬지)에 감염되고, 감염에 의하여 간염을 발병시키고, 만성 간염에 이르는 경우도 있다. 다른 것보다 사육이 용이한 실험동물에서, HCV에 높은 비율로 감염되는 것은 알려져 있지 않다.
한편, HCV는 생체 외(in vitro)에서 사람이나 원숭이 유래의 세포에 감염시키고, 증식시키는 것이 보고되어 있다. 그러나, 감염 효율, 증식 효율은 모두 낮다. 그 때문에 시험관 내에서 HCV를 감염, 증식시키는 것은 매우 곤란하다.
최근, 생체 외에서 합성한 HCV 게놈의 일부 (서브게노믹 RNA)와 약제 내성 마커를 가지는 RNA를, 사람 간암 유래의 수립 세포에 도입하고, 약제 내성을 지표로 세포를 선택함으로써, 매우 저빈도이지만, 세포 내에서 자율적으로 RNA가 복제ㅎ하고 있는 세포를 단리할 수 있다는 것을 알게 되었다. 많은 연구실에서 재현하고, 비교적 용이하게 유지를 할 수 있기 때문에, 연구에 널리 이용되게 되었다. 자주 이용되는 레프리콘은 HCV의 구조 단백질 부분을 인공적으로 결실시킨 것으로, 비구조 영역이 만들어 내는 단백질로 이루어지는 복제 장치가 HCV 본래의 플러스 사슬 RNA의 세포 내에서의 복제에 필요 충분한 정보를 가지는 것을 시사하고 있다.
HCV가 간장에서 증식하고 있는 것은, 간장에 포함되는 HCV가 산생하는 단백질을 검출하는 것이나, 간장에 포함되는 RNA를 검출함으로써 나타나고 있다. 또한, 인터페론 치료에 의하여 HCV를 일시적, 또는 항구적으로 구제하면, 그 후의 간염의 증상이 완화되는 것으로부터도, 간장에서의 병태의 발현에 HCV가 관여하고 있는 것은 분명하다. 그러나, HCV가 간장 중에서 어떠한 양태를 취하고 있는 지에 대하여는 해명되어 있지 않은 점이 많다.
그 때문에 HCV의 감염에 의하여 간염이 발병하고, 장기 만성화에 의하여 병상이 점차 악화되어, 최종적으로 간암에 이르는 HCV의 병태 발현 및 진전의 메카니즘에 대하여 해명되어 있지 않다. HCV의 각 단백질을 유전자 재조합에 의하여 배양 세포 중에서 발현시키고, 발현 세포 상태를 해석함으로써, 병태 발현·악화의 메카니즘에 대하여 가설을 세우는 연구는 활발하게 진행되고 있으나, 그것을 증명하기 위한 적절한 HCV 감염·증식 모델이 존재하지 않기 때문에, 가설을 검증되어 있지 않다.
특허 문헌 1: 일본 공개 특허 공보 2001-17187
비특허 문헌 1: Science 제277권, p570-, 1997
비특허 문헌 2: Journal of Virology, 제76권, p4008-4021, 2002
비특허 문헌 3: Science 제285권, p110-, 1999
비특허 문헌 4: Science 제290권, p1972-, 2000
비특허 문헌 5: Hepatology, 제29권, p223-229, 1999
비특허 문헌 6: Journal of Viro1ogy, 제77권, p2134-2146, 2003
비특허 문헌 7: Res. Virol., 제144권, p275-279, 1993
비특허 문헌 8: Journal of Virology, 제75권, p4614-4624, 2001
비특허 문헌 9: PNAS, 제29권, 14416-14421, 2002
비특허 문헌 10: Current Opinion in Infectious Disease, 제14권,
743-747
비특허 문헌 11: Jounal of Vira1 Hepatitis, 제6권, p35-47, 1999
비특허 문헌 12: Clinica1 Chemistry, 제43권, p1507-1522, 1 997
비특허 문헌 13: Journal of General Virology, 제81권, p1631-1648, 2000
HCV는 간장 중에서 복제하고, 증식하는 것으로 생각되고 있지만, 간장 중에서의 양태에 대한 해석이 이루어지지 않았다. 그 때문에 간장 중에서 어떻게 HCV가 복제하고 있는 지에 대한 정보는 거의 없고, HCV에 대한 치료약의 표적으로서 어느 분자가 적절한 것인 지에 대한 정보도 없다. 따라서, HCV의 치료법은 시행착오를 ㄱ겪지 않을 수 없다. 치료약을 효율적으로 개발하려면 환자 간장에서 활발하게 복제되고 있는 HCV 게놈 RNA를 특정할 필요가 있다. 이에, 본 발명은 간장에서 복제, 증식되고 있는 HCV 게놈 RNA 관한 것으로, 치료약의 적절한 표적이 되는 HCV 게놈 RNA를 제공하는 것을 목적으로 한다.
HCV의 증식에 의하여 간염이 중증화하고 있는 데도 불구하고, 간염 상태를 나타내는 바이러스 마커가 존재하지 않는다. 간염 상태를 나타내는 적절한 바이러스 마커가 요구되고 있다. 본 발명은 간염의 증상, 상태를 나타내는 바이러스 마커를 제공하는 것을 목적으로 한다.
HCV는 취급이 용이한 감염 동물 및 높은 비율로 감염되고, 고효율로 복제되는 생체 외 배양계가 존재하지 않기 때문에, 효율적인 약제의 스크리닝을 실시하는 것이 곤란하다. 이들은 HCV 특이적인 치료약의 개발을 곤란하게 만드는 원인의 하나이다.
예를 들면, 널리 이용되고 있는 인터페론 치료는 환자를 피검체로 하여 직접 치료법이 개발, 개량되어 오고 있어서 환자에게 큰 부담을 강요해왔다. 그 때문에, 환자의 입장을 고려한 의약품의 개발 방법이 필요하다. 이 문제를 해결하기 위하여 유효한 치료약과 치료법의 개발에 널리 이용할 수 있는 HCV 감염, 증식 모델이 필요하다. 본 발명의 목적의 하나는 치료약, 치료법의 개발, 스크리닝 등에 이용하는 HCV 감염, 증식 모델을 제공하는 것이다.
이와 같은 증식 모델로서 서브게노믹 레프리콘을 사용한 계(系)가 제공되었다. 즉, HCV의 서브 게놈과 적당한 약제 내성 마커를 코드하는 적절한 구조를 가지는 RNA를 사람 간암 유래 세포에 도입하고, 약제 내성으로 선택함으로써, 세포 내에서 자율적으로 증식하는 HCV 서브게노믹 RNA 레프리콘을 얻을 수 있다. 이 레프리콘과, 레프리콘을 유지하고 있는 세포를 사용함으로써, HCV에 대한 약제의 스크리닝이 시작되었다. 그러나, 이 방법에는 큰 문제가 있다.
그 하나는 사용하고 있는 HCV의 게놈은 본래의 HCV 게놈 RNA와는 다른 구조로 이루어져 있다는 점이다. 즉, 서브 게놈은 HCV 게놈의 비구조 단백질로 이루어지는 정보를 가질 뿐이다. 이 점을 개량하기 위하여, HCV 단백질의 모든 번역 영역과 약제 내성 마커로 이루어지는 레프리콘이 수립되었다. 이 레프리콘을 유지하고 있는 세포에서는 HCV 게놈에 코드되어 있는 모든 단백질이 산생될 것이고, HCV 입자가 세포 외로 방출되는 것이 기대되었다. 그러나 세포 외로 HCV 입자는 방출되지 않았다. 즉, 이와 같은 개량이 이루어졌다고 하여도 HCV의 증식계로서는 불완전한 것이었다.
또한, 상기 비구조 영역으로 이루어지는 서브 게놈의 레프리콘은 HCV의 5' 비번역 영역의 IRES (intenal ribosome entry site)와 코어의 일부 유전자의 하류에 네오마이신 내성 유전자를 가지고, 그 하류에 뇌김근염 바이러스(encephalomyocarditis virus (EMCV))의 IRES (intenal ribosome entry site)와 HCV의 비구조 단백을 코드하는 영역과 3' 비번역 영역을 가지는 구조이다. 또한, 개량된 HCV의 모든 번역 영역을 가지는 레프리콘도 서브 게놈의 레프리콘의 비구조 단백을 코드하는 유전자 영역을 HCV의 모든 번역 영역을 코드하는 영역과 교체한 것으로 것으로, 기본적인 구조는 같다.
이들 레프리콘은 본래의 HCV의 구조와 달리, HCV의 IRES와 EMCV의 IRES의 2개의 IRES를 가지고 있는데, 생체 내에 있어서의 HCV의 게놈의 구조와는 다른 것이다. 이 구조의 차이가, 세포외로의 HCV의 입자의 방출이 없는 원인의 하나일 가능성도 생각할 수 있다. 또한, 이와 같은 구조의 레프리콘은 생체 내에서는 복제되고 있는 바이러스 게놈과 다른 기구로 복제되고 있는 것도 생각할 수 있다.
이들로부터, HCV의 레프리콘 시스템은 본래의 생체 내의 HCV 게놈 구조를 가지는 RNA를 이용한 레프리콘 시스템인 것이 좋다. 본 발명은 간장에서 실제로 복제하고 있는 배열 및 구조로 이루어지는 레프리콘을 제공하는 것을 목적으로 한다.
더 큰 문제는 적응 변이라고 불리는 서브게노믹 레프리콘의 실험계 특유의 변이의 존재이다. 생체 외에서 기능하는 HCV 레프리콘을 회수하여 해석하면, NS3, NS5A 등에 본래 존재하지 않았던 복수의 변이가 발생하고 있다. 이 변이는 높은 효율로 복제하는 데 있어서 중요하고, 복제의 효율을 좌우한다. 그러나, 복제에 적절한 변이는 HCV의 증식에 있어서 발생하여서는 아니되는 변이인 것이 알려져 있다. 생체 외에서 합성한 HCV 게놈을 직접 침팬지의 간장에 접종함으로써 HCV에 감염시킨다.
이 생체 외에서 합성한 HCV RNA에, 서브게노믹 레프리콘으로 중요한 변이를 도입하면, 원래의 배열이 가지고 있던 침팬지에의 감염성이 소실되었다. 또한, 침팬지에 감염성을 나타내는 배열을 가지는 시험관 내의 세포에서 복제, 증폭하는 배열도 없다. 따라서, 생체 외에서 복제하게 된 배열은 생체 내에서 복제, 증식하는 기능을 잃었기 때문에, 본래의 HCV가 가지는 RNA 복제, 증식능을 유지하고 있지 않다. 생체에서도, 생체 외에서도 복제, 증식하는 배열일 필요가 있다. 즉, 본 발명의 제공하는 HCV 게놈 RNA 배열을 가지는 레프리콘은 생체 외에서도 시험관 내에서도 복제하는 것이다.
본 발명자들은 환자 간장 중의 HCV 게놈 RNA에 대한 cDNA를 단리하여, 그 전체 구조를 결정하였다. 결정한 배열을 해석하였더니, 단리된 cDNA는 지금까지 보고된 HCV 게놈 RNA의 구조와는 완전히 다른 배열을 가지는 것이 판명되었다.
이 새로운 구조의 게놈 RNA는 이미 보고되고 있는 HCV 게놈 RNA의 구조 단백질 영역이 일부, 또는 전부 결실되어 있는 것이 특징이며, 또한 결실된 부분의 전후의 배열은 원래의 HCV 게놈의 번역틀 (reading frame)을 유지한 채로 결합하고 있고, 이 HCV 게놈은 신규한 하나의 폴리펩티드를 코드하고 있는 것으로 판명되었다. 하나의 폴리펩티드는 HCV 본래의 구조 단백질의 일부와 비구조 단백질의 전부를 발현할 수 있다.
이 게놈에 코드되어 있는 HCV 폴리프로테인은 세포내 시그널 펩티다제에 의한 절단 부위, 자신의 프로테아제에 의한 절단 배열을 유지하고 있어, 프로세싱을 받는 것으로 추정된다. 사실, 배열 번호 1에 나타내는 신규한 HCV 게놈 cDNA를 포유류 세포에서 발현시켜, 산물(産物)을 해석하였더니, 코어 단백질은 정상적으로 프로세스되고, E1와 NS2 단백질은 융합 단백질로서 발현하고, NS3와는 프로세스되며, NS3는 본래의 크기의 분자로 프로세스되었다.
이 HCV 게놈 RNA를 트런케이트 폼 (TF)이라고 부른다. 이에 대하여, 이미 보고되어 있는 구조 유전자를 모두 포함하는 HCV RNA를 풀 렝쓰 폼(full length form) (FLF)이라고 부른다. 복수의 만성 C형 간염 환자의 간생검, 혈청을 해석하였다. 복수의 환자로부터 공통되는 특징을 가진 TF HCV 게놈이 검출되었다. 그 특징은 구조 단백질의 배열의 일부 또는 전부가 결실되어 있지만, NS2의 후반 이후의 배열을 유지하고 있고, 남아 있는 배열은 FLF로 추정되는 번역틀을 유지하는 형태로 발현할 수 있는 형태로 결실되어 있다.
즉, 본원 발명은
(1) C형 간염 바이러스 유전자에 있어서, 구조 단백 코드 영역의 일부분을 유지하고, 1개의 번역틀을 유지하면서, 코어 단백으로부터 NS2 단백을 코드하는 영역의 일부분이 결실되어 있는 트런케이트 폼 C형 간염 바이러스 유전자,
(2) C형 간염 바이러스 유전자에 있어서, 구조 단백 코드 영역의 일부분을 유지하고, 1개의 번역틀을 유지하면서, 적어도 E1 단백과 E2 단백의 이음부의 아미노산 배열을 코드하는 영역이 결실되어 있는 (1)의 트런케이트 폼 C형 간염 바이러스 유전자,
(3) C형 간염 바이러스 유전자에 있어서, E1 단백 코드 영역의 일부분, E2 단백 코드 영역, P7 단백 코드 영역 및 NS2 단백 코드 영역의 일부분이 번역틀을 유지하면서 결실되어 있는 트런케이트 폼 C형 간염 바이러스 유전자,
(4) 5'UTR 및 3'UTR를 가진, 상기 (1) 내지 (3)의 어느 하나에 기재된 유전자,
(5) 5'UTR, NS3로부터 하류의 단백질 코드 영역 및 3'UTR를 가지는 상기 (1) 내지 (3)의 어느 하나에 기재된 유전자를 제공한다.
C형 간염 바이러스 게놈은 RNA이지만, 본원 발명의 C형 간염 바이러스 유전자는 RNA도, DNA도 모두 이용 가능하다.
바꾸어 말하면, 본 발명은 (a) C형 간염 바이러스 유전자에 있어서, E1 단백 코드 영역의 일부분, E2 단백 코드 영역, P7단백 코드 영역 및 NS2 단백 코드 영역의 일부분이 번역틀을 유지하면서 결실되어 있는 트런케이트 폼 C형 간염 바이러스 유전자를 제공한다.
또한, 5' 비번역 영역으로부터 구조 단백인 코어 단백을 코드하는 영역의 전부 또는 일부 및 비구조 단백인 NS2의 후반 두 부분의 막 관통 영역을 코드하는 영역으로부터 3' 비번역 영역의 전부 또는 일부를 가진 상기 (a)에 기재된 트런케이트 폼 C형 간염 바이러스 유전자를 제공한다.
또한, C형 간염 바이러스 유전자의 핵산 배열의 1번에서 914번의 전부 또는 일부의 배열 및 3001번 이후의 전부 또는 일부의 배열을 가진 상기 (a)에 기재된 트런케이트 폼 C형 간염 바이러스 유전자를 제공한다.
한편, 서브게노믹 레프리콘의 해석으로부터 RNA 복제에 필요한 정보는 NS3 이후의 비구조 단백질의 영역에 코드되어 있는 것으로 나타나고 있다. 따라서, TF 게놈은 RNA 복제에 필요한 비구조 단백질의 정보를 모두 유지하고 있다. 또한, 일부의 간생검 샘플에서는 TF 게놈이 우선적으로 검출되므로, TF 게놈은 간장에서 자율적으로 복제하고 있는 것을 알 수 있다. 추가로, 생체 외에서 TF를 포함하는 RNA를 세포에 도입하면, 세포 내에서 복제한다. 즉, TF HCV 게놈 RNA는 레프리콘으로서 기능한다. 이들 결과로부터, 본 발명에 의하여 제공되는 HCV 게놈 RNA는 간장에서 복제하고, 또한 생체 외에서도 복제하는 것을 알게 되었다.
따라서, 본원 발명은 또한,
(6) 세포 중에서 자율적으로 복제하는 C형 간염 바이러스의 코어 단백으로부터 NS2 단백을 코드하는 유전자의 일부의 영역을 번역틀을 유지한 채로, 결실된 트런케이트 폼 C형 간염 바이러스 유전자 또는 결실된 영역이 적어도 E1 단백 및 E2 단백을 코드하는 영역에 존재하는 트런케이트 폼 C형 간염 바이러스 유전자의 레프리콘 유전자를 제공한다.
본 발명은, 또한,
(7) 상기 (3)의 레프리콘 유전자에 선택 마커 유전자가 결합된 레프리콘 유전자를 제공한다.
또는
(8) 상기 레프리콘 유전자가 복제하는 세포를 제공한다.
TF는 간장에서 FLF보다 유리하게 증식하고, 해당 환자가 중증의 간염 증상을 발현하고 있다. TF의 출현은 간염 증상에 강하게 관계되어 있고, TF의 증식을 억제 또는 저해하고, 구제하는 것은 유효한 치료 방법이 된다. 즉, 효과적으로 C형 만성 간염의 증상을 억제, 완화하는 약제는 TF를 표적으로 하여 개발하는 것이 좋다. 따라서, TF를 이용한 레프리콘을 이용한 의약품의 스크리닝은 간염의 중증화를 억제, 완화 또는 구제하기 위한 의약품 개발에 적절한 것은 분명하다.
따라서, 본원 발명은 또한,
(9) 트런케이트 폼 유전자가 복제하는 세포를 이용한 약제의 스크리닝 방법, 약효 평가 방법 및 약효를 평가함으로써 약제를 제조하는 방법을 제공한다.
(10) 트런케이트 폼 유전자를 넣은 벡터를 유지하여, 그 단백질을 발현하고 있는 세포를 제공한다.
(11) 트런케이트 폼 유전자를 포함하는 레프리콘이 복제하고 있는 세포, 또는 세포가 산생하는 단백질을 이용한 HCV의 진단 방법도 제공한다.
이 TF 게놈은 혈액 중에도 검출할 수 있다. 즉, 이 HCV 게놈 RNA는 간장 중에서 복제되어 혈액 중에 VLP로서 존재한다. 그러므로, TF 게놈의 검출에는 혈액, 또는 혈액 성분을 샘플로서 사용할 수 있다.
TF가 검출되는 환자에게 있어서, TF가 간장 RNA 중에서 우선형인 예가 관찰된다. 이것은 TF가 FLF보다 간장 내에서의 RNA 복제에 적절한 배열인 것을 나타내고 있다. 또한, 다수의 환자의 간생검, 혈청을 분석하여, TF가 간장에서 우선형인 환자는 증증의 간염을 발병하고 있는 예가 많고, 많게는 인터페론 치료의 예후도 불량한 것으로 판명되었다. 이것으로부터 TF형이 간장에서 우선인 경우는 간염이 중증화하고 있고, 또 인터페론 치료 효과도 한정적이다. 즉, TF를 검출, 또는 정량하는 것은 간염 상태를 나타내는 지표가 된다. 이것으로부터 TF를 검출, 정량하는 방법은 간염 상태를 진단하는 방법으로서 적용할 수 있고, 신규한 바이러스 마커로서 이용할 수 있다.
따라서, 본원 발명은, 또한,
(12) 검체 중의 트런케이트 폼 유전자를 검출하는 방법을 제공한다. 또한, 이 방법은 모든 유전자의 결실을 검출하는 방법을 사용하여, C형 간염 바이러스의 트런케이트 폼 유전자를 검출하는 방법을 포함한다.
또한 일례로서 C형 간염 바이러스 유전자의 핵산 배열의 1번에서 914번 및 3001번 이후의 배열로부터 설계한 프라이머를 사용하여 PCR을 실시하고, 트런케이트 폼 유전자를 증폭함으로써 트런케이트 폼 유전자를 검출 또는 정량하는 방법을 제공한다.
(13) 트런케이트 폼 C형 간염 바이러스 유전자 및 풀 렝쓰 폼 C형 간염 바이러스 유전자를 혼합한 검체에 있어서, 그 존재비를 정량하는 방법도 제공한다.
(14) 트런케이트 폼 유전자 및 풀 렝쓰 폼 유전자의 공통 영역의 유전자의 정량과 트런케이트 폼 유전자의 결실된 영역의 유전자의 정량에 의하여 존재비를 정량하는 방법도 제공한다.
또한, 혈중의 TF 게놈은 C형 간염 바이러스 입자 또는 C형 간염 바이러스 유사 입자로서 존재하고 있는 것으로 생각된다. 어떠한 구조체를 형성하고 있지 않으면, TF 게놈은 RNA이기 때문에, 신속하게 혈액 중의 RNase에 의하여 분해되어 검출할 수 없기 때문이다.
따라서, 본원 발명은,
(15) 트런케이트 폼 유전자를 유지하는 C형 간염 바이러스 입자 또는 바이러스 유사 입자를 제공한다.
TF 게놈은 구조 단백을 코드하는 영역이 결실되어 있다. 그 때문에, 결실된 영역에 코드되어 있던 펩티드를 결여한 신규한 HCV 폴리프로테인을 산생할 수 있다. 전술한 바와 같이 배열 번호 1의 HCV 게놈 cDNA를 포유류 세포에서 발현시켰을 경우, E1과 NS2의 융합 단백이 산생되었다.
그 때문에, 본원 발명은, 또한
(16) 트런케이트 폼 유전자로부터 산생되는 C형 간염 바이러스의 폴리프로테인 및 폴리프로테인으로부터 프로세스된 단백을 제공한다. 또한,
(17) 이 단백질을 특이적으로 인식하는 항체를 제공한다.
신규한 HCV 게놈 RNA (TF)를 사용함으로써, 생체 내에서 복제하고 있는 RNA 레프리콘을 생체 외에서 복제하는 것이 가능하게 된다. 이 레프리콘을 사용함으로써, 환자 간장 내에서 우선적으로 복제하고, 간염을 중증화시키는 감염 세포 모델을 구축할 수 있다. 이 모델을 이용함으로써, 간염의 중증화를 억제, 저해하는 의약품의 개발, 스크리닝을 실시할 수 있다.
TF를 검출 또는 정량하는 것은 간염 상태를 파악하는 데 효과적이며, 간염 상태를 나타내는 바이러스 마커로서 유효하다. 또한, 이 마커를 이용한 진단법에 의하여, 환자의 간염 상태, 약제의 효과에 대한 모니터링, 약제에 대한 저항성을 진단할 수 있다.
도 1은 간장 조직의 HCV 게놈 RNA로부터 RT-PCR에 의하여 검출된 TF 게놈의 대표적인 자기 영동상을 나타낸다. 프라이머 세트(1)는 cDNA 합성 프라이머가 3481R, 1st PCR 프라이머가 HClongAl과 3481R, 2nd PCR 프라이머가 85F와 3297R, 프라이머 세트 2는 cDNA 합성 프라이머가 3945R, 1st PCR 프라이머가 831S와 3945R, 2nd PCR 프라이머가 841S와 3759R, 프라이머 세트 3은 cDNA 합성 프라이머가 3945R, 1st PCR 프라이머가 813S와 3174AS, 2nd PCR 프라이머가 841S와 3111AS를 나타낸다. 화살표는 TF 게놈의 PCR 산물을 나타내고 있다.
도 2는 TF 게놈만이 검출된 검체의 TF 게놈과 D89815의 게놈과의 구조의 차이를 나타낸다. D89815의 각 단백을 코드하고 있는 핵산 배열은 코어가 341-914, E1이 915-1454, E2가 1455-2079, P7가 2580-2778, NS2가 2779-3419에 존재한다. 각 검체로부터 취득한 TF 게놈을 5'UTR은 실선으로, 단백을 코드하고 있는 부분은 회색의 막대로 나타내고 있다. 결실 부분은 각각의 막대를 선으로 묶어 나타낸다. 각 검체의 번호는 D89815에 대응하는 핵산의 위치를 나타낸다.
도 3은 TF 게놈과 FLF 게놈의 양쪽 모두가 검출된 검체의, TF 게놈과 D89815 의 게놈과의 구조의 차이를 나타낸다. 도 2와 마찬가지로, 각 검체로부터 취득한 TF 게놈을 5 ' UTR은 실선으로, 단백을 코드하고 있는 부분은 회색의 막대로 나타내고 있다. 결실 부분은 각각의 막대를 선으로 묶어 나타낸다. 각 검체의 번호는 D89815에 대응하는 핵산 배열의 위치를 나타내고 있다.
도 4는 실시예 6에 있어서의, 항원량의 경시적 변화를 나타내는 그래프이다.
도 5는 실시예 7에 대해 얻은 결실을 가진 유전자의 결실 전후의 배열을 나타낸다.
발명을 실시하기
위한 최선의 상태
이하에 본원 발명의 최선의 상태를 설명하지만, 본원 발명은 이에 한정되는 것은 아니다.
본 발명은 C형 간염 바이러스의 신규한 구조의 유전자에 관한 것이다. 정상적인 FLF의 C형 간염 바이러스의 유전자는 5'UTR와 그것에 이어서, 바이러스의 구조 단백질인 코어 단백, E1 단백, E2 단백 및 P7 단백 및 비구조 단백질 NS2 단백, NS3 단백, NS4A 단백, NS4B 단백, NS5A 단백, NS5B 단백을 코드하는 영역 및 3'UTR로 이루어져 있다.
본원 발명의 C형 간염 바이러스 유전자는 구조 단백질인 코어 단백, E1 단백, E2 단백 또는 P7 단백 각각의 전부 또는 일부 및/또는 비구조 단백질인 NS2의 일부를 코드하고 있는 유전자가 결실되어 있는 TF 게놈이다. 이 TF 게놈은 FLF 게놈의 번역틀을 유지한 채로, 인플레임으로 결실이 일어나고 있고, 결실 부분 이외의 영역에서는 정상적인 HCV의 폴리프로테인을 산생할 수 있다. 즉, 본원 발명은 C 형 간염 바이러스의 폴리프로테인을 코드하는 유전자의 일부 영역이 인플레임으로 결실된 C형 간염 바이러스 유전자에 관한 것이며, 그 TF 게놈의 구조에 특징이 있다.
이 HCV의 TF 게놈은 5'UTR를 유지하고 있고, 적어도 HCV의 FLF 게놈의 3001번째 이후의 유전자를 유지하고 있는 것이 많은 것으로 생각된다. 3001번째 이후의 유전자에게는 NS2 영역의 C말측의 두 부분의 막 관통 영역이 존재하고 있고, NS3 이후의 HCV의 단백질을 이 TF 게놈으로부터 정상적으로 세포 중에서 산생할 수 있다. (이하의 핵산 배열의 위치를 나타내는 번호는 전장형(全長型)의 배열인 GeneBank ACCession No. D89815의 HCV의 배열에 상당하는 위치의 번호를 기재한다.)
또한, 이 TF 게놈은 코어 단백, E1 단백, E2 단백의 일부 또는 전부를 코드하는 영역이 결실되어 있는 것에 특징이 있다. 특히 모든 TF 게놈은 도 2와 도 3에 나타낸 바와 같이, E1 및 E2 영역을 코드하는 유전자를 완전한 형태로 유지하지 않는 것에 특징이 있다. 바꾸어 말하면, E1 단백 및 E2 단백의 일부 또는 전부를 코드하는 유전자가 결실되어 있는 것에 특징이 있는 유전자이다. 현재 취득되어 있는 TF 게놈에서는 E1를 코드하는 1200위로부터 E2를 코드하는 1998위를 유지하고 있는 TF 게놈은 확인되어 있지 않다. 각 TF 유전자의 연속되는 결실은 적어도 63 염기이고, 최대 2043 염기이다. 또한, 1개의 TF 유전자의 결실의 합계는 1449 내지 2067 염기이다.
가장 전형적인 TF 게놈의 구조는 E1 단백으로부터 NS2 단백을 코드하는 약 2 kb의 유전자가 결실되어 있는 구조이다 (도 2).
또한, 상기 결실을 가지고, 또한 5'UTR, NS3로부터 하류의 단백질을 코드하는 유전자 및 3'UTR를 유지하고 있는 유전자가 좋다.
이 TF 유전자는 그 결실 부분의 구조에 특징이 있다. 그 때문에 핵산 배열은 HCV의 유전자 배열이면 모든 것을 포함한다. 본원 발명에서는 유전자형 1 이외에, 유전자형 2의 TF 유전자도 확인하고 있으나, 그 이외의 유전자형 3에서 6의 배열의 TF유전자도 포함된다.
이 TF 게놈은 C형 만성 활동성 간염 환자의 간장 중에서 자율적으로 복제하고 있다. 또한, 환자 조직 중에서 TF 게놈만이 PCR로 증폭되는 검체가 있다. 이것으로부터, TF 게놈은 FLF 게놈보다 간 조직 중에서 우선적으로 복제하고 있는 것으로 생각된다.
그 때문에, 이 TF 게놈을 간장 세포 또는 간장 유래 세포에서 복제 가능한 레프리콘으로서 이용할 수 있다. TF 게놈 그 자체가 자율 복제 능력이 있기 때문에, 그 구조 자체로 레프리콘으로서 기능한다. 또한, 레프리콘을 선택하기 위하여, 약제 내성 유전자를 결합시켜, 약제에 의하여 선택하는 것도 가능하다. 약제 내성 유전자로서는 네오마이신 등을 사용할 수 있다. 레프리콘으로서 이용 가능한 TF 게놈은 전술한 구조 영역을 결실힌 TF 게놈의 모든 것이 포함된다. 또한, 그 이외에도 간장에서 복제하고 있는 HCV 유전자의 인프레임의 결실체인 TF 게놈이라면, 어떠한 구조라도 이용 가능하다.
본원 발명의 가장 바람직한 레프리콘은 HCV의 5' 비번역 영역의 IRES, 코어 영역, E1 영역의 일부 및 NS2의 일부로부터 하류의 비구조 단백을 코드하는 영역과 3' UTR를 가지는 레프리콘이며, 생체 내에서 복제하고 있는 TF 게놈과 같은 구조의 레프리콘이다. 또한, 나아가 레프리콘의 일부, 예를 들면 코어 영역에 네오마이신 내성 유전자를 넣은 것도 레프리콘으로서 유용하다.
또한, 본원 발명은 이 레프리콘이 복제하는 세포를 제공한다. TF 게놈의 구조를 가지는 HCV-RNA는 실제로 간장 세포 중에서 복제되고 있고, 이 구조를 이용한 TF 게놈 레프리콘의 복제계는 간장에서의 바이러스 복제계를 반영하고 있는 것으로 생각된다. 레프리콘이 복제하는 세포는 간장 유래의 계대 세포라도 좋고, 초대 간세포이어도 좋다. 또한, 본원 발명의 레프리콘이 복제하는 세포이면, 간장 유래가 아닌 세포라도 이용 가능하다. 레프리콘이 복제하는 세포는 항상적으로 레프리콘이 복제하는 세포도 포함된다. 또한, 일과성으로 레프리콘이 복제하는 세포도 포함된다.
TF 게놈의 레프리콘의 복제계는 HCV의 감염에 대한 약제의 스크리닝에 유용하다. 이것은 이 TF 게놈이 실제로 간장 조직 중에서 복제되고 있기 때문이다. 이 레프리콘이 복제하고 있는 세포의 레프리콘 복제계를 사용하여, HCV의 증식을 억제하는 약제의 스크리닝할 수 있다. 레프리콘 복제 세포를 이용하여 약제의 약효를 평가하는 방법도 포함된다. 이 스크리닝계 등은 실제로 간장에서 복제되고 있는 TF 게놈의 레프리콘을 타겟으로 하고 있기 때문에, 더 효과적인 약제를 스크리닝할 수 있을 것으로 기대된다. 스크리닝된 약제의 효과를 평가하는 방법으로서도 유효하다. 이 경우, 약제의 평가를 이 방법으로 실시하는 것이 중요하고, 약제의 관리에 필수라면, 약제를 제조하는 방법으로도 이용할 수 있다.
또한, 본원 발명은 TF 게놈을 검출하는 방법을 제공한다. TF 게놈은 결실 부분을 가지고 있다. 이 결실 부분이 있는 유전자를 검출하는 방법이면, 어떠한 방법으로도 이용할 수 있다. 예를 들면, 결실 영역으로부터 3'측의 외측에 프라이머를 설정하고, RNA보다 cDNA를 합성한다. 그 cDNA로부터 결실 영역을 삽입하도록 5'와 3'의 외측에 프라이머를 설정하고 PCR을 실시함으로써, 결실이 있는 FLF 게놈보다 짧은 유전자를 검출할 수 있다. PCR을 실시한 후에, 노던 블로팅에 의하여 짧은 TF 게놈을 검출하는 것도 가능하다.
이 TF 게놈을 검출하는 방법은 상기에 기재한 방법 이외에 유전자의 결실을 검출하는 모든 방법이 포함된다.
또 전장(全長)의 FLF 게놈 및 TF 게놈의 양을 정량하고, 그 후 FLF 게놈을 정량한 값을 뺌으로써, TF 게놈의 양을 구하는 것도 가능하다.
FLF 게놈 및 TF 게놈의 결실이 없는 공통 영역에 PCR의 프라이머를 설정하고 RT-PCR에 의하여 양쪽 모두의 유전자의 양을 측정한다. 다음에, TF 게놈의 결실 영역에 PCR의 프라이머를 설정하고, 유전자의 양을 측정함으로써 FLF 게놈의 양만을 측정할 수 있다. 또한, FLF 게놈 및 TF 게놈의 양과 FLF 게놈의 양을 비교함으로써, TF 게놈의 양을 측정할 수 있다.
FLF 게놈 및 TF 게놈의 양을 측정하는 프라이머는 FLF 게놈과 TF 게놈이 중복되는 영역이면 어떠한 영역에 프라이머를 설정하여도 된다. 예를 들면 5'UTR나 NS3 이후의 비구조 단백을 코드하는 영역 또는 3'UTR에 프라이머를 설정할 수 있 다. 또한, FLF 게놈의 양만을 측정하는 프라이머는 TF 게놈이 결실되어 있는 영역이면 어떠한 영역이어도 좋지만, 예를 들면 구조 단백을 코드하고 있는 영역인, 코어, E1, E2의 유전자 영역이나, P7 또는 NS2 결실 영역에 설정할 수 있다. 전형적인 TF 게놈은 핵산의 배열 번호 1189에서 2922의 유전자가 결실되어 있기 때문에, 이 영역에 프라이머를 설정함으로써, FLF 게놈만을 검출할 수 있다. 더 좋기로는, 본원 발명의 실시예에서 얻은 모든 TF 게놈은 1200에서 1998이 결실되어 있기 때문에, 이 영역에 프라이머를 설정함으로써, FLF 게놈만을 검출하는 것이 가능하다.
HCV-RNA 유전자의 정량은 RT-PCR법에 의하여 실시할 수 있다. 예를 들면, Competitive-RT-PCR이나 리얼타임 PCR법 등을 사용할 수 있다.
본원 발명은 TF 게놈으로부터 산생되는 폴리프로테인에 관한 발명을 제공한다. TF 게놈은 인플레임으로 결실이 발생하고, 폴리프로테인을 코드하고 있다. 이 폴리프로테인은 C형 간염의 폴리프로테인과 비교하면 결실 영역에 코드된 펩티드를 가지지 않는 폴리프로테인이다. 또한, 결실 부분의 상류의 N말측의 단백과 하류의 C말측의 단백이 융합된 FLF 게놈으로부터 산생되는 정상적인 폴리프로테인과는 다른 신규한 단백질이다.
본원 발명은, 또한 상기 폴리프로테인으로부터 프로세스된 단백질을 제공한다. 상기 폴리프로테인은 펩티다제로 절단되지만, 프로세스에 의하여 E1와 NS2의 융합 단백, E1와 E2의 융합 단백, 코어와 E2의 융합 단백 등의 신규한 단백이 산생된다. 상기 폴리프로테인 또는 프로세스된 단백질은 FLF 게놈에서는 산생되지 않기 때문에, 이 단백을 검출함으로써, TF 게놈의 검출과 동등한 효과를 얻을 수 있다.
본 발명은 TF 게놈, 또는 TF 게놈으로부터 산생된 폴리프로테인을 검출 또는 정량함으로써, 간염의 증상을 진단하는 진단법을 제공한다.
본원 발명은 또한 TF 게놈으로부터 산생된 폴리프로테인 또는 융합 단백에 대한 특이적인 항체를 제공한다. 이들 항체는 폴리프로테인 또는 융합 단백의 검출에 유용하다.
또한, 본원발명은 TF 게놈을 포함하는 바이러스와 유사한 구조를 가지는 입자를 제공한다. TF 게놈은 간세포 이외의 혈액으로부터 검출된다. 혈액 중에서 TF 게놈은 코어의 단백질과 관련하여 존재하고 있다. 이 바이러스 입자 또는 바이러스와 유사한 입자는 진단, 치료 등에 이용할 수 있다.
실시예
1. 단축형 배열의 단리와 해석
환자 간장 절편 BP207 (0.5 mm × 1 mm)을 100 ㎕의 RIPA 완충액 (20 mM Tris-HCl [pH 7.5], 150 mM NaCl, 1% NP40, 0.1% deoxycholate, complete protease inhibitor cocktail [Roche diagnostics corporation] 중에서 파쇄하고, 10 krpm, 5분간의 원심 후 상청을 회수하였다. 이 추출액으로부터 High Pure Viral Nucleic Acid Kit (Roche diagnostics corporation)를 사용하여 제조업체가 권장하는 방법에 따라 핵산을 정제하였다. 정제한 핵산에 HC9405R-1b 프라이머를 가하고 MMLV reverse transcriptase (Invitrogen)를 사용하여 제조업체가 권장하는 조건으로, 42℃, 1 시간 역전사 반응을 실시하게 하고 cDNA를 얻었다.
이 반응액에 RNaseH (Invitrogen)를 가하고, 37℃에서, 30 분간 반응시켜, RNA를 분해시켰다. 이 반응액의 일부를 이용하여 HC-LongAl 프라이머와 T7-HC9313R 프라이머의 존재하에서, KlenTaq LA DNA polymerase (Clontech, BD bioscience)를 사용하여 94℃, 20초, 68℃, 9분간으로 이루어지는 30회의 서멀 사이클 반응에 의한 포리메라제 체인 리액션 (PCR)을 실시함으로써, HCV 게놈 RNA의 cDNA (HCV cDNA)의 증폭을 실시하였다. 이 반응액의 일부를 사용하고, 또한 HC85F와 HC9302R 프라이머의 존재 하에서 PCR을 실시하고, HCV cDNA를 증폭하였다.
증폭된 단편은 0.7% 아가로스 겔 전기 영동에 의하여 분리하고, 아가로스 겔로부터 QIAquick 9eI purification kit (QIAGEN)를 사용하여 제조업체가 권장하는 방법으로 DNA 단편을 회수하였다. 회수한 HCV cDNA 단편은 제조업체가 권장하는 방법에 따라서 pGEM-T easy 벡터 (Promega)와 연결 반응시키고, DH5α주를 형질 전환시켰다. 암피실린 내성으로, IPTG와 X-gal을 가한 한천 배지 상에서의 평판 배양으로 백색 콜로니를 형성하는 형질 전환체를 선택하고, 암피실린을 10O ㎍/㎖가 되도록 가한 2YT 배지에서 배양하였다. 배양한 균체로부터 Wizard Plus SV Miniprep DNA Purification System를 사용하여 plasmid를 정제하였다.
정제한 플라스미드에 넣은 HCV cDNA의 배열은 벡터 및 HCV cDNA에 적합하는 적절하게 준비한 프라이머를 이용하여 CEQ DTCS Quick Start Kit (벡터맨 코르타ㅅ사)에 의하여, 제조업체가 권장하는 방법에 따라 반응을 실시하고, CEQ2000 XL DNA analysis svstem (Software version 4.0.0, 벡터맨 코르타사)에 의하여 해석하였다. 얻은 데이터를 기초로, Sequencher (Version 4.1.2, Gene Codes Corporation)를 사용하여 배열 데이터의 통합, 해석을 실시하고, HCV cDNA의 염기 배열을 결정 하였다. 클론 LV207-0193-1, LV207-0193-6, LV207-0193-15의 3 종류의 HCV cDNA의 배열을 결정하였다.
단리된 배열을, 공표되어 있는 HCV cDNA의 배열 (D89851)과 비교하면, PCR에 사용한 프라이머의 영역인 제85위로부터 제9302위까지의 배열을 포함하지만, 어느 클론에서나 제1189로부터 제3000위에 상당하는 배열이 결여되어 있었다. 예상되는 아미노산 배열을 구하면, LV207-0193-1의 배열은 아미노산 길이의 하나의 아미노산을 코드하지만, E1의 도중부터 NS2의 도중까지가 번역틀이 어긋나지 않고 (in frame) 연결되는 형태로 결실되어 있었다. 구한 배열을 기초로, 공통되는 배열, 컨센서스 배열을 MacVector (version 7)를 사용하여 결정하고, 얻은 HCV cDNA 클론의 적절한 단편을 조합함으로써, 컨센서스 배열을 가지는 HCV cDNA 단편을 작성하였다.
구체적으로는 먼저 클론(1)의 단편에, Cla_s, Cla_as 프라이머를 이용하여, Quick Mutagenesis Kit (Staratagene)를 사용하여 제조업체가 권장하는 방법에 따라, 배열 번호 1의 제709위에 위치하는 ClaI 사이트를 변이 도입하였다. LV207-0193-1에서 제709위의 ClaI 사이트로부터, 제1063위의 AfeI까지의 단편, 제4169위의 HpaI 사이트로부터 제5569위의 SacI 사이트까지 및 제6687위의 SfiI 사이트로부터 제7123위의 BglII 사이트까지 있는 단편을 단리하여 구축에 사용하였다. 또한, LV207-0193-6에서는 제1063위의 AfeI 사이트로부터 제1265위의 BsiWI 사이트까지 있는 단편을 단리하여 사용하였다.
LV207-0193-15에서는 제1265위의 BsiWI 사이트로부터 제4169위의 HpaI 사이 트까지의 단편, 제5569위의 SaCl 사이트로부터 제6687위의 SfiI 사이트까지 있는 단편 및 제7123위의 BglII 사이트로부터 제7386위의 HindIII 사이트까지 있는 단편을 단리하여, 구축에 사용하였다. 이들 단편을 조합하고, 또한 아래와 같이 환자 조직으로부터 단리된 3'UTR을 포함하는 제7383위의 HindIII와 제7786위의 XbaI의 단편을 조합하여 크로닝 벡터 pBIuescriptSKII(-) (Stratagene)의 ClaI 사이트로부터 XbaI 사이트에 넣음으로써, 배열 번호 1의 제709위로부터 제7786위까지의 배열을 가지는 pLVC_ClaXba 7.2K를 구축하였다. 또한, 제한 효소, T4 DNA ligase는 뉴 잉글랜드 바이오랩사, 타카라슈조사, 도요보사, 니폰 진사로부터 구입한 것을 사용하였다.
한편 HCV cDNA의 말단은 아래와 같이 단리하였다. 전술한 RNaseH로 처리한 cDNA 반응액의 일부를, HCLongH1 및 HC705R와 JumpStart RedTaq DNA polymerase (Sigma)를 사용하여 94℃, 20초, 55℃, 30 초, 72℃, 1분으로 이루어지는 서멀 사이클을 35회 반복하는 PCR에 의하여, 이미 보고되어 있는 HCV cDNA의 제1위에서 709위에 상당하는 단편을 증폭시켰다. 단편의 클로닝, 해석은 통상의 방법에 따라 실시하였다. 그 결과, 배열 번호 1의 제1위로부터 제709위까지를 포함하는 HCV cDNA, pLV207-0007을 얻었다. 이것을 주형에 T7-H1V2 프라이머와 CoreCla_as프라이머를 이용한 PCR에 의하여 약 0.7 kb의 단편을 증폭하고, pGEM-T Easy에 클로닝함으로써, pT7_LV207_0007을 얻었다.
pcDNA3.1(+) (Invitrogen사)의 NotI와 XbaI 사이트 간에 pT7_LV207_0007을 NotI와 ClaI에 의하여 절단함으로써 얻을 수 있는 약 0.7 kb의 단편과 pLVC_ClaXba 7.2K를 ClaI와 XbaI로 절단함으로써 얻을 수 있는 약 7.2 kb의 단편을, 연결 삽입함으로써, 배열 번호 1의 제1위로부터 제7786위까지의 배열을 가지는 HCV cDNA의 삽입된 플라스미드 pcD-LV207TF를 얻었다.
환자 조직으로부터의 3'
UTR
유전자의 분리
만성 간염 환자의 조직으로부터 상기와 동일한 방법으로 RNA를 회수하였다. RNA 2.5 ㎕에 프라이머 8913F를 5 pmole (0.5 ㎕) 가하고 70℃에서 3분간 유지하고, 얼음 중에서 급냉하였다. 이것에 5xFirst-Strand Buffer 2 ㎕, 0.1 M DTT 1 ㎕, 20 mM dNTP 0.5 ㎕, RNase Inhibitor (TAKARA) 20 units, MMLV 리버스 트랜스클립타제 0.5 ㎕를 가하고 전체 양으로 10 ㎕가 되도록 디에틸피로카보네이트 처리한 멸균수를 가하였다. 이 혼합액을 42℃에서, 60분간 반응시켰다. RNA를 파괴하기 위하여, RNaseH (TAKARA, 60U/ ㎕)을 12 U 가하고 37℃에서 30분간 유지하고, 그 후 72℃에서, 3분 실활시켜, cDNA로서 사용하였다.
이 cDNA, 2 ㎕를 프라이머 8913 F 및 RP2를 사용하여 상기와 같은 방법으로 PCR을 실시하였다. 이 PCR의 산물의 일부를 사용하여 8939F와 R1의 프라이머로 두번째의 PCR을 실시하고, 약 600 base의 PCR 산물을 얻었다. 이 PCR 산물을 pGEM-T Easy 벡터로 클로닝하고, 핵산 배열을 상기와 같은 방법으로 결정하였다. 이하에 클로닝 및 유전자의 구축에 사용한 프라이머 배열의 일부를 나타낸다.
1b160Bam: 5'-cgcggatcct tagtcctcca gaacccggac ac-3' (배열 번호:49)
chiba-as: 5'-tgcacggtct acgagacct-3' (배열 번호: 50)
chiba-s: 5'-tagtggtctg cggaaccggt-3' (배열 번호: 51)
core_cla_as: 5'-gccgcatgta agggtatcga tgacc-3' (배열 번호: 52)
core_cla_s: 5'-ggtcatcgat acccttacat gcggc-3' (배열 번호 53)
eco_npt_as: 5'-gcgaattctt atcagaagaa ctcgtcaaga ag-3' (배열 번호: 54)
HClb9405R: 5'-gcctattggc ctggagtgtt tagctc-3' (배열 번호 55)
HC85F: 5'-atggcgttag tatgagtgtc gtgcagcct-3' (배열 번호 56)
HC705R: 5'-agccgcatgtaagggtatcgatgac-3' (배열 번호: 57)
HCl986S: 5'-tggttcggct gyacatggat gaa-3' (배열 번호: 58)
HC2199AS: 5'-ggrtagtgcc aragcctgta tgggta-3 (배열 번호: 59)
HC9302R: 5'-tcgggcacga gacaggctgt gatatatgtc t-3' (배열 번호: 60)
HClongAl: 5'-atcgtcttca cgcagaaagc gtctagccat-3' (배열 번호: 61)
HClongH1: 5'-gccagccccc tgatgggggc gacactccac c-3' (배열 번호: 62)
Nde_core9_as: 5'-aatcatatgt ctttgaggtt taggatttgt-3' (배열 번호: 63)
Nde_npt_s: 5'-gacatatgat tgaacaagat ggattgcac-3' (배열 번호: 64)
SbfHl: 5'-gtcctgcagg ccagccccct gatgggggcg aca-3' (배열 번호: 65)
SbfNpt-R: 5'-gacctgcagg ttatcagaag aactcgtcaa gaag-3' (배열 번호: 66)
T7_HlV2 : 5'-gccttaatta atacgactca ctataggcca gccccctgat gggggcgaca-3' (배열 번호: 67)
T7_HClongH1:5'-tctagtcgac ggccagtgaa ttgtaatacg actcactata gggcggccag ccccctgatgggggcgacac tccacc-3' (배열 번호: 68)
T7_HC9313b: 5'-tctagtcgac ggccagtgaa ttgtaatacg actcactcta gggcggcggg gtcgggcwcg ngacabgctg tga-3' (배열 번호: 83)
실시예 2.
환자로부터의
TF
HCV
게놈 RNA에 대한 cDNA의 단리와 해석
다음에, 만성 활동성 간염의 간장의 간생검의 23 검체 및 간암 환자의 수술시의 조직 검체인 BP1, BP2, BP3의 3 검체로부터 TF의 HCV-RNA의 검출을 시도하였다.
RNA의 추출
RNA용 추출 시약 ISOGEN (니폰 진사)의「미량 시료로부터의 RNA의 단리」의 프로토콜에 준하여, RNA를 추출하였다. 약 0.5 mm×1 mm의 크기의 환자 간장 절편에 0.8 ㎖의 ISOGEN를 가하고 1 ㎖의 칩의 끝으로 조직편을 풀어헤쳐, 피펫팅에 의하여 분쇄하였다. 5 분간 실온에서 방치하고, 0.2 ㎖의 클로로포름을 첨가한 후, 30초 심하게 교반하고, 4℃에서 5 분간 두었다. 냉각 미량 원심기 12000 g으로 4℃, 15분 원심하고, 수상(水相)을 회수하였다.
약 5 ㎍의 효모 tRNA를 첨가하고, 0.8 ㎖의 이소프로판올을 첨가하고 4℃에서 30분 또는 하루 낮밤 방치하였다. 12000 g으로 4℃, 15분간 원심하고, 상청을 버리고 펠렛이 된 RNA를 회수하였다. 그 펠렛에 1 ㎖의 70% 에탄올을 가하고 심하게 교반하여 12000 g으로 4℃에서 15 분간 원심하였다. 상청을 버리고, 동일한 조작을 2회 반복하고, 펠렛의 세정을 실시하였다. 마지막 세정 후에 펠렛를 10분간 풍건하고, 디에틸피로카보네이트 처리한 멸균수 50 ㎕에 용해하고, RNA 샘플로 하였다. 샘플은 사용할 때까지 -80℃로 보존하였다.
cDNA 합성
간생검 조직으로부터 추출한 RNA 샘플로부터 cDNA의 합성을 실시한 BP207의 조직으로부터 취득한 TF 게놈의 결실 영역이 HCV의 유전자의 핵산 번호의 1189에서 3000이므로 3000으로부터 3' 비번역 영역측에 cDNA 합성의 프라이머를 수 개 설정하고, cDNA 합성을 실시하였다. 리버스 트랜스크립타제는 GIBCO-BRL사의 MMLV 리버스 트랜스크립타제를 사용하고, 첨부한 프로토콜에 따라서 합성하였다.
RNA 2.5 ㎕에 프라이머를 5 pmole (0.5 ㎕) 가하고, 70℃에서 3 분간 유지한 후, 얼음 중에서 급냉하였다. 이것에 5xFirst-Strand Buffer 2 ㎕, O.1 M DTT 1 ㎕, 20 mM dNTP 0.5 ㎕, RNase Inhibitor (TAKARA) 20 units, MMLV 리버스 트랜스크립타제 0.5 ㎕을 가하고, 전체 양이 10 ㎕이 되도록 디에틸피로카보네이트 처리한 멸균수를 가하였다. 이 혼합액을 42℃에서 60분간 반응시켰다. RNA를 파괴하기 위하여, RNaseH (TAKARA, 60 U/㎕)를 12 U 가하고, 37℃에서 30분간 유지하고, 그 후 72℃에서 3분간 실활시켜, cDNA로서 사용하였다.
또한, cDNA 합성용의 프라이머는,
5035R: 5'-AGGCCTGTGA AGACGCTCTC CCAGAACT-3' (배열 번호: 69)
HC3297R: 5'-GGTGATGAC CTTGGTCTCC AT-3' (배열 번호: 70)
HC3481R : 5'-GCTTAGAGGC TAGTGATGAT GCAACCAAGT AC-3' (배열 번호: 71)
HC3945R: 5'-GGCGACCGCA TAGTAGTTTC CATA-3' (배열 번호: 72)
의 어느 하나를 사용하였다. 어느 프라이머를 사용하였는지는 표 1에 기재한다.
포리메라제 체인 리액션 ( PCR )에 의한 증폭
cDNA를 사용하여 5' UTR으로부터 NS3 코드하는 영역에 걸쳐서 몇개의 프라이마의 조합으로 PCR을 실시하였다. PCR는 TaKaRa LA Taq (TAKAR A)를 사용하여 첨부한 프로토콜에 준하여 반응시켰다. cDNA 2 ㎕, 10xLA PCR buffer II (Mg2 + free) 2.5 ㎕, 25 mM MgCl2 2.5 ㎕, 2.5 mM dNTP 2.5 mM, 센스 프라이머 10 pmole, 안티센스 프라이머 10 pmole, TaKaRa LA Taq 0.25 ㎕ (5 unit/㎕)를 가하고 이것에 멸균수를 가하고 25 ㎕으로 하여 PCR을 실시하였다. 반응은 마스터 사이클러 그레디언트(에펜도르프 야트론사)을 사용하였다. 반응 프로파일은 94℃에서 2분간 가열한 후, 변성 94℃ 20초, 어닐링 62℃ 30초, 연장 68℃ 3분으로 10 사이클, 그 후 변성 94℃ 20초, 어닐링 58℃ 30초, 연장 68℃ 3분으로 25 사이클로 실시하였다.
이 1회째의 PCR (1st PCR) 산물 2.5 ㎕을 사용하여 두번째의 PCR (2nd PCR)을 실시하였다. 제1 PCR 산물 2.5 ㎕, 10xLA PCR buffer II (Mg2 + free) 2.5 ㎕, 25 mM MgCl2 2.5 ㎕, 2.5 mM dNTP 2.5 mM, 센스 프라이머 10 pmole, 안티센스 프라이머 10 pmole, TaKaRa LA Taq 0.25 ㎕ (5 unit/㎕)를 가하고 이것에 멸균수를 가하여 25 ㎕으로 하고, 제1 PCR과 동일한 반응 프로파일로 PCR을 실시하였다.
프라이머는,
HC85F : 5'-ATGGCGTTAG TATGAGTGTC GTGCAGCCT-3' (배열 번호: 56)
HC813S: 5'-CTGGAGGACG GCGTGAACTA TGCAACAGGG AA-3' (배열 번호: 73)
HC841S: 5'-GGAACTTGCC CGGTTGCTCT TTCTCTATCT TC-3' (배열 번호: 74)
HC3206R : 5'-TGGGGCAAGA TGGTTATAAA C-3' (배열 번호: 75)
HC3174AS: 5'-GGGGTAAGAT GGTTATAAAC GTACGTACCT G-3' (배열번호: 76)
HC3111AS : 5'-ATAATGACCC CCGGCGACTT TCCGCACTAA C-3' (배열 번호: 77)
HC3297R : 5'-GGTGATGAC CTTGGTCTCC AT-3' (배열 번호: 70)
HC3481R : 5'-GCTTAGAGGC TAGTGATGAT GCAACCAAGT AC-3' (배열번호: 71)
HC3759R : 5'-TGACATCAGC ATGTCTCGTG ACCA-3' (배열 번호: 78)
HCLONGAl : 5'-ATCGTCTTCA CGCAGAAAGC GTCTAGCCAT-3' (배열 번호: 61)
HC3945R: 5'-GGCGACCGCA TAGTAGTTTC CATA-3' (배열 번호: 72)
의 어느 프라이머를 조합하여 사용하였다. 사용한 1st 및 2nd PCR의 프라이머의 조합은 표 1에 기재하였다.
PCR
산물의 해석
반응 종료 후의 PCR 산물을 전기 영동으로 해석하였다. 1% 아가로스 겔을 사용하여 1xTAE 버퍼 (Tris-HCl 40 mM, 빙초산 40 mM, EDTA 1mM)으로 서브마린형 전기 영동조로 영동하였다. PCR 산물에 겔 로딩 버퍼를 가하고 전기 영동 후, 에티듐브로마이드로 염색하여, 자외선 하에서 PCR 산물을 관찰하였다.
4 검체의 RNA로부터 3조(組)의 프라이머로 HCV의 유전자를 증폭시킨 전기 영동의 결과를 도 1에 나타낸다. 2nd PCR의 프라이머가 HC85F와 HC3297R의 조합에서는 약 3.1 kb, 841S와 3759R의 조합에서는 약 2.9 kb, 841S와 3111AS의 조합에서는 2.2 kb의 FLF의 HCV-RNA가 검출될 것이다. 번호 1의 검체 (BP274)는 어느 프라이머 의 조합에서도 FLF의 길이의 PCR 산물만이 검출되었다. 번호 2의 검체 (BP295) 는 HC85F와 HC3297R의 조합에서는 약 3.1 kb의 FLF와 2 kb의 TF, 841S와 3759R의 조합에서는 약 2.9 kb의 FLF, 841S와 3111AS의 조합에서는 2.2 kb의 FLF가 검출되었다.
번호 3의 검체 (BP325)는 HC85F와 HC3297R의 조합에서는 약 3.1 kb의 FLF, 841S와 3759R의 조합에서는 약 2.9 kb의 FLF, 841S와 311lAS의 조합에서는 2.2 kb의 FLF와 30O bp의 TF가 검출되었다. 번호 4의 검체 (BP373)는 HC85F와 HC3297R의 조합에서는 약 1.2 kb의 TF, 841S와 3759R의 조합에서는 약 1 kb의 TF, 841S와 3111 AS의 조합에서는 PCR 산물은 검출되지 않았다. 검체와 프라이머의 조합으로 달라지는데, FLF만이 검출되는 검체, FLF와 TF가 검출되는 검체, TF만이 검출되는 검체가 있었다. 26 검체에 대하여, 각각의 cDNA 합성 프라이머, 1st PCR, 2nd PCR의 프라이머의 조합에서 얻은 PCR 산물을 표 1에 나타낸다.
배열의 결정
얻은 FLF 및 TF이라고 생각되는 PCR 산물을 플라스미드에 넣어, 배열을 결정하였다.
PCR 산물을, 전기 영동한 아가로스 겔로부터 잘라내어, QIAquick PCR Purification Kit (QIAGEN)에 의하여 정제하고 40 ㎕의 멸균수에 추출하였다. DNA 5 ㎕으로 pGEM-T Easy Vector (Promega사) 0.5 ㎕에 10 x T4 ligase buffer 1 ㎕, T4 DNA Ligase 1 ㎕, 멸균수 2.5 ㎕을 가하고, 16℃에서 1 시간 반응시키고, DNA와 벡터를 결합시켰다. 이노우에 등의 방법 (Gene, vol 96, 1990, pp 23-28)그리고 작성한 대장균 DH5α의 컴피턴트 셀에 DNA를 가하고 정법에 따라서 형질 전환시킨다.
출현한 콜로니를 2xYT 배지에서 하루 낮밤 배양하고, Wizard Plus SV Minipreps DNA Purification System (Promega사)로 미니프리하고, 플라스미드 DNA를 회수하였다. PCR 산물이 들어간 플라스미드 DNA를 BECKMAN COULTER사의 CEQ 2000XL DNA analysis system으로 해석하고 배열을 결정하였다. CEQ2000 Dye Terminator Cycle Sequencing with Quick Start Kit를 사용하여 첨부한 프로토콜에 따라 반응시키고 해석하였다. 시퀀스 프라이머는 pGEM-T Easy의 프라이머 및 HCV의 배열에 적합한 것을 적절하게 선택하였다. 배열의 해석은 MacVector (Accelrys사) 및 Sequencher (Gene Codes사로 해석하였다.
검출된 TF 게놈의 핵산 배열 및 그로부터 추정되는 아미노산 배열을 배열표에 나타낸다. 검체 BP 203은 배열 번호 9 내지 12에, BP204는 배열 번호 13과 14에, BP208는 배열 번호 15와 16에, BP295는 배열 번호 17과 18에, BP325는 배열 번호 19와 20에, BP368는 배열 번호 21과 22에, BP373는 배열 번호 23 내지 28에, BP1는 배열 번호 29 내지 34에, BP2는 배열 번호 35 내지 38에 각각 기재하였다. 배열의 결정에 의하여, 어느 하나의 프라이머의 조합으로 FLF가 검출된 것, TF가 검출된 것을 정리하여 표 2에 나타내었다.
TF만이 검출된 검체가 6 검체, FLF와 TF가 검출된 검체가 5 검체, FLF만이 검출된 검체가 6 검체, 나머지의 검체는 어느 것도 검출되지 않았다. 두 가지 모두 검출된 검체는 대부분이 유전자형 2이며, 사용한 cDNA 합성 프라이머, 1st PCR, 2nd PCR 의 프라이머는 유전자형 1의 배열을 기초로 합성하였기 때문에, 유전자형 2의 HCV-RNA의 배열과는 일치하지 않는 부분이 있고, PCR 산물을 얻을 수 없는 검체가 많았던 것으로 생각된다.
TF
게놈의 해석
얻은 HCV-RNA의 TF의 배열을 No.D89815의 HCV의 배열의 구조와 비교 검토하였다. 도 2는 TF만이 검출된 BP207, BP368, BP373, BP1, BP2 및 BP203의 구조를 나타내고 있다. 결실 부분은 약 2 kb이지만, BP207와 완전히 동일한 영역이 결실되어 있는 것은 없고, 각각의 환자에게서 결실 영역이 차이가 났다. BP203는 유전자형 2의 검체에서 유일하게 TF가 검출된 검체이지만, 동일한 결실이 발견되었고, 결실 부분은 핵산 번호의 988에서 2988위에 존재하고 있었다.
이들 HCV--RNA에 공통되어 있는 것은 BP207와 같이 인프레임으로 결실이 일어나고 있고, 결실 부분을 제외하고는 HCV의 폴리프로테인을 합성하고 있는 것으로 생각된다. 특히 코어 단백을 코드하는 유전자를 정상적인 형태로 유지하고 있고, 코어 단백도 정상적으로 합성될 수 있는 것, NS2의 C말단측의 두 부분의 막 관통 영역이 결실된 것은 없고, NS3 이후의 단백질도 정상적으로 발현할 수 있는 것이 시사된다. 또한 E1와 NS2의 단백을 코드하는 영역이 인프레임으로 결합하고 있기 때문에, E1와 NS2의 융합 단백이 산생되고 있는 것으로 생각된다.
다음에, TF와 FLF의 양쪽 모두가 검출된 검체에 대하여 TF의 배열의 구조를 비교하였다 (도 3). BP204, BP325, BP295, BP288의 배열을 확인하였다. TF만이 검출된 검체와 달리, 코어 단백을 코드하는 영역에 해당하는 유전자가 취득되어 있는 BP204, BP295에서는 코어 영역의 일부 또는 전부가 결실되어 있었다. 또한, 결실 영역의 후반 부분도 NS2를 유지하고, E2영역의 일부를 결실되어 있는 것이 있었다.
다만, 검체의 HCV-RNA도 BP207와 마찬가지로 인프레임으로 결실이 일어나고 있어서, 결실 부분을 제외하고는 HCV의 폴리프로테인을 합성하고 있는 것으로 생각된다.
TF
와
FLF
의
HCV
-RNA의 배열의 검토
만성 활동성 간염 환자에게서 TF와 FLF의 HCV-RNA가 취득된 검체에 대하여, 그 중복 부분의 핵산 배열로 추정되는 아미노산 배열을 비교하였다. TF 길이의 PCR 산물을 동일한 방법으로 벡터에 클로닝하고, 핵산 배열을 결정하였다. BP204, BP325, BP208의 FLF의 배열은 각각 배열 번호 39와 40, 41과 42, 43과 44에 나타낸다. 핵산, 아미노산의 배열을 비교하면 BP325는 핵산에서 96.7%, 아미노산에서 97.5%, BP288는 핵산에서 97.3%, 아미노산에서 97.7%인 것으로 보아 동일한 유사종(quasispecies)에 속하는 바이러스라고 생각되었다. 한편, BP204는 핵산에서 82.6%, 아미노산에서 83.6%로 TF 와 FLF의 배열은 괴리되어 있었다.
혈청중으로부터의
TF
-
HCV
-
RNA
의 분리
간조직으로부터 TF-HCV-RNA가 검출된 검체에 대하여, 혈청으로부터 TF-HCV-RNA의 분리를 시도하였다. 간생검과 동시에 채혈된 혈청으로부터 High Pure Viral RNA Kit (Roche사)를 이용하여 추출하였다. 혈청 200 ㎕으로부터 첨부한 프로토콜에 따라서, 용출 완충액 50 ㎕에 추출하였다. 여기서 간생검의 검체인 BP368에 대응하는 혈청은 이하 S368라고 기재한다.
RNA 2.5 ㎕를 사용하여, 간생검의 RNA로부터 cDNA를 합성하고, PCR을 실시한 방법과 같은 방법으로, PCR 산물을 취득하였다. S368에 대하여 cDNA 합성을 3297R 프라이머, 제1 PCR을 HCLONGAl과 3297R, 2ndPCR을 85F와 3174AS로 실시하였을 경우에 1.2 kb의 TF의 PCR 산물이라고 생각되는 것을 취득할 수 있었다 (표 4).
다음에, 5'UTR 영역에서의 HCV-RNA의 정량으로 비교적 RNA량이 많았던 S20 4, S207, S368에 대하여, 그 밖의 프라이머의 조합으로 cDNA 합성으로부터 PCR을 실시하였다. 그 결과, S207와 S368는 TF의 길이의 PCR 산물이, S204에서는 FLF의 길이의 PCR 산물이 취득되었다 (표 5). 이 PCR 산물을 마찬가지로 pGEM-T Easy vector에 넣고, 배열을 결정하였다. S207, S368가 결정된 배열은 각각 배열 번호 45과 46, 배열 번호 47과 48에 나타낸다. BP207와 S207 및 BP368와 S368의 중복 부분에 대하여 핵산 및 아미노산의 배열 상동성을 비교하면, BP207와 S207에서 각각 99.4%과 99.3%, P368와 S368에서 98.8%과 97.3%이어서, 동일한 유사종에 속하는 바이러스라고 생각된다 (표 6). 이것은 간장 중에서 복제된 TF-HCV-RNA가 어떤 시스템으로 혈청 중에 방출되고 있는 것이다.
실시예
3.
HCV
RNA
레프리콘의
작성
pBluescriptIISK(+)의 XhoI와 XbaI 사이트 간에, XhoX-Xba-s 올리고머와 XhoX-Xba-as 올리고머를 어닐링시켜 얻는 링커 단편을 삽입함으로써, pBSIISK (+)△XX를 구축하였다. 또한, pLV207-0007을 Sbf-H1 프라이머와 Cla_as 프라이머를 사용한 PCR에 제공함으로써, 약 0.7 kb의 단편을 증폭시키고, 이것을 pGEM-T Easy에 클로닝함으로써, pLVC-0007 Sbf를 얻었다. pBSIISK(+) △XX의 NotI와 XbaI의 사이트 간에, pLVC-0007Sbf를 NotI와 ClaI로 절단함으로써 얻을 수 있는 약 0.7 kb의 단편과 pLVC_ClaXba 7.2 K를 ClaI와 XbaI로 절단함으로써 얻는 약 7.2 kb의 단편을 연결 삽입함으로써 pSbf-LV207TF를 얻었다.
한편, HCV 항체 양성의 혈청, G14로부터 정제한 RNA에, T7-HC9313b 프라이머를 가하고 SuperscriptII reverse transcriptase (Invitrogen)에 의하여, 제조업체가 권장하는 방법으로 cDNA를 합성하였다. 이 cDNA 반응액의 일부를 이용하여 T7-HClongH1 프라이머와 1b160Bam 프라이머의 존재하의 EX-Taq DNA polymerase (타카라슈조사)를 사용한 PCR (1 사이클이 95℃, 30초, 55℃, 1분, 74℃, 1분으로 이루어지는 반응을 35 사이클)에 의하여, HCV cDNA를 증폭하였다. 아가로스 겔 전기 영동으로 증폭된 단편을 분리하고, QIAquick gel kit를 사용하여 아가로스 겔로부터 DNA를 정제하였다. 정제한 단편은 pT7-blue T (Novagen)에 클로닝하고, Applied Biosystems DNA sequencer 377A를 사용하여 제조업체가 권장하는 시약, 조건으로 배열을 결정하였다.
다음에, HCV cDNA를 T7-H1V2 프라이머와 nde_core9_as 프라이머를 사용하여 PCR로 증폭하였다. 한편 pcDNA 3.1 (+)을 템플릿에 nde_npt_S와 EcoNpt_as 프라이머로 PCR을 실시하고, 네오마이신 내성 유전자 단편을 증폭하였다. 단편을 NdeI의 제한 효소 부위에서 연결시킨 것을 pBluescriptIISK(-)에 클로닝함으로써, HCV cDNA의 5'UTR와 core의 최초의 9 아미노산과 네오마이신 내성 유전자 (네오마이신 인산 전이 효소, NPT-II)와의 융합 단편을 구축하였다. 이 단편을 가지는 플라스미드를 또한 T7-H1V2 프라이머와 Sbf_Npt_R 프라이머를 사용한 PCR을 실시하고, 5'말단에 PacI의 3'말단에 SbfI의 사이트를 가지는 단편을 조제하였다. 이것을 pGEM-T Easy로 클로닝함으로써 pG14UTRcNE0를 얻었다.
pG14UTRcNE0로부터 NotI와 SbfI로 절단함으로써 얻을 수 있는 약 1.2 kb의 단편을, pSbf-LV207TF의 NotI와 SbfI 사이트 사이에 삽입함으로써, pLV207TFRepG14를 얻었다. 이 플라스미드 DNA를 XbaI 와 NotI로 절단한 것을 주형으로, Megascript T7 kit (Ambion)를 사용하여 RNA를 합성하였다. 제조업체가 권장하는 방법으로 RNA를 정제하였다.
사람 간암세포 (Huh7, JCRB0403)는 Dulbecco's modified Eagle medium (D-MEM, IWAKI)에 10% 소 태아 혈청 (FBS), 페니실린과 스트렙토마이신을 각각 50 U/㎖, 50 ug/㎖이 되도록 가한 것을 배양액으로 하여 5% 이산화탄소를 부가하고, 37℃에서 배양을 실시하였다. 컨플루언트가 되기 전의 세포를 트립신, EDTA 처리에 의하여 배양접시로부터 박리시키고, 혈청 첨가 배지에 재현탁함으로써 트립신을 비활성화한다. PBS로 2회 세정 후, 1.25% DMSO를 첨가한 Cytomix (120 mM Potassium chloride, 10 mM Potassium phosphate, 5 mM Magnesium chloride, 25 mM HEPES, 0.15 mM Calcium chloride, 2 mM EGTA, pH 7.6)에 재현탁하고, 갭 0.4 cm의 일렉트로포레이션 큐벳에 옮긴다.
적당량의 RNA를 세포에 가한 후, 5분간 얼음 위에서 충분히 냉각한다. 일렉트로포레이터 (Bio-Rad사)로, 960 uF, 270 V로 펄스를 가한다. 곧 바로 8 ㎖의 배지에 재현탁하여 일부를 플레이트에 뿌린다. 일정 시간 배양한 후, 세포를 0.1 % EDTA, PBS로 박리시키고, 원심 분리에 의하여 침전시켜, 회수하였다. 회수한 세포로부터, Isogen (니폰 진사)을 사용하고, 제조업체가 권장하는 조건에 따라 RNA를 회수하였다. 회수한 RNA에 포함되는 HCV RNA량은 정량적 RT-PCR법으로 해석하였다.
마이너스 사슬의 정량 방법
HCV RNA의 복제가 일어나고 있는 지는 세포 중에 HCV RNA의 5' UTR 영역의 마이너스 사슬을 검출할 수 있는지 아닌지로 조사하였다. 마이너스 사슬의 특이적인 정량법은 일본 공개 특허 공보 평08-187097호에 기재된 마이너스 사슬 RNA의 특이적 검출법과 동일한 방법으로 실시하였다.
pLV207TFRepG14를 주형으로 생체 외에서 합성한 RNA를 일렉트로포레이션으로 도입한 세포로부터 유의한 양의 마이너스 사슬을 검출할 수 있고, HCV RNA를 복제할 수 있는 것이 확인되었다.
실시예
4. 전체 RNA 중의 단축형 RNA
량비와
HCV
관련 질환과의 상관
환자 검체로부터 회수한 RNA에 포함되는 HCV RNA의 정량은 이하와 같이 실시하였다. 5' UTR를 표적으로 한 RNA의 정량을 실시하는 경우에는 Chiba-S , Chiba-AS 프라이머를 사용하였다. QauntiTect SYBR Green RT-PCR Kit (QIAGEN)를 사용하여 제조업체가 권장하는 조건으로 반응액을 조제하고, LightCycler Capillary (Roche diagnostics사)로 옮기고, LightCycler (Roche diagnostics사)에 세트하고, 반응시켜 PCR 산물을 경시적으로 모니터하였다. 적절하게 희석한 생체 외에서 합성한 HCV 5' UTR를 포함하는 기지(旣知)의 농도의 RNA를 표준 물질로서 사용하였다. LightCycler software (V3.5.3)를 이용하고 해석을 실시하였다.
RNA의 정량을 E2의 영역을 표적으로 실시하는 경우에는 HCl986S 프라이머와 HC2199-as 프라이머를 사용하였다. 0neStep RT-PCR kit (QIAGEN)를 사용하여, 제조업체가 권장하는 반응 조건으로 역전사 반응이 일어나게 하였다. 이 반응액을, 등량의 동일한 프라이머를 포함하는 LightCycler-FastStart DNA Master SYBR Green I kit (Roche diagnostics사)로 조제한 반응액을 가하고, LightCycler Capillary (Roche diagnostics사)로 옮기고, LightCycler (Roche diagnostics사)에 세트하였다. PCR 반응이 일어나게 하여 산물을 경시적으로 모니터하였다. 적절하게 희석한 생체 외에서 합성한 HCV 5'UTR를 포함하는 기존 농도의 RNA를 표준 물질로서 사용하여 Light Cycler software를 이용하여 정량치를 구하였다.
실시예
5. 단축형 배열을
코드하는
cDNA의 포유류 세포에서의 발현과
HCV
단백질의 해석
pcDNA 3.1의 NotI, XbaI에 pLV207TFRepG14를 NotI, XbaI로 절단하여 얻는 약 8.5 kb의 단편을 삽입함으로써, pcD-LVTRG를 얻었다. 이 플라스미드 DNA를 사람 신장 유래 배양 세포, 293 TRex에 Lipofectamine 2000 (Invitrogen사)를 사용하여 트랜스펙트하였다. DNA 트랜스펙트 후 4 시간째에 배지를 교환하고, 또한 세포를 18 시간 배양하였다. 세포를 플레이트로부터 회수하고, 원심 분리에 의하여 침전, 회수하였다. 세포를 RIPA 완충액 중에서 피펫팅함으로써 파쇄하고, 원심 분리법으로 상청을 회수하였다. 회수한 상청에 1/3 양의 3 x SDS sample buffer (187.5 mM Tris-HCl [pH 6.8], 6 % SDS, 125 mM DTT, 30% 글리세롤)를 가하고, 95℃, 5분간 열처리하였다.
폴리아크릴아미드 겔 (다이이치가가쿠야쿠힌사)에 적용하여, 제조업체가 권장하는 방법으로 전기 영동을 실시하였다. 영동 종료후, PVDF막 (Millipore사)에 Semi-Dry blotter (잘트리우스사)를 사용하여 통상적인 방법에 따라 전사하였다. 전사한 막은 TTBS (20 mM Tris-HCl [pH 7.5], 150 mM NaCl, 0.1 % Tween 20)에서 세정한 후, 10배 희석한 블로킹제 (Milk Diluent / Blocking Solution, Kirkegaad & Perry Laboratories사)중에서 실온, 2 시간 반응시켰다. 여기에 최종농도 0.3 ug/ ㎖가 되도록 희석한 일차 항체를 가하고 실온, 1.5 시간, 진탕하면서 반응시켰다. 반응액을 버리고 TTBS로 3회 세정 후, 40,000배로 희석한 HRP 표지 항체를 가하고, 실온, 1 시간, 진탕하면서 반응시켰다.
반응액을 버리고 TTBS에서 3회 세정 후, SuperSignal West Pico Chemiluminescent Substrate (Pierce사)를 사용하여 실온, 5분간 반응시켰다. 발생한 화학 발광은 LAS1000 (Fuji Film사)을 사용하여 검출하거나, 시그널이 불충분한 경우에는 BioMax Film (Kodak사)를 노광시킴으로써 검출하였다. pLV207TRG의 HCV cDNA에 의하여, 성숙형의 코어 항원 일치하는 위치에, NS3 항원이 성숙형 NS3 항원의 분자량에 일치하는 위치에, 각각 항코어 모노클로날 항체, 항NS3 토끼 항혈청에 의하여 검출되었다.
또한, 항E1 모노클로날 항체와 반응시키면, 분자량 약 35 kd의 정상적인 것에 가까운 분자량의 것이 검출되지만, EndoH 처리 후에 분석하여 반응시킨 경우에는 분자량 24 kd로 변화하였다. 이 분자량은 LV 207의 HCV cDNA가 예상되는 아미노산 배열의 E1와 NS2의 융합 단백질의 아미노산 배열로부터 산출되는 분자량과 거의 일치한다. 이것으로부터 E1와 NS2는 융합 단백질로서 존재하고, 당사슬 수식을 받는 것을 알게 되었다. 이들로부터 LV207의 HCV cDNA에 의하여 코드되는 HCV 폴리프로테인은 FLF형의 폴리프로테인과 동일한 절단 부위에서 절단되고 있는 것을 알게 되었다.
실시예
6.
HCV
RNA
레프리콘의
작성과 세포에서의 복제
약제 내성 마커인 네오마이신 내성 유전자를 가지지 않는 TF 타입의 레프리콘을 작성하였다. 실시예 3에서 구축한 pSbf-LV207TF의 플라스미드를 NotI와 ClaI로 절단하여 약 0.7 kb의 단편을 얻었다. 이 단편을 실시예 3에서 구축한 플라스미드 pLV207TFRepG14의 NotI와 ClaI 사이트 사이에 삽입함으로써, 플라스미드 pLV207TF를 구축하였다.
이 플라스미드 DNA를 NotI와 XbaI로 절단한 것을 주형으로 하고, Megascript T7 Kit (Ambion사)을 사용하여 RNA를 합성하였다. 이 RNA를 제조업체가 권장하는 방법으로 정제하고, 세포에의 트랜스펙션에 사용하였다.
2일간 배양을 실시한, 사람 간암 세포 Huh7에, 정제한 RNA를 일렉트로포레이션에 의하여 트랜스펙트하였다. 세포를 트립신, EDTA 처리하여 박리시키고, 혈청 첨가 배지에 재현탁하여, 트립신을 비활성화한다. PBS로 2회 세정 후, 1.25 % DMSO를 첨가한 Cytomix (120 mM Potassium chloride, 10 mM Potassium phosphate, 5 mM Magnesium chloride, 25 mM HEPES, 0.15 mM Calcium chloride, 2 mM EGTA, pH 7.6)에 재현탁하고, 약 4×106의 세포를 갭 0.4 cm의 일렉트로포레이션 큐벳에 옮겼다.
10 ㎍의 RNA를 큐벳에 가하고 5 분간 얼음 위에서 충분히 냉각한다. 일렉트로포레이터 (Bio-Rad)로 960 uF, 270 V로 펄스를 가한다. 즉시 8 ㎖의 배지에 재현탁하여, 12 웰의 플레이트 (직경 22.1 mm)에 뿌렸다. 4 시간, 24 시간, 48 시간, 72 시간 및 96 시간 세포를 0.1 % EDTA-PBS로 박리하여 원심 분리로 회수하였다. 세포 펠렛를 50 ㎕ RIPA 완충액 (20 mM Tris-HCl (pH 7.5), 150 mM NaCl, 1 mM EDTA, 1% NP40, 0.1 % Deoxycholate, 0.1 % SDS, complete protease inhibitor cocktail (Roche diagnostics사)에 용해하고, 10 krpm으로 5분간 원심에 의하여 상청을 회수하였다. 상청 10 ㎕를 HCV 코어 항원의 킷 (후지 레비오, 루미펄스사)을 사용하여 측정하였다.
도 4에 나타내는 바와 같이, 코어 항원의 측정값은 24 시간까지는 검출 한계 이하이지만, 48 시간부터 상승하고, 96 시간 후에도 증가하고 있었다. 이것은 본원 발명의 TF 타입의 레프리콘이 세포 중에서 복제되어, 코어 단백질을 복제하고 있는 것을 나타내고 있다. 간장에서 복제하고 있는 구조와 같은 TF 게놈이 생체 외에서 복제 가능한 것을 나타낸 것이다.
실시예
7. 유전자형 2의
HCV
검체로부터의
트런케이트
폼 유전자의 취득
실시예 2에서 만성 간염 환자의 생검법 검체로부터 RT-PCR법에 의하여, 트런케이트 폼 유전자를 검출하였지만, 유전자형 2의 검체로부터는 BP203의 검체를 제외하고, 트런케이트 폼 유전자는 검출할 수 없었다. 이 원인은 검출에 사용한 프라이머의 배열이 유전자형 1의 배열에 기초하여 디자인되었기 때문이라고 생각된다.
그 때문에, 유전자형 2의 배열에 기초하여 프라이머를 디자인하고, 그 프라이머에 의하여, 유전자형 2의 만성 간염 환자의 생검 검체로부터, 트런케이트 폼 유전자의 검출을 시도하였다.
사용하는 프라이머 이외에는 실시예 2의 방법에 따라, 표 1에 나타낸 유전자형 2에 대하여, cDNA 합성, PCR, PCR 프래그먼트의 클로닝, 염기 배열의 결정을 실시하였다. cDNA 합성 및 PCR의 프라이머의 조합은 다음에 나타내는 2조(組)의 조합으로 실시하였다. 프라이머 세트 A 는 cDNA 합성이 2a_HC3293R, 1st PCR이 2a_807S 및 2 a_3216R, 2nd PCR이 2a_HC835S 및 2a_HC3203R로, 프라이머 세트 B는 cDNA 합성이 2a_HC3156R, 1st PCR가 2a_807S 및 2a_3144R, 2nd PCR가 2a_HC835S 및 2a_HC3108R이다. 각각의 프라이머의 배열을 아래와 같이 나타낸다.
2a_HC3293R: TCTCCATTGGGCTGAACACCACAGGCTCCAC (배열 번호 84)
2a_HC3216R: GGGGAGAGGTGGTCATAGATGTAAGTGCCGG (배열 번호 85)
2a_HC3203R: CATAGATGTAAGTGccGGTCCACCTGCCTA (배열 번호 86)
2a_HC3144R: CTCCTGCGAGGTGTCTCACCAGGGTACACA (배열 번호 87)
2a_HC3108R: AGCAGAGCGTGAGCTCTGACGAAGTATGG (배열 번호 88)
2a_HC835S: GGAATCTACCCGGTTGCTCTTTTTCTATCTTC (배열 번호 89)
2a_HC807S: CTGGAAGACGGGATAAATTATGCAACAGGGAA (배열 번호 90)
표 6에 나타내는 바와 같이, 프라이머 세트 A에서 9 검체 중 6 검체에 있어서, 프라이머 세트 B에서 9 검체 중 2 검체에 있어서, 결실을 가지는 유전자가 검출되었다. 유전자의 배열을 결정하였더니, BP203는 987-2999nt, BP235는 1060-2945, BP297는 1024-2966에 약 2 kb의 결실을 가지고, 인프레임으로 결합하고 있는 전형적인 트런케이트 폼 유전자이었다. 결실 부분의 배열을 도 5에 나타낸다.
실시예
8. 만성 활동성 간염 환자 및 간암 환자의 간장 조직으로부터의 트런케이트 폼 유전자의
검출율에
대하여
실시예 2 및 실시예 7의 결과를 정리하면, BP207를 포함하는 24 검체의 만성 활동성 간염 환자의 간장 조직 중 BP207, BP203, BP325, BP297, BP368, BP373의 6검체로부터, E1로부터 NS2에 걸친 약 2 kb의 유전자가 인프레임으로 결실되어 있는 전형적인 트런케이트 폼 유전자가 검출되었다. 한편, 3 검체의 간암 조직에서는 BP1와 BP2로부터 동일한 전형적인 트런케이트 폼 유전자가 검출되었다. 각각의 검출율은 만성 간염 환자에서 25% (6/24), 간암 환자에서 66.6% (2/3)이고, 만성 간염으로부터 간암으로 진행됨에 따라, 검출율이 높아졌다.
또한, 무증후성 캐리어의 혈장으로부터의 20 검체로부터, 트런케이트 폼 유전자의 검출을 시도하였지만, 전형적인 트런케이트 폼 유전자는 검출할 수 없었다. 이 결과는 만성 간염, 간경변, 간암과 C형 간염 바이러스의 감염에 의한 병상의 진행과 트런케이트 폼 유전자의 존재가 어떠한 관련이 있는 것을 나타낸다. 즉, 트런케이트 폼 유전자의 검출은 병상의 진행의 예측 인자로서 유용한 것으로 생각된다.
실시예
9.
극증
간염
환자로 부터의
트런케이트
폼 유전자의 취득
극증 간염 환자의 혈청으로부터 트런케이트 폼 유전자의 취득을 시도하였다. 환자 혈청으로부터 RNA의 추출 시약인 ISOGEN-LS (니폰 진사)를 사용하여 첨부한 지시서에 따라서, RNA를 추출하였다.
이 RNA로부터 실시예 1에서 BP207로부터 유전자를 취득한 것과 동일한 순서로, 제85위에서 9302위까지의, HCV유전자를 취득하였다. pGEM-T easy 벡터에 클로닝한 11 클론의 배열을 결정하였더니, 10 클론은 922위에서 1062위, 1096위에서 1131위 및 1209위에서 2997위까지가 결실된 전형적인 트런케이트 폼 유전자이었다. 또한, 나머지 1 클론은 1096위 내지 1131위의 결실은 없었지만, 922위 내지 1062위 및 1209위 내지 2997위가 결실된 전형적인 트런케이트 폼 유전자이었다.
또한, 이 환자의 RNA로부터 실시예 1에 기재되어 있는 방법으로 5' 비번역 영역과 3' 비번역 영역의 cDNA의 취득을 실시하였다.
다음에, 5' 비번역 영역의 말단의 배열을 결정하기 위하여, 5' RACE법에 의하여 말단 배열의 취득을 시도하였다. 환자 RNA로부터 5'RACE System for Rapid Amplification of cDNA Ends, Version 2.0 (Invitrogen사)의 킷을 사용하여 첨부한 지시서에 따라서, HCV의 5' 비번역 영역의 말단을 취득하였다. cDNA 합성을 위한 안티센스 프라이머는 Chiba-as를 사용하였다. SuperScript II Reverse Transcriptase로 cDNA를 합성하고, S.N.A.P. column으로 정제후, cDNA에 TdT-tailing 반응을 실시하여 dCTP를 부가하였다. 이 cDNA를 킷에 첨부한 5' RACE Abridged Anchor 프라이머 및 KY78 프라이머: 5'-CTCGCAAGCACCCTATCAGCCAGT-3' (배열 번호: 91)로, LA Taq (TAKARA사)를 사용하여, 1st PCR을 실시하였다. 이 PCR 산물의 일부를 주형으로, 킷에 첨부한 UAP 프라이머와 KM2 프라이머: 5'-AGGCATTGAGCGGGTTTATC-3' (배열 번호: 92)로, LA Taq (TAKARA)를 사용하여 2nd PCR을 실시하고, PCR 산물을 얻었다. 이 PCR 산물을 pGEM-T easy 벡터로 클로닝하고, 배열을 결정하였다. 그 결과, 이 트런케이트 폼 유전자는 통상의 풀 렝쓰의 HCV의 유전자와 같이, 제1위의 배열로부터 5' 비번역 영역을 가지고 있었다.
또는 3' 비번역 영역의 말단의 배열을 결정하기 위하여, 3' RACE법에 의하여 말단의 배열의 취득을 시도하였다. 우선 환자의 RNA에 Poly(A)Tailing Kit (Ambion, Inc)을 사용하여, 첨부한 지시서에 따라서, Poly(A)를 부가하였다. 이 Poly(A)의 부가된 RNA로부터, dT-Adp 프라이머 5' -CTAGACTCGAGTCGACATCGTTTTTTTTTTTTTTTTTT-3' (배열 번호: 93)를 사용하여 실시예 1에 기재된 cDNA 합성 순서와 같이 cDNA를 합성하였다. 이 cDNA를 주형으로서 3 UTR-lF프라이머: 5'-ATCTTAGCCCTAGTCACGGC-3' (배열 번호: 94) 및 Adp 프라이머: 5' -CTAGACTCGAGTCGACATCG-3' (배열 번호: 95)로 1st PCR을, XR58F : 5'-CT AGCTGTGAAAGGTCCGTGAGccGCATGA-3' (배열 번호: 96) 및 Adp 프라이머 (배열 번호: 95)로 2nd PCR을 LA Taq (TAKARA)를 사용하여 실시하였다. 이 PCR 산물을 pGEM-T easy 벡터로 클로닝하고, 배열을 결정하였다. 그 결과, 이 트런케이트 폼 유전자의 3' 말단은 통상의 전장(full-length) HCV의 유전자와 동일한 3'말단을 가지고 있었다.
본 발명의 활용예로서 레프리콘 복제계는 C형 간염 바이러스의 치료약의 개발을 위한 약제 스크리닝에 이용할 수 있다. 이 계는 치료약의 약효 평가, 제조에도 이용 가능하다. 또한 TF 게놈의 검출계는 C형 간염의 병태 마커로서도 이용 가능하고, 진단약으로서 유용하다.
<110> Advanced Life Science Institute, Inc.
<120> HCV RNA HAVING NOVEL SEQUENCE
<130> CH-067087
<150> JP 2004-188543
<151> 2004-06-25
<150> JP 2004-190144
<151> 2004-06-28
<150> JP 2004-277677
<151> 2004-09-24
<160> 96
<170> PatentIn version 3.1
<210> 1
<211> 7785
<212> DNA
<213> Hepatitis C virus
<220>
<221> 5'UTR
<222> (1)..(341)
<220>
<221> gene
<222> (342)..(7559)
<220>
<221> CDS
<222> (342)..(950)
<223> core protein
<220>
<221> CDS
<222> (951)..(1607)
<223> truncated E1-NS2 fusion protein
<220>
<221> CDS
<222> (1608)..(3500)
<223> NS3 protein
<220>
<221> CDS
<222> (3501)..(3662)
<223> NS4A protein
<220>
<221> CDS
<222> (3663)..(4442)
<223> NS4B protein
<220>
<221> CDS
<222> (4443)..(5786)
<223> NS5A protein
<220>
<221> CDS
<222> (5787)..(7559)
<223> NS5B protein
<220>
<221> 3'UTR
<222> (7560)..(7775)
<400> 1
gccagccccc tgatgggggc gacactccac catagatcac tcccctgtga ggaactactg 60
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac c atg agc acg 350
Met Ser Thr
1
aat cct aaa cct caa aga aaa acc aaa cct aac acc aac cgc cgc cca 398
Asn Pro Lys Pro Gln Arg Lys Thr Lys Pro Asn Thr Asn Arg Arg Pro
5 10 15
cag gac gtc aag ttc ccg ggc ggt ggt cag atc gtt ggt gga gtt tac 446
Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly Gly Val Tyr
20 25 30 35
ctg ttg ccg cgc agg ggc ccc cgg ttg ggt gtg cgc gcg act agg aag 494
Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys
40 45 50
act tcc gag cgg tcg caa cct cgt gga agg cga caa cct atc ccc aag 542
Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro Ile Pro Lys
55 60 65
gct cgc cgg ccc gag ggc agg gcc tgg gct cag ccc ggg tac ccc tgg 590
Ala Arg Arg Pro Glu Gly Arg Ala Trp Ala Gln Pro Gly Tyr Pro Trp
70 75 80
ccc ctc tat ggc aat gag ggc tta ggg tgg gca gga tgg ctc ctg tca 638
Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp Leu Leu Ser
85 90 95
ccc cgc ggc tct cgg cct agt tgg ggc ccc acg gac ccc cgg cgt agg 686
Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro Arg Arg Arg
100 105 110 115
tcg cgt aac ttg ggt aag gtc atc gat acc ctc aca tgc ggc ttc gcc 734
Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys Gly Phe Ala
120 125 130
gac ctc atg ggg tac att ccg ctc gtc ggt gcc ccc cta ggg ggc gct 782
Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala Pro Leu Gly Gly Ala
135 140 145
gcc agg gcc cta gca cat ggt gtc cgg gtt ctg gag gac ggc gtg aac 830
Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn
150 155 160
tac gca aca ggg aat ttg ccc ggt tgc tct ttc tct atc ttc ctc ttg 878
Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile Phe Leu Leu
165 170 175
gct ctg ctg tcc tgt ctg acc atc cca gct tcc gct tat gaa gtg cgc 926
Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala Tyr Glu Val Arg
180 185 190 195
aac gtg tcc gga ata tac cat gtc acg aac gac tgc tcc aac tca agc 974
Asn Val Ser Gly Ile Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser
200 1 5
att gtg tat gag gca gcg gac gtg atc atg cat acc ccc ggg tgc gtg 1022
Ile Val Tyr Glu Ala Ala Asp Val Ile Met His Thr Pro Gly Cys Val
10 15 20
ccc tgt gtt cgg gag ggt aac gcc tcc cgc tgt tgg gca gcg ctc act 1070
Pro Cys Val Arg Glu Gly Asn Ala Ser Arg Cys Trp Ala Ala Leu Thr
25 30 35 40
ccc acg ctc gcg gtc ggg aat gcc agc gtc ccc act aag gca ata cgg 1118
Pro Thr Leu Ala Val Gly Asn Ala Ser Val Pro Thr Lys Ala Ile Arg
45 50 55
cgc cac gtc gat ctg ctt gtt ggg acg gct gct ttc tgc tcc gcc atg 1166
Arg His Val Asp Leu Leu Val Gly Thr Ala Ala Phe Cys Ser Ala Met
60 65 70
tac gtg ggg gat ctc tgc gga tac atc acc aaa ctc ctg ctc gcc aca 1214
Tyr Val Gly Asp Leu Cys Gly Tyr Ile Thr Lys Leu Leu Leu Ala Thr
75 80 85
ctc ggt ctg ctc atg gtg ctc cag gct gcc ata gct agg gtg ccg tac 1262
Leu Gly Leu Leu Met Val Leu Gln Ala Ala Ile Ala Arg Val Pro Tyr
90 95 100
ttc gta cgc act cag ggg ctc att cgt gtg tgt atg tta gtg cgg aaa 1310
Phe Val Arg Thr Gln Gly Leu Ile Arg Val Cys Met Leu Val Arg Lys
105 110 115 120
gtc gcc ggg ggt cac tat gcc cag atg gcc ttc atc aag ctg gcc gca 1358
Val Ala Gly Gly His Tyr Ala Gln Met Ala Phe Ile Lys Leu Ala Ala
125 130 135
ctg aca ggt aca tac gtt tat gac cat ctt act cca ctg cga gat tgg 1406
Leu Thr Gly Thr Tyr Val Tyr Asp His Leu Thr Pro Leu Arg Asp Trp
140 145 150
gcc cat gcg ggc ctg cga gac ctt gcg gtg gca gtg gag ccc gtc atc 1454
Ala His Ala Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro Val Ile
155 160 165
ttc tct gac atg gag acc aag atc atc acc tgg gga gca gac acc gcg 1502
Phe Ser Asp Met Glu Thr Lys Ile Ile Thr Trp Gly Ala Asp Thr Ala
170 175 180
gcg tgt ggg gat att att ttg ggt ctg ccc gtc tcc gcc cga agg ggg 1550
Ala Cys Gly Asp Ile Ile Leu Gly Leu Pro Val Ser Ala Arg Arg Gly
185 190 195 200
agg gag ata ctt ctg ggg ccg gcc gat agt ctt gag ggg cgg ggg tgg 1598
Arg Glu Ile Leu Leu Gly Pro Ala Asp Ser Leu Glu Gly Arg Gly Trp
205 210 215
cga ctc ctt gcg ccc atc acg gct tat tct caa cag acg cgg 1640
Arg Leu Leu Ala Pro Ile Thr Ala Tyr Ser Gln Gln Thr Arg
1 5 10
ggt tta ctc ggc tgc atc atc act agt ctc acg ggc cgg gac aag aac 1688
Gly Leu Leu Gly Cys Ile Ile Thr Ser Leu Thr Gly Arg Asp Lys Asn
15 20 25
cag gtc gag ggg gag gtt caa gtg gtt tcg acc gcg aca caa tcc ttc 1736
Gln Val Glu Gly Glu Val Gln Val Val Ser Thr Ala Thr Gln Ser Phe
30 35 40
ctg gcg acc tgt gtc aac ggc gtg tgt tgg act gtc tat cat ggt gcc 1784
Leu Ala Thr Cys Val Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala
45 50 55
ggc tca aaa acc cta gcc ggc cca aaa ggg ccg att atc caa atg tat 1832
Gly Ser Lys Thr Leu Ala Gly Pro Lys Gly Pro Ile Ile Gln Met Tyr
60 65 70 75
acc aat gta gac cag gac ctt gtt ggc tgg caa gcg ccc ccc ggg gcg 1880
Thr Asn Val Asp Gln Asp Leu Val Gly Trp Gln Ala Pro Pro Gly Ala
80 85 90
cgt tcc ttg aca cca tgc acc tgc ggc agc tcg gac ctt tac ctg gtt 1928
Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val
95 100 105
acg aga cat gct gac gtc att ccg gtg cgc cgg cga ggt gac ggt agg 1976
Thr Arg His Ala Asp Val Ile Pro Val Arg Arg Arg Gly Asp Gly Arg
110 115 120
ggg agc cta ctc tcc ccc aaa ccc atc tcc tac ttg aaa ggc tct tcg 2024
Gly Ser Leu Leu Ser Pro Lys Pro Ile Ser Tyr Leu Lys Gly Ser Ser
125 130 135
ggt ggt ccg ctg ctc tgc cct tcg ggg cac gct gtg ggc atc ttt cgg 2072
Gly Gly Pro Leu Leu Cys Pro Ser Gly His Ala Val Gly Ile Phe Arg
140 145 150 155
gct gct gtg tgc acc cgg ggg att gcg aag gct gtg gac ttt gta ccc 2120
Ala Ala Val Cys Thr Arg Gly Ile Ala Lys Ala Val Asp Phe Val Pro
160 165 170
gtt gag tgt atg gaa act act atg cgg tct ccg gtc ttc aca gac aac 2168
Val Glu Cys Met Glu Thr Thr Met Arg Ser Pro Val Phe Thr Asp Asn
175 180 185
tcg tcc ccc ccg acc gta ccg cag aca ttc caa gtg gcc cat cta cac 2216
Ser Ser Pro Pro Thr Val Pro Gln Thr Phe Gln Val Ala His Leu His
190 195 200
gct ccc act ggc agc ggc aaa agc acc aaa gta ccg gct gca tat gcg 2264
Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala
205 210 215
gcc caa ggg tat aag gta ctc gtc ctg aac ccg tcc gtt gcc gcc acc 2312
Ala Gln Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr
220 225 230 235
ctg agt ttt ggg gcg tat atg tcc aag gca cat ggt gtc gac cct aac 2360
Leu Ser Phe Gly Ala Tyr Met Ser Lys Ala His Gly Val Asp Pro Asn
240 245 250
atc aga act ggg atg agg acc atc acc aca ggc gct ccc atc acg tac 2408
Ile Arg Thr Gly Met Arg Thr Ile Thr Thr Gly Ala Pro Ile Thr Tyr
255 260 265
tcc acc tat ggc aag ttc ctt gcc gac ggt ggt tgt tcc ggg ggc gcc 2456
Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly Ala
270 275 280
tat gac atc ata tta tgt gat gag tgc cac tca act gac tca act act 2504
Tyr Asp Ile Ile Leu Cys Asp Glu Cys His Ser Thr Asp Ser Thr Thr
285 290 295
gtt tta ggc atc ggc aca gtt ctg gac caa gcg gag acg gct gga gcg 2552
Val Leu Gly Ile Gly Thr Val Leu Asp Gln Ala Glu Thr Ala Gly Ala
300 305 310 315
cga ctc gtc gtg ctc gcc acc gct acg cct cca gga tcg gtc acc gtg 2600
Arg Leu Val Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Val
320 325 330
cca cac ccc aat atc gag gaa gtg gct ctg tcc aac act gga gag atc 2648
Pro His Pro Asn Ile Glu Glu Val Ala Leu Ser Asn Thr Gly Glu Ile
335 340 345
ccc ttc tat ggc aaa gcc atc cct atc gag gtc atc aag ggg gga agg 2696
Pro Phe Tyr Gly Lys Ala Ile Pro Ile Glu Val Ile Lys Gly Gly Arg
350 355 360
cat ctc att ttc tgt cat tcc aag aag aaa tgc gac gag ctt gct gca 2744
His Leu Ile Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala
365 370 375
aag ttg tca ggt ctc gga ctc aat gct gta gtg tat tac cgg ggc ctt 2792
Lys Leu Ser Gly Leu Gly Leu Asn Ala Val Val Tyr Tyr Arg Gly Leu
380 385 390 395
gac gtg tcc gtc ata cct acc agc gga gac gtc gtt gtc gtg gca aca 2840
Asp Val Ser Val Ile Pro Thr Ser Gly Asp Val Val Val Val Ala Thr
400 405 410
gac gct cta atg acg ggc tat acc ggt gac ttt gac tca gtg atc gac 2888
Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val Ile Asp
415 420 425
tgt aat aca tgt gtc act cag aca gtc gac ttc agc ttg gat cct acc 2936
Cys Asn Thr Cys Val Thr Gln Thr Val Asp Phe Ser Leu Asp Pro Thr
430 435 440
ttc acc att gac acg acg acc gta ccc caa gac gcg gta tca cgc tcg 2984
Phe Thr Ile Asp Thr Thr Thr Val Pro Gln Asp Ala Val Ser Arg Ser
445 450 455
cag cgg cga ggt agg act ggc agg ggt agg gga ggc atc tac agg ttt 3032
Gln Arg Arg Gly Arg Thr Gly Arg Gly Arg Gly Gly Ile Tyr Arg Phe
460 465 470 475
gtg act cca gga gaa cgg ccc tcg ggc atg ttc gat tct tcg gtc ttg 3080
Val Thr Pro Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val Leu
480 485 490
tgt gag tgt tat gac gcg ggc tgt gct tgg tat gag ctc acg ccc gcc 3128
Cys Glu Cys Tyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro Ala
495 500 505
gaa acc acg gtt agg ttg cgg gct tac ctt aat aca cca ggg ttg ccc 3176
Glu Thr Thr Val Arg Leu Arg Ala Tyr Leu Asn Thr Pro Gly Leu Pro
510 515 520
gtc tgt cag gac cac ctg gag ttc tgg gag ggt gtc ttc aca ggc ctc 3224
Val Cys Gln Asp His Leu Glu Phe Trp Glu Gly Val Phe Thr Gly Leu
525 530 535
acc cac ata gaa gct cat ctc ttg tcc cag act aag gat gca gga gac 3272
Thr His Ile Glu Ala His Leu Leu Ser Gln Thr Lys Asp Ala Gly Asp
540 545 550 555
aat tac ccc tac ctg gta gcg tac caa gcc acg gtg tgc gcc agg gct 3320
Asn Tyr Pro Tyr Leu Val Ala Tyr Gln Ala Thr Val Cys Ala Arg Ala
560 565 570
cag gcc cca cct ccg tct tgg gat caa atg tgg aag tgt ctc atg cgg 3368
Gln Ala Pro Pro Pro Ser Trp Asp Gln Met Trp Lys Cys Leu Met Arg
575 580 585
ctt aaa cct acg ctg cac ggg cca aca ccc ctg ctg tat agg cta gga 3416
Leu Lys Pro Thr Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly
590 595 600
gcc gtc cag aat gag gtc acc ctt aca cac ccc ata acc aaa tac atc 3464
Ala Val Gln Asn Glu Val Thr Leu Thr His Pro Ile Thr Lys Tyr Ile
605 610 615
atc aca tgc atg tca gct gac ctg gag gtt gtc act agc acc tgg 3509
Ile Thr Cys Met Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp
620 625 630 1
gtg cta gta ggc gga gtc ctt gca gct ttg gcc gca tac tgc ctg aca 3557
Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Thr
5 10 15
aca ggc agt gtg gtc att gtg ggc agg atc atc ttg tcc ggg aag ccg 3605
Thr Gly Ser Val Val Ile Val Gly Arg Ile Ile Leu Ser Gly Lys Pro
20 25 30 35
gct gtc atc ccc gac agg gaa gtc ctc tac cag gcg ttc gat gaa atg 3653
Ala Val Ile Pro Asp Arg Glu Val Leu Tyr Gln Ala Phe Asp Glu Met
40 45 50
gag gag tgt gcc tca cac ctc cct tac atc gaa cag gga atg 3695
Glu Glu Cys Ala Ser His Leu Pro Tyr Ile Glu Gln Gly Met
1 5 10
cag ctc gcc gag caa ttc aag cag aaa gcg ctc ggg ctg cta caa acg 3743
Gln Leu Ala Glu Gln Phe Lys Gln Lys Ala Leu Gly Leu Leu Gln Thr
15 20 25
gcc act aag caa gcg gag gct gct gct ccc atg gtg gag tcc aaa tgg 3791
Ala Thr Lys Gln Ala Glu Ala Ala Ala Pro Met Val Glu Ser Lys Trp
30 35 40
cac gcc ctt gag gct ttc tgg gcg aag cac atg tgg aac ttc atc agc 3839
His Ala Leu Glu Ala Phe Trp Ala Lys His Met Trp Asn Phe Ile Ser
45 50 55
ggg ata cag tac tta gca ggc ttg tcc act ctg cct ggg aac ccc gca 3887
Gly Ile Gln Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala
60 65 70 75
ata gca tca ctg atg gca ttc aca gcc tct gtc acc agc ccg ctt acc 3935
Ile Ala Ser Leu Met Ala Phe Thr Ala Ser Val Thr Ser Pro Leu Thr
80 85 90
acc cag agc acc ctc ttg ttt aac atc ttg ggg gga tgg gtg gct gcc 3983
Thr Gln Ser Thr Leu Leu Phe Asn Ile Leu Gly Gly Trp Val Ala Ala
95 100 105
caa ctc gct ccc ccc ggt gct gct tcg gct ttt gtg ggc gcc gga att 4031
Gln Leu Ala Pro Pro Gly Ala Ala Ser Ala Phe Val Gly Ala Gly Ile
110 115 120
gcc ggc gcg gcc gta ggc agc ata ggc ctt ggg aag gtg ctt gtg gac 4079
Ala Gly Ala Ala Val Gly Ser Ile Gly Leu Gly Lys Val Leu Val Asp
125 130 135
att ctg gct gga tat ggg gca ggg gtg gca ggc gca ctc gtg gct ttt 4127
Ile Leu Ala Gly Tyr Gly Ala Gly Val Ala Gly Ala Leu Val Ala Phe
140 145 150 155
aag atc atg agc ggc gat atg ccc tcc acc gag gac ctg gtt aac ttg 4175
Lys Ile Met Ser Gly Asp Met Pro Ser Thr Glu Asp Leu Val Asn Leu
160 165 170
ctt cct gcc atc ctc tct cct ggt gcc ctg gtc gtc ggg gtc gtg tgc 4223
Leu Pro Ala Ile Leu Ser Pro Gly Ala Leu Val Val Gly Val Val Cys
175 180 185
gca gca ata ctg cgt cgg cac gtg ggc ccg gga gag ggg gct gtg cag 4271
Ala Ala Ile Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gln
190 195 200
tgg atg aac cgg ctg ata gcg ttc gct tcc cgg ggt aac cac atc tcc 4319
Trp Met Asn Arg Leu Ile Ala Phe Ala Ser Arg Gly Asn His Ile Ser
205 210 215
ccc acg cac tat gtg cct gag agc gac gcc gca gcg cgt gtt acc cag 4367
Pro Thr His Tyr Val Pro Glu Ser Asp Ala Ala Ala Arg Val Thr Gln
220 225 230 235
att ctt tcc aac ctt acc atc act cag ctg ctg aag agg ctt cac caa 4415
Ile Leu Ser Asn Leu Thr Ile Thr Gln Leu Leu Lys Arg Leu His Gln
240 245 250
tgg atc aat gag gac tgc tcc acg cca tgc tcc ggc tcg tgg 4457
Trp Ile Asn Glu Asp Cys Ser Thr Pro Cys Ser Gly Ser Trp
255 260 1 5
ctt agg gat gtt tgg gac tgg ata tgc acg gtg ttg gct gac ttc aag 4505
Leu Arg Asp Val Trp Asp Trp Ile Cys Thr Val Leu Ala Asp Phe Lys
10 15 20
acc tgg ctc cag tcc aag ctc ctg ccg cgg ttg ccg gga gtc cct ttc 4553
Thr Trp Leu Gln Ser Lys Leu Leu Pro Arg Leu Pro Gly Val Pro Phe
25 30 35
ttc tca tgc caa cgc ggg tac aag gga gtt tgg cgg ggg gat ggc atg 4601
Phe Ser Cys Gln Arg Gly Tyr Lys Gly Val Trp Arg Gly Asp Gly Met
40 45 50
atg cat acc acc tgc cca tgt gga gca caa atc acc gga cat gtc aaa 4649
Met His Thr Thr Cys Pro Cys Gly Ala Gln Ile Thr Gly His Val Lys
55 60 65
aat ggt tcc atg agg atc gct ggg cct aga acc tgc agc aac acg tgg 4697
Asn Gly Ser Met Arg Ile Ala Gly Pro Arg Thr Cys Ser Asn Thr Trp
70 75 80 85
cat ggg acg ttc ccc atc aac gca tac acc acg ggc ccc tgc aca ccc 4745
His Gly Thr Phe Pro Ile Asn Ala Tyr Thr Thr Gly Pro Cys Thr Pro
90 95 100
tcc ccg gcg ccc aac tat tcc aag gcg cta tgg cgg gtg gct gct gag 4793
Ser Pro Ala Pro Asn Tyr Ser Lys Ala Leu Trp Arg Val Ala Ala Glu
105 110 115
gag tac gtg gaa gtt acg cga gtg gga gac ttc cac tac gtg acg ggc 4841
Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His Tyr Val Thr Gly
120 125 130
atg acc act gac aac ata aaa tgc cca tgc cag gtt ccg gcc ccc gaa 4889
Met Thr Thr Asp Asn Ile Lys Cys Pro Cys Gln Val Pro Ala Pro Glu
135 140 145
ttc ttc aca gaa ctg gat gga gtg cgg ttg cac agg tac gct ccg gtg 4937
Phe Phe Thr Glu Leu Asp Gly Val Arg Leu His Arg Tyr Ala Pro Val
150 155 160 165
tgc aaa ccc ctc cta cgg gag gag gtt tta ttc cag gtt ggg tgc aac 4985
Cys Lys Pro Leu Leu Arg Glu Glu Val Leu Phe Gln Val Gly Cys Asn
170 175 180
caa tac ctg gtc ggg tca cag ctt cca tgc gag ccc gaa ccg gac gta 5033
Gln Tyr Leu Val Gly Ser Gln Leu Pro Cys Glu Pro Glu Pro Asp Val
185 190 195
gca gtg ctc act tcc atg ctt gcc gac ccc tcc cac att aca gca gag 5081
Ala Val Leu Thr Ser Met Leu Ala Asp Pro Ser His Ile Thr Ala Glu
200 205 210
aca gct aag cgt agg ttg gcc agg ggg tct ccc ccc tcc ttg gcc agc 5129
Thr Ala Lys Arg Arg Leu Ala Arg Gly Ser Pro Pro Ser Leu Ala Ser
215 220 225
tcg tca gct agc cag ttg tct gca cct tct ttg aag gcg aca tgc aat 5177
Ser Ser Ala Ser Gln Leu Ser Ala Pro Ser Leu Lys Ala Thr Cys Asn
230 235 240 245
acc cat cac cgc tcc ccg gac ctt gac ctc atc gag gcc aac ctc ctg 5225
Thr His His Arg Ser Pro Asp Leu Asp Leu Ile Glu Ala Asn Leu Leu
250 255 260
tgg tgg cag gag aag ggt gga aac atc acc cgt gtg gag tca gag aac 5273
Trp Trp Gln Glu Lys Gly Gly Asn Ile Thr Arg Val Glu Ser Glu Asn
265 270 275
aag gtg ata atc atg gac tct ttc gat ccg ctt cga gcg gag gag gat 5321
Lys Val Ile Ile Met Asp Ser Phe Asp Pro Leu Arg Ala Glu Glu Asp
280 285 290
gag agg gaa ata tct gtt gcg gcg gag atc ctg cgg caa tcc agg aaa 5369
Glu Arg Glu Ile Ser Val Ala Ala Glu Ile Leu Arg Gln Ser Arg Lys
295 300 305
ttc ccc cca gcg ttg ccc gta tgg gca cgc ccg gat tat aac cct cca 5417
Phe Pro Pro Ala Leu Pro Val Trp Ala Arg Pro Asp Tyr Asn Pro Pro
310 315 320 325
cta cta gag ccc tgg aag gac ccg gac tat gtc cct ccg gtg gta cat 5465
Leu Leu Glu Pro Trp Lys Asp Pro Asp Tyr Val Pro Pro Val Val His
330 335 340
ggg tgc ccg ctg ccg cct gcc aag act cct cca ata cca cct cca cgg 5513
Gly Cys Pro Leu Pro Pro Ala Lys Thr Pro Pro Ile Pro Pro Pro Arg
345 350 355
agg aaa agg acg gtt gtc ctg aca gag tcc acc gtg tct tct gtt ctg 5561
Arg Lys Arg Thr Val Val Leu Thr Glu Ser Thr Val Ser Ser Val Leu
360 365 370
gcg gag ctc act act aag acc ttc ggc agc tcc gaa tcg tcg gcc gct 5609
Ala Glu Leu Thr Thr Lys Thr Phe Gly Ser Ser Glu Ser Ser Ala Ala
375 380 385
gat agc ggc atg gcg acc gcc cct cct gac cag gcc tcc ggc gac ggc 5657
Asp Ser Gly Met Ala Thr Ala Pro Pro Asp Gln Ala Ser Gly Asp Gly
390 395 400 405
gac aaa gag tcc gac gtt gag tcg tac tcc tcc atg ccc ccc ctt gag 5705
Asp Lys Glu Ser Asp Val Glu Ser Tyr Ser Ser Met Pro Pro Leu Glu
410 415 420
gga gag ccg ggg gac ccc gat ctc agc gac ggg tct tgg tct acc gtg 5753
Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr Val
425 430 435
agc gag gag gct ggt gag gac gtc gtc tgc tgt tca atg tcc tat aca 5801
Ser Glu Glu Ala Gly Glu Asp Val Val Cys Cys Ser Met Ser Tyr Thr
440 445 1 5
tgg aca ggc gcc ttg atc aca cca tgc gct gcg gag gaa agc aag ctg 5849
Trp Thr Gly Ala Leu Ile Thr Pro Cys Ala Ala Glu Glu Ser Lys Leu
10 15 20
ccc atc aac gcg ttg agc aac tct ttg ctg cgt cat cac aac atg gtc 5897
Pro Ile Asn Ala Leu Ser Asn Ser Leu Leu Arg His His Asn Met Val
25 30 35
tat gcc aca aca tct cgc agc gca agc cag cgg cag aag aag gtc acc 5945
Tyr Ala Thr Thr Ser Arg Ser Ala Ser Gln Arg Gln Lys Lys Val Thr
40 45 50
ttt gac aga ctg cag gtc ctg gat gat cac tac cgg gac gtg ctt aag 5993
Phe Asp Arg Leu Gln Val Leu Asp Asp His Tyr Arg Asp Val Leu Lys
55 60 65
gag atg aag gcg aag gcg tcc aca gtt aag gct aaa ctt ctc tct gta 6041
Glu Met Lys Ala Lys Ala Ser Thr Val Lys Ala Lys Leu Leu Ser Val
70 75 80 85
gaa gaa gcc tgc aag ctg acg ccc cca cat tcg gcc aaa tct aag ttt 6089
Glu Glu Ala Cys Lys Leu Thr Pro Pro His Ser Ala Lys Ser Lys Phe
90 95 100
ggt tat ggg gca aag gac gtc cgg aac cta tcc agc agg gcc gtt aac 6137
Gly Tyr Gly Ala Lys Asp Val Arg Asn Leu Ser Ser Arg Ala Val Asn
105 110 115
cac att cgc tcc gtg tgg aag gac ttg ctg gaa gac act gaa aca cca 6185
His Ile Arg Ser Val Trp Lys Asp Leu Leu Glu Asp Thr Glu Thr Pro
120 125 130
att gac acc acc atc atg gca aaa agt gag gtt ttc tgc atc caa cca 6233
Ile Asp Thr Thr Ile Met Ala Lys Ser Glu Val Phe Cys Ile Gln Pro
135 140 145
gag aaa gga ggc cgc aag cca gct cgc ctt atc gtg ttc cca gac ctg 6281
Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu Ile Val Phe Pro Asp Leu
150 155 160 165
gga gtc cgt gta tgc gag aaa atg gcc ctc tac gac gtg gtc tcc acc 6329
Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val Ser Thr
170 175 180
ctt cct cag gcc gtg atg ggc tcc tca tat gga ttc caa tac tct cct 6377
Leu Pro Gln Ala Val Met Gly Ser Ser Tyr Gly Phe Gln Tyr Ser Pro
185 190 195
ggg cag cga gtc gag ttc ctg gta aat gcc tgg aaa tca aag aaa aac 6425
Gly Gln Arg Val Glu Phe Leu Val Asn Ala Trp Lys Ser Lys Lys Asn
200 205 210
ccc atg ggc ttc tca tat gac act cgc tgt ttc gac tca acg gtc act 6473
Pro Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr
215 220 225
gag agt gac atc cgc gtt gag gag tca atc tac caa tgt tgt gac ttg 6521
Glu Ser Asp Ile Arg Val Glu Glu Ser Ile Tyr Gln Cys Cys Asp Leu
230 235 240 245
gcc ccc gaa gcc aga cag gcc ata aag tcg ctc aca gag cgg ctc tat 6569
Ala Pro Glu Ala Arg Gln Ala Ile Lys Ser Leu Thr Glu Arg Leu Tyr
250 255 260
atc ggg ggt ccc ctg act aat tca aaa ggg caa aac tgc ggt tat cgc 6617
Ile Gly Gly Pro Leu Thr Asn Ser Lys Gly Gln Asn Cys Gly Tyr Arg
265 270 275
cgg tgt cgc gcc agc ggc gtg ctg acg act agc tgc ggt aat acc ctc 6665
Arg Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr Leu
280 285 290
aca tgt tac ttg aag gcc gct gcg gcc tgt cga gct gcg aag ctc cag 6713
Thr Cys Tyr Leu Lys Ala Ala Ala Ala Cys Arg Ala Ala Lys Leu Gln
295 300 305
gac tgc acg atg ctc gtg aac gga gac gac cta gtc gtt atc tgt gag 6761
Asp Cys Thr Met Leu Val Asn Gly Asp Asp Leu Val Val Ile Cys Glu
310 315 320 325
agt gcg gga acc caa gag gat gcg gcg aac cta cga gtc ttc acg gag 6809
Ser Ala Gly Thr Gln Glu Asp Ala Ala Asn Leu Arg Val Phe Thr Glu
330 335 340
gct atg act agg tac tct gct ccc cca ggg gac tcg cct caa cca gaa 6857
Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Ser Pro Gln Pro Glu
345 350 355
tac gac ttg gag ttg ata aca tct tgc tcc tcc aat gtg tcg gtc gcg 6905
Tyr Asp Leu Glu Leu Ile Thr Ser Cys Ser Ser Asn Val Ser Val Ala
360 365 370
cac gat gcg tct ggc aag agg gtg tac tac ctc act cgt gac ccc acc 6953
His Asp Ala Ser Gly Lys Arg Val Tyr Tyr Leu Thr Arg Asp Pro Thr
375 380 385
acc ccc ctt gca cgg gct gcg tgg gag aca gct aga cac act cca gtc 7001
Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr Ala Arg His Thr Pro Val
390 395 400 405
aac tcc tgg cta ggc aat atc atc atg tat gcg ccc acc tta tgg gca 7049
Asn Ser Trp Leu Gly Asn Ile Ile Met Tyr Ala Pro Thr Leu Trp Ala
410 415 420
agg atg att ctg atg acc cac ttc ttc tcc atc ctt cta gct cag gaa 7097
Arg Met Ile Leu Met Thr His Phe Phe Ser Ile Leu Leu Ala Gln Glu
425 430 435
caa ctt gga aaa gcc ctg gat tgc cag atc tat ggg gcc tgt tac tcc 7145
Gln Leu Gly Lys Ala Leu Asp Cys Gln Ile Tyr Gly Ala Cys Tyr Ser
440 445 450
att gag cca ctt gat cta cct cag atc att gaa cga ctc cac ggt ctt 7193
Ile Glu Pro Leu Asp Leu Pro Gln Ile Ile Glu Arg Leu His Gly Leu
455 460 465
agc gca ttt tca ctc cat agt tac tct cca ggt gag atc aat agg gtg 7241
Ser Ala Phe Ser Leu His Ser Tyr Ser Pro Gly Glu Ile Asn Arg Val
470 475 480 485
gct tca tgc ctc agg aaa ctt ggg gta cca ccc ttg cga gtc tgg aga 7289
Ala Ser Cys Leu Arg Lys Leu Gly Val Pro Pro Leu Arg Val Trp Arg
490 495 500
cat cgg gcc aga agt gtc cgc gct aag cta ctg tcc cag ggg ggg agg 7337
His Arg Ala Arg Ser Val Arg Ala Lys Leu Leu Ser Gln Gly Gly Arg
505 510 515
gcc gcc act tgt ggc aag tac ctc ttc aac tgg gca gta aag acc aag 7385
Ala Ala Thr Cys Gly Lys Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys
520 525 530
ctt aaa ctc act cca atc ccg gct gcg tcc cag ttg gat tta tcc agc 7433
Leu Lys Leu Thr Pro Ile Pro Ala Ala Ser Gln Leu Asp Leu Ser Ser
535 540 545
tgg ttc gtt gct ggt tac agc ggg gga gac ata tat cac agc ctg tct 7481
Trp Phe Val Ala Gly Tyr Ser Gly Gly Asp Ile Tyr His Ser Leu Ser
550 555 560 565
cgt gcc cga ccc cgc tgg ttc atg tgg tgc cta ctc cta ctt tct gta 7529
Arg Ala Arg Pro Arg Trp Phe Met Trp Cys Leu Leu Leu Leu Ser Val
570 575 580
ggg gta ggc atc tat cta ctc ccc aac cga t gaacggggag ctaaacactc 7580
Gly Val Gly Ile Tyr Leu Leu Pro Asn Arg
585 590
caggccaata ggccatcctg tttttttttt cttttttttt tttccttttt tttttttttt 7640
tttttttttt cctttttttt ttttttcttt tttccttttc tttcctttgg tggctccatc 7700
ttagccctag tcacggctag ctgtgaaagg tccgtgagcc gcttgactgc agagagtgct 7760
gatactggcc tctctgcaga tcatg 7785
<210> 2
<211> 203
<212> PRT
<213> Hepatitis C virus
<400> 2
Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Pro Asn Thr Asn
1 5 10 15
Arg Arg Pro Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly
20 25 30
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala
35 40 45
Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro
50 55 60
Ile Pro Lys Ala Arg Arg Pro Glu Gly Arg Ala Trp Ala Gln Pro Gly
65 70 75 80
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp
85 90 95
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro
100 105 110
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys
115 120 125
Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala Pro Leu
130 135 140
Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp
145 150 155 160
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile
165 170 175
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala Tyr
180 185 190
Glu Val Arg Asn Val Ser Gly Ile Tyr His Val
195 200
<210> 3
<211> 219
<212> PRT
<213> Hepatitis C virus
<400> 3
Thr Asn Asp Cys Ser Asn Ser Ser Ile Val Tyr Glu Ala Ala Asp Val
1 5 10 15
Ile Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu Gly Asn Ala
20 25 30
Ser Arg Cys Trp Ala Ala Leu Thr Pro Thr Leu Ala Val Gly Asn Ala
35 40 45
Ser Val Pro Thr Lys Ala Ile Arg Arg His Val Asp Leu Leu Val Gly
50 55 60
Thr Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu Cys Gly Tyr
65 70 75 80
Ile Thr Lys Leu Leu Leu Ala Thr Leu Gly Leu Leu Met Val Leu Gln
85 90 95
Ala Ala Ile Ala Arg Val Pro Tyr Phe Val Arg Thr Gln Gly Leu Ile
100 105 110
Arg Val Cys Met Leu Val Arg Lys Val Ala Gly Gly His Tyr Ala Gln
115 120 125
Met Ala Phe Ile Lys Leu Ala Ala Leu Thr Gly Thr Tyr Val Tyr Asp
130 135 140
His Leu Thr Pro Leu Arg Asp Trp Ala His Ala Gly Leu Arg Asp Leu
145 150 155 160
Ala Val Ala Val Glu Pro Val Ile Phe Ser Asp Met Glu Thr Lys Ile
165 170 175
Ile Thr Trp Gly Ala Asp Thr Ala Ala Cys Gly Asp Ile Ile Leu Gly
180 185 190
Leu Pro Val Ser Ala Arg Arg Gly Arg Glu Ile Leu Leu Gly Pro Ala
195 200 205
Asp Ser Leu Glu Gly Arg Gly Trp Arg Leu Leu
210 215
<210> 4
<211> 631
<212> PRT
<213> Hepatitis C virus
<400> 4
Ala Pro Ile Thr Ala Tyr Ser Gln Gln Thr Arg Gly Leu Leu Gly Cys
1 5 10 15
Ile Ile Thr Ser Leu Thr Gly Arg Asp Lys Asn Gln Val Glu Gly Glu
20 25 30
Val Gln Val Val Ser Thr Ala Thr Gln Ser Phe Leu Ala Thr Cys Val
35 40 45
Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Ser Lys Thr Leu
50 55 60
Ala Gly Pro Lys Gly Pro Ile Ile Gln Met Tyr Thr Asn Val Asp Gln
65 70 75 80
Asp Leu Val Gly Trp Gln Ala Pro Pro Gly Ala Arg Ser Leu Thr Pro
85 90 95
Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr Arg His Ala Asp
100 105 110
Val Ile Pro Val Arg Arg Arg Gly Asp Gly Arg Gly Ser Leu Leu Ser
115 120 125
Pro Lys Pro Ile Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro Leu Leu
130 135 140
Cys Pro Ser Gly His Ala Val Gly Ile Phe Arg Ala Ala Val Cys Thr
145 150 155 160
Arg Gly Ile Ala Lys Ala Val Asp Phe Val Pro Val Glu Cys Met Glu
165 170 175
Thr Thr Met Arg Ser Pro Val Phe Thr Asp Asn Ser Ser Pro Pro Thr
180 185 190
Val Pro Gln Thr Phe Gln Val Ala His Leu His Ala Pro Thr Gly Ser
195 200 205
Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala Ala Gln Gly Tyr Lys
210 215 220
Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Ser Phe Gly Ala
225 230 235 240
Tyr Met Ser Lys Ala His Gly Val Asp Pro Asn Ile Arg Thr Gly Met
245 250 255
Arg Thr Ile Thr Thr Gly Ala Pro Ile Thr Tyr Ser Thr Tyr Gly Lys
260 265 270
Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp Ile Ile Leu
275 280 285
Cys Asp Glu Cys His Ser Thr Asp Ser Thr Thr Val Leu Gly Ile Gly
290 295 300
Thr Val Leu Asp Gln Ala Glu Thr Ala Gly Ala Arg Leu Val Val Leu
305 310 315 320
Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Val Pro His Pro Asn Ile
325 330 335
Glu Glu Val Ala Leu Ser Asn Thr Gly Glu Ile Pro Phe Tyr Gly Lys
340 345 350
Ala Ile Pro Ile Glu Val Ile Lys Gly Gly Arg His Leu Ile Phe Cys
355 360 365
His Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Lys Leu Ser Gly Leu
370 375 380
Gly Leu Asn Ala Val Val Tyr Tyr Arg Gly Leu Asp Val Ser Val Ile
385 390 395 400
Pro Thr Ser Gly Asp Val Val Val Val Ala Thr Asp Ala Leu Met Thr
405 410 415
Gly Tyr Thr Gly Asp Phe Asp Ser Val Ile Asp Cys Asn Thr Cys Val
420 425 430
Thr Gln Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr Ile Asp Thr
435 440 445
Thr Thr Val Pro Gln Asp Ala Val Ser Arg Ser Gln Arg Arg Gly Arg
450 455 460
Thr Gly Arg Gly Arg Gly Gly Ile Tyr Arg Phe Val Thr Pro Gly Glu
465 470 475 480
Arg Pro Ser Gly Met Phe Asp Ser Ser Val Leu Cys Glu Cys Tyr Asp
485 490 495
Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro Ala Glu Thr Thr Val Arg
500 505 510
Leu Arg Ala Tyr Leu Asn Thr Pro Gly Leu Pro Val Cys Gln Asp His
515 520 525
Leu Glu Phe Trp Glu Gly Val Phe Thr Gly Leu Thr His Ile Glu Ala
530 535 540
His Leu Leu Ser Gln Thr Lys Asp Ala Gly Asp Asn Tyr Pro Tyr Leu
545 550 555 560
Val Ala Tyr Gln Ala Thr Val Cys Ala Arg Ala Gln Ala Pro Pro Pro
565 570 575
Ser Trp Asp Gln Met Trp Lys Cys Leu Met Arg Leu Lys Pro Thr Leu
580 585 590
His Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Ala Val Gln Asn Glu
595 600 605
Val Thr Leu Thr His Pro Ile Thr Lys Tyr Ile Ile Thr Cys Met Ser
610 615 620
Ala Asp Leu Glu Val Val Thr
625 630
<210> 5
<211> 54
<212> PRT
<213> Hepatitis C virus
<400> 5
Ser Thr Trp Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr
1 5 10 15
Cys Leu Thr Thr Gly Ser Val Val Ile Val Gly Arg Ile Ile Leu Ser
20 25 30
Gly Lys Pro Ala Val Ile Pro Asp Arg Glu Val Leu Tyr Gln Ala Phe
35 40 45
Asp Glu Met Glu Glu Cys
50
<210> 6
<211> 260
<212> PRT
<213> Hepatitis C virus
<400> 6
Ala Ser His Leu Pro Tyr Ile Glu Gln Gly Met Gln Leu Ala Glu Gln
1 5 10 15
Phe Lys Gln Lys Ala Leu Gly Leu Leu Gln Thr Ala Thr Lys Gln Ala
20 25 30
Glu Ala Ala Ala Pro Met Val Glu Ser Lys Trp His Ala Leu Glu Ala
35 40 45
Phe Trp Ala Lys His Met Trp Asn Phe Ile Ser Gly Ile Gln Tyr Leu
50 55 60
Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Ile Ala Ser Leu Met
65 70 75 80
Ala Phe Thr Ala Ser Val Thr Ser Pro Leu Thr Thr Gln Ser Thr Leu
85 90 95
Leu Phe Asn Ile Leu Gly Gly Trp Val Ala Ala Gln Leu Ala Pro Pro
100 105 110
Gly Ala Ala Ser Ala Phe Val Gly Ala Gly Ile Ala Gly Ala Ala Val
115 120 125
Gly Ser Ile Gly Leu Gly Lys Val Leu Val Asp Ile Leu Ala Gly Tyr
130 135 140
Gly Ala Gly Val Ala Gly Ala Leu Val Ala Phe Lys Ile Met Ser Gly
145 150 155 160
Asp Met Pro Ser Thr Glu Asp Leu Val Asn Leu Leu Pro Ala Ile Leu
165 170 175
Ser Pro Gly Ala Leu Val Val Gly Val Val Cys Ala Ala Ile Leu Arg
180 185 190
Arg His Val Gly Pro Gly Glu Gly Ala Val Gln Trp Met Asn Arg Leu
195 200 205
Ile Ala Phe Ala Ser Arg Gly Asn His Ile Ser Pro Thr His Tyr Val
210 215 220
Pro Glu Ser Asp Ala Ala Ala Arg Val Thr Gln Ile Leu Ser Asn Leu
225 230 235 240
Thr Ile Thr Gln Leu Leu Lys Arg Leu His Gln Trp Ile Asn Glu Asp
245 250 255
Cys Ser Thr Pro
260
<210> 7
<211> 448
<212> PRT
<213> Hepatitis C virus
<400> 7
Cys Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp Ile Cys Thr Val
1 5 10 15
Leu Ala Asp Phe Lys Thr Trp Leu Gln Ser Lys Leu Leu Pro Arg Leu
20 25 30
Pro Gly Val Pro Phe Phe Ser Cys Gln Arg Gly Tyr Lys Gly Val Trp
35 40 45
Arg Gly Asp Gly Met Met His Thr Thr Cys Pro Cys Gly Ala Gln Ile
50 55 60
Thr Gly His Val Lys Asn Gly Ser Met Arg Ile Ala Gly Pro Arg Thr
65 70 75 80
Cys Ser Asn Thr Trp His Gly Thr Phe Pro Ile Asn Ala Tyr Thr Thr
85 90 95
Gly Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Lys Ala Leu Trp
100 105 110
Arg Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe
115 120 125
His Tyr Val Thr Gly Met Thr Thr Asp Asn Ile Lys Cys Pro Cys Gln
130 135 140
Val Pro Ala Pro Glu Phe Phe Thr Glu Leu Asp Gly Val Arg Leu His
145 150 155 160
Arg Tyr Ala Pro Val Cys Lys Pro Leu Leu Arg Glu Glu Val Leu Phe
165 170 175
Gln Val Gly Cys Asn Gln Tyr Leu Val Gly Ser Gln Leu Pro Cys Glu
180 185 190
Pro Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Ala Asp Pro Ser
195 200 205
His Ile Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly Ser Pro
210 215 220
Pro Ser Leu Ala Ser Ser Ser Ala Ser Gln Leu Ser Ala Pro Ser Leu
225 230 235 240
Lys Ala Thr Cys Asn Thr His His Arg Ser Pro Asp Leu Asp Leu Ile
245 250 255
Glu Ala Asn Leu Leu Trp Trp Gln Glu Lys Gly Gly Asn Ile Thr Arg
260 265 270
Val Glu Ser Glu Asn Lys Val Ile Ile Met Asp Ser Phe Asp Pro Leu
275 280 285
Arg Ala Glu Glu Asp Glu Arg Glu Ile Ser Val Ala Ala Glu Ile Leu
290 295 300
Arg Gln Ser Arg Lys Phe Pro Pro Ala Leu Pro Val Trp Ala Arg Pro
305 310 315 320
Asp Tyr Asn Pro Pro Leu Leu Glu Pro Trp Lys Asp Pro Asp Tyr Val
325 330 335
Pro Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys Thr Pro Pro
340 345 350
Ile Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Thr Glu Ser Thr
355 360 365
Val Ser Ser Val Leu Ala Glu Leu Thr Thr Lys Thr Phe Gly Ser Ser
370 375 380
Glu Ser Ser Ala Ala Asp Ser Gly Met Ala Thr Ala Pro Pro Asp Gln
385 390 395 400
Ala Ser Gly Asp Gly Asp Lys Glu Ser Asp Val Glu Ser Tyr Ser Ser
405 410 415
Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly
420 425 430
Ser Trp Ser Thr Val Ser Glu Glu Ala Gly Glu Asp Val Val Cys Cys
435 440 445
<210> 8
<211> 591
<212> PRT
<213> Hepatitis C virus
<400> 8
Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu Ile Thr Pro Cys Ala Ala
1 5 10 15
Glu Glu Ser Lys Leu Pro Ile Asn Ala Leu Ser Asn Ser Leu Leu Arg
20 25 30
His His Asn Met Val Tyr Ala Thr Thr Ser Arg Ser Ala Ser Gln Arg
35 40 45
Gln Lys Lys Val Thr Phe Asp Arg Leu Gln Val Leu Asp Asp His Tyr
50 55 60
Arg Asp Val Leu Lys Glu Met Lys Ala Lys Ala Ser Thr Val Lys Ala
65 70 75 80
Lys Leu Leu Ser Val Glu Glu Ala Cys Lys Leu Thr Pro Pro His Ser
85 90 95
Ala Lys Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Asn Leu Ser
100 105 110
Ser Arg Ala Val Asn His Ile Arg Ser Val Trp Lys Asp Leu Leu Glu
115 120 125
Asp Thr Glu Thr Pro Ile Asp Thr Thr Ile Met Ala Lys Ser Glu Val
130 135 140
Phe Cys Ile Gln Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu Ile
145 150 155 160
Val Phe Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr
165 170 175
Asp Val Val Ser Thr Leu Pro Gln Ala Val Met Gly Ser Ser Tyr Gly
180 185 190
Phe Gln Tyr Ser Pro Gly Gln Arg Val Glu Phe Leu Val Asn Ala Trp
195 200 205
Lys Ser Lys Lys Asn Pro Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe
210 215 220
Asp Ser Thr Val Thr Glu Ser Asp Ile Arg Val Glu Glu Ser Ile Tyr
225 230 235 240
Gln Cys Cys Asp Leu Ala Pro Glu Ala Arg Gln Ala Ile Lys Ser Leu
245 250 255
Thr Glu Arg Leu Tyr Ile Gly Gly Pro Leu Thr Asn Ser Lys Gly Gln
260 265 270
Asn Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr Thr Ser
275 280 285
Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ala Ala Ala Cys Arg
290 295 300
Ala Ala Lys Leu Gln Asp Cys Thr Met Leu Val Asn Gly Asp Asp Leu
305 310 315 320
Val Val Ile Cys Glu Ser Ala Gly Thr Gln Glu Asp Ala Ala Asn Leu
325 330 335
Arg Val Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp
340 345 350
Ser Pro Gln Pro Glu Tyr Asp Leu Glu Leu Ile Thr Ser Cys Ser Ser
355 360 365
Asn Val Ser Val Ala His Asp Ala Ser Gly Lys Arg Val Tyr Tyr Leu
370 375 380
Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr Ala
385 390 395 400
Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn Ile Ile Met Tyr Ala
405 410 415
Pro Thr Leu Trp Ala Arg Met Ile Leu Met Thr His Phe Phe Ser Ile
420 425 430
Leu Leu Ala Gln Glu Gln Leu Gly Lys Ala Leu Asp Cys Gln Ile Tyr
435 440 445
Gly Ala Cys Tyr Ser Ile Glu Pro Leu Asp Leu Pro Gln Ile Ile Glu
450 455 460
Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr Ser Pro Gly
465 470 475 480
Glu Ile Asn Arg Val Ala Ser Cys Leu Arg Lys Leu Gly Val Pro Pro
485 490 495
Leu Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala Lys Leu Leu
500 505 510
Ser Gln Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr Leu Phe Asn Trp
515 520 525
Ala Val Lys Thr Lys Leu Lys Leu Thr Pro Ile Pro Ala Ala Ser Gln
530 535 540
Leu Asp Leu Ser Ser Trp Phe Val Ala Gly Tyr Ser Gly Gly Asp Ile
545 550 555 560
Tyr His Ser Leu Ser Arg Ala Arg Pro Arg Trp Phe Met Trp Cys Leu
565 570 575
Leu Leu Leu Ser Val Gly Val Gly Ile Tyr Leu Leu Pro Asn Arg
580 585 590
<210> 9
<211> 1060
<212> DNA
<213> Hepatitis C virus
<220>
<221> CDS
<222> (229)..(1059)
<400> 9
ccaggccccc ccctcccggg agagccatag tggtctgcgg aaccggtgag tacaccggaa 60
ttaccggaaa gactgggtcc tttcttggat aaacccactc tatgtccggt catttgggcg 120
tgcccccgca agactgctag ccgagtagcg ttggggtgcg aagggccttg tggtactgcc 180
tgatagggtg cttgcgagtg ccccgggagg tctcgtagac cgtgcatc atg agc aca 237
Met Ser Thr
1
aat cct aaa cct caa aga aaa acc aaa aga agc aca aac cgc cgc cca 285
Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Ser Thr Asn Arg Arg Pro
5 10 15
cag gac gtc aag ttc ccg ggt ggc ggt cag atc gtt ggc gga gtt tac 333
Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly Gly Val Tyr
20 25 30 35
ttg ctg ccg cgc agg ggc ccc agg ttg ggt gtg cgc gcg aca agg aag 381
Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys
40 45 50
act tcc gag cga tcc cag ccg cgt ggg aga cgc cag ccc atc ccg aaa 429
Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro Ile Pro Lys
55 60 65
gat cgg cgc tcc acc ggc aag tcc tgg gga aag cca gga tat cct tgg 477
Asp Arg Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly Tyr Pro Trp
70 75 80
ccc ctg tat gga aac gag ggt tgc ggc tgg gca ggt tgg ctc ctg tcc 525
Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp Leu Leu Ser
85 90 95
ccc cgc ggg tct cgt cct act tgg ggc ccc acc gac ccc cgg cac aga 573
Pro Arg Gly Ser Arg Pro Thr Trp Gly Pro Thr Asp Pro Arg His Arg
100 105 110 115
tca cgc aat tgg ggt aaa gtc atc gat acc ctt acg tgt ggt ttt gcc 621
Ser Arg Asn Trp Gly Lys Val Ile Asp Thr Leu Thr Cys Gly Phe Ala
120 125 130
gac ctc atg ggg tac atc cct gtc att ggc gcc ccg gtc gga ggc gtt 669
Asp Leu Met Gly Tyr Ile Pro Val Ile Gly Ala Pro Val Gly Gly Val
135 140 145
gcc aga gcc cta gcg cac ggt gtt agg gtc ctg gaa gac ggg gtg aat 717
Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn
150 155 160
tac gca aca ggg aat cta ccc ggt tgc tct ttt tct atc ttc ttg ctt 765
Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile Phe Leu Leu
165 170 175
gcc ctt ctg tcg tgc gtc aca gtg cca gtg tct gca gtg gag gtc agg 813
Ala Leu Leu Ser Cys Val Thr Val Pro Val Ser Ala Val Glu Val Arg
180 185 190 195
aac att agt tct agc tac tat gcc act aac gat tgc tcg gac aac agc 861
Asn Ile Ser Ser Ser Tyr Tyr Ala Thr Asn Asp Cys Ser Asp Asn Ser
200 205 210
atc acc tgg cag cgc ctt gtg ttt gaa gtc aca aaa tgg ttg tta gca 909
Ile Thr Trp Gln Arg Leu Val Phe Glu Val Thr Lys Trp Leu Leu Ala
215 220 225
atc ctg ggg tct gcc cac ctc ctt aaa gcg tcc ctg cta cgg gtg cca 957
Ile Leu Gly Ser Ala His Leu Leu Lys Ala Ser Leu Leu Arg Val Pro
230 235 240
tac ttt gtg agg gct cac gct ctg cta cgg gtg tgt acc ctg gtg agg 1005
Tyr Phe Val Arg Ala His Ala Leu Leu Arg Val Cys Thr Leu Val Arg
245 250 255
cac ctt gca gga gct aag tac atc cag atg ctg ttg atc act gtg ggc 1053
His Leu Ala Gly Ala Lys Tyr Ile Gln Met Leu Leu Ile Thr Val Gly
260 265 270 275
agg cgg a 1060
Arg Arg
<210> 10
<211> 277
<212> PRT
<213> Hepatitis C virus
<400> 10
Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Ser Thr Asn
1 5 10 15
Arg Arg Pro Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly
20 25 30
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala
35 40 45
Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro
50 55 60
Ile Pro Lys Asp Arg Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly
65 70 75 80
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp
85 90 95
Leu Leu Ser Pro Arg Gly Ser Arg Pro Thr Trp Gly Pro Thr Asp Pro
100 105 110
Arg His Arg Ser Arg Asn Trp Gly Lys Val Ile Asp Thr Leu Thr Cys
115 120 125
Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Val Ile Gly Ala Pro Val
130 135 140
Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp
145 150 155 160
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile
165 170 175
Phe Leu Leu Ala Leu Leu Ser Cys Val Thr Val Pro Val Ser Ala Val
180 185 190
Glu Val Arg Asn Ile Ser Ser Ser Tyr Tyr Ala Thr Asn Asp Cys Ser
195 200 205
Asp Asn Ser Ile Thr Trp Gln Arg Leu Val Phe Glu Val Thr Lys Trp
210 215 220
Leu Leu Ala Ile Leu Gly Ser Ala His Leu Leu Lys Ala Ser Leu Leu
225 230 235 240
Arg Val Pro Tyr Phe Val Arg Ala His Ala Leu Leu Arg Val Cys Thr
245 250 255
Leu Val Arg His Leu Ala Gly Ala Lys Tyr Ile Gln Met Leu Leu Ile
260 265 270
Thr Val Gly Arg Arg
275
<210> 11
<211> 1161
<212> DNA
<213> Hepatitis C virus
<220>
<221> CDS
<222> (229)..(1161)
<400> 11
ccaggccccc ccctcccggg agagccatag tggtctgcgg aaccggtgag tacaccggaa 60
ttaccggaaa gtctgggtcc tttcttggat aaacccactc tatgtccggt catttgggcg 120
tgcccccgca agactgctag ccgagtagcg ttgggttgcg aaaggccttg tggtactgcc 180
tgatagggtg cttgcgagtg ccccgggagg tctcgtagac cgtgcatc atg agc aca 237
Met Ser Thr
1
aat cct aaa cct caa aga aaa acc aaa aga aac aca aac cgc cgc cca 285
Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro
5 10 15
cag gac gtc aag ttc ccg ggt ggc ggt cag atc gtt ggc gga gtt tac 333
Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly Gly Val Tyr
20 25 30 35
ttg ctg ccg cgc agg ggc ccc agg ttg ggt gtg cgc gcg aca agg aag 381
Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys
40 45 50
act tcc gag cga tcc cag ccg cgt ggg aga cgc cag ccc atc ccg aaa 429
Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro Ile Pro Lys
55 60 65
gat cgg cgc tcc acc ggc aag tcc tgg gga aag cca gga tat cct tgg 477
Asp Arg Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly Tyr Pro Trp
70 75 80
ccc ctg tac gga aac gag ggt tgc ggc tgg gca ggt tgg ctc ctg tcc 525
Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp Leu Leu Ser
85 90 95
ccc cgc ggg tct cgt cct act tgg ggc ccc acc gac ccc cgg cac aga 573
Pro Arg Gly Ser Arg Pro Thr Trp Gly Pro Thr Asp Pro Arg His Arg
100 105 110 115
tca cgc aat ttg ggt aaa gtc atc gat acc ctt acg tgt ggt ttt gcc 621
Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys Gly Phe Ala
120 125 130
gac ctc atg ggg tac atc cct gtc att ggc gcc ccg gtc gga ggc gtt 669
Asp Leu Met Gly Tyr Ile Pro Val Ile Gly Ala Pro Val Gly Gly Val
135 140 145
gcc aga gcc cta gcg cac ggt gtt agg gtc ctg gaa gac ggg gtg aat 717
Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn
150 155 160
tac gca aca ggg aat cta ccc ggt tgc tct ttt tct atc ttc ttg ctt 765
Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile Phe Leu Leu
165 170 175
gcc ctt ctg tcg tgc gtc aca gtg cca gtg tct gca gtg gag gtc agg 813
Ala Leu Leu Ser Cys Val Thr Val Pro Val Ser Ala Val Glu Val Arg
180 185 190 195
aac att agt tct agc tac tat gcc act gac gat tgc tcg aac aac agc 861
Asn Ile Ser Ser Ser Tyr Tyr Ala Thr Asp Asp Cys Ser Asn Asn Ser
200 205 210
atc acc tgg cag cgc ctt gtg ttt gaa gtc aca aaa tgg ctg tta gca 909
Ile Thr Trp Gln Arg Leu Val Phe Glu Val Thr Lys Trp Leu Leu Ala
215 220 225
atc ctg ggg tct gcc cac ctc ctt aaa gcg tcc ctg cta cgg gtg cca 957
Ile Leu Gly Ser Ala His Leu Leu Lys Ala Ser Leu Leu Arg Val Pro
230 235 240
tac ttt gtg agg gct cac gct ctg cta cgg gtg tgt acc ctg gtg agg 1005
Tyr Phe Val Arg Ala His Ala Leu Leu Arg Val Cys Thr Leu Val Arg
245 250 255
cac ctt gca gga gct aag tac atc cag atg ctg ttg atc act gta ggc 1053
His Leu Ala Gly Ala Lys Tyr Ile Gln Met Leu Leu Ile Thr Val Gly
260 265 270 275
agg tgg acc ggc act tac atc tat gtc cac ctc tcc ccc tta tca act 1101
Arg Trp Thr Gly Thr Tyr Ile Tyr Val His Leu Ser Pro Leu Ser Thr
280 285 290
tgg gca gct cag ggt ttg cgg gac ctg gcg gtc gcc gtg gag cct gtg 1149
Trp Ala Ala Gln Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro Val
295 300 305
gtg ttc agc cca 1161
Val Phe Ser Pro
310
<210> 12
<211> 311
<212> PRT
<213> Hepatitis C virus
<400> 12
Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn
1 5 10 15
Arg Arg Pro Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly
20 25 30
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala
35 40 45
Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro
50 55 60
Ile Pro Lys Asp Arg Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly
65 70 75 80
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp
85 90 95
Leu Leu Ser Pro Arg Gly Ser Arg Pro Thr Trp Gly Pro Thr Asp Pro
100 105 110
Arg His Arg Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys
115 120 125
Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Val Ile Gly Ala Pro Val
130 135 140
Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp
145 150 155 160
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile
165 170 175
Phe Leu Leu Ala Leu Leu Ser Cys Val Thr Val Pro Val Ser Ala Val
180 185 190
Glu Val Arg Asn Ile Ser Ser Ser Tyr Tyr Ala Thr Asp Asp Cys Ser
195 200 205
Asn Asn Ser Ile Thr Trp Gln Arg Leu Val Phe Glu Val Thr Lys Trp
210 215 220
Leu Leu Ala Ile Leu Gly Ser Ala His Leu Leu Lys Ala Ser Leu Leu
225 230 235 240
Arg Val Pro Tyr Phe Val Arg Ala His Ala Leu Leu Arg Val Cys Thr
245 250 255
Leu Val Arg His Leu Ala Gly Ala Lys Tyr Ile Gln Met Leu Leu Ile
260 265 270
Thr Val Gly Arg Trp Thr Gly Thr Tyr Ile Tyr Val His Leu Ser Pro
275 280 285
Leu Ser Thr Trp Ala Ala Gln Gly Leu Arg Asp Leu Ala Val Ala Val
290 295 300
Glu Pro Val Val Phe Ser Pro
305 310
<210> 13
<211> 1095
<212> DNA
<213> Hepatitis C virus
<220>
<221> CDS
<222> (229)..(1095)
<400> 13
ccaggtcccc ccctcccggg agagccatag tggtctgcgg aaccggtgag tacaccggaa 60
ttgccaggac gaccgggtcc tttcttggat caacccgctc aatgcctgga gatttgggcg 120
tgcccccgcg agactgctag ccgagtagtg ttgggtcgcg aaaggccttg tggtactgcc 180
tgatagggtg cttgcgagtg ccccgggagg tctcgtagac cgtgcatc atg agc aca 237
Met Ser Thr
1
aat cct aaa cct caa aga aaa acc aaa cgt aac acc aac cgc cgc cca 285
Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro
5 10 15
cag gac gtc aag ttc ccg ggc ggt ggt cag atc gtt ggt gga gtt tac 333
Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly Gly Val Tyr
20 25 30 35
ctg ttg ccg cgc agg ggc ccc agg ttg ggt gtg cgc gcg act agg aag 381
Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys
40 45 50
act tcc gag cgg tcg caa cct cgt gga agg cga caa cct atc ccc aag 429
Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro Ile Pro Lys
55 60 65
gct cgc cag ccc gag ggt agg gcc tgg gct cag ccc ggg tac cct cgg 477
Ala Arg Gln Pro Glu Gly Arg Ala Trp Ala Gln Pro Gly Tyr Pro Arg
70 75 80
cct agt tgg ggc ccc acg gac ccc cgg cgt agg tcg cgt aat ttg ggt 525
Pro Ser Trp Gly Pro Thr Asp Pro Arg Arg Arg Ser Arg Asn Leu Gly
85 90 95
aag gtc atc gat acc ctt aca tgc ggc ttc gcc gac ctc atg ggg tac 573
Lys Val Ile Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr
100 105 110 115
atc ccg ctc gtc ggc gcc ccc cta ggg ggc gct gcc agg gcc ttg gcg 621
Ile Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg Ala Leu Ala
120 125 130
cat ggc gtc cgg gtt ctg gag gac ggc gtg aac tat gca aca ggg aac 669
His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala Thr Gly Asn
135 140 145
ctt ccc ggt tgc tct ttc tct atc ttc ctc ttg gct ttg ctg tcc tgt 717
Leu Pro Gly Cys Ser Phe Ser Ile Phe Leu Leu Ala Leu Leu Ser Cys
150 155 160
ttg acc att cca gcc tcc gcc cat gtc ccc cct ctc aac gtc cgg gga 765
Leu Thr Ile Pro Ala Ser Ala His Val Pro Pro Leu Asn Val Arg Gly
165 170 175
ggc cgc gac gcc atc atc ctt ctc aca tgt gcg gtc cac tca gag cta 813
Gly Arg Asp Ala Ile Ile Leu Leu Thr Cys Ala Val His Ser Glu Leu
180 185 190 195
gtt ttt aaa atc acc aaa atc ctg ctt gca ata ctt ggt ccg ctc atg 861
Val Phe Lys Ile Thr Lys Ile Leu Leu Ala Ile Leu Gly Pro Leu Met
200 205 210
gtg ctc cag gct ggt ctc att agg gtg ccg tac ttc gtg cgc gcc caa 909
Val Leu Gln Ala Gly Leu Ile Arg Val Pro Tyr Phe Val Arg Ala Gln
215 220 225
ggg ctt atc cgt gca tgc atg ttg gtg cgg aag atc gct ggg ggt cat 957
Gly Leu Ile Arg Ala Cys Met Leu Val Arg Lys Ile Ala Gly Gly His
230 235 240
tat gtc caa atg gct ctc gtg aag ctg gcc gca ctg acg ggc acg tac 1005
Tyr Val Gln Met Ala Leu Val Lys Leu Ala Ala Leu Thr Gly Thr Tyr
245 250 255
gtc tat gac cat ctt act cca ctg cgg gac tgg gcc cac acg ggc ctg 1053
Val Tyr Asp His Leu Thr Pro Leu Arg Asp Trp Ala His Thr Gly Leu
260 265 270 275
cga gac ctc gcg gtg gcg gtc gag ccc gtc gtc ttc tct gac 1095
Arg Asp Leu Ala Val Ala Val Glu Pro Val Val Phe Ser Asp
280 285
<210> 14
<211> 289
<212> PRT
<213> Hepatitis C virus
<400> 14
Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn
1 5 10 15
Arg Arg Pro Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly
20 25 30
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala
35 40 45
Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro
50 55 60
Ile Pro Lys Ala Arg Gln Pro Glu Gly Arg Ala Trp Ala Gln Pro Gly
65 70 75 80
Tyr Pro Arg Pro Ser Trp Gly Pro Thr Asp Pro Arg Arg Arg Ser Arg
85 90 95
Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu
100 105 110
Met Gly Tyr Ile Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg
115 120 125
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala
130 135 140
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile Phe Leu Leu Ala Leu
145 150 155 160
Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala His Val Pro Pro Leu Asn
165 170 175
Val Arg Gly Gly Arg Asp Ala Ile Ile Leu Leu Thr Cys Ala Val His
180 185 190
Ser Glu Leu Val Phe Lys Ile Thr Lys Ile Leu Leu Ala Ile Leu Gly
195 200 205
Pro Leu Met Val Leu Gln Ala Gly Leu Ile Arg Val Pro Tyr Phe Val
210 215 220
Arg Ala Gln Gly Leu Ile Arg Ala Cys Met Leu Val Arg Lys Ile Ala
225 230 235 240
Gly Gly His Tyr Val Gln Met Ala Leu Val Lys Leu Ala Ala Leu Thr
245 250 255
Gly Thr Tyr Val Tyr Asp His Leu Thr Pro Leu Arg Asp Trp Ala His
260 265 270
Thr Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro Val Val Phe Ser
275 280 285
Asp
<210> 15
<211> 788
<212> DNA
<213> Hepatitis C virus
<220>
<221> CDS
<222> (1)..(786)
<400> 15
ctc ttg gct ctg ctg tct tgt ctg acc atc cta gct tcc gcc tat gaa 48
Leu Leu Ala Leu Leu Ser Cys Leu Thr Ile Leu Ala Ser Ala Tyr Glu
1 5 10 15
gtg cgc aac gtg tcc ggg ttg tac cat gtc acg aac gac tgc tcc aac 96
Val Arg Asn Val Ser Gly Leu Tyr His Val Thr Asn Asp Cys Ser Asn
20 25 30
tca agt att gtg tat gag gca gcg gac atg atc atg cat acc ccc ggg 144
Ser Ser Ile Val Tyr Glu Ala Ala Asp Met Ile Met His Thr Pro Gly
35 40 45
tgc gtg ccc tgc gtc cgg gag aac aac cgc tct cgc tgc tgg gta gcg 192
Cys Val Pro Cys Val Arg Glu Asn Asn Arg Ser Arg Cys Trp Val Ala
50 55 60
ctc acc cct acg ctc gcg gcc aga aac agc agc atc ccc act gcg aca 240
Leu Thr Pro Thr Leu Ala Ala Arg Asn Ser Ser Ile Pro Thr Ala Thr
65 70 75 80
ata cga cgc cat gtc gat ttg ctc gtt ggg gca gcc gct ctc tgc tcc 288
Ile Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Leu Cys Ser
85 90 95
gcc atg tat gtg ggg gat ctc tgc gga tct gtc ttc ctc gtg ttc ttc 336
Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Phe Phe
100 105 110
tgt gct gcc tgg tat atc aag ggt aag ctg gtc ccc ggg gcg gca tat 384
Cys Ala Ala Trp Tyr Ile Lys Gly Lys Leu Val Pro Gly Ala Ala Tyr
115 120 125
gct ttt tat agc gta tgg ccg ctg ctc ctg ctc ttg ctg gcg cta cca 432
Ala Phe Tyr Ser Val Trp Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro
130 135 140
cca cga gcg tac gct atg gac cgg gag atg gct gca tca tgt gga ggc 480
Pro Arg Ala Tyr Ala Met Asp Arg Glu Met Ala Ala Ser Cys Gly Gly
145 150 155 160
ggg gtc ttc ata ggt cta ata atc ttg act ttg tca ccg cac tat aaa 528
Gly Val Phe Ile Gly Leu Ile Ile Leu Thr Leu Ser Pro His Tyr Lys
165 170 175
gca ttc ctc gct agg ctt ata tgg tgg tta caa tat ttt atc acc agg 576
Ala Phe Leu Ala Arg Leu Ile Trp Trp Leu Gln Tyr Phe Ile Thr Arg
180 185 190
acc gag gcg cac ttg caa gtg tgg atc ccc cct ctc aac gtt cgg ggg 624
Thr Glu Ala His Leu Gln Val Trp Ile Pro Pro Leu Asn Val Arg Gly
195 200 205
ggc cgt gat gcc atc atc ctc ctc atg tgc gtg gtc cat cca gag cta 672
Gly Arg Asp Ala Ile Ile Leu Leu Met Cys Val Val His Pro Glu Leu
210 215 220
att ttt gaa atc acc aag atc ttg ctc gcc ata ctg ggt ccg ccc atg 720
Ile Phe Glu Ile Thr Lys Ile Leu Leu Ala Ile Leu Gly Pro Pro Met
225 230 235 240
gtg ctc cag gcc ggc ctg att agg gtg ccg tac ttc gtg cgc gct caa 768
Val Leu Gln Ala Gly Leu Ile Arg Val Pro Tyr Phe Val Arg Ala Gln
245 250 255
ggg ctc att cgt gca tgc at 788
Gly Leu Ile Arg Ala Cys
260
<210> 16
<211> 262
<212> PRT
<213> Hepatitis C virus
<400> 16
Leu Leu Ala Leu Leu Ser Cys Leu Thr Ile Leu Ala Ser Ala Tyr Glu
1 5 10 15
Val Arg Asn Val Ser Gly Leu Tyr His Val Thr Asn Asp Cys Ser Asn
20 25 30
Ser Ser Ile Val Tyr Glu Ala Ala Asp Met Ile Met His Thr Pro Gly
35 40 45
Cys Val Pro Cys Val Arg Glu Asn Asn Arg Ser Arg Cys Trp Val Ala
50 55 60
Leu Thr Pro Thr Leu Ala Ala Arg Asn Ser Ser Ile Pro Thr Ala Thr
65 70 75 80
Ile Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Leu Cys Ser
85 90 95
Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Phe Phe
100 105 110
Cys Ala Ala Trp Tyr Ile Lys Gly Lys Leu Val Pro Gly Ala Ala Tyr
115 120 125
Ala Phe Tyr Ser Val Trp Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro
130 135 140
Pro Arg Ala Tyr Ala Met Asp Arg Glu Met Ala Ala Ser Cys Gly Gly
145 150 155 160
Gly Val Phe Ile Gly Leu Ile Ile Leu Thr Leu Ser Pro His Tyr Lys
165 170 175
Ala Phe Leu Ala Arg Leu Ile Trp Trp Leu Gln Tyr Phe Ile Thr Arg
180 185 190
Thr Glu Ala His Leu Gln Val Trp Ile Pro Pro Leu Asn Val Arg Gly
195 200 205
Gly Arg Asp Ala Ile Ile Leu Leu Met Cys Val Val His Pro Glu Leu
210 215 220
Ile Phe Glu Ile Thr Lys Ile Leu Leu Ala Ile Leu Gly Pro Pro Met
225 230 235 240
Val Leu Gln Ala Gly Leu Ile Arg Val Pro Tyr Phe Val Arg Ala Gln
245 250 255
Gly Leu Ile Arg Ala Cys
260
<210> 17
<211> 1504
<212> DNA
<213> Hepatitis C virus
<220>
<221> CDS
<222> (233)..(1504)
<400> 17
ccaggtcccc ccctcccggg agagccatag tggtctgcgg aaccggtgag tacaccggaa 60
ttgccaggac gaccgggtcc tttcttggat taacccgctc aatgcctgga gatttgggcg 120
tgcccccgcg agactgctag ccgagtagtg ttgggtcgcg aaaggccttg tggtactgcc 180
tgatagggtg cttgcgagtg ccccgggagg tctcgtagac cgtgcaccat gg 232
atg aat acc acc ggg ttc acc aag acg tgc ggg ggc ccc ccg tgt aac 280
Met Asn Thr Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn
1 5 10 15
atc ggg ggg gtc ggc aat aac acc ctg acc tgt ccc acg gac tgc ttc 328
Ile Gly Gly Val Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe
20 25 30
cgg aag cac ccc gag gct acg tac acg cga tgc ggt tcg ggg cct tgg 376
Arg Lys His Pro Glu Ala Thr Tyr Thr Arg Cys Gly Ser Gly Pro Trp
35 40 45
ttg aca cct agg tgc atg gtt gat tac cca tac agg ctt tgg cac tac 424
Leu Thr Pro Arg Cys Met Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr
50 55 60
ccc tgc act gtc aac ttt acc atc ttc aag gtt agg atg tac gtg ggg 472
Pro Cys Thr Val Asn Phe Thr Ile Phe Lys Val Arg Met Tyr Val Gly
65 70 75 80
ggc gtg gag cac agg ctt agt gct gca tgc aac tgg act cga gga gag 520
Gly Val Glu His Arg Leu Ser Ala Ala Cys Asn Trp Thr Arg Gly Glu
85 90 95
cgt tgt gac ttg gag gac agg gat aga tca gag ctc agt ccg cta ttg 568
Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu
100 105 110
cta tcc acg aca gag tgg caa ata ctg ccc tgc tcc ttc acc acc cta 616
Leu Ser Thr Thr Glu Trp Gln Ile Leu Pro Cys Ser Phe Thr Thr Leu
115 120 125
ccg gct ctg tcc act ggt tta atc cat ctc cat cag aac atc gtg gac 664
Pro Ala Leu Ser Thr Gly Leu Ile His Leu His Gln Asn Ile Val Asp
130 135 140
gtg caa tac ctg tac ggt gta ggg tcg gcg gtt gtc tcc ttt gta atc 712
Val Gln Tyr Leu Tyr Gly Val Gly Ser Ala Val Val Ser Phe Val Ile
145 150 155 160
aag tgg gag tat gtc gtg ctg ctt ttc ctt ctc ctg gcg gac gcg cgt 760
Lys Trp Glu Tyr Val Val Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg
165 170 175
gtc tgt gcc tgc ttg tgg atg atg ctg cta ata gcc cag gct gag gcc 808
Val Cys Ala Cys Leu Trp Met Met Leu Leu Ile Ala Gln Ala Glu Ala
180 185 190
gcc tta gag aac ctg gtg gtc ctc aat gcg gcg tcc ata gtc gga acg 856
Ala Leu Glu Asn Leu Val Val Leu Asn Ala Ala Ser Ile Val Gly Thr
195 200 205
cat ggc att ctc tcc ctc ctt gta ttc ttc tgt gcc gcc tgg tac atc 904
His Gly Ile Leu Ser Leu Leu Val Phe Phe Cys Ala Ala Trp Tyr Ile
210 215 220
aag ggc agg ctg gtc cct ggg gcg gca tat gtt ctt tat ggt gta tgg 952
Lys Gly Arg Leu Val Pro Gly Ala Ala Tyr Val Leu Tyr Gly Val Trp
225 230 235 240
ccg ctg ctc cgg ctc ctg ctg gcg tta cca caa cga gct tac gcc atg 1000
Pro Leu Leu Arg Leu Leu Leu Ala Leu Pro Gln Arg Ala Tyr Ala Met
245 250 255
gac cgg gag atg gct gca tca tgc gga ggc gcg gtt ttt ata ggc ttg 1048
Asp Arg Glu Met Ala Ala Ser Cys Gly Gly Ala Val Phe Ile Gly Leu
260 265 270
gca ctc ttg acc ttg tca cca tac tac aaa gtg ttc ctc gct agg ctc 1096
Ala Leu Leu Thr Leu Ser Pro Tyr Tyr Lys Val Phe Leu Ala Arg Leu
275 280 285
ata tgg tgg tta caa tac ttt atc act aga gcc gag gcg cac ttg caa 1144
Ile Trp Trp Leu Gln Tyr Phe Ile Thr Arg Ala Glu Ala His Leu Gln
290 295 300
gtg tgg gtc ccc ccc ctt aac gct cgg gga ggc cgc gat gcc atc atc 1192
Val Trp Val Pro Pro Leu Asn Ala Arg Gly Gly Arg Asp Ala Ile Ile
305 310 315 320
ctt ctc aca tgt gca gtc cat cca gag cta atc ttt gac atc acc aaa 1240
Leu Leu Thr Cys Ala Val His Pro Glu Leu Ile Phe Asp Ile Thr Lys
325 330 335
atc ctg ctc gcc ata ttt ggt cca ctc atg gtc ctc cag gct agt ata 1288
Ile Leu Leu Ala Ile Phe Gly Pro Leu Met Val Leu Gln Ala Ser Ile
340 345 350
act gca gtg ccg tac ttt gtg cgc gct caa ggg ctc att cgt gca tgc 1336
Thr Ala Val Pro Tyr Phe Val Arg Ala Gln Gly Leu Ile Arg Ala Cys
355 360 365
atg ttg gtg cgg aaa gtt gct ggg ggc cat tat gtc caa atg gcc ttc 1384
Met Leu Val Arg Lys Val Ala Gly Gly His Tyr Val Gln Met Ala Phe
370 375 380
atg aag ctg gca gca ctg aca ggt acg tac gtt tac gac cat ctt act 1432
Met Lys Leu Ala Ala Leu Thr Gly Thr Tyr Val Tyr Asp His Leu Thr
385 390 395 400
ccg ctg cgg gac tgg gcc cac gcg ggc cta cgg gac ctt gcg gtg gca 1480
Pro Leu Arg Asp Trp Ala His Ala Gly Leu Arg Asp Leu Ala Val Ala
405 410 415
gta gag ccc gtt gtc ttc tct gac 1504
Val Glu Pro Val Val Phe Ser Asp
420
<210> 18
<211> 424
<212> PRT
<213> Hepatitis C virus
<400> 18
Met Asn Thr Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn
1 5 10 15
Ile Gly Gly Val Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe
20 25 30
Arg Lys His Pro Glu Ala Thr Tyr Thr Arg Cys Gly Ser Gly Pro Trp
35 40 45
Leu Thr Pro Arg Cys Met Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr
50 55 60
Pro Cys Thr Val Asn Phe Thr Ile Phe Lys Val Arg Met Tyr Val Gly
65 70 75 80
Gly Val Glu His Arg Leu Ser Ala Ala Cys Asn Trp Thr Arg Gly Glu
85 90 95
Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu
100 105 110
Leu Ser Thr Thr Glu Trp Gln Ile Leu Pro Cys Ser Phe Thr Thr Leu
115 120 125
Pro Ala Leu Ser Thr Gly Leu Ile His Leu His Gln Asn Ile Val Asp
130 135 140
Val Gln Tyr Leu Tyr Gly Val Gly Ser Ala Val Val Ser Phe Val Ile
145 150 155 160
Lys Trp Glu Tyr Val Val Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg
165 170 175
Val Cys Ala Cys Leu Trp Met Met Leu Leu Ile Ala Gln Ala Glu Ala
180 185 190
Ala Leu Glu Asn Leu Val Val Leu Asn Ala Ala Ser Ile Val Gly Thr
195 200 205
His Gly Ile Leu Ser Leu Leu Val Phe Phe Cys Ala Ala Trp Tyr Ile
210 215 220
Lys Gly Arg Leu Val Pro Gly Ala Ala Tyr Val Leu Tyr Gly Val Trp
225 230 235 240
Pro Leu Leu Arg Leu Leu Leu Ala Leu Pro Gln Arg Ala Tyr Ala Met
245 250 255
Asp Arg Glu Met Ala Ala Ser Cys Gly Gly Ala Val Phe Ile Gly Leu
260 265 270
Ala Leu Leu Thr Leu Ser Pro Tyr Tyr Lys Val Phe Leu Ala Arg Leu
275 280 285
Ile Trp Trp Leu Gln Tyr Phe Ile Thr Arg Ala Glu Ala His Leu Gln
290 295 300
Val Trp Val Pro Pro Leu Asn Ala Arg Gly Gly Arg Asp Ala Ile Ile
305 310 315 320
Leu Leu Thr Cys Ala Val His Pro Glu Leu Ile Phe Asp Ile Thr Lys
325 330 335
Ile Leu Leu Ala Ile Phe Gly Pro Leu Met Val Leu Gln Ala Ser Ile
340 345 350
Thr Ala Val Pro Tyr Phe Val Arg Ala Gln Gly Leu Ile Arg Ala Cys
355 360 365
Met Leu Val Arg Lys Val Ala Gly Gly His Tyr Val Gln Met Ala Phe
370 375 380
Met Lys Leu Ala Ala Leu Thr Gly Thr Tyr Val Tyr Asp His Leu Thr
385 390 395 400
Pro Leu Arg Asp Trp Ala His Ala Gly Leu Arg Asp Leu Ala Val Ala
405 410 415
Val Glu Pro Val Val Phe Ser Asp
420
<210> 19
<211> 245
<212> DNA
<213> Hepatitis C virus
<220>
<221> CDS
<222> (1)..(243)
<400> 19
ctc ttg gct ttg ttg tcc tgt ttg acc gtc ccg act tcc gct tat gaa 48
Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser Ala Tyr Glu
1 5 10 15
gtg cgc aac gtg tcc ggg atg tac caa gtc acg aac gac tgc tcc aac 96
Val Arg Asn Val Ser Gly Met Tyr Gln Val Thr Asn Asp Cys Ser Asn
20 25 30
tca agc att gtg tat gag gca gcg ggc gcg atc atc ttt gag att acc 144
Ser Ser Ile Val Tyr Glu Ala Ala Gly Ala Ile Ile Phe Glu Ile Thr
35 40 45
aaa atc ttg ctc gcc atg ctt ggt ccg ctc atg atg ctc cag gct ggc 192
Lys Ile Leu Leu Ala Met Leu Gly Pro Leu Met Met Leu Gln Ala Gly
50 55 60
cta att aga gtg ccg tac ttc gtg cgc gct caa ggg ctc att cgt gcg 240
Leu Ile Arg Val Pro Tyr Phe Val Arg Ala Gln Gly Leu Ile Arg Ala
65 70 75 80
tgc tc 245
Cys
<210> 20
<211> 81
<212> PRT
<213> Hepatitis C virus
<400> 20
Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser Ala Tyr Glu
1 5 10 15
Val Arg Asn Val Ser Gly Met Tyr Gln Val Thr Asn Asp Cys Ser Asn
20 25 30
Ser Ser Ile Val Tyr Glu Ala Ala Gly Ala Ile Ile Phe Glu Ile Thr
35 40 45
Lys Ile Leu Leu Ala Met Leu Gly Pro Leu Met Met Leu Gln Ala Gly
50 55 60
Leu Ile Arg Val Pro Tyr Phe Val Arg Ala Gln Gly Leu Ile Arg Ala
65 70 75 80
Cys
<210> 21
<211> 1119
<212> DNA
<213> Hepatitis C virus
<220>
<221> CDS
<222> (229)..(1119)
<400> 21
ccaggacccc ccctcccggg agagccatag tggtctgcgg aaccggtgag tacaccggaa 60
ttgccaggac gaccgggtcc tttcttggat taacccgctc aatgcccgga gatttgggcg 120
tgcccccgca agactgctag ccgagtagtg ttgggtcgcg aagggccttg tggtactgcc 180
tgatagggtg cttgcgagtg ccccgggagg tctcgtagac cgtgcacc atg agc acg 237
Met Ser Thr
1
aat cct aaa ccc caa aga aaa acc aac cga aac acc aac cgc cgt cca 285
Asn Pro Lys Pro Gln Arg Lys Thr Asn Arg Asn Thr Asn Arg Arg Pro
5 10 15
cag gac gtt aag ttc ccg ggc ggt ggt cag atc gtc ggt gga gtt tac 333
Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly Gly Val Tyr
20 25 30 35
ctg ttg ccg cgc agg ggc ccc agg ttg ggt gtg cgc gcg act agg aag 381
Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys
40 45 50
act tcc gag cgg tcg caa cct cgt gga agg cga caa cct atc ccc aag 429
Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro Ile Pro Lys
55 60 65
gtt cgc cgg ccc gag ggc agg acc tgg gct cag ccc ggg tat cct tgg 477
Val Arg Arg Pro Glu Gly Arg Thr Trp Ala Gln Pro Gly Tyr Pro Trp
70 75 80
ccc ctc tat ggc aat gag ggc ttg ggg tgg gca gga tgg ctc ctg tca 525
Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp Leu Leu Ser
85 90 95
ccc cgt ggc tac cgg cct agt tgg ggc ccc acg gac ccc cgg cgt agg 573
Pro Arg Gly Tyr Arg Pro Ser Trp Gly Pro Thr Asp Pro Arg Arg Arg
100 105 110 115
tcg cgt aat ttg ggt aag gtc atc gat acc ctc aca tgc ggc ttc gcc 621
Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys Gly Phe Ala
120 125 130
gac ctc atg ggg tat att ccg ctt gtc ggc gcc cct tta gga ggc gct 669
Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala Pro Leu Gly Gly Ala
135 140 145
gcc agg gcc ctg gca cat ggt gtc cgg gtt ctg gag gac ggc gtg aat 717
Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn
150 155 160
tat gca aca ggg aat ttg cct ggt tgc tct ttc tct atc ttc ctc ttg 765
Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile Phe Leu Leu
165 170 175
gct ctg ctg tcc tgt ttt acc acc cca gct tcc gct tat gga gtg cgc 813
Ala Leu Leu Ser Cys Phe Thr Thr Pro Ala Ser Ala Tyr Gly Val Arg
180 185 190 195
acg tgc gcg gtc cat cca gag cca atc ttt gac atc acc aac ctc ctg 861
Thr Cys Ala Val His Pro Glu Pro Ile Phe Asp Ile Thr Asn Leu Leu
200 205 210
ctc gcc ata ctc ggc ccg ctc atg gtg ctc cag gct ggc ata act aga 909
Leu Ala Ile Leu Gly Pro Leu Met Val Leu Gln Ala Gly Ile Thr Arg
215 220 225
gtg ccg tac ttc gta cgc gct cag ggg ctc att cgt gca tgc atg tta 957
Val Pro Tyr Phe Val Arg Ala Gln Gly Leu Ile Arg Ala Cys Met Leu
230 235 240
gtg agg aaa gcg cct ggg ggt cat tat gtc caa atg gcc ctc atg agg 1005
Val Arg Lys Ala Pro Gly Gly His Tyr Val Gln Met Ala Leu Met Arg
245 250 255
ctg gcc gcg ctg aca ggt acg tac gtg tat gac cat ctc gcc cca ttg 1053
Leu Ala Ala Leu Thr Gly Thr Tyr Val Tyr Asp His Leu Ala Pro Leu
260 265 270 275
cag cat tgg gcc cac gcg ggc cta cga gac ctt gcg gtg gca gta gaa 1101
Gln His Trp Ala His Ala Gly Leu Arg Asp Leu Ala Val Ala Val Glu
280 285 290
ccc gtc atc ttc tct gac 1119
Pro Val Ile Phe Ser Asp
295
<210> 22
<211> 297
<212> PRT
<213> Hepatitis C virus
<400> 22
Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Asn Arg Asn Thr Asn
1 5 10 15
Arg Arg Pro Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly
20 25 30
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala
35 40 45
Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro
50 55 60
Ile Pro Lys Val Arg Arg Pro Glu Gly Arg Thr Trp Ala Gln Pro Gly
65 70 75 80
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp
85 90 95
Leu Leu Ser Pro Arg Gly Tyr Arg Pro Ser Trp Gly Pro Thr Asp Pro
100 105 110
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys
115 120 125
Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala Pro Leu
130 135 140
Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp
145 150 155 160
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile
165 170 175
Phe Leu Leu Ala Leu Leu Ser Cys Phe Thr Thr Pro Ala Ser Ala Tyr
180 185 190
Gly Val Arg Thr Cys Ala Val His Pro Glu Pro Ile Phe Asp Ile Thr
195 200 205
Asn Leu Leu Leu Ala Ile Leu Gly Pro Leu Met Val Leu Gln Ala Gly
210 215 220
Ile Thr Arg Val Pro Tyr Phe Val Arg Ala Gln Gly Leu Ile Arg Ala
225 230 235 240
Cys Met Leu Val Arg Lys Ala Pro Gly Gly His Tyr Val Gln Met Ala
245 250 255
Leu Met Arg Leu Ala Ala Leu Thr Gly Thr Tyr Val Tyr Asp His Leu
260 265 270
Ala Pro Leu Gln His Trp Ala His Ala Gly Leu Arg Asp Leu Ala Val
275 280 285
Ala Val Glu Pro Val Ile Phe Ser Asp
290 295
<210> 23
<211> 1143
<212> DNA
<213> Hepatitis C virus
<220>
<221> CDS
<222> (229)..(1143)
<400> 23
ccaggtcccc ccctcccggg agagccatag tggtctgcgg aaccggtgag tacaccggaa 60
ttgccaggac gaccgggtcc tttcttggat caacccgctc aatgcctgga gatttgggcg 120
tgcccccgcg agactactag ccgagtagtg ttgggtcgcg aaaggccttg tggtactgcc 180
tgatagggtg cttgcgagtg ccccgggagg tctcgtagac cgtgcatc atg agc aca 237
Met Ser Thr
1
aat cct aaa cct caa aga aaa acc aaa cgt aac acc aac cgc cgc cca 285
Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro
5 10 15
cag gac gtc aag ttc ccg ggc ggt ggc cag atc gtt ggt gga gtt tac 333
Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly Gly Val Tyr
20 25 30 35
ctg ttg ccg cgc agg ggc ccc agg ttg ggt gtg cgc gcg act agg aag 381
Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys
40 45 50
act tcc gag cgg tcg caa cct cgt gga agg cga caa cct atc ccc aag 429
Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro Ile Pro Lys
55 60 65
gct cgc cga ccc gag ggc agg gcc tgg gca cag ccc ggg tac cct tgg 477
Ala Arg Arg Pro Glu Gly Arg Ala Trp Ala Gln Pro Gly Tyr Pro Trp
70 75 80
cct ctc tat ggc aat gag ggc ctg ggg tgg gct gga tgg ctc ctg tca 525
Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp Leu Leu Ser
85 90 95
ccc cgc ggc tcc cgg cct agt tgg ggc ccc acg gac ccc cgg cgt agg 573
Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro Arg Arg Arg
100 105 110 115
tcg cgc aat ttg ggt aag gtc atc gat acc ctc act tgc ggc ttc gcc 621
Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys Gly Phe Ala
120 125 130
gac ctc atg ggg tac att ccg ctc gtc ggc gcc ccc cta gga ggt gct 669
Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala Pro Leu Gly Gly Ala
135 140 145
gcc agg gcc ctg gcg cat ggc gtc cgg gtt ctg gaa gac ggc gtg aac 717
Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn
150 155 160
tac gca aca ggg aat ttg ccc ggt tgc tct ttc tct atc ttc ctc ttg 765
Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile Phe Leu Leu
165 170 175
gct tta ctg tcc tgt ttg acc atc cca gct tcc gct cat caa gtg cgc 813
Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala His Gln Val Arg
180 185 190 195
aac gtg tcc ggg gtg tac cat gtc acg aac aac tgc tcc aac tca aga 861
Asn Val Ser Gly Val Tyr His Val Thr Asn Asn Cys Ser Asn Ser Arg
200 205 210
att gtg gac atc acc aag att ttg ctc gcc ata ttt ggc ccg ctc atg 909
Ile Val Asp Ile Thr Lys Ile Leu Leu Ala Ile Phe Gly Pro Leu Met
215 220 225
gcg ctc cag gct ggt tta act aga gtg ccg tac ttt gta cgc gct cat 957
Ala Leu Gln Ala Gly Leu Thr Arg Val Pro Tyr Phe Val Arg Ala His
230 235 240
ggg ctc atc cgt gtg tgc atg ttg gtg cgg aaa gtc tct ggg ggt cat 1005
Gly Leu Ile Arg Val Cys Met Leu Val Arg Lys Val Ser Gly Gly His
245 250 255
tac gtc cag atg gct ctc atg agg ctg gcc gca ctg acg ggc acg tac 1053
Tyr Val Gln Met Ala Leu Met Arg Leu Ala Ala Leu Thr Gly Thr Tyr
260 265 270 275
gtc tat aac cat ctt act ccg ctg cgg gac tgg gcc cac gtg ggc ctg 1101
Val Tyr Asn His Leu Thr Pro Leu Arg Asp Trp Ala His Val Gly Leu
280 285 290
cga gac ctt gca gtg gca gtt gag cct gtc atc ttc tct gac 1143
Arg Asp Leu Ala Val Ala Val Glu Pro Val Ile Phe Ser Asp
295 300 305
<210> 24
<211> 305
<212> PRT
<213> Hepatitis C virus
<400> 24
Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn
1 5 10 15
Arg Arg Pro Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly
20 25 30
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala
35 40 45
Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro
50 55 60
Ile Pro Lys Ala Arg Arg Pro Glu Gly Arg Ala Trp Ala Gln Pro Gly
65 70 75 80
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp
85 90 95
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro
100 105 110
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys
115 120 125
Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala Pro Leu
130 135 140
Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp
145 150 155 160
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile
165 170 175
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala His
180 185 190
Gln Val Arg Asn Val Ser Gly Val Tyr His Val Thr Asn Asn Cys Ser
195 200 205
Asn Ser Arg Ile Val Asp Ile Thr Lys Ile Leu Leu Ala Ile Phe Gly
210 215 220
Pro Leu Met Ala Leu Gln Ala Gly Leu Thr Arg Val Pro Tyr Phe Val
225 230 235 240
Arg Ala His Gly Leu Ile Arg Val Cys Met Leu Val Arg Lys Val Ser
245 250 255
Gly Gly His Tyr Val Gln Met Ala Leu Met Arg Leu Ala Ala Leu Thr
260 265 270
Gly Thr Tyr Val Tyr Asn His Leu Thr Pro Leu Arg Asp Trp Ala His
275 280 285
Val Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro Val Ile Phe Ser
290 295 300
Asp
305
<210> 25
<211> 844
<212> DNA
<213> Hepatitis C virus
<220>
<221> CDS
<222> (1)..(843)
<400> 25
ctt ttg act tta ctg tcc tgt ttg acc atc cca gct tcc gct cat caa 48
Leu Leu Thr Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala His Gln
1 5 10 15
gtg cgc aac gtg tcc ggg gtg tac cat gtc atg aac aac tgc tcc aac 96
Val Arg Asn Val Ser Gly Val Tyr His Val Met Asn Asn Cys Ser Asn
20 25 30
tca agt att gtg gac atc acc aag att ttg ctc gcc ata ttt ggc ccg 144
Ser Ser Ile Val Asp Ile Thr Lys Ile Leu Leu Ala Ile Phe Gly Pro
35 40 45
ctc atg gtg ctc cag gct ggt tta act aga gtg ccg tac ttc gta cgc 192
Leu Met Val Leu Gln Ala Gly Leu Thr Arg Val Pro Tyr Phe Val Arg
50 55 60
gct cat ggg ctc atc cgt gtg tgc atg ttg gtg cgg aaa gtc tct ggg 240
Ala His Gly Leu Ile Arg Val Cys Met Leu Val Arg Lys Val Ser Gly
65 70 75 80
ggt cat tac gtc cag atg gct ctc atg agg ctg gcc gca ctg acg ggt 288
Gly His Tyr Val Gln Met Ala Leu Met Arg Leu Ala Ala Leu Thr Gly
85 90 95
acg tac gtc tat aac cat ctt act ccg ctg cgg gac tgg ggc cac gcg 336
Thr Tyr Val Tyr Asn His Leu Thr Pro Leu Arg Asp Trp Gly His Ala
100 105 110
ggc ctg cga gac ctt gca gtg gca gtt gag cct gtc atc ttc tct gac 384
Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro Val Ile Phe Ser Asp
115 120 125
atg gag acc aag atc atc acc tgg ggg gcg gac acc gcg gcg tgc ggg 432
Met Glu Thr Lys Ile Ile Thr Trp Gly Ala Asp Thr Ala Ala Cys Gly
130 135 140
gac atc atc tca ggt cta ccc gtc tcc gcc cga agg ggg agg gag ata 480
Asp Ile Ile Ser Gly Leu Pro Val Ser Ala Arg Arg Gly Arg Glu Ile
145 150 155 160
tta ctg gga ccg gcc gac agt ttt gga gag cga ggg tgg cga ctc ctt 528
Leu Leu Gly Pro Ala Asp Ser Phe Gly Glu Arg Gly Trp Arg Leu Leu
165 170 175
gcg cct att acg gcc tac tcc caa caa acc cgg ggc ctg ctt ggc tgc 576
Ala Pro Ile Thr Ala Tyr Ser Gln Gln Thr Arg Gly Leu Leu Gly Cys
180 185 190
atc atc act agc ctt aca ggt cgg gac aag aac cag gtt gag ggg gag 624
Ile Ile Thr Ser Leu Thr Gly Arg Asp Lys Asn Gln Val Glu Gly Glu
195 200 205
gtt cag gtg gtt tcc acc gca acg caa tct ttc ccg gcg acc tgc gtc 672
Val Gln Val Val Ser Thr Ala Thr Gln Ser Phe Pro Ala Thr Cys Val
210 215 220
aac ggc gta cgt tgg act gtc tac cat ggt gcc ggc tca aag acc cta 720
Asn Gly Val Arg Trp Thr Val Tyr His Gly Ala Gly Ser Lys Thr Leu
225 230 235 240
gcc ggc cca aag ggc cca gtc acc cag atg tac acc aat gta gac cgg 768
Ala Gly Pro Lys Gly Pro Val Thr Gln Met Tyr Thr Asn Val Asp Arg
245 250 255
gac ctc gtt ggt tgg ccg gcg ccc tct ggg gcg cgc tcc ttg aca cca 816
Asp Leu Val Gly Trp Pro Ala Pro Ser Gly Ala Arg Ser Leu Thr Pro
260 265 270
tgc acc tgt ggc agc tca gac ctc tac c 844
Cys Thr Cys Gly Ser Ser Asp Leu Tyr
275 280
<210> 26
<211> 281
<212> PRT
<213> Hepatitis C virus
<400> 26
Leu Leu Thr Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala His Gln
1 5 10 15
Val Arg Asn Val Ser Gly Val Tyr His Val Met Asn Asn Cys Ser Asn
20 25 30
Ser Ser Ile Val Asp Ile Thr Lys Ile Leu Leu Ala Ile Phe Gly Pro
35 40 45
Leu Met Val Leu Gln Ala Gly Leu Thr Arg Val Pro Tyr Phe Val Arg
50 55 60
Ala His Gly Leu Ile Arg Val Cys Met Leu Val Arg Lys Val Ser Gly
65 70 75 80
Gly His Tyr Val Gln Met Ala Leu Met Arg Leu Ala Ala Leu Thr Gly
85 90 95
Thr Tyr Val Tyr Asn His Leu Thr Pro Leu Arg Asp Trp Gly His Ala
100 105 110
Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro Val Ile Phe Ser Asp
115 120 125
Met Glu Thr Lys Ile Ile Thr Trp Gly Ala Asp Thr Ala Ala Cys Gly
130 135 140
Asp Ile Ile Ser Gly Leu Pro Val Ser Ala Arg Arg Gly Arg Glu Ile
145 150 155 160
Leu Leu Gly Pro Ala Asp Ser Phe Gly Glu Arg Gly Trp Arg Leu Leu
165 170 175
Ala Pro Ile Thr Ala Tyr Ser Gln Gln Thr Arg Gly Leu Leu Gly Cys
180 185 190
Ile Ile Thr Ser Leu Thr Gly Arg Asp Lys Asn Gln Val Glu Gly Glu
195 200 205
Val Gln Val Val Ser Thr Ala Thr Gln Ser Phe Pro Ala Thr Cys Val
210 215 220
Asn Gly Val Arg Trp Thr Val Tyr His Gly Ala Gly Ser Lys Thr Leu
225 230 235 240
Ala Gly Pro Lys Gly Pro Val Thr Gln Met Tyr Thr Asn Val Asp Arg
245 250 255
Asp Leu Val Gly Trp Pro Ala Pro Ser Gly Ala Arg Ser Leu Thr Pro
260 265 270
Cys Thr Cys Gly Ser Ser Asp Leu Tyr
275 280
<210> 27
<211> 1042
<212> DNA
<213> Hepatitis C virus
<220>
<221> CDS
<222> (229)..(1041)
<400> 27
ccaggtcccc ccctcccggg agagccatag tggtctgcgg aaccggtgag tacaccggaa 60
ttgccaggac gaccgggtcc tttcttggat caacccgctc aatgcctgga gatttgggcg 120
tgcccccgcg agactactag ccgagtagtg ttgggtcgcg aaaggccttg tggtactgcc 180
tgatagggtg cttgcgagtg ccccgggagg tctcgtagac cgtgcatc atg agc aca 237
Met Ser Thr
1
aat cct aaa cct caa aga aaa acc aaa cgt aac acc aac cgc cgc cca 285
Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro
5 10 15
cag gac gtc aag ttc ccg ggc ggt ggc cag atc gtt ggt gga gtt tac 333
Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly Gly Val Tyr
20 25 30 35
ctg ttg ccg cgc agg ggc ccc agg ttg ggt gtg cgc gcg act agg aag 381
Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys
40 45 50
act tcc gag cgg tcg caa cct cgt gga agg cga caa cct atc ccc aag 429
Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro Ile Pro Lys
55 60 65
gct cgc cga ccc gag ggc agg gcc tgg gca cag ccc ggg tac cct tgg 477
Ala Arg Arg Pro Glu Gly Arg Ala Trp Ala Gln Pro Gly Tyr Pro Trp
70 75 80
cct ctc tat ggc aat gag ggc ctg ggg tgg gca gga tgg ttc ctg tca 525
Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp Phe Leu Ser
85 90 95
ccc cgc ggc tcc cgg cct agt tgg ggc ccc acg gac ccc cgg cgt agg 573
Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro Arg Arg Arg
100 105 110 115
tcg cgc aat ttg ggt aag gtc atc gat acc ctc act tgc ggc ttc gcc 621
Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys Gly Phe Ala
120 125 130
gac ctc atg ggg tac att ccg ctc gtc ggc gcc ccc tta gga ggt gct 669
Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala Pro Leu Gly Gly Ala
135 140 145
gcc agg gcc ctg gcg cat ggc gtc cgg gtt ctg gaa gac ggc gtg gac 717
Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asp
150 155 160
tac gca aca ggg aat ttg ccc ggt tgc tct ttc tct atc ttc ctc ttg 765
Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile Phe Leu Leu
165 170 175
gct tta ctg tcc tgt ttg acc atc cca gct tcc gct cat caa gtg cgc 813
Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala His Gln Val Arg
180 185 190 195
gac gtg tcc ggg gtg tac cat gtc acg aac aac tgc tcc aac tca agt 861
Asp Val Ser Gly Val Tyr His Val Thr Asn Asn Cys Ser Asn Ser Ser
200 205 210
att gtg gtc atc acc aag att ctg ctc gcc ata ttt ggc ccg ctc atg 909
Ile Val Val Ile Thr Lys Ile Leu Leu Ala Ile Phe Gly Pro Leu Met
215 220 225
gcg ctc cag gct ggt tta act aga gtg ccg tac ttc gta cgc gct cat 957
Ala Leu Gln Ala Gly Leu Thr Arg Val Pro Tyr Phe Val Arg Ala His
230 235 240
ggg ctc atc cgt gta tgc atg ttg gtg cgg aaa gtc tct ggg ggt cat 1005
Gly Leu Ile Arg Val Cys Met Leu Val Arg Lys Val Ser Gly Gly His
245 250 255
tac gtc cag atg gct ctc atg agg ctg gcc gca ctg a 1042
Tyr Val Gln Met Ala Leu Met Arg Leu Ala Ala Leu
260 265 270
<210> 28
<211> 271
<212> PRT
<213> Hepatitis C virus
<400> 28
Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn
1 5 10 15
Arg Arg Pro Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly
20 25 30
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala
35 40 45
Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro
50 55 60
Ile Pro Lys Ala Arg Arg Pro Glu Gly Arg Ala Trp Ala Gln Pro Gly
65 70 75 80
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp
85 90 95
Phe Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro
100 105 110
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys
115 120 125
Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala Pro Leu
130 135 140
Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp
145 150 155 160
Gly Val Asp Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile
165 170 175
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala His
180 185 190
Gln Val Arg Asp Val Ser Gly Val Tyr His Val Thr Asn Asn Cys Ser
195 200 205
Asn Ser Ser Ile Val Val Ile Thr Lys Ile Leu Leu Ala Ile Phe Gly
210 215 220
Pro Leu Met Ala Leu Gln Ala Gly Leu Thr Arg Val Pro Tyr Phe Val
225 230 235 240
Arg Ala His Gly Leu Ile Arg Val Cys Met Leu Val Arg Lys Val Ser
245 250 255
Gly Gly His Tyr Val Gln Met Ala Leu Met Arg Leu Ala Ala Leu
260 265 270
<210> 29
<211> 232
<212> DNA
<213> Hepatitis C virus
<220>
<221> CDS
<222> (1)..(231)
<400> 29
ctc ttg gct ttg ctg tcc tgt ttg acc att cca gcc tcc gcc cat gtc 48
Leu Leu Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala His Val
1 5 10 15
ccc cct ctc aac gtc cgg gga ggc cgc gac gcc atc atc ctt ctc aca 96
Pro Pro Leu Asn Val Arg Gly Gly Arg Asp Ala Ile Ile Leu Leu Thr
20 25 30
tgt gcg gtc cac tca gag cta gtt ttt aaa atc acc aaa atc ttg ctt 144
Cys Ala Val His Ser Glu Leu Val Phe Lys Ile Thr Lys Ile Leu Leu
35 40 45
gca ata ctt ggt ccg ctc atg gtg ctc cag gct ggt ctc att agg gtg 192
Ala Ile Leu Gly Pro Leu Met Val Leu Gln Ala Gly Leu Ile Arg Val
50 55 60
ccg tac ttc gtg cgc gcc caa ggg ctt atc cgt gca tgc a 232
Pro Tyr Phe Val Arg Ala Gln Gly Leu Ile Arg Ala Cys
65 70 75
<210> 30
<211> 77
<212> PRT
<213> Hepatitis C virus
<400> 30
Leu Leu Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala His Val
1 5 10 15
Pro Pro Leu Asn Val Arg Gly Gly Arg Asp Ala Ile Ile Leu Leu Thr
20 25 30
Cys Ala Val His Ser Glu Leu Val Phe Lys Ile Thr Lys Ile Leu Leu
35 40 45
Ala Ile Leu Gly Pro Leu Met Val Leu Gln Ala Gly Leu Ile Arg Val
50 55 60
Pro Tyr Phe Val Arg Ala Gln Gly Leu Ile Arg Ala Cys
65 70 75
<210> 31
<211> 1823
<212> DNA
<213> Hepatitis C virus
<220>
<221> CDS
<222> (228)..(1823)
<400> 31
ccaggtcccc cctcccggga gagccatagt ggtctgcgga accggtgagt acaccggaat 60
tgccaggacg accgggtcct ttcttggatc aacccgctca atgcctggag atttgggcgt 120
gcccccgcga gactgctagc cgagtagtgt tgggtcgcga aaggccttgt ggtactgcct 180
gatagggtgc ttgcgagtgc cccgggaggt ctcgtagacc gtgcatc atg agc aca 236
Met Ser Thr
1
aat cct aaa cct caa aga aaa acc aaa cgt aac acc aac cgc cgc cca 284
Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro
5 10 15
cag gac gtc aag ttc ccg ggc ggt ggt cag atc gtt ggt gga gtt tac 332
Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly Gly Val Tyr
20 25 30 35
ctg ttg ccg cgc agg ggc ccc agg ttg ggt gtg cgc gcg act agg aag 380
Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys
40 45 50
act tcc gag cgg tcg caa cct cgt gga agg cga caa cct atc ccc aag 428
Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro Ile Pro Lys
55 60 65
gct cgc cag ccc gag ggt agg gcc tgg gct cag ccc ggg tac cct tgg 476
Ala Arg Gln Pro Glu Gly Arg Ala Trp Ala Gln Pro Gly Tyr Pro Trp
70 75 80
ccc ctc tac ggc aat gag ggc ctg ggg tgg gca gga tgg ctc ctg tca 524
Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp Leu Leu Ser
85 90 95
ccc cgc ggc tct cgg cct agt tgg ggc ccc aca gac ccc cgg cgt agg 572
Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro Arg Arg Arg
100 105 110 115
tcg cgt aat ttg ggt aag gtc atc gat acc ctt aca tgc ggc ttc gcc 620
Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys Gly Phe Ala
120 125 130
gac ctc acg ggg tac atc ccg ctc gtc ggc gcc ccc cta ggg ggc gct 668
Asp Leu Thr Gly Tyr Ile Pro Leu Val Gly Ala Pro Leu Gly Gly Ala
135 140 145
gcc agg gcc ttg gcg cat ggc gtc cgg gtt ctg gag gac ggc gtg aac 716
Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn
150 155 160
tat gca aca ggg aac ctt ccc ggt tgc tct ttc tct atc ttc ctc ttg 764
Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile Phe Leu Leu
165 170 175
gct ttg ctg tcc tgt ttg acc att cca gcc tcc gcc cat gtc ccc cct 812
Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala His Val Pro Pro
180 185 190 195
ctc aac gtc cgg gga ggc cgc gac gcc atc atc ctt ctc aca tgt gcg 860
Leu Asn Val Arg Gly Gly Arg Asp Ala Ile Ile Leu Leu Thr Cys Ala
200 205 210
gtc cac tca gag cta gtt ttt aaa atc acc aaa atc ttg ctt gca ata 908
Val His Ser Glu Leu Val Phe Lys Ile Thr Lys Ile Leu Leu Ala Ile
215 220 225
ctt ggt ccg ctc atg gtg ctc cag gct ggt ctc gtt agg gtg ccg tac 956
Leu Gly Pro Leu Met Val Leu Gln Ala Gly Leu Val Arg Val Pro Tyr
230 235 240
ttc gtg cgc gcc caa ggg ctt atc cgt gca tgc atg ttg gtg cgg aag 1004
Phe Val Arg Ala Gln Gly Leu Ile Arg Ala Cys Met Leu Val Arg Lys
245 250 255
atc gct ggg ggt cat tat gtc caa atg gct ttc gtg aag ctg gcc gca 1052
Ile Ala Gly Gly His Tyr Val Gln Met Ala Phe Val Lys Leu Ala Ala
260 265 270 275
ctg acg ggc acg tac gtc tat gac cat ctt act cca ctg cgg gac tgg 1100
Leu Thr Gly Thr Tyr Val Tyr Asp His Leu Thr Pro Leu Arg Asp Trp
280 285 290
gcc cac acg ggc ctg cga gac ctc gcg gtg gcg gtc gag ccc gtc gtc 1148
Ala His Thr Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro Val Val
295 300 305
ttc tct gac atg ggg acc aag atc atc acc tgg ggg gcg gac acc gcg 1196
Phe Ser Asp Met Gly Thr Lys Ile Ile Thr Trp Gly Ala Asp Thr Ala
310 315 320
gcg tgc ggg gac atc atc tcg ggt ctg ccc gtc tcc gct cgg agg ggg 1244
Ala Cys Gly Asp Ile Ile Ser Gly Leu Pro Val Ser Ala Arg Arg Gly
325 330 335
agg gag ata ctc ctg gga ctg gcc gat agt ttc gga gag cag gga tgg 1292
Arg Glu Ile Leu Leu Gly Leu Ala Asp Ser Phe Gly Glu Gln Gly Trp
340 345 350 355
cga ctc ctt gcg cct atc acg gcc tac tcc caa cag acg cgg ggt tta 1340
Arg Leu Leu Ala Pro Ile Thr Ala Tyr Ser Gln Gln Thr Arg Gly Leu
360 365 370
ctt ggc tgc atc atc act agc ctc aca ggc cgg gac aag aac cag gtc 1388
Leu Gly Cys Ile Ile Thr Ser Leu Thr Gly Arg Asp Lys Asn Gln Val
375 380 385
gag ggg gaa gtc cag gtg gtt tcc acc gca acg cag tct ttc ctc gcg 1436
Glu Gly Glu Val Gln Val Val Ser Thr Ala Thr Gln Ser Phe Leu Ala
390 395 400
aca tgt gta aat ggt gtg tgt tgg act gtc tac cat ggt gcc ggc tca 1484
Thr Cys Val Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Ser
405 410 415
aag acc tta gcc ggc cct aag ggt ccg atc act caa atg tac acc aat 1532
Lys Thr Leu Ala Gly Pro Lys Gly Pro Ile Thr Gln Met Tyr Thr Asn
420 425 430 435
gtg gac cag gac ctc gtt ggc tgg cag gcg ccc cct ggg gcg cgt tcc 1580
Val Asp Gln Asp Leu Val Gly Trp Gln Ala Pro Pro Gly Ala Arg Ser
440 445 450
atg aca cca tgc acc tgc ggc agc tcg gac ctc tac ctg gtc acg aga 1628
Met Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr Arg
455 460 465
cat gcc gat gtc att ccg gtg cgt cgg cgg ggc gac agc aga ggg agc 1676
His Ala Asp Val Ile Pro Val Arg Arg Arg Gly Asp Ser Arg Gly Ser
470 475 480
cta ctc tcc ccc agg cct gtg tcc tat ttg aag ggc tcc tcg ggt ggt 1724
Leu Leu Ser Pro Arg Pro Val Ser Tyr Leu Lys Gly Ser Ser Gly Gly
485 490 495
cca ctg ctc tgc ccc ttg ggg cac gtc gtg ggc atc ttc cgg gct gct 1772
Pro Leu Leu Cys Pro Leu Gly His Val Val Gly Ile Phe Arg Ala Ala
500 505 510 515
gtg tgc acc cgg ggg gtt gcg aag gcg gtg gac ttt gta ccc gtt gag 1820
Val Cys Thr Arg Gly Val Ala Lys Ala Val Asp Phe Val Pro Val Glu
520 525 530
tct 1823
Ser
<210> 32
<211> 532
<212> PRT
<213> Hepatitis C virus
<400> 32
Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn
1 5 10 15
Arg Arg Pro Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly
20 25 30
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala
35 40 45
Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro
50 55 60
Ile Pro Lys Ala Arg Gln Pro Glu Gly Arg Ala Trp Ala Gln Pro Gly
65 70 75 80
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp
85 90 95
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro
100 105 110
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys
115 120 125
Gly Phe Ala Asp Leu Thr Gly Tyr Ile Pro Leu Val Gly Ala Pro Leu
130 135 140
Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp
145 150 155 160
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile
165 170 175
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala His
180 185 190
Val Pro Pro Leu Asn Val Arg Gly Gly Arg Asp Ala Ile Ile Leu Leu
195 200 205
Thr Cys Ala Val His Ser Glu Leu Val Phe Lys Ile Thr Lys Ile Leu
210 215 220
Leu Ala Ile Leu Gly Pro Leu Met Val Leu Gln Ala Gly Leu Val Arg
225 230 235 240
Val Pro Tyr Phe Val Arg Ala Gln Gly Leu Ile Arg Ala Cys Met Leu
245 250 255
Val Arg Lys Ile Ala Gly Gly His Tyr Val Gln Met Ala Phe Val Lys
260 265 270
Leu Ala Ala Leu Thr Gly Thr Tyr Val Tyr Asp His Leu Thr Pro Leu
275 280 285
Arg Asp Trp Ala His Thr Gly Leu Arg Asp Leu Ala Val Ala Val Glu
290 295 300
Pro Val Val Phe Ser Asp Met Gly Thr Lys Ile Ile Thr Trp Gly Ala
305 310 315 320
Asp Thr Ala Ala Cys Gly Asp Ile Ile Ser Gly Leu Pro Val Ser Ala
325 330 335
Arg Arg Gly Arg Glu Ile Leu Leu Gly Leu Ala Asp Ser Phe Gly Glu
340 345 350
Gln Gly Trp Arg Leu Leu Ala Pro Ile Thr Ala Tyr Ser Gln Gln Thr
355 360 365
Arg Gly Leu Leu Gly Cys Ile Ile Thr Ser Leu Thr Gly Arg Asp Lys
370 375 380
Asn Gln Val Glu Gly Glu Val Gln Val Val Ser Thr Ala Thr Gln Ser
385 390 395 400
Phe Leu Ala Thr Cys Val Asn Gly Val Cys Trp Thr Val Tyr His Gly
405 410 415
Ala Gly Ser Lys Thr Leu Ala Gly Pro Lys Gly Pro Ile Thr Gln Met
420 425 430
Tyr Thr Asn Val Asp Gln Asp Leu Val Gly Trp Gln Ala Pro Pro Gly
435 440 445
Ala Arg Ser Met Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu
450 455 460
Val Thr Arg His Ala Asp Val Ile Pro Val Arg Arg Arg Gly Asp Ser
465 470 475 480
Arg Gly Ser Leu Leu Ser Pro Arg Pro Val Ser Tyr Leu Lys Gly Ser
485 490 495
Ser Gly Gly Pro Leu Leu Cys Pro Leu Gly His Val Val Gly Ile Phe
500 505 510
Arg Ala Ala Val Cys Thr Arg Gly Val Ala Lys Ala Val Asp Phe Val
515 520 525
Pro Val Glu Ser
530
<210> 33
<211> 1824
<212> DNA
<213> Hepatitis C virus
<220>
<221> CDS
<222> (229)..(1824)
<400> 33
ccaggtcccc ccctcccggg agagccatag tggtctgcgg aaccggtgag tacaccggaa 60
ttgccaggac gaccgggtcc tttcttggat caacccgctc aatgcctgga gatttgggcg 120
tgcccccgcg aggctgctag ccgagtagtg ttgggtcgcg aaaggccttg tggtactgcc 180
tgatagggtg cttgcgagtg ccccgggagg tctcgtagac cgtgcatc atg agc aca 237
Met Ser Thr
1
aat cct aaa cct caa aga aaa acc aaa cgt aac acc aac cgc cgc cca 285
Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro
5 10 15
cag gac gtc aat ttc ccg ggc ggt ggt cag atc gtt ggt gga gtt tac 333
Gln Asp Val Asn Phe Pro Gly Gly Gly Gln Ile Val Gly Gly Val Tyr
20 25 30 35
ctg ttg ccg cgc agg ggc ccc agg ttg ggt gtg cgc gcg act agg aag 381
Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys
40 45 50
act tcc gag cgg tcg caa cct cgt gga agg cga caa cct atc ccc aag 429
Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro Ile Pro Lys
55 60 65
gct cgc cag ccc gag ggt agg gcc tgg gct cag ccc ggg tac cct tgg 477
Ala Arg Gln Pro Glu Gly Arg Ala Trp Ala Gln Pro Gly Tyr Pro Trp
70 75 80
ccc ctc tac ggc aat gag ggc ctg ggg tgg aca gga tgg ctc ctg tca 525
Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Thr Gly Trp Leu Leu Ser
85 90 95
ccc cgc ggc tct cgg cct agt tgg ggc ccc acg gac ccc cgg cgt agg 573
Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro Arg Arg Arg
100 105 110 115
tcg cgt aat ttg ggt aag gtc atc gat acc ctt aca tgc ggc ttc gcc 621
Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys Gly Phe Ala
120 125 130
gac ctc atg ggg tac atc ccg ctc gtc ggc gcc ccc cta ggg ggc gct 669
Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala Pro Leu Gly Gly Ala
135 140 145
gcc agg gcc ttg gcg cat ggc gtc cgg gtt ctg gag gac ggc gtg aac 717
Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn
150 155 160
tat gca aca ggg aac ctt ccc ggt tgc tct ttc tct atc ttc ctc ttg 765
Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile Phe Leu Leu
165 170 175
gct ttg ctg tcc tgt ttg acc att cca gcc tcc gcc cat gtc ccc cct 813
Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala His Val Pro Pro
180 185 190 195
ctc aac gtc cgg gga ggc cgc gac gcc atc atc ctt ctc aca tgt gcg 861
Leu Asn Val Arg Gly Gly Arg Asp Ala Ile Ile Leu Leu Thr Cys Ala
200 205 210
gtc cac tca gag cta gtt ttt aaa atc acc aaa atc ttg ctt gca ata 909
Val His Ser Glu Leu Val Phe Lys Ile Thr Lys Ile Leu Leu Ala Ile
215 220 225
ctt ggt ccg ctc atg gtg ctc cag gct ggt ctc att agg gtg ccg tac 957
Leu Gly Pro Leu Met Val Leu Gln Ala Gly Leu Ile Arg Val Pro Tyr
230 235 240
ttc gtg cgc gcc caa ggg ctt atc cgt gca tgc atg ttg gtg cgg aag 1005
Phe Val Arg Ala Gln Gly Leu Ile Arg Ala Cys Met Leu Val Arg Lys
245 250 255
atc gct ggg ggt cat tat gtc caa atg gct ttc gtg aag ctg gcc gca 1053
Ile Ala Gly Gly His Tyr Val Gln Met Ala Phe Val Lys Leu Ala Ala
260 265 270 275
ctg acg ggc acg tac gtc tat gac cat ctt act cca ctg cgg gac tgg 1101
Leu Thr Gly Thr Tyr Val Tyr Asp His Leu Thr Pro Leu Arg Asp Trp
280 285 290
gcc cac acg ggc ctg cga gac ctc gcg gtg gcg gtc gag ccc gtc gtc 1149
Ala His Thr Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro Val Val
295 300 305
ttc tct gac atg gag acc aag atc atc acc tgg ggg gcg gac acc gcg 1197
Phe Ser Asp Met Glu Thr Lys Ile Ile Thr Trp Gly Ala Asp Thr Ala
310 315 320
gcg tgc ggg gac atc atc tcg ggt ctg ccc gtc tcc gct cgg agg ggg 1245
Ala Cys Gly Asp Ile Ile Ser Gly Leu Pro Val Ser Ala Arg Arg Gly
325 330 335
agg gag ata ctc ctg gga cgg gcc gat agt ttc gga gag cag gga tgg 1293
Arg Glu Ile Leu Leu Gly Arg Ala Asp Ser Phe Gly Glu Gln Gly Trp
340 345 350 355
cga ctc ctt gcg cct atc acg gcc tac tcc caa cag acg cgg ggt tta 1341
Arg Leu Leu Ala Pro Ile Thr Ala Tyr Ser Gln Gln Thr Arg Gly Leu
360 365 370
ctt ggc tgc atc atc act agc ctc aca ggc cgg gac aag aac cag gtc 1389
Leu Gly Cys Ile Ile Thr Ser Leu Thr Gly Arg Asp Lys Asn Gln Val
375 380 385
gag ggg gaa gtc cag gtg gtt tcc acc gca acg cag tct ttc ctc gcg 1437
Glu Gly Glu Val Gln Val Val Ser Thr Ala Thr Gln Ser Phe Leu Ala
390 395 400
aca tgt gta aat ggt gtg tgt tgg act gtc tac cat ggt gcc ggc tca 1485
Thr Cys Val Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Ser
405 410 415
aag acc tta gcc ggc cct aag ggt ccg atc act caa atg tac acc aat 1533
Lys Thr Leu Ala Gly Pro Lys Gly Pro Ile Thr Gln Met Tyr Thr Asn
420 425 430 435
gtg gac cag gac ctc gtt ggt tgg cag gcg ccc cct ggg gcg cgt tcc 1581
Val Asp Gln Asp Leu Val Gly Trp Gln Ala Pro Pro Gly Ala Arg Ser
440 445 450
atg aca cca tgc acc tgc ggc agc tcg gac ctc tac ctg gtc acg aga 1629
Met Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr Arg
455 460 465
cat gcc gat gtc att ccg gtg cgt cgg cgg ggc gac agc aga ggg agc 1677
His Ala Asp Val Ile Pro Val Arg Arg Arg Gly Asp Ser Arg Gly Ser
470 475 480
cta ctc tcc ccc agg cct gtg tcc tat ttg aag ggc tcc tcg ggt ggt 1725
Leu Leu Ser Pro Arg Pro Val Ser Tyr Leu Lys Gly Ser Ser Gly Gly
485 490 495
cca ctg ctc tgc ccc ttg ggg cac gtc gtg ggc atc ttc cgg gct gct 1773
Pro Leu Leu Cys Pro Leu Gly His Val Val Gly Ile Phe Arg Ala Ala
500 505 510 515
gtg tgc acc cgg ggg gtt gcg aag gcg gtg gac ttt gta ccc gtt gag 1821
Val Cys Thr Arg Gly Val Ala Lys Ala Val Asp Phe Val Pro Val Glu
520 525 530
tct 1824
Ser
<210> 34
<211> 532
<212> PRT
<213> Hepatitis C virus
<400> 34
Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn
1 5 10 15
Arg Arg Pro Gln Asp Val Asn Phe Pro Gly Gly Gly Gln Ile Val Gly
20 25 30
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala
35 40 45
Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro
50 55 60
Ile Pro Lys Ala Arg Gln Pro Glu Gly Arg Ala Trp Ala Gln Pro Gly
65 70 75 80
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Thr Gly Trp
85 90 95
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro
100 105 110
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys
115 120 125
Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala Pro Leu
130 135 140
Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp
145 150 155 160
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile
165 170 175
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala His
180 185 190
Val Pro Pro Leu Asn Val Arg Gly Gly Arg Asp Ala Ile Ile Leu Leu
195 200 205
Thr Cys Ala Val His Ser Glu Leu Val Phe Lys Ile Thr Lys Ile Leu
210 215 220
Leu Ala Ile Leu Gly Pro Leu Met Val Leu Gln Ala Gly Leu Ile Arg
225 230 235 240
Val Pro Tyr Phe Val Arg Ala Gln Gly Leu Ile Arg Ala Cys Met Leu
245 250 255
Val Arg Lys Ile Ala Gly Gly His Tyr Val Gln Met Ala Phe Val Lys
260 265 270
Leu Ala Ala Leu Thr Gly Thr Tyr Val Tyr Asp His Leu Thr Pro Leu
275 280 285
Arg Asp Trp Ala His Thr Gly Leu Arg Asp Leu Ala Val Ala Val Glu
290 295 300
Pro Val Val Phe Ser Asp Met Glu Thr Lys Ile Ile Thr Trp Gly Ala
305 310 315 320
Asp Thr Ala Ala Cys Gly Asp Ile Ile Ser Gly Leu Pro Val Ser Ala
325 330 335
Arg Arg Gly Arg Glu Ile Leu Leu Gly Arg Ala Asp Ser Phe Gly Glu
340 345 350
Gln Gly Trp Arg Leu Leu Ala Pro Ile Thr Ala Tyr Ser Gln Gln Thr
355 360 365
Arg Gly Leu Leu Gly Cys Ile Ile Thr Ser Leu Thr Gly Arg Asp Lys
370 375 380
Asn Gln Val Glu Gly Glu Val Gln Val Val Ser Thr Ala Thr Gln Ser
385 390 395 400
Phe Leu Ala Thr Cys Val Asn Gly Val Cys Trp Thr Val Tyr His Gly
405 410 415
Ala Gly Ser Lys Thr Leu Ala Gly Pro Lys Gly Pro Ile Thr Gln Met
420 425 430
Tyr Thr Asn Val Asp Gln Asp Leu Val Gly Trp Gln Ala Pro Pro Gly
435 440 445
Ala Arg Ser Met Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu
450 455 460
Val Thr Arg His Ala Asp Val Ile Pro Val Arg Arg Arg Gly Asp Ser
465 470 475 480
Arg Gly Ser Leu Leu Ser Pro Arg Pro Val Ser Tyr Leu Lys Gly Ser
485 490 495
Ser Gly Gly Pro Leu Leu Cys Pro Leu Gly His Val Val Gly Ile Phe
500 505 510
Arg Ala Ala Val Cys Thr Arg Gly Val Ala Lys Ala Val Asp Phe Val
515 520 525
Pro Val Glu Ser
530
<210> 35
<211> 1115
<212> DNA
<213> Hepatitis C virus
<220>
<221> CDS
<222> (229)..(1113)
<400> 35
ccaggacccc ccctcccggg agagccatag tggtctgcgg aaccggtgag tacaccggaa 60
ttgccaggac gaccgggtcc tttcttggat caacccgctc aatgcctgga gatttgggcg 120
tgcccccgcg agactgctag ccgagtagtg ttgggtcgcg aaaggccttg tggtactgcc 180
tgatagggtg cttgcgagtg ccccgggagg tctcgtagac cgtgcacc atg agc acg 237
Met Ser Thr
1
aat cct aaa cct caa aaa aaa ccc aaa tgt aac acc aac cgc cgc cca 285
Asn Pro Lys Pro Gln Lys Lys Pro Lys Cys Asn Thr Asn Arg Arg Pro
5 10 15
cag gac gtc aag ttc ccg ggc ggt ggt cag atc gtt ggt gga gtt tac 333
Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly Gly Val Tyr
20 25 30 35
ctg ttg ccg cgc agg ggc ccc agg ttg ggt gtg cgc gcg act agg aag 381
Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys
40 45 50
act tcc gag cgg tcg caa cct cgt gga agg cga caa cct atc ccc aag 429
Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro Ile Pro Lys
55 60 65
gct cgc cgg ccc gag ggt agg gcc tgg gct cag ccc ggg tac cct tgg 477
Ala Arg Arg Pro Glu Gly Arg Ala Trp Ala Gln Pro Gly Tyr Pro Trp
70 75 80
ccc ctc tat ggc gat gag ggc cta ggg tgg gca gga tgg ctc ctg tca 525
Pro Leu Tyr Gly Asp Glu Gly Leu Gly Trp Ala Gly Trp Leu Leu Ser
85 90 95
ccc cgc ggc tcc cgg cct agt tgg ggc ccc act gac ccc cgg cgt agg 573
Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro Arg Arg Arg
100 105 110 115
tcg cgt aat ctg ggt aag gtc atc gat acc ctc aca tgc ggc ttc gcc 621
Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys Gly Phe Ala
120 125 130
gac ctc atg ggg tac att ccg ctc gtc ggc gcc ccc tta gga ggc gtt 669
Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala Pro Leu Gly Gly Val
135 140 145
gcc agg gcc ctg gcg cat ggc gtc cgg gtt ctg gaa gac agc gtg aac 717
Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Ser Val Asn
150 155 160
tac gca aca ggg aat ctg ccc ggt tgc tct ttc tct atc ttc ctc tta 765
Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile Phe Leu Leu
165 170 175
gct ttg ctg tcc tgc ttg act gtc ccg gct tcc gct tgc aaa act ccc 813
Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Cys Lys Thr Pro
180 185 190 195
acg ctc gcg gcc agg gag cta aac ctt gga atc gcc aca atc ttg ctc 861
Thr Leu Ala Ala Arg Glu Leu Asn Leu Gly Ile Ala Thr Ile Leu Leu
200 205 210
gcc ata ttt ggt ccg ctc gtg gcg ctc cag act ggc cta ttt agg gtg 909
Ala Ile Phe Gly Pro Leu Val Ala Leu Gln Thr Gly Leu Phe Arg Val
215 220 225
ccg tac ttc gtg cgc gcc caa ggg ctc atc cgt gcg tgc atg ttg gtg 957
Pro Tyr Phe Val Arg Ala Gln Gly Leu Ile Arg Ala Cys Met Leu Val
230 235 240
cgg aaa gtc tct ggg ggt cat cat gtc caa atg gct ctt gtg agg cta 1005
Arg Lys Val Ser Gly Gly His His Val Gln Met Ala Leu Val Arg Leu
245 250 255
gct gct cta acg ggc acg tac gtt tat gac cat ctt act ccg ctg cgg 1053
Ala Ala Leu Thr Gly Thr Tyr Val Tyr Asp His Leu Thr Pro Leu Arg
260 265 270 275
gac tgg ccc acg cgg gcc tgc gag atc ttg cgg tgg ctg ttg agc ccg 1101
Asp Trp Pro Thr Arg Ala Cys Glu Ile Leu Arg Trp Leu Leu Ser Pro
280 285 290
tca tct tct ctg ac 1115
Ser Ser Ser Leu
295
<210> 36
<211> 295
<212> PRT
<213> Hepatitis C virus
<400> 36
Met Ser Thr Asn Pro Lys Pro Gln Lys Lys Pro Lys Cys Asn Thr Asn
1 5 10 15
Arg Arg Pro Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly
20 25 30
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala
35 40 45
Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro
50 55 60
Ile Pro Lys Ala Arg Arg Pro Glu Gly Arg Ala Trp Ala Gln Pro Gly
65 70 75 80
Tyr Pro Trp Pro Leu Tyr Gly Asp Glu Gly Leu Gly Trp Ala Gly Trp
85 90 95
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro
100 105 110
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys
115 120 125
Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala Pro Leu
130 135 140
Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp
145 150 155 160
Ser Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile
165 170 175
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Cys
180 185 190
Lys Thr Pro Thr Leu Ala Ala Arg Glu Leu Asn Leu Gly Ile Ala Thr
195 200 205
Ile Leu Leu Ala Ile Phe Gly Pro Leu Val Ala Leu Gln Thr Gly Leu
210 215 220
Phe Arg Val Pro Tyr Phe Val Arg Ala Gln Gly Leu Ile Arg Ala Cys
225 230 235 240
Met Leu Val Arg Lys Val Ser Gly Gly His His Val Gln Met Ala Leu
245 250 255
Val Arg Leu Ala Ala Leu Thr Gly Thr Tyr Val Tyr Asp His Leu Thr
260 265 270
Pro Leu Arg Asp Trp Pro Thr Arg Ala Cys Glu Ile Leu Arg Trp Leu
275 280 285
Leu Ser Pro Ser Ser Ser Leu
290 295
<210> 37
<211> 817
<212> DNA
<213> Hepatitis C virus
<220>
<221> CDS
<222> (1)..(816)
<400> 37
ctc tta gct ttg ctg tcc tgc ttg act gtc cca gct tcc gct tgc gaa 48
Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Cys Glu
1 5 10 15
act ccc acg ctc gcg gcc agg gag cta aac ctt gga atc gcc aaa atc 96
Thr Pro Thr Leu Ala Ala Arg Glu Leu Asn Leu Gly Ile Ala Lys Ile
20 25 30
ttg ctc gcc ata ttt ggt ccg ctc atg gtg ctc cag act ggc cta att 144
Leu Leu Ala Ile Phe Gly Pro Leu Met Val Leu Gln Thr Gly Leu Ile
35 40 45
agg gtg ccg tac ttc gtg cgc gcc cag ggg ctc atc cgt gcg tgc atg 192
Arg Val Pro Tyr Phe Val Arg Ala Gln Gly Leu Ile Arg Ala Cys Met
50 55 60
ttg gtg cgg aaa gtc tct ggg ggt cat tat gtc caa atg gct ctt gtg 240
Leu Val Arg Lys Val Ser Gly Gly His Tyr Val Gln Met Ala Leu Val
65 70 75 80
agg cta gct gcg cta acg ggc acg tac gtt tat gac cat ctt act ccg 288
Arg Leu Ala Ala Leu Thr Gly Thr Tyr Val Tyr Asp His Leu Thr Pro
85 90 95
ctg cgg gac tgg gcc cac gcg ggc ctg cga gat ctc gcg gtg gca gtt 336
Leu Arg Asp Trp Ala His Ala Gly Leu Arg Asp Leu Ala Val Ala Val
100 105 110
gag ccc gtc atc ttc tct gac atg gag acc aag atc atc acc tgg gag 384
Glu Pro Val Ile Phe Ser Asp Met Glu Thr Lys Ile Ile Thr Trp Glu
115 120 125
gca gac acc gcg gcg tgc ggg gac atc atc tcg ggc cta ccc gtc tcc 432
Ala Asp Thr Ala Ala Cys Gly Asp Ile Ile Ser Gly Leu Pro Val Ser
130 135 140
gcc cga agg ggg agg gag ata ctt ttg ggg ccg gcc gat agt ttt aga 480
Ala Arg Arg Gly Arg Glu Ile Leu Leu Gly Pro Ala Asp Ser Phe Arg
145 150 155 160
gat cag ggg tgg caa ctc ctt gcg ccc atc acg gcc tac tcc caa cag 528
Asp Gln Gly Trp Gln Leu Leu Ala Pro Ile Thr Ala Tyr Ser Gln Gln
165 170 175
acg cgg ggc cta ctt ggc tgc atc atc act agc ctc aca ggc cgg gac 576
Thr Arg Gly Leu Leu Gly Cys Ile Ile Thr Ser Leu Thr Gly Arg Asp
180 185 190
aag aac cag gtc gag gga gag gct cag gtg gtt tcc acc gca aca caa 624
Lys Asn Gln Val Glu Gly Glu Ala Gln Val Val Ser Thr Ala Thr Gln
195 200 205
tcc ttc ctg gcg acc tgt gtt aat ggc gtg tgt tgg acc gcc tac cgt 672
Ser Phe Leu Ala Thr Cys Val Asn Gly Val Cys Trp Thr Ala Tyr Arg
210 215 220
ggc gcc ggt gca aag acc cta gcc ggc cca aag ggt cca atc acc caa 720
Gly Ala Gly Ala Lys Thr Leu Ala Gly Pro Lys Gly Pro Ile Thr Gln
225 230 235 240
atg tat acc aat gta gac cag gac ctc gtc ggt tgg cag gcg ccc tcc 768
Met Tyr Thr Asn Val Asp Gln Asp Leu Val Gly Trp Gln Ala Pro Ser
245 250 255
ggg tcg cgt tcc tta acg cca tgc acc tgc ggt agc tcg gac ctt tac 816
Gly Ser Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr
260 265 270
t 817
<210> 38
<211> 272
<212> PRT
<213> Hepatitis C virus
<400> 38
Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Cys Glu
1 5 10 15
Thr Pro Thr Leu Ala Ala Arg Glu Leu Asn Leu Gly Ile Ala Lys Ile
20 25 30
Leu Leu Ala Ile Phe Gly Pro Leu Met Val Leu Gln Thr Gly Leu Ile
35 40 45
Arg Val Pro Tyr Phe Val Arg Ala Gln Gly Leu Ile Arg Ala Cys Met
50 55 60
Leu Val Arg Lys Val Ser Gly Gly His Tyr Val Gln Met Ala Leu Val
65 70 75 80
Arg Leu Ala Ala Leu Thr Gly Thr Tyr Val Tyr Asp His Leu Thr Pro
85 90 95
Leu Arg Asp Trp Ala His Ala Gly Leu Arg Asp Leu Ala Val Ala Val
100 105 110
Glu Pro Val Ile Phe Ser Asp Met Glu Thr Lys Ile Ile Thr Trp Glu
115 120 125
Ala Asp Thr Ala Ala Cys Gly Asp Ile Ile Ser Gly Leu Pro Val Ser
130 135 140
Ala Arg Arg Gly Arg Glu Ile Leu Leu Gly Pro Ala Asp Ser Phe Arg
145 150 155 160
Asp Gln Gly Trp Gln Leu Leu Ala Pro Ile Thr Ala Tyr Ser Gln Gln
165 170 175
Thr Arg Gly Leu Leu Gly Cys Ile Ile Thr Ser Leu Thr Gly Arg Asp
180 185 190
Lys Asn Gln Val Glu Gly Glu Ala Gln Val Val Ser Thr Ala Thr Gln
195 200 205
Ser Phe Leu Ala Thr Cys Val Asn Gly Val Cys Trp Thr Ala Tyr Arg
210 215 220
Gly Ala Gly Ala Lys Thr Leu Ala Gly Pro Lys Gly Pro Ile Thr Gln
225 230 235 240
Met Tyr Thr Asn Val Asp Gln Asp Leu Val Gly Trp Gln Ala Pro Ser
245 250 255
Gly Ser Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr
260 265 270
<210> 39
<211> 2302
<212> DNA
<213> Hepatitis C virus
<220>
<221> CDS
<222> (1)..(2301)
<400> 39
cct ttg gct ctg cta tcc tgt ctg act gtc ccg gct tcc gct tat gag 48
Pro Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu
1 5 10 15
gtg cgc aac ttc tcc ggg ata tac cgt gtc acg aac gac tgc tcc aac 96
Val Arg Asn Phe Ser Gly Ile Tyr Arg Val Thr Asn Asp Cys Ser Asn
20 25 30
tca agc att gtg tat gag gca gcg gac atg atc atg cat act ccc ggg 144
Ser Ser Ile Val Tyr Glu Ala Ala Asp Met Ile Met His Thr Pro Gly
35 40 45
tgt gtg ccc tgc gtt cgg gag ggt aac tcc tcc cgt tgc tgg gta gcg 192
Cys Val Pro Cys Val Arg Glu Gly Asn Ser Ser Arg Cys Trp Val Ala
50 55 60
ctc act ccc acg cta gcg gcc agg aat atc agc gtc ccc act acg aca 240
Leu Thr Pro Thr Leu Ala Ala Arg Asn Ile Ser Val Pro Thr Thr Thr
65 70 75 80
ata cga cgc aat gtc gac ttg ctc gtt ggg gcg gct gct ttc tgc tcc 288
Ile Arg Arg Asn Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser
85 90 95
gcc atg tac gtg gga gac ctc tgc gga tct gtt ttc ctc gtc tcc cag 336
Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gln
100 105 110
ctt ttc acc ttc tcg cct cgc cgg cat gag aca gta cag gac tgc aat 384
Leu Phe Thr Phe Ser Pro Arg Arg His Glu Thr Val Gln Asp Cys Asn
115 120 125
tgt tca atc tat tcc ggc cat gtg tca ggt cac cgt atg gct tgg gat 432
Cys Ser Ile Tyr Ser Gly His Val Ser Gly His Arg Met Ala Trp Asp
130 135 140
atg atg atg aac tgg tca cct aca gca acc cta gta gtg tca cag tta 480
Met Met Met Asn Trp Ser Pro Thr Ala Thr Leu Val Val Ser Gln Leu
145 150 155 160
ctc cgg atc cca caa gcc gtc gtg gac atg gtg gcg ggg gct cac tgg 528
Leu Arg Ile Pro Gln Ala Val Val Asp Met Val Ala Gly Ala His Trp
165 170 175
gga gtc cta gcg ggc ctt gcc tac tat ccc atg gcg gga aac tgg gct 576
Gly Val Leu Ala Gly Leu Ala Tyr Tyr Pro Met Ala Gly Asn Trp Ala
180 185 190
aag gtg tta att gta ttg ctg ctc ttc gcc ggc gtt gac ggg cag acc 624
Lys Val Leu Ile Val Leu Leu Leu Phe Ala Gly Val Asp Gly Gln Thr
195 200 205
cgc gtg aca ggg ggg gcg gca gct cac acc gcc cgt ggg ctc act tcc 672
Arg Val Thr Gly Gly Ala Ala Ala His Thr Ala Arg Gly Leu Thr Ser
210 215 220
atc ctt cca cct ggg ccg tct cag aac atc cag ctt gta aac acc aat 720
Ile Leu Pro Pro Gly Pro Ser Gln Asn Ile Gln Leu Val Asn Thr Asn
225 230 235 240
ggc agc tgg cac atc aac agg act gct ctg aac tgc aat gac agc ctc 768
Gly Ser Trp His Ile Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu
245 250 255
cag act ggg ttt ctt gcc gcg ctg ttc ttc aca cac aag ttc aac gcg 816
Gln Thr Gly Phe Leu Ala Ala Leu Phe Phe Thr His Lys Phe Asn Ala
260 265 270
tct gga tgc cca gaa cgc atg gcc agc tgc cgt acc att gac aag ttc 864
Ser Gly Cys Pro Glu Arg Met Ala Ser Cys Arg Thr Ile Asp Lys Phe
275 280 285
aat caa ggg tgg ggt ccc atc acc tat gat ggg cat ggc ggc cag gac 912
Asn Gln Gly Trp Gly Pro Ile Thr Tyr Asp Gly His Gly Gly Gln Asp
290 295 300
cag agg cct tat tgc tgg cac tac gcg cct aag ccg tgc ggt atc gta 960
Gln Arg Pro Tyr Cys Trp His Tyr Ala Pro Lys Pro Cys Gly Ile Val
305 310 315 320
ccc gcg tcg cag gtg tgt ggt cca gtg tat tgt ttc acc cca agc cca 1008
Pro Ala Ser Gln Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro
325 330 335
gtt gtg gtg ggg acg acc gat cgc tcc ggt gtc cct acg tat agc tgg 1056
Val Val Val Gly Thr Thr Asp Arg Ser Gly Val Pro Thr Tyr Ser Trp
340 345 350
ggg gag aat gag aca gac gtg ctg ctt ctt aac aac acg cgg ccg ccg 1104
Gly Glu Asn Glu Thr Asp Val Leu Leu Leu Asn Asn Thr Arg Pro Pro
355 360 365
cta ggc aac tgg ttc ggc tgt aca tgg atg aat tgc act ggg ttc acc 1152
Leu Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Cys Thr Gly Phe Thr
370 375 380
aag acg tgc ggg ggc ccc ccg tgt aat atc ggg gga gtc ggc aac aac 1200
Lys Thr Cys Gly Gly Pro Pro Cys Asn Ile Gly Gly Val Gly Asn Asn
385 390 395 400
acc ttg acc tgc ccc acg gat tgc ttc cgg aag cac ccc gaa gcc act 1248
Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr
405 410 415
tac acc aaa tgt ggt tcg ggg cct tgg ttg aca ccc aga tgc ata gtt 1296
Tyr Thr Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Ile Val
420 425 430
gac tac cca tac agg ctc tgg cac tac ccc tgc act gtc aac ttc acc 1344
Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr
435 440 445
atc ttc aag gtt agg atg tat gtg ggg ggc gtg gag cac agg ctc aat 1392
Ile Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Asn
450 455 460
gct gca tgc aat tgg acc cga ggg gag cgt tgt ggg ttg gag gac agg 1440
Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Gly Leu Glu Asp Arg
465 470 475 480
gat aga tcg gag ctc agc ccg ctg ctg cta tct aca aca gag tgg cag 1488
Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gln
485 490 495
ata ctg cct tgc tcc ttc acc aca cta ccg gct ctg tcc act ggt tta 1536
Ile Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu
500 505 510
atc cat ctt cac cag aac atc gtg gac gtg caa tat ctg tac ggc ata 1584
Ile His Leu His Gln Asn Ile Val Asp Val Gln Tyr Leu Tyr Gly Ile
515 520 525
ggg tcg gtg gtt gtc tcc tct gca atc aag tgg gag tat gtc gtg ttg 1632
Gly Ser Val Val Val Ser Ser Ala Ile Lys Trp Glu Tyr Val Val Leu
530 535 540
ctc ttc ctt ctc ctg gcg gac gca cgc gtc tgt gcc tgc ttg tgg atg 1680
Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp Met
545 550 555 560
atg cta ctg gta gcc cag gcc gag gct gct tta gag aac cta gtg gtt 1728
Met Leu Leu Val Ala Gln Ala Glu Ala Ala Leu Glu Asn Leu Val Val
565 570 575
ctc aac gcg gca tcc gtg gct ggg acg cac ggc att atc ccc ttc ctt 1776
Leu Asn Ala Ala Ser Val Ala Gly Thr His Gly Ile Ile Pro Phe Leu
580 585 590
gtg ttc ttc tgt gcc gcc tgg tac atc aaa ggc agg ctc gtc cct gca 1824
Val Phe Phe Cys Ala Ala Trp Tyr Ile Lys Gly Arg Leu Val Pro Ala
595 600 605
gcg gca tat gct ttc tat ggc gta tgg ccg ctg ctc ctg ctc ctg ctg 1872
Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu Leu
610 615 620
gcg tta cca cca cga gct tac gcc atg gac cgg gag atg gct gca tcg 1920
Ala Leu Pro Pro Arg Ala Tyr Ala Met Asp Arg Glu Met Ala Ala Ser
625 630 635 640
tgt gga ggc ggg gtt ttt gta ggt ctg gca ttc ttg acc ttg tca cca 1968
Cys Gly Gly Gly Val Phe Val Gly Leu Ala Phe Leu Thr Leu Ser Pro
645 650 655
tac tac aag gtg ttc ctc gct aag ctc ata tgg tgg tta caa tat ttt 2016
Tyr Tyr Lys Val Phe Leu Ala Lys Leu Ile Trp Trp Leu Gln Tyr Phe
660 665 670
atc acc aga gcc gag gcg cac ttg caa gtg tgg atc ccc ccc ctc aac 2064
Ile Thr Arg Ala Glu Ala His Leu Gln Val Trp Ile Pro Pro Leu Asn
675 680 685
gtt cgg gga ggc cgt gat gcc atc atc ctc ctc gca tgc gca gtc cac 2112
Val Arg Gly Gly Arg Asp Ala Ile Ile Leu Leu Ala Cys Ala Val His
690 695 700
ccg gag cta atc ttt gac atc acc aaa ctt ctg ctc gcc ata ctc ggc 2160
Pro Glu Leu Ile Phe Asp Ile Thr Lys Leu Leu Leu Ala Ile Leu Gly
705 710 715 720
ccg ctc atg gtg ttc cag gcc agc ata acc cga gtg ccg tac ttt gtg 2208
Pro Leu Met Val Phe Gln Ala Ser Ile Thr Arg Val Pro Tyr Phe Val
725 730 735
cgc gct caa ggg ctc att cgt gca tgc atg tta gtg cgg aaa gcc gct 2256
Arg Ala Gln Gly Leu Ile Arg Ala Cys Met Leu Val Arg Lys Ala Ala
740 745 750
ggg ggt cat tat atc caa atg gcc ctc gtg aaa ctg gcc gcg ctg 2301
Gly Gly His Tyr Ile Gln Met Ala Leu Val Lys Leu Ala Ala Leu
755 760 765
a 2302
<210> 40
<211> 767
<212> PRT
<213> Hepatitis C virus
<400> 40
Pro Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu
1 5 10 15
Val Arg Asn Phe Ser Gly Ile Tyr Arg Val Thr Asn Asp Cys Ser Asn
20 25 30
Ser Ser Ile Val Tyr Glu Ala Ala Asp Met Ile Met His Thr Pro Gly
35 40 45
Cys Val Pro Cys Val Arg Glu Gly Asn Ser Ser Arg Cys Trp Val Ala
50 55 60
Leu Thr Pro Thr Leu Ala Ala Arg Asn Ile Ser Val Pro Thr Thr Thr
65 70 75 80
Ile Arg Arg Asn Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser
85 90 95
Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gln
100 105 110
Leu Phe Thr Phe Ser Pro Arg Arg His Glu Thr Val Gln Asp Cys Asn
115 120 125
Cys Ser Ile Tyr Ser Gly His Val Ser Gly His Arg Met Ala Trp Asp
130 135 140
Met Met Met Asn Trp Ser Pro Thr Ala Thr Leu Val Val Ser Gln Leu
145 150 155 160
Leu Arg Ile Pro Gln Ala Val Val Asp Met Val Ala Gly Ala His Trp
165 170 175
Gly Val Leu Ala Gly Leu Ala Tyr Tyr Pro Met Ala Gly Asn Trp Ala
180 185 190
Lys Val Leu Ile Val Leu Leu Leu Phe Ala Gly Val Asp Gly Gln Thr
195 200 205
Arg Val Thr Gly Gly Ala Ala Ala His Thr Ala Arg Gly Leu Thr Ser
210 215 220
Ile Leu Pro Pro Gly Pro Ser Gln Asn Ile Gln Leu Val Asn Thr Asn
225 230 235 240
Gly Ser Trp His Ile Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu
245 250 255
Gln Thr Gly Phe Leu Ala Ala Leu Phe Phe Thr His Lys Phe Asn Ala
260 265 270
Ser Gly Cys Pro Glu Arg Met Ala Ser Cys Arg Thr Ile Asp Lys Phe
275 280 285
Asn Gln Gly Trp Gly Pro Ile Thr Tyr Asp Gly His Gly Gly Gln Asp
290 295 300
Gln Arg Pro Tyr Cys Trp His Tyr Ala Pro Lys Pro Cys Gly Ile Val
305 310 315 320
Pro Ala Ser Gln Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro
325 330 335
Val Val Val Gly Thr Thr Asp Arg Ser Gly Val Pro Thr Tyr Ser Trp
340 345 350
Gly Glu Asn Glu Thr Asp Val Leu Leu Leu Asn Asn Thr Arg Pro Pro
355 360 365
Leu Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Cys Thr Gly Phe Thr
370 375 380
Lys Thr Cys Gly Gly Pro Pro Cys Asn Ile Gly Gly Val Gly Asn Asn
385 390 395 400
Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr
405 410 415
Tyr Thr Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Ile Val
420 425 430
Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr
435 440 445
Ile Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Asn
450 455 460
Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Gly Leu Glu Asp Arg
465 470 475 480
Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gln
485 490 495
Ile Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu
500 505 510
Ile His Leu His Gln Asn Ile Val Asp Val Gln Tyr Leu Tyr Gly Ile
515 520 525
Gly Ser Val Val Val Ser Ser Ala Ile Lys Trp Glu Tyr Val Val Leu
530 535 540
Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp Met
545 550 555 560
Met Leu Leu Val Ala Gln Ala Glu Ala Ala Leu Glu Asn Leu Val Val
565 570 575
Leu Asn Ala Ala Ser Val Ala Gly Thr His Gly Ile Ile Pro Phe Leu
580 585 590
Val Phe Phe Cys Ala Ala Trp Tyr Ile Lys Gly Arg Leu Val Pro Ala
595 600 605
Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu Leu
610 615 620
Ala Leu Pro Pro Arg Ala Tyr Ala Met Asp Arg Glu Met Ala Ala Ser
625 630 635 640
Cys Gly Gly Gly Val Phe Val Gly Leu Ala Phe Leu Thr Leu Ser Pro
645 650 655
Tyr Tyr Lys Val Phe Leu Ala Lys Leu Ile Trp Trp Leu Gln Tyr Phe
660 665 670
Ile Thr Arg Ala Glu Ala His Leu Gln Val Trp Ile Pro Pro Leu Asn
675 680 685
Val Arg Gly Gly Arg Asp Ala Ile Ile Leu Leu Ala Cys Ala Val His
690 695 700
Pro Glu Leu Ile Phe Asp Ile Thr Lys Leu Leu Leu Ala Ile Leu Gly
705 710 715 720
Pro Leu Met Val Phe Gln Ala Ser Ile Thr Arg Val Pro Tyr Phe Val
725 730 735
Arg Ala Gln Gly Leu Ile Arg Ala Cys Met Leu Val Arg Lys Ala Ala
740 745 750
Gly Gly His Tyr Ile Gln Met Ala Leu Val Lys Leu Ala Ala Leu
755 760 765
<210> 41
<211> 2240
<212> DNA
<213> Hepatitis C virus
<220>
<221> CDS
<222> (1)..(2238)
<400> 41
ctc ttg gct ctg ctg tcc tgt ctg act atc cca gct tcc gcc tat gag 48
Leu Leu Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala Tyr Glu
1 5 10 15
gtg cgc aac gtg tcc ggg ttg tac cat gtc acg aac gac tgc tcc aac 96
Val Arg Asn Val Ser Gly Leu Tyr His Val Thr Asn Asp Cys Ser Asn
20 25 30
tca agt att gtg tat gag gca gcg gac atg atc atg cat acc ccc ggg 144
Ser Ser Ile Val Tyr Glu Ala Ala Asp Met Ile Met His Thr Pro Gly
35 40 45
tgc gtg ccc tgc gtc cgg gag aac aac cgc cct cgc tgc tgg gta gcg 192
Cys Val Pro Cys Val Arg Glu Asn Asn Arg Pro Arg Cys Trp Val Ala
50 55 60
ctc act ccc acg ctc gcg gcc aga aac agc agc atc ccc act gcg aca 240
Leu Thr Pro Thr Leu Ala Ala Arg Asn Ser Ser Ile Pro Thr Ala Thr
65 70 75 80
ata cga cgc cat gtc gat ttg ctc gtt ggg gca gcc gct ctc tgc tcc 288
Ile Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Leu Cys Ser
85 90 95
gcc atg tat gtg ggg gat ctt tgc gga tct gtc ttc ctc gtc tcc cag 336
Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gln
100 105 110
ctg ttc acc ttc tcg cct cgc cgg tat gag acg gta caa gac tgc aat 384
Leu Phe Thr Phe Ser Pro Arg Arg Tyr Glu Thr Val Gln Asp Cys Asn
115 120 125
tgc tca ctc tat cct ggc cac gta aca ggt cac cgc atg gcc tgg gat 432
Cys Ser Leu Tyr Pro Gly His Val Thr Gly His Arg Met Ala Trp Asp
130 135 140
atg atg atg aac tgg tcg cct aca aca gcc cta gtg gta tcg cag ata 480
Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gln Ile
145 150 155 160
ctg cgg atc cca caa gcc gtc atg gac atg gtg acg ggg gcc cac tgg 528
Leu Arg Ile Pro Gln Ala Val Met Asp Met Val Thr Gly Ala His Trp
165 170 175
gga gtc ctg gcg ggc ctc gcc tac tat tcc atg gtg gga aac tgg gct 576
Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala
180 185 190
aag gtc ctg att gtg ttg tta ctc ttt gcc ggc gtt gat ggg acc acc 624
Lys Val Leu Ile Val Leu Leu Leu Phe Ala Gly Val Asp Gly Thr Thr
195 200 205
cac ata acg ggg ggg act gca ggc caa act gcc ttt agc ctc aca agt 672
His Ile Thr Gly Gly Thr Ala Gly Gln Thr Ala Phe Ser Leu Thr Ser
210 215 220
ctc ctc gca tct ggg ccg act cag aag atc caa att ata aac act aac 720
Leu Leu Ala Ser Gly Pro Thr Gln Lys Ile Gln Ile Ile Asn Thr Asn
225 230 235 240
ggc agc tgg cac atc aac aga act gcc ttg agt tgt aac gac tcc ctt 768
Gly Ser Trp His Ile Asn Arg Thr Ala Leu Ser Cys Asn Asp Ser Leu
245 250 255
cag act ggg ttc att gcc gcg ctg ttc tac aag cac agg ttc aac tcg 816
Gln Thr Gly Phe Ile Ala Ala Leu Phe Tyr Lys His Arg Phe Asn Ser
260 265 270
tcc gga tgc cca gag cgc atg gcc agc tgc cgc ccc atc gac agg ttt 864
Ser Gly Cys Pro Glu Arg Met Ala Ser Cys Arg Pro Ile Asp Arg Phe
275 280 285
aat cag ggg tgg ggt ccc att act tat gat gat aag ctt ccc gtc tca 912
Asn Gln Gly Trp Gly Pro Ile Thr Tyr Asp Asp Lys Leu Pro Val Ser
290 295 300
gac cag agg cct tac tgc agg cac tac gcg cct cgg ccg tgc ggt atc 960
Asp Gln Arg Pro Tyr Cys Arg His Tyr Ala Pro Arg Pro Cys Gly Ile
305 310 315 320
gtg ccc gcg tcg gag gtg tgt ggt ccg gtg tat tgc ttc acc cca agc 1008
Val Pro Ala Ser Glu Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser
325 330 335
cct gtt gtg gtg ggg acg acc gat cgc ttc ggc gct ccc acg tat aac 1056
Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Ala Pro Thr Tyr Asn
340 345 350
tgg ggg gag aat gag acg gac gtg cta atc ctc aac aac acg cgg ccg 1104
Trp Gly Glu Asn Glu Thr Asp Val Leu Ile Leu Asn Asn Thr Arg Pro
355 360 365
ccg caa ggc aac tgg ttt ggc tgt aca tgg atg aat aac acc ggg ttc 1152
Pro Gln Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Asn Thr Gly Phe
370 375 380
acc aag acg tgc gga ggc cct ccg tgt aac atc ggg ggg gtc ggc aat 1200
Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn Ile Gly Gly Val Gly Asn
385 390 395 400
gag acc ttg acc tgc cct acg gat tgc ttc cgg aag cac ccc gaa gcc 1248
Glu Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala
405 410 415
acg tac acc aaa tgc ggc tcg ggg cct tgg ttg aca cct agg tgt atg 1296
Thr Tyr Thr Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met
420 425 430
gtt gat tac cca tac aga ctt tgg cac tac ccc tgc act gta aac ttt 1344
Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe
435 440 445
act atc ttc aag atc agg atg tac gtg ggg ggt gta gag cac agg ttc 1392
Thr Ile Phe Lys Ile Arg Met Tyr Val Gly Gly Val Glu His Arg Phe
450 455 460
aca gcc gcg tgc aat tgg gcc cga gga gag cgc tgt gac gta gag gac 1440
Thr Ala Ala Cys Asn Trp Ala Arg Gly Glu Arg Cys Asp Val Glu Asp
465 470 475 480
agg gat aga gca gag ctc agc ccg cta cta ctg tct aca act gag tgg 1488
Arg Asp Arg Ala Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp
485 490 495
cag ata ctg ccc tgt tcc ttt acc acc cta cca gct ctg tcc acc gga 1536
Gln Ile Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly
500 505 510
tcg atc cac ctc cat cag aac acc gtg gac gtg caa tac ctg tac ggt 1584
Ser Ile His Leu His Gln Asn Thr Val Asp Val Gln Tyr Leu Tyr Gly
515 520 525
gta ggg tca gcg gtt gtt tcc atc gcg atc aaa tgg gag tat gtc ctg 1632
Val Gly Ser Ala Val Val Ser Ile Ala Ile Lys Trp Glu Tyr Val Leu
530 535 540
ctg ctt ttc ctt ctc ttg gcg gac gca cgc gtc tgc gcc tgt ttg tgg 1680
Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp
545 550 555 560
atg atg ctg ctg ata gcc cag gct gag gcc gct ttg gag aac ctg gtg 1728
Met Met Leu Leu Ile Ala Gln Ala Glu Ala Ala Leu Glu Asn Leu Val
565 570 575
atc ctc aat gcg gcg tcc gta gct gga gcg cac ggc att ctt tcc ttc 1776
Ile Leu Asn Ala Ala Ser Val Ala Gly Ala His Gly Ile Leu Ser Phe
580 585 590
ctc atg ttc ttc tgt gct gcc tgg tat atc aag ggt aag ctg gtc ccc 1824
Leu Met Phe Phe Cys Ala Ala Trp Tyr Ile Lys Gly Lys Leu Val Pro
595 600 605
ggg gcg gca tat gct ttc tat agc gta tgg ccg ctg ctc ctg ctc ctg 1872
Gly Ala Ala Tyr Ala Phe Tyr Ser Val Trp Pro Leu Leu Leu Leu Leu
610 615 620
ctg gcg cta cca cca cga gcg tac gct atg gac cgg gag atg gct gca 1920
Leu Ala Leu Pro Pro Arg Ala Tyr Ala Met Asp Arg Glu Met Ala Ala
625 630 635 640
tca tgt ggg ggt ggg gtc ttc ata ggt ttg gta gtc ttg act ttg tca 1968
Ser Cys Gly Gly Gly Val Phe Ile Gly Leu Val Val Leu Thr Leu Ser
645 650 655
ccg cac tat aaa gca ttc ctc gct agg ctt ata tgg tgg tta caa tat 2016
Pro His Tyr Lys Ala Phe Leu Ala Arg Leu Ile Trp Trp Leu Gln Tyr
660 665 670
ttt atc acc agg acc gag gcg cac ttg caa gtg tgg atc ccc ccc ctc 2064
Phe Ile Thr Arg Thr Glu Ala His Leu Gln Val Trp Ile Pro Pro Leu
675 680 685
aac gtt cgg ggg ggc cgt gat gcc atc atc ctc ctc atg tgc gtg gtc 2112
Asn Val Arg Gly Gly Arg Asp Ala Ile Ile Leu Leu Met Cys Val Val
690 695 700
cat cca gag cta att ttt gaa atc acc aag atc ttg ctc gcc ata ctg 2160
His Pro Glu Leu Ile Phe Glu Ile Thr Lys Ile Leu Leu Ala Ile Leu
705 710 715 720
ggt ccg ccc atg gtg ctc cag gcc ggc ctg att agg gtg ccg tac ttc 2208
Gly Pro Pro Met Val Leu Gln Ala Gly Leu Ile Arg Val Pro Tyr Phe
725 730 735
gtg cgc gct caa ggg ctc att cgt gcg tgc at 2240
Val Arg Ala Gln Gly Leu Ile Arg Ala Cys
740 745
<210> 42
<211> 746
<212> PRT
<213> Hepatitis C virus
<400> 42
Leu Leu Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala Tyr Glu
1 5 10 15
Val Arg Asn Val Ser Gly Leu Tyr His Val Thr Asn Asp Cys Ser Asn
20 25 30
Ser Ser Ile Val Tyr Glu Ala Ala Asp Met Ile Met His Thr Pro Gly
35 40 45
Cys Val Pro Cys Val Arg Glu Asn Asn Arg Pro Arg Cys Trp Val Ala
50 55 60
Leu Thr Pro Thr Leu Ala Ala Arg Asn Ser Ser Ile Pro Thr Ala Thr
65 70 75 80
Ile Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Leu Cys Ser
85 90 95
Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gln
100 105 110
Leu Phe Thr Phe Ser Pro Arg Arg Tyr Glu Thr Val Gln Asp Cys Asn
115 120 125
Cys Ser Leu Tyr Pro Gly His Val Thr Gly His Arg Met Ala Trp Asp
130 135 140
Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gln Ile
145 150 155 160
Leu Arg Ile Pro Gln Ala Val Met Asp Met Val Thr Gly Ala His Trp
165 170 175
Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala
180 185 190
Lys Val Leu Ile Val Leu Leu Leu Phe Ala Gly Val Asp Gly Thr Thr
195 200 205
His Ile Thr Gly Gly Thr Ala Gly Gln Thr Ala Phe Ser Leu Thr Ser
210 215 220
Leu Leu Ala Ser Gly Pro Thr Gln Lys Ile Gln Ile Ile Asn Thr Asn
225 230 235 240
Gly Ser Trp His Ile Asn Arg Thr Ala Leu Ser Cys Asn Asp Ser Leu
245 250 255
Gln Thr Gly Phe Ile Ala Ala Leu Phe Tyr Lys His Arg Phe Asn Ser
260 265 270
Ser Gly Cys Pro Glu Arg Met Ala Ser Cys Arg Pro Ile Asp Arg Phe
275 280 285
Asn Gln Gly Trp Gly Pro Ile Thr Tyr Asp Asp Lys Leu Pro Val Ser
290 295 300
Asp Gln Arg Pro Tyr Cys Arg His Tyr Ala Pro Arg Pro Cys Gly Ile
305 310 315 320
Val Pro Ala Ser Glu Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser
325 330 335
Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Ala Pro Thr Tyr Asn
340 345 350
Trp Gly Glu Asn Glu Thr Asp Val Leu Ile Leu Asn Asn Thr Arg Pro
355 360 365
Pro Gln Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Asn Thr Gly Phe
370 375 380
Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn Ile Gly Gly Val Gly Asn
385 390 395 400
Glu Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala
405 410 415
Thr Tyr Thr Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met
420 425 430
Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe
435 440 445
Thr Ile Phe Lys Ile Arg Met Tyr Val Gly Gly Val Glu His Arg Phe
450 455 460
Thr Ala Ala Cys Asn Trp Ala Arg Gly Glu Arg Cys Asp Val Glu Asp
465 470 475 480
Arg Asp Arg Ala Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp
485 490 495
Gln Ile Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly
500 505 510
Ser Ile His Leu His Gln Asn Thr Val Asp Val Gln Tyr Leu Tyr Gly
515 520 525
Val Gly Ser Ala Val Val Ser Ile Ala Ile Lys Trp Glu Tyr Val Leu
530 535 540
Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp
545 550 555 560
Met Met Leu Leu Ile Ala Gln Ala Glu Ala Ala Leu Glu Asn Leu Val
565 570 575
Ile Leu Asn Ala Ala Ser Val Ala Gly Ala His Gly Ile Leu Ser Phe
580 585 590
Leu Met Phe Phe Cys Ala Ala Trp Tyr Ile Lys Gly Lys Leu Val Pro
595 600 605
Gly Ala Ala Tyr Ala Phe Tyr Ser Val Trp Pro Leu Leu Leu Leu Leu
610 615 620
Leu Ala Leu Pro Pro Arg Ala Tyr Ala Met Asp Arg Glu Met Ala Ala
625 630 635 640
Ser Cys Gly Gly Gly Val Phe Ile Gly Leu Val Val Leu Thr Leu Ser
645 650 655
Pro His Tyr Lys Ala Phe Leu Ala Arg Leu Ile Trp Trp Leu Gln Tyr
660 665 670
Phe Ile Thr Arg Thr Glu Ala His Leu Gln Val Trp Ile Pro Pro Leu
675 680 685
Asn Val Arg Gly Gly Arg Asp Ala Ile Ile Leu Leu Met Cys Val Val
690 695 700
His Pro Glu Leu Ile Phe Glu Ile Thr Lys Ile Leu Leu Ala Ile Leu
705 710 715 720
Gly Pro Pro Met Val Leu Gln Ala Gly Leu Ile Arg Val Pro Tyr Phe
725 730 735
Val Arg Ala Gln Gly Leu Ile Arg Ala Cys
740 745
<210> 43
<211> 2237
<212> DNA
<213> Hepatitis C virus
<220>
<221> CDS
<222> (1)..(2226)
<400> 43
ctc ttg gct ttg ttg tcc tgt ttg acc gtc cca act tcc gct tat gaa 48
Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser Ala Tyr Glu
1 5 10 15
gtg cgc aac gtg tcc ggg atg tac caa gtc acg aac gac tgc tcc aac 96
Val Arg Asn Val Ser Gly Met Tyr Gln Val Thr Asn Asp Cys Ser Asn
20 25 30
tca agc att gtg tat gag gca gcg gac gtg atc atg cac acc ccc ggg 144
Ser Ser Ile Val Tyr Glu Ala Ala Asp Val Ile Met His Thr Pro Gly
35 40 45
tgc gtg ccc tgt gtc cgg gag agc aac ctc tcc cgc tgc tgg gtt gcg 192
Cys Val Pro Cys Val Arg Glu Ser Asn Leu Ser Arg Cys Trp Val Ala
50 55 60
ctc acg ccc acg ctc gcg gcc agg aac agc agt atc ccc act acg aca 240
Leu Thr Pro Thr Leu Ala Ala Arg Asn Ser Ser Ile Pro Thr Thr Thr
65 70 75 80
ata cga cgt cat gtc gat ttg ctc gtt ggg gca tct gcc ttc tgc tcc 288
Ile Arg Arg His Val Asp Leu Leu Val Gly Ala Ser Ala Phe Cys Ser
85 90 95
gct atg tac gtg ggg gat ctt tgc gga tct gtc ttc ctc atc tcc cag 336
Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Ile Ser Gln
100 105 110
ctg ttc acc ttc tca cct cgc cgg tac gag acg gtg caa gac tgc aat 384
Leu Phe Thr Phe Ser Pro Arg Arg Tyr Glu Thr Val Gln Asp Cys Asn
115 120 125
tgc tca ctc tat ccc ggc cac gta tca ggt cat cgc atg gct tgg gat 432
Cys Ser Leu Tyr Pro Gly His Val Ser Gly His Arg Met Ala Trp Asp
130 135 140
atg atg atg aac tgg tcg cct aca aca gcc tta gtg gta tcg cag tta 480
Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gln Leu
145 150 155 160
ctc cgg atc cca caa gcc atc gtg gac atg gtg aca ggg gcc cac tgg 528
Leu Arg Ile Pro Gln Ala Ile Val Asp Met Val Thr Gly Ala His Trp
165 170 175
ggg gtc ttg gcg ggt ctc gcc tat tac tcc atg gtg ggg aac tgg gct 576
Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala
180 185 190
aag gtc ttg att gtg atg cta ctc ttt tcc ggc gtt gac ggc gcc act 624
Lys Val Leu Ile Val Met Leu Leu Phe Ser Gly Val Asp Gly Ala Thr
195 200 205
cgc ctg tca ggg ggg gcg gca ggt cgt gat acc cgc ggc ttc gcg gcc 672
Arg Leu Ser Gly Gly Ala Ala Gly Arg Asp Thr Arg Gly Phe Ala Ala
210 215 220
ctc ttc cag cca ggg tca gct cag aac atc cag ctt ata aac tcc aac 720
Leu Phe Gln Pro Gly Ser Ala Gln Asn Ile Gln Leu Ile Asn Ser Asn
225 230 235 240
ggc agc tgg cac gtc aac agg aca gcc ctg aat tgc aat gac acc ctc 768
Gly Ser Trp His Val Asn Arg Thr Ala Leu Asn Cys Asn Asp Thr Leu
245 250 255
cac act ggg ttc att gcc ggg ctg ctc tac aca acc aaa ttc aac tcg 816
His Thr Gly Phe Ile Ala Gly Leu Leu Tyr Thr Thr Lys Phe Asn Ser
260 265 270
tcc ggg tgc cca ggg cgc ctg gcc agc tgc cgc ccc att gac aag ttc 864
Ser Gly Cys Pro Gly Arg Leu Ala Ser Cys Arg Pro Ile Asp Lys Phe
275 280 285
gcc cag ggg tgg ggt ccc atc act tat gct gag cca gga gcc tcg gac 912
Ala Gln Gly Trp Gly Pro Ile Thr Tyr Ala Glu Pro Gly Ala Ser Asp
290 295 300
cag agg ccc tat tgc tgg cac tac gcg cct cgg ccg tgc ggt att gta 960
Gln Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly Ile Val
305 310 315 320
ccc gcg tcg cag gtg tgt ggt cca gta tat tgc ttc acc cca agc ccc 1008
Pro Ala Ser Gln Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro
325 330 335
gtc gtg gtg ggt acg acc gat cgc tcc ggt gcc ccc acg tat acc tgg 1056
Val Val Val Gly Thr Thr Asp Arg Ser Gly Ala Pro Thr Tyr Thr Trp
340 345 350
ggg gag aat gag acg gac gtg cta ctt ctc aac aac aca cgg ccg ccg 1104
Gly Glu Asn Glu Thr Asp Val Leu Leu Leu Asn Asn Thr Arg Pro Pro
355 360 365
caa ggc aac tgg ttc ggc tgt aca tgg atg aat agc acc ggg ttc acc 1152
Gln Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr
370 375 380
aag acg tgt ggg gcc cct ccg tgc aac atc ggg ggg agc ggc aac aac 1200
Lys Thr Cys Gly Ala Pro Pro Cys Asn Ile Gly Gly Ser Gly Asn Asn
385 390 395 400
acc ttg atc tgc cct acg gat tgc ttc cgg aag cac ccc gag gcc act 1248
Thr Leu Ile Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr
405 410 415
tac atc aaa tgc ggc tcg ggg ccg tgg ttg aca cct agg tgt cta gtt 1296
Tyr Ile Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Leu Val
420 425 430
gat tac cca tac agg ctt tgg cac tac ccc tgc acc gtc aac ttt acc 1344
Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr
435 440 445
atc ttc aag atc agg atg tat gtg ggg ggc gtg gag cac aga ctc act 1392
Ile Phe Lys Ile Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Thr
450 455 460
gcc gca tgc aat tgg act cga gga gag cgt tgc gat ttg gag gac agg 1440
Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg
465 470 475 480
gat aga tcg gaa ctt agc cca ctg tta ctc tct aca acg gag tgg cag 1488
Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gln
485 490 495
ata ctg ccc tgt tcc ttc acc acc cta ccg gct ttg tcc act ggt ttg 1536
Ile Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu
500 505 510
att cat ctc cat cag aac att gtg gac gta caa tac ctg tac ggt gta 1584
Ile His Leu His Gln Asn Ile Val Asp Val Gln Tyr Leu Tyr Gly Val
515 520 525
ggg tca gcg gtt gtc tcc att gcg atc aaa tgg gag tac gtc gtg ctg 1632
Gly Ser Ala Val Val Ser Ile Ala Ile Lys Trp Glu Tyr Val Val Leu
530 535 540
ctc ttt ctc ctc ctg gcg gac gca cgc ttc tgc gcc tgc ttg tgg atg 1680
Leu Phe Leu Leu Leu Ala Asp Ala Arg Phe Cys Ala Cys Leu Trp Met
545 550 555 560
atg ctg ctg ata gcc cag gct gag gcc gcc tta gag aac ctg gtg atc 1728
Met Leu Leu Ile Ala Gln Ala Glu Ala Ala Leu Glu Asn Leu Val Ile
565 570 575
ctc aat gca gcg tcc gtg gcc gga gcc cgt ggc att ctc tcc ttc ctt 1776
Leu Asn Ala Ala Ser Val Ala Gly Ala Arg Gly Ile Leu Ser Phe Leu
580 585 590
gtg ttc ttc tgt gct gcc tgg tac atc aag ggc aaa ctg gtc cct ggg 1824
Val Phe Phe Cys Ala Ala Trp Tyr Ile Lys Gly Lys Leu Val Pro Gly
595 600 605
gcg gca tat gcc ctc tac ggt gta tgg ccg ctg ctt ctg ctc ctg ctg 1872
Ala Ala Tyr Ala Leu Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu Leu
610 615 620
gcg tta cca cca cga gca tac gcc ttt gac cgg gaa atg gct gca tcg 1920
Ala Leu Pro Pro Arg Ala Tyr Ala Phe Asp Arg Glu Met Ala Ala Ser
625 630 635 640
tgc gga ggc gcg gtt ttc ata ggt ctg atg ctt ctg acc ttg tca cca 1968
Cys Gly Gly Ala Val Phe Ile Gly Leu Met Leu Leu Thr Leu Ser Pro
645 650 655
cac tat aag gca ctc ctc gcc agg ctt ata tgg tgg tta caa tat ttt 2016
His Tyr Lys Ala Leu Leu Ala Arg Leu Ile Trp Trp Leu Gln Tyr Phe
660 665 670
atc acc agg gcc gag gcg cac ttg caa gtg tgg atc ccc ccc ctt aac 2064
Ile Thr Arg Ala Glu Ala His Leu Gln Val Trp Ile Pro Pro Leu Asn
675 680 685
gtt cgg ggg ggc cgc gat gcc att atc ctc ctc aca tgt gcg gtc cat 2112
Val Arg Gly Gly Arg Asp Ala Ile Ile Leu Leu Thr Cys Ala Val His
690 695 700
tca gag cta att ttt gaa atc acc aaa atc ttg ctc gcc atg ctt ggt 2160
Ser Glu Leu Ile Phe Glu Ile Thr Lys Ile Leu Leu Ala Met Leu Gly
705 710 715 720
ccg ctc atg atg ctc cag gct ggc cta att aga gtg ccg tac ttt gtg 2208
Pro Leu Met Met Leu Gln Ala Gly Leu Ile Arg Val Pro Tyr Phe Val
725 730 735
cgc gct caa ggg ctt att cgtg cgtgctt 2237
Arg Ala Gln Gly Leu Ile
740
<210> 44
<211> 742
<212> PRT
<213> Hepatitis C virus
<400> 44
Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser Ala Tyr Glu
1 5 10 15
Val Arg Asn Val Ser Gly Met Tyr Gln Val Thr Asn Asp Cys Ser Asn
20 25 30
Ser Ser Ile Val Tyr Glu Ala Ala Asp Val Ile Met His Thr Pro Gly
35 40 45
Cys Val Pro Cys Val Arg Glu Ser Asn Leu Ser Arg Cys Trp Val Ala
50 55 60
Leu Thr Pro Thr Leu Ala Ala Arg Asn Ser Ser Ile Pro Thr Thr Thr
65 70 75 80
Ile Arg Arg His Val Asp Leu Leu Val Gly Ala Ser Ala Phe Cys Ser
85 90 95
Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Ile Ser Gln
100 105 110
Leu Phe Thr Phe Ser Pro Arg Arg Tyr Glu Thr Val Gln Asp Cys Asn
115 120 125
Cys Ser Leu Tyr Pro Gly His Val Ser Gly His Arg Met Ala Trp Asp
130 135 140
Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gln Leu
145 150 155 160
Leu Arg Ile Pro Gln Ala Ile Val Asp Met Val Thr Gly Ala His Trp
165 170 175
Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala
180 185 190
Lys Val Leu Ile Val Met Leu Leu Phe Ser Gly Val Asp Gly Ala Thr
195 200 205
Arg Leu Ser Gly Gly Ala Ala Gly Arg Asp Thr Arg Gly Phe Ala Ala
210 215 220
Leu Phe Gln Pro Gly Ser Ala Gln Asn Ile Gln Leu Ile Asn Ser Asn
225 230 235 240
Gly Ser Trp His Val Asn Arg Thr Ala Leu Asn Cys Asn Asp Thr Leu
245 250 255
His Thr Gly Phe Ile Ala Gly Leu Leu Tyr Thr Thr Lys Phe Asn Ser
260 265 270
Ser Gly Cys Pro Gly Arg Leu Ala Ser Cys Arg Pro Ile Asp Lys Phe
275 280 285
Ala Gln Gly Trp Gly Pro Ile Thr Tyr Ala Glu Pro Gly Ala Ser Asp
290 295 300
Gln Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly Ile Val
305 310 315 320
Pro Ala Ser Gln Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro
325 330 335
Val Val Val Gly Thr Thr Asp Arg Ser Gly Ala Pro Thr Tyr Thr Trp
340 345 350
Gly Glu Asn Glu Thr Asp Val Leu Leu Leu Asn Asn Thr Arg Pro Pro
355 360 365
Gln Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr
370 375 380
Lys Thr Cys Gly Ala Pro Pro Cys Asn Ile Gly Gly Ser Gly Asn Asn
385 390 395 400
Thr Leu Ile Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr
405 410 415
Tyr Ile Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Leu Val
420 425 430
Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr
435 440 445
Ile Phe Lys Ile Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Thr
450 455 460
Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg
465 470 475 480
Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gln
485 490 495
Ile Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu
500 505 510
Ile His Leu His Gln Asn Ile Val Asp Val Gln Tyr Leu Tyr Gly Val
515 520 525
Gly Ser Ala Val Val Ser Ile Ala Ile Lys Trp Glu Tyr Val Val Leu
530 535 540
Leu Phe Leu Leu Leu Ala Asp Ala Arg Phe Cys Ala Cys Leu Trp Met
545 550 555 560
Met Leu Leu Ile Ala Gln Ala Glu Ala Ala Leu Glu Asn Leu Val Ile
565 570 575
Leu Asn Ala Ala Ser Val Ala Gly Ala Arg Gly Ile Leu Ser Phe Leu
580 585 590
Val Phe Phe Cys Ala Ala Trp Tyr Ile Lys Gly Lys Leu Val Pro Gly
595 600 605
Ala Ala Tyr Ala Leu Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu Leu
610 615 620
Ala Leu Pro Pro Arg Ala Tyr Ala Phe Asp Arg Glu Met Ala Ala Ser
625 630 635 640
Cys Gly Gly Ala Val Phe Ile Gly Leu Met Leu Leu Thr Leu Ser Pro
645 650 655
His Tyr Lys Ala Leu Leu Ala Arg Leu Ile Trp Trp Leu Gln Tyr Phe
660 665 670
Ile Thr Arg Ala Glu Ala His Leu Gln Val Trp Ile Pro Pro Leu Asn
675 680 685
Val Arg Gly Gly Arg Asp Ala Ile Ile Leu Leu Thr Cys Ala Val His
690 695 700
Ser Glu Leu Ile Phe Glu Ile Thr Lys Ile Leu Leu Ala Met Leu Gly
705 710 715 720
Pro Leu Met Met Leu Gln Ala Gly Leu Ile Arg Val Pro Tyr Phe Val
725 730 735
Arg Ala Gln Gly Leu Ile
740
<210> 45
<211> 490
<212> DNA
<213> Hepatitis C virus
<220>
<221> CDS
<222> (1)..(489)
<400> 45
ctc ttg gct ctg ctg tcc tgt ctg acc atc cca gct tcc gct tat gaa 48
Leu Leu Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala Tyr Glu
1 5 10 15
gtg cgc aac gtg tcc gga ata tac cat gtc acg aac gac tgc tcc aac 96
Val Arg Asn Val Ser Gly Ile Tyr His Val Thr Asn Asp Cys Ser Asn
20 25 30
tca agc att gtg tat gag gca gcg gac gtg atc atg cat acc ccc ggg 144
Ser Ser Ile Val Tyr Glu Ala Ala Asp Val Ile Met His Thr Pro Gly
35 40 45
tgc gtg ccc tgt gtt cgg gag ggt aac gcc tcc cgt tgt tgg gca gcg 192
Cys Val Pro Cys Val Arg Glu Gly Asn Ala Ser Arg Cys Trp Ala Ala
50 55 60
ctc act ccc acg ctc gcg gtc ggg aat gcc agc gtc ccc act aag gca 240
Leu Thr Pro Thr Leu Ala Val Gly Asn Ala Ser Val Pro Thr Lys Ala
65 70 75 80
ata cgg cgc cac gtc gat ctg ctt gtt ggg acg gct gct ttc tgc tcc 288
Ile Arg Arg His Val Asp Leu Leu Val Gly Thr Ala Ala Phe Cys Ser
85 90 95
gcc atg tac gtg ggg gat ctc tgc gga tac atc gcc aaa ctc ctg ctc 336
Ala Met Tyr Val Gly Asp Leu Cys Gly Tyr Ile Ala Lys Leu Leu Leu
100 105 110
gcc aca ctc ggt ctg ctc atg gtg ctc cag gct gcc ata gct aga gtg 384
Ala Thr Leu Gly Leu Leu Met Val Leu Gln Ala Ala Ile Ala Arg Val
115 120 125
ccg tac ttc gta cgc act cag ggg ctc att cgt gtg tgt atg tta gtg 432
Pro Tyr Phe Val Arg Thr Gln Gly Leu Ile Arg Val Cys Met Leu Val
130 135 140
cgg aaa gtc gcc ggg ggt cac tat gcc cag atg gcc ttc atc aag ctg 480
Arg Lys Val Ala Gly Gly His Tyr Ala Gln Met Ala Phe Ile Lys Leu
145 150 155 160
gcc gca ctg a 490
Ala Ala Leu
<210> 46
<211> 163
<212> PRT
<213> Hepatitis C virus
<400> 46
Leu Leu Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala Tyr Glu
1 5 10 15
Val Arg Asn Val Ser Gly Ile Tyr His Val Thr Asn Asp Cys Ser Asn
20 25 30
Ser Ser Ile Val Tyr Glu Ala Ala Asp Val Ile Met His Thr Pro Gly
35 40 45
Cys Val Pro Cys Val Arg Glu Gly Asn Ala Ser Arg Cys Trp Ala Ala
50 55 60
Leu Thr Pro Thr Leu Ala Val Gly Asn Ala Ser Val Pro Thr Lys Ala
65 70 75 80
Ile Arg Arg His Val Asp Leu Leu Val Gly Thr Ala Ala Phe Cys Ser
85 90 95
Ala Met Tyr Val Gly Asp Leu Cys Gly Tyr Ile Ala Lys Leu Leu Leu
100 105 110
Ala Thr Leu Gly Leu Leu Met Val Leu Gln Ala Ala Ile Ala Arg Val
115 120 125
Pro Tyr Phe Val Arg Thr Gln Gly Leu Ile Arg Val Cys Met Leu Val
130 135 140
Arg Lys Val Ala Gly Gly His Tyr Ala Gln Met Ala Phe Ile Lys Leu
145 150 155 160
Ala Ala Leu
<210> 47
<211> 1018
<212> DNA
<213> Hepatitis C virus
<220>
<221> CDS
<222> (229)..(1017)
<400> 47
ccaggacccc ccctcccggg agagccatag tggtctgcgg aaccggtgag tacaccggaa 60
ttgccaggac gaccgggtcc tttcttggat taacccgctc aatgcctgga gatttgggcg 120
tgcccccgcg agactgctag ccgagtagtg ttgggtcgcg aaaggccttg tggtactgcc 180
tgatagggtg cttgcgagtg ccccgggagg tctcgtagac cgtgcacc atg agc acg 237
Met Ser Thr
1
aat cct aaa ccc caa aga aaa acc aac cga aac acc aac cgc cgt cca 285
Asn Pro Lys Pro Gln Arg Lys Thr Asn Arg Asn Thr Asn Arg Arg Pro
5 10 15
cag gac gtt aag ttc ccg ggc ggt ggt cag atc gtc ggt gga gtt tac 333
Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly Gly Val Tyr
20 25 30 35
ctg ttg ccg cgc agg ggc ccc agg ttg ggt gtg cgc gcg act agg cag 381
Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Gln
40 45 50
act tcc gag cgg tcg cag cct cgt gga agg cga caa cct atc ccc aag 429
Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro Ile Pro Lys
55 60 65
gtt cgc cgg ccc gag ggc aga acc tgg gct cag ccc ggg tat cct tgg 477
Val Arg Arg Pro Glu Gly Arg Thr Trp Ala Gln Pro Gly Tyr Pro Trp
70 75 80
ccc ctc tat ggc aat gag ggc ttg ggg tgg gca gga tgg ctc ctg tca 525
Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp Leu Leu Ser
85 90 95
ccc cgt ggc tcc cgg cct agt tgg ggc ccc acg gac ccc cgg cgt agg 573
Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro Arg Arg Arg
100 105 110 115
tcg cgt aat ttg ggt aag gtc atc gat acc ctc aca tgc ggc ttc gcc 621
Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys Gly Phe Ala
120 125 130
gac ctc atg ggg tat att ccg ctt gtc ggc gcc cct tta gga ggc gct 669
Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala Pro Leu Gly Gly Ala
135 140 145
gcc agg gcc ctg gca cat ggt gtc cgg gtt ctg gag gac ggc gtg aat 717
Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn
150 155 160
tct gca aca ggg aat ttg cct ggt tgc tct ttc tct atc ttc ctc ttg 765
Ser Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile Phe Leu Leu
165 170 175
gct ctg ctg tcc tgt ttg acc atc cca gct tcc gct tat gaa gtg cgc 813
Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala Tyr Glu Val Arg
180 185 190 195
acg tgc gcg gtc cat cca gag cca atc ttt gac atc acc aac ctc ctg 861
Thr Cys Ala Val His Pro Glu Pro Ile Phe Asp Ile Thr Asn Leu Leu
200 205 210
ctc gcc ata ctc ggc ccg ctc atg gtg ctc cag gct ggc ata act aga 909
Leu Ala Ile Leu Gly Pro Leu Met Val Leu Gln Ala Gly Ile Thr Arg
215 220 225
gtg ccg tac ttc gta cgc gct caa ggg ctc att cgt gca tgc atg tta 957
Val Pro Tyr Phe Val Arg Ala Gln Gly Leu Ile Arg Ala Cys Met Leu
230 235 240
gtg cgg aaa acg cct ggg ggt cat tat gtc caa atg gcc ctc atg agg 1005
Val Arg Lys Thr Pro Gly Gly His Tyr Val Gln Met Ala Leu Met Arg
245 250 255
ctg gcc gca ctg a 1018
Leu Ala Ala Leu
260
<210> 48
<211> 263
<212> PRT
<213> Hepatitis C virus
<400> 48
Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Asn Arg Asn Thr Asn
1 5 10 15
Arg Arg Pro Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly
20 25 30
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala
35 40 45
Thr Arg Gln Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro
50 55 60
Ile Pro Lys Val Arg Arg Pro Glu Gly Arg Thr Trp Ala Gln Pro Gly
65 70 75 80
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp
85 90 95
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro
100 105 110
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys
115 120 125
Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala Pro Leu
130 135 140
Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp
145 150 155 160
Gly Val Asn Ser Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile
165 170 175
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Ile Pro Ala Ser Ala Tyr
180 185 190
Glu Val Arg Thr Cys Ala Val His Pro Glu Pro Ile Phe Asp Ile Thr
195 200 205
Asn Leu Leu Leu Ala Ile Leu Gly Pro Leu Met Val Leu Gln Ala Gly
210 215 220
Ile Thr Arg Val Pro Tyr Phe Val Arg Ala Gln Gly Leu Ile Arg Ala
225 230 235 240
Cys Met Leu Val Arg Lys Thr Pro Gly Gly His Tyr Val Gln Met Ala
245 250 255
Leu Met Arg Leu Ala Ala Leu
260
<210> 49
<211> 32
<212> DNA
<213> synthetic
<400> 49
cgcggatcct tagtcctcca gaacccggac ac 32
<210> 50
<211> 19
<212> DNA
<213> synthetic
<400> 50
tgcacggtct acgagacct 19
<210> 51
<211> 20
<212> DNA
<213> synthetic
<400> 51
tagtggtctg cggaaccggt 20
<210> 52
<211> 25
<212> DNA
<213> synthetic
<400> 52
gccgcatgta agggtatcga tgacc 25
<210> 53
<211> 25
<212> DNA
<213> synthetic
<400> 53
ggtcatcgat acccttacat gcggc 25
<210> 54
<211> 32
<212> DNA
<213> synthetic
<400> 54
gcgaattctt atcagaagaa ctcgtcaaga ag 32
<210> 55
<211> 26
<212> DNA
<213> synthetic
<400> 55
gcctattggc ctggagtgtt tagctc 26
<210> 56
<211> 29
<212> DNA
<213> synthetic
<400> 56
atggcgttag tatgagtgtc gtgcagcct 29
<210> 57
<211> 25
<212> DNA
<213> synthetic
<400> 57
agccgcatgt aagggtatcg atgac 25
<210> 58
<211> 23
<212> DNA
<213> synthetic
<400> 58
tggttcggct gyacatggat gaa 23
<210> 59
<211> 26
<212> DNA
<213> synthetic
<400> 59
ggrtagtgcc aragcctgta tgggta 26
<210> 60
<211> 31
<212> DNA
<213> synthetic
<400> 60
tcgggcacga gacaggctgt gatatatgtc t 31
<210> 61
<211> 30
<212> DNA
<213> synthetic
<400> 61
atcgtcttca cgcagaaagc gtctagccat 30
<210> 62
<211> 31
<212> DNA
<213> synthetic
<400> 62
gccagccccc tgatgggggc gacactccac c 31
<210> 63
<211> 30
<212> DNA
<213> synthetic
<400> 63
aatcatatgt ctttgaggtt taggatttgt 30
<210> 64
<211> 29
<212> DNA
<213> synthetic
<400> 64
gacatatgat tgaacaagat ggattgcac 29
<210> 65
<211> 33
<212> DNA
<213> synthetic
<400> 65
gtcctgcagg ccagccccct gatgggggcg aca 33
<210> 66
<211> 34
<212> DNA
<213> synthetic
<400> 66
gacctgcagg ttatcagaag aactcgtcaa gaag 34
<210> 67
<211> 50
<212> DNA
<213> synthetic
<400> 67
gccttaatta atacgactca ctataggcca gccccctgat gggggcgaca 50
<210> 68
<211> 76
<212> DNA
<213> Synthetic
<400> 68
tctagtcgac ggccagtgaa ttgtaatacg actcactata gggcggccag ccccctgatg 60
ggggcgacac tccacc 76
<210> 69
<211> 28
<212> DNA
<213> synthetic DNA
<400> 69
aggcctgtga agacgctctc ccagaact 28
<210> 70
<211> 21
<212> DNA
<213> synthetic DNA
<400> 70
ggtgatgacc ttggtctcca t 21
<210> 71
<211> 32
<212> DNA
<213> synthetic DNA
<400> 71
gcttagaggc tagtgatgat gcaaccaagt ac 32
<210> 72
<211> 24
<212> DNA
<213> synthetic DNA
<400> 72
ggcgaccgca tagtagtttc cata 24
<210> 73
<211> 32
<212> DNA
<213> synthetic DNA
<400> 73
ctggaggacg gcgtgaacta tgcaacaggg aa 32
<210> 74
<211> 32
<212> DNA
<213> synthetic DNA
<400> 74
ggaacttgcc cggttgctct ttctctatct tc 32
<210> 75
<211> 21
<212> DNA
<213> synthetic DNA
<400> 75
tggggcaaga tggttataaa c 21
<210> 76
<211> 31
<212> DNA
<213> synthetic DNA
<400> 76
ggggtaagat ggttataaac gtacgtacct g 31
<210> 77
<211> 31
<212> DNA
<213> synthetic DNA
<400> 77
ataatgaccc ccggcgactt tccgcactaa c 31
<210> 78
<211> 24
<212> DNA
<213> synthetic DNA
<400> 78
tgacatcagc atgtctcgtg acca 24
<210> 79
<211> 26
<212> DNA
<213> synthetic DNA
<400> 79
cttgaaaaag ccctggattg tcagat 26
<210> 80
<211> 20
<212> DNA
<213> synthetic DNA
<400> 80
acatgatctg cagagaggcc 20
<210> 81
<211> 26
<212> DNA
<213> synthetic DNA
<400> 81
ctacggggcc tgttactcca ttgaac 26
<210> 82
<211> 35
<212> DNA
<213> synthetic DNA
<400> 82
acatgatctg cagagaggcc agtatcagca ctctc 35
<210> 83
<211> 73
<212> DNA
<213> synthetic
<400> 83
tctagtcgac ggccagtgaa ttgtaatacg actcactcta gggcggcggg gtcgggcwcg 60
ngacabgctg tga 73
<210> 84
<211> 31
<212> DNA
<213> synthetic DNA
<400> 84
tctccattgg gctgaacacc acaggctcca c 31
<210> 85
<211> 31
<212> DNA
<213> synthetic DNA
<400> 85
ggggagaggt ggtcatagat gtaagtgccg g 31
<210> 86
<211> 30
<212> DNA
<213> synthetic DNA
<400> 86
catagatgta agtgccggtc cacctgccta 30
<210> 87
<211> 30
<212> DNA
<213> synthetic DNA
<400> 87
ctcctgcgag gtgtctcacc agggtacaca 30
<210> 88
<211> 29
<212> DNA
<213> synthetic DNA
<400> 88
agcagagcgt gagctctgac gaagtatgg 29
<210> 89
<211> 32
<212> DNA
<213> synthetic DNA
<400> 89
ggaatctacc cggttgctct ttttctatct tc 32
<210> 90
<211> 32
<212> DNA
<213> synthetic DNA
<400> 90
ctggaagacg ggataaatta tgcaacaggg aa 32
<210> 91
<211> 24
<212> DNA
<213> synthetic DNA
<400> 91
ctcgcaagca ccctatcagc cagt 24
<210> 92
<211> 20
<212> DNA
<213> synthetic DNA
<400> 92
aggcattgag cgggtttatc 20
<210> 93
<211> 38
<212> DNA
<213> synthetic DNA
<400> 93
ctagactcga gtcgacatcg tttttttttt tttttttt 38
<210> 94
<211> 20
<212> DNA
<213> synthetic DNA
<400> 94
atcttagccc tagtcacggc 20
<210> 95
<211> 20
<212> DNA
<213> synthetic DNA
<400> 95
ctagactcga gtcgacatcg 20
<210> 96
<211> 30
<212> DNA
<213> synthetic DNA
<400> 96
ctagctgtga aaggtccgtg agccgcatga 30
Claims (15)
- C형 간염 바이러스 유전자에 있어서, E1 단백 코드 영역의 일부분, E2 단백 코드 영역, P7 단백 코드 영역 및 NS2 단백 코드 영역의 일부분이 번역틀을 유지하면서 결실되어 있는 트런케이트 폼 C형 간염 바이러스 유전자.
- 제1항에 있어서, 5' 비번역 영역으로부터 구조 단백인 코어 단백을 코드하는 영역의 전부 또는 일부 및 비구조 단백인 NS2의 후반 2부분의 막 관통 영역을 코드하는 영역으로부터 3' 비번역 영역의 전부 또는 일부를 가지는 트런케이트 폼 C형 간염 바이러스 유전자.
- 제1항에 있어서, C형 간염 바이러스 유전자의 핵산 배열의 1번에서 914번의 전부 또는 일부의 배열 및 3001번 이후의 전부 또는 일부의 배열이 있는 트런케이트 폼 C형 간염 바이러스 유전자.
- 세포 중에서 자율적으로 복제하는 제1항 내지 제3항의 어느 하나의 항에 기재된 레프리콘 유전자.
- 제4항에 있어서, 선택 마커 유전자가 결합되어 있는 레프리콘 유전자.
- 제4항 또는 제5항에 기재된 레프리콘이 복제하는 세포.
- 제6항에 기재된 세포를 사용한 약제의 스크리닝 방법 또는 약효 평가 방법.
- 제1항 내지 제3항의 어느 한 항에 기재된 유전자를 넣은 벡터를 유지하고, 단백질을 발현하고 있는 세포.
- 제6항 또는 제8항에 기재된 레프리콘이 복제하고 있는 세포, 또는 세포가 산생하는 단백질을 사용하는 HCV의 진단 방법.
- 유전자의 결실을 검출하는 방법을 사용하여, C형 간염 바이러스의 트런케이트 폼 유전자를 검출하는 방법.
- C형 간염 바이러스 유전자의 핵산 배열의 1번 내지 914번 및 3001번 이후의 배열로부터 설계한 프라이머를 사용하여 PCR을 실시하고, 트런케이트 폼 유전자를 증폭함으로써 트런케이트 폼 유전자를 검출 또는 정량하는 방법.
- 트런케이트 폼 유전자 및 풀 렝쓰 폼 유전자의 공통 영역의 유전자의 측정과 트런케이트 폼 유전자의 결실된 영역의 유전자의 측정에 의하여 존재비를 정량하는 방법.
- 제1항 내지 제3항 중 어느 하나에 기재된 유전자를 유지하는 C형 간염 바이러스 입자 또는 바이러스 유사 입자.
- 제1항 내지 제3항 중 어느 하나에 기재된 유전자로부터 산생되는 C형 간염 바이러스의 폴리프로테인 및 폴리프로테인으로부터 프로세스된 단백.
- 제14항에 기재된 단백질을 특이적으로 인식하는 항체.
Applications Claiming Priority (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JPJP-P-2004-00188543 | 2004-06-25 | ||
JP2004188543 | 2004-06-25 | ||
JP2004190144 | 2004-06-28 | ||
JPJP-P-2004-00190144 | 2004-06-28 | ||
JPJP-P-2004-00277677 | 2004-09-24 | ||
JP2004277677 | 2004-09-24 |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20070024649A true KR20070024649A (ko) | 2007-03-02 |
KR100894150B1 KR100894150B1 (ko) | 2009-04-22 |
Family
ID=35781924
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020067027313A KR100894150B1 (ko) | 2004-06-25 | 2005-06-24 | 신규한 배열을 가진 hcv rna |
Country Status (5)
Country | Link |
---|---|
US (1) | US20090170063A1 (ko) |
EP (1) | EP1783218A4 (ko) |
JP (1) | JP5072361B2 (ko) |
KR (1) | KR100894150B1 (ko) |
WO (1) | WO2006001517A1 (ko) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110136678A1 (en) * | 2006-10-20 | 2011-06-09 | Erwin Sablon | Methodology for analysis of sequence variations within the hcv ns3/4a genomic region |
KR101444694B1 (ko) * | 2009-05-25 | 2014-10-01 | 에스케이이노베이션 주식회사 | 연성금속박적층체 및 이의 제조방법 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5874565A (en) * | 1995-08-29 | 1999-02-23 | Washington University | Nucleic acids comprising a highly conserved novel 3 terminal sequence element of the hepatitis C virus |
ATE437951T1 (de) * | 1997-05-06 | 2009-08-15 | Novartis Vaccines & Diagnostic | Intrazelluläre herstellung von verkürztem hepatitis c-polypeptid e2 |
AU4335099A (en) * | 1998-06-18 | 2000-01-05 | Government Of The United States Of America, As Represented By The Secretary Of The Department Of Health And Human Services, The | Surface targeted expression of a modified hepatitis c virus envelope protein |
ES2373642T3 (es) * | 2000-05-23 | 2012-02-07 | Washington University | Variantes de vhc. |
-
2005
- 2005-06-24 US US11/630,374 patent/US20090170063A1/en not_active Abandoned
- 2005-06-24 KR KR1020067027313A patent/KR100894150B1/ko not_active IP Right Cessation
- 2005-06-24 WO PCT/JP2005/012162 patent/WO2006001517A1/ja active Application Filing
- 2005-06-24 JP JP2006528839A patent/JP5072361B2/ja not_active Expired - Fee Related
- 2005-06-24 EP EP05755873A patent/EP1783218A4/en not_active Withdrawn
Also Published As
Publication number | Publication date |
---|---|
KR100894150B1 (ko) | 2009-04-22 |
US20090170063A1 (en) | 2009-07-02 |
EP1783218A4 (en) | 2009-02-18 |
EP1783218A1 (en) | 2007-05-09 |
JP5072361B2 (ja) | 2012-11-14 |
WO2006001517A1 (ja) | 2006-01-05 |
JPWO2006001517A1 (ja) | 2008-04-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1801209B1 (en) | Modified human hepatitis c virus genomic rna having autonomous replicative competence | |
US8754061B2 (en) | Nucleic acid construct containing a nucleic acid derived from the genome of hepatitis C virus (HCV) of genotype 2a, and a cell having such nucleic acid construct introduced therein | |
JP2013198486A (ja) | Hcv遺伝子 | |
JP5693957B2 (ja) | Hcv/gbv−bキメラウイルス | |
KR100894150B1 (ko) | 신규한 배열을 가진 hcv rna | |
EP1666598B1 (en) | Nucleic acid and gene originating in novel hcv strain and replicon-replicating cell using the gene | |
US20020160936A1 (en) | Hcv e2 protein binding agents for treatment of hepatitis c virus infection | |
US7790448B2 (en) | Nucleic acid and gene derived from novel HCV strain and replicon-replicating cell using said gene | |
Oniangue-Ndza | Development and characterization of subgenomic and full-length genome replicons based on the sequence of HCV AD78 strain | |
CN1973038A (zh) | 具有新的序列的丙型肝炎病毒rna |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
E902 | Notification of reason for refusal | ||
E701 | Decision to grant or registration of patent right | ||
GRNT | Written decision to grant | ||
FPAY | Annual fee payment |
Payment date: 20130321 Year of fee payment: 5 |
|
FPAY | Annual fee payment |
Payment date: 20140319 Year of fee payment: 6 |
|
FPAY | Annual fee payment |
Payment date: 20160401 Year of fee payment: 8 |
|
LAPS | Lapse due to unpaid annual fee |