CA2251745A1 - Non-a-non-e hepatitis virus having a translatable core region, reagents and methods for their use - Google Patents
Non-a-non-e hepatitis virus having a translatable core region, reagents and methods for their use Download PDFInfo
- Publication number
- CA2251745A1 CA2251745A1 CA002251745A CA2251745A CA2251745A1 CA 2251745 A1 CA2251745 A1 CA 2251745A1 CA 002251745 A CA002251745 A CA 002251745A CA 2251745 A CA2251745 A CA 2251745A CA 2251745 A1 CA2251745 A1 CA 2251745A1
- Authority
- CA
- Canada
- Prior art keywords
- sequence
- ac1s
- gbv
- seq
- peptide
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 241000700605 Viruses Species 0.000 title claims abstract description 23
- 208000006454 hepatitis Diseases 0.000 title claims abstract description 23
- 231100000283 hepatitis Toxicity 0.000 title claims abstract description 19
- 238000000034 method Methods 0.000 title claims abstract description 11
- 239000003153 chemical reaction reagent Substances 0.000 title claims abstract description 9
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 51
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 28
- VKCYHJWLYTUGCC-UHFFFAOYSA-N nonan-2-one Chemical compound CCCCCCCC(C)=O VKCYHJWLYTUGCC-UHFFFAOYSA-N 0.000 claims description 34
- 235000001014 amino acid Nutrition 0.000 claims description 18
- 150000001413 amino acids Chemical class 0.000 claims description 18
- 108090000623 proteins and genes Proteins 0.000 claims description 15
- 102000004169 proteins and genes Human genes 0.000 claims description 15
- 235000018102 proteins Nutrition 0.000 claims description 14
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 10
- 239000000203 mixture Substances 0.000 claims description 10
- 239000007790 solid phase Substances 0.000 claims description 9
- 230000015572 biosynthetic process Effects 0.000 claims description 5
- 125000000539 amino acid group Chemical group 0.000 claims description 2
- 238000009007 Diagnostic Kit Methods 0.000 claims 2
- 108091033319 polynucleotide Proteins 0.000 abstract description 2
- 102000040430 polynucleotide Human genes 0.000 abstract description 2
- 239000002157 polynucleotide Substances 0.000 abstract description 2
- 230000014616 translation Effects 0.000 description 35
- 238000013519 translation Methods 0.000 description 34
- ASKSDLRLUXEOOR-SUFRFZPQSA-N gbv-c Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=3C4=CC=CC=C4NC=3)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC(C)C)=CNC2=C1 ASKSDLRLUXEOOR-SUFRFZPQSA-N 0.000 description 27
- 241000531123 GB virus C Species 0.000 description 26
- 108020004705 Codon Proteins 0.000 description 20
- 239000002773 nucleotide Substances 0.000 description 20
- 125000003729 nucleotide group Chemical group 0.000 description 20
- 150000007523 nucleic acids Chemical class 0.000 description 19
- 238000012217 deletion Methods 0.000 description 18
- 230000037430 deletion Effects 0.000 description 18
- 108020004707 nucleic acids Proteins 0.000 description 14
- 102000039446 nucleic acids Human genes 0.000 description 14
- 238000011144 upstream manufacturing Methods 0.000 description 13
- 108091081024 Start codon Proteins 0.000 description 12
- 238000006243 chemical reaction Methods 0.000 description 12
- 239000013612 plasmid Substances 0.000 description 12
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 10
- 241000711549 Hepacivirus C Species 0.000 description 10
- 108020004999 messenger RNA Proteins 0.000 description 10
- 101710132601 Capsid protein Proteins 0.000 description 9
- 238000012286 ELISA Assay Methods 0.000 description 8
- 241000725303 Human immunodeficiency virus Species 0.000 description 8
- 229920001213 Polysorbate 20 Polymers 0.000 description 8
- 238000003556 assay Methods 0.000 description 8
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 8
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 8
- 210000002966 serum Anatomy 0.000 description 8
- 108020003589 5' Untranslated Regions Proteins 0.000 description 7
- 238000000338 in vitro Methods 0.000 description 7
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 6
- 230000001684 chronic effect Effects 0.000 description 6
- 230000000694 effects Effects 0.000 description 6
- 230000009257 reactivity Effects 0.000 description 6
- 230000001154 acute effect Effects 0.000 description 5
- 238000010276 construction Methods 0.000 description 5
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 5
- 238000012163 sequencing technique Methods 0.000 description 5
- 108091035707 Consensus sequence Proteins 0.000 description 4
- 108020005350 Initiator Codon Proteins 0.000 description 4
- 101710144111 Non-structural protein 3 Proteins 0.000 description 4
- 108010076504 Protein Sorting Signals Proteins 0.000 description 4
- 230000000692 anti-sense effect Effects 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 239000000499 gel Substances 0.000 description 4
- 239000003999 initiator Substances 0.000 description 4
- 102200028316 rs3211098 Human genes 0.000 description 4
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 230000014621 translational initiation Effects 0.000 description 4
- 230000003612 virological effect Effects 0.000 description 4
- 241000283707 Capra Species 0.000 description 3
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- 108090001074 Nucleocapsid Proteins Proteins 0.000 description 3
- 108010076039 Polyproteins Proteins 0.000 description 3
- 102000036639 antigens Human genes 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 239000003814 drug Substances 0.000 description 3
- 230000000977 initiatory effect Effects 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 235000006109 methionine Nutrition 0.000 description 3
- 230000007170 pathology Effects 0.000 description 3
- 229920001184 polypeptide Polymers 0.000 description 3
- 238000010839 reverse transcription Methods 0.000 description 3
- 238000001179 sorption measurement Methods 0.000 description 3
- WZRJTRPJURQBRM-UHFFFAOYSA-N 4-amino-n-(5-methyl-1,2-oxazol-3-yl)benzenesulfonamide;5-[(3,4,5-trimethoxyphenyl)methyl]pyrimidine-2,4-diamine Chemical compound O1C(C)=CC(NS(=O)(=O)C=2C=CC(N)=CC=2)=N1.COC1=C(OC)C(OC)=CC(CC=2C(=NC(N)=NC=2)N)=C1 WZRJTRPJURQBRM-UHFFFAOYSA-N 0.000 description 2
- 208000030507 AIDS Diseases 0.000 description 2
- 241001502050 Acis Species 0.000 description 2
- 206010059193 Acute hepatitis B Diseases 0.000 description 2
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 2
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 2
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 2
- 108090000565 Capsid Proteins Proteins 0.000 description 2
- BVKZGUZCCUSVTD-UHFFFAOYSA-L Carbonate Chemical compound [O-]C([O-])=O BVKZGUZCCUSVTD-UHFFFAOYSA-L 0.000 description 2
- 102100023321 Ceruloplasmin Human genes 0.000 description 2
- 206010008909 Chronic Hepatitis Diseases 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 208000005176 Hepatitis C Diseases 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- QXOHLNCNYLGICT-YFKPBYRVSA-N Met-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(O)=O QXOHLNCNYLGICT-YFKPBYRVSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- NQRYJNQNLNOLGT-UHFFFAOYSA-N Piperidine Chemical compound C1CCNCC1 NQRYJNQNLNOLGT-UHFFFAOYSA-N 0.000 description 2
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 2
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 2
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 231100000354 acute hepatitis Toxicity 0.000 description 2
- 208000037628 acute hepatitis B virus infection Diseases 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 238000000627 alternating current impedance spectroscopy Methods 0.000 description 2
- 239000000427 antigen Substances 0.000 description 2
- 108091007433 antigens Proteins 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 2
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- 208000002672 hepatitis B Diseases 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 239000013642 negative control Substances 0.000 description 2
- 238000010647 peptide synthesis reaction Methods 0.000 description 2
- 239000011541 reaction mixture Substances 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 210000001995 reticulocyte Anatomy 0.000 description 2
- 238000003757 reverse transcription PCR Methods 0.000 description 2
- 102220215119 rs1060503548 Human genes 0.000 description 2
- 229920006395 saturated elastomer Polymers 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 238000002864 sequence alignment Methods 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 235000011149 sulphuric acid Nutrition 0.000 description 2
- 239000001117 sulphuric acid Substances 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- GPRLSGONYQIRFK-MNYXATJNSA-N triton Chemical compound [3H+] GPRLSGONYQIRFK-MNYXATJNSA-N 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 1
- PKOHVHWNGUHYRE-ZFWWWQNUSA-N (2s)-1-[2-[[(2s)-2-amino-3-(1h-indol-3-yl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)NCC(=O)N1CCC[C@H]1C(O)=O PKOHVHWNGUHYRE-ZFWWWQNUSA-N 0.000 description 1
- COEXAQSTZUWMRI-STQMWFEESA-N (2s)-1-[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 COEXAQSTZUWMRI-STQMWFEESA-N 0.000 description 1
- AUXMWYRZQPIXCC-KNIFDHDWSA-N (2s)-2-amino-4-methylpentanoic acid;(2s)-2-aminopropanoic acid Chemical compound C[C@H](N)C(O)=O.CC(C)C[C@H](N)C(O)=O AUXMWYRZQPIXCC-KNIFDHDWSA-N 0.000 description 1
- OWQPOVKKUWUEKE-UHFFFAOYSA-N 1,2,3-benzotriazine Chemical class N1=NN=CC2=CC=CC=C21 OWQPOVKKUWUEKE-UHFFFAOYSA-N 0.000 description 1
- YEDUAINPPJYDJZ-UHFFFAOYSA-N 2-hydroxybenzothiazole Chemical compound C1=CC=C2SC(O)=NC2=C1 YEDUAINPPJYDJZ-UHFFFAOYSA-N 0.000 description 1
- 101800003921 7 kDa product Proteins 0.000 description 1
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 1
- 206010065051 Acute hepatitis C Diseases 0.000 description 1
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 1
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 1
- IXTPACPAXIOCRG-ACZMJKKPSA-N Ala-Glu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N IXTPACPAXIOCRG-ACZMJKKPSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- OMLWNBVRVJYMBQ-YUMQZZPRSA-N Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OMLWNBVRVJYMBQ-YUMQZZPRSA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- PPPXVIBMLFWNSK-BQBZGAKWSA-N Arg-Gly-Cys Chemical compound C(C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N PPPXVIBMLFWNSK-BQBZGAKWSA-N 0.000 description 1
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 1
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- 235000011330 Armoracia rusticana Nutrition 0.000 description 1
- 240000003291 Armoracia rusticana Species 0.000 description 1
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 1
- NPDLYUOYAGBHFB-WDSKDSINSA-N Asn-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NPDLYUOYAGBHFB-WDSKDSINSA-N 0.000 description 1
- RSMIHCFQDCVVBR-CIUDSAMLSA-N Asp-Gln-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RSMIHCFQDCVVBR-CIUDSAMLSA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- OAMLVOVXNKILLQ-BQBZGAKWSA-N Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O OAMLVOVXNKILLQ-BQBZGAKWSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- LTARLVHGOGBRHN-AAEUAGOBSA-N Asp-Trp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O LTARLVHGOGBRHN-AAEUAGOBSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- 101100194816 Caenorhabditis elegans rig-3 gene Proteins 0.000 description 1
- 208000000419 Chronic Hepatitis B Diseases 0.000 description 1
- 208000006154 Chronic hepatitis C Diseases 0.000 description 1
- ZSRSLWKGWFFVCM-WDSKDSINSA-N Cys-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O ZSRSLWKGWFFVCM-WDSKDSINSA-N 0.000 description 1
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 1
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 1
- 108020004414 DNA Proteins 0.000 description 1
- 208000001490 Dengue Diseases 0.000 description 1
- 206010012310 Dengue fever Diseases 0.000 description 1
- 241000710781 Flaviviridae Species 0.000 description 1
- 241001669573 Galeorhinus galeus Species 0.000 description 1
- ICDIMQAMJGDHSE-GUBZILKMSA-N Gln-His-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O ICDIMQAMJGDHSE-GUBZILKMSA-N 0.000 description 1
- JZDHUJAFXGNDSB-WHFBIAKZSA-N Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O JZDHUJAFXGNDSB-WHFBIAKZSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- KEBACWCLVOXFNC-DCAQKATOSA-N Glu-Arg-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KEBACWCLVOXFNC-DCAQKATOSA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- DKEXFJVMVGETOO-LURJTMIESA-N Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CN DKEXFJVMVGETOO-LURJTMIESA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- VLIJYPMATZSOLL-YUMQZZPRSA-N Gly-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN VLIJYPMATZSOLL-YUMQZZPRSA-N 0.000 description 1
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 208000005331 Hepatitis D Diseases 0.000 description 1
- 241001529916 Hepatitis GB virus B Species 0.000 description 1
- 208000037262 Hepatitis delta Diseases 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- UOYGZBIPZYKGSH-SRVKXCTJSA-N His-Ser-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N UOYGZBIPZYKGSH-SRVKXCTJSA-N 0.000 description 1
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 1
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 1
- JWBXCSQZLLIOCI-GUBZILKMSA-N Ile-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(C)C JWBXCSQZLLIOCI-GUBZILKMSA-N 0.000 description 1
- RCMNUBZKIIJCOI-ZPFDUUQYSA-N Ile-Met-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RCMNUBZKIIJCOI-ZPFDUUQYSA-N 0.000 description 1
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- 241000976924 Inca Species 0.000 description 1
- DEFJQIDDEAULHB-IMJSIDKUSA-N L-alanyl-L-alanine Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(O)=O DEFJQIDDEAULHB-IMJSIDKUSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- DNDWZFHLZVYOGF-KKUMJFAQSA-N Leu-Leu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O DNDWZFHLZVYOGF-KKUMJFAQSA-N 0.000 description 1
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 1
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 1
- HIIZIQUUHIXUJY-GUBZILKMSA-N Lys-Asp-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HIIZIQUUHIXUJY-GUBZILKMSA-N 0.000 description 1
- NDSNUWJPZKTFAR-DCAQKATOSA-N Lys-Cys-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCCN NDSNUWJPZKTFAR-DCAQKATOSA-N 0.000 description 1
- 101710151833 Movement protein TGBp3 Proteins 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 108091092724 Noncoding DNA Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 238000010222 PCR analysis Methods 0.000 description 1
- 241000682990 Pegivirus A Species 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 1
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- QEWBZBLXDKIQPS-STQMWFEESA-N Pro-Gly-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QEWBZBLXDKIQPS-STQMWFEESA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- QGLFRQCECIWXFA-RCWTZXSCSA-N Pro-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1)O QGLFRQCECIWXFA-RCWTZXSCSA-N 0.000 description 1
- HOTVCUAVDQHUDB-UFYCRDLUSA-N Pro-Phe-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 HOTVCUAVDQHUDB-UFYCRDLUSA-N 0.000 description 1
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 1
- PKHDJFHFMGQMPS-RCWTZXSCSA-N Pro-Thr-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKHDJFHFMGQMPS-RCWTZXSCSA-N 0.000 description 1
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 1
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 1
- MTMJNKFZDQEVSY-BZSNNMDCSA-N Pro-Val-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MTMJNKFZDQEVSY-BZSNNMDCSA-N 0.000 description 1
- 101710195279 Putative capsid protein Proteins 0.000 description 1
- 101710185720 Putative ethidium bromide resistance protein Proteins 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 241000288961 Saguinus imperator Species 0.000 description 1
- SSJMZMUVNKEENT-IMJSIDKUSA-N Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CO SSJMZMUVNKEENT-IMJSIDKUSA-N 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 1
- ZGFRMNZZTOVBOU-CIUDSAMLSA-N Ser-Met-Gln Chemical compound N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)O ZGFRMNZZTOVBOU-CIUDSAMLSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- 241000872201 Sorindeia Species 0.000 description 1
- 101710137500 T7 RNA polymerase Proteins 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- NLJKZUGAIIRWJN-LKXGYXEUSA-N Thr-Asp-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O NLJKZUGAIIRWJN-LKXGYXEUSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- LUMXICQAOKVQOB-YWIQKCBGSA-N Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O LUMXICQAOKVQOB-YWIQKCBGSA-N 0.000 description 1
- LCCSEJSPBWKBNT-OSUNSFLBSA-N Thr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N LCCSEJSPBWKBNT-OSUNSFLBSA-N 0.000 description 1
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 description 1
- BQASAMYRHNCKQE-IHRRRGAJSA-N Tyr-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BQASAMYRHNCKQE-IHRRRGAJSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- YQMILNREHKTFBS-IHRRRGAJSA-N Val-Phe-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YQMILNREHKTFBS-IHRRRGAJSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- UEXPMFIAZZHEAD-HSHDSVGOSA-N Val-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](C(C)C)N)O UEXPMFIAZZHEAD-HSHDSVGOSA-N 0.000 description 1
- 108700010756 Viral Polyproteins Proteins 0.000 description 1
- 208000037621 acute hepatitis C virus infection Diseases 0.000 description 1
- 108010056243 alanylalanine Proteins 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 150000001944 cysteine derivatives Chemical group 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 208000025729 dengue disease Diseases 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- VOWCXSYDBWCCCF-QIBOXNKSSA-N gbv-b Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@@H](N)[C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O)C1=CC=C(O)C=C1 VOWCXSYDBWCCCF-QIBOXNKSSA-N 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- STKYPAFSDFAEPH-LURJTMIESA-N glycylvaline Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CN STKYPAFSDFAEPH-LURJTMIESA-N 0.000 description 1
- 208000005252 hepatitis A Diseases 0.000 description 1
- 208000010710 hepatitis C virus infection Diseases 0.000 description 1
- 208000029570 hepatitis D virus infection Diseases 0.000 description 1
- 206010073071 hepatocellular carcinoma Diseases 0.000 description 1
- 231100000844 hepatocellular carcinoma Toxicity 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- NPZTUJOABDZTLV-UHFFFAOYSA-N hydroxybenzotriazole Substances O=C1C=CC=C2NNN=C12 NPZTUJOABDZTLV-UHFFFAOYSA-N 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010049589 leucyl-leucyl-leucine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 208000019423 liver disease Diseases 0.000 description 1
- 239000012160 loading buffer Substances 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- PSGAAPLEWMOORI-PEINSRQWSA-N medroxyprogesterone acetate Chemical compound C([C@@]12C)CC(=O)C=C1[C@@H](C)C[C@@H]1[C@@H]2CC[C@]2(C)[C@@](OC(C)=O)(C(C)=O)CC[C@H]21 PSGAAPLEWMOORI-PEINSRQWSA-N 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 125000000538 pentafluorophenyl group Chemical group FC1=C(F)C(F)=C(*)C(F)=C1F 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 102200091804 rs104894738 Human genes 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 239000002023 wood Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/24011—Flaviviridae
- C12N2770/24211—Hepacivirus, e.g. hepatitis C virus, hepatitis G virus
- C12N2770/24222—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Virology (AREA)
- Peptides Or Proteins (AREA)
Abstract
It is described a non-A-non-E hepatitis virus (GBV hepatitis virus) having a tra nslatable functional core region, polynucleotides, peptides, antibodies, diagnostics and reagents thereof and methods for their use .
Description
NON-A-NON- E HEPATITIS VIRUS HAVING A TRANSLATABLE CORE REGION, REAGENTS AND METHODS FOR THEIR USE
The invention concerns nonA-nonE hepatitis virus (GBV hepatitis virus) having a translatable 5' core region, polynucleotides, peptides, antibodies, diagnostics and reagents thereof and methods for their use.
Simons J.N. et al. (Nature Medicine, 1, 564, l995) revealed a viral genome, Flavivirus-like, named GBV-C, in a small percentage of nonA-nonE hepatitis affected patient sera. The genome has some homology with genomes from two virus, GBV-A a GBV-B, identified in tamarins (Simons J.N. et al., Proc. Natl. Acad. Sci. USA, 92, 340l, I995). Said virus represent a phylogenetic separate group, not comprising hepatitis C virus (HCV).
PCT patent application N. W095/21922, in the name of Abbott Lab., shows a partial nucleotide sequence of a GHV-C isolate. A method to reveal antibodies against GBV
is claimed (claim 44, p. 6l8); however no enabling disclosure for such method able to detect with a statistic significative percentage positive samples from nonA-nonE patients is provided. PCT patent application N.
W095/21922 does not disclose aminoacid sequences to be effectively used to design diagnostic peptides. As a matter of fact, Figure 39 shows a panel of proteins, having an around 30d aminoacid minimal lenght. Only 2 out 512 sera from drug-addicted subjects result to be positive for these proteins in an ELISA assay. Moreover at p. 168, table 23, ELISA assays on nonA-nonE patients result to be positive in very few samples, thus rendering the assay not workable: 0/89 Japan subjects; 0/67 Greece subjects; 6/72 USA (set M) subjects; 1/64 USA (set T) subjects; 3/62 USA (set 1/3) subjects; 18/32 Egypt subjects. At p. 169, table 24, only 21 out 303 nonA-nonE
patients are positive with an ELISA assay using peptides sequence. It is evident that said assay may not be used neither for a screening of healthy subjects, nor for diagnosis.
PCT patent application N. W095/32291, in the name of Genelabs, identifies by ~epitope mapping different antigenic regions of the GBV polyprotein. However no peptides or a combination thereof for a diagnostic assay are disclosed.
EP patent application N. 73660l, in the name of Abbott Lab., discloses 88 nucleotides upstream of the sequence shown in the PCT patent application N.
W095/21922. However no evidence is provided for an open reading frame in this region able to code for a core-like protein.
The nucleocapsid/El junction of the viral isolate was identified by Leary, T. P., et al. 1996. J. Med.
Virol. 48:60-67. It is predicted to occur approximately 46 aminoacids downstream of the first methionine residue of the ORF, while Linnen, J. et al. 1996. Science 27 1:505-508 have also identified an isolate in which the putative capsid protein contains 83 aminoacid residues.
The apparent main difference among these two isolates is one base deletion causing different initiation sites.
Thus, in comparison with other Flaviviridae, in which the lenght of the capsid protein varies from 114 aminoacids in dengue fever virus to 175 residues in HCV, the GBV-C
nucleocapsid appears too short to be functional.
The presence of core-like proteins in GBV-C has also been challenged by recent translation studies demonstrating that the site of translation initiation of the viral polyprotein is immediately upstream of the putative signal sequence for the El glycoprotein (Simons, J. N., et al. 1996. J. Virol. 70:6126-6135).
Although it was originally suggested that the apparent lack of a functional core protein could be ascribed to artefacts of the sequencing process, due to compressions caused by the high GC contents (teary et al.
supra), it is now clear that these sequences represent the true status of GBV-C in the numerous isolates that have been characterized (Simons et al. 1996, supra).
It has been proposed, therefore, that the NH,-terminal end of the GBV-C polyprotein is a truncated derivative of an ancestral capsid protein and that GBV-C
requires a helper virus to replicate (Heringlake, S . , H.
L. Tillmann, and M. P. Manns. 1996. J. Hepatol. 25:239-247). Alternatively, since a limited number of virus clones from each isolate has been examined so far, it is possible that, after infection, a large population of single base deletion mutants arise and these mutants would have the highest probability of being examined.
The authors report the characterization of a GBV-C
isolate, obtained from a HIV and HCV positive patient, in which translation can initiate at an AUG codon located 273 nucleotides upstream of the previously identified initiator site (Simons et al. 1996, supra), leading to a potentially functional nucleocapsid core protein of 107 aminoacids. This phenotype remained unvaried over the one year monitoring of the patient, suggesting therefore that stable replication competent GBV-C isolates may exist in nature.
The authors of the instant invention have also identified NS3 peptides of nonAnonE associated virus and has set up an assay to detect antibodies able to recognize epitopes comprised in said peptides. Said antibodies are detected in chronic and acute hepatitis affected subjects, classified as nonA-nonE for the lacking ef known virus related markers, in a high percentage of studied subjects (around 300). Surprisingly the assay is able to detect said antibodies also in subjects which are affected by other pathologies, as hepatitis A, hepatitis B, hepatitis C, hepatitis Delta, ~.I~J~, ~j~~i" v~~,,-dStr:..~.L~e:-~ S.", ~"'~~..~'y dU.~CI:TL~,'_:i.~
i.'_~S~dSeS
(LES, s'~NA+) r Show~rig 3 ~''iatwCrn~:'z1p ~r=G:_ v:E''.' ;;~~
The assay is per formed preferentia? 1;: t~~ react' ng a treated bzolcgzca~. samp~.e, as serum, ~i.t=~. Je~.ected peptides to ge4 an eiFectve and d'agnos=is method.
Antigen~anti.bodyes complexes are revealed in an effective ar.d reprcducir' a way by means of one of assays ~.-~cwr to the expert .in the ficid, as ELIcA, RIB:, EIA, i:nmunoblottin, atc.
1G y.~_~_ the foliow;;ny "~L~I vir~:s" means v' wa associated t4 non.~~or.E ~:epa=i t::s; "izu~zunoloc~.cail~r r~omolcgcus epi tope" means an ept =ope ::,at% irlg an a:~r.inca:.~.d sequence w~ir_h d'_~fers 'nom th~ reFe=r'inCj v'~?ltd~L~2 Cv~ ~:Tln or m:J~~?
y am~.ncac~.ds, but whic:~ i.s able to recc~nrze anc ~oi:.c. ~o antibodies for the ~e''errng epitone, with 5u:os;.ar~~-ial' y the sa~:.e binui::g a'fi :i ty.
It is a n object of the ~.;~~:er_ticr_ a ~oeptic'e ab'e ~to s: eCif::cally bir:C tc no:~..~r_onE hepat_tis visas a~~~~~oa,zas, having a =equer_ce cf a= least ~C,.
a~ inoaci s, said non.Anc.~.~ hepat. ti s virus 'oei ng characterized by hcs7~.Il~~ a trar.slatabie =~nc~i~.na_ core-;ikA ~rorswn 'onger then ~~ arninr_acid res~d~~;es a:~d by CC="lpriSl::~ ~.-~ d~.'Ll._"lG3Cid Sac~u,eIlCE'.~.~ ~~aS~
hczrc'_cgeus to the secrlence of SEQ I~ ;do.
The invention concerns nonA-nonE hepatitis virus (GBV hepatitis virus) having a translatable 5' core region, polynucleotides, peptides, antibodies, diagnostics and reagents thereof and methods for their use.
Simons J.N. et al. (Nature Medicine, 1, 564, l995) revealed a viral genome, Flavivirus-like, named GBV-C, in a small percentage of nonA-nonE hepatitis affected patient sera. The genome has some homology with genomes from two virus, GBV-A a GBV-B, identified in tamarins (Simons J.N. et al., Proc. Natl. Acad. Sci. USA, 92, 340l, I995). Said virus represent a phylogenetic separate group, not comprising hepatitis C virus (HCV).
PCT patent application N. W095/21922, in the name of Abbott Lab., shows a partial nucleotide sequence of a GHV-C isolate. A method to reveal antibodies against GBV
is claimed (claim 44, p. 6l8); however no enabling disclosure for such method able to detect with a statistic significative percentage positive samples from nonA-nonE patients is provided. PCT patent application N.
W095/21922 does not disclose aminoacid sequences to be effectively used to design diagnostic peptides. As a matter of fact, Figure 39 shows a panel of proteins, having an around 30d aminoacid minimal lenght. Only 2 out 512 sera from drug-addicted subjects result to be positive for these proteins in an ELISA assay. Moreover at p. 168, table 23, ELISA assays on nonA-nonE patients result to be positive in very few samples, thus rendering the assay not workable: 0/89 Japan subjects; 0/67 Greece subjects; 6/72 USA (set M) subjects; 1/64 USA (set T) subjects; 3/62 USA (set 1/3) subjects; 18/32 Egypt subjects. At p. 169, table 24, only 21 out 303 nonA-nonE
patients are positive with an ELISA assay using peptides sequence. It is evident that said assay may not be used neither for a screening of healthy subjects, nor for diagnosis.
PCT patent application N. W095/32291, in the name of Genelabs, identifies by ~epitope mapping different antigenic regions of the GBV polyprotein. However no peptides or a combination thereof for a diagnostic assay are disclosed.
EP patent application N. 73660l, in the name of Abbott Lab., discloses 88 nucleotides upstream of the sequence shown in the PCT patent application N.
W095/21922. However no evidence is provided for an open reading frame in this region able to code for a core-like protein.
The nucleocapsid/El junction of the viral isolate was identified by Leary, T. P., et al. 1996. J. Med.
Virol. 48:60-67. It is predicted to occur approximately 46 aminoacids downstream of the first methionine residue of the ORF, while Linnen, J. et al. 1996. Science 27 1:505-508 have also identified an isolate in which the putative capsid protein contains 83 aminoacid residues.
The apparent main difference among these two isolates is one base deletion causing different initiation sites.
Thus, in comparison with other Flaviviridae, in which the lenght of the capsid protein varies from 114 aminoacids in dengue fever virus to 175 residues in HCV, the GBV-C
nucleocapsid appears too short to be functional.
The presence of core-like proteins in GBV-C has also been challenged by recent translation studies demonstrating that the site of translation initiation of the viral polyprotein is immediately upstream of the putative signal sequence for the El glycoprotein (Simons, J. N., et al. 1996. J. Virol. 70:6126-6135).
Although it was originally suggested that the apparent lack of a functional core protein could be ascribed to artefacts of the sequencing process, due to compressions caused by the high GC contents (teary et al.
supra), it is now clear that these sequences represent the true status of GBV-C in the numerous isolates that have been characterized (Simons et al. 1996, supra).
It has been proposed, therefore, that the NH,-terminal end of the GBV-C polyprotein is a truncated derivative of an ancestral capsid protein and that GBV-C
requires a helper virus to replicate (Heringlake, S . , H.
L. Tillmann, and M. P. Manns. 1996. J. Hepatol. 25:239-247). Alternatively, since a limited number of virus clones from each isolate has been examined so far, it is possible that, after infection, a large population of single base deletion mutants arise and these mutants would have the highest probability of being examined.
The authors report the characterization of a GBV-C
isolate, obtained from a HIV and HCV positive patient, in which translation can initiate at an AUG codon located 273 nucleotides upstream of the previously identified initiator site (Simons et al. 1996, supra), leading to a potentially functional nucleocapsid core protein of 107 aminoacids. This phenotype remained unvaried over the one year monitoring of the patient, suggesting therefore that stable replication competent GBV-C isolates may exist in nature.
The authors of the instant invention have also identified NS3 peptides of nonAnonE associated virus and has set up an assay to detect antibodies able to recognize epitopes comprised in said peptides. Said antibodies are detected in chronic and acute hepatitis affected subjects, classified as nonA-nonE for the lacking ef known virus related markers, in a high percentage of studied subjects (around 300). Surprisingly the assay is able to detect said antibodies also in subjects which are affected by other pathologies, as hepatitis A, hepatitis B, hepatitis C, hepatitis Delta, ~.I~J~, ~j~~i" v~~,,-dStr:..~.L~e:-~ S.", ~"'~~..~'y dU.~CI:TL~,'_:i.~
i.'_~S~dSeS
(LES, s'~NA+) r Show~rig 3 ~''iatwCrn~:'z1p ~r=G:_ v:E''.' ;;~~
The assay is per formed preferentia? 1;: t~~ react' ng a treated bzolcgzca~. samp~.e, as serum, ~i.t=~. Je~.ected peptides to ge4 an eiFectve and d'agnos=is method.
Antigen~anti.bodyes complexes are revealed in an effective ar.d reprcducir' a way by means of one of assays ~.-~cwr to the expert .in the ficid, as ELIcA, RIB:, EIA, i:nmunoblottin, atc.
1G y.~_~_ the foliow;;ny "~L~I vir~:s" means v' wa associated t4 non.~~or.E ~:epa=i t::s; "izu~zunoloc~.cail~r r~omolcgcus epi tope" means an ept =ope ::,at% irlg an a:~r.inca:.~.d sequence w~ir_h d'_~fers 'nom th~ reFe=r'inCj v'~?ltd~L~2 Cv~ ~:Tln or m:J~~?
y am~.ncac~.ds, but whic:~ i.s able to recc~nrze anc ~oi:.c. ~o antibodies for the ~e''errng epitone, with 5u:os;.ar~~-ial' y the sa~:.e binui::g a'fi :i ty.
It is a n object of the ~.;~~:er_ticr_ a ~oeptic'e ab'e ~to s: eCif::cally bir:C tc no:~..~r_onE hepat_tis visas a~~~~~oa,zas, having a =equer_ce cf a= least ~C,.
a~ inoaci s, said non.Anc.~.~ hepat. ti s virus 'oei ng characterized by hcs7~.Il~~ a trar.slatabie =~nc~i~.na_ core-;ikA ~rorswn 'onger then ~~ arninr_acid res~d~~;es a:~d by CC="lpriSl::~ ~.-~ d~.'Ll._"lG3Cid Sac~u,eIlCE'.~.~ ~~aS~
hczrc'_cgeus to the secrlence of SEQ I~ ;do.
4:
25 '~SLrIPa.R'!PFULLTPT'IGFP,~PS, or an i.;unu:-:olcgically ho:noloc~ous variant cr the peptidA. Prefer aiy the peptide ~.as a 5equer~ce ccr~rrised in SEQ Iu Nc.4.
It is a Further object of tha i::vention a ccnposi~zor_ ccmpris~:~lQ at _east one o the peptides of the i nTrention. Prezerably said cor_p,csi ticn is adhered Jn a SO1 ~ a ptla$e .
A.ccarding tc a preferred. embcdi.-~,A-:t of the in~.Antion said c:ornpcsition comprises a'~ least tt~rc pepy~.~~ the irler.tion.
~.-'1"s ~ a.
AMEN~rp SH~.ET
r~ ~n" ~r,~. ,.: ~~ 7r, i ~Ln~ .~.; . ,nm _fty.. .,.~r_.'lrJ'', t7 ~,..~
_ . _. ~. , , _ . -.Z; r~. _ _ ~~ prr~-r nr~.
_ _..~.., ...,_.............,..~.~.._~.__ ... _...._....,a - ' ~ ~ : . _ ,.~
:j.. . I~: _ . ~ \ . ) ; ~
_ 5 _ F~azther object cf the invention is a diagnostic ki: for nonRror.~, hepatitis carnprising as specific reagent at least one peptide o~ Lha invention.
Preferably said kit comprises at least twe oei t~.des of t'.:e invention.
Furtrer object o~ ~.he i nventicr is a rr.ethod to detect ::cn.~, ron~ hepa=iris virus antibodies in a sa.~.-~ple c..,rnprisi:~g the following steps:
- inc~,:,ating in conditions allow':.nc t_:e for~!a~ion cf a.. artigen-aztibcdy co~:p~.e:c said sample with at least or.e peptide ;:f the inve:aion;
reveali-g saia cemp~.ex.
Tie izvsnti=n is new desl~r:.bed i n a non- i imi c'_ng .way by reference tc the following figures wherein:
r - riguro 1 renYeser_ts ~he ~r~.i:.eacic sect:enea o_ GDv N53 region dod of used peptides, with their dEnomination:
A~nrnlnr~l ~:~rrT
IJILI'1'Jv. ' ~~. ~ I _ ~mr ~~n~ "~I, ~:on~~ ,,.,;~_.~.,.: o .-m;i n , i '.. . .. . , ~ ' ~ i f i , ,n _ . ~ ~ . a I r"1 ~1 ~ . .
- . . I , . , '. I y. , , ~ ' I , , i . v a 4 W~ 97/39l29 PCT/IT96/00248 - figure 2 represents a bar graphic of p3.1-p3.9 peptide reactivity in ELISA assay with sera from subjects affected by different pathologies;
- figure 3a a 3b represent aligned aminoacid sequences of GBV core region; first three from top: from literature, others: isolated from the authors; "-" means an identity with the U36380 sequence; "_" means an identity with the U45966 sequence;
- figure 4a represents aligned aminoacid sequences of a segment of GBV core region, from literature: the first from the top from U45966 of Fig. 3a, others are isolated from the authors; at the bottom the consensus sequence is shown (SEQ ID No. 3);
- figure 4b represents a bar graphic of consensus peptide of Fig. 4a reactivity in ELISA assay with sera from subjects affected by different pathologies;
- figure 5 represents the nucleotide sequence alignment of 10 GBV-C isolates obtained from 10 distinct GBV-C positive patients, compared with the sequences of the published isolates deposited in GenBank [GBV-C
isolate (teary et al., supra), PNF2161 isolate (Linnen et al. supra) named GBV1 in the figure, and R10291 isolate named GBV2 in the figure]. The sequences were analyzed with the CLUSTAL alignment program. The alignment includes the nucleotide sequence between AC1S and AS-CWP
primers (shadowed in grey), and comprises a partial 5'UTR, the putative core region and a partial E1 region (corresponding to nt I12-733 of GBV-C, nt I29-752 of GHV1 and nt 56-680 of GBV2). The nucleotides are numbered only for isolate 35 (identified as the isolate with the longer ORF) and nucleotide +1 identifies~the nucleotide A of its first putative start codon in frame with the long ORF.
Note that in the other isolates nucleotide insertions or deletions are present. Gaps, introduced in the sequences to preserve alignment, are represented by dashes. The sequences between nucleotides 316 and 446, that are conserved, are not shown for reason of space. The nucleotides conserved among a11 the sequences are pointed out by an asterisk. The ATG codons in frame with the long ORF of each isolate are boxed;
- figure 6 represents the aminoacid sequence alignment of the isolates described in Fig. 5 and deduced from their nucleotide sequences. The aminoacid sequence of isolate 35 is identified as 35/1, while the sequences deduced from the two additional isolates derived from distinct serum samples collected during the follow-up of this patient are pointed out as 35/2 and 3S/3. The sequences were analyzed with the CLUSTAL alignment program. The sequence of the longer published ORF (GBV2) is considered as the consensus sequence. Dashes denote identity with the consensus sequence, whereas individual variations are shown by the single aminoacid code. The methionines in-frame with the long ORF are boxed and pointed out with an arrow for each isolate. The areas shadowed in grey mean the regions of high variability detected in our isolates;
- figure 7 represents a SDS-gel pattern of [3~S]cysteine-labelled in vitro translation products of the constructs, containing the AC1S/AS-CWP amplified region from the pointed out isolates. Positions of molecular weight standards (in kilodaltons) are pointed out. The last lane contains the negative control (no RNA). The IVTT of pCR35/AC1S construct was performed with and without capped mRNA;
- figure 8 represents the construction of the deletion mutants of pCR35/AC1S, pCRl9/AC1S, pCR3064/AC1S
and pCR2095/AC1S vectors. In the figure, the strategy for the construction of the pCR35/AC1S derivatives is described. The location of the T7 promoter (in black), the Bam H1 cleavage site and the position of the primers (AC1S and AS-CWP) used for the construction of the parent plasmid are shown. The numeration of the nucleotides is relative to the first putative start codon of the sequence of isolate 35. The partial 5' non coding region is white, the partial ORF is shadowed in grey and the position of AUG codons are pointed out with the notation Met. The positions of the sense primers (primers 1, 2, 3, 4 and 5) used with the antisense AS-CWP primer for the construction of the deletion mutants are also shown. The same strategy was followed for obtaining the pCRl9/AC1S, pCR3064/AC1S and pCR2095/AC1S derivatives. The length of the partial 5'UTR, of the partial ORF and the position of the AUG codons of these isolates are shown in Fig. 5 and 6.
- figure 9 represents a SDS-gel pattern of ~35S~cysteine-labelled in vitro translation products gLenerated by the O2, 43, 44 and 05 deletion mutants of pCR35/AC1 (A), pCRl9/ACIS (B) and pCR2095/AC1S (C) plasmids, obtained as described in Fig. 8. Positions of molecular weight markers (in kilodaltons) are pointed out. In panel (A)-the products of capped and uncapped mRNAs derived from the pCR35/AC1S undeleted plasmid are shown. A11 the other mRNAs used in these experiments were capped transcripts. In the same panel the first lane represents the product of translation of an unrelated mRNA (UNR). In panel (B) the first lane represents the product of translation of the pCR35/~, mutant, that was used as control for the size of the product of isolate 19. In panel (C) the first lane contains the negative control (no RNA) .
Analysis of GBV protein sequences Protein sequences have been computer analyzed by means of ANTIGEN program 6.85 (PCGene). ANTIGEN utilizes Hopp and Wood algorithm to predict protein potentially hydrophilic regions.
Peptide synthesis Peptides have been synthesized with an automated synthesizer MilliGen 9050 (Septrim). A11 of aminoacids were ~noc a-amino protected and pre-activated at the carboxy terminus pentafluorophenyl esters; only serine and threonine were activated~as benzotriazine esters.
A11 of coupling reactions were performed with HOBT
0.33M/DMF; washings were made with DMF and the a-amino group was released with 20o piperidine in DMF. Peptides were released with from resin by treatment with a mixture of TFA and scavengers which differ according to the composition.
Cases sera from 52 healthy donors controlled to be HBV-, HCV-and HIV-;
sera from 27 acute nonAnonE hepatitis patients;
sera from 16 chronic nonAnonE hepatitis patients;
sera from 61 acute hepatitis B patients;
sera from 14 chronic hepatitis B patients;
sera from 15 acute hepatitis C patients;
sera from 15 chronic hepatitis C patients;
sera from 29 drug addicted HIV-, HCV+ patients;
sera from 31 HIV+ patients;
sera from 33 HIV+ piastrinopenic patients;
sera from 30 AIDS patients;
sera from 39 LES patients.
ELISA assay The solid phase was prepared onto polivinylchloride dishes (N unc), by using a peptide or a mixture thereof.
Peptides were dissolved separately in a mixture of 500 CH3CN/H20 and diluted to a single final concentration of 10 ug/ml (total peptide concentration of 20 ug/ml) in carbonate buffer 50 mM, pH 9.6. 200 ul were added to each well and incubated for 2 hrs at 37C. When a single peptide was used, the final concentration was 20 ~g/ml.
Following to peptide adsorption, solid phases were saturated for 1 hr at 37~C with 300 ul/well of a BSA 2' solution in Tris-HC1 0.1M pH 7.5.
200 ul/well of 1:21 diluted sera in a solution of 5 PBS containing 0.3 g/1 EDTA,. 2 g/1 BSA, 2 ml/1 Triton x100 and 2 ml/1 Tween 20 were incubated 2 hrs at 37~C.
Following to 4 washes with PBS/Tween 20 0.1;, dishes were treated with 100 ul/well of horseradish peroxidase conjugated anti human Ig (Amersham). 1:30.000 diluted in 10 a PBS solution containing 0.04o Tween 20 and 20o non immune goat serum.
Following to 4 washes with PBS/Tween 20 O.lo, l00 ul/well of chromogen/substrate were added (Sorin Biomedica Diagnostics, DEIA KIT). The colorimetric reaction was stopped after 30 min. by adding 200 ul/well of sulphuric acid 1N. The optical density was read at 450 nm by referring to 630 nm.
CORE REGION
Reverse transcription (RT)-PCR
Analysis was performed with serum samples from 10 distinct subjects resulted positive for the presence of GBV-C RNA with the RT-PCR and hybridization assay as previously described (Fiordalisi, G., et al. J. Infect.
Dis. 174:l81-183). Four of them (patients identified with the codes 35, 19, 47 and MG) were HIV+/HCV+ patients with chronic liver disease; DM and 1159 were non A-E acute hepatitis patients; BZ was a patient with hepatocellular carcinoma; 3Q64 was a patient with fulminant non A-E
hepatitis; patient 2095 was an HCV+ subject with chronic hepatitis; and finally DS39 was a healthy blood donor.
Total RNA was extracted from 100 ul of serum samples of these subjects with TriPure reagent (Boehringer Mannheim Germany) and subjected to RT with random hexanucleotides using Superscript II (Life Tecnologies, Gaithersbul-g, MD). The obtained cDNA was used as template for PCR
reaction with degenerated primers, designed in our laboratory and derived from the sequences of GBVC and GBV
isolates deposited in Genbank database (Leary et al., Linnen et al. supra). The sequences of these primers, located in the 5'-UTR and putative structural regions, were:
AC1S sense primer:
5'AGG GTT SGW WGG TSG TAA ATC C 3' (SEQ ID No. S) AS-CWP antisense primer:
5'CGC CTG RTA NAR NGG CCA RCA 3' (SEQ ID No.6) wherein S = C or G; W = A or T; R = A or G; N = A or C or G or T.
Each of the 45 cycles of PCR included 1 min at 94~C, 1 min at 52~C and 1 min at 72~C, with an elongation time of 10 min in the last cycle. The sequence of the PCR
product obtained with these primers corresponds to nt 129 to 752 of GenBank accession no. U44402, the GBV-C isolate with the longest core-like protein reported to date (Linnen et al. supra).
Cloning and sequencing The amplification products obtained with AC1S and ASCWP primers were inserted into pCR2.1 vector (Invitrogen, San Diego, CA) and the sequences of the cloned inserts were obtained using the A.L.F. DNA
sequencer (Pharmacia Biotech, Uppsala, Sweden). The sequenced plasmids were named pCR X/AC1S, where X
corresponded to the isolate derived from the 10 patients considered in this study. Nucleotide and aminoacid sequences were analyzed with the CLUSTAL alignment program included in PC/GENE software package (Intelligenetics Inc., Mountain Vlow, CA).
Construction of deletion mutants' Mutants with deletions of different portions of the 5' region of pCR35/AC1S, pCRl9/AC1S, pCR3064/AC1S and pCR2095/AC1S constructs were derived from these plasmids by PCR amplification using synthetic primers. The antisense primer was the AS-CWP primer described above, while the sense primers, designed in our laboratory and derived from the sequences of our GBV-C isolates, mapped just upstream of the first 5 putative start codons on the basis of the sequence of isolate 35 (Fig. 5 and 6), as 5 described in Fig. 8.
Primer 1: nt -43/-27 5'ACGTAAGCTT A GGT GTW GGY CCT ACC G 3' (SEQ ID. No.7) Primer 2: nt +48/+64:
5'ACGTAAGCTT G TAC GGY CCA CGT CGC C 3' (SEQ ID. No.8);
Primer 3: nt +99/+117:
5'ACGTAAGCTT G CGA GTT GRC AAG GAC CAG 3'(SEQ ID. No.9);
Primer 4: nt +167/+Z83:
5'ACGTAAGCTT GR RVC GGG AAA TGC ATG 3'(SEQ ID. No.lO);
Primer 5: nt +239/+255:
15 5'ACGTAACCTT TG AGG GCG GGT GGC ATT 3'(SEQ ID. No.l1), wherein underlined nucleotides correspond to specific sequences of GBV-C, W = A or T; Y = C or T; R = A or G; V
- C or A or G. The position of the primers corresponded to the nucleotide numbers obtained considering as +1 the 20 nucleotide A of the first ATG in the ORF of isolate 35 (see Fig. 5) . The products of PCR with 1, 2, 3, 4 and 5 sense primers and AS-CWP antisense primer were cloned in pCR2.1 and named respectively pCRX/01, pCRX/02, pCRX/~3, pCRX/~4 and pCRX/05, where X corresponded to the 25 identification code of the isolate (35, 19, 3064 and 2095y .
In vitro transcription and translation (IVTT) Plasmids were linearized by digestion with BamHl and transcribed in vitro with T7 RNA polymerase (mCAP mRNA
30 Capping kit, Stratagene, La Jolla, CA) according to the manufacturer's instructions. In most cases, a m'GpppG cap structure was incorporated at the 5' end of the RNA
transcripts. In vitro translation reactions were performed with a rabbit reticulocyte lysate (RRL) system 35 (Amersham, Buckinghamshire, England), according to the manufacturer's instructions. Reactions (25 ul) contained 20 U of rRNasin (Promega, Madison,WI), 20 uCi of ['SS]
cysteine (l.000 Ci/mmol, Amersham, England) and 1.5 ~g of in vitro transcribed RNA. After incubation at 30~C for 1 hr, 2 ul aliquots were denatured for 5 min at 100~C in an equal volume of 2 X sodium dodecyl sulphate polyacrylamide gel electrophoresis (SDS-PAGE) loading buffer and analyzed on 18o SDS-polyacrylamide gels.
Protein gels were dried and exposed to X-ray film.
IO RESULTS
The analysed peptides are comprised in the NS3 region, from aa. 1346 to aa. 1550 (SEQ TD No.l), and are shown in Figure 1.
All of peptides were adsorbed to the solid phase at different concentrations and confronted with a panel of sera from nonAnonE affected patients and healthy donors.
ELISA assays were performed with different concentrations and adsorption conditions. Results show that peptides of Table 1 react with sera with different binding affinities. The reactivity may be increased by previous adsorption to the solid phase of a mixture of different peptides.
The G20T peptide has a decreased activity given to a cysteine dimer at position 1467. The cysteine to serine (G20T(C->S)) substitution solves this stability problem.
In order to get good reactivities solid phases should be prepared either with a mixture of l0ug/ml S16V
and 10 ug/ml G20T(C->S) or with E21A peptide at 20 ug/ml.
To determine the cut-off reactivities of 200 sera from healthy donors were tested and the following formula was applied:
OD cut-off = average OD healthy donors + 8 s.d. (standard deviation) The value was for both of solid phases of 0.450 ODqsonm. A11 of samples with an OD>0.450 were considered to be positive. Results from sera screening were reported in Table 1.
Table 1 S16V+G20T(C-S) E21A
n. patientsa patients patients+ + + +
DONORS
HEP. acute 27 8 32 7 28 nAnE chronic 16 8 32 7 28 B acute 61 30 46, 17 26, 6 HEP. B 14 1 3,1 0 0 chronic C acute 15 B 19,5 3 7,3 C 15 9 22,5 5 12,5 chronic TD HIV-,HCV+ 29 17 32,7 19 36,5 HIV+ 31 HIV+ 33 piastrinopenic I,ES 3 9 Results obtained with p3.1-p3.9 peptides of Figure 1 are shown in Figure 2.
Data show trhat some of analyzed peptides are able to specifically detect antibodies against epitopes of NS3 region of GBV virus.
CORE REGION
Identification of specific epitopes of "core" region Aminoacid sequences (as in figure 3a a 3b) show methionines which may be aligned with either the first or the second methionine already described. The 35HIV clone sequencing revealed a new aminoacid sequence, at the N
terminus region, starting with a methionine residue.
By clone sequencing aminoacid sequences of GBV virus N-terminus were revealed. The "consensus" sequence is shown in Figure 4a.
Figure 4b shows the reactivity obtained by assaying different sera with the peptide having the consensus sequence of Fig. 4a.
Peptide synthesis 5 Peptides were synthesized with an automated synthesizer MilliGen 9050 (Septrim) as shown for peptide for NS3 region.
Cases sera from 20 healthy donors;
10 sera from 23 acute nonAnonE hepatitis patients;
sera from 10 chronic nonAnonE hepatitis patients;
sera from 22 acute hepatitis B patients;
sera from 33 drug addicted HIV-, HCV+ patients;
sera from 7 HIV+ patients;
15 sera from 17 HIV+ piastrinopenic patients;
sera from 47 Community Aquired HCV+ patients;
sera from 11 AIDS patients.
ELISA assay Solid phase was prepared on polivinylchloride dishes (Nunc) . l00 ul of a 20 ug/ml solution of G11C, D23C-VA, M37C-LVA, D13G-TG and M27C-CTG peptides, in carbonate buffer 50 mM, pH 9.6 were added to each well and incubated for 2 hrs at 37~C.
Further to peptide adsorbtion solid phases were saturated for 1 hr at 37~C with 300 ul/well of a BSA 20 solution in Tris-HC1 0.1M pH 7.5.
100 ul of 1:21 diluted serum in PBS containing 0.3 g/1 EDTA, 2 g/1 BSA, 2 m1/1 Triton x100 and 2 m1/1 Tween 20 were incubated into wells for 2 hrs at 37~C.
Following to 4 washes with PBS/Tween 20 0.1%, 100 ul/well of horseradish peroxidas'e conjugated goat anti human Ig (Sorin Biomedica), diluted l:100 in a solution of PBS containing 0.04% Tween 20 and 200 of preimmune goat serum. were added to each well.
Following to 4 washes of PBS/Tween 20 0 . 1 o BSA, 100 ul of chromogen and substrate were added to each well (Sorin DEIA KIT). The colorimetric reaction was blocked after 30 min with 200 ul/well of sulphuric acid 1N. The OD reading was performed at 450 nm by referrng to OD at 630 nm.
Results show the presence of antibodies against peptides of the "core" region of GBV virus.
Sequence varialbility of the 5' terminal region of different GBV-C isolates The GBV-C isolates so far analyzed have revealed putative truncated core proteins of about 83 and 46 amino acids in length (Linnen et al. supra, Simons et al. 1995 supra). These differences are due to nucleotide deletions or insertions that introduce in the sequences a frameshift causing the generation of distinct in-frame starting codons. It is conceivable, therefore, that the intrinsic genomic variability of this virus may introduce, in some particular isolates, additional mutations, insertions or deletions that could generate potentially functional core proteins.
In order to evaluate this possibility, we analyzed the nucleotide sequences, and their deduced translation products. On the basis of this analysis, we found 4 different types of sequences. The sequences of isolates 19, DM, DS39, BZ, 1l59 and 47 encoded for a putative core protein of approximately 83 aminoacids in length, corresponding to the published sequence of R10291 isolate (GenBank accession no. U45966) (Linnen et al. supra) (GBV2 in Fig. 5 and 6). The most 5' and unique inframe AUG codon of isolates 3064 and MG, on the other hand, could initiate the translation of a putative core protein of 46 aminoacids in length similar to the one potentially encoded by the published sequences of GBV-C isolate (GenBank accession no. U36380) (teary et aI. supra) and PNF2161 isolate (Linnen et al. supra) (GenBank accession no. U44402) (GBV-C and GBV1 in the Fig. 5 and 6). The sequence of isolate 2095 was more peculiar since, due to one base deletion (Fig. 5), it contained only one potential in-frame initiator AUG codon, downstream of the putative start codon of GBV-C and HGVI and located 9 nucleotides only upstream of the sequence that specifies the putative E1 signal sequence (Leary ET al. supra). The mostly unexpected finding, however, was that the sequence derived from the isolate 35 (35/1 in Fig. 5 and 6) could potentially encode for a putative core protein of approximately 106 aminoacids in length. This particular feature was due to a point mutation that introduces an additional in-frame AUG codon 69 nucleotides upstream of the putative initiation codon of the HGV 2 reference sequence.
In order to exclude the possibility of errors (intrinsic to the polymerase chain reactions) or of sequence misinterpretations (possibly due to the frequent sequence compressions), we confirmed our results by amplifying and sequencing several clones of each isolate (data not shown). Moreover, in the case of patient 35, we cloned and sequenced RT-PCR products derived from 2 additional sequential serum samples collected during the one year follow-up of this patient (sequences 35/2 and 35/3 in Fig. 6). While this analysis confirmed the presence of the upstream AUG codon in a11 the samples of patient 35, it also established that this particular isolate was stable over the time and that it was not dominantly replaced by a population of mutants., Thus, our data provide the first evidence for the existence of additional GBV-C sequences, potentially leading to a core protein longer and not described in the previously cDNA clones derived from the numerous viremic patients analyzed so far (Leary et al., Simons et al.
1995, 1996, supra) .
Finally, our results establish that the putative core proteins of our isolates are characterized by discrete regions of high variability, that are separated by highly conserved motifs (Fig. 6).
Translation of isolates with distinct putative initiator codons 5 The deduced translation products derived from the nucleotide sequences of our isolates demonstrate that these GBV-C sequences are very heterogeneous with respect to the potential translation initiation site. Thus, in order to understand the potential replicative competence 10 of these viral isolates, it was important to assess whether initiation at any of these sites can indeed occurr. This issue is particularly relevant since, recently, Simons et al. (1996 supra) have provided evidence suggesting that, in their GBV-C isolates, 15 initiation of translation occurred only immediately upstream of the putative El signal sequence and that the upstream in frame AUG seemed to be unable to direct the synthesis of GBV-C polyprotein.
To determine the effect of the upstream in frame AUG
20 codons on translation of our viral isolates, mRNAs with capped methylated (m'GpppG) 5' end, synthesized from linearized pCRX/AC1S constructs, were examined for mRNA
activity in a cell-free translation system prepared from rabbit reticulocytes. For this purpose we considered 25 isolates with distinct putative initiator codons.
Isolate 35 had 5 in-frame AUG codons in the putative core region (Fig. 5 and 6) and therefore the pCR35/AC1S
construct (covering the partial 5'UTR, the putative core and a portion of the E1 protein) can potentially mediate 30 the synthesis of different transcription products with expected sizes of about 17, 14,11, 10 or 7 kDa. Figure 7 shows that efficient translation occurred and a polypeptide with the apparent size of 17 kDa, compatible with the activity of the first in-frame AUG, was observed 35 in a SDS-PAGE analysis. It is important to note that efficient translation was also obtained with IVTT
reactions programmed with uncapped RNA derived from pCR35/AC1S. Thus, these data strongly suggest that, at least in this construct, the first available in-frame AUG
from the 5' end of the RNA can efficiently act as the initiator codon. Furthermore, since the cap stmcture was not determinant for efficient translation, the data also suggest that, as recently proposed by Simons et al.
(Simons et al. 1996, supra), GBV-C RNA may not utilize the classical ribosome scanning mechanism.
IO IVTT reaction mixtures were also programmed with pCRDS39/AC1S, pCRDM/AC1S, pCRl9/AC1S, pCR3064/AC1S, pCRMG/AC1S and pCR2095/AC1S constructs.
Isolates DS39, DM and 19 had the first in-frame AUG
corresponding to the second in-frame AUG of isolate 35 (Fig. 5 and 6). Moreover, isolate DS39 had also two additional putative initiatior codons so that the pCRDS39/AC1S construct could potentially direct the synthesis of three possible proteins of 14, 10 and 7 kDa.
The pCRDM/ACIS construct, on the other hand, can potentially initiate translation at 4 different AUG
codons, producing polypeptides of I4, II, 10 and 7 kDa.
Two of these plasmids, pCRDS39/AC1S and pCRl9/AC1S, directed the translation of products of about 14 kDa as detected by SDS-PAGE (Fig. 7). The size of these proteins is compatible with the activity of the AUG sequence just downstream of the start codon of the pCR35/AC1S
construct, suggesting that, also in these cases, translation was initiated at the first available in-frame AUG of the ORF. Since pCRl9/AC1S contained only one AUG
inframe with the long ORF, and since on the basis of the nucleotide sequence we could also exclude the possibility that the expressed proteins represented unrelated products of translation directed by AUG codon located in the other two frames, we conclude that the observed band is the correct product of translation.
In contrast, no bands were detected in the reactions programmed with the pCRDM/AC1S plasmid. Since this construct contained the same potential in-frame initiator AUG codon than pCR39/AC1S and pCRl9/AC1S, the results 5 rise the possibility that Sequences upstream of the potential initiator codon may influence the translation either positively or negatively. This possibility is also reinforced by the fact that the plasmids derived from isolates 3064 and MG (pCR3064/AC1S and pCRMG/AC1S), that 10 could potentially direct the synthesis of a band of 10 kDa, did not produce detectable quantities of specific proteins, in accordance to the data of Simons et al.
(Simons et al. 1996 supra). Negative results were also obtained with the pCR2095/AC1S construct, derived from 15 isolate 2095, that could potentially produce a protein of about 7 kDa.
Analysis of deletion mutants To examine in details the activity of the different potential in-frame initiator AUG codons, to verify the 20 possibility of a regulatory role of the upstream sequences and to unequivocally establish the size of the obtained translation products, cell-free protein synthesis experiments were performed with capped mRNA
transcribed by truncated forms of the 35, 19, 3064 and 2095 constructs described in Fig. 8. Figure 9A shows the results obtained when IVTT reaction mixtures were programmed with the 35 deletion mutant plasmids. In these experiments, translation of the mRNA derived from pCR35/AC1S yielded the major precursor band of 17 kDa as well as 3 additional faint bands of 14, 11 and 7 kDa, probably corresponding to translation products that utilize the second, third and the fifth in-frame AUG as initiator codons. A very similar pattern was obtained with the pCR35/~1 construct. In vitro translation of pCR35/02, pCR35/03, pCR35/~4 and pCR35/05 transcripts yielded bands of progressively lower molecular weights of about 14, 11 and 7 kDa, implying therefore that, in this isolate, a11 the in-frame AUG codons, with the exception of the fourth AUG, might serve as potential site of translation initiation. In the lane containing the translation products of pCR35/02 mutant we found two bands of 14 and 11 kDa corresponding to translation products that utilized both the second and the third available AUG initiation codons.
As expected, the analysis of the deletion mutants of pCRl9/AC1S, that contains only one in-frame AUG, gave different results (Fig. 9B). The 14 kDa bands obtained with reactions programmed with pCRl9/AC1S, pCRl9/~1 and pCRl9/~2 demonstrate that the first and unique in-frame AUG is the site of translation initiation and that., at least for this mutant, deletion of 5' UTR does not seem to significantly influence the level of translation.
A quite unexpected finding of these experiments was that the reactions programmed with some of the deletion mutants of pCRl9/AC1S construct, that lacked the AUG
codon immediately upstream of the putative E1 signal sequence, yielded a band with the same apparent molecular mass of the 7 kDa protein detected in the reactions programmed with pCR35/44 and pCR35/~5. Although we can not exclude that this band may have resulted from partially degraded RNA, it is interesting that in this template the AUG triplet correspcnding to the fifth in-frame start codon of isolate 35 is replaced by an ACG
codon, that has been found to act as initiation codon in other mRNAs (Boeck, R., and D. Kolakofsky. l994. EMBO J.
13:3608-3617), including a transcript derived from a GBV-C mutated sequence (Simons et al. 1996 supra).
Reactions programmed with the deletion mutants derived from pCR3064/AC1S construct provided no detectable bands (data not shown). Since in this constructs the only available initiation codon is the AUG
corrisponding to the fourth start codon of the isolate 35, these results confirmed the data obtained with the mutants of isolate 35 and with the pCR3064/AC1S and pCRMG/AC1S plasmids, suggesting that this particular codon, as already reported (Simons et al. 1996 supra), is silent, probably due to the composition of the flanking sequences.
Finally, evidence for a regulatory role of the partial 5'UTR were obtained with the deleted 2095 plasmids (Fig. 9C). In this isolate the only available site of transcription initiation corresponds to the fifth AUG of isolate 35, that is however silent in construct pCR2095/AC1S. The expected 7 kDa product was surprisingly detected as a faint band with pCR2095/01, pCR2095/02 and pCR2095/~3 and the intensity of this band increased progressively by programming the reactions with pCR2095/04 and pCR2095/05. These data clearly mean that, in this particular template, the translation of the polypeptide is strongly influenced by the partial 5'UTR
present in the undeleted construct that, in this case, seems to inhibit the translation process.
Finally, we must stress that we have verified that none of the bands obtained with the deletion mutants can be accounted for by unrelated products of translation encoded by the other two possible reading frames.
o . ~ CA 02251745 1999-04-27 - 2 3 -~ ;
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: WABCO BV
(ii) TITLE OF INVENTION: Non-A non-E Hepatitis virus having a translatable core region, reagents and methods for their use (iii) NUMBER OF SEQUENCES: 11 (iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Dennison Associates (B) STREET: 133 Richmond Street West, Suite 301 (C) CITY: Toronto (D) PROVINCE: Ontario (E) COUNTRY: Canada (F) POSTAL CODE: M5H 2L7 (v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk (B) COMPUTER: IBM PC compatible (C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: PatentIn Release #1.0, Version #1.30(EPO) (vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: 2,251,745 (B) FILING DATE: 09-DEC-1996 (C) CLASSIFICATION:
(vii) ATTORNEY/AGENT INFORMATION:
(A) NAME: Dennison Associates (B) REFERENCE/DOCKET NUMBER: JJ-10288CA
(viii) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: 416-368-8313 (B) TELEFAX: 416-368-1645 (2) INFORMATION
FOR
SEQ
ID
NO:
1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 205 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID N0: 1:
Arg Val Arg Asp Val Ala Arg Gly Cys Gly Val Gln Leu Leu Tyr Val Ala Thr Ala Thr Pro Pro Gly Ser Pro Met Thr Gln His Ser Ile Pro Ile Glu Thr Lys Leu Asp Val Gly Glu I1e Pro Phe Tyr His Gly Gly Ile Pro Leu Glu Arg Met Arg Thr Gly Arg His Leu Val Phe Cys His Ser Lys Ala Glu Cys Glu Arg Leu Ala Gly Gln Phe Ser Ala Arg Gly Val Asn Ala Ile Ala Tyr Tyr Arg Gly Lys Asp Ser Ser Ile Ile Lys Asp Gly Asp Leu Val AlaThrAsp AlaLeu SerThrGly Tyr Val Cys Thr Gly Asn Phe Ser ThrAspCys GlyLeu ValValGlu Glu Asp Val Val Val Glu Val Leu ProThrIle ThrIle SerLeuArg Thr Thr Asp Val Pro Ala Ser Glu SerMetGln ArgArg GlyArgThr Gly Ala Leu Arg Gly Arg Ser Arg TyrTyrAla GlyVal GlyLysAla Pro Gly Tyr A1a Gly Val Val Ser ProValTrp SerAla ValGluAla Gly Arg Gly Val Thr Trp Tyr Met ProAspLeu ThrAla Asn Gly Glu (2) INFO RMATION FOR ID NO: :
3 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 7 aminoacids (B) TYPE: o acid amin (C) STRANDEDNESS: le sing (D) TOPOLOGY:linear (ii) MOLECULE TYPE:peptide (xi)SEQUENCE DESCRIPTION: 2:
SEQ ID NO:
Met Ser Leu Asn Arg Ala TyrPro ProGlyLeu ThrThr ProThr Arg Tyr Gly Pro Arg Arg Pro MetSer LeuLeuThr AsnArg PheAsn Ser Arg Arg Val Asp Lys Asp TrpGly ProGlyVal MetGly LysAsp Gln Pro Lys Pro Cys Pro Ser ArgThr GlyLysCys MetGly ProPro Arg Ser Ser Ala Ala Ala Cys ArgGly SerProArg IleLeu ArgVal Ser Arg Ala Gly Gly Ile Ser ProTyr ThrIleMet GluAla LeuLeu Leu Phe Leu Leu Gly Val Glu GlyAla IleLeu Ala (2) INFORMATION
FOR SEQ
ID N0:
3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 84 amino acids (B) TYPE: amino acid (C) STRANDEDNESS:
single (D) TOPOLOGY: linear (ii)MOLECULE TYPE: peptide (xi)SEQUENCE DESCRIPTION: 3:
SEQ ID NO:
Met Ser Leu Leu Thr Asn PheAsn ArgArgVaI AspLys AspGln Arg Trp Gly Pro Gly Val Met LysAsp ProLysPro CysPro SerArg Gly Arg Thr Gly Lys Cys Met ProPro SerSerA1a AlaAla CysSer Gly Arg Gly Ser Pro Arg Ile Leu Arg Val Arg Ala Gly Gly Ile Ser Leu Pro Tyr Thr Ile Met Glu Ala Leu Leu Phe Leu Leu Gly Val Glu Ala Gly A1a Ile Leu (2) INFORMATION FOR SEQ ID N0: 4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 23 amino acids 1 5 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID N0: 4:
Met Ser Leu Asn Arg Ala Arg Tyr Pro Pro Gly Leu Thr Thr Pro Thr Tyr Gly Pro Arg Arg Pro Ser (2) INFORMATION FOR SEQ ID NO: 5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 22 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5:
(2) INFORMATION FOR SEQ ID NO:
6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 base pairs (B) TYPE; nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleicacid (xi) SEQUENCE DESCRIPTION: SEQ NO: 6:
ID
1 (2) INFORMATION FOR SEQ ID NO:
5 7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 27 base pairs (B} TYPE: nucleic acid 2 (C) STRANDEDNESS; single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleicacid 2 (xi) SEQUENCE DESCRIPTION: SEQ N0: 7:
(2) INFORMATION FOR SEQ ID NO:
8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 27 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single 3 (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleicacid (xi) SEQUENCE DESCRIPTION: SEQ NO: 8:
ID
(2) INFORMATION FOR SEQ ID NO:
9:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 29 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid (xi) SEQUENCE DESCRIPTION: SEQ N0: 9:
ID
1 (2) INFORMATION FOR SEQ ID N0:
5 10:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 27 base pairs (B) TYPE: nucleic acid 2 (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid 2 (xi) SEQUENCE DESCRIPTION: SEQ NO: 10:
(2) INFORMATION FOR SEQ ID NO:
11:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 27 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single 3 (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid (xi) SEQUENCE DESCRIPTION: SEQ NO: 11:
ID
25 '~SLrIPa.R'!PFULLTPT'IGFP,~PS, or an i.;unu:-:olcgically ho:noloc~ous variant cr the peptidA. Prefer aiy the peptide ~.as a 5equer~ce ccr~rrised in SEQ Iu Nc.4.
It is a Further object of tha i::vention a ccnposi~zor_ ccmpris~:~lQ at _east one o the peptides of the i nTrention. Prezerably said cor_p,csi ticn is adhered Jn a SO1 ~ a ptla$e .
A.ccarding tc a preferred. embcdi.-~,A-:t of the in~.Antion said c:ornpcsition comprises a'~ least tt~rc pepy~.~~ the irler.tion.
~.-'1"s ~ a.
AMEN~rp SH~.ET
r~ ~n" ~r,~. ,.: ~~ 7r, i ~Ln~ .~.; . ,nm _fty.. .,.~r_.'lrJ'', t7 ~,..~
_ . _. ~. , , _ . -.Z; r~. _ _ ~~ prr~-r nr~.
_ _..~.., ...,_.............,..~.~.._~.__ ... _...._....,a - ' ~ ~ : . _ ,.~
:j.. . I~: _ . ~ \ . ) ; ~
_ 5 _ F~azther object cf the invention is a diagnostic ki: for nonRror.~, hepatitis carnprising as specific reagent at least one peptide o~ Lha invention.
Preferably said kit comprises at least twe oei t~.des of t'.:e invention.
Furtrer object o~ ~.he i nventicr is a rr.ethod to detect ::cn.~, ron~ hepa=iris virus antibodies in a sa.~.-~ple c..,rnprisi:~g the following steps:
- inc~,:,ating in conditions allow':.nc t_:e for~!a~ion cf a.. artigen-aztibcdy co~:p~.e:c said sample with at least or.e peptide ;:f the inve:aion;
reveali-g saia cemp~.ex.
Tie izvsnti=n is new desl~r:.bed i n a non- i imi c'_ng .way by reference tc the following figures wherein:
r - riguro 1 renYeser_ts ~he ~r~.i:.eacic sect:enea o_ GDv N53 region dod of used peptides, with their dEnomination:
A~nrnlnr~l ~:~rrT
IJILI'1'Jv. ' ~~. ~ I _ ~mr ~~n~ "~I, ~:on~~ ,,.,;~_.~.,.: o .-m;i n , i '.. . .. . , ~ ' ~ i f i , ,n _ . ~ ~ . a I r"1 ~1 ~ . .
- . . I , . , '. I y. , , ~ ' I , , i . v a 4 W~ 97/39l29 PCT/IT96/00248 - figure 2 represents a bar graphic of p3.1-p3.9 peptide reactivity in ELISA assay with sera from subjects affected by different pathologies;
- figure 3a a 3b represent aligned aminoacid sequences of GBV core region; first three from top: from literature, others: isolated from the authors; "-" means an identity with the U36380 sequence; "_" means an identity with the U45966 sequence;
- figure 4a represents aligned aminoacid sequences of a segment of GBV core region, from literature: the first from the top from U45966 of Fig. 3a, others are isolated from the authors; at the bottom the consensus sequence is shown (SEQ ID No. 3);
- figure 4b represents a bar graphic of consensus peptide of Fig. 4a reactivity in ELISA assay with sera from subjects affected by different pathologies;
- figure 5 represents the nucleotide sequence alignment of 10 GBV-C isolates obtained from 10 distinct GBV-C positive patients, compared with the sequences of the published isolates deposited in GenBank [GBV-C
isolate (teary et al., supra), PNF2161 isolate (Linnen et al. supra) named GBV1 in the figure, and R10291 isolate named GBV2 in the figure]. The sequences were analyzed with the CLUSTAL alignment program. The alignment includes the nucleotide sequence between AC1S and AS-CWP
primers (shadowed in grey), and comprises a partial 5'UTR, the putative core region and a partial E1 region (corresponding to nt I12-733 of GBV-C, nt I29-752 of GHV1 and nt 56-680 of GBV2). The nucleotides are numbered only for isolate 35 (identified as the isolate with the longer ORF) and nucleotide +1 identifies~the nucleotide A of its first putative start codon in frame with the long ORF.
Note that in the other isolates nucleotide insertions or deletions are present. Gaps, introduced in the sequences to preserve alignment, are represented by dashes. The sequences between nucleotides 316 and 446, that are conserved, are not shown for reason of space. The nucleotides conserved among a11 the sequences are pointed out by an asterisk. The ATG codons in frame with the long ORF of each isolate are boxed;
- figure 6 represents the aminoacid sequence alignment of the isolates described in Fig. 5 and deduced from their nucleotide sequences. The aminoacid sequence of isolate 35 is identified as 35/1, while the sequences deduced from the two additional isolates derived from distinct serum samples collected during the follow-up of this patient are pointed out as 35/2 and 3S/3. The sequences were analyzed with the CLUSTAL alignment program. The sequence of the longer published ORF (GBV2) is considered as the consensus sequence. Dashes denote identity with the consensus sequence, whereas individual variations are shown by the single aminoacid code. The methionines in-frame with the long ORF are boxed and pointed out with an arrow for each isolate. The areas shadowed in grey mean the regions of high variability detected in our isolates;
- figure 7 represents a SDS-gel pattern of [3~S]cysteine-labelled in vitro translation products of the constructs, containing the AC1S/AS-CWP amplified region from the pointed out isolates. Positions of molecular weight standards (in kilodaltons) are pointed out. The last lane contains the negative control (no RNA). The IVTT of pCR35/AC1S construct was performed with and without capped mRNA;
- figure 8 represents the construction of the deletion mutants of pCR35/AC1S, pCRl9/AC1S, pCR3064/AC1S
and pCR2095/AC1S vectors. In the figure, the strategy for the construction of the pCR35/AC1S derivatives is described. The location of the T7 promoter (in black), the Bam H1 cleavage site and the position of the primers (AC1S and AS-CWP) used for the construction of the parent plasmid are shown. The numeration of the nucleotides is relative to the first putative start codon of the sequence of isolate 35. The partial 5' non coding region is white, the partial ORF is shadowed in grey and the position of AUG codons are pointed out with the notation Met. The positions of the sense primers (primers 1, 2, 3, 4 and 5) used with the antisense AS-CWP primer for the construction of the deletion mutants are also shown. The same strategy was followed for obtaining the pCRl9/AC1S, pCR3064/AC1S and pCR2095/AC1S derivatives. The length of the partial 5'UTR, of the partial ORF and the position of the AUG codons of these isolates are shown in Fig. 5 and 6.
- figure 9 represents a SDS-gel pattern of ~35S~cysteine-labelled in vitro translation products gLenerated by the O2, 43, 44 and 05 deletion mutants of pCR35/AC1 (A), pCRl9/ACIS (B) and pCR2095/AC1S (C) plasmids, obtained as described in Fig. 8. Positions of molecular weight markers (in kilodaltons) are pointed out. In panel (A)-the products of capped and uncapped mRNAs derived from the pCR35/AC1S undeleted plasmid are shown. A11 the other mRNAs used in these experiments were capped transcripts. In the same panel the first lane represents the product of translation of an unrelated mRNA (UNR). In panel (B) the first lane represents the product of translation of the pCR35/~, mutant, that was used as control for the size of the product of isolate 19. In panel (C) the first lane contains the negative control (no RNA) .
Analysis of GBV protein sequences Protein sequences have been computer analyzed by means of ANTIGEN program 6.85 (PCGene). ANTIGEN utilizes Hopp and Wood algorithm to predict protein potentially hydrophilic regions.
Peptide synthesis Peptides have been synthesized with an automated synthesizer MilliGen 9050 (Septrim). A11 of aminoacids were ~noc a-amino protected and pre-activated at the carboxy terminus pentafluorophenyl esters; only serine and threonine were activated~as benzotriazine esters.
A11 of coupling reactions were performed with HOBT
0.33M/DMF; washings were made with DMF and the a-amino group was released with 20o piperidine in DMF. Peptides were released with from resin by treatment with a mixture of TFA and scavengers which differ according to the composition.
Cases sera from 52 healthy donors controlled to be HBV-, HCV-and HIV-;
sera from 27 acute nonAnonE hepatitis patients;
sera from 16 chronic nonAnonE hepatitis patients;
sera from 61 acute hepatitis B patients;
sera from 14 chronic hepatitis B patients;
sera from 15 acute hepatitis C patients;
sera from 15 chronic hepatitis C patients;
sera from 29 drug addicted HIV-, HCV+ patients;
sera from 31 HIV+ patients;
sera from 33 HIV+ piastrinopenic patients;
sera from 30 AIDS patients;
sera from 39 LES patients.
ELISA assay The solid phase was prepared onto polivinylchloride dishes (N unc), by using a peptide or a mixture thereof.
Peptides were dissolved separately in a mixture of 500 CH3CN/H20 and diluted to a single final concentration of 10 ug/ml (total peptide concentration of 20 ug/ml) in carbonate buffer 50 mM, pH 9.6. 200 ul were added to each well and incubated for 2 hrs at 37C. When a single peptide was used, the final concentration was 20 ~g/ml.
Following to peptide adsorption, solid phases were saturated for 1 hr at 37~C with 300 ul/well of a BSA 2' solution in Tris-HC1 0.1M pH 7.5.
200 ul/well of 1:21 diluted sera in a solution of 5 PBS containing 0.3 g/1 EDTA,. 2 g/1 BSA, 2 ml/1 Triton x100 and 2 ml/1 Tween 20 were incubated 2 hrs at 37~C.
Following to 4 washes with PBS/Tween 20 0.1;, dishes were treated with 100 ul/well of horseradish peroxidase conjugated anti human Ig (Amersham). 1:30.000 diluted in 10 a PBS solution containing 0.04o Tween 20 and 20o non immune goat serum.
Following to 4 washes with PBS/Tween 20 O.lo, l00 ul/well of chromogen/substrate were added (Sorin Biomedica Diagnostics, DEIA KIT). The colorimetric reaction was stopped after 30 min. by adding 200 ul/well of sulphuric acid 1N. The optical density was read at 450 nm by referring to 630 nm.
CORE REGION
Reverse transcription (RT)-PCR
Analysis was performed with serum samples from 10 distinct subjects resulted positive for the presence of GBV-C RNA with the RT-PCR and hybridization assay as previously described (Fiordalisi, G., et al. J. Infect.
Dis. 174:l81-183). Four of them (patients identified with the codes 35, 19, 47 and MG) were HIV+/HCV+ patients with chronic liver disease; DM and 1159 were non A-E acute hepatitis patients; BZ was a patient with hepatocellular carcinoma; 3Q64 was a patient with fulminant non A-E
hepatitis; patient 2095 was an HCV+ subject with chronic hepatitis; and finally DS39 was a healthy blood donor.
Total RNA was extracted from 100 ul of serum samples of these subjects with TriPure reagent (Boehringer Mannheim Germany) and subjected to RT with random hexanucleotides using Superscript II (Life Tecnologies, Gaithersbul-g, MD). The obtained cDNA was used as template for PCR
reaction with degenerated primers, designed in our laboratory and derived from the sequences of GBVC and GBV
isolates deposited in Genbank database (Leary et al., Linnen et al. supra). The sequences of these primers, located in the 5'-UTR and putative structural regions, were:
AC1S sense primer:
5'AGG GTT SGW WGG TSG TAA ATC C 3' (SEQ ID No. S) AS-CWP antisense primer:
5'CGC CTG RTA NAR NGG CCA RCA 3' (SEQ ID No.6) wherein S = C or G; W = A or T; R = A or G; N = A or C or G or T.
Each of the 45 cycles of PCR included 1 min at 94~C, 1 min at 52~C and 1 min at 72~C, with an elongation time of 10 min in the last cycle. The sequence of the PCR
product obtained with these primers corresponds to nt 129 to 752 of GenBank accession no. U44402, the GBV-C isolate with the longest core-like protein reported to date (Linnen et al. supra).
Cloning and sequencing The amplification products obtained with AC1S and ASCWP primers were inserted into pCR2.1 vector (Invitrogen, San Diego, CA) and the sequences of the cloned inserts were obtained using the A.L.F. DNA
sequencer (Pharmacia Biotech, Uppsala, Sweden). The sequenced plasmids were named pCR X/AC1S, where X
corresponded to the isolate derived from the 10 patients considered in this study. Nucleotide and aminoacid sequences were analyzed with the CLUSTAL alignment program included in PC/GENE software package (Intelligenetics Inc., Mountain Vlow, CA).
Construction of deletion mutants' Mutants with deletions of different portions of the 5' region of pCR35/AC1S, pCRl9/AC1S, pCR3064/AC1S and pCR2095/AC1S constructs were derived from these plasmids by PCR amplification using synthetic primers. The antisense primer was the AS-CWP primer described above, while the sense primers, designed in our laboratory and derived from the sequences of our GBV-C isolates, mapped just upstream of the first 5 putative start codons on the basis of the sequence of isolate 35 (Fig. 5 and 6), as 5 described in Fig. 8.
Primer 1: nt -43/-27 5'ACGTAAGCTT A GGT GTW GGY CCT ACC G 3' (SEQ ID. No.7) Primer 2: nt +48/+64:
5'ACGTAAGCTT G TAC GGY CCA CGT CGC C 3' (SEQ ID. No.8);
Primer 3: nt +99/+117:
5'ACGTAAGCTT G CGA GTT GRC AAG GAC CAG 3'(SEQ ID. No.9);
Primer 4: nt +167/+Z83:
5'ACGTAAGCTT GR RVC GGG AAA TGC ATG 3'(SEQ ID. No.lO);
Primer 5: nt +239/+255:
15 5'ACGTAACCTT TG AGG GCG GGT GGC ATT 3'(SEQ ID. No.l1), wherein underlined nucleotides correspond to specific sequences of GBV-C, W = A or T; Y = C or T; R = A or G; V
- C or A or G. The position of the primers corresponded to the nucleotide numbers obtained considering as +1 the 20 nucleotide A of the first ATG in the ORF of isolate 35 (see Fig. 5) . The products of PCR with 1, 2, 3, 4 and 5 sense primers and AS-CWP antisense primer were cloned in pCR2.1 and named respectively pCRX/01, pCRX/02, pCRX/~3, pCRX/~4 and pCRX/05, where X corresponded to the 25 identification code of the isolate (35, 19, 3064 and 2095y .
In vitro transcription and translation (IVTT) Plasmids were linearized by digestion with BamHl and transcribed in vitro with T7 RNA polymerase (mCAP mRNA
30 Capping kit, Stratagene, La Jolla, CA) according to the manufacturer's instructions. In most cases, a m'GpppG cap structure was incorporated at the 5' end of the RNA
transcripts. In vitro translation reactions were performed with a rabbit reticulocyte lysate (RRL) system 35 (Amersham, Buckinghamshire, England), according to the manufacturer's instructions. Reactions (25 ul) contained 20 U of rRNasin (Promega, Madison,WI), 20 uCi of ['SS]
cysteine (l.000 Ci/mmol, Amersham, England) and 1.5 ~g of in vitro transcribed RNA. After incubation at 30~C for 1 hr, 2 ul aliquots were denatured for 5 min at 100~C in an equal volume of 2 X sodium dodecyl sulphate polyacrylamide gel electrophoresis (SDS-PAGE) loading buffer and analyzed on 18o SDS-polyacrylamide gels.
Protein gels were dried and exposed to X-ray film.
IO RESULTS
The analysed peptides are comprised in the NS3 region, from aa. 1346 to aa. 1550 (SEQ TD No.l), and are shown in Figure 1.
All of peptides were adsorbed to the solid phase at different concentrations and confronted with a panel of sera from nonAnonE affected patients and healthy donors.
ELISA assays were performed with different concentrations and adsorption conditions. Results show that peptides of Table 1 react with sera with different binding affinities. The reactivity may be increased by previous adsorption to the solid phase of a mixture of different peptides.
The G20T peptide has a decreased activity given to a cysteine dimer at position 1467. The cysteine to serine (G20T(C->S)) substitution solves this stability problem.
In order to get good reactivities solid phases should be prepared either with a mixture of l0ug/ml S16V
and 10 ug/ml G20T(C->S) or with E21A peptide at 20 ug/ml.
To determine the cut-off reactivities of 200 sera from healthy donors were tested and the following formula was applied:
OD cut-off = average OD healthy donors + 8 s.d. (standard deviation) The value was for both of solid phases of 0.450 ODqsonm. A11 of samples with an OD>0.450 were considered to be positive. Results from sera screening were reported in Table 1.
Table 1 S16V+G20T(C-S) E21A
n. patientsa patients patients+ + + +
DONORS
HEP. acute 27 8 32 7 28 nAnE chronic 16 8 32 7 28 B acute 61 30 46, 17 26, 6 HEP. B 14 1 3,1 0 0 chronic C acute 15 B 19,5 3 7,3 C 15 9 22,5 5 12,5 chronic TD HIV-,HCV+ 29 17 32,7 19 36,5 HIV+ 31 HIV+ 33 piastrinopenic I,ES 3 9 Results obtained with p3.1-p3.9 peptides of Figure 1 are shown in Figure 2.
Data show trhat some of analyzed peptides are able to specifically detect antibodies against epitopes of NS3 region of GBV virus.
CORE REGION
Identification of specific epitopes of "core" region Aminoacid sequences (as in figure 3a a 3b) show methionines which may be aligned with either the first or the second methionine already described. The 35HIV clone sequencing revealed a new aminoacid sequence, at the N
terminus region, starting with a methionine residue.
By clone sequencing aminoacid sequences of GBV virus N-terminus were revealed. The "consensus" sequence is shown in Figure 4a.
Figure 4b shows the reactivity obtained by assaying different sera with the peptide having the consensus sequence of Fig. 4a.
Peptide synthesis 5 Peptides were synthesized with an automated synthesizer MilliGen 9050 (Septrim) as shown for peptide for NS3 region.
Cases sera from 20 healthy donors;
10 sera from 23 acute nonAnonE hepatitis patients;
sera from 10 chronic nonAnonE hepatitis patients;
sera from 22 acute hepatitis B patients;
sera from 33 drug addicted HIV-, HCV+ patients;
sera from 7 HIV+ patients;
15 sera from 17 HIV+ piastrinopenic patients;
sera from 47 Community Aquired HCV+ patients;
sera from 11 AIDS patients.
ELISA assay Solid phase was prepared on polivinylchloride dishes (Nunc) . l00 ul of a 20 ug/ml solution of G11C, D23C-VA, M37C-LVA, D13G-TG and M27C-CTG peptides, in carbonate buffer 50 mM, pH 9.6 were added to each well and incubated for 2 hrs at 37~C.
Further to peptide adsorbtion solid phases were saturated for 1 hr at 37~C with 300 ul/well of a BSA 20 solution in Tris-HC1 0.1M pH 7.5.
100 ul of 1:21 diluted serum in PBS containing 0.3 g/1 EDTA, 2 g/1 BSA, 2 m1/1 Triton x100 and 2 m1/1 Tween 20 were incubated into wells for 2 hrs at 37~C.
Following to 4 washes with PBS/Tween 20 0.1%, 100 ul/well of horseradish peroxidas'e conjugated goat anti human Ig (Sorin Biomedica), diluted l:100 in a solution of PBS containing 0.04% Tween 20 and 200 of preimmune goat serum. were added to each well.
Following to 4 washes of PBS/Tween 20 0 . 1 o BSA, 100 ul of chromogen and substrate were added to each well (Sorin DEIA KIT). The colorimetric reaction was blocked after 30 min with 200 ul/well of sulphuric acid 1N. The OD reading was performed at 450 nm by referrng to OD at 630 nm.
Results show the presence of antibodies against peptides of the "core" region of GBV virus.
Sequence varialbility of the 5' terminal region of different GBV-C isolates The GBV-C isolates so far analyzed have revealed putative truncated core proteins of about 83 and 46 amino acids in length (Linnen et al. supra, Simons et al. 1995 supra). These differences are due to nucleotide deletions or insertions that introduce in the sequences a frameshift causing the generation of distinct in-frame starting codons. It is conceivable, therefore, that the intrinsic genomic variability of this virus may introduce, in some particular isolates, additional mutations, insertions or deletions that could generate potentially functional core proteins.
In order to evaluate this possibility, we analyzed the nucleotide sequences, and their deduced translation products. On the basis of this analysis, we found 4 different types of sequences. The sequences of isolates 19, DM, DS39, BZ, 1l59 and 47 encoded for a putative core protein of approximately 83 aminoacids in length, corresponding to the published sequence of R10291 isolate (GenBank accession no. U45966) (Linnen et al. supra) (GBV2 in Fig. 5 and 6). The most 5' and unique inframe AUG codon of isolates 3064 and MG, on the other hand, could initiate the translation of a putative core protein of 46 aminoacids in length similar to the one potentially encoded by the published sequences of GBV-C isolate (GenBank accession no. U36380) (teary et aI. supra) and PNF2161 isolate (Linnen et al. supra) (GenBank accession no. U44402) (GBV-C and GBV1 in the Fig. 5 and 6). The sequence of isolate 2095 was more peculiar since, due to one base deletion (Fig. 5), it contained only one potential in-frame initiator AUG codon, downstream of the putative start codon of GBV-C and HGVI and located 9 nucleotides only upstream of the sequence that specifies the putative E1 signal sequence (Leary ET al. supra). The mostly unexpected finding, however, was that the sequence derived from the isolate 35 (35/1 in Fig. 5 and 6) could potentially encode for a putative core protein of approximately 106 aminoacids in length. This particular feature was due to a point mutation that introduces an additional in-frame AUG codon 69 nucleotides upstream of the putative initiation codon of the HGV 2 reference sequence.
In order to exclude the possibility of errors (intrinsic to the polymerase chain reactions) or of sequence misinterpretations (possibly due to the frequent sequence compressions), we confirmed our results by amplifying and sequencing several clones of each isolate (data not shown). Moreover, in the case of patient 35, we cloned and sequenced RT-PCR products derived from 2 additional sequential serum samples collected during the one year follow-up of this patient (sequences 35/2 and 35/3 in Fig. 6). While this analysis confirmed the presence of the upstream AUG codon in a11 the samples of patient 35, it also established that this particular isolate was stable over the time and that it was not dominantly replaced by a population of mutants., Thus, our data provide the first evidence for the existence of additional GBV-C sequences, potentially leading to a core protein longer and not described in the previously cDNA clones derived from the numerous viremic patients analyzed so far (Leary et al., Simons et al.
1995, 1996, supra) .
Finally, our results establish that the putative core proteins of our isolates are characterized by discrete regions of high variability, that are separated by highly conserved motifs (Fig. 6).
Translation of isolates with distinct putative initiator codons 5 The deduced translation products derived from the nucleotide sequences of our isolates demonstrate that these GBV-C sequences are very heterogeneous with respect to the potential translation initiation site. Thus, in order to understand the potential replicative competence 10 of these viral isolates, it was important to assess whether initiation at any of these sites can indeed occurr. This issue is particularly relevant since, recently, Simons et al. (1996 supra) have provided evidence suggesting that, in their GBV-C isolates, 15 initiation of translation occurred only immediately upstream of the putative El signal sequence and that the upstream in frame AUG seemed to be unable to direct the synthesis of GBV-C polyprotein.
To determine the effect of the upstream in frame AUG
20 codons on translation of our viral isolates, mRNAs with capped methylated (m'GpppG) 5' end, synthesized from linearized pCRX/AC1S constructs, were examined for mRNA
activity in a cell-free translation system prepared from rabbit reticulocytes. For this purpose we considered 25 isolates with distinct putative initiator codons.
Isolate 35 had 5 in-frame AUG codons in the putative core region (Fig. 5 and 6) and therefore the pCR35/AC1S
construct (covering the partial 5'UTR, the putative core and a portion of the E1 protein) can potentially mediate 30 the synthesis of different transcription products with expected sizes of about 17, 14,11, 10 or 7 kDa. Figure 7 shows that efficient translation occurred and a polypeptide with the apparent size of 17 kDa, compatible with the activity of the first in-frame AUG, was observed 35 in a SDS-PAGE analysis. It is important to note that efficient translation was also obtained with IVTT
reactions programmed with uncapped RNA derived from pCR35/AC1S. Thus, these data strongly suggest that, at least in this construct, the first available in-frame AUG
from the 5' end of the RNA can efficiently act as the initiator codon. Furthermore, since the cap stmcture was not determinant for efficient translation, the data also suggest that, as recently proposed by Simons et al.
(Simons et al. 1996, supra), GBV-C RNA may not utilize the classical ribosome scanning mechanism.
IO IVTT reaction mixtures were also programmed with pCRDS39/AC1S, pCRDM/AC1S, pCRl9/AC1S, pCR3064/AC1S, pCRMG/AC1S and pCR2095/AC1S constructs.
Isolates DS39, DM and 19 had the first in-frame AUG
corresponding to the second in-frame AUG of isolate 35 (Fig. 5 and 6). Moreover, isolate DS39 had also two additional putative initiatior codons so that the pCRDS39/AC1S construct could potentially direct the synthesis of three possible proteins of 14, 10 and 7 kDa.
The pCRDM/ACIS construct, on the other hand, can potentially initiate translation at 4 different AUG
codons, producing polypeptides of I4, II, 10 and 7 kDa.
Two of these plasmids, pCRDS39/AC1S and pCRl9/AC1S, directed the translation of products of about 14 kDa as detected by SDS-PAGE (Fig. 7). The size of these proteins is compatible with the activity of the AUG sequence just downstream of the start codon of the pCR35/AC1S
construct, suggesting that, also in these cases, translation was initiated at the first available in-frame AUG of the ORF. Since pCRl9/AC1S contained only one AUG
inframe with the long ORF, and since on the basis of the nucleotide sequence we could also exclude the possibility that the expressed proteins represented unrelated products of translation directed by AUG codon located in the other two frames, we conclude that the observed band is the correct product of translation.
In contrast, no bands were detected in the reactions programmed with the pCRDM/AC1S plasmid. Since this construct contained the same potential in-frame initiator AUG codon than pCR39/AC1S and pCRl9/AC1S, the results 5 rise the possibility that Sequences upstream of the potential initiator codon may influence the translation either positively or negatively. This possibility is also reinforced by the fact that the plasmids derived from isolates 3064 and MG (pCR3064/AC1S and pCRMG/AC1S), that 10 could potentially direct the synthesis of a band of 10 kDa, did not produce detectable quantities of specific proteins, in accordance to the data of Simons et al.
(Simons et al. 1996 supra). Negative results were also obtained with the pCR2095/AC1S construct, derived from 15 isolate 2095, that could potentially produce a protein of about 7 kDa.
Analysis of deletion mutants To examine in details the activity of the different potential in-frame initiator AUG codons, to verify the 20 possibility of a regulatory role of the upstream sequences and to unequivocally establish the size of the obtained translation products, cell-free protein synthesis experiments were performed with capped mRNA
transcribed by truncated forms of the 35, 19, 3064 and 2095 constructs described in Fig. 8. Figure 9A shows the results obtained when IVTT reaction mixtures were programmed with the 35 deletion mutant plasmids. In these experiments, translation of the mRNA derived from pCR35/AC1S yielded the major precursor band of 17 kDa as well as 3 additional faint bands of 14, 11 and 7 kDa, probably corresponding to translation products that utilize the second, third and the fifth in-frame AUG as initiator codons. A very similar pattern was obtained with the pCR35/~1 construct. In vitro translation of pCR35/02, pCR35/03, pCR35/~4 and pCR35/05 transcripts yielded bands of progressively lower molecular weights of about 14, 11 and 7 kDa, implying therefore that, in this isolate, a11 the in-frame AUG codons, with the exception of the fourth AUG, might serve as potential site of translation initiation. In the lane containing the translation products of pCR35/02 mutant we found two bands of 14 and 11 kDa corresponding to translation products that utilized both the second and the third available AUG initiation codons.
As expected, the analysis of the deletion mutants of pCRl9/AC1S, that contains only one in-frame AUG, gave different results (Fig. 9B). The 14 kDa bands obtained with reactions programmed with pCRl9/AC1S, pCRl9/~1 and pCRl9/~2 demonstrate that the first and unique in-frame AUG is the site of translation initiation and that., at least for this mutant, deletion of 5' UTR does not seem to significantly influence the level of translation.
A quite unexpected finding of these experiments was that the reactions programmed with some of the deletion mutants of pCRl9/AC1S construct, that lacked the AUG
codon immediately upstream of the putative E1 signal sequence, yielded a band with the same apparent molecular mass of the 7 kDa protein detected in the reactions programmed with pCR35/44 and pCR35/~5. Although we can not exclude that this band may have resulted from partially degraded RNA, it is interesting that in this template the AUG triplet correspcnding to the fifth in-frame start codon of isolate 35 is replaced by an ACG
codon, that has been found to act as initiation codon in other mRNAs (Boeck, R., and D. Kolakofsky. l994. EMBO J.
13:3608-3617), including a transcript derived from a GBV-C mutated sequence (Simons et al. 1996 supra).
Reactions programmed with the deletion mutants derived from pCR3064/AC1S construct provided no detectable bands (data not shown). Since in this constructs the only available initiation codon is the AUG
corrisponding to the fourth start codon of the isolate 35, these results confirmed the data obtained with the mutants of isolate 35 and with the pCR3064/AC1S and pCRMG/AC1S plasmids, suggesting that this particular codon, as already reported (Simons et al. 1996 supra), is silent, probably due to the composition of the flanking sequences.
Finally, evidence for a regulatory role of the partial 5'UTR were obtained with the deleted 2095 plasmids (Fig. 9C). In this isolate the only available site of transcription initiation corresponds to the fifth AUG of isolate 35, that is however silent in construct pCR2095/AC1S. The expected 7 kDa product was surprisingly detected as a faint band with pCR2095/01, pCR2095/02 and pCR2095/~3 and the intensity of this band increased progressively by programming the reactions with pCR2095/04 and pCR2095/05. These data clearly mean that, in this particular template, the translation of the polypeptide is strongly influenced by the partial 5'UTR
present in the undeleted construct that, in this case, seems to inhibit the translation process.
Finally, we must stress that we have verified that none of the bands obtained with the deletion mutants can be accounted for by unrelated products of translation encoded by the other two possible reading frames.
o . ~ CA 02251745 1999-04-27 - 2 3 -~ ;
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: WABCO BV
(ii) TITLE OF INVENTION: Non-A non-E Hepatitis virus having a translatable core region, reagents and methods for their use (iii) NUMBER OF SEQUENCES: 11 (iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Dennison Associates (B) STREET: 133 Richmond Street West, Suite 301 (C) CITY: Toronto (D) PROVINCE: Ontario (E) COUNTRY: Canada (F) POSTAL CODE: M5H 2L7 (v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk (B) COMPUTER: IBM PC compatible (C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: PatentIn Release #1.0, Version #1.30(EPO) (vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: 2,251,745 (B) FILING DATE: 09-DEC-1996 (C) CLASSIFICATION:
(vii) ATTORNEY/AGENT INFORMATION:
(A) NAME: Dennison Associates (B) REFERENCE/DOCKET NUMBER: JJ-10288CA
(viii) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: 416-368-8313 (B) TELEFAX: 416-368-1645 (2) INFORMATION
FOR
SEQ
ID
NO:
1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 205 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID N0: 1:
Arg Val Arg Asp Val Ala Arg Gly Cys Gly Val Gln Leu Leu Tyr Val Ala Thr Ala Thr Pro Pro Gly Ser Pro Met Thr Gln His Ser Ile Pro Ile Glu Thr Lys Leu Asp Val Gly Glu I1e Pro Phe Tyr His Gly Gly Ile Pro Leu Glu Arg Met Arg Thr Gly Arg His Leu Val Phe Cys His Ser Lys Ala Glu Cys Glu Arg Leu Ala Gly Gln Phe Ser Ala Arg Gly Val Asn Ala Ile Ala Tyr Tyr Arg Gly Lys Asp Ser Ser Ile Ile Lys Asp Gly Asp Leu Val AlaThrAsp AlaLeu SerThrGly Tyr Val Cys Thr Gly Asn Phe Ser ThrAspCys GlyLeu ValValGlu Glu Asp Val Val Val Glu Val Leu ProThrIle ThrIle SerLeuArg Thr Thr Asp Val Pro Ala Ser Glu SerMetGln ArgArg GlyArgThr Gly Ala Leu Arg Gly Arg Ser Arg TyrTyrAla GlyVal GlyLysAla Pro Gly Tyr A1a Gly Val Val Ser ProValTrp SerAla ValGluAla Gly Arg Gly Val Thr Trp Tyr Met ProAspLeu ThrAla Asn Gly Glu (2) INFO RMATION FOR ID NO: :
3 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 7 aminoacids (B) TYPE: o acid amin (C) STRANDEDNESS: le sing (D) TOPOLOGY:linear (ii) MOLECULE TYPE:peptide (xi)SEQUENCE DESCRIPTION: 2:
SEQ ID NO:
Met Ser Leu Asn Arg Ala TyrPro ProGlyLeu ThrThr ProThr Arg Tyr Gly Pro Arg Arg Pro MetSer LeuLeuThr AsnArg PheAsn Ser Arg Arg Val Asp Lys Asp TrpGly ProGlyVal MetGly LysAsp Gln Pro Lys Pro Cys Pro Ser ArgThr GlyLysCys MetGly ProPro Arg Ser Ser Ala Ala Ala Cys ArgGly SerProArg IleLeu ArgVal Ser Arg Ala Gly Gly Ile Ser ProTyr ThrIleMet GluAla LeuLeu Leu Phe Leu Leu Gly Val Glu GlyAla IleLeu Ala (2) INFORMATION
FOR SEQ
ID N0:
3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 84 amino acids (B) TYPE: amino acid (C) STRANDEDNESS:
single (D) TOPOLOGY: linear (ii)MOLECULE TYPE: peptide (xi)SEQUENCE DESCRIPTION: 3:
SEQ ID NO:
Met Ser Leu Leu Thr Asn PheAsn ArgArgVaI AspLys AspGln Arg Trp Gly Pro Gly Val Met LysAsp ProLysPro CysPro SerArg Gly Arg Thr Gly Lys Cys Met ProPro SerSerA1a AlaAla CysSer Gly Arg Gly Ser Pro Arg Ile Leu Arg Val Arg Ala Gly Gly Ile Ser Leu Pro Tyr Thr Ile Met Glu Ala Leu Leu Phe Leu Leu Gly Val Glu Ala Gly A1a Ile Leu (2) INFORMATION FOR SEQ ID N0: 4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 23 amino acids 1 5 (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID N0: 4:
Met Ser Leu Asn Arg Ala Arg Tyr Pro Pro Gly Leu Thr Thr Pro Thr Tyr Gly Pro Arg Arg Pro Ser (2) INFORMATION FOR SEQ ID NO: 5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 22 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5:
(2) INFORMATION FOR SEQ ID NO:
6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 base pairs (B) TYPE; nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleicacid (xi) SEQUENCE DESCRIPTION: SEQ NO: 6:
ID
1 (2) INFORMATION FOR SEQ ID NO:
5 7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 27 base pairs (B} TYPE: nucleic acid 2 (C) STRANDEDNESS; single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleicacid 2 (xi) SEQUENCE DESCRIPTION: SEQ N0: 7:
(2) INFORMATION FOR SEQ ID NO:
8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 27 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single 3 (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleicacid (xi) SEQUENCE DESCRIPTION: SEQ NO: 8:
ID
(2) INFORMATION FOR SEQ ID NO:
9:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 29 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid (xi) SEQUENCE DESCRIPTION: SEQ N0: 9:
ID
1 (2) INFORMATION FOR SEQ ID N0:
5 10:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 27 base pairs (B) TYPE: nucleic acid 2 (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid 2 (xi) SEQUENCE DESCRIPTION: SEQ NO: 10:
(2) INFORMATION FOR SEQ ID NO:
11:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 27 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single 3 (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid (xi) SEQUENCE DESCRIPTION: SEQ NO: 11:
ID
Claims (8)
1. Peptide able to specifically bind to nonAnone hepatitis virus antibodies having a sequence of at least 10 aminoacids, said nonAnone hepatitis virus being characterized by having a translatable functional core-like protein longer than 84 aminoacid residues and by comprising an aminoacid sequence at least 60% homologous to the sequence of SEQ ID NO. 4:
MSLNRARYPPGLTTPTYGPRRPS; or an immunologically homologous variant of the peptide.
MSLNRARYPPGLTTPTYGPRRPS; or an immunologically homologous variant of the peptide.
2. Peptide according to claim 1 having a sequence comprised in SEQ ID No. 4.
3. Composition comprising at least one of the peptides according to claim 1 or 2.
4. Composition according to claim 3 which is adhered on a solid phase.
5. Composition according to claim 3 or 4 comprising at least two peptides according to any of claim 1 or 2.
6. Diagnostic kit for nonAnone hepatitis comprising as specific reagent at least one peptide according to any of claim 1 or 2.
7. Diagnostic kit for nonAnone hepatitis comprising as specific reagent at least two peptides according to any of claim 1 or 2.
8. Method to detect nonA nonE hepatitis virus antibodies in a sample comprising the following steps:
- incubating in conditions allowing the formation of an antigen-antibody complex said sample with at least one peptide according to any of claims 1 or 2;
revealing said complex.
- incubating in conditions allowing the formation of an antigen-antibody complex said sample with at least one peptide according to any of claims 1 or 2;
revealing said complex.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
ITRM96A000245 | 1996-04-17 | ||
IT96RM000245A IT1284630B1 (en) | 1996-04-17 | 1996-04-17 | SPECIFIC EPITOPES OF VIRUSES ASSOCIATED WITH NON"A"-NON"E HEPATITIS, COMPOSITIONS AND METHOD FOR DETECTING ANTIBODY OF THESE EPITOPES. |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2251745A1 true CA2251745A1 (en) | 1997-10-23 |
Family
ID=11404119
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002251745A Abandoned CA2251745A1 (en) | 1996-04-17 | 1996-12-09 | Non-a-non-e hepatitis virus having a translatable core region, reagents and methods for their use |
Country Status (5)
Country | Link |
---|---|
EP (1) | EP0914445A1 (en) |
AU (1) | AU1109297A (en) |
CA (1) | CA2251745A1 (en) |
IT (1) | IT1284630B1 (en) |
WO (1) | WO1997039129A1 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
IT1297096B1 (en) * | 1997-12-03 | 1999-08-03 | Sorin Diagnostics S R L | METHOD FOR THE DETECTION OF SINGLE HELIX DNA ANALYTES |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2594195A (en) * | 1994-05-20 | 1995-12-18 | Genelabs Technologies, Inc. | Detection of viral antigens coded by reverse-reading frames |
PT763114E (en) * | 1994-05-20 | 2002-05-31 | Genelabs Tech Inc | HEPATITIS G VIRUS AND MOLECULAR CLONING |
-
1996
- 1996-04-17 IT IT96RM000245A patent/IT1284630B1/en active IP Right Grant
- 1996-12-09 WO PCT/IT1996/000248 patent/WO1997039129A1/en not_active Application Discontinuation
- 1996-12-09 CA CA002251745A patent/CA2251745A1/en not_active Abandoned
- 1996-12-09 EP EP96941817A patent/EP0914445A1/en not_active Withdrawn
- 1996-12-09 AU AU11092/97A patent/AU1109297A/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
EP0914445A1 (en) | 1999-05-12 |
AU1109297A (en) | 1997-11-07 |
WO1997039129A1 (en) | 1997-10-23 |
ITRM960245A1 (en) | 1997-10-17 |
IT1284630B1 (en) | 1998-05-21 |
ITRM960245A0 (en) | 1996-04-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0984068B1 (en) | Sequences of hepatitis c virus genotypes and their use as therapeutic and diagnostic agents | |
EP0998567B1 (en) | Cloned genomes of infectious hepatitis c viruses and uses thereof | |
AU691259B2 (en) | Method for reproducing in vitro the proteolytic activity of the NS3 protease of hepatitis C virus (HCV) | |
EP0779924B1 (en) | Nucleotide and amino acid sequences of the envelope 1 and core genes of hepatitis c virus | |
US7855052B2 (en) | Sequences of hepatitis C virus genotypes and their use as therapeutic and diagnostic agents | |
EP0763114A2 (en) | Hepatitis g virus and molecular cloning thereof | |
Pethel et al. | Mutational analysis of the octapeptide sequence motif at the NS1-NS2A cleavage junction of dengue type 4 virus | |
WO1995032292A2 (en) | Detection of viral antigens coded by reverse-reading frames | |
EP1185665B1 (en) | INFECTIOUS cDNA CLONE OF GB VIRUS B AND USES THEREOF | |
EP0518313A2 (en) | Gene of hepatitis C virus or fragment thereof, polypeptide encoded by the same | |
CA2251745A1 (en) | Non-a-non-e hepatitis virus having a translatable core region, reagents and methods for their use | |
WO1993002103A1 (en) | Epitopes of the env protein of the hepatitis c virus | |
Chang et al. | Antigenic heterogeneity of the hepatitis C virus NS4 protein as modeled with synthetic peptides | |
US7838002B2 (en) | HCV core+1 protein, methods for diagnosis of HCV infections, prophylaxis, and for screening of anti-HCV agents | |
Simmons | Site-Directed Mutagenesis of Southern Bean Mosaic Virus (Cowpea Strain): in Search of a Viral Protease | |
MXPA99006629A (en) | Epitopes of the protein pre-m/m of the dengue virus, synthetic peptides |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FZDE | Discontinued |