US20050130135A1 - Hepatitis a virus nucleotide sequences, recombinant proteins and uses thereof - Google Patents
Hepatitis a virus nucleotide sequences, recombinant proteins and uses thereof Download PDFInfo
- Publication number
- US20050130135A1 US20050130135A1 US11/051,613 US5161305A US2005130135A1 US 20050130135 A1 US20050130135 A1 US 20050130135A1 US 5161305 A US5161305 A US 5161305A US 2005130135 A1 US2005130135 A1 US 2005130135A1
- Authority
- US
- United States
- Prior art keywords
- sequence
- seq
- ser
- leu
- gly
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 241000709721 Hepatovirus A Species 0.000 title claims abstract description 138
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 title description 31
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 title description 31
- 108091028043 Nucleic acid sequence Proteins 0.000 title description 20
- 239000000427 antigen Substances 0.000 claims abstract description 90
- 108091007433 antigens Proteins 0.000 claims abstract description 82
- 102000036639 antigens Human genes 0.000 claims abstract description 82
- 239000000523 sample Substances 0.000 claims abstract description 80
- 238000000034 method Methods 0.000 claims description 88
- 239000012634 fragment Substances 0.000 claims description 81
- 150000001413 amino acids Chemical class 0.000 claims description 52
- 230000002163 immunogen Effects 0.000 claims description 44
- 239000007787 solid Substances 0.000 claims description 29
- 239000012472 biological sample Substances 0.000 claims description 21
- 230000015572 biosynthetic process Effects 0.000 claims description 14
- 208000015181 infectious disease Diseases 0.000 claims description 10
- 238000012360 testing method Methods 0.000 claims description 10
- 210000002966 serum Anatomy 0.000 claims description 8
- 210000002381 plasma Anatomy 0.000 claims description 6
- 210000004369 blood Anatomy 0.000 claims description 5
- 239000008280 blood Substances 0.000 claims description 5
- 239000013615 primer Substances 0.000 abstract description 58
- 150000007523 nucleic acids Chemical class 0.000 abstract description 57
- 102000039446 nucleic acids Human genes 0.000 abstract description 46
- 108020004707 nucleic acids Proteins 0.000 abstract description 46
- 238000003556 assay Methods 0.000 abstract description 33
- 238000001514 detection method Methods 0.000 abstract description 17
- 239000002987 primer (paints) Substances 0.000 abstract description 14
- 239000002243 precursor Substances 0.000 abstract description 10
- 108090000565 Capsid Proteins Proteins 0.000 abstract description 8
- 102100023321 Ceruloplasmin Human genes 0.000 abstract description 8
- 210000000234 capsid Anatomy 0.000 abstract description 5
- 238000003018 immunoassay Methods 0.000 abstract description 5
- 108020004414 DNA Proteins 0.000 description 98
- 239000002773 nucleotide Substances 0.000 description 61
- 125000003729 nucleotide group Chemical group 0.000 description 60
- 108090000765 processed proteins & peptides Proteins 0.000 description 55
- 102000004196 processed proteins & peptides Human genes 0.000 description 49
- 238000003752 polymerase chain reaction Methods 0.000 description 46
- 229920001184 polypeptide Polymers 0.000 description 45
- 102000040430 polynucleotide Human genes 0.000 description 44
- 108091033319 polynucleotide Proteins 0.000 description 44
- 239000002157 polynucleotide Substances 0.000 description 44
- 108090000623 proteins and genes Proteins 0.000 description 43
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 37
- 235000018102 proteins Nutrition 0.000 description 37
- 102000004169 proteins and genes Human genes 0.000 description 37
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 29
- 230000000295 complement effect Effects 0.000 description 28
- 108091034117 Oligonucleotide Proteins 0.000 description 26
- 235000001014 amino acid Nutrition 0.000 description 25
- 229940024606 amino acid Drugs 0.000 description 24
- 125000003275 alpha amino acid group Chemical group 0.000 description 21
- 238000009396 hybridization Methods 0.000 description 21
- 230000027455 binding Effects 0.000 description 20
- 238000006243 chemical reaction Methods 0.000 description 19
- 230000003321 amplification Effects 0.000 description 18
- 238000003199 nucleic acid amplification method Methods 0.000 description 18
- RXNXLAHQOVLMIE-UHFFFAOYSA-N phenyl 10-methylacridin-10-ium-9-carboxylate Chemical compound C12=CC=CC=C2[N+](C)=C2C=CC=CC2=C1C(=O)OC1=CC=CC=C1 RXNXLAHQOVLMIE-UHFFFAOYSA-N 0.000 description 18
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 17
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 17
- 210000004027 cell Anatomy 0.000 description 17
- 102000004190 Enzymes Human genes 0.000 description 16
- 108090000790 Enzymes Proteins 0.000 description 16
- 239000002585 base Substances 0.000 description 16
- 229940088598 enzyme Drugs 0.000 description 16
- 239000000203 mixture Substances 0.000 description 16
- 238000013518 transcription Methods 0.000 description 16
- 230000035897 transcription Effects 0.000 description 16
- 239000013598 vector Substances 0.000 description 16
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 15
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 15
- 108010050848 glycylleucine Proteins 0.000 description 15
- 239000002751 oligonucleotide probe Substances 0.000 description 15
- 239000013612 plasmid Substances 0.000 description 15
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 14
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 14
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 14
- 238000010367 cloning Methods 0.000 description 14
- -1 aromatic amino acids Chemical class 0.000 description 13
- 239000003153 chemical reaction reagent Substances 0.000 description 13
- 230000000694 effects Effects 0.000 description 13
- 102000053602 DNA Human genes 0.000 description 12
- 102100031780 Endonuclease Human genes 0.000 description 12
- 230000004048 modification Effects 0.000 description 12
- 238000012986 modification Methods 0.000 description 12
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 10
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 10
- 239000002299 complementary DNA Substances 0.000 description 10
- 108010078144 glutaminyl-glycine Proteins 0.000 description 10
- 230000003053 immunization Effects 0.000 description 10
- 238000003786 synthesis reaction Methods 0.000 description 10
- 241001465754 Metazoa Species 0.000 description 9
- 241000700605 Viruses Species 0.000 description 9
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 9
- 239000002253 acid Substances 0.000 description 9
- 239000000975 dye Substances 0.000 description 9
- 108091008146 restriction endonucleases Proteins 0.000 description 9
- 108010061238 threonyl-glycine Proteins 0.000 description 9
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 8
- 150000007513 acids Chemical class 0.000 description 8
- 229920000642 polymer Polymers 0.000 description 8
- 108010029020 prolylglycine Proteins 0.000 description 8
- 108020004635 Complementary DNA Proteins 0.000 description 7
- 241000588724 Escherichia coli Species 0.000 description 7
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 7
- 101710163270 Nuclease Proteins 0.000 description 7
- 230000000536 complexating effect Effects 0.000 description 7
- 230000001419 dependent effect Effects 0.000 description 7
- 230000014509 gene expression Effects 0.000 description 7
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 7
- 208000005252 hepatitis A Diseases 0.000 description 7
- 210000004408 hybridoma Anatomy 0.000 description 7
- 238000000746 purification Methods 0.000 description 7
- 239000000243 solution Substances 0.000 description 7
- 238000006467 substitution reaction Methods 0.000 description 7
- 230000003612 virological effect Effects 0.000 description 7
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 6
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 6
- 108010017826 DNA Polymerase I Proteins 0.000 description 6
- 102000004594 DNA Polymerase I Human genes 0.000 description 6
- 241000880493 Leptailurus serval Species 0.000 description 6
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 6
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 6
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 6
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 6
- 108700026244 Open Reading Frames Proteins 0.000 description 6
- 102000019197 Superoxide Dismutase Human genes 0.000 description 6
- 108010012715 Superoxide dismutase Proteins 0.000 description 6
- 239000002671 adjuvant Substances 0.000 description 6
- 108010044940 alanylglutamine Proteins 0.000 description 6
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 6
- 229960000723 ampicillin Drugs 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 6
- 239000003795 chemical substances by application Substances 0.000 description 6
- 238000002405 diagnostic procedure Methods 0.000 description 6
- 238000002474 experimental method Methods 0.000 description 6
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 6
- 108010037850 glycylvaline Proteins 0.000 description 6
- 238000002649 immunization Methods 0.000 description 6
- 238000000338 in vitro Methods 0.000 description 6
- 238000004519 manufacturing process Methods 0.000 description 6
- 239000011541 reaction mixture Substances 0.000 description 6
- 239000000126 substance Substances 0.000 description 6
- 210000001519 tissue Anatomy 0.000 description 6
- 101710132601 Capsid protein Proteins 0.000 description 5
- 101710197658 Capsid protein VP1 Proteins 0.000 description 5
- 108091026890 Coding region Proteins 0.000 description 5
- 230000006820 DNA synthesis Effects 0.000 description 5
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 5
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 5
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 5
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 5
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 5
- 101710118046 RNA-directed RNA polymerase Proteins 0.000 description 5
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 5
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Chemical compound CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 5
- 101710108545 Viral protein 1 Proteins 0.000 description 5
- DZBUGLKDJFMEHC-UHFFFAOYSA-N acridine Chemical compound C1=CC=CC2=CC3=CC=CC=C3N=C21 DZBUGLKDJFMEHC-UHFFFAOYSA-N 0.000 description 5
- 238000007792 addition Methods 0.000 description 5
- 239000011543 agarose gel Substances 0.000 description 5
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 5
- 230000000890 antigenic effect Effects 0.000 description 5
- 108010062796 arginyllysine Proteins 0.000 description 5
- 108010093581 aspartyl-proline Proteins 0.000 description 5
- 210000003719 b-lymphocyte Anatomy 0.000 description 5
- 238000010804 cDNA synthesis Methods 0.000 description 5
- 108010016616 cysteinylglycine Proteins 0.000 description 5
- 238000009585 enzyme analysis Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 239000000499 gel Substances 0.000 description 5
- 108010040030 histidinoalanine Proteins 0.000 description 5
- 108010092114 histidylphenylalanine Proteins 0.000 description 5
- 230000007062 hydrolysis Effects 0.000 description 5
- 238000006460 hydrolysis reaction Methods 0.000 description 5
- 230000005847 immunogenicity Effects 0.000 description 5
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 5
- 108010034529 leucyl-lysine Proteins 0.000 description 5
- 229910052751 metal Inorganic materials 0.000 description 5
- 239000002184 metal Substances 0.000 description 5
- 239000002245 particle Substances 0.000 description 5
- 238000002360 preparation method Methods 0.000 description 5
- 108010079317 prolyl-tyrosine Proteins 0.000 description 5
- 108010053725 prolylvaline Proteins 0.000 description 5
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 4
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 4
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 4
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 4
- RAUPFUCUDBQYHE-AVGNSLFASA-N Asn-Phe-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RAUPFUCUDBQYHE-AVGNSLFASA-N 0.000 description 4
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 4
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 4
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 4
- 241000713838 Avian myeloblastosis virus Species 0.000 description 4
- 241001227713 Chiron Species 0.000 description 4
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 4
- 239000003155 DNA primer Substances 0.000 description 4
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 4
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 4
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 4
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 4
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 4
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 4
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 4
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 4
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 4
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 4
- RAVLQPXCMRCLKT-KBPBESRZSA-N His-Gly-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RAVLQPXCMRCLKT-KBPBESRZSA-N 0.000 description 4
- CGAMSLMBYJHMDY-ONGXEEELSA-N His-Val-Gly Chemical compound CC(C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N CGAMSLMBYJHMDY-ONGXEEELSA-N 0.000 description 4
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 4
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 4
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 4
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 4
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 4
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 4
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 4
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 4
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 4
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 4
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 4
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 4
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 4
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 4
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 4
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 4
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 4
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 4
- 241000699666 Mus <mouse, genus> Species 0.000 description 4
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- 101100068676 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) gln-1 gene Proteins 0.000 description 4
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 4
- 239000004365 Protease Substances 0.000 description 4
- 241000700159 Rattus Species 0.000 description 4
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 4
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 4
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 4
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 4
- SCQBNMKLZVCXNX-ZFWWWQNUSA-N Trp-Arg-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N SCQBNMKLZVCXNX-ZFWWWQNUSA-N 0.000 description 4
- NXQAOORHSYJRGH-AAEUAGOBSA-N Trp-Gly-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 NXQAOORHSYJRGH-AAEUAGOBSA-N 0.000 description 4
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 4
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 4
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 4
- 108010068265 aspartyltyrosine Proteins 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 239000000969 carrier Substances 0.000 description 4
- 150000001875 compounds Chemical class 0.000 description 4
- 108010069495 cysteinyltyrosine Proteins 0.000 description 4
- 238000004925 denaturation Methods 0.000 description 4
- 230000036425 denaturation Effects 0.000 description 4
- 238000013461 design Methods 0.000 description 4
- 230000001747 exhibiting effect Effects 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- FDGQSTZJBFJUBT-UHFFFAOYSA-N hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 4
- 230000001900 immune effect Effects 0.000 description 4
- 238000011534 incubation Methods 0.000 description 4
- 108010064235 lysylglycine Proteins 0.000 description 4
- 108010038320 lysylphenylalanine Proteins 0.000 description 4
- 229920002521 macromolecule Polymers 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- 108010048818 seryl-histidine Proteins 0.000 description 4
- 108010026333 seryl-proline Proteins 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- 210000004989 spleen cell Anatomy 0.000 description 4
- 239000000758 substrate Substances 0.000 description 4
- 230000002194 synthesizing effect Effects 0.000 description 4
- 239000001226 triphosphate Substances 0.000 description 4
- 235000011178 triphosphate Nutrition 0.000 description 4
- 229960005486 vaccine Drugs 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- BZTDTCNHAFUJOG-UHFFFAOYSA-N 6-carboxyfluorescein Chemical compound C12=CC=C(O)C=C2OC2=CC(O)=CC=C2C11OC(=O)C2=CC=C(C(=O)O)C=C21 BZTDTCNHAFUJOG-UHFFFAOYSA-N 0.000 description 3
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 3
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 3
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 3
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 3
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 description 3
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 3
- 108090001008 Avidin Proteins 0.000 description 3
- 241000283707 Capra Species 0.000 description 3
- 241000701832 Enterobacteria phage T3 Species 0.000 description 3
- ATTWDCRXQNKRII-GUBZILKMSA-N Gln-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ATTWDCRXQNKRII-GUBZILKMSA-N 0.000 description 3
- JTWZNMUVQWWGOX-SOUVJXGZSA-N Gln-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JTWZNMUVQWWGOX-SOUVJXGZSA-N 0.000 description 3
- PBFGQTGPSKWHJA-QEJZJMRPSA-N Glu-Asp-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PBFGQTGPSKWHJA-QEJZJMRPSA-N 0.000 description 3
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 3
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 3
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 3
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 3
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 3
- 239000004471 Glycine Substances 0.000 description 3
- MBSSHYPAEHPSGY-LSJOCFKGSA-N His-Ala-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O MBSSHYPAEHPSGY-LSJOCFKGSA-N 0.000 description 3
- NWGXCPUKPVISSJ-AVGNSLFASA-N His-Gln-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NWGXCPUKPVISSJ-AVGNSLFASA-N 0.000 description 3
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 3
- JSQIXEHORHLQEE-MEYUZBJRSA-N His-Phe-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JSQIXEHORHLQEE-MEYUZBJRSA-N 0.000 description 3
- KAXZXLSXFWSNNZ-XVYDVKMFSA-N His-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KAXZXLSXFWSNNZ-XVYDVKMFSA-N 0.000 description 3
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 3
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 3
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 3
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 3
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 3
- 108060003951 Immunoglobulin Proteins 0.000 description 3
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 3
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 3
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 3
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 3
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 3
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 3
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 3
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 3
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 3
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 3
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 3
- 239000006137 Luria-Bertani broth Substances 0.000 description 3
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 3
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 3
- 239000004472 Lysine Substances 0.000 description 3
- BLIPQDLSCFGUFA-GUBZILKMSA-N Met-Arg-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O BLIPQDLSCFGUFA-GUBZILKMSA-N 0.000 description 3
- VYDLZDRMOFYOGV-TUAOUCFPSA-N Met-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N VYDLZDRMOFYOGV-TUAOUCFPSA-N 0.000 description 3
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 3
- 101710189818 Non-structural protein 2a Proteins 0.000 description 3
- 241000283973 Oryctolagus cuniculus Species 0.000 description 3
- 241001494479 Pecora Species 0.000 description 3
- 108091005804 Peptidases Proteins 0.000 description 3
- 108091093037 Peptide nucleic acid Proteins 0.000 description 3
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 3
- FRMKIPSIZSFTTE-HJOGWXRNSA-N Phe-Tyr-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FRMKIPSIZSFTTE-HJOGWXRNSA-N 0.000 description 3
- 239000004793 Polystyrene Substances 0.000 description 3
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 3
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 3
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 3
- UGHCUDLCCVVIJR-VGDYDELISA-N Ser-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N UGHCUDLCCVVIJR-VGDYDELISA-N 0.000 description 3
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 3
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 3
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 3
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 3
- SAKLWFSRZTZQAJ-GQGQLFGLSA-N Trp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SAKLWFSRZTZQAJ-GQGQLFGLSA-N 0.000 description 3
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 3
- CKKFTIQYURNSEI-IHRRRGAJSA-N Tyr-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CKKFTIQYURNSEI-IHRRRGAJSA-N 0.000 description 3
- CWVHKVVKAQIJKY-ACRUOGEOSA-N Tyr-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N CWVHKVVKAQIJKY-ACRUOGEOSA-N 0.000 description 3
- QHONGSVIVOFKAC-ULQDDVLXSA-N Tyr-Pro-His Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QHONGSVIVOFKAC-ULQDDVLXSA-N 0.000 description 3
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 3
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 3
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 3
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 3
- OJOMXGVLFKYDKP-QXEWZRGKSA-N Val-Met-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OJOMXGVLFKYDKP-QXEWZRGKSA-N 0.000 description 3
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 3
- 239000012491 analyte Substances 0.000 description 3
- 230000000692 anti-sense effect Effects 0.000 description 3
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 229920002678 cellulose Polymers 0.000 description 3
- 239000001913 cellulose Substances 0.000 description 3
- 230000002950 deficient Effects 0.000 description 3
- 239000005547 deoxyribonucleotide Substances 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 3
- 239000007850 fluorescent dye Substances 0.000 description 3
- 235000013922 glutamic acid Nutrition 0.000 description 3
- 239000004220 glutamic acid Substances 0.000 description 3
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 3
- 235000004554 glutamine Nutrition 0.000 description 3
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 3
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 3
- 108010010147 glycylglutamine Proteins 0.000 description 3
- 108010036413 histidylglycine Proteins 0.000 description 3
- 230000028993 immune response Effects 0.000 description 3
- 102000018358 immunoglobulin Human genes 0.000 description 3
- 238000001727 in vivo Methods 0.000 description 3
- 230000000977 initiatory effect Effects 0.000 description 3
- 229960000310 isoleucine Drugs 0.000 description 3
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 3
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 3
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 3
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 3
- 108010000761 leucylarginine Proteins 0.000 description 3
- 108010057821 leucylproline Proteins 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 150000002739 metals Chemical class 0.000 description 3
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 3
- 229930182817 methionine Natural products 0.000 description 3
- 108010085203 methionylmethionine Proteins 0.000 description 3
- 238000006384 oligomerization reaction Methods 0.000 description 3
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 3
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 3
- 229920002223 polystyrene Polymers 0.000 description 3
- 235000019419 proteases Nutrition 0.000 description 3
- 238000010791 quenching Methods 0.000 description 3
- 230000002285 radioactive effect Effects 0.000 description 3
- 238000010839 reverse transcription Methods 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 238000012216 screening Methods 0.000 description 3
- 230000003248 secreting effect Effects 0.000 description 3
- 230000009870 specific binding Effects 0.000 description 3
- 238000010561 standard procedure Methods 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 230000014616 translation Effects 0.000 description 3
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 3
- 239000004474 valine Substances 0.000 description 3
- 210000002845 virion Anatomy 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- QMXCRMQIVATQMR-UHFFFAOYSA-N (2,5-dioxopyrrolidin-1-yl) 3-pyridin-2-ylsulfanylpropanoate Chemical compound O=C1CCC(=O)N1OC(=O)CCSC1=CC=CC=N1 QMXCRMQIVATQMR-UHFFFAOYSA-N 0.000 description 2
- XUVKSPPGPPFPQN-UHFFFAOYSA-N 10-Methyl-9(10H)-acridone Chemical compound C1=CC=C2N(C)C3=CC=CC=C3C(=O)C2=C1 XUVKSPPGPPFPQN-UHFFFAOYSA-N 0.000 description 2
- 108010037497 3'-nucleotidase Proteins 0.000 description 2
- TVZGACDUOSZQKY-LBPRGKRZSA-N 4-aminofolic acid Chemical compound C1=NC2=NC(N)=NC(N)=C2N=C1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 TVZGACDUOSZQKY-LBPRGKRZSA-N 0.000 description 2
- WQZIDRAQTRIQDX-UHFFFAOYSA-N 6-carboxy-x-rhodamine Chemical compound OC(=O)C1=CC=C(C([O-])=O)C=C1C(C1=CC=2CCCN3CCCC(C=23)=C1O1)=C2C1=C(CCC1)C3=[N+]1CCCC3=C2 WQZIDRAQTRIQDX-UHFFFAOYSA-N 0.000 description 2
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 2
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 2
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 2
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 2
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 2
- DCUCOIYYUBILPS-GUBZILKMSA-N Ala-Leu-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DCUCOIYYUBILPS-GUBZILKMSA-N 0.000 description 2
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 2
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 2
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 2
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 2
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 2
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 2
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 2
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 2
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 2
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 2
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 2
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 2
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 2
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 2
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 2
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 2
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- 206010003445 Ascites Diseases 0.000 description 2
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 2
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 2
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 2
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 2
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 2
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 2
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 2
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 2
- TVIZQBFURPLQDV-DJFWLOJKSA-N Asp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N TVIZQBFURPLQDV-DJFWLOJKSA-N 0.000 description 2
- LNENWJXDHCFVOF-DCAQKATOSA-N Asp-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LNENWJXDHCFVOF-DCAQKATOSA-N 0.000 description 2
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 2
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 2
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 2
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 2
- 102000019260 B-Cell Antigen Receptors Human genes 0.000 description 2
- 108010012919 B-Cell Antigen Receptors Proteins 0.000 description 2
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 2
- NLCZGISONIGRQP-DCAQKATOSA-N Cys-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N NLCZGISONIGRQP-DCAQKATOSA-N 0.000 description 2
- 108010042407 Endonucleases Proteins 0.000 description 2
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 2
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 2
- ROHVCXBMIAAASL-HJGDQZAQSA-N Gln-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)N)N)O ROHVCXBMIAAASL-HJGDQZAQSA-N 0.000 description 2
- GHAXJVNBAKGWEJ-AVGNSLFASA-N Gln-Ser-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GHAXJVNBAKGWEJ-AVGNSLFASA-N 0.000 description 2
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 2
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 2
- GYCPQVFKCPPRQB-GUBZILKMSA-N Glu-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N GYCPQVFKCPPRQB-GUBZILKMSA-N 0.000 description 2
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 2
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 2
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 2
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 2
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 2
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 2
- GUOWMVFLAJNPDY-CIUDSAMLSA-N Glu-Ser-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GUOWMVFLAJNPDY-CIUDSAMLSA-N 0.000 description 2
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 2
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 2
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 2
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 2
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 2
- PCPOYRCAHPJXII-UWVGGRQHSA-N Gly-Lys-Met Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PCPOYRCAHPJXII-UWVGGRQHSA-N 0.000 description 2
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- 208000032843 Hemorrhage Diseases 0.000 description 2
- DEMIXZCKUXVEBO-BWAGICSOSA-N His-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O DEMIXZCKUXVEBO-BWAGICSOSA-N 0.000 description 2
- MHAJPDPJQMAIIY-UHFFFAOYSA-N Hydrogen peroxide Chemical compound OO MHAJPDPJQMAIIY-UHFFFAOYSA-N 0.000 description 2
- UGQMRVRMYYASKQ-UHFFFAOYSA-N Hypoxanthine nucleoside Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 UGQMRVRMYYASKQ-UHFFFAOYSA-N 0.000 description 2
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 2
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 2
- LOXMWQOKYBGCHF-JBDRJPRFSA-N Ile-Cys-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O LOXMWQOKYBGCHF-JBDRJPRFSA-N 0.000 description 2
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 description 2
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 2
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 2
- NUKXXNFEUZGPRO-BJDJZHNGSA-N Ile-Leu-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NUKXXNFEUZGPRO-BJDJZHNGSA-N 0.000 description 2
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 2
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 2
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 2
- BAJIJEGGUYXZGC-CIUDSAMLSA-N Leu-Asn-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N BAJIJEGGUYXZGC-CIUDSAMLSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 2
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 2
- WMTOVWLLDGQGCV-GUBZILKMSA-N Leu-Glu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WMTOVWLLDGQGCV-GUBZILKMSA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 2
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 2
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 2
- KTOIECMYZZGVSI-BZSNNMDCSA-N Leu-Phe-His Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KTOIECMYZZGVSI-BZSNNMDCSA-N 0.000 description 2
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 2
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 2
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 2
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 2
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 2
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 2
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 2
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 2
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 2
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 2
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 2
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 2
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 2
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 2
- PWPBGAJJYJJVPI-PJODQICGSA-N Met-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 PWPBGAJJYJJVPI-PJODQICGSA-N 0.000 description 2
- CWFYZYQMUDWGTI-GUBZILKMSA-N Met-Arg-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O CWFYZYQMUDWGTI-GUBZILKMSA-N 0.000 description 2
- DBOMZJOESVYERT-GUBZILKMSA-N Met-Asn-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N DBOMZJOESVYERT-GUBZILKMSA-N 0.000 description 2
- FYRUJIJAUPHUNB-IUCAKERBSA-N Met-Gly-Arg Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N FYRUJIJAUPHUNB-IUCAKERBSA-N 0.000 description 2
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 2
- 241000713869 Moloney murine leukemia virus Species 0.000 description 2
- 241000699670 Mus sp. Species 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 2
- 101710188639 Non-structural protein 3d Proteins 0.000 description 2
- 108010038807 Oligopeptides Proteins 0.000 description 2
- 102000015636 Oligopeptides Human genes 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- HPECNYCQLSVCHH-BZSNNMDCSA-N Phe-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N HPECNYCQLSVCHH-BZSNNMDCSA-N 0.000 description 2
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 2
- MFQXSDWKUXTOPZ-DZKIICNBSA-N Phe-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N MFQXSDWKUXTOPZ-DZKIICNBSA-N 0.000 description 2
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 2
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 2
- ROOQMPCUFLDOSB-FHWLQOOXSA-N Phe-Phe-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ROOQMPCUFLDOSB-FHWLQOOXSA-N 0.000 description 2
- CKJACGQPCPMWIT-UFYCRDLUSA-N Phe-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CKJACGQPCPMWIT-UFYCRDLUSA-N 0.000 description 2
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 2
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 2
- 206010035226 Plasma cell myeloma Diseases 0.000 description 2
- 241000276498 Pollachius virens Species 0.000 description 2
- 239000002202 Polyethylene glycol Substances 0.000 description 2
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 2
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 2
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 2
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 2
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 2
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 2
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 2
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 2
- HOJUNFDJDAPVBI-BZSNNMDCSA-N Pro-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 HOJUNFDJDAPVBI-BZSNNMDCSA-N 0.000 description 2
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- 108091036333 Rapid DNA Proteins 0.000 description 2
- 108091028664 Ribonucleotide Proteins 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 2
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 2
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 2
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 2
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 2
- WKLJLEXEENIYQE-SRVKXCTJSA-N Ser-Cys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WKLJLEXEENIYQE-SRVKXCTJSA-N 0.000 description 2
- ZFVFHHZBCVNLGD-GUBZILKMSA-N Ser-His-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFVFHHZBCVNLGD-GUBZILKMSA-N 0.000 description 2
- MOQDPPUMFSMYOM-KKUMJFAQSA-N Ser-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N MOQDPPUMFSMYOM-KKUMJFAQSA-N 0.000 description 2
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 2
- CJINPXGSKSZQNE-KBIXCLLPSA-N Ser-Ile-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O CJINPXGSKSZQNE-KBIXCLLPSA-N 0.000 description 2
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 2
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 2
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 2
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 2
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 2
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 2
- 108020004682 Single-Stranded DNA Proteins 0.000 description 2
- OUUQCZGPVNCOIJ-UHFFFAOYSA-M Superoxide Chemical compound [O-][O] OUUQCZGPVNCOIJ-UHFFFAOYSA-M 0.000 description 2
- 108010006785 Taq Polymerase Proteins 0.000 description 2
- MQUZMZBFKCHVOB-HJGDQZAQSA-N Thr-Gln-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O MQUZMZBFKCHVOB-HJGDQZAQSA-N 0.000 description 2
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 2
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 2
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 2
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 2
- JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 description 2
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 2
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 2
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 2
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 2
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- BOMYCJXTWRMKJA-RNXOBYDBSA-N Trp-Phe-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N BOMYCJXTWRMKJA-RNXOBYDBSA-N 0.000 description 2
- ZJPSMXCFEKMZFE-IHPCNDPISA-N Trp-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O ZJPSMXCFEKMZFE-IHPCNDPISA-N 0.000 description 2
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 2
- AKXBNSZMYAOGLS-STQMWFEESA-N Tyr-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AKXBNSZMYAOGLS-STQMWFEESA-N 0.000 description 2
- CYDVHRFXDMDMGX-KKUMJFAQSA-N Tyr-Asn-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O CYDVHRFXDMDMGX-KKUMJFAQSA-N 0.000 description 2
- ARSHSYUZHSIYKR-ACRUOGEOSA-N Tyr-His-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ARSHSYUZHSIYKR-ACRUOGEOSA-N 0.000 description 2
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 2
- VBFVQTPETKJCQW-RPTUDFQQSA-N Tyr-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VBFVQTPETKJCQW-RPTUDFQQSA-N 0.000 description 2
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 2
- IVXJODPZRWHCCR-JYJNAYRXSA-N Val-Arg-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IVXJODPZRWHCCR-JYJNAYRXSA-N 0.000 description 2
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 2
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 2
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 2
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 2
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 2
- SVLAAUGFIHSJPK-JYJNAYRXSA-N Val-Trp-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N SVLAAUGFIHSJPK-JYJNAYRXSA-N 0.000 description 2
- CFIBZQOLUDURST-IHRRRGAJSA-N Val-Tyr-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N CFIBZQOLUDURST-IHRRRGAJSA-N 0.000 description 2
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 2
- 108010067390 Viral Proteins Proteins 0.000 description 2
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 2
- 230000001464 adherent effect Effects 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- PYMYPHUHKUWMLA-LMVFSUKVSA-N aldehydo-D-ribose Chemical compound OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 2
- 239000003513 alkali Substances 0.000 description 2
- 150000001408 amides Chemical class 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 229960003896 aminopterin Drugs 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 235000009582 asparagine Nutrition 0.000 description 2
- 229960001230 asparagine Drugs 0.000 description 2
- 229940009098 aspartate Drugs 0.000 description 2
- 239000011324 bead Substances 0.000 description 2
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 230000000740 bleeding effect Effects 0.000 description 2
- 230000000903 blocking effect Effects 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 239000006285 cell suspension Substances 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 239000002738 chelating agent Substances 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 229920001577 copolymer Polymers 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 2
- 238000004163 cytometry Methods 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 238000010494 dissociation reaction Methods 0.000 description 2
- 230000005593 dissociations Effects 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 230000001804 emulsifying effect Effects 0.000 description 2
- 239000000839 emulsion Substances 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 238000009472 formulation Methods 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 229930195712 glutamate Natural products 0.000 description 2
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 2
- 150000004676 glycans Chemical class 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 239000012510 hollow fiber Substances 0.000 description 2
- 229920001519 homopolymer Polymers 0.000 description 2
- 229910052739 hydrogen Inorganic materials 0.000 description 2
- 239000001257 hydrogen Substances 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 150000002632 lipids Chemical class 0.000 description 2
- 210000001165 lymph node Anatomy 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 229910021645 metal ion Inorganic materials 0.000 description 2
- 230000011987 methylation Effects 0.000 description 2
- 238000007069 methylation reaction Methods 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 239000003068 molecular probe Substances 0.000 description 2
- 201000000050 myeloid neoplasm Diseases 0.000 description 2
- 238000007899 nucleic acid hybridization Methods 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 2
- 150000004713 phosphodiesters Chemical class 0.000 description 2
- 150000008298 phosphoramidates Chemical class 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 238000006116 polymerization reaction Methods 0.000 description 2
- 230000000379 polymerizing effect Effects 0.000 description 2
- 229920001282 polysaccharide Polymers 0.000 description 2
- 239000005017 polysaccharide Substances 0.000 description 2
- 230000037452 priming Effects 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- ZCCUUQDIBDJBTK-UHFFFAOYSA-N psoralen Chemical compound C1=C2OC(=O)C=CC2=CC2=C1OC=C2 ZCCUUQDIBDJBTK-UHFFFAOYSA-N 0.000 description 2
- 230000009257 reactivity Effects 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 239000002336 ribonucleotide Substances 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- 239000006152 selective media Substances 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- ATHGHQPFGPMSJY-UHFFFAOYSA-N spermidine Chemical compound NCCCCNCCCN ATHGHQPFGPMSJY-UHFFFAOYSA-N 0.000 description 2
- 210000000952 spleen Anatomy 0.000 description 2
- KZNICNPSHKQLFF-UHFFFAOYSA-N succinimide Chemical compound O=C1CCC(=O)N1 KZNICNPSHKQLFF-UHFFFAOYSA-N 0.000 description 2
- JJAHTWIKCUJRDK-UHFFFAOYSA-N succinimidyl 4-(N-maleimidomethyl)cyclohexane-1-carboxylate Chemical compound C1CC(CN2C(C=CC2=O)=O)CCC1C(=O)ON1C(=O)CCC1=O JJAHTWIKCUJRDK-UHFFFAOYSA-N 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- ABZLKHKQJHEPAX-UHFFFAOYSA-N tetramethylrhodamine Chemical compound C=12C=CC(N(C)C)=CC2=[O+]C2=CC(N(C)C)=CC=C2C=1C1=CC=CC=C1C([O-])=O ABZLKHKQJHEPAX-UHFFFAOYSA-N 0.000 description 2
- 229940104230 thymidine Drugs 0.000 description 2
- 230000005030 transcription termination Effects 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 238000011282 treatment Methods 0.000 description 2
- 241001515965 unidentified phage Species 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- YUXKOWPNKJSTPQ-AXWWPMSFSA-N (2s,3r)-2-amino-3-hydroxybutanoic acid;(2s)-2-amino-3-hydroxypropanoic acid Chemical compound OC[C@H](N)C(O)=O.C[C@@H](O)[C@H](N)C(O)=O YUXKOWPNKJSTPQ-AXWWPMSFSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- SLLFVLKNXABYGI-UHFFFAOYSA-N 1,2,3-benzoxadiazole Chemical class C1=CC=C2ON=NC2=C1 SLLFVLKNXABYGI-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical compound OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 description 1
- 108020005345 3' Untranslated Regions Proteins 0.000 description 1
- VXGRJERITKFWPL-UHFFFAOYSA-N 4',5'-Dihydropsoralen Natural products C1=C2OC(=O)C=CC2=CC2=C1OCC2 VXGRJERITKFWPL-UHFFFAOYSA-N 0.000 description 1
- LQILVUYCDHSGEU-UHFFFAOYSA-N 4-[(2,5-dioxopyrrol-1-yl)methyl]cyclohexane-1-carboxylic acid Chemical compound C1CC(C(=O)O)CCC1CN1C(=O)C=CC1=O LQILVUYCDHSGEU-UHFFFAOYSA-N 0.000 description 1
- 108020003589 5' Untranslated Regions Proteins 0.000 description 1
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 1
- WOJKKJKETHYEAC-UHFFFAOYSA-N 6-Maleimidocaproic acid Chemical compound OC(=O)CCCCCN1C(=O)C=CC1=O WOJKKJKETHYEAC-UHFFFAOYSA-N 0.000 description 1
- VWOLRKMFAJUZGM-UHFFFAOYSA-N 6-carboxyrhodamine 6G Chemical compound [Cl-].C=12C=C(C)C(NCC)=CC2=[O+]C=2C=C(NCC)C(C)=CC=2C=1C1=CC(C(O)=O)=CC=C1C(=O)OCC VWOLRKMFAJUZGM-UHFFFAOYSA-N 0.000 description 1
- NLSUMBWPPJUVST-UHFFFAOYSA-N 9-isothiocyanatoacridine Chemical compound C1=CC=C2C(N=C=S)=C(C=CC=C3)C3=NC2=C1 NLSUMBWPPJUVST-UHFFFAOYSA-N 0.000 description 1
- 101150022075 ADR1 gene Proteins 0.000 description 1
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 1
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 1
- RNHKOQHGYMTHFR-UBHSHLNASA-N Ala-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 RNHKOQHGYMTHFR-UBHSHLNASA-N 0.000 description 1
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 1
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- 102100034044 All-trans-retinol dehydrogenase [NAD(+)] ADH1B Human genes 0.000 description 1
- 101710193111 All-trans-retinol dehydrogenase [NAD(+)] ADH4 Proteins 0.000 description 1
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 1
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- OKKMBOSPBDASEP-CYDGBPFRSA-N Arg-Ile-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O OKKMBOSPBDASEP-CYDGBPFRSA-N 0.000 description 1
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 1
- MTYLORHAQXVQOW-AVGNSLFASA-N Arg-Lys-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O MTYLORHAQXVQOW-AVGNSLFASA-N 0.000 description 1
- YLVGUOGAFAJMKP-JYJNAYRXSA-N Arg-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YLVGUOGAFAJMKP-JYJNAYRXSA-N 0.000 description 1
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 1
- UIUXXFIKWQVMEX-UFYCRDLUSA-N Arg-Phe-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UIUXXFIKWQVMEX-UFYCRDLUSA-N 0.000 description 1
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 1
- AMIQZQAAYGYKOP-FXQIFTODSA-N Arg-Ser-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O AMIQZQAAYGYKOP-FXQIFTODSA-N 0.000 description 1
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 1
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 1
- XRLOBFSLPCHYLQ-ULQDDVLXSA-N Arg-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O XRLOBFSLPCHYLQ-ULQDDVLXSA-N 0.000 description 1
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 1
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 1
- HLTLEIXYIJDFOY-ZLUOBGJFSA-N Asn-Cys-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O HLTLEIXYIJDFOY-ZLUOBGJFSA-N 0.000 description 1
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- QUAWOKPCAKCHQL-SRVKXCTJSA-N Asn-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QUAWOKPCAKCHQL-SRVKXCTJSA-N 0.000 description 1
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 1
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 1
- NTWOPSIUJBMNRI-KKUMJFAQSA-N Asn-Lys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTWOPSIUJBMNRI-KKUMJFAQSA-N 0.000 description 1
- KEUNWIXNKVWCFL-FXQIFTODSA-N Asn-Met-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O KEUNWIXNKVWCFL-FXQIFTODSA-N 0.000 description 1
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 1
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 1
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- JXMREEPBRANWBY-VEVYYDQMSA-N Asn-Thr-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JXMREEPBRANWBY-VEVYYDQMSA-N 0.000 description 1
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- HBUJSDCLZCXXCW-YDHLFZDLSA-N Asn-Val-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HBUJSDCLZCXXCW-YDHLFZDLSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- GBAWQWASNGUNQF-ZLUOBGJFSA-N Asp-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N GBAWQWASNGUNQF-ZLUOBGJFSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 1
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 1
- KGAJCJXBEWLQDZ-UBHSHLNASA-N Asp-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N KGAJCJXBEWLQDZ-UBHSHLNASA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 1
- SMZCLQGDQMGESY-ACZMJKKPSA-N Asp-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N SMZCLQGDQMGESY-ACZMJKKPSA-N 0.000 description 1
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 1
- RQYMKRMRZWJGHC-BQBZGAKWSA-N Asp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N RQYMKRMRZWJGHC-BQBZGAKWSA-N 0.000 description 1
- AITKTFCQOBRJTG-CIUDSAMLSA-N Asp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N AITKTFCQOBRJTG-CIUDSAMLSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 1
- YRZIYQGXTSBRLT-AVGNSLFASA-N Asp-Phe-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YRZIYQGXTSBRLT-AVGNSLFASA-N 0.000 description 1
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 1
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 1
- RCGVPVZHKAXDPA-NYVOZVTQSA-N Asp-Trp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)NC(=O)[C@H](CC(=O)O)N RCGVPVZHKAXDPA-NYVOZVTQSA-N 0.000 description 1
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- ZOXJGFHDIHLPTG-UHFFFAOYSA-N Boron Chemical compound [B] ZOXJGFHDIHLPTG-UHFFFAOYSA-N 0.000 description 1
- NLZUEZXRPGMBCV-UHFFFAOYSA-N Butylhydroxytoluene Chemical compound CC1=CC(C(C)(C)C)=C(O)C(C(C)(C)C)=C1 NLZUEZXRPGMBCV-UHFFFAOYSA-N 0.000 description 1
- 229930182476 C-glycoside Natural products 0.000 description 1
- 150000000700 C-glycosides Chemical class 0.000 description 1
- QCMYYKRYFNMIEC-UHFFFAOYSA-N COP(O)=O Chemical class COP(O)=O QCMYYKRYFNMIEC-UHFFFAOYSA-N 0.000 description 1
- KXDHJXZQYSOELW-UHFFFAOYSA-M Carbamate Chemical compound NC([O-])=O KXDHJXZQYSOELW-UHFFFAOYSA-M 0.000 description 1
- 206010008631 Cholera Diseases 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- GHUVBPIYQYXXEF-SRVKXCTJSA-N Cys-Cys-Tyr Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 GHUVBPIYQYXXEF-SRVKXCTJSA-N 0.000 description 1
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 1
- RESAHOSBQHMOKH-KKUMJFAQSA-N Cys-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N RESAHOSBQHMOKH-KKUMJFAQSA-N 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 101710158312 DNA-binding protein HU-beta Proteins 0.000 description 1
- 241000238557 Decapoda Species 0.000 description 1
- 108090000204 Dipeptidase 1 Proteins 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 241001131785 Escherichia coli HB101 Species 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- MFLMFRZBAJSGHK-ACZMJKKPSA-N Gln-Cys-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N MFLMFRZBAJSGHK-ACZMJKKPSA-N 0.000 description 1
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 1
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 1
- PXAFHUATEHLECW-GUBZILKMSA-N Gln-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N PXAFHUATEHLECW-GUBZILKMSA-N 0.000 description 1
- JEFZIKRIDLHOIF-BYPYZUCNSA-N Gln-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(O)=O JEFZIKRIDLHOIF-BYPYZUCNSA-N 0.000 description 1
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 1
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 1
- DOMHVQBSRJNNKD-ZPFDUUQYSA-N Gln-Met-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DOMHVQBSRJNNKD-ZPFDUUQYSA-N 0.000 description 1
- DFRYZTUPVZNRLG-KKUMJFAQSA-N Gln-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DFRYZTUPVZNRLG-KKUMJFAQSA-N 0.000 description 1
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 1
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 1
- MFHVAWMMKZBSRQ-ACZMJKKPSA-N Gln-Ser-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N MFHVAWMMKZBSRQ-ACZMJKKPSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- OEIDWQHTRYEYGG-QEJZJMRPSA-N Gln-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N OEIDWQHTRYEYGG-QEJZJMRPSA-N 0.000 description 1
- NKSGKPWXSWBRRX-ACZMJKKPSA-N Glu-Asn-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N NKSGKPWXSWBRRX-ACZMJKKPSA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- FLQAKQOBSPFGKG-CIUDSAMLSA-N Glu-Cys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLQAKQOBSPFGKG-CIUDSAMLSA-N 0.000 description 1
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 1
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- OFIHURVSQXAZIR-SZMVWBNQSA-N Glu-Lys-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OFIHURVSQXAZIR-SZMVWBNQSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- ZWMYUDZLXAQHCK-CIUDSAMLSA-N Glu-Met-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O ZWMYUDZLXAQHCK-CIUDSAMLSA-N 0.000 description 1
- SOEPMWQCTJITPZ-SRVKXCTJSA-N Glu-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N SOEPMWQCTJITPZ-SRVKXCTJSA-N 0.000 description 1
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 1
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- NEDQVOQDDBCRGG-UHFFFAOYSA-N Gly Gly Thr Tyr Chemical compound NCC(=O)NCC(=O)NC(C(O)C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 NEDQVOQDDBCRGG-UHFFFAOYSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 1
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 1
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- 102100031181 Glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 1
- 101150026303 HEX1 gene Proteins 0.000 description 1
- 241000709715 Hepatovirus Species 0.000 description 1
- NNBWMLHQXBTIIT-HVTMNAMFSA-N His-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N NNBWMLHQXBTIIT-HVTMNAMFSA-N 0.000 description 1
- UMBKDWGQESDCTO-KKUMJFAQSA-N His-Lys-Lys Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O UMBKDWGQESDCTO-KKUMJFAQSA-N 0.000 description 1
- FJCGVRRVBKYYOU-DCAQKATOSA-N His-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N FJCGVRRVBKYYOU-DCAQKATOSA-N 0.000 description 1
- BSVLMPMIXPQNKC-KBPBESRZSA-N His-Phe-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O BSVLMPMIXPQNKC-KBPBESRZSA-N 0.000 description 1
- ZFDKSLBEWYCOCS-BZSNNMDCSA-N His-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CC=CC=C1 ZFDKSLBEWYCOCS-BZSNNMDCSA-N 0.000 description 1
- LNVILFYCPVOHPV-IHPCNDPISA-N His-Trp-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O LNVILFYCPVOHPV-IHPCNDPISA-N 0.000 description 1
- 241000702617 Human parvovirus B19 Species 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 1
- IPYVXYDYLHVWHU-GMOBBJLQSA-N Ile-Asn-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N IPYVXYDYLHVWHU-GMOBBJLQSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- JXMSHKFPDIUYGS-SIUGBPQLSA-N Ile-Glu-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N JXMSHKFPDIUYGS-SIUGBPQLSA-N 0.000 description 1
- AMSYMDIIIRJRKZ-HJPIBITLSA-N Ile-His-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AMSYMDIIIRJRKZ-HJPIBITLSA-N 0.000 description 1
- APDIECQNNDGFPD-PYJNHQTQSA-N Ile-His-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N APDIECQNNDGFPD-PYJNHQTQSA-N 0.000 description 1
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- ZUPJCJINYQISSN-XUXIUFHCSA-N Ile-Met-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUPJCJINYQISSN-XUXIUFHCSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- RTSQPLLOYSGMKM-DSYPUSFNSA-N Ile-Trp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N RTSQPLLOYSGMKM-DSYPUSFNSA-N 0.000 description 1
- YBHKCXNNNVDYEB-SPOWBLRKSA-N Ile-Trp-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N YBHKCXNNNVDYEB-SPOWBLRKSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 1
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 1
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 1
- 108010067060 Immunoglobulin Variable Region Proteins 0.000 description 1
- 102000017727 Immunoglobulin Variable Region Human genes 0.000 description 1
- 101710128560 Initiator protein NS1 Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 206010023126 Jaundice Diseases 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- XYUBOFCTGPZFSA-WDSOQIARSA-N Leu-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 XYUBOFCTGPZFSA-WDSOQIARSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- FOEHRHOBWFQSNW-KATARQTJSA-N Leu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N)O FOEHRHOBWFQSNW-KATARQTJSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 1
- YESNGRDJQWDYLH-KKUMJFAQSA-N Leu-Phe-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YESNGRDJQWDYLH-KKUMJFAQSA-N 0.000 description 1
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- JLYUZRKPDKHUTC-WDSOQIARSA-N Leu-Pro-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JLYUZRKPDKHUTC-WDSOQIARSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 1
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- VQXAVLQBQJMENB-SRVKXCTJSA-N Lys-Glu-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O VQXAVLQBQJMENB-SRVKXCTJSA-N 0.000 description 1
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 1
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 1
- IPTUBUUIFRZMJK-ACRUOGEOSA-N Lys-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 IPTUBUUIFRZMJK-ACRUOGEOSA-N 0.000 description 1
- UDXSLGLHFUBRRM-OEAJRASXSA-N Lys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCCN)N)O UDXSLGLHFUBRRM-OEAJRASXSA-N 0.000 description 1
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 1
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 1
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 1
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- FYYHWMGAXLPEAU-UHFFFAOYSA-N Magnesium Chemical compound [Mg] FYYHWMGAXLPEAU-UHFFFAOYSA-N 0.000 description 1
- QGQGAIBGTUJRBR-NAKRPEOUSA-N Met-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC QGQGAIBGTUJRBR-NAKRPEOUSA-N 0.000 description 1
- VTKPSXWRUGCOAC-GUBZILKMSA-N Met-Ala-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCSC VTKPSXWRUGCOAC-GUBZILKMSA-N 0.000 description 1
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 1
- XMMWDTUFTZMQFD-GMOBBJLQSA-N Met-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC XMMWDTUFTZMQFD-GMOBBJLQSA-N 0.000 description 1
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 1
- AEQVPPGEJJBFEE-CYDGBPFRSA-N Met-Ile-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEQVPPGEJJBFEE-CYDGBPFRSA-N 0.000 description 1
- DJBCKVNHEIJLQA-GMOBBJLQSA-N Met-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCSC)N DJBCKVNHEIJLQA-GMOBBJLQSA-N 0.000 description 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 1
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 1
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 1
- 108060004795 Methyltransferase Proteins 0.000 description 1
- 101710081079 Minor spike protein H Proteins 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- NQTADLQHYWFPDB-UHFFFAOYSA-N N-Hydroxysuccinimide Chemical group ON1C(=O)CCC1=O NQTADLQHYWFPDB-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 125000000729 N-terminal amino-acid group Chemical group 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 206010028813 Nausea Diseases 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 101710144127 Non-structural protein 1 Proteins 0.000 description 1
- 101710188643 Non-structural protein 3c Proteins 0.000 description 1
- 101800000440 Non-structural protein NS3A Proteins 0.000 description 1
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 1
- 108091005461 Nucleic proteins Proteins 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- 108010058846 Ovalbumin Proteins 0.000 description 1
- 108090000526 Papain Proteins 0.000 description 1
- 102000057297 Pepsin A Human genes 0.000 description 1
- 108090000284 Pepsin A Proteins 0.000 description 1
- 108010043958 Peptoids Proteins 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 1
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 1
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 1
- FSPGBMWPNMRWDB-AVGNSLFASA-N Phe-Cys-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N FSPGBMWPNMRWDB-AVGNSLFASA-N 0.000 description 1
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 1
- OPEVYHFJXLCCRT-AVGNSLFASA-N Phe-Gln-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O OPEVYHFJXLCCRT-AVGNSLFASA-N 0.000 description 1
- CDQCFGOQNYOICK-IHRRRGAJSA-N Phe-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDQCFGOQNYOICK-IHRRRGAJSA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- ISYSEOWLRQKQEQ-JYJNAYRXSA-N Phe-His-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISYSEOWLRQKQEQ-JYJNAYRXSA-N 0.000 description 1
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 1
- BVHFFNYBKRTSIU-MEYUZBJRSA-N Phe-His-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BVHFFNYBKRTSIU-MEYUZBJRSA-N 0.000 description 1
- RGZYXNFHYRFNNS-MXAVVETBSA-N Phe-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGZYXNFHYRFNNS-MXAVVETBSA-N 0.000 description 1
- JQLQUPIYYJXZLJ-ZEWNOJEFSA-N Phe-Ile-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 JQLQUPIYYJXZLJ-ZEWNOJEFSA-N 0.000 description 1
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- OWSLLRKCHLTUND-BZSNNMDCSA-N Phe-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OWSLLRKCHLTUND-BZSNNMDCSA-N 0.000 description 1
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 1
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 1
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 1
- ODGNUUUDJONJSC-UFYCRDLUSA-N Phe-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O ODGNUUUDJONJSC-UFYCRDLUSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- IIEOLPMQYRBZCN-SRVKXCTJSA-N Phe-Ser-Cys Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O IIEOLPMQYRBZCN-SRVKXCTJSA-N 0.000 description 1
- MRWOVVNKSXXLRP-IHPCNDPISA-N Phe-Ser-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MRWOVVNKSXXLRP-IHPCNDPISA-N 0.000 description 1
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 1
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- KIQUCMUULDXTAZ-HJOGWXRNSA-N Phe-Tyr-Tyr Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O KIQUCMUULDXTAZ-HJOGWXRNSA-N 0.000 description 1
- GAMLAXHLYGLQBJ-UFYCRDLUSA-N Phe-Val-Tyr Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC1=CC=C(C=C1)O)C(C)C)CC1=CC=CC=C1 GAMLAXHLYGLQBJ-UFYCRDLUSA-N 0.000 description 1
- 108010010677 Phosphodiesterase I Proteins 0.000 description 1
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 1
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 1
- 241000709664 Picornaviridae Species 0.000 description 1
- 239000004952 Polyamide Substances 0.000 description 1
- 108010020346 Polyglutamic Acid Proteins 0.000 description 1
- 229920000954 Polyglycolide Polymers 0.000 description 1
- 108010039918 Polylysine Proteins 0.000 description 1
- 108010076039 Polyproteins Proteins 0.000 description 1
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 1
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 1
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- XJROSHJRQTXWAE-XGEHTFHBSA-N Pro-Cys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XJROSHJRQTXWAE-XGEHTFHBSA-N 0.000 description 1
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 1
- SMFQZMGHCODUPQ-ULQDDVLXSA-N Pro-Lys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SMFQZMGHCODUPQ-ULQDDVLXSA-N 0.000 description 1
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 1
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 1
- DSGSTPRKNYHGCL-JYJNAYRXSA-N Pro-Phe-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DSGSTPRKNYHGCL-JYJNAYRXSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 1
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- KDCGOANMDULRCW-UHFFFAOYSA-N Purine Natural products N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 1
- 206010037660 Pyrexia Diseases 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 108091034057 RNA (poly(A)) Proteins 0.000 description 1
- 230000006819 RNA synthesis Effects 0.000 description 1
- 230000004570 RNA-binding Effects 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 1
- 241000702670 Rotavirus Species 0.000 description 1
- 108700031314 Rotavirus VP6 Proteins 0.000 description 1
- 241001468001 Salmonella virus SP6 Species 0.000 description 1
- 108091081021 Sense strand Proteins 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 1
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- QSHKTZVJGDVFEW-GUBZILKMSA-N Ser-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N QSHKTZVJGDVFEW-GUBZILKMSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- WMZVVNLPHFSUPA-BPUTZDHNSA-N Ser-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 WMZVVNLPHFSUPA-BPUTZDHNSA-N 0.000 description 1
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 1
- LSHUNRICNSEEAN-BPUTZDHNSA-N Ser-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CO)N LSHUNRICNSEEAN-BPUTZDHNSA-N 0.000 description 1
- 108010003723 Single-Domain Antibodies Proteins 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- 101710137500 T7 RNA polymerase Proteins 0.000 description 1
- 206010043376 Tetanus Diseases 0.000 description 1
- 241000205180 Thermococcus litoralis Species 0.000 description 1
- 241000589500 Thermus aquaticus Species 0.000 description 1
- 241000589499 Thermus thermophilus Species 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-N Thiophosphoric acid Chemical class OP(O)(S)=O RYYWUUFWQRZTIU-UHFFFAOYSA-N 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- KWQBJOUOSNJDRR-XAVMHZPKSA-N Thr-Cys-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N)O KWQBJOUOSNJDRR-XAVMHZPKSA-N 0.000 description 1
- GUZGCDIZVGODML-NKIYYHGXSA-N Thr-Gln-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O GUZGCDIZVGODML-NKIYYHGXSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- BIENEHRYNODTLP-HJGDQZAQSA-N Thr-Glu-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N)O BIENEHRYNODTLP-HJGDQZAQSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 1
- TZJSEJOXAIWOST-RHYQMDGZSA-N Thr-Lys-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N TZJSEJOXAIWOST-RHYQMDGZSA-N 0.000 description 1
- BDENGIGFTNYZSJ-RCWTZXSCSA-N Thr-Pro-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O BDENGIGFTNYZSJ-RCWTZXSCSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- VGNLMPBYWWNQFS-ZEILLAHLSA-N Thr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O VGNLMPBYWWNQFS-ZEILLAHLSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 1
- ZEJBJDHSQPOVJV-UAXMHLISSA-N Thr-Trp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZEJBJDHSQPOVJV-UAXMHLISSA-N 0.000 description 1
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- 108010034949 Thyroglobulin Proteins 0.000 description 1
- 102000009843 Thyroglobulin Human genes 0.000 description 1
- RTAQQCXQSZGOHL-UHFFFAOYSA-N Titanium Chemical compound [Ti] RTAQQCXQSZGOHL-UHFFFAOYSA-N 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 108091061763 Triple-stranded DNA Proteins 0.000 description 1
- XGFGVFMXDXALEV-XIRDDKMYSA-N Trp-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N XGFGVFMXDXALEV-XIRDDKMYSA-N 0.000 description 1
- UIRPULWLRODAEQ-QEJZJMRPSA-N Trp-Ser-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 UIRPULWLRODAEQ-QEJZJMRPSA-N 0.000 description 1
- WSMVEHPVOYXPAQ-XIRDDKMYSA-N Trp-Ser-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N WSMVEHPVOYXPAQ-XIRDDKMYSA-N 0.000 description 1
- LNGFWVPNKLWATF-ZVZYQTTQSA-N Trp-Val-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LNGFWVPNKLWATF-ZVZYQTTQSA-N 0.000 description 1
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 1
- WZQZUVWEPMGIMM-JYJNAYRXSA-N Tyr-Gln-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WZQZUVWEPMGIMM-JYJNAYRXSA-N 0.000 description 1
- FMOSEWZYZPMJAL-KKUMJFAQSA-N Tyr-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N FMOSEWZYZPMJAL-KKUMJFAQSA-N 0.000 description 1
- HDSKHCBAVVWPCQ-FHWLQOOXSA-N Tyr-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HDSKHCBAVVWPCQ-FHWLQOOXSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 1
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 1
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 1
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 1
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 1
- QRCBQDPRKMYTMB-IHPCNDPISA-N Tyr-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N QRCBQDPRKMYTMB-IHPCNDPISA-N 0.000 description 1
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- 101150024766 VP1 gene Proteins 0.000 description 1
- 101150093578 VP2 gene Proteins 0.000 description 1
- 101150036700 VP3 gene Proteins 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- CWOSXNKDOACNJN-BZSNNMDCSA-N Val-Arg-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N CWOSXNKDOACNJN-BZSNNMDCSA-N 0.000 description 1
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 1
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 1
- JXGWQYWDUOWQHA-DZKIICNBSA-N Val-Gln-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N JXGWQYWDUOWQHA-DZKIICNBSA-N 0.000 description 1
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- YTUABZMPYKCWCQ-XQQFMLRXSA-N Val-His-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N YTUABZMPYKCWCQ-XQQFMLRXSA-N 0.000 description 1
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 1
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 1
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- UFCHCOKFAGOQSF-BQFCYCMXSA-N Val-Trp-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N UFCHCOKFAGOQSF-BQFCYCMXSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- 108020000999 Viral RNA Proteins 0.000 description 1
- IXKSXJFAGXLQOQ-XISFHERQSA-N WHWLQLKPGQPMY Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 IXKSXJFAGXLQOQ-XISFHERQSA-N 0.000 description 1
- 206010000059 abdominal discomfort Diseases 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- DPKHZNPWBDQZCN-UHFFFAOYSA-N acridine orange free base Chemical compound C1=CC(N(C)C)=CC2=NC3=CC(N(C)C)=CC=C3C=C21 DPKHZNPWBDQZCN-UHFFFAOYSA-N 0.000 description 1
- 150000001251 acridines Chemical class 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 239000012670 alkaline solution Substances 0.000 description 1
- 239000002168 alkylating agent Substances 0.000 description 1
- 230000006229 amino acid addition Effects 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 208000022531 anorexia Diseases 0.000 description 1
- 230000000840 anti-viral effect Effects 0.000 description 1
- 239000003443 antiviral agent Substances 0.000 description 1
- 229940121357 antivirals Drugs 0.000 description 1
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 238000002820 assay format Methods 0.000 description 1
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000001588 bifunctional effect Effects 0.000 description 1
- 210000000941 bile Anatomy 0.000 description 1
- 102000023732 binding proteins Human genes 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 230000006287 biotinylation Effects 0.000 description 1
- 238000007413 biotinylation Methods 0.000 description 1
- 239000010836 blood and blood product Substances 0.000 description 1
- 210000000601 blood cell Anatomy 0.000 description 1
- 239000003914 blood derivative Substances 0.000 description 1
- 229940125691 blood product Drugs 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- 229910052796 boron Inorganic materials 0.000 description 1
- KDPAWGWELVVRCH-UHFFFAOYSA-N bromoacetic acid Chemical compound OC(=O)CBr KDPAWGWELVVRCH-UHFFFAOYSA-N 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 150000004657 carbamic acid derivatives Chemical class 0.000 description 1
- 239000000298 carbocyanine Substances 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 210000003850 cellular structure Anatomy 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 239000003636 conditioned culture medium Substances 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 239000005289 controlled pore glass Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 239000007822 coupling agent Substances 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 230000000120 cytopathologic effect Effects 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 1
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 1
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 1
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 1
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 1
- 206010061428 decreased appetite Diseases 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 206010013023 diphtheria Diseases 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- NAGJZTKCGNOGPW-UHFFFAOYSA-N dithiophosphoric acid Chemical class OP(O)(S)=S NAGJZTKCGNOGPW-UHFFFAOYSA-N 0.000 description 1
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 1
- 239000012154 double-distilled water Substances 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 239000002532 enzyme inhibitor Substances 0.000 description 1
- 108700020302 erbB-2 Genes Proteins 0.000 description 1
- 238000010931 ester hydrolysis Methods 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- VYXSBFYARXAAKO-UHFFFAOYSA-N ethyl 2-[3-(ethylamino)-6-ethylimino-2,7-dimethylxanthen-9-yl]benzoate;hydron;chloride Chemical compound [Cl-].C1=2C=C(C)C(NCC)=CC=2OC2=CC(=[NH+]CC)C(C)=CC2=C1C1=CC=CC=C1C(=O)OCC VYXSBFYARXAAKO-UHFFFAOYSA-N 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 210000003608 fece Anatomy 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 230000037406 food intake Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 230000005283 ground state Effects 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 208000002672 hepatitis B Diseases 0.000 description 1
- 239000000833 heterodimer Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 229940072221 immunoglobulins Drugs 0.000 description 1
- 238000009169 immunotherapy Methods 0.000 description 1
- 238000012606 in vitro cell culture Methods 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- JDNTWHVOXJZDSN-UHFFFAOYSA-N iodoacetic acid Chemical compound OC(=O)CI JDNTWHVOXJZDSN-UHFFFAOYSA-N 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 150000002540 isothiocyanates Chemical group 0.000 description 1
- 230000000155 isotopic effect Effects 0.000 description 1
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 239000004816 latex Substances 0.000 description 1
- 229920000126 latex Polymers 0.000 description 1
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 238000007834 ligase chain reaction Methods 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 210000004880 lymph fluid Anatomy 0.000 description 1
- 210000004698 lymphocyte Anatomy 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 239000011777 magnesium Substances 0.000 description 1
- 229910052749 magnesium Inorganic materials 0.000 description 1
- 206010025482 malaise Diseases 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 210000004779 membrane envelope Anatomy 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- 235000013336 milk Nutrition 0.000 description 1
- 210000004080 milk Anatomy 0.000 description 1
- 239000008267 milk Substances 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 230000008693 nausea Effects 0.000 description 1
- 238000006386 neutralization reaction Methods 0.000 description 1
- 230000003472 neutralizing effect Effects 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 239000002853 nucleic acid probe Substances 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 229940124276 oligodeoxyribonucleotide Drugs 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 238000011275 oncology therapy Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 229940092253 ovalbumin Drugs 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- 229940055729 papain Drugs 0.000 description 1
- 235000019834 papain Nutrition 0.000 description 1
- 229940111202 pepsin Drugs 0.000 description 1
- 239000000546 pharmaceutical excipient Substances 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 150000008300 phosphoramidites Chemical class 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 108010025488 pinealon Proteins 0.000 description 1
- 229920000729 poly(L-lysine) polymer Polymers 0.000 description 1
- 229920000747 poly(lactic acid) Polymers 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920002647 polyamide Polymers 0.000 description 1
- 229920002643 polyglutamic acid Polymers 0.000 description 1
- 229920000656 polylysine Polymers 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 125000006239 protecting group Chemical group 0.000 description 1
- 239000012460 protein solution Substances 0.000 description 1
- 230000006337 proteolytic cleavage Effects 0.000 description 1
- 230000005180 public health Effects 0.000 description 1
- IGFXRKMLLMBKSA-UHFFFAOYSA-N purine Chemical compound N1=C[N]C2=NC=NC2=C1 IGFXRKMLLMBKSA-UHFFFAOYSA-N 0.000 description 1
- 150000003220 pyrenes Chemical class 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 230000000241 respiratory effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 239000001022 rhodamine dye Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 159000000000 sodium salts Chemical class 0.000 description 1
- 238000010532 solid phase synthesis reaction Methods 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 229940063673 spermidine Drugs 0.000 description 1
- 108010005652 splenotritin Proteins 0.000 description 1
- 238000012289 standard assay Methods 0.000 description 1
- 150000001629 stilbenes Chemical class 0.000 description 1
- 235000021286 stilbenes Nutrition 0.000 description 1
- 229960002317 succinimide Drugs 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 210000001138 tear Anatomy 0.000 description 1
- 229960000814 tetanus toxoid Drugs 0.000 description 1
- 238000005382 thermal cycling Methods 0.000 description 1
- 150000003568 thioethers Chemical class 0.000 description 1
- 125000003396 thiol group Chemical group [H]S* 0.000 description 1
- 150000003573 thiols Chemical class 0.000 description 1
- 229960002175 thyroglobulin Drugs 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 108700012359 toxins Proteins 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010077037 tyrosyl-tyrosyl-phenylalanine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 230000009452 underexpressoin Effects 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 230000029812 viral genome replication Effects 0.000 description 1
- 239000011534 wash buffer Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
- 108010000998 wheylin-2 peptide Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
- A61P31/14—Antivirals for RNA viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/70—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving virus or bacteriophage
- C12Q1/701—Specific hybridization probes
- C12Q1/706—Specific hybridization probes for hepatitis
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/32011—Picornaviridae
- C12N2770/32411—Hepatovirus, i.e. hepatitis A virus
- C12N2770/32422—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A50/00—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
- Y02A50/30—Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change
Definitions
- the present invention pertains generally to viral diagnostics.
- the invention relates to nucleic acid and antibody-based assays for accurately diagnosing hepatitis A virus infection.
- Hepatitis A is an enterically transmitted disease that causes fever, malaise, anorexia, nausea, abdominal discomfort and jaundice.
- the etiologic agent of hepatitis A is a small, nonenveloped, spherical virus classified in the genus Hepatovirus of the Picornaviridae family.
- the HAV genome consists of a single-strand, linear, 7.5 kb RNA molecule encoding a polyprotein precursor that is processed to yield the structural proteins and enzymatic activities required for viral replication. HAV grows poorly in cell culture, is not cytopathic, and produces low yields of virus.
- HAV RNA extracted from virions is infectious in cell culture (Locarnini et al., J. Virol. 37:216-225, 1981 and Siegl et al., J. Gen. Virol. 57:331-341, 1981), direct manipulation of the viral genome becomes difficult because of its RNA composition.
- HAV encodes four capsid proteins (A, B, C and D) which contain the major antigenic domains recognized by antibodies of infected individuals.
- capsid proteins In addition to the capsid proteins, antigenic domains have been reported in nonstructural proteins such as 2A and the viral encoded protease. Another important HAV antigenic domain has been described in the junction between the capsid precursor P1 and 2A.
- HAV is normally acquired by fecal-oral route, by either person-to-person contact or ingestion of contaminated food or water.
- HAV transmission by pooled plasma products.
- the absence of a lipid envelope makes HAV very resistant to physicochemical inactivation, and the virus can withstand conventional heat treatment of blood products.
- HAV, as well as Parvovirus B19 have been transmitted through the administration of pooled plasma derivatives.
- sensitive and specific diagnostic assays to identify HAV antigens and/or antibodies in infected individuals as well as nucleic acid-based tests to detect viremic samples to exclude them from transfusion represents an important public health challenge.
- the present invention is based on the development of a sensitive, reliable nucleic acid-based diagnostic test for the detection of hepatitis A virus (HAV) in biological samples from potentially infected individuals.
- HAV hepatitis A virus
- the techniques described herein utilize extracted sample RNA as a template for amplification of HAV genomic sequence using transcription-mediated amplification (TMA), as well as in a 5′ nuclease assay, such as the TaqManTM technique.
- TMA transcription-mediated amplification
- the methods allow for the detection of HAV in viremic samples. Accordingly, infected samples can be identified and excluded from transfusion, as well as from the preparation of blood derivatives.
- the invention is directed to an isolated polynucleotide comprising (a) a nucleotide sequence comprising any one of the nucleotide sequences depicted in SEQ ID NOs: 1-39; (b) an isolated polynucleotide encoding a polypeptide comprising any one of SEQ ID NOs: 40-48; (c) a sequence complementary to any one of the sequences of (a) or (b); or (d) a fragment of any of the sequences in (a) or (b) wherein the fragment is at least 10 nucleotides.
- the invention is directed to an oligonucleotide primer consisting of a promoter region recognized by a DNA-dependent RNA polymerase operably linked to a HAV-specific complexing sequence of about 10 to about 75 nucleotides.
- the promoter region is the T7 promoter and said polymerase is T7 RNA polymerase.
- the HAV-specific sequence may be from the HAV genome, such as a nucleotide sequence comprising any one of the nucleotide sequences depicted in SEQ ID NOs: 1-39.
- the invention is directed an oligonucleotide primer consisting of a T7 promoter operably linked to a HAV-specific complexing sequence of about 10 to about 75 nucleotides, wherein the HAV-specific complexing sequence is derived from any one of the polynucleotide sequences of SEQ ID NOs: 1-39.
- the invention is directed to an oligonucleotide probe comprising a HAV-specific hybridizing sequence of about 10 to about 50 nucleotides linked to an acridinium ester label.
- the HAV-specific hybridizing sequence is a polynucleotide sequence derived from any one of the polynucleotide sequences of SEQ ID NOs: 1-39.
- the invention is directed to a vaccine composition
- a vaccine composition comprising an isolated immunogenic Hepatitis A virus (HAV) polypeptide, and a pharmaceutically acceptable excipient, wherein the HAV polypeptide is a polypeptide with at least 80% sequence identity to any one of the sequences of SEQ ID NOs: 40-48, or an immunogenic fragment thereof comprising at least 10 amino acids.
- HAV Hepatitis A virus
- the invention is directed to a diagnostic test kit comprising one or more oligonucleotide primers described herein, and instructions for conducting the diagnostic test.
- the test kit further comprises an oligonucleotide probe comprising a HAV hybridizing sequence of about 10 to about 50 nucleotides linked to an acridinium ester label.
- the invention is directed to an immunoassay for detecting antibodies that bind to a hepatitis A virus polypeptide comprising: providing an antigen comprising a sequence having at least 80% sequence identity to any one of the sequences of SEQ ID NOs: 40-48, or fragment thereof; incubating the antigen with a biological sample under conditions that allow for formation of an antibody-antigen complex; and detecting any antibody-antigen complexes comprised of said antigen.
- the antigen may be immobilized on a solid support, and may be at least 10 amino acids.
- the biological sample can be bodily fluid, tissue, or organ, such as human blood or a fraction thereof.
- the invention is directed to a method for detecting Hepatitis A virus (HAV) infection in a biological sample, the method comprising (a) isolating nucleic acid from a biological sample suspected of containing Hepatitis A virus (HAV) RNA, wherein said nucleic acid comprises a target sequence, (b) reacting the HAV nucleic acid with a detectably labeled probe sufficiently complementary to and capable of hybridizing with the target sequence, wherein the probe is derived from any one of SEQ ID NOs: 1-39, and further wherein said reacting is done under conditions that provide for the formation of a probe/target sequence complex, and (c) detecting the presence or absence of label as an indication of the presence or absence of the target sequence.
- HAV Hepatitis A virus
- FIG. 1 illustrates the nucleotide sequence of the 243 base pair VP3/VP1 HAV fragment determined for thirteen Indonesian (IND) (SEQ ID NOs: 1-13) and fourteen Chilean (SCL) (SEQ ID NOs: 14-27) isolates.
- IND Indonesian
- SCL Chilean
- FIG. 2 illustrates the nucleotide sequence (SEQ ID NO: 28) of a 2,950 bp KpnI/SphI insert encoding the HAV P1/2A precursor.
- FIG. 3 illustrates the nucleotide sequence (SEQ ID NO: 29) of a 6,696 bp KpnI/SphI insert encoding the full length HAV open reading frame.
- FIG. 4 illustrates the nucleotide sequence (SEQ ID NO: 30) of a 6,757 bp KpnI/SphI insert encoding the full length HAV open reading frame plus additional 3′ untranslated sequences.
- FIG. 5 illustrates the nucleotide (SEQ ID NO: 31) and corresponding amino acid sequence (SEQ ID NO: 40) of the recombinant protein of 94 kDa consisting of the capsid precursor P1 (methionine 1 - glutamic acid 791 ) plus the 45 amino terminal residues of the nonstructural protein 2A (serine 792 through glutamine 836 ).
- FIG. 6 illustrates the nucleotide (SEQ ID NO: 32) and corresponding amino acid sequence (SEQ ID NO: 41) of the recombinant protein of 115.5 kDa consisting of precursor P1 (Met 1 -Glu 791 ) fused with the nonstructural protein 2A (Ser 792 -Gln 980 ).
- FIG. 7 illustrates the nucleotide (SEQ ID NO: 33) and corresponding amino acid sequence (SEQ ID NO: 42) of the recombinant protein of 25 kDa (Asp 24 -Gln 245 ) representing HAV capsid protein 1B (VP2 gene product).
- FIG. 8 illustrates the nucleotide (SEQ ID NO: 34) and corresponding amino acid sequence (SEQ ID NO: 43) of the recombinant protein of 28 kDa (Met 246 -Gln 491 ) representing HAV capsid protein 1C (VP3 gene product).
- FIG. 9 illustrates the nucleotide (SEQ ID NO: 35) and corresponding amino acid sequence (SEQ ID NO: 44) of the recombinant protein of 33.3 kDa (Val 492 -Glu 791 ) representing HAV capsid protein 1D (VP1 gene product).
- FIG. 10 illustrates the nucleotide (SEQ ID NO: 36) and corresponding amino acid sequence (SEQ ID NO: 45) of the recombinant protein of 38.8 kDa consisting of human superoxide dismutase (153 aminoacids) fused with the HAV nonstructural protein 2A (Ser 792 -Gln 980 ).
- FIG. 11 illustrates the nucleotide (SEQ ID NO: 37) and corresponding amino acid sequence (SEQ ID NO: 46) of the recombinant protein of 24.9 kDa consisting of human superoxide dismutase (153 amino acids) fused with the HAV nonstructural protein 3A (Gly 1423 -Glu 1496 ).
- FIG. 12 illustrates the nucleotide (SEQ ID NO: 38) and corresponding amino acid sequence (SEQ ID NO: 47) of the recombinant protein of 41 kDa consisting of human superoxide dismutase (153 amino acids) fused with the HAV nonstructural protein 3C (Protease: Ser 1520 -Gln 1678 ).
- FIG. 13 illustrates the nucleotide (SEQ ID NO: 39) and corresponding amino acid sequence (SEQ ID NO: 48) of the recombinant protein of human superoxide dismutase (153 amino acids) fused with the HAV nonstructural protein 3D (RNA dependent RNA polymerase: Arg 1739 -Ser 2227 ).
- polypeptide and “protein” refer to a polymer of amino acid residues and are not limited to a minimum length of the product. Thus, peptides, oligopeptides, dimers, multimers, and the like, are included within the definition. Both full-length proteins and fragments thereof are encompassed by the definition.
- the terms also include postexpression modifications of the polypeptide, for example, glycosylation, acetylation, phosphorylation and the like.
- a “polypeptide” refers to a protein which includes modifications, such as deletions, additions and substitutions (generally conservative in nature), to the native sequence, so long as the protein maintains the desired activity. These modifications may be deliberate, as through site-directed mutagenesis, or may be accidental, such as through mutations of hosts which produce the proteins or errors due to PCR amplification.
- analog and mutant refer to biologically active derivatives of the reference molecule, or fragments of such derivatives, that retain desired activity, such as immunoreactivity in diagnostic assays.
- analog refers to compounds having a native polypeptide sequence and structure with one or more amino acid additions, substitutions (generally conservative in nature) and/or deletions, relative to the native molecule, so long as the modifications do not destroy immunogenic activity.
- mutein refers to peptides having one or more peptide mimics (“peptoids”), such as those described in International Publication No. WO 91/04282.
- the analog or mutein has at least the same immunoactivity as the native molecule. Methods for making polypeptide analogs and muteins are known in the art and are described further below.
- amino acids are generally divided into four families: (1) acidic—aspartate and glutamate; (2) basic—lysine, arginine, histidine; (3) non-polar—alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan; and (4) uncharged polar—glycine, asparagine, glutamine, cysteine, serine threonine, tyrosine. Phenylalanine, tryptophan, and tyrosine are sometimes classified as aromatic amino acids.
- the polypeptide of interest may include up to about 5-10 conservative or non-conservative amino acid substitutions, or even up to about 15-25 conservative or non-conservative amino acid substitutions, or any integer between 5-25, so long as the desired function of the molecule remains intact.
- One of skill in the art may readily determine regions of the molecule of interest that can tolerate change by reference to Hopp/Woods and Kyte-Doolittle plots, well known in the art.
- isolated is meant, when referring to a polypeptide, that the indicated molecule is separate and discrete from the whole organism with which the molecule is found in nature or is present in the substantial absence of other biological macro-molecules of the same type.
- isolated with respect to a polynucleotide is a nucleic acid molecule devoid, in whole or part, of sequences normally associated with it in nature; or a sequence, as it exists in nature, but having heterologous sequences in association therewith; or a molecule disassociated from the chromosome.
- a polynucleotide “derived from” or “specific for” a designated sequence refers to a polynucleotide sequence which comprises a contiguous sequence of approximately at least about 6 nucleotides, preferably at least about 8 nucleotides, more preferably at least about 10-12 nucleotides, and even more preferably at least about 15-20 nucleotides corresponding, i.e., identical or complementary to, a region of the designated nucleotide sequence.
- the derived polynucleotide will not necessarily be derived physically from the nucleotide sequence of interest, but may be generated in any manner, including, but not limited to, chemical synthesis, replication, reverse transcription or transcription, which is based on the information provided by the sequence of bases in the region(s) from which the polynucleotide is derived. As such, it may represent either a sense or an antisense orientation of the original polynucleotide.
- “Homology” refers to the percent similarity between two polynucleotide or two polypeptide moieties.
- Two DNA, or two polypeptide sequences are “substantially homologous” to each other when the sequences exhibit at least about 50%, preferably at least about 75%, more preferably at least about 80%-85%, preferably at least about 90%, and most preferably at least about 95%-98% sequence similarity over a defined length of the molecules.
- substantially homologous also refers to sequences showing complete identity to the specified DNA or polypeptide sequence.
- identity refers to an exact nucleotide-to-nucleotide or amino acid-to-amino acid correspondence of two polynucleotides or polypeptide sequences, respectively. Percent identity can be determined by a direct comparison of the sequence information between two molecules by aligning the sequences, counting the exact number of matches between the two aligned sequences, dividing by the length of the shorter sequence, and multiplying the result by 100.
- percent homology of a particular nucleotide sequence to a reference sequence can be determined using the homology algorithm of Smith and Waterman with a default scoring table and a gap penalty of six nucleotide positions.
- Another method of establishing percent homology in the context of the present invention is to use the MPSRCH package of programs copyrighted by the University of Edinburgh, developed by John F. Collins and Shane S. Sturrok, and distributed by IntelliGenetics, Inc. (Mountain View, Calif.). From this suite of packages the Smith-Waterman algorithm can be employed where default parameters are used for the scoring table (for example, gap open penalty of 12, gap extension penalty of one, and a gap of six). From the data generated the “Match” value reflects “sequence homology.” Other suitable programs for calculating the percent identity or similarity between sequences are generally known in the art, for example, another alignment program is BLAST, used with default parameters.
- homology can be determined by hybridization of polynucleotides under conditions which form stable duplexes between homologous regions, followed by digestion with single-stranded-specific nuclease(s), and size determination of the digested fragments.
- DNA sequences that are substantially homologous can be identified in a Southern hybridization experiment under, for example, stringent conditions, as defined for that particular system. Defining appropriate hybridization conditions is within the skill of the art. See, e.g., Sambrook et al., supra; DNA Cloning , supra; Nucleic Acid Hybridization , supra.
- “Operably linked” refers to an arrangement of elements wherein the components so described are configured so as to perform their desired function.
- a given promoter operably linked to a nucleic acid sequence is capable of effecting the transcription, and in the case of a coding sequence, the expression of the coding sequence when the proper transcription factors, etc., are present.
- the promoter need not be contiguous with the nucleic acid sequence, so long as it functions to direct the transcription and/or expression thereof.
- intervening untranslated yet transcribed sequences can be present between the promoter sequence and the coding sequence, as can transcribed introns, and the promoter sequence can still be considered “operably linked” to the coding sequence.
- “Recombinant” as used herein to describe a nucleic acid molecule means a polynucleotide of genomic, cDNA, viral, semisynthetic, or synthetic origin which, by virtue of its origin or manipulation is not associated with all or a portion of the polynucleotide with which it is associated in nature.
- the term “recombinant” as used with respect to a protein or polypeptide means a polypeptide produced by expression of a recombinant polynucleotide.
- the gene of interest is cloned and then expressed in transformed organisms, as described further below. The host organism expresses the foreign gene to produce the protein under expression conditions.
- control element refers to a polynucleotide sequence which aids in the transcription and/or translation of a nucleotide sequence to which it is linked.
- the term includes promoters, transcription termination sequences, upstream regulatory domains, polyadenylation signals, untranslated regions, including 5′-UTRs and 3′-UTRs and when appropriate, leader sequences and enhancers, which collectively provide for the transcription and translation of a coding sequence in a host cell.
- a “promoter” as used herein is a regulatory region capable of binding a polymerase and initiating transcription of a downstream (3′ direction) nucleotide sequence operably linked thereto.
- a promoter sequence includes the minimum number of bases or elements necessary to initiate transcription of a sequence of interest at levels detectable above background.
- a transcription initiation site within the promoter sequence is a transcription initiation site, as well as protein binding domains (consensus sequences) responsible for the binding of RNA or DNA polymerase.
- promoter may be a nucleic acid sequence that is recognized by a DNA-dependent RNA polymerase (“transcriptase”) as a signal to bind to the nucleic acid and begin the transcription of RNA at a specific site.
- transcriptases For binding, such transcriptases generally require DNA which is double-stranded in the portion comprising the promoter sequence and its complement; the template portion (sequence to be transcribed) need not be double-stranded.
- Individual DNA-dependent RNA polymerases recognize a variety of different promoter sequences which can vary markedly in their efficiency in promoting transcription. When an RNA polymerase binds to a promoter sequence to initiate transcription, that promoter sequence is not part of the sequence transcribed. Thus, the RNA transcripts produced thereby will not include that sequence.
- a control sequence “directs the transcription” of a nucleotide sequence when RNA or DNA polymerase will bind the promoter sequence and transcribe the adjacent sequence.
- a “DNA-dependent DNA polymerase” is an enzyme that synthesizes a complementary DNA copy from a DNA template. Examples are DNA polymerase I from E. coli and bacteriophage T7 DNA polymerase. All known DNA-dependent DNA polymerases require a complementary primer to initiate synthesis. Under suitable conditions, a DNA-dependent DNA polymerase may synthesize a complementary DNA copy from an RNA template.
- a “DNA-dependent RNA polymerase” or a “transcriptase” is an enzyme that synthesizes multiple RNA copies from a double-stranded or partially-double stranded DNA molecule having a (usually double-stranded) promoter sequence.
- the RNA molecules (“transcripts”) are synthesized in the 5′ to 3′ direction beginning at a specific position just downstream of the promoter. Examples of transcriptases are the DNA-dependent RNA polymerase from E. coli and bacteriophages T7, T3, and SP6.
- RNA-dependent DNA polymerase or “reverse transcriptase” is an enzyme that synthesizes a complementary DNA copy from an RNA template. All known reverse transcriptases also have the ability to make a complementary DNA copy from a DNA template; thus, they are both RNA- and DNA-dependent DNA polymerases. A primer is required to initiate synthesis with both RNA and DNA templates.
- RNAse H is an enzyme that degrades the RNA portion of an RNA:DNA duplex. These enzymes may be endonucleases or exonucleases. Most reverse transcriptase enzymes normally contain an RNAse H activity in addition to their polymerase activity. However, other sources of the RNAse H are available without an associated polymerase activity. The degradation may result in separation of RNA from a RNA:DNA complex. Alternatively, the RNAse H may simply cut the RNA at various locations such that portions of the RNA melt off or permit enzymes to unwind portions of the RNA.
- polynucleotide oligonucleotide
- nucleic acid oligonucleotide
- nucleic acid molecule a polymeric form of nucleotides of any length, either ribonucleotides or deoxyribonucleotides. This term refers only to the primary structure of the molecule. Thus, the term includes triple-, double- and single-stranded DNA, as well as triple-, double- and single-stranded RNA. It also includes modifications, such as by methylation and/or by capping, and unmodified forms of the polynucleotide.
- polynucleotide examples include polydeoxyribonucleotides (containing 2-deoxy-D-ribose), polyribonucleotides (containing D-ribose), any other type of polynucleotide which is an N- or C-glycoside of a purine or pyrimidine base, and other polymers containing nonnucleotidic backbones, for example, polyamide (e.g., peptide nucleic acids (PNAs)) and polymorpholino (commercially available from the Anti-Virals, Inc., Corvallis, Oreg., as Neugene) polymers, and other synthetic sequence-specific nucleic acid polymers providing that the polymers contain nucleobases in a configuration which allows for base pairing and base stacking, such as is found in DNA and RNA.
- PNAs peptide nucleic acids
- these terms include, for example, 3′-deoxy-2′,5′-DNA, oligodeoxyribonucleotide N3′ P5′ phosphoramidates, 2′-O-alkyl-substituted RNA, double- and single-stranded DNA, as well as double- and single-stranded RNA, DNA:RNA hybrids, and hybrids between PNAs and DNA or RNA, and also include known types of modifications, for example, labels which are known in the art, methylation, “caps,” substitution of one or more of the naturally occurring nucleotides with an analog, internucleotide modifications such as, for example, those with uncharged linkages (e.g., methyl phosphonates, phosphotriesters, phosphoramidates, carbamates, etc.), with negatively charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.), and with positively charged linkages (e.g., aminoalklyphosphoramidates
- target nucleic acid region denotes a nucleic acid molecule with a “target sequence” to be amplified.
- the target nucleic acid may be either single-stranded or double-stranded and may include other sequences besides the target sequence, which may not be amplified.
- target sequence refers to the particular nucleotide sequence of the target nucleic acid which is to be amplified.
- the target sequence may include a probe-hybridizing region contained within the target molecule with which a probe will form a stable hybrid under desired conditions.
- target sequence may also include the complexing sequences to which the oligonucleotide primers complex and be extended using the target sequence as a template.
- target sequence also refers to the sequence complementary to the “target sequence” as present in the target nucleic acid. If the “target nucleic acid” is originally double-stranded, the term “target sequence” refers to both the plus (+) and minus ( ⁇ ) strands.
- primer refers to an oligonucleotide which acts to initiate synthesis of a complementary DNA strand when placed under conditions in which synthesis of a primer extension product is induced, i.e., in the presence of nucleotides and a polymerization-inducing agent such as a DNA or RNA polymerase and at suitable temperature, pH, metal concentration, and salt concentration.
- the primer is preferably single-stranded for maximum efficiency in amplification, but may alternatively be double-stranded. If double-stranded, the primer is first treated to separate its strands before being used to prepare extension products.
- a “primer” is complementary to a template, and complexes by hydrogen bonding or hybridization with the template to give a primer/template complex for initiation of synthesis by a polymerase, which is extended by the addition of covalently bonded bases linked at its 3′ end complementary to the template in the process of DNA synthesis.
- probe or “oligonucleotide probe” refers to a structure comprised of a polynucleotide, as defined above, that contains a nucleic acid sequence complementary to a nucleic acid sequence present in the target nucleic acid analyte.
- the polynucleotide regions of probes may be composed of DNA, and/or RNA, and/or synthetic nucleotide analogs.
- the probe When an “oligonucleotide probe” is to be used in a 5′ nuclease assay, such as the TaqManTM technique, the probe will contain at least one fluorescer and at least one quencher which is digested by the 5′ endonuclease activity of a polymerase used in the reaction in order to detect any amplified target oligonucleotide sequences.
- the oligonucleotide probe will have a sufficient number of phosphodiester linkages adjacent to its 5′ end so that the 5′ to 3′ nuclease activity employed can efficiently degrade the bound probe to separate the fluorescers and quenchers.
- an oligonucleotide probe When an oligonucleotide probe is used in the TMA technique, it will be suitably labeled, as described below.
- hybridizing sequences need not have perfect complementarity to provide stable hybrids. In many situations, stable hybrids will form where fewer than about 10% of the bases are mismatches, ignoring loops of four or more nucleotides. Accordingly, as used herein the term “complementary” refers to an oligonucleotide that forms a stable duplex with its “complement” under assay conditions, generally where there is about 90% or greater homology.
- hybridize and “hybridization” refer to the formation of complexes between nucleotide sequences which are sufficiently complementary to form complexes via Watson-Crick base pairing.
- target template
- such complexes (or hybrids) are sufficiently stable to serve the priming function required by, e.g., the DNA polymerase to initiate DNA synthesis.
- Stringent hybridization conditions will typically include salt concentrations of less than about 1M, more usually less than about 500 mM and preferably less than about 200 mM.
- Hybridization temperatures can be as low as 5° C., but are typically greater than 22° C., more typically greater than about 30° C., and preferably in excess of about 37° C. Longer fragments may require higher hybridization temperatures for specific hybridization.
- Other factors may affect the stringency of hybridization, including base composition and length of the complementary strands, presence of organic solvents and extent of base mismatching, and the combination of parameters used is more important than the absolute measure of any one alone.
- hybridization conditions which may be controlled include buffer type and concentration, solution pH, presence and concentration of blocking reagents to decrease background binding such as repeat sequences or blocking protein solutions, detergent type(s) and concentrations, molecules such as polymers which increase the relative concentration of the polynucleotides, metal ion(s) and their concentration(s), chelator(s) and their concentrations, and other conditions known in the art.
- Less stringent, and/or more physiological, hybridization conditions are used where a labeled polynucleotide amplification product cycles on and off a substrate linked to a complementary probe polynucleotide during a real-time assay which is monitored during PCR amplification such as a molecular beacon assay.
- Such less stringent hybridization conditions can also comprise solution conditions effective for other aspects of the method, for example reverse transcription or PCR.
- binding pair refers to first and second molecules that specifically bind to each other, such as complementary polynucleotide pairs capable of forming nucleic acid duplexes. “Specific binding” of the first member of the binding pair to the second member of the binding pair in a sample is evidenced by the binding of the first member to the second member, or vice versa, with greater affinity and specificity than to other components in the sample. The binding between the members of the binding pair is typically noncovalent. Unless the context clearly indicates otherwise, the terms “affinity molecule” and “target analyte” are used herein to refer to first and second members of a binding pair, respectively.
- selective bind is meant that the molecule binds preferentially to the target of interest or binds with greater affinity to the target than to other molecules.
- a DNA molecule will bind to a substantially complementary sequence and not to unrelated sequences.
- the “melting temperature” or “Tm” of double-stranded DNA is defined as the temperature at which half of the helical structure of DNA is lost due to heating or other dissociation of the hydrogen bonding between base pairs, for example, by acid or alkali treatment, or the like.
- the T m of a DNA molecule depends on its length and on its base composition. DNA molecules rich in GC base pairs have a higher T m than those having an abundance of AT base pairs. Separated complementary strands of DNA spontaneously reassociate or anneal to form duplex DNA when the temperature is lowered below the T m . The highest rate of nucleic acid hybridization occurs approximately 25° C. below the T m .
- a “biological sample” refers to a sample of tissue or fluid isolated from a subject, that commonly includes antibodies produced by the subject. Typical samples that include such antibodies are known in the art and include but not limited to, blood, plasma, serum, fecal matter, urine, bone marrow, bile, spinal fluid, lymph fluid, samples of the skin, secretions of the skin, respiratory, intestinal, and genitourinary tracts, tears, saliva, milk, blood cells, organs, biopsies and also samples of in vitro cell culture constituents including but not limited to conditioned media resulting from the growth of cells and tissues in culture medium, e.g., recombinant cells, and cell components.
- label and “detectable label” refer to a molecule capable of detection, including, but not limited to, radioactive isotopes, fluorescers, chemiluminescers, chromophores, enzymes, enzyme substrates, enzyme cofactors, enzyme inhibitors, chromophores, dyes, metal ions, metal sols, ligands (e.g., biotin, avidin, strepavidin or haptens) and the like.
- fluorescer refers to a substance or a portion thereof which is capable of exhibiting fluorescence in the detectable range.
- an “antigen” includes any substance that may be specifically bound by an antibody molecule.
- the term “antigen” encompasses biologic molecules including, but not limited to, simple intermediary metabolites, sugars, lipids, autoacids, and hormones, as well as macromolecules such as complex carbohydrates, phopholipids, nucleic acids and proteins.
- an “immunogen” is a macromolecular antigen that is capable of initiating lymphocyte activation resulting in an antigen-specific immune response.
- An immunogen therefore includes any molecule which contains one or more epitopes that will stimulate a host's immune system to initiate a secretory, humoral and/or cellular antigen-specific response.
- antibody encompasses polyclonal and monoclonal antibody preparations, as well as preparations including hybrid antibodies, altered antibodies, chimeric antibodies and, humanized antibodies, as well as: hybrid (chimeric) antibody molecules (see, for example, Winter et al (1991) Nature 349:293-299; and U.S. Pat. No.
- F(ab′)2 and F(ab) fragments Fv molecules (noncovalent heterodimers, see, for example, Inbar et al (1972) Proc Natl Acad Sci USA 69:2659-2662; and Ehrlich et al (1980) Biochem 19:4091-4096); single-chain Fv molecules (sFv) (see, e.g., Huston et al (1988) Proc Natl Acad Sci USA 85:5879-5883); dimeric and trimeric antibody fragment constructs; minibodies (see, e.g., Pack et al. (1992) Biochem 31:1579-1584; Cumber et al.
- the term “monoclonal antibody” refers to an antibody composition having a homogeneous antibody population. The term is not limited regarding the species or source of the antibody, nor is it intended to be limited by the manner in which it is made. The term encompasses whole immunoglobulins.
- Polyclonal antibodies are generated by immunizing a suitable animal, such as a mouse, rat, rabbit, sheep or goat, with an antigen of interest.
- the antigen can be linked to a carrier prior to immunization.
- Suitable carriers are typically large, slowly metabolized macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic acids, polymeric amino acids, amino acid copolymers, lipid aggregates (such as oil droplets or liposomes), and inactive virus particles.
- lipid aggregates such as oil droplets or liposomes
- the antigen may be conjugated to a bacterial toxoid, such as toxoid from diphtheria, tetanus, cholera, etc., in order to enhance the immunogenicity thereof
- Immunization is generally performed by mixing or emulsifying the antigen in saline, preferably in an adjuvant such as Freund's complete adjuvant (“FCA”), and injecting the mixture or emulsion parenterally (generally subcutaneously or intramuscularly). The animal is generally boosted 2-6 weeks later with one or more injections of the antigen in saline, preferably using Freund's incomplete adjuvant (“FIA”). Antibodies may also be generated by in vitro immunization, using methods known in the art. Polyclonal antisera is then obtained from the immunized animal.
- FCA Freund's complete adjuvant
- Monoclonal antibodies are generally prepared using the method of Kohler and Milstein, Nature (1975) 256:495-497, or a modification thereof.
- a mouse or rat is immunized as described above.
- the spleen (and optionally several large lymph nodes) is removed and dissociated into single cells.
- the spleen cells may be screened (after removal of non-specifically adherent cells) by applying a cell suspension to a plate or well coated with the antigen.
- B-cells, expressing membrane-bound immunoglobulin specific for the antigen will bind to the plate, and are not rinsed away with the rest of the suspension.
- Resulting B-cells, or all dissociated spleen cells are then induced to fuse with myeloma cells to form hybridomas, and are cultured in a selective medium (e.g., hypoxanthine, aminopterin, thymidine medium, “HAT”).
- a selective medium e.g., hypoxanthine, aminopterin, thymidine medium, “HAT”.
- the resulting hybridomas are plated by limiting dilution, and are assayed for the production of antibodies which bind specifically to the immunizing antigen (and which do not bind to unrelated antigens).
- the selected monoclonal antibody-secreting hybridomas are then cultured either in vitro (e.g., in tissue culture bottles or hollow fiber reactors), or in vivo (e.g., as ascites in mice).
- the present invention is based on the discovery of novel diagnostic methods for accurately detecting HAV infection in a biological sample.
- the methods rely on sensitive nucleic acid-based detection techniques that allow identification of HAV target nucleic acid sequences in samples containing small amounts of virus.
- the inventors herein have characterized regions within the HAV genome which are desirable targets for diagnostic tests. Primers and probes derived from these regions are extremely useful for detection of HAV infection in biological samples.
- HAV primers and probes described above are used in nucleic acid-based assays for the detection of HAV infection in biological samples.
- primers and probes for use in these assays are preferably derived from the nucleotide sequences depicted in FIGS. 1-13 herein.
- Particularly preferred primers and probes for use with the present assays are designed from HAV genome to allow detection of HAV infection caused by a variety of isolates.
- capsid proteins, nonstructural proteins, protease and the junction between the capsid precursor P1 and 2A are readily obtained from additional isolates using portions of the HAV sequence found within these particular regions as primers in PCR reactions such as those described herein.
- Another method of obtaining nucleotide sequences with the desired sequences is by annealing complementary sets of overlapping synthetic oligonucleotides produced in a conventional, automated polynucleotide synthesizer, followed by ligation with an appropriate DNA ligase and amplification of the ligated nucleotide sequence via PCR. See, e.g., Jayaraman et al. (1991) Proc. Natl. Acad. Sci. USA 88:4084-4088.
- sequences Once the sequences have been prepared or isolated, they can be cloned into any suitable vector or replicon.
- suitable vectors include, but are not limited to, plasmids, phages, transposons, cosmids, chromosomes or viruses which are capable of replication when associated with the proper control elements.
- Recombinant clones are readily identified by restriction enzyme analysis and polyacryamide or agarose gel electrophresis, using techniques well known in the art, and described in the examples below.
- Polynucleotides of the invention encoding the HAV proteins are useful for designing hybridization probes for isolating and identifying cDNA clones and genomic clones encoding HAV proteins or allelic forms thereof. Such hybridization techniques are known to those of skill in the art. Sequence of polynucleotides that encode HAV proteins are also useful for designing primers for polymerase chain reaction (PCR). Also encompassed by the present invention, are single stranded polynucleotides, hereinafter referred to as antisense polynucleotides, having sequences which are complementary to the RNA sequences which encode the HAV proteins.
- Primers and probes for use in the assays herein are derived from these sequences and are readily synthesized by standard techniques, e.g., solid phase synthesis via phosphoramidite chemistry, as disclosed in U.S. Pat. Nos. 4,458,066 and 4,415,732, incorporated herein by reference; Beaucage et al. (1992) Tetrahedron 48:2223-2311; and Applied Biosystems User Bulletin No. 13 (1 Apr. 1987).
- Other chemical synthesis methods include, for example, the phosphotriester method described by Narang et al., Meth. Enzymol. ( 1979) 68:90 and the phosphodiester method disclosed by Brown et al., Meth. Enzymol. ( 1979) 68:109.
- Poly(A) or poly(C), or other non-complementary nucleotide extensions may be incorporated into probes using these same methods. Hexaethylene oxide extensions may be coupled to probes by methods known in the art. Cload et al. (1991) J. Am. Chem. Soc. 113:6324-6326; U.S. Pat. No. 4,914,210 to Levenson et al.; Durand et al. (1990) Nucleic Acids Res. 18:6353-6359; and Horn et al. (1986) Tet. Lett. 27:4705-4708.
- the primer sequences are in the range of between 10-100 nucleotides in length, such as 15-60, 20-40 and so on, more typically in the range of between 20-40 nucleotides long, and any length between the stated ranges.
- the typical probe is in the range of between 10-100 nucleotides long, such as 10-50, 15-40, 18-30, and so on, and any length between the stated ranges.
- one aspect of the invention encompasses oligonucleotides that are used as primers and probes in polymerase chain reaction (PCR) technologies to amplify transcripts of the genes which encode HAV proteins or portions of such transcripts.
- the primers have a G+C content of 40% or greater.
- Such oligonucleotides are at least 80% complementary with a sequence of SEQ ID NOs: 1-39.
- the primers and probes are at least 85% complementarity, 90% complementary, 95% complementary or more preferably 98% or 99% complementary with the sense strand or its corresponding antisense strand of SEQ ID NOs: 1-39.
- the probes may be coupled to labels for detection.
- labels for detection.
- derivatizing oligonucleotides with reactive functionalities which permit the addition of a label.
- biotinylating probes so that radioactive, fluorescent, chemiluminescent, enzymatic, or electron dense labels can be attached via avidin. See, e.g., Broken et al. (1978) Nucl. Acids Res. 5:363-384 which discloses the use of ferritin-avidin-biotin labels; and Chollet et al. (1985) Nucl. Acids Res.
- probes may be fluorescently labeled by linking a fluorescent molecule to the non-ligating terminus of the probe.
- Guidance for selecting appropriate fluorescent labels can be found in Smith et al., Meth. Enzymol . (1987) 155:260-301; Karger et al., Nucl. Acids Res . (1991) 19:4955-4962; Haugland (1989) Handbook of Fluorescent Probes and Research Chemicals (Molecular Probes, Inc., Eugene, Oreg.).
- Preferred fluorescent labels include fluorescein and derivatives thereof, such as disclosed in U.S. Pat. No. 4,318,846 and Lee et al., Cytometry (1989) 10:151-164.
- Dyes for use in the present invention include 3-pheniyl-7-isocyanatocoumarini, acridines, such as 9-isothiocyanatoacridine and acridine orange, pyrenes, benzoxadiazoles, and stilbenes.
- Additional dyes include 3-( ⁇ -carboxypentyl)-3′-ethyl-5,5′-dimethyloxa-carbocyanine (CYA); 6-carboxy fluorescein (FAM); 5,6-carboxyrhodamine-1 10 (R110); 6-carboxyrhodamine-6G (R6G); N′,N′,N′,N′-tetramethyl-6-carboxyrhodamine (TAMRA); 6-carboxy-X-rhodamine (ROX); 2′,4′,5′,7′,-tetrachloro-4-7-dichlorofluorescein (TET); 2′,7′-dimethoxy-4′,5′-6 carboxyrhodamine (JOE); 6-carboxy-2′,4,4′,5′,7,7′-hexachlorofluorescein (HEX); ALEXA; Cy3 and Cy5.
- fluorescent labels include fluorescein and derivatives thereof, such as disclosed in U.S. Pat. No. 4,318,846 and Lee et al., Cytometry (1989) 10:151-164, and 6-FAM, JOE, TAMRA, ROX, HEX-1, HEX-2, ZOE, TET-1 or NAN-2, and the like.
- probes can be labeled with an acridinium ester (AE) using the techniques described below.
- Current technologies allow the AE label to be placed at any location within the probe. See, e.g., Nelson et al. (1995) “Detection of Acridinium Esters by Chemiluminescence” in Nonisotopic Probing, Blotting and Sequencing , Kricka L. J. (ed) Academic Press, San Diego, Calif.; Nelson et al. (1994) “Application of the Hybridization Protection Assay (HPA) to PCR” in The Polymerase Chain Reaction , Mullis et al. (eds.) Birkhauser, Boston, Mass.; Weeks et al., Clin.
- HPA Hybridization Protection Assay
- An AE molecule can be directly attached to the probe using non-nucleotide-based linker arm chemistry that allows placement of the label at any location within the probe. See, e.g., U.S. Pat. Nos. 5,585,481 and 5,185,439.
- PCR polymerase chain reaction
- PCR is a technique for amplifying a desired target nucleic acid sequence contained in a nucleic acid molecule or mixture of molecules.
- a pair of primers is employed in excess to hybridize to the complementary strands of the target nucleic acid.
- the primers are each extended by a polymerase using the target nucleic acid as a template.
- the extension products become target sequences themselves after dissociation from the original target strand.
- New primers are then hybridized and extended by a polymerase, and the cycle is repeated to geometrically increase the number of target sequence molecules.
- PCR method for amplifying target nucleic acid sequences in a sample is well known in the art and has been described in, e.g., Innis et al. (eds.) PCR Protocols (Academic Press, NY 1990); Taylor (1991) Polymerase chain reaction: basic principles and automation , in PCR: A Practical Approach , McPherson et al. (eds.) IRL Press, Oxford; Saiki et al. (1986) Nature 324:163; as well as in U.S. Pat. Nos. 4,683,195, 4,683,202 and 4,889,818, all incorporated herein by reference in their entireties.
- PCR uses relatively short oligonucleotide primers which flank the target nucleotide sequence to be amplified, oriented such that their 3′ ends face each other, each primer extending toward the other.
- the polynucleotide sample is extracted and denatured, preferably by heat, and hybridized with first and second primers which are present in molar excess.
- Polymerization is catalyzed in the presence of the four deoxyribonucleotide triphosphates (dNTPs—dATP, dGTP, dCTP and dTTP) using a primer- and template-dependent polynucleotide polymerizing agent, such as any enzyme capable of producing primer extension products, for example, E.
- dNTPs deoxyribonucleotide triphosphates
- thermostable DNA polymerases isolated from Thermus aquaticus (Taq), available from a variety of sources (for example, Perkin Elmer), Thermus thermophilus (United States Biochemicals), Bacillus stereothermophilus (Bio-Rad), or Thermococcus litoralis (“Vent” polymerase, New England Biolabs). This results in two “long products” which contain the respective primers at their 5′ ends covalently linked to the newly synthesized complements of the original strands.
- the reaction mixture is then returned to polymerizing conditions, e.g., by lowering the temperature, inactivating a denaturing agent, or adding more polymerase, and a second cycle is initiated.
- the second cycle provides the two original strands, the two long products from the first cycle, two new long products replicated from the original strands, and two “short products” replicated from the long products.
- the short products have the sequence of the target sequence with a primer at each end.
- an additional two long products are produced, and a number of short products equal to the number of long and short products remaining at the end of the previous cycle.
- the number of short products containing the target sequence grow exponentially with each cycle.
- PCR is carried out with a commercially available thermal cycler, e.g., Perkin Elmer.
- RNAs may be amplified by reverse transcribing the mRNA into cDNA, and then performing PCR (RT-PCR), as described above. Alternatively, a single enzyme may be used for both steps as described in U.S. Pat. No. 5,322,770. mRNA may also be reverse transcribed into cDNA, followed by asymmetric gap ligase chain reaction (RT-AGLCR) as described by Marshall et al. (1994) PCR Meth. App. 4:80-84.
- RT-PCR reverse transcribing the mRNA into cDNA
- RT-AGLCR reverse transcribed into cDNA
- RT-AGLCR asymmetric gap ligase chain reaction
- the fluorogenic 5′ nuclease assay known as the TaqManTM assay (Perkin-Elmer), is a powerful and versatile PCR-based detection system for nucleic acid targets.
- primers and probes derived from regions of the HAV genome described herein can be used in TaqManTM analyses to detect the presence of infection in a biological sample. Analysis is performed in conjunction with thermal cycling by monitoring the generation of fluorescence signals.
- the assay system dispenses with the need for gel electrophoretic analysis, and has the capability to generate quantitative data allowing the determination of target copy numbers.
- the fluorogenic 5′ nuclease assay is conveniently performed using, for example, AmpliTaq GoldTM DNA polymerase, which has endogenous 5′ nuclease activity, to digest an internal oligonucleotide probe labeled with both a fluorescent reporter dye and a quencher (see, Holland et al. (1991) Proc. Natl. Acad.Sci. USA 88:7276-7280; and Lee et al. (1993) Nucl. Acids Res. 21:3761-3766). Assay results are detected by measuring changes in fluorescence that occur during the amplification cycle as the fluorescent probe is digested, uncoupling the dye and quencher labels and causing an increase in the fluorescent signal that is proportional to the amplification of target DNA.
- the amplification products can be detected in solution or using solid supports.
- the TaqManTM probe is designed to hybridize to a target sequence within the desired PCR product.
- the 5′ end of the TaqManTM probe contains a fluorescent reporter dye.
- the 3′ end of the probe is blocked to prevent probe extension and contains a dye that will quench the fluorescence of the 5′ fluorophore.
- the 5′ fluorescent label is cleaved off if a polymerase with 5′ exonuclease activity is present in the reaction. Excision of the 5′ fluorophore results in an increase in fluorescence which can be detected.
- the oligonucleotide probe is constructed such that the probe exists in at least one single-stranded conformation when unhybridized where the quencher molecule is near enough to the reporter molecule to quench the fluorescence of the reporter molecule.
- the oligonucleotide probe also exists in at least one conformation when hybridized to a target polynucleotide such that the quencher molecule is not positioned close enough to the reporter molecule to quench the fluorescence of the reporter molecule.
- the probe can be designed such that the quencher molecule quenches the reporter molecule when the probe is not hybridized, the probe can be designed such that the reporter molecule exhibits limited fluorescence unless the probe is either hybridized or digested.
- the present invention relates to methods for amplifying a target HAV nucleotide sequence using a nucleic acid polymerase having 5′ to 3′ nuclease activity, one or more primers capable of hybridizing to the target HAV sequence, and an oligonucleotide probe capable of hybridizing to the target HAV sequence 3′ relative to the primer.
- the polymerase digests the oligonucleotide probe when it is hybridized to the target sequence, thereby separating the reporter molecule from the quencher molecule.
- the fluorescence of the reporter molecule is monitored, with fluorescence corresponding to the occurrence of nucleic acid amplification.
- the reporter molecule is preferably a fluorescein dye and the quencher molecule is preferably a rhodamine dye.
- the probe sequences are selected such that they have a lower melt temperature than the primer sequences. Hence, the primer sequences are generally longer than the probe sequences. Typically, the primer sequences are in the range of between 10-75 nucleotides long, more typically in the range of 20-45. The typical probe is in the range of between 10-50 nucleotides long, more typically 15-40 nucleotides in length.
- the oligonucleotide probe may be attached to the solid support in a variety of manners.
- the probe may be attached to the solid support by attachment of the 3′ or 5′ terminal nucleotide of the probe to the solid support.
- the probe is attached to the solid support by a linker which serves to distance the probe from the solid support.
- the linker is usually at least 15-30 atoms in length, more preferably at least 15-50 atoms in length. The required length of the linker will depend on the particular solid support used. For example, a six atom linker is generally sufficient when high cross-linked polystyrene is used as the solid support.
- linkers are known in the art which may be used to attach the oligonucleotide probe to the solid support.
- the linker may be formed of any compound which does not significantly interfere with the hybridization of the target sequence to the probe attached to the solid support.
- the linker may be formed of a homopolymeric oligonucleotide which can be readily added on to the linker by automated synthesis.
- polymers such as functionalized polyethylene glycol can be used as the linker. Such polymers are preferred over homopolymeric oligonucleotides because they do not significantly interfere with the hybridization of probe to the target oligonucleotide. Polyethylene glycol is particularly preferred.
- linkages between the solid support, the linker and the probe are preferably not cleaved during removal of base protecting groups under basic conditions at high temperature.
- preferred linkages include carbamate and amide linkages.
- Examples of preferred types of solid supports for immobilization of the oligonucleotide probe include controlled pore glass, glass plates, polystyrene, avidin-coated polystyrene beads, cellulose, nylon, acrylamide gel and activated dextran.
- TMA transcription-mediated amplification
- TMA provides a method of identifying target nucleic acid sequences present in very small amounts in a biological sample. Such sequences may be difficult or impossible to detect using direct assay methods.
- TMA is an isothemal, autocatalytic nucleic acid target amplification system that can provide more than a billion RNA copies of a target sequence.
- the assay can be done qualitatively, to accurately detect the presence or absence of the target sequence in a biological sample.
- the assay can also provide a quantitative measure of the amount of target sequence over a concentration range of several orders of magnitude.
- TMA provides a method for autocatalytically synthesizing multiple copies of a target nucleic acid sequence without repetitive manipulation of reaction conditions such as temperature, ionic strength and pH.
- TMA includes the following steps: (a) isolating nucleic acid, including RNA, from the biological sample of interest suspected of being infected with HAV; and (b) combining into a reaction mixture (i) the isolated nucleic acid, (ii) first and second oligonucleotide primers, the first primer having a complexing sequence sufficiently complementary to the 3′ terminal portion of an RNA target sequence, if present (for example the (+) strand), to complex therewith, and the second primer having a complexing sequence sufficiently complementary to the 3′ terminal portion of the target sequence of its complement (for example, the ( ⁇ ) strand) to complex therewith, wherein the first oligonucleotide further comprises a sequence 5′ to the complexing sequence which includes a promoter, (iii) a reverse transcriptase or RNA and DNA dependent DNA polymerases, (iv) an enzyme activity which selectively degrades the RNA strand of an RNA-DNA complex (such as an RNAse H) and (
- the components of the reaction mixture may be combined stepwise or at once.
- the reaction mixture is incubated under conditions whereby an oligonucleotide/target sequence is formed, including DNA priming and nucleic acid synthesizing conditions (including ribonucleotide triphosphates and deoxyribonucleotide triphosphates) for a period of time sufficient to provide multiple copies of the target sequence.
- the reaction advantageously takes place under conditions suitable for maintaining the stability of reaction components such as the component enzymes and without requiring modification or manipulation of reaction conditions during the course of the amplification reaction. Accordingly, the reaction may take place under conditions that are substantially isothermal and include substantially constant ionic strength and pH.
- the reaction conveniently does not require a denaturation step to separate the RNA-DNA complex produced by the first DNA extension reaction.
- Suitable DNA polymerases include reverse transcriptases, such as avian myeloblastosis virus (AMV) reverse transcriptase (available from, e.g., Seikagaku America, Inc.) and Moloney murine leukemia virus (MMLV) reverse transcriptase (available from, e.g., Bethesda Research Laboratories).
- AMV avian myeloblastosis virus
- MMLV Moloney murine leukemia virus
- Promoters or promoter sequences suitable for incorporation in the primers are nucleic acid sequences (either naturally occurring, produced synthetically or a product of a restriction digest) that are specifically recognized by an RNA polymerase that recognizes and binds to that sequence and initiates the process of transcription whereby RNA transcripts are produced.
- the sequence may optionally include nucleotide bases extending beyond the actual recognition site for the RNA polymerase which may impart added stability or susceptibility to degradation processes or increased transcription efficiency.
- useful promoters include those which are recognized by certain bacteriophage polymerases such as those from bacteriophage T3, T7 or SP6, or a promoter from E. coli . These RNA polymerases are readily available from commercial sources, such as New England Biolabs and Epicentre.
- RNAse H activity
- AMV reverse transcriptase RNAse H activity
- exogenous RNAse H such as E. coli RNAse H
- RNAse H is readily available from, e.g., Bethesda Research Laboratories.
- RNA transcripts produced by these methods may serve as templates to produce additional copies of the target sequence through the above-described mechanisms.
- the system is autocatalytic and amplification occurs autocatalytically without the need for repeatedly modifying or changing reaction conditions such as temperature, pH, ionic strength or the like.
- Detection may be done using a wide variety of methods, including direct sequencing, hybridization with sequence-specific oligomers, gel electrophoresis and mass spectrometry. these methods can use heterogeneous or homogeneous formats, isotopic or nonisotopic labels, as well as no labels at all.
- One preferable method of detection is the use of target sequence-specific oligonucleotide probes, derived from the sequences described in FIGS. 1-13 and fragments thereof.
- the probes may be used in hybridization protection assays (HPA).
- HPA hybridization protection assays
- the probes are conveniently labeled with acridinium ester (AE), a highly chemiluminescent molecule.
- AE acridinium ester
- HPA Hybridization Protection Assay
- Chemiluminescence is triggered by reaction with alkaline hydrogen peroxide which yields an excited N-methyl acridone that subsequently collapses to ground state with the emission of a photon. Additionally, AE causes ester hydrolysis which yields the nonchemiluminescent-methyl acridinium carboxylic acid.
- HPA generally consists of the following steps: (a) the AE-labeled probe is hybridized with the target nucleic acid in solution for about 15 to about 30 minutes. A mild alkaline solution is then added and AE coupled to the unhybridized probe is hydrolyzed. This reaction takes approximately 5 to 10 minutes. The remaining hybrid-associated AE is detected as a measure of the amount of target present. This step takes approximately 2 to 5 seconds.
- the differential hydrolysis step is conducted at the same temperature as the hybridization step, typically at 50 to 70° C. Alternatively, a second differential hydrolysis step may be conducted at room temperature.
- HPA is described in detail in, e.g., U.S. Pat. Nos. 6,004,745; 5,948,899; and 5,283,174, the disclosures of which are incorporated by reference herein in their entireties.
- TMA is described in detail in, e.g., U.S. Pat. No.5,399,491, the disclosure of which is incorporated herein by reference in its entirety.
- an isolated nucleic acid sample suspected of containing a HAV target sequence, is mixed with a buffer concentrate containing the buffer, salts, magnesium, nucleotide triphosphates, primers, dithiothreitol, and spermidine.
- the reaction is optionally incubated at about 100° C. for approximately two minutes to denature any secondary structure.
- reverse transcriptase, RNA polymerase, and RNAse H are added and the mixture is incubated for two to four hours at 37° C.
- the reaction can then be assayed by denaturing the product, adding a probe solution, incubating 20 minutes at 60° C., adding a solution to selectively hydrolyze the unhybridized probe, incubating the reaction six minutes at 60° C., and measuring the remaining chemiluminescence in a luminometer.
- the above-described assay reagents can be provided in kits, with suitable instructions and other necessary reagents, in order to conduct the assays as described above.
- the kit will normally contain in separate containers the combination of primers and probes (either already bound to a solid matrix or separate with reagents for binding them to the matrix), control formulations (positive and/or negative), labeled reagents when the assay format requires same and signal generating reagents (e.g., enzyme substrate) if the label does not generate a signal directly.
- kit can also contain, depending on the particular assay used, other packaged reagents and materials (i.e. wash buffers and the like). Standard assays, such as those described above, can be conducted using these kits.
- Recombinant or synthetic HAV polypeptides can be used as diagnostics, or those which produce an immunological response, such as those that give rise to neutralizing antibodies, may be formulated into vaccines. Antibodies raised against these polypeptides can also be used as diagnostics, or for passive immunotherapy. In addition, antibodies to these polypeptides are useful for isolating and identifying HAV particles.
- the HAV antigens may also be isolated from HAV virions. The virions may be grown in HAV infected cells in tissue culture, or in an infected host.
- the antibodies may be polyclonal or monoclonal, may be a human antibody, or may be a hybrid or chimeric antibody, such as a humanized antibody, an altered antibody, F(ab′) 2 fragments, F(ab) fragments, Fv fragments, a single-domain antibody, a dimeric or trimeric antibody fragment construct, a minibody, or functional fragments thereof which bind to the analyte of interest.
- Antibodies are produced using techniques well known to those of skill in the art and disclosed in, for example, U.S. Pat. Nos. 4,011,308; 4,722,890; 4,016,043; 3,876,504; 3,770,380; and 4,372,745.
- polyclonal antibodies are generated by immunizing a suitable animal, such as a mouse, rat, rabbit, sheep or goat, with an antigen of interest.
- the antigen can be linked to a carrier prior to immunization.
- a carrier are well known to those of ordinary skill in the art.
- Immunization is generally performed by mixing or emulsifying the antigen in saline, preferably in an adjuvant such as Freund's complete adjuvant, and injecting the mixture or emulsion parenterally (generally subcutaneously or intramuscularly). The animal is generally boosted 2-6 weeks later with one or more injections of the antigen in saline, preferably using Freund's incomplete adjuvant.
- Antibodies may also be generated by in vitro immunization, using methods known in the art. Polyclonal antiserum is then obtained from the immunized animal.
- Monoclonal antibodies are generally prepared using the method of Kohler and Milstein (1975) Nature 256:495-497, or a modification thereof.
- a mouse or rat is immunized as described above.
- the spleen (and optionally several large lymph nodes) is removed and dissociated into single cells.
- the spleen cells may be screened (after removal of nonspecifically adherent cells) by applying a cell suspension to a plate or well coated with the antigen.
- B-cells, expressing membrane-bound immunoglobulin specific for the antigen will bind to the plate, and are not rinsed away with the rest of the suspension.
- Resulting B-cells, or all dissociated spleen cells are then induced to fuse with myeloma cells to form hybridomas, and are cultured in a selective medium (e.g., hypoxanthine, aminopterin, thymidine medium, “HAT”).
- the resulting hybridomas are plated by limiting dilution, and are assayed for the production of antibodies which bind specifically to the immunizing antigen (and which do not bind to unrelated antigens).
- the selected monoclonal antibody-secreting hybridomas are then cultured either in vitro (e.g., in tissue culture bottles or hollow fiber reactors), or in vivo (e.g., as ascites in mice).
- Human monoclonal antibodies are obtained by using human rather than murine hybridomas. See, e.g., Cote, et al. Monclonal Antibodies and Cancer Therapy , Alan R. Liss, 1985, p. 77
- Monoclonal antibodies or portions thereof may be identified by first screening a B-cell cDNA library for DNA molecules that encode antibodies that specifically bind to p185, according to the method generally set forth by Huse et al. (1989) Science 246:1275-1281. The DNA molecule may then be cloned and amplified to obtain sequences that encode the antibody (or binding domain) of the desired specificity.
- antibody fragments which retain the ability to recognize the molecule of interest, will also find use in the subject invention.
- a number of antibody fragments are known in the art which comprise antigen-binding sites capable of exhibiting immunological binding properties of an intact antibody molecule.
- functional antibody fragments can be produced by cleaving a constant region, not responsible for antigen binding, from the antibody molecule, using e.g., pepsin, to produce F(ab′) 2 fragments. These fragments will contain two antigen binding sites, but lack a portion of the constant region from each of the heavy chains.
- Fab fragments comprising a single antigen binding site, can be produced, e.g., by digestion of polyclonal or monoclonal antibodies with papain.
- Functional fragments including only the variable regions of the heavy and light chains, can also be produced, using standard techniques such as recombinant production or preferential proteolytic cleavage of immunoglobulin molecules. These fragments are known as F v . See, e.g., Inbar et al. (1972) Proc. Nat. Acad. Sci. USA 69:2659-2662; Hochman et al. (1976) Biochem 15:2706-2710; and Ehrlich et al. (1980) Biochem 19:4091-4096.
- a single-chain Fv (“sFv” or “scFv”) polypeptide is a covalently linked V H -V L heterodimer which is expressed from a gene fusion including V H - and V L -encoding genes linked by a peptide-encoding linker.
- a number of methods have been described to discern and develop chemical structures (linkers) for converting the naturally aggregated, but chemically separated, light and heavy polypeptide chains from an antibody V region into an sFv molecule which will fold into a three dimensional structure substantially similar to the structure of an antigen-binding site. See, e.g., U.S. Pat.
- the sFv molecules may be produced using methods described in the art. See, e.g., Huston et al. (1988) Proc. Nat. Acad. Sci. USA 85:5879-5883; U.S. Pat. Nos. 5,091,513, 5,132,405 and 4,946,778.
- Design criteria include determining the appropriate length to span the distance between the C-terminus of one chain and the N-terminus of the other, wherein the linker is generally formed from small hydrophilic amino acid residues that do not tend to coil or form secondary structures. Such methods have been described in the art. See, e.g., U.S. Pat. Nos.
- Suitable linkers generally comprise polypeptide chains of alternating sets of glycine and serine residues, and may include glutamic acid and lysine residues inserted to enhance solubility.
- Minibodies are sFv polypeptide chains which include oligomerization domains at their C-termini, separated from the sFv by a hinge region. Pack et al. (1992) Biochem 31:1579-1584.
- the oligomerization domain comprises self-associating ⁇ -helices, e.g., leucine zippers, that can be further stabilized by additional disulfide bonds.
- the oligomerization domain is designed to be compatible with vectorial folding across a membrane, a process thought to facilitate in vivo folding of the polypeptide into a functional binding protein.
- minibodies are produced using recombinant methods well known in the art. See, e.g., Pack et al. (1992) Biochem 31:1579-1584; Cumber et al. (1992) J Immunology 149B:120-126.
- polypeptides of the present invention may comprise a substantially complete viral domain, in many applications all that is required is that the polypeptide comprise an antigenic or immunogenic region of the virus.
- the polypeptides of SEQ ID Nos: 40-48 are used to elicit an immunological response.
- an immunological region of a polypeptide is generally relatively small—typically 8 to 10 amino acids or less in length. Fragments of as few as 5 amino acids may characterize an antigenic region. These segments may correspond to regions encoding for capsid proteins, nonstructural proteins, and the junction between the capsid precursor P1 and 2A.
- DNAs encoding short segments of these polypeptides can be expressed recombinantly either as fusion proteins, or as isolated polypeptides.
- short amino acid sequences can be conveniently obtained by chemical synthesis.
- the polypeptide may be linked to a suitable carrier.
- a suitable carrier A number of techniques for obtaining such linkage are known in the art,. including the formation of disulfide linkages using N-succinimidyl-3-(2-pyridyl-thio)propionate (SPDP) and succinimidyl 4-(N-maleimido-methyl)cyclohexane-1-carboxylate (SMCC) obtained from Pierce Company, Rockford, Ill., (if the peptide lacks a sulfhydryl group, this can be provided by addition of a cysteine residue).
- SPDP N-succinimidyl-3-(2-pyridyl-thio)propionate
- SMCC succinimidyl 4-(N-maleimido-methyl)cyclohexane-1-carboxylate
- reagents create a disulfide linkage between themselves and peptide cysteine residues on one protein and an amide linkage through the epsilon-amino on a lysine, or other free amino group in the other.
- a variety of such disulfide/amide-forming agents are known. See, for example, Immun. Rev. (1982) 62:185.
- Other bifunctional coupling agents form a thioether rather than a disulfide linkage.
- thio-ether-forming agents are commercially available and include reactive esters of 6-maleimidocaproic acid, 2-bromoacetic acid, 2-iodoacetic acid, 4-(N-maleimido-methyl)cyclohexane-1-carboxylic acid, and the like.
- the carboxyl groups can be activated by combining them with succinimide or 1-hydroxyl-2-nitro-4-sulfonic acid, sodium salt.
- Additional methods of coupling antigens employs the rotavirus/“binding peptide” system described in EPO Pub. No. 259,149, the disclosure of which is incorporated herein by reference. The foregoing list is not meant to be exhaustive, and modifications of the named compounds can clearly be used.
- Suitable carriers are typically large, slowly metabolized macromolecules such as proteins; polysaccharides, such as latex functionalized SepharoseTM, agarose, cellulose, cellulose beads and the like; polymeric amino acids, such as polyglutamic acid, polylysine, and the like; amino acid copolymers; and inactive virus particles.
- Especially useful protein substrates are serum albumins, keyhole limpet hemocyanin, immunoglobulin molecules, thyroglobulin, ovalbumin, tetanus toxoid, and other proteins well known to those skilled in the art.
- polypeptides comprising truncated HAV amino acid sequences encoding at least one viral epitope are useful immunological reagents.
- polypeptides comprising such truncated sequences can be used as reagents in an immunoassay.
- These polypeptides also are candidate subunit antigens in compositions for antiserum production or vaccines. While these truncated sequences can be produced by various known treatments of native viral protein, it is generally preferred to make synthetic or recombinant polypeptides comprising an HAV sequence.
- Polypeptides comprising these truncated HAV sequences can be made up entirely of HAV sequences (one or more epitopes, either contiguous or noncontiguous), or HAV sequences and heterologous sequences in a fusion protein.
- Useful heterologous sequences include sequences that provide for secretion from a recombinant host, enhance the immunological reactivity of the HAV epitope(s), or facilitate the coupling of the polypeptide to an immunoassay support or a vaccine carrier. See, e.g., EPO Pub. No. 116,201; U.S. Pat. No. 4,722,840; EPO Pub. No. 259,149; U.S. Pat. No. 4,629,783, the disclosures of which are incorporated herein by reference.
- the size of polypeptides comprising the truncated HAV sequences can vary widely, the minimum size being a sequence of sufficient size to provide an HAV epitope, while the maximum size is not critical. In some applications, the maximum size usually is not substantially greater than that required to provide the desired HAV epitopes and function(s) of the heterologous sequence, if any.
- the truncated HAV amino acid sequence will range from about 5 to about 100 amino acids in length. More typically, however, the HAV sequence will be a maximum of about 50 amino acids in length, preferably a maximum of about 30 amino acids. It is usually desirable to select HAV sequences of at least about 10, 12 or 15 amino acids, up to a maximum of about 20 or 25 amino acids.
- the truncated HAV amino acid sequence are selected from SEQ ID NOs: 40-48.
- the polynucleotides or the truncated amino acid sequences have at least about 50% homology to the polynucleotides of SEQ ID NOs: 40-48, preferably about 80% homology to the polynucleotides of SEQ ID NOs: 40-48, more preferably about 90%, 95%, or 99% homology to the polynucleotides of SEQ ID NOs: 40-48.
- Truncated HAV amino acid sequences comprising epitopes can be identified in a number of ways.
- the entire viral protein sequence can be screened by preparing a series of short peptides that together span the entire protein sequence.
- 100-mer polypeptides it would be routine to test each polypeptide for the presence of epitope(s) showing a desired reactivity, and then testing progressively smaller and overlapping fragments from an identified 100-mer to map the epitope of interest. Screening such peptides in an immunoassay is within the skill of the art.
- the immunogenicity of the HAV sequences may also be enhanced by preparing the sequences fused to or assembled with particle-forming proteins such as, for example, hepatitis B surface antigen or rotavirus VP6 antigen. Constructs wherein the HAV epitope is linked directly to the particle-forming protein coding sequences produce hybrids which are immunogenic with respect to the HAV epitope.
- all of the vectors prepared include epitopes specific to HAV, having various degrees of immunogenicity, such as, for example, the pre-S peptide.
- particles constructed from particle forming protein which include HAV sequences are immunogenic with respect to HAV and particle-form protein.
- enzymes were purchased from commercial sources, and used according to the manufacturers' directions. Nitrocellulose filters and the like were also purchased from commercial sources.
- RNA samples that had previously tested positive for HAV by IgM anti-HAV ELISA [ETI-HA-IgMK PLUS; DiaSorin, Inc; Saluggia (VC), Italy] were used to isolate RNA for subsequent experiments. Samples were stored at ⁇ 80° C. until used. RNA was extracted from 0.14 mL of serum using the QIAamp Viral Mini Spin Kit (QIAGEN, Valencia, Calif.) following the manufacturer's specifications.
- the RT-PCR was performed using the Titan One Tube RT-PCR Kit (Roche, Mannheim, Germany) to amplify a 243 bp fragment in the VP3/VP1 region.
- the 243 bp fragment corresponds to nucleotide positions 2172-2415 of the HAV genome as reported by Cohen et al. (1987) J. Virol. 61: 50-59.
- PCR products were electrophoresed on 4-20% polyacrylamide gels, stained with ethidium bromide and visualized under an UV source. Purification of amplified fragments was carried out using the QiaQuick PCR purification kit (QIAGEN, Valencia, Calif.).
- the PCR fragments were cloned into TOPO-TA vectors (Invitrogen, Carlsbad, Calif.). Cloning into these vectors is highly facilitated when the amplified DNA contains a single deoxyadenosine (A) at its 3′ end. Accordingly, a catalytic reaction to add the 3′ (A) overhead was used.
- the reaction mix contained 1.25 mM of dATP, 0.5 units of Taq polymerase (Perkin Elmer, Boston, Mass.) and proceeded at 72 C for 15 min.
- PCR fragments were cloned into the pCR2.1-TOPO vector using the Invitrogen's TA cloning kit (TOPO TM TA Cloning R Kit with One Shot TOP10 Electrocompetent Cells) following the manufacturer's specifications. Bacterial cells were incubated at 37° C. on Luria Broth plates containing ampicillin at 100 ⁇ g/mL, 0.66 mM IPTG and 0.033% X-Gal. A number of white colonies were inoculated in 4 mL of Luria-Broth ampicillin (100 ⁇ g/ml) and incubated overnight at 37° C. with shaking.
- Cloning of HAV P 1/2A precursor and full length open reading frame nucleotide fragments of interest include PCR of fragments of interest from the Chiron plasmid pHAVFL 18.3 #2 already containing a full length ORF of HAV and cloning those fragments of interest into the pGEM-4z vector (Promega, Madison Wis.).
- the pGEM vector has both an SP6 and T7 promoter to facilitate in vitro RNA synthesis of cloned products.
- the pGEM-4z vector was made by restriction digest of the plasmid using KpnI and SphI restriction enzymes (Roche Applied Science, Indianapolis, Ind.) followed by a phosphatase reaction using shrimp alkaline phosphatase (Roche Applied Science). The vector was then electrophoresed on an agarose gel and purified using the Promega Wizard PCR Purification kit (Promega).
- Primers were designed to flank the regions of interest and included the KpnI and SphI restriction sites to facilitate cloning. Primers were ordered from an in-house DNA synthesis facility. PCR reactions using pHAVFL 18.3 #2 as template were done using the Roche Expand High Fidelity PCR System following the manufacturer's recommendations. The PCR products were electrophoresed on an agarose gel and purified using the Promega Wizard PCR Purification kit (Promega).
- PCR products were ligated into the pGEM-4z vector using Roche Rapid DNA Ligation kit (Roche Applied Science) and transformed into HB101 competent cells. Bacterial cells were incubated at 37° C. on Luria Broth plates containing ampicillin at 100 ⁇ g/mL overnight. Three mL of overnight cultures were used to prepare plasmid DNA using the QIAprep Miniprep kit (QIAGEN, Valencia Calif.). Recombinant clones were identified by restriction enzyme analysis with KpnI and SphI (Roche Applied Science) and gel electrophoresed.
- Cloning of HAV full length open reading frame plus additional 3′ untranslated sequences include insertion of the HAV fragment from KpnI-DrdI from the above described HAV full length cloned fragment and a synthetic DNA region from DrdI-SphI into the pGEM-4z KpnI-SphI vector described above.
- Restriction enzyme digest was done on the pGEM-4z full length HAV construct described above to isolate a fragment using KpnI and DrdI enzymes (Roche Applied Science). The digest was electrophoresed and purified using the Promega Wizard PCR Purification kit (Promega). Synthetic DNA oligos were designed and ordered from an in-house DNA synthesis facility. The synthetic DrdI-SphI region was annealed from separate oligos and kinased according to standard molecular biology protocol. The two separate fragments were then ligated into the pGEM-4z vector using Roche Rapid DNA Ligation kit (Roche Applied Science) and transformed into HB101 competent cells. Bacterial cells were incubated at 37° C.
- plasmids from recombinant clones were prepared using Qiagen Maxi Plasmid kit (Qiagen) and the DNA suspended in ddH 2 O at 0.2 mg/mL. Nucleotide sequence determination of the HAV fragments was performed using an Applied BioSystems Model 373 or Model 377 DNA Sequencer system. The nucleotide sequence of the HAV inserts cloned in the pGEM-4z vector is shown in FIGS. 2-4 .
- Fragments encoding for P1, P1-2A, 1B, 1C, 1D, 2A and 3A were amplified using the DNA of a recombinant plasmid obtained in Chiron Corporation which contains the full-length HAV coding reading frame cloned in pUC 18.
- PCR primers were designed to PCR out the P1, P 1-2A, 1B, 1C, 1D, 2A and 3A regions of HAV. To facilitate the cloning of these regions into Chiron yeast expression vectors the NcoI, XhoI, and SalI restriction sites were introduced in the primers as required.
- PCR primers were synthesized in the DNA synthesis facility of Chiron Corporation. Synthetic oligonucleotides were purified, suspended in 300 ul of dH 2 O and their optical densities at 260 nm determined. The reaction mix contained 0.25 ng of template, 100 pmol of each primer, 10 ul of 1.25 mM of each dNTP and 1 unit of Taq polymerase (Vendor) in a final volume of 50 uL. Amplification conditions were 94° C. for 1 min., 50° C. for 2 min. and 68° C. for 4 min. for 35 cycles. A 7-min. post incubation at 75° C. was added to ensure the full extension of fragments.
- Plasmid pBS24.1 that was engineered to contain the yeast hybrid promoter ADH2/GAPDH (Cousens et al. (1987) Gene 61, 265-275) and an XhoI restriction site, was used for heterologous expression of the HAV recombinant proteins.
- This yeast expression vector contains 2 ⁇ g sequences and inverted repeats (IR) for autonomous replication in yeast, the ⁇ -factor terminator to ensure transcription termination, and the yeast leu2-d and URA3 for selection.
- the Co1E1 origin of replication and the ⁇ -lactamase gene are also present for propagation and selection in E. coli (Pichuantes et al.
- Plasmid pBS24.1 was digested with BamHI/SalI or XhoI/SalI and dephosphorylated with 10 units of calf intestine alkaline phosphatase (Boheringer Manheim, Indianapolis, Ind.) under the conditions recommended by the vendor.
- HAV nucleotide sequences coding for HAV 2A and 3A were fused to DNA sequences coding for the human superoxide dismutase (SOD) prior to the cloning.
- SOD superoxide dismutase
- the digested and purified HAV recombinant fragments were ligated with digested pBS24.1 using the Roche Rapid Ligation kit and protocol.
- the ligation mix was then used to transform Escherichia coli HB101 competent cells and transformants were selected in Luria-Broth plates containing ampicillin at 100 ⁇ g/mL after an overnight incubation at 37° C.
- Plasmid DNA was prepared using 1.5 mL of cultures and the QIAprep Miniprep kit (QIAGEN). Recombinant clones were identified by analytical restriction enzyme analysis with BamHI-SalI. Large-scale preparations of recombinant plasmids were made to perform sequencing to confirm the nucleotide sequence of the cloned HAV fragments.
- Yeast expression plasmids exhibiting the expected sequence for HAV P1, P1-2A, 1B, 1C, 1D, SOD-2A and SOD-3A were used in experiments of yeast transformation as follows. Competent Saccharomyces cerevisiae AD3 cells [Mat a, trp1+, ura3-52, prb1-1122, pep4-3, prc1-407, [cir 0 ],::pDM15(pGAP/ADR1::G418 R )], leu2( ⁇ AD)] were transformed with plasmid DNAs encoding for NS1, VP1 or VP2.
- yeast recombinants were achieved by two rounds of uracil-deficient plates followed by one round of leucine-deficient plates after incubation at 30° C. for 48-72 hours. Cultures were then grown in leucine-deficient media and then in YEP supplemented with 2% glucose (Pichuantes et al. (1989) Proteins: Struct. Funct. Genet. 6: 324-337) for 48 h before checking expression of the recombinant proteins.
- the nucleotide (SEQ ID NOs: 31-39) and corresponding amino acid sequences for the various proteins (SEQ ID NOs: 40-48) are shown in FIGS. 5-13 .
- the amino acid sequence of the polypeptides was deduced from the nucleotide sequences.
- the nucleotide sequences and the amino acid sequences were compared to the wild-type HAV nucleotide and protein sequences reported by Cohen et al (1987) J. Virol. 61:50-59.
- the polynucleotide sequences from the Indonesian samples have a 93.8-96.7% homology, while the polynucleotide sequences from the Chilean samples have a 90.5-94.7% homology.
- the amino acid sequences from the Indonesian and the Chilean samples have a homology of 98.8-100% and 97.5-98.8%, respectively.
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Virology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Wood Science & Technology (AREA)
- Immunology (AREA)
- Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Genetics & Genomics (AREA)
- Biophysics (AREA)
- Zoology (AREA)
- Medicinal Chemistry (AREA)
- Communicable Diseases (AREA)
- General Engineering & Computer Science (AREA)
- Analytical Chemistry (AREA)
- Biotechnology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Gastroenterology & Hepatology (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- General Chemical & Material Sciences (AREA)
- Animal Behavior & Ethology (AREA)
- Oncology (AREA)
- Public Health (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Pharmacology & Pharmacy (AREA)
- Veterinary Medicine (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Investigating Or Analysing Materials By The Use Of Chemical Reactions (AREA)
Abstract
Hepatitis A virus primers and probes derived from the capsid proteins and junction between the capsid precursor P1 and 2A of the HAV genome are disclosed. Also disclosed are nucleic acid-based assays using the primers and probes, antigen detection of HAV, and immunoassay for detecting the antibodies that bind to HAV.
Description
- This application is related to provisional patent applications Ser. No. 60/328,933 filed Oct. 12, 2001, from which priority is claimed under 35 USC §119(e)(1) and which application is incorporated herein by reference in its entireties.
- The present invention pertains generally to viral diagnostics. In particular, the invention relates to nucleic acid and antibody-based assays for accurately diagnosing hepatitis A virus infection.
- Hepatitis A is an enterically transmitted disease that causes fever, malaise, anorexia, nausea, abdominal discomfort and jaundice. The etiologic agent of hepatitis A, the hepatitis A virus, is a small, nonenveloped, spherical virus classified in the genus Hepatovirus of the Picornaviridae family. The HAV genome consists of a single-strand, linear, 7.5 kb RNA molecule encoding a polyprotein precursor that is processed to yield the structural proteins and enzymatic activities required for viral replication. HAV grows poorly in cell culture, is not cytopathic, and produces low yields of virus. Although HAV RNA extracted from virions is infectious in cell culture (Locarnini et al., J. Virol. 37:216-225, 1981 and Siegl et al., J. Gen. Virol. 57:331-341, 1981), direct manipulation of the viral genome becomes difficult because of its RNA composition.
- HAV encodes four capsid proteins (A, B, C and D) which contain the major antigenic domains recognized by antibodies of infected individuals. In addition to the capsid proteins, antigenic domains have been reported in nonstructural proteins such as 2A and the viral encoded protease. Another important HAV antigenic domain has been described in the junction between the capsid precursor P1 and 2A.
- HAV is normally acquired by fecal-oral route, by either person-to-person contact or ingestion of contaminated food or water. However, there is the potential for HAV transmission by pooled plasma products. The absence of a lipid envelope makes HAV very resistant to physicochemical inactivation, and the virus can withstand conventional heat treatment of blood products. Thus, HAV, as well as Parvovirus B19, have been transmitted through the administration of pooled plasma derivatives. The development of sensitive and specific diagnostic assays to identify HAV antigens and/or antibodies in infected individuals as well as nucleic acid-based tests to detect viremic samples to exclude them from transfusion represents an important public health challenge.
- Therefore, there remains a need for the development of reliable diagnostic tests to detect hepatitis A virus in viremic samples, in order to prevent transmission of the virus through blood and plasma derivatives or by close personal contact.
- The present invention is based on the development of a sensitive, reliable nucleic acid-based diagnostic test for the detection of hepatitis A virus (HAV) in biological samples from potentially infected individuals. The techniques described herein utilize extracted sample RNA as a template for amplification of HAV genomic sequence using transcription-mediated amplification (TMA), as well as in a 5′ nuclease assay, such as the TaqMan™ technique. The methods allow for the detection of HAV in viremic samples. Accordingly, infected samples can be identified and excluded from transfusion, as well as from the preparation of blood derivatives.
- In one aspect, the invention is directed to an isolated polynucleotide comprising (a) a nucleotide sequence comprising any one of the nucleotide sequences depicted in SEQ ID NOs: 1-39; (b) an isolated polynucleotide encoding a polypeptide comprising any one of SEQ ID NOs: 40-48; (c) a sequence complementary to any one of the sequences of (a) or (b); or (d) a fragment of any of the sequences in (a) or (b) wherein the fragment is at least 10 nucleotides.
- In another embodiment, the invention is directed to an oligonucleotide primer consisting of a promoter region recognized by a DNA-dependent RNA polymerase operably linked to a HAV-specific complexing sequence of about 10 to about 75 nucleotides. In certain embodiments, the promoter region is the T7 promoter and said polymerase is T7 RNA polymerase. Additionally, the HAV-specific sequence may be from the HAV genome, such as a nucleotide sequence comprising any one of the nucleotide sequences depicted in SEQ ID NOs: 1-39.
- In yet further embodiments, the invention is directed an oligonucleotide primer consisting of a T7 promoter operably linked to a HAV-specific complexing sequence of about 10 to about 75 nucleotides, wherein the HAV-specific complexing sequence is derived from any one of the polynucleotide sequences of SEQ ID NOs: 1-39.
- In another embodiment, the invention is directed to an oligonucleotide probe comprising a HAV-specific hybridizing sequence of about 10 to about 50 nucleotides linked to an acridinium ester label. In certain embodiments, the HAV-specific hybridizing sequence is a polynucleotide sequence derived from any one of the polynucleotide sequences of SEQ ID NOs: 1-39.
- In another embodiment, the invention is directed to a vaccine composition comprising an isolated immunogenic Hepatitis A virus (HAV) polypeptide, and a pharmaceutically acceptable excipient, wherein the HAV polypeptide is a polypeptide with at least 80% sequence identity to any one of the sequences of SEQ ID NOs: 40-48, or an immunogenic fragment thereof comprising at least 10 amino acids.
- In yet an additional embodiment, the invention is directed to a diagnostic test kit comprising one or more oligonucleotide primers described herein, and instructions for conducting the diagnostic test. In certain embodiments, the test kit further comprises an oligonucleotide probe comprising a HAV hybridizing sequence of about 10 to about 50 nucleotides linked to an acridinium ester label.
- In another embodiment, the invention is directed to an immunoassay for detecting antibodies that bind to a hepatitis A virus polypeptide comprising: providing an antigen comprising a sequence having at least 80% sequence identity to any one of the sequences of SEQ ID NOs: 40-48, or fragment thereof; incubating the antigen with a biological sample under conditions that allow for formation of an antibody-antigen complex; and detecting any antibody-antigen complexes comprised of said antigen. The antigen may be immobilized on a solid support, and may be at least 10 amino acids. In addition, the biological sample can be bodily fluid, tissue, or organ, such as human blood or a fraction thereof.
- In yet another embodiment, the invention is directed to a method for detecting Hepatitis A virus (HAV) infection in a biological sample, the method comprising (a) isolating nucleic acid from a biological sample suspected of containing Hepatitis A virus (HAV) RNA, wherein said nucleic acid comprises a target sequence, (b) reacting the HAV nucleic acid with a detectably labeled probe sufficiently complementary to and capable of hybridizing with the target sequence, wherein the probe is derived from any one of SEQ ID NOs: 1-39, and further wherein said reacting is done under conditions that provide for the formation of a probe/target sequence complex, and (c) detecting the presence or absence of label as an indication of the presence or absence of the target sequence.
- These and other aspects of the present invention will become evident upon reference to the following detailed description and attached drawings. In addition, various references are set forth herein which describe in more detail certain procedures or compositions, and are therefore incorporated by reference in their entirety.
-
FIG. 1 illustrates the nucleotide sequence of the 243 base pair VP3/VP1 HAV fragment determined for thirteen Indonesian (IND) (SEQ ID NOs: 1-13) and fourteen Chilean (SCL) (SEQ ID NOs: 14-27) isolates. -
FIG. 2 illustrates the nucleotide sequence (SEQ ID NO: 28) of a 2,950 bp KpnI/SphI insert encoding the HAV P1/2A precursor. -
FIG. 3 illustrates the nucleotide sequence (SEQ ID NO: 29) of a 6,696 bp KpnI/SphI insert encoding the full length HAV open reading frame. -
FIG. 4 illustrates the nucleotide sequence (SEQ ID NO: 30) of a 6,757 bp KpnI/SphI insert encoding the full length HAV open reading frame plus additional 3′ untranslated sequences. -
FIG. 5 illustrates the nucleotide (SEQ ID NO: 31) and corresponding amino acid sequence (SEQ ID NO: 40) of the recombinant protein of 94 kDa consisting of the capsid precursor P1 (methionine1- glutamic acid791) plus the 45 amino terminal residues of the nonstructural protein 2A (serine792 through glutamine836). -
FIG. 6 illustrates the nucleotide (SEQ ID NO: 32) and corresponding amino acid sequence (SEQ ID NO: 41) of the recombinant protein of 115.5 kDa consisting of precursor P1 (Met1-Glu791) fused with the nonstructural protein 2A (Ser792-Gln980). -
FIG. 7 illustrates the nucleotide (SEQ ID NO: 33) and corresponding amino acid sequence (SEQ ID NO: 42) of the recombinant protein of 25 kDa (Asp24-Gln245) representing HAV capsid protein 1B (VP2 gene product). -
FIG. 8 illustrates the nucleotide (SEQ ID NO: 34) and corresponding amino acid sequence (SEQ ID NO: 43) of the recombinant protein of 28 kDa (Met246-Gln491) representing HAV capsid protein 1C (VP3 gene product). -
FIG. 9 illustrates the nucleotide (SEQ ID NO: 35) and corresponding amino acid sequence (SEQ ID NO: 44) of the recombinant protein of 33.3 kDa (Val492-Glu791) representing HAV capsid protein 1D (VP1 gene product). -
FIG. 10 illustrates the nucleotide (SEQ ID NO: 36) and corresponding amino acid sequence (SEQ ID NO: 45) of the recombinant protein of 38.8 kDa consisting of human superoxide dismutase (153 aminoacids) fused with the HAV nonstructural protein 2A (Ser792-Gln980). -
FIG. 11 illustrates the nucleotide (SEQ ID NO: 37) and corresponding amino acid sequence (SEQ ID NO: 46) of the recombinant protein of 24.9 kDa consisting of human superoxide dismutase (153 amino acids) fused with the HAV nonstructural protein 3A (Gly1423-Glu1496). -
FIG. 12 illustrates the nucleotide (SEQ ID NO: 38) and corresponding amino acid sequence (SEQ ID NO: 47) of the recombinant protein of 41 kDa consisting of human superoxide dismutase (153 amino acids) fused with the HAV nonstructural protein 3C (Protease: Ser1520-Gln1678). -
FIG. 13 illustrates the nucleotide (SEQ ID NO: 39) and corresponding amino acid sequence (SEQ ID NO: 48) of the recombinant protein of human superoxide dismutase (153 amino acids) fused with the HAV nonstructural protein 3D (RNA dependent RNA polymerase: Arg1739-Ser2227). - The practice of the present invention will employ, unless otherwise indicated, conventional methods of chemistry, biochemistry, recombinant DNA techniques and virology, within the skill of the art. Such techniques are explained fully in the literature. See, e.g., Fundamental Virology, 2nd Edition, vol. I & II (B. N. Fields and D. M. Knipe, eds.); A. L. Lehninger, Biochemistry (Worth Publishers, Inc., current addition); Sambrook, et al., Molecular Cloning: A Laboratory Manual (2nd Edition, 1989); Methods In Enzymology (S. Colowick and N. Kaplan eds., Academic Press, Inc.); Oligonucleotide Synthesis (N. Gait, ed., 1984); A Practical Guide to Molecular Cloning (1984).
- All publications, patents and patent applications cited herein, whether supra or infra, are hereby incorporated by reference in their entirety.
- It must be noted that, as used in this specification and the appended claims, the singular forms “a”, “an” and “the” include plural referents unless the content clearly dictates otherwise. Thus, for example, reference to “an antigen” includes a mixture of two or more antigens, and the like.
- The following amino acid abbreviations are used throughout the text:
-
- Alanine: Ala (A) Arginine: Arg (R)
- Asparagine: Asn (N) Aspartic acid: Asp (D)
- Cysteine: Cys (C) Glutamine: Gln (Q)
- Glutamic acid: Glu (E) Glycine: Gly (G)
- Histidine: His (H) Isoleucine: Ile (I)
- Leucine: Leu (L) Lysine: Lys (K)
- Methionine: Met (M) Phenylalanine: Phe (F)
- Proline: Pro (P) Serine: Ser (S)
- Threonine: Thr (T) Tryptophan: Trp (W)
- Tyrosine: Tyr (Y) Valine: Val (V)
- I. Definitions
- In describing the present invention, the following terms will be employed, and are intended to be defined as indicated below.
- The terms “polypeptide” and “protein” refer to a polymer of amino acid residues and are not limited to a minimum length of the product. Thus, peptides, oligopeptides, dimers, multimers, and the like, are included within the definition. Both full-length proteins and fragments thereof are encompassed by the definition. The terms also include postexpression modifications of the polypeptide, for example, glycosylation, acetylation, phosphorylation and the like. Furthermore, for purposes of the present invention, a “polypeptide” refers to a protein which includes modifications, such as deletions, additions and substitutions (generally conservative in nature), to the native sequence, so long as the protein maintains the desired activity. These modifications may be deliberate, as through site-directed mutagenesis, or may be accidental, such as through mutations of hosts which produce the proteins or errors due to PCR amplification.
- The terms “analog” and “mutein” refer to biologically active derivatives of the reference molecule, or fragments of such derivatives, that retain desired activity, such as immunoreactivity in diagnostic assays. In general, the term “analog” refers to compounds having a native polypeptide sequence and structure with one or more amino acid additions, substitutions (generally conservative in nature) and/or deletions, relative to the native molecule, so long as the modifications do not destroy immunogenic activity. The term “mutein” refers to peptides having one or more peptide mimics (“peptoids”), such as those described in International Publication No. WO 91/04282. Preferably, the analog or mutein has at least the same immunoactivity as the native molecule. Methods for making polypeptide analogs and muteins are known in the art and are described further below.
- Particularly preferred analogs include substitutions that are conservative in nature, i.e., those substitutions that take place within a family of amino acids that are related in their side chains. Specifically, amino acids are generally divided into four families: (1) acidic—aspartate and glutamate; (2) basic—lysine, arginine, histidine; (3) non-polar—alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan; and (4) uncharged polar—glycine, asparagine, glutamine, cysteine, serine threonine, tyrosine. Phenylalanine, tryptophan, and tyrosine are sometimes classified as aromatic amino acids. For example, it is reasonably predictable that an isolated replacement of leucine with isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, or a similar conservative replacement of an amino acid with a structurally related amino acid, will not have a major effect on the biological activity. For example, the polypeptide of interest may include up to about 5-10 conservative or non-conservative amino acid substitutions, or even up to about 15-25 conservative or non-conservative amino acid substitutions, or any integer between 5-25, so long as the desired function of the molecule remains intact. One of skill in the art may readily determine regions of the molecule of interest that can tolerate change by reference to Hopp/Woods and Kyte-Doolittle plots, well known in the art.
- By “isolated” is meant, when referring to a polypeptide, that the indicated molecule is separate and discrete from the whole organism with which the molecule is found in nature or is present in the substantial absence of other biological macro-molecules of the same type. The term “isolated” with respect to a polynucleotide is a nucleic acid molecule devoid, in whole or part, of sequences normally associated with it in nature; or a sequence, as it exists in nature, but having heterologous sequences in association therewith; or a molecule disassociated from the chromosome.
- A polynucleotide “derived from” or “specific for” a designated sequence refers to a polynucleotide sequence which comprises a contiguous sequence of approximately at least about 6 nucleotides, preferably at least about 8 nucleotides, more preferably at least about 10-12 nucleotides, and even more preferably at least about 15-20 nucleotides corresponding, i.e., identical or complementary to, a region of the designated nucleotide sequence. The derived polynucleotide will not necessarily be derived physically from the nucleotide sequence of interest, but may be generated in any manner, including, but not limited to, chemical synthesis, replication, reverse transcription or transcription, which is based on the information provided by the sequence of bases in the region(s) from which the polynucleotide is derived. As such, it may represent either a sense or an antisense orientation of the original polynucleotide.
- “Homology” refers to the percent similarity between two polynucleotide or two polypeptide moieties. Two DNA, or two polypeptide sequences are “substantially homologous” to each other when the sequences exhibit at least about 50%, preferably at least about 75%, more preferably at least about 80%-85%, preferably at least about 90%, and most preferably at least about 95%-98% sequence similarity over a defined length of the molecules. As used herein, substantially homologous also refers to sequences showing complete identity to the specified DNA or polypeptide sequence.
- In general, “identity” refers to an exact nucleotide-to-nucleotide or amino acid-to-amino acid correspondence of two polynucleotides or polypeptide sequences, respectively. Percent identity can be determined by a direct comparison of the sequence information between two molecules by aligning the sequences, counting the exact number of matches between the two aligned sequences, dividing by the length of the shorter sequence, and multiplying the result by 100.
- Readily available computer programs can be used to aid in the analysis of homology and identity, such as ALIGN, Dayhoff, M. O. in Atlas of Protein Sequence and Structure M. O. Dayhoff ed., 5 Suppl. 3:353-358, National biomedical Research Foundation, Washington, D.C., which adapts the local homology algorithm of Smith and Waterman Advances in Appl. Math. 2:482-489, 1981 for peptide analysis. Programs for determining nucleotide sequence homology are available in the Wisconsin Sequence Analysis Package, Version 8 (available from Genetics Computer Group, Madison, Wis.) for example, the BESTFIT, FASTA and GAP programs, which also rely on the Smith and Waterman algorithm. These programs are readily utilized with the default parameters recommended by the manufacturer and described in the Wisconsin Sequence Analysis Package referred to above. For example, percent homology of a particular nucleotide sequence to a reference sequence can be determined using the homology algorithm of Smith and Waterman with a default scoring table and a gap penalty of six nucleotide positions.
- Another method of establishing percent homology in the context of the present invention is to use the MPSRCH package of programs copyrighted by the University of Edinburgh, developed by John F. Collins and Shane S. Sturrok, and distributed by IntelliGenetics, Inc. (Mountain View, Calif.). From this suite of packages the Smith-Waterman algorithm can be employed where default parameters are used for the scoring table (for example, gap open penalty of 12, gap extension penalty of one, and a gap of six). From the data generated the “Match” value reflects “sequence homology.” Other suitable programs for calculating the percent identity or similarity between sequences are generally known in the art, for example, another alignment program is BLAST, used with default parameters. For example, BLASTN and BLASTP can be used using the following default parameters: genetic code=standard; filter=none; strand=both; cutoff=60; expect=10; Matrix=BLOSUM62; Descriptions=50 sequences; sort by=HIGH SCORE; Databases=non-redundant, GenBank+EMBL+DDBJ+PDB+GenBank CDS translations+Swiss protein+Spupdate+PIR. Details of these programs can be found at the following internet address: http://www.ncbi.nlm.gov/cgi-bin/BLAST.
- Alternatively, homology can be determined by hybridization of polynucleotides under conditions which form stable duplexes between homologous regions, followed by digestion with single-stranded-specific nuclease(s), and size determination of the digested fragments. DNA sequences that are substantially homologous can be identified in a Southern hybridization experiment under, for example, stringent conditions, as defined for that particular system. Defining appropriate hybridization conditions is within the skill of the art. See, e.g., Sambrook et al., supra; DNA Cloning, supra; Nucleic Acid Hybridization, supra.
- “Operably linked” refers to an arrangement of elements wherein the components so described are configured so as to perform their desired function. Thus, a given promoter operably linked to a nucleic acid sequence is capable of effecting the transcription, and in the case of a coding sequence, the expression of the coding sequence when the proper transcription factors, etc., are present. The promoter need not be contiguous with the nucleic acid sequence, so long as it functions to direct the transcription and/or expression thereof. Thus, for example, intervening untranslated yet transcribed sequences can be present between the promoter sequence and the coding sequence, as can transcribed introns, and the promoter sequence can still be considered “operably linked” to the coding sequence.
- “Recombinant” as used herein to describe a nucleic acid molecule means a polynucleotide of genomic, cDNA, viral, semisynthetic, or synthetic origin which, by virtue of its origin or manipulation is not associated with all or a portion of the polynucleotide with which it is associated in nature. The term “recombinant” as used with respect to a protein or polypeptide means a polypeptide produced by expression of a recombinant polynucleotide. In general, the gene of interest is cloned and then expressed in transformed organisms, as described further below. The host organism expresses the foreign gene to produce the protein under expression conditions.
- A “control element” refers to a polynucleotide sequence which aids in the transcription and/or translation of a nucleotide sequence to which it is linked. The term includes promoters, transcription termination sequences, upstream regulatory domains, polyadenylation signals, untranslated regions, including 5′-UTRs and 3′-UTRs and when appropriate, leader sequences and enhancers, which collectively provide for the transcription and translation of a coding sequence in a host cell.
- A “promoter” as used herein is a regulatory region capable of binding a polymerase and initiating transcription of a downstream (3′ direction) nucleotide sequence operably linked thereto. For purposes of the present invention, a promoter sequence includes the minimum number of bases or elements necessary to initiate transcription of a sequence of interest at levels detectable above background. Within the promoter sequence is a transcription initiation site, as well as protein binding domains (consensus sequences) responsible for the binding of RNA or DNA polymerase. For example, promoter may be a nucleic acid sequence that is recognized by a DNA-dependent RNA polymerase (“transcriptase”) as a signal to bind to the nucleic acid and begin the transcription of RNA at a specific site. For binding, such transcriptases generally require DNA which is double-stranded in the portion comprising the promoter sequence and its complement; the template portion (sequence to be transcribed) need not be double-stranded. Individual DNA-dependent RNA polymerases recognize a variety of different promoter sequences which can vary markedly in their efficiency in promoting transcription. When an RNA polymerase binds to a promoter sequence to initiate transcription, that promoter sequence is not part of the sequence transcribed. Thus, the RNA transcripts produced thereby will not include that sequence.
- A control sequence “directs the transcription” of a nucleotide sequence when RNA or DNA polymerase will bind the promoter sequence and transcribe the adjacent sequence.
- A “DNA-dependent DNA polymerase” is an enzyme that synthesizes a complementary DNA copy from a DNA template. Examples are DNA polymerase I from E. coli and bacteriophage T7 DNA polymerase. All known DNA-dependent DNA polymerases require a complementary primer to initiate synthesis. Under suitable conditions, a DNA-dependent DNA polymerase may synthesize a complementary DNA copy from an RNA template.
- A “DNA-dependent RNA polymerase” or a “transcriptase” is an enzyme that synthesizes multiple RNA copies from a double-stranded or partially-double stranded DNA molecule having a (usually double-stranded) promoter sequence. The RNA molecules (“transcripts”) are synthesized in the 5′ to 3′ direction beginning at a specific position just downstream of the promoter. Examples of transcriptases are the DNA-dependent RNA polymerase from E. coli and bacteriophages T7, T3, and SP6.
- An “RNA-dependent DNA polymerase” or “reverse transcriptase” is an enzyme that synthesizes a complementary DNA copy from an RNA template. All known reverse transcriptases also have the ability to make a complementary DNA copy from a DNA template; thus, they are both RNA- and DNA-dependent DNA polymerases. A primer is required to initiate synthesis with both RNA and DNA templates.
- “RNAse H” is an enzyme that degrades the RNA portion of an RNA:DNA duplex. These enzymes may be endonucleases or exonucleases. Most reverse transcriptase enzymes normally contain an RNAse H activity in addition to their polymerase activity. However, other sources of the RNAse H are available without an associated polymerase activity. The degradation may result in separation of RNA from a RNA:DNA complex. Alternatively, the RNAse H may simply cut the RNA at various locations such that portions of the RNA melt off or permit enzymes to unwind portions of the RNA.
- The terms “polynucleotide,” “oligonucleotide,” “nucleic acid” and “nucleic acid molecule” are used herein to include a polymeric form of nucleotides of any length, either ribonucleotides or deoxyribonucleotides. This term refers only to the primary structure of the molecule. Thus, the term includes triple-, double- and single-stranded DNA, as well as triple-, double- and single-stranded RNA. It also includes modifications, such as by methylation and/or by capping, and unmodified forms of the polynucleotide. More particularly, the terms “polynucleotide,” “oligonucleotide,” “nucleic acid” and “nucleic acid molecule” include polydeoxyribonucleotides (containing 2-deoxy-D-ribose), polyribonucleotides (containing D-ribose), any other type of polynucleotide which is an N- or C-glycoside of a purine or pyrimidine base, and other polymers containing nonnucleotidic backbones, for example, polyamide (e.g., peptide nucleic acids (PNAs)) and polymorpholino (commercially available from the Anti-Virals, Inc., Corvallis, Oreg., as Neugene) polymers, and other synthetic sequence-specific nucleic acid polymers providing that the polymers contain nucleobases in a configuration which allows for base pairing and base stacking, such as is found in DNA and RNA. There is no intended distinction in length between the terms “polynucleotide,” “oligonucleotide,” “nucleic acid” and “nucleic acid molecule,” and these terms will be used interchangeably. These terms refer only to the primary structure of the molecule. Thus, these terms include, for example, 3′-deoxy-2′,5′-DNA, oligodeoxyribonucleotide N3′ P5′ phosphoramidates, 2′-O-alkyl-substituted RNA, double- and single-stranded DNA, as well as double- and single-stranded RNA, DNA:RNA hybrids, and hybrids between PNAs and DNA or RNA, and also include known types of modifications, for example, labels which are known in the art, methylation, “caps,” substitution of one or more of the naturally occurring nucleotides with an analog, internucleotide modifications such as, for example, those with uncharged linkages (e.g., methyl phosphonates, phosphotriesters, phosphoramidates, carbamates, etc.), with negatively charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.), and with positively charged linkages (e.g., aminoalklyphosphoramidates, aminoalkylphosphotriesters), those containing pendant moieties, such as, for example, proteins (including nucleases, toxins, antibodies, signal peptides, poly-L-lysine, etc.), those with intercalators (e.g., acridine, psoralen, etc.), those containing chelators (e.g., metals, radioactive metals, boron, oxidative metals, etc.), those containing alkylators, those with modified linkages (e.g., alpha anomeric nucleic acids, etc.), as well as unmodified forms of the polynucleotide or oligonucleotide. In particular, DNA is deoxyribonucleic acid.
- As used herein, the term “target nucleic acid region” or “target nucleic acid” denotes a nucleic acid molecule with a “target sequence” to be amplified. The target nucleic acid may be either single-stranded or double-stranded and may include other sequences besides the target sequence, which may not be amplified. The term “target sequence” refers to the particular nucleotide sequence of the target nucleic acid which is to be amplified. The target sequence may include a probe-hybridizing region contained within the target molecule with which a probe will form a stable hybrid under desired conditions. The “target sequence” may also include the complexing sequences to which the oligonucleotide primers complex and be extended using the target sequence as a template. Where the target nucleic acid is originally single-stranded, the term “target sequence” also refers to the sequence complementary to the “target sequence” as present in the target nucleic acid. If the “target nucleic acid” is originally double-stranded, the term “target sequence” refers to both the plus (+) and minus (−) strands.
- The term “primer” or “oligonucleotide primer” as used herein, refers to an oligonucleotide which acts to initiate synthesis of a complementary DNA strand when placed under conditions in which synthesis of a primer extension product is induced, i.e., in the presence of nucleotides and a polymerization-inducing agent such as a DNA or RNA polymerase and at suitable temperature, pH, metal concentration, and salt concentration. The primer is preferably single-stranded for maximum efficiency in amplification, but may alternatively be double-stranded. If double-stranded, the primer is first treated to separate its strands before being used to prepare extension products. This denaturation step is typically effected by heat, but may alternatively be carried out using alkali, followed by neutralization. Thus, a “primer” is complementary to a template, and complexes by hydrogen bonding or hybridization with the template to give a primer/template complex for initiation of synthesis by a polymerase, which is extended by the addition of covalently bonded bases linked at its 3′ end complementary to the template in the process of DNA synthesis.
- As used herein, the term “probe” or “oligonucleotide probe” refers to a structure comprised of a polynucleotide, as defined above, that contains a nucleic acid sequence complementary to a nucleic acid sequence present in the target nucleic acid analyte. The polynucleotide regions of probes may be composed of DNA, and/or RNA, and/or synthetic nucleotide analogs. When an “oligonucleotide probe” is to be used in a 5′ nuclease assay, such as the TaqMan™ technique, the probe will contain at least one fluorescer and at least one quencher which is digested by the 5′ endonuclease activity of a polymerase used in the reaction in order to detect any amplified target oligonucleotide sequences. In this context, the oligonucleotide probe will have a sufficient number of phosphodiester linkages adjacent to its 5′ end so that the 5′ to 3′ nuclease activity employed can efficiently degrade the bound probe to separate the fluorescers and quenchers. When an oligonucleotide probe is used in the TMA technique, it will be suitably labeled, as described below.
- It will be appreciated that the hybridizing sequences need not have perfect complementarity to provide stable hybrids. In many situations, stable hybrids will form where fewer than about 10% of the bases are mismatches, ignoring loops of four or more nucleotides. Accordingly, as used herein the term “complementary” refers to an oligonucleotide that forms a stable duplex with its “complement” under assay conditions, generally where there is about 90% or greater homology.
- The terms “hybridize” and “hybridization” refer to the formation of complexes between nucleotide sequences which are sufficiently complementary to form complexes via Watson-Crick base pairing. Where a primer “hybridizes” with target (template), such complexes (or hybrids) are sufficiently stable to serve the priming function required by, e.g., the DNA polymerase to initiate DNA synthesis.
- Stringent hybridization conditions will typically include salt concentrations of less than about 1M, more usually less than about 500 mM and preferably less than about 200 mM. Hybridization temperatures can be as low as 5° C., but are typically greater than 22° C., more typically greater than about 30° C., and preferably in excess of about 37° C. Longer fragments may require higher hybridization temperatures for specific hybridization. Other factors may affect the stringency of hybridization, including base composition and length of the complementary strands, presence of organic solvents and extent of base mismatching, and the combination of parameters used is more important than the absolute measure of any one alone. Other hybridization conditions which may be controlled include buffer type and concentration, solution pH, presence and concentration of blocking reagents to decrease background binding such as repeat sequences or blocking protein solutions, detergent type(s) and concentrations, molecules such as polymers which increase the relative concentration of the polynucleotides, metal ion(s) and their concentration(s), chelator(s) and their concentrations, and other conditions known in the art. Less stringent, and/or more physiological, hybridization conditions are used where a labeled polynucleotide amplification product cycles on and off a substrate linked to a complementary probe polynucleotide during a real-time assay which is monitored during PCR amplification such as a molecular beacon assay. Such less stringent hybridization conditions can also comprise solution conditions effective for other aspects of the method, for example reverse transcription or PCR.
- As used herein, the term “binding pair” refers to first and second molecules that specifically bind to each other, such as complementary polynucleotide pairs capable of forming nucleic acid duplexes. “Specific binding” of the first member of the binding pair to the second member of the binding pair in a sample is evidenced by the binding of the first member to the second member, or vice versa, with greater affinity and specificity than to other components in the sample. The binding between the members of the binding pair is typically noncovalent. Unless the context clearly indicates otherwise, the terms “affinity molecule” and “target analyte” are used herein to refer to first and second members of a binding pair, respectively.
- The terms “specific-binding molecule” and “affinity molecule” are used interchangeably herein and refer to a molecule that will selectively bind, through chemical or physical means to a detectable substance present in a sample. By “selectively bind” is meant that the molecule binds preferentially to the target of interest or binds with greater affinity to the target than to other molecules. For example, a DNA molecule will bind to a substantially complementary sequence and not to unrelated sequences.
- The “melting temperature” or “Tm” of double-stranded DNA is defined as the temperature at which half of the helical structure of DNA is lost due to heating or other dissociation of the hydrogen bonding between base pairs, for example, by acid or alkali treatment, or the like. The Tm of a DNA molecule depends on its length and on its base composition. DNA molecules rich in GC base pairs have a higher Tm than those having an abundance of AT base pairs. Separated complementary strands of DNA spontaneously reassociate or anneal to form duplex DNA when the temperature is lowered below the Tm. The highest rate of nucleic acid hybridization occurs approximately 25° C. below the Tm. The Tm may be estimated using the following relationship: Tm=69.3+0.41(GC) % (Marmur et al. (1962) J. Mol. Biol. 5:109-118).
- As used herein, a “biological sample” refers to a sample of tissue or fluid isolated from a subject, that commonly includes antibodies produced by the subject. Typical samples that include such antibodies are known in the art and include but not limited to, blood, plasma, serum, fecal matter, urine, bone marrow, bile, spinal fluid, lymph fluid, samples of the skin, secretions of the skin, respiratory, intestinal, and genitourinary tracts, tears, saliva, milk, blood cells, organs, biopsies and also samples of in vitro cell culture constituents including but not limited to conditioned media resulting from the growth of cells and tissues in culture medium, e.g., recombinant cells, and cell components.
- As used herein, the terms “label” and “detectable label” refer to a molecule capable of detection, including, but not limited to, radioactive isotopes, fluorescers, chemiluminescers, chromophores, enzymes, enzyme substrates, enzyme cofactors, enzyme inhibitors, chromophores, dyes, metal ions, metal sols, ligands (e.g., biotin, avidin, strepavidin or haptens) and the like. The term “fluorescer” refers to a substance or a portion thereof which is capable of exhibiting fluorescence in the detectable range.
- An “antigen” includes any substance that may be specifically bound by an antibody molecule. Thus, the term “antigen” encompasses biologic molecules including, but not limited to, simple intermediary metabolites, sugars, lipids, autoacids, and hormones, as well as macromolecules such as complex carbohydrates, phopholipids, nucleic acids and proteins.
- An “immunogen” is a macromolecular antigen that is capable of initiating lymphocyte activation resulting in an antigen-specific immune response. An immunogen therefore includes any molecule which contains one or more epitopes that will stimulate a host's immune system to initiate a secretory, humoral and/or cellular antigen-specific response.
- The term “antibody” encompasses polyclonal and monoclonal antibody preparations, as well as preparations including hybrid antibodies, altered antibodies, chimeric antibodies and, humanized antibodies, as well as: hybrid (chimeric) antibody molecules (see, for example, Winter et al (1991) Nature 349:293-299; and U.S. Pat. No. 4,816,567); F(ab′)2 and F(ab) fragments; Fv molecules (noncovalent heterodimers, see, for example, Inbar et al (1972) Proc Natl Acad Sci USA 69:2659-2662; and Ehrlich et al (1980) Biochem 19:4091-4096); single-chain Fv molecules (sFv) (see, e.g., Huston et al (1988) Proc Natl Acad Sci USA 85:5879-5883); dimeric and trimeric antibody fragment constructs; minibodies (see, e.g., Pack et al. (1992) Biochem 31:1579-1584; Cumber et al. (1992) J Immunology 149B: 120-126); humanized antibody molecules (see, e.g., Riechmann et al (1988) Nature 332:323-327; Verhoeyan et al. (1988) Science 239:1534-1536; and U.K. Patent Publication No. GB 2,276,169, published 21 September 1994); and, any functional fragments obtained from such molecules, wherein such fragments retain specific-binding properties of the parent antibody molecule.
- As used herein, the term “monoclonal antibody” refers to an antibody composition having a homogeneous antibody population. The term is not limited regarding the species or source of the antibody, nor is it intended to be limited by the manner in which it is made. The term encompasses whole immunoglobulins.
- Methods of making polyclonal and monoclonal antibodies are known in the art. Polyclonal antibodies are generated by immunizing a suitable animal, such as a mouse, rat, rabbit, sheep or goat, with an antigen of interest. In order to enhance immunogenicity, the antigen can be linked to a carrier prior to immunization. Suitable carriers are typically large, slowly metabolized macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic acids, polymeric amino acids, amino acid copolymers, lipid aggregates (such as oil droplets or liposomes), and inactive virus particles. Such carriers are well known to those of ordinary skill in the art. Furthermore, the antigen may be conjugated to a bacterial toxoid, such as toxoid from diphtheria, tetanus, cholera, etc., in order to enhance the immunogenicity thereof
- Rabbits, sheep and goats are preferred for the preparation of polyclonal sera when large volumes of sera are desired. These animals are good design choices also because of the availability of labeled anti-rabbit, anti-sheep and anti-goat antibodies. Immunization is generally performed by mixing or emulsifying the antigen in saline, preferably in an adjuvant such as Freund's complete adjuvant (“FCA”), and injecting the mixture or emulsion parenterally (generally subcutaneously or intramuscularly). The animal is generally boosted 2-6 weeks later with one or more injections of the antigen in saline, preferably using Freund's incomplete adjuvant (“FIA”). Antibodies may also be generated by in vitro immunization, using methods known in the art. Polyclonal antisera is then obtained from the immunized animal.
- Monoclonal antibodies are generally prepared using the method of Kohler and Milstein, Nature (1975) 256:495-497, or a modification thereof. Typically, a mouse or rat is immunized as described above. However, rather than bleeding the animal to extract serum, the spleen (and optionally several large lymph nodes) is removed and dissociated into single cells. If desired, the spleen cells may be screened (after removal of non-specifically adherent cells) by applying a cell suspension to a plate or well coated with the antigen. B-cells, expressing membrane-bound immunoglobulin specific for the antigen, will bind to the plate, and are not rinsed away with the rest of the suspension. Resulting B-cells, or all dissociated spleen cells, are then induced to fuse with myeloma cells to form hybridomas, and are cultured in a selective medium (e.g., hypoxanthine, aminopterin, thymidine medium, “HAT”). The resulting hybridomas are plated by limiting dilution, and are assayed for the production of antibodies which bind specifically to the immunizing antigen (and which do not bind to unrelated antigens). The selected monoclonal antibody-secreting hybridomas are then cultured either in vitro (e.g., in tissue culture bottles or hollow fiber reactors), or in vivo (e.g., as ascites in mice).
- II. Modes of Carrying Out the Invention
- Before describing the present invention in detail, it is to be understood that this invention is not limited to particular formulations or process parameters as such may, of course, vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments of the invention only, and is not intended to be limiting.
- Although a number of compositions and methods similar or equivalent to those described herein can be used in the practice of the present invention, the preferred materials and methods are described herein.
- As noted above, the present invention is based on the discovery of novel diagnostic methods for accurately detecting HAV infection in a biological sample. The methods rely on sensitive nucleic acid-based detection techniques that allow identification of HAV target nucleic acid sequences in samples containing small amounts of virus.
- In particular, the inventors herein have characterized regions within the HAV genome which are desirable targets for diagnostic tests. Primers and probes derived from these regions are extremely useful for detection of HAV infection in biological samples.
- HAV primers and probes described above are used in nucleic acid-based assays for the detection of HAV infection in biological samples. In particular, primers and probes for use in these assays are preferably derived from the nucleotide sequences depicted in
FIGS. 1-13 herein. - Particularly preferred primers and probes for use with the present assays are designed from HAV genome to allow detection of HAV infection caused by a variety of isolates.
- The four capsid proteins, nonstructural proteins, protease and the junction between the capsid precursor P1 and 2A are readily obtained from additional isolates using portions of the HAV sequence found within these particular regions as primers in PCR reactions such as those described herein. Another method of obtaining nucleotide sequences with the desired sequences is by annealing complementary sets of overlapping synthetic oligonucleotides produced in a conventional, automated polynucleotide synthesizer, followed by ligation with an appropriate DNA ligase and amplification of the ligated nucleotide sequence via PCR. See, e.g., Jayaraman et al. (1991) Proc. Natl. Acad. Sci. USA 88:4084-4088. Once the sequences have been prepared or isolated, they can be cloned into any suitable vector or replicon. Numerous cloning vectors are known to those of skill in the art, and the selection of an appropriate cloning vector is a matter of choice. Suitable vectors include, but are not limited to, plasmids, phages, transposons, cosmids, chromosomes or viruses which are capable of replication when associated with the proper control elements.
- Recombinant clones are readily identified by restriction enzyme analysis and polyacryamide or agarose gel electrophresis, using techniques well known in the art, and described in the examples below.
- Polynucleotides of the invention encoding the HAV proteins are useful for designing hybridization probes for isolating and identifying cDNA clones and genomic clones encoding HAV proteins or allelic forms thereof. Such hybridization techniques are known to those of skill in the art. Sequence of polynucleotides that encode HAV proteins are also useful for designing primers for polymerase chain reaction (PCR). Also encompassed by the present invention, are single stranded polynucleotides, hereinafter referred to as antisense polynucleotides, having sequences which are complementary to the RNA sequences which encode the HAV proteins.
- Primers and probes for use in the assays herein are derived from these sequences and are readily synthesized by standard techniques, e.g., solid phase synthesis via phosphoramidite chemistry, as disclosed in U.S. Pat. Nos. 4,458,066 and 4,415,732, incorporated herein by reference; Beaucage et al. (1992) Tetrahedron 48:2223-2311; and Applied Biosystems User Bulletin No. 13 (1 Apr. 1987). Other chemical synthesis methods include, for example, the phosphotriester method described by Narang et al., Meth. Enzymol. (1979) 68:90 and the phosphodiester method disclosed by Brown et al., Meth. Enzymol. (1979) 68:109. Poly(A) or poly(C), or other non-complementary nucleotide extensions may be incorporated into probes using these same methods. Hexaethylene oxide extensions may be coupled to probes by methods known in the art. Cload et al. (1991) J. Am. Chem. Soc. 113:6324-6326; U.S. Pat. No. 4,914,210 to Levenson et al.; Durand et al. (1990) Nucleic Acids Res. 18:6353-6359; and Horn et al. (1986) Tet. Lett. 27:4705-4708. Typically, the primer sequences are in the range of between 10-100 nucleotides in length, such as 15-60, 20-40 and so on, more typically in the range of between 20-40 nucleotides long, and any length between the stated ranges. The typical probe is in the range of between 10-100 nucleotides long, such as 10-50, 15-40, 18-30, and so on, and any length between the stated ranges.
- Thus, one aspect of the invention encompasses oligonucleotides that are used as primers and probes in polymerase chain reaction (PCR) technologies to amplify transcripts of the genes which encode HAV proteins or portions of such transcripts. Preferably, the primers have a G+C content of 40% or greater. Such oligonucleotides are at least 80% complementary with a sequence of SEQ ID NOs: 1-39. Preferably, the primers and probes are at least 85% complementarity, 90% complementary, 95% complementary or more preferably 98% or 99% complementary with the sense strand or its corresponding antisense strand of SEQ ID NOs: 1-39.
- Moreover, the probes may be coupled to labels for detection. There are several means known for derivatizing oligonucleotides with reactive functionalities which permit the addition of a label. For example, several approaches are available for biotinylating probes so that radioactive, fluorescent, chemiluminescent, enzymatic, or electron dense labels can be attached via avidin. See, e.g., Broken et al. (1978) Nucl. Acids Res. 5:363-384 which discloses the use of ferritin-avidin-biotin labels; and Chollet et al. (1985) Nucl. Acids Res. 13:1529-1541 which discloses biotinylation of the 5′ termini of oligonucleotides via an aminoalkylphosphoramide linker arm. Several methods are also available for synthesizing amino-derivatized oligonucleotides which are readily labeled by fluorescent or other types of compounds derivatized by amino-reactive groups, such as isothiocyanate, N-hydroxysuccinimide, or the like, see, e.g., Connolly (1987) Nucl. Acids Res. 15:3131-3139, Gibson et al. (1987) Nucl. Acids Res. 15:6455-6467 and U.S. Pat. No. 4,605,735 to Miyoshi et al. Methods are also available for synthesizing sulfhydryl-derivatized oligonucleotides which can be reacted with thiol-specific labels, see, e.g., U.S. Pat. No. 4,757,141 to Fung et al., Connolly et al. (1985) Nucl. Acids Res. 13:4485-4502 and Spoat et al. (1987) Nucl. Acids Res. 15:4837-4848. A comprehensive review of methodologies for labeling DNA fragments is provided in Matthews et al., Anal. Biochem. (1988) 169:1-25.
- For example, probes may be fluorescently labeled by linking a fluorescent molecule to the non-ligating terminus of the probe. Guidance for selecting appropriate fluorescent labels can be found in Smith et al., Meth. Enzymol. (1987) 155:260-301; Karger et al., Nucl. Acids Res. (1991) 19:4955-4962; Haugland (1989) Handbook of Fluorescent Probes and Research Chemicals (Molecular Probes, Inc., Eugene, Oreg.). Preferred fluorescent labels include fluorescein and derivatives thereof, such as disclosed in U.S. Pat. No. 4,318,846 and Lee et al., Cytometry (1989) 10:151-164. Dyes for use in the present invention include 3-pheniyl-7-isocyanatocoumarini, acridines, such as 9-isothiocyanatoacridine and acridine orange, pyrenes, benzoxadiazoles, and stilbenes. Additional dyes include 3-(ε-carboxypentyl)-3′-ethyl-5,5′-dimethyloxa-carbocyanine (CYA); 6-carboxy fluorescein (FAM); 5,6-carboxyrhodamine-1 10 (R110); 6-carboxyrhodamine-6G (R6G); N′,N′,N′,N′-tetramethyl-6-carboxyrhodamine (TAMRA); 6-carboxy-X-rhodamine (ROX); 2′,4′,5′,7′,-tetrachloro-4-7-dichlorofluorescein (TET); 2′,7′-dimethoxy-4′,5′-6 carboxyrhodamine (JOE); 6-carboxy-2′,4,4′,5′,7,7′-hexachlorofluorescein (HEX); ALEXA; Cy3 and Cy5. These dyes are commercially available from various suppliers such as Applied Biosystems Division of Perkin Elmer Corporation (Foster City, Calif.), and Molecular Probes, Inc. (Eugene, Oreg.). Preferred fluorescent labels include fluorescein and derivatives thereof, such as disclosed in U.S. Pat. No. 4,318,846 and Lee et al., Cytometry (1989) 10:151-164, and 6-FAM, JOE, TAMRA, ROX, HEX-1, HEX-2, ZOE, TET-1 or NAN-2, and the like.
- Additionally, probes can be labeled with an acridinium ester (AE) using the techniques described below. Current technologies allow the AE label to be placed at any location within the probe. See, e.g., Nelson et al. (1995) “Detection of Acridinium Esters by Chemiluminescence” in Nonisotopic Probing, Blotting and Sequencing, Kricka L. J. (ed) Academic Press, San Diego, Calif.; Nelson et al. (1994) “Application of the Hybridization Protection Assay (HPA) to PCR” in The Polymerase Chain Reaction, Mullis et al. (eds.) Birkhauser, Boston, Mass.; Weeks et al., Clin. Chem. (1983) 29:1474-1479; Berry et al., Clin. Chem. (1988) 34:2087-2090. An AE molecule can be directly attached to the probe using non-nucleotide-based linker arm chemistry that allows placement of the label at any location within the probe. See, e.g., U.S. Pat. Nos. 5,585,481 and 5,185,439.
- The primers and probes described above may be used in polymerase chain reaction (PCR)-based techniques to detect HAV infection in biological samples. PCR is a technique for amplifying a desired target nucleic acid sequence contained in a nucleic acid molecule or mixture of molecules. In PCR, a pair of primers is employed in excess to hybridize to the complementary strands of the target nucleic acid. The primers are each extended by a polymerase using the target nucleic acid as a template. The extension products become target sequences themselves after dissociation from the original target strand. New primers are then hybridized and extended by a polymerase, and the cycle is repeated to geometrically increase the number of target sequence molecules. The PCR method for amplifying target nucleic acid sequences in a sample is well known in the art and has been described in, e.g., Innis et al. (eds.) PCR Protocols (Academic Press, NY 1990); Taylor (1991) Polymerase chain reaction: basic principles and automation, in PCR: A Practical Approach, McPherson et al. (eds.) IRL Press, Oxford; Saiki et al. (1986) Nature 324:163; as well as in U.S. Pat. Nos. 4,683,195, 4,683,202 and 4,889,818, all incorporated herein by reference in their entireties.
- In particular, PCR uses relatively short oligonucleotide primers which flank the target nucleotide sequence to be amplified, oriented such that their 3′ ends face each other, each primer extending toward the other. The polynucleotide sample is extracted and denatured, preferably by heat, and hybridized with first and second primers which are present in molar excess. Polymerization is catalyzed in the presence of the four deoxyribonucleotide triphosphates (dNTPs—dATP, dGTP, dCTP and dTTP) using a primer- and template-dependent polynucleotide polymerizing agent, such as any enzyme capable of producing primer extension products, for example, E. coli DNA polymerase I, Klenow fragment of DNA polymerase I, T4 DNA polymerase, thermostable DNA polymerases isolated from Thermus aquaticus (Taq), available from a variety of sources (for example, Perkin Elmer), Thermus thermophilus (United States Biochemicals), Bacillus stereothermophilus (Bio-Rad), or Thermococcus litoralis (“Vent” polymerase, New England Biolabs). This results in two “long products” which contain the respective primers at their 5′ ends covalently linked to the newly synthesized complements of the original strands. The reaction mixture is then returned to polymerizing conditions, e.g., by lowering the temperature, inactivating a denaturing agent, or adding more polymerase, and a second cycle is initiated. The second cycle provides the two original strands, the two long products from the first cycle, two new long products replicated from the original strands, and two “short products” replicated from the long products. The short products have the sequence of the target sequence with a primer at each end. On each additional cycle, an additional two long products are produced, and a number of short products equal to the number of long and short products remaining at the end of the previous cycle. Thus, the number of short products containing the target sequence grow exponentially with each cycle. Preferably, PCR is carried out with a commercially available thermal cycler, e.g., Perkin Elmer.
- RNAs may be amplified by reverse transcribing the mRNA into cDNA, and then performing PCR (RT-PCR), as described above. Alternatively, a single enzyme may be used for both steps as described in U.S. Pat. No. 5,322,770. mRNA may also be reverse transcribed into cDNA, followed by asymmetric gap ligase chain reaction (RT-AGLCR) as described by Marshall et al. (1994) PCR Meth. App. 4:80-84.
- The fluorogenic 5′ nuclease assay, known as the TaqMan™ assay (Perkin-Elmer), is a powerful and versatile PCR-based detection system for nucleic acid targets. Hence, primers and probes derived from regions of the HAV genome described herein can be used in TaqMan™ analyses to detect the presence of infection in a biological sample. Analysis is performed in conjunction with thermal cycling by monitoring the generation of fluorescence signals. The assay system dispenses with the need for gel electrophoretic analysis, and has the capability to generate quantitative data allowing the determination of target copy numbers.
- The fluorogenic 5′ nuclease assay is conveniently performed using, for example, AmpliTaq Gold™ DNA polymerase, which has endogenous 5′ nuclease activity, to digest an internal oligonucleotide probe labeled with both a fluorescent reporter dye and a quencher (see, Holland et al. (1991) Proc. Natl. Acad.Sci. USA 88:7276-7280; and Lee et al. (1993) Nucl. Acids Res. 21:3761-3766). Assay results are detected by measuring changes in fluorescence that occur during the amplification cycle as the fluorescent probe is digested, uncoupling the dye and quencher labels and causing an increase in the fluorescent signal that is proportional to the amplification of target DNA.
- The amplification products can be detected in solution or using solid supports. In this method, the TaqMan™ probe is designed to hybridize to a target sequence within the desired PCR product. The 5′ end of the TaqMan™ probe contains a fluorescent reporter dye. The 3′ end of the probe is blocked to prevent probe extension and contains a dye that will quench the fluorescence of the 5′ fluorophore. During subsequent amplification, the 5′ fluorescent label is cleaved off if a polymerase with 5′ exonuclease activity is present in the reaction. Excision of the 5′ fluorophore results in an increase in fluorescence which can be detected.
- In particular, the oligonucleotide probe is constructed such that the probe exists in at least one single-stranded conformation when unhybridized where the quencher molecule is near enough to the reporter molecule to quench the fluorescence of the reporter molecule. The oligonucleotide probe also exists in at least one conformation when hybridized to a target polynucleotide such that the quencher molecule is not positioned close enough to the reporter molecule to quench the fluorescence of the reporter molecule. By adopting these hybridized and unhybridized conformations, the reporter molecule and quencher molecule on the probe exhibit different fluorescence signal intensities when the probe is hybridized and unhybridized. As a result, it is possible to determine whether the probe is hybridized or unhybridized based on a change in the fluorescence intensity of the reporter molecule, the quencher molecule, or a combination thereof. In addition, because the probe can be designed such that the quencher molecule quenches the reporter molecule when the probe is not hybridized, the probe can be designed such that the reporter molecule exhibits limited fluorescence unless the probe is either hybridized or digested.
- Accordingly, the present invention relates to methods for amplifying a target HAV nucleotide sequence using a nucleic acid polymerase having 5′ to 3′ nuclease activity, one or more primers capable of hybridizing to the target HAV sequence, and an oligonucleotide probe capable of hybridizing to the
target HAV sequence 3′ relative to the primer. During amplification, the polymerase digests the oligonucleotide probe when it is hybridized to the target sequence, thereby separating the reporter molecule from the quencher molecule. As the amplification is conducted, the fluorescence of the reporter molecule is monitored, with fluorescence corresponding to the occurrence of nucleic acid amplification. The reporter molecule is preferably a fluorescein dye and the quencher molecule is preferably a rhodamine dye. - While the length of the primers and probes can vary, the probe sequences are selected such that they have a lower melt temperature than the primer sequences. Hence, the primer sequences are generally longer than the probe sequences. Typically, the primer sequences are in the range of between 10-75 nucleotides long, more typically in the range of 20-45. The typical probe is in the range of between 10-50 nucleotides long, more typically 15-40 nucleotides in length.
- If a solid support is used, the oligonucleotide probe may be attached to the solid support in a variety of manners. For example, the probe may be attached to the solid support by attachment of the 3′ or 5′ terminal nucleotide of the probe to the solid support. More preferably, the probe is attached to the solid support by a linker which serves to distance the probe from the solid support. The linker is usually at least 15-30 atoms in length, more preferably at least 15-50 atoms in length. The required length of the linker will depend on the particular solid support used. For example, a six atom linker is generally sufficient when high cross-linked polystyrene is used as the solid support.
- A wide variety of linkers are known in the art which may be used to attach the oligonucleotide probe to the solid support. The linker may be formed of any compound which does not significantly interfere with the hybridization of the target sequence to the probe attached to the solid support. The linker may be formed of a homopolymeric oligonucleotide which can be readily added on to the linker by automated synthesis. Alternatively, polymers such as functionalized polyethylene glycol can be used as the linker. Such polymers are preferred over homopolymeric oligonucleotides because they do not significantly interfere with the hybridization of probe to the target oligonucleotide. Polyethylene glycol is particularly preferred.
- The linkages between the solid support, the linker and the probe are preferably not cleaved during removal of base protecting groups under basic conditions at high temperature. Examples of preferred linkages include carbamate and amide linkages.
- Examples of preferred types of solid supports for immobilization of the oligonucleotide probe include controlled pore glass, glass plates, polystyrene, avidin-coated polystyrene beads, cellulose, nylon, acrylamide gel and activated dextran.
- For a detailed description of the TaqMan™ assay, reagents and conditions for use therein, see, e.g., Holland et al. (1991) Proc. Natl. Acad. Sci, U.S.A. 88:7276-7280; U.S. Pat. Nos. 5,538,848, 5,723,591, and 5,876,930, all incorporated herein by reference in their entireties.
- The HAV sequences described herein may also be used as a basis for transcription-mediated amplification (TMA) assays. TMA provides a method of identifying target nucleic acid sequences present in very small amounts in a biological sample. Such sequences may be difficult or impossible to detect using direct assay methods. In particular, TMA is an isothemal, autocatalytic nucleic acid target amplification system that can provide more than a billion RNA copies of a target sequence. The assay can be done qualitatively, to accurately detect the presence or absence of the target sequence in a biological sample. The assay can also provide a quantitative measure of the amount of target sequence over a concentration range of several orders of magnitude. TMA provides a method for autocatalytically synthesizing multiple copies of a target nucleic acid sequence without repetitive manipulation of reaction conditions such as temperature, ionic strength and pH.
- Generally, TMA includes the following steps: (a) isolating nucleic acid, including RNA, from the biological sample of interest suspected of being infected with HAV; and (b) combining into a reaction mixture (i) the isolated nucleic acid, (ii) first and second oligonucleotide primers, the first primer having a complexing sequence sufficiently complementary to the 3′ terminal portion of an RNA target sequence, if present (for example the (+) strand), to complex therewith, and the second primer having a complexing sequence sufficiently complementary to the 3′ terminal portion of the target sequence of its complement (for example, the (−) strand) to complex therewith, wherein the first oligonucleotide further comprises a sequence 5′ to the complexing sequence which includes a promoter, (iii) a reverse transcriptase or RNA and DNA dependent DNA polymerases, (iv) an enzyme activity which selectively degrades the RNA strand of an RNA-DNA complex (such as an RNAse H) and (v) an RNA polymerase which recognizes the promoter.
- The components of the reaction mixture may be combined stepwise or at once. The reaction mixture is incubated under conditions whereby an oligonucleotide/target sequence is formed, including DNA priming and nucleic acid synthesizing conditions (including ribonucleotide triphosphates and deoxyribonucleotide triphosphates) for a period of time sufficient to provide multiple copies of the target sequence. The reaction advantageously takes place under conditions suitable for maintaining the stability of reaction components such as the component enzymes and without requiring modification or manipulation of reaction conditions during the course of the amplification reaction. Accordingly, the reaction may take place under conditions that are substantially isothermal and include substantially constant ionic strength and pH. The reaction conveniently does not require a denaturation step to separate the RNA-DNA complex produced by the first DNA extension reaction.
- Suitable DNA polymerases include reverse transcriptases, such as avian myeloblastosis virus (AMV) reverse transcriptase (available from, e.g., Seikagaku America, Inc.) and Moloney murine leukemia virus (MMLV) reverse transcriptase (available from, e.g., Bethesda Research Laboratories).
- Promoters or promoter sequences suitable for incorporation in the primers are nucleic acid sequences (either naturally occurring, produced synthetically or a product of a restriction digest) that are specifically recognized by an RNA polymerase that recognizes and binds to that sequence and initiates the process of transcription whereby RNA transcripts are produced. The sequence may optionally include nucleotide bases extending beyond the actual recognition site for the RNA polymerase which may impart added stability or susceptibility to degradation processes or increased transcription efficiency. Examples of useful promoters include those which are recognized by certain bacteriophage polymerases such as those from bacteriophage T3, T7 or SP6, or a promoter from E. coli. These RNA polymerases are readily available from commercial sources, such as New England Biolabs and Epicentre.
- Some of the reverse transcriptases suitable for use in the methods herein have an RNAse H activity, such as AMV reverse transcriptase. It may, however, be preferable to add exogenous RNAse H, such as E. coli RNAse H, even when AMV reverse transcriptase is used. RNAse H is readily available from, e.g., Bethesda Research Laboratories.
- The RNA transcripts produced by these methods may serve as templates to produce additional copies of the target sequence through the above-described mechanisms. The system is autocatalytic and amplification occurs autocatalytically without the need for repeatedly modifying or changing reaction conditions such as temperature, pH, ionic strength or the like.
- Detection may be done using a wide variety of methods, including direct sequencing, hybridization with sequence-specific oligomers, gel electrophoresis and mass spectrometry. these methods can use heterogeneous or homogeneous formats, isotopic or nonisotopic labels, as well as no labels at all.
- One preferable method of detection is the use of target sequence-specific oligonucleotide probes, derived from the sequences described in
FIGS. 1-13 and fragments thereof. The probes may be used in hybridization protection assays (HPA). In this embodiment, the probes are conveniently labeled with acridinium ester (AE), a highly chemiluminescent molecule. See, e.g., Nelson et al. (1995) “Detection of Acridinium Esters by Chemiluminescence” in Nonisotopic Probing, Blotting and Sequencing, Kricka L. J. (ed) Academic Press, San Diego, Calif.; Nelson et al. (1994) “Application of the Hybridization Protection Assay (HPA) to PCR” in The Polymerase Chain Reaction, Mullis et al. (eds.) Birkhauser, Boston, Mass.; Weeks et al. (1983) Clin. Chem. 29:1474-1479; Berry et al. (1988) Clin. Chem. 34:2087-2090. One AE molecule is directly attached to the probe using a non-nucleotide-based linker arm chemistry that allows placement of the label at any location within the probe. See, e.g., U.S. Pat. Nos. 5,585,481 and 5,185,439. Chemiluminescence is triggered by reaction with alkaline hydrogen peroxide which yields an excited N-methyl acridone that subsequently collapses to ground state with the emission of a photon. Additionally, AE causes ester hydrolysis which yields the nonchemiluminescent-methyl acridinium carboxylic acid. - When the AE molecule is covalently attached to a nucleic acid probe, hydrolysis is rapid under mildly alkaline conditions. When the AE-labeled probe is exactly complementary to the target nucleic acid, the rate of AE hydrolysis is greatly reduced. Thus, hybridized and unhybridized AE-labeled probe can be detected directly in solution, without the need for physical separation.
- HPA generally consists of the following steps: (a) the AE-labeled probe is hybridized with the target nucleic acid in solution for about 15 to about 30 minutes. A mild alkaline solution is then added and AE coupled to the unhybridized probe is hydrolyzed. This reaction takes approximately 5 to 10 minutes. The remaining hybrid-associated AE is detected as a measure of the amount of target present. This step takes approximately 2 to 5 seconds. Preferably, the differential hydrolysis step is conducted at the same temperature as the hybridization step, typically at 50 to 70° C. Alternatively, a second differential hydrolysis step may be conducted at room temperature. This allows elevated pHs to be used, for example in the range of 10-11, which yields larger differences in the rate of hydrolysis between hybridized and unhybridized AE-labeled probe. HPA is described in detail in, e.g., U.S. Pat. Nos. 6,004,745; 5,948,899; and 5,283,174, the disclosures of which are incorporated by reference herein in their entireties.
- TMA is described in detail in, e.g., U.S. Pat. No.5,399,491, the disclosure of which is incorporated herein by reference in its entirety. In one example of a typical assay, an isolated nucleic acid sample, suspected of containing a HAV target sequence, is mixed with a buffer concentrate containing the buffer, salts, magnesium, nucleotide triphosphates, primers, dithiothreitol, and spermidine. The reaction is optionally incubated at about 100° C. for approximately two minutes to denature any secondary structure. After cooling to room temperature, reverse transcriptase, RNA polymerase, and RNAse H are added and the mixture is incubated for two to four hours at 37° C. The reaction can then be assayed by denaturing the product, adding a probe solution, incubating 20 minutes at 60° C., adding a solution to selectively hydrolyze the unhybridized probe, incubating the reaction six minutes at 60° C., and measuring the remaining chemiluminescence in a luminometer.
- As is readily apparent, design of the assays described herein are subject to a great deal of variation, and many formats are known in the art. The above descriptions are merely provided as guidance and one of skill in the art can readily modify the described protocols, using techniques well known in the art.
- The above-described assay reagents, including the primers, probes, solid support with bound probes, as well as other detection reagents, can be provided in kits, with suitable instructions and other necessary reagents, in order to conduct the assays as described above. The kit will normally contain in separate containers the combination of primers and probes (either already bound to a solid matrix or separate with reagents for binding them to the matrix), control formulations (positive and/or negative), labeled reagents when the assay format requires same and signal generating reagents (e.g., enzyme substrate) if the label does not generate a signal directly. Instructions (e.g., written, tape, VCR, CD-ROM, etc.) for carrying out the assay usually will be included in the kit. The kit can also contain, depending on the particular assay used, other packaged reagents and materials (i.e. wash buffers and the like). Standard assays, such as those described above, can be conducted using these kits.
- Recombinant or synthetic HAV polypeptides can be used as diagnostics, or those which produce an immunological response, such as those that give rise to neutralizing antibodies, may be formulated into vaccines. Antibodies raised against these polypeptides can also be used as diagnostics, or for passive immunotherapy. In addition, antibodies to these polypeptides are useful for isolating and identifying HAV particles. The HAV antigens may also be isolated from HAV virions. The virions may be grown in HAV infected cells in tissue culture, or in an infected host.
- Particularly, the antibodies may be polyclonal or monoclonal, may be a human antibody, or may be a hybrid or chimeric antibody, such as a humanized antibody, an altered antibody, F(ab′)2 fragments, F(ab) fragments, Fv fragments, a single-domain antibody, a dimeric or trimeric antibody fragment construct, a minibody, or functional fragments thereof which bind to the analyte of interest. Antibodies are produced using techniques well known to those of skill in the art and disclosed in, for example, U.S. Pat. Nos. 4,011,308; 4,722,890; 4,016,043; 3,876,504; 3,770,380; and 4,372,745.
- For example, polyclonal antibodies are generated by immunizing a suitable animal, such as a mouse, rat, rabbit, sheep or goat, with an antigen of interest. In order to enhance immunogenicity, the antigen can be linked to a carrier prior to immunization. Such carriers are well known to those of ordinary skill in the art. Immunization is generally performed by mixing or emulsifying the antigen in saline, preferably in an adjuvant such as Freund's complete adjuvant, and injecting the mixture or emulsion parenterally (generally subcutaneously or intramuscularly). The animal is generally boosted 2-6 weeks later with one or more injections of the antigen in saline, preferably using Freund's incomplete adjuvant. Antibodies may also be generated by in vitro immunization, using methods known in the art. Polyclonal antiserum is then obtained from the immunized animal.
- Monoclonal antibodies are generally prepared using the method of Kohler and Milstein (1975) Nature 256:495-497, or a modification thereof. Typically, a mouse or rat is immunized as described above. However, rather than bleeding the animal to extract serum, the spleen (and optionally several large lymph nodes) is removed and dissociated into single cells. If desired, the spleen cells may be screened (after removal of nonspecifically adherent cells) by applying a cell suspension to a plate or well coated with the antigen. B-cells, expressing membrane-bound immunoglobulin specific for the antigen, will bind to the plate, and are not rinsed away with the rest of the suspension. Resulting B-cells, or all dissociated spleen cells, are then induced to fuse with myeloma cells to form hybridomas, and are cultured in a selective medium (e.g., hypoxanthine, aminopterin, thymidine medium, “HAT”). The resulting hybridomas are plated by limiting dilution, and are assayed for the production of antibodies which bind specifically to the immunizing antigen (and which do not bind to unrelated antigens). The selected monoclonal antibody-secreting hybridomas are then cultured either in vitro (e.g., in tissue culture bottles or hollow fiber reactors), or in vivo (e.g., as ascites in mice). Human monoclonal antibodies are obtained by using human rather than murine hybridomas. See, e.g., Cote, et al. Monclonal Antibodies and Cancer Therapy, Alan R. Liss, 1985, p. 77
- Monoclonal antibodies or portions thereof may be identified by first screening a B-cell cDNA library for DNA molecules that encode antibodies that specifically bind to p185, according to the method generally set forth by Huse et al. (1989) Science 246:1275-1281. The DNA molecule may then be cloned and amplified to obtain sequences that encode the antibody (or binding domain) of the desired specificity.
- As explained above, antibody fragments which retain the ability to recognize the molecule of interest, will also find use in the subject invention. A number of antibody fragments are known in the art which comprise antigen-binding sites capable of exhibiting immunological binding properties of an intact antibody molecule. For example, functional antibody fragments can be produced by cleaving a constant region, not responsible for antigen binding, from the antibody molecule, using e.g., pepsin, to produce F(ab′)2 fragments. These fragments will contain two antigen binding sites, but lack a portion of the constant region from each of the heavy chains. Similarly, if desired, Fab fragments, comprising a single antigen binding site, can be produced, e.g., by digestion of polyclonal or monoclonal antibodies with papain. Functional fragments, including only the variable regions of the heavy and light chains, can also be produced, using standard techniques such as recombinant production or preferential proteolytic cleavage of immunoglobulin molecules. These fragments are known as Fv. See, e.g., Inbar et al. (1972) Proc. Nat. Acad. Sci. USA 69:2659-2662; Hochman et al. (1976) Biochem 15:2706-2710; and Ehrlich et al. (1980) Biochem 19:4091-4096.
- A single-chain Fv (“sFv” or “scFv”) polypeptide is a covalently linked VH-VL heterodimer which is expressed from a gene fusion including VH- and VL-encoding genes linked by a peptide-encoding linker. Huston et al. (1988) Proc. Nat. Acad. Sci. USA 85:5879-5883. A number of methods have been described to discern and develop chemical structures (linkers) for converting the naturally aggregated, but chemically separated, light and heavy polypeptide chains from an antibody V region into an sFv molecule which will fold into a three dimensional structure substantially similar to the structure of an antigen-binding site. See, e.g., U.S. Pat. Nos. 5,091,513, 5,132,405 and 4,946,778. The sFv molecules may be produced using methods described in the art. See, e.g., Huston et al. (1988) Proc. Nat. Acad. Sci. USA 85:5879-5883; U.S. Pat. Nos. 5,091,513, 5,132,405 and 4,946,778. Design criteria include determining the appropriate length to span the distance between the C-terminus of one chain and the N-terminus of the other, wherein the linker is generally formed from small hydrophilic amino acid residues that do not tend to coil or form secondary structures. Such methods have been described in the art. See, e.g., U.S. Pat. Nos. 5,091,513, 5,132,405 and 4,946,778. Suitable linkers generally comprise polypeptide chains of alternating sets of glycine and serine residues, and may include glutamic acid and lysine residues inserted to enhance solubility.
- “Mini-antibodies” or “minibodies” will also find use with the present invention. Minibodies are sFv polypeptide chains which include oligomerization domains at their C-termini, separated from the sFv by a hinge region. Pack et al. (1992) Biochem 31:1579-1584. The oligomerization domain comprises self-associating α-helices, e.g., leucine zippers, that can be further stabilized by additional disulfide bonds. The oligomerization domain is designed to be compatible with vectorial folding across a membrane, a process thought to facilitate in vivo folding of the polypeptide into a functional binding protein. Generally, minibodies are produced using recombinant methods well known in the art. See, e.g., Pack et al. (1992) Biochem 31:1579-1584; Cumber et al. (1992) J Immunology 149B:120-126.
- While the polypeptides of the present invention may comprise a substantially complete viral domain, in many applications all that is required is that the polypeptide comprise an antigenic or immunogenic region of the virus. Thus, in one aspect of the invention, the polypeptides of SEQ ID Nos: 40-48 are used to elicit an immunological response. In another aspect of the invention, an immunological region of a polypeptide is generally relatively small—typically 8 to 10 amino acids or less in length. Fragments of as few as 5 amino acids may characterize an antigenic region. These segments may correspond to regions encoding for capsid proteins, nonstructural proteins, and the junction between the capsid precursor P1 and 2A. Accordingly, using the cDNAs of these regions as a basis, DNAs encoding short segments of these polypeptides can be expressed recombinantly either as fusion proteins, or as isolated polypeptides. In addition, short amino acid sequences can be conveniently obtained by chemical synthesis.
- In instances wherein the synthesized polypeptide is correctly configured so as to provide the correct epitope, but is too small to be immunogenic, the polypeptide may be linked to a suitable carrier. A number of techniques for obtaining such linkage are known in the art,. including the formation of disulfide linkages using N-succinimidyl-3-(2-pyridyl-thio)propionate (SPDP) and succinimidyl 4-(N-maleimido-methyl)cyclohexane-1-carboxylate (SMCC) obtained from Pierce Company, Rockford, Ill., (if the peptide lacks a sulfhydryl group, this can be provided by addition of a cysteine residue). These reagents create a disulfide linkage between themselves and peptide cysteine residues on one protein and an amide linkage through the epsilon-amino on a lysine, or other free amino group in the other. A variety of such disulfide/amide-forming agents are known. See, for example, Immun. Rev. (1982) 62:185. Other bifunctional coupling agents form a thioether rather than a disulfide linkage. Many of these thio-ether-forming agents are commercially available and include reactive esters of 6-maleimidocaproic acid, 2-bromoacetic acid, 2-iodoacetic acid, 4-(N-maleimido-methyl)cyclohexane-1-carboxylic acid, and the like. The carboxyl groups can be activated by combining them with succinimide or 1-hydroxyl-2-nitro-4-sulfonic acid, sodium salt. Additional methods of coupling antigens employs the rotavirus/“binding peptide” system described in EPO Pub. No. 259,149, the disclosure of which is incorporated herein by reference. The foregoing list is not meant to be exhaustive, and modifications of the named compounds can clearly be used.
- Any carrier may be used which does not itself induce the production of antibodies harmful to the host. Suitable carriers are typically large, slowly metabolized macromolecules such as proteins; polysaccharides, such as latex functionalized Sepharose™, agarose, cellulose, cellulose beads and the like; polymeric amino acids, such as polyglutamic acid, polylysine, and the like; amino acid copolymers; and inactive virus particles. Especially useful protein substrates are serum albumins, keyhole limpet hemocyanin, immunoglobulin molecules, thyroglobulin, ovalbumin, tetanus toxoid, and other proteins well known to those skilled in the art.
- In addition to the polypeptides comprising SEQ ID NOs: 40-48, polypeptides comprising truncated HAV amino acid sequences encoding at least one viral epitope are useful immunological reagents. For example, polypeptides comprising such truncated sequences can be used as reagents in an immunoassay. These polypeptides also are candidate subunit antigens in compositions for antiserum production or vaccines. While these truncated sequences can be produced by various known treatments of native viral protein, it is generally preferred to make synthetic or recombinant polypeptides comprising an HAV sequence. Polypeptides comprising these truncated HAV sequences can be made up entirely of HAV sequences (one or more epitopes, either contiguous or noncontiguous), or HAV sequences and heterologous sequences in a fusion protein. Useful heterologous sequences include sequences that provide for secretion from a recombinant host, enhance the immunological reactivity of the HAV epitope(s), or facilitate the coupling of the polypeptide to an immunoassay support or a vaccine carrier. See, e.g., EPO Pub. No. 116,201; U.S. Pat. No. 4,722,840; EPO Pub. No. 259,149; U.S. Pat. No. 4,629,783, the disclosures of which are incorporated herein by reference.
- The size of polypeptides comprising the truncated HAV sequences can vary widely, the minimum size being a sequence of sufficient size to provide an HAV epitope, while the maximum size is not critical. In some applications, the maximum size usually is not substantially greater than that required to provide the desired HAV epitopes and function(s) of the heterologous sequence, if any. Typically, the truncated HAV amino acid sequence will range from about 5 to about 100 amino acids in length. More typically, however, the HAV sequence will be a maximum of about 50 amino acids in length, preferably a maximum of about 30 amino acids. It is usually desirable to select HAV sequences of at least about 10, 12 or 15 amino acids, up to a maximum of about 20 or 25 amino acids. In another aspect, the truncated HAV amino acid sequence are selected from SEQ ID NOs: 40-48. In yet another aspect of the invention, the polynucleotides or the truncated amino acid sequences have at least about 50% homology to the polynucleotides of SEQ ID NOs: 40-48, preferably about 80% homology to the polynucleotides of SEQ ID NOs: 40-48, more preferably about 90%, 95%, or 99% homology to the polynucleotides of SEQ ID NOs: 40-48.
- Truncated HAV amino acid sequences comprising epitopes can be identified in a number of ways. For example, the entire viral protein sequence can be screened by preparing a series of short peptides that together span the entire protein sequence. By starting with, for example, 100-mer polypeptides, it would be routine to test each polypeptide for the presence of epitope(s) showing a desired reactivity, and then testing progressively smaller and overlapping fragments from an identified 100-mer to map the epitope of interest. Screening such peptides in an immunoassay is within the skill of the art. It is also known to carry out a computer analysis of a protein sequence to identify potential epitopes, and then prepare oligopeptides comprising the identified regions for screening. It is appreciated by those of skill in the art that such computer analysis of antigenicity does not always identify an epitope that actually exists, and can also incorrectly identify a region of the protein as containing an epitope.
- The immunogenicity of the HAV sequences may also be enhanced by preparing the sequences fused to or assembled with particle-forming proteins such as, for example, hepatitis B surface antigen or rotavirus VP6 antigen. Constructs wherein the HAV epitope is linked directly to the particle-forming protein coding sequences produce hybrids which are immunogenic with respect to the HAV epitope. In addition, all of the vectors prepared include epitopes specific to HAV, having various degrees of immunogenicity, such as, for example, the pre-S peptide. Thus, particles constructed from particle forming protein which include HAV sequences are immunogenic with respect to HAV and particle-form protein.
- III. Experimental
- Below are examples of specific embodiments for carrying out the present invention. The examples are offered for illustrative purposes only, and are not intended to limit the scope of the present invention in any way.
- Efforts have been made to ensure accuracy with respect to numbers used (e.g., amounts, temperatures, etc.), but some experimental error and deviation should, of course, be allowed for.
- In the following examples, enzymes were purchased from commercial sources, and used according to the manufacturers' directions. Nitrocellulose filters and the like were also purchased from commercial sources.
- In the isolation of DNA fragments, except where noted, all DNA manipulations were done according to standard procedures. See, Sambrook et al., supra. Restriction enzymes, T4DNA ligase, E. coli, DNA polymerase I, Klenow fragment, and other biological reagents can be purchased from commercial suppliers and used according to the manufacturers' directions. Double stranded DNA fragments were separated on agarose gels.
- Human serum samples that had previously tested positive for HAV by IgM anti-HAV ELISA [ETI-HA-IgMK PLUS; DiaSorin, Inc; Saluggia (VC), Italy] were used to isolate RNA for subsequent experiments. Samples were stored at −80° C. until used. RNA was extracted from 0.14 mL of serum using the QIAamp Viral Mini Spin Kit (QIAGEN, Valencia, Calif.) following the manufacturer's specifications.
- The RT-PCR was performed using the Titan One Tube RT-PCR Kit (Roche, Mannheim, Germany) to amplify a 243 bp fragment in the VP3/VP1 region. The 243 bp fragment corresponds to nucleotide positions 2172-2415 of the HAV genome as reported by Cohen et al. (1987) J. Virol. 61: 50-59.
- Experiments were performed using the primers shown in Table 1 and the procedures described below.
TABLE 1 Primers used in the “RT-PCR” Experiments SEQ PCR Genomic ID Primer Sequence product region NO: SN2172 GCTCCTCTTTATCATGCTATGGAT 243 bp VP3/VP1 49 SN2415 GAGGAAATGTCTCAGGTACTITGT 243 bp VP3/ VP1 50
For this experiment, the “RT-PCR” was performed in a final volume of 50 μL using 10 μL of extracted HAV RNA following the manufacturer's specifications. The amplification profile involved reverse transcription at 50° C. for 30 min., template denaturation at 94° C. for 2 min., denaturation at 94° C. for 30 sec., primer annealing at 55 ° C. for 30 sec. and elongation at 68° C. for 45 sec. for 40 cycles. A final 10 min. incubation at 68° C. to ensure the full extension of fragments followed the 40 PCR cycles. - PCR products were electrophoresed on 4-20% polyacrylamide gels, stained with ethidium bromide and visualized under an UV source. Purification of amplified fragments was carried out using the QiaQuick PCR purification kit (QIAGEN, Valencia, Calif.).
- The PCR fragments were cloned into TOPO-TA vectors (Invitrogen, Carlsbad, Calif.). Cloning into these vectors is highly facilitated when the amplified DNA contains a single deoxyadenosine (A) at its 3′ end. Accordingly, a catalytic reaction to add the 3′ (A) overhead was used. The reaction mix contained 1.25 mM of dATP, 0.5 units of Taq polymerase (Perkin Elmer, Boston, Mass.) and proceeded at 72 C for 15 min.
- PCR fragments were cloned into the pCR2.1-TOPO vector using the Invitrogen's TA cloning kit (TOPOTM TA CloningR Kit with One Shot TOP10 Electrocompetent Cells) following the manufacturer's specifications. Bacterial cells were incubated at 37° C. on Luria Broth plates containing ampicillin at 100 μg/mL, 0.66 mM IPTG and 0.033% X-Gal. A number of white colonies were inoculated in 4 mL of Luria-Broth ampicillin (100 μg/ml) and incubated overnight at 37° C. with shaking. Three mL of the overnight cultures were used to prepare plasmid DNA using the QIAprep Miniprep kit (QIAGEN). Recombinant clones were identified by restriction enzyme analysis with EcoRI (New England and Biolabs) and 4-20% polyacryamide electrophoresis as described above.
- In order to determine the DNA sequences of the clones, large amounts of plasmids from recombinant clones were prepared as above and the DNA suspended in TE (10 mM Tris-HCl, pH 8.0, 1 mM EDTA) at 0.2 mg/mL. Nucleotide sequence determination of the Hepatitis A fragments was performed using an Applied Biosystems Model 373 or Model 377 DNA Sequencer system (Foster City, Calif.). The nucleotide sequence of the 243 bp VP3/VP1 fragment determined for 13 Indonesian (IND) (SEQ ID NOs: 1-13) and 14 Chilean (SCL) (SEQ ID NOs: 14-27) HAV isolates is shown in
FIG. 1 . - Cloning of
HAV P 1/2A precursor and full length open reading frame nucleotide fragments of interest include PCR of fragments of interest from the Chiron plasmid pHAVFL 18.3 #2 already containing a full length ORF of HAV and cloning those fragments of interest into the pGEM-4z vector (Promega, Madison Wis.). The pGEM vector has both an SP6 and T7 promoter to facilitate in vitro RNA synthesis of cloned products. The pGEM-4z vector was made by restriction digest of the plasmid using KpnI and SphI restriction enzymes (Roche Applied Science, Indianapolis, Ind.) followed by a phosphatase reaction using shrimp alkaline phosphatase (Roche Applied Science). The vector was then electrophoresed on an agarose gel and purified using the Promega Wizard PCR Purification kit (Promega). - Primers were designed to flank the regions of interest and included the KpnI and SphI restriction sites to facilitate cloning. Primers were ordered from an in-house DNA synthesis facility. PCR reactions using pHAVFL 18.3 #2 as template were done using the Roche Expand High Fidelity PCR System following the manufacturer's recommendations. The PCR products were electrophoresed on an agarose gel and purified using the Promega Wizard PCR Purification kit (Promega).
- The PCR products were ligated into the pGEM-4z vector using Roche Rapid DNA Ligation kit (Roche Applied Science) and transformed into HB101 competent cells. Bacterial cells were incubated at 37° C. on Luria Broth plates containing ampicillin at 100 μg/mL overnight. Three mL of overnight cultures were used to prepare plasmid DNA using the QIAprep Miniprep kit (QIAGEN, Valencia Calif.). Recombinant clones were identified by restriction enzyme analysis with KpnI and SphI (Roche Applied Science) and gel electrophoresed.
- Cloning of HAV full length open reading frame plus additional 3′ untranslated sequences include insertion of the HAV fragment from KpnI-DrdI from the above described HAV full length cloned fragment and a synthetic DNA region from DrdI-SphI into the pGEM-4z KpnI-SphI vector described above.
- Restriction enzyme digest was done on the pGEM-4z full length HAV construct described above to isolate a fragment using KpnI and DrdI enzymes (Roche Applied Science). The digest was electrophoresed and purified using the Promega Wizard PCR Purification kit (Promega). Synthetic DNA oligos were designed and ordered from an in-house DNA synthesis facility. The synthetic DrdI-SphI region was annealed from separate oligos and kinased according to standard molecular biology protocol. The two separate fragments were then ligated into the pGEM-4z vector using Roche Rapid DNA Ligation kit (Roche Applied Science) and transformed into HB101 competent cells. Bacterial cells were incubated at 37° C. on Luria Broth plates containing ampicillin at 100 μg/mL overnight. Three mL of the overnight cultures were used to prepare plasmid DNA using the QIAprep Miniprep kit (Qiagen, Valencia Calif.). Recombinant clones were identified by restriction enzyme analysis with KpnI and SphI (Roche Applied Science) and gel electrophoresed.
- Large amounts of plasmids from recombinant clones were prepared using Qiagen Maxi Plasmid kit (Qiagen) and the DNA suspended in ddH2O at 0.2 mg/mL. Nucleotide sequence determination of the HAV fragments was performed using an Applied BioSystems Model 373 or Model 377 DNA Sequencer system. The nucleotide sequence of the HAV inserts cloned in the pGEM-4z vector is shown in
FIGS. 2-4 . - Fragments encoding for P1, P1-2A, 1B, 1C, 1D, 2A and 3A were amplified using the DNA of a recombinant plasmid obtained in Chiron Corporation which contains the full-length HAV coding reading frame cloned in pUC 18. PCR primers were designed to PCR out the P1, P 1-2A, 1B, 1C, 1D, 2A and 3A regions of HAV. To facilitate the cloning of these regions into Chiron yeast expression vectors the NcoI, XhoI, and SalI restriction sites were introduced in the primers as required.
- PCR primers were synthesized in the DNA synthesis facility of Chiron Corporation. Synthetic oligonucleotides were purified, suspended in 300 ul of dH2O and their optical densities at 260 nm determined. The reaction mix contained 0.25 ng of template, 100 pmol of each primer, 10 ul of 1.25 mM of each dNTP and 1 unit of Taq polymerase (Vendor) in a final volume of 50 uL. Amplification conditions were 94° C. for 1 min., 50° C. for 2 min. and 68° C. for 4 min. for 35 cycles. A 7-min. post incubation at 75° C. was added to ensure the full extension of fragments. Aliquots of 5 μL were used to check PCR synthesis by electrophoresis on 1% agarose gels. The entire PCR product was then electrophoresed and fragments exhibiting the expected sizes were purified from the gels using the PCR Purification kit (Promega) following the vendor's recommendations. Approximately 0.8 μg of purified PCR DNA was digested with the appropriate restriction enzymes (Roche) for 3h at 37° C. and the products were further purified using the Promega PCR Purification kit.
- Plasmid pBS24.1, that was engineered to contain the yeast hybrid promoter ADH2/GAPDH (Cousens et al. (1987)
Gene 61, 265-275) and an XhoI restriction site, was used for heterologous expression of the HAV recombinant proteins. This yeast expression vector contains 2 μg sequences and inverted repeats (IR) for autonomous replication in yeast, the α-factor terminator to ensure transcription termination, and the yeast leu2-d and URA3 for selection. The Co1E1 origin of replication and the β-lactamase gene are also present for propagation and selection in E. coli(Pichuantes et al. (1996) “Expression of Heterologous Gene Products in Yeast” in Protein Engineering A Guide to Design and Production, Chapter 5. J. L. Cleland and C. Craik, eds., Wiley-Liss, Inc., New York, N.Y. pp 129-161). Plasmid pBS24.1 was digested with BamHI/SalI or XhoI/SalI and dephosphorylated with 10 units of calf intestine alkaline phosphatase (Boheringer Manheim, Indianapolis, Ind.) under the conditions recommended by the vendor. The HAV nucleotide sequences coding for HAV 2A and 3A were fused to DNA sequences coding for the human superoxide dismutase (SOD) prior to the cloning. The digested and purified HAV recombinant fragments were ligated with digested pBS24.1 using the Roche Rapid Ligation kit and protocol. The ligation mix was then used to transform Escherichia coli HB101 competent cells and transformants were selected in Luria-Broth plates containing ampicillin at 100 μg/mL after an overnight incubation at 37° C. Several colonies of each transformation were picked and inoculated in 3 mL of Luria-Broth with ampicillin at 100 μg/mL and incubated at 37° C. with shaking overnight. Plasmid DNA was prepared using 1.5 mL of cultures and the QIAprep Miniprep kit (QIAGEN). Recombinant clones were identified by analytical restriction enzyme analysis with BamHI-SalI. Large-scale preparations of recombinant plasmids were made to perform sequencing to confirm the nucleotide sequence of the cloned HAV fragments. Yeast expression plasmids exhibiting the expected sequence for HAV P1, P1-2A, 1B, 1C, 1D, SOD-2A and SOD-3A were used in experiments of yeast transformation as follows. Competent Saccharomyces cerevisiae AD3 cells [Mat a, trp1+, ura3-52, prb1-1122, pep4-3, prc1-407, [cir0],::pDM15(pGAP/ADR1::G418R)], leu2(ΔAD)] were transformed with plasmid DNAs encoding for NS1, VP1 or VP2. Selection of yeast recombinants was achieved by two rounds of uracil-deficient plates followed by one round of leucine-deficient plates after incubation at 30° C. for 48-72 hours. Cultures were then grown in leucine-deficient media and then in YEP supplemented with 2% glucose (Pichuantes et al. (1989) Proteins: Struct. Funct. Genet. 6: 324-337) for 48 h before checking expression of the recombinant proteins. - The nucleotide (SEQ ID NOs: 31-39) and corresponding amino acid sequences for the various proteins (SEQ ID NOs: 40-48) are shown in
FIGS. 5-13 . The amino acid sequence of the polypeptides was deduced from the nucleotide sequences. The nucleotide sequences and the amino acid sequences were compared to the wild-type HAV nucleotide and protein sequences reported by Cohen et al (1987) J. Virol. 61:50-59. The polynucleotide sequences from the Indonesian samples have a 93.8-96.7% homology, while the polynucleotide sequences from the Chilean samples have a 90.5-94.7% homology. The amino acid sequences from the Indonesian and the Chilean samples have a homology of 98.8-100% and 97.5-98.8%, respectively. - Accordingly, novel hepatitis A virus sequences and detection assays using these sequences have been disclosed. From the foregoing, it will be appreciated that, although specific embodiments of the invention have been described herein for purposes of illustration, various modifications may be made without deviating from the spirit and scope thereof.
# SEQUENCE LIS #TING <160> NUMBER OF SEQ ID NOS: 50 <210> SEQ ID NO 1 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: IND-1-2 <400> SEQUENCE: 1 tgctcctctt tatcatgcta tggatgtcac cacacaggtt ggagatgatt cc #ggaggttt 60 ttcaacgaca gtttctacag agcagaatgt tccagatccc caagttggta ta #acaactat 120 gaaggattta aaaggaaaag ccaatagagg gaaaatggat gtttcaggag ta #caagcacc 180 tgtgggagct attacaacaa ttgaggatcc agttttagca aagaaagtac ct #gagacatt 240 tcctg # # # 245 <210> SEQ ID NO 2 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: IND-2-2 <400> SEQUENCE: 2 gctcctcttt atcatgctat ggatgtcacc acacaggttg gagatgattc cg #gaggtttt 60 tcaacgacag tttctacaga gcagaatgtt cctgatcccc aagttggcat aa #caaccatg 120 agggacttaa aagggaaagc caataggggg aagatggatg tttcaggagt gc #aagcacct 180 gtgggagcta ttacaacaat tgaggatcca gttttagcaa agaaagtacc tg #agacattt 240 cctga # # # 245 <210> SEQ ID NO 3 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: IND-2-4 <400> SEQUENCE: 3 tgctcctctt tatcatgcta tggatgtcac cacacaggtt ggagatgatt cc #ggaggttt 60 ttcaacgaca gtttctacag agcagaatgt tccagatccc caagttggta ta #acaaccat 120 gagggattta aaaggaaaag ccaatagagg gaaaatggat gtttcaggag ta #caagcacc 180 tgtgggagct attacaacaa ttgaggatcc agttttagca aagaaagtac ct #gagacatt 240 tcctg # # # 245 <210> SEQ ID NO 4 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: IND-3-2 <400> SEQUENCE: 4 tgctcctctt tatcatgcta tggatgtcac cacacaggtt ggagatgatt cc #ggaggttt 60 ttcaacaaca gtttctacag agcagaatgt tcctgatccc caagttggca ta #acaaccat 120 gagggattta aaagggaaag ctaatagggg aaagatggat gtgtcaggag tg #caagcacc 180 tgtgggagcc atcacaacaa ttgaggatcc agttttagca aagaaagtac ct #gagacatt 240 tcctg # # # 245 <210> SEQ ID NO 5 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: IND-4-5 <400> SEQUENCE: 5 gctcctcttt atcatgctat ggatgtcacc acacaggttg gagatgattc cg #gaggtttt 60 tcaacgacag tttctacaga gcagaatgtt ccagatcccc aagttggtat aa #caactatg 120 aaggatttaa aaggaaaagc caatagaggg aaaatggatg tttcaggagt ac #aagcacct 180 gtgggagcta tcacaacaat tgaggatcca gttttagcaa agaaagtacc tg #agacattt 240 cctga # # # 245 <210> SEQ ID NO 6 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: IND-6-4 <400> SEQUENCE: 6 tgctcctctt tatcatgcta tggatgtcac cacacaggtt ggagatgatt cc #ggaggttt 60 ttcaacgaca gtttctacag agcagaatgt tccagatccc caagttggta ta #acaactat 120 gaaggattta aaaggaaaag ccaatagagg gaaaatggat gtttcaggag ta #caagcacc 180 tgtgggagct attacaacag ttgaggatcc agttttagca aagaaagtac ct #gagacatt 240 tcctg # # # 245 <210> SEQ ID NO 7 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: IND-7-1 <400> SEQUENCE: 7 gctcctcttt atcatgctat ggatgtcacc acacaggttg gagatgattc cg #gaggtttt 60 tcaacgacag tttctacaga gcagaatgtt ccagatcccc aagttggtat aa #caactatg 120 aaggatttaa aaggaaaagc caatagaggg aaaatggatg tttcaggagt ac #aagcacct 180 gtgggagcta ttacaacagt tgaggatcca gttttagcaa agaaagtacc tg #agacattt 240 cctga # # # 245 <210> SEQ ID NO 8 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: IND-8-2 <400> SEQUENCE: 8 gctcctcttt atcatgctat ggatgtcacc acacaggttg gagatgattc cg #gaggtttt 60 tcaacgacag tttctacaga gcagaatgtt ccagatcccc aagttggtat aa #caactatg 120 aaggatttaa aaggaaaagc caatagaggg aaaatggatg tttcaggagt ac #aagcacct 180 gtgggagcta ttacaacagt tgaggatcca gttttagcaa agaaagtacc tg #agacattt 240 cctga # # # 245 <210> SEQ ID NO 9 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: IND-9-1 <400> SEQUENCE: 9 tgctcctctt tatcatgcta tggatgtcac cacacaggtt ggagatgatt cc #ggaggttt 60 ttcaacgaca gtttctacag agcagaatgt tccagatccc caagttggta ta #acaactat 120 gaaggattta aaaggaaaag ccaatagagg gaaaatggat gtttcaggag ta #caagcacc 180 tgtgggagct attacaacag ttgaggatcc agttttagca aagaaagtac ct #gagacatt 240 tcctg # # # 245 <210> SEQ ID NO 10 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: IND-10-5 <400> SEQUENCE: 10 tgctcctctt tatcatgcta tggatgtcac cacacaggtt ggagatgatt cc #ggaggttt 60 ttcaacgaca gtttctacag agcagaatgt tcctggtccc caagttggca ta #acaaccat 120 gagggactta aaagggaaag ccaatagggg gaagatggat gtttcaggag tg #caagcacc 180 tgtgggagct attacaacaa ttgaggatcc agttttagca aagaaagtac ct #gagacatt 240 tcctg # # # 245 <210> SEQ ID NO 11 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: IND-11-5 <400> SEQUENCE: 11 tgctcctctt tatcatgcta tggatgtcac cacacaggtt ggagatgatt cc #ggaggttt 60 ttcaacgaca gtttctacag agcagaatgt tccagatccc caagttggta ta #acaactat 120 gaaggattta aaaggaaaag ccaatagagg gaaaatggat gtttcaggag ta #caagcacc 180 tgtgggagct attacaacaa ttgaggatcc agttttagca aagaaagtac ct #gagacatt 240 tcctg # # # 245 <210> SEQ ID NO 12 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: IND-12-1 <400> SEQUENCE: 12 gctcctcttt atcatgctat ggatgttact acacaggttg gagatgattc ag #gaggtttc 60 tcaacaacag tttccacaga gcagaatgtt cctgatcccc aagttgggat aa #caaccatg 120 agggatttaa aaggggaagc caatagggga aagatggatg tttcaggagt gc #aagcacct 180 gtgggagcta tcacaacaat tgaggatcca gttttagcaa agaaagtacc tg #agacattt 240 cctga # # # 245 <210> SEQ ID NO 13 <211> LENGTH: 244 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: IND-12-2 <400> SEQUENCE: 13 ctcctcttta tcatgctatg gatgttacca cacaggttgg agatgattca gg #aggttttt 60 caacaacagt ttctacagag cagaatgttc ctgatcccca agttggcata ac #aaccatga 120 gggacttaaa agggaaagcc aataggggga agatggatgt ttcaggagtg ca #agcacctg 180 tgggagctat tacaacaatt gaggatccag ttttagcaaa gaaagtacct ga #gacatttc 240 ctga # # # 244 <210> SEQ ID NO 14 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: SCL2-10 <400> SEQUENCE: 14 tgctcctctt tatcatgcta tggatgtcac cacacaggtt ggagatgatt cc #gggggttt 60 ttcaacgaca gtttctacag agcagaatgt tccagatccc caagttggta ta #acaactat 120 gaaggattta aaaggaaaag ccaatagagg gaaaatggat gtttcaggag ta #caagcacc 180 tgtgggagct attacaacag ttgaggatcc agttttagca aagaaagtac ct #gagacatt 240 tcctg # # # 245 <210> SEQ ID NO 15 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: SCL3-10 <400> SEQUENCE: 15 gctcctcttt atcatgctat ggatgtcacc acacaggttg gagatgattc cg #gaggtttt 60 tcaacgacag tttctacaga gcagaatgtt ccagatcccc aagttggtat aa #caactatg 120 aaggatttaa aaggaaaagc caatagaggg aaaatggatg tttcaggagt ac #aagcacct 180 gtgggagcta ttacaacagt tgaggatcca gttttagcaa agaaagtacc tg #agacattt 240 cctga # # # 245 <210> SEQ ID NO 16 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: SCL4-3 <400> SEQUENCE: 16 gctcctcttt atcatgctat ggatgttacc acacaggttg gagacgattc ag #gaggtttt 60 tcaacaacag tttctactga gcagaatgtt cctgatcccc aagttggtat aa #caaccatg 120 agggacctaa aagggaaagc caatagaggg aagatggatg tttcaggagt ac #aagcacct 180 gtgggagcta ttacaacaat tgaggatcca gtcttggcaa agaaagtacc tg #agacattt 240 cctga # # # 245 <210> SEQ ID NO 17 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: SCL7-6 <400> SEQUENCE: 17 gctcctcttt atcatgctat ggatgtcacc acacaggttg gagatgattc cg #gaggtttt 60 tcaacgacag tttctacaga gcagaatgtt ccagatcccc aagttggtat aa #caactatg 120 aaggatttaa aaggaaaagc caatagaggg aaaatggatg tttcaggagt ac #aagcacct 180 gtgggagcta ttacaacagt tgaggatcca gttttagcaa agaaagtacc tg #agacattt 240 cctga # # # 245 <210> SEQ ID NO 18 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: SCL8-2 <400> SEQUENCE: 18 gctcctcttt atcatgctat ggatgtcacc acacaggttg gagatgattc ag #gaggtttt 60 tcaacaacag tttctacaga acagaatgtt cctgatcccc aggttggcat aa #caactatg 120 agggatctaa aagggaaggc caatagtgga aagatggatg tttcaggagt gc #aagcacct 180 gtgggggcta ttacaacaat tgaggatcca gttttagcaa agaaagtacc tg #agacattt 240 cctga # # # 245 <210> SEQ ID NO 19 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: SCL8-5 <400> SEQUENCE: 19 gctcctcttt atcatgctat ggatgtcacc acacaggttg gagatgattc ag #gaggtttt 60 tcaacaacag tttctacaga gcagaatgtt cctgatcccc aggttggcat aa #caactatg 120 agggatctaa aagggaaggc caatagtgga aagatggatg tttcaggagt gc #aagcacct 180 gtgggggcta ttacaacaat tgaggatcca gttttagcaa agaaagtacc tg #agacattt 240 cctga # # # 245 <210> SEQ ID NO 20 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: SCL9-4 <400> SEQUENCE: 20 gctcctcttt atcatgctat ggatgttacc acacaggttg gagatgattc ag #gaggtttt 60 tcaacaacag tttctacaga acagaatgtt cctgatcccc aggttggcat aa #caactatg 120 agggatctaa aagggaaggc caatagtgga aagatggatg tttcaggagt gc #aagcacct 180 gtgggggcta ttacaacaat tgaggatcca gttttagcaa agaaagtacc tg #agacattt 240 cctga # # # 245 <210> SEQ ID NO 21 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: SCL10-1 <400> SEQUENCE: 21 gctcctcttt atcatgctat ggatgttacc acacaggttg gagatgattc ag #gaggtttt 60 tcaacaacag tttctacaga acagaatgtt cctgatcccc aggttggcat aa #caactatg 120 agggatctaa aagggaaggc caatagtgga aagatggatg tttcaggagt gc #aagcacct 180 gtgggggcta ttacaacaat tgaggatcca gttttagcaa agaaagtacc tg #agacattt 240 cctga # # # 245 <210> SEQ ID NO 22 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: SCL11-5 <400> SEQUENCE: 22 gctcctcttt atcatgctat ggatgttacc acacaggttg gagatgattc ag #gaggtttt 60 tcaacaacag tttctacaga acagaatgtt cctgatcccc aggttggcat aa #caactatg 120 agggatctaa aagggaaggc caatagtgga aagatggatg tttcaggagt gc #aagcacct 180 gtgggggcta ttacaacaat tgaggatcca gttttagcaa agaaagtacc tg #agacattt 240 cctga # # # 245 <210> SEQ ID NO 23 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: SCL12-6 <400> SEQUENCE: 23 gctcctcttt atcatgctat ggatgtcacc acacaggttg gagatgattc cg #gaggtttt 60 tcaacgacag tttctacaga gcagaatgtt ccagatcccc aagttggtat aa #caactatg 120 aaggatttaa aaggaaaagc caatagaggg aaaatggatg tttcaggagt ac #aagcacct 180 gtgggagcta ttacaacagt tgaggatcca gttttagcaa agaaagtacc tg #agacattt 240 cctga # # # 245 <210> SEQ ID NO 24 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: SCL14-3 <400> SEQUENCE: 24 gctcctcttt atcatgctat ggatgttacc acacaggttg gagacgattc ag #gaggtttt 60 tcaacaacag tttctacaga gcagaatgtt cctgatcccc aagttggtat aa #caaccatg 120 agggacctaa aagggaaagc caatagaggg aagatggatg tttcaggagt ac #aagcacct 180 gtgggagcta ttacaacaat tgaggatcca gtcttggcaa agaaagtacc tg #agacattt 240 cctga # # # 245 <210> SEQ ID NO 25 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: SCL15-1 <400> SEQUENCE: 25 gctcctcttt atcatgctat ggatgttacc acacaggttg gagacgattc ag #gaggtttt 60 tcaacaacag tttctacaga gcagaatgtt cctgatcccc aagttggtat aa #caaccatg 120 agggacctaa aagggaaagc caatagaggg aagatggatg tttcaggagt ac #aagcacct 180 gtgggagcta ttacaacaat tgaggatcca gttttggcaa agaaagtacc tg #agacattt 240 cctga # # # 245 <210> SEQ ID NO 26 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: SCL15-2 <400> SEQUENCE: 26 gctcctcttt atcatgctat ggatgttacc acacaggttg gagacgattc ag #gaggtttt 60 tcaacaacag tttctacaga gcagaatgtt cctgatcccc aagttggtat aa #caaccatg 120 agggacctaa aagggaaagc caatagaggg aagatggatg tttcaggagt ac #aagcacct 180 gtgggagcta ttacaacaat tgaggatcca gtcttggcaa agaaagtacc tg #agacattt 240 cctga # # # 245 <210> SEQ ID NO 27 <211> LENGTH: 245 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: SCL16-8 <400> SEQUENCE: 27 gctcctcttt atcatgctat ggatgttacc acacaggttg gagatgattc ag #gaggtttt 60 tcaacaacag tttctacaga acagaatgtt cctgatcccc aggttggcat aa #caactatg 120 agggatctaa aagggaaggc caatagtgga aagatggatg tttcaggagt gc #aagcacct 180 gtgggggcta ttacaacaat tgaggatcca gttttagcaa agaaagtacc tg #agacattt 240 cctga # # # 245 <210> SEQ ID NO 28 <211> LENGTH: 2950 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: HAV P1/2A precursor <400> SEQUENCE: 28 ggtaccatga atatgtccaa acaaggaatt ttccagactg ttgggagtgg cc #ttgaccac 60 atcctgtctt tggcagatat tgaggaagag caaatgattc agtccgttga ta #ggactgca 120 gtgactggag cttcttactt cacttctgtg gaccaatctt cagttcatac tg #ctgaggtt 180 ggctcacatc aaattgaacc tttgaaaacc tctgttgata aacctggttc ta #agaaaact 240 cagggggaaa agtttttcct gattcattct gctgattggc tcactacaca tg #ctctcttt 300 catgaagttg caaaattgga tgtggtgaaa ctactgtata atgagcagtt tg #ccgtccaa 360 ggtttgttga gataccatac atatgcaaga tttggcattg agattcaagt tc #agataaat 420 cccacaccct ttcagcaagg aggactaatt tgtgccatgg ttcctggtga cc #aaagttat 480 ggttcaatag catccttgac tgtttatcct catggtctgt taaattgcaa ta #tcaacaat 540 gtagttagaa taaaggttcc atttatttat actagaggtg cttatcattt ta #aagatcca 600 cagtacccag tttgggaatt gacaatcaga gtttggtcag agttgaatat tg #gaacagga 660 acttcagctt acacttcact caatgtttta gctaggttta cagatttgga gt #tgcatgga 720 ttaactcctc tttctacaca gatgatgaga aatgaattta gggtcagtac ta #ctgaaaat 780 gttgtaaatt tgtcaaatta tgaagatgca agggcaaaaa tgtcttttgc tt #tggatcag 840 gaagattgga agtctgatcc ttcccaaggt ggtggaatta aaattactca tt #ttactacc 900 tggacatcca ttccaacctt agctgctcag tttccattta atgcttcaga tt #cagttgga 960 caacaaatta aagttattcc agtggaccca tactttttcc aaatgacaaa ca #ctaatcct 1020 gatcaaaaat gtataactgc cttggcctct atttgtcaga tgttctgctt tt #ggagggga 1080 gatcttgttt ttgattttca ggtttttcca accaaatatc attcaggtag ac #tgttgttt 1140 tgttttgttc ctgggaatga gttaatagat gttactggaa ttacattaaa ac #aggcaact 1200 actgctcctt gtgcagtgat ggacattaca ggagtgcagt caaccttgag at #ttcgtgtt 1260 ccttggattt ctgatacacc ttatcgagtg aataggtaca cgaagtcagc ac #atcaaaaa 1320 ggtgagtaca ctgccattgg gaagcttatt gtgtattgtt ataacagact ga #cttctcct 1380 tctaatgttg cttctcatgt tagagttaat gtttatcttt cagcaattaa tt #tggaatgt 1440 tttgctcctc tttaccatgc tatggatgtt actacacagg ttggagatga tt #caggaggt 1500 ttctcaacaa cagtttctac agagcagaat gttcctgatc cccaagttgg ga #taacaacc 1560 atgagggatt taaaaggaaa agccaatagg ggaaagatgg atgtttcagg ag #tgcaagca 1620 cctgtgggag ctatcacaac aattgaagat ccagttttag caaagaaagt ac #ctgagaca 1680 tttcctgaat tgaagcctgg agagtccaga catacatcag atcacatgtc ta #tttataaa 1740 ttcatgggaa ggtctcattt tttgtgcact tttactttca attcaaataa ta #aagagtac 1800 acatttccaa taaccctgtc ttcgacttct aatcctcctc atggtttacc at #caacatta 1860 aggtggttct tcaatttgtt tcagttgtat agaggaccat tggatttaac aa #ttataatc 1920 acaggagcca ctgatgtgga tggtatggcc tggtttactc cagtgggcct tg #ctgtcgac 1980 accccttggg tggaaaagga gtcagctttg tctattgatt ataaaactgc cc #ttggagct 2040 gttagattta atacaagaag aacaggaaac attcaaatta gattgccgtg gt #attcttat 2100 ttgtatgccg tgtctggagc actggatggc ttgggggata agacagattc ta #catttgga 2160 ttggtttcta ttcagattgc aaattacaat cattctgatg aatatttgtc ct #tcagttgt 2220 tatttgtctg tcacagagca atcagagttc tattttccta gagctccatt aa #attcaaat 2280 gctatgttgt ccactgaatc catgatgagt agaattgcag ctggagactt gg #agtcatca 2340 gtggatgatc ccagatcaga ggaggataga agatttgaga gtcatataga at #gtaggaaa 2400 ccatacaaag aattgagact ggaggttggg aaacaaagac tcaaatatgc tc #aggaagag 2460 ttatcaaatg aagtgcttcc acctcctagg aaaatgaagg ggttattttc ac #aagctaaa 2520 atttctcttt tttatactga ggagcatgaa ataatgaagt tttcttggag ag #gagtgact 2580 gctgatacta gggctttgag aagatttgga ttctctctgg ctgctggtag aa #gtgtgtgg 2640 actcttgaaa tggatgctgg agttcttact ggaagattga tcagattgaa tg #atgagaaa 2700 tggacagaaa tgaaggatga taagattgtt tcattaattg aaaagttcac aa #gcaataaa 2760 tattggtcta aagtgaattc tccacatgga atgttggatc ttgaagaaat gc #tgccaatt 2820 ctaagatttt ccaaatatgt ctgagacaga tttgtgtttc ctgttacatt gg #ctaaatcc 2880 aaagaaaatc aatttagcag atagaatgct tggattgtct ggagtgcagg aa #attaagga 2940 acaggcatgc # # # 2950 <210> SEQ ID NO 29 <211> LENGTH: 6696 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: HAV open reading frame <400> SEQUENCE: 29 ggtaccatga atatgtccaa acaaggaatt ttccagactg ttgggagtgg cc #ttgaccac 60 atcctgtctt tggcagatat tgaggaagag caaatgattc agtccgttga ta #ggactgca 120 gtgactggag cttcttactt cacttctgtg gaccaatctt cagttcatac tg #ctgaggtt 180 ggctcacatc aaattgaacc tttgaaaacc tctgttgata aacctggttc ta #agaaaact 240 cagggggaaa agtttttcct gattcattct gctgattggc tcactacaca tg #ctctcttt 300 catgaagttg caaaattgga tgtggtgaaa ctactgtata atgagcagtt tg #ccgtccaa 360 ggtttgttga gataccatac atatgcaaga tttggcattg agattcaagt tc #agataaat 420 cccacaccct ttcagcaagg aggactaatt tgtgccatgg ttcctggtga cc #aaagttat 480 ggttcaatag catccttgac tgtttatcct catggtctgt taaattgcaa ta #tcaacaat 540 gtagttagaa taaaggttcc atttatttat actagaggtg cttatcattt ta #aagatcca 600 cagtacccag tttgggaatt gacaatcaga gtttggtcag agttgaatat tg #gaacagga 660 acttcagctt acacttcact caatgtttta gctaggttta cagatttgga gt #tgcatgga 720 ttaactcctc tttctacaca gatgatgaga aatgaattta gggtcagtac ta #ctgaaaat 780 gttgtaaatt tgtcaaatta tgaagatgca agggcaaaaa tgtcttttgc tt #tggatcag 840 gaagattgga agtctgatcc ttcccaaggt ggtggaatta aaattactca tt #ttactacc 900 tggacatcca ttccaacctt agctgctcag tttccattta atgcttcaga tt #cagttgga 960 caacaaatta aagttattcc agtggaccca tactttttcc aaatgacaaa ca #ctaatcct 1020 gatcaaaaat gtataactgc cttggcctct atttgtcaga tgttctgctt tt #ggagggga 1080 gatcttgttt ttgattttca ggtttttcca accaaatatc attcaggtag ac #tgttgttt 1140 tgttttgttc ctgggaatga gttaatagat gttactggaa ttacattaaa ac #aggcaact 1200 actgctcctt gtgcagtgat ggacattaca ggagtgcagt caaccttgag at #ttcgtgtt 1260 ccttggattt ctgatacacc ttatcgagtg aataggtaca cgaagtcagc ac #atcaaaaa 1320 ggtgagtaca ctgccattgg gaagcttatt gtgtattgtt ataacagact ga #cttctcct 1380 tctaatgttg cctctcatgt tagagttaat gtttatcttt cagcaattaa tt #tggaatgt 1440 tttgctcctc tttaccatgc tatggatgtt actacacagg ttggagatga tt #caggaggt 1500 ttctcaacaa cagtttctac agagcagaat gttcctgatc cccaagttgg ga #taacaacc 1560 atgagggatt taaaaggaaa agccaatagg ggaaagatgg atgtttcagg ag #tgcaagca 1620 cctcgtggga gctatcagca acaattgaac gatccagttt tagcaaagaa ag #tacctgag 1680 acatttcctg aattgaagcc tggagagtcc agacatacat cagatcacat gt #ctatttat 1740 aaattcatgg gaaggtctca ttttttgtgc acttttactt tcaattcaaa ta #ataaagag 1800 tacacatttc caataaccct gtcttcgact tctaatcctc ctcatggttt ac #catcaaca 1860 ttaaggtggt tcttcaattt gtttcagttg tatagaggac cattggattt aa #caattata 1920 atcacaggag ccactgatgt ggatggtatg gcctggttta ctccagtggg cc #ttgctgtc 1980 gacccttggg tggaaaagga gtcagctttg tctattgatt ataaaactgc cc #ttggagct 2040 gttagattta atacaagaag aacaggaaac attcaaatta gattgccgtg gt #attcttat 2100 ttgtatgccg tgtctggagc actggatggc ttgggggata agacagattc ta #catttgga 2160 ttgtttctat tcgagattgc aaattacaat cattctgatg aatatttgtc ct #tcagttgt 2220 tatttgtctg tcacagagca atcagagttc tattttccta gagctccatt aa #attcaaat 2280 gctatgttgt ccactgaatc catgatgagt agaattgcag ctggagactt gg #agtcatca 2340 gtggatgatc ccagatcaga ggaggataga agatttgaga gtcatataga at #gtaggaaa 2400 ccatacaaag aattgagact ggaggttggg aaacaaagac tcaaatatgc tc #aggaagag 2460 ttatcaaatg aagtgcttcc acctcctagg aaaatgaagg ggttattttc ac #aagctaaa 2520 atttctcttt tttatactga ggagcatgaa ataatgaagt tttcttggag ag #gagtgact 2580 gctgatacta gggctttgag aagatttgga ttctctctgg ctgctggtag aa #gtgtgtgg 2640 actcttgaaa tggatgctgg agttcttact ggaagattga tcagattgaa tg #atgagaaa 2700 tggacagaaa tgaaggatga taagattgtt tcattaattg aaaagttcac aa #gcaataaa 2760 tattggtcta aagtgaattt tccacatgga atgttggatc ttgaagaaat tg #ctgccaat 2820 tctaaggatt ttccaaatat gtctgagaca gatttgtgtt tcctgttaca tt #ggctaaat 2880 ccaaagaaaa tcaatttagc agatagaatg cttggattgt ctggagtgca gg #aaattaag 2940 gaacagggtg ttggactgat agcagagtgt agaactttct tggattctat tg #ctgggact 3000 ttgaaatcta tgatgtttgg gtttcatcat tctgtgactg ttgaaattat aa #atactgtg 3060 ctttgttttg ttaagagtgg aatcctgctt tatgtcatac aacaattgaa cc #aagatgaa 3120 cactctcaca taattggttt gttgagagtt atgaattatg cagatattgg ct #gttcagtt 3180 atttcatgtg gtaaagtttt ttccaaaatg ttagaaacag tttttaattg gc #aaatggat 3240 tctagaatga tggagctgag gactcagagc ttctctaatt ggttaagaga ta #tttgttca 3300 ggaattacta tttttaaaag ttttaaggat gccatatatt ggttatatac aa #aattgaag 3360 gatttttatg aagtaaatta tggcaagaaa aaggatattc ttaatattct ca #aagataat 3420 cagcaaaaaa tagaaaaagc cattgaagaa gcagacaatt tttgcatttt gc #aaattcaa 3480 gatgtagaga aatttgatca gtatcagaaa ggggttgatt taatacaaaa gc #tgagaact 3540 gtccattcaa tggcgcaagt tgaccccaat ttgggggttc atttgtcacc tc #tcagagat 3600 tgcatagcaa gagtccacca aaagctcaag aatcttggat ctataaatca gg #ccatggta 3660 acaagatgtg agccagttgt ttgctatttg tatggcaaaa gagggggagg ga #aaagcttg 3720 acttcaattg cattggcaac caaaatttgt aaacactatg gtgttgaacc tg #agaaaaat 3780 atttacacca aacctgtggc ctcagattat tgggatggat atagtggaca at #tagtttgc 3840 attattgatg atattggcca aaacacaaca gatgaagatt ggtcagattt tt #gtcaatta 3900 gtgtcaggat gcccaatgag attgaatatg gcttctctag aggagaaggg ca #gacatttt 3960 tcctctcctt ttataatagc aacttcaaat tggtcaaatc caagtccaaa aa #cagtttat 4020 gttaaggaag caattgatcg taggcttcat tttaaggttg aagttaaacc tg #cttcattt 4080 tttaaaaatc ctcacaatga tatgttgaat gttaatttgg ccaaaacaaa tg #atgcaatt 4140 aaggacatgt cttgtgttga tttaataatg gatggacaca atatttcatt ga #tggattta 4200 cttagttcct tagtgatgac agttgaaatt aggaaacaga atatgagtga at #tcatggag 4260 ttgtggtctc agggaatttc agatgatgac aatgatagtg cagtggctga gt #ttttccag 4320 tcttttccat ctggtgaacc atcaaattgg aagttatcta gttttttcca at #ctgtcact 4380 aatcacaagt gggttgctgt gggagctgca gttggcattc ttggagtgct tg #tgggagga 4440 tggtttgtgt ataagcattt ttcccgcaaa gaggaagaac caattccagc tg #aaggggtt 4500 tatcatggcg tgactaagcc caaacaagtg attaaattgg atgcagatcc ag #tagagtcc 4560 cagtcaactc tagaaatagc aggattagtt aggaaaaatc tggttcagtt tg #gagttggt 4620 gagaaaaatg gatgtgtgag atgggtcatg aatgccttag gagtgaagga tg #attggttg 4680 ttagtacctt ctcatgctta taaatttgaa aaggattatg aaatgatgga gt #tttacttc 4740 aatagaggtg gaacttacta ttcaatttca gctggtaatg ttgttattca at #ctttagat 4800 gtgggatttc aagatgttgt tttaatgaag gtttctacaa ttcccaagtt ta #gagatatt 4860 actcaacact ttattaagaa aggagatgtg cctagagcct taaatcgctt gg #caacatta 4920 gtgacaaccg ttaatggaac tcctatgtta atttctgagg gaccattaaa ga #tggaagaa 4980 aaagccactt atgttcataa gaagaatgat ggtactacag ttgatttgac tg #tagatcag 5040 gcatggagag gaaaaggtga aggtcttcct ggaatgtgtg gtggggccct ag #tgtcatca 5100 aatcagtcca tacagaatgc aattttgggt attcatgttg ctggaggaaa tt #caattctt 5160 gtggcaaagc tggttactca agaaatgttt caaaacattg ataagaaaat tg #aaagtcag 5220 agaataatga aagtggaatt tactcaatgt tcaatgaatg tagtctccaa aa #cgcttttt 5280 agaaagagtc ccattcatca ccacattgat agaaccatga ttaattttcc tg #cagctatg 5340 cctttctcta aagctgaaat tgatccaatg gctatgatgt tgtccaaata tt #cattacct 5400 attgtggagg aaccagagga ttacaaggaa gcttcagttt tttatcaaaa ca #aaatagta 5460 ggcaagactc agctagttga tgacttttta gatcttgata tggctattac ag #gggctcca 5520 ggcattgatg ctatcaatat ggattcatct cctgggtttc cttatgttca ag #aaaaattg 5580 accaaaagag atttaatttg gttggatgaa aatggtttgc tgttaggagt tc #acccaaga 5640 ttggcccaga gaattttatt taatactgtc atgatggaaa attgttctga ct #tagatgtt 5700 gtttttacaa cttgtccaaa agatgaattg agaccattag aaaaagtttt gg #aatcaaaa 5760 acaagagcca ttgatgcttg tcctttggat tatacaattc tatgtcgaat gt #attggggt 5820 ccagctatca gttatttcca tttgaatcca gggtttcaca caggtgttgc ta #ttggcata 5880 gatcctgata gacagtggga tgaattattt aaaacaatga taagatttgg ag #atgttggt 5940 cttgatttag atttctctgc ttttgatgcc agtcttagtc catttatgat ta #gggaagca 6000 ggtagaatca tgagtgaatt atctggaaca ccatctcatt ttggaacagc tc #ttatcaat 6060 actatcattt attctaaaca tctgctgtac aactgttgtt atcatgtttg tg #gttcaatg 6120 ccttctgggt ctccttgcac agctttgttg aattcaatta ttaataatat ta #atctgtat 6180 tatgtgtttt ctaaaatatt tggaaagtct ccagttttct tttgtcaagc tt #tgaggatc 6240 ctttgttacg gagatgatgt tttgatagtt ttttccagag atgttcaaat tg #acaatctt 6300 gacttgattg gacagaaaat tgtagatgag ttcaaaaaac ttggcatgac ag #ccacctca 6360 gctgataaaa atgtgcctca actgaagcca gtttcagaat tgacttttct ca #aaagatct 6420 ttcaatttgg tggaggatag aattagacct gcaatttcag aaaagacaat tt #ggtctttg 6480 atggcttggc agagaagtaa cgctgagttt gagcggaatt tagaaaatgc tc #agtggttt 6540 gcttttatgc atggctatga gttctatcag aaattttatt attttgttca gt #cctgtttg 6600 gagaaagaga tgatagaata tagacttaaa tcttatgatt ggtggagaat ga #gattttat 6660 gaccagtgtt tcatttgtga cctttcatga gcatgc # # 6696 <210> SEQ ID NO 30 <211> LENGTH: 6757 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: HAV open reading frame plus addit #ional 3′ untranslated sequences <400> SEQUENCE: 30 ggtaccatga atatgtccaa acaaggaatt ttccagactg ttgggagtgg cc #ttgaccac 60 atcctgtctt tggcagatat tgaggaagag caaatgattc agtccgttga ta #ggactgca 120 gtgactggag cttcttactt cacttctgtg gaccaatctt cagttcatac tg #ctgaggtt 180 ggctcacatc aaattgaacc tttgaaaacc tctgttgata aacctggttc ta #agaaaact 240 cagggggaaa agtttttcct gattcattct gctgattggc tcactacaca tg #ctctcttt 300 catgaagttg caaaattgga tgtggtgaaa ctactgtata atgagcagtt tg #ccgtccaa 360 ggtttgttga gataccatac atatgcaaga tttggcattg agattcaagt tc #agataaat 420 cccacaccct ttcagcaagg aggactaatt tgtgccatgg ttcctggtga cc #aaagttat 480 ggttcaatag catccttgac tgtttatcct catggtctgt taaattgcaa ta #tcaacaat 540 gtagttagaa taaaggttcc atttatttat actagaggtg cttatcattt ta #aagatcca 600 cagtacccag tttgggaatt gacaatcaga gtttggtcag agttgaatat tg #gaacagga 660 acttcagctt acacttcact caatgtttta gctaggttta cagatttgga gt #tgcatgga 720 ttaactcctc tttctacaca gatgatgaga aatgaattta gggtcagtac ta #ctgaaaat 780 gttgtaaatt tgtcaaatta tgaagatgca agggcaaaaa tgtcttttgc tt #tggatcag 840 gaagattgga agtctgatcc ttcccaaggt ggtggaatta aaattactca tt #ttactacc 900 tggacatcca ttccaacctt agctgctcag tttccattta atgcttcaga tt #cagttgga 960 caacaaatta aagttattcc agtggaccca tactttttcc aaatgacaaa ca #ctaatcct 1020 gatcaaaaat gtataactgc cttggcctct atttgtcaga tgttctgctt tt #ggagggga 1080 gatcttgttt ttgattttca ggtttttcca accaaatatc attcaggtag ac #tgttgttt 1140 tgttttgttc ctgggaatga gttaatagat gttactggaa ttacattaaa ac #aggcaact 1200 actgctcctt gtgcagtgat ggacattaca ggagtgcagt caaccttgag at #ttcgtgtt 1260 ccttggattt ctgatacacc ttatcgagtg aataggtaca cgaagtcagc ac #atcaaaaa 1320 ggtgagtaca ctgccattgg gaagcttatt gtgtattgtt ataacagact ga #cttctcct 1380 tctaatgttg cctctcatgt tagagttaat gtttatcttt cagcaattaa tt #tggaatgt 1440 tttgctcctc tttaccatgc tatggatgtt actacacagg ttggagatga tt #caggaggt 1500 ttctcaacaa cagtttctac agagcagaat gttcctgatc cccaagttgg ga #taacaacc 1560 atgagggatt taaaaggaaa agccaatagg ggaaagatgg atgtttcagg ag #tgcaagca 1620 cctcgtggga gctatcagca acaattgaac gatccagttt tagcaaagaa ag #tacctgag 1680 acatttcctg aattgaagcc tggagagtcc agacatacat cagatcacat gt #ctatttat 1740 aaattcatgg gaaggtctca ttttttgtgc acttttactt tcaattcaaa ta #ataaagag 1800 tacacatttc caataaccct gtcttcgact tctaatcctc ctcatggttt ac #catcaaca 1860 ttaaggtggt tcttcaattt gtttcagttg tatagaggac cattggattt aa #caattata 1920 atcacaggag ccactgatgt ggatggtatg gcctggttta ctccagtggg cc #ttgctgtc 1980 gacccttggg tggaaaagga gtcagctttg tctattgatt ataaaactgc cc #ttggagct 2040 gttagattta atacaagaag aacaggaaac attcaaatta gattgccgtg gt #attcttat 2100 ttgtatgccg tgtctggagc actggatggc ttgggggata agacagattc ta #catttgga 2160 ttgtttctat tcgagattgc aaattacaat cattctgatg aatatttgtc ct #tcagttgt 2220 tatttgtctg tcacagagca atcagagttc tattttccta gagctccatt aa #attcaaat 2280 gctatgttgt ccactgaatc catgatgagt agaattgcag ctggagactt gg #agtcatca 2340 gtggatgatc ccagatcaga ggaggataga agatttgaga gtcatataga at #gtaggaaa 2400 ccatacaaag aattgagact ggaggttggg aaacaaagac tcaaatatgc tc #aggaagag 2460 ttatcaaatg aagtgcttcc acctcctagg aaaatgaagg ggttattttc ac #aagctaaa 2520 atttctcttt tttatactga ggagcatgaa ataatgaagt tttcttggag ag #gagtgact 2580 gctgatacta gggctttgag aagatttgga ttctctctgg ctgctggtag aa #gtgtgtgg 2640 actcttgaaa tggatgctgg agttcttact ggaagattga tcagattgaa tg #atgagaaa 2700 tggacagaaa tgaaggatga taagattgtt tcattaattg aaaagttcac aa #gcaataaa 2760 tattggtcta aagtgaattt tccacatgga atgttggatc ttgaagaaat tg #ctgccaat 2820 tctaaggatt ttccaaatat gtctgagaca gatttgtgtt tcctgttaca tt #ggctaaat 2880 ccaaagaaaa tcaatttagc agatagaatg cttggattgt ctggagtgca gg #aaattaag 2940 gaacagggtg ttggactgat agcagagtgt agaactttct tggattctat tg #ctgggact 3000 ttgaaatcta tgatgtttgg gtttcatcat tctgtgactg ttgaaattat aa #atactgtg 3060 ctttgttttg ttaagagtgg aatcctgctt tatgtcatac aacaattgaa cc #aagatgaa 3120 cactctcaca taattggttt gttgagagtt atgaattatg cagatattgg ct #gttcagtt 3180 atttcatgtg gtaaagtttt ttccaaaatg ttagaaacag tttttaattg gc #aaatggat 3240 tctagaatga tggagctgag gactcagagc ttctctaatt ggttaagaga ta #tttgttca 3300 ggaattacta tttttaaaag ttttaaggat gccatatatt ggttatatac aa #aattgaag 3360 gatttttatg aagtaaatta tggcaagaaa aaggatattc ttaatattct ca #aagataat 3420 cagcaaaaaa tagaaaaagc cattgaagaa gcagacaatt tttgcatttt gc #aaattcaa 3480 gatgtagaga aatttgatca gtatcagaaa ggggttgatt taatacaaaa gc #tgagaact 3540 gtccattcaa tggcgcaagt tgaccccaat ttgggggttc atttgtcacc tc #tcagagat 3600 tgcatagcaa gagtccacca aaagctcaag aatcttggat ctataaatca gg #ccatggta 3660 acaagatgtg agccagttgt ttgctatttg tatggcaaaa gagggggagg ga #aaagcttg 3720 acttcaattg cattggcaac caaaatttgt aaacactatg gtgttgaacc tg #agaaaaat 3780 atttacacca aacctgtggc ctcagattat tgggatggat atagtggaca at #tagtttgc 3840 attattgatg atattggcca aaacacaaca gatgaagatt ggtcagattt tt #gtcaatta 3900 gtgtcaggat gcccaatgag attgaatatg gcttctctag aggagaaggg ca #gacatttt 3960 tcctctcctt ttataatagc aacttcaaat tggtcaaatc caagtccaaa aa #cagtttat 4020 gttaaggaag caattgatcg taggcttcat tttaaggttg aagttaaacc tg #cttcattt 4080 tttaaaaatc ctcacaatga tatgttgaat gttaatttgg ccaaaacaaa tg #atgcaatt 4140 aaggacatgt cttgtgttga tttaataatg gatggacaca atatttcatt ga #tggattta 4200 cttagttcct tagtgatgac agttgaaatt aggaaacaga atatgagtga at #tcatggag 4260 ttgtggtctc agggaatttc agatgatgac aatgatagtg cagtggctga gt #ttttccag 4320 tcttttccat ctggtgaacc atcaaattgg aagttatcta gttttttcca at #ctgtcact 4380 aatcacaagt gggttgctgt gggagctgca gttggcattc ttggagtgct tg #tgggagga 4440 tggtttgtgt ataagcattt ttcccgcaaa gaggaagaac caattccagc tg #aaggggtt 4500 tatcatggcg tgactaagcc caaacaagtg attaaattgg atgcagatcc ag #tagagtcc 4560 cagtcaactc tagaaatagc aggattagtt aggaaaaatc tggttcagtt tg #gagttggt 4620 gagaaaaatg gatgtgtgag atgggtcatg aatgccttag gagtgaagga tg #attggttg 4680 ttagtacctt ctcatgctta taaatttgaa aaggattatg aaatgatgga gt #tttacttc 4740 aatagaggtg gaacttacta ttcaatttca gctggtaatg ttgttattca at #ctttagat 4800 gtgggatttc aagatgttgt tttaatgaag gtttctacaa ttcccaagtt ta #gagatatt 4860 actcaacact ttattaagaa aggagatgtg cctagagcct taaatcgctt gg #caacatta 4920 gtgacaaccg ttaatggaac tcctatgtta atttctgagg gaccattaaa ga #tggaagaa 4980 aaagccactt atgttcataa gaagaatgat ggtactacag ttgatttgac tg #tagatcag 5040 gcatggagag gaaaaggtga aggtcttcct ggaatgtgtg gtggggccct ag #tgtcatca 5100 aatcagtcca tacagaatgc aattttgggt attcatgttg ctggaggaaa tt #caattctt 5160 gtggcaaagc tggttactca agaaatgttt caaaacattg ataagaaaat tg #aaagtcag 5220 agaataatga aagtggaatt tactcaatgt tcaatgaatg tagtctccaa aa #cgcttttt 5280 agaaagagtc ccattcatca ccacattgat agaaccatga ttaattttcc tg #cagctatg 5340 cctttctcta aagctgaaat tgatccaatg gctatgatgt tgtccaaata tt #cattacct 5400 attgtggagg aaccagagga ttacaaggaa gcttcagttt tttatcaaaa ca #aaatagta 5460 ggcaagactc agctagttga tgacttttta gatcttgata tggctattac ag #gggctcca 5520 ggcattgatg ctatcaatat ggattcatct cctgggtttc cttatgttca ag #aaaaattg 5580 accaaaagag atttaatttg gttggatgaa aatggtttgc tgttaggagt tc #acccaaga 5640 ttggcccaga gaattttatt taatactgtc atgatggaaa attgttctga ct #tagatgtt 5700 gtttttacaa cttgtccaaa agatgaattg agaccattag aaaaagtttt gg #aatcaaaa 5760 acaagagcca ttgatgcttg tcctttggat tatacaattc tatgtcgaat gt #attggggt 5820 ccagctatca gttatttcca tttgaatcca gggtttcaca caggtgttgc ta #ttggcata 5880 gatcctgata gacagtggga tgaattattt aaaacaatga taagatttgg ag #atgttggt 5940 cttgatttag atttctctgc ttttgatgcc agtcttagtc catttatgat ta #gggaagca 6000 ggtagaatca tgagtgaatt atctggaaca ccatctcatt ttggaacagc tc #ttatcaat 6060 actatcattt attctaaaca tctgctgtac aactgttgtt atcatgtttg tg #gttcaatg 6120 ccttctgggt ctccttgcac agctttgttg aattcaatta ttaataatat ta #atctgtat 6180 tatgtgtttt ctaaaatatt tggaaagtct ccagttttct tttgtcaagc tt #tgaggatc 6240 ctttgttacg gagatgatgt tttgatagtt ttttccagag atgttcaaat tg #acaatctt 6300 gacttgattg gacagaaaat tgtagatgag ttcaaaaaac ttggcatgac ag #ccacctca 6360 gctgataaaa atgtgcctca actgaagcca gtttcagaat tgacttttct ca #aaagatct 6420 ttcaatttgg tggaggatag aattagacct gcaatttcag aaaagacaat tt #ggtctttg 6480 atggcttggc agagaagtaa cgctgagttt gagcagaatt tagaaaatgc tc #agtggttt 6540 gcttttatgc atggctatga gttctatcag aaattttatt attttgttca gt #cctgtttg 6600 gagaaagaga tgatagaata tagacttaaa tcttatgatt ggtggagaat ga #gattttat 6660 gaccagtgtt tcatttgtga cctttcatga tttgtttaaa caaattttct ta #ctctttct 6720 gaggtttgtt tatttctttt gtccgctaac tgcatgc # # 6757 <210> SEQ ID NO 31 <211> LENGTH: 2508 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: recombinant protein of 94 kDa <400> SEQUENCE: 31 atgaatatgt ccaaacaagg aattttccgg actgttggga gtggccttga cc #acatcctg 60 tctttggcag atattgagga agagcaaatg attcagtccg ttgataggac tg #cagtgact 120 ggagcttctt acttcacttc tgtggaccaa tcttcagttc atactgctga gg #ttggctca 180 catcaaattg aacctttgaa aacctctgtt gataaacctg gttctaagaa aa #ctcagggg 240 gaaaagtttt tcctgattca ttctgctgat tggctcacta cacatgctct ct #ttcatgaa 300 gttgcaaaat tggatgtggt gaaactactg tataatgagc agtttgccgt cc #aaggtttg 360 ttgagatacc atacatatgc aagatttggc attgagattc aagttcagat aa #atcccaca 420 ccctttcagc aaggaggact aatttgtgcc atggttcctg gtgaccaaag tt #atggttca 480 atagcatcct tgactgttta tcctcatggt ctgttaaatt gcaatatcaa ca #atgtagtt 540 agaataaagg ttccatttat ttatactaga ggtgcttatc attttaaaga tc #cacagtac 600 ccagtttggg aattgacaat cagagtttgg tcagagttga atattggaac ag #gaacttca 660 gcttacactt cactcaatgt tttagctagg tttacagatt tggagttgca tg #gattaact 720 cctctttcta cacagatgat gagaaatgaa tttagggtca gtactactga aa #atgttgta 780 aatttgtcaa attatgaaga tgcaagggca aaaatgtctt ttgctttgga tc #aggaagat 840 tggaagtctg atccttccca aggtggtgga attaaaatta ctcattttac ta #cctggaca 900 tccattccaa ccttagctgc tcagtttcca tttaatgctt cagattcagt tg #gacaacaa 960 attaaagtta ttccagtgga cccatacttt ttccaaatga caaacactaa tc #ctgatcaa 1020 aaatgtataa ctgccttggc ctctatttgt cagatgttct gcttttggag gg #gagatctt 1080 gtttttgatt ttcaggtttt tccaaccaaa tatcattcag gtagactgtt gt #tttgtttt 1140 gttcctggga atgagttaat agatgttact ggaattacat taaaacaggc aa #ctactgct 1200 ccttgtgcag tgatggacat tacaggagtg cagtcaacct tgagatttcg tg #ttccttgg 1260 atttctgata caccttatcg agtgaatagg tacacgaagt cagcacatca aa #aaggtgag 1320 tacactgcca ttgggaagct tattgtgtat tgttataaca gactgacttc tc #cttctaat 1380 gttgcctctc atgttagagt taatgtttat ctttcagcaa ttaatttgga at #gttttgct 1440 cctctttacc atgctatgga tgttactaca caggttggag atgattcagg ag #gtttctca 1500 acaacagttt ctacagagca gaatgttcct gatccccaag ttgggataac aa #ccatgagg 1560 gattcaaaag gaaaagccaa taggggaaag atggatgttt caggagtgca ag #cacctgtg 1620 ggagctatca caacaattga agatccagtt ttagcaaaga aagtacctga ga #catttcct 1680 gaattgaagc ctggagagtc cagacataca tcagatcaca tgtctattta ta #aattcatg 1740 ggaaggtctc attttttgtg cacttttact ttcaattcaa ataataaaga gt #acacattt 1800 ccaataaccc tgtcttcgac ttctaatcct cctcatggtt taccatcaac at #taaggtgg 1860 ttcttcaatt tgtttcagtt gtatagagga ccattggatt taacaattat aa #tcacagga 1920 gccactgatg tggatggtat ggcctggttt actccagtgg gccttgctgt cg #acacccct 1980 tgggtggaaa aggagtcagc tttgtctatt gattataaaa ctgcccttgg ag #ctgttaga 2040 tttaatacaa gaagaacagg aatcatccaa attagattgc cgtggtattc tt #atttgtat 2100 gccgtgtctg gagcactgga tggcttgggg gataagacag attctacatt tg #gattggtt 2160 tctattcaga ttgcaaatta caatcattct gatgaatatt tgtccttcag tt #gttatttg 2220 tctgtcacag agcaatcaga gttctatttt cctagagctc cattaaattc aa #atgctatg 2280 ttgtccactg aatccatgat gagtagaatt gcagctggag acttggagtc at #cagtggat 2340 gatcccagat cagaggagga tagaagattt gagagtcata tagaatgtag ga #aaccatac 2400 aaagaattga gactggaggt tgggaaacaa agactcaaat atgctcagga ag #agttatca 2460 aatgaagtgc ttccacctcc taggaaaatc aaggggttat tttcacaa # 2508 <210> SEQ ID NO 32 <211> LENGTH: 2940 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: recombinant protein of 115.5 kDa <400> SEQUENCE: 32 atgaatatgt ccaaacaagg aattttccag actgttggga gtggccttga cc #acatcctg 60 tctttggcag atattgagga agagcaaatg attcagtccg ttgataggac tg #cagtgact 120 ggagcttctt acttcacttc tgtggaccaa tcttcagttc atactgctga gg #ttggctca 180 catcaaattg aacctttgaa aacctctgtt gataaacctg gttctaagaa aa #ctcagggg 240 gaaaagtttt tcctgattca ttctgctgat tggctcacta cacatgctct ct #ttcatgaa 300 gttgcaaaat tggatgtggt gaaactactg tataatgagc agtttgccgt cc #aaggtttg 360 ttgagatacc atacatatgc aagatttggc attgagattc aagttcagat aa #atcccaca 420 ccctttcagc aaggaggact aatttgtgcc atggttcctg gtgaccaaag tt #atggttca 480 atagcatcct tgactgttta tcctcatggt ctgttaaatt gcaatatcaa ca #atgtagtt 540 agaataaagg ttccatttat ttatactaga ggtgcttatc attttaaaga tc #cacagtac 600 ccagtttggg aattgacaat cagagtttgg tcagagttga atattggaac ag #gaacttca 660 gcttacactt cactcaatgt tttagctagg tttacagatt tggagttgca tg #gattaact 720 cctctttcta cacagatgat gagaaatgaa tttagggtca gtactactga aa #atgttgta 780 aatttgtcaa attatgaaga tgcaagggca aaaatgtctt ttgctttgga tc #aggaagat 840 tggaagtctg atccttccca aggtggtgga attaaaatta ctcattttac ta #cctggaca 900 tccattccaa ccttagctgc tcagtttcca tttaatgctt cagattcagt tg #gacaacaa 960 attaaagtta ttccagtgga cccatacttt ttccaaatga caaacactaa tc #ctgatcaa 1020 aaatgtataa ctgccttggc ctctatttgt cagatgttct gcttttggag gg #gagatctt 1080 gtttttgatt ttcaggtttt tccaaccaaa tatcattcag gtagactgtt gt #tttgtttt 1140 gttcctggga atgagttaat agatgttact ggaattacat taaaacaggc aa #ctactgct 1200 ccttgtgcag tgatggacat tacaggagtg cagtcaacct tgagatttcg tg #ttccttgg 1260 atttctgata caccttatcg agtgaatagg tacacgaagt cagcacatca aa #aaggtgag 1320 tacactgcca ttgggaagct tattgtgtat tgttataaca gactgacttc tc #cttctaat 1380 gttgcctctc atgttagagt taatgtttat ctttcagcaa ttaatttgga at #gttttgct 1440 cctctttacc atgctatgga tgttactaca caggttggag atgattcagg ag #gtttctca 1500 acaacagttt ctacagagca gaatgttcct gatccccaag ttgggataac aa #ccatgagg 1560 gatttaaaag gaaaagccaa taggggaaag atggatgttt caggagtgca ag #cacctgtg 1620 ggagctatca caacaattga agatccagtt ttagcaaaga aagtacctga ga #catttcct 1680 gaattgaagc ctggagagtc cagacataca tcagatcaca tgtctattta ta #aattcatg 1740 ggaaggtctc attttttgtg cacttttact ttcaattcaa ataataaaga gt #acacattt 1800 ccaataaccc tgtcttcgac ttctaatcct cctcatggtt taccatcaac at #taaggtgg 1860 ttcttcaatt tgtttcagtt gtatagagga ccattggatt taacaattat aa #tcacagga 1920 gccactgatg tggatggtat ggcctggttt actccagtgg gccttgctgt cg #acacccct 1980 tgggtggaaa aggagtcagc tttgtctatt gattataaaa ctgcccttgg ag #ctgttaga 2040 tttaatacaa gaagaacagg aaacattcaa attagattgc cgtggtattc tt #atttgtat 2100 gccgtgtctg gagcactgga tggcttgggg gataagacag attctacatt tg #gattggtt 2160 tctattcaga ttgcaaatta caatcattct gatgaatatt tgtccttcag tt #gttatttg 2220 tctgtcacag agcaatcaga gttctatttt cctagagctc cattaaattc aa #atgctatg 2280 ttgtccactg aatccatgat gagtagaatt gcagctggag acttggagtc at #cagtggat 2340 gatcccagat cagaggagga tagaagattt gagagtcata tagaatgtag ga #aaccatac 2400 aaagaattga gactggaggt tgggaaacaa agactcaaat atgctcagga ag #agttatca 2460 aatgaagtgc ttccacctcc taggaaaatg aaaggcctat tttcacaagc ta #aaatttct 2520 cttttttata ctgaggagca tgaaataatg aagttttctt ggagaggagt ga #ctgctgat 2580 actagggctt tgagaagatt tggattctct ctggctgctg gtagaagtgt gt #ggactctt 2640 gaaatggatg ctggagttct tactggagga ttgatcagat tgaatgatga ga #aatggaca 2700 gaaatgaagg atgataagat tgtttcatta attgaaaagt tcacaagcaa ta #aatattgg 2760 tctaaagtga attttccgca tgcaatgttg gatcttgaag aaattgctgc ca #attcgaag 2820 gattttccaa atatgtctga gacagatttg tgtttcctgt tacattggct aa #atccaaag 2880 aaaatcaatt tagcagatag aatgcttgga ttgtctggag tgcaggaaat ta #aggaacag 2940 <210> SEQ ID NO 33 <211> LENGTH: 669 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: recombinant protein of 25 kDa <400> SEQUENCE: 33 atggatattg aggaagagca aatgattcag tccgttgata ggactgcagt ga #ctggagct 60 tcttacttca cttctgtgga ccaatcttca gttcatactg ctgaggttgg ct #cacatcaa 120 attgaacctt tgaaaacctc tgttgataaa cctggttcta agaaaactca gg #gggaaaag 180 tttttcctga ttcattctgc tgattggctc actacacatg ctctctttca tg #aagttgca 240 aaattggatg tggtgaaact actgtataat gagcagtttg ccgtccaagg tt #tgttgaga 300 taccatacat atgcaagatt tggcattgag attcaagttc agataaatcc ca #cacccttt 360 cagcaaggag gactaatttg tgccatggtt cctggtgacc aaagttatgg tt #caatagca 420 tccttgactg tttatcctca tggtctgtta aattgcaata tcaacaatgt ag #ttagaata 480 aaggttccat ttatttatac tagaggtgct tatcatttta aagatccaca gt #acccagtt 540 tgggaattga caatcagagt ttggtcagag ttgaatattg gaacaggaac tt #cagcttac 600 acttcactca atgttttagc taggtttaca gatttggagt tgcatggatt aa #ctcctctt 660 tctacacag # # # 669 <210> SEQ ID NO 34 <211> LENGTH: 744 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: recombinant protein of 28 kDa <400> SEQUENCE: 34 atggctatga tgagaaatga atttagggtc agtactactg aaaatgttgt aa #atttgtca 60 aattatgaag atgcaagggc aaaaatgtct tttgctttgg atcaggaaga tt #ggaagtct 120 gatccttccc aaggtggtgg aattaaaatt actcatttta ctacctggac at #ccattcca 180 accttagctg ctcagtttcc atttaatgct tcagattcag ttggacaaca aa #ttaaagtt 240 attccagtgg acccatactt tttccaaatg acaaacacta atcctgatca aa #aatgtata 300 actgccttgg cctctatttg tcagatgttc tgcttttgga ggggagatct tg #tttttgat 360 tttcaggttt ttccaaccaa atatcattca ggtagactgt tgttttgttt tg #ttcctggg 420 aatgagttaa tagatgttac tggaattaca ttaaaacagg caactactgc tc #cttgtgca 480 gtgatggaca ttacaggagt gcagtcaacc ttgagatttc gtgttccttg ga #tttctgat 540 acaccttatc gagtgaatag gtacacgaag tcagcacatc aaaaaggtga gt #acactgcc 600 attgggaagc ttattgtgta ttgttataac agactgactt ctccttctaa tg #ttgcctct 660 catgttagag ttaatgttta tctttcagca attaatttgg aatgttttgc tc #ctctttac 720 catgctatgg atgttactac acag # # 744 <210> SEQ ID NO 35 <211> LENGTH: 906 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: recombinant protein of 33.3 kDa <400> SEQUENCE: 35 atggctgttg gagatgattc aggaggtttc tcaacaacag tttctacaga gc #agaatgtt 60 cctgatcccc aagttgggat aacaaccatg agggattcaa aaggaaaagc ca #atagggga 120 aagatggatg tttcaggagt gcaagcacct gtgggagcta tcacaacaat tg #aagatcca 180 gttttagcaa agaaagtacc tgagacattt cctgaattga agcctggaga gt #ccagacat 240 acatcagatc acatgtctat ttataaattc atgggaaggt ctcatttttt gt #gcactttt 300 actttcaatt caaataataa agagtacaca tttccaataa ccctgtcttc ga #cttctaat 360 cctcctcatg gtttaccatc aacattaagg tggttcttca atttgtttca gt #tgtataga 420 ggaccattgg atttaacaat tataatcaca ggagccactg atgtggatgg ta #tggcctgg 480 tttactccag tgggccttgc tgtcgacacc ccttgggtgg aaaaggagtc ag #ctttgtct 540 attgattata aaactgccct tggagctgtt agatttaata caagaagaac ag #gaaacatc 600 caaattagat tgccgtggta ttcttatttg tatgccgtgt ctggagcact gg #atggcttg 660 gggggtaaga cagattctac atttggattg gtttctattc agattgcaaa tt #acaatcat 720 tctgatgaat atttgtcctt cagttgttat ttgtctgtca cagagcaatc ag #agttctat 780 tttcctagag ctccattaaa ttcaaatgct atgttgtcca ctgaatccat ga #tgagtaga 840 attgcagctg gagacttgga gtcatcagtg gatgatccca gatcagagga gg #atagaaga 900 tttgag # # # 906 <210> SEQ ID NO 36 <211> LENGTH: 1056 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: recombinant protein of 38.8 kDa <400> SEQUENCE: 36 atggctacaa aggctgtttg tgttttgaag ggtgacggcc cagttcaagg ta #ttattaac 60 ttcgagcaga aggaaagtaa tggaccagtg aaggtgtggg gaagcattaa ag #gactgact 120 gaaggcctgc atggattcca tgttcatgag tttggagata atacagcagg ct #gtaccagt 180 gcaggtcctc actttaatcc tctatccaga aaacacggtg ggccaaagga tg #aagagagg 240 catgttggag acttgggcaa tgtgactgct gacaaagatg gtgtggccga tg #tgtctatt 300 gaagattctg tgatctcact ctcaggagac cattgcatca ttggccgcac ac #tggtggtc 360 catgaaaaag cagatgactt gggcaaaggt ggaaatgaag aaagtacaaa ga #caggaaac 420 gctggaagtc gtttggcttg tggtgtaatt gggatcgccc agaatttggg aa #ttcagatc 480 tctcgagcta gtcatataga atgtaggaaa ccatacaaag aattgagact gg #aggttggg 540 aaacaaagac tcaaatatgc tcaggaagag ttatcaaatg aagtgcttcc ac #ctcctagg 600 aaaatgaagg ggttattttc acaagctaaa atttctcttt tttatactga gg #agcatgaa 660 ataatgaagt tttcttggag aggagtgact gctgatacta gggctttgag aa #gatttgga 720 ttctctctgg ctgctggtag aagtgtgtgg actcttgaaa tggatgctgg ag #ttcttact 780 ggaggattga tcagattgaa tgatgagaaa tggacagaaa tgaaggatga ta #agattgtt 840 tcattaattg aaaagttcac aagcaataaa tattggtcta aagtgaattt tc #cgcatgca 900 atgttggatc ttgaagaaat tgctgccaat tcgaaggatt ttccaaatat gt #ctgagaca 960 gatttgtgtt tcctgttaca ttggctaaat ccaaagaaaa tcaatttagc ag #atagaatg 1020 cttggattgt ctggagtgca ggaaattaag gaacag # # 1056 <210> SEQ ID NO 37 <211> LENGTH: 708 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: recombinant protein of 24.9 kDa <400> SEQUENCE: 37 atggctacaa aggctgtttg tgttttgaag ggtgacggcc cagttcaagg ta #ttattaac 60 ttcgagcaga aggaaagtaa tggaccagtg aaggtgtggg gaagcattaa ag #gactgact 120 gaaggcctgc atggattcca tgttcatgag tttggagata atacagcagg ct #gtaccagt 180 gcaggtcctc actttaatcc tctatccaga aaacacggtg ggccaaagga tg #aagagagg 240 catgttggag acttgggcaa tgtgactgct gacaaagatg gtgtggccga tg #tgtctatt 300 gaagattctg tgatctcact ctcaggagac cattgcatca ttggccgcac ac #tggtggtc 360 catgaaaaag cagatgactt gggcaaaggt ggaaatgaag aaagtacaaa ga #caggaaac 420 gctggaagtc gtttggcttg tggtgtaatt gggatcgccc agaatttggg aa #ttcagatc 480 tctcgaggaa tttcagatga tgacaatgat agtgcaatgg ctgagttttt cc #agtctttt 540 ccatctggtg aaccatcaaa ttccaagtta tctagttttt tccaatctgt ca #ctaatcac 600 aagtgggttg ctgtgggagc tgcagttggc attcttggag tgcttgtggg ag #gatggttt 660 gtgtataagc atttttcccg caaagaggaa gaaccaattc cagctgaa # 708 <210> SEQ ID NO 38 <211> LENGTH: 1148 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: recombinant protein of 41 kDa <400> SEQUENCE: 38 ccatggctac aaaggctgtt tgtgttttga agggtgacgg cccagttcaa gg #tattatta 60 acttcgagca gaaggaaagt aatggaccag tgaaggtgtg gggaagcatt aa #aggactga 120 ctgaaggcct gcatggattc catgttcatg agtttggaga taatacagca gg #ctgtacca 180 gtgcaggtcc tcactttaat cctctatcca gaaaacacgg tgggccaaag ga #tgaagaga 240 ggcatgttgg agacttgggc aatgtgactg ctgacaaaga tggtgtggcc ga #tgtgtcta 300 ttgaagattc tgtgatctca ctctcaggag accattgcat cattggccgc ac #actggtgg 360 tccatgaaaa agcagatgac ttgggcaaag gtggaaatga agaaagtaca aa #gacaggaa 420 acgctggaag tcgtttggct tgtggtgtaa ttgggatcgc ccagaatttg gg #aattcaga 480 tctctcgagc atcaactcta gaaatagcag gattagttag gaaaaatctg gt #tcagtttg 540 gagttggtga gaaaaatgga tgtgtgagat gggtcatgaa tgccttagga gt #gaaggatg 600 attggttgtt agtaccttct catgcttata aatttgaaaa ggattatgaa at #gatggagt 660 tttacttcaa tagaggtgga acttactatt caatttcagc tggtaatgtt gt #tattcaat 720 ctttagatgt gggatttcaa gatgttgttt taatgaaggt tcctacaatt cc #caagttta 780 gagatattac tcaacacttt attaagaaag gagatgtgcc tagagcctta aa #tcgcttgg 840 caacattagt gacaaccgtt aatggaactc ctatgttaat ttctgaggga cc #attaaaga 900 tggaagaaaa agccacttat gttcataaga agaatgatgg tactacagtt ga #tttgactg 960 tagatcaggc atggagagga aaaggtgaag gtcttcctgg aatgtgtggt gg #ggccctag 1020 tgtcatcaaa tcagtccata cagaatgcaa ttttgggtat tcatgttgct gg #aggaaatt 1080 caattcttgt ggcaaagctg gttactcaag aaatgtttca aaacattgat aa #gaaaattg 1140 aaagtcag # # # 1148 <210> SEQ ID NO 39 <211> LENGTH: 1956 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: recombinant protein of human superoxide dismutas #e fused with the HAV nonstructural protein 3D <400> SEQUENCE: 39 atggctacaa aggctgtttg tgttttgaag ggtgacggcc cagttcaagg ta #ttattaac 60 ttcgagcaga aggaaagtaa tggaccagtg aaggtgtggg gaagcattaa ag #gactgact 120 gaaggcctgc atggattcca tgttcatgag tttggagata atacagcagg ct #gtaccagt 180 gcaggtcctc actttaatcc tctatccaga aaacacggtg ggccaaagga tg #aagagagg 240 catgttggag acttgggcaa tgtgactgct gacaaagatg gtgtggccga tg #tgtctatt 300 gaagattctg tgatctcact ctcaggagac cattgcatca ttggccgcac ac #tggtggtc 360 catgaaaaag cagatgactt gggcaaaggt ggaaatgaag aaagtacaaa ga #caggaaac 420 gctggaagtc gtttggcttg tggtgtaatt gggatcgccc agaatttggg aa #ttcagatc 480 tctcgagcaa gaataatgaa agtggaattt actcaatgtt caatgaatgt ag #tctccaaa 540 acgcttttta gaaagagtcc cattcatcac cacattgata aaaccatgat ta #attttcct 600 gcagctatgc ctttctctaa agctgaaatt gatccaatgg ctatgacgtt gt #ccaaatat 660 tcattaccta ttgtggagga accagaggat tacaaggaag cttcagtttt tt #atcaaaac 720 aaaatagtag gcaagactca gctagttgat gactttttag atcttgatat gg #ctattaca 780 ggggctccag gcattgatgc tatcaatatg gattcatctc ctgggtttcc tt #atgttcaa 840 gaaaaattga ccaaaagaga tttaatttgg ttggatgaaa atggtttgct gt #taggagtt 900 cacccaagat tggcccagag aattttattt aatactgtca tgatggaaaa tt #gttctgac 960 ttagatgttg tttttacaac ttgtccaaaa gatgaattga gaccattaga ga #aagttttg 1020 gaatcaaaaa caagagccat tgatgcttgt cctttggatt atacaattct at #gtcgaatg 1080 tattggggtc cagctatcag ttatttccat ttgaatccag ggtttcacac ag #gtgttgct 1140 attggcatag atcctgataa acagtgggat gaattattta aaacaatgat aa #gatttgga 1200 gatgttggtc ttgatttaga tttctctgct tttgatgcca gtcttagtcc at #ttatgatt 1260 agggaagcag gtagaatcat gagtgaatta tctggaacac catctcattt tg #gaacagct 1320 cttatcaata ctatcattta ttctaaacat ctgctgtaca actgttgtta tc #atgtttgt 1380 ggttcaatgc cttctgggtc tccttgcaca gctttgttga attcaattat ta #ataatatt 1440 aatctgtatt atgtgttttc taaaatattt ggaaagtctc cagttttctt tt #gtcaagct 1500 ttgaggatcc tttgttacgg agatgatgtt ttgatagttt tttccagaga tg #ttcaaatt 1560 gacaatcttg acttgattgg acagaaaatt gtagatgagt tcaaaaaact tg #gcatgaca 1620 gccacctcag ctgataaaaa tgtgcctcaa ctgaagccag tttcagaatt ga #cttttctc 1680 aaaagatctt tcaatttggt ggaggataga attagacctg caatttcaga aa #agacaatt 1740 tggtctttga tggcttggca gagaagtaac gctgagtttg agcagaattt ag #aaaatgct 1800 cagtggtttg cttttatgca tggctatgag ttctatcaga aattttatta tt #ttgttcag 1860 tcctgtttgg agaaagagat gatagaatat agacttaaat cttatgattg gt #ggagaatg 1920 agattttatg accagtgttt catttgtgac ctttca # # 1956 <210> SEQ ID NO 40 <211> LENGTH: 836 <212> TYPE: PRT <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: recombinant protein of 94 kDa <400> SEQUENCE: 40 Met Asn Met Ser Lys Gln Gly Ile Phe Arg Th #r Val Gly Ser Gly Leu 1 5 # 10 # 15 Asp His Ile Leu Ser Leu Ala Asp Ile Glu Gl #u Glu Gln Met Ile Gln 20 # 25 # 30 Ser Val Asp Arg Thr Ala Val Thr Gly Ala Se #r Tyr Phe Thr Ser Val 35 # 40 # 45 Asp Gln Ser Ser Val His Thr Ala Glu Val Gl #y Ser His Gln Ile Glu 50 # 55 # 60 Pro Leu Lys Thr Ser Val Asp Lys Pro Gly Se #r Lys Lys Thr Gln Gly 65 # 70 # 75 # 80 Glu Lys Phe Phe Leu Ile His Ser Ala Asp Tr #p Leu Thr Thr His Ala 85 # 90 # 95 Leu Phe His Glu Val Ala Lys Leu Asp Val Va #l Lys Leu Leu Tyr Asn 100 # 105 # 110 Glu Gln Phe Ala Val Gln Gly Leu Leu Arg Ty #r His Thr Tyr Ala Arg 115 # 120 # 125 Phe Gly Ile Glu Ile Gln Val Gln Ile Asn Pr #o Thr Pro Phe Gln Gln 130 # 135 # 140 Gly Gly Leu Ile Cys Ala Met Val Pro Gly As #p Gln Ser Tyr Gly Ser 145 1 #50 1 #55 1 #60 Ile Ala Ser Leu Thr Val Tyr Pro His Gly Le #u Leu Asn Cys Asn Ile 165 # 170 # 175 Asn Asn Val Val Arg Ile Lys Val Pro Phe Il #e Tyr Thr Arg Gly Ala 180 # 185 # 190 Tyr His Phe Lys Asp Pro Gln Tyr Pro Val Tr #p Glu Leu Thr Ile Arg 195 # 200 # 205 Val Trp Ser Glu Leu Asn Ile Gly Thr Gly Th #r Ser Ala Tyr Thr Ser 210 # 215 # 220 Leu Asn Val Leu Ala Arg Phe Thr Asp Leu Gl #u Leu His Gly Leu Thr 225 2 #30 2 #35 2 #40 Pro Leu Ser Thr Gln Met Met Arg Asn Glu Ph #e Arg Val Ser Thr Thr 245 # 250 # 255 Glu Asn Val Val Asn Leu Ser Asn Tyr Glu As #p Ala Arg Ala Lys Met 260 # 265 # 270 Ser Phe Ala Leu Asp Gln Glu Asp Trp Lys Se #r Asp Pro Ser Gln Gly 275 # 280 # 285 Gly Gly Ile Lys Ile Thr His Phe Thr Thr Tr #p Thr Ser Ile Pro Thr 290 # 295 # 300 Leu Ala Ala Gln Phe Pro Phe Asn Ala Ser As #p Ser Val Gly Gln Gln 305 3 #10 3 #15 3 #20 Ile Lys Val Ile Pro Val Asp Pro Tyr Phe Ph #e Gln Met Thr Asn Thr 325 # 330 # 335 Asn Pro Asp Gln Lys Cys Ile Thr Ala Leu Al #a Ser Ile Cys Gln Met 340 # 345 # 350 Phe Cys Phe Trp Arg Gly Asp Leu Val Phe As #p Phe Gln Val Phe Pro 355 # 360 # 365 Thr Lys Tyr His Ser Gly Arg Leu Leu Phe Cy #s Phe Val Pro Gly Asn 370 # 375 # 380 Glu Leu Ile Asp Val Thr Gly Ile Thr Leu Ly #s Gln Ala Thr Thr Ala 385 3 #90 3 #95 4 #00 Pro Cys Ala Val Met Asp Ile Thr Gly Val Gl #n Ser Thr Leu Arg Phe 405 # 410 # 415 Arg Val Pro Trp Ile Ser Asp Thr Pro Tyr Ar #g Val Asn Arg Tyr Thr 420 # 425 # 430 Lys Ser Ala His Gln Lys Gly Glu Tyr Thr Al #a Ile Gly Lys Leu Ile 435 # 440 # 445 Val Tyr Cys Tyr Asn Arg Leu Thr Ser Pro Se #r Asn Val Ala Ser His 450 # 455 # 460 Val Arg Val Asn Val Tyr Leu Ser Ala Ile As #n Leu Glu Cys Phe Ala 465 4 #70 4 #75 4 #80 Pro Leu Tyr His Ala Met Asp Val Thr Thr Gl #n Val Gly Asp Asp Ser 485 # 490 # 495 Gly Gly Phe Ser Thr Thr Val Ser Thr Glu Gl #n Asn Val Pro Asp Pro 500 # 505 # 510 Gln Val Gly Ile Thr Thr Met Arg Asp Ser Ly #s Gly Lys Ala Asn Arg 515 # 520 # 525 Gly Lys Met Asp Val Ser Gly Val Gln Ala Pr #o Val Gly Ala Ile Thr 530 # 535 # 540 Thr Ile Glu Asp Pro Val Leu Ala Lys Lys Va #l Pro Glu Thr Phe Pro 545 5 #50 5 #55 5 #60 Glu Leu Lys Pro Gly Glu Ser Arg His Thr Se #r Asp His Met Ser Ile 565 # 570 # 575 Tyr Lys Phe Met Gly Arg Ser His Phe Leu Cy #s Thr Phe Thr Phe Asn 580 # 585 # 590 Ser Asn Asn Lys Glu Tyr Thr Phe Pro Ile Th #r Leu Ser Ser Thr Ser 595 # 600 # 605 Asn Pro Pro His Gly Leu Pro Ser Thr Leu Ar #g Trp Phe Phe Asn Leu 610 # 615 # 620 Phe Gln Leu Tyr Arg Gly Pro Leu Asp Leu Th #r Ile Ile Ile Thr Gly 625 6 #30 6 #35 6 #40 Ala Thr Asp Val Asp Gly Met Ala Trp Phe Th #r Pro Val Gly Leu Ala 645 # 650 # 655 Val Asp Thr Pro Trp Val Glu Lys Glu Ser Al #a Leu Ser Ile Asp Tyr 660 # 665 # 670 Lys Thr Ala Leu Gly Ala Val Arg Phe Asn Th #r Arg Arg Thr Gly Ile 675 # 680 # 685 Ile Gln Ile Arg Leu Pro Trp Tyr Ser Tyr Le #u Tyr Ala Val Ser Gly 690 # 695 # 700 Ala Leu Asp Gly Leu Gly Asp Lys Thr Asp Se #r Thr Phe Gly Leu Val 705 7 #10 7 #15 7 #20 Ser Ile Gln Ile Ala Asn Tyr Asn His Ser As #p Glu Tyr Leu Ser Phe 725 # 730 # 735 Ser Cys Tyr Leu Ser Val Thr Glu Gln Ser Gl #u Phe Tyr Phe Pro Arg 740 # 745 # 750 Ala Pro Leu Asn Ser Asn Ala Met Leu Ser Th #r Glu Ser Met Met Ser 755 # 760 # 765 Arg Ile Ala Ala Gly Asp Leu Glu Ser Ser Va #l Asp Asp Pro Arg Ser 770 # 775 # 780 Glu Glu Asp Arg Arg Phe Glu Ser His Ile Gl #u Cys Arg Lys Pro Tyr 785 7 #90 7 #95 8 #00 Lys Glu Leu Arg Leu Glu Val Gly Lys Gln Ar #g Leu Lys Tyr Ala Gln 805 # 810 # 815 Glu Glu Leu Ser Asn Glu Val Leu Pro Pro Pr #o Arg Lys Ile Lys Gly 820 # 825 # 830 Leu Phe Ser Gln 835 <210> SEQ ID NO 41 <211> LENGTH: 980 <212> TYPE: PRT <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: recombinant protein of 115.5 kDa <400> SEQUENCE: 41 Met Asn Met Ser Lys Gln Gly Ile Phe Gln Th #r Val Gly Ser Gly Leu 1 5 # 10 # 15 Asp His Ile Leu Ser Leu Ala Asp Ile Glu Gl #u Glu Gln Met Ile Gln 20 # 25 # 30 Ser Val Asp Arg Thr Ala Val Thr Gly Ala Se #r Tyr Phe Thr Ser Val 35 # 40 # 45 Asp Gln Ser Ser Val His Thr Ala Glu Val Gl #y Ser His Gln Ile Glu 50 # 55 # 60 Pro Leu Lys Thr Ser Val Asp Lys Pro Gly Se #r Lys Lys Thr Gln Gly 65 # 70 # 75 # 80 Glu Lys Phe Phe Leu Ile His Ser Ala Asp Tr #p Leu Thr Thr His Ala 85 # 90 # 95 Leu Phe His Glu Val Ala Lys Leu Asp Val Va #l Lys Leu Leu Tyr Asn 100 # 105 # 110 Glu Gln Phe Ala Val Gln Gly Leu Leu Arg Ty #r His Thr Tyr Ala Arg 115 # 120 # 125 Phe Gly Ile Glu Ile Gln Val Gln Ile Asn Pr #o Thr Pro Phe Gln Gln 130 # 135 # 140 Gly Gly Leu Ile Cys Ala Met Val Pro Gly As #p Gln Ser Tyr Gly Ser 145 1 #50 1 #55 1 #60 Ile Ala Ser Leu Thr Val Tyr Pro His Gly Le #u Leu Asn Cys Asn Ile 165 # 170 # 175 Asn Asn Val Val Arg Ile Lys Val Pro Phe Il #e Tyr Thr Arg Gly Ala 180 # 185 # 190 Tyr His Phe Lys Asp Pro Gln Tyr Pro Val Tr #p Glu Leu Thr Ile Arg 195 # 200 # 205 Val Trp Ser Glu Leu Asn Ile Gly Thr Gly Th #r Ser Ala Tyr Thr Ser 210 # 215 # 220 Leu Asn Val Leu Ala Arg Phe Thr Asp Leu Gl #u Leu His Gly Leu Thr 225 2 #30 2 #35 2 #40 Pro Leu Ser Thr Gln Met Met Arg Asn Glu Ph #e Arg Val Ser Thr Thr 245 # 250 # 255 Glu Asn Val Val Asn Leu Ser Asn Tyr Glu As #p Ala Arg Ala Lys Met 260 # 265 # 270 Ser Phe Ala Leu Asp Gln Glu Asp Trp Lys Se #r Asp Pro Ser Gln Gly 275 # 280 # 285 Gly Gly Ile Lys Ile Thr His Phe Thr Thr Tr #p Thr Ser Ile Pro Thr 290 # 295 # 300 Leu Ala Ala Gln Phe Pro Phe Asn Ala Ser As #p Ser Val Gly Gln Gln 305 3 #10 3 #15 3 #20 Ile Lys Val Ile Pro Val Asp Pro Tyr Phe Ph #e Gln Met Thr Asn Thr 325 # 330 # 335 Asn Pro Asp Gln Lys Cys Ile Thr Ala Leu Al #a Ser Ile Cys Gln Met 340 # 345 # 350 Phe Cys Phe Trp Arg Gly Asp Leu Val Phe As #p Phe Gln Val Phe Pro 355 # 360 # 365 Thr Lys Tyr His Ser Gly Arg Leu Leu Phe Cy #s Phe Val Pro Gly Asn 370 # 375 # 380 Glu Leu Ile Asp Val Thr Gly Ile Thr Leu Ly #s Gln Ala Thr Thr Ala 385 3 #90 3 #95 4 #00 Pro Cys Ala Val Met Asp Ile Thr Gly Val Gl #n Ser Thr Leu Arg Phe 405 # 410 # 415 Arg Val Pro Trp Ile Ser Asp Thr Pro Tyr Ar #g Val Asn Arg Tyr Thr 420 # 425 # 430 Lys Ser Ala His Gln Lys Gly Glu Tyr Thr Al #a Ile Gly Lys Leu Ile 435 # 440 # 445 Val Tyr Cys Tyr Asn Arg Leu Thr Ser Pro Se #r Asn Val Ala Ser His 450 # 455 # 460 Val Arg Val Asn Val Tyr Leu Ser Ala Ile As #n Leu Glu Cys Phe Ala 465 4 #70 4 #75 4 #80 Pro Leu Tyr His Ala Met Asp Val Thr Thr Gl #n Val Gly Asp Asp Ser 485 # 490 # 495 Gly Gly Phe Ser Thr Thr Val Ser Thr Glu Gl #n Asn Val Pro Asp Pro 500 # 505 # 510 Gln Val Gly Ile Thr Thr Met Arg Asp Leu Ly #s Gly Lys Ala Asn Arg 515 # 520 # 525 Gly Lys Met Asp Val Ser Gly Val Gln Ala Pr #o Val Gly Ala Ile Thr 530 # 535 # 540 Thr Ile Glu Asp Pro Val Leu Ala Lys Lys Va #l Pro Glu Thr Phe Pro 545 5 #50 5 #55 5 #60 Glu Leu Lys Pro Gly Glu Ser Arg His Thr Se #r Asp His Met Ser Ile 565 # 570 # 575 Tyr Lys Phe Met Gly Arg Ser His Phe Leu Cy #s Thr Phe Thr Phe Asn 580 # 585 # 590 Ser Asn Asn Lys Glu Tyr Thr Phe Pro Ile Th #r Leu Ser Ser Thr Ser 595 # 600 # 605 Asn Pro Pro His Gly Leu Pro Ser Thr Leu Ar #g Trp Phe Phe Asn Leu 610 # 615 # 620 Phe Gln Leu Tyr Arg Gly Pro Leu Asp Leu Th #r Ile Ile Ile Thr Gly 625 6 #30 6 #35 6 #40 Ala Thr Asp Val Asp Gly Met Ala Trp Phe Th #r Pro Val Gly Leu Ala 645 # 650 # 655 Val Asp Thr Pro Trp Val Glu Lys Glu Ser Al #a Leu Ser Ile Asp Tyr 660 # 665 # 670 Lys Thr Ala Leu Gly Ala Val Arg Phe Asn Th #r Arg Arg Thr Gly Asn 675 # 680 # 685 Ile Gln Ile Arg Leu Pro Trp Tyr Ser Tyr Le #u Tyr Ala Val Ser Gly 690 # 695 # 700 Ala Leu Asp Gly Leu Gly Asp Lys Thr Asp Se #r Thr Phe Gly Leu Val 705 7 #10 7 #15 7 #20 Ser Ile Gln Ile Ala Asn Tyr Asn His Ser As #p Glu Tyr Leu Ser Phe 725 # 730 # 735 Ser Cys Tyr Leu Ser Val Thr Glu Gln Ser Gl #u Phe Tyr Phe Pro Arg 740 # 745 # 750 Ala Pro Leu Asn Ser Asn Ala Met Leu Ser Th #r Glu Ser Met Met Ser 755 # 760 # 765 Arg Ile Ala Ala Gly Asp Leu Glu Ser Ser Va #l Asp Asp Pro Arg Ser 770 # 775 # 780 Glu Glu Asp Arg Arg Phe Glu Ser His Ile Gl #u Cys Arg Lys Pro Tyr 785 7 #90 7 #95 8 #00 Lys Glu Leu Arg Leu Glu Val Gly Lys Gln Ar #g Leu Lys Tyr Ala Gln 805 # 810 # 815 Glu Glu Leu Ser Asn Glu Val Leu Pro Pro Pr #o Arg Lys Met Lys Gly 820 # 825 # 830 Leu Phe Ser Gln Ala Lys Ile Ser Leu Phe Ty #r Thr Glu Glu His Glu 835 # 840 # 845 Ile Met Lys Phe Ser Trp Arg Gly Val Thr Al #a Asp Thr Arg Ala Leu 850 # 855 # 860 Arg Arg Phe Gly Phe Ser Leu Ala Ala Gly Ar #g Ser Val Trp Thr Leu 865 8 #70 8 #75 8 #80 Glu Met Asp Ala Gly Val Leu Thr Gly Gly Le #u Ile Arg Leu Asn Asp 885 # 890 # 895 Glu Lys Trp Thr Glu Met Lys Asp Asp Lys Il #e Val Ser Leu Ile Glu 900 # 905 # 910 Lys Phe Thr Ser Asn Lys Tyr Trp Ser Lys Va #l Asn Phe Pro His Ala 915 # 920 # 925 Met Leu Asp Leu Glu Glu Ile Ala Ala Asn Se #r Lys Asp Phe Pro Asn 930 # 935 # 940 Met Ser Glu Thr Asp Leu Cys Phe Leu Leu Hi #s Trp Leu Asn Pro Lys 945 9 #50 9 #55 9 #60 Lys Ile Asn Leu Ala Asp Arg Met Leu Gly Le #u Ser Gly Val Gln Glu 965 # 970 # 975 Ile Lys Glu Gln 980 <210> SEQ ID NO 42 <211> LENGTH: 223 <212> TYPE: PRT <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: recombinant protein of 25 kDa <400> SEQUENCE: 42 Met Asp Ile Glu Glu Glu Gln Met Ile Gln Se #r Val Asp Arg Thr Ala 1 5 # 10 # 15 Val Thr Gly Ala Ser Tyr Phe Thr Ser Val As #p Gln Ser Ser Val His 20 # 25 # 30 Thr Ala Glu Val Gly Ser His Gln Ile Glu Pr #o Leu Lys Thr Ser Val 35 # 40 # 45 Asp Lys Pro Gly Ser Lys Lys Thr Gln Gly Gl #u Lys Phe Phe Leu Ile 50 # 55 # 60 His Ser Ala Asp Trp Leu Thr Thr His Ala Le #u Phe His Glu Val Ala 65 # 70 # 75 # 80 Lys Leu Asp Val Val Lys Leu Leu Tyr Asn Gl #u Gln Phe Ala Val Gln 85 # 90 # 95 Gly Leu Leu Arg Tyr His Thr Tyr Ala Arg Ph #e Gly Ile Glu Ile Gln 100 # 105 # 110 Val Gln Ile Asn Pro Thr Pro Phe Gln Gln Gl #y Gly Leu Ile Cys Ala 115 # 120 # 125 Met Val Pro Gly Asp Gln Ser Tyr Gly Ser Il #e Ala Ser Leu Thr Val 130 # 135 # 140 Tyr Pro His Gly Leu Leu Asn Cys Asn Ile As #n Asn Val Val Arg Ile 145 1 #50 1 #55 1 #60 Lys Val Pro Phe Ile Tyr Thr Arg Gly Ala Ty #r His Phe Lys Asp Pro 165 # 170 # 175 Gln Tyr Pro Val Trp Glu Leu Thr Ile Arg Va #l Trp Ser Glu Leu Asn 180 # 185 # 190 Ile Gly Thr Gly Thr Ser Ala Tyr Thr Ser Le #u Asn Val Leu Ala Arg 195 # 200 # 205 Phe Thr Asp Leu Glu Leu His Gly Leu Thr Pr #o Leu Ser Thr Gln 210 # 215 # 220 <210> SEQ ID NO 43 <211> LENGTH: 248 <212> TYPE: PRT <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: recombinant protein of 28 kDa <400> SEQUENCE: 43 Met Ala Met Met Arg Asn Glu Phe Arg Val Se #r Thr Thr Glu Asn Val 1 5 # 10 # 15 Val Asn Leu Ser Asn Tyr Glu Asp Ala Arg Al #a Lys Met Ser Phe Ala 20 # 25 # 30 Leu Asp Gln Glu Asp Trp Lys Ser Asp Pro Se #r Gln Gly Gly Gly Ile 35 # 40 # 45 Lys Ile Thr His Phe Thr Thr Trp Thr Ser Il #e Pro Thr Leu Ala Ala 50 # 55 # 60 Gln Phe Pro Phe Asn Ala Ser Asp Ser Val Gl #y Gln Gln Ile Lys Val 65 # 70 # 75 # 80 Ile Pro Val Asp Pro Tyr Phe Phe Gln Met Th #r Asn Thr Asn Pro Asp 85 # 90 # 95 Gln Lys Cys Ile Thr Ala Leu Ala Ser Ile Cy #s Gln Met Phe Cys Phe 100 # 105 # 110 Trp Arg Gly Asp Leu Val Phe Asp Phe Gln Va #l Phe Pro Thr Lys Tyr 115 # 120 # 125 His Ser Gly Arg Leu Leu Phe Cys Phe Val Pr #o Gly Asn Glu Leu Ile 130 # 135 # 140 Asp Val Thr Gly Ile Thr Leu Lys Gln Ala Th #r Thr Ala Pro Cys Ala 145 1 #50 1 #55 1 #60 Val Met Asp Ile Thr Gly Val Gln Ser Thr Le #u Arg Phe Arg Val Pro 165 # 170 # 175 Trp Ile Ser Asp Thr Pro Tyr Arg Val Asn Ar #g Tyr Thr Lys Ser Ala 180 # 185 # 190 His Gln Lys Gly Glu Tyr Thr Ala Ile Gly Ly #s Leu Ile Val Tyr Cys 195 # 200 # 205 Tyr Asn Arg Leu Thr Ser Pro Ser Asn Val Al #a Ser His Val Arg Val 210 # 215 # 220 Asn Val Tyr Leu Ser Ala Ile Asn Leu Glu Cy #s Phe Ala Pro Leu Tyr 225 2 #30 2 #35 2 #40 His Ala Met Asp Val Thr Thr Gln 245 <210> SEQ ID NO 44 <211> LENGTH: 302 <212> TYPE: PRT <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: recombinant protein of 33.3 kDa <400> SEQUENCE: 44 Met Ala Val Gly Asp Asp Ser Gly Gly Phe Se #r Thr Thr Val Ser Thr 1 5 # 10 # 15 Glu Gln Asn Val Pro Asp Pro Gln Val Gly Il #e Thr Thr Met Arg Asp 20 # 25 # 30 Ser Lys Gly Lys Ala Asn Arg Gly Lys Met As #p Val Ser Gly Val Gln 35 # 40 # 45 Ala Pro Val Gly Ala Ile Thr Thr Ile Glu As #p Pro Val Leu Ala Lys 50 # 55 # 60 Lys Val Pro Glu Thr Phe Pro Glu Leu Lys Pr #o Gly Glu Ser Arg His 65 # 70 # 75 # 80 Thr Ser Asp His Met Ser Ile Tyr Lys Phe Me #t Gly Arg Ser His Phe 85 # 90 # 95 Leu Cys Thr Phe Thr Phe Asn Ser Asn Asn Ly #s Glu Tyr Thr Phe Pro 100 # 105 # 110 Ile Thr Leu Ser Ser Thr Ser Asn Pro Pro Hi #s Gly Leu Pro Ser Thr 115 # 120 # 125 Leu Arg Trp Phe Phe Asn Leu Phe Gln Leu Ty #r Arg Gly Pro Leu Asp 130 # 135 # 140 Leu Thr Ile Ile Ile Thr Gly Ala Thr Asp Va #l Asp Gly Met Ala Trp 145 1 #50 1 #55 1 #60 Phe Thr Pro Val Gly Leu Ala Val Asp Thr Pr #o Trp Val Glu Lys Glu 165 # 170 # 175 Ser Ala Leu Ser Ile Asp Tyr Lys Thr Ala Le #u Gly Ala Val Arg Phe 180 # 185 # 190 Asn Thr Arg Arg Thr Gly Asn Ile Gln Ile Ar #g Leu Pro Trp Tyr Ser 195 # 200 # 205 Tyr Leu Tyr Ala Val Ser Gly Ala Leu Asp Gl #y Leu Gly Gly Lys Thr 210 # 215 # 220 Asp Ser Thr Phe Gly Leu Val Ser Ile Gln Il #e Ala Asn Tyr Asn His 225 2 #30 2 #35 2 #40 Ser Asp Glu Tyr Leu Ser Phe Ser Cys Tyr Le #u Ser Val Thr Glu Gln 245 # 250 # 255 Ser Glu Phe Tyr Phe Pro Arg Ala Pro Leu As #n Ser Asn Ala Met Leu 260 # 265 # 270 Ser Thr Glu Ser Met Met Ser Arg Ile Ala Al #a Gly Asp Leu Glu Ser 275 # 280 # 285 Ser Val Asp Asp Pro Arg Ser Glu Glu Asp Ar #g Arg Phe Glu 290 # 295 # 300 <210> SEQ ID NO 45 <211> LENGTH: 352 <212> TYPE: PRT <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: recombinant protein of 38.8 kDa <400> SEQUENCE: 45 Met Ala Thr Lys Ala Val Cys Val Leu Lys Gl #y Asp Gly Pro Val Gln 1 5 # 10 # 15 Gly Ile Ile Asn Phe Glu Gln Lys Glu Ser As #n Gly Pro Val Lys Val 20 # 25 # 30 Trp Gly Ser Ile Lys Gly Leu Thr Glu Gly Le #u His Gly Phe His Val 35 # 40 # 45 His Glu Phe Gly Asp Asn Thr Ala Gly Cys Th #r Ser Ala Gly Pro His 50 # 55 # 60 Phe Asn Pro Leu Ser Arg Lys His Gly Gly Pr #o Lys Asp Glu Glu Arg 65 # 70 # 75 # 80 His Val Gly Asp Leu Gly Asn Val Thr Ala As #p Lys Asp Gly Val Ala 85 # 90 # 95 Asp Val Ser Ile Glu Asp Ser Val Ile Ser Le #u Ser Gly Asp His Cys 100 # 105 # 110 Ile Ile Gly Arg Thr Leu Val Val His Glu Ly #s Ala Asp Asp Leu Gly 115 # 120 # 125 Lys Gly Gly Asn Glu Glu Ser Thr Lys Thr Gl #y Asn Ala Gly Ser Arg 130 # 135 # 140 Leu Ala Cys Gly Val Ile Gly Ile Ala Gln As #n Leu Gly Ile Gln Ile 145 1 #50 1 #55 1 #60 Ser Arg Ala Ser His Ile Glu Cys Arg Lys Pr #o Tyr Lys Glu Leu Arg 165 # 170 # 175 Leu Glu Val Gly Lys Gln Arg Leu Lys Tyr Al #a Gln Glu Glu Leu Ser 180 # 185 # 190 Asn Glu Val Leu Pro Pro Pro Arg Lys Met Ly #s Gly Leu Phe Ser Gln 195 # 200 # 205 Ala Lys Ile Ser Leu Phe Tyr Thr Glu Glu Hi #s Glu Ile Met Lys Phe 210 # 215 # 220 Ser Trp Arg Gly Val Thr Ala Asp Thr Arg Al #a Leu Arg Arg Phe Gly 225 2 #30 2 #35 2 #40 Phe Ser Leu Ala Ala Gly Arg Ser Val Trp Th #r Leu Glu Met Asp Ala 245 # 250 # 255 Gly Val Leu Thr Gly Gly Leu Ile Arg Leu As #n Asp Glu Lys Trp Thr 260 # 265 # 270 Glu Met Lys Asp Asp Lys Ile Val Ser Leu Il #e Glu Lys Phe Thr Ser 275 # 280 # 285 Asn Lys Tyr Trp Ser Lys Val Asn Phe Pro Hi #s Ala Met Leu Asp Leu 290 # 295 # 300 Glu Glu Ile Ala Ala Asn Ser Lys Asp Phe Pr #o Asn Met Ser Glu Thr 305 3 #10 3 #15 3 #20 Asp Leu Cys Phe Leu Leu His Trp Leu Asn Pr #o Lys Lys Ile Asn Leu 325 # 330 # 335 Ala Asp Arg Met Leu Gly Leu Ser Gly Val Gl #n Glu Ile Lys Glu Gln 340 # 345 # 350 <210> SEQ ID NO 46 <211> LENGTH: 236 <212> TYPE: PRT <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: recombinant protein of 24.9 kDa <400> SEQUENCE: 46 Met Ala Thr Lys Ala Val Cys Val Leu Lys Gl #y Asp Gly Pro Val Gln 1 5 # 10 # 15 Gly Ile Ile Asn Phe Glu Gln Lys Glu Ser As #n Gly Pro Val Lys Val 20 # 25 # 30 Trp Gly Ser Ile Lys Gly Leu Thr Glu Gly Le #u His Gly Phe His Val 35 # 40 # 45 His Glu Phe Gly Asp Asn Thr Ala Gly Cys Th #r Ser Ala Gly Pro His 50 # 55 # 60 Phe Asn Pro Leu Ser Arg Lys His Gly Gly Pr #o Lys Asp Glu Glu Arg 65 # 70 # 75 # 80 His Val Gly Asp Leu Gly Asn Val Thr Ala As #p Lys Asp Gly Val Ala 85 # 90 # 95 Asp Val Ser Ile Glu Asp Ser Val Ile Ser Le #u Ser Gly Asp His Cys 100 # 105 # 110 Ile Ile Gly Arg Thr Leu Val Val His Glu Ly #s Ala Asp Asp Leu Gly 115 # 120 # 125 Lys Gly Gly Asn Glu Glu Ser Thr Lys Thr Gl #y Asn Ala Gly Ser Arg 130 # 135 # 140 Leu Ala Cys Gly Val Ile Gly Ile Ala Gln As #n Leu Gly Ile Gln Ile 145 1 #50 1 #55 1 #60 Ser Arg Gly Ile Ser Asp Asp Asp Asn Asp Se #r Ala Met Ala Glu Phe 165 # 170 # 175 Phe Gln Ser Phe Pro Ser Gly Glu Pro Ser As #n Ser Lys Leu Ser Ser 180 # 185 # 190 Phe Phe Gln Ser Val Thr Asn His Lys Trp Va #l Ala Val Gly Ala Ala 195 # 200 # 205 Val Gly Ile Leu Gly Val Leu Val Gly Gly Tr #p Phe Val Tyr Lys His 210 # 215 # 220 Phe Ser Arg Lys Glu Glu Glu Pro Ile Pro Al #a Glu 225 2 #30 2 #35 <210> SEQ ID NO 47 <211> LENGTH: 382 <212> TYPE: PRT <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: recombinant protein of 41 kDa <400> SEQUENCE: 47 Met Ala Thr Lys Ala Val Cys Val Leu Lys Gl #y Asp Gly Pro Val Gln 1 5 # 10 # 15 Gly Ile Ile Asn Phe Glu Gln Lys Glu Ser As #n Gly Pro Val Lys Val 20 # 25 # 30 Trp Gly Ser Ile Lys Gly Leu Thr Glu Gly Le #u His Gly Phe His Val 35 # 40 # 45 His Glu Phe Gly Asp Asn Thr Ala Gly Cys Th #r Ser Ala Gly Pro His 50 # 55 # 60 Phe Asn Pro Leu Ser Arg Lys His Gly Gly Pr #o Lys Asp Glu Glu Arg 65 # 70 # 75 # 80 His Val Gly Asp Leu Gly Asn Val Thr Ala As #p Lys Asp Gly Val Ala 85 # 90 # 95 Asp Val Ser Ile Glu Asp Ser Val Ile Ser Le #u Ser Gly Asp His Cys 100 # 105 # 110 Ile Ile Gly Arg Thr Leu Val Val His Glu Ly #s Ala Asp Asp Leu Gly 115 # 120 # 125 Lys Gly Gly Asn Glu Glu Ser Thr Lys Thr Gl #y Asn Ala Gly Ser Arg 130 # 135 # 140 Leu Ala Cys Gly Val Ile Gly Ile Ala Gln As #n Leu Gly Ile Gln Ile 145 1 #50 1 #55 1 #60 Ser Arg Ala Ser Thr Leu Glu Ile Ala Gly Le #u Val Arg Lys Asn Leu 165 # 170 # 175 Val Gln Phe Gly Val Gly Glu Lys Asn Gly Cy #s Val Arg Trp Val Met 180 # 185 # 190 Asn Ala Leu Gly Val Lys Asp Asp Trp Leu Le #u Val Pro Ser His Ala 195 # 200 # 205 Tyr Lys Phe Glu Lys Asp Tyr Glu Met Met Gl #u Phe Tyr Phe Asn Arg 210 # 215 # 220 Gly Gly Thr Tyr Tyr Ser Ile Ser Ala Gly As #n Val Val Ile Gln Ser 225 2 #30 2 #35 2 #40 Leu Asp Val Gly Phe Gln Asp Val Val Leu Me #t Lys Val Pro Thr Ile 245 # 250 # 255 Pro Lys Phe Arg Asp Ile Thr Gln His Phe Il #e Lys Lys Gly Asp Val 260 # 265 # 270 Pro Arg Ala Leu Asn Arg Leu Ala Thr Leu Va #l Thr Thr Val Asn Gly 275 # 280 # 285 Thr Pro Met Leu Ile Ser Glu Gly Pro Leu Ly #s Met Glu Glu Lys Ala 290 # 295 # 300 Thr Tyr Val His Lys Lys Asn Asp Gly Thr Th #r Val Asp Leu Thr Val 305 3 #10 3 #15 3 #20 Asp Gln Ala Trp Arg Gly Lys Gly Glu Gly Le #u Pro Gly Met Cys Gly 325 # 330 # 335 Gly Ala Leu Val Ser Ser Asn Gln Ser Ile Gl #n Asn Ala Ile Leu Gly 340 # 345 # 350 Ile His Val Ala Gly Gly Asn Ser Ile Leu Va #l Ala Lys Leu Val Thr 355 # 360 # 365 Gln Glu Met Phe Gln Asn Ile Asp Lys Lys Il #e Glu Ser Gln 370 # 375 # 380 <210> SEQ ID NO 48 <211> LENGTH: 652 <212> TYPE: PRT <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: recombinant protein of human superoxide dismutas #e fused with the HAV nonstructural protein <400> SEQUENCE: 48 Met Ala Thr Lys Ala Val Cys Val Leu Lys Gl #y Asp Gly Pro Val Gln 1 5 # 10 # 15 Gly Ile Ile Asn Phe Glu Gln Lys Glu Ser As #n Gly Pro Val Lys Val 20 # 25 # 30 Trp Gly Ser Ile Lys Gly Leu Thr Glu Gly Le #u His Gly Phe His Val 35 # 40 # 45 His Glu Phe Gly Asp Asn Thr Ala Gly Cys Th #r Ser Ala Gly Pro His 50 # 55 # 60 Phe Asn Pro Leu Ser Arg Lys His Gly Gly Pr #o Lys Asp Glu Glu Arg 65 # 70 # 75 # 80 His Val Gly Asp Leu Gly Asn Val Thr Ala As #p Lys Asp Gly Val Ala 85 # 90 # 95 Asp Val Ser Ile Glu Asp Ser Val Ile Ser Le #u Ser Gly Asp His Cys 100 # 105 # 110 Ile Ile Gly Arg Thr Leu Val Val His Glu Ly #s Ala Asp Asp Leu Gly 115 # 120 # 125 Lys Gly Gly Asn Glu Glu Ser Thr Lys Thr Gl #y Asn Ala Gly Ser Arg 130 # 135 # 140 Leu Ala Cys Gly Val Ile Gly Ile Ala Gln As #n Leu Gly Ile Gln Ile 145 1 #50 1 #55 1 #60 Ser Arg Ala Arg Ile Met Lys Val Glu Phe Th #r Gln Cys Ser Met Asn 165 # 170 # 175 Val Val Ser Lys Thr Leu Phe Arg Lys Ser Pr #o Ile His His His Ile 180 # 185 # 190 Asp Lys Thr Met Ile Asn Phe Pro Ala Ala Me #t Pro Phe Ser Lys Ala 195 # 200 # 205 Glu Ile Asp Pro Met Ala Met Thr Leu Ser Ly #s Tyr Ser Leu Pro Ile 210 # 215 # 220 Val Glu Glu Pro Glu Asp Tyr Lys Glu Ala Se #r Val Phe Tyr Gln Asn 225 2 #30 2 #35 2 #40 Lys Ile Val Gly Lys Thr Gln Leu Val Asp As #p Phe Leu Asp Leu Asp 245 # 250 # 255 Met Ala Ile Thr Gly Ala Pro Gly Ile Asp Al #a Ile Asn Met Asp Ser 260 # 265 # 270 Ser Pro Gly Phe Pro Tyr Val Gln Glu Lys Le #u Thr Lys Arg Asp Leu 275 # 280 # 285 Ile Trp Leu Asp Glu Asn Gly Leu Leu Leu Gl #y Val His Pro Arg Leu 290 # 295 # 300 Ala Gln Arg Ile Leu Phe Asn Thr Val Met Me #t Glu Asn Cys Ser Asp 305 3 #10 3 #15 3 #20 Leu Asp Val Val Phe Thr Thr Cys Pro Lys As #p Glu Leu Arg Pro Leu 325 # 330 # 335 Glu Lys Val Leu Glu Ser Lys Thr Arg Ala Il #e Asp Ala Cys Pro Leu 340 # 345 # 350 Asp Tyr Thr Ile Leu Cys Arg Met Tyr Trp Gl #y Pro Ala Ile Ser Tyr 355 # 360 # 365 Phe His Leu Asn Pro Gly Phe His Thr Gly Va #l Ala Ile Gly Ile Asp 370 # 375 # 380 Pro Asp Lys Gln Trp Asp Glu Leu Phe Lys Th #r Met Ile Arg Phe Gly 385 3 #90 3 #95 4 #00 Asp Val Gly Leu Asp Leu Asp Phe Ser Ala Ph #e Asp Ala Ser Leu Ser 405 # 410 # 415 Pro Phe Met Ile Arg Glu Ala Gly Arg Ile Me #t Ser Glu Leu Ser Gly 420 # 425 # 430 Thr Pro Ser His Phe Gly Thr Ala Leu Ile As #n Thr Ile Ile Tyr Ser 435 # 440 # 445 Lys His Leu Leu Tyr Asn Cys Cys Tyr His Va #l Cys Gly Ser Met Pro 450 # 455 # 460 Ser Gly Ser Pro Cys Thr Ala Leu Leu Asn Se #r Ile Ile Asn Asn Ile 465 4 #70 4 #75 4 #80 Asn Leu Tyr Tyr Val Phe Ser Lys Ile Phe Gl #y Lys Ser Pro Val Phe 485 # 490 # 495 Phe Cys Gln Ala Leu Arg Ile Leu Cys Tyr Gl #y Asp Asp Val Leu Ile 500 # 505 # 510 Val Phe Ser Arg Asp Val Gln Ile Asp Asn Le #u Asp Leu Ile Gly Gln 515 # 520 # 525 Lys Ile Val Asp Glu Phe Lys Lys Leu Gly Me #t Thr Ala Thr Ser Ala 530 # 535 # 540 Asp Lys Asn Val Pro Gln Leu Lys Pro Val Se #r Glu Leu Thr Phe Leu 545 5 #50 5 #55 5 #60 Lys Arg Ser Phe Asn Leu Val Glu Asp Arg Il #e Arg Pro Ala Ile Ser 565 # 570 # 575 Glu Lys Thr Ile Trp Ser Leu Met Ala Trp Gl #n Arg Ser Asn Ala Glu 580 # 585 # 590 Phe Glu Gln Asn Leu Glu Asn Ala Gln Trp Ph #e Ala Phe Met His Gly 595 # 600 # 605 Tyr Glu Phe Tyr Gln Lys Phe Tyr Tyr Phe Va #l Gln Ser Cys Leu Glu 610 # 615 # 620 Lys Glu Met Ile Glu Tyr Arg Leu Lys Ser Ty #r Asp Trp Trp Arg Met 625 6 #30 6 #35 6 #40 Arg Phe Tyr Asp Gln Cys Phe Ile Cys Asp Le #u Ser 645 # 650 <210> SEQ ID NO 49 <211> LENGTH: 24 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: primer SN2172 <400> SEQUENCE: 49 gctcctcttt atcatgctat ggat # # 24 <210> SEQ ID NO 50 <211> LENGTH: 24 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Description of Artificial #Sequence: primer SN2415 <400> SEQUENCE: 50 caggaaatgt ctcaggtact ttct # # 24
Claims (38)
1. A method for detecting antibodies in a biological sample that bind to a hepatitis A virus (HAV) antigen, said method comprising:
(a) providing an antigen comprising a sequence having at least 80% sequence identity to any one of the sequences of SEQ ID NOS:40-48, or an immunogenic fragment of any one of the sequences of SEQ ID NOS:40-48 comprising at least 10 amino acids;
(b) incubating said antigen with said biological sample under conditions that allow for formation of an antibody-antigen complex if HAV antibodies are present in said biological sample; and
(c) detecting the presence or absence of said antibody-antigen complexes.
2. The method of claim 1 , wherein said antigen comprises the sequence of SEQ ID NO:40, or an immunogenic fragment thereof comprising at least 10 amino acids.
3. The method of claim 1 , wherein said antigen comprises the sequence of SEQ ID NO:41, or an immunogenic fragment thereof comprising at least 10 amino acids.
4. The method of claim 1 , wherein said antigen comprises the sequence of SEQ ID NO:42, or an immunogenic fragment thereof comprising at least 10 amino acids.
5. The method of claim 1 , wherein said antigen comprises the sequence of SEQ ID NO:43, or an immunogenic fragment thereof comprising at least 10 amino acids.
6. The method of claim 1 , wherein said antigen comprises the sequence of SEQ ID NO:44, or an immunogenic fragment thereof comprising at least 10 amino acids.
7. The method of claim 1 , wherein said antigen comprises the sequence of SEQ ID NO:45, or an immunogenic fragment thereof comprising at least 10 amino acids.
8. The method of claim 1 , wherein said antigen comprises the sequence of SEQ ID NO:46, or an immunogenic fragment thereof comprising at least 10 amino acids.
9. The method of claim 1 , wherein said antigen comprises the sequence of SEQ ID NO:47, or an immunogenic fragment thereof comprising at least 10 amino acids.
10. The method of claim 1 , wherein said antigen comprises the sequence of SEQ ID NO:48, or an immunogenic fragment thereof comprising at least 10 amino acids.
11. The method of claim 1 , wherein said biological sample is human blood, plasma or serum.
12. The method of claim 1 , wherein said antigen in step (a) is immobilized on a solid support.
13. The method of claim 1 , wherein said antigen comprises a sequence having at least 90% sequence identity to any one of the sequences of SEQ ID NOS :40-48, or an immunogenic fragment thereof.
14. The method of claim 1 , wherein said antigen comprises a sequence having at least 95% sequence identity to any one of the sequences of SEQ ID NOS:40-48, or an immunogenic fragment thereof.
15. The method of claim 1 , wherein said antigen comprises a sequence having at least 99% sequence identity to any one of the sequences of SEQ ID NOS:40-48, or an immunogenic fragment thereof.
16. A method for detecting antibodies in a human blood, plasma or serum sample that bind to a hepatitis A virus (HAV) antigen, said method comprising:
(a) providing an antigen immobilized on a solid support, said antigen comprising any one of the sequences of SEQ ID NOS:40-48, or an immunogenic fragment of any one of the sequences of SEQ ID NOS:40-48 comprising at least 10 amino acids;
(b) incubating said antigen with said sample under conditions that allow for formation of an antibody-antigen complex if HAV antibodies are present in said sample; and
(c) detecting the presence or absence of said antibody-antigen complexes.
17. The method of claim 16 , wherein said antigen comprises the sequence of SEQ ID NO:40, or an immunogenic fragment thereof comprising at least 10 amino acids.
18. The method of claim 16 , wherein said antigen comprises the sequence of SEQ ID NO:41, or an immunogenic fragment thereof comprising at least 10 amino acids.
19. The method of claim 16 , wherein said antigen comprises the sequence of SEQ ID NO:42, or an immunogenic fragment thereof comprising at least 10 amino acids.
20. The method of claim 16 , wherein said antigen comprises the sequence of SEQ ID NO:43, or an immunogenic fragment thereof comprising at least 10 amino acids.
21. The method of claim 16 , wherein said antigen comprises the sequence of SEQ ID NO:44, or an immunogenic fragment thereof comprising at least 10 amino acids.
22. The method of claim 16 , wherein said antigen comprises the sequence of SEQ ID NO:45, or an immunogenic fragment thereof comprising at least 10 amino acids.
23. The method of claim 16 , wherein said antigen comprises the sequence of SEQ ID NO:46, or an immunogenic fragment thereof comprising at least 10 amino acids.
24. The method of claim 16 , wherein said antigen comprises the sequence of SEQ ID NO:47, or an immunogenic fragment thereof comprising at least 10 amino acids.
25. The method of claim 16 , wherein said antigen comprises the sequence of SEQ ID NO:48, or an immunogenic fragment thereof comprising at least 10 amino acids.
26. A solid support comprising an immobilized antigen, wherein said antigen comprises a sequence having at least 80% sequence identity to any one of the sequences of SEQ ID NOS:40-48, or an immunogenic fragment of any one of the sequences of SEQ ID NOS:40-48 comprising at least 10 amino acids.
27. The solid support of claim 26 , wherein said antigen comprises the sequence of SEQ ID NO:40, or an immunogenic fragment thereof comprising at least 10 amino acids.
28. The solid support of claim 26 , wherein said antigen comprises the sequence of SEQ ID NO:41, or an immunogenic fragment thereof comprising at least 10 amino acids.
29. The solid support of claim 26 , wherein said antigen comprises the sequence of SEQ ID NO:42, or an immunogenic fragment thereof comprising at least 10 amino acids.
30. The solid support of claim 26 , wherein said antigen comprises the sequence of SEQ ID NO:43, or an immunogenic fragment thereof comprising at least 10 amino acids.
31. The solid support of claim 26 , wherein said antigen comprises the sequence of SEQ ID NO:44, or an immunogenic fragment thereof comprising at least 10 amino acids.
32. The solid support of claim 26 , wherein said antigen comprises the sequence of SEQ ID NO:45, or an immunogenic fragment thereof comprising at least 10 amino acids.
33. The solid support of claim 26 , wherein said antigen comprises the sequence of SEQ ID NO:46, or an immunogenic fragment thereof comprising at least 10 amino acids.
34. The solid support of claim 26 , wherein said antigen comprises the sequence of SEQ ID NO:47, or an immunogenic fragment thereof comprising at least 10 amino acids.
35. The solid support of claim 26 , wherein said antigen comprises the sequence of SEQ ID NO:48, or an immunogenic fragment thereof comprising at least 10 amino acids.
36. An immunodiagnostic test kit for detecting hepatitis A virus (HAV) infection, said test kit comprising:
(a) an antigen comprising a sequence having at least 80% sequence identity to any one of the sequences of SEQ ID NOS:40-48, or an immunogenic fragment of any one of the sequences of SEQ ID NOS:40-48 comprising at least 10 amino acids; and
(b) instructions for conducting the immunodiagnostic test.
37. An immunodiagnostic test kit for detecting hepatitis A virus (HAV) infection, said test kit comprising:
(a) the solid support of claim 26; and
(b) instructions for conducting the immunodiagnostic test.
38. A method for detecting hepatitis A virus (HAV) in a biological sample, said method comprising:
(a) providing an antibody against an antigen that comprises a sequence having at least 80% sequence identity to any one of the sequences of SEQ ID NOS:40-48, or an immunogenic fragment of any one of the sequences of SEQ ID NOS:40-48 comprising at least 10 amino acids;
(b) incubating said antibody with said biological sample under conditions that allow for formation of an antibody-antigen complex if HAV is present in said biological sample; and
(c) detecting the presence or absence of said antibody-antigen complexes.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/051,613 US20050130135A1 (en) | 2001-10-12 | 2005-02-04 | Hepatitis a virus nucleotide sequences, recombinant proteins and uses thereof |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US32893301P | 2001-10-12 | 2001-10-12 | |
US10/272,459 US6942965B2 (en) | 2001-10-12 | 2002-10-15 | Hepatitis A virus nucleotide sequences, recombinant proteins and uses thereof |
US11/051,613 US20050130135A1 (en) | 2001-10-12 | 2005-02-04 | Hepatitis a virus nucleotide sequences, recombinant proteins and uses thereof |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/272,459 Continuation US6942965B2 (en) | 2001-10-12 | 2002-10-15 | Hepatitis A virus nucleotide sequences, recombinant proteins and uses thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
US20050130135A1 true US20050130135A1 (en) | 2005-06-16 |
Family
ID=23283101
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/272,459 Expired - Lifetime US6942965B2 (en) | 2001-10-12 | 2002-10-15 | Hepatitis A virus nucleotide sequences, recombinant proteins and uses thereof |
US11/051,613 Abandoned US20050130135A1 (en) | 2001-10-12 | 2005-02-04 | Hepatitis a virus nucleotide sequences, recombinant proteins and uses thereof |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/272,459 Expired - Lifetime US6942965B2 (en) | 2001-10-12 | 2002-10-15 | Hepatitis A virus nucleotide sequences, recombinant proteins and uses thereof |
Country Status (4)
Country | Link |
---|---|
US (2) | US6942965B2 (en) |
EP (1) | EP1521849A4 (en) |
JP (1) | JP2005528882A (en) |
WO (1) | WO2003030624A2 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100325719A1 (en) * | 2009-06-19 | 2010-12-23 | Craig Stephen Etchegoyen | System and Method for Redundancy in a Communication Network |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ES2392445T3 (en) * | 2004-07-13 | 2012-12-10 | Gen-Probe Incorporated | Compositions and methods for the detection of hepatitis A virus nucleic acid |
JP5215865B2 (en) * | 2005-11-22 | 2013-06-19 | ノバルティス ヴァクシンズ アンド ダイアグノスティクス インコーポレイテッド | Norovirus and sapovirus antigens |
ES2945966T3 (en) * | 2011-07-15 | 2023-07-11 | Gen Probe Inc | Compositions and method for detecting human parvovirus nucleic acid and hepatitis A virus nucleic acid |
EP2643461B1 (en) * | 2011-09-07 | 2016-06-15 | Grifols Therapeutics Inc. | Methods, compositions, and kits for determining hepatitis a virus |
US8628916B2 (en) * | 2012-03-15 | 2014-01-14 | Industry Foundation Of Chonnam National University | Method for isolating hepatitis A virus or spring viremia of carp virus |
ES2712664T3 (en) * | 2015-07-23 | 2019-05-14 | Grifols Sa | Procedures for purification of a virus produced in vitro and virus elimination assay |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5516630A (en) * | 1983-09-30 | 1996-05-14 | The United States Of America As Represented By The Department Of Health And Human Services | Methods of detecting hepatitis A virus |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1985001517A1 (en) * | 1983-09-30 | 1985-04-11 | Massachusetts Institute Of Technology | PRODUCTION OF cDNA REPRESENTING HEPATITIS A VIRAL SEQUENCES |
EP0154587B1 (en) * | 1984-03-02 | 1991-05-15 | Merck & Co. Inc. | Fragments of dna which encode peptides capable of inducing in vivo the synthesis of anti-hepatitis a virus antibodies |
DE19752898A1 (en) * | 1997-11-28 | 1999-08-05 | Centeon Pharma Gmbh | Method for the detection of high concentrations of four in blood plasma and / or blood serum by means of the polymerase chain reaction |
-
2002
- 2002-10-15 JP JP2003533673A patent/JP2005528882A/en active Pending
- 2002-10-15 US US10/272,459 patent/US6942965B2/en not_active Expired - Lifetime
- 2002-10-15 WO PCT/US2002/033121 patent/WO2003030624A2/en not_active Application Discontinuation
- 2002-10-15 EP EP02789210A patent/EP1521849A4/en not_active Withdrawn
-
2005
- 2005-02-04 US US11/051,613 patent/US20050130135A1/en not_active Abandoned
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5516630A (en) * | 1983-09-30 | 1996-05-14 | The United States Of America As Represented By The Department Of Health And Human Services | Methods of detecting hepatitis A virus |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100325719A1 (en) * | 2009-06-19 | 2010-12-23 | Craig Stephen Etchegoyen | System and Method for Redundancy in a Communication Network |
Also Published As
Publication number | Publication date |
---|---|
EP1521849A4 (en) | 2006-05-17 |
WO2003030624A2 (en) | 2003-04-17 |
JP2005528882A (en) | 2005-09-29 |
US6942965B2 (en) | 2005-09-13 |
EP1521849A2 (en) | 2005-04-13 |
WO2003030624A3 (en) | 2005-02-10 |
US20030124517A1 (en) | 2003-07-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR0185373B1 (en) | Nanbv diagnostics and vaccines | |
JP2532805B2 (en) | NANBV diagnostic agents and vaccines | |
CA2065287C (en) | New hcv isolates | |
RU2301263C2 (en) | Method for detecting human parvovirus b19 in biological sample (variants) and set for its realization | |
IE83377B1 (en) | New HCV isolate J-1 | |
US20050130135A1 (en) | Hepatitis a virus nucleotide sequences, recombinant proteins and uses thereof | |
JP2007513604A (en) | New enteroviruses, vaccines, medicines and diagnostic kits | |
DK175975B1 (en) | NANBV diagnostics and vaccines | |
AU2002353819A1 (en) | Hepatitis a virus nucleotide sequences, recombinant proteins and uses thereof | |
WO2022199526A1 (en) | Hepatitis e virus-like particles and uses thereof | |
DK176280B1 (en) | NANBV diagnostics and vaccines |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: CHIRON CORPORATION, CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:PICHUANTES, SERGIO;REEL/FRAME:016696/0468 Effective date: 20050826 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |