KR20060041179A - Hiv-1 envelope glycoproteins having unusual disulfide structure - Google Patents
Hiv-1 envelope glycoproteins having unusual disulfide structure Download PDFInfo
- Publication number
- KR20060041179A KR20060041179A KR1020057023830A KR20057023830A KR20060041179A KR 20060041179 A KR20060041179 A KR 20060041179A KR 1020057023830 A KR1020057023830 A KR 1020057023830A KR 20057023830 A KR20057023830 A KR 20057023830A KR 20060041179 A KR20060041179 A KR 20060041179A
- Authority
- KR
- South Korea
- Prior art keywords
- asn
- thr
- sequence
- ile
- val
- Prior art date
Links
- 125000002228 disulfide group Chemical group 0.000 title abstract description 20
- 101710121417 Envelope glycoprotein Proteins 0.000 title abstract description 4
- 102100021696 Syncytin-1 Human genes 0.000 title abstract 2
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 292
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 281
- 229920001184 polypeptide Polymers 0.000 claims abstract description 274
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 168
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 168
- 239000002157 polynucleotide Substances 0.000 claims abstract description 168
- 239000000203 mixture Substances 0.000 claims abstract description 120
- 238000000034 method Methods 0.000 claims abstract description 102
- 230000002163 immunogen Effects 0.000 claims abstract description 89
- 239000013598 vector Substances 0.000 claims abstract description 65
- 230000014509 gene expression Effects 0.000 claims abstract description 38
- 238000002405 diagnostic procedure Methods 0.000 claims abstract description 9
- 210000004027 cell Anatomy 0.000 claims description 139
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 71
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 claims description 64
- 235000018417 cysteine Nutrition 0.000 claims description 60
- 239000000523 sample Substances 0.000 claims description 46
- 150000007523 nucleic acids Chemical class 0.000 claims description 40
- 238000003556 assay Methods 0.000 claims description 38
- 102000039446 nucleic acids Human genes 0.000 claims description 37
- 108020004707 nucleic acids Proteins 0.000 claims description 37
- 239000002773 nucleotide Substances 0.000 claims description 36
- 125000003729 nucleotide group Chemical group 0.000 claims description 36
- 101800001690 Transmembrane protein gp41 Proteins 0.000 claims description 29
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 28
- 238000009396 hybridization Methods 0.000 claims description 28
- 230000003321 amplification Effects 0.000 claims description 27
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 27
- 230000027455 binding Effects 0.000 claims description 26
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims description 25
- 108020004705 Codon Proteins 0.000 claims description 21
- 241001465754 Metazoa Species 0.000 claims description 19
- 108091034117 Oligonucleotide Proteins 0.000 claims description 19
- 230000003053 immunization Effects 0.000 claims description 18
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 claims description 17
- 230000004927 fusion Effects 0.000 claims description 16
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 claims description 15
- 238000002649 immunization Methods 0.000 claims description 15
- 239000013604 expression vector Substances 0.000 claims description 14
- 229930182817 methionine Natural products 0.000 claims description 13
- 230000009870 specific binding Effects 0.000 claims description 13
- 108010078791 Carrier Proteins Proteins 0.000 claims description 12
- 239000002671 adjuvant Substances 0.000 claims description 12
- 238000001727 in vivo Methods 0.000 claims description 12
- 210000004962 mammalian cell Anatomy 0.000 claims description 12
- 102000014914 Carrier Proteins Human genes 0.000 claims description 11
- 108020004711 Nucleic Acid Probes Proteins 0.000 claims description 11
- 239000012472 biological sample Substances 0.000 claims description 11
- 230000003308 immunostimulating effect Effects 0.000 claims description 11
- 239000002853 nucleic acid probe Substances 0.000 claims description 11
- 241000894007 species Species 0.000 claims description 11
- 241000700584 Simplexvirus Species 0.000 claims description 10
- 239000003814 drug Substances 0.000 claims description 10
- 102000003978 Tissue Plasminogen Activator Human genes 0.000 claims description 9
- 108090000373 Tissue Plasminogen Activator Proteins 0.000 claims description 9
- 239000003795 chemical substances by application Substances 0.000 claims description 9
- 229940079593 drug Drugs 0.000 claims description 9
- 229960000187 tissue plasminogen activator Drugs 0.000 claims description 9
- 239000013603 viral vector Substances 0.000 claims description 9
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical group C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 8
- 238000004519 manufacturing process Methods 0.000 claims description 8
- 239000007790 solid phase Substances 0.000 claims description 8
- 239000003937 drug carrier Substances 0.000 claims description 6
- 230000001580 bacterial effect Effects 0.000 claims description 4
- 239000000833 heterodimer Substances 0.000 claims description 4
- 239000000710 homodimer Substances 0.000 claims description 4
- 102100022717 Atypical chemokine receptor 1 Human genes 0.000 claims description 3
- 101000678879 Homo sapiens Atypical chemokine receptor 1 Proteins 0.000 claims description 3
- 239000013078 crystal Substances 0.000 claims 1
- 238000003745 diagnosis Methods 0.000 claims 1
- 230000028993 immune response Effects 0.000 abstract description 32
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 75
- 108020004414 DNA Proteins 0.000 description 59
- 241000700605 Viruses Species 0.000 description 59
- 108090000623 proteins and genes Proteins 0.000 description 49
- 208000031886 HIV Infections Diseases 0.000 description 43
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 40
- 235000001014 amino acid Nutrition 0.000 description 40
- 241000725303 Human immunodeficiency virus Species 0.000 description 38
- 229940024606 amino acid Drugs 0.000 description 37
- 150000001413 amino acids Chemical class 0.000 description 34
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 33
- 229960005486 vaccine Drugs 0.000 description 33
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 29
- 108091007433 antigens Proteins 0.000 description 28
- 235000018102 proteins Nutrition 0.000 description 28
- 102000004169 proteins and genes Human genes 0.000 description 28
- 102000036639 antigens Human genes 0.000 description 27
- 239000000427 antigen Substances 0.000 description 25
- 150000001945 cysteines Chemical class 0.000 description 25
- 108010073969 valyllysine Proteins 0.000 description 25
- 239000012634 fragment Substances 0.000 description 24
- 239000000306 component Substances 0.000 description 23
- 238000011081 inoculation Methods 0.000 description 23
- 108010015796 prolylisoleucine Proteins 0.000 description 21
- 230000003612 virological effect Effects 0.000 description 21
- 108010087924 alanylproline Proteins 0.000 description 20
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 19
- 230000002159 abnormal effect Effects 0.000 description 19
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 19
- 108010061238 threonyl-glycine Proteins 0.000 description 19
- 108010027345 wheylin-1 peptide Proteins 0.000 description 19
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 18
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 18
- 108010034507 methionyltryptophan Proteins 0.000 description 18
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 17
- AAWLEICNDUHIJM-MBLNEYKQSA-N Ala-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C)N)O AAWLEICNDUHIJM-MBLNEYKQSA-N 0.000 description 17
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 17
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 17
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 17
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 17
- 108010040030 histidinoalanine Proteins 0.000 description 17
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 16
- QXNGSPZMGFEZNO-QRTARXTBSA-N Asn-Val-Trp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QXNGSPZMGFEZNO-QRTARXTBSA-N 0.000 description 16
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 16
- WCTCIIAGNMFYAO-DCAQKATOSA-N Leu-Cys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O WCTCIIAGNMFYAO-DCAQKATOSA-N 0.000 description 16
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 16
- VKMOGXREKGVZAF-QEJZJMRPSA-N Trp-Asp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VKMOGXREKGVZAF-QEJZJMRPSA-N 0.000 description 16
- RWTFCAMQLFNPTK-UMPQAUOISA-N Trp-Val-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)=CNC2=C1 RWTFCAMQLFNPTK-UMPQAUOISA-N 0.000 description 16
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 16
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 16
- 230000000295 complement effect Effects 0.000 description 16
- 108010053725 prolylvaline Proteins 0.000 description 16
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 15
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 15
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 15
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 15
- SDTPKSOWFXBACN-GUBZILKMSA-N His-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O SDTPKSOWFXBACN-GUBZILKMSA-N 0.000 description 15
- -1 amino acid analogs Chemical class 0.000 description 15
- 239000012528 membrane Substances 0.000 description 15
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 15
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 14
- GYCPQVFKCPPRQB-GUBZILKMSA-N Glu-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N GYCPQVFKCPPRQB-GUBZILKMSA-N 0.000 description 14
- OZAPWFHRPINHND-GUBZILKMSA-N Pro-Cys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OZAPWFHRPINHND-GUBZILKMSA-N 0.000 description 14
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 14
- YPBYQWFZAAQMGW-XIRDDKMYSA-N Trp-Lys-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N YPBYQWFZAAQMGW-XIRDDKMYSA-N 0.000 description 14
- 108010005233 alanylglutamic acid Proteins 0.000 description 14
- 108010089804 glycyl-threonine Proteins 0.000 description 14
- 108010051110 tyrosyl-lysine Proteins 0.000 description 14
- PBFXCUOEGVJTMV-QXEWZRGKSA-N Asn-Met-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O PBFXCUOEGVJTMV-QXEWZRGKSA-N 0.000 description 13
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 13
- HJXOFWKCWLHYIJ-SZMVWBNQSA-N Trp-Lys-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HJXOFWKCWLHYIJ-SZMVWBNQSA-N 0.000 description 13
- 238000001514 detection method Methods 0.000 description 13
- 208000015181 infectious disease Diseases 0.000 description 13
- 239000002245 particle Substances 0.000 description 13
- 238000003752 polymerase chain reaction Methods 0.000 description 13
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 12
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 12
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 12
- NQSUTVRXXBGVDQ-LKXGYXEUSA-N Cys-Asn-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NQSUTVRXXBGVDQ-LKXGYXEUSA-N 0.000 description 12
- ZALGPUWUVHOGAE-GVXVVHGQSA-N Glu-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZALGPUWUVHOGAE-GVXVVHGQSA-N 0.000 description 12
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 12
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 12
- LNIIRLODKOWQIY-IHRRRGAJSA-N Phe-Asn-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LNIIRLODKOWQIY-IHRRRGAJSA-N 0.000 description 12
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 12
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 12
- 238000003018 immunoassay Methods 0.000 description 12
- 239000002502 liposome Substances 0.000 description 12
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 11
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 11
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 11
- 241000880493 Leptailurus serval Species 0.000 description 11
- 102100036011 T-cell surface glycoprotein CD4 Human genes 0.000 description 11
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 11
- 238000002372 labelling Methods 0.000 description 11
- 238000006467 substitution reaction Methods 0.000 description 11
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 10
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 10
- 108010041986 DNA Vaccines Proteins 0.000 description 10
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 10
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 10
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 10
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 10
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 10
- 108010034529 leucyl-lysine Proteins 0.000 description 10
- 150000002632 lipids Chemical class 0.000 description 10
- 108010017391 lysylvaline Proteins 0.000 description 10
- 229940021995 DNA vaccine Drugs 0.000 description 9
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 9
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 9
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 9
- 102100034349 Integrase Human genes 0.000 description 9
- TWJDQTTXXZDJKV-BPUTZDHNSA-N Trp-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O TWJDQTTXXZDJKV-BPUTZDHNSA-N 0.000 description 9
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 9
- 125000000539 amino acid group Chemical group 0.000 description 9
- 108010060035 arginylproline Proteins 0.000 description 9
- 238000000338 in vitro Methods 0.000 description 9
- 108010078274 isoleucylvaline Proteins 0.000 description 9
- 108010003700 lysyl aspartic acid Proteins 0.000 description 9
- 239000000463 material Substances 0.000 description 9
- 238000003786 synthesis reaction Methods 0.000 description 9
- 238000012360 testing method Methods 0.000 description 9
- 238000013518 transcription Methods 0.000 description 9
- 230000035897 transcription Effects 0.000 description 9
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 8
- JIVJQYNNAYFXDG-LKXGYXEUSA-N Cys-Thr-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JIVJQYNNAYFXDG-LKXGYXEUSA-N 0.000 description 8
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 8
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 8
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 8
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 8
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 8
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 8
- RSDHVTMRXSABSV-GHCJXIJMSA-N Ile-Asn-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N RSDHVTMRXSABSV-GHCJXIJMSA-N 0.000 description 8
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 8
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 8
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 8
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 8
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 8
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 8
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 8
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 8
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 8
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 8
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 8
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 8
- LMSBRIVOCYOKMU-NRPADANISA-N Val-Gln-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N LMSBRIVOCYOKMU-NRPADANISA-N 0.000 description 8
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 8
- 108010093581 aspartyl-proline Proteins 0.000 description 8
- 230000015572 biosynthetic process Effects 0.000 description 8
- 238000004113 cell culture Methods 0.000 description 8
- 239000000412 dendrimer Substances 0.000 description 8
- 229920000736 dendritic polymer Polymers 0.000 description 8
- 239000001963 growth medium Substances 0.000 description 8
- 238000000746 purification Methods 0.000 description 8
- 230000001105 regulatory effect Effects 0.000 description 8
- 210000001519 tissue Anatomy 0.000 description 8
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 7
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 7
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 7
- 102100035875 C-C chemokine receptor type 5 Human genes 0.000 description 7
- 101710149870 C-C chemokine receptor type 5 Proteins 0.000 description 7
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 7
- HJXSYJVCMUOUNY-SRVKXCTJSA-N Cys-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N HJXSYJVCMUOUNY-SRVKXCTJSA-N 0.000 description 7
- ZLFRUAFDAIFNHN-LKXGYXEUSA-N Cys-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)O ZLFRUAFDAIFNHN-LKXGYXEUSA-N 0.000 description 7
- 101710091045 Envelope protein Proteins 0.000 description 7
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 7
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 7
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 7
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 7
- 108091028043 Nucleic acid sequence Proteins 0.000 description 7
- CSDMCMITJLKBAH-SOUVJXGZSA-N Phe-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O CSDMCMITJLKBAH-SOUVJXGZSA-N 0.000 description 7
- DSXPMZMSJHOKKK-HJOGWXRNSA-N Phe-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DSXPMZMSJHOKKK-HJOGWXRNSA-N 0.000 description 7
- FJLODLCIOJUDRG-PYJNHQTQSA-N Pro-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FJLODLCIOJUDRG-PYJNHQTQSA-N 0.000 description 7
- 101710188315 Protein X Proteins 0.000 description 7
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 7
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 7
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 7
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 7
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 7
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 7
- FDALPRWYVKJCLL-PMVVWTBXSA-N Thr-His-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O FDALPRWYVKJCLL-PMVVWTBXSA-N 0.000 description 7
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 7
- 108010047495 alanylglycine Proteins 0.000 description 7
- 108010013835 arginine glutamate Proteins 0.000 description 7
- 108010092854 aspartyllysine Proteins 0.000 description 7
- 239000000872 buffer Substances 0.000 description 7
- 235000014633 carbohydrates Nutrition 0.000 description 7
- 230000000694 effects Effects 0.000 description 7
- 238000009472 formulation Methods 0.000 description 7
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 7
- 108010032388 glycyl-prolyl-glycyl-arginyl-alanyl-phenylanine Proteins 0.000 description 7
- 230000005847 immunogenicity Effects 0.000 description 7
- 108010064235 lysylglycine Proteins 0.000 description 7
- 238000002703 mutagenesis Methods 0.000 description 7
- 231100000350 mutagenesis Toxicity 0.000 description 7
- 238000006386 neutralization reaction Methods 0.000 description 7
- 210000002381 plasma Anatomy 0.000 description 7
- 239000007787 solid Substances 0.000 description 7
- 239000000243 solution Substances 0.000 description 7
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 6
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 description 6
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 6
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 6
- ANGAOPNEPIDLPO-XVYDVKMFSA-N Ala-His-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N ANGAOPNEPIDLPO-XVYDVKMFSA-N 0.000 description 6
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 6
- FVNAUOZKIPAYNA-BPNCWPANSA-N Ala-Met-Tyr Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FVNAUOZKIPAYNA-BPNCWPANSA-N 0.000 description 6
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 6
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 6
- 102100031650 C-X-C chemokine receptor type 4 Human genes 0.000 description 6
- 241000701022 Cytomegalovirus Species 0.000 description 6
- COYGBRTZEVWZBW-XKBZYTNZSA-N Gln-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O COYGBRTZEVWZBW-XKBZYTNZSA-N 0.000 description 6
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 6
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 6
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 6
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 6
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 6
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 6
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 6
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 6
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 6
- 101000922348 Homo sapiens C-X-C chemokine receptor type 4 Proteins 0.000 description 6
- ASCFJMSGKUIRDU-ZPFDUUQYSA-N Ile-Arg-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O ASCFJMSGKUIRDU-ZPFDUUQYSA-N 0.000 description 6
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 6
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 6
- FBGXMKUWQFPHFB-JBDRJPRFSA-N Ile-Ser-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N FBGXMKUWQFPHFB-JBDRJPRFSA-N 0.000 description 6
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 6
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 6
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 6
- XTONYTDATVADQH-CIUDSAMLSA-N Lys-Cys-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O XTONYTDATVADQH-CIUDSAMLSA-N 0.000 description 6
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 6
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 6
- 241000699670 Mus sp. Species 0.000 description 6
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 6
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 6
- 239000002202 Polyethylene glycol Substances 0.000 description 6
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 6
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 6
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 6
- OJFFAQFRCVPHNN-JYBASQMISA-N Ser-Thr-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OJFFAQFRCVPHNN-JYBASQMISA-N 0.000 description 6
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 6
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 6
- YEGMNOHLZNGOCG-UBHSHLNASA-N Trp-Asn-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YEGMNOHLZNGOCG-UBHSHLNASA-N 0.000 description 6
- VTHNLRXALGUDBS-BPUTZDHNSA-N Trp-Gln-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VTHNLRXALGUDBS-BPUTZDHNSA-N 0.000 description 6
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 description 6
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 6
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 6
- 230000000890 antigenic effect Effects 0.000 description 6
- 210000004369 blood Anatomy 0.000 description 6
- 239000008280 blood Substances 0.000 description 6
- 150000001720 carbohydrates Chemical class 0.000 description 6
- 150000001875 compounds Chemical class 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 238000001415 gene therapy Methods 0.000 description 6
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 6
- 108010057821 leucylproline Proteins 0.000 description 6
- 108010009298 lysylglutamic acid Proteins 0.000 description 6
- 108020004999 messenger RNA Proteins 0.000 description 6
- 229920001223 polyethylene glycol Polymers 0.000 description 6
- 102000005962 receptors Human genes 0.000 description 6
- 108020003175 receptors Proteins 0.000 description 6
- 210000003705 ribosome Anatomy 0.000 description 6
- 125000003396 thiol group Chemical group [H]S* 0.000 description 6
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 6
- 241000701161 unidentified adenovirus Species 0.000 description 6
- 238000002255 vaccination Methods 0.000 description 6
- 210000002845 virion Anatomy 0.000 description 6
- 208000030507 AIDS Diseases 0.000 description 5
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 5
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 5
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 5
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 5
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 5
- RRVBEKYEFMCDIF-WHFBIAKZSA-N Asn-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)C(=O)N RRVBEKYEFMCDIF-WHFBIAKZSA-N 0.000 description 5
- SPIPSJXLZVTXJL-ZLUOBGJFSA-N Asn-Cys-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O SPIPSJXLZVTXJL-ZLUOBGJFSA-N 0.000 description 5
- LANZYLJEHLBUPR-BPUTZDHNSA-N Asn-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)N)N LANZYLJEHLBUPR-BPUTZDHNSA-N 0.000 description 5
- JXMREEPBRANWBY-VEVYYDQMSA-N Asn-Thr-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JXMREEPBRANWBY-VEVYYDQMSA-N 0.000 description 5
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 5
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 5
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 5
- ICTXFVKYAGQURS-UBHSHLNASA-N Asp-Asn-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ICTXFVKYAGQURS-UBHSHLNASA-N 0.000 description 5
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 5
- 108091026890 Coding region Proteins 0.000 description 5
- 108020004635 Complementary DNA Proteins 0.000 description 5
- DCXGXDGGXVZVMY-GHCJXIJMSA-N Cys-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CS DCXGXDGGXVZVMY-GHCJXIJMSA-N 0.000 description 5
- HMWBPUDETPKSSS-DCAQKATOSA-N Cys-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCCN)C(=O)O HMWBPUDETPKSSS-DCAQKATOSA-N 0.000 description 5
- YWEHYKGJWHPGPY-XGEHTFHBSA-N Cys-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N)O YWEHYKGJWHPGPY-XGEHTFHBSA-N 0.000 description 5
- DTCCMDYODDPHBG-ACZMJKKPSA-N Gln-Ala-Cys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O DTCCMDYODDPHBG-ACZMJKKPSA-N 0.000 description 5
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 5
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 5
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 5
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 5
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 5
- 108090000288 Glycoproteins Proteins 0.000 description 5
- 102000003886 Glycoproteins Human genes 0.000 description 5
- RBOOOLVEKJHUNA-CIUDSAMLSA-N His-Cys-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O RBOOOLVEKJHUNA-CIUDSAMLSA-N 0.000 description 5
- YKUAGFAXQRYUQW-KKUMJFAQSA-N His-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O YKUAGFAXQRYUQW-KKUMJFAQSA-N 0.000 description 5
- 241000700588 Human alphaherpesvirus 1 Species 0.000 description 5
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 5
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 5
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 5
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 5
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 5
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 5
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 5
- 108700005091 Immunoglobulin Genes Proteins 0.000 description 5
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 5
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 5
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 5
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 5
- HGUUMQWGYCVPKG-DCAQKATOSA-N Leu-Pro-Cys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HGUUMQWGYCVPKG-DCAQKATOSA-N 0.000 description 5
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 5
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 5
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 5
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 5
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 5
- HOTNHEUETJELDL-BPNCWPANSA-N Met-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N HOTNHEUETJELDL-BPNCWPANSA-N 0.000 description 5
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 5
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 5
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 5
- AIZVVCMAFRREQS-GUBZILKMSA-N Pro-Cys-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AIZVVCMAFRREQS-GUBZILKMSA-N 0.000 description 5
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 5
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 5
- AJJDPGVVNPUZCR-RHYQMDGZSA-N Pro-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1)O AJJDPGVVNPUZCR-RHYQMDGZSA-N 0.000 description 5
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 5
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 5
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 5
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 5
- BJJRNAVDQGREGC-HOUAVDHOSA-N Thr-Trp-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O BJJRNAVDQGREGC-HOUAVDHOSA-N 0.000 description 5
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 5
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 5
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 5
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 5
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Chemical compound CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 5
- 108010008355 arginyl-glutamine Proteins 0.000 description 5
- 108010062796 arginyllysine Proteins 0.000 description 5
- 238000010804 cDNA synthesis Methods 0.000 description 5
- 239000013592 cell lysate Substances 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 5
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol group Chemical group [C@@H]1(CC[C@H]2[C@@H]3CC=C4C[C@@H](O)CC[C@]4(C)[C@H]3CC[C@]12C)[C@H](C)CCCC(C)C HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 5
- 230000001186 cumulative effect Effects 0.000 description 5
- 108010060199 cysteinylproline Proteins 0.000 description 5
- 108010077515 glycylproline Proteins 0.000 description 5
- 230000001939 inductive effect Effects 0.000 description 5
- 239000007924 injection Substances 0.000 description 5
- 238000002347 injection Methods 0.000 description 5
- 238000007899 nucleic acid hybridization Methods 0.000 description 5
- 108010012581 phenylalanylglutamate Proteins 0.000 description 5
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 5
- 108010051242 phenylalanylserine Proteins 0.000 description 5
- 239000013612 plasmid Substances 0.000 description 5
- 229920000642 polymer Polymers 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- FSYKKLYZXJSNPZ-UHFFFAOYSA-N sarcosine Chemical compound C[NH2+]CC([O-])=O FSYKKLYZXJSNPZ-UHFFFAOYSA-N 0.000 description 5
- 238000013268 sustained release Methods 0.000 description 5
- 239000012730 sustained-release form Substances 0.000 description 5
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 4
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 4
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 4
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 4
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 4
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 4
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 4
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 4
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 4
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 4
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 4
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 4
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 4
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 4
- 102000009410 Chemokine receptor Human genes 0.000 description 4
- 108050000299 Chemokine receptor Proteins 0.000 description 4
- LHLSSZYQFUNWRZ-NAKRPEOUSA-N Cys-Arg-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LHLSSZYQFUNWRZ-NAKRPEOUSA-N 0.000 description 4
- LHMSYHSAAJOEBL-CIUDSAMLSA-N Cys-Lys-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O LHMSYHSAAJOEBL-CIUDSAMLSA-N 0.000 description 4
- UKHNKRGNFKSHCG-CUJWVEQBSA-N Cys-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N)O UKHNKRGNFKSHCG-CUJWVEQBSA-N 0.000 description 4
- UWZLBXOBVKRUFE-HGNGGELXSA-N Gln-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N UWZLBXOBVKRUFE-HGNGGELXSA-N 0.000 description 4
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 4
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 4
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 4
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 4
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 4
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 4
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 4
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 4
- QSQXZZCGPXQBPP-BQBZGAKWSA-N Gly-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)CN)C(=O)N[C@@H](CS)C(=O)O QSQXZZCGPXQBPP-BQBZGAKWSA-N 0.000 description 4
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 4
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 4
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 4
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 4
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 4
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 4
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 4
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 4
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 4
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 4
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 4
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 4
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 4
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 4
- KSPIYJQBLVDRRI-UHFFFAOYSA-N N-methylisoleucine Chemical compound CCC(C)C(NC)C(O)=O KSPIYJQBLVDRRI-UHFFFAOYSA-N 0.000 description 4
- 206010028980 Neoplasm Diseases 0.000 description 4
- UUWCIPUVJJIEEP-SRVKXCTJSA-N Phe-Asn-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N UUWCIPUVJJIEEP-SRVKXCTJSA-N 0.000 description 4
- MRNRMSDVVSKPGM-AVGNSLFASA-N Phe-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRNRMSDVVSKPGM-AVGNSLFASA-N 0.000 description 4
- UEHNWRNADDPYNK-DLOVCJGASA-N Phe-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N UEHNWRNADDPYNK-DLOVCJGASA-N 0.000 description 4
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 4
- QTDBZORPVYTRJU-KKXDTOCCSA-N Phe-Tyr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O QTDBZORPVYTRJU-KKXDTOCCSA-N 0.000 description 4
- CVAUVSOFHJKCHN-BZSNNMDCSA-N Phe-Tyr-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=CC=C1 CVAUVSOFHJKCHN-BZSNNMDCSA-N 0.000 description 4
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 4
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 4
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 4
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 4
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 4
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 4
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 4
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 4
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 4
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 4
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 4
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 4
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 4
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 4
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 4
- SMLCYZYQFRTLCO-UWJYBYFXSA-N Tyr-Cys-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O SMLCYZYQFRTLCO-UWJYBYFXSA-N 0.000 description 4
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 description 4
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 4
- NWDOPHYLSORNEX-QXEWZRGKSA-N Val-Asn-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N NWDOPHYLSORNEX-QXEWZRGKSA-N 0.000 description 4
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 4
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 4
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 4
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 4
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 4
- 108010070944 alanylhistidine Proteins 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 230000000692 anti-sense effect Effects 0.000 description 4
- 239000007864 aqueous solution Substances 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- 230000004071 biological effect Effects 0.000 description 4
- 229940098773 bovine serum albumin Drugs 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 108010016616 cysteinylglycine Proteins 0.000 description 4
- 230000002950 deficient Effects 0.000 description 4
- 238000013461 design Methods 0.000 description 4
- 239000000975 dye Substances 0.000 description 4
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 4
- 108010015792 glycyllysine Proteins 0.000 description 4
- 239000005090 green fluorescent protein Substances 0.000 description 4
- 108010036413 histidylglycine Proteins 0.000 description 4
- 108010018006 histidylserine Proteins 0.000 description 4
- 239000007927 intramuscular injection Substances 0.000 description 4
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 4
- 108010053037 kyotorphin Proteins 0.000 description 4
- 238000007834 ligase chain reaction Methods 0.000 description 4
- 125000005647 linker group Chemical group 0.000 description 4
- 108010054155 lysyllysine Proteins 0.000 description 4
- 108010038320 lysylphenylalanine Proteins 0.000 description 4
- 229920002521 macromolecule Polymers 0.000 description 4
- 230000001404 mediated effect Effects 0.000 description 4
- 239000000178 monomer Substances 0.000 description 4
- 230000003287 optical effect Effects 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 230000002265 prevention Effects 0.000 description 4
- 239000000047 product Substances 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 229920005989 resin Polymers 0.000 description 4
- 239000011347 resin Substances 0.000 description 4
- 238000003757 reverse transcription PCR Methods 0.000 description 4
- 238000001890 transfection Methods 0.000 description 4
- 230000014616 translation Effects 0.000 description 4
- 241001430294 unidentified retrovirus Species 0.000 description 4
- PECYZEOJVXMISF-UHFFFAOYSA-N 3-aminoalanine Chemical compound [NH3+]CC(N)C([O-])=O PECYZEOJVXMISF-UHFFFAOYSA-N 0.000 description 3
- DXTYEWAQOXYRHZ-KKXDTOCCSA-N Ala-Phe-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DXTYEWAQOXYRHZ-KKXDTOCCSA-N 0.000 description 3
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 3
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 3
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 3
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 3
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 3
- QGABLMITFKUQDF-DCAQKATOSA-N Asn-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGABLMITFKUQDF-DCAQKATOSA-N 0.000 description 3
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 3
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 3
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 3
- FLJVGAFLZVBBNG-BPUTZDHNSA-N Asn-Trp-Arg Chemical compound N[C@@H](CC(=O)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O FLJVGAFLZVBBNG-BPUTZDHNSA-N 0.000 description 3
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 3
- SPKRHJOVRVDJGG-CIUDSAMLSA-N Asp-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N SPKRHJOVRVDJGG-CIUDSAMLSA-N 0.000 description 3
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 3
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 3
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 3
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 3
- 108700010070 Codon Usage Proteins 0.000 description 3
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 3
- 102000004127 Cytokines Human genes 0.000 description 3
- 108090000695 Cytokines Proteins 0.000 description 3
- 241000702421 Dependoparvovirus Species 0.000 description 3
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 3
- 238000002965 ELISA Methods 0.000 description 3
- 241000196324 Embryophyta Species 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 3
- AQPZYBSRDRZBAG-AVGNSLFASA-N Gln-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N AQPZYBSRDRZBAG-AVGNSLFASA-N 0.000 description 3
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 3
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 3
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 3
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 3
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 3
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 3
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 3
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 3
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 3
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 3
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 3
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 3
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 3
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 3
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 3
- DXUJSRIVSWEOAG-NAKRPEOUSA-N Ile-Arg-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N DXUJSRIVSWEOAG-NAKRPEOUSA-N 0.000 description 3
- IPYVXYDYLHVWHU-GMOBBJLQSA-N Ile-Asn-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N IPYVXYDYLHVWHU-GMOBBJLQSA-N 0.000 description 3
- KOPIAUWNLKKELG-SIGLWIIPSA-N Ile-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N KOPIAUWNLKKELG-SIGLWIIPSA-N 0.000 description 3
- VNDQNDYEPSXHLU-JUKXBJQTSA-N Ile-His-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N VNDQNDYEPSXHLU-JUKXBJQTSA-N 0.000 description 3
- RFMDODRWJZHZCR-BJDJZHNGSA-N Ile-Lys-Cys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(O)=O RFMDODRWJZHZCR-BJDJZHNGSA-N 0.000 description 3
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 3
- ZSESFIFAYQEKRD-CYDGBPFRSA-N Ile-Val-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N ZSESFIFAYQEKRD-CYDGBPFRSA-N 0.000 description 3
- 108060003951 Immunoglobulin Proteins 0.000 description 3
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 3
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 3
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 3
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 3
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 3
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 3
- 108060001084 Luciferase Proteins 0.000 description 3
- 239000005089 Luciferase Substances 0.000 description 3
- WLCYCADOWRMSAJ-CIUDSAMLSA-N Lys-Asn-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O WLCYCADOWRMSAJ-CIUDSAMLSA-N 0.000 description 3
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 3
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 3
- 102000018697 Membrane Proteins Human genes 0.000 description 3
- 108010052285 Membrane Proteins Proteins 0.000 description 3
- JCMMNFZUKMMECJ-DCAQKATOSA-N Met-Lys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JCMMNFZUKMMECJ-DCAQKATOSA-N 0.000 description 3
- 241000699666 Mus <mouse, genus> Species 0.000 description 3
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 3
- 108010066427 N-valyltryptophan Proteins 0.000 description 3
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 3
- 108010092030 NAKTIIVQL nanopeptide Proteins 0.000 description 3
- 241000283973 Oryctolagus cuniculus Species 0.000 description 3
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 3
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 3
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 3
- PKHDJFHFMGQMPS-RCWTZXSCSA-N Pro-Thr-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKHDJFHFMGQMPS-RCWTZXSCSA-N 0.000 description 3
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 3
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 3
- XUIMIQQOPSSXEZ-UHFFFAOYSA-N Silicon Chemical compound [Si] XUIMIQQOPSSXEZ-UHFFFAOYSA-N 0.000 description 3
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 3
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 3
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 3
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 3
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 3
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 3
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 3
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 3
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 3
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 3
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 3
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 3
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 3
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 3
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 3
- PTAWAMWPRFTACW-SZMVWBNQSA-N Trp-Gln-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PTAWAMWPRFTACW-SZMVWBNQSA-N 0.000 description 3
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 3
- XBWKCYFGRXKWGO-SRVKXCTJSA-N Tyr-Cys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O XBWKCYFGRXKWGO-SRVKXCTJSA-N 0.000 description 3
- XKDOQXAXKFQWQJ-SRVKXCTJSA-N Tyr-Cys-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O XKDOQXAXKFQWQJ-SRVKXCTJSA-N 0.000 description 3
- KLQPIEVIKOQRAW-IZPVPAKOSA-N Tyr-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KLQPIEVIKOQRAW-IZPVPAKOSA-N 0.000 description 3
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 3
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 3
- SBJCTAZFSZXWSR-AVGNSLFASA-N Val-Met-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SBJCTAZFSZXWSR-AVGNSLFASA-N 0.000 description 3
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 3
- 108020000999 Viral RNA Proteins 0.000 description 3
- QWCKQJZIFLGMSD-UHFFFAOYSA-N alpha-aminobutyric acid Chemical compound CCC(N)C(O)=O QWCKQJZIFLGMSD-UHFFFAOYSA-N 0.000 description 3
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 3
- 238000003491 array Methods 0.000 description 3
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 3
- 108010047857 aspartylglycine Proteins 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 3
- 239000001506 calcium phosphate Substances 0.000 description 3
- 229910000389 calcium phosphate Inorganic materials 0.000 description 3
- 235000011010 calcium phosphates Nutrition 0.000 description 3
- 201000011510 cancer Diseases 0.000 description 3
- 125000002091 cationic group Chemical group 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 210000001151 cytotoxic T lymphocyte Anatomy 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 210000004443 dendritic cell Anatomy 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 3
- 229940088598 enzyme Drugs 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 238000004108 freeze drying Methods 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 239000011521 glass Substances 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 3
- 230000001900 immune effect Effects 0.000 description 3
- 102000018358 immunoglobulin Human genes 0.000 description 3
- 230000000977 initiatory effect Effects 0.000 description 3
- 238000011005 laboratory method Methods 0.000 description 3
- 239000003446 ligand Substances 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 229910052751 metal Inorganic materials 0.000 description 3
- 239000002184 metal Substances 0.000 description 3
- 108010005942 methionylglycine Proteins 0.000 description 3
- 210000003205 muscle Anatomy 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 235000015097 nutrients Nutrition 0.000 description 3
- 230000000750 progressive effect Effects 0.000 description 3
- 210000001236 prokaryotic cell Anatomy 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 230000001177 retroviral effect Effects 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- 230000028327 secretion Effects 0.000 description 3
- 230000035945 sensitivity Effects 0.000 description 3
- 210000002966 serum Anatomy 0.000 description 3
- 229910052710 silicon Inorganic materials 0.000 description 3
- 239000010703 silicon Substances 0.000 description 3
- 238000002741 site-directed mutagenesis Methods 0.000 description 3
- 108010005652 splenotritin Proteins 0.000 description 3
- 238000010561 standard procedure Methods 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 230000008685 targeting Effects 0.000 description 3
- 230000001225 therapeutic effect Effects 0.000 description 3
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 3
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 2
- BVAUMRCGVHUWOZ-ZETCQYMHSA-N (2s)-2-(cyclohexylazaniumyl)propanoate Chemical compound OC(=O)[C@H](C)NC1CCCCC1 BVAUMRCGVHUWOZ-ZETCQYMHSA-N 0.000 description 2
- FUOOLUPWFVMBKG-UHFFFAOYSA-N 2-Aminoisobutyric acid Chemical compound CC(C)(N)C(O)=O FUOOLUPWFVMBKG-UHFFFAOYSA-N 0.000 description 2
- OYIFNHCXNCRBQI-UHFFFAOYSA-N 2-aminoadipic acid Chemical compound OC(=O)C(N)CCCC(O)=O OYIFNHCXNCRBQI-UHFFFAOYSA-N 0.000 description 2
- RDFMDVXONNIGBC-UHFFFAOYSA-N 2-aminoheptanoic acid Chemical compound CCCCCC(N)C(O)=O RDFMDVXONNIGBC-UHFFFAOYSA-N 0.000 description 2
- BCHZICNRHXRCHY-UHFFFAOYSA-N 2h-oxazine Chemical compound N1OC=CC=C1 BCHZICNRHXRCHY-UHFFFAOYSA-N 0.000 description 2
- GOLORTLGFDVFDW-UHFFFAOYSA-N 3-(1h-benzimidazol-2-yl)-7-(diethylamino)chromen-2-one Chemical compound C1=CC=C2NC(C3=CC4=CC=C(C=C4OC3=O)N(CC)CC)=NC2=C1 GOLORTLGFDVFDW-UHFFFAOYSA-N 0.000 description 2
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 2
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 2
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 2
- PHQXWZGXKAFWAZ-ZLIFDBKOSA-N Ala-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 PHQXWZGXKAFWAZ-ZLIFDBKOSA-N 0.000 description 2
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 2
- 241000024188 Andala Species 0.000 description 2
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 2
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 2
- BHSYMWWMVRPCPA-CYDGBPFRSA-N Arg-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N BHSYMWWMVRPCPA-CYDGBPFRSA-N 0.000 description 2
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 2
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 2
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 2
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 2
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 2
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 2
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 2
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 2
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 2
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 2
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 2
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 2
- QRHYAUYXBVVDSB-LKXGYXEUSA-N Asn-Cys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QRHYAUYXBVVDSB-LKXGYXEUSA-N 0.000 description 2
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 2
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 2
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 2
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 2
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 2
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 2
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 2
- YTXCCDCOHIYQFC-GUBZILKMSA-N Asp-Met-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTXCCDCOHIYQFC-GUBZILKMSA-N 0.000 description 2
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 2
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 2
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 2
- 108010041397 CD4 Antigens Proteins 0.000 description 2
- 241000282693 Cercopithecidae Species 0.000 description 2
- 241000557626 Corvus corax Species 0.000 description 2
- IQXSTXKVEMRMMB-XAVMHZPKSA-N Cys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N)O IQXSTXKVEMRMMB-XAVMHZPKSA-N 0.000 description 2
- 102000016607 Diphtheria Toxin Human genes 0.000 description 2
- 108010053187 Diphtheria Toxin Proteins 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 108060002716 Exonuclease Proteins 0.000 description 2
- 108010010803 Gelatin Proteins 0.000 description 2
- PCKOTDPDHIBGRW-CIUDSAMLSA-N Gln-Cys-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N PCKOTDPDHIBGRW-CIUDSAMLSA-N 0.000 description 2
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 2
- QBEWLBKBGXVVPD-RYUDHWBXSA-N Gln-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N QBEWLBKBGXVVPD-RYUDHWBXSA-N 0.000 description 2
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 2
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 2
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 2
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 2
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 2
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 2
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 2
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 2
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 2
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 2
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 2
- FMVLWTYYODVFRG-BQBZGAKWSA-N Gly-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN FMVLWTYYODVFRG-BQBZGAKWSA-N 0.000 description 2
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 2
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 2
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 2
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 2
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 2
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 2
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 2
- 229940033330 HIV vaccine Drugs 0.000 description 2
- 108700010908 HIV-1 proteins Proteins 0.000 description 2
- 101710154606 Hemagglutinin Proteins 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 2
- MDBYBTWRMOAJAY-NHCYSSNCSA-N His-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MDBYBTWRMOAJAY-NHCYSSNCSA-N 0.000 description 2
- LBCAQRFTWMMWRR-CIUDSAMLSA-N His-Cys-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O LBCAQRFTWMMWRR-CIUDSAMLSA-N 0.000 description 2
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 2
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 2
- IAYPZSHNZQHQNO-KKUMJFAQSA-N His-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N IAYPZSHNZQHQNO-KKUMJFAQSA-N 0.000 description 2
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 description 2
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 2
- ZZHGKECPZXPXJF-PCBIJLKTSA-N Ile-Asn-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZZHGKECPZXPXJF-PCBIJLKTSA-N 0.000 description 2
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 2
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 2
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 2
- VUEXLJFLDONGKQ-PYJNHQTQSA-N Ile-His-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N VUEXLJFLDONGKQ-PYJNHQTQSA-N 0.000 description 2
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 2
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 2
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 2
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 2
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 2
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 2
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 2
- 206010061598 Immunodeficiency Diseases 0.000 description 2
- 208000029462 Immunodeficiency disease Diseases 0.000 description 2
- 102000009786 Immunoglobulin Constant Regions Human genes 0.000 description 2
- 108010009817 Immunoglobulin Constant Regions Proteins 0.000 description 2
- JUQLUIFNNFIIKC-YFKPBYRVSA-N L-2-aminopimelic acid Chemical compound OC(=O)[C@@H](N)CCCCC(O)=O JUQLUIFNNFIIKC-YFKPBYRVSA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- RHGKLRLOHDJJDR-BYPYZUCNSA-N L-citrulline Chemical compound NC(=O)NCCC[C@H]([NH3+])C([O-])=O RHGKLRLOHDJJDR-BYPYZUCNSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- QEFRNWWLZKMPFJ-ZXPFJRLXSA-N L-methionine (R)-S-oxide Chemical compound C[S@@](=O)CC[C@H]([NH3+])C([O-])=O QEFRNWWLZKMPFJ-ZXPFJRLXSA-N 0.000 description 2
- QEFRNWWLZKMPFJ-UHFFFAOYSA-N L-methionine sulphoxide Natural products CS(=O)CCC(N)C(O)=O QEFRNWWLZKMPFJ-UHFFFAOYSA-N 0.000 description 2
- SNDPXSYFESPGGJ-UHFFFAOYSA-N L-norVal-OH Natural products CCCC(N)C(O)=O SNDPXSYFESPGGJ-UHFFFAOYSA-N 0.000 description 2
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical compound CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 2
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 2
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 2
- BAJIJEGGUYXZGC-CIUDSAMLSA-N Leu-Asn-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N BAJIJEGGUYXZGC-CIUDSAMLSA-N 0.000 description 2
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 2
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 2
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 2
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 2
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 2
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 2
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 2
- LSLUTXRANSUGFY-XIRDDKMYSA-N Leu-Trp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O LSLUTXRANSUGFY-XIRDDKMYSA-N 0.000 description 2
- HQBOMRTVKVKFMN-WDSOQIARSA-N Leu-Trp-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O HQBOMRTVKVKFMN-WDSOQIARSA-N 0.000 description 2
- NPBGTPKLVJEOBE-IUCAKERBSA-N Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N NPBGTPKLVJEOBE-IUCAKERBSA-N 0.000 description 2
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 2
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 2
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 2
- MWVUEPNEPWMFBD-SRVKXCTJSA-N Lys-Cys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCCN MWVUEPNEPWMFBD-SRVKXCTJSA-N 0.000 description 2
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 2
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 2
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 2
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 2
- JCVOHUKUYSYBAD-DCAQKATOSA-N Lys-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CS)C(=O)O JCVOHUKUYSYBAD-DCAQKATOSA-N 0.000 description 2
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 2
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- RKIIYGUHIQJCBW-SRVKXCTJSA-N Met-His-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RKIIYGUHIQJCBW-SRVKXCTJSA-N 0.000 description 2
- BKIFWLQFOOKUCA-DCAQKATOSA-N Met-His-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N BKIFWLQFOOKUCA-DCAQKATOSA-N 0.000 description 2
- SCIFESDRCALIIM-UHFFFAOYSA-N N-Me-Phenylalanine Natural products CNC(C(O)=O)CC1=CC=CC=C1 SCIFESDRCALIIM-UHFFFAOYSA-N 0.000 description 2
- OLNLSTNFRUFTLM-UHFFFAOYSA-N N-ethylasparagine Chemical compound CCNC(C(O)=O)CC(N)=O OLNLSTNFRUFTLM-UHFFFAOYSA-N 0.000 description 2
- YPIGGYHFMKJNKV-UHFFFAOYSA-N N-ethylglycine Chemical compound CC[NH2+]CC([O-])=O YPIGGYHFMKJNKV-UHFFFAOYSA-N 0.000 description 2
- 108010065338 N-ethylglycine Proteins 0.000 description 2
- 230000004988 N-glycosylation Effects 0.000 description 2
- SCIFESDRCALIIM-VIFPVBQESA-N N-methyl-L-phenylalanine Chemical compound C[NH2+][C@H](C([O-])=O)CC1=CC=CC=C1 SCIFESDRCALIIM-VIFPVBQESA-N 0.000 description 2
- 239000004677 Nylon Substances 0.000 description 2
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 2
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 2
- 241001631646 Papillomaviridae Species 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 108091093037 Peptide nucleic acid Proteins 0.000 description 2
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 2
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 2
- 241000276498 Pollachius virens Species 0.000 description 2
- 239000004952 Polyamide Substances 0.000 description 2
- 239000004743 Polypropylene Substances 0.000 description 2
- 239000004793 Polystyrene Substances 0.000 description 2
- 241000288906 Primates Species 0.000 description 2
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 2
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 2
- OLTFZQIYCNOBLI-DCAQKATOSA-N Pro-Cys-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O OLTFZQIYCNOBLI-DCAQKATOSA-N 0.000 description 2
- SOACYAXADBWDDT-CYDGBPFRSA-N Pro-Ile-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SOACYAXADBWDDT-CYDGBPFRSA-N 0.000 description 2
- BWCZJGJKOFUUCN-ZPFDUUQYSA-N Pro-Ile-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O BWCZJGJKOFUUCN-ZPFDUUQYSA-N 0.000 description 2
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 2
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 2
- NFLNBHLMLYALOO-DCAQKATOSA-N Pro-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 NFLNBHLMLYALOO-DCAQKATOSA-N 0.000 description 2
- VWHJZETTZDAGOM-XUXIUFHCSA-N Pro-Lys-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VWHJZETTZDAGOM-XUXIUFHCSA-N 0.000 description 2
- 101710176177 Protein A56 Proteins 0.000 description 2
- 238000012228 RNA interference-mediated gene silencing Methods 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 241000283984 Rodentia Species 0.000 description 2
- 108010077895 Sarcosine Proteins 0.000 description 2
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 2
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 2
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 2
- ZHYMUFQVKGJNRM-ZLUOBGJFSA-N Ser-Cys-Asn Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O ZHYMUFQVKGJNRM-ZLUOBGJFSA-N 0.000 description 2
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 2
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 2
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 2
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 2
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 2
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 2
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 2
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 2
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 2
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 2
- 230000005867 T cell response Effects 0.000 description 2
- 210000001744 T-lymphocyte Anatomy 0.000 description 2
- RYYWUUFWQRZTIU-UHFFFAOYSA-N Thiophosphoric acid Chemical class OP(O)(S)=O RYYWUUFWQRZTIU-UHFFFAOYSA-N 0.000 description 2
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 2
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 2
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 2
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 2
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 2
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 2
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 2
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 2
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 2
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 2
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 2
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 2
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 2
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 2
- HYNAKPYFEYJMAS-XIRDDKMYSA-N Trp-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HYNAKPYFEYJMAS-XIRDDKMYSA-N 0.000 description 2
- KRCPXGSWDOGHAM-XIRDDKMYSA-N Trp-Lys-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O KRCPXGSWDOGHAM-XIRDDKMYSA-N 0.000 description 2
- KOVPHHXMHLFWPL-BPUTZDHNSA-N Trp-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CC(=O)N)C(=O)O KOVPHHXMHLFWPL-BPUTZDHNSA-N 0.000 description 2
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 2
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 2
- ZNFPUOSTMUMUDR-JRQIVUDYSA-N Tyr-Asn-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZNFPUOSTMUMUDR-JRQIVUDYSA-N 0.000 description 2
- JFDGVHXRCKEBAU-KKUMJFAQSA-N Tyr-Asp-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JFDGVHXRCKEBAU-KKUMJFAQSA-N 0.000 description 2
- UMXSDHPSMROQRB-YJRXYDGGSA-N Tyr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UMXSDHPSMROQRB-YJRXYDGGSA-N 0.000 description 2
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 2
- OKDNSNWJEXAMSU-IRXDYDNUSA-N Tyr-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 OKDNSNWJEXAMSU-IRXDYDNUSA-N 0.000 description 2
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 2
- NUQZCPSZHGIYTA-HKUYNNGSSA-N Tyr-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N NUQZCPSZHGIYTA-HKUYNNGSSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- 241000700618 Vaccinia virus Species 0.000 description 2
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 2
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 2
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 2
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 2
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 2
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 2
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 2
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 2
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 2
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 2
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 2
- 108010003533 Viral Envelope Proteins Proteins 0.000 description 2
- 108010067390 Viral Proteins Proteins 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 150000007513 acids Chemical class 0.000 description 2
- 238000007792 addition Methods 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 125000000217 alkyl group Chemical group 0.000 description 2
- 125000003277 amino group Chemical group 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 2
- 210000001106 artificial yeast chromosome Anatomy 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 230000002238 attenuated effect Effects 0.000 description 2
- 239000011324 bead Substances 0.000 description 2
- 238000005452 bending Methods 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 235000020958 biotin Nutrition 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 108010006025 bovine growth hormone Proteins 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 238000012219 cassette mutagenesis Methods 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 210000000170 cell membrane Anatomy 0.000 description 2
- 210000002421 cell wall Anatomy 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 239000003638 chemical reducing agent Substances 0.000 description 2
- 235000012000 cholesterol Nutrition 0.000 description 2
- 239000002131 composite material Substances 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 229920001577 copolymer Polymers 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 210000000805 cytoplasm Anatomy 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 239000000539 dimer Substances 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- 150000002019 disulfides Chemical class 0.000 description 2
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 239000000839 emulsion Substances 0.000 description 2
- 230000002708 enhancing effect Effects 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 102000013165 exonuclease Human genes 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 2
- 238000001917 fluorescence detection Methods 0.000 description 2
- 239000007850 fluorescent dye Substances 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 239000008273 gelatin Substances 0.000 description 2
- 229920000159 gelatin Polymers 0.000 description 2
- 235000019322 gelatine Nutrition 0.000 description 2
- 235000011852 gelatine desserts Nutrition 0.000 description 2
- 230000009368 gene silencing by RNA Effects 0.000 description 2
- 238000007429 general method Methods 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 2
- RWSXRVCMGQZWBV-WDSKDSINSA-N glutathione Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@@H](CS)C(=O)NCC(O)=O RWSXRVCMGQZWBV-WDSKDSINSA-N 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 239000010931 gold Substances 0.000 description 2
- 229910052737 gold Inorganic materials 0.000 description 2
- 239000003102 growth factor Substances 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 239000000185 hemagglutinin Substances 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 239000005556 hormone Substances 0.000 description 2
- 229940088597 hormone Drugs 0.000 description 2
- 210000005260 human cell Anatomy 0.000 description 2
- 210000004408 hybridoma Anatomy 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 210000000987 immune system Anatomy 0.000 description 2
- 230000007813 immunodeficiency Effects 0.000 description 2
- 230000016784 immunoglobulin production Effects 0.000 description 2
- 229940072221 immunoglobulins Drugs 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 239000012678 infectious agent Substances 0.000 description 2
- 239000003999 initiator Substances 0.000 description 2
- 230000003834 intracellular effect Effects 0.000 description 2
- 238000007918 intramuscular administration Methods 0.000 description 2
- 238000010255 intramuscular injection Methods 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 210000003292 kidney cell Anatomy 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 210000000265 leukocyte Anatomy 0.000 description 2
- 210000002540 macrophage Anatomy 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 210000000663 muscle cell Anatomy 0.000 description 2
- 230000003472 neutralizing effect Effects 0.000 description 2
- 230000036963 noncompetitive effect Effects 0.000 description 2
- 231100000252 nontoxic Toxicity 0.000 description 2
- 230000003000 nontoxic effect Effects 0.000 description 2
- 229920001778 nylon Polymers 0.000 description 2
- 210000001672 ovary Anatomy 0.000 description 2
- 239000000123 paper Substances 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- WTJKGGKOPKCXLL-RRHRGVEJSA-N phosphatidylcholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCC=CCCCCCCCC WTJKGGKOPKCXLL-RRHRGVEJSA-N 0.000 description 2
- 150000004713 phosphodiesters Chemical class 0.000 description 2
- 229920003023 plastic Polymers 0.000 description 2
- 239000004033 plastic Substances 0.000 description 2
- 229920001983 poloxamer Polymers 0.000 description 2
- 229920002647 polyamide Polymers 0.000 description 2
- 229920001155 polypropylene Polymers 0.000 description 2
- 229920002223 polystyrene Polymers 0.000 description 2
- 239000011148 porous material Substances 0.000 description 2
- 230000004481 post-translational protein modification Effects 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 2
- 230000001737 promoting effect Effects 0.000 description 2
- 235000019833 protease Nutrition 0.000 description 2
- 230000001681 protective effect Effects 0.000 description 2
- 230000017854 proteolysis Effects 0.000 description 2
- 238000003753 real-time PCR Methods 0.000 description 2
- 238000003259 recombinant expression Methods 0.000 description 2
- 238000010188 recombinant method Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 238000010839 reverse transcription Methods 0.000 description 2
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 2
- 230000003248 secreting effect Effects 0.000 description 2
- 238000010532 solid phase synthesis reaction Methods 0.000 description 2
- 238000011146 sterile filtration Methods 0.000 description 2
- 230000001954 sterilising effect Effects 0.000 description 2
- 238000004659 sterilization and disinfection Methods 0.000 description 2
- 230000004936 stimulating effect Effects 0.000 description 2
- 239000011550 stock solution Substances 0.000 description 2
- 229940031626 subunit vaccine Drugs 0.000 description 2
- 239000004094 surface-active agent Substances 0.000 description 2
- MPLHNVLQVRSVEE-UHFFFAOYSA-N texas red Chemical compound [O-]S(=O)(=O)C1=CC(S(Cl)(=O)=O)=CC=C1C(C1=CC=2CCCN3CCCC(C=23)=C1O1)=C2C1=C(CCC1)C3=[N+]1CCCC3=C2 MPLHNVLQVRSVEE-UHFFFAOYSA-N 0.000 description 2
- 229940021747 therapeutic vaccine Drugs 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 230000005945 translocation Effects 0.000 description 2
- 230000032258 transport Effects 0.000 description 2
- 239000013638 trimer Substances 0.000 description 2
- 241000712461 unidentified influenza virus Species 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- 238000011179 visual inspection Methods 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- 210000005253 yeast cell Anatomy 0.000 description 2
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 1
- GIDNRVZDUMNLRK-GFCCVEGCSA-N (2S)-2-amino-2,3-dimethyl-3-phenylbutanoic acid Chemical compound CC([C@](N)(C(=O)O)C)(C1=CC=CC=C1)C GIDNRVZDUMNLRK-GFCCVEGCSA-N 0.000 description 1
- PKOHVHWNGUHYRE-ZFWWWQNUSA-N (2s)-1-[2-[[(2s)-2-amino-3-(1h-indol-3-yl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)NCC(=O)N1CCC[C@H]1C(O)=O PKOHVHWNGUHYRE-ZFWWWQNUSA-N 0.000 description 1
- JJQSPVIEQMFVNS-YMAIGWNHSA-N (2s)-2-[[(2s)-5-amino-2-[[(2s)-2-[[(2s,3s)-2-[[(2s,3s)-2-[[(2s,3r)-2-[[(2s)-6-amino-2-[[(2s)-2-[[(2s)-2,4-diamino-4-oxobutanoyl]amino]propanoyl]amino]hexanoyl]amino]-3-hydroxybutanoyl]amino]-3-methylpentanoyl]amino]-3-methylpentanoyl]amino]-3-methylbutano Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JJQSPVIEQMFVNS-YMAIGWNHSA-N 0.000 description 1
- AUXMWYRZQPIXCC-KNIFDHDWSA-N (2s)-2-amino-4-methylpentanoic acid;(2s)-2-aminopropanoic acid Chemical compound C[C@H](N)C(O)=O.CC(C)C[C@H](N)C(O)=O AUXMWYRZQPIXCC-KNIFDHDWSA-N 0.000 description 1
- ASWBNKHCZGQVJV-UHFFFAOYSA-N (3-hexadecanoyloxy-2-hydroxypropyl) 2-(trimethylazaniumyl)ethyl phosphate Chemical compound CCCCCCCCCCCCCCCC(=O)OCC(O)COP([O-])(=O)OCC[N+](C)(C)C ASWBNKHCZGQVJV-UHFFFAOYSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- LDGWQMRUWMSZIU-LQDDAWAPSA-M 2,3-bis[(z)-octadec-9-enoxy]propyl-trimethylazanium;chloride Chemical compound [Cl-].CCCCCCCC\C=C/CCCCCCCCOCC(C[N+](C)(C)C)OCCCCCCCC\C=C/CCCCCCCC LDGWQMRUWMSZIU-LQDDAWAPSA-M 0.000 description 1
- MHKBMNACOMRIAW-UHFFFAOYSA-N 2,3-dinitrophenol Chemical class OC1=CC=CC([N+]([O-])=O)=C1[N+]([O-])=O MHKBMNACOMRIAW-UHFFFAOYSA-N 0.000 description 1
- KDZIGQIDPXKMBA-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-3-methylbutanoyl)amino]acetyl]amino]-3-hydroxypropanoyl]amino]pentanedioic acid Chemical compound CC(C)C(N)C(=O)NCC(=O)NC(CO)C(=O)NC(C(O)=O)CCC(O)=O KDZIGQIDPXKMBA-UHFFFAOYSA-N 0.000 description 1
- BFSVOASYOCHEOV-UHFFFAOYSA-N 2-diethylaminoethanol Chemical compound CCN(CC)CCO BFSVOASYOCHEOV-UHFFFAOYSA-N 0.000 description 1
- YSMODUONRAFBET-UHFFFAOYSA-N 5-hydroxylysine Chemical group NCC(O)CCC(N)C(O)=O YSMODUONRAFBET-UHFFFAOYSA-N 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 1
- SHYYAQLDNVHPFT-DLOVCJGASA-N Ala-Asn-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SHYYAQLDNVHPFT-DLOVCJGASA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- NFDVJAKFMXHJEQ-HERUPUMHSA-N Ala-Asp-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NFDVJAKFMXHJEQ-HERUPUMHSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 1
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 1
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- JWUZOJXDJDEQEM-ZLIFDBKOSA-N Ala-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 JWUZOJXDJDEQEM-ZLIFDBKOSA-N 0.000 description 1
- SGFBVLBKDSXGAP-GKCIPKSASA-N Ala-Phe-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N SGFBVLBKDSXGAP-GKCIPKSASA-N 0.000 description 1
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 1
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 1
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 241000710929 Alphavirus Species 0.000 description 1
- 244000105975 Antidesma platyphyllum Species 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 1
- VYSRNGOMGHOJCK-GUBZILKMSA-N Arg-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N VYSRNGOMGHOJCK-GUBZILKMSA-N 0.000 description 1
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 1
- WOPFJPHVBWKZJH-SRVKXCTJSA-N Arg-Arg-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O WOPFJPHVBWKZJH-SRVKXCTJSA-N 0.000 description 1
- USNSOPDIZILSJP-FXQIFTODSA-N Arg-Asn-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O USNSOPDIZILSJP-FXQIFTODSA-N 0.000 description 1
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 1
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- JVMKBJNSRZWDBO-FXQIFTODSA-N Arg-Cys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O JVMKBJNSRZWDBO-FXQIFTODSA-N 0.000 description 1
- PTVGLOCPAVYPFG-CIUDSAMLSA-N Arg-Gln-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PTVGLOCPAVYPFG-CIUDSAMLSA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 1
- CVXXSWQORBZAAA-SRVKXCTJSA-N Arg-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N CVXXSWQORBZAAA-SRVKXCTJSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 1
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 1
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 1
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 1
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 1
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- AMIQZQAAYGYKOP-FXQIFTODSA-N Arg-Ser-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O AMIQZQAAYGYKOP-FXQIFTODSA-N 0.000 description 1
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 1
- JPAWCMXVNZPJLO-IHRRRGAJSA-N Arg-Ser-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JPAWCMXVNZPJLO-IHRRRGAJSA-N 0.000 description 1
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 1
- WCZXPVPHUMYLMS-VEVYYDQMSA-N Arg-Thr-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O WCZXPVPHUMYLMS-VEVYYDQMSA-N 0.000 description 1
- WTFIFQWLQXZLIZ-UMPQAUOISA-N Arg-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O WTFIFQWLQXZLIZ-UMPQAUOISA-N 0.000 description 1
- ZUVMUOOHJYNJPP-XIRDDKMYSA-N Arg-Trp-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZUVMUOOHJYNJPP-XIRDDKMYSA-N 0.000 description 1
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 240000003291 Armoracia rusticana Species 0.000 description 1
- 235000011330 Armoracia rusticana Nutrition 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 1
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- PTNFNTOBUDWHNZ-GUBZILKMSA-N Asn-Arg-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O PTNFNTOBUDWHNZ-GUBZILKMSA-N 0.000 description 1
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 1
- GOVUDFOGXOONFT-VEVYYDQMSA-N Asn-Arg-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GOVUDFOGXOONFT-VEVYYDQMSA-N 0.000 description 1
- CQMQJWRCRQSBAF-BPUTZDHNSA-N Asn-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N CQMQJWRCRQSBAF-BPUTZDHNSA-N 0.000 description 1
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 1
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 1
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 1
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 1
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 1
- DXZNJWFECGJCQR-FXQIFTODSA-N Asn-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N DXZNJWFECGJCQR-FXQIFTODSA-N 0.000 description 1
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 1
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 1
- WVCJSDCHTUTONA-FXQIFTODSA-N Asn-Asp-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WVCJSDCHTUTONA-FXQIFTODSA-N 0.000 description 1
- BGINHSZTXRJIPP-FXQIFTODSA-N Asn-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BGINHSZTXRJIPP-FXQIFTODSA-N 0.000 description 1
- HLTLEIXYIJDFOY-ZLUOBGJFSA-N Asn-Cys-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O HLTLEIXYIJDFOY-ZLUOBGJFSA-N 0.000 description 1
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- AITGTTNYKAWKDR-CIUDSAMLSA-N Asn-His-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O AITGTTNYKAWKDR-CIUDSAMLSA-N 0.000 description 1
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 1
- GOKCTAJWRPSCHP-VHWLVUOQSA-N Asn-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)N)N GOKCTAJWRPSCHP-VHWLVUOQSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 1
- RTFWCVDISAMGEQ-SRVKXCTJSA-N Asn-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N RTFWCVDISAMGEQ-SRVKXCTJSA-N 0.000 description 1
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 1
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 1
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 1
- PUUPMDXIHCOPJU-HJGDQZAQSA-N Asn-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O PUUPMDXIHCOPJU-HJGDQZAQSA-N 0.000 description 1
- BIGRHVNFFJTHEB-UBHSHLNASA-N Asn-Trp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O BIGRHVNFFJTHEB-UBHSHLNASA-N 0.000 description 1
- UPAGTDJAORYMEC-VHWLVUOQSA-N Asn-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)N)N UPAGTDJAORYMEC-VHWLVUOQSA-N 0.000 description 1
- QIRJQYQOIKBPBZ-IHRRRGAJSA-N Asn-Tyr-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QIRJQYQOIKBPBZ-IHRRRGAJSA-N 0.000 description 1
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 1
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 1
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 1
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 description 1
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- WLKVEEODTPQPLI-ACZMJKKPSA-N Asp-Gln-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WLKVEEODTPQPLI-ACZMJKKPSA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- YWLDTBBUHZJQHW-KKUMJFAQSA-N Asp-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N YWLDTBBUHZJQHW-KKUMJFAQSA-N 0.000 description 1
- ZXRQJQCXPSMNMR-XIRDDKMYSA-N Asp-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N ZXRQJQCXPSMNMR-XIRDDKMYSA-N 0.000 description 1
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 description 1
- BPTFNDRZKBFMTH-DCAQKATOSA-N Asp-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N BPTFNDRZKBFMTH-DCAQKATOSA-N 0.000 description 1
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 1
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 1
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- KCOPOPKJRHVGPE-AQZXSJQPSA-N Asp-Thr-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O KCOPOPKJRHVGPE-AQZXSJQPSA-N 0.000 description 1
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- QCMYYKRYFNMIEC-UHFFFAOYSA-N COP(O)=O Chemical class COP(O)=O QCMYYKRYFNMIEC-UHFFFAOYSA-N 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 239000004215 Carbon black (E152) Substances 0.000 description 1
- 102000019034 Chemokines Human genes 0.000 description 1
- 108010012236 Chemokines Proteins 0.000 description 1
- 241000284156 Clerodendrum quadriloculare Species 0.000 description 1
- 108091033380 Coding strand Proteins 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- 239000004971 Cross linker Substances 0.000 description 1
- PKNIZMPLMSKROD-BIIVOSGPSA-N Cys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N PKNIZMPLMSKROD-BIIVOSGPSA-N 0.000 description 1
- PRVVCRZLTJNPCS-FXQIFTODSA-N Cys-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N PRVVCRZLTJNPCS-FXQIFTODSA-N 0.000 description 1
- GMXSSZUVDNPRMA-FXQIFTODSA-N Cys-Arg-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GMXSSZUVDNPRMA-FXQIFTODSA-N 0.000 description 1
- CEZSLNCYQUFOSL-BQBZGAKWSA-N Cys-Arg-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O CEZSLNCYQUFOSL-BQBZGAKWSA-N 0.000 description 1
- DEVDFMRWZASYOF-ZLUOBGJFSA-N Cys-Asn-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DEVDFMRWZASYOF-ZLUOBGJFSA-N 0.000 description 1
- CPTUXCUWQIBZIF-ZLUOBGJFSA-N Cys-Asn-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CPTUXCUWQIBZIF-ZLUOBGJFSA-N 0.000 description 1
- YXPNKXFOBHRUBL-BJDJZHNGSA-N Cys-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N YXPNKXFOBHRUBL-BJDJZHNGSA-N 0.000 description 1
- NIXHTNJAGGFBAW-CIUDSAMLSA-N Cys-Lys-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N NIXHTNJAGGFBAW-CIUDSAMLSA-N 0.000 description 1
- MXZYQNJCBVJHSR-KATARQTJSA-N Cys-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)O MXZYQNJCBVJHSR-KATARQTJSA-N 0.000 description 1
- LHJDLVVQRJIURS-SRVKXCTJSA-N Cys-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N LHJDLVVQRJIURS-SRVKXCTJSA-N 0.000 description 1
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 1
- WTXCNOPZMQRTNN-BWBBJGPYSA-N Cys-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)O WTXCNOPZMQRTNN-BWBBJGPYSA-N 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- 150000008574 D-amino acids Chemical class 0.000 description 1
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 1
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 241000450599 DNA viruses Species 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 239000004375 Dextrin Substances 0.000 description 1
- 229920001353 Dextrin Polymers 0.000 description 1
- 206010061818 Disease progression Diseases 0.000 description 1
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 1
- ULXXDWZMMSQBDC-ACZMJKKPSA-N Gln-Asp-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ULXXDWZMMSQBDC-ACZMJKKPSA-N 0.000 description 1
- VNCLJDOTEPPBBD-GUBZILKMSA-N Gln-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N VNCLJDOTEPPBBD-GUBZILKMSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 1
- GXMBDEGTXHQBAO-NKIYYHGXSA-N Gln-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N)O GXMBDEGTXHQBAO-NKIYYHGXSA-N 0.000 description 1
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 1
- TWTWUBHEWQPMQW-ZPFDUUQYSA-N Gln-Ile-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWTWUBHEWQPMQW-ZPFDUUQYSA-N 0.000 description 1
- KHGGWBRVRPHFMH-PEFMBERDSA-N Gln-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHGGWBRVRPHFMH-PEFMBERDSA-N 0.000 description 1
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 1
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 1
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- QDXMSSWCEVYOLZ-SZMVWBNQSA-N Gln-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QDXMSSWCEVYOLZ-SZMVWBNQSA-N 0.000 description 1
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 1
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 1
- JUUNNOLZGVYCJT-JYJNAYRXSA-N Gln-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JUUNNOLZGVYCJT-JYJNAYRXSA-N 0.000 description 1
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 1
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 1
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 1
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 1
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 1
- RNPGPFAVRLERPP-QEJZJMRPSA-N Gln-Trp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O RNPGPFAVRLERPP-QEJZJMRPSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- RSUVOPBMWMTVDI-XEGUGMAKSA-N Glu-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(O)=O)C)C(O)=O)=CNC2=C1 RSUVOPBMWMTVDI-XEGUGMAKSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 1
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 1
- KEBACWCLVOXFNC-DCAQKATOSA-N Glu-Arg-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KEBACWCLVOXFNC-DCAQKATOSA-N 0.000 description 1
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 1
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- NUSWUSKZRCGFEX-FXQIFTODSA-N Glu-Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O NUSWUSKZRCGFEX-FXQIFTODSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- VXQOONWNIWFOCS-HGNGGELXSA-N Glu-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N VXQOONWNIWFOCS-HGNGGELXSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- YGLCLCMAYUYZSG-AVGNSLFASA-N Glu-Lys-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 YGLCLCMAYUYZSG-AVGNSLFASA-N 0.000 description 1
- ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- AOCARQDSFTWWFT-DCAQKATOSA-N Glu-Met-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AOCARQDSFTWWFT-DCAQKATOSA-N 0.000 description 1
- XNOWYPDMSLSRKP-GUBZILKMSA-N Glu-Met-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O XNOWYPDMSLSRKP-GUBZILKMSA-N 0.000 description 1
- SOEPMWQCTJITPZ-SRVKXCTJSA-N Glu-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N SOEPMWQCTJITPZ-SRVKXCTJSA-N 0.000 description 1
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 1
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- RZMXBFUSQNLEQF-QEJZJMRPSA-N Glu-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N RZMXBFUSQNLEQF-QEJZJMRPSA-N 0.000 description 1
- DXMOIVCNJIJQSC-QEJZJMRPSA-N Glu-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N DXMOIVCNJIJQSC-QEJZJMRPSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 108010024636 Glutathione Proteins 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 1
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- NTOWAXLMQFKJPT-YUMQZZPRSA-N Gly-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN NTOWAXLMQFKJPT-YUMQZZPRSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- VAXIVIPMCTYSHI-YUMQZZPRSA-N Gly-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN VAXIVIPMCTYSHI-YUMQZZPRSA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- DENRBIYENOKSEX-PEXQALLHSA-N Gly-Ile-His Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DENRBIYENOKSEX-PEXQALLHSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- IKAIKUBBJHFNBZ-LURJTMIESA-N Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CN IKAIKUBBJHFNBZ-LURJTMIESA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 1
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 1
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 1
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 1
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- JZNWSCPGTDBMEW-UHFFFAOYSA-N Glycerophosphorylethanolamin Natural products NCCOP(O)(=O)OCC(O)CO JZNWSCPGTDBMEW-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 229920002683 Glycosaminoglycan Polymers 0.000 description 1
- 208000037357 HIV infectious disease Diseases 0.000 description 1
- 229940033332 HIV-1 vaccine Drugs 0.000 description 1
- YXBRCTXAEYSCHS-XVYDVKMFSA-N His-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N YXBRCTXAEYSCHS-XVYDVKMFSA-N 0.000 description 1
- CIWILNZNBPIHEU-DCAQKATOSA-N His-Arg-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O CIWILNZNBPIHEU-DCAQKATOSA-N 0.000 description 1
- MWAJSVTZZOUOBU-IHRRRGAJSA-N His-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 MWAJSVTZZOUOBU-IHRRRGAJSA-N 0.000 description 1
- HRGGKHFHRSFSDE-CIUDSAMLSA-N His-Asn-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N HRGGKHFHRSFSDE-CIUDSAMLSA-N 0.000 description 1
- UJWYPUUXIAKEES-CUJWVEQBSA-N His-Cys-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UJWYPUUXIAKEES-CUJWVEQBSA-N 0.000 description 1
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 1
- FONIDUOGWNWEAX-XIRDDKMYSA-N His-Trp-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O FONIDUOGWNWEAX-XIRDDKMYSA-N 0.000 description 1
- HIJIJPFILYPTFR-ACRUOGEOSA-N His-Tyr-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HIJIJPFILYPTFR-ACRUOGEOSA-N 0.000 description 1
- 108010048209 Human Immunodeficiency Virus Proteins Proteins 0.000 description 1
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 1
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 1
- WUEIUSDAECDLQO-NAKRPEOUSA-N Ile-Ala-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)O)N WUEIUSDAECDLQO-NAKRPEOUSA-N 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 1
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 1
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 1
- OEQKGSPBDVKYOC-ZKWXMUAHSA-N Ile-Gly-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OEQKGSPBDVKYOC-ZKWXMUAHSA-N 0.000 description 1
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 1
- YKLOMBNBQUTJDT-HVTMNAMFSA-N Ile-His-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YKLOMBNBQUTJDT-HVTMNAMFSA-N 0.000 description 1
- CCYGNFBYUNHFSC-MGHWNKPDSA-N Ile-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CCYGNFBYUNHFSC-MGHWNKPDSA-N 0.000 description 1
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 1
- CEPIAEUVRKGPGP-DSYPUSFNSA-N Ile-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 CEPIAEUVRKGPGP-DSYPUSFNSA-N 0.000 description 1
- RVNOXPZHMUWCLW-GMOBBJLQSA-N Ile-Met-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RVNOXPZHMUWCLW-GMOBBJLQSA-N 0.000 description 1
- NNVXABCGXOLIEB-PYJNHQTQSA-N Ile-Met-His Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NNVXABCGXOLIEB-PYJNHQTQSA-N 0.000 description 1
- UYNXBNHVWFNVIN-HJWJTTGWSA-N Ile-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 UYNXBNHVWFNVIN-HJWJTTGWSA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- RKQAYOWLSFLJEE-SVSWQMSJSA-N Ile-Thr-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N RKQAYOWLSFLJEE-SVSWQMSJSA-N 0.000 description 1
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 1
- JCGMFFQQHJQASB-PYJNHQTQSA-N Ile-Val-His Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O JCGMFFQQHJQASB-PYJNHQTQSA-N 0.000 description 1
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 108010067060 Immunoglobulin Variable Region Proteins 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 102100034353 Integrase Human genes 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- 102000015696 Interleukins Human genes 0.000 description 1
- 108010063738 Interleukins Proteins 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- SNDPXSYFESPGGJ-BYPYZUCNSA-N L-2-aminopentanoic acid Chemical compound CCC[C@H](N)C(O)=O SNDPXSYFESPGGJ-BYPYZUCNSA-N 0.000 description 1
- QUOGESRFPZDMMT-UHFFFAOYSA-N L-Homoarginine Natural products OC(=O)C(N)CCCCNC(N)=N QUOGESRFPZDMMT-UHFFFAOYSA-N 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- AHLPHDHHMVZTML-BYPYZUCNSA-N L-Ornithine Chemical compound NCCC[C@H](N)C(O)=O AHLPHDHHMVZTML-BYPYZUCNSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- AGPKZVBTJJNPAG-UHNVWZDZSA-N L-allo-Isoleucine Chemical compound CC[C@@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-UHNVWZDZSA-N 0.000 description 1
- 150000008575 L-amino acids Chemical class 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- QUOGESRFPZDMMT-YFKPBYRVSA-N L-homoarginine Chemical compound OC(=O)[C@@H](N)CCCCNC(N)=N QUOGESRFPZDMMT-YFKPBYRVSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- DXYBNWJZJVSZAE-GUBZILKMSA-N Leu-Gln-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N DXYBNWJZJVSZAE-GUBZILKMSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- SNOUHRPNNCAOPI-SZMVWBNQSA-N Leu-Trp-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SNOUHRPNNCAOPI-SZMVWBNQSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- 108090001030 Lipoproteins Proteins 0.000 description 1
- 102000004895 Lipoproteins Human genes 0.000 description 1
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 1
- BTSXLXFPMZXVPR-DLOVCJGASA-N Lys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BTSXLXFPMZXVPR-DLOVCJGASA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- YRWCPXOFBKTCFY-NUTKFTJISA-N Lys-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N YRWCPXOFBKTCFY-NUTKFTJISA-N 0.000 description 1
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 1
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 1
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- PHHYNOUOUWYQRO-XIRDDKMYSA-N Lys-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N PHHYNOUOUWYQRO-XIRDDKMYSA-N 0.000 description 1
- SFQPJNQDUUYCLA-BJDJZHNGSA-N Lys-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N SFQPJNQDUUYCLA-BJDJZHNGSA-N 0.000 description 1
- BYEBKXRNDLTGFW-CIUDSAMLSA-N Lys-Cys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O BYEBKXRNDLTGFW-CIUDSAMLSA-N 0.000 description 1
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 1
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 1
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 1
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 1
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 1
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 1
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 1
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 1
- AHFOKDZWPPGJAZ-SRVKXCTJSA-N Lys-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N AHFOKDZWPPGJAZ-SRVKXCTJSA-N 0.000 description 1
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- MTBLFIQZECOEBY-IHRRRGAJSA-N Lys-Met-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O MTBLFIQZECOEBY-IHRRRGAJSA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- UIJVKVHLCQSPOJ-XIRDDKMYSA-N Lys-Ser-Trp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O UIJVKVHLCQSPOJ-XIRDDKMYSA-N 0.000 description 1
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 1
- ZVZRQKJOQQAFCF-ULQDDVLXSA-N Lys-Tyr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZVZRQKJOQQAFCF-ULQDDVLXSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 1
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- CWFYZYQMUDWGTI-GUBZILKMSA-N Met-Arg-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O CWFYZYQMUDWGTI-GUBZILKMSA-N 0.000 description 1
- CTVJSFRHUOSCQQ-DCAQKATOSA-N Met-Arg-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTVJSFRHUOSCQQ-DCAQKATOSA-N 0.000 description 1
- NCVJJAJVWILAGI-SRVKXCTJSA-N Met-Gln-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NCVJJAJVWILAGI-SRVKXCTJSA-N 0.000 description 1
- MVBZBRKNZVJEKK-DTWKUNHWSA-N Met-Gly-Pro Chemical compound CSCC[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N MVBZBRKNZVJEKK-DTWKUNHWSA-N 0.000 description 1
- RATXDYWHIYNZLE-DCAQKATOSA-N Met-Lys-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N RATXDYWHIYNZLE-DCAQKATOSA-N 0.000 description 1
- JKXVPNCSAMWUEJ-GUBZILKMSA-N Met-Met-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O JKXVPNCSAMWUEJ-GUBZILKMSA-N 0.000 description 1
- RIIFMEBFDDXGCV-VEVYYDQMSA-N Met-Thr-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O RIIFMEBFDDXGCV-VEVYYDQMSA-N 0.000 description 1
- RKRFGIBULDYDPF-XIRDDKMYSA-N Met-Trp-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RKRFGIBULDYDPF-XIRDDKMYSA-N 0.000 description 1
- YDKYJRZWRJTILC-WDSOQIARSA-N Met-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 YDKYJRZWRJTILC-WDSOQIARSA-N 0.000 description 1
- JACMWNXOOUYXCD-JYJNAYRXSA-N Met-Val-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JACMWNXOOUYXCD-JYJNAYRXSA-N 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 241000714177 Murine leukemia virus Species 0.000 description 1
- NSTPXGARCQOSAU-VIFPVBQESA-N N-formyl-L-phenylalanine Chemical compound O=CN[C@H](C(=O)O)CC1=CC=CC=C1 NSTPXGARCQOSAU-VIFPVBQESA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- RHGKLRLOHDJJDR-UHFFFAOYSA-N Ndelta-carbamoyl-DL-ornithine Natural products OC(=O)C(N)CCCNC(N)=O RHGKLRLOHDJJDR-UHFFFAOYSA-N 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 102000007999 Nuclear Proteins Human genes 0.000 description 1
- 108010089610 Nuclear Proteins Proteins 0.000 description 1
- AHLPHDHHMVZTML-UHFFFAOYSA-N Orn-delta-NH2 Natural products NCCCC(N)C(O)=O AHLPHDHHMVZTML-UHFFFAOYSA-N 0.000 description 1
- UTJLXEIPEHZYQJ-UHFFFAOYSA-N Ornithine Natural products OC(=O)C(C)CCCN UTJLXEIPEHZYQJ-UHFFFAOYSA-N 0.000 description 1
- 102000016979 Other receptors Human genes 0.000 description 1
- 101710160107 Outer membrane protein A Proteins 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 102000057297 Pepsin A Human genes 0.000 description 1
- 108090000284 Pepsin A Proteins 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 1
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 1
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 1
- IQXOZIDWLZYYAW-IHRRRGAJSA-N Phe-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IQXOZIDWLZYYAW-IHRRRGAJSA-N 0.000 description 1
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 1
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 1
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 1
- UXQFHEKRGHYJRA-STQMWFEESA-N Phe-Met-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O UXQFHEKRGHYJRA-STQMWFEESA-N 0.000 description 1
- HBXAOEBRGLCLIW-AVGNSLFASA-N Phe-Ser-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HBXAOEBRGLCLIW-AVGNSLFASA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 1
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 1
- BQMFWUKNOCJDNV-HJWJTTGWSA-N Phe-Val-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQMFWUKNOCJDNV-HJWJTTGWSA-N 0.000 description 1
- 239000004698 Polyethylene Substances 0.000 description 1
- 229920002873 Polyethylenimine Polymers 0.000 description 1
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- FDINZVJXLPILKV-DCAQKATOSA-N Pro-His-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O FDINZVJXLPILKV-DCAQKATOSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 1
- SVXXJYJCRNKDDE-AVGNSLFASA-N Pro-Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CN=CN1 SVXXJYJCRNKDDE-AVGNSLFASA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 1
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 1
- FIDNSJUXESUDOV-JYJNAYRXSA-N Pro-Tyr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O FIDNSJUXESUDOV-JYJNAYRXSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 108010066717 Q beta Replicase Proteins 0.000 description 1
- 238000012181 QIAquick gel extraction kit Methods 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 206010057190 Respiratory tract infections Diseases 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 101150071661 SLC25A20 gene Proteins 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 1
- CTLVSHXLRVEILB-UBHSHLNASA-N Ser-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N CTLVSHXLRVEILB-UBHSHLNASA-N 0.000 description 1
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- NJSPTZXVPZDRCU-UBHSHLNASA-N Ser-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N NJSPTZXVPZDRCU-UBHSHLNASA-N 0.000 description 1
- MPPHJZYXDVDGOF-BWBBJGPYSA-N Ser-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CO MPPHJZYXDVDGOF-BWBBJGPYSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 1
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 1
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 1
- LDEBVRIURYMKQS-WISUUJSJSA-N Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO LDEBVRIURYMKQS-WISUUJSJSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- QNBVFKZSSRYNFX-CUJWVEQBSA-N Ser-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N)O QNBVFKZSSRYNFX-CUJWVEQBSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- YXEYTHXDRDAIOJ-CWRNSKLLSA-N Ser-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N)C(=O)O YXEYTHXDRDAIOJ-CWRNSKLLSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- 108010071390 Serum Albumin Proteins 0.000 description 1
- 102000007562 Serum Albumin Human genes 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 108700005078 Synthetic Genes Proteins 0.000 description 1
- 230000029662 T-helper 1 type immune response Effects 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- 108010055044 Tetanus Toxin Proteins 0.000 description 1
- 241000906446 Theraps Species 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- IRKWVRSEQFTGGV-VEVYYDQMSA-N Thr-Asn-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IRKWVRSEQFTGGV-VEVYYDQMSA-N 0.000 description 1
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 1
- QNJZOAHSYPXTAB-VEVYYDQMSA-N Thr-Asn-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O QNJZOAHSYPXTAB-VEVYYDQMSA-N 0.000 description 1
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 1
- UZJDBCHMIQXLOQ-HEIBUPTGSA-N Thr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O UZJDBCHMIQXLOQ-HEIBUPTGSA-N 0.000 description 1
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 1
- WLDUCKSCDRIVLJ-NUMRIWBASA-N Thr-Gln-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O WLDUCKSCDRIVLJ-NUMRIWBASA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 1
- DXNUZQGVOMCGNS-SWRJLBSHSA-N Thr-Gln-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O DXNUZQGVOMCGNS-SWRJLBSHSA-N 0.000 description 1
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- VULNJDORNLBPNG-SWRJLBSHSA-N Thr-Glu-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VULNJDORNLBPNG-SWRJLBSHSA-N 0.000 description 1
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- SIMKLINEDYOTKL-MBLNEYKQSA-N Thr-His-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C)C(=O)O)N)O SIMKLINEDYOTKL-MBLNEYKQSA-N 0.000 description 1
- KRGDDWVBBDLPSJ-CUJWVEQBSA-N Thr-His-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O KRGDDWVBBDLPSJ-CUJWVEQBSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 1
- MCDVZTRGHNXTGK-HJGDQZAQSA-N Thr-Met-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O MCDVZTRGHNXTGK-HJGDQZAQSA-N 0.000 description 1
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 1
- GUHLYMZJVXUIPO-RCWTZXSCSA-N Thr-Met-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GUHLYMZJVXUIPO-RCWTZXSCSA-N 0.000 description 1
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 1
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- SOUPNXUJAJENFU-SWRJLBSHSA-N Thr-Trp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O SOUPNXUJAJENFU-SWRJLBSHSA-N 0.000 description 1
- XGUAUKUYQHBUNY-SWRJLBSHSA-N Thr-Trp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XGUAUKUYQHBUNY-SWRJLBSHSA-N 0.000 description 1
- MYNYCUXMIIWUNW-IEGACIPQSA-N Thr-Trp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MYNYCUXMIIWUNW-IEGACIPQSA-N 0.000 description 1
- ZEJBJDHSQPOVJV-UAXMHLISSA-N Thr-Trp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZEJBJDHSQPOVJV-UAXMHLISSA-N 0.000 description 1
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- PEYSVKMXSLPQRU-FJHTZYQYSA-N Trp-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O PEYSVKMXSLPQRU-FJHTZYQYSA-N 0.000 description 1
- PNKDNKGMEHJTJQ-BPUTZDHNSA-N Trp-Arg-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N PNKDNKGMEHJTJQ-BPUTZDHNSA-N 0.000 description 1
- TZNNEYFZZAHLBL-BPUTZDHNSA-N Trp-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O TZNNEYFZZAHLBL-BPUTZDHNSA-N 0.000 description 1
- IXEGQBJZDIRRIV-QEJZJMRPSA-N Trp-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IXEGQBJZDIRRIV-QEJZJMRPSA-N 0.000 description 1
- MHNHRNHJMXAVHZ-AAEUAGOBSA-N Trp-Asn-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N MHNHRNHJMXAVHZ-AAEUAGOBSA-N 0.000 description 1
- UKINEYBQXPMOJO-UBHSHLNASA-N Trp-Asn-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N UKINEYBQXPMOJO-UBHSHLNASA-N 0.000 description 1
- FKAPNDWDLDWZNF-QEJZJMRPSA-N Trp-Asp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FKAPNDWDLDWZNF-QEJZJMRPSA-N 0.000 description 1
- LTLBNCDNXQCOLB-UBHSHLNASA-N Trp-Asp-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 LTLBNCDNXQCOLB-UBHSHLNASA-N 0.000 description 1
- WPSYJHFHZYJXMW-JSGCOSHPSA-N Trp-Gln-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O WPSYJHFHZYJXMW-JSGCOSHPSA-N 0.000 description 1
- OENGVSDBQHHGBU-QEJZJMRPSA-N Trp-Glu-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OENGVSDBQHHGBU-QEJZJMRPSA-N 0.000 description 1
- WMIUTJPFHMMUGY-ZFWWWQNUSA-N Trp-Pro-Gly Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)NCC(=O)O WMIUTJPFHMMUGY-ZFWWWQNUSA-N 0.000 description 1
- QHWMVGCEQAPQDK-UMPQAUOISA-N Trp-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O QHWMVGCEQAPQDK-UMPQAUOISA-N 0.000 description 1
- RQKMZXSRILVOQZ-GMVOTWDCSA-N Trp-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N RQKMZXSRILVOQZ-GMVOTWDCSA-N 0.000 description 1
- ICPRIGUXAFULPH-ILWGZMRPSA-N Trp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N)C(=O)O ICPRIGUXAFULPH-ILWGZMRPSA-N 0.000 description 1
- YTHWAWACWGWBLE-MNSWYVGCSA-N Trp-Tyr-Thr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 YTHWAWACWGWBLE-MNSWYVGCSA-N 0.000 description 1
- XGEUYEOEZYFHRL-KKXDTOCCSA-N Tyr-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XGEUYEOEZYFHRL-KKXDTOCCSA-N 0.000 description 1
- HSVPZJLMPLMPOX-BPNCWPANSA-N Tyr-Arg-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O HSVPZJLMPLMPOX-BPNCWPANSA-N 0.000 description 1
- JBBYKPZAPOLCPK-JYJNAYRXSA-N Tyr-Arg-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O JBBYKPZAPOLCPK-JYJNAYRXSA-N 0.000 description 1
- OEVJGIHPQOXYFE-SRVKXCTJSA-N Tyr-Asn-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OEVJGIHPQOXYFE-SRVKXCTJSA-N 0.000 description 1
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 1
- WJKJJGXZRHDNTN-UWVGGRQHSA-N Tyr-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WJKJJGXZRHDNTN-UWVGGRQHSA-N 0.000 description 1
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 1
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 1
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 1
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 1
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 1
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 1
- OFHKXNKJXURPSY-ULQDDVLXSA-N Tyr-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O OFHKXNKJXURPSY-ULQDDVLXSA-N 0.000 description 1
- BGFCXQXETBDEHP-BZSNNMDCSA-N Tyr-Phe-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O BGFCXQXETBDEHP-BZSNNMDCSA-N 0.000 description 1
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 1
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 1
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 1
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 1
- RMRFSFXLFWWAJZ-HJOGWXRNSA-N Tyr-Tyr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 RMRFSFXLFWWAJZ-HJOGWXRNSA-N 0.000 description 1
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 1
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 1
- 244000000188 Vaccinium ovalifolium Species 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- DCOOGDCRFXXQNW-ZKWXMUAHSA-N Val-Asn-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N DCOOGDCRFXXQNW-ZKWXMUAHSA-N 0.000 description 1
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- XXDVDTMEVBYRPK-XPUUQOCRSA-N Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O XXDVDTMEVBYRPK-XPUUQOCRSA-N 0.000 description 1
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 1
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 1
- PGBJAZDAEWPDAA-NHCYSSNCSA-N Val-Gln-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N PGBJAZDAEWPDAA-NHCYSSNCSA-N 0.000 description 1
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- RKIGNDAHUOOIMJ-BQFCYCMXSA-N Val-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 RKIGNDAHUOOIMJ-BQFCYCMXSA-N 0.000 description 1
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- FXVDGDZRYLFQKY-WPRPVWTQSA-N Val-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C FXVDGDZRYLFQKY-WPRPVWTQSA-N 0.000 description 1
- JVYIGCARISMLMV-HOCLYGCPSA-N Val-Gly-Trp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JVYIGCARISMLMV-HOCLYGCPSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 1
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 1
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- HVRRJRMULCPNRO-BZSNNMDCSA-N Val-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 HVRRJRMULCPNRO-BZSNNMDCSA-N 0.000 description 1
- QTXGUIMEHKCPBH-FHWLQOOXSA-N Val-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 QTXGUIMEHKCPBH-FHWLQOOXSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 108010093857 Viral Hemagglutinins Proteins 0.000 description 1
- 230000010530 Virus Neutralization Effects 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 238000001261 affinity purification Methods 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 150000001299 aldehydes Chemical class 0.000 description 1
- 238000011166 aliquoting Methods 0.000 description 1
- 150000001371 alpha-amino acids Chemical class 0.000 description 1
- 235000008206 alpha-amino acids Nutrition 0.000 description 1
- 229940037003 alum Drugs 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- PNEYBMLMFCGWSK-UHFFFAOYSA-N aluminium oxide Inorganic materials [O-2].[O-2].[O-2].[Al+3].[Al+3] PNEYBMLMFCGWSK-UHFFFAOYSA-N 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 239000003945 anionic surfactant Substances 0.000 description 1
- 230000005875 antibody response Effects 0.000 description 1
- 230000030741 antigen processing and presentation Effects 0.000 description 1
- 210000000612 antigen-presenting cell Anatomy 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 235000006708 antioxidants Nutrition 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 229960005070 ascorbic acid Drugs 0.000 description 1
- 235000010323 ascorbic acid Nutrition 0.000 description 1
- 239000011668 ascorbic acid Substances 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 238000002820 assay format Methods 0.000 description 1
- 230000003190 augmentative effect Effects 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 238000002869 basic local alignment search tool Methods 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- SQVRNKJHWKZAKO-UHFFFAOYSA-N beta-N-Acetyl-D-neuraminic acid Natural products CC(=O)NC1C(O)CC(O)(C(O)=O)OC1C(O)C(O)CO SQVRNKJHWKZAKO-UHFFFAOYSA-N 0.000 description 1
- 230000001588 bifunctional effect Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 229940088623 biologically active substance Drugs 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 210000000481 breast Anatomy 0.000 description 1
- 239000008366 buffered solution Substances 0.000 description 1
- DQXBYHZEEUGOBF-UHFFFAOYSA-N but-3-enoic acid;ethene Chemical compound C=C.OC(=O)CC=C DQXBYHZEEUGOBF-UHFFFAOYSA-N 0.000 description 1
- 101150102633 cact gene Proteins 0.000 description 1
- 101150039352 can gene Proteins 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 150000001735 carboxylic acids Chemical class 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 230000007910 cell fusion Effects 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 230000036755 cellular response Effects 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 229920002301 cellulose acetate Polymers 0.000 description 1
- 239000000919 ceramic Substances 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000002738 chelating agent Substances 0.000 description 1
- 125000003636 chemical group Chemical group 0.000 description 1
- 230000035605 chemotaxis Effects 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 229960002173 citrulline Drugs 0.000 description 1
- 235000013477 citrulline Nutrition 0.000 description 1
- 238000012875 competitive assay Methods 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 238000009833 condensation Methods 0.000 description 1
- 230000005494 condensation Effects 0.000 description 1
- 230000001143 conditioned effect Effects 0.000 description 1
- 230000003750 conditioning effect Effects 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 125000004093 cyano group Chemical group *C#N 0.000 description 1
- 125000000753 cycloalkyl group Chemical group 0.000 description 1
- 125000001995 cyclobutyl group Chemical group [H]C1([H])C([H])([H])C([H])(*)C1([H])[H] 0.000 description 1
- 238000004163 cytometry Methods 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 230000007123 defense Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 239000003398 denaturant Substances 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 238000001212 derivatisation Methods 0.000 description 1
- 235000019425 dextrin Nutrition 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- MGHPNCMVUAKAIE-UHFFFAOYSA-N diphenylmethanamine Chemical compound C=1C=CC=CC=1C(N)C1=CC=CC=C1 MGHPNCMVUAKAIE-UHFFFAOYSA-N 0.000 description 1
- 150000002016 disaccharides Chemical class 0.000 description 1
- 230000005750 disease progression Effects 0.000 description 1
- 238000004043 dyeing Methods 0.000 description 1
- 239000002158 endotoxin Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 108010078428 env Gene Products Proteins 0.000 description 1
- 125000000816 ethylene group Chemical group [H]C([H])([*:1])C([H])([H])[*:2] 0.000 description 1
- 239000005038 ethylene vinyl acetate Substances 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000001704 evaporation Methods 0.000 description 1
- 230000008020 evaporation Effects 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 238000000799 fluorescence microscopy Methods 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 229960002989 glutamic acid Drugs 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 229960003180 glutathione Drugs 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 210000002288 golgi apparatus Anatomy 0.000 description 1
- 229960000789 guanidine hydrochloride Drugs 0.000 description 1
- PJJJBBJSCAKJQF-UHFFFAOYSA-N guanidinium chloride Chemical compound [Cl-].NC(N)=[NH2+] PJJJBBJSCAKJQF-UHFFFAOYSA-N 0.000 description 1
- 229940093915 gynecological organic acid Drugs 0.000 description 1
- 235000009424 haa Nutrition 0.000 description 1
- 125000001475 halogen functional group Chemical group 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 210000002443 helper t lymphocyte Anatomy 0.000 description 1
- 208000002672 hepatitis B Diseases 0.000 description 1
- 125000005842 heteroatom Chemical group 0.000 description 1
- 125000000623 heterocyclic group Chemical group 0.000 description 1
- 208000033519 human immunodeficiency virus infectious disease Diseases 0.000 description 1
- 230000008348 humoral response Effects 0.000 description 1
- 229930195733 hydrocarbon Natural products 0.000 description 1
- 150000002430 hydrocarbons Chemical class 0.000 description 1
- 239000000017 hydrogel Substances 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 229920001477 hydrophilic polymer Polymers 0.000 description 1
- 229920001600 hydrophobic polymer Polymers 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 125000004029 hydroxymethyl group Chemical group [H]OC([H])([H])* 0.000 description 1
- 229960002591 hydroxyproline Drugs 0.000 description 1
- 230000003100 immobilizing effect Effects 0.000 description 1
- 230000000984 immunochemical effect Effects 0.000 description 1
- 230000006054 immunological memory Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 108700010900 influenza virus proteins Proteins 0.000 description 1
- 230000004941 influx Effects 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 229910052500 inorganic mineral Inorganic materials 0.000 description 1
- 229920000592 inorganic polymer Polymers 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 229940079322 interferon Drugs 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 238000010253 intravenous injection Methods 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 239000004816 latex Substances 0.000 description 1
- 229920000126 latex Polymers 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 208000032839 leukemia Diseases 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 229920006008 lipopolysaccharide Polymers 0.000 description 1
- 239000012669 liquid formulation Substances 0.000 description 1
- 210000005229 liver cell Anatomy 0.000 description 1
- 210000001165 lymph node Anatomy 0.000 description 1
- 210000004698 lymphocyte Anatomy 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 229910052752 metalloid Inorganic materials 0.000 description 1
- 150000002738 metalloids Chemical class 0.000 description 1
- 150000002739 metals Chemical class 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 238000002493 microarray Methods 0.000 description 1
- 239000003094 microcapsule Substances 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- 239000004005 microsphere Substances 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 239000011707 mineral Substances 0.000 description 1
- 230000000116 mitigating effect Effects 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 239000003068 molecular probe Substances 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 210000001616 monocyte Anatomy 0.000 description 1
- 150000002772 monosaccharides Chemical class 0.000 description 1
- 125000004573 morpholin-4-yl group Chemical group N1(CCOCC1)* 0.000 description 1
- 238000002887 multiple sequence alignment Methods 0.000 description 1
- 210000004165 myocardium Anatomy 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 230000009871 nonspecific binding Effects 0.000 description 1
- 238000003499 nucleic acid array Methods 0.000 description 1
- 239000003921 oil Substances 0.000 description 1
- 238000002966 oligonucleotide array Methods 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 238000011369 optimal treatment Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- 229920000620 organic polymer Polymers 0.000 description 1
- 229960003104 ornithine Drugs 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 229940111202 pepsin Drugs 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 210000005259 peripheral blood Anatomy 0.000 description 1
- 239000011886 peripheral blood Substances 0.000 description 1
- 210000001322 periplasm Anatomy 0.000 description 1
- 102000013415 peroxidase activity proteins Human genes 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- 239000008194 pharmaceutical composition Substances 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 239000007981 phosphate-citrate buffer Substances 0.000 description 1
- 150000008104 phosphatidylethanolamines Chemical class 0.000 description 1
- 150000003904 phospholipids Chemical class 0.000 description 1
- 229920001200 poly(ethylene-vinyl acetate) Polymers 0.000 description 1
- 229920000747 poly(lactic acid) Polymers 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920001515 polyalkylene glycol Polymers 0.000 description 1
- 229920000447 polyanionic polymer Polymers 0.000 description 1
- 229920000728 polyester Polymers 0.000 description 1
- 229920000573 polyethylene Polymers 0.000 description 1
- 229920005862 polyol Polymers 0.000 description 1
- 150000003077 polyols Chemical class 0.000 description 1
- 229920006324 polyoxymethylene Polymers 0.000 description 1
- 229920001296 polysiloxane Polymers 0.000 description 1
- 229920000136 polysorbate Polymers 0.000 description 1
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 239000013615 primer Substances 0.000 description 1
- 239000002987 primer (paints) Substances 0.000 description 1
- 230000001566 pro-viral effect Effects 0.000 description 1
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000000069 prophylactic effect Effects 0.000 description 1
- 125000001436 propyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 239000010453 quartz Substances 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 108010043277 recombinant soluble CD4 Proteins 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 238000005096 rolling process Methods 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 229930182490 saponin Natural products 0.000 description 1
- 235000017709 saponins Nutrition 0.000 description 1
- 150000007949 saponins Chemical class 0.000 description 1
- 229940043230 sarcosine Drugs 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000004062 sedimentation Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- SQVRNKJHWKZAKO-OQPLDHBCSA-N sialic acid Chemical compound CC(=O)N[C@@H]1[C@@H](O)C[C@@](O)(C(O)=O)OC1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-OQPLDHBCSA-N 0.000 description 1
- 150000004760 silicates Chemical class 0.000 description 1
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N silicon dioxide Inorganic materials O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 1
- 239000002356 single layer Substances 0.000 description 1
- 210000002027 skeletal muscle Anatomy 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
- 238000003892 spreading Methods 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 238000010254 subcutaneous injection Methods 0.000 description 1
- 239000007929 subcutaneous injection Substances 0.000 description 1
- 150000005846 sugar alcohols Chemical class 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 208000010648 susceptibility to HIV infection Diseases 0.000 description 1
- 230000002459 sustained effect Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 229940118376 tetanus toxin Drugs 0.000 description 1
- YSMODUONRAFBET-WHFBIAKZSA-N threo-5-hydroxy-L-lysine Chemical compound NC[C@@H](O)CC[C@H](N)C(O)=O YSMODUONRAFBET-WHFBIAKZSA-N 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- FGMPLJWBKKVCDB-UHFFFAOYSA-N trans-L-hydroxy-proline Natural products ON1CCCC1C(O)=O FGMPLJWBKKVCDB-UHFFFAOYSA-N 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 238000011282 treatment Methods 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 230000029069 type 2 immune response Effects 0.000 description 1
- 108010087967 type I signal peptidase Proteins 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 108010043941 valyl-glutamyl-isoleucyl-asparaginyl-cysteinyl-threonyl-arginine Proteins 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 230000029812 viral genome replication Effects 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
- A61K39/21—Retroviridae, e.g. equine infectious anemia virus
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/70—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving virus or bacteriophage
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/569—Immunoassay; Biospecific binding assay; Materials therefor for microorganisms, e.g. protozoa, bacteria, viruses
- G01N33/56983—Viruses
- G01N33/56988—HIV or HTLV
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
- G01N33/6854—Immunoglobulins
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
- G01N33/6893—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids related to diseases not provided for elsewhere
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/53—DNA (RNA) vaccination
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
- C07K2319/02—Fusion polypeptide containing a localisation/targetting motif containing a signal sequence
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/40—Fusion polypeptide containing a tag for immunodetection, or an epitope for immunisation
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/40—Fusion polypeptide containing a tag for immunodetection, or an epitope for immunisation
- C07K2319/43—Fusion polypeptide containing a tag for immunodetection, or an epitope for immunisation containing a FLAG-tag
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/15011—Lentivirus, not HIV, e.g. FIV, SIV
- C12N2740/15022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16111—Human Immunodeficiency Virus, HIV concerning HIV env
- C12N2740/16122—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16111—Human Immunodeficiency Virus, HIV concerning HIV env
- C12N2740/16134—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Immunology (AREA)
- Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Hematology (AREA)
- Virology (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Urology & Nephrology (AREA)
- Biochemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- Organic Chemistry (AREA)
- Biotechnology (AREA)
- Pathology (AREA)
- Food Science & Technology (AREA)
- Cell Biology (AREA)
- General Physics & Mathematics (AREA)
- Mycology (AREA)
- Pharmacology & Pharmacy (AREA)
- Epidemiology (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Biophysics (AREA)
- Communicable Diseases (AREA)
- Genetics & Genomics (AREA)
- Tropical Medicine & Parasitology (AREA)
- Gastroenterology & Hepatology (AREA)
- AIDS & HIV (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Peptides Or Proteins (AREA)
Abstract
Description
본 발명은 전반적으로 인간 면역결핍 바이러스 유형 1(HIV-1) 외피 당단백질인 gp120 및/또는 gp160에 대한 면역 반응을 유도시키거나 측정하는데 유용한 면역원성 조성물 분야에 관한 것이다. 특히, 본 발명은 gp120 서열을 포함하는 폴리펩티드 및/또는 이러한 폴리펩티드를 엔코딩하는 폴리누클레오티드 (RNA 또는 DNA)를 포함하는 면역원성 조성물에 관한 것이며, 이러한 조성물은 HIV-1 감염 및/또는 질병(AIDS)의 예방 또는 치료에 유용한 면역 반응을 유도시킨다.The present invention relates generally to the field of immunogenic compositions useful for inducing or measuring immune responses against human immunodeficiency virus type 1 (HIV-1) envelope glycoproteins gp120 and / or gp160. In particular, the present invention relates to immunogenic compositions comprising polypeptides comprising the gp120 sequence and / or polynucleotides (RNA or DNA) encoding such polypeptides, wherein such compositions comprise HIV-1 infection and / or disease (AIDS). Induces an immune response useful for the prevention or treatment of
후천성 면역결핍 증후군 (AIDS)는 인간 면역결핍 바이러스 (HIV)로서 확인된 레트로바이러스에 의해 유발된다. 항체 또는 세포성 반응의 유도를 기초로 하여 방어 면역 반응을 유도하는 백신을 개발하려는 상당한 연구가 있었다. 최근의 연구는 안전상의 이유로 약독되거나 사멸된 바이러스 보다는 HIV 단백질을 백신에서 면역원으로서 사용하는 서브유닛(subunit) 백신을 이용해 왔다. 서브유닛 백신은 일반적으로 비리온 및 바이러스 감염된 세포의 표면상에 디스플레이되는 HIV 외피 단백질의 일부인 gp120을 포함한다. 이와 관련하여, gp120은 유리된 비리온 또는 세포 대 세포 융합에 의해 HIV-1 감염을 매개한다. 둘 모두의 경우에, gp120은 두 개의 세포성 수용체에 결합함으로써 HIV-1 감염을 개시하며, 하나의 수용체는 CD4이고, 나머지 하나의 수용체는 케모카인 수용체 (전형적으로 CCR5 또는 CXCR4)이다. CD4 및 케모카인 수용체에 대한 독특한 고친화성 결합 부위는 gp120 분자의 표면상에 위치해 있다.Acquired immunodeficiency syndrome (AIDS) is caused by a retrovirus identified as human immunodeficiency virus (HIV). There has been considerable research to develop vaccines that induce a protective immune response based on the induction of antibodies or cellular responses. Recent studies have used subunit vaccines that use HIV proteins as immunogens in vaccines rather than attenuated or killed viruses for safety reasons. Subunit vaccines generally include gp120, which is part of an HIV envelope protein displayed on the surface of virion and virus infected cells. In this regard, gp120 mediates HIV-1 infection by free virion or cell-to-cell fusion. In both cases, gp120 initiates HIV-1 infection by binding to two cellular receptors, one receptor is CD4 and the other receptor is a chemokine receptor (typically CCR5 or CXCR4). Unique high affinity binding sites for the CD4 and chemokine receptors are located on the surface of the gp120 molecule.
HIV 외피 단백질은 널리 공지되어 있고, 다수의 HIV 균주로부터의 HIV 외피에 대한 아미노산 및 누클레오티드 서열이 알려져 있다. gp120 분자는 분자의 겉보기 분자량을 120,000 달톤으로 증가시키기 위해 N-결합된 글리코실화에 의해 광범위하게 변형되는 60,000 달톤의 폴리펩티드 코어로 구성되어 있다. 성숙한 HIV-1 외피 단백질인 gp120 및 gp41은 둘 모두 공통의 전구체인 gp160으로부터 유래된다. gp160 전구체는 막-결합된 리보솜상에서 단백질이 합성되도록 유도하고 소포체의 내강(lumen)내로의 초기(nascent) 폴리펩티드의 전위(translocation) (분비 경로)를 보장하는 아미노-말단 시그널 서열을 함유한다. 소포체에서, 시그널 서열은 시그널 펩티다아제에 의해 제거되고, 단백질은 협조번역(cotranslational) 작용으로 "단순한(simple)" 고-만노오스 (high-mannose) 유형의 N-결합된 탄수화물을 획득한다. 그 후, 탄수화물 함유 단백질은 골지체로 수송되고, 여기서 많은 고-만노오스 탄수화물은 "복잡한(complex)" 시알산 함유 탄수화물로 전환된다. 또한, gp160은 보존된 당단백질 프로세싱 부위에서 푸린 또는 푸린 유사 펩티다아제에 의한 단백분해에 의해 gp120/gp41 복합체로 전환된다. 그 후, gp120/gp41 복합체는 세포 표면으로 배출되며, 여기서 이러한 복합체는 삼합체 구조물을 형성하는 것으로 여겨진다. 세포막 또는 비리온막에서, gp120은 비공유 상호작용에 의해 gp41과 결합된 표재성 막 단백질로서 존재한다. 대조적으로, gp41은 카르복실 말단 근처에 위치한 소수성 트랜스멤브레인(transmembrane) 도메인에 의해 막 이중층내에 앵커링(anchoring)된 내재성 막 단백질이다.HIV envelope proteins are well known and amino acid and nucleotide sequences for HIV envelopes from a number of HIV strains are known. The gp120 molecule consists of a 60,000 Dalton polypeptide core that is extensively modified by N-linked glycosylation to increase the apparent molecular weight of the molecule to 120,000 Daltons. The mature HIV-1 envelope proteins gp120 and gp41 are both derived from the common precursor gp160. The gp160 precursor contains an amino-terminal signal sequence that induces protein synthesis on membrane-bound ribosomes and ensures the translocation (secretory pathway) of the nascent polypeptide into the lumen of the endoplasmic reticulum. In the endoplasmic reticulum, the signal sequence is removed by signal peptidase, and the protein obtains a "simple" high-mannose type of N-linked carbohydrates by cotranslational action. The carbohydrate-containing protein is then transported to the Golgi apparatus, where many high-mannose carbohydrates are converted to "complex" sialic acid-containing carbohydrates. In addition, gp160 is converted to the gp120 / gp41 complex by proteolysis with purine or purine-like peptidase at the conserved glycoprotein processing site. The gp120 / gp41 complex then exits the cell surface, where it is believed to form a trimer structure. In cell membranes or virion membranes, gp120 exists as superficial membrane protein bound to gp41 by non-covalent interactions. In contrast, gp41 is an intrinsic membrane protein anchored in the membrane bilayer by a hydrophobic transmembrane domain located near the carboxyl terminus.
gp120의 아미노산 서열은 5개의 과가변성 도메인 (V1-V5)이 산재되어 있는 5개의 비교적 보존된 도메인 (C1-C5)을 함유한다. gp120 주서열 중의 18개 시스테인 잔기의 위치 및 gp120 서열 중의 약 24개의 N-결합된 글리코실화 부위의 위치는 대부분의 gp120 서열에서 보존된다. 과가변성 도메인은 광범위한 아미노산 치환, 삽입 및 결실을 함유한다. 이들 도메인에서의 서열 변이는 다양한 바이러스 단리물로부터의 gp120 분자들 사이의 30% 이하의 전체 서열 가변성을 설명해준다. 이러한 변이에도 불구하고, 모든 gp120 서열은 gp41과 상호작용하고 CD4 및 케모카인 (CCR5 및 CXCR4) 수용체에 결합하여 바이러스 및 숙주 세포막의 융합을 유도할 수 있는 능력을 보유한다.The amino acid sequence of gp120 contains five relatively conserved domains (C1-C5) interspersed with five hypervariable domains (V1-V5). The position of 18 cysteine residues in the gp120 sequence and the position of about 24 N-linked glycosylation sites in the gp120 sequence are conserved in most gp120 sequences. Hypervariable domains contain a wide range of amino acid substitutions, insertions and deletions. Sequence variations in these domains account for up to 30% overall sequence variability between gp120 molecules from various viral isolates. Despite this variation, all gp120 sequences retain the ability to interact with gp41 and bind to CD4 and chemokine (CCR5 and CXCR4) receptors to induce fusion of viral and host cell membranes.
발명의 개요Summary of the Invention
본 발명은 단리된 폴리펩티드 또는 단리된 폴리누클레오티드 및 약제학적으로 허용되는 담체를 포함하는 면역원성 조성물을 제공한다. 단리된 폴리펩티드는 제 1 gp120 아미노산 서열을 포함하거나 단리된 폴리누클레오티드는 이러한 제 1 gp120 아미노산 서열을 엔코딩하며, 제 1 gp120 서열이 gp120의 V2, V3 및 C4 도메인을 적어도 포함하며, HIV gp120의 HXB-2 균주로부터의 gp120의 N-말단 메티오닌으로부터 넘버링하여 (i) 제 1 gp120 서열은 54번, 74번, 119번, 126번, 131번, 157번, 196번, 205번, 218번, 228번, 239번, 247번, 296번, 331번, 378번, 385번, 418번 및 445번 중 하나 이상의 위치에서 하나 이상의 시스테인 잔기를 결여하고/하거나, (ii) 제 1 gp120 서열은 24번, 29번, 34번, 54번, 74번, 119번, 126번, 131번, 157번, 196번, 205번, 218번, 228번, 239번, 247번, 296번, 331번, 378번, 385번, 418번, 445번, 493번, 495번, 499번 내지 501번, 503번 내지 508번 및 510번 이외의 위치에서 하나 이상의 추가의 시스테인 잔기를 포함한다. 그러나, 제 1 gp120 서열은 V1 도메인에서 하나 이상의 추가의 시스테인을 지닌 서브타입 G gp120 서열 또는 V4 도메인에서 하나 이상의 추가의 시스테인을 지닌 서브타입 E gp120 서열이 아니다. 바람직한 구체예에서, 제 1 gp120 서열은 V1 도메인을 추가로 포함한다.The present invention provides an immunogenic composition comprising an isolated polypeptide or isolated polynucleotide and a pharmaceutically acceptable carrier. The isolated polypeptide comprises a first gp120 amino acid sequence or an isolated polynucleotide encodes this first gp120 amino acid sequence, wherein the first gp120 sequence comprises at least the V2, V3, and C4 domains of gp120, and the HXB- of HIV gp120 Numbered from the N-terminal methionine of gp120 from two strains, (i) the first gp120 sequence was 54, 74, 119, 126, 131, 157, 196, 205, 218, 228 , At least one cysteine residue at one or more of
한 가지 구체예에서, 제 1 gp120 서열은 54번, 74번, 119번, 126번, 157번, 205번, 218번, 228번, 239번, 247번, 331번, 378번 또는 385번 중 하나 이상의 위치에서 하나 이상의 시스테인 잔기를 결여한다.In one embodiment, the first gp120 sequence is selected from 54, 74, 119, 126, 157, 205, 218, 228, 239, 247, 331, 378, or 385. It lacks one or more cysteine residues in one or more positions.
또 다른 구체예에서, 제 1 gp120 서열은 54번, 74번, 119번, 126번, 131번, 157번, 196번, 205번, 218번, 228번, 239번, 247번, 296번, 331번, 378번, 385번, 418번 또는 445번 이외의 위치에서 하나 이상의 추가의 시스테인 잔기를 포함하며, 단, 하나 이상의 추가의 시스테인 잔기는 gp120의 V1 도메인에 존재하지 않는다.In another embodiment, the first gp120 sequence is 54, 74, 119, 126, 131, 157, 196, 205, 218, 228, 239, 247, 296, One or more additional cysteine residues at positions other than 331, 378, 385, 418, or 445, provided that one or more additional cysteine residues are not present in the V1 domain of gp120.
제 1 gp120 서열은 천연 gp120 서열, 바람직하게는 HIV의 1차 단리물로부터의 gp120 서열을 포함한다. 바람직한 구체예에서, 제 1 gp120 서열은 SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134 및 136로 구성된 군으로부터 선택된 gp120의 V1, V2, V3 및 C4 도메인과 각각 약 99% 이상의 동일성을 지닌다. 더욱 바람직한 구체예에서, 제 1 gp120 서열은 SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134 및 136로 구성된 군으로부터 선택된 gp120의 V1, V2, V3 및 C4 도메인을 적어도 포함한다.The first gp120 sequence comprises a native gp120 sequence, preferably the gp120 sequence from the primary isolate of HIV. In a preferred embodiment, the first gp120 sequence comprises SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134 and 136 Each have at least about 99% identity with the V1, V2, V3, and C4 domains of gp120 selected from the group consisting of: In a more preferred embodiment, the first gp120 sequence has SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36 , 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86 , 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134 and 136 At least V1, V2, V3 and C4 domains of gp120 selected from the group consisting of:
바람직한 구체예에서, 제 1 gp120 서열은 홀수의 시스테인을 포함한다. 이러한 구체예의 변형례에서, 면역원성 조성물은 제 1 gp120 서열을 포함하는 폴리펩티드를 포함하며, 제 1 gp120 서열 중의 시스테인은, 바람직하게는 이황화물 결합에 의해, 또 다른 폴리펩티드와 공유 결합된다. 다른 폴리펩티드는 제 2 gp120 서열을 포함할 수 있다. 제 2 gp120 서열이 제 1 gp120 서열과 동일한 경우, 제 1 및 제 2 gp120 서열은 동종이합체(homodimer)를 형성한다. 제 2 gp120 서열이 제 1 gp120 서열과 상이한 경우, 제 1 및 제 2 gp120 서열은 이종이합체(heterodimer)를 형성한다. 다른 폴리펩티드는 추가로 또는 대안적으로 gp41 아미노산 서열을 포함할 수 있다.In a preferred embodiment, the first gp120 sequence comprises odd cysteines. In a variation of this embodiment, the immunogenic composition comprises a polypeptide comprising a first gp120 sequence, wherein the cysteine in the first gp120 sequence is covalently linked to another polypeptide, preferably by disulfide bonds. The other polypeptide may comprise a second gp120 sequence. If the second gp120 sequence is identical to the first gp120 sequence, the first and second gp120 sequences form a homodimer. If the second gp120 sequence is different from the first gp120 sequence, the first and second gp120 sequences form a heterodimer. Other polypeptides may additionally or alternatively comprise gp41 amino acid sequences.
또한, 본 발명은 제 1 gp120 서열을 포함하는 폴리펩티드를 포함하는 면역원성 조성물을 제공하며, 여기서 gp120 서열 중의 시스테인은 세포-특이적 결합 부 분, 약물, 면역자극 올리누클레오티드 (예를 들어, CpG) 및 면역원성 담체 단백질로 구성된 군으로부터 선택된 작용제와 공유 결합되어 있다.The present invention also provides an immunogenic composition comprising a polypeptide comprising a first gp120 sequence, wherein the cysteine in the gp120 sequence is cell-specific binding moiety, drug, immunostimulatory oligonucleotide (eg, CpG) And an agent selected from the group consisting of immunogenic carrier proteins.
특정 구체예에서, 면역원성 조성물은 제 1 gp120 서열을 포함하는 융합 폴리펩티드를 포함하는 폴리펩티드 또는 이러한 융합 폴리펩티드를 엔코딩하는 폴리누클레오티드를 포함한다.In certain embodiments, an immunogenic composition comprises a polypeptide comprising a fusion polypeptide comprising a first gp120 sequence or a polynucleotide encoding such a fusion polypeptide.
면역원성 조성물은 요망되는 경우 애쥬번트를 추가로 포함할 수 있다.The immunogenic composition may further comprise an adjuvant if desired.
또한, 본 발명은 제 1 gp120 아미노산 서열을 포함하는 단리된 폴리펩티드를 제공하며, 제 1 gp120 서열이 gp120의 V2, V3 및 C4 도메인을 적어도 포함하며, HIV gp120의 HXB-2 균주로부터의 gp120의 N-말단 메티오닌으로부터 넘버링하여 (a) 제 1 gp120 서열은 54번, 74번, 119번, 126번, 157번, 205번, 218번, 228번, 239번, 247번, 331번, 378번 또는 385번 중 하나 이상의 위치에서 하나 이상의 시스테인 잔기를 결여하고/하거나, (b) 제 1 gp120 서열은 54번, 74번, 119번, 126번, 131번, 157번, 196번, 205번, 218번, 228번, 239번, 247번, 296번, 331번, 378번, 385번, 418번 또는 445번 이외의 위치에서 하나 이상의 추가의 시스테인 잔기를 포함하며, 단, 하나 이상의 추가의 시스테인 잔기는 gp120의 V1 도메인에 존재하지 않는다. 그러나, 제 1 gp120 서열은 V4 도메인에서 하나 이상의 추가의 시스테인을 지닌 서브타입 E gp120 서열이 아니다. 바람직한 구체예에서, 제 1 gp120 서열은 V1 도메인을 추가로 포함한다.The present invention also provides an isolated polypeptide comprising a first gp120 amino acid sequence, wherein the first gp120 sequence comprises at least the V2, V3 and C4 domains of gp120, and the N of gp120 from the HXB-2 strain of HIV gp120. Numbering from terminal methionine (a) the first gp120 sequence is 54, 74, 119, 126, 157, 205, 218, 228, 239, 247, 331, 378 or Lack one or more cysteine residues at one or more positions of 385, and / or (b) the first gp120 sequence is 54, 74, 119, 126, 131, 157, 196, 205, 218 At least one additional cysteine residue at a position other than 228, 239, 247, 296, 331, 378, 385, 418 or 445, provided that at least one additional cysteine residue Is not present in the V1 domain of gp120. However, the first gp120 sequence is not a subtype E gp120 sequence with one or more additional cysteines in the V4 domain. In a preferred embodiment, the first gp120 sequence further comprises a V1 domain.
하나의 구체예에서, 단리된 폴리펩티드는 54번, 74번, 119번, 126번, 157번, 205번, 218번, 228번, 239번, 247번, 331번, 378번 또는 385번 중 하나 이상의 위 치에서 하나 이상의 시스테인 잔기가 결여된 제 1 gp120 서열을 포함한다. 또 다른 구체예에서, 단리된 폴리펩티드는 54번, 74번, 119번, 126번, 131번, 157번, 196번, 205번, 218번, 228번, 239번, 247번, 296번, 331번, 378번, 385번, 418번 또는 445번 이외의 위치에서 하나 이상의 추가의 시스테인 잔기를 포함하는 제 1 gp120 서열을 포함하며, 단, 하나 이상의 추가의 시스테인 잔기는 gp120의 V1 도메인에 존재하지 않는다.In one embodiment, the isolated polypeptide is one of 54, 74, 119, 126, 157, 205, 218, 228, 239, 247, 331, 378 or 385. At least one position comprising a first gp120 sequence lacking at least one cysteine residue. In another embodiment, the isolated polypeptide is 54, 74, 119, 126, 131, 157, 196, 205, 218, 228, 239, 247, 296, 331 A first gp120 sequence comprising one or more additional cysteine residues at positions other than
바람직한 구체예에서, 단리된 폴리펩티드는 SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134 및 136으로 구성된 군으로부터 선택된 gp120의 V1, V2, V3 및 C4 도메인과 각각 약 99% 이상의 동일성을 지니는 제 1 gp120 아미노산 서열을 포함한다. 더욱 바람직한 구체예에서, 제 1 gp120 서열은 SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134 및 136으로 구성된 군으로부터 선택된 gp120의 V1, V2, V3 및 C4 도메인을 적어도 포함한다.In a preferred embodiment, the isolated polypeptide is SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38 , 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88 , 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134 and 136 A first gp120 amino acid sequence having at least about 99% identity with each of the V1, V2, V3, and C4 domains of gp120 selected from the group. In a more preferred embodiment, the first gp120 sequence has SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36 , 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86 , 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134 and 136 At least V1, V2, V3 and C4 domains of gp120 selected from the group consisting of:
바람직한 구체예에서, 단리된 폴리펩티드는 홀수의 시스테인을 지닌 제 1 gp120 서열을 포함한다. 이러한 구체예의 변형례에서, 제 1 gp120 서열 중의 시스 테인은, 바람직하게는 이황화물 결합을 통해, 또 다른 폴리펩티드와 공유 결합되어 있다. 다른 폴리펩티드는 제 2 gp120 서열을 포함할 수 있다. 제 2 gp120 서열이 제 1 gp120 서열과 동일한 경우, 제 1 및 제 2 gp120 서열은 동종이합체를 형성한다. 제 2 gp120 서열이 제 1 gp120 서열과 상이한 경우, 제 1 및 제 2 gp120 서열은 이종이합체를 형성한다. 다른 폴리펩티드는 추가로 또는 대안적으로 gp41 아미노산 서열을 포함할 수 있다.In a preferred embodiment, the isolated polypeptide comprises a first gp120 sequence with an odd cysteine. In a variant of this embodiment, the cysteine in the first gp120 sequence is covalently linked with another polypeptide, preferably via disulfide bonds. The other polypeptide may comprise a second gp120 sequence. If the second gp120 sequence is identical to the first gp120 sequence, the first and second gp120 sequences form a homodimer. If the second gp120 sequence is different from the first gp120 sequence, the first and second gp120 sequences form a heterodimer. Other polypeptides may additionally or alternatively comprise gp41 amino acid sequences.
또한, 본 발명은 gp120 서열 중의 시스테인이 세포-특이적 결합 부분, 약물, 면역자극 올리고누클레오티드 및 면역원성 담체 단백질로 구성된 군으로부터 선택된 작용제와 공유 결합되어 있는 단리된 폴리펩티드를 제공한다.The invention also provides an isolated polypeptide wherein the cysteine in the gp120 sequence is covalently linked with an agent selected from the group consisting of cell-specific binding moieties, drugs, immunostimulatory oligonucleotides and immunogenic carrier proteins.
특정 구체예에서, 단리된 폴리펩티드는 제 1 gp120 서열을 포함하는 융합 폴리펩티드를 포함한다. 융합 폴리펩티드는 이종 시그널 서열, 예를 들어 단순 포진 바이러스 당단백질 D (gD-1) 시그널 서열 또는 인간 조직 플라스미노겐 활성인자 시그널 서열을 포함할 수 있다. 융합 폴리펩티드는 추가로 또는 대안적으로 에피토프 태그(tag)를 포함할 수 있다.In certain embodiments, isolated polypeptides comprise fusion polypeptides comprising a first gp120 sequence. The fusion polypeptide may comprise a heterologous signal sequence, such as a herpes simplex virus glycoprotein D (gD-1) signal sequence or human tissue plasminogen activator signal sequence. The fusion polypeptide may additionally or alternatively comprise an epitope tag.
또한, 본 발명은 본 발명의 단리된 폴리펩티드 중 어느 하나를 엔코딩하는 단리된 폴리누클레오티드를 제공한다. 엔코딩된 폴리펩티드의 발현에 사용하고자 하는 경우, 폴리누클레오티드는 바람직하게는 특정 종의 숙주 세포에서 발현되도록 코돈 최적화된다. 바람직한 구체예에서, 폴리누클레오티드는 SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134 및 136으로 구성된 군으로부터 선택된 gp120의 V1, V2, V3 및 C4 도메인과 각각 약 99% 이상의 동일성을 지니는 gp120 서열을 엔코딩한다. 더욱 바람직한 구체예에서, 폴리누클레오티드는 SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135 또는 이들의 서브시퀀스(subsequence)로 구성된 군으로부터 선택된 gp120 누클레오티드 서열을 포함하며, 여기서 서브시퀀스는 gp120의 V1, V2, V3 및 C4를 적어도 엔코딩한다.The invention also provides an isolated polynucleotide encoding any of the isolated polypeptides of the invention. When intended for use in the expression of an encoded polypeptide, the polynucleotide is preferably codon optimized for expression in a host cell of a particular species. In a preferred embodiment, the polynucleotides comprise SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, Group consisting of 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134 and 136 Encode a gp120 sequence having at least about 99% identity with each of the V1, V2, V3, and C4 domains of gp120 selected from. In a more preferred embodiment, the polynucleotide is SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37 , 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87 , 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135 or these And a gp120 nucleotide sequence selected from the group consisting of subsequences of wherein the subsequences at least encode V1, V2, V3, and C4 of gp120.
본 발명의 그 밖의 양태는 본 발명의 폴리누클레오티드 중 어느 하나를 포함하는 벡터 및 이러한 벡터를 포함하는 숙주 세포를 포함한다. 바람직한 숙주 세포는 포유동물 및 세균 세포를 포함한다.Other aspects of the invention include vectors comprising any of the polynucleotides of the invention and host cells comprising such vectors. Preferred host cells include mammalian and bacterial cells.
또한, 본 발명은 제 1 gp120 아미노산 서열을 포함하는 폴리펩티드를 제조하는 방법을 제공한다. 한 가지 구체예에서, 이러한 방법은 (a) 본 발명의 벡터를 세포내로 도입시키고, (b) 폴리펩티드를 발현시키는 것을 포함한다. 세포는 생체내 또는 배양액내에 존재할 수 있다. 배양액내에 존재하는 경우, 이러한 방법은 폴리펩티드를 배양액으로부터 회수하는 것을 추가로 포함할 수 있다. 또 다른 구체예에서, 폴리펩티드는 (a) 발현 벡터를 포함하는 본 발명의 숙주 세포를 폴리펩티드의 발현에 적합한 조건하에서 배양하고, (b) 배양액으로부터 폴리펩티드를 회 수함으로써 제조된다.The present invention also provides a method of preparing a polypeptide comprising the first gp120 amino acid sequence. In one embodiment, such methods comprise (a) introducing a vector of the invention into a cell and (b) expressing a polypeptide. The cells can be in vivo or in culture. When present in a culture, the method may further comprise recovering the polypeptide from the culture. In another embodiment, the polypeptide is prepared by (a) culturing a host cell of the invention comprising an expression vector under conditions suitable for expression of the polypeptide, and (b) recovering the polypeptide from the culture.
본 발명의 또 다른 양태는 동물을 제 1 gp120 서열을 포함하는 폴리펩티드로 면역시키는 방법이다. 이러한 방법은 본 발명의 면역원성 조성물을 동물에게 투여하는 것을 포함한다.Another aspect of the invention is a method of immunizing an animal with a polypeptide comprising a first gp120 sequence. Such methods include administering an immunogenic composition of the invention to an animal.
또한, 본 발명은 피검체로부터의 생물학적 샘플을 (a) gp120 아미노산 서열을 포함하는 단리된 폴리펩티드와 접촉시키고, (b) 샘플이 단리된 폴리펩티드에 특이적으로 결합하는 항체를 함유하는 지의 여부를 결정하는 것을 포함하는 진단 방법을 제공한다. gp120 서열은 gp120의 V2, V3 및 C4 도메인을 적어도 포함하고, HIV gp120의 HXB-2 균주로부터의 gp120의 N-말단 메티오닌으로부터 넘버링하여 (i) 54번, 74번, 119번, 126번, 131번, 157번, 196번, 205번, 218번, 228번, 239번, 247번, 296번, 331번, 378번, 385번, 418번 및 445번 중 하나 이상의 위치에서 하나 이상의 시스테인 잔기를 결여하고/하거나, (ii) 24번, 29번, 34번, 54번, 74번, 119번, 126번, 131번, 157번, 196번, 205번, 218번, 228번, 239번, 247번, 296번, 331번, 378번, 385번, 418번, 445번, 493번, 495번, 499번 내지 501번, 503번 내지 508번 및 510번 이외의 위치에서 하나 이상의 추가의 시스테인 잔기를 포함한다. 그러나, gp120 서열은 V1 도메인에서 하나 이상의 추가의 시스테인을 지닌 서브타입 G gp120 서열 또는 V4 도메인에서 하나 이상의 추가의 시스테인을 지닌 서브타입 E gp120 서열이 아니다. 바람직한 구체예에서, gp120 서열은 V1 도메인을 추가로 포함한다.The invention also determines whether a biological sample from a subject is contacted with (a) an isolated polypeptide comprising the gp120 amino acid sequence, and (b) the sample contains an antibody that specifically binds to the isolated polypeptide. It provides a diagnostic method comprising the. The gp120 sequence comprises at least the V2, V3 and C4 domains of gp120 and is numbered from the N-terminal methionine of gp120 from the HXB-2 strain of HIV gp120 (i) 54, 74, 119, 126, 131 At least one cysteine residue at one or more of 157, 196, 205, 218, 228, 239, 247, 296, 331, 378, 385, 418 and 445 Or / or (ii) 24, 29, 34, 54, 74, 119, 126, 131, 157, 196, 205, 218, 228, 239, One or more additional cysteines at positions other than 247, 296, 331, 378, 385, 418, 445, 493, 495, 499 to 501, 503 to 508 and 510 It includes residues. However, the gp120 sequence is not a subtype G gp120 sequence with one or more additional cysteines in the V1 domain or a subtype E gp120 sequence with one or more additional cysteines in the V4 domain. In a preferred embodiment, the gp120 sequence further comprises a V1 domain.
또 다른 구체예에서, 본 발명은 피검체로부터의 생물학적 샘플이 비정상적인 시스테인 개수를 지닌 gp120 아미노산 서열을 포함하는 폴리펩티드 또는 이러한 gp120 아미노산 서열을 엔코딩하는 폴리누클레오티드를 포함하는 지의 여부를 결정하기 위해 피검체로부터의 생물학적 샘플을 검정하는 것을 포함하는 진단 방법을 제공한다. 더욱 상세하게는, 샘플이 HIV gp120의 HXB-2 균주로부터의 gp120의 N-말단 메티오닌으로부터 넘버링하여 (a) 54번, 74번, 119번, 126번, 131번, 157번, 196번, 205번, 218번, 228번, 239번, 247번, 296번, 331번, 378번, 385번, 418번 및 445번 중 하나 이상의 위치에서 하나 이상의 시스테인 잔기를 결여하고/하거나, (b) 24번, 29번, 34번, 54번, 74번, 119번, 126번, 131번, 157번, 196번, 205번, 218번, 228번, 239번, 247번, 296번, 331번, 378번, 385번, 418번, 445번, 493번, 495번, 499번 내지 501번, 503번 내지 508번 및 510번 이외의 위치에서 하나 이상의 추가의 시스테인 잔기를 포함하는 gp120 아미노산 서열에 대해 검정된다. 바람직하게는, gp120 서열은 V1 도메인에서 하나 이상의 추가의 시스테인을 지닌 서브타입 G gp120 서열 또는 V4 도메인에서 하나 이상의 추가의 시스테인을 지닌 서브타입 E gp120 서열이 아니다.In another embodiment, the present invention provides a method for determining whether a biological sample from a subject comprises a polypeptide comprising a gp120 amino acid sequence having an abnormal cysteine number or a polynucleotide encoding such a gp120 amino acid sequence. It provides a diagnostic method comprising assaying a biological sample of. More specifically, the samples were numbered from the N-terminal methionine of gp120 from the HXB-2 strain of HIV gp120 (a) 54, 74, 119, 126, 131, 157, 196, 205 Lack one or more cysteine residues at one or more of
이러한 구체예의 하나의 변형례에서, 검정은 샘플을 결합에 적합한 조건하에서 gp120 서열과 특이적으로 결합하는 항체와 접촉시키는 것을 포함하는 면역검정이다.In one variation of this embodiment, the assay is an immunoassay comprising contacting a sample with an antibody that specifically binds to the gp120 sequence under conditions suitable for binding.
이러한 구체예의 또 다른 변형례에서, 검정은 폴리누클레오티드 기초 검정이며, 여기서 샘플 폴리누클레오티드는 하이브리드화에 적합한 조건하에서 gp120 서열을 엔코딩하는 누클레오티드 서열에 특이적으로 하이브리드화되는 핵산 서열과 접촉된다. 바람직한 구체예에서, 핵산 분자는 한 쌍의 증폭 프라이머 중 하나 일 수 있으며, 이 경우, 검정은 샘플 폴리누클레오티드를 증폭에 적합한 조건하에서 둘 모두의 증폭 프라이머와 접촉시킨 후, 증폭 생성물이 생성되는 지의 여부를 결정함으로써 수행된다. 또 다른 바람직한 구체예에서, 핵산 분자는 고체상에 부착된 핵산 프로브이며, 검정은 샘플 폴리누클레오티드가 핵산 프로브에 특이적으로 하이브리드화되는 지의 여부를 결정함으로써 수행된다.In another variation of this embodiment, the assay is a polynucleotide based assay wherein the sample polynucleotide is contacted with a nucleic acid sequence that specifically hybridizes to a nucleotide sequence that encodes a gp120 sequence under conditions suitable for hybridization. In a preferred embodiment, the nucleic acid molecule can be one of a pair of amplification primers, in which case the assay contacts the sample polynucleotide with both amplification primers under conditions suitable for amplification, and then whether the amplification product is produced. Is determined by determining In another preferred embodiment, the nucleic acid molecule is a nucleic acid probe attached to a solid phase and the assay is performed by determining whether the sample polynucleotide is specifically hybridized to the nucleic acid probe.
첨부된 도면은 HIV-1 gp120 아미노산 서열의 개략도를 도시한다. 모든 도면에 있어서, 각각의 아미노산 잔기 위치에 대한 넘버링은 HIV-1 균주 HXB2를 기준으로 한다. 선은 변이된 실제 아미노산 잔기를 나타내며, 아미노산 잔기는 표준 1분자 약어를 사용하여 명명되어 있다. 모든 변이는 하나의 잔기에서 또 다른 잔기로의 치환을 나타내도록 하기 형식으로 표시된다: (HXB2의) Z에서 X -> Y, 여기서 X는 치환되는 아미노산이고, Y는 치환 (즉, 새로운) 아미노산이다. 5개의 가변 영역 및 5개의 보존 영역은 각각 V1-V5 및 C1-C5로서 나타나 있다.The attached figure shows a schematic of the HIV-1 gp120 amino acid sequence. In all figures, the numbering for each amino acid residue position is based on the HIV-1 strain HXB2. Lines represent actual amino acid residues that are mutated, and amino acid residues are named using standard single molecule abbreviations. All variants are represented in the following format to indicate a substitution from one residue to another: Z to X-> Y (of HXB2), where X is the amino acid that is substituted, and Y is the substituted (ie new) amino acid. to be. Five variable regions and five conserved regions are shown as V1-V5 and C1-C5, respectively.
도 1은 샘플 U-101c1에서 Cys 잔기가 Arg 잔기로 변화되는 치환을 지니는 54번 및 126번 위치를 도시한다. 전체 시스테인 보체(complement)는 16개 잔기이다.1 depicts positions 54 and 126 with substitutions where the Cys residue is changed to an Arg residue in sample U-101c1. The total cysteine complement is 16 residues.
도 2는 샘플 U-178c13에서 Cys 잔기가 각각 Arg 및 Tyr로 변화되는 치환을 지니는 위치 131번 및 228번 위치를 도시한다. 전체 시스테인 보체는 16개 잔기이다.FIG. 2 shows
도 3은 17개 잔기의 전체 시스테인 보체를 지닌 HIV-1 샘플을 도시한다. 샘 플명은 각각의 변이 뒤에 괄호안에 나타나 있다.3 shows an HIV-1 sample with total cysteine complement of 17 residues. The sample name is shown in parentheses after each variation.
도 4는 19개 잔기의 전체 시스테인 보체를 지닌 HIV-1 샘플을 도시한다. 샘플명은 각각의 변이 뒤에 괄호안에 나타나 있다.4 shows an HIV-1 sample with total cysteine complement of 19 residues. Sample names are shown in parentheses after each variation.
도 5A 내지 도 5X는 20개 잔기의 전체 시스테인 보체를 지닌 HIV-1 샘플을 도시한다. 샘플명은 각각의 도면 아래에 나타나 있다. 이러한 복합 도면에서, 단지 추가의 Cys 잔기를 지닌 영역이 도시되어 있다. 패널 A 내지 S에서, 루프내의 잔기의 개수 (Cys 잔기를 제외)는 패널의 우측 하단부에 도시되어 있다. U-374c1 및 U-234c10을 도시하는 패널 T는 추가의 시스테인을 기초로 하는 한 세트의 잠재적인 2차 구조를 나타낸다. 패널 U는 독특한 Cys 쌍이룸(pairing)을 기초로 하는 U-374c1 및 U-208_4cl에 대한 다양한 입체형태를 도시한다. V2 (U-033c1) (패널 V), V4 (U-210c2) (패널 W), 및 V5 (U-062_2c5) (패널 X) 내의 추가의 Cys 잔기가 또한 도시되어 있다.5A-5X show HIV-1 samples with total cysteine complement of 20 residues. Sample names are shown below each figure. In this composite drawing, only regions with additional Cys residues are shown. In panels A through S, the number of residues in the loop (except Cys residues) is shown in the lower right of the panel. Panel T, showing U-374c1 and U-234c10, shows a set of potential secondary structures based on additional cysteines. Panel U shows various conformations for U-374c1 and U-208_4cl based on unique Cys pairings. Additional Cys residues in V2 (U-033c1) (Panel V), V4 (U-210c2) (Panel W), and V5 (U-062_2c5) (Panel X) are also shown.
본 발명은 HIV gp120으로부터의 서열을 포함하는 단리된 폴리펩티드를 제공한다. 이러한 gp120 서열은 비정상적 이황화물 구조를 지닌다. 상세하게는, 본 발명의 gp120 서열은 정상적인 18개 시스테인 잔기 보다 많거나 적은 수의 시스테인 잔기를 함유한다. 본원에 기술된 gp120 서열은 북미, 푸에르토리코 및 네덜란드 전역에 걸쳐 있는 61개의 임상 연구소에서 수행된 HIV 백신의 대규모 임상 실험에서 수득되었다. 이러한 연구 과정 동안, 350건의 최근의 HIV 신규 감염으로부터 gp120 엔코딩 cDNA를 클로닝하고, 시퀀싱하였다. 이러한 연구에서, 혈장 샘플을 감염된 지 6개월내에 취하고, 혈장 바이러스 RNA를 역전사시키고, 중합효소 연쇄 반응 (PCR)에 의해 증폭시키고, 시퀀싱하였다. 의외로, 수득된 gp120 서열의 약 20%가 비정상적 이황화물 구조를 지녔다. 따라서, 이러한 구조적 특징은 감염 후 초기에 광범위한 변이가 숙주에서 일어날 수 있기 전에 상당한 수의 성공적인 바이러스에서 발견된다. 따라서, 비정상적 이황화물 구조를 지닌 바이러스는 신규 감염과 관련된 "전파 표현형(transmission phenotype)" 또는 북미에서 유행하는 HIV-1의 주된 신규 변이체를 나타낼 수 있다. 어느 경우든지, HIV-1 감염을 예방하도록 설계되는 백신은 비정상적인 이황화물 구조를 지닌 gp120을 함유하는 바이러스에 대해 유도되어야 한다.The present invention provides an isolated polypeptide comprising a sequence from HIV gp120. This gp120 sequence has an abnormal disulfide structure. Specifically, the gp120 sequence of the present invention contains more or fewer cysteine residues than normal 18 cysteine residues. The gp120 sequences described herein were obtained in large scale clinical trials of HIV vaccines conducted at 61 clinical laboratories throughout North America, Puerto Rico and the Netherlands. During this course of study, gp120 encoding cDNA was cloned and sequenced from 350 recent HIV new infections. In this study, plasma samples were taken within 6 months of infection, plasma viral RNA was reverse transcribed, amplified by polymerase chain reaction (PCR) and sequenced. Surprisingly, about 20% of the obtained gp120 sequence had an abnormal disulfide structure. Thus, these structural features are found in a significant number of successful viruses early in the day before extensive variation can occur in the host. Thus, viruses with abnormal disulfide structures may represent the "transmission phenotype" associated with new infections or the major new variant of HIV-1 prevalent in North America. In either case, vaccines designed to prevent HIV-1 infection should be directed against viruses containing gp120 with abnormal disulfide structures.
홀수의 시스테인을 함유하는 천연 단백질은 비정상적인데, 이는 유리된 설프히드릴기가 다른 유리된 설프히드릴기와 고도로 반응성이어서 분자내 및 분자간 이황화물 결합을 형성하는 경향이 있기 때문이다. 홀수의 시스테인을 함유하는 본 발명의 gp120 서열이 관심의 대상인데, 이는 쌍을 이루지 않은 시스테인 잔기가 여분의 분자간 및 분자내 이황화물 결합을 형성할 수 있으며, 분자 다양성을 발생시키도록 바이러스에 의해 이용되는 종래에 공지된 바 없는 메커니즘을 나타낼 수 있다. 원칙적으로, 이러한 다양성은 두 개의 gp120 분자 (각각 쌍을 이루지 않는 시스테인을 지님) 사이의 공유 결합을 매개하거나, gp41 내의 시스테인 잔기와의 공유 결합을 매개하거나, 또 다른 바이러스 또는 세포성 단백질과의 공유 결합을 매개함으로써 분자내 또는 분자간 이황화물 구조를 변화시킴으로써 발생될 수 있다.Natural proteins containing an odd number of cysteines are abnormal because free sulfhydryl groups are highly reactive with other free sulfhydryl groups and tend to form intramolecular and intermolecular disulfide bonds. Of interest are the gp120 sequences of the invention, which contain an odd number of cysteines, which are used by the virus to create unpaired cysteine residues that can form extra intermolecular and intramolecular disulfide bonds and generate molecular diversity. It may represent a mechanism that is not known in the art. In principle, this diversity mediates covalent bonds between two gp120 molecules (each with unpaired cysteines), covalent bonds with cysteine residues in gp41, or covalent with another viral or cellular protein. It can occur by changing the intramolecular or intermolecular disulfide structure by mediating the bond.
더욱이, 백신 개발자는 비리온 및 바이러스 감염된 세포의 표면상에 존재하는 gp120/gp41 복합체의 구조를 보다 정확하게 복제하는 신규 항원을 생성하는 데에 쌍을 이루지 않은 시스테인 잔기를 함유하는 HIV-1 외피 단백질을 사용할 수 있다. 예를 들어, HIV-1 비리온 및 바이러스 감염된 세포의 표면상에 존재하는 바이러스 스파이크(spike)는 단량체 gp120/gp41 복합체의 비공유 올리고머 (통상적으로 삼합체)인 것으로 여겨진다. 이러한 복합체는 취성이고 분해되므로, 단량체 또는 올리고머 gp120/gp41 복합체를 정제하는 것은 가능하지 않았다. 그러나, 올리고머 gp120/gp41 복합체를 제조하는 방법이 개발될 수 있는 경우, 우수한 HIV-1 백신이 개발될 수 있는 것으로 믿어진다. 실제로, 몇몇 연구자들은 쌍을 이루지 않은 시스테인 잔기를 gp120 및 gp41의 구조내로 공학처리하여 안정하고 공유 결합된 gp120/gp41 복합체를 생성시키기 위해 시험관내 변이유발을 이용하였다. 공유 결합된 복합체가 수득된다고 하더라도, 생성된 구조는 천연 구조가 아니므로 비리온 또는 바이러스 감염된 세포에 존재하는 것과 같은 gp120/gp41 복합체의 구조를 정확하게 복제하지 못했다. 본원에 기술된 다수의 분자는 천연에 존재하는 것이고 이황화물 안정화된 gp120/gp41 복합체의 제조를 위해 종래에 사용된 것들과 거리가 먼 규정된 영역에 쌍을 이루지 않는 시스테인을 지니고 있으므로, 이들 분자는 gp120의 면역원성을 향상시키기 위해 다양한 이황화물 결합된 구조를 유전 공학처리하기 위한 특별한 기회를 제공한다. 예시적인 구조는 동종이합체, gp120/gp41 이종이합체, 다른 HIV-1 단백질에 공유 결합된 gp120, 비-HIV-1 단백질에 결합된 gp120 또는 비-단백질 화학적 조성물에 공유 결합된 gp120을 포함한다. 이러한 구조는 단량체로 달성할 수 있는 면역원성 이상으로 면역원성을 향상시킬 수 있다. 이종이합체 gp120 복합체는 면역 반응의 폭을 확장시키는 데에 유용하다. 예를 들어, 두 개의 상이한 HIV-1 서브타입으로부터의 gp120 서열을 함유하는 이종이합체는 각각의 HIV-1 서브타입에 대한 면역 반응을 유도시키기 위해 사용될 수 있다. 다른 구체예에서, 유리된 설프히드릴기를 함유하는 gp120 서열은, 예를 들어 gp120 서열을 특정 세포 유형 (예를 들어, 수지상 세포 또는 대식구)에 표적화시키거나 면역원성을 향상시키기 위해 (예를 들어, gp120 서열을 고도로 면역원성인 담체 단백질, 예를 들어 디프테리아 독소, 키홀 림펫 헤모시아닌 (keyhole limpet hemocyanin), 티로글로불린 또는 우혈청 알부민에 결합시킴으로써), 다른 부분에 결합될 수 있다. gp120은 높은 친화성으로 세포 표면 항원 CD4에 결합하므로, 유리된 설프히드릴기를 함유하는 gp120 서열은 CD4 함유 세포로의 표적화를 위해 약물에 결합될 수 있다.Moreover, vaccine developers have developed HIV-1 envelope proteins containing unpaired cysteine residues to generate new antigens that more accurately replicate the structure of the gp120 / gp41 complex present on the surface of virion and virus infected cells. Can be used. For example, viral spikes present on the surface of HIV-1 virions and virus infected cells are believed to be non-covalent oligomers (typically trimers) of the monomeric gp120 / gp41 complex. Since these complexes are brittle and degrade, it was not possible to purify the monomer or oligomeric gp120 / gp41 complexes. However, it is believed that good HIV-1 vaccines can be developed if methods for preparing oligomeric gp120 / gp41 complexes can be developed. Indeed, some researchers have used in vitro mutagenesis to engineer unpaired cysteine residues into the structures of gp120 and gp41 to produce stable and covalently linked gp120 / gp41 complexes. Although covalently bound complexes are obtained, the resulting structures are not native structures and thus did not accurately replicate the structure of the gp120 / gp41 complexes such as those present in virion or virus infected cells. Many of the molecules described herein are naturally present and have cysteines that do not pair in defined regions far from those used conventionally for the production of disulfide stabilized gp120 / gp41 complexes. It offers a special opportunity for genetic engineering of various disulfide bound structures to enhance the immunogenicity of gp120. Exemplary structures include homodimers, gp120 / gp41 heterodimers, gp120 covalently linked to other HIV-1 proteins, gp120 linked to non-HIV-1 proteins, or gp120 covalently bound to non-protein chemical compositions. This structure can enhance immunogenicity beyond the immunogenicity achievable with monomers. The heterodimeric gp120 complex is useful for expanding the breadth of the immune response. For example, heterodimers containing gp120 sequences from two different HIV-1 subtypes can be used to induce an immune response against each HIV-1 subtype. In other embodiments, the gp120 sequence containing the free sulfhydryl group can be used, for example, to target (eg, enhance) immunogenicity or to target the gp120 sequence to a particular cell type (eg, dendritic cells or macrophages). by binding the gp120 sequence to a highly immunogenic carrier protein such as diphtheria toxin, keyhole limpet hemocyanin, tyroglobulin or bovine serum albumin). Since gp120 binds to cell surface antigen CD4 with high affinity, gp120 sequences containing free sulfhydryl groups can be bound to the drug for targeting to CD4-containing cells.
정의Justice
본원 명세서에 사용된 용어는 별다른 언급이 없는 한 하기 제시된 바와 같이 정의된다.The terminology used herein is defined as set forth below unless otherwise noted.
전장 gp120 아미노산 서열은, 서열이 대부분의 전장 gp120 서열에 존재하는 18개를 초과하는 시스테인 잔기를 함유하는 경우, 하나 이상의 "추가 시스테인"을 지니는 것으로 언급된다. 또한, gp120 단편은, 이것이 유래되는 전장 gp120 서열이 정상적인 18개 시스테인 잔기 보다 많은 개수의 시스테인 잔기를 함유하고 여분의 시스테인(들) 중 하나 이상이 단편에 존재하는 경우, "추가의 시스테인"을 지니는 것으로 언급된다.The full length gp120 amino acid sequence is said to have one or more "additional cysteines" when the sequence contains more than 18 cysteine residues present in most full length gp120 sequences. In addition, a gp120 fragment has a "additional cysteine" if the full-length gp120 sequence from which it contains contains more cysteine residues than normal 18 cysteine residues and one or more of the extra cysteine (s) is present in the fragment. It is mentioned.
"면역원성 조성물"이란 용어는 면역 반응을 유도할 수 있는 임의의 조성물을 의미한다.The term "immunogenic composition" means any composition capable of inducing an immune response.
본원에 사용된 "백신"이란 용어는 감염 물질에 의한 감염의 위험을 감소시키거나 감염을 예방하는 면역원성 조성물 ("예방용 백신") 또는 기존의 감염을 임의의 정도로 개선시키는 면역원성 조성물 ("치료용 백신")을 의미한다. 백신이 감염 물질에 의한 후속 공격으로부터 생물체를 방어하는 경우, 이러한 백신은 "방어용"으로 언급된다.As used herein, the term "vaccine" refers to an immunogenic composition ("prevention vaccine") that reduces the risk of or prevents infection by an infectious agent or an immunogenic composition that improves to any extent an existing infection (" Therapeutic vaccine "). If the vaccine protects the organism from subsequent attack by the infectious agent, such vaccine is referred to as "defense".
본원에 사용된 "DNA 백신"이란 용어는 항원을 엔코딩하는 폴리누클레오티드를 하나 이상 함유하는 백신을 의미하며, 이러한 폴리누클레오티드가 생물체에 투여되면 엔코딩된 항원이 발현된 후, 이러한 항원에 대한 면역 반응이 일어난다.As used herein, the term "DNA vaccine" refers to a vaccine containing one or more polynucleotides encoding an antigen, wherein when such polynucleotides are administered to an organism, the immune response to these antigens is expressed after the encoded antigen is expressed. Happens.
본원에 사용된 "바이러스 유래 백신"이란 용어는 바이러스 입자, 바이러스 유사 입자 (VLP), 바이러스 입자 또는 VLP의 일부 및/또는 표면상에서 항원을 디스플레이하는 바이러스 감염된 세포를 함유하는 백신을 의미하며, 이러한 입자 또는 세포가 생물체에 투여되면 디스플레이된 항원에 대한 면역 반응이 유도된다. "바이러스 유래된 백신"이란 용어는 두 개 이상의 상이한 공급원으로부터의 성분을 함유하는 키메라 바이러스 입자 또는 VLP를 포함한다.The term "virus derived vaccine" as used herein refers to a virus containing a virus particle, a virus like particle (VLP), a virus particle or a virus infected cell displaying an antigen on a portion and / or surface of the VLP, such a particle. Or when cells are administered to an organism, an immune response against the displayed antigen is induced. The term "virus derived vaccine" includes chimeric virus particles or VLPs containing components from two or more different sources.
"폴리펩티드" 및 "단백질"이란 용어는 아미노산의 중합체를 의미하는 것으로 상호교환적으로 사용되며, 달리 특정되지 않는 경우 천연 아미노산과 유사한 방식으로 기능할 수 있는 비정형(atypical) 아미노산을 포함한다.The terms "polypeptide" and "protein" are used interchangeably to mean polymers of amino acids and include atypical amino acids that can function in a similar manner to natural amino acids unless otherwise specified.
폴리펩티드 또는 폴리누클레오티드와 관련하여 사용된 "단리된"이란 용어는 폴리펩티드 또는 폴리누클레오티드와 함께 전형적으로 존재하는 하나 이상의 다른 성분과 분리된 폴리펩티드 또는 폴리누클레오티드를 의미한다. 따라서, 천연 폴리펩티드는 이것이 이러한 폴리펩티드 또는 폴리누클레오티드와 함께 천연적으로 존재하는 하나 이상의 다른 성분으로부터 정제된 경우에 단리된 것이다. 재조합 폴리펩티드 또는 폴리누클레오티드는 이것이 이러한 폴리펩티드 또는 폴리누클레오티드가 생성된 경우에 존재하는 하나 이상의 다른 성분으로부터 정제된 경우에 단리된 것이다.The term "isolated" as used in the context of a polypeptide or polynucleotide means a polypeptide or polynucleotide that is separated from one or more other components typically present with the polypeptide or polynucleotide. Thus, a natural polypeptide is isolated when it is purified from one or more other components that exist naturally with such polypeptide or polynucleotide. A recombinant polypeptide or polynucleotide is isolated when it is purified from one or more other components present when such polypeptide or polynucleotide is produced.
"아미노산" 또는 "아미노산 잔기"란 용어는 별다른 언급이 없는 한 천연 L-아미노산 또는 잔기를 포함한다. 통상적으로 사용되는 아미노산에 대한 1문자 및 3문자 약어가 본원에 사용된다 (Lehninger, A. L. (1975) Biochemistry, 2d ed., pp. 71-92, Worth Publishers, N. Y.). "아미노산" 및 "아미노산 잔기"란 용어는 D-아미노산 뿐만 아니라 화학적으로 개질된 아미노산, 예를 들어 아미노산 유사체, 단백질내로 보통 혼입되지 않는 천연 아미노산, 및 아미노산의 특징적인 특성을 지닌 화학 합성된 화합물 (집합적으로, "비정형" 아미노산)을 포함한다. 예를 들어, 천연 Phe 또는 Pro와 동일한 펩티드 화합물의 입체형태 제한을 가능케 하는 페닐알라민 또는 프롤린의 유사체 또는 미메틱(mimetic)이 "아미노산"의 정의에 포함된다.The term "amino acid" or "amino acid residue" includes natural L-amino acids or residues unless otherwise indicated. One- and three-letter abbreviations for commonly used amino acids are used herein (Lehninger, A. L. (1975) Biochemistry, 2d ed., Pp. 71-92, Worth Publishers, N. Y.). The terms "amino acid" and "amino acid residue" refer to D-amino acids as well as chemically modified amino acids, such as amino acid analogs, natural amino acids not normally incorporated into proteins, and chemically synthesized compounds having the characteristic properties of amino acids ( Collectively, “atypical” amino acids). For example, analogs or mimetics of phenylalamine or proline that allow conformational restriction of the same peptide compound as native Phe or Pro are included in the definition of “amino acid”.
예시적인 비정형 아미노산은 예를 들어 국제 공개 번호 WO 90/01940에 기재된 아미노산 뿐만 아니라 Glu 및 Asp에 대해 치환될 수 있는 2-아미노 아디프산 (Aad); Glu 및 Asp에 대한 2-아미노피멜산 (Apm); Met, Leu 및 그 밖의 지방족 아미노산에 대한 2-아미노부티르산 (Abu); Met, Leu 및 그 밖의 지방족 아미노산에 대한 2-아미노헵탄산 (Ahe); Gly에 대한 2-아미노이소부티르산 (Aib); Val, Leu 및 Ile에 대한 시클로헥실알라닌 (Cha); Arg 및 Lys에 대한 호모아르기닌 (Har); Lys, Arg 및 His에 대한 2,3-디아미노프로피온산 (Dpr); Gly, Pro 및 Ala에 대한 N-에틸글리신 (EtGly); Asn 및 Gln에 대한 N-에틸아스파라긴 (EtAsn); Lys에 대한 히드록실리신 (Hyl); Lys에 대한 알로히드록실리신 (Ahyl); Pro, Ser 및 Thr에 대한 3- (및 4-)히드록시프롤린 (3Hyp, 4Hyp); Ile, Leu 및 Val에 대한 알로-이소루신 (Aile); Gly, Pro 및 Ala에 대한 N-메틸글리신 (MeGly, 사르코신); Ile에 대한 N-메틸이소루신 (MeIle); Met 및 그 밖의 지방족 아미노산에 대한 노르발린 (Nva); Met 및 그 밖의 지방족 아미노산에 대한 노르루신 (Nle); Lys, Arg 및 His에 대한 오르니틴 (Orn); Thr, Asn 및 Gln에 대한 시트룰린 (Cit) 및 메티오닌 설폭시드 (MSO); Phe에 대한 N-메틸페닐알라닌 (MePhe), 트리메틸페닐알라닌, 할로 (F, Cl, Br 및 I) 페닐알라닌 및 트리플루오릴페닐알라닌을 포함한다.Exemplary atypical amino acids include, for example, 2-amino adipic acid (Aad) which may be substituted for the amino acids described in International Publication No. WO 90/01940 as well as Glu and Asp; 2-aminopimelic acid (Apm) for Glu and Asp; 2-aminobutyric acid (Abu) for Met, Leu and other aliphatic amino acids; 2-aminoheptanoic acid (Ahe) for Met, Leu and other aliphatic amino acids; 2-aminoisobutyric acid (Aib) for Gly; Cyclohexylalanine (Cha) for Val, Leu and Ile; Homoarginine (Har) for Arg and Lys; 2,3-diaminopropionic acid (Dpr) for Lys, Arg and His; N-ethylglycine (EtGly) for Gly, Pro and Ala; N-ethylasparagine (EtAsn) for Asn and Gln; Hydroxylicin (Hyl) for Lys; Allohydroxylysine (Ahyl) for Lys; 3- (and 4-) hydroxyproline (3Hyp, 4Hyp) for Pro, Ser and Thr; Allo-isoleucine (Aile) for Ile, Leu and Val; N-methylglycine (MeGly, Sarcosine) for Gly, Pro and Ala; N-methylisoleucine (MeIle) for Ile; Norvaline (Nva) for Met and other aliphatic amino acids; Norleucine (Nle) for Met and other aliphatic amino acids; Ornithine (Orn) for Lys, Arg and His; Citrulline (Cit) and methionine sulfoxide (MSO) for Thr, Asn and Gln; N-methylphenylalanine (MePhe), trimethylphenylalanine, halo (F, Cl, Br and I) phenylalanine and trifluorylphenylalanine for Phe.
폴리펩티드에 관해 사용된 "전장"이란 용어는 성숙한 야생형 폴리펩티드와 동일한 길이를 지닌 폴리펩티드를 의미한다.As used herein, the term "full length" refers to a polypeptide having the same length as a mature wild type polypeptide.
폴리펩티드 또는 폴리누클레오티드에 관해 사용된 "단편"이란 용어는 거대 분자의 일부를 나타낸다. 따라서, 폴리펩티드 단편은 거대 분자의 N-말단 부분, C-말단 부분 또는 둘 모두를 결여한다. 또한, 폴리펩티드 단편은 본원에서 "펩티드"로서 언급된다. 폴리누클레오티드의 단편은 거대 분자의 5' 부분, 3' 부분 또는 둘 모두를 결여할 수 있다. 또한, 폴리누클레오티드 단편은 본원에서 "올리고누클레오티드"로서 언급된다. 올리고누클레오티드는 비교적 짧은 핵산 분자, 일반적으로 200개 보다 짧은 누클레오티드, 더욱 특히 100개 보다 짧은 누클레오티드, 가장 특히 50개 보다 짧은 누클레오티드이다. 전형적으로, 올리고누클레오티드는 단일 가닥 DNA 분자이다.The term "fragment" as used with respect to a polypeptide or polynucleotide refers to a part of a macromolecule. Thus, polypeptide fragments lack the N-terminal portion, C-terminal portion, or both of the macromolecules. Polypeptide fragments are also referred to herein as "peptides". Fragments of polynucleotides may lack the 5 'portion, 3' portion or both of the macromolecules. Polynucleotide fragments are also referred to herein as "oligonucleotides". Oligonucleotides are relatively short nucleic acid molecules, generally shorter than 200 nucleotides, more particularly shorter than 100 nucleotides, most particularly shorter than 50 nucleotides. Typically, oligonucleotides are single stranded DNA molecules.
아미노산 또는 누클레오티드 서열의 "서브시퀀스(subsequence)"는 거대 서열의 일부이다.The "subsequence" of an amino acid or nucleotide sequence is part of a large sequence.
두 개 이상의 아미노산 또는 누클레오티드 서열과 관련된 "동일한" 또는 "동일성 퍼센트"란 용어는 하기 서열 비교 알고리듬 중 하나를 사용하여 또는 시각적 조사에 의해 측정하여 최대 일치성에 대해 비교되고 정렬되는 경우에 동일하거나, 동일한 아미노산 잔기 또는 누클레오티드의 특정 비율을 지니는 둘 이상의 서열 또는 서브시퀀스를 의미한다.The terms "identical" or "percent identity" associated with two or more amino acid or nucleotide sequences are the same or the same when compared and aligned for maximal identity, as determined by visual inspection or using one of the following sequence comparison algorithms. By two or more sequences or subsequences having a specific proportion of amino acid residues or nucleotides.
서열 비교를 위해, 전형적으로 하나의 서열은 시험 서열이 비교되는 기준 서열로서 기능한다. 서열 비교 알고리듬을 사용하는 경우, 시험 및 기준 서열이 컴퓨터로 입력되고, 필요한 경우 서브시퀀스 좌표가 지정되고, 서열 알고리듬 프로그램 파라미터가 지정된다. 그 후, 서열 비교 알고리듬은 지정된 프로그램 파라미터를 기초로 하여 기준 서열에 대한 시험 서열(들)의 서열 동일성 퍼센트를 계산한다.For sequence comparison, typically one sequence functions as a reference sequence to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are entered into the computer, subsequence coordinates are designated if necessary, and sequence algorithm program parameters are specified. The sequence comparison algorithm then calculates the percent sequence identity of the test sequence (s) relative to the reference sequence based on the designated program parameters.
비교를 위한 서열의 최적 정렬은, 예를 들어 스미스(Smith) & 워터만(Waterman)의 국소 상동성 알고리듬 (Adv. Appl. Math. 2:482 (1981))에 의해, 니들만(Needleman) & 운스히(Wunsch)의 상동성 정렬 알고리듬 (J. Mol. Biol. 48:443 (1970))에 의해, 피어슨(Pearson) & 립만(Lipman)의 유사성 방법에 대한 검색 ((1988) Proc. Natl. Acad. Sci. USA 85:2444)에 의해, 이들 알고리듬의 컴퓨터화된 실행 (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetic Computer Group, 575 Science Dr., Madison, WI)에 의해, 또는 시각적 조사 (개괄적으로 상기 오수벨(Ausubel) 등의 문헌을 참조하라)에 의해 수행될 수 있다.The optimal alignment of the sequences for comparison is, for example, by Smith & Waterman's local homology algorithm (Adv. Appl. Math. 2: 482 (1981)). A search for similarity methods of Pearson & Lipman (Wunsch's homology alignment algorithm (J. Mol. Biol. 48: 443 (1970)) ((1988) Proc. Natl. By Acad. Sci. USA 85: 2444) by computerized implementation of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetic Computer Group, 575 Science Dr., Madison, WI). Or visual inspection (see Ausubel et al., Supra).
유용한 알고리듬의 한 가지 예는 PILEUP이다. PILEUP은 관계 및 서열 동일성 퍼센트를 나타내기 위해 점진적 쌍(pairwise) 정렬을 사용하여 관련 서열의 그룹으로부터 다수의 서열 정렬을 생성시킨다. 이는 또한 정렬을 생성시키는데 사용되는 클러스터링 관계를 보여주는 트리(tree) 또는 계통수(dendrogram)를 플로팅한다. PILEUP은 펭(Feng) & 둘리틀(Doolittle)의 점진적 정렬 방법 ((1987) J. Mol. Evol. 35:351-360))의 단순화를 이용한다. 사용된 방법은 히긴스(Higgins) & 샤프(Sharp)에 의해 기재된 방법 ((1989) CABIOS 5:151-153)과 유사하다. 프로그램은 각각 5,000개 누클레오티드 또는 아미노산의 최대 길이를 지닌 300개 이하의 서열을 정렬할 수 있다. 다중 정렬 절차는 두 개의 가장 유사한 서열의 쌍 정렬로 시작하여, 두 개의 정렬된 서열의 클러스터를 생성시킨다. 그 후, 이러한 클러스터는 다음의 가장 관련된 서열 또는 정렬된 서열들의 클러스터에 대해 정렬된다. 서열들의 두 개의 클러스터는 두 개의 개별적 서열의 쌍 정렬의 단순한 신장에 의해 정렬된다. 최종 정렬은 일련의 점진적 쌍 정렬에 의해 달성된다. 프로그램은 특정 서열 및 서열 비교의 영역을 위한 이들의 아미노산 또는 누클레오티드 좌표를 지정함에 의해 및 프로그램 파라미터를 지정함에 의해 실행된다. 예를 들어, 기준 서열은 하기 파라미터를 이용하여 서열 동일성 퍼센트 관계를 측정하기 위해 다른 시험 서열과 비교될 수 있다: 디폴트 갭 중량 (3.00), 디폴트 갭 길이 중량 (0.10) 및 칭량된 말단 갭.One example of a useful algorithm is PILEUP. PILEUP generates multiple sequence alignments from groups of related sequences using progressive pairwise alignments to indicate percent relationship and sequence identity. It also plots a tree or dendrogram showing the clustering relationships used to generate the alignment. PILEUP takes advantage of the simplification of Feng & Doolittle's progressive alignment method ((1987) J. Mol. Evol. 35: 351-360). The method used is similar to the method described by Higgins & Sharp ((1989) CABIOS 5: 151-153). The program can align up to 300 sequences, each with a maximum length of 5,000 nucleotides or amino acids. The multiple alignment procedure begins with a pair alignment of the two most similar sequences, resulting in a cluster of two aligned sequences. This cluster is then aligned with respect to the next most relevant sequence or cluster of aligned sequences. Two clusters of sequences are aligned by simple extension of the pair alignment of two individual sequences. Final alignment is achieved by a series of progressive pair alignments. The program is run by specifying their amino acid or nucleotide coordinates for a particular sequence and region of sequence comparison and by specifying program parameters. For example, reference sequences can be compared with other test sequences to determine percent sequence identity relationships using the following parameters: default gap weight (3.00), default gap length weight (0.10) and weighed terminal gap.
서열 동일성 퍼센트 및 서열 유사성 퍼센트를 측정하는데 적합한 알고리듬의 또 다른 예는 알트슐(Altschul) 등에 의해 기술된 ((1990) J. Mol. Biol. 215:403-410) BLAST 알고리듬 (Basic Local Alignment Search Tool)이다. BLAST 분석을 수행하기 위한 소프트웨어는 국립 생물공학 정보 센터 (http://www.ncbi.nlm.nih.gov/)를 통해 공개적으로 입수할 수 있다. 이러한 알고리듬은 데이터베이스 서열 중의 동일한 길이의 워드(word)와 정렬되는 경우에 일부 포지티브값의 한계 스코어 T와 매치(match)되거나 이를 만족시키는 질문(query) 서열 중의 길이 W의 짧은 워드를 확인함으로써 먼저 높은 스코어링 서열 쌍 (HSP)을 확인하는 것을 포함한다. T는 이웃하는 워드 스코어 한계로서 언급된다 (Altschul et al, supra). 이러한 초기의 이웃하는 워드 히트(hit)는 이들을 함유하는 보다 긴 HSP를 발견하기 위한 검색을 개시하는데 있어서 시드(seed)로서 작용한다. 그 후, 워드 히트는 누적 정렬 스코어가 증가될 수 있는 한 각각의 서열을 따라 양 방향으로 신장된다. 누적 스코어는 누클레오티드 서열에 대해 파라미터 M (한 쌍의 매칭 잔기에 대한 보상 스코어; 항상 0 보다 큼) 및 N (미스매칭 잔기에 대한 패널티 스코어; 항상 0 보다 작음)을 사용하여 계산된다. 아미노산 서열의 경우, 누적 스코어를 계산하기 위해 스코어링 매트릭스가 사용된다. 각각의 방향으로의 워드 히트의 신장은, 누적 정렬 스코어가 이의 최대 수득값으로부터 양 X 만큼 적어지거나; 누적 스코어가 하나 이상의 네거티브-스코어링 잔기 정렬의 누적으로 인해 0 이하가 되거나; 어느 하나의 서열의 말단에 이르는 경우에 정지된다. BLAST 알고리듬 파라미터 W, T 및 X는 정렬의 민감성 및 속도를 결정한다. BLASTN 프로그램 (누클레오티드 서열의 경우)은 디폴트로서 11의 워드 길이 (W), 10의 기대값 (E), M=5, N=-4, 및 양쪽 가닥의 비교를 사용한다. 아미노산 서열의 경우, BLASTP 프로그램은 디폴트로서 3의 워드 길이 (W), 10의 기대값 (E), 및 BLOSUM62 스코어링 매트릭스를 사용한다 (참조: Henikoff & Henikoff (1989) Proc. Natl. Acad. Sci. USA 89:10915)를 사용한다.Another example of a suitable algorithm for measuring percent sequence identity and percent sequence similarity is the BLAST algorithm (Basic Local Alignment Search Tool (1990) J. Mol. Biol. 215: 403-410) described by Altschul et al. )to be. Software for performing BLAST analyzes is publicly available through the National Biotechnology Information Center (http://www.ncbi.nlm.nih.gov/). This algorithm is first determined by identifying short words of length W in the query sequence that match or satisfy the marginal score T of some positive values when aligned with words of equal length in the database sequence. Identifying a scoring sequence pair (HSP). T is referred to as the neighboring word score limit (Altschul et al, supra). This initial neighboring word hit acts as a seed in initiating a search to find longer HSPs containing them. The word hits are then stretched in both directions along each sequence as long as the cumulative alignment score can be increased. Cumulative scores are calculated using parameters M (compensation score for a pair of matching residues; always greater than 0) and N (penalty score for mismatching residues; always less than 0) for the nucleotide sequence. For amino acid sequences, a scoring matrix is used to calculate the cumulative score. The elongation of the word hit in each direction is such that the cumulative alignment score is reduced by an amount X from its maximum obtained value; The cumulative score is less than or equal to zero due to the accumulation of one or more negative-scoring residue alignments; It stops when it reaches the end of either sequence. The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses a comparison of 11 strands of word length (W), 10 expected values (E), M = 5, N = -4, and both strands by default. For amino acid sequences, the BLASTP program uses, by default, a word length of 3 (W), an expected value of 10 (E), and a BLOSUM62 scoring matrix (see Henikoff & Henikoff (1989) Proc. Natl. Acad. Sci. USA 89: 10915).
서열 동일성 퍼센트를 계산하는 것 이외에, BLAST 알고리듬은 또한 두 개 서열 사이의 유사성의 통계적 분석을 수행한다 (참조: Karlin & Altschul 1993) Proc. Natl. Acad. Sci. USA, 90: 5873-5787). BLAST 알고리듬에 의해 제공되는 유사성의 한 가지 측정치는 최소 합계 확률 (P(N))이며, 이는 두 개의 누클레오티드 또는 아미노산 서열 사이의 매치가 우연히 일어날 확률의 표시를 제공한다. 예를 들어, 핵산은 기준 핵산에 대한 시험 핵산의 비교에서 최소 합계 확률이 약 0.1 미만, 더욱 바람직하게는 약 0.01 미만, 가장 바람직하게는 약 0.001 미만인 경우에 기준 서열과 유사한 것으로 간주된다.In addition to calculating percent sequence identity, the BLAST algorithm also performs statistical analysis of the similarity between the two sequences (Karlin & Altschul 1993) Proc. Natl. Acad. Sci. USA, 90: 5873-5787). One measure of similarity provided by the BLAST algorithm is the minimum sum probability (P (N)), which provides an indication of the probability that a match between two nucleotide or amino acid sequences will occur by chance. For example, a nucleic acid is considered analogous to a reference sequence when the minimum sum probability is less than about 0.1, more preferably less than about 0.01, and most preferably less than about 0.001 in comparison of the test nucleic acid to the reference nucleic acid.
둘 이상의 폴리펩티드 중의 잔기는 이들이 상동 (즉, 1차, 2차 또는 3차 구조에서 유사한 위치를 차지함)이거나 유사 (즉, 동일하거나 유사한 기능적 능력을 지님)한 경우에 "상응(correspond)"하는 것으로 언급된다. 당 분야에 널리 공지된 바와 같이, 상동성 잔기는 상기 기술된 최대 일치에 대해 폴리펩티드 서열을 정렬함으로써 결정될 수 있다.Residues in two or more polypeptides are said to be "correspond" when they are homologous (ie, occupy similar positions in the primary, secondary, or tertiary structure) or similar (ie, have the same or similar functional capacity). Is mentioned. As is well known in the art, homologous residues can be determined by aligning the polypeptide sequence for the maximum match described above.
본원에 기술된 gp120 아미노산 서열 중의 위치는 위치 번호에 의해 확인되는데, 여기서 번호는 HIV의 HXB-2 균주로부터의 gp120 (SEQ ID NO:138)의 N-말단 메티오닌으로부터 넘버링된다.The position in the gp120 amino acid sequence described herein is identified by position number, where the number is numbered from the N-terminal methionine of gp120 (SEQ ID NO: 138) from the HXB-2 strain of HIV.
gp120 아미노산 서열은 또 다른 아미노산 잔기가 특정 위치에 통상적으로 존재하는 시스테인 대신 치환된 경우 또는 시스테인이 결실된 경우에 그러한 위치에서 시스테인을 결여하는 것으로 언급된다.The gp120 amino acid sequence is said to lack cysteine at that position if another amino acid residue is substituted for cysteine that is normally present at that position or if the cysteine is deleted.
폴리펩티드와 관련하여 사용된 "야생형"이란 용어는 분자의 공급원과 무관하게 천연 생물체로부터의 폴리펩티드 중에 존재하는 아미노산 서열을 지닌 임의의 폴리펩티드를 의미하는데, 즉, "야생형"이란 용어는 분자가 천연원으로부터 정제되었는 지; 재조합 기술을 사용하여 발현된 후 정제되었는 지; 또는 합성되었는 지의 여부와 무관한 서열 특징을 의미한다.The term "wild type" as used in reference to a polypeptide refers to any polypeptide having an amino acid sequence present in a polypeptide from a natural organism, regardless of the source of the molecule, ie the term "wild type" means that the molecule is from a natural source. Purified; Purified after expression using recombinant technology; Or a sequence feature independent of whether it was synthesized.
"아미노산 서열 변형체"란 용어는 아미노산의 첨가, 결실 또는 치환에 의해 야생형 아미노산 서열과 상이해진 아미노산 서열을 지닌 폴리펩티드를 의미한다.The term "amino acid sequence variant" refers to a polypeptide having an amino acid sequence that differs from the wild-type amino acid sequence by addition, deletion or substitution of amino acids.
본원에 사용된 "보존적 아미노산 치환"이란 용어는 아미노산이 기능적으로 균등한 아미노산으로 대체됨을 의미한다. 기능적으로 균등한 아미노산은 일반적으로 이들이 대체하는 아미노산과 크기 및/또는 특성 (예를 들어, 전하 또는 소수성)이 유사하다. 유사한 특성을 지닌 아미노산은 다음과 같이 분류될 수 있다:As used herein, the term "conservative amino acid substitutions" means that amino acids are replaced by functionally equivalent amino acids. Functionally equivalent amino acids are generally similar in size and / or properties (eg, charge or hydrophobicity) to the amino acids they replace. Amino acids with similar properties can be classified as follows:
(1) 소수성: His, Trp, Tyr, Phe, Met, Leu, Ile, Val, Ala;(1) hydrophobic: His, Trp, Tyr, Phe, Met, Leu, Ile, Val, Ala;
(2) 중성 소수성: Cys, Ser, Thr;(2) neutral hydrophobicity: Cys, Ser, Thr;
(3) 극성: Ser, Thr, Asn, Gln;(3) polarity: Ser, Thr, Asn, Gln;
(4) 산성/음 하전: Asp, Glu;(4) acidic / negative charge: Asp, Glu;
(5) 하전: Asp, Glu, Arg, Lys, His;(5) charges: Asp, Glu, Arg, Lys, His;
(6) 염기성/양 하전: Arg, Lys, His;(6) basic / positive charges: Arg, Lys, His;
(7) 염기성: Asn, Gln, His, Lys, Arg;(7) basic: Asn, Gln, His, Lys, Arg;
(8) 사슬 배향에 영향을 주는 잔기: Gly, Pro; 및(8) residues that affect chain orientation: Gly, Pro; And
(9) 방향족: Trp, Tyr, Phe, His.(9) aromatic: Trp, Tyr, Phe, His.
하기 표는 예시적이고 바람직한 보존적 아미노산 치환을 나타낸다.The table below shows exemplary and preferred conservative amino acid substitutions.
"시그널 서열"은 막 결합된 단백질 및 분비된 단백질상에서 발견되는 아미노산 서열로서, 단백질의 합성을 막 결합된 리보솜으로 유도하여 초기 펩티드 사슬의 소포체의 내강내로의 전위를 매개하는 아미노산 서열이며, 여기서 유리된 리보솜상의 세포질에서 합성되는 단백질에 대해 이용가능하지 않은 다양한 번역후 수식 (예를 들어, 글리코실화)이 이용가능하다. 재조합 발현에서 사용되는 바와 같이, 폴리펩티드는 정제의 용이함을 위해 폴리펩티드를 배양 배지내로 발현시키는 세포로부터 분비된다. 시그널 서열은 이것이 재조합 발현 또는 분비를 촉진시키도록 융합되는 폴리펩티드 이외의 폴리펩티드로부터 유래되는 경우 "이종성"인 것으로 언급된다.A "signal sequence" is an amino acid sequence found on membrane-bound and secreted proteins that is an amino acid sequence that directs the synthesis of proteins into membrane-bound ribosomes and mediates the translocation of the endoplasmic reticulum of the initial peptide chain into the lumen, where the free Various post-translational modifications (eg glycosylation) are available that are not available for the protein synthesized in the cytoplasm on the ribosome. As used in recombinant expression, the polypeptide is secreted from cells expressing the polypeptide into the culture medium for ease of purification. A signal sequence is said to be "heterologous" when it is derived from a polypeptide other than the polypeptide that is fused to promote recombinant expression or secretion.
"에피토프 태그"는 항체에 대한 에피토프를 규정하는 아미노산 서열이다. 에피토프 태그는 정제 또는 검출을 촉진시키기 위해 폴리펩티드 또는 펩티드내로 공학처리될 수 있다.An "epitope tag" is an amino acid sequence that defines an epitope for an antibody. Epitope tags can be engineered into polypeptides or peptides to facilitate purification or detection.
본원에 사용된 "융합 폴리펩티드"는 융합되는 폴리펩티드에 정상적으로는 존재하지 않는 아미노산 서열에 결합된 하나의 폴리펩티드로부터의 아미노산 서열을 포함하는 폴리펩티드이다. 융합되는 폴리펩티드에 정상적으로는 존재하지 않는 아미노산 서열은 상이한 폴리펩티드로부터의 아미노산 서열 (예를 들어, 이종성 시그널 서열) 또는 인공 서열일 수 있다. 일반적으로, 융합 폴리펩티드는 다른 폴리펩티드 서열을 엔코딩하는 누클레오티드 서열에 프레임으로 융합된 하나의 폴리펩티드 서열을 엔코딩하는 누클레오티드 서열을 함유하는 구성물을 사용하는 재조합 기술을 이용하여 발현된다.As used herein, a "fusion polypeptide" is a polypeptide comprising an amino acid sequence from one polypeptide that is linked to an amino acid sequence that is not normally present in the polypeptide to be fused. Amino acid sequences that are not normally present in the polypeptide to be fused may be amino acid sequences (eg, heterologous signal sequences) or artificial sequences from different polypeptides. In general, fusion polypeptides are expressed using recombinant techniques using constructs that contain a nucleotide sequence that encodes one polypeptide sequence fused in frame to a nucleotide sequence that encodes another polypeptide sequence.
폴리펩티드 또는 폴리펩티드 단편과 관련하여 사용되는 "유도체"란 용어는 아미노산 서열 변형체 뿐만 아니라 하나 이상의 화학적 그룹의 첨가, 결실 또는 치환에 의해 야생형 아미노산 서열과 상이해진 임의의 다른 분자를 포함한다. "유도체"는 야생형 폴리펩티드 또는 폴리펩티드 단편의 하나 이상의 생물학적 또는 면역학적 특성, 예를 들어 수용체에 특이적으로 결합하는 생물학적 특성 및 항체에 특이적으로 결합하는 면역학적 특성을 보유한다.The term "derivative" as used in the context of a polypeptide or polypeptide fragment includes amino acid sequence variants as well as any other molecule that differs from the wild type amino acid sequence by addition, deletion or substitution of one or more chemical groups. A "derivative" retains one or more biological or immunological properties of a wild type polypeptide or polypeptide fragment, eg, a biological property that specifically binds to a receptor and an immunological property that specifically binds to an antibody.
"세포 특이적 결합 부분"은 하나 또는 다수의 특정 세포 유형에서 발견되는 결합 파트너에 특이적으로 결합하는 부분을 의미한다. 세포 특이적 결합 부분은, 예를 들어 요망되는 세포 유형으로 폴리펩티드 또는 폴리누클레오티드를 유도시키기 위해, 폴리펩티드 또는 폴리누클레오티드에 결합될 수 있다."Cell-specific binding moiety" refers to a moiety that specifically binds to a binding partner found in one or many specific cell types. The cell specific binding moiety can be linked to the polypeptide or polynucleotide, for example, to drive the polypeptide or polynucleotide to the desired cell type.
"면역원성 담체 단백질"은 그 자체가 현저한 면역 반응을 유도시키지 않는 항원 (즉, "합텐")에 결합된 폴리펩티드이다. 생성된 컨쥬게이트는 합텐에 대한 면역 반응을 일으킬 수 있다.An "immunogenic carrier protein" is a polypeptide that binds to an antigen (ie "hapten") that does not itself elicit a significant immune response. The resulting conjugate can elicit an immune response against hapten.
"특이적 결합"이란 용어는 특정 부위에서 결합 파트너가 또 다른 파트너 (예를 들어, 두 개의 폴리펩티드, 폴리펩티드와 핵산 분자, 또는 두 개의 핵산 분자)에 우선적으로 결합하는 것으로 정의된다. "특이적으로 결합하는"이라는 용어는 표적 분자/서열에 대한 결합 우선성 (예를 들어, 친화성)이 비특이적 표적 분자 (예를 들어, 특이적으로 인식되는 부위(들)을 결여하는 무작위 생성된 분자)의 2배 이상, 더욱 바람직하게는 5배 이상, 가장 바람직하게는 10배 또는 20배 이상임을 나타낸다.The term “specific binding” is defined as the binding partner preferentially binding to another partner (eg, two polypeptides, a polypeptide and a nucleic acid molecule, or two nucleic acid molecules) at a particular site. The term “specifically binding” refers to random generation where binding priority (eg, affinity) for a target molecule / sequence lacks a nonspecific target molecule (eg, a specifically recognized site (s). More than 5 times, more preferably 5 times, most preferably 10 times or 20 times.
본원에 사용된 "항체"란 용어는 면역글로불린 유전자 또는 면역글로불린 유전자의 단편에 의해 실질적으로 엔코딩된 하나 이상의 폴리펩티드로 구성된 단백질을 의미한다. 인식된 면역글로불린 유전자는 카파, 람다, 알파, 감마, 델타, 엡실론 및 뮤 불변 영역 유전자 뿐만 아니라 다양한 면역글로불린 가변 영역 유전자를 포함한다. 경쇄는 카파 또는 람다로서 분류된다. 중쇄는 감마, 뮤, 알파, 델타 또는 엡실론으로서 분류되며, 이는 차례로 IgG, IgM, IgA, IgD 및 IgE를 각각 형성한다.As used herein, the term "antibody" refers to a protein consisting of one or more polypeptides substantially encoded by an immunoglobulin gene or fragment of an immunoglobulin gene. Recognized immunoglobulin genes include kappa, lambda, alpha, gamma, delta, epsilon and mu constant region genes as well as various immunoglobulin variable region genes. Light chains are classified as kappa or lambda. Heavy chains are classified as gamma, mu, alpha, delta or epsilon, which in turn form IgG, IgM, IgA, IgD and IgE, respectively.
전형적인 면역글로불린 (항체) 구조 단위는 사합체를 포함하는 것으로 공지되어 있다. 각각의 사합체는 각각 하나의 "가벼운" (약 25kD) 사슬 및 하나의 "무거운" 사슬 (약 50 내지 70kD)을 지니는 두 개의 동일한 쌍의 폴리펩티드 사슬로 구성된다. 각각의 사슬의 N-말단은 항원 인식을 주로 담당하는 100개 내지 110개 이상의 아미노산의 가변 영역을 규정한다. "가변 경쇄 (VL)" 및 "가변 중쇄 (VH)"는 각각 이러한 경쇄 및 중쇄를 의미한다.Typical immunoglobulin (antibody) structural units are known to include tetramers. Each tetramer consists of two identical pairs of polypeptide chains, each with one "light" (about 25 kD) chain and one "heavy" chain (about 50-70 kD). The N-terminus of each chain defines the variable region of 100 to 110 or more amino acids primarily responsible for antigen recognition. "Variable light chain (VL)" and "variable heavy chain (VH)" refer to such light and heavy chains, respectively.
항체는 온전한 면역글로불린 또는 다양한 펩티다아제에 의한 분해에 의해 생성된 다수의 잘 특성화된 단편으로 존재한다. 따라서, 예를 들어 펩신은 힌지 영역내의 이황화물 결합 아래에서 항체를 분해하여, Fab의 이합체인 F(ab)'2을 생성시키며, 이는 그 자체로 이황화물 결합에 의해 VH-CH1에 결합되는 경쇄이다. F(ab)'2는 온화한 조건하에서 힌지 영역내의 이황화물 결합을 파괴하도록 환원됨으로써 (Fab')2 이합체를 Fab' 단량체로 전환시킬 수 있다. Fab' 단량체는 본질적으로 힌지 영역의 일부를 지닌 Fab 이다 (다른 항체 단편의 보다 상세한 설명은 하기 문헌을 참조하라: Fundamental Immunology, W.E. Paul, ed., Raven Press, N.Y. (1993)). 다양한 항체 단편이 온전한 항체의 분해와 관련하여 규정되어 있지만, 당업자는 이러한 Fab' 단편이 화학적으로 또는 재조합 DNA 방법을 이용함으로써 새로 합성될 수 있음을 인식할 것이다. 따라서, 본원에 사용된 항체란 용어는 또한 전체 항체의 변형에 의해 생성되거나 재조합 DNA 방법을 사용하여 새로 합성된 항체 단편을 포함한다. 바람직한 항체는, 가변 중쇄 및 가변 경쇄가 (직접 또는 펩티드 링커를 통해) 함께 결합되어 연속 폴리펩티드를 형성하는 단일 사슬 항체 (단일 폴리펩티드 사슬로 존재하는 항체), 더욱 바람직하게는 단일 사슬 Fv 항체 (sFv 또는 scFv)를 포함한다. 단일 사슬 Fv 항체는 공유 결합된 VH-VL 이종이합체이며, 이는 직접 결합되거나 펩티드 엔코딩 링커에 의해 결합되는 VH- 및 VL-엔코딩 서열을 포함하는 핵산으로부터 발현될 수 있다 (Huston, et al. (1988) Proc. Nat. Acad. Sci. USA, 85: 5879-5883). VH 및 VL은 연결되어 단일 폴리펩티드 사슬을 형성하지만, VH 및 VL 도메인은 비공유 결합된다. scFv 항체 및 천연적으로 응집되지만 화학적으로 분리된 항체 V 영역으로부터의 경쇄 및 중쇄 폴리펩티드를 항원 결합 부위의 구조와 실질적으로 유사한 3차원 구조로 폴딩되는 분자로 전환시키는 다수의 다른 구조가 당업자에게 공지되어 있다 (참조: 미국 특허 제 5,091,513호, 5,132,405호 및 제 4,956,778호).Antibodies exist in a number of well characterized fragments produced by degradation by intact immunoglobulins or various peptidase. Thus, for example, pepsin cleaves antibodies under disulfide bonds in the hinge region, producing F (ab) '2, a dimer of Fab, which is itself a light chain that is bound to VH-CH1 by disulfide bonds. to be. F (ab) '2 can convert (Fab') 2 dimers to Fab 'monomers by reducing them to disrupt disulfide bonds in the hinge region under mild conditions. Fab 'monomers are essentially Fabs with a portion of the hinge region (for more detailed descriptions of other antibody fragments, see Fundamental Immunology, W.E. Paul, ed., Raven Press, N.Y. (1993)). Although various antibody fragments are defined in terms of the degradation of intact antibodies, one of ordinary skill in the art will recognize that such Fab ′ fragments can be newly synthesized chemically or by using recombinant DNA methods. Thus, the term antibody as used herein also encompasses antibody fragments produced by modification of the entire antibody or newly synthesized using recombinant DNA methods. Preferred antibodies are single chain antibodies (antibodies present as a single polypeptide chain) in which the variable heavy and variable light chains are joined together (directly or via a peptide linker) to form a continuous polypeptide, more preferably a single chain Fv antibody (sFv or scFv). Single chain Fv antibodies are covalently linked VH-VL heterodimers, which can be expressed from nucleic acids comprising VH- and VL-encoding sequences that are directly bound or bound by peptide encoding linkers (Huston, et al. (1988). Proc. Nat. Acad. Sci. USA, 85: 5879-5883). VH and VL are joined to form a single polypeptide chain, while VH and VL domains are non-covalently linked. Numerous other structures are known to those of skill in the art to convert light and heavy chain polypeptides from scFv antibodies and naturally aggregated but chemically isolated antibody V regions into molecules that are folded into a three-dimensional structure substantially similar to that of the antigen binding site. (See US Pat. Nos. 5,091,513, 5,132,405 and 4,956,778).
"유효량" 및 "충분한 양"이란 표현은 의도된 생물학적 활성을 생성시키는 생물학적 활성 물질의 양을 의미한다.The expression “effective amount” and “sufficient amount” means the amount of biologically active substance that produces the intended biological activity.
"약제학적으로 허용되는"이란 용어는 작용제의 약제학적 사용을 가능하게 하기에 세포 및/또는 피검체에 충분히 무독성인 임의의 작용제를 의미한다.The term "pharmaceutically acceptable" means any agent that is sufficiently nontoxic to cells and / or subjects to enable pharmaceutical use of the agent.
"애쥬번트"란 용어는 항원에 대한 면역 반응을 향상시키는 화합물 또는 혼합물을 의미한다.The term "adjuvant" refers to a compound or mixture that enhances an immune response to an antigen.
HIV의 "1차 단리물"은 세포 배양물의 계대없이 HIV로 감염된 개체로부터 수득된 HIV 단리물이다.A "primary isolate" of HIV is an HIV isolate obtained from an individual infected with HIV without passage of cell culture.
"폴리누클레오티드"란 용어는 데옥시리보누클레오티드 또는 리보누클레오티드 중합체를 의미하며, 달리 한정되지 않는 한 천연 누클레오티드와 유사한 방식으로 기능할 수 있는 천연 누클레오티드의 공지된 유사체를 포함한다. "폴리누클레오티드"란 용어는 예를 들어 게놈 DNA; 통상적으로 메신저 RNA (mRNA)의 역전사 또는 증폭에 의해 수득되는 mRNA의 DNA 표현인 상보적 DNA (cDNA); 합성적으로 생성되거나 증폭에 의해 생성된 DNA 분자; 및 mRNA를 포함하는 임의의 형태의 DNA 또는 RNA를 의미한다. "폴리누클레오티드"란 용어는 이중 가닥 핵산 분자 뿐만 아니라 단일 가닥 분자를 포함한다. 이중 가닥 폴리누클레오티드에서, 폴리누클레오티드 가닥은 동연적(coextensive)일 필요가 없다 (즉,이중 가닥 폴리누클레오티드는 두 가닥 모두의 전체 길이를 따라 이중 가닥일 필요가 없다).The term "polynucleotide" refers to a deoxyribonucleotide or ribonucleotide polymer and includes known analogs of natural nucleotides that can function in a manner similar to natural nucleotides, unless otherwise defined. The term "polynucleotide" includes, for example, genomic DNA; Complementary DNA (cDNA), which is a DNA representation of mRNA, usually obtained by reverse transcription or amplification of messenger RNA (mRNA); DNA molecules produced synthetically or by amplification; And any form of DNA or RNA, including mRNA. The term "polynucleotide" includes single stranded molecules as well as double stranded nucleic acid molecules. In double stranded polynucleotides, the polynucleotide strands do not need to be coextensive (ie, the double stranded polynucleotides need not be double stranded along the entire length of both strands).
본원에 사용된 "벡터"란 용어는 폴리누클레오티드를 함유하는 DNA 구성물을 의미한다. 바람직한 벡터는 숙주 세포에서 안정적으로 또는 일시적으로 증식될 수 있다. 예를 들어, 벡터는 플라스미드, 코스미드, 세균 인공 염색체 (BAC), 효모 인공 염색체 (YAC), 또는 바이러스 벡터일 수 있다. 적합한 숙주내로 도입된 경우, 벡터는 숙주 게놈과 무관하게 복제하고 기능할 수 있거나, 일부 경우에는 숙주 게놈내로 삽입될 수 있다.As used herein, the term "vector" refers to a DNA construct containing a polynucleotide. Preferred vectors can be propagated stably or temporarily in host cells. For example, the vector can be a plasmid, cosmid, bacterial artificial chromosome (BAC), yeast artificial chromosome (YAC), or viral vector. When introduced into a suitable host, the vector can replicate and function independently of the host genome, or in some cases can be inserted into the host genome.
본원에 사용된 용어 "작동적으로 결합된"은 조절 서열 (전형적으로 프로모터)과 결합된 서열간의 작동적 결합을 의미한다. 예를 들어, 프로모터가 결합된 서열의 전사를 개시시킬 수 있는 경우, 프로모터는 서열에 작동적으로 결합된 것이다.As used herein, the term "operably linked" means operatively binding between a regulatory sequence (typically a promoter) and a linked sequence. For example, if a promoter can initiate transcription of a bound sequence, the promoter is operably linked to the sequence.
"발현 벡터"는 적합한 숙주에서 폴리누클레오티드의 발현을 수행할 수 있는 조절 서열에 작동적으로 결합된 폴리누클레오티드를 함유하는 DNA 작제물을 의미한다. 예시적 조절 서열은 전사를 수행하기 위한 프로모터, 전사를 조절하기 위한 임의의 오퍼레이터 서열, 적합한 mRNA 리보좀 결합 부위를 엔코딩하는 서열, 및 전사 및 번역의 종결을 조절하는 서열을 포함한다.By "expression vector" is meant a DNA construct containing a polynucleotide operably linked to a regulatory sequence capable of carrying out the expression of the polynucleotide in a suitable host. Exemplary regulatory sequences include promoters for performing transcription, any operator sequence for regulating transcription, sequences encoding suitable mRNA ribosomal binding sites, and sequences regulating termination of transcription and translation.
용어 "숙주 세포"는 벡터를 일시적으로 또는 안정적으로 유지할 수 있는 세포를 의미한다. 본 발명의 숙주 세포는 세균 세포, 효모 세포, 곤충 세포, 식물 세포 및 포유동물 세포를 포함하나 이에 제한되지 않는다. 당해분야에 공지된 기타 숙주 또한 본 발명에 사용하기에 적합하다.The term "host cell" refers to a cell capable of temporarily or stably maintaining a vector. Host cells of the invention include, but are not limited to, bacterial cells, yeast cells, insect cells, plant cells, and mammalian cells. Other hosts known in the art are also suitable for use in the present invention.
숙주 세포는 세포 배양액중에 존재할 수 있거나, 대안적으로 "생체내" 존재할 수 있다. 숙주 세포가 살아있는 유기체에 존재하는 경우 "생체내"라 한다.Host cells may be present in cell culture or, alternatively, may be “in vivo”. When a host cell is present in a living organism, it is called "in vivo."
본원에 사용된 바와 같은 용어 "상보적"은 2개의 누클레오티드가 정확하게 쌍을 이루는 능력을 의미한다. 즉, 핵산 분자의 제공된 위치의 누클레오티드가 또 다른 핵산 분자의 누클레오티드와 수소 결합할 수 있으면, 이러한 2개의 핵산 분자는 상기 위치에서 서로에 대해 상보적인 것으로 간주된다. 용어 "실질적으로 상보적"은 다른 서열에 충분히 상보적이어서 엄격한 하이브리드화 조건하에서 특이적으로 하이브리드화할 수 있는 서열을 나타낸다.The term “complementary,” as used herein, refers to the ability of two nucleotides to exactly pair. That is, if a nucleotide at a given position of a nucleic acid molecule can hydrogen bond with a nucleotide of another nucleic acid molecule, these two nucleic acid molecules are considered to be complementary to each other at that position. The term “substantially complementary” refers to a sequence that is sufficiently complementary to other sequences so that it can specifically hybridize under stringent hybridization conditions.
문구 "엄격한 하이브리드화 조건"은 일반적으로 규정된 이온 강도 및 pH에서 특이적 서열에 대한 융점 (Tm) 보다 약 5℃ 낮은 온도를 의미한다. 대부분의 서열의 특이적 하이브리드화를 달성하는데 적합한 예시적인 엄격한 조건은 약 60℃ 이상의 온도 및 pH 7.0에서 약 0.2 몰의 염 농도이다.The phrase “strict hybridization conditions” generally means a temperature about 5 ° C. below the melting point (T m ) for the specific sequence at the defined ionic strength and pH. Exemplary stringent conditions suitable for achieving specific hybridization of most sequences are salt concentrations of about 0.2 molar at temperatures of about 60 ° C. or higher and pH 7.0.
"특이적 하이브리드화"은 규정된 엄격한 조건하에 하이브리드화 혼합물중에 존재하는 기타 누클레오티드 서열로의 실질적인 결합의 부재하에 핵산 분자가 표적 누클레오티드 서열에 결합하는 것을 의미한다. 당업자는 하이브리드화 조건의 엄격성을 완화시키면 서열 미스매치가 용인될 수 있음을 인지하고 있다."Specific hybridization" means that a nucleic acid molecule binds to a target nucleotide sequence in the absence of substantial binding to other nucleotide sequences present in the hybridization mixture under defined stringent conditions. Those skilled in the art recognize that mitigating the stringency of hybridization conditions can tolerate sequence mismatches.
비정상적 이황화물 구조를 갖는 단리된 gp120 폴리펩티드Isolated gp120 polypeptide with abnormal disulfide structure
본 발명은 제 1 gp120 아미노산 서열을 포함하는 단리된 폴리펩티드를 제공한다. 단리된 폴리펩티드는 하나 이상의 추가적인 서열을 포함할 수 있으며, 따라서 단리된 폴리펩티드는 전장 gp160 서열 또는 gp160의 gp120 서열 함유 단편을 포함할 수 있다. 제 1 gp120 서열은 전장 gp120 서열 또는 이의 단편일 수 있다. 본 발명에 유용한 gp120 단편은 적어도 gp120의 V2, V3 및 C4 도메인을 포함한다. 바람직한 구체예에서, gp120 단편은 또한 V1 도메인을 포함한다. gp120 서열은 대부분의 gp120 서열에 존재하는 18개 시스테인 잔기 중 하나 이상이 결여되어 있거나, 하나 이상의 추가적인 시스테인을 포함한다.The present invention provides an isolated polypeptide comprising a first gp120 amino acid sequence. An isolated polypeptide may comprise one or more additional sequences and thus the isolated polypeptide may comprise a full length gp160 sequence or a gp120 sequence containing fragment of gp160. The first gp120 sequence may be the full length gp120 sequence or fragment thereof. Gp120 fragments useful in the present invention include at least the V2, V3 and C4 domains of gp120. In a preferred embodiment, the gp120 fragment also comprises a V1 domain. The gp120 sequence lacks one or more of the 18 cysteine residues present in most gp120 sequences, or includes one or more additional cysteines.
본 발명의 gp120 폴리펩티드는 본 발명의 면역원성 조성물의 성분으로 사용되어 gp120-특이적 면역 반응을 유도해낼 수 있다. 바람직한 구체예에서, gp120 폴리펩티드는 백신 항원으로써 사용된다. 폴리펩티드는 다양한 형태의 백신 예를 들어, 유리된 형태, 세포 특이적 부분에 공유적으로 결합된 형태 또는 면역원성 담체 단백질 형태로 백신중에 포함될 수 있거나, 바이러스 입자의 표면상에 디스플레이될 수 있다 (바이러스 유래된 백신). 또한, 약물을 수송하는 gp120 폴리펩티드는 약물을 CD4- 또는 케모킨 수용체-함유 세포로 표적화시키는데 사용될 수 있다.The gp120 polypeptide of the invention can be used as a component of an immunogenic composition of the invention to elicit a gp120-specific immune response. In a preferred embodiment, the gp120 polypeptide is used as a vaccine antigen. Polypeptides can be included in the vaccine in various forms, such as free forms, covalently bound to cell specific moieties, or immunogenic carrier proteins, or displayed on the surface of viral particles (viruses Derived vaccine). In addition, gp120 polypeptides that transport drugs can be used to target the drug to CD4- or chemokine receptor-containing cells.
본 발명의 gp120 폴리펩티드는 샘플이 비정상적 이황화물 구조를 갖는 제공된 gp120 폴리펩티드에 대해 특이적인 항체를 함유하는지의 여부를 측정하기 위한 진단 방법에 유용하다.The gp120 polypeptides of the invention are useful in diagnostic methods for determining whether a sample contains antibodies specific for a given gp120 polypeptide having an abnormal disulfide structure.
또한, 본 발명의 적합한 폴리펩티드는 gp120 면역검정에서 표준으로서 사용될 수 있다. 더욱 상세하게는, 상기 면역검정에 사용된 항-HIV-1 항체와 교차반응하는 본 발명의 폴리펩티드는 양성 대조군으로서 사용될 수 있으며, 이는 샘플을 gp120의 존재에 대해 검정할 경우 관찰된 결과와 비교될 수 있다.In addition, suitable polypeptides of the invention can be used as standards in the gp120 immunoassay. More specifically, polypeptides of the invention that cross react with anti-HIV-1 antibodies used in the immunoassay can be used as positive controls, which are compared with the results observed when the sample is assayed for the presence of gp120. Can be.
A. gp120 폴리펩티드의 유형A. Types of gp120 Polypeptides
제 1 구체예에서, 제 1 gp120 서열은 HIV gp120의 HXB-2 균주로부터의 gp120의 N-말단 메티오닌으로부터 넘버링하여 54, 74, 119, 126, 131, 157, 196, 205, 218, 228, 239, 247, 296, 331, 378, 385, 418 및 445번 중 하나 이상의 위치에서 하나 이상의 시스테인 잔기가 결여되어 있다. 본 구체예의 바람직한 변형에서, 제 1 gp120 서열은 54, 74, 119, 126, 157, 205, 218, 228, 239, 247, 331, 378 및 385번 중 하나 이상의 위치에서 하나 이상의 시스테인 잔기가 결여되어 있다.In a first embodiment, the first gp120 sequence is numbered from the N-terminal methionine of gp120 from the HXB-2 strain of HIV gp120 to 54, 74, 119, 126, 131, 157, 196, 205, 218, 228, 239 At least one of the
제 2 구체예에서, 제 1 gp120 서열은 HXB-2 gp120에 기초하여 넘버링하여 24, 29, 34, 54, 74, 119, 126, 131, 157, 196, 205, 218, 228, 239, 247, 296, 331, 378, 385, 418, 445, 493, 495, 499-501, 503-508 및 510번 이외의 위치에서 하나 이상의 추가의 시스테인 잔기를 포함한다. 그러나, 본 발명의 제 1 gp120 서열은 V1 도메인에서 하나 이상의 추가적인 시스테인을 갖는 서브타입 G gp120 서열이 아니며, 또한 V4 도메인에서 하나 이상의 추가적인 시스테인을 갖는 서브타입 E gp120 서열이 아니다. 본 구체예의 바람직한 변형에서, 제 1 gp120 서열은 54, 74, 119, 126, 131, 157, 196, 205, 218, 228, 239, 247, 297, 331, 378, 385, 418 및 445번 이외의 위치에서 하나 이상의 추가의 시스테인 잔기를 포함하며, 단, 하나 이상의 추가적인 시스테인 잔기는 gp120의 V1 도메인에 존재하지 않는다.In a second embodiment, the first gp120 sequence is numbered based on HXB-2 gp120 to be 24, 29, 34, 54, 74, 119, 126, 131, 157, 196, 205, 218, 228, 239, 247, One or more additional cysteine residues at positions other than 296, 331, 378, 385, 418, 445, 493, 495, 499-501, 503-508 and 510. However, the first gp120 sequence of the present invention is not a subtype G gp120 sequence with one or more additional cysteines in the V1 domain, nor is it a subtype E gp120 sequence with one or more additional cysteines in the V4 domain. In a preferred variation of this embodiment, the first gp120 sequence is other than 54, 74, 119, 126, 131, 157, 196, 205, 218, 228, 239, 247, 297, 331, 378, 385, 418 and 445 One or more additional cysteine residues in position, provided that one or more additional cysteine residues are not present in the V1 domain of gp120.
본 발명은 또한 이러한 두 구체예가 조합된 것 즉, 제 1 gp120 서열이 상기 제 1 구체예에서 기재된 위치에서 하나 이상의 시스테인이 결여되어 있으며, 상기 제 2 구체예에서 기재된 위치에서 하나 이상의 추가적인 시스테인을 포함하는 것을 포함한다.The present invention also combines these two embodiments, i.e., the first gp120 sequence lacks one or more cysteines at the positions described in the first embodiment, and includes one or more additional cysteines at the positions described in the second embodiment. It involves doing.
바람직한 구체예에서, 제 1 gp120 서열은 SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134 및 136으로 구성된 군으로부터 선택된 gp120 서열의 V2, V3 및 C4 도메인 각각에 대해 약 80%, 약 85%, 약 90%, 약 95% 또는 약 99% 이상의 동일성을 갖는다. 이들 gp120 서열은 북미, 푸에르토리코 및 네덜란드에 걸쳐 61개 임상지에서 수행된 HIV 백신의 대규모 임상 실험으로 수득되었다. 하기 표 1은 실험으로부터 SEQ ID NO과 gp120 샘플 설계간의 상응물을 나타낸다. HIV-1의 HXB-2 균주로부터의 gp120에 대한 아미노산 서열은 SEQ ID NO:138이다.In a preferred embodiment, the first gp120 sequence comprises SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134 and 136 Have at least about 80%, about 85%, about 90%, about 95%, or about 99% identity to each of the V2, V3, and C4 domains of the gp120 sequence selected from the group consisting of: These gp120 sequences were obtained from large scale clinical trials of HIV vaccines carried out in 61 clinical sites across North America, Puerto Rico and the Netherlands. Table 1 below shows the corresponding between SEQ ID NO and gp120 sample design from the experiment. The amino acid sequence for gp120 from the HXB-2 strain of HIV-1 is SEQ ID NO: 138.
더욱 바람직한 구체예에서, 제 1 gp120 서열은 이들 서열중 하나로부터 선택된 gp120 서열의 V1, V2, V3 및 C4 도메인 각각에 약 80%, 약 85%, 약 90%, 약 95% 또는 약 99% 이상의 동일성을 갖는다. 이러한 요건은 이들 도메인중 각각에 대한 동일성 퍼센트가 약 80%, 약 85%, 약 90%, 약 95% 또는 약 99% 이상일 경우 충족된다. HXB-2의 N-말단 메티오닌으로부터 넘버링하여 이들 도메인의 종결점은 다음과 같다: V1:잔기 131 내지 157; V2:잔기 157 내지 196; V3:잔기 296 내지 331; C4:잔기 418 내지 445. 이러한 구체예의 특정 예에서, 제 1 gp120 서열은 SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134 및 136으로 구성된 군으로부터 선택된 gp120 서열의 적어도 V1, V2, V3 및 C4 도메인을 포함한다.In a more preferred embodiment, the first gp120 sequence is at least about 80%, about 85%, about 90%, about 95% or about 99% to each of the V1, V2, V3 and C4 domains of the gp120 sequence selected from one of these sequences Have identity. This requirement is met if the percent identity for each of these domains is at least about 80%, about 85%, about 90%, about 95% or about 99%. The endpoints of these domains, numbered from the N-terminal methionine of HXB-2, are as follows: V1: residues 131-157; V2:
표 1. HIV-1 gp120 DNA 및 단백질 서열에 대한 서열 ID 번호Table 1. Sequence ID numbers for HIV-1 gp120 DNA and protein sequences
gp120 아미노산 서열은 천연 (즉, 야생형) 아미노산 서열 또는 야생형 폴리펩티드의 상응하는 영역의 아미노산 서열 변이체일 수 있다. 한 구체예에서, gp120 아미노산 서열은 HIV-1의 1차 단리물로부터 유래된다. 본 발명의 바람직한 폴리펩티드는 일반적으로 야생형 gp120 아미노산 서열 또는 상기 정의된 바와 같은 보존적 아미노산 치환기를 함유하는 gp120 아미노산 서열을 포함한다.The gp120 amino acid sequence may be a native (ie wild type) amino acid sequence or an amino acid sequence variant of the corresponding region of a wild type polypeptide. In one embodiment, the gp120 amino acid sequence is derived from the primary isolate of HIV-1. Preferred polypeptides of the invention generally comprise a wild type gp120 amino acid sequence or a gp120 amino acid sequence containing conservative amino acid substituents as defined above.
본 발명의 폴리펩티드는 gp120 아미노산 서열 즉, 이종성 단백질로부터의 아미노산 서열 이외에 기타 아미노산 서열을 포함할 수 있다. 따라서, 본 발명은 gp120 아미노산 서열이 하나 이상의 이종성 단백질로부터 아미노산 서열의 한쪽 또는 양쪽 말단에 융합된 융합 폴리펩티드를 포함한다. 대상 단백질로 종종 혼입된 추가적인 아미노산 서열의 예로는 단백질의 분비를 촉진하는 시그널 서열, 및 면역학적 검출 또는 친화도 정제에 사용될 수 있는 에피토프 태그를 포함한다. 본 발명에 사용된 바람직한 시그널 서열은 단순포진 바이러스 당단백질 D (HSV gD-1) 시그널 서열 및 인간 조직 플라스미노겐 활성인자 시그널 서열을 포함하나 이에 제한되지 않는다. 예시적 에피토프 태그는 그린 형광 단백질, 헤마글루티닌, 또는 FLAG 에피토프 태그 및 헥사히스티딘 또는 유사한 금속 친화도 태그를 포함한다. N-말단 HSV gD-1 서열은 HSV gD-1 시그널 서열이 분비를 촉진하는데 사용될 경우 에피토프 태그로서 통상적으로 사용된다.Polypeptides of the invention may comprise other amino acid sequences in addition to the gp120 amino acid sequence, ie, the amino acid sequence from the heterologous protein. Thus, the invention includes fusion polypeptides in which the gp120 amino acid sequence is fused from one or more heterologous proteins to one or both ends of the amino acid sequence. Examples of additional amino acid sequences that are often incorporated into the protein of interest include signal sequences that promote secretion of the protein, and epitope tags that can be used for immunological detection or affinity purification. Preferred signal sequences used in the present invention include, but are not limited to, herpes simplex virus glycoprotein D (HSV gD-1) signal sequence and human tissue plasminogen activator signal sequence. Exemplary epitope tags include green fluorescent protein, hemagglutinin, or FLAG epitope tags and hexahistidine or similar metal affinity tags. N-terminal HSV gD-1 sequences are commonly used as epitope tags when the HSV gD-1 signal sequence is used to promote secretion.
본 발명의 폴리펩티드는 gp120-특이적 면역 반응을 유도할 수 있는 능력을 보유하는 유도체를 생성시키도록 변형될 수 있다. 예를 들어, 당해분야의 당업자에게는 다양한 기법이 상응하는 펩티드 화합물과 동일하거나 상이한 목적하는 생물학적 활성을 가지며, 예를 들어, 용해도, 안정도 및 가수분해 및 단백질분해에 대한 감수성에 있어서 펩티드보다 더욱 유리한 활성을 갖는 소위 "펩티드 미메틱(mimetic)"을 작제하는데 유용하다는 것이 자명하다. 예를 들어, 문헌[Morgan, et al., Ann. Rep. Med. Chem., 24:243-252 (1989)] 참조. 따라서, 본 발명의 폴리펩티드는 예를 들어, N-말단 아미노기, C-말단 카르복실기에서 변형되고/거나 펩티드의 하나 이상의 아미도 결합은 비아미도 결합으로 전환된 펩티드 미메틱를 포함한다.Polypeptides of the invention can be modified to produce derivatives that retain the ability to induce a gp120-specific immune response. For example, to those skilled in the art, various techniques have the same or different desired biological activities as the corresponding peptide compounds, and are more advantageous than peptides, for example in solubility, stability and sensitivity to hydrolysis and proteolysis. It is obvious that it is useful for constructing a so-called "peptide mimetic" having. See, eg, Morgan, et al., Ann. Rep. Med. Chem., 24: 243-252 (1989). Thus, polypeptides of the invention include peptide mimetics, for example, modified at N-terminal amino groups, C-terminal carboxyl groups and / or at least one amido bond of the peptide is converted to nonamido bonds.
특정 구체예에서, 본 발명의 폴리펩티드는 홀수의 시스테인 잔기를 갖는 제 1 gp120 서열을 포함한다. 이러한 서열은 하나 이상의 "유리된(free)" 시스테인 (즉, 분자간 이황화물 결합을 형성하지 않는 시스테인)을 갖는다. 유리된 시스테인은 기타 폴리펩티드에 공유적으로 결합할 수 있다. 예를 들어, 유리된 시스테인은 기타 폴리펩티드에서 유리된 시스테인을 갖는 이황화물 결합을 형성할 수 있다. 유리된 시스테인은 용이하게는 예를 들어, 부위특이적 돌연변이법을 이용하여 임의의 대상 폴리펩티드로 삽입될 수 있다. 기타 폴리펩티드는 제 1 gp120 서열을 포함할 수 있으며, 이는 제 1 gp120 서열과 동일하거나 상이할 수 있다. 두개의 동일한 gp120 서열간의 이황화물 결합은 동종이합체를 생성시킨다. 두개의 상이한 gp120 서열간의 이황화결합은 이종이합체를 생성시킨다.In certain embodiments, polypeptides of the invention comprise a first gp120 sequence having an odd cysteine residue. Such sequences have one or more "free" cysteines (ie, cysteines that do not form intermolecular disulfide bonds). Free cysteine can covalently bind other polypeptides. For example, free cysteine can form disulfide bonds with free cysteine in other polypeptides. The free cysteine can easily be inserted into any subject polypeptide, for example using site specific mutagenesis. The other polypeptide may comprise a first gp120 sequence, which may be the same as or different from the first gp120 sequence. Disulfide bonds between two identical gp120 sequences result in homodimers. Disulfide bonds between two different gp120 sequences produce heterodimers.
본 구체예의 변형에서, 제 1 gp120 서열에서 유리된 시스테인은 gp41 아미노산 서열을 포함하는 다른 폴리펩티드에 공유적으로 결합한다. 바람직하게는, 제 1 gp120 서열은 gp41 서열에서 유리된 시스테인에 이황화물 결합하여 gp120/gp41 복합체를 생성시킨다. 이러한 복합체는 항원으로서 유용한데, 그 이유는 이들이 HIV-1 바이러스 입자 또는 바이러스 감염된 세포의 표면상의 바이러스 스파이크에서 발견된 gp120/gp41 복합체를 모방할 수 있기 때문이다.In a variation of this embodiment, the cysteine liberated in the first gp120 sequence binds covalently to another polypeptide comprising the gp41 amino acid sequence. Preferably, the first gp120 sequence disulfides to the cysteine liberated in the gp41 sequence to produce a gp120 / gp41 complex. Such complexes are useful as antigens because they can mimic the gp120 / gp41 complex found in viral spikes on the surface of HIV-1 virus particles or virus infected cells.
기타 구체예에서, gp120 서열에서 유리된 시스테인은 세포 특이적 결합 부분, 약물, 면역자극 올리고누클레오티드 (예를 들어, CpG) 또는 면역원성 담체 단백질에 공유적으로 결합한다. 이러한 방식으로 유래된 gp120을 항원-제시 세포 (예컨대, 수지상 세포 또는 마크로파아지)로의 표적화는 면역 반응 (예를 들어, TH1 또는 TH2 면역 반응)의 효능 및 특질을 조절하는데 유용할 수 있다. 본 발명에 유용한 세포-특이적 결합 부분은 표적 세포 또는 세포들 (예를 들어, 백혈구)에 위치하나, 비표적 세포 (예를 들어, 간 또는 근육 세포)에서는 현저한 양으로 발견되지 않은 특이적 리간드-결합 부위에 결합할 수 있는 부분 (예를 들어, 리간드 또는 이들의 단편)을 포함한다. 일반적으로, 세포 특이적 결합 부분은 막-결합된 세포-표면 단백질, 탄수화물, 지질, 글리코사미노글리칸, 지질단백질, 항원 또는 수용체에 결합한다. 이와 같이, 예시적인 세포-특이적 결합 부분은 예를 들어, 리간드 예컨대, 호르몬 또는 사이토킨, 호르몬 또는 사이토킨의 수용체 결합 도메인, 부착 분자 및 항체를 포함할 수 있다. 본 발명에 사용하기에 적합한 일반적으로 사용된 면역원성 담체는 디프테리아 톡신, 키홀 림펫 헤모시아닌 (KLH), 티로글로불린, 우혈청 알부민 (BSA) 및 테타누스 독소를 포함한다.In other embodiments, the cysteine liberated in the gp120 sequence covalently binds to cell specific binding moieties, drugs, immunostimulatory oligonucleotides (eg, CpG) or immunogenic carrier proteins. Targeting gp120 derived in this manner to antigen-presenting cells (eg dendritic cells or macrophages) may be useful for modulating the efficacy and properties of an immune response (eg, a TH1 or TH2 immune response). Cell-specific binding moieties useful in the present invention are specific ligands located in target cells or cells (eg leukocytes) but not found in significant amounts in non-target cells (eg liver or muscle cells). -Moieties capable of binding to binding sites (eg ligands or fragments thereof). Generally, cell specific binding moieties bind to membrane-bound cell-surface proteins, carbohydrates, lipids, glycosaminoglycans, lipoproteins, antigens or receptors. As such, exemplary cell-specific binding moieties may include, for example, ligands such as hormones or cytokines, receptor binding domains of hormones or cytokines, adhesion molecules and antibodies. Commonly used immunogenic carriers suitable for use in the present invention include diphtheria toxin, keyhole limpet hemocyanin (KLH), tyroglobulin, bovine serum albumin (BSA) and tetanus toxin.
B. gp120 폴리펩티드의 생성B. Generation of gp120 Polypeptides
1. 합성 기법1. Synthesis Technique
본 발명에 따른 폴리펩티드는 당해분야에 공지된 방법 예를 들어, 배타적 고체상 합성법, 부분 고체상 합성법, 단편 응축, 및 고전 용액 합성법을 이용하여 합성될 수 있다. 예를 들어, 문헌 [Merrifield, J.Am.Chem.Soc., 85:2149 (1963)] 참조. 고체상 기법이 바람직하며, 예를 들어, 문헌 [John Morrow Stewart and Janis Dillaha Young, Solid Phase Peptide Syntheses (2nd Ed., Pierce Chemical Company, 1984)]에 기술되어 있다. 고체상 페이스에서, 합성은 전형적으로 알파-아미노 보호된 수지를 사용하여 펩티드의 C-말단으로부터 개시된다. 적합한 출발 물질은 예를 들어, 요구되는 알파-아미노산을 클로로메틸화된 수지, 히드록시메틸 수지 또는 벤즈히드릴아민 수지에 부착시키므로써 준비될 수 있다. 펩티드를 배열시키는 역할을 하는 것으로서 자동화된 펩티드 합성기가 시중에서 구입가능하다.Polypeptides according to the invention can be synthesized using methods known in the art such as exclusive solid phase synthesis, partial solid phase synthesis, fragment condensation, and classical solution synthesis. See, eg, Merrifield, J. Am. Chem. Soc., 85: 2149 (1963). Solid phase techniques are preferred and are described, for example, in John Morrow Stewart and Janis Dillaha Young, Solid Phase Peptide Syntheses (2nd Ed., Pierce Chemical Company, 1984). At the solid phase, the synthesis is typically initiated from the C-terminus of the peptide using alpha-amino protected resins. Suitable starting materials can be prepared, for example, by attaching the required alpha-amino acids to chloromethylated resins, hydroxymethyl resins or benzhydrylamine resins. Automated peptide synthesizers are commercially available as they serve to align peptides.
2. 재조합 기법2. Recombination Technique
본 발명에 따른 폴리펩티드는 또한, 재조합 기법을 이용하여 생성될 수 있다. gp120 폴리누클레오티드는 합성적으로 생성되거나, (PCT, RT-PCT, 롤링 서클 증폭 (RCA) 또는 기타 증폭법)에 의해 바이러스 RNA, 프로바이러스 DNA 또는 클로닝된 gp120 폴리누클레오티드 또는 HIV-1 단리물로부터 증폭되거나, HIV-1 단리물로부터 클로닝된다. 한편으론 제공된 gp120 폴리누클레오티드로, 목적하는 gp120 아미노산 서열을 엔코딩하는 폴리누클레오티드는 다양한 클로닝 및 돌연변이법에 의해 생성될 수 있다. 예를 들어, 문헌 [Sambrook, J., Fritsch, E.F., and Maniatis, T. (1989) in Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y.] 참조. 널리 사용된 돌연변이법의 예로는 부위특이적 돌연변이법 (Kunkel et al., (1991) Methods Enzymol., 204:125-139; Carter, P., et al., (1986) Nucl. Acids Res. 10:6487), 카세트 돌연변이법 (Wells, J.A., et al., (1985) Gene 34:315), 및 제한 선택 돌연변이법 (Wells, J.A., et al., (1986) Philos. Trans. R.Soc., London Ser.A, 317:415)을 포함한다.Polypeptides according to the invention can also be produced using recombinant techniques. gp120 polynucleotides are synthetically generated or amplified from viral RNA, proviral DNA or cloned gp120 polynucleotides or HIV-1 isolates by (PCT, RT-PCT, rolling circle amplification (RCA) or other amplification methods) Or cloned from an HIV-1 isolate. On the other hand, with the provided gp120 polynucleotides, the polynucleotides encoding the desired gp120 amino acid sequence can be produced by various cloning and mutagenesis. See, eg, Sambrook, J., Fritsch, E.F., and Maniatis, T. (1989) in Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y. Examples of widely used mutagenesis include site-specific mutagenesis (Kunkel et al., (1991) Methods Enzymol., 204: 125-139; Carter, P., et al., (1986) Nucl. Acids Res. 10 : 6487), cassette mutagenesis (Wells, JA, et al., (1985) Gene 34: 315), and restriction selection mutagenesis (Wells, JA, et al., (1986) Philos. Trans.R.Soc. , London Ser.A, 317: 415.
본 발명의 바람직한 구체예에서, gp120 코딩 영역의 서열은 목적하는 gp120 서열을 엔코딩하는 합성 폴리누클레오티드를 설계하기 위한 가이드로서 사용되며, 상기 합성 폴리누클레오티드는 본 발명의 벡터내로 혼입될 수 있다. 합성 유전자를 작제하기 위한 방법은 당업자에 널리 공지되어 있다. 예를 들어, 문헌 [Dennis, M.S., Carter, P. and Lazarus, R.A. (1993) Proteins; Struct. Funct. Genet., 15:312-321] 참조. 발현 및 정제 방법이 본 발명의 핵산, 벡터 및 숙주 세포와 관련하여 하기에 더욱 상세히 기술되어 있다.In a preferred embodiment of the invention, the sequence of the gp120 coding region is used as a guide for designing synthetic polynucleotides encoding the desired gp120 sequence, which synthetic polynucleotides can be incorporated into the vectors of the invention. Methods for constructing synthetic genes are well known to those skilled in the art. See, eg, Dennis, M.S., Carter, P. and Lazarus, R.A. (1993) Proteins; Struct. Funct. Genet., 15: 312-321. Expression and purification methods are described in more detail below in connection with the nucleic acids, vectors and host cells of the present invention.
3. 복합물 및 컨쥬게이트3. Composites and Conjugates
본 발명의 gp120 폴리펩티드는 폴리펩티드 착물 및 컨쥬게이트를 생성시키는데 일반적으로 사용된 기법에 의해 변형될 수 있다. 하나 이상의 gp120 폴리펩티드를 포함하는 이황화물-결합된 올리고머 복합물은 예를 들어, 온화한 환원제 예컨대, 글루타티온, 디티오트레이톨 (DTT) 또는 β-메르캅토에탄올 및 선택적으로 변성제 예컨대, 우레아 또는 구아니딘 히드로클로라이드를 함유하는 용액중에 복합물을 형성하게 될 폴리펩티드를 혼합하므로써 생성될 수 있다. 환원제 및 선택적 변성제는 투석에 의해 제거될 수 있다. 이황화물 결합은 공기중에서 변성 동안 형성되며, 이는 환원된 설피히드릴 기를 산화시킨다. 분자내 이황화물 결합은 올리고머를 생성시킨다. 용액이 단지 하나의 유형의 gp120 폴리펩티드를 함유하는 경우, 형성된 올리고머는 그러한 종만을 함유할 것이다. 용액이 다른 폴리펩티드 (예를 들어, gp41)과 함께 상이한 gp120 폴리펩티드 또는 gp120 폴리펩티드를 함유하는 경우, 형성된 올리고머는 다양한 종을 함유할 수 있다.The gp120 polypeptides of the invention can be modified by the techniques generally used to generate polypeptide complexes and conjugates. Disulfide-linked oligomeric complexes comprising one or more gp120 polypeptides may, for example, contain mild reducing agents such as glutathione, dithiothritol (DTT) or β-mercaptoethanol and optionally denaturing agents such as urea or guanidine hydrochloride. It can be produced by mixing polypeptides that will form complexes in the solution containing them. Reducing agents and selective denaturants can be removed by dialysis. Disulfide bonds are formed during denaturation in air, which oxidizes the reduced sulfhydryl group. Intramolecular disulfide bonds produce oligomers. If the solution contains only one type of gp120 polypeptide, the oligomers formed will contain only such species. If the solution contains different gp120 polypeptides or gp120 polypeptides along with other polypeptides (eg gp41), the oligomers formed may contain various species.
대안적으로, 본 발명의 하나 이상의 gp120 폴리펩티드를 포함하는 올리고머 복합체는 예를 들어, 표준 이작용기성 가교제를 사용하여 생성될 수 있다. 이러한 제제는 또한 gp120 폴리펩티드와 세포-특이적 결합 부분, 약물, 면역자극 올리고누클레오티드, 또는 면역원성 담체 단백질과의 컨쥬게이트를 생성시키는데 사용될 수 있다.Alternatively, oligomeric complexes comprising one or more gp120 polypeptides of the invention can be produced using, for example, standard difunctional crosslinkers. Such agents can also be used to generate conjugates of gp120 polypeptide with cell-specific binding moieties, drugs, immunostimulatory oligonucleotides, or immunogenic carrier proteins.
일부 적용 예컨대, 하기 기술된 임상법에 있어서, 검출가능한 라벨을 본 발명의 하나의 gp120 폴리펩티드 또는 gp120 폴리펩티드들에 부착시키는 것이 바람직하다. 본 발명에 사용하기에 적합한 검출가능한 라벨은 분광학적, 광화학적, 생화학적, 면역화학적, 전기적, 광학적 또는 화학적 수단에 의해 검출가능한 조성물을 포함한다. 예로는 라벨링된 스트렙타비딘 컨쥬게이트, 마그네틱 비드 (예를 들어, 다이나비드 : DynabeadsTM), 형광성 염료 (예를 들어, 플루오레신, 텍사스 레드, 로다민, 코우마린, 옥사진, 그린 형광 단백질 등 예를 들어, 문헌 [Molecular Probes, Eugene, Oregon, USA]), 라디오라벨 (예를 들어, 3H, 125I, 35S, 14C 또는 32P), 효소 (예를 들어, 홀스 라디쉬 퍼옥시다아제, 알카리 포스파타아제 및 ELISA에 통상적으로 사용되는 기타 효소), 및 비색 라벨 예컨대, 콜로이드 골드 (예를 들어, 40-80mm 직경의 높은 효율의 스캐터 녹색광을 갖는 골드 입자) 또는 착색된 유리 또는 플라스틱 (예를 들어, 폴리스티렌, 폴리프로필렌, 라텍스 등) 비드로 염색하기 위한 바이오틴을 포함한다. 이러한 라벨 사용법에 대해 교시하고 있는 특허는 U.S. 특허 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; 및 4,366,241이다.In some applications, such as in the clinical methods described below, it is desirable to attach a detectable label to one gp120 polypeptide or gp120 polypeptides of the invention. Detectable labels suitable for use in the present invention include compositions detectable by spectroscopic, photochemical, biochemical, immunochemical, electrical, optical or chemical means. Examples include labeled streptavidin conjugates, magnetic beads (e.g., Dynabeads TM ), fluorescent dyes (e.g., fluorescein, Texas red, rhodamine, coumarin, oxazine, green fluorescent protein See, eg, Molecular Probes, Eugene, Oregon, USA), radiolabels (eg, 3 H, 125 I, 35 S, 14 C or 32 P), enzymes (eg horse radish) Peroxidase, alkaline phosphatase and other enzymes commonly used in ELISA), and colorimetric labels such as colloidal gold (eg, gold particles with high efficiency scatter green light of 40-80 mm diameter) or colored glass Or biotin for dyeing with plastic (eg, polystyrene, polypropylene, latex, etc.) beads. Patents teaching the use of such labels are described in US Pat. 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; And 4,366,241.
비정상적 이황화물 구조를 갖는 gp120 폴리펩티드를 엔코딩하는 단백질 폴리누클레오티드, 벡터 및 숙주 세포Protein polynucleotides, vectors, and host cells that encode gp120 polypeptides with abnormal disulfide structures
본 발명은 또한 본 발명의 폴리펩티드를 엔코딩하는 단리된 폴리누클레오티드를 제공한다. 따라서, 본 발명의 폴리누클레오티드는 gp120 아미노산 서열을 엔코딩하는 DNA 또는 RAN 부분을 포함한다. 본 발명의 폴리누클레오티드는 생체내 (즉, 유기체) 또는 실험관내 (예를 들어, 세포 배양액)에서 본 발명의 폴리펩티드의 재조합물 생성에 유용하다. 특정 구체예에서, 하기 기술된 바와 같이, 본 발명의 폴리누클레오티드는 면역원성 조성물 예컨대, DNA 백신, 재조합 바이러스 또는 바이러스 유래된 백신 (즉, 바이러스 입자의 성분으로서)에 사용될 수 있다. 또한, 본 발명의 gp120 폴리누클레오티드는 샘플이 비정상적 이황화물 구조를 갖는 gp120 폴리펩티드를 엔코딩하는 폴리누클레오티드를 함유하는지의 여부를 측정하기 위한 임상 방법에 사용될 수 있다.The invention also provides an isolated polynucleotide encoding a polypeptide of the invention. Thus, the polynucleotides of the present invention comprise DNA or RAN moieties that encode the gp120 amino acid sequence. Polynucleotides of the invention are useful for the production of recombinants of polypeptides of the invention in vivo (ie, organisms) or in vitro (eg, cell cultures). In certain embodiments, as described below, the polynucleotides of the present invention may be used in immunogenic compositions such as DNA vaccines, recombinant viruses or virus derived vaccines (ie, as components of viral particles). In addition, the gp120 polynucleotides of the present invention can be used in clinical methods to determine whether a sample contains a polynucleotide encoding a gp120 polypeptide having an abnormal disulfide structure.
A. 폴리누클레오티드A. Polynucleotides
특정 구체예에서, 폴리누클레오티드는 SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134 및 136으로 구성된 군으로부터 선택된 gp120의 V1, V2, V3 및 C4 도메인 각각에 대해 약 80%, 약 85%, 약 90%, 약 95% 또는 약 99% 이상의 동일성을 갖는 gp120 서열을 엔코딩한다. 바람직한 구체예에서, 폴리누클레오티드는 SEQ ID NO; 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135 또는 이들의 서브시퀀스로 구성된 군으로부터 선택된 gp120 누클레오티드 서열을 포함하며, 여기서 서열은 gp120의 적어도 V1, V2, V3 및 V4 도메인을 엔코딩한다. 상기 표 1은 상기 논의된 임상 실험으로부터의 gp120 샘플 설계와 SEQ ID NO 간의 상응물을 나타낸다. HIV-1의 HXB-2 균주로부터의 gp120에 대한 누클레오티드 서열은 SEQ ID NO:137이다.In certain embodiments, the polynucleotides comprise SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, Group consisting of 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134 and 136 Encode a gp120 sequence having at least about 80%, about 85%, about 90%, about 95% or about 99% identity to each of the V1, V2, V3 and C4 domains of gp120 selected from. In a preferred embodiment, the polynucleotides comprise SEQ ID NO; 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, Gp120 nucleotide sequence selected from the group consisting of 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135 or subsequences thereof Wherein the sequence encodes at least the V1, V2, V3 and V4 domains of gp120. Table 1 above shows the correspondence between gp120 sample design and SEQ ID NO from the clinical trials discussed above. The nucleotide sequence for gp120 from the HXB-2 strain of HIV-1 is SEQ ID NO: 137.
상기 논의된 바와 같이, 엔코딩된 gp120 아미노산 서열은 야생형 서열 또는 변이체 서열일 수 있다. gp120 아미노산 서열이 야생형 서열인 경우, 이러한 서열을 엔코딩하는 누클레오티드 서열은 유전자 코드의 축중으로 인한 아미노산 서열 변형을 유도하지 않는 하나 이상의 "침묵" 변이를 함유하는 누클레오티드 서열 또는 야생형 누클레오티드 서열일 수 있다.As discussed above, the encoded gp120 amino acid sequence may be a wild type sequence or a variant sequence. If the gp120 amino acid sequence is a wild type sequence, the nucleotide sequence encoding such a sequence may be a nucleotide sequence or wild type nucleotide sequence containing one or more "silent" mutations that do not induce amino acid sequence modifications due to degeneracy of the genetic code.
예를 들어, 폴리누클레오티드가 엔코딩된 폴리펩티드를 발현하는데 사용하기 위한 경우, 침묵 변이체는 코돈을 숙주 세포에 의해 바람직한 것으로 최적화시키는 표준 돌연변이법에 의해 유입될 수 있다. 대안적으로, 침묵 변이체를 함유하는 폴리누클레오티드는 설계되고 합성될 수 있다. 더욱 특히, 당업자로서, 코돈 사용은 고도로 비불규칙적이고, 종간에 상이하다. 코돈 사용 패턴은 tRNA 이소억셉터(isoacceptor)의 상대적 풍부함과 관련된 것으로 공지되어 있다. gp120의 경우, 코돈 빈도에서의 변화는 포유동물 세포에서 gp120의 재조합 생성을 향상시키는 것으로 공지되어 있다 (Haas J et al. (1996) Curr Biol. 6(3):315-24: Andre S. et al. (1998) J Virol. 72(2):1497-503). 천연 gp120 코딩 서열은 일반적으로 60% A-T를 초과하며, 포유동물계에서 고도로 발현되는 유전자는 약 60% G-C를 초과하는 경향이 있다. 따라서, 본 발명의 폴리누클레오티드가 포유동물 세포에서 본 발명의 폴리펩티드를 발현하는데 유용한 경우, 폴리누클레오티드는 바람직하게는, 코돈 빈도가 포유동물 유전자 더욱 바람직하게는, 상대적으로 고도로 발현된 포유동물 유전자에서 발견되는 코돈 빈도와 더욱 밀접하게 매칭되도록 "코돈-최적화" 된다.For example, when the polynucleotide is for use in expressing an encoded polypeptide, the silent variant can be introduced by standard mutagenesis that optimizes the codon as desired by the host cell. Alternatively, polynucleotides containing silent variants can be designed and synthesized. More particularly, as those skilled in the art, the use of codons is highly irregular and differs between species. Codon usage patterns are known to be related to the relative abundance of tRNA isoacceptors. For gp120, changes in codon frequency are known to enhance recombinant production of gp120 in mammalian cells (Haas J et al. (1996) Curr Biol. 6 (3): 315-24: Andre S. et. al. (1998) J Virol. 72 (2): 1497-503). Native gp120 coding sequences generally exceed 60% A-T and highly expressed genes in mammalian systems tend to exceed about 60% G-C. Thus, where the polynucleotides of the invention are useful for expressing the polypeptides of the invention in mammalian cells, the polynucleotides are preferably found in mammalian genes with codon frequencies more preferably in relatively highly expressed mammalian genes. "Codon-optimized" to more closely match the codon frequency that is being obtained.
본 발명의 코돈-최적화된 폴리누클레오티드는 전형적으로, 이의 비최적화된 대응물로부터 누클레오티드 서열과 약 50% 내지 약 80% 상이하다. 따라서, 본 발명의 코돈-최적화된 폴리누클레오티드는 상응하는 비최적화된 폴리누클레오티드와 약 50%, 약 60%, 약 70%, 약 80%, 또는 약 90% 서열 동일성을 공유할 수 있다.Codon-optimized polynucleotides of the invention are typically about 50% to about 80% different from the nucleotide sequence from their unoptimized counterparts. Thus, the codon-optimized polynucleotides of the present invention may share about 50%, about 60%, about 70%, about 80%, or about 90% sequence identity with the corresponding non-optimized polynucleotide.
바람직한 구체예에서, 본 발명의 폴리누클레오티드는 포유동물 세포에서 발현을 위해 코돈-최적화된다. 특히 설치류 세포 (예를 들어, 마우스, 래트 및 햄스퍼 세포) 및 영장류 세포 (예를 들어, 원숭이 또는 인간 세포)가 유리하다. 예를 들어, 바람직한 구체예에서, 본 발명의 폴리누클레오티드는 차이니즈 햄스퍼 난소 (CHO) 세포에서 발현에 대해 코돈-최적화된다. 본 발명의 폴리누클레오티드는 발현이 요망되는 종으로부터의 임의의 유전자에 기초하여 코돈-최적화될 수 있으나, 코돈-최적화는 바람직하게는, 코돈 빈도를 비교적 고도로 발현된 유전자의 코돈 빈도에 근접하게 변화시키므로써 수행된다. 예를 들어, 본 발명의 폴리누클레오티드는 면역글로불린 유전자의 코돈 빈도에 가까운 코돈 빈도를 가질 수 있다. 이와 같이, 예를 들어, 인간 Ig 카파 및 Mu 불변 영역 유전자의 코돈 빈도가 결정될 수 있으며, 폴리누클레오티드는 Ig 카파 및 Mu 불변 영역 유전자의 조합된 코돈 빈도에 가까운 코돈 빈도를 갖도록 조작될 수 있다. 간단하게는, 이러한 유전자의 서열은 유전자뱅크 또는 유사한 데이타베이스로부터 다운로드될 수 있다. Ig 카파 C-영역은 유전자뱅크의 위치 HUMIGKC3로서 설계되며, Ig Mu C-영역 서열은 위치 HSIGMHCC로서 설계된다. 그 후, 이들 서열은 복사되고, 코돈 사용이 결정된다. 코돈 사용 정보는 조합될 수 있으며, 조합된 코돈 빈도가 계산된다. Ig 유전자에 대한 조합된 코돈 빈도에 사실상 매칭되는 코돈 빈도를 갖는 본 발명의 코돈-최적화된 폴리누클레오티드는 그 후, 합성적으로 편리하게 생성된다.In a preferred embodiment, the polynucleotides of the invention are codon-optimized for expression in mammalian cells. Particularly advantageous are rodent cells (eg mouse, rat and hamsper cells) and primate cells (eg monkey or human cells). For example, in a preferred embodiment, the polynucleotides of the invention are codon-optimized for expression in Chinese Hamsper Ovary (CHO) cells. The polynucleotides of the invention can be codon-optimized based on any gene from the species in which expression is desired, but codon-optimization preferably changes the codon frequency close to the codon frequency of a relatively highly expressed gene. Is performed. For example, the polynucleotides of the present invention may have a codon frequency close to the codon frequency of an immunoglobulin gene. As such, for example, codon frequencies of human Ig kappa and Mu constant region genes can be determined, and polynucleotides can be engineered to have codon frequencies close to the combined codon frequencies of Ig kappa and Mu constant region genes. For simplicity, the sequence of such genes can be downloaded from genebanks or similar databases. The Ig kappa C-region is designed as position HUMIGKC3 of the gene bank and the Ig Mu C-region sequence is designed as position HSIGMHCC. These sequences are then copied and codon usage is determined. Codon usage information can be combined and the combined codon frequency is calculated. Codon-optimized polynucleotides of the present invention having codon frequencies that substantially match the combined codon frequencies for the Ig gene are then produced synthetically conveniently.
일부 적용에서, 특정 적용에 더욱 적합하게 변형시킨 폴리누클레오티드를 생성시키거나 본원에 기술된 폴리누클레오티드를 안정화시키는 것이 유리하다. 이를 위해, 본 발명의 폴리누클레오티드는 포스포로티오에이트, 포스포트리에스테르, 메틸 포스포네이트, 단쇄 알킬 또는 시클로알킬 당간(intersugar) 결합 또는 단쇄 헤테로원자 또는 헤테로시클릭 당간 ("백본") 결합을 함유할 수 있다. 가장 바람직한 것은 포스포로티오에이트이며, CH2--NH--O--CH2, CH2--N(CH3)--O--CH2 (메틸렌(메틸이미노) 또는 MMI 백본으로서 공지됨) 및 CH2--O--N(CH3)--CH2, CH2--N(CH3)--N(CH3)--CH2, 및 O--N(CH3)--CH2--CH 백본을 갖는 포스포로티오에이트이다 (여기서, 포스포디에스테르는 O--P--O--CH2임). 또한 바람직한 것은 모르폴리노 백본 구조를 갖는 폴리누클레오티드이다 (Summerton, J.E. and Weller, D.D., U.S. Pat. No. 5,034,506). 기타 바람직한 구체예는 단백질-핵산 또는 펩티드-핵산 (PNA) 백본을 사용하며, 여기서 폴리누클레오티드의 포스포디에스테르 백본은 폴리아미드 백본과 대체되며, 염기는 폴리아미드 백본의 아자 질소 원자에 직접적으로 또는 간접적으로 결합된다 (P.E. Nielsen, M.Egholm, R.H.Berg, O.Buchardt, Science 1991, 254, 1497). 본 발명의 폴리누클레오티드는 알킬 및 할로겐-치환된 당 부분을 함유할 수 있고/거나 펜토푸라노실기 대신에 시클로부틸과 같은 당 미메틱을 가질 수 있다. 기타 바람직한 구체예에서, 폴리누클레오티드는 하나 이상의 변형된 염기 형태 또는 "일반적인 염기" 예컨대, 이노신을 포함할 수 있다. 폴리누클레오티드는 필요에 따라, RNA 절단기, 콜레스테릴기, 수용체기, 인터칼레이토, 폴리누클레오티드의 약동학적 특성을 증대시키기 위한 기 및/또는 폴리누클레오티드의 약역학적 특성을 증대시키기 위한 기를 포함할 수 있다.In some applications, it is advantageous to produce polynucleotides that have been modified to more suitably suit a particular application or to stabilize the polynucleotides described herein. To this end, the polynucleotides of the present invention contain phosphorothioates, phosphoesters, methyl phosphonates, short chain alkyl or cycloalkyl intersugar bonds or short chain heteroatoms or heterocyclic sugar ("backbone") bonds. can do. Most preferred are phosphorothioates, CH2--NH--O--CH2, CH2--N (CH3)-O--CH2 (known as methylene (methylimino) or MMI backbone) and CH2- Phosphorothioate with -O--N (CH3)-CH2, CH2--N (CH3)-N (CH3)-CH2, and O--N (CH3)-CH2--CH backbone Where the phosphodiester is O--P--O--CH2. Also preferred are polynucleotides having a morpholino backbone structure (Summerton, J.E. and Weller, D.D., U.S. Pat.No. 5,034,506). Other preferred embodiments use a protein-nucleic acid or peptide-nucleic acid (PNA) backbone, wherein the phosphodiester backbone of the polynucleotide is replaced with the polyamide backbone, and the base is directly or indirectly to the aza nitrogen atom of the polyamide backbone. (PE Nielsen, M.Egholm, RHBerg, O.Buchardt, Science 1991, 254, 1497). The polynucleotides of the present invention may contain alkyl and halogen-substituted sugar moieties and / or may have sugar mimetics such as cyclobutyl instead of pentofuranosyl groups. In other preferred embodiments, the polynucleotide may comprise one or more modified base forms or “common bases” such as inosine. Polynucleotides may include, as necessary, groups for enhancing the pharmacokinetic properties of RNA cutters, cholesteryl groups, acceptor groups, intercalators, polynucleotides, and / or groups for enhancing the pharmacokinetic properties of polynucleotides. .
또한, 본 발명의 폴리누클레오티드는 하이브리드화 검정에 사용하기 위한 다양한 이용가능한 라벨을 포함하도록 변형될 수 있다. 적합한 라벨은 본 발명의 gp120 폴리펩티드 라벨링에 있어서 상기 논의된 라벨을 포함한다. 형광 분자는 임상 방법에 사용하기 위한 폴리누클레오티드를 라벨링하는데 편리하게 사용된다. 폴리누클레오티드에 결합할 수 있는 형광 라벨은 플루오레세인, 텍사스 레드, 로다민, 코우마린, 옥사진, 그린 형광 단백질을 포함하나 이에 제한되지 않는다.In addition, the polynucleotides of the present invention may be modified to include various available labels for use in hybridization assays. Suitable labels include the labels discussed above for gp120 polypeptide labeling of the invention. Fluorescent molecules are conveniently used to label polynucleotides for use in clinical methods. Fluorescent labels capable of binding to polynucleotides include, but are not limited to, fluorescein, Texas red, rhodamine, coumarin, oxazine, green fluorescent protein.
당업자는 본 발명의 폴리누클레오티드의 코딩 가닥에 상보적이거나 실질적으로 상보적인 폴리누클레오티드가 본 발명의 폴리펩티드의 발현을 억제하는데 사용될 수 있으며, 이는 연구학적 또는 치료학적 목적에 유리할 수 있음을 이해한다. 따라서, 본 발명의 핵산은 이러한 "안티센스 폴리누클레오티드"를 포함하며, 문구 "본 발명의 폴리펩티드를 엔코딩하는 폴리누클레오티드"는 이러한 안티센스 분자를 포함하는 것으로 해석된다. 안티센스 폴리누클레오티드는 DNA 또는 RNA일 수 있으며, 각각 연구적 또는 치료학적 안티센스 또는 RNA 간섭 (RNAi) 적용에 유용하다.Those skilled in the art understand that polynucleotides complementary or substantially complementary to the coding strand of the polynucleotides of the present invention can be used to inhibit the expression of the polypeptides of the present invention, which may be advantageous for research or therapeutic purposes. Thus, nucleic acids of the present invention include such "antisense polynucleotides" and the phrase "polynucleotides encoding a polypeptide of the present invention" is interpreted to include such antisense molecules. Antisense polynucleotides can be DNA or RNA, and are useful for research or therapeutic antisense or RNA interference (RNAi) applications, respectively.
본 발명의 폴리누클레오티드는 합성적으로 생성되거나, 클로닝된 gp120 폴리누클레오티드 또는 HIV-1 단리물로부터 증폭시키거나, HIV-1 단리물로부터 수득될 수 있다. 필요에 따라, 이와 같이 수득된 폴리누클레오티드의 누클레오티드 서열은 목적하는 누클레오티드 서열에 도달하기 위한 다양한 클로닝 및 돌연변이법을 이용하여 변형될 수 있다. 예를 들어, 문헌 [Sambrook, J., Fritsch, E.F., and Maniatis, T. (1989) in Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y.] 참조.Polynucleotides of the invention can be synthesized, amplified from cloned gp120 polynucleotides or HIV-1 isolates, or obtained from HIV-1 isolates. If desired, the nucleotide sequence of the polynucleotide thus obtained can be modified using various cloning and mutagenesis to reach the desired nucleotide sequence. See, eg, Sambrook, J., Fritsch, E.F., and Maniatis, T. (1989) in Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y.
B. 벡터B. Vector
본 발명의 폴리누클레오티드는 숙주 세포에서 증식 및/또는 발현을 위한 벡터로 혼입될 수 있다. 이러한 벡터는 전형적으로 적합한 숙주 세포 (예를 들어, E.coli, 차이니즈 햄스터 난소 [CHO] 세포)에서 벡터의 복제를 수행할 수 있는 복제 서열 (즉, 복제 오리진) 및 선택가능한 마커 예컨대, 항체 내성 유전자를 엔코딩하는 서열을 함유한다. 적합한 숙주의 형질전환시, 벡터는 독립적으로 숙주 게놈과 무관하게 복제되고 기능할 수 있거나 숙주 게놈내로 삽입될 수 있다. 벡터 설계는 무엇보다도, 의도된 용도에 의존적이며, 벡터용 숙주 세포, 및 특정 사용 및 숙주 세포를 위한 본 발명의 벡터의 설계물이 당해분야의 범위내에 있다. Polynucleotides of the invention can be incorporated into vectors for propagation and / or expression in host cells. Such vectors are typically replicating sequences (ie, replication origins) and selectable markers such as antibody resistance that can perform replication of the vector in a suitable host cell (eg, E. coli, Chinese hamster ovary [CHO] cells). It contains a sequence that encodes a gene. Upon transformation of a suitable host, the vector may independently replicate and function independently of the host genome or may be inserted into the host genome. Vector design depends, among other things, on the intended use, and designs of host cells for vectors, and vectors of the invention for particular uses and host cells, are within the scope of the art.
벡터가 폴리펩티드의 발현을 위한 것인 경우, 벡터는 작동적으로 연결된 폴리펩티드-코딩 서열의 발현을 유도하고/거나 증대시킬 수 있는 하나 이상의 조절 서열을 포함한다. 예를 들어, 원핵생물의 발현에 적합한 조절 서열은 프로모터 서열, 오퍼레이터 서열, 및 리보좀 결합 부위를 포함할 수 있다. 진핵 세포에서 발현을 위한 조절 서열은 프로모터, 인핸서 및 전사 종결 서열 (즉, 폴리아데닐화 시그널)을 포함할 수 있다. 본 발명의 발현 벡터는 생체내 (예를 들어, DNA 또는 바이러스 유래된 백신 적용) 또는 시험관내 (예를 들어, 세포 배양액)에서 본 발명의 폴리펩티드를 발현하는데 유용하다.If the vector is for expression of a polypeptide, the vector comprises one or more regulatory sequences capable of inducing and / or augmenting the expression of an operably linked polypeptide-coding sequence. For example, regulatory sequences suitable for expression of prokaryotes may include promoter sequences, operator sequences, and ribosomal binding sites. Regulatory sequences for expression in eukaryotic cells may include promoters, enhancers and transcription termination sequences (ie, polyadenylation signals). Expression vectors of the invention are useful for expressing polypeptides of the invention in vivo (eg, DNA or virus derived vaccine applications) or in vitro (eg, cell culture).
본 발명에 따른 모든 발현 벡터는 또한 기타 서열 예컨대, 시그널 서열 또는 증식가능한 유전자를 엔코딩하는 핵산 서열을 포함할 수 있다. 시그널 서열은 이에 융합된 폴리펩티드를 단백질을 발현하는 세포에서 분비시킬 수 있다. 발현 벡터에서, 시그널 서열을 엔코딩하는 핵산은 코딩 서열의 리딩 프레임을 보존하도록 코딩 서열에 연결된다. 또한, 선택된 숙주 세포에서 영양요구성 결핍을 보완하는 유전자 벡터중의 함유는 벡터로 형질전환된 숙주 세포를 선택할 수 있게 한다.All expression vectors according to the invention may also comprise other sequences such as signal sequences or nucleic acid sequences encoding proliferable genes. The signal sequence can secrete the polypeptide fused to it in a cell expressing the protein. In expression vectors, the nucleic acid encoding the signal sequence is linked to the coding sequence to preserve the reading frame of the coding sequence. In addition, the inclusion of a gene vector in a selected host cell that compensates for dystrophic deficiency allows selection of a host cell transformed with the vector.
바이러스 벡터는 본 발명의 폴리누클레오티드를 세포 또는 유기체로 유도한 후, 엔코딩된 단백질을 발현시키는데 특히 유용하다. 바이러스 벡터는 폴리펩티드를 유기체에 전달하여 생리학적 질환을 개선시키는, 즉 "유전자 치료"를 위한 수단으로서 광범위하게 연구되었다. 유전자 치료 공정의 재검토에 있어서, 예를 들어, 문헌 [Anderson, Science (1992) 256:808-813; Nabel and Felgner (1993) TIBTECH 11:211-217; Mitani and Caskey (1993) TIBTECH 11:162-166; Mulligan (1993) Science, 926-932; Dillon (1993) TIBTECH 11: 167-175; Miller (1992) Nature 357: 455-460; Van Brunt (1988) Biotechnology 6(10): 1149-1154; Vigne (1995) Restorative Neurology and Neuroscience 8:35-36; Kremer and Perricaudet (1995) British Medical Bulletin 51(1) 31-44; Haddada et al. (1995) in Current Topics in Microbiology and Immunology, Doerfler and Boehm (eds) Springer-Verlag, Heidelberg Germany; and Yu et al., (1994) Gene Therapy, 1:13-26] 참조.Viral vectors are particularly useful for inducing the polynucleotides of the invention into cells or organisms and then expressing the encoded proteins. Viral vectors have been extensively studied as means for delivering polypeptides to organisms to ameliorate physiological diseases, ie, "gene therapy." For a review of gene therapy processes, see, eg, Anderson, Science (1992) 256: 808-813; Nabel and Felgner (1993) TIBTECH 11: 211-217; Mitani and Caskey (1993) TIBTECH 11: 162-166; Mulligan (1993) Science, 926-932; Dillon (1993) TIBTECH 11: 167-175; Miller (1992) Nature 357: 455-460; Van Brunt (1988) Biotechnology 6 (10): 1149-1154; Vigne (1995) Restorative Neurology and Neuroscience 8: 35-36; Kremer and Perricaudet (1995) British Medical Bulletin 51 (1) 31-44; Haddada et al. (1995) in Current Topics in Microbiology and Immunology, Doerfler and Boehm (eds) Springer-Verlag, Heidelberg Germany; and Yu et al., (1994) Gene Therapy, 1: 13-26.
널리 사용된 바이러스 벡터 시스템은 아데노바이러스, 아데노 관련 바이러스, 우두 바이러스, 카나리아두 바이러스, 헤르페스 바이러스, 및 다양한 레트로바이러스 발현 시스템을 포함하나, 이에 제한되지 않는다. 아데노바이러스 벡터의 용도는 당업자에 널리 공지되어 있으며, 예를 들어, WO 96/25507에 상세히 기술되어 있다. 특히 바람직한 아데노바이러스 벡터는 문헌[Wills et al. (1994) Hum. Gene Therap. 5:1079-1088]에 기술되어 있다. 본 발명에 사용하기에 적합한 아데노바이러스 벡터는 또한 시중에서 구입가능하다. 예를 들어, 클론테크(Clontech)에서 판매되는 Adeno-XTM Tet-OffTM 유전자 발현 시스템은 유도가능한 이종 유전자를 대부분의 포유동물 세포로 유입시키는데 효과적인 수단을 제공한다.Widely used viral vector systems include, but are not limited to, adenoviruses, adeno-associated viruses, vaccinia viruses, canariadu viruses, herpes viruses, and various retroviral expression systems. The use of adenovirus vectors is well known to those skilled in the art and is described in detail, for example, in WO 96/25507. Particularly preferred adenovirus vectors are described in Wills et al. (1994) Hum. Gene Therap. 5: 1079-1088. Adenovirus vectors suitable for use in the present invention are also commercially available. For example, the Adeno-X ™ Tet-Off ™ gene expression system sold by Clontech provides an effective means of introducing inducible heterologous genes into most mammalian cells.
예를 들어, 폴리누클레오티드 및 펩티드의 실험관내 생성, 및 생체내 및 생체외 유전자 치료 공정에서 표적 핵산으로 형질도입된 세포에 사용되는 아데노 관련 바이러스(AAV)-기재 벡터는 예를 들어, 문헌[West et al. (1987) Virology 160:38-47; Carter et al. (1989) U.S. Patent No. 4,797,368; Carter et al. WO 93/24641 (1993); Kotin (1994) Human Gene Therapy 5:793-801; Muzyezka (1994) J.Clin. Invst. 94:1351; Lebkowski, U.S. Pat. No. 5,173,414; Tratschin et al. (1985) Mol. Cell. Biol. 5(11): 3251-3260; Tratschin, et al. (1984) Mol.Cell.Biol., 4:2072-2081; Hermonat and Muzyczka (1984) Proc.Natl. Acad. Sci. USA, 81:6466-6470; McLaughlin et al. (1988) and Samulski et al. (1989) J. Virol., 63:03822-3828]에 기술되어 있다. rAAV에 의해 형질전환될 수 있는 세포주는 문헌 [Lebkowski et al. (1988) Mol. Cell. Biol., 8:3988-3996]에 기술된 세포주를 포함한다.For example, adeno-associated virus (AAV) -based vectors used in in vitro production of polynucleotides and peptides and in cells transduced with target nucleic acids in in vivo and ex vivo gene therapy processes are described, for example, in West. et al. (1987) Virology 160: 38-47; Carter et al. (1989) U.S. Patent No. 4,797,368; Carter et al. WO 93/24641 (1993); Kotin (1994) Human Gene Therapy 5: 793-801; Muzyezka (1994) J. Clin. Invst. 94: 1351; Lebkowski, U.S. Pat. No. 5,173,414; Tratschin et al. (1985) Mol. Cell. Biol. 5 (11): 3251-3260; Tratschin, et al. (1984) Mol. Cell. Biol., 4: 2072-2081; Hermonat and Muzyczka (1984) Proc. Natl. Acad. Sci. USA, 81: 6466-6470; McLaughlin et al. (1988) and Samulski et al. (1989) J. Virol., 63: 03822-3828. Cell lines that can be transformed with rAAV are described by Lebkowski et al. (1988) Mol. Cell. Biol., 8: 3988-3996.
널리 사용된 레트로바이러스 벡터는 뮤린 루케미아 바이러스 (MuLV), 긴팔원숭이-유인원 류케미아 바이러스 (GaLV), 원숭이 면역결핍 바이러스 (SIV), 인간 면역결핍 바이러스 (HIV), 알파바이러스, 및 이들의 조합에 기초한 벡터를 포함한다 (예를 들어, 문헌 [Buchscher et al. (1992) J. Virol. 66(5) 2731-2739; Johann et al. (1992) J.Virol. 66 (5): 1635-1640 (1992); Sommerfelt et al., (1990) Virol. 176:58-59; Wilson et al. (1989) J.Virol. 63:2374-2378; Miller et al., J.Virol. 65:2220-2224 (1991); Wong-Staal et al., PCT/US94/05700, and Rosenburg and Fauci (1993) in Fundamental Immunology, Third Edition Paul (ed) Raven Press, Ltd., New York and the reference therein, and Yu et al. (1994) Gene Therapy, supra; U.S. Patent 6,008,535, and the like] 참조). 기타 적합한 바이러스 벡터는 단순포진 바이러스 (HSV), 파필로마바이러스, 엡스테인 바르 바이러스 (EBV), 및 렌티바이러스를 포함하나, 이에 제한되지 않는다.Widely used retroviral vectors include murine Lucemia virus (MuLV), Gibbon-Apes Leukemia Virus (GaLV), monkey immunodeficiency virus (SIV), human immunodeficiency virus (HIV), alphavirus, and combinations thereof. Vectors based (see, eg, Buchscher et al. (1992) J. Virol. 66 (5) 2731-2739; Johann et al. (1992) J. Virol. 66 (5): 1635-1640). (1992); Sommerfelt et al., (1990) Virol. 176: 58-59; Wilson et al. (1989) J. Virol. 63: 2374-2378; Miller et al., J. Virol. 65: 2220- 2224 (1991); Wong-Staal et al., PCT / US94 / 05700, and Rosenburg and Fauci (1993) in Fundamental Immunology, Third Edition Paul (ed) Raven Press, Ltd., New York and the reference therein, and Yu et al. (1994) Gene Therapy, supra; US Patent 6,008,535, and the like). Other suitable viral vectors include, but are not limited to, herpes simplex virus (HSV), papillomavirus, Epstein Barr virus (EBV), and lentivirus.
한 구체예에서, 본 발명에 따른 벡터는 DNA 백신 및 재조합 바이러스 벡터로서 작용할 수 있는 이작용성 플라스미드이다. 정제된 벡터 DNA의 피검체로의 직접적인 주입 즉, DNA 백신은 엔코딩된 폴리펩티드에 대한 면역 반응을 유도한다. 벡터는 또한, 바이러스 유래된 백신에 사용하기 위한 살아있는 재조합 바이러스를 생성시키는데 사용될 수 있다. 이러한 벡터는 두개의 상이한 프로모터 즉, 동물 프로모터 (DNA 백신에서 벡터로서 사용하기 위한) 및 바이러스 프로모터 (바이러스 유래된 백신에 사용하기 위한)에 작동적으로 결합된 본 발명의 폴리펩티드를 엔코딩하는 누클레오티드 서열을 포함한다.In one embodiment, the vector according to the invention is a bifunctional plasmid that can act as a DNA vaccine and a recombinant viral vector. Direct injection of purified vector DNA into a subject, ie, a DNA vaccine, induces an immune response against the encoded polypeptide. Vectors can also be used to generate live recombinant viruses for use in virus derived vaccines. Such vectors contain nucleotide sequences encoding polypeptides of the invention that are operably linked to two different promoters, an animal promoter (for use as a vector in a DNA vaccine) and a viral promoter (for use in a virus derived vaccine). Include.
본 발명의 벡터는 알맞은 제한 부위에서 리게이션에 의해 목적하는 엘리먼트를 결합시키므로써 생성된다. 이러한 부위가 존재하지 않는 경우, 적합한 부위가 표준 돌연변이법 (예를 들어, 부위 특이적 또는 카세트 돌연변이법)에 의해 삽입될 수 있거나, 합성 올리고누클레오티드 어답터 또는 링커가 통상적인 관례에 따라 사용될 수 있다.The vector of the present invention is generated by binding the desired element by ligation at an appropriate restriction site. If such sites do not exist, suitable sites may be inserted by standard mutagenesis (eg, site specific or cassette mutagenesis), or synthetic oligonucleotide adapters or linkers may be used in accordance with conventional practice.
C. 숙주 세포C. Host Cells
본 발명은 또한 본 발명의 벡터를 함유하는 숙주 세포를 제공한다. 매우 다양한 숙주 세포는 벡터의 증식 및/또는 발현에 유용하다. 예로는 원핵 세포 (예컨대, E.coli 및 바실러스, 슈도모나스 및 기타 세균 균주), 효모 또는 기타 진륜류 세포 (S.세레베시아 및 P.파스토리스를 포함), 곤충 세포, 식물 세포, 및 파아지 뿐만 아니라, 포유동물 세포 (예컨대, 차이니즈 햄스터 난소 세포) 특히, 인간 세포 (예컨대, 인간 배아 신장 세포)를 포함하는 고등의 진핵 세포를 포함한다. 본 발명에 따른 숙주 세포는 배양 세포 및 살아있는 유기체에 존재하는 세포, 예컨대, DNA 백신 또는 바이러스 백터가 삽입된 유전자이식 식물 또는 동물 또는 세포를 포함한다.The invention also provides host cells containing the vectors of the invention. A wide variety of host cells are useful for propagation and / or expression of vectors. Examples include prokaryotic cells (e.g., E. coli and Bacillus, Pseudomonas and other bacterial strains), yeast or other fungal cells (including S. cerevisiae and P. pastoris), insect cells, plant cells, and phages. But also eukaryotic cells, including mammalian cells (eg, Chinese hamster ovary cells), in particular human cells (eg, human embryonic kidney cells). Host cells according to the present invention include culture cells and cells present in living organisms, such as transgenic plants or animals or cells into which a DNA vaccine or viral vector has been inserted.
본 발명의 벡터는 사용되는 벡터-숙주 시스템에 따라 변할 임의의 알맞은 방법에 의해 숙주 세포내로 유입된다. 일반적으로, 벡터는 벡터를 함유하는 바이러스로의 형질전환 (또한 "트랜스펙션"으로서 공지됨) 또는 감염에 의해 숙주 세포로 유입된다. 숙주 세포는 원핵 세포 (또는 세포벽을 갖는 담체 세포)인 경우, 알맞은 형질전환 방법은 문헌 [Cohen, et al. (1972) Proc. Natl. Acad. Sci., USA, 69:2100-14]에 기술된 칼슘 처리 방법을 포함한다. 원핵 세포가 숙주로서 사용되고, 벡터가 파아지미드 벡터인 경우, 각각 벡터는 감염에 의해 숙주 세포로 유입될 수 있다. 효모 세포는 예를 들어, 문헌 [Hinnen (1978) Proc. Natl. Acad. Sci, USA, 75:1929-33]에 교시된 바와 같이 폴리에틸렌 글리콜을 사용하여 형질전환될 수 있다. 포유동물 세포는 문헌 [Graham, et al. (1978) Virology, 52:546 and Gorman, et al. (1990) DNA and Prot.Eng. Tech., 2:3-10]에 기술된 칼슘 포스페이트 침전 방법을 이용하여 편리하게 형질전환될 수 있다. 그러나, DNA를 숙주 세포로 유입하기 위한 기타 공지된 방법 예컨대, 핵 주입, 일렉트로포레이션 및 원형질체 융합이 본 발명에 사용될 수 있다.The vector of the present invention is introduced into the host cell by any suitable method that will vary depending on the vector-host system used. In general, vectors are introduced into host cells by transformation (also known as "transfection") or infection with a virus containing the vector. If the host cell is a prokaryotic cell (or carrier cell with a cell wall), suitable transformation methods are described in Cohen, et al. (1972) Proc. Natl. Acad. Sci., USA, 69: 2100-14. When prokaryotic cells are used as hosts and the vector is a phagemid vector, each vector can be introduced into the host cell by infection. Yeast cells are described, for example, in Hinnen (1978) Proc. Natl. Acad. Sci, USA, 75: 1929-33, can be used to transform using polyethylene glycol. Mammalian cells are described in Graham, et al. (1978) Virology, 52: 546 and Gorman, et al. (1990) DNA and Prot. Eng. Tech., 2: 3-10] can be conveniently transformed using the calcium phosphate precipitation method described. However, other known methods for introducing DNA into host cells such as nuclear injection, electroporation and protoplast fusion can be used in the present invention.
본 발명은 생체내 세포 및 실험관내 세포 즉, 배양 세포로 본 발명의 벡터를 유입시키는 것을 포함한다. 살아있는 유기체에 존재하는 세포로 벡터를 삽입하는 기법은 널리 공지되어 있으며, 본 발명의 면역원성 조성물의 사용과 관련하여 하기에 더욱 자세히 설명되어 있다. 특정 구체예에서, 본 발명의 벡터는 DNA 또는 바이러스 유래된 백신중에서 피검체로 삽입된다.The present invention includes introducing a vector of the present invention into cells in vivo and cells in vitro, ie culture cells. Techniques for inserting vectors into cells present in living organisms are well known and described in more detail below in connection with the use of the immunogenic compositions of the invention. In certain embodiments, a vector of the invention is inserted into a subject in a DNA or virus derived vaccine.
재조합체 생성 방법Recombinant production method
발현 벡터로 형질전환된 숙주 세포는 본 발명의 폴리누클레오티드에 의해 엔코딩된 폴리펩티드를 발현하는데 사용될 수 있다. 발현은 세포 성장 및 발현에 적합한 조건하에 숙주 세포를 배양하고, 세포 용해물로부터, 또는 폴리펩티드가 분비되는 경우 배양 배지로부터 발현된 폴리펩티드를 회수하는 것을 포함한다. 특히, 배양 배지는 사용된 숙주 세포에 대한 적합한 영양분 및 성장 인자를 함유한다. 영양분 및 성장 인자는 많은 경우에 널리 공지되어 있거나, 당업자에 의해 용이하게 실험에 의해 결정될 수 있다. 예를 들어, 포유동물 숙주 세포에 적합한 배양 조건은 문헌 [Mammalian Cell Culture (Mather ed., Plenum Press 1984) and in Barnes and Sato (1980) Cell 22:649]에 기술되어 있다.Host cells transformed with expression vectors can be used to express polypeptides encoded by the polynucleotides of the invention. Expression involves culturing the host cell under conditions suitable for cell growth and expression, and recovering the expressed polypeptide from the cell lysate or from the culture medium when the polypeptide is secreted. In particular, the culture medium contains suitable nutrients and growth factors for the host cells used. Nutrients and growth factors are well known in many cases, or can be readily determined experimentally by one skilled in the art. For example, suitable culture conditions for mammalian host cells are described in Mammalian Cell Culture (Mather ed., Plenum Press 1984) and in Barnes and Sato (1980) Cell 22: 649.
또한, 배양 조건은 전사, 번역 및 세포 컴파트먼트간 단백질 수송을 가능하게 해야 한다. 이러한 공정에 영향을 주는 인자는 널리 공지되어 있으며, 예를 들어, DNA/RNA 복사체 수; DNA를 안정화시키는 인자; 배양 배지중에 존재하는 영양분, 보충제, 및 전사 유도자 또는 리프레서; 배양액의 온도, pH 및 삼투질 농도; 및 세포 밀도를 포함한다. 특정 벡터-숙주 세포 시스템에서 발현을 촉진하는 이들 인자의 조절는 당업자의 수준내에 있다. 예를 들어, 실험관내 포유동물 세포 배양의 생산성을 최대화시키기 위한 원리 및 실행 기법은 문헌 [Mammalian Cell Biotechnology: a Practical Approach (Butler ed., IRL Press (1991)]에서 발견할 수 있다.In addition, the culture conditions should allow for transcription, translation and protein transport between cell compartments. Factors affecting this process are well known and include, for example, the number of DNA / RNA copies; Factors that stabilize DNA; Nutrients, supplements, and transcription inducers or refreshers present in the culture medium; Temperature, pH and osmolality of the culture; And cell density. Regulation of these factors to promote expression in certain vector-host cell systems is within the level of those skilled in the art. For example, principles and practice techniques for maximizing the productivity of in vitro mammalian cell culture can be found in Mammalian Cell Biotechnology: a Practical Approach (Butler ed., IRL Press (1991)).
대규모 또는 소규모 단백질 생성을 위한 많은 널리 공지된 기법은 본 발명의 폴리펩티드를 발현하는데 사용될 수 있다. 이는 쉐이킹된 플라스크, 플루오르화된 베드 생반응기, 롤러 버틀 배양 시스템, 및 교반된 탱크 생반응기 시스템의 사용을 포함하나 이에 제한되지 않는다. 세포 배양은 회분식, 유가식 또는 연속 모드로 수행될 수 있다.Many well known techniques for large or small scale protein production can be used to express polypeptides of the invention. This includes, but is not limited to, the use of shaked flasks, fluorinated bed bioreactors, roller bottle culture systems, and stirred tank bioreactor systems. Cell culture can be performed in batch, fed-batch or continuous mode.
상기 기술된 바와 같이 생성된 재조합 단백질의 회수 방법은 널리공지되어 있으며, 이용된 발현 시스템에 따라 다르다. 시그널 서열을 포함하는 폴리펩티드는 배양 배지 또는 주변원형질로부터 회수될 수 있다. 폴리펩티드는 또한 세포내 발현되어 세포 용해물로부터 회수될 수 있다.Methods for recovering the recombinant protein produced as described above are well known and depend on the expression system used. Polypeptides comprising the signal sequence can be recovered from the culture medium or from the periplasm. Polypeptides can also be expressed intracellularly and recovered from cell lysates.
발현된 폴리펩티드는 숙주 세포 또는 배양 배지의 하나 이상의 성분으로부터 폴리펩티드를 분리할 수 있는 임의의 방법에 의해 세포 배양액 또는 세포 용해물로부터 정제될 수 있다. 전형적으로, 폴리펩티드는 폴리펩티드의 의도된 용도와 대립되는 숙주 세포 및/또는 배양 배지 성분으로부터 분리된다. 제 1 단계로서, 배양 배지 또는 세포 용해물은 일반적으로 원심분리되거나 여과되어 세포 부스러기를 제거한다. 그 후, 상청액을 전형적으로 원심분리하거나 원하는 부피로 희석하거나 추가의 정제를 위한 제조물을 컨디셔닝하는데 적합한 완충액으로 다이아필터링된다.The expressed polypeptide can be purified from the cell culture or cell lysate by any method capable of separating the polypeptide from one or more components of the host cell or culture medium. Typically, a polypeptide is isolated from host cell and / or culture medium components that are in conflict with the intended use of the polypeptide. As a first step, the culture medium or cell lysate is generally centrifuged or filtered to remove cell debris. The supernatant is then centrifuged, diluted to the desired volume, or diafiltered with a buffer suitable for conditioning the preparation for further purification.
그 후, 폴리펩티드는 널리 공지된 기법을 이용하여 추가로 정제될 수 있다. 선택된 기법은 발현된 폴리펩티드의 특성에 따라 다를 것이다. 예를 들어, 폴리펩티드가 에피토프 태그 또는 기타 친화성 도메인을 함유하는 융합 단백질로서 발현되는 경우, 정제는 전형적으로 코그네이트 결합 파트너를 함유하는 친화도 칼럼의 사용을 포함한다. 예를 들어, 그린 형광 단백질, 헤마글루티닌 또는 FLAG 에피토프 태그, 또는 헥사히스티딘 또는 유사한 금속 친화도 태그와 융합된 폴리펩티드는 친화된 칼럼에 대한 세분화에 의해 정제될 수 있다.The polypeptide can then be further purified using well known techniques. The technique chosen will depend on the nature of the expressed polypeptide. For example, when a polypeptide is expressed as a fusion protein containing an epitope tag or other affinity domain, purification typically involves the use of an affinity column containing a cognate binding partner. For example, a polypeptide fused with a green fluorescent protein, hemagglutinin or FLAG epitope tag, or hexahistidine or similar metal affinity tag can be purified by segmentation on an affinity column.
면역원성 조성물Immunogenic Compositions
A. 면역원성 조성물의 유형A. Types of Immunogenic Compositions
본 발명의 면역원성 조성물은 본 발명의 단리된 폴리펩티드, 단리된 폴리누클레오티드, 또는 이 둘 모두를 포함할 수 있다. 본 발명의 단리된 gp120 폴리펩티드는 적합한 용량으로 피검체에 투여시 항-gp120 면역 반응을 유도하기에 충분한 양으로 면역원성 조성물중에 존재한다. 본 발명의 단리된 gp120 폴리누클레오티드는, 피검체로의 적합한 용량의 투여가 엔코딩된 gp120 폴리펩티드의 발현을 유도하여, 항-gp120 면역 반응을 자극하기에 충분한 양으로 면역원성 조성물중에 존재한다.Immunogenic compositions of the invention may comprise an isolated polypeptide of the invention, an isolated polynucleotide, or both. The isolated gp120 polypeptide of the present invention is present in the immunogenic composition in an amount sufficient to induce an anti-gp120 immune response when administered to a subject at a suitable dose. The isolated gp120 polynucleotide of the present invention is present in the immunogenic composition in an amount sufficient for administration of a suitable dose to the subject induces expression of the encoded gp120 polypeptide, thereby stimulating an anti-gp120 immune response.
바람직한 구체예에서, 면역원성 조성물은 "다가"이며, 이는 두개 이상의 상이한 항원성 gp120 서열을 제공한다. 이와 같이, 폴리펩티드 기재 조성물은 2개 이상의 상이한 HIV 단리물로부터 유래된 gp120 서열을 포함하는 하나 이상의 폴리펩티드를 함유할 수 있다. 폴리누클레오티드 기재 조성물은 2개 이상의 상이한 HIV 단리물로부터 유래된 gp120 서열을 엔코딩하는 하나 이상의 폴리누클레오티드를 함유할 수 있다. 대안적으로, 본 발명의 면역원성 조성물은 하나의 HIV 단리물로부터 유래된 gp120 서열을 포함하는 하나 이상의 폴리펩티드 및 상이한 HIV 단리물로부터 유래된 gp120 서열을 엔코딩하는 하나 이상의 폴리누클레오티드를 함유할 수 있다. 이러한 구체예의 변형은 목적하는 대로 많은 상이한 항원성 gp120 서열 예를 들어, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50 또는 100개 이상의 상이한 항원성 gp120 서열을 제공할 수 있다.In a preferred embodiment, the immunogenic composition is "multivalent", which provides two or more different antigenic gp120 sequences. As such, the polypeptide based composition may contain one or more polypeptides comprising a gp120 sequence derived from two or more different HIV isolates. The polynucleotide based composition may contain one or more polynucleotides encoding a gp120 sequence derived from two or more different HIV isolates. Alternatively, an immunogenic composition of the invention may contain one or more polypeptides comprising a gp120 sequence derived from one HIV isolate and one or more polynucleotides encoding a gp120 sequence derived from a different HIV isolate. Variations of this embodiment can be carried out as many different antigenic gp120 sequences as desired, for example 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50 or 100 or more different antigenic gp120 sequences. Can be provided.
1. 폴리펩티드를 함유하는 면역원성 조성물1. Immunogenic Compositions Containing Polypeptides
본 발명은 본 발명의 단리된 폴리펩티드를 함유하는 면역원성 조성물을 제공한다. 조성물은 예를 들어, 저장 용액 예컨대, 적합한 완충액 예를 들어, 생리학적 완충액을 포함하는 기타 성분을 선택적으로 함유한다. 바람직한 구체예에서, 기타 성분은 약제학적으로 허용되는 담체 예컨대, 문헌 [Remington's Pharmaceutical Sciences (1980) 16th editions, Osol, ed., 1980]에 기술된 바와 같은 담체이다.The present invention provides immunogenic compositions containing the isolated polypeptides of the present invention. The composition optionally contains other components, including, for example, stock solutions such as suitable buffers such as physiological buffers. In a preferred embodiment, the other component is a pharmaceutically acceptable carrier such as that described in Remington's Pharmaceutical Sciences (1980) 16th editions, Osol, ed., 1980.
면역원성 조성물은 유리된 형태 (즉, 기타 부분에 컨쥬게이팅되지 않은 형태) 또는 세포 특이적 결합 부위에 공유적으로 결합된 본 발명의 하나 이상의 폴리펩티드, 면역자극 올리고누클레오티드 또는 면역원성 담체 단백질을 포함할 수 있다.The immunogenic composition may comprise one or more polypeptides, immunostimulatory oligonucleotides or immunogenic carrier proteins of the invention covalently bound to a free form (ie, non-conjugated to other portions) or to a cell specific binding site. Can be.
한 구체예에서, 본 발명의 면역원성 조성물은 본 발명의 폴리펩티드를 발현하는 세포, 세포 용해물, 또는 폴리펩티드를 함유하는 이들의 분획물, 예컨대, 막 분획물을 포함한다.In one embodiment, the immunogenic compositions of the invention comprise cells, cell lysates, or fractions thereof containing the polypeptide, such as membrane fractions, that express the polypeptides of the invention.
또 다른 구체예에서, 면역원성 조성물은 본 발명의 하나 이상의 폴리펩티드를 나타내는 바이러스 입자 및/또는 바이러스 감염된 세포를 포함한다. 조작된 바이러스 입자 및/또는 감염된 세포를 갖는 피검체의 면역화는 당해분야에 널리 공지되어 있으며, 면역화를 위해 사용된 조성물은 일반적으로 "바이러스 유래된 백신"으로 일컬어진다. 바이러스 감염 성분이 B 림프구, 헬퍼 T 림프구 및 세포독성 T 림프구를 활성화시키는 강력한 면역 반응을 증진시킬 수 있기 때문에, 바이러스 유래된 백신이 유리할 수 있다. 많은 바이러스 종은 바이러스 유래된 백신에 유용한 재조합 바이러스를 생성시키는데 사용될 수 있다. 예로는 우두 바이러스 (International Patent Publication WO 87/06262, Oct. 22, 1987, by Moss et al.; Cooney et al., Proc. Natl. Acad. Sci. USA 90:1882-6(1993); Graham et al., J.Infect. Dis. 166:244-52 (1992); McElrath et al., J.Infect.Dis. 169:41-7(1994)) 및 카나리아두 바이러스 (Pialoux et al., AIDS Res. Hum. Retroviruses 11:373-81 (1995), erratum in AIDS Res. Hum. Retroviruses 11:875 (1995); Andersson et al., J.Infect.Dis. 174:977-85(1996); Fries et al., Vaccine 14:428-34 (1996); Gonczol et al., Vaccine 13:1080-5 (1995))을 포함한다. 바이러스 유래된 백신은 결함이 있는 아데노바이러스 또는 아데노바이러스를 사용하여 제조될 수 있다 (Gilardi-Hebenstreit et al., J.Gen.Virol.71:2425-31 (1990); Prevec et al., J.Infect. Dis. 161:27-30 (1990); Lubeck et al., Proc.Natl.Acad.Sci. USA 86:6763-7 (1989); Xiang et al., Virology 219:220-7 (1996)). 백신에 유용한 재조합 바이러스를 생성하도록 조작될 수 있는 기타 바이러스는 양생(amphotropic) 숙주 범위를 갖는 세포에 패키징되는 레트로바이러스 (예를 들어, 문헌 [Miller, Human Gene Ther. 1:5-14 (1990); Ausubel et al., Current Protocols in Molecular Biology, § 9] 참조), 및 약화되거나 결함이 있는 DNA 바이러스 예컨대, 단순포진 바이러스 (HSV) (예를 들어, 문헌[Kaplitt et al., Molec. Cell. Neurosci. 2:320-330 (1991) 참조), 파필로마바이러스, 엡스테인 바르 바이러스 (EBV), 아데노-관련 바이러스 (AAV) (예를 들어, 문헌 [Samulski et al., J.Virol. 61:3096-3101 (1987); Samulski et al., J. Virol. 63:3822-3828 (1989) 참조) 등을 포함하나 이에 제한되지 않는다.In another embodiment, the immunogenic composition comprises viral particles and / or virus infected cells representing one or more polypeptides of the invention. Immunization of subjects with engineered viral particles and / or infected cells is well known in the art, and compositions used for immunization are generally referred to as "virus derived vaccines". Virus-derived vaccines may be advantageous because viral infectious components can promote potent immune responses that activate B lymphocytes, helper T lymphocytes, and cytotoxic T lymphocytes. Many viral species can be used to produce recombinant viruses useful for virus derived vaccines. Examples include vaccinia virus (International Patent Publication WO 87/06262, Oct. 22, 1987, by Moss et al .; Cooney et al., Proc. Natl. Acad. Sci. USA 90: 1882-6 (1993); Graham et. al., J. Infect. Dis. 166: 244-52 (1992); McElrath et al., J. Infect. Dis. 169: 41-7 (1994)) and Canarydu virus (Pialoux et al., AIDS Res). Hum.Retroviruses 11: 373-81 (1995), erratum in AIDS Res.Hum.Retroviruses 11: 875 (1995); Andersson et al., J. Infect. Dis. 174: 977-85 (1996); Fries et. al., Vaccine 14: 428-34 (1996); Gonczol et al., Vaccine 13: 1080-5 (1995). Virus derived vaccines can be made using defective adenoviruses or adenoviruses (Gilardi-Hebenstreit et al., J. Gen. Virol. 71: 2425-31 (1990); Prevec et al., J. Infect. Dis. 161: 27-30 (1990); Lubeck et al., Proc. Natl. Acad. Sci. USA 86: 6763-7 (1989); Xiang et al., Virology 219: 220-7 (1996) ). Other viruses that can be engineered to produce recombinant viruses useful in vaccines include retroviruses that are packaged in cells with an amphotropic host range (eg, Miller, Human Gene Ther. 1: 5-14 (1990) Ausubel et al., Current Protocols in Molecular Biology, § 9), and attenuated or defective DNA viruses such as herpes simplex virus (HSV) (see, eg, Kaplitt et al., Molec. Cell. Neurosci. 2: 320-330 (1991)), papillomavirus, Epstein Barr virus (EBV), adeno-associated virus (AAV) (see, eg, Samulski et al., J. Virol. 61: 3096-3101 (1987); see Samulski et al., J. Virol. 63: 3822-3828 (1989), and the like.
본 발명에 사용하기에 적합한 약제학적으로 허용되는 담체는 사용되는 용량에서 세포, 조직 또는 피검체에 비독성이며, 완충제 (예컨대, 포스페이트 완충제, 시트레이트 완충제, 및 기타 유기산으로 제조된 완충제), 항산화제 (예를 들어, 아스코르브산), 저분자량 (약 10 잔기 미만) 펩티드, 폴리펩티드 (예컨대, 혈청 알부민, 젤라틴, 및 면역글로불린), 친수성 중합체 (예컨대, 폴리비닐피롤리돈), 아미노산 (예컨대, 글리신, 글루타민, 아스파라긴, 아르기닌 및/또는 리신), 모노사카라이드, 디사카라이드 및/또는 기타 탄수화물 (글루코오스, 만노스 및 덱스트린 포함), 킬레이팅제 (예를 들어, 에틸렌디아민테트라아세트산 [EDTA]), 당 알코올 (예컨대, 만니톨 및 소르비톨), 염 형성 반대이온 (예를 들어, 나트륨) 및/또는 음이온성 계면활성제 (예컨대, TweenTM, PluronicsTM, 및 PEG)를 포함할 수 있다. 한 구체예에서, 약제학적으로 허용되는 담체는 수성 pH-완충된 용액이다.Pharmaceutically acceptable carriers suitable for use in the present invention are nontoxic to cells, tissues or subjects at the dosages employed, and include buffers (eg, buffers made of phosphate buffers, citrate buffers, and other organic acids), antioxidants Agents (eg, ascorbic acid), low molecular weight (less than about 10 residues) peptides, polypeptides (eg, serum albumin, gelatin, and immunoglobulins), hydrophilic polymers (eg, polyvinylpyrrolidone), amino acids (eg, Glycine, glutamine, asparagine, arginine and / or lysine), monosaccharides, disaccharides and / or other carbohydrates (including glucose, mannose and dextrins), chelating agents (eg, ethylenediaminetetraacetic acid [EDTA]) Sugar alcohols (eg mannitol and sorbitol), salt-forming counterions (eg sodium) and / or anionic surfactants (eg Tween ™ , Pluronics ™ , and PEG). In one embodiment, the pharmaceutically acceptable carrier is an aqueous pH-buffered solution.
바람직한 구체예는 서방형 조성물을 포함한다. 예시적인 서방형 조성물은 폴리펩티드가 부착되거나, 폴리펩티드가 캡슐화된 고형의 소수성 중합체의 반침투성 매트릭스를 포함한다. 적합한 중합체의 예로는 폴리에스테르, 히드로겔, 폴리락티드, L-글루탐산과 T-에틸-L-글루타마아제의 공중합체, 비분해성인 에틸렌-비닐아세테이트, 분해가능한 락트산-글리콜산 공중합체, 및 폴리-D-(-)-3-히드록시부티르산을 포함한다. 이러한 물질은 구체화된 물품 예컨대, 필름 또는 마이크로캡슐 형태로 존재한다.Preferred embodiments include sustained release compositions. Exemplary sustained release compositions include semipermeable matrices of solid hydrophobic polymers to which the polypeptide is attached or the polypeptide is encapsulated. Examples of suitable polymers include polyesters, hydrogels, polylactides, copolymers of L-glutamic acid and T-ethyl-L-glutamease, nondegradable ethylene-vinylacetate, degradable lactic acid-glycolic acid copolymers, and Poly-D-(-)-3-hydroxybutyric acid. Such materials are present in the form of specified articles such as films or microcapsules.
서방형 조성물의 예로는 전형적으로, ε-아미노기를 통해 폴리알킬렌 글리콜 (예를 들어, 폴리에틸렌 글리콜 [PEG])에 부착된 폴리펩티드를 포함한다. 단백질로의 PEG 부착은 생체내 반감기를 연장하는 널리 공지된 수단이다 (예를 들어, 문헌 [Abuchowski, J., et al. (1977) J. Biol. Chem. 252:3582-86] 참조). 통상적인 "PEG화(pegylation)" 방법이 이용될 수 있으며, 단 "PEG화"된 변형물은 목적하는 기능을 보유한다.Examples of sustained release compositions typically include a polypeptide attached to a polyalkylene glycol (eg, polyethylene glycol [PEG]) via an ε-amino group. PEG attachment to proteins is a well known means of extending half-life in vivo (see, eg, Abuchowski, J., et al. (1977) J. Biol. Chem. 252: 3582-86). Conventional “PEGlation” methods may be used, provided that the “PEGylated” variant retains the desired function.
또 다른 구체예에서, 서방형 조성물은 리포좀으로 엔트랩핑된 폴리펩티드를 포함한다. 리포좀은 다양한 유형의 지질, 인지질 및/또는 계면활성제로 이루어진 작은 비히클이다. 이들 성분은 전형적으로 생물학적 막의 지질 배열과 유사하게 이중막 형태로 배열된다. 폴리펩티드를 함유하는 리포좀은 예를 들어, 문헌 [Epstein, et al. (1985) PNAS USA 82:3688-92, and Hwang, et al., (1980) PNAS USA, 77:4030-34]에 기술된 바와 같이 공지된 방법에 의해 제조된다. 보통, 이러한 제조물중의 리포좀은 작은 (약 200-800 Angstrom) 단층 유형이며, 여기에서 지질 함량은 약 30mol% 콜레스테롤 보다 크고, 이러한 특정 %는 최적 치료를 제공하기 위해 조절된다. 유용한 리포좀은 예를 들어, 포스파티딜콜린, 콜레스테롤 및 PEG-유래된 포스파디틸에탄올아민(PEG-PE)를 포함하는 지질 조성물을 이용한 역상 증발법에 의해 생성될 수 있다. 필요에 따라, 리포좀은 특정 직경의 리포좀을 생성하기 위해 규정된 포어 크기의 필터를 통해 압출된다.In another embodiment, the sustained release composition comprises a polypeptide entrapped with liposomes. Liposomes are small vehicles composed of various types of lipids, phospholipids and / or surfactants. These components are typically arranged in bilayer form, similar to the lipid arrangement of biological membranes. Liposomes containing polypeptides are described, for example, in Epstein, et al. (1985) PNAS USA 82: 3688-92, and Hwang, et al., (1980) PNAS USA, 77: 4030-34. Usually, liposomes in such preparations are of a small (about 200-800 Angstrom) monolayer type, where the lipid content is greater than about 30 mol% cholesterol, and this particular percentage is adjusted to provide optimal treatment. Useful liposomes can be produced, for example, by reverse phase evaporation using a lipid composition comprising phosphatidylcholine, cholesterol and PEG-derived phosphaditylethanolamine (PEG-PE). If desired, liposomes are extruded through filters of defined pore size to produce liposomes of a particular diameter.
본 발명의 조성물은 국제 공고 WO 91/04014에 기술된 바와 같이 삽입될 수 있는 막 예컨대, 실라스틱 막상에 흡착된 폴리펩티드를 포함할 수 있다.The composition of the present invention may comprise a polypeptide adsorbed onto a membrane, such as a silicone membrane, which can be inserted as described in International Publication WO 91/04014.
본 발명의 면역원성 조성물은 예를 들어, 수용액 또는 냉동건조된 케이크를 포함하는 표준 형태로 저장될 수 있다. 이러한 조성물은 전형적으로 피검체에 투여될 경우 멸균된다. 수용액의 멸균은 멸균 여과막을 통해 여과에 의해 용이하게 달성된다. 조성물이 냉동건조된 형태로 저장되는 경우, 조성물은 냉동건조 및 재구성 전 또는 후에 여과될 수 있다.The immunogenic compositions of the invention can be stored in standard form, including, for example, aqueous solutions or lyophilized cakes. Such compositions are typically sterilized when administered to a subject. Sterilization of the aqueous solution is readily accomplished by filtration through sterile filtration membranes. If the composition is stored in lyophilized form, the composition may be filtered before or after lyophilization and reconstitution.
2. 폴리누클레오티드를 함유하는 면역원성 조성물2. Immunogenic Compositions Containing Polynucleotides
본 발명은 본 발명의 폴리펩티드를 엔코딩하는 단리된 폴리누클레오티드를 함유하는 면역원성 조성물을 제공한다. 이러한 조성물은 기타 성분 예를 들어, 저장액 예컨대, 적합한 완충액 예를 들어, 생리학적 완충액을 포함한다. 바람직한 구체예에서, 기타 성분은 상기 기술된 바와 같이 약제학적으로 허용되는 담체이다.The present invention provides immunogenic compositions containing isolated polynucleotides encoding polypeptides of the invention. Such compositions include other components such as stock solutions such as suitable buffers such as physiological buffers. In a preferred embodiment, the other component is a pharmaceutically acceptable carrier as described above.
폴리펩티드 항원으로의 전형적인 면역화에 대한 대안으로는 피검체의 조직 세포에 의한 항원 발현을 위해 피검체의 조직으로 항원을 엔코딩하는 폴리누클레오티드를 직접 생체내 유입시키는 것을 포함한다. 피검체를 백신화시키는데 사용된 폴리누클레오티드-기재 조성물은 "DNA 백신" 또는 "핵산-기재 백신"으로 일컬어진다. DNA 백신은 국제특허공고 WO 95/20660 및 국제특허공고 WO 93/19183에 기술되어 있다. 면역 반응을 유도하기 위해 바이러스 단백질을 엔코딩하는 직접적으로 주입된 DNA의 능력은 다양한 실험 시스템을 통해 입증되었다 (Conry et al., Cancer Res., 54:1164-1168 (1994); Cox et al., Virol, 67:5664-5667 (1993); Davis et al., Hum.Mole.Genet., 2:1847-1851 (1993); Sedegah et al., Proc.Natl.Acad.Sci., 91:9866-9870 (1994); Montgomery et al., DNA Cell Bio., 12:777-783 (1993); Ulmer et al., Science, 259:1745-1749 (1993); Wang et al., Proc.Natl.Acad.Sci., 90:4156-4160 (1993); Xiang et al., Virology, 199:132-140 (1994)). 인플루엔자 바이러스의 중화에서 이러한 기법을 평가하기 위한 연구는 엔벨로프 및 내부 바이러스 단백질 둘 모두를 사용하여 항체 생성을 유도하였으며, 특히 바이러스 헤마글루티닌 단백질 (HA)에 초점을 맞추었다 (Fynan et al., DNA Cell.Biol., 12:785-789 (1993A); Fynan et al., Proc.Natl.Acad.Sci., 90:11478-11482 (1993B); Robinson et al., Vaccine, 11:957, (1993); Webster et al., Vaccine, 12:1495-1498 (1994)). HIV env 단백질을 엔코딩하는 DNA를 직접적으로 유입시켜 보호성 면역 반응을 유도하는 백신화는 세포 매개된 반응 및 체액성 반응 둘 모두를 유도하며, 이는 살아있는 바이러스로 수득된 것과 유사한 반응이다 (Raz et al., Proc. Natl. Acad. Sci., 91:9519-9523 (1994); Ulmer, 1993, supra; Wang, 1993, supra; Xiang, 1994, supra). 또한, 동물의 수명 동안 본질적으로 지속되는 핵단백질을 엔코딩하는 DNA에 대한 재현가능한 면역 반응은 마우스에서 보고되었다 (Yankauckas et al., DNA Cell Biol., 12:771-776(1993)). DNA 백신은 면역 시스템의 상이한 암을 자극하기 위해 고안될 수 있다. 예를 들어, 주조직적합 항원군 I (MHC-I) 반응은 단백질 항원의 세포내 발현에 의해 우수하게 자극된다. 이를 달성하기 위해, gp120 시그널 서열을 엔코딩하는 서열이 결실되고, 개시제 메티오닌 잔기에 대한 코돈으로 대체된다. 시그널 서열 결여된 gp120 유전자는 세포질에서 유리된 리보좀상에서 합성되고, N-결합된 탄수화물을 초래하지 않으며, 세포내에서 단백질분해 처리되며, MHC-I-제한된 면역 반응을 자극할 수 있다. MHC-I 반응은 CD8-함유 T 세포에 의해 매개된 세포독성 T 세포 반응을 촉진하는데 특히 효과적인 것으로 여겨진다. 반대로, 시그널 서열을 함유하는 gp120 유전자가 포유동물 세포에서 발현되는 경우, 시그널 서열은 막-결합된 리보좀상에서의 합성을 유도하며, "분비 경로"로 전위되며, 여기서 세포 표면 또는 세포외 컴파트먼트로 배출하도록 예정된 단백질은 많은 후번역 변형 (예를 들어, 글리코실화)을 겪게되며, 주조직적합 항원군 II (MHC-II) 단백질과 함께 면역 시스템에 제공된다. MHC-II 항원과 관련하여 제시된 단백질 항원은 특히 항체 반응 및 CD4-매개된 T 세포 반응을 촉진하는데 효과적이나, CD8-의존적 면역 반응을 자극하는데는 효과적이지 않다 (예를 들어, 세포독성 T 림프구 [CTL]).Alternatives to typical immunization with polypeptide antigens include direct in vivo incorporation of a polynucleotide encoding the antigen into the tissue of the subject for expression of the antigen by the tissue cell of the subject. The polynucleotide-based composition used to vaccinate a subject is referred to as a "DNA vaccine" or a "nucleic acid-based vaccine." DNA vaccines are described in International Patent Publication WO 95/20660 and International Patent Publication WO 93/19183. The ability of directly injected DNA to encode viral proteins to induce an immune response has been demonstrated through various experimental systems (Conry et al., Cancer Res., 54: 1164-1168 (1994); Cox et al., Virol, 67: 5664-5667 (1993); Davis et al., Hum.Mole.Genet., 2: 1847-1851 (1993); Sedegah et al., Proc. Natl. Acad. Sci., 91: 9866- 9870 (1994); Montgomery et al., DNA Cell Bio., 12: 777-783 (1993); Ulmer et al., Science, 259: 1745-1749 (1993); Wang et al., Proc. Natl. Acad Sci., 90: 4156-4160 (1993); Xiang et al., Virology, 199: 132-140 (1994). Studies to evaluate this technique in neutralization of influenza viruses have used both envelope and internal viral proteins to induce antibody production, with a particular focus on viral hemagglutinin protein (HA) (Fynan et al., DNA Cell. Biol., 12: 785-789 (1993A); Fynan et al., Proc. Natl. Acad. Sci., 90: 11478-11482 (1993B); Robinson et al., Vaccine, 11: 957, ( 1993); Webster et al., Vaccine, 12: 1495-1498 (1994)). Vaccination, which directly induces a protective immune response by direct influx of DNA encoding the HIV env protein, leads to both cell-mediated and humoral responses, similar to those obtained with live viruses (Raz et al. , Proc. Natl. Acad. Sci., 91: 9519-9523 (1994); Ulmer, 1993, supra; Wang, 1993, supra; Xiang, 1994, supra). In addition, reproducible immune responses to DNA encoding nuclear proteins that persist essentially over the life of the animal have been reported in mice (Yankauckas et al., DNA Cell Biol., 12: 771-776 (1993)). DNA vaccines can be designed to stimulate different cancers of the immune system. For example, histocompatibility antigen group I (MHC-I) responses are well stimulated by intracellular expression of protein antigens. To accomplish this, the sequence encoding the gp120 signal sequence is deleted and replaced with a codon for the initiator methionine residue. The gp120 gene lacking a signal sequence is synthesized on ribosomes liberated in the cytoplasm, does not result in N-linked carbohydrates, is proteolytically processed in the cell and can stimulate an MHC-I-limited immune response. The MHC-I response is believed to be particularly effective in promoting cytotoxic T cell responses mediated by CD8-containing T cells. Conversely, when the gp120 gene containing the signal sequence is expressed in mammalian cells, the signal sequence induces synthesis on membrane-bound ribosomes and is translocated into the "secretory pathway" where the cell surface or extracellular compartment Proteins scheduled to be excreted undergo a number of posttranslational modifications (eg, glycosylation) and are provided to the immune system along with major histocompatibility antigen group II (MHC-II) proteins. Protein antigens shown in connection with MHC-II antigens are particularly effective for promoting antibody responses and CD4-mediated T cell responses, but not for stimulating CD8-dependent immune responses (eg, cytotoxic T lymphocytes [ CTL]).
당업계에 잘 알려진 바와 같이, 수많은 인자들이 항원 유전자의 발현 효율 및/또는 DNA 백신의 면역원성에 영향을 미칠 수 있다. 이러한 인자의 예로는 벡터, 항원 유전자 발현을 유도하는데 사용되는 프로모터, 및 플라스미드 내에서 삽입된 유전자의 안정성이 포함된다. 프로모터는 그 기원에 따라 조직 특이성 및 mRNA 합성을 개시하는 효율이 상이하다(Xiang et al., Virology, 209: 564-579 (1994); Chapman et al., Nucle. Acids. Res., 19: 3979-3986 (1991) ). 포유동물계에서 많은 DNA 백신은 사이토메갈로바이러스(CMV)로부터 유래된 바이러스 프로모터에 의존하여 왔다. 이들은 많은 포유동물종에서 근육 및 피부 접종에 있어서 양호한 효율을 나타내었다. As is well known in the art, a number of factors can affect the expression efficiency of antigen genes and / or the immunogenicity of DNA vaccines. Examples of such factors include vectors, promoters used to induce antigen gene expression, and stability of the inserted gene in the plasmid. Promoters differ in efficiency of initiating tissue specificity and mRNA synthesis depending on their origin (Xiang et al., Virology, 209: 564-579 (1994); Chapman et al., Nucle. Acids. Res., 19: 3979). -3986 (1991)). Many mammalian vaccines in mammals have relied on viral promoters derived from cytomegalovirus (CMV). They have shown good efficiency in muscle and skin inoculation in many mammalian species.
약제학적 사용을 위해, 본 발명의 폴리누클레오티드는 특정 적응증에 적합한 방식으로 제형화된다. 미국 특허 제6,001,651호(Bennett et al.)에는 올리고누클레오티드 치료제와 함께 사용하기에 적합한 많은 약제 조성물 및 이러한 올리고누클레오티드를 투여하는 방법에 관하여 기술되어 있다. 바람직한 구체예에서, 본 발명의 치료 조성물은 상기 기술된 바와 같이 지질과 조합된 폴리누클레오티드를 포함한다. For pharmaceutical use, the polynucleotides of the present invention are formulated in a manner suitable for a particular indication. US Patent No. 6,001,651 (Bennett et al.) Describes a number of pharmaceutical compositions suitable for use with oligonucleotide therapeutics and methods of administering such oligonucleotides. In a preferred embodiment, the therapeutic composition of the invention comprises a polynucleotide in combination with a lipid as described above.
폴리누클레오티드를 함유하는 조성물은, 예를 들어 수용액 또는 동결건조된 케이크를 포함하는 표준 형태로 저장될 수 있다. 이러한 조성물은 일반적으로 세포 또는 피검체에 투여될 때 멸균된다. 수용액의 멸균은 멸균 여과막을 통한 여과에 의해 용이하게 수행된다. 조성물이 동결건조된 형태로 저장되는 경우, 조성물은 동결건조 및 재구성 이전 또는 이후에 여과될 수 있다.Compositions containing polynucleotides may be stored in standard form, including, for example, aqueous solutions or lyophilized cakes. Such compositions are generally sterile when administered to cells or subjects. Sterilization of the aqueous solution is easily carried out by filtration through sterile filtration membranes. If the composition is stored in lyophilized form, the composition may be filtered before or after lyophilization and reconstitution.
3. 다른 성분3. Other Ingredients
상기에서 기술된 성분에 부가하여, 본 발명의 면역원성 조성물은 하나 이상의 애쥬번트(adjuvant)를 포함할 수 있다. 전형적인 애쥬번트로는, 프로인트 완전 애쥬번트(complete Freund's adjuvant), 프로인트 불완전 애쥬번트(incomplete Freund's adjuvant), 사포닌, 광물성 겔, 예를 들어 수산화알루미늄, 계면활성물질, 예를 들어 라이소레시틴, 플루로닉 폴리올(pluronic polyols), 폴리음이온(polyanions), 펩티드, 오일 또는 탄화수소 에멀션, 키홀 림펫 헤모시아닌, 디니트로페놀, 및 BCG (bacille Calmette-Guerin) 및 코르네박테리움 파르붐(Corynebacterium parvum)이 포함되나, 이에 한정되지 않는다. 근년에, 새로운 부류의 애쥬번트인 면역자극성 올리고누클레오티드가 기술되었다. 이들은 구아닌 및 시토신을 다량 함유한(GC 풍부) 화학합성된 소형(약 15량체 내지 30량체) 올리고누클레오티드인 경향이 있다. 일부 경우에(예를 들어, B형 간염 표면 항원과 같은 미립자 항원), 이들 올리고누클레오티드는 면역자극 효과를 갖도록 바이러스 항원과 간단히 혼합될 수 있다. 다른 경우에는(예를 들어, gp120), 올리고누클레오티드를 단백질에 화학적으로 결합시킴으로써 우수한 활성이 얻어진다. 본 출원에서 기술된 gp120 분자는 올리고누클레오티드와 화학적으로 결합하는 데에 이용될 수 있는 쌍을 이루지 않는 시스테인으로 인해, 면역자극성 올리고누클레오티드에 의한 유도체화에 특히 적합하다. 애쥬번트의 선택은 백신접종 대상 피검체에 의존한다. 바람직하게는, 약제학적으로 허용되는 애쥬번트가 사용된다. 인간 피검체를 위한 바람직한 애쥬번트는 명반(알루미나 겔)이다.In addition to the components described above, the immunogenic compositions of the present invention may comprise one or more adjuvants. Typical adjuvants include Freund's complete adjuvant, Freund's incomplete adjuvant, saponins, mineral gels such as aluminum hydroxide, surfactants such as lysolecithin, Pluronic polyols, polyanions, peptides, oils or hydrocarbon emulsions, keyhole limpet hemocyanin, dinitrophenols, and BCG (bacille Calmette-Guerin) and Cornebacterium parvum ), But is not limited thereto. In recent years, a new class of adjuvant, immunostimulatory oligonucleotides, has been described. They tend to be small (about 15 to 30-mer) oligonucleotides that are chemically synthesized (GC rich) containing large amounts of guanine and cytosine. In some cases (eg, particulate antigens such as hepatitis B surface antigen), these oligonucleotides may be simply mixed with viral antigens to have an immunostimulatory effect. In other cases (eg gp120), good activity is obtained by chemically binding oligonucleotides to proteins. The gp120 molecules described in this application are particularly suitable for derivatization with immunostimulatory oligonucleotides due to the unpaired cysteines that can be used to chemically bind to the oligonucleotides. The choice of adjuvant depends on the subject to be vaccinated. Preferably, pharmaceutically acceptable adjuvants are used. Preferred adjuvants for human subjects are alum (alumina gel).
다른 항원, 예를 들어 다른 HIV-1 폴리펩티드(예를 들어, gp41)가 본 발명의 면역원성 조성물에 포함될 수 있다.Other antigens, such as other HIV-1 polypeptides (eg gp41), can be included in the immunogenic compositions of the invention.
면역원성 조성물은 본 발명의 폴리펩티드에 대한 항-gp120 면역 반응을 증강시키거나 다른 잇점을 제공하는 다른 폴리펩티드를 포함할 수 있다. 예를 들어, 특정 구체예에서, 본 발명의 면역원성 조성물중에 인터페론 또는 인터루킨과 같은 사이토카인 부류 중 하나 이상의 성분을 포함시키는 것이 유리할 수 있다.The immunogenic composition may comprise other polypeptides that enhance or provide other advantages for the anti-gp120 immune response against the polypeptides of the invention. For example, in certain embodiments, it may be advantageous to include one or more components of the cytokine class, such as interferon or interleukin, in the immunogenic compositions of the present invention.
바람직한 구체예에서, 본 발명의 단리된 폴리누클레오티드를 함유하는 조성물은 폴리누클레오티드가 세포내에 유입되는 것을 용이하게 하는 성분을 포함한다. 폴리누클레오티드의 세포내 전달을 용이하게 하는 성분은 잘 알려져 있으며, 예를 들어 지질, 리포좀, 유-수 에멀션, 폴리에틸렌 이민 및 덴드라이머(dendrimer)를 포함하며, 이들은 본 발명에 따른 조성물에 사용될 수 있다. 지질은 이러한 타입중 가장 널리 사용되는 성분이며, 이용될 수 있는 지질 또는 지질 제형은 본 발명의 폴리누클레오티드와 함께 사용될 수 있다. 일반적으로, 양이온성 지질이 바람직하다. 바람직한 양이온성 지질은 N-[l-(2,3-디올레일옥시)프로필]-n,n,n-트리메틸암모늄클로라이드 (DOTMA), 디올레일 포스파티딜에탄올아민 (DOPE), 및/또는 디올레일 포스파티딜콜린(DOPC)을 포함한다. 상기에서 폴리펩티드에 대해 기술된 바와 같이, 폴리누클레오티드도 리포좀에 포획될 수 있다.In a preferred embodiment, the composition containing the isolated polynucleotide of the present invention comprises a component that facilitates the introduction of the polynucleotide into the cell. Components that facilitate intracellular delivery of polynucleotides are well known and include, for example, lipids, liposomes, oil-in-water emulsions, polyethylene imines and dendrimers, which can be used in the compositions according to the invention. have. Lipids are the most widely used component of this type, and the lipids or lipid formulations that can be used can be used with the polynucleotides of the present invention. In general, cationic lipids are preferred. Preferred cationic lipids are N- [l- (2,3-dioleyloxy) propyl] -n, n, n-trimethylammoniumchloride (DOTMA), dioleyl phosphatidylethanolamine (DOPE), and / or dioleyl phosphatidylcholine (DOPC). As described for polypeptides above, polynucleotides can also be captured in liposomes.
다른 구체예에서, 폴리누클레오티드는 세포를 형질감염시키는 데에 사용될 수 있는 덴드라이머와 착체를 형성한다. 덴드라이머 폴리양이온은, 올리고머 및/또는 중합체를 추가하고 양(+)으로 변화된 외부 표면을 제공하는 반복 반응 서열에 의해 코어 분자 또는 고안된 개시물질상에 전형적으로 형성되는 3차원의 매우 정돈된 올리고머 및/또는 중합체 화합물이다. 적합한 덴드라이머에는 "스타버스트(starburst)" 덴드라이머 및 다양한 덴드라이머 폴리양이온이 포함되지만, 이에 한정되지 않는다. 덴드라이머의 제조 방법 및 생체 내에서 세포에 폴리누클레오티드를 도입하기 위해 덴드라이머를 사용하는 것은 당업자에게 잘 알려져 있으며, 예를 들어 PCT/US83/02052 및 미국 특허 제4,507,466호; 제4,558,120호; 제4,568,737호; 제4,587,329호; 제4,631,337호; 제4,694,064호; 제4,713,975호; 제4,737,550호; 제4,871,779호; 제4,857,599호; 및 제5,661,025호에 상세히 기술되어 있다. In another embodiment, the polynucleotides form a complex with a dendrimer that can be used to transfect cells. Dendrimer polycations are three-dimensional, highly ordered oligomers typically formed on core molecules or designed initiators by repeating reaction sequences that add oligomers and / or polymers and provide a positively altered outer surface. And / or polymeric compounds. Suitable dendrimers include, but are not limited to, "starburst" dendrimers and various dendrimer polycations. Methods of making dendrimers and the use of dendrimers to introduce polynucleotides into cells in vivo are well known to those skilled in the art, see, for example, PCT / US83 / 02052 and US Pat. No. 4,507,466; 4,558,120; 4,558,120; No. 4,568,737; 4,587,329; 4,631,337; 4,694,064; No. 4,713,975; No. 4,737,550; 4,871,779; 4,857,599; And 5,661,025.
B. 면역원성 조성물의 용도B. Use of Immunogenic Compositions
본 발명의 면역원성 조성물은 동물에서 gpl20-특이적 면역 반응을 일으키는 데에 사용될 수 있다. 본 발명의 면역원성 조성물은 이하에서 더 상세히 설명되는 바와 같이, 적합한 투여 경로에 의해 동물에게 투여될 수 있다.The immunogenic compositions of the invention can be used to elicit gpl20-specific immune responses in animals. The immunogenic compositions of the invention can be administered to the animal by a suitable route of administration, as described in more detail below.
한 구체예에서, 면역원성 조성물은 항-gpl20 항체, 예를 들어 HIV 연구에 유용한 항체를 생성하기 위하여 동물에 투여된다. 일반적으로, 동물은 항체 생성에 전형적으로 사용되는 것이다. 포유동물(예를 들어, 설치류, 토끼, 염소, 양 등)이 바람직하다. In one embodiment, the immunogenic composition is administered to an animal to produce an anti-gpl20 antibody, eg, an antibody useful for HIV studies. In general, animals are those typically used for antibody production. Mammals (eg, rodents, rabbits, goats, sheep, etc.) are preferred.
폴리클로날 항체는 항원성 폴리펩티드를 적합한 동물(예를 들어, 마우스 또는 토끼)에 주사(예를 들어, 피하 또는 근육내 주사)하여 생성된다. 그 다음, 동물로부터 채취한 혈액 샘플로부터 항체를 얻는다. 폴리클로날 항체를 생성시키는 데에 사용되는 기술은 문헌(참조: Methods of Enzymology,"Production of Antisera With Small Doses of Immunogen: Multiple Intradermal Injections", Langone, et al. eds. (Acad. Press, 1981))에 광범위하게 기술되어 있다. 동물에 의해 생성된 폴리클로날 항체를, 예를 들어 폴리펩티드(이 폴리펩티드에 대해 항체가 생성됨)가 결합되어 있는 매트릭스에 결합시키고, 용리시켜서, 추가로 정제시킬 수 있다. 당업자는 폴리클로날 및 모노클로날 항체의 정제 및/또는 농축에 대한 다양한 표준 기술을 숙지하고 있다(참조: Coligan, et al.(1991) Unit 9, Current Protocols in Immunology, Wiley Interscience). Polyclonal antibodies are produced by injection (eg subcutaneous or intramuscular injection) of an antigenic polypeptide into a suitable animal (eg mouse or rabbit). The antibody is then obtained from a blood sample taken from the animal. Techniques used to generate polyclonal antibodies are described in Methods of Enzymology, "Production of Antisera With Small Doses of Immunogen: Multiple Intradermal Injections", Langone, et al. Eds. (Acad. Press, 1981). ) Are extensively described. Polyclonal antibodies produced by an animal can be further purified, for example, by binding to the matrix to which the polypeptide (an antibody is generated for this polypeptide) is bound, eluting and so on. Those skilled in the art are familiar with various standard techniques for the purification and / or enrichment of polyclonal and monoclonal antibodies (Coligan, et al. (1991)
많은 용도에서, 모노클로날 항-gp120 항체가 바람직하다. mAb를 분비하는 하이브리도마의 생산에 사용되는 일반법은 잘 알려져 있다(Kohler and Milstein (1975) Nature, 256: 495). 간단하게는, 쾰러 및 밀스타인(Kohler and Milstein)에 의해 기술된 바와 같이, 상기 기술은 5명의 개개의 암환자의 국부 배출 림프절로부터 림프구를 단리하고, 이 세포를 풀링하고, 세포를 SHFP-1와 융합시키는 것을 필요로 하였다. 암세포주에 결합된 항체를 생산하기 위하여 하이브리도마를 스크리닝하였다. 통상적인 스크리닝 기술(예를 들어, 효소결합면역흡수검정 즉 "ELISA")를 사용하여 mAb간의 특이성을 확인하여 관심있는 mAb의 기본 반응 패턴을 결정할 수 있다. In many applications, monoclonal anti-gp120 antibodies are preferred. General methods used for the production of hybridomas that secrete mAbs are well known (Kohler and Milstein (1975) Nature, 256: 495). Briefly, as described by Kohler and Milstein, the technique isolates lymphocytes from local draining lymph nodes of five individual cancer patients, pools them, and stores the cells with SHFP-1. It was necessary to fuse. Hybridomas were screened to produce antibodies bound to cancer cell lines. Conventional screening techniques (eg, enzyme-linked immunosorbent assay, or “ELISA”) can be used to determine specificity between mAbs to determine the basic response pattern of the mAb of interest.
대안적으로, 본 발명의 면역원성 조성물은 인간 피검체에게 투여하기 위한 백신으로 사용될 수 있다. 특히, 조성물은 HIV-1으로 감염되지 않은 개체에게 투여되어, 감염증의 위험을 감소시키거나 감염증을 방지(HIV-1 감염의 예방)할 수 있다. 보건 전문가, 경찰관, 및 소방관과 같은 개인들은 본 발명의 백신의 예방적 투여로부터 이득을 얻을 수 있다. 조성물은 또한 HIV-1으로 이미 감염되어 있으나, 아직 면역 반응을 개시할 수 있는 개체에게 투여될 수도 있다(참조: Salk, Nature 327: 473-476 (1987); Salk et al., Science 195: 834-847 (1977) ). 소위 "치료용 백신"은 기존 감염증의 개선(예를 들어, 피검체의 상태를 개선시키거나, 질병 진행을 늦추거나 방지함으로써) 및/또는 추가의 HIV-1 균주에 의한 감염에 대한 예방을 제공할 수 있다. Alternatively, the immunogenic compositions of the present invention can be used as a vaccine for administration to a human subject. In particular, the composition can be administered to a subject not infected with HIV-1 to reduce the risk of infection or to prevent infection (prevention of HIV-1 infection). Individuals such as health professionals, police officers, and firefighters can benefit from the prophylactic administration of the vaccines of the present invention. The composition may also be administered to a subject already infected with HIV-1 but still capable of initiating an immune response (Salk, Nature 327: 473-476 (1987); Salk et al., Science 195: 834). -847 (1977)). So-called “therapeutic vaccines” provide for the improvement of existing infections (eg, by improving the subject's condition, by slowing or preventing disease progression) and / or by preventing further infection with HIV-1 strains. can do.
1. 폴리펩티드를 함유하는 면역원성 조성물1. Immunogenic Compositions Containing Polypeptides
폴리펩티드계 면역원성 조성물은 편리하게 주사로 투여될 수 있으나(예를 들어, 피하, 피내, 근육내, 복강내, 정맥내 등), 카테테르 또는 다른 수술 튜브를 통한 전달도 생각할 수 있다. 대체 경로는 경구 투여(정제 등) 및 흡입(예를 들어, 액체 제형 또는 동결건조 또는 에어로졸 제형을 위해 시판되는 네뷸라이저 사용) 투여를 포함한다. 폴리펩티드 조성물은 마이크로스피어, 리포좀, 면역자극복합체(immune-stimulating complexes: ISCOMs), 또는 적절한 조직(예를 들어, 혈액)에 도입된 다른 마이크로입자 전달 시스템 또는 서방출 제형에 의해 투여될 수도 있다. Polypeptide-based immunogenic compositions can be conveniently administered by injection (eg, subcutaneously, intradermal, intramuscular, intraperitoneal, intravenous, etc.), but delivery through catheters or other surgical tubes is also contemplated. Alternative routes include oral administration (tablets and the like) and inhalation (eg, using commercially available nebulizers for liquid formulations or lyophilization or aerosol formulations). The polypeptide composition may be administered by microspheres, liposomes, immuno-stimulating complexes (ISCOMs), or other microparticle delivery systems or sustained release formulations introduced into appropriate tissues (eg, blood).
면역원성 조성물로 투여되는 gpl20 폴리펩티드의 백신접종 용량은 특정 조성물의 성질, 예를 들어 특정 제형의 면역원성, 투여경로, 면역접종법, 피검체의 상태 등에 의존하며, 특정 상황에 적합한 용량의 결정은 당업자의 기술 수준내에 있다. 일반적으로, 투여당 300㎍의 gpl20 폴리펩티드 용량이 가장 바람직하지만, 바람직한 용량은 투여당 약 10㎍ 내지 1mg의 범위일 수 있고, 이 바람직한 범위 이외의 용량은 특정 제형, 투여 경로(예를 들어, 근육내 vs 피하), 및/또는 면역접종법에 따라 사용될 수 있다. 상이한 투여량이 연속적인 일련의 접종에 사용될 수 있다. 따라서, 개업의는 1차 접종에 비교적 고용량을 투여한 후, 비교적 소용량의 gp120 폴리펩티드로 추가 접종할 수 있다.Vaccination doses of gpl20 polypeptides administered in immunogenic compositions depend on the nature of the particular composition, eg, immunogenicity of the particular formulation, route of administration, immunization, the condition of the subject, and the like. Is within the technical level of In general, a dose of 300 μg gpl20 polypeptide per dose is most preferred, but a preferred dose may range from about 10 μg to 1 mg per dose, and doses outside this preferred range may be determined by a particular formulation, route of administration (eg, muscle Intra vs subcutaneous), and / or immunization. Different dosages may be used for successive series of inoculations. Thus, a practitioner can administer a relatively high dose at the first inoculation, followed by booster inoculation with a relatively small dose of gp120 polypeptide.
2. 폴리누클레오티드를 함유하는 면역원성 조성물2. Immunogenic Compositions Containing Polynucleotides
본 발명의 폴리누클레오티드계 면역원성 조성물은 코딩된 폴리펩티드를 피검체의 생체 내에서 발현시켜, 코딩된 폴리펩티드에 대한 면역 반응을 일으키는 데에 사용될 수 있다. 벤베니스티 등(Benvenisty, N., and Reshef, L. PNAS 83, 95519555, (1986))은 마우스에 복강내(i.p.), 정맥내(i.v.), 또는 근육내(i.m.)로 도입된 CaPO4-침강 DNA가 발현될 수 있었음을 입증하였다. 마우스에 DNA 발현 벡터를 i.m. 주입한 결과 근육 세포에 DNA가 흡수되고 DNA에 의해 코딩된 단백질이 발현되었다. 플라스미드는 에피좀에 유지되고 복제되지 않았다. 그 후, 랫트, 어류, 및 영장류의 골격근 및 랫트의 심근에 i.m. 주사후 지속적인 발현이 관찰되었다. W090/11092 (1990.10.4)에는 척추동물에 백신접종을 하기 위해 네이키드(naked) 폴리누클레오티드를 사용하는 것이 기술되어 있다.The polynucleotide based immunogenic composition of the present invention can be used to express a encoded polypeptide in vivo in a subject, thereby generating an immune response against the encoded polypeptide. Benvenisty et al. (Benvenisty, N., and Reshef, L. PNAS 83, 95519555, (1986)) show that CaPO 4 introduced into mice intraperitoneally (ip), intravenously (iv), or intramuscular (im). It was demonstrated that precipitated DNA could be expressed. Im injection of the DNA expression vector into the mouse resulted in uptake of DNA into muscle cells and expression of the protein encoded by the DNA. Plasmids were maintained in episomes and did not replicate. Subsequently, sustained expression was observed after im injection into rat, fish, and primate skeletal muscle and rat myocardium. WO 90/11092 (1990.10.4) describes the use of naked polynucleotides for vaccination of vertebrates.
다양한 방법들이 동물에 폴리누클레오티드를 도입하는 데에 사용될 수 있으며, 특정 폴리누클레오티드를 동물에 도입하는 데에 적합한 방법의 선택은 당업계의 기술 수준 내에 있다. 예를 들어, 마우스에서 항-BGH 항체를 생성하기 위해, 소의 성장 호르몬(BGH)을 코딩하는 DNA로 코팅된 금 마이크로프로젝틸(microprojectiles)을 마우스의 피부에 도입하였다. 살아있는 동물의 피부, 근육, 지방 및 유방 조직을 형질감염시키기 위해 제트 주입기가 사용되어 왔다. 마우스에 DNA:양이온성 리포좀 복합체를 정맥내 주사하는 것은 츄 등(Zhu et al., Science 261: 209-211 (Jul. 9,1993))에 의해 보고되었고, 클로닝된 형질전환 유전자가 전신 발현되었다. 울머 등(Ulmer et al., Science 259: 1745-1749, (1993))은 인플루엔자 바이러스 단백질을 코딩하는 DNA의 근육 주사에 의한 인플루엔자 바이러스 감염의 이종성 보호에 관하여 보고하였다. WO 93/17706호에는 바이러스에 대하여 동물에 백신접종을 하는 방법으로서, 담체 입자가 유전자 구성물로 코팅되고, 상기 코팅된 입자가 동물의 세포 내로 가속되는 방법이 기술되어 있다. "유전자총(gene-gun)을 사용한 플라스미드의 고속 접종은 마우스의 면역 반응을 증가시켰으며(Fynan, 1993B, supra; Eisenbraun et al. , DNA Cell Biol., 12: 791-797 (1993) ), 이는 아마도 DNA 형질감염 효율이 크고 수지상 세포에 의한 항원 제시가 더 효과적이기 때문일 것이다. 본 발명의 폴리누클레오티드는 당업계에 공지된 다른 방법, 예를 들어 형질감염(transfection), 일렉트로포레이션(electroporation), 마이크로주입(microinjection), 트랜스덕션(transduction), 세포 융합, DEAE 덱스트란, 인산 칼슘 침강, 리포펙션(lipofection)(리포좀 융합), 또는 DNA 벡터 운반기에 의해 도입될 수 있다(참조: Wu et al. , J. Biol. Chem. 267: 963-967 (1992); Wu and Wu, J. Biol. Chem. 263: 14621-14624(1988); Hartmut et al., 캐나다 특허 출원 제2,012,311호, 1990. 3. 15 출원). Various methods can be used to introduce polynucleotides into an animal, and the selection of a suitable method for introducing a specific polynucleotide into an animal is within the skill of the art. For example, to produce anti-BGH antibodies in mice, gold microprojectiles coated with DNA encoding bovine growth hormone (BGH) were introduced into the skin of mice. Jet injectors have been used to transfect skin, muscle, fat and breast tissue of living animals. Intravenous injection of DNA: cationic liposome complexes into mice was reported by Chu et al. (Zhu et al., Science 261: 209-211 (Jul. 9,1993)) and cloned transgenes were expressed systemically. . Ulmer et al., Science 259: 1745-1749, (1993) reported on the heterologous protection of influenza virus infection by intramuscular injection of DNA encoding influenza virus proteins. WO 93/17706 describes a method of vaccinating an animal against a virus, wherein the carrier particles are coated with a gene construct and the coated particles are accelerated into the cells of the animal. "High-speed inoculation of plasmids with a gene-gun increased the immune response in mice (Fynan, 1993B, supra; Eisenbraun et al., DNA Cell Biol., 12: 791-797 (1993)). This is probably because the efficiency of DNA transfection and antigen presentation by dendritic cells is more effective The polynucleotides of the present invention may be used in other methods known in the art, for example, transfection, electroporation. , Microinjection, transduction, cell fusion, DEAE dextran, calcium phosphate sedimentation, lipofection (liposome fusion), or DNA vector transporters (Wu et al.). , J. Biol. Chem. 267: 963-967 (1992); Wu and Wu, J. Biol. Chem. 263: 14621-14624 (1988); Hartmut et al., Canadian Patent Application No. 2,012,311, 1990. 3. 15 applications).
면역원성 조성물로 투여되는 gp120 폴리누클레오티드의 백신접종 용량은 특정 조성물의 성질, 예를 들어 특정 제형의 면역원성, 투여 경로, 면역접종법, 피검체의 상태 등에 의존하며, 특정 상황에 적합한 용량의 결정은 당업계의 기술 수준 내에 있다. 일반적으로, 투여당 약 1 내지 100mg의 gpl20 폴리펩티드 용량이 바람직하다. 상이한 투여량이 일련의 연속 접종에 사용될 수 있다. 따라서, 임상의는 1차 접종에 비교적 고용량을 투여한 다음, 비교적 소용량의 gp120 폴리누클레오티드를 추가 접종할 수 있다.Vaccination doses of gp120 polynucleotides administered as immunogenic compositions depend on the nature of the particular composition, eg, immunogenicity of the particular formulation, route of administration, immunization, the condition of the subject, and so on. Are within the skill of the art. Generally, a dosage of about 1 to 100 mg gpl20 polypeptide per administration is preferred. Different dosages may be used for a series of serial inoculations. Thus, the clinician may administer a relatively high dose at the first inoculation and then boost the relatively small dose of gp120 polynucleotide.
3. 면역접종법3. Immunization
gp120 특이적 면역 반응은 본 발명의 면역원성 조성물로 피검체를 1회 이상 접종함으로써 일어날 수 있다. 최초의 접종은 "1차 접종"으로 불리우고, 후속 접종은 "추가 접종"으로 불리운다. 추가 접종은 일반적으로 면역 반응을 증가시키며, 1회 이상의 추가 접종을 포함하는 면역접종법이 바람직하다. 상기에 설명된 모든 유형의 면역원성 조성물이 1차 또는 추가 면역접종에 사용될 수 있다. 따라서, 예를 들어 본 발명의 폴리누클레오티드를 함유하는 면역원성 조성물(예를 들어, 바이러스 유래 백신)를 1차 면역접종에 사용한 다음, 본 발명의 폴리펩티드를 함유하는 면역원성 조성물로 추가 접종할 수 있고, 역으로도 가능하다. 또한, 1차 면역접종 및 1회 이상의 추가 면역접종은 동일한 항원성 gp120 서열 및/또는 상이한 항원성 gp120 서열을 제공할 수 있다. The gp120 specific immune response may occur by inoculating one or more subjects with an immunogenic composition of the invention. The first inoculation is called "primary inoculation" and the subsequent inoculation is called "boost inoculation". A booster dose generally increases the immune response, and immunizations involving one or more booster doses are preferred. All types of immunogenic compositions described above can be used for primary or additional immunizations. Thus, for example, an immunogenic composition containing a polynucleotide of the invention (e.g., a virus derived vaccine) can be used for primary immunization followed by further inoculation with an immunogenic composition containing a polypeptide of the invention, The reverse is also possible. In addition, primary immunization and one or more additional immunizations may provide the same antigenic gp120 sequence and / or different antigenic gp120 sequences.
예시적 구체예에서, 적합한 면역접종법은 본 발명의 하나 이상의 면역원성 조성물에 의한 3회 이상의 별개의 접종을 포함하며, 2차 접종은 1차 접종 이후 약 2주 이상 후에, 바람직하게는 3주 내지 8주 후에, 더욱 바람직하게는 약 4주 후에 투여된다. 일반적으로, 3차 접종은 2차 접종 후 수개월 후에, 바람직하게는 1차 접종 후 약 5개월 이상 후에, 더 바람직하게는 1차 접종 후 약 6개월 내지 약 2년 후에, 더욱더 바람직하게는 1차 접종 후 약 8개월 내지 약 1년 후에 투여된다. 3회를 초과하는 주기적인 접종은 피검체의 "면역 기억"을 증가시키는 데에 바람직하다(참조: Anderson et al., J. Infectious Diseases 160 (6): 960-969 (Dec. 1989)). In an exemplary embodiment, suitable immunizations comprise three or more separate inoculations with one or more immunogenic compositions of the invention, wherein the second inoculation is at least about two weeks after the first inoculation, preferably from three weeks to After 8 weeks, more preferably after about 4 weeks. Generally, the third inoculation is months after the second inoculation, preferably at least about 5 months after the first inoculation, more preferably about 6 months to about 2 years after the first inoculation, even more preferably the first From about 8 months to about 1 year after inoculation. More than three periodic inoculations are desirable to increase the subject's "immune memory" (Anderson et al., J. Infectious Diseases 160 (6): 960-969 (Dec. 1989)).
선택된 면역접종 파라미터, 예를 들어 제형, 용량, 접종법 등의 적합도는 피검체로부터 혈청을 분취하여 면역접종 프로그램 과정 동안 항체 역가를 검정하여 결정될 수 있다. 대안적으로, T 세포 집단을 통상적인 방법으로 모니터링할 수 있다. 또한, 목적하는 효과, 예를 들어 HIV-1 감염 또는 AIDS로의 진행의 예방, 질병 상태의 개선(예를 들어, 바이러스 부하의 감소), 또는 감염되지 않은 상대에 대한 전염 빈도의 감소에 대해 피검체의 임상 상태를 모니터링한다. 이러한 모니터링이 백신접종이 최적이 아님을 나타내는 경우, 피검체는 추가 용량의 면역원성 조성물로 추가 접종될 수 있고, 백신접종 파라미터는 면역 반응을 증강시킬 것으로 예상되는 방식으로 변화될 수 있다. 따라서, 예를 들어, gp120 폴리펩티드 또는 폴리누클레오티드 및/또는 애쥬번트의 용량이 증가할 수 있거나, gp120 폴리펩티드가 면역원성 담체 단백질과 결합되거나 이와 복합체를 형성할 수 있거나, 투여 경로가 변화될 수 있다.The suitability of the selected immunization parameters, such as formulation, dose, inoculation, etc., can be determined by assaying antibody titers during the course of the immunization program by aliquoting serum from the subject. Alternatively, the T cell population can be monitored by conventional methods. In addition, subjects may be tested for the desired effect, such as prevention of HIV-1 infection or progression to AIDS, improvement of disease state (e.g., reduction of viral load), or reduction of transmission frequency to an uninfected partner. Monitor the clinical condition of the If such monitoring indicates that vaccination is not optimal, the subject may be boosted with additional doses of the immunogenic composition, and the vaccination parameters may be changed in a way that is expected to enhance the immune response. Thus, for example, the dose of a gp120 polypeptide or polynucleotide and / or adjuvant may be increased, the gp120 polypeptide may bind to or form a complex with an immunogenic carrier protein, or the route of administration may be altered.
진단 방법Diagnostic method
A. 비정상적 이황화물 구조를 갖는 gp120 폴리펩티드에 특이적인 항체의 검출A. Detection of Antibodies Specific to gp120 Polypeptides with Abnormal Disulfide Structures
본 발명은 피검체가 본 발명의 GP120에 특이적인 항체를 생성하였는지 여부를 결정하기 위한 진단 방법을 제공한다. 이 방법은 피검체로부터 얻은 생물학적 샘플을 관심있는 GP120 폴리펩티드와 접촉시키고, 샘플이 GP120 폴리펩티드와 특이적으로 결합하는 항체를 함유하고 있는지 여부를 결정하는 것을 필요로 한다. 사용된 샘플은 항체를 함유하는 임의의 조직을 포함할 수 있으나, 가장 편리한 것은 혈액이나 혈액 분획(예를 들어, 혈청 또는 혈장)이다.The present invention provides a diagnostic method for determining whether a subject produced an antibody specific for GP120 of the present invention. This method requires contacting a biological sample obtained from a subject with a GP120 polypeptide of interest and determining whether the sample contains an antibody that specifically binds to the GP120 polypeptide. The sample used may comprise any tissue containing the antibody, but the most convenient is the blood or blood fraction (eg serum or plasma).
항-GP120 항체는 공지된 수많은 면역검정을 사용하여 샘플 내에서 검출 및/또는 정량될 수 있다(참조: 미국 특허 제4,366,241호; 제4,376,110호; 제4,517,288호; 및 제4,837,168호). 면역검정에 대한 일반적 개관에 관하여는 문헌(Cell Biology Volume 37: Antibodies in Cell Biology, Asai, ed. Academic Press, Inc. New York (1993); Basic and Clinical Immunology 7th Edition, Stites & Terr, eds. (1991))을 참조할 수 있다. Anti-GP120 antibodies can be detected and / or quantified in a sample using a number of known immunoassays (see US Pat. Nos. 4,366,241; 4,376,110; 4,517,288; and 4,837,168). For a general overview of immunoassays, see Cell Biology Volume 37: Antibodies in Cell Biology, Asai, ed.Academic Press, Inc. New York (1993); Basic and Clinical Immunology 7th Edition, Stites & Terr, eds. 1991).
표준 고체상 형식으로, gp120 폴리펩티드에 특이적인 샘플 내의 항체를 고정하는 포획제로서 작용하는 고체상에 관심있는 gp120 폴리펩티드를 부착할 수 있다. 결합된 항체는 그 후 간단한 세척 단계에 의해 샘플 내의 유리된 항체와 분리될 수 있다. In standard solid phase format, one can attach the gp120 polypeptide of interest to a solid phase that acts as a capture agent to immobilize antibodies in a sample specific for the gp120 polypeptide. The bound antibody can then be separated from the free antibody in the sample by a simple washing step.
면역검정은 전형적으로 gp120 폴리펩티드 및 gp120 폴리펩티드에 특이적인 샘플 내의 항체에 의해 형성되는 결합 복합체에 특이적으로 결합하여 표지하는 표지용 시제를 사용한다. 직접적이거나 간접적인 임의의 적합한 표지 시스템이 사용될 수 있다. 예를 들어, 시험 대상인 피검체의 종에 특이적인 표지된 항체는 고체상에 결합한 항체를 표지하는 데 사용될 수 있다. 면역글로불린 불변 영역과 특이적으로 결합할 수 있는 다른 폴리펩티드, 예를 들어 폴리펩티드 A 또는 폴리펩티드 G 또한 표지용 시제로서 사용될 수 있다. 이러한 폴리펩티드는 연쇄구균의 세포벽의 통상적인 구성성분이다. 이들은 다양한 종으로부터의 면역글로불린 불변 영역과의 강한 비-면역원 반응성을 나타낸다(참조: Kronval, et al. (1973) J. Immunol., 111: 1401-1406, and Akerstrom (1985) J. Immunol., 135: 2589-2542). 적합한 표지는 본 발명의 gp120 폴리펩티드를 표지하는 것에 관하여 상기에서 논의된 것들을 포함한다.Immunoassays typically use a labeling reagent that specifically binds and labels a gp120 polypeptide and a binding complex formed by an antibody in a sample specific for the gp120 polypeptide. Any suitable labeling system, direct or indirect, can be used. For example, labeled antibodies specific for the species of the subject under test can be used to label antibodies bound to a solid phase. Other polypeptides that can specifically bind immunoglobulin constant regions, such as polypeptide A or polypeptide G, can also be used as labeling reagents. Such polypeptides are a common component of the cell wall of streptococci. They exhibit strong non-immunogen reactivity with immunoglobulin constant regions from various species (Kronval, et al. (1973) J. Immunol., 111: 1401-1406, and Akerstrom (1985) J. Immunol., 135: 2589-2542). Suitable labels include those discussed above with respect to labeling the gp120 polypeptide of the invention.
본 발명의 검정은 당업계에 잘 알려진 표준 방법에 따라 기록된다(포지티브 또는 네거티브, 또는 항-gp120 항체의 수량으로서). 특정 기록 방법은 검정 형식 및 표지의 선택에 의존한다. 예를 들어, 웨스턴 블롯 검정은 효소 표지에 의해 생성되는 발색된 생성물을 가시화하여 기록될 수 있다. 정확한 분자량에서 분명하게 보이는 발색된 밴드 또는 반점이 존재하면 포지티브 결과로서 기록되는 한편, 분명하게 보이는 반점이나 밴드가 부재하면 네거티브로 기록된다. 밴드 또는 반점의 강도는 항-gp120 항체 농도의 정량적 척도를 제공한다. Assays of the invention are recorded according to standard methods well known in the art (as positive or negative, or as quantities of anti-gp120 antibodies). The specific recording method depends on the assay format and the choice of label. For example, Western blot assays can be recorded by visualizing the colored product produced by enzyme labeling. The presence of colored bands or spots clearly visible at the correct molecular weight is recorded as a positive result, while the absence of clearly visible spots or bands is recorded negative. The intensity of the band or spot provides a quantitative measure of the anti-gp120 antibody concentration.
바람직한 구체예에서, 본 발명에 따른 면역검정은 마이크로일렉트로미케니칼(MicroElectroMechanical System: MEMS)를 이용하여 수행된다. MEMS는 기계, 광학, 및 유체 요소를 전자공학과 조합하여 관심있는 분석물의 편리한 검출을 가능하게 하는, 실리콘상에 집적된 현미경적 구조물이다. 본 발명에 사용하기에 적합한 예시적인 MEMS 장치는 프로티베리스 멀티캔티레버 어레이(Protiveris' multicantilever array)이다. 이 어레이는 특별하게 고안된 실리콘 마이크로캔티레버의 화학-기계적 작동 및 후속하는 마이크로캔티레버 편향의 광학적 검출에 근거한다. 한쪽 면이 단백질, 항체, 항원 또는 DNA 단편으로 코팅된 마이크로캔티레버를 상보적 분자를 함유하는 용액에 노출시키는 경우, 마이크로캔티레버가 구부러진다. 이러한 구부러짐은 결합 사건에 기인하는 표면 에너지의 변화로 인한 것이다. 구부러짐(편향) 정도의 광학적 검출은 마이크로캔티레버에 결합된 상보적인 분자의 양을 측정할 수 있도록 한다. In a preferred embodiment, the immunoassay according to the invention is carried out using a MicroElectroMechanical System (MEMS). MEMS are microscopic structures integrated on silicon that combine mechanical, optical, and fluid elements with electronics to enable convenient detection of analytes of interest. An exemplary MEMS device suitable for use with the present invention is a Protiveris' multicantilever array. This array is based on the chemical-mechanical actuation of specially designed silicon microcantilevers and the optical detection of subsequent microcantilever deflections. When a microcantilever coated on one side with a protein, antibody, antigen or DNA fragment is exposed to a solution containing complementary molecules, the microcantilever bends. This bending is due to the change in surface energy due to the binding event. Optical detection of the degree of bending (deflection) allows measuring the amount of complementary molecules bound to the microcantilever.
B. 비정상적 이황화물 구조를 갖는 gp120 서열의 검출B. Detection of gp120 Sequences with Abnormal Disulfide Structures
본 발명은 또한, 피검체로부터 얻은 생물학적 샘플이 비정상적 이황화물 구조를 특징으로 하는 gp120 아미노산 서열을 포함하는 폴리펩티드 및/또는 비정상적 이황화물 구조를 특징으로 하는 gp120 아미노산 서열을 코딩하는 폴리누클레오티드를 함유하는지 여부를 결정하기 위한 진단 방법을 제공한다. 비정상적 이황화물 구조는 북미에 유포되는 HIV-1의 새로운 주요 변종 또는 감염과 관련된 "전염 표현형(transmission phenotype)"을 나타낼 수 있다. 이 방법은 gp120 서열을 포함하는 폴리펩티드 또는 gp120 서열을 코딩하는 폴리누클레오티드에 대해 샘플을 검정하는 것을 필요로 하며, 여기서 gp120 서열은 HIV gp120의 HXB-2 균주로부터의 gp120의 N-말단 메티오닌부터 넘버링하여 (a) 54, 74, 119, 126, 131, 157, 196, 205, 218, 228, 239, 247, 296, 331, 378, 385, 418, 및 445번 중 하나 이상의 위치에서 하나 이상의 시스테인 잔기가 결여되어 있고/거나, (b) 24, 29, 34, 54, 74, 119, 126, 131, 157, 196, 205, 218, 228, 239, 247, 296, 331, 378, 385, 418, 445, 493, 495, 499-501, 503-508, 및 510번 이외의 위치에서 하나 이상의 추가의 시스테인 잔기를 포함한다. 바람직한 구체예에서, gp120 서열은 V1 도메인에 하나 이상의 추가의 시스테인을 가지는 서브타입 G gp120 서열 또는 V4 도메인에 하나 이상의 추가의 시스테인을 가지는 서브타입 E gp120 서열이 아니다. 본 발명의 진단 방법은 비정상적 이황화물 구조를 갖는 gp120 서열을 포함하는 폴리펩티드 및/또는 비정상적 이황화물 구조를 갖는 gp120 서열을 코딩하는 폴리누클레오티드의 존재를 검출할 수 있고, 임의로 정량할 수 있는 검정을 이용하여 수행될 수 있다. 사용된 샘플은 gp120 아미노산 서열을 포함하는 폴리펩티드 또는 gp120 아미노산 서열을 코딩하는 폴리누클레오티드를 함유하는 것으로 예상되는 조직을 포함할 수 있다. 편리하게는, 혈액 또는 혈액 분획(예를 들어, 혈청 또는 혈장)이 검정을 위해 샘플링된다.The invention also provides whether a biological sample obtained from a subject contains a polypeptide comprising a gp120 amino acid sequence characterized by an abnormal disulfide structure and / or a polynucleotide encoding a gp120 amino acid sequence characterized by an abnormal disulfide structure. It provides a diagnostic method for determining. Abnormal disulfide structure may represent a "transmission phenotype" associated with a new major strain or HIV infection spreading in North America. This method requires assaying a sample for a polypeptide comprising the gp120 sequence or a polynucleotide encoding the gp120 sequence, wherein the gp120 sequence is numbered from the N-terminal methionine of gp120 from the HXB-2 strain of HIV gp120 (a) at least one cysteine residue at one or more of
1. gpl20 폴리펩티드계 검정1. gpl20 polypeptide-based assay
면역검정은 일반적으로 비정상적 이황화물 구조를 특징으로 하는 gp120 폴리펩티드의 검출에 가장 편리하다. gp120 폴리펩티드를 검출하는 면역검정을 수행하기 위한 고려할 사항, 예를 들어 형식, 표지 시스템은 항-gp120 항체의 검출에 관하여 상기에서 실질적으로 설명되어 있다. Immunoassays are generally most convenient for the detection of gp120 polypeptides characterized by abnormal disulfide structures. Considerations for performing an immunoassay for detecting a gp120 polypeptide, e.g., format, labeling system, are described substantially above with respect to the detection of anti-gp120 antibodies.
gp120 폴리펩티드를 검출하는 바람직한 면역검정은 경쟁적 또는 비경쟁적 검정이다. 비경쟁적 면역검정은 특이적 항체에 결합된 gp120 폴리펩티드의 양이 직접 측정되는 검정이다. 경쟁적 검정에서는, 샘플 내의 gp120 폴리펩티드의 양은 특이적 항체로부터 치환되는(또는 특이적 항체와 경쟁된) 추가된(외인성) 폴리펩티드의 양을 측정하여 간접적으로 측정된다.Preferred immunoassays for detecting gp120 polypeptide are competitive or noncompetitive assays. Noncompetitive immunoassays are assays in which the amount of gp120 polypeptide bound to a specific antibody is directly measured. In a competitive assay, the amount of gp120 polypeptide in a sample is determined indirectly by measuring the amount of added (exogenous) polypeptide that is substituted from (or competed with) the specific antibody.
이러한 면역검정에 유용한 항체는, 예를 들어 상기에서 설명한 바와 같이 생성될 수 있는 폴리클로날 및 모노클로날 항체를 포함한다.Antibodies useful for such immunoassays include polyclonal and monoclonal antibodies, which may be generated, for example, as described above.
2. gp120 폴리누클레오티드계 검정2. gp120 polynucleotide based assay
비정상적 이황화물 구조를 특징으로 하는 gp120 서열을 코딩하는 gp120 폴리누클레오티드는 샘플 폴리누클레오티드에 대한 적합한 핵산 분자의 특이적 하이브리드화에 기초하여 검출되는 것이 일반적이다. 핵산 분자는 샘플 누클레오티드에서 검출하고자 하는 gp120 폴리누클레오티드에는 존재하고 다른 누클레오티드에는 존재하지 않는 표적 누클레오티드 서열과 특이적으로 하이브리드화한다. 바람직한 구체예에서, 핵산 분자는 표적 누클레오티드 서열에 실질적으로 상보적이다. A gp120 polynucleotide encoding a gp120 sequence characterized by an abnormal disulfide structure is generally detected based on specific hybridization of a suitable nucleic acid molecule to a sample polynucleotide. The nucleic acid molecule specifically hybridizes with a target nucleotide sequence that is present in the gp120 polynucleotide to be detected in the sample nucleotide and not in the other nucleotide. In a preferred embodiment, the nucleic acid molecule is substantially complementary to the target nucleotide sequence.
폴리누클레오티드는 당업계에 잘 알려진 많은 방법들 중 하나에 따라 샘플로부터 제조될 수 있다. 폴리누클레오티드의 단리 및 정제에 대한 일반법은 문헌(Tijssen ed. , (1993) Chapter 3 of Laboratory Techniques in Biochemistry and Molecular Biology : Hybridization With Nucleic Acid Probes, Part L Theory and Nucleic Acid Preparation, Elsevier, N. Y. and Tijssen ed.)에 상세히 설명되어 있다. 바람직한 구체예에서, gp120 폴리누클레오티드는 HIV-1 바이러스 RNA를 함유하는 샘플로부터 역전사에 의해 수득된 후 증폭될 수 있다.Polynucleotides can be prepared from samples according to one of many methods well known in the art. General methods for isolation and purification of polynucleotides are described in Tijssen ed., (1993)
i. 증폭계 검정i. Amplification test
한 구체예에서, 증폭계 검정은 관심있는 gp120 폴리펩티드를 검출하고, 임의로 정량하기 위해 사용될 수 있다. 이러한 증폭계 검정에서, 샘플 내의 gp120 폴리누클레오티드는 검출가능한 표지 또는 표지 시스템의 성분을 함유하는 핵산 프라이머를 사용하여 수행되는 증폭 반응에서 주형으로서 작용한다. 적합한 증폭 방법은, 중합효소 연쇄 반응(PCR), 역전사 PCR(RT-PCR); 리가아제 연쇄 반응(LCR)(참조: Wu and Wallace (1989) Genomics 4: 560, Landegren et al. (1988) Science 241: 1077, and Barringer et al. (1990) Gene 89: 117); 전사 증폭(Kwoh et al. (1989) Proc. Natl. Acad. Sci. USA 86: 1173), 자체 지속되는 서열 복제(self-sustained sequence replication) (Guatelli et al. (1990) Proc.Nat. Acad. Sci. USA 87: 1874); 도트(dot) PCR, 및 링커 어댑터(linker adapter) PCR 등을 포함하지만, 이에 한정되지 않는다.In one embodiment, an amplification assay can be used to detect and optionally quantify a gp120 polypeptide of interest. In this amplification assay, the gp120 polynucleotide in the sample acts as a template in an amplification reaction performed using nucleic acid primers containing a detectable label or component of a labeling system. Suitable amplification methods include polymerase chain reaction (PCR), reverse transcription PCR (RT-PCR); Ligase chain reaction (LCR) (Wu and Wallace (1989) Genomics 4: 560, Landegren et al. (1988) Science 241: 1077, and Barringer et al. (1990) Gene 89: 117); Transcription amplification (Kwoh et al. (1989) Proc. Natl. Acad. Sci. USA 86: 1173), self-sustained sequence replication (Guatelli et al. (1990) Proc. Nat. Acad. Sci. USA 87: 1874; Dot PCR, linker adapter PCR, and the like.
gp120 폴리누클레오티드의 수준을 결정하는 것이 바람직한 경우, 잘 알려진 수많은 "정량적" 증폭 방법이 사용될 수 있다. 정량적 PCR은 일반적으로 동일한 프라이머를 사용하여 기지량의 대조 서열을 동시에 공동증폭시키는 것을 포함한다. 이것은 PCR 반응을 캘리브레이션하는 데에 사용될 수 있는 내부 표준을 제공한다. 정량적 PCR에 대한 상세한 프로토콜은 문헌(PCR Protocols, A Guide to Methods and Applications, Innis et al., Academic Press, Inc. N. Y., (1990))에 제공되어 있다.If it is desired to determine the level of gp120 polynucleotides, a number of well known "quantitative" amplification methods can be used. Quantitative PCR generally involves co-amplifying a known amount of control sequence simultaneously using the same primers. This provides an internal standard that can be used to calibrate the PCR reaction. Detailed protocols for quantitative PCR are provided in PCR Protocols, A Guide to Methods and Applications, Innis et al., Academic Press, Inc. N. Y., (1990).
ii. 하이브리드화계 검정ii. Hybridization System Test
핵산 하이브리드화는 간단하게 프로브와 이의 상보적인 표적 누클레오티드 서열이 상보적인 염기 짝지음을 통해 안정한 하이브리드 듀플렉스를 형성할 수 있는 조건하에서 핵산 프로브를 샘플 폴리누클레오티드와 접촉시키는 것을 포함한다. 하이브리드 듀플렉스를 형성하지 않는 핵산은 그 후 세척 제거되고 하이브리드화된 핵산이 남아서 부착된 검출가능한 표지 또는 표지 시스템의 성분의 검출을 통해 검출된다. 핵산 하이브리드화 기술을 사용하여 폴리누클레오티드를 검출 및/또는 정량하는 방법은 당업자에게 공지되어 있다(참조: Sambrook et al. supra). 하이브리드화 기술은 문헌(Hames and Higgins (1985) Nucleic Acid Hybridization, A Practical Approach, IRL Press; Gall and Pardue (1969) Proc. Natl. Acad. Sci. USA 63: 378-383 ; John et al. (1969) Nature 223: 582-587)에 일반적으로 기술되어 있다. 하이브리드화 조건을 최적화하는 방법은 문헌(Tijssen (1993) Laboratory Techniques in Biochemistry and Molecular Biology, Vol. 24 : Hybridization With Nucleic Acid Probes, Elsevier, N. Y.)에 기술되어 있다. Nucleic acid hybridization simply involves contacting a nucleic acid probe with a sample polynucleotide under conditions such that the probe and its complementary target nucleotide sequence can form a stable hybrid duplex through complementary base pairing. Nucleic acid that does not form a hybrid duplex is then detected through detection of a component of a detectable label or labeling system that is washed off and remains attached to the hybridized nucleic acid. Methods of detecting and / or quantifying polynucleotides using nucleic acid hybridization techniques are known to those skilled in the art (Sambrook et al. Supra). Hybridization techniques are described in Hames and Higgins (1985) Nucleic Acid Hybridization, A Practical Approach, IRL Press; Gall and Pardue (1969) Proc. Natl. Acad. Sci. USA 63: 378-383; John et al. (1969). Nature 223: 582-587). Methods for optimizing hybridization conditions are described in Tijssen (1993) Laboratory Techniques in Biochemistry and Molecular Biology, Vol. 24: Hybridization With Nucleic Acid Probes, Elsevier, N. Y.
본원에서 gp120 폴리누클레오티드의 검출에 사용되는 핵산 프로브는 이들 폴리누클레오티드의 전체 길이 또는 전체 길이 미만일 수 있다. 더 짧은 프로브는 특이성을 위해 일반적으로 경험적으로 시험된다. 바람직하게는, 핵산 프로브는 길이가 약 15개 이상, 보다 바람직하게는 약 20개 이상인 염기이다. 핵산 하이브리드화에 사용하기 위한 핵산 프로브 서열을 선택하는 방법에 대해서 문헌(Sambrook et al.)을 참조할 수 있다. 하이브리드화된 프로브의 가시화는 관심있는 gp120 폴리누클레오티드의 존재 또는 부재를 정량적으로 결정할 수 있게 하며, 표준 방법(예를 들어, 핵산 프로브가 방사활성 표지되어 있는 밀도법)이 gp120 폴리누클레오티드의 수준을 정량하는 데에 사용될 수 있다.The nucleic acid probes used herein for the detection of gp120 polynucleotides may be less than or less than the full length of these polynucleotides. Shorter probes are generally tested empirically for specificity. Preferably, the nucleic acid probe is at least about 15 bases, more preferably at least about 20 bases in length. See Sambrook et al. For how to select nucleic acid probe sequences for use in nucleic acid hybridization. Visualization of the hybridized probes enables quantitative determination of the presence or absence of the gp120 polynucleotide of interest, and standard methods (eg, density methods in which nucleic acid probes are radioactively labeled) quantify the level of gp120 polynucleotides. Can be used to
다양한 추가의 핵산 하이브리드화 형식은 당업자에게 공지되어 있다. 표준 형식은 샌드위치 검정 및 경쟁 또는 치환 검정을 포함한다. 샌드위치 검정은 폴리누클레오티드를 검출하거나 단리하는 상업적으로 유용한 하이브리드화 검정이다. 이러한 검정은 고체 지지체에 공유적으로 고정된 "포획" 핵산 및 용액 내의 표지된 "신호" 핵산을 이용한다. 샘플은 표적 폴리누클레오티드를 제공한다. 포획 핵산 및 신호 핵산은 각각 표적 누클레오티드와 하이브리드화하여 "샌드위치" 하이브리드화 복합체를 형성한다. Various additional nucleic acid hybridization formats are known to those skilled in the art. Standard formats include sandwich assays and competition or substitution assays. Sandwich assays are commercially useful hybridization assays that detect or isolate polynucleotides. This assay utilizes "capture" nucleic acids covalently immobilized on a solid support and labeled "signal" nucleic acids in solution. The sample provides a target polynucleotide. Capture nucleic acids and signal nucleic acids each hybridize with a target nucleotide to form a "sandwich" hybridization complex.
한 구체예에서, 본 발명의 방법은 어레이계 하이브리드화 형식으로 이용될 수 있다. 어레이 형식에서, 다수의 상이한 하이브리드화 반응이 실질적으로 "병행적으로" 일어날 수 있다. 이것은 하나의 실험에서 많은 하이브리드화의 신속하고 실질적인 동시 평가를 제공한다. 어레이계 형식에서 하이브리드화 반응을 수행하는 방법은 당업자에게 잘 알려져 있다(참조: Pastinen (1997) Genome Res. 7: 606-614; Jackson (1996) Nature Biotechnology 14: 1685; Chee (1995) Science 274: 610; WO 96/17958, Pinkel et al. (1998) Nature Genetics 20 : 207-211). In one embodiment, the method of the present invention can be used in an array based hybridization format. In an array format, many different hybridization reactions can occur substantially "in parallel". This provides a quick and practical simultaneous evaluation of many hybridizations in one experiment. Methods of performing hybridization reactions in array based formats are well known to those skilled in the art. See Pastinen (1997) Genome Res. 7: 606-614; Jackson (1996) Nature Biotechnology 14: 1685; Chee (1995) Science 274: 610, WO 96/17958, Pinkel et al. (1998) Nature Genetics 20: 207-211).
어레이, 특히 핵산 어레이는 당업자에게 잘 알려진 다양한 방법에 따라 생성될 수 있다. 예를 들어, 간단한 구체예에서, "저밀도" 어레이는 고체 지지체(예를 들어, 유리 표면, 막 등) 상의 상이한 위치에 상이한 핵산의 반점을 찍어서(예를 들어, 피펫을 사용하여 손으로) 간단히 생성시킬 수 있다. 이러한 간단히 반점을 찍는 방법이 자동화되어 고밀도 반점의 마이크로어레이가 생산되었다. 예를 들어, 미국 특허 제5,807,522호는 작은 부피의 생물학적 샘플을 침착시키기 위해 표면에 대하여 마이크로모세관을 두드리는 자동 시스템의 사용을 기술하고 있다. 이 공정이 반복되어 고밀도 어레이를 생성시킨다. 어레이는 또한 올리고누클레오티드 합성 기술을 사용하여 생성될 수 있다. 따라서, 예를 들어, 미국 특허 제5,807,522호 및 PCT 특허 공보 제90/15070호 및 제92/10092호에서는 고밀도 올리고누클레오티드 어레이의 광-지시 조합 합성의 사용을 교시하고 있다. 고밀도 어레이의 합성은 미국 특허 제5,744,305호; 제5,800,992호; 및 제5,445,934호에 기술되어 있다. Arrays, particularly nucleic acid arrays, can be produced according to a variety of methods well known to those skilled in the art. For example, in a simple embodiment, a “low density” array is simply made by spotting different nucleic acids at different locations on a solid support (eg, glass surface, membrane, etc.) (eg by hand using a pipette). Can be generated. This simple spotting method was automated to produce a high density spotted microarray. For example, US Pat. No. 5,807,522 describes the use of an automated system that taps microcapillaries against a surface to deposit a small volume of biological sample. This process is repeated to create a high density array. Arrays can also be generated using oligonucleotide synthesis techniques. Thus, for example, US Pat. Nos. 5,807,522 and PCT Patent Publications 90/15070 and 92/10092 teach the use of light-directed combinatorial synthesis of high density oligonucleotide arrays. Synthesis of high density arrays is described in US Pat. No. 5,744,305; 5,800,992; 5,800,992; And 5,445,934.
바람직한 구체예에서, 본 발명에 사용되는 어레이는 "프로브" 핵산을 함유한다. 그 다음에, 이러한 프로브는 생물학적 샘플로부터 얻은 폴리누클레오티드에 존재하는 표적 누클레오티드 서열과 각각 하이브리드화한다. 대안적으로, 상이한 샘플로부터의 폴리누클레오티드가 어레이되고, 그 다음에 이 어레이가 구별되게 표지될 수 있는 하나 이상의 프로브로 프로빙되도록 형식을 바꿀 수 있다.In a preferred embodiment, the array used in the present invention contains a "probe" nucleic acid. These probes then each hybridize with the target nucleotide sequence present in the polynucleotide obtained from the biological sample. Alternatively, polynucleotides from different samples can be arrayed and then reformatted such that the array is probed with one or more probes that can be distinguishably labeled.
다양한 고체 표면상에 핵산을 고정시키는 많은 방법들이 당업계에 공지되어 있다. 광범위한 유기 및 무기 중합체 및 다른 물질들은 천연 및 합성 둘 모두, 고체 표면을 위한 물질로서 사용될 수 있다. 예시적인 고체 표면은, 예를 들어 니트로셀룰로오스, 나일론, 유리, 석영, 디아조화 막(종이 또는 나일론), 실리콘, 폴리포름알데히드, 셀룰로오스, 및 셀룰로오스 아세테이트를 포함한다. 또한, 폴리에틸렌, 폴리프로필렌, 폴리스티렌 등과 같은 플라스틱이 사용될 수 있다. 사용될 수 있는 다른 물질은 종이, 세라믹, 금속, 메탈로이드, 반도체 물질 등을 포함한다. 또한, 겔을 형성하는 물질이 사용될 수 있다. 이러한 물질은, 예를 들어 단백질(예를 들어, 젤라틴), 리포폴리사카라이드, 실리케이트, 아가로오스 및 폴리아크릴아미드를 포함한다. 고체 표면이 다공성인 경우, 다양한 공극 크기가 시스템의 성질에 따라 사용될 수 있다.Many methods for immobilizing nucleic acids on various solid surfaces are known in the art. A wide range of organic and inorganic polymers and other materials can be used as materials for solid surfaces, both natural and synthetic. Exemplary solid surfaces include, for example, nitrocellulose, nylon, glass, quartz, diazotized membranes (paper or nylon), silicon, polyformaldehyde, cellulose, and cellulose acetate. In addition, plastics such as polyethylene, polypropylene, polystyrene and the like can be used. Other materials that can be used include paper, ceramics, metals, metalloids, semiconductor materials, and the like. In addition, materials that form gels can be used. Such materials include, for example, proteins (eg gelatin), lipopolysaccharides, silicates, agarose and polyacrylamides. If the solid surface is porous, various pore sizes may be used depending on the nature of the system.
표면 제조에 있어서, 다앙한 성질을 얻기 위해 다수의 상이한 물질이 특히 라미네이트로서 사용될 수 있다. 예를 들어, 단백질(예를 들어, 소혈청알부민) 또는 거대분자의 혼합물(예를 들어, 덴하르트 용액)은 비특이적 결합을 피하고, 공유적 컨쥬게이션을 단순화하고/거나 신호 검출을 증가시키기 위해 사용될 수 있다. 화합물과 표면 사이의 공유 결합이 바람직한 경우에는, 표면은 다작용성이거나 다작용화될 수 있을 것이다. 표면상에 존재하고 결합에 사용될 수 있는 작용기는 카르복실산, 알데히드, 아미노기, 시아노기, 에틸렌기, 히드록실기, 메르캅토기 등을 포함할 수 있다. 다양한 화합물을 다양한 표면에 결합시키는 방식은 잘 알려져 있으며, 문헌에 충분히 예시되어 있다.In surface preparation, many different materials can be used, in particular, as laminates in order to obtain a variety of properties. For example, a protein (eg bovine serum albumin) or a mixture of macromolecules (eg denhardt solution) may be used to avoid nonspecific binding, simplify covalent conjugation and / or increase signal detection. Can be. If a covalent bond between the compound and the surface is desired, the surface may be multifunctional or polyfunctionalized. Functional groups present on the surface and which may be used for bonding may include carboxylic acids, aldehydes, amino groups, cyano groups, ethylene groups, hydroxyl groups, mercapto groups and the like. The manner in which the various compounds are bound to the various surfaces is well known and fully exemplified in the literature.
어레이는 약 1mm 직경에서 작게는 약 1㎛까지 다양한 크기의 표적 요소로 구성될 수 있다. 1㎠ 면적을 정량적으로 형광 이미지화할 수 있는 비교적 간단한 방법은 하나의 이미지에서 다수의 표적 요소들로부터 데이터의 획득을 허용한다고 기술되어 왔다(참조: Wittrup (1994) Cytometry 16: 206-213, Pinkel et al. (1998) Nature Genetics 20: 207-211). The array may consist of target elements of various sizes from about 1 mm in diameter to as small as about 1 μm. A relatively simple method of quantitatively fluorescence imaging of 1 cm 2 area has been described that allows acquisition of data from multiple target elements in one image (see Wittrup (1994) Cytometry 16: 206-213, Pinkel et. al. (1998) Nature Genetics 20: 207-211.
본 발명에 따른 하이브리드화 검정은 프로티베리스 멀티캔티레버 어레이(Protiveris' multicantilever array)와 같은 마이크로일렉트로미케니칼(MicroElectroMechanical System: MEMS)를 이용하여 수행될 수 있다.Hybridization assays according to the present invention can be performed using a MicroElectroMechanical System (MEMS), such as a Protiveris' multicantilever array.
iii. gp120 폴리누클레오티드의 검출iii. Detection of gp120 Polynucleotides
gp120 폴리누클레오티드는 상기 폴리누클레오티드계 검정으로 검출가능한 표지에 의해 검출된다. 상기에서 논의된 표지는 본 발명의 폴리누클레오티드계 검정에서 사용될 수 있다. 표지는 하이브리드화 또는 증폭 이전, 또는 이후에 프로브 또는 프라이머 또는 샘플 폴리누클레오티드에 첨가될 수 있다. 소위 "직접 표지"는 검정을 수행하기 전에 표지된 폴리누클레오티드에 직접 부착되거나 혼입된 검출가능한 표지이다. 대조적으로, 소위 "간접 표지"는 하이브리드화 이후 하이브리드 듀플렉스에 결합된다. 간접 표지의 경우, 하이브리드 듀플렉스 내의 폴리누클레오티드 중 하나가 검출가능한 표지가 결합하는 성분을 보유한다. 따라서, 예를 들어, 프로브 또는 프라이머는 하이브리드화 이전에 비오티닐화될 수 있다. 하이브리드화 이후에, 아비딘 컨쥬게이션된 형광발색단은 비오틴 보유 하이브리드 듀플렉스와 결합하여 용이하게 검출되는 표지를 제공한다. 폴리누클레오티드의 표지 및 검출 방법에 대한 상세한 개관에 관해서는, 문헌(Laboratory Techniques in Biochemistry and Molecular Biology, Vol. 24 : Hybridization With Nucleic Acid Probes, P. Tijssen, ed. Elsevier, N. Y. , (1993))을 참조할 수 있다. gp120 polynucleotides are detected by labels detectable with the polynucleotide-based assay. The labels discussed above can be used in the polynucleotide based assays of the present invention. The label can be added to the probe or primer or sample polynucleotide before or after hybridization or amplification. So-called "direct labels" are detectable labels that are directly attached or incorporated into the labeled polynucleotides prior to performing the assay. In contrast, the so-called "indirect label" binds to the hybrid duplex after hybridization. For indirect labels, one of the polynucleotides in the hybrid duplex carries the component to which the detectable label binds. Thus, for example, the probe or primer may be biotinylated prior to hybridization. After hybridization, the avidin conjugated fluorophore is combined with the biotin bearing hybrid duplex to provide a label that is easily detected. For a detailed overview of methods for labeling and detecting polynucleotides, see Laboratory Techniques in Biochemistry and Molecular Biology, Vol. 24: Hybridization With Nucleic Acid Probes, P. Tijssen, ed.Elsevier, NY, (1993). Reference may be made.
하이브리드화 검정의 감도는 검출되는 표적 폴리누클레오티드를 증가시키는 폴리누클레오티드 증폭 시스템을 사용하여 증가될 수 있다. 이러한 시스템의 예는 중합효소 연쇄 반응(PCR) 시스템 및 리가아제 연쇄 반응(LCR) 시스템을 포함한다. 종래 기술로 최근에 기술된 다른 방법은 핵산 서열 기초 증폭(nucleic acid sequence based amplification)(NASBAO, Cangene, Mississauga, Ontario) 및 큐 베타 레플리카제 시스템(Q Beta Replicase systems)이다. The sensitivity of the hybridization assay can be increased using a polynucleotide amplification system that increases the target polynucleotide detected. Examples of such systems include polymerase chain reaction (PCR) systems and ligase chain reaction (LCR) systems. Other methods recently described in the prior art are nucleic acid sequence based amplification (NASBAO, Cangene, Mississauga, Ontario) and Q Beta Replicase systems.
본 발명의 증폭계 검정에 사용하기에 적합한 바람직한 구체예에서, 프라이머는 2개의 형광 염료 "리포터 염료(reporter dye)" 및 "켄쳐 염료(quencher dye)를 함유한다. 손상되지 않은 경우, 프라이머는 켄쳐 염료 효과로 인해 매우 낮은 형광 수준을 나타낸다. 프라이머가 분열되거나 분해되는 경우(예를 들어, 중합효소의 엑소누클레아제 활성에 의해, 이하 참조), 리포터 염료가 형광을 나타내고 적합한 형광 검출 시스템에 의해 검출된다. 많은 기술(PCR, RT-PCR, RCA, 또는 다른 증폭 방법)에 의한 증폭은 중합효소 및 엑소누클레아제 활성을 갖는 적합한 DNA 중합효소(예를 들어, Taq DNA 중합효소)를 사용하여 수행된다. 이러한 중합효소는 새로운 DNA 가닥을 합성하고, 과정 중에서, 표지된 프라이머를 분해하는 결과 형광성을 증가시킨다. 이러한 타입의 상업적으로 시판되는 형광 검출 시스템은 어플라이드 바이오시스템즈(Applied Biosystems)의 ABI 프리즘 시스템(ABI Prism® Systems) 7000, 7300, 7500, 7700, 또는 7900(TaqMand3) 또는 로슈(Roche)의 라이트사이클러 시스템(LightCycler® System)을 포함한다.In a preferred embodiment suitable for use in the amplification assay of the present invention, the primer contains two fluorescent dyes "reporter dye" and "quencher dye. If undamaged, the primer is quencher. Very low fluorescence levels due to the dye effect: if the primer cleaves or degrades (e.g., by the exonuclease activity of the polymerase, see below), the reporter dye will fluoresce and by a suitable fluorescence detection system Amplification by many techniques (PCR, RT-PCR, RCA, or other amplification methods) can be accomplished using suitable DNA polymerases (eg, Taq DNA polymerase) with polymerase and exonuclease activity. This polymerase increases the fluorescence as a result of synthesizing new DNA strands and, in the process, digesting the labeled primers. Commercially available fluorescence detection systems are ABI Prism® Systems 7000, 7300, 7500, 7700, or 7900 (TaqMand3) or Roche's LightCycler® System from Applied Biosystems. ).
하기 실시예는 예시를 위하여 제공되며 본 발명을 제한하고자 하는 것은 아니다.The following examples are provided for purposes of illustration and are not intended to limit the invention.
실시예Example
실시예 1Example 1
gp120 폴리펩티드의 발현Expression of the gp120 Polypeptide
HIV-1 gp120 서열은 바람직하게는, 이종성 신호 서열 및 에피토프 태그를 함유하는 융합 단백질로서 발현된다. 예시적인 바람직한 신호 서열은 단순포진 바이러스-1(HSV1) gD 당단백질 및 인간 조직 플라스미노겐 활성화인자(tPA)로부터 얻은 것을 포함한다. 성숙한 HSV1 gD의 처음 29개 아미노산은 에피토프 태그로서 작용하며, 이것은 gp120의 HXB2 균주의 N-말단 메티오닌(성숙 gp120의 12번 잔기)으로부터 넘버링할 때 HIV gp120의 42번 잔기와 결합한다. The HIV-1 gp120 sequence is preferably expressed as a fusion protein containing a heterologous signal sequence and an epitope tag. Exemplary preferred signal sequences include those obtained from herpes simplex virus-1 (HSV1) gD glycoprotein and human tissue plasminogen activator (tPA). The first 29 amino acids of mature HSV1 gD act as epitope tags, which bind to residue 42 of HIV gp120 when numbered from the N-terminal methionine (
pCI.gD.gpl20은 포유동물 전사 유닛 및 pUC 벡터 골격을 함유한다(프로메가 코포레이션(Promega Corporation, Madison WI)으로부터 구입가능한 pCI 포유동물 발현 벡터). 전사 유닛은 사이토메갈로바이러스(CMV) 즉시형 조기(immediate early) 프로모터 및 인조 인트론에 이어서 신호 서열 및 KpnI 부위에서 HIV-1 gp120의 42번 잔기에 융합된 성숙한 HSV1 gD 당단백질의 처음 29개 아미노산에 대한 코딩 영역을 함유한다. gpl20 서열은 상기에서 언급한 바와 같이 정지 코돈 및 XhoI 제한 부위에서 종결된다. 벡터 내에서 그 다음에 SV40 폴리-A 서열 및 전자 종결자가 이어진다.pCI.gD.gpl20 contains the mammalian transcriptional unit and the pUC vector backbone (pCI mammalian expression vector available from Promega Corporation, Madison WI). The transcription unit comprises the cytomegalovirus (CMV) immediate early promoter and artificial intron followed by the first 29 amino acids of the mature HSV1 gD glycoprotein fused to residue 42 of HIV-1 gp120 at the signal sequence and KpnI site. Contains the coding region. The gpl20 sequence ends at the stop codon and the XhoI restriction site as mentioned above. Within the vector is followed by the SV40 poly-A sequence and the electron terminator.
pCI.tPA.gp120은 또한 포유동물 전사 유닛 및 pUC 벡터 골격을 함유한다(프로메가 코포레이션(Promega Corporation, Madison WI)으로부터 구입가능한 pCI 포유동물 발현 벡터). 전사 유닛은 사이토메갈로바이러스(CMV) 조기(immediate early) 프로모터 및 인조 인트론에 이어서 tPA 프리-프로 서열의 처음 36개 아미노산 및 KpnI 부위에서 HIV-1 gp120의 42번 잔기에 융합된 성숙한 HSV1 gD 당단백질의 처음 29개 아미노산에 대한 코딩 영역을 함유한다. gpl20 서열은 상기에서 언급한 바와 같이 정지 코돈 및 XhoI 제한 부위에서 종결된다. 벡터 내에서 그 다음에 SV40 폴리-A 서열 및 전자 종결자가 이어진다.pCI.tPA.gp120 also contains a mammalian transcriptional unit and a pUC vector backbone (pCI mammalian expression vector available from Promega Corporation, Madison WI). The transcription unit comprises a cytomegalovirus (CMV) early early promoter and an artificial intron followed by the first 36 amino acids of the tPA pre-pro sequence and a mature HSV1 gD glycoprotein fused to residue 42 of HIV-1 gp120 at the KpnI site. Contains the coding region for the first 29 amino acids. The gpl20 sequence ends at the stop codon and the XhoI restriction site as mentioned above. Within the vector is followed by the SV40 poly-A sequence and the electron terminator.
본 발명의 gp120 폴리펩티드를 코딩하는 서열을 중합효소 연쇄 반응(PCR)에 의해 증폭시켜 42번 아미노산 잔기에서 529번 아미노산 잔기까지 연장되는 DNA 단편을 생성시켰다. 단편은 5' 말단에서 KpnI 제한 부위 및 정지 코돈에 이어서 3' 말단에서 XhoI 제한 부위를 가진다. 이러한 제한 부위는 pCI.gD.gp120 벡터 또는 pCI.tPA.gpl20 벡터 내로의 클로닝을 용이하게 한다.The sequence encoding the gp120 polypeptide of the invention was amplified by polymerase chain reaction (PCR) to generate a DNA fragment extending from amino acid residues 42 to amino acid residues 529. The fragment has a KpnI restriction site at the 5 'end and a stop codon followed by an XhoI restriction site at the 3' end. Such restriction sites facilitate cloning into the pCI.gD.gp120 vector or pCI.tPA.gpl20 vector.
gpl20 증폭 생성물을 KpnI 및 XhoI 제한 효소로 분해시키고, 1.5 kb gpl20 단편을 QIAquick PCR 정제 키트(Qiagen, Valencia, CA)와 같은 시판되는 키트를 사용하여 단리하였다. PCI.gD.gpl20 또는 pCI.tPA.gpl20을 KpnI 및 XhoI로 분해시키고, 4.3 kb 벡터 단편을 QIAquick 겔 추출 키트(Qiagen, Valencia, CA)와 같은 시판되는 키트에 의해 단리시켰다. gp120 단편을 어느 한 쪽 벡터 내로 라이게이션시켰다. 라이게이션 생성물을 ToplO E. coli 내로 형질전환시켰다. DNA를 제조하고, KpnI 및 XhoI로 분해시키고 겔 전기영동 처리하여 형질전환체가 목적하는 구성물을 함유하는 것을 확인하였다. 그 다음, 유전자 발현을, 인산 칼슘 기술(Graham et al. , Virology 52: 456-467 (1973))을 이용하여 293T 배아 인간 신장 세포주(Graham et al. , J. Gen. Virol. 36: 59-77 (1977)) 내로 일시적으로 형질감염시키고, 후속하여 폴리클로날 토끼 항-gp120 항혈청을 이용하는 웨스턴 블롯에 의해 조건화된 배지를 검정하여 시험하였다.The gpl20 amplification product was digested with KpnI and XhoI restriction enzymes, and 1.5 kb gpl20 fragments were isolated using commercially available kits such as the QIAquick PCR Purification Kit (Qiagen, Valencia, Calif.). PCI.gD.gpl20 or pCI.tPA.gpl20 were digested with KpnI and XhoI and 4.3 kb vector fragments were isolated by commercially available kits such as the QIAquick gel extraction kit (Qiagen, Valencia, CA). gp120 fragment was ligated into either vector. Ligation products were transformed into ToplO E. coli. DNA was prepared, digested with KpnI and XhoI and subjected to gel electrophoresis to confirm that the transformants contained the desired construct. Gene expression was then determined using a calcium phosphate technique (Graham et al., Virology 52: 456-467 (1973)) using the 293T embryonic human kidney cell line (Graham et al., J. Gen. Virol. 36: 59- 77 (1977) were transiently transfected and subsequently tested by assaying medium conditioned by Western blot using polyclonal rabbit anti-gp120 antiserum.
실시예 2Example 2
바이러스 감염성 및 중화 검정Virus infectivity and neutralization assay
HIV-1의 시험관내 배양은 불가피하게 선택적 조건을 부과하기 때문에, HIV-1 감염 환자에서 순환하는 바이러스의 정확한 제시는 환자로부터 유래하는 물질(예를 들어, 혈장, 백혈구, 뇌)로부터 중간적인 시험관내 배양 단계 없이 분자적으로 클로닝되는 바이러스 서열을 검사하여서만 수득될 수 있다. 그 후, 레트로바이러스 RNA 게놈으로부터 얻은 클로닝된 상보적 DNA(cDNA)를 추가 분석을 위해 안정한 DNA계 발현 시스템으로 삽입할 수 있다. 이러한 발현 시스템의 하나가 페노센스(PhenoSenseTM) HIV 중화 검정이다(문헌(Richman et al., 2003)에 상세히 기재되어 있음). 간단하게, 검정은 클로닝된 HIV-1 엔벨로프 단백질을 발현하는 슈도타입 (pseudotype) 바이러스를 만든 다음, 이 바이러스를 바이러스 감염성 연구에 이용한다. 이 검정은 클로닝된 HIV 엔벨로프 당단백질의 표현형을 케모카인 수용체 사용(CXCR4 또는 CCR5) 및 가용성 CD4에 대한 감수성 및 바이러스 중화 항체에 관하여 결정하는 데에 사용되어 왔다. 이 검정은 활성화된 말초 혈액 단구(PBMCs)를 억제하는 항체의 능력이 측정되는 통상적인 바이러스 중화 검정과 상관성이 있는 것으로 나타났다.Since in vitro culture of HIV-1 inevitably imposes selective conditions, accurate presentation of the circulating virus in HIV-1 infected patients is an intermediate test from material derived from the patient (eg, plasma, white blood cells, brain). It can only be obtained by examining viral sequences that are molecularly cloned without an in vitro culture step. The cloned complementary DNA (cDNA) obtained from the retroviral RNA genome can then be inserted into a stable DNA based expression system for further analysis. One such expression system is the PhenoSense ™ HIV neutralization assay (as described in detail in Richman et al., 2003). In brief, the assay creates a pseudotype virus that expresses a cloned HIV-1 envelope protein and then uses the virus for viral infectivity studies. This assay has been used to determine the phenotype of cloned HIV envelope glycoproteins with respect to chemokine receptor use (CXCR4 or CCR5) and susceptibility and viral neutralizing antibodies to soluble CD4. This assay has been shown to correlate with conventional virus neutralization assays in which the ability of antibodies to inhibit activated peripheral blood monocytes (PBMCs) is measured.
페노센스(PhenoSenseTM) 검정은 보다 신속하고, 보다 감도가 높고, 한정된 바이러스를 사용한다는 점에서 통상적인 방법에 비해 장점이 있다. 이것은 시험관 내에서 바이러스 선택으로부터 일어나는 잠재적인 인공물을 피한다. 페노센스(PhenoSenseTM) 검정은 핵산 증폭(RT-PCR)을 사용하여 HIV-포지티브 환자 혈장 샘플로부터 HIV 엔벨로프 서열(gp160)을 얻는다. 증폭된 엔벨로프 서열을 통상적인 클로닝 방법을 이용하여 발현 벡터(pCXAS)내로 삽입시킨다. 발현 벡터는 단일 단리된 분자 클론으로부터 제조하거나, 샘플 수집시에 환자에서 각종 바이러스 유사종을 정확하게 나타내는 서열의 큰 풀(pool)로부터 제조할 수 있다. 바이러스 엔벨로프 단백질(풀링된 gp160 또는 개별적인 gp160)을 발현하는 재조합 HIV-1 원액을, HEK293 세포를, HIV-1 엔벨로프 단백질이 결핍된 결함있는 HIV-1 게놈 바이러스 벡터 및 관심있는 HIV 엔벨로프 단백질을 함유하는 제2의 발현 벡터를 동시형질감염시켜 제조한다. HIV-1 게놈 벡터는 복제 결함이 있고, 루시페라아제 발현 카세트를 HIV 엔벨로프 유전자의 결실된 영역 내에 함유한다. 환자 바이러스 엔벨로프 단백질을 갖는 슈도타입 재조합 바이러스, 및 CXCR4 및 CCR5 의존성 대조 바이러스 (NL4-3, JRCSF) 및 특이성 대조 양생(amphotropic) 뮤린 백혈병 바이러스(A-MLV)를 형질감염후 48시간에 수집하고 37℃에서 1시간 동안 일련의 모노클로날 항체 및/또는 혈장 대조 4배 희석액과 함께 인큐베이션하였다. CD4/CCR5를 발현하는 U87 세포 및 CD4/CXCR4를 발현하는 세포에 바이러스-항체 희석액을 접종하였다. 바이러스 감염성은 접종후 72시간 후에 감염된 세포에서 발현되는 루시페라아제 활성의 양을 측정하여 결정하고, 평형 상대 광단위(Balanced Relative Light Units: RLUs)로서 기록하였다. 중화 활성은 항체 네거티브 대조와 비교하여 각 항체 농도에서 바이러스 복제(루시페라아제 활성)의 억제율로서 표시하였다. IC50은 바이러스 감염성을 50% 억제시키기 위해 필요한 모노클로날 항체의 농도로서 정의된다. 바이러스는 IC50이 특이성 대조 바이러스 A-MLV에 있어서 동일한 시약의 IC50 보다 적어도 3배 높은 경우, 중화에 대해 감수성인 것으로 분류하였다. PhenoSense ™ assays have advantages over conventional methods in that they are faster, more sensitive, and use limited viruses. This avoids potential artifacts resulting from virus selection in vitro. The PhenoSense ™ assay uses nucleic acid amplification (RT-PCR) to obtain an HIV envelope sequence (gp160) from HIV-positive patient plasma samples. The amplified envelope sequences are inserted into expression vectors (pCXAS) using conventional cloning methods. Expression vectors can be prepared from single isolated molecular clones or from large pools of sequences that accurately represent various virus like species in the patient at the time of sample collection. Recombinant HIV-1 stocks expressing viral envelope proteins (pooled gp160 or individual gp160), HEK293 cells, a defective HIV-1 genomic viral vector lacking HIV-1 envelope protein and the HIV envelope protein of interest The second expression vector is prepared by cotransfection. The HIV-1 genomic vector is replication defective and contains a luciferase expression cassette in the deleted region of the HIV envelope gene. Pseudotype recombinant virus with patient virus envelope protein, and CXCR4 and CCR5 dependent control virus (NL4-3, JRCSF) and specific control murine leukemia virus (A-MLV) were collected 48 hours post-transfection and 37 Incubate with serial monoclonal antibodies and / or plasma control 4-fold dilutions for 1 hour at < RTI ID = 0.0 > U87 cells expressing CD4 / CCR5 and cells expressing CD4 / CXCR4 were inoculated with virus-antibody dilutions. Viral infectivity was determined by measuring the amount of luciferase activity expressed in infected cells 72 hours after inoculation and recorded as Balanced Relative Light Units (RLUs). Neutralizing activity was expressed as the rate of inhibition of viral replication (luciferase activity) at each antibody concentration compared to the antibody negative control. IC50 is defined as the concentration of monoclonal antibody required to inhibit 50% viral infectivity. Viruses were classified as susceptible to neutralization when the IC50 was at least three times higher than the IC50 of the same reagent for the specific control virus A-MLV.
실험 결과Experiment result
시스테인 돌연변이체중 3개 U-099 (19개 시스테인 잔기 보유), U-209 (19개 시스테인 잔기 보유) 및 U-210 (20개 시스테인 잔기 보유)를 페노센스(PhenoSenseTM) 감염성 및 중화 검정을 실시하고, 그 결과를 표 2에 제시하였다. 각 바이러스는 CD4 및 CCR5 케모카인 수용체를 발현하는 U87 세포에 결합하여 이를 감염시켰으나, CD4 및 CXCR4 공동수용체를 발현하는 U87 세포에는 그러하지 않았고, 이는 이들 돌연변이 바이러스가 오로지 R5 표현형이라는 것을 나타낸다. 바이러스를 gp41(4E10 및 2F5) 및 gpl20 (2G12)을 표적화하는 모노클로날 항체(MAbs)를 사용하여 차별적으로 중화시켰다. 19-시스테인 돌연변이 U-099는 gp41 MAbs에 대해 2 내지 8배 더 저항성을 나타내었으나, gpl20 MAb 2G12를 사용하는 다른 돌연변이체 정도 만큼 중화시켰다. 흥미롭게도, U-210는, 1차 HIV 단리물들에 대해 광범위하게 교차중화성을 나타내는 것으로 생각되는(참조, Trkola et al. , 1996) MAb인 2G12에 의해 중화되지 않았다. 그러나, 2G12 결합은 V4에서의 돌연변이에 대해 감수성을 나타내고 U-210은 이 영역에 2개의 추가 시스테인 잔기를 갖기 때문에, 중화 실패는 이들 시스테인 돌연변이에 의해 매개될 수 있다. 이 연구의 결과는 전형적인 18개 시스테인 잔기가 아닌 19개 및 20개 시스테인 잔기를 프로세싱하는 HIV 단리물은 작용성이 있고, CD4 및 CCR5 케모카인 수용체를 함유하는 세포의 감염을 매개할 수 있다는 것을 입증한다.Three U-099 (having 19 cysteine residues), U-209 (having 19 cysteine residues) and U-210 (having 20 cysteine residues) of the cysteine mutants were subjected to the PhenoSense ™ infectivity and neutralization assay And the results are shown in Table 2. Each virus binds to and infects U87 cells expressing CD4 and CCR5 chemokine receptors, but not to U87 cells expressing CD4 and CXCR4 co-receptors, indicating that these mutant viruses are only R5 phenotype. Virus was differentially neutralized using monoclonal antibodies (MAbs) targeting gp41 (4E10 and 2F5) and gpl20 (2G12). 19-cysteine mutant U-099 showed 2-8 times more resistance to gp41 MAbs, but neutralized by the extent of other mutants using gpl20 MAb 2G12. Interestingly, U-210 was not neutralized by 2G12, a MAb that is thought to exhibit extensive cross-neutralization for primary HIV isolates (see Trkola et al., 1996). However, since 2G12 binding is susceptible to mutations in V4 and U-210 has two additional cysteine residues in this region, neutralization failure can be mediated by these cysteine mutations. The results of this study demonstrate that HIV isolates
표 2TABLE 2
IC50은 바이러스 감염성을 50% 억제시키기 위해 필요한 모노클로날 항체의 농도로서 정의된다. MAbs 4E10 및 2F5는 gp41에서의 영역을 표적으로 하지만, MAb 2G12는 gpl20에서 에피토프를 인식한다. 92HT594는 이중성 주성(dual-tropic) 바이러스 대조이다. JRCSF 및 NL43는 각각 R5 및 X4 주성에 대한 대조이다. A-MLV는 어느 한쪽의 세포 타입(R5 또는 X4)을 감염시키지만, HIV MAbs에 의해서 억제되지 않는 비-HIV 바이러스이다. NA; 적용되지 않음.IC50 is defined as the concentration of monoclonal antibody required to inhibit 50% viral infectivity. MAbs 4E10 and 2F5 target the region at gp41, while MAb 2G12 recognizes the epitope at gpl20. 92HT594 is a dual-tropic virus control. JRCSF and NL43 are controls for R5 and X4 chemotaxis, respectively. A-MLV is a non-HIV virus that infects either cell type (R5 or X4) but is not inhibited by HIV MAbs. NA; Not applicable.
참고문헌references
1. Richman DD,Wrin T, Little SJ, Petropoulos CJ. (2003). Rapid evolution of the neutralizing antibody response to HIV type 1 infection. Proceedings of the National Academy of Sciences U S A. 100 (7): 4144-4149. Richman DD, Wrin T, Little SJ, Petropoulos CJ. (2003). Rapid evolution of the neutralizing antibody response to
2. Trkola A, Purtscher M, Muster T, Ballaun C, Buchacher A, Sullivan N, Srinivasan K, Sodroski J, Moore JP, and Katinger H. (1996). Human monoclonal antibody 2G12 defines a distinctive neutralization epitope on the gpl20 glycoprotein of Human Immunodeficiency Virus Type 1. Journal of Virology 70 (2): 1100-1108. 2. Trkola A, Purtscher M, Muster T, Ballaun C, Buchacher A, Sullivan N, Srinivasan K, Sodroski J, Moore JP, and Katinger H. (1996). Human monoclonal antibody 2G12 defines a distinctive neutralization epitope on the gpl20 glycoprotein of Human
본 명세서에서 언급된 모든 공보 및 특허는 각 공보 또는 특허가 구체적이고 개별적으로 참고문헌으로 인용된다고 표시된 것과 동일한 정도로 본원에서 참고문헌으로 인용된다.All publications and patents mentioned in this specification are herein incorporated by reference to the same extent as if each publication or patent was specifically and individually indicated to be incorporated by reference.
SEQUENCE LISTING
<110> VaxGen, Inc.
Berman, Phillip W
Jobes, David V
<120> HIV-1 ENVELOPE GLYCOPROTEINS HAVING UNUSUAL DISULFIDE STRUCTURE
<130> 409J-000210US/PC
<140> US 10/866,527 and PCT/US 04/018672
<141> 2004-06-10
<160> 140
<170> PatentIn version 3.2
<210> 1
<211> 1613
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 1
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaggaat tgtcagcgct 60
ggtggagatg gggcatcatg ctccttggaa tgttgatgat ctgtaatgct gtaggacaat 120
tgtgggttac ggtctattat ggggtacctg tgtggaaaga agccaccacc actctattcc 180
gtgcatcaga tgctaaagca tatgatacag aggtacataa tgtctgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaaa tagaattgga aaatgtgaca gaaactttta 300
acatgtggaa aaataacatg gtagaacaaa tgcatgagga tataatcagt ttatgggatc 360
aaggcctaaa accatgtgta aaattaaccc cactccgtgt tactttaaat tgcactgact 420
acaagaatgc taatagtacc aataataata gtaccagtga tagtagcaat ctagaagagg 480
agaaaggaga aataaaaaac tgctctttca atatcactac aagcataaaa gataggatgc 540
agaaagaata tgcacttttt tataaacttg atatagtacc aatagataat aataatacta 600
gatataggat gataagttgt aacacctcag tcattacaca ggcctgtcca aaggtatctt 660
ttgagccaat tcccatacat tattgtgccc cggctggttt tgcgattcta aaatgtaagg 720
ataagaagtt caatggaaca ggaccatgta gaaatgtcag cacagtacaa tgtacacatg 780
gaattaggcc agtagtatca actcaattgc tgttaaatgg cagtctagca gaagaagagg 840
tagtacttag atctgaaaat ttcacgaaca atgctaaaac cataatagta cagctaaaag 900
aacctataaa aatcaattgt acaagaccca acaacaatac aagaaaaagt atacatatag 960
gaccagggag agcattttat acaacagggg agataatagg agacataaga caagcacatt 1020
gtagcattag taaggtagaa tggaacaaca ctttgataca aatagttgaa aaattaagag 1080
aacaatttgg gactaaaaca ataaatttta ctaaaccctc aggaggggac ctagaaattg 1140
taacgcacag ctttaattgt agaggggaat ttttctactg taataccaca aaactgttta 1200
atagtacttg gcctgggaat attacttgga ctcggaataa taatgttact acagaaaata 1260
tcacactccc atgcagaata aaacaaattg tgaacagatg gcaggaagta ggaaaagcaa 1320
tgtatgcccc tcccatccaa ggacaaatta gatgttcatc aaatattaca gggctgctat 1380
taacaagaga tggtggtggg gaccagaata gcacagggga gatcttcaga cctggaggag 1440
gagatatgag ggacaattgg agaagtgaac tatacaaata taaagtagta caaattgaac 1500
cattaggaat agcacccacc aaggcaagga gaagagtggt gcagagagaa aaaagagcag 1560
tgggaacatt aggagctatg ttccttgggt tcttgggagc agcaggaagc act 1613
<210> 2
<211> 484
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 2
Asn Ala Val Gly Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Arg Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Glu Leu Glu Asn Val Thr Glu Thr
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Gly Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Arg Val Thr Leu Asn Cys Thr Asp Tyr Lys Asn Ala Asn Ser Thr
100 105 110
Asn Asn Asn Ser Thr Ser Asp Ser Ser Asn Leu Glu Glu Glu Lys Gly
115 120 125
Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Ser Ile Lys Asp Arg
130 135 140
Met Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Leu Asp Ile Val Pro Ile
145 150 155 160
Asp Asn Asn Asn Thr Arg Tyr Arg Met Ile Ser Cys Asn Thr Ser Val
165 170 175
Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His
180 185 190
Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Lys Asp Lys Lys
195 200 205
Phe Asn Gly Thr Gly Pro Cys Arg Asn Val Ser Thr Val Gln Cys Thr
210 215 220
His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser
225 230 235 240
Leu Ala Glu Glu Glu Val Val Leu Arg Ser Glu Asn Phe Thr Asn Asn
245 250 255
Ala Lys Thr Ile Ile Val Gln Leu Lys Glu Pro Ile Lys Ile Asn Cys
260 265 270
Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly
275 280 285
Arg Ala Phe Tyr Thr Thr Gly Glu Ile Ile Gly Asp Ile Arg Gln Ala
290 295 300
His Cys Ser Ile Ser Lys Val Glu Trp Asn Asn Thr Leu Ile Gln Ile
305 310 315 320
Val Glu Lys Leu Arg Glu Gln Phe Gly Thr Lys Thr Ile Asn Phe Thr
325 330 335
Lys Pro Ser Gly Gly Asp Leu Glu Ile Val Thr His Ser Phe Asn Cys
340 345 350
Arg Gly Glu Phe Phe Tyr Cys Asn Thr Thr Lys Leu Phe Asn Ser Thr
355 360 365
Trp Pro Gly Asn Ile Thr Trp Thr Arg Asn Asn Asn Val Thr Thr Glu
370 375 380
Asn Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Val Asn Arg Trp Gln
385 390 395 400
Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Gln Gly Gln Ile Arg
405 410 415
Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Gly
420 425 430
Asp Gln Asn Ser Thr Gly Glu Ile Phe Arg Pro Gly Gly Gly Asp Met
435 440 445
Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Gln Ile
450 455 460
Glu Pro Leu Gly Ile Ala Pro Thr Lys Ala Arg Arg Arg Val Val Gln
465 470 475 480
Arg Glu Lys Arg
<210> 3
<211> 1604
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 3
agaaagagca gaagacagtg gcaatgaaag tgacggggat caggaagaat tgtcagcgct 60
tgtggagatg gggcatgatg ctcctgggga tgttaatgat ctgtagtgct gcagaacaat 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga agcaaccacc actctatttt 180
gtgcatcaga tgctaaagca tatgacgcag agaaacataa tgtttgggcc acacatgcct 240
gcgtacccac agaccccaac ccacaagaaa tagtattgga aaatgtgaca gaatatttta 300
atgcttggaa aaataacatg gtagaacaga tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa accatgtgta aaattaaccc cactctgtgt tactttaaat cgcactgatt 420
tgaataatag tactaacacc actaatagta atagcagcgg ggggatgatg agagaagaaa 480
tgaaaaactg ctctttcaat atcaccacaa caataggtga taggaggcaa aaagaatatg 540
cactttttta taaacttgat atagcatcaa taaaggatga tgctaataat ttcacatata 600
ggttgataag ttgtaacacc tcagtcatta cacaagcctg tccaaagata tcctttgagc 660
caattcccat acattattgt gccccggctg gttttgcaat tctaaagtat aacgataaga 720
agttcaatgg agaagagcaa tgtaaaaatg tcagcacagt acaatgtaca catggaatta 780
agccagtagt atcaactcag ctgctgttaa atggtagtct agcagaagaa gagatagtaa 840
ttagatctga caatttcaca gacaatgcta aaaccataac agtacagctg aatgaatctg 900
tagtaattaa ttgtacaaga ccccacaaca atacaagaaa aagtataaat ataggaccag 960
ggagagcatg gtatacaaca ggagaaataa taggagatat aagacaagca cattgtaaca 1020
ttagtaaaac acaatggaat aacactttaa taaagatagt taaaaaatta agagaacaat 1080
ttaatacaaa caccataatc tttaatcaat ccacaggagg ggacctagaa attgtaatgc 1140
acagttttaa ttgtggaggg gaatttttct actgtgatac aacacaactg tttaatagta 1200
cttggaatat tactggagaa agtacttgga atagtactgg aaaaacaaat gaaactatca 1260
cactcccatg tagaataaaa caagttataa acatgtggca gcaagtaggg aaagcaatgt 1320
atgcccctcc catcaaaggg caaattagat gttcatcaaa tattacaggg ctgctattaa 1380
caagagatgg tggtaagaac agcagtaacg ggactgagac ctttagacct ggaggaggag 1440
atatgaggga caattggaga agtgaattat ataaatataa agtagtagaa attgaaccat 1500
taggaatagc acccactaag gcaaagagaa gagtggtgca gagagaaaga agagcagtaa 1560
taggagctat gttccttggg ttcttgggag cagcaggaag cact 1604
<210> 4
<211> 483
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 4
Ser Ala Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Ala Glu Lys His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Val Leu Glu Asn Val Thr Glu Tyr
50 55 60
Phe Asn Ala Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Arg Thr Asp Leu Asn Asn Ser Thr Asn Thr
100 105 110
Thr Asn Ser Asn Ser Ser Gly Gly Met Met Arg Glu Glu Met Lys Asn
115 120 125
Cys Ser Phe Asn Ile Thr Thr Thr Ile Gly Asp Arg Arg Gln Lys Glu
130 135 140
Tyr Ala Leu Phe Tyr Lys Leu Asp Ile Ala Ser Ile Lys Asp Asp Ala
145 150 155 160
Asn Asn Phe Thr Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val Ile Thr
165 170 175
Gln Ala Cys Pro Lys Ile Ser Phe Glu Pro Ile Pro Ile His Tyr Cys
180 185 190
Ala Pro Ala Gly Phe Ala Ile Leu Lys Tyr Asn Asp Lys Lys Phe Asn
195 200 205
Gly Glu Glu Gln Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly
210 215 220
Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala
225 230 235 240
Glu Glu Glu Ile Val Ile Arg Ser Asp Asn Phe Thr Asp Asn Ala Lys
245 250 255
Thr Ile Thr Val Gln Leu Asn Glu Ser Val Val Ile Asn Cys Thr Arg
260 265 270
Pro His Asn Asn Thr Arg Lys Ser Ile Asn Ile Gly Pro Gly Arg Ala
275 280 285
Trp Tyr Thr Thr Gly Glu Ile Ile Gly Asp Ile Arg Gln Ala His Cys
290 295 300
Asn Ile Ser Lys Thr Gln Trp Asn Asn Thr Leu Ile Lys Ile Val Lys
305 310 315 320
Lys Leu Arg Glu Gln Phe Asn Thr Asn Thr Ile Ile Phe Asn Gln Ser
325 330 335
Thr Gly Gly Asp Leu Glu Ile Val Met His Ser Phe Asn Cys Gly Gly
340 345 350
Glu Phe Phe Tyr Cys Asp Thr Thr Gln Leu Phe Asn Ser Thr Trp Asn
355 360 365
Ile Thr Gly Glu Ser Thr Trp Asn Ser Thr Gly Lys Thr Asn Glu Thr
370 375 380
Ile Thr Leu Pro Cys Arg Ile Lys Gln Val Ile Asn Met Trp Gln Gln
385 390 395 400
Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Lys Gly Gln Ile Arg Cys
405 410 415
Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Lys Asn
420 425 430
Ser Ser Asn Gly Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg
435 440 445
Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Glu Ile Glu
450 455 460
Pro Leu Gly Ile Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg
465 470 475 480
Glu Arg Arg
<210> 5
<211> 1613
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 5
agaaagagca gaagacagtg gcaatgagag tgatggggat gaggaagaat tatcagcact 60
ggtggagagg gggcatcttg ctccttggga tgttaatgat cagtagtgct atagaaaatt 120
cgtgggtcac agtctattat ggggtacctg tgtggaaaga agctaccacc actttatttt 180
gtgcatcaga tgctaaagct tatgaaacag aggcacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaaa taaaattgga aaatgtgtca gaaaatttta 300
acatgtggaa aaataacatg gtagaccaaa tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cactctgtgt tactttaaat tgcactgatt 420
attttgggaa tactactaat accaatacta ataccaccag tagtcccagc accaacagta 480
gtaatgaagg agaagtgaaa aaatgctctt tcaatatcac cacagaagta agggacaagg 540
tgcgaaaaga atttgcactt ttttataaac ttgatatagt acgaacaggt catgataata 600
ctagctatag gttgataagt tgtaacacct cagtcattac acaggcctgc ccaaagatat 660
cctttgatcc aattcccata cattattgtg ccccggctgg ttttgcgatt ctcaagtgta 720
gagataataa atttaatgga acaggaccat gtaaaaatgt cagcacagta caatgtactc 780
atggaattag gccagtaata tcaactcaac tactgttaaa tggcagtcta gcagaagaag 840
aggtagtagt tagatctaaa aatttcacaa acaatgctga agtcataata gtgcagctga 900
aagaatctgt acaaataaat cgtacaagac ccaacaacaa tacaaggaaa agtataccta 960
tgggtccagg gagagcatgg tatgctacag aagatatcat aggaaatata agacaggcac 1020
attgtaacat tagtggagta aaatggaata acactttaca gcaaatagtt aaaaaattaa 1080
gagagcaatt taaaaataaa acaataaagt ttcagccatc ctcaggaggg gacccagaaa 1140
ttgtaaggca cagttttaat tgtagagggg agtttttcta ctgtgataca acactactgt 1200
ttaatagtac ttggaatagt aatgatactt ggaatagtac tgaagggtca aatgacacta 1260
ttacactccc gtgtagaata aaacaaattg taaacatgtg gcaagaagta ggaaaagcaa 1320
tgtatgctcc tcccatcaaa ggacaactta actgttcatc aaatattaca gggccgatat 1380
taacaagaga tggtggtaag ggtgagaact cgaccgagaa caatactgag atattcagac 1440
ctggaggagg agatatgagg gacaattggc gaagtgaatt atataaatat aaagtagtac 1500
aaattgaacc attaggaata gcacccacta aggcaaagag aagagtggtg cagagagaaa 1560
aaagaggagc gggactgttt ttcctggggt tcttgggaac agcaggaagc act 1613
<210> 6
<211> 487
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 6
Ser Ala Ile Glu Asn Ser Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Glu Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Lys Leu Glu Asn Val Ser Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Asp Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Tyr Phe Gly Asn Thr Thr Asn
100 105 110
Thr Asn Thr Asn Thr Thr Ser Ser Pro Ser Thr Asn Ser Ser Asn Glu
115 120 125
Gly Glu Val Lys Lys Cys Ser Phe Asn Ile Thr Thr Glu Val Arg Asp
130 135 140
Lys Val Arg Lys Glu Phe Ala Leu Phe Tyr Lys Leu Asp Ile Val Arg
145 150 155 160
Thr Gly His Asp Asn Thr Ser Tyr Arg Leu Ile Ser Cys Asn Thr Ser
165 170 175
Val Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe Asp Pro Ile Pro Ile
180 185 190
His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Arg Asp Asn
195 200 205
Lys Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr Val Gln Cys
210 215 220
Thr His Gly Ile Arg Pro Val Ile Ser Thr Gln Leu Leu Leu Asn Gly
225 230 235 240
Ser Leu Ala Glu Glu Glu Val Val Val Arg Ser Lys Asn Phe Thr Asn
245 250 255
Asn Ala Glu Val Ile Ile Val Gln Leu Lys Glu Ser Val Gln Ile Asn
260 265 270
Arg Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile Pro Met Gly Pro
275 280 285
Gly Arg Ala Trp Tyr Ala Thr Glu Asp Ile Ile Gly Asn Ile Arg Gln
290 295 300
Ala His Cys Asn Ile Ser Gly Val Lys Trp Asn Asn Thr Leu Gln Gln
305 310 315 320
Ile Val Lys Lys Leu Arg Glu Gln Phe Lys Asn Lys Thr Ile Lys Phe
325 330 335
Gln Pro Ser Ser Gly Gly Asp Pro Glu Ile Val Arg His Ser Phe Asn
340 345 350
Cys Arg Gly Glu Phe Phe Tyr Cys Asp Thr Thr Leu Leu Phe Asn Ser
355 360 365
Thr Trp Asn Ser Asn Asp Thr Trp Asn Ser Thr Glu Gly Ser Asn Asp
370 375 380
Thr Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Val Asn Met Trp Gln
385 390 395 400
Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Lys Gly Gln Leu Asn
405 410 415
Cys Ser Ser Asn Ile Thr Gly Pro Ile Leu Thr Arg Asp Gly Gly Lys
420 425 430
Gly Glu Asn Ser Thr Glu Asn Asn Thr Glu Ile Phe Arg Pro Gly Gly
435 440 445
Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val
450 455 460
Val Gln Ile Glu Pro Leu Gly Ile Ala Pro Thr Lys Ala Lys Arg Arg
465 470 475 480
Val Val Gln Arg Glu Lys Arg
485
<210> 7
<211> 1595
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 7
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tatcagcact 60
tgtggagatg gggcatcatg ctccttggga tgttgatgat ctgtaatgct gcagaacagt 120
tgtgggtcac agtctattat ggggtacctg tgtggaggga tgcaaatacc actctatttt 180
gtgcatccga tgctaaagca tatgatacag aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag tagtattgga aaatgtgaca gaaagcttta 300
acatatggaa aaataacatg gtagaacaaa tgcatgagga tataatcagt ttgtgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cactctgcgt tactttaaat tgcagtaatt 420
taagtaccac taataataat accattagtg gtaatgagac agcagtaaat aaaggagaaa 480
taaaaaaccg ctctttcaat gtcaccacaa acataagaga tagggtaaag aaagaatatg 540
cgctttttta taatcttgat ttagtacaaa taggtgattc taatactagc tatacaatgg 600
taaagtgtaa cacctcagtc attacacagg cctgtccaaa ggtacccttt gagccaattc 660
ccatacattt ttgtgcccca gctggttttg cgattctaaa gtgtaataat aagacgttca 720
gtggaaaagg agaatgtaca aatgtcagca cagtacaatg tacgcatgga attagaccag 780
tagtatcaac tcatctgctg ttaaatggca gcttagcaga agaagacata gtaattagat 840
ctgacaattt cacggacaat actaaaacca taatagtaca gctagacaat actataaaca 900
ttacttgtac cagacccaac aataatacaa ggaaaggtat acatatagga ccagggagag 960
cattttatgc aacaggggat ataataggaa atataagaca agcacattgt aaccttagta 1020
aaacacattg gaataacact ttaaaacaga tagttaaaaa attaagagaa caatttaaaa 1080
ataaaacaat agtctttaat caatctacag gaggggaccc agaaattgta cagcacactt 1140
ttaattgtag aggggaattt ttctattgta actcaacacc actgtttaat agtacttggt 1200
atcctaatag tacattggat gaaacaaaca gcacagacaa caatgaaact atcacactcc 1260
aatgcagaat aagacaaatt ataaacatgt ggcaggaagt aggaaaagca atgtatgccc 1320
ctcctatcag aggacaaatt acatgcacat caaatattac agggctgata ttaacaagag 1380
atggtggaga taacaatgaa actgagatct tcaggcctgg aggaggcaat atgaaggata 1440
attggagaag tgaattatat aaatataaag tagtaaaaat tgagccatta ggaatagcac 1500
ccactaaggc aaagagaaga gcggtgcaga gagaaaaaag agcagcggga ataggagctg 1560
tgttccttgg gttcttggga gcagcaggaa gcact 1595
<210> 8
<211> 479
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 8
Asn Ala Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Arg Asp Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Glu Asn Val Thr Glu Ser
50 55 60
Phe Asn Ile Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Ser Asn Leu Ser Thr Thr Asn Asn Asn
100 105 110
Thr Ile Ser Gly Asn Glu Thr Ala Val Asn Lys Gly Glu Ile Lys Asn
115 120 125
Arg Ser Phe Asn Val Thr Thr Asn Ile Arg Asp Arg Val Lys Lys Glu
130 135 140
Tyr Ala Leu Phe Tyr Asn Leu Asp Leu Val Gln Ile Gly Asp Ser Asn
145 150 155 160
Thr Ser Tyr Thr Met Val Lys Cys Asn Thr Ser Val Ile Thr Gln Ala
165 170 175
Cys Pro Lys Val Pro Phe Glu Pro Ile Pro Ile His Phe Cys Ala Pro
180 185 190
Ala Gly Phe Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Ser Gly Lys
195 200 205
Gly Glu Cys Thr Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg
210 215 220
Pro Val Val Ser Thr His Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu
225 230 235 240
Asp Ile Val Ile Arg Ser Asp Asn Phe Thr Asp Asn Thr Lys Thr Ile
245 250 255
Ile Val Gln Leu Asp Asn Thr Ile Asn Ile Thr Cys Thr Arg Pro Asn
260 265 270
Asn Asn Thr Arg Lys Gly Ile His Ile Gly Pro Gly Arg Ala Phe Tyr
275 280 285
Ala Thr Gly Asp Ile Ile Gly Asn Ile Arg Gln Ala His Cys Asn Leu
290 295 300
Ser Lys Thr His Trp Asn Asn Thr Leu Lys Gln Ile Val Lys Lys Leu
305 310 315 320
Arg Glu Gln Phe Lys Asn Lys Thr Ile Val Phe Asn Gln Ser Thr Gly
325 330 335
Gly Asp Pro Glu Ile Val Gln His Thr Phe Asn Cys Arg Gly Glu Phe
340 345 350
Phe Tyr Cys Asn Ser Thr Pro Leu Phe Asn Ser Thr Trp Tyr Pro Asn
355 360 365
Ser Thr Leu Asp Glu Thr Asn Ser Thr Asp Asn Asn Glu Thr Ile Thr
370 375 380
Leu Gln Cys Arg Ile Arg Gln Ile Ile Asn Met Trp Gln Glu Val Gly
385 390 395 400
Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Gln Ile Thr Cys Thr Ser
405 410 415
Asn Ile Thr Gly Leu Ile Leu Thr Arg Asp Gly Gly Asp Asn Asn Glu
420 425 430
Thr Glu Ile Phe Arg Pro Gly Gly Gly Asn Met Lys Asp Asn Trp Arg
435 440 445
Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Ile
450 455 460
Ala Pro Thr Lys Ala Lys Arg Arg Ala Val Gln Arg Glu Lys Arg
465 470 475
<210> 9
<211> 1676
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 9
agaaagagca gaagacagtg gcaatgaaag tgaaggagac caggaagaat tatcaaagct 60
tgtggagagg gggcaccttg ttccttggaa tgttgatgat ctgtagtgtt acaggacaat 120
tgtgggttac agtctattat ggggtacctg tgtggaaaga ggcaaccacc actctatttt 180
gtgcatcaaa tgctaaagca tatgatacag aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac gcacaagaag tagtattaga aaatgtgaca gaatattttg 300
acatgtggaa aaatgacatg gtagaacaaa tgcatgagga tgtaatcagt ttatgggatc 360
aaagcctaaa gccatgtgta gaattaaccc cactctgtgt tactttaaat tgcactgatg 420
tgaatattac taataccaat aatagtacca ttaacaatag tagtaataat accaatagta 480
gtgattggga acggatggag ccaggagaaa taaaaaactg ctctttcaat agcaccacaa 540
acatgagaga taggacgcag agagaatacg cactttttta taaacttgat atagaaccag 600
tagataataa aagtaataat aaaagtctta atgaaagtat tagtaaaagt attacttata 660
ggttaataag ttataacacc tcagtcatta aacaggcctg tccaaaagta tcttttgagc 720
caattcccat acattattgt gccccagctg gttttgcaat tctaaagtgt aataatgaga 780
cattcgatgg aaaaggagaa tgtagaaatg tcagcacagt acaatgtaca catggaatta 840
ggccaatagt gtcaactcaa ctgctgttaa atggcagtct agcagaaaag gacatagtaa 900
ttagatcaaa caatttctcg gacaatgcta aaaccataat agtacatctg aatgaatcta 960
taacaattaa gtgtataaga cccaacaata atacaagaaa aagtatacat atagcaccag 1020
gaagcgcatt ttatgcaaca ggagacataa taggagatat aaggcaagca cattgtaaca 1080
ttagtgcaaa aaattggatt aacactttaa aacagatagt tataaaacta aaaggaaaat 1140
ataatactag tacaaaaata gactttaagc catcctcagg aggggaccca gaaattgtaa 1200
tgcacagctt taattgtgga ggggagtttt tctactgtaa tacatcaaaa ctgtttaata 1260
atacttggaa ggagaataat actttagagt caaatgatac tatggagatc attaacgaaa 1320
ctattatact cccatgtaga ataaaacagt ttataaacat gtggcagaaa gtgggaaaag 1380
caatgtatgc ccctcccatc agaggacaaa ttaaatgtga atcaaatatt acagggctgc 1440
tattaacaag agatggtggt aatacaaata gcacgaacgg gaccgagacc ttcagacctg 1500
gaggaggaaa tatgaaagac aattggagaa gtgaattgta caaatataaa gtagtaaaaa 1560
ttgaaccaat aggaatagca cccaccaggg caaaaagaag agtggtgcag agagaaaaaa 1620
gagcagtggg aataggagct gtgttccttg ggttcttggg agcagcagga agcact 1676
<210> 10
<211> 506
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 10
Ser Val Thr Gly Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asn Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Ala Gln Glu Val Val Leu Glu Asn Val Thr Glu Tyr
50 55 60
Phe Asp Met Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp Val
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Glu Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Val Asn Ile Thr Asn Thr Asn
100 105 110
Asn Ser Thr Ile Asn Asn Ser Ser Asn Asn Thr Asn Ser Ser Asp Trp
115 120 125
Glu Arg Met Glu Pro Gly Glu Ile Lys Asn Cys Ser Phe Asn Ser Thr
130 135 140
Thr Asn Met Arg Asp Arg Thr Gln Arg Glu Tyr Ala Leu Phe Tyr Lys
145 150 155 160
Leu Asp Ile Glu Pro Val Asp Asn Lys Ser Asn Asn Lys Ser Leu Asn
165 170 175
Glu Ser Ile Ser Lys Ser Ile Thr Tyr Arg Leu Ile Ser Tyr Asn Thr
180 185 190
Ser Val Ile Lys Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile Pro
195 200 205
Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asn
210 215 220
Glu Thr Phe Asp Gly Lys Gly Glu Cys Arg Asn Val Ser Thr Val Gln
225 230 235 240
Cys Thr His Gly Ile Arg Pro Ile Val Ser Thr Gln Leu Leu Leu Asn
245 250 255
Gly Ser Leu Ala Glu Lys Asp Ile Val Ile Arg Ser Asn Asn Phe Ser
260 265 270
Asp Asn Ala Lys Thr Ile Ile Val His Leu Asn Glu Ser Ile Thr Ile
275 280 285
Lys Cys Ile Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile His Ile Ala
290 295 300
Pro Gly Ser Ala Phe Tyr Ala Thr Gly Asp Ile Ile Gly Asp Ile Arg
305 310 315 320
Gln Ala His Cys Asn Ile Ser Ala Lys Asn Trp Ile Asn Thr Leu Lys
325 330 335
Gln Ile Val Ile Lys Leu Lys Gly Lys Tyr Asn Thr Ser Thr Lys Ile
340 345 350
Asp Phe Lys Pro Ser Ser Gly Gly Asp Pro Glu Ile Val Met His Ser
355 360 365
Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser Lys Leu Phe
370 375 380
Asn Asn Thr Trp Lys Glu Asn Asn Thr Leu Glu Ser Asn Asp Thr Met
385 390 395 400
Glu Ile Ile Asn Glu Thr Ile Ile Leu Pro Cys Arg Ile Lys Gln Phe
405 410 415
Ile Asn Met Trp Gln Lys Val Gly Lys Ala Met Tyr Ala Pro Pro Ile
420 425 430
Arg Gly Gln Ile Lys Cys Glu Ser Asn Ile Thr Gly Leu Leu Leu Thr
435 440 445
Arg Asp Gly Gly Asn Thr Asn Ser Thr Asn Gly Thr Glu Thr Phe Arg
450 455 460
Pro Gly Gly Gly Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys
465 470 475 480
Tyr Lys Val Val Lys Ile Glu Pro Ile Gly Ile Ala Pro Thr Arg Ala
485 490 495
Lys Arg Arg Val Val Gln Arg Glu Lys Arg
500 505
<210> 11
<211> 1646
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 11
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagagt tgcttgtgga 60
aatggggcac cttgttcctt ggaatgttga tgatctgtag tgctgtagaa caattgtggg 120
tcacagttta ttatggagta cctgtgtgga aagaagcaac caccactcta ttttgtgcat 180
cagatgctaa ggcatatatt ccagaggtac ataatgtatg ggccacacat gcctgtgtac 240
ccacagatcc caacccacaa gaagtagaat tgaaaaatgt gacagaggat tttaacatgt 300
ggaagaataa catggtagaa caaatgcatg aagatgtaat cagtttatgg gatcaaagcc 360
taaagccata tgtggaatta accccactct gtgttacgtt aaattgcact gattattggg 420
gggatactac tcgtgccgga aatactactg ctagtgtcac tagtactgct aatgtcacta 480
gtagtaaaga ggtacaaatg aaaaactgct ctttctatgt ctccacaaac atgatggata 540
agaaacagaa agaatacgca cttttttata aacttgatgt agtgccaata ggtaatgaga 600
ctaatggtaa ggagactaat aatagctata ggttaataag ttgtaacacc tcagtagtta 660
cccaagcctg tccaaaggta acctttgagc caattcccat acattattgt gccccggctg 720
gttttgtgat tctaaagtgc aaggataaga ggttcaatgg aacaggacca tgtacaaatg 780
tcagcacagt acaatgtaca catggaatta ggccagtagt atcaactcaa ctactgttaa 840
atggcagctt agcagaagaa gatatagtac ttagatctga aaatttctcg aacaatgcta 900
aaaacataat agtacagctg aatgaatctg tagtaattaa ttgtacaaga ctcaacaaca 960
atacaagaaa aagcatacat atggggccag ggaaagcatt ttatgcaaca ggagacacca 1020
taggagatat aagacaagca cattgtaaca ttagtgaaga ggcctggaat aaaactctaa 1080
gacgaatagc tataaaatta aaagaacaat ttaatataac agacaaagta atctttaaac 1140
cctcctcagg aggggacata gaaattgcaa tgcacagtgt taattgtgga ggggaatttt 1200
tctactgtaa tacaacacag ctgtttaata gtacttggaa tgaaacacag ctgaatagta 1260
gtactgtgaa taatattaca aggtcagaca acaacatcac actcccatgc aaaataaagc 1320
aaattgtaaa catgtggcag aaaataggaa aagcaatgta tgcccctccc atcagtggac 1380
taattagatg taaatcaaat attacaggga taatattagc aagagatggt ggtaataatg 1440
gcacaaatga tacgaggacc ttcagacctg taggaggaaa tatgaaggac aattggagaa 1500
gtgaattata taaatataaa gtagtaagaa ttgaaccatt aggagtagca cccaccaagg 1560
caaagagaag agtggtgcag agagaaaaaa gagcagtggg actaggagct atgttccttg 1620
ggttcttggg agcagcagga agcact 1646
<210> 12
<211> 498
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 12
Ser Ala Val Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Ile Pro Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Glu Leu Lys Asn Val Thr Glu Asp
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Val
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Tyr Val Glu Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Tyr Trp Gly Asp Thr Thr Arg
100 105 110
Ala Gly Asn Thr Thr Ala Ser Val Thr Ser Thr Ala Asn Val Thr Ser
115 120 125
Ser Lys Glu Val Gln Met Lys Asn Cys Ser Phe Tyr Val Ser Thr Asn
130 135 140
Met Met Asp Lys Lys Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Leu Asp
145 150 155 160
Val Val Pro Ile Gly Asn Glu Thr Asn Gly Lys Glu Thr Asn Asn Ser
165 170 175
Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val Val Thr Gln Ala Cys Pro
180 185 190
Lys Val Thr Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly
195 200 205
Phe Val Ile Leu Lys Cys Lys Asp Lys Arg Phe Asn Gly Thr Gly Pro
210 215 220
Cys Thr Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val
225 230 235 240
Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Asp Ile
245 250 255
Val Leu Arg Ser Glu Asn Phe Ser Asn Asn Ala Lys Asn Ile Ile Val
260 265 270
Gln Leu Asn Glu Ser Val Val Ile Asn Cys Thr Arg Leu Asn Asn Asn
275 280 285
Thr Arg Lys Ser Ile His Met Gly Pro Gly Lys Ala Phe Tyr Ala Thr
290 295 300
Gly Asp Thr Ile Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser Glu
305 310 315 320
Glu Ala Trp Asn Lys Thr Leu Arg Arg Ile Ala Ile Lys Leu Lys Glu
325 330 335
Gln Phe Asn Ile Thr Asp Lys Val Ile Phe Lys Pro Ser Ser Gly Gly
340 345 350
Asp Ile Glu Ile Ala Met His Ser Val Asn Cys Gly Gly Glu Phe Phe
355 360 365
Tyr Cys Asn Thr Thr Gln Leu Phe Asn Ser Thr Trp Asn Glu Thr Gln
370 375 380
Leu Asn Ser Ser Thr Val Asn Asn Ile Thr Arg Ser Asp Asn Asn Ile
385 390 395 400
Thr Leu Pro Cys Lys Ile Lys Gln Ile Val Asn Met Trp Gln Lys Ile
405 410 415
Gly Lys Ala Met Tyr Ala Pro Pro Ile Ser Gly Leu Ile Arg Cys Lys
420 425 430
Ser Asn Ile Thr Gly Ile Ile Leu Ala Arg Asp Gly Gly Asn Asn Gly
435 440 445
Thr Asn Asp Thr Arg Thr Phe Arg Pro Val Gly Gly Asn Met Lys Asp
450 455 460
Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Arg Ile Glu Pro
465 470 475 480
Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu
485 490 495
Lys Arg
<210> 13
<211> 1586
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 13
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tgtcagcgct 60
tgtggaaatg gggcaccatg ctccttggga tgttaatgat ctgtagggct gcagagcaat 120
tgtgggtcac agtctattat ggagtacctg tgtggagaga agcaaacacc actctatttt 180
gtgcatcaga tgctaaagca tatgatacag aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccctaac ccacaagaag tagtattgga aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaga tgcatgagga tataatcagt ctatgggatc 360
aaagcctaaa accatgtgta aaattaaccc cactctgtgt tactttaaat tgtaatacca 420
ttaatgccac taaagatatg ataggagaat taaaaaactg ctctttcaac atcaccacaa 480
gcataagaga taagtggcaa aaagaatatg cactttttta taaacttgat gtagtgccaa 540
tagatgataa tggtaatgat actggtaatg gtagctatag gctaataagt tgtaatacct 600
cagtcattac acaggcctgt ccaaagacat cctttgagcc aattcccata cattattgtg 660
ccccggctgg ttttgcgatt ctaaagtgta acaataaaaa gttcaatgga acaggaccac 720
gtaaaaatgt cagcacagta caatgtacac atggaattag gccagtagta tctactcaac 780
tgttgttaaa tggcagtcta gcagaagaag agatagtact tagatctgaa aatttctcaa 840
acaatgctaa aaccataata gtacaattga atgaatctat agtaattaat tgtacaagac 900
ccaacaacaa tacgagaaaa agtatacata taggaccagg gagagcattt tatgcagcag 960
gagaaataat aggagatata agaacagcac attgtaacat tagtggaaca aaatggaata 1020
acactttaaa acagatagtt gtaaaattaa gagaacaatt tggaaataaa acaatggtct 1080
ttagtcactc ctcaggaggg gacccggaaa ttgtaaggca cagttttaat tgtggagggg 1140
aatttttcca ttgcaataca acacaactgt ttaatagtag ttggccttgg aatggtactg 1200
aagggtcaaa taacactgaa ggaaatgaca caatcaccct cccatgcaga ataaaacaaa 1260
ttataaacat gtggcaggaa gtaggaaaag caatgtatgc ccctcccatc agaggggtaa 1320
ttaaatgttc atcaaatatt acagggctat tattaacaag agatgggggt actaacagga 1380
ccgacaatgg gagcgaggtc ttcagacctg ggggaggaga tatgagggac aattggagta 1440
gtgaattata taaaaataaa gtagtaagaa ttgaaccatt aggagtagca cccaccaagg 1500
caaagagaag agtggtgcag agagaaaaaa gagcagtggg actaggagct atgttccttg 1560
ggttcttggg agcagcagga agcact 1586
<210> 14
<211> 476
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 14
Arg Ala Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Arg Glu Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Asn Thr Ile Asn Ala Thr Lys Asp Met
100 105 110
Ile Gly Glu Leu Lys Asn Cys Ser Phe Asn Ile Thr Thr Ser Ile Arg
115 120 125
Asp Lys Trp Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Leu Asp Val Val
130 135 140
Pro Ile Asp Asp Asn Gly Asn Asp Thr Gly Asn Gly Ser Tyr Arg Leu
145 150 155 160
Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Thr Ser
165 170 175
Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile
180 185 190
Leu Lys Cys Asn Asn Lys Lys Phe Asn Gly Thr Gly Pro Arg Lys Asn
195 200 205
Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr
210 215 220
Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Ile Val Leu Arg
225 230 235 240
Ser Glu Asn Phe Ser Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Asn
245 250 255
Glu Ser Ile Val Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys
260 265 270
Ser Ile His Ile Gly Pro Gly Arg Ala Phe Tyr Ala Ala Gly Glu Ile
275 280 285
Ile Gly Asp Ile Arg Thr Ala His Cys Asn Ile Ser Gly Thr Lys Trp
290 295 300
Asn Asn Thr Leu Lys Gln Ile Val Val Lys Leu Arg Glu Gln Phe Gly
305 310 315 320
Asn Lys Thr Met Val Phe Ser His Ser Ser Gly Gly Asp Pro Glu Ile
325 330 335
Val Arg His Ser Phe Asn Cys Gly Gly Glu Phe Phe His Cys Asn Thr
340 345 350
Thr Gln Leu Phe Asn Ser Ser Trp Pro Trp Asn Gly Thr Glu Gly Ser
355 360 365
Asn Asn Thr Glu Gly Asn Asp Thr Ile Thr Leu Pro Cys Arg Ile Lys
370 375 380
Gln Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro
385 390 395 400
Pro Ile Arg Gly Val Ile Lys Cys Ser Ser Asn Ile Thr Gly Leu Leu
405 410 415
Leu Thr Arg Asp Gly Gly Thr Asn Arg Thr Asp Asn Gly Ser Glu Val
420 425 430
Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Ser Ser Glu Leu
435 440 445
Tyr Lys Asn Lys Val Val Arg Ile Glu Pro Leu Gly Val Ala Pro Thr
450 455 460
Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
465 470 475
<210> 15
<211> 1618
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 15
agaaagagcg aagacagtgg caatgagagt gagggggatc atgaggaatt atcagtactt 60
atggaaatgg ggcaccatgc tcctggggat attgatgatc tgtaatgcta gtgaaaaatt 120
gtgggtcaca gtctattatg gggtacctgt gtggaaagag gcaaacacca ctctattttg 180
tgcatcagat gccaaagctt atgatacaga agtacataat gtttgggcca cacatgcctg 240
tgtacccaca gacccccgcc ctcaagaagt actattggga aatgtgacag aaaattttaa 300
catgtggaaa aataacatgg tagaacaaat gcatgaggat ataatcagtt tatgggatca 360
aagcctaaag ccatgtgtaa aattaacccc actctgtgtt actttaaatt gcactaactt 420
gaatgatact aatatcagta gtagtaatgt tagtacccat aatagtagtg gcataggaga 480
aatgaaaaat tgctctttca atgttaccac aagtataaga gataagatga agaaagaata 540
tgcacttttt tatagacttg atatagttcc aatagataat agtaacacca gttatatgtt 600
aataagttgt aatacctcag tcattacaca ggcctgtcca aaggtatcct ttgaaccaat 660
tcccatacat tattgtgccc cggctggttt tgcgattcta aagtgtaatg ataagaagtt 720
caatggaaca ggaccatgta agaatgtcag cacagtacaa tgtacacatg gaattaggcc 780
agtagtatca actcaactgc tgttaaatgg cagtttagca gaagaagaga tagtaattag 840
atctgaaaat ttcacagaca atactaaaac cataatagtg catctgaacg aatctataca 900
aattaattgt acaagaccca acaacaatac aagaaaaagc atacatatag gaccaggaag 960
agcattttat gcaacaggag aaataatagg agatataaga caagcacatt gtaaccttag 1020
tagagcaaaa tggaataaca cgttaaaaca gatagttaaa aaattaagag tacaatttga 1080
aaataaaaca atagtcttta atcaatcttc aggaggggac ccagaaattg taatgcacag 1140
ctttaattgt ggaggggaat ttttctactg taatacaaca gcactgttta atagtacttg 1200
gaatagtaat agtactgaat ggtcaaatga cactgaaagc aatgacacag tgattacact 1260
cccatgcaga ataaaacaaa tagtaaacat gtggcaggaa gttggaaaag caatgtatgc 1320
ccctcccatc aagggacaaa ttaagtggat atcaaatatt acagggatac tattaacaag 1380
agatggggga agagatgagg ttaatagcac gaacgagaac aagaccgaga tcttcagacc 1440
tgcaggagga aatatgaagg acaattggag aagtgaatta tataaatata aagtagtaaa 1500
aattgaacca ttaggaatag cacccactag ggcaagaaga agagtggtgc agagagaaaa 1560
aagagcagtg acactaggag ctttgttcct tgggttcttg ggagcagcag gaagcact 1618
<210> 16
<211> 487
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 16
Asn Ala Ser Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Arg Pro Gln Glu Val Leu Leu Gly Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asn Leu Asn Asp Thr Asn Ile Ser
100 105 110
Ser Ser Asn Val Ser Thr His Asn Ser Ser Gly Ile Gly Glu Met Lys
115 120 125
Asn Cys Ser Phe Asn Val Thr Thr Ser Ile Arg Asp Lys Met Lys Lys
130 135 140
Glu Tyr Ala Leu Phe Tyr Arg Leu Asp Ile Val Pro Ile Asp Asn Ser
145 150 155 160
Asn Thr Ser Tyr Met Leu Ile Ser Cys Asn Thr Ser Val Ile Thr Gln
165 170 175
Ala Cys Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala
180 185 190
Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp Lys Lys Phe Asn Gly
195 200 205
Thr Gly Pro Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile
210 215 220
Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu
225 230 235 240
Glu Glu Ile Val Ile Arg Ser Glu Asn Phe Thr Asp Asn Thr Lys Thr
245 250 255
Ile Ile Val His Leu Asn Glu Ser Ile Gln Ile Asn Cys Thr Arg Pro
260 265 270
Asn Asn Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly Arg Ala Phe
275 280 285
Tyr Ala Thr Gly Glu Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn
290 295 300
Leu Ser Arg Ala Lys Trp Asn Asn Thr Leu Lys Gln Ile Val Lys Lys
305 310 315 320
Leu Arg Val Gln Phe Glu Asn Lys Thr Ile Val Phe Asn Gln Ser Ser
325 330 335
Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Asn Cys Gly Gly Glu
340 345 350
Phe Phe Tyr Cys Asn Thr Thr Ala Leu Phe Asn Ser Thr Trp Asn Ser
355 360 365
Asn Ser Thr Glu Trp Ser Asn Asp Thr Glu Ser Asn Asp Thr Val Ile
370 375 380
Thr Leu Pro Cys Arg Ile Lys Gln Ile Val Asn Met Trp Gln Glu Val
385 390 395 400
Gly Lys Ala Met Tyr Ala Pro Pro Ile Lys Gly Gln Ile Lys Trp Ile
405 410 415
Ser Asn Ile Thr Gly Ile Leu Leu Thr Arg Asp Gly Gly Arg Asp Glu
420 425 430
Val Asn Ser Thr Asn Glu Asn Lys Thr Glu Ile Phe Arg Pro Ala Gly
435 440 445
Gly Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val
450 455 460
Val Lys Ile Glu Pro Leu Gly Ile Ala Pro Thr Arg Ala Arg Arg Arg
465 470 475 480
Val Val Gln Arg Glu Lys Arg
485
<210> 17
<211> 1625
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 17
agaaagagca gaagacagtg gcaatgagag tgaaggagat caagaagaat tgtcagcgct 60
tgtggagatg gggcatcatg ctccttggga tattgatgat ctgtagtgct acagaaaaat 120
tgtgggtcac agtctattat ggggtacctg tgtggaagga agcaaacacc actctatttt 180
gtgcatctga tgctaaagca tatgatacag aggtacataa tgtttgggca acacatgcct 240
gtgtacccac agaccccaac ccacaagaag taagattaaa aaatgtgaca gaaaatttta 300
acatgtggag gaataacatg gtagaacaga tgcaggagga tataatcagt ttgtgggatc 360
aaagcctaaa gccatgtgta acattaactc cactatgtgt tactttaaat tgcactgatt 420
attggggcaa tgttactggg accaatacta ctagtaaccc tactggtact ggtgtgggtg 480
gtaccactaa caatggcgcg gaagtgatga agtgctcttt taatgtcacc acaagtgtaa 540
gagataaggt acaaaaagaa tctgctcttt tttatagact tgatgtagta aaaatagatg 600
agaaaacaaa tacaaccaat tataggttga taagttgtaa cacctcagtc attaaacagg 660
cccgtccaaa ggtaaacttt gagccaattc ccatacatta ttgtgccccg gctggttttg 720
cgattctaaa gtgtaatgat aagaagttca atggaacagg atcatgtaaa aatgtcagca 780
cagtacaatg tacacatgga attaagccag tagtatccac tcacttgctg ttaaatggca 840
gtctagcaga agatgagata gtaattagat ctgaaaattt cacgaacaat gctaaaacca 900
taatagtaca gctgaataat tatgtaaaaa ttaattgtat aagacccaat aataatacaa 960
gaaaaagtat atcactcgga ccaggaagag cattttatac aacaggagac ataataggaa 1020
atataagaca agcacattgc aaccttagtg gtacagaatg gaataacact ttaaaacagg 1080
tagctaacaa attaagagaa caatttaaca aaacaataat aaaatttaag caaccccccc 1140
cgggagggga cctagaaatc acaatgctca cttttaattg tggaggagaa tttttttact 1200
gtaattcatc agcactgttt aatagtactt tgacttggga tagtaaggca tgggcaaata 1260
cacttgaaga aaatatcaca ctcccatgca gaataaaaca aattgtaaac aagtggcagg 1320
aagtaggaaa agcaatatat gcccctccca tcagtggaca gattaattgt acatcaaata 1380
ttacagggat actattaaca agagatggtg gtaataacaa cgacactaac aacactgagg 1440
tcttcagacc tggaggagga gatatgaggg acaattggag aagtgagtta tataaatata 1500
aagtagtaaa aattgaacca ttaggaatag cacccaccag ggcaaagaga agagtggtgc 1560
agagagaaaa aagagcagca ataggagcta tgttccttgg gttcttggga gcagcaggaa 1620
gcact 1625
<210> 18
<211> 490
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 18
Ser Ala Thr Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Arg Leu Lys Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Arg Asn Asn Met Val Glu Gln Met Gln Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Thr Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Tyr Trp Gly Asn Val Thr Gly
100 105 110
Thr Asn Thr Thr Ser Asn Pro Thr Gly Thr Gly Val Gly Gly Thr Thr
115 120 125
Asn Asn Gly Ala Glu Val Met Lys Cys Ser Phe Asn Val Thr Thr Ser
130 135 140
Val Arg Asp Lys Val Gln Lys Glu Ser Ala Leu Phe Tyr Arg Leu Asp
145 150 155 160
Val Val Lys Ile Asp Glu Lys Thr Asn Thr Thr Asn Tyr Arg Leu Ile
165 170 175
Ser Cys Asn Thr Ser Val Ile Lys Gln Ala Arg Pro Lys Val Asn Phe
180 185 190
Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu
195 200 205
Lys Cys Asn Asp Lys Lys Phe Asn Gly Thr Gly Ser Cys Lys Asn Val
210 215 220
Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr His
225 230 235 240
Leu Leu Leu Asn Gly Ser Leu Ala Glu Asp Glu Ile Val Ile Arg Ser
245 250 255
Glu Asn Phe Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Asn Asn
260 265 270
Tyr Val Lys Ile Asn Cys Ile Arg Pro Asn Asn Asn Thr Arg Lys Ser
275 280 285
Ile Ser Leu Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly Asp Ile Ile
290 295 300
Gly Asn Ile Arg Gln Ala His Cys Asn Leu Ser Gly Thr Glu Trp Asn
305 310 315 320
Asn Thr Leu Lys Gln Val Ala Asn Lys Leu Arg Glu Gln Phe Asn Lys
325 330 335
Thr Ile Ile Lys Phe Lys Gln Pro Pro Pro Gly Gly Asp Leu Glu Ile
340 345 350
Thr Met Leu Thr Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser
355 360 365
Ser Ala Leu Phe Asn Ser Thr Leu Thr Trp Asp Ser Lys Ala Trp Ala
370 375 380
Asn Thr Leu Glu Glu Asn Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile
385 390 395 400
Val Asn Lys Trp Gln Glu Val Gly Lys Ala Ile Tyr Ala Pro Pro Ile
405 410 415
Ser Gly Gln Ile Asn Cys Thr Ser Asn Ile Thr Gly Ile Leu Leu Thr
420 425 430
Arg Asp Gly Gly Asn Asn Asn Asp Thr Asn Asn Thr Glu Val Phe Arg
435 440 445
Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys
450 455 460
Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Ile Ala Pro Thr Arg Ala
465 470 475 480
Lys Arg Arg Val Val Gln Arg Glu Lys Arg
485 490
<210> 19
<211> 1643
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 19
agaaagagca gaagacagtg gcaatgagag tgatggagat caggaagagt tatcagaact 60
tatggagagg gggcaccttg ctccttggga tgttaatgat gatctgtagt gctgcagaag 120
aatcgtgggt cacagtatat tatggggtac ctgtgtggaa agaagcaacc accactctat 180
tttgtgcatc agatgctaaa ggctatgata cagaaagaca taatgtttgg gccacacatg 240
cctgtgtacc cacagacccc aacccacaag aaattgaatt ggtaaatgtg acagaatatt 300
ttaacatggg aaaaaataac atggtagaac agatgcatga ggatataatc agtttatggg 360
atgaaagcct aaagccatgt gtaaaattaa ccccactctg tgttactcta aattgcacta 420
atttgaatat tactaatacc actggtatta ctaatagtag cctggaagaa atgaggagaa 480
taatgacaaa ctgttctttc aaggtcacca caaatataag agataaggtg cagaagcaat 540
atgcactgtt gtataaactt gatgtagtac aaatagatga tgagagtacc acaggtaata 600
ggagtaacag cgcctacagg ttgataagtt gtaacacctc agtcattaca caggcccgtc 660
caaaggtatc ctttgagcca attcccatac acttttgtgc cccggctggt tttgcgattc 720
taaaatgtaa ggataagaag ttcaatggaa caggactatg taaaaatgtc agcacagtac 780
aatgtacaca tggaattagg ccagtagtat caactcagct gctgttaaat ggcagtctag 840
cagaagaaga ggtagtaatt agatctgtaa atttcacaaa caatgctaaa actataatag 900
tacagctgaa caaatctata gaaattaatt gtacaagacc caacaacaat acaagaagag 960
gtataaatat aggacccggg agagcatttt acacaataaa ggacataaca ggagatataa 1020
gacaagcaca ttgtaacatt agtgcatcag actggaataa tactgtaaca caggtagttg 1080
caaaattaaa agagcaattt gggaataaaa caatagtctt taatcaatcc tcaggaggag 1140
acccagaaat tataatgcac acttttaatt gtggagggga atttttctac tgtaagacaa 1200
cacaactgtt taatagtact tggcctaata atggtacttg gcctaatagt aattggactg 1260
ataataatag aacttggaac ggtgctaaag gaactatcac actcccatgc agaataaaac 1320
aaattgtaaa catgtggcag gaagtaggaa aagcaatgta tgcccctccc atcgaaggga 1380
aaataaaatg tacatcaaat cttacaggat tgctattaac aagagatggt ggtaatgtga 1440
atggcaccac catcgagacc ttcagacctg gaggaggaga tatgagggac aattggagaa 1500
gtgaattata taaatataaa gtagtacaaa ttgaaccatt aggactggca cctaccaagg 1560
caaagagaag agtggtgcag agagaaaaaa ggggagtaat aggagctatg ttccttgggt 1620
tcttgggagc agcaggaagc act 1643
<210> 20
<211> 495
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 20
Ser Ala Ala Glu Glu Ser Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Gly
20 25 30
Tyr Asp Thr Glu Arg His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Glu Leu Val Asn Val Thr Glu Tyr
50 55 60
Phe Asn Met Gly Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Glu Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asn Leu Asn Ile Thr Asn Thr Thr
100 105 110
Gly Ile Thr Asn Ser Ser Leu Glu Glu Met Arg Arg Ile Met Thr Asn
115 120 125
Cys Ser Phe Lys Val Thr Thr Asn Ile Arg Asp Lys Val Gln Lys Gln
130 135 140
Tyr Ala Leu Leu Tyr Lys Leu Asp Val Val Gln Ile Asp Asp Glu Ser
145 150 155 160
Thr Thr Gly Asn Arg Ser Asn Ser Ala Tyr Arg Leu Ile Ser Cys Asn
165 170 175
Thr Ser Val Ile Thr Gln Ala Arg Pro Lys Val Ser Phe Glu Pro Ile
180 185 190
Pro Ile His Phe Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Lys
195 200 205
Asp Lys Lys Phe Asn Gly Thr Gly Leu Cys Lys Asn Val Ser Thr Val
210 215 220
Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu
225 230 235 240
Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Val Asn Phe
245 250 255
Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Asn Lys Ser Ile Glu
260 265 270
Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Arg Gly Ile Asn Ile
275 280 285
Gly Pro Gly Arg Ala Phe Tyr Thr Ile Lys Asp Ile Thr Gly Asp Ile
290 295 300
Arg Gln Ala His Cys Asn Ile Ser Ala Ser Asp Trp Asn Asn Thr Val
305 310 315 320
Thr Gln Val Val Ala Lys Leu Lys Glu Gln Phe Gly Asn Lys Thr Ile
325 330 335
Val Phe Asn Gln Ser Ser Gly Gly Asp Pro Glu Ile Ile Met His Thr
340 345 350
Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Lys Thr Thr Gln Leu Phe
355 360 365
Asn Ser Thr Trp Pro Asn Asn Gly Thr Trp Pro Asn Ser Asn Trp Thr
370 375 380
Asp Asn Asn Arg Thr Trp Asn Gly Ala Lys Gly Thr Ile Thr Leu Pro
385 390 395 400
Cys Arg Ile Lys Gln Ile Val Asn Met Trp Gln Glu Val Gly Lys Ala
405 410 415
Met Tyr Ala Pro Pro Ile Glu Gly Lys Ile Lys Cys Thr Ser Asn Leu
420 425 430
Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Val Asn Gly Thr Thr
435 440 445
Ile Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg
450 455 460
Ser Glu Leu Tyr Lys Tyr Lys Val Val Gln Ile Glu Pro Leu Gly Leu
465 470 475 480
Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
485 490 495
<210> 21
<211> 1626
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 21
agaaagagca gaagacagtg gcaatgagag tgatggagat caggaggaat tatcagcgct 60
cgtggagatg gggcaccatg ctccttggga tgttgatgat ttatagtgct gcaggagagt 120
tatgggtcac agtttattat ggggtaccgg tgtggaaaga agcaaccact actttattct 180
gtgcatcaga tgctaaagca tatgacacag aggtacataa tgtttgggca acacatgcct 240
gtgtacccac agaccctaat ccacaagaag tattattgga aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaga tgcatgagga tataatcagt ctatgggatc 360
aaagcctaaa gccacgtgta aaattaaccc cactctgtgt tactttaaac tgtactaatt 420
tgagaaatgt tactaatttg aaaaatgtta ctaataacag taatattagt ggtactaata 480
acaatactag tagtgggggg ctgaagggag gagaaatgaa aaattgctct ttctatatca 540
ccacacacag aaaggataag gtgaagaaag aatatgcact tttttataac cttgatatag 600
tatcaacaga tgatgataat acaagctata tattgagaag ttgtaacacc tcagtcatta 660
cccaggcctg tccaaaggta acctttgaac caattcccat acattattgt accccagctg 720
gttttgcgat tctgaagtgt aacgataaga agttcaatgg aacaggacca tgtagaaatg 780
tcagtacagt acaatgtaca catggaatca agccagtagt gtcaacccaa ctgttgttaa 840
atggcagtct agcagaagaa gaggtagtaa ttagatctga aaatttcacg gacaatgtta 900
aaaccataat agtacagctg aatgaatctg taataattaa ttgtacaaga cccagcaaca 960
atacaagaaa aagtatacgt tttggaccag gggcggcatt ttatacaaca ggagacataa 1020
taggagatat aagacaagca cattgtaaca tcagtagagc agaatggaat aacactttaa 1080
aacaaatagt taaaaaatta caagaacaat ttgtgaataa aacaatagtc tttaatcaat 1140
ctgcaggagg ggacccagaa attgtaaggc acagtgtaaa ttgtggaggg gaatttttct 1200
actgcgatac aacacaactg tttaatagta cttggaatag tactggagag tcaaataaca 1260
ctcaagaaaa tgacctaatc acactcccat gcagaataaa acaaattata aacagatggc 1320
aggaaatagg aaaagcaatg tatgcccctc ccatccaagg acaaattagc tgtacatcaa 1380
atattacagg gctgctacta acaagagatg gtggtaataa taataacagc acagagacct 1440
tcagacctgg aggaggaaat atgaaggaca attggagaag tgaattatat aaatataaag 1500
tagtaaaaat tcagccatta ggggtagcac ccaccaaggc aaagagaaga gtggtgcaga 1560
gggaaaagag cagtgggagc actaggagct atgttccttg ggttcttggg agcagcagga 1620
agcact 1626
<210> 22
<211> 489
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 22
Ser Ala Ala Gly Glu Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Leu Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Arg Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asn Leu Arg Asn Val Thr Asn Leu
100 105 110
Lys Asn Val Thr Asn Asn Ser Asn Ile Ser Gly Thr Asn Asn Asn Thr
115 120 125
Ser Ser Gly Gly Leu Lys Gly Gly Glu Met Lys Asn Cys Ser Phe Tyr
130 135 140
Ile Thr Thr His Arg Lys Asp Lys Val Lys Lys Glu Tyr Ala Leu Phe
145 150 155 160
Tyr Asn Leu Asp Ile Val Ser Thr Asp Asp Asp Asn Thr Ser Tyr Ile
165 170 175
Leu Arg Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val
180 185 190
Thr Phe Glu Pro Ile Pro Ile His Tyr Cys Thr Pro Ala Gly Phe Ala
195 200 205
Ile Leu Lys Cys Asn Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Arg
210 215 220
Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro Val Val Ser
225 230 235 240
Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile
245 250 255
Arg Ser Glu Asn Phe Thr Asp Asn Val Lys Thr Ile Ile Val Gln Leu
260 265 270
Asn Glu Ser Val Ile Ile Asn Cys Thr Arg Pro Ser Asn Asn Thr Arg
275 280 285
Lys Ser Ile Arg Phe Gly Pro Gly Ala Ala Phe Tyr Thr Thr Gly Asp
290 295 300
Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser Arg Ala Glu
305 310 315 320
Trp Asn Asn Thr Leu Lys Gln Ile Val Lys Lys Leu Gln Glu Gln Phe
325 330 335
Val Asn Lys Thr Ile Val Phe Asn Gln Ser Ala Gly Gly Asp Pro Glu
340 345 350
Ile Val Arg His Ser Val Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asp
355 360 365
Thr Thr Gln Leu Phe Asn Ser Thr Trp Asn Ser Thr Gly Glu Ser Asn
370 375 380
Asn Thr Gln Glu Asn Asp Leu Ile Thr Leu Pro Cys Arg Ile Lys Gln
385 390 395 400
Ile Ile Asn Arg Trp Gln Glu Ile Gly Lys Ala Met Tyr Ala Pro Pro
405 410 415
Ile Gln Gly Gln Ile Ser Cys Thr Ser Asn Ile Thr Gly Leu Leu Leu
420 425 430
Thr Arg Asp Gly Gly Asn Asn Asn Asn Ser Thr Glu Thr Phe Arg Pro
435 440 445
Gly Gly Gly Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr
450 455 460
Lys Val Val Lys Ile Gln Pro Leu Gly Val Ala Pro Thr Lys Ala Lys
465 470 475 480
Arg Arg Val Val Gln Arg Glu Lys Ser
485
<210> 23
<211> 1562
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 23
agaaagagca gaagacagtg gcaatgagag tgacggggat gaggaacaat tatccgcact 60
tatggaaaga ggtcaccttg ctccttggaa tattgatgat atgtagtgct acagaaaatt 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga agcaaccacc actctattct 180
gtgcatcgga tgctaaggca tatgatacag aggcacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaaa tgagattgga aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaga tgcaggatga tataatcagt ttatgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cactctgtgt tactttaaat tgcactaata 420
ccacaaatgc taatagtacc aataataata actgggacat gaaaaactgc tctttcaatg 480
tcacctcagg cataagagat aaggtgcgaa aagaacatgc actcttttat gcacttgatg 540
tagtaccaat agataatgag actaactata ggttgataag ttgtaacacc tcagtcatca 600
cacaggcctg tccaaaggta tcctttgagc caattcctat acattattat gccccggctg 660
gttttgcgat tctaaaatgt agggataaaa agttcaatgg aacaggacca tgtaaagatg 720
tcagcacagt acaatgtaca catggaatta agccagtagt atcaactcaa ctactgttaa 780
atggcagtct agcagaagaa gaggtagtaa tcagatctga aaacttcacg aacaatgcta 840
aaaccatatt agtacaactg aatgaatctg tagtaattaa ttgtacaaga cccaacaaca 900
atacaagaaa aagtataaat ataggaccag ggagagcatt ctatgcaaca ggagaaataa 960
taggagatat aagacaagca cattgtaacc ttagtaaggc acaatggaac aacactttaa 1020
aaaaggtagt tgtaaaatta agagaacaat ttccgaataa aacgatagtc tttactcatt 1080
cctcaggagg ggacccagaa attgtaatgc acagttttaa ttgtggagga gaatttttct 1140
actgtaattc aacaccactg tttaatagta cttggaagtt gaatggtact atggaatcaa 1200
atgacactga aggaaatctc acactccaat gcagaataaa acaaatcatg aacaagtggc 1260
aggaagtagg aaaggcaatg tatgcccctc ccatccaagg acagattaga tgttcatcaa 1320
atattacagg gctgttatta gtaagagatg gtggggtcaa cagcgccaac gagaccttca 1380
gaccaggagg aggagatatg agggacaatt ggagaagtga attatataaa tataaagtag 1440
taaaaattga accattagga atagcaccca ccaaggcaaa gagaagagtg gtgcagagag 1500
aaaaaagagc agtgggaata ggagctttgt tccttgggtt cttgggagca gcaggaagca 1560
ct 1562
<210> 24
<211> 468
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 24
Ser Ala Thr Glu Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Met Arg Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met Gln Asp Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asn Thr Thr Asn Ala Asn Ser Thr
100 105 110
Asn Asn Asn Asn Trp Asp Met Lys Asn Cys Ser Phe Asn Val Thr Ser
115 120 125
Gly Ile Arg Asp Lys Val Arg Lys Glu His Ala Leu Phe Tyr Ala Leu
130 135 140
Asp Val Val Pro Ile Asp Asn Glu Thr Asn Tyr Arg Leu Ile Ser Cys
145 150 155 160
Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro
165 170 175
Ile Pro Ile His Tyr Tyr Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys
180 185 190
Arg Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Lys Asp Val Ser Thr
195 200 205
Val Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu
210 215 220
Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Glu Asn
225 230 235 240
Phe Thr Asn Asn Ala Lys Thr Ile Leu Val Gln Leu Asn Glu Ser Val
245 250 255
Val Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile Asn
260 265 270
Ile Gly Pro Gly Arg Ala Phe Tyr Ala Thr Gly Glu Ile Ile Gly Asp
275 280 285
Ile Arg Gln Ala His Cys Asn Leu Ser Lys Ala Gln Trp Asn Asn Thr
290 295 300
Leu Lys Lys Val Val Val Lys Leu Arg Glu Gln Phe Pro Asn Lys Thr
305 310 315 320
Ile Val Phe Thr His Ser Ser Gly Gly Asp Pro Glu Ile Val Met His
325 330 335
Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Pro Leu
340 345 350
Phe Asn Ser Thr Trp Lys Leu Asn Gly Thr Met Glu Ser Asn Asp Thr
355 360 365
Glu Gly Asn Leu Thr Leu Gln Cys Arg Ile Lys Gln Ile Met Asn Lys
370 375 380
Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Gln Gly Gln
385 390 395 400
Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Val Arg Asp Gly
405 410 415
Gly Val Asn Ser Ala Asn Glu Thr Phe Arg Pro Gly Gly Gly Asp Met
420 425 430
Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile
435 440 445
Glu Pro Leu Gly Ile Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln
450 455 460
Arg Glu Lys Arg
465
<210> 25
<211> 1643
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 25
agaaagagca gaagacagtg gcaatgagag cgaaggggat caggaggaat tggcagcgct 60
tgtgttggag atggggcacg atgctccttg gaatgttaat gatctgtagt gctacagaac 120
cattgtgggt aacagtctat tatggggtac ctgtgtggaa agaagcaacc accactctat 180
ttcgtgcatc agatgctaaa gcatatggta cagaggtaca taatgtttgg gccacgcatg 240
cctgtgtacc cacagacccc aacccacaag aagtagtatt ggaaaatgta acagaaaatt 300
ttaatgcgtg ggaaaataac atggtggaac aaatgcatga ggatataatc agtttatggg 360
atcaaagtct aaagccatgt gtaaagttaa ccccactctg tgttacttta aaatgcactg 420
ataatttggg gaatgatact aaaaccagta ataagagctg ggaaaagatg gagccaggag 480
aaataaaaaa ctgctccttc aacatcacca caagcatagg agataagacg caggaaacat 540
atgcattttt ttataaactt gatgtagtac caatagataa taagactaca atagataata 600
atactgcaag aaactatagc gactataggt tgataagttg taacacctca gtcattacac 660
aggcctgtcc aaaggtatct tttgaaccaa ttcccataca ttattgtgcc ccggctggtt 720
ttgcgattct aaagtgtaac aataagacat tcatgggaaa aggaccatgt acaaatgtca 780
gcacagtaca atgtacacat ggaattaagc cagtagtatc aactcaactg ctgttaaatg 840
gcagtctggc agaagaagag ataataatta gatctgaaaa tttcacggac aatgctaaaa 900
ccttaataat acatctgaac cactctgtag aaattaagtg tataagaccc aacaacaata 960
caagcgaagg tatacatata ggaccaggga gagcgtttta tccaacaaga ataataggag 1020
atataagaaa agcacattgt aacattaatg aaacagcatg gaagacaact ttagcacaga 1080
tagttacaaa attaagagaa caatttggga ataaaacaat agtctttagc caatcctcag 1140
gaggggaccc agaaattgta atgcacagtt ttaattgtgg aggggaattt ttctactgtg 1200
atacaacaaa actgtttaat agtacttgga atgttaatga tacttggaat ggtgctggag 1260
ggtcaaacag cactgaaaga aacaccacca tcatactccc atgcaaaata aaacaaatta 1320
taaacttgtg gcaggaggta ggaaaagcaa tgtatgcccc tcccatcaaa ggactaatta 1380
gatgttcatc aaatattaca gggctgctat taacaagaga tggtggtaat aacaatgaca 1440
caaacgggac agagatcttc agacctgggg gaggagatat gagggacaat tggagaagtg 1500
aattatataa atataaagta gtgaaaattg aaccattagg agtagcaccc actaaggcaa 1560
agagaagagt ggtgcagaga gaaagaagag cagtgggaat aggagctttg ttccttgggt 1620
tcttgggagc agcaggaagc act 1643
<210> 26
<211> 494
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 26
Ser Ala Thr Glu Pro Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Arg Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Gly Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Ala Trp Glu Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Lys Cys Thr Asp Asn Leu Gly Asn Asp Thr Lys
100 105 110
Thr Ser Asn Lys Ser Trp Glu Lys Met Glu Pro Gly Glu Ile Lys Asn
115 120 125
Cys Ser Phe Asn Ile Thr Thr Ser Ile Gly Asp Lys Thr Gln Glu Thr
130 135 140
Tyr Ala Phe Phe Tyr Lys Leu Asp Val Val Pro Ile Asp Asn Lys Thr
145 150 155 160
Thr Ile Asp Asn Asn Thr Ala Arg Asn Tyr Ser Asp Tyr Arg Leu Ile
165 170 175
Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe
180 185 190
Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu
195 200 205
Lys Cys Asn Asn Lys Thr Phe Met Gly Lys Gly Pro Cys Thr Asn Val
210 215 220
Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln
225 230 235 240
Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Ile Ile Ile Arg Ser
245 250 255
Glu Asn Phe Thr Asp Asn Ala Lys Thr Leu Ile Ile His Leu Asn His
260 265 270
Ser Val Glu Ile Lys Cys Ile Arg Pro Asn Asn Asn Thr Ser Glu Gly
275 280 285
Ile His Ile Gly Pro Gly Arg Ala Phe Tyr Pro Thr Arg Ile Ile Gly
290 295 300
Asp Ile Arg Lys Ala His Cys Asn Ile Asn Glu Thr Ala Trp Lys Thr
305 310 315 320
Thr Leu Ala Gln Ile Val Thr Lys Leu Arg Glu Gln Phe Gly Asn Lys
325 330 335
Thr Ile Val Phe Ser Gln Ser Ser Gly Gly Asp Pro Glu Ile Val Met
340 345 350
His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asp Thr Thr Lys
355 360 365
Leu Phe Asn Ser Thr Trp Asn Val Asn Asp Thr Trp Asn Gly Ala Gly
370 375 380
Gly Ser Asn Ser Thr Glu Arg Asn Thr Thr Ile Ile Leu Pro Cys Lys
385 390 395 400
Ile Lys Gln Ile Ile Asn Leu Trp Gln Glu Val Gly Lys Ala Met Tyr
405 410 415
Ala Pro Pro Ile Lys Gly Leu Ile Arg Cys Ser Ser Asn Ile Thr Gly
420 425 430
Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn Asn Asp Thr Asn Gly Thr
435 440 445
Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser
450 455 460
Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala
465 470 475 480
Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Arg Arg
485 490
<210> 27
<211> 1613
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 27
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tgtcagctct 60
tgtggaaatg gggcaccatg ctccttggga tgttgatgat ctgtagtgct gcagaacaac 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga tgcaaccacc actttatttt 180
gtgcatcaga tgctaaagca tacgacaaag aggcacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccctaac ccacgagaaa taaaattgga aaatgtgaca gaaaatttta 300
acatgtggaa aaatgacatg gcagaccaga tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa gccatgtgta gaattaaccc cactctgtgt tactttaaat tgcactaata 420
ttagtttgaa tagtactaac aatgatacta ttaacagtag taatagtact gaaggaataa 480
atatgaggga agaaatgaaa aactgctctt tcaataccac cacaagtata ggagataaga 540
ataagagaga atatgcactt ttttataaac ttgatgtagt accaatagat aataagacaa 600
gctatacgtt gataaattgt aacacctcag tcattaaaca ggcctgtcca aaggtaacct 660
ttgaaccaat tcccatacat tattgtgccc cggctggttt tgcgattcta aagtgtctca 720
ataagacgtt cgatggaaat ggaacatgta caaatgtcag cacagtacag tgtacacatg 780
gcattagacc agtagtgtca acccaactac tgttaaatgg cagtctagca gaagaagagg 840
tagtaattag atatgagaat gtccaggaca atactaaaac cataatagta cagctgaacg 900
aaactgtaaa aattaattgt acaagaccca acaacaatac aagaaaaggt atacatgtgg 960
gatgggggag accaatttat gcaacaggag aaataatagg agatataaga caagcacatc 1020
gtaatctaag taaaaaagac tggggagaca ctttaaagaa gatagctata aaactacaag 1080
aacaatttaa tacaacaata atctttgagc aatcctcagg aggggaccca gaaattacaa 1140
tgcacagtct taattgtgga ggggaatttt tctactgtaa tacatcaaag ctgtttaatg 1200
gcacttggtc taatggtact tggactagtg gtatttggaa taatactgga gagtcagata 1260
gcacaatcac actcccatgc agaataaaac aaattataaa caggtggcag ggagtaggac 1320
aagcaatgta taaccctccc atcaacggac taattagctg ttcatcaaat attacaggac 1380
tgatattaac aagagatgga ggtaacaaca ggtccaacga gaccttcaga ccaagtggag 1440
gaaacatgag ggacaattgg agaagtgaat tatataaata tagagtagta aaaattgaac 1500
cattaggagt agcacccacc aaggcaaaga gaagagtggt gcagagagaa aaaagagcag 1560
tggggatgat aggagctgtg ttccttgggt tcttgggagc agcaggaagc act 1613
<210> 28
<211> 484
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 28
Ser Ala Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Lys Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Arg Glu Ile Lys Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asp Met Ala Asp Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Glu Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asn Ile Ser Leu Asn Ser Thr Asn
100 105 110
Asn Asp Thr Ile Asn Ser Ser Asn Ser Thr Glu Gly Ile Asn Met Arg
115 120 125
Glu Glu Met Lys Asn Cys Ser Phe Asn Thr Thr Thr Ser Ile Gly Asp
130 135 140
Lys Asn Lys Arg Glu Tyr Ala Leu Phe Tyr Lys Leu Asp Val Val Pro
145 150 155 160
Ile Asp Asn Lys Thr Ser Tyr Thr Leu Ile Asn Cys Asn Thr Ser Val
165 170 175
Ile Lys Gln Ala Cys Pro Lys Val Thr Phe Glu Pro Ile Pro Ile His
180 185 190
Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Leu Asn Lys Thr
195 200 205
Phe Asp Gly Asn Gly Thr Cys Thr Asn Val Ser Thr Val Gln Cys Thr
210 215 220
His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser
225 230 235 240
Leu Ala Glu Glu Glu Val Val Ile Arg Tyr Glu Asn Val Gln Asp Asn
245 250 255
Thr Lys Thr Ile Ile Val Gln Leu Asn Glu Thr Val Lys Ile Asn Cys
260 265 270
Thr Arg Pro Asn Asn Asn Thr Arg Lys Gly Ile His Val Gly Trp Gly
275 280 285
Arg Pro Ile Tyr Ala Thr Gly Glu Ile Ile Gly Asp Ile Arg Gln Ala
290 295 300
His Arg Asn Leu Ser Lys Lys Asp Trp Gly Asp Thr Leu Lys Lys Ile
305 310 315 320
Ala Ile Lys Leu Gln Glu Gln Phe Asn Thr Thr Ile Ile Phe Glu Gln
325 330 335
Ser Ser Gly Gly Asp Pro Glu Ile Thr Met His Ser Leu Asn Cys Gly
340 345 350
Gly Glu Phe Phe Tyr Cys Asn Thr Ser Lys Leu Phe Asn Gly Thr Trp
355 360 365
Ser Asn Gly Thr Trp Thr Ser Gly Ile Trp Asn Asn Thr Gly Glu Ser
370 375 380
Asp Ser Thr Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Arg
385 390 395 400
Trp Gln Gly Val Gly Gln Ala Met Tyr Asn Pro Pro Ile Asn Gly Leu
405 410 415
Ile Ser Cys Ser Ser Asn Ile Thr Gly Leu Ile Leu Thr Arg Asp Gly
420 425 430
Gly Asn Asn Arg Ser Asn Glu Thr Phe Arg Pro Ser Gly Gly Asn Met
435 440 445
Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Arg Val Val Lys Ile
450 455 460
Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln
465 470 475 480
Arg Glu Lys Arg
<210> 29
<211> 1646
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 29
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaac tatcagcact 60
tgtggaaatg gggcaccttg ctccttggga tatcgatgat ctgtagtgct aaagaagaca 120
agttgtgggt cacagtctat tatggggtac ctgtgtggag agatgcaaac accactctat 180
tttgtgcatc aggtgctaaa gcatataaga cagaggtaca taatgtctgg gccacacatg 240
cctgtgtacc cacagacccc aacccacaag aagtggtatt gggaaatgtg acagaatatt 300
ttaatgcatg gaaaaatgac atggtagaac agatgcatga ggatataatc aatctatggg 360
atcaaagcct aaagccatgt gtaaaattaa ccccactctg tgtcacttta aattgcacta 420
acgttaaaaa caatgctacc aaaataaatg ataccaccac tacacctagt gaggaaatag 480
aaataaaaaa ctgctctttc aacatcaccg caggcataag agataagata cagaaagaat 540
atgcattgtt ttctaaattt gatttagtac aaatccatga agataataaa aataataata 600
atacaaacta tacagactat aggttgataa gttgtaacac ctcagtcatt acgcaggcct 660
gtccaaaagt atcctttgag ccaattccca tacatttttg taccccggct ggttttgcga 720
ttctaaagcg taataataag acattcaacg gaaaagggcc atgtacaaat gtcagtacag 780
tacagtgtac acatggaatt aggccagtag tatcaactca actgctgcta aatggcagtt 840
tagcagaaga ggatgtagta attagatctg aaaatttcac aaacaatgtt aaaaccataa 900
tagtacagct gaaagaagct gtacaaatta attgcacaag gcccaacaac aatacaagaa 960
aaagtatacc tataggacca gggagagcat tttatgcaac aggagacata ataggagata 1020
taagacaagc acattgtaac attagtggaa cacaatggaa taaaacttta ggaaagatag 1080
ttgaaaaatt aaaagaacaa tttgggaata aaacaataat ctttaaccaa cccgtaggag 1140
gggacccaga aattgtagcg cacactttta attgtggagg ggaatttttc tactgtaata 1200
caacacctct gtttaatagt acctggactt ggaatagtac ttggaatggt actacaagta 1260
ctgggaatgt tactaaaaaa attatcacac tccaatgcag aataagacaa attgtaaaca 1320
tgtggcagaa agtaggaaaa gcaatgtatg cccctcccat cagaggacag attggatgtt 1380
catcaaatat tacagggctg ctattaacaa gagatggtgg taatagtgag aacgggacca 1440
ataacacaga cacagagacc ttcagaccgg gaggaggaga tatgagggac aattggagaa 1500
gtgaattata taaatataaa gtagtaagaa ttgaaccatt aggaatagca cccactaagg 1560
caaggagaag agtggtgcag agagaaaaaa gagcagtggg aataggggct ttgttccttg 1620
ggttcttggg agcagcagga agcact 1646
<210> 30
<211> 496
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 30
Ser Ala Lys Glu Asp Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro
1 5 10 15
Val Trp Arg Asp Ala Asn Thr Thr Leu Phe Cys Ala Ser Gly Ala Lys
20 25 30
Ala Tyr Lys Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val
35 40 45
Pro Thr Asp Pro Asn Pro Gln Glu Val Val Leu Gly Asn Val Thr Glu
50 55 60
Tyr Phe Asn Ala Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp
65 70 75 80
Ile Ile Asn Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr
85 90 95
Pro Leu Cys Val Thr Leu Asn Cys Thr Asn Val Lys Asn Asn Ala Thr
100 105 110
Lys Ile Asn Asp Thr Thr Thr Thr Pro Ser Glu Glu Ile Glu Ile Lys
115 120 125
Asn Cys Ser Phe Asn Ile Thr Ala Gly Ile Arg Asp Lys Ile Gln Lys
130 135 140
Glu Tyr Ala Leu Phe Ser Lys Phe Asp Leu Val Gln Ile His Glu Asp
145 150 155 160
Asn Lys Asn Asn Asn Asn Thr Asn Tyr Thr Asp Tyr Arg Leu Ile Ser
165 170 175
Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu
180 185 190
Pro Ile Pro Ile His Phe Cys Thr Pro Ala Gly Phe Ala Ile Leu Lys
195 200 205
Arg Asn Asn Lys Thr Phe Asn Gly Lys Gly Pro Cys Thr Asn Val Ser
210 215 220
Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu
225 230 235 240
Leu Leu Asn Gly Ser Leu Ala Glu Glu Asp Val Val Ile Arg Ser Glu
245 250 255
Asn Phe Thr Asn Asn Val Lys Thr Ile Ile Val Gln Leu Lys Glu Ala
260 265 270
Val Gln Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile
275 280 285
Pro Ile Gly Pro Gly Arg Ala Phe Tyr Ala Thr Gly Asp Ile Ile Gly
290 295 300
Asp Ile Arg Gln Ala His Cys Asn Ile Ser Gly Thr Gln Trp Asn Lys
305 310 315 320
Thr Leu Gly Lys Ile Val Glu Lys Leu Lys Glu Gln Phe Gly Asn Lys
325 330 335
Thr Ile Ile Phe Asn Gln Pro Val Gly Gly Asp Pro Glu Ile Val Ala
340 345 350
His Thr Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr Pro
355 360 365
Leu Phe Asn Ser Thr Trp Thr Trp Asn Ser Thr Trp Asn Gly Thr Thr
370 375 380
Ser Thr Gly Asn Val Thr Lys Lys Ile Ile Thr Leu Gln Cys Arg Ile
385 390 395 400
Arg Gln Ile Val Asn Met Trp Gln Lys Val Gly Lys Ala Met Tyr Ala
405 410 415
Pro Pro Ile Arg Gly Gln Ile Gly Cys Ser Ser Asn Ile Thr Gly Leu
420 425 430
Leu Leu Thr Arg Asp Gly Gly Asn Ser Glu Asn Gly Thr Asn Asn Thr
435 440 445
Asp Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp
450 455 460
Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Arg Ile Glu Pro Leu Gly
465 470 475 480
Ile Ala Pro Thr Lys Ala Arg Arg Arg Val Val Gln Arg Glu Lys Arg
485 490 495
<210> 31
<211> 1606
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 31
agaaagacag aagacagtgg caatgagagc gaaggggacc aggaagaatt gtcagcactt 60
gtggtggaga tggggcacca tgctctttgg gatgttgatg atctgtagtg ctgcaaaaga 120
aaaattgtgg gtcacagtct attatggggt gcctgtgtgg aaagaagcaa ccaccactct 180
attttgtgca tcagatgcta aagcatatga cacagaagca cataatgttt gggccacaca 240
tgcttgtgta cccacaaacc ctaacccaca agaagtatta ttgaaaaatg tgacagaaga 300
ttttaacatg tggaaaaata atatggtaga acagatgcat gaggatataa tcagtttatg 360
ggatcaaagc ctaaagccat gtgtgaaatt aaccccactc tgtgttactt tacattgcac 420
tgatgcgaac attactgcaa acagtactgc tactaacagt actgttagct ccattaaaga 480
agaagtgaaa aactgctctt tcaatatcac cacagaagta agagacaagg taaagaaaga 540
acatgcactt ttttatagac ttgatgtagt accaatagct aatgataata caagctatac 600
attggtaaat tgtaacacct caaccattac acaggcctgt ccaaaggtga cctttgaacc 660
aattcctata cattattgtg ccccggctgg ttttgcgatt ctaaaatgta atgataagaa 720
tttcaatgga acaggaccat gtaaaaatgt cagcacagta caatgcacac atggaattag 780
gccagtggtg tcaactcaac tactgttaaa tggcagtcta gcagaagatg aggtagtaat 840
tagatctgaa aatttcacaa acaatgcaaa aatcataata gtacagctaa atgaatctgt 900
aataattaat tatacaagac ctggcaacaa tacaagaaaa agtatacata taggaccggg 960
aagtgcattt tatgcaacag gagacataat aggagatata agacaagcac attgtaacat 1020
tagtaaagca gattgggaga aaacactaaa acaggtagtt aaaaaattac aggaacaata 1080
tgggaataaa acaataaact ttacccaatc ctcaggagga gacccagaaa ttgtaatgca 1140
cagtcttaat tgtggaggag aatttttcta ttgtaataca acaaagctgt ttaatagtac 1200
ttggcagaat ggtactattg taggatcaga aaatacgtca gacattatca tactcccatg 1260
cagaataaag caaattataa acaggtggca ggaagtagga aaagcaatgt atgcccctcc 1320
catcagcgga gacattagat gtacatcaaa tattacaggg ctgctattaa caagagatgg 1380
gggtataaag aacaagacca atgggacaga gacagagatc ttcagacctg caggaggaga 1440
tatgaaggac aattggagaa gtgaattata taaatataaa gtagtaaaaa ttgaaccgtt 1500
aggaatagca cccaccaggg caaggagaag agtggtgcag agggaaaaaa gagaagtgac 1560
gctgggagtt atgttccttg ggttcttggg agcagcagga agcact 1606
<210> 32
<211> 482
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 32
Ser Ala Ala Lys Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro
1 5 10 15
Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys
20 25 30
Ala Tyr Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val
35 40 45
Pro Thr Asn Pro Asn Pro Gln Glu Val Leu Leu Lys Asn Val Thr Glu
50 55 60
Asp Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp
65 70 75 80
Ile Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr
85 90 95
Pro Leu Cys Val Thr Leu His Cys Thr Asp Ala Asn Ile Thr Ala Asn
100 105 110
Ser Thr Ala Thr Asn Ser Thr Val Ser Ser Ile Lys Glu Glu Val Lys
115 120 125
Asn Cys Ser Phe Asn Ile Thr Thr Glu Val Arg Asp Lys Val Lys Lys
130 135 140
Glu His Ala Leu Phe Tyr Arg Leu Asp Val Val Pro Ile Ala Asn Asp
145 150 155 160
Asn Thr Ser Tyr Thr Leu Val Asn Cys Asn Thr Ser Thr Ile Thr Gln
165 170 175
Ala Cys Pro Lys Val Thr Phe Glu Pro Ile Pro Ile His Tyr Cys Ala
180 185 190
Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp Lys Asn Phe Asn Gly
195 200 205
Thr Gly Pro Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile
210 215 220
Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu
225 230 235 240
Asp Glu Val Val Ile Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Ile
245 250 255
Ile Ile Val Gln Leu Asn Glu Ser Val Ile Ile Asn Tyr Thr Arg Pro
260 265 270
Gly Asn Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly Ser Ala Phe
275 280 285
Tyr Ala Thr Gly Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn
290 295 300
Ile Ser Lys Ala Asp Trp Glu Lys Thr Leu Lys Gln Val Val Lys Lys
305 310 315 320
Leu Gln Glu Gln Tyr Gly Asn Lys Thr Ile Asn Phe Thr Gln Ser Ser
325 330 335
Gly Gly Asp Pro Glu Ile Val Met His Ser Leu Asn Cys Gly Gly Glu
340 345 350
Phe Phe Tyr Cys Asn Thr Thr Lys Leu Phe Asn Ser Thr Trp Gln Asn
355 360 365
Gly Thr Ile Val Gly Ser Glu Asn Thr Ser Asp Ile Ile Ile Leu Pro
370 375 380
Cys Arg Ile Lys Gln Ile Ile Asn Arg Trp Gln Glu Val Gly Lys Ala
385 390 395 400
Met Tyr Ala Pro Pro Ile Ser Gly Asp Ile Arg Cys Thr Ser Asn Ile
405 410 415
Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Ile Lys Asn Lys Thr Asn
420 425 430
Gly Thr Glu Thr Glu Ile Phe Arg Pro Ala Gly Gly Asp Met Lys Asp
435 440 445
Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro
450 455 460
Leu Gly Ile Ala Pro Thr Arg Ala Arg Arg Arg Val Val Gln Arg Glu
465 470 475 480
Lys Arg
<210> 33
<211> 1589
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 33
agaaagagca gaagacagtg gcaatgaaag tgaaggggac caggaagagt tatcagcact 60
tgtggagatg gggcatcatg ccccttggga tgttgacgat ttgtagtgtt gcagaacaat 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga agccaccacc actctatttt 180
gtgcatcaga agctaaggca tatgttacag aggtacataa tatttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag cagtattgga aaatgtgaca gaaaatttta 300
acatatggaa aaatgacatg gtagaccaga tgcatgagga tataatcagt gtatgggatc 360
aaagcctaaa gccatgtgtg aaattaaccc cgctctgtgt cactttaaat tgcactgatt 420
attttgggaa aactaatatt actaccactt ctagcagtgg tcccaataat gatagaggaa 480
tgaaaaactg ctctttcaat atcaccacaa gcataagaga taaggtaacg aaagaacatg 540
cactttttta tagagttgat gtagtcccaa tagatagtag taatagtagc tatagattga 600
taaattgtaa cacctcagtc attacacagg ccagtccaaa agtatccttt gagccaattc 660
ccatacatta ttgtaccccg gctggttttg cgattataaa gtgtaataat aagacattca 720
atggaacagg accatgtaga aatgtcagca cagtacaatg tacacatgga attaggccaa 780
tagtgtcaac tcagctgttg ttaaatggca gtctagcagt agaagaggta gtaattagat 840
ctgaaaatat cacgaacaat gctaaaacca taatagtaca attgaacgaa tctgtaagca 900
tcaattgtac aagacccagc aacaatacaa gaagggggat acatatggga ccagggagag 960
cattttggac aacaggtgaa gtaataggag atataaggaa agcacattgt aacattagta 1020
gaaaagaatg gaatgacact ttagacaagg tagtcaaaaa attaagggaa aaatttaatg 1080
caacaataat ctttaatcaa tccacaggag gggacccaga aattgtaatg cacactttta 1140
attgtggagg ggagtttttc tactgtaaca catcacaact gtttaatagt acttgggata 1200
ttaatggaaa tactactgga gggttagaag gcaatgacac aatcacactc caatgtagaa 1260
taaaacaaat tgtaaacatg tggcaggaag taggaaaagc aatgtatgcc cctcccatcc 1320
aaggaaaaat tagatgttca tcaaatatta cagggctgct attaacaaga gatggtggta 1380
ataacagtag taacaatgag accttcagac ctggaggagg agatatgagg gacaattgga 1440
gaagtgaact atataaatat aaagtagtaa aaattcaacc actaggaata gcacccacca 1500
aggcaaagag aagagtggtg cagagagaaa aaagagcagt gggaatagga gctttgttcc 1560
ttgggttctt gggagcagca ggaagcact 1589
<210> 34
<211> 477
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 34
Ser Val Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Glu Ala Lys Ala
20 25 30
Tyr Val Thr Glu Val His Asn Ile Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ala Val Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Ile Trp Lys Asn Asp Met Val Asp Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Val Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Tyr Phe Gly Lys Thr Asn Ile
100 105 110
Thr Thr Thr Ser Ser Ser Gly Pro Asn Asn Asp Arg Gly Met Lys Asn
115 120 125
Cys Ser Phe Asn Ile Thr Thr Ser Ile Arg Asp Lys Val Thr Lys Glu
130 135 140
His Ala Leu Phe Tyr Arg Val Asp Val Val Pro Ile Asp Ser Ser Asn
145 150 155 160
Ser Ser Tyr Arg Leu Ile Asn Cys Asn Thr Ser Val Ile Thr Gln Ala
165 170 175
Ser Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Thr Pro
180 185 190
Ala Gly Phe Ala Ile Ile Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr
195 200 205
Gly Pro Cys Arg Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg
210 215 220
Pro Ile Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Val Glu
225 230 235 240
Glu Val Val Ile Arg Ser Glu Asn Ile Thr Asn Asn Ala Lys Thr Ile
245 250 255
Ile Val Gln Leu Asn Glu Ser Val Ser Ile Asn Cys Thr Arg Pro Ser
260 265 270
Asn Asn Thr Arg Arg Gly Ile His Met Gly Pro Gly Arg Ala Phe Trp
275 280 285
Thr Thr Gly Glu Val Ile Gly Asp Ile Arg Lys Ala His Cys Asn Ile
290 295 300
Ser Arg Lys Glu Trp Asn Asp Thr Leu Asp Lys Val Val Lys Lys Leu
305 310 315 320
Arg Glu Lys Phe Asn Ala Thr Ile Ile Phe Asn Gln Ser Thr Gly Gly
325 330 335
Asp Pro Glu Ile Val Met His Thr Phe Asn Cys Gly Gly Glu Phe Phe
340 345 350
Tyr Cys Asn Thr Ser Gln Leu Phe Asn Ser Thr Trp Asp Ile Asn Gly
355 360 365
Asn Thr Thr Gly Gly Leu Glu Gly Asn Asp Thr Ile Thr Leu Gln Cys
370 375 380
Arg Ile Lys Gln Ile Val Asn Met Trp Gln Glu Val Gly Lys Ala Met
385 390 395 400
Tyr Ala Pro Pro Ile Gln Gly Lys Ile Arg Cys Ser Ser Asn Ile Thr
405 410 415
Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn Ser Ser Asn Asn Glu
420 425 430
Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu
435 440 445
Leu Tyr Lys Tyr Lys Val Val Lys Ile Gln Pro Leu Gly Ile Ala Pro
450 455 460
Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
465 470 475
<210> 35
<211> 1703
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 35
agaaagagca gaagacagtg gcaatgagag tgagggagac caggaagaat tatcagcact 60
tgtggagatg gggcaacata tggagatggg gcatgatgct ccttgggatg ttgatgatct 120
gtagtgctgc agaagatttg tgggtcacag tttattatgg ggtacctgtg tggaaagacg 180
caaagaccac tctattttgt gcatcagatg ctaaagcata taagacagag gtacataatg 240
tttgggccac gcatgcctgt gtacccacag accccaaccc acaagaagta gaaatgaaaa 300
atgtgacaga agattttaac atgtggaaaa ataatatggt agaacagatg catgaggata 360
taatcagttt atgggatcag agcctaaaac cacgtgtaaa attaacccca ctctgtgtta 420
ctttaaagtg ctttgatgtg aagaataaaa ccactactac cactactaat agtaccacat 480
ccactattag tactactacc actaagacgc ccactgttag taaagggaca gagaaatcag 540
aactgacaaa ctgctctttc aatatcacca caaacataag agataagttt cagaaaaact 600
atgcaatttt tgataaactt gatgtagtac caatagatga tgataatgat actactacta 660
acaataatac tagtaatgaa aaaagcttta ggttaataaa ttgtaacacc tcagtcatca 720
cacaggcctg cccaaagata tcatttgaac caattcccat acattattgt accccggctg 780
gttttgcgat tctaaagtgt aaagataaaa atttcaatgg aacaggaaaa tgtaaaagcg 840
tcagcacagt gcaatgtaca catggaatta ggccagtagt gtcaactcaa ctactgttaa 900
atggcagtct agcagaagaa gaggtagtaa ttagatctgc cgatttctcg gacaatacta 960
aaatcataat agtacagctg aataaatctg tagaaattaa ttgtacaaga cccaacaaca 1020
ataaaagaaa aagtataaat ataggaccag ggagagcaat gtttgcaaca ggagacataa 1080
taggagatat aagaaaagca tcttgtacca ttaatgaaac acaatggaat aacacgttac 1140
aacaggtagt tataaaatta aaagaacaat atggaaataa aacaatagtc tttgaccgcc 1200
cctcaggagg ggacccagaa attgtaatgc acagttttaa ttgtggagga gaatttttct 1260
attgtaattc aacacaactg tttaatagta gttgggggcc taatggtact cggaatggta 1320
ctacaacgat aaatggtact atcatactcc catgtagaat aaaacaaatt ataaacatgt 1380
ggcaggaagt aggaaaagca atgtatgccc ctcccatcga gggacttatt aactgtacat 1440
caaatatcac agggctacta ttaacaagag atggtggcca tgacaataat gacacaaaaa 1500
ataacaatac cgagatcttc agacctggag gaggagatat gagggacaat tggagaagtg 1560
aattatataa atataaagta gtaaaaattg aaccattagg aatagcaccc aacaggacaa 1620
aaagaatagt ggtgcaaaga gaaaaaagag cagtgggatt cggagctgtg ttccttgggt 1680
tcttgggagc agcaggaagc act 1703
<210> 36
<211> 509
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 36
Ser Ala Ala Glu Asp Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Asp Ala Lys Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Lys Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Glu Met Lys Asn Val Thr Glu Asp
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Arg Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Lys Cys Phe Asp Val Lys Asn Lys Thr Thr Thr
100 105 110
Thr Thr Thr Asn Ser Thr Thr Ser Thr Ile Ser Thr Thr Thr Thr Lys
115 120 125
Thr Pro Thr Val Ser Lys Gly Thr Glu Lys Ser Glu Leu Thr Asn Cys
130 135 140
Ser Phe Asn Ile Thr Thr Asn Ile Arg Asp Lys Phe Gln Lys Asn Tyr
145 150 155 160
Ala Ile Phe Asp Lys Leu Asp Val Val Pro Ile Asp Asp Asp Asn Asp
165 170 175
Thr Thr Thr Asn Asn Asn Thr Ser Asn Glu Lys Ser Phe Arg Leu Ile
180 185 190
Asn Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe
195 200 205
Glu Pro Ile Pro Ile His Tyr Cys Thr Pro Ala Gly Phe Ala Ile Leu
210 215 220
Lys Cys Lys Asp Lys Asn Phe Asn Gly Thr Gly Lys Cys Lys Ser Val
225 230 235 240
Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln
245 250 255
Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser
260 265 270
Ala Asp Phe Ser Asp Asn Thr Lys Ile Ile Ile Val Gln Leu Asn Lys
275 280 285
Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Lys Arg Lys Ser
290 295 300
Ile Asn Ile Gly Pro Gly Arg Ala Met Phe Ala Thr Gly Asp Ile Ile
305 310 315 320
Gly Asp Ile Arg Lys Ala Ser Cys Thr Ile Asn Glu Thr Gln Trp Asn
325 330 335
Asn Thr Leu Gln Gln Val Val Ile Lys Leu Lys Glu Gln Tyr Gly Asn
340 345 350
Lys Thr Ile Val Phe Asp Arg Pro Ser Gly Gly Asp Pro Glu Ile Val
355 360 365
Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr
370 375 380
Gln Leu Phe Asn Ser Ser Trp Gly Pro Asn Gly Thr Arg Asn Gly Thr
385 390 395 400
Thr Thr Ile Asn Gly Thr Ile Ile Leu Pro Cys Arg Ile Lys Gln Ile
405 410 415
Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile
420 425 430
Glu Gly Leu Ile Asn Cys Thr Ser Asn Ile Thr Gly Leu Leu Leu Thr
435 440 445
Arg Asp Gly Gly His Asp Asn Asn Asp Thr Lys Asn Asn Asn Thr Glu
450 455 460
Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu
465 470 475 480
Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Ile Ala Pro
485 490 495
Asn Arg Thr Lys Arg Ile Val Val Gln Arg Glu Lys Arg
500 505
<210> 37
<211> 1658
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 37
agaaagagca gaagacagtg gcaatgaaag tgaaggggat caagaagagt tatcagcact 60
tgttgagatg gggcgccatg ctccttggga tgttaatgat ctgtagtgct gcagaacaat 120
tgtgggtcac agtctattat ggggtacctg tgtggagaga agcaaacacc actctatttt 180
gtgcatcaga tgctaaagca tatgataaag aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag tagaattgga aaatgtgaca gaaaatttta 300
acatgtggaa aaatgacatg gtagaacaga tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cactctgcgt tactttaaat tgtactgatt 420
taaggtcaca gaatgtgact tataccactg gtgctaatac cactatggct actaccacta 480
gtactaatac cactagtagt gggggagaga tgcaggtagg aatgaaaaac tgctctttca 540
atatcaccac aaacacacaa gataaggtga agggatatgc acattttgat aaccttgatc 600
tagtacaaat agaggatgaa aatcacagca ataacagcta taggttgata cattgtaaca 660
cctcagtaat tacacaggcc tgtccaaagg tatcctttga gccaattcct atacattatc 720
gtgccccggc tggttttgcg attctaaagt gtaaagataa gaagttcaat ggaacaggac 780
cctgtacaaa tgtcagcaca gtacagtgta cacatggaat taggccagta gtatccactc 840
aactgctgtt caatggcagt ctagcagaag aagaggtagt aattagatct gccaatttct 900
cagaaaatga taaaatcata atagtacagc tgaaagacgc tgtacaaatt aattgtacaa 960
gacccaacaa caacaccaga aaaggtatac atatgggacc agggaaagta ttttacgcaa 1020
cagaagtcat aggggacata aggcgagcac attgtaacat tagtaaagaa aattggaata 1080
atactttaaa acagatagct atacaattaa gagagcaaga gcagttcaag aataaaacaa 1140
tagtctttaa tcaatcctca ggaggggacc cagaaattgt aatgtctagt tttaattgtg 1200
gaggggaatt tttctactgt aatacaacac aactgtttaa tagtacttgg gagaatgata 1260
ctagtacttg gaatgatact gaagggtcga atggcactat cacactccca tgcagaataa 1320
aacaaattat caacatgtgg caggaggtag gaaaagcaat atatgcccct cccatcaaag 1380
gaccacttca ttgttcatca aatattacag ggctactatt aacaagagat ggtggtaata 1440
ctaatgagag caacaccacc gaggtcgagg tcttcagacc tttaggagga aacatgaggg 1500
acaattggag aagtgaatta tataaatata aagtagtaaa gattgaacca ttaggaatag 1560
cacccaccaa ggcaaagaga agagtggtgc agagagaaaa aagagcagtg ggaataggag 1620
ctgtgttcct tgggttcttg ggagcagcag gaagcact 1658
<210> 38
<211> 500
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 38
Ser Ala Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Arg Glu Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Lys Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Glu Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Leu Arg Ser Gln Asn Val Thr
100 105 110
Tyr Thr Thr Gly Ala Asn Thr Thr Met Ala Thr Thr Thr Ser Thr Asn
115 120 125
Thr Thr Ser Ser Gly Gly Glu Met Gln Val Gly Met Lys Asn Cys Ser
130 135 140
Phe Asn Ile Thr Thr Asn Thr Gln Asp Lys Val Lys Gly Tyr Ala His
145 150 155 160
Phe Asp Asn Leu Asp Leu Val Gln Ile Glu Asp Glu Asn His Ser Asn
165 170 175
Asn Ser Tyr Arg Leu Ile His Cys Asn Thr Ser Val Ile Thr Gln Ala
180 185 190
Cys Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Arg Ala Pro
195 200 205
Ala Gly Phe Ala Ile Leu Lys Cys Lys Asp Lys Lys Phe Asn Gly Thr
210 215 220
Gly Pro Cys Thr Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg
225 230 235 240
Pro Val Val Ser Thr Gln Leu Leu Phe Asn Gly Ser Leu Ala Glu Glu
245 250 255
Glu Val Val Ile Arg Ser Ala Asn Phe Ser Glu Asn Asp Lys Ile Ile
260 265 270
Ile Val Gln Leu Lys Asp Ala Val Gln Ile Asn Cys Thr Arg Pro Asn
275 280 285
Asn Asn Thr Arg Lys Gly Ile His Met Gly Pro Gly Lys Val Phe Tyr
290 295 300
Ala Thr Glu Val Ile Gly Asp Ile Arg Arg Ala His Cys Asn Ile Ser
305 310 315 320
Lys Glu Asn Trp Asn Asn Thr Leu Lys Gln Ile Ala Ile Gln Leu Arg
325 330 335
Glu Gln Glu Gln Phe Lys Asn Lys Thr Ile Val Phe Asn Gln Ser Ser
340 345 350
Gly Gly Asp Pro Glu Ile Val Met Ser Ser Phe Asn Cys Gly Gly Glu
355 360 365
Phe Phe Tyr Cys Asn Thr Thr Gln Leu Phe Asn Ser Thr Trp Glu Asn
370 375 380
Asp Thr Ser Thr Trp Asn Asp Thr Glu Gly Ser Asn Gly Thr Ile Thr
385 390 395 400
Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly
405 410 415
Lys Ala Ile Tyr Ala Pro Pro Ile Lys Gly Pro Leu His Cys Ser Ser
420 425 430
Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Thr Asn Glu
435 440 445
Ser Asn Thr Thr Glu Val Glu Val Phe Arg Pro Leu Gly Gly Asn Met
450 455 460
Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile
465 470 475 480
Glu Pro Leu Gly Ile Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln
485 490 495
Arg Glu Lys Arg
500
<210> 39
<211> 1568
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 39
agaaagagca gaagacagtg gcaatgagag tgaaggggat catgaagaat tgtcagcact 60
tgtggtggag atggggcatg atgctccttg ggatgttgat gatctgtagt gctacagaac 120
aattgtgggt cacagtctat tatggggtac ctgtgtggaa agaagcaacc accactctat 180
tttgtgcatc agatgctaaa gcatatgaca cagaggcaca taatgtttgg gccacacatg 240
cctgtgtacc cacagaccct aacccacaag aagtagaatt gaaaaatgtg acagaaaatc 300
ttaacatgtg gaaaaatgac atggtagaac agatgcatga ggatataatc aatttatggg 360
atcaaagcct aaagccatgt gtaaaattaa ctccactctg tgtcacttta cattgcacta 420
atttgaatgt tactaccagt aatactacaa gttggggaga gatggaggca ggagaaataa 480
aaaactgctc tttcaatgtc accacacgca gaagaaataa gaaagaatat gcactttttt 540
ataaacttga tgtagtacct atagatagtg ataatgcaag ctatacgttg ataaattgta 600
acacttcagt cattacacaa gcctgtccaa aggtatcctt tgaaccaatt cccatacatt 660
attgtgcccc ggctggtttt gcgattctaa aatgtaatga taagaaattc aatggaacag 720
gaccatgtaa aaatgttagc acagtacaat gtacacatgg aattaggcca gtagtgtcaa 780
ttcaactgct gttaaatggc agtctagcag aagaagaggt agtaattaga tctgaaaatt 840
tctcgaacaa tgctaaagcc gtaatagtac agctgaatgc atctatagaa attaattgta 900
caagacccaa caacaataca agaaaagata tacatatagg accagggaga gcattatata 960
caacaggagg aataatagga gatataagac aagcacattg tagccttagg aaagcagaat 1020
ggaatgacac tttaaaacat gtagttacaa aattaagaga acaatttggg aataaaacaa 1080
tattctttaa tcaatcctca ggaggggacc cagagattgt aatgcacagt tttaattgtg 1140
gaggggaatt tttctactgt aatacaacaa tgctgtttaa tagtaatagt acttggaatg 1200
atactacagg accagataat aacactatca tactcccatg tagaataaaa caaattataa 1260
acaggtggca ggaagtagga aaagcaatgt atgcccctcc tatcagtgga ccaattaaac 1320
gcacatcaaa tattacaggg ctactattaa caagagatgg tggtagtaac accaccgaga 1380
ccttcagacc tggaggagga gatatgaggg acaattggag aagtgaatta tataaatata 1440
aagtagtaaa aattgagcca ttaggggtag cacccaccag ggcaaggaga agagtggtgc 1500
agagagaaaa aagagcagtg ggactgggag ctgtattcct tgggttcttg ggagcagcag 1560
gaagcact 1568
<210> 40
<211> 469
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 40
Ser Ala Thr Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Glu Leu Lys Asn Val Thr Glu Asn
50 55 60
Leu Asn Met Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Asn Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu His Cys Thr Asn Leu Asn Val Thr Thr Ser Asn
100 105 110
Thr Thr Ser Trp Gly Glu Met Glu Ala Gly Glu Ile Lys Asn Cys Ser
115 120 125
Phe Asn Val Thr Thr Arg Arg Arg Asn Lys Lys Glu Tyr Ala Leu Phe
130 135 140
Tyr Lys Leu Asp Val Val Pro Ile Asp Ser Asp Asn Ala Ser Tyr Thr
145 150 155 160
Leu Ile Asn Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val
165 170 175
Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala
180 185 190
Ile Leu Lys Cys Asn Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Lys
195 200 205
Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser
210 215 220
Ile Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile
225 230 235 240
Arg Ser Glu Asn Phe Ser Asn Asn Ala Lys Ala Val Ile Val Gln Leu
245 250 255
Asn Ala Ser Ile Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg
260 265 270
Lys Asp Ile His Ile Gly Pro Gly Arg Ala Leu Tyr Thr Thr Gly Gly
275 280 285
Ile Ile Gly Asp Ile Arg Gln Ala His Cys Ser Leu Arg Lys Ala Glu
290 295 300
Trp Asn Asp Thr Leu Lys His Val Val Thr Lys Leu Arg Glu Gln Phe
305 310 315 320
Gly Asn Lys Thr Ile Phe Phe Asn Gln Ser Ser Gly Gly Asp Pro Glu
325 330 335
Ile Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn
340 345 350
Thr Thr Met Leu Phe Asn Ser Asn Ser Thr Trp Asn Asp Thr Thr Gly
355 360 365
Pro Asp Asn Asn Thr Ile Ile Leu Pro Cys Arg Ile Lys Gln Ile Ile
370 375 380
Asn Arg Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Ser
385 390 395 400
Gly Pro Ile Lys Arg Thr Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg
405 410 415
Asp Gly Gly Ser Asn Thr Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp
420 425 430
Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys
435 440 445
Ile Glu Pro Leu Gly Val Ala Pro Thr Arg Ala Arg Arg Arg Val Val
450 455 460
Gln Arg Glu Lys Arg
465
<210> 41
<211> 1541
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 41
agaaagagca gaagacagtg gcaatgagag tgaaggagat gaggaagcac tggcagcact 60
ggtggacagg gggcatcatg ctccttggga tgctgatgat ctgtagtgct gtaaacaact 120
tgtgggtcac tgtatattat ggagtgcctg tgtggagaga agcaaccacc accctatttt 180
gtgcatcaga tgctaaagca tacaaaacag aggtacataa tgtctgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaga tagatttggt aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaga tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa accatgcgta aaattaaccc cactctgtgt tgtgaaaaac tgctctttca 420
acaccaccac aatggtaagg gatagggagc ggaaagaata tgctctcttt tataaacttg 480
atgtagtaca aatgaatgat aataataata atagtaccca tggaacctat agattgataa 540
attgtaatac ctcagtcatt acacaggcct gtccaaaggt atcctttgag ccaattccca 600
tacattattg cgccccggct ggttttgcga ttctaaagtg taaagacaag aagttcaatg 660
gaacgggacc atgcaaaaat gtcagcacag tacagtgtac acatggaatt aggccagtag 720
tgtcaactca actgctgcta aatggcagtc tagcagaaga agaggtaata attagatctg 780
aaaacttcac aagcaatgct aaaaccataa tagtacagct aaatgaaact gtagagatta 840
attgcacaag acctagcaac aatacaagaa gaagtataca tataggacca gggagagcat 900
tttacacaac aggagacata ataggagata taagaaaagc acattgtaac attagtagaa 960
caaaatggaa taacacttta ggacagatag ttgaaaaatt acaagaacaa tttaagaata 1020
aaacaataat ctttaattca tcctcaggag gggacccaga aattgtatat cacagtttta 1080
attgtggagg ggaatttttc tactgtaata caacagaact gttcgatagt acctggtata 1140
gcccctggaa cagtactggt gggtcaaata acactgaagg gaatagcacg atcacactca 1200
aatgcagaat aaagcaaatt gtaaacaggt ggcaggaagt aggaaaagca atgtatgccc 1260
ctcccatcca gggaaaaatt aaatgttcat caaatattac agggctacta ttaacaagag 1320
acggtggtaa tagtaatagt agtaacgaga ccttcagacc aggaggagga gacatgagag 1380
acaattggag aagtgaatta tataaatata aagtaataaa aattgaacca ttaggagtag 1440
cacccaccaa ggcaaagaga agagtggtgc aaagagaaag aagagcagtg ggaacaatag 1500
gagctatgtt ccttgggttc ttgggagcag caggaagcac t 1541
<210> 42
<211> 460
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 42
Ser Ala Val Asn Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Arg Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Lys Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Asp Leu Val Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Val Lys Asn Cys Ser Phe Asn Thr Thr Thr Met Val Arg
100 105 110
Asp Arg Glu Arg Lys Glu Tyr Ala Leu Phe Tyr Lys Leu Asp Val Val
115 120 125
Gln Met Asn Asp Asn Asn Asn Asn Ser Thr His Gly Thr Tyr Arg Leu
130 135 140
Ile Asn Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser
145 150 155 160
Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile
165 170 175
Leu Lys Cys Lys Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Lys Asn
180 185 190
Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr
195 200 205
Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Ile Ile Arg
210 215 220
Ser Glu Asn Phe Thr Ser Asn Ala Lys Thr Ile Ile Val Gln Leu Asn
225 230 235 240
Glu Thr Val Glu Ile Asn Cys Thr Arg Pro Ser Asn Asn Thr Arg Arg
245 250 255
Ser Ile His Ile Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly Asp Ile
260 265 270
Ile Gly Asp Ile Arg Lys Ala His Cys Asn Ile Ser Arg Thr Lys Trp
275 280 285
Asn Asn Thr Leu Gly Gln Ile Val Glu Lys Leu Gln Glu Gln Phe Lys
290 295 300
Asn Lys Thr Ile Ile Phe Asn Ser Ser Ser Gly Gly Asp Pro Glu Ile
305 310 315 320
Val Tyr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr
325 330 335
Thr Glu Leu Phe Asp Ser Thr Trp Tyr Ser Pro Trp Asn Ser Thr Gly
340 345 350
Gly Ser Asn Asn Thr Glu Gly Asn Ser Thr Ile Thr Leu Lys Cys Arg
355 360 365
Ile Lys Gln Ile Val Asn Arg Trp Gln Glu Val Gly Lys Ala Met Tyr
370 375 380
Ala Pro Pro Ile Gln Gly Lys Ile Lys Cys Ser Ser Asn Ile Thr Gly
385 390 395 400
Leu Leu Leu Thr Arg Asp Gly Gly Asn Ser Asn Ser Ser Asn Glu Thr
405 410 415
Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu
420 425 430
Tyr Lys Tyr Lys Val Ile Lys Ile Glu Pro Leu Gly Val Ala Pro Thr
435 440 445
Lys Ala Lys Arg Arg Val Val Gln Arg Glu Arg Arg
450 455 460
<210> 43
<211> 1625
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 43
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagagt tatcaaaact 60
tgtggaaatg gggcacattg ctccttggga tattgatgat tagtagtgct acagaacaat 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga agcaaccacc actctatttt 180
gtgcatcaga tgctaaagcc tataatacag aggttcataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaat ccacaagaag taatgttaaa tgtgacagag aattttaaca 300
tgtggaaaaa tgacatggta gaacagatgc aggaggatat aatcagttta tgggatcaaa 360
gcctaaagcc atgtgtaaaa ttaaccccac tctgtgttac tttaagctac actgatgcaa 420
atagtactga tgttaatcat accaaaaata gtagtgaggg aatgatggag ggagaaaaaa 480
tgaaaaactg ctctttcaat atcaccacaa gcatgggaaa taagatgcag aaagaatatg 540
cactttttca tagacttgat gtaataccaa tggataatga aagtgctagt gctaactatt 600
ctaactatag gttaataagc tgtaacacct cagtcactac acaggcttgt ccaaaaatat 660
catttgagcc aattcccata cattattgta ccccggctgg ttttgcgatt ctaaaatgta 720
atgataagaa attcagtgga aaaggaggat gtagaaatgt cagtacagta caatgcacac 780
atggaattaa gccagtagta tcgactcaac tactgttaaa tggcagtcta gcagaagaag 840
atgtggtaat taaatctgcc aatttctcgg acaatgctaa aaccataata gtacagctga 900
atgaatctgt aataattaat tgtacaagac ccaacaacaa tacaagaaag ggtatacata 960
tgggaccagg gaaaacattt tatgcaacag gagccataat aggagatata agacaagcac 1020
attgcaatgt tagtagaaca gaccggaata acactttaaa aagggtagct aaaaaactac 1080
aagaacaatt taatacaaca aaagttgtct ttaaacaatc ctcaggaggg gacccagaaa 1140
ttgtaatgca cagttttaat tgtggagggg aatttttcta ctgtaataca tcagggctgt 1200
ttaatagtac ttggccttgg aatgatacta aagaggcaaa taacactaac acaatcacac 1260
tcccatgcaa aataaaacaa atcataaaca tgtggcaggc agtagggaaa gcaatgtatg 1320
cccctcccat tagtgggata attaaatgtg aatctaatat tacagggctg ctactaacaa 1380
gagatggtgg tagtaagaac acaacagata gtaatgacac aaacataaca caagaggtct 1440
tcagaccagg aggaggagat atgagggaca attggagaag tgaattatat aaatataaag 1500
tagtaagaat tgaaccatta ggagtagcac ccactaaggc aaaaagaaga gtggtgcaga 1560
gagaaaaaag agcagtggga ataggagctg tgttccttgg gttcttggga gcagcaggaa 1620
gcact 1625
<210> 44
<211> 489
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 44
Ser Ala Thr Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asn Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Met Leu Asn Val Thr Glu Asn Phe
50 55 60
Asn Met Trp Lys Asn Asp Met Val Glu Gln Met Gln Glu Asp Ile Ile
65 70 75 80
Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu
85 90 95
Cys Val Thr Leu Ser Tyr Thr Asp Ala Asn Ser Thr Asp Val Asn His
100 105 110
Thr Lys Asn Ser Ser Glu Gly Met Met Glu Gly Glu Lys Met Lys Asn
115 120 125
Cys Ser Phe Asn Ile Thr Thr Ser Met Gly Asn Lys Met Gln Lys Glu
130 135 140
Tyr Ala Leu Phe His Arg Leu Asp Val Ile Pro Met Asp Asn Glu Ser
145 150 155 160
Ala Ser Ala Asn Tyr Ser Asn Tyr Arg Leu Ile Ser Cys Asn Thr Ser
165 170 175
Val Thr Thr Gln Ala Cys Pro Lys Ile Ser Phe Glu Pro Ile Pro Ile
180 185 190
His Tyr Cys Thr Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp Lys
195 200 205
Lys Phe Ser Gly Lys Gly Gly Cys Arg Asn Val Ser Thr Val Gln Cys
210 215 220
Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly
225 230 235 240
Ser Leu Ala Glu Glu Asp Val Val Ile Lys Ser Ala Asn Phe Ser Asp
245 250 255
Asn Ala Lys Thr Ile Ile Val Gln Leu Asn Glu Ser Val Ile Ile Asn
260 265 270
Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Gly Ile His Met Gly Pro
275 280 285
Gly Lys Thr Phe Tyr Ala Thr Gly Ala Ile Ile Gly Asp Ile Arg Gln
290 295 300
Ala His Cys Asn Val Ser Arg Thr Asp Arg Asn Asn Thr Leu Lys Arg
305 310 315 320
Val Ala Lys Lys Leu Gln Glu Gln Phe Asn Thr Thr Lys Val Val Phe
325 330 335
Lys Gln Ser Ser Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Asn
340 345 350
Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser Gly Leu Phe Asn Ser
355 360 365
Thr Trp Pro Trp Asn Asp Thr Lys Glu Ala Asn Asn Thr Asn Thr Ile
370 375 380
Thr Leu Pro Cys Lys Ile Lys Gln Ile Ile Asn Met Trp Gln Ala Val
385 390 395 400
Gly Lys Ala Met Tyr Ala Pro Pro Ile Ser Gly Ile Ile Lys Cys Glu
405 410 415
Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Ser Lys Asn
420 425 430
Thr Thr Asp Ser Asn Asp Thr Asn Ile Thr Gln Glu Val Phe Arg Pro
435 440 445
Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr
450 455 460
Lys Val Val Arg Ile Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys
465 470 475 480
Arg Arg Val Val Gln Arg Glu Lys Arg
485
<210> 45
<211> 1606
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 45
agaaagagca gaagacagtg gcaatgagag cgaaggagat caagaggaat tgtcagcact 60
cgtggagatg gggcatcatg ctccttggga tgttaatgat ctatagtact gcagaaaaaa 120
cgtgggtcac agtatattat ggggtacctg tgtggaagga agcaaacacc actctatttt 180
gtgcatcaga tgctaaagca tatgatacag aggcacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaaa taaagttgga aaatgtgaca gagaacttta 300
acatgtggaa aaacaacatg gtagaacaga tgcatgagga tattatcagt ttatgggatc 360
aaagcctaca gccaagtgta aaattaaccc cactttgtgt tactttaaat tgctctactg 420
cgaattttac taaaaggaat tttactaata gcactgagca tgaaaagccg agtgcagaaa 480
tgagaaactg ctctttcaat atcaccacaa tcgtaagaga taaggtaaca aaagaacatg 540
cactttttta tagagatgat gtagtaccaa tagataatgc tagtaataat accagttata 600
ggttaataaa ttgtaatacc tcagtcatta cacaggcctg tccaaagata tcctttgagc 660
caattcctat acattattgt gccccggctg gttttgcgat tctaaagtgt aatgataaga 720
catttaatgg aacagggcta tgtaaaaatg tcagcacagt acaatgtaca catggaatta 780
gaccagtagt gtcaactcaa ctgttgttaa atggcagtct agcagaaaaa aaggtagtag 840
ttagatctga ggagttttca gacaatgcta aatccataat agtacagctg aatacatctg 900
tagtaattaa ttgtacaaga cccggcaata atacaagaag aagcatacct atgggaccag 960
gaagagtatt ttatgcaaca gatataatag gagacataag acaggcacat tgtaacctta 1020
gtagagcagc atggaataac actttaaagc tgatagccgc agaattaaaa gaaatatata 1080
ataaaacaat agcctttaat cgatcctcag gaggggaccc agaaattgta atgcacactt 1140
ttaattgtgg gggggaattt ttctactgta atacaacaca actgtttaat agtgcttgga 1200
acagtactaa tatttatact gggaataatt ctactaatga caccatctct aatgacgcca 1260
tctcactccc atgtagaata aaacaaattg taaacatgtg gcaggaagta ggaaaagcaa 1320
tgtatgcccc tcccatcaga ggaaacatca gctgttcctc aaatattaca gggctgatat 1380
tgacaagaga tggcgggaat agtagtagta ataccgagat cttcagacct cagggaggga 1440
atataaagga caattggaga agtgaattat ataaatataa agtagtaaga attgaaccat 1500
taggattagc acccaccaag gcaaagagga gagtggtgcg gagagaaaaa gagcggtaac 1560
gttcggagct ttgttccttg ggttcttggg agcagcagga agcact 1606
<210> 46
<211> 483
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 46
Ser Thr Ala Glu Lys Thr Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Lys Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Gln Pro Ser Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Ser Thr Ala Asn Phe Thr Lys Arg Asn
100 105 110
Phe Thr Asn Ser Thr Glu His Glu Lys Pro Ser Ala Glu Met Arg Asn
115 120 125
Cys Ser Phe Asn Ile Thr Thr Ile Val Arg Asp Lys Val Thr Lys Glu
130 135 140
His Ala Leu Phe Tyr Arg Asp Asp Val Val Pro Ile Asp Asn Ala Ser
145 150 155 160
Asn Asn Thr Ser Tyr Arg Leu Ile Asn Cys Asn Thr Ser Val Ile Thr
165 170 175
Gln Ala Cys Pro Lys Ile Ser Phe Glu Pro Ile Pro Ile His Tyr Cys
180 185 190
Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp Lys Thr Phe Asn
195 200 205
Gly Thr Gly Leu Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly
210 215 220
Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala
225 230 235 240
Glu Lys Lys Val Val Val Arg Ser Glu Glu Phe Ser Asp Asn Ala Lys
245 250 255
Ser Ile Ile Val Gln Leu Asn Thr Ser Val Val Ile Asn Cys Thr Arg
260 265 270
Pro Gly Asn Asn Thr Arg Arg Ser Ile Pro Met Gly Pro Gly Arg Val
275 280 285
Phe Tyr Ala Thr Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn
290 295 300
Leu Ser Arg Ala Ala Trp Asn Asn Thr Leu Lys Leu Ile Ala Ala Glu
305 310 315 320
Leu Lys Glu Ile Tyr Asn Lys Thr Ile Ala Phe Asn Arg Ser Ser Gly
325 330 335
Gly Asp Pro Glu Ile Val Met His Thr Phe Asn Cys Gly Gly Glu Phe
340 345 350
Phe Tyr Cys Asn Thr Thr Gln Leu Phe Asn Ser Ala Trp Asn Ser Thr
355 360 365
Asn Ile Tyr Thr Gly Asn Asn Ser Thr Asn Asp Thr Ile Ser Asn Asp
370 375 380
Ala Ile Ser Leu Pro Cys Arg Ile Lys Gln Ile Val Asn Met Trp Gln
385 390 395 400
Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Asn Ile Ser
405 410 415
Cys Ser Ser Asn Ile Thr Gly Leu Ile Leu Thr Arg Asp Gly Gly Asn
420 425 430
Ser Ser Ser Asn Thr Glu Ile Phe Arg Pro Gln Gly Gly Asn Ile Lys
435 440 445
Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Arg Ile Glu
450 455 460
Pro Leu Gly Leu Ala Pro Thr Lys Ala Lys Arg Arg Val Val Arg Arg
465 470 475 480
Glu Lys Glu
<210> 47
<211> 1613
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 47
agaaagagca gaagacagtg gcaatgaaag tgaaggggat caggaagaat tatcagtgct 60
tgtggggatg gggcaccatg ctcctcggga tattgatgat ctgtagtgct gcagaaaatt 120
tgtgggtcac agtctactat ggggtacctg tgtggaaaga agcaaccacc actctatttc 180
gtgcatcaga tgctaaagca tatgatacag aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag tagtattgga aaatgttaca gaaaatttta 300
atatgtggaa aaatgacatg gtagaacaga tgcaggagga tatagtcagt ttatgggatc 360
aaagcctaaa gccatgtgta gaatcaactc cactctgtgt tactctaaat tgtactgatg 420
tgaagaagaa tgctaataat accactggta ataccactga tggtaacgtg gaaaggttgg 480
agaaagaaga aataaaaaac tgctctttca atatcaccac aagcataaga gataagatgc 540
ggaaagaata tgcacttttt tatagccttg atgtagtacc aatagataag gataatacaa 600
gctataggtt gataagttgt aacacctcag tcattacaca ggcctgtcca aaagtatcct 660
ttgagccaat tcccatacat tattgtgccc cggctggttt tgcgcttcta aaatgtaatg 720
ataaggagtt caatggaaca ggaccatgta ggaatgtcag cacagtccaa tgtacacatg 780
gaattaggcc agtagtatca actcaactgc tgttaaatgg cagtctagca gaagaaaaga 840
tagtaattag atctgagaat ttcacgagca atgctaaaac tataatagta cagctgaata 900
aatctataga aattaattgt ataagaccca acaacaacac aagaagaagt atacatatag 960
gaccaggggg agcattttat gcaacagaaa taataggaga tataagacaa gcacattgta 1020
ccctcaatag aacagaatgg aataacactc taggacagat agttaaaaaa ttaagagaac 1080
aatatgggaa taaaacaata aaatttacgc agccctcctc aggaggggac ccagaagttg 1140
taatgcacag ttttaattgt ggaggggaat ttttctactg taattcatca cagctgttta 1200
atagtacttg ggatgttact gaagggtcaa ataacactga aggaagcaac gatacaggaa 1260
tcatactccc gtgcagaata aaacaaataa taaacatgtg gcagaaagta ggaaaagcaa 1320
tgtatgcccc tcccatcaga ggacaaatta actgtacatc aaatattaca gggctactat 1380
taataagaga tggtggtaac aacgggaccg acaacgggac cgagaccttc agacctggag 1440
gaggagatat gagggacaat tggagaagtg aattatataa atataaagta gtaaaaattg 1500
aaccattagg agtagcaccc accaaggcaa agagaagagt ggtgcagaga gaaaaaagag 1560
cagtgggaat aggagctttg ttccttgggt tcttgggagc agcaggaagc act 1613
<210> 48
<211> 485
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 48
Ser Ala Ala Glu Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Arg Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asp Met Val Glu Gln Met Gln Glu Asp Ile
65 70 75 80
Val Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Glu Ser Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Val Lys Lys Asn Ala Asn Asn
100 105 110
Thr Thr Gly Asn Thr Thr Asp Gly Asn Val Glu Arg Leu Glu Lys Glu
115 120 125
Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Ser Ile Arg Asp Lys
130 135 140
Met Arg Lys Glu Tyr Ala Leu Phe Tyr Ser Leu Asp Val Val Pro Ile
145 150 155 160
Asp Lys Asp Asn Thr Ser Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val
165 170 175
Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His
180 185 190
Tyr Cys Ala Pro Ala Gly Phe Ala Leu Leu Lys Cys Asn Asp Lys Glu
195 200 205
Phe Asn Gly Thr Gly Pro Cys Arg Asn Val Ser Thr Val Gln Cys Thr
210 215 220
His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser
225 230 235 240
Leu Ala Glu Glu Lys Ile Val Ile Arg Ser Glu Asn Phe Thr Ser Asn
245 250 255
Ala Lys Thr Ile Ile Val Gln Leu Asn Lys Ser Ile Glu Ile Asn Cys
260 265 270
Ile Arg Pro Asn Asn Asn Thr Arg Arg Ser Ile His Ile Gly Pro Gly
275 280 285
Gly Ala Phe Tyr Ala Thr Glu Ile Ile Gly Asp Ile Arg Gln Ala His
290 295 300
Cys Thr Leu Asn Arg Thr Glu Trp Asn Asn Thr Leu Gly Gln Ile Val
305 310 315 320
Lys Lys Leu Arg Glu Gln Tyr Gly Asn Lys Thr Ile Lys Phe Thr Gln
325 330 335
Pro Ser Ser Gly Gly Asp Pro Glu Val Val Met His Ser Phe Asn Cys
340 345 350
Gly Gly Glu Phe Phe Tyr Cys Asn Ser Ser Gln Leu Phe Asn Ser Thr
355 360 365
Trp Asp Val Thr Glu Gly Ser Asn Asn Thr Glu Gly Ser Asn Asp Thr
370 375 380
Gly Ile Ile Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln
385 390 395 400
Lys Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Gln Ile Asn
405 410 415
Cys Thr Ser Asn Ile Thr Gly Leu Leu Leu Ile Arg Asp Gly Gly Asn
420 425 430
Asn Gly Thr Asp Asn Gly Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp
435 440 445
Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys
450 455 460
Ile Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val
465 470 475 480
Gln Arg Glu Lys Arg
485
<210> 49
<211> 1616
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 49
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaggaat tatcagtgct 60
tgtggacatg gggcacgatg ctccttggga tgttgatgat ctgtagtgct gtagaacaat 120
tgtgggtcac agtctattat ggggtgcctg tgtgggaaga agcgaccacc actctatttt 180
gtgcatcaga tgctaaaaca tatgatccag aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag tagtattggg aaatgtgaca gaaaacttta 300
acatgtggaa aaatgacatg gtaaaccaaa tgcatgagga tataatcagt ttatgggatc 360
aaagtttaaa gccatgtgca aaattaaccc cactctgtgt tactctaaat tgcactgata 420
agttgaatat taatactacc agtaccaata gtagtaccaa taatactact agtagtggag 480
tggatgaagg gggaatgaaa aactgctctt tcaatgtcac cacaagcata agagataggg 540
tgcagaaaga atatgcactt ttttataaac ctgatgtagt accaatagat gataatacta 600
ataatactag ctataggttg ataaattgta acacctcagt cattacacaa gcctgtccaa 660
aggtaacctt tgatccaatt cccatacatt attgtgcccc ggctggtttt gcgattctaa 720
aatgtaacaa taagacgttc aatggatcag gaccatgtac aaatgtcagc acagtacaat 780
gtacacatgg aattaaacca gtggtgtcga ctcaattgct gttaaatggc agtctagcag 840
aggaagaaat agtaatcagg tctgaagatt tcacggacaa tgatagaacc ataatagtac 900
agctgaatga atctgtagta attcattgta caagacccaa caacaataca agaaaaagta 960
tacacctagg accagggagt gcattttatg caacaggaga tataatagga gatataaaac 1020
aagcacattg taacattagt agagcaaatt ggactaacac cttaaaacag atagctggaa 1080
aattaaaaga acagtttgga aataagacaa tattctttaa tcaatcctca ggaggggatc 1140
cagaaattgt aacacacagt ttcaattgta gaggggaatt tttctactgt aatacatcac 1200
aattgtttaa cagtacttgg cttcataata atagtactgg gaatgatact gaaaagaatg 1260
gtaatatcac actcccacgc agaataaaac aaattataaa catgtggcag caagtaggaa 1320
aagcaatgta tgccccccct gtcaaaggac taattacatg ttcatcaaat attacaggac 1380
tgctattagt aagagatggt ggtaacaaca ccaacgccac cgacaccgag accttcagac 1440
ctggaggagg agatatgagg gacaattgga gaagtgaatt atataaatat aaagtagtaa 1500
aaatcaaacc attaggaata gcacccacca aggcaaaaag aagagtggtg cagagagaaa 1560
aaagagcagc actaggagct atgttccttg ggttcttggg agcagcagga agcact 1616
<210> 50
<211> 487
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 50
Ser Ala Val Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Glu Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Thr
20 25 30
Tyr Asp Pro Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Gly Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asp Met Val Asn Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Ala Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Lys Leu Asn Ile Asn Thr Thr
100 105 110
Ser Thr Asn Ser Ser Thr Asn Asn Thr Thr Ser Ser Gly Val Asp Glu
115 120 125
Gly Gly Met Lys Asn Cys Ser Phe Asn Val Thr Thr Ser Ile Arg Asp
130 135 140
Arg Val Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Pro Asp Val Val Pro
145 150 155 160
Ile Asp Asp Asn Thr Asn Asn Thr Ser Tyr Arg Leu Ile Asn Cys Asn
165 170 175
Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Thr Phe Asp Pro Ile
180 185 190
Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn
195 200 205
Asn Lys Thr Phe Asn Gly Ser Gly Pro Cys Thr Asn Val Ser Thr Val
210 215 220
Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu
225 230 235 240
Asn Gly Ser Leu Ala Glu Glu Glu Ile Val Ile Arg Ser Glu Asp Phe
245 250 255
Thr Asp Asn Asp Arg Thr Ile Ile Val Gln Leu Asn Glu Ser Val Val
260 265 270
Ile His Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile His Leu
275 280 285
Gly Pro Gly Ser Ala Phe Tyr Ala Thr Gly Asp Ile Ile Gly Asp Ile
290 295 300
Lys Gln Ala His Cys Asn Ile Ser Arg Ala Asn Trp Thr Asn Thr Leu
305 310 315 320
Lys Gln Ile Ala Gly Lys Leu Lys Glu Gln Phe Gly Asn Lys Thr Ile
325 330 335
Phe Phe Asn Gln Ser Ser Gly Gly Asp Pro Glu Ile Val Thr His Ser
340 345 350
Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asn Thr Ser Gln Leu Phe
355 360 365
Asn Ser Thr Trp Leu His Asn Asn Ser Thr Gly Asn Asp Thr Glu Lys
370 375 380
Asn Gly Asn Ile Thr Leu Pro Arg Arg Ile Lys Gln Ile Ile Asn Met
385 390 395 400
Trp Gln Gln Val Gly Lys Ala Met Tyr Ala Pro Pro Val Lys Gly Leu
405 410 415
Ile Thr Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Val Arg Asp Gly
420 425 430
Gly Asn Asn Thr Asn Ala Thr Asp Thr Glu Thr Phe Arg Pro Gly Gly
435 440 445
Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val
450 455 460
Val Lys Ile Lys Pro Leu Gly Ile Ala Pro Thr Lys Ala Lys Arg Arg
465 470 475 480
Val Val Gln Arg Glu Lys Arg
485
<210> 51
<211> 1613
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 51
agaaagagca gaagacagtg gcaatgagag tgagggggac caggaagaat tatcagcact 60
tgtggagatg gggcatcatg ctccttggga tgtcaatgat ctgtaatgct acaaaagatt 120
tgtgggtcac agtttattat ggggtacctg tatggaaaga agcaaacacc agtctatttt 180
gtgcatcaga tgctaaagca tatgatacag aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag tattcatgaa caatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaaa tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa accatgtgta aaattaaccc cactcggtgt tactctagat tgcactaagg 420
ctaatattac caataatagt accactaata gtagcggggg aatagaggag ggaagagaca 480
tagaaaattg ctctttcaat atcaccacaa acataagaga taagataaag aaagaatatg 540
cactttttta tagccttgat gtgatagcaa tagatgatag tagtaatagt agtaatagaa 600
gctataggtt gagaggttgt aacacctcaa ccatcacaca ggcctgtcca aaggtaacct 660
ttgagccaat tccaatacat tattgtgccc cagctggttt tgcgattcta aagtgtaacg 720
ataagaagtt caatggaaca ggaccatgta aaaatgtcag tacagtacaa tgtacacatg 780
gaattaggcc agtagtatca actcaactgc tgctaaatgg cagtatagca gaaaaagagg 840
tagtaattag gtccgctaat ttcacggaca atgctaaaac cataatagta cagctgaata 900
actctgtaca aattaattgc acaagacccg gcaacaatac aagaaaaagt atacatatag 960
gaccaggcag agcattttat gcacatgaaa taatagggga gataagacaa gcacattgta 1020
cccttaacag aacacaatgg aataacactt taaaacagat agttataaaa ttaagagaac 1080
aatttaacaa taagacaata gtctttaatc actcctcagg aggggaccca gaaattgtaa 1140
cacacagttt taattgtgga ggggaatttt tctactgtaa tacatcacaa ttatttaata 1200
gtacctggag gagtaatgaa actgtaaatg acactatggg aaaggacaca aatgacacaa 1260
caatcacact cccatgcaga ataaaacaaa ttataaacat gtggcaggaa gtaggaaaag 1320
caatgtatgc cccgcccatc agaggacaaa ttagctgttc atcaaatatt acagggctgc 1380
tattaacaag agatggtggt gtgaacgaga ccaacgccac cgaggtcttc agacctggag 1440
gaggagatat gagggacaat tggagaagtg aattatataa atataaagta gtagaaattg 1500
aaccattagg aatagcaccc accaaggcaa agagaagagt ggtgcagaga gagaaaagag 1560
cagtgggaat aggagctgtg ttccttgggt tcttgggagc agcaggaagc act 1613
<210> 52
<211> 485
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 52
Asn Ala Thr Lys Asp Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Asn Thr Ser Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Phe Met Asn Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Gly Val Thr Leu Asp Cys Thr Lys Ala Asn Ile Thr Asn Asn Ser
100 105 110
Thr Thr Asn Ser Ser Gly Gly Ile Glu Glu Gly Arg Asp Ile Glu Asn
115 120 125
Cys Ser Phe Asn Ile Thr Thr Asn Ile Arg Asp Lys Ile Lys Lys Glu
130 135 140
Tyr Ala Leu Phe Tyr Ser Leu Asp Val Ile Ala Ile Asp Asp Ser Ser
145 150 155 160
Asn Ser Ser Asn Arg Ser Tyr Arg Leu Arg Gly Cys Asn Thr Ser Thr
165 170 175
Ile Thr Gln Ala Cys Pro Lys Val Thr Phe Glu Pro Ile Pro Ile His
180 185 190
Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp Lys Lys
195 200 205
Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr Val Gln Cys Thr
210 215 220
His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser
225 230 235 240
Ile Ala Glu Lys Glu Val Val Ile Arg Ser Ala Asn Phe Thr Asp Asn
245 250 255
Ala Lys Thr Ile Ile Val Gln Leu Asn Asn Ser Val Gln Ile Asn Cys
260 265 270
Thr Arg Pro Gly Asn Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly
275 280 285
Arg Ala Phe Tyr Ala His Glu Ile Ile Gly Glu Ile Arg Gln Ala His
290 295 300
Cys Thr Leu Asn Arg Thr Gln Trp Asn Asn Thr Leu Lys Gln Ile Val
305 310 315 320
Ile Lys Leu Arg Glu Gln Phe Asn Asn Lys Thr Ile Val Phe Asn His
325 330 335
Ser Ser Gly Gly Asp Pro Glu Ile Val Thr His Ser Phe Asn Cys Gly
340 345 350
Gly Glu Phe Phe Tyr Cys Asn Thr Ser Gln Leu Phe Asn Ser Thr Trp
355 360 365
Arg Ser Asn Glu Thr Val Asn Asp Thr Met Gly Lys Asp Thr Asn Asp
370 375 380
Thr Thr Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp
385 390 395 400
Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Gln Ile
405 410 415
Ser Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly
420 425 430
Val Asn Glu Thr Asn Ala Thr Glu Val Phe Arg Pro Gly Gly Gly Asp
435 440 445
Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Glu
450 455 460
Ile Glu Pro Leu Gly Ile Ala Pro Thr Lys Ala Lys Arg Arg Val Val
465 470 475 480
Gln Arg Glu Lys Arg
485
<210> 53
<211> 1658
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 53
agaaagagca gaagacagtg gcaatgagag cgaaggggat caggaagaat tggcagcact 60
tgtggagatg gggcaccatg ctcccatggg gcaccatgct ccttgggatg ttaatgatga 120
tctgtagtgc tgcagaagaa aaatgggtca cagtctatta tggggtacct gtgtggaaag 180
aagcaaccac cactctattt tgtgcatcag atgctaaagc atatgacaca gaggtacata 240
atgtttgggc cacacatgcc tgtgtaccca cagaccctaa cccacaagaa gtagtattgg 300
gaaatgtgac agaaaatttt aatgtgtgga aaaatgacat ggtagaacag atgcatgaag 360
atataatcag cttatgggat caaagcctaa agccatgtgt aaaattaacc ccactctgtg 420
ttgctttaaa ttgcactaat gtgaatgata ctaggacaaa caatagtagt agtagtgata 480
aaaatgatgc taaaaccaat agtagtagta gttgggaaag gatggaagga gaagtaaaaa 540
actgctcttt caatgttacc acaagaataa gaaacaaggt gcagaaagaa tatgcacttt 600
tttataagct tgatgtagtg ccaatagaga aggataatgc aagctataca ttgataaatt 660
gtaacacctc agtcattaca caggcctgtc caaaggtatc ttttgaacca attcccatac 720
attattgtgc cccggctggt tttgcgattc taaagcgtaa tgataagaag ttcaatggaa 780
aaggcccatg tacaaatgtc agcacagtac gatgtacaca tggaattagg ccagtagtgt 840
caactcaact actgttaaat ggcagtctag cagaagaagg ggtagtaatt agatctgaaa 900
atctcacgaa caatgttaaa accataatag tacagctgaa caaatctgta aaaattaatt 960
gtacaagacc caacaacaat acaagaaaaa gtataaatat aggaccaggg agagcatttt 1020
atgcaacagg agcaataata ggaaatatga gacaagcaca ttgtaacctt aatggaacag 1080
aatggaagaa cactttaaga caggtagtta taagcttaag agagaaattt gggaagaaga 1140
caatagtctt caaccaatcc tcaggagggg atttagaaat tataatgcac aattttaatt 1200
gtggagggga atttttctac tgtgatacaa cacagctgtt taatagtact tggctgccta 1260
atgagactac agagtcaaat aacattactg gaggacctaa tgacacactc acgctcccat 1320
gtagaataaa acaaattata aacagatggc aggaagtagg aaaagcaatg tatgcccctc 1380
ccatcagtgg gcaaattaga tgctcatcaa atattacggg gctgctatta acaagagatg 1440
gtggtgagga gcagaatgac actgaggtct ttagacctgg aggaggagat atgagggaca 1500
attggagaag tgaattatat aaatataaag tagtaagaat tgagccatca ggagtagcac 1560
ccaccaaggc aaagagaaga gtggtgcaaa gagacaaaag agcagtggga gcactaggag 1620
ctatgttcct tgggttcttg ggagcagcag gaagcact 1658
<210> 54
<211> 490
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 54
Ser Ala Ala Glu Glu Lys Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Gly Asn Val Thr Glu Asn
50 55 60
Phe Asn Val Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Ala Leu Asn Cys Thr Asn Val Asn Asp Thr Arg Thr Asn
100 105 110
Asn Ser Ser Ser Ser Asp Lys Asn Asp Ala Lys Thr Asn Ser Ser Ser
115 120 125
Ser Trp Glu Arg Met Glu Gly Glu Val Lys Asn Cys Ser Phe Asn Val
130 135 140
Thr Thr Arg Ile Arg Asn Lys Val Gln Lys Glu Tyr Ala Leu Phe Tyr
145 150 155 160
Lys Leu Asp Val Val Pro Ile Glu Lys Asp Asn Ala Ser Tyr Thr Leu
165 170 175
Ile Asn Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser
180 185 190
Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile
195 200 205
Leu Lys Arg Asn Asp Lys Lys Phe Asn Gly Lys Gly Pro Cys Thr Asn
210 215 220
Val Ser Thr Val Arg Cys Thr His Gly Ile Arg Pro Val Val Ser Thr
225 230 235 240
Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Gly Val Val Ile Arg
245 250 255
Ser Glu Asn Leu Thr Asn Asn Val Lys Thr Ile Ile Val Gln Leu Asn
260 265 270
Lys Ser Val Lys Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys
275 280 285
Ser Ile Asn Ile Gly Pro Gly Arg Ala Phe Tyr Ala Thr Gly Ala Ile
290 295 300
Ile Gly Asn Met Arg Gln Ala His Cys Asn Leu Asn Gly Thr Glu Trp
305 310 315 320
Lys Asn Thr Leu Arg Gln Val Val Ile Ser Leu Arg Glu Lys Phe Gly
325 330 335
Lys Lys Thr Ile Val Phe Asn Gln Ser Ser Gly Gly Asp Leu Glu Ile
340 345 350
Ile Met His Asn Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asp Thr
355 360 365
Thr Gln Leu Phe Asn Ser Thr Trp Leu Pro Asn Glu Thr Thr Glu Ser
370 375 380
Asn Asn Ile Thr Gly Gly Pro Asn Asp Thr Leu Thr Leu Pro Cys Arg
385 390 395 400
Ile Lys Gln Ile Ile Asn Arg Trp Gln Glu Val Gly Lys Ala Met Tyr
405 410 415
Ala Pro Pro Ile Ser Gly Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly
420 425 430
Leu Leu Leu Thr Arg Asp Gly Gly Glu Glu Gln Asn Asp Thr Glu Val
435 440 445
Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu
450 455 460
Tyr Lys Tyr Lys Val Val Arg Ile Glu Pro Ser Gly Val Ala Pro Thr
465 470 475 480
Lys Ala Lys Arg Arg Val Val Gln Arg Asp
485 490
<210> 55
<211> 1658
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 55
agaaagagca gaagacagtg gcaatgaaag tgaaggggac caggaagaat tatcagcact 60
tatggagatg gggcaccatg ctcctatgga gatggggcac catgctcctt gggctattaa 120
tgatctgtaa tgctatagaa gaatcgtggg tcacagtcca ttatggagta cctgtgtgga 180
aagaagcaaa caccactctg ttttgtgcat cagatgctaa agcatatgat acagaggtac 240
ataatgtttg ggccacacat gcctgtgtac ccacaaaccc caacccacaa gaagtagact 300
tgggaaatgt gacagaaaat tttaatgcat ggaaaaatga catggtagaa caaatgcatg 360
aggatataat tagtttatgg gatcaaagcc taaagccatg tgtaaaatta actccactct 420
gtgttactct acagtgcact gatttgagga atgatactaa taccactagt agtcctaata 480
ccactagtgg taactggatg gataaaaggg aaatgaaaaa ctgctctttc aatatcacca 540
caagcataag agataagctg cagaaaacat ttgcactttt ttataaactt gatatagtac 600
caataaatga ggacaaaaac agtagtaata ttgataatac cagttatagg ttgataagtt 660
gtaatacctc agtcattaca caggcctgtc caaaggtatc ctttgagcca attcccatac 720
attattgtgc cccggctggt tttgcgattc taaaatgtaa agatgaggag ttcaatggaa 780
caggaccatg taaaaatgtc agtacagtac aatatacaca tggaattagg ccagtagtat 840
caactcaact gctgttaaat ggcagtctag cagaacaagg ggtagtactt agatctaaaa 900
atatctcaga caatactaaa accataatag tacagctaaa agaagctgta acaattaagt 960
gtacaagacc caacaacaat acaagaaaaa gtatacatat aggacctggg agagcatttt 1020
atgcaacagg agacataata ggagatataa gacaagccca ttgtaacatc agtgcaacaa 1080
agtggaatga cacgttacgt cagatagttg aaaaattaca aggatcattt aagaataaaa 1140
caataagctt caagcgatcc tcaggagggg atccagagat tgtaatgcac agttttaatt 1200
gtggagggga atttttctat tgtaattcaa caaaactgtt taatagtact tggtatagta 1260
atgggactag tacttttgat aatactactg aacgaacaaa tgacactatc atactcccat 1320
gcagaataaa acaaattata aacatgtggc aggaagtagg aaaagcaatg tatgcccctc 1380
ccatcccagg actaattaac tgttcatcaa atattacagg actgctatta ataagagatg 1440
gtggtaataa ctatactgac aatactgaga tcttcagacc tggaggagga gacatgaggg 1500
acaattggag aagtgaatta tataaatata aagtagtaaa agttgaacca ttaggtatag 1560
cacccaccaa ggcaaagaga agagtggtac agagagaaaa aagagcagtg ggaataggag 1620
cgtttttcct tgggttcttg ggagcagcag gaagcact 1658
<210> 56
<211> 490
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 56
Asn Ala Ile Glu Glu Ser Trp Val Thr Val His Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asn Pro Asn Pro Gln Glu Val Asp Leu Gly Asn Val Thr Glu Asn
50 55 60
Phe Asn Ala Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Gln Cys Thr Asp Leu Arg Asn Asp Thr Asn Thr
100 105 110
Thr Ser Ser Pro Asn Thr Thr Ser Gly Asn Trp Met Asp Lys Arg Glu
115 120 125
Met Lys Asn Cys Ser Phe Asn Ile Thr Thr Ser Ile Arg Asp Lys Leu
130 135 140
Gln Lys Thr Phe Ala Leu Phe Tyr Lys Leu Asp Ile Val Pro Ile Asn
145 150 155 160
Glu Asp Lys Asn Ser Ser Asn Ile Asp Asn Thr Ser Tyr Arg Leu Ile
165 170 175
Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe
180 185 190
Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu
195 200 205
Lys Cys Lys Asp Glu Glu Phe Asn Gly Thr Gly Pro Cys Lys Asn Val
210 215 220
Ser Thr Val Gln Tyr Thr His Gly Ile Arg Pro Val Val Ser Thr Gln
225 230 235 240
Leu Leu Leu Asn Gly Ser Leu Ala Glu Gln Gly Val Val Leu Arg Ser
245 250 255
Lys Asn Ile Ser Asp Asn Thr Lys Thr Ile Ile Val Gln Leu Lys Glu
260 265 270
Ala Val Thr Ile Lys Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser
275 280 285
Ile His Ile Gly Pro Gly Arg Ala Phe Tyr Ala Thr Gly Asp Ile Ile
290 295 300
Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser Ala Thr Lys Trp Asn
305 310 315 320
Asp Thr Leu Arg Gln Ile Val Glu Lys Leu Gln Gly Ser Phe Lys Asn
325 330 335
Lys Thr Ile Ser Phe Lys Arg Ser Ser Gly Gly Asp Pro Glu Ile Val
340 345 350
Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr
355 360 365
Lys Leu Phe Asn Ser Thr Trp Tyr Ser Asn Gly Thr Ser Thr Phe Asp
370 375 380
Asn Thr Thr Glu Arg Thr Asn Asp Thr Ile Ile Leu Pro Cys Arg Ile
385 390 395 400
Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala
405 410 415
Pro Pro Ile Pro Gly Leu Ile Asn Cys Ser Ser Asn Ile Thr Gly Leu
420 425 430
Leu Leu Ile Arg Asp Gly Gly Asn Asn Tyr Thr Asp Asn Thr Glu Ile
435 440 445
Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu
450 455 460
Tyr Lys Tyr Lys Val Val Lys Val Glu Pro Leu Gly Ile Ala Pro Thr
465 470 475 480
Lys Ala Lys Arg Arg Val Val Gln Arg Glu
485 490
<210> 57
<211> 1655
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 57
agaaagagca gaagacagtg gcaatgagag tgaaggagat caggaagaat tgtcagcgct 60
tgtggacatg gggcaccatg ctccttggga tgttgatgat ctgtagtact gcagaacaac 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga agcaactacc actttatttt 180
gtgcatcaga tgctaaagca tatgacacag aagcacataa tgtttgggcc acacatgcct 240
gtgtacccac ggaccctaac ccacaagaag tagtaataaa tgtgacagaa aattttaaca 300
tgtggaaaaa tgacatggta gaacagatgc atgaggacat aatcagtgta tgggaccaga 360
gtctaaagcc atgtgtaaaa ttaaccccac tctgtgttac tttaaattgc actaattgga 420
atggtactaa taccaataat gctaatacta ccagtagtcc taatattacc agtactacta 480
ctgccaatat ttatgagaaa agaatggaag aaggagaaat acaaaactgc actttcaatg 540
tcaccacaag cataagggat aaggtaaaag aagaatatgc acttttttat agatctgatg 600
taggccaaat aggtaataat agtaataata catatacatt gataaattgt aattcctcag 660
tcattacaca ggctcgtcca aagatatcct ttgaaccaat tcccatacat tattgtaccc 720
cggctggttt tgcgattcta aaatgcaata ataagacctt caatggaaca ggaccatgta 780
acaatgtcag cacagtacag tgtacacatg gtattaggcc agtagtatca actcaattgt 840
tgttaaatgg cagtctagca gaagatgaga taatgattag atctgcaaat ctctcggaca 900
atactaaaaa cataatagta cagctgaata aatctgtaga aattaattgt acaagaccca 960
acaataatac aagaaaaagt ataaatatag gaccagggag agcattttat gcaacaggag 1020
atataatagg aaacataaga catgcatatt gtaccattaa tgaaacaaaa tggaatgaaa 1080
ctttaagaca gatagctaca aaattacaca aacaatttaa taaaacaata atctttgagc 1140
agtcctcagg aggagaccca gaaattacaa cgcacagttt taattgtgga ggggaatttt 1200
tctactgtaa tacaacaccg ctgtttaata gcacttgggt taagactcag aatgatacta 1260
tagggtctaa gactcagaat gctactacag ggttaaatgg cactatcata ctcccatgca 1320
gaataaaaca aatcataaac agatggcagg aagtaggaag agcaatgtat gcccctccca 1380
tcaaaggaat aattagatgt tcatcaaata ttacagggct gatattgaca agagatggtg 1440
gtggtaatga gagcagggaa aatgagacct tcagacctgg aggaggagat atgagggaca 1500
attggagaag tgaattatat aaatataaag tagtaaaaat tgagccaata ggactggcac 1560
ccaccaaggc aaagagaaga gtggtgcaga gagaaaaaag agcggtaacg ctgggagcta 1620
tgttccctgg gttcttggga gcagcaggaa gcact 1655
<210> 58
<211> 499
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 58
Ser Thr Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Ile Asn Val Thr Glu Asn Phe
50 55 60
Asn Met Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile Ile
65 70 75 80
Ser Val Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu
85 90 95
Cys Val Thr Leu Asn Cys Thr Asn Trp Asn Gly Thr Asn Thr Asn Asn
100 105 110
Ala Asn Thr Thr Ser Ser Pro Asn Ile Thr Ser Thr Thr Thr Ala Asn
115 120 125
Ile Tyr Glu Lys Arg Met Glu Glu Gly Glu Ile Gln Asn Cys Thr Phe
130 135 140
Asn Val Thr Thr Ser Ile Arg Asp Lys Val Lys Glu Glu Tyr Ala Leu
145 150 155 160
Phe Tyr Arg Ser Asp Val Gly Gln Ile Gly Asn Asn Ser Asn Asn Thr
165 170 175
Tyr Thr Leu Ile Asn Cys Asn Ser Ser Val Ile Thr Gln Ala Arg Pro
180 185 190
Lys Ile Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Thr Pro Ala Gly
195 200 205
Phe Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro
210 215 220
Cys Asn Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val
225 230 235 240
Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Asp Glu Ile
245 250 255
Met Ile Arg Ser Ala Asn Leu Ser Asp Asn Thr Lys Asn Ile Ile Val
260 265 270
Gln Leu Asn Lys Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn
275 280 285
Thr Arg Lys Ser Ile Asn Ile Gly Pro Gly Arg Ala Phe Tyr Ala Thr
290 295 300
Gly Asp Ile Ile Gly Asn Ile Arg His Ala Tyr Cys Thr Ile Asn Glu
305 310 315 320
Thr Lys Trp Asn Glu Thr Leu Arg Gln Ile Ala Thr Lys Leu His Lys
325 330 335
Gln Phe Asn Lys Thr Ile Ile Phe Glu Gln Ser Ser Gly Gly Asp Pro
340 345 350
Glu Ile Thr Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys
355 360 365
Asn Thr Thr Pro Leu Phe Asn Ser Thr Trp Val Lys Thr Gln Asn Asp
370 375 380
Thr Ile Gly Ser Lys Thr Gln Asn Ala Thr Thr Gly Leu Asn Gly Thr
385 390 395 400
Ile Ile Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Arg Trp Gln Glu
405 410 415
Val Gly Arg Ala Met Tyr Ala Pro Pro Ile Lys Gly Ile Ile Arg Cys
420 425 430
Ser Ser Asn Ile Thr Gly Leu Ile Leu Thr Arg Asp Gly Gly Gly Asn
435 440 445
Glu Ser Arg Glu Asn Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg
450 455 460
Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu
465 470 475 480
Pro Ile Gly Leu Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg
485 490 495
Glu Lys Arg
<210> 59
<211> 1637
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 59
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tatcagcact 60
tgtggacatg gagcaacatg ctcacgatgc tccttgggat gttaatgatc tgtagtgctg 120
cagatcaatt gtgggtcaca gtctattatg gggtacctgt gtggaaagaa acaaccacca 180
ctctattttg tgcatcagat gctaaagcat atgataaaga ggtacataat gtttgggcca 240
cacatgcctg tgtacccaca gaccccaacc cacaagaagt aatattggaa aatgtgacag 300
aaaattttaa cgcgtggaaa aatgacatgg tagaacagat gcatgaggat ataattagtt 360
tatgggatca aagcttaaag ccatgtgtaa aattaacccc actctgtgtt actttaaatt 420
gcactgatgc taatattact aataccaatg ataatgagcc caatagtagt gtggtgaaac 480
tgatagagaa aggagaaata aaaaactgct ctttcaatat caccacaagc ataagagata 540
agatgcagaa agcatatgca cttttttata aacttgatgt agaaccaata gagaataata 600
ctactagcta taggttgata agttgtaaca cctcagtcat tacacaagcc tgtccaaagg 660
tatcctttga gccaattccc atacattttt gtgccccggc tggttttgcg attctaaagt 720
gtaacaataa gacattcgag ggaaaaggac catgtaaaaa tatcagcaca gtacaatgta 780
cacatggaat taggccagta gtatcaactc aattgctgtt aaatggcagt ctggcagaag 840
aagagatagt gattagatct gacaattttt caaacagtgc taaaaccata atagtacagt 900
tgaatgcatc tgtagaaatt aatcgtacaa gacccaacaa caatacgaga aaaggtatag 960
ttataggacc agggagaaaa gttattgcaa cagaaaaaat aataggagat gtaagacaag 1020
cacattgtaa cattagtata acaaaatgga ataatacttt aggccatata gttaataaat 1080
taagaaaaca atttggggaa aataaaacaa tagtctttaa gcaacactca ggaggggatc 1140
cagaagttat aatgcataat tttaattgtg caggggaatt tttctactgt aatacaacag 1200
gactgtttaa tagcacttgg cattggaatg gtacttggag tggtactgaa aggagaaata 1260
gcactgaagg aaatgacaca cttacactcc catgcagaat aaaacaaatt ataaacatgt 1320
ggcaggaagt aggaaaagca atgtatgccc ctcccgttaa cggacagatt agatgtttat 1380
caaatattac aggactgcta ttaacaagag atggtggtaa taacaataac acaaacgaca 1440
ccgaaacctt cagacctgaa ggaggagata tgagggacaa ttggagaagt gaattatata 1500
aatataaagt agtaagaatt gaaccattag gagtagcacc caccaaggca aagagaagag 1560
tggtgcagag agaaaaaaga gcactgggag taggagcagc tttgttcctt gggttcttgg 1620
gagcagcagg aagcact 1637
<210> 60
<211> 489
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 60
Ser Ala Ala Asp Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Thr Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Lys Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Ile Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Ala Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Ala Asn Ile Thr Asn Thr Asn
100 105 110
Asp Asn Glu Pro Asn Ser Ser Val Val Lys Leu Ile Glu Lys Gly Glu
115 120 125
Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Ser Ile Arg Asp Lys Met
130 135 140
Gln Lys Ala Tyr Ala Leu Phe Tyr Lys Leu Asp Val Glu Pro Ile Glu
145 150 155 160
Asn Asn Thr Thr Ser Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val Ile
165 170 175
Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His Phe
180 185 190
Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe
195 200 205
Glu Gly Lys Gly Pro Cys Lys Asn Ile Ser Thr Val Gln Cys Thr His
210 215 220
Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu
225 230 235 240
Ala Glu Glu Glu Ile Val Ile Arg Ser Asp Asn Phe Ser Asn Ser Ala
245 250 255
Lys Thr Ile Ile Val Gln Leu Asn Ala Ser Val Glu Ile Asn Arg Thr
260 265 270
Arg Pro Asn Asn Asn Thr Arg Lys Gly Ile Val Ile Gly Pro Gly Arg
275 280 285
Lys Val Ile Ala Thr Glu Lys Ile Ile Gly Asp Val Arg Gln Ala His
290 295 300
Cys Asn Ile Ser Ile Thr Lys Trp Asn Asn Thr Leu Gly His Ile Val
305 310 315 320
Asn Lys Leu Arg Lys Gln Phe Gly Glu Asn Lys Thr Ile Val Phe Lys
325 330 335
Gln His Ser Gly Gly Asp Pro Glu Val Ile Met His Asn Phe Asn Cys
340 345 350
Ala Gly Glu Phe Phe Tyr Cys Asn Thr Thr Gly Leu Phe Asn Ser Thr
355 360 365
Trp His Trp Asn Gly Thr Trp Ser Gly Thr Glu Arg Arg Asn Ser Thr
370 375 380
Glu Gly Asn Asp Thr Leu Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile
385 390 395 400
Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Val Asn
405 410 415
Gly Gln Ile Arg Cys Leu Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg
420 425 430
Asp Gly Gly Asn Asn Asn Asn Thr Asn Asp Thr Glu Thr Phe Arg Pro
435 440 445
Glu Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr
450 455 460
Lys Val Val Arg Ile Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys
465 470 475 480
Arg Arg Val Val Gln Arg Glu Lys Arg
485
<210> 61
<211> 1678
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 61
agaaagagca gaagacagtg gaatgagagt gaaggggatc aggaagaatt atcagcactt 60
gtggatatgg ggcatcttgc tccctgggat gttgatgatc tgtagtgctg cagacaagtt 120
gtgggtcaca gtctattatg gggtacctgt gtggaaagaa gcaaccacca ctctattttg 180
tgcatcagat gctaaagcat atagtgcaga ggtacataat gtctgggcca cacatgcctg 240
tgtacccaca gaccccgacc cacaggaaat agtattggaa aatgtaacag aaaattttaa 300
catgtggaaa aataacatgg tagaacagat gcaggaggat ataatcagtt tatgggatca 360
aagcctaaag ccatgtgtaa aattaacccc tctctgtgtc actttaaacc gcactgatga 420
attgcggact actaataaga ctactaataa gaccaatgat acagagacga atactactaa 480
taccaccagc tgggaaaaag gggaaatgaa aaactgctct tttgatgtca ccacaaacat 540
aagagataag tggcagagag aatatgcact tttttataag cttgatgtag taccaataga 600
taatgatggt aatggtaata gtagtaataa tgccactgat aataataata ctaccaaata 660
taccaactat aggttgataa gttgtaacac ctcagttatt acacaggcct gtccaaaggt 720
atcctttgag ccaattccca tacattattg tgccccagct ggttttgcga ttctaaagtg 780
taaagatgag aagttcagtg gaacaggacc atgtaaaaat gtcagcacag tacaatgcac 840
acatggaatt aggccagtag tatcaactca actgctgttg aatggcagtc tagcaaaaga 900
agagataata attagatctg aaaatctcac gaacaatgct aaaaccataa tagtacagct 960
gaatgaatct gtatcaatta attgtataag acccaacaat aatacaagaa gaggtatacc 1020
tataggacca gggcaagcat tttatgcaac aggggatata ataggggata taagaaaagc 1080
acattgtata gttaacagta cacaatggaa taacacttta gcacaggtag ccataaaatt 1140
aaatgaacac tttccaaata aaacaatagt ctttaagcag tcctcaggag gggacccaga 1200
aattgtaatg cacagtttta attgtggagg ggaatttttc tactgtgatt caacaccact 1260
gtttaacaat acttggaatg aaacacattt taataatact tgggatagta ttgaaaaggg 1320
aaaaatcata ctccaatgca gaataaaaca aattataaat atgtggcagg aagtaggaaa 1380
agcaatgtat gcccctccca tcagagggct gattaactgt acatcaaaca ttacagggct 1440
actattaaca agagatggtg gcaagaaaga gaatgagagt gatactatcg agatcttcag 1500
acctggagga ggagacatga ggaacaattg gagaagtgaa ttatataaat ataaagtagt 1560
aaaaattgaa ccattaggag tagcacccac caaggcaaaa agaagagtgg tgcagagaga 1620
aaaaagagca gcgctaggag ctttgttcct tgggttcttg ggagcagcag gaagcact 1678
<210> 62
<211> 508
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 62
Ser Ala Ala Asp Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Ser Ala Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asp Pro Gln Glu Ile Val Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met Gln Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Arg Thr Asp Glu Leu Arg Thr Thr Asn Lys
100 105 110
Thr Thr Asn Lys Thr Asn Asp Thr Glu Thr Asn Thr Thr Asn Thr Thr
115 120 125
Ser Trp Glu Lys Gly Glu Met Lys Asn Cys Ser Phe Asp Val Thr Thr
130 135 140
Asn Ile Arg Asp Lys Trp Gln Arg Glu Tyr Ala Leu Phe Tyr Lys Leu
145 150 155 160
Asp Val Val Pro Ile Asp Asn Asp Gly Asn Gly Asn Ser Ser Asn Asn
165 170 175
Ala Thr Asp Asn Asn Asn Thr Thr Lys Tyr Thr Asn Tyr Arg Leu Ile
180 185 190
Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe
195 200 205
Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu
210 215 220
Lys Cys Lys Asp Glu Lys Phe Ser Gly Thr Gly Pro Cys Lys Asn Val
225 230 235 240
Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln
245 250 255
Leu Leu Leu Asn Gly Ser Leu Ala Lys Glu Glu Ile Ile Ile Arg Ser
260 265 270
Glu Asn Leu Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Asn Glu
275 280 285
Ser Val Ser Ile Asn Cys Ile Arg Pro Asn Asn Asn Thr Arg Arg Gly
290 295 300
Ile Pro Ile Gly Pro Gly Gln Ala Phe Tyr Ala Thr Gly Asp Ile Ile
305 310 315 320
Gly Asp Ile Arg Lys Ala His Cys Ile Val Asn Ser Thr Gln Trp Asn
325 330 335
Asn Thr Leu Ala Gln Val Ala Ile Lys Leu Asn Glu His Phe Pro Asn
340 345 350
Lys Thr Ile Val Phe Lys Gln Ser Ser Gly Gly Asp Pro Glu Ile Val
355 360 365
Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asp Ser Thr
370 375 380
Pro Leu Phe Asn Asn Thr Trp Asn Glu Thr His Phe Asn Asn Thr Trp
385 390 395 400
Asp Ser Ile Glu Lys Gly Lys Ile Ile Leu Gln Cys Arg Ile Lys Gln
405 410 415
Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro
420 425 430
Ile Arg Gly Leu Ile Asn Cys Thr Ser Asn Ile Thr Gly Leu Leu Leu
435 440 445
Thr Arg Asp Gly Gly Lys Lys Glu Asn Glu Ser Asp Thr Ile Glu Ile
450 455 460
Phe Arg Pro Gly Gly Gly Asp Met Arg Asn Asn Trp Arg Ser Glu Leu
465 470 475 480
Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala Pro Thr
485 490 495
Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
500 505
<210> 63
<211> 1595
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 63
agaaagagca gaagacagtg gcaatgagag tgaaggagat catgaagaac tatcagaact 60
tatggagagg gggcatgatg ctccttggga tattcatgat ctgtagtgct acagaacaat 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga agcaaatacc actctatttt 180
gtgcatcaga tgctaaagca tataagacag aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag tattattgcc aaatatgaca gaagatttta 300
acatgtggaa aaataacatg gtagaacaga tgcatgagga tataatcagt ttgtgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cactctgtgt tactttaaaa tgcactgact 420
tgaatactac taatactatc aatagtagtg acttgatgga gaagggagaa ataaagaact 480
gctctttcaa tatcaccaca aacataagag ataagatgca gaaagactat gcgctttttt 540
atagacttga tgtagtacca atagataatg ataatactag ctataggttg ataagttgta 600
acacctcagt cattacacag gcctgcccaa aggtatcttt tgagccaatt cccatacatt 660
gttgtgcccc ggctggtttt gcgattctaa agtgtaaaga taagaatttc aatggaacag 720
gaacatgtaa aaatgtcagc acagtacagt gtacacatgg aattagacca gtagtatcaa 780
ctcaactgtt gttaaatggt agtctggcag aagaagaggt agtaattaga tctgccaatt 840
tcagtgacaa tgctaaaaac ataatagtac agctgaacga aactgtagaa attaattgta 900
caagacccaa caacaataca atgaaaagca tacatatagg actagggaga gcattttata 960
caacaggaca aataatagga gatataagaa aagcacattg tagcattaat atgacaaaat 1020
ggaataacac cttaatacag gtagctaaaa agttaggaga acaatttaag aataaaacaa 1080
tagtctttaa ccaatcctca ggaggggaca cagaaattgt aatgcacagc tttaattgtg 1140
gaggggaatt tttctactgc aatacaacac aactgtttaa tggtagttgg aatccaaatg 1200
gtacttggaa ttatgctggg gggtcaaacg acactatcac actcccatgc agaataaaac 1260
aaattataaa tatgtggcag gaagtaggaa aagcaatgta tgcccctccc gtcaaaggac 1320
aaatcagatg tgtatcaaac attacagggt tgatattaac aagagatggt ggtaatggtg 1380
gtaatggcac agacaacacc accgagatct ttaggcctgc aggaggaaat atgaaggaca 1440
attggagaag tgaattatat aaatataaag tagtaagaat tgaaccatta ggagtagcac 1500
ccactaaggc aaagagaaga gtggtacaaa gagaaaaaag agcagtggga atgggagctc 1560
tgttccttgg gttcttggga gcagcaggaa gcact 1595
<210> 64
<211> 479
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 64
Ser Ala Thr Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Lys Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Leu Leu Pro Asn Met Thr Glu Asp
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Lys Cys Thr Asp Leu Asn Thr Thr Asn Thr Ile
100 105 110
Asn Ser Ser Asp Leu Met Glu Lys Gly Glu Ile Lys Asn Cys Ser Phe
115 120 125
Asn Ile Thr Thr Asn Ile Arg Asp Lys Met Gln Lys Asp Tyr Ala Leu
130 135 140
Phe Tyr Arg Leu Asp Val Val Pro Ile Asp Asn Asp Asn Thr Ser Tyr
145 150 155 160
Arg Leu Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys
165 170 175
Val Ser Phe Glu Pro Ile Pro Ile His Cys Cys Ala Pro Ala Gly Phe
180 185 190
Ala Ile Leu Lys Cys Lys Asp Lys Asn Phe Asn Gly Thr Gly Thr Cys
195 200 205
Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val
210 215 220
Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val
225 230 235 240
Ile Arg Ser Ala Asn Phe Ser Asp Asn Ala Lys Asn Ile Ile Val Gln
245 250 255
Leu Asn Glu Thr Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr
260 265 270
Met Lys Ser Ile His Ile Gly Leu Gly Arg Ala Phe Tyr Thr Thr Gly
275 280 285
Gln Ile Ile Gly Asp Ile Arg Lys Ala His Cys Ser Ile Asn Met Thr
290 295 300
Lys Trp Asn Asn Thr Leu Ile Gln Val Ala Lys Lys Leu Gly Glu Gln
305 310 315 320
Phe Lys Asn Lys Thr Ile Val Phe Asn Gln Ser Ser Gly Gly Asp Thr
325 330 335
Glu Ile Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys
340 345 350
Asn Thr Thr Gln Leu Phe Asn Gly Ser Trp Asn Pro Asn Gly Thr Trp
355 360 365
Asn Tyr Ala Gly Gly Ser Asn Asp Thr Ile Thr Leu Pro Cys Arg Ile
370 375 380
Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala
385 390 395 400
Pro Pro Val Lys Gly Gln Ile Arg Cys Val Ser Asn Ile Thr Gly Leu
405 410 415
Ile Leu Thr Arg Asp Gly Gly Asn Gly Gly Asn Gly Thr Asp Asn Thr
420 425 430
Thr Glu Ile Phe Arg Pro Ala Gly Gly Asn Met Lys Asp Asn Trp Arg
435 440 445
Ser Glu Leu Tyr Lys Tyr Lys Val Val Arg Ile Glu Pro Leu Gly Val
450 455 460
Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
465 470 475
<210> 65
<211> 1592
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 65
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaggaat tgtcagcact 60
tatggagatg gggcaccatg ctccttggga tgttaatgat ctgtagtgct acagaacaat 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga agcaactacc accctatttt 180
gtgcatcaga tgctaaagca tatgatacag agagacataa tgtttgggcc acacatgcct 240
gtgtacccac agacccctgc ccacaagaag taggattggg aaatgtgaca gagtatttta 300
acatgtggaa aaataacatg gtagaacaga tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa gccatgtgta aaattaacac cactctgtgt tactttaaat tgtaatgcgg 420
gaaagtttaa ttatacgaat aatactgata cactgaaaga agaagtagga gaaataaaaa 480
actgctcttt caatatcacc acaagcataa gagataaggt aaagaaagaa tatgcatttt 540
ttaataaact tgatgtagta ccaatagata atgagaatga tagctatagg ttgataagtt 600
gtaacacctc agtcattact caggcctgtc caaaggtatc atttgagcca attcctatac 660
attattgtgc cccagctggt tttgcgattc taaggtgtaa taataagaca ttcaatggga 720
caggaccatg tacaaatgtc agtacagtac aatgtacaca tggaattagg ccagtagtgt 780
caacccaact gctgttaaat ggcagtctag cagaagagga ggtaatgatt aggtctgaga 840
acttcacgaa caatgctaaa accataatag tacagctgaa tgaatctgta gtaattaatt 900
gtacaagacc caacaacaat acaagaaaaa gtatacacat aggaccaggg agagcatttt 960
atacaacagg agagataata ggagatataa gaaaagcaca ttgtaacatt agtaaagcaa 1020
aatgggatag cactttaaaa caagtagtta caaaattaag agaacaatat ggaaataaaa 1080
caatagcctt taagaactcc tcaggagggg acccagaaat tgtaatgcac agttttaatt 1140
gtggagggga atttttctac tgtaatacaa caaagctatt taatagtact tggaatagga 1200
cagaggtaga tactattgaa ggaaatacca ctataaatat cacactccca tgtagaataa 1260
aacaaattat aaacatgtgg caggaagtag gaaaagcaat gtatgcccct cccatcagag 1320
gaccaattag ctgcacatca aatattacag ggctgctgtt aataagagat ggtggtacag 1380
acaatagcac gaacgacacc gagatcttca gacctggagg aggagatatg agggacaatt 1440
ggagaagtga attatacaaa tataaagtag taaaaattga accattagga atagcaccca 1500
ccaaggcaaa gagaagagtg gtgcagagag aaaaaagagc aataggaata ggagctgtgt 1560
tccttgggtt cttgggagca gcaggaagca ct 1592
<210> 66
<211> 478
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 66
Ser Ala Thr Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Arg His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Cys Pro Gln Glu Val Gly Leu Gly Asn Val Thr Glu Tyr
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Asn Ala Gly Lys Phe Asn Tyr Thr Asn
100 105 110
Asn Thr Asp Thr Leu Lys Glu Glu Val Gly Glu Ile Lys Asn Cys Ser
115 120 125
Phe Asn Ile Thr Thr Ser Ile Arg Asp Lys Val Lys Lys Glu Tyr Ala
130 135 140
Phe Phe Asn Lys Leu Asp Val Val Pro Ile Asp Asn Glu Asn Asp Ser
145 150 155 160
Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro
165 170 175
Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly
180 185 190
Phe Ala Ile Leu Arg Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro
195 200 205
Cys Thr Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val
210 215 220
Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val
225 230 235 240
Met Ile Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr Ile Ile Val
245 250 255
Gln Leu Asn Glu Ser Val Val Ile Asn Cys Thr Arg Pro Asn Asn Asn
260 265 270
Thr Arg Lys Ser Ile His Ile Gly Pro Gly Arg Ala Phe Tyr Thr Thr
275 280 285
Gly Glu Ile Ile Gly Asp Ile Arg Lys Ala His Cys Asn Ile Ser Lys
290 295 300
Ala Lys Trp Asp Ser Thr Leu Lys Gln Val Val Thr Lys Leu Arg Glu
305 310 315 320
Gln Tyr Gly Asn Lys Thr Ile Ala Phe Lys Asn Ser Ser Gly Gly Asp
325 330 335
Pro Glu Ile Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr
340 345 350
Cys Asn Thr Thr Lys Leu Phe Asn Ser Thr Trp Asn Arg Thr Glu Val
355 360 365
Asp Thr Ile Glu Gly Asn Thr Thr Ile Asn Ile Thr Leu Pro Cys Arg
370 375 380
Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr
385 390 395 400
Ala Pro Pro Ile Arg Gly Pro Ile Ser Cys Thr Ser Asn Ile Thr Gly
405 410 415
Leu Leu Leu Ile Arg Asp Gly Gly Thr Asp Asn Ser Thr Asn Asp Thr
420 425 430
Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser
435 440 445
Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Ile Ala
450 455 460
Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
465 470 475
<210> 67
<211> 1586
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 67
agaaagagca gaagacagtg gcaatgaaag tgacggggat catgaagaat tatcagcact 60
tatggagatg gggcatcatg ctccttggga tgttgatgat ctatagtact gcagaacaac 120
aattgtgggt cacagtctat tatggggtac ctgtgtggaa agaagcaact actactctat 180
tctgtgcatc agatgctaaa gcatatgata cagaggtaca taatgtttgg gccacacatg 240
cctgtgtacc cacagacccc aacccacaag aagtagtatt ggggaatgtg acagaaaatt 300
ttaacatgtg gaaaaataac atggtagaac agatgcatga ggatataatc agtttatggg 360
atcaaagcct aaaaccatgt gtaaaactaa ccccactctg tgttacttta aattgcactg 420
actgggataa aacgaattgc actaatgggg gagatattac tgctaatgag gaaaaaggag 480
aactaaaaaa ttgctctttc aatatcacca caaacataag agataagata cggaaagaat 540
atgcactttt ttataaattg gatgtagtac caatagataa tgataatact agttataggt 600
tgataaattg taacacctca gtcattacac aagcctgtcc aaaggtatcc tttgagccaa 660
ttcccataca ttattgtgcc ccggctggtt ttgcgattct aaagtgtaac aataagacgt 720
tcaatggaaa aggaccatgt aaaaatgtca gcacagtaca atgcacacat ggaattaggc 780
cagtagtgtc aactcaactg ctgttaaatg gcagtttagc agaagaagag gtagtagtta 840
gatctgccaa tttctcggac agtgccaaaa ccatcatagt acaactaaat gaatctgtag 900
taattaattg tacaagaccc aacaacaata caagaaaaag catacatata ggaccaggga 960
gagcatttta tgcaacagga gaaataatag gagatataag acaagcacat tgtaatctta 1020
gtctaacaaa atggaatcaa actttatatc aggtagttag aaaattaaaa gaacaattta 1080
agaataaaac aatagccttt aatcactcct caggagggga cccagaaatt gtaatgcaca 1140
gttttaattg tggaggagaa tttttctact gtaatacaac acaattattt aatagtactt 1200
ggtatactaa tggtacttgg agtgatactg gaagtaatga cacagtcaca ctcccatgca 1260
gaataaaaca aattataaac aggtggcaag aagtaggaaa agcaatgtat gcccctccca 1320
tcaaaggaca aattagatgc tcatcaaata ttacagggct gttattaaca agagatggtg 1380
gtagtagcaa aaacgagacc gaggtcttca gacctggagg aggagatatg agggacaatt 1440
ggagaagtga attatataaa tataaagtag taaaaattga accattagga gtagcaccca 1500
ccagggcaaa gagaagagtg gtgcagagag aaaaaagagg aataggagct gtgttccttg 1560
ggttcttggg agcagcagga agcact 1586
<210> 68
<211> 478
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 68
Ser Thr Ala Glu Gln Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro
1 5 10 15
Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys
20 25 30
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val
35 40 45
Pro Thr Asp Pro Asn Pro Gln Glu Val Val Leu Gly Asn Val Thr Glu
50 55 60
Asn Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp
65 70 75 80
Ile Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr
85 90 95
Pro Leu Cys Val Thr Leu Asn Cys Thr Asp Trp Asp Lys Thr Asn Cys
100 105 110
Thr Asn Gly Gly Asp Ile Thr Ala Asn Glu Glu Lys Gly Glu Leu Lys
115 120 125
Asn Cys Ser Phe Asn Ile Thr Thr Asn Ile Arg Asp Lys Ile Arg Lys
130 135 140
Glu Tyr Ala Leu Phe Tyr Lys Leu Asp Val Val Pro Ile Asp Asn Asp
145 150 155 160
Asn Thr Ser Tyr Arg Leu Ile Asn Cys Asn Thr Ser Val Ile Thr Gln
165 170 175
Ala Cys Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala
180 185 190
Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly
195 200 205
Lys Gly Pro Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile
210 215 220
Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu
225 230 235 240
Glu Glu Val Val Val Arg Ser Ala Asn Phe Ser Asp Ser Ala Lys Thr
245 250 255
Ile Ile Val Gln Leu Asn Glu Ser Val Val Ile Asn Cys Thr Arg Pro
260 265 270
Asn Asn Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly Arg Ala Phe
275 280 285
Tyr Ala Thr Gly Glu Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn
290 295 300
Leu Ser Leu Thr Lys Trp Asn Gln Thr Leu Tyr Gln Val Val Arg Lys
305 310 315 320
Leu Lys Glu Gln Phe Lys Asn Lys Thr Ile Ala Phe Asn His Ser Ser
325 330 335
Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Asn Cys Gly Gly Glu
340 345 350
Phe Phe Tyr Cys Asn Thr Thr Gln Leu Phe Asn Ser Thr Trp Tyr Thr
355 360 365
Asn Gly Thr Trp Ser Asp Thr Gly Ser Asn Asp Thr Val Thr Leu Pro
370 375 380
Cys Arg Ile Lys Gln Ile Ile Asn Arg Trp Gln Glu Val Gly Lys Ala
385 390 395 400
Met Tyr Ala Pro Pro Ile Lys Gly Gln Ile Arg Cys Ser Ser Asn Ile
405 410 415
Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Ser Ser Lys Asn Glu Thr
420 425 430
Glu Val Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser
435 440 445
Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala
450 455 460
Pro Thr Arg Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
465 470 475
<210> 69
<211> 1628
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 69
agaaagagca gaagacagtg gcaatgagag cgaaggggac caagaagaat tggcagcagc 60
acttatggaa atggggcacg atgctccttg ggatgttaat gatctgtagt gctgcggaac 120
aatggtgggt cacagtctat tatggagtac ctgtgtggaa ggacgcaaat accactctat 180
tttgtgcatc agatgctaaa gcatatgata cagaggcaca taatgtctgg gccacacatg 240
cctgtgtacc cacagatccc aacccacaag aaatagtatt ggaaaatgtg acagaagatt 300
ttaacatgtg gaaaaataac atggcagacc agatgcatga ggatataatc agtttatggg 360
atcaaagcct aaagccatgt gtaaaattaa ctccgctctg tgttacttta aattgtactg 420
attggaatgg taatactact agcaatagta ctataaacaa caatactagt actaaggcag 480
aaatgaaaaa atgctctttt aatatcacca caagcataag agataaagtg acaaaggaat 540
atgcactgtt ttatagagtt gatgtagtac caatagataa agaaaataat aataccaatt 600
ataccaatta tagattaata aattgtaaca cctcagtcat tacacaagcc tgtccaaaga 660
catcctttga gccaattcct atacattatt gtgccccagc tggttttgca attctaaagt 720
gtaacaataa gacattcaca ggaaaaggac tctgtacaag ggttagcaca ttacaatgta 780
cacatggaat tagaccagta gtgtcaactc aactgctgtt aaatggcagt ctagcagaag 840
aggaggtagt aattagatgt gaaaatatca cagacaatgc taaaaccata atagtacagc 900
tgaatgaatc tgtagcaatt aattgtacaa gacctaataa caatacaaga aaaagtatac 960
ctataggacc agggagagca ttttatgcaa caggagatat agtaggaaat ataagacaag 1020
cacattgtaa ccttagtgga acagagtggg aaaaaacttt agggaaaata gttggggaat 1080
taagaaaaaa ttttgagaat aagacaataa tttttaatca atcctcagga ggggacccag 1140
aaattgtatc gcaccttttt aattgtggag gagaattttt ctattgtaac tcaacacaac 1200
tgtttaatag tacttggaat actactgcaa aaattgatgg ttctgggaat gttactggaa 1260
aggtaaatag cactatcaca ctccaatgta aaataagaca aattgtaaac ctgtggcagg 1320
aagtaggaaa agcaatgtat gcccctccca tcagtggaat aatttactgt tcatcaaata 1380
ttacagggct gatactgatg agagatggtg gtaatgatag tagcacgaat ggaaacgaga 1440
ccttcagacc tggaggggga aatatgaaag ataattggag aagtgaatta tataaatata 1500
aagtagtaaa aattgaacca ttaggactag cacccaccaa ggcaaagagg agagtggtgc 1560
aaagagaaaa aagagcagta ggaataggag ctatgttcct tgggttcttg ggagcagcag 1620
gaagcact 1628
<210> 70
<211> 489
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 70
Ser Ala Ala Glu Gln Trp Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Asp Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Val Leu Glu Asn Val Thr Glu Asp
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Ala Asp Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Trp Asn Gly Asn Thr Thr Ser
100 105 110
Asn Ser Thr Ile Asn Asn Asn Thr Ser Thr Lys Ala Glu Met Lys Lys
115 120 125
Cys Ser Phe Asn Ile Thr Thr Ser Ile Arg Asp Lys Val Thr Lys Glu
130 135 140
Tyr Ala Leu Phe Tyr Arg Val Asp Val Val Pro Ile Asp Lys Glu Asn
145 150 155 160
Asn Asn Thr Asn Tyr Thr Asn Tyr Arg Leu Ile Asn Cys Asn Thr Ser
165 170 175
Val Ile Thr Gln Ala Cys Pro Lys Thr Ser Phe Glu Pro Ile Pro Ile
180 185 190
His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asn Lys
195 200 205
Thr Phe Thr Gly Lys Gly Leu Cys Thr Arg Val Ser Thr Leu Gln Cys
210 215 220
Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly
225 230 235 240
Ser Leu Ala Glu Glu Glu Val Val Ile Arg Cys Glu Asn Ile Thr Asp
245 250 255
Asn Ala Lys Thr Ile Ile Val Gln Leu Asn Glu Ser Val Ala Ile Asn
260 265 270
Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile Pro Ile Gly Pro
275 280 285
Gly Arg Ala Phe Tyr Ala Thr Gly Asp Ile Val Gly Asn Ile Arg Gln
290 295 300
Ala His Cys Asn Leu Ser Gly Thr Glu Trp Glu Lys Thr Leu Gly Lys
305 310 315 320
Ile Val Gly Glu Leu Arg Lys Asn Phe Glu Asn Lys Thr Ile Ile Phe
325 330 335
Asn Gln Ser Ser Gly Gly Asp Pro Glu Ile Val Ser His Leu Phe Asn
340 345 350
Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gln Leu Phe Asn Ser
355 360 365
Thr Trp Asn Thr Thr Ala Lys Ile Asp Gly Ser Gly Asn Val Thr Gly
370 375 380
Lys Val Asn Ser Thr Ile Thr Leu Gln Cys Lys Ile Arg Gln Ile Val
385 390 395 400
Asn Leu Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Ser
405 410 415
Gly Ile Ile Tyr Cys Ser Ser Asn Ile Thr Gly Leu Ile Leu Met Arg
420 425 430
Asp Gly Gly Asn Asp Ser Ser Thr Asn Gly Asn Glu Thr Phe Arg Pro
435 440 445
Gly Gly Gly Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr
450 455 460
Lys Val Val Lys Ile Glu Pro Leu Gly Leu Ala Pro Thr Lys Ala Lys
465 470 475 480
Arg Arg Val Val Gln Arg Glu Lys Arg
485
<210> 71
<211> 1622
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 71
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tggcggcgct 60
ggtggagatg gggcaccatg ctccttggga tgttaatgat ctgtagtgct acagaacaat 120
tgtgggttac agtctattat ggggtacctg tgtggaaaga agcaaccacc actctatttt 180
gtgcatcaga tgctaaagca tataatacag aggtacgtaa tgtatgggcc acacatgcct 240
gtgtgcccac aggccccaac ccacaagaaa tagtattggt aaatgtgaca gaagatttta 300
acatgtggaa aaatagcatg gtagaacaga tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa accttgtgta aaattaaccc cactctgtgt tactttaaac tgcactgatt 420
tgaggaatac tactaatagc actaatagtg acggggaaaa gatggagaga ggagaaataa 480
aaaactgctc tttcaatgtt accacaggca taagagataa ggttcagaga gaatatgcac 540
tcttttataa acttgatata gtaccaatag aggaaggtgg ggataatacc agctgtaggg 600
ataataccag ctataggttg ataagttgta atacctcagt cattacacaa gcctgtccaa 660
aggtatcctt tgagccaatt cccatacatt attgtgcccc agctggtttt gcgattctaa 720
agtgtaataa taagacgttc aatggaaaag gaccatgttc aaatgtcagc acagtacaat 780
gtacacatgg aattaggcca gtagtgtcaa ctcaactgct gttaaacggc agtctagcag 840
aaaaagaggt agtaattaga tctgaaaata tcacggacaa tactaaaaac ataatagtac 900
agttaaatga aactgtagaa attaattgta caagacccaa caacaataca agaaaaagta 960
tacatatagg accggggaga gcatttcatg caacaggaga aataatagga aatataagac 1020
aggcatattg taacattagt ggagcaaaat ggaataacac tttaaaacag atagtaaaaa 1080
gattaaaaga aaaatttccg aataagataa tagtctttaa tcactcctca ggaggggacc 1140
cagaaattgt aacacacagt tttaattgtg gaggggaatt tttctactgt aattcaacaa 1200
acctgtttaa tagtaattca acacaactgg ataattggac ttatactgaa gggtcaaatg 1260
acactgttat cacgctccca tgcagaataa aacaaattgt aaatatgtgg caggaagtag 1320
gaaaagcaat gtatgctcct cccatcagag gacaaattag atgttcttca aatattacag 1380
ggctgatatt aacaagagat ggtggtaata agactgacaa tgacaccacc gagaccttca 1440
gacctggagg aggaagcatg agggacaatt ggagaagtga attatataaa tataaaatag 1500
taaaggttga accactagga gtagcaccca ccaaggcaaa gaggagagtg gtgcagagag 1560
aaaaaagaac agtgggactg ggagccttgt ttcttgggtt cttgggagca gcaggaagca 1620
ct 1622
<210> 72
<211> 488
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 72
Ser Ala Thr Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asn Thr Glu Val Arg Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Gly Pro Asn Pro Gln Glu Ile Val Leu Val Asn Val Thr Glu Asp
50 55 60
Phe Asn Met Trp Lys Asn Ser Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Leu Arg Asn Thr Thr Asn Ser
100 105 110
Thr Asn Ser Asp Gly Glu Lys Met Glu Arg Gly Glu Ile Lys Asn Cys
115 120 125
Ser Phe Asn Val Thr Thr Gly Ile Arg Asp Lys Val Gln Arg Glu Tyr
130 135 140
Ala Leu Phe Tyr Lys Leu Asp Ile Val Pro Ile Glu Glu Gly Gly Asp
145 150 155 160
Asn Thr Ser Cys Arg Asp Asn Thr Ser Tyr Arg Leu Ile Ser Cys Asn
165 170 175
Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile
180 185 190
Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn
195 200 205
Asn Lys Thr Phe Asn Gly Lys Gly Pro Cys Ser Asn Val Ser Thr Val
210 215 220
Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu
225 230 235 240
Asn Gly Ser Leu Ala Glu Lys Glu Val Val Ile Arg Ser Glu Asn Ile
245 250 255
Thr Asp Asn Thr Lys Asn Ile Ile Val Gln Leu Asn Glu Thr Val Glu
260 265 270
Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile His Ile
275 280 285
Gly Pro Gly Arg Ala Phe His Ala Thr Gly Glu Ile Ile Gly Asn Ile
290 295 300
Arg Gln Ala Tyr Cys Asn Ile Ser Gly Ala Lys Trp Asn Asn Thr Leu
305 310 315 320
Lys Gln Ile Val Lys Arg Leu Lys Glu Lys Phe Pro Asn Lys Ile Ile
325 330 335
Val Phe Asn His Ser Ser Gly Gly Asp Pro Glu Ile Val Thr His Ser
340 345 350
Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Asn Leu Phe
355 360 365
Asn Ser Asn Ser Thr Gln Leu Asp Asn Trp Thr Tyr Thr Glu Gly Ser
370 375 380
Asn Asp Thr Val Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Val Asn
385 390 395 400
Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly
405 410 415
Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Ile Leu Thr Arg Asp
420 425 430
Gly Gly Asn Lys Thr Asp Asn Asp Thr Thr Glu Thr Phe Arg Pro Gly
435 440 445
Gly Gly Ser Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys
450 455 460
Ile Val Lys Val Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg
465 470 475 480
Arg Val Val Gln Arg Glu Lys Arg
485
<210> 73
<211> 1622
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 73
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tatcagcact 60
tgtggagagg ggacaacttg tggagagggg gcatcatgct ccttgggata ttgatgatct 120
gtagtgctac agaaaaattg tgggtcacag tctattatgg ggtacctgtg tggaaaaatg 180
caaacaccac tctattttgt gcatcagatg ctaaagcata tgatacagag gtacataatg 240
tttgggccac acacgcctgt gtacccacag accccagccc acgagaatta atattggaaa 300
atgtgacaga agactttgac atatggaaaa ataacatggt agaacagatg caagaggata 360
taatcagttt atgggatcaa agcctaaagc catgtgtaaa attaacccct ctctgtgtta 420
ctttagaatg caagaatgcc actaaaatta gtaatagcag tgaaattgga gaaatgaaaa 480
actgctcttt taatgttacc acagacagga gagataaggt gaaaacagaa tatgcacttt 540
tttataacct tgatataata caaatagagg aggagaatac cagcagctgc agtaatacca 600
gcagctacag gttgataagt tgtaacacct caacccttac acaggcctgt ccaaagatat 660
cctttgagcc aattcccata cattattgtg ccccggctgg ttttgcaatt ctaaagtgta 720
ataataaaac attcgatgga aaaggatcat gtaaaaatgt cagcacagta caatgtacac 780
atggaattaa gccagtagta tcaactcaac tgctgctaaa cggcagtcta gcagaagaag 840
aggtagtaat tagatctgct aatctctcag acaatgctaa aaccataata gtacagctga 900
acatgtctgt acaaattaat tgtacaagac ccaacaacaa tacaagaaga ggtatacatt 960
taggaccagg gagagccttt tatggaacag acataatagg agatataaga caagcacatt 1020
gtaacattag tggaaaacaa tggaattaca ctttacaaca gatagttaaa aaattcagaa 1080
aacaatttga gaatagcaca gtgatcttta acagatcctc aggaggggac ccagaaattg 1140
taatgcacag ttttaattgt ggaggggaat ttttctactg taatacaaca gaactgttta 1200
atagtacttg gaacagtagt catcctttgg atgatacttg gcctcctttg gataatacaa 1260
gtgacactac tatcacactc ccatgcagaa taaaacaaat tataaacatg tggcaggaag 1320
taggaaaagc aatgtatgcc cctcccatca aaggaccaat tagatgtgaa tcaaatatta 1380
cagggctgct attaacaaga gatggtggcg ataccaatac cactaacggg actgagacct 1440
tcagacctgg aggaggagat atgagggaca attggagaag tgaattatat aaatataaag 1500
tagtaagaat taaaccatta ggaatagcac ccaccaaggc acagagaaga gtggtgcaaa 1560
gagaaaaaag agcagcacta ggagctatgt tccttgggtt cttgggagca gcaggaagca 1620
ct 1622
<210> 74
<211> 483
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 74
Ser Ala Thr Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Asn Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Ser Pro Arg Glu Leu Ile Leu Glu Asn Val Thr Glu Asp
50 55 60
Phe Asp Ile Trp Lys Asn Asn Met Val Glu Gln Met Gln Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Glu Cys Lys Asn Ala Thr Lys Ile Ser Asn Ser
100 105 110
Ser Glu Ile Gly Glu Met Lys Asn Cys Ser Phe Asn Val Thr Thr Asp
115 120 125
Arg Arg Asp Lys Val Lys Thr Glu Tyr Ala Leu Phe Tyr Asn Leu Asp
130 135 140
Ile Ile Gln Ile Glu Glu Glu Asn Thr Ser Ser Cys Ser Asn Thr Ser
145 150 155 160
Ser Tyr Arg Leu Ile Ser Cys Asn Thr Ser Thr Leu Thr Gln Ala Cys
165 170 175
Pro Lys Ile Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala
180 185 190
Gly Phe Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Asp Gly Lys Gly
195 200 205
Ser Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro
210 215 220
Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu
225 230 235 240
Val Val Ile Arg Ser Ala Asn Leu Ser Asp Asn Ala Lys Thr Ile Ile
245 250 255
Val Gln Leu Asn Met Ser Val Gln Ile Asn Cys Thr Arg Pro Asn Asn
260 265 270
Asn Thr Arg Arg Gly Ile His Leu Gly Pro Gly Arg Ala Phe Tyr Gly
275 280 285
Thr Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser Gly
290 295 300
Lys Gln Trp Asn Tyr Thr Leu Gln Gln Ile Val Lys Lys Phe Arg Lys
305 310 315 320
Gln Phe Glu Asn Ser Thr Val Ile Phe Asn Arg Ser Ser Gly Gly Asp
325 330 335
Pro Glu Ile Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr
340 345 350
Cys Asn Thr Thr Glu Leu Phe Asn Ser Thr Trp Asn Ser Ser His Pro
355 360 365
Leu Asp Asp Thr Trp Pro Pro Leu Asp Asn Thr Ser Asp Thr Thr Ile
370 375 380
Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val
385 390 395 400
Gly Lys Ala Met Tyr Ala Pro Pro Ile Lys Gly Pro Ile Arg Cys Glu
405 410 415
Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asp Thr Asn
420 425 430
Thr Thr Asn Gly Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg
435 440 445
Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Arg Ile Lys
450 455 460
Pro Leu Gly Ile Ala Pro Thr Lys Ala Gln Arg Arg Val Val Gln Arg
465 470 475 480
Glu Lys Arg
<210> 75
<211> 1574
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 75
agaaagagca gaagacagtg gcaatgacag tgatggggat caggaggaat tatcaatgct 60
tgtggaaatg gggcatgacg ctccttggga tgttgatgat ctgtagtgct gcacaattgt 120
gggtcacagt ctgttatggg gtaccggtgt ggaaagaagc aaccaccact ctattttgtg 180
catcagatgc taaagcatat gacacagagg tacataatgt ttgggccaca catgcctgtg 240
tacccacaga ccccaatcca ctagaattaa aattggataa tgtgacagaa aattttaaca 300
tgtggaaaaa taacatggta gaacaaatgc atgaggatat aatcagttta tgggatcaaa 360
gcctaaagcc atgtgtaaaa ttaaccccac tctgtgttac tttaaattgc actgactact 420
cgaagaatgg tactaataac actgctaata atgaaggaga aatgaaaaac tgctctttta 480
atatcaccac aaacataaga gataagatgc agaatgaata cgcacttttt tataaacatg 540
atatggtatc aatagataat agtagtacta gctataggtt gacaagttgt aacacctcag 600
tcattacaca ggcctgtcca aagataacct ttgaaccaat tcctatacat tattgtaccc 660
cggctggttt tgcgctttta aagtgtaata ataaaacgtt caatggaaca ggaccatgta 720
aaaatgtcag cacagtacaa tgtacacatg gaattaggcc agtagtttca actcaactgc 780
tgttaaatgg cagtctagca gaagaagagg tagtaattag atctgaaaat ttcacgaaca 840
atgcaaaaat cataatagta cagctaaatg aaactataca aattaattgt acaagaccca 900
acaacaatac aagaaaaagt atacatatag caccagggag agcattttat gcaacaggag 960
aaataatagg agatataaga caagcacatt gtaacattag cagagcaaaa tggaacaacg 1020
ctttaaaaca gatagttgaa aaattaagag aacaatttaa gaataaaaca atagaattta 1080
agtcatcctc aggaggggac ccagaaattg taatgcacag tttcaattgt ggaggggaat 1140
ttttctactg taattcaaca aaactgttta atagtatttg gtatccgaat ggtactgaag 1200
ggtcaaataa cactgaagga aatgacccaa tcacactccc atgcagaata agacaaattg 1260
taaacagatg gcaggaagta ggaaaagcaa tgtatgcccc tcccatcagg ggaccaatta 1320
gatgttcatc aaatattaca gggctgctat taacaagaga tggtggtgct aataatactg 1380
ataatgagac cttcagacct ggaggaggag atatgaggga caattggaga agtgaattat 1440
ataaatataa agtagtaaga attgaaccat taggaatagc gcccactacg gcaaggagaa 1500
gagtggtgca aagagaaaaa agggcagcca taggagctat gatccttggg ttcttgggag 1560
cagcaggaag cact 1574
<210> 76
<211> 473
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 76
Ser Ala Ala Gln Leu Trp Val Thr Val Cys Tyr Gly Val Pro Val Trp
1 5 10 15
Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr
20 25 30
Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr
35 40 45
Asp Pro Asn Pro Leu Glu Leu Lys Leu Asp Asn Val Thr Glu Asn Phe
50 55 60
Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile Ile
65 70 75 80
Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu
85 90 95
Cys Val Thr Leu Asn Cys Thr Asp Tyr Ser Lys Asn Gly Thr Asn Asn
100 105 110
Thr Ala Asn Asn Glu Gly Glu Met Lys Asn Cys Ser Phe Asn Ile Thr
115 120 125
Thr Asn Ile Arg Asp Lys Met Gln Asn Glu Tyr Ala Leu Phe Tyr Lys
130 135 140
His Asp Met Val Ser Ile Asp Asn Ser Ser Thr Ser Tyr Arg Leu Thr
145 150 155 160
Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Ile Thr Phe
165 170 175
Glu Pro Ile Pro Ile His Tyr Cys Thr Pro Ala Gly Phe Ala Leu Leu
180 185 190
Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Lys Asn Val
195 200 205
Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln
210 215 220
Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser
225 230 235 240
Glu Asn Phe Thr Asn Asn Ala Lys Ile Ile Ile Val Gln Leu Asn Glu
245 250 255
Thr Ile Gln Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser
260 265 270
Ile His Ile Ala Pro Gly Arg Ala Phe Tyr Ala Thr Gly Glu Ile Ile
275 280 285
Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser Arg Ala Lys Trp Asn
290 295 300
Asn Ala Leu Lys Gln Ile Val Glu Lys Leu Arg Glu Gln Phe Lys Asn
305 310 315 320
Lys Thr Ile Glu Phe Lys Ser Ser Ser Gly Gly Asp Pro Glu Ile Val
325 330 335
Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr
340 345 350
Lys Leu Phe Asn Ser Ile Trp Tyr Pro Asn Gly Thr Glu Gly Ser Asn
355 360 365
Asn Thr Glu Gly Asn Asp Pro Ile Thr Leu Pro Cys Arg Ile Arg Gln
370 375 380
Ile Val Asn Arg Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro
385 390 395 400
Ile Arg Gly Pro Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu
405 410 415
Thr Arg Asp Gly Gly Ala Asn Asn Thr Asp Asn Glu Thr Phe Arg Pro
420 425 430
Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr
435 440 445
Lys Val Val Arg Ile Glu Pro Leu Gly Ile Ala Pro Thr Thr Ala Arg
450 455 460
Arg Arg Val Val Gln Arg Glu Lys Arg
465 470
<210> 77
<211> 1610
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 77
agaaagagca gaagacagtg gcaatgaaag tgaaggggat caggaagaat tgtcagcgct 60
ggtggatagg gggcatcttg ctccttggaa tgttgatgat ctgtagtgct gcagaacaat 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga agcaaacatc actctatttt 180
gtgcatcaga tgctaaggga tatgatacag aagcacataa tgtctgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagagg tagcattgac aaatgtgaca gaaaacttta 300
acatgtggaa aaataacatg gtagaacaaa tgcatgagga tataattagt ttatgggatc 360
aaagcttaaa gccatgtgta aaattaaccc cactctgtgt tactttagat tgcaatgata 420
ctaatataaa tgtaactagc aaaaatgaga caatgatgga gcaaggagaa gcaaaaaact 480
gctctttcaa tatcaccaca aatttaagag ataaggtgca gaaagaatat tcagtttttt 540
ataaacttga tgtagtacca atagaagagg agaaaaataa tagtattaac aatagatata 600
ggttgataag ttgtaacacc tcagtcatta cacaagcctg tccaaagata tcctttgaac 660
caattcccat acattattgt gccccggctg gttttgcgat tctgaagtgt aacaataaga 720
cattcagtgg aaaaggacca tgcacaaatg tcagcacagt acaatgcaca catggaatta 780
ggccagtagt atcaactcaa ctgctgttaa atggcagcct agcagaagga gagatagtaa 840
ttagatctga caatttcaca gacaatacaa aaaccataat agtacagctg aatacatctg 900
tagcaattaa ttgtacaaga cccaacaaca atacaagaag aagtataact ataggaccag 960
ggagagcatt ttatgcaaca gacataatag gagatataag acaagcacat tgtaacatta 1020
gtagaacaca atggaataac actttaaaac aggtagctag aaaattaagt gaacaattta 1080
atgcaacaat agtttttaat aaatcctcag gaggggaccc agaaattgta atgcacagtt 1140
ttaattgtgg aggggaattt ttctactgta atacaacaca actgtttaat agtatttggt 1200
gtcctaataa tactggagag tcaaatagca ctaacaatga gacaatcata ctcccatgca 1260
gattaaaaca atttataaac atgtggcagg aggtaggaaa agcaatgtat gcccctccca 1320
tcagaggata cattaactgt tcatcaaata ttacggggct gctattaaca agagatggtg 1380
gtggtgataa taaaaacacg agtccagaga acaagacaga gaccttcaga cctggaggag 1440
gaaatatgaa ggacaattgg agaagtgaac tgtataaata taaagtagta agaattgaac 1500
cattaggagt agcacccacc aaggcaaaga gaagagtggt gcagagagaa aaaagagcag 1560
tgggaatagg agctatgttc cttgggttct tgggagcagc aggaagcact 1610
<210> 78
<211> 484
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 78
Ser Ala Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Asn Ile Thr Leu Phe Cys Ala Ser Asp Ala Lys Gly
20 25 30
Tyr Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Ala Leu Thr Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asp Cys Asn Asp Thr Asn Ile Asn Val Thr Ser
100 105 110
Lys Asn Glu Thr Met Met Glu Gln Gly Glu Ala Lys Asn Cys Ser Phe
115 120 125
Asn Ile Thr Thr Asn Leu Arg Asp Lys Val Gln Lys Glu Tyr Ser Val
130 135 140
Phe Tyr Lys Leu Asp Val Val Pro Ile Glu Glu Glu Lys Asn Asn Ser
145 150 155 160
Ile Asn Asn Arg Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val Ile Thr
165 170 175
Gln Ala Cys Pro Lys Ile Ser Phe Glu Pro Ile Pro Ile His Tyr Cys
180 185 190
Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Ser
195 200 205
Gly Lys Gly Pro Cys Thr Asn Val Ser Thr Val Gln Cys Thr His Gly
210 215 220
Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala
225 230 235 240
Glu Gly Glu Ile Val Ile Arg Ser Asp Asn Phe Thr Asp Asn Thr Lys
245 250 255
Thr Ile Ile Val Gln Leu Asn Thr Ser Val Ala Ile Asn Cys Thr Arg
260 265 270
Pro Asn Asn Asn Thr Arg Arg Ser Ile Thr Ile Gly Pro Gly Arg Ala
275 280 285
Phe Tyr Ala Thr Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn
290 295 300
Ile Ser Arg Thr Gln Trp Asn Asn Thr Leu Lys Gln Val Ala Arg Lys
305 310 315 320
Leu Ser Glu Gln Phe Asn Ala Thr Ile Val Phe Asn Lys Ser Ser Gly
325 330 335
Gly Asp Pro Glu Ile Val Met His Ser Phe Asn Cys Gly Gly Glu Phe
340 345 350
Phe Tyr Cys Asn Thr Thr Gln Leu Phe Asn Ser Ile Trp Cys Pro Asn
355 360 365
Asn Thr Gly Glu Ser Asn Ser Thr Asn Asn Glu Thr Ile Ile Leu Pro
370 375 380
Cys Arg Leu Lys Gln Phe Ile Asn Met Trp Gln Glu Val Gly Lys Ala
385 390 395 400
Met Tyr Ala Pro Pro Ile Arg Gly Tyr Ile Asn Cys Ser Ser Asn Ile
405 410 415
Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Gly Asp Asn Lys Asn Thr
420 425 430
Ser Pro Glu Asn Lys Thr Glu Thr Phe Arg Pro Gly Gly Gly Asn Met
435 440 445
Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Arg Ile
450 455 460
Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln
465 470 475 480
Arg Glu Lys Arg
<210> 79
<211> 1607
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 79
agaaagagca gaagacagtg gcaatgaaag cgaaggggac caggaagaat tatcagcact 60
tgtggagatg gggtaccatg ctccttggga tgttgatgat ctgtagtgct acagagcaat 120
tgtgggtaac agtctattat ggggtacctg tgtggaaaga agcaaaaacc actctatttt 180
gtgcatcaga tgctaaagca tatgatacag agatgcataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaaa tagtattgga aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagatcaga tgcaggagga tgtaatcagt ctatgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cactctgtgt tactttaaat tgcaatgata 420
cattgaggaa tgataatagc actaagaata atagtagtac tggttgggaa aagatggaga 480
aaggagaaat aaaaaattgc tctttcagtg ccaccacaac cgtgaaagat aagacacaga 540
aacaatatgc acttttttat aatcttgata tagtacatac aaatgatggt ggtaatagta 600
gctatatgtt aagaagttgt aacacctcag tcattacaca ggcctgtcca aaggtatcat 660
ttgagccaat tcccatacat tattgtgccc cggctggttt tgcgattcta aagtgtaatg 720
ataagaagtt caatgggaca ggaccatgta aaaatgtcag cacagtacaa tgtacacatg 780
gaattaggcc agtagtgtca actcaactgc tattaaatgg cagtctagca gaagaagagg 840
tagtaattag atctagcaat ttaacggaca atactaaaac cataatagta cagctgaagg 900
aatctgtaaa aattaattgt acaagaccca acaacaatac aagaaaaagt atatctatag 960
gaccagggag cgcattttat gcaacaggag acataatagg agatataaga caagcacatt 1020
gcaaccttag taaaacagaa tggggggaaa ctttaagaca gatagctaca aaattaagag 1080
aacaatttaa taataaaaca ataatcttta atagctcctc aggaggggac ccagaaattg 1140
taatgcacag ttttaattgt ggaggggaat ttttctactg taatacaaca aaactgttta 1200
atggtacttg gaatggtact tggaagaatg gtacttggag tacaaatgat actgaaaatg 1260
atactatcat actcccatgt agaataaaac aaattataaa catgtggcag gaagtaggaa 1320
aagcaatgtg tgcccctccc atcagaggac aaattaactg ctcatcacag attacagggc 1380
tgctattaac aagagatggt ggagataacc ctaaaaatga aaccttcaga cctggaggag 1440
gagatatgag ggacaattgg agaagtgaat tatataaata taaagtagta aaaattgaac 1500
cattaggaat agcacctacc agggcaaaga gaagagtggt gcagagagaa aaaagagcag 1560
caataggagc tatgatcctt gggttcttgg gagcagcagg aagcact 1607
<210> 80
<211> 484
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 80
Ser Ala Thr Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Lys Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Met His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Val Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Asp Gln Met Gln Glu Asp Val
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Asn Asp Thr Leu Arg Asn Asp Asn Ser
100 105 110
Thr Lys Asn Asn Ser Ser Thr Gly Trp Glu Lys Met Glu Lys Gly Glu
115 120 125
Ile Lys Asn Cys Ser Phe Ser Ala Thr Thr Thr Val Lys Asp Lys Thr
130 135 140
Gln Lys Gln Tyr Ala Leu Phe Tyr Asn Leu Asp Ile Val His Thr Asn
145 150 155 160
Asp Gly Gly Asn Ser Ser Tyr Met Leu Arg Ser Cys Asn Thr Ser Val
165 170 175
Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His
180 185 190
Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp Lys Lys
195 200 205
Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr Val Gln Cys Thr
210 215 220
His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser
225 230 235 240
Leu Ala Glu Glu Glu Val Val Ile Arg Ser Ser Asn Leu Thr Asp Asn
245 250 255
Thr Lys Thr Ile Ile Val Gln Leu Lys Glu Ser Val Lys Ile Asn Cys
260 265 270
Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile Ser Ile Gly Pro Gly
275 280 285
Ser Ala Phe Tyr Ala Thr Gly Asp Ile Ile Gly Asp Ile Arg Gln Ala
290 295 300
His Cys Asn Leu Ser Lys Thr Glu Trp Gly Glu Thr Leu Arg Gln Ile
305 310 315 320
Ala Thr Lys Leu Arg Glu Gln Phe Asn Asn Lys Thr Ile Ile Phe Asn
325 330 335
Ser Ser Ser Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Asn Cys
340 345 350
Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr Lys Leu Phe Asn Gly Thr
355 360 365
Trp Asn Gly Thr Trp Lys Asn Gly Thr Trp Ser Thr Asn Asp Thr Glu
370 375 380
Asn Asp Thr Ile Ile Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met
385 390 395 400
Trp Gln Glu Val Gly Lys Ala Met Cys Ala Pro Pro Ile Arg Gly Gln
405 410 415
Ile Asn Cys Ser Ser Gln Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly
420 425 430
Gly Asp Asn Pro Lys Asn Glu Thr Phe Arg Pro Gly Gly Gly Asp Met
435 440 445
Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile
450 455 460
Glu Pro Leu Gly Ile Ala Pro Thr Arg Ala Lys Arg Arg Val Val Gln
465 470 475 480
Arg Glu Lys Arg
<210> 81
<211> 1580
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 81
agaaagagca gaagacagtg gcaatgagag tgagggggat tatgaggaat tatcagcact 60
tgtggagatg gggcatgacg ctccttggga tgttaatgat cagtagtgct aatgaacaat 120
tgtgggtcac agtctgttat ggggtacctg tgtggaaaga agcaaccact actttatttt 180
gtgcatcaga tgctaaagca tatgctgcag agaaacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag tagtaataaa tgtgacagaa aattttaaca 300
tgtggaaaaa taacatggta gagcagatgc atgaagatgt aactagttta tgggaccaaa 360
gcctaaagcc atgtgtaaaa ttaacccctc tctgtgttac tttaaattgc actgactatg 420
aggggaataa tatcactagt gggaataaga caggagaaat aaaaaactgc tctttcgaga 480
tcaccacaaa catgagagat aagatacaga aaacatatgc acttttttat agacttgatg 540
tagaaccaat aaatgatgat aatgttactt ataggttgat aagctgtaat acctcagtca 600
ttacacaagc ctgtccaaag gtaacctttg agccaattcc catacattat tgtgccccgg 660
ctggctttgc gattgtaaag tgtaacaata aaacgttcaa tggaacagga ccatgtaaaa 720
atgttagcac agtacaatgt acacatggaa ttaggccagt agtatcaact caactgctgt 780
taaatggcag tctagcggaa gaagagacaa tgattagatc tgagaatttc tcggacaatg 840
ctaaaatcat aatagtacag ctgaataaat ctgtaaaaat taattgtaca agacccaaca 900
ataatacaat aaaaggtata catataggac cagggagagc attttataca acaggacaaa 960
taataggaga cataagacaa gcatattgta ccattaataa aacagaatgg aataacactt 1020
tatcacagat agctaaaaaa ttaagtagac aatttgagaa taaaacaata gcctttaggc 1080
caccctcagg aggggaccca gaaattgtaa tgcacagttt taattgtgga ggggaatttt 1140
tctattgtaa tacaacacaa ctgtttaata gtaattggac tactaatgga gagtcaaatt 1200
acacaacggg aaacaatgag acaattatca cactcccatg cagaataaaa caatttataa 1260
acatgtggca ggaagtagga aaagcaatgt atgcccctcc cattagtgga ataattaatt 1320
gcttatcaaa tattacaggg ctgctattaa caagagatgg tggtaatagt agcagcgcca 1380
acagcaccga gatcttcaga cctggaggag gggatatgag ggataattgg agaagtgaac 1440
tatataaata taaagtagta caaattgaac cattaggatt agcacccacc aaggcaaaga 1500
gaagagtggt gcagagagaa aaaagagcag tgggactagg agctgtgttc cttgggttct 1560
tgggagcagc aggaagcact 1580
<210> 82
<211> 474
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 82
Ser Ala Asn Glu Gln Leu Trp Val Thr Val Cys Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Ala Ala Glu Lys His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Ile Asn Val Thr Glu Asn Phe
50 55 60
Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Val Thr
65 70 75 80
Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu
85 90 95
Cys Val Thr Leu Asn Cys Thr Asp Tyr Glu Gly Asn Asn Ile Thr Ser
100 105 110
Gly Asn Lys Thr Gly Glu Ile Lys Asn Cys Ser Phe Glu Ile Thr Thr
115 120 125
Asn Met Arg Asp Lys Ile Gln Lys Thr Tyr Ala Leu Phe Tyr Arg Leu
130 135 140
Asp Val Glu Pro Ile Asn Asp Asp Asn Val Thr Tyr Arg Leu Ile Ser
145 150 155 160
Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Thr Phe Glu
165 170 175
Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Val Lys
180 185 190
Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser
195 200 205
Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu
210 215 220
Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Thr Met Ile Arg Ser Glu
225 230 235 240
Asn Phe Ser Asp Asn Ala Lys Ile Ile Ile Val Gln Leu Asn Lys Ser
245 250 255
Val Lys Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Ile Lys Gly Ile
260 265 270
His Ile Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly Gln Ile Ile Gly
275 280 285
Asp Ile Arg Gln Ala Tyr Cys Thr Ile Asn Lys Thr Glu Trp Asn Asn
290 295 300
Thr Leu Ser Gln Ile Ala Lys Lys Leu Ser Arg Gln Phe Glu Asn Lys
305 310 315 320
Thr Ile Ala Phe Arg Pro Pro Ser Gly Gly Asp Pro Glu Ile Val Met
325 330 335
His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr Gln
340 345 350
Leu Phe Asn Ser Asn Trp Thr Thr Asn Gly Glu Ser Asn Tyr Thr Thr
355 360 365
Gly Asn Asn Glu Thr Ile Ile Thr Leu Pro Cys Arg Ile Lys Gln Phe
370 375 380
Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile
385 390 395 400
Ser Gly Ile Ile Asn Cys Leu Ser Asn Ile Thr Gly Leu Leu Leu Thr
405 410 415
Arg Asp Gly Gly Asn Ser Ser Ser Ala Asn Ser Thr Glu Ile Phe Arg
420 425 430
Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys
435 440 445
Tyr Lys Val Val Gln Ile Glu Pro Leu Gly Leu Ala Pro Thr Lys Ala
450 455 460
Lys Arg Arg Val Val Gln Arg Glu Lys Arg
465 470
<210> 83
<211> 1643
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 83
agaaagagca gaagacagtg gcaatgagag tgaaggagat caggaagaat tgtcagaaat 60
tctggaaatg gggcaccttg ctccttggga tgttaatgat gatctgtaga gctgcagagg 120
attcgtgggt cacagtctat tatggggtac ctgtgtggaa ggaagcaacc accactctat 180
tttgtgcatc agatgctaaa gcatatgaca cagaggtaca taatgtttgg gccacacatg 240
cctgtgtacc cacagaccct aacccacaag aagtagtatt ggaaaatgtg acagaaaatt 300
tcaatgcgtg gaaaaataat atggtagaac agatgcatga ggatataatc agtttatggg 360
atcaaagcct aaagccatgt gtaaaattaa cccctctttg tgttactcta aattgcactg 420
atgtgagaaa taatgctacc aatactacta ataccaatag tactaatacc tatagtacta 480
acatagaaaa gatgaaagaa ggagaaataa aaaactgctc tttcaatacc accccaagca 540
taacagacaa gatgcagaag gcatatgcat tgttttataa gcttgatata gtacagataa 600
ataatgataa taaagataat accagctata gattgataag ttgtaatacc tcagtcatta 660
cacaggcctg tccaaaggca tcctttgagc caattcccat acattattgt gccccagctg 720
gttttgcgat tctaaagtgt aatgataaga agttcaatgg aacaggacca tgcaaaaatg 780
tcagcacagt acaatgtaca catggaatta ggccagtagt atcaacccaa ctgctgttaa 840
atggcagtct agcagaagaa gaggtagtaa ttagatctga aaatttcaca aacaatgcta 900
aaaccataat agtacagctg aatgagactg tacatattaa ttgtacaaga cccaacaaca 960
atacaagaaa aagtataggt ataggaccag ggagaacatt ttttgcaaca ggggaaataa 1020
taggagacat aagacaagca cattgtaaca ttagtagaaa caactggaat aaaactttag 1080
aaagggtagt taaaaaatta agagaacaat ttgggaacaa caaaacaatt gtttttaatc 1140
aatcctcagg aggggaccca gaaattgtga tgcacagttt taattgtaga ggagaatttt 1200
tctactgtaa tacaacacaa ctgtttaata gtacttggaa tgctaatagt acttggaatg 1260
ctaacgagaa tactactgga atgccaagtg acaatatcac actcccctgc agaataaaac 1320
aagttataaa catgtggcag gaagtaggaa aagcaatgta tgcccctccc attaaaggac 1380
caattaagtg ttcatcaaat attacaggac tgctattaac aagggatggt ggtgtgaaca 1440
attctgaaac tgagaccttc agacctggag gaggagatat gaggaacaat tggagatgtg 1500
gattatataa atataaagta gtaaaaattg agccattagg agtagcaccc accagggcaa 1560
agagaagagt ggtgcaaaga gaaaaaagag cagtgggact aggagctatg ttccttgggt 1620
tcttgggagc agcaggaagc act 1643
<210> 84
<211> 494
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 84
Arg Ala Ala Glu Asp Ser Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Ala Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Val Arg Asn Asn Ala Thr Asn
100 105 110
Thr Thr Asn Thr Asn Ser Thr Asn Thr Tyr Ser Thr Asn Ile Glu Lys
115 120 125
Met Lys Glu Gly Glu Ile Lys Asn Cys Ser Phe Asn Thr Thr Pro Ser
130 135 140
Ile Thr Asp Lys Met Gln Lys Ala Tyr Ala Leu Phe Tyr Lys Leu Asp
145 150 155 160
Ile Val Gln Ile Asn Asn Asp Asn Lys Asp Asn Thr Ser Tyr Arg Leu
165 170 175
Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Ala Ser
180 185 190
Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile
195 200 205
Leu Lys Cys Asn Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Lys Asn
210 215 220
Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr
225 230 235 240
Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg
245 250 255
Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Asn
260 265 270
Glu Thr Val His Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys
275 280 285
Ser Ile Gly Ile Gly Pro Gly Arg Thr Phe Phe Ala Thr Gly Glu Ile
290 295 300
Ile Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser Arg Asn Asn Trp
305 310 315 320
Asn Lys Thr Leu Glu Arg Val Val Lys Lys Leu Arg Glu Gln Phe Gly
325 330 335
Asn Asn Lys Thr Ile Val Phe Asn Gln Ser Ser Gly Gly Asp Pro Glu
340 345 350
Ile Val Met His Ser Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asn
355 360 365
Thr Thr Gln Leu Phe Asn Ser Thr Trp Asn Ala Asn Ser Thr Trp Asn
370 375 380
Ala Asn Glu Asn Thr Thr Gly Met Pro Ser Asp Asn Ile Thr Leu Pro
385 390 395 400
Cys Arg Ile Lys Gln Val Ile Asn Met Trp Gln Glu Val Gly Lys Ala
405 410 415
Met Tyr Ala Pro Pro Ile Lys Gly Pro Ile Lys Cys Ser Ser Asn Ile
420 425 430
Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Val Asn Asn Ser Glu Thr
435 440 445
Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asn Asn Trp Arg Cys
450 455 460
Gly Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala
465 470 475 480
Pro Thr Arg Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
485 490
<210> 85
<211> 1603
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 85
agaaagagca gagacagtgg caatgaaagt gaaggggatc aggaagaatt atcagcactt 60
gtggggatgg ggcatgatgc tccttgggat gttaatgatc tgtagtgcta cagaaaacct 120
gtgggtcaca gtatattgtg gggtacctgt gtggaaagaa gcagaaacca gtctattttg 180
tgcatcagat gctaacacat ataatacaga ggctcataat gtttgggcca ctcatgcctg 240
tgtacccacg gaccccaacc cacaagaaat atatttggaa aatgtgacag aaaattttaa 300
catgtggaaa aataacatgg tagaacagat gcatgaggat atagtaagtt tatgggatga 360
aagcctaaag ccatgtgtaa aaataacccc actctgtgtc actctaaatt gcactgattt 420
ggaaaatggc actagtagca ataatagtag ctatcaaagg ggggaagaag gagaaataaa 480
gaactgctct ttcaatatca ccacaagatt aagagaaaag gtacagaaag aatatgcact 540
tttttataaa cttgatataa tagcaatgga taataaaact aatgctacca gatataggtt 600
gataagttgt aacacctcaa ccattacaca ggcctgtcca aaagtatcct ttgagccaat 660
tcccatacat tattgtgccc cagctggttt tgcgcttttc aagtgtaatg ataagaagtt 720
caatggatca ggaacatgta acaatgtcag cacagtacaa tgtacacatg gaattaggcc 780
agtagtatca actcagctgt tgctaaatgg cagtctagca gaagaagagg tagtaattag 840
atctgaaaat ttcacaaaca gtgctaaaac cataatagta cagctaaaag aacctgtaaa 900
aattaattgt acaagaccca acaataatac aagaagaagt atacatatag gaccaggaaa 960
agcattttat gcaacaggag aaataatagg agatatagga caagcacatt gtaacattag 1020
tggacaagaa tggaataaaa ctttaattca gatagttaaa aaattgagag aacaatttgg 1080
gaataagacg ataaacttta ctaaaccagc aggaggggac ccagagattg taatgcacag 1140
ttttaattgt ggaggggaat ttttctactg tgatacaaca cgactgttta atagggcttg 1200
gaataatact gaagagttaa atagtactac tggagagtca aataacacta tcaccctccc 1260
atgcagaata aaacaaatta taaacatgtg gcaggaagta ggaaaagcaa tgtatgcccc 1320
tcccatccaa ggaacaatta gatgttcatc aaatattaca gggctgctac tagcaagaga 1380
tggtggcagt aacaatgaga ctaatactac tgaaatcttc agacctgcag gaggagatat 1440
gagggacaat tggagaagtg aattatataa atataaagta gtaaaaattg aaccattagg 1500
agtagcaccc accagggcaa agagaagagt ggtgcaaaga gaaagaagag caataggaat 1560
aggagctgtg ttccttgggt tcttgggagc agcaggaagc act 1603
<210> 86
<211> 482
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 86
Ser Ala Thr Glu Asn Leu Trp Val Thr Val Tyr Cys Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Glu Thr Ser Leu Phe Cys Ala Ser Asp Ala Asn Thr
20 25 30
Tyr Asn Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Tyr Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Val Ser Leu Trp Asp Glu Ser Leu Lys Pro Cys Val Lys Ile Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Leu Glu Asn Gly Thr Ser Ser
100 105 110
Asn Asn Ser Ser Tyr Gln Arg Gly Glu Glu Gly Glu Ile Lys Asn Cys
115 120 125
Ser Phe Asn Ile Thr Thr Arg Leu Arg Glu Lys Val Gln Lys Glu Tyr
130 135 140
Ala Leu Phe Tyr Lys Leu Asp Ile Ile Ala Met Asp Asn Lys Thr Asn
145 150 155 160
Ala Thr Arg Tyr Arg Leu Ile Ser Cys Asn Thr Ser Thr Ile Thr Gln
165 170 175
Ala Cys Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala
180 185 190
Pro Ala Gly Phe Ala Leu Phe Lys Cys Asn Asp Lys Lys Phe Asn Gly
195 200 205
Ser Gly Thr Cys Asn Asn Val Ser Thr Val Gln Cys Thr His Gly Ile
210 215 220
Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu
225 230 235 240
Glu Glu Val Val Ile Arg Ser Glu Asn Phe Thr Asn Ser Ala Lys Thr
245 250 255
Ile Ile Val Gln Leu Lys Glu Pro Val Lys Ile Asn Cys Thr Arg Pro
260 265 270
Asn Asn Asn Thr Arg Arg Ser Ile His Ile Gly Pro Gly Lys Ala Phe
275 280 285
Tyr Ala Thr Gly Glu Ile Ile Gly Asp Ile Gly Gln Ala His Cys Asn
290 295 300
Ile Ser Gly Gln Glu Trp Asn Lys Thr Leu Ile Gln Ile Val Lys Lys
305 310 315 320
Leu Arg Glu Gln Phe Gly Asn Lys Thr Ile Asn Phe Thr Lys Pro Ala
325 330 335
Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Asn Cys Gly Gly Glu
340 345 350
Phe Phe Tyr Cys Asp Thr Thr Arg Leu Phe Asn Arg Ala Trp Asn Asn
355 360 365
Thr Glu Glu Leu Asn Ser Thr Thr Gly Glu Ser Asn Asn Thr Ile Thr
370 375 380
Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly
385 390 395 400
Lys Ala Met Tyr Ala Pro Pro Ile Gln Gly Thr Ile Arg Cys Ser Ser
405 410 415
Asn Ile Thr Gly Leu Leu Leu Ala Arg Asp Gly Gly Ser Asn Asn Glu
420 425 430
Thr Asn Thr Thr Glu Ile Phe Arg Pro Ala Gly Gly Asp Met Arg Asp
435 440 445
Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro
450 455 460
Leu Gly Val Ala Pro Thr Arg Ala Lys Arg Arg Val Val Gln Arg Glu
465 470 475 480
Arg Arg
<210> 87
<211> 1604
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 87
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tgtcagcact 60
tatggagatg gggcatcatg ctccttggaa tgttaatgat ctgtagtgct gcaagcctgt 120
gggtcacagt ctattatggg gtacctgtgt ggaaagatgc aaacaccact ctattttgtg 180
catcagatgc taaagcatat gatacagagg tacataatgt gtgggccaca catgcctgtg 240
tacccacaga ccccaaccca caagaagtag tattggaaaa tgtgacagaa aattttaata 300
tgtggaaaaa taacatggta gaacagatgc atgaggacat aattagttta tgggaccaaa 360
gcctaaagcc atgtgtaaaa ctaaccccac tctgtgttac tttaaattgc actgagttga 420
tgttgaatac tactaccaat agtactacta ccaatagtac cagtagtcct cctaccagta 480
gtggattgac aaactgctct ttcaatatcg ccacagatct aagagataag gtgcagaaag 540
aatatgctct tttttctaca cttgatgtag tatcaatagg taataacagc tctaggctga 600
taagttgtaa cacctcaatc cttacacagg cctgtccaaa ggtatccttt gagccaattc 660
ccatacatta ttgtgccccg gctggttttg caattctaaa gtgtaacaat aagacattca 720
atggaaaagg actatgtaac aatgtcagca caatacaatg tacacatgga attaagccag 780
tagtatcaac tcaattactg ttaaatggca gtctagcaga gaaagacata gtaattagat 840
ctgacaattt ttcaaacaat gctaagacca taatagtaca gctgaagaag cctgtataca 900
tcaattgtac aagacccaac aacaatacga gaaaaggtat acacatagca ccagggagag 960
cattttatac aacaggacag ataataggag acataaggaa agcatattgt gaaattagtg 1020
gaaaaagctg gaataacact ttagaacaga tagctacaaa attaagagaa caatttggga 1080
gtaataaaac aatagtcttt aatcaatcct cgggagggga cccagaaatt gtaatgcaca 1140
gttttaattg tagaggagaa tttttctatt gtaattcaac acaattgttt aatagtactt 1200
ggccgggtaa cggtcctagc aataatacta ctgggaatgg tactgatact gttatcatcc 1260
ttccatgcag aataaaacaa atcataaaca tgtggcagga agtaggaaga gcaatgtgtg 1320
cccctcccat cgcaggacaa attaactgta caacaaaaat tacagggctg ttattaacaa 1380
gagatggtgg gaatagcaat gagaccaaag agactgaaat ctttagacct ggaggaggag 1440
atatgaggga caattggaga agtgaattat acaaatataa agtagtaaaa attgaaccat 1500
taggagtagc acccaccgag gcaaggagaa gagtggtgca acgagaaaag agcagtggga 1560
ctaggagcta tgttccttgg gttcttggga gcagcagaag cact 1604
<210> 88
<211> 483
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 88
Ser Ala Ala Ser Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp
1 5 10 15
Lys Asp Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr
20 25 30
Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr
35 40 45
Asp Pro Asn Pro Gln Glu Val Val Leu Glu Asn Val Thr Glu Asn Phe
50 55 60
Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile Ile
65 70 75 80
Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu
85 90 95
Cys Val Thr Leu Asn Cys Thr Glu Leu Met Leu Asn Thr Thr Thr Asn
100 105 110
Ser Thr Thr Thr Asn Ser Thr Ser Ser Pro Pro Thr Ser Ser Gly Leu
115 120 125
Thr Asn Cys Ser Phe Asn Ile Ala Thr Asp Leu Arg Asp Lys Val Gln
130 135 140
Lys Glu Tyr Ala Leu Phe Ser Thr Leu Asp Val Val Ser Ile Gly Asn
145 150 155 160
Asn Ser Ser Arg Leu Ile Ser Cys Asn Thr Ser Ile Leu Thr Gln Ala
165 170 175
Cys Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro
180 185 190
Ala Gly Phe Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Lys
195 200 205
Gly Leu Cys Asn Asn Val Ser Thr Ile Gln Cys Thr His Gly Ile Lys
210 215 220
Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Lys
225 230 235 240
Asp Ile Val Ile Arg Ser Asp Asn Phe Ser Asn Asn Ala Lys Thr Ile
245 250 255
Ile Val Gln Leu Lys Lys Pro Val Tyr Ile Asn Cys Thr Arg Pro Asn
260 265 270
Asn Asn Thr Arg Lys Gly Ile His Ile Ala Pro Gly Arg Ala Phe Tyr
275 280 285
Thr Thr Gly Gln Ile Ile Gly Asp Ile Arg Lys Ala Tyr Cys Glu Ile
290 295 300
Ser Gly Lys Ser Trp Asn Asn Thr Leu Glu Gln Ile Ala Thr Lys Leu
305 310 315 320
Arg Glu Gln Phe Gly Ser Asn Lys Thr Ile Val Phe Asn Gln Ser Ser
325 330 335
Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Asn Cys Arg Gly Glu
340 345 350
Phe Phe Tyr Cys Asn Ser Thr Gln Leu Phe Asn Ser Thr Trp Pro Gly
355 360 365
Asn Gly Pro Ser Asn Asn Thr Thr Gly Asn Gly Thr Asp Thr Val Ile
370 375 380
Ile Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val
385 390 395 400
Gly Arg Ala Met Cys Ala Pro Pro Ile Ala Gly Gln Ile Asn Cys Thr
405 410 415
Thr Lys Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Ser Asn
420 425 430
Glu Thr Lys Glu Thr Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg
435 440 445
Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu
450 455 460
Pro Leu Gly Val Ala Pro Thr Glu Ala Arg Arg Arg Val Val Gln Arg
465 470 475 480
Glu Lys Ser
<210> 89
<211> 1666
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 89
agaaagagca gagacagtgg caatgagagc gaaggagacc aggaagaatt gtcagcactc 60
gtggggatgg ggaaccatgc tcctgtggag atggggcatc atgctccttg ggatgttaat 120
gatctgtagt gctaaagaaa atttgtgggt cacagtctat tatggggtac ctgtgtggaa 180
agaagcatcc accactctat tttgtgcatc agatgctaaa gcatatgata cagaggtaca 240
taatgtttgg gccacacatg cctgtgtacc cacagacccc agcccacaag aagtagtatt 300
gggaaatgtg acagaatgtt ttaacatgtg gaataacaac atggtagaac agatgcatga 360
ggatataatc agtttatggg accaaagtct aaaaccctgt gtaaaattaa ccccactctg 420
tgttacctta agttgcagtg acgttaatat taccaatatt atcaataaca ctattgctaa 480
aaataatagt ttaagaatgg aaacaggaga cataaaaaac tgctctttca atatcaccac 540
aaacataaga gataagatgc aaacagaata tgcacttttt tataaatttg atgtagtgcc 600
aatatatgat agcaatgatg atagcaatat tactagaaat gatagttata ggataataag 660
ttgtaatacc tcagtcatta cacaggcctg tccaaaggta acctttgagc caattcccat 720
acattattgt gccccggctg gttttgcgat tctaaagtgt aacaataaga cattcaatgg 780
aaaaggacca tgtacaaatg tcagcacagt acaatgtaca catggaatta ggccagtagt 840
gtcaactcaa ctactgttaa atggcagtct agcagaaaag gagatagtga ttagatctga 900
caatttctcg gacaatgcta aaactataat agtacagtta aatggaactg ttcaaattaa 960
ttgttcaaga cccggcaaca atacaagaaa aagtatacat ataggaccag ggagtgcatt 1020
ttatgcaaca ggagacataa taggagatat aagaaaagca cattgtaaca ttagtaaaac 1080
agactggaat aacactttag gaaagatagc aaaaaaatta agagaacaat ttggggaaaa 1140
taaaacaata gagtttgaga aatccacagg aggggaccca gaagttatga tgcatacttt 1200
taattgtgga ggggaatttt tctactgtaa ttcaacaccg ctgtttaatg gtagtacttt 1260
taataatact tggacacctt tgaatagtag tgctaaaggg ccaaatgaca ctctcatact 1320
ccaatgtaga ataaaacaaa tcataaacat gtggcaggaa gtaggaaaag caatgtatgc 1380
ccctcccatc agaggataca ttaattgttc atcaaatatt acagggctgc tattgacaag 1440
agatggtggt aataatactg gtaatgatag caataccgag accttcagac ctacaggagg 1500
aaatatgaag gacaattgga ggagtgaatt atataaatat aaagtagtac aaattgaacc 1560
attaggagta gcacccacca gggcaaaaag aagagtggtg cagagagaaa aaagagcagc 1620
gctgggggct atgttccttg ggttcttggg agcagcagga agcact 1666
<210> 90
<211> 496
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 90
Ser Ala Lys Glu Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Ser Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Ser Pro Gln Glu Val Val Leu Gly Asn Val Thr Glu Cys
50 55 60
Phe Asn Met Trp Asn Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Ser Cys Ser Asp Val Asn Ile Thr Asn Ile Ile
100 105 110
Asn Asn Thr Ile Ala Lys Asn Asn Ser Leu Arg Met Glu Thr Gly Asp
115 120 125
Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Asn Ile Arg Asp Lys Met
130 135 140
Gln Thr Glu Tyr Ala Leu Phe Tyr Lys Phe Asp Val Val Pro Ile Tyr
145 150 155 160
Asp Ser Asn Asp Asp Ser Asn Ile Thr Arg Asn Asp Ser Tyr Arg Ile
165 170 175
Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Thr
180 185 190
Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile
195 200 205
Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Lys Gly Pro Cys Thr Asn
210 215 220
Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr
225 230 235 240
Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Lys Glu Ile Val Ile Arg
245 250 255
Ser Asp Asn Phe Ser Asp Asn Ala Lys Thr Ile Ile Val Gln Leu Asn
260 265 270
Gly Thr Val Gln Ile Asn Cys Ser Arg Pro Gly Asn Asn Thr Arg Lys
275 280 285
Ser Ile His Ile Gly Pro Gly Ser Ala Phe Tyr Ala Thr Gly Asp Ile
290 295 300
Ile Gly Asp Ile Arg Lys Ala His Cys Asn Ile Ser Lys Thr Asp Trp
305 310 315 320
Asn Asn Thr Leu Gly Lys Ile Ala Lys Lys Leu Arg Glu Gln Phe Gly
325 330 335
Glu Asn Lys Thr Ile Glu Phe Glu Lys Ser Thr Gly Gly Asp Pro Glu
340 345 350
Val Met Met His Thr Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn
355 360 365
Ser Thr Pro Leu Phe Asn Gly Ser Thr Phe Asn Asn Thr Trp Thr Pro
370 375 380
Leu Asn Ser Ser Ala Lys Gly Pro Asn Asp Thr Leu Ile Leu Gln Cys
385 390 395 400
Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met
405 410 415
Tyr Ala Pro Pro Ile Arg Gly Tyr Ile Asn Cys Ser Ser Asn Ile Thr
420 425 430
Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn Thr Gly Asn Asp Ser
435 440 445
Asn Thr Glu Thr Phe Arg Pro Thr Gly Gly Asn Met Lys Asp Asn Trp
450 455 460
Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Gln Ile Glu Pro Leu Gly
465 470 475 480
Val Ala Pro Thr Arg Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
485 490 495
<210> 91
<211> 1559
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 91
agaaagagca gaagacagtg gcaatgagag tgaaggagat caggaagaac tatcagcact 60
tgtggagatg gggcatcatg ctctttggga tattaatgat ctgtagtgct gaagaaaagt 120
gggtcacagt ctattatggg gtacctgtgt ggaaagaagc aaagaccact ctattttgtg 180
catcagatgc taaagcatat gatacagagg cacataatgt ttgggccaca catgcctgtg 240
tacccacaga ccccaaccca caagaagtag tattggagaa tgtgacagaa aattttaaca 300
tgtggaaaaa tgacatggta gagcagatgc atgaggatgt aatcagttta tgggatcaaa 360
gcctaaagcc atgtgtagaa ttaacgccac tctgtgttac tctaaattgc actaatctaa 420
attgcactaa caacactagt agcgaaataa aaaactgttc tttctatgtc accacaagca 480
tggaaggtaa ggtgaaaaaa catgcaacgt tttatagcct tgatatagta cgaacaacag 540
agagtaatat cagctatagg ttgataagtt gtaacacctc agtcattaca caggcctgtc 600
caaaaatatc ctttgaacca attcccatac attattgtgc cccggctggt tttgcgatcc 660
taaagtgtaa caataagaca ttcaatggaa caggaccatg tacaaatgtc agcacagtgc 720
aatgtacaca tggaattaag ccagtagtat caactcaact actgttaaat ggcagtctag 780
cagaggaaga ggtagtaatt agatctgaaa atttcatgag aaatgataaa atcataatag 840
tacagctaaa tgaatctata gaaattaatt gtacaagacc caactataat acaagaaaag 900
gtttccatat aggaccaggg agagcaattt atacaggaca aataatagga gatatcagac 960
aagcacattg taacattagt ggaataaaat ggaagaaggc tttaaaacag atagttggaa 1020
aattaagaga acaatttggg aataaaacaa tagtatttaa tcagtcctca ggaggggacc 1080
tagaaattga aacacacagt tttaattgtg gaggggaatt tttctactgc aatacaacac 1140
aactgtttaa taatacttgg ccgtcaaata atactgacgg agatattaac gaaaacatca 1200
cactcacact cccatgcaga ataaaacaaa ttataaacat gtggcagaaa gtaggaaaag 1260
ccatgtatgc ccctcccatc aaaggacaaa ttagatgttc atcaaatatt acagggctgc 1320
tactaatgag agatggtggt aacgacacca ccaaggcaaa cgacaccgag gtcttcagac 1380
ctggaggagg ggagatgagg gacaattgga gaagtgaatt atataaatat aaagtagtaa 1440
aaattgaacc attaggaata gcacccacca aggcaaagag aagagtggtg cagagagaaa 1500
aaagaggagt aggattagga gctatgttcc ttgggttctt gggagcagca ggaagcact 1559
<210> 92
<211> 467
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 92
Ser Ala Glu Glu Lys Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp
1 5 10 15
Lys Glu Ala Lys Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr
20 25 30
Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr
35 40 45
Asp Pro Asn Pro Gln Glu Val Val Leu Glu Asn Val Thr Glu Asn Phe
50 55 60
Asn Met Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp Val Ile
65 70 75 80
Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Glu Leu Thr Pro Leu
85 90 95
Cys Val Thr Leu Asn Cys Thr Asn Leu Asn Cys Thr Asn Asn Thr Ser
100 105 110
Ser Glu Ile Lys Asn Cys Ser Phe Tyr Val Thr Thr Ser Met Glu Gly
115 120 125
Lys Val Lys Lys His Ala Thr Phe Tyr Ser Leu Asp Ile Val Arg Thr
130 135 140
Thr Glu Ser Asn Ile Ser Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val
145 150 155 160
Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe Glu Pro Ile Pro Ile His
165 170 175
Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asn Lys Thr
180 185 190
Phe Asn Gly Thr Gly Pro Cys Thr Asn Val Ser Thr Val Gln Cys Thr
195 200 205
His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser
210 215 220
Leu Ala Glu Glu Glu Val Val Ile Arg Ser Glu Asn Phe Met Arg Asn
225 230 235 240
Asp Lys Ile Ile Ile Val Gln Leu Asn Glu Ser Ile Glu Ile Asn Cys
245 250 255
Thr Arg Pro Asn Tyr Asn Thr Arg Lys Gly Phe His Ile Gly Pro Gly
260 265 270
Arg Ala Ile Tyr Thr Gly Gln Ile Ile Gly Asp Ile Arg Gln Ala His
275 280 285
Cys Asn Ile Ser Gly Ile Lys Trp Lys Lys Ala Leu Lys Gln Ile Val
290 295 300
Gly Lys Leu Arg Glu Gln Phe Gly Asn Lys Thr Ile Val Phe Asn Gln
305 310 315 320
Ser Ser Gly Gly Asp Leu Glu Ile Glu Thr His Ser Phe Asn Cys Gly
325 330 335
Gly Glu Phe Phe Tyr Cys Asn Thr Thr Gln Leu Phe Asn Asn Thr Trp
340 345 350
Pro Ser Asn Asn Thr Asp Gly Asp Ile Asn Glu Asn Ile Thr Leu Thr
355 360 365
Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Lys Val Gly
370 375 380
Lys Ala Met Tyr Ala Pro Pro Ile Lys Gly Gln Ile Arg Cys Ser Ser
385 390 395 400
Asn Ile Thr Gly Leu Leu Leu Met Arg Asp Gly Gly Asn Asp Thr Thr
405 410 415
Lys Ala Asn Asp Thr Glu Val Phe Arg Pro Gly Gly Gly Glu Met Arg
420 425 430
Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu
435 440 445
Pro Leu Gly Ile Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg
450 455 460
Glu Lys Arg
465
<210> 93
<211> 1691
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 93
agaaagagca gaagacagtg gcaatgagag tgaaggagat caagaggagt tatcagctct 60
tgttgaaagg gggcatcttg ctccttggga tattgatgat ctgtagtgct acagacaact 120
tgtgggtcac agtatattat ggggtacctg tatggaaaga agcaaccacc actctatttt 180
gtgcatcaga tgctaaagcc cataatacag aggtacacaa tgtttgggcc acacatgcct 240
gtgtacccac agaccctgac ccacaagaag tagtattgga aaatgtgaca gaaaatttta 300
acatgtggaa aaatgacatg gtagaacaga tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cactttgtgt tactttaaat tgcactaatg 420
ctaaaatgaa caatactgct gataccaatg ctactaatac tgttaatatc agcaaggaag 480
aaatggaaga aataaaaaac tgctctttca atgtcaccac aagcttaaga gataagatgc 540
agagccaata tgcattgttt tataaacttg atatagtacc aatagataat agtagtagta 600
tagataatag tagtaataca tgtaatagta atagtacaca taataatagt agtacatgta 660
ataattatgc taattataga ttgataagtt gtgacacctc agtcattaca caggcctgtc 720
caaaggtatc ctttgagcca attcccatac attattgtgc cccggctggt tttgcgattc 780
taaagtgtaa taataagacg ttcaatggat caggaccatg taaaaatgtc agcacagtac 840
aatgtacaca tggaatcagg ccagtagtat caactcaact gctgttaaat ggcagtctag 900
cagaagaagg ggtagttatt agatctgaga atttcacaaa caatgctaaa accataatag 960
tacagataca tgaacctata gaaattaatt gtacaagacc caacaacaat acaagaaaaa 1020
gtataactat aggaccagga agagcgtttt atgcaacagg agacataata ggagatataa 1080
gacaagcaca ttgtaacctt agtaaagcaa gatggaatga tactttaaaa cagatagtta 1140
caaaattaag agaacagttt agaaataaaa caataaattt tactcaatcc tcaggagggg 1200
acccagaaat tgtgatgcac agttttaatt gtggagggga atttttctac tgtaatacaa 1260
cacaactgtt taatagtact tggttgtcta atagtacttg gaatgatact gaaaggtcaa 1320
atggcactga aactattaca ctcccatgca gaataaagca agttataaac aggtggcagg 1380
aagtaggaaa agcaatgtat gcccctccca tcagtggaat aattagatgt tcatcaaata 1440
ttacagggct gctattaaca agagatggtg gtaatagtaa tgacactccg actgatactg 1500
agatcttcag acctggagga ggagatatga gggacaattg gagaagcgaa ttatataaat 1560
ataaagtagt aaaaattgaa ccattaggaa tagcacccac caaggcaaag agaagagtgg 1620
tgcagagaga aaaaagagca gcgggaatag gagctctgtt ccttgggttc ttgggagcag 1680
caggaagcac t 1691
<210> 94
<211> 511
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 94
Ser Ala Thr Asp Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
His Asn Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asp Pro Gln Glu Val Val Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asn Ala Lys Met Asn Asn Thr Ala
100 105 110
Asp Thr Asn Ala Thr Asn Thr Val Asn Ile Ser Lys Glu Glu Met Glu
115 120 125
Glu Ile Lys Asn Cys Ser Phe Asn Val Thr Thr Ser Leu Arg Asp Lys
130 135 140
Met Gln Ser Gln Tyr Ala Leu Phe Tyr Lys Leu Asp Ile Val Pro Ile
145 150 155 160
Asp Asn Ser Ser Ser Ile Asp Asn Ser Ser Asn Thr Cys Asn Ser Asn
165 170 175
Ser Thr His Asn Asn Ser Ser Thr Cys Asn Asn Tyr Ala Asn Tyr Arg
180 185 190
Leu Ile Ser Cys Asp Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val
195 200 205
Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala
210 215 220
Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Ser Gly Pro Cys Lys
225 230 235 240
Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser
245 250 255
Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Gly Val Val Ile
260 265 270
Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Ile
275 280 285
His Glu Pro Ile Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg
290 295 300
Lys Ser Ile Thr Ile Gly Pro Gly Arg Ala Phe Tyr Ala Thr Gly Asp
305 310 315 320
Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Leu Ser Lys Ala Arg
325 330 335
Trp Asn Asp Thr Leu Lys Gln Ile Val Thr Lys Leu Arg Glu Gln Phe
340 345 350
Arg Asn Lys Thr Ile Asn Phe Thr Gln Ser Ser Gly Gly Asp Pro Glu
355 360 365
Ile Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn
370 375 380
Thr Thr Gln Leu Phe Asn Ser Thr Trp Leu Ser Asn Ser Thr Trp Asn
385 390 395 400
Asp Thr Glu Arg Ser Asn Gly Thr Glu Thr Ile Thr Leu Pro Cys Arg
405 410 415
Ile Lys Gln Val Ile Asn Arg Trp Gln Glu Val Gly Lys Ala Met Tyr
420 425 430
Ala Pro Pro Ile Ser Gly Ile Ile Arg Cys Ser Ser Asn Ile Thr Gly
435 440 445
Leu Leu Leu Thr Arg Asp Gly Gly Asn Ser Asn Asp Thr Pro Thr Asp
450 455 460
Thr Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg
465 470 475 480
Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Ile
485 490 495
Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
500 505 510
<210> 95
<211> 1616
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 95
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tgtcagcgct 60
ggtggagatg gggcatcatg ctccttggga tgttgatgat ctgtaaagct gcagaccaat 120
tgtgggtcac agtctattat ggggtacctg tgtggaagga agcaaccacc actttatttt 180
gtgcatcaga tgctaaagca tataagacag aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaat ccacaagaaa tagacttggc aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaga tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa gccatgtgta aaactaaccc cactctgtgt tactttaaat tgtactgatg 420
acttagaaaa ttgtaaaaat accactaata ataatgccgc taataataat aatacctgcg 480
acatgccagg agaaataaag aactgctctt tcaatatcac cgcaggtata agagataaga 540
tgcagaaaga atatgcactt tttaatacac ttgatgtagt accaataggt gatgagaatg 600
ataataccag ttataggtta ataagttgta atacctcagt cattacacag gcctgtccaa 660
aggtatcctt tgagccaatt cccatacatt attgtgcccc ggctggtttt gcgattctaa 720
agtgtaaaga taagaaattc aatgggacag gacagtgtaa aaatgtcagc acagtacaat 780
gtacacatgg aattaagcca gtagtatcaa ctcaattact gttaaatggc agtctagcag 840
aagaagaggt agtaattagg tctgaaaatt tcacaaacaa tgctaaaacc ataatagtac 900
agctgaaaga acctgtacaa attaattgta caaggcccaa caacaataca agaaaaagca 960
tacatatagg accagggaga gcattttatg caacaggaca aataatagga gatataagac 1020
aagcacattg taacattagt agtataaaat ggaataacac tttaaaacag atagttaaaa 1080
aattaagaga acaatttaat aaaacaataa tctttaatca atcctcagga ggggacccag 1140
aaattgtaat gcacattttt aattgtagag gggaattttt ctactgtaat acaacacaac 1200
tgtttaatag cacttggaat attactgaag ggtcaaatga caatattaca ggcaggtcaa 1260
atgacactat cacgctccca tgcaagataa gacaaattgt aaacatgtgg caggaagtag 1320
gaaaagcaat gtatgctcct cccatcagag gacaaattaa ctgtacatca aatattacag 1380
ggctgctgtt aacaagagat ggtggtaaaa atatcagcga gtccgaaacc ttcagacctg 1440
gaggaggaaa tatgaaggac aattggagaa gtgaattata taaatacaaa gtagtacaaa 1500
ttgaaccatt aggagtagca cccaccaagg caaagagaag agtggtgcag agagaaaaaa 1560
gagcagtggg aataggagct ctgttccttg ggttcttggg agcagcagga agcact 1616
<210> 96
<211> 486
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 96
Lys Ala Ala Asp Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Lys Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Asp Leu Ala Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Asp Leu Glu Asn Cys Lys Asn
100 105 110
Thr Thr Asn Asn Asn Ala Ala Asn Asn Asn Asn Thr Cys Asp Met Pro
115 120 125
Gly Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Ala Gly Ile Arg Asp
130 135 140
Lys Met Gln Lys Glu Tyr Ala Leu Phe Asn Thr Leu Asp Val Val Pro
145 150 155 160
Ile Gly Asp Glu Asn Asp Asn Thr Ser Tyr Arg Leu Ile Ser Cys Asn
165 170 175
Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile
180 185 190
Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Lys
195 200 205
Asp Lys Lys Phe Asn Gly Thr Gly Gln Cys Lys Asn Val Ser Thr Val
210 215 220
Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu
225 230 235 240
Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Glu Asn Phe
245 250 255
Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Lys Glu Pro Val Gln
260 265 270
Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile His Ile
275 280 285
Gly Pro Gly Arg Ala Phe Tyr Ala Thr Gly Gln Ile Ile Gly Asp Ile
290 295 300
Arg Gln Ala His Cys Asn Ile Ser Ser Ile Lys Trp Asn Asn Thr Leu
305 310 315 320
Lys Gln Ile Val Lys Lys Leu Arg Glu Gln Phe Asn Lys Thr Ile Ile
325 330 335
Phe Asn Gln Ser Ser Gly Gly Asp Pro Glu Ile Val Met His Ile Phe
340 345 350
Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asn Thr Thr Gln Leu Phe Asn
355 360 365
Ser Thr Trp Asn Ile Thr Glu Gly Ser Asn Asp Asn Ile Thr Gly Arg
370 375 380
Ser Asn Asp Thr Ile Thr Leu Pro Cys Lys Ile Arg Gln Ile Val Asn
385 390 395 400
Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly
405 410 415
Gln Ile Asn Cys Thr Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp
420 425 430
Gly Gly Lys Asn Ile Ser Glu Ser Glu Thr Phe Arg Pro Gly Gly Gly
435 440 445
Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val
450 455 460
Gln Ile Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val
465 470 475 480
Val Gln Arg Glu Lys Arg
485
<210> 97
<211> 1640
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 97
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tatcagcact 60
ggtggagatg ggggaccatg ctcctttggt tattgatgat ctgtagtgct gcagaacaat 120
tgtgggtcac agtttactat ggggtacctg tgtggaaaga agcaaccacc actctatttt 180
gtgcatcaga tgccaaagca tatgatccag aggcacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaat tagtattggc aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaga tgcaggagga tataatcagt ttatgggatc 360
aaagcctaaa gccatgtaca aaattaaccc cactctgtgt tactttaaat tgtactgatg 420
tggagtataa taatgaaact gcttccaata atactgcttc caatactact attattaatg 480
ggggaacaat agagggagat ggaataaaaa actgctcttt caatatcacc accagcctaa 540
ataaggtgca gaaagaaaat gcatattttt ataatcttga tgtagtacaa atagataata 600
gtgataatac cagctatagg ttagtaagtt gtaacacctc agtcattaca caggcctgtc 660
caaagatatc ttttgagcca attcccatac attactgtac cccggctggt tttgcgattc 720
taaagtgtaa tgataaaaag ttcaatggaa caggaccatg taaaaatgtc agcacagtac 780
aatgtacaca tggaattaag ccagtagtat caactcaact gttgttaaat ggcagtctag 840
cagaagaaga ggtagtaatt agatcagaaa atttcacaga taatgcaaaa atcataatag 900
tacagctgaa tgaatctatg gaaattaatt gtgcaagacc caacaacaat acaagaaaag 960
gtatacatat gggaccaggg aaagcatttt atgcaacagg agccataata ggagatatac 1020
gaagagcaca ttgcaacatt agtcaaacaa agtggaacaa tgccctaaaa cagatagcta 1080
taaagttaag agaacaattt gggaataaaa caatagtctt tacaaactcc tcaggagggg 1140
acccagaaat tgtaatgcac agttttaact gtggagggga gtttttctac tgtaatacat 1200
cactcctgtt taatagtatc tggaatagta ctactttgtc aaatagcact acaggagatg 1260
gaaatatcac actcccatgc agaataaaac aaattataaa tatgtggcag aaagtaggga 1320
aagcaatgta tgcccctccc atccaaggac taattaaatg tacatcaaat atcacaggga 1380
tgttattaat aagagatggt ggtaacatca actgcactga gactaatacc accaactgca 1440
atgagactga gaccttcaga cctgtaggag gagatatgag ggacaattgg agaagtgaat 1500
tatataaata taaagtagta aaaattgaac cattaggaat agcacccact aaggcaaaga 1560
gaagagtggt gcagagagaa aaaagagcag cgggactagg agctttgttc cttgggttct 1620
tgggagcagc aggaagcact 1640
<210> 98
<211> 494
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 98
Ser Ala Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Pro Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Leu Val Leu Ala Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met Gln Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Thr Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Val Glu Tyr Asn Asn Glu Thr
100 105 110
Ala Ser Asn Asn Thr Ala Ser Asn Thr Thr Ile Ile Asn Gly Gly Thr
115 120 125
Ile Glu Gly Asp Gly Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Ser
130 135 140
Leu Asn Lys Val Gln Lys Glu Asn Ala Tyr Phe Tyr Asn Leu Asp Val
145 150 155 160
Val Gln Ile Asp Asn Ser Asp Asn Thr Ser Tyr Arg Leu Val Ser Cys
165 170 175
Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe Glu Pro
180 185 190
Ile Pro Ile His Tyr Cys Thr Pro Ala Gly Phe Ala Ile Leu Lys Cys
195 200 205
Asn Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr
210 215 220
Val Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu
225 230 235 240
Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Glu Asn
245 250 255
Phe Thr Asp Asn Ala Lys Ile Ile Ile Val Gln Leu Asn Glu Ser Met
260 265 270
Glu Ile Asn Cys Ala Arg Pro Asn Asn Asn Thr Arg Lys Gly Ile His
275 280 285
Met Gly Pro Gly Lys Ala Phe Tyr Ala Thr Gly Ala Ile Ile Gly Asp
290 295 300
Ile Arg Arg Ala His Cys Asn Ile Ser Gln Thr Lys Trp Asn Asn Ala
305 310 315 320
Leu Lys Gln Ile Ala Ile Lys Leu Arg Glu Gln Phe Gly Asn Lys Thr
325 330 335
Ile Val Phe Thr Asn Ser Ser Gly Gly Asp Pro Glu Ile Val Met His
340 345 350
Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser Leu Leu
355 360 365
Phe Asn Ser Ile Trp Asn Ser Thr Thr Leu Ser Asn Ser Thr Thr Gly
370 375 380
Asp Gly Asn Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met
385 390 395 400
Trp Gln Lys Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Gln Gly Leu
405 410 415
Ile Lys Cys Thr Ser Asn Ile Thr Gly Met Leu Leu Ile Arg Asp Gly
420 425 430
Gly Asn Ile Asn Cys Thr Glu Thr Asn Thr Thr Asn Cys Asn Glu Thr
435 440 445
Glu Thr Phe Arg Pro Val Gly Gly Asp Met Arg Asp Asn Trp Arg Ser
450 455 460
Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Ile Ala
465 470 475 480
Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
485 490
<210> 99
<211> 1649
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 99
agaaagagca gaagacagtg gcaatgagag tgaaggagat caggaggaat tgtcagcact 60
ggtggtggaa atggggcacc atgctccttg ggatgttgat gatctgtagt gctaacgagc 120
aattgtgggt cacagtctat tatggggtac ctgtgtggaa agaagcaacc accactctat 180
tttgtgcatc agatgctaaa gcatatgaca cagaggcaca taatgtttgg gccacacatg 240
cctgtgtacc cacagaccct aacccacaag aagtagtatt ggtaaatgtg acagaaaatt 300
ttaatatgtg gaaaaataac atggtagaac agacgcagga ggatataatc agtttatggg 360
atcaaagcct aaagccatgt gtaaaattaa ccccactctg tgttactcta aattgcactg 420
attggaccct acattgcaat aataatgata ctaattgcac tactttaaaa aatgatacta 480
aaaccaataa taatagtagt ttgagaacaa tggagggagg agaagtaaaa aactgctctt 540
tcaatgtcac cacaagccta agagataagg agcgaaaaga atatgcactt ttttataaac 600
ttgatgtagt accaataggt aatgataata caagctatac gctgataaat tgtaacacct 660
caaccattac acaggcctgt ccaaaggtaa cctttgaacc aattcccata cattattgta 720
ccccggctgg ttttgcgctt ctaaagtgta atgataagaa gttcaatgga acaggacagt 780
gtaaaaatgt cagcacagta caatgtacac atggaattag gccagtagta tcaactcaat 840
tgctgctaaa tggcagtcta tcagaaggag aggtaatgat tagatctgaa aatttcacaa 900
acaatgctaa aaccataata gtacagctga atgaatctat agcaattaat tgtacaagac 960
ccaacaacaa tacaagaaaa agtataacta taggaccggg gagagcattt tttacaacag 1020
gagaaataac aggagatata agacaagcac attgtaacct tagtgcagta caatggaata 1080
acacattaaa acagatagtt gcaaaattaa gggaacaatt tgggaataat acaataagct 1140
ttaataaatc cgcaggaggg gacccagaaa ttgtaatgca cagttttaat tgtggagggg 1200
aatttttcta ctgtgataca acacagctgt ttaatagtac ttgggataat gacacagact 1260
taagtattaa gaatgagact acagaatcaa acaacaaaac tatcacactc ccgtgcagaa 1320
taaaacaaat tataaacaga tggcaggaag taggaaaagc aatgtatgcc cctcccatca 1380
gaggacaaat taaatgttca tcaaatatta cagggctact attaacaaga gatggtggta 1440
tgaacaatag cgccaacgag accttcagac ctggaggagg agatatgagg gacaattgga 1500
gaagtgaatt atataaatat aaagtagtaa aaattgagcc attaggggta gcacccacca 1560
aggcaaagag aagagcggtg caaagagaaa aaagagcagt gggaatagga gctgtgttcc 1620
ttgggttctt gggagcagca ggaagcact 1649
<210> 100
<211> 496
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 100
Ser Ala Asn Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Val Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Thr Gln Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Trp Thr Leu His Cys Asn Asn
100 105 110
Asn Asp Thr Asn Cys Thr Thr Leu Lys Asn Asp Thr Lys Thr Asn Asn
115 120 125
Asn Ser Ser Leu Arg Thr Met Glu Gly Gly Glu Val Lys Asn Cys Ser
130 135 140
Phe Asn Val Thr Thr Ser Leu Arg Asp Lys Glu Arg Lys Glu Tyr Ala
145 150 155 160
Leu Phe Tyr Lys Leu Asp Val Val Pro Ile Gly Asn Asp Asn Thr Ser
165 170 175
Tyr Thr Leu Ile Asn Cys Asn Thr Ser Thr Ile Thr Gln Ala Cys Pro
180 185 190
Lys Val Thr Phe Glu Pro Ile Pro Ile His Tyr Cys Thr Pro Ala Gly
195 200 205
Phe Ala Leu Leu Lys Cys Asn Asp Lys Lys Phe Asn Gly Thr Gly Gln
210 215 220
Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val
225 230 235 240
Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ser Glu Gly Glu Val
245 250 255
Met Ile Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr Ile Ile Val
260 265 270
Gln Leu Asn Glu Ser Ile Ala Ile Asn Cys Thr Arg Pro Asn Asn Asn
275 280 285
Thr Arg Lys Ser Ile Thr Ile Gly Pro Gly Arg Ala Phe Phe Thr Thr
290 295 300
Gly Glu Ile Thr Gly Asp Ile Arg Gln Ala His Cys Asn Leu Ser Ala
305 310 315 320
Val Gln Trp Asn Asn Thr Leu Lys Gln Ile Val Ala Lys Leu Arg Glu
325 330 335
Gln Phe Gly Asn Asn Thr Ile Ser Phe Asn Lys Ser Ala Gly Gly Asp
340 345 350
Pro Glu Ile Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr
355 360 365
Cys Asp Thr Thr Gln Leu Phe Asn Ser Thr Trp Asp Asn Asp Thr Asp
370 375 380
Leu Ser Ile Lys Asn Glu Thr Thr Glu Ser Asn Asn Lys Thr Ile Thr
385 390 395 400
Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Arg Trp Gln Glu Val Gly
405 410 415
Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Gln Ile Lys Cys Ser Ser
420 425 430
Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Met Asn Asn Ser
435 440 445
Ala Asn Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp
450 455 460
Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly
465 470 475 480
Val Ala Pro Thr Lys Ala Lys Arg Arg Ala Val Gln Arg Glu Lys Arg
485 490 495
<210> 101
<211> 1610
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 101
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tgcttacgga 60
aatggggcac catgctcctt gggatattaa tgatctgtag tgctgcagga aatttgtggg 120
tcacagtcta ttatggggtg cctgtgtgga aagaagcaac caccactcta ttctgtgcat 180
cagatgctaa agcatatgct acagaggcac ataatgtttg ggccacacat gcctgtgtac 240
ccacagaccc taacccacaa gaagtagtaa tggaaaatgt gacagaaaat tttaacatgt 300
ggaaaaataa catggtagaa cagatgcatg aggatataat cagtttatgg gatcaaagcc 360
taaagccatg tgtaaaatta accccactct gtgttactct aaattgcact gactgtgtga 420
cttcaaattg cactaagttg aagaatgtca ctaataatgc taatattagc aagatggaga 480
tggaggaggg agaaatgaaa aactgctctt ttaacatcac ctcaggcatg agagataaga 540
tgaagaaaga atatgcattt ttttataaac ttgatatagt accaataagt aatgataata 600
ctagctatag attgataagt tgtaatacct cagtcattac acaggcatgt ccaaaagtat 660
cctttgagcc aattcccata cattattgtg ccccggctgg ttttgcgatt ctaaaatgta 720
atgataagaa gttcaatgga acaggaccat gtaaaaatgt cagcacagta caatgtacac 780
atggaattag gccagtagtg tcaactcaat tgctgttaaa tggcagccta gcagaagaag 840
aggtagtaat tagatctgaa aatttaacgg acaatggtaa aaccataata gtacagctga 900
acgaatctgt acacattaat tgtacaagac ccagcaacaa taccagaaaa agtatacata 960
taggaccagg gaaagcattt tatgcaacag gacaaataat aggagatata agacaagcac 1020
actgtaacat tagtgaaaaa caatggaata aaactttaag ccagatagtt aaaaaattaa 1080
gagaacaatt taaaaataaa acaatagtct ttaatcaatc ctcaggaggg gacccagaaa 1140
ttgtaatgca cagttttaat tgtggaggag aatttttcta ctgtaattca acacaactat 1200
ttaatactac ttggaattgg aatgatactg aagggtcaaa taacactgaa agaaatgaaa 1260
gaaatattac actcccatgc agaataaaac aaattgtaaa caggtggcag gaagtaggaa 1320
aagcaatgta tgcccctccc atcagcggac caattagctg ttcatcaaat attacagggc 1380
tgctattaac aagagatggt ggtctcccaa acaatactga gaccttcaga ccagaaggag 1440
gaaatatgaa ggacaattgg agaagtgaat tatataaata taaagtagta aaaattgaac 1500
cattaggaat agcacccacc aaggcaaaga gaagagtggt gcaaagagaa aaaagagcag 1560
tgacattagg agctatgttc cttgggttct tgggagcagc aggaagcact 1610
<210> 102
<211> 486
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 102
Ser Ala Ala Gly Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Ala Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Met Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Cys Val Thr Ser Asn Cys Thr
100 105 110
Lys Leu Lys Asn Val Thr Asn Asn Ala Asn Ile Ser Lys Met Glu Met
115 120 125
Glu Glu Gly Glu Met Lys Asn Cys Ser Phe Asn Ile Thr Ser Gly Met
130 135 140
Arg Asp Lys Met Lys Lys Glu Tyr Ala Phe Phe Tyr Lys Leu Asp Ile
145 150 155 160
Val Pro Ile Ser Asn Asp Asn Thr Ser Tyr Arg Leu Ile Ser Cys Asn
165 170 175
Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile
180 185 190
Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn
195 200 205
Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr Val
210 215 220
Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu
225 230 235 240
Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Glu Asn Leu
245 250 255
Thr Asp Asn Gly Lys Thr Ile Ile Val Gln Leu Asn Glu Ser Val His
260 265 270
Ile Asn Cys Thr Arg Pro Ser Asn Asn Thr Arg Lys Ser Ile His Ile
275 280 285
Gly Pro Gly Lys Ala Phe Tyr Ala Thr Gly Gln Ile Ile Gly Asp Ile
290 295 300
Arg Gln Ala His Cys Asn Ile Ser Glu Lys Gln Trp Asn Lys Thr Leu
305 310 315 320
Ser Gln Ile Val Lys Lys Leu Arg Glu Gln Phe Lys Asn Lys Thr Ile
325 330 335
Val Phe Asn Gln Ser Ser Gly Gly Asp Pro Glu Ile Val Met His Ser
340 345 350
Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gln Leu Phe
355 360 365
Asn Thr Thr Trp Asn Trp Asn Asp Thr Glu Gly Ser Asn Asn Thr Glu
370 375 380
Arg Asn Glu Arg Asn Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Val
385 390 395 400
Asn Arg Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Ser
405 410 415
Gly Pro Ile Ser Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg
420 425 430
Asp Gly Gly Leu Pro Asn Asn Thr Glu Thr Phe Arg Pro Glu Gly Gly
435 440 445
Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val
450 455 460
Lys Ile Glu Pro Leu Gly Ile Ala Pro Thr Lys Ala Lys Arg Arg Val
465 470 475 480
Val Gln Arg Glu Lys Arg
485
<210> 103
<211> 1610
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 103
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tatcagcact 60
ggtggaaatg gggcacgatg ctccttggaa tgttgatgat ctgtagtgct gcagaacaat 120
tgtgggttac agtctattat ggggtacctg tgtggaagga agcaaagacc actctattct 180
gtgcatcaga tgctaaagca tatgaaacag aggtacataa tgtttgggcc gcacatgcct 240
gtgtacccac agaccccaac ccacaagaag tagtattggc aaatgtgaca gaaaatttta 300
atatgtggga caacaacaag gtagaacaga tgcaggagga tataatcagt ttatgggatg 360
aaggcctaaa gccatgtgta aaattaaccc cactctgtat tactctaaat tgcactgacc 420
cctgtgatgc tcaaaatagc actcgaagtt gcacttattt gaatgatacg gtggaagagg 480
agagaggaca aataagaaat tgctctttca atatctccac aaagctagaa aatagaaggc 540
agacaggata tgcagttttt gataaacttg atttagtacc agtagatggt ggtaataata 600
ctgtcagata taggttgata aattgtaaca cctcagtcat tacacaagca tgtccaaagg 660
tatcctttga accaattccc atacattatt gtaccccagc tggttttgcg attctaaagt 720
gtaatgataa gaagttcaat ggaacagggc catgcacaaa tgtcagcaca gtacaatgca 780
cacatggaat taggccagta gtgtcaactc aactgctgtt aaatggcagt ctggcagaag 840
aagaggtagt aattagatct aaaaatttca cagacaatac taaaactata atagtacagc 900
tgaatgaatc tgtacaaatt aattgtacaa gacccaacaa caatacaaga cagagtacac 960
ctatgggacc agggaaagca ctttacacaa cacagataat aggagacata agacaagcac 1020
attgtaacat tagtacaaaa gattggaatg acaccttaaa aaagatagtt acaaaattag 1080
aagaacaatt tgaaaataaa acaataagct ttaatcaatc ctcaggaggg gacccagaaa 1140
ttgtaaagca cagttttaat tgtggagggg aattcttcta ctgtgataca acaaaactat 1200
ttaatagtac ttggcctgca agtatcaatt ataccactgg agtaaatatc actggagtta 1260
tcacactccc atgtagaata aaacaaatta taaacaggtg gcagggagga ggaaaggcaa 1320
tgtatgctcc tcccatcagt ggaccaatta gatgttcatc aaatattaca gggctgctat 1380
taacaagaga tggtggtgat gacaataatg gggctgagac ctttagacct ggaggaggag 1440
atatgaggga caattggaga agtgaattat ataaatataa agtagtaaaa attgaaccat 1500
taggagtagc acccaccaag gcaaagagaa gagtggtgca gagagaaaaa agagcagtgg 1560
gaacactagg agctatgttc cttgggttct tgggagcagc aggaagcact 1610
<210> 104
<211> 483
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 104
Ser Ala Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Lys Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Glu Thr Glu Val His Asn Val Trp Ala Ala His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Ala Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Asp Asn Asn Lys Val Glu Gln Met Gln Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Glu Gly Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Ile Thr Leu Asn Cys Thr Asp Pro Cys Asp Ala Gln Asn Ser
100 105 110
Thr Arg Ser Cys Thr Tyr Leu Asn Asp Thr Val Glu Glu Glu Arg Gly
115 120 125
Gln Ile Arg Asn Cys Ser Phe Asn Ile Ser Thr Lys Leu Glu Asn Arg
130 135 140
Arg Gln Thr Gly Tyr Ala Val Phe Asp Lys Leu Asp Leu Val Pro Val
145 150 155 160
Asp Gly Gly Asn Asn Thr Val Arg Tyr Arg Leu Ile Asn Cys Asn Thr
165 170 175
Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile Pro
180 185 190
Ile His Tyr Cys Thr Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp
195 200 205
Lys Lys Phe Asn Gly Thr Gly Pro Cys Thr Asn Val Ser Thr Val Gln
210 215 220
Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn
225 230 235 240
Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Lys Asn Phe Thr
245 250 255
Asp Asn Thr Lys Thr Ile Ile Val Gln Leu Asn Glu Ser Val Gln Ile
260 265 270
Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Gln Ser Thr Pro Met Gly
275 280 285
Pro Gly Lys Ala Leu Tyr Thr Thr Gln Ile Ile Gly Asp Ile Arg Gln
290 295 300
Ala His Cys Asn Ile Ser Thr Lys Asp Trp Asn Asp Thr Leu Lys Lys
305 310 315 320
Ile Val Thr Lys Leu Glu Glu Gln Phe Glu Asn Lys Thr Ile Ser Phe
325 330 335
Asn Gln Ser Ser Gly Gly Asp Pro Glu Ile Val Lys His Ser Phe Asn
340 345 350
Cys Gly Gly Glu Phe Phe Tyr Cys Asp Thr Thr Lys Leu Phe Asn Ser
355 360 365
Thr Trp Pro Ala Ser Ile Asn Tyr Thr Thr Gly Val Asn Ile Thr Gly
370 375 380
Val Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Arg Trp Gln
385 390 395 400
Gly Gly Gly Lys Ala Met Tyr Ala Pro Pro Ile Ser Gly Pro Ile Arg
405 410 415
Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asp
420 425 430
Asp Asn Asn Gly Ala Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg
435 440 445
Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu
450 455 460
Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg
465 470 475 480
Glu Lys Arg
<210> 105
<211> 1589
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 105
agaaagagca gaagacagtg gcaatgaaag tgagggagac caggaggaat tattgggact 60
tgtggagatg gggcacgatg ctccttggga tgttaatgat ctgtagtgct acagaacaat 120
tatgggtcac aatctattat ggggtacctg tgtggaaaga agcagatacc actctatttt 180
gtgcatcaga tgctaaagca tatgatacag agaaacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccctaac ccacaagaag tagtaatggc aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaga tgcagaagga tattatcagt ctatgggatg 360
aaagcttaaa accatgtgta aaattaaccc cactctgtat cactttaaat tgcaatactt 420
taaattgcac acataacaat aattgtagta gtttgagaga agaaatgaca aactgctcct 480
tcaatgccac ctcaaaattg agagataagg ttcggaaaca atatgcaatt ttttctaaac 540
ttgatgtggt aaaactagat agtaataata atgaaacaga atataggttg ataaattgta 600
acacctcagt cgttacacag gcctgtccaa aggtatcatt tgagccaatt cccatacatt 660
attgtgcccc ggctggtttt gcgattctaa agtgtaacaa taagacattc gatggaaaag 720
gaccatgtac aaatgtcagc acagtacaat gtacacatgg aattaggcca gtagtgtcaa 780
ctcaactgct gttaaatggc agtctagcag aagaagggat agtaattaga tctgacaata 840
tctcagacaa tactaaaacc ataatagtac agctgaatga atctgtagca attaattgtt 900
caagacccca caacaataca agaaaaagta tacatatagc accagggaga gccttttatg 960
caacaggaga cataatagga gatataagac aagcacattg taacattagt agaaaacaat 1020
ggaataaaac tttagaacag gtagctaaaa aattaagaga aaaatttgta aataaagcaa 1080
ttatctttaa aaactcctca ggaggggacc cagaaattgt aatgcacagt tttaattgta 1140
gaggggaatt tttctactgt aatacaacac cactgtttaa tgggacttgg aatggtacta 1200
atttgagtac taataaggag tcaaatgaca caatcatact ccaatgtaga ataaaacaaa 1260
ttataaacat gtggcaggaa gtaggaaaag caatgtatgc ccctcccatt gcaggacaaa 1320
ttaactgttc atcaaaaatt acagggctac tattaacaag agatggtagt agcacaaatg 1380
ggacaaatga gactgagatc ttcagacctg gaggaggaga tatgagggac aattggagaa 1440
gcgaattata taaatataaa gtagtaagag ttgaaccaat aggaatagca cccaccaagg 1500
caaagagaag agtggtgcag agagaaaaaa gagcagtggg aacgttagga gctatgttcc 1560
ttgggttctt gggagcagca ggaagcact 1589
<210> 106
<211> 476
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 106
Ser Ala Thr Glu Gln Leu Trp Val Thr Ile Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Asp Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Lys His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Met Ala Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met Gln Lys Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Glu Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Ile Thr Leu Asn Cys Asn Thr Leu Asn Cys Thr His Asn Asn
100 105 110
Asn Cys Ser Ser Leu Arg Glu Glu Met Thr Asn Cys Ser Phe Asn Ala
115 120 125
Thr Ser Lys Leu Arg Asp Lys Val Arg Lys Gln Tyr Ala Ile Phe Ser
130 135 140
Lys Leu Asp Val Val Lys Leu Asp Ser Asn Asn Asn Glu Thr Glu Tyr
145 150 155 160
Arg Leu Ile Asn Cys Asn Thr Ser Val Val Thr Gln Ala Cys Pro Lys
165 170 175
Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe
180 185 190
Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Asp Gly Lys Gly Pro Cys
195 200 205
Thr Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val
210 215 220
Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Gly Ile Val
225 230 235 240
Ile Arg Ser Asp Asn Ile Ser Asp Asn Thr Lys Thr Ile Ile Val Gln
245 250 255
Leu Asn Glu Ser Val Ala Ile Asn Cys Ser Arg Pro His Asn Asn Thr
260 265 270
Arg Lys Ser Ile His Ile Ala Pro Gly Arg Ala Phe Tyr Ala Thr Gly
275 280 285
Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser Arg Lys
290 295 300
Gln Trp Asn Lys Thr Leu Glu Gln Val Ala Lys Lys Leu Arg Glu Lys
305 310 315 320
Phe Val Asn Lys Ala Ile Ile Phe Lys Asn Ser Ser Gly Gly Asp Pro
325 330 335
Glu Ile Val Met His Ser Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys
340 345 350
Asn Thr Thr Pro Leu Phe Asn Gly Thr Trp Asn Gly Thr Asn Leu Ser
355 360 365
Thr Asn Lys Glu Ser Asn Asp Thr Ile Ile Leu Gln Cys Arg Ile Lys
370 375 380
Gln Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro
385 390 395 400
Pro Ile Ala Gly Gln Ile Asn Cys Ser Ser Lys Ile Thr Gly Leu Leu
405 410 415
Leu Thr Arg Asp Gly Ser Ser Thr Asn Gly Thr Asn Glu Thr Glu Ile
420 425 430
Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu
435 440 445
Tyr Lys Tyr Lys Val Val Arg Val Glu Pro Ile Gly Ile Ala Pro Thr
450 455 460
Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
465 470 475
<210> 107
<211> 1689
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 107
agaaagagca gaagacagtg gcaatgagag tgacggggat caggaagaat tacttatgga 60
gatggggcac cttattcctg gggatattga tgatctgtaa ggctgcagaa aacttgtggg 120
tcacagtcta ttatggggta cctgtgtgga aagacgcaac caccactcta ttttgtgcat 180
cagatgctaa agctgttgat acagaagtac acaatgtgtg ggccacacat gcctgtgtac 240
ccacagaccc aaacccacaa gaagtagtat tgtataatgt gacagaaaat tttaacatgt 300
ggaaaaataa catggtagaa cagatgcatg aggatataat cagtttatgg gatgaaagtc 360
taaagccatg tgtaaagcta accccactct gtgttacttt gaattgcact gatgaattga 420
atatcaatac taataatacc tgtagtaata ccagtagtgg cactaataat accagtaatt 480
gcactaatgc tgagtcgact atcatcagta ccagtaatag tagcaatacc agaaatatca 540
gtgatagtag caagatagag aaaggagaaa taaaaaactg tactttcaat atcaccacaa 600
gcataagaga taaggtgcag aaagaatatg cactgtttta taaacttgat gtagtaccaa 660
caggtagtga caatactagc tataggttga taagttgtaa tacctcagtc attacacagg 720
cctgtccaaa ggtaaccttt gagccaattc ctatacatta ttgtgccccg gctggttatg 780
cgattgtaaa gtgcaacaat aagacattca gtggaaaagg accatgtaaa aatgtcagca 840
cagtacaatg cacacatggg attaagccag tagtatcaac tcagttgctg ttaaatggca 900
gtctggcaga aaaagatata gtaattagat ctgacaactt ctcaaacaat gctaaaacca 960
taatagtgca gctgaacaca tctgtagaaa ttaattgtac aagacccagc aacaatacaa 1020
gaaaaagtat tcatatagga ccagggagag cattttatgc aacagatata ataggagata 1080
taagacaagc acattgtaac attagtgaag aaaattggta taaaactcta aagcaggtag 1140
ctatgaaatt aggagaacag tttcagaata aaaaaatagt ctttaatcaa tcctcaggag 1200
gggacccaga aattgtaatg cacagtttta attgtggagg ggaatttttc ttctgtagta 1260
catcacaact gtttaatagt acttggaatt atagtacttg ggatgaaata gaaaataaaa 1320
ctcaaggaaa tggcacactc acactcccat gcagaataaa acaaattgta aacatgtggc 1380
aggaagtagg aaaagcaatg tatgcccctc ctatcaaagg aaatattaca tgttcatcaa 1440
agattacagg gttgctatta acaagagatg gtggtaagac taatatcaat ggcaccgaga 1500
tcttcagacc aggaggagga gatatgaggg acaattggag aagtgaacta tataaatata 1560
aagtagtaaa aattgaacca ttaggaatag cacccaccaa ggcaaagaga agagtggtgc 1620
agagagaaaa aagagcagtg ggaataggag ctatgttcct tgggttcttg ggaggcagca 1680
ggaagcact 1689
<210> 108
<211> 512
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 108
Lys Ala Ala Glu Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Val Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Tyr Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Glu Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Glu Leu Asn Ile Asn Thr Asn
100 105 110
Asn Thr Cys Ser Asn Thr Ser Ser Gly Thr Asn Asn Thr Ser Asn Cys
115 120 125
Thr Asn Ala Glu Ser Thr Ile Ile Ser Thr Ser Asn Ser Ser Asn Thr
130 135 140
Arg Asn Ile Ser Asp Ser Ser Lys Ile Glu Lys Gly Glu Ile Lys Asn
145 150 155 160
Cys Thr Phe Asn Ile Thr Thr Ser Ile Arg Asp Lys Val Gln Lys Glu
165 170 175
Tyr Ala Leu Phe Tyr Lys Leu Asp Val Val Pro Thr Gly Ser Asp Asn
180 185 190
Thr Ser Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala
195 200 205
Cys Pro Lys Val Thr Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro
210 215 220
Ala Gly Tyr Ala Ile Val Lys Cys Asn Asn Lys Thr Phe Ser Gly Lys
225 230 235 240
Gly Pro Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys
245 250 255
Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Lys
260 265 270
Asp Ile Val Ile Arg Ser Asp Asn Phe Ser Asn Asn Ala Lys Thr Ile
275 280 285
Ile Val Gln Leu Asn Thr Ser Val Glu Ile Asn Cys Thr Arg Pro Ser
290 295 300
Asn Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly Arg Ala Phe Tyr
305 310 315 320
Ala Thr Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser
325 330 335
Glu Glu Asn Trp Tyr Lys Thr Leu Lys Gln Val Ala Met Lys Leu Gly
340 345 350
Glu Gln Phe Gln Asn Lys Lys Ile Val Phe Asn Gln Ser Ser Gly Gly
355 360 365
Asp Pro Glu Ile Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe
370 375 380
Phe Cys Ser Thr Ser Gln Leu Phe Asn Ser Thr Trp Asn Tyr Ser Thr
385 390 395 400
Trp Asp Glu Ile Glu Asn Lys Thr Gln Gly Asn Gly Thr Leu Thr Leu
405 410 415
Pro Cys Arg Ile Lys Gln Ile Val Asn Met Trp Gln Glu Val Gly Lys
420 425 430
Ala Met Tyr Ala Pro Pro Ile Lys Gly Asn Ile Thr Cys Ser Ser Lys
435 440 445
Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Lys Thr Asn Ile Asn
450 455 460
Gly Thr Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp
465 470 475 480
Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly
485 490 495
Ile Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
500 505 510
<210> 109
<211> 1616
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 109
agaaagagca gaagacagtg gcaatgagag tgatggggat caggaggaat tatcagcact 60
tgtggagatg gggcaccatg ctccttggga tgttgatgat ctgtaaaatc actacaggaa 120
aaacgtgggt cacagtctat tatggggtac ctgtgtggaa agaagcaact accactctat 180
tttgtgcatc agatgctaaa gcatatgata cagaggtaca taatgtttgg gccacacatg 240
cctgtgtacc cacagacccc aacccacaaa aaatagtaat ggtaaatgtg acagaggaat 300
ttaacatgtg gaaaaataac atggtagaac agatgcatga ggatataatc agtttatggg 360
atcagagcct aaagccatgt gtaaaattaa ccccactctg cgttacttta aattgcaatg 420
atactacgga gagaaattgc actggtcccg atggtagaaa aataaattgc actgaagtga 480
aaaattgctc tctcaatatc accacaaaca taagagataa ggtgcagaaa gaatatgcac 540
ttttttatag aactgatgtg gtgccaatag ataataataa tactagtacc agcagtcata 600
gctataggtt gataagttgt aatacctcag tccttacaca gacctgtcca aaagtatcct 660
ttcagccaat tcccatacat tattgtgccc cggctggttt tgcgattcta aaatgtaaca 720
ataggacatt caatggaaaa ggagaatgtg gtaatgtcag cacagtacaa tgtacacatg 780
gaattaagcc agtagtatca actcagctgc tgctaagtgg cagtctagca gaacaagaga 840
tagtgcttag atctgacaac ttctcagaca atgctaaaac cataatagta cagctgacta 900
aacctgtaga aattaattgt acacgaccca acaacaatac aagaaaaagt atagctatag 960
gaccagggag agcatttatt gcaacaggag acataatagg agatataaga caggcacatt 1020
gtaacattag taaagtagca tggaataaca ctatagaaga ggtagcaaga aaattaagca 1080
aacaatttgg gaatagatca ataaccttta atcaatcctc aggaggggac ccagaaattg 1140
taatgcacag ttttaattgt ggaggggaat ttttctactg taatacaaca ggactattta 1200
gtagtacttg gaatgttact gataattgga atgctactga agaggcaaat accactaggg 1260
taaatatcac actcccatgc agaataagac aaattgtaaa catgtggcag gaagtaggaa 1320
aagcaatgta tgcccctccc atcagaggac aaattagttg ctcatcaaat attacaggac 1380
tgatattaac aagagatggt ggtaaccaga gcaacgagac gacccctgag atctttagac 1440
ctggaggagg agatatgagg gacaattgga gaagtgaatt atataaatat aaagtagtaa 1500
aaattgaacc tataggaata gcacccacca aggcaaagag aagagtggtg cagagagaaa 1560
aaagaggggt actaggagct atgttccttg ggttcttggg agcagcagga agcact 1616
<210> 110
<211> 487
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 110
Lys Ile Thr Thr Gly Lys Thr Trp Val Thr Val Tyr Tyr Gly Val Pro
1 5 10 15
Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys
20 25 30
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val
35 40 45
Pro Thr Asp Pro Asn Pro Gln Lys Ile Val Met Val Asn Val Thr Glu
50 55 60
Glu Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp
65 70 75 80
Ile Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr
85 90 95
Pro Leu Cys Val Thr Leu Asn Cys Asn Asp Thr Thr Glu Arg Asn Cys
100 105 110
Thr Gly Pro Asp Gly Arg Lys Ile Asn Cys Thr Glu Val Lys Asn Cys
115 120 125
Ser Leu Asn Ile Thr Thr Asn Ile Arg Asp Lys Val Gln Lys Glu Tyr
130 135 140
Ala Leu Phe Tyr Arg Thr Asp Val Val Pro Ile Asp Asn Asn Asn Thr
145 150 155 160
Ser Thr Ser Ser His Ser Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val
165 170 175
Leu Thr Gln Thr Cys Pro Lys Val Ser Phe Gln Pro Ile Pro Ile His
180 185 190
Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asn Arg Thr
195 200 205
Phe Asn Gly Lys Gly Glu Cys Gly Asn Val Ser Thr Val Gln Cys Thr
210 215 220
His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu Ser Gly Ser
225 230 235 240
Leu Ala Glu Gln Glu Ile Val Leu Arg Ser Asp Asn Phe Ser Asp Asn
245 250 255
Ala Lys Thr Ile Ile Val Gln Leu Thr Lys Pro Val Glu Ile Asn Cys
260 265 270
Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile Ala Ile Gly Pro Gly
275 280 285
Arg Ala Phe Ile Ala Thr Gly Asp Ile Ile Gly Asp Ile Arg Gln Ala
290 295 300
His Cys Asn Ile Ser Lys Val Ala Trp Asn Asn Thr Ile Glu Glu Val
305 310 315 320
Ala Arg Lys Leu Ser Lys Gln Phe Gly Asn Arg Ser Ile Thr Phe Asn
325 330 335
Gln Ser Ser Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Asn Cys
340 345 350
Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr Gly Leu Phe Ser Ser Thr
355 360 365
Trp Asn Val Thr Asp Asn Trp Asn Ala Thr Glu Glu Ala Asn Thr Thr
370 375 380
Arg Val Asn Ile Thr Leu Pro Cys Arg Ile Arg Gln Ile Val Asn Met
385 390 395 400
Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Gln
405 410 415
Ile Ser Cys Ser Ser Asn Ile Thr Gly Leu Ile Leu Thr Arg Asp Gly
420 425 430
Gly Asn Gln Ser Asn Glu Thr Thr Pro Glu Ile Phe Arg Pro Gly Gly
435 440 445
Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val
450 455 460
Val Lys Ile Glu Pro Ile Gly Ile Ala Pro Thr Lys Ala Lys Arg Arg
465 470 475 480
Val Val Gln Arg Glu Lys Arg
485
<210> 111
<211> 1694
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 111
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tgtcactcct 60
ggagatgggg caccatgctc ctatggagat ggggcaccat gctccttggg ttgttaatga 120
tgatctgtaa tgctaaagaa gaaaaaacgt gggtcacagt atattatggg gtacctgtgt 180
ggaaagaagc gaccaccact ctattttgtg catcagatgc taaagcatat gacacagagg 240
tacataatgt ttgggccaca catgcctgtg tacctacaga ccccgaccca caggaaatat 300
tcttggaaaa tgtgacagaa aattttaaca tgtggaaaaa taacatggta gaacagatgc 360
atgaggatat aatcagtttg tgggatcaaa gcctaaagcc atgtgtaaaa ttaaccccac 420
tctgtgttac gttaaattgc actaatgtga ggattactag cataaattgc actgatgcga 480
gtagtaatag cactggtgtg gaaaaaacta ctagcactaa tggggggaat gttactattt 540
gtaatactac tgaggagata aaaaactgct ccttcaacat caccaccaac atgaaagata 600
agatacagaa gacatatgca ctgttttata aacttgatgt agaaccaata gataagggga 660
atgagaatag caccagctat agggtgataa gttgtaacac ttcaaccatt acacaggcct 720
gtccaaaggt atcctttgag ccaattccta tacattattg tgccccggct ggttttgcga 780
ttctaaaatg tagagataaa aggtttaatg gaacaggacc atgtacaaat gtcagcacag 840
tacaatgtac acatggaatt aaaccagtag tatcaactca actgctgtta aatggcagtc 900
tagcagaaga agagatagta cttagatctg aaaatttctc gaacaatgct aaaaacatat 960
taatacagct aaaagaacct gtagaaatta attgtacaag acccaacaac aatacaagaa 1020
aaggtataca tataggacca ggaagagcat tttatggaac agatataata ggagatataa 1080
gacaagcaca ttgtaacatt agcagagaaa aatggaatag cactttaagt cagatagttg 1140
ataaattaag agaacaatat gggaaaaata aaacaataat ctttgatcaa ccatcaggag 1200
gggacccaga aattgtaacg cacagtttta attgtggagg agaatttttc tactgtaatt 1260
caacacaact gtttaatagc acttggtatg ataatagtac ttggaatgag aataaaaatc 1320
acactaaaaa tgacacaatc acgctcccat gcagaataaa gcaatttata aacatgtggc 1380
aggaagtagg aaaagcaatg tatgcccctc ccatcagagg acaaattaaa tgctcatcaa 1440
atattacagg gctaataata acaagagatg gggggaacaa caacagcgag accttcaaca 1500
acgagacctt cagacctgga ggaggaaata tgaaggacaa ttggagaagt gaattatata 1560
aatataaagt agtaagaatt gagccattag gagtagcacc caccagggca aagaggagag 1620
tggtgcagag agaaaaaaga gcagtgggaa taggagctgt gttccttggg ttcttgggag 1680
cagcaggaag cact 1694
<210> 112
<211> 504
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 112
Asn Ala Lys Glu Glu Lys Thr Trp Val Thr Val Tyr Tyr Gly Val Pro
1 5 10 15
Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys
20 25 30
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val
35 40 45
Pro Thr Asp Pro Asp Pro Gln Glu Ile Phe Leu Glu Asn Val Thr Glu
50 55 60
Asn Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp
65 70 75 80
Ile Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr
85 90 95
Pro Leu Cys Val Thr Leu Asn Cys Thr Asn Val Arg Ile Thr Ser Ile
100 105 110
Asn Cys Thr Asp Ala Ser Ser Asn Ser Thr Gly Val Glu Lys Thr Thr
115 120 125
Ser Thr Asn Gly Gly Asn Val Thr Ile Cys Asn Thr Thr Glu Glu Ile
130 135 140
Lys Asn Cys Ser Phe Asn Ile Thr Thr Asn Met Lys Asp Lys Ile Gln
145 150 155 160
Lys Thr Tyr Ala Leu Phe Tyr Lys Leu Asp Val Glu Pro Ile Asp Lys
165 170 175
Gly Asn Glu Asn Ser Thr Ser Tyr Arg Val Ile Ser Cys Asn Thr Ser
180 185 190
Thr Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile Pro Ile
195 200 205
His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Arg Asp Lys
210 215 220
Arg Phe Asn Gly Thr Gly Pro Cys Thr Asn Val Ser Thr Val Gln Cys
225 230 235 240
Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly
245 250 255
Ser Leu Ala Glu Glu Glu Ile Val Leu Arg Ser Glu Asn Phe Ser Asn
260 265 270
Asn Ala Lys Asn Ile Leu Ile Gln Leu Lys Glu Pro Val Glu Ile Asn
275 280 285
Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Gly Ile His Ile Gly Pro
290 295 300
Gly Arg Ala Phe Tyr Gly Thr Asp Ile Ile Gly Asp Ile Arg Gln Ala
305 310 315 320
His Cys Asn Ile Ser Arg Glu Lys Trp Asn Ser Thr Leu Ser Gln Ile
325 330 335
Val Asp Lys Leu Arg Glu Gln Tyr Gly Lys Asn Lys Thr Ile Ile Phe
340 345 350
Asp Gln Pro Ser Gly Gly Asp Pro Glu Ile Val Thr His Ser Phe Asn
355 360 365
Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gln Leu Phe Asn Ser
370 375 380
Thr Trp Tyr Asp Asn Ser Thr Trp Asn Glu Asn Lys Asn His Thr Lys
385 390 395 400
Asn Asp Thr Ile Thr Leu Pro Cys Arg Ile Lys Gln Phe Ile Asn Met
405 410 415
Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Gln
420 425 430
Ile Lys Cys Ser Ser Asn Ile Thr Gly Leu Ile Ile Thr Arg Asp Gly
435 440 445
Gly Asn Asn Asn Ser Glu Thr Phe Asn Asn Glu Thr Phe Arg Pro Gly
450 455 460
Gly Gly Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys
465 470 475 480
Val Val Arg Ile Glu Pro Leu Gly Val Ala Pro Thr Arg Ala Lys Arg
485 490 495
Arg Val Val Gln Arg Glu Lys Arg
500
<210> 113
<211> 1685
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 113
agaaagagca gaagacagtg gcaatgagag tgaaggggac caagacgaat tatcagcact 60
tatggagatg gggcatcatg ctcctgggga tattgatgat ctgtagtgct acagaacaat 120
ggtgggtcac agtctattat ggagtacctg tgtggagaga tgcaaatacc actctatttt 180
gtgcatcaga ttctaaagca tatgctacag aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaaa taaacttgga aaatgtaaca gaagagttta 300
acatatggaa aaataacatg gtagaacaaa tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cactctgtgt tactttaaat tgcactgatt 420
acaatggtac tcataccaat actactaata ccactagtat ttatggggaa aagatggaaa 480
taggagaagt aaagaaatgc tctttcaatg ctaccacaat cataagagat aaggtggaca 540
aagaagaagc acttttttat aaacttgata tagtaccaat agatggtaat aatgagacta 600
acatagttaa taatgggact aataatacta gtaccaacta taccagctat aggctaataa 660
attgtaacac ctcagtcatt acacaggcct gtccagaggt atcctttgag ccaattccca 720
tacattattg tgccccggct ggttttgcga ttctaaagtg taaagagaag gcgttcaatg 780
gaagtgggcc atgtaaaaat gtcagctcag tacaatgtac acatggaatt aagccagtag 840
tatcaactca attgccgtta aatggcagtc tagcagaaga agaagtagta attagatctg 900
aaaatttcac aaacaatgct aaaaccataa tagtacagct gaaagaagct gtaaacatta 960
gttgtataag gcccaacaac aatacaagaa aaagtatacc tataggacca gggagagcat 1020
tttatgcaac aggagacata ataggagata taggacaagc acattgtaac cttagtagaa 1080
caaattggaa taaaacttta caacagatag ctacaaaatt aggagaaaag tttaataaaa 1140
caacaataat ctttaatcaa tcctcaggag gggacccaga aattgtaatg cacagtttta 1200
tttgtggagg ggaatttttc tactgtaata caacacaact gtttaatagt acttggaact 1260
gtactgagaa tgggaattgt acactggtta ccggtacttg gcctgacagg ccaaatagaa 1320
ctggagaaaa tgacacaatc acactcccat gcagaataaa acaaatcata aacctgtggc 1380
aggaagtagg aaaagcaatg tatgcctctc ccatccaggg actaattaat tgtacatcaa 1440
atattacagg gctgctatta acaagagatg gtggtaccca tagtagacag aatgagacct 1500
tcagacctga aggaggaaat atgaaagaca attggagaag tgaattatat aaatataaag 1560
tagtaagaat tgaaccatta ggagtagcac ccaccaaggc aaagagaaga gtggtgcaga 1620
gagaaaaaag agcagtggga ttgggagcta tgatccttgg gttcttggga gcagcaggaa 1680
gcact 1685
<210> 114
<211> 509
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 114
Ser Ala Thr Glu Gln Trp Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Arg Asp Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ser Lys Ala
20 25 30
Tyr Ala Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Asn Leu Glu Asn Val Thr Glu Glu
50 55 60
Phe Asn Ile Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Tyr Asn Gly Thr His Thr Asn
100 105 110
Thr Thr Asn Thr Thr Ser Ile Tyr Gly Glu Lys Met Glu Ile Gly Glu
115 120 125
Val Lys Lys Cys Ser Phe Asn Ala Thr Thr Ile Ile Arg Asp Lys Val
130 135 140
Asp Lys Glu Glu Ala Leu Phe Tyr Lys Leu Asp Ile Val Pro Ile Asp
145 150 155 160
Gly Asn Asn Glu Thr Asn Ile Val Asn Asn Gly Thr Asn Asn Thr Ser
165 170 175
Thr Asn Tyr Thr Ser Tyr Arg Leu Ile Asn Cys Asn Thr Ser Val Ile
180 185 190
Thr Gln Ala Cys Pro Glu Val Ser Phe Glu Pro Ile Pro Ile His Tyr
195 200 205
Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Lys Glu Lys Ala Phe
210 215 220
Asn Gly Ser Gly Pro Cys Lys Asn Val Ser Ser Val Gln Cys Thr His
225 230 235 240
Gly Ile Lys Pro Val Val Ser Thr Gln Leu Pro Leu Asn Gly Ser Leu
245 250 255
Ala Glu Glu Glu Val Val Ile Arg Ser Glu Asn Phe Thr Asn Asn Ala
260 265 270
Lys Thr Ile Ile Val Gln Leu Lys Glu Ala Val Asn Ile Ser Cys Ile
275 280 285
Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile Pro Ile Gly Pro Gly Arg
290 295 300
Ala Phe Tyr Ala Thr Gly Asp Ile Ile Gly Asp Ile Gly Gln Ala His
305 310 315 320
Cys Asn Leu Ser Arg Thr Asn Trp Asn Lys Thr Leu Gln Gln Ile Ala
325 330 335
Thr Lys Leu Gly Glu Lys Phe Asn Lys Thr Thr Ile Ile Phe Asn Gln
340 345 350
Ser Ser Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Ile Cys Gly
355 360 365
Gly Glu Phe Phe Tyr Cys Asn Thr Thr Gln Leu Phe Asn Ser Thr Trp
370 375 380
Asn Cys Thr Glu Asn Gly Asn Cys Thr Leu Val Thr Gly Thr Trp Pro
385 390 395 400
Asp Arg Pro Asn Arg Thr Gly Glu Asn Asp Thr Ile Thr Leu Pro Cys
405 410 415
Arg Ile Lys Gln Ile Ile Asn Leu Trp Gln Glu Val Gly Lys Ala Met
420 425 430
Tyr Ala Ser Pro Ile Gln Gly Leu Ile Asn Cys Thr Ser Asn Ile Thr
435 440 445
Gly Leu Leu Leu Thr Arg Asp Gly Gly Thr His Ser Arg Gln Asn Glu
450 455 460
Thr Phe Arg Pro Glu Gly Gly Asn Met Lys Asp Asn Trp Arg Ser Glu
465 470 475 480
Leu Tyr Lys Tyr Lys Val Val Arg Ile Glu Pro Leu Gly Val Ala Pro
485 490 495
Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
500 505
<210> 115
<211> 1632
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 115
agaaagagca aaacagtggc aatgaaagtg aaggggacca ggatgaattg tcagcgctgg 60
tggtggacat ggggcacgat gctccttggg atgttgatga tctgtagtgc tgcagaaaag 120
ttgtgggtca cagtctatta tggggtacct gtgtggaaag aagcaaccac cactctattt 180
tgtgcatcag atgctaaagc atataagaca gaaaagcata atgtctgggc cacacatgcc 240
tgtgtaccca caaaccccaa cccacaagaa gtagtaatgg aaaatgtaac agaatatttt 300
aacatgtgga aaaataacat ggtagaacag atgcaggagg atataatcag tttatgggat 360
caaagcctaa agccatgtgt aaaattaacc ccactctgtg taactttaac ttgtgtgaat 420
attactaact gtaagaataa tactaactgt aataatgata ctaacagtaa gaatgatact 480
cttaaggagg agatagggga aataaaaaac tgctctttca acgtcaccac agccataaga 540
gataaggtgc agaaagaata tgcattattt cataaacttg atgtagtaca aatagataat 600
gataatacta gtagtaatac ttctaagcct tataggttga taagttgtaa cacctcagtc 660
attacacagg cctgtccaaa ggtaaccttt gagccaattc ccatacatta ttgtgcctcg 720
gctggttttg cgattctaaa gtgtaacaat aagactttca atggaacagg accatgtaca 780
aatgtcagca cagtacaatg tacacatgga attaggccag tagtatcaac tcaactgttg 840
ttaaatgaca gcctagcaga aaaagaggca atagttagat ctgaaaattt cacaaacaac 900
gctaaaatca taatagtaca gctaaatgaa tctgtagaga ttaattgtac aagacccaac 960
aacaatacaa gaagaagtat acctatagga ccagggaaag cattttttac atcagaaata 1020
ataggagata taagaaaagc acactgtaac attagtggaa caaagtggaa tgccactttg 1080
cataaaatag ctataaaatt aagagaacaa tatggaaata aaacaatagt ctttaatcaa 1140
ccttcaggag gggacccaga agttgtaatg cacagtttta actgtggagg ggaatttttc 1200
tactgtgata caacacaact gtttaatagt acttggttta atagtacttg gccaaatatc 1260
acacttgaag aaaatatcac actcccatgc aaaataaaac aatttataaa catgtggcag 1320
gaagtaggaa aagcaatgta tgcccctccc atcagaggac aaattaactg ttcatcacag 1380
attacagggc tgctattaac aagagatggt ggtcagggta acaatactaa caacgacact 1440
gagattttca gaccaggggg aggagatatg agggacaatt ggagaagtga attatacaaa 1500
tataaagtag taagaattga gccattggga gtagcaccca ccaaggcaat gagaagagtg 1560
gtgcagagag aaaaaagagc aataggacta ggagcttttt tccttgggtt cttgggagca 1620
gcaggaagca ct 1632
<210> 116
<211> 491
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 116
Ser Ala Ala Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Lys Thr Glu Lys His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asn Pro Asn Pro Gln Glu Val Val Met Glu Asn Val Thr Glu Tyr
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met Gln Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Thr Cys Val Asn Ile Thr Asn Cys Lys Asn Asn
100 105 110
Thr Asn Cys Asn Asn Asp Thr Asn Ser Lys Asn Asp Thr Leu Lys Glu
115 120 125
Glu Ile Gly Glu Ile Lys Asn Cys Ser Phe Asn Val Thr Thr Ala Ile
130 135 140
Arg Asp Lys Val Gln Lys Glu Tyr Ala Leu Phe His Lys Leu Asp Val
145 150 155 160
Val Gln Ile Asp Asn Asp Asn Thr Ser Ser Asn Thr Ser Lys Pro Tyr
165 170 175
Arg Leu Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys
180 185 190
Val Thr Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Ser Ala Gly Phe
195 200 205
Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys
210 215 220
Thr Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val
225 230 235 240
Ser Thr Gln Leu Leu Leu Asn Asp Ser Leu Ala Glu Lys Glu Ala Ile
245 250 255
Val Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Ile Ile Ile Val Gln
260 265 270
Leu Asn Glu Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr
275 280 285
Arg Arg Ser Ile Pro Ile Gly Pro Gly Lys Ala Phe Phe Thr Ser Glu
290 295 300
Ile Ile Gly Asp Ile Arg Lys Ala His Cys Asn Ile Ser Gly Thr Lys
305 310 315 320
Trp Asn Ala Thr Leu His Lys Ile Ala Ile Lys Leu Arg Glu Gln Tyr
325 330 335
Gly Asn Lys Thr Ile Val Phe Asn Gln Pro Ser Gly Gly Asp Pro Glu
340 345 350
Val Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asp
355 360 365
Thr Thr Gln Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp Pro Asn
370 375 380
Ile Thr Leu Glu Glu Asn Ile Thr Leu Pro Cys Lys Ile Lys Gln Phe
385 390 395 400
Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile
405 410 415
Arg Gly Gln Ile Asn Cys Ser Ser Gln Ile Thr Gly Leu Leu Leu Thr
420 425 430
Arg Asp Gly Gly Gln Gly Asn Asn Thr Asn Asn Asp Thr Glu Ile Phe
435 440 445
Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr
450 455 460
Lys Tyr Lys Val Val Arg Ile Glu Pro Leu Gly Val Ala Pro Thr Lys
465 470 475 480
Ala Met Arg Arg Val Val Gln Arg Glu Lys Arg
485 490
<210> 117
<211> 1679
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 117
agaaagagca gaagacagtg gcaatgagag tgatggggat caggaagagt tatcagcact 60
tgtggaaagg gggcaccttg ctccttggaa tattgatgat ctgtagtgct gcagaacaat 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga tgcaaccacc actttatttt 180
gtgcatcaga tgctaaagca tatgatacag aggtacacaa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag tagtaatggg aaatgtgaca gaatatttta 300
acatgtggac aaataacatg gtagaacaga tgcatgagga tgtaatcagt ttatgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cactctgtgt tactttaaat tgcactaatt 420
tggtgaatac tacctgtaat gggactacta acaataatac tacctgtact gggactgcta 480
acaatgatac taataccaat agtactaggt gggtgtatca agcgatggca ggagaaataa 540
aaaactgctc tttcaatatc accacaaaca taagagataa gataaaaaaa gaatatgcac 600
tttttaatag acttgatata gtaccaatag atgatgagaa taagaatact ggcaatacta 660
ctagctatag gttgataagt tgtaacacct cagtcattac acaggcctgt ccaaaggtaa 720
cctttgaacc aattcccata cattattgtg ccccggctgg ttttgcgatt ctcaagtgta 780
atgataagaa gttcaatgga acaggaccat gtacaaatgt cagcacagta caatgtacac 840
atggaattag gccagtagta tcaactcaac tactattaaa tggcagtcta gcagaagaag 900
agacagtaat tagatctagc aatttctcga acaatgctaa aatcataata gtacagctga 960
atgaaactgt acgaattaat tgtacaagac ccaacaacaa tacaagaaga agtatacata 1020
taggaccagg gagagcattt tatgcaacag gagacataat aggagatata agacaagcac 1080
attgtaacat tagtggagaa gaatggagga gaactttaaa acggataact ataaaattag 1140
gagaacaatt taataaaaca aaaataagct ataaccaatc ctcaggaggg gacccagaaa 1200
ttgtaaggca cagttttaat tgtcaagggg aatttttcta ctgtgataca tcaggactgt 1260
ttaatagtac ttgggtgaag aatgatactt ggaatgagag tagtattagc aatggaacta 1320
tcacactccc atgcagaata aaacaaattg taaacatgtg gcaggaagta ggaagagcaa 1380
tgtatgcccc tcctatcaaa ggacaaatta attgtacatc gaatattaca gggctgctac 1440
taacaagaga tggtggtcag actaatagca ccaacaacga cactgagacc ttcagaccta 1500
caggaggaga tataagggac aattggagga gtgaattata taaatataaa gtagtaaaaa 1560
ttgaaccatt aggaatagca cccaccaggg caaaaagaag agtggtgcaa agagaaaaaa 1620
gagcagtggg aacgatggga gcgttgttcc ttgggttctt gggagcagca ggaagcact 1679
<210> 118
<211> 506
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 118
Ser Ala Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Met Gly Asn Val Thr Glu Tyr
50 55 60
Phe Asn Met Trp Thr Asn Asn Met Val Glu Gln Met His Glu Asp Val
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asn Leu Val Asn Thr Thr Cys Asn
100 105 110
Gly Thr Thr Asn Asn Asn Thr Thr Cys Thr Gly Thr Ala Asn Asn Asp
115 120 125
Thr Asn Thr Asn Ser Thr Arg Trp Val Tyr Gln Ala Met Ala Gly Glu
130 135 140
Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Asn Ile Arg Asp Lys Ile
145 150 155 160
Lys Lys Glu Tyr Ala Leu Phe Asn Arg Leu Asp Ile Val Pro Ile Asp
165 170 175
Asp Glu Asn Lys Asn Thr Gly Asn Thr Thr Ser Tyr Arg Leu Ile Ser
180 185 190
Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Thr Phe Glu
195 200 205
Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys
210 215 220
Cys Asn Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Thr Asn Val Ser
225 230 235 240
Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu
245 250 255
Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Thr Val Ile Arg Ser Ser
260 265 270
Asn Phe Ser Asn Asn Ala Lys Ile Ile Ile Val Gln Leu Asn Glu Thr
275 280 285
Val Arg Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Arg Ser Ile
290 295 300
His Ile Gly Pro Gly Arg Ala Phe Tyr Ala Thr Gly Asp Ile Ile Gly
305 310 315 320
Asp Ile Arg Gln Ala His Cys Asn Ile Ser Gly Glu Glu Trp Arg Arg
325 330 335
Thr Leu Lys Arg Ile Thr Ile Lys Leu Gly Glu Gln Phe Asn Lys Thr
340 345 350
Lys Ile Ser Tyr Asn Gln Ser Ser Gly Gly Asp Pro Glu Ile Val Arg
355 360 365
His Ser Phe Asn Cys Gln Gly Glu Phe Phe Tyr Cys Asp Thr Ser Gly
370 375 380
Leu Phe Asn Ser Thr Trp Val Lys Asn Asp Thr Trp Asn Glu Ser Ser
385 390 395 400
Ile Ser Asn Gly Thr Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Val
405 410 415
Asn Met Trp Gln Glu Val Gly Arg Ala Met Tyr Ala Pro Pro Ile Lys
420 425 430
Gly Gln Ile Asn Cys Thr Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg
435 440 445
Asp Gly Gly Gln Thr Asn Ser Thr Asn Asn Asp Thr Glu Thr Phe Arg
450 455 460
Pro Thr Gly Gly Asp Ile Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys
465 470 475 480
Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Ile Ala Pro Thr Arg Ala
485 490 495
Lys Arg Arg Val Val Gln Arg Glu Lys Arg
500 505
<210> 119
<211> 1589
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 119
agaaagagca gaagacagtg gcaatgagag tgagggggat catgaggaat tacttgtgga 60
aatggggcat catgctcctt gggatattga tgatctgtag tgctacagac aaattgtggg 120
tcacagtcta ttatggggtg cctgtgtgga aagaagcatc caccactcta ttttgtgcat 180
cagatgctaa agcctatgat acagaggtac ataatgtttg ggccacacat gcctgtgtac 240
ccacagatcc caatccacac gaattagaat tggaaaatgt gacagaagat tttaacatgt 300
ggaaaaatga catggtcgaa cagatgcatg aggatataat cagtttatgg gatcaaagcc 360
taaaaccatg tgtaaaatta accccactct gtgttacttt aaattgcagt gatgctttaa 420
cttgcaatag gacatcaaat agcagtagta cttcaaattg cagtaactgg gaaccgatag 480
aagaaataaa aaattgctct ttcaatatta ccacaagcat agaaaataag atgcagaaaa 540
agtctgcatt ttttgatgcc cttgatgtag tacaaataga tgatactagt tataggttga 600
taaattgtaa cacctcagtc attacacagg cctgtccaaa gatatccttt gagccaattc 660
ccatacatta ttgtgccccg gctggttttg cgcttctaaa gtgtaaggat ccgaaattca 720
atggaacagg gccatgtaaa tatgctagct cagtacagtg tacacatgga attaggccgg 780
tagtatcaac tcaactgctg ctaaatggca gtctagcaga agaagatata gtaattagat 840
ctgccaattt ctcggacaac accaaagcca taatagtaca actaaaagaa cctgtaataa 900
ttaattgcac aagacccaac aacaatacaa gacaaagtgt acatatagga ccagggagcg 960
cactttatac aacagatata ataggagata taagaaaagc acattgtaac attagtagag 1020
cagactggac taaagcttta aaccagacag tcataaaatt aagagaacaa tttaagaata 1080
aaacaatagt ctttaatcaa tcctcaggag gggatccaga aattgtaatg cacactttta 1140
attgtggagg ggaatttttc tattgtaatt caacaaaact gtttaatagt acttggaatg 1200
ggactgaacc aggagagtca aatgacactg taatcatact cccatgcaga ataaaacaaa 1260
ttataaatat gtggcaggaa gtaggaaaag caatgtatgc ccctcccatc agaggacaaa 1320
ttagatgtac atcaaatatt acagggctgc tactaacaag agatggggga aatgagacca 1380
ctaaaaacgg gactgagacc ttcagacctg gaggaggaaa tatgaaggac aattggagaa 1440
gtgaattata taaatataaa gtggtaaaaa ttgaaccatt aggagtagca cccaccaagg 1500
caaaaagaag agtggtgcag agagaaaaaa gagcaatagg ggcattcgga gctatgttcc 1560
ttgggttctt gggagcagca ggaagcact 1589
<210> 120
<211> 478
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 120
Ser Ala Thr Asp Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Ser Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro His Glu Leu Glu Leu Glu Asn Val Thr Glu Asp
50 55 60
Phe Asn Met Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Ser Asp Ala Leu Thr Cys Asn Arg Thr
100 105 110
Ser Asn Ser Ser Ser Thr Ser Asn Cys Ser Asn Trp Glu Pro Ile Glu
115 120 125
Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Ser Ile Glu Asn Lys
130 135 140
Met Gln Lys Lys Ser Ala Phe Phe Asp Ala Leu Asp Val Val Gln Ile
145 150 155 160
Asp Asp Thr Ser Tyr Arg Leu Ile Asn Cys Asn Thr Ser Val Ile Thr
165 170 175
Gln Ala Cys Pro Lys Ile Ser Phe Glu Pro Ile Pro Ile His Tyr Cys
180 185 190
Ala Pro Ala Gly Phe Ala Leu Leu Lys Cys Lys Asp Pro Lys Phe Asn
195 200 205
Gly Thr Gly Pro Cys Lys Tyr Ala Ser Ser Val Gln Cys Thr His Gly
210 215 220
Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala
225 230 235 240
Glu Glu Asp Ile Val Ile Arg Ser Ala Asn Phe Ser Asp Asn Thr Lys
245 250 255
Ala Ile Ile Val Gln Leu Lys Glu Pro Val Ile Ile Asn Cys Thr Arg
260 265 270
Pro Asn Asn Asn Thr Arg Gln Ser Val His Ile Gly Pro Gly Ser Ala
275 280 285
Leu Tyr Thr Thr Asp Ile Ile Gly Asp Ile Arg Lys Ala His Cys Asn
290 295 300
Ile Ser Arg Ala Asp Trp Thr Lys Ala Leu Asn Gln Thr Val Ile Lys
305 310 315 320
Leu Arg Glu Gln Phe Lys Asn Lys Thr Ile Val Phe Asn Gln Ser Ser
325 330 335
Gly Gly Asp Pro Glu Ile Val Met His Thr Phe Asn Cys Gly Gly Glu
340 345 350
Phe Phe Tyr Cys Asn Ser Thr Lys Leu Phe Asn Ser Thr Trp Asn Gly
355 360 365
Thr Glu Pro Gly Glu Ser Asn Asp Thr Val Ile Ile Leu Pro Cys Arg
370 375 380
Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr
385 390 395 400
Ala Pro Pro Ile Arg Gly Gln Ile Arg Cys Thr Ser Asn Ile Thr Gly
405 410 415
Leu Leu Leu Thr Arg Asp Gly Gly Asn Glu Thr Thr Lys Asn Gly Thr
420 425 430
Glu Thr Phe Arg Pro Gly Gly Gly Asn Met Lys Asp Asn Trp Arg Ser
435 440 445
Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala
450 455 460
Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
465 470 475
<210> 121
<211> 1616
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 121
agaaagagca gaagacagtg gcaatgaaag tgaaggggat caggaagaat tgtcagcgct 60
tgtggagatg gggcacgatg ctccttggga tgttaatgat atgtagtgct gcagagcaat 120
tgtgggtcac agtctattat ggggtacctg tgtggagaga agcaaacacc actctattct 180
gtgcctcaga tgctaaagca caggttgcag aggcacataa tgtatgggcc acacatgcct 240
gtgtacccac agaccctagc ccacaagaag tagtaatgga aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaga tgcatgagga tataatcagt ttatgggatc 360
aaagtctaaa gccatgtgtg aaattaaccc cactctgcgt tactttaaat tgcactaatg 420
tgggttgcac tggtaatact actggaccca attgtacttc tttgactgat cataatagta 480
atcttacttg gggaatggag aaaggagaaa taaaaaattg ctctttcaat gtcaccagta 540
taacaaataa gatgcagaaa gaatatgcac ttttttataa acttgatgta atgccaatgg 600
atagtacaga taatacaacg tatacactga taaattgtaa cccctcagtc attacacagg 660
cctgtccaaa ggtatctttt gaacccattc ctatacatta ttgtaccccg gctggttttg 720
cgattctaaa gtgtaatgat aagacattca atggatcagg accatgtaca aatgtcagta 780
cagtactatg tacacatgga attaggccag tagtgtcaac tcaactactg ttaaatggca 840
gtctagcaga agaggaggta atagtcaggt ccgagaattt ctcggacaat actaaaatca 900
taatagtaca gctgaataaa actgtagaaa ttaattgtac aagacccaat aacaatacaa 960
gaaaaagtat acatatagca ccaggaaaag cattctatgc aacaggtgat ataataggag 1020
atataagaca agcacattgt aacatcagtg aaacaaaatg ggtgaacact ttaaaacagg 1080
tagttacaaa attaagggaa caatatggga ataaaacaat agcctttaat caatcctcag 1140
gaggggatcc agaaattgta acgcatagtt ttaattgtgg aggagaattt ttctactgta 1200
atacatcacg gctgtttaat agtaattgga ctgggaatgg aacgactgag tcaggaaata 1260
gcacaatcat acttccatgc agaataaaac aaattataaa cagatggcag gaagtaggaa 1320
aagcaatgta tgccaatccc attagtggac caatcaactg ttcatcaaac attacagggc 1380
tgctattaac aagagatggt ggtaaagtga ccaatgacac caccgagacc ttcagacctt 1440
ggggtggaga tatgagggac aattggagaa gtgaactata taaatataaa gtagtaagaa 1500
ttgagccatt aggactagca cccaccaggg caaagagaag agtggtgcag agagaaaaga 1560
gagcaataac attaggagct atgttccttg ggttcttggg agcagcagga agcact 1616
<210> 122
<211> 486
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 122
Ser Ala Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Arg Glu Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Gln Val Ala Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Ser Pro Gln Glu Val Val Met Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asn Val Gly Cys Thr Gly Asn Thr
100 105 110
Thr Gly Pro Asn Cys Thr Ser Leu Thr Asp His Asn Ser Asn Leu Thr
115 120 125
Trp Gly Met Glu Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn Val Thr
130 135 140
Ser Ile Thr Asn Lys Met Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Leu
145 150 155 160
Asp Val Met Pro Met Asp Ser Thr Asp Asn Thr Thr Tyr Thr Leu Ile
165 170 175
Asn Cys Asn Pro Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe
180 185 190
Glu Pro Ile Pro Ile His Tyr Cys Thr Pro Ala Gly Phe Ala Ile Leu
195 200 205
Lys Cys Asn Asp Lys Thr Phe Asn Gly Ser Gly Pro Cys Thr Asn Val
210 215 220
Ser Thr Val Leu Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln
225 230 235 240
Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Ile Val Arg Ser
245 250 255
Glu Asn Phe Ser Asp Asn Thr Lys Ile Ile Ile Val Gln Leu Asn Lys
260 265 270
Thr Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser
275 280 285
Ile His Ile Ala Pro Gly Lys Ala Phe Tyr Ala Thr Gly Asp Ile Ile
290 295 300
Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser Glu Thr Lys Trp Val
305 310 315 320
Asn Thr Leu Lys Gln Val Val Thr Lys Leu Arg Glu Gln Tyr Gly Asn
325 330 335
Lys Thr Ile Ala Phe Asn Gln Ser Ser Gly Gly Asp Pro Glu Ile Val
340 345 350
Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser
355 360 365
Arg Leu Phe Asn Ser Asn Trp Thr Gly Asn Gly Thr Thr Glu Ser Gly
370 375 380
Asn Ser Thr Ile Ile Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Arg
385 390 395 400
Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Asn Pro Ile Ser Gly Pro
405 410 415
Ile Asn Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly
420 425 430
Gly Lys Val Thr Asn Asp Thr Thr Glu Thr Phe Arg Pro Trp Gly Gly
435 440 445
Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val
450 455 460
Arg Ile Glu Pro Leu Gly Leu Ala Pro Thr Arg Ala Lys Arg Arg Val
465 470 475 480
Val Gln Arg Glu Lys Arg
485
<210> 123
<211> 1610
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 123
agaaagagca gaagacagtg gcaatgagag tgaaggagat caggaagaat tggcagcgct 60
tgtggagatg gggcatgatg ctccttggga tgttgatgat cagtagtgct gaagaagatt 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga agcagagacc actttatttt 180
gtgcatcaga tgctaaagca tataacacag aggcacataa tgtgtgggcc acacatgcct 240
gtgtaccaac agaccctagc ccacaagaag tattattggt aaatgtgaca gaaaattata 300
acatgtggaa aaataacatg gtagaacaga tgcatgagga tataattagt ttatgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cgctttgtgt tactttaaat tgcactaatg 420
tgaattgcac tcatgagaat ggtaccacta ccgagtgcgg taataatggg atacagatgg 480
agaaaggaga aatgaaaaac tgctctttca atattaccac aagcataaaa aataagatgc 540
agaaagaata tgcacttttg tataaactag atttagcatc aataggtaat gataatacaa 600
gctatacttt gataagttgt aacacctcag tcattacaca ggcctgtcca aagatatcct 660
ttgaaccaat tccaatacat tattgtgccc cggctggttt tgcgattcta aaatgtaatg 720
ataagaactt caagggaaca ggatcatgta aaaatgtcag cacagtacaa tgtacacatg 780
gaattaagcc agtagtgtca actcaattgt tgttaaatgg cactttagca gaaacagagg 840
tagtaattag atctgaaaat atcacagaca atgctaaaac cataatagta caactgaagg 900
accctgtaaa aattaattgt acaagacctg gcaacaatac agcaagaagc atacatatgg 960
gaccggggag agcattttct gcaacaggac aaataatagg aaatataaga caagcacatt 1020
gtaaccttag tagaacagaa tgggatgaca ctttaaaaaa gatagctaag aaattaggag 1080
aacaatttag gaataaaagt atagccttta atcaatcctc aggaggggac ccagaaattg 1140
taatgcacag ttttaattgt ggaggggaat ttttttactg taatacatca cagctgttta 1200
atagtacttg gtggaacaat ggtactagga atgatgctgc aaggtcaaat agcactgaac 1260
ctatcacact ccggtgcagt ataaagcaaa ttataaacag atggcaggaa gtaggaaaag 1320
caatgtatgc ccctcccatc aggggaaacg ttacatgtaa ctcaagtatt acagggctac 1380
tcttaataag agatggtggg aacagtaatg agtctactga gaccttcaga cctcagggag 1440
gaaatatgaa ggacaattgg agaagtgaat tatacaaata taaagtagta aaaattgagc 1500
cattaggagt agcacccacc aaggcaaaga gaagagtggt gcagagagaa aaaagagcag 1560
tgggactagg agctgtgttc cttgggttct tgggagcagc aggaagcact 1610
<210> 124
<211> 484
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 124
Ser Ala Glu Glu Asp Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Glu Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asn Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Ser Pro Gln Glu Val Leu Leu Val Asn Val Thr Glu Asn
50 55 60
Tyr Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asn Val Asn Cys Thr His Glu Asn
100 105 110
Gly Thr Thr Thr Glu Cys Gly Asn Asn Gly Ile Gln Met Glu Lys Gly
115 120 125
Glu Met Lys Asn Cys Ser Phe Asn Ile Thr Thr Ser Ile Lys Asn Lys
130 135 140
Met Gln Lys Glu Tyr Ala Leu Leu Tyr Lys Leu Asp Leu Ala Ser Ile
145 150 155 160
Gly Asn Asp Asn Thr Ser Tyr Thr Leu Ile Ser Cys Asn Thr Ser Val
165 170 175
Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe Glu Pro Ile Pro Ile His
180 185 190
Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp Lys Asn
195 200 205
Phe Lys Gly Thr Gly Ser Cys Lys Asn Val Ser Thr Val Gln Cys Thr
210 215 220
His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Thr
225 230 235 240
Leu Ala Glu Thr Glu Val Val Ile Arg Ser Glu Asn Ile Thr Asp Asn
245 250 255
Ala Lys Thr Ile Ile Val Gln Leu Lys Asp Pro Val Lys Ile Asn Cys
260 265 270
Thr Arg Pro Gly Asn Asn Thr Ala Arg Ser Ile His Met Gly Pro Gly
275 280 285
Arg Ala Phe Ser Ala Thr Gly Gln Ile Ile Gly Asn Ile Arg Gln Ala
290 295 300
His Cys Asn Leu Ser Arg Thr Glu Trp Asp Asp Thr Leu Lys Lys Ile
305 310 315 320
Ala Lys Lys Leu Gly Glu Gln Phe Arg Asn Lys Ser Ile Ala Phe Asn
325 330 335
Gln Ser Ser Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Asn Cys
340 345 350
Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser Gln Leu Phe Asn Ser Thr
355 360 365
Trp Trp Asn Asn Gly Thr Arg Asn Asp Ala Ala Arg Ser Asn Ser Thr
370 375 380
Glu Pro Ile Thr Leu Arg Cys Ser Ile Lys Gln Ile Ile Asn Arg Trp
385 390 395 400
Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Asn Val
405 410 415
Thr Cys Asn Ser Ser Ile Thr Gly Leu Leu Leu Ile Arg Asp Gly Gly
420 425 430
Asn Ser Asn Glu Ser Thr Glu Thr Phe Arg Pro Gln Gly Gly Asn Met
435 440 445
Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile
450 455 460
Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln
465 470 475 480
Arg Glu Lys Arg
<210> 125
<211> 1619
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 125
agaaagagca gaagacagtg gcaatgagag tgatggggat aaagaagaat tactggtgga 60
gatggggccc gatgctcctt gggatattga tgacctatag tgcagcagaa ttttgggtca 120
cagtctacta tggagtgcca gtgtggaaag aaacaaccac cactctattt tgtgcatcag 180
atgccaaagc atatgataca gaggcacata atgtttgggc cacacatgcc tgtgtaccca 240
cagaccccaa cccacaagaa gtagtattgg aaaaggtgac agaagagttt aacatgtgga 300
aaaatagcat ggtagaacag atgcatgagg atataatcag tttatgggat caaagtctaa 360
agccatgtgt aaaactaacc ccactctgtg ttactttaag ttgcactgat tgtaatggta 420
ctagccctga gtgtgcgaag aatgctagta ctactaccac tagtagtaag ggattgatag 480
ataaagggga aataaaaaac tgctctttca atgccaccac acacataatg gataaggtgc 540
agaaagaata tgcattattt tataacactg atttagtaca aatagagggt gagaaatctg 600
ataataatac tagatatagg ttaataagtt gtaacacctc agtcattaaa caggcctgtc 660
caaaggtatc ttttgagcca attcccatac attattgtgc cccggctggt tttgcgattc 720
taaagtgtaa agataagaat ttcaatggaa caggaaaatg ttacaatgtc agcacagtac 780
aatgtacaca tggaattagg ccagtaatgt caactcaact gctgttaaat ggcagcctag 840
cagaagaaga aatagtaatt agatctgcca atttctcgaa caatgctaaa accataatag 900
tacatctgaa tgaatctgta gaaattaact gcacaagacc caacaacgat acaaggaaaa 960
gtataaatat aggaccaggg agagcatggt atgcagcagg agaaataata ggaaatataa 1020
gaaaagcata ttgtaacatt agcagagcaa aatggaacaa cactttaaaa catgtagttg 1080
aaaaactaag aaaacaattt ggaaataaaa caataaactt tacacaacac gcaggagggg 1140
acctagaaat tgtgacgcat agttttaatt gtggagggga attcttctac tgcaacacaa 1200
cacagctgtt taatagtact tggcctaaga atggtacttg gaatggtact ggtagtgaca 1260
ttatcacact cccatgcaaa ataaaacaga ttataaacat gtggcaggag gtaggaaaag 1320
caatgtatgc ccctcccatc agcggactaa ttagatgttc atcaaatatt acagggctgc 1380
tattaacaag agatggtggt aagggtaatg gcacaaatga tacagagatc ttcagaccag 1440
gaggaggaga tatgagggac aattggagaa gtgaattata taaatataaa gtagtagaaa 1500
ttgagccaat aggactagca cccaccaagg caaagagaag agtggtgcag agagaaaaaa 1560
gagcagtggg aacgctggga gctatgttcc ttgggttctt gggagcagca ggaagcact 1619
<210> 126
<211> 488
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 126
Ser Ala Ala Glu Phe Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp
1 5 10 15
Lys Glu Thr Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr
20 25 30
Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr
35 40 45
Asp Pro Asn Pro Gln Glu Val Val Leu Glu Lys Val Thr Glu Glu Phe
50 55 60
Asn Met Trp Lys Asn Ser Met Val Glu Gln Met His Glu Asp Ile Ile
65 70 75 80
Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu
85 90 95
Cys Val Thr Leu Ser Cys Thr Asp Cys Asn Gly Thr Ser Pro Glu Cys
100 105 110
Ala Lys Asn Ala Ser Thr Thr Thr Thr Ser Ser Lys Gly Leu Ile Asp
115 120 125
Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn Ala Thr Thr His Ile Met
130 135 140
Asp Lys Val Gln Lys Glu Tyr Ala Leu Phe Tyr Asn Thr Asp Leu Val
145 150 155 160
Gln Ile Glu Gly Glu Lys Ser Asp Asn Asn Thr Arg Tyr Arg Leu Ile
165 170 175
Ser Cys Asn Thr Ser Val Ile Lys Gln Ala Cys Pro Lys Val Ser Phe
180 185 190
Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu
195 200 205
Lys Cys Lys Asp Lys Asn Phe Asn Gly Thr Gly Lys Cys Tyr Asn Val
210 215 220
Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Met Ser Thr Gln
225 230 235 240
Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Ile Val Ile Arg Ser
245 250 255
Ala Asn Phe Ser Asn Asn Ala Lys Thr Ile Ile Val His Leu Asn Glu
260 265 270
Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asp Thr Arg Lys Ser
275 280 285
Ile Asn Ile Gly Pro Gly Arg Ala Trp Tyr Ala Ala Gly Glu Ile Ile
290 295 300
Gly Asn Ile Arg Lys Ala Tyr Cys Asn Ile Ser Arg Ala Lys Trp Asn
305 310 315 320
Asn Thr Leu Lys His Val Val Glu Lys Leu Arg Lys Gln Phe Gly Asn
325 330 335
Lys Thr Ile Asn Phe Thr Gln His Ala Gly Gly Asp Leu Glu Ile Val
340 345 350
Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr
355 360 365
Gln Leu Phe Asn Ser Thr Trp Pro Lys Asn Gly Thr Trp Asn Gly Thr
370 375 380
Gly Ser Asp Ile Ile Thr Leu Pro Cys Lys Ile Lys Gln Ile Ile Asn
385 390 395 400
Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Ser Gly
405 410 415
Leu Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp
420 425 430
Gly Gly Lys Gly Asn Gly Thr Asn Asp Thr Glu Ile Phe Arg Pro Gly
435 440 445
Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys
450 455 460
Val Val Glu Ile Glu Pro Ile Gly Leu Ala Pro Thr Lys Ala Lys Arg
465 470 475 480
Arg Val Val Gln Arg Glu Lys Arg
485
<210> 127
<211> 1613
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 127
agaaagagca gaagacagtg gcaatgagag cgaaggggat caggaagagt tgtcaacact 60
tatggagatg gggcaccatg ctccttggga tgttgatgat ttgtagtgct gcagaaaact 120
tgtgggtcac agtctactat ggggtacctg tgtggaaaga agcaaccacc actctatttt 180
gtgcatcgaa tgctaaagca tatgagacag aggtgcataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag tagtattggg aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaga tgcatgagga tgtaattagt ttgtgggacc 360
aaagcttaaa gccatgtgta aaattgaccc cactctgtgt tactttacat tgcactgatt 420
gtgagaatac tattactggg gggaataata ctaatagtaa atgcaatgag gataagggga 480
atactactgc cactatattg atagagaaag gagagatgaa aaactgctct tttaatgtca 540
ccacagacct aagagataag atgcagaaag aatatgcact tgatgtagta ccattagaca 600
gtactaatac cagctataag ttagtaagtt gtaacacctc agtcattaca caggcctgtc 660
caaaggtatc ttttgagcca attccaatac atttctgtgc cccagctggt tttgcgattc 720
taaagtgtaa caataaaacg tttgatggaa aaggaccatg tacaaatgtc agtacagtgc 780
gatgtacaca tggaattaaa ccagtagtgt caactcaact gctgttaaat ggcagtctag 840
cagaagaaga gatagtgatt agatctgaaa atttctcgaa caatgctaaa accataatag 900
tacagctaaa taaaactgta gaaattaatt gtacaagacc caacaacaac acaagcaaag 960
gtatacatat gggaccaggg agggcatttt atgcaacagg aagaatagta ggagatataa 1020
gacaagcaca ttgtaacatt agtaacgcag attggacaaa tactttaaaa caggtagcta 1080
ggaaattaag ggaacaatat gtgaataaaa caatagcctt taagccaccc tcaggagggg 1140
acccagaagt tgtactgcac acttttaatt gtagagggga atttttctac tgtaatttat 1200
caagaatgtt taatagtagt tttaattcaa cacaactgtc taattattca gaagatactg 1260
ggaccatcac agtcccatgc agaataaaac aatttataaa catgtggcag gaagtaggaa 1320
aagcaatgta tgcccctccc atcagaggag aaattaattg ttcatcaaag attacaggat 1380
tgttattaac aagagacggt ggcaatagca atgggactga gattttcaga cctggaggag 1440
gagatatgag ggacaattgg agaagtgaat tatacaaata taaagtagta agaattgaac 1500
cattaggatt agcacccacc aaggcaaaga gaagagtggt gcagagagaa aaaagagcag 1560
cagtgacaat gggagcaatg ttccctgggt tcttgggagc agcaggaagc act 1613
<210> 128
<211> 484
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 128
Ser Ala Ala Glu Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asn Ala Lys Ala
20 25 30
Tyr Glu Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Gly Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Val
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu His Cys Thr Asp Cys Glu Asn Thr Ile Thr Gly
100 105 110
Gly Asn Asn Thr Asn Ser Lys Cys Asn Glu Asp Lys Gly Asn Thr Thr
115 120 125
Ala Thr Ile Leu Ile Glu Lys Gly Glu Met Lys Asn Cys Ser Phe Asn
130 135 140
Val Thr Thr Asp Leu Arg Asp Lys Met Gln Lys Glu Tyr Ala Leu Asp
145 150 155 160
Val Val Pro Leu Asp Ser Thr Asn Thr Ser Tyr Lys Leu Val Ser Cys
165 170 175
Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro
180 185 190
Ile Pro Ile His Phe Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys
195 200 205
Asn Asn Lys Thr Phe Asp Gly Lys Gly Pro Cys Thr Asn Val Ser Thr
210 215 220
Val Arg Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu
225 230 235 240
Leu Asn Gly Ser Leu Ala Glu Glu Glu Ile Val Ile Arg Ser Glu Asn
245 250 255
Phe Ser Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Asn Lys Thr Val
260 265 270
Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Ser Lys Gly Ile His
275 280 285
Met Gly Pro Gly Arg Ala Phe Tyr Ala Thr Gly Arg Ile Val Gly Asp
290 295 300
Ile Arg Gln Ala His Cys Asn Ile Ser Asn Ala Asp Trp Thr Asn Thr
305 310 315 320
Leu Lys Gln Val Ala Arg Lys Leu Arg Glu Gln Tyr Val Asn Lys Thr
325 330 335
Ile Ala Phe Lys Pro Pro Ser Gly Gly Asp Pro Glu Val Val Leu His
340 345 350
Thr Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asn Leu Ser Arg Met
355 360 365
Phe Asn Ser Ser Phe Asn Ser Thr Gln Leu Ser Asn Tyr Ser Glu Asp
370 375 380
Thr Gly Thr Ile Thr Val Pro Cys Arg Ile Lys Gln Phe Ile Asn Met
385 390 395 400
Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Glu
405 410 415
Ile Asn Cys Ser Ser Lys Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly
420 425 430
Gly Asn Ser Asn Gly Thr Glu Ile Phe Arg Pro Gly Gly Gly Asp Met
435 440 445
Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Arg Ile
450 455 460
Glu Pro Leu Gly Leu Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln
465 470 475 480
Arg Glu Lys Arg
<210> 129
<211> 1640
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 129
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tatcagcact 60
tatggagatg gggcaccgtg ctccttggga tgttgatgat ctgtagtgct gtagaacaat 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga agcaaccacc actttatttt 180
gtgcatcaga tgctaaagca tatgacacag aggcacataa tgtctgggcc acacatgcct 240
gtgtacctac agaccctaac ccacaagaag tagtattgga aaatgtgaca gaagattcta 300
acatgtggaa aaataacatg gtagaacaga tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cactctgtgt tactctaaat tgcactgatt 420
tcaattgtac tagttccagc aatactacta atagcacttg cattggtacc catcggacta 480
ctaataccga tggtagggag aaattggaaa tggaggtagg agaaataaaa aactgctctt 540
tcaatgtcac cacaagcata aggaataagg tacagaaaga atatgcactt ttttataaac 600
ttgatgtaat gccaatagat agtacgagct atacattgat acattgcaac acttcaacca 660
ttacacaggc ctgtccaaag gtatcctttg aaccaattcc tatacattat tgtgccccgg 720
ctggttttgc gattctaaag tgtaacaata agacgttcag tggaaaagga ccatgtaaaa 780
atgtcagcac agttcaatgt acacatggaa ttaggccagt agtgtcaact caactgctgt 840
taaatggtag tctagcagaa gaagagatag taattaggtc tgacaatttc tcggacaatg 900
ctaaaatcat aatagtacac ctaaataaat ctatagaaat taattgtaca agacccaaca 960
ataatacaag aaaaagaata tcgatggggc cgggaagagt atattataca acaggacaaa 1020
taataggaga tataagaaaa gcacattgta atattagtgg agaagaatgg aatagaacgt 1080
taaaagggat agttataaaa ttaagagaac aatttgggaa gaataaaaca atcatctttg 1140
atagatcctc aggaggggac ctagaaattg aaatgcatag ttttaattgt ggaggagagt 1200
tcttctactg taatacaaca aaactattta atagtgcttg gaatgagtca ggttacaatg 1260
ggacaaattc taatggaact attacactcc catgcagaat aagacaaatt gtaaacaggt 1320
ggcaggaagt aggaaaagca atgtatgccc ctcccatcac aggacaaatt aggtgttcat 1380
caaatattac aggactaata ttaacaagag atggtggtaa cagtagcaat agtagtaatg 1440
tgaatgagac cttcagacct acaggaggag atatgaggga caattggaga agtgaattat 1500
ataaatataa agtaatacga attgagccaa taggagtagc acccaccaag gcaaagagaa 1560
gagtggtgca gagagagaaa agagcagtgg gaacgctagg agctatgttc cttgggttct 1620
tgggagcagc aggaagcact 1640
<210> 130
<211> 493
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 130
Ser Ala Val Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Glu Asn Val Thr Glu Asp
50 55 60
Ser Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Phe Asn Cys Thr Ser Ser Ser
100 105 110
Asn Thr Thr Asn Ser Thr Cys Ile Gly Thr His Arg Thr Thr Asn Thr
115 120 125
Asp Gly Arg Glu Lys Leu Glu Met Glu Val Gly Glu Ile Lys Asn Cys
130 135 140
Ser Phe Asn Val Thr Thr Ser Ile Arg Asn Lys Val Gln Lys Glu Tyr
145 150 155 160
Ala Leu Phe Tyr Lys Leu Asp Val Met Pro Ile Asp Ser Thr Ser Tyr
165 170 175
Thr Leu Ile His Cys Asn Thr Ser Thr Ile Thr Gln Ala Cys Pro Lys
180 185 190
Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe
195 200 205
Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Ser Gly Lys Gly Pro Cys
210 215 220
Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val
225 230 235 240
Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Ile Val
245 250 255
Ile Arg Ser Asp Asn Phe Ser Asp Asn Ala Lys Ile Ile Ile Val His
260 265 270
Leu Asn Lys Ser Ile Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr
275 280 285
Arg Lys Arg Ile Ser Met Gly Pro Gly Arg Val Tyr Tyr Thr Thr Gly
290 295 300
Gln Ile Ile Gly Asp Ile Arg Lys Ala His Cys Asn Ile Ser Gly Glu
305 310 315 320
Glu Trp Asn Arg Thr Leu Lys Gly Ile Val Ile Lys Leu Arg Glu Gln
325 330 335
Phe Gly Lys Asn Lys Thr Ile Ile Phe Asp Arg Ser Ser Gly Gly Asp
340 345 350
Leu Glu Ile Glu Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr
355 360 365
Cys Asn Thr Thr Lys Leu Phe Asn Ser Ala Trp Asn Glu Ser Gly Tyr
370 375 380
Asn Gly Thr Asn Ser Asn Gly Thr Ile Thr Leu Pro Cys Arg Ile Arg
385 390 395 400
Gln Ile Val Asn Arg Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro
405 410 415
Pro Ile Thr Gly Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Ile
420 425 430
Leu Thr Arg Asp Gly Gly Asn Ser Ser Asn Ser Ser Asn Val Asn Glu
435 440 445
Thr Phe Arg Pro Thr Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu
450 455 460
Leu Tyr Lys Tyr Lys Val Ile Arg Ile Glu Pro Ile Gly Val Ala Pro
465 470 475 480
Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
485 490
<210> 131
<211> 1664
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 131
agaaagagca gaagacagtg gcaatgaaag cgaaggagat gaagaagcat tggcagcact 60
tgtggaaagg gggcatcatg ctccttggga tgttaatgat ctgtagtgct gcaccaaact 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga agcaaccacc actctatttt 180
gtgcatcaga tgctaaagca tacaaaacag aggctcataa tgtctgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag tagtattgga aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaga tgcatgagga tataattagt ctatgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cactctgtgt tactttaaat tgcactaagt 420
taaataattg cactgagttg cagaatgata ctacgaatag tactacgttg gagaatggta 480
cttattgcat taaggtggag aataaaacta atataaggga agaaatgaca aattgctctt 540
tcaatattac cacaagtgta agagataagg tgcataaaca atatgctctc ttctatagat 600
ttgatttagt accaatagag gatgaaaata agaatactag ctctaataat agctttagat 660
tgataaattg taatacctca atcattacac agtcctgtcc aaaggtaacc tttgagccaa 720
ttcccataca ttattgtacc ccagctggtt ttgcgattct aaagtgtaat gataagaagt 780
tcaatggaaa agggccatgt acaaatgtca gcacagtaca atgtacacat ggaattaaac 840
cagtagtgtc aactcaactg ctgttaaatg gcagtctagc agaagaagag gtagtaatta 900
gatctgaaaa cttcacaaac aatgctaaaa ccataatagt acagctgaac gagactgtag 960
aaattaattg tacaagacct agcaacaata caagaaaaag tataactata ggaccaggga 1020
gagcattttt tacaacaggg gatgtaatag gaaatataag gcaagcatat tgtaacgtta 1080
gtagggcaaa atggaataac actttaagac agatagttac aaaactaaga gaacaatttg 1140
agaataaaac aataattttt aagtcatcct cgggagggga cccagaaatt gtaactcaca 1200
cttttaattg tggaggagaa tttttctact gtaatacaac accactgttt aatagtacct 1260
gggataatag tacctgggat tggaataata ctgaagagtc aaacagcact cccattgtac 1320
tcacatgcag aataaaacaa attgtaaata tgtggcagga ggtaggaaaa gcaatgtatg 1380
cccctcccat cagaggacaa atttggtgtt catcaaatat tacaggactg ctattaacaa 1440
gagatggtgg aataaataat acgggaaatg agaccttcag acctgcagga ggagatatga 1500
gggacaattg gagaagtgaa tcatataaat ataaagtagt aagaattgaa ccattaggag 1560
tggcacccac caaggcaaag agaagagtgg tgcagagaga aaaaagagca gtgggaacaa 1620
taggagctat gttccttggg ttcttgggag cagcaggaag cact 1664
<210> 132
<211> 501
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 132
Ser Ala Ala Pro Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Lys Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Asn Asn Cys Thr Glu Leu
100 105 110
Gln Asn Asp Thr Thr Asn Ser Thr Thr Leu Glu Asn Gly Thr Tyr Cys
115 120 125
Ile Lys Val Glu Asn Lys Thr Asn Ile Arg Glu Glu Met Thr Asn Cys
130 135 140
Ser Phe Asn Ile Thr Thr Ser Val Arg Asp Lys Val His Lys Gln Tyr
145 150 155 160
Ala Leu Phe Tyr Arg Phe Asp Leu Val Pro Ile Glu Asp Glu Asn Lys
165 170 175
Asn Thr Ser Ser Asn Asn Ser Phe Arg Leu Ile Asn Cys Asn Thr Ser
180 185 190
Ile Ile Thr Gln Ser Cys Pro Lys Val Thr Phe Glu Pro Ile Pro Ile
195 200 205
His Tyr Cys Thr Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp Lys
210 215 220
Lys Phe Asn Gly Lys Gly Pro Cys Thr Asn Val Ser Thr Val Gln Cys
225 230 235 240
Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly
245 250 255
Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Glu Asn Phe Thr Asn
260 265 270
Asn Ala Lys Thr Ile Ile Val Gln Leu Asn Glu Thr Val Glu Ile Asn
275 280 285
Cys Thr Arg Pro Ser Asn Asn Thr Arg Lys Ser Ile Thr Ile Gly Pro
290 295 300
Gly Arg Ala Phe Phe Thr Thr Gly Asp Val Ile Gly Asn Ile Arg Gln
305 310 315 320
Ala Tyr Cys Asn Val Ser Arg Ala Lys Trp Asn Asn Thr Leu Arg Gln
325 330 335
Ile Val Thr Lys Leu Arg Glu Gln Phe Glu Asn Lys Thr Ile Ile Phe
340 345 350
Lys Ser Ser Ser Gly Gly Asp Pro Glu Ile Val Thr His Thr Phe Asn
355 360 365
Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr Pro Leu Phe Asn Ser
370 375 380
Thr Trp Asp Asn Ser Thr Trp Asp Trp Asn Asn Thr Glu Glu Ser Asn
385 390 395 400
Ser Thr Pro Ile Val Leu Thr Cys Arg Ile Lys Gln Ile Val Asn Met
405 410 415
Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Gln
420 425 430
Ile Trp Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly
435 440 445
Gly Ile Asn Asn Thr Gly Asn Glu Thr Phe Arg Pro Ala Gly Gly Asp
450 455 460
Met Arg Asp Asn Trp Arg Ser Glu Ser Tyr Lys Tyr Lys Val Val Arg
465 470 475 480
Ile Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val
485 490 495
Gln Arg Glu Lys Arg
500
<210> 133
<211> 1640
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 133
agaaagagca gaagacagtg gcaatgaaag cgaaggagac caggaagaac tatcagcgct 60
tgtggagatg gggcacgatg ctccttggga tgttgatgat ctgtagtgct acagaaaaat 120
tgtgggtcac agtctactat ggggtacctg tgtggaaaga agcaactacc actctatttt 180
gtgcatcaga tgctaaagca tatgatacgg aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag tagtattggt aaatgtaaca gaaaagttta 300
acatgtggaa aaataacatg gtagaacaaa tgcatgagga cataatcaat ctatgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cgctctgtat tactttaaat tgctctgatg 420
ttagaaattg cactgagcag gggaatgata ctgccgctag tacttgtatt gattggaaga 480
ctaatggtag tgagaaagtg atggagaaag gagaaataaa aaactgctct ttcaatatta 540
caacaaacat aagggacaag gtgaaggaag agtatgcact tttttataaa attgatatag 600
caccaataga taatgatact actagctata ggttgataaa ttgtaacacc tcagtcatta 660
cacaggcctg tccaaaggta tcctttgagc caattcccat acattattgt gccccggctg 720
gttttgcgat tctaaaatgt ggagataaga agttcaatgg aacaggacta tgtaaaaatg 780
tcagcacagt acaatgtaca catggaatta ggccagtagt gtcaactcaa ctgctgttaa 840
acggcagtct agcagaagaa gatgtaataa ttagatctgc caatttcaca gacaatgcta 900
aaaacataat agtacagctg aaggaatctg tagaaattac ttgtataaga cccaacaata 960
caagaaaaag tattcatata ggaccaggga aaacattttt tacaacagaa ataataggaa 1020
atataagaca agcatattgt acccttgatg gaacaaaatg gaataacact ttagcacaga 1080
tagttgaaca attaagggga caatttggaa ataaaacaat agactttaag caaccctcag 1140
gaggggaccc agaagttata atgcacaagt ttaattgtgg aggggaattt ttctactgta 1200
attcaacaca actgtttaat agtacttggc cactaaatgg tactagatca ggtggcactg 1260
aaggaagcac tgaaggaaat atcacactcc catgtaaaat aaaacaaatt ataaacaggt 1320
ggcaggaagt aggaaaagca atgtatgccc ctcccatcaa aggaataatt agatgttcat 1380
cgaatattac agggctgata ttaacaagag atggtgggga gggcaagaac gataccaaca 1440
gtaccgagat cttcagacct ggaggaggag atatgaggga caattggaga agtgaattat 1500
ataaatataa agtagtacga attgaaccat taggagtagc acccaccaag gcaaagagaa 1560
gagtggtgca gagagaaaaa agagcagtgg ggatgctagg agctatgttc cttgggttct 1620
tgggagcagc aggaagcact 1640
<210> 134
<211> 493
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 134
Ser Ala Thr Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Val Asn Val Thr Glu Lys
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Asn Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Ile Thr Leu Asn Cys Ser Asp Val Arg Asn Cys Thr Glu Gln
100 105 110
Gly Asn Asp Thr Ala Ala Ser Thr Cys Ile Asp Trp Lys Thr Asn Gly
115 120 125
Ser Glu Lys Val Met Glu Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn
130 135 140
Ile Thr Thr Asn Ile Arg Asp Lys Val Lys Glu Glu Tyr Ala Leu Phe
145 150 155 160
Tyr Lys Ile Asp Ile Ala Pro Ile Asp Asn Asp Thr Thr Ser Tyr Arg
165 170 175
Leu Ile Asn Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val
180 185 190
Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala
195 200 205
Ile Leu Lys Cys Gly Asp Lys Lys Phe Asn Gly Thr Gly Leu Cys Lys
210 215 220
Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser
225 230 235 240
Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Asp Val Ile Ile
245 250 255
Arg Ser Ala Asn Phe Thr Asp Asn Ala Lys Asn Ile Ile Val Gln Leu
260 265 270
Lys Glu Ser Val Glu Ile Thr Cys Ile Arg Pro Asn Asn Thr Arg Lys
275 280 285
Ser Ile His Ile Gly Pro Gly Lys Thr Phe Phe Thr Thr Glu Ile Ile
290 295 300
Gly Asn Ile Arg Gln Ala Tyr Cys Thr Leu Asp Gly Thr Lys Trp Asn
305 310 315 320
Asn Thr Leu Ala Gln Ile Val Glu Gln Leu Arg Gly Gln Phe Gly Asn
325 330 335
Lys Thr Ile Asp Phe Lys Gln Pro Ser Gly Gly Asp Pro Glu Val Ile
340 345 350
Met His Lys Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr
355 360 365
Gln Leu Phe Asn Ser Thr Trp Pro Leu Asn Gly Thr Arg Ser Gly Gly
370 375 380
Thr Glu Gly Ser Thr Glu Gly Asn Ile Thr Leu Pro Cys Lys Ile Lys
385 390 395 400
Gln Ile Ile Asn Arg Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro
405 410 415
Pro Ile Lys Gly Ile Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Ile
420 425 430
Leu Thr Arg Asp Gly Gly Glu Gly Lys Asn Asp Thr Asn Ser Thr Glu
435 440 445
Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu
450 455 460
Leu Tyr Lys Tyr Lys Val Val Arg Ile Glu Pro Leu Gly Val Ala Pro
465 470 475 480
Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
485 490
<210> 135
<211> 1672
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 135
agaagagcag aagacagtgg caatgaaagt gaaggagacc aggaagaatt atcagaactt 60
atggagatgg ggcatcttgc tccttgggat attaatgatc tgtagtgctg cagaaaaatt 120
gtgggtaaca gtctattatg gggtacctgt gtggagagaa gcaaacacca ctttattttg 180
tgcatcagat gctaaagcat atgatacaga agtacataat gtctgggcca cacatgcctg 240
tgtgcccaca gaccccaacc cacaagaagt agtattggga aatgtgacag aatattttaa 300
tatgtggaaa aataacatgg tagaacagat gcatgaggat ataatcagtt tatgggatca 360
aagcctaaag ccatgtgtaa aattaactcc actctgtgtt actttaaatt gcactgatgc 420
agtctgtact tcaaattgca ctaattccac tggtacgtcc actcctattc ccaccactgt 480
tagcagtgag gacaaaggag aaataaaaaa ctgctctttc aatgtcacca caagcataaa 540
agataggata cagagagaat atgcaacttt ttataagctt gatgtagtac caatagatga 600
tgatgataat actagagatg ataataatac tagtaataat aatactagta accctagtaa 660
gactctctat aggttgataa attgtaacac ctcagccctt acacaggcct gtccaaaggt 720
atcctttgaa ccaattccca tacattattg tgccccagct ggttttgcga ttctaaagtg 780
taacaataag acattcgatg gatcgggacc atgtacaaat gtcagcacag tacaatgtac 840
acatggaatt aggccagtag tgtcaactca attgctgtta aatggcagtc tagcagaagg 900
agatatagta atcagatctg aaaatttctc gaacaatgct aaaaccataa tagtacagct 960
gaaggaatct ataagcatta attgtacaag acccaacaac aatacaagaa aaagtataca 1020
tataggacaa ggaagggcat tttatacaac aggagatata ataggagata taagaaaagc 1080
acattgtaac gttagtagag aaggttggaa taacgctgta aaccgactag ttgaaaaatt 1140
aaaagaacaa tttggaaaaa gaaaaacaat aaaatttaag ccatcctcag gaggggaccc 1200
agagattgta atgcacatgt ttaattgtgg aggggagttt ttctactgta atacatcaaa 1260
actgtttaat gttgttaatg atacttggat gggatcaaat gacactggag aaatcgagct 1320
cccatgtaga ataaaacaaa tcgtaagcat gtggcaggaa gtaggaaaag caatgtatgc 1380
ccctcccatc agaggacaaa ttagatgttc atcagatatc acagggctgc tattaacaag 1440
agatggtggt aaggatgaca acaacacaat tggaaatgaa accttcagac ctggaggagg 1500
agatatgagg gacaattgga gaagtgaatt atataaatat aaagtagtaa aaattcaacc 1560
actaggaata gcacccacca aggcaaagag aagagtggtg cagagagaaa aaagagcagt 1620
gggagcacta ggagctatgt tccttgggtt cttgggagca gcaggaagca ct 1672
<210> 136
<211> 504
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 136
Ser Ala Ala Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Arg Glu Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Gly Asn Val Thr Glu Tyr
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Ala Val Cys Thr Ser Asn Cys
100 105 110
Thr Asn Ser Thr Gly Thr Ser Thr Pro Ile Pro Thr Thr Val Ser Ser
115 120 125
Glu Asp Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn Val Thr Thr Ser
130 135 140
Ile Lys Asp Arg Ile Gln Arg Glu Tyr Ala Thr Phe Tyr Lys Leu Asp
145 150 155 160
Val Val Pro Ile Asp Asp Asp Asp Asn Thr Arg Asp Asp Asn Asn Thr
165 170 175
Ser Asn Asn Asn Thr Ser Asn Pro Ser Lys Thr Leu Tyr Arg Leu Ile
180 185 190
Asn Cys Asn Thr Ser Ala Leu Thr Gln Ala Cys Pro Lys Val Ser Phe
195 200 205
Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu
210 215 220
Lys Cys Asn Asn Lys Thr Phe Asp Gly Ser Gly Pro Cys Thr Asn Val
225 230 235 240
Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln
245 250 255
Leu Leu Leu Asn Gly Ser Leu Ala Glu Gly Asp Ile Val Ile Arg Ser
260 265 270
Glu Asn Phe Ser Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Lys Glu
275 280 285
Ser Ile Ser Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser
290 295 300
Ile His Ile Gly Gln Gly Arg Ala Phe Tyr Thr Thr Gly Asp Ile Ile
305 310 315 320
Gly Asp Ile Arg Lys Ala His Cys Asn Val Ser Arg Glu Gly Trp Asn
325 330 335
Asn Ala Val Asn Arg Leu Val Glu Lys Leu Lys Glu Gln Phe Gly Lys
340 345 350
Arg Lys Thr Ile Lys Phe Lys Pro Ser Ser Gly Gly Asp Pro Glu Ile
355 360 365
Val Met His Met Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr
370 375 380
Ser Lys Leu Phe Asn Val Val Asn Asp Thr Trp Met Gly Ser Asn Asp
385 390 395 400
Thr Gly Glu Ile Glu Leu Pro Cys Arg Ile Lys Gln Ile Val Ser Met
405 410 415
Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Gln
420 425 430
Ile Arg Cys Ser Ser Asp Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly
435 440 445
Gly Lys Asp Asp Asn Asn Thr Ile Gly Asn Glu Thr Phe Arg Pro Gly
450 455 460
Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys
465 470 475 480
Val Val Lys Ile Gln Pro Leu Gly Ile Ala Pro Thr Lys Ala Lys Arg
485 490 495
Arg Val Val Gln Arg Glu Lys Arg
500
<210> 137
<211> 1533
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 137
atgagagtga aggagaaata tcagcacttg tggagatggg ggtggagatg gggcaccatg 60
ctccttggga tgttgatgat ctgtagtgct acagaaaaat tgtgggtcac agtctattat 120
ggggtacctg tgtggaagga agcaaccacc actctatttt gtgcatcaga tgctaaagca 180
tatgatacag aggtacataa tgtttgggcc acacatgcct gtgtacccac agaccccaac 240
ccacaagaag tagtattggt aaatgtgaca gaaaatttta acatgtggaa aaatgacatg 300
gtagaacaga tgcatgagga tataatcagt ttatgggatc aaagcctaaa gccatgtgta 360
aaattaaccc cactctgtgt tagtttaaag tgcactgatt tgaagaatga tactaatacc 420
aatagtagta gcgggagaat gataatggag aaaggagaga taaaaaactg ctctttcaat 480
atcagcacaa gcataagagg taaggtgcag aaagaatatg cattttttta taaacttgat 540
ataataccaa tagataatga tactaccagc tataagttga caagttgtaa cacctcagtc 600
attacacagg cctgtccaaa ggtatccttt gagccaattc ccatacatta ttgtgccccg 660
gctggttttg cgattctaaa atgtaataat aagacgttca atggaacagg accatgtaca 720
aatgtcagca cagtacaatg tacacatgga attaggccag tagtatcaac tcaactgctg 780
ttaaatggca gtctagcaga agaagaggta gtaattagat ctgtcaattt cacggacaat 840
gctaaaacca taatagtaca gctgaacaca tctgtagaaa ttaattgtac aagacccaac 900
aacaatacaa gaaaaagaat ccgtatccag agaggaccag ggagagcatt tgttacaata 960
ggaaaaatag gaaatatgag acaagcacat tgtaacatta gtagagcaaa atggaataac 1020
actttaaaac agatagctag caaattaaga gaacaatttg gaaataataa aacaataatc 1080
tttaagcaat cctcaggagg ggacccagaa attgtaacgc acagttttaa ttgtggaggg 1140
gaatttttct actgtaattc aacacaactg tttaatagta cttggtttaa tagtacttgg 1200
agtactgaag ggtcaaataa cactgaagga agtgacacaa tcaccctccc atgcagaata 1260
aaacaaatta taaacatgtg gcagaaagta ggaaaagcaa tgtatgcccc tcccatcagt 1320
ggacaaatta gatgttcatc aaatattaca gggctgctat taacaagaga tggtggtaat 1380
agcaacaatg agtccgagat cttcagacct ggaggaggag atatgaggga caattggaga 1440
agtgaattat ataaatataa agtagtaaaa attgaaccat taggagtagc acccaccaag 1500
gcaaagagaa gagtggtgca gagagaaaaa aga 1533
<210> 138
<211> 511
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 138
Met Arg Val Lys Glu Lys Tyr Gln His Leu Trp Arg Trp Gly Trp Arg
1 5 10 15
Trp Gly Thr Met Leu Leu Gly Met Leu Met Ile Cys Ser Ala Thr Glu
20 25 30
Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala
35 40 45
Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu
50 55 60
Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn
65 70 75 80
Pro Gln Glu Val Val Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp
85 90 95
Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp
100 105 110
Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Ser
115 120 125
Leu Lys Cys Thr Asp Leu Lys Asn Asp Thr Asn Thr Asn Ser Ser Ser
130 135 140
Gly Arg Met Ile Met Glu Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn
145 150 155 160
Ile Ser Thr Ser Ile Arg Gly Lys Val Gln Lys Glu Tyr Ala Phe Phe
165 170 175
Tyr Lys Leu Asp Ile Ile Pro Ile Asp Asn Asp Thr Thr Ser Tyr Lys
180 185 190
Leu Thr Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val
195 200 205
Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala
210 215 220
Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr
225 230 235 240
Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser
245 250 255
Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile
260 265 270
Arg Ser Val Asn Phe Thr Asp Asn Ala Lys Thr Ile Ile Val Gln Leu
275 280 285
Asn Thr Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg
290 295 300
Lys Arg Ile Arg Ile Gln Arg Gly Pro Gly Arg Ala Phe Val Thr Ile
305 310 315 320
Gly Lys Ile Gly Asn Met Arg Gln Ala His Cys Asn Ile Ser Arg Ala
325 330 335
Lys Trp Asn Asn Thr Leu Lys Gln Ile Ala Ser Lys Leu Arg Glu Gln
340 345 350
Phe Gly Asn Asn Lys Thr Ile Ile Phe Lys Gln Ser Ser Gly Gly Asp
355 360 365
Pro Glu Ile Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr
370 375 380
Cys Asn Ser Thr Gln Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp
385 390 395 400
Ser Thr Glu Gly Ser Asn Asn Thr Glu Gly Ser Asp Thr Ile Thr Leu
405 410 415
Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Lys Val Gly Lys
420 425 430
Ala Met Tyr Ala Pro Pro Ile Ser Gly Gln Ile Arg Cys Ser Ser Asn
435 440 445
Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Ser Asn Asn Glu
450 455 460
Ser Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg
465 470 475 480
Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val
485 490 495
Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
500 505 510
<210> 139
<211> 511
<212> PRT
<213> Artificial
<220>
<223> HIV-1 HXB2 gp120 with Cys mutations of U-101c1
<400> 139
Met Arg Val Lys Glu Lys Tyr Gln His Leu Trp Arg Trp Gly Trp Arg
1 5 10 15
Trp Gly Thr Met Leu Leu Gly Met Leu Met Ile Cys Ser Ala Thr Glu
20 25 30
Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala
35 40 45
Thr Thr Thr Leu Phe Arg Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu
50 55 60
Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn
65 70 75 80
Pro Gln Glu Val Val Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp
85 90 95
Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp
100 105 110
Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Arg Val Ser
115 120 125
Leu Lys Cys Thr Asp Leu Lys Asn Asp Thr Asn Thr Asn Ser Ser Ser
130 135 140
Gly Arg Met Ile Met Glu Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn
145 150 155 160
Ile Ser Thr Ser Ile Arg Gly Lys Val Gln Lys Glu Tyr Ala Phe Phe
165 170 175
Tyr Lys Leu Asp Ile Ile Pro Ile Asp Asn Asp Thr Thr Ser Tyr Lys
180 185 190
Leu Thr Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val
195 200 205
Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala
210 215 220
Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr
225 230 235 240
Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser
245 250 255
Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile
260 265 270
Arg Ser Val Asn Phe Thr Asp Asn Ala Lys Thr Ile Ile Val Gln Leu
275 280 285
Asn Thr Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg
290 295 300
Lys Arg Ile Arg Ile Gln Arg Gly Pro Gly Arg Ala Phe Val Thr Ile
305 310 315 320
Gly Lys Ile Gly Asn Met Arg Gln Ala His Cys Asn Ile Ser Arg Ala
325 330 335
Lys Trp Asn Asn Thr Leu Lys Gln Ile Ala Ser Lys Leu Arg Glu Gln
340 345 350
Phe Gly Asn Asn Lys Thr Ile Ile Phe Lys Gln Ser Ser Gly Gly Asp
355 360 365
Pro Glu Ile Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr
370 375 380
Cys Asn Ser Thr Gln Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp
385 390 395 400
Ser Thr Glu Gly Ser Asn Asn Thr Glu Gly Ser Asp Thr Ile Thr Leu
405 410 415
Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Lys Val Gly Lys
420 425 430
Ala Met Tyr Ala Pro Pro Ile Ser Gly Gln Ile Arg Cys Ser Ser Asn
435 440 445
Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Ser Asn Asn Glu
450 455 460
Ser Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg
465 470 475 480
Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val
485 490 495
Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
500 505 510
<210> 140
<211> 511
<212> PRT
<213> Artificial
<220>
<223> HIV-1 HXB2 gp120 with Cys mutations of U-178c13
<400> 140
Met Arg Val Lys Glu Lys Tyr Gln His Leu Trp Arg Trp Gly Trp Arg
1 5 10 15
Trp Gly Thr Met Leu Leu Gly Met Leu Met Ile Cys Ser Ala Thr Glu
20 25 30
Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala
35 40 45
Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu
50 55 60
Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn
65 70 75 80
Pro Gln Glu Val Val Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp
85 90 95
Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp
100 105 110
Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Ser
115 120 125
Leu Lys Arg Thr Asp Leu Lys Asn Asp Thr Asn Thr Asn Ser Ser Ser
130 135 140
Gly Arg Met Ile Met Glu Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn
145 150 155 160
Ile Ser Thr Ser Ile Arg Gly Lys Val Gln Lys Glu Tyr Ala Phe Phe
165 170 175
Tyr Lys Leu Asp Ile Ile Pro Ile Asp Asn Asp Thr Thr Ser Tyr Lys
180 185 190
Leu Thr Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val
195 200 205
Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala
210 215 220
Ile Leu Lys Tyr Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr
225 230 235 240
Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser
245 250 255
Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile
260 265 270
Arg Ser Val Asn Phe Thr Asp Asn Ala Lys Thr Ile Ile Val Gln Leu
275 280 285
Asn Thr Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg
290 295 300
Lys Arg Ile Arg Ile Gln Arg Gly Pro Gly Arg Ala Phe Val Thr Ile
305 310 315 320
Gly Lys Ile Gly Asn Met Arg Gln Ala His Cys Asn Ile Ser Arg Ala
325 330 335
Lys Trp Asn Asn Thr Leu Lys Gln Ile Ala Ser Lys Leu Arg Glu Gln
340 345 350
Phe Gly Asn Asn Lys Thr Ile Ile Phe Lys Gln Ser Ser Gly Gly Asp
355 360 365
Pro Glu Ile Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr
370 375 380
Cys Asn Ser Thr Gln Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp
385 390 395 400
Ser Thr Glu Gly Ser Asn Asn Thr Glu Gly Ser Asp Thr Ile Thr Leu
405 410 415
Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Lys Val Gly Lys
420 425 430
Ala Met Tyr Ala Pro Pro Ile Ser Gly Gln Ile Arg Cys Ser Ser Asn
435 440 445
Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Ser Asn Asn Glu
450 455 460
Ser Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg
465 470 475 480
Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val
485 490 495
Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
500 505 510
SEQUENCE LISTING
<110> VaxGen, Inc.
Berman, Phillip W
Jobes, David V
<120> HIV-1 ENVELOPE GLYCOPROTEINS HAVING UNUSUAL DISULFIDE STRUCTURE
<130> 409J-000210US / PC
<140> US 10 / 866,527 and PCT / US 04/018672
<141> 2004-06-10
<160> 140
<170> PatentIn version 3.2
<210> 1
<211> 1613
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 1
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaggaat tgtcagcgct 60
ggtggagatg gggcatcatg ctccttggaa tgttgatgat ctgtaatgct gtaggacaat 120
tgtgggttac ggtctattat ggggtacctg tgtggaaaga agccaccacc actctattcc 180
gtgcatcaga tgctaaagca tatgatacag aggtacataa tgtctgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaaa tagaattgga aaatgtgaca gaaactttta 300
acatgtggaa aaataacatg gtagaacaaa tgcatgagga tataatcagt ttatgggatc 360
aaggcctaaa accatgtgta aaattaaccc cactccgtgt tactttaaat tgcactgact 420
acaagaatgc taatagtacc aataataata gtaccagtga tagtagcaat ctagaagagg 480
agaaaggaga aataaaaaac tgctctttca atatcactac aagcataaaa gataggatgc 540
agaaagaata tgcacttttt tataaacttg atatagtacc aatagataat aataatacta 600
gatataggat gataagttgt aacacctcag tcattacaca ggcctgtcca aaggtatctt 660
ttgagccaat tcccatacat tattgtgccc cggctggttt tgcgattcta aaatgtaagg 720
ataagaagtt caatggaaca ggaccatgta gaaatgtcag cacagtacaa tgtacacatg 780
gaattaggcc agtagtatca actcaattgc tgttaaatgg cagtctagca gaagaagagg 840
tagtacttag atctgaaaat ttcacgaaca atgctaaaac cataatagta cagctaaaag 900
aacctataaa aatcaattgt acaagaccca acaacaatac aagaaaaagt atacatatag 960
gaccagggag agcattttat acaacagggg agataatagg agacataaga caagcacatt 1020
gtagcattag taaggtagaa tggaacaaca ctttgataca aatagttgaa aaattaagag 1080
aacaatttgg gactaaaaca ataaatttta ctaaaccctc aggaggggac ctagaaattg 1140
taacgcacag ctttaattgt agaggggaat ttttctactg taataccaca aaactgttta 1200
atagtacttg gcctgggaat attacttgga ctcggaataa taatgttact acagaaaata 1260
tcacactccc atgcagaata aaacaaattg tgaacagatg gcaggaagta ggaaaagcaa 1320
tgtatgcccc tcccatccaa ggacaaatta gatgttcatc aaatattaca gggctgctat 1380
taacaagaga tggtggtggg gaccagaata gcacagggga gatcttcaga cctggaggag 1440
gagatatgag ggacaattgg agaagtgaac tatacaaata taaagtagta caaattgaac 1500
cattaggaat agcacccacc aaggcaagga gaagagtggt gcagagagaa aaaagagcag 1560
tgggaacatt aggagctatg ttccttgggt tcttgggagc agcaggaagc act 1613
<210> 2
<211> 484
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 2
Asn Ala Val Gly Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Arg Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Glu Leu Glu Asn Val Thr Glu Thr
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Gly Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Arg Val Thr Leu Asn Cys Thr Asp Tyr Lys Asn Ala Asn Ser Thr
100 105 110
Asn Asn Asn Ser Thr Ser Asp Ser Ser Asn Leu Glu Glu Glu Lys Gly
115 120 125
Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Ser Ile Lys Asp Arg
130 135 140
Met Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Leu Asp Ile Val Pro Ile
145 150 155 160
Asp Asn Asn Asn Thr Arg Tyr Arg Met Ile Ser Cys Asn Thr Ser Val
165 170 175
Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His
180 185 190
Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Lys Asp Lys Lys
195 200 205
Phe Asn Gly Thr Gly Pro Cys Arg Asn Val Ser Thr Val Gln Cys Thr
210 215 220
His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser
225 230 235 240
Leu Ala Glu Glu Glu Val Val Leu Arg Ser Glu Asn Phe Thr Asn Asn
245 250 255
Ala Lys Thr Ile Ile Val Gln Leu Lys Glu Pro Ile Lys Ile Asn Cys
260 265 270
Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly
275 280 285
Arg Ala Phe Tyr Thr Thr Gly Glu Ile Ile Gly Asp Ile Arg Gln Ala
290 295 300
His Cys Ser Ile Ser Lys Val Glu Trp Asn Asn Thr Leu Ile Gln Ile
305 310 315 320
Val Glu Lys Leu Arg Glu Gln Phe Gly Thr Lys Thr Ile Asn Phe Thr
325 330 335
Lys Pro Ser Gly Gly Asp Leu Glu Ile Val Thr His Ser Phe Asn Cys
340 345 350
Arg Gly Glu Phe Phe Tyr Cys Asn Thr Thr Lys Leu Phe Asn Ser Thr
355 360 365
Trp Pro Gly Asn Ile Thr Trp Thr Arg Asn Asn Asn Val Thr Thr Glu
370 375 380
Asn Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Val Asn Arg Trp Gln
385 390 395 400
Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Gln Gly Gln Ile Arg
405 410 415
Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Gly
420 425 430
Asp Gln Asn Ser Thr Gly Glu Ile Phe Arg Pro Gly Gly Gly Asp Met
435 440 445
Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Gln Ile
450 455 460
Glu Pro Leu Gly Ile Ala Pro Thr Lys Ala Arg Arg Arg Val Val Gln
465 470 475 480
Arg Glu Lys Arg
<210> 3
<211> 1604
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 3
agaaagagca gaagacagtg gcaatgaaag tgacggggat caggaagaat tgtcagcgct 60
tgtggagatg gggcatgatg ctcctgggga tgttaatgat ctgtagtgct gcagaacaat 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga agcaaccacc actctatttt 180
gtgcatcaga tgctaaagca tatgacgcag agaaacataa tgtttgggcc acacatgcct 240
gcgtacccac agaccccaac ccacaagaaa tagtattgga aaatgtgaca gaatatttta 300
atgcttggaa aaataacatg gtagaacaga tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa accatgtgta aaattaaccc cactctgtgt tactttaaat cgcactgatt 420
tgaataatag tactaacacc actaatagta atagcagcgg ggggatgatg agagaagaaa 480
tgaaaaactg ctctttcaat atcaccacaa caataggtga taggaggcaa aaagaatatg 540
cactttttta taaacttgat atagcatcaa taaaggatga tgctaataat ttcacatata 600
ggttgataag ttgtaacacc tcagtcatta cacaagcctg tccaaagata tcctttgagc 660
caattcccat acattattgt gccccggctg gttttgcaat tctaaagtat aacgataaga 720
agttcaatgg agaagagcaa tgtaaaaatg tcagcacagt acaatgtaca catggaatta 780
agccagtagt atcaactcag ctgctgttaa atggtagtct agcagaagaa gagatagtaa 840
ttagatctga caatttcaca gacaatgcta aaaccataac agtacagctg aatgaatctg 900
tagtaattaa ttgtacaaga ccccacaaca atacaagaaa aagtataaat ataggaccag 960
ggagagcatg gtatacaaca ggagaaataa taggagatat aagacaagca cattgtaaca 1020
ttagtaaaac acaatggaat aacactttaa taaagatagt taaaaaatta agagaacaat 1080
ttaatacaaa caccataatc tttaatcaat ccacaggagg ggacctagaa attgtaatgc 1140
acagttttaa ttgtggaggg gaatttttct actgtgatac aacacaactg tttaatagta 1200
cttggaatat tactggagaa agtacttgga atagtactgg aaaaacaaat gaaactatca 1260
cactcccatg tagaataaaa caagttataa acatgtggca gcaagtaggg aaagcaatgt 1320
atgcccctcc catcaaaggg caaattagat gttcatcaaa tattacaggg ctgctattaa 1380
caagagatgg tggtaagaac agcagtaacg ggactgagac ctttagacct ggaggaggag 1440
atatgaggga caattggaga agtgaattat ataaatataa agtagtagaa attgaaccat 1500
taggaatagc acccactaag gcaaagagaa gagtggtgca gagagaaaga agagcagtaa 1560
taggagctat gttccttggg ttcttgggag cagcaggaag cact 1604
<210> 4
<211> 483
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 4
Ser Ala Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Ala Glu Lys His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Val Leu Glu Asn Val Thr Glu Tyr
50 55 60
Phe Asn Ala Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Arg Thr Asp Leu Asn Asn Ser Thr Asn Thr
100 105 110
Thr Asn Ser Asn Ser Ser Gly Gly Met Met Arg Glu Glu Met Lys Asn
115 120 125
Cys Ser Phe Asn Ile Thr Thr Thr Ile Gly Asp Arg Arg Gln Lys Glu
130 135 140
Tyr Ala Leu Phe Tyr Lys Leu Asp Ile Ala Ser Ile Lys Asp Asp Ala
145 150 155 160
Asn Asn Phe Thr Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val Ile Thr
165 170 175
Gln Ala Cys Pro Lys Ile Ser Phe Glu Pro Ile Pro Ile His Tyr Cys
180 185 190
Ala Pro Ala Gly Phe Ala Ile Leu Lys Tyr Asn Asp Lys Lys Phe Asn
195 200 205
Gly Glu Glu Gln Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly
210 215 220
Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala
225 230 235 240
Glu Glu Glu Ile Val Ile Arg Ser Asp Asn Phe Thr Asp Asn Ala Lys
245 250 255
Thr Ile Thr Val Gln Leu Asn Glu Ser Val Val Ile Asn Cys Thr Arg
260 265 270
Pro His Asn Asn Thr Arg Lys Ser Ile Asn Ile Gly Pro Gly Arg Ala
275 280 285
Trp Tyr Thr Thr Gly Glu Ile Ile Gly Asp Ile Arg Gln Ala His Cys
290 295 300
Asn Ile Ser Lys Thr Gln Trp Asn Asn Thr Leu Ile Lys Ile Val Lys
305 310 315 320
Lys Leu Arg Glu Gln Phe Asn Thr Asn Thr Ile Ile Phe Asn Gln Ser
325 330 335
Thr Gly Gly Asp Leu Glu Ile Val Met His Ser Phe Asn Cys Gly Gly
340 345 350
Glu Phe Phe Tyr Cys Asp Thr Thr Gln Leu Phe Asn Ser Thr Trp Asn
355 360 365
Ile Thr Gly Glu Ser Thr Trp Asn Ser Thr Gly Lys Thr Asn Glu Thr
370 375 380
Ile Thr Leu Pro Cys Arg Ile Lys Gln Val Ile Asn Met Trp Gln Gln
385 390 395 400
Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Lys Gly Gln Ile Arg Cys
405 410 415
Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Lys Asn
420 425 430
Ser Ser Asn Gly Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg
435 440 445
Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Glu Ile Glu
450 455 460
Pro Leu Gly Ile Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg
465 470 475 480
Glu arg arg
<210> 5
<211> 1613
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 5
agaaagagca gaagacagtg gcaatgagag tgatggggat gaggaagaat tatcagcact 60
ggtggagagg gggcatcttg ctccttggga tgttaatgat cagtagtgct atagaaaatt 120
cgtgggtcac agtctattat ggggtacctg tgtggaaaga agctaccacc actttatttt 180
gtgcatcaga tgctaaagct tatgaaacag aggcacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaaa taaaattgga aaatgtgtca gaaaatttta 300
acatgtggaa aaataacatg gtagaccaaa tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cactctgtgt tactttaaat tgcactgatt 420
attttgggaa tactactaat accaatacta ataccaccag tagtcccagc accaacagta 480
gtaatgaagg agaagtgaaa aaatgctctt tcaatatcac cacagaagta agggacaagg 540
tgcgaaaaga atttgcactt ttttataaac ttgatatagt acgaacaggt catgataata 600
ctagctatag gttgataagt tgtaacacct cagtcattac acaggcctgc ccaaagatat 660
cctttgatcc aattcccata cattattgtg ccccggctgg ttttgcgatt ctcaagtgta 720
gagataataa atttaatgga acaggaccat gtaaaaatgt cagcacagta caatgtactc 780
atggaattag gccagtaata tcaactcaac tactgttaaa tggcagtcta gcagaagaag 840
aggtagtagt tagatctaaa aatttcacaa acaatgctga agtcataata gtgcagctga 900
aagaatctgt acaaataaat cgtacaagac ccaacaacaa tacaaggaaa agtataccta 960
tgggtccagg gagagcatgg tatgctacag aagatatcat aggaaatata agacaggcac 1020
attgtaacat tagtggagta aaatggaata acactttaca gcaaatagtt aaaaaattaa 1080
gagagcaatt taaaaataaa acaataaagt ttcagccatc ctcaggaggg gacccagaaa 1140
ttgtaaggca cagttttaat tgtagagggg agtttttcta ctgtgataca acactactgt 1200
ttaatagtac ttggaatagt aatgatactt ggaatagtac tgaagggtca aatgacacta 1260
ttacactccc gtgtagaata aaacaaattg taaacatgtg gcaagaagta ggaaaagcaa 1320
tgtatgctcc tcccatcaaa ggacaactta actgttcatc aaatattaca gggccgatat 1380
taacaagaga tggtggtaag ggtgagaact cgaccgagaa caatactgag atattcagac 1440
ctggaggagg agatatgagg gacaattggc gaagtgaatt atataaatat aaagtagtac 1500
aaattgaacc attaggaata gcacccacta aggcaaagag aagagtggtg cagagagaaa 1560
aaagaggagc gggactgttt ttcctggggt tcttgggaac agcaggaagc act 1613
<210> 6
<211> 487
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 6
Ser Ala Ile Glu Asn Ser Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Glu Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Lys Leu Glu Asn Val Ser Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Asp Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Tyr Phe Gly Asn Thr Thr Asn
100 105 110
Thr Asn Thr Asn Thr Thr Ser Ser Pro Ser Thr Asn Ser Ser Asn Glu
115 120 125
Gly Glu Val Lys Lys Cys Ser Phe Asn Ile Thr Thr Glu Val Arg Asp
130 135 140
Lys Val Arg Lys Glu Phe Ala Leu Phe Tyr Lys Leu Asp Ile Val Arg
145 150 155 160
Thr Gly His Asp Asn Thr Ser Tyr Arg Leu Ile Ser Cys Asn Thr Ser
165 170 175
Val Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe Asp Pro Ile Pro Ile
180 185 190
His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Arg Asp Asn
195 200 205
Lys Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr Val Gln Cys
210 215 220
Thr His Gly Ile Arg Pro Val Ile Ser Thr Gln Leu Leu Leu Asn Gly
225 230 235 240
Ser Leu Ala Glu Glu Glu Val Val Val Arg Ser Lys Asn Phe Thr Asn
245 250 255
Asn Ala Glu Val Ile Ile Val Gln Leu Lys Glu Ser Val Gln Ile Asn
260 265 270
Arg Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile Pro Met Gly Pro
275 280 285
Gly Arg Ala Trp Tyr Ala Thr Glu Asp Ile Ile Gly Asn Ile Arg Gln
290 295 300
Ala His Cys Asn Ile Ser Gly Val Lys Trp Asn Asn Thr Leu Gln Gln
305 310 315 320
Ile Val Lys Lys Leu Arg Glu Gln Phe Lys Asn Lys Thr Ile Lys Phe
325 330 335
Gln Pro Ser Ser Gly Gly Asp Pro Glu Ile Val Arg His Ser Phe Asn
340 345 350
Cys Arg Gly Glu Phe Phe Tyr Cys Asp Thr Thr Leu Leu Phe Asn Ser
355 360 365
Thr Trp Asn Ser Asn Asp Thr Trp Asn Ser Thr Glu Gly Ser Asn Asp
370 375 380
Thr Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Val Asn Met Trp Gln
385 390 395 400
Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Lys Gly Gln Leu Asn
405 410 415
Cys Ser Ser Asn Ile Thr Gly Pro Ile Leu Thr Arg Asp Gly Gly Lys
420 425 430
Gly Glu Asn Ser Thr Glu Asn Asn Thr Glu Ile Phe Arg Pro Gly Gly
435 440 445
Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val
450 455 460
Val Gln Ile Glu Pro Leu Gly Ile Ala Pro Thr Lys Ala Lys Arg Arg
465 470 475 480
Val Val Gln Arg Glu Lys Arg
485
<210> 7
<211> 1595
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 7
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tatcagcact 60
tgtggagatg gggcatcatg ctccttggga tgttgatgat ctgtaatgct gcagaacagt 120
tgtgggtcac agtctattat ggggtacctg tgtggaggga tgcaaatacc actctatttt 180
gtgcatccga tgctaaagca tatgatacag aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag tagtattgga aaatgtgaca gaaagcttta 300
acatatggaa aaataacatg gtagaacaaa tgcatgagga tataatcagt ttgtgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cactctgcgt tactttaaat tgcagtaatt 420
taagtaccac taataataat accattagtg gtaatgagac agcagtaaat aaaggagaaa 480
taaaaaaccg ctctttcaat gtcaccacaa acataagaga tagggtaaag aaagaatatg 540
cgctttttta taatcttgat ttagtacaaa taggtgattc taatactagc tatacaatgg 600
taaagtgtaa cacctcagtc attacacagg cctgtccaaa ggtacccttt gagccaattc 660
ccatacattt ttgtgcccca gctggttttg cgattctaaa gtgtaataat aagacgttca 720
gtggaaaagg agaatgtaca aatgtcagca cagtacaatg tacgcatgga attagaccag 780
tagtatcaac tcatctgctg ttaaatggca gcttagcaga agaagacata gtaattagat 840
ctgacaattt cacggacaat actaaaacca taatagtaca gctagacaat actataaaca 900
ttacttgtac cagacccaac aataatacaa ggaaaggtat acatatagga ccagggagag 960
cattttatgc aacaggggat ataataggaa atataagaca agcacattgt aaccttagta 1020
aaacacattg gaataacact ttaaaacaga tagttaaaaa attaagagaa caatttaaaa 1080
ataaaacaat agtctttaat caatctacag gaggggaccc agaaattgta cagcacactt 1140
ttaattgtag aggggaattt ttctattgta actcaacacc actgtttaat agtacttggt 1200
atcctaatag tacattggat gaaacaaaca gcacagacaa caatgaaact atcacactcc 1260
aatgcagaat aagacaaatt ataaacatgt ggcaggaagt aggaaaagca atgtatgccc 1320
ctcctatcag aggacaaatt acatgcacat caaatattac agggctgata ttaacaagag 1380
atggtggaga taacaatgaa actgagatct tcaggcctgg aggaggcaat atgaaggata 1440
attggagaag tgaattatat aaatataaag tagtaaaaat tgagccatta ggaatagcac 1500
ccactaaggc aaagagaaga gcggtgcaga gagaaaaaag agcagcggga ataggagctg 1560
tgttccttgg gttcttggga gcagcaggaa gcact 1595
<210> 8
<211> 479
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 8
Asn Ala Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Arg Asp Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Glu Asn Val Thr Glu Ser
50 55 60
Phe Asn Ile Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Ser Asn Leu Ser Thr Thr Asn Asn Asn
100 105 110
Thr Ile Ser Gly Asn Glu Thr Ala Val Asn Lys Gly Glu Ile Lys Asn
115 120 125
Arg Ser Phe Asn Val Thr Thr Asn Ile Arg Asp Arg Val Lys Lys Glu
130 135 140
Tyr Ala Leu Phe Tyr Asn Leu Asp Leu Val Gln Ile Gly Asp Ser Asn
145 150 155 160
Thr Ser Tyr Thr Met Val Lys Cys Asn Thr Ser Val Ile Thr Gln Ala
165 170 175
Cys Pro Lys Val Pro Phe Glu Pro Ile Pro Ile His Phe Cys Ala Pro
180 185 190
Ala Gly Phe Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Ser Gly Lys
195 200 205
Gly Glu Cys Thr Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg
210 215 220
Pro Val Val Ser Thr His Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu
225 230 235 240
Asp Ile Val Ile Arg Ser Asp Asn Phe Thr Asp Asn Thr Lys Thr Ile
245 250 255
Ile Val Gln Leu Asp Asn Thr Ile Asn Ile Thr Cys Thr Arg Pro Asn
260 265 270
Asn Asn Thr Arg Lys Gly Ile His Ile Gly Pro Gly Arg Ala Phe Tyr
275 280 285
Ala Thr Gly Asp Ile Ile Gly Asn Ile Arg Gln Ala His Cys Asn Leu
290 295 300
Ser Lys Thr His Trp Asn Asn Thr Leu Lys Gln Ile Val Lys Lys Leu
305 310 315 320
Arg Glu Gln Phe Lys Asn Lys Thr Ile Val Phe Asn Gln Ser Thr Gly
325 330 335
Gly Asp Pro Glu Ile Val Gln His Thr Phe Asn Cys Arg Gly Glu Phe
340 345 350
Phe Tyr Cys Asn Ser Thr Pro Leu Phe Asn Ser Thr Trp Tyr Pro Asn
355 360 365
Ser Thr Leu Asp Glu Thr Asn Ser Thr Asp Asn Asn Glu Thr Ile Thr
370 375 380
Leu Gln Cys Arg Ile Arg Gln Ile Ile Asn Met Trp Gln Glu Val Gly
385 390 395 400
Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Gln Ile Thr Cys Thr Ser
405 410 415
Asn Ile Thr Gly Leu Ile Leu Thr Arg Asp Gly Gly Asp Asn Asn Glu
420 425 430
Thr Glu Ile Phe Arg Pro Gly Gly Gly Asn Met Lys Asp Asn Trp Arg
435 440 445
Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Ile
450 455 460
Ala Pro Thr Lys Ala Lys Arg Arg Ala Val Gln Arg Glu Lys Arg
465 470 475
<210> 9
<211> 1676
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 9
agaaagagca gaagacagtg gcaatgaaag tgaaggagac caggaagaat tatcaaagct 60
tgtggagagg gggcaccttg ttccttggaa tgttgatgat ctgtagtgtt acaggacaat 120
tgtgggttac agtctattat ggggtacctg tgtggaaaga ggcaaccacc actctatttt 180
gtgcatcaaa tgctaaagca tatgatacag aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac gcacaagaag tagtattaga aaatgtgaca gaatattttg 300
acatgtggaa aaatgacatg gtagaacaaa tgcatgagga tgtaatcagt ttatgggatc 360
aaagcctaaa gccatgtgta gaattaaccc cactctgtgt tactttaaat tgcactgatg 420
tgaatattac taataccaat aatagtacca ttaacaatag tagtaataat accaatagta 480
gtgattggga acggatggag ccaggagaaa taaaaaactg ctctttcaat agcaccacaa 540
acatgagaga taggacgcag agagaatacg cactttttta taaacttgat atagaaccag 600
tagataataa aagtaataat aaaagtctta atgaaagtat tagtaaaagt attacttata 660
ggttaataag ttataacacc tcagtcatta aacaggcctg tccaaaagta tcttttgagc 720
caattcccat acattattgt gccccagctg gttttgcaat tctaaagtgt aataatgaga 780
cattcgatgg aaaaggagaa tgtagaaatg tcagcacagt acaatgtaca catggaatta 840
ggccaatagt gtcaactcaa ctgctgttaa atggcagtct agcagaaaag gacatagtaa 900
ttagatcaaa caatttctcg gacaatgcta aaaccataat agtacatctg aatgaatcta 960
taacaattaa gtgtataaga cccaacaata atacaagaaa aagtatacat atagcaccag 1020
gaagcgcatt ttatgcaaca ggagacataa taggagatat aaggcaagca cattgtaaca 1080
ttagtgcaaa aaattggatt aacactttaa aacagatagt tataaaacta aaaggaaaat 1140
ataatactag tacaaaaata gactttaagc catcctcagg aggggaccca gaaattgtaa 1200
tgcacagctt taattgtgga ggggagtttt tctactgtaa tacatcaaaa ctgtttaata 1260
atacttggaa ggagaataat actttagagt caaatgatac tatggagatc attaacgaaa 1320
ctattatact cccatgtaga ataaaacagt ttataaacat gtggcagaaa gtgggaaaag 1380
caatgtatgc ccctcccatc agaggacaaa ttaaatgtga atcaaatatt acagggctgc 1440
tattaacaag agatggtggt aatacaaata gcacgaacgg gaccgagacc ttcagacctg 1500
gaggaggaaa tatgaaagac aattggagaa gtgaattgta caaatataaa gtagtaaaaa 1560
ttgaaccaat aggaatagca cccaccaggg caaaaagaag agtggtgcag agagaaaaaa 1620
gagcagtggg aataggagct gtgttccttg ggttcttggg agcagcagga agcact 1676
<210> 10
<211> 506
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 10
Ser Val Thr Gly Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asn Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Ala Gln Glu Val Val Leu Glu Asn Val Thr Glu Tyr
50 55 60
Phe Asp Met Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp Val
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Glu Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Val Asn Ile Thr Asn Thr Asn
100 105 110
Asn Ser Thr Ile Asn Asn Ser Ser Asn Asn Thr Asn Ser Ser Asp Trp
115 120 125
Glu Arg Met Glu Pro Gly Glu Ile Lys Asn Cys Ser Phe Asn Ser Thr
130 135 140
Thr Asn Met Arg Asp Arg Thr Gln Arg Glu Tyr Ala Leu Phe Tyr Lys
145 150 155 160
Leu Asp Ile Glu Pro Val Asp Asn Lys Ser Asn Asn Lys Ser Leu Asn
165 170 175
Glu Ser Ile Ser Lys Ser Ile Thr Tyr Arg Leu Ile Ser Tyr Asn Thr
180 185 190
Ser Val Ile Lys Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile Pro
195 200 205
Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asn
210 215 220
Glu Thr Phe Asp Gly Lys Gly Glu Cys Arg Asn Val Ser Thr Val Gln
225 230 235 240
Cys Thr His Gly Ile Arg Pro Ile Val Ser Thr Gln Leu Leu Leu Asn
245 250 255
Gly Ser Leu Ala Glu Lys Asp Ile Val Ile Arg Ser Asn Asn Phe Ser
260 265 270
Asp Asn Ala Lys Thr Ile Ile Val His Leu Asn Glu Ser Ile Thr Ile
275 280 285
Lys Cys Ile Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile His Ile Ala
290 295 300
Pro Gly Ser Ala Phe Tyr Ala Thr Gly Asp Ile Ile Gly Asp Ile Arg
305 310 315 320
Gln Ala His Cys Asn Ile Ser Ala Lys Asn Trp Ile Asn Thr Leu Lys
325 330 335
Gln Ile Val Ile Lys Leu Lys Gly Lys Tyr Asn Thr Ser Thr Lys Ile
340 345 350
Asp Phe Lys Pro Ser Ser Gly Gly Asp Pro Glu Ile Val Met His Ser
355 360 365
Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser Lys Leu Phe
370 375 380
Asn Asn Thr Trp Lys Glu Asn Asn Thr Leu Glu Ser Asn Asp Thr Met
385 390 395 400
Glu Ile Ile Asn Glu Thr Ile Ile Leu Pro Cys Arg Ile Lys Gln Phe
405 410 415
Ile Asn Met Trp Gln Lys Val Gly Lys Ala Met Tyr Ala Pro Pro Ile
420 425 430
Arg Gly Gln Ile Lys Cys Glu Ser Asn Ile Thr Gly Leu Leu Leu Thr
435 440 445
Arg Asp Gly Gly Asn Thr Asn Ser Thr Asn Gly Thr Glu Thr Phe Arg
450 455 460
Pro Gly Gly Gly Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys
465 470 475 480
Tyr Lys Val Val Lys Ile Glu Pro Ile Gly Ile Ala Pro Thr Arg Ala
485 490 495
Lys Arg Arg Val Val Gln Arg Glu Lys Arg
500 505
<210> 11
<211> 1646
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 11
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagagt tgcttgtgga 60
aatggggcac cttgttcctt ggaatgttga tgatctgtag tgctgtagaa caattgtggg 120
tcacagttta ttatggagta cctgtgtgga aagaagcaac caccactcta ttttgtgcat 180
cagatgctaa ggcatatatt ccagaggtac ataatgtatg ggccacacat gcctgtgtac 240
ccacagatcc caacccacaa gaagtagaat tgaaaaatgt gacagaggat tttaacatgt 300
ggaagaataa catggtagaa caaatgcatg aagatgtaat cagtttatgg gatcaaagcc 360
taaagccata tgtggaatta accccactct gtgttacgtt aaattgcact gattattggg 420
gggatactac tcgtgccgga aatactactg ctagtgtcac tagtactgct aatgtcacta 480
gtagtaaaga ggtacaaatg aaaaactgct ctttctatgt ctccacaaac atgatggata 540
agaaacagaa agaatacgca cttttttata aacttgatgt agtgccaata ggtaatgaga 600
ctaatggtaa ggagactaat aatagctata ggttaataag ttgtaacacc tcagtagtta 660
cccaagcctg tccaaaggta acctttgagc caattcccat acattattgt gccccggctg 720
gttttgtgat tctaaagtgc aaggataaga ggttcaatgg aacaggacca tgtacaaatg 780
tcagcacagt acaatgtaca catggaatta ggccagtagt atcaactcaa ctactgttaa 840
atggcagctt agcagaagaa gatatagtac ttagatctga aaatttctcg aacaatgcta 900
aaaacataat agtacagctg aatgaatctg tagtaattaa ttgtacaaga ctcaacaaca 960
atacaagaaa aagcatacat atggggccag ggaaagcatt ttatgcaaca ggagacacca 1020
taggagatat aagacaagca cattgtaaca ttagtgaaga ggcctggaat aaaactctaa 1080
gacgaatagc tataaaatta aaagaacaat ttaatataac agacaaagta atctttaaac 1140
cctcctcagg aggggacata gaaattgcaa tgcacagtgt taattgtgga ggggaatttt 1200
tctactgtaa tacaacacag ctgtttaata gtacttggaa tgaaacacag ctgaatagta 1260
gtactgtgaa taatattaca aggtcagaca acaacatcac actcccatgc aaaataaagc 1320
aaattgtaaa catgtggcag aaaataggaa aagcaatgta tgcccctccc atcagtggac 1380
taattagatg taaatcaaat attacaggga taatattagc aagagatggt ggtaataatg 1440
gcacaaatga tacgaggacc ttcagacctg taggaggaaa tatgaaggac aattggagaa 1500
gtgaattata taaatataaa gtagtaagaa ttgaaccatt aggagtagca cccaccaagg 1560
caaagagaag agtggtgcag agagaaaaaa gagcagtggg actaggagct atgttccttg 1620
ggttcttggg agcagcagga agcact 1646
<210> 12
<211> 498
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 12
Ser Ala Val Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Ile Pro Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Glu Leu Lys Asn Val Thr Glu Asp
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Val
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Tyr Val Glu Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Tyr Trp Gly Asp Thr Thr Arg
100 105 110
Ala Gly Asn Thr Thr Ala Ser Val Thr Ser Thr Ala Asn Val Thr Ser
115 120 125
Ser Lys Glu Val Gln Met Lys Asn Cys Ser Phe Tyr Val Ser Thr Asn
130 135 140
Met Met Asp Lys Lys Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Leu Asp
145 150 155 160
Val Val Pro Ile Gly Asn Glu Thr Asn Gly Lys Glu Thr Asn Asn Ser
165 170 175
Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val Val Thr Gln Ala Cys Pro
180 185 190
Lys Val Thr Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly
195 200 205
Phe Val Ile Leu Lys Cys Lys Asp Lys Arg Phe Asn Gly Thr Gly Pro
210 215 220
Cys Thr Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val
225 230 235 240
Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Asp Ile
245 250 255
Val Leu Arg Ser Glu Asn Phe Ser Asn Asn Ala Lys Asn Ile Val
260 265 270
Gln Leu Asn Glu Ser Val Val Ile Asn Cys Thr Arg Leu Asn Asn Asn
275 280 285
Thr Arg Lys Ser Ile His Met Gly Pro Gly Lys Ala Phe Tyr Ala Thr
290 295 300
Gly Asp Thr Ile Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser Glu
305 310 315 320
Glu Ala Trp Asn Lys Thr Leu Arg Arg Ile Ala Ile Lys Leu Lys Glu
325 330 335
Gln Phe Asn Ile Thr Asp Lys Val Ile Phe Lys Pro Ser Ser Gly Gly
340 345 350
Asp Ile Glu Ile Ala Met His Ser Val Asn Cys Gly Gly Glu Phe Phe
355 360 365
Tyr Cys Asn Thr Thr Gln Leu Phe Asn Ser Thr Trp Asn Glu Thr Gln
370 375 380
Leu Asn Ser Ser Thr Val Asn Asn Ile Thr Arg Ser Asp Asn Asn Ile
385 390 395 400
Thr Leu Pro Cys Lys Ile Lys Gln Ile Val Asn Met Trp Gln Lys Ile
405 410 415
Gly Lys Ala Met Tyr Ala Pro Pro Ile Ser Gly Leu Ile Arg Cys Lys
420 425 430
Ser Asn Ile Thr Gly Ile Ile Leu Ala Arg Asp Gly Gly Asn Asn Gly
435 440 445
Thr Asn Asp Thr Arg Thr Phe Arg Pro Val Gly Gly Asn Met Lys Asp
450 455 460
Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Arg Ile Glu Pro
465 470 475 480
Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu
485 490 495
Lys arg
<210> 13
<211> 1586
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 13
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tgtcagcgct 60
tgtggaaatg gggcaccatg ctccttggga tgttaatgat ctgtagggct gcagagcaat 120
tgtgggtcac agtctattat ggagtacctg tgtggagaga agcaaacacc actctatttt 180
gtgcatcaga tgctaaagca tatgatacag aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccctaac ccacaagaag tagtattgga aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaga tgcatgagga tataatcagt ctatgggatc 360
aaagcctaaa accatgtgta aaattaaccc cactctgtgt tactttaaat tgtaatacca 420
ttaatgccac taaagatatg ataggagaat taaaaaactg ctctttcaac atcaccacaa 480
gcataagaga taagtggcaa aaagaatatg cactttttta taaacttgat gtagtgccaa 540
tagatgataa tggtaatgat actggtaatg gtagctatag gctaataagt tgtaatacct 600
cagtcattac acaggcctgt ccaaagacat cctttgagcc aattcccata cattattgtg 660
ccccggctgg ttttgcgatt ctaaagtgta acaataaaaa gttcaatgga acaggaccac 720
gtaaaaatgt cagcacagta caatgtacac atggaattag gccagtagta tctactcaac 780
tgttgttaaa tggcagtcta gcagaagaag agatagtact tagatctgaa aatttctcaa 840
acaatgctaa aaccataata gtacaattga atgaatctat agtaattaat tgtacaagac 900
ccaacaacaa tacgagaaaa agtatacata taggaccagg gagagcattt tatgcagcag 960
gagaaataat aggagatata agaacagcac attgtaacat tagtggaaca aaatggaata 1020
acactttaaa acagatagtt gtaaaattaa gagaacaatt tggaaataaa acaatggtct 1080
ttagtcactc ctcaggaggg gacccggaaa ttgtaaggca cagttttaat tgtggagggg 1140
aatttttcca ttgcaataca acacaactgt ttaatagtag ttggccttgg aatggtactg 1200
aagggtcaaa taacactgaa ggaaatgaca caatcaccct cccatgcaga ataaaacaaa 1260
ttataaacat gtggcaggaa gtaggaaaag caatgtatgc ccctcccatc agaggggtaa 1320
ttaaatgttc atcaaatatt acagggctat tattaacaag agatgggggt actaacagga 1380
ccgacaatgg gagcgaggtc ttcagacctg ggggaggaga tatgagggac aattggagta 1440
gtgaattata taaaaataaa gtagtaagaa ttgaaccatt aggagtagca cccaccaagg 1500
caaagagaag agtggtgcag agagaaaaaa gagcagtggg actaggagct atgttccttg 1560
ggttcttggg agcagcagga agcact 1586
<210> 14
<211> 476
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 14
Arg Ala Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Arg Glu Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Asn Thr Ile Asn Ala Thr Lys Asp Met
100 105 110
Ile Gly Glu Leu Lys Asn Cys Ser Phe Asn Ile Thr Thr Ser Ile Arg
115 120 125
Asp Lys Trp Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Leu Asp Val Val
130 135 140
Pro Ile Asp Asp Asn Gly Asn Asp Thr Gly Asn Gly Ser Tyr Arg Leu
145 150 155 160
Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Thr Ser
165 170 175
Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile
180 185 190
Leu Lys Cys Asn Asn Lys Lys Phe Asn Gly Thr Gly Pro Arg Lys Asn
195 200 205
Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr
210 215 220
Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Ile Val Leu Arg
225 230 235 240
Ser Glu Asn Phe Ser Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Asn
245 250 255
Glu Ser Ile Val Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys
260 265 270
Ser Ile His Ile Gly Pro Gly Arg Ala Phe Tyr Ala Ala Gly Glu Ile
275 280 285
Ile Gly Asp Ile Arg Thr Ala His Cys Asn Ile Ser Gly Thr Lys Trp
290 295 300
Asn Asn Thr Leu Lys Gln Ile Val Val Lys Leu Arg Glu Gln Phe Gly
305 310 315 320
Asn Lys Thr Met Val Phe Ser His Ser Ser Gly Gly Asp Pro Glu Ile
325 330 335
Val Arg His Ser Phe Asn Cys Gly Gly Glu Phe Phe His Cys Asn Thr
340 345 350
Thr Gln Leu Phe Asn Ser Ser Trp Pro Trp Asn Gly Thr Glu Gly Ser
355 360 365
Asn Asn Thr Glu Gly Asn Asp Thr Ile Thr Leu Pro Cys Arg Ile Lys
370 375 380
Gln Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro
385 390 395 400
Pro Ile Arg Gly Val Ile Lys Cys Ser Ser Asn Ile Thr Gly Leu Leu
405 410 415
Leu Thr Arg Asp Gly Gly Thr Asn Arg Thr Asp Asn Gly Ser Glu Val
420 425 430
Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Ser Ser Glu Leu
435 440 445
Tyr Lys Asn Lys Val Val Arg Ile Glu Pro Leu Gly Val Ala Pro Thr
450 455 460
Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
465 470 475
<210> 15
<211> 1618
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 15
agaaagagcg aagacagtgg caatgagagt gagggggatc atgaggaatt atcagtactt 60
atggaaatgg ggcaccatgc tcctggggat attgatgatc tgtaatgcta gtgaaaaatt 120
gtgggtcaca gtctattatg gggtacctgt gtggaaagag gcaaacacca ctctattttg 180
tgcatcagat gccaaagctt atgatacaga agtacataat gtttgggcca cacatgcctg 240
tgtacccaca gacccccgcc ctcaagaagt actattggga aatgtgacag aaaattttaa 300
catgtggaaa aataacatgg tagaacaaat gcatgaggat ataatcagtt tatgggatca 360
aagcctaaag ccatgtgtaa aattaacccc actctgtgtt actttaaatt gcactaactt 420
gaatgatact aatatcagta gtagtaatgt tagtacccat aatagtagtg gcataggaga 480
aatgaaaaat tgctctttca atgttaccac aagtataaga gataagatga agaaagaata 540
tgcacttttt tatagacttg atatagttcc aatagataat agtaacacca gttatatgtt 600
aataagttgt aatacctcag tcattacaca ggcctgtcca aaggtatcct ttgaaccaat 660
tcccatacat tattgtgccc cggctggttt tgcgattcta aagtgtaatg ataagaagtt 720
caatggaaca ggaccatgta agaatgtcag cacagtacaa tgtacacatg gaattaggcc 780
agtagtatca actcaactgc tgttaaatgg cagtttagca gaagaagaga tagtaattag 840
atctgaaaat ttcacagaca atactaaaac cataatagtg catctgaacg aatctataca 900
aattaattgt acaagaccca acaacaatac aagaaaaagc atacatatag gaccaggaag 960
agcattttat gcaacaggag aaataatagg agatataaga caagcacatt gtaaccttag 1020
tagagcaaaa tggaataaca cgttaaaaca gatagttaaa aaattaagag tacaatttga 1080
aaataaaaca atagtcttta atcaatcttc aggaggggac ccagaaattg taatgcacag 1140
ctttaattgt ggaggggaat ttttctactg taatacaaca gcactgttta atagtacttg 1200
gaatagtaat agtactgaat ggtcaaatga cactgaaagc aatgacacag tgattacact 1260
cccatgcaga ataaaacaaa tagtaaacat gtggcaggaa gttggaaaag caatgtatgc 1320
ccctcccatc aagggacaaa ttaagtggat atcaaatatt acagggatac tattaacaag 1380
agatggggga agagatgagg ttaatagcac gaacgagaac aagaccgaga tcttcagacc 1440
tgcaggagga aatatgaagg acaattggag aagtgaatta tataaatata aagtagtaaa 1500
aattgaacca ttaggaatag cacccactag ggcaagaaga agagtggtgc agagagaaaa 1560
aagagcagtg acactaggag ctttgttcct tgggttcttg ggagcagcag gaagcact 1618
<210> 16
<211> 487
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 16
Asn Ala Ser Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Arg Pro Gln Glu Val Leu Leu Gly Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asn Leu Asn Asp Thr Asn Ile Ser
100 105 110
Ser Ser Asn Val Ser Thr His Asn Ser Ser Gly Ile Gly Glu Met Lys
115 120 125
Asn Cys Ser Phe Asn Val Thr Thr Ser Ile Arg Asp Lys Met Lys Lys
130 135 140
Glu Tyr Ala Leu Phe Tyr Arg Leu Asp Ile Val Pro Ile Asp Asn Ser
145 150 155 160
Asn Thr Ser Tyr Met Leu Ile Ser Cys Asn Thr Ser Val Ile Thr Gln
165 170 175
Ala Cys Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala
180 185 190
Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp Lys Lys Phe Asn Gly
195 200 205
Thr Gly Pro Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile
210 215 220
Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu
225 230 235 240
Glu Glu Ile Val Ile Arg Ser Glu Asn Phe Thr Asp Asn Thr Lys Thr
245 250 255
Ile Ile Val His Leu Asn Glu Ser Ile Gln Ile Asn Cys Thr Arg Pro
260 265 270
Asn Asn Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly Arg Ala Phe
275 280 285
Tyr Ala Thr Gly Glu Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn
290 295 300
Leu Ser Arg Ala Lys Trp Asn Asn Thr Leu Lys Gln Ile Val Lys Lys
305 310 315 320
Leu Arg Val Gln Phe Glu Asn Lys Thr Ile Val Phe Asn Gln Ser Ser
325 330 335
Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Asn Cys Gly Gly Glu
340 345 350
Phe Phe Tyr Cys Asn Thr Thr Ala Leu Phe Asn Ser Thr Trp Asn Ser
355 360 365
Asn Ser Thr Glu Trp Ser Asn Asp Thr Glu Ser Asn Asp Thr Val Ile
370 375 380
Thr Leu Pro Cys Arg Ile Lys Gln Ile Val Asn Met Trp Gln Glu Val
385 390 395 400
Gly Lys Ala Met Tyr Ala Pro Pro Ile Lys Gly Gln Ile Lys Trp Ile
405 410 415
Ser Asn Ile Thr Gly Ile Leu Leu Thr Arg Asp Gly Gly Arg Asp Glu
420 425 430
Val Asn Ser Thr Asn Glu Asn Lys Thr Glu Ile Phe Arg Pro Ala Gly
435 440 445
Gly Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val
450 455 460
Val Lys Ile Glu Pro Leu Gly Ile Ala Pro Thr Arg Ala Arg Arg Arg
465 470 475 480
Val Val Gln Arg Glu Lys Arg
485
<210> 17
<211> 1625
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 17
agaaagagca gaagacagtg gcaatgagag tgaaggagat caagaagaat tgtcagcgct 60
tgtggagatg gggcatcatg ctccttggga tattgatgat ctgtagtgct acagaaaaat 120
tgtgggtcac agtctattat ggggtacctg tgtggaagga agcaaacacc actctatttt 180
gtgcatctga tgctaaagca tatgatacag aggtacataa tgtttgggca acacatgcct 240
gtgtacccac agaccccaac ccacaagaag taagattaaa aaatgtgaca gaaaatttta 300
acatgtggag gaataacatg gtagaacaga tgcaggagga tataatcagt ttgtgggatc 360
aaagcctaaa gccatgtgta acattaactc cactatgtgt tactttaaat tgcactgatt 420
attggggcaa tgttactggg accaatacta ctagtaaccc tactggtact ggtgtgggtg 480
gtaccactaa caatggcgcg gaagtgatga agtgctcttt taatgtcacc acaagtgtaa 540
gagataaggt acaaaaagaa tctgctcttt tttatagact tgatgtagta aaaatagatg 600
agaaaacaaa tacaaccaat tataggttga taagttgtaa cacctcagtc attaaacagg 660
cccgtccaaa ggtaaacttt gagccaattc ccatacatta ttgtgccccg gctggttttg 720
cgattctaaa gtgtaatgat aagaagttca atggaacagg atcatgtaaa aatgtcagca 780
cagtacaatg tacacatgga attaagccag tagtatccac tcacttgctg ttaaatggca 840
gtctagcaga agatgagata gtaattagat ctgaaaattt cacgaacaat gctaaaacca 900
taatagtaca gctgaataat tatgtaaaaa ttaattgtat aagacccaat aataatacaa 960
gaaaaagtat atcactcgga ccaggaagag cattttatac aacaggagac ataataggaa 1020
atataagaca agcacattgc aaccttagtg gtacagaatg gaataacact ttaaaacagg 1080
tagctaacaa attaagagaa caatttaaca aaacaataat aaaatttaag caaccccccc 1140
cgggagggga cctagaaatc acaatgctca cttttaattg tggaggagaa tttttttact 1200
gtaattcatc agcactgttt aatagtactt tgacttggga tagtaaggca tgggcaaata 1260
cacttgaaga aaatatcaca ctcccatgca gaataaaaca aattgtaaac aagtggcagg 1320
aagtaggaaa agcaatatat gcccctccca tcagtggaca gattaattgt acatcaaata 1380
ttacagggat actattaaca agagatggtg gtaataacaa cgacactaac aacactgagg 1440
tcttcagacc tggaggagga gatatgaggg acaattggag aagtgagtta tataaatata 1500
aagtagtaaa aattgaacca ttaggaatag cacccaccag ggcaaagaga agagtggtgc 1560
agagagaaaa aagagcagca ataggagcta tgttccttgg gttcttggga gcagcaggaa 1620
gcact 1625
<210> 18
<211> 490
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 18
Ser Ala Thr Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Arg Leu Lys Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Arg Asn Asn Met Val Glu Gln Met Gln Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Thr Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Tyr Trp Gly Asn Val Thr Gly
100 105 110
Thr Asn Thr Thr Ser Asn Pro Thr Gly Thr Gly Val Gly Gly Thr Thr
115 120 125
Asn Asn Gly Ala Glu Val Met Lys Cys Ser Phe Asn Val Thr Thr Ser
130 135 140
Val Arg Asp Lys Val Gln Lys Glu Ser Ala Leu Phe Tyr Arg Leu Asp
145 150 155 160
Val Val Lys Ile Asp Glu Lys Thr Asn Thr Thr Asn Tyr Arg Leu Ile
165 170 175
Ser Cys Asn Thr Ser Val Ile Lys Gln Ala Arg Pro Lys Val Asn Phe
180 185 190
Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu
195 200 205
Lys Cys Asn Asp Lys Lys Phe Asn Gly Thr Gly Ser Cys Lys Asn Val
210 215 220
Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr His
225 230 235 240
Leu Leu Leu Asn Gly Ser Leu Ala Glu Asp Glu Ile Val Ile Arg Ser
245 250 255
Glu Asn Phe Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Asn Asn
260 265 270
Tyr Val Lys Ile Asn Cys Ile Arg Pro Asn Asn Asn Thr Arg Lys Ser
275 280 285
Ile Ser Leu Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly Asp Ile Ile
290 295 300
Gly Asn Ile Arg Gln Ala His Cys Asn Leu Ser Gly Thr Glu Trp Asn
305 310 315 320
Asn Thr Leu Lys Gln Val Ala Asn Lys Leu Arg Glu Gln Phe Asn Lys
325 330 335
Thr Ile Ile Lys Phe Lys Gln Pro Pro Pro Gly Gly Asp Leu Glu Ile
340 345 350
Thr Met Leu Thr Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser
355 360 365
Ser Ala Leu Phe Asn Ser Thr Leu Thr Trp Asp Ser Lys Ala Trp Ala
370 375 380
Asn Thr Leu Glu Glu Asn Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile
385 390 395 400
Val Asn Lys Trp Gln Glu Val Gly Lys Ala Ile Tyr Ala Pro Pro Ile
405 410 415
Ser Gly Gln Ile Asn Cys Thr Ser Asn Ile Thr Gly Ile Leu Leu Thr
420 425 430
Arg Asp Gly Gly Asn Asn Asn Asp Thr Asn Asn Thr Glu Val Phe Arg
435 440 445
Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys
450 455 460
Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Ile Ala Pro Thr Arg Ala
465 470 475 480
Lys Arg Arg Val Val Gln Arg Glu Lys Arg
485 490
<210> 19
<211> 1643
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 19
agaaagagca gaagacagtg gcaatgagag tgatggagat caggaagagt tatcagaact 60
tatggagagg gggcaccttg ctccttggga tgttaatgat gatctgtagt gctgcagaag 120
aatcgtgggt cacagtatat tatggggtac ctgtgtggaa agaagcaacc accactctat 180
tttgtgcatc agatgctaaa ggctatgata cagaaagaca taatgtttgg gccacacatg 240
cctgtgtacc cacagacccc aacccacaag aaattgaatt ggtaaatgtg acagaatatt 300
ttaacatggg aaaaaataac atggtagaac agatgcatga ggatataatc agtttatggg 360
atgaaagcct aaagccatgt gtaaaattaa ccccactctg tgttactcta aattgcacta 420
atttgaatat tactaatacc actggtatta ctaatagtag cctggaagaa atgaggagaa 480
taatgacaaa ctgttctttc aaggtcacca caaatataag agataaggtg cagaagcaat 540
atgcactgtt gtataaactt gatgtagtac aaatagatga tgagagtacc acaggtaata 600
ggagtaacag cgcctacagg ttgataagtt gtaacacctc agtcattaca caggcccgtc 660
caaaggtatc ctttgagcca attcccatac acttttgtgc cccggctggt tttgcgattc 720
taaaatgtaa ggataagaag ttcaatggaa caggactatg taaaaatgtc agcacagtac 780
aatgtacaca tggaattagg ccagtagtat caactcagct gctgttaaat ggcagtctag 840
cagaagaaga ggtagtaatt agatctgtaa atttcacaaa caatgctaaa actataatag 900
tacagctgaa caaatctata gaaattaatt gtacaagacc caacaacaat acaagaagag 960
gtataaatat aggacccggg agagcatttt acacaataaa ggacataaca ggagatataa 1020
gacaagcaca ttgtaacatt agtgcatcag actggaataa tactgtaaca caggtagttg 1080
caaaattaaa agagcaattt gggaataaaa caatagtctt taatcaatcc tcaggaggag 1140
acccagaaat tataatgcac acttttaatt gtggagggga atttttctac tgtaagacaa 1200
cacaactgtt taatagtact tggcctaata atggtacttg gcctaatagt aattggactg 1260
ataataatag aacttggaac ggtgctaaag gaactatcac actcccatgc agaataaaac 1320
aaattgtaaa catgtggcag gaagtaggaa aagcaatgta tgcccctccc atcgaaggga 1380
aaataaaatg tacatcaaat cttacaggat tgctattaac aagagatggt ggtaatgtga 1440
atggcaccac catcgagacc ttcagacctg gaggaggaga tatgagggac aattggagaa 1500
gtgaattata taaatataaa gtagtacaaa ttgaaccatt aggactggca cctaccaagg 1560
caaagagaag agtggtgcag agagaaaaaa ggggagtaat aggagctatg ttccttgggt 1620
tcttgggagc agcaggaagc act 1643
<210> 20
<211> 495
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 20
Ser Ala Ala Glu Glu Ser Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Gly
20 25 30
Tyr Asp Thr Glu Arg His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Glu Leu Val Asn Val Thr Glu Tyr
50 55 60
Phe Asn Met Gly Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Glu Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asn Leu Asn Ile Thr Asn Thr Thr
100 105 110
Gly Ile Thr Asn Ser Ser Leu Glu Glu Met Arg Arg Ile Met Thr Asn
115 120 125
Cys Ser Phe Lys Val Thr Thr Asn Ile Arg Asp Lys Val Gln Lys Gln
130 135 140
Tyr Ala Leu Leu Tyr Lys Leu Asp Val Val Gln Ile Asp Asp Glu Ser
145 150 155 160
Thr Thr Gly Asn Arg Ser Asn Ser Ala Tyr Arg Leu Ile Ser Cys Asn
165 170 175
Thr Ser Val Ile Thr Gln Ala Arg Pro Lys Val Ser Phe Glu Pro Ile
180 185 190
Pro Ile His Phe Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Lys
195 200 205
Asp Lys Lys Phe Asn Gly Thr Gly Leu Cys Lys Asn Val Ser Thr Val
210 215 220
Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu
225 230 235 240
Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Val Asn Phe
245 250 255
Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Asn Lys Ser Ile Glu
260 265 270
Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Arg Gly Ile Asn Ile
275 280 285
Gly Pro Gly Arg Ala Phe Tyr Thr Ile Lys Asp Ile Thr Gly Asp Ile
290 295 300
Arg Gln Ala His Cys Asn Ile Ser Ala Ser Asp Trp Asn Asn Thr Val
305 310 315 320
Thr Gln Val Val Ala Lys Leu Lys Glu Gln Phe Gly Asn Lys Thr Ile
325 330 335
Val Phe Asn Gln Ser Ser Gly Gly Asp Pro Glu Ile Ile Met His Thr
340 345 350
Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Lys Thr Thr Gln Leu Phe
355 360 365
Asn Ser Thr Trp Pro Asn Asn Gly Thr Trp Pro Asn Ser Asn Trp Thr
370 375 380
Asp Asn Asn Arg Thr Trp Asn Gly Ala Lys Gly Thr Ile Thr Leu Pro
385 390 395 400
Cys Arg Ile Lys Gln Ile Val Asn Met Trp Gln Glu Val Gly Lys Ala
405 410 415
Met Tyr Ala Pro Pro Ile Glu Gly Lys Ile Lys Cys Thr Ser Asn Leu
420 425 430
Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Val Asn Gly Thr Thr
435 440 445
Ile Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg
450 455 460
Ser Glu Leu Tyr Lys Tyr Lys Val Val Gln Ile Glu Pro Leu Gly Leu
465 470 475 480
Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
485 490 495
<210> 21
<211> 1626
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 21
agaaagagca gaagacagtg gcaatgagag tgatggagat caggaggaat tatcagcgct 60
cgtggagatg gggcaccatg ctccttggga tgttgatgat ttatagtgct gcaggagagt 120
tatgggtcac agtttattat ggggtaccgg tgtggaaaga agcaaccact actttattct 180
gtgcatcaga tgctaaagca tatgacacag aggtacataa tgtttgggca acacatgcct 240
gtgtacccac agaccctaat ccacaagaag tattattgga aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaga tgcatgagga tataatcagt ctatgggatc 360
aaagcctaaa gccacgtgta aaattaaccc cactctgtgt tactttaaac tgtactaatt 420
tgagaaatgt tactaatttg aaaaatgtta ctaataacag taatattagt ggtactaata 480
acaatactag tagtgggggg ctgaagggag gagaaatgaa aaattgctct ttctatatca 540
ccacacacag aaaggataag gtgaagaaag aatatgcact tttttataac cttgatatag 600
tatcaacaga tgatgataat acaagctata tattgagaag ttgtaacacc tcagtcatta 660
cccaggcctg tccaaaggta acctttgaac caattcccat acattattgt accccagctg 720
gttttgcgat tctgaagtgt aacgataaga agttcaatgg aacaggacca tgtagaaatg 780
tcagtacagt acaatgtaca catggaatca agccagtagt gtcaacccaa ctgttgttaa 840
atggcagtct agcagaagaa gaggtagtaa ttagatctga aaatttcacg gacaatgtta 900
aaaccataat agtacagctg aatgaatctg taataattaa ttgtacaaga cccagcaaca 960
atacaagaaa aagtatacgt tttggaccag gggcggcatt ttatacaaca ggagacataa 1020
taggagatat aagacaagca cattgtaaca tcagtagagc agaatggaat aacactttaa 1080
aacaaatagt taaaaaatta caagaacaat ttgtgaataa aacaatagtc tttaatcaat 1140
ctgcaggagg ggacccagaa attgtaaggc acagtgtaaa ttgtggaggg gaatttttct 1200
actgcgatac aacacaactg tttaatagta cttggaatag tactggagag tcaaataaca 1260
ctcaagaaaa tgacctaatc acactcccat gcagaataaa acaaattata aacagatggc 1320
aggaaatagg aaaagcaatg tatgcccctc ccatccaagg acaaattagc tgtacatcaa 1380
atattacagg gctgctacta acaagagatg gtggtaataa taataacagc acagagacct 1440
tcagacctgg aggaggaaat atgaaggaca attggagaag tgaattatat aaatataaag 1500
tagtaaaaat tcagccatta ggggtagcac ccaccaaggc aaagagaaga gtggtgcaga 1560
gggaaaagag cagtgggagc actaggagct atgttccttg ggttcttggg agcagcagga 1620
agcact 1626
<210> 22
<211> 489
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 22
Ser Ala Ala Gly Glu Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Leu Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Arg Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asn Leu Arg Asn Val Thr Asn Leu
100 105 110
Lys Asn Val Thr Asn Asn Ser Asn Ile Ser Gly Thr Asn Asn Asn Thr
115 120 125
Ser Ser Gly Gly Leu Lys Gly Gly Glu Met Lys Asn Cys Ser Phe Tyr
130 135 140
Ile Thr Thr His Arg Lys Asp Lys Val Lys Lys Glu Tyr Ala Leu Phe
145 150 155 160
Tyr Asn Leu Asp Ile Val Ser Thr Asp Asp Asp Asn Thr Ser Tyr Ile
165 170 175
Leu Arg Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val
180 185 190
Thr Phe Glu Pro Ile Pro Ile His Tyr Cys Thr Pro Ala Gly Phe Ala
195 200 205
Ile Leu Lys Cys Asn Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Arg
210 215 220
Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro Val Val Ser
225 230 235 240
Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile
245 250 255
Arg Ser Glu Asn Phe Thr Asp Asn Val Lys Thr Ile Ile Val Gln Leu
260 265 270
Asn Glu Ser Val Ile Ile Asn Cys Thr Arg Pro Ser Asn Asn Thr Arg
275 280 285
Lys Ser Ile Arg Phe Gly Pro Gly Ala Ala Phe Tyr Thr Thr Gly Asp
290 295 300
Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser Arg Ala Glu
305 310 315 320
Trp Asn Asn Thr Leu Lys Gln Ile Val Lys Lys Leu Gln Glu Gln Phe
325 330 335
Val Asn Lys Thr Ile Val Phe Asn Gln Ser Ala Gly Gly Asp Pro Glu
340 345 350
Ile Val Arg His Ser Val Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asp
355 360 365
Thr Thr Gln Leu Phe Asn Ser Thr Trp Asn Ser Thr Gly Glu Ser Asn
370 375 380
Asn Thr Gln Glu Asn Asp Leu Ile Thr Leu Pro Cys Arg Ile Lys Gln
385 390 395 400
Ile Ile Asn Arg Trp Gln Glu Ile Gly Lys Ala Met Tyr Ala Pro Pro
405 410 415
Ile Gln Gly Gln Ile Ser Cys Thr Ser Asn Ile Thr Gly Leu Leu Leu
420 425 430
Thr Arg Asp Gly Gly Asn Asn Asn Asn Ser Thr Glu Thr Phe Arg Pro
435 440 445
Gly Gly Gly Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr
450 455 460
Lys Val Val Lys Ile Gln Pro Leu Gly Val Ala Pro Thr Lys Ala Lys
465 470 475 480
Arg Arg Val Val Gln Arg Glu Lys Ser
485
<210> 23
<211> 1562
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 23
agaaagagca gaagacagtg gcaatgagag tgacggggat gaggaacaat tatccgcact 60
tatggaaaga ggtcaccttg ctccttggaa tattgatgat atgtagtgct acagaaaatt 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga agcaaccacc actctattct 180
gtgcatcgga tgctaaggca tatgatacag aggcacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaaa tgagattgga aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaga tgcaggatga tataatcagt ttatgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cactctgtgt tactttaaat tgcactaata 420
ccacaaatgc taatagtacc aataataata actgggacat gaaaaactgc tctttcaatg 480
tcacctcagg cataagagat aaggtgcgaa aagaacatgc actcttttat gcacttgatg 540
tagtaccaat agataatgag actaactata ggttgataag ttgtaacacc tcagtcatca 600
cacaggcctg tccaaaggta tcctttgagc caattcctat acattattat gccccggctg 660
gttttgcgat tctaaaatgt agggataaaa agttcaatgg aacaggacca tgtaaagatg 720
tcagcacagt acaatgtaca catggaatta agccagtagt atcaactcaa ctactgttaa 780
atggcagtct agcagaagaa gaggtagtaa tcagatctga aaacttcacg aacaatgcta 840
aaaccatatt agtacaactg aatgaatctg tagtaattaa ttgtacaaga cccaacaaca 900
atacaagaaa aagtataaat ataggaccag ggagagcatt ctatgcaaca ggagaaataa 960
taggagatat aagacaagca cattgtaacc ttagtaaggc acaatggaac aacactttaa 1020
aaaaggtagt tgtaaaatta agagaacaat ttccgaataa aacgatagtc tttactcatt 1080
cctcaggagg ggacccagaa attgtaatgc acagttttaa ttgtggagga gaatttttct 1140
actgtaattc aacaccactg tttaatagta cttggaagtt gaatggtact atggaatcaa 1200
atgacactga aggaaatctc acactccaat gcagaataaa acaaatcatg aacaagtggc 1260
aggaagtagg aaaggcaatg tatgcccctc ccatccaagg acagattaga tgttcatcaa 1320
atattacagg gctgttatta gtaagagatg gtggggtcaa cagcgccaac gagaccttca 1380
gaccaggagg aggagatatg agggacaatt ggagaagtga attatataaa tataaagtag 1440
taaaaattga accattagga atagcaccca ccaaggcaaa gagaagagtg gtgcagagag 1500
aaaaaagagc agtgggaata ggagctttgt tccttgggtt cttgggagca gcaggaagca 1560
ct 1562
<210> 24
<211> 468
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 24
Ser Ala Thr Glu Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Met Arg Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met Gln Asp Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asn Thr Thr Asn Ala Asn Ser Thr
100 105 110
Asn Asn Asn Asn Trp Asp Met Lys Asn Cys Ser Phe Asn Val Thr Ser
115 120 125
Gly Ile Arg Asp Lys Val Arg Lys Glu His Ala Leu Phe Tyr Ala Leu
130 135 140
Asp Val Val Pro Ile Asp Asn Glu Thr Asn Tyr Arg Leu Ile Ser Cys
145 150 155 160
Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro
165 170 175
Ile Pro Ile His Tyr Tyr Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys
180 185 190
Arg Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Lys Asp Val Ser Thr
195 200 205
Val Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu
210 215 220
Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Glu Asn
225 230 235 240
Phe Thr Asn Asn Ala Lys Thr Ile Leu Val Gln Leu Asn Glu Ser Val
245 250 255
Val Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile Asn
260 265 270
Ile Gly Pro Gly Arg Ala Phe Tyr Ala Thr Gly Glu Ile Ile Gly Asp
275 280 285
Ile Arg Gln Ala His Cys Asn Leu Ser Lys Ala Gln Trp Asn Asn Thr
290 295 300
Leu Lys Lys Val Val Val Lys Leu Arg Glu Gln Phe Pro Asn Lys Thr
305 310 315 320
Ile Val Phe Thr His Ser Ser Gly Gly Asp Pro Glu Ile Val Met His
325 330 335
Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Pro Leu
340 345 350
Phe Asn Ser Thr Trp Lys Leu Asn Gly Thr Met Glu Ser Asn Asp Thr
355 360 365
Glu Gly Asn Leu Thr Leu Gln Cys Arg Ile Lys Gln Ile Met Asn Lys
370 375 380
Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Gln Gly Gln
385 390 395 400
Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Val Arg Asp Gly
405 410 415
Gly Val Asn Ser Ala Asn Glu Thr Phe Arg Pro Gly Gly Gly Asp Met
420 425 430
Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile
435 440 445
Glu Pro Leu Gly Ile Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln
450 455 460
Arg Glu Lys Arg
465
<210> 25
<211> 1643
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 25
agaaagagca gaagacagtg gcaatgagag cgaaggggat caggaggaat tggcagcgct 60
tgtgttggag atggggcacg atgctccttg gaatgttaat gatctgtagt gctacagaac 120
cattgtgggt aacagtctat tatggggtac ctgtgtggaa agaagcaacc accactctat 180
ttcgtgcatc agatgctaaa gcatatggta cagaggtaca taatgtttgg gccacgcatg 240
cctgtgtacc cacagacccc aacccacaag aagtagtatt ggaaaatgta acagaaaatt 300
ttaatgcgtg ggaaaataac atggtggaac aaatgcatga ggatataatc agtttatggg 360
atcaaagtct aaagccatgt gtaaagttaa ccccactctg tgttacttta aaatgcactg 420
ataatttggg gaatgatact aaaaccagta ataagagctg ggaaaagatg gagccaggag 480
aaataaaaaa ctgctccttc aacatcacca caagcatagg agataagacg caggaaacat 540
atgcattttt ttataaactt gatgtagtac caatagataa taagactaca atagataata 600
atactgcaag aaactatagc gactataggt tgataagttg taacacctca gtcattacac 660
aggcctgtcc aaaggtatct tttgaaccaa ttcccataca ttattgtgcc ccggctggtt 720
ttgcgattct aaagtgtaac aataagacat tcatgggaaa aggaccatgt acaaatgtca 780
gcacagtaca atgtacacat ggaattaagc cagtagtatc aactcaactg ctgttaaatg 840
gcagtctggc agaagaagag ataataatta gatctgaaaa tttcacggac aatgctaaaa 900
ccttaataat acatctgaac cactctgtag aaattaagtg tataagaccc aacaacaata 960
caagcgaagg tatacatata ggaccaggga gagcgtttta tccaacaaga ataataggag 1020
atataagaaa agcacattgt aacattaatg aaacagcatg gaagacaact ttagcacaga 1080
tagttacaaa attaagagaa caatttggga ataaaacaat agtctttagc caatcctcag 1140
gaggggaccc agaaattgta atgcacagtt ttaattgtgg aggggaattt ttctactgtg 1200
atacaacaaa actgtttaat agtacttgga atgttaatga tacttggaat ggtgctggag 1260
ggtcaaacag cactgaaaga aacaccacca tcatactccc atgcaaaata aaacaaatta 1320
taaacttgtg gcaggaggta ggaaaagcaa tgtatgcccc tcccatcaaa ggactaatta 1380
gatgttcatc aaatattaca gggctgctat taacaagaga tggtggtaat aacaatgaca 1440
caaacgggac agagatcttc agacctgggg gaggagatat gagggacaat tggagaagtg 1500
aattatataa atataaagta gtgaaaattg aaccattagg agtagcaccc actaaggcaa 1560
agagaagagt ggtgcagaga gaaagaagag cagtgggaat aggagctttg ttccttgggt 1620
tcttgggagc agcaggaagc act 1643
<210> 26
<211> 494
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 26
Ser Ala Thr Glu Pro Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Arg Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Gly Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Ala Trp Glu Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Lys Cys Thr Asp Asn Leu Gly Asn Asp Thr Lys
100 105 110
Thr Ser Asn Lys Ser Trp Glu Lys Met Glu Pro Gly Glu Ile Lys Asn
115 120 125
Cys Ser Phe Asn Ile Thr Thr Ser Ile Gly Asp Lys Thr Gln Glu Thr
130 135 140
Tyr Ala Phe Phe Tyr Lys Leu Asp Val Val Pro Ile Asp Asn Lys Thr
145 150 155 160
Thr Ile Asp Asn Asn Thr Ala Arg Asn Tyr Ser Asp Tyr Arg Leu Ile
165 170 175
Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe
180 185 190
Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu
195 200 205
Lys Cys Asn Asn Lys Thr Phe Met Gly Lys Gly Pro Cys Thr Asn Val
210 215 220
Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln
225 230 235 240
Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Ile Ile Ile Arg Ser
245 250 255
Glu Asn Phe Thr Asp Asn Ala Lys Thr Leu Ile Ile His Leu Asn His
260 265 270
Ser Val Glu Ile Lys Cys Ile Arg Pro Asn Asn Asn Thr Ser Glu Gly
275 280 285
Ile His Ile Gly Pro Gly Arg Ala Phe Tyr Pro Thr Arg Ile Ile Gly
290 295 300
Asp Ile Arg Lys Ala His Cys Asn Ile Asn Glu Thr Ala Trp Lys Thr
305 310 315 320
Thr Leu Ala Gln Ile Val Thr Lys Leu Arg Glu Gln Phe Gly Asn Lys
325 330 335
Thr Ile Val Phe Ser Gln Ser Ser Gly Gly Asp Pro Glu Ile Val Met
340 345 350
His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asp Thr Thr Lys
355 360 365
Leu Phe Asn Ser Thr Trp Asn Val Asn Asp Thr Trp Asn Gly Ala Gly
370 375 380
Gly Ser Asn Ser Thr Glu Arg Asn Thr Thr Ile Ile Leu Pro Cys Lys
385 390 395 400
Ile Lys Gln Ile Ile Asn Leu Trp Gln Glu Val Gly Lys Ala Met Tyr
405 410 415
Ala Pro Pro Ile Lys Gly Leu Ile Arg Cys Ser Ser Asn Ile Thr Gly
420 425 430
Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn Asn Asp Thr Asn Gly Thr
435 440 445
Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser
450 455 460
Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala
465 470 475 480
Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Arg Arg
485 490
<210> 27
<211> 1613
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 27
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tgtcagctct 60
tgtggaaatg gggcaccatg ctccttggga tgttgatgat ctgtagtgct gcagaacaac 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga tgcaaccacc actttatttt 180
gtgcatcaga tgctaaagca tacgacaaag aggcacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccctaac ccacgagaaa taaaattgga aaatgtgaca gaaaatttta 300
acatgtggaa aaatgacatg gcagaccaga tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa gccatgtgta gaattaaccc cactctgtgt tactttaaat tgcactaata 420
ttagtttgaa tagtactaac aatgatacta ttaacagtag taatagtact gaaggaataa 480
atatgaggga agaaatgaaa aactgctctt tcaataccac cacaagtata ggagataaga 540
ataagagaga atatgcactt ttttataaac ttgatgtagt accaatagat aataagacaa 600
gctatacgtt gataaattgt aacacctcag tcattaaaca ggcctgtcca aaggtaacct 660
ttgaaccaat tcccatacat tattgtgccc cggctggttt tgcgattcta aagtgtctca 720
ataagacgtt cgatggaaat ggaacatgta caaatgtcag cacagtacag tgtacacatg 780
gcattagacc agtagtgtca acccaactac tgttaaatgg cagtctagca gaagaagagg 840
tagtaattag atatgagaat gtccaggaca atactaaaac cataatagta cagctgaacg 900
aaactgtaaa aattaattgt acaagaccca acaacaatac aagaaaaggt atacatgtgg 960
gatgggggag accaatttat gcaacaggag aaataatagg agatataaga caagcacatc 1020
gtaatctaag taaaaaagac tggggagaca ctttaaagaa gatagctata aaactacaag 1080
aacaatttaa tacaacaata atctttgagc aatcctcagg aggggaccca gaaattacaa 1140
tgcacagtct taattgtgga ggggaatttt tctactgtaa tacatcaaag ctgtttaatg 1200
gcacttggtc taatggtact tggactagtg gtatttggaa taatactgga gagtcagata 1260
gcacaatcac actcccatgc agaataaaac aaattataaa caggtggcag ggagtaggac 1320
aagcaatgta taaccctccc atcaacggac taattagctg ttcatcaaat attacaggac 1380
tgatattaac aagagatgga ggtaacaaca ggtccaacga gaccttcaga ccaagtggag 1440
gaaacatgag ggacaattgg agaagtgaat tatataaata tagagtagta aaaattgaac 1500
cattaggagt agcacccacc aaggcaaaga gaagagtggt gcagagagaa aaaagagcag 1560
tggggatgat aggagctgtg ttccttgggt tcttgggagc agcaggaagc act 1613
<210> 28
<211> 484
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 28
Ser Ala Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Lys Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Arg Glu Ile Lys Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asp Met Ala Asp Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Glu Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asn Ile Ser Leu Asn Ser Thr Asn
100 105 110
Asn Asp Thr Ile Asn Ser Ser Asn Ser Thr Glu Gly Ile Asn Met Arg
115 120 125
Glu Glu Met Lys Asn Cys Ser Phe Asn Thr Thr Thr Ser Ile Gly Asp
130 135 140
Lys Asn Lys Arg Glu Tyr Ala Leu Phe Tyr Lys Leu Asp Val Val Pro
145 150 155 160
Ile Asp Asn Lys Thr Ser Tyr Thr Leu Ile Asn Cys Asn Thr Ser Val
165 170 175
Ile Lys Gln Ala Cys Pro Lys Val Thr Phe Glu Pro Ile Pro Ile His
180 185 190
Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Leu Asn Lys Thr
195 200 205
Phe Asp Gly Asn Gly Thr Cys Thr Asn Val Ser Thr Val Gln Cys Thr
210 215 220
His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser
225 230 235 240
Leu Ala Glu Glu Glu Val Val Ile Arg Tyr Glu Asn Val Gln Asp Asn
245 250 255
Thr Lys Thr Ile Ile Val Gln Leu Asn Glu Thr Val Lys Ile Asn Cys
260 265 270
Thr Arg Pro Asn Asn Asn Thr Arg Lys Gly Ile His Val Gly Trp Gly
275 280 285
Arg Pro Ile Tyr Ala Thr Gly Glu Ile Ile Gly Asp Ile Arg Gln Ala
290 295 300
His Arg Asn Leu Ser Lys Lys Asp Trp Gly Asp Thr Leu Lys Lys Ile
305 310 315 320
Ala Ile Lys Leu Gln Glu Gln Phe Asn Thr Thr Ile Ile Phe Glu Gln
325 330 335
Ser Ser Gly Gly Asp Pro Glu Ile Thr Met His Ser Leu Asn Cys Gly
340 345 350
Gly Glu Phe Phe Tyr Cys Asn Thr Ser Lys Leu Phe Asn Gly Thr Trp
355 360 365
Ser Asn Gly Thr Trp Thr Ser Gly Ile Trp Asn Asn Thr Gly Glu Ser
370 375 380
Asp Ser Thr Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Arg
385 390 395 400
Trp Gln Gly Val Gly Gln Ala Met Tyr Asn Pro Pro Ile Asn Gly Leu
405 410 415
Ile Ser Cys Ser Ser Asn Ile Thr Gly Leu Ile Leu Thr Arg Asp Gly
420 425 430
Gly Asn Asn Arg Ser Asn Glu Thr Phe Arg Pro Ser Gly Gly Asn Met
435 440 445
Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Arg Val Val Lys Ile
450 455 460
Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln
465 470 475 480
Arg Glu Lys Arg
<210> 29
<211> 1646
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 29
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaac tatcagcact 60
tgtggaaatg gggcaccttg ctccttggga tatcgatgat ctgtagtgct aaagaagaca 120
agttgtgggt cacagtctat tatggggtac ctgtgtggag agatgcaaac accactctat 180
tttgtgcatc aggtgctaaa gcatataaga cagaggtaca taatgtctgg gccacacatg 240
cctgtgtacc cacagacccc aacccacaag aagtggtatt gggaaatgtg acagaatatt 300
ttaatgcatg gaaaaatgac atggtagaac agatgcatga ggatataatc aatctatggg 360
atcaaagcct aaagccatgt gtaaaattaa ccccactctg tgtcacttta aattgcacta 420
acgttaaaaa caatgctacc aaaataaatg ataccaccac tacacctagt gaggaaatag 480
aaataaaaaa ctgctctttc aacatcaccg caggcataag agataagata cagaaagaat 540
atgcattgtt ttctaaattt gatttagtac aaatccatga agataataaa aataataata 600
atacaaacta tacagactat aggttgataa gttgtaacac ctcagtcatt acgcaggcct 660
gtccaaaagt atcctttgag ccaattccca tacatttttg taccccggct ggttttgcga 720
ttctaaagcg taataataag acattcaacg gaaaagggcc atgtacaaat gtcagtacag 780
tacagtgtac acatggaatt aggccagtag tatcaactca actgctgcta aatggcagtt 840
tagcagaaga ggatgtagta attagatctg aaaatttcac aaacaatgtt aaaaccataa 900
tagtacagct gaaagaagct gtacaaatta attgcacaag gcccaacaac aatacaagaa 960
aaagtatacc tataggacca gggagagcat tttatgcaac aggagacata ataggagata 1020
taagacaagc acattgtaac attagtggaa cacaatggaa taaaacttta ggaaagatag 1080
ttgaaaaatt aaaagaacaa tttgggaata aaacaataat ctttaaccaa cccgtaggag 1140
gggacccaga aattgtagcg cacactttta attgtggagg ggaatttttc tactgtaata 1200
caacacctct gtttaatagt acctggactt ggaatagtac ttggaatggt actacaagta 1260
ctgggaatgt tactaaaaaa attatcacac tccaatgcag aataagacaa attgtaaaca 1320
tgtggcagaa agtaggaaaa gcaatgtatg cccctcccat cagaggacag attggatgtt 1380
catcaaatat tacagggctg ctattaacaa gagatggtgg taatagtgag aacgggacca 1440
ataacacaga cacagagacc ttcagaccgg gaggaggaga tatgagggac aattggagaa 1500
gtgaattata taaatataaa gtagtaagaa ttgaaccatt aggaatagca cccactaagg 1560
caaggagaag agtggtgcag agagaaaaaa gagcagtggg aataggggct ttgttccttg 1620
ggttcttggg agcagcagga agcact 1646
<210> 30
<211> 496
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 30
Ser Ala Lys Glu Asp Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro
1 5 10 15
Val Trp Arg Asp Ala Asn Thr Thr Leu Phe Cys Ala Ser Gly Ala Lys
20 25 30
Ala Tyr Lys Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val
35 40 45
Pro Thr Asp Pro Asn Pro Gln Glu Val Val Leu Gly Asn Val Thr Glu
50 55 60
Tyr Phe Asn Ala Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp
65 70 75 80
Ile Ile Asn Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr
85 90 95
Pro Leu Cys Val Thr Leu Asn Cys Thr Asn Val Lys Asn Asn Ala Thr
100 105 110
Lys Ile Asn Asp Thr Thr Thr Thr Pro Ser Glu Glu Ile Glu Ile Lys
115 120 125
Asn Cys Ser Phe Asn Ile Thr Ala Gly Ile Arg Asp Lys Ile Gln Lys
130 135 140
Glu Tyr Ala Leu Phe Ser Lys Phe Asp Leu Val Gln Ile His Glu Asp
145 150 155 160
Asn Lys Asn Asn Asn Asn Thr Asn Tyr Thr Asp Tyr Arg Leu Ile Ser
165 170 175
Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu
180 185 190
Pro Ile Pro Ile His Phe Cys Thr Pro Ala Gly Phe Ala Ile Leu Lys
195 200 205
Arg Asn Asn Lys Thr Phe Asn Gly Lys Gly Pro Cys Thr Asn Val Ser
210 215 220
Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu
225 230 235 240
Leu Leu Asn Gly Ser Leu Ala Glu Glu Asp Val Val Ile Arg Ser Glu
245 250 255
Asn Phe Thr Asn Asn Val Lys Thr Ile Ile Val Gln Leu Lys Glu Ala
260 265 270
Val Gln Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile
275 280 285
Pro Ile Gly Pro Gly Arg Ala Phe Tyr Ala Thr Gly Asp Ile Ile Gly
290 295 300
Asp Ile Arg Gln Ala His Cys Asn Ile Ser Gly Thr Gln Trp Asn Lys
305 310 315 320
Thr Leu Gly Lys Ile Val Glu Lys Leu Lys Glu Gln Phe Gly Asn Lys
325 330 335
Thr Ile Ile Phe Asn Gln Pro Val Gly Gly Asp Pro Glu Ile Val Ala
340 345 350
His Thr Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr Pro
355 360 365
Leu Phe Asn Ser Thr Trp Thr Trp Asn Ser Thr Trp Asn Gly Thr Thr
370 375 380
Ser Thr Gly Asn Val Thr Lys Lys Ile Ile Thr Leu Gln Cys Arg Ile
385 390 395 400
Arg Gln Ile Val Asn Met Trp Gln Lys Val Gly Lys Ala Met Tyr Ala
405 410 415
Pro Pro Ile Arg Gly Gln Ile Gly Cys Ser Ser Asn Ile Thr Gly Leu
420 425 430
Leu Leu Thr Arg Asp Gly Gly Asn Ser Glu Asn Gly Thr Asn Asn Thr
435 440 445
Asp Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp
450 455 460
Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Arg Ile Glu Pro Leu Gly
465 470 475 480
Ile Ala Pro Thr Lys Ala Arg Arg Arg Val Val Gln Arg Glu Lys Arg
485 490 495
<210> 31
<211> 1606
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 31
agaaagacag aagacagtgg caatgagagc gaaggggacc aggaagaatt gtcagcactt 60
gtggtggaga tggggcacca tgctctttgg gatgttgatg atctgtagtg ctgcaaaaga 120
aaaattgtgg gtcacagtct attatggggt gcctgtgtgg aaagaagcaa ccaccactct 180
attttgtgca tcagatgcta aagcatatga cacagaagca cataatgttt gggccacaca 240
tgcttgtgta cccacaaacc ctaacccaca agaagtatta ttgaaaaatg tgacagaaga 300
ttttaacatg tggaaaaata atatggtaga acagatgcat gaggatataa tcagtttatg 360
ggatcaaagc ctaaagccat gtgtgaaatt aaccccactc tgtgttactt tacattgcac 420
tgatgcgaac attactgcaa acagtactgc tactaacagt actgttagct ccattaaaga 480
agaagtgaaa aactgctctt tcaatatcac cacagaagta agagacaagg taaagaaaga 540
acatgcactt ttttatagac ttgatgtagt accaatagct aatgataata caagctatac 600
attggtaaat tgtaacacct caaccattac acaggcctgt ccaaaggtga cctttgaacc 660
aattcctata cattattgtg ccccggctgg ttttgcgatt ctaaaatgta atgataagaa 720
tttcaatgga acaggaccat gtaaaaatgt cagcacagta caatgcacac atggaattag 780
gccagtggtg tcaactcaac tactgttaaa tggcagtcta gcagaagatg aggtagtaat 840
tagatctgaa aatttcacaa acaatgcaaa aatcataata gtacagctaa atgaatctgt 900
aataattaat tatacaagac ctggcaacaa tacaagaaaa agtatacata taggaccggg 960
aagtgcattt tatgcaacag gagacataat aggagatata agacaagcac attgtaacat 1020
tagtaaagca gattgggaga aaacactaaa acaggtagtt aaaaaattac aggaacaata 1080
tgggaataaa acaataaact ttacccaatc ctcaggagga gacccagaaa ttgtaatgca 1140
cagtcttaat tgtggaggag aatttttcta ttgtaataca acaaagctgt ttaatagtac 1200
ttggcagaat ggtactattg taggatcaga aaatacgtca gacattatca tactcccatg 1260
cagaataaag caaattataa acaggtggca ggaagtagga aaagcaatgt atgcccctcc 1320
catcagcgga gacattagat gtacatcaaa tattacaggg ctgctattaa caagagatgg 1380
gggtataaag aacaagacca atgggacaga gacagagatc ttcagacctg caggaggaga 1440
tatgaaggac aattggagaa gtgaattata taaatataaa gtagtaaaaa ttgaaccgtt 1500
aggaatagca cccaccaggg caaggagaag agtggtgcag agggaaaaaa gagaagtgac 1560
gctgggagtt atgttccttg ggttcttggg agcagcagga agcact 1606
<210> 32
<211> 482
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 32
Ser Ala Ala Lys Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro
1 5 10 15
Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys
20 25 30
Ala Tyr Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val
35 40 45
Pro Thr Asn Pro Asn Pro Gln Glu Val Leu Leu Lys Asn Val Thr Glu
50 55 60
Asp Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp
65 70 75 80
Ile Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr
85 90 95
Pro Leu Cys Val Thr Leu His Cys Thr Asp Ala Asn Ile Thr Ala Asn
100 105 110
Ser Thr Ala Thr Asn Ser Thr Val Ser Ser Ile Lys Glu Glu Val Lys
115 120 125
Asn Cys Ser Phe Asn Ile Thr Thr Glu Val Arg Asp Lys Val Lys Lys
130 135 140
Glu His Ala Leu Phe Tyr Arg Leu Asp Val Val Pro Ile Ala Asn Asp
145 150 155 160
Asn Thr Ser Tyr Thr Leu Val Asn Cys Asn Thr Ser Thr Ile Thr Gln
165 170 175
Ala Cys Pro Lys Val Thr Phe Glu Pro Ile Pro Ile His Tyr Cys Ala
180 185 190
Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp Lys Asn Phe Asn Gly
195 200 205
Thr Gly Pro Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile
210 215 220
Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu
225 230 235 240
Asp Glu Val Val Ile Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Ile
245 250 255
Ile Ile Val Gln Leu Asn Glu Ser Val Ile Ile Asn Tyr Thr Arg Pro
260 265 270
Gly Asn Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly Ser Ala Phe
275 280 285
Tyr Ala Thr Gly Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn
290 295 300
Ile Ser Lys Ala Asp Trp Glu Lys Thr Leu Lys Gln Val Val Lys Lys
305 310 315 320
Leu Gln Glu Gln Tyr Gly Asn Lys Thr Ile Asn Phe Thr Gln Ser Ser
325 330 335
Gly Gly Asp Pro Glu Ile Val Met His Ser Leu Asn Cys Gly Gly Glu
340 345 350
Phe Phe Tyr Cys Asn Thr Thr Lys Leu Phe Asn Ser Thr Trp Gln Asn
355 360 365
Gly Thr Ile Val Gly Ser Glu Asn Thr Ser Asp Ile Ile Ile Leu Pro
370 375 380
Cys Arg Ile Lys Gln Ile Ile Asn Arg Trp Gln Glu Val Gly Lys Ala
385 390 395 400
Met Tyr Ala Pro Pro Ile Ser Gly Asp Ile Arg Cys Thr Ser Asn Ile
405 410 415
Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Ile Lys Asn Lys Thr Asn
420 425 430
Gly Thr Glu Thr Glu Ile Phe Arg Pro Ala Gly Gly Asp Met Lys Asp
435 440 445
Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro
450 455 460
Leu Gly Ile Ala Pro Thr Arg Ala Arg Arg Arg Val Val Gln Arg Glu
465 470 475 480
Lys arg
<210> 33
<211> 1589
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 33
agaaagagca gaagacagtg gcaatgaaag tgaaggggac caggaagagt tatcagcact 60
tgtggagatg gggcatcatg ccccttggga tgttgacgat ttgtagtgtt gcagaacaat 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga agccaccacc actctatttt 180
gtgcatcaga agctaaggca tatgttacag aggtacataa tatttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag cagtattgga aaatgtgaca gaaaatttta 300
acatatggaa aaatgacatg gtagaccaga tgcatgagga tataatcagt gtatgggatc 360
aaagcctaaa gccatgtgtg aaattaaccc cgctctgtgt cactttaaat tgcactgatt 420
attttgggaa aactaatatt actaccactt ctagcagtgg tcccaataat gatagaggaa 480
tgaaaaactg ctctttcaat atcaccacaa gcataagaga taaggtaacg aaagaacatg 540
cactttttta tagagttgat gtagtcccaa tagatagtag taatagtagc tatagattga 600
taaattgtaa cacctcagtc attacacagg ccagtccaaa agtatccttt gagccaattc 660
ccatacatta ttgtaccccg gctggttttg cgattataaa gtgtaataat aagacattca 720
atggaacagg accatgtaga aatgtcagca cagtacaatg tacacatgga attaggccaa 780
tagtgtcaac tcagctgttg ttaaatggca gtctagcagt agaagaggta gtaattagat 840
ctgaaaatat cacgaacaat gctaaaacca taatagtaca attgaacgaa tctgtaagca 900
tcaattgtac aagacccagc aacaatacaa gaagggggat acatatggga ccagggagag 960
cattttggac aacaggtgaa gtaataggag atataaggaa agcacattgt aacattagta 1020
gaaaagaatg gaatgacact ttagacaagg tagtcaaaaa attaagggaa aaatttaatg 1080
caacaataat ctttaatcaa tccacaggag gggacccaga aattgtaatg cacactttta 1140
attgtggagg ggagtttttc tactgtaaca catcacaact gtttaatagt acttgggata 1200
ttaatggaaa tactactgga gggttagaag gcaatgacac aatcacactc caatgtagaa 1260
taaaacaaat tgtaaacatg tggcaggaag taggaaaagc aatgtatgcc cctcccatcc 1320
aaggaaaaat tagatgttca tcaaatatta cagggctgct attaacaaga gatggtggta 1380
ataacagtag taacaatgag accttcagac ctggaggagg agatatgagg gacaattgga 1440
gaagtgaact atataaatat aaagtagtaa aaattcaacc actaggaata gcacccacca 1500
aggcaaagag aagagtggtg cagagagaaa aaagagcagt gggaatagga gctttgttcc 1560
ttgggttctt gggagcagca ggaagcact 1589
<210> 34
<211> 477
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 34
Ser Val Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Glu Ala Lys Ala
20 25 30
Tyr Val Thr Glu Val His Asn Ile Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ala Val Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Ile Trp Lys Asn Asp Met Val Asp Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Val Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Tyr Phe Gly Lys Thr Asn Ile
100 105 110
Thr Thr Thr Ser Ser Ser Gly Pro Asn Asn Asp Arg Gly Met Lys Asn
115 120 125
Cys Ser Phe Asn Ile Thr Thr Ser Ile Arg Asp Lys Val Thr Lys Glu
130 135 140
His Ala Leu Phe Tyr Arg Val Asp Val Val Pro Ile Asp Ser Ser Asn
145 150 155 160
Ser Ser Tyr Arg Leu Ile Asn Cys Asn Thr Ser Val Ile Thr Gln Ala
165 170 175
Ser Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Thr Pro
180 185 190
Ala Gly Phe Ala Ile Ile Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr
195 200 205
Gly Pro Cys Arg Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg
210 215 220
Pro Ile Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Val Glu
225 230 235 240
Glu Val Val Ile Arg Ser Glu Asn Ile Thr Asn Asn Ala Lys Thr Ile
245 250 255
Ile Val Gln Leu Asn Glu Ser Val Ser Ile Asn Cys Thr Arg Pro Ser
260 265 270
Asn Asn Thr Arg Arg Gly Ile His Met Gly Pro Gly Arg Ala Phe Trp
275 280 285
Thr Thr Gly Glu Val Ile Gly Asp Ile Arg Lys Ala His Cys Asn Ile
290 295 300
Ser Arg Lys Glu Trp Asn Asp Thr Leu Asp Lys Val Val Lys Lys Leu
305 310 315 320
Arg Glu Lys Phe Asn Ala Thr Ile Ile Phe Asn Gln Ser Thr Gly Gly
325 330 335
Asp Pro Glu Ile Val Met His Thr Phe Asn Cys Gly Gly Glu Phe Phe
340 345 350
Tyr Cys Asn Thr Ser Gln Leu Phe Asn Ser Thr Trp Asp Ile Asn Gly
355 360 365
Asn Thr Thr Gly Gly Leu Glu Gly Asn Asp Thr Ile Thr Leu Gln Cys
370 375 380
Arg Ile Lys Gln Ile Val Asn Met Trp Gln Glu Val Gly Lys Ala Met
385 390 395 400
Tyr Ala Pro Pro Ile Gln Gly Lys Ile Arg Cys Ser Ser Asn Ile Thr
405 410 415
Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn Ser Ser Asn Asn Glu
420 425 430
Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu
435 440 445
Leu Tyr Lys Tyr Lys Val Val Lys Ile Gln Pro Leu Gly Ile Ala Pro
450 455 460
Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
465 470 475
<210> 35
<211> 1703
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 35
agaaagagca gaagacagtg gcaatgagag tgagggagac caggaagaat tatcagcact 60
tgtggagatg gggcaacata tggagatggg gcatgatgct ccttgggatg ttgatgatct 120
gtagtgctgc agaagatttg tgggtcacag tttattatgg ggtacctgtg tggaaagacg 180
caaagaccac tctattttgt gcatcagatg ctaaagcata taagacagag gtacataatg 240
tttgggccac gcatgcctgt gtacccacag accccaaccc acaagaagta gaaatgaaaa 300
atgtgacaga agattttaac atgtggaaaa ataatatggt agaacagatg catgaggata 360
taatcagttt atgggatcag agcctaaaac cacgtgtaaa attaacccca ctctgtgtta 420
ctttaaagtg ctttgatgtg aagaataaaa ccactactac cactactaat agtaccacat 480
ccactattag tactactacc actaagacgc ccactgttag taaagggaca gagaaatcag 540
aactgacaaa ctgctctttc aatatcacca caaacataag agataagttt cagaaaaact 600
atgcaatttt tgataaactt gatgtagtac caatagatga tgataatgat actactacta 660
acaataatac tagtaatgaa aaaagcttta ggttaataaa ttgtaacacc tcagtcatca 720
cacaggcctg cccaaagata tcatttgaac caattcccat acattattgt accccggctg 780
gttttgcgat tctaaagtgt aaagataaaa atttcaatgg aacaggaaaa tgtaaaagcg 840
tcagcacagt gcaatgtaca catggaatta ggccagtagt gtcaactcaa ctactgttaa 900
atggcagtct agcagaagaa gaggtagtaa ttagatctgc cgatttctcg gacaatacta 960
aaatcataat agtacagctg aataaatctg tagaaattaa ttgtacaaga cccaacaaca 1020
ataaaagaaa aagtataaat ataggaccag ggagagcaat gtttgcaaca ggagacataa 1080
taggagatat aagaaaagca tcttgtacca ttaatgaaac acaatggaat aacacgttac 1140
aacaggtagt tataaaatta aaagaacaat atggaaataa aacaatagtc tttgaccgcc 1200
cctcaggagg ggacccagaa attgtaatgc acagttttaa ttgtggagga gaatttttct 1260
attgtaattc aacacaactg tttaatagta gttgggggcc taatggtact cggaatggta 1320
ctacaacgat aaatggtact atcatactcc catgtagaat aaaacaaatt ataaacatgt 1380
ggcaggaagt aggaaaagca atgtatgccc ctcccatcga gggacttatt aactgtacat 1440
caaatatcac agggctacta ttaacaagag atggtggcca tgacaataat gacacaaaaa 1500
ataacaatac cgagatcttc agacctggag gaggagatat gagggacaat tggagaagtg 1560
aattatataa atataaagta gtaaaaattg aaccattagg aatagcaccc aacaggacaa 1620
aaagaatagt ggtgcaaaga gaaaaaagag cagtgggatt cggagctgtg ttccttgggt 1680
tcttgggagc agcaggaagc act 1703
<210> 36
<211> 509
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 36
Ser Ala Ala Glu Asp Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Asp Ala Lys Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Lys Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Glu Met Lys Asn Val Thr Glu Asp
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Arg Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Lys Cys Phe Asp Val Lys Asn Lys Thr Thr Thr
100 105 110
Thr Thr Thr Asn Ser Thr Thr Ser Ser Thr Ile Ser Thr Thr Thr Thr Lys
115 120 125
Thr Pro Thr Val Ser Lys Gly Thr Glu Lys Ser Glu Leu Thr Asn Cys
130 135 140
Ser Phe Asn Ile Thr Thr Asn Ile Arg Asp Lys Phe Gln Lys Asn Tyr
145 150 155 160
Ala Ile Phe Asp Lys Leu Asp Val Val Pro Ile Asp Asp Asp Asn Asp
165 170 175
Thr Thr Thr Asn Asn Asn Thr Ser Asn Glu Lys Ser Phe Arg Leu Ile
180 185 190
Asn Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe
195 200 205
Glu Pro Ile Pro Ile His Tyr Cys Thr Pro Ala Gly Phe Ala Ile Leu
210 215 220
Lys Cys Lys Asp Lys Asn Phe Asn Gly Thr Gly Lys Cys Lys Ser Val
225 230 235 240
Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln
245 250 255
Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser
260 265 270
Ala Asp Phe Ser Asp Asn Thr Lys Ile Ile Ile Val Gln Leu Asn Lys
275 280 285
Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Lys Arg Lys Ser
290 295 300
Ile Asn Ile Gly Pro Gly Arg Ala Met Phe Ala Thr Gly Asp Ile Ile
305 310 315 320
Gly Asp Ile Arg Lys Ala Ser Cys Thr Ile Asn Glu Thr Gln Trp Asn
325 330 335
Asn Thr Leu Gln Gln Val Val Ile Lys Leu Lys Glu Gln Tyr Gly Asn
340 345 350
Lys Thr Ile Val Phe Asp Arg Pro Ser Gly Gly Asp Pro Glu Ile Val
355 360 365
Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr
370 375 380
Gln Leu Phe Asn Ser Ser Trp Gly Pro Asn Gly Thr Arg Asn Gly Thr
385 390 395 400
Thr Thr Ile Asn Gly Thr Ile Ile Leu Pro Cys Arg Ile Lys Gln Ile
405 410 415
Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile
420 425 430
Glu Gly Leu Ile Asn Cys Thr Ser Asn Ile Thr Gly Leu Leu Leu Thr
435 440 445
Arg Asp Gly Gly His Asp Asn Asn Asp Thr Lys Asn Asn Asn Thr Glu
450 455 460
Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu
465 470 475 480
Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Ile Ala Pro
485 490 495
Asn Arg Thr Lys Arg Ile Val Val Gln Arg Glu Lys Arg
500 505
<210> 37
<211> 1658
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 37
agaaagagca gaagacagtg gcaatgaaag tgaaggggat caagaagagt tatcagcact 60
tgttgagatg gggcgccatg ctccttggga tgttaatgat ctgtagtgct gcagaacaat 120
tgtgggtcac agtctattat ggggtacctg tgtggagaga agcaaacacc actctatttt 180
gtgcatcaga tgctaaagca tatgataaag aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag tagaattgga aaatgtgaca gaaaatttta 300
acatgtggaa aaatgacatg gtagaacaga tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cactctgcgt tactttaaat tgtactgatt 420
taaggtcaca gaatgtgact tataccactg gtgctaatac cactatggct actaccacta 480
gtactaatac cactagtagt gggggagaga tgcaggtagg aatgaaaaac tgctctttca 540
atatcaccac aaacacacaa gataaggtga agggatatgc acattttgat aaccttgatc 600
tagtacaaat agaggatgaa aatcacagca ataacagcta taggttgata cattgtaaca 660
cctcagtaat tacacaggcc tgtccaaagg tatcctttga gccaattcct atacattatc 720
gtgccccggc tggttttgcg attctaaagt gtaaagataa gaagttcaat ggaacaggac 780
cctgtacaaa tgtcagcaca gtacagtgta cacatggaat taggccagta gtatccactc 840
aactgctgtt caatggcagt ctagcagaag aagaggtagt aattagatct gccaatttct 900
cagaaaatga taaaatcata atagtacagc tgaaagacgc tgtacaaatt aattgtacaa 960
gacccaacaa caacaccaga aaaggtatac atatgggacc agggaaagta ttttacgcaa 1020
cagaagtcat aggggacata aggcgagcac attgtaacat tagtaaagaa aattggaata 1080
atactttaaa acagatagct atacaattaa gagagcaaga gcagttcaag aataaaacaa 1140
tagtctttaa tcaatcctca ggaggggacc cagaaattgt aatgtctagt tttaattgtg 1200
gaggggaatt tttctactgt aatacaacac aactgtttaa tagtacttgg gagaatgata 1260
ctagtacttg gaatgatact gaagggtcga atggcactat cacactccca tgcagaataa 1320
aacaaattat caacatgtgg caggaggtag gaaaagcaat atatgcccct cccatcaaag 1380
gaccacttca ttgttcatca aatattacag ggctactatt aacaagagat ggtggtaata 1440
ctaatgagag caacaccacc gaggtcgagg tcttcagacc tttaggagga aacatgaggg 1500
acaattggag aagtgaatta tataaatata aagtagtaaa gattgaacca ttaggaatag 1560
cacccaccaa ggcaaagaga agagtggtgc agagagaaaa aagagcagtg ggaataggag 1620
ctgtgttcct tgggttcttg ggagcagcag gaagcact 1658
<210> 38
<211> 500
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 38
Ser Ala Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Arg Glu Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Lys Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Glu Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Leu Arg Ser Gln Asn Val Thr
100 105 110
Tyr Thr Thr Gly Ala Asn Thr Thr Met Ala Thr Thr Thr Ser Thr Asn
115 120 125
Thr Thr Ser Ser Gly Gly Glu Met Gln Val Gly Met Lys Asn Cys Ser
130 135 140
Phe Asn Ile Thr Thr Asn Thr Gln Asp Lys Val Lys Gly Tyr Ala His
145 150 155 160
Phe Asp Asn Leu Asp Leu Val Gln Ile Glu Asp Glu Asn His Ser Asn
165 170 175
Asn Ser Tyr Arg Leu Ile His Cys Asn Thr Ser Val Ile Thr Gln Ala
180 185 190
Cys Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Arg Ala Pro
195 200 205
Ala Gly Phe Ala Ile Leu Lys Cys Lys Asp Lys Lys Phe Asn Gly Thr
210 215 220
Gly Pro Cys Thr Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg
225 230 235 240
Pro Val Val Ser Thr Gln Leu Leu Phe Asn Gly Ser Leu Ala Glu Glu
245 250 255
Glu Val Val Ile Arg Ser Ala Asn Phe Ser Glu Asn Asp Lys Ile Ile
260 265 270
Ile Val Gln Leu Lys Asp Ala Val Gln Ile Asn Cys Thr Arg Pro Asn
275 280 285
Asn Asn Thr Arg Lys Gly Ile His Met Gly Pro Gly Lys Val Phe Tyr
290 295 300
Ala Thr Glu Val Ile Gly Asp Ile Arg Arg Ala His Cys Asn Ile Ser
305 310 315 320
Lys Glu Asn Trp Asn Asn Thr Leu Lys Gln Ile Ala Ile Gln Leu Arg
325 330 335
Glu Gln Glu Gln Phe Lys Asn Lys Thr Ile Val Phe Asn Gln Ser Ser
340 345 350
Gly Gly Asp Pro Glu Ile Val Met Ser Ser Phe Asn Cys Gly Gly Glu
355 360 365
Phe Phe Tyr Cys Asn Thr Thr Gln Leu Phe Asn Ser Thr Trp Glu Asn
370 375 380
Asp Thr Ser Thr Trp Asn Asp Thr Glu Gly Ser Asn Gly Thr Ile Thr
385 390 395 400
Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly
405 410 415
Lys Ala Ile Tyr Ala Pro Pro Ile Lys Gly Pro Leu His Cys Ser Ser
420 425 430
Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Thr Asn Glu
435 440 445
Ser Asn Thr Thr Glu Val Glu Val Phe Arg Pro Leu Gly Gly Asn Met
450 455 460
Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile
465 470 475 480
Glu Pro Leu Gly Ile Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln
485 490 495
Arg Glu Lys Arg
500
<210> 39
<211> 1568
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 39
agaaagagca gaagacagtg gcaatgagag tgaaggggat catgaagaat tgtcagcact 60
tgtggtggag atggggcatg atgctccttg ggatgttgat gatctgtagt gctacagaac 120
aattgtgggt cacagtctat tatggggtac ctgtgtggaa agaagcaacc accactctat 180
tttgtgcatc agatgctaaa gcatatgaca cagaggcaca taatgtttgg gccacacatg 240
cctgtgtacc cacagaccct aacccacaag aagtagaatt gaaaaatgtg acagaaaatc 300
ttaacatgtg gaaaaatgac atggtagaac agatgcatga ggatataatc aatttatggg 360
atcaaagcct aaagccatgt gtaaaattaa ctccactctg tgtcacttta cattgcacta 420
atttgaatgt tactaccagt aatactacaa gttggggaga gatggaggca ggagaaataa 480
aaaactgctc tttcaatgtc accacacgca gaagaaataa gaaagaatat gcactttttt 540
ataaacttga tgtagtacct atagatagtg ataatgcaag ctatacgttg ataaattgta 600
acacttcagt cattacacaa gcctgtccaa aggtatcctt tgaaccaatt cccatacatt 660
attgtgcccc ggctggtttt gcgattctaa aatgtaatga taagaaattc aatggaacag 720
gaccatgtaa aaatgttagc acagtacaat gtacacatgg aattaggcca gtagtgtcaa 780
ttcaactgct gttaaatggc agtctagcag aagaagaggt agtaattaga tctgaaaatt 840
tctcgaacaa tgctaaagcc gtaatagtac agctgaatgc atctatagaa attaattgta 900
caagacccaa caacaataca agaaaagata tacatatagg accagggaga gcattatata 960
caacaggagg aataatagga gatataagac aagcacattg tagccttagg aaagcagaat 1020
ggaatgacac tttaaaacat gtagttacaa aattaagaga acaatttggg aataaaacaa 1080
tattctttaa tcaatcctca ggaggggacc cagagattgt aatgcacagt tttaattgtg 1140
gaggggaatt tttctactgt aatacaacaa tgctgtttaa tagtaatagt acttggaatg 1200
atactacagg accagataat aacactatca tactcccatg tagaataaaa caaattataa 1260
acaggtggca ggaagtagga aaagcaatgt atgcccctcc tatcagtgga ccaattaaac 1320
gcacatcaaa tattacaggg ctactattaa caagagatgg tggtagtaac accaccgaga 1380
ccttcagacc tggaggagga gatatgaggg acaattggag aagtgaatta tataaatata 1440
aagtagtaaa aattgagcca ttaggggtag cacccaccag ggcaaggaga agagtggtgc 1500
agagagaaaa aagagcagtg ggactgggag ctgtattcct tgggttcttg ggagcagcag 1560
gaagcact 1568
<210> 40
<211> 469
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 40
Ser Ala Thr Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Glu Leu Lys Asn Val Thr Glu Asn
50 55 60
Leu Asn Met Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Asn Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu His Cys Thr Asn Leu Asn Val Thr Thr Ser Asn
100 105 110
Thr Thr Ser Trp Gly Glu Met Glu Ala Gly Glu Ile Lys Asn Cys Ser
115 120 125
Phe Asn Val Thr Thr Arg Arg Arg Asn Lys Lys Glu Tyr Ala Leu Phe
130 135 140
Tyr Lys Leu Asp Val Val Pro Ile Asp Ser Asp Asn Ala Ser Tyr Thr
145 150 155 160
Leu Ile Asn Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val
165 170 175
Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala
180 185 190
Ile Leu Lys Cys Asn Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Lys
195 200 205
Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser
210 215 220
Ile Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile
225 230 235 240
Arg Ser Glu Asn Phe Ser Asn Asn Ala Lys Ala Val Ile Val Gln Leu
245 250 255
Asn Ala Ser Ile Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg
260 265 270
Lys Asp Ile His Ile Gly Pro Gly Arg Ala Leu Tyr Thr Thr Gly Gly
275 280 285
Ile Ile Gly Asp Ile Arg Gln Ala His Cys Ser Leu Arg Lys Ala Glu
290 295 300
Trp Asn Asp Thr Leu Lys His Val Val Thr Lys Leu Arg Glu Gln Phe
305 310 315 320
Gly Asn Lys Thr Ile Phe Phe Asn Gln Ser Ser Gly Gly Asp Pro Glu
325 330 335
Ile Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn
340 345 350
Thr Thr Met Leu Phe Asn Ser Asn Ser Thr Trp Asn Asp Thr Thr Gly
355 360 365
Pro Asp Asn Asn Thr Ile Ile Leu Pro Cys Arg Ile Lys Gln Ile Ile
370 375 380
Asn Arg Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Ser
385 390 395 400
Gly Pro Ile Lys Arg Thr Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg
405 410 415
Asp Gly Gly Ser Asn Thr Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp
420 425 430
Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys
435 440 445
Ile Glu Pro Leu Gly Val Ala Pro Thr Arg Ala Arg Arg Arg Val Val
450 455 460
Gln Arg Glu Lys Arg
465
<210> 41
<211> 1541
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 41
agaaagagca gaagacagtg gcaatgagag tgaaggagat gaggaagcac tggcagcact 60
ggtggacagg gggcatcatg ctccttggga tgctgatgat ctgtagtgct gtaaacaact 120
tgtgggtcac tgtatattat ggagtgcctg tgtggagaga agcaaccacc accctatttt 180
gtgcatcaga tgctaaagca tacaaaacag aggtacataa tgtctgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaga tagatttggt aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaga tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa accatgcgta aaattaaccc cactctgtgt tgtgaaaaac tgctctttca 420
acaccaccac aatggtaagg gatagggagc ggaaagaata tgctctcttt tataaacttg 480
atgtagtaca aatgaatgat aataataata atagtaccca tggaacctat agattgataa 540
attgtaatac ctcagtcatt acacaggcct gtccaaaggt atcctttgag ccaattccca 600
tacattattg cgccccggct ggttttgcga ttctaaagtg taaagacaag aagttcaatg 660
gaacgggacc atgcaaaaat gtcagcacag tacagtgtac acatggaatt aggccagtag 720
tgtcaactca actgctgcta aatggcagtc tagcagaaga agaggtaata attagatctg 780
aaaacttcac aagcaatgct aaaaccataa tagtacagct aaatgaaact gtagagatta 840
attgcacaag acctagcaac aatacaagaa gaagtataca tataggacca gggagagcat 900
tttacacaac aggagacata ataggagata taagaaaagc acattgtaac attagtagaa 960
caaaatggaa taacacttta ggacagatag ttgaaaaatt acaagaacaa tttaagaata 1020
aaacaataat ctttaattca tcctcaggag gggacccaga aattgtatat cacagtttta 1080
attgtggagg ggaatttttc tactgtaata caacagaact gttcgatagt acctggtata 1140
gcccctggaa cagtactggt gggtcaaata acactgaagg gaatagcacg atcacactca 1200
aatgcagaat aaagcaaatt gtaaacaggt ggcaggaagt aggaaaagca atgtatgccc 1260
ctcccatcca gggaaaaatt aaatgttcat caaatattac agggctacta ttaacaagag 1320
acggtggtaa tagtaatagt agtaacgaga ccttcagacc aggaggagga gacatgagag 1380
acaattggag aagtgaatta tataaatata aagtaataaa aattgaacca ttaggagtag 1440
cacccaccaa ggcaaagaga agagtggtgc aaagagaaag aagagcagtg ggaacaatag 1500
gagctatgtt ccttgggttc ttgggagcag caggaagcac t 1541
<210> 42
<211> 460
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 42
Ser Ala Val Asn Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Arg Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Lys Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Asp Leu Val Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Val Lys Asn Cys Ser Phe Asn Thr Thr Thr Met Val Arg
100 105 110
Asp Arg Glu Arg Lys Glu Tyr Ala Leu Phe Tyr Lys Leu Asp Val Val
115 120 125
Gln Met Asn Asp Asn Asn Asn Asn Ser Thr His Gly Thr Tyr Arg Leu
130 135 140
Ile Asn Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser
145 150 155 160
Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile
165 170 175
Leu Lys Cys Lys Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Lys Asn
180 185 190
Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr
195 200 205
Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Ile Ile Arg
210 215 220
Ser Glu Asn Phe Thr Ser Asn Ala Lys Thr Ile Ile Val Gln Leu Asn
225 230 235 240
Glu Thr Val Glu Ile Asn Cys Thr Arg Pro Ser Asn Asn Thr Arg Arg
245 250 255
Ser Ile His Ile Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly Asp Ile
260 265 270
Ile Gly Asp Ile Arg Lys Ala His Cys Asn Ile Ser Arg Thr Lys Trp
275 280 285
Asn Asn Thr Leu Gly Gln Ile Val Glu Lys Leu Gln Glu Gln Phe Lys
290 295 300
Asn Lys Thr Ile Ile Phe Asn Ser Ser Ser Gly Gly Asp Pro Glu Ile
305 310 315 320
Val Tyr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr
325 330 335
Thr Glu Leu Phe Asp Ser Thr Trp Tyr Ser Pro Trp Asn Ser Thr Gly
340 345 350
Gly Ser Asn Asn Thr Glu Gly Asn Ser Thr Ile Thr Leu Lys Cys Arg
355 360 365
Ile Lys Gln Ile Val Asn Arg Trp Gln Glu Val Gly Lys Ala Met Tyr
370 375 380
Ala Pro Pro Ile Gln Gly Lys Ile Lys Cys Ser Ser Asn Ile Thr Gly
385 390 395 400
Leu Leu Leu Thr Arg Asp Gly Gly Asn Ser Asn Ser Ser Asn Glu Thr
405 410 415
Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu
420 425 430
Tyr Lys Tyr Lys Val Ile Lys Ile Glu Pro Leu Gly Val Ala Pro Thr
435 440 445
Lys Ala Lys Arg Arg Val Val Gln Arg Glu Arg Arg
450 455 460
<210> 43
<211> 1625
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 43
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagagt tatcaaaact 60
tgtggaaatg gggcacattg ctccttggga tattgatgat tagtagtgct acagaacaat 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga agcaaccacc actctatttt 180
gtgcatcaga tgctaaagcc tataatacag aggttcataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaat ccacaagaag taatgttaaa tgtgacagag aattttaaca 300
tgtggaaaaa tgacatggta gaacagatgc aggaggatat aatcagttta tgggatcaaa 360
gcctaaagcc atgtgtaaaa ttaaccccac tctgtgttac tttaagctac actgatgcaa 420
atagtactga tgttaatcat accaaaaata gtagtgaggg aatgatggag ggagaaaaaa 480
tgaaaaactg ctctttcaat atcaccacaa gcatgggaaa taagatgcag aaagaatatg 540
cactttttca tagacttgat gtaataccaa tggataatga aagtgctagt gctaactatt 600
ctaactatag gttaataagc tgtaacacct cagtcactac acaggcttgt ccaaaaatat 660
catttgagcc aattcccata cattattgta ccccggctgg ttttgcgatt ctaaaatgta 720
atgataagaa attcagtgga aaaggaggat gtagaaatgt cagtacagta caatgcacac 780
atggaattaa gccagtagta tcgactcaac tactgttaaa tggcagtcta gcagaagaag 840
atgtggtaat taaatctgcc aatttctcgg acaatgctaa aaccataata gtacagctga 900
atgaatctgt aataattaat tgtacaagac ccaacaacaa tacaagaaag ggtatacata 960
tgggaccagg gaaaacattt tatgcaacag gagccataat aggagatata agacaagcac 1020
attgcaatgt tagtagaaca gaccggaata acactttaaa aagggtagct aaaaaactac 1080
aagaacaatt taatacaaca aaagttgtct ttaaacaatc ctcaggaggg gacccagaaa 1140
ttgtaatgca cagttttaat tgtggagggg aatttttcta ctgtaataca tcagggctgt 1200
ttaatagtac ttggccttgg aatgatacta aagaggcaaa taacactaac acaatcacac 1260
tcccatgcaa aataaaacaa atcataaaca tgtggcaggc agtagggaaa gcaatgtatg 1320
cccctcccat tagtgggata attaaatgtg aatctaatat tacagggctg ctactaacaa 1380
gagatggtgg tagtaagaac acaacagata gtaatgacac aaacataaca caagaggtct 1440
tcagaccagg aggaggagat atgagggaca attggagaag tgaattatat aaatataaag 1500
tagtaagaat tgaaccatta ggagtagcac ccactaaggc aaaaagaaga gtggtgcaga 1560
gagaaaaaag agcagtggga ataggagctg tgttccttgg gttcttggga gcagcaggaa 1620
gcact 1625
<210> 44
<211> 489
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 44
Ser Ala Thr Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asn Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Met Leu Asn Val Thr Glu Asn Phe
50 55 60
Asn Met Trp Lys Asn Asp Met Val Glu Gln Met Gln Glu Asp Ile Ile
65 70 75 80
Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu
85 90 95
Cys Val Thr Leu Ser Tyr Thr Asp Ala Asn Ser Thr Asp Val Asn His
100 105 110
Thr Lys Asn Ser Ser Glu Gly Met Met Glu Gly Glu Lys Met Lys Asn
115 120 125
Cys Ser Phe Asn Ile Thr Thr Ser Met Gly Asn Lys Met Gln Lys Glu
130 135 140
Tyr Ala Leu Phe His Arg Leu Asp Val Ile Pro Met Asp Asn Glu Ser
145 150 155 160
Ala Ser Ala Asn Tyr Ser Asn Tyr Arg Leu Ile Ser Cys Asn Thr Ser
165 170 175
Val Thr Thr Gln Ala Cys Pro Lys Ile Ser Phe Glu Pro Ile Pro Ile
180 185 190
His Tyr Cys Thr Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp Lys
195 200 205
Lys Phe Ser Gly Lys Gly Gly Cys Arg Asn Val Ser Thr Val Gln Cys
210 215 220
Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly
225 230 235 240
Ser Leu Ala Glu Glu Asp Val Val Ile Lys Ser Ala Asn Phe Ser Asp
245 250 255
Asn Ala Lys Thr Ile Ile Val Gln Leu Asn Glu Ser Val Ile Ile Asn
260 265 270
Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Gly Ile His Met Gly Pro
275 280 285
Gly Lys Thr Phe Tyr Ala Thr Gly Ala Ile Ile Gly Asp Ile Arg Gln
290 295 300
Ala His Cys Asn Val Ser Arg Thr Asp Arg Asn Asn Thr Leu Lys Arg
305 310 315 320
Val Ala Lys Lys Leu Gln Glu Gln Phe Asn Thr Thr Lys Val Val Phe
325 330 335
Lys Gln Ser Ser Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Asn
340 345 350
Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser Gly Leu Phe Asn Ser
355 360 365
Thr Trp Pro Trp Asn Asp Thr Lys Glu Ala Asn Asn Thr Asn Thr Ile
370 375 380
Thr Leu Pro Cys Lys Ile Lys Gln Ile Ile Asn Met Trp Gln Ala Val
385 390 395 400
Gly Lys Ala Met Tyr Ala Pro Pro Ile Ser Gly Ile Ile Lys Cys Glu
405 410 415
Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Ser Lys Asn
420 425 430
Thr Thr Asp Ser Asn Asp Thr Asn Ile Thr Gln Glu Val Phe Arg Pro
435 440 445
Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr
450 455 460
Lys Val Val Arg Ile Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys
465 470 475 480
Arg Arg Val Val Gln Arg Glu Lys Arg
485
<210> 45
<211> 1606
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 45
agaaagagca gaagacagtg gcaatgagag cgaaggagat caagaggaat tgtcagcact 60
cgtggagatg gggcatcatg ctccttggga tgttaatgat ctatagtact gcagaaaaaa 120
cgtgggtcac agtatattat ggggtacctg tgtggaagga agcaaacacc actctatttt 180
gtgcatcaga tgctaaagca tatgatacag aggcacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaaa taaagttgga aaatgtgaca gagaacttta 300
acatgtggaa aaacaacatg gtagaacaga tgcatgagga tattatcagt ttatgggatc 360
aaagcctaca gccaagtgta aaattaaccc cactttgtgt tactttaaat tgctctactg 420
cgaattttac taaaaggaat tttactaata gcactgagca tgaaaagccg agtgcagaaa 480
tgagaaactg ctctttcaat atcaccacaa tcgtaagaga taaggtaaca aaagaacatg 540
cactttttta tagagatgat gtagtaccaa tagataatgc tagtaataat accagttata 600
ggttaataaa ttgtaatacc tcagtcatta cacaggcctg tccaaagata tcctttgagc 660
caattcctat acattattgt gccccggctg gttttgcgat tctaaagtgt aatgataaga 720
catttaatgg aacagggcta tgtaaaaatg tcagcacagt acaatgtaca catggaatta 780
gaccagtagt gtcaactcaa ctgttgttaa atggcagtct agcagaaaaa aaggtagtag 840
ttagatctga ggagttttca gacaatgcta aatccataat agtacagctg aatacatctg 900
tagtaattaa ttgtacaaga cccggcaata atacaagaag aagcatacct atgggaccag 960
gaagagtatt ttatgcaaca gatataatag gagacataag acaggcacat tgtaacctta 1020
gtagagcagc atggaataac actttaaagc tgatagccgc agaattaaaa gaaatatata 1080
ataaaacaat agcctttaat cgatcctcag gaggggaccc agaaattgta atgcacactt 1140
ttaattgtgg gggggaattt ttctactgta atacaacaca actgtttaat agtgcttgga 1200
acagtactaa tatttatact gggaataatt ctactaatga caccatctct aatgacgcca 1260
tctcactccc atgtagaata aaacaaattg taaacatgtg gcaggaagta ggaaaagcaa 1320
tgtatgcccc tcccatcaga ggaaacatca gctgttcctc aaatattaca gggctgatat 1380
tgacaagaga tggcgggaat agtagtagta ataccgagat cttcagacct cagggaggga 1440
atataaagga caattggaga agtgaattat ataaatataa agtagtaaga attgaaccat 1500
taggattagc acccaccaag gcaaagagga gagtggtgcg gagagaaaaa gagcggtaac 1560
gttcggagct ttgttccttg ggttcttggg agcagcagga agcact 1606
<210> 46
<211> 483
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 46
Ser Thr Ala Glu Lys Thr Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Lys Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Gln Pro Ser Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Ser Thr Ala Asn Phe Thr Lys Arg Asn
100 105 110
Phe Thr Asn Ser Thr Glu His Glu Lys Pro Ser Ala Glu Met Arg Asn
115 120 125
Cys Ser Phe Asn Ile Thr Thr Ile Val Arg Asp Lys Val Thr Lys Glu
130 135 140
His Ala Leu Phe Tyr Arg Asp Asp Val Val Pro Ile Asp Asn Ala Ser
145 150 155 160
Asn Asn Thr Ser Tyr Arg Leu Ile Asn Cys Asn Thr Ser Val Ile Thr
165 170 175
Gln Ala Cys Pro Lys Ile Ser Phe Glu Pro Ile Pro Ile His Tyr Cys
180 185 190
Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp Lys Thr Phe Asn
195 200 205
Gly Thr Gly Leu Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly
210 215 220
Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala
225 230 235 240
Glu Lys Lys Val Val Val Arg Ser Glu Glu Phe Ser Asp Asn Ala Lys
245 250 255
Ser Ile Ile Val Gln Leu Asn Thr Ser Val Val Ile Asn Cys Thr Arg
260 265 270
Pro Gly Asn Asn Thr Arg Arg Ser Ile Pro Met Gly Pro Gly Arg Val
275 280 285
Phe Tyr Ala Thr Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn
290 295 300
Leu Ser Arg Ala Ala Trp Asn Asn Thr Leu Lys Leu Ile Ala Ala Glu
305 310 315 320
Leu Lys Glu Ile Tyr Asn Lys Thr Ile Ala Phe Asn Arg Ser Ser Gly
325 330 335
Gly Asp Pro Glu Ile Val Met His Thr Phe Asn Cys Gly Gly Glu Phe
340 345 350
Phe Tyr Cys Asn Thr Thr Gln Leu Phe Asn Ser Ala Trp Asn Ser Thr
355 360 365
Asn Ile Tyr Thr Gly Asn Asn Ser Thr Asn Asp Thr Ile Ser Asn Asp
370 375 380
Ala Ile Ser Leu Pro Cys Arg Ile Lys Gln Ile Val Asn Met Trp Gln
385 390 395 400
Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Asn Ile Ser
405 410 415
Cys Ser Ser Asn Ile Thr Gly Leu Ile Leu Thr Arg Asp Gly Gly Asn
420 425 430
Ser Ser Ser Asn Thr Glu Ile Phe Arg Pro Gln Gly Gly Asn Ile Lys
435 440 445
Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Arg Ile Glu
450 455 460
Pro Leu Gly Leu Ala Pro Thr Lys Ala Lys Arg Arg Val Val Arg Arg
465 470 475 480
Glu Lys Glu
<210> 47
<211> 1613
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 47
agaaagagca gaagacagtg gcaatgaaag tgaaggggat caggaagaat tatcagtgct 60
tgtggggatg gggcaccatg ctcctcggga tattgatgat ctgtagtgct gcagaaaatt 120
tgtgggtcac agtctactat ggggtacctg tgtggaaaga agcaaccacc actctatttc 180
gtgcatcaga tgctaaagca tatgatacag aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag tagtattgga aaatgttaca gaaaatttta 300
atatgtggaa aaatgacatg gtagaacaga tgcaggagga tatagtcagt ttatgggatc 360
aaagcctaaa gccatgtgta gaatcaactc cactctgtgt tactctaaat tgtactgatg 420
tgaagaagaa tgctaataat accactggta ataccactga tggtaacgtg gaaaggttgg 480
agaaagaaga aataaaaaac tgctctttca atatcaccac aagcataaga gataagatgc 540
ggaaagaata tgcacttttt tatagccttg atgtagtacc aatagataag gataatacaa 600
gctataggtt gataagttgt aacacctcag tcattacaca ggcctgtcca aaagtatcct 660
ttgagccaat tcccatacat tattgtgccc cggctggttt tgcgcttcta aaatgtaatg 720
ataaggagtt caatggaaca ggaccatgta ggaatgtcag cacagtccaa tgtacacatg 780
gaattaggcc agtagtatca actcaactgc tgttaaatgg cagtctagca gaagaaaaga 840
tagtaattag atctgagaat ttcacgagca atgctaaaac tataatagta cagctgaata 900
aatctataga aattaattgt ataagaccca acaacaacac aagaagaagt atacatatag 960
gaccaggggg agcattttat gcaacagaaa taataggaga tataagacaa gcacattgta 1020
ccctcaatag aacagaatgg aataacactc taggacagat agttaaaaaa ttaagagaac 1080
aatatgggaa taaaacaata aaatttacgc agccctcctc aggaggggac ccagaagttg 1140
taatgcacag ttttaattgt ggaggggaat ttttctactg taattcatca cagctgttta 1200
atagtacttg ggatgttact gaagggtcaa ataacactga aggaagcaac gatacaggaa 1260
tcatactccc gtgcagaata aaacaaataa taaacatgtg gcagaaagta ggaaaagcaa 1320
tgtatgcccc tcccatcaga ggacaaatta actgtacatc aaatattaca gggctactat 1380
taataagaga tggtggtaac aacgggaccg acaacgggac cgagaccttc agacctggag 1440
gaggagatat gagggacaat tggagaagtg aattatataa atataaagta gtaaaaattg 1500
aaccattagg agtagcaccc accaaggcaa agagaagagt ggtgcagaga gaaaaaagag 1560
cagtgggaat aggagctttg ttccttgggt tcttgggagc agcaggaagc act 1613
<210> 48
<211> 485
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 48
Ser Ala Ala Glu Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Arg Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asp Met Val Glu Gln Met Gln Glu Asp Ile
65 70 75 80
Val Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Glu Ser Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Val Lys Lys Asn Ala Asn Asn
100 105 110
Thr Thr Gly Asn Thr Thr Asp Gly Asn Val Glu Arg Leu Glu Lys Glu
115 120 125
Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Ser Ile Arg Asp Lys
130 135 140
Met Arg Lys Glu Tyr Ala Leu Phe Tyr Ser Leu Asp Val Val Pro Ile
145 150 155 160
Asp Lys Asp Asn Thr Ser Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val
165 170 175
Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His
180 185 190
Tyr Cys Ala Pro Ala Gly Phe Ala Leu Leu Lys Cys Asn Asp Lys Glu
195 200 205
Phe Asn Gly Thr Gly Pro Cys Arg Asn Val Ser Thr Val Gln Cys Thr
210 215 220
His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser
225 230 235 240
Leu Ala Glu Glu Lys Ile Val Ile Arg Ser Glu Asn Phe Thr Ser Asn
245 250 255
Ala Lys Thr Ile Ile Val Gln Leu Asn Lys Ser Ile Glu Ile Asn Cys
260 265 270
Ile Arg Pro Asn Asn Asn Thr Arg Arg Ser Ile His Ile Gly Pro Gly
275 280 285
Gly Ala Phe Tyr Ala Thr Glu Ile Gly Asp Ile Arg Gln Ala His
290 295 300
Cys Thr Leu Asn Arg Thr Glu Trp Asn Asn Thr Leu Gly Gln Ile Val
305 310 315 320
Lys Lys Leu Arg Glu Gln Tyr Gly Asn Lys Thr Ile Lys Phe Thr Gln
325 330 335
Pro Ser Ser Gly Gly Asp Pro Glu Val Val Met His Ser Phe Asn Cys
340 345 350
Gly Gly Glu Phe Phe Tyr Cys Asn Ser Ser Gln Leu Phe Asn Ser Thr
355 360 365
Trp Asp Val Thr Glu Gly Ser Asn Asn Thr Glu Gly Ser Asn Asp Thr
370 375 380
Gly Ile Ile Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln
385 390 395 400
Lys Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Gln Ile Asn
405 410 415
Cys Thr Ser Asn Ile Thr Gly Leu Leu Leu Ile Arg Asp Gly Gly Asn
420 425 430
Asn Gly Thr Asp Asn Gly Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp
435 440 445
Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys
450 455 460
Ile Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val
465 470 475 480
Gln Arg Glu Lys Arg
485
<210> 49
<211> 1616
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 49
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaggaat tatcagtgct 60
tgtggacatg gggcacgatg ctccttggga tgttgatgat ctgtagtgct gtagaacaat 120
tgtgggtcac agtctattat ggggtgcctg tgtgggaaga agcgaccacc actctatttt 180
gtgcatcaga tgctaaaaca tatgatccag aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag tagtattggg aaatgtgaca gaaaacttta 300
acatgtggaa aaatgacatg gtaaaccaaa tgcatgagga tataatcagt ttatgggatc 360
aaagtttaaa gccatgtgca aaattaaccc cactctgtgt tactctaaat tgcactgata 420
agttgaatat taatactacc agtaccaata gtagtaccaa taatactact agtagtggag 480
tggatgaagg gggaatgaaa aactgctctt tcaatgtcac cacaagcata agagataggg 540
tgcagaaaga atatgcactt ttttataaac ctgatgtagt accaatagat gataatacta 600
ataatactag ctataggttg ataaattgta acacctcagt cattacacaa gcctgtccaa 660
aggtaacctt tgatccaatt cccatacatt attgtgcccc ggctggtttt gcgattctaa 720
aatgtaacaa taagacgttc aatggatcag gaccatgtac aaatgtcagc acagtacaat 780
gtacacatgg aattaaacca gtggtgtcga ctcaattgct gttaaatggc agtctagcag 840
aggaagaaat agtaatcagg tctgaagatt tcacggacaa tgatagaacc ataatagtac 900
agctgaatga atctgtagta attcattgta caagacccaa caacaataca agaaaaagta 960
tacacctagg accagggagt gcattttatg caacaggaga tataatagga gatataaaac 1020
aagcacattg taacattagt agagcaaatt ggactaacac cttaaaacag atagctggaa 1080
aattaaaaga acagtttgga aataagacaa tattctttaa tcaatcctca ggaggggatc 1140
cagaaattgt aacacacagt ttcaattgta gaggggaatt tttctactgt aatacatcac 1200
aattgtttaa cagtacttgg cttcataata atagtactgg gaatgatact gaaaagaatg 1260
gtaatatcac actcccacgc agaataaaac aaattataaa catgtggcag caagtaggaa 1320
aagcaatgta tgccccccct gtcaaaggac taattacatg ttcatcaaat attacaggac 1380
tgctattagt aagagatggt ggtaacaaca ccaacgccac cgacaccgag accttcagac 1440
ctggaggagg agatatgagg gacaattgga gaagtgaatt atataaatat aaagtagtaa 1500
aaatcaaacc attaggaata gcacccacca aggcaaaaag aagagtggtg cagagagaaa 1560
aaagagcagc actaggagct atgttccttg ggttcttggg agcagcagga agcact 1616
<210> 50
<211> 487
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 50
Ser Ala Val Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Glu Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Thr
20 25 30
Tyr Asp Pro Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Gly Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asp Met Val Asn Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Ala Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Lys Leu Asn Ile Asn Thr Thr
100 105 110
Ser Thr Asn Ser Ser Thr Asn Asn Thr Thr Ser Ser Gly Val Asp Glu
115 120 125
Gly Gly Met Lys Asn Cys Ser Phe Asn Val Thr Thr Ser Ile Arg Asp
130 135 140
Arg Val Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Pro Asp Val Val Pro
145 150 155 160
Ile Asp Asp Asn Thr Asn Asn Thr Ser Tyr Arg Leu Ile Asn Cys Asn
165 170 175
Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Thr Phe Asp Pro Ile
180 185 190
Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn
195 200 205
Asn Lys Thr Phe Asn Gly Ser Gly Pro Cys Thr Asn Val Ser Thr Val
210 215 220
Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu
225 230 235 240
Asn Gly Ser Leu Ala Glu Glu Glu Ile Val Ile Arg Ser Glu Asp Phe
245 250 255
Thr Asp Asn Asp Arg Thr Ile Ile Val Gln Leu Asn Glu Ser Val Val
260 265 270
Ile His Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile His Leu
275 280 285
Gly Pro Gly Ser Ala Phe Tyr Ala Thr Gly Asp Ile Ile Gly Asp Ile
290 295 300
Lys Gln Ala His Cys Asn Ile Ser Arg Ala Asn Trp Thr Asn Thr Leu
305 310 315 320
Lys Gln Ile Ala Gly Lys Leu Lys Glu Gln Phe Gly Asn Lys Thr Ile
325 330 335
Phe Phe Asn Gln Ser Ser Gly Gly Asp Pro Glu Ile Val Thr His Ser
340 345 350
Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asn Thr Ser Gln Leu Phe
355 360 365
Asn Ser Thr Trp Leu His Asn Asn Ser Thr Gly Asn Asp Thr Glu Lys
370 375 380
Asn Gly Asn Ile Thr Leu Pro Arg Arg Ile Lys Gln Ile Ile Asn Met
385 390 395 400
Trp Gln Gln Val Gly Lys Ala Met Tyr Ala Pro Pro Val Lys Gly Leu
405 410 415
Ile Thr Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Val Arg Asp Gly
420 425 430
Gly Asn Asn Thr Asn Ala Thr Asp Thr Glu Thr Phe Arg Pro Gly Gly
435 440 445
Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val
450 455 460
Val Lys Ile Lys Pro Leu Gly Ile Ala Pro Thr Lys Ala Lys Arg Arg
465 470 475 480
Val Val Gln Arg Glu Lys Arg
485
<210> 51
<211> 1613
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 51
agaaagagca gaagacagtg gcaatgagag tgagggggac caggaagaat tatcagcact 60
tgtggagatg gggcatcatg ctccttggga tgtcaatgat ctgtaatgct acaaaagatt 120
tgtgggtcac agtttattat ggggtacctg tatggaaaga agcaaacacc agtctatttt 180
gtgcatcaga tgctaaagca tatgatacag aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag tattcatgaa caatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaaa tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa accatgtgta aaattaaccc cactcggtgt tactctagat tgcactaagg 420
ctaatattac caataatagt accactaata gtagcggggg aatagaggag ggaagagaca 480
tagaaaattg ctctttcaat atcaccacaa acataagaga taagataaag aaagaatatg 540
cactttttta tagccttgat gtgatagcaa tagatgatag tagtaatagt agtaatagaa 600
gctataggtt gagaggttgt aacacctcaa ccatcacaca ggcctgtcca aaggtaacct 660
ttgagccaat tccaatacat tattgtgccc cagctggttt tgcgattcta aagtgtaacg 720
ataagaagtt caatggaaca ggaccatgta aaaatgtcag tacagtacaa tgtacacatg 780
gaattaggcc agtagtatca actcaactgc tgctaaatgg cagtatagca gaaaaagagg 840
tagtaattag gtccgctaat ttcacggaca atgctaaaac cataatagta cagctgaata 900
actctgtaca aattaattgc acaagacccg gcaacaatac aagaaaaagt atacatatag 960
gaccaggcag agcattttat gcacatgaaa taatagggga gataagacaa gcacattgta 1020
cccttaacag aacacaatgg aataacactt taaaacagat agttataaaa ttaagagaac 1080
aatttaacaa taagacaata gtctttaatc actcctcagg aggggaccca gaaattgtaa 1140
cacacagttt taattgtgga ggggaatttt tctactgtaa tacatcacaa ttatttaata 1200
gtacctggag gagtaatgaa actgtaaatg acactatggg aaaggacaca aatgacacaa 1260
caatcacact cccatgcaga ataaaacaaa ttataaacat gtggcaggaa gtaggaaaag 1320
caatgtatgc cccgcccatc agaggacaaa ttagctgttc atcaaatatt acagggctgc 1380
tattaacaag agatggtggt gtgaacgaga ccaacgccac cgaggtcttc agacctggag 1440
gaggagatat gagggacaat tggagaagtg aattatataa atataaagta gtagaaattg 1500
aaccattagg aatagcaccc accaaggcaa agagaagagt ggtgcagaga gagaaaagag 1560
cagtgggaat aggagctgtg ttccttgggt tcttgggagc agcaggaagc act 1613
<210> 52
<211> 485
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 52
Asn Ala Thr Lys Asp Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Asn Thr Ser Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Phe Met Asn Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Gly Val Thr Leu Asp Cys Thr Lys Ala Asn Ile Thr Asn Asn Ser
100 105 110
Thr Thr Asn Ser Ser Gly Gly Ile Glu Glu Gly Arg Asp Ile Glu Asn
115 120 125
Cys Ser Phe Asn Ile Thr Thr Asn Ile Arg Asp Lys Ile Lys Lys Glu
130 135 140
Tyr Ala Leu Phe Tyr Ser Leu Asp Val Ile Ala Ile Asp Asp Ser Ser
145 150 155 160
Asn Ser Ser Asn Arg Ser Tyr Arg Leu Arg Gly Cys Asn Thr Ser Thr
165 170 175
Ile Thr Gln Ala Cys Pro Lys Val Thr Phe Glu Pro Ile Pro Ile His
180 185 190
Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp Lys Lys
195 200 205
Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr Val Gln Cys Thr
210 215 220
His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser
225 230 235 240
Ile Ala Glu Lys Glu Val Val Ile Arg Ser Ala Asn Phe Thr Asp Asn
245 250 255
Ala Lys Thr Ile Ile Val Gln Leu Asn Asn Ser Val Gln Ile Asn Cys
260 265 270
Thr Arg Pro Gly Asn Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly
275 280 285
Arg Ala Phe Tyr Ala His Glu Ile Ile Gly Glu Ile Arg Gln Ala His
290 295 300
Cys Thr Leu Asn Arg Thr Gln Trp Asn Asn Thr Leu Lys Gln Ile Val
305 310 315 320
Ile Lys Leu Arg Glu Gln Phe Asn Asn Lys Thr Ile Val Phe Asn His
325 330 335
Ser Ser Gly Gly Asp Pro Glu Ile Val Thr His Ser Phe Asn Cys Gly
340 345 350
Gly Glu Phe Phe Tyr Cys Asn Thr Ser Gln Leu Phe Asn Ser Thr Trp
355 360 365
Arg Ser Asn Glu Thr Val Asn Asp Thr Met Gly Lys Asp Thr Asn Asp
370 375 380
Thr Thr Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp
385 390 395 400
Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Gln Ile
405 410 415
Ser Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly
420 425 430
Val Asn Glu Thr Asn Ala Thr Glu Val Phe Arg Pro Gly Gly Gly Asp
435 440 445
Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Glu
450 455 460
Ile Glu Pro Leu Gly Ile Ala Pro Thr Lys Ala Lys Arg Arg Val Val
465 470 475 480
Gln Arg Glu Lys Arg
485
<210> 53
<211> 1658
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 53
agaaagagca gaagacagtg gcaatgagag cgaaggggat caggaagaat tggcagcact 60
tgtggagatg gggcaccatg ctcccatggg gcaccatgct ccttgggatg ttaatgatga 120
tctgtagtgc tgcagaagaa aaatgggtca cagtctatta tggggtacct gtgtggaaag 180
aagcaaccac cactctattt tgtgcatcag atgctaaagc atatgacaca gaggtacata 240
atgtttgggc cacacatgcc tgtgtaccca cagaccctaa cccacaagaa gtagtattgg 300
gaaatgtgac agaaaatttt aatgtgtgga aaaatgacat ggtagaacag atgcatgaag 360
atataatcag cttatgggat caaagcctaa agccatgtgt aaaattaacc ccactctgtg 420
ttgctttaaa ttgcactaat gtgaatgata ctaggacaaa caatagtagt agtagtgata 480
aaaatgatgc taaaaccaat agtagtagta gttgggaaag gatggaagga gaagtaaaaa 540
actgctcttt caatgttacc acaagaataa gaaacaaggt gcagaaagaa tatgcacttt 600
tttataagct tgatgtagtg ccaatagaga aggataatgc aagctataca ttgataaatt 660
gtaacacctc agtcattaca caggcctgtc caaaggtatc ttttgaacca attcccatac 720
attattgtgc cccggctggt tttgcgattc taaagcgtaa tgataagaag ttcaatggaa 780
aaggcccatg tacaaatgtc agcacagtac gatgtacaca tggaattagg ccagtagtgt 840
caactcaact actgttaaat ggcagtctag cagaagaagg ggtagtaatt agatctgaaa 900
atctcacgaa caatgttaaa accataatag tacagctgaa caaatctgta aaaattaatt 960
gtacaagacc caacaacaat acaagaaaaa gtataaatat aggaccaggg agagcatttt 1020
atgcaacagg agcaataata ggaaatatga gacaagcaca ttgtaacctt aatggaacag 1080
aatggaagaa cactttaaga caggtagtta taagcttaag agagaaattt gggaagaaga 1140
caatagtctt caaccaatcc tcaggagggg atttagaaat tataatgcac aattttaatt 1200
gtggagggga atttttctac tgtgatacaa cacagctgtt taatagtact tggctgccta 1260
atgagactac agagtcaaat aacattactg gaggacctaa tgacacactc acgctcccat 1320
gtagaataaa acaaattata aacagatggc aggaagtagg aaaagcaatg tatgcccctc 1380
ccatcagtgg gcaaattaga tgctcatcaa atattacggg gctgctatta acaagagatg 1440
gtggtgagga gcagaatgac actgaggtct ttagacctgg aggaggagat atgagggaca 1500
attggagaag tgaattatat aaatataaag tagtaagaat tgagccatca ggagtagcac 1560
ccaccaaggc aaagagaaga gtggtgcaaa gagacaaaag agcagtggga gcactaggag 1620
ctatgttcct tgggttcttg ggagcagcag gaagcact 1658
<210> 54
<211> 490
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 54
Ser Ala Ala Glu Glu Lys Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Gly Asn Val Thr Glu Asn
50 55 60
Phe Asn Val Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Ala Leu Asn Cys Thr Asn Val Asn Asp Thr Arg Thr Asn
100 105 110
Asn Ser Ser Ser Ser Asp Lys Asn Asp Ala Lys Thr Asn Ser Ser Ser
115 120 125
Ser Trp Glu Arg Met Glu Gly Glu Val Lys Asn Cys Ser Phe Asn Val
130 135 140
Thr Thr Arg Ile Arg Asn Lys Val Gln Lys Glu Tyr Ala Leu Phe Tyr
145 150 155 160
Lys Leu Asp Val Val Pro Ile Glu Lys Asp Asn Ala Ser Tyr Thr Leu
165 170 175
Ile Asn Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser
180 185 190
Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile
195 200 205
Leu Lys Arg Asn Asp Lys Lys Phe Asn Gly Lys Gly Pro Cys Thr Asn
210 215 220
Val Ser Thr Val Arg Cys Thr His Gly Ile Arg Pro Val Val Ser Thr
225 230 235 240
Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Gly Val Val Ile Arg
245 250 255
Ser Glu Asn Leu Thr Asn Asn Val Lys Thr Ile Ile Val Gln Leu Asn
260 265 270
Lys Ser Val Lys Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys
275 280 285
Ser Ile Asn Ile Gly Pro Gly Arg Ala Phe Tyr Ala Thr Gly Ala Ile
290 295 300
Ile Gly Asn Met Arg Gln Ala His Cys Asn Leu Asn Gly Thr Glu Trp
305 310 315 320
Lys Asn Thr Leu Arg Gln Val Val Ile Ser Leu Arg Glu Lys Phe Gly
325 330 335
Lys Lys Thr Ile Val Phe Asn Gln Ser Ser Gly Gly Asp Leu Glu Ile
340 345 350
Ile Met His Asn Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asp Thr
355 360 365
Thr Gln Leu Phe Asn Ser Thr Trp Leu Pro Asn Glu Thr Thr Glu Ser
370 375 380
Asn Asn Ile Thr Gly Gly Pro Asn Asp Thr Leu Thr Leu Pro Cys Arg
385 390 395 400
Ile Lys Gln Ile Ile Asn Arg Trp Gln Glu Val Gly Lys Ala Met Tyr
405 410 415
Ala Pro Pro Ile Ser Gly Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly
420 425 430
Leu Leu Leu Thr Arg Asp Gly Gly Glu Glu Gln Asn Asp Thr Glu Val
435 440 445
Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu
450 455 460
Tyr Lys Tyr Lys Val Val Arg Ile Glu Pro Ser Gly Val Ala Pro Thr
465 470 475 480
Lys Ala Lys Arg Arg Val Val Gln Arg Asp
485 490
<210> 55
<211> 1658
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 55
agaaagagca gaagacagtg gcaatgaaag tgaaggggac caggaagaat tatcagcact 60
tatggagatg gggcaccatg ctcctatgga gatggggcac catgctcctt gggctattaa 120
tgatctgtaa tgctatagaa gaatcgtggg tcacagtcca ttatggagta cctgtgtgga 180
aagaagcaaa caccactctg ttttgtgcat cagatgctaa agcatatgat acagaggtac 240
ataatgtttg ggccacacat gcctgtgtac ccacaaaccc caacccacaa gaagtagact 300
tgggaaatgt gacagaaaat tttaatgcat ggaaaaatga catggtagaa caaatgcatg 360
aggatataat tagtttatgg gatcaaagcc taaagccatg tgtaaaatta actccactct 420
gtgttactct acagtgcact gatttgagga atgatactaa taccactagt agtcctaata 480
ccactagtgg taactggatg gataaaaggg aaatgaaaaa ctgctctttc aatatcacca 540
caagcataag agataagctg cagaaaacat ttgcactttt ttataaactt gatatagtac 600
caataaatga ggacaaaaac agtagtaata ttgataatac cagttatagg ttgataagtt 660
gtaatacctc agtcattaca caggcctgtc caaaggtatc ctttgagcca attcccatac 720
attattgtgc cccggctggt tttgcgattc taaaatgtaa agatgaggag ttcaatggaa 780
caggaccatg taaaaatgtc agtacagtac aatatacaca tggaattagg ccagtagtat 840
caactcaact gctgttaaat ggcagtctag cagaacaagg ggtagtactt agatctaaaa 900
atatctcaga caatactaaa accataatag tacagctaaa agaagctgta acaattaagt 960
gtacaagacc caacaacaat acaagaaaaa gtatacatat aggacctggg agagcatttt 1020
atgcaacagg agacataata ggagatataa gacaagccca ttgtaacatc agtgcaacaa 1080
agtggaatga cacgttacgt cagatagttg aaaaattaca aggatcattt aagaataaaa 1140
caataagctt caagcgatcc tcaggagggg atccagagat tgtaatgcac agttttaatt 1200
gtggagggga atttttctat tgtaattcaa caaaactgtt taatagtact tggtatagta 1260
atgggactag tacttttgat aatactactg aacgaacaaa tgacactatc atactcccat 1320
gcagaataaa acaaattata aacatgtggc aggaagtagg aaaagcaatg tatgcccctc 1380
ccatcccagg actaattaac tgttcatcaa atattacagg actgctatta ataagagatg 1440
gtggtaataa ctatactgac aatactgaga tcttcagacc tggaggagga gacatgaggg 1500
acaattggag aagtgaatta tataaatata aagtagtaaa agttgaacca ttaggtatag 1560
cacccaccaa ggcaaagaga agagtggtac agagagaaaa aagagcagtg ggaataggag 1620
cgtttttcct tgggttcttg ggagcagcag gaagcact 1658
<210> 56
<211> 490
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 56
Asn Ala Ile Glu Glu Ser Trp Val Thr Val His Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asn Pro Asn Pro Gln Glu Val Asp Leu Gly Asn Val Thr Glu Asn
50 55 60
Phe Asn Ala Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Gln Cys Thr Asp Leu Arg Asn Asp Thr Asn Thr
100 105 110
Thr Ser Ser Pro Asn Thr Thr Ser Gly Asn Trp Met Asp Lys Arg Glu
115 120 125
Met Lys Asn Cys Ser Phe Asn Ile Thr Thr Ser Ile Arg Asp Lys Leu
130 135 140
Gln Lys Thr Phe Ala Leu Phe Tyr Lys Leu Asp Ile Val Pro Ile Asn
145 150 155 160
Glu Asp Lys Asn Ser Ser Asn Ile Asp Asn Thr Ser Tyr Arg Leu Ile
165 170 175
Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe
180 185 190
Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu
195 200 205
Lys Cys Lys Asp Glu Glu Phe Asn Gly Thr Gly Pro Cys Lys Asn Val
210 215 220
Ser Thr Val Gln Tyr Thr His Gly Ile Arg Pro Val Val Ser Thr Gln
225 230 235 240
Leu Leu Leu Asn Gly Ser Leu Ala Glu Gln Gly Val Val Leu Arg Ser
245 250 255
Lys Asn Ile Ser Asp Asn Thr Lys Thr Ile Ile Val Gln Leu Lys Glu
260 265 270
Ala Val Thr Ile Lys Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser
275 280 285
Ile His Ile Gly Pro Gly Arg Ala Phe Tyr Ala Thr Gly Asp Ile Ile
290 295 300
Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser Ala Thr Lys Trp Asn
305 310 315 320
Asp Thr Leu Arg Gln Ile Val Glu Lys Leu Gln Gly Ser Phe Lys Asn
325 330 335
Lys Thr Ile Ser Phe Lys Arg Ser Ser Gly Gly Asp Pro Glu Ile Val
340 345 350
Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr
355 360 365
Lys Leu Phe Asn Ser Thr Trp Tyr Ser Asn Gly Thr Ser Thr Phe Asp
370 375 380
Asn Thr Thr Glu Arg Thr Asn Asp Thr Ile Ile Leu Pro Cys Arg Ile
385 390 395 400
Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala
405 410 415
Pro Pro Ile Pro Gly Leu Ile Asn Cys Ser Ser Asn Ile Thr Gly Leu
420 425 430
Leu Leu Ile Arg Asp Gly Gly Asn Asn Tyr Thr Asp Asn Thr Glu Ile
435 440 445
Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu
450 455 460
Tyr Lys Tyr Lys Val Val Lys Val Glu Pro Leu Gly Ile Ala Pro Thr
465 470 475 480
Lys Ala Lys Arg Arg Val Val Gln Arg Glu
485 490
<210> 57
<211> 1655
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 57
agaaagagca gaagacagtg gcaatgagag tgaaggagat caggaagaat tgtcagcgct 60
tgtggacatg gggcaccatg ctccttggga tgttgatgat ctgtagtact gcagaacaac 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga agcaactacc actttatttt 180
gtgcatcaga tgctaaagca tatgacacag aagcacataa tgtttgggcc acacatgcct 240
gtgtacccac ggaccctaac ccacaagaag tagtaataaa tgtgacagaa aattttaaca 300
tgtggaaaaa tgacatggta gaacagatgc atgaggacat aatcagtgta tgggaccaga 360
gtctaaagcc atgtgtaaaa ttaaccccac tctgtgttac tttaaattgc actaattgga 420
atggtactaa taccaataat gctaatacta ccagtagtcc taatattacc agtactacta 480
ctgccaatat ttatgagaaa agaatggaag aaggagaaat acaaaactgc actttcaatg 540
tcaccacaag cataagggat aaggtaaaag aagaatatgc acttttttat agatctgatg 600
taggccaaat aggtaataat agtaataata catatacatt gataaattgt aattcctcag 660
tcattacaca ggctcgtcca aagatatcct ttgaaccaat tcccatacat tattgtaccc 720
cggctggttt tgcgattcta aaatgcaata ataagacctt caatggaaca ggaccatgta 780
acaatgtcag cacagtacag tgtacacatg gtattaggcc agtagtatca actcaattgt 840
tgttaaatgg cagtctagca gaagatgaga taatgattag atctgcaaat ctctcggaca 900
atactaaaaa cataatagta cagctgaata aatctgtaga aattaattgt acaagaccca 960
acaataatac aagaaaaagt ataaatatag gaccagggag agcattttat gcaacaggag 1020
atataatagg aaacataaga catgcatatt gtaccattaa tgaaacaaaa tggaatgaaa 1080
ctttaagaca gatagctaca aaattacaca aacaatttaa taaaacaata atctttgagc 1140
agtcctcagg aggagaccca gaaattacaa cgcacagttt taattgtgga ggggaatttt 1200
tctactgtaa tacaacaccg ctgtttaata gcacttgggt taagactcag aatgatacta 1260
tagggtctaa gactcagaat gctactacag ggttaaatgg cactatcata ctcccatgca 1320
gaataaaaca aatcataaac agatggcagg aagtaggaag agcaatgtat gcccctccca 1380
tcaaaggaat aattagatgt tcatcaaata ttacagggct gatattgaca agagatggtg 1440
gtggtaatga gagcagggaa aatgagacct tcagacctgg aggaggagat atgagggaca 1500
attggagaag tgaattatat aaatataaag tagtaaaaat tgagccaata ggactggcac 1560
ccaccaaggc aaagagaaga gtggtgcaga gagaaaaaag agcggtaacg ctgggagcta 1620
tgttccctgg gttcttggga gcagcaggaa gcact 1655
<210> 58
<211> 499
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 58
Ser Thr Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Ile Asn Val Thr Glu Asn Phe
50 55 60
Asn Met Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile Ile
65 70 75 80
Ser Val Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu
85 90 95
Cys Val Thr Leu Asn Cys Thr Asn Trp Asn Gly Thr Asn Thr Asn Asn
100 105 110
Ala Asn Thr Thr Ser Ser Pro Asn Ile Thr Ser Thr Thr Thr Ala Asn
115 120 125
Ile Tyr Glu Lys Arg Met Glu Glu Gly Glu Ile Gln Asn Cys Thr Phe
130 135 140
Asn Val Thr Thr Ser Ile Arg Asp Lys Val Lys Glu Glu Tyr Ala Leu
145 150 155 160
Phe Tyr Arg Ser Asp Val Gly Gln Ile Gly Asn Asn Ser Asn Asn Thr
165 170 175
Tyr Thr Leu Ile Asn Cys Asn Ser Ser Val Ile Thr Gln Ala Arg Pro
180 185 190
Lys Ile Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Thr Pro Ala Gly
195 200 205
Phe Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro
210 215 220
Cys Asn Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val
225 230 235 240
Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Asp Glu Ile
245 250 255
Met Ile Arg Ser Ala Asn Leu Ser Asp Asn Thr Lys Asn Ile Ile Val
260 265 270
Gln Leu Asn Lys Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn
275 280 285
Thr Arg Lys Ser Ile Asn Ile Gly Pro Gly Arg Ala Phe Tyr Ala Thr
290 295 300
Gly Asp Ile Ile Gly Asn Ile Arg His Ala Tyr Cys Thr Ile Asn Glu
305 310 315 320
Thr Lys Trp Asn Glu Thr Leu Arg Gln Ile Ala Thr Lys Leu His Lys
325 330 335
Gln Phe Asn Lys Thr Ile Ile Phe Glu Gln Ser Ser Gly Gly Asp Pro
340 345 350
Glu Ile Thr Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys
355 360 365
Asn Thr Thr Pro Leu Phe Asn Ser Thr Trp Val Lys Thr Gln Asn Asp
370 375 380
Thr Ile Gly Ser Lys Thr Gln Asn Ala Thr Thr Gly Leu Asn Gly Thr
385 390 395 400
Ile Ile Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Arg Trp Gln Glu
405 410 415
Val Gly Arg Ala Met Tyr Ala Pro Pro Ile Lys Gly Ile Ile Arg Cys
420 425 430
Ser Ser Asn Ile Thr Gly Leu Ile Leu Thr Arg Asp Gly Gly Gly Asn
435 440 445
Glu Ser Arg Glu Asn Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg
450 455 460
Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu
465 470 475 480
Pro Ile Gly Leu Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg
485 490 495
Glu lys arg
<210> 59
<211> 1637
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 59
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tatcagcact 60
tgtggacatg gagcaacatg ctcacgatgc tccttgggat gttaatgatc tgtagtgctg 120
cagatcaatt gtgggtcaca gtctattatg gggtacctgt gtggaaagaa acaaccacca 180
ctctattttg tgcatcagat gctaaagcat atgataaaga ggtacataat gtttgggcca 240
cacatgcctg tgtacccaca gaccccaacc cacaagaagt aatattggaa aatgtgacag 300
aaaattttaa cgcgtggaaa aatgacatgg tagaacagat gcatgaggat ataattagtt 360
tatgggatca aagcttaaag ccatgtgtaa aattaacccc actctgtgtt actttaaatt 420
gcactgatgc taatattact aataccaatg ataatgagcc caatagtagt gtggtgaaac 480
tgatagagaa aggagaaata aaaaactgct ctttcaatat caccacaagc ataagagata 540
agatgcagaa agcatatgca cttttttata aacttgatgt agaaccaata gagaataata 600
ctactagcta taggttgata agttgtaaca cctcagtcat tacacaagcc tgtccaaagg 660
tatcctttga gccaattccc atacattttt gtgccccggc tggttttgcg attctaaagt 720
gtaacaataa gacattcgag ggaaaaggac catgtaaaaa tatcagcaca gtacaatgta 780
cacatggaat taggccagta gtatcaactc aattgctgtt aaatggcagt ctggcagaag 840
aagagatagt gattagatct gacaattttt caaacagtgc taaaaccata atagtacagt 900
tgaatgcatc tgtagaaatt aatcgtacaa gacccaacaa caatacgaga aaaggtatag 960
ttataggacc agggagaaaa gttattgcaa cagaaaaaat aataggagat gtaagacaag 1020
cacattgtaa cattagtata acaaaatgga ataatacttt aggccatata gttaataaat 1080
taagaaaaca atttggggaa aataaaacaa tagtctttaa gcaacactca ggaggggatc 1140
cagaagttat aatgcataat tttaattgtg caggggaatt tttctactgt aatacaacag 1200
gactgtttaa tagcacttgg cattggaatg gtacttggag tggtactgaa aggagaaata 1260
gcactgaagg aaatgacaca cttacactcc catgcagaat aaaacaaatt ataaacatgt 1320
ggcaggaagt aggaaaagca atgtatgccc ctcccgttaa cggacagatt agatgtttat 1380
caaatattac aggactgcta ttaacaagag atggtggtaa taacaataac acaaacgaca 1440
ccgaaacctt cagacctgaa ggaggagata tgagggacaa ttggagaagt gaattatata 1500
aatataaagt agtaagaatt gaaccattag gagtagcacc caccaaggca aagagaagag 1560
tggtgcagag agaaaaaaga gcactgggag taggagcagc tttgttcctt gggttcttgg 1620
gagcagcagg aagcact 1637
<210> 60
<211> 489
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 60
Ser Ala Ala Asp Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Thr Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Lys Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Ile Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Ala Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Ala Asn Ile Thr Asn Thr Asn
100 105 110
Asp Asn Glu Pro Asn Ser Ser Val Val Lys Leu Ile Glu Lys Gly Glu
115 120 125
Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Ser Ile Arg Asp Lys Met
130 135 140
Gln Lys Ala Tyr Ala Leu Phe Tyr Lys Leu Asp Val Glu Pro Ile Glu
145 150 155 160
Asn Asn Thr Thr Ser Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val Ile
165 170 175
Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His Phe
180 185 190
Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe
195 200 205
Glu Gly Lys Gly Pro Cys Lys Asn Ile Ser Thr Val Gln Cys Thr His
210 215 220
Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu
225 230 235 240
Ala Glu Glu Glu Ile Val Ile Arg Ser Asp Asn Phe Ser Asn Ser Ala
245 250 255
Lys Thr Ile Ile Val Gln Leu Asn Ala Ser Val Glu Ile Asn Arg Thr
260 265 270
Arg Pro Asn Asn Asn Thr Arg Lys Gly Ile Val Ile Gly Pro Gly Arg
275 280 285
Lys Val Ile Ala Thr Glu Lys Ile Ile Gly Asp Val Arg Gln Ala His
290 295 300
Cys Asn Ile Ser Ile Thr Lys Trp Asn Asn Thr Leu Gly His Ile Val
305 310 315 320
Asn Lys Leu Arg Lys Gln Phe Gly Glu Asn Lys Thr Ile Val Phe Lys
325 330 335
Gln His Ser Gly Gly Asp Pro Glu Val Ile Met His Asn Phe Asn Cys
340 345 350
Ala Gly Glu Phe Phe Tyr Cys Asn Thr Thr Gly Leu Phe Asn Ser Thr
355 360 365
Trp His Trp Asn Gly Thr Trp Ser Gly Thr Glu Arg Arg Asn Ser Thr
370 375 380
Glu Gly Asn Asp Thr Leu Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile
385 390 395 400
Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Val Asn
405 410 415
Gly Gln Ile Arg Cys Leu Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg
420 425 430
Asp Gly Gly Asn Asn Asn Asn Thr Asn Asp Thr Glu Thr Phe Arg Pro
435 440 445
Glu Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr
450 455 460
Lys Val Val Arg Ile Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys
465 470 475 480
Arg Arg Val Val Gln Arg Glu Lys Arg
485
<210> 61
<211> 1678
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 61
agaaagagca gaagacagtg gaatgagagt gaaggggatc aggaagaatt atcagcactt 60
gtggatatgg ggcatcttgc tccctgggat gttgatgatc tgtagtgctg cagacaagtt 120
gtgggtcaca gtctattatg gggtacctgt gtggaaagaa gcaaccacca ctctattttg 180
tgcatcagat gctaaagcat atagtgcaga ggtacataat gtctgggcca cacatgcctg 240
tgtacccaca gaccccgacc cacaggaaat agtattggaa aatgtaacag aaaattttaa 300
catgtggaaa aataacatgg tagaacagat gcaggaggat ataatcagtt tatgggatca 360
aagcctaaag ccatgtgtaa aattaacccc tctctgtgtc actttaaacc gcactgatga 420
attgcggact actaataaga ctactaataa gaccaatgat acagagacga atactactaa 480
taccaccagc tgggaaaaag gggaaatgaa aaactgctct tttgatgtca ccacaaacat 540
aagagataag tggcagagag aatatgcact tttttataag cttgatgtag taccaataga 600
taatgatggt aatggtaata gtagtaataa tgccactgat aataataata ctaccaaata 660
taccaactat aggttgataa gttgtaacac ctcagttatt acacaggcct gtccaaaggt 720
atcctttgag ccaattccca tacattattg tgccccagct ggttttgcga ttctaaagtg 780
taaagatgag aagttcagtg gaacaggacc atgtaaaaat gtcagcacag tacaatgcac 840
acatggaatt aggccagtag tatcaactca actgctgttg aatggcagtc tagcaaaaga 900
agagataata attagatctg aaaatctcac gaacaatgct aaaaccataa tagtacagct 960
gaatgaatct gtatcaatta attgtataag acccaacaat aatacaagaa gaggtatacc 1020
tataggacca gggcaagcat tttatgcaac aggggatata ataggggata taagaaaagc 1080
acattgtata gttaacagta cacaatggaa taacacttta gcacaggtag ccataaaatt 1140
aaatgaacac tttccaaata aaacaatagt ctttaagcag tcctcaggag gggacccaga 1200
aattgtaatg cacagtttta attgtggagg ggaatttttc tactgtgatt caacaccact 1260
gtttaacaat acttggaatg aaacacattt taataatact tgggatagta ttgaaaaggg 1320
aaaaatcata ctccaatgca gaataaaaca aattataaat atgtggcagg aagtaggaaa 1380
agcaatgtat gcccctccca tcagagggct gattaactgt acatcaaaca ttacagggct 1440
actattaaca agagatggtg gcaagaaaga gaatgagagt gatactatcg agatcttcag 1500
acctggagga ggagacatga ggaacaattg gagaagtgaa ttatataaat ataaagtagt 1560
aaaaattgaa ccattaggag tagcacccac caaggcaaaa agaagagtgg tgcagagaga 1620
aaaaagagca gcgctaggag ctttgttcct tgggttcttg ggagcagcag gaagcact 1678
<210> 62
<211> 508
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 62
Ser Ala Ala Asp Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Ser Ala Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asp Pro Gln Glu Ile Val Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met Gln Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Arg Thr Asp Glu Leu Arg Thr Thr Asn Lys
100 105 110
Thr Thr Asn Lys Thr Asn Asp Thr Glu Thr Asn Thr Thr Asn Thr Thr
115 120 125
Ser Trp Glu Lys Gly Glu Met Lys Asn Cys Ser Phe Asp Val Thr Thr
130 135 140
Asn Ile Arg Asp Lys Trp Gln Arg Glu Tyr Ala Leu Phe Tyr Lys Leu
145 150 155 160
Asp Val Val Pro Ile Asp Asn Asp Gly Asn Gly Asn Ser Ser Asn Asn
165 170 175
Ala Thr Asp Asn Asn Asn Thr Thr Lys Tyr Thr Asn Tyr Arg Leu Ile
180 185 190
Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe
195 200 205
Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu
210 215 220
Lys Cys Lys Asp Glu Lys Phe Ser Gly Thr Gly Pro Cys Lys Asn Val
225 230 235 240
Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln
245 250 255
Leu Leu Leu Asn Gly Ser Leu Ala Lys Glu Glu Ile Ile Ile Arg Ser
260 265 270
Glu Asn Leu Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Asn Glu
275 280 285
Ser Val Ser Ile Asn Cys Ile Arg Pro Asn Asn Asn Thr Arg Arg Gly
290 295 300
Ile Pro Ile Gly Pro Gly Gln Ala Phe Tyr Ala Thr Gly Asp Ile Ile
305 310 315 320
Gly Asp Ile Arg Lys Ala His Cys Ile Val Asn Ser Thr Gln Trp Asn
325 330 335
Asn Thr Leu Ala Gln Val Ala Ile Lys Leu Asn Glu His Phe Pro Asn
340 345 350
Lys Thr Ile Val Phe Lys Gln Ser Ser Gly Gly Asp Pro Glu Ile Val
355 360 365
Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asp Ser Thr
370 375 380
Pro Leu Phe Asn Asn Thr Trp Asn Glu Thr His Phe Asn Asn Thr Trp
385 390 395 400
Asp Ser Ile Glu Lys Gly Lys Ile Ile Leu Gln Cys Arg Ile Lys Gln
405 410 415
Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro
420 425 430
Ile Arg Gly Leu Ile Asn Cys Thr Ser Asn Ile Thr Gly Leu Leu Leu
435 440 445
Thr Arg Asp Gly Gly Lys Lys Glu Asn Glu Ser Asp Thr Ile Glu Ile
450 455 460
Phe Arg Pro Gly Gly Gly Asp Met Arg Asn Asn Trp Arg Ser Glu Leu
465 470 475 480
Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala Pro Thr
485 490 495
Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
500 505
<210> 63
<211> 1595
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 63
agaaagagca gaagacagtg gcaatgagag tgaaggagat catgaagaac tatcagaact 60
tatggagagg gggcatgatg ctccttggga tattcatgat ctgtagtgct acagaacaat 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga agcaaatacc actctatttt 180
gtgcatcaga tgctaaagca tataagacag aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag tattattgcc aaatatgaca gaagatttta 300
acatgtggaa aaataacatg gtagaacaga tgcatgagga tataatcagt ttgtgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cactctgtgt tactttaaaa tgcactgact 420
tgaatactac taatactatc aatagtagtg acttgatgga gaagggagaa ataaagaact 480
gctctttcaa tatcaccaca aacataagag ataagatgca gaaagactat gcgctttttt 540
atagacttga tgtagtacca atagataatg ataatactag ctataggttg ataagttgta 600
acacctcagt cattacacag gcctgcccaa aggtatcttt tgagccaatt cccatacatt 660
gttgtgcccc ggctggtttt gcgattctaa agtgtaaaga taagaatttc aatggaacag 720
gaacatgtaa aaatgtcagc acagtacagt gtacacatgg aattagacca gtagtatcaa 780
ctcaactgtt gttaaatggt agtctggcag aagaagaggt agtaattaga tctgccaatt 840
tcagtgacaa tgctaaaaac ataatagtac agctgaacga aactgtagaa attaattgta 900
caagacccaa caacaataca atgaaaagca tacatatagg actagggaga gcattttata 960
caacaggaca aataatagga gatataagaa aagcacattg tagcattaat atgacaaaat 1020
ggaataacac cttaatacag gtagctaaaa agttaggaga acaatttaag aataaaacaa 1080
tagtctttaa ccaatcctca ggaggggaca cagaaattgt aatgcacagc tttaattgtg 1140
gaggggaatt tttctactgc aatacaacac aactgtttaa tggtagttgg aatccaaatg 1200
gtacttggaa ttatgctggg gggtcaaacg acactatcac actcccatgc agaataaaac 1260
aaattataaa tatgtggcag gaagtaggaa aagcaatgta tgcccctccc gtcaaaggac 1320
aaatcagatg tgtatcaaac attacagggt tgatattaac aagagatggt ggtaatggtg 1380
gtaatggcac agacaacacc accgagatct ttaggcctgc aggaggaaat atgaaggaca 1440
attggagaag tgaattatat aaatataaag tagtaagaat tgaaccatta ggagtagcac 1500
ccactaaggc aaagagaaga gtggtacaaa gagaaaaaag agcagtggga atgggagctc 1560
tgttccttgg gttcttggga gcagcaggaa gcact 1595
<210> 64
<211> 479
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 64
Ser Ala Thr Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Lys Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Leu Leu Pro Asn Met Thr Glu Asp
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Lys Cys Thr Asp Leu Asn Thr Thr Asn Thr Ile
100 105 110
Asn Ser Ser Asp Leu Met Glu Lys Gly Glu Ile Lys Asn Cys Ser Phe
115 120 125
Asn Ile Thr Thr Asn Ile Arg Asp Lys Met Gln Lys Asp Tyr Ala Leu
130 135 140
Phe Tyr Arg Leu Asp Val Val Pro Ile Asp Asn Asp Asn Thr Ser Tyr
145 150 155 160
Arg Leu Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys
165 170 175
Val Ser Phe Glu Pro Ile Pro Ile His Cys Cys Ala Pro Ala Gly Phe
180 185 190
Ala Ile Leu Lys Cys Lys Asp Lys Asn Phe Asn Gly Thr Gly Thr Cys
195 200 205
Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val
210 215 220
Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val
225 230 235 240
Ile Arg Ser Ala Asn Phe Ser Asp Asn Ala Lys Asn Ile Ile Val Gln
245 250 255
Leu Asn Glu Thr Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr
260 265 270
Met Lys Ser Ile His Ile Gly Leu Gly Arg Ala Phe Tyr Thr Thr Gly
275 280 285
Gln Ile Ile Gly Asp Ile Arg Lys Ala His Cys Ser Ile Asn Met Thr
290 295 300
Lys Trp Asn Asn Thr Leu Ile Gln Val Ala Lys Lys Leu Gly Glu Gln
305 310 315 320
Phe Lys Asn Lys Thr Ile Val Phe Asn Gln Ser Ser Gly Gly Asp Thr
325 330 335
Glu Ile Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys
340 345 350
Asn Thr Thr Gln Leu Phe Asn Gly Ser Trp Asn Pro Asn Gly Thr Trp
355 360 365
Asn Tyr Ala Gly Gly Ser Asn Asp Thr Ile Thr Leu Pro Cys Arg Ile
370 375 380
Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala
385 390 395 400
Pro Pro Val Lys Gly Gln Ile Arg Cys Val Ser Asn Ile Thr Gly Leu
405 410 415
Ile Leu Thr Arg Asp Gly Gly Asn Gly Gly Asn Gly Thr Asp Asn Thr
420 425 430
Thr Glu Ile Phe Arg Pro Ala Gly Gly Asn Met Lys Asp Asn Trp Arg
435 440 445
Ser Glu Leu Tyr Lys Tyr Lys Val Val Arg Ile Glu Pro Leu Gly Val
450 455 460
Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
465 470 475
<210> 65
<211> 1592
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 65
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaggaat tgtcagcact 60
tatggagatg gggcaccatg ctccttggga tgttaatgat ctgtagtgct acagaacaat 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga agcaactacc accctatttt 180
gtgcatcaga tgctaaagca tatgatacag agagacataa tgtttgggcc acacatgcct 240
gtgtacccac agacccctgc ccacaagaag taggattggg aaatgtgaca gagtatttta 300
acatgtggaa aaataacatg gtagaacaga tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa gccatgtgta aaattaacac cactctgtgt tactttaaat tgtaatgcgg 420
gaaagtttaa ttatacgaat aatactgata cactgaaaga agaagtagga gaaataaaaa 480
actgctcttt caatatcacc acaagcataa gagataaggt aaagaaagaa tatgcatttt 540
ttaataaact tgatgtagta ccaatagata atgagaatga tagctatagg ttgataagtt 600
gtaacacctc agtcattact caggcctgtc caaaggtatc atttgagcca attcctatac 660
attattgtgc cccagctggt tttgcgattc taaggtgtaa taataagaca ttcaatggga 720
caggaccatg tacaaatgtc agtacagtac aatgtacaca tggaattagg ccagtagtgt 780
caacccaact gctgttaaat ggcagtctag cagaagagga ggtaatgatt aggtctgaga 840
acttcacgaa caatgctaaa accataatag tacagctgaa tgaatctgta gtaattaatt 900
gtacaagacc caacaacaat acaagaaaaa gtatacacat aggaccaggg agagcatttt 960
atacaacagg agagataata ggagatataa gaaaagcaca ttgtaacatt agtaaagcaa 1020
aatgggatag cactttaaaa caagtagtta caaaattaag agaacaatat ggaaataaaa 1080
caatagcctt taagaactcc tcaggagggg acccagaaat tgtaatgcac agttttaatt 1140
gtggagggga atttttctac tgtaatacaa caaagctatt taatagtact tggaatagga 1200
cagaggtaga tactattgaa ggaaatacca ctataaatat cacactccca tgtagaataa 1260
aacaaattat aaacatgtgg caggaagtag gaaaagcaat gtatgcccct cccatcagag 1320
gaccaattag ctgcacatca aatattacag ggctgctgtt aataagagat ggtggtacag 1380
acaatagcac gaacgacacc gagatcttca gacctggagg aggagatatg agggacaatt 1440
ggagaagtga attatacaaa tataaagtag taaaaattga accattagga atagcaccca 1500
ccaaggcaaa gagaagagtg gtgcagagag aaaaaagagc aataggaata ggagctgtgt 1560
tccttgggtt cttgggagca gcaggaagca ct 1592
<210> 66
<211> 478
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 66
Ser Ala Thr Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Arg His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Cys Pro Gln Glu Val Gly Leu Gly Asn Val Thr Glu Tyr
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Asn Ala Gly Lys Phe Asn Tyr Thr Asn
100 105 110
Asn Thr Asp Thr Leu Lys Glu Glu Val Gly Glu Ile Lys Asn Cys Ser
115 120 125
Phe Asn Ile Thr Thr Ser Ser Ile Arg Asp Lys Val Lys Lys Glu Tyr Ala
130 135 140
Phe Phe Asn Lys Leu Asp Val Val Pro Ile Asp Asn Glu Asn Asp Ser
145 150 155 160
Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro
165 170 175
Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly
180 185 190
Phe Ala Ile Leu Arg Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro
195 200 205
Cys Thr Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val
210 215 220
Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val
225 230 235 240
Met Ile Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr Ile Ile Val
245 250 255
Gln Leu Asn Glu Ser Val Val Ile Asn Cys Thr Arg Pro Asn Asn Asn
260 265 270
Thr Arg Lys Ser Ile His Ile Gly Pro Gly Arg Ala Phe Tyr Thr Thr
275 280 285
Gly Glu Ile Ile Gly Asp Ile Arg Lys Ala His Cys Asn Ile Ser Lys
290 295 300
Ala Lys Trp Asp Ser Thr Leu Lys Gln Val Val Thr Lys Leu Arg Glu
305 310 315 320
Gln Tyr Gly Asn Lys Thr Ile Ala Phe Lys Asn Ser Ser Gly Gly Asp
325 330 335
Pro Glu Ile Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr
340 345 350
Cys Asn Thr Thr Lys Leu Phe Asn Ser Thr Trp Asn Arg Thr Glu Val
355 360 365
Asp Thr Ile Glu Gly Asn Thr Thr Ile Asn Ile Thr Leu Pro Cys Arg
370 375 380
Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr
385 390 395 400
Ala Pro Pro Ile Arg Gly Pro Ile Ser Cys Thr Ser Asn Ile Thr Gly
405 410 415
Leu Leu Leu Ile Arg Asp Gly Gly Thr Asp Asn Ser Thr Asn Asp Thr
420 425 430
Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser
435 440 445
Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Ile Ala
450 455 460
Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
465 470 475
<210> 67
<211> 1586
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 67
agaaagagca gaagacagtg gcaatgaaag tgacggggat catgaagaat tatcagcact 60
tatggagatg gggcatcatg ctccttggga tgttgatgat ctatagtact gcagaacaac 120
aattgtgggt cacagtctat tatggggtac ctgtgtggaa agaagcaact actactctat 180
tctgtgcatc agatgctaaa gcatatgata cagaggtaca taatgtttgg gccacacatg 240
cctgtgtacc cacagacccc aacccacaag aagtagtatt ggggaatgtg acagaaaatt 300
ttaacatgtg gaaaaataac atggtagaac agatgcatga ggatataatc agtttatggg 360
atcaaagcct aaaaccatgt gtaaaactaa ccccactctg tgttacttta aattgcactg 420
actgggataa aacgaattgc actaatgggg gagatattac tgctaatgag gaaaaaggag 480
aactaaaaaa ttgctctttc aatatcacca caaacataag agataagata cggaaagaat 540
atgcactttt ttataaattg gatgtagtac caatagataa tgataatact agttataggt 600
tgataaattg taacacctca gtcattacac aagcctgtcc aaaggtatcc tttgagccaa 660
ttcccataca ttattgtgcc ccggctggtt ttgcgattct aaagtgtaac aataagacgt 720
tcaatggaaa aggaccatgt aaaaatgtca gcacagtaca atgcacacat ggaattaggc 780
cagtagtgtc aactcaactg ctgttaaatg gcagtttagc agaagaagag gtagtagtta 840
gatctgccaa tttctcggac agtgccaaaa ccatcatagt acaactaaat gaatctgtag 900
taattaattg tacaagaccc aacaacaata caagaaaaag catacatata ggaccaggga 960
gagcatttta tgcaacagga gaaataatag gagatataag acaagcacat tgtaatctta 1020
gtctaacaaa atggaatcaa actttatatc aggtagttag aaaattaaaa gaacaattta 1080
agaataaaac aatagccttt aatcactcct caggagggga cccagaaatt gtaatgcaca 1140
gttttaattg tggaggagaa tttttctact gtaatacaac acaattattt aatagtactt 1200
ggtatactaa tggtacttgg agtgatactg gaagtaatga cacagtcaca ctcccatgca 1260
gaataaaaca aattataaac aggtggcaag aagtaggaaa agcaatgtat gcccctccca 1320
tcaaaggaca aattagatgc tcatcaaata ttacagggct gttattaaca agagatggtg 1380
gtagtagcaa aaacgagacc gaggtcttca gacctggagg aggagatatg agggacaatt 1440
ggagaagtga attatataaa tataaagtag taaaaattga accattagga gtagcaccca 1500
ccagggcaaa gagaagagtg gtgcagagag aaaaaagagg aataggagct gtgttccttg 1560
ggttcttggg agcagcagga agcact 1586
<210> 68
<211> 478
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 68
Ser Thr Ala Glu Gln Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro
1 5 10 15
Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys
20 25 30
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val
35 40 45
Pro Thr Asp Pro Asn Pro Gln Glu Val Val Leu Gly Asn Val Thr Glu
50 55 60
Asn Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp
65 70 75 80
Ile Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr
85 90 95
Pro Leu Cys Val Thr Leu Asn Cys Thr Asp Trp Asp Lys Thr Asn Cys
100 105 110
Thr Asn Gly Gly Asp Ile Thr Ala Asn Glu Glu Lys Gly Glu Leu Lys
115 120 125
Asn Cys Ser Phe Asn Ile Thr Thr Asn Ile Arg Asp Lys Ile Arg Lys
130 135 140
Glu Tyr Ala Leu Phe Tyr Lys Leu Asp Val Val Pro Ile Asp Asn Asp
145 150 155 160
Asn Thr Ser Tyr Arg Leu Ile Asn Cys Asn Thr Ser Val Ile Thr Gln
165 170 175
Ala Cys Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala
180 185 190
Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly
195 200 205
Lys Gly Pro Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile
210 215 220
Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu
225 230 235 240
Glu Glu Val Val Val Arg Ser Ala Asn Phe Ser Asp Ser Ala Lys Thr
245 250 255
Ile Ile Val Gln Leu Asn Glu Ser Val Val Ile Asn Cys Thr Arg Pro
260 265 270
Asn Asn Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly Arg Ala Phe
275 280 285
Tyr Ala Thr Gly Glu Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn
290 295 300
Leu Ser Leu Thr Lys Trp Asn Gln Thr Leu Tyr Gln Val Val Arg Lys
305 310 315 320
Leu Lys Glu Gln Phe Lys Asn Lys Thr Ile Ala Phe Asn His Ser Ser
325 330 335
Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Asn Cys Gly Gly Glu
340 345 350
Phe Phe Tyr Cys Asn Thr Thr Gln Leu Phe Asn Ser Thr Trp Tyr Thr
355 360 365
Asn Gly Thr Trp Ser Asp Thr Gly Ser Asn Asp Thr Val Thr Leu Pro
370 375 380
Cys Arg Ile Lys Gln Ile Ile Asn Arg Trp Gln Glu Val Gly Lys Ala
385 390 395 400
Met Tyr Ala Pro Pro Ile Lys Gly Gln Ile Arg Cys Ser Ser Asn Ile
405 410 415
Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Ser Ser Lys Asn Glu Thr
420 425 430
Glu Val Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser
435 440 445
Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala
450 455 460
Pro Thr Arg Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
465 470 475
<210> 69
<211> 1628
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 69
agaaagagca gaagacagtg gcaatgagag cgaaggggac caagaagaat tggcagcagc 60
acttatggaa atggggcacg atgctccttg ggatgttaat gatctgtagt gctgcggaac 120
aatggtgggt cacagtctat tatggagtac ctgtgtggaa ggacgcaaat accactctat 180
tttgtgcatc agatgctaaa gcatatgata cagaggcaca taatgtctgg gccacacatg 240
cctgtgtacc cacagatccc aacccacaag aaatagtatt ggaaaatgtg acagaagatt 300
ttaacatgtg gaaaaataac atggcagacc agatgcatga ggatataatc agtttatggg 360
atcaaagcct aaagccatgt gtaaaattaa ctccgctctg tgttacttta aattgtactg 420
attggaatgg taatactact agcaatagta ctataaacaa caatactagt actaaggcag 480
aaatgaaaaa atgctctttt aatatcacca caagcataag agataaagtg acaaaggaat 540
atgcactgtt ttatagagtt gatgtagtac caatagataa agaaaataat aataccaatt 600
ataccaatta tagattaata aattgtaaca cctcagtcat tacacaagcc tgtccaaaga 660
catcctttga gccaattcct atacattatt gtgccccagc tggttttgca attctaaagt 720
gtaacaataa gacattcaca ggaaaaggac tctgtacaag ggttagcaca ttacaatgta 780
cacatggaat tagaccagta gtgtcaactc aactgctgtt aaatggcagt ctagcagaag 840
aggaggtagt aattagatgt gaaaatatca cagacaatgc taaaaccata atagtacagc 900
tgaatgaatc tgtagcaatt aattgtacaa gacctaataa caatacaaga aaaagtatac 960
ctataggacc agggagagca ttttatgcaa caggagatat agtaggaaat ataagacaag 1020
cacattgtaa ccttagtgga acagagtggg aaaaaacttt agggaaaata gttggggaat 1080
taagaaaaaa ttttgagaat aagacaataa tttttaatca atcctcagga ggggacccag 1140
aaattgtatc gcaccttttt aattgtggag gagaattttt ctattgtaac tcaacacaac 1200
tgtttaatag tacttggaat actactgcaa aaattgatgg ttctgggaat gttactggaa 1260
aggtaaatag cactatcaca ctccaatgta aaataagaca aattgtaaac ctgtggcagg 1320
aagtaggaaa agcaatgtat gcccctccca tcagtggaat aatttactgt tcatcaaata 1380
ttacagggct gatactgatg agagatggtg gtaatgatag tagcacgaat ggaaacgaga 1440
ccttcagacc tggaggggga aatatgaaag ataattggag aagtgaatta tataaatata 1500
aagtagtaaa aattgaacca ttaggactag cacccaccaa ggcaaagagg agagtggtgc 1560
aaagagaaaa aagagcagta ggaataggag ctatgttcct tgggttcttg ggagcagcag 1620
gaagcact 1628
<210> 70
<211> 489
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 70
Ser Ala Ala Glu Gln Trp Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Asp Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Val Leu Glu Asn Val Thr Glu Asp
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Ala Asp Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Trp Asn Gly Asn Thr Thr Ser
100 105 110
Asn Ser Thr Ile Asn Asn Asn Thr Ser Thr Lys Ala Glu Met Lys Lys
115 120 125
Cys Ser Phe Asn Ile Thr Thr Ser Ile Arg Asp Lys Val Thr Lys Glu
130 135 140
Tyr Ala Leu Phe Tyr Arg Val Asp Val Val Pro Ile Asp Lys Glu Asn
145 150 155 160
Asn Asn Thr Asn Tyr Thr Asn Tyr Arg Leu Ile Asn Cys Asn Thr Ser
165 170 175
Val Ile Thr Gln Ala Cys Pro Lys Thr Ser Phe Glu Pro Ile Pro Ile
180 185 190
His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asn Lys
195 200 205
Thr Phe Thr Gly Lys Gly Leu Cys Thr Arg Val Ser Thr Leu Gln Cys
210 215 220
Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly
225 230 235 240
Ser Leu Ala Glu Glu Glu Val Val Ile Arg Cys Glu Asn Ile Thr Asp
245 250 255
Asn Ala Lys Thr Ile Ile Val Gln Leu Asn Glu Ser Val Ala Ile Asn
260 265 270
Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile Pro Ile Gly Pro
275 280 285
Gly Arg Ala Phe Tyr Ala Thr Gly Asp Ile Val Gly Asn Ile Arg Gln
290 295 300
Ala His Cys Asn Leu Ser Gly Thr Glu Trp Glu Lys Thr Leu Gly Lys
305 310 315 320
Ile Val Gly Glu Leu Arg Lys Asn Phe Glu Asn Lys Thr Ile Ile Phe
325 330 335
Asn Gln Ser Ser Gly Gly Asp Pro Glu Ile Val Ser His Leu Phe Asn
340 345 350
Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gln Leu Phe Asn Ser
355 360 365
Thr Trp Asn Thr Thr Ala Lys Ile Asp Gly Ser Gly Asn Val Thr Gly
370 375 380
Lys Val Asn Ser Thr Ile Thr Leu Gln Cys Lys Ile Arg Gln Ile Val
385 390 395 400
Asn Leu Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Ser
405 410 415
Gly Ile Ile Tyr Cys Ser Ser Asn Ile Thr Gly Leu Ile Leu Met Arg
420 425 430
Asp Gly Gly Asn Asp Ser Ser Thr Asn Gly Asn Glu Thr Phe Arg Pro
435 440 445
Gly Gly Gly Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr
450 455 460
Lys Val Val Lys Ile Glu Pro Leu Gly Leu Ala Pro Thr Lys Ala Lys
465 470 475 480
Arg Arg Val Val Gln Arg Glu Lys Arg
485
<210> 71
<211> 1622
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 71
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tggcggcgct 60
ggtggagatg gggcaccatg ctccttggga tgttaatgat ctgtagtgct acagaacaat 120
tgtgggttac agtctattat ggggtacctg tgtggaaaga agcaaccacc actctatttt 180
gtgcatcaga tgctaaagca tataatacag aggtacgtaa tgtatgggcc acacatgcct 240
gtgtgcccac aggccccaac ccacaagaaa tagtattggt aaatgtgaca gaagatttta 300
acatgtggaa aaatagcatg gtagaacaga tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa accttgtgta aaattaaccc cactctgtgt tactttaaac tgcactgatt 420
tgaggaatac tactaatagc actaatagtg acggggaaaa gatggagaga ggagaaataa 480
aaaactgctc tttcaatgtt accacaggca taagagataa ggttcagaga gaatatgcac 540
tcttttataa acttgatata gtaccaatag aggaaggtgg ggataatacc agctgtaggg 600
ataataccag ctataggttg ataagttgta atacctcagt cattacacaa gcctgtccaa 660
aggtatcctt tgagccaatt cccatacatt attgtgcccc agctggtttt gcgattctaa 720
agtgtaataa taagacgttc aatggaaaag gaccatgttc aaatgtcagc acagtacaat 780
gtacacatgg aattaggcca gtagtgtcaa ctcaactgct gttaaacggc agtctagcag 840
aaaaagaggt agtaattaga tctgaaaata tcacggacaa tactaaaaac ataatagtac 900
agttaaatga aactgtagaa attaattgta caagacccaa caacaataca agaaaaagta 960
tacatatagg accggggaga gcatttcatg caacaggaga aataatagga aatataagac 1020
aggcatattg taacattagt ggagcaaaat ggaataacac tttaaaacag atagtaaaaa 1080
gattaaaaga aaaatttccg aataagataa tagtctttaa tcactcctca ggaggggacc 1140
cagaaattgt aacacacagt tttaattgtg gaggggaatt tttctactgt aattcaacaa 1200
acctgtttaa tagtaattca acacaactgg ataattggac ttatactgaa gggtcaaatg 1260
acactgttat cacgctccca tgcagaataa aacaaattgt aaatatgtgg caggaagtag 1320
gaaaagcaat gtatgctcct cccatcagag gacaaattag atgttcttca aatattacag 1380
ggctgatatt aacaagagat ggtggtaata agactgacaa tgacaccacc gagaccttca 1440
gacctggagg aggaagcatg agggacaatt ggagaagtga attatataaa tataaaatag 1500
taaaggttga accactagga gtagcaccca ccaaggcaaa gaggagagtg gtgcagagag 1560
aaaaaagaac agtgggactg ggagccttgt ttcttgggtt cttgggagca gcaggaagca 1620
ct 1622
<210> 72
<211> 488
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 72
Ser Ala Thr Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asn Thr Glu Val Arg Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Gly Pro Asn Pro Gln Glu Ile Val Leu Val Asn Val Thr Glu Asp
50 55 60
Phe Asn Met Trp Lys Asn Ser Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Leu Arg Asn Thr Thr Asn Ser
100 105 110
Thr Asn Ser Asp Gly Glu Lys Met Glu Arg Gly Glu Ile Lys Asn Cys
115 120 125
Ser Phe Asn Val Thr Thr Gly Ile Arg Asp Lys Val Gln Arg Glu Tyr
130 135 140
Ala Leu Phe Tyr Lys Leu Asp Ile Val Pro Ile Glu Glu Gly Gly Asp
145 150 155 160
Asn Thr Ser Cys Arg Asp Asn Thr Ser Tyr Arg Leu Ile Ser Cys Asn
165 170 175
Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile
180 185 190
Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn
195 200 205
Asn Lys Thr Phe Asn Gly Lys Gly Pro Cys Ser Asn Val Ser Thr Val
210 215 220
Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu
225 230 235 240
Asn Gly Ser Leu Ala Glu Lys Glu Val Val Ile Arg Ser Glu Asn Ile
245 250 255
Thr Asp Asn Thr Lys Asn Ile Ile Val Gln Leu Asn Glu Thr Val Glu
260 265 270
Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile His Ile
275 280 285
Gly Pro Gly Arg Ala Phe His Ala Thr Gly Glu Ile Ile Gly Asn Ile
290 295 300
Arg Gln Ala Tyr Cys Asn Ile Ser Gly Ala Lys Trp Asn Asn Thr Leu
305 310 315 320
Lys Gln Ile Val Lys Arg Leu Lys Glu Lys Phe Pro Asn Lys Ile Ile
325 330 335
Val Phe Asn His Ser Ser Gly Gly Asp Pro Glu Ile Val Thr His Ser
340 345 350
Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Asn Leu Phe
355 360 365
Asn Ser Asn Ser Thr Gln Leu Asp Asn Trp Thr Tyr Thr Glu Gly Ser
370 375 380
Asn Asp Thr Val Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Val Asn
385 390 395 400
Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly
405 410 415
Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Ile Leu Thr Arg Asp
420 425 430
Gly Gly Asn Lys Thr Asp Asn Asp Thr Thr Glu Thr Phe Arg Pro Gly
435 440 445
Gly Gly Ser Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys
450 455 460
Ile Val Lys Val Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg
465 470 475 480
Arg Val Val Gln Arg Glu Lys Arg
485
<210> 73
<211> 1622
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 73
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tatcagcact 60
tgtggagagg ggacaacttg tggagagggg gcatcatgct ccttgggata ttgatgatct 120
gtagtgctac agaaaaattg tgggtcacag tctattatgg ggtacctgtg tggaaaaatg 180
caaacaccac tctattttgt gcatcagatg ctaaagcata tgatacagag gtacataatg 240
tttgggccac acacgcctgt gtacccacag accccagccc acgagaatta atattggaaa 300
atgtgacaga agactttgac atatggaaaa ataacatggt agaacagatg caagaggata 360
taatcagttt atgggatcaa agcctaaagc catgtgtaaa attaacccct ctctgtgtta 420
ctttagaatg caagaatgcc actaaaatta gtaatagcag tgaaattgga gaaatgaaaa 480
actgctcttt taatgttacc acagacagga gagataaggt gaaaacagaa tatgcacttt 540
tttataacct tgatataata caaatagagg aggagaatac cagcagctgc agtaatacca 600
gcagctacag gttgataagt tgtaacacct caacccttac acaggcctgt ccaaagatat 660
cctttgagcc aattcccata cattattgtg ccccggctgg ttttgcaatt ctaaagtgta 720
ataataaaac attcgatgga aaaggatcat gtaaaaatgt cagcacagta caatgtacac 780
atggaattaa gccagtagta tcaactcaac tgctgctaaa cggcagtcta gcagaagaag 840
aggtagtaat tagatctgct aatctctcag acaatgctaa aaccataata gtacagctga 900
acatgtctgt acaaattaat tgtacaagac ccaacaacaa tacaagaaga ggtatacatt 960
taggaccagg gagagccttt tatggaacag acataatagg agatataaga caagcacatt 1020
gtaacattag tggaaaacaa tggaattaca ctttacaaca gatagttaaa aaattcagaa 1080
aacaatttga gaatagcaca gtgatcttta acagatcctc aggaggggac ccagaaattg 1140
taatgcacag ttttaattgt ggaggggaat ttttctactg taatacaaca gaactgttta 1200
atagtacttg gaacagtagt catcctttgg atgatacttg gcctcctttg gataatacaa 1260
gtgacactac tatcacactc ccatgcagaa taaaacaaat tataaacatg tggcaggaag 1320
taggaaaagc aatgtatgcc cctcccatca aaggaccaat tagatgtgaa tcaaatatta 1380
cagggctgct attaacaaga gatggtggcg ataccaatac cactaacggg actgagacct 1440
tcagacctgg aggaggagat atgagggaca attggagaag tgaattatat aaatataaag 1500
tagtaagaat taaaccatta ggaatagcac ccaccaaggc acagagaaga gtggtgcaaa 1560
gagaaaaaag agcagcacta ggagctatgt tccttgggtt cttgggagca gcaggaagca 1620
ct 1622
<210> 74
<211> 483
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 74
Ser Ala Thr Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Asn Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Ser Pro Arg Glu Leu Ile Leu Glu Asn Val Thr Glu Asp
50 55 60
Phe Asp Ile Trp Lys Asn Asn Met Val Glu Gln Met Gln Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Glu Cys Lys Asn Ala Thr Lys Ile Ser Asn Ser
100 105 110
Ser Glu Ile Gly Glu Met Lys Asn Cys Ser Phe Asn Val Thr Thr Asp
115 120 125
Arg Arg Asp Lys Val Lys Thr Glu Tyr Ala Leu Phe Tyr Asn Leu Asp
130 135 140
Ile Ile Gln Ile Glu Glu Glu Asn Thr Ser Ser Cys Ser Asn Thr Ser
145 150 155 160
Ser Tyr Arg Leu Ile Ser Cys Asn Thr Ser Thr Leu Thr Gln Ala Cys
165 170 175
Pro Lys Ile Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala
180 185 190
Gly Phe Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Asp Gly Lys Gly
195 200 205
Ser Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro
210 215 220
Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu
225 230 235 240
Val Val Ile Arg Ser Ala Asn Leu Ser Asp Asn Ala Lys Thr Ile Ile
245 250 255
Val Gln Leu Asn Met Ser Val Gln Ile Asn Cys Thr Arg Pro Asn Asn
260 265 270
Asn Thr Arg Arg Gly Ile His Leu Gly Pro Gly Arg Ala Phe Tyr Gly
275 280 285
Thr Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser Gly
290 295 300
Lys Gln Trp Asn Tyr Thr Leu Gln Gln Ile Val Lys Lys Phe Arg Lys
305 310 315 320
Gln Phe Glu Asn Ser Thr Val Ile Phe Asn Arg Ser Ser Gly Gly Asp
325 330 335
Pro Glu Ile Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr
340 345 350
Cys Asn Thr Thr Glu Leu Phe Asn Ser Thr Trp Asn Ser Ser His Pro
355 360 365
Leu Asp Asp Thr Trp Pro Pro Leu Asp Asn Thr Ser Asp Thr Thr Ile
370 375 380
Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val
385 390 395 400
Gly Lys Ala Met Tyr Ala Pro Pro Ile Lys Gly Pro Ile Arg Cys Glu
405 410 415
Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asp Thr Asn
420 425 430
Thr Thr Asn Gly Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg
435 440 445
Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Arg Ile Lys
450 455 460
Pro Leu Gly Ile Ala Pro Thr Lys Ala Gln Arg Arg Val Val Gln Arg
465 470 475 480
Glu lys arg
<210> 75
<211> 1574
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 75
agaaagagca gaagacagtg gcaatgacag tgatggggat caggaggaat tatcaatgct 60
tgtggaaatg gggcatgacg ctccttggga tgttgatgat ctgtagtgct gcacaattgt 120
gggtcacagt ctgttatggg gtaccggtgt ggaaagaagc aaccaccact ctattttgtg 180
catcagatgc taaagcatat gacacagagg tacataatgt ttgggccaca catgcctgtg 240
tacccacaga ccccaatcca ctagaattaa aattggataa tgtgacagaa aattttaaca 300
tgtggaaaaa taacatggta gaacaaatgc atgaggatat aatcagttta tgggatcaaa 360
gcctaaagcc atgtgtaaaa ttaaccccac tctgtgttac tttaaattgc actgactact 420
cgaagaatgg tactaataac actgctaata atgaaggaga aatgaaaaac tgctctttta 480
atatcaccac aaacataaga gataagatgc agaatgaata cgcacttttt tataaacatg 540
atatggtatc aatagataat agtagtacta gctataggtt gacaagttgt aacacctcag 600
tcattacaca ggcctgtcca aagataacct ttgaaccaat tcctatacat tattgtaccc 660
cggctggttt tgcgctttta aagtgtaata ataaaacgtt caatggaaca ggaccatgta 720
aaaatgtcag cacagtacaa tgtacacatg gaattaggcc agtagtttca actcaactgc 780
tgttaaatgg cagtctagca gaagaagagg tagtaattag atctgaaaat ttcacgaaca 840
atgcaaaaat cataatagta cagctaaatg aaactataca aattaattgt acaagaccca 900
acaacaatac aagaaaaagt atacatatag caccagggag agcattttat gcaacaggag 960
aaataatagg agatataaga caagcacatt gtaacattag cagagcaaaa tggaacaacg 1020
ctttaaaaca gatagttgaa aaattaagag aacaatttaa gaataaaaca atagaattta 1080
agtcatcctc aggaggggac ccagaaattg taatgcacag tttcaattgt ggaggggaat 1140
ttttctactg taattcaaca aaactgttta atagtatttg gtatccgaat ggtactgaag 1200
ggtcaaataa cactgaagga aatgacccaa tcacactccc atgcagaata agacaaattg 1260
taaacagatg gcaggaagta ggaaaagcaa tgtatgcccc tcccatcagg ggaccaatta 1320
gatgttcatc aaatattaca gggctgctat taacaagaga tggtggtgct aataatactg 1380
ataatgagac cttcagacct ggaggaggag atatgaggga caattggaga agtgaattat 1440
ataaatataa agtagtaaga attgaaccat taggaatagc gcccactacg gcaaggagaa 1500
gagtggtgca aagagaaaaa agggcagcca taggagctat gatccttggg ttcttgggag 1560
cagcaggaag cact 1574
<210> 76
<211> 473
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 76
Ser Ala Ala Gln Leu Trp Val Thr Val Cys Tyr Gly Val Pro Val Trp
1 5 10 15
Lys Glu Ala Thr Thr Thr Le Le Phe Cys Ala Ser Asp Ala Lys Ala Tyr
20 25 30
Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr
35 40 45
Asp Pro Asn Pro Leu Glu Leu Lys Leu Asp Asn Val Thr Glu Asn Phe
50 55 60
Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile Ile
65 70 75 80
Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu
85 90 95
Cys Val Thr Leu Asn Cys Thr Asp Tyr Ser Lys Asn Gly Thr Asn Asn
100 105 110
Thr Ala Asn Asn Glu Gly Glu Met Lys Asn Cys Ser Phe Asn Ile Thr
115 120 125
Thr Asn Ile Arg Asp Lys Met Gln Asn Glu Tyr Ala Leu Phe Tyr Lys
130 135 140
His Asp Met Val Ser Ile Asp Asn Ser Ser Thr Ser Tyr Arg Leu Thr
145 150 155 160
Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Ile Thr Phe
165 170 175
Glu Pro Ile Pro Ile His Tyr Cys Thr Pro Ala Gly Phe Ala Leu Leu
180 185 190
Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Lys Asn Val
195 200 205
Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln
210 215 220
Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser
225 230 235 240
Glu Asn Phe Thr Asn Asn Ala Lys Ile Ile Ile Val Gln Leu Asn Glu
245 250 255
Thr Ile Gln Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser
260 265 270
Ile His Ile Ala Pro Gly Arg Ala Phe Tyr Ala Thr Gly Glu Ile Ile
275 280 285
Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser Arg Ala Lys Trp Asn
290 295 300
Asn Ala Leu Lys Gln Ile Val Glu Lys Leu Arg Glu Gln Phe Lys Asn
305 310 315 320
Lys Thr Ile Glu Phe Lys Ser Ser Ser Gly Gly Asp Pro Glu Ile Val
325 330 335
Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr
340 345 350
Lys Leu Phe Asn Ser Ile Trp Tyr Pro Asn Gly Thr Glu Gly Ser Asn
355 360 365
Asn Thr Glu Gly Asn Asp Pro Ile Thr Leu Pro Cys Arg Ile Arg Gln
370 375 380
Ile Val Asn Arg Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro
385 390 395 400
Ile Arg Gly Pro Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu
405 410 415
Thr Arg Asp Gly Gly Ala Asn Asn Thr Asp Asn Glu Thr Phe Arg Pro
420 425 430
Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr
435 440 445
Lys Val Val Arg Ile Glu Pro Leu Gly Ile Ala Pro Thr Thr Ala Arg
450 455 460
Arg Arg Val Val Gln Arg Glu Lys Arg
465 470
<210> 77
<211> 1610
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 77
agaaagagca gaagacagtg gcaatgaaag tgaaggggat caggaagaat tgtcagcgct 60
ggtggatagg gggcatcttg ctccttggaa tgttgatgat ctgtagtgct gcagaacaat 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga agcaaacatc actctatttt 180
gtgcatcaga tgctaaggga tatgatacag aagcacataa tgtctgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagagg tagcattgac aaatgtgaca gaaaacttta 300
acatgtggaa aaataacatg gtagaacaaa tgcatgagga tataattagt ttatgggatc 360
aaagcttaaa gccatgtgta aaattaaccc cactctgtgt tactttagat tgcaatgata 420
ctaatataaa tgtaactagc aaaaatgaga caatgatgga gcaaggagaa gcaaaaaact 480
gctctttcaa tatcaccaca aatttaagag ataaggtgca gaaagaatat tcagtttttt 540
ataaacttga tgtagtacca atagaagagg agaaaaataa tagtattaac aatagatata 600
ggttgataag ttgtaacacc tcagtcatta cacaagcctg tccaaagata tcctttgaac 660
caattcccat acattattgt gccccggctg gttttgcgat tctgaagtgt aacaataaga 720
cattcagtgg aaaaggacca tgcacaaatg tcagcacagt acaatgcaca catggaatta 780
ggccagtagt atcaactcaa ctgctgttaa atggcagcct agcagaagga gagatagtaa 840
ttagatctga caatttcaca gacaatacaa aaaccataat agtacagctg aatacatctg 900
tagcaattaa ttgtacaaga cccaacaaca atacaagaag aagtataact ataggaccag 960
ggagagcatt ttatgcaaca gacataatag gagatataag acaagcacat tgtaacatta 1020
gtagaacaca atggaataac actttaaaac aggtagctag aaaattaagt gaacaattta 1080
atgcaacaat agtttttaat aaatcctcag gaggggaccc agaaattgta atgcacagtt 1140
ttaattgtgg aggggaattt ttctactgta atacaacaca actgtttaat agtatttggt 1200
gtcctaataa tactggagag tcaaatagca ctaacaatga gacaatcata ctcccatgca 1260
gattaaaaca atttataaac atgtggcagg aggtaggaaa agcaatgtat gcccctccca 1320
tcagaggata cattaactgt tcatcaaata ttacggggct gctattaaca agagatggtg 1380
gtggtgataa taaaaacacg agtccagaga acaagacaga gaccttcaga cctggaggag 1440
gaaatatgaa ggacaattgg agaagtgaac tgtataaata taaagtagta agaattgaac 1500
cattaggagt agcacccacc aaggcaaaga gaagagtggt gcagagagaa aaaagagcag 1560
tgggaatagg agctatgttc cttgggttct tgggagcagc aggaagcact 1610
<210> 78
<211> 484
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 78
Ser Ala Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Asn Ile Thr Leu Phe Cys Ala Ser Asp Ala Lys Gly
20 25 30
Tyr Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Ala Leu Thr Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asp Cys Asn Asp Thr Asn Ile Asn Val Thr Ser
100 105 110
Lys Asn Glu Thr Met Met Glu Gln Gly Glu Ala Lys Asn Cys Ser Phe
115 120 125
Asn Ile Thr Thr Asn Leu Arg Asp Lys Val Gln Lys Glu Tyr Ser Val
130 135 140
Phe Tyr Lys Leu Asp Val Val Pro Ile Glu Glu Glu Lys Asn Asn Ser
145 150 155 160
Ile Asn Asn Arg Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val Ile Thr
165 170 175
Gln Ala Cys Pro Lys Ile Ser Phe Glu Pro Ile Pro Ile His Tyr Cys
180 185 190
Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Ser
195 200 205
Gly Lys Gly Pro Cys Thr Asn Val Ser Thr Val Gln Cys Thr His Gly
210 215 220
Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala
225 230 235 240
Glu Gly Glu Ile Val Ile Arg Ser Asp Asn Phe Thr Asp Asn Thr Lys
245 250 255
Thr Ile Ile Val Gln Leu Asn Thr Ser Val Ala Ile Asn Cys Thr Arg
260 265 270
Pro Asn Asn Asn Thr Arg Arg Ser Ile Thr Ile Gly Pro Gly Arg Ala
275 280 285
Phe Tyr Ala Thr Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn
290 295 300
Ile Ser Arg Thr Gln Trp Asn Asn Thr Leu Lys Gln Val Ala Arg Lys
305 310 315 320
Leu Ser Glu Gln Phe Asn Ala Thr Ile Val Phe Asn Lys Ser Ser Gly
325 330 335
Gly Asp Pro Glu Ile Val Met His Ser Phe Asn Cys Gly Gly Glu Phe
340 345 350
Phe Tyr Cys Asn Thr Thr Gln Leu Phe Asn Ser Ile Trp Cys Pro Asn
355 360 365
Asn Thr Gly Glu Ser Asn Ser Thr Asn Asn Glu Thr Ile Ile Leu Pro
370 375 380
Cys Arg Leu Lys Gln Phe Ile Asn Met Trp Gln Glu Val Gly Lys Ala
385 390 395 400
Met Tyr Ala Pro Pro Ile Arg Gly Tyr Ile Asn Cys Ser Ser Asn Ile
405 410 415
Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Gly Asp Asn Lys Asn Thr
420 425 430
Ser Pro Glu Asn Lys Thr Glu Thr Phe Arg Pro Gly Gly Gly Asn Met
435 440 445
Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Arg Ile
450 455 460
Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln
465 470 475 480
Arg Glu Lys Arg
<210> 79
<211> 1607
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 79
agaaagagca gaagacagtg gcaatgaaag cgaaggggac caggaagaat tatcagcact 60
tgtggagatg gggtaccatg ctccttggga tgttgatgat ctgtagtgct acagagcaat 120
tgtgggtaac agtctattat ggggtacctg tgtggaaaga agcaaaaacc actctatttt 180
gtgcatcaga tgctaaagca tatgatacag agatgcataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaaa tagtattgga aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagatcaga tgcaggagga tgtaatcagt ctatgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cactctgtgt tactttaaat tgcaatgata 420
cattgaggaa tgataatagc actaagaata atagtagtac tggttgggaa aagatggaga 480
aaggagaaat aaaaaattgc tctttcagtg ccaccacaac cgtgaaagat aagacacaga 540
aacaatatgc acttttttat aatcttgata tagtacatac aaatgatggt ggtaatagta 600
gctatatgtt aagaagttgt aacacctcag tcattacaca ggcctgtcca aaggtatcat 660
ttgagccaat tcccatacat tattgtgccc cggctggttt tgcgattcta aagtgtaatg 720
ataagaagtt caatgggaca ggaccatgta aaaatgtcag cacagtacaa tgtacacatg 780
gaattaggcc agtagtgtca actcaactgc tattaaatgg cagtctagca gaagaagagg 840
tagtaattag atctagcaat ttaacggaca atactaaaac cataatagta cagctgaagg 900
aatctgtaaa aattaattgt acaagaccca acaacaatac aagaaaaagt atatctatag 960
gaccagggag cgcattttat gcaacaggag acataatagg agatataaga caagcacatt 1020
gcaaccttag taaaacagaa tggggggaaa ctttaagaca gatagctaca aaattaagag 1080
aacaatttaa taataaaaca ataatcttta atagctcctc aggaggggac ccagaaattg 1140
taatgcacag ttttaattgt ggaggggaat ttttctactg taatacaaca aaactgttta 1200
atggtacttg gaatggtact tggaagaatg gtacttggag tacaaatgat actgaaaatg 1260
atactatcat actcccatgt agaataaaac aaattataaa catgtggcag gaagtaggaa 1320
aagcaatgtg tgcccctccc atcagaggac aaattaactg ctcatcacag attacagggc 1380
tgctattaac aagagatggt ggagataacc ctaaaaatga aaccttcaga cctggaggag 1440
gagatatgag ggacaattgg agaagtgaat tatataaata taaagtagta aaaattgaac 1500
cattaggaat agcacctacc agggcaaaga gaagagtggt gcagagagaa aaaagagcag 1560
caataggagc tatgatcctt gggttcttgg gagcagcagg aagcact 1607
<210> 80
<211> 484
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 80
Ser Ala Thr Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Lys Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Met His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Val Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Asp Gln Met Gln Glu Asp Val
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Asn Asp Thr Leu Arg Asn Asp Asn Ser
100 105 110
Thr Lys Asn Asn Ser Ser Thr Gly Trp Glu Lys Met Glu Lys Gly Glu
115 120 125
Ile Lys Asn Cys Ser Phe Ser Ala Thr Thr Thr Thr Val Lys Asp Lys Thr
130 135 140
Gln Lys Gln Tyr Ala Leu Phe Tyr Asn Leu Asp Ile Val His Thr Asn
145 150 155 160
Asp Gly Gly Asn Ser Ser Tyr Met Leu Arg Ser Cys Asn Thr Ser Val
165 170 175
Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His
180 185 190
Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp Lys Lys
195 200 205
Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr Val Gln Cys Thr
210 215 220
His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser
225 230 235 240
Leu Ala Glu Glu Glu Val Val Ile Arg Ser Ser Asn Leu Thr Asp Asn
245 250 255
Thr Lys Thr Ile Ile Val Gln Leu Lys Glu Ser Val Lys Ile Asn Cys
260 265 270
Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile Ser Ile Gly Pro Gly
275 280 285
Ser Ala Phe Tyr Ala Thr Gly Asp Ile Ile Gly Asp Ile Arg Gln Ala
290 295 300
His Cys Asn Leu Ser Lys Thr Glu Trp Gly Glu Thr Leu Arg Gln Ile
305 310 315 320
Ala Thr Lys Leu Arg Glu Gln Phe Asn Asn Lys Thr Ile Ile Phe Asn
325 330 335
Ser Ser Ser Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Asn Cys
340 345 350
Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr Lys Leu Phe Asn Gly Thr
355 360 365
Trp Asn Gly Thr Trp Lys Asn Gly Thr Trp Ser Thr Asn Asp Thr Glu
370 375 380
Asn Asp Thr Ile Ile Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met
385 390 395 400
Trp Gln Glu Val Gly Lys Ala Met Cys Ala Pro Pro Ile Arg Gly Gln
405 410 415
Ile Asn Cys Ser Ser Gln Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly
420 425 430
Gly Asp Asn Pro Lys Asn Glu Thr Phe Arg Pro Gly Gly Gly Asp Met
435 440 445
Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile
450 455 460
Glu Pro Leu Gly Ile Ala Pro Thr Arg Ala Lys Arg Arg Val Val Gln
465 470 475 480
Arg Glu Lys Arg
<210> 81
<211> 1580
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 81
agaaagagca gaagacagtg gcaatgagag tgagggggat tatgaggaat tatcagcact 60
tgtggagatg gggcatgacg ctccttggga tgttaatgat cagtagtgct aatgaacaat 120
tgtgggtcac agtctgttat ggggtacctg tgtggaaaga agcaaccact actttatttt 180
gtgcatcaga tgctaaagca tatgctgcag agaaacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag tagtaataaa tgtgacagaa aattttaaca 300
tgtggaaaaa taacatggta gagcagatgc atgaagatgt aactagttta tgggaccaaa 360
gcctaaagcc atgtgtaaaa ttaacccctc tctgtgttac tttaaattgc actgactatg 420
aggggaataa tatcactagt gggaataaga caggagaaat aaaaaactgc tctttcgaga 480
tcaccacaaa catgagagat aagatacaga aaacatatgc acttttttat agacttgatg 540
tagaaccaat aaatgatgat aatgttactt ataggttgat aagctgtaat acctcagtca 600
ttacacaagc ctgtccaaag gtaacctttg agccaattcc catacattat tgtgccccgg 660
ctggctttgc gattgtaaag tgtaacaata aaacgttcaa tggaacagga ccatgtaaaa 720
atgttagcac agtacaatgt acacatggaa ttaggccagt agtatcaact caactgctgt 780
taaatggcag tctagcggaa gaagagacaa tgattagatc tgagaatttc tcggacaatg 840
ctaaaatcat aatagtacag ctgaataaat ctgtaaaaat taattgtaca agacccaaca 900
ataatacaat aaaaggtata catataggac cagggagagc attttataca acaggacaaa 960
taataggaga cataagacaa gcatattgta ccattaataa aacagaatgg aataacactt 1020
tatcacagat agctaaaaaa ttaagtagac aatttgagaa taaaacaata gcctttaggc 1080
caccctcagg aggggaccca gaaattgtaa tgcacagttt taattgtgga ggggaatttt 1140
tctattgtaa tacaacacaa ctgtttaata gtaattggac tactaatgga gagtcaaatt 1200
acacaacggg aaacaatgag acaattatca cactcccatg cagaataaaa caatttataa 1260
acatgtggca ggaagtagga aaagcaatgt atgcccctcc cattagtgga ataattaatt 1320
gcttatcaaa tattacaggg ctgctattaa caagagatgg tggtaatagt agcagcgcca 1380
acagcaccga gatcttcaga cctggaggag gggatatgag ggataattgg agaagtgaac 1440
tatataaata taaagtagta caaattgaac cattaggatt agcacccacc aaggcaaaga 1500
gaagagtggt gcagagagaa aaaagagcag tgggactagg agctgtgttc cttgggttct 1560
tgggagcagc aggaagcact 1580
<210> 82
<211> 474
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 82
Ser Ala Asn Glu Gln Leu Trp Val Thr Val Cys Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Ala Ala Glu Lys His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Ile Asn Val Thr Glu Asn Phe
50 55 60
Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Val Thr
65 70 75 80
Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu
85 90 95
Cys Val Thr Leu Asn Cys Thr Asp Tyr Glu Gly Asn Asn Ile Thr Ser
100 105 110
Gly Asn Lys Thr Gly Glu Ile Lys Asn Cys Ser Phe Glu Ile Thr Thr
115 120 125
Asn Met Arg Asp Lys Ile Gln Lys Thr Tyr Ala Leu Phe Tyr Arg Leu
130 135 140
Asp Val Glu Pro Ile Asn Asp Asp Asn Val Thr Tyr Arg Leu Ile Ser
145 150 155 160
Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Thr Phe Glu
165 170 175
Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Val Lys
180 185 190
Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser
195 200 205
Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu
210 215 220
Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Thr Met Ile Arg Ser Glu
225 230 235 240
Asn Phe Ser Asp Asn Ala Lys Ile Ile Ile Val Gln Leu Asn Lys Ser
245 250 255
Val Lys Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Ile Lys Gly Ile
260 265 270
His Ile Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly Gln Ile Ile Gly
275 280 285
Asp Ile Arg Gln Ala Tyr Cys Thr Ile Asn Lys Thr Glu Trp Asn Asn
290 295 300
Thr Leu Ser Gln Ile Ala Lys Lys Leu Ser Arg Gln Phe Glu Asn Lys
305 310 315 320
Thr Ile Ala Phe Arg Pro Pro Ser Gly Gly Asp Pro Glu Ile Val Met
325 330 335
His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr Gln
340 345 350
Leu Phe Asn Ser Asn Trp Thr Thr Asn Gly Glu Ser Asn Tyr Thr Thr
355 360 365
Gly Asn Asn Glu Thr Ile Ile Thr Leu Pro Cys Arg Ile Lys Gln Phe
370 375 380
Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile
385 390 395 400
Ser Gly Ile Ile Asn Cys Leu Ser Asn Ile Thr Gly Leu Leu Leu Thr
405 410 415
Arg Asp Gly Gly Asn Ser Ser Ser Ala Asn Ser Thr Glu Ile Phe Arg
420 425 430
Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys
435 440 445
Tyr Lys Val Val Gln Ile Glu Pro Leu Gly Leu Ala Pro Thr Lys Ala
450 455 460
Lys Arg Arg Val Val Gln Arg Glu Lys Arg
465 470
<210> 83
<211> 1643
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 83
agaaagagca gaagacagtg gcaatgagag tgaaggagat caggaagaat tgtcagaaat 60
tctggaaatg gggcaccttg ctccttggga tgttaatgat gatctgtaga gctgcagagg 120
attcgtgggt cacagtctat tatggggtac ctgtgtggaa ggaagcaacc accactctat 180
tttgtgcatc agatgctaaa gcatatgaca cagaggtaca taatgtttgg gccacacatg 240
cctgtgtacc cacagaccct aacccacaag aagtagtatt ggaaaatgtg acagaaaatt 300
tcaatgcgtg gaaaaataat atggtagaac agatgcatga ggatataatc agtttatggg 360
atcaaagcct aaagccatgt gtaaaattaa cccctctttg tgttactcta aattgcactg 420
atgtgagaaa taatgctacc aatactacta ataccaatag tactaatacc tatagtacta 480
acatagaaaa gatgaaagaa ggagaaataa aaaactgctc tttcaatacc accccaagca 540
taacagacaa gatgcagaag gcatatgcat tgttttataa gcttgatata gtacagataa 600
ataatgataa taaagataat accagctata gattgataag ttgtaatacc tcagtcatta 660
cacaggcctg tccaaaggca tcctttgagc caattcccat acattattgt gccccagctg 720
gttttgcgat tctaaagtgt aatgataaga agttcaatgg aacaggacca tgcaaaaatg 780
tcagcacagt acaatgtaca catggaatta ggccagtagt atcaacccaa ctgctgttaa 840
atggcagtct agcagaagaa gaggtagtaa ttagatctga aaatttcaca aacaatgcta 900
aaaccataat agtacagctg aatgagactg tacatattaa ttgtacaaga cccaacaaca 960
atacaagaaa aagtataggt ataggaccag ggagaacatt ttttgcaaca ggggaaataa 1020
taggagacat aagacaagca cattgtaaca ttagtagaaa caactggaat aaaactttag 1080
aaagggtagt taaaaaatta agagaacaat ttgggaacaa caaaacaatt gtttttaatc 1140
aatcctcagg aggggaccca gaaattgtga tgcacagttt taattgtaga ggagaatttt 1200
tctactgtaa tacaacacaa ctgtttaata gtacttggaa tgctaatagt acttggaatg 1260
ctaacgagaa tactactgga atgccaagtg acaatatcac actcccctgc agaataaaac 1320
aagttataaa catgtggcag gaagtaggaa aagcaatgta tgcccctccc attaaaggac 1380
caattaagtg ttcatcaaat attacaggac tgctattaac aagggatggt ggtgtgaaca 1440
attctgaaac tgagaccttc agacctggag gaggagatat gaggaacaat tggagatgtg 1500
gattatataa atataaagta gtaaaaattg agccattagg agtagcaccc accagggcaa 1560
agagaagagt ggtgcaaaga gaaaaaagag cagtgggact aggagctatg ttccttgggt 1620
tcttgggagc agcaggaagc act 1643
<210> 84
<211> 494
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 84
Arg Ala Ala Glu Asp Ser Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Ala Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Val Arg Asn Asla Ala Thr Asn
100 105 110
Thr Thr Asn Thr Asn Ser Thr Asn Thr Tyr Ser Thr Asn Ile Glu Lys
115 120 125
Met Lys Glu Gly Glu Ile Lys Asn Cys Ser Phe Asn Thr Thr Pro Ser
130 135 140
Ile Thr Asp Lys Met Gln Lys Ala Tyr Ala Leu Phe Tyr Lys Leu Asp
145 150 155 160
Ile Val Gln Ile Asn Asn Asp Asn Lys Asp Asn Thr Ser Tyr Arg Leu
165 170 175
Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Ala Ser
180 185 190
Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile
195 200 205
Leu Lys Cys Asn Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Lys Asn
210 215 220
Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr
225 230 235 240
Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg
245 250 255
Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Asn
260 265 270
Glu Thr Val His Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys
275 280 285
Ser Ile Gly Ile Gly Pro Gly Arg Thr Phe Phe Ala Thr Gly Glu Ile
290 295 300
Ile Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser Arg Asn Asn Trp
305 310 315 320
Asn Lys Thr Leu Glu Arg Val Val Lys Lys Leu Arg Glu Gln Phe Gly
325 330 335
Asn Asn Lys Thr Ile Val Phe Asn Gln Ser Ser Gly Gly Asp Pro Glu
340 345 350
Ile Val Met His Ser Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asn
355 360 365
Thr Thr Gln Leu Phe Asn Ser Thr Trp Asn Ala Asn Ser Thr Trp Asn
370 375 380
Ala Asn Glu Asn Thr Thr Gly Met Pro Ser Asp Asn Ile Thr Leu Pro
385 390 395 400
Cys Arg Ile Lys Gln Val Ile Asn Met Trp Gln Glu Val Gly Lys Ala
405 410 415
Met Tyr Ala Pro Pro Ile Lys Gly Pro Ile Lys Cys Ser Ser Asn Ile
420 425 430
Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Val Asn Asn Ser Glu Thr
435 440 445
Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asn Asn Trp Arg Cys
450 455 460
Gly Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala
465 470 475 480
Pro Thr Arg Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
485 490
<210> 85
<211> 1603
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 85
agaaagagca gagacagtgg caatgaaagt gaaggggatc aggaagaatt atcagcactt 60
gtggggatgg ggcatgatgc tccttgggat gttaatgatc tgtagtgcta cagaaaacct 120
gtgggtcaca gtatattgtg gggtacctgt gtggaaagaa gcagaaacca gtctattttg 180
tgcatcagat gctaacacat ataatacaga ggctcataat gtttgggcca ctcatgcctg 240
tgtacccacg gaccccaacc cacaagaaat atatttggaa aatgtgacag aaaattttaa 300
catgtggaaa aataacatgg tagaacagat gcatgaggat atagtaagtt tatgggatga 360
aagcctaaag ccatgtgtaa aaataacccc actctgtgtc actctaaatt gcactgattt 420
ggaaaatggc actagtagca ataatagtag ctatcaaagg ggggaagaag gagaaataaa 480
gaactgctct ttcaatatca ccacaagatt aagagaaaag gtacagaaag aatatgcact 540
tttttataaa cttgatataa tagcaatgga taataaaact aatgctacca gatataggtt 600
gataagttgt aacacctcaa ccattacaca ggcctgtcca aaagtatcct ttgagccaat 660
tcccatacat tattgtgccc cagctggttt tgcgcttttc aagtgtaatg ataagaagtt 720
caatggatca ggaacatgta acaatgtcag cacagtacaa tgtacacatg gaattaggcc 780
agtagtatca actcagctgt tgctaaatgg cagtctagca gaagaagagg tagtaattag 840
atctgaaaat ttcacaaaca gtgctaaaac cataatagta cagctaaaag aacctgtaaa 900
aattaattgt acaagaccca acaataatac aagaagaagt atacatatag gaccaggaaa 960
agcattttat gcaacaggag aaataatagg agatatagga caagcacatt gtaacattag 1020
tggacaagaa tggaataaaa ctttaattca gatagttaaa aaattgagag aacaatttgg 1080
gaataagacg ataaacttta ctaaaccagc aggaggggac ccagagattg taatgcacag 1140
ttttaattgt ggaggggaat ttttctactg tgatacaaca cgactgttta atagggcttg 1200
gaataatact gaagagttaa atagtactac tggagagtca aataacacta tcaccctccc 1260
atgcagaata aaacaaatta taaacatgtg gcaggaagta ggaaaagcaa tgtatgcccc 1320
tcccatccaa ggaacaatta gatgttcatc aaatattaca gggctgctac tagcaagaga 1380
tggtggcagt aacaatgaga ctaatactac tgaaatcttc agacctgcag gaggagatat 1440
gagggacaat tggagaagtg aattatataa atataaagta gtaaaaattg aaccattagg 1500
agtagcaccc accagggcaa agagaagagt ggtgcaaaga gaaagaagag caataggaat 1560
aggagctgtg ttccttgggt tcttgggagc agcaggaagc act 1603
<210> 86
<211> 482
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 86
Ser Ala Thr Glu Asn Leu Trp Val Thr Val Tyr Cys Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Glu Thr Ser Leu Phe Cys Ala Ser Asp Ala Asn Thr
20 25 30
Tyr Asn Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Tyr Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Val Ser Leu Trp Asp Glu Ser Leu Lys Pro Cys Val Lys Ile Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Leu Glu Asn Gly Thr Ser Ser
100 105 110
Asn Asn Ser Ser Tyr Gln Arg Gly Glu Glu Gly Glu Ile Lys Asn Cys
115 120 125
Ser Phe Asn Ile Thr Thr Arg Leu Arg Glu Lys Val Gln Lys Glu Tyr
130 135 140
Ala Leu Phe Tyr Lys Leu Asp Ile Ile Ala Met Asp Asn Lys Thr Asn
145 150 155 160
Ala Thr Arg Tyr Arg Leu Ile Ser Cys Asn Thr Ser Thr Ile Thr Gln
165 170 175
Ala Cys Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala
180 185 190
Pro Ala Gly Phe Ala Leu Phe Lys Cys Asn Asp Lys Lys Phe Asn Gly
195 200 205
Ser Gly Thr Cys Asn Asn Val Ser Thr Val Gln Cys Thr His Gly Ile
210 215 220
Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu
225 230 235 240
Glu Glu Val Val Ile Arg Ser Glu Asn Phe Thr Asn Ser Ala Lys Thr
245 250 255
Ile Ile Val Gln Leu Lys Glu Pro Val Lys Ile Asn Cys Thr Arg Pro
260 265 270
Asn Asn Asn Thr Arg Arg Ser Ile His Ile Gly Pro Gly Lys Ala Phe
275 280 285
Tyr Ala Thr Gly Glu Ile Ile Gly Asp Ile Gly Gln Ala His Cys Asn
290 295 300
Ile Ser Gly Gln Glu Trp Asn Lys Thr Leu Ile Gln Ile Val Lys Lys
305 310 315 320
Leu Arg Glu Gln Phe Gly Asn Lys Thr Ile Asn Phe Thr Lys Pro Ala
325 330 335
Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Asn Cys Gly Gly Glu
340 345 350
Phe Phe Tyr Cys Asp Thr Thr Arg Leu Phe Asn Arg Ala Trp Asn Asn
355 360 365
Thr Glu Glu Leu Asn Ser Thr Thr Gly Glu Ser Asn Asn Thr Ile Thr
370 375 380
Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly
385 390 395 400
Lys Ala Met Tyr Ala Pro Pro Ile Gln Gly Thr Ile Arg Cys Ser Ser
405 410 415
Asn Ile Thr Gly Leu Leu Leu Ala Arg Asp Gly Gly Ser Asn Asn Glu
420 425 430
Thr Asn Thr Thr Glu Ile Phe Arg Pro Ala Gly Gly Asp Met Arg Asp
435 440 445
Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro
450 455 460
Leu Gly Val Ala Pro Thr Arg Ala Lys Arg Arg Val Val Gln Arg Glu
465 470 475 480
Arg Arg
<210> 87
<211> 1604
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 87
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tgtcagcact 60
tatggagatg gggcatcatg ctccttggaa tgttaatgat ctgtagtgct gcaagcctgt 120
gggtcacagt ctattatggg gtacctgtgt ggaaagatgc aaacaccact ctattttgtg 180
catcagatgc taaagcatat gatacagagg tacataatgt gtgggccaca catgcctgtg 240
tacccacaga ccccaaccca caagaagtag tattggaaaa tgtgacagaa aattttaata 300
tgtggaaaaa taacatggta gaacagatgc atgaggacat aattagttta tgggaccaaa 360
gcctaaagcc atgtgtaaaa ctaaccccac tctgtgttac tttaaattgc actgagttga 420
tgttgaatac tactaccaat agtactacta ccaatagtac cagtagtcct cctaccagta 480
gtggattgac aaactgctct ttcaatatcg ccacagatct aagagataag gtgcagaaag 540
aatatgctct tttttctaca cttgatgtag tatcaatagg taataacagc tctaggctga 600
taagttgtaa cacctcaatc cttacacagg cctgtccaaa ggtatccttt gagccaattc 660
ccatacatta ttgtgccccg gctggttttg caattctaaa gtgtaacaat aagacattca 720
atggaaaagg actatgtaac aatgtcagca caatacaatg tacacatgga attaagccag 780
tagtatcaac tcaattactg ttaaatggca gtctagcaga gaaagacata gtaattagat 840
ctgacaattt ttcaaacaat gctaagacca taatagtaca gctgaagaag cctgtataca 900
tcaattgtac aagacccaac aacaatacga gaaaaggtat acacatagca ccagggagag 960
cattttatac aacaggacag ataataggag acataaggaa agcatattgt gaaattagtg 1020
gaaaaagctg gaataacact ttagaacaga tagctacaaa attaagagaa caatttggga 1080
gtaataaaac aatagtcttt aatcaatcct cgggagggga cccagaaatt gtaatgcaca 1140
gttttaattg tagaggagaa tttttctatt gtaattcaac acaattgttt aatagtactt 1200
ggccgggtaa cggtcctagc aataatacta ctgggaatgg tactgatact gttatcatcc 1260
ttccatgcag aataaaacaa atcataaaca tgtggcagga agtaggaaga gcaatgtgtg 1320
cccctcccat cgcaggacaa attaactgta caacaaaaat tacagggctg ttattaacaa 1380
gagatggtgg gaatagcaat gagaccaaag agactgaaat ctttagacct ggaggaggag 1440
atatgaggga caattggaga agtgaattat acaaatataa agtagtaaaa attgaaccat 1500
taggagtagc acccaccgag gcaaggagaa gagtggtgca acgagaaaag agcagtggga 1560
ctaggagcta tgttccttgg gttcttggga gcagcagaag cact 1604
<210> 88
<211> 483
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 88
Ser Ala Ala Ser Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp
1 5 10 15
Lys Asp Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr
20 25 30
Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr
35 40 45
Asp Pro Asn Pro Gln Glu Val Val Leu Glu Asn Val Thr Glu Asn Phe
50 55 60
Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile Ile
65 70 75 80
Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu
85 90 95
Cys Val Thr Leu Asn Cys Thr Glu Leu Met Leu Asn Thr Thr Thr Asn
100 105 110
Ser Thr Thr Thr Asn Ser Thr Ser Ser Pro Pro Thr Ser Ser Gly Leu
115 120 125
Thr Asn Cys Ser Phe Asn Ile Ala Thr Asp Leu Arg Asp Lys Val Gln
130 135 140
Lys Glu Tyr Ala Leu Phe Ser Thr Leu Asp Val Val Ser Ile Gly Asn
145 150 155 160
Asn Ser Ser Arg Leu Ile Ser Cys Asn Thr Ser Ile Leu Thr Gln Ala
165 170 175
Cys Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro
180 185 190
Ala Gly Phe Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Lys
195 200 205
Gly Leu Cys Asn Asn Val Ser Thr Ile Gln Cys Thr His Gly Ile Lys
210 215 220
Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Lys
225 230 235 240
Asp Ile Val Ile Arg Ser Asp Asn Phe Ser Asn Asn Ala Lys Thr Ile
245 250 255
Ile Val Gln Leu Lys Lys Pro Val Tyr Ile Asn Cys Thr Arg Pro Asn
260 265 270
Asn Asn Thr Arg Lys Gly Ile His Ile Ala Pro Gly Arg Ala Phe Tyr
275 280 285
Thr Thr Gly Gln Ile Ile Gly Asp Ile Arg Lys Ala Tyr Cys Glu Ile
290 295 300
Ser Gly Lys Ser Trp Asn Asn Thr Leu Glu Gln Ile Ala Thr Lys Leu
305 310 315 320
Arg Glu Gln Phe Gly Ser Asn Lys Thr Ile Val Phe Asn Gln Ser Ser
325 330 335
Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Asn Cys Arg Gly Glu
340 345 350
Phe Phe Tyr Cys Asn Ser Thr Gln Leu Phe Asn Ser Thr Trp Pro Gly
355 360 365
Asn Gly Pro Ser Asn Asn Thr Thr Gly Asn Gly Thr Asp Thr Val Ile
370 375 380
Ile Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val
385 390 395 400
Gly Arg Ala Met Cys Ala Pro Pro Ile Ala Gly Gln Ile Asn Cys Thr
405 410 415
Thr Lys Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Ser Asn
420 425 430
Glu Thr Lys Glu Thr Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg
435 440 445
Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu
450 455 460
Pro Leu Gly Val Ala Pro Thr Glu Ala Arg Arg Arg Val Val Gln Arg
465 470 475 480
Glu lys ser
<210> 89
<211> 1666
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 89
agaaagagca gagacagtgg caatgagagc gaaggagacc aggaagaatt gtcagcactc 60
gtggggatgg ggaaccatgc tcctgtggag atggggcatc atgctccttg ggatgttaat 120
gatctgtagt gctaaagaaa atttgtgggt cacagtctat tatggggtac ctgtgtggaa 180
agaagcatcc accactctat tttgtgcatc agatgctaaa gcatatgata cagaggtaca 240
taatgtttgg gccacacatg cctgtgtacc cacagacccc agcccacaag aagtagtatt 300
gggaaatgtg acagaatgtt ttaacatgtg gaataacaac atggtagaac agatgcatga 360
ggatataatc agtttatggg accaaagtct aaaaccctgt gtaaaattaa ccccactctg 420
tgttacctta agttgcagtg acgttaatat taccaatatt atcaataaca ctattgctaa 480
aaataatagt ttaagaatgg aaacaggaga cataaaaaac tgctctttca atatcaccac 540
aaacataaga gataagatgc aaacagaata tgcacttttt tataaatttg atgtagtgcc 600
aatatatgat agcaatgatg atagcaatat tactagaaat gatagttata ggataataag 660
ttgtaatacc tcagtcatta cacaggcctg tccaaaggta acctttgagc caattcccat 720
acattattgt gccccggctg gttttgcgat tctaaagtgt aacaataaga cattcaatgg 780
aaaaggacca tgtacaaatg tcagcacagt acaatgtaca catggaatta ggccagtagt 840
gtcaactcaa ctactgttaa atggcagtct agcagaaaag gagatagtga ttagatctga 900
caatttctcg gacaatgcta aaactataat agtacagtta aatggaactg ttcaaattaa 960
ttgttcaaga cccggcaaca atacaagaaa aagtatacat ataggaccag ggagtgcatt 1020
ttatgcaaca ggagacataa taggagatat aagaaaagca cattgtaaca ttagtaaaac 1080
agactggaat aacactttag gaaagatagc aaaaaaatta agagaacaat ttggggaaaa 1140
taaaacaata gagtttgaga aatccacagg aggggaccca gaagttatga tgcatacttt 1200
taattgtgga ggggaatttt tctactgtaa ttcaacaccg ctgtttaatg gtagtacttt 1260
taataatact tggacacctt tgaatagtag tgctaaaggg ccaaatgaca ctctcatact 1320
ccaatgtaga ataaaacaaa tcataaacat gtggcaggaa gtaggaaaag caatgtatgc 1380
ccctcccatc agaggataca ttaattgttc atcaaatatt acagggctgc tattgacaag 1440
agatggtggt aataatactg gtaatgatag caataccgag accttcagac ctacaggagg 1500
aaatatgaag gacaattgga ggagtgaatt atataaatat aaagtagtac aaattgaacc 1560
attaggagta gcacccacca gggcaaaaag aagagtggtg cagagagaaa aaagagcagc 1620
gctgggggct atgttccttg ggttcttggg agcagcagga agcact 1666
<210> 90
<211> 496
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 90
Ser Ala Lys Glu Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Ser Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Ser Pro Gln Glu Val Val Leu Gly Asn Val Thr Glu Cys
50 55 60
Phe Asn Met Trp Asn Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Ser Cys Ser Asp Val Asn Ile Thr Asn Ile Ile
100 105 110
Asn Asn Thr Ile Ala Lys Asn Asn Ser Leu Arg Met Glu Thr Gly Asp
115 120 125
Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Asn Ile Arg Asp Lys Met
130 135 140
Gln Thr Glu Tyr Ala Leu Phe Tyr Lys Phe Asp Val Val Pro Ile Tyr
145 150 155 160
Asp Ser Asn Asp Asp Ser Asn Ile Thr Arg Asn Asp Ser Tyr Arg Ile
165 170 175
Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Thr
180 185 190
Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile
195 200 205
Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Lys Gly Pro Cys Thr Asn
210 215 220
Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr
225 230 235 240
Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Lys Glu Ile Val Ile Arg
245 250 255
Ser Asp Asn Phe Ser Asp Asn Ala Lys Thr Ile Ile Val Gln Leu Asn
260 265 270
Gly Thr Val Gln Ile Asn Cys Ser Arg Pro Gly Asn Asn Thr Arg Lys
275 280 285
Ser Ile His Ile Gly Pro Gly Ser Ala Phe Tyr Ala Thr Gly Asp Ile
290 295 300
Ile Gly Asp Ile Arg Lys Ala His Cys Asn Ile Ser Lys Thr Asp Trp
305 310 315 320
Asn Asn Thr Leu Gly Lys Ile Ala Lys Lys Leu Arg Glu Gln Phe Gly
325 330 335
Glu Asn Lys Thr Ile Glu Phe Glu Lys Ser Thr Gly Gly Asp Pro Glu
340 345 350
Val Met Met His Thr Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn
355 360 365
Ser Thr Pro Leu Phe Asn Gly Ser Thr Phe Asn Asn Thr Trp Thr Pro
370 375 380
Leu Asn Ser Ser Ala Lys Gly Pro Asn Asp Thr Leu Ile Leu Gln Cys
385 390 395 400
Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met
405 410 415
Tyr Ala Pro Pro Ile Arg Gly Tyr Ile Asn Cys Ser Ser Asn Ile Thr
420 425 430
Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn Thr Gly Asn Asp Ser
435 440 445
Asn Thr Glu Thr Phe Arg Pro Thr Gly Gly Asn Met Lys Asp Asn Trp
450 455 460
Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Gln Ile Glu Pro Leu Gly
465 470 475 480
Val Ala Pro Thr Arg Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
485 490 495
<210> 91
<211> 1559
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 91
agaaagagca gaagacagtg gcaatgagag tgaaggagat caggaagaac tatcagcact 60
tgtggagatg gggcatcatg ctctttggga tattaatgat ctgtagtgct gaagaaaagt 120
gggtcacagt ctattatggg gtacctgtgt ggaaagaagc aaagaccact ctattttgtg 180
catcagatgc taaagcatat gatacagagg cacataatgt ttgggccaca catgcctgtg 240
tacccacaga ccccaaccca caagaagtag tattggagaa tgtgacagaa aattttaaca 300
tgtggaaaaa tgacatggta gagcagatgc atgaggatgt aatcagttta tgggatcaaa 360
gcctaaagcc atgtgtagaa ttaacgccac tctgtgttac tctaaattgc actaatctaa 420
attgcactaa caacactagt agcgaaataa aaaactgttc tttctatgtc accacaagca 480
tggaaggtaa ggtgaaaaaa catgcaacgt tttatagcct tgatatagta cgaacaacag 540
agagtaatat cagctatagg ttgataagtt gtaacacctc agtcattaca caggcctgtc 600
caaaaatatc ctttgaacca attcccatac attattgtgc cccggctggt tttgcgatcc 660
taaagtgtaa caataagaca ttcaatggaa caggaccatg tacaaatgtc agcacagtgc 720
aatgtacaca tggaattaag ccagtagtat caactcaact actgttaaat ggcagtctag 780
cagaggaaga ggtagtaatt agatctgaaa atttcatgag aaatgataaa atcataatag 840
tacagctaaa tgaatctata gaaattaatt gtacaagacc caactataat acaagaaaag 900
gtttccatat aggaccaggg agagcaattt atacaggaca aataatagga gatatcagac 960
aagcacattg taacattagt ggaataaaat ggaagaaggc tttaaaacag atagttggaa 1020
aattaagaga acaatttggg aataaaacaa tagtatttaa tcagtcctca ggaggggacc 1080
tagaaattga aacacacagt tttaattgtg gaggggaatt tttctactgc aatacaacac 1140
aactgtttaa taatacttgg ccgtcaaata atactgacgg agatattaac gaaaacatca 1200
cactcacact cccatgcaga ataaaacaaa ttataaacat gtggcagaaa gtaggaaaag 1260
ccatgtatgc ccctcccatc aaaggacaaa ttagatgttc atcaaatatt acagggctgc 1320
tactaatgag agatggtggt aacgacacca ccaaggcaaa cgacaccgag gtcttcagac 1380
ctggaggagg ggagatgagg gacaattgga gaagtgaatt atataaatat aaagtagtaa 1440
aaattgaacc attaggaata gcacccacca aggcaaagag aagagtggtg cagagagaaa 1500
aaagaggagt aggattagga gctatgttcc ttgggttctt gggagcagca ggaagcact 1559
<210> 92
<211> 467
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 92
Ser Ala Glu Glu Lys Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp
1 5 10 15
Lys Glu Ala Lys Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr
20 25 30
Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr
35 40 45
Asp Pro Asn Pro Gln Glu Val Val Leu Glu Asn Val Thr Glu Asn Phe
50 55 60
Asn Met Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp Val Ile
65 70 75 80
Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Glu Leu Thr Pro Leu
85 90 95
Cys Val Thr Leu Asn Cys Thr Asn Leu Asn Cys Thr Asn Asn Thr Ser
100 105 110
Ser Glu Ile Lys Asn Cys Ser Phe Tyr Val Thr Thr Ser Met Glu Gly
115 120 125
Lys Val Lys Lys His Ala Thr Phe Tyr Ser Leu Asp Ile Val Arg Thr
130 135 140
Thr Glu Ser Asn Ile Ser Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val
145 150 155 160
Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe Glu Pro Ile Pro Ile His
165 170 175
Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asn Lys Thr
180 185 190
Phe Asn Gly Thr Gly Pro Cys Thr Asn Val Ser Thr Val Gln Cys Thr
195 200 205
His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser
210 215 220
Leu Ala Glu Glu Glu Val Val Ile Arg Ser Glu Asn Phe Met Arg Asn
225 230 235 240
Asp Lys Ile Ile Ile Val Gln Leu Asn Glu Ser Ile Glu Ile Asn Cys
245 250 255
Thr Arg Pro Asn Tyr Asn Thr Arg Lys Gly Phe His Ile Gly Pro Gly
260 265 270
Arg Ala Ile Tyr Thr Gly Gln Ile Ile Gly Asp Ile Arg Gln Ala His
275 280 285
Cys Asn Ile Ser Gly Ile Lys Trp Lys Lys Ala Leu Lys Gln Ile Val
290 295 300
Gly Lys Leu Arg Glu Gln Phe Gly Asn Lys Thr Ile Val Phe Asn Gln
305 310 315 320
Ser Ser Gly Gly Asp Leu Glu Ile Glu Thr His Ser Phe Asn Cys Gly
325 330 335
Gly Glu Phe Phe Tyr Cys Asn Thr Thr Gln Leu Phe Asn Asn Thr Trp
340 345 350
Pro Ser Asn Asn Thr Asp Gly Asp Ile Asn Glu Asn Ile Thr Leu Thr
355 360 365
Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Lys Val Gly
370 375 380
Lys Ala Met Tyr Ala Pro Pro Ile Lys Gly Gln Ile Arg Cys Ser Ser
385 390 395 400
Asn Ile Thr Gly Leu Leu Leu Met Arg Asp Gly Gly Asn Asp Thr Thr
405 410 415
Lys Ala Asn Asp Thr Glu Val Phe Arg Pro Gly Gly Gly Glu Met Arg
420 425 430
Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu
435 440 445
Pro Leu Gly Ile Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg
450 455 460
Glu lys arg
465
<210> 93
<211> 1691
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 93
agaaagagca gaagacagtg gcaatgagag tgaaggagat caagaggagt tatcagctct 60
tgttgaaagg gggcatcttg ctccttggga tattgatgat ctgtagtgct acagacaact 120
tgtgggtcac agtatattat ggggtacctg tatggaaaga agcaaccacc actctatttt 180
gtgcatcaga tgctaaagcc cataatacag aggtacacaa tgtttgggcc acacatgcct 240
gtgtacccac agaccctgac ccacaagaag tagtattgga aaatgtgaca gaaaatttta 300
acatgtggaa aaatgacatg gtagaacaga tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cactttgtgt tactttaaat tgcactaatg 420
ctaaaatgaa caatactgct gataccaatg ctactaatac tgttaatatc agcaaggaag 480
aaatggaaga aataaaaaac tgctctttca atgtcaccac aagcttaaga gataagatgc 540
agagccaata tgcattgttt tataaacttg atatagtacc aatagataat agtagtagta 600
tagataatag tagtaataca tgtaatagta atagtacaca taataatagt agtacatgta 660
ataattatgc taattataga ttgataagtt gtgacacctc agtcattaca caggcctgtc 720
caaaggtatc ctttgagcca attcccatac attattgtgc cccggctggt tttgcgattc 780
taaagtgtaa taataagacg ttcaatggat caggaccatg taaaaatgtc agcacagtac 840
aatgtacaca tggaatcagg ccagtagtat caactcaact gctgttaaat ggcagtctag 900
cagaagaagg ggtagttatt agatctgaga atttcacaaa caatgctaaa accataatag 960
tacagataca tgaacctata gaaattaatt gtacaagacc caacaacaat acaagaaaaa 1020
gtataactat aggaccagga agagcgtttt atgcaacagg agacataata ggagatataa 1080
gacaagcaca ttgtaacctt agtaaagcaa gatggaatga tactttaaaa cagatagtta 1140
caaaattaag agaacagttt agaaataaaa caataaattt tactcaatcc tcaggagggg 1200
acccagaaat tgtgatgcac agttttaatt gtggagggga atttttctac tgtaatacaa 1260
cacaactgtt taatagtact tggttgtcta atagtacttg gaatgatact gaaaggtcaa 1320
atggcactga aactattaca ctcccatgca gaataaagca agttataaac aggtggcagg 1380
aagtaggaaa agcaatgtat gcccctccca tcagtggaat aattagatgt tcatcaaata 1440
ttacagggct gctattaaca agagatggtg gtaatagtaa tgacactccg actgatactg 1500
agatcttcag acctggagga ggagatatga gggacaattg gagaagcgaa ttatataaat 1560
ataaagtagt aaaaattgaa ccattaggaa tagcacccac caaggcaaag agaagagtgg 1620
tgcagagaga aaaaagagca gcgggaatag gagctctgtt ccttgggttc ttgggagcag 1680
caggaagcac t 1691
<210> 94
<211> 511
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 94
Ser Ala Thr Asp Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
His Asn Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asp Pro Gln Glu Val Val Leu Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asn Ala Lys Met Asn Asn Thr Ala
100 105 110
Asp Thr Asn Ala Thr Asn Thr Val Asn Ile Ser Lys Glu Glu Met Glu
115 120 125
Glu Ile Lys Asn Cys Ser Phe Asn Val Thr Thr Ser Leu Arg Asp Lys
130 135 140
Met Gln Ser Gln Tyr Ala Leu Phe Tyr Lys Leu Asp Ile Val Pro Ile
145 150 155 160
Asp Asn Ser Ser Ser Ile Asp Asn Ser Ser Asn Thr Cys Asn Ser Asn
165 170 175
Ser Thr His Asn Asn Ser Ser Thr Cys Asn Asn Tyr Ala Asn Tyr Arg
180 185 190
Leu Ile Ser Cys Asp Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val
195 200 205
Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala
210 215 220
Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Ser Gly Pro Cys Lys
225 230 235 240
Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser
245 250 255
Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Gly Val Val Ile
260 265 270
Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Ile
275 280 285
His Glu Pro Ile Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg
290 295 300
Lys Ser Ile Thr Ile Gly Pro Gly Arg Ala Phe Tyr Ala Thr Gly Asp
305 310 315 320
Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Leu Ser Lys Ala Arg
325 330 335
Trp Asn Asp Thr Leu Lys Gln Ile Val Thr Lys Leu Arg Glu Gln Phe
340 345 350
Arg Asn Lys Thr Ile Asn Phe Thr Gln Ser Ser Gly Gly Asp Pro Glu
355 360 365
Ile Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn
370 375 380
Thr Thr Gln Leu Phe Asn Ser Thr Trp Leu Ser Asn Ser Thr Trp Asn
385 390 395 400
Asp Thr Glu Arg Ser Asn Gly Thr Glu Thr Ile Thr Leu Pro Cys Arg
405 410 415
Ile Lys Gln Val Ile Asn Arg Trp Gln Glu Val Gly Lys Ala Met Tyr
420 425 430
Ala Pro Pro Ile Ser Gly Ile Ile Arg Cys Ser Ser Asn Ile Thr Gly
435 440 445
Leu Leu Leu Thr Arg Asp Gly Gly Asn Ser Asn Asp Thr Pro Thr Asp
450 455 460
Thr Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg
465 470 475 480
Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Ile
485 490 495
Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
500 505 510
<210> 95
<211> 1616
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 95
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tgtcagcgct 60
ggtggagatg gggcatcatg ctccttggga tgttgatgat ctgtaaagct gcagaccaat 120
tgtgggtcac agtctattat ggggtacctg tgtggaagga agcaaccacc actttatttt 180
gtgcatcaga tgctaaagca tataagacag aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaat ccacaagaaa tagacttggc aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaga tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa gccatgtgta aaactaaccc cactctgtgt tactttaaat tgtactgatg 420
acttagaaaa ttgtaaaaat accactaata ataatgccgc taataataat aatacctgcg 480
acatgccagg agaaataaag aactgctctt tcaatatcac cgcaggtata agagataaga 540
tgcagaaaga atatgcactt tttaatacac ttgatgtagt accaataggt gatgagaatg 600
ataataccag ttataggtta ataagttgta atacctcagt cattacacag gcctgtccaa 660
aggtatcctt tgagccaatt cccatacatt attgtgcccc ggctggtttt gcgattctaa 720
agtgtaaaga taagaaattc aatgggacag gacagtgtaa aaatgtcagc acagtacaat 780
gtacacatgg aattaagcca gtagtatcaa ctcaattact gttaaatggc agtctagcag 840
aagaagaggt agtaattagg tctgaaaatt tcacaaacaa tgctaaaacc ataatagtac 900
agctgaaaga acctgtacaa attaattgta caaggcccaa caacaataca agaaaaagca 960
tacatatagg accagggaga gcattttatg caacaggaca aataatagga gatataagac 1020
aagcacattg taacattagt agtataaaat ggaataacac tttaaaacag atagttaaaa 1080
aattaagaga acaatttaat aaaacaataa tctttaatca atcctcagga ggggacccag 1140
aaattgtaat gcacattttt aattgtagag gggaattttt ctactgtaat acaacacaac 1200
tgtttaatag cacttggaat attactgaag ggtcaaatga caatattaca ggcaggtcaa 1260
atgacactat cacgctccca tgcaagataa gacaaattgt aaacatgtgg caggaagtag 1320
gaaaagcaat gtatgctcct cccatcagag gacaaattaa ctgtacatca aatattacag 1380
ggctgctgtt aacaagagat ggtggtaaaa atatcagcga gtccgaaacc ttcagacctg 1440
gaggaggaaa tatgaaggac aattggagaa gtgaattata taaatacaaa gtagtacaaa 1500
ttgaaccatt aggagtagca cccaccaagg caaagagaag agtggtgcag agagaaaaaa 1560
gagcagtggg aataggagct ctgttccttg ggttcttggg agcagcagga agcact 1616
<210> 96
<211> 486
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 96
Lys Ala Ala Asp Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Lys Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Asp Leu Ala Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Asp Leu Glu Asn Cys Lys Asn
100 105 110
Thr Thr Asn Asn Asn Ala Ala Asn Asn Asn Asn Thr Cys Asp Met Pro
115 120 125
Gly Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Ala Gly Ile Arg Asp
130 135 140
Lys Met Gln Lys Glu Tyr Ala Leu Phe Asn Thr Leu Asp Val Val Pro
145 150 155 160
Ile Gly Asp Glu Asn Asp Asn Thr Ser Tyr Arg Leu Ile Ser Cys Asn
165 170 175
Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile
180 185 190
Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Lys
195 200 205
Asp Lys Lys Phe Asn Gly Thr Gly Gln Cys Lys Asn Val Ser Thr Val
210 215 220
Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu
225 230 235 240
Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Glu Asn Phe
245 250 255
Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Lys Glu Pro Val Gln
260 265 270
Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile His Ile
275 280 285
Gly Pro Gly Arg Ala Phe Tyr Ala Thr Gly Gln Ile Ile Gly Asp Ile
290 295 300
Arg Gln Ala His Cys Asn Ile Ser Ser Ile Lys Trp Asn Asn Thr Leu
305 310 315 320
Lys Gln Ile Val Lys Lys Leu Arg Glu Gln Phe Asn Lys Thr Ile Ile
325 330 335
Phe Asn Gln Ser Ser Gly Gly Asp Pro Glu Ile Val Met His Ile Phe
340 345 350
Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asn Thr Thr Gln Leu Phe Asn
355 360 365
Ser Thr Trp Asn Ile Thr Glu Gly Ser Asn Asp Asn Ile Thr Gly Arg
370 375 380
Ser Asn Asp Thr Ile Thr Leu Pro Cys Lys Ile Arg Gln Ile Val Asn
385 390 395 400
Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly
405 410 415
Gln Ile Asn Cys Thr Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp
420 425 430
Gly Gly Lys Asn Ile Ser Glu Ser Glu Thr Phe Arg Pro Gly Gly Gly
435 440 445
Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val
450 455 460
Gln Ile Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val
465 470 475 480
Val Gln Arg Glu Lys Arg
485
<210> 97
<211> 1640
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 97
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tatcagcact 60
ggtggagatg ggggaccatg ctcctttggt tattgatgat ctgtagtgct gcagaacaat 120
tgtgggtcac agtttactat ggggtacctg tgtggaaaga agcaaccacc actctatttt 180
gtgcatcaga tgccaaagca tatgatccag aggcacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaat tagtattggc aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaga tgcaggagga tataatcagt ttatgggatc 360
aaagcctaaa gccatgtaca aaattaaccc cactctgtgt tactttaaat tgtactgatg 420
tggagtataa taatgaaact gcttccaata atactgcttc caatactact attattaatg 480
ggggaacaat agagggagat ggaataaaaa actgctcttt caatatcacc accagcctaa 540
ataaggtgca gaaagaaaat gcatattttt ataatcttga tgtagtacaa atagataata 600
gtgataatac cagctatagg ttagtaagtt gtaacacctc agtcattaca caggcctgtc 660
caaagatatc ttttgagcca attcccatac attactgtac cccggctggt tttgcgattc 720
taaagtgtaa tgataaaaag ttcaatggaa caggaccatg taaaaatgtc agcacagtac 780
aatgtacaca tggaattaag ccagtagtat caactcaact gttgttaaat ggcagtctag 840
cagaagaaga ggtagtaatt agatcagaaa atttcacaga taatgcaaaa atcataatag 900
tacagctgaa tgaatctatg gaaattaatt gtgcaagacc caacaacaat acaagaaaag 960
gtatacatat gggaccaggg aaagcatttt atgcaacagg agccataata ggagatatac 1020
gaagagcaca ttgcaacatt agtcaaacaa agtggaacaa tgccctaaaa cagatagcta 1080
taaagttaag agaacaattt gggaataaaa caatagtctt tacaaactcc tcaggagggg 1140
acccagaaat tgtaatgcac agttttaact gtggagggga gtttttctac tgtaatacat 1200
cactcctgtt taatagtatc tggaatagta ctactttgtc aaatagcact acaggagatg 1260
gaaatatcac actcccatgc agaataaaac aaattataaa tatgtggcag aaagtaggga 1320
aagcaatgta tgcccctccc atccaaggac taattaaatg tacatcaaat atcacaggga 1380
tgttattaat aagagatggt ggtaacatca actgcactga gactaatacc accaactgca 1440
atgagactga gaccttcaga cctgtaggag gagatatgag ggacaattgg agaagtgaat 1500
tatataaata taaagtagta aaaattgaac cattaggaat agcacccact aaggcaaaga 1560
gaagagtggt gcagagagaa aaaagagcag cgggactagg agctttgttc cttgggttct 1620
tgggagcagc aggaagcact 1640
<210> 98
<211> 494
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 98
Ser Ala Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Pro Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Leu Val Leu Ala Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met Gln Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Thr Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Val Glu Tyr Asn Asn Glu Thr
100 105 110
Ala Ser Asn Asn Thr Ala Ser Asn Thr Thr Ile Ile Asn Gly Gly Thr
115 120 125
Ile Glu Gly Asp Gly Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Ser
130 135 140
Leu Asn Lys Val Gln Lys Glu Asn Ala Tyr Phe Tyr Asn Leu Asp Val
145 150 155 160
Val Gln Ile Asp Asn Ser Asp Asn Thr Ser Tyr Arg Leu Val Ser Cys
165 170 175
Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe Glu Pro
180 185 190
Ile Pro Ile His Tyr Cys Thr Pro Ala Gly Phe Ala Ile Leu Lys Cys
195 200 205
Asn Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr
210 215 220
Val Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu
225 230 235 240
Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Glu Asn
245 250 255
Phe Thr Asp Asn Ala Lys Ile Ile Ile Val Gln Leu Asn Glu Ser Met
260 265 270
Glu Ile Asn Cys Ala Arg Pro Asn Asn Asn Thr Arg Lys Gly Ile His
275 280 285
Met Gly Pro Gly Lys Ala Phe Tyr Ala Thr Gly Ala Ile Gly Asp
290 295 300
Ile Arg Arg Ala His Cys Asn Ile Ser Gln Thr Lys Trp Asn Asn Ala
305 310 315 320
Leu Lys Gln Ile Ala Ile Lys Leu Arg Glu Gln Phe Gly Asn Lys Thr
325 330 335
Ile Val Phe Thr Asn Ser Ser Gly Gly Asp Pro Glu Ile Val Met His
340 345 350
Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser Leu Leu
355 360 365
Phe Asn Ser Ile Trp Asn Ser Thr Thr Leu Ser Asn Ser Thr Thr Gly
370 375 380
Asp Gly Asn Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met
385 390 395 400
Trp Gln Lys Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Gln Gly Leu
405 410 415
Ile Lys Cys Thr Ser Asn Ile Thr Gly Met Leu Leu Ile Arg Asp Gly
420 425 430
Gly Asn Ile Asn Cys Thr Glu Thr Asn Thr Thr Asn Cys Asn Glu Thr
435 440 445
Glu Thr Phe Arg Pro Val Gly Gly Asp Met Arg Asp Asn Trp Arg Ser
450 455 460
Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Ile Ala
465 470 475 480
Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
485 490
<210> 99
<211> 1649
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 99
agaaagagca gaagacagtg gcaatgagag tgaaggagat caggaggaat tgtcagcact 60
ggtggtggaa atggggcacc atgctccttg ggatgttgat gatctgtagt gctaacgagc 120
aattgtgggt cacagtctat tatggggtac ctgtgtggaa agaagcaacc accactctat 180
tttgtgcatc agatgctaaa gcatatgaca cagaggcaca taatgtttgg gccacacatg 240
cctgtgtacc cacagaccct aacccacaag aagtagtatt ggtaaatgtg acagaaaatt 300
ttaatatgtg gaaaaataac atggtagaac agacgcagga ggatataatc agtttatggg 360
atcaaagcct aaagccatgt gtaaaattaa ccccactctg tgttactcta aattgcactg 420
attggaccct acattgcaat aataatgata ctaattgcac tactttaaaa aatgatacta 480
aaaccaataa taatagtagt ttgagaacaa tggagggagg agaagtaaaa aactgctctt 540
tcaatgtcac cacaagccta agagataagg agcgaaaaga atatgcactt ttttataaac 600
ttgatgtagt accaataggt aatgataata caagctatac gctgataaat tgtaacacct 660
caaccattac acaggcctgt ccaaaggtaa cctttgaacc aattcccata cattattgta 720
ccccggctgg ttttgcgctt ctaaagtgta atgataagaa gttcaatgga acaggacagt 780
gtaaaaatgt cagcacagta caatgtacac atggaattag gccagtagta tcaactcaat 840
tgctgctaaa tggcagtcta tcagaaggag aggtaatgat tagatctgaa aatttcacaa 900
acaatgctaa aaccataata gtacagctga atgaatctat agcaattaat tgtacaagac 960
ccaacaacaa tacaagaaaa agtataacta taggaccggg gagagcattt tttacaacag 1020
gagaaataac aggagatata agacaagcac attgtaacct tagtgcagta caatggaata 1080
acacattaaa acagatagtt gcaaaattaa gggaacaatt tgggaataat acaataagct 1140
ttaataaatc cgcaggaggg gacccagaaa ttgtaatgca cagttttaat tgtggagggg 1200
aatttttcta ctgtgataca acacagctgt ttaatagtac ttgggataat gacacagact 1260
taagtattaa gaatgagact acagaatcaa acaacaaaac tatcacactc ccgtgcagaa 1320
taaaacaaat tataaacaga tggcaggaag taggaaaagc aatgtatgcc cctcccatca 1380
gaggacaaat taaatgttca tcaaatatta cagggctact attaacaaga gatggtggta 1440
tgaacaatag cgccaacgag accttcagac ctggaggagg agatatgagg gacaattgga 1500
gaagtgaatt atataaatat aaagtagtaa aaattgagcc attaggggta gcacccacca 1560
aggcaaagag aagagcggtg caaagagaaa aaagagcagt gggaatagga gctgtgttcc 1620
ttgggttctt gggagcagca ggaagcact 1649
<210> 100
<211> 496
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 100
Ser Ala Asn Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Val Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Thr Gln Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Trp Thr Leu His Cys Asn Asn
100 105 110
Asn Asp Thr Asn Cys Thr Thr Leu Lys Asn Asp Thr Lys Thr Asn Asn
115 120 125
Asn Ser Ser Leu Arg Thr Met Glu Gly Gly Glu Val Lys Asn Cys Ser
130 135 140
Phe Asn Val Thr Thr Ser Ser Leu Arg Asp Lys Glu Arg Lys Glu Tyr Ala
145 150 155 160
Leu Phe Tyr Lys Leu Asp Val Val Pro Ile Gly Asn Asp Asn Thr Ser
165 170 175
Tyr Thr Leu Ile Asn Cys Asn Thr Ser Thr Ile Thr Gln Ala Cys Pro
180 185 190
Lys Val Thr Phe Glu Pro Ile Pro Ile His Tyr Cys Thr Pro Ala Gly
195 200 205
Phe Ala Leu Leu Lys Cys Asn Asp Lys Lys Phe Asn Gly Thr Gly Gln
210 215 220
Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val
225 230 235 240
Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ser Glu Gly Glu Val
245 250 255
Met Ile Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr Ile Ile Val
260 265 270
Gln Leu Asn Glu Ser Ile Ala Ile Asn Cys Thr Arg Pro Asn Asn Asn
275 280 285
Thr Arg Lys Ser Ile Thr Ile Gly Pro Gly Arg Ala Phe Phe Thr Thr
290 295 300
Gly Glu Ile Thr Gly Asp Ile Arg Gln Ala His Cys Asn Leu Ser Ala
305 310 315 320
Val Gln Trp Asn Asn Thr Leu Lys Gln Ile Val Ala Lys Leu Arg Glu
325 330 335
Gln Phe Gly Asn Asn Thr Ile Ser Phe Asn Lys Ser Ala Gly Gly Asp
340 345 350
Pro Glu Ile Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr
355 360 365
Cys Asp Thr Thr Gln Leu Phe Asn Ser Thr Trp Asp Asn Asp Thr Asp
370 375 380
Leu Ser Ile Lys Asn Glu Thr Thr Glu Ser Asn Asn Lys Thr Ile Thr
385 390 395 400
Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Arg Trp Gln Glu Val Gly
405 410 415
Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Gln Ile Lys Cys Ser Ser
420 425 430
Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Met Asn Asn Ser
435 440 445
Ala Asn Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp
450 455 460
Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly
465 470 475 480
Val Ala Pro Thr Lys Ala Lys Arg Arg Ala Val Gln Arg Glu Lys Arg
485 490 495
<210> 101
<211> 1610
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 101
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tgcttacgga 60
aatggggcac catgctcctt gggatattaa tgatctgtag tgctgcagga aatttgtggg 120
tcacagtcta ttatggggtg cctgtgtgga aagaagcaac caccactcta ttctgtgcat 180
cagatgctaa agcatatgct acagaggcac ataatgtttg ggccacacat gcctgtgtac 240
ccacagaccc taacccacaa gaagtagtaa tggaaaatgt gacagaaaat tttaacatgt 300
ggaaaaataa catggtagaa cagatgcatg aggatataat cagtttatgg gatcaaagcc 360
taaagccatg tgtaaaatta accccactct gtgttactct aaattgcact gactgtgtga 420
cttcaaattg cactaagttg aagaatgtca ctaataatgc taatattagc aagatggaga 480
tggaggaggg agaaatgaaa aactgctctt ttaacatcac ctcaggcatg agagataaga 540
tgaagaaaga atatgcattt ttttataaac ttgatatagt accaataagt aatgataata 600
ctagctatag attgataagt tgtaatacct cagtcattac acaggcatgt ccaaaagtat 660
cctttgagcc aattcccata cattattgtg ccccggctgg ttttgcgatt ctaaaatgta 720
atgataagaa gttcaatgga acaggaccat gtaaaaatgt cagcacagta caatgtacac 780
atggaattag gccagtagtg tcaactcaat tgctgttaaa tggcagccta gcagaagaag 840
aggtagtaat tagatctgaa aatttaacgg acaatggtaa aaccataata gtacagctga 900
acgaatctgt acacattaat tgtacaagac ccagcaacaa taccagaaaa agtatacata 960
taggaccagg gaaagcattt tatgcaacag gacaaataat aggagatata agacaagcac 1020
actgtaacat tagtgaaaaa caatggaata aaactttaag ccagatagtt aaaaaattaa 1080
gagaacaatt taaaaataaa acaatagtct ttaatcaatc ctcaggaggg gacccagaaa 1140
ttgtaatgca cagttttaat tgtggaggag aatttttcta ctgtaattca acacaactat 1200
ttaatactac ttggaattgg aatgatactg aagggtcaaa taacactgaa agaaatgaaa 1260
gaaatattac actcccatgc agaataaaac aaattgtaaa caggtggcag gaagtaggaa 1320
aagcaatgta tgcccctccc atcagcggac caattagctg ttcatcaaat attacagggc 1380
tgctattaac aagagatggt ggtctcccaa acaatactga gaccttcaga ccagaaggag 1440
gaaatatgaa ggacaattgg agaagtgaat tatataaata taaagtagta aaaattgaac 1500
cattaggaat agcacccacc aaggcaaaga gaagagtggt gcaaagagaa aaaagagcag 1560
tgacattagg agctatgttc cttgggttct tgggagcagc aggaagcact 1610
<210> 102
<211> 486
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 102
Ser Ala Ala Gly Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Ala Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Met Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Cys Val Thr Ser Asn Cys Thr
100 105 110
Lys Leu Lys Asn Val Thr Asn Asn Ala Asn Ile Ser Lys Met Glu Met
115 120 125
Glu Glu Gly Glu Met Lys Asn Cys Ser Phe Asn Ile Thr Ser Gly Met
130 135 140
Arg Asp Lys Met Lys Lys Glu Tyr Ala Phe Phe Tyr Lys Leu Asp Ile
145 150 155 160
Val Pro Ile Ser Asn Asp Asn Thr Ser Tyr Arg Leu Ile Ser Cys Asn
165 170 175
Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile
180 185 190
Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn
195 200 205
Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr Val
210 215 220
Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu
225 230 235 240
Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Glu Asn Leu
245 250 255
Thr Asp Asn Gly Lys Thr Ile Ile Val Gln Leu Asn Glu Ser Val His
260 265 270
Ile Asn Cys Thr Arg Pro Ser Asn Asn Thr Arg Lys Ser Ile His Ile
275 280 285
Gly Pro Gly Lys Ala Phe Tyr Ala Thr Gly Gln Ile Ile Gly Asp Ile
290 295 300
Arg Gln Ala His Cys Asn Ile Ser Glu Lys Gln Trp Asn Lys Thr Leu
305 310 315 320
Ser Gln Ile Val Lys Lys Leu Arg Glu Gln Phe Lys Asn Lys Thr Ile
325 330 335
Val Phe Asn Gln Ser Ser Gly Gly Asp Pro Glu Ile Val Met His Ser
340 345 350
Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gln Leu Phe
355 360 365
Asn Thr Thr Trp Asn Trp Asn Asp Thr Glu Gly Ser Asn Asn Thr Glu
370 375 380
Arg Asn Glu Arg Asn Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Val
385 390 395 400
Asn Arg Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Ser
405 410 415
Gly Pro Ile Ser Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg
420 425 430
Asp Gly Gly Leu Pro Asn Asn Thr Glu Thr Phe Arg Pro Glu Gly Gly
435 440 445
Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val
450 455 460
Lys Ile Glu Pro Leu Gly Ile Ala Pro Thr Lys Ala Lys Arg Arg Val
465 470 475 480
Val Gln Arg Glu Lys Arg
485
<210> 103
<211> 1610
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 103
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tatcagcact 60
ggtggaaatg gggcacgatg ctccttggaa tgttgatgat ctgtagtgct gcagaacaat 120
tgtgggttac agtctattat ggggtacctg tgtggaagga agcaaagacc actctattct 180
gtgcatcaga tgctaaagca tatgaaacag aggtacataa tgtttgggcc gcacatgcct 240
gtgtacccac agaccccaac ccacaagaag tagtattggc aaatgtgaca gaaaatttta 300
atatgtggga caacaacaag gtagaacaga tgcaggagga tataatcagt ttatgggatg 360
aaggcctaaa gccatgtgta aaattaaccc cactctgtat tactctaaat tgcactgacc 420
cctgtgatgc tcaaaatagc actcgaagtt gcacttattt gaatgatacg gtggaagagg 480
agagaggaca aataagaaat tgctctttca atatctccac aaagctagaa aatagaaggc 540
agacaggata tgcagttttt gataaacttg atttagtacc agtagatggt ggtaataata 600
ctgtcagata taggttgata aattgtaaca cctcagtcat tacacaagca tgtccaaagg 660
tatcctttga accaattccc atacattatt gtaccccagc tggttttgcg attctaaagt 720
gtaatgataa gaagttcaat ggaacagggc catgcacaaa tgtcagcaca gtacaatgca 780
cacatggaat taggccagta gtgtcaactc aactgctgtt aaatggcagt ctggcagaag 840
aagaggtagt aattagatct aaaaatttca cagacaatac taaaactata atagtacagc 900
tgaatgaatc tgtacaaatt aattgtacaa gacccaacaa caatacaaga cagagtacac 960
ctatgggacc agggaaagca ctttacacaa cacagataat aggagacata agacaagcac 1020
attgtaacat tagtacaaaa gattggaatg acaccttaaa aaagatagtt acaaaattag 1080
aagaacaatt tgaaaataaa acaataagct ttaatcaatc ctcaggaggg gacccagaaa 1140
ttgtaaagca cagttttaat tgtggagggg aattcttcta ctgtgataca acaaaactat 1200
ttaatagtac ttggcctgca agtatcaatt ataccactgg agtaaatatc actggagtta 1260
tcacactccc atgtagaata aaacaaatta taaacaggtg gcagggagga ggaaaggcaa 1320
tgtatgctcc tcccatcagt ggaccaatta gatgttcatc aaatattaca gggctgctat 1380
taacaagaga tggtggtgat gacaataatg gggctgagac ctttagacct ggaggaggag 1440
atatgaggga caattggaga agtgaattat ataaatataa agtagtaaaa attgaaccat 1500
taggagtagc acccaccaag gcaaagagaa gagtggtgca gagagaaaaa agagcagtgg 1560
gaacactagg agctatgttc cttgggttct tgggagcagc aggaagcact 1610
<210> 104
<211> 483
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 104
Ser Ala Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Lys Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Glu Thr Glu Val His Asn Val Trp Ala Ala His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Ala Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Asp Asn Asn Lys Val Glu Gln Met Gln Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Glu Gly Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Ile Thr Leu Asn Cys Thr Asp Pro Cys Asp Ala Gln Asn Ser
100 105 110
Thr Arg Ser Cys Thr Tyr Leu Asn Asp Thr Val Glu Glu Glu Arg Gly
115 120 125
Gln Ile Arg Asn Cys Ser Phe Asn Ile Ser Thr Lys Leu Glu Asn Arg
130 135 140
Arg Gln Thr Gly Tyr Ala Val Phe Asp Lys Leu Asp Leu Val Pro Val
145 150 155 160
Asp Gly Gly Asn Asn Thr Val Arg Tyr Arg Leu Ile Asn Cys Asn Thr
165 170 175
Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile Pro
180 185 190
Ile His Tyr Cys Thr Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp
195 200 205
Lys Lys Phe Asn Gly Thr Gly Pro Cys Thr Asn Val Ser Thr Val Gln
210 215 220
Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn
225 230 235 240
Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Lys Asn Phe Thr
245 250 255
Asp Asn Thr Lys Thr Ile Ile Val Gln Leu Asn Glu Ser Val Gln Ile
260 265 270
Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Gln Ser Thr Pro Met Gly
275 280 285
Pro Gly Lys Ala Leu Tyr Thr Thr Gln Ile Ile Gly Asp Ile Arg Gln
290 295 300
Ala His Cys Asn Ile Ser Thr Lys Asp Trp Asn Asp Thr Leu Lys Lys
305 310 315 320
Ile Val Thr Lys Leu Glu Glu Gln Phe Glu Asn Lys Thr Ile Ser Phe
325 330 335
Asn Gln Ser Ser Gly Gly Asp Pro Glu Ile Val Lys His Ser Phe Asn
340 345 350
Cys Gly Gly Glu Phe Phe Tyr Cys Asp Thr Thr Lys Leu Phe Asn Ser
355 360 365
Thr Trp Pro Ala Ser Ile Asn Tyr Thr Thr Gly Val Asn Ile Thr Gly
370 375 380
Val Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Arg Trp Gln
385 390 395 400
Gly Gly Gly Lys Ala Met Tyr Ala Pro Pro Ile Ser Gly Pro Ile Arg
405 410 415
Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asp
420 425 430
Asp Asn Asn Gly Ala Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg
435 440 445
Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu
450 455 460
Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg
465 470 475 480
Glu lys arg
<210> 105
<211> 1589
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 105
agaaagagca gaagacagtg gcaatgaaag tgagggagac caggaggaat tattgggact 60
tgtggagatg gggcacgatg ctccttggga tgttaatgat ctgtagtgct acagaacaat 120
tatgggtcac aatctattat ggggtacctg tgtggaaaga agcagatacc actctatttt 180
gtgcatcaga tgctaaagca tatgatacag agaaacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccctaac ccacaagaag tagtaatggc aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaga tgcagaagga tattatcagt ctatgggatg 360
aaagcttaaa accatgtgta aaattaaccc cactctgtat cactttaaat tgcaatactt 420
taaattgcac acataacaat aattgtagta gtttgagaga agaaatgaca aactgctcct 480
tcaatgccac ctcaaaattg agagataagg ttcggaaaca atatgcaatt ttttctaaac 540
ttgatgtggt aaaactagat agtaataata atgaaacaga atataggttg ataaattgta 600
acacctcagt cgttacacag gcctgtccaa aggtatcatt tgagccaatt cccatacatt 660
attgtgcccc ggctggtttt gcgattctaa agtgtaacaa taagacattc gatggaaaag 720
gaccatgtac aaatgtcagc acagtacaat gtacacatgg aattaggcca gtagtgtcaa 780
ctcaactgct gttaaatggc agtctagcag aagaagggat agtaattaga tctgacaata 840
tctcagacaa tactaaaacc ataatagtac agctgaatga atctgtagca attaattgtt 900
caagacccca caacaataca agaaaaagta tacatatagc accagggaga gccttttatg 960
caacaggaga cataatagga gatataagac aagcacattg taacattagt agaaaacaat 1020
ggaataaaac tttagaacag gtagctaaaa aattaagaga aaaatttgta aataaagcaa 1080
ttatctttaa aaactcctca ggaggggacc cagaaattgt aatgcacagt tttaattgta 1140
gaggggaatt tttctactgt aatacaacac cactgtttaa tgggacttgg aatggtacta 1200
atttgagtac taataaggag tcaaatgaca caatcatact ccaatgtaga ataaaacaaa 1260
ttataaacat gtggcaggaa gtaggaaaag caatgtatgc ccctcccatt gcaggacaaa 1320
ttaactgttc atcaaaaatt acagggctac tattaacaag agatggtagt agcacaaatg 1380
ggacaaatga gactgagatc ttcagacctg gaggaggaga tatgagggac aattggagaa 1440
gcgaattata taaatataaa gtagtaagag ttgaaccaat aggaatagca cccaccaagg 1500
caaagagaag agtggtgcag agagaaaaaa gagcagtggg aacgttagga gctatgttcc 1560
ttgggttctt gggagcagca ggaagcact 1589
<210> 106
<211> 476
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 106
Ser Ala Thr Glu Gln Leu Trp Val Thr Ile Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Asp Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Lys His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Met Ala Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met Gln Lys Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Glu Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Ile Thr Leu Asn Cys Asn Thr Leu Asn Cys Thr His Asn Asn
100 105 110
Asn Cys Ser Ser Leu Arg Glu Glu Met Thr Asn Cys Ser Phe Asn Ala
115 120 125
Thr Ser Lys Leu Arg Asp Lys Val Arg Lys Gln Tyr Ala Ile Phe Ser
130 135 140
Lys Leu Asp Val Val Lys Leu Asp Ser Asn Asn Asn Glu Thr Glu Tyr
145 150 155 160
Arg Leu Ile Asn Cys Asn Thr Ser Val Val Thr Gln Ala Cys Pro Lys
165 170 175
Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe
180 185 190
Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Asp Gly Lys Gly Pro Cys
195 200 205
Thr Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val
210 215 220
Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Gly Ile Val
225 230 235 240
Ile Arg Ser Asp Asn Ile Ser Asp Asn Thr Lys Thr Ile Ile Val Gln
245 250 255
Leu Asn Glu Ser Val Ala Ile Asn Cys Ser Arg Pro His Asn Asn Thr
260 265 270
Arg Lys Ser Ile His Ile Ala Pro Gly Arg Ala Phe Tyr Ala Thr Gly
275 280 285
Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser Arg Lys
290 295 300
Gln Trp Asn Lys Thr Leu Glu Gln Val Ala Lys Lys Leu Arg Glu Lys
305 310 315 320
Phe Val Asn Lys Ala Ile Ile Phe Lys Asn Ser Ser Gly Gly Asp Pro
325 330 335
Glu Ile Val Met His Ser Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys
340 345 350
Asn Thr Thr Pro Leu Phe Asn Gly Thr Trp Asn Gly Thr Asn Leu Ser
355 360 365
Thr Asn Lys Glu Ser Asn Asp Thr Ile Ile Leu Gln Cys Arg Ile Lys
370 375 380
Gln Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro
385 390 395 400
Pro Ile Ala Gly Gln Ile Asn Cys Ser Ser Lys Ile Thr Gly Leu Leu
405 410 415
Leu Thr Arg Asp Gly Ser Ser Thr Asn Gly Thr Asn Glu Thr Glu Ile
420 425 430
Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu
435 440 445
Tyr Lys Tyr Lys Val Val Arg Val Glu Pro Ile Gly Ile Ala Pro Thr
450 455 460
Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
465 470 475
<210> 107
<211> 1689
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 107
agaaagagca gaagacagtg gcaatgagag tgacggggat caggaagaat tacttatgga 60
gatggggcac cttattcctg gggatattga tgatctgtaa ggctgcagaa aacttgtggg 120
tcacagtcta ttatggggta cctgtgtgga aagacgcaac caccactcta ttttgtgcat 180
cagatgctaa agctgttgat acagaagtac acaatgtgtg ggccacacat gcctgtgtac 240
ccacagaccc aaacccacaa gaagtagtat tgtataatgt gacagaaaat tttaacatgt 300
ggaaaaataa catggtagaa cagatgcatg aggatataat cagtttatgg gatgaaagtc 360
taaagccatg tgtaaagcta accccactct gtgttacttt gaattgcact gatgaattga 420
atatcaatac taataatacc tgtagtaata ccagtagtgg cactaataat accagtaatt 480
gcactaatgc tgagtcgact atcatcagta ccagtaatag tagcaatacc agaaatatca 540
gtgatagtag caagatagag aaaggagaaa taaaaaactg tactttcaat atcaccacaa 600
gcataagaga taaggtgcag aaagaatatg cactgtttta taaacttgat gtagtaccaa 660
caggtagtga caatactagc tataggttga taagttgtaa tacctcagtc attacacagg 720
cctgtccaaa ggtaaccttt gagccaattc ctatacatta ttgtgccccg gctggttatg 780
cgattgtaaa gtgcaacaat aagacattca gtggaaaagg accatgtaaa aatgtcagca 840
cagtacaatg cacacatggg attaagccag tagtatcaac tcagttgctg ttaaatggca 900
gtctggcaga aaaagatata gtaattagat ctgacaactt ctcaaacaat gctaaaacca 960
taatagtgca gctgaacaca tctgtagaaa ttaattgtac aagacccagc aacaatacaa 1020
gaaaaagtat tcatatagga ccagggagag cattttatgc aacagatata ataggagata 1080
taagacaagc acattgtaac attagtgaag aaaattggta taaaactcta aagcaggtag 1140
ctatgaaatt aggagaacag tttcagaata aaaaaatagt ctttaatcaa tcctcaggag 1200
gggacccaga aattgtaatg cacagtttta attgtggagg ggaatttttc ttctgtagta 1260
catcacaact gtttaatagt acttggaatt atagtacttg ggatgaaata gaaaataaaa 1320
ctcaaggaaa tggcacactc acactcccat gcagaataaa acaaattgta aacatgtggc 1380
aggaagtagg aaaagcaatg tatgcccctc ctatcaaagg aaatattaca tgttcatcaa 1440
agattacagg gttgctatta acaagagatg gtggtaagac taatatcaat ggcaccgaga 1500
tcttcagacc aggaggagga gatatgaggg acaattggag aagtgaacta tataaatata 1560
aagtagtaaa aattgaacca ttaggaatag cacccaccaa ggcaaagaga agagtggtgc 1620
agagagaaaa aagagcagtg ggaataggag ctatgttcct tgggttcttg ggaggcagca 1680
ggaagcact 1689
<210> 108
<211> 512
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 108
Lys Ala Ala Glu Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Val Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Leu Tyr Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Glu Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Glu Leu Asn Ile Asn Thr Asn
100 105 110
Asn Thr Cys Ser Asn Thr Ser Ser Gly Thr Asn Asn Thr Ser Asn Cys
115 120 125
Thr Asn Ala Glu Ser Thr Ile Ile Ser Thr Ser Asn Ser Ser Asn Thr
130 135 140
Arg Asn Ile Ser Asp Ser Ser Lys Ile Glu Lys Gly Glu Ile Lys Asn
145 150 155 160
Cys Thr Phe Asn Ile Thr Thr Ser Ile Arg Asp Lys Val Gln Lys Glu
165 170 175
Tyr Ala Leu Phe Tyr Lys Leu Asp Val Val Pro Thr Gly Ser Asp Asn
180 185 190
Thr Ser Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala
195 200 205
Cys Pro Lys Val Thr Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro
210 215 220
Ala Gly Tyr Ala Ile Val Lys Cys Asn Asn Lys Thr Phe Ser Gly Lys
225 230 235 240
Gly Pro Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys
245 250 255
Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Lys
260 265 270
Asp Ile Val Ile Arg Ser Asp Asn Phe Ser Asn Asn Ala Lys Thr Ile
275 280 285
Ile Val Gln Leu Asn Thr Ser Val Glu Ile Asn Cys Thr Arg Pro Ser
290 295 300
Asn Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly Arg Ala Phe Tyr
305 310 315 320
Ala Thr Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser
325 330 335
Glu Glu Asn Trp Tyr Lys Thr Leu Lys Gln Val Ala Met Lys Leu Gly
340 345 350
Glu Gln Phe Gln Asn Lys Lys Ile Val Phe Asn Gln Ser Ser Gly Gly
355 360 365
Asp Pro Glu Ile Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe
370 375 380
Phe Cys Ser Thr Ser Gln Leu Phe Asn Ser Thr Trp Asn Tyr Ser Thr
385 390 395 400
Trp Asp Glu Ile Glu Asn Lys Thr Gln Gly Asn Gly Thr Leu Thr Leu
405 410 415
Pro Cys Arg Ile Lys Gln Ile Val Asn Met Trp Gln Glu Val Gly Lys
420 425 430
Ala Met Tyr Ala Pro Pro Ile Lys Gly Asn Ile Thr Cys Ser Ser Lys
435 440 445
Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Lys Thr Asn Ile Asn
450 455 460
Gly Thr Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp
465 470 475 480
Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly
485 490 495
Ile Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
500 505 510
<210> 109
<211> 1616
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 109
agaaagagca gaagacagtg gcaatgagag tgatggggat caggaggaat tatcagcact 60
tgtggagatg gggcaccatg ctccttggga tgttgatgat ctgtaaaatc actacaggaa 120
aaacgtgggt cacagtctat tatggggtac ctgtgtggaa agaagcaact accactctat 180
tttgtgcatc agatgctaaa gcatatgata cagaggtaca taatgtttgg gccacacatg 240
cctgtgtacc cacagacccc aacccacaaa aaatagtaat ggtaaatgtg acagaggaat 300
ttaacatgtg gaaaaataac atggtagaac agatgcatga ggatataatc agtttatggg 360
atcagagcct aaagccatgt gtaaaattaa ccccactctg cgttacttta aattgcaatg 420
atactacgga gagaaattgc actggtcccg atggtagaaa aataaattgc actgaagtga 480
aaaattgctc tctcaatatc accacaaaca taagagataa ggtgcagaaa gaatatgcac 540
ttttttatag aactgatgtg gtgccaatag ataataataa tactagtacc agcagtcata 600
gctataggtt gataagttgt aatacctcag tccttacaca gacctgtcca aaagtatcct 660
ttcagccaat tcccatacat tattgtgccc cggctggttt tgcgattcta aaatgtaaca 720
ataggacatt caatggaaaa ggagaatgtg gtaatgtcag cacagtacaa tgtacacatg 780
gaattaagcc agtagtatca actcagctgc tgctaagtgg cagtctagca gaacaagaga 840
tagtgcttag atctgacaac ttctcagaca atgctaaaac cataatagta cagctgacta 900
aacctgtaga aattaattgt acacgaccca acaacaatac aagaaaaagt atagctatag 960
gaccagggag agcatttatt gcaacaggag acataatagg agatataaga caggcacatt 1020
gtaacattag taaagtagca tggaataaca ctatagaaga ggtagcaaga aaattaagca 1080
aacaatttgg gaatagatca ataaccttta atcaatcctc aggaggggac ccagaaattg 1140
taatgcacag ttttaattgt ggaggggaat ttttctactg taatacaaca ggactattta 1200
gtagtacttg gaatgttact gataattgga atgctactga agaggcaaat accactaggg 1260
taaatatcac actcccatgc agaataagac aaattgtaaa catgtggcag gaagtaggaa 1320
aagcaatgta tgcccctccc atcagaggac aaattagttg ctcatcaaat attacaggac 1380
tgatattaac aagagatggt ggtaaccaga gcaacgagac gacccctgag atctttagac 1440
ctggaggagg agatatgagg gacaattgga gaagtgaatt atataaatat aaagtagtaa 1500
aaattgaacc tataggaata gcacccacca aggcaaagag aagagtggtg cagagagaaa 1560
aaagaggggt actaggagct atgttccttg ggttcttggg agcagcagga agcact 1616
<210> 110
<211> 487
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 110
Lys Ile Thr Thr Gly Lys Thr Trp Val Thr Val Tyr Tyr Gly Val Pro
1 5 10 15
Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys
20 25 30
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val
35 40 45
Pro Thr Asp Pro Asn Pro Gln Lys Ile Val Met Val Asn Val Thr Glu
50 55 60
Glu Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp
65 70 75 80
Ile Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr
85 90 95
Pro Leu Cys Val Thr Leu Asn Cys Asn Asp Thr Thr Glu Arg Asn Cys
100 105 110
Thr Gly Pro Asp Gly Arg Lys Ile Asn Cys Thr Glu Val Lys Asn Cys
115 120 125
Ser Leu Asn Ile Thr Thr Asn Ile Arg Asp Lys Val Gln Lys Glu Tyr
130 135 140
Ala Leu Phe Tyr Arg Thr Asp Val Val Pro Ile Asp Asn Asn Asn Thr
145 150 155 160
Ser Thr Ser Ser His Ser Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val
165 170 175
Leu Thr Gln Thr Cys Pro Lys Val Ser Phe Gln Pro Ile Pro Ile His
180 185 190
Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asn Arg Thr
195 200 205
Phe Asn Gly Lys Gly Glu Cys Gly Asn Val Ser Thr Val Gln Cys Thr
210 215 220
His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu Ser Gly Ser
225 230 235 240
Leu Ala Glu Gln Glu Ile Val Leu Arg Ser Asp Asn Phe Ser Asp Asn
245 250 255
Ala Lys Thr Ile Ile Val Gln Leu Thr Lys Pro Val Glu Ile Asn Cys
260 265 270
Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile Ala Ile Gly Pro Gly
275 280 285
Arg Ala Phe Ile Ala Thr Gly Asp Ile Ile Gly Asp Ile Arg Gln Ala
290 295 300
His Cys Asn Ile Ser Lys Val Ala Trp Asn Asn Thr Ile Glu Glu Val
305 310 315 320
Ala Arg Lys Leu Ser Lys Gln Phe Gly Asn Arg Ser Ile Thr Phe Asn
325 330 335
Gln Ser Ser Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Asn Cys
340 345 350
Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr Gly Leu Phe Ser Ser Thr
355 360 365
Trp Asn Val Thr Asp Asn Trp Asn Ala Thr Glu Glu Ala Asn Thr Thr
370 375 380
Arg Val Asn Ile Thr Leu Pro Cys Arg Ile Arg Gln Ile Val Asn Met
385 390 395 400
Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Gln
405 410 415
Ile Ser Cys Ser Ser Asn Ile Thr Gly Leu Ile Leu Thr Arg Asp Gly
420 425 430
Gly Asn Gln Ser Asn Glu Thr Thr Pro Glu Ile Phe Arg Pro Gly Gly
435 440 445
Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val
450 455 460
Val Lys Ile Glu Pro Ile Gly Ile Ala Pro Thr Lys Ala Lys Arg Arg
465 470 475 480
Val Val Gln Arg Glu Lys Arg
485
<210> 111
<211> 1694
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 111
agaaagagca gaagacagtg gcaatgagag tgaaggggat caggaagaat tgtcactcct 60
ggagatgggg caccatgctc ctatggagat ggggcaccat gctccttggg ttgttaatga 120
tgatctgtaa tgctaaagaa gaaaaaacgt gggtcacagt atattatggg gtacctgtgt 180
ggaaagaagc gaccaccact ctattttgtg catcagatgc taaagcatat gacacagagg 240
tacataatgt ttgggccaca catgcctgtg tacctacaga ccccgaccca caggaaatat 300
tcttggaaaa tgtgacagaa aattttaaca tgtggaaaaa taacatggta gaacagatgc 360
atgaggatat aatcagtttg tgggatcaaa gcctaaagcc atgtgtaaaa ttaaccccac 420
tctgtgttac gttaaattgc actaatgtga ggattactag cataaattgc actgatgcga 480
gtagtaatag cactggtgtg gaaaaaacta ctagcactaa tggggggaat gttactattt 540
gtaatactac tgaggagata aaaaactgct ccttcaacat caccaccaac atgaaagata 600
agatacagaa gacatatgca ctgttttata aacttgatgt agaaccaata gataagggga 660
atgagaatag caccagctat agggtgataa gttgtaacac ttcaaccatt acacaggcct 720
gtccaaaggt atcctttgag ccaattccta tacattattg tgccccggct ggttttgcga 780
ttctaaaatg tagagataaa aggtttaatg gaacaggacc atgtacaaat gtcagcacag 840
tacaatgtac acatggaatt aaaccagtag tatcaactca actgctgtta aatggcagtc 900
tagcagaaga agagatagta cttagatctg aaaatttctc gaacaatgct aaaaacatat 960
taatacagct aaaagaacct gtagaaatta attgtacaag acccaacaac aatacaagaa 1020
aaggtataca tataggacca ggaagagcat tttatggaac agatataata ggagatataa 1080
gacaagcaca ttgtaacatt agcagagaaa aatggaatag cactttaagt cagatagttg 1140
ataaattaag agaacaatat gggaaaaata aaacaataat ctttgatcaa ccatcaggag 1200
gggacccaga aattgtaacg cacagtttta attgtggagg agaatttttc tactgtaatt 1260
caacacaact gtttaatagc acttggtatg ataatagtac ttggaatgag aataaaaatc 1320
acactaaaaa tgacacaatc acgctcccat gcagaataaa gcaatttata aacatgtggc 1380
aggaagtagg aaaagcaatg tatgcccctc ccatcagagg acaaattaaa tgctcatcaa 1440
atattacagg gctaataata acaagagatg gggggaacaa caacagcgag accttcaaca 1500
acgagacctt cagacctgga ggaggaaata tgaaggacaa ttggagaagt gaattatata 1560
aatataaagt agtaagaatt gagccattag gagtagcacc caccagggca aagaggagag 1620
tggtgcagag agaaaaaaga gcagtgggaa taggagctgt gttccttggg ttcttgggag 1680
cagcaggaag cact 1694
<210> 112
<211> 504
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 112
Asn Ala Lys Glu Glu Lys Thr Trp Val Thr Val Tyr Tyr Gly Val Pro
1 5 10 15
Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys
20 25 30
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val
35 40 45
Pro Thr Asp Pro Asp Pro Gln Glu Ile Phe Leu Glu Asn Val Thr Glu
50 55 60
Asn Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp
65 70 75 80
Ile Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr
85 90 95
Pro Leu Cys Val Thr Leu Asn Cys Thr Asn Val Arg Ile Thr Ser Ile
100 105 110
Asn Cys Thr Asp Ala Ser Ser Asn Ser Thr Gly Val Glu Lys Thr Thr
115 120 125
Ser Thr Asn Gly Gly Asn Val Thr Ile Cys Asn Thr Thr Glu Glu Ile
130 135 140
Lys Asn Cys Ser Phe Asn Ile Thr Thr Asn Met Lys Asp Lys Ile Gln
145 150 155 160
Lys Thr Tyr Ala Leu Phe Tyr Lys Leu Asp Val Glu Pro Ile Asp Lys
165 170 175
Gly Asn Glu Asn Ser Thr Ser Tyr Arg Val Ile Ser Cys Asn Thr Ser
180 185 190
Thr Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile Pro Ile
195 200 205
His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Arg Asp Lys
210 215 220
Arg Phe Asn Gly Thr Gly Pro Cys Thr Asn Val Ser Thr Val Gln Cys
225 230 235 240
Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly
245 250 255
Ser Leu Ala Glu Glu Glu Ile Val Leu Arg Ser Glu Asn Phe Ser Asn
260 265 270
Asn Ala Lys Asn Ile Leu Ile Gln Leu Lys Glu Pro Val Glu Ile Asn
275 280 285
Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Gly Ile His Ile Gly Pro
290 295 300
Gly Arg Ala Phe Tyr Gly Thr Asp Ile Ile Gly Asp Ile Arg Gln Ala
305 310 315 320
His Cys Asn Ile Ser Arg Glu Lys Trp Asn Ser Thr Leu Ser Gln Ile
325 330 335
Val Asp Lys Leu Arg Glu Gln Tyr Gly Lys Asn Lys Thr Ile Ile Phe
340 345 350
Asp Gln Pro Ser Gly Gly Asp Pro Glu Ile Val Thr His Ser Phe Asn
355 360 365
Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gln Leu Phe Asn Ser
370 375 380
Thr Trp Tyr Asp Asn Ser Thr Trp Asn Glu Asn Lys Asn His Thr Lys
385 390 395 400
Asn Asp Thr Ile Thr Leu Pro Cys Arg Ile Lys Gln Phe Ile Asn Met
405 410 415
Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Gln
420 425 430
Ile Lys Cys Ser Ser Asn Ile Thr Gly Leu Ile Ile Thr Arg Asp Gly
435 440 445
Gly Asn Asn Asn Ser Glu Thr Phe Asn Asn Glu Thr Phe Arg Pro Gly
450 455 460
Gly Gly Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys
465 470 475 480
Val Val Arg Ile Glu Pro Leu Gly Val Ala Pro Thr Arg Ala Lys Arg
485 490 495
Arg Val Val Gln Arg Glu Lys Arg
500
<210> 113
<211> 1685
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 113
agaaagagca gaagacagtg gcaatgagag tgaaggggac caagacgaat tatcagcact 60
tatggagatg gggcatcatg ctcctgggga tattgatgat ctgtagtgct acagaacaat 120
ggtgggtcac agtctattat ggagtacctg tgtggagaga tgcaaatacc actctatttt 180
gtgcatcaga ttctaaagca tatgctacag aggtacataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaaa taaacttgga aaatgtaaca gaagagttta 300
acatatggaa aaataacatg gtagaacaaa tgcatgagga tataatcagt ttatgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cactctgtgt tactttaaat tgcactgatt 420
acaatggtac tcataccaat actactaata ccactagtat ttatggggaa aagatggaaa 480
taggagaagt aaagaaatgc tctttcaatg ctaccacaat cataagagat aaggtggaca 540
aagaagaagc acttttttat aaacttgata tagtaccaat agatggtaat aatgagacta 600
acatagttaa taatgggact aataatacta gtaccaacta taccagctat aggctaataa 660
attgtaacac ctcagtcatt acacaggcct gtccagaggt atcctttgag ccaattccca 720
tacattattg tgccccggct ggttttgcga ttctaaagtg taaagagaag gcgttcaatg 780
gaagtgggcc atgtaaaaat gtcagctcag tacaatgtac acatggaatt aagccagtag 840
tatcaactca attgccgtta aatggcagtc tagcagaaga agaagtagta attagatctg 900
aaaatttcac aaacaatgct aaaaccataa tagtacagct gaaagaagct gtaaacatta 960
gttgtataag gcccaacaac aatacaagaa aaagtatacc tataggacca gggagagcat 1020
tttatgcaac aggagacata ataggagata taggacaagc acattgtaac cttagtagaa 1080
caaattggaa taaaacttta caacagatag ctacaaaatt aggagaaaag tttaataaaa 1140
caacaataat ctttaatcaa tcctcaggag gggacccaga aattgtaatg cacagtttta 1200
tttgtggagg ggaatttttc tactgtaata caacacaact gtttaatagt acttggaact 1260
gtactgagaa tgggaattgt acactggtta ccggtacttg gcctgacagg ccaaatagaa 1320
ctggagaaaa tgacacaatc acactcccat gcagaataaa acaaatcata aacctgtggc 1380
aggaagtagg aaaagcaatg tatgcctctc ccatccaggg actaattaat tgtacatcaa 1440
atattacagg gctgctatta acaagagatg gtggtaccca tagtagacag aatgagacct 1500
tcagacctga aggaggaaat atgaaagaca attggagaag tgaattatat aaatataaag 1560
tagtaagaat tgaaccatta ggagtagcac ccaccaaggc aaagagaaga gtggtgcaga 1620
gagaaaaaag agcagtggga ttgggagcta tgatccttgg gttcttggga gcagcaggaa 1680
gcact 1685
<210> 114
<211> 509
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 114
Ser Ala Thr Glu Gln Trp Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Arg Asp Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ser Lys Ala
20 25 30
Tyr Ala Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Ile Asn Leu Glu Asn Val Thr Glu Glu
50 55 60
Phe Asn Ile Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asp Tyr Asn Gly Thr His Thr Asn
100 105 110
Thr Thr Asn Thr Thr Ser Ile Tyr Gly Glu Lys Met Glu Ile Gly Glu
115 120 125
Val Lys Lys Cys Ser Phe Asn Ala Thr Thr Ile Ile Arg Asp Lys Val
130 135 140
Asp Lys Glu Glu Ala Leu Phe Tyr Lys Leu Asp Ile Val Pro Ile Asp
145 150 155 160
Gly Asn Asn Glu Thr Asn Ile Val Asn Asn Gly Thr Asn Asn Thr Ser
165 170 175
Thr Asn Tyr Thr Ser Tyr Arg Leu Ile Asn Cys Asn Thr Ser Val Ile
180 185 190
Thr Gln Ala Cys Pro Glu Val Ser Phe Glu Pro Ile Pro Ile His Tyr
195 200 205
Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Lys Glu Lys Ala Phe
210 215 220
Asn Gly Ser Gly Pro Cys Lys Asn Val Ser Ser Val Gln Cys Thr His
225 230 235 240
Gly Ile Lys Pro Val Val Ser Thr Gln Leu Pro Leu Asn Gly Ser Leu
245 250 255
Ala Glu Glu Glu Val Val Ile Arg Ser Glu Asn Phe Thr Asn Asn Ala
260 265 270
Lys Thr Ile Ile Val Gln Leu Lys Glu Ala Val Asn Ile Ser Cys Ile
275 280 285
Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile Pro Ile Gly Pro Gly Arg
290 295 300
Ala Phe Tyr Ala Thr Gly Asp Ile Ile Gly Asp Ile Gly Gln Ala His
305 310 315 320
Cys Asn Leu Ser Arg Thr Asn Trp Asn Lys Thr Leu Gln Gln Ile Ala
325 330 335
Thr Lys Leu Gly Glu Lys Phe Asn Lys Thr Thr Ile Ile Phe Asn Gln
340 345 350
Ser Ser Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Ile Cys Gly
355 360 365
Gly Glu Phe Phe Tyr Cys Asn Thr Thr Gln Leu Phe Asn Ser Thr Trp
370 375 380
Asn Cys Thr Glu Asn Gly Asn Cys Thr Leu Val Thr Gly Thr Trp Pro
385 390 395 400
Asp Arg Pro Asn Arg Thr Gly Glu Asn Asp Thr Ile Thr Leu Pro Cys
405 410 415
Arg Ile Lys Gln Ile Ile Asn Leu Trp Gln Glu Val Gly Lys Ala Met
420 425 430
Tyr Ala Ser Pro Ile Gln Gly Leu Ile Asn Cys Thr Ser Asn Ile Thr
435 440 445
Gly Leu Leu Leu Thr Arg Asp Gly Gly Thr His Ser Arg Gln Asn Glu
450 455 460
Thr Phe Arg Pro Glu Gly Gly Asn Met Lys Asp Asn Trp Arg Ser Glu
465 470 475 480
Leu Tyr Lys Tyr Lys Val Val Arg Ile Glu Pro Leu Gly Val Ala Pro
485 490 495
Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
500 505
<210> 115
<211> 1632
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 115
agaaagagca aaacagtggc aatgaaagtg aaggggacca ggatgaattg tcagcgctgg 60
tggtggacat ggggcacgat gctccttggg atgttgatga tctgtagtgc tgcagaaaag 120
ttgtgggtca cagtctatta tggggtacct gtgtggaaag aagcaaccac cactctattt 180
tgtgcatcag atgctaaagc atataagaca gaaaagcata atgtctgggc cacacatgcc 240
tgtgtaccca caaaccccaa cccacaagaa gtagtaatgg aaaatgtaac agaatatttt 300
aacatgtgga aaaataacat ggtagaacag atgcaggagg atataatcag tttatgggat 360
caaagcctaa agccatgtgt aaaattaacc ccactctgtg taactttaac ttgtgtgaat 420
attactaact gtaagaataa tactaactgt aataatgata ctaacagtaa gaatgatact 480
cttaaggagg agatagggga aataaaaaac tgctctttca acgtcaccac agccataaga 540
gataaggtgc agaaagaata tgcattattt cataaacttg atgtagtaca aatagataat 600
gataatacta gtagtaatac ttctaagcct tataggttga taagttgtaa cacctcagtc 660
attacacagg cctgtccaaa ggtaaccttt gagccaattc ccatacatta ttgtgcctcg 720
gctggttttg cgattctaaa gtgtaacaat aagactttca atggaacagg accatgtaca 780
aatgtcagca cagtacaatg tacacatgga attaggccag tagtatcaac tcaactgttg 840
ttaaatgaca gcctagcaga aaaagaggca atagttagat ctgaaaattt cacaaacaac 900
gctaaaatca taatagtaca gctaaatgaa tctgtagaga ttaattgtac aagacccaac 960
aacaatacaa gaagaagtat acctatagga ccagggaaag cattttttac atcagaaata 1020
ataggagata taagaaaagc acactgtaac attagtggaa caaagtggaa tgccactttg 1080
cataaaatag ctataaaatt aagagaacaa tatggaaata aaacaatagt ctttaatcaa 1140
ccttcaggag gggacccaga agttgtaatg cacagtttta actgtggagg ggaatttttc 1200
tactgtgata caacacaact gtttaatagt acttggttta atagtacttg gccaaatatc 1260
acacttgaag aaaatatcac actcccatgc aaaataaaac aatttataaa catgtggcag 1320
gaagtaggaa aagcaatgta tgcccctccc atcagaggac aaattaactg ttcatcacag 1380
attacagggc tgctattaac aagagatggt ggtcagggta acaatactaa caacgacact 1440
gagattttca gaccaggggg aggagatatg agggacaatt ggagaagtga attatacaaa 1500
tataaagtag taagaattga gccattggga gtagcaccca ccaaggcaat gagaagagtg 1560
gtgcagagag aaaaaagagc aataggacta ggagcttttt tccttgggtt cttgggagca 1620
gcaggaagca ct 1632
<210> 116
<211> 491
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 116
Ser Ala Ala Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Lys Thr Glu Lys His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asn Pro Asn Pro Gln Glu Val Val Met Glu Asn Val Thr Glu Tyr
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met Gln Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Thr Cys Val Asn Ile Thr Asn Cys Lys Asn Asn
100 105 110
Thr Asn Cys Asn Asn Asp Thr Asn Ser Lys Asn Asp Thr Leu Lys Glu
115 120 125
Glu Ile Gly Glu Ile Lys Asn Cys Ser Phe Asn Val Thr Thr Ala Ile
130 135 140
Arg Asp Lys Val Gln Lys Glu Tyr Ala Leu Phe His Lys Leu Asp Val
145 150 155 160
Val Gln Ile Asp Asn Asp Asn Thr Ser Ser Asn Thr Ser Lys Pro Tyr
165 170 175
Arg Leu Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys
180 185 190
Val Thr Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Ser Ala Gly Phe
195 200 205
Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys
210 215 220
Thr Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val
225 230 235 240
Ser Thr Gln Leu Leu Leu Asn Asp Ser Leu Ala Glu Lys Glu Ala Ile
245 250 255
Val Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Ile Ile Ile Val Gln
260 265 270
Leu Asn Glu Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr
275 280 285
Arg Arg Ser Ile Pro Ile Gly Pro Gly Lys Ala Phe Phe Thr Ser Glu
290 295 300
Ile Ile Gly Asp Ile Arg Lys Ala His Cys Asn Ile Ser Gly Thr Lys
305 310 315 320
Trp Asn Ala Thr Leu His Lys Ile Ala Ile Lys Leu Arg Glu Gln Tyr
325 330 335
Gly Asn Lys Thr Ile Val Phe Asn Gln Pro Ser Gly Gly Asp Pro Glu
340 345 350
Val Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asp
355 360 365
Thr Thr Gln Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp Pro Asn
370 375 380
Ile Thr Leu Glu Glu Asn Ile Thr Leu Pro Cys Lys Ile Lys Gln Phe
385 390 395 400
Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile
405 410 415
Arg Gly Gln Ile Asn Cys Ser Ser Gln Ile Thr Gly Leu Leu Leu Thr
420 425 430
Arg Asp Gly Gly Gln Gly Asn Asn Thr Asn Asn Asp Thr Glu Ile Phe
435 440 445
Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr
450 455 460
Lys Tyr Lys Val Val Arg Ile Glu Pro Leu Gly Val Ala Pro Thr Lys
465 470 475 480
Ala Met Arg Arg Val Val Gln Arg Glu Lys Arg
485 490
<210> 117
<211> 1679
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 117
agaaagagca gaagacagtg gcaatgagag tgatggggat caggaagagt tatcagcact 60
tgtggaaagg gggcaccttg ctccttggaa tattgatgat ctgtagtgct gcagaacaat 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga tgcaaccacc actttatttt 180
gtgcatcaga tgctaaagca tatgatacag aggtacacaa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag tagtaatggg aaatgtgaca gaatatttta 300
acatgtggac aaataacatg gtagaacaga tgcatgagga tgtaatcagt ttatgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cactctgtgt tactttaaat tgcactaatt 420
tggtgaatac tacctgtaat gggactacta acaataatac tacctgtact gggactgcta 480
acaatgatac taataccaat agtactaggt gggtgtatca agcgatggca ggagaaataa 540
aaaactgctc tttcaatatc accacaaaca taagagataa gataaaaaaa gaatatgcac 600
tttttaatag acttgatata gtaccaatag atgatgagaa taagaatact ggcaatacta 660
ctagctatag gttgataagt tgtaacacct cagtcattac acaggcctgt ccaaaggtaa 720
cctttgaacc aattcccata cattattgtg ccccggctgg ttttgcgatt ctcaagtgta 780
atgataagaa gttcaatgga acaggaccat gtacaaatgt cagcacagta caatgtacac 840
atggaattag gccagtagta tcaactcaac tactattaaa tggcagtcta gcagaagaag 900
agacagtaat tagatctagc aatttctcga acaatgctaa aatcataata gtacagctga 960
atgaaactgt acgaattaat tgtacaagac ccaacaacaa tacaagaaga agtatacata 1020
taggaccagg gagagcattt tatgcaacag gagacataat aggagatata agacaagcac 1080
attgtaacat tagtggagaa gaatggagga gaactttaaa acggataact ataaaattag 1140
gagaacaatt taataaaaca aaaataagct ataaccaatc ctcaggaggg gacccagaaa 1200
ttgtaaggca cagttttaat tgtcaagggg aatttttcta ctgtgataca tcaggactgt 1260
ttaatagtac ttgggtgaag aatgatactt ggaatgagag tagtattagc aatggaacta 1320
tcacactccc atgcagaata aaacaaattg taaacatgtg gcaggaagta ggaagagcaa 1380
tgtatgcccc tcctatcaaa ggacaaatta attgtacatc gaatattaca gggctgctac 1440
taacaagaga tggtggtcag actaatagca ccaacaacga cactgagacc ttcagaccta 1500
caggaggaga tataagggac aattggagga gtgaattata taaatataaa gtagtaaaaa 1560
ttgaaccatt aggaatagca cccaccaggg caaaaagaag agtggtgcaa agagaaaaaa 1620
gagcagtggg aacgatggga gcgttgttcc ttgggttctt gggagcagca ggaagcact 1679
<210> 118
<211> 506
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 118
Ser Ala Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro Gln Glu Val Val Met Gly Asn Val Thr Glu Tyr
50 55 60
Phe Asn Met Trp Thr Asn Asn Met Val Glu Gln Met His Glu Asp Val
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asn Leu Val Asn Thr Thr Cys Asn
100 105 110
Gly Thr Thr Asn Asn Asn Thr Thr Cys Thr Gly Thr Ala Asn Asn Asp
115 120 125
Thr Asn Thr Asn Ser Thr Arg Trp Val Tyr Gln Ala Met Ala Gly Glu
130 135 140
Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Asn Ile Arg Asp Lys Ile
145 150 155 160
Lys Lys Glu Tyr Ala Leu Phe Asn Arg Leu Asp Ile Val Pro Ile Asp
165 170 175
Asp Glu Asn Lys Asn Thr Gly Asn Thr Thr Ser Tyr Arg Leu Ile Ser
180 185 190
Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Thr Phe Glu
195 200 205
Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys
210 215 220
Cys Asn Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Thr Asn Val Ser
225 230 235 240
Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu
245 250 255
Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Thr Val Ile Arg Ser Ser
260 265 270
Asn Phe Ser Asn Asn Ala Lys Ile Ile Ile Val Gln Leu Asn Glu Thr
275 280 285
Val Arg Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Arg Ser Ile
290 295 300
His Ile Gly Pro Gly Arg Ala Phe Tyr Ala Thr Gly Asp Ile Ile Gly
305 310 315 320
Asp Ile Arg Gln Ala His Cys Asn Ile Ser Gly Glu Glu Trp Arg Arg
325 330 335
Thr Leu Lys Arg Ile Thr Ile Lys Leu Gly Glu Gln Phe Asn Lys Thr
340 345 350
Lys Ile Ser Tyr Asn Gln Ser Ser Gly Gly Asp Pro Glu Ile Val Arg
355 360 365
His Ser Phe Asn Cys Gln Gly Glu Phe Phe Tyr Cys Asp Thr Ser Gly
370 375 380
Leu Phe Asn Ser Thr Trp Val Lys Asn Asp Thr Trp Asn Glu Ser Ser
385 390 395 400
Ile Ser Asn Gly Thr Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Val
405 410 415
Asn Met Trp Gln Glu Val Gly Arg Ala Met Tyr Ala Pro Pro Ile Lys
420 425 430
Gly Gln Ile Asn Cys Thr Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg
435 440 445
Asp Gly Gly Gln Thr Asn Ser Thr Asn Asn Asp Thr Glu Thr Phe Arg
450 455 460
Pro Thr Gly Gly Asp Ile Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys
465 470 475 480
Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Ile Ala Pro Thr Arg Ala
485 490 495
Lys Arg Arg Val Val Gln Arg Glu Lys Arg
500 505
<210> 119
<211> 1589
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 119
agaaagagca gaagacagtg gcaatgagag tgagggggat catgaggaat tacttgtgga 60
aatggggcat catgctcctt gggatattga tgatctgtag tgctacagac aaattgtggg 120
tcacagtcta ttatggggtg cctgtgtgga aagaagcatc caccactcta ttttgtgcat 180
cagatgctaa agcctatgat acagaggtac ataatgtttg ggccacacat gcctgtgtac 240
ccacagatcc caatccacac gaattagaat tggaaaatgt gacagaagat tttaacatgt 300
ggaaaaatga catggtcgaa cagatgcatg aggatataat cagtttatgg gatcaaagcc 360
taaaaccatg tgtaaaatta accccactct gtgttacttt aaattgcagt gatgctttaa 420
cttgcaatag gacatcaaat agcagtagta cttcaaattg cagtaactgg gaaccgatag 480
aagaaataaa aaattgctct ttcaatatta ccacaagcat agaaaataag atgcagaaaa 540
agtctgcatt ttttgatgcc cttgatgtag tacaaataga tgatactagt tataggttga 600
taaattgtaa cacctcagtc attacacagg cctgtccaaa gatatccttt gagccaattc 660
ccatacatta ttgtgccccg gctggttttg cgcttctaaa gtgtaaggat ccgaaattca 720
atggaacagg gccatgtaaa tatgctagct cagtacagtg tacacatgga attaggccgg 780
tagtatcaac tcaactgctg ctaaatggca gtctagcaga agaagatata gtaattagat 840
ctgccaattt ctcggacaac accaaagcca taatagtaca actaaaagaa cctgtaataa 900
ttaattgcac aagacccaac aacaatacaa gacaaagtgt acatatagga ccagggagcg 960
cactttatac aacagatata ataggagata taagaaaagc acattgtaac attagtagag 1020
cagactggac taaagcttta aaccagacag tcataaaatt aagagaacaa tttaagaata 1080
aaacaatagt ctttaatcaa tcctcaggag gggatccaga aattgtaatg cacactttta 1140
attgtggagg ggaatttttc tattgtaatt caacaaaact gtttaatagt acttggaatg 1200
ggactgaacc aggagagtca aatgacactg taatcatact cccatgcaga ataaaacaaa 1260
ttataaatat gtggcaggaa gtaggaaaag caatgtatgc ccctcccatc agaggacaaa 1320
ttagatgtac atcaaatatt acagggctgc tactaacaag agatggggga aatgagacca 1380
ctaaaaacgg gactgagacc ttcagacctg gaggaggaaa tatgaaggac aattggagaa 1440
gtgaattata taaatataaa gtggtaaaaa ttgaaccatt aggagtagca cccaccaagg 1500
caaaaagaag agtggtgcag agagaaaaaa gagcaatagg ggcattcgga gctatgttcc 1560
ttgggttctt gggagcagca ggaagcact 1589
<210> 120
<211> 478
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 120
Ser Ala Thr Asp Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Ser Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Asn Pro His Glu Leu Glu Leu Glu Asn Val Thr Glu Asp
50 55 60
Phe Asn Met Trp Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Ser Asp Ala Leu Thr Cys Asn Arg Thr
100 105 110
Ser Asn Ser Ser Ser Thr Ser Asn Cys Ser Asn Trp Glu Pro Ile Glu
115 120 125
Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Ser Ile Glu Asn Lys
130 135 140
Met Gln Lys Lys Ser Ala Phe Phe Asp Ala Leu Asp Val Val Gln Ile
145 150 155 160
Asp Asp Thr Ser Tyr Arg Leu Ile Asn Cys Asn Thr Ser Val Ile Thr
165 170 175
Gln Ala Cys Pro Lys Ile Ser Phe Glu Pro Ile Pro Ile His Tyr Cys
180 185 190
Ala Pro Ala Gly Phe Ala Leu Leu Lys Cys Lys Asp Pro Lys Phe Asn
195 200 205
Gly Thr Gly Pro Cys Lys Tyr Ala Ser Ser Val Gln Cys Thr His Gly
210 215 220
Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala
225 230 235 240
Glu Glu Asp Ile Val Ile Arg Ser Ala Asn Phe Ser Asp Asn Thr Lys
245 250 255
Ala Ile Ile Val Gln Leu Lys Glu Pro Val Ile Ile Asn Cys Thr Arg
260 265 270
Pro Asn Asn Asn Thr Arg Gln Ser Val His Ile Gly Pro Gly Ser Ala
275 280 285
Leu Tyr Thr Thr Asp Ile Ile Gly Asp Ile Arg Lys Ala His Cys Asn
290 295 300
Ile Ser Arg Ala Asp Trp Thr Lys Ala Leu Asn Gln Thr Val Ile Lys
305 310 315 320
Leu Arg Glu Gln Phe Lys Asn Lys Thr Ile Val Phe Asn Gln Ser Ser
325 330 335
Gly Gly Asp Pro Glu Ile Val Met His Thr Phe Asn Cys Gly Gly Glu
340 345 350
Phe Phe Tyr Cys Asn Ser Thr Lys Leu Phe Asn Ser Thr Trp Asn Gly
355 360 365
Thr Glu Pro Gly Glu Ser Asn Asp Thr Val Ile Ile Leu Pro Cys Arg
370 375 380
Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr
385 390 395 400
Ala Pro Pro Ile Arg Gly Gln Ile Arg Cys Thr Ser Asn Ile Thr Gly
405 410 415
Leu Leu Leu Thr Arg Asp Gly Gly Asn Glu Thr Thr Lys Asn Gly Thr
420 425 430
Glu Thr Phe Arg Pro Gly Gly Gly Asn Met Lys Asp Asn Trp Arg Ser
435 440 445
Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala
450 455 460
Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg
465 470 475
<210> 121
<211> 1616
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 121
agaaagagca gaagacagtg gcaatgaaag tgaaggggat caggaagaat tgtcagcgct 60
tgtggagatg gggcacgatg ctccttggga tgttaatgat atgtagtgct gcagagcaat 120
tgtgggtcac agtctattat ggggtacctg tgtggagaga agcaaacacc actctattct 180
gtgcctcaga tgctaaagca caggttgcag aggcacataa tgtatgggcc acacatgcct 240
gtgtacccac agaccctagc ccacaagaag tagtaatgga aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaga tgcatgagga tataatcagt ttatgggatc 360
aaagtctaaa gccatgtgtg aaattaaccc cactctgcgt tactttaaat tgcactaatg 420
tgggttgcac tggtaatact actggaccca attgtacttc tttgactgat cataatagta 480
atcttacttg gggaatggag aaaggagaaa taaaaaattg ctctttcaat gtcaccagta 540
taacaaataa gatgcagaaa gaatatgcac ttttttataa acttgatgta atgccaatgg 600
atagtacaga taatacaacg tatacactga taaattgtaa cccctcagtc attacacagg 660
cctgtccaaa ggtatctttt gaacccattc ctatacatta ttgtaccccg gctggttttg 720
cgattctaaa gtgtaatgat aagacattca atggatcagg accatgtaca aatgtcagta 780
cagtactatg tacacatgga attaggccag tagtgtcaac tcaactactg ttaaatggca 840
gtctagcaga agaggaggta atagtcaggt ccgagaattt ctcggacaat actaaaatca 900
taatagtaca gctgaataaa actgtagaaa ttaattgtac aagacccaat aacaatacaa 960
gaaaaagtat acatatagca ccaggaaaag cattctatgc aacaggtgat ataataggag 1020
atataagaca agcacattgt aacatcagtg aaacaaaatg ggtgaacact ttaaaacagg 1080
tagttacaaa attaagggaa caatatggga ataaaacaat agcctttaat caatcctcag 1140
gaggggatcc agaaattgta acgcatagtt ttaattgtgg aggagaattt ttctactgta 1200
atacatcacg gctgtttaat agtaattgga ctgggaatgg aacgactgag tcaggaaata 1260
gcacaatcat acttccatgc agaataaaac aaattataaa cagatggcag gaagtaggaa 1320
aagcaatgta tgccaatccc attagtggac caatcaactg ttcatcaaac attacagggc 1380
tgctattaac aagagatggt ggtaaagtga ccaatgacac caccgagacc ttcagacctt 1440
ggggtggaga tatgagggac aattggagaa gtgaactata taaatataaa gtagtaagaa 1500
ttgagccatt aggactagca cccaccaggg caaagagaag agtggtgcag agagaaaaga 1560
gagcaataac attaggagct atgttccttg ggttcttggg agcagcagga agcact 1616
<210> 122
<211> 486
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 122
Ser Ala Ala Glu Gln Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Arg Glu Ala Asn Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Gln Val Ala Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Ser Pro Gln Glu Val Val Met Glu Asn Val Thr Glu Asn
50 55 60
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asn Val Gly Cys Thr Gly Asn Thr
100 105 110
Thr Gly Pro Asn Cys Thr Ser Leu Thr Asp His Asn Ser Asn Leu Thr
115 120 125
Trp Gly Met Glu Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn Val Thr
130 135 140
Ser Ile Thr Asn Lys Met Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Leu
145 150 155 160
Asp Val Met Pro Met Asp Ser Thr Asp Asn Thr Thr Tyr Thr Leu Ile
165 170 175
Asn Cys Asn Pro Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe
180 185 190
Glu Pro Ile Pro Ile His Tyr Cys Thr Pro Ala Gly Phe Ala Ile Leu
195 200 205
Lys Cys Asn Asp Lys Thr Phe Asn Gly Ser Gly Pro Cys Thr Asn Val
210 215 220
Ser Thr Val Leu Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln
225 230 235 240
Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Ile Val Arg Ser
245 250 255
Glu Asn Phe Ser Asp Asn Thr Lys Ile Ile Ile Val Gln Leu Asn Lys
260 265 270
Thr Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser
275 280 285
Ile His Ile Ala Pro Gly Lys Ala Phe Tyr Ala Thr Gly Asp Ile Ile
290 295 300
Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser Glu Thr Lys Trp Val
305 310 315 320
Asn Thr Leu Lys Gln Val Val Thr Lys Leu Arg Glu Gln Tyr Gly Asn
325 330 335
Lys Thr Ile Ala Phe Asn Gln Ser Ser Gly Gly Asp Pro Glu Ile Val
340 345 350
Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser
355 360 365
Arg Leu Phe Asn Ser Asn Trp Thr Gly Asn Gly Thr Thr Glu Ser Gly
370 375 380
Asn Ser Thr Ile Ile Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Arg
385 390 395 400
Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Asn Pro Ile Ser Gly Pro
405 410 415
Ile Asn Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly
420 425 430
Gly Lys Val Thr Asn Asp Thr Thr Glu Thr Phe Arg Pro Trp Gly Gly
435 440 445
Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val
450 455 460
Arg Ile Glu Pro Leu Gly Leu Ala Pro Thr Arg Ala Lys Arg Arg Val
465 470 475 480
Val Gln Arg Glu Lys Arg
485
<210> 123
<211> 1610
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 123
agaaagagca gaagacagtg gcaatgagag tgaaggagat caggaagaat tggcagcgct 60
tgtggagatg gggcatgatg ctccttggga tgttgatgat cagtagtgct gaagaagatt 120
tgtgggtcac agtctattat ggggtacctg tgtggaaaga agcagagacc actttatttt 180
gtgcatcaga tgctaaagca tataacacag aggcacataa tgtgtgggcc acacatgcct 240
gtgtaccaac agaccctagc ccacaagaag tattattggt aaatgtgaca gaaaattata 300
acatgtggaa aaataacatg gtagaacaga tgcatgagga tataattagt ttatgggatc 360
aaagcctaaa gccatgtgta aaattaaccc cgctttgtgt tactttaaat tgcactaatg 420
tgaattgcac tcatgagaat ggtaccacta ccgagtgcgg taataatggg atacagatgg 480
agaaaggaga aatgaaaaac tgctctttca atattaccac aagcataaaa aataagatgc 540
agaaagaata tgcacttttg tataaactag atttagcatc aataggtaat gataatacaa 600
gctatacttt gataagttgt aacacctcag tcattacaca ggcctgtcca aagatatcct 660
ttgaaccaat tccaatacat tattgtgccc cggctggttt tgcgattcta aaatgtaatg 720
ataagaactt caagggaaca ggatcatgta aaaatgtcag cacagtacaa tgtacacatg 780
gaattaagcc agtagtgtca actcaattgt tgttaaatgg cactttagca gaaacagagg 840
tagtaattag atctgaaaat atcacagaca atgctaaaac cataatagta caactgaagg 900
accctgtaaa aattaattgt acaagacctg gcaacaatac agcaagaagc atacatatgg 960
gaccggggag agcattttct gcaacaggac aaataatagg aaatataaga caagcacatt 1020
gtaaccttag tagaacagaa tgggatgaca ctttaaaaaa gatagctaag aaattaggag 1080
aacaatttag gaataaaagt atagccttta atcaatcctc aggaggggac ccagaaattg 1140
taatgcacag ttttaattgt ggaggggaat ttttttactg taatacatca cagctgttta 1200
atagtacttg gtggaacaat ggtactagga atgatgctgc aaggtcaaat agcactgaac 1260
ctatcacact ccggtgcagt ataaagcaaa ttataaacag atggcaggaa gtaggaaaag 1320
caatgtatgc ccctcccatc aggggaaacg ttacatgtaa ctcaagtatt acagggctac 1380
tcttaataag agatggtggg aacagtaatg agtctactga gaccttcaga cctcagggag 1440
gaaatatgaa ggacaattgg agaagtgaat tatacaaata taaagtagta aaaattgagc 1500
cattaggagt agcacccacc aaggcaaaga gaagagtggt gcagagagaa aaaagagcag 1560
tgggactagg agctgtgttc cttgggttct tgggagcagc aggaagcact 1610
<210> 124
<211> 484
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 124
Ser Ala Glu Glu Asp Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val
1 5 10 15
Trp Lys Glu Ala Glu Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala
20 25 30
Tyr Asn Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro
35 40 45
Thr Asp Pro Ser Pro Gln Glu Val Leu Leu Val Asn Val Thr Glu Asn
50 55 60
Tyr Asn Met Trp Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile
65 70 75 80
Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro
85 90 95
Leu Cys Val Thr Leu Asn Cys Thr Asn Val Asn Cys Thr His Glu Asn
100 105 110
Gly Thr Thr Thr Glu Cys Gly Asn Asn Gly Ile Gln Met Glu Lys Gly
115 120 125
Glu Met Lys Asn Cys Ser Phe Asn Ile Thr Thr Ser Ile Lys Asn Lys
130 135 140
Met Gln Lys Glu Tyr Ala Leu Leu Tyr Lys Leu Asp Leu Ala Ser Ile
145 150 155 160
Gly Asn Asp Asn Thr Ser Tyr Thr Leu Ile Ser Cys Asn Thr Ser Val
165 170 175
Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe Glu Pro Ile Pro Ile His
180 185 190
Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp Lys Asn
195 200 205
Phe Lys Gly Thr Gly Ser Cys Lys Asn Val Ser Thr Val Gln Cys Thr
210 215 220
His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Thr
225 230 235 240
Leu Ala Glu Thr Glu Val Val Ile Arg Ser Glu Asn Ile Thr Asp Asn
245 250 255
Ala Lys Thr Ile Ile Val Gln Leu Lys Asp Pro Val Lys Ile Asn Cys
260 265 270
Thr Arg Pro Gly Asn Asn Thr Ala Arg Ser Ile His Met Gly Pro Gly
275 280 285
Arg Ala Phe Ser Ala Thr Gly Gln Ile Ile Gly Asn Ile Arg Gln Ala
290 295 300
His Cys Asn Leu Ser Arg Thr Glu Trp Asp Asp Thr Leu Lys Lys Ile
305 310 315 320
Ala Lys Lys Leu Gly Glu Gln Phe Arg Asn Lys Ser Ile Ala Phe Asn
325 330 335
Gln Ser Ser Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Asn Cys
340 345 350
Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser Gln Leu Phe Asn Ser Thr
355 360 365
Trp Trp Asn Asn Gly Thr Arg Asn Asp Ala Ala Arg Ser Asn Ser Thr
370 375 380
Glu Pro Ile Thr Leu Arg Cys Ser Ile Lys Gln Ile Ile Asn Arg Trp
385 390 395 400
Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Asn Val
405 410 415
Thr Cys Asn Ser Ser Ile Thr Gly Leu Leu Leu Ile Arg Asp Gly Gly
420 425 430
Asn Ser Asn Glu Ser Thr Glu Thr Phe Arg Pro Gln Gly Gly Asn Met
435 440 445
Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile
450 455 460
Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln
465 470 475 480
Arg Glu Lys Arg
<210> 125
<211> 1619
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 125
agaaagagca gaagacagtg gcaatgagag tgatggggat aaagaagaat tactggtgga 60
gatggggccc gatgctcctt gggatattga tgacctatag tgcagcagaa ttttgggtca 120
cagtctacta tggagtgcca gtgtggaaag aaacaaccac cactctattt tgtgcatcag 180
atgccaaagc atatgataca gaggcacata atgtttgggc cacacatgcc tgtgtaccca 240
cagaccccaa cccacaagaa gtagtattgg aaaaggtgac agaagagttt aacatgtgga 300
aaaatagcat ggtagaacag atgcatgagg atataatcag tttatgggat caaagtctaa 360
agccatgtgt aaaactaacc ccactctgtg ttactttaag ttgcactgat tgtaatggta 420
ctagccctga gtgtgcgaag aatgctagta ctactaccac tagtagtaag ggattgatag 480
ataaagggga aataaaaaac tgctctttca atgccaccac acacataatg gataaggtgc 540
agaaagaata tgcattattt tataacactg atttagtaca aatagagggt gagaaatctg 600
ataataatac tagatatagg ttaataagtt gtaacacctc agtcattaaa caggcctgtc 660
caaaggtatc ttttgagcca attcccatac attattgtgc cccggctggt tttgcgattc 720
taaagtgtaa agataagaat ttcaatggaa caggaaaatg ttacaatgtc agcacagtac 780
aatgtacaca tggaattagg ccagtaatgt caactcaact gctgttaaat ggcagcctag 840
cagaagaaga aatagtaatt agatctgcca atttctcgaa caatgctaaa accataatag 900
tacatctgaa tgaatctgta gaaattaact gcacaagacc caacaacgat acaaggaaaa 960
gtataaatat aggaccaggg agagcatggt atgcagcagg agaaataata ggaaatataa 1020
gaaaagcata ttgtaacatt agcagagcaa aatggaacaa cactttaaaa catgtagttg 1080
aaaaactaag aaaacaattt ggaaataaaa caataaactt tacacaacac gcaggagggg 1140
acctagaaat tgtgacgcat agttttaatt gtggagggga attcttctac tgcaacacaa 1200
cacagctgtt taatagtact tggcctaaga atggtacttg gaatggtact ggtagtgaca 1260
ttatcacact cccatgcaaa ataaaacaga ttataaacat gtggcaggag gtaggaaaag 1320
caatgtatgc ccctcccatc agcggactaa ttagatgttc atcaaatatt acagggctgc 1380
tattaacaag agatggtggt aagggtaatg gcacaaatga tacagagatc ttcagaccag 1440
gaggaggaga tatgagggac aattggagaa gtgaattata taaatataaa gtagtagaaa 1500
ttgagccaat aggactagca cccaccaagg caaagagaag agtggtgcag agagaaaaaa 1560
gagcagtggg aacgctggga gctatgttcc ttgggttctt gggagcagca ggaagcact 1619
<210> 126
<211> 488
<212> PRT
<213> Human immunodeficiency virus type 1
<400> 126
Ser Ala Ala Glu Phe Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp
1 5 10 15
Lys Glu Thr Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr
20 25 30
Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr
35 40 45
Asp Pro Asn Pro Gln Glu Val Val Leu Glu Lys Val Thr Glu Glu Phe
50 55 60
Asn Met Trp Lys Asn Ser Met Val Glu Gln Met His Glu Asp Ile Ile
65 70 75 80
Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu
85 90 95
Cys Val Thr Leu Ser Cys Thr Asp Cys Asn Gly Thr Ser Pro Glu Cys
100 105 110
Ala Lys Asn Ala Ser Thr Thr Thr Thr Ser Ser Lys Gly Leu Ile Asp
115 120 125
Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn Ala Thr Thr His Ile Met
130 135 140
Asp Lys Val Gln Lys Glu Tyr Ala Leu Phe Tyr Asn Thr Asp Leu Val
145 150 155 160
Gln Ile Glu Gly Glu Lys Ser Asp Asn Asn Thr Arg Tyr Arg Leu Ile
165 170 175
Ser Cys Asn Thr Ser Val Ile Lys Gln Ala Cys Pro Lys Val Ser Phe
180 185 190
Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu
195 200 205
Lys Cys Lys Asp Lys Asn Phe Asn Gly Thr Gly Lys Cys Tyr Asn Val
210 215 220
Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Met Ser Thr Gln
225 230 235 240
Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Ile Val Ile Arg Ser
245 250 255
Ala Asn Phe Ser Asn Asn Ala Lys Thr Ile Ile Val His Leu Asn Glu
260 265 270
Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asp Thr Arg Lys Ser
275 280 285
Ile Asn Ile Gly Pro Gly Arg Ala Trp Tyr Ala Ala Gly Glu Ile Ile
290 295 300
Gly Asn Ile Arg Lys Ala Tyr Cys Asn Ile Ser Arg Ala Lys Trp Asn
305 310 315 320
Asn Thr Leu Lys His Val Val Glu Lys Leu Arg Lys Gln Phe Gly Asn
325 330 335
Lys Thr Ile Asn Phe Thr Gln His Ala Gly Gly Asp Leu Glu Ile Val
340 345 350
Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr
355 360 365
Gln Leu Phe Asn Ser Thr Trp Pro Lys Asn Gly Thr Trp Asn Gly Thr
370 375 380
Gly Ser Asp Ile Ile Thr Leu Pro Cys Lys Ile Lys Gln Ile Ile Asn
385 390 395 400
Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Ser Gly
405 410 415
Leu Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp
420 425 430
Gly Gly Lys Gly Asn Gly Thr Asn Asp Thr Glu Ile Phe Arg Pro Gly
435 440 445
Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys
450 455 460
Val Val Glu Ile Glu Pro Ile Gly Leu Ala Pro Thr Lys Ala Lys Arg
465 470 475 480
Arg Val Val Gln Arg Glu Lys Arg
485
<210> 127
<211> 1613
<212> DNA
<213> Human immunodeficiency virus type 1
<400> 127
agaaagagca gaagacagtg gcaatgagag cgaaggggat caggaagagt tgtcaacact 60
tatggagatg gggcaccatg ctccttggga tgttgatgat ttgtagtgct gcagaaaact 120
tgtgggtcac agtctactat ggggtacctg tgtggaaaga agcaaccacc actctatttt 180
gtgcatcgaa tgctaaagca tatgagacag aggtgcataa tgtttgggcc acacatgcct 240
gtgtacccac agaccccaac ccacaagaag tagtattggg aaatgtgaca gaaaatttta 300
acatgtggaa aaataacatg gtagaacaga tgcatgagga tgtaattagt ttgtgggacc 360
aaagcttaaa gccatgtgta aaattgaccc cactctgtgt tactttacat tgcactgatt 420
gtgagaatac tattactggg gggaataata ctaatagtaa atgcaatgag gataagggga 480
atactactgc cactatattg atagagaaag gagagatgaa aaactgctct tttaatgtca 540
ccacagacct aagagataag atgcagaaag aatatgcact tgatgtagta ccattagaca 600
gtactaatac cagctataag ttagtaagtt gtaacacctc agtcattaca caggcctgtc 660
caaaggtatc ttttgagcca attccaatac atttctgtgc cccagctggt tttgcgattc 720
taaagtgtaa caataaaacg tttgatggaa aaggaccatg tacaaatgtc agtacagtgc 780
gatgtacaca tggaattaaa ccagtagtgt caactcaact gctgttaaat ggcagtctag 840
cagaagaaga gatagtgatt agatctgaaa atttctcgaa caatgctaaa accataatag 900
tacagctaaa taaaactgta gaaattaatt gtacaagacc caacaacaac acaagcaaag 960
gtatacatat gggaccaggg agggcatttt atgcaacagg aagaatagta ggagatataa 1020
gacaagcaca ttgtaacatt agtaacgcag attggacaaa tactttaaaa caggtagcta 1080
ggaaattaag ggaacaatat gtgaataaaa caatagcctt taagccaccc tcaggagggg 1140
acccagaagt tgtactgcac acttttaatt gtagagggga atttttctac tgtaatttat 1200
caagaatgtt taatagtagt tttaattcaa cacaactgtc taattattca gaagatactg 1260
ggaccatcac agtcccatgc agaataaaac aatttataaa catgtggcag gaagtaggaa 1320
aagcaatgta tgcccctccc atcagaggag aaattaattg ttcatcaaag attacaggat 1380
tgttattaac aagagacggt ggcaatagca atgggactga gattttcaga cctggaggag 1440
gagatatgag ggacaattgg agaagtgaat tatacaaata taaagtagta agaattgaac 1500
cattaggatt agcacccacc aaggcaaaga gaagagtggt gcagagagaa aaaagagcag 1560
cagtgacaat gggagcaatg ttccctgggt tcttgggagc agcaggaagc act 1613
<210> 128
<211> 484
<212> PRT
<213> Human
Claims (63)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US47781503P | 2003-06-12 | 2003-06-12 | |
US60/477,815 | 2003-06-12 |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20060041179A true KR20060041179A (en) | 2006-05-11 |
Family
ID=33551767
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020057023830A KR20060041179A (en) | 2003-06-12 | 2004-06-10 | Hiv-1 envelope glycoproteins having unusual disulfide structure |
Country Status (9)
Country | Link |
---|---|
US (1) | US20050025779A1 (en) |
EP (1) | EP1633308A4 (en) |
KR (1) | KR20060041179A (en) |
CN (1) | CN1809381A (en) |
AU (1) | AU2004247146A1 (en) |
CA (1) | CA2528005A1 (en) |
IL (1) | IL172273A0 (en) |
MX (1) | MXPA05013334A (en) |
WO (1) | WO2004110384A2 (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050287068A1 (en) * | 2004-06-17 | 2005-12-29 | Bot Adrian I | Efficacy of active immunotherapy by integrating diagnostic with therapeutic methods |
WO2006122822A2 (en) | 2005-05-20 | 2006-11-23 | Lonza Biologics Plc. | High-level expression of recombinant antibody in a mammalian host cell |
WO2007030518A2 (en) * | 2005-09-06 | 2007-03-15 | The Government Of The United States Of America As Represented By The Secretary Of The Department Of Health And Human Services | Conformationally stabilized hiv envelope immunogens and triggering hiv-1 envelope to reveal cryptic v3-loop epitopes |
US20110195089A1 (en) * | 2008-10-08 | 2011-08-11 | The Adminstrators Of The Tulane Educational Fund | Compositions, methods, and kits for enhancing the immunogenicity of pathogenic antigens |
WO2012139099A2 (en) * | 2011-04-08 | 2012-10-11 | Duke University | Herpes simplex virus vaccine |
CA2841376C (en) | 2011-07-05 | 2023-02-28 | Duke University | N-terminal deleted gp120 immunogens |
US10092638B2 (en) | 2011-10-03 | 2018-10-09 | Duke University | GP120 immunogens and methods inducing neutralizing antibodies to human immunodeficiency virus |
CN108640977A (en) * | 2018-06-18 | 2018-10-12 | 上海大学 | Specifically bind the polypeptide of envelope glycoprotein gp120 and its application on HIV |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4376110A (en) * | 1980-08-04 | 1983-03-08 | Hybritech, Incorporated | Immunometric assays using monoclonal antibodies |
US4366241A (en) * | 1980-08-07 | 1982-12-28 | Syva Company | Concentrating zone method in heterogeneous immunoassays |
US4517288A (en) * | 1981-01-23 | 1985-05-14 | American Hospital Supply Corp. | Solid phase system for ligand assay |
CA1291031C (en) * | 1985-12-23 | 1991-10-22 | Nikolaas C.J. De Jaeger | Method for the detection of specific binding agents and their correspondingbindable substances |
DE69130741T2 (en) * | 1990-05-16 | 1999-07-29 | Dana-Farber Cancer Institute, Boston, Mass. | IMMUNOGENIC PEPTIDES, ANTIBODIES AND THEIR USE IN RELATION TO THE CD4 RECEPTOR BINDING |
ZA975889B (en) * | 1996-07-08 | 1998-02-23 | Genentech Inc | HIV envelope polypeptides and vaccine. |
US6716429B1 (en) * | 1997-10-01 | 2004-04-06 | Dana-Farber Cancer Institute, Inc. | Stabilization of envelope glycoprotein trimers by disulfide bonds introduced into a gp 41 glycoprotein ectodomain |
WO2001000648A1 (en) * | 1999-06-25 | 2001-01-04 | Progenics Pharmaceuticals, Inc. | Stabilized viral envelope proteins and uses thereof |
-
2004
- 2004-06-10 MX MXPA05013334A patent/MXPA05013334A/en not_active Application Discontinuation
- 2004-06-10 KR KR1020057023830A patent/KR20060041179A/en not_active Application Discontinuation
- 2004-06-10 AU AU2004247146A patent/AU2004247146A1/en not_active Abandoned
- 2004-06-10 EP EP04755049A patent/EP1633308A4/en not_active Withdrawn
- 2004-06-10 CA CA002528005A patent/CA2528005A1/en not_active Abandoned
- 2004-06-10 US US10/866,527 patent/US20050025779A1/en not_active Abandoned
- 2004-06-10 WO PCT/US2004/018672 patent/WO2004110384A2/en active Application Filing
- 2004-06-10 CN CNA2004800163060A patent/CN1809381A/en active Pending
-
2005
- 2005-11-30 IL IL172273A patent/IL172273A0/en unknown
Also Published As
Publication number | Publication date |
---|---|
MXPA05013334A (en) | 2006-05-19 |
IL172273A0 (en) | 2006-04-10 |
WO2004110384A2 (en) | 2004-12-23 |
CN1809381A (en) | 2006-07-26 |
EP1633308A2 (en) | 2006-03-15 |
EP1633308A4 (en) | 2008-06-25 |
CA2528005A1 (en) | 2004-12-23 |
AU2004247146A1 (en) | 2004-12-23 |
WO2004110384A3 (en) | 2005-06-02 |
US20050025779A1 (en) | 2005-02-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0778888B1 (en) | Antigenically-marked non-infectious retrovirus-like particles | |
AP1282A (en) | HIV envelope polypeptides and vaccine. | |
US10081658B2 (en) | Truncated HIV envelope proteins (ENV), methods and compositions related thereto | |
US6121021A (en) | Constitutive expression of non-infectious HIV-like particles | |
US6585979B1 (en) | HIV envelope polypeptides and immunogenic composition | |
AU784635B2 (en) | The genome of the HIV-1 inter-subtype (C/B') and use thereof | |
US8017126B2 (en) | Modified HIV-1 envelope proteins | |
KR20060041179A (en) | Hiv-1 envelope glycoproteins having unusual disulfide structure | |
NZ543381A (en) | Materials and methods for immunizing against FIV infection | |
US20040115621A1 (en) | Ancestral viruses and vaccines | |
JP2003523188A (en) | AIDS ancestral virus and vaccine | |
US7700726B2 (en) | HIV antisense proteins | |
CA2320383A1 (en) | Hiv chemokines |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WITN | Application deemed withdrawn, e.g. because no request for examination was filed or no examination fee was paid |