CN1244896B - 非-m、非-o hiv-1毒株、片段及应用 - Google Patents
非-m、非-o hiv-1毒株、片段及应用 Download PDFInfo
- Publication number
- CN1244896B CN1244896B CN971813809A CN97181380A CN1244896B CN 1244896 B CN1244896 B CN 1244896B CN 971813809 A CN971813809 A CN 971813809A CN 97181380 A CN97181380 A CN 97181380A CN 1244896 B CN1244896 B CN 1244896B
- Authority
- CN
- China
- Prior art keywords
- leu
- gly
- glu
- ile
- lys
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 241000713772 Human immunodeficiency virus 1 Species 0.000 title claims abstract description 68
- 239000012634 fragment Substances 0.000 title abstract description 13
- 150000007523 nucleic acids Chemical group 0.000 claims description 94
- 241001430294 unidentified retrovirus Species 0.000 claims description 10
- 239000002773 nucleotide Substances 0.000 claims description 9
- 125000003729 nucleotide group Chemical group 0.000 claims description 9
- 230000000813 microbial effect Effects 0.000 claims description 2
- 108090000623 proteins and genes Proteins 0.000 abstract description 18
- 102000004169 proteins and genes Human genes 0.000 abstract description 13
- 230000003321 amplification Effects 0.000 abstract description 11
- 238000003199 nucleic acid amplification method Methods 0.000 abstract description 11
- 230000000405 serological effect Effects 0.000 abstract description 10
- 208000031886 HIV Infections Diseases 0.000 abstract description 8
- 239000003153 chemical reaction reagent Substances 0.000 abstract description 8
- 241000713340 Human immunodeficiency virus 2 Species 0.000 abstract description 7
- 238000003745 diagnosis Methods 0.000 abstract description 5
- 239000003795 chemical substances by application Substances 0.000 abstract description 4
- 230000002163 immunogen Effects 0.000 abstract description 3
- 230000001177 retroviral effect Effects 0.000 abstract description 2
- 108020004707 nucleic acids Proteins 0.000 description 92
- 102000039446 nucleic acids Human genes 0.000 description 92
- 108090000765 processed proteins & peptides Proteins 0.000 description 49
- 102220023257 rs387907546 Human genes 0.000 description 42
- 102220369446 c.1274G>A Human genes 0.000 description 30
- 150000001413 amino acids Chemical class 0.000 description 29
- 210000002966 serum Anatomy 0.000 description 28
- 241000700605 Viruses Species 0.000 description 25
- 102220369447 c.1352G>A Human genes 0.000 description 23
- 102220369445 c.668T>C Human genes 0.000 description 22
- 238000012360 testing method Methods 0.000 description 22
- 208000030507 AIDS Diseases 0.000 description 21
- 108020004414 DNA Proteins 0.000 description 17
- 230000035772 mutation Effects 0.000 description 17
- 102220023258 rs387907548 Human genes 0.000 description 16
- 238000000034 method Methods 0.000 description 15
- 230000002068 genetic effect Effects 0.000 description 14
- 241000725303 Human immunodeficiency virus Species 0.000 description 13
- 238000004458 analytical method Methods 0.000 description 11
- 208000015181 infectious disease Diseases 0.000 description 11
- 239000000523 sample Substances 0.000 description 11
- 238000006243 chemical reaction Methods 0.000 description 10
- 102220023256 rs387907547 Human genes 0.000 description 10
- 101800001690 Transmembrane protein gp41 Proteins 0.000 description 9
- 230000004087 circulation Effects 0.000 description 8
- 210000004698 lymphocyte Anatomy 0.000 description 8
- 238000011160 research Methods 0.000 description 8
- 208000011580 syndromic disease Diseases 0.000 description 8
- 230000000890 antigenic effect Effects 0.000 description 7
- 230000009257 reactivity Effects 0.000 description 7
- 210000004369 blood Anatomy 0.000 description 6
- 239000008280 blood Substances 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 102220004457 rs11567847 Human genes 0.000 description 6
- 238000001262 western blot Methods 0.000 description 6
- 241000282577 Pan troglodytes Species 0.000 description 5
- 210000004027 cell Anatomy 0.000 description 5
- 230000004069 differentiation Effects 0.000 description 5
- 239000000284 extract Substances 0.000 description 5
- 102000004196 processed proteins & peptides Human genes 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- 239000000427 antigen Substances 0.000 description 4
- 102000036639 antigens Human genes 0.000 description 4
- 108091007433 antigens Proteins 0.000 description 4
- 238000009396 hybridization Methods 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 108700004029 pol Genes Proteins 0.000 description 4
- 101150088264 pol gene Proteins 0.000 description 4
- 108010072476 SIV envelope protein gp41 Proteins 0.000 description 3
- 239000013256 coordination polymer Substances 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- 238000013412 genome amplification Methods 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 230000002103 transcriptional effect Effects 0.000 description 3
- LGGRPYXPOUIMKG-OJICBBQQSA-N (2S)-2-[[(2S)-2-[[(2S,3S)-2-[[(2S,3R)-2-[[2-[[(2S,3S)-2-[[(2S,3S)-2-[[(2S)-4-amino-2-[[(2S)-6-amino-2-[[(2S,3R)-2-[[(2S,3R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-1-[2-[[(2S,3S)-2-[[(2S)-2-[[(2S,3S)-2-[[(2S)-2-[[(2S)-6-amino-2-[[(2S)-2-[[(2S)-6-amino-2-[[(2S)-4-amino-2-[[(2S)-2-[[(2S)-4-amino-2-[[(2S)-1-[(2S)-2-[[(2S,3R)-2-amino-3-hydroxybutanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]amino]-4-oxobutanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-4-oxobutanoyl]amino]hexanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]hexanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-methylpentanoyl]amino]-3-(1H-imidazol-4-yl)propanoyl]amino]-3-methylpentanoyl]amino]acetyl]pyrrolidine-2-carbonyl]amino]acetyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]propanoyl]amino]-3-phenylpropanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-hydroxybutanoyl]amino]-3-hydroxybutanoyl]amino]hexanoyl]amino]-4-oxobutanoyl]amino]-3-methylpentanoyl]amino]-3-methylpentanoyl]amino]acetyl]amino]-3-hydroxybutanoyl]amino]-3-methylpentanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-N-[(2S)-1-[[(2S)-1-amino-3-(1H-imidazol-4-yl)-1-oxopropan-2-yl]amino]-1-oxopropan-2-yl]pentanediamide Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N2CCC[C@H]2C(=O)NCC(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)N[C@@H](CC4=CC=C(C=C4)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC5=CNC=N5)C(=O)N)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC6=CC=C(C=C6)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]7CCCN7C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H]([C@@H](C)O)N LGGRPYXPOUIMKG-OJICBBQQSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 241000282693 Cercopithecidae Species 0.000 description 2
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 2
- 239000012472 biological sample Substances 0.000 description 2
- 239000011248 coating agent Substances 0.000 description 2
- 238000000576 coating method Methods 0.000 description 2
- 238000012790 confirmation Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 239000012228 culture supernatant Substances 0.000 description 2
- 238000013016 damping Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 108700004025 env Genes Proteins 0.000 description 2
- 101150030339 env gene Proteins 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 108700004026 gag Genes Proteins 0.000 description 2
- 101150098622 gag gene Proteins 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 108700004028 nef Genes Proteins 0.000 description 2
- 101150023385 nef gene Proteins 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 239000002574 poison Substances 0.000 description 2
- 231100000614 poison Toxicity 0.000 description 2
- 229920001184 polypeptide Polymers 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 108700004030 rev Genes Proteins 0.000 description 2
- 101150098213 rev gene Proteins 0.000 description 2
- 238000010839 reverse transcription Methods 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 108700004027 tat Genes Proteins 0.000 description 2
- 101150098170 tat gene Proteins 0.000 description 2
- 108700026220 vif Genes Proteins 0.000 description 2
- 101150059019 vif gene Proteins 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- 108700026215 vpr Genes Proteins 0.000 description 2
- 101150024249 vpr gene Proteins 0.000 description 2
- 108700026222 vpu Genes Proteins 0.000 description 2
- 101150090490 vpu gene Proteins 0.000 description 2
- 102000040350 B family Human genes 0.000 description 1
- 108091072128 B family Proteins 0.000 description 1
- 241001481710 Cerambycidae Species 0.000 description 1
- 241000282556 Cercocebus atys Species 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 101710095468 Cyclase Proteins 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 1
- 241000282324 Felis Species 0.000 description 1
- XITLYYAIPBBHPX-ZKWXMUAHSA-N Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(N)=O XITLYYAIPBBHPX-ZKWXMUAHSA-N 0.000 description 1
- 208000037357 HIV infectious disease Diseases 0.000 description 1
- 241000714260 Human T-lymphotropic virus 1 Species 0.000 description 1
- 241000714259 Human T-lymphotropic virus 2 Species 0.000 description 1
- 206010020852 Hypertonia Diseases 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- 150000008536 L-asparagines Chemical class 0.000 description 1
- 208000032420 Latent Infection Diseases 0.000 description 1
- AZLASBBHHSLQDB-GUBZILKMSA-N Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(C)C AZLASBBHHSLQDB-GUBZILKMSA-N 0.000 description 1
- YSZNURNVYFUEHC-BQBZGAKWSA-N Lys-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(O)=O YSZNURNVYFUEHC-BQBZGAKWSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 206010027336 Menstruation delayed Diseases 0.000 description 1
- QXOHLNCNYLGICT-YFKPBYRVSA-N Met-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(O)=O QXOHLNCNYLGICT-YFKPBYRVSA-N 0.000 description 1
- 241000581002 Murex Species 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 241000282579 Pan Species 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- 208000037581 Persistent Infection Diseases 0.000 description 1
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- 101710149951 Protein Tat Proteins 0.000 description 1
- 101710149136 Protein Vpr Proteins 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 229920002684 Sepharose Polymers 0.000 description 1
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 1
- 101710136739 Teichoic acid poly(glycerol phosphate) polymerase Proteins 0.000 description 1
- 241000532784 Thelia <leafhopper> Species 0.000 description 1
- 108010065667 Viral Matrix Proteins Proteins 0.000 description 1
- 101710201961 Virion infectivity factor Proteins 0.000 description 1
- 241000726445 Viroids Species 0.000 description 1
- 230000005856 abnormality Effects 0.000 description 1
- 230000036436 anti-hiv Effects 0.000 description 1
- 230000000798 anti-retroviral effect Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- OJYGBLRPYBAHRT-IPQSZEQASA-N chloralose Chemical compound O1[C@H](C(Cl)(Cl)Cl)O[C@@H]2[C@@H](O)[C@@H]([C@H](O)CO)O[C@@H]21 OJYGBLRPYBAHRT-IPQSZEQASA-N 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 238000002405 diagnostic procedure Methods 0.000 description 1
- 208000037771 disease arising from reactivation of latent virus Diseases 0.000 description 1
- 239000012153 distilled water Substances 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 208000033519 human immunodeficiency virus infectious disease Diseases 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- LWGJTAZLEJHCPA-UHFFFAOYSA-N n-(2-chloroethyl)-n-nitrosomorpholine-4-carboxamide Chemical compound ClCCN(N=O)C(=O)N1CCOCC1 LWGJTAZLEJHCPA-UHFFFAOYSA-N 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 230000000803 paradoxical effect Effects 0.000 description 1
- 210000005105 peripheral blood lymphocyte Anatomy 0.000 description 1
- 210000003819 peripheral blood mononuclear cell Anatomy 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- 230000000379 polymerizing effect Effects 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000001902 propagating effect Effects 0.000 description 1
- 230000008521 reorganization Effects 0.000 description 1
- 230000000452 restraining effect Effects 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 238000010257 thawing Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000029812 viral genome replication Effects 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
- A61P31/14—Antivirals for RNA viruses
- A61P31/18—Antivirals for RNA viruses for HIV
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N7/00—Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/569—Immunoassay; Biospecific binding assay; Materials therefor for microorganisms, e.g. protozoa, bacteria, viruses
- G01N33/56983—Viruses
- G01N33/56988—HIV or HTLV
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/525—Virus
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/15011—Lentivirus, not HIV, e.g. FIV, SIV
- C12N2740/15022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16021—Viruses as such, e.g. new isolates, mutants or their genomic sequences
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16111—Human Immunodeficiency Virus, HIV concerning HIV env
- C12N2740/16122—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16211—Human Immunodeficiency Virus, HIV concerning HIV gagpol
- C12N2740/16222—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16311—Human Immunodeficiency Virus, HIV concerning HIV regulatory proteins
- C12N2740/16322—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S435/00—Chemistry: molecular biology and microbiology
- Y10S435/975—Kit
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Virology (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Immunology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Biochemistry (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Urology & Nephrology (AREA)
- Biotechnology (AREA)
- Hematology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- AIDS & HIV (AREA)
- Tropical Medicine & Parasitology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biophysics (AREA)
- Cell Biology (AREA)
- Gastroenterology & Hepatology (AREA)
- General Engineering & Computer Science (AREA)
- Food Science & Technology (AREA)
- Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- General Physics & Mathematics (AREA)
- Pathology (AREA)
- Oncology (AREA)
- Public Health (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Pharmacology & Pharmacy (AREA)
- Animal Behavior & Ethology (AREA)
- Communicable Diseases (AREA)
Abstract
本发明涉及HIV-1非-M族非-O族的逆转录病毒毒株,尤其涉及一种称为YBF 30的毒株、其片段及其作为诊断和免疫原性制剂的应用。与M族和O族不同的HIV-2具有以下特征:对M族和O族的蛋白质几乎没有或没有血清学反应,而对源于YBF 30毒株或SIVCPZGAB毒株的蛋白质具有强烈的血清学反应;无M族和O族HIV-1的env和gag区引物引发的基因组扩增;源于YBF 30毒株的引物存在时的基因组扩增;与YBF 30毒株相比,包膜基因产物的同源性大于70%。
Description
本发明涉及非-M、非-O HIV-1族的逆转录病毒毒株,尤其涉及一种称为YBF30的毒株,涉及其片段及其作为诊断试剂和免疫原性制剂的应用。
人类获得性免疫缺陷病毒HIV-1和HIV-2是逆转录慢病毒,这类病毒存在于许多非洲灵长类中。所有的这些病毒看来有一个共同的祖先;然而,要预先判断这些不同的病毒是在什么时期从这个前体分离是非常困难的。更为远缘但属于同族的其它病毒,存在于其它哺乳动物中(有蹄动物和猫科动物)。
所有这些病毒与长期感染有关;在天然感染的猴中症状的缺乏是普遍规律。
由于其与Sooty Mangabey(西部非洲)病毒很强的同源性,HIV-2的起源看来是较明确的,但在猴中没有发现与HIV-1密切关联的病毒。关系最为密切的病毒是在两只黑猩猩中发现的病毒(CPZGABSIV,ANT SIV)。
业已发现所有的慢病毒都显示出很大的遗传变异性,对这些获自许多不同地理位置的变种进行的种系发生学研究,已使得HIV-1的8个亚型(分化枝)得到区别分类,它们各自之间都是等距的。分化枝仅是一种变异性表达的数字表示;基于氨基酸而不是核酸的表型学分析得出了不同的结果(Korber等人,1994)。
亚型的证实与一项种系发生学分析是一致的,到目前为止,种系发生学分析还没有任何病理生理学交联,但相反,却有地理对应性。这是因为每一种亚型主要见于一个特定的地理区域。B亚型主要见于欧洲和美国,而两种亚型即E和B,见于泰国,且传染方式(事实上与特定人群相对应)与所发现的亚型之间有很强的相关性。在非洲发现的所有分化枝及其在世界其它地区的分布,反映了沉溺于高危险行为 的人们之间相互交流的可能性。主要的分化枝是分化枝A,之所以它是主要的,是因为在非洲的大部分地区都存在。在某些非洲国家已发现了很大程度的变异性(G.Myers,1994;P.M.Sharp等人,1994)。在西中部非洲国家如中非共和国(Murphy等人,1993)和喀麦隆(Nkengasong等人,1994),某些亚型的特征已得以表征。
最后,携带HIV-1的病毒变种的病人已得到表征,对在法国市场销售的特定试剂盒来说,他们的血清已提出了检测问题,而且他们的确证性Western印迹是不典型的(Loussert-Ajaka等人,1994;Simon等人,1994;PCT国际申请WO 96/27013)。
对这些变种的分析证实了这样一个事实,即:1型HIV病毒应被分为两个族,即M(主要)族和O(外围层)族,后者包括这些分离株,正如Charneau等人1994年提出的那样。对已知的O族病毒序列进行的同义突变/非同义突变比率分析表明,这一新族也是古老的,即使不比M族更为古老(Loussert-Ajaka等人,1995)。到目前为止,它的低流行率-即:在喀麦隆被HIV-1感染病人的8%(Zekeng等人,1994)以及在法国确定的18例,被认为是由于纯粹的流行病学因素。
HIV-1的这两个族形成了一株形状象双星形骨针的树(图9-19)。两个分离株-即从加蓬的一只黑猩猩身上得到的CPZGAB SIV(Huet等人,1990)以及从Antwerp动物园的一只黑猩猩身上得到的CPZANT SIV,具有的序列和遗传结构与HIV-1有非常密切的关联,但它们并不属于这两个族中的任何一族,且在进化系统树上形成两个新枝。
对于开发用于检测HIV感染的足够敏感和特异的试剂,也就是说不会产生假阴性或假阳性结果的试剂,以及对于开发对既不属于M族、也不属于O族的亚型有保护作用的组合物,新亚型的证实是非常重要的。
因此,申请者自己制定了一个目标,即提供一种非-M、非-O毒株,以及来源于该毒株的序列,后者适用于检测非-M、非-OHIV变种,而且不会产生假阴性或假阳性结果。为了达到这个目的, 发明者特地建立了一个规则系统,用于区别并确证M族和O族HIV-1感染,据此使他们能够选择非-M、非-O变种。
本发明涉及一种非-M、非-O HIV-1毒株,它显示也一种逆转录病毒的形态和免疫学特征,该逆转录病毒已于1996年7月2日保藏在国立微生物保藏中心(CNCM),保藏号I-1753(命名为YBF30),由巴斯德研究所保管。
非-M、非-O变种被认为是指一种1型HIV,从血清学和分子方面不能将其识别归属于这些族中的任何一族。
本发明还涉及以上定义的毒株(SEQ ID No.1)的完整核苷酸序列,以及涉及源自该毒株的、大小至少为10个核苷酸的核酸片段。
可能会提到的该类型的片段是:
-YBF 30LTR(SEQ ID No.2)
-YBF 30GAG(SEQ ID No.3)(gag基因)
-YBF 30POL(SEQ ID No.5)(pol基因)
-YBF 30VIF(SEQ ID No.7)(vif基因)
-YBF 30VPR(SEQ ID No.9)(vpr基因)
-YBF 30VPU(SEQ ID No.11)(vpu基因)
-YBF 30TAT(SEQ ID No.13)(tat基因)
-YBF 30REV(SEQ ID No.15)(rev基因)
-YBF 30ENV gp160(SEQ ID No.17)(env基因)
-YBF 30NEF(SEQ ID No.19)(nef基因)
-SEQ ID Nos.21-57,也被分别命名,YLG、LPBS.1、GAG YAS1.1、GAG Y AS1、GAG 6、GAG Y S1、GAG Y S1.1、GAG YS1.2、YRT AS1.3、YRT AS1.2、YRT AS1.1、YRT 2、YRT AS1、YRT 2.1、YRT2.2、YRT2.3、YRT2.4、4481-1、4481-2、4235.1、4235.2、4235.3、4235.4、SK69.6、SK69.5、SK69.4、SK69.3、SK69.2、SK69.1、SK68.1、SK68.2、SK68.3、LAI AS1.3、LSIAS1.2、LSI AS1.1、LSI A1、YLPA以及与以上核苷酸序列之一不同,或与这些序列之一不互补,但能够与来自一种非-M、非-O HIV-1病毒的核酸序列特异性杂交的任何序列。
这样的序列可被用于非-M、非-O HIV-1的特异性鉴别,且作为诊断试剂或单独或与其它试剂合在一起,用于任何HIV-1的特异性鉴别。
这些序列尤其可作为引物或探针用于诊断试验,包括或是与待测病毒序列的直接杂交,或是该病毒序列的扩增,这些试验应用一种包含至少10个核苷酸的寡核苷酸,后者包括在以上序列中的任何一个序列中,尤其是上述序列之一SEQ ID No.21-57。
本发明还涉及HIV-1病毒,其特征在于它们与M族和O族都不相同,而且显示以下特征:
*对M和O族的蛋白质几乎没有或没有血清学反应性,而对源自YBF30毒株或CPZGAB SIV毒株的蛋白质却有很强的血清学反应性;
*当应用来自M和O族HIV-1病毒env和gag区的引物时,无基因组扩增;
*当来自以上定义的YBF30毒株的引物存在时,基因组扩增;以及
*相对于YBF30毒株,包膜基因产物的同源性大于70%。
本发明还涉及上述序列的应用,以提供HIV-1型核酸序列的杂交和/或基因扩增的一种方法,这些方法可用于体外诊断非-M、非-OHIV-1型病毒在一个体的潜在感染。
这种体外诊断方法是应用一种生物样本(血清或循环淋巴细胞)进行的,它包括:
●提取待测核酸的步骤,核酸属于病毒的基因组,该病毒有可能存在于生物样本中,以及,在适当的时候,应用一种逆转录酶处理核酸的步骤,如果该核酸为RNA形式的话。
●至少一个循环,包括以下步骤:使核酸变性,与至少一个根据本发明的序列杂交,以及,在适当的时候,延伸杂交物,后者是在适当的试剂(聚合剂,如DNA聚合酶和dNTP)存在时形成的,以及
●检测属于非-M、非-O HIV-1族型病毒的基因组核酸的可能 存在的步骤。
应用以下条件用于利用来源于YBF30毒株的引物进行PCR:
-根据酚/氯仿技术提取淋巴细胞DNA,在260nm波长用分光度法对其定量。所有的扩增都是应用Perkin Elmer 2400热循环仪进行的。
-长片段(9kb)PCR的进行应用了XL PCR试剂盒(PerkinElmer)按照生产商的条件、以及应用了dNTP、提供的缓冲液以及Perkin Elmer的“热启动(hot start)”,这一长片段PCR的扩增循环是:
●94℃,变性2分钟一个循环,
●然后16个循环:94℃15秒钟,55℃15秒钟,68℃8分钟,
●然后24个循环:94℃15秒钟,55℃15秒钟,68℃8分钟,每个循环另外增加15秒钟(加长)。
-对长片段PCR的扩增产物进行巢式PCR。进行巢式PCR的条件如下:
●“扩展高度精确PCR系统(Expand High Fidelity PCRsystem)”Tag聚合酶缓冲液和酶来自Boehringer Mannheim,按照生产商的说明进行,dNTP和“热启动”来自Perkin Elmer,
●每dNTP各200μmol,根据本发明的每种引物各20pmol,DNA 5μl,10×PCR缓冲液10μl,以及Tag聚合酶2.6单位,体积为100μl,
●扩增:94℃2分钟一个循环,随后38个循环:94℃15秒钟,55℃15秒钟,72℃延伸一段时间-根据待扩增PCR产物的大小,时间长短不一(从30秒至2分钟),以及最后的延伸循环,72℃10分钟。
扩增产物优选应用直接测序检测。
本发明还涉及一种肽或一种肽片段,其特征在于它能被一种非-M、非-O HIV-1毒株表达,或应用一种上述核苷酸序列而被表达,而且它能够:(1)被上述非-M、非-O HIV-1病毒、尤其是YBF30毒株或该毒株的变种诱导产生的抗体所识别,这些抗体存在于用非-M、非-O HIV-1毒株感染后获得的生物标本中,和/或(2)诱导抗非-M、非-O HIV-1抗体的产生。
可能会提到的这种类型的肽,尤其是来自YBF30毒株的肽,有:gag基因表达的肽(SEQ ID No.4)、pol基因表达的肽(SEQ IDNo.6)、vif基因表达的肽(SEQ ID No.8)、vpr基因表达的肽(SEQID No.10)、vpu基因表达的肽(SEQ ID No.12)、tat基因表达的肽(SEQ ID No.14)、rev基因表达的肽(SEQ ID No.16)、env基因表达的肽(SEQ ID No.18)或其片段之一如V3环区的一个片段,即CTRPGNNTGGQVQIGPAMTFYNIEKIVGDIRQAYC(SEQ IDNo.58),以及nef基因表达的肽(SEQ ID No.20),或者能够识别上述非-M、非-O HIV-1感染期间产生的抗体的这些多肽的一个片段。
本发明还涉及免疫原性组合物,其包括根据本发明的核苷酸序列的一个或多个翻译产物,和/或上述、尤其是通过合成方法获得的肽中的一个。
本发明还涉及针对上述一个或多个肽的抗体,以及它们在提供用于在体外,尤其是特异性诊断HIV-1型病毒在一个体的感染的方法的用途,利用的方法是本领域熟练的技术人员所熟知的。
本发明包括所有的、能被从获自非-M、非-O HIV-1毒株感染后的感染血清中分离的抗体所识别的肽,以及能够被根据本发明的一种抗体所识别的肽。
本发明进一步涉及一种在体外诊断非-M、非-O HIV-1病毒的方法,该方法的特征在于:它包括使一种取自病人的生物标本接触根据权利要求10的抗体,其可能会与抗-CPZGAB SIV抗体结合,以及检测可能存在于该生物样本中的HIV-1抗原与该抗体之间形成的免疫复合物。
本发明还涉及一种用于诊断HIV-1的试剂盒,该试剂盒的特征在于它包括至少一种根据本发明的试剂。
除上述条款外,本发明还包括其它条款,这在以下描述中物是显而易见的,以下描述提及实施作为本发明之主题的方法的实例,并提及附图,其中:
-图1-7显示了不同的引物在YBF30毒株基因组上的位置;
-图8显示了YBF30毒株的基因组结构;
-图9-16描绘了与M族HIV-1和O族HIV-1相比,YBF30毒株不同基因的系统发生学分析(图9:1tr基因,图10:gag基因,图11:tat基因,图12:rev基因,图13:vif基因,图14:env gp120基因,图15:env gp41基因,图16:nef基因,图17:pol基因,图18:vpr基因,图19:vpu基因,);
-图20显示了YBF30与HIV-1/GPZGAB SIV之间的百分比遗传距离。
不过,理所当然应予理解的是,这些实施例只是以说明本发明的主题方式提供的,它们绝不对本发明构成限制。
实施例:获得根据本发明的非-M、非-O HIV-1变种(YBF30)及其应用
联系到在喀麦隆的对人类免疫缺陷病毒(HIV)感染进行的流行病学研究,这尤其是可能的,其流行病学是特别似非而是的。在这个国家,毒株的多样性非常突出,目前已知的HIV-1病毒M(主要)族的大多数亚型都有报道。O族(O代表外围层)高度多样性的HIV-1病毒的感染病例已有报道,几乎仅见于喀麦隆血统的病人。HIV-2、HTLV-1和HTLV-2亚型A和B感染病例也有报道。
以先前的血清学和基因型评估结果为基础,本发明者建立了一种区分并确证M族和O族HIV-1病毒感染的规则系统,以选择非-M、非-O变种。
这些方法被用到送至雅温得国家参比实验室检测HIV感染的标本上,使之有可能鉴定出一种高度多样性的HIV分离株,并明确用于鉴定一种新的HIV-1族的工具,其中将观察到的这种人类毒株YBF30与猿毒株CPZGAB SIV之间的同源性考虑在内。
I-在流行病学研究中从血清学上鉴定YBF30变种的方法
1)采集标本:
研究了1994年和1995年送至雅温得参比实验室以检测或确证HIV感染的全部成年病人血清(n=8831)。
2)从血清学上区别M族和O族HIV-1,并选择变种:
如果抗-HIV抗体检测为阳性(Genelvia Mixt间接混合HIV-1和HIV-2EIA,Sanofi-Pasteur,巴黎,法国),那么,根据与M族特异性抗原竞争的原理,将其与EIA试验结合(Wellcozyme Rec HIV-1,Murex,Dartford,英国)。
如果竞争Wellcozyme Rec HIV-1试验是阳性的,以光密度(OD)的反应性与阈值或截断值(CO)相比的比率大于5(CO/OD>5),那么,这份血清就被认为是HIV-1阳性,这一结果应在一种新标本上予以确证。
选择反应性比率大于5作为竞争试验是确证感染HIV-1的试验,是基于Bichat医院病毒实验室得到的经验:反应的比率>5的全部7200份样本都产生了强阳性HIV-1Western印迹(WB,New Lav Blot1,SDP,Marnes 1a Coquette)。除了HIV-1血清转换的病例外,确证为HIV阳性、而Wellcozyme比率<5的标本,相应于HIV-2感染或者O族HIV-1或其它HIV-1变种感染。
为了在进行混合EIA检测时消除假阳性反应,CO/OD比率<5的样本用第三代混合HIV-1/HIV-2EIA(Enzygnost Plus,Marburg,德国)的系统测试,包括M和O HIV-1族的抗原(MVP 5180毒株的重组gp41)。如果这项试验是阳性的,然后再进行一种区别HIV-1与HIV-2的快速试验(Multispot,SDP,Mrnes Ia Coquette)以及一种Western印迹(WB,New Lav Blot 1或2,SDP)。
3)从血清学上确证O族HIV-1与HIV-1变种感染
所有CO/DO比率<5、且被WB区别为阳性(阳性标准:2ENV+/-POL+/-GAG或1ENV+POL+/-GAG)和HIV-1的样本,都接受应用V3和跨膜区肽抗原的斑点印迹试验测试(InnoLia,Innogenetics,Ghent,比利时)。
4)O族和变种毒株的逆转录病毒分离
在喀麦隆用Ficoll-Hypaque梯度分离血清阳性病人的外周血单核细胞(PMBC),然后收集在液氮中保存并运往巴黎。
病人的PBMC在解冻后与血清阴性高加索供血者的淋巴细胞共同培养。在为期一个月的时间内,通过检测逆转录酶活性以及进行检测P24抗原的试验(Elavia P24 Polyclonal,SDP),以证实培养上清中的病毒复制。
5)序列
根据片段大小在1~1.4%浓度的琼脂糖凝胶上显现PCR产物,在3M乙酸钠(1∶10)和3体积的纯乙醇中沉淀,在-80℃孵育30分钟,然后13,000转/分离心20分钟。将沉淀干燥,然后溶于10μl蒸馏水(Sigma)中。根据生产商的说明,用“Qiauick凝胶提取试剂盒”(Qiagen)进行纯化;根据以往的描述(Loussert-Ajakat等人,1995),应用Applied Biosystem Dye Terminator试剂盒,在自动DNA测序仪上对产物进行测序;在Sequence Navigator软件(AppliedBiosystem)上分析核苷酸序列,应用GeneWorks软件(IntelligeneticsInc.)进行序列对比。
6)系统发生学分析:
应用CLUSTAL软件对序列进行多重对比,并将生物学和理论生物物理学实验室(Los Alamos,New Mexico,87545,美国)拥有的HIV序列汇集的序列对比作为参比模板。
应用PHYLIP软件进行系统发生学分析;首先应用DNADIST计算距离,然后应用NEIGBOR JOINTING或FITCH进行系统发生学分析:最后,应用DRAWTREE画系统树(图9~19)。图20还显示了遗传距离百分比。
首先应用SEQBOOT进行“引导程序(bootstrapping)”分析,随后DNADIST与NEIGHBOR JOINTING或FITCH。最后,应用CONSENS获得引导程序值。
II-检测O族和变种HIV病毒的研究结果:
在筛查中发现为阳性的3193份样本中,有174份样本被认为是有异常血清学反应的O族或M族,或被认为是变种。
III-显示异常血清学反应的非-O族和非-M族样本的检测
WB(蛋白质印迹)为HIV-1阳性、但在竞争EIA中反应性CO/CD比率<5的174份血清,应用来自M族、O族和CPZGAB SIV的V3肽,通过鉴别LIA斑点印迹进行了测试:
-7份没有与任何代表性肽(M,O或CPZGAB SIV)起反应。细胞收集的缺乏不能允许得出任何结论。
-82份与至少一种对应于O族毒株V3环的肽产生反应性。交叉反应的频率低,而且限于对应于共有V3区(11%)和CPZGAB SIV V3区(43%)的表位。
-84份血清没有与O族表位起反应。这些样本中的大多数获自表现艾滋病综合症的病人(75/84)。
-1份获自一位喀麦隆病人(NJ)的血清只与CPZGAB SIV肽起反应。这种与CPZAB SIV抗原的孤立反应性以往从未有描述。因为已从该病人身上取到淋巴细胞,所以有可能继续对这一被称为YBF30的毒株进行病毒学表征。
IV-对取自该病人的第一批标本(1995年5月)进行血清学和病毒学检测的结果(血清号:95-6295):
1)商品化ELISA试验(光密度/阈值)
阳性标准:OD/CO>1
Genelavia=>15
Wellcozyme CO/OD=1.55
Abbott Plus=>15
Behring Plus=4.2
2)Western印迹
New Lav 1 Oasteur WB:
160++,120++,68++,55+,41+,40+/-,34++,24++,18+
3)Inngenetics LIA斑点印迹
除CPZGAB SIV V3外,全部O族和B族条带均为阴性。
4)对M族和O族特异性肽进行的探索性血清学检查结果
对TOURS CHU病毒实验室Francis Barin教授开创的技术进行了修改(Barin F等人,1996);应用了合成的跨膜区肽(Biom*rieux)以建立一种区别M族和O族的试验。这项技术是以O族和M族跨膜gp41肽之间的抗体结合竞争为基础的,肽沉淀在固相,O族或是M族的gp41跨膜肽以更高的浓度存在于高渗性液体的反应相。结果如下表I所示,其中CP孔相当于100%抑制对照,CSP孔相当于0%抑制对照。
表I
6295份血清的族间O族M鉴别结果
这些结果证实,与固相肽(CSP)有很强的结合,以及由于M和O肽(CP)的联合加入而造成明显抑制,但无论是M肽还是O肽,都无明确的区别。因此这一血清学证据表明,感染毒株既不属于M族、也不属于O族。
考虑到InnoLia斑点印迹中CPZGAB SIV V3抗原的孤立反应性,同样依据肽之间的竞争,通过将gp41M、gp41O和gp41CPZGABSIV肽与其竞争,对该血清进行了研究。
取自名为‘Amandine’的黑猩猩的血清(M.Petters惠赠,他分离出了CPZGAB SIV毒株,AIDS 1992)之应用,初步使该技术得到证实。表II中,最低值(OD)表示与抗原的最高度结合。
表II应用Amandine黑猩猩血清和6295份血清
进行族间O-族M-CPZGAB SIV鉴别的结果
“Amandine”血清的反应性证实了根据本发明的试验及其正确性,并且显示,尽管该病人的血清同样与M和CPZGAB SIV肽起反应,但它与O肽并无交叉反应。
这些结果表明,M族gp41和CPZGAB SIV gp41肽对该病人血清有相似的抑制作用。感染毒株的抗原因此产生了抗体,后者以相似的方式识别M族和CPZBAG SIV gp41肽。
5)从淋巴细胞分离(1995年5月采样)中得出的结果
应用标准技术从1995年5月22日采集的淋巴细胞中分离了一种逆转录病毒。应用MT2细胞系培养,显示YBF30毒株没有形成任何合胞体(NSI)。
V-对第二份血液标本(1995年11月)(血清号95-3371)进行的血清学检查结果
1)Innogenetics LIA斑点印迹
除了CPZGAB SIV V3外,全部条带均为阴性。
2)对M族和O族特异性肽进行的探索性血清学检查结果
表III显示了应用3371份血清进行族间O-族M-CPZGAB SIVgp41鉴别的结果。
表III
应用3371份血清进行族间O-族M-CPZGAB SIV gp41鉴别的结果
对此新的血液标本(取自处于疾病终末期的同一病人)研究的结 果证实,CPZGAB SIV gp41肽显著抑制该病人的血清。
感染毒株的抗原因此诱导产生了优选地识别CPZGAB gp41肽的抗体。
3.淋巴细胞分离的结果(血样于95年11月采集(95-3371-YBF31))
应用标准技术从1995年11月采样的淋巴细胞中分离出一种逆转录病毒,命名为YBF31;其序列成分与YBF30相同。
VI-基因组扩增与YBF30的序列
用于全部PCR操作的DNA是从阳性培养结束时获得的细胞中提取的。
应用O族HIV-1引物进行的PCR在检测的不同区(gap、pol、env)都是阴性的。同样,应用M族HIV-1特异性引物进行的PCR也是阴性的。
O族PCR的扩增和杂交条件同Loussert-Ajaka,1995描述的一样。M族PCR的扩增和杂交条件同下文作者引用的一样。
这些M族引物的定位根据以下HIV-1HXB2序列:
-在env gp120:ED3/ED12(位置5956-5985;7822-7792);ED5/ED14(6556-6581;7960-7931);ED5/ED12;ED3/ED14;ES7/ES8(7001-7020;7667-7647)(Delwart等人,科学(Science)1993;262:1257-1261)。
-在env gp41:先是PCR、ED3/M29,随后是巢式PCR,M28/M29(7785-7808;8099-8124);M28/M29具有以下序列:M28:CGGTTCTT(AG)GGAGCAGC(ACT)GGAAGCA,M29:T(CT)T(ACGT)TCCCA(CT)T(AT)(CT)A(AGT)CCA(AGT)GTCAT;
SK68/SK69(Ou等人,科学,1988;239:295-297)。
-在gag:Amplicor Roche诊断系统;巢式gag引物(Loussert-Ajaka等人,柳叶刀(Lancet)1995346:912-913);SK38/SK39 (Ou等人,科学,1988;239:295-297)。
-在pol:A/NE1(Boucher等人,柳叶刀,1990;336:585-590);pol3/pol4(Laure等人,柳叶刀,1988;ii,538-541)。
只有应用H pol引物(4235/4538)进行的PCR是阳性的,随之应用引物4327/4481(Fransen等人,分子与细胞探针(Molecular andCellular Probes)1994;8:317-322)进行了巢式PCR。这一位于整合酶(260bp)中的H pol片段已被测序。由于病毒过量使之可以利用HPOL引物扩增。这是因为应用的DNA是从强阳性培养(逆转录酶>100,000cpm)结束时的细胞中提取的。对未经共同培养的新鲜细胞中提取的DNA进行扩增是不可能的,这是因为在HPOL引物(尤其是在3′区)与YBF30分离株的序列之间存在大量错配。该3′末端的保守对于Tag聚合酶的延伸活性是非常重要的。
1-pol基因序列:非常简并引物通过RT-PCR用于扩增从阳性培养上清中提取的RNA,产生了阳性扩增。这些引物是所有逆转录病毒共有的(Donehowert等人,病毒学方法杂志(J.Virol.Methods)1990;28:33-46),位于pol基因的逆转录酶区。测序后的片段分析使之有可能从YBF30分离株中产生一种特异性引物,即YRT2(SEQID No.32),并应用Hpol 4481引物(Fransen等人,1994,loc.cit.)作为反义引物,以扩增pol基因。通过按照所产生的每一片段所需来合成特异性引物,对片段进行测序(图1)。
2-env基因序列:第二个步骤是进行长片段PCR(XL-PCR,Perkin Elmer),以此扩增整个病毒(9000bp),应用的引物位于LTR:LPBS 1(SEQ ID No.22);LsiGi,随后应用YRT 2(SEQ IDNo.32)/SK69进行6000bp巢式PCR,并按照相同步骤测序整个包膜。gp41区的测序应用巢式PCR,使用引物SK 68/LSiGi。
3-gag基因序列:巢式PCR的应用是通过长片段PCR(LPBS1/LSiGi)进行的,使用的引物Gag5和Gag11i,按照需要由这种特异性引物产生,以便沿病毒基因组走行。
VII-测序结果
对YBF30毒株进行了全部测序(见序列表)。对1995年11月的YBF31毒株进行了部分测序,显著变异的缺乏证实了YBF30序列的正确性。
VIII-合成YBF30毒株V3环区的肽
研究V3环区的序列使之有可能合成相应的肽,并对YBF30毒株V3环区的氨基酸与其它M亚型和O毒株的氨基酸进行比较。
这些肽的序列是:
YBF30:SEQ ID No.58
CPZGAB SIV:CHRPGNNTRGEVQIGPGMTFYNIENVYGDT
RSAYC(SEQ ID No.59)
O族:CIRPGNRTYRNLQIGPGMTFYNVEIATGDIRKAFC
(ANT 70)(SEQ ID No.60)
M族:CTRPNNTRKSVRIGPGQAFYATGDIIGDIRQAHC
(SS-A型)(SEQ ID No.61)
从环的5′区的两个天冬酰胺开始合成肽,并根据以往描述的同一原理应用(见IV4),那就是供其与M族、O族和CPZGAB SIV的肽竞争。表IV显示的结果证实了这一毒株的原始本质以及这些株的可能传播,因为血清学结果支持在喀麦隆有YBF30型的感染。此外,一项对从喀麦隆选择的200份HIV-1阳性血清的研究,提供了一例新病例的证据,该病例显示的特征与YBF30相似。
表IV 200份血清的反应性研究
血清 起源 V3A V3cpz V3YBF3 CP CSP
0
953371 喀麦隆 1.66 0.38 1.39 0.39 1.64
956295 喀麦隆 1.72 0.37 1.16 0.51 1.73
967321 喀麦隆 0.07 0.17 0.5 0.05 0.27
Amandine GABSIV 1.74 0.14 1.48 0.19 1.74
NOA* ANTSIV 2.66 0.31 1.88 0.46 1.9
*来自CPZ ANT SIV的血清
血清953371和956295(与分离出YBF30毒株的病人相对应)与CPZ SIV肽的反应性在这一新试验中得到确证。与其自身V3抗原的反应性较低,在疾病的晚期是常见的。然而,这种反应性一直高于M肽引起的反应性。另一例喀麦隆病人(血清967321)显示出同样的肽反应性特征。
参考文献
*Barin F.等人,艾滋病研究与人类逆转录病毒,1996,12,13,1279-1289,抗体与代表HIV 1型基因型A至E共有序列的V3肽结合的多样性:用于HIV 1型血清学亚型分型的一种方法。
*Charneau P.,Borman AM.,Quillent C.,Guetard D.,ChamaretS.,Cohen J.,Remy G.,Montagnier L.,及F.Clavel,病毒学(Virology),1994,205,247-253,一种高度趋异的HIV-1分离株的分离及包膜序列:一种新的HIV-1族的定义。
*Descamps D.,Collin G.,Loussert-Ajaka I.,Saragosti S.,SimonF.及F.Brun-Vezinet.艾滋病(AIDS),1995,9,977-978,HIV-1O族对抗逆转录病毒药物的敏感性。
*Huet,T.,Cheynier R.,Meyerhans A.,Roelants,G.,及S.Wain-Hobson,自然(Nature),1990,345,356-359,一种与HIV-1相 关的黑猩猩慢病毒的遗传结构。
*Korber BTM.,MacInnes K.,Smith R.及G.Myers,病毒学杂志(J.Virol),1994,68-6730-6744,在HIV-1的不同遗传谱系中观察到的V3环蛋白质序列的突变趋势。
*Loussert-Ajaka I.,Ly TD.,Chaix Ml,Ingrand D.,Saragosti S.,Courouce AM.,Brun-Vezinet F.及F.Simon,柳叶刀(Lancet),1994,343,1393-1394,HIV-1亚型O感染病人中的HIV-1/HIV-2血清阴性。
*Loussert-Ajaka I.,Chairk.ML.,Korber B.,Letourneur F.,Gomans E.,Allen E.,Ly Td.,Brun-Vezinet F.,Simon F.及S.Saragosti,病毒学杂志,1995,69,5640-5649,从居信在法车的喀麦隆病人身上分离的HIV1型O族毒株的变异性。
*Murphy,E.,B.Korber,Georgs-Courbot,MC.,You B.,Pinter A.,cook D.,Loeml1/MP.,Georges A.,Mathiot C.,Barr*-Sinoussi F.,及M.Girard,艾滋病研究与人类逆转录病毒(AIDS Res。Hum.Retroviruses),1993,9997-1006,来自中非共和国的人类免疫缺陷病毒1型V3区序列的多样性。
*G.Myers,艾滋病研究与人类逆转录病毒,1994,10,11,1317-1324,艾滋病10周年评述。
*Nkengasong,J.N.,Janssens W.,Heyndrickx L.,Fransen K.,Ndumbe PM.,Motte J.,Leonaers A.,Ngolle M.,Ayuk J.,Poit P.,及G.Van der Groen,艾滋病1994,8,1405-1412,喀麦隆HIV-1的基因型亚型。
*Sharp P.M.,等人,艾滋病,1994,8,增刊1,S27-S42,人类免疫缺陷病毒的起源与多样性。
*Simon,F.,T.D.Ly,A.Baillou-Beaufils,V.Schneider-Fauveau,J.de Saint-Martin,I。L-Ajaka,M.L.Chaix,S.Saragosti,a.M.Courouc*,D.Ingrand,C.Janot,及F.Brun-Vezinet,艾滋病,1994,8,1628-1629,抗-HIV-1亚型O抗体筛查试剂盒的敏感性。
*Zekeng,L.,L.Guttlet,E.Afane Ze,A.Sam-Abbenyi,G,Mbouni,Essomba,E.Mpoudi-Noglle,M.Mouny-lobbe,J.B.Tapko,及L.Kaptue,艾滋病,1994,8,1626-1628,喀麦隆HIV-1亚型O的流行率:初步结果。
从上文可以看出,本发明绝不限于刚刚清楚地予以描述的实施方案;相反,它包括所有的、技术人员可能想到的变化,而不背离本发明的内容或范围。
序列表
(1)一般信息
(i)申请人:
(A)姓名:INSTUTUT NATIONAL DE LA SANTE ET DE LATRCHERCHE MEDICALE-INSERM
(B)街道:101rue DE TOLBIAC
(C)城市:巴黎
(E)国家:法国
(F)邮政编码(ZIP):75654CEDEX 13
(A)姓名:ASSISTANCE PUBLIQUE-HOPITAUS DE PARIS
(B)街道:3avenue Victoria
(C)城市:巴黎
(E)国家:法国
(F)邮政编码(ZIP):75100RP
(A)姓名:INSTITUT PASTEUR
(B)街道:28rue du Docteur Rous
(C)城市:巴黎
(E)国家:法国
(F)邮政编码(ZIP):75724Cedex 15
(A)姓名:MAUCLERE Philippe
(B)街道:2rue Buhan
(C)城市:Bordeaux
(E)国家:法国
(F)邮政编码(ZIP):33000
(A)姓名:LOUSSERT AJAKA Ibtissam
(B)街道:26avenue de la Republique
(C)城市:SARTROUVILLE
(E)国家:法国
(F)邮政编码(ZIP):78500
(A)姓名:STMON francois
(B)街道:8rue Germain Pilon
(C)城市:巴黎
(E)国家:法国
(F)邮政编码(ZIP):75018
(A)姓名:SARAGOSTI Sentob
(B)街道:69bis rue do Billancourt
(C)城市:BOULOGNE BILLANCOURT
(E)国家:法国
(F)邮政编码(ZIP):92100
(A)姓名:BARRE-SINOUSSL Francoise
(B)街道:104le capricorne,50rue d’Erevan
(C)城市:ISSY LES MOULINEAUX
(E)国家:法国
(F)邮政编码(ZIP):92130
(ii)发明名称:非-M、非-O HIV-1毒株、片段及应用
(iii)序列数目:61
(iv)计算机可读信息:
(A)介质类型:软盘
(B)计算机:IBM PC兼容机
(C)操作系统:PC-DOS/MS-DOS
(D)软件:PatentIn Release#1.0,Version#1.30(EPO)
(2)SEQ ID NO:1的信息:
(i)序列特征:
(A)长度:9183个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:DNA(基因组)
(xi)SEQ ID NO:1的序列描述:
CTTCTCGCTT GTACTGGGTC TCTCTTGCTG GACCAGATTA GAGCCTGGGA GCTCTCTGGC 60
TAGCAGGGAA CCCACTGCTT AAGCCTCAAT AAAGCTTGCC TTGAGTGCTA AAGTGGTGTG 120
TGCCCATCCA TTCGGTAACT CTGGTACCTA GAGATCCCTC AGACCATCTA GACTGAGTGA 180
AAAATCTCTA GCAGTGGCGC CCGAACAGGG ACTTGAAAAC GAAAGTAGAA CCGGAGGCTG 240
AATCTCTCGA CGCAGGACTC GGCTCGTTGG TGCACACAGC GAGAGGCGAG GCGGCGGAAG 300
TGTGAGTACG CAATTTTGAC TGGCGGTGGC CAGAAAGTAG GAGAGAGGAT GGGTGCGAGA 360
GCGTCAGTGT TAACAGGGGG AAAATTAGAT CAATGGGAAT CAATTTATTT GAGACCAGGG 420
GGAAAGAAAA AATACAGAAT GAAACATTTA GTATGGGCAA GCAGGGAGCT GGAAAGATTC 480
GCTTGTAACC CAGGTCTCAT GGACACAGCG GACGGCTGTG CCAAGTTACT AAATCAATTA 540
GAACCAGCTC TCAAGACAGG GTCAGAAGAA CTGCGCTCTT TATATAACGC TCTAGCAGTT 600
CTTTATTGTG TCCATAGTAG GATACAGATA CACAACACAC AGGAAGCTTT GGACAAGATA 660
AAAGAGAAAC AGGAACAGCA CAAGCCCGAG CCAAAAAACC CAGAAGCAGG GGCAGCGGCA 720
GCAACTGATA GCAATATCAG TAGGAATTAT CCTCTAGTCC AGACTGCTCA AGGACAAATG 780
GTACATCAGC CGCTGACACC CAGAACCTTA AATGCTTGGG TGAAAGTGAT AGAGGAGAAG 840
GCCTTTAGTC CAGAAGTAAT ACCAATGTTT ATGGCCTTGT CAGAAGGGGC AACGCCCTCA 900
GATCTAAATA CTATGTTAAA TACAGTAGGG GGACATCAGG CAGCAATGCA GATGCTGAAG 960
GAAGTCATCA ATGAGGAAGC AGCAGACTGG GATAGGACAC ATCCAGTCCC TGTGGGACCA 1020
CTACCCCCAG GGCAACTGAG AGACCCTAGA GGAAGTGATA TAGCAGGAAC AACTAGCACC 1080
CTGGCAGAAC AGGTGGCTTG GATGACTGCT AATCCTCCTG TTCCAGTAGG AGATATTTAT 1140
AGAAGATGGA TAGTCCTGGG GTTAAACAGA ATTGTGAGAA TGTATAGTCC TGTCAGCATT 1200
CTAGAGATCA AACAAGGACC AAAAGAACCC TTCAGAGACT ATGTAGACAG GTTCTACAAA 1260
ACTCTAAGAG CAGAGCAGGC AACACAGGAA GTAAAGAATT GGATGACAGA AACACTCTTA 1320
GTACAAAATG CAAACCCAGA TTGTAAACAG CTCCTAAAAG CATTAGGGCC AGGAGCTACC 1380
TTAGAAGAGA TGATGACGGC CTGCCAGGGA GTGGGGGGAC CAGCACATAA GGCAAGAGTG 1440
CTAGCAGAGG CTATGTCACA GGTGCAGCAG CCAACAACTA GTGTCTTTGC ACAAAGGGGA 1500
AACTTTAAAG GCATAAGGAA ACCCATTAAA TGTTTCAATT GTGGCAAAGA GGGCCATTTG 1560
GCAAGAAACT GTAAGGCCCC TAGAAGAGGA GGCTGTTGGA AGTGTGGGCA AGAAGGACAT 1620
CAAATGAAAG ATTGTAAAAA TGAAGGAAGA CAGGCTAATT TTTTAGGGAA GAGCTGGTCT 1680
CCCTTCAAAG GGAGACCAGG AAACTTCCCC CAGACAACAA CAAGGAAAGA GCCCACAGCC 1740
CCGCCACTAG AGAGTTATGG GTTTCAGGAG GAGAAGAGCA CACAGGGGAA GGAGATGCAG 1800
GAGAACCAGG AGAGGACAGA GAACTCTCTG TACCCACCTT TAACTTCCCT CAGATCACTC 1860
TTTGGCAACG ACCCGTCATC ACAGTAAAAA TAGGGAAAGA AGTAAGAGAA GCTCTTTTAG 1920
ATACAGGAGC TGATGATACA GTAATAGAAG AGCTACAATT AGAGGGAAAA TGGAAACCAA 1980
AAATGATAGG AGGAATTGGA GGATTTATCA AAGTGAGACA ATATGATAAT ATAACAGTAG 2040
ACATACAGGG AAGAAAAGCA GTTGGTACAG TATTAGTAGG ACCAACACCT GTTAATATTA 2100
TAGGAAGAAA TCTTTTAACC CAGATTGGCT GTACTTTAAA TTTTCCAATA AGTCCTATTG 2160
AAACTGTACC AGTAAAATTA AAACCAGGAA TGGATGGCCC AAAGGTAAAA CAATGGCCTT 2220
TGACAACAGA AAAAATAGAG GCATTAAGAG AAATTTGTAC AGAAATGGAA AAGGAAGGAA 2280
AAATTTCTAG AATAGGGCCT GAGAATCCAT ATAACACTCC AATTTTTGCT ATAAAAAAGA 2340
AAGATAGCAC TAAATGGAGA AAATTAGTAG ATTTCAGGGA ATTAAATAAA AGGACCCAAG 2400
ATTTTTGGGA AGTGCAGCTA GGAATTCCAC ATCCAGCAGG ATTAAAGCAG AAAAAATCAG 2460
TGACAGTTTT GGATGTAGGA GATGCTTATT TTTCATGTCC CTTGGACAAA GATTTTAGAA 2520
AGTATACAGC TTTTACCATA CCTAGTATAA ACAATGAGAC ACCTGGTATT AGATACCAGT 2580
ATAATGTGCT GCCACAAGGC TGGAAAGGGT CACCAGCAAT TTTTCAGAGT ACAATGACAA 2640
AAATTCTAGA ACCATTCAGA GAGAAACATC CAGAGATAAT CATTTACCAG TACATGGATG 2700
ACCTCTATGT GGGATCTGAC TTAGAACTAG CACAACATAG AGAGGCAGTA GAAGACCTTA 2760
GAGATCATCT TTTGAAGTGG GGCTTTACGA CCCCTGACAA AAAACATCAG AAGGAACCCC 2820
CGTTCCTCTG GATGGGATAT GAACTCCATC CAGACAAATG GACAGTCCAG CCAATAAAGT 2880
TACCAGAAAA GGATGTATGG ACTGTCAATG ATATACAGAA ATTAGTAGGA AAGTTAAATT 2940
GGGCAAGTCA GATCTATCCA GGAATCAGAG TAAAACAGCT CTGTAAATTA ATCAGAGGAA 3000
CCAAAGCTTT GACAGAAGTA GTCAACTTTA CAGAAGAAGC AGAATTAGAA CTAGCAGAAA 3060
ACAGGGAGAT ATTAAAAGAA CCCCTGCATG GAGTCTATTA TGACCCAGGA AAAGAATTAG 3120
TAGCAGAAAT TCAAAAGCAA GGACAAGGTC AGTGGACATA TCAGATTTAT CAGGAGTTAC 3180
ATAAAAATTT AAAAACAGGA AAGTATGCAA AAATGAGATC TGCCCATACT AATGATATAA 3240
AACAGTTAGT TGAAGTGGTA AGGAAAGTGG CAACAGAAAG TATAGTAATT TGGGGAAAGA 3300
CTCCTAAATT TAGATTACCA GTACAAAAGG AAGTGTGGGA GGCATGGTGG ACCGATCATT 3360
GGCAAGCAAC TTGGATTCCT GAGTGGGAAT TTGTCAACAC TCCTCCCCTT GTAAAATTAT 3420
GGTATCAGTT AGAAACAGAG CCAATCAGTG GGGCAGAAAC TTTCTATGTA GATGGAGCAG 3480
CTAATAGGGA AACAAAATTG GGAAAAGCAG GTTTTGTGAC AGATAGGGGA AGACAGAAAG 3540
TGGTCTCTAT TGCAGACACC ACCAATCAAA AGGCTGAGTT ACAAGCTATC CTTATGGCCT 3600
TACAAGAGTC AGGACGGGAT GTAAACATAG TCACTGACTC TCAGTATGCT ATGGGAATAA 3660
TTCATTCACA GCCAGATAAA AGTGAATCAG AATTGGTGAG CCAAATAATA GAAGAGCTCA 3720
TAAAAAAGGA AAGAGTTTAT CTCTCTTGGG TACCTGCACA TAAAGGTATT GGAGGAAATG 3780
AGCAGGTAGA CAAATTAGTT AGCTCAGGAA TTAGAAAAAT ATTATTCCTA GATGGTATAG 3840
AAAAAGCCCA AGAAGATCAT GACAGATATC ACAGCAATTG GAAAGCAATG GCCAGTGATT 3900
TTAACTTACC CCCCATAGTG GCAAAAGAAA TAGTAGCCAG CTGTGACAAA TGCCAGCTAA 3960
AAGGGGAAGC CATGCATGGA CAGGTCAATT GTAGTCCAGG AGTGTGGCAA TTAGATTGTA 4020
CACACTTAGA GGGAAAAATC ATCCTTGTGG CGGTCCATGT GGCCAGTGGC TACTTAGAAG 4080
CAGAAGTTAT TCCTGCAGAG ACAGGACAGG AAACAGCATA TTTTATTTTA AAGTTAGCTG 4140
GAAGATGGCC AGTAAAAGTT ATACACACTG ATAATGGATC CAATTTCACT AGTGCCACTG 4200
TAAAAGCAGC CTGTTGGTGG GCAAATATCA AACAGGAATT TGGGATACCC TACAATCCTC 4260
AAAGTCAGGG AGCAGTAGAG TCCATGAATA AAGAATTAAA GAAAATTATA GGACAAATCA 4320
GAGATCAAGC AGAACATCTA AAGACAGCAG TGCAAATGGC GGTTTTCATT CACAATTTTA 4380
AAAGAAAAGG GGGGATTGGG GGGTACACTG CAGGGGAAAG AATAATAGAC ATAATAGCAA 4440
CAGACATACA GACAACAAAT TTACAAACAC AAATTTTAAA AGTTCAAAAT TTTCGGGTTT 4500
ATTACAGAGA CAGCAGAGAT CCCATTTGGA AAGGACCAGC CAAACTTCTG TGGAAAGGAG 4560
AAGGGGCAGT GGTAATTCAA GATAACGGGG ATATAAAAGT AGTCCCACGT AGGAAAGCAA 4620
AAATAATTAG GGATTATGGA AAACAGATGG CAGGTGATGG TTGTGTGGCA AGTGGACAGG 4680
ATGAAAATCA GGAAATGGAA TAGCTTAGTA AAACATCATA TGTATGTGTC AAAAAAGGCA 4740
AAAGGATGGT ATTATAGACA TCATTATGAA ACACATCACC CAAAAATAAG TTCAGAAGTA 4800
CATATCCCAG TAGGTCAGGC AAGATTAGTG ACAGTCACTT ATTGGGGGCT AACAACAGGA 4860
GAACAGTCTT GGCATCTAGG ACATGGAGTA TCCATAGAAT GGAGACTAAG AAAATACAAG 4920
ACACAAGTTG ATCCTGAAAT GGCAGACAAG CTAATACATC TTCATTATTT TGATTGTTTT 4980
ACAGCCTCTG CCATAAGGCA AGCGGTCTTA GGGAGACCAG TATTACCTAG GTGTGAATAT 5040
CCAGCAGGGC ACAAACAGGT AGGCACCCTA CAATATCTAG CACTAACAGC CTGGGTGGGA 5100
GCAAAGAAGA GAAAGCCACC CTTACCTAGT GTGACTAAGC TAACAGAAGA TAGATGGAAC 5160
GAGCACCAGA AGATGCAGGG CCACAGAGGG AACCCTATAA TGAATGGGCA CTAGAATTAT 5220
TAGAAGAATT AAAAAATGAA GCTGTGCGCC ATTTTCCAAG GATTTGGCTA CATGGGTTAG 5280
GACAACACAT CTATAACACA TATGGAGACA CCTGGGAGGG GGTAGAGGCA ATTATCAGGA 5340
TACTACAACA ATTACTGTTT ATCCATTATA GGATTGGCTG CCAGCACAGC AGAATAGGGA 5400
TCACTCCTCA AAGGAGAAGG AATGGAACCA GTAGATCCTA GATTAGAGCC CTGGAATCAT 5460
CCAGGAAGCC AACCTAAAAC AGCTTGCAAT AATTGCTATT GTAAAAGATG TTGCTATCAC 5520
TGCTTATATT GCTTCACAAA GAAAGGCTTA GGCATCTCAT ATGGCAGGAA GAAGCGGAGT 5580
CAACGACGAA GAACTCCTCA GAGCAGTAAG AGTCATCAAG ATCTTATACC AGAGCAGTAA 5640
GTAAAACCTG TATATATGCT GTCATTGGGA TTCATAGCGT TAGGAGCAGC AGTTAGCATA 5700
GCAGTAATAG TCTGGGCATT ACTATATAGA GAATATAAGA AAATAAAATT GCAGGAAAAA 5760
ATAAAACACA TAAGACAGAG AATAAGAGAA AGAGAAGAAG ATAGTGGCAA TGAAAGTGAT 5820
GGGGATGCAG AGTGGTTGGA TGGGGATGAA GAGTGGTTGG TTACTCTTCT ATCTTCTAGT 5880
AAGCTTGATC AAGGTAATTG GGTCTGAACA ACATTGGGTA ACAGTGTACT ATGGGGTACC 5940
AGTATGGAGA GAAGCAGAGA CAACTCTTTT CTGTGCTTCA GATGCTAAAG CCCATAGTAC 6000
AGAGGCTCAC AACATCTGGG CCACACAAGC ATGTGTTCCT ACTGATCCCA ATCCACAAGA 6060
AGTGCTATTA CCCAATGTAA CTGAAAAATT TAATATGTGG GAAAATAAAA TGGCAGACCA 6120
AATGCAAGAG GATATTATCA GTCTGTGGGA ACAGAGCTTA AAGCCCTGTG TTAAATTAAC 6180
CCCATTATGT GTAACTATGC TTTGTAACGA TAGCTATGGG GAGGAAAGGA ACAATACAAA 6240
TATGACAACA AGAGAACCAG ACATAGGATA CAAACAAATG AAAAATTGCT CATTCAATGC 6300
AACCACTGAG CTAACAGATA AAAAGAAGCA AGTTTACTCT CTGTTTTATG TAGAAGATGT 6360
AGTACCAATC AATGCCTATA ATAAAACATA TAGGCTAATA AATTGTAATA CCACAGCTGT 6420
GACACAAGCT TGTCCTAAGA CTTCCTTTGA GCCAATTCCA ATACATTACT GTGCACCACC 6480
AGGCTTTGCC ATTATGAAAT GTAATGAAGG AAACTTTAGT GGAAATGGAA GCTGTACAAA 6540
TGTGAGTACT GTACAATGCA CACATGGAAT AAAGCCAGTG ATATCCACTC AGTTAATCCT 6600
AAATGGAAGC TTAAATACAG ATGGAATTGT TATTAGAAAT GATAGTCACA GTAATCTGTT 6660
GGTGCAATGG AATGAGACAG TGCCAATAAA TTGTACAAGG CCAGGAAATA ATACAGGAGG 6720
ACAGGTGCAG ATAGGACCTG CTATGACATT TTATAACATA GAAAAAATAG TAGGAGACAT 6780
TAGACAAGCA TACTGTAATG TCTCTAAAGA ACTATGGGAA CCAATGTGGA ATAGAACAAG 6840
AGAGGAAATA AAGAAAATCC TGGGGAAAAA CAACATAACC TTCAGGGCTC GAGAGAGGAA 6900
TGAAGGAGAC CTAGAAGTGA CACACTTAAT GTTCAATTGT AGAGGAGAGT TTTTCTATTG 6960
TAACACTTCC AAATTATTTA ATGAGGAATT ACTTAACGAG ACAGGTGAGC CTATTACTCT 7020
GCCTTGTAGA ATAAGACAGA TTGTAAATTT GTGGACAAGG GTAGGAAAAG GAATTTATGC 7080
ACCACCAATT CGGGGAGTTC TTAACTGTAC CTCCAATATT ACTGGACTGG TTCTAGAATA 7140
TAGTGGTGGG CCTGACACCA AGGAAACAAT AGTATATCCC TCAGGAGGAA ACATGGTTAA 7200
TCTCTGGAGA CAAGAGTTGT ATAAGTACAA AGTAGTTAGC ATAGAACCCA TAGGAGTAGC 7260
ACCAGGTAAA GCTAAAAGAC GCACAGTGAG TAGAGAAAAA AGAGCAGCCT TTGGACTAGG 7320
TGCGCTGTTT CTTGGGTTTC TTGGAGCAGC AGGGAGCACT ATGGGCGCAG CGTCAATAAC 7380
GCTGACGGTA CAGGCCCGGA CATTATTATC TGGGATAGTG CAACAGCAGA ATATTCTGTT 7440
GAGAGCAATA GAGGCGCAAC AACATTTGTT GCAACTCTCA ATCTGGGGCA TTAAACAGCT 7500
CCAGGCAAAA GTCCTTGCTA TAGAAAGATA CCTTAGGGAT CAGCAAATCC TAAGTCTATG 7560
GGGCTGCTCA GGAAAAACAA TATGCTATAC CACTGTGCCT TGGAATGAGA CTTGGAGCAA 7620
CAATACCTCT TATGATACAA TCTGGAATAA TTTAACCTGG CAACAATGGG ATGAGAAAGT 7680
AAGAAACTAT TCAGGTGTCA TTTTTGGACT TATAGAACAG GCACAAGAAC AACAGAACAC 7740
AAATGAGAAA TCACTCTTGG AATTGGATCA ATGGGACAGT CTGTGGAGCT GGTTTGGTAT 7800
TACAAAATGG CTGTGGTATA TAAAAATAGC TATAATGATA GTAGCAGGCA TTGTAGGCAT 7860
AAGAATCATA AGTATAGTAA TAACTATAAT AGCAAGAGTT AGGCAGGGAT ATTCTCCCCT 7920
TTCGTTGCAG ACCCTTATCC CAACAGCAAG GGGACCAGAC AGGCCAGAAG AAACAGAAGG 7980
AGGCGTTGGA GAGCAAGACA GAGGCAGATC CGTGCGATTA GTGAGCGGAT TCTCAGCTCT 8040
TGTCTGGGAG GACCTCCGGA ACCTGTTGAT CTTCCTCTAC CACCGCTTGA CAGACTCACT 8100
CTTGATACTG AGGAGGACTC TGGAACTCCT GGGACAGAGT CTCAGCAGGG GACTGCAACT 8160
ACTGAATGAA CTCAGAACAC ACTTGTGGGG AATACTTGCA TATTGGGGAA AAGAGTTAAG 8220
GGATAGTGCT ATCAGCTTGC TTAATACAAC AGCTATTGTA GTAGCAGAAG GAACAGATAG 8280
GATTATAGAA TTAGCACAAA GAATAGGAAG GGGAATATTA CACATACCTA GAAGAATCAG 8340
ACAAGGCCTA GAAAGAGCAC TGATATAAGA TGGGAAAGAT TTGGTCAAAG AGCAGCCTAG 8400
TAGGATGGCC AGAAATCAGA GAAAGAATGA GAAGACAAAC GCAAGAACCA GCAGTAGAGC 8460
CAGCAGTAGG AGCAGGAGCA GCTTCTCAAG ATCTAGCTAA TCGAGGGGCC ATCACCATAA 8520
GAAATACTAG AGACAATAAT GAAAGTATAG CTTGGCTAGA AGCACAAGAA GAAGAAGAGG 8580
AAGTAGGCTT TCCAGTACGC CCTCAGGTAC CATTAAGGCC AATAACCTAT AAACAGGCTT 8640
TTGATCTTTC CTTCTTTTTA AAAGATAAGG GGGGACTGGA AGGGCTAGTT TGGTCCAGAA 8700
AAAGGCAAGA TATTCTAGAC CTCTGGATGT ATCACACACA AGGCATCCTC CCTGACTGGC 8760
ATAACTACAC ACCAGGGCCA GGAATTAGAT ACCCCGTAAC CTTTGGATGG TGCTTCAAAC 8820
TAGTACCATT GTCAGCTGAA GAAGTAGAAG AGGCTAATGA AGGAGACAAC AATGCCCTCT 8880
TACACCCCAT ATGTCAACAT GGAGCAGATG ATGATCATAA AGAAGTGTTG GTGTGGCGAT 8940
TTGACAGCTC CCTAGCAAGA AGACATGTAG CAAGAGAGCT GCATCCGGAG TTTTACAAGA 9000
ACTGCTGACA AGGGACTTTA CTGCTGACAA GGGACTTTAT ACTTGGGGAC TTTCCGCCAG 9060
GGACTTTCCA GGGAGGTGTG GTTGGGGGAG TGGCTTGCCC TCAGAGCTGC ATAAAAGCAG 9120
CCGCTTCTCG CTTGTACTGG GTCTCTCTTG CTGGACCAGA TTAGAGTCTG GGAGCATATT 9180
GGG 9183
(2)SEQ ID NO:2的信息:
(i)序列特征:
(A)长度:813个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:DNA(基因组)
(xi)SEQ ID NO:2的序列描述:
TTGGAAGGGC TAGTTTGGTC CAGAAAAAGG CAAGATATTC TAGACCTCTG GATGTATCAC 60
ACACAAGGCA TCCTCCCTGA CTGGCATAAC TACACACCAG GGCCAGGAAT TAGATACCCC 120
GTAACCTTTG GATGGTGCTT CAAACTAGTA CCATTGTCAG CTGAAGAAGT AGAAGAGGCT 180
AATGAAGGAG ACAACAATGC CCTCTTACAC CCCATATGTC AACATGGAGC AGATGATGAT 240
CATAAAGAAG TGTTGGTGTG GCGATTTGAC AGCTCCCTAG CAAGAAGACA TGTAGCAAGA 300
GAGCTGCATC CGGAGTTTTA CAAGAACTGC TGACAAGGGA CTTTACTGCT GACAAGGGAC 360
TTTATACTTG GGGACTTTCC GCCAGGGACT TTCCAGGGAG GTGTGGTTGG GGGAGTGGCT 420
TGCCCTCAGA GCTGCATAAA AGCAGCCGCT TCTCGCTTGT ACTGGGTCTC TCTTGCTGGA 480
CTATACAGAT TAGAGCCTGG GAGCTCTCTG GCTAGCAGGG AACCCACTGC TTAAGCCTCA 540
ATAAATACAG CTTGCCTTGA GTGCTAAAGT GGTGTGTGCC CATCCATTCG GTAACTCTGG 600
TACCTAGAGA ATCCCTCAGA CCATCTAGAC TGAGTGAAAA ATCTCTAGCA GTGGCGCCCG 660
AACAGGGACT TAGTTGAAAA CGAAAGTAGA ACCGGAGGCT GAATCTCTCG ACGCAGGACT 720
CGGCTCGTTG GTGCACACAG CGAGAGGCGA GGCGGCGGAA GTGTGAGTAC GCAATTTTGA 780
CTGGCGGTGG CCAGAAAGTA GGAGAGAGGG AGG 813
(2)SEQ ID NO:3的信息:
(i)序列特征:
(A)长度:1539个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:DNA(基因组)
(ix)特征
(A)名称/关键词:CDS
(B)位置:1..1536
(xi)SEQ ID NO:3的序列描述:
ATG GGT GCG AGA GCG TCA GTG TTA ACA GGG GGA AAA TTA GAT CAA TGG 48
Met Gly Ala Arg Ala Ser Val Leu Thr Gly Gly Lys Leu Asp Gln Trp
1 5 10 15
GAA TCA ATT TAT TTG AGA CCA GGG GGA AAG AAA AAA TAC AGA ATG AAA 96
Glu Ser Ile Tyr Leu Arg Pro Gly Gly Lys Lys Lys Tyr Arg Met Lys
20 25 30
CAT TTA GTA TGG GCA AGC AGG GAG CTG GAA AGA TTC GCT TGT AAC CCA 144
His Leu Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Cys Asn Pro
35 40 45
GGT CTC ATG GAC ACA GCG GAC GGC TGT GCC AAG TTA CTA AAT CAA TTA 192
Gly Leu Met Asp Thr Ala Asp Gly Cys Ala Lys Leu Leu Asn Gln Leu
50 55 60
GAA CCA GCT CTC AAG ACA GGG TCA GAA GAA CTG CGC TCT TTA TAT AAC 240
Glu Pro Ala Leu Lys Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn
65 70 75 80
GCT CTA GCA GTT CTT TAT TGT GTC CAT AGT AGG ATA CAG ATA CAC AAC 288
Ala Leu Ala Val Leu Tyr Cys Val His Ser Arg Ile GlnIle His Asn
85 90 95
ACA CAG GAA GCT TTG GAC AAG ATA AAA GAG AAA CAG GAA CAG CAC AAG 336
Thr Gln Glu Ala Leu Asp Lys Ile Lys Glu Lys Gln Glu Gln His Lys
100 105 110
CCC GAG CCA AAA AAC CCA GAA GCA GGG GCA GCG GCA GCA ACT GAT AGC 384
Pro Glu Pro Lys Asn Pro Glu Ala Gly Ala Ala Ala Ala Thr Asp Ser
115 120 125
AAT ATC AGT AGG AAT TAT CCT CTA GTC CAG ACT GCT CAA GGA CAA ATG 432
Asn Ile Ser Arg Asn Tyr Pro Leu Val Gln Thr Ala Gln Gly Gln Met
130 135 140
GTA CAT CAG CCG CTG ACA CCC AGA ACC TTA AAT GCT TGG GTG AAA GTG 480
Val His Gln Pro Leu Thr Pro Arg Thr Leu Asn Ala Trp Val Lys Val
145 150 155 160
ATA GAG GAG AAG GCC TTT AGT CCA GAA GTA ATA CCA ATG TTT ATG GCC 528
Ile Glu Glu Lys Ala Phe Ser Pro Glu Val Ile Pro Met Phe Met Ala
165 170 175
TTG TCA GAA GGG GCA ACG CCC TCA GAT CTA AAT ACT ATG TTA AAT ACA 576
Leu Ser Glu Gly Ala Thr Pro Ser Asp Leu Asn Thr Met Leu Asn Thr
180 185 190
GTA GGG GGA CAT CAG GCA GCA ATG CAG ATG CTG AAG GAA GTC ATC AAT 624
Val Gly Gly His Gln Ala Ala Met Gln Met Leu Lys Glu Val Ile Asn
195 200 205
GAG GAA GCA GCA GAC TGG GAT AGG ACA CAT CCA GTC CCT GTG GGA CCA 672
Glu Glu Ala Ala Asp Trp Asp Arg Thr His Pro Val Pro Val Gly Pro
210 215 220
CTA CCC CCA GGG CAA CTG AGA GAC CCT AGA GGA AGT GAT ATA GCA GGA 720
Leu Pro Pro Gly Gln Leu Arg Asp Pro Arg Gly Ser Asp Ile Ala Gly
225 230 235 240
ACA ACT AGC ACC CTG GCA GAA CAG GTG GCT TGG ATG ACT GCT AAT CCT 768
Thr Thr Ser Thr Leu Ala Glu Gln Val Ala Trp Met Thr Ala Asn Pro
245 250 255
CCT GTT CCA GTA GGA GAT ATT TAT AGA AGA TGG ATA GTC CTG GGG TTA 816
Pro Val Pro Val Gly Asp Ile Tyr Arg Arg Trp Ile Val Leu Gly Leu
260 265 270
AAC AGA ATT GTG AGA ATG TAT AGT CCT GTC AGC ATT CTA GAG ATC AAA 864
Asn Arg Ile Val Arg Met Tyr Ser Pro Val Ser Ile Leu Glu Ile Lys
275 280 285
CAA GGA CCA AAA GAA CCC TTC AGA GAC TAT GTA GAC AGG TTC TAC AAA 912
Gln Gly Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys
290 295 300
ACT CTA AGA GCA GAG CAG GCA ACA CAG GAA GTA AAG AAT TGG ATG ACA 960
Thr Leu Arg Ala Glu Gln Ala Thr Gln Glu Val Lys Asn Trp Met Thr
305 310 315 320
GAA ACA CTC TTA GTA CAA AAT GCA AAC CCA GAT TGT AAA CAG CTC CTA 1008
Glu Thr Leu Leu Val Gln Asn Ala Asn Pro Asp Cys Lys Gln Leu Leu
325 330 335
AAA GCA TTA GGG CCA GGA GCT ACC TTA GAA GAG ATG ATG ACG GCC TGC 1056
Lys Ala Leu Gly Pro Gly Ala Thr Leu Glu Glu Met Met Thr Ala Cys
340 345 350
CAG GGA GTG GGG GGA CCA GCA CAT AAG GCA AGA GTG CTA GCA GAG GCT 1104
Gln Gly Val Gly Gly Pro Ala His Lys Ala Arg Val Leu Ala Glu Ala
355 360 365
ATG TCA CAG GTG CAG CAG CCA ACA ACT AGT GTC TTT GCA CAA AGG GGA 1152
Met Ser Gln Val Gln Gln Pro Thr Thr Ser Val Phe Ala Gln Arg Gly
370 375 380
AAC TTT AAA GGC ATA AGG AAA CCC ATT AAA TGT TTC AAT TGT GGC AAA 1200
Asn Phe Lys Gly Ile Arg Lys Pro Ile Lys Cys Phe Asn Cys Gly Lys
385 390 395 400
GAG GGC CAT TTG GCA AGA AAC TGT AAG GCC CCT AGA AGA GGA GGC TGT 1248
Glu Gly His Leu Ala Arg Asn Cys Lys Ala Pro Arg Arg Gly Gly Cys
405 410 415
TGG AAG TGT GGG CAA GAA GGA CAT CAA ATG AAA GAT TGT AAA AAT GAA 1296
Trp Lys Cys Gly Gln Glu Gly His Gln Met Lys Asp Cys Lys Asn Glu
420 425 430
GGA AGA CAG GCT AAT TTT TTA GGG AAG AGC TGG TCT CCC TTC AAA GGG 1344
Gly Arg Gln Ala Asn Phe Leu Gly Lys Ser Trp Ser Pro Phe Lys Gly
435 440 445
AGA CCA GGA AAC TTC CCC CAG ACA ACA ACA AGG AAA GAG CCC ACA GCC 1392
Arg Pro Gly Asn Phe Pro Gln Thr Thr Thr Arg Lys Glu Pro Thr Ala
450 455 460
CCG CCA CTA GAG AGT TAT GGG TTT CAG GAG GAG AAG AGC ACA CAG GGG 1440
Pro Pro Leu Glu Ser Tyr Gly Phe Gln Glu Glu Lys Ser Thr Gln Gly
465 470 475 480
AAG GAG ATG CAG GAG AAC CAG GAG AGG ACA GAG AAC TCT CTG TAC CCA 1488
Lys Glu Met Gln Glu Asn Gln Glu Arg Thr Glu Asn Ser Leu Tyr Pro
485 490 495
CCT TTA ACT TCC CTC AGA TCA CTC TTT GGC AAC GAC CCG TCA TCA CAG 1536
Pro Leu Thr Ser Leu Arg Ser Leu Phe Gly Asn Asp Pro Ser Ser Gln
500 505 510
TAA 1539
(2)SEQ ID NO:4的信息:
(i)序列特征:
(A)长度:512个氨基酸
(B)类型:氨基酸
(D)拓扑类型:线性
(ii)分子类型:蛋白质
(xi)SEQ ID NO:4的序列描述:
Met Gly Ala Arg Ala Ser Val Leu Thr Gly Gly Lys Leu Asp Gln Trp
1 5 10 15
Glu Ser Ile Tyr Leu Arg Pro Gly Gly Lys Lys Lys Tyr Arg Met Lys
20 25 30
His Leu Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Cys Asn Pro
35 40 45
Gly Leu Met Asp Thr Ala Asp Gly Cys Ala Lys Leu Leu Asn Gln Leu
50 55 60
Glu Pro Ala Leu Lys Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn
65 70 75 80
Ala Leu Ala Val Leu Tyr Cys Val His Ser Arg Ile Gln Ile His Asn
85 90 95
Thr Gln Glu Ala Leu Asp Lys Ile Lys Glu Lys Gln Glu Gln His Lys
100 105 110
Pro Glu Pro Lys Asn Pro Glu Ala Gly Ala Ala Ala Ala Thr Asp Ser
115 120 125
Asn Ile Ser Arg Asn Tyr Pro Leu Val Gln Thr Ala Gln Gly Gln Met
130 135 140
Val His Gln Pro Leu Thr Pro Arg Thr Leu Asn Ala Trp Val Lys Val
145 150 155 160
Ile Glu Glu Lys Ala Phe Ser Pro Glu Val Ile Pro Met Phe Met Ala
165 170 175
Leu Ser Glu Gly Ala Thr Pro Ser Asp Leu Asn Thr Met Leu Asn Thr
180 185 190
Val Gly Gly His Gln Ala Ala Met Gln Met Leu Lys Glu Val Ile Asn
195 200 205
Glu Glu Ala Ala Asp Trp Asp Arg Thr His Pro Val Pro Val Gly Pro
210 215 220
Leu Pro Pro Gly Gln Leu Arg Asp Pro Arg Gly Ser Asp Ile Ala Gly
225 230 235 240
Thr Thr Ser Thr Leu Ala Glu Gln Val Ala Trp Met Thr Ala Asn Pro
245 250 255
Pro Val Pro Val Gly Asp Ile Tyr Arg Arg Trp Ile Val Leu Gly Leu
260 265 270
Asn Arg Ile Val Arg Met Tyr Ser Pro Val Ser Ile Leu Glu Ile Lys
275 280 285
Gln Gly Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys
290 295 300
Thr Leu Arg Ala Glu Gln Ala Thr Gln Glu Val Lys Asn Trp Met Thr
305 310 315 320
Glu Thr Leu Leu Val Gln Asn Ala Asn Pro Asp Cys Lys Gln Leu Leu
325 330 335
Lys Ala Leu Gly Pro Gly Ala Thr Leu Glu Glu Met Met Thr Ala Cys
340 345 350
Gln Gly Val Gly Gly Pro Ala His Lys Ala Arg Val Leu Ala Glu Ala
355 360 365
Met Ser Gln Val Gln Gln Pro Thr Thr Ser Val Phe Ala Gln Arg Gly
370 375 380
Asn Phe Lys Gly Ile Arg Lys Pro Ile Lys Cys Phe Asn Cys Gly Lys
385 390 395 400
Glu Gly His Leu Ala Arg Asn Cys Lys Ala Pro Arg Arg Gly Gly Cys
405 410 415
Trp Lys Cys Gly Gln Glu Gly His Gln Met Lys Asp Cys Lys Asn Glu
420 425 430
Gly Arg Gln Ala Asn Phe Leu Gly Lys Ser Trp Ser Pro Phe Lys Gly
435 440 445
Arg Pro Gly Asn Phe Pro Gln Thr Thr Thr Arg Lys Glu Pro Thr Ala
450 455 460
Pro Pro Leu Glu Ser Tyr Gly Phe Gln Glu Glu Lys Ser Thr Gln Gly
465 470 475 480
Lys Glu Met Gln Glu Asn Gln Glu Arg Thr Glu Asn Ser Leu Tyr Pro
485 490 495
Pro Leu Thr Ser Leu Arg Ser Leu Phe Gly Asn Asp Pro Ser Ser Gln
500 505 510
(2)SEQ ID NO:5的信息:
(i)序列特征:
(A)长度:3045个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:DNA(基因组)
(ix)特征
(A)名称/关键词:CDS
(B)位置:1..3042
(xi)SEQ ID NO:5的序列描述:
TTT TTT AGG GAA GAG CTG GTC TCC CTT CAA AGG GAG ACC AGG AAA CTT 48
Phe Phe Arg Glu Glu Leu Val Ser Leu Gln Arg Glu Thr Arg Lys Leu
515 520 525
CCC CCA GAC AAC AAC AAG GAA AGA GCC CAC AGC CCC GCC ACT AGA GAG 96
Pro Pro Asp Asn Asn Lys Glu Arg Ala His Ser Pro Ala Thr Arg Glu
530 535 540
TTA TGG GTT TCA GGA GGA GAA GAG CAC ACA GGG GAA GGA GAT GCA GGA 144
Leu Trp Val Ser Gly Gly Glu Glu His Thr Gly Glu Gly Asp Ala Gly
545 550 555 560
GAA CCA GGA GAG GAC AGA GAA CTC TCT GTA CCC ACC TTT AAC TTC CCT 192
Glu Pro Gly Glu Asp Arg Glu Leu Ser Val Pro Thr Phe Asn Phe Pro
565 570 575
CAG ATC ACT CTT TGG CAA CGA CCC GTC ATC ACA GTA AAA ATA GGG AAA 240
Gln Ile Thr Leu Trp Gln Arg Pro Val Ile Thr Val Lys Ile Gly Lys
580 585 590
GAA GTA AGA GAA GCT CTT TTA GAT ACA GGA GCT GAT GAT ACA GTA ATA 288
Glu Val Arg Glu Ala Leu Leu Asp Thr Gly Ala Asp Asp Thr Val Ile
595 600 605
GAA GAG CTA CAA TTA GAG GGA AAA TGG AAA CCA AAA ATG ATA GGA GGA 336
Glu Glu Leu Gln Leu Glu Gly Lys Trp Lys Pro Lys Met Ile Gly Gly
610 615 620
ATT GGA GGA TTT ATC AAA GTG AGA CAA TAT GAT AAT ATA ACA GTA GAC 384
Ile Gly Gly Phe Ile Lys Val Arg Gln Tyr Asp Asn Ile Thr Val Asp
625 630 635 640
ATA CAG GGA AGA AAA GCA GTT GGT ACA GTA TTA GTA GGA CCA ACA CCT 432
Ile Gln Gly Arg Lys Ala Val Gly Thr Val Leu Val Gly Pro Thr Pro
645 650 655
GTT AAT ATT ATA GGA AGA AAT CTT TTA ACC CAG ATT GGC TGT ACT TTA 480
Val Asn Ile Ile Gly Arg Asn Leu Leu Thr Gln Ile Gly Cys Thr Leu
660 665 670
AAT TTT CCA ATA AGT CCT ATT GAA ACT GTA CCA GTA AAA TTA AAA CCA 528
Asn Phe Pro Ile Ser Pro Ile Glu Thr Val Pro Val Lys Leu Lys Pro
675 680 685
GGA ATG GAT GGC CCA AAG GTA AAA CAA TGG CCT TTG ACA ACA GAA AAA 576
Gly Met Asp Gly Pro Lys Val Lys Gln Trp Pro Leu Thr Thr Glu Lys
690 695 700
ATA GAG GCA TTA AGA GAA ATT TGT ACA GAA ATG GAA AAG GAA GGA AAA 624
Ile Glu Ala Leu Arg Glu Ile Cys Thr Glu Met Glu Lys Glu Gly Lys
705 710 715 720
ATT TCT AGA ATA GGG CCT GAG AAT CCA TAT AAC ACT CCA ATT TTT GCT 672
Ile Ser Arg Ile Gly Pro Glu Asn Pro Tyr Asn Thr Pro Ile Phe Ala
725 730 735
ATA AAA AAG AAA GAT AGC ACT AAA TGG AGA AAA TTA GTA GAT TTC AGG 720
Ile Lys Lys Lys Asp Ser Thr Lys Trp Arg Lys Leu Val Asp Phe Arg
740 745 750
GAA TTA AAT AAA AGG ACC CAA GAT TTT TGG GAA GTG CAG CTA GGA ATT 768
Glu Leu Asn Lys Arg Thr Gln Asp Phe Trp Glu Val Gln Leu Gly Ile
755 760 765
CCA CAT CCA GCA GGA TTA AAG CAG AAA AAA TCA GTG ACA GTT TTG GAT 816
Pro His Pro Ala Gly Leu Lys Gln Lys Lys Ser Val Thr Val Leu Asp
770 775 780
GTA GGA GAT GCT TAT TTT TCA TGT CCC TTG GAC AAA GAT TTT AGA AAG 864
Val Gly Asp Ala Tyr Phe Ser Cys Pro Leu Asp Lys Asp Phe Arg Lys
785 790 795 800
TAT ACA GCT TTT ACC ATA CCT AGT ATA AAC AAT GAG ACA CCT GGT ATT 912
Tyr Thr Ala Phe Thr Ile Pro Ser Ile Asn Asn Glu Thr Pro Gly Ile
805 810 815
AGA TAC CAG TAT AAT GTG CTG CCA CAA GGC TGG AAA GGG TCA CCA GCA 960
Arg Tyr Gln Tyr Asn Val Leu Pro Gln Gly Trp Lys Gly Ser Pro Ala
820 825 830
ATT TTT CAG AGT ACA ATG ACA AAA ATT CTA GAA CCA TTC AGA GAG AAA 1008
Ile Phe Gln Ser Thr Met Thr Lys Ile Leu Glu Pro Phe Arg Glu Lys
835 840 845
CAT CCA GAG ATA ATC ATT TAC CAG TAC ATG GAT GAC CTC TAT GTG GGA 1056
His Pro Glu Ile Ile Ile Tyr Gln Tyr Met Asp Asp Leu Tyr Val Gly
850 855 860
TCT GAC TTA GAA CTA GCA CAA CAT AGA GAG GCA GTA GAA GAC CTC AGA 1104
Ser Asp Leu Glu Leu Ala Gln His Arg Glu Ala Val Glu Asp Leu Arg
865 870 875 880
GAT CAT CTT TTG AAG TGG GGC TTT ACG ACC CCT GAC AAA AAA CAT CAG 1152
Asp His Leu Leu Lys Trp Gly Phe Thr Thr Pro Asp Lys Lys His Gln
885 890 895
AAG GAG CCC CCG TTC CTC TGG ATG GGA TAT GAA CTC CAT CCA GAC AAA 1200
Lys Glu Pro Pro Phe Leu Trp Met Gly Tyr Glu Leu His Pro Asp Lys
900 905 910
TGG ACA GTC CAG CCA ATA AAG TTA CCA GAA AAG GAT GTA TGG ACT GTC 1248
Trp Thr Val Gln Pro Ile Lys Leu Pro Glu Lys Asp Val Trp Thr Val
915 920 925
AAT GAT ATA CAG AAA TTA GTA GGA AAG TTA AAT TGG GCA AGT CAG ATC 1296
Asn Asp Ile Gln Lys Leu Val Gly Lys Leu Asn Trp Ala Ser Gln Ile
930 935 940
TAT CCA GGA ATC AGA GTA AAA CAG CTC TGT AAA TTA ATC AGA GGA GCC 1344
Tyr Pro Gly Ile Arg Val Lys Gln Leu Cys Lys Leu Ile Arg Gly Ala
945 950 955 960
AGA GCT TTG ACA GAA GTA GTC AAC TTT ACA GAA GAA GCA GAA TTA GAA 1392
Arg Ala Leu Thr Glu Val Val Asn Phe Thr Glu Glu Ala Glu Leu Glu
965 970 975
CTA GCA GAA AAC AGG GAG ATA TTA AAA GAA CCC CTG CAT GGA GTC TAT 1440
Leu Ala Glu Asn Arg Glu Ile Leu Lys Glu Pro Leu His Gly Val Tyr
980 985 990
TAT GAC CCA GGA AAA GAA TTA GTA GCA GAA ATT CAA AAG CAA GGA CAA 1488
Tyr Asp Pro Gly Lys Glu Leu Val Ala Glu Ile Gln Lys Gln Gly Gln
995 1000 1005
GGT CAG TGG ACA TAT CAG ATT TAT CAG GAG TTA CAT AAA AAT TTA AAA 1536
Gly Gln Trp Thr Tyr Gln Ile Tyr Gln Glu Leu His Lys Asn Leu Lys
1010 1015 1020
ACA GGA AAG TAT GCA AAA ATG AGA TCT GCC CAT ACT AAT GAT ATA AAA 1584
Thr Gly Lys Tyr Ala Lys Met Arg Ser Ala His Thr Asn Asp Ile Lys
1025 1030 1035 1040
CAG TTA GTT GAA GTG GTA AGG AAA GTG GCA ACA GAA AGT ATA GTA ATT 1632
Gln Leu Val Glu Val Val Arg Lys Val Ala Thr Glu Ser Ile Val Ile
1045 1050 1055
TGG GGA AAG ACT CCT AAA TTT AGA TTA CCA GTA CAA AAG GAA GTG TGG 1680
Trp Gly Lys Thr Pro Lys Phe Arg Leu Pro Val Gln Lys Glu Val Trp
1060 1065 1070
GAG GCA TGG TGG ACC GAT CAT TGG CAA GCA ACT TGG ATT CCT GAG TGG 1728
Glu Ala Trp Trp Thr Asp His Trp Gln Ala Thr Trp Ile Pro Glu Trp
1075 1080 1085
GAA TTT GTC AAC ACT CCT CCC CTT GTA AAA TTA TGG TAT CAG TTA GAA 1776
Glu Phe Val Asn Thr Pro Pro Leu Val Lys Leu Trp Tyr Gln Leu Glu
1090 1095 1100
ACA GAG CCA ATC AGT GGG GCA GAA ACT TTC TAT GTA GAT GGA GCA GCT 1824
Thr Glu Pro Ile Ser Gly Ala Glu Thr Phe Tyr Val Asp Gly Ala Ala
1105 1110 1115 1120
AAT AGG GAA ACA AAA TTG GGA AAA GCA GGT TTT GTG ACA GAT AGG GGA 1872
Asn Arg Glu Thr Lys Leu Gly Lys Ala Gly Phe Val Thr Asp Arg Gly
1125 1130 1135
AGA CAG AAA GTG GTC TCT ATT GCA GAC ACC ACC AAT CAA AAG GCT GAG 1920
Arg Gln Lys Val Val Ser Ile Ala Asp Thr Thr Asn Gln Lys Ala Glu
1140 1145 1150
TTA CAA GCT ATC CTT ATG GCC TTA CAA GAG TCA GGA CGG GAT GTA AAC 1968
Leu Gln Ala Ile Leu Met Ala Leu Gln Glu Ser Gly Arg Asp Val Asn
1155 1160 1165
ATA GTC ACT GAC TCT CAG TAT GCT ATG GGA ATA ATT CAT TCA CAG CCA 2016
Ile Val Thr Asp Ser Gln Tyr Ala Met Gly Ile Ile His Ser Gln Pro
1170 1175 1180
GAT AAA AGT GAA TCA GAA TTG GTG AGC CAA ATA ATA GAA GAG CTC ATA 2064
Asp Lys Ser Glu Ser Glu Leu Val Ser Gln Ile Ile Glu Glu Leu Ile
1185 1190 1195 1200
AAA AAG GAA AGA GTT TAT CTC TCT TGG GTA CCT GCA CAT AAA GGT ATT 2112
Lys Lys Glu Arg Val Tyr Leu Ser Trp Val Pro Ala His Lys Gly Ile
1205 1210 1215
GGA GGA AAT GAG CAG GTA GAC AAA TTA GTT AGC TCA GGA ATT AGA AAA 2160
Gly Gly Asn Glu Gln Val Asp Lys Leu Val Ser Ser Gly Ile Arg Lys
1220 1225 1230
ATA TTA TTC CTA GAT GGT ATA GAA AAA GCC CAA GAA GAT CAT GAC AGA 2208
Ile Leu Phe Leu Asp Gly Ile Glu Lys Ala Gln Glu Asp His Asp Arg
1235 1240 1245
TAT CAC AGC AAT TGG AAA GCA ATG GCC AGT GAT TTT AAC TTA CCC CCC 2256
Tyr His Ser Asn Trp Lys Ala Met Ala Ser Asp Phe Asn Leu Pro Pro
1250 1255 1260
ATA GTG GCA AAA GAA ATA GTA GCC AGC TGT GAC AAA TGC CAG CTA AAA 2304
Ile Val Ala Lys Glu Ile Val Ala Ser Cys Asp Lys Cys Gln Leu Lys
1265 1270 1275 1280
GGG GAA GCC ATG CAT GGA CAG GTC AAT TGT AGT CCA GGA GTG TGG CAA 2352
Gly Glu Ala Met His Gly Gln Val Asn Cys Ser Pro Gly Val Trp Gln
1285 1290 1295
TTA GAT TGT ACA CAC TTA GAG GGA AAA ATC ATC CTT GTG GCG GTC CAT 2400
Leu Asp Cys Thr His Leu Glu Gly Lys Ile Ile Leu Val Ala Val His
1300 1305 1310
GTG GCC AGT GGC TAC TTA GAA GCA GAA GTT ATT CCT GCA GAG ACA GGA 2448
Val Ala Ser Gly Tyr Leu Glu Ala Glu Val Ile Pro Ala Glu Thr Gly
1315 1320 1325
CAG GAA ACA GCA TAT TTT ATT TTA AAG TTA GCT GGA AGA TGG CCA GTA 2496
Gln Glu Thr Ala Tyr Phe Ile Leu Lys Leu Ala Gly Arg Trp Pro Val
1330 1335 1340
AAA GTT ATA CAC ACT GAT AAT GGA TCC AAT TTC ACT AGT GCC ACT GTA 2544
Lys Val Ile His Thr Asp Asn Gly Ser Asn Phe Thr Ser Ala Thr Val
1345 1350 1355 1360
AAA GCA GCC TGT TGG TGG GCA AAT ATC AAA CAG GAA TTT GGG ATA CCC 2592
Lys Ala Ala Cys Trp Trp Ala Asn Ile Lys Gln Glu Phe Gly Ile Pro
1365 1370 1375
TAC AAT CCT CAA AGT CAG GGA GCA GTA GAG TCC ATG AAT AAA GAA TTA 2640
Tyr Asn Pro Gln Ser Gln Gly Ala Val Glu Ser Met Asn Lys Glu Leu
1380 1385 1390
AAG AAA ATT ATA GGA CAA ATC AGA GAT CAA GCA GAA CAT CTA AAG ACA 2688
Lys Lys Ile Ile Gly Gln Ile Arg Asp Gln Ala Glu His Leu Lys Thr
1395 1400 1405
GCA GTG CAA ATG GCG GTT TTC ATT CAC AAT TTT AAA AGA AAA GGG GGG 2736
Ala Val Gln Met Ala Val Phe Ile His Asn Phe Lys Arg Lys Gly Gly
1410 1415 1420
ATT GGG GGG TAC ACT GCA GGG GAA AGA ATA ATA GAC ATA ATA GCA ACA 2784
Ile Gly Gly Tyr Thr Ala Gly Glu Arg Ile Ile Asp Ile Ile Ala Thr
1425 1430 1435 1440
GAC ATA CAG ACA ACA AAT TTA CAA ACA CAA ATT TTA AAA GTT CAA AAT 2832
Asp Ile Gln Thr Thr Asn Leu Gln Thr Gln Ile Leu Lys Val Gln Asn
1445 1450 1455
TTT CGG GTT TAT TAC AGA GAC AGC AGA GAT CCC ATT TGG AAA GGA CCA 2880
Phe Arg Val Tyr Tyr Arg Asp Ser Arg Asp Pro Ile Trp Lys Gly Pro
1460 1465 1470
GCC AAA CTT CTG TGG AAA GGA GAA GGG GCA GTG GTA ATT CAA GAT AAC 2928
Ala Lys Leu Leu Trp Lys Gly Glu Gly Ala Val Val Ile Gln Asp Asn
1475 1480 1485
GGG GAT ATA AAA GTA GTC CCA CGT AGG AAA GCA AAA ATA ATT AGG GAT 2976
Gly Asp Ile Lys Val Val Pro Arg Arg Lys Ala Lys Ile Ile Arg Asp
1490 1495 1500
TAT GGA AAA CAG ATG GCA GGT GAT GGT TGT GTG GCA AGT GGA CAG GAT 3024
Tyr Gly Lys Gln Met Ala Gly Asp Gly Cys Val Ala Ser Gly Gln Asp
1505 1510 1515 1520
GAA AAT CAG GAA ATG GAA TAG 3045
Glu Asn Gln Glu Met Glu
1525
(2)SEQ ID NO:6的信息:
(i)序列特征:
(A)长度:1014个氨基酸
(B)类型:氨基酸
(D)拓扑类型:线性
(ii)分子类型:蛋白质
(xi)SEQ ID NO:6的序列描述:
Phe Phe Arg Glu Glu Leu Val Ser Leu Gln Arg Glu Thr Arg Lys Leu
1 5 10 15
Pro Pro Asp Asn Asn Lys Glu Arg Ala His Ser Pro Ala Thr Arg Glu
20 25 30
Leu Trp Val Ser Gly Gly Glu Glu His Thr Gly Glu Gly Asp Ala Gly
35 40 45
Glu Pro Gly Glu Asp Arg Glu Leu Ser Val Pro Thr Phe Asn Phe Pro
50 55 60
Gln Ile Thr Leu Trp Gln Arg Pro Val Ile Thr Val Lys Ile Gly Lys
65 70 75 80
Glu Val Arg Glu Ala Leu Leu Asp Thr Gly Ala Asp Asp Thr Val Ile
85 90 95
Glu Glu Leu Gln Leu Glu Gly Lys Trp Lys Pro Lys Met Ile Gly Gly
100 105 110
Ile Gly Gly Phe Ile Lys Val Arg Gln Tyr Asp Asn Ile Thr Val Asp
115 120 125
Ile Gln Gly Arg Lys Ala Val Gly Thr Val Leu Val Gly Pro Thr Pro
130 135 140
Val Asn Ile Ile Gly Arg Asn Leu Leu Thr Gln Ile Gly Cys Thr Leu
145 150 155 160
Asn Phe Pro Ile Ser Pro Ile Glu Thr Val Pro Val Lys Leu Lys Pro
165 170 175
Gly Met Asp Gly Pro Lys Val Lys Gln Trp Pro Leu Thr Thr Glu Lys
180 185 190
Ile Glu Ala Leu Arg Glu Ile Cys Thr Glu Met Glu Lys Glu Gly Lys
195 200 205
Ile Ser Arg Ile Gly Pro Glu Asn Pro Tyr Asn Thr Pro Ile Phe Ala
210 215 220
Ile Lys Lys Lys Asp Ser Thr Lys Trp Arg Lys Leu Val Asp Phe Arg
225 230 235 240
Glu Leu Asn Lys Arg Thr Gln Asp Phe Trp Glu Val Gln Leu Gly Ile
245 250 255
Pro His Pro Ala Gly Leu Lys Gln Lys Lys Ser Val Thr Val Leu Asp
260 265 270
Val Gly Asp Ala Tyr Phe Ser Cys Pro Leu Asp Lys Asp Phe Arg Lys
275 280 285
Tyr Thr Ala Phe Thr Ile Pro Ser Ile Asn Asn Glu Thr Pro Gly Ile
290 295 300
Arg Tyr Gln Tyr Asn Val Leu Pro Gln Gly Trp Lys Gly Ser Pro Ala
305 310 315 320
Ile Phe Gln Ser Thr Met Thr Lys Ile Leu Glu Pro Phe Arg Glu Lys
325 330 335
His Pro Glu Ile Ile Ile Tyr Gln Tyr Met Asp Asp Leu Tyr Val Gly
340 345 350
Ser Asp Leu Glu Leu Ala Gln His Arg Glu Ala Val Glu Asp Leu Arg
355 360 365
Asp His Leu Leu Lys Trp Gly Phe Thr Thr Pro Asp Lys Lys His Gln
370 375 380
Lys Glu Pro Pro Phe Leu Trp Met Gly Tyr Glu Leu His Pro Asp Lys
385 390 395 400
Trp Thr Val Gln Pro Ile Lys Leu Pro Glu Lys Asp Val Trp Thr Val
405 410 415
Asn Asp Ile Gln Lys Leu Val Gly Lys Leu Asn Trp Ala Ser Gln Ile
420 425 430
Tyr Pro Gly Ile Arg Val Lys Gln Leu Cys Lys Leu Ile Arg Gly Ala
435 440 445
Arg Ala Leu Thr Glu Val Val Asn Phe Thr Glu Glu Ala Glu Leu Glu
450 455 460
Leu Ala Glu Asn Arg Glu Ile Leu Lys Glu Pro Leu His Gly Val Tyr
465 470 475 480
Tyr Asp Pro Gly Lys Glu Leu Val Ala Glu Ile Gln Lys Gln Gly Gln
485 490 495
Gly Gln Trp Thr Tyr Gln Ile Tyr Gln Glu Leu His Lys Asn Leu Lys
500 505 510
Thr Gly Lys Tyr Ala Lys Met Arg Ser Ala His Thr Asn Asp Ile Lys
515 520 525
Gln Leu Val Glu Val Val Arg Lys Val Ala Thr Glu Ser Ile Val Ile
530 535 540
Trp Gly Lys Thr Pro Lys Phe Arg Leu Pro Val Gln Lys Glu Val Trp
545 550 555 560
Glu Ala Trp Trp Thr Asp His Trp Gln Ala Thr Trp Ile Pro Glu Trp
565 570 575
Glu Phe Val Asn Thr Pro Pro Leu Val Lys Leu Trp Tyr Gln Leu Glu
580 585 590
Thr Glu Pro Ile Ser Gly Ala Glu Thr Phe Tyr Val Asp Gly Ala Ala
595 600 605
Asn Arg Glu Thr Lys Leu Gly Lys Ala Gly Phe Val Thr Asp Arg Gly
610 615 620
Arg Gln Lys Val Val Ser Ile Ala Asp Thr Thr Asn Gln Lys Ala Glu
625 630 635 640
Leu Gln Ala Ile Leu Met Ala Leu Gln Glu Ser Gly Arg Asp Val Asn
645 650 655
Ile Val Thr Asp Ser Gln Tyr Ala Met Gly Ile Ile His Ser Gln Pro
660 665 670
Asp Lys Ser Glu Ser Glu Leu Val Ser Gln Ile Ile Glu Glu Leu Ile
675 680 685
Lys Lys Glu Arg Val Tyr Leu Ser Trp Val Pro Ala His Lys Gly Ile
690 695 700
Gly Gly Asn Glu Gln Val Asp Lys Leu Val Ser Ser Gly Ile Arg Lys
705 710 715 720
Ile Leu Phe Leu Asp Gly Ile Glu Lys Ala Gln Glu Asp His Asp Arg
725 730 735
Tyr His Ser Asn Trp Lys Ala Met Ala Ser Asp Phe Asn Leu Pro Pro
740 745 750
Ile Val Ala Lys Glu Ile Val Ala Ser Cys Asp Lys Cys Gln Leu Lys
755 760 765
Gly Glu Ala Met His Gly Gln Val Asn Cys Ser Pro Gly Val Trp Gln
770 775 780
Leu Asp Cys Thr His Leu Glu Gly Lys Ile Ile Leu Val Ala Val His
785 790 795 800
Val Ala Ser Gly Tyr Leu Glu Ala Glu Val Ile Pro Ala Glu Thr Gly
805 810 815
Gln Glu Thr Ala Tyr Phe Ile Leu Lys Leu Ala Gly Arg Trp Pro Val
820 825 830
Lys Val Ile His Thr Asp Asn Gly Ser Asn Phe Thr Ser Ala Thr Val
835 840 845
Lys Ala Ala Cys Trp Trp Ala Asn Ile Lys Gln Glu Phe Gly Ile Pro
850 855 860
Tyr Asn Pro Gln Ser Gln Gly Ala Val Glu Ser Met Asn Lys Glu Leu
865 870 875 880
Lys Lys Ile Ile Gly Gln Ile Arg Asp Gln Ala Glu His Leu Lys Thr
885 890 895
Ala Val Gln Met Ala Val Phe Ile His Asn Phe Lys Arg Lys Gly Gly
900 905 910
Ile Gly Gly Tyr Thr Ala Gly Glu Arg Ile Ile Asp Ile Ile Ala Thr
915 920 925
Asp Ile Gln Thr Thr Asn Leu Gln Thr Gln Ile Leu Lys Val Gln Asn
930 935 940
Phe Arg Val Tyr Tyr Arg Asp Ser Arg Asp Pro Ile Trp Lys Gly Pro
945 950 955 960
Ala Lys Leu Leu Trp Lys Gly Glu Gly Ala Val Val Ile Gln Asp Asn
965 970 975
Gly Asp Ile Lys Val Val Pro Arg Arg Lys Ala Lys Ile Ile Arg Asp
980 985 990
Tyr Gly Lys Gln Met Ala Gly Asp Gly Cys Val Ala Ser Gly Gln Asp
995 1000 1005
Glu Asn Gln Glu Met Glu
1010
(2)SEQ ID NO:7的信息:
(i)序列特征:
(A)长度:579个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:DNA(基因组)
(ix)特征
(A)名称/关键词:CDS
(B)位置:1..576
(xi)SEQ ID NO:7的序列描述:
ATG GAA AAC AGA TGG CAG GTG ATG GTT GTG TGG CAA GTG GAC AGG ATG 48
Met Glu Asn Arg Trp Gln Val Met Val Val Trp Gln Val Asp Arg Met
1015 1020 1025 1030
AAA ATC AGG AAA TGG AAT AGC TTA GTA AAA CAT CAT ATG TAT GTG TCA 96
Lys Ile Arg Lys Trp Asn Ser Leu Val Lys His His Met Tyr Val Ser
1035 1040 1045
AAA AAG GCA AAA GGA TGG TAT TAT AGA CAT CAT TAT GAA ACA CAT CAC 144
Lys Lys Ala Lys Gly Trp Tyr Tyr Arg His His Tyr Glu Thr His His
1050 1055 1060
CCA AAA ATA AGT TCA GAA GTA CAT ATC CCA GTA GGT CAG GCA AGA TTA 192
Pro Lys Ile Ser Ser Glu Val His Ile Pro Val Gly Gln Ala Arg Leu
1065 1070 1075
GTG ACA GTC ACT TAT TGG GGG CTA ACA ACA GGA GAA CAG TCT TGG CAT 240
Val Thr Val Thr Tyr Trp Gly Leu Thr Thr Gly Glu Gln Ser Trp His
1080 1085 1090
CTA GGA CAT GGA GTA TCC ATA GAA TGG AGA CTA AGA AAA TAC AAG ACA 288
Leu Gly His Gly Val Ser Ile Glu Trp Arg Leu Arg Lys Tyr Lys Thr
1095 1100 1105 1110
CAA GTT GAT CCT GAA ATG GCA GAC AAG CTA ATA CAT CTT CAT TAT TTT 336
Gln Val Asp Pro Glu Met Ala Asp Lys Leu Ile His Leu His Tyr Phe
1115 1120 1125
GAT TGT TTT ACA GCC TCT GCC ATA AGG CAA GCG GTC TTA GGG AGA CCA 384
Asp Cys Phe Thr Ala Ser Ala Ile Arg Gln Ala Val Leu Gly Arg Pro
1130 1135 1140
GTA TTA CCT AGG TGT GAA TAT CCA GCA GGG CAC AAA CAG GTA GGC ACC 432
Val Leu Pro Arg Cys Glu Tyr Pro Ala Gly His Lys Gln Val Gly Thr
1145 1150 1155
CTA CAA TAT CTA GCA CTA ACA GCC TGG GTG GGA GCA AAG AAG AGA AAG 480
Leu Gln Tyr Leu Ala Leu Thr Ala Trp Val Gly Ala Lys Lys Arg Lys
1160 1165 1170
CCA CCC TTA CCT AGT GTG ACT AAG CTA ACA GAA GAT AGA TGG AAC GAG 528
Pro Pro Leu Pro Ser Val Thr Lys Leu Thr Glu Asp Arg Trp Asn Glu
1175 1180 1185 1190
CAC CAG AAG ATG CAG GGC CAC AGA GGG AAC CCT ATA ATG AAT GGG CAC 576
His Gln Lys Met Gln Gly His Arg Gly Asn Pro Ile Met Asn Gly His
1195 1200 1205
TAG 579
(2)SEQ ID NO:8的信息:
(i)序列特征:
(A)长度:192个氨基酸
(B)类型:氨基酸
(D)拓扑类型:线性
(ii)分子类型:蛋白质
(xi)SEQ ID NO:8的序列描述:
Met Glu Asn Arg Trp Gln Val Met Val Val Trp Gln Val Asp Arg Met
1 5 10 15
Lys Ile Arg Lys Trp Asn Ser Leu Val Lys His His Met Tyr Val Ser
20 25 30
Lys Lys Ala Lys Gly Trp Tyr Tyr Arg His His Tyr Glu Thr His His
35 40 45
Pro Lys Ile Ser Ser Glu Val His Ile Pro Val Gly Gln Ala Arg Leu
50 55 60
Val Thr Val Thr Tyr Trp Gly Leu Thr Thr Gly Glu Gln Ser Trp His
65 70 75 80
Leu Gly His Gly Val Ser Ile Glu Trp Arg Leu Arg Lys Tyr Lys Thr
85 90 95
Gln Val Asp Pro Glu Met Ala Asp Lys Leu Ile His Leu His Tyr Phe
100 105 110
Asp Cys Phe Thr Ala Ser Ala Ile Arg Gln Ala Val Leu Gly Arg Pro
115 120 125
Val Leu Pro Arg Cys Glu Tyr Pro Ala Gly His Lys Gln Val Gly Thr
130 135 140
Leu Gln Tyr Leu Ala Leu Thr Ala Trp Val Gly Ala Lys Lys Arg Lys
145 150 155 160
Pro Pro Leu Pro Ser Val Thr Lys Leu Thr Glu Asp Arg Trp Asn Glu
165 170 175
His Gln Lys Met Gln Gly His Arg Gly Asn Pro Ile Met Asn Gly His
180 185 190
(2)SEQ ID NO:9的信息:
(i)序列特征:
(A)长度:288个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:DNA(基因组)
(ix)特征
(A)名称/关键词:CDS
(B)位置:1..285
(xi)SEQ ID NO:9的序列描述:
ATG GAA CGA GCA CCA GAA GAT GCA GGG CCA CAG AGG GAA CCC TAT AAT 48
Met Glu Arg Ala Pro Glu Asp Ala Gly Pro Gln Arg Glu Pro Tyr Asn
195 200 205
GAA TGG GCA CTA GAA TTA TTA GAA GAA TTA AAA AAT GAA GCT GTG CGC 96
Glu Trp Ala Leu Glu Leu Leu Glu Glu Leu Lys Asn Glu Ala Val Arg
210 215 220
CAT TTT CCA AGG ATT TGG CTA CAT GGG TTA GGA CAA CAC ATC TAT AAC 144
His Phe Pro Arg Ile Trp Leu His Gly Leu Gly Gln His Ile Tyr Asn
225 230 235 240
ACA TAT GGA GAC ACC TGG GAG GGG GTA GAG GCA ATT ATC AGG ATA CTA 192
Thr Tyr Gly Asp Thr Trp Glu Gly Val Glu Ala Ile Ile Arg Ile Leu
245 250 255
CAA CAA TTA CTG TTT ATC CAT TAT AGG ATT GGC TGC CAG CAC AGC AGA 240
Gln Gln Leu Leu Phe Ile His Tyr Arg Ile Gly Cys Gln His Ser Arg
260 265 270
ATA GGG ATC ACT CCT CAA AGG AGA AGG AAT GGA ACC AGT AGA TCC 285
Ile Gly Ile Thr Pro Gln Arg Arg Arg Asn Gly Thr Ser Arg Ser
275 280 285
TAG 288
(2)SEQ ID NO:10的信息:
(i)序列特征:
(A)长度:95个氨基酸
(B)类型:氨基酸
(D)拓扑类型:线性
(ii)分子类型:蛋白质
(xi)SEQ ID NO:10的序列描述:
Met Glu Arg Ala Pro Glu Asp Ala Gly Pro Gln Arg Glu Pro Tyr Asn
1 5 10 15
Glu Trp Ala Leu Glu Leu Leu Glu Glu Leu Lys Asn Glu Ala Val Arg
20 25 30
His Phe Pro Arg Ile Trp Leu His Gly Leu Gly Gln His Ile Tyr Asn
35 40 45
Thr Tyr Gly Asp Thr Trp Glu Gly Val Glu Ala Ile Ile Arg Ile Leu
50 55 60
Gln Gln Leu Leu Phe Ile His Tyr Arg Ile Gly Cys Gln His Ser Arg
65 70 75 80
Ile Gly Ile Thr Pro Gln Arg Arg Arg Asn Gly Thr Ser Arg Ser
85 90 95
(2)SEQ ID NO:11的信息:
(i)序列特征:
(A)长度:252个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:DNA(基因组)
(ix)特征
(A)名称/关键词:CDS
(B)位置:1..249
(xi)SEQ ID NO:11的序列描述:
ATG CTG TCA TTG GGA TTC ATA GCG TTA GGA GCA GCA GTT AGC ATA GCA 48
Met Leu Ser Leu Gly Phe Ile Ala Leu Gly Ala Ala Val Ser Ile Ala
100 105 110
GTA ATA GTC TGG GCA TTA CTA TAT AGA GAA TAT AAG AAA ATA AAA TTG 96
Val Ile Val Trp Ala Leu Leu Tyr Arg Glu Tyr Lys Lys Ile Lys Leu
115 120 125
CAG GAA AAA ATA AAA CAC ATA AGA CAG AGA ATA AGA GAA AGA GAA GAA 144
Gln Glu Lys Ile Lys His Ile Arg Gln Arg Ile Arg Glu Arg Glu Glu
130 135 140
GAT AGT GGC AAT GAA AGT GAT GGG GAT GCA GAG TGG TTG GAT GGG GAT 192
Asp Ser Gly Asn Glu Ser Asp Gly Asp Ala Glu Trp Leu Asp Gly Asp
145 150 155
GAA GAG TGG TTG GTT ACT CTT CTA TCT TCT AGT AAG CTT GAT CAA GGT 240
Glu Glu Trp Leu Val Thr Leu Leu Ser Ser Ser Lys Leu Asp Gln Gly
160 165 170 175
AAT TGG GTC TGA 252
Asn Trp Val
(2)SEQ ID NO:12的信息:
(i)序列特征:
(A)长度:83个氨基酸
(B)类型:氨基酸
(D)拓扑类型:线性
(ii)分子类型:蛋白质
(xi)SEQ ID NO:12的序列描述:
Met Leu Ser Leu Gly Phe Ile Ala Leu Gly Ala Ala Val Ser Ile Ala
1 5 10 15
Val Ile Val Trp Ala Leu Leu Tyr Arg Glu Tyr Lys Lys Ile Lys Leu
20 25 30
Gln Glu Lys Ile Lys His Ile Arg Gln Arg Ile Arg Glu Arg Glu Glu
35 40 45
Asp Ser Gly Asn Glu Ser Asp Gly Asp Ala Glu Trp Leu Asp Gly Asp
50 55 60
Glu Glu Trp Leu Val Thr Leu Leu Ser Ser Ser Lys Leu Asp Gln Gly
65 70 75 80
Asn Trp Val
(2)SEQ ID NO:13的信息:
(i)序列特征:
(A)长度:306个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:DNA(基因组)
(ix)特征
(A)名称/关键词:CDS
(B)位置:1..303
(xi)SEQ ID NO:13的序列描述:
ATG GAA CCA GTA GAT CCT AGA TTA GAG CCC TGG AAT CAT CCA GGA AGC 48
Met Glu Pro Val Asp Pro Arg Leu Glu Pro Trp Asn His Pro Gly Ser
85 90 95
CAA CCT AAA ACA GCT TGC AAT AAT TGC TAT TGT AAA AGA TGT TGC TAT 96
Gln Pro Lys Thr Ala Cys Asn Asn Cys Tyr Cys Lys Arg Cys Cys Tyr
100 105 110 115
CAC TGC TTA TAT TGC TTC ACA AAG AAA GGC TTA GGC ATC TCA TAT GGC 144
His Cys Leu Tyr Cys Phe Thr Lys Lys Gly Leu Gly Ile Ser Tyr Gly
120 125 130
AGG AAG AAG CGG AGT CAA CGA CGA AGA ACT CCT CAG AGC AGT AAG AGT 192
Arg Lys Lys Arg Ser Gln Arg Arg Arg Thr Pro Gln Ser Ser Lys Ser
135 140 145
CAT CAA GAT CTT ATA CCA GAG CAG CCC TTA TCC CAA CAG CAA GGG GAC 240
His Gln Asp Leu Ile Pro Glu Gln Pro Leu Ser Gln Gln Gln Gly Asp
150 155 160
CAG ACA GGC CAG AAG AAA CAG AAG GAG GCG TTG GAG AGC AAG ACA GAG 288
Gln Thr Gly Gln Lys Lys Gln Lys Glu Ala Leu Glu Ser Lys Thr Glu
165 170 175
GCA GAT CCG TGC GAT TAG 306
Ala Asp Pro Cys Asp
180
(2)SEQ ID NO:14的信息:
(i)序列特征:
(A)长度:101个氨基酸
(B)类型:氨基酸
(D)拓扑类型:线性
(ii)分子类型:蛋白质
(xi)SEQ ID NO:14的序列描述:
Met Glu Pro Val Asp Pro Arg Leu Glu Pro Trp Asn His Pro Gly Ser
1 5 10 15
Gln Pro Lys Thr Ala Cys Asn Asn Cys Tyr Cys Lys Arg Cys Cys Tyr
20 25 30
His Cys Leu Tyr Cys Phe Thr Lys Lys Gly Leu Gly Ile Ser Tyr Gly
35 40 45
Arg Lys Lys Arg Ser Gln Arg Arg Arg Thr Pro Gln Ser Ser Lys Ser
50 55 60
His Gln Asp Leu Ile Pro Glu Gln Pro Leu Ser Gln Gln Gln Gly Asp
65 70 75 80
Gln Thr Gly Gln Lys Lys Gln Lys Glu Ala Leu Glu Ser Lys Thr Glu
85 90 95
Ala Asp Pro Cys Asp
100
(2)SEQ ID NO:15的信息:
(i)序列特征:
(A)长度:369个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:DNA(基因组)
(ix)特征
(A)名称/关键词:CDS
(B)位置:1..366
(xi)SEQ ID NO:15的序列描述:
ATG GCA GGA AGA AGC GGA GTC AAC GAC GAA GAA CTC CTC AGA GCA GTA 48
Met Ala Gly Arg Ser Gly Val Asn Asp Glu Glu Leu Leu Arg Ala Val
105 110 115
AGA GTC ATC AAG ATC TTA TAC CAG AGC AGT TAT CCC AAC AGC AAG GGG 96
Arg Val Ile Lys Ile Leu Tyr Gln Ser Ser Tyr Pro Asn Ser Lys Gly
120 125 130
ACC AGA CAG GCC AGA AGA AAC AGA AGG AGG CGT TGG AGA GCA AGA CAG 144
Thr Arg Gln Ala Arg Arg Asn Arg Arg Arg Arg Trp Arg Ala Arg Gln
135 140 145
AGG CAG ATC CGT GCG ATT AGT GAG CGG ATT CTC AGC TCT TGT CTG GGA 192
Arg Gln Ile Arg Ala Ile Ser Glu Arg Ile Leu Ser Ser Cys Leu Gly
150 155 160 165
GGA CCT CCG GAA CCT GTT GAT CTT CCT CTA CCA CCG CTT GAC AGA CTC 240
Gly Pro Pro Glu Pro Val Asp Leu Pro Leu Pro Pro Leu Asp Arg Leu
170 175 180
ACT CTT GAT ACT GAG GAG GAC TCT GGA ACT CCT GGG ACA GAG TCT CAG 288
Thr Leu Asp Thr Glu Glu Asp Ser Gly Thr Pro Gly Thr Glu Ser Gln
185 190 195
CAG GGG ACT GCA ACT ACT GAA TGA ACT CAG AAC ACA CTT GTG GGG AAT 336
Gln Gly Thr Ala Thr Thr Glu * Thr Gln Asn Thr Leu Val Gly Asn
200 205 210
ACT TGC ATA TTG GGG AAA AGA GTT AAG GGA TAG 369
Thr Cys Ile Leu Gly Lys Arg Val Lys Gly
215 220
(2)SEQ ID NO:16的信息:
(i)序列特征:
(A)长度:122个氨基酸
(B)类型:氨基酸
(D)拓扑类型:线性
(ii)分子类型:蛋白质
(xi)SEQ ID NO:16的序列描述:
Met Ala Gly Arg Ser Gly Val Asn Asp Glu Glu Leu Leu Arg Ala Val
1 5 10 15
Arg Val Ile Lys Ile Leu Tyr Gln Ser Ser Tyr Pro Asn Ser Lys Gly
20 25 30
Thr Arg Gln Ala Arg Arg Asn Arg Arg Arg Arg Trp Arg Ala Arg Gln
35 40 45
Arg Gln Ile Arg Ala Ile Ser Glu Arg Ile Leu Ser Ser Cys Leu Gly
50 55 60
Gly Pro Pro Glu Pro Val Asp Leu Pro Leu Pro Pro Leu Asp Arg Leu
65 70 75 80
Thr Leu Asp Thr Glu Glu Asp Ser Gly Thr Pro Gly Thr Glu Ser Gln
85 90 95
Gln Gly Thr Ala Thr Thr Glu * Thr Gln Asn Thr Leu Val Gly Asn
100 105 110
Thr Cys Ile Leu Gly Lys Arg Val Lys Gly
115 120
(2)SEQ ID NO:17的信息:
(i)序列特征:
(A)长度:2559个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:DNA(基因组)
(ix)特征
(A)名称/关键词:CDS
(B)位置:1..2556
(xi)SEQ ID NO:17的序列描述:
ATG AAA GTG ATG GGG ATG CAG AGT GGT TGG ATG GGG ATG AAG AGT GGT 48
Met Lys Val Met Gly Met Gln Ser Gly Trp MetGly Met Lys Ser Gly
125 130 135
TGG TTA CTC TTC TAT CTT CTA GTA AGC TTG ATC AAG GTA ATT GGG TCT 96
Trp Leu Leu Phe Tyr Leu Leu Val Ser Leu Ile Lys Val Ile Gly Ser
140 145 150
GAA CAA CAT TGG GTA ACA GTG TAC TAT GGG GTA CCA GTA TGG AGA GAA 144
Glu Gln His Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Arg Glu
155 160 165 170
GCA GAG ACA ACT CTT TTC TGT GCT TCA GAT GCT AAA GCC CAT AGT ACA 192
Ala Glu Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala His Ser Thr
175 180 185
GAG GCT CAC AAC ATC TGG GCC ACA CAA GCA TGT GTT CCT ACT GAT CCC 240
Glu Ala His Asn Ile Trp Ala Thr Gln Ala Cys Val Pro Thr Asp Pro
190 195 200
AAT CCA CAA GAA GTG CTA TTA CCC AAT GTA ACT GAA AAA TTT AAT ATG 288
Asn Pro Gln Glu Val Leu Leu Pro Asn Val Thr Glu Lys Phe Asn Met
205 210 215
TGG GAA AAT AAA ATG GCA GAC CAA ATG CAA GAG GAT ATT ATC AGT CTG 336
Trp Glu Asn Lys Met Ala Asp Gln Met Gln Glu Asp Ile Ile Ser Leu
220 225 230
TGG GAA CAG AGC TTA AAG CCC TGT GTT AAA TTA ACC CCA TTA TGT GTA 384
Trp Glu Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val
235 240 245 250
ACT ATG CTT TGT AAC GAT AGC TAT GGG GAG GAA AGG AAC AAT ACA AAT 432
Thr Met Leu Cys Asn Asp Ser Tyr Gly Glu Glu Arg Asn Asn Thr Asn
255 260 265
ATG ACA ACA AGA GAA CCA GAC ATA GGA TAC AAA CAA ATG AAA AAT TGC 480
Met Thr Thr Arg Glu Pro Asp Ile Gly Tyr Lys Gln Met Lys Asn Cys
270 275 280
TCA TTC AAT GCA ACC ACT GAG CTA ACA GAT AAA AAG AAG CAA GTT TAC 528
Ser Phe Asn Ala Thr Thr Glu Leu Thr Asp Lys Lys Lys Gln Val Tyr
285 290 295
TCT CTG TTT TAT GTA GAA GAT GTA GTA CCA ATC AAT GCC TAT AAT AAA 576
Ser Leu Phe Tyr Val Glu Asp Val Val Pro Ile Asn Ala Tyr Asn Lys
300 305 310
ACA TAT AGG CTA ATA AAT TGT AAT ACC ACA GCT GTG ACA CAA GCT TGT 624
Thr Tyr Arg Leu Ile Asn Cys Asn Thr Thr Ala Val Thr Gln Ala Cys
315 320 325 330
CCT AAG ACT TCC TTT GAG CCA ATT CCA ATA CAT TAC TGT GCA CCA CCA 672
Pro Lys Thr Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Pro
335 340 345
GGC TTT GCC ATT ATG AAA TGT AAT GAA GGA AAC TTT AGT GGA AAT GGA 720
Gly Phe Ala Ile Met Lys Cys Asn Glu Gly Asn Phe Ser Gly Asn Gly
350 355 360
AGC TGT ACA AAT GTG AGT ACT GTA CAA TGC ACA CAT GGA ATA AAG CCA 768
Ser Cys Thr Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro
365 370 375
GTG ATA TCC ACT CAG TTA ATC CTA AAT GGA AGC TTA AAT ACA GAT GGA 816
Val Ile Ser Thr Gln Leu Ile Leu Asn Gly Ser Leu Asn Thr Asp Gly
380 385 390
ATT GTT ATT AGA AAT GAT AGT CAC AGT AAT CTG TTG GTG CAA TGG AAT 864
Ile Val Ile Arg Asn Asp Ser His Ser Asn Leu Leu Val Gln Trp Asn
395 400 405 410
GAG ACA GTG CCA ATA AAT TGT ACA AGG CCA GGA AAT AAT ACA GGA GGA 912
Glu Thr Val Pro Ile Asn Cys Thr Arg Pro Gly Asn Asn Thr Gly Gly
415 420 425
CAG GTG CAG ATA GGA CCT GCT ATG ACA TTT TAT AAC ATA GAA AAA ATA 960
Gln Val Gln Ile Gly Pro Ala Met Thr Phe Tyr Asn Ile Glu Lys Ile
430 435 440
GTA GGA GAC ATT AGA CAA GCA TAC TGT AAT GTC TCT AAA GAA CTA TGG 1008
Val Gly Asp Ile Arg Gln Ala Tyr Cys Asn Val Ser Lys Glu Leu Trp
445 450 455
GAA CCA ATG TGG AAT AGA ACA AGA GAG GAA ATA AAG AAA ATC CTG GGG 1056
Glu Pro Met Trp Asn Arg Thr Arg Glu Glu Ile Lys Lys Ile Leu Gly
460 465 470
AAA AAC AAC ATA ACC TTC AGG GCT CGA GAG AGG AAT GAA GGA GAC CTA 1104
Lys Asn Asn Ile Thr Phe Arg Ala Arg Glu Arg Asn Glu Gly Asp Leu
475 480 485 490
GAA GTG ACA CAC TTA ATG TTC AAT TGT AGA GGA GAG TTT TTC TAT TGT 1152
Glu Val Thr His Leu Met Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys
495 500 505
AAC ACT TCC AAA TTA TTT AAT GAG GAA TTA CTT AAC GAG ACA GGT GAG 1200
Asn Thr Ser Lys Leu Phe Asn Glu Glu Leu Leu Asn Glu Thr Gly Glu
510 515 520
CCT ATT ACT CTG CCT TGT AGA ATA AGA CAG ATT GTA AAT TTG TGG ACA 1248
Pro Ile Thr Leu Pro Cys Arg Ile Arg Gln Ile Val Asn Leu Trp Thr
525 530 535
AGG GTA GGA AAA GGA ATT TAT GCA CCA CCA ATT CGG GGA GTT CTT AAC 1296
Arg Val Gly Lys Gly Ile Tyr Ala Pro Pro Ile Arg Gly Val Leu Asn
540 545 550
TGT ACC TCC AAT ATT ACT GGA CTG GTT CTA GAA TAT AGT GGT GGG CCT 1344
Cys Thr Ser Asn Ile Thr Gly Leu Val Leu Glu Tyr Ser Gly Gly Pro
555 560 565 570
GAC ACC AAG GAA ACA ATA GTA TAT CCC TCA GGA GGA AAC ATG GTT AAT 1392
Asp Thr Lys Glu Thr Ile Val Tyr Pro Ser Gly Gly Asn Met Val Asn
575 580 585
CTC TGG AGA CAA GAG TTG TAT AAG TAC AAA GTA GTT AGC ATA GAA CCC 1440
Leu Trp Arg Gln Glu Leu Tyr Lys Tyr Lys Val Val Ser Ile Glu Pro
590 595 600
ATA GGA GTA GCA CCA GGT AAA GCT AAA AGA CGC ACA GTG AGT AGA GAA 1488
Ile Gly Val Ala Pro Gly Lys Ala Lys Arg Arg Thr Val Ser Arg Glu
605 610 615
AAA AGA GCA GCC TTT GGA CTA GGT GCG CTG TTT CTT GGG TTT CTT GGA 1536
Lys Arg Ala Ala Phe Gly Leu Gly Ala Leu Phe Leu Gly Phe Leu Gly
620 625 630
GCA GCA GGG AGC ACT ATG GGC GCA GCG TCA ATA ACG CTG ACG GTA CAG 1584
Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln
635 640 645 650
GCC CGG ACA TTA TTA TCT GGG ATA GTG CAA CAG CAG AAT ATT CTG TTG 1632
Ala Arg Thr Leu Leu Ser Gly Ile Val Gln Gln Gln Asn Ile Leu Leu
655 660 665
AGA GCA ATA GAG GCG CAA CAA CAT TTG TTG CAA CTC TCA ATC TGG GGC 1680
Arg Ala Ile Glu Ala Gln Gln His Leu Leu Gln Leu Ser Ile Trp Gly
670 675 680
ATT AAA CAG CTC CAG GCA AAA GTC CTT GCT ATA GAA AGA TAC CTT AGG 1728
Ile Lys Gln Leu Gln Ala Lys Val Leu Ala Ile Glu Arg Tyr Leu Arg
685 690 695
GAT CAG CAA ATC CTA AGT CTA TGG GGC TGC TCA GGA AAA ACA ATA TGC 1776
Asp Gln Gln Ile Leu Ser Leu Trp Gly Cys Ser Gly Lys Thr Ile Cys
700 705 710
TAT ACC ACT GTG CCT TGG AAT GAG ACT TGG AGC AAC AAT ACC TCT TAT 1824
Tyr Thr Thr Val Pro Trp Asn Glu Thr Trp Ser Asn Asn Thr Ser Tyr
715 720 725 730
GAT ACA ATC TGG AAT AAT TTA ACC TGG CAA CAA TGG GAT GAG AAA GTA 1872
Asp Thr Ile Trp Asn Asn Leu Thr Trp Gln Gln Trp Asp Glu Lys Val
735 740 745
AGA AAC TAT TCA GGT GTC ATT TTT GGA CTT ATA GAA CAG GCA CAA GAA 1920
Arg Asn Tyr Ser Gly Val Ile Phe Gly Leu Ile Glu Gln Ala Gln Glu
750 755 760
CAA CAG AAC ACA AAT GAG AAA TCA CTC TTG GAA TTG GAT CAA TGG GAC 1968
Gln Gln Asn Thr Asn Glu Lys Ser Leu Leu Glu Leu Asp Gln Trp Asp
765 770 775
AGT CTG TGG AGC TGG TTT GGT ATT ACA AAA TGG CTG TGG TAT ATA AAA 2016
Ser Leu Trp Ser Trp Phe Gly Ile Thr Lys Trp Leu Trp Tyr Ile Lys
780 785 790
ATA GCT ATA ATG ATA GTA GCA GGC ATT GTA GGC ATA AGA ATC ATA AGT 2064
Ile Ala Ile Met Ile Val Ala Gly Ile Val Gly Ile Arg Ile Ile Ser
795 800 805 810
ATA GTA ATA ACT ATA ATA GCA AGA GTT AGG CAG GGA TAT TCT CCC CTT 2112
Ile Val Ile Thr Ile Ile Ala Arg Val Arg Gln Gly Tyr Ser Pro Leu
815 820 825
TCG TTG CAG ACC CTT ATC CCA ACA GCA AGG GGA CCA GAC AGG CCA GAA 2160
Ser Leu Gln Thr Leu Ile Pro Thr Ala Arg Gly Pro Asp Arg Pro Glu
830 835 840
GAA ACA GAA GGA GGC GTT GGA GAG CAA GAC AGA GGC AGA TCC GTG CGA 2208
Glu Thr Glu Gly Gly Val Gly Glu Gln Asp Arg Gly Arg Ser Val Arg
845 850 855
TTA GTG AGC GGA TTC TCA GCT CTT GTC TGG GAG GAC CTC CGG AAC CTG 2256
Leu Val Ser Gly Phe Ser Ala Leu Val Trp Glu Asp Leu Arg Asn Leu
860 865 870
TTG ATC TTC CTC TAC CAC CGC TTG ACA GAC TCA CTC TTG ATA CTG AGG 2304
Leu Ile Phe Leu Tyr His Arg Leu Thr Asp Ser Leu Leu Ile Leu Arg
875 880 885 890
AGG ACT CTG GAA CTC CTG GGA CAG AGT CTC AGC AGG GGA CTG CAA CTA 2352
Arg Thr Leu Glu Leu Leu Gly Gln Ser Leu Ser Arg Gly Leu Gln Leu
895 900 905
CTG AAT GAA CTC AGA ACA CAC TTG TGG GGA ATA CTT GCA TAT TGG GGA 2400
Leu Asn Glu Leu Arg Thr His Leu Trp Gly Ile Leu Ala Tyr Trp Gly
910 915 920
AAA GAG TTA AGG GAT AGT GCT ATC AGC TTG CTT AAT ACA ACA GCT ATT 2448
Lys Glu Leu Arg Asp Ser Ala Ile Ser Leu Leu Asn Thr Thr Ala Ile
925 930 935
GTA GTA GCA GAA GGA ACA GAT AGG ATT ATA GAA TTA GCA CAA AGA ATA 2496
Val Val Ala Glu Gly Thr Asp Arg Ile Ile Glu Leu Ala Gln Arg Ile
940 945 950
GGA AGG GGA ATA TTA CAC ATA CCT AGA AGA ATC AGA CAA GGC CTA GAA 2544
Gly Arg Gly Ile Leu His Ile Pro Arg Arg Ile Arg Gln Gly Leu Glu
955 960 965 970
2559
AGA GCA CTG ATA TAA
Arg Ala Leu Ile
(2)SEQ ID NO:18的信息:
(i)序列特征:
(A)长度:852个氨基酸
(B)类型:氨基酸
(D)拓扑类型:线性
(ii)分子类型:蛋白质
(xi)SEQ ID NO:18的序列描述:
Met Lys Val Met Gly Met Gln Ser Gly Trp Met Gly Met Lys Ser Gly
1 5 10 15
Trp Leu Leu Phe Tyr Leu Leu Val Ser Leu Ile Lys Val Ile Gly Ser
20 25 30
Glu Gln His Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Arg Glu
35 40 45
Ala Glu Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala His Ser Thr
50 55 60
Glu Ala His Asn Ile Trp Ala Thr Gln Ala Cys Val Pro Thr Asp Pro
65 70 75 80
Asn Pro Gln Glu Val Leu Leu Pro Asn Val Thr Glu Lys Phe Asn Met
85 90 95
Trp Glu Asn Lys Met Ala Asp Gln Met Gln Glu Asp Ile Ile Ser Leu
100 105 110
Trp Glu Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val
115 120 125
Thr Met Leu Cys Asn Asp Ser Tyr Gly Glu Glu Arg Asn Asn Thr Asn
130 135 140
Met Thr Thr Arg Glu Pro Asp Ile Gly Tyr Lys Gln Met Lys Asn Cys
145 150 155 160
Ser Phe Asn Ala Thr Thr Glu Leu Thr Asp Lys Lys Lys Gln Val Tyr
165 170 175
Ser Leu Phe Tyr Val Glu Asp Val Val Pro Ile Asn Ala Tyr Asn Lys
180 185 190
Thr Tyr Arg Leu Ile Asn Cys Asn Thr Thr Ala Val Thr Gln Ala Cys
195 200 205
Pro Lys Thr Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Pro
210 215 220
Gly Phe Ala Ile Met Lys Cys Asn Glu Gly Asn Phe Ser Gly Asn Gly
225 230 235 240
Ser Cys Thr Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro
245 250 255
Val Ile Ser Thr Gln Leu Ile Leu Asn Gly Ser Leu Asn Thr Asp Gly
260 265 270
Ile Val Ile Arg Asn Asp Ser His Ser Asn Leu Leu Val Gln Trp Asn
275 280 285
Glu Thr Val Pro Ile Asn Cys Thr Arg Pro Gly Asn Asn Thr Gly Gly
290 295 300
Gln Val Gln Ile Gly Pro Ala Met Thr Phe Tyr Asn Ile Glu Lys Ile
305 310 315 320
Val Gly Asp Ile Arg Gln Ala Tyr Cys Asn Val Ser Lys Glu Leu Trp
325 330 335
Glu Pro Met Trp Asn Arg Thr Arg Glu Glu Ile Lys Lys Ile Leu Gly
340 345 350
Lys Asn Asn Ile Thr Phe Arg Ala Arg Glu Arg Asn Glu Gly Asp Leu
355 360 365
Glu Val Thr His Leu Met Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys
370 375 380
Asn Thr Ser Lys Leu Phe Asn Glu Glu Leu Leu Asn Glu Thr Gly Glu
385 390 395 400
Pro Ile Thr Leu Pro Cys Arg Ile Arg Gln Ile Val Asn Leu Trp Thr
405 410 415
Arg Val Gly Lys Gly Ile Tyr Ala Pro Pro Ile Arg Gly Val Leu Asn
420 425 430
Cys Thr Ser Asn Ile Thr Gly Leu Val Leu Glu Tyr Ser Gly Gly Pro
435 440 445
Asp Thr Lys Glu Thr Ile Val Tyr Pro Ser Gly Gly Asn Met Val Asn
450 455 460
Leu Trp Arg Gln Glu Leu Tyr Lys Tyr Lys Val Val Ser Ile Glu Pro
465 470 475 480
Ile Gly Val Ala Pro Gly Lys Ala Lys Arg Arg Thr Val Ser Arg Glu
485 490 495
Lys Arg Ala Ala Phe Gly Leu Gly Ala Leu Phe Leu Gly Phe Leu Gly
500 505 510
Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln
515 520 525
Ala Arg Thr Leu Leu Ser Gly Ile Val Gln Gln Gln Asn Ile Leu Leu
530 535 540
Arg Ala Ile Glu Ala Gln Gln His Leu Leu Gln Leu Ser Ile Trp Gly
545 550 555 560
Ile Lys Gln Leu Gln Ala Lys Val Leu Ala Ile Glu Arg Tyr Leu Arg
565 570 575
Asp Gln Gln Ile Leu Ser Leu Trp Gly Cys Ser Gly Lys Thr Ile Cys
580 585 590
Tyr Thr Thr Val Pro Trp Asn Glu Thr Trp Ser Asn Asn Thr Ser Tyr
595 600 605
Asp Thr Ile Trp Asn Asn Leu Thr Trp Gln Gln Trp Asp Glu Lys Val
610 615 620
Arg Asn Tyr Ser Gly Val Ile Phe Gly Leu Ile Glu Gln Ala Gln Glu
625 630 635 640
Gln Gln Asn Thr Asn Glu Lys Ser Leu Leu Glu Leu Asp Gln Trp Asp
645 650 655
Ser Leu Trp Ser Trp Phe Gly Ile Thr Lys Trp Leu Trp Tyr Ile Lys
660 665 670
Ile Ala Ile Met Ile Val Ala Gly Ile Val Gly Ile Arg Ile Ile Ser
675 680 685
Ile Val Ile Thr Ile Ile Ala Arg Val Arg Gln Gly Tyr Ser Pro Leu
690 695 700
Ser Leu Gln Thr Leu Ile Pro Thr Ala Arg Gly Pro Asp Arg Pro Glu
705 710 715 720
Glu Thr Glu Gly Gly Val Gly Glu Gln Asp Arg Gly Arg Ser Val Arg
725 730 735
Leu Val Ser Gly Phe Ser Ala Leu Val Trp Glu Asp Leu Arg Asn Leu
740 745 750
Leu Ile Phe Leu Tyr His Arg Leu Thr Asp Ser Leu Leu Ile Leu Arg
755 760 765
Arg Thr Leu Glu Leu Leu Gly Gln Ser Leu Ser Arg Gly Leu Gln Leu
770 775 780
Leu Asn Glu Leu Arg Thr His Leu Trp Gly Ile Leu Ala Tyr Trp Gly
785 790 795 800
Lys Glu Leu Arg Asp Ser Ala Ile Ser Leu Leu Asn Thr Thr Ala Ile
805 810 815
Val Val Ala Glu Gly Thr Asp Arg Ile Ile Glu Leu Ala Gln Arg Ile
820 825 830
Gly Arg Gly Ile Leu His Ile Pro Arg Arg Ile Arg Gln Gly Leu Glu
835 840 845
Arg Ala LeuIle
850
(2)SEQ ID NO:19的信息:
(i)序列特征:
(A)长度:639个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:DNA(基因组)
(ix)特征
(A)名称/关键词:CDS
(B)位置:1..636
(xi)SEQ ID NO:19的序列描述:
ATG GGA AAG ATT TGG TCA AAG AGC AGC CTA GTA GGA TGG CCA GAA ATC 48
Met Gly Lys Ile Trp Ser Lys Ser Ser Leu Val Gly Trp Pro Glu Ile
855 860 865
AGA GAA AGA ATG AGA AGA CAA ACG CAA GAA CCA GCA GTA GAG CCA GCA 96
Arg Glu Arg Met Arg Arg Gln Thr Gln Glu Pro Ala Val Glu Pro Ala
870 875 880
GTA GGA GCA GGA GCA GCT TCT CAA GAT CTA GCT AAT CGA GGG GCC ATC 144
Val Gly Ala Gly Ala Ala Ser Gln Asp Leu Ala Asn Arg Gly Ala Ile
885 890 895 900
ACC ATA AGA AAT ACT AGA GAC AAT AAT GAA AGT ATA GCT TGG CTA GAA 192
Thr Ile Arg Asn Thr Arg Asp Asn Asn Glu Ser Ile Ala Trp Leu Glu
905 910 915
GCA CAA GAA GAA GAA GAG GAA GTA GGC TTT CCA GTA CGC CCT CAG GTA 240
Ala Gln Glu Glu Glu Glu Glu Val Gly Phe Pro Val Arg Pro Gln Val
920 925 930
CCA TTA AGG CCA ATA ACC TAT AAA CAG GCT TTT GAT CTT TCC TTC TTT 288
Pro Leu Arg Pro Ile Thr Tyr Lys Gln Ala Phe Asp Leu Ser Phe Phe
935 940 945
TTA AAA GAT AAG GGG GGA CTG GAA GGG CTA GTT TGG TCC AGA AAA AGG 336
Leu Lys Asp Lys Gly Gly Leu Glu Gly Leu Val Trp Ser Arg Lys Arg
950 955 960
CAA GAT ATT CTA GAC CTC TGG ATG TAT CAC ACA CAA GGC ATC CTC CCT 384
Gln Asp Ile Leu Asp Leu Trp Met Tyr His Thr Gln Gly Ile Leu Pro
965 970 975 980
GAC TGG CAT AAC TAC ACA CCA GGG CCA GGA ATT AGA TAC CCC GTA ACC 432
Asp Trp His Asn Tyr Thr Pro Gly Pro Gly Ile Arg Tyr Pro Val Thr
985 990 995
TTT GGA TGG TGC TTC AAA CTA GTA CCA TTG TCA GCT GAA GAA GTA GAA 480
Phe Gly Trp Cys Phe Lys Leu Val Pro Leu Ser Ala Glu Glu Val Glu
1000 1005 1010
GAG GCT AAT GAA GGA GAC AAC AAT GCC CTC TTA CAC CCC ATA TGT CAA 528
Glu Ala Asn Glu Gly Asp Asn Asn Ala Leu Leu His Pro Ile Cys Gln
1015 1020 1025
CAT GGA GCA GAT GAT GAT CAT AAA GAA GTG TTG GTG TGG CGA TTT GAC 576
His Gly Ala Asp Asp Asp His Lys Glu Val Leu Val Trp Arg Phe Asp
1030 1035 1040
AGC TCC CTA GCA AGA AGA CAT GTA GCA AGA GAG CTG CAT CCG GAG TTT 624
Ser Ser Leu Ala Arg Arg His Val Ala Arg Glu Leu His Pro Glu Phe
1045 1050 1055 1060
TAC AAG AAC TGC TGA 639
Tyr Lys Asn Cys
(2)SEQ ID NO:20的信息:
(i)序列特征:
(A)长度:212个氨基酸
(B)类型:氨基酸
(D)拓扑类型:线性
(ii)分子类型:蛋白质
(xi)SEQ ID NO:20的序列描述:
Met Gly Lys Ile Trp Ser Lys Ser Ser Leu Val Gly Trp Pro Glu Ile
1 5 10 15
Arg Glu Arg Met Arg Arg Gln Thr Gln Glu Pro Ala Val Glu Pro Ala
20 25 30
Val Gly Ala Gly Ala Ala Ser Gln Asp Leu Ala Asn Arg Gly Ala Ile
35 40 45
Thr Ile Arg Asn Thr Arg Asp Asn Asn Glu Ser Ile Ala Trp Leu Glu
50 55 60
Ala Gln Glu Glu Glu Glu Glu Val Gly Phe Pro Val Arg Pro Gln Val
65 70 75 80
Pro Leu Arg Pro Ile Thr Tyr Lys Gln Ala Phe Asp Leu Ser Phe Phe
85 90 95
Leu Lys Asp Lys Gly Gly Leu Glu Gly Leu Val Trp Ser Arg Lys Arg
100 105 110
Gln Asp Ile Leu Asp Leu Trp Met Tyr His Thr Gln Gly Ile Leu Pro
115 120 125
Asp Trp His Asn Tyr Thr Pro Gly Pro Gly Ile Arg Tyr Pro Val Thr
130 135 140
Phe Gly Trp Cys Phe Lys Leu Val Pro Leu Ser Ala Glu Glu Val Glu
145 150 155 160
Glu Ala Asn Glu Gly Asp Asn Asn Ala Leu Leu His Pro Ile Cys Gln
165 170 175
His Gly Ala Asp Asp Asp His Lys Glu Val Leu Val Trp Arg Phe Asp
180 185 190
Ser Ser Leu Ala Arg Arg His Val Ala Arg Glu Leu His Pro Glu Phe
195 200 205
Tyr Lys Asn Cys
210
(2)SEQ ID NO:21的信息:
(i)序列特征:
(A)长度:20个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:21的序列描述:
ATTGCGTACT CACACTTCCG 20
(2)SEQ ID NO:22的信息:
(i)序列特征:
(A)长度:17个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:22的序列描述:
GGCAAGCAGG GAGCTGG 17
(2)SEQ ID NO:23的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:23的序列描述:
TCCTTGAGCA GTCTGGAC 18
(2)SEQ ID NO:24的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:24的序列描述:
GAACAGGAGG ATTAGCAG 18
(2)SEQ ID NO:25的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:25的序列描述:
AGCAGAGGCT ATGTCACA 18
(2)SEQ ID NO:26的信息:
(i)序列特征:
(A)长度:19个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:26的序列描述:
TGTAAGGCCC CTAGAAGAG 19
(2)SEQ ID NO:27的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:27的序列描述:
ACAGAGAACT CTCTGTAC 18
(2)SEQ ID NO:28的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:28的序列描述:
AAGAAAAGCA GTTGGTAC 18
(2)SEQ ID NO:29的信息:
(i)序列特征:
(A)长度:17个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:29的序列描述:
TTTCTTCCCT GTATGTC 17
(2)SEQ ID NO:30的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:30的序列描述:
GTTATATGGA TTCTCAGG 18
(2)SEQ ID NO:31的信息:
(i)序列特征:
(A)长度:19个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:31的序列描述:
TGGCAGCACA TTATACTGG 19
(2)SEQ ID NO:32的信息:
(i)序列特征:
(A)长度:23个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:32的序列描述:
ATCATTTACC AGTACATGGA CGA 23
(2)SEQ ID NO:33的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:33的序列描述:
TGTCAGGGGT CGTAAAGC 18
(2)SEQ ID NO:34的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:34的序列描述:
TCCTCTGGAT GGGATATG 18
(2)SEQ ID NO:35的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:35的序列描述:
TCTATCCAGG AATCAGAG 18
(2)SEQ ID NO:36的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:36的序列描述:
AATGAGATCT GCCCATAC 18
(2)SEQ ID NO:37的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:37的序列描述:
TGACAGATAG GGGAAGAC 18
(2)SEQ ID NO:38的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:38的序列描述:
AACCGCCATT TGCACTGC 18
(2)SEQ ID NO:39的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:39的序列描述:
ACATGGACCG CCACAAGG 18
(2)SEQ ID NO:40的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:40的序列描述:
AGCAACAGAC ATACAGAC 18
(2)SEQ ID NO:41的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:41的序列描述:
AAAGTAGTCC CACGTAGG 18
(2)SEQ ID NO:42的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:42的序列描述:
ATATCCCAGT AGGTCAGG 18
(2)SEQ ID NO:43的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:43的序列描述:
TCTAGCACTA ACAGCCTG 18
(2)SEQ ID NO:44的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:44的序列描述:
ACTCTTACTG CTCTGAGG 18
(2)SEQ ID NO:45的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:45的序列描述:
CCATAGTACA CTGTTACC 18
(2)SEQ ID NO:46的信息:
(i)序列特征:
(A)长度:20个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:46的序列描述:
CATAGCTATC GTTACAAAGC 20
(2)SEQ ID NO:47的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:47的序列描述:
TCATAATGGC AAAGCCTG 18
(2)SEQ ID NO:48的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:48的序列描述:
CTATTCCACA TTGGTTCC 18
(2)SEQ ID NO:49的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:49的序列描述:
ATTCTAGAAC CAGTCCAG 18
(2)SEQ ID NO:50的信息:
(i)序列特征:
(A)长度:20个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:50的序列描述:
CCTTAGGGAT CAGCAAATCC 20
(2)SEQ ID NO:51的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:51的序列描述:
TGGGACAGTC TGTGGAGC 18
(2)SEQ ID NO:52的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:52的序列描述:
TTCTCAGCTC TTGTCTGG 18
(2)SEQ ID NO:53的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:53的序列描述:
ATTAAGCAAG CTGATAGC 18
(2)SEQ ID NO:54的信息:
(i)序列特征:
(A)长度:16个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:54的序列描述:
TGTGCTTCTA GCCAAG 16
(2)SEQ ID NO:55的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:55的序列描述:
GCTCCATGTT GACATATG 18
(2)SEQ ID NO:56的信息:
(i)序列特征:
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:56的序列描述:
AGAGAGACCC AGTACAAG 18
(2)SEQ ID NO:57的信息:
(i)序列特征:
(A)长度:20个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:其它核酸
(A)描述:/desc=“引物”
(xi)SEQ ID NO:57的序列描述:
ATAAAAGCAG CCGCTTCTCG 20
(2)SEQ ID NO:58的信息:
(i)序列特征:
(A)长度:35个氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:肽
(xi)SEQ ID NO:58的序列描述:
Cys Thr Arg Pro Gly Asn Asn Thr Gly Gly Gln Val Gln Ile Gly Pro
1 5 10 15
Ala Met Thr Phe Tyr Asn Ile Glu Lys Ile Val Gly Asp Ile Arg Gln
20 25 30
Ala Tyr Cys
35
(2)SEQ ID NO:59的信息:
(i)序列特征:
(A)长度:35个氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:肽
(xi)SEQ ID NO:59的序列描述:
Cys His Arg Pro Gly Asn Asn Thr Arg Gly Glu Val Gln Ile Gly Pro
1 5 10 15
Gly Met Thr Phe Tyr Asn Ile Glu Asn Val Tyr Gly Asp Thr Arg Ser
20 25 30
Ala Tyr Cys
35
(2)SEQ ID NO:60的信息:
(i)序列特征:
(A)长度:35个氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:肽
(xi)SEQ ID NO:60的序列描述:
Cys Ile Arg Pro Gly Asn Arg Thr Tyr Arg Asn Leu Gln Ile Gly Pro
1 5 10 15
Gly Met Thr Phe Tyr Asn Val Glu Ile Ala Thr Gly Asp Ile Arg Lys
20 25 30
Ala Phe Cys
35
(2)SEQ ID NO:61的信息:
(i)序列特征:
(A)长度:35个氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑类型:线性
(ii)分子类型:肽
(xi)SEQ ID NO:61的序列描述:
Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Val Arg Ile Gly Pro
1 5 10 15
Gly Gln Ala Phe Tyr Ala Thr Gly Asp Ile Ile Gly Asp Ile Arg Gln
20 25 30
Ala His Cys
35
Claims (2)
1.非-M、非-O HIV-1毒株,这种逆转录病毒于1996年7月2日保藏在国立微生物保藏中心,保藏号为I-1753,命名为YBF30,由巴斯德研究所保存。
2.分离的核酸序列,其特征在于为权利要求1所述毒株的完整核苷酸序列且特征在于其由序列SEQ ID NO:1定义。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR96/15087 | 1996-12-09 | ||
FR9615087A FR2756843B1 (fr) | 1996-12-09 | 1996-12-09 | Souches de vih-1 non-m non-o, fragments et applications |
PCT/FR1997/002227 WO1998026075A1 (fr) | 1996-12-09 | 1997-12-08 | Souches de vih-1 non-m non-o, fragments et applications |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1244896A CN1244896A (zh) | 2000-02-16 |
CN1244896B true CN1244896B (zh) | 2011-06-22 |
Family
ID=9498464
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN971813809A Expired - Lifetime CN1244896B (zh) | 1996-12-09 | 1997-12-08 | 非-m、非-o hiv-1毒株、片段及应用 |
Country Status (14)
Country | Link |
---|---|
US (4) | US6509018B1 (zh) |
EP (1) | EP0946731B1 (zh) |
JP (1) | JP2000515768A (zh) |
CN (1) | CN1244896B (zh) |
AT (1) | ATE422544T1 (zh) |
AU (1) | AU737857B2 (zh) |
BR (1) | BRPI9713891B8 (zh) |
CA (1) | CA2274490C (zh) |
DE (1) | DE69739251D1 (zh) |
DK (1) | DK0946731T3 (zh) |
ES (1) | ES2323604T3 (zh) |
FR (1) | FR2756843B1 (zh) |
PT (1) | PT946731E (zh) |
WO (1) | WO1998026075A1 (zh) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7935805B1 (en) * | 1998-12-31 | 2011-05-03 | Novartis Vaccines & Diagnostics, Inc | Polynucleotides encoding antigenic HIV Type C polypeptides, polypeptides and uses thereof |
EP1422298B1 (en) * | 1999-07-09 | 2010-01-27 | Gen-Probe Incorporated | Detection of hiv-1 by nucleic acid amplification |
DE19936003A1 (de) * | 1999-08-03 | 2001-02-15 | Dade Behring Marburg Gmbh | Lentivirus aus der Gruppe der Immunschwächeviren der Drillaffen(mandrillus leucophaeus) und ihre Verwendung |
WO2004092724A2 (en) * | 2003-04-11 | 2004-10-28 | The Government Of The United States Of America As Represented By The Secretary, Department Of Health | Multiple antigenic peptide assay for detection of hiv or siv type retroviruses |
EP1778282B1 (en) * | 2004-08-17 | 2016-03-30 | Institut Gustave Roussy | Mutated hiv nef for modulating immunity |
WO2007084568A2 (en) | 2006-01-17 | 2007-07-26 | Health Research, Inc. | Heteroduplex tracking assay |
RU2355763C2 (ru) * | 2006-09-13 | 2009-05-20 | Закрытое акционерное общество "Научно-исследовательский институт Аджиномото-Генетика" (ЗАО АГРИ) | Мутантная ацетолактатсинтаза и способ продукции разветвленных l-аминокислот |
EP2121984A2 (en) * | 2007-03-16 | 2009-11-25 | 454 Life Sciences Corporation | System and method for detection of hiv drug resistant variants |
US20090258342A1 (en) * | 2008-04-09 | 2009-10-15 | Intelligent Medical Devices, Inc. | Optimized probes and primers and methods of using same for the detection, quantification and grouping of hiv-1 |
US9650668B2 (en) | 2008-09-03 | 2017-05-16 | Nabsys 2.0 Llc | Use of longitudinally displaced nanoscale electrodes for voltage sensing of biomolecules and other analytes in fluidic channels |
US8262879B2 (en) | 2008-09-03 | 2012-09-11 | Nabsys, Inc. | Devices and methods for determining the length of biopolymers and distances between probes bound thereto |
US8859201B2 (en) | 2010-11-16 | 2014-10-14 | Nabsys, Inc. | Methods for sequencing a biomolecule by detecting relative positions of hybridized probes |
WO2012109574A2 (en) | 2011-02-11 | 2012-08-16 | Nabsys, Inc. | Assay methods using dna binding proteins |
US9914966B1 (en) | 2012-12-20 | 2018-03-13 | Nabsys 2.0 Llc | Apparatus and methods for analysis of biomolecules using high frequency alternating current excitation |
WO2014113557A1 (en) | 2013-01-18 | 2014-07-24 | Nabsys, Inc. | Enhanced probe binding |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1986002383A1 (en) * | 1984-10-18 | 1986-04-24 | Institut Pasteur | Envelope antigens of lymphadenopathy associated virus and their applications |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2767078B2 (ja) * | 1990-10-17 | 1998-06-18 | アメリカ合衆国 | Hiv―1の分子クローンとその使用法 |
FR2707170A1 (fr) * | 1993-06-04 | 1995-01-13 | Pasteur Institut | Expression des récepteurs CD4 et CD26 dans des cellules recombinantes, inhibiteurs du récepteur CD26. |
US5945301A (en) * | 1995-09-29 | 1999-08-31 | Chugai Seiyaku Kabushiki Kaisha | Kinase in TGF-β family signal transduction system |
US5994123A (en) * | 1996-08-09 | 1999-11-30 | Regents Of University Of Minnesota | Sugarcane bacilliform virus promoter |
-
1996
- 1996-12-09 FR FR9615087A patent/FR2756843B1/fr not_active Expired - Lifetime
-
1997
- 1997-12-08 EP EP97950245A patent/EP0946731B1/fr not_active Expired - Lifetime
- 1997-12-08 DE DE69739251T patent/DE69739251D1/de not_active Expired - Lifetime
- 1997-12-08 ES ES97950245T patent/ES2323604T3/es not_active Expired - Lifetime
- 1997-12-08 CN CN971813809A patent/CN1244896B/zh not_active Expired - Lifetime
- 1997-12-08 BR BRPI9713891A patent/BRPI9713891B8/pt not_active IP Right Cessation
- 1997-12-08 AU AU53272/98A patent/AU737857B2/en not_active Expired
- 1997-12-08 US US09/319,588 patent/US6509018B1/en not_active Expired - Lifetime
- 1997-12-08 AT AT97950245T patent/ATE422544T1/de active
- 1997-12-08 JP JP10526284A patent/JP2000515768A/ja not_active Ceased
- 1997-12-08 DK DK97950245T patent/DK0946731T3/da active
- 1997-12-08 PT PT97950245T patent/PT946731E/pt unknown
- 1997-12-08 WO PCT/FR1997/002227 patent/WO1998026075A1/fr active IP Right Grant
- 1997-12-08 CA CA2274490A patent/CA2274490C/fr not_active Expired - Lifetime
-
2002
- 2002-11-22 US US10/301,661 patent/US7030234B2/en not_active Expired - Lifetime
-
2005
- 2005-09-15 US US11/226,240 patent/US7534877B2/en not_active Expired - Fee Related
-
2009
- 2009-04-24 US US12/385,973 patent/US8227185B2/en not_active Expired - Fee Related
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1986002383A1 (en) * | 1984-10-18 | 1986-04-24 | Institut Pasteur | Envelope antigens of lymphadenopathy associated virus and their applications |
Also Published As
Publication number | Publication date |
---|---|
FR2756843A1 (fr) | 1998-06-12 |
BRPI9713891B8 (pt) | 2021-07-06 |
AU5327298A (en) | 1998-07-03 |
US20070190524A1 (en) | 2007-08-16 |
DK0946731T3 (da) | 2009-06-22 |
AU737857B2 (en) | 2001-08-30 |
BRPI9713891A (pt) | 2000-02-29 |
CA2274490A1 (fr) | 1998-06-18 |
WO1998026075A1 (fr) | 1998-06-18 |
US7030234B2 (en) | 2006-04-18 |
US20030157660A1 (en) | 2003-08-21 |
FR2756843B1 (fr) | 1999-01-22 |
US7534877B2 (en) | 2009-05-19 |
EP0946731A1 (fr) | 1999-10-06 |
US6509018B1 (en) | 2003-01-21 |
US20100184014A1 (en) | 2010-07-22 |
DE69739251D1 (de) | 2009-03-26 |
JP2000515768A (ja) | 2000-11-28 |
US8227185B2 (en) | 2012-07-24 |
PT946731E (pt) | 2009-05-14 |
CN1244896A (zh) | 2000-02-16 |
CA2274490C (fr) | 2013-02-12 |
ES2323604T3 (es) | 2009-07-21 |
BRPI9713891B1 (pt) | 2015-08-18 |
EP0946731B1 (fr) | 2009-02-11 |
ATE422544T1 (de) | 2009-02-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1244896B (zh) | 非-m、非-o hiv-1毒株、片段及应用 | |
Gelderblom et al. | Morphogenesis and morphology of HIV structure-function relations | |
Park et al. | The nonmyristylated Pr160gag-pol polyprotein of human immunodeficiency virus type 1 interacts with Pr55gag and is incorporated into viruslike particles | |
JP4029107B2 (ja) | Hiv−グループの免疫不全ウイルスに対する抗体を検出するキット及び方法 | |
Willey et al. | Increase in soluble CD4 binding to and CD4-induced dissociation of gp120 from virions correlates with infectivity of human immunodeficiency virus type 1 | |
AU621031B2 (en) | Variants of lav viruses, their dna- and protein-components and their uses, particularly for diagnostic purposes and for the preparation of immunogenic compositions | |
CN1651457B (zh) | Hiv-1 o组(或亚组)逆转录病毒抗原的核苷酸序列 | |
US7029679B2 (en) | Variant of LAV viruses | |
US6037165A (en) | Methods for the preparation of human immunodeficiency virus type 2 (HIV-2) and antigens encoped thereby | |
EHRLICH et al. | Expression in Escherichia coli and purification of human immunodeficiency virus type 1 capsid protein (p24) | |
US6511801B1 (en) | HIV-1 group O antigens and uses thereof | |
US5824482A (en) | Purification, cloning, and characterization of a novel human immunodeficiency virus LAVMAL | |
Gregersen et al. | Detection of human immunodeficiency virus and other retroviruses in cell culture supernatants by a reverse transcriptase microassay | |
Buonaguro et al. | Heteroduplex mobility assay and phylogenetic analysis of V3 region sequences of human immunodeficiency virus type 1 isolates from Gulu, northern Uganda. The Italian-Ugandan Cooperation AIDS Program | |
Manca et al. | Human T helper cells specific for HIV reverse transcriptase: possible role in intrastructural help for HIV envelope‐specific antibodies | |
MXPA99005320A (en) | Non-m non-o hiv strains, fragments and applications | |
Park et al. | Targeting a foreign protein into virion particles by fusion with the Vpx protein of simian immunodeficiency virus | |
Gorse et al. | Recombinant gp160 vaccination schedule and MHC HLA type as factors influencing cellular responses to HIV-1 envelope glycoprotein | |
DK176153B1 (da) | Protein- eller glycoproteinfragment | |
CA1340875C (en) | Variants of lav viruses their dna- and protein- components and their uses, particularly for diagnostic purposes and for the preparation of immunogenic compositions | |
AU780017B2 (en) | Nucleotide sequences of HIV-1 type (or subtype) O retrovirus antigens |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CX01 | Expiry of patent term | ||
CX01 | Expiry of patent term |
Granted publication date: 20110622 |