AU668355C - HCV genomic sequences for diagnostics and therapeutics - Google Patents

HCV genomic sequences for diagnostics and therapeutics

Info

Publication number
AU668355C
AU668355C AU21558/92A AU2155892A AU668355C AU 668355 C AU668355 C AU 668355C AU 21558/92 A AU21558/92 A AU 21558/92A AU 2155892 A AU2155892 A AU 2155892A AU 668355 C AU668355 C AU 668355C
Authority
AU
Australia
Prior art keywords
εequence
region
nucleic acid
hcv
sequence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired
Application number
AU21558/92A
Other versions
AU2155892A (en
AU668355B2 (en
Inventor
Eileen Beall
Tai-An Cha
Bruce Irvine
Janice Kolberg
Michael S. Urdea
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Novartis Vaccines and Diagnostics Inc
Original Assignee
Novartis Vaccines and Diagnostics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Novartis Vaccines and Diagnostics Inc filed Critical Novartis Vaccines and Diagnostics Inc
Priority claimed from PCT/US1992/004036 external-priority patent/WO1992019743A2/en
Publication of AU2155892A publication Critical patent/AU2155892A/en
Publication of AU668355B2 publication Critical patent/AU668355B2/en
Application granted granted Critical
Publication of AU668355C publication Critical patent/AU668355C/en
Assigned to NOVARTIS VACCINES AND DIAGNOSTICS, INC. reassignment NOVARTIS VACCINES AND DIAGNOSTICS, INC. Request to Amend Deed and Register Assignors: CHIRON CORPORATION
Anticipated expiration legal-status Critical
Expired legal-status Critical Current

Links

Description

HCV GENOMIC SEQUENCES FOR DIAGNOSTICS AND THERAPEUTICS
This application is a continuation-in-part of U.S. Serial No. 07/697,326 entitled "Polynucleotide Probes Useful for Screening for Hepatitis C Virus, filed May 8, 1991.
Technical Field The invention relates to compositions and methods for the detection and treatment of hepatitis C virus, (HCV) infection, formerly referred to as blood-borne non-A, non-B hepatitis virus (NANBV) infection. More εpecifically, embodiments of the present invention feature compositions and methods for the detection of HCV, and for the development of vaccines for the prophylactic treatment of infections of HCV, and development of antibody products for conveying passive immunity to HCV.
Background of the Invention
The prototype isolate of HCV was characterized in U.S. Patent Application Serial No. 122,714 (See also EPO Publication No. 318,216}. As used herein, the term "HCV" includes new isolates of the same viral species. The term "HCV-1" referred to in U.S. Patent Application Serial No. 122,714.
SUBSTITUTE SHEET HCV is a transmissible disease distinguishable from other forms of viral-associated liver diseases, including that caused by the known hepatitis viruses, i.e., hepatitis A virus (HAV), hepatitis B virus (HBV), and delta hepatitis virus (HDV), as well aε the hepatitis induced by cytomegalovirus (CMV) or Epstein-Barr virus (EBV) . HCV was firεt identified in individuals who had received blood transfuεions.
The demand for sensitive, specific methods for screening and identifying carriers of HCV and HCV contaminated blood or blood products is significant. Post-transfusion hepatitis (PTH) occurs in approximately 10% of transfused patients, and HCV accounts for up to 90% of these cases. The diseaεe frequently progresses to chronic liver damage (25-55%). Patient care as well as the prevention of transmission of HCV by blood and blood products or by close personal contact require reliable screening, diagnostic and prognostic tools to detect nucleic acids, antigens and antibodies related to HCV.
Information in this application suggests the HCV has several genotypes. That iε, the genetic information of the HCV viruε may not be totally identical for all HCV, but encompasses groups with differing genetic information.
Genetic information is stored in thread-like molecules of DNA and RNA. DNA consistε of covalently
SUBSTITUTE SHEET linked chains of deoxyribonucleotides and RNA consists of covalently linked chains of ribonucleotides. Each nucleotide is characterized by one of four bases: adenine (A), guanine (G), thy ine (T), and cytosine (C) . The bases are complementary in the sense that, due to the orientation of functional groups, certain base pairs attract and bond to each other through hydrogen bonding and ir-stacking interactions. Adenine in one strand of DNA pairs with thymine in an opposing complementary strand. Guanine in one strand of DNA pairs with cytosine in an opposing complementary strand. In RNA, the thymine base is replaced by uracil (U) which pairs with adenine in an opposing complementary strand. The genetic code of living organism iε carried in the sequence of base pairs. Living cells interpret, transcribe and translate the information of nucleic acid to make proteins and peptides.
The HCV genome is comprised of a single positive strand of RNA. The HCV genome possesses a continuous, translational open reading frame (ORF) that encodes a polyprotein of about 3,000 amino acids. In the ORF, the structural protein(s) appear to be encoded in approximately the first quarter of the N-terminus region, with the majority of the polyprotein responsible for non-structural proteins.
SUBSTITUTE SHEET The HCV polyprotein compriεes, from the amino terminus to the carboxy terminus, the nucleocapsid protein (C), the envelope protein (E), and the non-structural proteinε (NS) 1, 2 (b), 3, 4 (b), and 5. HCV of differing genotypes may encode for proteins which present an altered responεe to host immune εyεtemε. HCV of differing genotypeε may be difficult to detect by immuno diagnostic techniques and nucleic acid probe techniques which are not εpecifically directed to εuch genotype.
Definitions for εelected terms used in the application are set forth below to facilitate an understanding of the invention. The term "corresponding" means homologous to or complementary to a particular sequence of nucleic acid. As between nucleic acids and peptides, corresponding refers to amino acids of a peptide in an order derived from the sequence of a nucleic acid or its complement.
The term "non-naturally occurring nucleic acid" refers to a portion of genomic nucleic acid, cDNA, semisynthetic nucleic acid, or synthetic origin nucleic acid which, by virtue of its origin or manipulation: (1) is not associated with all of a nucleic acid with which it is associated in nature, (2) is linked to a nucleic acid or other chemical agent other than that to
SUBSTITUTE SHEET which it iε linked in nature, or (3) does not occur in nature.
Similarly the term, "a non-naturally occurring peptide" refers to a portion of a large naturally occurring peptide or protein, or εemi-εynthetic or εynthetic peptide, which by virtue of itε origin or manipulation (1) iε not associated with all of a peptide with which it is associated in nature, (2) is linked to peptides, functional groups or chemical agents other than that to which it is linked in nature, or (3) does not occur in nature.
The term "primer" refers to a nucleic acid which is capable of initiating the synthesis of a larger nucleic acid when placed under appropriate conditions. The primer will be completely or substantially complementary to a region of the nucleic acid to be copied. Thus, under conditions conducive to hybridization, the primer will anneal to a complementary region of a larger nucleic acid. Upon addition of suitable reactantε, the primer is extended by the polymerizing agent to form a copy of the larger nucleic acid.
The term "binding pair" refers to any pair of molecules which exhibit mutual affinity or binding capacity. For the purposes of the present application, the term "ligand" will refer to one molecule of the » binding pair, and the term "antiligand" or "receptor"
SUBSTITUTE SHEET or "target" will refer to the opposite molecule of the binding pair. For example, with respect to nucleic acids, a binding pair may comprise two complementary nucleic acids. One of the nucleic acids may be designated the ligand and the other strand iε designated the antiligand receptor or target. The designation of ligand or antiligand is a matter of arbitrary convenience. Other binding pairs comprise, by way of example, antigens and antibodies, drugε and drug receptor εiteε and enzymes and enzyme substrates, to name a few.
The term "label" referε to a molecular moiety capable of detection including, by way of example, without limitation, radioactive isotopes, enzymes, luminescent agents, precipitating agents, and dyes.
The term "support" includes conventional εupportε such aε filters and membranes aε well aε retrievable supports which can be substantially disperεed within a medium and removed or εeparated from the medium by immobilization, filtering, partitioning, or the like. The term "εupport meanε" referε to supports capable of being associated to nucleic acids, peptideε or antibodieε by binding partnerε, or covalent or noncovalent linkageε. A number of HCV strains and isolates have been identified. When compared with the εequence of the original iεolate derived from the USA ("HCV-l"; εee
SUBSTITUTE SHEET Q.-L. Choo et al. (1989) Science 244:359-362, Q.-L. Choo et al. (1990) Brit. Med. Bull. £6:423-441, Q--L- Choo et al., Proc. Natl. Acad. Sci. 88:2451-2455 (1991), and E.P.O. Patent Publication No. 318,216, cited supra), it was found that a Japanese isolate ("HCV Jl") differed significantly in both nucleotide and polypeptide sequence within the NS3 and NS4 regions. This conclusion was later extended to the NS5 and envelope (El/S and E2/NS1) regions (see K. Takeuchi et al., J. Gen. Virol. (1990) 71:3027-3033, Y. Kubo, Nucl. Acidε. Res. (1989) 1/7:10367-10372, and K. Takeuchi et al.. Gene (1990) .91:287-291). The former group of isolates, originally identified in the United States, iε termed "Genotype I" throughout the preεent diεcloεure, while the latter group of iεolateε, initially identified in Japan, iε termed "Genotype II" herein.
Brief Description of the Invention The present invention features compositions of matter comprising nucleic acids and peptideε corresponding to the HCV viral genome which define different genotypes. The present invention also features methods of uεing the compositions corresponding to sequences of the HCV viral genome which define different genotypeε described herein.
SUBSTITUTE SHEET A. Nucleic acid compoεitionε The nucleic acid of the present invention, correεponding to the HCV viral genome which define different genotypes, have utility as probes in nucleic acid hybridization asεayε, aε primers for reactionε involving the synthesis of nucleic acid, as binding partners for separating HCV viral nucleic acid from other constituents which may be present, and aε anti-εenεe nucleic acid for preventing the transcription or translation of viral nucleic acid.
One embodiment of the present invention features a compoεition compriεing a non-naturally occurring nucleic acid having a nucleic acid εequence of at leaεt eight nucleotideε corresponding to a non-HCV-l nucleotide sequence of the hepatitis C viral genome. Preferably, the nucleotide εequence iε εelected from a εequence preεent in at leaεt one region conεiεting of the NS5 region, envelope 1 region, 5'UT region, and the core region. Preferably, with reεpect to sequences which correspond to the NS5 region, the sequence is selected from a sequence within a sequence numbered 2-22. The εequence numbered 1 corresponds to HCV-1. Sequences numbered 1-22 are defined in the Sequence Listing of the application.
Preferably, with respect to εequences corresponding to the envelope 1 region, the sequence iε
SUBSTITUTE SHEET εelected from a sequence within sequences numbered 24-32. Sequence No. 23 correspondε to HCV- . Sequences numbered 23-32 are set forth in the Sequence Listing of the application. Preferably, with respect to the sequences which correspond to the 5'UT regions, the εequence is εelected from a εequence within sequences numbered 34-51. Sequence No. 33 corresponds to HCV-1. Sequence No. 33-51 are set forth in the Sequence Listing of this application.
Preferably, with respect to the sequenceε which correspond to the core region, the sequence is εelected from a sequence within the sequenceε numbered 53-66. Sequence No. 52 corresponds to HCV-1. Sequences 52-66 are set forth in the Sequence Liεting of this application.
The compositionε of the present invention form hybridization products with nucleic acid corresponding to different genotypes of HCV. HCV has at least five genotypes, which will be referred to in this application by the designations GI-GV. The first genotype, GI, iε exemplified by εequenceε numbered 1-6, 23-25, 33-38 and 52-57. The second genotype, Gil, is exemplified by the sequenceε numbered 7-12, 26-28, 39-45 and 58-64. The third genotype, GUI, iε exemplified by sequences numbered 13-17, 32, 46-47 and 65-66. The fourth genotype, GIV,
SUBSTITUTE SHEET iε exemplified by sequences numbered 20-22, and 29-31 and 48-49. The fifth genotype, GV, is exemplified by εequenceε numbered 18, 19, 50 and 51.
One embodiment of the preεent invention features compositionε comprising a nucleic acid having a εequence correεponding to one or more sequenceε which exemplify a genotype of HCV.
B. Method of forming a Hybridization Product Embodiments of the present invention also feature a method of forming a hybridization product with nucleic acid having a sequence corresponding to HCV nucleic acid. One method compriεeε the εtepε of placing a non-naturally occurring nucleic acid having a non-HCV-l εequence correεponding to HCV nucleic acid under conditionε in which hybridization may occur. The non-naturally occurring nucleic acid is capable of forming a hybridization product with HCV nucleic acid, under hybridization conditions. The method further comprises the step of imposing hybridization conditions to form a hybridization product in the presence of nucleic acid corresponding to a region of the HCV genome.
The formation of a hybridization product haε utility for detecting the presence of one or more genotypes of HCV. Preferably, the non-naturally occurring nucleic acid forms a hybridization product
SUBSTITUTE SHEET with nucleic acid of HCV in one or more regionε compriεing the NS5 region, envelope 1 region, 5'UT region and the core region. To detect the hybridization product, it is useful to associate the non-naturally occurring nucleic acid with a label. The formation of the hybridization product is detected by separating the hybridization product from labeled non-naturally occurring nucleic acid, which has not formed a hybridization product. The formation of a hybridization product haε utility aε a meanε of εeparating one or more genotypes of HCV nucleic acid from other constituents potentially present. For such applicationε, it iε useful to associate the non-naturally occurring nucleic acid with a support for εeparating the reεultant hybridization product from the the other conεtituents.
Nucleic acid "sandwich assays" employ one nucleic acid associated with a label and a εecond nucleic acid associated with a support. An embodiment of the present invention features a sandwich assay comprising two nucleic acids, both have sequenceε which correspond to HCV nucleic acids; however, at least one non-naturally occurring nucleic acid has a sequence corresponding to non-HCV-1 HCV nucleic acid. At least one nucleic acid is capable of asεociating with a label, and the other iε capable of associating with a support. The support associated non-naturally
SUBSTITUTE SHEET occurring nucleic acid is uεed to εeparate the hybridization productε which include an HCV nucleic acid and the non-naturally occurring nucleic acid having a non-HCV-1 εequence. One embodiment of the preεent invention features a method of detecting one or more genotypes of HCV. The method compriseε the εtepε of placing a non-naturally occurring nucleic acid under conditions which hybridization may occur. The non-naturally occurring nucleic acid is capable of forming a hybridization product with nucleic acid from one or more genotypes of HCV. The firεt genotype, GI, iε exemplified by εequenceε numbered 1-6, 23-25, 33-38 and 52-57. The εecond genotype, GII, iε exemplified by the εequenceε numbered 7-12, 26-28, 39-45 and 58-64. The third genotype, GUI, is exemplified by εequenceε numbered 13-17, 32, 46-47 and 65-66. The fourth genotype, GIV, iε exemplified εequenceε numbered 20-22 and 29-31. The fifth genotype, GV, iε exemplified by εequenceε numbered 18, 19, 50 and 51.
The hybridization product of HCV nucleic acid with a non-naturally occurring nucleic acid having non-HCV-1 εequence correεponding to εequenceε within the HCV genome has utility for priming a reaction for the synthesis of nucleic acid.
The hybridization product of HCV nucleic acid with a non-naturally occurring nucleic acid having a
SUBSTITUTE SHEET εequence correεponding to a particular genotype of HCV haε utility for priming a reaction for the εyntheεiε of nucleic acid of εuch genotype. In one embodiment, the εynthesized nucleic acid is indicative of the presence of one or more genotypes of HCV.
The synthesis of nucleic acid may also facilitate cloning of the nucleic acid into expression vectors which synthesize viral proteins.
Embodiments of the present methodε have utility aε anti-εense agents for preventing the transcription or translation of viral nucleic acid. The formation of a hybridization product of a non-naturally occurring nucleic acid having εequenceε which correεpond to a particular genotype of HCV genomic εequencing with HCV nucleic acid may block tranεlation or transcription of εuch genotype. Therapeutic agents can be engineered to include all five genotypeε for incluεivity.
C. Peptide and antibody composition
A further embodiment of the present invention features a composition of matter compriεing a non-naturally occurring peptide of three or more amino acids correεponding to a nucleic acid having a non-HCV-l εequence. Preferably, the non-HCV-1 εequence corresponds with a sequence within one or more regions consisting of the NS5 region, the envelope 1 region, the 5'UT region, and* the core region.
SUBSTITUTE SHEET Preferably, with respect to peptides correεponding to a nucleic acid having a non-HCV-l εequence of the NS5 region, the εequence iε within εequenceε numbered 2-22. The εequence numbered 1 correεpondε to HCV-l. Sequenceε numbered 1-22 are εet forth in the Sequence Liεting.
Preferably, with reεpect to peptideε correεponding to a nucleic acid having a non-HCV-l εequence of the envelope 1 region, the εequence iε within εequenceε numbered 24-32. The εequence numbered 23 correεpondε to HCV-1. Sequenceε numbered 23-32 are εet forth in the Sequence Listing.
Preferably, with reεpect to peptideε correεponding to a nucleic acid having a non-HCV-l εequence directed to the core region, the εequence iε within εequenceε numbered 53-66. Sequence numbered 52 correεpondε to HCV-1. Sequences numbered 52-66 are εet forth in the Sequence Listing.
The further embodiment of the present invention features peptide compositionε correεponding to nucleic acid εequenceε of a genotype of HCV. The firεt genotype, GI, iε exemplified by εequenceε numbered 1-6, 23-25, 33-38 and 52-57. The εecond genotype, GII, iε exemplified by the εequenceε numbered 7-12, 26-28, 39-45 and 58-64. The third genotype, GUI, is exemplified by εequenceε numbered 13-17, 32, 46-47 and 65-66. The fourth genotype, GIV, iε exemplified
SUBSTITUTE SHEET εequenceε numbered 20-22, 29-31, 48 and 49. The fifth genotype, GV, iε exemplified by εequenceε numbered 18, 19, 50 and 51.
The non-naturally occurring peptides of the present invention are useful as a component of a vaccine. The εequence information of the preεent invention permits the design of vaccines which are inclusive for all or εome of the different genotypeε of HCV. Directing a vaccine to a particular genotype allowε prophylactic treatment to be tailored to maximize the protection to thoεe agentε likely to be encountered. Directing a vaccine to more than one genotype allowε the vaccine to be more inclusive.
The peptide compositions are also useful for the development of εpecific antibodieε to the HCV proteinε. One embodiment of the preεent invention features as a composition of matter, an antibody to peptideε corresponding to a non-HCV-l sequence of the HCV genome. Preferably, the non-HCV-l sequence is εelected from the εequence within a region conεiεting of the NS5 region, the envelope 1 region, and the core region. There are no peptideε associated with the untranslated 5'UT region.
Preferably, with respect to antibodies directed to peptideε of the NS5 region, the peptide correεpondε to a εequence within εequenceε numbered 2-22. Preferably, with respect to antibodieε directed to a peptide
SUBSTITUTE SHEET correεponding to the envelope 1 region, the peptide correεpondε to a εequence within εequenceε numbered 24-32. Preferably, with reεpect to the antibodieε directed to peptideε correεponding to the core region, the peptide correεpondε to a εequence within εequenceε numbered 53-66.
Antibodies directed to peptides which reflect a particular genotype have utility for the detection of εuch genotypes of HCV and therapeutic agents.. One embodiment of the preεent invention featureε an antibody directed to a peptide correεponding to nucleic acid having εequenceε of a particular genotype. The firεt genotype, GI, iε exemplified by εequenceε numbered 1-6, 23-25, 33-38 and 52-57. The εecond genotype, GII, iε exemplified by the sequences numbered 7-12, 26-28, 39-45 and 58-64. The third genotype, GUI, is exemplified by sequenceε numbered 13-17, 32, 46-47 and 65-66. The fourth genotype, GIV, iε exemplified εequenceε numbered 20-22, 29-31, 48 and 49. The fifth genotype, GV, iε exemplified by εequenceε numbered 18, 19, 50 and 51.
Individuals skilled in the art will readily recognize that the compositionε of the present invention can be packaged with instructions for use in the form of a kit for performing nucleic acid hybridizationε or immunochemical reactionε.
SUBSTITUTE SHEET The preεent invention iε further described in the following figures which illustrate εequenceε demonstrating genotypes of HCV. The εequences are deεignated by numerals 1-145, which numerals and εequences are conεistent with the numeralε and εequenceε εet forth in the Sequence Liεting. Sequenceε 146 and 147 facilitate the diεcuεεion of an aεεay which numeralε and εequences are consiεtent with the numeralε and εequenceε εet forth in- the Sequence Liεting.
Brief eεcription of the Figureε and Sequence Liεting
Figure 1 depicts schematically the genetic organization of HCV;
Figure 2 setε forth nucleic acid εequences numbered 1-22 which εequenceε are derived from the NS5 region of the HCV viral genome;
Figure 3 εetε forth nucleic acid sequenceε numbered 23-32 which εequenceε are derived from the envelope 1 region of the HCV viral genome; Figure 4 εetε forth nucleic acid εequenceε numbered 33-51 which εequenceε are derived from the 5'UT region of the HCV viral genome; and.
Figure 5 εetε forth nucleic acid εequenceε numbered 52-66 which εequenceε are derived from the core region of the HCV viral genome.
The Sequence Liεting εetε forth the εe uences of sequenceε numbered 1-147.
SUBSTITUTE SHEET Detailed Deεcription of the Invention
The preεent invention will be deεcribed in detail aε aε nucleic acid having εequenceε correεponding to the HCV genome and related peptideε and binding partnerε, for diagnostic and therapeutic applications. The practice of the present invention will employ, unless otherwise indicated, conventional techniques of chemistry, molecular biology, microbiology, recombinant DNA, and immunology, which are within the skill of the art. Such techniques are explained fully in the literature. See e.g., Maniatis, Fitsch & Sambrook, Molecular Cloning; A Laboratory Manual (1982); DNA Cloning, Volumes I and II (D.N Glover ed. 1985); Oligonucleotide Synthesiε (M.J. Gait ed, 1984); Nucleic Acid Hybridization (B.D. Hames & S.J. Higgins edε. 1984); the series, Methods in Enzymology (Academic Preεε, Inc.), particularly Vol. 154 and Vol. 155 (Wu and Grossman, edε.).
The cDNA libraries are derived from nucleic acid sequenceε preεent in the plasma of an HCV-infected chimpanzee. The construction of one of these libraries, the "c" library (ATCC No. 40394), is described in PCT Pub. No. WO90/14436. The sequences of the library relevant to the present invention are εet forth herein as εequence numberε 1, 23, 33 and 52. Nucleic acids iεolated or synthesized in accordance with featureε of the preεent invention are
SUBSTITUTE SHEET uεeful, by way of example without limitation aε probeε, primerε, anti-εenεe geneε and for developing expreεsion εystems for the synthesis of peptides corresponding to such sequences. The nucleic acid εequenceε described define genotypeε of HCV with reεpect to four regionε of the viral genome. Figure 1 depicts schematically the organization of HCV. The four regions of particular interest are the NS5 region, the envelope 1 region, the 5'UT region and the core region.
The sequences εet forth in the preεent application aε εequenceε numbered 1-22 suggest at least five genotypes in the NS5 region. Sequences numbered 1-22 are depicted in Figure 2 as well as the Sequence Liεting. Each εequence numbered 1-22 iε derived from nucleic acid having 340 nucleotideε from the NS5 region. The five genotypes are defined by groupings of the εequenceε defined by εequence numbered 1-22. For convenience, in the preεent application, the different genotypeε will be assigned roman numeralε and the letter "G".
The firεt genotype (GI) iε exemplified by εequenceε within εequenceε numbered 1-6. A second genotype (GII) is exemplified by sequences within εequenceε numbered 7-12. A third genotype (GUI) iε exemplified by the sequences within sequences numbered 13-17. A fourth genotype (GIV) is exemplified by
SUBSTITUTE SHEET εequenceε within εequenceε numbered 20-22. A fifth genotype (GV) iε exemplified by εequenceε within εequenceε numbered 18 and 19.
The εequenceε set forth in the preεent application aε εequenceε numbered 23-32 εuggest at least four genotypes in the envelope 1 region of HCV. Sequences numbered 23-32 are depicted in Figure 3 as well aε in the Sequence Listing. Each εequence numbered 23-32 iε - derived from nucleic acid having 100 nucleotides from the envelope 1 region.
A firεt envelope 1 genotype group (GI) iε exemplified by the εequenceε within the εequenceε numbered 23-25. A εecond envelope 1 genotype (GII) region iε exemplified by εequenceε within εequenceε numbered 26-28. A third envelope 1 genotype (GUI) iε exemplified by the εequenceε within εequenceε numbered 32. A fourth envelope 1 genotype (GIV) iε exemplified by the sequenceε within εequence numbered 29-31.
The εequenceε εet forth in the preεent application aε sequences numbered 33-51 εuggeεt at leaεt three genotypeε in the 5'UT region of HCV. Sequenceε numbered 33-51 are depicted in Figure 4 as well as in the Sequence Liεting. Each εequence numbered 33-51 iε derived from the nucleic acid having 252 nucleotides from the 5'UT region, .although εequences 50 and 51 are somewhat shorter at approximately 180 nucleotideε.
SUBSTITUTE SHEET The first 5'UT genotype (GI) iε exemplified by the εequences within sequenceε numbered 33-38. A εecond 5'UT genotype (GII) iε exemplified by the εequenceε within εequences numbered 39-45. A third 5'UT genotype (GUI) is exemplified by the sequenceε within εequenceε numbered 46-47. A fourth 5'UT genotype (GIV) iε exemplified by εequenceε within sequenceε humbered 48 and 49. A fifth 5'UT genotype (GV) iε exemplified by εequenceε within εequences numbered 50 and 51. The εequences numbered 48-62 εuggeεt at leaεt three genotypeε in the core region of HCV. The εequences numbered 52-66 are depicted in Figure 5 as well as in the Sequence Listing.
The first core region genotype (GI) is exemplified by the εequenceε within εequences numbered 52-57. The εecond core region genotype (GII) is exemplified by εequences within sequences numbered 58-64. The third core region genotype (GUI) is exemplified by sequenceε within sequences numbered 65 and 66. Sequences numbered 52-65 are comprised of 549 nucleotides.
Sequence numbered 66 iε comprised of 510 nucleotides. The variouε genotypeε described with respect to each region are consistent. That is, HCV having features of the first genotype with respect to the NS5 region will subεtantially conform to featureε of the firεt genotype of the envelope 1 region, the 5'UT region and the core region.
SUBSTITUTE SHEET - 22 -
Nucleic acid iεolated or synthesized in accordance with the εequenceε εet forth in εequence numberε 1-66 are uεeful aε probes, primers, capture ligands and anti-sense agents. As probeε, primerε, capture ligands and anti-sense agentε, the nucleic acid wil normally compriεe approximately eight or more nucleotideε for specificity as well aε the ability to form stable hybridization products.
Probeε
A nucleic acid iεolated or synthesized in accordance with a sequence defining a particular genotype of a region of the HCV genome can be used as a probe to detect εuch genotype or uεed in combination with other nucleic acid probeε to detect εubεtantially all genotypeε of HCV.
With the εequence information εet forth in the preεent application, εequenceε of eight or more nucleotideε are identified which provide the deεired incluεivity and excluεivity with reεpect to variouε genotypes within HCV, and extraneous nucleic acid εequences likely to be encountered during hybridization conditions.
Individuals εkilled in the art will readily recognize that the nucleic acid εequenceε, for uεe aε probeε, can be provided with a label to facilitate detection of a hybridization product.
SUBSTITUTE SHEET Capture Ligand
For uεe aε a capture ligand, the nucleic acid εelected in the manner deεcribed above with reεpect to probes, can be readily associated with supports. The manner in which nucleic acid iε aεεociated with εupportε iε well known. Nucleic acid having εequenceε corresponding to a sequence within sequenceε numbered 1-66 have utility to εeparate viral nucleic acid of one genotype from the nucleic acid of HCV of a different genotype. Nucleic acid iεolated or synthesized in accordance with εequenceε within εequenceε numbered 1-66, used in combinationε, have utility to capture εubεtantially all nucleic acid of all HCV genotypeε.
Primerε
Nucleic acid iεolated or synthesized in accordance with the sequenceε deεcribed herein have utility aε primerε for the amplification of HCV εequenceε. With reεpect to polymeraεe chain reaction (PCR) techniqueε, nucleic acid εequenceε of eight or more nucleotideε corresponding to one or more sequenceε of εequenceε numbered 1-66 have utility in conjunction with εuitable enzymeε and reagentε to create copieε of the viral nucleic acid. A plurality of primerε having different εequenceε corresponding to more than one genotype can be used to create copies of viral nucleic acid for such genotypes.
SUBSTITUTE SHEET The copies can be used in diagnostic assays to detect HCV virus. The copieε can also be incorporated into cloning and expresεion vectors to generate polypeptideε correεponding to the nucleic acid εyntheεized by PCR, aε will be deεcribed in greater detail below.
Anti-εenεe
Nucleic acid iεolated or εyntheεized in accordance with the εequenceε deεcribed herein have utility aε anti-εenεe geneε to prevent the expreεεion of HCV.
Nucleic acid correεponding to a genotype of HCV iε loaded into a εuitable carrier εuch aε a lipoεo e for introduction into a cell infected with HCV. A nucleic acid having eight or more nucleotideε iε capable of binding to viral nucleic acid or viral messenger RNA.
Preferably, the anti-εenεe nucleic acid iε compriεed of
30 or more nucleotideε to provide neceεεary εtability of a hybridization product of viral nucleic acid or viral messenger RNA. Methods for loading anti-εenεe nucleic acid iε known in the art aε exemplified by U.S.
Patent 4,241,046 issued December 23, 1980 to
Papahadjopoulos et al.
Peptide Synthesis
Nucleic acid iεolated or εyntheεized in accordance with the sequences described herein have utility to
SUBSTITUTE SHEET generate peptides. The εequenceε exemplified by εequences numbered 1-32 and 52-66 can be cloned into εuitable vectorε or uεed to iεolate nucleic acid. The iεolated nucleic acid iε combined with suitable DNA linkerε and cloned into a εuitable vector. The vector can be uεed to tranεform a εuitable hoεt organiεm εuch as E_j_ coli and the peptide encoded by the sequenceε iεolated.
Molecular cloning techniques are described in the text Molecular Cloning: A Laboratory Manual, Maniatis et al., Coldspring Harbor Laboratory (1982).
The isolated peptide has utility as an antigenic substance for the development of vaccines and antibodies directed to the particular genotype of HCV.
Vaccineε and Antibodieε
The peptide materials of the present invention have utility for the development of antibodies and vaccineε. The availability of cDNA εequenceε, or nucleotide εequenceε derived therefrom (including segments and mod fications of the sequence), permitε the construction of expression vectors encoding antigenically active regions of the peptide encoded in either strand. The antigenically active regions may be derived from the NS5 region, envelope 1 regions, and the core region.
SUBSTITUTE SHEET Fragments encoding the desired peptides are derived from the cDNA clones using conventional restriction digestion or by synthetic methods, and are ligated into vectors which may, for example, contain portions of fuεion εequenceε εuch aε beta galactoεidaεe or εuperoxide diεmutaεe (SOD), preferably SOD. Methodε and vectorε which are uεeful for the production of polypeptideε which contain fusion εequences of SOD are described in European Patent Office Publication number 0196056, publiεhed October 1, 1986.
Any desired portion of the HCV cDNA containing an open reading frame, in either sense strand, can be obtained as a recombinant peptide, such as a mature or fuεion protein; alternatively, a peptide encoded in the cDNA can be provided by chemical synthesis.
The DNA encoding the desired peptide, whether in fused or mature form, and whether or not containing a signal sequence to permit secretion, may be ligated into expresεion vectorε εuitable for any convenient hoεt. Both eukaryotic and prokaryotic host syεtemε are preεently uεed in forming recombinant peptideε. The peptide iε then iεolated from lyεed cellε or from the culture medium and purified to the extent needed for its intended use. Purification may be by techniques known in the art, for example, differential extraction, εalt fractionation, chromatography on ion exchange resins, affinity chromatography, centrifugation, and
SUBSTITUTE SHEET the like. See, for example. Methods in Enzymology for a variety of methodε for purifying proteinε. Such peptideε can be uεed as diagnostics, or those which give riεe to neutralizing antibodies may be formulated into vaccines. Antibodies raised againεt these peptides can alεo be uεed aε diagnostics, or for passive immunotherapy or for isolating and identifying HCV.
An antigenic region of a peptide iε generally relatively εmall—typically 8 to 10 amino acidε or leεε in length. Fragmentε of aε few aε 5 amino acidε may characterize an antigenic region. These segments may correspond to NS5 region, envelope 1 region, and the core region of the HCV genome. The 5'UT region is not known to be tranεlated. Accordingly, uεing the cDNAs of εuch regionε, DNAε encoding short segments of HCV peptides correεponding to εuch regions can be expressed recombinantly either as fusion proteinε, or aε iεolated peptideε. In addition, εhort amino acid sequences can be conveniently obtained by chemical syntheεis. In instanceε wherein the εyntheεized peptide iε correctly configured εo as to provide the correct epitope, but is too εmall to be immunogenic, the peptide may be linked to a εuitable carrier. A number of techniqueε for obtaining such linkage are known in the art, including the formation of disulfide linkages using N-succinimidyl-3-(2-
SUBSTITUTE SHEET pyridylthio)propionate (SPDP) and εuccinimidyl 4-(N-maleimido-methyl)eyelohexane-1-carboxylate (SMCC) obtained from Pierce Company, Rockford, Illinoiε, (if the peptide lackε a εulfhydryl group, this can be provided by addition of a cysteine reεidue) . Theεe reagents create a disulfide linkage between themεelveε and peptide cyεteine reεidueε on one protein and an amide linkage through the epεilon-amino on a lyεine, or other free amino group in the other. A variety of εuch diεulfide/amide-forming agentε are known. See, for example, Immun Rev (1982) 62:185. Other bifunctional coupling agentε form a thioether rather than a diεulfide linkage. Many of theεe thio-ether-forming agentε are commercially available and include reactive eεterε of 6-maleimidocaprioc acid, 2-bromoacetic acid, 2-iodoacetic acid, 4-N-maleimido-methyl)cyclohexane-l- carboxylic acid, and the like. The ca box 1 groupε can be activated by combining them with εuccinimide or l-hydroxyl-2 nitro-4-εulfonic acid, εodium εalt. Additional methodε of coupling antigens employs the rotavirus/"binding peptide" εystem described in EPO Pub. No. 259,149, the disclosure of which iε incorporated herein by reference. The foregoing liεt iε not meant to be exhaustive, and modifications of the named compounds can clearly be used.
Any carrier may be uεed which does not itself induce the production of antibodieε harmful to the
SUBSTITUTE SHEET hoεt. Suitable carrierε are typically large, εlowly metabolized macromolecules such aε proteinε; polyεaccharideε, εuch as latex functionalized Sepharoεe, agaroεe, cellulose, cellulose beads and the like; polymeric amino acids, such aε polyglutamic acid, polylyεine, and the like; amino acid copolymers; and inactive virus particles. Especially useful protein substrates are serum albumins, keyhole limpet hemocyanin, immunoglobulin molecules, thyroglobulin, ovalbumin, tetanuε toxoid, and other proteinε well known to those skilled in the art.
Peptides comprising HCV amino acid sequences encoding at least one viral epitope derived from the NS5, envelope 1, and core region are useful immunological reagents. The 5'UT region iε not known to be translated. For example, peptides comprising such truncated sequenceε can be uεed as reagents in an immunoasεay. These peptides also are candidate subunit antigens in compositions for antiserum production or vaccines. While the truncated εequenceε can be produced by variouε known treatmentε of native viral protein, it iε generally preferred to make synthetic or recombinant peptides comprising HCV sequence. Peptides comprising these truncated HCV sequenceε can be made up entirely of HCV εequences (one or more epitopes, either contiguous or noncontiguouε), or HCV εequenceε and heterologous sequenceε in a fuεion protein. Useful
SUBSTITUTE SHEET heterologouε εequenceε include εequenceε that provide for secretion from a recombinant hoεt, enhance the immunological reactivity of the HCV epitope(ε), or facilitate the coupling of the polypeptide to an immunoasεay εupport or a vaccine carrier. See, E.G., EPO Pub. No. 116,201; U.S. Pat. No. 4,722,840; EPO Pub. No. 259,149 U.S. Pat. No. 4,629,783.
The εize of peptideε compriεing the truncated HCV εequenceε can vary widely, the minimum εize being a εequence of εufficient εize to provide an HCV epitope, while the maximum εize iε not critical. For convenience, the maximum εize usually is not εubεtantially greater than that required to provide the desired HCV epitopes and function(s) of the heterologous εequence, if any. Typically, the truncated HCV amino acid sequence will range from about 5 to about 100 amino acids in length. More typically, however, the HCV εequence will be a maximum of about 50 amino acidε in length, preferably a maximum of about 30 amino acidε. It iε usually desirable to select HCV εequenceε of at leaεt about 10, 12 or 15 amino acids, up to a maximum of about 20 or 25 amino acids.
HCV amino acid εequences compriεing epitopes can be identified in a number of wayε. For example, the entire protein εequence correεponding to each of the NS5, envelope 1, and core regionε can be εcreened by preparing a εerieε of short peptideε that together εpan
SUBSTITUTE SHEET the entire protein εequence of εuch regionε. By εtarting with, for example, peptideε of approximately 100 amino acidε, it would be routine to teεt each peptide for the preεence of epitope(ε) showing a desired reactivity, and then testing progressively smaller and overlapping fragments from an identified peptides of 100 amino acids to map the epitope of interest. Screening εuch peptides in an immunoasεay iε within the εkill of the art. It iε alεo known to carry out a computer analysis of a protein sequence to identify potential epitopes, and then prepare peptides comprising the identified regions for screening.
The immunogenicity of the epitopes of HCV may also be enhanced by preparing them in mammalian or yeast εyεtemε fuεed with or aεεembled with particle-forming proteinε εuch as, for example, that associated with hepatitiε B εurface antigen. See, e.g. , US 4,722,840. Constructs wherein the HCV epitope is linked directly to the particle-forming protein coding sequenceε produce hybridε which are immunogenic with reεpect to the HCV epitope. In addition, all of the vectorε prepared include epitopeε εpecific to HBV, having various degrees of immunogenicity, such as, for example, the pre-S peptide. Thus, particleε conεtructed from particle forming protein which include HCV εequences are immunogenic with respect to HCV^and HBV.
SUBSTITUTE SHEET Hepatitiε surface antigen (HBSAg) has been εhown to be formed and aεεembled into particleε in S. cereviεiae (P. Valenzuela et al. (1982)), aε well aε in, for example, mammalian cells (P. Valenzuela et al. 1984)). The formation of εuch particleε haε been εhown to enhance the immunogenicity of the monomer εubunit. The constructs may also include the immunodominant epitope of HBSAg, compriεing the 55 amino acidε of the presurface (pre-S) region. Neurath et al. (1984). Constructs of the pre-S-HBSAg particle expressible in yeast are discloεed in EPO 174,444, published March 19, 1986; hybridε including heterologouε viral εequenceε for yeaεt expreεεion are diεcloεed in EPO 175,261, published March 26, 1966. These constructε may alεo be expressed in mammalian cells such aε Chineεe hamster ovary (CHO) cells uεing an SV40-dihydrofolate reductaεe vector (Michelle et al. (1984)).
In addition, portionε of the particle-forming protein coding εequence may be replaced with codonε encoding an HCV epitope. In this replacement, regions which are not required to mediate the aggregation of the units to form immunogenic particles in yeaεt of mammals can be deleted, thus eliminating additional HBV antigenic εiteε from competition with the HCV epitope.
Vaccines
Vaccines may be prepared from one or more
SUBSTITUTE SHEET immunogenic peptideε derived from HCV. The obεerved homology between HCV and Flaviviruεes provides information concerning the peptides which are likely to be most effective as vaccines, as well as the regionε of the genome in which they are encoded.
Multivalent vaccineε againεt HCV may be compriεed of one or more epitopeε from one or more proteinε derived from the NS5, envelope 1, and core regionε. In particular, vaccineε are contemplated compriεing one or more HCV proteinε or εubunit antigens derived from the NS5, envelope 1, and core regions. The 5'UT region iε not known to be tranεlated.
The preparation of vaccineε which contain an immunogenic peptide aε an active ingredient, iε known to one skilled in the art. Typically, such vaccines are prepared as injectables, either as liquid solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid prior to injection may also be prepared. The preparation may also be emulsified, or the protein encapsulated in liposomeε. The active immunogenic ingredients are often mixed with excipients which are pharmaceutically acceptable and compatible with the active ingredient. Suitable excipientε are, for example, water, εaline, dextroεe, glycerol, ethanol, or the like and combinations thereof. In addition, if desired, the vaccine may contain minor amountε of auxiliary εubstances εuch aε wetting or
SUBSTITUTE SHEET emulεifying agentε, pH buffering agentε, and/or adjuvants which enhance the effectiveneεε of the vaccine. Exampleε of adjuvants which may be effective include but are not limited to: aluminum hydroxide, N-acetyl-muramyl-L-theronyl-D- isoglutamine (thr-MDP), N-acetyl-nor-muramyl-L-alanyl- D-iεoglutamine (CGP 11637, referred to aε nor-MDP), N- acetylmuramyl-L- alanyl-D-iεoglutaminyl-L-alanine-2-(1- 2-dipalmitoyl -εn-glycero-3-hydroxyphoεphoryloxy)- ethylamine (CGP 19835A, referred to aε MTP-PE), and RIBI, which containε three componentε extracted from bacteria, monophoεphoryl lipid A, trehaloεe dimycolate and cell wall skeleton (MPL+TDM+CWS) in a 2% εqualene/Tween 80 emulsion. The effectiveneεε of an adjuvant may be determined by meaεuring the amount of antibodieε directed againεt an immunogenic peptide containing an HCV antigenic εequence reεulting from adminiεtration of thiε peptide in vaccines which are also comprised of the variouε adjuvantε. The vaccineε are conventionally adminiεtered parenterally, by injection, for example, either subcutaneously or intramuscularly. Additional formulationε which are εuitable for other modes of administration include εuppoεitorieε and, in εome cases, oral formulations. For εuppoεitorieε, traditional binderε and carrierε may include, for example, polyalkylene glycolε or triglycerides; εuch
SUBSTITUTE SHEET εuppoεitorieε may be formed from mixtures containing the active ingredient in the range of 0/5% to 10%, preferably l%-2%. Oral formulationε include εuch normally employed excipientε as, for example, pharmaceutical grades of mannitol, lactose, starch, magnesium εtearate, sodium saccharine, cellulose, magnesium carbonate, and the like.
The exampleε below are provided for illustrative purposeε and are not intended to limit the εcope of the present invention.
I. Detection of HCV RNA from Serum
RNA was extracted from serum uεing guanidinium εalt, phenol and chloroform according to the instructions of the kit manufacturer (RNAzol"* B kit, Cinna/Biotecx) . Extracted RNA was precipitated with isopropanol and waεhed with ethanol. A total of 25 μl serum was processed for RNA isolation, and the purified RNA waε reεuεpended in 5 μl diethyl pyrocarbonate treated water for subsequent cDNA synthesis.
II. cDNA Synthesis and Polymerase Chain Reaction (PCR) Amplification Table 1 listε the εequence and poεition (with reference to HCVl) of all the PCR*primerε and probeε uεed in theεe examples. Letter designations for
SUBSTITUTE SHEET - 36 -
nucleotides are consiεtent with 37 C.F.R. SSI.821- 1.825. Thuε, the letters A, C G, T, and U are uεed in the ordinary εenεe of adenine, cytoεine, guanine, thymine, and uracil. The letter M means A or C; R meanε A or G; W meanε A or T/U; S meanε C or G; Y meanε C or T/U; K meanε G or T/U; V means A or C or G, not T/U; H means A or C or T/U, not G; D means A or G or T/ϋ, not C; B meanε C or G or T/U, not A; N meanε (A or C or G or T/U) or (unknown or other). Table 1 iε εet forth below:
Table 1 Seq. No. Sequence (5'-3') Nucleotide Poεition
67 CAAACGTAACACCAACCGRCGCCCACAGG 374-402 68 ACAGAYCCGCAKAGRTCCCCCACG 1192-1169
69 GCAACCTCGAGGTAGACGTCAGCCTATCCC 509-538
70 GCAACCTCGTGGAAGGCGACAACCTATCCC 509-538
71 GTCACCAATGATTGCCCTAACTCGAGTATT 948-977
72 GTCACGAACGACTGCTCCAACTCAAG 948-973 73 TGGACATGATCGCTGGWGCYCACTGGGG 1375-1402
74 TGGAYATGGTGGYGGGGGCYCACTGGGG 1375-1402
75 ATGATGAACTGGTCVCCYAC 1308-1327
76 ACCTTVGCCCAGTTSCCCRCCATGGA 1453-1428
77 AACCCACTCTATGYCCGGYCAT 205-226 78 GAATCGCTGGGGTGACCG 171-188
79 CCATGAATCACTCCCCTGTGA<3GAACTA 30-57
80 TTGCGGGGGCACGCCCAA 244-227
SUBSTITUTE SHEET - 37 -
For cDNA synthesis and PCR amplification, a protocol developed by Perkin-Elmer/Cetus (GeneAmp® RNA PCR kit) was uεed. Both random hexa er and primerε with εpecific complementary εequences to HCV were employed to prime the reverse transcription (RT) reaction. All processes, except for adding and mixing reaction components, were performed in a thermal cycler (MJ Research, Inc.). The firεt strand cDNA synthesis reaction was inactivated at 99°C for 5 min, and then cooled at 50°C for 5 min before adding reaction components for subsequent amplification. After an initial 5 cycles of 97βC for 1 min, 50°C for 2 min, and 72βC for 3 min, 30 cycles of 94°C for 1 min, 55βC for 2 min, and 72°C for 3 min followed, and then a final 7 min of elongation at 72°C.
For the genotyping analysis, sequences 67 and 68 were used as primers in the PCR reaction. These primers amplify a εegment correεponding to the core and envelope regions. After amplification, the reaction productε were separated on an agaroεe gel and then transferred to a nylon membrane. The immobilized reaction products were allowed to hybridize with a
32
P-labelled nucleic acid corresponding to either Genotype I (core or envelope 1) or Genotype II (core or envelope 1). Nucleic acid corresponding to Genotype 1 comprised sequenceε numbered 69 (core), 71 (envelope), and 73 (envelope). Nucleic acid corresponding to
SUBSTITUTE SHEET Genotype II comprised sequences numbered 70 (core), 72 (envelope), and 74 (envelope).
The Genotype I probes only hybridized to the product amplified from isolateε which had Genotype I εequence. Similarly, Genotype II probeε only hybridized to the product amplified from iεolates which had Genotype II sequence.
In another experiment, PCR products were generated using sequenceε 79 and 80. The products were analyzed as described above except Sequence No. 73 was used to detect Genotype I, Sequence No. 74 was used to detect Genotype II, Sequence No. 77 (5'UT) was uεed to detect Genotype III, and Sequence No. 78 (5'UT) waε uεed to detect Genotype IV. Each εequence hybridized in a genotype εpecific manner.
III. Detection of HCV GI-GIV uεing a εandwich hybridization assay for HCV RNA An amplified solution phase nucleic acid sandwich hybridization assay format is described in this example. The assay format employs several nucleic acid probes to effect capture and detection. A capture probe nucleic acid is capable of associating a complementary probe bound to a solid support and HCV nucleic acid to effect capture. A detection probe nucleic acid has a first εegment (A) that binds to HCV nucleic acid and a second segment (B) that hybridizes to a second amplifier nucleic acid.
SUBSTITUTE SHEET The amplifier'nucleic acid has a firεt εegment (B*) that hybridizes to εegment (B) of the probe nucleic acid and alεo comprises fifteen iterations of a εegment (C). Segment C of the amplifier nucleic acid is capable of hybridizing to three labeled nucleic acids. Nucleic acid sequenceε which correεpond to nucleotide εequenceε of the envelope 1 gene of Group I HCV iεolateε are εet forth in εequenceε numbered 81-99. Table 2 εetε forth the area of the HCV genome to which the nucleic acid εequenceε correεpond and a preferred uεe of the εequenceε.
Complement of Nucleotide Numbers
879-911 912-944 945-977 978-1010
1011-1043 1044-1076 1077-1109 1110-1142 1143-1175
SUBSTITUTE SHEET Table 2 continued
Nucleic acid sequences which correspond to nucleotide sequences of the envelope 1 gene of Group II HCV isolates are set forth in sequences 100-118. Table 3 sets forth the area of the HCV genome to which the nucleic acid corresponds and the preferred use of the sequences.
SUBSTITUTE SHEET Table 3
Probe Type Sequence No. Complement of Nucleotide Numbers
879- -911 912- -944 945- -977 978- -1010 1011- -1043 1044- 1076 1077- 1109 1110- -1142 1143- 1175 1176- 1208 1209- 1241 1242= 1274 1275- 1307 1308- 1340 1341- 1373 1374- 1406 1407- 1439 1440- 1472 1473- 1505
Nucleic acid sequences which correεpond to nucleotide εequenceε in the C gene and the 5'UT region
SUBSTITUTE SHEET are εet forth in εequenceε 119-145. Table 4 identifieε the εequence with a preferred uεe.
Table 4
Probe Type Sequence No.
SUBSTITUTE SHEET - 43 -
Table 4 continued
The detection and capture probe HCV-εpecific segments, and their respective names aε uεed in this aεsay were as follows. Capture sequences are sequenceε numbered
119-122 and 141-144.
Detection εequenceε are εequenceε numbered 119-140. Each detection εequence contained, in addition to the sequences substantially complementary to the HCV sequenceε, a 5' extenεion (B) which extenεion (B) iε complementary to a εegment of the εecond amplifier nucleic acid. The extension (B) εequence iε identified in the Sequence Liεting aε Sequence No. 146, and iε reproduced below.
AGGCATAGGACCCGTGTCTT
SUBSTITUTE SHEET - 44 -
Each capture εequence contained, in addition to the εequences substantially complementary to HCV εequenceε, a εequence complementary to DNA bound to a solid phaεe. The εequence complementary to DNA bound to a solid εupport was carried downstream from the capture εequence. The sequence complementary to the DNA bound to the support iε εet forth aε Sequence No. 147 and is reproduced below.
CTTCTTTGGAGAAAGTGGTG Microtiter plates were prepared aε follows. White Microlite 1 Re ovawell εtripε (polyεtyrene microtiter plates, 96 wells/plate) were purchased from Dynatech Inc. -
Each well was filled with 200 μl 1 N HC1 and incubated at room temperature for 15-20 min. The plates were then washed 4 times with IX PBS and the wellε aspirated to remove liquid. The wellε were then filled with 200 μl 1 N NaOH and incubated at room temperature for 15-20 min. The plateε were again washed 4 times with IX PBS and the wells aspirated to remove liquid.
Poly(phe-lys) was purchaεed from Sigma Chemicals, Inc. This polypeptide haε a 1:1 molar ratio of phe:lyε and an average m.w. of 47,900 gm/mole. It haε an average length of 309 amino acids and contains 155 amines/mole. A 1 mg/ml εolution of the polypeptide was mixed with 2M NaCl/lX PBS to a final concentration of
SUBSTITUTE - 45 -
0.1 mg/ml (pH 6.0). A volume of 200 μl of this εolution waε added to each well. The plate was wrapped in plastic to prevent drying and incubated at 30βC overnight. The plate was then washed 4 times with IX PBS and the wells aspirated to remove liquid.
The following procedure was uεed to couple the nucleic acid, a complementary εequence to Sequence No. 147, to the plates, hereinafter referred to as immobilized nucleic acid. Synthesiε of immobilized nucleic acid having a εequence complementary to
Sequence No. 133 waε deεcribed in EPA 883096976. A quantity of 20 mg diεuccinimidyl εuberate waε dissolved in 300 μl dimethyl formamide (DMF). A quantity of 26 O -gQ units of immobilized nucleic acid waε added to 100 μl coupling buffer (50 mM sodium phoεphate, pH 7.8). The coupling mixture waε then added to the DSS-DMF εolution and stirred with a magnetic stirrer for 30 min. An NAP-25 column was equilibrated with 10 mM sodium phoεphate, pH 6.5. The coupling mixture DSS-DMF εolution was added to 2 ml 10 mM sodium phosphate, pH 6.5, at 4°C. The mixture was vortexed to mix and loaded onto the equilibrated NAP-25 column. DSS-activated immobilized nucleic acid DNA was eluted from the column with 3.5 ml 10 mM sodium phosphate, pH 6.5. A quantity of 5.6 OD2g. units of eluted
DSS-activated immobilized nucleic acid DNA was added to 1500 ml 50 mM sodium phosphate, pH 7.8. A volume of 50
SUBSTITUTE SHEET μl of thiε εolution was added to each well and the plates were incubated overnight. The plate was then waεhed 4 timeε with IX PBS and the wellε aspirated to remove liquid. Final stripping of plateε waε accompliεhed aε follows. A volume of 200 μl of 0.2N NaOH containing 0.5% (w/v) SDS was added to each well. The plate waε wrapped in plaεtic and incubated at 65°C for 60 min. The plate was then washed 4 times with IX PBS and the wells aspirated to remove liquid. The stripped plate was εtored with deεiccant beads at 2-8°C.
Serum samples to be asεayed were analyzed uεing PCR followed by εequence analyεiε to determine the genotype. Sample preparation conεisted of delivering 50 μl of the serum sample and 150 μl P-K Buffer (2 mg/ml proteinase K in 53 mM Triε-HCl, pH 8.0/0.6 M NaCl/0.06 M sodium citrate/8 mM EDTA, pH 8.0/1.3%SDS/16μg/ml sonicated salmon sperm DNA/7% formamide/50 fmoles capture probes/160 fmoles detection probes) to each well. Plates were agitated to mix the contents in the well, covered and incubated for 16 hr at 62°C. After a further 10 minute period at room temperature, the contents of each well were aspirated to remove all fluid, and the wells waεhed 2X with washing buffer (0.1% SDS/0.015 M NaCl/ 0.0015 M sodium citrate). The amplifier nucleic acid waε then added to
SUBSTITUTE SHEET each well (50 μl of 0.7 fmole/μl solution in 0..48 M NaCl/0.048 M sodium citrate/0.1% SDS/0.5% "blocking reagent" (Boehringer Mannheim, catalog No. 1096 176)). After covering the plateε and agitating to mix the contentε in the wellε, the plateε were incubated for 30 min. at 52°C.
After a further 10 min period at room temperature, the wellε were waεhed aε deεcribed above.
Alkaline phoεphatase label nucleic acid, discloεed in EP 883096976, waε then added to each well (50 μl/well of 2.66 fmoleε/μl). After incubation at 52°C for 15 min., and 10 min. at room temperature, the wells were waεhed twice aε above and then 3X with 0.015 M NaCl/0.0015 M sodium citrate. An enzyme-triggered dioxetane (Schaap et al., Tet. Lett. (1987) 28:1159-1162 and EPA Pub. No. 0254051), obtained from Lumigen, Inc., was employed. A quantity of 50 μl Lumiphos 530 (Lumigen) waε added to each well. The wellε were tapped lightly so that the reagent would fall to the bottom and gently swirled to diεtribute the reagent evenly over the bottom. The wells were covered and incubated at 37βC for 20-40 min.
Plates were then read on a Dynatech ML 1000 luminometer. Output was given as the full integral of the light produced during the reaction.
The aεεay poεitively detected each of the εerum samples, regardless of genotype.
SUBSTITUTE SHEET US92/040
- 48 -
IV. Expression of the Polypeptide Encoded in Sequences Defined by Differing Genotypes HCV polypeptides encoded by a εequence within εequences 1-66 are expresεed aε a fusion polypeptide with superoxide diεmutaεe (SOD) . A cDNA carrying εuch εequences is εubcloned into the expression vector pSODcfl (Steimer et al. 1986)).
Firεt, DNA iεolated from pSODcfl iε treated with BamHI and EcoRI, and the following linker waε ligated into the linear DNA created by the reεtriction enzymeε: 5 GAT CCT GGA ATT CTG ATA AGA
CCT TAA GAC TAT TTT AA 3 After cloning, the plaεmid containing the insert iε iεolated. Plaεmid containing the inεert is restricted with EcoRI. The HCV cDNA iε ligated into thiε EcoRI linearized plaεmid DNA. The DNA mixture iε uεed to transform E. coli strain D1210 (Sadler et al. (1980)). Polypeptides are isolated on gelε.
V. Antiqenicity of Polypeptideε
The antigenicity of polypeptideε formed in Section IV iε evaluated in the following manner. Polyethylene pins arranged on a block in an 8 12 array (Coselco Mimetopes, Victoria, Australia) are prepared by placing the pins in a bath (20% v/v piperidine in dimethylformamide (DMF)) for 30 minutes at room
SUBSTITUTE SHEET - 49 -
temperature. The pinε are removed, waεhed in DMF for 5 minutes, then waεhed in methanol four timeε (2 min/wash) . The pins are allowed to air dry for at least 10 minutes, then washed a final time in DMF (5Min). 1-Hydroxybenzotriazole (HOBt, 367 mg) is dissolved in DMF (80 μL) for use in coupling Fmoc-protected polypeptides prepared in Section IV.
The protected amino acids are placed in micro-titer plate wells with HOBt, and the pin block placed over the plate, immersing the pinε in the wellε. The assembly is then sealed in a plastic bag and allowed to react at 25βC for 18 hourε to couple the firεt amino acids to the pins. The block iε then removed, and the pinε washed with DMF (2 min.), MeOH (4 x, 2 min.), and again with DMF (2 min.) to clean and deprotect the bound amino acids. The procedure iε repeated for each additional amino acid coupled, until all octamerε are prepared.
The free N-termini are then acetylated to compensate for the free amide, as most of the epitopeε are not found at the N-terminuε and thuε would not have the associated positive charge. Acetylation is accomplished by filling the wellε of a microtiter plate with DMF/acetic anhydride/triethylamine (5:2:1 v/v/v) and allowing the pins to react in the wells for 90 minutes at 20βC. The pins are then washed with DMF (2
SUBSTITUTE SHEET min.) and MeOH (4 x, 2 min.), and air dried for at leaεt 10 minutes.
The side chain protecting groups are removed by treating the pinε with trifluoroacetic acid/phenol/ dithioethane (95:2.5:1.5, v/v/v) in polypropylene bags for 4 hours at room temperature. The pinε are then washed in dichloromethane (2 x, 2 min.), 5% di-isopropylethylamine/dichloromethane (2 x, 5 min.), dichloromethane. (5 min.), and air-dried for at leaεt 10 minutes. The pinε are then waεhed in water (2 min.), MeOH (18 hourε), dried in vacuo, and εtored in εealed plastic bags over silica gel. IV.B.lS.b Aεεay of Peptideε.
Octamer-bearing pinε are treated by εonicating for 30 minuteε in a diεruption buffer (1% sodium dodecylsulfate, 0.1% 2-mercaptoethanol, 0.1 M NaH2P04) at 60βC. The pinε are then immerεed εeveral times in water (60βC), followed by boiling MeOH (2 min.), and allowed to air dry. The pins are then precoated for 1 hour at 25βC in microtiter wells containing 200 μL blocking buffer (1% ovalbumin, 1% BSA, 0.1% Tween, and 0.05% NaN3 in PBS), with agitation. The pins are then immersed in microtiter wells containing 175 μL antisera obtained from human patients diagnoεed aε having HCV and allowed to incubate at 4°C overnight. The formation of a complex between polyclonal antibodies of the serum and
SUBSTITUTE SHEET - 51 -
the polypeptide initiates that the peptides give rise to an immune response in vivo. Such peptides are candidates for the development of vaccines.
Thus, this invention has been described and illustrated. It will be apparent to those skilled in the art that many variations and modifications can be made without departing from the purview of the appended claims and without departing from the teaching and scope of the present invention.
SUBSTITUTE SHEET - 52 -
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: Tai-An Cha
(ii) TITLE OF INVENTION: HCV GENOMIC SEQUENCES FOR DIAGNOSTICS AND THERAPEUTICS
(iii) NUMBER OF SEQUENCES: 147
(iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Wolf, Greenfield & Sackε, P.C.
(B) STREET: 600 Atlantic Avenue (C) CITY: Boston
(D) STATE: Massachusetts
(E) COUNTRY: USA
(F) ZIP: 02210
(v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Diskette, 5.25 inch
(B) COMPUTER: IBM compatible
(C) OPERATING SYSTEM: MS-DOS Version 3.3
(D) SOFTWARE: WordPerfect 5.1
SUBSTITUTE SHEET (vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: Not Available
(B) FILING DATE: Not Available
(C) CLASSIFICATION: Not Available
(vii) PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER: 07/697,326
(B) FILING DATE: 8 May 1991
(viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: Janiuk, Anthony J.
(B) REGISTRATION NUMBER: 29,809
(C) REFERENCE/DOCKET NUMBER: C0772/7000
(ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: (617) 720-3500
(B) TELEFAX: (617) 720-2441
(C) TELEX: EZEKIEL
(2) INFORMATION FOR SEQ ID NO: 1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 nucleotides
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
SUBSTITUTE SHEET - 54 -
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE: (ATCC # 40394)
(C) INDIVIDUAL ISOLATE: nsδhcvl
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 1
CTCCACAGTC ACTGAGAGCG ACATCCGTAC GGAGGAGGCA 40
ATCTACCAAT GTTGTGACCT CGACCCCCAA GCCCGCGTGG 80
CCATCAAGTC CCTCACCGAG AGGCTTTATG TTGGGGGCCC 120 TCTTACCAAT TCAAGGGGGG AGAACTGCGG CTATCGCAGG 160
TGCCGCGCGA GCGGCGTACT GACAACTAGC TGTGGTAACA 200
CCCTCACTTG CTACATCAAG GCCCGGGCAG CCTGTCGAGC 240
CGCAGGGCTC CAGGACTGCA CCATGCTCGT GTGTGGCGAC 280
GACTTAGTCG TTATCTGTGA AAGCGCGGGG GTCCAGGAGG 320 ACGCGGCGAG CCTGAGAGCC 340
(2) INFORMATION FOR SEQ ID NO: 2:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 340 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
SUBSTITUTE SHE (vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: ns5i21
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 CTCCACAGTC ACTGAGAGCG ACATCCGTAC GGAGGAGGCA 40
ATTTACCAAT GTTGTGACCT GGACCCCCAA GCCCGCATGG 80
CCATCAAGTC CCTCACTGAG AGGCTTTATG TCGGGGGCCC 120
TCTTACCAAT TCAAGGGGGG AGAACTGCGG CTACCGCAGG 160
TGCCGCGCGA GCGGCGTACT GACAACTAGC TGTGGTAACA 200 CCCTCACTTG CTACATCAAG GCCCGGGCAG CCTGTCGAGC 240
CGCAGGGCTC CAGGACTGCA CCATGCTTGT GTGTGGCGAC 280
GACTTAGTCG TTATCTGTGA AAGTGCGGGG GTCCAGGAGG 320
ACGCGGCGAG CCTGAGAGCC 340
(2) INFORMATION FOR SEQ ID NO: 3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) individual iεolate: nεδptl
SUBSTITUTE SHEET (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3
CTCCACAGTC ACTGAGAGCG ACATCCGTAC GGAGGAGGCA 40
ATCTACCAAT GTTGTGATCT GGACCCCCAA GCCCGCGTGG 80
CCATCAAGTC CCTCACTGAG AGGCTTTACG TTGGGGGCCC 120 TCTTACCAAT TCAAGGGGGG AGAACTGCGG CTACCGCAGG 160
TGCCGGGCGA GCGGCGTACT GACAACTAGC TGTGGTAATA 200
CCCTCACTTG CTACATCAAG GCCCGGGCAG CCTGTCGAGC 240
CGCAGGGCTC CGGGACTGCA CCATGCTCGT GTGTGGTGAC 280
GACTTGGTCG TTATCTGTGA GAGTGCGGGG GTCCAGGAGG 320 ACGCGGCGAG CCTGAGAGCC 340
(2) INFORMATION FOR SEQ ID NO: 4
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 340 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: nε5gm2
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4
CTCTACAGTC ACTGAGAACG ACATCCGTAC GGAGGAGGCA 40 ATTTACCAAT GTTGTGACCT GGACCCCCAA GCCCGCGTGG 80
SUBSTITUTE SHEET - 57 -
CCATCAAGTC CCTCACTGAG AGGCTTTATG TTGGGGGCCC 120
CCTTACCAAT TCAAGGGGGG AAAACTGCGG CTATCGCAGG 160
TGCCGCGCGA GCGGCGTACT GACAACTAGC TGTGGTAACA 200
CCCTCACTTG CTACATTAAG GCCCGGGCAG CCTGTCGAGC 240
CGCAGGGCTC CAGGACTGCA CCATGCTCGT GTGTGGCGAC 280
GACTTAGTCG TTATCTGTGA GAGTGCGGGA GTCCAGGAGG 320
ACGCGGCGAA CTTGAGAGCC 340
(2) INFORMATION FOR SEQ ID NO: 5
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE: (C) INDIVIDUAL ISOLATE: ns5usl7
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5
CTCCACAGTC ACTGAGAGCG ATATCCGTAC GGAGGAGGCA 40
ATCTACCAGT GTTGTGACCT GGACCCCCAA GCCCGCGTGG 80 CCATCAAGTC CCTCACCGAG AGGCTTTATG TCGGGGGCCC 120
TCTTACCAAT TCAAGGGGGG AAAACTGCGG CTATCGCAGG 160
TGCCGCGCAA GCGGCGTACT GACAACTAGC TGTGGTAACA 200
SUBSTITUTE SHEET CCCTCACTTG TTACATCAAG GCCCAAGCAG CCTGTCGAGC 240
CGCAGGGCTC CGGGACTGCA CCATGCTCGT GTGTGGCGAC 280
GACTTAGTCG TTATCTGTGA AAGTCAGGGA GTCCAGGAGG 320
ATGCAGCGAA CCTGAGAGCC 340
(2) INFORMATION FOR SEQ ID NO: 6
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 nucleotideε (B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: nε5εp2
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 6 CTCTACAGTC ACTGAGAGCG ATATCCGTAC GGAGGAGGCA 40
ATCTACCAAT GTTGTGACCT GGACCCCGAA GCCCGTGTGG 80
CCATCAAGTC CCTCACTGAG AGGCTTTATG TTGGGGGCCC 120
TCTTACCAAT TCAAGGGGGG AGAACTGCGG CTACCGCAGG 160
TGCCGCGCAA GCGGCGTACT GACGACTAGC TGTGGTAATA 200 CCCTCACTTG TTACATCAAG GCCCGGGCAG CCTGTCGAGC 240
CGCAGGGCTC CAGGACTGCA CCATGCTCGT GTGTGGCGAC 280
SUBSTITUTE SHEET GACCTAGTCG TTATCTGCGA AAGTGCGGGG GTCCAGGAGG 320 ACGCGGCGAG CCTGAGAGCC 340
(2) INFORMATION FOR SEQ ID NO: 7
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE: (C) INDIVIDUAL ISOLATE: nε5jl
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7
CTCCACAGTC ACTGAGAATG ACACCCGTGT TGAGGAGTCA 40
ATTTACCAAT GTTGTGACTT GGCCCCCGAA GCCAGACAGG 80 CCATAAGGTC GCTCACAGAG CGGCTCTATG TCGGGGGTCC 120
TATGACTAAC TCCAAAGGGC AGAACTGCGG CTATCGCCGG 160
TGCCGCGCGA GCGGCGTGCT GACGACTAGC TGCGGTAATA 200
CCCTCACATG CTACCTGAAG GCCACAGCGG CCTGTCGAGC 240
TGCCAAGCTC CAGGACTGCA CGATGCTCGT GAACGGAGAC 280 GACCTTGTCG TTATCTGTGA AAGCGCGGGG AACCAAGAGG 320
ACGCGGCAAG CCTACGAGCC 340
SUBSTITUTE SHEET - 60 -
(2) INFORMATION FOR SEQ ID NO: 8
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: ns5kl
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 8 CTCAACGGTC ACTGAGAATG ACATCCGTGT TGAGGAGTCA 40
ATTTACCAAA GTTGTGACTT GGCCCCCGAG GCCAGACAAG 80
CCATAAGGTC GCTCACAGAG CGGCTTTACA TCGGGGGCCC 120
CCTGACTAAT TCAAAAGGGC AGAACTGCGG CTATCGCCGA 160
TGCCGCGCCA GCGGTGTGCT GACGACTAGC TGCGGTAATA 200 CCCTCACATG TTACTTGAAG GCCACTGCGG CCTGTAGAGC 240
TGCGAAGCTC CAGGACTGCA CGATGCTCGT GTGCGGAGAC 280
GACCTTGTCG TTATCTGTGA AAGCGCGGGA ACCCAGGAGG 320
ATGCGGCGAG CCTACGAGTC 340
(2) INFORMATION FOR SEQ ID NO: 9
SUBSTITUTE SHEET - 61 -
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE: (C) INDIVIDUAL ISOLATE: nsδkl.l
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 9
CTCAACGGTC ACCGAGAATG ACATCCGTGT TGAGGAGTCA 40
ATTTATCAAT GTTGTGCCTT GGCCCCCGAG GCTAGACAGG 80 CCATAAGGTC GCTCACAGAG CGGCTTTATA TCGGGGGCCC 120
CCTGACCAAT TCAAAGGGGC AGAACTGCGG TTATCGCCGG 160
TGCCGCGCCA GCGGCGTACT GACGACCAGC TGCGGTAATA 200
CCCTTACATG TTACTTGAAG GCCTCTGCAG CCTGTCGAGC 240
CGCGAAGCTC CAGGACTGCA CGATGCTCGT GTGTGGGGAC 280 GACCTTGTCG TTATCTGTGA AAGCGCGGGA ACCCAGGAGG 320
ACGCGGCGAA CCTACGAGTC 340
(2) INFORMATION FOR SEQ ID NO: 10
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 nucleotides
(B) TYPE: nucleic acid
SUBSTITUTE SHEET (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: nε5gh6
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 10 CTCAACGGTC ACTGAGAGTG ACATCCGTGT CGAGGAGTCG 40
ATTTACCAAT GTTGTGACTT GGCCCCCGAA GCCAGGCAGG 80
CCATAAGGTC GCTCACCGAG CGACTTTATA TCGGGGGCCC 120
CCTGACTAAT TCAAAAGGGC AGAACTGCGG TTATCGCCGG 160
TGCCGCGCGA GCGGCGTGCT GACGACTAGC TGCGGTAATA 200 CCCTCACATG TTACTTGAAG GCCTCTGCAG CCTGTCGAGC 240
TGCAAAGCTC CAGGACTGCA CGATGCTCGT GAACGGGGAC 280
GACCTTGTCG TTATCTGCGA GAGCGCGGGA ACCCAAGAGG 320
ACGCGGCGAG CCTACGAGTC 340
(2) INFORMATION FOR SEQ ID NO: 11
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 nucleotides
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
SUBSTITUTE SHEET - 63 -
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: nsδεpl
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11
CTCCACAGTC ACTGAGAGTG ACATCCGTGT TGAGGAGTCA 40
ATTTACCAAT GTTGTGACTT GGCCCCCGAA GCCAGACAGG 80
CTATAAGGTC GCTCACAGAG CGGCTGTACA TCGGGGGTCC 120 CCTGACTAAT TCAAAAGGGC AGAACTGCGG CTATCGCCGG 160
TGCCGCGCAA GCGGCGTGCT GACGACTAGC TGCGGTAACA 200
CCCTCACATG TTACTTGAAG GCCTCTGCGG CCTGTCGAGC 240
TGCGAAGCTC CAGGACTGCA CGATGCTCGT GTGCGGTGAC 280
GACCTTGTCG TTATCTGTGA GAGCGCGGGA ACCCAAGAGG 320 ACGCGGCGAG CCTACGAGTC 3 0
(2) INFORMATION FOR SEQ ID NO: 12
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 340 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
SUBSTITUTE SHEET (C) individual iεolate: nε5εp3
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12
CTCAACAGTC ACTGAGAGTG ACATCCGTGT TGAGGAGTCA 40 ATCTACCAAT GTTGTGACTT GGCCCCCGAA GCCAGACAGG 80
CTATAAGGTC GCTCACAGAG CGGCTTTACA TCGGGGGTCC 120
CCTGACTAAT TCAAAAGGGC AGAACTGCGG CTATCGCCGG 160
TGCCGCGCAA GCGGCGTGCT GACGACTAGC TGCGGTAATA 200
CCCTCACATG TTACCTGAAG GCCAGTGCGG CCTGTCGAGC 240 TGCGAAGCTC CAGGACTGCA CAATGCTCGT GTGCGGTGAC 280
GACCTTGTCG TTATCTGTGA GAGCGCGGGG ACCCAAGAGG 320
ACGCGGCGAG CCTACGAGTC 340
(2) INFORMATION FOR SEQ ID NO: 13
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE: (C) INDIVIDUAL ISOLATE: nε5k2
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13
SUBSTITUTE SHEET CTCAACCGTC ACTGAGAGAG ACATCAGAAC TGAGGAGTCC 40
ATATACCGAG CCTGCTCCCT GCCTGAGGAG GCTCACATTG 80
CCATACACTC GCTGACTGAG AGGCTCTACG TGGGAGGGCC 120
CATGTTCAAC AGCAAGGGCC AGACCTGCGG GTACAGGCGT 160
TGCCGCGCCA GCGGGGTGCT CACCACTAGC ATGGGGAACA 200
CCATCACATG CTATGTAAAA GCCCTAGCGG CTTGCAAGGC 240
TGCAGGGATA GTTGCACCCT CAATGCTGGT ATGCGGCGAC 280
GACTTAGTTG TCATCTCAGA AAGCCAGGGG ACTGAGGAGG 320
ACGAGCGGAA CCTGAGAGCT . 340
(2) INFORMATION FOR SEQ ID NO: 14
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 nucleotideε (B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: ns5arg8
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 CTCTACAGTC ACGTAAAAGG ACATCACATC CTAGGAGTCC 40
ATCTACCAGT CCTGTTCACT GCCCGAGGAG GCTCGAACTG 80
CTATACACTC ACTGACTGAG AGACTATACG TAGGGGGGCC 120
SUBSTITUTE SHEET - 66 -
CATGACAAAC AGCAAGGGCC AATCCTGCGG GTACAGGCGT 160
TGCCGCGCGA GCGCAGTGCT CACCACCAGC ATGGGCAACA 200
CACTCACGTG CTACGTAAAA GCCAGGGCGG CGTGTAACGC 240
CGCGGGGATT GTTGCTCCCA CCATGCTGGT GTGCGGTGAC 280 GACCTGGTCG TCATCTCAGA GAGTCAAGGG GCTGAGGAGG 320
ACGAGCAGAA CCTGAGAGTC 340
(2) INFORMATION FOR SEQ ID NO: 15
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: nsδilO
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15
CTCTACAGTC ACAGAGAGGG ACATCAGAAC CGAGGAGTCC 40
ATCTATCTGT CCTGCTCACT GCCTGAGGAG GCCCGAACTG 80
CTATACACTC ACTGACTGAG AGACTGTACG TAGGGGGGCC 120 CATGACAAAC AGCAAGGGGC AATCCTGCGG GTACAGGCGT 160
TGCCGCGCGA GCGGAGTGCT CACCACCAGC ATGGGCAACA 200
CGCTCACGTG CTACGTGAAA GCCAGAGCGG CGTGTAACGC 240
SUBSTITUTE SHEET CGCGGGCATT GTTGCTCCCA CCATGTTGGT GTGCGGCGAC 280 GACCTGGTTG TCATCTCAGA GAGTCAGGGG GTCGAGGAAG 320 ATGAGCGGAA CCTGAGAGTC 340
(2) INFORMATION FOR SEQ ID NO: 16
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 nucleotides
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: nε5arg6
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16
CTCTACAGTC ACGGAGAGGG ACATCAGAAC CGAGGAGTCC 40 ATCTATCTGT CCTGTTCACT GCCTGAGGAG GCTCGAACTG 80
CCATACACTC ACTGACTGAG AGGCTGTACG TAGGGGGGCC 120
CATGACAAAC AGCAAAGGGC AATCCTGCGG GTACAGGCGT 160
TGCCGCGCGA GCGGAGTGCT CACCACCAGC ATGGGTAACA 200
CACTCACGTG CTACGTGAAA GCTAAAGCGG CATGTAACGC 240 CGCGGGCATT GTTGCCCCCA CCATGTTGGT GTGCGGCGAC 280
GACCTAGTCG TCATCTCAGA GAGTCAAGGG GTCGAGGAGG 320
ATGAGCGAAA CCTGAGAGCT 3 0
SUBSTITUTE SHEET - 68 -
(2) INFORMATION FOR SEQ ID NO: 17
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: ns5k2b
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 CTCAACCGTC ACGGAGAGGG ACATAAGAAC AGAAGAATCC 40
ATATATCAGG GTTGTTCCCT GCCTCAGGAG GCTAGAACTG 80
CTATCCACTC GCTCACTGAG AGACTGTACG TAGGAGGGCC 120
CATGACAAAC AGCAAGGGAC AATCCTGCGG TTACAGGCGT 160
TGCCGCGCCA GCGGGGTCTT CACCACCAGC ATGGGGAATA 200 CCATGACATG CTACATCAAA GCCCTTGCAG CGTGCAAAGC 240
TGCAGGGATC GTGGACCCTA TCATGCTGGT GTGTGGAGAC 280
GACCTGGTCG TCATCTCGGA GAGCGAAGGT AACGAGGAGG 320
ACGAGCGAAA CCTGAGAGCT 340
(2) INFORMATION FOR SEQ ID NO: 18
(i) SEQUENCE CHARACTERISTICS:
SUBSTITUTE SHEET (A) LENGTH: 340 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: nε5sa283
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18
CTCGACCGTT ACCGAACATG ACATAATGAC TGAAGAGTCT 40
ATTTACCAAT CATTGTACTT GCAGCCTGAG GCGCGTGTGG 80
CAATACGGTC ACTCACCCAA CGCCTGTACT GTGGAGGCCC 120 CATGTATAAC AGCAAGGGGC AACAATGTGG TTATCGTAGA 160
TGCCGCGCCA GCGGCGTCTT CACCACTAGT ATGGGCAACA 200
CCATGACGTG CTACATTAAG GCTTTAGCCT CCTGTAGAGC 240
CGCAAAGCTC CAGGACTGCA CGCTCCTGGT GTGTGGTGAT 320
GATAAAGCGA CCTGAGAGCC 340
(2) INFORMATION FOR SEQ ID NO: 19
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 nucleotides (B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
SUBSTITUTE SHEET (ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: ns5εal56
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19
CTCGACCGTT ACCGAACATG ACATAATGAC TGAAGAGTCC 40
ATTTACCAAT CATTGTACTT GCAGCCTGAG GCACGCGCGG 80
CAATACGGTC ACTCACCCAA CGCCTGTACT GTGGAGGCCC 120 CATGTATAAC AGCAAGGGGC AACAATGTGG TTACCGTAGA 160
TGCCGCGCCA GCGGCGTCTT CACCACCAGT ATGGGCAACA 200
CCATGACGTG CTACATCAAG GCTTCAGCCG CCTGTAGAGC 240
TGCAAAGCTC CAGGACTGCA CGCTCCTGGT GTGTGGTGTG 280
ACCTTGGTGG CCATTTGCGA GAGCCAAGGG ACGCACGAGG 320 ATGAAGCGTG CCTGAGAGTC 340
(2) INFORMATION FOR SEQ ID NO: 20
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 340 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
SUBSTITUTE SHEET - 71 -
(C) INDIVIDUAL ISOLATE: ns5ill
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20
CTCTACTGTC ACTGAACAGG ACATCAGGGT GGAAGAGGAG 40 ATATACCAGT GCTGTAACCT TGAACCGGAG GCCAGGAAAG 80
TGATCTCCTC CCTCACGGAG CGGCTTTACT GCGGGGGCCC 120
TATGTTCAAC AGCAAGGGGG CCCAGTGTGG TTATCGCCGT 160
TGCCGTGCTA GTGGAGTCCT GCCTACCAGC TTCGGCAACA 200
CAATCACTTG TTACATCAAG GCTAGAGCGG CTTCGAAGGC 240 CGCAGGCCTC CGGAACCCGG ACTTTCTTGT CTGCGGAGAT 280
GATCTGGTCG TGGTGGCTGA GAGTGATGGC GTCGACGAGG 320
ATAGAGCAGC CCTGAGAGCC 340
(2) INFORMATION FOR SEQ ID NO: 21
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE: (C) INDIVIDUAL ISOLATE: nε5i4
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21
SUBSTITUTE SHEET CTCGACTGTC ACTGAACAGG ACATCAGGGT GGAAGAGGAG 40
ATATACCAAT GCTGTAACCT TGAACCGGAG GCCAGGAAAG 80
TGATCTCCTC CCTCACGGAG CGGCTTTACT GCGGGGGCCC 120
TATGTTCAAT AGCAAGGGGG CCCAGTGTGG TTATCGCCGT 160
TGCCGTGCTA GTGGAGTTCT GCCTACCAGC TTCGGCAACA 200
CAATCACTTG TTACATCAAG GCTAGAGCGG CTGCGAAGGC 240
CGCAGGGCTC CGGACCCCGG ACTTTCTCGT CTGCGGAGAT 280
GATCTGGTTG TGGTGGCTGA GAGTGATGGC GTCGACGAGG 320
ATAGAACAGC CCTGCGAGCC 340
(2) INFORMATION FOR SEQ ID NO: 22
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 nucleotideε (B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: ns5gh8
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 CTCAACTGTC ACTGAACAGG ACATCAGGGT GGAAGAGGAG 40
ATATACCAAT GCTGTAACCT TGAACCGGAG GCCAGGAAAG 80
TGATCTCCTC CCTCACGGAA CGGCTTTACT GCGGGGGCCC 120
SUBSTITUTE SHEET - 73 -
TATGTTCAAC AGCAAGGGGG CCCAGTGTGG TTATCGCCGT 160
TGCCGTGCCA GTGGAGTTCT GCCTACCAGC TTCGGCAACA 200
CAATCACTTG TTACATCAAA GCTAGAGCGG CTGCCGAAGC 240
CGCAGGCCTC CGGAACCCGG ACTTTCTTGT CTGCGGAGAT 280 GATCTGGTTG TGGTGGCTGA GAGTGATGGC GTCAATGAGG 320
ATAGAGCAGC CCTGGGAGCC 340
(2) INFORMATION FOR SEQ ID NO: 23
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 100 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE: (ATCC * 40394) (C) INDIVIDUAL ISOLATE: hcvl
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23
GACGGCGTTG GTAATGGCTC AGCTGCTCCG GATCCCACAA 40
GCCATCTTGG ACATGATCGC TGGTGCTCAC TGGGGAGTCC 80
TGGCGGGCAT AGCGTATTTC 100
(2)* INFORMATION FOR SEQ ID NO: 24
SUBSTITUTE SHEET (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 100 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE: (C) INDIVIDUAL ISOLATE: US5
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24
GACGGCGTTG GTGGTAGCTC AGGTACTCCG GATCCCACAA 40
GCCATCATGG ACATGATCGC TGGAGCCCAC TGGGGAGTCC 80 TGGCGGGCAT AGCGTATTTC 100
(2) INFORMATION FOR SEQ ID NO: 25
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 100 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
SUBSTITUTE SHEET - 75 -
(C) INDIVIDUAL ISOLATE: AUS5
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25
AACGGCGCTG GTAGTAGCTC AGCTGCTCAG GGTCCCGCAA 40 GCCATCGTGG ACATGATCGC TGGTGCCCAC TGGGGAGTCC 80
TAGCGGGCAT AGCGTATTTT 100
(2) INFORMATION FOR SEQ ID NO: 26
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 100 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: US4
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26
GACAGCCCTA GTGGTATCGC AGTTACTCCG GATCCCACAA 40
GCCGTCATGG ATATGGTGGC GGGGGCCCAC TGGGGAGTCC 80
TGGCGGGCCT TGCCTACTAT 100
(2) INFORMATION FOR SEQ ID NO: 27
SUBSTITUTE SHEET (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 100 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE: (C) INDIVIDUAL ISOLATE: ARG2
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27
AGCAGCCCTA GTGGTGTCGC AGTTACTCCG GATCCCACAA 40
AGCATCGTGG ACATGGTGGC GGGGGCCCAC TGGGGAGTCC 80 TGGCGGGCCT TGCTTACTAT 100
(2) INFORMATION FOR SEQ ID NO: 28
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 100 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
SUBSTITUTE SHEET (C) INDIVIDUAL ISOLATE: 115
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28
GGCAGCCCTA GTGGTGTCGC AGTTACTCCG GATCCCGCAA 40 GCTGTCGTGG ACATGGTGGC GGGGGCCCAC TGGGGAATCC 80
TAGCGGGTCT TGCCTACTAT 100
(2) INFORMATION FOR SEQ ID NO: 29
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 100 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: GH8
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29
TGTGGGTATG GTGGTGGCGC ACGTCCTGCG TTTGCCCCAG 40
ACCTTGTTCG ACATAATAGC CGGGGCCCAT TGGGGCATCT 80
TGGCGGGCTT GGCCTATTAC 100
(2) INFORMATION FOR SEQ ID NO: 30
SUBSTITUTE SHEET (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 100 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE: (C) INDIVIDUAL ISOLATE: 14
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30
TGTGGGTATG GTGGTAGCAC ACGTCCTGCG TCTGCCCCAG 40
ACCTTGTTCG ACATAATAGC CGGGGCCCAT TGGGGCATCT 80 TGGCAGGCCT AGCCTATTAC 100
(2) INFORMATION FOR SEQ ID NO: 31
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 100 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
SUBSTITUTE SHEET - 79 -
. (C) INDIVIDUAL ISOLATE: 111
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31
TGTGGGTATG GTGGTGGCGC AAGTCCTGCG TTTGCCCCAG 40 ACCTTGTTCG ACGTGCTAGC CGGGGCCCAT TGGGGCATCT 80
TGGCGGGCCT GGCCTATTAC 100
(2) INFORMATION FOR SEQ ID NO: 32
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 100 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: 110
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32
TACCACTATG CTCCTGGCAT ACTTGGTGCG CATCCCGGAG 40
GTCATCCTGG ACATTATCAC GGGAGGACAC TGGGGCGTGA 80
TGTTTGGCCT GGCTTATTTC 100
(2) INFORMATION FOR SEQ ID NO: 33
SUBSTITUTE SHEET (i). SEQUENCE CHARACTERISTICS:
(A) LENGTH: 252 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE: (ATCC # 40394) (C) INDIVIDUAL ISOLATE: hcvl
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33
GTTAGTATGA GTGTCGTGCA GCCTCCAGGA CCCCCCCTCC 40
CGGGAGAGCC ATAGTGGTCT GCGGAACCGG TGAGTACACC 80 GGAATTGCCA GGACGACCGG GTCCTTTCTT GGATCAACCC 120
GCTCAATGCC TGGAGATTTG GGCGTGCCCC CGCAAGACTG 160
CTAGCCGAGT AGTGTTGGGT CGCGAAAGGC CTTGTGGTAC 200
TGCCTGATAG GGTGCTTGCG AGTGCCCCGG GAGGTCTCGT 240
AGACCGTGCA CC 252
(2) INFORMATION FOR SEQ ID NO: 34
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 252 nucleotides (B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
SUBSTITUTE SHEET - 81 -
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: uε5
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 34
GTTAGTATGA GTGTCGTGCA GCCTCCAGGA CCCCCCCTCC 40
CGGGAGAGCC ATAGTGGTCT GCGGAACCGG TGAGTACACC 80
GGAATTGCCA GGACGACCGG GTCCTTTCTT GGATCAACCC 120 GCTCAATGCC TGGAGATTTG GGCGTGCCCC CGCAAGACTG 160
CTAGCCGAGT AGTGTTGGGT CGCGAAAGGC CTTGTGGTAC 200
TGCCTGATAG GGTGCTTGCG AGTGCCCCGG GAGGTCTCGT 240
AGACCGTGCA CC 252
(2) INFORMATION FOR SEQ ID NO: 35
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 252 nucleotides
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: ausl
SUBSTITUTE SHEET (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35
GTTAGTATGA GTGTCGTGCA GCCTCCAGGA CCCCCCCTCC 40
CGGGAGAGCC ATAGTGGTCT GCGGAACCGG TGAGTACACC 80
GGAATTGCCA GGACGACCGG GTCCTTTCTT GGATCAACCC 120 GCTCAATGCC TGGAGATTTG GGCACGCCCC CGCAAGATCA 160
CTAGCCGAGT AGTGTTGGGT CGCGAAAGGC CTTGTGGTAC 200
TGCCTGATAG GGTGCTTGCG AGTGCCCCGG GAGGTCTCGT 2 0
AGACCGTGCA CC 252
(2) INFORMATION FOR SEQ ID NO: 36
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 252 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: εp2
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36
GTTAGTATGA GTGTCGTGCA GCCTCCAGGA CCCCCCCTCC 40 CGGGAGAGCC ATAGTGGTCT GCGGAACCGG TGAGTACACC 80
GGAATTGCCA GGACGACCGG GTCCTTTCTT GGATAAACCC 120
GCTCAATGCC TGGAGATTTG GGCGTGCCCC CGCGAGACTG 160
SUBSTITUTE SHEET CTAGCCGAGT AGTGTTGGGT CGCGAAAGGC CTTGTGGTAC 200 TGCCTGATAG GGTGCTTGCG AGTGCCCCGG GAGGTCTCGT 240 AGACCGTGCA CC 252
(2) INFORMATION FOR SEQ ID NO: 37
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 252 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: gm2
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37
GTTAGTATGA GTGTCGTGCA GCCTCCAGGA CCCCCCCTCC 40 CGGGAGAGCC ATAGTGGTCT GCGGAACCGG TGAGTACACC 80
GGAATTGCCA GGACGACCGG GTCCTTTCTT GGATCAACCC 120
GCTCAATGCC TGGAGATTTG GGCGTGCCCC CGCAAGACTG 160
CTAGCCGAGT AGTGTTGGGT CGCGAAAGGC CTTGTGGTAC 200
TGCCTGATAG GGTGCTTGCG AGTGCCCCGG GAGGTCTCGT 240 AGACCGTGCA CC 252
(2) INFORMATION FOR SEQ ID NO: 38
SUBSTITUTE SHEET (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 252 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE: (C) INDIVIDUAL ISOLATE: i21
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38
GTTAGTATGA GTGTCGTGCA GCCTCCAGGA CCCCCCCTCC 40
CGGGAGAGCC ATAGTGGTCT GCGGAACCGG TGAGTACACC 80 GGAATTGCCA GGACGACCGG GTCCTTTCTT GGATAAACCC 120
GCTCAATGCC TGGAGATTTG GGCGTGCCCC CGCAAGACTG 160
CTAGCCGAGT AGTGTTGGGT CGCGAAAGGC CTTGTGGTAC 200
TGCCTGATAG GGTGCTTGCG AGTGCCCCGG GAGGTCTCGT 240
AGACCGTGCA CC 252
(2) INFORMATION FOR SEQ ID NO: 39
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 252 nucleotideε (B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
SUBSTITUTE SHEET (ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: uε4
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39
GTTAGTATGA GTGTCGTGCA GCCTCCAGGA CCCCCCCTCC 40
CGGGAGAGCC ATAGTGGTCT GCGGAACCGG TGAGTACACC 80
GGAATTGCCA GGACGACCGG GTCCTTTCTT GGATCAACCC 120 GCTCAATGCC TGGAGATTTG GGCGTGCCCC CGCGAGACTG 160
CTAGCCGAGT AGTGTTGGGT CGCGAAAGGC CTTGTGGTAC 200
TGCCTGATAG GGTGCTTGCG AGTGCCCCGG GAGGTCTCGT 240
AGACCGTGCA CC 252
(2) INFORMATION FOR SEQ ID NO: 40
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 252 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: jhl
SUBSTITUTE SHEET (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40
GTTAGTATGA GTGTCGTGCA GCCTCCAGGA CCCCCCCTCC 40
CGGGAGAGCC ATAGTGGTCT GCGGAACCGG TGAGTACACC 80
GGAATTGCCA GGACGACCGG GTCCTTTCTT GGATCAACCC 120 GCTCAATGCC TGGAGATTTG GGCGTGCCCC CGCGAGACTG 160
CTAGCCGAGT AGTGTTGGGT CGCGAAAGGC CTTGTGGTAC 200
TGCCTGATAG GGTGCTTGCG AGTGCCCCGG GAGGTCTCGT 240
AGACCGTGCA TC 252
(2) INFORMATION FOR SEQ ID NO: 41
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 252 nucleotides
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: nac5
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41
GTTAGTATGA GTGTCGTGCA GCCTCCAGGA CCCCCCCTCC 40 CGGGAGAGCC ATAGTGGTCT GCGGAACCGG TGAGTACACC 80
GGAATTGCCA GGACGACCGG GTCCTTTCTT GGATCAACCC 120
GCTCAATGCC TGGAGATTTG GGCGTGCCCC CGCGAGACTG 160
SUBSTITUTE SHEET CTAGCCGAGT AGTGTTGGGT CGCGAAAGGC CTTGTGGTAC 200 TGCCTGATAG GGTGCTTGCG AGTGCCCCGG GAGGTCTCGT 240 AGACCGTGCA CC 252
(2) INFORMATION FOR SEQ ID NO: 42
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 252 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE: (C) INDIVIDUAL ISOLATE: arg2
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42
GTTAGTATGA GTGTCGTGCA GCCTCCAGGA CCCCCCCTCC 40
CGGGAGAGCC ATAGTGGTCT GCGGAACCGG TGAGTACACC 80 GGAATTGCCA GGACGACCGG GTCCTTTCTT GGATCAACCC 120
GCTCAATGCC TGGAGATTTG GGCGTGCCCC CGCGAGACTG 160
CTAGCCGAGT AGTGTTGGGT CGCGAAAGGC CTTGTGGTAC 200
TGCCTGATAG GGTGCTTGCG AGTGCCCCGG GAGGTCTCGT 240
AGACCGTGCA CC 252
(2 ) INFORMATION FOR SEQ ID NO : 43
SUBSTITUTE SHEET (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 252 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE: (C) INDIVIDUAL ISOLATE: εpl
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43
GTTAGTATGA GTGTCGTGCA GCCTCCAGGA CCCCCCCTCC 40
CGGGAGAGCC ATAGTGGTCT GCGGAACCGG TGAGTACACC 80 GGAATTGCCA GGACGACCGG GTCCTTTCTT GGATCAACCC 120
GCTCAATGCC TGGAGATTTG GGCGTGCCCC CGCGAGACTG 160
CTAGCCGAGT AGTGTTGGGT CGCGAAAGGC CTTGTGGTAC 200
TGCCTGATAG GGTGCTTGCG AGTGCCCCGG GAGGTCTCGT 240
AGACCGTGCA CC 252
(2) INFORMATION FOR SEQ ID NO: 44
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 252 nucleotideε (B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
SUBSTITUTE SHEET (ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: ghl
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44
GTTAGTATGA GTGTCGTGCA GCCTCCAGGA CCCCCCCTCC 40
CGGGAGAGCC ATAGTGGTCT GCGGAACCGG TGAGTACACC 80
GGAATTGCCA GGACGACCGG GTCCTTTCTT GGATCAACCC 120 GCTCAATGCC TGGAGATTTG GGCGTGCCCC CGCGAGACTG 160
CTAGCCGAGT AGTGTTGGGT CGCGAAAGGC CTTGTGGTAC 200
TGCCTGATAG GGTGCTTGCG AGTGCCCCGG GAGGTCTCGT 240
AGACCGTGCA CC 252
(2) INFORMATION FOR SEQ ID NO: 45
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 252 nucleotides
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: il5
SUBSTITUTE SHEET (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45
GTTAGTATGA GTGTCGTGCA GCCTCCAGGA CCCCCCCTCC 40
CGGGAGAGCC ATAGTGGTCT GCGGAACCGG TGAGTACACC 80
GGAATTGCCA GGACGACCGG GTCCTTTCTT GGATCAACCC 120 GCTCAATGCC TGGAGATTTG GGCGTGCCCC CGCGAGACTG 160
CTAGCCGAGT AGTGTTGGGT CGCGAAAGGC CTTGTGGTAC 200
TGCCTGATAG GGTGCTTGCG AGTGCCCCGG GAGGTCTCGT. 240
AGACCGTGCA CC 252
(2) INFORMATION FOR SEQ ID NO: 46
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 252 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: ilO
SUBSTITUTE SHEET (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46
GCTAGTATCA GTGTCGTACA GCCTCCAGGC CCCCCCCTCC 40
CGGGAGAGCC ATAGTGGTCT GCGGAACCGG TGAGTACACC 80
GGAATTGCCG GGAAGACTGG GTCCTTTCTT GGATAAACCC 120 ACTCTATGCC CGGCCATTTG GGCGTGCCCC CGCAAGACTG 160
CTAGCCGAGT AGCGTTGGGT TGCGAAAGGC CTTGTGGTAC 200
TGCCTGATAG GGTGCTTGCG AGTGCCCCGG GAGGTCTCGT 240
AGACCGTGCA TC 252
(2) INFORMATION FOR SEQ ID NO: 47
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 252 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: arg6
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47
GTTAGTATGA GTCTCGTACA GCCTCCAGGC CCCCCCCTCC 40 CGGGAGAGCC ATAGTGGTCT GCGGAACCGG TGAGTACACC 80
GGAATTGCTG GGAAGACTGG GTCCTTTCTT GGATAAACCC 120
ACTCTATGCC CAGCCATTTG GGCGTGCCCC CGCAAGACTG 160
SUBSTITUTE SHEET CTAGCCGAGT AGCGTTGGGT TGCGAAAGGC CTTGTGGTAC 200 TGCCTGATAG GGTGCTTGCG AGTGCCCCGG GAGGTCTCGT 240 AGACCGTGCA TC 252
(2) INFORMATION FOR SEQ ID NO: 48
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 252 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE: (C) INDIVIDUAL ISOLATE: ε21
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48
GTTAGTACGA GTGTCGTGCA GCCTCCAGGA CTCCCCCTCC 40 CGGGAGAGCC ATAGTGGTCT GCGGAACCGG TGAGTACACC 80
GGAATCGCTG GGGTGACCGG GTCCTTTCTT GGAGCAACCC 120
GCTCAATACC CAGAAATTTG GGCGTGCCCC CGCGAGATCA 160
CTAGCCGAGT AGTGTTGGGT CGCGAAAGGC CTTGTGGTAC 200
TGCCTGATAG GGTGCTTGCG AGTGCCCCGG GAGGTCTCGT 240 AGACCGTGCA AC 252
(2) INFORMATION FOR SEQ ID NO: 49
SUBSTITUTE SHEET (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 252 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: gj61329
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49
GTTAGTACGA GTGTCGTGCA GCCTCCAGGA CCCCCCCTCC 40
CGGGAGAGCC ATAGTGGTCT GCGGAACCGG TGAGTACACC 80
GGAATCGCTG GGGTGACCGG GTCCTTTCTT GGAGTAACCC 120
GCTCAATACC CAGAAATTTG GGCGTGCCCC CGCGAGATCA 160
CTAGCCGAGT AGTGTTGGGT CGCGAAAGGC CTTGTGGTAC 200 TGCCTGATAG GGTGCTTGCG AGTGCCCCGG GAGGTCTCGT 240
AGACCGTGCA AC 252
(2) INFORMATION FOR SEQ ID NO: 50
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 180 nucleotideε
SUBSTITUTE SHEET (B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA (vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: εa3
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50
GTTAGTATGA GTGTCGAACA GCCTCCAGGA CCCCCCCTCC 40
CGGGAGAGCC ATAGTGGTCT GCGGAACCGG TGAGTACACC 80
GGAATTGCCG GGATGACCGG GTCCTTTCTT GGATAAACCC 120
GCTCAATGCC CGGAGATTTG GGCGTGCCCC CGCGAGACTG 160 CTAGCCGAGT AGTGTTGGGT 180
(2) INFORMATION FOR SEQ ID NO: 51
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 180 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
SUBSTITUTE SHEET (C) INDIVIDUAL ISOLATE: sa4
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 GTTAGTATGA GTGTCGAACA GCCTCCAGGA CCCCCCCTCC 40 CGGGAGAGCC ATAGTGGTCT GCGGAACCGG TGAGTACACC 80
GGAATTGCCG GGATGACCGG GTCCTTTCTT GGATAAACCC 120 GCTCAATGCC CGGAGATTTG GGCGTGCCCC CGCGAGACTG 160 CTAGCCGAGT AGTGTTGGGT 180
(2) INFORMATION FOR SEQ ID NO: 52
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 549 nucleotideε (B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE: (ATCC # 40394) (C) INDIVIDUAL ISOLATE: hcvl
SUBSTITUTE SHEET (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52
ATGAGCACGA ATCCTAAACC TCAAAAAAAA AACAAACGTA 40
ACACCAACCG TCGCCCACAG GACGTCAAGT TCCCGGGTGG 80
CGGTCAGATC GTTGGTGGAG TTTACTTGTT GCCGCGCAGG 120 GGCCCTAGAT TGGGTGTGCG CGCGACGAGA AAGACTTCCG 160
AGCGGTCGCA ACCTCGAGGT AGACGTCAGC CTATCCCCAA 200
GGCTCGTCGG CCCGAGGGCA GGACCTGGGC TCAGCCCGGG 240
TACCCTTGGC CCCTCTATGG CAATGAGGGC TGCGGGTGGG 280
CGGGATGGCT CCTGTCTCCC CGTGGCTCTC GGCCTAGCTG 320 GGGCCCCACA GACCCCCGGC GTAGGTCGCG CAATTTGGGT 360
AAGGTCATCG ATACCCTTAC GTGCGGCTTC GCCGACCTCA 400
TGGGGTACAT ACCGCTCGTC GGCGCCCCTC TTGGAGGCGC 440
TGCCAGGGCC CTGGCGCATG GCGTCCGGGT TCTGGAAGAC 480
GGCGTGAACT ATGCAACAGG GAACCTTCCT GGTTGCTCTT 520 TCTCTATCTT CCTTCTGGCC CTGCTCTCT 549
(2) INFORMATION FOR SEQ ID NO: 53
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 549 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
SUBSTITUTE SHEET (C) INDIVIDUAL ISOLATE: US5
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 53
ATGAGCACGA ATCCTAAACC TCAAAGAAAA ACCAAACGTA 40 ACACCAACCG TCGCCCACAG GACGTCAAGT TCCCGGGTGG 80
CGGTCAGATC GTTGGTGGAG TTTACTTGTT GCCGCGCAGG 120
GGCCCTAGAT TGGGTGTGCG CGCGACGAGG AAGACTTCCG 160
AGCGGTCGCA ACCTCGAGGT AGACGTCAGC CTATCCCCAA 200
GGCGCGTCGG CCCGAGGGCA GGACCTGGGC TCAGCCCGGG 240 TACCCTTGGC CCCTCTATGG CAATGAGGGT TGCGGGTGGG 280
CGGGATGGCT CCTGTCTCCC CGTGGCTCTC GGCCTAGTTG 320
GGGCCCCACA GACCCCCGGC GTAGGTCGCG CAATTTGGGT 360
AAGGTCATCG ATACCCTTAC GTGCGGCTTC GCCGACCACA 400
TGGGGTACAT ACCGCTCGTC GGCGCCCCTC TTGGAGGCGC 440 TGCCAGGGCT CTGGCGCATG GCGTCCGGGT TCTGGAAGAC 480
GGCGTGAACT ATGCAACAGG GAACCTTCCT GGTTGCTCTT 520
TCTCTATCTT CCTTCTGGCC CTGCTCTCT 549
(2) INFORMATION FOR SEQ ID NO: 54
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 549 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
SUBSTITUTE SHEET (vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: auεl
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54
ATGAGCACGA ATCCTAAACC TCAAAGAAAA ACCAAACGTA 40
ACACCAACCG TCGCCCACAG GACGTTAAGT TCCCGGGTGG 80
CGGTCAGATC GTTGGTGGAG TTTACTTGTT GCCGCGCAGG 120
GGCCCTAGAT TGGGTGTGCG CGCGACGAGG AAGACTTCCG 160 AGCGGTCGCA ACCTCGAGGT AGACGTCAGC CTATCCCTAA 200
GGCGCGTCGG CCCGAGGGCA GGACCTGGGC TCAGCCCGGG 240
TACCCCTGGC CCCTCTATGG TAATGAGGGT TGCGGATGGG 280
CGGGATGGCT CCTGTCCCCC CGTGGCTCTC GGCCTAGTTG 320
GGGCCCTACA GACCCCCGGC GTAGGTCGCG CAATTTGGGT 360 AAGGTCATCG ATACCCTCAC GTGCGGCTTC GCCGACCACA 400
TGGGGTACAT TCCGCTCGTT GGCGCCCCTC TTGGGGGCGC 440
TGCCAGGGCC CTGGCGCATG GCGTCCGGGT TCTGGAAGAC 480
GGCGTGAACT ATGCAACAGG GAATCTTCCT GGTTGCTCTT 520
TCTCTATCTT CCTTCTGGCC CTTCTCTCT 549
(2) INFORMATION FOR SEQ ID NO: 55
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 549 nucleotideε (B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
SUBSTITUTE SHEET (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: εp2 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55
ATGAGCACGA ATCCTAAACC TCAAAGAAAA ACCAAACGTA 40
ACACCAACCG TCGCCCACAG GACGTCAAGT TCCCGGGTGG 80 CGGTCAGATC GTTGGTGGAG TTTACTTGTT GCCGCGCAGG 120
GGCCCTAGAT TGGGTGTGCG CACGACGAGG AAGACTTCCG 160
AGCGGTCGCA ACCTCGAGGT AGACGTCAGC CCATCCCCAA 200
GGCTCGTCGA CCCGAGGGCA GGACCTGGGC TCAGCCCGGG 240
TACCCTTGGC CCCTCTATGG CAATGAGGGC TGCGGGTGGG 280 CGGGATGGCT CCTGTCTCCC CGTGGCTCTC GGCCTAGCTG 320
GGGCCCCACA GACCCCCGGC GTAGGTCGCG CAATTTGGGT 360
AAGGTCATCG ATACCCTTAC GTGCGGCTTC GCCGACCTCA 400
TGGGGTACAT ACCGCTCGTC GGCGCCCCTC TTGGAGGCGC 440
TGCCAGAGCC CTGGCGCATG GCGTCCGGGT TCTGGAAGAC 480 GGCGTGAACT ATGCAACAGG GAACCTTCCC GGTTGCTCTT 520
TCTCTATCTT CCTTCTGGCC CTGCTCTCT 549
(2) INFORMATION FOR SEQ ID NO: 56
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 549 nucleotides
(B) TYPE: nucleic acid
SUBSTITUTE SHEET (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: gm2
(xi) SEQUENCE DESCRIPTION: SEQ ATGAGCACGA ATCCTAAACC TCAAAGAAGA ACACCAACCG TCGCCCACAG GACGTCAAGT CGGTCAGATC GTTGGTGGAG TTTACTTGTT GGCCCTAGAT TGGGTGTGCG CGCGACGAGG AGCGGTCGCA ACCTCGAGGT AGACGTCAGC GGCACGTCGG CCCGAGGGTA GGACCTGGGC TACCCTTGGC CCCTCTATGG CAATGAGGGT CGGGATGGCT CCTGTCTCCC CGCGGCTCTC GGGCCCCACA GACCCCCGGC GTAGGTCGCG AAGGTCATCG ATACCCTTAC GTGCGGCTTC TGGGGTACAT ACCGCTCGTC GGCGCCCCTC TGCCAGGGCC CTGGCGCATG GCGTCCGGGT GGCGTGAACT ATGCAACAGG GAACCTTCCT TCTCTATCTT CCTTCTGGCC CTGCTCTCT
(2) INFORMATION FOR SEQ ID NO: 57
(i) SEQUENCE CHARACTERISTICS:
SUBSTITUTE SHEET (A) LENGTH: 549 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA (vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: i21
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57
ATGAGCACGA ATCCTAAACC TCAAAGAAAA ACCAAACGTA 40
ACACCAACCG TCGCCCACAG GACGTCAAGT TCCCGGGTGG 80
CGGTCAGATC GTTGGTGGAG TTTACTTGTT GCCGCGCAGG 120
GGCCCTAGAT TGGGTGTGCG CGCGACGAGG AAGACTTCCG 160 AGCGGTCGCA ACCTCGTGGT AGACGCCAGC CTATCCCCAA 200
GGCGCGTCGG CCCGAGGGCA GGACCTGGGC TCAGCCCGGG 240
TACCCTTGGC CCCTCTATGG CAATGAGGGT TGCGGGTGGG 280
CGGGATGGCT CCTGTCTCCC CGTGGCTCTC GGCCTAGCTG 320
GGGCCCCACA GACCCCCGGC GTAGGTCGCG CAATTTGGGT 360 AAGGTCATCG ATACCCTTAC GTGCGGCTTC GCCGACCTCA 400
TGGGGTACAT ACCGCTCGTC GGCGCCCCTC TTGGAGGCGC 440
TGCCAGGGCC CTGGCGCATG GCGTCCGGGT TCTGGAAGAC 480
GGCGTGAACT ATGCAACAGG GAACCTTCCT GGTTGCTCTT 520
TTTCTATTTT CCTTCTGGCC CTGCTCTCT 549
(2 ) INFORMATION FOR SEQ ID NO : 58
SUBSTITUTE SHEET (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 549 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE: (C) INDIVIDUAL ISOLATE: uε4
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58
ATGAGCACGA ATCCTAAACC TCAAAGAAAA ACCAAACGTA 40
ACACCAACCG CCGCCCACAG GACGTTAAGT TCCCGGGCGG 80 TGGCCAGGTC GTTGGTGGAG TTTACCTGTT GCCGCGCAGG 120
GGCCCCAGGT TGGGTGTGCG CGCGACTAGG AAGACTTCCG 160
AGCGGTCGCA ACCTCGTGGA AGGCGACAAC CTATCCCCAA 200
GGCTCGCCAG CCCGAGGGCA GGGCCTGGGC TCAGCCCGGG 240
TACCCTTGGC CCCTCTATGG CAATGAGGGT ATGGGGTGGG 280 CAGGATGGCT CCTGTCACCC CGTGGCTCTC GGCCTAGTTG 320
GGGCCCCACG GACCCCCGGC GTAGGTCGCG TAATTTGGGT 360
AAGGTCATCG ATACCCTCAC ATGCGGCTTC GCCGACCTCA 400
TGGGGTACAT TCCGCTCGTC GGCGCCCCCC TTAGGGGCGC 440
TGCCAGGGCC TTGGCGCATG GCGTCCGGGT TCTGGAGGAC 480 GGCGTGAACT ACGCAACAGG GAATCTGCCC GGTTGCTCCT 520
TTTCTATCTT CCTCTTGGCT CTGCTGTCC 549
SUBSTITUTE SHEET (2) INFORMATION FOR SEQ ID NO: 59
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 549 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(Vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: jhl
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 ATGAGCACAA ATCCTAAACC TCAAAGAAAA ACCAAACGTA 40
ACACCAACCG CCGCCCACAG GACGTCAAGT TCCCGGGCGG 80
TGGTCAGATC GTTGGTGGAG TTTACCTGTT GCCGCGCAGG 120
GGCCCCAGGT TGGGTGTGCG CGCGACTAGG AAGACTTCCG 160
AGCGGTCGCA ACCTCGTGGA AGGCGACAAC CTATCCCCAA 200 GGCTCGCCAG CCCGAGGGCA GGGCCTGGGC TCAGCCCGGG 2 0
TACCCTTGGC CCCTCTATGG CAACGAGGGT ATGGGGTGGG 280
CAGGATGGCT CCTGTCACCC CGTGGCTCTC GGCCTAGTTG 320
GGGCCCCACG GACCCCCGGC GTAGGTCGCG TAATTTGGGT 360
AAGGTCATCG ATACCCTCAC ATGCGGCTTC GCCGACCTCA 400 TGGGGTACAT TCCGCTTGTC GGCGCCCCCC TAGGGGGCGC 440
TGCCAGGGCC CTGGCACATG GTGTCCGGGT TCTGGAGGAC 480
GGCGTGAACT ATGCAACAGG GAATTTGCCC GGTTGCTCTT 520
SUBSTITUTE SHEET TCTCTATCTT CCTCTTGGCT CTGCTGTCC 549
(2) INFORMATION FOR SEQ ID NO: 60
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 549 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: nac5
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60
ATGAGCACAA ATCCTAAACC CCAAAGAAAA ACCAAACGTA 40
ACACCAACCG TCGCCCACAG GACGTCAAGT TCCCGGGCGG 80
TGGTCAGATC GTTGGTGGAG TTTACCTGTT GCCGCGCAGG 120 GGCCCCAGGT TGGGTGTGCG CGCGACTAGG AAGACTTCCG 160
AGCGGTCGCA ACCTCGTGGA AGGCGACAAC CTATCCCCAA 200
GGCTCGCCGG CCCGAGGGCA GGTCCTGGGC TCAGCCCGGG 240
TACCCTTGGC CCCTCTATGG CAACGAGGGT ATGGGGTGGG 280
CAGGATGGCT CCTGTCACCC CGCGGCTCCC GGCCTAGTTG 320 GGGCCCCACG GACCCCCGGC GTAGGTCGCG TAATTTGGGT 360
AAGGTCATCG ATACCCTCAC ATGCGGCTTC GCCGACCTCA 400
SUBSTITUTE SHEET TGGGGTACAT TCCGCTCGTC GGCGCCCCCC TAGGGGGCGC 440 TGCCAGGGCC CTGGCACATG GTGTCCGGGT TCTGGAGGAC 480 GGCGTGAACT ATGCAACAGG GAATTTGCCT GGTTGCTCTT 520 TCTCTATCTT CCTCTTGGCT CTGCTGTCC 549
(2) INFORMATION FOR SEQ ID NO: 61
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 549 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: arg2
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61
ATGAGCACGA ATCCTAAACC TCAAAGAAAA ACCAAACGTA 40 ACACCAACCG CCGCCCACAG GACGTCAAGT TCCCGGGCGG 80
TGGTCAGATC GTTGGTGGAG TTTACTTGTT GCCGCGCAGG 120
GGCCCCAGGT TGGGTGTGCG CGCGACTAGG AAGACTTCCG 160
AGCGGTCGCA ACCTCGTGGA AGGCGACAAC CTATCCCCAA 200
GGCTCGCCAG CCCGAGGGTA GGGCCTGGGC TCAGCCCGGG 240 TACCCTTGGC CCCTCTATGG CAATGAGGGT ATGGGGTGGG 280
CAGGGTGGCT CCTGTCCCCC CGCGGCTCCC GGCCTAGTTG 320
SUBSTITUTE SHEET GGGCCCCACA GACCCCCGGC GTAGGTCGCG TAATTTGGGT 360
AAGGTCATCG ATACCCTCAC ATGCGGCTTC GCCGACCTCA 400
TGGGGTACAT TCCGCTCGTC GGCGCCCCCC TAGGGGGCGC 440
TGCCAGGGCC CTGGCGCATG GCGTCCGGGT TCTGGAGGAC 480 GGCGTGAACT ATGCAACAGG GAATCTGCCC GGTTGCTCTT 520
TCTCTATCTT CCTCTTGGCT TTGCTGTCC 549 (2) INFORMATION FOR SEQ ID NO: 62
(i). SEQUENCE CHARACTERISTICS: (A) LENGTH: 549 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: εpl
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62
ATGAGCACGA ATCCTAAACC TCAAAGAAAA ACCAAACGTA 40
ACACCAACCG CCGCCCACAG GACGTCAAGT TCCCGGGCGG 80
TGGTCAGATC GTTGGTGGAG TTTACCTGTT GCCGCGCAGG 120
GGCCCCAGGT TGGGTGTGCG CGCGACTAGG AAGACTTCCG 160 AGCGGTCGCA ACCTCGTGGA AGGCGACAAC CTATCCCCAA 200
GGCTCGCCGG CCCGAGGGCA GGGCCTGGGC TCAGCCCGGG 240
TATCCTTGGC CCCTCTATGG CAATGAGGGT CTGGGGTGGG 280
SUBSTITUTE SHEET CAGGATGGCT CCTGTCACCC CGCGGCTCTC GGCCTAGCTG 320
GGGCCCTACC GACCCCCGGC GTAGGTCGCG CAACTTGGGT 360
AAGGTCATCG ATACCCTTAC GTGCGGCTTC GCCGACCTCA 400
TGGGGTACAT TCCGCTCGTC GGCGCCCCCC TTAGGGGCGC 440
TGCCAGGGCC CTGGCGCATG GCGTCCGGGT TCTGGAGGAC 480
GGCGTGAACT ATGCAACAGG GAATTTGCCC GGTTGCTCTT 520
TCTCTATCTT CCTCTTGGCT TTGCTGTCC 549
(2) INFORMATION FOR SEQ ID NO: 63
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 549 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE: (C) INDIVIDUAL ISOLATE: ghl
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63
ATGAGCACGA ATCCTAAACC TCAAAGAAAA ACCAAACGTA 40
ACACCAACCG CCGCCCACAG GACGTCAAGT TCCCGGGCGG 80 TGGTCAGATC GTTGGTGGAG TTTACTTGTT GCCGCGCAGG 120
GGCCCCAGGT TGGGTGTGCG CGCGACTAGG AAGACTTCCG 160
AGCGGTCGCA ACCTCGTGGA AGGCGACAAC CTATCCCCAA 200
SUBSTITUTE SHEET GGCTCGCCGG CCCGAGGGCA GGGCCTGGGC TCAGCCCGGG 240
TACCCTTGGC CCCTCTATGG CAATGAGGGT ATGGGGTGGG 280
CAGGATGGCT CCTGTCACCC CGTGGTTCTC GGCCTAGTTG 320
GGGCCCCACG GACCCCCGGC GTAGGTCGCG CAATTTGGGT 360
AAGATCATCG ATACCCTCAC GTGCGGCTTC GCCGACCTCA 400
TGGGGTACAT TCCGCTCGTC GGCGCCCCCC TAGGGGGCGC 440
TGCCAGGGCC CTGGCGCATG GCGTCCGGGT TCTGGAGGAC 480
GGCGTGAACT ATGCAACAGG GAATCTGCCC GGTTGCTCCT 520
TTTCTATCTT CCTTCTGGCT TTGCTGTCC 549
(2) INFORMATION FOR SEQ ID NO: 64
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 549 nucleotideε (B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: il5
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 ATGAGCACGA ATCCTAAACC TCAAAGAAAA ACCAAACGTA 40
ACACCAACCG CCGCCCACAG GACGTCAAGT TCCCGGGCGG 80
TGGTCAGATC GTTGGTGGAG TTTACCTGTT GCCGCGCAGG 120
SUBSTITUTE SHEET GGCCCCAGGT TGGGTGTGCG CGCGACTAGG AAGACTTCCG 160
AGCGGTCGCA ACCTCGTGGA AGGCGACAAC CTATCCCCAA 200
GGCTCGCCAG CCCGAGGGCA GGGCCTGGGC TCAGCCCGGG 240
TACCCCTGGC CCCTCTATGG CAATGAGGGT ATGGGGTGGG 280 CAGGATGGCT CCTGTCACCC CGCGGCTCCC GGCCTAGTTG 320
GGGCCCCAAA GACCCCCGGC GTAGGTCGCG TAATTTGGGT 360
AAGGTCATCG ATACCCTCAC ATGCGGCTTC GCCGACCTCA 400
TGGGGTACAT TCCGCTCGTC GGCGCCCCCT TAGGGGGCGC 440
TGCCAGGGCC CTGGCGCATG GCGTCCGGGT TCTGGAGGAC 480 GGCGTGAACT ATGCAACAGG GAATCTACCC GGTTGCTCTT 520
TCTCTATCTT CCTCTTGGCT TTGCTGTCC 549
(2) INFORMATION FOR SEQ ID NO: 65
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 549 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: ilO
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 65 ATGAGCACAA ATCCTAAACC TCAAAGAAAA ACCAAAAGAA 40
SUBSTITUTE SHEET ACACTAACCG CCGCCCACAG GACGTCAAGT TCCCGGGCGG 80
TGGCCAGATC GTTGGCGGAG TATACTTGCT GCCGCGCAGG 120
GGCCCGAGAT TGGGTGTGCG CGCGACGAGG AAAACTTCCG 160
AACGATCCCA GCCACGCGGA AGGCGTCAGC CCATCCCTAA 200 AGATCGTCGC ACCGCTGGCA AGTCCTGGGG AAGGCCAGGA 240
TATCCTTGGC CCCTGTATGG GAATGAGGGT CTCGGCTGGG 280
CAGGGTGGCT CCTGTCCCCC CGTGGCTCTC GCCCTTCATG 320
GGGCCCCACT GACCCCCGGC ATAGATCGCG CAACTTGGGT 360
AAGGTCATCG ATACCCTAAC GTGCGGTTTT GCCGACCTCA 400 TGGGGTACAT TCCCGTCATC GGCGCCCCCG TTGGAGGCGT 440
TGCCAGAGCT CTCGCCCACG GAGTGAGGGT TCTGGAGGAT 480
GGGGTAAATT ATGCAACAGG GAATTTGCCC GGTTGCTCTT 520
TCTCTATCTT TCTCTTAGCC CTCTTGTCT 549
(2) INFORMATION FOR SEQ ID NO: 66
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 510 nucleotides
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(C) INDIVIDUAL ISOLATE: arg6
SUBSTITUTE SHEET - Ill -
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66
ATGAGCACAA ATCCTCAACC TCAAAGAAAA ACCAAAAGAA 40
ACACTAACCG CCGCCCACAG GACGTCAAGT TCCCGGGCGG 80
TGGTCAGATC GTTGGCGGAG TATACTTGTT GCCGCGCAGG 120 GGCCCCAGGT TGGGTGTGCG CGCGACGAGG AAAACTTCCG 160
AACGGTCCCA GCCACGTGGG AGGCGCCAGC CCATCCCCAA 200
AGATCGGCGC ACCACTGGCA.AGTCCTGGGG GAAGCCAGGA 240
TACCCTTGGC CCCTGTATGG GAATGAGGGT CTCGGCTGGG 280
CAGGGTGGCT CCTGTCCCCC CGCGGTTCTC GCCCTTCATG 320 GGGCCCCACT GACCCCCGGC ATAGATCACG CAACTTGGGT 360
AAGGTCATCG ATACCCTAAC GTGTGGTTTT GCCGACCTCA 400
TGGGGTACAT TCCCGTCGGT GGTGCCCCCG TTGGTGGTGT 4 0
CGCCAGAGCC CTTGCCCATG GGGTGAGGGT TCTGGAAGAC 480
GGGATAAATT ATGCAACAGG GAATCTGCCC 510
(2) INFORMATION FOR SEQ ID NO: 67
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 29 nucleotideε (B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 CAAACGTAAC ACCAACCGRC GCCCACAGG 29
SUBSTITUTE SHEET (2) INFORMATION FOR SEQ ID NO: 68
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS:.. single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 ACAGAYCCGC AKAGRTCCCC CACG 24
(2) INFORMATION FOR SEQ ID NO: 69
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 30 nucleotides
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69
CGAACCTCGA GGTAGACGTC AGCCTATCCC 30
SUBSTITUTE SHEET (2) INFORMATION FOR SEQ ID NO: 70
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 30 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70 GCAACCTCGT GGAAGGCGAC AACCTATCCC 30
(2) INFORMATION FOR SEQ ID NO: 71
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 30 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71 GTCACCAATG ATTGCCCTAA CTCGAGTATT 30
(2) INFORMATION FOR SEQ ID NO: 72
SUBSTITUTE SHEET (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 26 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72
GTCACGAACG ACTGCTCCAA CTCAAG 26
(2) INFORMATION FOR SEQ ID NO: 73
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 28 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 73 TGGACATGAT CGCTGGWGCY CACTGGGG 28
(2) INFORMATION FOR SEQ ID NO: 74
SUBSTITUTE SHEET (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 28 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 74 TGGAYATGGT GGYGGGGGCY CACTGGGG 28
(2) INFORMATION FOR SEQ ID NO: 75
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 ATGATGAACT GGTCVCCYAC 20
(2) INFORMATION FOR SEQ ID NO: 76
(i) SEQUENCE CHARACTERISTICS:
SUBSTITUTE SHEET (A) LENGTH: 26 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 76
ACCTTVGCCC AGTTSCCCRC CATGGA 26
(2) INFORMATION FOR SEQ ID NO: 77
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 22 nucleotides
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77
AACCCACTCT ATGYCCGGYC AT 22
(2) INFORMATION FOR SEQ ID NO: 78
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 nucleotideε
(B) TYPE: nucleic acid
SUBSTITUTE SHEET (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 GAATCGCTGG GGTGACCG 18
(2) INFORMATION FOR SEQ ID NO: 79
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 28 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 CCATGAATCA CTCCCCTGTG AGGAACTA 28
(2) INFORMATION FOR SEQ ID NO: 80
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
SUBSTITUTE SHEET (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80
TTGCGGGGGC ACGCCCAA 18
(2) INFORMATION FOR SEQ ID NO: 81
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81 YGAAGCGGGC ACAGTCARRC AAGARAGCAG GGC 33
(2) INFORMATION FOR SEQ ID NO: 82
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε (B) TYPE: nucleic acid
(C) STRANDEDNESS: single
SUBSTITUTE SHEET (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 82
RTARAGCCCY GWGGAGTTGC GCACTTGGTR GGC 33
(2) INFORMATION FOR SEQ ID NO: 83
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 RATACTCGAG TTAGGGCAAT CATTGGTGAC RTG 33
(2) INFORMATION FOR SEQ ID NO: 84
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: .linear
SUBSTITUTE SHEET (ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 AGYRTGCAGG ATGGYATCRK BCGYCTCGTA CAC 33
(2) INFORMATION FOR SEQ ID NO: 85
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85
GTTRCCCTCR CGAACGCAAG GGACRCACCC CGG 33
(2) INFORMATION FOR SEQ ID NO: 86
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
( i i ) MOLECULE- TYPE : DNA
SUBSTITUTE SHEET (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 CGTRGGGGTY AYCGCCACCC AACACCTCGA GRC 33
(2) INFORMATION FOR SEQ ID NO: 87
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 CGTYGYGGGG AGTTTGCCRT CCCTGGTGGC YAC 33
(2) INFORMATION FOR SEQ ID NO: 88
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88
SUBSTITUTE SHEET CCCGACAAGC AGATCGATGT GACGTCGAAG CTG 33
(2) INFORMATION FOR SEQ ID NO: 89
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 89 CCCCACGTAG ARGGCCGARC AGAGRGTGGC GCY 33
(2) INFORMATION FOR SEQ ID NO: 90
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotides (B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 90 YTGRCCGACA AGAAAGACAG ACCCGCAYAR GTC 33
SUBSTITUTE SHEET (2) INFORMATION FOR SEQ ID NO: 91
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91 CGTCCAGTGG YGCCTGGGAG AGAAGGTGAA CAG 33
(2) INFORMATION FOR SEQ ID NO: 92
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92
GCCGGGATAG ATRGARCAAT TGCARYCTTG CGT 33
SUBSTITUTE SHEET (2) INFORMATION FOR SEQ ID NO: 93
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 CATATCCCAT GCCATGCGGT GACCCGTTAY ATG 33
(2) INFORMATION FOR SEQ ID NO: 94
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 YACCAAYGCC GTCGTAGGGG ACCARTTCAT CAT 33
(2) INFORMATION FOR SEQ ID NO: 95
SUBSTITUTE SHEET (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95 GATGGCTTGT GGGATCCGGA GYASCTGAGC YAY 33
(2) INFORMATION FOR SEQ ID NO: 96
(i) SEQUENCE CHARACTERISTICS: (A)' LENGTH: 33 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96 GACTCCCCAG TGRGCWCCAG CGATCATRTC CAW 33
(2) INFORMATION FOR SEQ ID NO: 97
(i) SEQUENCE CHARACTERISTICS:
SUBSTITUTE SHEET (A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97
CCCCACCATG GAGAAATACG CTATGCCCGC YAG 33
(2) INFORMATION FOR SEQ ID NO: 98
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98
TAGYAGCAGY ACTACYARGA CCTTCGCCCA GTT 33
(2) INFORMATION FOR SEQ ID NO: 99
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid
SUBSTITUTE SHEET (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 99 GSTGACGTGR GTKTCYGCGT CRACGCCGGC RAA 33
(2) INFORMATION FOR SEQ ID NO: 100
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100 GGAAGYTGGG ATGGTYARRC ARGASAGCAR AGC 33
(2) INFORMATION FOR SEQ ID NO: 101
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
SUBSTITUTE SHEET (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101
GTAYAYYCCG GACRCGTTGC GCACTTCRTA AGC 33
(2) INFORMATION FOR SEQ ID NO: 102
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 AATRCTTGMG TTGGAGCART CGTTYGTGAC ATG 33
(2) INFORMATION FOR SEQ ID NO: 103
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
SUBSTITUTE SHEET (ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 RGYRTGCATG ATCAYGTCCG YYGCCTCATA CAC 33
(2) INFORMATION FOR SEQ ID NO: 104
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104
RTTGTYYTCC CGRACGCARG GCACGCACCC RGG 33
(2) INFORMATION FOR SEQ ID NO: 105
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
SUBSTITUTE SHEET ( i) SEQUENCE DESCRIPTION: SEQ ID NO: 105 CGTGGGRGTS AGCGCYACCC AGCARCGGGA GSW 33
(2) INFORMATION FOR SEQ ID NO: 106
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 106 YGTRGTGGGG AYGCTGKHRT TCCTGGCCGC VAR 33
(2) INFORMATION FOR SEQ ID NO: 107
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107
SUBSTITUTE SHEET CCCRACGAGC AARTCGACRT GRCGTCGTAW TGT 33
(2) INFORMATION FOR SEQ ID NO: 108
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108 YCCCACGTAC ATAGCSGAMS AGARRGYAGC CGY 33
(2) INFORMATION FOR SEQ ID NO: 109
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε (B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109 CTGGGAGAYR AGRAAAACAG ATCCGCARAG RTC 33
SUBSTITUTE SHEET (2) INFORMATION FOR SEQ ID NO: 110
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 nucleotideε
(5) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 110 YGTCTCRTGC CGGCCAGSBG AGAAGGTGAA YAG 33
(2) INFORMATION FOR SEQ ID NO: 111
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111
GCCGGGATAG AKKGAGCART TGCAKTCCTG YAC 33
SUBSTITUTE SHEET (2) INFORMATION FOR SEQ ID NO: 112
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 CATATCCCAA GCCATRCGRT GGCCTGAYAC CTG 33
(2) INFORMATION FOR SEQ ID NO: 113
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113 CACTARGGCT GYYGTRGGYG ACCAGTTCAT CAT 33
(2) INFORMATION FOR SEQ ID NO: 114
SUBSTITUTE SHEET (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 GACRGCTTGT GGGATCCGGA GTAACTGCGA YAC 33
(2) INFORMATION FOR SEQ ID NO: 115
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115 GACTCCCCAG TGRGCCCCCG CCACCATRTC CAT 33
(2) INFORMATION FOR SEQ ID NO: 116
(i) SEQUENCE CHARACTERISTICS:
SUBSTITUTE SHEET (A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116
SCCCACCATG GAWWAGTAGG CAAGGCCCGC YAG 33
(2) INFORMATION FOR SEQ ID NO: 117
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117
GAGTAGCATC ACAATCAADA CCTTAGCCCA GTT 33
(2) INFORMATION FOR SEQ ID NO: 118
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid
SUBSTITUTE SHEET (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118 YGWCRYGYRG GTRTKCCCGT CAACGCCGGC AAA 33
(2) INFORMATION FOR SEQ ID NO: 119
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119 TCCTCACAGG GGAGTGATTC ATGGTGGAGT GTC 33
(2) INFORMATION FOR SEQ ID NO: 120
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
SUBSTITUTE SHEET (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 120
ATGGCTAGAC GCTTTCTGCG TGAAGACAGT AGT 33
(2) INFORMATION FOR SEQ ID NO: 121
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121 GCCTGGAGGC TGCACGRCAC TCATACTAAC GCC 33
(2) INFORMATION FOR SEQ ID NO: 122
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
SUBSTITUTE SHEET - 138 -
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 CGCAGACCAC TATGGCTCTY CCGGGAGGGG GGG 33
(2) INFORMATION FOR SEQ ID NO: 123 (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 123
TCRTCCYGGC AATTCCGGTG TACTCACCGG TTC 33
(2) INFORMATION FOR SEQ ID NO: 124
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
SUBSTITUTE SHEET (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124 GCATIGAGCG GGTTDATCCA AGAAAGGACC CGG 33
(2) INFORMATION FOR SEQ ID NO: 125
(i) SEQUENCE CHARACTERISTICS:
(A) -LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125 AGCAGTCTYG CGGGGGCACG CCCAARTCTC CAG 33
(2) INFORMATION FOR SEQ ID NO: 126
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126
SUBSTITUTE SHEET ACAAGGCCTT TCGCGACCCA ACACTACTCG GCT 33
(2) INFORMATION FOR SEQ ID NO: 127
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127 GGGGCACTCG CAAGCACCCT ATCAGGCAGT ACC 33
(2) INFORMATION FOR SEQ ID NO: 128
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε (B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
SUBSTITUTE SHEET (ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128 YGTGCTCATG RTGCACGGTC TACGAGACCT CCC 33
(2) INFORMATION FOR SEQ ID NO: 129
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129 GTTACGTTTG KTTYTTYTTT GRGGTTTRGG AWT 33
(2) INFORMATION FOR SEQ ID NO: 130
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
SUBSTITUTE SHEET (ii) MOLECULE TYPE: DNA
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130 CGGGAACTTR ACGTCCTGTG GGCGRCGGTT GGT 33
(2) INFORMATION FOR SEQ ID NO: 131
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε (B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 131 CARGTAAACT CCACCRACGA TCTGRCCRCC RCC 33
(2) INFORMATION FOR SEQ ID NO: 132
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
SUBSTITUTE SHEET ( i) SEQUENCE DESCRIPTION: SEQ ID NO: 132 RCGCACACCC AAYCTRGGGC CCCTGCGCGG CAA 33
(2) INFORMATION FOR SEQ ID NO: 133
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133 AGGTTGCGAC CGCTCGGAAG TCTTYCTRGT CGC 33
(2) INFORMATION FOR SEQ ID NO: 134
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134
SUBSTITUTE SHEET RCGHRCCTTG GGGATAGGCT GACGTCWACC TCG 33
(2) INFORMATION FOR SEQ ID NO: 135
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135
RCGHRCCTTG GGGATAGGTT GTCGCCWTCC ACG 33
(2) INFORMATION FOR SEQ ID NO: 136
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136
YCCRGGCTGR GCCCAGRYCC TRCCCTCGGR YYG 33
SUBSTITUTE SHEET (2) INFORMATION FOR SEQ ID NO: 137
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137 BSHRCCCTCR TTRCCRTAGA GGGGCCADGG RTA 33
(2) INFORMATION FOR SEQ ID NO: 138
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138 GCCRCGGGGW GACAGGAGCC ATCCYGCCCA CCC 33
(2) INFORMATION FOR SEQ ID NO: 139
SUBSTITUTE SHEET (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139
CCGGGGGTCY GTGGGGCCCC AYCTAGGCCG RGA 33
(2) INFORMATION FOR SEQ ID NO: 140
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140 ATCGATGACC TTACCCAART TRCGCGACCT RCG 33
(2) INFORMATION FOR SEQ ID NO: 141
(i) SEQUENCE CHARACTERISTICS:
SUBSTITUTE SHEET (A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141 CCCCATGAGR TCGGCGAAGC CGCAYGTRAG GGT 33
(2) INFORMATION FOR SEQ ID NO: 142
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142
GCCYCCWARR GGGGCGCCGA CGAGCGGWAT RTA 33
(2) INFORMATION FOR SEQ ID NO: 143
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid
SUBSTITUTE SHEET (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143
AACCCGGACR CCRTGYGCCA RGGCCCTGGC AGC 33
(2) INFORMATION FOR SEQ ID NO: 144
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144 RTTCCCTGTT GCATAGTTCA CGCCGTCYTC CAG 33
(2) INFORMATION FOR SEQ ID NO: 145
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 nucleotideε (B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
SUBSTITUTE SHEET - 149 -
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145 CARRAGGAAG AKAGAGAAAG AGCAACCRGG MAR 33
(2) INFORMATION FOR SEQ ID NO: 146
(i) SEQUENCE CHARACTERISTICS: .(A) LENGTH: 20 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146 AGGCATAGGA CCCGTGTCTT 20
(2) INFORMATION FOR SEQ ID NO: 147
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 20 nucleotideε
(B) TYPE: nucleic acid (C) STRANDEDNESS: εingle
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147 CTTCTTTGGA GAAAGTGGTG 20
SUBSTITUTE SHEET

Claims (67)

1. As a composition of matter, a non-naturally occurring nucleic acid having a non-HCV-1 nucleotide εequence of eight or more nucleotides corresponding to a nucleotide sequence within the hepatitiε C viruε genome.
2. The compoεition of claim 1 wherein εaid nucleotide εequence corresponding to a non-HCV-1 nucleotide εequence within the hepatitiε C virus genome is selected from the regions conεiεting of the NS5 region, envelope 1 region, 5'UT region, and the core region.
3. The composition of claim 1 wherein εaid nucleotide εequence corresponding to a non-HCV-1 nucleotide sequence within the hepatitis C virus genome correspondε to a εequence in the NS5 region.
4. The composition of claim 3 wherein εaid nucleotide sequence corresponding to a non-HCV-1 sequence within the hepatitis C virus genome iε εelected from a εequence within εequences numbered 2-22.
SUBSTITUTE SHEET
5. The composition .of claim 1 wherein said nucleotide sequence correεponding to a non-HCV-1 nucleotide εequence within the hepatitiε C viruε genome correεpondε to a εequence in the envelope 1 region.
6. The compoεition of claim 5 wherein εaid nucleotide εequence correεponding to a non-HCV-1 εequence within the hepatitiε C viruε genome correεponds to a εequence within εequence numberε 24-32.
7. The composition of claim 1 wherein at least one εequence correεponding to a non-HCV-1 nucleotide εequence within the hepatitiε C viruε genome corresponds to a sequence in the 5'UT region.
8. The composition of claim 7 wherein said nucleotide εequence correεponding to a non-HCV-1 εequence within the hepatitiε C virus genome corresponds to a sequence within sequenceε numbered 34-51.
9. The composition of claim 1 wherein said nucleotide sequence corresponding to a non-HCV-1 nucleotide εequence within the hepatitis C virus genome corresponds to a sequence in the core region.
SUBSTITUTE SHEET
10. The composition of claim 9 wherein εaid nucleotide εequence correεponding to a non-HCV-1 εequence within the hepatitiε C viruε genome correεpondε to a within εequenceε numbered 53-66.
11. The compoεition of claim 1 wherein εaid non-naturally occurring nucleic acid haε a nucleotide εequence correεponding to one or more genotypeε of hepatitiε C viruε.
12. The compoεition of claim 11 wherein εaid non-naturally occurring nucleic acid haε a εequence correεponding to a εequence of a firεt genotype which firεt genotype iε defined εubεtantially by εequenceε numbered 1-6 in the NS5 region, 23-25 in the envelope 1 region, 33-38 in the 5'UT region, and 52-57 in the core region.
13. The compoεition of claim 11 wherein εaid non-naturally occurring nucleic acid has a sequence corresponding to a sequence of a εecond genotype which εecond genotype iε defined εubεtantially by εequenceε numbered 7-12 in the NS5 region, 26-28 in the envelope 1 region, 39-45 in the 5'UT region, and 58-64 in the core region.
SUBSTITUTE SHEET
14. The compoεition of claim 11 wherein εaid non-naturally occurring nucleic acid haε a εequence corresponding to a sequence of a third genotype which third genotype iε defined substantially by sequences numbered 13-17 in the NS5 region, 32 in the envelope 1 region, 46-47 in the 5'UT region and 65-66 in the core region.
15. The composition of claim 11 wherein said non-naturally occurring nucleic acid has a sequence corresponding to a sequence of a fourth genotype which fourth genotype is defined subεtantially by sequences numbered 20-22 in the NS5 region, 29-31 in the envelope 1 region and 48-49 in the 5'UT region.
16. The composition of claim 11 wherein εaid non-naturally occurring nucleic acid haε a εequence corresponding to a sequence of a fifth genotype which fifth genotype is defined εubεtantially by εequences numbered 18-19 in the NS5 region and 50-51 in the 5'UT region.
17. The composition of claim' 1 wherein said non-naturally occurring nucleic acid is capable of priming a reaction for the syntheεis of nucleic acid to form a nucleic acid having a nucleotide sequence corresponding to hepatitis C virus.
SUBSTITUTE SHEET
18. The compoεition of claim 1 wherein εaid non-naturally occurring nucleic acid haε label meanε for detecting a hybridization product.
19. The compoεition of claim 1 wherein εaid non-naturally occurring nucleic acid haε εupport meanε for εeparating a hybridization product from εolution.
20. The compoεition of claim 1 wherein εaid non-naturally occurring nucleic acid prevents the transcription or translation of viral nucleic acid.
21. A method of forming a hybridization product with a hepatitis C virus nucleic acid comprising the following εtepε: a. placing a non-naturally occurring nucleic acid having a nucleotide εequence of eight or more nucleotideε correεponding to a non-HCV-l εequence in the hepatitiε C viral genome into conditionε in which hybridization conditionε can be impoεed εaid non-naturally occurring nucleic acid capable of forming a hybridization product with εaid hepatitiε C viruε nucleic acid under hybridization conditionε; and
SUBSTITUTE SHEET b. . imposing hybridization conditionε to form a hybridization product in the preεence of hepatitiε C virus nucleic acid.
22. The method of claim 21 wherein εaid nucleotide εequence correεponding to a non-HCV-1 sequence in the hepatitis C virus genome corresponds to a sequence within at least one of the regions consisting esεentially of NS5 region, envelope 1 region, 5'UT region, and the core region.
23. The method of claim 21 wherein εaid nucleotide εequence correεpondε to a non-HCV-l εequence correεponds to a sequence within the NS5 region.
24. The method of claim 23 wherein εaid nucleotide εequence corresponds to a non-HCV-l sequence correεponds to a sequence within sequences numbered 2-22.
25. The method of claim 21 wherein said nucleotide εequence corresponds to a non-HCV-l sequence corresponds to a sequence within the envelope 1 region.
SUBSTITUTE SHEET
26. The method of claim 25 wherein εaid nucleotide εequence correεpondε to a non-HCV-1 εequence is εelected from a εequence within εequenceε numbered 24-32.
27. The method of claim 21 wherein εaid nucleotide εequence correεpondε to a non-HCV-l εequence correεponding to a εequence within the 5'UT region.
28. The method of claim 27 wherein εaid nucleotide εequence correεpondε to a non-HCV-1 εequence εelected from a εequence within εequenceε numbered 34-51.
29. The method of claim 21 wherein εaid nucleotide εequence correεponds to a non-HCV-l εequence correεponding to a εequence within the core region.
30. The method of claim 29 wherein εaid nucleotide εequence correεpondε to a non-HCV-1 εequence εelected from a εequence within εequenceε numbered 53-66.
31. The method of claim 21 wherein εaid nucleotide εequence correεpondε to a non-HCV-1 nucleotide εequence corresponding to one or more genotypes of hepatitis C virus.
SUBSTITUTE SHEET
32. The method of claim 21 wherein εaid non-naturally occurring nucleic acid haε a εequence correεponding to a sequence of a first genotype which first genotype is defined subεtantially by εequenceε numbered 1-6 in the NS5 region, 23-25 in the envelope 1 region, 33-38 in the 5'UT region, and 52-57 in the core region.
33. The method of claim 21 wherein εaid non-naturally occurring nucleic acid haε a εequence correεponding to a sequence of a εecond genotype which εecond genotype iε defined εubεtantially by εequenceε numbered 7-12 in the NS5 region, 26-28 in the envelope 1 region, 39-45 in the 5'UT region, and 58-64 in the core region.
34. The method of claim 21 wherein εaid non-naturally occurring nucleic acid haε a εequence correεponding to a sequence of a third genotype which third genotype is defined substantially by sequenceε numbered 13-17 in the NS5 region, 32 in the envelope 1 region, 46-47 in the 5'UT region and 65-66 in the core region.
35. The method of claim 21 wherein εaid non-naturally occurring nucleic acid haε a εequence correεponding to a εequence of a fourth genotype which fourth genotype iε defined substantially by sequences numbered 20-22 in the NS5 region, 29-31 in the envelope 1 region and 48-49 in the 5'UT region.
SUBSTITUTE SHEET
36. The method of claim 21 wherein εaid non-naturally occurring nucleic acid has a sequence corresponding to a εequence of a fifth genotype which fifth genotype iε defined εubstantially by sequences numbered 18-19 in the NS5 region and 50-51 in the 5'UT region.
37. The method of claim 21 wherein said hybridization product iε capable of priming a reaction for the synthesis of nucleic acid.
38. The method of claim 21 wherein said non-naturally occurring nucleic acid has label means for detecting a hybridization product.
39. The method of claim 21 wherein εaid non-naturally occurring nucleic acid haε εupport meanε for εeparating the hybridization product from εolution.
40. The method of claim 21 wherein εaid non-naturally occurring nucleic acid preventε the tranεcription or tranεlation of viral nucleic acid.
41. Aε a compoεition of matter, a non-naturally occurring polypeptide correεponding to a non-HCV-1 nucleotide sequence of nine or more nucleotides which sequence of nine or more nucleotides correspondε to a εequence within hepatitiε C viruε genomic εequenceε.
SUBSTITUTE SHEET
42. The composition of claim 41 wherein said non-HCV-1 εequence iε selected from one of the regionε conεiεting of NS5 region, envelope 1 region, and the core region.
43. The compoεition of claim 41 wherein εaid non-HCV-1 nucleotide εequence correεponds to a εequence in the NS5 region.
44. The compoεition of claim 43 wherein εaid non-HCV-1 εequence iε εelected from a εequence within εequenceε numbered 2-22.
45. The compoεition of claim 41 wherein εaid non-HCV-1 εequence correεponds to a sequence in the envelope 1 region.
46. The composition of claim 45 wherein said non-HCV-1 sequence is εelected from a εequence within sequences numbered 24-32.
47. The composition of claim 41 wherein said non-HCV-1 sequence corresponds to a εequence in the core region.
48. The composition of claim 47 wherein said non-HCV-l εequence iε εelected from a εequence within εequenceε numbered 52-66.
SUBSTITUTE SHEET
49. The composition of claim 41 wherein εaid non-HCV-l nucleotide εequence haε a nucleotide εequence correεponding to one or more genotypeε of hepatitiε C viruε.
50. The compoεition of claim 41 wherein εaid non-HCV-1 nucleotide εequence haε a εequence correεponding to a εequence of a firεt genotype which firεt genotype iε defined εubstantially by sequences numbered 1-6 in the NS5 region, 23-25 in the envelope 1 region, and 52-57 in the core region.
51. The composition of claim 41 wherein said non-HCV-1 nucleotide εequence haε a εequence correεponding to a εequence of a εecond genotype which εecond genotype iε defined εubstantially by sequences numbered 7-12 in the NS5 region, 26-28 in the envelope 1 region, and 58-64 in the core region.
52. The composition of claim 41 wherein εaid non-HCV-1 nucleotide εequence haε a εequence correεponding to a εequence of a third genotype which third genotype iε defined substantially by sequences numbered 13-17 in the NS5 region, 32 in the envelope 1 region, and 65-66 in the core region.
SUBSTITUTE SHEET
53. The compoεition of claim 41 wherein εaid non-HCV-1 nucleotide εequence haε a εequence correεponding to a εequence of a fourth genotype which fourth genotype iε defined εubstantially by εequenceε numbered 20-22 in the NS5 region, 29-31 in the envelope 1 region and 48-49 in the 5'UT region.
54. The compoεition of claim 41 wherein εaid non-HCV-1 nucleotide εequence haε a εequence correεponding to a εequence of a fifth genotype which fifth genotype iε defined εubεtantially by εequenceε numbered 18-19 in the NS5 region and 50-51 in the 5'UT region.
55. The compoεition of claim 41 wherein εaid polypeptide iε capable of generating an immune reaction in a hoεt.
56. An antibody capable of εelectively binding to the compoεition of claim 41.
57. A method of detecting one or more genotypeε of hepatitiε C viruε compriεing the following εtepε: a) placing a non-naturally occurring nucleic acid having a nucleotide εequence of eight or more nucleotides corresponding to one or more genotypes of hepatitis C virus under conditions where hybridization conditions can be imposed,
SUBSTITUTE SHEET b) impoεing hybridization conditionε to form a hybridization product in the preεence of hepatitiε C viruε nucleic acid; and c) monitoring the non-naturally occurring nucleic acid for the formation of a hybridization product, which hybridization product iε indicative of the preεence of the genotype of hepatitiε C viruε.
58. The method of claim 57 wherein εaid non-naturally occurring nucleic acid haε a εequence correεponding to a εequence of a firεt genotype which firεt genotype iε defined εubεtantially by εequenceε numbered 1-6 in the NS5 region, 23-25 in the envelope 1 region, 33-38 in the 5'UT region, and 52-57 in the core region.
59. The method of claim 57 wherein εaid non-naturally occurring nucleic acid haε a sequence corresponding to a sequence of a second genotype which εecond genotype is defined substantially by sequences numbered 7-12 in the NS5 region, 26-28 in the envelope 1 region, 39-45 in the 5'UT region, and 58-64 in the core region.
SUBSTITUTE SHEET
60. The method of claim 57 wherein said non-naturally occurring nucleic acid has a sequence corresponding to a εequence of a third genotype which third genotype is defined εubεtantially by εequences numbered 13-17 in the NS5 region, 32 in the envelope 1 region, 46-47 in the 5'UT region and 65-66 in the core region.
61. The method of claim 57 wherein said non-naturally occurring nucleic acid haε a εequence corresponding to a sequence of a fourth genotype which fourth genotype is defined subεtantially by sequences numbered 20-22 in the NS5 region, 29-31 in the envelope 1 region and 48-49 in the 5'UT region.
62. The method of claim 57 wherein said non-naturally occurring nucleic acid has a sequence corresponding to a sequence of a fifth genotype which fifth genotype is defined subεtantially by εequences. numbered 18-19 in the NS5 region.
63. The method of claim 57 wherein said non-naturally occurring nucleic acid has a εequence corresponding to a εequence numbered 67-145.
SUBSTITUTE SHEET
64. The method of claim 57 wherein εaid non-naturally occurring nucleic acid has a sequence corresponding to a εequence numbered 69, 71, 73 and 81-99 to identify Group I genotypeε in the core and region of the HCV genome.
65. The method of claim 57 wherein εaid non-naturally occurring nucleic acid haε a εequence correεponding to a εequence numbered 70, 72, 70 and 100-118 to identify Group II genotypeε in the core and envelope regionε of the HCV genome.
66. The method of claim 57 wherein said non-naturally occurring nucleic acid has a sequence corresponding to a εequence numbered 77 to identify Group III genotypeε in the 5' UT region of the HCV genome.
67. The method of claim 57 wherein εaid non-naturally occurring nucleic acid haε a sequence numbered 79 to identify Group IV genotypes in the 5' UT region of the HCV genome.
SUBSTITUTE SHEET
AU21558/92A 1991-05-08 1992-05-08 HCV genomic sequences for diagnostics and therapeutics Expired AU668355C (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US69732691A 1991-05-08 1991-05-08
US697326 1991-05-08
PCT/US1992/004036 WO1992019743A2 (en) 1991-05-08 1992-05-08 Hcv genomic sequences for diagnostics and therapeutics

Publications (3)

Publication Number Publication Date
AU2155892A AU2155892A (en) 1992-12-21
AU668355B2 AU668355B2 (en) 1996-05-02
AU668355C true AU668355C (en) 1997-05-08

Family

ID=

Similar Documents

Publication Publication Date Title
WO1992019743A2 (en) Hcv genomic sequences for diagnostics and therapeutics
US6214583B1 (en) HCV genomic sequences for diagnostics and therapeutics
KR0138776B1 (en) Nanbv diagnostics and vaccines
AU666576B2 (en) NANBV diagnostics and vaccines
IE83867B1 (en) HCV genomic sequences for diagnostics and therapeutics
WO1995032291A2 (en) Hepatitis g virus and molecular cloning thereof
WO1995032290A2 (en) Non-a/non-b/non-c/non-d/non-e hepatitis agents and molecular cloning thereof
AU668355C (en) HCV genomic sequences for diagnostics and therapeutics
EP0832114A2 (en) Nucleotide and amino acid sequences of hypervariable region 1 of the envelope 2 gene of hepatitis c virus
AU624105B2 (en) Nanbv diagnostics and vaccines
US5437974A (en) DNA sequence and encoded polypeptide useful in the diagnosis of hepatitis disease
RU2162894C2 (en) Polypeptide showing antigenic property of hepatitis c virus and its fragment (variants), diagnostic reagent (variants), set for antibody detection (variants), method of antibody detection (variants)
AU684177C (en) Hepatitis G virus and molecular cloning thereof
AU624105C (en) NANBV diagnostics and vaccines