EP1825272A2 - Fungal signalling and metabolic enzymes - Google Patents

Fungal signalling and metabolic enzymes

Info

Publication number
EP1825272A2
EP1825272A2 EP05811554A EP05811554A EP1825272A2 EP 1825272 A2 EP1825272 A2 EP 1825272A2 EP 05811554 A EP05811554 A EP 05811554A EP 05811554 A EP05811554 A EP 05811554A EP 1825272 A2 EP1825272 A2 EP 1825272A2
Authority
EP
European Patent Office
Prior art keywords
protein
polynucleotide
candidate substance
seq
fragment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP05811554A
Other languages
German (de)
French (fr)
Inventor
Nina F2G Limited BROGDEN
Michael John F2G Limited BROMLEY
Paul David F2G Limited CARR
Sarah Jane F2G Limited KAYE
Jason David F2G Limited OLIVER
Daniel Scott F2G Limited TUCKWELL
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
F2G Ltd
Original Assignee
F2G Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from GB0426390A external-priority patent/GB0426390D0/en
Priority claimed from GB0521062A external-priority patent/GB0521062D0/en
Application filed by F2G Ltd filed Critical F2G Ltd
Publication of EP1825272A2 publication Critical patent/EP1825272A2/en
Withdrawn legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/88Lyases (4.)
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P31/00Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
    • A61P31/04Antibacterial agents
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P31/00Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
    • A61P31/10Antimycotics
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/37Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from fungi
    • C07K14/38Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from fungi from Aspergillus
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/02Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving viable microorganisms
    • C12Q1/18Testing for antimicrobial activity of a material

Definitions

  • the present invention relates to a method of screening for an anti-fungal agent and to fungal genes involved in signalling and metabolism.
  • Invasive fungal infections are well recognised as diseases of the immunocompromised host. Over the last twenty years there have been significant rises in the number of recorded instances of fungal infection (Groll et al., 1996, J Infect 33, 23-32). hi part this is due to increased awareness and improved diagnosis of fungal infection. However, the primary cause of this increased incidence is the vast rise in the number of susceptible individuals. This is due to a number of factors including new and aggressive immunosuppressive therapies, increased survival in intensive care, increased numbers of transplant procedures and the greater use of antibiotics worldwide.
  • fungal infection occurs at high frequency; lung transplant recipients have a frequency of up to 20% colonisation and infection with a fungal organism and fungal infection in allogenic hoemopoetic stem transplant recipients is as high as 15% (Ribaud et al., 1999, Clin Infect Dis. 28:322-30).
  • polyenes e.g., amphotericin B
  • azoles e.g., ketoconazole or itraconazole
  • echinocandins e.g., caspofungin
  • flucytosine e.g., flucytosine
  • the polyenes are the oldest class of antifungal agent being first introduced in the 1950's. The exact mode of action remains unclear but polyenes are only effective against organisms that contain sterols in their outer membranes. It has been proposed that amphotericin B interacts with membrane sterols to produce pores allowing leakage of cytoplasmic components and subsequent cell death.
  • Azoles function by the inhibition of 14 ⁇ -demethylase via a cytochrome P450-dependent mechanism. This leads to a depletion of the membrane sterol ergosterol and the accumulation of sterol precursors resulting in a plasma membrane with altered fluidity and structure. Echinocandins work by inhibiting the cell wall synthesis enzyme ⁇ -glucan synthase, leading to abnormal cell wall formation, osmotic sensitivity and cell lysis.
  • Flucytosine is a pyrimidine analogue interfering with cellular pyrimidine metabolism as well DNA, RNA and protein synthesis. However widespread resistance to flucyotosine limits its therapeutic use.
  • Novel fungal-specific genes are likely to present the best opportunity for the development of effective novel anti-fungal agents.
  • target genes are present in a range of fungi, but absent from humans, and fungal- specific genes involved in metabolism and signalling would be valuable candidates.
  • the inventors have exploited the availability of fungal and mammalian genomes to identify such genes which are thus suitable as targets for the development of anti- fungal drugs.
  • the inventors have found a set of twelve genes which are present in fungi but not humans. This finding allows the identification of anti-fungal agents based on their ability to target these genes.
  • the invention provides a set of twelve proteins which can be used to screen for anti-fungal agents.
  • a set of twelve proteins from Aspergillus fumigatus (see Table I) is provided.
  • the inventors have found two Aspergillus fumigatus genes which resemble the single S. cerevisiae ILV3 gene.
  • ILV3 is essential in S. cerevisiae for the biosynthesis of the branched amino acids leucine, iso leucine and valine, but this enzyme is absent from animals, making it a good target for an antifungal.
  • This gene has not been used before as a target for the discovery of an antifungal agent, nor have recombinant ILV3 proteins been synthesised.
  • two A. fumigatus ILV3-like genes have to be knocked out to render the organism inviable.
  • the invention therefore provides ILV3-like genes of fungi (see Tables I and II) which can be used either individually or together (as pairs) to screen for antifungal agents.
  • a polynucleotide that comprises sequence which encodes (i), (ii) or (iii), (v) a polynucleotide comprising sequence which has at least 70% identity with the coding sequence of (iv), and determining whether the candidate substance binds or modulates (i), (ii), (iii), (iv) or (v), wherein binding or modulation of (i), (ii), (iii), (iv) or (v) indicates that the candidate substance is an anti-fungal agent, - use of (i), (ii), (iii), (iv) or (v) as defined above to identify or obtain an antifungal agent,
  • an antibody which is specific for a protein of the invention - a method for preventing or treating a fungal infection comprising administering an anti-fungal agent identified by the screening method of the invention, and
  • fungus which has been killed, or whose growth has been impaired, by inhibition of the expression or activity of a protein or polynucleotide of the invention.
  • ⁇ Numbers after SEQ ED Nos. correspond to bases of genomic DNA encoding the protein in cases where introns are present.
  • proteins of the invention and polynucleotide sequences (termed “proteins of the invention” and “polynucleotides of the invention” herein), including homologues and/or fragments of the fungal proteins and polynucleotides, to identify anti-fungal agents.
  • a protein or polynucleotide of the invention may be defined by similarity in sequence to a another member of the family. This similarity may be based on percentage identity (for example to the sequences shown in the sequence listing).
  • a protein or polynucleotide of the invention may be an ILV3 or ILVD protein, defined as a dihydroxy acid dehydratase, or as a protein which shows homology to SEQ E) No. 21, or as a protein which matches the ILVD_EDD Pfam profile.
  • the protein or polynucleotide of the invention may align with other proteins or polynucleotides of the invention (as shown in SEQ ID Nos. 1-63).
  • the protein or polynucleotide of the invention may be in isolated form (such as non-cellular form), or, in the case of membrane-associated proteins, as a membrane preparation, for example when used in the method of the invention.
  • the polynucleotide may comprise native, synthetic or recombinant polynucleotide, and the protein may comprise native, synthetic or recombinant protein.
  • the polynucleotide or protein may comprise combinations of native, synthetic or recombinant polynucleotide or protein, respectively.
  • the polynucleotides and proteins of the invention may have a sequence which is the same as, or different from, naturally occurring polynucleotides and proteins.
  • references to polynucleotides and proteins being “isolated from” a particular organism include polynucleotides and proteins which were prepared by means other than obtaining them from the organism, such as synthetically or recombinantly.
  • the polynucleotide or protein is isolated from a fungus, more preferably a filamentous fungus, even more preferably an Ascomycete.
  • the polynucleotide or protein is isolated from an organism selected from Aspergillus; Blumeria; Candida; Colletotrichium; Cryptococcus; Encephalitozoon; Fusarium; Histoplasma, Leptosphaeria; Magnaporthe; Mycosphaerella; Neurospora; Phytophthora; Plasmopara; Pneumocystis; Pyricularia; Pythium; Puccinia; Rhizoctonia; Saccharomyces, Schizosaccharomyces, Trichophyton; and Ustilago.
  • the polynucleotide or protein is isolated from Aspergillus.
  • the polynucleotide or protein is isolated from an organism selected from the species Aspergillus flavus; Aspergillus fumigatus; Aspergillus nidulans;
  • the polynucleotide or protein is isolated from Aspergillus fumigatus, preferably the protein, may be isolated from A. fumigatus AF293. Variants of the above mentioned polynucleotides and proteins are also provided, and are discussed below.
  • the protein of the invention may comprise an amino acid sequence substantially as set out and independently selected from any of SEQ ID Nos: 3, 6, 9, 12, 15, 18, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 50, 53, 56, 59, 61, 63 or variants thereof.
  • the polynucleotide of the invention may comprise DNA, such as genomic DNA.
  • the polynucleotide may comprise a sequence substantially as set out and independently selected from any of SEQ ID Nos. 1, 4, 7, 10, 13, 16, 19, 22, 25, 28, 31, 34, 37, 40, 43, 46, 49, 51, 54, 57, 60, 62 or complements, or variants thereof.
  • the polynucleotide may comprise RNA, preferably mRNA, preferably spliced mRNA.
  • the polynucleotide comprises substantially the sequence shown as SEQ ID Nos 2, 5, 8, 11, 14, 17, 20, 23, 26, 29, 32, 35, 38, 41, 44, 47, 49, 52, 55, 58, 60, 62, or a complement, or a variant thereof.
  • the protein is encoded by the regions of sequences SEQ ID Nos. 1, 4, 7, 10, 13, 16, 19, 22, 25, 28, 31, 34, 37, 40, 43, 46, 49, 51, 54, 57, 60 or 62 as described in the column "gDNA” in Tables I or II, or a complement, or a variant thereof.
  • the isolated polynucleotide comprises substantially a nucleotide sequence independently selected from the regions and sequences given in the column "gDNA” in Tables I or II.
  • the protein is encoded by a polynucleotide which polynucleotide comprises substantially a sequence independently selected from at least one of the the regions and sequences given in the column "gDNA" in Tables I or II, or a complement or, a variant thereof.
  • the polynucleotide encodes a protein which comprises substantially the amino acid sequences SEQ ID Nos: 1, 4, 7, 10, 13, 16, 19, 22, 25, 28, 31, 34, 39, 42, 45, 48, 50, 53, 56, 59, 61, 63 or a variant thereof.
  • amino acid/polynucleotide/protein an amino acid, polynucleotide or protein produced naturally from biological sources either in vivo or in vitro.
  • synthetic amino acid/polynucleotide/protein is meant an amino acid, polynucleotide or protein which has been produced artificially or de novo using a DNA or protein synthesis machine known in the art.
  • recombinant amino acid/polynucleotide /protein is meant an amino acid, polynucleotide or protein which has been produced using recombinant DNA or protein technology or methodologies which are known to the skilled technician.
  • variant and the terms “substantially the amino acid/polynucleotide/protein sequence” are used herein to refer to related sequences. As discussed below such related sequences are typically homologous to (share percentage identity with) a given sequence, for example over the entire length of the sequence or over a portion of a given length.
  • the related sequence may also be a fragment of the sequence or of a homologous sequence.
  • a variant protein may be encoded by a variant polynucleotide.
  • variant and the terms “substantially the amino acid/polynucleotide/protein sequence” we mean that the sequence has at least 30%, preferably 40%, more preferably 50%, and even more preferably, 60% sequence identity with the amino acid/polynucleotide/protein sequences of any one of the sequences referred to.
  • Calculation of percentage identities between different amino acid/polynucleotide/protein sequences may be carried out as follows.
  • a multiple alignment is first generated by the ClustalX program (pairwise parameters: gap opeining 10.0, gap extension 0.1, protein matrix Gonnet 250, DNA matrix IUB; multiple parameters: gap opening 10.0, gap extension 0.2, delay divergent sequences 30%, DNA transition weight 0.5, negative matrix off, protein matrix gonnet series, DNA weight IUB; Protein gap parameters, residue-specific penalties on, hydrophilic penalties on, hydrophilic residues GPSNDQERK, gap separation distance 4, end gap separation off).
  • the percentage identity is then calcluated from the multiple alignment as (N/T)*100, where N is the number of positions at which the two sequences share an identical residue, and T is the total number of positions compared.
  • percentage identity can be calculated as (N/S)* 100 where S is the length of the shorter sequence being compared.
  • the amino acid/polynucleotide/protein seqences may be synthesised de novo, or may be native amino acid/polynucleotide/protein sequence, or a derivative thereof.
  • An amino acid/polynucleotide/protein sequence with a greater identity than 65% to any of the sequences referred to is also envisaged.
  • An amino acid/polynucleotide/protein sequence with a greater identity than 70% to any of the sequences referred to is also envisaged.
  • An amino acid/polynucleotide/protein sequence with a greater identity than 75% to any of the sequences referred to is also envisaged.
  • An amino acid/polynucleotide/protein sequence with a greater identity than 80% to any of the sequences referred to is also envisaged.
  • the amino acid/polynucleotide/protein sequence has 85% identity with any of the sequences referred to, more preferably 90% identity, even more preferably 92% identity, even more preferably 95% identity, even more preferably 97% identity, even more preferably 98% identity and, most preferably, 99% identity with any of the referred to sequences.
  • percentage identities may be measured over the entire length of the original sequence or over a region of 15, 20, 50 or 100 amino acids/bases of the original sequence.
  • percentage identity is measured with reference to SEQ ED Nos. 3, 6, 9, 12, 15, 18, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 50, 53, 56, 59 61 or 63.
  • the variant protein has at least 40% identity, such as at least 60% or at least 80% identity with SEQ ID Nos. 3, 6, 9, 12, 15, 18, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 50, 53, 56, 59, 61 or 63 or a portion of one of these.
  • a substantially similar nucleotide sequence will be encoded by a sequence which hybridizes to the sequences shown in SEQ ID Nos. 1, 2, 4, 5, 7, 8, 10, 11, 13, 14, 16, 17, 19, 20, 22, 23, 25, 26, 28, 29, 31, 32, 34, 35, 37, 38, 40, 41, 43, 44, 46, 47, 49, 51, 52, 54, 55, 57, 58, 60, 62, or their complements under stringent conditions.
  • stringent conditions we mean the nucleotide hybridises to filter- bound DNA or RNA in 6x sodium chloride/sodium citrate (SSC) at approxmiately 45 0 C followed by at least one wash in 0.2x SSC/0.1% SDS at approximately 5-65°C.
  • a substantially similar protein may differ by at least 1, but less than 5, 10, 20, 50 or 100 amino acids from the sequences shown in SEQ ID Nos. 3, 6, 9, 12, 15, 18, 21, 24, 27, 30, 33,. 36, 39, 42, 45, 48, 50, 53, 56, 59, 61 or 63. Such differences may each be additions, deletions or substitutions.
  • nucleic acid sequence could be varied or changed without substantially affecting the sequence of the protein encoded thereby, to provide a functional variant thereof.
  • Suitable nucleotide variants are those having a sequence altered by the substitution of different codons that encode the same amino acid within the sequence, thus producing a silent change.
  • suitable variants are those having homologous nucleotide sequences but comprising all, or portions of, sequence which are altered by the substitution of different codons that encode an amino acid with a side chain of similar biophysical properties to the amino acid it substitutes, to produce a conservative change.
  • small non-polar, hydrophobic amino acids include glycine, alanine, leucine, isoleucine, valine, proline, and methionine.
  • Large non-polar, hydrophobic amino acids include phenylalanine, tryptophan and tyrosine.
  • the polar neutral amino acids include serine, threonine, cysteine, asparagine and glutamine.
  • the positively charged (basic) amino acids include lysine, arginine and histidine.
  • the negatively charged (acidic) amino acids include aspartic acid and glutamic acid.
  • Certain organisms, including Candida are known to use non-standard codons compared to those used in the majority of eukaryotes. Any comparisons of polynucleotides and proteins from such organisms with the sequences given here should take these differences into account.
  • DNA/RNA an identical option exists within the DNADIST program of PHYLIP.
  • Other modifications in protein sequences are also envisaged and within the scope of the claimed invention, i.e. those which occur during or after translation, e.g. by acetylation, amidation, carboxylation, GPI-linkage, myristoylation, phosphorylation, proteolytic cleavage or linkage to a ligand.
  • variants also include a fragment of the relevant polynucleotide or protein sequences, including a fragment of the homologous sequences (which have percentage identity to a specified sequence) referred to above.
  • a polynucleotide fragment will typically comprise at least 10 bases, such as at least 20, 30, 50, 100, 200, 500 or 1000 bases.
  • a protein fragment will typically comprise at least 10 amino acids, such as at least 20, 30, 50, 80, 100, 150, 200, 300, 400 or 500 amino acids.
  • the fragments may lack at least 3 amino acids, such as at least 10, 20 or 30 amino acids of the amino acids from either end of the protein.
  • the invention provides methods of screening which may be used to identify modulators of the proteins or polynucleotides of the invention, such as inhibitors of expression or activity of the proteins or polynucleotides of the invention.
  • a candidate substance is contacted with a protein or polynucleotide of the invention and whether or not the candidate substance binds or modulates the protein or polynucleotide is determined.
  • the modulator may promote (agonise) or inhibit (antagonise) the activity of the protein.
  • a therapeutic modulator (against fungal infection) will inhibit the expression or activity of protein or polynucleotide of the invention.
  • the method may be carried out in vitro (inside or outside a cell) or in vivo.
  • the method may be carried out on a cell, or cell culture extract, or cell extract or cell- membrane fraction.
  • the cell may or may not be a cell in which the polynucleotide or protein is naturally present.
  • the cell may or may not be a fungal cell, or may or may not be a cell of any of the fungi mentioned herein.
  • the protein or polynucleotide may be present in a non-cellular form in the method, thus the protein may be in the form of a recombinant protein purified from a cell.
  • Methods which determine whether a candidate substance is able to bind the protein or polynucleotide may comprise providing the protein or polynucleotide to a candidate substance and determining whether binding occurs, for example by measuring the amount of the candidate substance which binds the protein or polynucleotide.
  • the binding may be determined by measuring a characteristic of the protein or polynucleotide that changes upon binding, such as spectroscopic changes.
  • the binding may be determined by measuring reaction substrate or product levels in the presence and absence of the candidate and comparing the levels.
  • the assay format may be a 'band shift' system. This involves determining whether a test candidate advances or retards the protein or polynucleotide on gel electrophoresis relative to the absence of the compound.
  • the method may be a competitive binding method. This determines whether the candidate is able to inhibit the binding of the protein or polynucleotide to an agent which is known to bind to the protein or polynucleotide, such as an antibody specific for the protein, or a substrate of the protein.
  • Whether or not a candidate substance modulates the activity of the protein may be determined by providing the candidate substance to the protein under conditions that permit activity of the protein, and determining whether the candidate substance is able to modulate the activity of the product.
  • the activity which is measured may be any of the activities of the proteins of the invention mentioned herein, including; endonuclease, exonuclease, exoribonuclease, G-protein coupled receptor, ILV3/dihydroxyacid dehydratase, kinase, phosphatase, phosphatididylinositol-specific phospholipase C, phosphodiesetrase, protein tyrosine phosphatase, ion transport or small molecule transport/permease activities.
  • the screening method comprising carrying out a reaction in the presence and absence of the candidate substance to determine whether the candidate substance inhibits the activity of the protein of the invention.
  • ILV3 activity can be measured as follows: An ILV3 protein is incubated with a substrate molecule such as dihydroxy valeric acid, dihydroxy methylvaleric acid, another dihydroxy acid, or a polyhydroxy acid (such as threonic acid or 2,3,4,5- tetrahydroxy pentanoic acid), and the appearance of a keto acid product measured either directly or indirectly.
  • a substrate molecule such as dihydroxy valeric acid, dihydroxy methylvaleric acid, another dihydroxy acid, or a polyhydroxy acid (such as threonic acid or 2,3,4,5- tetrahydroxy pentanoic acid)
  • Direct measurement can be carried out by means of spectrophotometry, for example at 240 nm, whereas indirect measurement can be carried out by reacting the keto acid with semicarbazide and measuring the appearance of product by spectrophotometry, for exmaple at 250 nm, or by reacting the keto acid with 2,4-dinitrophenylhydrazine and measuring the reaction products by spectrophotometry at 540-550 nm.
  • This assay may be used as a screen for inhibitors of filamentous fungal ILV3s by (a) adding to the assay putative inhibitor compounds and looking for a decrease in product, and (b) carrying out the assay firstly with a group I ILV3 (Table II) and then carrying out the assay with a group II ILV3 (or vice versa) and identifying compounds that inhibit in both assays.
  • the assay can be carried out with recombinant A.fumigatus ILV34 and ILV1352 (Table II).
  • ILV3 inhibitors may also be identified by the above assay using a single ILV3 protein such as from any of the following species: organism selected from the species Aspergillus flavus; Aspergillus fumigatus; Aspergillus nidulans; Aspergillus niger; Aspergillus parasiticus; Aspergillus terreus; Blumeria graminis; Candida albicans; Candida cruzei; Candida glabrata; Candida parapsilosis; Candida tropicalis; Colletotrichium trifolii; Cryptococcus neoformans; Encephalitozoon cuniculi; Fusarium graminarium; Fusarium solani; Fusarium sporotrichoides; Histoplasma capsulata; Leptosphaeria nodorum; Magnaporthe grisea; Mycosphaerella graminicola; Neurospora crassa; Phytophthora capsici; Phytophthora in
  • a candidate substance is contacted with a cell heterozygous for an underexpressed, mutated, disrupted or deleted copy or copies of the gene or genes, and the extent to which the candidate substance inhibits growth of the cell is determined by any suitable means and compared to the effects of the candidate substance on cells homozygous for unaltered copies of the gene.
  • the heterozygous cell will show greater sensitivity to substances that inhibit the gene or its gene product.
  • Suitable candidate substances which can tested in the above methods include antibody products (for example, monoclonal and polyclonal antibodies, single chain antibodies, chimeric antibodies and CDR-grafted antibodies). Furthermore, combinatorial libraries, defined chemical identities, peptide and peptide mimetics, oligonucleotides and natural product libraries, such as display libraries (e.g. phage display libraries) may also be tested.
  • the candidate substances may be chemical compounds. Batches of the candidate substances may be used in an initial screen of, for example, ten substances per reaction, and the substances from batches which show inhibition tested individually.
  • a polynucleotide or protein of the invention for use as a medicament or in diagnosis.
  • the polynucleotide or protein may be modified prior to use, preferably to produce a derivative or variant thereof.
  • the polynucleotide or protein may be derivatised.
  • the protein may be modified by epitope tagging, addition of fusion partners or purification tags such as glutathione S-transferase, multiple histidines or maltose binding protein, addition of green fluorescent protein, covalent attachment of molecules including biotin or fluorescent tags, incorporation of selenomethionine, inclusion or attachment of radioisotopes or fluorescent/non-fluorescent lanthanide chelates.
  • the polynucleotide may be modified by methylation or attachment of digoxygenin (DIG) or by addition of sequence encoding the above tags, proteins or epitopes.
  • DIG digoxygenin
  • the medicament is adapted to retard or prevent a fungal infection.
  • the fungal infection may be in human, animal or plant.
  • the polynucleotide or protein may be used for the development of a drug.
  • the polynucleotide or protein may be used in, or for the generation of, a molecular model of said polynucleotide or said protein.
  • a polynucleotide or protein of the invention for the preparation of a medicament for the treatment of a fungal infection.
  • the polynucleotide or protein may be modified prior to use, preferably to produce a derivative or variant thereof.
  • the polynucleotide or protein may be derivatised.
  • the polynucleotide or protein may not be modified or derivatised.
  • the medicament is adapted to retard or prevent a fungal infection.
  • the treatment may comprise retarding or preventing fungal infection.
  • the drug and/or medicament comprises an inhibitor.
  • the drug or medicament is adapted to inhibit expression and/or activity of the polynucleotide or a fragment thereof, and/or the function of the protein or a fragment thereof.
  • the fungal infection comprises an infection by a fungus, more preferably an Ascomycete, and even more preferably, an organism selected from the genera Aspergillus; Blume ⁇ a; Candida; Colletotrichium; Cryptococcus; Encephalitozoon; Fusarium; Histoplasma, Leptosphaeria; Magnaporthe; Mycosphaerella; Neurospora; Phytophthora; Plasmopara; Pneumocystis; Pyricularia; Pythium; Puccinia; Rhizoctonia, Trichophyton; and Ustilago.
  • a fungus more preferably an Ascomycete, and even more preferably, an organism selected from the genera Aspergillus; Blume ⁇ a; Candida; Colletotrichium; Cryptococcus; Encephalitozoon; Fusarium; Histoplasma, Leptosphaeria; Magnaporthe; Mycosphaerella; Neurospora; Phytophthora
  • the fungal infection comprises an infection by an organism selected from the genera Aspergillus.
  • the fungal infection comprises an infection by an organism selected from the species Aspergillus flavus; Aspergillus fumigatus; Aspergillus nidulans; Aspergillus niger; Aspergillus parasiticus; Aspergillus terreus; Blumeria graminis; Candida albicans; Candida cruzei; Candida glabrata; Candida parapsilosis; Candida tropicalis; Colletotrichium trifolii; Cryptococcus neoformans; Encephalitozoon cuniculi; Fusarium graminarium; Fusarium solani; Fusarium sporotrichoides; Histoplasma capsulata; Leptosphaeria nodorum; Magnaporthe grisea; Mycosphaerella graminicola; Neurospora crassa; Phytophthora capsici; Phytophthora infestans; Plasmopara viticola; Pneumocy
  • a recombinant DNA molecule or vector comprising a polynucleotide of the invention.
  • the recombinant DNA molecule or vector may comprise an expression cassette.
  • the recombinant DNA molecule or vector comprises an expression vector.
  • the polynucleotide sequence is operatively linked to an expression control sequence.
  • a suitable control sequence may comprise a promoter, an enhancer etc.
  • a cell containing a polynucleotide, recombinant DNA molecule or vector of the invention a polynucleotide, recombinant DNA molecule or vector of the invention.
  • the cell may be transformed or transfected with the polynucleotide, recombinant DNA molecule or vector by suitable means.
  • the cell produces a recombinant protein of the invention.
  • the invention also provides an organism which is transgenic for the polynucleotide of the invention (whose cells may be the same as the cells of the invention mentioned herein).
  • Such an organism is typically a fungus, such as any genera or species of fungus mentioned herein.
  • the organism may be a microorganism, such as a bacterium, virus or yeast.
  • the organism may be a plant, or animal (including birds and mammals), such as any of the animals mentioned herein.
  • the organism may be produced by introduction of the polynucleotide of the invention into a cell of the organism, and in the case of a multicellular organism allowing the cell to grow into a whole organism.
  • a cell in which a polynucleotide or protein of the invention is non-functional and/or inhibited.
  • the cell may be of, or present in, a multicellular organism.
  • the cell may be a mutant cell.
  • the cell is typically a fungal cell, such as of any genera or species of fungus mentioned herein.
  • a preferred means of generating the cell is to modify the polynucleotide of the invention, such that the polynucleotide is non-functional. This modification may be to cause a mutation, which disrupts the expression or function of a gene product. Such mutations may be to the nucleic acid sequences that act as 5' or 3' regulatory sequences for the polynucleotide, or may be a mutation introduced into the coding sequence of the polynucleotide. Functional deletion of the polynucleotide may be, for example, by mutation of the polynucleotide in the form of nucleotide substitution, addition or, preferably, nucleotide deletion.
  • the polynucleotide may be made non- functional and/or inhibited by: (i) shifting the reading frame of the coding sequence of the polynucleotide;
  • a preferred means of introducing a mutation into a polynucleotide is to utilize molecular biology techniques specifically to target the polynucleotide which is to be mutated. Mutations may be induced using a DNA molecule.
  • a most preferred means of introducing a mutation is to use a DNA molecule that has been especially prepared such that homologous recombination occurs between the target polynucleotide and the DNA molecule. When this is the case, the DNA molecule, which may be double stranded, may contain base sequences similar or identical to the target polynucleotide to allow the DNA molecule to hybridize to (and subsequently recombine with) the target.
  • the mutant cell may contain mutations of two different ILV3 genes, where the function of either or both gene products may be inhibited or abolished.
  • inhibitors include agents that prevent transcription of the polynucleotide, or prevent translation, expression or disrupt post-translational modification.
  • the inhibitor may be an agent that increases degradation of the gene product (e.g. a specific proteolytic enzyme).
  • the inhibitor may be an agent which prevents the polynucleotide product from functioning, such as neutralizing antibodies.
  • the inhibitor may also be an antisense oligonucleotide, or any synthetic chemical capable of inhibiting expression of the gene or the stability and/or function of the protein.
  • the inhibitor may also be a protein which interacts with a proetin of the invention prevent its function.
  • the inhibitor may also be an RNA molecule which causes inhibition by RNA interference.
  • the antisense polynucleotide or RNA molecule which causes RNA interference is an example of a polynucleotide of the invention.
  • an antibody exhibiting immunospecificity for a protein of the invention may be used as a diagnostic reagent.
  • the antibody may be monoclonal or polyclonal, and may be raised in mouse, rat, rabbit, chicken, turkey, horse, goat or donkey.
  • the antibody may be raised against one of the proteins of the invention, or may be raised against proteolytic or recombinant fragments.
  • antibody includes fragments which bind a protein of the invention. Such fragments include Fv, F(ab') and F(ab') 2 fragments, as well as single chain antibodies.
  • the antibodies and fragment thereof may be chimeric antibodies, CDR- grafted antibodies or humanised antibodies.
  • any of the therapeutic substances e.g. proteins, polynucleotides or modulators
  • Any such substance may be administered in a variety of dosage forms. It may be administered orally (e.g. as tablets, troches, lozenges, aqueous or oily suspensions, dispersible powders or granules), parenterally, subcutaneously, intravenously, intramuscularly, intrasternally, transdermally or by infusion techniques. The substance may also be administered as suppositories. A physician will be able to determine the required route of administration for each particular patient.
  • the substance is formulated for use with a pharmaceutically acceptable carrier or diluent.
  • the pharmaceutical carrier or diluent may be, for example, an isotonic solution.
  • solid oral forms may contain, together with the active compound, diluents, e.g. lactose, dextrose, saccharose, cellulose, corn starch or potato starch; lubricants, e.g. silica, talc, stearic acid, magnesium or calcium stearate, and/or polyethylene glycols; binding agents; e.g. starches, arabic gums, gelatin, methylcellulose, carboxymethylcellulose or polyvinyl pyrrolidone; disaggregating agents, e.g.
  • Such pharmaceutical preparations may be manufactured in known manner, for example, by means of mixing, granulating, tabletting, sugar-coating, or film coating processes.
  • Liquid dispersions for oral administration may be syrups, emulsions and suspensions.
  • the syrups may contain as carriers, for example, saccharose or saccharose with glycerine and/or mannitol and/or sorbitol.
  • Suspensions and emulsions may contain as carrier, for example a natural gum, agar, sodium alginate, pectin, methylcellulose, carboxymethylcellulose, or polyvinyl alcohol.
  • the suspensions or solutions for intramuscular injections may contain, together with the active compound, a pharmaceutically acceptable carrier, e.g. sterile water, olive oil, ethyl oleate, glycols, e.g.
  • Solutions for intravenous or infusions may contain as carrier, for example, sterile water or preferably they may be in the form of sterile, aqueous, isotonic saline solutions.
  • a therapeutically effective non-toxic amount of substance is administered.
  • the dose may be determined according to various parameters, especially according to the substance used; the age, weight and condition of the patient to be treated; the route of administration; and the required regimen. Again, a physician will be able to determine the required route of administration and dosage for any particular patient.
  • a typical daily dose is from about 0.1 to 50 mg per kg, preferably from about O.lmg/kg to lOmg/kg of body weight, according to the activity of the specific inhibitor, the age, weight and conditions of the subject to be treated, the type and severity of the disease and the frequency and route of administration.
  • daily dosage levels are from 5 mg to 2 g.
  • Modulators identified by the method of the invention may be administered to plants in order to prevent or treat fungal infections.
  • the modulators are normally applied in the form of compositions together with one or more agriculturally acceptable carriers or diluents and can be applied to the crop area or plant to be treated, simultaneously or in succession with further compounds.
  • the modulators of the invention can be applied together with carriers, surfactants or application-promoting adjuvants customarily employed in the art of formulation.
  • Suitable carriers and diluents correspond to substances ordinarily employed in formulation technology, e.g. natural or regenerated mineral substances, solvents, dispersants, wetting agents, tackifiers, binders or fertilizers.
  • a preferred method of applying the modulators of the present invention or an agrochemical composition which contains them is leaf application.
  • the number of applications and the rate of application depend on the intensity of infection by the fungus.
  • the active ingredients can also penetrate the plant through the roots via the soil (systemic action) by impregnating the locus of the plant with a liquid composition, or by applying the compounds in solid form to the soil, e.g. in granular form (soil application).
  • the active ingredients may also be applied to seeds (coating) by impregnating the seeds either with a liquid formulation containing active ingredients, or coating them with a solid formulation. In special cases, further types of application are also possible, for example, selective treatment of the plant stems or buds.
  • the active ingredients are used in unmodified form or, preferably, together with the adjuvants conventionally employed in the art of formulation, and are therefore formulated in known manner to emulsifiable concentrates, coatable pastes, directly sprayable or dilutable solutions, dilute emulsions, wettable powders, soluble powders, dusts, granulates, and also encapsulations, for example, in polymer substances.
  • the methods of application such as spraying, atomizing, dusting, scattering or pouring, are chosen in accordance with the intended objectives and the prevailing circumstances.
  • Advantageous rates of application are normally from 5Og to 5kg of active ingredient (a.i.) per hectare ("ha", approximately 2.471 acres), preferably from lOOg to 2kg a.i./ha, most preferably from 20Og to 50Og a.i./ha.
  • the formulations, compositions or preparations containing the active ingredients and, where appropriate, a solid or liquid adjuvant are prepared in known manner, for example by homogeneously mixing and/or grinding active ingredients with extenders, for example solvents, solid carriers and, where appropriate, surface- active compounds (surfactants).
  • Suitable solvents include aromatic hydrocarbons, preferably the fractions having 8 to 12 carbon atoms, for example, xylene mixtures or substituted naphthalenes, phthalates such as dibutyl phthalate or dioctyl phthalate, aliphatic hydrocarbons such as cyclohexane or paraffins, alcohols and glycols and their ethers and esters, such as ethanol, ethylene glycol, monomethyl or monoethyl ether, ketones such as cyclohexanone, strongly polar solvents such as N-methyl-2-pyrrolidone, dimethyl sulfoxide or dimethyl formamide, as well as epoxidized vegetable oils such as epoxidized coconut oil or soybean oil; or water.
  • aromatic hydrocarbons preferably the fractions having 8 to 12 carbon atoms, for example, xylene mixtures or substituted naphthalenes, phthalates such as dibutyl phthalate or dioctyl phthal
  • the solid carriers used e.g. for dusts and dispersible powders are normally natural mineral fillers such as calcite, talcum, kaolin, montmorillonite or attapulgite.
  • Suitable granulated adsorptive carriers are porous types, for example pumice, broken brick, sepiolite or bentonite; and suitable nonsorbent carriers are materials such as calcite or sand, hi addition, a great number of pregranulated materials of inorganic or organic nature can be used, e.g. especially dolomite or pulverized plant residues.
  • suitable surface-active compounds are nonionic, cationic and/or anionic surfactants having good emulsifying, dispersing and wetting properties.
  • surfactants will also be understood as comprising mixtures of surfactants.
  • Suitable anionic surfactants can be both water-soluble soaps and water-soluble synthetic surface-active compounds.
  • Suitable soaps are the alkali metal salts, alkaline earth metal salts or unsubstituted or substituted ammonium salts of higher fatty acids (chains of 10 to 22 carbon atoms), for example the sodium or potassium salts of oleic or stearic acid, or of natural fatty acid mixtures which can be obtained for example from coconut oil or tallow oil.
  • the fatty acid methyltaurin salts may also be used.
  • fatty sulfonates especially fatty sulfonates, fatty sulfates, sulfonated benzimidazole derivatives or alkylarylsulfonates.
  • the fatty sulfonates or sulfates are usually in the form of alkali metal salts, alkaline earth metal salts or unsubstituted or substituted ammoniums salts and have a 8 to 22 carbon alkyl radical which also includes the alkyl moiety of alkyl radicals, for example, the sodium or calcium salt of lignonsulfonic acid, of dodecylsulfate or of a mixture of fatty alcohol sulfates obtained from natural fatty acids.
  • These compounds also comprise the salts of sulfuric acid esters and sulfonic acids of fatty alcohol/ethylene oxide adducts.
  • the sulfonated benzimidazole derivatives preferably contain 2 sulfonic acid groups and one fatty acid radical containing 8 to 22 carbon atoms.
  • alkylarylsulfonates are the sodium, calcium or triethanolamine salts of dodecylbenzenesulfonic acid, dibutylnaphthalenesulfonic acid, or of a naphthalenesulfonic acid/formaldehyde condensation product.
  • corresponding phosphates e.g. salts of the phosphoric acid ester of an adduct of p-nonylphenol with 4 to 14 moles of ethylene oxide.
  • Non-ionic surfactants are preferably polyglycol ether derivatives of aliphatic or cycloaliphatic alcohols, or saturated or unsaturated fatty acids and alkylphenols, said derivatives containing 3 to 30 glycol ether groups and 8 to 20 carbon atoms in the (aliphatic) hydrocarbon moiety and 6 to 18 carbon atoms in the alkyl moiety of the alkylphenols.
  • non-ionic surfactants are the water-soluble adducts of polyethylene oxide with polypropylene glycol, ethylenediamine propylene glycol and alkylpolypropylene glycol containing 1 to 10 carbon atoms in the alkyl chain, which adducts contain 20 to 250 ethylene glycol ether groups and 10 to 100 propylene glycol ether groups. These compounds usually contain 1 to 5 ethylene glycol units per propylene glycol unit.
  • non-ionic surfactants are nonylphenolpolyethoxyethanols, castor oil polyglycol ethers, polypropylene/polyethylene oxide adducts, tributylphenoxypolyethoxyethanol, polyethylene glycol and octylphenoxyethoxyethanol.
  • Fatty acid esters of polyoxyethylene sorbitan and polyoxyethylene sorbitan trioleate are also suitable non-ionic surfactants.
  • Cationic surfactants are preferably quaternary ammonium salts which have, as N-substituent, at least one C 8 -C 22 alkyl radical and, as further substituents, lower unsubstituted or halogenated alkyl, benzyl or lower hydroxyalkyl radicals.
  • the salts are preferably in the form of halides, methylsulfates or ethylsulfates, e.g. stearyltrimethylammonium chloride or benzyldi(2-chloroethyl)ethylammonium bromide.
  • the agrochemical compositions usually contain from about 0.1 to about 99% preferably about 0.1 to about 95%, and most preferably from about 3 to about 90% of the active ingredient, from about 1 to about 99.9%, preferably from about 1 to 99%, and most preferably from about 5 to about 95% of a solid or liquid adjuvant, and from about 0 to about 25%, preferably about 0.1 to about 25%, and most preferably from about 0.1 to about 20% of a surfactant.
  • a surfactant preferably formulated as concentrates, the end user will normally employ dilute formulations. All of the features described herein may be combined with any of the above aspects, in any combination.
  • fungal target genes should be present in as broad a range of fungi as possible, but absent from humans.
  • a bioinformatics strategy was devised to identify such potential targets exploiting the availability of fungal and human genomes. Programs were written in PERL, and used publicly available downloaded databases and the BLAST algorithm (Altschul et al., 1990, J. MoI. Biol. 215:403-410). Predicted proteins from the A.
  • nidulans genome http://www.broad.mit.edu/ftp/pub/armotation/aspergillus/assemblyl/release3.1/asper gillus_nidulans_l_r3.1_proteins.fasta.gz
  • ftp://ftp.ncbi.nih.gov/refseq/H_sapiens/H_sapiens/protein/ were kept (i.e. E-value > le-4). This set was then blasted against N.
  • albicans orfs http://www-sequence.stanford.edu/group/candida/download.html
  • a set of 819 proteins with good homologs E-value ⁇ le-10
  • pan- fungal proteins
  • 2184 proteins which can be thought of as "filamentous-only” proteins.
  • the pan-fungal set was examined for enzymes or enzyme families. Surprisingly, four ILV3-like genes were identified, AN4058, AN6346, (Tables I and II), AN5138 and AN7358, each of which had an A. fumigatus ortholog. This contrasts with the presence of a single ILV3 gene in S. cerevisiae. Alignment of the four ILV3 genes with ILV3 genes from other organisms, followed by phylogentic analysis identified the two ILV3 genes given in table I as the closest to the S. cerevisiae ILV3 gene. This was supported by percentage identity values given in Table III., A phosphoinositol phospholipase C was also identified (see Table I).
  • ILV3_Sc ILV3 from S. cerevisiae
  • the A. fumigatus genes corresponding to the A. nidulans genes were identified as follows: The A. nidulans protein was blasted against the A. fumigatus genome (ftp://ftp.sanger.ac.uk/pub/pathogens/AJomigatus/AF.contigs.031704) to identify the matching region.
  • Suboptimal exon cutoff 1.00
  • WISE2 http://www.ebi.ac.uk/Wise2 ⁇ .
  • the predicted genes were compared with similar sequences using blast, the multiple alignment programs ClustalX (Thompson et al., 1997, Nucleic Acids Research, 24:4876-4882) and QAlign (Sameth et al., 2003, Bioinformatics 19, 1592-1593; http://www.ridom.de/qalign), and the alignment editor/viewer Align (Hepperle, D., 2001: Multicolor Sequence Alignment Editor. Institute of Freshwater Ecology and Inland Fisheries, 16775 Stechlin, Germany).
  • a gene of interest For a gene of interest to be suitable as a anti-fungal drug target, it is necessary to show that it is an essential gene by generating a knock-out strain in which the gene is disabled.
  • the genomic DNA is then used as the substrate for a tansposition reaction using the Epicentre Tn5 bacterial transposon into which fungal and bacterial selection markers have been inserted.
  • Suitable fungal selection genes are PyrG, hygromycin or zeomycin; suitable bacterial markers are kanamycin or zeomycin.
  • the transposed constructs are then screened by PCR to identify those where the transposon has inserted into the gene.
  • PCR primers are designed either to cover the whole gene, such that insertion of the transposon results in the appearance of a product of higher molecular weight, or to extend from the start or end of the gene into the transposon, such that a product is only obtained when the transposon has inserted.
  • the genomic DNA ⁇ ransposon construct is excised from pGEMT-easy with a restriction enzyme which cuts only in the vector (e.g., Notl or Dral) and then used to transform haploid fungal protoplasts by means of PEG-mediated transformation.
  • the fungi are grown under selective conditions, determined by the marker used, and transformants are picked. These are then screened by PCR using primers specific for the gene of interest: Replacement of the endogenous gene with the transposon-modified gene results in a single band of higher molecular weigh by PCR. Therefore, if the modified gene is observed, the gene is not essential. However, if none of the transformants show gene replacement, the gene of interest may be an essential gene. In this case, the transformation is then carried out on diploids using the same method and essentiality of the gene is tested by rehaploidisation followed by examination of the segregation pattern in haploids.
  • ILV34A mutant construct A PCR was set up with Extensor master mix, A. fumigatus genomic DNA, and primers ILV34A_F1 and ILV34A_R1 (SEQ ID Nos. 72 and 73). The resulting 5889 bp PCR product was gel purified (Qiaquick gel purification kit, Qiagen) and ligated into pGEMTeasy overnight at 4 0 C (Promega). 1 ⁇ l of the ligation mix was transformed into Electrocompetent E. coli Genehogs (Invitrogen) by electroporation. Transformed cells were plated on LB-Ampicillin-IPTG-Xgal agar plates and incubated at 37 0 C overnight.
  • Plasmid DNA was isolated by Qiaprep miniprep DNA isolation (Qiagen). Notl digestion of the plasmid DNA indicated whether a 5.9 kb insert was present and the presence of ILV34 DNA was confirmed by PCR reactions using the following PCR primer sets: a) SEQ ID Nos 72 and 73; b) SEQ ID No. 74 and 75. Plasmids yielding the following size PCR products were deemed to be pGEMTEasy_ILV34A: a) 5889 bp, b) 1930 bp.
  • a plasmid, pMB4zeo was constructed that contained the mosaic ends recognised by the TN5 transposase, an Aspergillus fumigatus pyr G sequence and a bacterial zeocin resistance gene.
  • the pyrG cassette was prepared with EcoRI sites flanking the genomic pyrG sequence. This cassette was introduced into the EcoRI site of pMOD2 (Epicentre).
  • a zeocin resistance cassette was sub-cloned from an Xbal-NheI fragment of pEMzeo (Invitrogen) into the Xbal site.
  • pMB4zeo was digested with PshAI and Xmnl and the 2551 bp fragment obtained was gel purified.
  • This fragment contained mosaic ends for transposition, an Aspergillus pyrG cassette and a bacterial zeocin resistance marker.
  • pGEMTEasy_ILV34A was mutated by transposition with the EZ::TN transposase kit (Epicentre) using PshAI- MB4zeo. The following were assembled in a microcentrifuge: 1 ⁇ l EZ::TN 1OX Reaction Buffer, 1 ⁇ l pGEMTEasy_ILV34A, 1 ⁇ l PshAI-MB4zeo, 6 ⁇ l sterile water, 1 ⁇ l EZ::TN Transposase.
  • the reaction mixture was incubated for 2 hours at 37 0 C. 1 ⁇ l EZ::TN 1OX Stop Solution was added, mixed and heated for 10 minutes at 7O 0 C. 1 ⁇ l of the stopped reaction was transformed into Electrocompetent E. coli Genehogs (Invitrogen) by electroporation. Transformed cells were plated on LB-Ampicillin- zeocin agar plates and incubated at 37°C overnight. Colonies were picked into LB- ampicillin broth and incubated at 37 0 C overnight with shaking at 220 rpm. Plasmid DNA was isolated by Qiaprep miniprep DNA isolation (Qiagen). Plasmids were screened by PCR using primer SEQ ID No. 74 and 80.
  • a plasmid was selected that gave a PCR product of approximately 600bp indicating that the transposon PshAI- MB4zeo had inserted approximately 600 bp from the ATG start site of the coding sequence, thus disrupting the gene.
  • This plasmid was designated ILV34A_KO33.
  • the plasmid was digested with Notl and the 8.4kb fragment gel purified. This fragment was used for fungal transformation.
  • a BAC containing a genomic copy of ILV1352 was was isolated and used as a template for a PCR with Extensor master mix and primers SEQ ID Nos. 76 and 77.
  • the resulting 5958 bp PCR product was purified (Qiaquick gel purification kit,
  • a plasmid was constructed for transposition of a hygromycin resistance cassette. Firstly, The bacterial zeocin resistance cassette from pEMzeo was introduced into the EcoRI site of pMOD2 between the mosaic ends. Then, the zeocin resistance cassette together with the mosaic ends were amplified by PCR including Spel sites on the primers. The product was then digested with Spel and ligated into the Spel site of pGEMTeasy. The hygromycin resistance cassette was then cloned into the Xba I site. The resulting plasmid (named pPH8) was digested with Spe I and Xmn I to yield a 3649 bp fragment which was gel purified.
  • This fragment contained mosaic ends for transposition, an Aspergillus hygromycin resistance cassette and a bacterial zeocin resistance marker.
  • pGEMTEasy_ILV1352 was mutated by transposition with the EZ::TN transposase kit (Epicentre) using (SpeI_PH8). The following were assembled in a microcentrifuge: 1 ⁇ l EZ::TN 1OX Reaction Buffer, 1 ⁇ l pGEMTEasy_ILV1352, 2 ⁇ l (SpeI_PH8), 5 ⁇ l sterile water, 1 ⁇ l EZ::TN Transposase. The reaction mixture was incubated for 2 hours at 37 0 C.
  • a PCR product of approximately 900bp indicated that the transposon PshAI-MB4zeo had inserted approximately 900 bp from the ATG start site of the coding sequence, thus disrupting the gene.
  • the mutant plasmid was designated ILV1352_KO21.
  • the plasmid was digested with Dral and the ⁇ 12kb fragment was gel purified. This fragment was used for fungal transformation.
  • the diploid ILV1352 knockout was then transformed with the ILV34A mutant construct and resulting colonies screened by PCR with primers SEQ ED Nos. 80 and 81. Positive clones were checked extensively by PCR and Southern blotting. The diploid was haploidised on benomyl SAB plus uridine and uracil. Haploid spores were assessed for the presence of the hygromycin and pyrG selective markers. No growth was seen when haploid spores were plated on media without uridine and uracil but with hygromycin, indicating that the double knockout was lethal.
  • E. coli Select96 cells (Promega) are used in accordance with manufacturers' instructions.
  • A.fumigatus clinical isolate AF293 (ref. No. NCPF7367; available to the public from the NCPF repository; Bristol, U.K.); the CBS repository (Belgium) or from Dr. David Denning' s clinical isolate culture collection, Hope Hospital, Salford. U.K.) is the preferred strain according to the present invention.
  • AF293 was isolated in 1993 from the lung biopsy of a patient with invasive aspergillosis and aplastic anaemia. It was donated by Shrewsbury PHLS.
  • the mycelium (fresh or freeze dried) is ground to a powder using liquid nitrogen in a mortar cooled to -2O 0 C.
  • the ground biomass is transferred to 50 ml tubes on ice up to the 10 ml mark.
  • An equal volume of extraction buffer (0.7 M NaCl; 0.1 M Na 2 SO 3 ; 0.1 M Tris-HCl pH 7.5; 0.05 M EDTA; l%(w/v) SDS; pre- warmed to 65 0 C) is then added to each tube, mixed thoroughly with a pipette tip and incubated at 65 0 C for 20 minutes in a water bath.
  • a volume of chloroform/isoamyl alcohol (24:1) equivalent to the volume of the original biomass is then added to each tube, tubes are mixed thoroughly and incubated on ice for 30 min. Tubes are then centrifuged at 3,500 x g for 30 min and the aqueous phase carefully transferred to fresh 50 ml tubes without disturbing the interface.
  • the pellet is suspended in 2 ml sterile water. 1 ml of 7.5 M ammonium acetate is added, mixed and incubated on ice for 1 hour. Tubes are centrifuged at 12,000 x g for 30 min, the supernatants transferred to a fresh tube and 0.54 volumes of isopropanol are added, mixed and incubated at room temperature for at least 15 minutes. Tubes are then centrifuged at 5,930 x g for 10 min, the supernatant is removed and the pellet washed in 1 ml of 70% ethanol. Tubes are centrifuged at 5,930 x g for 10 min and all the ethanol is removed.
  • the pellet is air dried for 20-30 minutes at room temperature and suspended in 0.5-1.0 ml of TE (10 mM Tris-HCl pH 7.5; 1 mM EDTA) Finally, the DNA is treated with RNase A (5 ⁇ l of lmg/ml stock).
  • Primers pairs are designed to the upstream and downstream regions of the A. fumigatus AF293 genes: The 200-base regions flanking the gene of interest are used as input sequence for Primer3 (http://frodo.wi.mit.edu/cgi- bin/primer3/primer3_www.cgi) to provide a primer pair that spans the gene. If the gene is particularly long it may be necesssary to design primer pairs with internal sequences and thus sequence the gene in parts. The following reagents and conditions are used:
  • PCR cycles are as follows: (1) 95° C, 2 min; (2) 95° C, 30 sec; (3) 54° C, 30 sec; (4) 72° C, 2 min; (5) 72° C, 10 min; (6) 8° C, hold. 40 cycles of steps 2-4 are carried out and the PCR products are run on a gel.
  • the product band is excised from the gel and purified using QIAquick Gel Extraction Kit (Qiagen Ltd, Boundary Court, Gatwick Road, Crawley, West Wales, RHlO 9AX, UK) according to the manufacturers instructions and eluted into 30 ⁇ l of sterile water (BDH molecular biology grade/filter sterile).
  • the PCR product is then ligated into pGEM-Teasy (Promega) using the following ligation mixture: 2x Buffer, 5 ⁇ l; pGEM Teasy, 1 ⁇ l; PCR product, 3 ⁇ l; T4 DNA Ligase,l ⁇ l. The reaction is incubated overnight at 4° C.
  • Plasmid DNA is extracted using Qiagen miniprep kit according to the manufacturers instructions. 1 ⁇ l of plasmid DNA is digested with restriction enzymes for 1 hour at 37° C. Results are compared with the predicted sizes for constructs and clones showing the correct restriction digest pattern are sequenced at MWG Biotech UK Ltd, Waterside House, Peartree Bridge, Milton Keynes, MK6 3BY.
  • the internal sequences of the genes of interest are experimentally determined by cloning and sequencing cDNA, and the 5' and 3' ends of the genes are determined by RACE (Rapid Amplification of cDNA Ends).
  • step 3 RLC was used as the lysis buffer of choice;
  • step 7 the Rneasy column was incubated for 5 min at room temperature after addition of RWl;
  • step 9a was carried out;
  • step 10 30 ⁇ l RNase-free water was added, the samples incubated for 10 min at room temperature, and then centrifuged;
  • step 11 the elution step was repeated to give a total volume of 60 ⁇ l RNA.
  • RNA contamination was removed from the RNA by the addition of Dnase, using 2 ⁇ l DNase per ⁇ g RNA, in the presence of 1OX DNase buffer and incubating at 37°C for 2h.
  • DNase-treated RNA was cleaned up using the RNeasy Plant Mini Kit following the RNeasy Mini Protocol for RNA Cleanup (RNeasy Mini Handbook 06/2001, pages 79-81).
  • RNA-free RNA 100 ng-1 ⁇ g of DNA-free RNA, 3 ⁇ l oligo (dT) (100 ng/ ⁇ l), and DEPC- treated water to a total volume of 42 ⁇ l. Samples were incubated in a heat block at 65 0 C for 5 min after which they were allowed to cool slowly to room temperature. Then 2 ⁇ l Ultrapure dNTPs, 1 ⁇ l reverse transcriptase (Stratascript) and 5 ⁇ l 1OX reverse transcriptase reaction buffer (Stratascript) were added. Samples were incubated at 42°C for Ih, denatured at 9O 0 C for 5 min and then cooled on ice.
  • PCR is carried out using the cDNA above to generate cDNA fragments. Primers are designed based on the 5' and 3' ends of the predicted genes. PCR reactions are carried out using the following reagents and conditions:
  • PCR cycles are run as follows; (1) 94° C, 5 min; (2) 94° C, 30 sec; (3) 53° C, 30 sec; (4) 68° C, 90 sec; (5) 68° C, 10 min; (6) 8° C, pause. Cycles 2-4 are run 40 times.
  • the PCR products are purified using QIAquick PCR Purification Kit (Qiagen Ltd, Boundary Court, Gatwick Road, Crawley, Westshire, RHlO 9AX, UK) according to the manufacturers instructions and run on agarose gels. PCR products are ligated into pGEM-Teasy, used to transform Select 96 cells, and sequenced as described in Example 3 above.
  • RACE Rapid Amplification of cDNA Ends
  • RNA was prepared using the FastRNA kit (QBIOgene) following the manufacturer's instructions (Revision 6030-999-1 J05) with the following amendments: At step 1, 40 mg of biomass was used per extraction; At step 2, samples were processed for 20 seconds at speed 5, incubated on ice for 3 minutes, and processed again for 20 seconds at speed 5; At step 3 samples were centrifuged for 5 minutes; At step 5, 500 ⁇ l DIPS were added, mixed, and incubated at room temperature for 2 minutes. Samples were mixed again and incubated for a further 2 minutes; At step 6 two washes in 250 ⁇ l SEWS were carried out; At step 7, the pellet was disolved in 50 ⁇ l SAFE buffer.
  • RNA 1 ⁇ g total RNA prepared as described above was de-phosphorylated in a 10 ⁇ l reaction using 10 units of calf intestinal phosphate (CIP), 1 ⁇ l 1OX CIP buffer and 4OU RNaseOutTM (made up to 10 ⁇ l in DEPC water) at 5O 0 C for 1 hour. Samples were then made up to 100 ⁇ l with DEPC water and the RNA extracted with 100 ⁇ l (25:24:1) phenol:chloroform:isoamyl alcohol. RNA was then precipitated by the addition of 2 ⁇ l mussel glycogen (10 mg/ml), 10 ⁇ l 3M sodium acetate, pH 5.2 and 220 ⁇ l 95% ethanol and the sample frozen on dry ice for 10 minutes. RNA was pelleted by centrifugation at 14,500 rpm for 20 minutes at 4 0 C, washed with 70% ethanol, air dried and re-suspended in 8 ⁇ l DEPC water.
  • CIP calf intestinal phosphate
  • RNA was de-capped in a 10 ⁇ l reaction with 0.5 U tobacco acid pyrophosphatase (TAP), 1 ⁇ l 10x TAP buffer and 40 U RnaseOutTM for 1 hour at 37 0 C. RNA was extracted with phenolxhloroform and precipitated as above, and then re-suspended in 7 ⁇ l DEPC-treated water.
  • TAP tobacco acid pyrophosphatase
  • First-strand cDNA is prepared by the addition of 1 ⁇ l GeneRacerTM Oligo dT primer and 1 ⁇ l dNTP mix (1OmM each) to 10 ⁇ l ligated RNA and incubated at 65 0 C for 5 minutes. The following reagents were added to the 12 ⁇ l ligated RNA and primer mix; 4 ⁇ l 5x first strand buffer, 2 ⁇ l 0.1 M DTT, 1 ⁇ l RNaseOutTM and 1 ⁇ l SuperscriptTM II RT (200 U/ ⁇ l) and incubated first at 42 0 C for 50 minutes and then, to stop the reaction, at 7O 0 C for 15 minutes. 2 U RNase H was added to the reaction mix and incubated at 37 0 C for 20 minutes. To amplify the 5'cDNA ends a 50 ⁇ l PCR reaction is set up using 1 ⁇ l of the
  • RACE-ready cDNA prepared above, 1 ⁇ l GeneRacerTM 5' primer, -1 ⁇ l reverse gene- specific primer (designed against the complementary strand of the coding sequence: 5 pmol/ ⁇ l stock), 1 ⁇ l dNTP solution (10 mM each), 2 ⁇ l 50 mM MgSO 4 , 5 ⁇ l High Fidelity PCR buffer, 0.5 ⁇ l Platinum® Taq DNA Polymerase High Fidelity (5 U/ ⁇ l) and 38.5 ⁇ l sterile water. Cycling parameters are given in Table V below.
  • a second, nested PCR stage may also be carried out. This is set up using 1 ⁇ l of the RACE cDNA from the first stage above, 1 ⁇ l Nested 5' primer (supplied with kit), 1 ⁇ l second reverse gene-specific primer (designed against the complementary strand of the coding sequence and nested with respect to the above primer: 5 pmol/ ⁇ l stock), 1 ⁇ l dNTP solution (10 mM each), 2 ⁇ l 50 niM MgSO 4 , 5 ⁇ l High Fidelity PCR buffer, 0.5 ⁇ l Platinum® Taq DNA Polymerase High Fidelity (5 U/ ⁇ l) and 38.5 ⁇ l sterile water. Cycling parameters are given in Table V below.
  • a 50 ⁇ l PCR reaction is set up using 1 ⁇ l of the RACE- ready cDNA prepared above, 1 ⁇ l GeneRacerTM 3' primer (10 ⁇ M), 1 ⁇ l forward gene-specific primer (designed against the coding strand of the coding sequence : 5 pmol/ ⁇ l stock), 1 ⁇ l dNTP solution (10 mM each), 2 ⁇ l 50 mM MgSO 4 , 5 ⁇ l High Fidelity PCR buffer, 0.5 ⁇ l Platinum® Taq DNA Polymerase High Fidelity (5 U/ ⁇ l) and 38.5 ⁇ l sterile water. Cycling parameters are given in Table V below:
  • a second, nested PCR stage may also be carried out. This is set up using 1 ⁇ l of the 3' RACE cDNA from the first stage above, 1 ⁇ l Nested 3' primer (supplied with kit), 1 ⁇ l reverse gene-specific primer (designed against the coding strand of the coding sequence and nested with respect to the above primer: 5 pmol/ ⁇ l stock), 1 ⁇ l dNTP solution (10 mM each), 2 ⁇ l 50 mM MgSO 4 , 5 ⁇ l High Fidelity PCR buffer, 0.5 ⁇ l Platinum® Taq DNA Polymerase High Fidelity (5 U/ ⁇ l) and 38.5 ⁇ l sterile water. Cycling parameters are given in Table V below.
  • 5' and 3' RACE identify the 5' ATG and 3' stop codons as well as giving the 5' and 3' untranslated regions of the genes.
  • a 50 ⁇ l PCR reaction was set up using 1.5 ⁇ l of RACE-ready cDNA prepared as described above, 3 ⁇ l GeneRacerTM 5' primer, 1 ⁇ l reverse gene-specific primer (designed against the complementary strand of the coding sequence; SEQ ID No. 67: 10 pmol/ ⁇ l stock), 1 ⁇ l dNTP solution (10 mM each), 2 ⁇ l 50 mM MgSO 4 , 5 ⁇ l High Fidelity PCR buffer, 1 ⁇ l Platinum® Taq DNA Polymerase High Fidelity (5 U/ ⁇ l) and 36 ⁇ l sterile water. Cycling parameters are given in Table VI below. 5' RACE confirmed the predicted 5' start site and first intron ofILV34.
  • a 50 ⁇ l PCR reaction was set up using 1 ⁇ l of the RACE-ready cDNA prepared above, 1 ⁇ l GeneRacerTM 5' primer, 1 ⁇ l reverse gene-specific primer (designed against the complementary strand of the coding sequence: SEQ ID No. 68; 5 pmol/ ⁇ l stock), 1 ⁇ l dNTP solution (10 mM each), 2 ⁇ l 50 mM MgSO 4 , 5 ⁇ l High Fidelity PCR buffer, 0.5 ⁇ l Platinum® Taq DNA Polymerase High Fidelity (5 U/ ⁇ l) and 38.5 ⁇ l sterile water. Cycling parameters are given in Table VI below. A 550 b.p. product was cloned into pCR4- Topo as per manufacturers instructions and sequenced using T7 and T3 sequncing primers. 5' RACE confirmed the predicted 5' start site of ILV1352.
  • Homo logs of the proteins or polynucleotides of the invention can be identified in other fungi by means of bio informatics analysis. Sequences identified by bioinformatics can be used to design primers which in turn can be used in PCR to generate DNA coding for the homologs. Alternatively, degenerate PCR can be used to obtain sequence, which can then be used to generate probes for screening cDNA or genomic libraries of the organism of interest to identify clones containing the homologs. As a further alternative Southern blots, using fragments of genes from one species as probes, can be used to identify the presence of a homolog in the genome of a second species. The same probe can then be used to screen cDNA or genomic DNA libraries. Once clones corresponding to the novel genes have been identified they can be expressed for functional characterisation of the protein.
  • Homologs of the proteins and polynucleotides of the invention can be identified by searching locally held databases, as detailed in Table VII, using BLAST with SEQ ID Nos: 3, 6, 9, 12, 15, 18, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 50, 53, 56, 59, 61 or 63 as the query sequence. Where necessary, matching contigs are down-loaded and genes predicted from genomic DNA as described in Example 1. Alternatively, BLAST searches can be carried out over the web.
  • SEQ ID Nos: 3, 6, 9, 12, 15, 18, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 50, 53, 56, 59, 61 or 63, and hits identified from blast searches above can be clarified by phylogenetic analysis, for example using the PHYLIP suite of programs (Felsenstein, Felsenstein, J., 2002. PHYLIP (Phytogeny Inference Package) version 3.6a3. Distributed by the author. Department of Genome Sciences, University of Washington, Seattle). A distance matrix is generated using PROTDIST with the Jones-Taylor-Thornton model and a tree inferred using FITCH with global rearrangements and 10 jumbles of input order.
  • ILV3 sequences in filamentous fungi other than A. nidulans and A. fumigatus were identified by means of the methods described above and by BLAST searches against the NCBI nr database. Protein sequences were aligned with Aspergillus ILV3 proteins and gene predictions improved where necessary. The resulting sequences (SEQ ID Nos 37-63) are summarised in Table II. From the alignment, it was possible to cluster the ILV3 sequences into two groups of orthologs, indicated in the table as group I, clustering with A. fumigatus sequence SEQ ED No. 21, and group II, clustering with A. fumigatus SEQ ID No. 12.
  • Fungal cultures are prepared using methods suitable for particular species. For example, Aspergillus and Candida species, Cryptococcus neoformans, Fusarium solani and Trichophyton species are maintained on Sabouraud dextrose agar at 30- 35°C; Leptosphaeria nodorum on Malt agar medium (30 g/L malt extract; 15 g/L Bacto-agar, pH 5.5), 24.0 0 C; Magnaporthe grisea on oatmeal agar (6.1 g/L agar, 53.3 g/L instant oatmeal) 25.O 0 C, or Cornmeal agar (Difco 0386), 26.O 0 C; Phytophthora capsici cultures are maintained on on V-8 agar at 24°C; Pyricularia oryzae cultures are maintained on rice polish agar at 24 0 C under white fluorescent lights (12 hr artificial day), and are subcultured every 7 - 14 days by the transfer of mycelial plugs to fresh plates;
  • Primers are designed to correspond to regions conserved between the gene of interest and its homologs (identified as described above). Those skilled in the art will appreciate that it may be necessary to try a range of primer pairs. PCR reactions using the primer pairs are set up as follows: 2x ReddyMix PCR mastermix (ABgene) 12.5 ⁇ l
  • the reactions are run using the following conditions on a Biometra personal PCR cycler (Thistle Scientific Ltd, DFDS House, Goldie Road, Uddington, Glasgow, G71 6NZ): (1) 95 0 C, 5min; (2) 95 0 C, lmin; (3) 53 0 C, lmin 30sec; (4) 68°C, 2min 30sec; (5) 72 0 C, lOmin; (6) 4 0 C, Hold. 30 cycles of steps 2-4 are carried out.
  • a Biometra personal PCR cycler Thistle Scientific Ltd, DFDS House, Goldie Road, Uddington, Glasgow, G71 6NZ
  • the PCR products are purified (to remove residual enzymes and nucleotides) using Qiagen's QIAquick PCR Purification Kit (Qiagen Ltd, Boundary Court, Gatwick Road, Crawley, Westshire, RHlO 9AX, UK) according to the manufacturers instructions and eluted into 40 ⁇ l of sterile water (BDH molecular biology grade/filter sterile).
  • the purified PCR products are examined on 1% agarose gels.
  • degenerate PCR may require variations in a number of parameters in the attempt to generate a product. These include primer concentration, template concentration, concentration OfMg 2+ ions, elongation and annealing times, and annealing temperature.
  • Variations in temperature can be accomodated by the use of a gradient PCR machine.
  • the purified PCR products are cloned into pPEM-Teasy (Promega) and then transformed into XL 10-Go Id ® Kan ultracompetent E. coli cells according to the manufacturers instructions.
  • the transformation reactions are then plated onto LB agar plates containing ampicillin (100 ⁇ g/ml), 50 ⁇ l X-gal (4%) and 10 ⁇ l IPTG (100 mM). Following overnight incubation at 37 0 C, individual white colonies from each transformation are sub-cultured into LB broth containing ampicillin (100 ⁇ g/ml). After overnight incubation at 37 0 C with shaking, plasmids are extracted using Qiagen spin mini plasmid extraction kits according to the manufacturers instructions and sent away for full-length sequencing.
  • Genomic DNA from the fungi of interest are digested with the appropriate restriction enzyme and run on 0.8 % agarose gel. The gel is then submerged in 250 mM HCl for no more than 10 mins, with shaking, at room temperature, after which the gel is rinsed with sterilised RO water.
  • Transfer of the DNA onto nylon membrane is carried out using 0.4 M NaOH. Transfer protocols and apparatus are well known and are described in e.g. Sambrook et al., (1989), Molecular Cloning, 2 nd Edition., Cold Spring Harbor Laboratory Press. After transfer, the DNA is fixed to the membrane by baking at 12O 0 C for 30 min. The membrane can then be used immediately, or stored dry for future use.
  • Probes are generated either by restriction digests of DNA or by PCR of an appropriate region.
  • a suitable probe can be generated by PCR using a primer pair designed using Primer3 (http://frodo.wi.mit.edu/cgi-bin/primer3/primer3_www.cgi) and A. fumigatus genomic DNA.
  • 1 ⁇ g DNA template is diluted in molecular biology water to a total volume of 16 ⁇ l, denatured in a boiling water bath for 10 mins, and quickly chilled on ice.
  • 4 ⁇ l DIG-High Prime (1 mM dATP, 1 mM dCTP, 1 mM dGTP, 0.65 mM dTTP, 0.35 mM alkali-labile-digoxygenin-11-dUTP, 1 U/ ⁇ l labelling grade Klenow enzyme, 5 x reaction buffer, in 50% (v/v) glycerol) is then added and the reaction incubated at 37 0 C for 20 hours, after which 2 ⁇ l of 200 mM EDTA pH 8.0 is added to terminate the labelling reaction.
  • the labelling efficiency is estimated by comparison with DIG-labelled control DNA.
  • the membrane is placed in a hybridisation tube containing 20 ml of prehybridisation solution (DIG Easy Hyb, Roche) per 100cm 2 of membrane surface area and prehybridised at 42°C for 2 hours in a hybridisation oven.
  • the DIG- labelled probe is denatured by heating in a boiling water bath for 10 min and then chilled directly on ice.
  • the probe is then diluted to -200 ng/mL in hybridisation solution (Easy Hyb, Roche; at least 5 mL of hybridisation solution is required per hybridisation).
  • the prehybridisation solution is discarded from the hybridization tube and the hybridisation solution containing the DIG-labelled probe added quickly.
  • the hybridisation then proceeds overnight at a 42°C in the hybridisation oven.
  • the optimum temperature is dependant on probe size and homology with target sequence and is determined empirically.
  • the membrane is washed twice at 42 0 C, 5 mins per wash, with 50 mL of stringency wash solution (3 x SSC, 0.1% SDS; where 20 x SSC buffer is 3 M NaCl, 300 mM sodium citrate, pH 7.0), followed by two washes at RT, 15 min per wash, in 50 mL stringency wash solution.
  • stringency wash solution 3 x SSC, 0.1% SDS; where 20 x SSC buffer is 3 M NaCl, 300 mM sodium citrate, pH 7.0
  • the stringency of these washes can be decreased by increasing the SSC concentration to 6 x SSC, 0.1% SDS and/or decreasing the wash temperatures.
  • the membrane is washed in 20 mL washing buffer (100 mM maleic acid, 150 mM NaCl; pH 7.5; 0.3% v/v Tween 20), and then incubated successively with the following; 20 mL blocking solution (1% w/v blocking reagent for nucleic acid hybridisation, Roche, dissolved in 100 mM maleic acid, 150 mM NaCl, pH 7), for 30 min at room temperature; Anti-DIG- alkaline phosphatase (Roche) diluted 1 : 5,000 in blocking buffer, 30 min at room temperature; Washing buffer, two washes each of 15 min at room temperature; Detection buffer (10OmM Tris-HCl, 100 mM NaCl; pH 9.5), 2 min at room temperature.
  • 20 mL washing buffer 100 mM maleic acid, 150 mM NaCl; pH 7.5; 0.3% v/v Tween 20
  • 20 mL blocking solution 1% w/v blocking reagent for nucleic
  • the membrane is then removed, placed on top of an acetate sheet, and ⁇ 0.5 ml (per 100cm 2 ) of CSPD or CDP-star added to the top of the membrane.
  • a second sheet of acetate is then placed over the surface of the membrane, the assembly incubated for 5 min at room temperature and then sealed in a plastic bag.
  • the assembly is then exposed to X-ray film for between 15 min and 1 hour. Optimal exposure time is determined empirically by increasing exposure time up to 24 hours.
  • the presence of a band on the gel is evidence of a gene in the genomic DNA of interest. The molecular weight of the band depends on the size of the restriction ⁇ fragment that contains the gene.
  • Example 6 Expression of recombinant proteins and/or fragments Recombinant proteins or fragments are expressed to enable detailed study of function and for the development of an in vitro high-throughput screen for inhibitory compounds.
  • PCR is carried out using cDNA, prepared as described above, to generate polynucleotides encoding protein sequence essentially corresponding to SEQ ID Nos. 3, 6, 9, 12, 15, 18, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 50, 53, 56, 59, 61 or 63.
  • Primers are designed to encode the 5' and 3' ends of the coding sequences, with the addition of bases necessary to anneal with the pET-30 Xa/LIC vector (5' additional sequence, GGTATTGAGGGTCGC; 3' additional sequence,
  • AGAGGAGAGTTAGAGCC AGAGGAGAGTTAGAGCC. If the protein has an N-terminal leader peptide, this should be excluded. If the protein is made up of multiple domains, it may be desirable or necessary to express only a limited number of domains, or even a single domain. In these cases, primers are designed to correspond to domain boundaries. PCR reactions are carried out using the following reaction mixture and conditions. All Reagents are present in the KOD kit (Novagen).
  • PCR reactions are run using the following conditions: (1) 94 0 C, 5 min; (2) 94 0 C, 1 min;
  • Bacteria are harvested by centrifugation at 4500 rpm for 10 minutes and the pellets lysed in lysis buffer (10 ml Bugbuster (Novagen), 10 ⁇ l Benzonase (Novagen), 0.4 ⁇ l lysozyme (Novagen) and 100 ⁇ l IM imadazole for 20 minutes at room temperature. Cells are then spun down at 1600Og for 20' at 4° C and the supernatant, containing soluble recombinant protein, removed to a clean tube.
  • lysis buffer 10 ml Bugbuster (Novagen), 10 ⁇ l Benzonase (Novagen), 0.4 ⁇ l lysozyme (Novagen) and 100 ⁇ l IM imadazole for 20 minutes at room temperature.
  • Cells are then spun down at 1600Og for 20' at 4° C and the supernatant, containing soluble recombinant protein, removed to a clean tube.
  • Alternative expression systems can be used for expression in bacteria, such as the glutathione S-transferase or mannose-binding fusion-protein system.
  • PCRs wre run using the following conditions: (1) 94 0 C, 5 min; (2) 94 0 C, 1 min; (3) 59 0 C, 1 min 30 sec; (4) 68 0 C, 1 min 30 sec; (5) 68°C, 10 min; (6) 8°C, hold. 40 cycles of steps 2-4 were carried out and the PCR products purified using QIAquick PCR Purification Kit (Qiagen Ltd, Boundary Court, Gatwick Road,
  • Bacteria were harvested by centrifugation at 3500g for 10 minutes and the pellets lysed in lysis buffer (24 ml Bugbuster, Novagen; 24 ⁇ l Benzonase, Novagen; 0.4 ⁇ l rLysozyme, Novagen; and 1200 ⁇ l IM imidazole) for 20 minutes with mixing at room temperature. Cell debris was then removed by centrifuging the sample at 1600Og for 20 minutes at 4°C and the supernatant, containing soluble protein, removed to a clean tube. Supernatant was added to pre-washed Ni-NTA resin at a concentration of approximately 25 mg protein per ml of resin and allowed to bind for 1 hour at 4°C with mixing.
  • lysis buffer 24 ml Bugbuster, Novagen; 24 ⁇ l Benzonase, Novagen; 0.4 ⁇ l rLysozyme, Novagen; and 1200 ⁇ l IM imidazole
  • Protein/resin mix was then poured into a large disposable plastic column, washed twice in 7.5 ml wash/bind buffer (2.5 ml IM Na 2 HPO 4 pH8.0, 6.25 ml 4 M NaCl, 1 ml 1 M imidazole pH8.0, 0.5 ml 10% Tween 20, made up to 50 ml with dH 2 O) and then eluted in 6.5 ml elution buffer (1 ml 1 M Na 2 HPO 4 , pH8.0, 2.5 ml 4 M NaCl, 5 ml 1 M imidazole pH8.0, 200 ⁇ l 10% Tween 20, 200 ⁇ l protease inhibitor cocktail III, made up to 20 ml with dH 2 O).
  • wash/bind buffer 2.5 ml IM Na 2 HPO 4 pH8.0, 6.25 ml 4 M NaCl, 1 ml 1 M imidazole pH8.0, 0.5 ml 10% Tween 20, made up to 50 ml with dH 2 O
  • DNA Polymerase kit (Novagen); 2.5 ⁇ l 1Ox PCR buffer, 2.5 ⁇ l dNTPs (2 mM), 1 ⁇ l MgSO 4 (25 mM), 1.5 ⁇ l each primer (5 pmol/ ⁇ l), 1 ⁇ l template cDNA, 15 ⁇ l nuclease-free water, 0.5 ⁇ l KOD Hot Start polymerase.
  • PCRs were run using the following conditions: (1) 95°C, 5 min; (2) 95 0 C, 1 min; (3) 56 0 C, 1 min 30 sec; (4) 68 0 C, 2 min 30 sec; (5) 68°C, 10 min; (6) 8°C, hold. 45 cycles of steps 2-4 were carried out and the PCR products purified using QIAquick PCR Purification Kit (Qiagen Ltd, Boundary Court, Gatwick Road, Crawley, West Wales, RHlO 9AX, UK) according to the manufacturers instructions. The purified PCR products were examined on agarose gels. cDNA fragments were then cloned into the pET30 Ek/LIC vector (Novagen), transformed into Nova Blue chemically competent E.
  • QIAquick PCR Purification Kit QIAquick PCR Purification Kit
  • coli cells and plated on to a pre-warmed kanamycin (+) selection plate. After an overnight incubation at 37 0 C, a kanamycin-resistant colony was selected and grown up in kanamycin-containing LB medium (30 ⁇ g/ml). A glycerol stock was produced from the culture, the remains of which were used to purify plasmid DNA using Qiagen's Plasmid Mini Kit. Confirmation of the presence and correct " sequence and orientation of the inserts was determined by PCR and sequencing of the construct. Purified plasmid DNA was transformed into chemically competent BL21 Star (DE3) One Shot E. coli cells.
  • the inclusion bodies were purified and truncated ILVl 352 solubilised and re- folded as follows:
  • the glycerol stock produced from BL21 cells containing truncated ILV1352 in pET30 Ek/LIC, was used to inoculate 10 ml LB, 30 ⁇ g/ml kanamycin broth. The broth was incubated overnight at 37 0 C, with shaking at 220 rpm. The culture was added to 90 ml LB kanamycin broth and incubated at 37 0 C, until the OD 600 had reached between 0.4-1.0 (approximately 1.5 hr).
  • IPTG (0.1 mM) was added to the culture which was incubated at 3O 0 C for 5 hr. Cells were harvested by centrifugation at 8,500 rpm for 10 min. Pellets were resuspended in Bugbuster Master Mix (5 ml per 100 ml culture) and incubated at room temperature for 20 min with shaking.
  • the cell suspension was centrifuged at 11,000 rpm for 20 min at 4 0 C. After removal of the supernatant, the pellet was resuspended in 5 ml Bugbuster Master Mix. Six volumes (30 ml) 1:10 Bugbuster Protein Extraction Reagent was added to the cell suspension and inclusion bodies were collected by centrifugation at 6,000 rpm for 15 min at 4 0 C.
  • inclusion bodies were resuspended in 50 ml 1:10 Bugbuster reagent.
  • the cell suspension was centrifuged at 6,000 rpm for 15 min at 4 0 C. This wash step was repeated two further times, but in the final step centrifugation speed was increased to 1 l,000rpm.
  • the final pellet of purified inclusion bodies was resuspended in a 0.1 culture volume (10 ml) of Ix IB Wash Buffer (Novagen). Inclusion bodies were collected by centrifugation at 8,500 rpm for 10 min. The pellet was resuspended in 0.1 culture volume of Ix IB Wash Buffer and inclusion bodies were collected by centrifugation at 8,500 rpm for 10 min.
  • Recombinant proteins can be assayed using an assay type specific for the particular protein. For example:
  • Endonucleases can be asayed by incubating the protein with DNA, such as Lamdba or pBR322, and observing whether the DNA is cleaved by running on an agrose gel.
  • DNA such as Lamdba or pBR322
  • Exonucleases can be assayed by incubating the protein with fluorescently or radio-labelled DNA, such as Lamdba or pBR322, and observing whether the fluorescent or labelled nucleotides are released.
  • Exoribonucleases can be assayed by incubating the protein with fluorescent or radiolabeled RNA and observing whether fluorescent or labeled ribonucleotides are released.
  • GPCRs G-protein coupled receptors
  • GPCR membrane fractions derivatised with FlashBlueTM beads (Perkin Elmer) and measuring emitted light.
  • GPCR membrane fractions are prepared from cells expressing, or over-expressing the GPCR of interest.
  • ILV3/ILV3/dihydroxyacid dehydratases can be assayed by measuring the formation of the keto acid reaction products, either directly, at 313 nm, or by derivatising the ketone with 2,4-dinitrophenyl hydrazine and measuring at 530 nm.
  • Kinases can be assayed by incubating the kinase with [ 32 P]-ATP and substrate, and measuring the incorporation of 32 P-label into the substrate.
  • Suitable substrates may include myelin basic protein, glycogen synthase and enolase.
  • fluorescence quencing technology such as QTL Lightspeed TM kinase assays (QTL biosystems, Reigate, Surrey) can be used.
  • Phosphatases can be assayed by incubating the protein with [ 32 P]-ATP-labelled substrate and measuring the release of 32 P-label.
  • Suitable substrates may include myelin basic protein, glycogen synthase and enolase.
  • phosphatase assays exploiting fluorescence quenching technology, such as IQ phosphatase assays (Pierce, Cramlington, Northumberland) can be used.
  • Phosphatididylinositol-specific phospholipase Cs can be assayed by using the chromogenic substrate 5 -bromo-4-chloro-3-indoxyl-myoinositol-l -phosphate or the fluorogenic substrate 4-methylumbelliferyl-myo-inositol-l -phosphate (Restaino et al., 1999, J. FoodProt. 62, 244-251 ; Reissbrodt, 2004, Int. J. Food Microbiol. 15, 1-
  • Phosphodiesterases such as 3 '5' cyclic nucleotide phosphodiesterases can be assayed as described by Wera et al. (FEBS Lett. 1997, 420, 147-150) by following the time-dependent degradation of cAMP. Samples and controls are incubated in 50 niM Tris-HCl (pH 8), 0.1 mMEDTA, and 500 mM cAMP at 30°C. The reaction is stopped by heating, and cAMP is measured using the cAMP [ 3 H] assay system (Amersham, Arlington Heights, IL).
  • Protein tyrosine phosphatases can be assayed using substrate protein (such as myelin basic protein) where the tyrosines have been labelled with [ 32 P], and measuring released label after incubation with the enzyme.
  • substrate protein such as myelin basic protein
  • the nonradioactive ProFluorTM assay kit Promega
  • assays are modified for the identification of an inhibitor by including a candidate substance in the incubation and measuring the extent to which the enzyme activity is inhibited.
  • inhibitors can be identified using a generic genetic screen. Heterozygous knock-out mutants are generated, for instance as described in Example 2. In most this should result in less gene product being made by the heterozygote than the wild type diploid. If the gene is essential for growth then the heterozygote should be more sensitive to a compound that targets the product of that gene. This phenomenon is called haploinsufficiency and has been demonstrated in yeast (Genomic profiling of drug sensitivities via induced haploinsufficiency. Giaever G, Shoemaker DD, Jones TW, Liang H,
  • the primary screen for genes of unknown function involves monitoring the growth of the heterozygous mutant versus the growth of the wild type diploid strain of Aspergillus fumigatus, in the presence and absence of a panel of compounds. Spore suspensions of these strains are set up in RPMI 1640 medium in 96-well plates. 1x10 4 cfu/ml is the inoculum used. Potential inhibitors are added to give a final concentration of 32 ⁇ g/ml. The plates are then incubated at 37 0 C for 48h. The OD485 of the cultures is then measured using a plate reading spectrophotometer.
  • the Minimal Inhibitory Concentration (MIC) for the compound in each strain is determined as follows: The heterozygote mutant and the wild type diploid are incubated in the presence of a range of concentrations of the chemical. The lowest concentration of chemical that prevents growth of the organism (the Minimal Inhibitory Concentration, MIC) is calculated for both strains. Doubling dilutions of the compound of interest are prepared in RPMI 1640 medium in 96-well plates starting at 50 ⁇ g/ml down to 0.1 ⁇ g/ml in duplicate. Each well is inoculated with either wild type or mutant Aspergillus fumigatus and the plate incubated at 37 0 C for 24/48h prior to measuring the OD485.
  • An inhibitor of the product of the gene of unknown function will have a lower MIC in the mutant strain than in the wild type strain, i.e., a 2-fold or more difference in MIC between the 2 strains.
  • This anti-fungal compound can then be used as the basis for chemistry approaches to improve the specificity, potency and other properties of the compound.
  • ILV3 Assay The assay for ILV34 is based upon the ability of this enzyme to dehydrate dihydroxyacid substrates to a keto acid.
  • the natural substrates are 2,3- dihydroxy-3- methylbutyrate and 2,3- dihydroxy-3-ethylbutyrate; an alternative substrate which is commercially available is L-threonic acid.
  • the appearance of the keto acid product can be monitored directly at 240 run; alternatively it can be reacted with semicarbazide and sodium acetate and monitored at 250 nm.
  • the semicarbazide/sodium acetate effectively stops the enzymatic reaction and develops it giving an increased absorbance, which is stable for at least 24 hours (Kanamori and Wixom, 1963, J. Biol. Chem. 238:998-1005; Kiritani and Wagner, 1970, Meth. Enzymol. 17:755-764; Limberg et al., 1995, Bioorg. Med. Chem. 3:487-494).
  • Assays were carried out in 96- or 384-well plates. To each well of a 384- well plate was added 0-8000 ng recombinant truncated ILV34 and 25 ⁇ l 0-5OmM threonate (dissolved in 50 raM Tris-HCl, 10 mM MgCl 2 , pH8.0), and the volume made up to 50 ⁇ l with 50 mM Tris-HCl, 10 mM MgCl 2 (pH8.0). Samples were incubated at room temperature and at suitable intervals the reaction was stopped and developed by the addition of 25 ⁇ l semicarbazide solution (1.26% w/v semicarbazide in 1.89% w/v sodium acetate solution). The samples were incubated for 15 mins after the final semicarbazide/sodium acetate addition and then read at 250nm.
  • ILV34 had a Km of approximately 10 mM for threonate, and was most active at pH 8.0. Magnesium ion concentration had no effect on ILV34 activity in the range 50 ⁇ M-10 mM.
  • Screens for inhibitors of ILV34 were based on the assay described above.
  • the screen described is for a 384 format but the protocol can be adapted to run 1536 or other formats as required.
  • L-threonic acid (hemicalcium salt [Aldrich 380644-5G]; 2OmM in 62.5 mM Tris-HCl, 12 mM MgCl 2 pH ⁇ .O) was prepared prior to use on the day of the screen.
  • the solution was sonicated at room temperature until clear, a glass rod was used to crush material which was slow to dissolve.
  • the final concentration of L-threonic acid in the assay wells was 8mM.
  • the stop/signal amplification reagent (semicarbazide HCl [Aldrich S220-1]; sodium acetate, anhydrous [BDH 301045M]); 1.26% w/v semicarbazide, 1.89% w/v sodium acetate in deionised water) was also prepared prior to use on the day of the screen.
  • Recombinant ILV34 enzyme prepared as described above was made up in 62.5 mM Tris-HCl, 12 mM MgCl 2 buffer (pH8.0). The final buffer concentration in the assay was 50 mM Tris-HCl, 9.6 mM MgCl 2 buffer (pH ⁇ .O).
  • Assays were carried out using Tecan Freedom, Tecan TeMo and PerkinElmer Minitrak robots together with a ThermoLabsystems multidrop 384 and a Tecan Safire automated plate reader.
  • 20 ⁇ l of enzyme typically around 2 ⁇ g/well, depending on specific activity of the batch
  • 20 ⁇ l L-threonic acid solution were added to wells of the microtitre plates containing test compounds.
  • 20 ⁇ l of 62.5 mM Tris-HCl, 12 mM MgCl 2 buffer (pH8.0) was used for a duplicate set of plates (i.e. for background no- enzyme controls); DMSO (diluted in the same way as solubilised compound stocks) was used for no-compound controls. Plates were incubated at room temperature for 40 minutes after which 25 ⁇ l of stop/amplification reagent was added. After 15 minutes at room temperature plates were read at 250 nm and data processed using Excel spreadsheets to convert raw data into percent inhibition data.
  • Secondary screens can be carried out to measure dose response data for selected compounds, using essentially the same protocol as the pimary screen.
  • the secondary screen uses the Excelf ⁇ t version 3 software (IDBS), with sigmoidal model 606, to plot appropriate inhibition values and determine IC50 data for compounds.
  • IDBS Excelf ⁇ t version 3 software
  • ILVl 352 can be assayed using a similar assay to that employed for ILV34, and ILV1352 inhibitors are identified in a similar way to ILV34 inhibitors.
  • Compounds identified as inhibitors from the ILV34 assay can be tested in a similar assay using recombinant ILV 1352 (or vice versa) and compounds showing inhibition in both assays are candidates for antifungal agents.
  • compounds showing inhibition of one of the ILV3 proteins may be ILV3 inhibitors.
  • Example 8 Production of an antibody Recombinant protein may be used as an immunogen, (as described in Example 6). Alternatively, synthetic proteins or polypeptides encoding regions either unique to the individual proteins, or likely to provide cross-reactivity within a set of homologs are used. Peptides may need to be conjugated to carrier proteins before immunization. Preimmune sera from animals to be immunised are screened against the immunogen to ensure that there is no endogenous cross reactivity. Animals (typically sheep, rabbits or mice) are then immunised. For polyclonal antibody production, the resulting sera is affinity purified using the immunogen cross-linked to a chromatography matrix. Alternatively, purification of the antibody fraction from the serum, e.g. using protein G or protein A cross-linked to a matrix, may be sufficient. Monoclonal antibody production proceeds by methods familiar to those skilled in the art.
  • the specificities of the resulting polyclonal and/or monoclonal antibodies are checked by ELISA and/or western blotting using the immunogen, related constructs or whole cell lysates and extracts as targets.
  • Negative controls such as paralogous proteins, different constructs or different species are also employed to test specificity and/or to determine the range of species and/or genus cross-reactivity.

Abstract

Method of identifying an anti-fungal agent which targets as an essential protein or gene of a fungus comprising contacting a candidate substance with (i) a protein which comprises the sequence shown by SEQ ID NOS: 3, 6, 9, 12, 15, 18, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 50, 53, 56, 59, 61 or 63, or (ii) a protein which has 60% identity with (i), or (iii) a protein comprising a fragment of (i) or (ii) which fragment has a length of at least 50 amino acids, or (iv) a polynucleotide that comprises a sequence which encodes (i), (ii) or (iii), or (v) a polynucleotide comprising a sequence which has at least 70% identity with the coding sequence of (iv), and determining whether the candidate substance binds or modulates (i), (ii), (iii), (iv), or (v), wherein binding or modulation of (i), (ii), (iii), (iv), or (v) indicates that the candidate substance is an anti-fungal agent.

Description

FUNGAL SIGNALLING AND METABOLIC ENZYMES
Field of the invention
The present invention relates to a method of screening for an anti-fungal agent and to fungal genes involved in signalling and metabolism.
Backeround of the invention
Invasive fungal infections are well recognised as diseases of the immunocompromised host. Over the last twenty years there have been significant rises in the number of recorded instances of fungal infection (Groll et al., 1996, J Infect 33, 23-32). hi part this is due to increased awareness and improved diagnosis of fungal infection. However, the primary cause of this increased incidence is the vast rise in the number of susceptible individuals. This is due to a number of factors including new and aggressive immunosuppressive therapies, increased survival in intensive care, increased numbers of transplant procedures and the greater use of antibiotics worldwide.
In certain patient groups, fungal infection occurs at high frequency; lung transplant recipients have a frequency of up to 20% colonisation and infection with a fungal organism and fungal infection in allogenic hoemopoetic stem transplant recipients is as high as 15% (Ribaud et al., 1999, Clin Infect Dis. 28:322-30).
Currently only four classes of antifungal drug are available to treat systemic fungal infections. These are the polyenes (e.g., amphotericin B), the azoles (e.g., ketoconazole or itraconazole) the echinocandins (e.g., caspofungin) and flucytosine. The polyenes are the oldest class of antifungal agent being first introduced in the 1950's. The exact mode of action remains unclear but polyenes are only effective against organisms that contain sterols in their outer membranes. It has been proposed that amphotericin B interacts with membrane sterols to produce pores allowing leakage of cytoplasmic components and subsequent cell death.
Azoles function by the inhibition of 14α-demethylase via a cytochrome P450- dependent mechanism. This leads to a depletion of the membrane sterol ergosterol and the accumulation of sterol precursors resulting in a plasma membrane with altered fluidity and structure. Echinocandins work by inhibiting the cell wall synthesis enzyme β-glucan synthase, leading to abnormal cell wall formation, osmotic sensitivity and cell lysis.
Flucytosine is a pyrimidine analogue interfering with cellular pyrimidine metabolism as well DNA, RNA and protein synthesis. However widespread resistance to flucyotosine limits its therapeutic use.
It can be seen that, to date, the currently available antifungal agents act primarily against only two cellular targets; membrane sterols (ployenes and azoles) and β-glucan synthase (echinocandins).
Resistance to both azoles and polyenes has been widely reported leaving only the recently introduced echinocandins to combat invasive fungal infections. As the use of echinocandins increases, resistance by fungi will inevitably occur.
The identification of new classes of anti-fungal agent with novel modes of action is required to ensure positive therapeutic outcomes for patients in the future. Novel fungal-specific genes are likely to present the best opportunity for the development of effective novel anti-fungal agents. In particular it is highly desirable that target genes are present in a range of fungi, but absent from humans, and fungal- specific genes involved in metabolism and signalling would be valuable candidates. The inventors have exploited the availability of fungal and mammalian genomes to identify such genes which are thus suitable as targets for the development of anti- fungal drugs.
Summary of the invention
The inventors have found a set of twelve genes which are present in fungi but not humans. This finding allows the identification of anti-fungal agents based on their ability to target these genes.
The invention provides a set of twelve proteins which can be used to screen for anti-fungal agents. In particular a set of twelve proteins from Aspergillus fumigatus (see Table I) is provided.
The inventors have found two Aspergillus fumigatus genes which resemble the single S. cerevisiae ILV3 gene. ILV3 is essential in S. cerevisiae for the biosynthesis of the branched amino acids leucine, iso leucine and valine, but this enzyme is absent from animals, making it a good target for an antifungal. This gene has not been used before as a target for the discovery of an antifungal agent, nor have recombinant ILV3 proteins been synthesised. Surprisingly the inventors have found that two A. fumigatus ILV3-like genes have to be knocked out to render the organism inviable.
The invention therefore provides ILV3-like genes of fungi (see Tables I and II) which can be used either individually or together (as pairs) to screen for antifungal agents.
Accordingly the invention provides the following:
- a method of identifying an anti-fungal agent which targets a protein or gene of a fungus comprising contacting a candidate substance with
(i) a protein which comprises the sequence shown by SEQ ID NOs: 3, 6, 9, 12, 15, 18, 21, 24, 27, 30, 33 or 36
(ii) a protein which has 60% identity with (i),
(iii) a protein comprising a fragment of (i) or (ii) which fragment has a length of at least 50 amino acids,
(iv) a polynucleotide that comprises sequence which encodes (i), (ii) or (iii), (v) a polynucleotide comprising sequence which has at least 70% identity with the coding sequence of (iv), and determining whether the candidate substance binds or modulates (i), (ii), (iii), (iv) or (v), wherein binding or modulation of (i), (ii), (iii), (iv) or (v) indicates that the candidate substance is an anti-fungal agent, - use of (i), (ii), (iii), (iv) or (v) as defined above to identify or obtain an antifungal agent,
- a method of identifying an anti-fungal agent which targets ILV3 genes of fungi comprising contacting a candidate substance with
(i) a protein which comprises the sequence shown by SEQ ID NOs: 12, 21, 39, 42, 45, 48, 50, 53, 56, 59, 61 or 63
(ii) a protein which has at least 60% identity with (i), (iii) a protein comprising a fragment of (i) or (ii) which fragment has a length of at least 50 amino acids, and determining whether the candidate substance binds or modulates (i), (ii) or (iii), wherein binding or modulation of (i), (ii) or (iii) indicates that the candidate substance is an anti-fungal agent, use of (i), (ii), or (iii) as defined above to identify or obtain an anti-fungal agent,
- a method of identifying an anti-fungal agent which targets ILV3 genes of fungi comprising contacting a candidate substance with
(i) a protein which comprises the sequence shown by SEQ ED NOs: 21, 42, 45, 53 or 56
(ii) a protein which has at least 60% identity with (i),(iϋ) a protein comprising a fragment of (i) or (ii) which fragment has a length of at least 50 amino acids, contacting the same substance with
(iv) a protein which comprises the sequence shown by SEQ ID NOs: 12, 39, 48, 50 or 59
(v) a protein which has at least 60% identity with (iv), (vi) a protein comprising a fragment of (i) or (ii) which fragment has a length of at least 50 amino acids, and determining whether the candidate substance binds or modulates (i), (ii) or (iii), and (iv), (v) or (vi), wherein binding or modulation of (i), (ii) or (iii), and (iv), (v) or (vi) indicates that the candidate substance is an anti-fungal agent,
- the above method wherein the first screen in carried out with SEQ ED NOs: 12, 39, 48, 50 or 59 and the second screen with SEQ ED NOs: 21, 42, 45, 53 or 56.
- use of (i), (ii), or (iii), and (iv), (v) or (vi) as defined above to identify or obtain an anti-fungal agent, - use of an anti-fungal agent identified by the method of the invention in the manufacture of a medicament for prevention or treatment of fungal infection,
- an isolated protein or polynucleotide of the invention,
- an organism which is transgenic for a polynucleotide of the invention, - an organism which has been genetically engineered to render a polynucleotide or protein of the invention non-functional or inhibited,
- an antibody which is specific for a protein of the invention, - a method for preventing or treating a fungal infection comprising administering an anti-fungal agent identified by the screening method of the invention, and
- a fungus which has been killed, or whose growth has been impaired, by inhibition of the expression or activity of a protein or polynucleotide of the invention.
Table I. A. fumieatus sequences claimed and their relationship to sequences given in the sequence listing
^^Numbers after SEQ ED Nos. correspond to bases of genomic DNA encoding the protein in cases where introns are present.
(2*RNA sequences are given in the sequence listing with Thymidine (T), although it is understood that in vivo Uridine (U) would be present.
'Groups I sequences cluster with A. fumigatus sequence SEQ ID No. 21; group II sequences cluster with A. fumigatus SEQ ID No. 12.
2 AN4058 sequence differs from that deposited in the publicly available database and was repredicted from genomic DNA based on alignment with other ILV genes. 3OnIy the C-terminal sequence of this gene could be predicted.
Detailed description of the invention
As mentioned above the invention relates to use of particular proteins and polynucleotide sequences (termed "proteins of the invention" and "polynucleotides of the invention" herein), including homologues and/or fragments of the fungal proteins and polynucleotides, to identify anti-fungal agents.
A protein or polynucleotide of the invention may be defined by similarity in sequence to a another member of the family. This similarity may be based on percentage identity (for example to the sequences shown in the sequence listing).
A protein or polynucleotide of the invention may be an ILV3 or ILVD protein, defined as a dihydroxy acid dehydratase, or as a protein which shows homology to SEQ E) No. 21, or as a protein which matches the ILVD_EDD Pfam profile.
The protein or polynucleotide of the invention may align with other proteins or polynucleotides of the invention (as shown in SEQ ID Nos. 1-63).
The protein or polynucleotide of the invention may be in isolated form (such as non-cellular form), or, in the case of membrane-associated proteins, as a membrane preparation, for example when used in the method of the invention. The polynucleotide may comprise native, synthetic or recombinant polynucleotide, and the protein may comprise native, synthetic or recombinant protein. The polynucleotide or protein may comprise combinations of native, synthetic or recombinant polynucleotide or protein, respectively. The polynucleotides and proteins of the invention may have a sequence which is the same as, or different from, naturally occurring polynucleotides and proteins.
It is to be understood that the term "isolated from" may be read as "of herein. Therefore references to polynucleotides and proteins being "isolated from" a particular organism include polynucleotides and proteins which were prepared by means other than obtaining them from the organism, such as synthetically or recombinantly.
Preferably, the polynucleotide or protein, is isolated from a fungus, more preferably a filamentous fungus, even more preferably an Ascomycete. Preferably, the polynucleotide or protein, is isolated from an organism selected from Aspergillus; Blumeria; Candida; Colletotrichium; Cryptococcus; Encephalitozoon; Fusarium; Histoplasma, Leptosphaeria; Magnaporthe; Mycosphaerella; Neurospora; Phytophthora; Plasmopara; Pneumocystis; Pyricularia; Pythium; Puccinia; Rhizoctonia; Saccharomyces, Schizosaccharomyces, Trichophyton; and Ustilago.
Preferably, the polynucleotide or protein, is isolated from Aspergillus. Preferably, the polynucleotide or protein, is isolated from an organism selected from the species Aspergillus flavus; Aspergillus fumigatus; Aspergillus nidulans;
Aspergillus niger; Aspergillus parasiticus; Aspergillus terreus; Blumeria graminis; Candida albicans; Candida cruzei; Candida glabrata; Candida parapsilosis; Candida tropicalis; Colletotrichium trifolii; Cryptococcus neoformans; Encephalitozoon cuniculi; Fusarium graminarium; Fusarium solani; Fusarium sporotrichoides; Histoplasma capsulata; Leptosphaeria nodorum; Magnaporthe grisea; Mycosphaerella graminicola; Neurospora crassa; Phytophthora capsici; Phytophthora infestans; Plasmopara viticola; Pneumocystis jiroveci; Puccinia coronata; Puccinia graminis; Pyricularia oryzae; Pythium ultimum; Rhizoctonia solani; Saccharomyces cerevisiae; Schizosaccharomyces pombe; Trichophyton inter digitale; Trichophyton rubrum; and Ustilago maydis.
Preferably, the polynucleotide or protein, is isolated from Aspergillus fumigatus, preferably the protein, may be isolated from A. fumigatus AF293. Variants of the above mentioned polynucleotides and proteins are also provided, and are discussed below. In one embodiment, the protein of the invention may comprise an amino acid sequence substantially as set out and independently selected from any of SEQ ID Nos: 3, 6, 9, 12, 15, 18, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 50, 53, 56, 59, 61, 63 or variants thereof.
The polynucleotide of the invention may comprise DNA, such as genomic DNA. The polynucleotide may comprise a sequence substantially as set out and independently selected from any of SEQ ID Nos. 1, 4, 7, 10, 13, 16, 19, 22, 25, 28, 31, 34, 37, 40, 43, 46, 49, 51, 54, 57, 60, 62 or complements, or variants thereof.
The polynucleotide may comprise RNA, preferably mRNA, preferably spliced mRNA. Preferably, the polynucleotide comprises substantially the sequence shown as SEQ ID Nos 2, 5, 8, 11, 14, 17, 20, 23, 26, 29, 32, 35, 38, 41, 44, 47, 49, 52, 55, 58, 60, 62, or a complement, or a variant thereof. '
Preferably, the protein is encoded by the regions of sequences SEQ ID Nos. 1, 4, 7, 10, 13, 16, 19, 22, 25, 28, 31, 34, 37, 40, 43, 46, 49, 51, 54, 57, 60 or 62 as described in the column "gDNA" in Tables I or II, or a complement, or a variant thereof.
Preferably, the isolated polynucleotide comprises substantially a nucleotide sequence independently selected from the regions and sequences given in the column "gDNA" in Tables I or II.
Preferably, the protein is encoded by a polynucleotide which polynucleotide comprises substantially a sequence independently selected from at least one of the the regions and sequences given in the column "gDNA" in Tables I or II, or a complement or, a variant thereof. Preferably, the polynucleotide encodes a protein which comprises substantially the amino acid sequences SEQ ID Nos: 1, 4, 7, 10, 13, 16, 19, 22, 25, 28, 31, 34, 39, 42, 45, 48, 50, 53, 56, 59, 61, 63 or a variant thereof.
By the term "native amino acid/polynucleotide/protein", is meant an amino acid, polynucleotide or protein produced naturally from biological sources either in vivo or in vitro.
By the term "synthetic amino acid/polynucleotide/protein", is meant an amino acid, polynucleotide or protein which has been produced artificially or de novo using a DNA or protein synthesis machine known in the art.
By the term "recombinant amino acid/polynucleotide /protein", is meant an amino acid, polynucleotide or protein which has been produced using recombinant DNA or protein technology or methodologies which are known to the skilled technician.
The term "variant", and the terms "substantially the amino acid/polynucleotide/protein sequence" are used herein to refer to related sequences. As discussed below such related sequences are typically homologous to (share percentage identity with) a given sequence, for example over the entire length of the sequence or over a portion of a given length. The related sequence may also be a fragment of the sequence or of a homologous sequence. A variant protein may be encoded by a variant polynucleotide.
By the term "variant", and the terms "substantially the amino acid/polynucleotide/protein sequence", we mean that the sequence has at least 30%, preferably 40%, more preferably 50%, and even more preferably, 60% sequence identity with the amino acid/polynucleotide/protein sequences of any one of the sequences referred to. A sequence which is "substantially the amino acid/polynucleotide/peptide sequence" may be the same as the relevant sequence.
Calculation of percentage identities between different amino acid/polynucleotide/protein sequences may be carried out as follows. A multiple alignment is first generated by the ClustalX program (pairwise parameters: gap opeining 10.0, gap extension 0.1, protein matrix Gonnet 250, DNA matrix IUB; multiple parameters: gap opening 10.0, gap extension 0.2, delay divergent sequences 30%, DNA transition weight 0.5, negative matrix off, protein matrix gonnet series, DNA weight IUB; Protein gap parameters, residue-specific penalties on, hydrophilic penalties on, hydrophilic residues GPSNDQERK, gap separation distance 4, end gap separation off). The percentage identity is then calcluated from the multiple alignment as (N/T)*100, where N is the number of positions at which the two sequences share an identical residue, and T is the total number of positions compared. Alternatively, percentage identity can be calculated as (N/S)* 100 where S is the length of the shorter sequence being compared. The amino acid/polynucleotide/protein seqences may be synthesised de novo, or may be native amino acid/polynucleotide/protein sequence, or a derivative thereof.
An amino acid/polynucleotide/protein sequence with a greater identity than 65% to any of the sequences referred to is also envisaged. An amino acid/polynucleotide/protein sequence with a greater identity than 70% to any of the sequences referred to is also envisaged. An amino acid/polynucleotide/protein sequence with a greater identity than 75% to any of the sequences referred to is also envisaged. An amino acid/polynucleotide/protein sequence with a greater identity than 80% to any of the sequences referred to is also envisaged. Preferably, the amino acid/polynucleotide/protein sequence has 85% identity with any of the sequences referred to, more preferably 90% identity, even more preferably 92% identity, even more preferably 95% identity, even more preferably 97% identity, even more preferably 98% identity and, most preferably, 99% identity with any of the referred to sequences.
The above mentioned percentage identities may be measured over the entire length of the original sequence or over a region of 15, 20, 50 or 100 amino acids/bases of the original sequence. In a preferred embodiment percentage identity is measured with reference to SEQ ED Nos. 3, 6, 9, 12, 15, 18, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 50, 53, 56, 59 61 or 63. Preferably the variant protein has at least 40% identity, such as at least 60% or at least 80% identity with SEQ ID Nos. 3, 6, 9, 12, 15, 18, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 50, 53, 56, 59, 61 or 63 or a portion of one of these.
Alternatively, a substantially similar nucleotide sequence will be encoded by a sequence which hybridizes to the sequences shown in SEQ ID Nos. 1, 2, 4, 5, 7, 8, 10, 11, 13, 14, 16, 17, 19, 20, 22, 23, 25, 26, 28, 29, 31, 32, 34, 35, 37, 38, 40, 41, 43, 44, 46, 47, 49, 51, 52, 54, 55, 57, 58, 60, 62, or their complements under stringent conditions. By stringent conditions, we mean the nucleotide hybridises to filter- bound DNA or RNA in 6x sodium chloride/sodium citrate (SSC) at approxmiately 450C followed by at least one wash in 0.2x SSC/0.1% SDS at approximately 5-65°C. Alternatively, a substantially similar protein may differ by at least 1, but less than 5, 10, 20, 50 or 100 amino acids from the sequences shown in SEQ ID Nos. 3, 6, 9, 12, 15, 18, 21, 24, 27, 30, 33,. 36, 39, 42, 45, 48, 50, 53, 56, 59, 61 or 63. Such differences may each be additions, deletions or substitutions.
Due to the degeneracy of the genetic code, it is clear that any nucleic acid sequence could be varied or changed without substantially affecting the sequence of the protein encoded thereby, to provide a functional variant thereof. Suitable nucleotide variants are those having a sequence altered by the substitution of different codons that encode the same amino acid within the sequence, thus producing a silent change.
Other suitable variants are those having homologous nucleotide sequences but comprising all, or portions of, sequence which are altered by the substitution of different codons that encode an amino acid with a side chain of similar biophysical properties to the amino acid it substitutes, to produce a conservative change. For example small non-polar, hydrophobic amino acids include glycine, alanine, leucine, isoleucine, valine, proline, and methionine. Large non-polar, hydrophobic amino acids include phenylalanine, tryptophan and tyrosine. The polar neutral amino acids include serine, threonine, cysteine, asparagine and glutamine. The positively charged (basic) amino acids include lysine, arginine and histidine. The negatively charged (acidic) amino acids include aspartic acid and glutamic acid. Certain organisms, including Candida are known to use non-standard codons compared to those used in the majority of eukaryotes. Any comparisons of polynucleotides and proteins from such organisms with the sequences given here should take these differences into account.
In accurate alignment of protein or DNA sequences the trade-off between optimal matching of sequences and the introduction of gaps to obtain such a match is important, hi the case of proteins, the means by which matches are scored is also of significance. The family of PAM matrices (e.g., Dayhoff, M. et al., 1978, Atlas of protein sequence and structure, Natl. Biomed. Res. Found.) and BLOSUM matrices quantitate the nature and likelihood of conservative substitutions and are used in multiple alignment algorithms, although other, equally applicable matrices will be known to those skilled in the art. The popular multiple alignment program ClustalW, and its windows version ClustalX (Thompson et al., 1994, Nucleic Acids Research, 22, 4673-4680; Thompson et al., 1997, Nucleic Acids Research, 24, 4876-4882) are efficient ways to generate multiple alignments of proteins and DNA. Use of the Align program is also preferred (Hepperle, D., 2001 : Multicolor
Sequence Alignment Editor. Institute of Freshwater Ecology and Inland Fisheries, 16775 Stechlin, Germany), although others, such as JalView or Cinema are also suitable.
Calculation of percentage identities between proteins occurs during the generation of multiple alignments by Clustal. However, these values need to be recalculated if the alignment has been manually improved, or for the deliberate comparison of two sequences. Programs that calculate this value for pairs of protein sequences within an alignment include PROTDIST within the PHYLIP phylogeny package (Felsenstein; http://evolution.gs.washington.edu/phylip.html) using the "Similarity Table" option as the model for amino acid substitution (P). For
DNA/RNA, an identical option exists within the DNADIST program of PHYLIP. Other modifications in protein sequences are also envisaged and within the scope of the claimed invention, i.e. those which occur during or after translation, e.g. by acetylation, amidation, carboxylation, GPI-linkage, myristoylation, phosphorylation, proteolytic cleavage or linkage to a ligand.
The term "variant", and the terms "substantially the amino acid/polynucleotide/protein sequence" also include a fragment of the relevant polynucleotide or protein sequences, including a fragment of the homologous sequences (which have percentage identity to a specified sequence) referred to above. A polynucleotide fragment will typically comprise at least 10 bases, such as at least 20, 30, 50, 100, 200, 500 or 1000 bases. A protein fragment will typically comprise at least 10 amino acids, such as at least 20, 30, 50, 80, 100, 150, 200, 300, 400 or 500 amino acids. The fragments may lack at least 3 amino acids, such as at least 10, 20 or 30 amino acids of the amino acids from either end of the protein.
The invention provides methods of screening which may be used to identify modulators of the proteins or polynucleotides of the invention, such as inhibitors of expression or activity of the proteins or polynucleotides of the invention. In one embodiment of the method a candidate substance is contacted with a protein or polynucleotide of the invention and whether or not the candidate substance binds or modulates the protein or polynucleotide is determined.
The modulator may promote (agonise) or inhibit (antagonise) the activity of the protein. A therapeutic modulator (against fungal infection) will inhibit the expression or activity of protein or polynucleotide of the invention.
The method may be carried out in vitro (inside or outside a cell) or in vivo. The method may be carried out on a cell, or cell culture extract, or cell extract or cell- membrane fraction. The cell may or may not be a cell in which the polynucleotide or protein is naturally present. The cell may or may not be a fungal cell, or may or may not be a cell of any of the fungi mentioned herein. The protein or polynucleotide may be present in a non-cellular form in the method, thus the protein may be in the form of a recombinant protein purified from a cell.
Any suitable binding or activity assay may be used. Methods which determine whether a candidate substance is able to bind the protein or polynucleotide may comprise providing the protein or polynucleotide to a candidate substance and determining whether binding occurs, for example by measuring the amount of the candidate substance which binds the protein or polynucleotide. The binding may be determined by measuring a characteristic of the protein or polynucleotide that changes upon binding, such as spectroscopic changes. The binding may be determined by measuring reaction substrate or product levels in the presence and absence of the candidate and comparing the levels. The assay format may be a 'band shift' system. This involves determining whether a test candidate advances or retards the protein or polynucleotide on gel electrophoresis relative to the absence of the compound.
The method may be a competitive binding method. This determines whether the candidate is able to inhibit the binding of the protein or polynucleotide to an agent which is known to bind to the protein or polynucleotide, such as an antibody specific for the protein, or a substrate of the protein.
Whether or not a candidate substance modulates the activity of the protein may be determined by providing the candidate substance to the protein under conditions that permit activity of the protein, and determining whether the candidate substance is able to modulate the activity of the product.
The activity which is measured may be any of the activities of the proteins of the invention mentioned herein, including; endonuclease, exonuclease, exoribonuclease, G-protein coupled receptor, ILV3/dihydroxyacid dehydratase, kinase, phosphatase, phosphatididylinositol-specific phospholipase C, phosphodiesetrase, protein tyrosine phosphatase, ion transport or small molecule transport/permease activities. In one embodiment the screening method comprising carrying out a reaction in the presence and absence of the candidate substance to determine whether the candidate substance inhibits the activity of the protein of the invention.
ILV3 activity can be measured as follows: An ILV3 protein is incubated with a substrate molecule such as dihydroxy valeric acid, dihydroxy methylvaleric acid, another dihydroxy acid, or a polyhydroxy acid (such as threonic acid or 2,3,4,5- tetrahydroxy pentanoic acid), and the appearance of a keto acid product measured either directly or indirectly. Direct measurement can be carried out by means of spectrophotometry, for example at 240 nm, whereas indirect measurement can be carried out by reacting the keto acid with semicarbazide and measuring the appearance of product by spectrophotometry, for exmaple at 250 nm, or by reacting the keto acid with 2,4-dinitrophenylhydrazine and measuring the reaction products by spectrophotometry at 540-550 nm. This assay may be used as a screen for inhibitors of filamentous fungal ILV3s by (a) adding to the assay putative inhibitor compounds and looking for a decrease in product, and (b) carrying out the assay firstly with a group I ILV3 (Table II) and then carrying out the assay with a group II ILV3 (or vice versa) and identifying compounds that inhibit in both assays. The assay can be carried out with recombinant A.fumigatus ILV34 and ILV1352 (Table II).
ILV3 inhibitors may also be identified by the above assay using a single ILV3 protein such as from any of the following species: organism selected from the species Aspergillus flavus; Aspergillus fumigatus; Aspergillus nidulans; Aspergillus niger; Aspergillus parasiticus; Aspergillus terreus; Blumeria graminis; Candida albicans; Candida cruzei; Candida glabrata; Candida parapsilosis; Candida tropicalis; Colletotrichium trifolii; Cryptococcus neoformans; Encephalitozoon cuniculi; Fusarium graminarium; Fusarium solani; Fusarium sporotrichoides; Histoplasma capsulata; Leptosphaeria nodorum; Magnaporthe grisea; Mycosphaerella graminicola; Neurospora crassa; Phytophthora capsici; Phytophthora infestans; Plasmopara viticola; Pneumocystis jiroveci; Puccinia coronata; Puccinia graminis; Pyricularia oryzae; Pythium ultimum; Rhizoctonia solani; Saccharomyces cerevisiae; Schizosaccharomyces pombe; Trichophyton inter digitale; Trichophyton rubrum; and Ustilago maydis.
In a further embodiment of the method, a candidate substance is contacted with a cell heterozygous for an underexpressed, mutated, disrupted or deleted copy or copies of the gene or genes, and the extent to which the candidate substance inhibits growth of the cell is determined by any suitable means and compared to the effects of the candidate substance on cells homozygous for unaltered copies of the gene. The heterozygous cell will show greater sensitivity to substances that inhibit the gene or its gene product.
Suitable candidate substances which can tested in the above methods include antibody products (for example, monoclonal and polyclonal antibodies, single chain antibodies, chimeric antibodies and CDR-grafted antibodies). Furthermore, combinatorial libraries, defined chemical identities, peptide and peptide mimetics, oligonucleotides and natural product libraries, such as display libraries (e.g. phage display libraries) may also be tested. The candidate substances may be chemical compounds. Batches of the candidate substances may be used in an initial screen of, for example, ten substances per reaction, and the substances from batches which show inhibition tested individually. According to a further aspect of the present invention, there is provided a polynucleotide or protein of the invention for use as a medicament or in diagnosis.
The polynucleotide or protein may be modified prior to use, preferably to produce a derivative or variant thereof. The polynucleotide or protein may be derivatised. The protein may be modified by epitope tagging, addition of fusion partners or purification tags such as glutathione S-transferase, multiple histidines or maltose binding protein, addition of green fluorescent protein, covalent attachment of molecules including biotin or fluorescent tags, incorporation of selenomethionine, inclusion or attachment of radioisotopes or fluorescent/non-fluorescent lanthanide chelates. The polynucleotide may be modified by methylation or attachment of digoxygenin (DIG) or by addition of sequence encoding the above tags, proteins or epitopes.
Preferably, the medicament is adapted to retard or prevent a fungal infection. The fungal infection may be in human, animal or plant. The polynucleotide or protein may be used for the development of a drug. The polynucleotide or protein may be used in, or for the generation of, a molecular model of said polynucleotide or said protein.
According to a further aspect of the present invention, there is provided use of a polynucleotide or protein of the invention for the preparation of a medicament for the treatment of a fungal infection.
The polynucleotide or protein may be modified prior to use, preferably to produce a derivative or variant thereof. The polynucleotide or protein may be derivatised. The polynucleotide or protein may not be modified or derivatised.
Preferably, the medicament is adapted to retard or prevent a fungal infection. The treatment may comprise retarding or preventing fungal infection. Preferably, the drug and/or medicament comprises an inhibitor. Preferably, the drug or medicament is adapted to inhibit expression and/or activity of the polynucleotide or a fragment thereof, and/or the function of the protein or a fragment thereof.
Preferably, the fungal infection comprises an infection by a fungus, more preferably an Ascomycete, and even more preferably, an organism selected from the genera Aspergillus; Blumeήa; Candida; Colletotrichium; Cryptococcus; Encephalitozoon; Fusarium; Histoplasma, Leptosphaeria; Magnaporthe; Mycosphaerella; Neurospora; Phytophthora; Plasmopara; Pneumocystis; Pyricularia; Pythium; Puccinia; Rhizoctonia, Trichophyton; and Ustilago.
Preferably, the fungal infection comprises an infection by an organism selected from the genera Aspergillus.
Preferably, the fungal infection comprises an infection by an organism selected from the species Aspergillus flavus; Aspergillus fumigatus; Aspergillus nidulans; Aspergillus niger; Aspergillus parasiticus; Aspergillus terreus; Blumeria graminis; Candida albicans; Candida cruzei; Candida glabrata; Candida parapsilosis; Candida tropicalis; Colletotrichium trifolii; Cryptococcus neoformans; Encephalitozoon cuniculi; Fusarium graminarium; Fusarium solani; Fusarium sporotrichoides; Histoplasma capsulata; Leptosphaeria nodorum; Magnaporthe grisea; Mycosphaerella graminicola; Neurospora crassa; Phytophthora capsici; Phytophthora infestans; Plasmopara viticola; Pneumocystis jiroveci; Puccinia coronata; Puccinia graminis; Pyricularia oryzae; Pythium ultimum; Rhizoctonia solani; Trichophyton inter digitale; Trichophyton rubrum; and Ustilago maydis. Preferably, the fungal infection comprises an infection by Aspergillus fumigatus.
According to a further aspect of the present invention, there is provided a recombinant DNA molecule or vector comprising a polynucleotide of the invention.
The recombinant DNA molecule or vector may comprise an expression cassette. Preferably, the recombinant DNA molecule or vector comprises an expression vector. Preferably, the polynucleotide sequence is operatively linked to an expression control sequence. A suitable control sequence may comprise a promoter, an enhancer etc.
According to another aspect of the present invention, there is provided a cell containing a polynucleotide, recombinant DNA molecule or vector of the invention.
The cell may be transformed or transfected with the polynucleotide, recombinant DNA molecule or vector by suitable means. Preferably, the cell produces a recombinant protein of the invention.
The invention also provides an organism which is transgenic for the polynucleotide of the invention (whose cells may be the same as the cells of the invention mentioned herein). Such an organism is typically a fungus, such as any genera or species of fungus mentioned herein. The organism may be a microorganism, such as a bacterium, virus or yeast. The organism may be a plant, or animal (including birds and mammals), such as any of the animals mentioned herein.
The organism may be produced by introduction of the polynucleotide of the invention into a cell of the organism, and in the case of a multicellular organism allowing the cell to grow into a whole organism.
According to a further aspect of the present invention, there is provided a cell in which a polynucleotide or protein of the invention is non-functional and/or inhibited. The cell may be of, or present in, a multicellular organism.
The cell may be a mutant cell. The cell is typically a fungal cell, such as of any genera or species of fungus mentioned herein. A preferred means of generating the cell is to modify the polynucleotide of the invention, such that the polynucleotide is non-functional. This modification may be to cause a mutation, which disrupts the expression or function of a gene product. Such mutations may be to the nucleic acid sequences that act as 5' or 3' regulatory sequences for the polynucleotide, or may be a mutation introduced into the coding sequence of the polynucleotide. Functional deletion of the polynucleotide may be, for example, by mutation of the polynucleotide in the form of nucleotide substitution, addition or, preferably, nucleotide deletion.
The polynucleotide may be made non- functional and/or inhibited by: (i) shifting the reading frame of the coding sequence of the polynucleotide;
(ii) adding, substituting or deleting amino acids in the protein encoded by the polynucleotide; or
(iii) partially or entirely deleting the DNA coding for the polynucleotide and/or the upstream and downstream regulatory sequences associated with the polynucleotide.
(iv) inserting DNA into the coding or non-coding regions.
A preferred means of introducing a mutation into a polynucleotide is to utilize molecular biology techniques specifically to target the polynucleotide which is to be mutated. Mutations may be induced using a DNA molecule. A most preferred means of introducing a mutation is to use a DNA molecule that has been especially prepared such that homologous recombination occurs between the target polynucleotide and the DNA molecule. When this is the case, the DNA molecule, which may be double stranded, may contain base sequences similar or identical to the target polynucleotide to allow the DNA molecule to hybridize to (and subsequently recombine with) the target.
In the case of ILV3 proteins the mutant cell may contain mutations of two different ILV3 genes, where the function of either or both gene products may be inhibited or abolished.
It is also possible to provide a cell in which the polynucleotide is nonfunctional and/or inhibited without introducing a mutation into the gene or its regulatory regions. This may be done by using specific inhibitors. Examples of such inhibitors include agents that prevent transcription of the polynucleotide, or prevent translation, expression or disrupt post-translational modification. Alternatively, the inhibitor may be an agent that increases degradation of the gene product (e.g. a specific proteolytic enzyme). Equally, the inhibitor may be an agent which prevents the polynucleotide product from functioning, such as neutralizing antibodies. The inhibitor may also be an antisense oligonucleotide, or any synthetic chemical capable of inhibiting expression of the gene or the stability and/or function of the protein. The inhibitor may also be a protein which interacts with a proetin of the invention prevent its function. The inhibitor may also be an RNA molecule which causes inhibition by RNA interference. In one embodiment the antisense polynucleotide or RNA molecule which causes RNA interference is an example of a polynucleotide of the invention.
According to a further aspect, there is provided an antibody exhibiting immunospecificity for a protein of the invention. The antibody may be used as a diagnostic reagent.
The antibody may be monoclonal or polyclonal, and may be raised in mouse, rat, rabbit, chicken, turkey, horse, goat or donkey. The antibody may be raised against one of the proteins of the invention, or may be raised against proteolytic or recombinant fragments.
For the purposes of this invention, the term "antibody", unless specified to the contrary, includes fragments which bind a protein of the invention. Such fragments include Fv, F(ab') and F(ab')2 fragments, as well as single chain antibodies.
Furthermore, the antibodies and fragment thereof may be chimeric antibodies, CDR- grafted antibodies or humanised antibodies. Administration
The formulation of any of the therapeutic substances (e.g. proteins, polynucleotides or modulators) mentioned herein will depend upon factors such as the nature of the substance and the condition to be treated. Any such substance may be administered in a variety of dosage forms. It may be administered orally (e.g. as tablets, troches, lozenges, aqueous or oily suspensions, dispersible powders or granules), parenterally, subcutaneously, intravenously, intramuscularly, intrasternally, transdermally or by infusion techniques. The substance may also be administered as suppositories. A physician will be able to determine the required route of administration for each particular patient.
Typically the substance is formulated for use with a pharmaceutically acceptable carrier or diluent. The pharmaceutical carrier or diluent may be, for example, an isotonic solution. For example, solid oral forms may contain, together with the active compound, diluents, e.g. lactose, dextrose, saccharose, cellulose, corn starch or potato starch; lubricants, e.g. silica, talc, stearic acid, magnesium or calcium stearate, and/or polyethylene glycols; binding agents; e.g. starches, arabic gums, gelatin, methylcellulose, carboxymethylcellulose or polyvinyl pyrrolidone; disaggregating agents, e.g. starch, alginic acid, alginates or sodium starch glycolate; effervescing mixtures; dyestuffs; sweeteners; wetting agents, such as lecithin, polysorbates, laurylsulphates; and, in general, non-toxic and pharmacologically inactive substances used in pharmaceutical formulations. Such pharmaceutical preparations may be manufactured in known manner, for example, by means of mixing, granulating, tabletting, sugar-coating, or film coating processes.
Liquid dispersions for oral administration may be syrups, emulsions and suspensions. The syrups may contain as carriers, for example, saccharose or saccharose with glycerine and/or mannitol and/or sorbitol. Suspensions and emulsions may contain as carrier, for example a natural gum, agar, sodium alginate, pectin, methylcellulose, carboxymethylcellulose, or polyvinyl alcohol. The suspensions or solutions for intramuscular injections may contain, together with the active compound, a pharmaceutically acceptable carrier, e.g. sterile water, olive oil, ethyl oleate, glycols, e.g. propylene glycol, and if desired, a suitable amount of lidocaine hydrochloride. Solutions for intravenous or infusions may contain as carrier, for example, sterile water or preferably they may be in the form of sterile, aqueous, isotonic saline solutions.
A therapeutically effective non-toxic amount of substance is administered. The dose may be determined according to various parameters, especially according to the substance used; the age, weight and condition of the patient to be treated; the route of administration; and the required regimen. Again, a physician will be able to determine the required route of administration and dosage for any particular patient. A typical daily dose is from about 0.1 to 50 mg per kg, preferably from about O.lmg/kg to lOmg/kg of body weight, according to the activity of the specific inhibitor, the age, weight and conditions of the subject to be treated, the type and severity of the disease and the frequency and route of administration. Preferably, daily dosage levels are from 5 mg to 2 g.
Agricultural use
Modulators identified by the method of the invention may be administered to plants in order to prevent or treat fungal infections. The modulators are normally applied in the form of compositions together with one or more agriculturally acceptable carriers or diluents and can be applied to the crop area or plant to be treated, simultaneously or in succession with further compounds.
The modulators of the invention can be applied together with carriers, surfactants or application-promoting adjuvants customarily employed in the art of formulation. Suitable carriers and diluents correspond to substances ordinarily employed in formulation technology, e.g. natural or regenerated mineral substances, solvents, dispersants, wetting agents, tackifiers, binders or fertilizers.
A preferred method of applying the modulators of the present invention or an agrochemical composition which contains them is leaf application. The number of applications and the rate of application depend on the intensity of infection by the fungus. However, the active ingredients can also penetrate the plant through the roots via the soil (systemic action) by impregnating the locus of the plant with a liquid composition, or by applying the compounds in solid form to the soil, e.g. in granular form (soil application). The active ingredients may also be applied to seeds (coating) by impregnating the seeds either with a liquid formulation containing active ingredients, or coating them with a solid formulation. In special cases, further types of application are also possible, for example, selective treatment of the plant stems or buds.
The active ingredients are used in unmodified form or, preferably, together with the adjuvants conventionally employed in the art of formulation, and are therefore formulated in known manner to emulsifiable concentrates, coatable pastes, directly sprayable or dilutable solutions, dilute emulsions, wettable powders, soluble powders, dusts, granulates, and also encapsulations, for example, in polymer substances. Like the nature of the compositions, the methods of application, such as spraying, atomizing, dusting, scattering or pouring, are chosen in accordance with the intended objectives and the prevailing circumstances. Advantageous rates of application are normally from 5Og to 5kg of active ingredient (a.i.) per hectare ("ha", approximately 2.471 acres), preferably from lOOg to 2kg a.i./ha, most preferably from 20Og to 50Og a.i./ha. The formulations, compositions or preparations containing the active ingredients and, where appropriate, a solid or liquid adjuvant, are prepared in known manner, for example by homogeneously mixing and/or grinding active ingredients with extenders, for example solvents, solid carriers and, where appropriate, surface- active compounds (surfactants). Suitable solvents include aromatic hydrocarbons, preferably the fractions having 8 to 12 carbon atoms, for example, xylene mixtures or substituted naphthalenes, phthalates such as dibutyl phthalate or dioctyl phthalate, aliphatic hydrocarbons such as cyclohexane or paraffins, alcohols and glycols and their ethers and esters, such as ethanol, ethylene glycol, monomethyl or monoethyl ether, ketones such as cyclohexanone, strongly polar solvents such as N-methyl-2-pyrrolidone, dimethyl sulfoxide or dimethyl formamide, as well as epoxidized vegetable oils such as epoxidized coconut oil or soybean oil; or water.
The solid carriers used e.g. for dusts and dispersible powders, are normally natural mineral fillers such as calcite, talcum, kaolin, montmorillonite or attapulgite. In order to improve the physical properties it is also possible to add highly dispersed silicic acid or highly dispersed absorbent polymers. Suitable granulated adsorptive carriers are porous types, for example pumice, broken brick, sepiolite or bentonite; and suitable nonsorbent carriers are materials such as calcite or sand, hi addition, a great number of pregranulated materials of inorganic or organic nature can be used, e.g. especially dolomite or pulverized plant residues.
Depending on the nature of the active ingredient to be used in the formulation, suitable surface-active compounds are nonionic, cationic and/or anionic surfactants having good emulsifying, dispersing and wetting properties. The term "surfactants" will also be understood as comprising mixtures of surfactants.
Suitable anionic surfactants can be both water-soluble soaps and water-soluble synthetic surface-active compounds. Suitable soaps are the alkali metal salts, alkaline earth metal salts or unsubstituted or substituted ammonium salts of higher fatty acids (chains of 10 to 22 carbon atoms), for example the sodium or potassium salts of oleic or stearic acid, or of natural fatty acid mixtures which can be obtained for example from coconut oil or tallow oil. The fatty acid methyltaurin salts may also be used.
More frequently, however, so-called synthetic surfactants are used, especially fatty sulfonates, fatty sulfates, sulfonated benzimidazole derivatives or alkylarylsulfonates. The fatty sulfonates or sulfates are usually in the form of alkali metal salts, alkaline earth metal salts or unsubstituted or substituted ammoniums salts and have a 8 to 22 carbon alkyl radical which also includes the alkyl moiety of alkyl radicals, for example, the sodium or calcium salt of lignonsulfonic acid, of dodecylsulfate or of a mixture of fatty alcohol sulfates obtained from natural fatty acids. These compounds also comprise the salts of sulfuric acid esters and sulfonic acids of fatty alcohol/ethylene oxide adducts. The sulfonated benzimidazole derivatives preferably contain 2 sulfonic acid groups and one fatty acid radical containing 8 to 22 carbon atoms. Examples of alkylarylsulfonates are the sodium, calcium or triethanolamine salts of dodecylbenzenesulfonic acid, dibutylnaphthalenesulfonic acid, or of a naphthalenesulfonic acid/formaldehyde condensation product. Also suitable are corresponding phosphates, e.g. salts of the phosphoric acid ester of an adduct of p-nonylphenol with 4 to 14 moles of ethylene oxide.
Non-ionic surfactants are preferably polyglycol ether derivatives of aliphatic or cycloaliphatic alcohols, or saturated or unsaturated fatty acids and alkylphenols, said derivatives containing 3 to 30 glycol ether groups and 8 to 20 carbon atoms in the (aliphatic) hydrocarbon moiety and 6 to 18 carbon atoms in the alkyl moiety of the alkylphenols. Further suitable non-ionic surfactants are the water-soluble adducts of polyethylene oxide with polypropylene glycol, ethylenediamine propylene glycol and alkylpolypropylene glycol containing 1 to 10 carbon atoms in the alkyl chain, which adducts contain 20 to 250 ethylene glycol ether groups and 10 to 100 propylene glycol ether groups. These compounds usually contain 1 to 5 ethylene glycol units per propylene glycol unit.
Representative examples of non-ionic surfactants are nonylphenolpolyethoxyethanols, castor oil polyglycol ethers, polypropylene/polyethylene oxide adducts, tributylphenoxypolyethoxyethanol, polyethylene glycol and octylphenoxyethoxyethanol. Fatty acid esters of polyoxyethylene sorbitan and polyoxyethylene sorbitan trioleate are also suitable non-ionic surfactants.
Cationic surfactants are preferably quaternary ammonium salts which have, as N-substituent, at least one C8-C22 alkyl radical and, as further substituents, lower unsubstituted or halogenated alkyl, benzyl or lower hydroxyalkyl radicals. The salts are preferably in the form of halides, methylsulfates or ethylsulfates, e.g. stearyltrimethylammonium chloride or benzyldi(2-chloroethyl)ethylammonium bromide.
The surfactants customarily employed in the art of formulation are described, for example, in "McCutcheon's Detergents and Emulsifiers Annual", MC Publishing Corp. Ringwood, New Jersey, 1979, and Sisely and Wood, "Encyclopaedia of Surface Active Agents," Chemical Publishing Co., Inc. New York, 1980.
The agrochemical compositions usually contain from about 0.1 to about 99% preferably about 0.1 to about 95%, and most preferably from about 3 to about 90% of the active ingredient, from about 1 to about 99.9%, preferably from about 1 to 99%, and most preferably from about 5 to about 95% of a solid or liquid adjuvant, and from about 0 to about 25%, preferably about 0.1 to about 25%, and most preferably from about 0.1 to about 20% of a surfactant. Whereas commercial products are preferably formulated as concentrates, the end user will normally employ dilute formulations. All of the features described herein may be combined with any of the above aspects, in any combination.
Embodiments of the invention will now be described by way of example.
EXAMPLES
Example 1. Identification fungal-specific genes in Aspergillus fumisatus
Ideally, fungal target genes should be present in as broad a range of fungi as possible, but absent from humans. A bioinformatics strategy was devised to identify such potential targets exploiting the availability of fungal and human genomes. Programs were written in PERL, and used publicly available downloaded databases and the BLAST algorithm (Altschul et al., 1990, J. MoI. Biol. 215:403-410). Predicted proteins from the A. nidulans genome (http://www.broad.mit.edu/ftp/pub/armotation/aspergillus/assemblyl/release3.1/asper gillus_nidulans_l_r3.1_proteins.fasta.gz) were blasted against the human refseq proteins (ftp://ftp.ncbi.nih.gov/refseq/H_sapiens/H_sapiens/protein/), and only those proteins without a matching human sequence were kept (i.e. E-value > le-4). This set was then blasted against N. crassa predicted proteins (http ://www.broad.mit.edu/ftp/pub/annotation/neurospora/assembly3/neurospora_3_ protein.gz ) and only those proteins with a good match (i.e., E-value < le-10) were kept. The resulting set of 2993 proteins therefore contained genes conserved between filamentous fungi but absent from humans. This set was then blasted against C. albicans orfs (http://www-sequence.stanford.edu/group/candida/download.html) and thereby separated into a set of 819 proteins with good homologs (E-value < le-10), which can be though of as "pan- fungal" proteins, and the other 2184 proteins, which can be thought of as "filamentous-only" proteins.
The pan-fungal set was examined for enzymes or enzyme families. Surprisingly, four ILV3-like genes were identified, AN4058, AN6346, (Tables I and II), AN5138 and AN7358, each of which had an A. fumigatus ortholog. This contrasts with the presence of a single ILV3 gene in S. cerevisiae. Alignment of the four ILV3 genes with ILV3 genes from other organisms, followed by phylogentic analysis identified the two ILV3 genes given in table I as the closest to the S. cerevisiae ILV3 gene. This was supported by percentage identity values given in Table III., A phosphoinositol phospholipase C was also identified (see Table I).
Table III. Percentage identities between ILV3 homo logs of Aspergilli
j\fl346, A.fumigatus ortholog of AN5138, from contig 1346; Af34_B, A.fumigatus ortholog of AN7358, from contig 34.
ILV3_Sc; ILV3 from S. cerevisiae
The "pan-fungal" and "filamentous-only" sequence sets were also analysed to identify signalling and metabolic molecules, by searching the data sets with PFAM HMMs (Bateman et al., 2004, Nucl. Acids Res. 32, D138-D141; http://www.sanger.ac.uk/Software/Pfam/), using a PERL script and downloaded HMMs. The HMMs used and the proteins identified in this way are given in Table rv. TablelV. Identification of target molecules by HMM
The A. fumigatus genes corresponding to the A. nidulans genes were identified as follows: The A. nidulans protein was blasted against the A. fumigatus genome (ftp://ftp.sanger.ac.uk/pub/pathogens/AJomigatus/AF.contigs.031704) to identify the matching region. The matching gene was predicted from this sequence using Genscan (genes.mit.edu/GENSCAN.html; Settings; organism = vertebrate;
Suboptimal exon cutoff = 1.00) and/or WISE2 (http://www.ebi.ac.uk/Wise2Λ. The predicted genes were compared with similar sequences using blast, the multiple alignment programs ClustalX (Thompson et al., 1997, Nucleic Acids Research, 24:4876-4882) and QAlign (Sameth et al., 2003, Bioinformatics 19, 1592-1593; http://www.ridom.de/qalign), and the alignment editor/viewer Align (Hepperle, D., 2001: Multicolor Sequence Alignment Editor. Institute of Freshwater Ecology and Inland Fisheries, 16775 Stechlin, Germany). Gene structures were visualised and modified using Artemis (http://www.sanger.ac.uk/Software/Artemis/; Rutherford et al., 2000, Bioinformatics 16, 944-945). It was necessary to carefully examine predictions and to compare predicted genes with homologous proteins to arrive at an informed prediction. The resulting genes are given in Tables I and II.
Example 2. Production of Gene Knockouts in A. fumisatus
For a gene of interest to be suitable as a anti-fungal drug target, it is necessary to show that it is an essential gene by generating a knock-out strain in which the gene is disabled. First a section of genomic DNA is synthesised by PCR5 corresponding to the gene of interest and the 2-3 kb on either side, and the PCR products cloned into pGEMT-easy (Promega). The genomic DNA is then used as the substrate for a tansposition reaction using the Epicentre Tn5 bacterial transposon into which fungal and bacterial selection markers have been inserted. Suitable fungal selection genes are PyrG, hygromycin or zeomycin; suitable bacterial markers are kanamycin or zeomycin. The transposed constructs are then screened by PCR to identify those where the transposon has inserted into the gene. PCR primers are designed either to cover the whole gene, such that insertion of the transposon results in the appearance of a product of higher molecular weight, or to extend from the start or end of the gene into the transposon, such that a product is only obtained when the transposon has inserted. Once a transposed copy has been identified, the genomic DNAΛransposon construct is excised from pGEMT-easy with a restriction enzyme which cuts only in the vector (e.g., Notl or Dral) and then used to transform haploid fungal protoplasts by means of PEG-mediated transformation. The fungi are grown under selective conditions, determined by the marker used, and transformants are picked. These are then screened by PCR using primers specific for the gene of interest: Replacement of the endogenous gene with the transposon-modified gene results in a single band of higher molecular weigh by PCR. Therefore, if the modified gene is observed, the gene is not essential. However, if none of the transformants show gene replacement, the gene of interest may be an essential gene. In this case, the transformation is then carried out on diploids using the same method and essentiality of the gene is tested by rehaploidisation followed by examination of the segregation pattern in haploids.
2.1 Gene disruption of Aspergillus fumigatus ILV3 genes Two Aspergillus fumigatus ILV3-like genes ILV34A and ILV1352 (Tables 1 and II) were knocked out as follows. Initially a ~6 kbp fragment of genomic sequence was generated for each gene follows.
2.1.1. ILV34A mutant construct A PCR was set up with Extensor master mix, A. fumigatus genomic DNA, and primers ILV34A_F1 and ILV34A_R1 (SEQ ID Nos. 72 and 73). The resulting 5889 bp PCR product was gel purified (Qiaquick gel purification kit, Qiagen) and ligated into pGEMTeasy overnight at 40C (Promega). 1 μl of the ligation mix was transformed into Electrocompetent E. coli Genehogs (Invitrogen) by electroporation. Transformed cells were plated on LB-Ampicillin-IPTG-Xgal agar plates and incubated at 370C overnight. White colonies were picked into LB-ampicillin broth and incubated at 370C overnight with shaking at 220 rpm. Plasmid DNA was isolated by Qiaprep miniprep DNA isolation (Qiagen). Notl digestion of the plasmid DNA indicated whether a 5.9 kb insert was present and the presence of ILV34 DNA was confirmed by PCR reactions using the following PCR primer sets: a) SEQ ID Nos 72 and 73; b) SEQ ID No. 74 and 75. Plasmids yielding the following size PCR products were deemed to be pGEMTEasy_ILV34A: a) 5889 bp, b) 1930 bp. A plasmid, pMB4zeo, was constructed that contained the mosaic ends recognised by the TN5 transposase, an Aspergillus fumigatus pyr G sequence and a bacterial zeocin resistance gene. The pyrG cassette was prepared with EcoRI sites flanking the genomic pyrG sequence. This cassette was introduced into the EcoRI site of pMOD2 (Epicentre). A zeocin resistance cassette was sub-cloned from an Xbal-NheI fragment of pEMzeo (Invitrogen) into the Xbal site. pMB4zeo was digested with PshAI and Xmnl and the 2551 bp fragment obtained was gel purified. This fragment (PshAI-MB4zeo) contained mosaic ends for transposition, an Aspergillus pyrG cassette and a bacterial zeocin resistance marker. pGEMTEasy_ILV34A was mutated by transposition with the EZ::TN transposase kit (Epicentre) using PshAI- MB4zeo. The following were assembled in a microcentrifuge: 1 μl EZ::TN 1OX Reaction Buffer, 1 μl pGEMTEasy_ILV34A, 1 μl PshAI-MB4zeo, 6 μl sterile water, 1 μl EZ::TN Transposase. The reaction mixture was incubated for 2 hours at 370C. 1 μl EZ::TN 1OX Stop Solution was added, mixed and heated for 10 minutes at 7O0C. 1 μl of the stopped reaction was transformed into Electrocompetent E. coli Genehogs (Invitrogen) by electroporation. Transformed cells were plated on LB-Ampicillin- zeocin agar plates and incubated at 37°C overnight. Colonies were picked into LB- ampicillin broth and incubated at 370C overnight with shaking at 220 rpm. Plasmid DNA was isolated by Qiaprep miniprep DNA isolation (Qiagen). Plasmids were screened by PCR using primer SEQ ID No. 74 and 80. A plasmid was selected that gave a PCR product of approximately 600bp indicating that the transposon PshAI- MB4zeo had inserted approximately 600 bp from the ATG start site of the coding sequence, thus disrupting the gene. This plasmid was designated ILV34A_KO33. The plasmid was digested with Notl and the 8.4kb fragment gel purified. This fragment was used for fungal transformation.
2.1.2 ILVl 352 mutant construct
A BAC containing a genomic copy of ILV1352 was was isolated and used as a template for a PCR with Extensor master mix and primers SEQ ID Nos. 76 and 77. The resulting 5958 bp PCR product was purified (Qiaquick gel purification kit,
Qiagen) and ligated into pGEMTeasy overnight at 40C (Promega). 1 μl of the ligation mix was transformed into Electrocompetent E. coli Genehogs (Invitrogen) by electroporation. Transformed cells were plated on LB-Ampicillin-IPTG-Xgal agar plates and incubated at 370C overnight. White colonies were picked into LB- ampicillin broth and incubated at 37°C overnight with shaking at 220 rpm. Plasmid DNA was isolated by Qiaprep miniprep DNA isolation (Qiagen). Notl digestion of the plasmid DNA indicated whether a 6 kb insert was present and the presence of ILV 1352 DNA was confirmed by PCR reactions using the following PCR primer sets: a) SEQ ID Nos. 76 and 77; b) SEQ ID Nos. 78 and 79. Plasmids yielding the following size PCR products were deemed to be pGEMTEasy_ILV1352: a) 5958 bp, b) 1923 bp.
A plasmid was constructed for transposition of a hygromycin resistance cassette. Firstly, The bacterial zeocin resistance cassette from pEMzeo was introduced into the EcoRI site of pMOD2 between the mosaic ends. Then, the zeocin resistance cassette together with the mosaic ends were amplified by PCR including Spel sites on the primers. The product was then digested with Spel and ligated into the Spel site of pGEMTeasy. The hygromycin resistance cassette was then cloned into the Xba I site. The resulting plasmid (named pPH8) was digested with Spe I and Xmn I to yield a 3649 bp fragment which was gel purified. This fragment (SpeI_PH8) contained mosaic ends for transposition, an Aspergillus hygromycin resistance cassette and a bacterial zeocin resistance marker. pGEMTEasy_ILV1352 was mutated by transposition with the EZ::TN transposase kit (Epicentre) using (SpeI_PH8). The following were assembled in a microcentrifuge: 1 μl EZ::TN 1OX Reaction Buffer, 1 μl pGEMTEasy_ILV1352, 2 μl (SpeI_PH8), 5 μl sterile water, 1 μl EZ::TN Transposase. The reaction mixture was incubated for 2 hours at 370C. 1 μl EZ::TN 1OX Stop Solution was added, mixed and heated for 10 minutes at 7O0C. 1 μl of the stopped reaction was transformed into Electrocompetent E. coli Genehogs (Invitrogen) by electroporation. Transformed cells were plated on LB-Ampicillin-zeocin agar plates and incubated at 370C overnight. Colonies were picked into LB-ampicillin broth and incubated at 370C overnight with shaking at 220 rpm. Plasmid DNA was isolated by Qiaprep miniprep DNA isolation (Qiagen). Plasmids were screened by PCR using primers SEQ ID Nos. 78 and 80. A PCR product of approximately 900bp indicated that the transposon PshAI-MB4zeo had inserted approximately 900 bp from the ATG start site of the coding sequence, thus disrupting the gene. The mutant plasmid was designated ILV1352_KO21. The plasmid was digested with Dral and the ~12kb fragment was gel purified. This fragment was used for fungal transformation.
2.1.3 Fungal transformation Initial studies demonstrated that a single knockout of ILV34A was not lethal, but did result in a strain with reduced growth. The effect of knocking out both ILV34A and ILV 1352 was therefore investigated. A brown/white colour diploid pyrG" strain of Aspergillus fumigatus (CDP3.1) was transformed with the ILV1352_knockout construct. Transformants were selected on hygromycin and screened by PCR using primers SEQ ID Nos. 80 and 83. Positive clones were checked by Southern blotting to confirm that there was a single knockout. No growth phenotype was observed for the ILV1352 single knockout. The diploid ILV1352 knockout was then transformed with the ILV34A mutant construct and resulting colonies screened by PCR with primers SEQ ED Nos. 80 and 81. Positive clones were checked extensively by PCR and Southern blotting. The diploid was haploidised on benomyl SAB plus uridine and uracil. Haploid spores were assessed for the presence of the hygromycin and pyrG selective markers. No growth was seen when haploid spores were plated on media without uridine and uracil but with hygromycin, indicating that the double knockout was lethal.
Example 3. Genomic Sequencing of Genes
The genomic sequences of the genes identified in Example 1 above can be determined experimentally as follows:
3.1 Bacterial and Fungal Strains
For bacterial cloning, E. coli Select96 cells (Promega) are used in accordance with manufacturers' instructions.
A.fumigatus clinical isolate AF293 (ref. No. NCPF7367; available to the public from the NCPF repository; Bristol, U.K.); the CBS repository (Belgium) or from Dr. David Denning' s clinical isolate culture collection, Hope Hospital, Salford. U.K.) is the preferred strain according to the present invention. AF293 was isolated in 1993 from the lung biopsy of a patient with invasive aspergillosis and aplastic anaemia. It was donated by Shrewsbury PHLS.
3.2 Purification of A. fumigatus genomic DNA
To obtain mycelial material for genomic DNA isolation, approximately 107 A. fumigatus conidia are inoculated in 50 ml of Vogel's minimal medium and incubated with shaking at 200 rpm until late exponential phase (18-24 h) at 370C. Mycelium is dried down onto Whatmann 54 paper using a Buchner funnel and a side-arm flask attached to a vacuum pump and washed with PBS/Tween. At this point, the mycelium can be freeze-dried for extraction at a later date.
The mycelium (fresh or freeze dried) is ground to a powder using liquid nitrogen in a mortar cooled to -2O0C. The ground biomass is transferred to 50 ml tubes on ice up to the 10 ml mark. An equal volume of extraction buffer (0.7 M NaCl; 0.1 M Na2SO3; 0.1 M Tris-HCl pH 7.5; 0.05 M EDTA; l%(w/v) SDS; pre- warmed to 650C) is then added to each tube, mixed thoroughly with a pipette tip and incubated at 650C for 20 minutes in a water bath. A volume of chloroform/isoamyl alcohol (24:1) equivalent to the volume of the original biomass is then added to each tube, tubes are mixed thoroughly and incubated on ice for 30 min. Tubes are then centrifuged at 3,500 x g for 30 min and the aqueous phase carefully transferred to fresh 50 ml tubes without disturbing the interface.
An equal volume of chloroform/isoamyl alcohol (24:1) is added, the tubes vortexed and incubated on ice for 15 minutes. Tubes are then spun at 3,500 x g for 15 minutes. After this spin, if large amounts of precipitate are still present, the supernatant is removed and the chloroform:isoamyl alcohol step repeated. The supernatant is removed and placed in clean sterile Oak Ridge tubes. An equal volume of isopropanol is added and mixed gently. Tubes are incubated at room temperature for at least 15 minutes. Tubes are then centrifuged at 3,030 x g for 10 minutes at 40C to pellet the DNA. The supernatant is removed and the pellet allowed to air dry for 10-25 minutes. The pellet is suspended in 2 ml sterile water. 1 ml of 7.5 M ammonium acetate is added, mixed and incubated on ice for 1 hour. Tubes are centrifuged at 12,000 x g for 30 min, the supernatants transferred to a fresh tube and 0.54 volumes of isopropanol are added, mixed and incubated at room temperature for at least 15 minutes. Tubes are then centrifuged at 5,930 x g for 10 min, the supernatant is removed and the pellet washed in 1 ml of 70% ethanol. Tubes are centrifuged at 5,930 x g for 10 min and all the ethanol is removed. The pellet is air dried for 20-30 minutes at room temperature and suspended in 0.5-1.0 ml of TE (10 mM Tris-HCl pH 7.5; 1 mM EDTA) Finally, the DNA is treated with RNase A (5 μl of lmg/ml stock).
3.3 PCR Reactions Primers pairs are designed to the upstream and downstream regions of the A. fumigatus AF293 genes: The 200-base regions flanking the gene of interest are used as input sequence for Primer3 (http://frodo.wi.mit.edu/cgi- bin/primer3/primer3_www.cgi) to provide a primer pair that spans the gene. If the gene is particularly long it may be necesssary to design primer pairs with internal sequences and thus sequence the gene in parts. The following reagents and conditions are used:
1 Ox high fidelity PCR buffer 5 μl dNTP (Clontech: 10 mM) 1 μl nH20 39 μl
Pfu Ultra Polmerase (2.5 U/μl) 1 μl
Primer pairs (10 pmol/μl stock) 1 μl each gDNA
PCR cycles are as follows: (1) 95° C, 2 min; (2) 95° C, 30 sec; (3) 54° C, 30 sec; (4) 72° C, 2 min; (5) 72° C, 10 min; (6) 8° C, hold. 40 cycles of steps 2-4 are carried out and the PCR products are run on a gel. The product band is excised from the gel and purified using QIAquick Gel Extraction Kit (Qiagen Ltd, Boundary Court, Gatwick Road, Crawley, West Sussex, RHlO 9AX, UK) according to the manufacturers instructions and eluted into 30 μl of sterile water (BDH molecular biology grade/filter sterile).
(1 :30 dilution of stock) 2 μl
3.4 Genomic DNA Cloning and Sequencing Since the gDNA is amplified using Pfu ultra polymerase which produces blunt ends, it is necessary to add 'A' overhangs before ligating in to pGEM Teasy. 12.5 μl of purified PCR product is incubated with 12.5 μl 2x PCR Reddy Mix (ABGene) at 70° C for 30 minutes. The sample is then purified using Qigen Qiaquick gel extraction kit and eluted with 30 μl of molecular biology grade water.
The PCR product is then ligated into pGEM-Teasy (Promega) using the following ligation mixture: 2x Buffer, 5 μl; pGEM Teasy, 1 μl; PCR product, 3 μl; T4 DNA Ligase,l μl. The reaction is incubated overnight at 4° C.
2 μl of the ligation mix are then added to Select 96 cells (Promega) and incubated for 20 min on ice. Cells are then heat shocked at 42° C for 45 sec and placed back on ice. 250 μl of room temp. SOC medium are then added and the cells incubated for 1 hour at 37° C, with shaking at 220 rpm. 50 and 200 μl amounts are then plated on to LB agar plates containing ampicillin (100 μg/ml), 50 μl X-gal (4%) and 10 μl PTG (100 mM) and incubated over night at 37° C.
Individual white colonies are picked from each transformation are inoculated into LB with ampicillin (100 μg/ml) and incubated overnight at 370C with shaking at 220 rpm. Plasmid DNA is extracted using Qiagen miniprep kit according to the manufacturers instructions. 1 μl of plasmid DNA is digested with restriction enzymes for 1 hour at 37° C. Results are compared with the predicted sizes for constructs and clones showing the correct restriction digest pattern are sequenced at MWG Biotech UK Ltd, Waterside House, Peartree Bridge, Milton Keynes, MK6 3BY.
Example 4. cDNA sequencing and RACE
The internal sequences of the genes of interest are experimentally determined by cloning and sequencing cDNA, and the 5' and 3' ends of the genes are determined by RACE (Rapid Amplification of cDNA Ends).
4.1 cDNA cloning and sequencing
4.1.1 Preparation of A. fumigatus RNA and cDNA
Fungal cultures were prepared as described in Example 3. Cultures were harvested by filtration, then washed twice with DEPC-treated water and transferred to a 50 ml Falcon tube. Samples were frozen in liquid nitrogen and stored at -8O0C until required. To prepare RNA, fungal samples were ground to a fine powder under liquid nitrogen. RNA was then extracted using the Qiagen RNeasy Plant Mini Kit following the protocol for isolation of total RNA from filamentous fungi in the RNeasy Mini Handbook (06/2001, Pages 75-78, http://www.qiagen.com/literature/ handbooks/rna/rnamini/1016272HBRNY_062001WW.pdf). The following modifications were used: At step 3, RLC was used as the lysis buffer of choice; At step 7, the Rneasy column was incubated for 5 min at room temperature after addition of RWl; The optional step 9a was carried out; At step 10, 30μl RNase-free water was added, the samples incubated for 10 min at room temperature, and then centrifuged; At step 11 , the elution step was repeated to give a total volume of 60 μl RNA.
DNA contamination was removed from the RNA by the addition of Dnase, using 2 μl DNase per μg RNA, in the presence of 1OX DNase buffer and incubating at 37°C for 2h. DNase-treated RNA was cleaned up using the RNeasy Plant Mini Kit following the RNeasy Mini Protocol for RNA Cleanup (RNeasy Mini Handbook 06/2001, pages 79-81).
To synthesise cDNA from the above RNA the following reaction mixture was prepared: 100 ng-1 μg of DNA-free RNA, 3 μl oligo (dT) (100 ng/μl), and DEPC- treated water to a total volume of 42 μl. Samples were incubated in a heat block at 650C for 5 min after which they were allowed to cool slowly to room temperature. Then 2 μl Ultrapure dNTPs, 1 μl reverse transcriptase (Stratascript) and 5 μl 1OX reverse transcriptase reaction buffer (Stratascript) were added. Samples were incubated at 42°C for Ih, denatured at 9O0C for 5 min and then cooled on ice.
4.1.2 Production of cDNA constructs
PCR is carried out using the cDNA above to generate cDNA fragments. Primers are designed based on the 5' and 3' ends of the predicted genes. PCR reactions are carried out using the following reagents and conditions:
1 Ox high fidelity PCR buffer 5 μl dNTP (clontech: 10 mM) 1 μl
MgSO4 (50 mM) 2 μl nH2O 37.8 μl
Platinum TAQ Polmerase (5 U/μl) 0.2 μl
Primer pairs (10 pmol/μl stock) 1 μl each cDNA 2 μl
PCR cycles are run as follows; (1) 94° C, 5 min; (2) 94° C, 30 sec; (3) 53° C, 30 sec; (4) 68° C, 90 sec; (5) 68° C, 10 min; (6) 8° C, pause. Cycles 2-4 are run 40 times. The PCR products are purified using QIAquick PCR Purification Kit (Qiagen Ltd, Boundary Court, Gatwick Road, Crawley, West Sussex, RHlO 9AX, UK) according to the manufacturers instructions and run on agarose gels. PCR products are ligated into pGEM-Teasy, used to transform Select 96 cells, and sequenced as described in Example 3 above.
4.2 RACE To determine the 5' and/or 3' ends of the genes, RACE (Rapid Amplification of cDNA Ends) was carried out, using the GeneRacer™ Kit (Invitrogen; cat. No. Ll 502-01), essentially as per manufacturers instructions.
4.2.1 Preparation of RNA A. fumigatus biomass was prepared as described in Example 3. RNA was prepared using the FastRNA kit (QBIOgene) following the manufacturer's instructions (Revision 6030-999-1 J05) with the following amendments: At step 1, 40 mg of biomass was used per extraction; At step 2, samples were processed for 20 seconds at speed 5, incubated on ice for 3 minutes, and processed again for 20 seconds at speed 5; At step 3 samples were centrifuged for 5 minutes; At step 5, 500 μl DIPS were added, mixed, and incubated at room temperature for 2 minutes. Samples were mixed again and incubated for a further 2 minutes; At step 6 two washes in 250 μl SEWS were carried out; At step 7, the pellet was disolved in 50 μl SAFE buffer.
4.2.2 RACE
1 μg total RNA prepared as described above was de-phosphorylated in a 10 μl reaction using 10 units of calf intestinal phosphate (CIP), 1 μl 1OX CIP buffer and 4OU RNaseOut™ (made up to 10 μl in DEPC water) at 5O0C for 1 hour. Samples were then made up to 100 μl with DEPC water and the RNA extracted with 100 μl (25:24:1) phenol:chloroform:isoamyl alcohol. RNA was then precipitated by the addition of 2 μl mussel glycogen (10 mg/ml), 10 μl 3M sodium acetate, pH 5.2 and 220 μl 95% ethanol and the sample frozen on dry ice for 10 minutes. RNA was pelleted by centrifugation at 14,500 rpm for 20 minutes at 40C, washed with 70% ethanol, air dried and re-suspended in 8 μl DEPC water.
De-phosphorylated RNA (7 μl) was de-capped in a 10 μl reaction with 0.5 U tobacco acid pyrophosphatase (TAP), 1 μl 10x TAP buffer and 40 U RnaseOut™ for 1 hour at 370C. RNA was extracted with phenolxhloroform and precipitated as above, and then re-suspended in 7 μl DEPC-treated water.
De-phosphorylated, de-capped RNA (7 μl) was added to the pre-aliquoted GeneRacer™ RNA Oligo (0.25 μg) and incubated at 65°C for 5 minutes. A 10 μl ligation reaction is then set up by the addition of 1 μl 10x ligase buffer, 1 μl 10 mM ATP, 40 U RnaseOut™ and 5 U T4 RNA ligase and incubated at 370C for 1 hour. RNA was extracted and precipitated as described previously and re-suspended in 11 μl DEPC-treated water.
First-strand cDNA is prepared by the addition of 1 μl GeneRacer™ Oligo dT primer and 1 μl dNTP mix (1OmM each) to 10 μl ligated RNA and incubated at 650C for 5 minutes. The following reagents were added to the 12 μl ligated RNA and primer mix; 4 μl 5x first strand buffer, 2 μl 0.1 M DTT, 1 μl RNaseOut™ and 1 μl Superscript™ II RT (200 U/μl) and incubated first at 420C for 50 minutes and then, to stop the reaction, at 7O0C for 15 minutes. 2 U RNase H was added to the reaction mix and incubated at 370C for 20 minutes. To amplify the 5'cDNA ends a 50 μl PCR reaction is set up using 1 μl of the
RACE-ready cDNA prepared above, 1 μl GeneRacer™ 5' primer, -1 μl reverse gene- specific primer (designed against the complementary strand of the coding sequence: 5 pmol/μl stock), 1 μl dNTP solution (10 mM each), 2 μl 50 mM MgSO4, 5 μl High Fidelity PCR buffer, 0.5 μl Platinum® Taq DNA Polymerase High Fidelity (5 U/μl) and 38.5 μl sterile water. Cycling parameters are given in Table V below.
A second, nested PCR stage may also be carried out. This is set up using 1 μl of the RACE cDNA from the first stage above, 1 μl Nested 5' primer (supplied with kit), 1 μl second reverse gene-specific primer (designed against the complementary strand of the coding sequence and nested with respect to the above primer: 5 pmol/μl stock), 1 μl dNTP solution (10 mM each), 2 μl 50 niM MgSO4, 5 μl High Fidelity PCR buffer, 0.5 μl Platinum® Taq DNA Polymerase High Fidelity (5 U/μl) and 38.5 μl sterile water. Cycling parameters are given in Table V below.
To amplify 3' ends a 50 μl PCR reaction is set up using 1 μl of the RACE- ready cDNA prepared above, 1 μl GeneRacer™ 3' primer (10 μM), 1 μl forward gene-specific primer (designed against the coding strand of the coding sequence : 5 pmol/μl stock), 1 μl dNTP solution (10 mM each), 2 μl 50 mM MgSO4, 5 μl High Fidelity PCR buffer, 0.5 μl Platinum® Taq DNA Polymerase High Fidelity (5 U/μl) and 38.5 μl sterile water. Cycling parameters are given in Table V below:
A second, nested PCR stage may also be carried out. This is set up using 1 μl of the 3' RACE cDNA from the first stage above, 1 μl Nested 3' primer (supplied with kit), 1 μl reverse gene-specific primer (designed against the coding strand of the coding sequence and nested with respect to the above primer: 5 pmol/μl stock), 1 μl dNTP solution (10 mM each), 2 μl 50 mM MgSO4, 5 μl High Fidelity PCR buffer, 0.5 μl Platinum® Taq DNA Polymerase High Fidelity (5 U/μl) and 38.5 μl sterile water. Cycling parameters are given in Table V below.
5' and 3' RACE identify the 5' ATG and 3' stop codons as well as giving the 5' and 3' untranslated regions of the genes.
Table V. Cycling parameters for 5' and 3'RACE
5' and 3' RACE Nested PCR
94 0C 2 min 1 cycle 940C 2 min 1 cycle
940C 30 sec 5 cycles 94°C 30 sec 25 cycles
720C 1 min 670C 30 sec
680C 1 min
940C 30 sec 5 cycles
7O 0C 1 min
68°C 10 min 1 cycle
940C 30 sec 25 cycles 8°C Hold
640C 30 sec
680C 1 min
680C 10 min 1 cycle
80C Hold To determine the 5' end of ILV34 a 50 μl PCR reaction was set up using 1.5 μl of RACE-ready cDNA prepared as described above, 3 μl GeneRacer™ 5' primer, 1 μl reverse gene-specific primer (designed against the complementary strand of the coding sequence; SEQ ID No. 67: 10 pmol/μl stock), 1 μl dNTP solution (10 mM each), 2 μl 50 mM MgSO4, 5 μl High Fidelity PCR buffer, 1 μl Platinum® Taq DNA Polymerase High Fidelity (5 U/μl) and 36 μl sterile water. Cycling parameters are given in Table VI below. 5' RACE confirmed the predicted 5' start site and first intron ofILV34.
To amplify the 5'cDNA end of ILV1352 a 50 μl PCR reaction was set up using 1 μl of the RACE-ready cDNA prepared above, 1 μl GeneRacer™ 5' primer, 1 μl reverse gene-specific primer (designed against the complementary strand of the coding sequence: SEQ ID No. 68; 5 pmol/μl stock), 1 μl dNTP solution (10 mM each), 2 μl 50 mM MgSO4, 5 μl High Fidelity PCR buffer, 0.5 μl Platinum® Taq DNA Polymerase High Fidelity (5 U/μl) and 38.5 μl sterile water. Cycling parameters are given in Table VI below. A 550 b.p. product was cloned into pCR4- Topo as per manufacturers instructions and sequenced using T7 and T3 sequncing primers. 5' RACE confirmed the predicted 5' start site of ILV1352.
Table VI. Cycling parameters for 5' RACE
ILV3^ \ ILV1352
940C 2 min, 1 cycle 94 0C 2 min 1 cycle
940C 30 sec, 720C l min, 4 cycles 940C 30 sec, 720C l min, 5 cycles
940C 30 sec, 70 0C 1 min, 4 cycles 940C 30 sec, 70 0C 1 min, 5 cycles
940C 30 sec, 640C 30 sec, 680C 1 940C 30 sec, 640C 30, sec 680C 1 min, 29 cycle- min, 25 cycles
680C 10 min , 1 cycle 680C 10 min, 1 cycle
80C, hold 80C Hold Example 5. Identification of fungal homologs of genes of interest Homo logs of the proteins or polynucleotides of the invention can be identified in other fungi by means of bio informatics analysis. Sequences identified by bioinformatics can be used to design primers which in turn can be used in PCR to generate DNA coding for the homologs. Alternatively, degenerate PCR can be used to obtain sequence, which can then be used to generate probes for screening cDNA or genomic libraries of the organism of interest to identify clones containing the homologs. As a further alternative Southern blots, using fragments of genes from one species as probes, can be used to identify the presence of a homolog in the genome of a second species. The same probe can then be used to screen cDNA or genomic DNA libraries. Once clones corresponding to the novel genes have been identified they can be expressed for functional characterisation of the protein.
5.1 Identification of homologs by bioinformatics Homologs of the proteins and polynucleotides of the invention can be identified by searching locally held databases, as detailed in Table VII, using BLAST with SEQ ID Nos: 3, 6, 9, 12, 15, 18, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 50, 53, 56, 59, 61 or 63 as the query sequence. Where necessary, matching contigs are down-loaded and genes predicted from genomic DNA as described in Example 1. Alternatively, BLAST searches can be carried out over the web.
Table VII. Sources of data for local BLAST searches
ESTs1 I
'This dataset contains ESTs from the following plant pathogen fungi: Blumeria graminis, Botrγotinia, Cladosporium fulvum, Colletotrichum trifolii, Cryphonectria parasitica, Fusarium sporotrichioides, Gibberella zeae, Leptosphaeria maculans, Magnaporthe grisea, Mycosphaerella graminicola, Phytophthora infestans, Phytophthora sojae, Sclerotinia sclerotiorum, Ustilago maydis and Verticillium dahliae.
The relationships between SEQ ID Nos: 3, 6, 9, 12, 15, 18, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 50, 53, 56, 59, 61 or 63, and hits identified from blast searches above can be clarified by phylogenetic analysis, for example using the PHYLIP suite of programs (Felsenstein, Felsenstein, J., 2002. PHYLIP (Phytogeny Inference Package) version 3.6a3. Distributed by the author. Department of Genome Sciences, University of Washington, Seattle). A distance matrix is generated using PROTDIST with the Jones-Taylor-Thornton model and a tree inferred using FITCH with global rearrangements and 10 jumbles of input order. 100 bootstrap replicates are generated using SEQBOOT, distance matrices generated using PROTDIST as above, trees inferred using NEIGHBOUR, and then bootstrap values and the consensus trees are calculated using CONSENSE. Trees are viewed using TREEVIEW (Page, 1996 Page, R. D. M., 1996. TREEVIEW: An application to display phylogenetic trees on personal computers. Computer Applications in the Biosciences 12, 357-358.). Preliminary phylogenetic trees can be generated "on the fly" by the multiple alignment package QAlign (Sameth et al., 2003, Bioinformatics 19, 1592-1593; http://www.ridom.de/qalign). Alternatively , the relationship between SEQ ID Nos: 3, 6, 9, 12, 15, 18, 21, 24,
27, 30, 33 36, 39, 42, 45, 48, 50, 53, 56, 59, 61 or 63 and homologs can be clarified using reciprocal blast hits as described by e.g., Wall et al. {Bioinformatics 19, 1710- 1711).
ILV3 sequences in filamentous fungi other than A. nidulans and A. fumigatus were identified by means of the methods described above and by BLAST searches against the NCBI nr database. Protein sequences were aligned with Aspergillus ILV3 proteins and gene predictions improved where necessary. The resulting sequences (SEQ ID Nos 37-63) are summarised in Table II. From the alignment, it was possible to cluster the ILV3 sequences into two groups of orthologs, indicated in the table as group I, clustering with A. fumigatus sequence SEQ ED No. 21, and group II, clustering with A. fumigatus SEQ ID No. 12.
5.2 Identification of homologs by defienerate PCR
5.2.1. Preparation of genomic DNA from organism of interest
Fungal cultures are prepared using methods suitable for particular species. For example, Aspergillus and Candida species, Cryptococcus neoformans, Fusarium solani and Trichophyton species are maintained on Sabouraud dextrose agar at 30- 35°C; Leptosphaeria nodorum on Malt agar medium (30 g/L malt extract; 15 g/L Bacto-agar, pH 5.5), 24.00C; Magnaporthe grisea on oatmeal agar (6.1 g/L agar, 53.3 g/L instant oatmeal) 25.O0C, or Cornmeal agar (Difco 0386), 26.O0C; Phytophthora capsici cultures are maintained on on V-8 agar at 24°C; Pyricularia oryzae cultures are maintained on rice polish agar at 240C under white fluorescent lights (12 hr artificial day), and are subcultured every 7 - 14 days by the transfer of mycelial plugs to fresh plates; Pythium ultimum cultures are maintained on PDA at 240C, and subcultured every 7 days by the transfer of aerial mycelium to fresh plates with an inoculating needle; Rhizoctonia solani cultures are maintained on PDA at 240C under fluorescent lights (12 h artificial day), and subcultured every 7 days by the transfer of mycelial plugs to fresh plates; Ustilago maydis cultures are maintained on PDY agar at 3O0C in the dark, and subcultured by re-streaking. Genomic DNA is prepared from cultures using standard methodologies, e.g. using the Qiagen DNeasy Plant Kit, or using methods described in Example 3.
5.2.2 PCR
Primers are designed to correspond to regions conserved between the gene of interest and its homologs (identified as described above). Those skilled in the art will appreciate that it may be necessary to try a range of primer pairs. PCR reactions using the primer pairs are set up as follows: 2x ReddyMix PCR mastermix (ABgene) 12.5 μl
Primers (5 pmol) 1 μl each template gDNA 1.5-4 μg/ml nuclease-free water to final volume of 25 μl
The reactions are run using the following conditions on a Biometra personal PCR cycler (Thistle Scientific Ltd, DFDS House, Goldie Road, Uddington, Glasgow, G71 6NZ): (1) 950C, 5min; (2) 950C, lmin; (3) 530C, lmin 30sec; (4) 68°C, 2min 30sec; (5) 720C, lOmin; (6) 40C, Hold. 30 cycles of steps 2-4 are carried out. The PCR products are purified (to remove residual enzymes and nucleotides) using Qiagen's QIAquick PCR Purification Kit (Qiagen Ltd, Boundary Court, Gatwick Road, Crawley, West Sussex, RHlO 9AX, UK) according to the manufacturers instructions and eluted into 40 μl of sterile water (BDH molecular biology grade/filter sterile). The purified PCR products are examined on 1% agarose gels. Those skilled in the art will appreciate that degenerate PCR may require variations in a number of parameters in the attempt to generate a product. These include primer concentration, template concentration, concentration OfMg2+ ions, elongation and annealing times, and annealing temperature. Variations in temperature can be accomodated by the use of a gradient PCR machine. The purified PCR products are cloned into pPEM-Teasy (Promega) and then transformed into XL 10-Go Id® Kan ultracompetent E. coli cells according to the manufacturers instructions. The transformation reactions are then plated onto LB agar plates containing ampicillin (100 μg/ml), 50 μl X-gal (4%) and 10 μl IPTG (100 mM). Following overnight incubation at 370C, individual white colonies from each transformation are sub-cultured into LB broth containing ampicillin (100 μg/ml). After overnight incubation at 370C with shaking, plasmids are extracted using Qiagen spin mini plasmid extraction kits according to the manufacturers instructions and sent away for full-length sequencing.
5.3 Identification of homologs by Southern Blotting
5.3.1 Digestion of genomic DNA and transfer to nylon membranes
Genomic DNA from the fungi of interest are digested with the appropriate restriction enzyme and run on 0.8 % agarose gel. The gel is then submerged in 250 mM HCl for no more than 10 mins, with shaking, at room temperature, after which the gel is rinsed with sterilised RO water.
Transfer of the DNA onto nylon membrane is carried out using 0.4 M NaOH. Transfer protocols and apparatus are well known and are described in e.g. Sambrook et al., (1989), Molecular Cloning, 2nd Edition., Cold Spring Harbor Laboratory Press. After transfer, the DNA is fixed to the membrane by baking at 12O0C for 30 min. The membrane can then be used immediately, or stored dry for future use.
5.3.2. Preparation of probe Probes are generated either by restriction digests of DNA or by PCR of an appropriate region. A suitable probe can be generated by PCR using a primer pair designed using Primer3 (http://frodo.wi.mit.edu/cgi-bin/primer3/primer3_www.cgi) and A. fumigatus genomic DNA.
1 μg DNA template is diluted in molecular biology water to a total volume of 16 μl, denatured in a boiling water bath for 10 mins, and quickly chilled on ice. 4 μl DIG-High Prime (1 mM dATP, 1 mM dCTP, 1 mM dGTP, 0.65 mM dTTP, 0.35 mM alkali-labile-digoxygenin-11-dUTP, 1 U/μl labelling grade Klenow enzyme, 5 x reaction buffer, in 50% (v/v) glycerol) is then added and the reaction incubated at 370C for 20 hours, after which 2 μl of 200 mM EDTA pH 8.0 is added to terminate the labelling reaction. The labelling efficiency is estimated by comparison with DIG-labelled control DNA.
5.3.3.Prehybridisation and Hybridisation
The membrane is placed in a hybridisation tube containing 20 ml of prehybridisation solution (DIG Easy Hyb, Roche) per 100cm2 of membrane surface area and prehybridised at 42°C for 2 hours in a hybridisation oven. The DIG- labelled probe is denatured by heating in a boiling water bath for 10 min and then chilled directly on ice. The probe is then diluted to -200 ng/mL in hybridisation solution (Easy Hyb, Roche; at least 5 mL of hybridisation solution is required per hybridisation). The prehybridisation solution is discarded from the hybridization tube and the hybridisation solution containing the DIG-labelled probe added quickly. The hybridisation then proceeds overnight at a 42°C in the hybridisation oven. The optimum temperature is dependant on probe size and homology with target sequence and is determined empirically.
After hybridisation, the membrane is washed twice at 420C, 5 mins per wash, with 50 mL of stringency wash solution (3 x SSC, 0.1% SDS; where 20 x SSC buffer is 3 M NaCl, 300 mM sodium citrate, pH 7.0), followed by two washes at RT, 15 min per wash, in 50 mL stringency wash solution. The stringency of these washes can be decreased by increasing the SSC concentration to 6 x SSC, 0.1% SDS and/or decreasing the wash temperatures.
5.3.4. Detection
The membrane is washed in 20 mL washing buffer (100 mM maleic acid, 150 mM NaCl; pH 7.5; 0.3% v/v Tween 20), and then incubated successively with the following; 20 mL blocking solution (1% w/v blocking reagent for nucleic acid hybridisation, Roche, dissolved in 100 mM maleic acid, 150 mM NaCl, pH 7), for 30 min at room temperature; Anti-DIG- alkaline phosphatase (Roche) diluted 1 : 5,000 in blocking buffer, 30 min at room temperature; Washing buffer, two washes each of 15 min at room temperature; Detection buffer (10OmM Tris-HCl, 100 mM NaCl; pH 9.5), 2 min at room temperature. The membrane is then removed, placed on top of an acetate sheet, and ~ 0.5 ml (per 100cm2) of CSPD or CDP-star added to the top of the membrane. A second sheet of acetate is then placed over the surface of the membrane, the assembly incubated for 5 min at room temperature and then sealed in a plastic bag. The assembly is then exposed to X-ray film for between 15 min and 1 hour. Optimal exposure time is determined empirically by increasing exposure time up to 24 hours. The presence of a band on the gel is evidence of a gene in the genomic DNA of interest. The molecular weight of the band depends on the size of the restriction fragment that contains the gene.
Example 6. Expression of recombinant proteins and/or fragments Recombinant proteins or fragments are expressed to enable detailed study of function and for the development of an in vitro high-throughput screen for inhibitory compounds. PCR is carried out using cDNA, prepared as described above, to generate polynucleotides encoding protein sequence essentially corresponding to SEQ ID Nos. 3, 6, 9, 12, 15, 18, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 50, 53, 56, 59, 61 or 63.
Primers are designed to encode the 5' and 3' ends of the coding sequences, with the addition of bases necessary to anneal with the pET-30 Xa/LIC vector (5' additional sequence, GGTATTGAGGGTCGC; 3' additional sequence,
AGAGGAGAGTTAGAGCC). If the protein has an N-terminal leader peptide, this should be excluded. If the protein is made up of multiple domains, it may be desirable or necessary to express only a limited number of domains, or even a single domain. In these cases, primers are designed to correspond to domain boundaries. PCR reactions are carried out using the following reaction mixture and conditions. All Reagents are present in the KOD kit (Novagen).
2.5 μl 1Ox PCR Buffer
5 μl dNTPs (2mM) 2 μl MgSO4 (25mM)
1 μl each primer (5 pmol each)
1 μl template cDNA
11.5 μl nuclease-free water
1 μl KOD Polymerase
PCR reactions are run using the following conditions: (1) 940C, 5 min; (2) 940C, 1 min;
(3) 59.30C, 1 min; (4) 680C, 1 min 30sec; (5) 680C, 10 min; (6) 1O0C, hold. 40 cycles of steps 2-4 are carried out and the PCR products purified using QIAquick PCR Purification
Kit (Qiagen Ltd, Boundary Court, Gatwick Road, Crawley, West Sussex, RHlO 9AX, UK) according to the manufacturers instructions. The purified PCR products are examined on agarose gels. cDNA fragments are then cloned in to the pET30 Xa/LIC vector (Novagen), transformed into Nova Blue chemically competent E. coli cells, and plated on to a prewarmed kanamycin (+) selection plate. After an overnight incubation at 37° C, kanamycin-resistant colonies are selected and grown up in kanamycin containing LB medium. Plasmid DNA is isolated using the Plasmid Mini Kit (Qiagen). Confirmation of the presence and correct orientation of the inserts is determined by restriction analysis and sequencing of the construct. Purified plasmid DNA, which is been confirmed to be of the correct sequence and orientation, is transformed into chemically competent BL21 Star (DE3) One Shot E. coli cells and grown overnight at 37° C. 2 ml of an over-night culture are used to innoculate 100 ml of LB, 30 μg/ml kanamycin, and the cultures incubated at 37° C, 220 rpm until the cell density reaches an optical density of 0.6 (approximately 3 hours). Expression of the recombinant protein is then induced with IPTG (ImM) for 5 hours.
Bacteria are harvested by centrifugation at 4500 rpm for 10 minutes and the pellets lysed in lysis buffer (10 ml Bugbuster (Novagen), 10 μl Benzonase (Novagen), 0.4 μl lysozyme (Novagen) and 100 μl IM imadazole for 20 minutes at room temperature. Cells are then spun down at 1600Og for 20' at 4° C and the supernatant, containing soluble recombinant protein, removed to a clean tube.
Supernatant is added to prewashed Ni-Nta resin at a concentration of 5-10 mg protein per ml of resin and allowed to bind for 1 hour at 4° C. Protein-resin mix is then poured into a column, washed twice in 4 ml of wash buffer (2.5 ml IM phosphate buffer ρH8 , 6.25 ml 4M NaCl, 1 ml IM Imidazole pH8, 0.5 ml 10%
Tween 20; made up to 50 mis in n.H20) and then eluted in 4x 0.5 ml fractions with elution buffer (250 μl IM Phosphate Buffer pH8, 625 μl 4M NaCl, 1.25 ml IM Imidazole pH8, 50 μl 10% Tween 20, Made up to 5 mis in n.H2O). Fractions containing purified protein are identified by SDS-Page and Western blotting using an S-tag HRP conjugate (Novagen), pooled and then desalted using a PDlO column (Amersham) equilibrated with 25 ml of 0.1 M KPO4 pH7. Fractions containing purified recombinant protein can be concentrated using YMlO columns (Millipore) and stored at -8O0C.
Alternative expression systems can be used for expression in bacteria, such as the glutathione S-transferase or mannose-binding fusion-protein system.
6.1 Expression of recombinant ILV34
A full-length and a truncated version of ILV34 (Table II) were expressed. The truncated version lacked the 29 amino acid N-terminal mitochondrial targeting sequence. Primers were designed to encode the 5' and 3' ends of the coding sequence, with the addition of bases necessary to anneal with the pET-30 Ek/LIC vector (full length construct 5' primer SEQ ID No. 64; truncated construct 5' primer SEQ ID No. 65; common 3' primer, SEQID No. 66). PCRs were carried out using the following reaction mixture and conditions (all reagents were present in the KOD Hot Start DNA Polymerase kit, Novagen):
5μl 1Ox PCR buffer; 5μl dNTPs (2mM); 3μl MgSO4 (25mM); 1.5μl each primer (15 pmol each); lμl template cDNA; 29.5μl nuclease-free water; 2.5μl DMSO; lμl KOD Hot Start polymerase
PCRs wre run using the following conditions: (1) 940C, 5 min; (2) 940C, 1 min; (3) 590C, 1 min 30 sec; (4) 680C, 1 min 30 sec; (5) 68°C, 10 min; (6) 8°C, hold. 40 cycles of steps 2-4 were carried out and the PCR products purified using QIAquick PCR Purification Kit (Qiagen Ltd, Boundary Court, Gatwick Road,
Crawley, West Sussex, RHlO 9AX, UK) according to the manufacturers instructions. The purified PCR products were examined on agarose gels. cDNA fragments were then cloned into the pET30 Ek/LIC vector (Novagen), transformed into Nova Blue chemically competent E. coli cells, and plated on to a pre-warmed kanamycin (+) selection plate. After an overnight incubation at 37°C, kanamycin-resistant colonies were selected and grown up in kanamycin-containing LB medium (30μg/ml). Plasmid DNA was isolated using the Plasmid Mini Kit (Qiagen). Confirmation of the presence and correct sequence and orientation of the inserts was determined by restriction analysis, PCRs and sequencing of the construct. Purified plasmid DNA with of the correct sequence and orientation was transformed into chemically competent BL21 Star (DE3) One Shot E. coli cells and grown overnight at 37°C. 6 ml of an overnight culture were used to inoculate 200 ml of LB, 30 μg/ml kanamycin, and the cultures incubated at 37°C, 220 rpm until the cell density reached an optical density of 0.5-0.7 (approximately 2 hours). Expression of the recombinant protein was then induced with IPTG (0.5 mM) for 20 hours at 200C. Bacteria were harvested by centrifugation at 3500g for 10 minutes and the pellets lysed in lysis buffer (24 ml Bugbuster, Novagen; 24 μl Benzonase, Novagen; 0.4 μl rLysozyme, Novagen; and 1200 μl IM imidazole) for 20 minutes with mixing at room temperature. Cell debris was then removed by centrifuging the sample at 1600Og for 20 minutes at 4°C and the supernatant, containing soluble protein, removed to a clean tube. Supernatant was added to pre-washed Ni-NTA resin at a concentration of approximately 25 mg protein per ml of resin and allowed to bind for 1 hour at 4°C with mixing. Protein/resin mix was then poured into a large disposable plastic column, washed twice in 7.5 ml wash/bind buffer (2.5 ml IM Na2HPO4 pH8.0, 6.25 ml 4 M NaCl, 1 ml 1 M imidazole pH8.0, 0.5 ml 10% Tween 20, made up to 50 ml with dH2O) and then eluted in 6.5 ml elution buffer (1 ml 1 M Na2HPO4, pH8.0, 2.5 ml 4 M NaCl, 5 ml 1 M imidazole pH8.0, 200μl 10% Tween 20, 200 μl protease inhibitor cocktail III, made up to 20 ml with dH2O). The presence of purified ILV34 protein in the eluate was confirmed by SDS-PAGE, and the eluate was then desalted using PDlO columns equilibrated with buffer containing 50 mM Tris-HCl, 10 mM MgCl2, pH 8.0. Aliquots were stored at -80°C.
6.2 Expression of recombinant ILV1352
Constructs encoding full-length and truncated versions of ILV 1352 (Table II) were produced. The truncated version lacks the first 84 base pairs of the ILV1352 DNA sequence. Primers were designed to encode the 5' and 3' ends of the coding sequence, with the addition of bases necessary to anneal with the pET-30 Ek/LIC vector; (5' primer, full length, SEQ ID No. 68; 5' primer truncated, SEQ ID No. 69; common 3' primer, SEQ ID No. 70). PCRs were carried out using the following reaction mixture and conditions. All reagents were present in the KOD Hot Start
DNA Polymerase kit (Novagen); 2.5 μl 1Ox PCR buffer, 2.5 μl dNTPs (2 mM), 1 μl MgSO4 (25 mM), 1.5 μl each primer (5 pmol/μl), 1 μl template cDNA, 15 μl nuclease-free water, 0.5 μl KOD Hot Start polymerase.
PCRs were run using the following conditions: (1) 95°C, 5 min; (2) 950C, 1 min; (3) 560C, 1 min 30 sec; (4) 680C, 2 min 30 sec; (5) 68°C, 10 min; (6) 8°C, hold. 45 cycles of steps 2-4 were carried out and the PCR products purified using QIAquick PCR Purification Kit (Qiagen Ltd, Boundary Court, Gatwick Road, Crawley, West Sussex, RHlO 9AX, UK) according to the manufacturers instructions. The purified PCR products were examined on agarose gels. cDNA fragments were then cloned into the pET30 Ek/LIC vector (Novagen), transformed into Nova Blue chemically competent E. coli cells, and plated on to a pre-warmed kanamycin (+) selection plate. After an overnight incubation at 370C, a kanamycin-resistant colony was selected and grown up in kanamycin-containing LB medium (30 μg/ml). A glycerol stock was produced from the culture, the remains of which were used to purify plasmid DNA using Qiagen's Plasmid Mini Kit. Confirmation of the presence and correct" sequence and orientation of the inserts was determined by PCR and sequencing of the construct. Purified plasmid DNA was transformed into chemically competent BL21 Star (DE3) One Shot E. coli cells.
Preliminary studies showed that recombinant truncated ILV 1352 accumulated in inclusion bodies. The inclusion bodies were purified and truncated ILVl 352 solubilised and re- folded as follows: The glycerol stock, produced from BL21 cells containing truncated ILV1352 in pET30 Ek/LIC, was used to inoculate 10 ml LB, 30 μg/ml kanamycin broth. The broth was incubated overnight at 370C, with shaking at 220 rpm. The culture was added to 90 ml LB kanamycin broth and incubated at 370C, until the OD600 had reached between 0.4-1.0 (approximately 1.5 hr). At this point, IPTG (0.1 mM) was added to the culture which was incubated at 3O0C for 5 hr. Cells were harvested by centrifugation at 8,500 rpm for 10 min. Pellets were resuspended in Bugbuster Master Mix (5 ml per 100 ml culture) and incubated at room temperature for 20 min with shaking.
The cell suspension was centrifuged at 11,000 rpm for 20 min at 40C. After removal of the supernatant, the pellet was resuspended in 5 ml Bugbuster Master Mix. Six volumes (30 ml) 1:10 Bugbuster Protein Extraction Reagent was added to the cell suspension and inclusion bodies were collected by centrifugation at 6,000 rpm for 15 min at 40C.
The supernatant was removed and inclusion bodies were resuspended in 50 ml 1:10 Bugbuster reagent. The cell suspension was centrifuged at 6,000 rpm for 15 min at 40C. This wash step was repeated two further times, but in the final step centrifugation speed was increased to 1 l,000rpm. The final pellet of purified inclusion bodies was resuspended in a 0.1 culture volume (10 ml) of Ix IB Wash Buffer (Novagen). Inclusion bodies were collected by centrifugation at 8,500 rpm for 10 min. The pellet was resuspended in 0.1 culture volume of Ix IB Wash Buffer and inclusion bodies were collected by centrifugation at 8,500 rpm for 10 min. The supernatant was removed and the inclusion bodies were resuspended in Ix IB Solubilisation Buffer plus 0.3% N-lauroylsarcosine and ImM DTT (Novagen) at a concentration of 10 mg/ml. The sample was mixed gently, incubated for 15 min at room temperature and centrifuged at 8,500 rpm for 10 min. The solubilised fraction of ILV1352 was dialyzed against three changes of neutral pH buffer (2OmM Tris- HCl, pH 8.5) + O.lmM DTT using a Slide- A-Lyzer 7K MWCO dialysis cassette (Pierce), with DTT omitted from the final dialysis step. Example 7. Assays for the identification of inhibitors
7.1 Biochemical assays for the identification of inhibitors
Recombinant proteins can be assayed using an assay type specific for the particular protein. For example:
Endonucleases can be asayed by incubating the protein with DNA, such as Lamdba or pBR322, and observing whether the DNA is cleaved by running on an agrose gel.
Exonucleases can be assayed by incubating the protein with fluorescently or radio-labelled DNA, such as Lamdba or pBR322, and observing whether the fluorescent or labelled nucleotides are released. Exoribonucleases can be assayed by incubating the protein with fluorescent or radiolabeled RNA and observing whether fluorescent or labeled ribonucleotides are released.
GPCRs (G-protein coupled receptors) can be assayed by incubating radiolabeled ligand with GPCR membrane fractions derivatised with FlashBlue™ beads (Perkin Elmer) and measuring emitted light. GPCR membrane fractions are prepared from cells expressing, or over-expressing the GPCR of interest.
ILV3/ILV3/dihydroxyacid dehydratases can be assayed by measuring the formation of the keto acid reaction products, either directly, at 313 nm, or by derivatising the ketone with 2,4-dinitrophenyl hydrazine and measuring at 530 nm. Kinases can be assayed by incubating the kinase with [32P]-ATP and substrate, and measuring the incorporation of 32P-label into the substrate. Suitable substrates may include myelin basic protein, glycogen synthase and enolase. Alternatively, fluorescence quencing technology such as QTL Lightspeed ™ kinase assays (QTL biosystems, Reigate, Surrey) can be used. Phosphatases can be assayed by incubating the protein with [32P]-ATP-labelled substrate and measuring the release of 32P-label. Suitable substrates may include myelin basic protein, glycogen synthase and enolase. Alternatively, phosphatase assays exploiting fluorescence quenching technology, such as IQ phosphatase assays (Pierce, Cramlington, Northumberland) can be used.
Phosphatididylinositol-specific phospholipase Cs can be assayed by using the chromogenic substrate 5 -bromo-4-chloro-3-indoxyl-myoinositol-l -phosphate or the fluorogenic substrate 4-methylumbelliferyl-myo-inositol-l -phosphate (Restaino et al., 1999, J. FoodProt. 62, 244-251 ; Reissbrodt, 2004, Int. J. Food Microbiol. 15, 1-
9).
Phosphodiesterases such as 3 '5' cyclic nucleotide phosphodiesterases can be assayed as described by Wera et al. (FEBS Lett. 1997, 420, 147-150) by following the time-dependent degradation of cAMP. Samples and controls are incubated in 50 niM Tris-HCl (pH 8), 0.1 mMEDTA, and 500 mM cAMP at 30°C. The reaction is stopped by heating, and cAMP is measured using the cAMP [3H] assay system (Amersham, Arlington Heights, IL).
Protein tyrosine phosphatases can be assayed using substrate protein (such as myelin basic protein) where the tyrosines have been labelled with [32P], and measuring released label after incubation with the enzyme. Alternatively, the nonradioactive ProFluor™ assay kit (Promega) can be used.
These assays are modified for the identification of an inhibitor by including a candidate substance in the incubation and measuring the extent to which the enzyme activity is inhibited.
7.2 Genetic screen for the identification of inhibitors
In the case of proteins for which a function is not known or obvious, inhibitors can be identified using a generic genetic screen. Heterozygous knock-out mutants are generated, for instance as described in Example 2. In most this should result in less gene product being made by the heterozygote than the wild type diploid. If the gene is essential for growth then the heterozygote should be more sensitive to a compound that targets the product of that gene. This phenomenon is called haploinsufficiency and has been demonstrated in yeast (Genomic profiling of drug sensitivities via induced haploinsufficiency. Giaever G, Shoemaker DD, Jones TW, Liang H,
Winzeler EA, Astromoff A, Davis RW. Nat Genet. 1999 21:278-83.)
The primary screen for genes of unknown function involves monitoring the growth of the heterozygous mutant versus the growth of the wild type diploid strain of Aspergillus fumigatus, in the presence and absence of a panel of compounds. Spore suspensions of these strains are set up in RPMI 1640 medium in 96-well plates. 1x104 cfu/ml is the inoculum used. Potential inhibitors are added to give a final concentration of 32 μg/ml. The plates are then incubated at 370C for 48h. The OD485 of the cultures is then measured using a plate reading spectrophotometer.
If both heretozygote and wild-type are unaffected no further work is carried out on the compound. If there is (a) growth of the wild type but no growth of the heterozygote, or (b) no growth of both strains, the Minimal Inhibitory Concentration (MIC) for the compound in each strain is determined as follows: The heterozygote mutant and the wild type diploid are incubated in the presence of a range of concentrations of the chemical. The lowest concentration of chemical that prevents growth of the organism (the Minimal Inhibitory Concentration, MIC) is calculated for both strains. Doubling dilutions of the compound of interest are prepared in RPMI 1640 medium in 96-well plates starting at 50 μg/ml down to 0.1 μg/ml in duplicate. Each well is inoculated with either wild type or mutant Aspergillus fumigatus and the plate incubated at 370C for 24/48h prior to measuring the OD485.
An inhibitor of the product of the gene of unknown function will have a lower MIC in the mutant strain than in the wild type strain, i.e., a 2-fold or more difference in MIC between the 2 strains. This anti-fungal compound can then be used as the basis for chemistry approaches to improve the specificity, potency and other properties of the compound.
7.3 ILV3 Assay The assay for ILV34 is based upon the ability of this enzyme to dehydrate dihydroxyacid substrates to a keto acid. The natural substrates are 2,3- dihydroxy-3- methylbutyrate and 2,3- dihydroxy-3-ethylbutyrate; an alternative substrate which is commercially available is L-threonic acid. The appearance of the keto acid product can be monitored directly at 240 run; alternatively it can be reacted with semicarbazide and sodium acetate and monitored at 250 nm. The semicarbazide/sodium acetate effectively stops the enzymatic reaction and develops it giving an increased absorbance, which is stable for at least 24 hours (Kanamori and Wixom, 1963, J. Biol. Chem. 238:998-1005; Kiritani and Wagner, 1970, Meth. Enzymol. 17:755-764; Limberg et al., 1995, Bioorg. Med. Chem. 3:487-494).
Assays were carried out in 96- or 384-well plates. To each well of a 384- well plate was added 0-8000 ng recombinant truncated ILV34 and 25 μl 0-5OmM threonate (dissolved in 50 raM Tris-HCl, 10 mM MgCl2, pH8.0), and the volume made up to 50 μl with 50 mM Tris-HCl, 10 mM MgCl2 (pH8.0). Samples were incubated at room temperature and at suitable intervals the reaction was stopped and developed by the addition of 25 μl semicarbazide solution (1.26% w/v semicarbazide in 1.89% w/v sodium acetate solution). The samples were incubated for 15 mins after the final semicarbazide/sodium acetate addition and then read at 250nm.
Rate of reaction (change in absorbance per minute) was linear over different ILV34 concentrations but became saturated at high substrate concentrations. ILV34 had a Km of approximately 10 mM for threonate, and was most active at pH 8.0. Magnesium ion concentration had no effect on ILV34 activity in the range 50 μM-10 mM. An inhibitor of ILV34, 2-hydroxy-3-methylbutyric acid (Sigma 219835), was tested and the IC5ofound to be approximately 10 mM.
7.4 High-throughput screen for the identification of ILV34 inhibitors
Screens for inhibitors of ILV34 were based on the assay described above. The screen described is for a 384 format but the protocol can be adapted to run 1536 or other formats as required.
Compounds to be tested were dissolved in 100% DMSO, diluted in water and loaded into 384 square well polystyrene plates (eg. 'Greiner bio-one' UV-Star 384 Microplates; lOμl/well). The final DMSO concentration in all assay wells was 5%v/v.
The substrate, L-threonic acid (hemicalcium salt [Aldrich 380644-5G]; 2OmM in 62.5 mM Tris-HCl, 12 mM MgCl2 pHδ.O) was prepared prior to use on the day of the screen. The solution was sonicated at room temperature until clear, a glass rod was used to crush material which was slow to dissolve. The final concentration of L-threonic acid in the assay wells was 8mM.
The stop/signal amplification reagent (semicarbazide HCl [Aldrich S220-1]; sodium acetate, anhydrous [BDH 301045M]); 1.26% w/v semicarbazide, 1.89% w/v sodium acetate in deionised water) was also prepared prior to use on the day of the screen.
Recombinant ILV34 enzyme prepared as described above was made up in 62.5 mM Tris-HCl, 12 mM MgCl2 buffer (pH8.0). The final buffer concentration in the assay was 50 mM Tris-HCl, 9.6 mM MgCl2 buffer (pHδ.O).
Assays were carried out using Tecan Freedom, Tecan TeMo and PerkinElmer Minitrak robots together with a ThermoLabsystems multidrop 384 and a Tecan Safire automated plate reader. 20 μl of enzyme (typically around 2 μg/well, depending on specific activity of the batch) followed by 20 μl L-threonic acid solution were added to wells of the microtitre plates containing test compounds. 20 μl of 62.5 mM Tris-HCl, 12 mM MgCl2 buffer (pH8.0) was used for a duplicate set of plates (i.e. for background no- enzyme controls); DMSO (diluted in the same way as solubilised compound stocks) was used for no-compound controls. Plates were incubated at room temperature for 40 minutes after which 25 μl of stop/amplification reagent was added. After 15 minutes at room temperature plates were read at 250 nm and data processed using Excel spreadsheets to convert raw data into percent inhibition data.
The kinetics of the screen over the incubation time were such that reaction progress curves were both linear with time and protein concentration. The Z' value for the screen was equal to 0.83 and thus fully acceptable (Zhang et al., 1999, J. Biomolecular Screening, 4, 67-73). Consistency of signal between wells on plates, plate to plate, and screen run to screen run were also acceptable for an HTS regime.
Secondary screens can be carried out to measure dose response data for selected compounds, using essentially the same protocol as the pimary screen. The secondary screen uses the Excelfϊt version 3 software (IDBS), with sigmoidal model 606, to plot appropriate inhibition values and determine IC50 data for compounds.
ILVl 352 can be assayed using a similar assay to that employed for ILV34, and ILV1352 inhibitors are identified in a similar way to ILV34 inhibitors. Compounds identified as inhibitors from the ILV34 assay can be tested in a similar assay using recombinant ILV 1352 (or vice versa) and compounds showing inhibition in both assays are candidates for antifungal agents. Alternatively, compounds showing inhibition of one of the ILV3 proteins may be ILV3 inhibitors.
Example 8. Production of an antibody Recombinant protein may be used as an immunogen, (as described in Example 6). Alternatively, synthetic proteins or polypeptides encoding regions either unique to the individual proteins, or likely to provide cross-reactivity within a set of homologs are used. Peptides may need to be conjugated to carrier proteins before immunization. Preimmune sera from animals to be immunised are screened against the immunogen to ensure that there is no endogenous cross reactivity. Animals (typically sheep, rabbits or mice) are then immunised. For polyclonal antibody production, the resulting sera is affinity purified using the immunogen cross-linked to a chromatography matrix. Alternatively, purification of the antibody fraction from the serum, e.g. using protein G or protein A cross-linked to a matrix, may be sufficient. Monoclonal antibody production proceeds by methods familiar to those skilled in the art.
The specificities of the resulting polyclonal and/or monoclonal antibodies are checked by ELISA and/or western blotting using the immunogen, related constructs or whole cell lysates and extracts as targets. Negative controls, such as paralogous proteins, different constructs or different species are also employed to test specificity and/or to determine the range of species and/or genus cross-reactivity.
The reader's attention is directed to all papers and documents which are filed concurrently with or previous to this specification in connection with this application and which are open to public inspection with this specification, and the contents of all such papers and documents are incorporated herein by reference.
All of the features disclosed in this specification (including any accompanying claims, abstract and drawings), and/or all of the steps of any method or process so disclosed, may be combined in any combination, except combinations where at least some of such features and/or steps are mutually exclusive.
Each feature disclosed in this specification (including any accompanying claims, abstract and drawings), may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise. Thus, unless expressly stated otherwise, each feature disclosed is one example only of a generic series of equivalent or similar features.
The invention is not restricted to the details of the foregoing embodiment(s). The invention extends to any novel one, or any novel combination, of the features disclosed in this specification (including any accompanying claims, abstract and drawings), or to any novel one, or any novel combination, of the steps of any method or process so disclosed.
Sequence Listing
SEQ ID No. 1
ATGCGATCCTTTCCTCTTGTCCTTGCCGCAGGCATTCCTGCCCTGGCTGCCACCGCCGGCGAGTTCAA CATACTCGCCTTGAACGTCGCAGGACTCCCCCCCATCCTGAACGGCAACGACGTGCCCGGCGACAAGT CCGACAACTCTCGGCAGATTGGCAGGAAATTCGCCGAGTACGGATACGACGTGATTCATGTGCAGGAG GTGAGTCCGTCCCTGCGATCGCCCTGATCTCGCTGACGCAACAGGACTTCAACTACCACGCGTACATC TACGAAACCGACAACCATCCCTACCGCACTCCCACGTCGGGCGGCGCGGGGATCGGTTCCGGCTTGAA TACGCTCTCCAACTTTGAATTCACCAACTTTGTGCGGACCAAGTGGGCGACCTGCTCGAATGCCGAGG GGGCCGACTGTCTCACCCCCAAGGGGTTCACCTCGATGCGGGTGCGCGTTGAGGAGGGGGTCTACGTG GATTTTTACAATCTCCACGCGGACGCCGGGTATTTCTTTCCTGCCTTCGTCCGTTCTGTAGCAAAGCT GACTTGCACAGATCCAAAGACGACGATGTCAAGGCTCGCAGCGCCAATCTGCAGCAGCTGGCCGATTA CATCAAGGTCAACTCGGCCGGCAATGCGGTGCTGGTGTTTGGCGACACCAATGCACGGTATACAAGGA CTGGCGACAATATCCGTGTCTTCCAGACGCAGAATGGCATGGTCAACCCGTGGGTGGAGCTGATTCTC CAGGGAGCCGCGCCGGCCGAGGGGAGCAATGCCTTGCTCTGCCAGAATCCTAGCACAACCAGTGACTG TGAAACGGTCGATAAGATCTTGTAACCGATCCCCGAGCCGTTTGTGACTGCACGACAGGAGGGCTGAC GGGAATTAGCTACCGGGGAAGCCGTGCGGTCGACCTCAGGGCCGTCTTCTGGAACTACGAGAGCAACA AGTTCCTCAGCGACAAGGGGACGATCCTCTCCGATCACAATCCCATCACGACCAACTTCACCTGGACC TTGTCCAATGCCTTCCGCCAGAGTGACATCTTCGGTGGCCCACACGGGTAAGGCAGATTTTTTACATC AGAAACTCCCAACGCGGCCAGCTAACACAAGAAGCAGAACCTGGTTTAACGACCTCGACTCCCTCCCT ACGGCGTTCACCGGCCAAAACAAGCCAAGCAAGATCTCCCTGCGCGGTGCCGAGCGACTCGACAGCGT CGGCCTCACCGTGGCCTCGGGCAAGAGCTACACCCACGGGGGCACCGGGGGCCAGCTCTCGGAACTCC CCCTCGCGGCCAACGAGTACTGGACCAAGGCAAAGCTGTGCCAGGGGCAGTACCGCGGCCACACGCGC AACTTCTACCTGCTGGCGACGACGAGCAGCGGCCGGACGGTGAGTGCGGGGACGGCGACGTCAGACTG CAAGGAGTTTGCGGCGGCGCCAGGGTGGCAGATTGTGGGATTCTACGGGCAGGACGGGGACGAGATTG ATCAGCTGGGATTTCTCTACGGGTTGATT
SEQ ID No. 2 ATGCGATCCTTTCCTCTTGTCCTTGCCGCAGGCATTCCTGCCCTGGCTGCCACCGCCGGCGAGTTCAA CATACTCGCCTTGAACGTCGCAGGACTCCCCCCCATCCTGAACGGCAACGACGTGCCCGGCGACAAGT CCGACAACTCTCGGCAGATTGGCAGGAAATTCGCCGAGTACGGATACGACGTGATTCATGTGCAGGAG GACTTCAACTACCACGCGTACATCTACGAAACCGACAACCATCCCTACCGCACTCCCACGTCGGGCGG CGCGGGGATCGGTTCCGGCTTGAATACGCTCTCCAACTTTGAATTCACCAACTTTGTGCGGACCAAGT GGGCGACCTGCTCGAATGCCGAGGGGGCCGACTGTCTCACCCCCAAGGGGTTCACCTCGATGCGGGTG CGCGTTGAGGAGGGGGTCTACGTGGATTTTTACAATCTCCACGCGGACGCCGGATCCAAAGACGACGA TGTCAAGGCTCGCAGCGCCAATCTGCAGCAGCTGGCCGATTACATCAAGGTCAACTCGGCCGGCAATG CGGTGCTGGTGTTTGGCGACACCAATGCACGGTATACAAGGACTGGCGACAATATCCGTGTCTTCCAG ACGCAGAATGGCATGGTCAACCCGTGGGTGGAGCTGATTCTCCAGGGAGCCGCGCCGGCCGAGGGGAG CAATGCCTTGCTCTGCCAGAATCCTAGCACAACCAGTGACTGTGAAACGGTCGATAAGATCTTCTACC GGGGAAGCCGTGCGGTCGACCTCAGGGCCGTCTTCTGGAACTACGAGAGCAACAAGTTCCTCAGCGAC AAGGGGACGATCCTCTCCGATCACAATCCCATCACGACCAACTTCACCTGGACCTTGTCCAATGCCTT CCGCCAGAGTGACATCTTCGGTGGCCCACACGGCAGAACCTGGTTTAACGACCTCGACTCCCTCCCTA CGGCGTTCACCGGCCAAAACAAGCCAAGCAAGATCTCCCTGCGCGGTGCCGAGCGACTCGACAGCGTC GGCCTCACCGTGGCCTCGGGCAAGAGCTACACCCACGGGGGCACCGGGGGCCAGCTCTCGGAACTCCC CCTCGCGGCCAACGAGTACTGGACCAAGGCAAAGCTGTGCCAGGGGCAGTACCGCGGCCACACGCGCA ACTTCTACCTGCTGGCGACGACGAGCAGCGGCCGGACGGTGAGTGCGGGGACGGCGACGTCAGACTGC AAGGAGTTTGCGGCGGCGCCAGGGTGGCAGATTGTGGGATTCTACGGGCAGGACGGGGACGAGATTGA TCAGCTGGGATTTCTCTACGGGTTGATT SEQ ID No. 3
MRSFPLVLAAGIPALAATAGEFNILALNVAGLPPILNGNDVPGDKSDNSRQIGRKFAEYGYDVIHVQE DFNYHAYIYETDNHPYRTPTSGGAGIGSGLNTLSNFEFTNFVRTKWATCSNAEGADCLTPKGFTSMRV RVEEGVYVDFYNLHADAGSKDDDVKARSANLQQLADYIKVNSAGNAVLVFGDTNARYTRTGDNIRVFQ TQNGMVNPWVELILQGAAPAEGSNALLCQNPSTTSDCETVDKIFYRGSRAVDLRAVFWNYESNKFLSD KGTILSDHNPITTNFTWTLSNAFRQSDIFGGPHGRTWFNDLDSLPTAFTGQNKPSKISLRGAERLDSV GLTVASGKSYTHGGTGGQLSELPLAANEYWTKAKLCQGQYRGHTRNFYLLATTSSGRTVSAGTATSDC KEFAAAPGWQIVGFYGQDGDEIDQLGFLYGLI
SEQ ID No. 4 ATGAAGGGTCCCACGGGAGGTCCTCGTGAAGACAGCCTCACTGGACTTCTTGTCCGATCAACGTCGAC GAACTGGTCGACGAATTCGGTGATTGCGGTTGACGCGGGTACCCTTGTTTCAGGGATTATTCACACTC TGGAGCAATACAATGCGGAATTAAGAGATGGCTCGTATATGATGAATGAAGGGCCTTTTGCCGGCCTC AGAATGCCCTACAAGTCTGCTCAAGCCAATGCTGCACATATTTTCCGAGACATCATTGGAGCTGTCCT GATCACACATGCTCACTTGGATCACCTGTCCGGCCTGGCGATCAACACGCCCATGCTTGAAGCAGGAA ACGGGCCCAAACCTTTGGCCGCTCTACCGTCAATTGTCGCTGCCATAAAAAACCATATGCTTAACGAC GTGATCTGGCCAAATCTGTCCGATGAAGATGGCGGGGCGGGTCTACTGACTTATCAACGTCTTGTTGA AGGGGGGAATCCAAGGTTCGGGCGTGGAGATGCGAAGGGGTATATCCGAGCCTGCGACGGTCTTCTAG CCCGGTGCCTCGGCGTCAGTCATGGACGATGCAGGCAACGCTTTCACCCGGAGTCTGGTACTCACCAC CGCGTGGGTAGCTCAGTCTTTGCTGCTGATCCACTGATGCTGCCTTCCAGAGCTATATCCGTTGATCA CTCTACGGACGCAGGGTAAGTCTTCCACACTGGAACGCGATTGGAATTCACTGACATTGAAACTATTT GAAGGATATATTCTCCAGCTCGCTCTCCCCGAATGCACCCAGCAAACACGAAAGATCCTATTTGGGCA ACAGTGGAGAGCTCTGCATTCTTCATCCGTGATCACCATACTGGCAATGAGATTATTATCTTCGGTGA CGTCGAACCCGACTGTGTCTCGCTTGACCCACGCAATAAACGCGTCTGGGAGGCAGCAGCACCGAAAA TTGCGACCGGTAAATTGCGCGCCATCTTCATTGAATGCTCATATGACGATTCTGTCGAGGATGCAACT TTATACGGTCATCTATGTCCGCGACACCTAATTGCGGAACTGACGTTGCTGGCCGGGAAGGTTATGGA AGCCCGGCACCCGCGCATGGCGATATCGAGTATCGGAAAGCGTAAACACTCAGACTCTAACAGGTATG GCGGCAGTGGTGGCCAAGTGAGCCCGAAATCCAAGCGTTTCCAAAGCTTCTCCGTTGCAGCTGGCAAG AGGGCCGACATGGATTCTGTTACGGCCGACATGCGACCACGATTCTACTCAACCGCGGAAGGCTTGGC TGAGATCCCTGAGCTGGTTCATGACGAGCCGACACCGAGCTACCACGAGCCTGACGATGTAAGCGATA CTGAAAATTCGGTGGATCCTGCACCACAGTCCACCAGCCCTGGAGAAGCTCAGACCAGTGACTCTGGG CAGCTTCCGCTGTCGGGTTTATCTATCTACATCATACACATCAAAGAAGATCTGACCGATGATATCCC TCCCCGAGAACGAATTTTACAACAGCTTCGATCCCGAGGCGAAGCCGCTAGACTAGGCTGCGAGTTCT ACGCTCCCCATCGCGGAGAAGGCATTTGGGTTTGA
SEQ ID No. 5
ATGAAGGGTCCCACGGGAGGTCCTCGTGAAGACAGCCTCACTGGACTTCTTGTCCGATCAACGTCGAC GAACTGGTCGACGAATTCGGTGATTGCGGTTGACGCGGGTACCCTTGTTTCAGGGATTATTCACACTC TGGAGCAATACAATGCGGAATTAAGAGATGGCTCGTATATGATGAATGAAGGGCCTTTTGCCGGCCTC AGAATGCCCTACAAGTCTGCTCAAGCCAATGCTGCACATATTTTCCGAGACATCATTGGAGCTGTCCT GATCACACATGCTCACTTGGATCACCTGTCCGGCCTGGCGATCAACACGCCCATGCTTGAAGCAGGAA ACGGGCCCAAACCTTTGGCCGCTCTACCGTCAATTGTCGCTGCCATAAAAAACCATATGCTTAACGAC GTGATCTGGCCAAATCTGTCCGATGAAGATGGCGGGGCGGGTCTACTGACTTATCAACGTCTTGTTGA AGGGGGGAATCCAAGGTTCGGGCGTGGAGATGCGAAGGGGTATATCCGAGCCTGCGACGGTCTTCTAG CCCGGTGCCTCGGCGTCAGTCATGGACGATGCAGGCAACGCTTTCACCCGGAGTCTGGTACTCACCAC CGCGTGGCTCGCTCTCCCCGAATGCACCCAGCAAACACGAAAGATCCTATTTGGGCAACAGTGGAGAG CTCTGCATTCTTCATCCGTGATCACCATACTGGCAATGAGATTATTATCTTCGGTGACGTCGAACCCG ACTGTGTCTCGCTTGACCCACGCAATAAACGCGTCTGGGAGGCAGCAGCACCGAAAATTGCGACCGGT AAATTGCGCGCCATCTTCATTGAATGCTCATATGACGATTCTGTCGAGGATGCAACTTTATACGGTCA TCTATGTCCGCGACACCTAATTGCGGAACTGACGTTGCTGGCCGGGAAGGTTATGGAAGCCCGGCACC CGCGCATGGCGATATCGAGTATCGGAAAGCGTAAACACTCAGACTCTAACAGGTATGGCGGCAGTGGT GGCCAAGTGAGCCCGAAATCCAAGCGTTTCCAAAGCTTCTCCGTTGCAGCTGGCAAGAGGGCCGACAT GGATTCTGTTACGGCCGACATGCGACCACGATTCTACTCAACCGCGGAAGGCTTGGCTGAGATCCCTG AGCTGGTTCATGACGAGCCGACACCGAGCTACCACGAGCCTGACGATGTAAGCGATACTGAAAATTCG GTGGATCCTGCACCACAGTCCACCAGCCCTGGAGAAGCTCAGACCAGTGACTCTGGGCAGCTTCCGCT GTCGGGTTTATCTATCTACATCATACACATCAAAGAAGATCTGACCGATGATATCCCTCCCCGAGAAC GAATTTTACAACAGCTTCGATCCCGAGGCGAAGCCGCTAGACTAGGCTGCGAGTTCTACGCTCCCCAT CGCGGAGAAGGCATTTGGGTTTGA SEQ ID No. 6
MKGPTGGPREDSLTGLLVRSTSTNWSTNSVIAVDAGTLVSGIIHTLEQYNAELRDGSYMMNEGPFAGL RMPYKSAQANAAHIFRDIIGAVLITHAHLDHLSGLAINTPMLEAGNGPKPLAALPSIVAAIKNHMLND VIWPNLSDEDGGAGLLTYQRLVEGGNPRFGRGDAKGYIRACDGLLARCLGVSHGRCRQRFHPESGTHH RVARSPRMHPANTKDPIWATVESSAFFIRDHHTGNEIIIFGDVEPDCVSLDPRNKRVWEAAAPKIATG KLRAIFIECSYDDSVEDATLYGHLCPRHLIAELTLLAGKVMEARHPRMAISSIGKRKHSDSNRYGGSG GQVSPKSKRFQSFSVAAGKRADMDSVTADMRPRFYSTAEGLAEIPELVHDEPTPSYHEPDDVSDTENS VDPAPQSTSPGEAQTSDSGQLPLSGLSIYIIHIKEDLTDDIPPRERILQQLRSRGEAARLGCEFYAPH
RGEGIWV SEQ ID No. 7
ATGGTGGCCGAACACTTGACGATTCGCAACCTTACCACAACCCCAATAATCCTCAAGCTCATTGAGCG TTTTCACCCCCATAAGGATCCTCGAGATGACATACATTCCTTGGCCAGGAATTTTACTCGAATTCTAA GCAATGTCACTCGCACCAATGAGACTGTCGCCGCGATTACTGATGACAATGAACCATTTGCTCATGAA GAGTATGATATTCACGTGGAGCCTTTCCAGACAGTCCAGACCGAAGTTCGTGCATTCATCGATTCGGA CAAAGAGCGCCTGCGCTGGACCTTTGAAGCCGAGGGAGAGCGACATCAGATTCAGACACCGGTTCCTA CGACGGAGTCTGCCGGAATGAAGGCGTTGTGTGATAATCCTCGATTTCGCTTCACTGGCGTCTACGTG ACGCCTGAATCCCACCTTGCGATCTACTCCTCGGCCAATCTGCAAGCATGGATGGGAGAGCTCAAAGA TAGCACATTGCTCTCCTCGCTATCCATACCAGGAACACACAATTCTCCTACGTGCCATGTCGCGGCCC CGTCTGTCCGCTGCCAAGCTGTCAGTCCGCGCGAACAGCTTCGAAATGGTGTCCGTTTCTTCGACATC CGTGTGCAGCCCCAGTTCCCAGAGGACCCATCCAAAGATGAGCTTATCCTAGTGCACAGCGTGTTCCC CATCTCCTTGACGGGCAACAAGTACTTTCGCGATCTGATGCGAGACGTTAATGAGTTCCTCAATGAGA ATCCATCCGAGACGCTCATCATCTCTCTGAAGCGAGAAGGCCCAGGAAACCATACCGATGAACAATTG AGTCGGATTGTGCGCGACCATTATGCTCGTCCAGATAGCCGGTGGTACACAGAACCCAAAATCCCTAC TCTGGGCGAGGTGCGCGGGAAAGTCGTCCTGCTTCGCCGGTTCAACATCATTGAAGAGCTAAAGCATG AACACGACGGCCGCGGCTGGGGCATAGACGGAAGCGATTGGGCAGATAATACTCCAAACGCTACCTGC AGCAGTGGTCAACTCTGCATCCAGGACTTCTACGAGGTCCTCGAGACGAAGAACATCGATGTCAAAAT CAAATATGTAACGGAGCACTGTGAGCGCTCCAGCGGGCACTGTTACCCGTTCGGTGCCCTCCCGGATC CCGAAGCTAGCAAAGCGCATCCATTCTATATAAATTTCCTCAGTGCAAGCAACTTCTGGAAGGTGGGG ACGTGGCCCGAGAAGATTGCAGCAAAGCTGAACCCGGCAACTGTTGACTATCTCTGCCGGAGACACAG TCACCCGGACGGTGACTGGTCGACTGGTATTCTTGTTACGGACTGGGTTGGCCACGAAGGCGACTGGG ACCTTGTGCGTTGCATTGTTGGCATGAATGCTAAGCTGAAGATGAGACAGATGAGAGAGGAGCAGGAA CACTAA SEQ ID No. 8
ATGGTGGCCGAACACTTGACGATTCGCAACCTTACCACAACCCCAATAATCCTCAAGCTCATTGAGCG TTTTCACCCCCATAAGGATCCTCGAGATGACATACATTCCTTGGCCAGGAATTTTACTCGAATTCTAA GCAATGTCACTCGCACCAATGAGACTGTCGCCGCGATTACTGATGACAATGAACCATTTGCTCATGAA GAGTATGATATTCACGTGGAGCCTTTCCAGACAGTCCAGACCGAAGTTCGTGCATTCATCGATTCGGA CAAAGAGCGCCTGCGCTGGACCTTTGAAGCCGAGGGAGAGCGACATCAGATTCAGACACCGGTTCCTA CGACGGAGTCTGCCGGAATGAAGGCGTTGTGTGATAATCCTCGATTTCGCTTCACTGGCGTCTACGTG ACGCCTGAATCCCACCTTGCGATCTACTCCTCGGCCAATCTGCAAGCATGGATGGGAGAGCTCAAAGA TAGCACATTGCTCTCCTCGCTATCCATACCAGGAACACACAATTCTCCTACGTGCCATGTCGCGGCCC CGTCTGTCCGCTGCCAAGCTGTCAGTCCGCGCGAACAGCTTCGAAATGGTGTCCGTTTCTTCGACATC CGTGTGCAGCCCCAGTTCCCAGAGGACCCATCCAAAGATGAGCTTATCCTAGTGCACAGCGTGTTCCC CATCTCCTTGACGGGCAACAAGTACTTTCGCGATCTGATGCGAGACGTTAATGAGTTCCTCAATGAGA ATCCATCCGAGACGCTCATCATCTCTCTGAAGCGAGAAGGCCCAGGAAACCATACCGATGAACAATTG AGTCGGATTGTGCGCGACCATTATGCTCGTCCAGATAGCCGGTGGTACACAGAACCCAAAATCCCTAC TCTGGGCGAGGTGCGCGGGAAAGTCGTCCTGCTTCGCCGGTTCAACATCATTGAAGAGCTAAAGCATG AACACGACGGCCGCGGCTGGGGCATAGACGGAAGCGATTGGGCAGATAATACTCCAAACGCTACCTGC AGCAGTGGTCAACTCTGCATCCAGGACTTCTACGAGGTCCTCGAGACGAAGAACATCGATGTCAAAAT CAAATATGTAACGGAGCACTGTGAGCGCTCCAGCGGGCACTGTTACCCGTTCGGTGCCCTCCCGGATC CCGAAGCTAGCAAAGCGCATCCATTCTATATAAATTTCCTCAGTGCAAGCAACTTCTGGAAGGTGGGG ACGTGGCCCGAGAAGATTGCAGCAAAGCTGAACCCGGCAACTGTTGACTATCTCTGCCGGAGACACAG TCACCCGGACGGTGACTGGTCGACTGGTATTCTTGTTACGGACTGGGTTGGCCACGAAGGCGACTGGG ACCTTGTGCGTTGCATTGTTGGCATGAATGCTAAGCTGAAGATGAGACAGATGAGAGAGGAGCAGGAA CACTAA
SEQ ID No. 9 MVAEHLTIRNLTTTPIILKLIERFHPHKDPRDDIHSLARNFTRILSNVTRTNETVAAITDDNEPFAHE EYDIHVEPFQTVQTEVRAFIDSDKERLRWTFEAEGERHQIQTPVPTTESAGMKALCDNPRFRFTGVYV TPESHLAIYSSANLQAWMGELKDSTLLSSLSIPGTHNSPTCHVAAPSVRCQAVSPREQLRNGVRFFDI RVQPQFPEDPSKDELILVHSVFPISLTGNKYFRDLMRDVNEFLNENPSETLIISLKREGPGNHTDEQL SRIVRDHYARPDSRWYTEPKIPTLGEVRGKVVLLRRFNIIEELKHEHDGRGWGIDGSDWADNTPNATC SSGQLCIQDFYEVLETKNIDVKIKYVTEHCERSSGHCYPFGALPDPEASKAHPFYINFLSASNFWKVG TWPEKIAAKLNPATVDYLCRRHSHPDGDWSTGILVTDWVGHEGDWDLVRCIVGMNAKLKMRQMREEQE H
SEQ ID No. 10 ATGGACTCCTCTACCTCCGCGTCGTCCAACACCCACGGCGAAGCCAAGTATATCAACTTCCCCACTCT TCCTGACGACGCAAAGCATGAAGATGGCACCACTGCGCTGAACAGATATTCTTCTTATATCACTCGGG GCCATGACTTTCCTGGTGCTCGGGTTTGTTACCCTCATTGAGCTGTCTATGAGATGTGGGGGCTGACA ATCTGCTCTTGTATGACTGAATAGGCTATGCTTTTTGCAGCGGGGATCCCGGATCGCGAAGCGATGGC TAAGAGCCCACAGGTAGGAATTGCCAGTGTCTGGTGGGAGGGAAATCCTTGTAATATGCATCTGCTGG ACTTGGGCAAGACCGTGAAGAAGGCCGTTACAGATCAGGGTATGATCGGTTGGCAGTATAATACCATT GGAGTTTCAGATGCCATTTCAATGGGTAGTGAGGGTGAGCGTATTCAGGGCTTGAGGAGCATTGCTAC TCTCTGCGTGAGATATTCCTGACGTGTATTCCCAGGCATGAGATTTTCTCTCCAGACGCGTGAGATCA TTGCAGACAGCGTCGAGACTGTGACTTGCGCGCAGTATCATGATGCATGCATTGCAATTCCTGGGTGC GACAAGAATATGCCTGGAGTAGTAATGGGTATGGCCAGACACAATCGGCCTTCGCTTATGATTTACGG TGGAACAATTCAGGTTGGATACTCGAACCTGCTGCGGAAGCGGGTCAACGTGTCGACTTGCTTTGAAG CGGCTGGTGCCTATGCTTATGATACTTTGCGTCAACCGGACGATGGGGGTGACACCAGTAAAAGCAAG GACGAGATTATGGATGACATTGAGAGACATGCTTGTCCCAGTGCGGGTGCATGTGGAGGCATGTTTAC TGCAAACACAATGGCCACGGCGATTGAGTCTATGGGCCTGTCCCTACCAGGGTCATCGTCAACGCCTG CCTCGTCTCCATCGAAGATGCGAGAATGTGTTAAAGCGGCAGAAGCCATCAAGACCTGTATGGAGAAG AACATTAGGCCTCGGGATCTTTTGACCAAGCGCTCCTTCGAGAATGCCCTCGTCATGACGATGGCTCT GGGAGGAAGTACCAATGGTGTCTTGCATTTCCTTGCCATGGCTCGGACGGCGGATGTGAACCTGACCC TAGATGATGTCCAACGGGTCAGCAACAAGATCCCTTTCATTGCTGACTTGGCCCCCAGTGGGAAGTAC TACATGGCAGACCTGTACGATATCGGAGGGATCCCGTCCGTGCAGAAGTTGCTGATCGCGGCGGGACT TCTTGACGGTGACATCCCGACGGTCACCGGCAAGACCTTGGCTGAGAATGTTGCATCTTTCCCATCTC TACCTCAGGACCAAGTCATCATCCGGCCCCTGGACAACCCAATCAAGACGACTGGCCACCTGCAGATT CTACGCGGGAACCTGGCGCCTGGCGGAGCGGTGGCCAAGATCACTGGCAAGGAGGGCACCAAGTTCAC AGGCAAAGCACGTGTTTTCGATAAAGAATATCAGCTCAACGATGCTCTGACCCAAGGCAAGATTCCTC GAGGCGAAAACTTAGTGCTCATCGTCCGCTACGAAGGACCCAAGGGTGGGCCAGGCATGCCGGAGCAG CTCAAAGCGAGCGCGGCGCTGATGGGAGCTAAGCTCAACAATGTGGCCCTAATCACAGATGGAAGATA TTCAGGGGCTAGTCATGGATTCATCGTGGGTCATATCGTCCCAGAAGCTGCGGTCGGAGGGCCCATTG CCATTGTTCGCGATGACGATGTGATCACCATTGATGCGGAAACCAACACGATAAACATGCATGTCTCA GATGAGGAAATCCAGCAGCGACTGAAAGAGTGGAAGCCCCCAGTGCCTCATGTCACACGTGGTGTACT CGCCAAGTATGCAAGGCTGGTTGGGGATGCCTCTCATGGTGCAATGACGGATTTGTTCTAG
SEQ ID No. 11.
ATGGACTCCTCTACCTCCGCGTCGTCCAACACCCACGGCGAAGCCAAGTATATCAACTTCCCCACTCT TCCTGACGACGCAAAGCATGAAGATGGCACCACTGCGCTGAACAGATATTCTTCTTATATCACTCGGG GCCATGACTTTCCTGGTGCTCGGGCTATGCTTTTTGCAGCGGGGATCCCGGATCGCGAAGCGATGGCT AAGAGCCCACAGGTAGGAATTGCCAGTGTCTGGTGGGAGGGAAATCCTTGTAATATGCATCTGCTGGA CTTGGGCAAGACCGTGAAGAAGGCCGTTACAGATCAGGGTATGATCGGTTGGCAGTATAATACCATTG GAGTTTCAGATGCCATTTCAATGGGTAGTGAGGGCATGAGATTTTCTCTCCAGACGCGTGAGATCATT GCAGACAGCGTCGAGACTGTGACTTGCGCGCAGTATCATGATGCATGCATTGCAATTCCTGGGTGCGA CAAGAATATGCCTGGAGTAGTAATGGGTATGGCCAGACACAATCGGCCTTCGCTTATGATTTACGGTG GAACAATTCAGGTTGGATACTCGAACCTGCTGCGGAAGCGGGTCAACGTGTCGACTTGCTTTGAAGCG GCTGGTGCCTATGCTTATGATACTTTGCGTCAACCGGACGATGGGGGTGACACCAGTAAAAGCAAGGA CGAGATTATGGATGACATTGAGAGACATGCTTGTCCCAGTGCGGGTGCATGTGGAGGCATGTTTACTG CAAACACAATGGCCACGGCGATTGAGTCTATGGGCCTGTCCCTACCAGGGTCATCGTCAACGCCTGCC TCGTCTCCATCGAAGATGCGAGAATGTGTTAAAGCGGCAGAAGCCATCAAGACCTGTATGGAGAAGAA CATTAGGCCTCGGGATCTTTTGACCAAGCGCTCCTTCGAGAATGCCCTCGTCATGACGATGGCTCTGG GAGGAAGTACCAATGGTGTCTTGCATTTCCTTGCCATGGCTCGGACGGCGGATGTGAACCTGACCCTA GATGATGTCCAACGGGTCAGCAACAAGATCCCTTTCATTGCTGACTTGGCCCCCAGTGGGAAGTACTA CATGGCAGACCTGTACGATATCGGAGGGATCCCGTCCGTGCAGAAGTTGCTGATCGCGGCGGGACTTC TTGACGGTGACATCCCGACGGTCACCGGCAAGACCTTGGCTGAGAATGTTGCATCTTTCCCATCTCTA CCTCAGGACCAAGTCATCATCCGGCCCCTGGACAACCCAATCAAGACGACTGGCCACCTGCAGATTCT ACGCGGGAACCTGGCGCCTGGCGGAGCGGTGGCCAAGATCACTGGCAAGGAGGGCACCAAGTTCACAG GCAAAGCACGTGTTTTCGATAAAGAATATCAGCTCAACGATGCTCTGACCCAAGGCAAGATTCCTCGA GGCGAAAACTTAGTGCTCATCGTCCGCTACGAAGGACCCAAGGGTGGGCCAGGCATGCCGGAGCAGCT CAAAGCGAGCGCGGCGCTGATGGGAGCTAAGCTCAACAATGTGGCCCTAATCACAGATGGAAGATATT CAGGGGCTAGTCATGGATTCATCGTGGGTCATATCGTCCCAGAAGCTGCGGTCGGAGGGCCCATTGCC ATTGTTCGCGATGACGATGTGATCACCATTGATGCGGAAACCAACACGATAAACATGCATGTCTCAGA TGAGGAAATCCAGCAGCGACTGAAAGAGTGGAAGCCCCCAGTGCCTCATGTCACACGTGGTGTACTCG CCAAGTATGCAAGGCTGGTTGGGGATGCCTCTCATGGTGCAATGACGGATTTGTTCTAG SEQ ID No. 12
MDSSTSASSNTHGEAKYINFPTLPDDAKHEDGTTALNRYSSYITRGHDFPGARAMLFAAGIPDREAMA
KSPQVGIASVWWEGNPCNMHLLDLGKTVKKAVTDQGMIGWQYNTIGVSDAISMGSEGMRFSLQTREII
ADSVETVTCAQYHDACIAIPGCDKNMPGVVMGMARHNRPSLMIYGGTIQVGYSNLLRKRVNVSTCFEA AGAYAYDTLRQPDDGGDTSKSKDEIMDDIERHACPSAGACGGMFTANTMATAIESMGLSLPGSSSTPA
SSPSKMRECVKAAEAIKTCMEKNIRPRDLLTKRSFENALVMTMALGGSTNGVLHFLAMARTADVNLTL
DDVQRVSNKIPFIADLAPSGKYYMADLYDIGGIPSVQKLLIAAGLLDGDIPTVTGKTLAENVASFPSL
PQDQVIIRPLDNPIKTTGHLQILRGNLAPGGAVAKITGKEGTKFTGKARVFDKEYQLNDALTQGKIPR
GENLVLIVRYEGPKGGPGMPEQLKASAALMGAKLNNVALITDGRYSGASHGFIVGHIVPEAAVGGPIA IVRDDDVITIDAETNTINMHVSDEEIQQRLKEWKPPVPHVTRGVLAKYARLVGDASHGAMTDLF
SEQ ID No. 13
ATGGCTAAACCACTTAGTGAGAAGTCTTCTAATAGCTCCCTCAGGGCTAAAGCGAGTGAGTCTTCTTC
CCTTGGTTATCGACCTTTGCAGTGAGCTGTTTACTGATTAGACCGCGTTAACCAGGCGAAAGGACTCG TCTTAGAGAAACATCATTGGAGTCAGAGATCATTCACACTGAAAAGTTGCCTCCCAATTTCGGAGATG TTGTCAAAGGAGTCTATAGAAGCTCTTTCCCGCAACCTTGGCATTTCCAAGCGCTAAAGAAGCTGGGA CTCAGGATGATTGTGTAAGCGGCTGCCATATAAGATCATTACCCCTTTCTTACGCTTGTCCACATAGT ACGTTAGTTGAAGGGGACTACACCCAGGACCACCAAGTCTTTCTCAAAGAGAATGGTATTGAACATCG TCGCATTCTTATCCTGGCAAACAAGGATCCCACGATTCGAACTCCGGACCATGTCGTGAACCGAGTCT TGGAAATCATGCTCAACAAGACCAATCACCCACTCCTTCTACACTGCAACAAGGGAAAGGTGAGCGCA AATGCCGATAATACAAATTCGACCCGACTAACAAGACTTAATAGCATCGGACTGGATGCATTGTCGGC TGCTTTCGGAAGGTTCAGGGCTGGGACATGCCAGCTATTCGCAAGGAATACCTCAATTTTTCGTTGCC GAAATCAAGGCCTCTCGACGAACGATTCATTGAACTTTTCGATGACACCAGACTCGGGCCCCTCGCTG TTTCTTCTGGTGCAAGCTCCTGGCCTGCTGGTGTAATGCTCGATCCGCTTCGCGAAGAAGTAGTCGAG GACGAAAATACCCCG
SEQ ID No. 14
ATGGCTAAACCACTTAGTGAGAAGTCTTCTAATAGCTCCCTCAGGGCTAAAGCGAGTGAGTCTTCTTC
CCTTGGTTATCGACCTTTGCAAGAAACATCATTGGAGTCAGAGATCATTCACACTGAAAAGTTGCCTC CCAATTTCGGAGATGTTGTCAAAGGAGTCTATAGAAGCTCTTTCCCGCAACCTTGGCATTTCCAAGCG CTAAAGAAGCTGGGACTCAGGATGATTGTTACGTTAGTTGAAGGGGACTACACCCAGGACCACCAAGT CTTTCTCAAAGAGAATGGTATTGAACATCGTCGCATTCTTATCCTGGCAAACAAGGATCCCACGATTC GAACTCCGGACCATGTCGTGAACCGAGTCTTGGAAATCATGCTCAACAAGACCAATCACCCACTCCTT CTACACTGCAACAAGGGAAAGCATCGGACTGGATGCATTGTCGGCTGCTTTCGGAAGGTTCAGGGCTG GGACATGCCAGCTATTCGCAAGGAATACCTCAATTTTTCGTTGCCGAAATCAAGGCCTCTCGACGAAC GATTCATTGAACTTTTCGATGACACCAGACTCGGGCCCCTCGCTGTTTCTTCTGGTGCAAGCTCCTGG CCTGCTGGTGTAATGCTCGATCCGCTTCGCGAAGAAGTAGTCGAGGACGAAAATACCCCG
SEQ ID No. 15 MAKPLSEKSSNSSLRAKASESSSLGYRPLQETSLESEIIHTEKLPPNFGDVVKGVYRSSFPQPWHFQA LKKLGLRMIVTLVEGDYTQDHQVFLKENGIEHRRILILANKDPTIRTPDHVVNRVLEIMLNKTNHPLL LHCNKGKHRTGCIVGCFRKVQGWDMPAIRKEYLNFSLPKSRPLDERFIELFDDTRLGPLAVSSGASSW PAGVMLDPLREEVVEDENTP SEQ ID No. 16
ATGGACACCCCCGATCTCAAATGCACCCATCTCACCTCACCAGACTATATGAACTTCGTCCTGTCTAT GTACAACCCTGACGTTGCCGCAATAGCTCTCGTGAACTTCAACTAACGAACACAGTCTTATCATATTT GGGATTCTCCTGTCGTATCTCCCTCAGCACATTCGCATAATCAACCTCAAAAGCTCCTTCGGGATCTC CCCATACTTCGTCCTCCTTGGAACCACATCAGGTACCTCGGCGTTGGCAAATGTTGTAACACAACAGC AGAGTTTACATGATGTGGAATGCTGCAAGAACATCAATGGACTAGCTTGCTTTGGGGGACTACTCGGC ATCTTCCAAGTCGGGACACAGTGGCTTTGCTTTGTTATCATGTAAGCTCACCTACTTCTCCGCCTGAT AACATTCACGTGATTAATGCAAGATCGTGTAGTCTGCTTTTGTTCGTTATTTACTTCCCCCGAGCCAC CTCTCCAATCTCGCCTACAGAATCAGAGTCTTCGGCGAGAAATGGCCCATCATATACAACAGCGTTGG TCGTCAGCGGTATTTGTATCCTCCACGCCGTGGTGATGTTCATCACCTCTGCCGCAATCGCGGTCAAC CGGCCATCCCAGCTTCAGGCATGGTCCAACTTTTCAGGGGTGGTTGCTGCCATTCTCGCTTCGATCCA GTATTTCCCTCAGATCTACACTACGTTAAGGTTGCGGTGTGTGGGTAGCCTGAGCATCCCAATGATGT GTATCCAAACCCCGGGAAGTCTTGTGTGGGCAGGTAGTTTAGCTGCACGACTGGGACCAAAAGGATGG AGTACATGGGGCGTGTTGATTGTGACGGCATGCTTACAAGGCACGCTCTTGGCAATGGCCATCTTTTT TGAATATTTTGGGCCCAACAAGCAGCGCAACCATCGCCATGGCAAAGATCTTCCTCCGAACGGTAGTG GAGAAGGCCCAGAGGAAAGAGACCATGAGCAGCCGTCTGAGGAAACGCCACTTCTCCAA
SEQ ID No. 17
ATGGACACCCCCGATCTCAAATGCACCCATCTCACCTCACCAGACTATATGAACTTCGTCCTGTCTAT TCTTATCATATTTGGGATTCTCCTGTCGTATCTCCCTCAGCACATTCGCATAATCAACCTCAAAAGCT CCTTCGGGATCTCCCCATACTTCGTCCTCCTTGGAACCACATCAGGTACCTCGGCGTTGGCAAATGTT GTAACACAACAGCAGAGTTTACATGATGTGGAATGCTGCAAGAACATCAATGGACTAGCTTGCTTTGG GGGACTACTCGGCATCTTCCAAGTCGGGACACAGTGGCTTTGCTTTGTTATCATTCTGCTTTTGTTCG TTATTTACTTCCCCCGAGCCACCTCTCCAATCTCGCCTACAGAATCAGAGTCTTCGGCGAGAAATGGC CCATCATATACAACAGCGTTGGTCGTCAGCGGTATTTGTATCCTCCACGCCGTGGTGATGTTCATCAC CTCTGCCGCAATCGCGGTCAACCGGCCATCCCAGCTTCAGGCATGGTCCAACTTTTCAGGGGTGGTTG CTGCCATTCTCGCTTCGATCCAGTATTTCCCTCAGATCTACACTACGTTAAGGTTGCGGTGTGTGGGT AGCCTGAGCATCCCAATGATGTGTATCCAAACCCCGGGAAGTCTTGTGTGGGCAGGTAGTTTAGCTGC ACGACTGGGACCAAAAGGATGGAGTACATGGGGCGTGTTGATTGTGACGGCATGCTTACAAGGCACGC TCTTGGCAATGGCCATCTTTTTTGAATATTTTGGGCCCAACAAGCAGCGCAACCATCGCCATGGCAAA GATCTTCCTCCGAACGGTAGTGGAGAAGGCCCAGAGGAAAGAGACCATGAGCAGCCGTCTGAGGAAAC GCCACTTCTCCAA
SEQ ID No. 18 MDTPDLKCTHLTSPDYMNFVLSILIIFGILLSYLPQHIRIINLKSSFGISPYFVLLGTTSGTSALANV VTQQQSLHDVECCKNINGLACFGGLLGIFQVGTQWLCFVIILLLFVIYFPRATSPISPTESESSARNG PSYTTALVVSGICILHAVVMFITSAAIAVNRPSQLQAWSNFSGVVAAILASIQYFPQIYTTLRLRCVG
SLSIPMMCIQTPGSLVWAGSLAARLGPKGWSTWGVLIVTACLQGTLLAMAIFFEYFGPNKQRNHRHGK DLPPNGSGEGPEERDHEQPSEETPLLQ
SEQ ID No. 19
ATGCTTCTCTCTCAGACCAGAGGGCGTCTGCCCTCGACTCTCCGGAGCTTCTCTCGGTAGGCCACGCT GCTCTCCAAATAGGTCCTTTGAACTAAGAAGGATCGCCGTTAACGATATATTTCAATCCAGCCGTGCC CTTTCGACTACTCTCCCCAGAGGCAAAGATAGCGAAGAAACGGCTTTGAATAAGGTCTCCCGCAATGT CACACAACCCATCTCGCAGGGTGCTTCCCAGGCTATGCTGTACGCTACAGGCCTCACAGAGGAGGATA TGAACAAGGCGCAGGTCGGTATTTCGTCTGTCTGGTACAACGGCAACCCCTGTAACATGCACCTGCTG GATCTCAGCAACCGGGTGCGTGAAGGTGTGCAAAAGGCCGGTCTGGTCGGCTTCCAGTTCAACACCGT TGGTGTCAGCGACGCCATCAGTATGGGTACCAAGGGAATGCGATACTCTCTGCAGAGCCGTGATCTGA TCGCCGATTCCATCGAAACCGTCATGGGTGGCCAGTGGTACGACGCGAACATCAGTATCCCCGGTTGC GACAAGAACATGCCCGGTGTGTTGATGGCCATGGGTCGTGTCAACCGCCCCAGTCTGATGGTGTACGG CGGTACCATCAAGCCCGGCTGCGCCAGGACTCAGAACAACGCAGACATCGATATCGTTTCGGCCTTCC AGGCGTACGGACAGTTCCTGACCGGCGAGATCACCGAGAACCAGCGCTTTGACATCATCCGCAACGCC TGCCCCGGCGGTGGTGCCTGTGGCGGTATGTACACGGCCAACACCATGGCGACCGCCATCGAGGTCAT GGGTATGACGCTGCCCGGCTCCTCGTCGAACCCGGCCGAGTCCAAGGCCAAGGACCTCGAATGCTTGG CGGCCGGTGAGGCCATCAAGCGGCTGCTCAAGGAGGACATTCGGCCGTCGGACATCCTGACTCGCCAG GCCTTCGAGAACGCCATGATCGTCGTCAACATCACCGGTGGCTCGACCAATGCCGTCCTCCACCTGAT CGCCATCGCCGACTCGGTTGGCATCAAGCTCGACATCGAGGACTTCCAGAAGGTCTCGGACCGCACCC CTTTCCTGGCCGACCTGAAGCCATCGGGCAAGTACGTCATGGCTGACCTGCACAACATCGGCGGCACC CCCTCCCTGCTCAAGTTCCTGCTCAAGGAGGGCGTCATCGACGGCTCCGGCATGACCGTCACGGGTGA GACCCTTGCCAAGAACCTCGAGAAGGTTCCCGACTTCCCAGCCGACCAGAAGATCATCCGGCCCCTGT
CCAACCCCATCAAGAAGACCGGCCACATTCAGATCCTCCGCGGCTCCCTGGCCCCCGGCGGCTCCGTC GGCAAGATCACCGGCAAGGAAGGCACCCGCTTCGTCGGCAAGGCCCGCGTCTTCGACGACGAGGACGA TTTCATCGCCGCTCTCGAGCGCAACGAGATCAAAAAGGAGGAAAAGACCGTCGTTGTCATCCGCTACA CCGGCCCCAAGGGCGGACCCGGCATGCCCGAGATGCTCAAGCCCTCCTCCGCACTCATGGGCGCCGGC CTCGGCTCCTCCTGCGCTCTCATCACCGACGGCCGCTTCTCCGGCGGCTCTCACGGCTTCCTGATCGG TCACATCGTGCCCGAGGCCGCGGTCGGTGGTCCCATCGGTCTCGTCAAGGATGGCGACACCATCACCA TCGACGCCGAGAAGCGCCTGCTCGACCTCGACGTTGACGAGACCGAGCTTGCTCGTCGAAGGAAGGAG TGGGAGGCTCTCCGGGATGCCGGCAAGTTGCCTCAAACTGGTCTTACGATGAGGGGTACCCTGGGTAA ATACGCTAGGTATGTTCCGCCTCACCTATTTTCCCCTGGTCTTTGGTTTCCTAACGATAGCATACTGA CTGGCTGTAGAACTGTCAAGGATGCCAGCCACGGCTGCATCACTGACTCTGTAGAATGA
SEQ ID No. 20
ATGCTTCTCTCTCAGACCAGAGGGCGTCTGCCCTCGACTCTCCGGAGCTTCTCTCGCCGTGCCCTTTC
GACTACTCTCCCCAGAGGCAAAGATAGCGAAGAAACGGCTTTGAATAAGGTCTCCCGCAATGTCACAC AACCCATCTCGCAGGGTGCTTCCCAGGCTATGCTGTACGCTACAGGCCTCACAGAGGAGGATATGAAC AAGGCGCAGGTCGGTATTTCGTCTGTCTGGTACAACGGCAACCCCTGTAACATGCACCTGCTGGATCT CAGCAACCGGGTGCGTGAAGGTGTGCAAAAGGCCGGTCTGGTCGGCTTCCAGTTCAACACCGTTGGTG TCAGCGACGCCATCAGTATGGGTACCAAGGGAATGCGATACTCTCTGCAGAGCCGTGATCTGATCGCC GATTCCATCGAAACCGTCATGGGTGGCCAGTGGTACGACGCGAACATCAGTATCCCCGGTTGCGACAA GAACATGCCCGGTGTGTTGATGGCCATGGGTCGTGTCAACCGCCCCAGTCTGATGGTGTACGGCGGTA CCATCAAGCCCGGCTGCGCCAGGACTCAGAACAACGCAGACATCGATATCGTTTCGGCCTTCCAGGCG TACGGACAGTTCCTGACCGGCGAGATCACCGAGAACCAGCGCTTTGACATCATCCGCAACGCCTGCCC CGGCGGTGGTGCCTGTGGCGGTATGTACACGGCCAACACCATGGCGACCGCCATCGAGGTCATGGGTA TGACGCTGCCCGGCTCCTCGTCGAACCCGGCCGAGTCCAAGGCCAAGGACCTCGAATGCTTGGCGGCC GGTGAGGCCATCAAGCGGCTGCTCAAGGAGGACATTCGGCCGTCGGACATCCTGACTCGCCAGGCCTT CGAGAACGCCATGATCGTCGTCAACATCACCGGTGGCTCGACCAATGCCGTCCTCCACCTGATCGCCA TCGCCGACTCGGTTGGCATCAAGCTCGACATCGAGGACTTCCAGAAGGTCTCGGACCGCACCCCTTTC CTGGCCGACCTGAAGCCATCGGGCAAGTACGTCATGGCTGACCTGCACAACATCGGCGGCACCCCCTC CCTGCTCAAGTTCCTGCTCAAGGAGGGCGTCATCGACGGCTCCGGCATGACCGTCACGGGTGAGACCC TTGCCAAGAACCTCGAGAAGGTTCCCGACTTCCCAGCCGACCAGAAGATCATCCGGCCCCTGTCCAAC CCCATCAAGAAGACCGGCCACATTCAGATCCTCCGCGGCTCCCTGGCCCCCGGCGGCTCCGTCGGCAA GATCACCGGCAAGGAAGGCACCCGCTTCGTCGGCAAGGCCCGCGTCTTCGACGACGAGGACGATTTCA TCGCCGCTCTCGAGCGCAACGAGATCAAAAAGGAGGAAAAGACCGTCGTTGTCATCCGCTACACCGGC CCCAAGGGCGGACCCGGCATGCCCGAGATGCTCAAGCCCTCCTCCGCACTCATGGGCGCCGGCCTCGG CTCCTCCTGCGCTCTCATCACCGACGGCCGCTTCTCCGGCGGCTCTCACGGCTTCCTGATCGGTCACA TCGTGCCCGAGGCCGCGGTCGGTGGTCCCATCGGTCTCGTCAAGGATGGCGACACCATCACCATCGAC GCCGAGAAGCGCCTGCTCGACCTCGACGTTGACGAGACCGAGCTTGCTCGTCGAAGGAAGGAGTGGGA GGCTCTCCGGGATGCCGGCAAGTTGCCTCAAACTGGTCTTACGATGAGGGGTACCCTGGGTAAATACG CTAGAACTGTCAAGGATGCCAGCCACGGCTGCATCACTGACTCTGTAGAATGA
SEQ ID No. 21
MLLSQTRGRLPSTLRSFSRRALSTTLPRGKDSEETALNKVSRNVTQPISQGASQAMLYATGLTEEDMN
KAQVGISSVWYNGNPCNMHLLDLSNRVREGVQKAGLVGFQFNTVGVSDAISMGTKGMRYSLQSRDLIA DSIETVMGGQWYDANISIPGCDKNMPGVLMAMGRVNRPSLMVYGGTIKPGCARTQNNADIDIVSAFQA YGQFLTGEITENQRFDIIRNACPGGGACGGMYTANTMATAIEVMGMTLPGSSSNPAESKAKDLECLAA GEAIKRLLKEDIRPSDILTRQAFENAMIVVNITGGSTNAVLHLIAIADSVGIKLDIEDFQKVSDRTPF LADLKPSGKYVMADLHNIGGTPSLLKFLLKEGVIDGSGMTVTGETLAKNLEKVPDFPADQKIIRPLSN PIKKTGHIQILRGSLAPGGSVGKITGKEGTRFVGKARVFDDEDDFIAALERNEIKKEEKTVVVIRYTG PKGGPGMPEMLKPSSALMGAGLGSSCALITDGRFSGGSHGFLIGHIVPEAAVGGPIGLVKDGDTITID AEKRLLDLDVDETELARRRKEWEALRDAGKLPQTGLTMRGTLGKYARTVKDASHGCITDSVE
SEQ ID No. 22 ATGACGAATCGTACATCATTGAACGGACGGTGTCCGGTACCATTCCTTCAGGAAGATCTCTTCCCCCC CACTGGTGGATGTACGCTCCCCTGCTGATTGCTGCCGTGCTGTTGTCGACATTCGCTGATCGTGTTGG TTCTTTGCTTGCTAGTCATCGGAGGTCGTTATTGTCAACCTGTAGGCGATATTTCGTGCTGTTTACCC TGTCCGATAGTATCATGGACGTACGGTGATGGTAAGTTCACAGTGTGTTCGAGTTGCAGAGTCACCCA TCATTCTAACCTCGTTCACTTTTTCAAGGGTTGTTCGGGAAGGCGAGCTCTGCCAGCTGGATCAGTGT TGCTATCTTGCCGCTCTGTATCTTCCTGCTTGTGTCGTACGCAGTGCTCCCGGTCAAATTTACGCATC GCCACTACCTCAGCGTCTGCTTCACATTGGGTATCTGTTTCATGGAGGCAAGTAAAGTGGCGCGTTGG AGGGACCTACGACCTGTTATTAACCGACATGCAGATTGCGTTCATTATCCCTCTAGGGGTAAAACCAG ATCAATGCTATAACCAGATCACCCCCAACGATATGCATTCTAGTCTCTCCTGCGCTTTTACAGGCTCC CTTCTTCTTCTCGGAGGCTGGATGGTGGTAGTATGGAGTATGTTGGGAACAATTCTGAGCTTATTTGG ATTGCCCTCGTTCTAACGATCTCCAGGTTTTCTCCGTACGGTGGCCTTCCATCTTCAGGTATGCTGGG AAGTCATACTGGGTCCGAAATTCATGTGGGGAGCGTTGATCTTCGGTTGGGTCGTCCCGGCTGTCGGC CTCACGGTGATGCTGATCTTGACGGGCGTATCTTTTCGATTCGGCACCGTTTGTCATATCAACATCGA TGGTGCCCTGCAGGATTATTGGATTCCCATCATCTCGTTCGCGGTTGCCGCACTGATCCTCCAACTGG CGACAATGGCGTACTGCATCCACGTCTACGTCAAGTCTTTGTTCGACACCGATTCGACGACAAACAGC TCGGGATTGCCGTCCTACTCTGCCAGCGTCCGCACCGTGTCCGCTCGTCAAGCATACCGTCGCATCCG CAGAGTCCTCCAATTACAGTGGCGAGGCGTAACTCTGGTCTTGATCATCATCGCGAATGTGATTTTCT TCTCCGTGACCTTCATCGAGCTCGACAGCTCCCTCAAACCGACCGCGGAGAATATGGAAAAGGCGCTT CCATGGGTTGCATGTCTGGCAGCCACCAATGGTGACCGAGAAAAATGCGACCCCGAGGCGGCAAAGTT CCGGCCTAGCGAAGGGTTACTCCTCGCCGTCCTGGTTCTCCTTTCCCTGGTCGGATTCTGGAACTTCA TCCTCTTTGCCCGCCCCTCGATCTTCCACGGCTGGGTCGACTTCTTCCAGAACAAGTTTGGCACGGGC GACGGCCGTCTCGAGTTCGTGTCGGCTGATGCCCGCACGCGGCTGGGCGACACGCGGTCCTACGAGAT GCTCAACAGCACGGGCCTCCCCTCCTACAAATCGCCATCGCCCATGGTCCGATCGCCAAGTCCTGCGC GCATGGGGGGCACGAAGAGTCCCGAGAATGGCCACTTTGGGCGAGACGCCAGGTATGTGCGGCCGTCC ATGAGTTTCTCCAGCCCGCGGCCGCCCAGCGCATCACAAGGGCGCGGCTGGGATCCCAAGACAACGTT TGCC
SEQ ID No. 23
ATGACGAATCGTACATCATTGAACGGACGGTGTCCGGTACCATTCCTTCAGGAAGATCTCTTCCCCCC
CACTGGTGGATTCATCGGAGGTCGTTATTGTCAACCTGTAGGCGATATTTCGTGCTGTTTACCCTGTC CGATAGTATCATGGACGTACGGTGATGGGTTGTTCGGGAAGGCGAGCTCTGCCAGCTGGATCAGTGTT GCTATCTTGCCGCTCTGTATCTTCCTGCTTGTGTCGTACGCAGTGCTCCCGGTCAAATTTACGCATCG CCACTACCTCAGCGTCTGCTTCACATTGGGTATCTGTTTCATGGAGGCAAGTAAAATTGCGTTCATTA TCCCTCTAGGGGTAAAACCAGATCAATGCTATAACCAGATCACCCCCAACGATATGCATTCTAGTCTC TCCTGCGCTTTTACAGGCTCCCTTCTTCTTCTCGGAGGCTGGATGGTGGTAGTATGGAGTTTTCTCCG TACGGTGGCCTTCCATCTTCAGGTATGCTGGGAAGTCATACTGGGTCCGAAATTCATGTGGGGAGCGT TGATCTTCGGTTGGGTCGTCCCGGCTGTCGGCCTCACGGTGATGCTGATCTTGACGGGCGTATCTTTT CGATTCGGCACCGTTTGTCATATCAACATCGATGGTGCCCTGCAGGATTATTGGATTCCCATCATCTC GTTCGCGGTTGCCGCACTGATCCTCCAACTGGCGACAATGGCGTACTGCATCCACGTCTACGTCAAGT CTTTGTTCGACACCGATTCGACGACAAACAGCTCGGGATTGCCGTCCTACTCTGCCAGCGTCCGCACC GTGTCCGCTCGTCAAGCATACCGTCGCATCCGCAGAGTCCTCCAATTACAGTGGCGAGGCGTAACTCT GGTCTTGATCATCATCGCGAATGTGATTTTCTTCTCCGTGACCTTCATCGAGCTCGACAGCTCCCTCA AACCGACCGCGGAGAATATGGAAAAGGCGCTTCCATGGGTTGCATGTCTGGCAGCCACCAATGGTGAC CGAGAAAAATGCGACCCCGAGGCGGCAAAGTTCCGGCCTAGCGAAGGGTTACTCCTCGCCGTCCTGGT TCTCCTTTCCCTGGTCGGATTCTGGAACTTCATCCTCTTTGCCCGCCCCTCGATCTTCCACGGCTGGG TCGACTTCTTCCAGAACAAGTTTGGCACGGGCGACGGCCGTCTCGAGTTCGTGTCGGCTGATGCCCGC ACGCGGCTGGGCGACACGCGGTCCTACGAGATGCTCAACAGCACGGGCCTCCCCTCCTACAAATCGCC ATCGCCCATGGTCCGATCGCCAAGTCCTGCGCGCATGGGGGGCACGAAGAGTCCCGAGAATGGCCACT TTGGGCGAGACGCCAGGTATGTGCGGCCGTCCATGAGTTTCTCCAGCCCGCGGCCGCCCAGCGCATCA CAAGGGCGCGGCTGGGATCCCAAGACAACGTTTGCC
SEQ ID No. 24
MTNRTSLNGRCPVPFLQEDLFPPTGGFIGGRYCQPVGDISCCLPCPIVSWTYGDGLFGKASSASWISV AILPLCIFLLVSYAVLPVKFTHRHYLSVCFTLGICFMEASKIAFIIPLGVKPDQCYNQITPNDMHSSL SCAFTGSLLLLGGWMVVVWSFLRTVAFHLQVCWEVILGPKFMWGALIFGWVVPAVGLTVMLILTGVSF RFGTVCHINIDGALQDYWIPIISFAVAALILQLATMAYCIHVYVKSLFDTDSTTNSSGLPSYSASVRT VSARQAYRRIRRVLQLQWRGVTLVLIIIANVIFFSVTFIELDSSLKPTAENMEKALPWVACLAATNGD REKCDPEAAKFRPSEGLLLAVLVLLSLVGFWNFILFARPSIFHGWVDFFQNKFGTGDGRLEFVSADAR TRLGDTRSYEMLNSTGLPSYKSPSPMVRSPSPARMGGTKSPENGHFGRDARYVRPSMSFSSPRPPSAS QGRGWDPKTTFA
SEQ ID No. 25
ATGAAGACAACTACCGAACTTCCACTGCGTATACTAACACACAATATCCGCTATGCAACCAGTTCCCC ATTCAAGGGTGAGCTGCCCTGGAACGATCGCAAGCAGCCTCTCTTAAATGAACTGCTCTTCAATACGC GCAATCAGGATGCATTCATCTGCTTACAAGAAGTGCTCCATAATCAACTGGTCGATGTTCTATCCGGA CTGAAACAACCTCCTTCTACTATTCCCAAAAGCGCCTCCGAGACACAACAATGGGAATACATCGGAGT CGGTCGAGACGACGGGCACAAAGCAGGAGAGTACTCGCCCATCTTCTATCAACCGTCTGTTTGGCAGC TGTGTCATTGGGAAAGTGTCTGGTTGTCGGAAACGCCGAACAAACCATCCAAAGGCTGGGATGCGGCG TCTATAAGGATTCTGGTAAGACACTAGGCGTGTGACGAAATCCTCTTCATTAAGTTGACGCCAAACTA GACCATCGGTGTCTTCACACACAACATAACTCGCCATACCGTCCTTGTGATGAATACACATCTAGACG ACCAGGGATCTCAGTCACGCTTCGAGGCTGCCAAAATCATCCTTCAGAAGATCGATGAATACCGAAGC GGCAAATTTGGGACACTTATCGCAGGGGTATTCCTCGCAGGTGATTTTAATAGTCAAGAGACGCAAGA AGCCTATAATGTCTTGACAGGGTCGGAATCCTCCTTGGTCGACACAGCAAAGGTTGTGGAACCTAGCC AGCACTACGGCAATTATTATACATGGACAGGGTTCGGATATGAGGGAGAGGATCCCACGCGCATTGAT TATATTTTAATCGGACCTGGGAAGAACAAACTTGGGTCCTGGATAGTCAACGGATACGCCGTGTTAGC GAACCGATTCGATTCTGGCGTGTTTCTGTCAGACCACAGGGCCGTTGTCGCAGACATCACCTTGTACA ACTGA
SEQ ID No. 26 ATGAAGACAACTACCGAACTTCCACTGCGTATACTAACACACAATATCCGCTATGCAACCAGTTCCCC ATTCAAGGGTGAGCTGCCCTGGAACGATCGCAAGCAGCCTCTCTTAAATGAACTGCTCTTCAATACGC GCAATCAGGATGCATTCATCTGCTTACAAGAAGTGCTCCATAATCAACTGGTCGATGTTCTATCCGGA CTGAAACAACCTCCTTCTACTATTCCCAAAAGCGCCTCCGAGACACAACAATGGGAATACATCGGAGT CGGTCGAGACGACGGGCACAAAGCAGGAGAGTACTCGCCCATCTTCTATCAACCGTCTGTTTGGCAGC TGTGTCATTGGGAAAGTGTCTGGTTGTCGGAAACGCCGAACAAACCATCCAAAGGCTGGGATGCGGCG TCTATAAGGATTCTGACCATCGGTGTCTTCACACACAACATAACTCGCCATACCGTCCTTGTGATGAA TACACATCTAGACGACCAGGGATCTCAGTCACGCTTCGAGGCTGCCAAAATCATCCTTCAGAAGATCG ATGAATACCGAAGCGGCAAATTTGGGACACTTATCGCAGGGGTATTCCTCGCAGGTGATTTTAATAGT CAAGAGACGCAAGAAGCCTATAATGTCTTGACAGGGTCGGAATCCTCCTTGGTCGACACAGCAAAGGT TGTGGAACCTAGCCAGCACTACGGCAATTATTATACATGGACAGGGTTCGGATATGAGGGAGAGGATC CCACGCGCATTGATTATATTTTAATCGGACCTGGGAAGAACAAACTTGGGTCCTGGATAGTCAACGGA TACGCCGTGTTAGCGAACCGATTCGATTCTGGCGTGTTTCTGTCAGACCACAGGGCCGTTGTCGCAGA CATCACCTTGTACAACTGA
SEQ ID No. 27
MKTTTELPLRILTHNIRYATSSPFKGELPWNDRKQPLLNELLFNTRNQDAFICLQEVLHNQLVDVLSG LKQPPSTIPKSASETQQWEYIGVGRDDGHKAGEYSPIFYQPSVWQLCHWESVWLSETPNKPSKGWDAA SIRILTIGVFTHNITRHTVLVMNTHLDDQGSQSRFEAAKIILQKIDEYRSGKFGTLIAGVFLAGDFNS QETQEAYNVLTGSESSLVDTAKVVEPSQHYGNYYTWTGFGYEGEDPTRIDYILIGPGKNKLGSWIVNG YAVLANRFDSGVFLSDHRAVVADITLYN
SEQ ID No. 28 ATGTCGCTCTCACCGCGAGAATTGTTCGCCATATCGACGACAGAACGCATATGCTCTGCGATCTCTCT CGCTGGCACAAGTATCATAATCATCTCTTTCCTCTCATCTAGATCCTTTCGCAAGCCGATCAATCGTC TCGTATTTTACGCGTCATGGGGAAATATCATGGCAAATGTAGCCACGATGATCTCTCAGTCGGGCATC GCCTACGGGACAAGCAGCTGCCTTTGTCAGTTCCAGGCGTTTCTGATTCAATGGTCAGTAGCTCGGTC GGATTGCCCGTGTCTGATGCCCATGACTTTTATACTGATGGTTGCTGTTCGCATGTCGCAGGTTCATG CCCGCTGATGCGTTGTGGACTCTTGCCATGGCATGCAATGTCTACTTGACATTCTTCCACAAATATAA CTCGGAACAGCTGCGACAACTCGAGTGGAAATACGTGCTTTTCTGCTATGGCCTCCCCTTTATTCCGG CATTCGTTTATTTCTTCATAGAAACCGAGGCTCGGGGAAAGGTTTACGGCTCAGCCATAGTAAGTGCA ACGTTTGTCGTAGGGGAATCAGTCATATGCAGAGTTACCCTCTAACTCGTTGTTGCTTGACAGCTCTG GTGCTGGGTGTCGCTCCCCTGGGATTTTCTTCGCATTGCGGTCTTTTATGGTCCAGTCTGGTTCGTCA TTTTCCTAACATTTGCCATTTACCTGCGTGCTGGCAAAGTAATCTTCGAGAAACGACGGCAACTCGAG GAGGCTGAGTGTCCGGAGTCTTCCGGCGAAATTGACAGTCCTATGGAGCCCGCTGTTTCCAAAAAGAC AGAAATCCACGTCACCAGCGAGATCACACATTCTGGGAGTGAATCCAATCGATTGTCCATCATGAGTG CCAGCCTTATACCGCATCGATACCTCAGCCCATACTCGCCCTACTCTGTAACCATTGAAGGTGGCAGC GCAGCTGGCAACGACGAACATGTGCCAATGCGGACGTTGAGAAGCAGCCAGCACGATCCATACGCTCA GCATCACGCTATGGCGAGAGACGTGAACTCGGCCGCCTGGGCTTATACCAAATATGCGATGCTTTTCT TCATCGCACTGCTTGTCACATGGGTCAGTTCATCATCATCAGCAGCAGCAGAAGCAATGGTGTCAGTA ACAGTCTGGACTGATATTGATATTTGGAATAGGTACCCTCCACGATTAACAGATTGTACGCGCTGATC TATCCGCGCAACTTCAACTTCGGCATGAACTATACGTCCAGTTTCGTCCTTCCGCTGCAAGGCTTCTG GAATAGTATCATTTACGTATCAATCTCTTGGCCTGCTTTCAAAGAGGCTTTTGCGAAGATCAAATGGC GACATTCCCTGCAAAGAGGCCCATCCCAACACATAATCACTGGACATGGGGCTGTCGGAAGTCTCCAT TCGCATGGGAACGACAGCACCCGCGCGTTGACG
SEQ ID No. 29
ATGTCGCTCTCACCGCGAGAATTGTTCGCCATATCGACGACAGAACGCATATGCTCTGCGATCTCTCT
CGCTGGCACAAGTATCATAATCATCTCTTTCCTCTCATCTAGATCCTTTCGCAAGCCGATCAATCGTC TCGTATTTTACGCGTCATGGGGAAATATCATGGCAAATGTAGCCACGATGATCTCTCAGTCGGGCATC GCCTACGGGACAAGCAGCTGCCTTTGTCAGTTCCAGGCGTTTCTGATTCAATGGTTCATGCCCGCTGA TGCGTTGTGGACTCTTGCCATGGCATGCAATGTCTACTTGACATTCTTCCACAAATATAACTCGGAAC AGCTGCGACAACTCGAGTGGAAATACGTGCTTTTCTGCTATGGCCTCCCCTTTATTCCGGCATTCGTT TATTTCTTCATAGAAACCGAGGCTCGGGGAAAGGTTTACGGCTCAGCCATACTCTGGTGCTGGGTGTC GCTCCCCTGGGATTTTCTTCGCATTGCGGTCTTTTATGGTCCAGTCTGGTTCGTCATTTTCCTAACAT TTGCCATTTACCTGCGTGCTGGCAAAGTAATCTTCGAGAAACGACGGCAACTCGAGGAGGCTGAGTGT CCGGAGTCTTCCGGCGAAATTGACAGTCCTATGGAGCCCGCTGTTTCCAAAAAGACAGAAATCCACGT CACCAGCGAGATCACACATTCTGGGAGTGAATCCAATCGATTGTCCATCATGAGTGCCAGCCTTATAC CGCATCGATACCTCAGCCCATACTCGCCCTACTCTGTAACCATTGAAGGTGGCAGCGCAGCTGGCAAC GACGAACATGTGCCAATGCGGACGTTGAGAAGCAGCCAGCACGATCCATACGCTCAGCATCACGCTAT GGCGAGAGACGTGAACTCGGCCGCCTGGGCTTATACCAAATATGCGATGCTTTTCTTCATCGCACTGC TTGTCACATGGGTACCCTCCACGATTAACAGATTGTACGCGCTGATCTATCCGCGCAACTTCAACTTC GGCATGAACTATACGTCCAGTTTCGTCCTTCCGCTGCAAGGCTTCTGGAATAGTATCATTTACGTATC AATCTCTTGGCCTGCTTTCAAAGAGGCTTTTGCGAAGATCAAATGGCGACATTCCCTGCAAAGAGGCC CATCCCAACACATAATCACTGGACATGGGGCTGTCGGAAGTCTCCATTCGCATGGGAACGACAGCACC CGCGCGTTGACG
SEQ ID No. 30 MSLSPRELFAISTTERICSAISLAGTSIIIISFLSSRSFRKPINRLVFYASWGNIMANVATMISQSGI AYGTSSCLCQFQAFLIQWFMPADALWTLAMACNVYLTFFHKYNSEQLRQLEWKYVLFCYGLPFIPAFV YFFIETEARGKVYGSAILWCWVSLPWDFLRIAVFYGPVWFVIFLTFAIYLRAGKVIFEKRRQLEEAEC PESSGEIDSPMEPAVSKKTEIHVTSEITHSGSESNRLSIMSASLIPHRYLSPYSPYSVTIEGGSAAGN DEHVPMRTLRSSQHDPYAQHHAMARDVNSAAWAYTKYAMLFFIALLVTWVPSTINRLYALIYPRNFNF GMNYTSSFVLPLQGFWNSIIYVSISWPAFKEAFAKIKWRHSLQRGPSQHIITGHGAVGSLHSHGNDST RALT
SEQ ID No. 31 ATGGAGACTGGCACCGACAAAGGCTCGGCGGACGCCATCCTCGAGCATCTAGGCTATACTCCCGAACT GTCCCGCAACCGGTCCGTCCTGCAGGTCGCCTTTATGTGCTTTATCCTCTCCTCCGTCCCCTACGGTC TGGCTACCACCTTCTTCTACCCACTCGCCGCCGGAGGCCCGTCCACCATCGTCTGGGGCTGGATTATC GTCTCCCTCGTTATCCTGTGTGTAGCCATCTCGCTGGCCGAGATCACCTCCGTCTATCCCACTGCCGG TGGCGTCTACTACCAGACCTTTGCCCTATCGCCCCCCTGGTGCCGTCGCGCCGCCGCCTGGATCTGTG GATGGGCCTATGTCCTCGGCAATATCACCATTACCCTGGCTGTAAACTTTGGGACCACCCTGTTCTTT GTCGCCTGTCTCAATGTCTTCGAATCCGAGCCCGGTGTCGGGATTGTAGATGACATGCAGACCTACCA GATCTACCTCATCTTCCTCGCCATCACCCTGTTGACCCATGCGATCTCGTCCCTAGGCAACAAATGGC TCCCCAGTCTGGAGGTATGTTTCTTTTCTTTTCTTTTCTTTTTTTTTTTTATTTATTTTCCAATGTTT TCCCTCCGTTGCGACCCGTGTGTCTTGGCTCCCAATGCGACTTCTCATGCGTTTCGTCTCCCCATATA GATTTCAGCCATCTTCTTGACCCTCATCGGCCTGATTGCCCTGATCATCTCCGTCCTTGTGGTCGCCA AACATGGTCGACACTCTGGTAAGTGGGTCTTCGCCGACTTTGAGCCCCAGTCGGGCTGGCCGGCCGGC TGGTCTTTCTGTATCGGTCTGTTGCAGGCCGCATACGCCACCTCGGCGACTGGCATGATCATCTCGTG AGCAATCCCGTTTCGCCCTTTTGCATGATCGGTCCATTAAGATTTTGATAGGATGTGCGAGGAAGTCC GGGAGCCTGCCATTCAGGTGCCCAAAGCCATGGTGGGCACCATCGTGCTCAACTTCGTGGCCGGCTTG GGCTTCTTGCTGCCGCTGACCTTTGTCCTGCCCGATATCACGATGCTGGTGAACCTGGCCTCGGGGCA GCCCACCCCCGTGATCCTGAAAGATGCGCTGGGCAGCTCGACCGGAGCCTTCCTGCTGCTTCTCCCTC TGCTCATTCTCGGCGTCATCTGCGGCGTGGGATGTGTGACGGCCGCCTCCCGCTGCACCTGGGCGTTT GCGCGCGACGGCGGTATCCCCGGCTCCAAGTGGTGGAAGACCGTCAACGCGACGCTGGACATTCCCCT CAACGCCATGATGCTGGGCATGACGGTGGAGATTGCCCTCGGTGCCATCTACTTTGGCTCCACGGCCG CGTTCAACGCCTTCTCCGGTGTCGGTGTGATTTTCCTGACCCTCAGCTACGCGTGCCCCATCGCGGTC TCGTTCTTCTTCCGTCGTCGCTCGGAGATCGCCAACGCCAGATTCAACCTGGGGATCATCGGCAGTAT CTGCAACGTGGTTGCTCTGGGTAAGTCTCCGCCCCGCTCGCCCACTCCGCAACTCACTCCTCCTGGGC AACCTACTGACGGGTCTACAGCATGGAGTCTTCTGGCCATCCCTCTGTTCTGTATGCCGACGTACAAG GTCGTCACCCTGGAGACCATGAACTACGCCTGTGTCGTCTTCGTCGGGTTCACGACCATCGCCGGACT CTGGTACTTGGTCTGGGGCTACCGCAACTACGACGGTCCCCCCAAGGAGGGTATCGATGGAGTGGAGG CCGATTTCCCCGATCTGCCCGCCAAGTCTGGGTAA
SEQ ID No. 32
ATGGAGACTGGCACCGACAAAGGCTCGGCGGACGCCATCCTCGAGCATCTAGGCTATACTCCCGAACT
GTCCCGCAACCGGTCCGTCCTGCAGGTCGCCTTTATGTGCTTTATCCTCTCCTCCGTCCCCTACGGTC TGGCTACCACCTTCTTCTACCCACTCGCCGCCGGAGGCCCGTCCACCATCGTCTGGGGCTGGATTATC GTCTCCCTCGTTATCCTGTGTGTAGCCATCTCGCTGGCCGAGATCACCTCCGTCTATCCCACTGCCGG TGGCGTCTACTACCAGACCTTTGCCCTATCGCCCCCCTGGTGCCGTCGCGCCGCCGCCTGGATCTGTG GATGGGCCTATGTCCTCGGCAATATCACCATTACCCTGGCTGTAAACTTTGGGACCACCCTGTTCTTT GTCGCCTGTCTCAATGTCTTCGAATCCGAGCCCGGTGTCGGGATTGTAGATGACATGCAGACCTACCA GATCTACCTCATCTTCCTCGCCATCACCCTGTTGACCCATGCGATCTCGTCCCTAGGCAACAAATGGC TCCCCAGTCTGGAGATTTCAGCCATCTTCTTGACCCTCATCGGCCTGATTGCCCTGATCATCTCCGTC CTTGTGGTCGCCAAACATGGTCGACACTCTGGTAAGTGGGTCTTCGCCGACTTTGAGCCCCAGTCGGG CTGGCCGGCCGGCTGGTCTTTCTGTATCGGTCTGTTGCAGGCCGCATACGCCACCTCGGCGACTGGCA TGATCATCTCGATGTGCGAGGAAGTCCGGGAGCCTGCCATTCAGGTGCCCAAAGCCATGGTGGGCACC ATCGTGCTCAACTTCGTGGCCGGCTTGGGCTTCTTGCTGCCGCTGACCTTTGTCCTGCCCGATATCAC GATGCTGGTGAACCTGGCCTCGGGGCAGCCCACCCCCGTGATCCTGAAAGATGCGCTGGGCAGCTCGA CCGGAGCCTTCCTGCTGCTTCTCCCTCTGCTCATTCTCGGCGTCATCTGCGGCGTGGGATGTGTGACG GCCGCCTCCCGCTGCACCTGGGCGTTTGCGCGCGACGGCGGTATCCCCGGCTCCAAGTGGTGGAAGAC CGTCAACGCGACGCTGGACATTCCCCTCAACGCCATGATGCTGGGCATGACGGTGGAGATTGCCCTCG GTGCCATCTACTTTGGCTCCACGGCCGCGTTCAACGCCTTCTCCGGTGTCGGTGTGATTTTCCTGACC CTCAGCTACGCGTGCCCCATCGCGGTCTCGTTCTTCTTCCGTCGTCGCTCGGAGATCGCCAACGCCAG ATTCAACCTGGGGATCATCGGCAGTATCTGCAACGTGGTTGCTCTGGCATGGAGTCTTCTGGCCATCC CTCTGTTCTGTATGCCGACGTACAAGGTCGTCACCCTGGAGACCATGAACTACGCCTGTGTCGTCTTC GTCGGGTTCACGACCATCGCCGGACTCTGGTACTTGGTCTGGGGCTACCGCAACTACGACGGTCCCCC CAAGGAGGGTATCGATGGAGTGGAGGCCGATTTCCCCGATCTGCCCGCCAAGTCTGGGTAA
SEQ ID No. 33
METGTDKGSADAILEHLGYTPELSRNRSVLQVAFMCFILSSVPYGLATTFFYPLAAGGPSTIVWGWII VSLVILCVAISLAEITSVYPTAGGVYYQTFALSPPWCRRAAAWICGWAYVLGNITITLAVNFGTTLFF VACLNVFESEPGVGIVDDMQTYQIYLIFLAITLLTHAISSLGNKWLPSLEISAIFLTLIGLIALIISV LVVAKHGRHSGKWVFADFEPQSGWPAGWSFCIGLLQAAYATSATGMIISMCEEVREPAIQVPKAMVGT IVLNFVAGLGFLLPLTFVLPDITMLVNLASGQPTPVILKDALGSSTGAFLLLLPLLILGVICGVGCVT AASRCTWAFARDGGIPGSKWWKTVNATLDIPLNAMMLGMTVEIALGAIYFGSTAAFNAFSGVGVIFLT LSYACPIAVSFFFRRRSEIANARFNLGIIGSICNVVALAWSLLAIPLFCMPTYKVVTLETMNYACVVF VGFTTIAGLWYLVWGYRNYDGPPKEGIDGVEADFPDLPAKSG
SEQ ID No. 34 ATGAAGACAAAAGCAACCTCCCTCTTCCTCTGCGTCTCGGCGCTTGTATCCTCAATCTCTGCCCTCAC CATCAGCCGAATCAATGGAAACAAATACCTCTCGCCCTACGCCGGCGAAACCGTCTCCAATATCCAAG GCCTGGTAACAGCCAAAGGCCCCTCTGGCTTTTACCTACGGTCAACAACCCCAGACGACGACGACGCC ACCTCGGAATCCATCTACGTCTACGGCAGCACCGCTGTCTCCAAAGTCTCCGTCGGCGACATCATCTC CCTTTCGGGGAAAGTCTCCGAGTACCGCTCCTCGGCCTCATACGTGTATCTAACGGAACTCACCTCGC CATCTGCCATCTCCGTCGTCTCGCGCGGGAATGCCGTCGTGCCCGTGGTGGTCGGCAAGGGCGGCCGG GCCCCTCCGACGGAGCAGTTCTCCGTGCTGGACGGCGGGGATGTGCTTGCTGTGCCGAATAATGTGAG CACAGTCAGCAAGACGAATCCAGTGCTGCGGCCGGGGACGTACGGGATGGACTTTTGGGAGAGTCTGA GCGGGGAGTTGGTGAGGGTGTCGAACGTAAGGGCGATTGCGAGGCCGAATTCGTACGGTGATACGTGG GTGCGTGGGGATTGGAAGGTAAGTGGGGAGAATGAGAGGGGTGGGTTGACGATGCGGGAGAAGGGTAT GCATGCATGTTCGTCAGGATGTTTAGACGAGGCTGATGAACGTACCTAGACTCGAACCCCGAGGCTGT CATTGTTGGGTCGCCGCTGGATGGTAGTAAAAATCCTACTGATACCAAGCTGGGGGATTTGGTTGGGG ATATCACAGGCGTGATCACGACGGCGTACGGGTACTATGTGCTGCTGCCCCTGACAGCCTTGACTGTT ACGGGGTCAAACACGACGGCTGCTGCTCCTACAGATTTGGAGTCTGCTGGAACATGCGGTGGTGTGAC GGTTGGCTCGTACAATGTCAACAACCTGGCTCCCAACTCGACGACGCTGCCCAAGATCGCGCAGCATA TTGCGCAGTACCTGAAGAGTCCGACGCTGGTCTTTCTGCAAGAAATCCAGGATGATGACGGGCCGACT GATGACGGCGGTATGCTTCCTCTCAGGCTGACTGTGGCCATCGGCTAATGCATGCGCCAGTGGTATCC GCCAACAAGACGCTCTCGACCCTCGTGCAGTCCATCGCCGACCAAGGCGGCATCCGGTACTCGTTTGT CGATATTGCCCCGGTCAACAAGGAAGACGGCGGCCAGCCAGGCGGGAATATCCGCGTCGCCTACCTGT ACGATCCGTCTGTGATTCGCCTGCGTGACGCCAACCCTGGCTCCAACACCGATGCCAATGAGGTCCTC CCCGGCCCCGAACTCAAGTATAACCCTGGCCTGATCGACCCGTCCAACAACGCGTGGCTCAACAGTCG CAAGCCGCTAGCCGCAGCCTGGGAAACGCTCGACGGCAAGAACAAGTTCTTCACGGTGAATGTGCACT TCACCAGTAAGGGCGGTGGATCGTCAATTGAGGGCGATGCCCGGCCGCCAGTCAACGGAGGCGTTGCC ACACGCGAGGCGCAGGCCAAGCTCGTTGCTGTACGTTGTCCTCGTTCTCAATCCTTGAGCGGTACTAA CGTCAGAAGGAATTTACATCGTCCATCCTAGCCGAAGACTCCACCGCCAAGATCATTGTTTCCGGCGA CTTTAACGAGTTCACCTTTGCACAGCCGCTTGAAACGTTCCTGGCTGAATCTGGGTTGGAGGATCTCG ACGAGGTCGCTGGGATTGCAGCCACGGAGCGGTACACGTATCTGTACGATATGAACTGCCAACAGCTG GACCACATGTTTGTGAGCCCGGCGCTGGCAACAGGAGCGCAGATGCACCATCTGCATGTCAACACCTG GGTTTCGTTCGACGACCAGGCAAGCGACCATGATCCTACGGTGGCGTTGCTGAACGTGTGTAGCTAG
SEQ ID No. 35 ATGAAGACAAAAGCAACCTCCCTCTTCCTCTGCGTCTCGGCGCTTGTATCCTCAATCTCTGCCCTCAC CATCAGCCGAATCAATGGAAACAAATACCTCTCGCCCTACGCCGGCGAAACCGTCTCCAATATCCAAG GCCTGGTAACAGCCAAAGGCCCCTCTGGCTTTTACCTACGGTCAACAACCCCAGACGACGACGACGCC ACCTCGGAATCCATCTACGTCTACGGCAGCACCGCTGTCTCCAAAGTCTCCGTCGGCGACATCATCTC CCTTTCGGGGAAAGTCTCCGAGTACCGCTCCTCGGCCTCATACGTGTATCTAACGGAACTCACCTCGC CATCTGCCATCTCCGTCGTCTCGCGCGGGAATGCCGTCGTGCCCGTGGTGGTCGGCAAGGGCGGCCGG GCCCCTCCGACGGAGCAGTTCTCCGTGCTGGACGGCGGGGATGTGCTTGCTGTGCCGAATAATGTGAG CACAGTCAGCAAGACGAATCCAGTGCTGCGGCCGGGGACGTACGGGATGGACTTTTGGGAGAGTCTGA GCGGGGAGTTGGTGAGGGTGTCGAACGTAAGGGCGATTGCGAGGCCGAATTCGTACGGTGATACGTGG GTGCGTGGGGATTGGAAGGTAAGTGGGGAGAATGAGAGGGGTGGGTTGACGATGCGGGAGAAGGACTC GAACCCCGAGGCTGTCATTGTTGGGTCGCCGCTGGATGGTAGTAAAAATCCTACTGATACCAAGCTGG GGGATTTGGTTGGGGATATCACAGGCGTGATCACGACGGCGTACGGGTACTATGTGCTGCTGCCCCTG ACAGCCTTGACTGTTACGGGGTCAAACACGACGGCTGCTGCTCCTACAGATTTGGAGTCTGCTGGAAC ATGCGGTGGTGTGACGGTTGGCTCGTACAATGTCAACAACCTGGCTCCCAACTCGACGACGCTGCCCA AGATCGCGCAGCATATTGCGCAGTACCTGAAGAGTCCGACGCTGGTCTTTCTGCAAGAAATCCAGGAT GATGACGGGCCGACTGATGACGGCGTGGTATCCGCCAACAAGACGCTCTCGACCCTCGTGCAGTCCAT CGCCGACCAAGGCGGCATCCGGTACTCGTTTGTCGATATTGCCCCGGTCAACAAGGAAGACGGCGGCC AGCCAGGCGGGAATATCCGCGTCGCCTACCTGTACGATCCGTCTGTGATTCGCCTGCGTGACGCCAAC CCTGGCTCCAACACCGATGCCAATGAGGTCCTCCCCGGCCCCGAACTCAAGTATAACCCTGGCCTGAT CGACCCGTCCAACAACGCGTGGCTCAACAGTCGCAAGCCGCTAGCCGCAGCCTGGGAAACGCTCGACG GCAAGAACAAGTTCTTCACGGTGAATGTGCACTTCACCAGTAAGGGCGGTGGATCGTCAATTGAGGGC GATGCCCGGCCGCCAGTCAACGGAGGCGTTGCCACACGCGAGGCGCAGGCCAAGCTCGTTGCTAAGGA ATTTACATCGTCCATCCTAGCCGAAGACTCCACCGCCAAGATCATTGTTTCCGGCGACTTTAACGAGT TCACCTTTGCACAGCCGCTTGAAACGTTCCTGGCTGAATCTGGGTTGGAGGATCTCGACGAGGTCGCT GGGATTGCAGCCACGGAGCGGTACACGTATCTGTACGATATGAACTGCCAACAGCTGGACCACATGTT TGTGAGCCCGGCGCTGGCAACAGGAGCGCAGATGCACCATCTGCATGTCAACACCTGGGTTTCGTTCG ACGACCAGGCAAGCGACCATGATCCTACGGTGGCGTTGCTGAACGTGTGTAGCTAG
SEQ ID No. 36 MKTKATSLFLCVSALVSSISALTISRINGNKYLSPYAGETVSNIQGLVTAKGPSGFYLRSTTPDDDDA TSESIYVYGSTAVSKVSVGDIISLSGKVSEYRSSASYVYLTELTSPSAISVVSRGNAVVPWVGKGGR APPTEQFSVLDGGDVLAVPNNVSTVSKTNPVLRPGTYGMDFWESLSGELVRVSNVRAIARPNSYGDTW VRGDWKVSGENERGGLTMREKDSNPEAVIVGSPLDGSKNPTDTKLGDLVGDITGVITTAYGYYVLLPL TALTVTGSNTTAAAPTDLESAGTCGGVTVGSYNVNNLAPNSTTLPKIAQHIAQYLKSPTLVFLQEIQD DDGPTDDGVVSANKTLSTLVQSIADQGGIRYSFVDIAPVNKEDGGQPGGNIRVAYLYDPSVIRLRDAN PGSNTDANEVLPGPELKYNPGLIDPSNNAWLNSRKPLAAAWETLDGKNKFFTVNVHFTSKGGGSSIEG DARPPVNGGVATREAQAKLVAKEFTSSILAEDSTAKIIVSGDFNEFTFAQPLETFLAESGLEDLDEVA GIAATERYTYLYDMNCQQLDHMFVSPALATGAQMHHLHVNTWVSFDDQASDHDPTVALLNVCS SEQ ID No 37
ATGGATTCCACGAAACCTACAGACAACCCTTCGCTCCAGGATCCCAAATACATCGAGTTTCCTGCCCT TCCGAGCGATGCAAAGCATGCAGATGGTACCCTTGCACTGAATCGGCACTCAACACATATAACCCGTG GTCATGATTTCCCTGGTGCAAAGGTTTGTCTAGGATACCGAGCCTGCGGGTTTCATGGACTAACTGGC ATGCATTCTTAGGCCATGCTTTATGCTGCCGGCGTCCCGGATAAGGAGTCTATGGCGAAGAGTCCCCA TGTTGGAATCGCAGGTTCAACCTCCTCAACCCATTTCGCAGACTTTCCGCTAACTGATAAGGTGTTTG GTGGGAGGGAAACCCATGCAATATGCACCTCCTGGACCTTGCTACGACCGCTAAGAAAGCCGTGATTG ACCGGGGTATGCTTGGATGGCAATACAATACCATTGGAGTTTCAGACGCAATCTCTATGGGTAGTGAA GGTACGCTCTTAGACTGGCATATCCAAACCGGTACCGCGTTGTGGAAGTGGAGGCTTATTATTGCGCA GGCATGAGATTCTCGCTTCAATCCCGTGAAATCATTGCCGATAGTGTCGAAACCGTTACGTGTGCGCA ATATCACGATGCCTGTATCGCCATTCCGGGTTGTGATAAGAACATGCCCGGTGTTGTTATGGGTATGG CCAGACATAACCGACCGTCCCTGATGATATACGGTGGTACAATTCAGAAGGGTTACTCGCAGTCACTT CGGAAGAACATCAGCGTGTCTTCGTGCTTCGAAGCTGCGGGCGCATACGCATATGATACCCTGCGCCA GCCTGACGATGGAGGCGACACGAGCCTAACAAAGGACGAAATCATGGACGATCTGGAAAAGCACGCTT GTCCTAGCGCAGGTGCATGTGGAGGAATGTTTACCGCAAACACAATGGCCACCGCAATTGAATCCATG GGGTTGACTCTGCCGGGTTCATCGTCGACGCCTGCTTCGTCGCCGACCAAGATGCGAGAATGTGTCAA GGCAGCCGATGCTATTAAAACCTGCCTGGAGAAAAACATTCGTCCGCGTGACTTGCTCACCAAAAGGT CCTTTGAGAACGCTCTTGTTATGACTATGGCATTAGGAGGAAGCACCAATGGTGTGCTTCATTTCCTG GCTATGGCCAGAACCGCGGGCGTTGATCTCACCTTGGATGACGTCCAAAGAGTTAGCAACAAGATTCC ATTCATTGCAGACCTTTCCCCAAGTGGAAAATACTTTATGGCCGACCTTTACGAAATCGGAGGCATTC CCTCCGTCCATAAGCTGTTGATTGCGGCTGGCCTCATCGACGGTGGTATTCCCACCGTAACCGGCAAG ACTCTAGCTGAAAATGTGGCCTCATATCCATCTCTTCCCGACGATCAGGTCATTATCCGTCCTTTGAA CAACCCCATCAAGCCGACTGGTCATCTCCAGATCCTAAAGGGTAATCTTGCTCCAGGTGGCGCGGTGG CCAAGATTACCGGCAAAGAAGGAACCAAATTCACCGGAAAAGCGCGCGTTTTCGACAAGGAGTACCAG CTTAACGATGCCTTGACGCAGGGCAAGATCCCCCGCGGAGAAAACCTTGTGCTCATCGTTCGCTACGA AGGTCCAAAGGGAGGACCAGGTATGCCAGAACAGCTCAAAGCCAGTGCGGCGCTTATGGGTGCAAAAC TGACCAACGTGGCATTGATTACGGATGGTCGGTACTCTGGGGCCTCGCATGGCTTTATCGTTGGGCAT ATTGTGCCAGAGGCAGCAGTGGGAGGCCCCATTGCCGTTGTTCGTGACGGGGACTCTGTCACTATCAA CGCGGAGACCAACGAACTCAGCATGGATGTTTCGGATGAGGAGATTCAACGGCGGTTGAAAGAATGGA AGCCACCGGCTCCTACTGTGACGCGCGGTGTGCTTGCGAAGTATGCCCGGCTGGTAGGGGATGCTTCG CACGGTGCCATGACAGATCTGTTTTGA
SEQ ID No. 38 ATGGATTCCACGAAACCTACAGACAACCCTTCGCTCCAGGATCCCAAATACATCGAGTTTCCTGCCCT TCCGAGCGATGCAAAGCATGCAGATGGTACCCTTGCACTGAATCGGCACTCAACACATATAACCCGTG GTCATGATTTCCCTGGTGCAAAGGCCATGCTTTATGCTGCCGGCGTCCCGGATAAGGAGTCTATGGCG AAGAGTCCCCATGTTGGAATCGCAGGTGTTTGGTGGGAGGGAAACCCATGCAATATGCACCTCCTGGA CCTTGCTACGACCGCTAAGAAAGCCGTGATTGACCGGGGTATGCTTGGATGGCAATACAATACCATTG GAGTTTCAGACGCAATCTCTATGGGTAGTGAAGGCATGAGATTCTCGCTTCAATCCCGTGAAATCATT GCCGATAGTGTCGAAACCGTTACGTGTGCGCAATATCACGATGCCTGTATCGCCATTCCGGGTTGTGA TAAGAACATGCCCGGTGTTGTTATGGGTATGGCCAGACATAACCGACCGTCCCTGATGATATACGGTG GTACAATTCAGAAGGGTTACTCGCAGTCACTTCGGAAGAACATCAGCGTGTCTTCGTGCTTCGAAGCT GCGGGCGCATACGCATATGATACCCTGCGCCAGCCTGACGATGGAGGCGACACGAGCCTAACAAAGGA CGAAATCATGGACGATCTGGAAAAGCACGCTTGTCCTAGCGCAGGTGCATGTGGAGGAATGTTTACCG CAAACACAATGGCCACCGCAATTGAATCCATGGGGTTGACTCTGCCGGGTTCATCGTCGACGCCTGCT TCGTCGCCGACCAAGATGCGAGAATGTGTCAAGGCAGCCGATGCTATTAAAACCTGCCTGGAGAAAAA CATTCGTCCGCGTGACTTGCTCACCAAAAGGTCCTTTGAGAACGCTCTTGTTATGACTATGGCATTAG GAGGAAGCACCAATGGTGTGCTTCATTTCCTGGCTATGGCCAGAACCGCGGGCGTTGATCTCACCTTG GATGACGTCCAAAGAGTTAGCAACAAGATTCCATTCATTGCAGACCTTTCCCCAAGTGGAAAATACTT TATGGCCGACCTTTACGAAATCGGAGGCATTCCCTCCGTCCATAAGCTGTTGATTGCGGCTGGCCTCA TCGACGGTGGTATTCCCACCGTAACCGGCAAGACTCTAGCTGAAAATGTGGCCTCATATCCATCTCTT CCCGACGATCAGGTCATTATCCGTCCTTTGAACAACCCCATCAAGCCGACTGGTCATCTCCAGATCCT AAAGGGTAATCTTGCTCCAGGTGGCGCGGTGGCCAAGATTACCGGCAAAGAAGGAACCAAATTCACCG GAAAAGCGCGCGTTTTCGACAAGGAGTACCAGCTTAACGATGCCTTGACGCAGGGCAAGATCCCCCGC GGAGAAAACCTTGTGCTCATCGTTCGCTACGAAGGTCCAAAGGGAGGACCAGGTATGCCAGAACAGCT CAAAGCCAGTGCGGCGCTTATGGGTGCAAAACTGACCAACGTGGCATTGATTACGGATGGTCGGTACT CTGGGGCCTCGCATGGCTTTATCGTTGGGCATATTGTGCCAGAGGCAGCAGTGGGAGGCCCCATTGCC GTTGTTCGTGACGGGGACTCTGTCACTATCAACGCGGAGACCAACGAACTCAGCATGGATGTTTCGGA TGAGGAGATTCAACGGCGGTTGAAAGAATGGAAGCCACCGGCTCCTACTGTGACGCGCGGTGTGCTTG CGAAGTATGCCCGGCTGGTAGGGGATGCTTCGCACGGTGCCATGACAGATCTGTTTTGA
SEQ ID No. 39
MDSTKPTDNPSLQDPKYIEFPALPSDAKHADGTLALNRHSTHITRGHDFPGAKAMLYAAGVPDKESMA
KSPHVGIAGVWWEGNPCNMHLLDLATTAKKAVIDRGMLGWQYNTIGVSDAISMGSEGMRFSLQSREII ADSVETVTCAQYHDACIAIPGCDKNMPGVVMGMARHNRPSLMIYGGTIQKGYSQSLRKNISVSSCFEA
AGAYAYDTLRQPDDGGDTSLTKDEIMDDLEKHACPSAGACGGMFTANTMATAIESMGLTLPGSSSTPA SSPTKMRECVKAADAIKTCLEKNIRPRDLLTKRSFENALVMTMALGGSTNGVLHFLAMARTAGVDLTL
DDVQRVSNKIPFIADLSPSGKYFMADLYEIGGIPSVHKLLIAAGLIDGGIPTVTGKTLAENVASYPSL PDDQVIIRPLNNPIKPTGHLQILKGNLAPGGAVAKITGKEGTKFTGKARVFDKEYQLNDALTQGKIPR GENLVLIVRYEGPKGGPGMPEQLKASAALMGAKLTNVALITDGRYSGASHGFIVGHIVPEAAVGGPIA VVRDGDSVTINAETNELSMDVSDEEIQRRLKEWKPPAPTVTRGVLAKYARLVGDASHGAMTDLF
SEQ ID No. 40 ATGCTTCTTTCACAGACTCGGGGCCGCCTGCCCTCTGCTCTCCGCAGCTTGGCCAACAGAGCTGCTAT GTAGGTCACCTCCCATTGTACAGCTAATCAATGGTTTTACGTCTTGGTTGTACGTCTGCTAAGTGAAA
ATGCGTCCAGTCGTCCAATCTCTACTACACTCCCCCGCCAAAAAGCGTCGCCAAAAGATGATGAGCCC GTCCTCAACAAGGTCTCCCGCCATATCACACAGCCGGTGTCCCAGGGTGCTTCCCAGGCGATGCTGTA CGCTACGGGTCTTACTGAGGCCGACATGAACAAGGCCCAGGTTGGTATTTCCTCGGTCTGGTACAACG GCAACCCTTGCAACATGCACCTCCTCGACCTGAACAACCGTGTCCGCGAGGGTGTGCAAAAGGCTGGC CTTATCGGGTACCAGTTCAACACCATTGGTGTCAGTGATGGAATCAGTATGGGTACCAGTGGTATGCG TTACTCGCTTCAGAGCCGTGACCTCATCGCCGACTCTATCGAGACCGTCATGGGTGGTCAGTGGTACG ACGCCAACATCAGCATCCCCGGTTGCGACAAGAACATGCCCGGTGTTTTGATGGCTATGGGACGAGTC AACCGCCCCAGTTTGATGGTTTACGGTGGAACCATCAAGCCCGGCTGCGCATCCATGCAGGGCAACGC TGACATCGATATCGTCTCTGCCTTCCAAGCCTACGGCCAGTTCATCAGCGGTGAGATCAACGAGCCCC AGCGCTTCGATATCATCCGCCACGCCTGCCCCGGTGGCGGCGCTTGCGGTGGAATGTACACTGCCAAC ACCATGGCCACGGCCATCGAAGTCATGGGTATGACCCTCACGGGCTCCTCGTCCAACCCGGCCGAATC GCAAGCCAAATACGACGAATGTCTTCGCGCTGGTGAAGCCATCAAGCGCCTCCTCGTCGAAGACATCC GCCCCTCCGACATCATGACTCGGCAAGCCTTCGAGAACGCCATGGTCGTTGTCAACATCACCGGCGGC TCCACCAATGCTGTCCTTCACCTCATCGCCATTGCCGACTCCGTCGGCATCAAGCTCACAATTGACGA CTTCCAAGCCGTCTCTGACCGCACCCCCTTCCTCGCAGACCTCAAGCCATCCGGCAAATACGTTATGG CCGACCTCCACAACATCGGCGGCACCCCCTCCCTCCTCAAATTCCTCCTCAAGGAAGGCGTCATTGAT GGCTCCGGCATCACAGTTACCGGTGAAACTCTCGCCAAGAACCTCGAGAAAGTCCCCGATTTCCCCGA GGACCAGAAAATCATTCGCCCCTTCTCCAACCCCATCAAGGAAACAGGCCACATCCAGATCCTGCGCG GTTCGCTCGCGCCGGGCGGTTGCGTTGGTAAGATTACCGGTAAGGAGGGAACCGTTTTCACGGGCAAG GCCCGCGTCTTTAACCACGAAGACGACTTCATTGCCGCCCTGGAGCGCAAGGAAATCACCAAGGATGA GCAAACTGTCGTTGTGATTCGCTACACCGGTCCCAAGGGTGGTCCCGGTATGCCTGGTATGCACTCTC TAGCAACCTGACCCTTCAATTCTCATCCAACCTGCTAATTCACCTTTCTAGAAATGCTCAAGCCTTCA AGCGCCCTCATGGGTGCTGGCCTCGGCCAAACCTGCGCCCTGATCACAGACGGACGCTTCTCCGGTGG TTCGCACGGCTTCCTTATTGGACACATCGTCCCAGAGGCTGCCGTCGGTGGGCCGATTGGTCTCGTAC ACGACGGCGACGTGATCACCATTGATGCCGAGAAGCGCGTTTTGGACCTTGACGTTGACGAGGCGGAA CTCGCTAAGCGACGGAAGCAGTGGGAGGCTGATAAGGCGGCGGGCAAGTTGCCCCAGACCGGGTTGAA CTTGCGCGGGACGCTTGGAAAGTATGCCCGGAATGTCAAGGATGCTAGTTCCGGGTGTATTACGGACG CTTTCGATTAA
SEQ ID No. 41
ATGCTTCTTTCACAGACTCGGGGCCGCCTGCCCTCTGCTCTCCGCAGCTTGGCCAACAGAGCTGCTAT TCGTCCAATCTCTACTACACTCCCCCGCCAAAAAGCGTCGCCAAAAGATGATGAGCCCGTCCTCAACA AGGTCTCCCGCCATATCACACAGCCGGTGTCCCAGGGTGCTTCCCAGGCGATGCTGTACGCTACGGGT CTTACTGAGGCCGACATGAACAAGGCCCAGGTTGGTATTTCCTCGGTCTGGTACAACGGCAACCCTTG CAACATGCACCTCCTCGACCTGAACAACCGTGTCCGCGAGGGTGTGCAAAAGGCTGGCCTTATCGGGT ACCAGTTCAACACCATTGGTGTCAGTGATGGAATCAGTATGGGTACCAGTGGTATGCGTTACTCGCTT CAGAGCCGTGACCTCATCGCCGACTCTATCGAGACCGTCATGGGTGGTCAGTGGTACGACGCCAACAT CAGCATCCCCGGTTGCGACAAGAACATGCCCGGTGTTTTGATGGCTATGGGACGAGTCAACCGCCCCA GTTTGATGGTTTACGGTGGAACCATCAAGCCCGGCTGCGCATCCATGCAGGGCAACGCTGACATCGAT ATCGTCTCTGCCTTCCAAGCCTACGGCCAGTTCATCAGCGGTGAGATCAACGAGCCCCAGCGCTTCGA TATCATCCGCCACGCCTGCCCCGGTGGCGGCGCTTGCGGTGGAATGTACACTGCCAACACCATGGCCA CGGCCATCGAAGTCATGGGTATGACCCTCACGGGCTCCTCGTCCAACCCGGCCGAATCGCAAGCCAAA TACGACGAATGTCTTCGCGCTGGTGAAGCCATCAAGCGCCTCCTCGTCGAAGACATCCGCCCCTCCGA CATCATGACTCGGCAAGCCTTCGAGAACGCCATGGTCGTTGTCAACATCACCGGCGGCTCCACCAATG CTGTCCTTCACCTCATCGCCATTGCCGACTCCGTCGGCATCAAGCTCACAATTGACGACTTCCAAGCC GTCTCTGACCGCACCCCCTTCCTCGCAGACCTCAAGCCATCCGGCAAATACGTTATGGCCGACCTCCA CAACATCGGCGGCACCCCCTCCCTCCTCAAATTCCTCCTCAAGGAAGGCGTCATTGATGGCTCCGGCA TCACAGTTACCGGTGAAACTCTCGCCAAGAACCTCGAGAAAGTCCCCGATTTCCCCGAGGACCAGAAA ATCATTCGCCCCTTCTCCAACCCCATCAAGGAAACAGGCCACATCCAGATCCTGCGCGGTTCGCTCGC GCCGGGCGGTTGCGTTGGTAAGATTACCGGTAAGGAGGGAACCGTTTTCACGGGCAAGGCCCGCGTCT TTAACCACGAAGACGACTTCATTGCCGCCCTGGAGCGCAAGGAAATCACCAAGGATGAGCAAACTGTC GTTGTGATTCGCTACACCGGTCCCAAGGGTGGTCCCGGTATGCCTGAAATGCTCAAGCCTTCAAGCGC CCTCATGGGTGCTGGCCTCGGCCAAACCTGCGCCCTGATCACAGACGGACGCTTCTCCGGTGGTTCGC ACGGCTTCCTTATTGGACACATCGTCCCAGAGGCTGCCGTCGGTGGGCCGATTGGTCTCGTACACGAC GGCGACGTGATCACCATTGATGCCGAGAAGCGCGTTTTGGACCTTGACGTTGACGAGGCGGAACTCGC TAAGCGACGGAAGCAGTGGGAGGCTGATAAGGCGGCGGGCAAGTTGCCCCAGACCGGGTTGAACTTGC GCGGGACGCTTGGAAAGTATGCCCGGAATGTCAAGGATGCTAGTTCCGGGTGTATTACGGACGCTTTC GATTAA
SEQ ID No. 42
MLLSQTRGRLPSALRSLANRAAIRPISTTLPRQKASPKDDEPVLNKVSRHITQPVSQGASQAMLYATG LTEADMNKAQVGISSVWYNGNPCNMHLLDLNNRVREGVQKAGLIGYQFNTIGVSDGISMGTSGMRYSL QSRDLIADSIETVMGGQWYDANISIPGCDKNMPGVLMAMGRVNRPSLMVYGGTIKPGCASMQGNADID IVSAFQAYGQFISGEINEPQRFDIIRHACPGGGACGGMYTANTMATAIEVMGMTLTGSSSNPAESQAK YDECLRAGEAIKRLLVEDIRPSDIMTRQAFENAMVVVNITGGSTNAVLHLIAIADSVGIKLTIDDFQA VSDRTPFLADLKPSGKYVMADLHNIGGTPSLLKFLLKEGVIDGSGITVTGETLAKNLEKVPDFPEDQK IIRPFSNPIKETGHIQILRGSLAPGGCVGKITGKEGTVFTGKARVFNHEDDFIAALERKEITKDEQTV VVIRYTGPKGGPGMPEMLKPSSALMGAGLGQTCALITDGRFSGGSHGFLIGHIVPEAAVGGPIGLVHD GDVITIDAEKRVLDLDVDEAELAKRRKQWEADKAAGKLPQTGLNLRGTLGKYARNVKDASSGCITDAF D
SEQ ID No. 43 ATGCTTTCCCGGTCACTGCTGCGCTCTAGAGCTGTGGGAGCTTTCCCCCTCTCTGCCAGGAATCATGG GTATGTCTACCATCATCCGTTGCCAACGATCAACAACATCCTGCACAAGGCCAATTCACTTCTGAATA CTAACCCATCTTCCATTACAGCCGCTTCCTGTCTACCACCTCCATCCGCTCCGATGACAAGCTCAACA AGATCTCCTCCAACATCACTCAGCCCAAGGCCCAGGGTGCTTCCCAGGCTATGCTCTACGCCACCGGC CTCTCTGAAGCTGACATGAACAAGGCTCAAGTTGGCATCTCGTCCGTCTGGTACGAGGGAAACCCTTG CAACATGCACCTTATGGACCTCTCTGCCCACGTCAAGGAGTCTGTCGCCAAGGCCGGTCTTATTCCCT ACCGATTCAACACCATCGGTGTCTCTGATGGTATCTCCATGGGTACCACTGGTATGCGATATTCCCTC CAGAGCCGAGAGATTATTGCCGATAGTGTTGAGACTGTCATGAACGGTCAATGGTATGACGCCAACGT CAGCTTGCCCGGTTGCGACAAGAACATGCCCGGTGTTGCCATTGCCATGGGTCGTGTCAACCGTCCCA GCATCATGGTTTACGGTGGTACCATTAAGCCCGGTTGCACCAAGCAGGGCGAGTCTATCGATATCGTC TCTGCTTTCCAGGCCTACGGTCAATACATCACCGGCGAGATCACCGAGGAGCAGCGATTCGACATTAT CCGAAATGCCTGCCCTGGTGGTGGTGCTTGTGGTGGCATGTACACTGCCAACACCATGGCTACTGCCA TTGAGACTCTGGGACTTACCCTCCCTGGTAGCAGCAGCAGCCCTGCTGAGGACCCCAGCAAGATCGCC GAGTGTGAGGCTGTTGGACCTGCTATCCGCAACATTCTCAAGGAAGATATCCGACCTCGTGACATCAT GACTCGCCAAGCCTTTGAGAATGCCATGATCGTCACTACCATCCTTGGTGGCAGCACCAACGCTGTTC TGCATCTTATCGCAATTGCCGACTCTGTCGGTATCAAGCTCGACATCGAGGACTTCCAAAAGGTTTCC GACCGCACTCCCTTCCTTGCCGACCTGAAGCCCTCTGGAAAGTGGGTCATGGCCGATATGCACAAGAT TGGTGGTACTCCTGCTCTTCTCAAGTTCCTCTTGAAGGAGGGTATCATTGACGGCTCTGGTATCACTG TCACTGGCAAGACCATGAAGCAGAACGTCGAGGAGTTGCCTGGATTCCCCGAGGATCAAACCATCATT CGCCCCCTTAGCAACCCCATTAAGCCTACCGGTCACATCCAGATTCTCCGTGGATCCCTGGCTCCTGG TGGCTGTGTTGGTAAGATCACTGGCAAGGAGGGTCTCCGATTCGAGGGTAAGGCCCGTGTCTACGACT CCGAGCCCGCCTTCATCTCTAGCCTTGAGGCTGGTGAGATCAAGAAGGGTGAGAAGACTGTCGTTATC ATCCGATATGATGGACCCAAGGGTGGCCCCGGTATGCCTGAGATGCTGAAGCCTTCTTCTGCCATTAT GGGTGCTGGCCTTGGACAGGATGTCGCCCTTCTCACTGACGGTCGCTTCTCTGGTGGTTCTCACGGTT TCATTATTGGTCACATTGTCCCCGAGGCAATGGAGGGTGGCCCTATCGCCCTTGTCGAGGACGGTGAC ACCATCGTTATCGACGCCGAGTCTCGTGCTATCGACCTCGTTGTTCCCGAGGCAGAGGTTGATCGCCG TCGCAAGGCCTGGAAGGCTCCCGCTCCCCGATACACCAAGGGCACACTCAGCAAGTACGCTCGACTGG TGACCAACGCCAGTGAGGGCTGTGTCACCGATAGCGGTCTCAAGAACTAA
SEQ ID No. 44 ATGCTTTCCCGGTCACTGCTGCGCTCTAGAGCTGTGGGAGCTTTCCCCCTCTCTGCCAGGAATCATGG CCGCTTCCTGTCTACCACCTCCATCCGCTCCGATGACAAGCTCAACAAGATCTCCTCCAACATCACTC AGCCCAAGGCCCAGGGTGCTTCCCAGGCTATGCTCTACGCCACCGGCCTCTCTGAAGCTGACATGAAC AAGGCTCAAGTTGGCATCTCGTCCGTCTGGTACGAGGGAAACCCTTGCAACATGCACCTTATGGACCT CTCTGCCCACGTCAAGGAGTCTGTCGCCAAGGCCGGTCTTATTCCCTACCGATTCAACACCATCGGTG TCTCTGATGGTATCTCCATGGGTACCACTGGTATGCGATATTCCCTCCAGAGCCGAGAGATTATTGCC GATAGTGTTGAGACTGTCATGAACGGTCAATGGTATGACGCCAACGTCAGCTTGCCCGGTTGCGACAA GAACATGCCCGGTGTTGCCATTGCCATGGGTCGTGTCAACCGTCCCAGCATCATGGTTTACGGTGGTA CCATTAAGCCCGGTTGCACCAAGCAGGGCGAGTCTATCGATATCGTCTCTGCTTTCCAGGCCTACGGT CAATACATCACCGGCGAGATCACCGAGGAGCAGCGATTCGACATTATCCGAAATGCCTGCCCTGGTGG TGGTGCTTGTGGTGGCATGTACACTGCCAACACCATGGCTACTGCCATTGAGACTCTGGGACTTACCC TCCCTGGTAGCAGCAGCAGCCCTGCTGAGGACCCCAGCAAGATCGCCGAGTGTGAGGCTGTTGGACCT GCTATCCGCAACATTCTCAAGGAAGATATCCGACCTCGTGACATCATGACTCGCCAAGCCTTTGAGAA TGCCATGATCGTCACTACCATCCTTGGTGGCAGCACCAACGCTGTTCTGCATCTTATCGCAATTGCCG ACTCTGTCGGTATCAAGCTCGACATCGAGGACTTCCAAAAGGTTTCCGACCGCACTCCCTTCCTTGCC GACCTGAAGCCCTCTGGAAAGTGGGTCATGGCCGATATGCACAAGATTGGTGGTACTCCTGCTCTTCT CAAGTTCCTCTTGAAGGAGGGTATCATTGACGGCTCTGGTATCACTGTCACTGGCAAGACCATGAAGC AGAACGTCGAGGAGTTGCCTGGATTCCCCGAGGATCAAACCATCATTCGCCCCCTTAGCAACCCCATT AAGCCTACCGGTCACATCCAGATTCTCCGTGGATCCCTGGCTCCTGGTGGCTGTGTTGGTAAGATCAC TGGCAAGGAGGGTCTCCGATTCGAGGGTAAGGCCCGTGTCTACGACTCCGAGCCCGCCTTCATCTCTA GCCTTGAGGCTGGTGAGATCAAGAAGGGTGAGAAGACTGTCGTTATCATCCGATATGATGGACCCAAG GGTGGCCCCGGTATGCCTGAGATGCTGAAGCCTTCTTCTGCCATTATGGGTGCTGGCCTTGGACAGGA TGTCGCCCTTCTCACTGACGGTCGCTTCTCTGGTGGTTCTCACGGTTTCATTATTGGTCACATTGTCC CCGAGGCAATGGAGGGTGGCCCTATCGCCCTTGTCGAGGACGGTGACACCATCGTTATCGACGCCGAG TCTCGTGCTATCGACCTCGTTGTTCCCGAGGCAGAGGTTGATCGCCGTCGCAAGGCCTGGAAGGCTCC CGCTCCCCGATACACCAAGGGCACACTCAGCAAGTACGCTCGACTGGTGACCAACGCCAGTGAGGGCT GTGTCACCGATAGCGGTCTCAAGAACTAA
SEQ ID No. 45 MLSRSLLRSRAVGAFPLSARNHGRFLSTTSIRSDDKLNKISSNITQPKAQGASQAMLYATGLSEADMN KAQVGISSVWYEGNPCNMHLMDLSAHVKESVAKAGLIPYRFNTIGVSDGISMGTTGMRYSLQSREIIA DSVETVMNGQWYDANVSLPGCDKNMPGVAIAMGRVNRPSIMVYGGTIKPGCTKQGESIDIVSAFQAYG QYITGEITEEQRFDIIRNACPGGGACGGMYTANTMATAIETLGLTLPGSSSSPAEDPSKIAECEAVGP AIRNILKEDIRPRDIMTRQAFENAMIVTTILGGSTNAVLHLIAIADSVGIKLDIEDFQKVSDRTPFLA DLKPSGKWVMADMHKIGGTPALLKFLLKEGIIDGSGITVTGKTMKQNVEELPGFPEDQTIIRPLSNPI KPTGHIQILRGSLAPGGCVGKITGKEGLRFEGKARVYDSEPAFISSLEAGEIKKGEKTVVIIRYDGPK GGPGMPEMLKPSSAIMGAGLGQDVALLTDGRFSGGSHGFIIGHIVPEAMEGGPIALVEDGDTIVIDAE SRAIDLVVPEAEVDRRRKAWKAPAPRYTKGTLSKYARLVTNASEGCVTDSGLKN
SEQ ID No. 46
ATGGCGGACCAAGTCACTCACGATCCCAAGCAGTCAAGCGACTACATCCCCTTCCCTTGCCTTCCTCC CGGCGGAGCTCTCAACCGTTGGTCTACAAAGATCACCCGCGAGCATGACTACCCCGGAGCTCAGGTGC GTTATATCTCCCAAACTCGCTGCTCGCCACTGACATTGCTTGATGTATAGGCTATGCTCTACGGAGCT GGTGTCAAGGACCAGCACACAATGAAGAACGCGCCCCAGGTTGGTGTTGCTACCGTCTGGTGGCAAGG AAACCCGTGCAAGTGAGTTTGAAGTCAATTGAATATGTACATTAAGATGAGATGCTAACGAGTTTGCG CAGTACCCATCGTAAGTGACATAGATAAATAATTTAGGAAAACATACTTATGAATGATAGTCCTTGAT CTTGGCCAGATCGTCAAGAACTCCATCGAGAAGGAAGGCATGATCGGCTGGCAGTTCAACACCGTTGG TGTATCTGACGCCATCACCATGGGCGGCGAGGGTAAGTAAACACTCGCAATACATAACAATCTCACTC GCTCACTCACCCCACAGGCATGCGCTTCTCTCTTCAAACTCGAGAAATCATTGCTGATTCTATCGAGT CCGTAACCTGCGCCCAGCATCATGACGCCAACATCTCCATTCCCGGCTGTGACAAGAACATGCCCGGC ACAGTCATGGCCGCCGCTCGTCACAACCGCCCCTTCATCATGATCTACGGCGGTACCATCCGCAAGGG CCACTCCAACCTCCTTGAGAAGCCCATCAACATCAGCACCTGCTACGAGGCCTCGGGTGCTTTCAACT ACGGTCGTCTGCATGCCAAGACGAACCCCGGCGAGCCTGGTCGCGAGAGCTCCGATGTCATGGATGAT ATCGAGAAGCACGCTTGTCCCGGCGCCGGAGCTTGTGGTGGCATGTATACAGCCAACACCATGGCTAC CGCTATCGAGGCCATGGGCCTTACTCTGCCTGGTTCATCATCGTACCCTGCCGAGTCTCCTGAGAAGC GTCGTGAGTGTGAGCGCGCTGCCCAGGTTATCCGAACTACCATGGAGAAGGACCTTCGTCCTCGCGAT ATCATGACCCGTGCCTCTTTCGAGAACGCTCTTGTCCTGACCATGATTCTCGGTGGTTCAACAAACGG TGTTCTTCACTTCCTCGCCATGGCCAACACCGCCGATGTTCCCCTGACCATTGACGACATCCAGCGTG CCAGTGACCGCACCCCCTTCCTCGCTGATCTCGCCCCCAGTGGAAAGTACTACATGGAGGATCTCTAC AAGGTCGGCGGTACACCCTCCGTCATCAAGATGCTCGTCGCCCGTGGTCTTCTCGACGGTAGCATCAT GACCATTACCGGCAAGACTCTCGCCGAGAACGTCGCGGACTGGCCTAGTCTGGACCCCGGCCAGGACA TCATCCGTCCTCTTGAGAACCCCATCAAGGACTCTGGCCACATCCGCATCCTAAAGGGTAACTTTGCC CCCGGCGGCGCCGTCGCTAAGATCACTGGAAAGGAGGGTCTGTCCTTCACCGGCAAGGCCCGTGTCTT CAACACCGAGAAGGAGCTCAACGGCGCACTGAACCGAAGCGAGATCAAGCAGTCCGATGGTAACCTCG TCGTCATCGTCCGATACGAGGGCCCCAAGGGCGGTCCCGGTATGCCTGAACAGCTCAAGGCTTCTGCA GCCATCATGGGTGCCGGCCTCTCCAACCTGGCACTTGTCACCGACGGACGATACAGTGGTGCTTCTCA CGGTTTCATCGTGGGCCACGTTGTGCCTGAGGCTATGGTCGGAGGTCCCATCGCTCTGGTCAAGGATG GAGACGAGATCACTATCGATGCGATTAACAACCGAATCGATGTCGACATCACTGACGAGGAGATGGAG AAGCGAAGGAGTGAGTGGAAGCCTCCTGCGCCCCGTGTTACGAGGGGTGTGTTGGCCAAGTATGCCCG CTTGGTCGGTGATGCTTCCCACGGTGCTGTAACAGATCAGTGGTAG
SEQ ID No. 47 ATGGCGGACCAAGTCACTCACGATCCCAAGCAGTCAAGCGACTACATCCCCTTCCCTTGCCTTCCTCC CGGCGGAGCTCTCAACCGTTGGTCTACAAAGATCACCCGCGAGCATGACTACCCCGGAGCTCAGGCTA TGCTCTACGGAGCTGGTGTCAAGGACCAGCACACAATGAAGAACGCGCCCCAGGTTGGTGTTGCTACC GTCTGGTGGCAAGGAAACCCGTGCAATACCCATCTCCTTGATCTTGGCCAGATCGTCAAGAACTCCAT CGAGAAGGAAGGCATGATCGGCTGGCAGTTCAACACCGTTGGTGTATCTGACGCCATCACCATGGGCG GCGAGGGCATGCGCTTCTCTCTTCAAACTCGAGAAATCATTGCTGATTCTATCGAGTCCGTAACCTGC GCCCAGCATCATGACGCCAACATCTCCATTCCCGGCTGTGACAAGAACATGCCCGGCACAGTCATGGC CGCCGCTCGTCACAACCGCCCCTTCATCATGATCTACGGCGGTACCATCCGCAAGGGCCACTCCAACC TCCTTGAGAAGCCCATCAACATCAGCACCTGCTACGAGGCCTCGGGTGCTTTCAACTACGGTCGTCTG CATGCCAAGACGAACCCCGGCGAGCCTGGTCGCGAGAGCTCCGATGTCATGGATGATATCGAGAAGCA CGCTTGTCCCGGCGCCGGAGCTTGTGGTGGCATGTATACAGCCAACACCATGGCTACCGCTATCGAGG CCATGGGCCTTACTCTGCCTGGTTCATCATCGTACCCTGCCGAGTCTCCTGAGAAGCGTCGTGAGTGT GAGCGCGCTGCCCAGGTTATCCGAACTACCATGGAGAAGGACCTTCGTCCTCGCGATATCATGACCCG TGCCTCTTTCGAGAACGCTCTTGTCCTGACCATGATTCTCGGTGGTTCAACAAACGGTGTTCTTCACT TCCTCGCCATGGCCAACACCGCCGATGTTCCCCTGACCATTGACGACATCCAGCGTGCCAGTGACCGC ACCCCCTTCCTCGCTGATCTCGCCCCCAGTGGAAAGTACTACATGGAGGATCTCTACAAGGTCGGCGG TACACCCTCCGTCATCAAGATGCTCGTCGCCCGTGGTCTTCTCGACGGTAGCATCATGACCATTACCG GCAAGACTCTCGCCGAGAACGTCGCGGACTGGCCTAGTCTGGACCCCGGCCAGGACATCATCCGTCCT CTTGAGAACCCCATCAAGGACTCTGGCCACATCCGCATCCTAAAGGGTAACTTTGCCCCCGGCGGCGC CGTCGCTAAGATCACTGGAAAGGAGGGTCTGTCCTTCACCGGCAAGGCCCGTGTCTTCAACACCGAGA AGGAGCTCAACGGCGCACTGAACCGAAGCGAGATCAAGCAGTCCGATGGTAACCTCGTCGTCATCGTC CGATACGAGGGCCCCAAGGGCGGTCCCGGTATGCCTGAACAGCTCAAGGCTTCTGCAGCCATCATGGG TGCCGGCCTCTCCAACCTGGCACTTGTCACCGACGGACGATACAGTGGTGCTTCTCACGGTTTCATCG TGGGCCACGTTGTGCCTGAGGCTATGGTCGGAGGTCCCATCGCTCTGGTCAAGGATGGAGACGAGATC ACTATCGATGCGATTAACAACCGAATCGATGTCGACATCACTGACGAGGAGATGGAGAAGCGAAGGAG TGAGTGGAAGCCTCCTGCGCCCCGTGTTACGAGGGGTGTGTTGGCCAAGTATGCCCGCTTGGTCGGTG ATGCTTCCCACGGTGCTGTAACAGATCAGTGGTAG
SEQ ID No. 48
MADQVTHDPKQSSDYIPFPCLPPGGALNRWSTKITREHDYPGAQAMLYGAGVKDQHTMKNAPQVGVAT VWWQGNPCNTHLLDLGQIVKNSIEKEGMIGWQFNTVGVSDAITMGGEGMRFSLQTREIIADSIESVTC
AQHHDANISIPGCDKNMPGTVMAAARHNRPFIMIYGGTIRKGHSNLLEKPINISTCYEASGAFNYGRL
HAKTNPGEPGRESSDVMDDIEKHACPGAGACGGMYTANTMATAIEAMGLTLPGSSSYPAESPEKRREC
ERAAQVIRTTMEKDLRPRDIMTRASFENALVLTMILGGSTNGVLHFLAMANTADVPLTIDDIQRASDR
TPFLADLAPSGKYYMEDLYKVGGTPSVIKMLVARGLLDGSIMTITGKTLAENVADWPSLDPGQDIIRP LENPIKDSGHIRILKGNFAPGGAVAKITGKEGLSFTGKARVFNTEKELNGALNRSEIKQSDGNLVVIV
RYEGPKGGPGMPEQLKASAAIMGAGLSNLALVTDGRYSGASHGFIVGHVVPEAMVGGPIALVKDGDEI
TIDAINNRIDVDITDEEMEKRRSEWKPPAPRVTRGVLAKYARLVGDASHGAVTDQW
SEQ ID No. 49 ATGCAACCCAGCGGCCGCTACATGATGGAGGACCTGTACCGCGTCGGAGGCACCCCGTCCGTGCTCAA GATGCTTATCGCCGCAGGGTTGATCGACGGCACGATCCCGACGGTGACGGGTAAAACCCTCGCCGAGA ACGTCGAGTCCTGGCCGTCCCTCGACCCCGGGCAGGACATTATCCGCCCGCTCTCGGACCCCGTCAAG GCCACGGGACACATTCGCATCCTGCGCGGCAACCTGGCCCCCGGCGGCGCCGTGGCCAAAATCACTGG CAAGGAGGGTATTTCCTTTACAGGCCGGGCACGCGTCTTCAACAAGGAGCACGAGCTGGATCACGCCC TGTCCACGAGCCAGATCAAGGCGAGCGACGGCAACCTCGTCGTCATTGTGCGCTACGAGGGCCCCAAG GGCGGACCGGGCATGCCCGAGCAACTGCGCGCCTCGGCAGCCATCATGGGCGCCGGCCTCTCCAACGT CGCCCTCGTAACGGACGGCCGCTACAGCGGCGCCAGCCACGGATTCATCGTCGGCCACGTCGTGCCCG AGGCTGCCACCGGTGGGCCCATCGGGCTGGTCAAGGACGGTGACTTTGTGCGCATCGATGCCGAGACC AACAGGATCGACATTATAGGCATCGACGGCGTTGCGGCCGAGGGCGATCTGGACGCCGTCGACAAGGA GCTGGAGAGGCGCAGGGCCGAGTGGAAGAAGCCTGTCATGAAGCCGCTGAGGGGTGTCCTGGCCAAGT ATGCGAGGTTGGTCGGTGACGCGAGCCACGGTGCTGTGACGGATCAGGAGGACCCGAGCTGGTGA
SEQ ID No. 50
MQPSGRYMMEDLYRVGGTPSVLKMLIAAGLIDGTIPTVTGKTLAENVESWPSLDPGQDIIRPLSDPVK ATGHIRILRGNLAPGGAVAKITGKEGISFTGRARVFNKEHELDHALSTSQIKASDGNLVVIVRYEGPK GGPGMPEQLRASAAIMGAGLSNVALVTDGRYSGASHGFIVGHVVPEAATGGPIGLVKDGDFVRIDAET NRIDIIGIDGVAAEGDLDAVDKELERRRAEWKKPVMKPLRGVLAKYARLVGDASHGAVTDQEDPSW
SEQ ID No. 51 ATGTCCTCAAACCTGCTGCGCGCCAGGGTGCCCAAGGCCCTGGCTGCCACCAGGAGCCATGCGTATGT CATTACACAGCCAATTCTCGATACAGACATTGATATATCAAACTGTCTATACCACATCGGTTGATATA ACGATTAGCAAACTAACATACTCCAATGTCACCAGGGCCCTCTTCTCCACAACCTCGCGCCGCGCTGA GCAACTGAACAAGACTTCGGCCAAGATCACACAGCCCAAGTCCCAGGGTGCGTCACAGGCCATGCTGT ACGCCACCGGCTTGACCGAGGAGGACATGAACAAGCCGCAGGTCGGCATCTCGTCGGTCTGGTACGAG GGCAACCCCTGCAACATGCACATCCTCAAGCTGTCGGAGCGGATCCGCGACTCCGTCAAGGCCGCCAA CCTGGTCCCCATGCGCTTCAACACCATTGGAGTCTCGGACGGTATCAGCATGGGCACGACCGGCATGC GCTATTCGCTGCAGAGCCGTGAGATCATCGCCGACAGTATCGAGACCGTCATGCAGGGCCAGTGGTAC GATGCCAACATCTCCATCCCGGGCTGCGACAAGAACATGCCCGGTGTTCTCATCGCCCTCGGCCGTGT CAACCGCCCCAGTTTGATCGTTTACGGCGGTACCATCAAGCCCGGCTGCAGCCAAAAGGGCGAGCCCA TCGACATCGTCAGCGCCTTCCAGGCCTACGGCCAGTACCTGACTGGTGAGATCACCGAGGAGCAGCGG TTCGACATCATCCGGAACGCGTGCCCCGGCGGTGGTGCATGCGGTGGCATGTACACTGCCAACACTTT GGCGTCGGCTATCGAGACTCTGGGCATGAGCTTGCCGGGAAGCAGCAGCAACCCCGCAGAGCACCCCA GCAAGCTGGCCGAATGTGACCAGGTTGGCGAGGCCATCAAGAACATCCTCCGTGAGGACGTGCGCCCC CGCGACATCATGACGAGGCAGGCCTTTGAGAACGCCATGGTTGTAGTCAGCATCCTTGGTGGAAGCAC CAACGCCGTCCTGCACCTGCTTGCCGTTGCCGATGCGGTCGGTGTCAAGCTTACCATTGACGACTTCC AAGCCGTTTCCGACCGGACACCGCTCCTGGCAGACCTCAAGCCTTCGGGCAAATACGTCATGGAGGAT GTACACAAGATTGGTGGCACGCCATCTCTCCTGCGCTTCCTGGCCAAGGAGGGCTTGATCGATGCTTC TGGCATCACCGTCACTGGCAAGACCATGAAGGAGAACTTGGACAAGTACCCCGACTTCCCGGCCGACC AGCCCATCATCCGGCCTCTCAGCAACCCCCTCAAGTCCACGGGCCACATCCAGATCCTCCGCGGTAGT CTCGCACCCGGTGGCTCTGTCGGCAAGATTACCGGCAAGGAGGGTCTTCAGTTCACTGGCAAGGCCCG CTGCTACGACTGCGAGGACGACTTTATCGAGTCGCTCGAGCGTGGCGAGATCAAGAAGGGTGAGAAGA CGGTTGTCATCATCCGTTACGAGGGCCCCAAGGGTGGTCCCGGCATGCCCGAGATGCTGAAGCCCAGC TCGGCCATCATGGGTGCCGGTCTCGGCAAGGACGTTGCCCTCATCACTGACGGTCGCTTCTCTGGCGG TTCGCACGGATTCCTCATCGGACACGTTGTGCCCGAGGCCATGGAGGGCGGACCTATCGCACTTGTTC GCGATGGCGACACCATTACTATTGATGCGGAGAAGCGGGTGATCGACACGGACGTGTCGGACAAGACC ATGGCGGAGCGCCGCGCCGAGTGGAAGGCTCCCCCGATCAGGGAGACCAGGGGAACGCTGGCCAAGTA CGCAGCGCTGGTTAGCGACGCGAGCAGTGGTTGCGTGACGGACAAGGTCGCGCGATAA SEQ ID No. 52
ATGTCCTCAAACCTGCTGCGCGCCAGGGTGCCCAAGGCCCTGGCTGCCACCAGGAGCCATGCGGCCCT CTTCTCCACAACCTCGCGCCGCGCTGAGCAACTGAACAAGACTTCGGCCAAGATCACACAGCCCAAGT CCCAGGGTGCGTCACAGGCCATGCTGTACGCCACCGGCTTGACCGAGGAGGACATGAACAAGCCGCAG GTCGGCATCTCGTCGGTCTGGTACGAGGGCAACCCCTGCAACATGCACATCCTCAAGCTGTCGGAGCG GATCCGCGACTCCGTCAAGGCCGCCAACCTGGTCCCCATGCGCTTCAACACCATTGGAGTCTCGGACG GTATCAGCATGGGCACGACCGGCATGCGCTATTCGCTGCAGAGCCGTGAGATCATCGCCGACAGTATC GAGACCGTCATGCAGGGCCAGTGGTACGATGCCAACATCTCCATCCCGGGCTGCGACAAGAACATGCC CGGTGTTCTCATCGCCCTCGGCCGTGTCAACCGCCCCAGTTTGATCGTTTACGGCGGTACCATCAAGC CCGGCTGCAGCCAAAAGGGCGAGCCCATCGACATCGTCAGCGCCTTCCAGGCCTACGGCCAGTACCTG ACTGGTGAGATCACCGAGGAGCAGCGGTTCGACATCATCCGGAACGCGTGCCCCGGCGGTGGTGCATG CGGTGGCATGTACACTGCCAACACTTTGGCGTCGGCTATCGAGACTCTGGGCATGAGCTTGCCGGGAA GCAGCAGCAACCCCGCAGAGCACCCCAGCAAGCTGGCCGAATGTGACCAGGTTGGCGAGGCCATCAAG AACATCCTCCGTGAGGACGTGCGCCCCCGCGACATCATGACGAGGCAGGCCTTTGAGAACGCCATGGT TGTAGTCAGCATCCTTGGTGGAAGCACCAACGCCGTCCTGCACCTGCTTGCCGTTGCCGATGCGGTCG GTGTCAAGCTTACCATTGACGACTTCCAAGCCGTTTCCGACCGGACACCGCTCCTGGCAGACCTCAAG CCTTCGGGCAAATACGTCATGGAGGATGTACACAAGATTGGTGGCACGCCATCTCTCCTGCGCTTCCT GGCCAAGGAGGGCTTGATCGATGCTTCTGGCATCACCGTCACTGGCAAGACCATGAAGGAGAACTTGG ACAAGTACCCCGACTTCCCGGCCGACCAGCCCATCATCCGGCCTCTCAGCAACCCCCTCAAGTCCACG GGCCACATCCAGATCCTCCGCGGTAGTCTCGCACCCGGTGGCTCTGTCGGCAAGATTACCGGCAAGGA GGGTCTTCAGTTCACTGGCAAGGCCCGCTGCTACGACTGCGAGGACGACTTTATCGAGTCGCTCGAGC GTGGCGAGATCAAGAAGGGTGAGAAGACGGTTGTCATCATCCGTTACGAGGGCCCCAAGGGTGGTCCC GGCATGCCCGAGATGCTGAAGCCCAGCTCGGCCATCATGGGTGCCGGTCTCGGCAAGGACGTTGCCCT CATCACTGACGGTCGCTTCTCTGGCGGTTCGCACGGATTCCTCATCGGACACGTTGTGCCCGAGGCCA TGGAGGGCGGACCTATCGCACTTGTTCGCGATGGCGACACCATTACTATTGATGCGGAGAAGCGGGTG ATCGACACGGACGTGTCGGACAAGACCATGGCGGAGCGCCGCGCCGAGTGGAAGGCTCCCCCGATCAG GGAGACCAGGGGAACGCTGGCCAAGTACGCAGCGCTGGTTAGCGACGCGAGCAGTGGTTGCGTGACGG ACAAGGTCGCGCGATAA
SEQ ID No. 53 MSSNLLRARVPKALAATRSHAALFSTTSRRAEQLNKTSAKITQPKSQGASQAMLYATGLTEEDMNKPQ VGISSVWYEGNPCNMHILKLSERIRDSVKAANLVPMRFNTIGVSDGISMGTTGMRYSLQSREIIADSI ETVMQGQWYDANISIPGCDKNMPGVLIALGRVNRPSLIVYGGTIKPGCSQKGEPIDIVSAFQAYGQYL TGEITEEQRFDIIRNACPGGGACGGMYTANTLASAIETLGMSLPGSSSNPAEHPSKLAECDQVGEAIK NILREDVRPRDIMTRQAFENAMVVVSILGGSTNAVLHLLAVADAVGVKLTIDDFQAVSDRTPLLADLK PSGKYVMEDVHKIGGTPSLLRFLAKEGLIDASGITVTGKTMKENLDKYPDFPADQPIIRPLSNPLKST GHIQILRGSLAPGGSVGKITGKEGLQFTGKARCYDCEDDFIESLERGEIKKGEKTVVIIRYEGPKGGP GMPEMLKPSSAIMGAGLGKDVALITDGRFSGGSHGFLIGHVVPEAMEGGPIALVRDGDTITIDAEKRV IDTDVSDKTMAERRAEWKAPPIRETRGTLAKYAALVSDASSGCVTDKVAR SEQ ID No. 54
ATGCTCGCTCCCTCCCTTCTGCGGGCTCAGGCCCCCAGAGCTCTGGCTTCTGCCCGTCTCTCCCTGTG AGTTTTCAGCCCAGTGACAGGCAACCTTGATGTCCAAGCGCAAGTGAAGCTATTCAGGAAGCGATACT GACCAAGGTCTTTCCTCTCCCCTCCAGCCGCTCCCTCTCCACCACTCCCCGCCGCTACAATGCCCAGG AGAAGCAGCTCAACAAGGTGTCGGCCAACATCACCCAGCCCAAGTCTCAGGGTGCTTCCCAGGCCATG CTCTACGCCACCGGCCTCAACGAGGACGACATGAACAAGGCCCAGGTCGGTATCTCGTCTGTCTGGTA TGAGGGCAACCCTTGCAACATGCACCTTCTCGACCTCTCCGGCCTCGTCAAGGAGTCCGTTGCCAAGG CTGGCCTCGTCCCCATGCGCTTCAACACCATCGGTGTCTCGGACGGTATCTCCATGGGTACCACCGGT ATGCGCTACAGCTTGCAGTCCCGTGAGATTATTGCCGACTCCATCGAGACCGTCATGAACGGCCAGTG GTACGACGCCAACATCTCCCTCCCCGGTTGCGACAAGAACATGCCCGGTGTCCTCATCGCCATGGGCC GTGTCAACCGCCCCTCCATCATGGTCTACGGTGGTACCATCAAGCCCGGCTGCAACATGAAGGGCGAG AACATCGATATCGTTTCCGCCTTCCAGGCATACGGCCAGTACATCTCCGGTGAGATCGACGAGAAGCA GCGCTTCGATATCATCCGCAACGCCTGCCCTGGTGGTGGCGCCTGCGGTGGCATGTACACTGCCAACA CCATGGCCACCGCCATCGAGACCCTCGGCATGACCCTTCCCGGTTCCTCCTCCTACCCCGCCGAGTCT CCCGAGAAGAAGAACGAGTGCTTGAGCGTCGGTGAGGCCATCAAGAACCTCCTCCGCGAGGACATCCG CCCCACCGATATCCTTACTCGCCAGGCCTTCGAGAACGCCATGATCGTCGTCAACATTCTTGGCGGTT CCACCAACGCCGTCCTCCACTTGATTGCCATCGCCGACTCCGTTGGCATCAAGCTCACCATCGATGAC TTCCAGGCCGTCTCCGACCGCACTCCTTTCCTTGCCGACCTCAAGCCTTCCGGCAAGTGGGTCATGGA GGACCTTAGCAAGATTGGCGGCACCCCCGCCCTTCTCAAATTCCTCCTCAAGGAGGGAATCCTCGACG GCTCCGGCATCACCTCGACCGGTAAGACCATGAAGGAGAACGTCGAGAAGTTCCCCGACTTCCCCACC GACCAGGCCATCATCCGCCCTCTGTCGAACCCCATCAAGGAAACCGGACACATTCAGATCCTCCGCGG CTCTCTCGCTCCCGGCGGTTCCGTCGGCAAGATCACCGGCAAGGAGGGTCTCCGTTTCGAGGGTAAGG CTCGCTGCTTCGACTACGAGGACGGCTTTATTGAGGCTCTCGAGCGTGGCGAGATCAAGAAGGGCGAG AAGACCGTCGTCGTCATCCGCTACGAGGGTCCCAAGGGCGGCCCTGGCATGCCCGAAATGCTCAAGCC TTCGTCCGCCATTATGGGCTACGGTCTCGGCAAGGACGTTGCCCTCATCACTGACGGCCGCTTCTCCG GTGGTTCTCACGGCTTCCTCATCGGCCACATTGTGCCTGAGGCTATGGAGGGTGGCCCCATTGGTCTC GTCCGCGATGGCGACACCATCGTCATCGACGCCGAGAAGAAGGTTCTTGACCTTGAGGTTCCTGAGGA GGAGCTCGCCAAGCGCCGCAAGGAGTGGAAGGCTCCTGAGCCCAAGGCTAAGCGTGGTACTCTCAGGA AGTACGCCCAGCTCGTCAAGGATGCTAGCTCTGGCTGCGTCACTGACGCTTAA
SEQ ID No. 55
ATGCTCGCTCCCTCCCTTCTGCGGGCTCAGGCCCCCAGAGCTCTGGCTTCTGCCCGTCTCTCCCTCCG CTCCCTCTCCACCACTCCCCGCCGCTACAATGCCCAGGAGAAGCAGCTCAACAAGGTGTCGGCCAACA TCACCCAGCCCAAGTCTCAGGGTGCTTCCCAGGCCATGCTCTACGCCACCGGCCTCAACGAGGACGAC ATGAACAAGGCCCAGGTCGGTATCTCGTCTGTCTGGTATGAGGGCAACCCTTGCAACATGCACCTTCT CGACCTCTCCGGCCTCGTCAAGGAGTCCGTTGCCAAGGCTGGCCTCGTCCCCATGCGCTTCAACACCA TCGGTGTCTCGGACGGTATCTCCATGGGTACCACCGGTATGCGCTACAGCTTGCAGTCCCGTGAGATT ATTGCCGACTCCATCGAGACCGTCATGAACGGCCAGTGGTACGACGCCAACATCTCCCTCCCCGGTTG CGACAAGAACATGCCCGGTGTCCTCATCGCCATGGGCCGTGTCAACCGCCCCTCCATCATGGTCTACG GTGGTACCATCAAGCCCGGCTGCAACATGAAGGGCGAGAACATCGATATCGTTTCCGCCTTCCAGGCA TACGGCCAGTACATCTCCGGTGAGATCGACGAGAAGCAGCGCTTCGATATCATCCGCAACGCCTGCCC TGGTGGTGGCGCCTGCGGTGGCATGTACACTGCCAACACCATGGCCACCGCCATCGAGACCCTCGGCA TGACCCTTCCCGGTTCCTCCTCCTACCCCGCCGAGTCTCCCGAGAAGAAGAACGAGTGCTTGAGCGTC GGTGAGGCCATCAAGAACCTCCTCCGCGAGGACATCCGCCCCACCGATATCCTTACTCGCCAGGCCTT CGAGAACGCCATGATCGTCGTCAACATTCTTGGCGGTTCCACCAACGCCGTCCTCCACTTGATTGCCA TCGCCGACTCCGTTGGCATCAAGCTCACCATCGATGACTTCCAGGCCGTCTCCGACCGCACTCCTTTC CTTGCCGACCTCAAGCCTTCCGGCAAGTGGGTCATGGAGGACCTTAGCAAGATTGGCGGCACCCCCGC CCTTCTCAAATTCCTCCTCAAGGAGGGAATCCTCGACGGCTCCGGCATCACCTCGACCGGTAAGACCA TGAAGGAGAACGTCGAGAAGTTCCCCGACTTCCCCACCGACCAGGCCATCATCCGCCCTCTGTCGAAC CCCATCAAGGAAACCGGACACATTCAGATCCTCCGCGGCTCTCTCGCTCCCGGCGGTTCCGTCGGCAA GATCACCGGCAAGGAGGGTCTCCGTTTCGAGGGTAAGGCTCGCTGCTTCGACTACGAGGACGGCTTTA TTGAGGCTCTCGAGCGTGGCGAGATCAAGAAGGGCGAGAAGACCGTCGTCGTCATCCGCTACGAGGGT CCCAAGGGCGGCCCTGGCATGCCCGAAATGCTCAAGCCTTCGTCCGCCATTATGGGCTACGGTCTCGG CAAGGACGTTGCCCTCATCACTGACGGCCGCTTCTCCGGTGGTTCTCACGGCTTCCTCATCGGCCACA TTGTGCCTGAGGCTATGGAGGGTGGCCCCATTGGTCTCGTCCGCGATGGCGACACCATCGTCATCGAC GCCGAGAAGAAGGTTCTTGACCTTGAGGTTCCTGAGGAGGAGCTCGCCAAGCGCCGCAAGGAGTGGAA GGCTCCTGAGCCCAAGGCTAAGCGTGGTACTCTCAGGAAGTACGCCCAGCTCGTCAAGGATGCTAGCT CTGGCTGCGTCACTGACGCTTAA SEQ ID No . 56
MLAPSLLRAQAPRALASARLSLRSLSTTPRRYNAQEKQLNKVSANITQPKSQGASQAMLYATGLNEDD
MNKAQVGISSVWYEGNPCNMHLLDLSGLVKESVAKAGLVPMRFNTIGVSDGISMGTTGMRYSLQSREI
IADSIETVMNGQWYDANISLPGCDKNMPGVLIAMGRVNRPSIMVYGGTIKPGCNMKGENIDIVSAFQA YGQYISGEIDEKQRFDIIRNACPGGGACGGMYTANTMATAIETLGMTLPGSSSYPAESPEKKNECLSV
GEAIKNLLREDIRPTDILTRQAFENAMIVVNILGGSTNAVLHLIAIADSVGIKLTIDDFQAVSDRTPF
LADLKPSGKWVMEDLSKIGGTPALLKFLLKEGILDGSGITSTGKTMKENVEKFPDFPTDQAIIRPLSN
PIKETGHIQILRGSLAPGGSVGKITGKEGLRFEGKARCFDYEDGFIEALERGEIKKGEKTVVVIRYEG
PKGGPGMPEMLKPSSAIMGYGLGKDVALITDGRFSGGSHGFLIGHIVPEAMEGGPIGLVRDGDTIVID AEKKVLDLEVPEEELAKRRKEWKAPEPKAKRGTLRKYAQLVKDASSGCVTDA
SEQ ID No. 57
ATGGCTTCCAACCAAGATAATAAGGCTGTCGCTCCCGACGCCGCCGCTCCCGCCGGCCAGTCCACCAC
CACCACGACCACCAACGACAACAGCGAGCGCAACTTGCCCAAAGAGGGCGAATACATCCAATGGCGTA CTCTCCCAGCTGGCAACCCCGACCAGTTGAACCGCTGGTCGCACTTTTTGACTCGTGAGCATGAGTTT CCTGGAGCTCAGGTAAGTAATGAAGCTATTGCCATTGCCATCATTTGCTCTTGATGCTTCCTACCTAC CTACCTACCTACCTACCTACCCTCACGTCACGTCATGACATGACATGACAAATTCCGCTGCTCTCACT TATTCATTCACTGCCGTGCTTAAGCGGATCCCACACACACTACACACATACCCAACACCTCGGGTTCA AAATTGGGGTATCGTTCGGCAAAAGAATCCCGAGCCCGTCCGTCACTCGCTTCGTCATTCATATGCCT TATCTGGATGCGGAGATGGTTCATGAGGGGTAGCTTCTGGGCTGCTGTGGGGTAGGTAGTACAGCTTT TGCGTGGTACCTACCTCTAGACAACAAAGTGGAAAGGGGCATGGACCCCCCCTGACGACGATCATGAA TTACATACAACGGGTTGTACCAAAGCAACATGAAAAGCTCGGAATAACTAACATGGCGATTACCCAAC CATGTAGGCCATGCTCTACGGCGCCGGTGTCCCGAATAAGGACATGATGAAGAAGGCGCCGCATGTCG GCATTGCGACGGTCTGGTGGGAGGGCAACCCCTGCAAGTGAGTTGTACCTACCTACCCAAAGGAATGA ATGAGATGATAGGTGGTGAGAGAGGGTTGAGGACTAACGTGGTGGTTATAGCACCCATTTGCTGGATC TTGGCCAAAAGGTCAAGAAGGCCGTTGAGCGCGAGAAGATGCTTGCGTGGCAGTTCAACACCATTGGT GTCTCGGATGGTATCACCATGGGCGGAGAGGGTGAGTTTTTGTCACCTCCATGTCTCAATGCGTAATA TGAGGTTGAATGCTAAAACCACATCACATCAACCAGGCATGCGCTACTCGCTTCAGTCTCGCGAGATC ATCGCCGACTCCATCGAGACCGTGACGTGCGCTCAGCATCATGACGCCAACATCTCCATTCCCGGCTG CGACAAGAACATGCCCGGTGTCATCATGGCCGCGGCCCGCCACAACCGTCCCTTCGTCATGATCTACG GCGGCACCATGCGCGGCGGCCACTCGGAGCTCCTCGACCGCCCCATCAACATCGTCACTTGCTACGAG GCTTCCGGCGCTTACACGTATGGCCGTCTCAAGCCCGCTTGCCCCAACAGCACCGCCACCCCTTCGGA CGTCATGGACGACATCGAGCAGCACGCCTGCCCCGGCGCCGGCGCCTGCGGCGGCATGTACACCGCCA ACACCATGGCGACCGCCATCGAAGCCATGGGCCTCACCGCACCGGGCTCCTCTTCTTTCCCTGCCTCT TCGCCCGAAAAGTTCCGCGAGTGCGAAAAAGCCGCCGAGTACATCAAGATCTGTATGGAGAAGGACAT
TCGCCCGCGCGACCTCTTGACCAAGGCCTCGTTCGAGAACGCCCTCGTGCTCACCATGATCCTCGGCG GTTCTACCAATGGCGTTCTGCACTACCTCGCCATGGCCAACTCGGCCGACGTCGACTTGACTCTGGAC GACATCAACCGCGTCTCAGCCAAGACGCCCTTTTTGGCTGACATGGCCCCCTCCGGCAGGTACTACAT GGAAGACCTGTACAAAGTCGGCGGCACCCCCGCGGTCCTCAAGATGCTCATCGCCGCAGGCTATATCG ATGGCACCATCCCCACTATCACCGGCAAGTCTTTGGCGGAAAACGTATCCGACTGGCCTAGTCTCGAC
CCCGATCAAAAAATCATCCGCCCCCTTGACAACCCCATCAAGTCCCAGGGGCATATCCGTGTCTTGTA CGGAAACTTCTCCCCCGGTGGTGCGGTCGCCAAGATTACAGGCAAGGAAGGTCTCTCCTTCACCGGCA AAGCCCGTTGTTTCAACAAGGAATTCGAGCTCGACGCCGCCTTGAAGAACTCGGAAATTACCCTCGAG CAAGGAAACCAGGTCCTCATCGTGCGGTACGAAGGACCCAAAGGCGGTCCAGGTATGCCAGAGCAACT GAAGGCTTCAGCGGCGATCATGGGCGCGGGCCTGACGAATGTCGCGCTGGTGACGGATGGTCGGTACT CTGGTGCGTCGCATGGGTTTATTGTGGGCCATGTGGTCCCCGAGGCGGCGACGGGGGGGCCGATTGCG CTGGTGAAGGACGGGGATTTGATTACGATTGATGCGGTTAGGAACCGGATTGATGTCGTCAAGACTGT GGAAGGCGTCGAGGGTGAGGAAGAGATTGCCAAGGTGTTGGAGGAGAGAAAGAAGGGGTGGAAGGCGC CGAAGATGAAGCCGACGAGGGGGGCGTTGGCCAAGTATGCGAGGTTGGTGGGGGATGCGTCGCATGGG GCTGTTACTGATTTGGGTGGGGATGCCTATTAG
SEQ ID No. 58
ATGGCTTCCAACCAAGATAATAAGGCTGTCGCTCCCGACGCCGCCGCTCCCGCCGGCCAGTCCACCAC CACCACGACCACCAACGACAACAGCGAGCGCAACTTGCCCAAAGAGGGCGAATACATCCAATGGCGTA CTCTCCCAGCTGGCAACCCCGACCAGTTGAACCGCTGGTCGCACTTTTTGACTCGTGAGCATGAGTTT CCTGGAGCTCAGGCCATGCTCTACGGCGCCGGTGTCCCGAATAAGGACATGATGAAGAAGGCGCCGCA TGTCGGCATTGCGACGGTCTGGTGGGAGGGCAACCCCTGCAACACCCATTTGCTGGATCTTGGCCAAA AGGTCAAGAAGGCCGTTGAGCGCGAGAAGATGCTTGCGTGGCAGTTCAACACCATTGGTGTCTCGGAT GGTATCACCATGGGCGGAGAGGGCATGCGCTACTCGCTTCAGTCTCGCGAGATCATCGCCGACTCCAT CGAGACCGTGACGTGCGCTCAGCATCATGACGCCAACATCTCCATTCCCGGCTGCGACAAGAACATGC CCGGTGTCATCATGGCCGCGGCCCGCCACAACCGTCCCTTCGTCATGATCTACGGCGGCACCATGCGC GGCGGCCACTCGGAGCTCCTCGACCGCCCCATCAACATCGTCACTTGCTACGAGGCTTCCGGCGCTTA CACGTATGGCCGTCTCAAGCCCGCTTGCCCCAACAGCACCGCCACCCCTTCGGACGTCATGGACGACA TCGAGCAGCACGCCTGCCCCGGCGCCGGCGCCTGCGGCGGCATGTACACCGCCAACACCATGGCGACC GCCATCGAAGCCATGGGCCTCACCGCACCGGGCTCCTCTTCTTTCCCTGCCTCTTCGCCCGAAAAGTT CCGCGAGTGCGAAAAAGCCGCCGAGTACATCAAGATCTGTATGGAGAAGGACATTCGCCCGCGCGACC TCTTGACCAAGGCCTCGTTCGAGAACGCCCTCGTGCTCACCATGATCCTCGGCGGTTCTACCAATGGC GTTCTGCACTACCTCGCCATGGCCAACTCGGCCGACGTCGACTTGACTCTGGACGACATCAACCGCGT CTCAGCCAAGACGCCCTTTTTGGCTGACATGGCCCCCTCCGGCAGGTACTACATGGAAGACCTGTACA AAGTCGGCGGCACCCCCGCGGTCCTCAAGATGCTCATCGCCGCAGGCTATATCGATGGCACCATCCCC ACTATCACCGGCAAGTCTTTGGCGGAAAACGTATCCGACTGGCCTAGTCTCGACCCCGATCAAAAAAT CATCCGCCCCCTTGACAACCCCATCAAGTCCCAGGGGCATATCCGTGTCTTGTACGGAAACTTCTCCC CCGGTGGTGCGGTCGCCAAGATTACAGGCAAGGAAGGTCTCTCCTTCACCGGCAAAGCCCGTTGTTTC AACAAGGAATTCGAGCTCGACGCCGCCTTGAAGAACTCGGAAATTACCCTCGAGCAAGGAAACCAGGT CCTCATCGTGCGGTACGAAGGACCCAAAGGCGGTCCAGGTATGCCAGAGCAACTGAAGGCTTCAGCGG CGATCATGGGCGCGGGCCTGACGAATGTCGCGCTGGTGACGGATGGTCGGTACTCTGGTGCGTCGCAT GGGTTTATTGTGGGCCATGTGGTCCCCGAGGCGGCGACGGGGGGGCCGATTGCGCTGGTGAAGGACGG GGATTTGATTACGATTGATGCGGTTAGGAACCGGATTGATGTCGTCAAGACTGTGGAAGGCGTCGAGG GTGAGGAAGAGATTGCCAAGGTGTTGGAGGAGAGAAAGAAGGGGTGGAAGGCGCCGAAGATGAAGCCG ACGAGGGGGGCGTTGGCCAAGTATGCGAGGTTGGTGGGGGATGCGTCGCATGGGGCTGTTACTGATTT GGGTGGGGATGCCTATTAG
SEQ ID No. 59 MASNQDNKAVAPDAAAPAGQSTTTTTTNDNSERNLPKEGEYIQWRTLPAGNPDQLNRWSHFLTREHEF PGAQAMLYGAGVPNKDMMKKAPHVGIATVWWEGNPCNTHLLDLGQKVKKAVEREKMLAWQFNTIGVSD GITMGGEGMRYSLQSREIIADSIETVTCAQHHDANISIPGCDKNMPGVIMAAARHNRPFVMIYGGTMR GGHSELLDRPINIVTCYEASGAYTYGRLKPACPNSTATPSDVMDDIEQHACPGAGACGGMYTANTMAT AIEAMGLTAPGSSSFPASSPEKFRECEKAAEYIKICMEKDIRPRDLLTKASFENALVLTMILGGSTNG VLHYLAMANSADVDLTLDDINRVSAKTPFLADMAPSGRYYMEDLYKVGGTPAVLKMLIAAGYIDGTIP TITGKSLAENVSDWPSLDPDQKIIRPLDNPIKSQGHIRVLYGNFSPGGAVAKITGKEGLSFTGKARCF NKEFELDAALKNSEITLEQGNQVLIVRYEGPKGGPGMPEQLKASAAIMGAGLTNVALVTDGRYSGASH GFIVGHVVPEAATGGPIALVKDGDLITIDAVRNRIDVVKTVEGVEGEEEIAKVLEERKKGWKAPKMKP TRGALAKYARLVGDASHGAVTDLGGDAY
SEQ ID No. 60
ATGAGTTTTGTCAAATCTTGCAGAGGGTGTCTTAGGACATTTGCTACATCCACAATCAAGCATGAGAA AAAGTTAAACAAGTACTCGTCTATTGTCACAGGCGACCCATCTCAGGGTGCTTCCCAAGCAATGCTTT ATGCCACTGGTTTCAGTGATGAAGATTTCGATCGTGCACAAATCGGTGTCGGTTCTGTTTGGTGGTCA GGTAACCCATGTAACATGCACTTGATGGAGTTGAACAACAGGTGTTCCGAATCAGTCAACAAGGCCGG CTTAAAAGCCATGCAATTCAACTCCATTGGTGTGTCGGACGGTATCACCAACGGTACTGAAGGTATGA AGTACTCTTTACAGCTGAGAGAAATTATCGCCGACTCCTTTGAAACCATGACCATGGCCCAACTATAT GACGGTAACATTGCCATTCCTTCATGTGATAAAAATATGCCCGGGGTATTGATGGCTATGGGAAGACA CAACAGACCTGCCATTATGGTCTATGGTGGTACTATCTTGCCTGGATCTCCAACTTGCGGAACCCAAA ACCCTGCTGTAGCCGACAAAATCGATATCATTAGTGCTTTCCAATCCTACGGACAATACTTGTCAAAA
CAAATCAACAACGAAGAAAGAATAGATATTGTCAAACATGCCTGTCCAGGGCCCGGTGCATGTGGTGG TATGTACACTGCCAACACTATGGCCTCTGCCTCTGAAGTCTTGGGGTTGACCTTACCATTTTCGTCCT CGTCCCCGGCAGTCTCCAAAGAAAAAGCCGAAGAGTGTGCCAACGTCGGGTTCGCCTTAAAGAATTTG TTAGAGTTGGACTTGAAACCAAGAGATATTGTCACCAAAAAATCATTTGAAAACGCTATTGCTTATAT CATTGCCACTGGTGGTTCTACTAATGCCGTTTTACATCTTATTGCCATTGCCTCTTCATTTGACATTA CTATTACTGTTGACGATTTCCAAAGAATCTCCGACAACACTCCCTTGTTGGCCGATTTCAAACCATCG GGTAAATACGTTATGGCCGACTTGCAAAATGTCGGCGGTACACCTGCTGTTATGAAATACTTGATCAA AGAAGGCATTATCGACGGTACCCAATTGAGTGTCACCGGTAAAACCATCAACGAAAACTTGGCTAAAC TTGCTGATTTGCCTGAGGGCCAAGACATTGTTAGACCAGTATCAAACCCATTGAAGCCAAGTGGCCAC TTACAAATCTTGAAAGGTACTTTGGCTCCAGGGTCTGCTGTCGCTAAAATCACCGGTAAAGAAGGTAC
TTATTTCAAAGGTAAAGCTAGAGTATTCAACGACGAAGGTGCATTTATTGTTGCCTTGGAAAATGGCG AGATCAAAAAAGGCGAAAAAACAGTTTGTGTGATCAGATACGAAGGTCCAAAAGGTGGTCCAGGTATG CCAGAAATGTTGAAACCTTCGTCTGCATTAATGGGTTACGGGTTAGGTAAAGACGTTGCTTTGTTGAC TGATGGTAGATTTTCGGGTGGTTCCCACGGGTTCCTTATTGGCCACATTGTTCCTGAAGCCGCTGAAG GTGGTCCAATTGCTTTGGTTGAAGATGGCGATATTATTGTCATCGACGCAGACAATAACAAAATCGAT TTGTTGGTTGAGCCAGACGTCTTGACCGAAAGAAGAAAACACTGGACGCCTCCAGAACCAAGATACAA AAGAGGTACTTTGGCCAAATACGCCAAGTTGGTCAGCGATGCATCTAAGGGATGTGTTACAGATTTAT AA
SEQ ID No. 61
MSFVKSCRGCLRTFATSTIKHEKKLNKYSSIVTGDPSQGASQAMLYATGFSDEDFDRAQIGVGSVWWS GNPCNMHLMELNNRCSESVNKAGLKAMQFNSIGVSDGITNGTEGMKYSLQSREIIADSFETMTMAQLY DGNIAIPSCDKNMPGVLMAMGRHNRPAIMVYGGTILPGSPTCGTQNPAVADKIDIISAFQSYGQYLSK QINNEERIDIVKHACPGPGACGGMYTANTMASASEVLGLTLPFSSSSPAVSKEKAEECANVGFALKNL LELDLKPRDIVTKKSFENAIAYIIATGGSTNAVLHLIAIASSFDITITVDDFQRISDNTPLLADFKPS GKYVMADLQNVGGTPAVMKYLIKEGIIDGTQLSVTGKTINENLAKLADLPEGQDIVRPVSNPLKPSGH LQILKGTLAPGSAVAKITGKEGTYFKGKARVFNDEGAFIVALENGEIKKGEKTVCVIRYEGPKGGPGM PEMLKPSSALMGYGLGKDVALLTDGRFSGGSHGFLIGHIVPEAAEGGPIALVEDGDIIVIDADNNKID LLVEPDVLTERRKHWTPPEPRYKRGTLAKYAKLVSDASKGCVTDL
SEQ ID No. 62
ATGGGCTTGTTAACGAAAGTTGCTACATCTAGACAATTCTCTACAACGAGATGCGTTGCAAAGAAGCT
CAACAAGTACTCGTATATCATCACTGAACCTAAGGGCCAAGGTGCGTCCCAGGCCATGCTTTATGCCA CCGGTTTCAAGAAGGAAGATTTCAAGAAGCCTCAAGTCGGGGTTGGTTCCTGTTGGTGGTCCGGTAAC CCATGTAACATGCATCTATTGGACTTGAATAACAGATGTTCTCAATCCATTGAAAAAGCGGGTTTGAA AGCTATGCAGTTCAACACCATCGGTGTTTCAGACGGTATCTCTATGGGTACTAAAGGTATGAGATACT CGTTACAAAGTAGAGAAATCATTGCAGACTCCTTTGAAACCATCATGATGGCACAACACTACGATGCT AACATCGCCATCCCATCATGTGACAAAAACATGCCCGGTGTCATGATGGCCATGGGTAGACATAACAG ACCTTCCATCATGGTATATGGTGGTACTATCTTGCCCGGTCATCCAACATGTGGTTCTTCGAAGATCT CTAAAAACATCGATATCGTCTCTGCGTTCCAATCCTACGGTGAATATATTTCCAAGCAATTCACTGAA GAAGAAAGAGAAGATGTTGTGGAACATGCATGCCCAGGTCCTGGTTCTTGTGGTGGTATGTATACTGC CAACACAATGGCTTCTGCCGCTGAAGTGCTAGGTTTGACCATTCCAAACTCCTCTTCCTTCCCAGCCG TTTCCAAGGAGAAGTTAGCTGAGTGTGACAACATTGGTGAATACATCAAGAAGACAATGGAATTGGGT ATTTTACCTCGTGATATCCTCACAAAAGAGGCTTTTGAAAACGCCATTACTTATGTCGTTGCAACCGG TGGGTCCACTAATGCTGTTTTGCATTTGGTGGCTGTTGCTCACTCTGCGGGTGTCAAGTTGTCACCAG ATGATTTCCAAAGAATCAGTGATACTACACCATTGATCGGTGACTTCAAACCTTCTGGTAAATACGTC ATGGCCGATTTGATTAACGTTGGTGGTACCCAATCTGTGATTAAGTATCTATATGAAAACAACATGTT GCACGGTAACACAATGACTGTTACCGGTGACACTTTGGCAGAACGTGCAAAGAAAGCACCAAGCCTAC CTGAAGGACAAGAGATTATTAAGCCACTCTCCCACCCAATCAAGGCCAACGGTCACTTGCAAATTCTG TACGGTTCATTGGCACCAGGTGGAGCTGTGGGTAAAATTACCGGTAAGGAAGGTACTTACTTCAAGGG TAGAGCACGTGTGTTCGAAGAGGAAGGTGCCTTTATTGAAGCCTTGGAAAGAGGTGAAATCAAGAAGG GTGAAAAAACCGTTGTTGTTATCAGATATGAAGGTCCAAGAGGTGCACCAGGTATGCCTGAAATGCTA AAGCCTTCCTCTGCTCTGATGGGTTACGGTTTGGGTAAAGATGTTGCATTGTTGACTGATGGTAGATT CTCTGGTGGTTCTCACGGGTTCTTAATCGGCCACATTGTTCCCGAAGCCGCTGAAGGTGGTCCTATCG GGTTGGTCAGAGACGGCGATGAGATTATCATTGATGCTGATAATAACAAGATTGACCTATTAGTCTCT GATAAGGAAATGGCTCAACGTAAACAAAGTTGGGTTGCACCTCCACCTCGTTACACAAGAGGTACTCT ATCCAAGTATGCTAAGTTGGTTTCCAACGCTTCCAACGGTTGTGTTTTAGATGCTTGA SEQ ID No. 63
MGLLTKVATSRQFSTTRCVAKKLNKYSYIITEPKGQGASQAMLYATGFKKEDFKKPQVGVGSCWWSGN PCNMHLLDLNNRCSQSIEKAGLKAMQFNTIGVSDGISMGTKGMRYSLQSREIIADSFETIMMAQHYDA NIAIPSCDKNMPGVMMAMGRHNRPSIMVYGGTILPGHPTCGSSKISKNIDIVSAFQSYGEYISKQFTE EEREDVVEHACPGPGSCGGMYTANTMASAAEVLGLTIPNSSSFPAVSKEKLAECDNIGEYIKKTMELG ILPRDILTKEAFENAITYVVATGGSTNAVLHLVAVAHSAGVKLSPDDFQRISDTTPLIGDFKPSGKYV MADLINVGGTQSVIKYLYENNMLHGNTMTVTGDTLAERAKKAPSLPEGQEIIKPLSHPIKANGHLQIL YGSLAPGGAVGKITGKEGTYFKGRARVFEEEGAFIEALERGEIKKGEKTVVVIRYEGPRGAPGMPEML KPSSALMGYGLGKDVALLTDGRFSGGSHGFLIGHIVPEAAEGGPIGLVRDGDEIIIDADNNKIDLLVS DKEMAQRKQSWVAPPPRYTRGTLSKYAKLVSNASNGCVLDA
SEQ ID No. 64
GAC GAC GAC AAG ATG CTT CTC TCT CAG ACC AGA
SEQ ID No. 65 GAC GAC GAC AAG ATG AAA GAT AGC GAA ACG GCT T
SEQ ID No. 66
GAG GAG AAG CCC GGT TCA TTC TAC AGA GTC AGT GAT GC
SEQ ID No. 67
TGG GTT GTG TGA CAT TGC GGG AGA C
SEQ ID No. 68 GACGACGACAAGATGGACTCCTCTACCTCCGCGTCGTCCAA;
SEQ ID No. 69 GACGACGACAAGATGCATGAAGATGGCACCACTGCGCT; SEQ ID No. 70
GAGGAGAAGCCCGGTCTAGAACAAATCCGTCATTGCACCATG
SEQ ID No. 71
ACTTGCGCGCAGTATCATGATGCATGCATTG
SEQ ID No. 72 Caccgattggtggtatgacttc
SEQ ID No. 73. cctaattatcgcttcgctcagc
SEQ ID No. 74. Atgcttctctctcagaccagag SEQ ID No. 75.
GGCATCCTTGACAGTTCTACAG
SEQ ID No. 76. GTTAGCAGCCGAGATCTTCAAG
SEQ ID No 77. ATATACCTTGGGTCAGAAGCGC
SEQ ID No. 78. CACGGCGAAGCCAAGTATATCA
SEQ ID No. 79. ATACGTCATTGCACCATGAGAG SEQ ID No. 80
GAGCCAATATGCGAGAACACCCG
SEQ ID No. 81 GTCAGTATCATCCCCGCAAT
SEQ ID No. 82 GGCACTTCTTCCTCAACTGC
SEQ ID NO. 83 ILV TGGCTGAGGGGAGAACTCTA

Claims

1. Method of identifying an anti-fungal agent which targets an essential protein or gene of a fungus comprising contacting a candidate substance with (i) a protein which comprises the sequence shown by SEQ ID NOS: 3, 6, 9, 12,
15, 18, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 50, 53, 56, 59, 61 or 63, or (ii) a protein which has 60% identity with (i), or
(iii) a protein comprising a fragment of (i) or (ii) which fragment has a length of at least 50 amino acids, or (iv) a polynucleotide that comprises a sequence which encodes (i), (ii) or (iii), or
(v) a polynucleotide comprising a sequence which has at least 70% identity with the coding sequence of (iv), and determining whether the candidate substance binds or modulates (i), (ii), (iii), (iv), or (v), wherein binding or modulation of (i), (ii), (iii), (iv), or (v) indicates that the candidate substance is an anti-fungal agent.
2. Method according to claim 1 comprising carrying out a reaction in the presence and absence of the candidate substance to determine whether the candidate substance inhibits the activity of the protein as defined in claim 1.
3. Method of identifying an anti-fungal agent which targets ILV3 genes of fungi comprising contacting a candidate substance with
(i) a protein which comprises the sequence shown by SEQ ID NOs: 12, 21, 39, 42, 45, 48, 50, 53, 56, 59, 61 or 63, or
(ii) a protein which has at least 60% identity with (i), or (iii) a protein comprising a fragment of (i) or (ii) which fragment has a length of at least 50 amino acids, and determining whether the candidate substance binds or modulates (i), (ii) or (iii), wherein binding or modulation of (i), (ii) or (iii) indicates that the candidate substance is an anti-fungal agent,
4. Method according to claim 3 comprising carrying out a reaction in the presence and absence of the candidate substance to determine whether the candidate substance inhibits the activity of the protein as defined in claim 3
5. Method of identifying an anti-fungal agent which targets ILV3 genes of fungi comprising contacting a candidate substance with
(i) a protein which comprises the sequence shown by SEQ ID NOs: 21, 42, 45, 53 or 56, or
(ii) a protein which has at least 60% identity with (i), or (iii) a protein comprising a fragment of (i) or (ii) which fragment has a length of at least 50 amino acids, and also contacting said candidate substance with
(iv) a protein which comprises the sequence shown by SEQ ID NOs: 12, 39, 48, 50 or 59 , or (v) a protein which has at least 60% identity with (i), or
(vi) a protein comprising a fragment of (i) or (ii) which fragment has a length of at least 50 amino acids, and determining whether the candidate substance binds or modulates (i), (ii) or (iii), and whether the candidate substance binds or modulates (iv), (v) or (vi), wherein binding or modulation of (i), (ii) or (iii), and (iv), (v) or (vi) indicates that the candidate substance is an anti-fungal agent.
6. Method according to claim 5 comprising carrying out a reaction in the presence and absence of the candidate substance to determine whether the candidate substance inhibits the activity of the proteins as defined in claim 5
7. Method of identifying an anti-fungal agent which targets ILV3 genes of fungi comprising contacting a candidate substance with
(i) a protein which comprises the sequence shown by SEQ ID NOs: 12, 39, 48, 50 or 59, or
(ii) a protein which has at least 60% identity with (i), or (iii) a protein comprising a fragment of (i) or (ii) which fragment has a length of at least 50 amino acids, and also contacting said candidate substance with
(iv) a protein which comprises the sequence shown by SEQ ID NOs: 21, 42, 45, 53 or 56, or
(v) a protein which has at least 60% identity with (iv), or (vi) a protein comprising a fragment of (iv) or (v) which fragment has a length of at least 50 amino acids, and determining whether the candidate substance binds or modulates (i), (ii) or (iii), and whether the candidate substance binds or modulates (iv), (v) or (vi), wherein binding or modulation of (i), (ii) or (iii), and (iv), (v) or (vi) indicates that the candidate substance is an anti-fungal agent,
8. Method according to claim 7 comprising carrying out a reaction in the presence and absence of the candidate substance to determine whether the candidate substance inhibits the activity of the proteins as defined in claim 7.
9. Method according to claims 1 to 8 wherein the protein or polynucleotide is from Aspergillus flavus; Aspergillus fumigatus; Aspergillus nidulans; Aspergillus niger; Aspergillus parasiticus; Aspergillus terreus; Blumeria graminis; Candida albicans; Candida cruzei; Candida glabrata; Candida parapsilosis; Candida tropicalis; Colletotrichium trifolii; Cryptococcus neoformans; Encephalitozoon cuniculi; Fusarium graminarium; Fusarium solani; Fusarium sporotrichoides;
Histoplasma capsulata; Leptosphaeria nodorum; Magnaporthe grisea;
Mycosphaerella graminicola; Neurospora crassa; Phytophthora capsici;
Phytophthora infestans; Plasmopara viticola; Pneumocystis jiroveci; Puccinia coronata; Puccinia graminis; Pyήcularia oryzae; Pyihium ultimum; Rhizoctonia solani; Saccharomyces cerevisiae; Schizosaccharomyces pombe; Trichophyton interdigitale; Trichophyton rubrum; or Ustilago maydis.
10. Method according to any one of the preceding claims which further comprises formulating the identified anti-fungal agent into an agricultural or a pharmaceutical composition.
11. Method according to any one of the preceding claims which further comprises killing or impairing the growth of a fungus by contacting the fungus with the identified anti-fungal agent.
12. Use of a protein or polynucleotide as defined in claim 1, 3, 5 or 7 to identify or obtain an anti-fungal agent.
13. Use of an anti-fungal agent identified by the method of any one of claims 1 to 8 in the manufacture of a medicament for prevention or treatment of fungal infection.
14. An isolated protein or polynucleotide as defined in claim 1, 3, 5 or 7
15. A vector comprising a polynucleotide as defined in claim 1, 3, 5 or 7.
16. A recombinant cell comprising a polynucleotide as defined in claim 1, 3, 5 or 7or a vector according to claim 15.
17. A method of obtaining a protein as defined in claim 1, 3, 5 or 7comprising expressing the protein from a polynucleotide as defined in claim 1, 3, 5 or 7or a vector according to claim 15.
18. A method of obtaining a polynucleotide as defined in claim 1, 3, 5 or 7comprising replication of a vector as defined in claim 15 or synthesis of the polynucleotide by condensation of nucleotides.
19. An organism which is transgenic for a polynucleotide as defined in claim 1, 3, 5 or 7.
20. An organism which has been genetically engineered to render a polynucleotide or protein as defined, in claim 1, 3, 5 or 7non- functional or inhibited.
21. An antibody which is specific for a protein as defined in claim 1, 3, 5 or 7.
22. A method for preventing or treating a fungal infection comprising administering an anti-fungal agent identified by the method of any one of claims 1, 3, 5 or 7.
23. A method for preventing or treating a fungal infection comprising administering a protein or polynucleotide as defined in claim 1, 3, 5 or 7.
24. A method of killing, or impairing the growth of, a fungus comprising inhibiting the expression or activity of a polynucleotide or protein as defined in claim 1, 3, 5 or 7.
25. A method according to claim 24 wherein the fungus has infected a human, animal or plant individual.
26. A fungus which has been killed, or whose growth has been impaired, by inhibition of the expression or activity of a protein or polynucleotide as defined in claim 1, 3, 5 or 7.
EP05811554A 2004-12-01 2005-12-01 Fungal signalling and metabolic enzymes Withdrawn EP1825272A2 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
GB0426390A GB0426390D0 (en) 2004-12-01 2004-12-01 Fungal signalling and metabolic enzymes
GB0521062A GB0521062D0 (en) 2005-10-17 2005-10-17 Fungal signalling and metabolic enzymes
PCT/GB2005/004604 WO2006059111A2 (en) 2004-12-01 2005-12-01 Fungal signalling and metabolic enzymes

Publications (1)

Publication Number Publication Date
EP1825272A2 true EP1825272A2 (en) 2007-08-29

Family

ID=35883555

Family Applications (1)

Application Number Title Priority Date Filing Date
EP05811554A Withdrawn EP1825272A2 (en) 2004-12-01 2005-12-01 Fungal signalling and metabolic enzymes

Country Status (4)

Country Link
US (1) US20110081348A1 (en)
EP (1) EP1825272A2 (en)
JP (1) JP2008521427A (en)
WO (1) WO2006059111A2 (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008133141A1 (en) * 2007-04-24 2008-11-06 Toyo Boseki Kabushiki Kaisha Osmotin recombinant protein, method for production of the same, and use of the same
KR20110063576A (en) 2008-09-29 2011-06-10 부타맥스 어드밴스드 바이오퓨얼스 엘엘씨 Increased heterologous fe-s enzyme actiivty in yeast
CN102186973B (en) 2008-09-29 2017-08-15 布特马斯先进生物燃料有限责任公司 The identification of [2Fe 2S] dihydroxyacid dehydratase of bacterium and purposes
CA2770842A1 (en) 2009-08-12 2011-02-17 Gevo, Inc. Cytosolic isobutanol pathway localization for the production of isobutanol
MY156003A (en) 2009-11-24 2015-12-31 Gevo Inc Methods of increasing dihydroxy acid dehydratase activity to improve production of fuels, chemicals, and amino acids
KR20130027063A (en) 2010-02-17 2013-03-14 부타맥스 어드밴스드 바이오퓨얼스 엘엘씨 Improving activity of fe-s cluster requiring proteins
US9650624B2 (en) 2012-12-28 2017-05-16 Butamax Advanced Biofuels Llc DHAD variants for butanol production
US9580705B2 (en) 2013-03-15 2017-02-28 Butamax Advanced Biofuels Llc DHAD variants and methods of screening
EP3610044B1 (en) * 2017-04-12 2022-01-05 Momentum Bioscience Limited Detection and delineation of microorganisms using the ilv3 gene

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5998420A (en) * 1996-04-08 1999-12-07 University Of Medicine & Dentistry Of New Jersey Method for treating Mycobacterium tuberculosis
US6280963B1 (en) * 1997-11-07 2001-08-28 Millennium Pharmaceuticals, Inc. Essential fungal genes and their use
DE10261834A1 (en) * 2002-12-20 2004-07-08 Phenion Gmbh & Co. Kg High-throughput suitable screening process for the identification of active substances

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO2006059111A3 *

Also Published As

Publication number Publication date
WO2006059111A3 (en) 2006-08-31
WO2006059111A2 (en) 2006-06-08
JP2008521427A (en) 2008-06-26
US20110081348A1 (en) 2011-04-07

Similar Documents

Publication Publication Date Title
US20110081348A1 (en) Fungal signalling and metabolic enzymes
Kong et al. The activation of Phytophthora effector Avr3b by plant cyclophilin is required for the nudix hydrolase activity of Avr3b
Pagliuso et al. An RNA-binding protein secreted by a bacterial pathogen modulates RIG-I signaling
Mills et al. Kinetoplastid PPEF phosphatases: dual acylated proteins expressed in the endomembrane system of Leishmania
Wang et al. Analysis of the dermatophyte Trichophyton rubrum expressed sequence tags
CA2523875A1 (en) Sars virus nucleotide and amino acid sequences and uses thereof
US20080200374A1 (en) Mutational derivatives of microcin j25
Peng et al. Pathogen hijacks programmed cell death signaling by arginine ADPR-deacylization of caspases
Liang et al. SUSA2 is an F-box protein required for autoimmunity mediated by paired NLRs SOC3-CHS1 and SOC3-TN2
Hoi et al. Clp-targeting BacPROTACs impair mycobacterial proteostasis and survival
US20040167066A1 (en) Cleavage and polyadenylation complex of precursor mrna
Shi et al. The ANIP1-OsWRKY62 module regulates both basal defense and Pi9-mediated immunity against Magnaporthe oryzae in rice
Ye et al. Characterization and expression analysis of a caspase-2 in an invertebrate echinoderm sea cumber Apostichopus japonicus
JP2002541782A (en) Fungal β-tubulin gene
WO2010079998A2 (en) Use of hog, ras, and camp signal pathway genes for fungal infection treatment
WO2004104193A2 (en) Phospholipase c
WO1996039527A1 (en) Cell-cycle regulatory proteins from human pathogens, and uses related thereto
Kim et al. A histone deacetylase, MoHDA1 regulates asexual development and virulence in the rice blast fungus
WO2005095975A2 (en) Anti-fungal target genes
US20080206220A1 (en) 2031 Oxidoreductase
Wang et al. Characterization of a novel otubain-like protease with deubiquitination activity from Nosema bombycis (Microsporidia)
Fang et al. Sensitivity of Magnaporthe grisea to the sterol demethylation inhibitor fungicide propiconazole
Onchieku et al. Artemisinin acts by inhibiting Plasmodium falciparum Ddi1, a retropepsin, resulting into the accumulation of ubiquitinated proteins
KR101683002B1 (en) USE OF sppA GENE AND SppA PROTEIN FOR TREATMENT OF ASPERGILLOSIS
Gu et al. UvATG6 Interacts with BAX Inhibitor 1 Proteins and Plays Critical Roles in Growth, Conidiation, and Virulence in Ustilaginoidea virens

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20070611

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR

17Q First examination report despatched

Effective date: 20080205

DAX Request for extension of the european patent (deleted)
GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20091229