CN1177048C - Human protein with function of suppressing cancer cell growth and its coding sequence - Google Patents

Human protein with function of suppressing cancer cell growth and its coding sequence

Info

Publication number
CN1177048C
CN1177048C CNB011267267A CN01126726A CN1177048C CN 1177048 C CN1177048 C CN 1177048C CN B011267267 A CNB011267267 A CN B011267267A CN 01126726 A CN01126726 A CN 01126726A CN 1177048 C CN1177048 C CN 1177048C
Authority
CN
China
Prior art keywords
leu
ccc
ctg
gcc
pro
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB011267267A
Other languages
Chinese (zh)
Other versions
CN1403478A (en
Inventor
顾健人
杨胜利
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SHANGHAI XINSHIJIE GENE TECHN DEVELOPMENT Co Ltd
Original Assignee
SHANGHAI XINSHIJIE GENE TECHN DEVELOPMENT Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SHANGHAI XINSHIJIE GENE TECHN DEVELOPMENT Co Ltd filed Critical SHANGHAI XINSHIJIE GENE TECHN DEVELOPMENT Co Ltd
Priority to CNB011267267A priority Critical patent/CN1177048C/en
Publication of CN1403478A publication Critical patent/CN1403478A/en
Application granted granted Critical
Publication of CN1177048C publication Critical patent/CN1177048C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Landscapes

  • Peptides Or Proteins (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)

Abstract

The present invention discloses a novel human protein with the function of inhibiting cancer, polynucleotide for encoding the polypeptide and a method for preparing the polypeptide by a recombinant technology. The present invention also discloses a method of using the polypeptide to treat various diseases, such as cancers. The present invention also discloses an antagonist of the polypeptide and a therapeutic effect thereof. The present invention also discloses the application of the polynucleotide for encoding the human protein with the function of inhibiting cancer.

Description

The proteic polynucleotide of people that coding has anticancer growth function
Technical field
The invention belongs to biological technical field, specifically, the present invention relates to new coding and have the proteic polynucleotide of people of cancer suppressing function and the polypeptide of this polynucleotide encoding.The invention still further relates to the purposes and the preparation of these polynucleotide and polypeptide.
Background technology
The research of people's gene group is international focus at present, removes human chromosome DNA large scale sequencing, outside the method for expressed sequence order-checking (EST), also lacks the screening that begins from function and has the high-throughout method of functional gene.
Cancer is one of principal disease of harm humans health.In order to treat effectively and prophylaxis of tumours, people more and more pay close attention to genetic treatment of tumor at present.Therefore, this area presses for people's albumen and the agonist/inhibitor thereof that development research has cancer suppressing function.
Summary of the invention
The purpose of this invention is to provide the new people's protein polypeptide of a class with cancer suppressing function with and fragment, analogue and derivative.
Another object of the present invention provides the polynucleotide of these polypeptide of coding.
Another object of the present invention provides the method for these polypeptide of production and the purposes of this polypeptide and encoding sequence.
In a first aspect of the present invention, novel isolated protein polypeptide with cancer suppressing function is provided, it comprises the polypeptide of the aminoacid sequence with the group of being selected from down: SEQ ID NO:2,5,8,11,14,17,20,23,26,29,32,35; Or its conservative property variation polypeptide or its active fragments or its reactive derivative.
Preferably, this polypeptide is the polypeptide with aminoacid sequence of the group of being selected from down: SEQ ID NO:2,5,8,11,14,17,20,23,26,29,32,35.
In a second aspect of the present invention, a kind of isolating polynucleotide are provided, it comprises a nucleotide sequence, and this nucleotide sequence is shown at least 85% homogeny with a kind of nucleotides sequence that is selected from down group: the polynucleotide of the above-mentioned protein polypeptide with cancer suppressing function of (a) encoding; (b) with polynucleotide (a) complementary polynucleotide.Preferably, the polypeptide of this polynucleotide encoding has the aminoacid sequence of the group of being selected from down: SEQ ID NO:2,5,8,11,14,17,20,23,26,29,32,35.More preferably, the sequence of these polynucleotide is selected from down group: SEQ ID NO:3,6,9,12,15,18,21,24,27,30,33,36 coding region sequence or full length sequence.
In a third aspect of the present invention, the carrier that contains above-mentioned polynucleotide is provided, and has been transformed or host cell of transduceing or the host cell that is directly transformed or transduce by above-mentioned polynucleotide by this carrier.
In a fourth aspect of the present invention, the preparation method who prepares the polypeptide of the protein-active with cancer suppressing function is provided, this method comprises: (a) have under the proteic condition of cancer suppressing function suitable the expression, cultivate the above-mentioned host cell that is transformed or transduce; (b) from culture, isolate the polypeptide of protein-active with cancer suppressing function.
In a fifth aspect of the present invention, provide and above-mentioned protein polypeptide specificity bonded antibody with cancer suppressing function.The nucleic acid molecule that can be used for detecting also is provided, and it contains, and continuous 10 Nucleotide are to full length nucleotide in the above-mentioned polynucleotide, and preferably it contains the about 10-800 of a successive Nucleotide.
In a sixth aspect of the present invention, a kind of pharmaceutical composition is provided, it contains the protein polypeptide and the pharmaceutically acceptable carrier with cancer suppressing function of the present invention of safe and effective amount.These pharmaceutical compositions can be treated illnesss such as cancer and cellular abnormality propagation.
Others of the present invention are because the disclosure of this paper is conspicuous to those skilled in the art.
Embodiment
The present invention adopts large-scale cDNA clone transfection cancer cells, has on the basis of cancer suppressing action in acquisition, proves new gene through order-checking, further obtains full length cDNA clone.DNA transfection evidence, the albumen with cancer suppressing function of the present invention has the effect that suppresses clone's formation to cancer cells (liver cancer cell), and its inhibiting rate is more than 50% or 50%.
As used herein, " isolating " is meant that material separates (if natural substance, primal environment promptly is a natural surroundings) from its primal environment.Do not have separation and purification as polynucleotide under the native state in the active somatic cell and polypeptide, but same polynucleotide or polypeptide as from native state with in other materials that exist separately, then for separation and purification.
As used herein, " isolating albumen or polypeptide with cancer suppressing function " is meant that the protein polypeptide with cancer suppressing function is substantially free of natural relative other albumen, lipid, carbohydrate or other material.Those skilled in the art can have the albumen of cancer suppressing function with the purified technology of protein purifying of standard.Basically pure polypeptide can produce single master tape on non-reduced polyacrylamide gel.
Polypeptide of the present invention can be recombinant polypeptide, natural polypeptides, synthetic polypeptide, preferred recombinant polypeptide.Polypeptide of the present invention can be the product of natural purifying, or the product of chemosynthesis, or uses recombinant technology to produce from protokaryon or eucaryon host (for example, bacterium, yeast, higher plant, insect and mammalian cell).The host used according to the recombinant production scheme, polypeptide of the present invention can be glycosylated, maybe can be nonglycosylated.Polypeptide of the present invention also can comprise or not comprise initial methionine residues.
The present invention also comprises the proteic fragment of the people with cancer suppressing function, derivative and analogue.As used herein, term " fragment ", " derivative " are meant with " analogue " and keep natural identical biological function or the active polypeptide of people's albumen with cancer suppressing function of the present invention basically.Polypeptide fragment of the present invention, derivative or analogue can be that (i) has one or more conservative or substituted polypeptide of non-conservation amino-acid residue (preferred conservative amino acid residue), and the amino-acid residue of such replacement can be also can not encoded by genetic code, or (ii) in one or more amino-acid residues, has a polypeptide of substituted radical, or (iii) mature polypeptide and another compound (such as the compound that prolongs the polypeptide transformation period, polyoxyethylene glycol for example) merge formed polypeptide, or (iv) additional aminoacid sequence is fused to this peptide sequence and the polypeptide that forms (as leader sequence or secretion sequence or be used for the sequence or the proteinogen sequence of this polypeptide of purifying).According to the instruction of this paper, these fragments, derivative and analogue belong to the known scope of those skilled in the art.
Polynucleotide of the present invention can be dna form or rna form.Dna form comprises the DNA of cDNA, genomic dna or synthetic.DNA can be strand or double-stranded.DNA can be coding strand or noncoding strand.Be example with FP977 albumen (in this application, its clone numbering is adopted in proteinic name), the coding region sequence of encoding mature polypeptide can be identical with the coding region sequence shown in the SEQ ID NO:3 or the varient of degeneracy.As used herein, " varient of degeneracy " is meant that in the present invention coding has the protein of SEQ ID NO:2, but with the differentiated nucleotide sequence of coding region sequence shown in the SEQ IDNO:3.Be example with FP1147 albumen (in this application, its clone numbering is adopted in proteinic name) again, the coding region sequence of encoding mature polypeptide can be identical with the coding region sequence shown in the SEQ ID NO:6 or the varient of degeneracy.As used herein, " varient of degeneracy " is meant that in the present invention coding has the protein of SEQ ID NO:5, but with the differentiated nucleotide sequence of coding region sequence shown in the SEQ ID NO:6.Have the albumen of cancer suppressing function for of the present invention other, can the rest may be inferred.
The polynucleotide of encoding mature polypeptide comprise: the encoding sequence of an encoding mature polypeptide; The encoding sequence of mature polypeptide and various additional code sequence; Encoding sequence of mature polypeptide (with optional additional code sequence) and non-coding sequence.
Term " polynucleotide of coded polypeptide " can be the polynucleotide that comprise this polypeptide of encoding, and also can be the polynucleotide that also comprise additional code and/or non-coding sequence.
The invention still further relates to the varient of above-mentioned polynucleotide, its coding has the polypeptide of identical aminoacid sequence or fragment, analogue and the derivative of polypeptide with the present invention.The varient of these polynucleotide can be the allelic variant of natural generation or the varient that non-natural takes place.These nucleotide diversity bodies comprise and replace varient, deletion mutation body and insert varient.As known in the art, allelic variant is the replacement form of polynucleotide, and it may be replacement, disappearance or the insertion of one or more Nucleotide, but can be from not changing the function of its encoded polypeptides in fact.
The invention still further relates to and above-mentioned sequence hybridization and two sequences between have at least 50%, preferably at least 70%, the polynucleotide of at least 80% homogeny more preferably.The present invention be more particularly directed under stringent condition and the interfertile polynucleotide of polynucleotide of the present invention.In the present invention, " stringent condition " is meant: (1) than hybridization under low ionic strength and the comparatively high temps and wash-out, as 0.2 * SSC, and 0.1%SDS, 60 ℃; Or (2) hybridization the time is added with denaturing agent, as 50% (v/v) methane amide, 0.1% calf serum/0.1%Ficoll, 42 ℃ etc.; Or (3) only at the homogeny between the two sequences at least more than 95%, be more preferably 97% and just hybridize when above.And the polypeptide of interfertile polynucleotide encoding has identical biological function (is example with FP977 albumen) and activity with the mature polypeptide shown in the SEQ IDNO:2.
The invention still further relates to nucleic acid fragment with above-mentioned sequence hybridization.As used herein, the length of " nucleic acid fragment " contains 15 Nucleotide at least, better is at least 30 Nucleotide, is more preferably at least 50 Nucleotide, preferably more than at least 100 Nucleotide.The amplification technique (as PCR) that nucleic acid fragment can be used for nucleic acid has the proteic polynucleotide of cancer suppressing function to determine and/or to separate to encode.
Polypeptide among the present invention and polynucleotide preferably provide with isolating form, more preferably are purified to homogeneous.
Dna sequence dna of the present invention can obtain with several method.For example, with hybridization technique DNA isolation well known in the art.These technology including, but not limited to: 1) with probe and genome or the hybridization of cDNA library to detect homology nucleotide sequence and 2) antibody screening of expression library to be to detect the dna fragmentation of the clone with common structure feature.
The proteic specific DNA fragment sequence that coding has cancer suppressing function produces also and can obtain with following method: 1) separate double chain DNA sequence from genomic dna; 2) the chemical synthesising DNA sequence is to obtain the double-stranded DNA of required polypeptide.
In the above-mentioned method of mentioning, isolation of genomic DNA is least commonly used.When the whole aminoacid sequence of the polypeptide product of needs was known, the direct chemical of dna sequence dna is synthetic to be the method for often selecting for use.When if required amino acid whose whole sequence is not known, the direct chemical of dna sequence dna is synthetic to be impossible, and the method for selecting for use is the separation of cDNA sequence.The standard method that separates interested cDNA is from the donorcells separating mRNA of this gene of high expression level and carries out reverse transcription, forms plasmid or phage cDNA library.Extract the existing multiple proven technique of method of mRNA, test kit also can obtain (Qiagene) from commercial channels.And the construction cDNA library also is usual method (Sambrook, et al., Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory.New York, 1989).Also can obtain the cDNA library of commercial offers, as the different cDNA library of Clontech company.When being used in combination the polymeric enzyme reaction technology, even few expression product also can be cloned.
Available ordinary method is screened gene of the present invention from these cDNA libraries.These methods include, but is not limited to: (1) DNA-DNA or DNA-RNA hybridization; (2) function of marker gene occurs or forfeiture; (3) mensuration has the level of the proteic transcript of cancer suppressing function; (4), detect the protein product of genetic expression by immunological technique or mensuration biologic activity.Aforesaid method can singly be used, but also several different methods combined utilization.
In (1) kind method, hybridizing used probe is and any a part of homology of polynucleotide of the present invention that at least 15 Nucleotide of its length better are at least 30 Nucleotide, are more preferably at least 50 Nucleotide, preferably at least 100 Nucleotide.In addition, the length of probe within 2kb, preferably is within the 1kb usually.Probe used herein is the dna sequence dna of chemosynthesis on the basis of gene DNA sequence information of the present invention normally.Gene of the present invention itself or fragment are certainly as probe.The mark of dna probe can be used radio isotope, fluorescein or enzyme (as alkaline phosphatase) etc.
In (4) kind method, detect the protein product of protein gene expression and can use immunological technique such as Western blotting, radioimmunoprecipitation, enzyme-linked immunosorbent assay (ELISA) etc. with cancer suppressing function.
Use method (Saiki, the et al.Science 1985 of round pcr DNA amplification/RNA; 230:1350-1354) be optimized for acquisition gene of the present invention.When particularly being difficult to obtain the cDNA of total length from the library, can preferably use RACE method (the terminal rapid amplifying method of RACE-cDNA), the primer that is used for PCR can suitably be selected according to sequence information of the present invention disclosed herein, and available ordinary method is synthetic.Available ordinary method is as the DNA/RNA fragment by gel electrophoresis separation and purifying amplification.
The gene of the present invention that obtains as mentioned above, perhaps the available ordinary method of mensuration of the nucleotide sequence of various dna fragmentations etc. such as dideoxy chain termination (Sanger et al.PNAS, 1977,74:5463-5467).This class nucleotide sequencing is available commercial sequencing kit etc. also.In order to obtain the cDNA sequence of total length, order-checking need be carried out repeatedly.Sometimes need to measure a plurality of clones' cDNA sequence, just can be spliced into the cDNA sequence of total length.
The present invention also relates to comprise the carrier of polynucleotide of the present invention, and the host cell that produces through genetically engineered with carrier of the present invention or albumen coded sequence with cancer suppressing function, and the method that produces polypeptide of the present invention through recombinant technology.
Recombinant DNA technology (Science, 1984 by routine; 224:1431), can utilize polymerized nucleoside acid sequence of the present invention to can be used to express or produce the protein polypeptide with cancer suppressing function of reorganization.In general following steps are arranged:
(1). have the proteic polynucleotide of people (or varient) of cancer suppressing function with coding of the present invention, or transform or the transduction proper host cell with the recombinant expression vector that contains these polynucleotide;
(2). the host cell of in suitable medium, cultivating;
(3). separation, protein purification from substratum or cell.
Among the present invention, the people's albumen polynucleotide sequence with cancer suppressing function can be inserted in the recombinant expression vector.Term " recombinant expression vector " refers to that bacterial plasmid well known in the art, phage, yeast plasmid, vegetable cell virus, mammalian cell virus are as adenovirus, retrovirus or other carriers.The carrier of Shi Yonging includes but not limited in the present invention: and the expression vector based on T7 of in bacterium, expressing (Rosenberg, et al.Gene, 1987,56:125); The pMSXND expression vector of in mammalian cell, expressing (Lee and Nathans, J Bio Chem.263:3521,1988) and at the carrier that derives from baculovirus of expressed in insect cells.In a word, as long as can duplicate in host and stablize, any plasmid and carrier can be used.A key character of expression vector is to contain replication orgin, promotor, marker gene and translation controlling elements usually.
Method well-known to those having ordinary skill in the art can be used to make up and contains people's encoding histone dna sequence dna with cancer suppressing function and suitable transcribing/the translate expression vector of control signal.These methods comprise (Sambroook, et al.) such as extracorporeal recombinant DNA technology, DNA synthetic technology, the interior recombinant technologys of body.Described dna sequence dna can effectively be connected on the suitable promotor in the expression vector, and is synthetic to instruct mRNA.The representative example of these promotors has: colibacillary lac or trp promotor; Lambda particles phage P LPromotor; Eukaryotic promoter comprises LTRs and some other known may command gene expression promoter in protokaryon or eukaryotic cell or its virus of CMV immediate early promoter, early stage and late period SV40 promotor, retrovirus.Expression vector also comprises ribosome bind site and the transcription terminator that translation initiation is used.
In addition, expression vector preferably comprises one or more selected markers, to be provided for selecting the phenotypic character of transformed host cells, cultivate Tetrahydrofolate dehydrogenase, neomycin resistance and the green fluorescent protein (GFP) of usefulness as eukaryotic cell, or be used for colibacillary tsiklomitsin or amicillin resistance.
Comprise the carrier of above-mentioned suitable dna sequence dna and suitable promotor or control sequence, can be used to transform appropriate host cell, so that it can marking protein.
Host cell can be a prokaryotic cell prokaryocyte, as bacterial cell; Or eukaryotic cell such as low, as yeast cell; Or higher eucaryotic cells, as mammalian cell.Representative example has: intestinal bacteria, streptomyces; The bacterial cell of Salmonella typhimurium; Fungal cell such as yeast; Vegetable cell; The insect cell of fruit bat S2 or Sf9; The zooblast of CHO, COS or Bowes melanoma cells etc.
When polynucleotide of the present invention are expressed in higher eucaryotic cells, be enhanced if will make to transcribe when in carrier, inserting enhancer sequence.Enhanser is the cis acting factor of DNA, and nearly 10 to 300 base pairs act on promotor transcribing with enhancing gene usually.Can for example be included in the SV40 enhanser of 100 to 270 base pairs of replication origin side in late period one, at the polyoma enhanser of replication origin side in late period one and adenovirus enhanser etc.
Persons skilled in the art all know how to select appropriate carriers, promotor, enhanser and host cell.
Can carry out with routine techniques well known to those skilled in the art with the recombinant DNA transformed host cell.When the host was prokaryotic organism such as intestinal bacteria, the competent cell that can absorb DNA can be used CaCl in exponential growth after date results 2Method is handled, and used step is well-known in this area.Alternative is to use MgCl 2If desired, transforming also the method for available electroporation carries out.When the host is an eukaryote, can select following DNA transfection method for use: coprecipitation of calcium phosphate method, conventional mechanical method such as microinjection, electroporation, liposome packing etc.
The transformant that obtains can be cultivated with ordinary method, expresses the polypeptide of coded by said gene of the present invention.According to used host cell, used substratum can be selected from various conventional substratum in the cultivation.Under the condition that is suitable for the host cell growth, cultivate.After host cell grows into suitable cell density, induce the promotor of selection with suitable method (as temperature transition or chemical induction), cell is cultivated for some time again.
Recombinant polypeptide in the above methods can wrap by in cell, extracellular or on cytolemma, express or be secreted into the extracellular.If desired, can utilize its physics, the separating by various separation methods with other characteristic and the albumen of purification of Recombinant of chemistry.These methods are well-known to those skilled in the art.The example of these methods includes, but are not limited to: conventional renaturation handles, with protein precipitant handle (salt analysis method), centrifugal, the broken bacterium of infiltration, superly handle, the combination of super centrifugal, sieve chromatography (gel-filtration), adsorption chromatography, ion exchange chromatography, high performance liquid chromatography (HPLC) and other various liquid chromatography (LC) technology and these methods.
The people's albumen or the polypeptide with cancer suppressing function of reorganization are of use in many ways.These purposes include, but is not limited to: directly have the disease (as cancer) due to the low or forfeiture of the protein function of cancer suppressing function as pharmacological agent and be used to screen and promote or antagonism has antibody, polypeptide or other part of the protein function of cancer suppressing function.For example, antibody can be used for activating or suppressing to have the proteic function of people of cancer suppressing function.The people's protein screening peptide library that has a cancer suppressing function with the reorganization of expressing can be used for seeking the peptide molecule that can suppress or stimulate the people's protein function with cancer suppressing function of therapeutic value.
The present invention also provides screening of medicaments to improve (agonist) or check the method that (antagonist) has the proteic medicament of people of cancer suppressing function to identify.Agonist improves the biological function such as stimulate cellular proliferation of the people's albumen with cancer suppressing function, and antagonist prevention disorder such as the various cancer relevant with cell hyperproliferation with treatment.For example, can be in the presence of medicine, the proteic film preparation of people that mammalian cell or expression is had cancer suppressing function is cultivated with the people's albumen with cancer suppressing function of mark.Measure the medicine raising then or check this interactional ability.
The proteic antagonist of people with cancer suppressing function comprises antibody, compound, acceptor disappearance thing and the analogue etc. that filter out.Described antagonist can and be eliminated its function with the people's protein binding with cancer suppressing function, or suppresses to have the proteic generation of people of cancer suppressing function, or combines with the avtive spot of polypeptide and to make polypeptide can not bring into play biological function.The proteic antagonist of people with cancer suppressing function can be used for therepic use.
In screening during as the compound of antagonist, albumen of the present invention can be added during bioanalysis measures, determine by measuring albumen and the interaction between its acceptor that compounds affect has cancer suppressing function whether compound is antagonist.With the same quadrat method of above-mentioned SCREENED COMPOUND, can filter out the acceptor disappearance thing and the analogue of antagonist action.
Polypeptide of the present invention can be directly used in disease treatment, for example, and various malignant tumours and cellular abnormality propagation etc.
Polypeptide of the present invention, and fragment, derivative, analogue or their cell can be used as antigen to produce antibody.These antibody can be polyclone or monoclonal antibody.Polyclonal antibody can obtain by the method with this polypeptide direct injection animal.The technology of preparation monoclonal antibody comprises hybridoma technology, three knurl technology, people B-quadroma technology, EBV-hybridoma technology etc.
Can be with polypeptide of the present invention and antagonist and suitable pharmaceutical carrier combination back use.These carriers can be water, glucose, ethanol, salt, damping fluid, glycerine and their combination.Composition comprises the polypeptide or the antagonist of safe and effective amount and carrier and the vehicle that does not influence effect of drugs.These compositions can be used as medicine and are used for disease treatment.
The present invention also provides medicine box or the test kit that contains one or more containers, and one or more medicinal compositions compositions of the present invention are housed in the container.With these containers, can have by the given indicative prompting of government authorities of making, using or selling medicine or biological products, the government authorities that this prompting reflects production, uses or sells permits it to use on human body.In addition, polypeptide of the present invention can be used in combination with other treatment compound.
Pharmaceutical composition can be with mode administration easily, as by in part, intravenously, intraperitoneal, intramuscular, subcutaneous, the nose or the route of administration of intracutaneous.Albumen with cancer suppressing function comes administration with the amount that treats and/or prevents concrete indication effectively.The proteic amount with cancer suppressing function and the dosage range that are applied to the patient will depend on many factors, as administering mode, person's to be treated healthiness condition and diagnostician's judgement.
The proteic polynucleotide of people with cancer suppressing function also can be used for multiple therapeutic purpose.Gene therapy technology can be used for treating since have that the proteic nothing of cancer suppressing function is expressed or the proteic expression with cancer suppressing function of unusual/non-activity due to cell proliferation, growth or metabolic disturbance.The gene therapy vector of reorganization can be used for treating the protein expression with cancer suppressing function or the disease of active caused by abnormal.Deriving from the expression vector of virus such as protein gene that retrovirus, adenovirus, adeno-associated virus (AAV), hsv, parvovirus etc. can be used for having cancer suppressing function is transferred in the cell.The method that structure carries the recombinant viral vector of the protein gene with cancer suppressing function be found in existing document (Sambrook, etal.).The people protein gene of reorganization with cancer suppressing function can be packaged in the liposome and be transferred in the cell in addition.
Suppress to have cancer suppressing function people's protein mRNA oligonucleotide (comprising sense-rna and DNA) and ribozyme also within the scope of the invention.Ribozyme is the enzyme sample RNA molecule that a kind of energy specificity is decomposed specific RNA, and its mechanism of action is to carry out the endonuclease effect after ribozyme molecule and the hybridization of complementary target RNA-specific.The RNA of antisense and DNA and ribozyme can obtain with existing any RNA or DNA synthetic technology, as the technology widespread use of solid phase phosphoamide chemical synthesis synthetic oligonucleotide.Antisense rna molecule can be transcribed acquisition by the dna sequence dna of this RNA that encodes in external or body.This dna sequence dna has been incorporated into the downstream of rna polymerase promoter of carrier.In order to increase the stability of nucleic acid molecule, available several different methods is modified it, and as increasing the sequence length of both sides, the connection between the ribonucleoside is used phosphoric acid thioester bond or peptide bond but not phosphodiester bond.
Polynucleotide imports tissue or intracellular method comprises: directly be injected into polynucleotide in the in-vivo tissue; Or external by carrier (as virus, phage or plasmid etc.) earlier with the polynucleotide transfered cell in, again cell is transplanted in the body etc.
Polypeptide of the present invention also can be used as the peptide spectrum analysis, for example, the polypeptide available physical, chemistry or enzyme carry out the specificity cutting, and carry out the two-dimentional or three-dimensional gel electrophoresis analysis of one dimension.
The present invention also provides the antibody at the people's proteantigen determinant with cancer suppressing function.These antibody include, but is not limited to: the fragment that polyclonal antibody, monoclonal antibody, chimeric antibody, single-chain antibody, Fab fragment and Fab expression library produce.These antibody can prepare with ordinary method.The anti-proteic antibody of people with cancer suppressing function can be used in the immunohistochemistry technology, detects the people's albumen with cancer suppressing function in the biopsy specimen.
With the also available labelled with radioisotope of the protein bound monoclonal antibody of the people with cancer suppressing function, inject in the body and can follow the tracks of its position and distribution.Antibody among the present invention can be used for treating or prevents and the relevant disease of people's albumen with cancer suppressing function.The antibody that gives suitable dosage can stimulate or block proteic generation of the people with cancer suppressing function or activity.
Antibody also can be used for designing the immunotoxin at a certain privileged sites in the body.As have cancer suppressing function people's albumen high-affinity monoclonal antibody can with bacterium or plant poison (as diphtheria toxin, ricin, abrine etc.) covalent attachment.
Available people's albumen or the polypeptide immune animal of the production of polyclonal antibody with cancer suppressing function, as rabbit, mouse, rat etc.Multiple adjuvant can be used for the enhancing immunity reaction, includes but not limited to freund's adjuvant etc.
Have cancer suppressing function people's protein monoclonal antibody can with hybridoma technology production (Kohler and Milstein.Nature, 1975,256:495-497).With the variable region bonded chimeric antibody in human constant region and inhuman source can with existing technology production (Morrison et al, PNAS, 1985,81:6851).And the technology of existing manufacture order chain antibody (U.S.PatNo.4946778) also can be used for producing the anti-proteic single-chain antibody of people with cancer suppressing function.
Can be incorporated into the rondom polypeptide storehouse that solid formation forms by the various amino acid that may make up by screening with the protein bound peptide molecule of the present invention obtains.During screening, must carry out mark to people's protein molecular with cancer suppressing function.
The invention still further relates to quantitatively and detection and localization has the diagnostic testing process of people's protein level of cancer suppressing function.These tests are known in the art, and comprise that FISH measures and radioimmunoassay.The people's protein level that is detected in the test with cancer suppressing function, the disease that can have the importance of people's albumen in various diseases of cancer suppressing function with laying down a definition and be used to diagnose albumen to work with cancer suppressing function.
Proteic polynucleotide with cancer suppressing function can be used for having the diagnosis and the treatment of the protein related diseases of cancer suppressing function.Aspect diagnosis, the proteic polynucleotide with cancer suppressing function can be used for detecting have cancer suppressing function proteic expression whether or under morbid state, have an abnormal exprssion of cancer suppressing function.As the protein D NA sequence with cancer suppressing function can be used for the hybridization of biopsy specimen is had with judgement the proteic abnormal expression of cancer suppressing function.Hybridization technique comprises the Southern blotting, Northern blotting, in situ hybridization etc.These technological methods all are disclosed mature technologies, and relevant test kit all can obtain from commercial channels.Part or all of polynucleotide of the present invention can be used as probe stationary on microarray (Microarray) or DNA chip (being called " gene chip " again), is used for analyzing the differential expression analysis and the gene diagnosis of tissue gene.Carry out RNA-polymerase chain reaction (RT-PCR) amplification in vitro with the special primer of the albumen with cancer suppressing function and also can detect proteic transcription product with cancer suppressing function.
The sudden change that detection has the protein gene of cancer suppressing function also can be used for diagnosing the relevant disease of albumen with cancer suppressing function.Form with protein mutation of cancer suppressing function comprises that to have point mutation that the protein D NA sequence of cancer suppressing function compares, transposition, disappearance, reorganization and other any unusual etc. with normal wild type.Available existing technology such as Southern blotting, dna sequence analysis, PCR and in situ hybridization detect sudden change.In addition, sudden change might influence proteic expression, therefore can judge indirectly that with Northern blotting, Western blotting gene has or not sudden change.
Sequence of the present invention identifies it also is valuable to karyomit(e).These sequences can be specifically at certain bar human chromosome particular location and and can with its hybridization.At present, need to identify the concrete site of each gene on the karyomit(e).Yet have only chromosomal marker thing seldom to can be used for the marker chromosomes position now based on actual sequence data (repetition polymorphism).For these sequences are associated with disease related gene.The first step is positioned dna sequence dna of the present invention on the karyomit(e) exactly.
In brief, prepare PCR primer (preferred 15-35bp), sequence can be positioned on the karyomit(e) according to cDNA.Then, these primers are used for the somatocyte hybrid cell that the PCR screening contains each bar human chromosome.Have only those hybrid cells that contain corresponding to the people's gene of primer can produce the fragment of amplification.
The PCR localization method of somatocyte hybrid cell is that DNA is navigated to concrete chromosomal quick method.Use Oligonucleolide primers of the present invention,, can utilize one group to realize inferior location from specific chromosomal fragment or a large amount of genomic clone by similar approach.Other the similar strategy that can be used for chromosomal localization comprises in situ hybridization, uses the karyomit(e) prescreen and the hybridization preliminary election of the airflow classification of mark, thereby makes up the special cDNA storehouse of karyomit(e).
The cDNA clone is carried out fluorescence in situ hybridization (FISH) with Metaphase Chromosome, can in a step, accurately carry out chromosomal localization.The summary of this technology is referring to Verma etc., Human Chromosomes:a Manual of BasicTechniques, Pergamon Press, New York (1988).
In case sequence is positioned to chromosome position accurately, the physical location of this sequence on karyomit(e) just can be associated with the gene map data.These data for example are found in, V.Mckusick, Mendelian Inheritance in Man (can by with the online acquisition of Johns Hopkins University Welch Medical Library).Can pass through linkage analysis then, determine gene and navigated to relation between the disease on the chromosomal region already.
Then, need to measure ill and not cDNA between diseased individuals or genome sequence difference.If observe certain sudden change in some or all of diseased individuals, and this sudden change is not observed in any normal individual, then this sudden change may be the cause of disease of disease.More ill and diseased individuals not is usually directed at first seek the variation of structure in the karyomit(e), as from the horizontal visible of karyomit(e) or use based on detectable disappearance of the PCR of cDNA sequence or transposition.
Pyrenoids thuja acid full length sequence or its fragment with cancer suppressing function of the present invention can obtain with the method for pcr amplification method, recombination method or synthetic usually.For the pcr amplification method, can be disclosed according to the present invention about nucleotide sequence, especially open reading frame sequence designs primer, and with commercially available cDNA storehouse or by the prepared cDNA storehouse of ordinary method well known by persons skilled in the art as template, amplification and must relevant sequence.When sequence is longer, usually needs to carry out twice or pcr amplification repeatedly, and then the fragment that each time amplifies is stitched together by proper order.
In case obtained relevant sequence, just can obtain relevant sequence in large quantity with recombination method.This normally is cloned into carrier with it, changes cell again over to, separates obtaining relevant sequence then from the host cell after the propagation by ordinary method.
In addition, also the method for available synthetic is synthesized relevant sequence, especially fragment length more in short-term.Usually, by first synthetic a plurality of small segments, and then connect and to obtain the very long fragment of sequence.
At present, can be fully come the dna sequence dna of code book invention albumen (or its fragment, or derivatives thereof) by chemosynthesis.This dna sequence dna can be introduced then in the various dna moleculars (as carrier) and cell in this area.In addition, also can will suddenly change and introduce in the protein sequence of the present invention by chemosynthesis.
In addition, because the albumen with cancer suppressing function of the present invention has the natural acid sequence that is derived from the people, therefore, compare with the albumen of the same clan that derives from other species, estimate to have higher active and/or lower side effect (for example in the intravital immunogenicity of people lower or do not have) being applied to man-hour.
Below in conjunction with specific embodiment, further set forth the present invention.Should be understood that these embodiment only to be used to the present invention is described and be not used in and limit the scope of the invention.The experimental technique of unreceipted actual conditions in the following example, usually according to people such as normal condition such as Sambrook, molecular cloning: laboratory manual (New York:Cold Spring Harbor LaboratoryPress, 1989) condition described in, or the condition of advising according to manufacturer.
The acquisition of embodiment 1:cDNA gene and the restraining effect that the cancer cells clone is formed
FP977, FP1147, FP1188, FP1193, FP1477, FP1572, FP2568, FP2653, FP2823, FP2860, FP2926 and FP3235 come from self-built human fetal cDNA library.Get the fetal tissue at 3,6,9 monthly ages, (GIBCO BRL company) extracts total RNA by manufacturer's specification sheets with Trizol reagent, extracts mRNA with the mRNA test kit (Pharmacia company) of purifying.Make up the cDNA library of above-mentioned mRNA with pCMV-script TMXR cDNA library construction test kit (Stratagene company).Wherein ThermoScript II is used MMLV-RT-Superscript II (GIBCO BRL) instead, and reverse transcription reaction carries out at 42 ℃.Transform XL 10-Gold recipient cell, obtained 1 * 10 6The cDNA library of cfu/ μ g cDNA titre.The first round is picking cDNA clone at random, is probe with high abundance cDNA clone with the cDNA clone who has proved cancer inhibitor cell growth function thereafter, screening by hybridization cDNA library, weak positive and negative clone of picking.With Qiagen 96 orifice plate plasmid extraction test kits, carry out the extraction of plasmid DNA by shop instruction.Plasmid DNA and empty carrier transfection simultaneously hepatoma cell line 7721.After the 100ng DNA alcohol precipitation drying, add 6 μ l H 2Transfection is treated in the O dissolving.Add 0.74 μ l liposome and 9.3 μ l serum-free mediums in every part of DNA sample, behind the mixing, room temperature was placed 10 minutes.Add 150 μ l serum-free mediums in every pipe, divide equally and add 3 holes and grow in 7721 cells of 96 orifice plates, placed 2 hours for 37 ℃, every hole adds 50 μ l serum-free mediums again, 37 ℃ 24 hours.Every hole is changed 100 μ l and is trained liquid entirely, 37 ℃ 24 hours, change the full training liquid 100 μ l that contain G418,37 ℃ 24-48 hour, the limit is observed, the training liquid that G418 concentration does not wait is changed on the limit.After about 2-3 time, there is the clone to form up to the microscopy cell, counting.Find that above-mentioned clone has anticancer clone formation effect, the result is as shown in the table.
CDNA clone's transfectional cell (7721) clone formation situation
CDNA clones title CDNA clones number (three repetitions) Empty carrier clone number (three repetitions)
FP977 0 0 0 15 18 13
FP1147 12 10 11 15 18 13
FP1188 2 1 4 15 18 13
FP1193 4 4 4 15 18 13
FP1477 2 2 3 15 18 13
FP1572 7 5 9 15 18 13
FP2568 0 0 0 15 18 13
FP2653 8 11 10 15 18 13
FP2823 5 4 8 15 18 13
FP2860 13 10 4 15 18 13
FP2926 7 4 10 15 18 13
FP3235 2 5 7 15 18 13
The cDNA clone is adopted two deoxidation cessation method, on the ABI377 automatic dna sequencer, measure the nucleotide sequence of the nearly 500bp of one end.After the analysis, be defined as novel gene cloning, carry out the other end order-checking, do not obtain full length cDNA sequence yet, the design primer checks order once more, up to obtaining full length sequence (SEQ ID NO:1,4,7,10,13,16,19,22,25,28,31,34).
Embodiment 2:PCR method obtains the acquisition of full-length gene and recombinant protein
Get fetal tissue, (GIBCO BRL company) extracts total RNA by manufacturer's specification sheets with Trizol reagent, extracts mRNA with the mRNA test kit (Pharmacia company) of purifying.With MMLV-RT-Superscript II (GIBCO BRL), ThermoScript II is carried out reverse transcription reaction at 42 ℃, obtains fetus cDNA.Utilize the special primer (as shown in the table) of each gene, by 97 ℃ of 3 ' 1 circulation.94 ℃ 30 " 60 ℃ 30 " 72 ℃ 1 ' totally 35 circulations, 72 ℃ 10 ', pcr amplification is carried out in 1 circulation, and acquisition contains the amplified production of each protein gene of complete open reading frame sequence.Amplified production is through sequence verification, and the sequence that records with embodiment 1 conforms to, and changes amplified production over to host cell with routine techniques subsequently, obtains recombinant protein (SEQ ID NO:2,5,8,11,14,17,20,23,26,29,32,35).
Gene specific primer
Clone's title Special primer 1 (5 ' → 3 ') Special primer 2 (5 ' → 3 ')
FP977 (338)CACAGTACCCAAACAACGC GACGATAGGGAACCGATT(2549)
FP1147 (212)GAGCTAGAATGGAATCCCTGAA GTCATCGAAGTGTTTCACGGTT(1326)
FP1188 (101)CTCCCGACTCCCCAGTTT CTAGACGCACTTGCGGTT(1872)
FP1193 (107)GCCCCACTGACCCAAGTC CGAAGACGGCATAGGGAT(2181)
FP1477 (252)CCCACATCTTCCATCACTTT CTTCCCTCGGTTATGCTCA(3222)
FP1572 (107)CCAATCCCAGACATCCTCA ACTTTCTGAGGCAACCCT(2403)
FP2568 (163)AGGAGGAAGATGGTCCAGG GTTGCCCTACATACGCCT(1647)
FP2653 (228)TGAGGGACAGAGCAAAGAC GACGAGGGCATCAGGAGT(2244)
FP2823 (60)GATTTGTTTGCCTTTTACCA TTACCTCTTTTACTTTGCTT(2627)
FP2860 (48)TGTTTAGGTTGTTTCCAGT TTGTATCACTCTGGAGGGT(1822)
FP2926 (231)AGCTGATGTGGCGCTGAG GGTTCTAGTGCGGTAACG(1867)
FP3235 (3)GGCGAGGCTACCCAGGCT CCGACTGGACCGTAACCC(1426)
Embodiment 3:cDNA cloned sequence is analyzed
1.FP977
A: nucleotide sequence (SEQ ID NO:1) length: 2657 bases
1 GGTAATTTGT CAAACTGATG TCTCAGGTAA GGTGCCACTC TACGCATTGC AGAGTTGTGG
61 CAGATACCCC AGACCTGTTC TCTGTTTCAT TTTCAAAAAT GCTTTCCAAC ATTGAACTTC
121 TCTCTCCTGC TAGCCCTAAA AGAAGATCCC CCTAGAAATC TCCTGGAAGA GAGGAAATCA
181 GATCAACTGG GGCTGCCTCA GACCTTGCAG CAGGAATTCT CCCTGATCAA TGTGCAAATC
241 CGGAATGTCA ATGTGGAGAT GGATGCGGCA GACAGGAGCT GCACAGTGTC TGTGCACTGC
301 AGCAACCATC GTGTCAAGAT GCTGGTGAAG TTCCCTGCAC AGTACCCAAA CAACGCCGCC
361 CCTTCCTTCC AGTTTATTAA CCCCACAACC ATCACATCCA CCATGAAAGC TAAGCTGCTG
421 AAGATCCTGA AGGACACAGC CCTGCAGAAA GTGAAGCGTG GCCAGAGCTG CCTGGAGCCC
481 TGCCTGCGCC AGCTCGTCTC CTGCCTTGAG TCCTTTGTGA ACCAGGAAGA CAGCGCTTCC
541 AGCAACCCGT TTGCACTCCC CAACTCTGTC ACTCCCCCCT TACCGACGTT TGCGCGGGTG
601 ACCACGGCTT ACGGGTCGTA CCAGGACGCC AACATTCCCT TTCCTAGGAC TTCTGGGGCC
661 AGGTTCTGCG GAGCAGGTTA CCTGGTATAT TTCACAAGGC CCATGACAAT GCATCGGGCG
721 GTGTCTCCCA CAGAGCCTAC TCCGAGATCT CTCTCAGCCT TGTCTGCTTA TCACACTGGC
781 TTGATCGCGC CCATGAAGAT CCGCACAGAG GCCCCTGGGA ACCTTCGTTT ATACAGTGGG
841 AGCCCCACTC GCAGCGAGAA AGAGCAGGTC TCCATCAGCT CCTTCTACTA CAAGGAGCGG
901 AAATCAAGAC GATGGAAAAG TAAGCGTGAG GGATCAGACT CTGGCAATCG ACAGATCAAG
961 GCTGCTGGGA AAGTCATCAT CCAGGATATT GCTTGCCTCC TGCCTGTTCA CAAATCGCTG
1021 GGAGAGCTGT ACATATTGAA TGTGAATGAT ATTCAGGAAA CATGTCAGAA GAATGCCGCC
1081 TCTGCCTTGC TCGTTGGAAG AAAGGATCTT GTCCAGGTTT GGTCGCTGGC TACGGTAGCT
1141 ACAGATCTTT GCCTTGGTCC GAAATCTGAC CCAGATTTGG AAACACCCTG GGCTCGACAT
1201 CCATTTGGGC GGCAGCTGCT GGAGTCCCTG TTGGCTCACT ATTGCCGGCT CCGGGATGTT
1261 CAGACACTGG CGATGCTCTG TAGCGTGTTT GAAGCCCAGT CTCGGCCTCA GGGGCTACCA
1321 AACCCCTTTG GGCCTTTTCC TAACCGTTCT TCTAATCTTG TGGTGTCCCA TAGTCGATAT
1381 CCTAGCTTTA CCTCTTCTGG TTCCTGCTCC AGTATGTCAG ACCCAGGGCT CAACACTGGC
1441 GGCTGGAACA TAGCGGGAAG AGAGGCAGAG CACTTGTCCT CCCCTTGGGG AGAATCCTCA
1501 CCAGAAGAGC TCCGCTTTGG GAGTCTGACC TACAGTGATC CCCGTGAGCG AGAACGCGAC
1561 CAGCATGATA AAAATAAAAG GCTCCTGGAC CCCGCCAATA CCCAGCAATT TGATGACTTT
1621 AAGAAATGCT ATGGGGAAAT CCTCTACCGT TGGGGTCTGA GAGAGAAGCG AGCTGAAGTG
1681 TTGAAGTTTG TCTCCTGTCC TCCTGACCCT CACAAAGGGA TCGAGTTCGG CGTGTACTGC
1741 AGCCACTGCC GGAGTGAGGT CCGTGGCACG CAGTGTGCCA TCTGCAAAGG CTTCACGTTC
1801 CAGTGTGCCA TCTGTCACGT GGCTGTGCGG GGATCGTCCA ATTTCTGCCT GACCTGTGGG
1861 CACGGTGGCC ACACCAGCCA CATGATGGAG TGGTTTCGGA CCCAGGAGGT GTGTCCCACC
1921 GGGTGTGGGT GCCACTGCCT GCTTGAAAGC ACTTTCTGAA CCTACAGAAG TTGGGTATTG
1981 TCTGAAATCC CAGAGGACCC ATAAGTGCCG GTGACAAGCT GTCTGTCAGG GGAGAGGCTC
2041 CAGAACCTGG GTTCGTCCCC AGTGAGACCG GAGGATGATC CCCCAAGGAC TGCGCAGCAT
2101 CAGCTCTTGG TGGGCCTCTG CCTTCTCTTC TGTTTGGCCA CCTGGTGTGG ATGTCACTGT
2161 GTGAAGATAA GGACAGAAGT GCAGAGCTGC GCTTTGTGTG TTGTCTATGT CGGCTGAGCT
2221 ACCAAGGTGG AAGTTTTCAT GGAGAAAAGC ACCTGGCTCC AGGGCCAGTG TTACAGTGTT
2281 ACCCTGTAAG GTGTTAGCCT TAAACCACCG AGCAGCGTTC TCTTGATGCC AGTGCAGAGA
2341 CCAGAGTCAG ATGCCCGAGG ACAGTGGGTA GGAATTTCAT CAACAAATGG ACCTATGGCA
2401 TCATGGCTTT AGAAGCTGGT ACATTTACTG AGCTGATGGA CAGTGGCCTT CTAAAATATG
2461 ACACTTAAAT TGTAAATATG CACTGTACTT AAGGATTCTT AAGATGTATT TTTTTGTTAT
2521 TTCTCCTCCA GCTGCTATCC CTTGGCTAAT AAAATTCTAG TAATTTGAAA AAAAAAAAAA
2581 AGAGAGAAAG TTAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA
2641 AAAAAAAAAA AAAAAAA
B: nucleotide sequence (SEQ ID NO:2) length: 566 amino acid
1 MDAADRSCTV SVHCSNHRVK MLVKFPAQYP NNAAPSFQFI NPTTITSTMK AKLLKILKDT
61 ALQKVKRGQS CLEPCLRQLV SCLESFVNQE DSASSNPFAL PNSVTPPLPT FARVTTAYGS
121 YQDANIPFPR TSGARFCGAG YLVYFTRPMT MHRAVSPTEP TPRSLSALSA YHTGLIAPMK
181 IRTEAPGNLR LYSGSPTRSE KEQVSISSFY YKERKSRRWK SKREGSDSGN RQIKAAGKVI
241 IQDIACLLPV HKSLGELYIL NVNDIQETCQ KNAASALLVG RKDLVQVWSL ATVATDLCLG
301 PKSDPDLETP WARHPFGRQL LESLLAHYCR LRDVQTLAML CSVFEAQSRP QGLPNPFGPF
361 PNRSSNLVVS HSRYPSFTSS GSCSSMSDPG LNTGGWNIAG REAEHLSSPW GESSPEELRF
421 GSLTYSDPRE RERDQHDKNK RLLDPANTQQ FDDFKKCYGE ILYRWGLREK RAEVLKFVSC
481 PPDPHKGIEF GVYCSHCRSE VRGTQCAICK GFTFQCAICH VAVRGSSNFC LTCGHGGHTS
541 HMMEWFRTQE VCPTGCGCHC LLESTF
C. Nucleotide and amino acid composite sequence (SEQ ID NO:3) clone number: FP977
Start code: 259ATG stops coding: 1957TGA protein molecular weight: 62904.43
(annotate: what (1) provided is the position that initial sum stops first Nucleotide of coding, and (2) molecular weight unit is dalton)
1 GGT AAT TTG TCA AAC TGA TGT CTC AGG TAA GGT GCC ACT CTA CGC ATT 48
49 GCA GAG TTG TGG CAG ATA CCC CAG ACC TGT TCT CTG TTT CAT TTT CAA 96
97 AAA TGC TTT CCA ACA TTG AAC TTC TCT CTC CTG CTA GCC CTA AAA GAA 144
145 GAT CCC CCT AGA AAT CTC CTG GAA GAG AGG AAA TCA GAT CAA CTG GGG 192
193 CTG CCT CAG ACC TTG CAG CAG GAA TTC TCC CTG ATC AAT GTG CAA ATC 240
241 CGG AAT GTC AAT GTG GAG ATG GAT GCG GCA GAC AGG AGC TGC ACA GTG 288
1 Met Asp Ala Ala Asp Arg Ser Cys Thr Val 10
89 TCT GTG CAC TGC AGC AAC CAT CGT GTC AAG ATG CTG GTG AAG TTC CCT 336
11 Ser Val His Cys Ser Asn His Arg Val Lys Met Leu Val Lys Phe Pro 26
337 GCA CAG TAC CCA AAC AAC GCC GCC CCT TCC TTC CAG TTT ATT AAC CCC 384
27 Ala Gln Tyr Pro Asn Asn Ala Ala Pro Ser Phe Gln Phe Ile Asn Pro 42
385 ACA ACC ATC ACA TCC ACC ATG AAA GCT AAG CTG CTG AAG ATC CTG AAG 432
43 Thr Thr Ile Thr Ser Thr Met Lys Ala Lys Leu Leu Lys Ile Leu Lys 58
433 GAC ACA GCC CTG CAG AAA GTG AAG CGT GGC CAG AGC TGC CTG GAG CCC 480
59 Asp Thr Ala Leu Gln Lys Val Lys Arg Gly Gln Ser Cys Leu Glu Pro 74
481 TGC CTG CGC CAG CTC GTC TCC TGC CTT GAG TCC TTT GTG AAC CAG GAA 528
75 Cys Leu Arg Gln Leu Val Ser Cys Leu Glu Ser Phe Val Asn Gln Glu 90
529 GAC AGC GCT TCC AGC AAC CCG TTT GCA CTC CCC AAC TCT GTC ACT CCC 576
91 Asp Ser Ala Ser Ser Asn Pro Phe Ala Leu Pro Asn Ser Val Thr Pro 106
577 CCC TTA CCG ACG TTT GCG CGG GTG ACC ACG GCT TAC GGG TCG TAC CAG 624
107 Pro Leu Pro Thr Phe Ala Arg Val Thr Thr Ala Tyr Gly Ser Tyr Gln 122
625 GAC GCC AAC ATT CCC TTT CCT AGG ACT TCT GGG GCC AGG TTC TGC GGA 672
123 Asp Ala Asn Ile Pro Phe Pro Arg Thr Ser Gly Ala Arg Phe Cys Gly 138
673 GCA GGT TAC CTG GTA TAT TTC ACA AGG CCC ATG ACA ATG CAT CGG GCG 720
139 Ala Gly Tyr Leu Val Tyr Phe Thr Arg Pro Met Thr Met His Arg Ala 154
721 GTG TCT CCC ACA GAG CCT ACT CCG AGA TCT CTC TCA GCC TTG TCT GCT 768
155 Val Ser Pro Thr Glu Pro Thr Pro Arg Ser Leu Ser Ala Leu Ser Ala 170
769 TAT CAC ACT GGC TTG ATC GCG CCC ATG AAG ATC CGC ACA GAG GCC CCT 816
171 Tyr His Thr Gly Leu Ile Ala Pro Met Lys Ile Arg Thr Glu Ala Pro 186
817 GGG AAC CTT CGT TTA TAC AGT GGG AGC CCC ACT CGC AGC GAG AAA GAG 864
187 Gly Asn Leu Arg Leu Tyr Ser Gly Ser Pro Thr Arg Ser Glu Lys Glu 202
865 CAG GTC TCC ATC AGC TCC TTC TAC TAC AAG GAG CGG AAA TCA AGA CGA 912
203 Gln Val Ser Ile Ser Ser Phe Tyr Tyr Lys Glu Arg Lys Ser Arg Arg 218
913 TGG AAA AGT AAG CGT GAG GGA TCA GAC TCT GGC AAT CGA CAG ATC AAG 960
219 Trp Lys Ser Lys Arg Glu Gly Ser Asp Ser Gly Asn Arg Gln Ile Lys 234
961 GCT GCT GGG AAA GTC ATC ATC CAG GAT ATT GCT TGC CTC CTG CCT GTT 1008
235 Ala Ala Gly Lys Val Ile Ile Gln Asp Ile Ala Cys Leu Leu Pro Val 250
1009 CAC AAA TCG CTG GGA GAG CTG TAC ATA TTG AAT GTG AAT GAT ATT CAG 1056
251 His Lys Ser Leu Gly Glu Leu Tyr Ile Leu Asn Val Asn Asp Ile Gln 266
1057 GAA ACA TGT CAG AAG AAT GCC GCC TCT GCC TTG CTC GTT GGA AGA AAG 1104
267 Glu Thr Cys Gln Lys Asn Ala Ala Ser Ala Leu Leu Val Gly Arg Lys 282
1105 GAT CTT GTC CAG GTT TGG TCG CTG GCT ACG GTA GCT ACA GAT CTT TGC 1152
283 Asp Leu Val Gln Val Trp Ser Leu Ala Thr Val Ala Thr Asp Leu Cys 298
1153 CTT GGT CCG AAA TCT GAC CCA GAT TTG GAA ACA CCC TGG GCT CGA CAT 1200
299 Leu Gly Pro Lys Ser Asp Pro Asp Leu Glu Thr Pro Trp Ala Arg His 314
1201 CCA TTT GGG CGG CAG CTG CTG GAG TCC CTG TTG GCT CAC TAT TGC CGG 1248
315 Pro Phe Gly Arg Gln Leu Leu Glu Ser Leu Leu Ala His Tyr Cys Arg 330
1249 CTC CGG GAT GTT CAG ACA CTG GCG ATG CTC TGT AGC GTG TTT GAA GCC 1296
331 Leu Arg Asp Val Gln Thr Leu Ala Met Leu Cys Ser Val Phe Glu Ala 346
1297 CAG TCT CGG CCT CAG GGG CTA CCA AAC CCC TTT GGG CCT TTT CCT AAC 1344
347 Gln Ser Arg Pro Gln Gly Leu Pro Asn Pro Phe Gly Pro Phe Pro Asn 362
1345 CGT TCT TCT AAT CTT GTG GTG TCC CAT AGT CGA TAT CCT AGC TTT ACC 1392
363 Arg Ser Ser Asn Leu Val Val Ser His Ser Arg Tyr Pro Ser Phe Thr 378
1393 TCT TCT GGT TCC TGC TCC AGT ATG TCA GAC CCA GGG CTC AAC ACT GGC 1440
379 Ser Ser Gly Ser Cys Ser Ser Met Ser Asp Pro Gly Leu Asn Thr Gly 394
1441 GGC TGG AAC ATA GCG GGA AGA GAG GCA GAG CAC TTG TCC TCC CCT TGG 1488
395 Gly Trp Asn Ile Ala Gly Arg Glu Ala Glu His Leu Ser Ser Pro Trp 410
1489 GGA GAA TCC TCA CCA GAA GAG CTC CGC TTT GGG AGT CTG ACC TAC AGT 1536
411 Gly Glu Ser Ser Pro Glu Glu Leu Arg Phe Gly Ser Leu Thr Tyr Ser 426
1537 GAT CCC CGT GAG CGA GAA CGC GAC CAG CAT GAT AAA AAT AAA AGG CTC 1584
427 Asp Pro Arg Glu Arg Glu Arg Asp Gln His Asp Lys Asn Lys Arg Leu 442
1585 CTG GAC CCC GCC AAT ACC CAG CAA TTT GAT GAC TTT AAG AAA TGC TAT 1632
443 Leu Asp Pro Ala Asn Thr Gln Gln Phe Asp Asp Phe Lys Lys Cys Tyr 458
1633 GGG GAA ATC CTC TAC CGT TGG GGT CTG AGA GAG AAG CGA GCT GAA GTG 1680
459 Gly Glu Ile Leu Tyr Arg Trp Gly Leu Arg Glu Lys Arg Ala Glu Val 474
1681 TTG AAG TTT GTC TCC TGT CCT CCT GAC CCT CAC AAA GGG ATC GAG TTC 1728
475 Leu Lys Phe Val Ser Cys Pro Pro Asp Pro His Lys Gly Ile Glu Phe 490
1729 GGC GTG TAC TGC AGC CAC TGC CGG AGT GAG GTC CGT GGC ACG CAG TGT 1776
491 Gly Val Tyr Cys Ser His Cys Arg Ser Glu Val Arg Gly Thr Gln Cys 506
1777 GCC ATC TGC AAA GGC TTC ACG TTC CAG TGT GCC ATC TGT CAC GTG GCT 1824
507 Ala Ile Cys Lys Gly Phe Thr Phe Gln Cys Ala Ile Cys His Val Ala 522
1825 GTG CGG GGA TCG TCC AAT TTC TGC CTG ACC TGT GGG CAC GGT GGC CAC 1872
523 Val Arg Gly Ser Ser Asn Phe Cys Leu Thr Cys Gly His Gly Gly His 538
1873 ACC AGC CAC ATG ATG GAG TGG TTT CGG ACC CAG GAG GTG TGT CCC ACC 1920
539 Thr Ser His Met Met Glu Trp Phe Arg Thr Gln Glu Val Cys Pro Thr 554
1921 GGG TGT GGG TGC CAC TGC CTG CTT GAA AGC ACT TTC TGA ACC TAC AGA 1968
555 Gly Cys Gly Cys His Cys Leu Leu Glu Ser Thr Phe *** 567
1969 AGT TGG GTA TTG TCT GAA ATC CCA GAG GAC CCA TAA GTG CCG GTG ACA 2016
2017 AGC TGT CTG TCA GGG GAG AGG CTC CAG AAC CTG GGT TCG TCC CCA GTG 2064
2065 AGA CCG GAG GAT GAT CCC CCA AGG ACT GCG CAG CAT CAG CTC TTG GTG 2112
2113 GGC CTC TGC CTT CTC TTC TGT TTG GCC ACC TGG TGT GGA TGT CAC TGT 2208
2209 GTC GGC TGA GCT ACC AAG GTG GAA GTT TTC ATG GAG AAA AGC ACC TGG 2256
2257 CTC CAG GGC CAG TGT TAC AGT GTT ACC CTG TAA GGT GTT AGC CTT AAA 2304
2305 CCA CCG AGC AGC GTT CTC TTG ATG CCA GTG CAG AGA CCA GAG TCA GAT 2352
2353 GCC CGA GGA CAG TGG GTA GGA ATT TCA TCA ACA AAT GGA CCT ATG GCA 2400
2401 TCA TGG CTT TAG AAG CTG GTA CAT TTA CTG AGC TGA TGG ACA GTG GCC 2448
2449 TTC TAA AAT ATG ACA CTT AAA TTG TAA ATA TGC ACT GTA CTT AAG GAT 2496
2497 TCT TAA GAT GTA TTT TTT TGT TAT TTC TCC TCC AGC TGC TAT CCC TTG 2544
2545 GCT AAT AAA ATT CTA GTA ATT TGA AAA AAA AAA AAA AGA GAG AAA GTT 2592
2593 AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA 2640
2641 AAA AAA AAA AAA AAA AA 2657
2.FP1147
A: nucleotide sequence (SEQ ID NO:4) length: 2450 bases
1 GGGTATATAT AACTTTTAAA TTATGTCAAC TTTTCTGTTT GCCAACTAGT ATGTAAGTTC
61 CACAAGGGCA ATGATTTTTT ATCTGTTCTC TAGTTATCCT AAGTAATTAG AATAGCGCCC
121 AGCCCATAGC TGGAACTTAT GAGGAAATGA GGAAACGAGC TTCAGATAAT TTAAATAACT
181 TCCCAAGACT AAGTTTTATA GTTAGTTGGT GGAGCTAGAA TGGAATCCCT GAAGAGTGGC
241 TCAAAGGCCT TTCTCTTTAC CATGGCCTGT GCTGCCTTGC TATACTGATG CCATGAAGTT
301 AGTCCATCCT GAGGTCTAAC CCCTTTTGAT GGTGTGGAAC TCACATGGGG AAGGCGGACA
361 TTTGCCCACC TGGGCCTTCA ACTCCTTCAC TTGGAGCTGG CCTCTCCCCA TCATGCCAGC
421 TTCCCTGTGT GTGAATGGGT GTGTGTGTGT GTGTGAATGG GGTGTGTGAA GTGTGTATGT
481 GAATGGGGGT GTGTGTGAAT GGGGTGTGTG TGTGTGAATG ATGTGTGTGT GAATGAGGTG
541 TGTGTGAATG GGGTGTGTGT GTGTGAATGA GGTGTGTGTG TGAATGGGGT GTGTGTGTGA
601 ATGGGGTGTG TGTGTGCGGT TGATGACTAT CCGTGTGAGA ACACGGGAGT TGGGGCTGCT
661 CCTTCTCTTT GCCTCCTTGC CCAGGCACCT GAGGTCTTGG ACTCCTGTCC TAACTCTCAG
721 CCCATTAGAG GCTGCTGCCT CTGGACTAGA CTCAGGCTCT GACCTCACAC TCCATGCCTA
781 GGATTTCAGT CTGGGGTCCC AGTTCCCATC TGCCATTCCG GGCTTGGTCA TCAGCCTCTT
841 CCCAGAGGCC CAGGATGGGT GGATTTGGCA GGAGTATGGG GAAGGAAGGA AGAGCTTAGC
901 TTCCTTCCCT GTGGGGACCC TGTGAGGCAC AGCAGACTGG GCTGGGCCTG GTCCTGGGCT
961 CCAGCCTCCA GCCTCCACTC ACACCCTCCT CTTCCTCAGG ATCATCTTCT CCACGCCCCT
1021 GGCCGTCATT GCCTACTTCC TCATCTGGTT CGTGCCCGAC TTCCCACACG GCCAGACCTA
1081 TTGGTACCTG CTTTTCTATT GCCTCTTTGA AACAATGGTC ACGTGTTTCC ATGTTCCCTA
1141 CTCGGCTCTC ACCATGTTCA TCAGCACCGA GCAGACTGAG CGGGATTCTG CCACCGCCTA
1201 TCGGATGACT GTGGAAGTGC TGGGCACAGT GCTGGGCACG GCGATCCAGG GACAAATCGT
1261 GGGCCAAGCA GACACGCCTT GTTTCCAGGA CCTCAATAGC TCTACAGTAG CTTCACAAAG
1321 TGCCAACCAT ACACATGGCA CCACCTCACA CAGGGAAACG CAAAAGGCAT ACCTGCTGGC
1381 AGCGGGGGTC ATTGTCTGTA TCTATATAAT CTGTGCTGTC ATCCTGATCC TGGGGCGTGC
1441 GGGAGCAGAG AGCTCTCGGC CACTTTAACC ATTCCCATCT GGCAGTGGTT CTTGACCCGG
1501 TTTGGCAAGA AGACAGCTGT ATATGTTGGG ATCTCATCAG CAGTGCCATT TCTCATCTTG
1561 GTGGCCCTCA TGGAGAGTAA CCTCATCATT ACATATGCGG TAGCTGTGGC AGCTGGCATC
1621 AGTGTGGCAG CTGCCTTCTT ACTACCCTGG TCCATGCTGC CTGATGTCAT TGACGACTTC
1681 CATCTGAAGC AGCCCCACTT CCATGGAACC GAGCCCATCT TCTTCTCCTT CTATGTCTTC
1741 TTCACCAAGT TTGCCTCTGG AGTGTCACTG GGCATTTCTA CCCTCAGTCT GGACTTTGCA
1801 GGGTACCAGA CCCGTGGCTG CTCGCAGCCG GAACGTGTCA AGTTTACACT GAACATGCTC
1861 GTGACCATGG CTCCCATAGT TCTCATCCTG CTGGGCCTGC TGCTCTTCAA AATGTACCCC
1921 ATTGATGAGG AGAGGCGGCG GCAGAATAAG AAGGCCCTGC AGGCACTGAG GGACGAGGCC
1981 AGCAGCTCTG GCTGCTCAGA AACAGACTCC ACAGAGCTGG CTAGCATCCT CTAGGGCCCG
2041 CCACGTTGCC CGAAGCCACC ATGCAGAAGG CCACAGAAGG GATCAGGACC TGTCTGCCGG
2101 CTTGCTGAGC AGCTGGACTG CAGGTGCTAG GAAGGGAACT GAAGACTCAA GGAGGTGGCC
2161 CAGGACACTT GCTGTGCTCA CTGTGGGGCC GGCTGCTCTG TGGCCTCCTG CCTCCCCTCT
2221 GCCTGCCTGT GGGGCCAAGC CCTGGGGCTG CCACTGTGAA TATGCCAAGG ACTGATCGGG
2281 CCTAGCCCGG AACACTAATG TAGAAACCTT TTTTTACAGA GCCTAATTAA TAACTTAATG
2341 ACTGTGTACA TAGCAATGTG TGTGTATGTA TATGTCTGTG AGCTATTAAT GTTATTAATT
2401 TTCATAAAAG CTGGAAAGCA GCTGCCTGTT TCAAAAAAAA AAAAAAAAAA
B: nucleotide sequence (SEQ ID NO:5) length: 154 amino acid
1 MESNLIITYA VAVAAGISVA AAFLLPWSML PDVIDDFHLK QPHFHGTEPI FFSFYVFFTK
61 FASGVSLGIS TLSLDFAGYQ TRGCSQPERV KFTLNMLVTM APIVLILLGL LLFKMYPIDE
121 ERRRQNKKAL QALRDEASSS GCSETDSTEL ASIL
C. Nucleotide and amino acid composite sequence (SEQ ID NO:6) clone number: FP1147
Start code: 1570ATG stops coding: 2032TAG protein molecular weight: 17057.85
1 GGG TAT ATA TAA CTT TTA AAT TAT GTC AAC TTT TCT GTT TGC CAA CTA 48
49 GTA TGT AAG TTC CAC AAG GGC AAT GAT TTT TTA TCT GTT CTC TAG TTA 96
97 TCC TAA GTA ATT AGA ATA GCG CCC AGC CCA TAG CTG GAA CTT ATG AGG 144
145 AAA TGA GGA AAC GAG CTT CAG ATA ATT TAA ATA ACT TCC CAA GAC TAA 192
193 GTT TTA TAG TTA GTT GGT GGA GCT AGA ATG GAA TCC CTG AAG AGT GGC 240
241 TCA AAG GCC TTT CTC TTT ACC ATG GCC TGT GCT GCC TTG CTA TAC TGA 288
289 TGC CAT GAA GTT AGT CCA TCC TGA GGT CTA ACC CCT TTT GAT GGT GTG 336
337 GAA CTC ACA TGG GGA AGG CGG ACA TTT GCC CAC CTG GGC CTT CAA CTC 384
385 CTT CAC TTG GAG CTG GCC TCT CCC CAT CAT GCC AGC TTC CCT GTG TGT 432
433 GAA TGG GTG TGT GTG TGT GTG TGA ATG GGG TGT GTG AAG TGT GTA TGT 480
481 GAA TGG GGG TGT GTG TGA ATG GGG TGT GTG TGT GTG AAT GAT GTG TGT 528
529 GTG AAT GAG GTG TGT GTG AAT GGG GTG TGT GTG TGT GAA TGA GGT GTG 576
577 TGT GTG AAT GGG GTG TGT GTG TGA ATG GGG TGT GTG TGT GCG GTT GAT 624
625 GAC TAT CCG TGT GAG AAC ACG GGA GTT GGG GCT GCT CCT TCT CTT TGC 672
673 CTC CTT GCC CAG GCA CCT GAG GTC TTG GAC TCC TGT CCT AAC TCT CAG 720
721 CCC ATT AGA GGC TGC TGC CTC TGG ACT AGA CTC AGG CTC TGA CCT CAC 768
769 ACT CCA TGC CTA GGA TTT CAG TCT GGG GTC CCA GTT CCC ATC TGC CAT 816
817 TCC GGG CTT GGT CAT CAG CCT CTT CCC AGA GGC CCA GGA TGG GTG GAT 864
865 TTG GCA GGA GTA TGG GGA AGG AAG GAA GAG CTT AGC TTC CTT CCC TGT 912
913 GGG GAC CCT GTG AGG CAC AGC AGA CTG GGC TGG GCC TGG TCC TGG GCT 960
961 CCA GCC TCC AGC CTC CAC TCA CAC CCT CCT CTT CCT CAG GAT CAT CTT 1008
1009 CTC CAC GCC CCT GGC CGT CAT TGC CTA CTT CCT CAT CTG GTT CGT GCC 1056
1057 CGA CTT CCC ACA CGG CCA GAC CTA TTG GTA CCT GCT TTT CTA TTG CCT 1104
1105 CTT TGA AAC AAT GGT CAC GTG TTT CCA TGT TCC CTA CTC GGC TCT CAC 1152
1153 CAT GTT CAT CAG CAC CGA GCA GAC TGA GCG GGA TTC TGC CAC CGC CTA 1200
1201 TCG GAT GAC TGT GGA AGT GCT GGG CAC AGT GCT GGG CAC GGC GAT CCA 1248
1249 GGG ACA AAT CGT GGG CCA AGC AGA CAC GCC TTG TTT CCA GGA CCT CAA 1296
1297 TAG CTC TAC AGT AGC TTC ACA AAG TGC CAA CCA TAC ACA TGG CAC CAC 1344
1345 CTC ACA CAG GGA AAC GCA AAA GGC ATA CCT GCT GGC AGC GGG GGT CAT 1392
1393 TGT CTG TAT CTA TAT AAT CTG TGC TGT CAT CCT GAT CCT GGG GCG TGC 1440
1441 GGG AGC AGA GAG CTC TCG GCC ACT TTA ACC ATT CCC ATC TGG CAG TGG 1488
1489 TTC TTG ACC CGG TTT GGC AAG AAG ACA GCT GTA TAT GTT GGG ATC TCA 1536
1537 TCA GCA GTG CCA TTT CTC ATC TTG GTG GCC CTC ATG GAG AGT AAC CTC 1584
1 Met Glu Ser Asn Leu 5
1585 ATC ATT ACA TAT GCG GTA GCT GTG GCA GCT GGC ATC AGT GTG GCA GCT 1632
6 Ile Ile Thr Tyr Ala Val Ala Val Ala Ala Gly Ile Ser Val Ala Ala 21
1633 GCC TTC TTA CTA CCC TGG TCC ATG CTG CCT GAT GTC ATT GAC GAC TTC 1680
22 Ala Phe Leu Leu Pro Trp Ser Met Leu Pro Asp Val Ile Asp Asp Phe 37
1681 CAT CTG AAG CAG CCC CAC TTC CAT GGA ACC GAG CCC ATC TTC TTC TCC 1728
38 His Leu Lys Gln Pro His Phe His Gly Thr Glu Pro Ile Phe Phe Ser 53
1729 TTC TAT GTC TTC TTC ACC AAG TTT GCC TCT GGA GTG TCA CTG GGC ATT 1776
54 Phe Tyr Val Phe Phe Thr Lys Phe Ala Ser Gly Val Ser Leu Gly Ile 69
1777 TCT ACC CTC AGT CTG GAC TTT GCA GGG TAC CAG ACC CGT GGC TGC TCG 1824
70 Ser Thr Leu Ser Leu Asp Phe Ala Gly Tyr Gln Thr Arg Gly Cys Ser 85
1825 CAG CCG GAA CGT GTC AAG TTT ACA CTG AAC ATG CTC GTG ACC ATG GCT 1872
86 Gln Pro Glu Arg Val Lys Phe Thr Leu Asn Met Leu Val Thr Met Ala 101
1873 CCC ATA GTT CTC ATC CTG CTG GGC CTG CTG CTC TTC AAA ATG TAC CCC 1920
102 Pro Ile Val Leu Ile Leu Leu Gly Leu Leu Leu Phe Lys Met Tyr Pro 117
1921 ATT GAT GAG GAG AGG CGG CGG CAG AAT AAG AAG GCC CTG CAG GCA CTG 1968
118 Ile Asp Glu Glu Arg Arg Arg Gln Asn Lys Lys Ala Leu Gln Ala Leu 133
1969 AGG GAC GAG GCC AGC AGC TCT GGC TGC TCA GAA ACA GAC TCC ACA GAG 2016
134 Arg Asp Glu Ala Ser Ser Ser Gly Cys Ser Glu Thr Asp Ser Thr Glu 149
2017 CTG GCT AGC ATC CTC TAG GGC CCG CCA CGT TGC CCG AAG CCA CCA TGC 2064
150 Leu Ala Ser Ile Leu *** 155
2065 AGA AGG CCA CAG AAG GGA TCA GGA CCT GTC TGC CGG CTT GCT GAG CAG 2112
2113 CTG GAC TGC AGG TGC TAG GAA GGG AAC TGA AGA CTC AAG GAG GTG GCC 2160
2161 CAG GAC ACT TGC TGT GCT CAC TGT GGG GCC GGC TGC TCT GTG GCC TCC 2208
2209 TGC CTC CCC TCT GCC TGC CTG TGG GGC CAA GCC CTG GGG CTG CCA CTG 2256
2257 TGA ATA TGC CAA GGA CTG ATC GGG CCT AGC CCG GAA CAC TAA TGT AGA 2304
2305 AAC CTT TTT TTA CAG AGC CTA ATT AAT AAC TTA ATG ACT GTG TAC ATA 2352
2353 GCA ATG TGT GTG TAT GTA TAT GTC TGT GAG CTA TTA ATG TTA TTA ATT 2400
2401 TTC ATA AAA GCT GGA AAG CAG CTG CCT GTT TCA AAA AAA AAA AAA AAA 2448
2449 AA 2450
3.FP1188
A: nucleotide sequence (SEQ ID NO:7) length: 1902 bases
1 GACCAAGCCA TCCCCTCTTT GCCTCTCCCC ATCCCTCGCT GGCCTGCTCC TGCTCCCCTT
61 CTCCCATCCT CCCCTCCCCC GTCTCTGCCC AGCCAGCCCC CTCCCGACTC CCCAGTTTCA
121 TCGGACTCCC TGGCCCCATC CCGTCCCCGC CCTGGCCCCT TTGTGCCCCT TCCCATCGTT
181 TTCTCCCTCC TTCCCGGGCT TGGCGTCCCT TCTCCACCCC TAACTCCTTC CACTCGGCCT
241 CCCTGCCCCT TCACGGCCCG CCTGCCTCCC TGCCCAAGTC CTGAGCCACC ATGCTGACCC
301 CGATGGTGGC CGGGGGGGTG GTGTTCCCCG GACTCTTCCT CCTCTCCAAG AACACGCTCC
361 AGCGGCTGCC CCAGCTACGC TGGGAGGAGG CCGACGCAGT CATTGTCTCA GCCAGGCTGG
421 TGTCCTCTGT CCAGGCCATC ATGGCCTCCA CTGCCGGCTA CATCGTCTCC ACCTCCTGCA
481 AGCACATCAT TGATGACCAA CACTGGCTGT CCTCTGCCTA CACGCAATTT GCTGTGCCCT
541 ACTTCATCTA CGACATCTAC GCCATGTTCC TCTGTCACTG GCACAAGCAC CAGGTCAAAG
601 GGCATGGAGG GGACGACGGA GCGGCCAGAG CCCCGGGCAG CACGTGGGCC ATAGCGCGTG
661 GCTACCTGCA CAAGGAGTTC CTCATGGTGC TCCACCATGC CGCCATGGTG CTGGTGTGCT
721 TCCCACTCTC AGTGGTGTGG CGACAGGGGA AGGGAGACTT CTTTCTGGGT TGCATGTTGA
781 TGGCAGAGGT CAGCACGCCC TTCGTCTGCC TTGGCAAGAT CCTCATCCAG TACAAGCAGC
841 AGCACACACT GCTGCACAAG GTGAACGGGG CCCTGATGCT GCTCAGCTTC CTCTGCTGCC
901 GGGTGCTGCT CTTTCCCTAC CTGTACTGGG CCTACGGGCG CCATGCCGGC CTGCCCCTGC
961 TGGCCGTGCC CCTGGCCATC CCTGCCCACG TCAACCTGGG CGCTGCGCTG CTCCTGGCCC
1021 CTCAGCTCTA CTGGTTCTTC CTCATCTGCC GTGGGGCCTG CCGCCTCTTC TGGCCCCGCT
1081 CCCGGCCGCC CCCGGCCTGC CAGGCCCAGG ACTGAGGCCG GGGGCCGGGA CCCTCCCCCT
1141 CCCCACCCCC ACCCCCGTGG AGACAGGGCT CTGGGGCTGA TGGCTGGGGT TGGGAGCCAG
1201 GGTCCTCTTG CCCGGACAAC CCCAGGACTG ACGATGACCC CGAAAGGGAA GAGGCCCCAT
1261 CTCTCGGGGA CTGAGGGGGT GGAGAGAGGG GACCTCTTCC CCCTACTCTG CCCCCTTCCT
1321 GCACACCCTT GCGCTGGAGG AGGGGAGGGG GCACCGCCTC CCACCCACTG AGGGCAGGAG
1381 GGCTTGTGGG GAGGGACACC AACAGGGTTT CAAGGGGACC AGGAGTCAGA ATGTGGGGAG
1441 ACGCCTCTGC CAAGGCCATC CCAGCCCCTA TGCTGCCATC CCCCAGGGCT CCCCATCACC
1501 CGAGAGGAGA GGACGCCCCA ACTAACCCCC GCTGGCCCTC GGGCCTCCCG AGTGGCCGGC
1561 TGCAACCACA GCTCCTCTCC AGGGTAGGCC AGCTTGAGGA ATCTTATTTA TTTTATTTAT
1621 TTACCCAAAT TTGAACTAGT CTGTTGGGTT GGGGGAAGGA GGTGGCTGCT ACCCCCAAGC
1681 CTTCCCAGTG CTGACAACCC CGGGGGCAGG CGAGGGCGCC CAGTCCCTCA CCATCGGCTG
1741 CACATCGCGC CCTCGGGCCC TGCCATGTCC CTGGTGCTAC TGACCTCTCA AGGCTTCCTC
1801 CAATCTGGGG TCGGGGGACC CTGGGAGGTG CTTTACAGAC CGCTAATAAA AGACGATCTG
1861 CGTGAACGCC AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA
B: nucleotide sequence (SEQ ID NO:8) length: 274 amino acid
1 MLTPMVAGGV VFPGLFLLSK NTLQRLPQLR WEEADAVIVS ARLVSSVQAI MASTAGYIVS
61 TSCKHIIDDQ HWLSSAYTQF AVPYFIYDIY AMFLCHWHKH QVKGHGGDDG AARAPGSTWA
121 IARGYLHKEF LMVLHHAAMV LVCFPLSVVW RQGKGDFFLG CMLMAEVSTP FVCLGKILIQ
181 YKQQHTLLHK VNGALMLLSF LCCRVLLFPY LYWAYGRHAG LPLLAVPLAI PAHVNLGAAL
241 LLAPQLYWFF LICRGACRLF WPRSRPPPAC QAQD
C. Nucleotide and amino acid composite sequence (SEQ ID NO:9) clone number: FP1188
Start code: 291ATG stops coding: 1113TGA protein molecular weight: 30627.70
1 GA CCA AGC CAT CCC CTC TTT GCC TCT CCC CAT CCC TCG CTG GCC TGC 47
48 TCC TGC TCC CCT TCT CCC ATC CTC CCC TCC CCC GTC TCT GCC CAG CCA 95
96 GCC CCC TCC CGA CTC CCC AGT TTC ATC GGA CTC CCT GGC CCC ATC CCG 143
144 TCC CCG CCC TGG CCC CTT TGT GCC CCT TCC CAT CGT TTT CTC CCT CCT 191
192 TCC CGG GCT TGG CGT CCC TTC TCC ACC CCT AAC TCC TTC CAC TCG GCC 239
240 TCC CTG CCC CTT CAC GGC CCG CCT GCC TCC CTG CCC AAG TCC TGA GCC 287
288 ACC ATG CTG ACC CCG ATG GTG GCC GGG GGG GTG GTG TTC CCC GGA CTC 335
1 Met Leu Thr Pro Met Val Ala Gly Gly Val Val Phe Pro Gly Leu 15
336 TTC CTC CTC TCC AAG AAC ACG CTC CAG CGG CTG CCC CAG CTA CGC TGG 383
16 Phe Leu Leu Ser Lys Asn Thr Leu Gln Arg Leu Pro Gln Leu Arg Trp 31
384 GAG GAG GCC GAC GCA GTC ATT GTC TCA GCC AGG CTG GTG TCC TCT GTC 431
32 Glu Glu Ala Asp Ala Val Ile Val Ser Ala Arg Leu Val Ser Ser Val 47
432 CAG GCC ATC ATG GCC TCC ACT GCC GGC TAC ATC GTC TCC ACC TCC TGC 479
48 Gln Ala Ile Met Ala Ser Thr Ala Gly Tyr Ile Val Ser Thr Ser Cys 63
480 AAG CAC ATC ATT GAT GAC CAA CAC TGG CTG TCC TCT GCC TAC ACG CAA 527
64 Lys His Ile Ile Asp Asp Gln His Trp Leu Ser Ser Ala Tyr Thr Gln 79
528 TTT GCT GTG CCC TAC TTC ATC TAC GAC ATC TAC GCC ATG TTC CTC TGT 575
80 Phe Ala Val Pro Tyr Phe Ile Tyr Asp Ile Tyr Ala Met Phe Leu Cys 95
576 CAC TGG CAC AAG CAC CAG GTC AAA GGG CAT GGA GGG GAC GAC GGA GCG 623
96 His Trp His Lys His Gln Val Lys Gly His Gly Gly Asp Asp Gly Ala 111
624 GCC AGA GCC CCG GGC AGC ACG TGG GCC ATA GCG CGT GGC TAC CTG CAC 671
112 Ala Arg Ala Pro Gly Ser Thr Trp Ala Ile Ala Arg Gly Tyr Leu His 127
672 AAG GAG TTC CTC ATG GTG CTC CAC CAT GCC GCC ATG GTG CTG GTG TGC 719
128 Lys Glu Phe Leu Met Val Leu His His Ala Ala Met Val Leu Val Cys 143
720 TTC CCA CTC TCA GTG GTG TGG CGA CAG GGG AAG GGA GAC TTC TTT CTG 767
144 Phe Pro Leu Ser Val Val Trp Arg Gln Gly Lys Gly Asp Phe Phe Leu 159
768 GGT TGC ATG TTG ATG GCA GAG GTC AGC ACG CCC TTC GTC TGC CTT GGC 815
160 Gly Cys Met Leu Met Ala Glu Val Ser Thr Pro Phe Val Cys Leu Gly 175
816 AAG ATC CTC ATC CAG TAC AAG CAG CAG CAC ACA CTG CTG CAC AAG GTG 863
176 Lys Ile Leu Ile Gln Tyr Lys Gln Gln His Thr Leu Leu His Lys Val 191
864 AAC GGG GCC CTG ATG CTG CTC AGC TTC CTC TGC TGC CGG GTG CTG CTC 911
192 Asn Gly Ala Leu Met Leu Leu Ser Phe Leu Cys Cys Arg Val Leu Leu 207
912 TTT CCC TAC CTG TAC TGG GCC TAC GGG CGC CAT GCC GGC CTG CCC CTG 959
208 Phe Pro Tyr Leu Tyr Trp Ala Tyr Gly Arg His Ala Gly Leu Pro Leu 223
960 CTG GCC GTG CCC CTG GCC ATC CCT GCC CAC GTC AAC CTG GGC GCT GCG 1007
224 Leu Ala Val Pro Leu Ala Ile Pro Ala His Val Asn Leu Gly Ala Ala 239
1008 CTG CTC CTG GCC CCT CAG CTC TAC TGG TTC TTC CTC ATC TGC CGT GGG 1055
240 Leu Leu Leu Ala Pro Gln Leu Tyr Trp Phe Phe Leu Ile Cys Arg Gly 255
1056 GCC TGC CGC CTC TTC TGG CCC CGC TCC CGG CCG CCC CCG GCC TGC CAG 1103
256 Ala Cys Arg Leu Phe Trp Pro Arg Ser Arg Pro Pro Pro Ala Cys Gln 271
1104 GCC CAG GAC TGA GGC CGG GGG CCG GGA CCC TCC CCC TCC CCA CCC CCA 1151
272 Ala Gln Asp *** 275
1152 CCC CCG TGG AGA CAG GGC TCT GGG GCT GAT GGC TGG GGT TGG GAG CCA 1199
1200 GGG TCC TCT TGC CCG GAC AAC CCC AGG ACT GAC GAT GAC CCC GAA AGG 1247
1248 GAA GAG GCC CCA TCT CTC GGG GAC TGA GGG GGT GGA GAG AGG GGA CCT 1295
1296 CTT CCC CCT ACT CTG CCC CCT TCC TGC ACA CCC TTG CGC TGG AGG AGG 1343
1344 GGA GGG GGC ACC GCC TCC CAC CCA CTG AGG GCA GGA GGG CTT GTG GGG 1391
1392 AGG GAC ACC AAC AGG GTT TCA AGG GGA CCA GGA GTC AGA ATG TGG GGA 1439
1440 GAC GCC TCT GCC AAG GCC ATC CCA GCC CCT ATG CTG CCA TCC CCC AGG 1487
1488 GCT CCC CAT CAC CCG AGA GGA GAG GAC GCC CCA ACT AAC CCC CGC TGG 1535
1536 CCC TCG GGC CTC CCG AGT GGC CGG CTG CAA CCA CAG CTC CTC TCC AGG 1583
1584 GTA GGC CAG CTT GAG GAA TCT TAT TTA TTT TAT TTA TTT ACC CAA ATT 1631
1632 TGA ACT AGT CTG TTG GGT TGG GGG AAG GAG GTG GCT GCT ACC CCC AAG 1679
1680 CCT TCC CAG TGC TGA CAA CCC CGG GGG CAG GCG AGG GCG CCC AGT CCC 1727
1728 TCA CCA TCG GCT GCA CAT CGC GCC CTC GGG CCC TGC CAT GTC CCT GGT 1775
1776 GCT ACT GAC CTC TCA AGG CTT CCT CCA ATC TGG GGT CGG GGG ACC CTG 1823
1824 GGA GGT GCT TTA CAG ACC GCT AAT AAA AGA CGA TCT GCG TGA ACG CCA 1871
1872 AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA A 1902
4.FP1193
A: nucleotide sequence (SEQ ID NO:10) length: 2222 bases
1 GGGGCTCCGC TAGTCACCAA GGAGTCCCCC AAGCCTGACA AAGGGAAGGG CCCTCCCTGG
61 GCAGACTGTG GTAGTACCAC GGCCCAGTCC ACACCCCTAG TACCTGGCCC CACTGACCCA
121 AGTCAGGGCC CTGAGGGGCT GGCCCCACAC TCAGCCATCG AGGAGAAGGT GATGAAGGGC
181 ATTGAGGAGA ACGTGCTGCG GCTCCAGGGC CAGGAGCGAG CCCCTGGCGC CGAGGTCAAG
241 CACCGCAACA CCAGCAGCAT CGCCAGCTGG TTCGGCCTTA AGAAGAGCAA GCTGCCAGCG
301 CTGAACCGCC GCACAGAGGC CACCAAGAAC AAGGAGGGGG CTGGCGGGGG CTCCCCGCTC
361 CGGAGGGAAG TCAAGATGGA AGCCCGGAAG CTGGAGGCCG AGAGCCTCAA CATCTCCAAG
421 CTGATGGCCA AGGCGGAAGA CCTGCGTCGG GCACTGGAAG AGGAGAGGCC TACCTAAGCA
481 GCCGGGCCCG CCACGCCTGG TGGCCCAGCC CCAGGGCCCA ACACGGGGCT GGGGCAGGTG
541 CAGGGCCAGC TGGCTGGCAT GTACCAAGGT GCAGACACCT TCATGCAGCA GCTGCCTAAC
601 AGGGTGGATG GCAAGGAGCT GCCATCCAAG AGCTGGCGGG AGCCCAAGCC TGAGTACGGG
661 GATTTCCAGC CGGTGTCTTC TGACCCCAAG AGCCCCTGGC AGCCTGTGGG CCCCGGAATG
721 GCCTGGTGGG CCCTCTTCAG GGCTGCGGAA AACCTCCTGG AAAGCCCGAG CAGCGAGCCA
781 GGGAGGCGGG AAGAGATGCC CTCGGAGGAC AGCCTGGCCG AGCAGTGCCC ACCTCACACT
841 TCACAGCCTG TGGCTCCTTG ACTCGAACCT TGGACAGTGG CATTGGGACC TTCCCACCCC
901 CAGACCATGG TAGCAGTGGG ACCCCCAGCA AGAATCTTCC TAAGACCAAG CCACCGCGGC
961 TGGATCCCCC ACCTGGGTAC CCCCAGCTCG GCCCCCACCC CTTACCAAAG TCCCCCGCCG
1021 CGCCCACACA CTGGAGCGGG AGGTGCCAGG CATAGAGGAG CTGCTGGTGA GTGGGCGGCA
1081 CCCCAGCATG CCAGCCTTCC CTGCACTGCT ACCCGCTGCT CCGGGCCACC GGGGCCATGA
1141 GACCTGTCCT GATGATCCCT GTGAAGACCC AGGCCCCACC CCTCCTGTCC AGCTGGCCAA
1201 GAACTGGACC TTCCCCAATA CTAGGGCAGC CGGCAGCTCC TCGGACCCTC TCATGTGCCC
1261 ACCCCGACAA CTGGAGGGGC TGCCCAGGAC CCCCATGGCC CTGCCCGTGG ACCGGAAGCG
1321 AAGCCAGGAG CCCAGCCGCC CGTCCCCTAC GCCCCAGGGC CCACCTTTCG GGGGTAGCCG
1381 CACCCCCAGC ACTTCGGACA TGGCCGAGGA AGGCAGAGTG GCCAGCGGGG GCCCCCCAGG
1441 GCTGGAGACC TCGGAGTCTC TCAGTGACTC ACTCTACGAC TCGCTGTCCT CTTGTGGGAG
1501 TCAGGGCTGA GGGGCTGCGC CACGCCACGG CCCCGCTGGA GCTGGGGACC ACAGACTGGA
1561 CCGGCTCTCT TCATGCCCAG CCCCCGGAGA CGGGGACCCC TCCCTGAAGG GACCAAGGAG
1621 GCAGGTGGAT AAGAAGGTGA AAAGGGGGTC CCTGGCACAC CCCACCACCC ACTGCTTCGG
1681 CGGATGAGAT GACCGTGCTC AGCTCAGGGA GAGACCCCGC CCTTGGTCCC TTCTCTCACC
1741 CAGAGTAAGG CTCTTCCTGG AAGGGACTGG GGGTTAAAGG CCACTGTGTC GCAGCCCCCC
1801 AGTCCCTACT TCAGGCTGAG CCATCTTGTG GTGCTGGGCT TCCTGCCCAC CAGCCGTGCC
1861 ATCTCTGCCC AACCCGGCTG CTCCTCTGCC CCGAAGCCCT CGCGAGGCCC TCCTGGAGGC
1921 CCCCGTGCTG GTGGAGTTTG GGGGCCAGGG GGACAAGTTG CCTTCTCTCT CTGCCCTGGT
1981 CCTCCCTGCT GTCTGGATGG TGCTGCCCTC CTCTGCCCCA TGCCTTTGGG GTCTGTTCGT
2041 CCGTCTTTTT TGTTGTTGTT TTTATATATT GAAGCGCCTG GCCCAGGCCC CCAGGCCCCC
2101 AGGCCCCGCA CTGCGGTTAA TTTATGTGTT GTTTAAAATG CGGCTGCTCT GCTTCCTGCC
2161 TCTGCTTCTG CCGTATCCCT AATAAAATGT GGAGGCCCCC TCAAAAAAAA AAAAAAAAAA
2221 AA
B: nucleotide sequence (SEQ ID NO:11) length: 156 amino acid
1 MRPVLMIPVK TQAPPLLSSW PRTGPSPILG QPAAPRTLSC AHPDNWRGCP GPPWPCPWTG
61 SEARSPAARP LRPRAHLSGV AAPPALRTWP RKAEWPAGAP QGWRPRSLSV THSTTRCPLV
121 GVRAEGLRHA TAPLELGTTD WTGSLHAQPP ETGTPP
C. Nucleotide and amino acid composite sequence (SEQ ID NO:12) clone number: FP1193
Start code: 1137ATG stops coding: 1605TGA protein molecular weight: 16682.19
1 GG GGC TCC GCT AGT CAC CAA GGA GTC CCC CAA GCC TGA CAA AGG GAA 47
48 GGG CCC TCC CTG GGC AGA CTG TGG TAG TAC CAC GGC CCA GTC CAC ACC 95
96 CCT AGT ACC TGG CCC CAC TGA CCC AAG TCA GGG CCC TGA GGG GCT GGC 143
144 CCC ACA CTC AGC CAT CGA GGA GAA GGT GAT GAA GGG CAT TGA GGA GAA 191
192 CGT GCT GCG GCT CCA GGG CCA GGA GCG AGC CCC TGG CGC CGA GGT CAA 239
240 GCA CCG CAA CAC CAG CAG CAT CGC CAG CTG GTT CGG CCT TAA GAA GAG 287
288 CAA GCT GCC AGC GCT GAA CCG CCG CAC AGA GGC CAC CAA GAA CAA GGA 335
336 GGG GGC TGG CGG GGG CTC CCC GCT CCG GAG GGA AGT CAA GAT GGA AGC 383
384 CCG GAA GCT GGA GGC CGA GAG CCT CAA CAT CTC CAA GCT GAT GGC CAA 431
432 GGC GGA AGA CCT GCG TCG GGC ACT GGA AGA GGA GAG GCC TAC CTA AGC 479
480 AGC CGG GCC CGC CAC GCC TGG TGG CCC AGC CCC AGG GCC CAA CAC GGG 527
528 GCT GGG GCA GGT GCA GGG CCA GCT GGC TGG CAT GTA CCA AGG TGC AGA 575
576 CAC CTT CAT GCA GCA GCT GCC TAA CAG GGT GGA TGG CAA GGA GCT GCC 623
624 ATC CAA GAG CTG GCG GGA GCC CAA GCC TGA GTA CGG GGA TTT CCA GCC 671
672 GGT GTC TTC TGA CCC CAA GAG CCC CTG GCA GCC TGT GGG CCC CGG AAT 719
720 GGC CTG GTG GGC CCT CTT CAG GGC TGC GGA AAA CCT CCT GGA AAG CCC 767
768 GAG CAG CGA GCC AGG GAG GCG GGA AGA GAT GCC CTC GGA GGA CAG CCT 815
816 GGC CGA GCA GTG CCC ACC TCA CAC TTC ACA GCC TGT GGC TCC TTG ACT 863
864 CGA ACC TTG GAC AGT GGC ATT GGG ACC TTC CCA CCC CCA GAC CAT GGT 911
912 AGC AGT GGG ACC CCC AGC AAG AAT CTT CCT AAG ACC AAG CCA CCG CGG 959
960 CTG GAT CCC CCA CCT GGG TAC CCC CAG CTC GGC CCC CAC CCC TTA CCA 1007
1008 AAG TCC CCC GCC GCG CCC ACA CAC TGG AGC GGG AGG TGC CAG GCA TAG 1055
1056 AGG AGC TGC TGG TGA GTG GGC GGC ACC CCA GCA TGC CAG CCT TCC CTG 1103
1104 CAC TGC TAC CCG CTG CTC CGG GCC ACC GGG GCC ATG AGA CCT GTC CTG 1151
1 Met Arg Pro Val Leu 5
1152 ATG ATC CCT GTG AAG ACC CAG GCC CCA CCC CTC CTG TCC AGC TGG CCA 1199
6 Met Ile Pro Val Lys Thr Gln Ala Pro Pro Leu Leu Ser Ser Trp Pro 21
1200 AGA ACT GGA CCT TCC CCA ATA CTA GGG CAG CCG GCA GCT CCT CGG ACC 1247
22 Arg Thr Gly Pro Ser Pro Ile Leu Gly Gln Pro Ala Ala Pro Arg Thr 37
1248 CTC TCA TGT GCC CAC CCC GAC AAC TGG AGG GGC TGC CCA GGA CCC CCA 1295
38 Leu Ser Cys Ala His Pro Asp Asn Trp Arg Gly Cys Pro Gly Pro Pro 53
1296 TGG CCC TGC CCG TGG ACC GGA AGC GAA GCC AGG AGC CCA GCC GCC CGT 1343
54 Trp Pro Cys Pro Trp Thr Gly Ser Glu Ala Arg Ser Pro Ala Ala Arg 69
1344 CCC CTA CGC CCC AGG GCC CAC CTT TCG GGG GTA GCC GCA CCC CCA GCA 1391
70 Pro Leu Arg Pro Arg Ala His Leu Ser Gly Val Ala Ala Pro Pro Ala 85
1392 CTT CGG ACA TGG CCG AGG AAG GCA GAG TGG CCA GCG GGG GCC CCC CAG 1439
86 Leu Arg Thr Trp Pro Arg Lys Ala Glu Trp Pro Ala Gly Ala Pro Gln 101
1440 GGC TGG AGA CCT CGG AGT CTC TCA GTG ACT CAC TCT ACG ACT CGC TGT 1487
102 Gly Trp Arg Pro Arg Ser Leu Ser Val Thr His Ser Thr Thr Arg Cys 117
1488 CCT CTT GTG GGA GTC AGG GCT GAG GGG CTG CGC CAC GCC ACG GCC CCG 1535
118 Pro Leu Val Gly Val Arg Ala Glu Gly Leu Arg His Ala Thr Ala Pro 133
1536 CTG GAG CTG GGG ACC ACA GAC TGG ACC GGC TCT CTT CAT GCC CAG CCC 1583
134 Leu Glu Leu Gly Thr Thr Asp Trp Thr Gly Ser Leu His Ala Gln Pro 149
1584 CCG GAG ACG GGG ACC CCT CCC TGA AGG GAC CAA GGA GGC AGG TGG ATA 1631
150 Pro Glu Thr Gly Thr Pro Pro *** 157
1632 AGA AGG TGA AAA GGG GGT CCC TGG CAC ACC CCA CCA CCC ACT GCT TCG 1679
1680 GCG GAT GAG ATG ACC GTG CTC AGC TCA GGG AGA GAC CCC GCC CTT GGT 1727
1728 CCC TTC TCT CAC CCA GAG TAA GGC TCT TCC TGG AAG GGA CTG GGG GTT 1775
1776 AAA GGC CAC TGT GTC GCA GCC CCC CAG TCC CTA CTT CAG GCT GAG CCA 1823
1824 TCT TGT GGT GCT GGG CTT CCT GCC CAC CAG CCG TGC CAT CTC TGC CCA 1871
1872 ACC CGG CTG CTC CTC TGC CCC GAA GCC CTC GCG AGG CCC TCC TGG AGG 1919
1920 CCC CCG TGC TGG TGG AGT TTG GGG GCC AGG GGG ACA AGT TGC CTT CTC 1967
1968 TCT CTG CCC TGG TCC TCC CTG CTG TCT GGA TGG TGC TGC CCT CCT CTG 2015
2016 CCC CAT GCC TTT GGG GTC TGT TCG TCC GTC TTT TTT GTT GTT GTT TTT 2063
2064 ATA TAT TGA AGC GCC TGG CCC AGG CCC CCA GGC CCC CAG GCC CCG CAC 2111
2112 TGC GGT TAA TTT ATG TGT TGT TTA AAA TGC GGC TGC TCT GCT TCC TGC 2159
2160 CTC TGC TTC TGC CGT ATC CCT AAT AAA ATG TGG AGG CCC CCT CAA AAA 2207
2208 AAA AAA AAA AAA AAA 2222
5.FP1477
A: nucleotide sequence (SEQ ID NO:13) length: 3502 bases
1 GCCAGCCATC AGCCTCTCTT TCAAGGAGTC CGTGGCCTTC AACTTTGGCA GCCGTCCTCT
61 GCGCTACCCA GTGGCAGGCT ACCGGCCCCT GCAGGACCCA CCGAGTGCTG ACCTGGTGCG
121 GGCACAGAGG TTGCTGGGCT GCTTCCGGGC AGTGCTGAGT GTGGAGCTGG ACCCTGTGGA
181 GGGGCGGCTG TTGGACAAGG AGAGCTCCAA GTGGCGGTTG CGGGGCCAGC CCACCGTCCT
241 CCTCACACTG GCCCACATCT TCCATCACTT TGCACCGCTT CTGCGCAAGG TGTATCTGGT
301 GGAGGCTGTG CTCATGAGCT TCTTGCTGGG CATCGTGGAG AAGGGCACAC CCACACAGGC
361 ACAGTCCGTG GTGCACCAGG TCCTGGACCT CTTGTGGCTC TTCATGGAGG ACTACGAGGT
421 ACAAGATTGC CTCAAGCAGT TGATGATGTC TCTGCTTCGG CTGTACCGAT TCTCACCCAT
481 TGTCCCAGAC CTGGGCCTAC AGATCCATTA CCTGCGGCTC ACTATCGCCA TCCTGAGGCA
541 TGAGAAGTCC CGCAAGTTTC TGCTTAGCAA TGTCCTCTTC GACGTGCTCC GCTCCGTTGT
601 CTTCTTTTAC ATCAAGAGCC CCCTGCGTGT GGAGGAGGCC GGCCTGCAGG AGCTCATTCC
661 CACCACCTGG TGGCCCCACT GCTCCAGTAG GGAGGGCAAA GAGAGCACGG AGATGAAGGA
721 GGAGACCGCA GAGGAGCGGC TGCGGCGGCG AGCCTACGAA CGGGGCTGTC AGCGGCTCAG
781 AAGCGCATCG AAGTGGTGGA AGAACTACAG GTCCAGATCC TGAAGCTGCT GCTGGACAAT
841 AAAGATGACA ATGGGGGTGA AGCTTCTAGG TATATCTTCC TGACCAAGTT TCGCAAGTTT
901 CTGCAGGAGA ACGCCAGTGG CCGGGGGAAC ATGCCCATGC TCTGCCCCCC TGAGTACATG
961 GTCTGCTTCT TACACCGGCT GATCTCTGCC CTGCGCTACT ATTGGGATGA ATACAAGGCT
1021 TCCAATCCTC ATGCTTCCTT CAGTGAGGAG GCCTACATCC CGCCCCAGGT CTTCTATAAT
1081 GGCAAGGTGG ACTACTTTGA CCTGCAGCGC CTGGGGGGCC TCCTCTCGCA CCTGCGGAAG
1141 ACCCTCAAAG ATGACCTTGC TTCCAAAGCC AACATTGTGA TCGACCCACT GGAGCTCCAG
1201 TCAACCGCCA TGGATGACCT AGATGAGGAT GAGGAGCCAG CCCCAGCTAT GGCCCAGCGC
1261 CCCATGCAGG CCCTGGCTGT TGGGGGGCCA CTGCCCCTGC CCCGGCCCGG CTGGCTCAGT
1321 TCTCCAACTT TGGGCCGAGC CAACCGCTTC CTCAGCACAG CGGCTGTGAG CCTCATGACC
1381 CCACGGCGGC CTCTGAGCAC CTCGGAGAAA GTGAAGGTCC GCACGCTGAG CGTGGAGCAG
1441 AGGACCCGTG AGGACATTGA AGGCAGCCAC TGGAATGAGG GCTTGCTGCT GGGGCGGCCC
1501 CCCGAGGAGC CTGAGCAGCC CCTCACCGAG AACTCGCTGC TGGAAGTCCT GGATGGGGCG
1561 GTCATGATGT ACAACCTCAG CGTACACCAG CAGCTGGGCA AGATGGTGGG TGTCTCCGAT
1621 GATGTCAATG AATACGCTAT GGCTCTGAGG GACACAGAGG ACAAGCTCCG CCGGTGCCCC
1681 AAGAGGAGGA AGGACATCCT TGCAGAGTTG ACCAAGAGCC AGAAGGTTTT CTCAGAAAAG
1741 CTGGACCACC TGAGCCGCCG TCTTGCCTGG GTCCATGCCA CTGTCTACTC CCAGGAGAAG
1801 ATGCTGGACA TCTACTGGCT GCTGCGCGTC TGCCTGCGGA CCATTGAGCA CGGTGATCGC
1861 ACAGGGTCTC TCTTTGCCTT CATGCCCGAG TTCTACCTGA GCGTGGCCAT CAACAGCTAC
1921 AGTGCTCTCA AGAATTACTT TGGTCCCGTG CACAGCATGG AGGAGCTCCC AGGCTATGAA
1981 GAGACCCTGA CCCGCCTGGC TGCCATTCTC GCCAAACACT TTGCCGACGC ACGCATTGTG
2041 GGCACTGACA TCCGAGACTC ACTGATGCAG GCCCTGGCCA GCTACGTGTG CTACCCACAC
2101 TCCCTGCGGG CTGTGGAGCG AATCCCCGAG GAGCAGCGTA TCGCCATGGT GAGGAACCTC
2161 CTGGCGCCCT ATGAGCAGCG GCCCTGGGCC CAGACCAACT GGATCCTGGT GCGGCTCTGG
2221 AGGGGCTGTG GCTTCGGGTA CCGCTATACA CGGCTGCCAC ATCTGCTGAA AACCAAACTT
2281 GAGGATGCCA ATTTGCCCAG CCTCCAGAAG CCCTGCCCTT CCACCCTGCT GCAGCAGCAC
2341 ATGGCGGACC TCCTACAGCA GGGTCCTGAT GTGGCACCCA GCTTCCTCAA CAGCGTCCTC
2401 AATCAGCTCA ACTGGGCCTT CTCTGAATTC ATTGGCATGA TCCAAGAGAT CCAGCAGGCT
2461 GCTGAGCGCC TGGAGCGGAA CTTTGTGGAC AGCCGGCAGC TCAAGGTATG TGCCACCTGC
2521 TTTGACCTCT CGGTCAGCCT GCTGCGTGTC TTGGAGATGA CTATCACACT GGTGCCTGAG
2581 ATATTCCTTG ACTGGACCCG GCCTACCTCT GAGATGCTGC TGCGGCGTCT TGCACAGCTG
2641 CTAAACCAGG TGCTGAACCG GGTGACAGCT GAGAGGAACC TGTTTGATCG TGTGGTCACC
2701 CTACGGCTGC CTGGCCTAGA GAGCGTGGAC CACTATCCCA TTCTGGTGGC AGTGACGGGC
2761 ATCCTGGTGC AGCTCCTGGT GCGTGGCCCA GCCTCAGAGA GAGAGCAAGC CACATCAGTG
2821 CTCCTGGCAG ATCCCTGCTT CCAGCTACGC TCAATATGCT ATCTCCTGGG ACAGCCAGAG
2881 CCCCCAGCAC CTGGCACTGC TCTGCCAGCC CCTGACCGGA AGCGCTTCTC CCTGCAGAGC
2941 TATGCGGATT ATATCAGTGC CGATGAGCTG GCCCAAGTGG AACAGATGCT GGCGCACCTG
3001 ACCTCTGCAT CTGCCCAGGC AGCAGCTGCC TCCCTGCCCA CCAGTGAGGA GGACCTCTGC
3061 CCCATTTGCT ATGCCCACCC CATCTCTGCT GTGTTCCAGC CCTGTGGCCA CAAGTCCTGC
3121 AAAGCCTGTA TCAACCAGCA CCTGATGAAC AACAAGGACT GCTTCTTCTG CAAAACCACC
3181 ATCGTGTCTG TAGAGGACTG GGAGAAGGGA GCCAATACGA GTACTACCTC CTCAGCTGCC
3241 TAGCCCTCAC AGCCTGTGCC ATCCTGGAGC CTCCACCTTT GAACCCAGAG CCAGGCTGGG
3301 CCCTATTTAT GAGCTCCCTT TGCCCTTCTC CTGTATCCCA CACCACCACA TCCAACCTCC
3361 TTGCCTGCCT GTATCCTCAT TGGTGGGAGG CCCAGCCATG GCCCTAATTG TGCCTGAGCT
3421 TGACTTTCAG TCAGGGCCAC AGTGAGCATT AAATTATTAT TCCATACAAA AAAAAAAAAA
3481 AAAAAAAAAA AAAAAAAAAA AA
B: nucleotide sequence (SEQ ID NO:14) length: 770 amino acid
1 MPMLCPPEYM VCFLHRLISA LRYYWDEYKA SNPHASFSEE AYIPPQVFYN GKVDYFDLQR
61 LGGLLSHLRK TLKDDLASKA NIVIDPLELQ STAMDDLDED EEPAPAMAQR PMQALAVGGP
121 LPLPRPGWLS SPTLGRANRF LSTAAVSLMT PRRPLSTSEK VKVRTLSVEQ RTREDIEGSH
181 WNEGLLLGRP PEEPEQPLTE NSLLEVLDGA VMMYNLSVHQ QLGKMVGVSD DVNEYAMALR
241 DTEDKLRRCP KRRKDILAEL TKSQKVFSEK LDHLSRRLAW VHATVYSQEK MLDIYWLLRV
301 CLRTIEHGDR TGSLFAFMPE FYLSVAINSY SALKNYFGPV HSMEELPGYE ETLTRLAAIL
361 AKHFADARIV GTDIRDSLMQ ALASYVCYPH SLRAVERIPE EQRIAMVRNL LAPYEQRPWA
421 QTNWILVRLW RGCGFGYRYT RLPHLLKTKL EDANLPSLQK PCPSTLLQQH MADLLQQGPD
481 VAPSFLNSVL NQLNWAFSEF IGMIQEIQQA AERLERNFVD SRQLKVCATC FDLSVSLLRV
541 LEMTITLVPE IFLDWTRPTS EMLLRRLAQL LNQVLNRVTA ERNLFDRVVT LRLPGLESVD
601 HYPILVAVTG ILVQLLVRGP ASEREQATSV LLADPCFQLR SICYLLGQPE PPAPGTALPA
661 PDRKRFSLQS YADYISADEL AQVEQMLAHL TSASAQAAAA SLPTSEEDLC PICYAHPISA
721 VFQPCGHKSC KACINQHLMN NKDCFFCKTT IVSVEDWEKG ANTSTTSSAA
C. Nucleotide and amino acid composite sequence (SEQ ID NO:15) clone number: FP1477
Start code: 931ATG stops coding: 3241TAG protein molecular weight: 86888.27
1 GCC AGC CAT CAG CCT CTC TTT CAA GGA GTC CGT GGC CTT CAA CTT TGG 48
49 CAG CCG TCC TCT GCG CTA CCC AGT GGC AGG CTA CCG GCC CCT GCA GGA 96
97 CCC ACC GAG TGC TGA CCT GGT GCG GGC ACA GAG GTT GCT GGG CTG CTT 144
145 CCG GGC AGT GCT GAG TGT GGA GCT GGA CCC TGT GGA GGG GCG GCT GTT 192
193 GGA CAA GGA GAG CTC CAA GTG GCG GTT GCG GGG CCA GCC CAC CGT CCT 240
241 CCT CAC ACT GGC CCA CAT CTT CCA TCA CTT TGC ACC GCT TCT GCG CAA 288
289 GGT GTA TCT GGT GGA GGC TGT GCT CAT GAG CTT CTT GCT GGG CAT CGT 336
337 GGA GAA GGG CAC ACC CAC ACA GGC ACA GTC CGT GGT GCA CCA GGT CCT 384
385 GGA CCT CTT GTG GCT CTT CAT GGA GGA CTA CGA GGT ACA AGA TTG CCT 432
433 CAA GCA GTT GAT GAT GTC TCT GCT TCG GCT GTA CCG ATT CTC ACC CAT 480
481 TGT CCC AGA CCT GGG CCT ACA GAT CCA TTA CCT GCG GCT CAC TAT CGC 528
529 CAT CCT GAG GCA TGA GAA GTC CCG CAA GTT TCT GCT TAG CAA TGT CCT 576
577 CTT CGA CGT GCT CCG CTC CGT TGT CTT CTT TTA CAT CAA GAG CCC CCT 624
625 GCG TGT GGA GGA GGC CGG CCT GCA GGA GCT CAT TCC CAC CAC CTG GTG 672
673 GCC CCA CTG CTC CAG TAG GGA GGG CAA AGA GAG CAC GGA GAT GAA GGA 720
721 GGA GAC CGC AGA GGA GCG GCT GCG GCG GCG AGC CTA CGA ACG GGG CTG 768
769 TCA GCG GCT CAG AAG CGC ATC GAA GTG GTG GAA GAA CTA CAG GTC CAG 816
817 ATC CTG AAG CTG CTG CTG GAC AAT AAA GAT GAC AAT GGG GGT GAA GCT 864
865 TCT AGG TAT ATC TTC CTG ACC AAG TTT CGC AAG TTT CTG CAG GAG AAC 912
913 GCC AGT GGC CGG GGG AACATG CCC ATG CTC TGC CCC CCT GAG TAC ATG 960
1 Met Pro Met Leu Cys Pro Pro Glu Tyr Met 10
961 GTC TGC TTC TTA CAC CGG CTG ATC TCT GCC CTG CGC TAC TAT TGG GAT 1008
11 Val Cys Phe Leu His Arg Leu Ile Ser Ala Leu Arg Tyr Tyr Trp Asp 26
1009 GAA TAC AAG GCT TCC AAT CCT CAT GCT TCC TTC AGT GAG GAG GCC TAC 1056
27 Glu Tyr Lys Ala Ser Asn Pro His Ala Ser Phe Ser Glu Glu Ala Tyr 42
1057 ATC CCG CCC CAG GTC TTC TAT AAT GGC AAG GTG GAC TAC TTT GAC CTG 1104
43 Ile Pro Pro Gln Val Phe Tyr Asn Gly Lys Val Asp Tyr Phe Asp Leu 58
1105 CAG CGC CTG GGG GGC CTC CTC TCG CAC CTG CGG AAG ACC CTC AAA GAT 1152
59 Gln Arg Leu Gly Gly Leu Leu Ser His Leu Arg Lys Thr Leu Lys Asp 74
1153 GAC CTT GCT TCC AAA GCC AAC ATT GTG ATC GAC CCA CTG GAG CTC CAG 1200
75 Asp Leu Ala Ser Lys Ala Asn Ile Val Ile Asp Pro Leu Glu Leu Gln 90
1201 TCA ACC GCC ATG GAT GAC CTA GAT GAG GAT GAG GAG CCA GCC CCA GCT 1248
91 Ser Thr Ala Met Asp Asp Leu Asp Glu Asp Glu Glu Pro Ala Pro Ala 106
1249 ATG GCC CAG CGC CCC ATG CAG GCC CTG GCT GTT GGG GGG CCA CTG CCC 1296
107 Met Ala Gln Arg Pro Met Gln Ala Leu Ala Val Gly Gly Pro Leu Pro 122
1297 CTG CCC CGG CCC GGC TGG CTC AGT TCT CCA ACT TTG GGC CGA GCC AAC 1344
123 Leu Pro Arg Pro Gly Trp Leu Ser Ser Pro Thr Leu Gly Arg Ala Asn 138
1345 CGC TTC CTC AGC ACA GCG GCT GTG AGC CTC ATG ACC CCA CGG CGG CCT 1392
139 Arg Phe Leu Ser Thr Ala Ala Val Ser Leu Met Thr Pro Arg Arg Pro 154
1393 CTG AGC ACC TCG GAG AAA GTG AAG GTC CGC ACG CTG AGC GTG GAG CAG 1440
155 Leu Ser Thr Ser Glu Lys Val Lys Val Arg Thr Leu Ser Val Glu Gln 170
1441 AGG ACC CGT GAG GAC ATT GAA GGC AGC CAC TGG AAT GAG GGC TTG CTG 1488
171 Arg Thr Arg Glu Asp Ile Glu Gly Ser His Trp Asn Glu Gly Leu Leu 186
1489 CTG GGG CGG CCC CCC GAG GAG CCT GAG CAG CCC CTC ACC GAG AAC TCG 1536
187 Leu Gly Arg Pro Pro Glu Glu Pro Glu Gln Pro Leu Thr Glu Asn Ser 202
1537 CTG CTG GAA GTC CTG GAT GGG GCG GTC ATG ATG TAC AAC CTC AGC GTA 1584
203 Leu Leu Glu Val Leu Asp Gly Ala Val Met Met Tyr Asn Leu Ser Val 218
1585 CAC CAG CAG CTG GGC AAG ATG GTG GGT GTC TCC GAT GAT GTC AAT GAA 1632
219 His Gln Gln Leu Gly Lys Met Val Gly Val Ser Asp Asp Val Asn Glu 234
1633 TAC GCT ATG GCT CTG AGG GAC ACA GAG GAC AAG CTC CGC CGG TGC CCC 1680
235 Tyr Ala Met Ala Leu Arg Asp Thr Glu Asp Lys Leu Arg Arg Cys Pro 250
1681 AAG AGG AGG AAG GAC ATC CTT GCA GAG TTG ACC AAG AGC CAG AAG GTT 1728
251 Lys Arg Arg Lys Asp Ile Leu Ala Glu Leu Thr Lys Ser Gln Lys Val 266
1729 TTC TCA GAA AAG CTG GAC CAC CTG AGC CGC CGT CTT GCC TGG GTC CAT 1776
267 Phe Ser Glu Lys Leu Asp His Leu Ser Arg Arg Leu Ala Trp Val His 282
1777 GCC ACT GTC TAC TCC CAG GAG AAG ATG CTG GAC ATC TAC TGG CTG CTG 1824
283 Ala Thr Val Tyr Ser Gln Glu Lys Met Leu Asp Ile Tyr Trp Leu Leu 298
1825 CGC GTC TGC CTG CGG ACC ATT GAG CAC GGT GAT CGC ACA GGG TCT CTC 1872
299 Arg Val Cys Leu Arg Thr Ile Glu His Gly Asp Arg Thr Gly Ser Leu 314
1873 TTT GCC TTC ATG CCC GAG TTC TAC CTG AGC GTG GCC ATC AAC AGC TAC 1920
315 Phe Ala Phe Met Pro Glu Phe Tyr Leu Ser Val Ala Ile Asn Ser Tyr 330
1921 AGT GCT CTC AAG AAT TAC TTT GGT CCC GTG CAC AGC ATG GAG GAG CTC 1968
331 Ser Ala Leu Lys Asn Tyr Phe Gly Pro Val His Ser Met Glu Glu Leu 346
1969 CCA GGC TAT GAA GAG ACC CTG ACC CGC CTG GCT GCC ATT CTC GCC AAA 2016
347 Pro Gly Tyr Glu Glu Thr Leu Thr Arg Leu Ala Ala Ile Leu Ala Lys 362
2017 CAC TTT GCC GAC GCA CGC ATT GTG GGC ACT GAC ATC CGA GAC TCA CTG 2064
363 His Phe Ala Asp Ala Arg Ile Val Gly Thr Asp Ile Arg Asp Ser Leu 378
2065 ATG CAG GCC CTG GCC AGC TAC GTG TGC TAC CCA CAC TCC CTG CGG GCT 2112
379 Met Gln Ala Leu Ala Ser Tyr Val Cys Tyr Pro His Ser Leu Arg Ala 394
2113 GTG GAG CGA ATC CCC GAG GAG CAG CGT ATC GCC ATG GTG AGG AAC CTC 2160
395 Val Glu Arg Ile Pro Glu Glu Gln Arg Ile Ala Met Val Arg Asn Leu 410
2161 CTG GCG CCC TAT GAG CAG CGG CCC TGG GCC CAG ACC AAC TGG ATC CTG 2208
411 Leu Ala Pro Tyr Glu Gln Arg Pro Trp Ala Gln Thr Asn Trp Ile Leu 426
2209 GTG CGG CTC TGG AGG GGC TGT GGC TTC GGG TAC CGC TAT ACA CGG CTG 2256
427 Val Arg Leu Trp Arg Gly Cys Gly Phe Gly Tyr Arg Tyr Thr Arg Leu 442
2257 CCA CAT CTG CTG AAA ACC AAA CTT GAG GAT GCC AAT TTG CCC AGC CTC 2304
443 Pro His Leu Leu Lys Thr Lys Leu Glu Asp Ala Asn Leu Pro Ser Leu 458
2305 CAG AAG CCC TGC CCT TCC ACC CTG CTG CAG CAG CAC ATG GCG GAC CTC 2352
459 Gln Lys Pro Cys Pro Ser Thr Leu Leu Gln Gln His Met Ala Asp Leu 474
2353 CTA CAG CAG GGT CCT GAT GTG GCA CCC AGC TTC CTC AAC AGC GTC CTC 2400
475 Leu Gln Gln Gly Pro Asp Val Ala Pro Ser Phe Leu Asn Ser Val Leu 490
2401 AAT CAG CTC AAC TGG GCC TTC TCT GAA TTC ATT GGC ATG ATC CAA GAG 2448
491 Asn Gln Leu Asn Trp Ala Phe Ser Glu Phe Ile Gly Met Ile Gln Glu 506
2449 ATC CAG CAG GCT GCT GAG CGC CTG GAG CGG AAC TTT GTG GAC AGC CGG 2496
507 Ile Gln Gln Ala Ala Glu Arg Leu Glu Arg Asn Phe Val Asp Ser Arg 522
2497 CAG CTC AAG GTA TGT GCC ACC TGC TTT GAC CTC TCG GTC AGC CTG CTG 2544
523 Gln Leu Lys Val Cys Ala Thr Cys Phe Asp Leu Ser Val Ser Leu Leu 538
2545 CGT GTC TTG GAG ATG ACT ATC ACA CTG GTG CCT GAG ATA TTC CTT GAC 2592
539 Arg Val Leu Glu Met Thr Ile Thr Leu Val Pro Glu Ile Phe Leu Asp 554
2593 TGG ACC CGG CCT ACC TCT GAG ATG CTG CTG CGG CGT CTT GCA CAG CTG 2640
555 Trp Thr Arg Pro Thr Ser Glu Met Leu Leu Arg Arg Leu Ala Gln Leu 570
2641 CTA AAC CAG GTG CTG AAC CGG GTG ACA GCT GAG AGG AAC CTG TTT GAT 2688
571 Leu Asn Gln Val Leu Asn Arg Val Thr Ala Glu Arg Asn Leu Phe Asp 586
2689 CGT GTG GTC ACC CTA CGG CTG CCT GGC CTA GAG AGC GTG GAC CAC TAT 2736
587 Arg Val Val Thr Leu Arg Leu Pro Gly Leu Glu Ser Val Asp His Tyr 602
2737 CCC ATT CTG GTG GCA GTG ACG GGC ATC CTG GTG CAG CTC CTG GTG CGT 2784
603 Pro Ile Leu Val Ala Val Thr Gly Ile Leu Val Gln Leu Leu Val Arg 618
2785 GGC CCA GCC TCA GAG AGA GAG CAA GCC ACA TCA GTG CTC CTG GCA GAT 2832
619 Gly Pro Ala Ser Glu Arg Glu Gln Ala Thr Ser Val Leu Leu Ala Asp 634
2833 CCC TGC TTC CAG CTA CGC TCA ATA TGC TAT CTC CTG GGA CAG CCA GAG 2880
635 Pro Cys Phe Gln Leu Arg Ser Ile Cys Tyr Leu Leu Gly Gln Pro Glu 650
2881 CCC CCA GCA CCT GGC ACT GCT CTG CCA GCC CCT GAC CGG AAG CGC TTC 2928
651 Pro Pro Ala Pro Gly Thr Ala Leu Pro Ala Pro Asp Arg Lys Arg Phe 666
2929 TCC CTG CAG AGC TAT GCG GAT TAT ATC AGT GCC GAT GAG CTG GCC CAA 2976
667 Ser Leu Gln Ser Tyr Ala Asp Tyr Ile Ser Ala Asp Glu Leu Ala Gln 682
2977 GTG GAA CAG ATG CTG GCG CAC CTG ACC TCT GCA TCT GCC CAG GCA GCA 3024
683 Val Glu Gln Met Leu Ala His Leu Thr Ser Ala Ser Ala Gln Ala Ala 698
3025 GCT GCC TCC CTG CCC ACC AGT GAG GAG GAC CTC TGC CCC ATT TGC TAT 3072
699 Ala Ala Ser Leu Pro Thr Ser Glu Glu Asp Leu Cys Pro Ile Cys Tyr 714
3073 GCC CAC CCC ATC TCT GCT GTG TTC CAG CCC TGT GGC CAC AAG TCC TGC 3120
715 Ala His Pro Ile Ser Ala Val Phe Gln Pro Cys Gly His Lys Ser Cys 730
3121 AAA GCC TGT ATC AAC CAG CAC CTG ATG AAC AAC AAG GAC TGC TTC TTC 3168
731 Lys Ala Cys Ile Asn Gln His Leu Met Asn Asn Lys Asp Cys Phe Phe 746
3169 TGC AAA ACC ACC ATC GTG TCT GTA GAG GAC TGG GAG AAG GGA GCC AAT 3216
747 Cys Lys Thr Thr Ile Val Ser Val Glu Asp Trp Glu Lys Gly Ala Asn 762
3217 ACG AGT ACT ACC TCC TCA GCT GCC TAG CCC TCA CAG CCT GTG CCA TCC 3264
763 Thr Ser Thr Thr Ser Ser Ala Ala *** 771
3265 TGG AGC CTC CAC CTT TGA ACC CAG AGC CAG GCT GGG CCC TAT TTA TGA 3312
3313 GCT CCC TTT GCC CTT CTC CTG TAT CCC ACA CCA CCA CAT CCA ACC TCC 3360
3361 TTG CCT GCC TGT ATC CTC ATT GGT GGG AGG CCC AGC CAT GGC CCT AAT 3408
3409 TGT GCC TGA GCT TGA CTT TCA GTC AGG GCC ACA GTG AGC ATT AAA TTA 3456
3457 TTA TTC CAT ACA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA A 3502
6.FP1572
A: nucleotide sequence (SEQ ID NO:16) length: 2445 bases
1 GCACAGGCTC CCCCAGATGC CCTGGCCAGG CCAGGCATGT GCCCACTTGC AAGTCTTTAT
61 CCTTCTCTTA GGTTCCCTCT GGCATGCACT TTTCTCCTCC ACCTGCCCAA TCCCAGACAT
121 CCTCAGAGTC TAGCATCTGG TGACACTTCT GTGACACTTT CCCTGATCCC CAGCCAGAGG
181 GAACTGCTCC CCGCTCCTGA GCTCCCTGAG CCCCTTTCCA CACAGAACAC TTAGACTGTT
241 GTATTGCCCC TGGTTGTGTT GGGCCCATGA CCCTCACTGG ACTGTGAGCT CCTTGAGGAC
301 AGGGGCACCC ACTTTTTCAT CTTGGGGAAT CAGAGCCCAC ATGGTGCTGC GTGCGCTGTT
361 GAGGCCTCTG TGCAGCCCGG GCCCATGTCT GCTGGGATGG GGGGAAAGGT AGGGAGGTAG
421 GTGGGGAGGA GGCAGGGGGT CTGATGGAGT AGAATACAAC AGGCAGCTTT CACAGTGGAT
481 TGAAGCTGTG CTTTCAGTTC AGCATATGCT ACTGAGCCCC TGCCATGCCA GCCCCTGGGT
541 CAGCACTGGA GAAGATGCCC TGTCCCTTCC TCATGGGGCA GCAAGGAAAG GGGATGCTTT
601 GCAGATGAAG ACAGATCCTC CTGGGGAGGC GACGTGTGGC ACATATGCAG TTAGACAGGG
661 ACAGATTGGG CCCAGAAGCC AGCTGTCCTG TTTCCAGCAG TCACCAGGAC CCTTGACCCA
721 GGGCCTGGTC TCCATGGGTG GGGCAAAGCC CTGCTGACTT TATTGTGGGG CCGGGGGGGG
781 GGGGGGCGGC GCTGGTTTCC AGGGCAGCGA AGGGATTCGA GGCCCATCAG GCCTGCCTGG
841 CTCCCCTGGG CCACCGGGAC CTCCTGGGAT TCAGGGCCCC GCCGGTCTGG ATGGTTTGGA
901 TGGGAAGGAC GGCAAGCCTG GCTTGAGGGG GGACCCTGGT CCTGCTGGCC CCCCTGGACT
961 CATGGGACCA CCGGGCTTTA AGGGGAAAAC AGGACATCCT GGCCTCCCAG GACCTAAGGG
1021 TGACTGTGGC AAACCAGGTC CCCCTGGCAG CACTGGCCGG CCTGGCGCAG AGGGTGAACC
1081 TGGTGCCATG GGACCCCAGG GAAGACCCGG TCCCCCGGGA CACGTTGGGC CACCAGGGCC
1141 TCCAGGCCAG CCAGGACCAG CTGGGATCTC TGCAGTGGGT CTGAAAGGAG ACCGAGGAGC
1201 CACCGGAGAA AGGGGCCTTG CAGGCCTCCC AGGCCAGCCC GGCCCCCCTG GACACCCTGG
1261 CCCCCCAGGC GAACCTGGTA CGGATGGTGC AGCTGGCAAA GAGGGACCCC CTGGAAAGCA
1321 GGGATTCTAT GGACCTCCTG GTCCCAAGGT GAGTGGGCAC TGGCTGGGCT TGAGAGGAGA
1381 AGCTCACGAG TGGATGAAGC CCAGCTGCAA GTTACGGCTT TAGCCACAGA GAGGAGACAC
1441 AGGAAGGCCA GTGGGCAGGG GGCAAAAATG CAGAACTTGG AGCAGGATAG TGTCTGAAAA
1501 TTTCTCACAT GGGGTCATTA GCTGTGGTTG GGGGGAAGGA AATCTCCAGT TCATTTACTC
1561 ACTGATTGAC TGATTGATTC ATTCATTGTT TCCATTTCCC GAGCACTCCC TTGCTCCTGC
1621 TCACATTCCA GGCCCTTGTC CTGGCCTCAG GGTGATCCAG GAGCTGCAGG ACAGAAGGGC
1681 CAGGCAGGAG AGAAGGGGAG AGCCGGCATG CCTGGTGGAC CTGGCAAGAG TGGTTCCATG
1741 GGGCCTGTTG GGCCACCGGG CCCTGCAGGA GAGAGAGGCC ACCCTGGAGC TCCGGGGCCT
1801 TCGGGGAGCC CTGGCTTGCC TGGTGTGCCT GGCTCCATGG GAGACATGGT GAATTATGAT
1861 GAAATCAAGA GGTTCATCAG ACAAGAGATC ATTAAAATGT TTGATGAGAG AATGGCTTAC
1921 TACACCTCCA GGATGCAGTT CCCCATGGAG ATGGCGGCAG CTCCGGGACG ACCAGGGCCT
1981 CCAGGGAAGG ATGGTGCTCC GGGCAGGCCA GGTGCTCCAG GGTCACCTGG GCTCCCTGGT
2041 CAGATTGGCA GAGAAGGACG GCAGGGCTTG CCAGGAGTAA GAGGATTGCC TGGTACCAAA
2101 GGTGAAAAAG GGGACATTGG TATTGGCATT GCAGGAGAAA ATGGTCTTCC CGGCCCCCCA
2161 GGTCCTCAAG GTCCTCCAGG CTATGGCAAG ATGGGTGCAA CAGGACCAAT GGGCCAGCAA
2221 GGCATCCCTG GCATCCCTGG GCCCCCGGGT CCCATGGGCC AGCCAGGCAA GGCTGGCCAC
2281 TGTAATCCCT CTGACTGCTT TGGGGCCATG CCGATGGAGC AGCAGTACCC ACCCATGAAA
2341 ACCATGAAGG GGCCTTTTGG CTGAAATTCC CCACCTGCCT TTGGATGAAA GACTCCGTTG
2401 GGAATAAATG GCCAAAGCTT ATAGGACAAA AAAAAAAAAA AAAAA
B: nucleotide sequence (SEQ ID NO:17) length: 218 amino acid
1 MPGGPGKSGS MGPVGPPGPA GERGHPGAPG PSGSPGLPGV PGSMGDMVNY DEIKRFIRQE
61 IIKMFDERMA YYTSRMQFPM EMAAAPGRPG PPGKDGAPGR PGAPGSPGLP GQIGREGRQG
121 LPGVRGLPGT KGEKGDIGIG IAGENGLPGP PGPQGPPGYG KMGATGPMGQ QGIPGIPGPP
181 GPMGQPGKAG HCNPSDCFGA MPMEQQYPPM KTMKGPFG
C. Nucleotide and amino acid composite sequence (SEQ ID NO:18) clone number: FP1572
Start code: 1708ATG stops coding: 2362TGA protein molecular weight: 21732.83
1 GCA CAG GCT CCC CCA GAT GCC CTG GCC AGG CCA GGC ATG TGC CCA CTT 48
49 GCA AGT CTT TAT CCT TCT CTT AGG TTC CCT CTG GCA TGC ACT TTT CTC 96
97 GTC CAC CTG CCC AAT CCC AGA CAT CCT CAG AGT CTA GCA TCT GGT GAC 144
145 ACT TCT GTG ACA CTT TCC CTG ATC CCC AGC CAG AGG GAA CTG CTC CCC 192
193 GCT CCT GAG GTC CCT GAG CCC CTT TCC ACA CAG AAC ACT TAG ACT GTT 240
241 GTA TTG CCC CTG GTT GTG TTG GGC CCA TGA CCC TCA CTG GAC TGT GAG 288
289 CTC CTT GAG GAC AGG GGC ACC CAC TTT TTC ATC TTG GGG AAT CAG AGC 336
337 CCA CAT GGT GCT GCG TGC GCT GTT GAG GCC TCT GTG CAG CCC GGG CCC 384
385 ATG TCT GCT GGG ATG GGG GGA AAG GTA GGG AGG TAG GTG GGG AGG AGG 432
433 CAG GGG GTC TGA TGG AGT AGA ATA CAA CAG GCA GCT TTC ACA GTG GAT 480
481 TGA AGC TGT GCT TTC AGT TCA GCA TAT GCT ACT GAG CCC CTG CCA TGC 528
529 CAG CCC CTG GGT CAG CAC TGG AGA AGA TGC CCT GTC CCT TCC TCA TGG 576
577 GGC AGC AAG GAA AGG GGA TGC TTT GCA GAT GAA GAC AGA TCC TCC TGG 624
625 GGA GGC GAC GTG TGG CAC ATA TGC AGT TAG ACA GGG ACA GAT TGG GCC 672
673 CAG AAG CCA GCT GTC CTG TTT CCA GCA GTC ACC AGG ACC CTT GAC CCA 720
721 GGG CCT GGT CTC CAT GGG TGG GGC AAA GCC CTG CTG ACT TTA TTG TGG 768
769 GGC CGG GGG GGG GGG GGG CGG CGC TGG TTT CCA GGG CAG CGA AGG GAT 816
817 TCG AGG CCC ATC AGG CCT GCC TGG CTC CCC TGG GCC ACC GGG ACC TCC 864
865 TGG GAT TCA GGG CCC CGC CGG TCT GGA TGG TTT GGA TGG GAA GGA CGG 912
913 CAA GCC TGG CTT GAG GGG GGA CCC TGG TCC TGC TGG CCC CCC TGG ACT 960
961 CAT GGG ACC ACC GGG CTT TAA GGG GAA AAC AGG ACA TCC TGG CCT CCC 1008
1009 AGG ACC TAA GGG TGA CTG TGG CAA ACC AGG TCC CCC TGG CAG CAC TGG 1056
1057 CCG GCC TGG CGC AGA GGG TGA ACC TGG TGC CAT GGG ACC CCA GGG AAG 1104
1105 ACC CGG TCC CCC GGG ACA CGT TGG GCC ACC AGG GCC TCC AGG CCA GCC 1152
1153 AGG ACC AGC TGG GAT CTC TGC AGT GGG TCT GAA AGG AGA CCG AGG AGC 1200
1201 CAC CGG AGA AAG GGG CCT TGC AGG CCT CCC AGG CCA GCC CGG CCC CCC 1248
1249 TGG ACA CCC TGG CCC CCC AGG CGA ACC TGG TAC GGA TGG TGC AGC TGG 1296
1297 CAA AGA GGG ACC CCC TGG AAA GCA GGG ATT CTA TGG ACC TCC TGG TCC 1344
1345 CAA GGT GAG TGG GCA CTG GCT GGG CTT GAG AGG AGA AGC TCA CGA GTG 1392
1393 GAT GAA GCC CAG CTG CAA GTT ACG GCT TTA GCC ACA GAG AGG AGA CAC 1440
1441 AGG AAG GCC AGT GGG CAG GGG GCA AAA ATG CAG AAC TTG GAG CAG GAT 1488
1489 AGT GTC TGA AAA TTT CTC ACA TGG GGT CAT TAG CTG TGG TTG GGG GGA 1536
1537 AGG AAA TCT CCA GTT CAT TTA CTC ACT GAT TGA CTG ATT GAT TCA TTC 1584
1585 ATT GTT TCC ATT TCC CGA GCA CTC CCT TGC TCC TGC TCA CAT TCC AGG 1632
1633 CCC TTG TCC TGG CCT CAG GGT GAT CCA GGA GCT GCA GGA CAG AAG GGC 1680
1681 CAG GCA GGA GAG AAG GGG AGA GCC GGC ATG CCT GGT GGA CCT GGC AAG 1728
1 Met Pro Gly Gly Pro Gly Lys 7
1729 AGT GGT TCC ATG GGG CCT GTT GGG CCA CCG GGC CCT GCA GGA GAG AGA 1776
8 Ser Gly Ser Met Gly Pro Val Gly Pro Pro Gly Pro Ala Gly Glu Arg 23
1777 GGC CAC CCT GGA GCT CCG GGG CCT TCG GGG AGC CCT GGC TTG CCT GGT 1824
24 Gly His Pro Gly Ala Pro Gly Pro Ser Gly Ser Pro Gly Leu Pro Gly 39
1825 GTG CCT GGC TCC ATG GGA GAC ATG GTG AAT TAT GAT GAA ATC AAG AGG 1872
40 Val Pro Gly Ser Met Gly Asp Met Val Asn Tyr Asp Glu Ile Lys Arg 55
1873 TTC ATC AGA CAA GAG ATC ATT AAA ATG TTT GAT GAG AGA ATG GCT TAC 1920
56 Phe Ile Arg Gln Glu Ile Ile Lys Met Phe Asp Glu Arg Met Ala Tyr 71
1921 TAC ACC TCC AGG ATG CAG TTC CCC ATG GAG ATG GCG GCA GCT CCG GGA 1968
72 Tyr Thr Ser Arg Met Gln Phe Pro Met Glu Met Ala Ala Ala Pro Gly 87
1969 CGA CCA GGG CCT CCA GGG AAG GAT GGT GCT CCG GGC AGG CCA GGT GCT 2016
88 Arg Pro Gly Pro Pro Gly Lys Asp Gly Ala Pro Gly Arg Pro Gly Ala 103
2017 CCA GGG TCA CCT GGG CTC CCT GGT CAG ATT GGC AGA GAA GGA CGG CAG 2064
104 Pro Gly Ser Pro Gly Leu Pro Gly Gln Ile Gly Arg Glu Gly Arg Gln 119
2065 GGC TTG CCA GGA GTA AGA GGA TTG CCT GGT ACC AAA GGT GAA AAA GGG 2112
120 Gly Leu Pro Gly Val Arg Gly Leu Pro Gly Thr Lys Gly Glu Lys Gly 135
2113 GAC ATT GGT ATT GGC ATT GCA GGA GAA AAT GGT CTT CCC GGC CCC CCA 2160
136 Asp Ile Gly Ile Gly Ile Ala Gly Glu Asn Gly Leu Pro Gly Pro Pro 151
2161 GGT CCT CAA GGT CCT CCA GGC TAT GGC AAG ATG GGT GCA ACA GGA CCA 2208
152 Gly Pro Gln Gly Pro Pro Gly Tyr Gly Lys Met Gly Ala Thr Gly Pro 167
2209 ATG GGC CAG CAA GGC ATC CCT GGC ATC CCT GGG CCC CCG GGT CCC ATG 2256
168 Met Gly Gln Gln Gly Ile Pro Gly Ile Pro Gly Pro Pro Gly Pro Met 183
2257 GGC CAG CCA GGC AAG GCT GGC CAC TGT AAT CCC TCT GAC TGC TTT GGG 2304
184 Gly Gln Pro Gly Lys Ala Gly His Cys Asn Pro Ser Asp Cys Phe Gly 199
2305 GCC ATG CCG ATG GAG CAG CAG TAC CCA CCC ATG AAA ACC ATG AAG GGG 2352
200 Ala Met Pro Met Glu Gln Gln Tyr Pro Pro Met Lys Thr Met Lys Gly 215
2353 CCT TTT GGC TGA AAT TCC CCA CCT GCC TTT GGA TGA AAG ACT CCG TTG 2400
216 Pro Phe Gly *** 219
2401 GGA ATA AAT GGC CAA AGC TTA TAG GAC AAA AAA AAA AAA AAA AAA 2445
7.FP2568
A: nucleotide sequence (SEQ ID NO:19) length: 1987 bases
1 GTGCTGCGCC CGGATGCGTC TGTTACTAGA GTGGAGAGTC TACCTTCGTC TCACATGTGC
61 CACAAAGGAT GGCATGGCCC GGGAGTGCCC CACCACGTGG CTTTCACCCC CTGCAAAGCC
121 AGACTTCGCC CAGCGACACA GTGTCAAGCC CACAGCTCTC CAAGGAGGAA GATGGTCCAG
181 GCTGGGAGCA TCCCCTTAGC AGCAGCCTCT GATCCCTTGG CCAAGCAGGA GGGAACCATT
241 AGCAGCCTGA GGAGCTGGCT GGCTGGGAGC CTCGGGGACC GCCCAGCCTT GCTCCCAGCT
301 CACCCACAAG ATGTGGACAG CTCTTGTGCT CATTTGGATT TTCTCCTTGT CCTTATCTGA
361 AAGCCATGCG GCATCCAACG ATCCACGCAA CTTTGTGCCT AACAAAATGT GGAAGGGATT
421 AGTCAAGAGG AATGCATCTG TGGAAACAGT TGATAATAAA ACGTCTGAGG ATGTAACCAT
481 GGCAGCAGCT TCTCCTGTCA CATTGACCAA AGGGACTTCG GCAGCCCACC TCAACTCTAT
541 GGAAGTCACA ACAGAGGACA CAAGCAGGAC AGATGTGAGT GAACCAGCAA CTTCAGGAGT
601 TGCAGCTGAT GGTGTGACCT CCATTGCTCC CACGGCTGTG GCCTCCAGTA CGACTGCGGC
661 CTCCATTACG ACTGCGGCCT CCAGTATGAC TGTGGCCTCC AGTGCTCCCA CGACTGCAGC
721 CTCCAGTACA ACTGTGGCCT CCATTGCTCC CACGACTGCA GCCTCCAGTA TGACTGCGGC
781 CTCCAGCACT CCCATGACAC TTGCACTCCC CGCGCCCACG TCCACTTCCA CAGGGCGGAC
841 CCCGTCCACT ACCGCCACTG GGCATCCATC TCTCAGCACA GCCCTCGCAC AAGTGCCAAA
901 GAGCAGCGCG TTGCCAAGAA CAGCAACCCT GGCCACATTG GCCACACGTG CTCAGACTGT
961 AGCGACCACA GCAAACACAA GCAGCCCCAT GAGCACTCGT CCAAGTCCTT CCAAGCACAT
1021 GCCCAGTGAC ACCGCGGCAA GCCCTGTACC CCCTATGCGT CCCCAAGCAC AAGGTCCCAT
1081 TAGCCAGGTG TCAGTGGACC AGCCTGTGGT TAACACAACA AATAAATCCA CACCCATGCC
1141 CTCAAACACA ACCCCAGAGC CCGCCCCCAC CCCCACAGTG GTGACCACCA CCAAGGCACA
1201 AGCCAGGGAG CCAACTGCCA GCCCAGTGCC AGTACCTCAC ACCAGCCCAA TCCCTGAGAT
1261 GGAGGCCATG TCCCCCACGA CACAGCCAAG CCCCATGCCA TATACCCAGA GGGCCGCTGG
1321 GCCAGGCACA TCCCAGGCAC CGGAGCAGGT AGAGACTGAA GCCACACCAG GTACTGATTC
1381 CACTGGGCCA ACACCCAGGA GCTCAGGGGG CACTAAGATG CCAGCCACGG ACTCGTGCCA
1441 GCCCAGCACC CAAGGCCAGT ACATGGTGGT CACCACTGAG CCCCTCACCC AGGCCGTGGT
1501 AGACAAAACT CTCCTTCTGG TGGTGCTGTT ACTCGGGGTG ACCCTTTTCA TCACAGTCTT
1561 GGTTTTGTTT GCCCTGCAGG CCTATGAGAG CTACAAGAAG AAGGACTACA CCCAGGTGGA
1621 CTACTTAATC AACGGGATGT ATGCGGACTC AGAAATGTGA GGGGGGCGGG GGCCTGGCGG
1681 GAGGCCTGGC CCCTTCCTCG TCCTTTCCTT TTGCCTTTGA GACCAAACCA AGTGCTTCCA
1741 AATTCTTTTG GTGCAATTGA GGAGATATGC CAGATGCTTA AACACATTTA ATTGCTGTCA
1801 GATTAATTCC ATGATCACTA AAGAGTTGCT GCTTTTTTCA TATTTATTTT TGTAAATGAT
1861 TCTGTGCCCA GGAGCAGCTG GGGGTTCCAC CTCAGGGTGG GGCGGGCAGG ACCCCGTCTC
1921 CCCAGGTGTC GGAGCCTGAC CTGAATTAAA GTACTGACTG CTCGCCAAAA AAAAAAAAAA
1981 AAAAAAA
B: nucleotide sequence (SEQ ID NO:20) length: 449 amino acid
1 MWTALVLIWI FSLSLSESHA ASNDPRNFVP NKMWKGLVKR NASVETVDNK TSEDVTMAAA
61 SPVTLTKGTS AAHLNSMEVT TEDTSRTDVS EPATSGVAAD GVTSIAPTAV ASSTTAASIT
121 TAASSMTVAS SAPTTAASST TVASIAPTTA ASSMTAASST PMTLALPAPT STSTGRTPST
181 TATGHPSLST ALAQVPKSSA LPRTATLATL ATRAQTVATT ANTSSPMSTR PSPSKHMPSD
241 TAASPVPPMR PQAQGPISQV SVDQPVVNTT NKSTPMPSNT TPEPAPTPTV VTTTKAQARE
301 PTASPVPVPH TSPIPEMEAM SPTTQPSPMP YTQRAAGPGT SQAPEQVETE ATPGTDSTGP
361 TPRSSGGTKM PATDSCQPST QGQYMVVTTE PLTQAVVDKT LLLVVLLLGV TLFITVLVLF
421 ALQAYESYKK KDYTQVDYLI NGMYADSEM
C. Nucleotide and amino acid composite sequence (SEQ ID NO:21) clone number: FP2568
Start code: 311ATG stops coding: 1658TGA protein molecular weight: 46140.13
1 G TGC TGC GCC CGG ATG CGT CTG TTA CTA GAG TGG AGA GTC TAC CTT 46
47 CGT CTC ACA TGT GCC ACA AAG GAT GGC ATG GCC CGG GAG TGC CCC ACC 94
95 ACG TGG CTT TCA CCC CCT GCA AAG CCA GAC TTC GCC CAG CGA CAC AGT 142
143 GTC AAG CCC ACA GCT CTC CAA GGA GGA AGA TGG TCC AGG CTG GGA GCA 190
191 TCC CCT TAG CAG CAG CCT CTG ATC CCT TGG CCA AGC AGG AGG GAA CCA 238
239 TTA GCA GCC TGA GGA GCT GGC TGG CTG GGA GCC TCG GGG ACC GCC CAG 286
287 CCT TGC TCC CAG CTC ACC CAC AAG ATG TGG ACA GCT CTT GTG CTC ATT 334
1 Met Trp Thr Ala Leu Val Leu Ile 8
335 TGG ATT TTC TCC TTG TCC TTA TCT GAA AGC CAT GCG GCA TCC AAC GAT 382
9 Trp Ile Phe Ser Leu Ser Leu Ser Glu Ser His Ala Ala Ser Asn Asp 24
383 CCA CGC AAC TTT GTG CCT AAC AAA ATG TGG AAG GGA TTA GTC AAG AGG 430
25 Pro Arg Asn Phe Val Pro Asn Lys Met Trp Lys Gly Leu Val Lys Arg 40
431 AAT GCA TCT GTG GAA ACA GTT GAT AAT AAA ACG TCT GAG GAT GTA ACC 478
41 Asn Ala Ser Val Glu Thr Val Asp Asn Lys Thr Ser Glu Asp Val Thr 56
479 ATG GCA GCA GCT TCT CCT GTC ACA TTG ACC AAA GGG ACT TCG GCA GCC 526
57 Met Ala Ala Ala Ser Pro Val Thr Leu Thr Lys Gly Thr Ser Ala Ala 72
527 CAC CTC AAC TCT ATG GAA GTC ACA ACA GAG GAC ACA AGC AGG ACA GAT 574
73 His Leu Asn Ser Met Glu Val Thr Thr Glu Asp Thr Ser Arg Thr Asp 88
575 GTG AGT GAA CCA GCA ACT TCA GGA GTT GCA GCT GAT GGT GTG ACC TCC 622
89 Val Ser Glu Pro Ala Thr Ser Gly Val Ala Ala Asp Gly Val Thr Ser 104
623 ATT GCT CCC ACG GCT GTG GCC TCC AGT ACG ACT GCG GCC TCC ATT ACG 670
105 Ile Ala Pro Thr Ala Val Ala Ser Ser Thr Thr Ala Ala Ser Ile Thr 120
671 ACT GCG GCC TCC AGT ATG ACT GTG GCC TCC AGT GCT CCC ACG ACT GCA 718
121 Thr Ala Ala Ser Ser Met Thr Val Ala Ser Ser Ala Pro Thr Thr Ala 136
719 GCC TCC AGT ACA ACT GTG GCC TCC ATT GCT CCC ACG ACT GCA GCC TCC 766
137 Ala Ser Ser Thr Thr Val Ala Ser Ile Ala Pro Thr Thr Ala Ala Ser 152
767 AGT ATG ACT GCG GCC TCC AGC ACT CCC ATG ACA CTT GCA CTC CCC GCG 814
153 Ser Met Thr Ala Ala Ser Ser Thr Pro Met Thr Leu Ala Leu Pro Ala 168
815 CCC ACG TCC ACT TCC ACA GGG CGG ACC CCG TCC ACT ACC GCC ACT GGG 862
169 Pro Thr Ser Thr Ser Thr Gly Arg Thr Pro Ser Thr Thr Ala Thr Gly 184
863CAT CCA TCT CTC AGC ACA GCC CTC GCA CAA GTG CCA AAG AGC AGC GCG 910
185 His Pro Ser Leu Ser Thr Ala Leu Ala Gln Val Pro Lys Ser Ser Ala 200
911 TTG CCA AGA ACA GCA ACC CTG GCC ACA TTG GCC ACA CGT GCT CAG ACT 958
201 Leu Pro Arg Thr Ala Thr Leu Ala Thr Leu Ala Thr Arg Ala Gln Thr 216
959 GTA GCG ACC ACA GCA AAC ACA AGC AGC CCC ATG AGC ACT CGT CCA AGT 1006
217 Val Ala Thr Thr Ala Asn Thr Ser Ser Pro Met Ser Thr Arg Pro Ser 232
1007 CCT TCC AAG CAC ATG CCC AGT GAC ACC GCG GCA AGC CCT GTA CCC CCT 1054
233 Pro Ser Lys His Met Pro Ser Asp Thr Ala Ala Ser Pro Val Pro Pro 248
1055 ATG CGT CCC CAA GCA CAA GGT CCC ATT AGC CAG GTG TCA GTG GAC CAG 1102
249 Met Arg Pro Gln Ala Gln Gly Pro Ile Ser Gln Val Ser Val Asp Gln 264
1103 CCT GTG GTT AAC ACA ACA AAT AAA TCC ACA CCC ATG CCC TCA AAC ACA 1150
265 Pro Val Val Asn Thr Thr Asn Lys Ser Thr Pro Met Pro Ser Asn Thr 280
1151 ACC CCA GAG CCC GCC CCC ACC CCC ACA GTG GTG ACC ACC ACC AAG GCA 1198
281 Thr Pro Glu Pro Ala Pro Thr Pro Thr Val Val Thr Thr Thr Lys Ala 296
1199 CAA GCC AGG GAG CCA ACT GCC AGC CCA GTG CCA GTA CCT CAC ACC AGC 1246
297 Gln Ala Arg Glu Pro Thr Ala Ser Pro Val Pro Val Pro His Thr Ser 312
1247 CCA ATC CCT GAG ATG GAG GCC ATG TCC CCC ACG ACA CAG CCA AGC CCC 1294
313 Pro Ile Pro Glu Met Glu Ala Met Ser Pro Thr Thr Gln Pro Ser Pro 328
1295 ATG CCA TAT ACC CAG AGG GCC GCT GGG CCA GGC ACA TCC CAG GCA CCG 1342
329 Met Pro Tyr Thr Gln Arg Ala Ala Gly Pro Gly Thr Ser Gln Ala Pro 344
1343 GAG CAG GTA GAG ACT GAA GCC ACA CCA GGT ACT GAT TCC ACT GGG CCA 1390
345 Glu Gln Val Glu Thr Glu Ala Thr Pro Gly Thr Asp Ser Thr Gly Pro 360
1391 ACA CCC AGG AGC TCA GGG GGC ACT AAG ATG CCA GCC ACG GAC TCG TGC 1438
361 Thr Pro Arg Ser Ser Gly Gly Thr Lys Met Pro Ala Thr Asp Ser Cys 376
1439 CAG CCC AGC ACC CAA GGC CAG TAC ATG GTG GTC ACC ACT GAG CCC CTC 1486
377 Gln Pro Ser Thr Gln Gly Gln Tyr Met yal yal Thr Thr Glu Pro Leu 392
1487 ACC CAG GCC GTG GTA GAC AAA ACT CTC CTT CTG GTG GTG CTG TTA CTC 1534
393 Thr Gln Ala Val Val Asp Lys Thr Leu Leu Leu Val Val Leu Leu Leu 408
1535 GGG GTG ACC CTT TTC ATC ACA GTC TTG GTT TTG TTT GCC CTG CAG GCC 1582
409 Gly Val Thr Leu Phe Ile Thr Val Leu Val Leu Phe Ala Leu Gln Ala 424
1583 TAT GAG AGC TAC AAG AAG AAG GAC TAC ACC CAG GTG GAC TAC TTA ATC 1630
425 Tyr Glu Ser Tyr Lys Lys Lys Asp Tyr Thr Gln Val Asp Tyr Leu Ile 440
1631 AAC GGG ATG TAT GCG GAC TCA GAA ATG TGA GGG GGG CGG GGG CCT GGC 1678
441 Asn Gly Met Tyr Ala Asp Ser Glu Met *** 450
1679 GGG AGG CCT GGC CCC TTC CTC GTC CTT TCC TTT TGC CTT TGA GAC CAA 1726
1727 ACC AAG TGC TTC CAA ATT CTT TTG GTG CAA TTG AGG AGA TAT GCC AGA 1774
1775 TGC TTA AAC ACA TTT AAT TGC TGT CAG ATT AAT TCC ATG ATC ACT AAA 1822
1823 GAG TTG CTG CTT TTT TCA TAT TTA TTT TTG TAA ATG ATT CTG TGC CCA 1870
1871 GGA GCA GCT GGG GGT TCC ACC TCA GGG TGG GGC GGG CAG GAC CCC GTC 1918
1919 TCC CCA GGT GTC GGA GCC TGA CCT GAA TTA AAG TAC TGA CTG CTC GCC 1966
1967 AAA AAA AAA AAA AAA AAA AAA 1987
8.FP2653
A: nucleotide sequence (SEQ ID NO:22) length: 2437 bases
1 GGGACGGTTT GTGACCCCCT TAGCCGACCC TACTCCTCAC TGGCCGGGAC AACTGGTCTT
61 ATCACGGAGG CTGGGGCCAG GCAGCCCTTC GGTTCGGGTG GGCCCATGGA CCCCAGTCCA
121 ACGCCGAGGG AATAGGACCA TCCAAAAGCG GAACCTTCGC CTCAGAAAAA GGGTGCGGGA
181 CCCCTCCTCA CCGTGCGGTC ACGGTACGGA CAGGGTAGAT CACAGGCTGA GGGACAGAGC
241 AAAGACCCCT GAGGCCGGAC ACCTGGGGTC CTGCCGGGCC CCTCCCCACG AGAGTTCCCT
301 GTGTCTGTGC CAATCGTTTT CGTCTTTCTT TGCCGCAGTT TCTTTTCCTG TAAATCATGG
361 TTAATGACAT TAACCTTCTT ACCATCAGGG GTTAGTTGTG GTTGTGATAA ATAATTACTA
421 CCGTTATTAA GCAATTGCAA TTTGCAGTGT GCCAGGCACT GTGCCAAGTA TTTTGCTTCG
481 ATGATTTCAC GTCATCTTCA AAACAACCTC ATGAGGGCTG GGTCAGTTAG AACCTAAACA
541 AACTAGAGAC CTGGTTGCAA CCCCTCAGGC TCTGCTGATG CTGTCCCCCT TTGTTCCTGC
601 AGCGTGGACC CTGCCAGCAG CCAGGCCATG GAGCTCTCTG ATGTCACCCT CATTGAGGGT
661 GTGGGTAATG AGGTGATGGT GGTGGCAGGT GTGGTGGTGC TGATTCTAGC CTTGGTCCTA
721 GCTTGGCTCT CTACCTACGT AGCAGACAGC GGTAGCAACC AGCTCCTGGG CGCTATTGTG
781 TCAGCAGGCG ACACATCCGT CCTCCACCTG GGGCATGTGG ACCACCTGGT GGCAGGCCAA
841 GGCAACCCCG AGCCAACTGA ACTCCCCCAT CCATCAGAGG GTAATGATGA GAAGGCTGAA
901 GAGGCGGGTG AAGGTCGGGG AGACTCCACT GGGGAGGCTG GAGCTGGGGG TGGTGTTGAG
961 CCCAGCCTTG AGCATCTCCT TGACATCCAA GGCCTGCCCA AAAGACAAGC AGGTGCAGGC
1021 AGCAGCAGTC CAGAGGCCCC CCTGAGATCT GAGGATAGCA CCTGCCTCCC TCCCAGCCCT
1081 GGCCTCATCA CTGTGCGGCT CAAATTCCTC AATGATACCG AGGAGCTGGC TGTGGCTAGG
1141 CCAGAGGATA CCGTGGGTGC CCTGAAGAGC AAATACTTCC CTGGACAAGA AAGCCAGATG
1201 AAACTGATCT ACCAGGGCCG CCTGCTACAA GACCCAGCCC GCACACTGCG TTCTCTGAAC
1261 ATTACCGACA ACTGTGTGAT TCACTGCCAC CGCTCACCCC CAGGGTCAGC TGTTCCAGGC
1321 CCCTCAGCCT CCTTGGCCCC CTCGGCCACT GAGCCACCCA GCCTTGGTGT CAATGTGGGC
1381 AGCCTCATGG TGCCTGTCTT TGTGGTGCTG TTGGGTGTGG TCTGGTACTT CCGAATCAAT
1441 TACCGCCAAT TCTTCACAGC ACCTGCCACT GTCTCCCTGG TGGGAGTCAC CGTCTTCTTC
1501 AGCTTCCTAG TATTTGGGAT GTATGGACGA TAAGGACATA GGAAGAAAAT GAAAGGCATG
1561 GTCTTTCTCC TTTATGGCCT CCCCACTTTT CCTGGCCAGA GCTGGGCCCA AGGGCCGGGG
1621 AGGGAGGGGT GGAAAAGGAT GTGATGGAAA TCTCCTCCAT AGGACACAGG AGGCAAGTAT
1681 GCGGCCTCCC CTTCTCATCC ACAGGAGTAC AGATGTCCCT CCCGTGCGAG CACAACTCAG
1741 GTAGAAATGA GGATGTCATC TTCCTTCACT TTTAGGGTCC TCTGAAGGAG TTCAAAGCTG
1801 CTGGCCAAGC TCAGTGGGGA GCCTGGGCTC TGAGATTCCC TCCCACCTGT GGTTCTGACT
1861 CTTCCCAGTG TCCTGCATGT CTGCCCCCAG CACCCAGGGC TGCCTGCAAG GGCAGCTCAG
1921 CATGGCCCCA GCACAACTCC GTAGGGAGCC TGGAGTATCC TTCCATTTCT CAGCCAAATA
1981 CTCATCTTTT GAGACTGAAA TCACACTGGC GGGAATGAAG ATTGTGCCAG CCTTCTCTTA
2041 TGGGCACCTA GCCGCCTTCA CCTTCTTCCT CTACCCCTTA GCAGGAATAG GGTGTCCTCC
2101 CTTCTTTCAA AGCACTTTGC TTGCATTTTA TTTTATTTTT TTAAGAGTCC TTCATAGAGC
2161 TCAGTCAGGA AGGGGATGGG GCACCAAGCC AAGCCCCCAG CATTGGGAGC GGCCAGGCCA
2221 CAGCTGCTGC TCCCGTAGTC CTCAGGCTGT AAGCAAGAGA CAGCACTGGC CCTTGGCCAG
2281 CGTCCTACCC TGCCCAACTC CAAGGACTGG GTATGGATCG CTGGGCCCTA GGCTCTTGCT
2341 TCTGGGGCTA TTGGAGGGTC AGTGTCTGTG ACTGAATAAA GTTCCATTTT GTGGTCCTGC
2401 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAA
B: nucleotide sequence (SEQ ID NO:23) length: 301 amino acid
1 MELSDVTLIE GVGNEVMVVA GVVVLILALV LAWLSTYVAD SGSNQLLGAI VSAGDTSVLH
61 LGHVDHLVAG QGNPEPTELP HPSEGNDEKA EEAGEGRGDS TGEAGAGGGV EPSLEHLLDI
121 QGLPKRQAGA GSSSPEAPLR SEDSTCLPPS PGLITVRLKF LNDTEELAVA RPEDTVGALK
181 SKYFPGQESQ MKLIYQGRLL QDPARTLRSL NITDNCVIHC HRSPPGSAVP GPSASLAPSA
241 TEPPSLGVNV GSLMVPVFVV LLGVVWYFRI NYRQFFTAPA TVSLVGVTVF FSFLVFGMYG
301 R
C. Nucleotide and amino acid composite sequence (SEQ ID NO:24) clone number: FP2653
Start code: 628 ATG stop coding: 1531 TAA protein molecular weights: 31605.19
1 GGG ACG GTT TGT GAC CCC CTT AGC CGA CCC TAC TCC TCA CTG GCC GGG 48
49 ACA ACT GGT CTT ATC ACG GAG GCT GGG GCC AGG CAG CCC TTC GGT TCG 96
97 GGT GGG CCC ATG GAC CCC AGT CCA ACG CCG AGG GAA TAG GAC CAT CCA 144
145 AAA GCG GAA CCT TCG CCT CAG AAA AAG GGT GCG GGA CCC CTC CTC ACC 192
193 GTG CGG TCA CGG TAC GGA CAG GGT AGA TCA CAG GCT GAG GGA CAG AGC 240
241 AAA GAC CCC TGA GGC CGG ACA CCT GGG GTC CTG CCG GGC CCC TCC CCA 288
289 CGA GAG TTC CCT GTG TCT GTG CCA ATC GTT TTC GTC TTT CTT TGC CGC 336
337 AGT TTC TTT TCC TGT AAA TCA TGG TTA ATG ACA TTA ACC TTC TTA CCA 384
385 TCA GGG GTT AGT TGT GGT TGT GAT AAA TAA TTA CTA CCG TTA TTA AGC 432
433 AAT TGC AAT TTG CAG TGT GCC AGG CAC TGT GCC AAG TAT TTT GCT TCG 480
481 ATG ATT TCA CGT CAT CTT CAA AAC AAC CTC ATG AGG GCT GGG TCA GTT 528
529 AGA ACC TAA ACA AAC TAG AGA CCT GGT TGC AAC CCC TCA GGC TCT GCT 576
577 GAT GCT GTC CCC CTT TGT TCC TGC AGC GTG GAC CCT GCC AGC AGC CAG 624
625 GCC ATG GAG CTC TCT GAT GTC ACC CTC ATT GAG GGT GTG GGT AAT GAG 672
1 Met Glu Leu Ser Asp Val Thr Leu Ile Glu Gly Val Gly Asn Glu 15
673 GTG ATG GTG GTC GCA GGT GTG GTG GTG CTG ATT CTA GCC TTG GTC CTA 720
16 Val Met Val Val Ala Gly Val Val Val Leu Ile Leu Ala Leu Val Leu 31
721 GCT TGG CTC TCT ACC TAC GTA GCA GAC AGC GGT AGC AAC CAG CTC CTG 768
32 Ala Trp Leu Ser Thr Tyr Val Ala Asp Ser Gly Ser Asn Gln Leu Leu 47
769 GGC GCT ATT GTG TCA GCA GGC GAC ACA TCC GTC CTC CAC CTG GGG CAT 816
48 Gly Ala Ile Val Ser Ala Gly Asp Thr Ser Val Leu His Leu Gly His 63
817 GTG GAC CAC CTG GTG GCA GGC CAA GGC AAC CCC GAG CCA ACT GAA CTC 864
64 Val Asp His Leu Val Ala Gly Gln Gly Asn Pro Glu Pro Thr Glu Leu 79
865 CCC CAT CCA TCA GAG GGT AAT GAT GAG AAG GCT GAA GAG GCG GGT GAA 912
80 Pro His Pro Ser Glu Gly Asn Asp Glu Lys Ala Glu Glu Ala Gly Glu 95
913 GGT CGG GGA GAC TCC ACT GGG GAG GCT GGA GCT GGG GGT GGT GTT GAG 960
96 Gly Arg Gly Asp Ser Thr Gly Glu Ala Gly Ala Gly Gly Gly Val Glu 111
961 CCC AGC CTT GAG CAT CTC CTT GAC ATC CAA GGC CTG CCC AAA AGA CAA 1008
112 Pro Ser Leu Glu His Leu Leu Asp Ile Gln Gly Leu Pro Lys Arg Gln 127
1009 GCA GGT GCA GGC AGC AGC AGT CCA GAG GCC CCC CTG AGA TCT GAG GAT 1056
128 Ala Gly Ala Gly Ser Ser Ser Pro Glu Ala Pro Leu Arg Ser Glu Asp 143
1057 AGC ACC TGC CTC CCT CCC AGC CCT GGC CTC ATC ACT GTG CGG CTC AAA 1104
144 Ser Thr Cys Leu Pro Pro Ser Pro Gly Leu Ile Thr Val Arg Leu Lys 159
1105 TTC CTC AAT GAT ACC GAG GAG CTG GCT GTG GCT AGG CCA GAG GAT ACC 1152
160 Phe Leu Asn Asp Thr Glu Glu Leu Ala Val Ala Arg Pro Glu Asp Thr 175
1153 GTG GGT GCC CTG AAG AGC AAA TAC TTC CCT GGA CAA GAA AGC CAG ATG 1200
176 Val Gly Ala Leu Lys Ser Lys Tyr Phe Pro Gly Gln Glu Ser Gln Met 191
1201 AAA CTG ATC TAC CAG GGC CGC CTG CTA CAA GAC CCA GCC CGC ACA CTG 1248
192 Lys Leu Ile Tyr Gln Gly Arg Leu Leu Gln Asp Pro Ala Arg Thr Leu 207
1249 CGT TCT CTG AAC ATT ACC GAC AAC TGT GTG ATT CAC TGC CAC CGC TCA 1296
208 Arg Ser Leu Asn Ile Thr Asp Asn Cys Val Ile His Cys His Arg Ser 223
1297 CCC CCA GGG TCA GCT GTT CCA GGC CCC TCA GCC TCC TTG GCC CCC TCG 1344
224 Pro Pro Gly Ser Ala Val Pro Gly Pro Ser Ala Ser Leu Ala Pro Ser 239
1345 GCC ACT GAG CCA CCC AGC CTT GGT GTC AAT GTG GGC AGC CTC ATG GTG 1392
240 Ala Thr Glu Pro Pro Ser Leu Gly Val Asn Val Gly Ser Leu Met Val 255
1393 CCT GTC TTT GTG GTG CTG TTG GGT GTG GTC TGG TAC TTC CGA ATC AAT 1440
256 Pro Val Phe Val Val Leu Leu Gly Val Val Trp Tyr Phe Arg Ile Asn 271
1441 TAC CGC CAA TTC TTC ACA GCA CCT GCC ACT GTC TCC CTG GTG GGA GTC 1488
272 Tyr Arg Gln Phe Phe Thr Ala Pro Ala Thr Val Ser Leu Val Gly Val 287
1489 ACC GTC TTC TTC AGC TTC CTA GTA TTT GGG ATG TAT GGA CGA TAA GGA 1536
288 Thr Val Phe Phe Ser Phe Leu Val Phe Gly Met Tyr Gly Arg *** 302
1537 CAT AGG AAG AAA ATG AAA GGC ATG GTC TTT CTC CTT TAT GGC CTC CCC 1584
1585 ACT TTT CCT GGC CAG AGC TGG GCC CAA GGG CCG GGG AGG GAG GGG TGG 1632
1633 AAA AGG ATG TGA TGG AAA TCT CCT CCA TAG GAC ACA GGA GGC AAG TAT 1680
1681 GCG GCC TCC CCT TCT CAT CCA CAG GAG TAC AGA TGT CCC TCC CGT GCG 1728
1729 AGC ACA ACT CAG GTA GAA ATG AGG ATG TCA TCT TCC TTC ACT TTT AGG 1776
1777 GTC CTC TGA AGG AGT TCA AAG CTG CTG GCC AAG CTC AGT GGG GAG CCT 1824
1825 GGG CTC TGA GAT TCC CTC CCA CCT GTG GTT CTG ACT CTT CCC AGT GTC 1872
1873 CTG CAT GTC TGC CCC CAG CAC CCA GGG CTG CCT GCA AGG GCA GCT CAG 1920
1921 CAT GGC CCC AGC ACA ACT CCG TAG GGA GCC TGG AGT ATC CTT CCA TTT 1968
1969 CTC AGC CAA ATA CTC ATC TTT TGA GAC TGA AAT CAC ACT GGC GGG AAT 2016
2017 GAA GAT TGT GCC AGC CTT CTC TTA TGG GCA CCT AGC CGC CTT CAC CTT 2064
2065 CTT CCT CTA CCC CTT AGC AGG AAT AGG GTG TCC TCC CTT CTT TCA AAG 2112
2113 CAC TTT GCT TGC ATT TTA TTT TAT TTT TTT AAG AGT CCT TCA TAG AGC 2160
2161 TCA GTC AGG AAG GGG ATG GGG CAC CAA GCC AAG CCC CCA GCA TTG GGA 2208
2209 GCG GCC AGG CCA CAG CTG CTG CTC CCG TAG TCC TCA GGC TGT AAG CAA 2256
2257 GAG ACA GCA CTG GCC CTT GGC CAG CGT CCT ACC CTG CCC AAC TCC AAG 2304
2305 GAC TGG GTA TGG ATC GCT GGG CCC TAG GCT CTT GCT TCT GGG GCT ATT 2352
2353 GGA GGG TCA GTG TCT GTG ACT GAA TAA AGT TCC ATT TTG TGG TCC TGC 2400
2401 AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAAA 2437
9.FP2823
A: nucleotide sequence (SEQ ID NO:25) length: 2643 bases
1 GGTTCCTAGT TGCCTTGTGC AATACTCTTG TGGCATGTTT GCTACACCTC CTGAACTTTG
61 ATTTGTTTGC CTTTTACCAG CTATTATGAC TCAAAATTGT CCCCTAGAAC ATGGAATAAT
121 GGCAGAAAGA AAGTGTGTGG TTGAATAAAC ACACAGATTG GCATCCACCG TTGAAACAGG
181 AAAACATCTT ATGTTATGCT GCTGCTGTTG TGAGGGCTGA TGGGCCTTGA AATGTATTTC
241 CTGCACTATG TGTGTGTGAG TGTGTGTGAT TATACTTTTT GGCCTCACAG CCCCATCATC
301 CCTTTCTAAT AACGTCACGT CGATAAGGGG CTTAGGATTG CATCTGGCCT GTGTAAGCCC
361 TCTGAGTTCT GCGGTTCTTA GAGTTCCCTT TTCAGCACTA TAGCTCTGCC TTGTTCCCTT
421 GTTCCTCCTT CTGGCGCCCC GTGCTGTGCC CCCTGCAGGA GTCCAAGCTG TCCCCATGCT
481 GCGTTCTGGT CCGGCCGCCC CTCCCGTGGT GTGGCCCTGG CCGACCCCCC TCCTGCGCCC
541 CGCTTTTCTC GCAGAAGCTG CTCTTTGCCG GCTCCCGCTC TCAGCTGGTG CAGCTGCCCG
601 TGGCCGACTG CATGAAGTAT CGCTCCTGTG CAGACTGTGT CCTCGCCCGG GACCCCTATT
661 GCGCCTGGAG CGTCAACACC AGCCGCTGTG TGGCCGTGGG TGGCCACTCT GGATCTCTAC
721 TGATCCAGCA TGTGATGACC TCGGACACTT CAGGCATCTG CAACCTCCGT GGCAGTAAGA
781 AAGGTGAGCT TTTTCATTCC CGTCGCATCG GGCTGAGCCC TGGACCAGAG CTGGAGTTTC
841 TGTTCTCCTC TTCCCCAGCC CTGCTTTCCT GCACTAACAT GCACTCTGTT TTCTCTGCCA
901 CTACAGTCAG GCCCACTCCC AAAAACATCA CGGTGGTGGC GGGCACAGAC CTGGTGCTGC
961 CCTGCCACCT CTCCTCCAAC TTGGCCCATG CCCGCTGGAC CTTTGGGGGC CGGGACCTGC
1021 CTGCGGAACA GCCCGGGTCC TTCCTCTACG ATGCCCGGCT CCAGGCCCTG GTTGTGATGG
1081 CTGCCCAGCC CCGCCATGCC GGGGCCTACC ACTGCTTTTC AGAGGAGCAG GGGGCGCGGC
1141 TGGCTGCTGA AGGCTACCTT GTGGCTGTCG TGGCAGGCCC GTCGGTGACC TTGGAGGCCC
1201 GGGCCCCCCT GGAAAACCTG GGGCTGGTGT GGCTGGCGGT GGTGGCCCTG GGGGCTGTGT
1261 GCCTGGTGCT GCTGCTGCTG GTGCTGTCAT TGCGCCGGCG GCTGCGGGAA GAGCTGGAGA
1321 AAGGGGCCAA GGCTACTGAG AGGACCTTGG TGTACCCCCT GGAGCTGCCC AAGGAGCCCA
1381 CCAGTCCCCC CTTCCGGCCC TGTCCTGAAC CAGATGAGAA ACTTTGGGAT CCTGTCGGTT
1441 ACTACTATTC AGATGGCTCC CTTAAGATAG TACCTGGGCA TGCCCGGTGC CAGCCCGGTG
1501 GGGGGCCCCC TTCGCCACCT CCAGGCATCC CAGGCCAGCC TCTGCCTTCT CCAACTCGGC
1561 TTCACCTGGG GGGTGGGCGG AACTCAAATG CCAATGGTTA CGTGCGCTTA CAACTAGGAG
1621 GGGAGGACCG GGGAGGGCTC GGGCACCCCC TGCCTGAGCT CGCGGATGAA CTGAGACGCA
1681 AACTGCAGCA ACGCCAGCCA CTGCCCGACT CCAACCCCGA GGAGTCATCA GTATGAGGGG
1741 AACCCCCACC GCGTCGGCGG GAAGCGTGGG AGGTGTAGCT CCTACTTTTG CACAGGCACC
1801 AGCTACCTCA GGGACATGGC ACGGGCACCT GCTCTGTCTG GGACAGATAC TGCCCAGCAC
1861 CCACCCGGCC ATGAGGACCT GCTCTGCTCA GCACGGGCAC TGCCACTTGG TGTGGCTCAC
1921 CAGGGCACCA GCCTCGCAGA AGGCATCTTC CTCCTCTCTG TGAATCACAG ACACGCGGGA
1981 CCCCAGCCGC CAAAACTTTT CAAGGCAGAA GTTTCAAGAT GTGTGTTTGT CTGTATTTGC
2041 ACATGTGTTT GTGTGTGTGT GTATGTGTGT GTGCACGCGC GTGCGCGCTT GTGGCATAGC
2101 CTTCCTGTTT CTGTCAAGTC TTCCCTTGGC CTGGGTCCTC CTGGTGAGTC ATTGGAGCTA
2161 TGAAGGGGAA GGGGTCGTAT CACTTTGTCT CTCCTACCCC CACTGCCCCG AGTGTCGGGC
2221 AGCGATGTAC ATATGGAGGT GGGGTGGACA GGGTGCTGTG CCCCTTCAGA GGGAGTGCAG
2281 GGCTTGGGGT GGGCCTAGTC CTGCTCCTAG GGCTGTGAAT GTTTTCAGGG TGGGGGGAGG
2341 GAGATGGAGC CTCCTGTGTG TTTGGGGGGA AGGGTGGGTG GGGCCTCCCA CTTGGCCCCG
2401 GGGTTCAGTG GTATTTTATA CTTGCCTTCT TCCTGTACAG GGCTGGGAAA GGCTGTGTGA
2461 GGGGAGAGAA GGGAGAGGGT GGGCCTGCTG TGGACAATGG CATACTCTCT TCCAGCCCTA
2521 GGAGGAGGGC TCCTAACAGT GTAACTTATT GTGTCCCCGC GTATTTATTT GTTGTAAATA
2581 TTTGAGTATT TTTATATTGA CAAATAAAAT GGAGAAAATG AAACGAAAAA AAAAAAAAAA
2641 AAA
B: nucleotide sequence (SEQ ID NO:26) length: 374 amino acid
1 MKYRSCADCV LARDPYCAWS VNTSRCVAVG GHSGSLLIQH VMTSDTSGIC NLRGSKKGEL
61 FHSRRIGLSP GPELEFLFSS SPALLSCTNM HSVFSATTVR PTPKNITVVA GTDLVLPCHL
121 SSNLAHARWT FGGRDLPAEQ PGSFLYDARL QALVVMAAQP RHAGAYHCFS EEQGARLAAE
181 GYLVAVVAGP SVTLEARAPL ENLGLVWLAV VALGAVCLVL LLLVLSLRRR LREELEKGAK
241 ATERTLVYPL ELPKEPTSPP FRPCPEPDEK LWDPVGYYYS DGSLKIVPGH ARCQPGGGPP
301 SPPPGIPGQP LPSPTRLHLG GGRNSNANGY VRLQLGGEDR GGLGHPLPEL ADELRRKLQQ
361 RQPLPDSNPE ESSV
C. Nucleotide and amino acid composite sequence (SEQ ID NO:27) clone number: FP2823
Start code: 612ATG stops coding: 1734TGA protein molecular weight: 40201.86
1 GG TTC CTA GTT GCC TTG TGC AAT ACT CTT GTG GCA TGT TTG CTA CAC 47
48 CTC CTG AAC TTT GAT TTG TTT GCC TTT TAC CAG CTA TTA TGA CTC AAA 95
96 ATT GTC CCC TAG AAC ATG GAA TAA TGG CAG AAA GAA AGT GTG TGG TTG 143
144 AAT AAA CAC ACA GAT TGG CAT CCA CCG TTG AAA CAG GAA AAC ATC TTA 191
192 TGT TAT GCT GCT GCT GTT GTG AGG GCT GAT GGG CCT TGA AAT GTA TTT 239
240 CCT GCA CTA TGT GTG TGT GAG TGT GTG TGA TTA TAC TTT TTG GCC TCA 287
288 CAG CCC CAT CAT CCC TTT CTA ATA ACG TCA CGT CGA TAA GGG GCT TAG 335
336 GAT TGC ATC TGG CCT GTG TAA GCC CTC TGA GTT CTG CGG TTC TTA GAG 383
384 TTC CCT TTT CAG CAC TAT AGC TCT GCC TTG TTC CCT TGT TCC TCC TTC 431
432 TGG CGC CCC GTG CTG TGC CCC CTG CAG GAG TCC AAG CTG TCC CCA TGC 479
480 TGC GTT CTG GTC CGG CCG CCC CTC CCG TGG TGT GGC CCT GGC CGA CCC 527
528 CCC TCC TGC GCC CCG CTT TTC TCG CAG AAG CTG CTC TTT GCC GGC TCC 575
576 CGC TCT CAG CTG GTG CAG CTG CCC GTG GCC GAC TGC ATG AAG TAT CGC 623
1 Met Lys Tyr Arg 4
624 TCC TGT GCA GAC TGT GTC CTC GCC CGG GAC CCC TAT TGC GCC TGG AGC 671
5 Ser Cys Ala Asp Cys Val Leu Ala Arg Asp Pro Tyr Cys Ala Trp Ser 20
672 GTC AAC ACC AGC CGC TGT GTG GCC GTG GGT GGC CAC TCT GGA TCT CTA 719
21 Val Asn Thr Ser Arg Cys Val Ala Val Gly Gly His Ser Gly Ser Leu 36
720 CTG ATC CAG CAT GTG ATG ACC TCG GAC ACT TCA GGC ATC TGC AAC CTC 767
37 Leu Ile Gln His Val Met Thr Ser Asp Thr Ser Gly Ile Cys Asn Leu 52
768 CGT GGC AGT AAG AAA GGT GAG CTT TTT CAT TCC CGT CGC ATC GGG CTG 815
53 Arg Gly Ser Lys Lys Gly Glu Leu Phe His Ser Arg Arg Ile Gly Leu 68
816 AGC CCT GGA CCA GAG CTG GAG TTT CTG TTC TCC TCT TCC CCA GCC CTG 863
69 Ser Pro Gly Pro Glu Leu Glu Phe Leu Phe Ser Ser Ser Pro Ala Leu 84
864 CTT TCC TGC ACT AAC ATG CAC TCT GTT TTC TCT GCC ACT ACA GTC AGG 911
85 Leu Ser Cys Thr Asn Met His Ser Val Phe Ser Ala Thr Thr Val Arg 100
912 CCC ACT CCC AAA AAC ATC ACG GTG GTG GCG GGC ACA GAC CTG GTG CTG 959
101 Pro Thr Pro Lys Asn Ile Thr Val Val Ala Gly Thr Asp Leu Val Leu 116
960 CCC TGC CAC CTC TCC TCC AAC TTG GCC CAT GCC CGC TGG ACC TTT GGG 1007
117 Pro Cys His Leu Ser Ser Asn Leu Ala His Ala Arg Trp Thr Phe Gly 132
1008 GGC CGG GAC CTG CCT GCG GAA CAG CCC GGG TCC TTC CTC TAC GAT GCC 1055
133 Gly Arg Asp Leu Pro Ala Glu Gln Pro Gly Ser Phe Leu Tyr Asp Ala 148
1056 CGG CTC CAG GCC CTG GTT GTG ATG GCT GCC CAG CCC CGC CAT GCC GGG 1103
149 Arg Leu Gln Ala Leu Val Val Met Ala Ala Gln Pro Arg His Ala Gly 164
1104 GCC TAC CAC TGC TTT TCA GAG GAG CAG GGG GCG CGG CTG GCT GCT GAA 1151
165 Ala Tyr His Cys Phe Ser Glu Glu Gln Gly Ala Arg Leu Ala Ala Glu 180
1152 GGC TAC CTT GTG GCT GTC GTG GCA GGC CCG TCG GTG ACC TTG GAG GCC 1199
181 Gly Tyr Leu Val Ala Val Val Ala Gly Pro Ser Val Thr Leu Glu Ala 196
1200 CGG GCC CCC CTG GAA AAC CTG GGG CTG GTG TGG CTG GCG GTG GTG GCC 1247
197 Arg Ala Pro Leu Glu Asn Leu Gly Leu Val Trp Leu Ala Val Val Ala 212
1248 CTG GGG GCT GTG TGC CTG GTG CTG CTG CTG CTG GTG CTG TCA TTG CGC 1295
213 Leu Gly Ala Val Cys Leu Val Leu Leu Leu Leu Val Leu Ser Leu Arg 228
1296 CGG CGG CTG CGG GAA GAG CTG GAG AAA GGG GCC AAG GCT ACT GAG AGG 1343
229 Arg Arg Leu Arg Glu Glu Leu Glu Lys Gly Ala Lys Ala Thr Glu Arg 244
1344 ACC TTG GTG TAC CCC CTG GAG CTG CCC AAG GAG CCC ACC AGT CCC CCC 1391
245 Thr Leu Val Tyr Pro Leu Glu Leu Pro Lys Glu Pro Thr Ser Pro Pro 260
1392 TTC CGG CCC TGT CCT GAA CCA GAT GAG AAA CTT TGG GAT CCT GTC GGT 1439
261 Phe Arg Pro Cys Pro Glu Pro Asp Glu Lys Leu Trp Asp Pro Val Gly 276
1440 TAC TAC TAT TCA GAT GGC TCC CTT AAG ATA GTA CCT GGG CAT GCC CGG 1487
277 Tyr Tyr Tyr Ser Asp Gly Ser Leu Lys Ile Val Pro Gly His Ala Arg 292
1488 TGC CAG CCC GGT GGG GGG CCC CCT TCG CCA CCT CCA GGC ATC CCA GGC 1535
293 Cys Gln Pro Gly Gly Gly Pro Pro Ser Pro Pro Pro Gly Ile Pro Gly 308
1536 CAG CCT CTG CCT TCT CCA ACT CGG CTT CAC CTG GGG GGT GGG CGG AAC 1583
309 Gln Pro Leu Pro Ser Pro Thr Arg Leu His Leu Gly Gly Gly Arg Asn 324
1584 TCA AAT GCC AAT GGT TAC GTG CGC TTA CAA CTA GGA GGG GAG GAC CGG 1631
325 Ser Asn Ala Asn Gly Tyr Val Arg Leu Gln Leu Gly Gly Glu Asp Arg 340
1632 GGA GGG CTC GGG CAC CCC CTG CCT GAG CTC GCG GAT GAA CTG AGA CGC 1679
341 Gly Gly Leu Gly His Pro Leu Pro Glu Leu Ala Asp Glu Leu Arg Arg 356
1680 AAA CTG CAG CAA CGC CAG CCA CTG CCC GAC TCC AAC CCC GAG GAG TCA 1727
357 Lys Leu Gln Gln Arg Gln Pro Leu Pro Asp Ser Asn Pro Glu Glu Ser 372
1728 TCA GTA TGA GGG GAA CCC CCA CCG CGT CGG CGG GAA GCG TGG GAG GTG 1775
373 Ser Val *** 375
1776 TAG CTC CTA CTT TTG CAC AGG CAC CAG CTA CCT CAG GGA CAT GGC ACG 1823
1824 GGC ACC TGC TCT GTC TGG GAC AGA TAC TGC CCA GCA CCC ACC CGG CCA 1871
1872 TGA GGA CCT GCT CTG CTC AGC ACG GGC ACT GCC ACT TGG TGT GGC TCA 1919
1920 CCA GGG CAC CAG CCT CGC AGA AGG CAT CTT CCT CCT CTC TGT GAA TCA 1967
1968 CAG ACA CGC GGG ACC CCA GCC GCC AAA ACT TTT CAA GGC AGA AGT TTC 2015
2016 AAG ATG TGT GTT TGT CTG TAT TTG CAC ATG TGT TTG TGT GTG TGT GTA 2063
2064 TGT GTG TGT GCA CGC GCG TGC GCG CTT GTG GCA TAG CCT TCC TGT TTC 2111
2112 TGT CAA GTC TTC CCT TGG CCT GGG TCC TCC TGG TGA GTC ATT GGA GCT 2159
2160 ATG AAG GGG AAG GGG TCG TAT CAC TTT GTC TCT CCT ACC CCC ACT GCC 2207
2208 CCG AGT GTC GGG CAG CGA TGT ACA TAT GGA GGT GGG GTG GAC AGG GTG 2255
2256 CTG TGC CCC TTC AGA GGG AGT GCA GGG CTT GGG GTG GGC CTA GTC CTG 2303
2304 CTC CTA GGG CTG TGA ATG TTT TCA GGG TGG GGG GAG GGA GAT GGA GCC 2351
2352 TCC TGT GTG TTT GGG GGG AAG GGT GGG TGG GGC CTC CCA CTT GGC CCC 2399
2400 GGG GTT CAG TGG TAT TTT ATA CTT GCC TTC TTC CTG TAC AGG GCT GGG 2447
2448 AAA GGC TGT GTG AGG GGA GAG AAG GGA GAG GGT GGG CCT GCT GTG GAC 2495
2496 AAT GGC ATA CTC TCT TCC AGC CCT AGG AGG AGG GCT CCT AAC AGT GTA 2543
2544 ACT TAT TGT GTC CCC GCG TAT TTA TTT GTT GTA AAT ATT TGA GTA TTT 2591
2592 TTA TAT TGA CAA ATA AAA TGG AGA AAA TGA AAC GAA AAA AAA AAA AAA 2639
2640 AAA A 2643
10.FP2860
A: nucleotide sequence (SEQ ID NO:28) length: 1848 bases
1 GTTGATAAAT TTCACGTAAT TTACTCAACC GTTTCTCTAC TGTTGGATGT TTAGGTTGTT
61 TCCAGTTTTT TGATATAATC ACAACCCATG TTTGTTGAGC ATGGACTGTG TGCTCAGCAC
121 TGTCATGGAC ATTTCCCTTG CAGTAATTTG TTTACCCCTA AAAACATCCT GGTGGGAAAA
181 GTGGTTTTAT TATCCCCATT TTACAGATGA AAAAACTGAG GCCCAAAGTT AGAGCAGGAT
241 TTAAACCCAT GAAGTTTGAC CCAGAGCCTG TTTTGTAATC ATGCCTCTGC TATTTGTCAC
301 ATCATCATGC CTGCCCATAT TCTGGATGGT TTCCTTAGGA TAGAATCCCG GAAGTGAAAA
361 TGGGTCAGTG CAGCCATAGT GTACACCTAG GAGGGAGTGG CTGGGGTCAG GAGAGGGCGT
421 TGGCTGAGAG GGTTAAGGAA TGGAGGGGCC AAGGTAATGG GAGGATCATC CACATGGAAG
481 TTGAAGTCAC CAAGAACTGA ACCAGGAGTG ATTTTGGAGA AAGTGCTGGT GAGCCAGGAG
541 CTAAAAACCC TTACAGATGA CAGGGTGCAG CCTGGGAGCC TTCCAGAAGT AGAGGTTGGT
601 ACAGAAGCAT GATGTAAGGG GAAGAACAGT CCAGGGAACA GCAGTGAGGA GCAAGGAGGA
661 CACATACCAG CCTCCAGGTC CATGGTCCAT GGGATACGGG AGGGGAAATC GCCACCCATG
721 GGGCCATGGG AAGTTTCAGA GATTAGAGAG GGGTGGGAGA TGAGTCAGAC CAGGCAAGCG
781 CCCAGCTGTG AGGTCTGGAG ACCGGGCTCA CACAGGAGCC TGCACTTGTG GAGGTTACCA
841 AGGCACAGAC AAAGGACGTC GTGATTTAGC TGCTAATGGG CCAAGACAGA TAGTGATGGT
901 GAGGCATACG TTCTGGGGAC AGGATGGGTG GAGCCTGCGC GGATGTGGTG CCATCTTCAC
961 TTCTGCTGGT GGGACCGATG AAACTGTGAA ATAAAAAGTC CAGGGACCGG GGCAGGGCCC
1021 TGGGGTTCCC GGAGATGACT TCCAAGGGGA AGTTCATGGA GAAGTCAGGG AAGGAGTGGG
1081 GAGAGCCTGT TCGGTGGTGA CAGAGGTATA GAGGCCATGG GCTCTCTTGG ACACCATTTG
1141 ATTCTCAGGT CACACTGAAG TACAGTAAGC ATTAAGTAGA GTAAACATCA CTATTGTTGC
1201 TTGGGAAAGA ATTGCAGTTT CCATCCAGAT AATTTGTTTC ATGAGGAATT TTGTACTAGA
1261 AAAAATTACT TTGTATTATT AAAAAATAGT AAAACAAACA AAAATGGTGC AAAGGGCTTA
1321 GTACTTGTCC CACGACCGGG GTCTGACTTT CCCCCTCACG ATCCTGTGGT GCCCACAGCG
1381 AACCCATCCT ACGTGCCTCC ACCCCAGTGC CAGCCAGGCG AGTTTGCCTG TGCCAACAGC
1441 CGCTGCATCC AGGAGCGCTG GAAGTGTGAC GGAGACAACG ATTGCCTGGA CAACAGTGAT
1501 GAGGCCCCAG CCCTCTGCCA TCAGCACACC TGCCCCTCGG ACCGATTCAA GTGCGAGAAC
1561 AACCGGTGCA TCCCCAACCG CTGGCTCTGC GACGGGGACA ATGACTGTGG GAACAGTGAA
1621 GATGAGTCCA ATGCCACTTG TTCAGGTGTG GAGCGGGGCT CAGATCACAC GAGGCACCCC
1681 TCAGTCAAGG AGGCAGGGGA GACGAGGGGG CCAGGTTCAG TGGCTCATGC CTATAATCTC
1741 AGCACTTTGG GAGGCCAAGG TGAGAGGATC ACTTGAGCCC AGGAGCTCAA GACCAGCCTG
1801 GCCAACATAG TGAGACCTCC CACTCTTTAC AAAAAAAAAA AAAAAAAA
B: nucleotide sequence (SEQ ID NO:29) length: 103 amino acid
1 MRPQPSAIST PAPRTDSSAR TTGASPTAGS ATGTMTVGTV KMSPMPLVQV WSGAQITRGT
61 PQSRRQGRRG GQVQWLMPII SALWEAKVRG SLEPRSSRPA WPT
C. Nucleotide and amino acid composite sequence (SEQ ID NO:30) clone number: FP2860
Start code: 1499ATG stops coding: 1808TAG protein molecular weight: 10954.91
1 G TTG ATA AAT TTC ACG TAA TTT ACT CAA CCG TTT CTC TAC TGT TGG 46
47 ATG TTT AGG TTG TTT CCA GTT TTT TGA TAT AAT CAC AAC CCA TGT TTG 94
95 TTG AGC ATG GAC TGT GTG CTC AGC ACT GTC ATG GAC ATT TCC CTT GCA 142
143 GTA ATT TGT TTA CCC CTA AAA ACA TCC TGG TGG GAA AAG TGG TTT TAT 190
191 TAT CCC CAT TTT ACA GAT GAA AAA ACT GAG GCC CAA AGT TAG AGC AGG 238
239 ATT TAA ACC CAT GAA GTT TGA CCC AGA GCC TGT TTT GTA ATC ATG CCT 286
287 CTG CTA TTT GTC ACA TCA TCA TGC CTG CCC ATA TTC TGG ATG GTT TCC 334
335 TTA GGA TAG AAT CCC GGA AGT GAA AAT GGG TCA GTG CAG CCA TAG TGT 382
383 ACA CCT AGG AGG GAG TGG CTG GGG TCA GGA GAG GGC GTT GGC TGA GAG 430
431 GGT TAA GGA ATG GAG GGG CCA AGG TAA TGG GAG GAT CAT CCA CAT GGA 478
479 AGT TGA AGT CAC CAA GAA CTG AAC CAG GAG TGA TTT TGG AGA AAG TGC 526
527 TGG TGA GCC AGG AGC TAA AAA CCC TTA CAG ATG ACA GGG TGC AGC CTG 574
575 GGA GCC TTC CAG AAG TAG AGG TTG GTA CAG AAG CAT GAT GTA AGG GGA 622
623 AGA ACA GTC CAG GGA ACA GCA GTG AGG AGC AAG GAG GAC ACA TAC CAG 670
671 CCT CCA GGT CCA TGG TCC ATG GGA TAC GGG AGG GGA AAT CGC CAC CCA 718
719 TGG GGC CAT GGG AAG TTT CAG AGA TTA GAG AGG GGT GGG AGA TGA GTC 766
767 AGA CCA GGC AAG CGC CCA GCT GTG AGG TCT GGA GAC CGG GCT CAC ACA 814
815 GGA GCC TGC ACT TGT GGA GGT TAC CAA GGC ACA GAC AAA GGA CGT CGT 862
863 GAT TTA GCT GCT AAT GGG CCA AGA CAG ATA GTG ATG GTG AGG CAT ACG 910
911 TTC TGG GGA CAG GAT GGG TGG AGC CTG CGC GGA TGT GGT GCC ATC TTC 958
959 ACT TCT GCT GGT GGG ACC GAT GAA ACT GTG AAA TAA AAA GTC CAG GGA 1006
1007 CCG GGG CAG GGC CCT GGG GTT CCC GGA GAT GAC TTC CAA GGG GAA GTT 1054
1055 CAT GGA GAA GTC AGG GAA GGA GTG GGG AGA GCC TGT TCG GTG GTG ACA 1102
1103 GAG GTA TAG AGG CCA TGG GCT CTC TTG GAC ACC ATT TGA TTC TCA GGT 1150
1151 CAC ACT GAA GTA CAG TAA GCA TTA AGT AGA GTA AAC ATC ACT ATT GTT 1198
1199 GCT TGG GAA AGA ATT GCA GTT TCC ATC CAG ATA ATT TGT TTC ATG AGG 1246
1247 AAT TTT GTA CTA GAA AAA ATT ACT TTG TAT TAT TAA AAA ATA GTA AAA 1294
1295 CAA ACA AAA ATG GTG CAA AGG GCT TAG TAC TTG TCC CAC GAC CGG GGT 1342
1343 CTG ACT TTC CCC CTC ACG ATC CTG TGG TGC CCA CAG CGA ACC CAT CCT 1390
1391 ACG TGC CTC CAC CCC AGT GCC AGC CAG GCG AGT TTG CCT GTG CCA ACA 1438
1439 GCC GCT GCA TCC AGG AGC GCT GGA AGT GTG ACG GAG ACA ACG ATT GCC 1486
1487 TGG ACA ACA GTG ATG AGG CCC CAG CCC TCT GCC ATC AGC ACA CCT GCC 1534
1 Met Arg Pro Gln Pro Ser Ala Ile Ser Thr Pro Ala 12
1535 CCT CGG ACC GAT TCA AGT GCG AGA ACA ACC GGT GCA TCC CCA ACC GCT 1582
13 Pro Arg Thr Asp Ser Ser Ala Arg Thr Thr Gly Ala Ser Pro Thr Ala 28
1583 GGC TCT GCG ACG GGG ACA ATG ACT GTG GGA ACA GTG AAG ATG AGT CCA 1630
29 Gly Ser Ala Thr Gly Thr Met Thr Val Gly Thr Val Lys Met Ser Pro 44
1631 ATG CCA CTT GTT CAG GTG TGG AGC GGG GCT CAG ATC ACA CGA GGC ACC 1678
45 Met Pro Leu Val Gln Val Trp Ser Gly Ala Gln Ile Thr Arg Gly Thr 60
1679 CCT CAG TCA AGG AGG CAG GGG AGA CGA GGG GGC CAG GTT CAG TGG CTC 1726
61 Pro Gln Ser Arg Arg Gln Gly Arg Arg Gly Gly Gln Val Gln Trp Leu 76
1727 ATG CCT ATA ATC TCA GCA CTT TGG GAG GCC AAG GTG AGA GGA TCA CTT 1774
77 Met Pro Ile Ile Ser Ala Leu Trp Glu Ala Lys Val Arg Gly Ser Leu 92
1775 GAG CCC AGG AGC TCA AGA CCA GCC TGG CCA ACA TAG TGA GAC CTC CCA 1822
93 Glu Pro Arg Ser Ser Arg Pro Ala Trp Pro Thr *** 104
1823 CTC TTT ACA AAA AAA AAA AAA AAA AA 1848
11.FP2926
A: nucleotide sequence (SEQ ID NO:31) length: 1921 bases
1 GCACGGTGAA ACCCCGTCTC TACTAAAAAT ACAAAAAATT AGCCGGGCAC GGTGGCGGGC
61 ACCTGTAGTC CCAGCTACTC GGGAGGCTGA GGCAGGAGAA TGGCATGAAC CTGGGAGGCG
121 GAGCTTGCAG TGAGCCGAGA TTGCGCCACT GCACTCCAGC CTGGGCCACA GAGCGAGACT
181 CCATCTCAAA AAAAAAAAAA GAACCCTGGG GTTTGGGCAG AGAGAGTTGG AGCTGATGTG
241 GCGCTGAGGG GGCTGCTCCC TCCCATCTGA GTCTCCCATC TCTGCCTGCA CTCTTCTGGC
301 TGGCACTGTG CCAGCCTGCT AACCTCCCTG GGCCTCAGTT TCCTCCTCTG TCAAATGAGA
361 GAGGATCTTC TCTGGGTGTA GAAAAGGACG AGGTGGTGAG TGGGTCTGAA GGCCTCTGGT
421 GTCCCATAAA GCGACTCTCC TCACCATCTT TGCCACCCAT TGGGGTGTCC AGCACCCATG
481 GAACTCTGTC TGTGCCTCTG TCCTGGAGGG AGACTTGACC TCCTGCTCAG GAAAGGCTCT
541 CCAAGCCCTT GTTGTGAAAT TCCTGCCTGC TGTCCGGAAC TCAGTCTTCC CATCCGAGGG
601 ACGAAGGTTT CGGGAAGAGA GGTGGACAGG AAGGGGTCCT CATCAGCGGT CCCACCCTCC
661 TCTCCTTCCT TCGCCCTCTC CAGGCCAGAA ATCAGCGAGG AGCTCAAGGA CCTGATCCTG
721 AAGATGTTAG ACAAGAATCC CGAGACGAGA ATTGGGGTGC CAGACATCAA GGTCGGGGAA
781 CTGGGGGTCT TGGGCTGGGC TGGGACACAG AAAACAGGAG TCACTTTCCC TTTCTGGAGG
841 GATCAACACC AGGATGCATG TGTGTTGGGT TTGAGTCTGT GGACTTTGGA CCCCTCCAGG
901 TGATTCTGGT AATGGCCTGA CCTCTCCCCC TCTCCCTGCC CTCCCGGCCC CGACAGTTGC
961 ACCCTTGGGT GACCAAGAAC GGGGAGGAGC CCCTTCCTTC GGAGGAGGAG CACTGCAGCG
1021 TGGTGGAGGT GACAGAGGAG GAGGTTAAGA ACTCAGTCAG GCTCATCCCC AGCTGGACCA
1081 CGGTGGTAAG AGAGCCGGGG TAGATGCTCC CTTGTCCTGG AGGGCCTGGG GGACCTGAGC
1141 CTTGCTCTGT GCCTGGCTCC TTGGGGGGAC AGAGGCCTGC CTGGCCAGCC AGCTGTGATC
1201 CTGGGCCACT GGAGCCAGCC ATTCTGATGG AGGCCCATGG AGAGGGAGGT CTTGTGGTCG
1261 GGAGACCAGG AGGCTTGGTG AGGAGAGTGA CTGATTTAAA GAAATAGCGG GCGTGGGGCC
1321 GGGCGCGGTG GCTCACGCCT GTAATCCCAG CACTTTGGGA GGCCAAGGCG GGCAGATCAC
1381 GAGGTCAGGA GATCGAGACC ATCCTTGAAA CCCCGTCTCT ACTAAAAATA TAGAAAATTA
1441 GCCGGGCGTG GTGGCGGGCG CGTGTAGTCC CAGCTACTCG GGAGGCTGAG GCAGGAGAAT
1501 GGTGTGAACC CGGGAGGTGG AGTTTGCCGT GAGCCGAGAT CGCGCCACTG CACTCCAGCC
1561 TGGGCCACAG AGCGAGACTG CGTCTCAAAA AAAAAAAAAG AAGAAAAGAA AAGAAAGAAA
1621 TACCAGGCGC GGTGGCTCAC GCCTGGAATC CCAGCACTTT GGGAGGCCGA GGCGGGTGGA
1681 TCACGAGGTC AGGAGATCGA GACCATCCTG GCTAATACGG CGAAACCCCA CCTCTACTAA
1741 AAATACAAAA AAATTAGCCG GGCGCAGTGG TGGGCACCTG TAGTCCCAGC TACTGGGGAG
1801 GCCGAGGCAG GAGAATCGCT TGAACCTGGG AGGTGGAGGT TGTAGTGAGC CAAGATCACG
1861 CCATTGCACT CCAGCCTGGT TGACAGAACG AGACTCCATC TCAAAAAAAA AAAAAAAAAA
1921 A
B: nucleotide sequence (SEQ ID NO:32) length: 141 amino acid
1 MELCLCLCPG GRLDLLLRKG SPSPCCEIPA CCPELSLPIR GTKVSGREVD RKGSSSAVPP
61 SSPSFALSRP EISEELKDLI LKMLDKNPET RIGVPDIKVG ELGVLGWAGT QKTGVTFPFW
121 RDQHQDACVL GLSLWTLDPS R
C. Nucleotide and amino acid composite sequence (SEQ ID NO:33) clone number: FP2926
Start code: 478ATG stops coding: 901TGA protein molecular weight: 15271.93
1 GCA CGG TGA AAC CCC GTC TCT ACT AAA AAT ACA AAA AAT TAG CCG GGC 48
49 ACG GTG GCG GGC ACC TGT AGT CCC AGC TAC TCG GGA GGC TGA GGC AGG 96
97 AGA ATG GCA TGA ACC TGG GAG GCG GAG CTT GCA GTG AGC CGA GAT TGC 144
145 GCC ACT GCA CTC CAG CCT GGG CCA CAG AGC GAG ACT CCA TCT CAA AAA 192
193 AAA AAA AAG AAC CCT GGG GTT TGG GCA GAG AGA GTT GGA GCT GAT GTG 240
241 GCG CTG AGG GGG CTG CTC CCT CCC ATC TGA GTC TCC CAT CTC TGC CTG 288
289 CAC TCT TCT GGC TGG CAC TGT GCC AGC CTG CTA ACC TCC CTG GGC CTC 336
337 AGT TTC CTC CTC TGT CAA ATG AGA GAG GAT CTT CTC TGG GTG TAG AAA 384
385 AGG ACG AGG TGG TGA GTG GGT CTG AAG GCC TCT GGT GTC CCA TAA AGC 432
433 GAC TCT CCT CAC CAT CTT TGC CAC CCA TTG GGG TGT CCA GCA CCC ATG 480
1 Met 1
481 GAA CTC TGT CTG TGC CTC TGT CCT GGA GGG AGA CTT GAC CTC CTG CTC 528
2 Glu Leu Cys Leu Cys Leu Cys Pro Gly Gly Arg Leu Asp Leu Leu Leu 17
529 AGG AAA GGC TCT CCA AGC CCT TGT TGT GAA ATT CCT GCC TGC TGT CCG 576
18 Arg Lys Gly Ser Pro Ser Pro Cys Cys Glu Ile Pro Ala Cys Cys Pro 33
577 GAA CTC AGT CTT CCC ATC CGA GGG ACG AAG GTT TCG GGA AGA GAG GTG 624
34 Glu Leu Ser Leu Pro Ile Arg Gly Thr Lys Val Ser Gly Arg Glu Val 49
625 GAC AGG AAG GGG TCC TCA TCA GCG GTC CCA CCC TCC TCT CCT TCC TTC 672
50 Asp Arg Lys Gly Ser Ser Ser Ala Val Pro Pro Ser Ser Pro Ser Phe 65
673 GCC CTC TCC AGG CCA GAA ATC AGC GAG GAG CTC AAG GAC CTG ATC CTG 720
66 Ala Leu Ser Arg Pro Glu Ile Ser Glu Glu Leu Lys Asp Leu Ile Leu 81
721 AAG ATG TTA GAC AAG AAT CCC GAG ACG AGA ATT GGG GTG CCA GAC ATC 768
82 Lys Met Leu Asp Lys Asn Pro Glu Thr Arg Ile Gly Val Pro Asp Ile 97
769 AAG GTC GGG GAA CTG GGG GTC TTG GGC TGG GCT GGG ACA CAG AAA ACA 816
98 Lys Val Gly Glu Leu Gly Val Leu Gly Trp Ala Gly Thr Gln Lys Thr 113
817 GGA GTC ACT TTC CCT TTC TGG AGG GAT CAA CAC CAG GAT GCA TGT GTG 864
114 Gly Val Thr Phe Pro Phe Trp Arg Asp Gln His Gln Asp Ala Cys Val 129
865 TTG GGT TTG AGT CTG TGG ACT TTG GAC CCC TCC AGG TGA TTC TGG TAA 912
130 Leu Gly Leu Ser Leu Trp Thr Leu Asp Pro Ser Arg *** 142
913 TGG CCT GAC CTC TCC CCC TCT CCC TGC CCT CCC GGC CCC GAC AGT TGC 960
961 ACC CTT GGG TGA CCA AGA ACG GGG AGG AGC CCC TTC CTT CGG AGG AGG 1008
1009 AGC ACT GCA GCG TGG TGG AGG TGA CAG AGG AGG AGG TTA AGA ACT CAG 1056
1057 TCA GGC TCA TCC CCA GCT GGA CCA CGG TGG TAA GAG AGC CGG GGT AGA 1104
1105 TGC TCC CTT GTC CTG GAG GGC CTG GGG GAC CTG AGC CTT GCT CTG TGC 1152
1153 CTG GCT CCT TGG GGG GAC AGA GGC CTG CCT GGC CAG CCA GCT GTG ATC 1200
1201 CTG GGC CAC TGG AGC CAG CCA TTC TGA TGG AGG CCC ATG GAG AGG GAG 1248
1249 GTC TTG TGG TCG GGA GAC CAG GAG GCT TGG TGA GGA GAG TGA CTG ATT 1296
1297 TAA AGA AAT AGC GGG CGT GGG GCC GGG CGC GGT GGC TCA CGC CTG TAA 1344
1345 TCC CAG CAC TTT GGG AGG CCA AGG CGG GCA GAT CAC GAG GTC AGG AGA 1392
1393 TCG AGA CCA TCC TTG AAA CCC CGT CTC TAC TAA AAA TAT AGA AAA TTA 1440
1441 GCC GGG CGT GGT GGC GGG CGC GTG TAG TCC CAG CTA CTC GGG AGG CTG 1488
1489 AGG CAG GAG AAT GGT GTG AAC CCG GGA GGT GGA GTT TGC CGT GAG CCG 1536
1537 AGA TCG CGC CAC TGC ACT CCA GCC TGG GCC ACA GAG CGA GAC TGC GTC 1584
1585 TCA AAA AAA AAA AAA GAA GAA AAG AAA AGA AAG AAA TAC CAG GCG CGG 1632
1633 TGG CTC ACG CCT GGA ATC CCA GCA CTT TGG GAG GCC GAG GCG GGT GGA 1680
1681 TCA CGA GGT CAG GAG ATC GAG ACC ATC CTG GCT AAT ACG GCG AAA CCC 1728
1729 CAC CTC TAC TAA AAA TAC AAA AAA ATT AGC CGG GCG CAG TGG TGG GCA 1776
1777 CCT GTA GTC CCA GCT ACT GGG GAG GCC GAG GCA GGA GAA TCG CTT GAA 1824
1825 CCT GGG AGG TGG AGG TTG TAG TGA GCC AAG ATC ACG CCA TTG CAC TCC 1872
1873 AGC CTG GTT GAC AGA ACG AGA CTC CAT CTC AAA AAA AAA AAA AAA AAA 1920
1921 A 1921
12.FP3235
A: nucleotide sequence (SEQ ID NO:34) length: 1529 bases
1 GCGGCGAGGC TACCCAGGCT TCCCTGGAGT CGGCCCCACG GATCATGCGG CTGGTGGCCG
61 AATGCAGCCG CTCCAGGGCC CGGGCAGGCG AGCTGTGGCT GCCGCATGGG ACAGTGGCCA
121 CTCCTGTGTT CATGCCAGTG GGCACGCAGG CCACCATGAA GGGCATCACG ACCGAACAGC
181 TGGACGCTCT GGGGACCCGA GCTGATCCAG AAAGCCAACG GTCTCCACGG CTTCATGAAT
241 TGGCCTCATA ATCTGCTAAC GGACAGCGGC GGTTTCCAGA TGGTGTCGCT GGTGTCTCTG
301 TCCGAGGTGA CGGAGGAGGG CGTCCGCTTC CGCTCCCCCT ACGACGGCAA TGAGACCCTG
361 CTGAGCCCGG AGAAATCCGT GCAGATCCAG AATGCGCTGG GCTCGGACAT CATCATGCAG
421 CTGGACGACG TGGTTAGCAG TACTGTGACT GGGCCACGTG TGGAGGAGGC CATGTACAGG
481 TCAATCCGCT GGCTGGACCG GTGCATTGCA GCCCATCAGC GGCCGGACAA GCAGAACCTC
541 TTCGCCATTA TCCAGGGTGG GCTGGACGCA GATCTCCGGG CCACCTGCCT TGAAGAGATG
601 ACCAAGCGAG ACGTGCCTGG CTTCGCCATC GGGGGCCTGA GCGGGGGTGA GAGCAAGTCG
661 CAGTTCTGGC GGATGGTGGC GCTGAGCACC TCTCGGCTGC CGAAGGACAA GCCCCGATAT
721 CTGATGGGGG TTGGGTATGT TGTGGATAGG GAAGCCAGAG CCCTACCTGT GGGAAGTGGA
781 TTCCTGGGGA CCCCCTACCC TGCTTGGGGA GGTGGCATTT GGGGGAAACG GACACAGGTC
841 TGATCTGAGG AGACTAGGAA GACATGGCTG TCCCTTGGGG GCCATTCTGA GGGAATATGG
901 CCCAGTCTGG GGCAGTGTGA GGGTTGGAAG GGGCCCTGGG AAGCCCCTGA GGTTCTCTGC
961 CCCCTCCCGT CATGGCTGCA ACCCCAGCTA TGCCACTGAT CTGGTAGTCT GCGTGGCTCT
1021 TGGATGTGAC ATGTTCGACT GCGTCTTCCC CACACGGACA GCGCGCTTTG GCTCTGCCCT
1081 GGTGCCCACT GGGAACCTGC AGTTGAGGAA GAAGGTGTTT GAGAAGGACT TCGGCCCCAT
1141 AGACCCGGAG TGCACCTGCC CCACGTGCCA AAAGCACAGC CGCGCCTTCC TGCACGCACT
1201 GCTGCACAGT GACAACACGG CCGCGCTGCA CCACCTCACG GTCCACAACA TCGCCTACCA
1261 GCTGCAGCTC ATGAGCGCCG TCCGCACCAG CATCGTGGAG AAGCGCTTCC CGGACTTCGT
1321 GCGGGACTTC ATGGGCGCCA TGTACGGGGA TCCCACCCTC TGTCCCACCT GGGCCACTGA
1381 CGCTCTGGCC TCTGTGGGAA TCACACTGGG CTGACCTGGC ATTGGGAGAG GGAGGGAGGA
1441 AGGAAGGGAG GGAGGGGCTG GAAGATACTG AAGGATTCCT TTTTGAAAGG TTTTTTTTAT
1501 TGTAACTTAC AAAAAAAAAA AAAAAAAAA
B: nucleotide sequence (SEQ ID NO:35) length: 225 amino acid
1 MLWIGKPEPY LWEVDSWGPP TLLGEVAFGG NGHRSDLRRL GRHGCPLGAI LREYGPVWGS
61 VRVGRGPGKP LRFSAPSRHG CNPSYATDLV VCVALGCDMF DCVFPTRTAR FGSALVPTGN
121 LQLRKKVFEK DFGPIDPECT CPTCQKHSRA FLHALLHSDN TAALHHLTVH NIAYQLQLMS
181 AVRTSIVEKR FPDFVRDFMG AMYGDPTLCP TWATDALASV GITLG
C. Nucleotide and amino acid composite sequence (SEQ ID NO:36) clone number: FP3235
Start code: 737ATG stops coding: 1412TGA protein molecular weight: 24675.17
1 G CGG CGA GGC TAC CCA GGC TTC CCT GGA GTC GGC CCC ACG GAT CAT 46
47 GCG GCT GGT GGC CGA ATG CAG CCG CTC CAG GGC CCG GGC AGG CGA GCT 94
95 GTG GCT GCC GCA TGG GAC AGT GGC CAC TCC TGT GTT CAT GCC AGT GGG 142
143 CAC GCA GGC CAC CAT GAA GGG CAT CAC GAC CGA ACA GCT GGA CGC TCT 190
191 GGG GAC CCG AGC TGA TCC AGA AAG CCA ACG GTC TCC ACG GCT TCA TGA 238
239 ATT GGC CTC ATA ATC TGC TAA CGG ACA GCG GCG GTT TCC AGA TGG TGT 286
287 CGC TGG TGT CTC TGT CCG AGG TGA CGG AGG AGG GCG TCC GCT TCC GCT 334
335 CCC CCT ACG ACG GCA ATG AGA CCC TGC TGA GCC CGG AGA AAT CCG TGC 382
383 AGA TCC AGA ATG CGC TGG GCT CGG ACA TCA TCA TGC AGC TGG ACG ACG 430
431 TGG TTA GCA GTA CTG TGA CTG GGC CAC GTG TGG AGG AGG CCA TGT ACA 478
479 GGT CAA TCC GCT GGC TGG ACC GGT GCA TTG CAG CCC ATC AGC GGC CGG 526
527 ACA AGC AGA ACC TCT TCG CCA TTA TCC AGG GTG GGC TGG ACG CAG ATC 574
575 TCC GGG CCA CCT GCC TTG AAG AGA TGA CCA AGC GAG ACG TGC CTG GCT 622
623 TCG CCA TCG GGG GCC TGA GCG GGG GTG AGA GCA AGT CGC AGT TCT GGC 670
671 GGA TGG TGG CGC TGA GCA CCT CTC GGC TGC CGA AGG ACA AGC CCC GAT 718
719 ATC TGA TGG GGG TTG GGT ATG TTG TGG ATA GGG AAG CCA GAG CCC TAC 766
1 Met Leu Trp Ile Gly Lys Pro Glu Pro Tyr 10
767 CTG TGG GAA GTG GAT TCC TGG GGA CCC CCT ACC CTG CTT GGG GAG GTG 814
11 Leu Trp Glu Val Asp Ser Trp Gly Pro Pro Thr Leu Leu Gly Glu Val 26
815 GCA TTT GGG GGA AAC GGA CAC AGG TCT GAT CTG AGG AGA CTA GGA AGA 862
27 Ala Phe Gly Gly Asn Gly His Arg Ser Asp Leu Arg Arg Leu Gly Arg 42
863 CAT GGC TGT CCC TTG GGG GCC ATT CTG AGG GAA TAT GGC CCA GTC TGG 910
43 His Gly Cys Pro Leu Gly Ala Ile Leu Arg Glu Tyr Gly Pro Val Trp 58
911 GGC AGT GTG AGG GTT GGA AGG GGC CCT GGG AAG CCC CTG AGG TTC TCT 958
59 Gly Ser Val Arg Val Gly Arg Gly Pro Gly Lys Pro Leu Arg Phe Ser 74
959 GCC CCC TCC CGT CAT GGC TGC AAC CCC AGC TAT GCC ACT GAT CTG GTA 1006
75 Ala Pro Ser Arg His Gly Cys Asn Pro Ser Tyr Ala Thr Asp Leu Val 90
1007 GTC TGC GTG GCT CTT GGA TGT GAC ATG TTC GAC TGC GTC TTC CCC ACA 1054
91 Val Cys Val Ala Leu Gly Cys Asp Met Phe Asp Cys Val Phe Pro Thr 106
1055 CGG ACA GCG CGC TTT GGC TCT GCC CTG GTG CCC ACT GGG AAC CTG CAG 1102
107 Arg Thr Ala Arg Phe Gly Ser Ala Leu Val Pro Thr Gly Asn Leu Gln 122
1103 TTG AGG AAG AAG GTG TTT GAG AAG GAC TTC GGC CCC ATA GAC CCG GAG 1150
123 Leu Arg Lys Lys Val Phe Glu Lys Asp Phe Gly Pro Ile Asp Pro Glu 138
1151 TGC ACC TGC CCC ACG TGC CAA AAG CAC AGC CGC GCC TTC CTG CAC GCA 1198
139 Cys Thr Cys Pro Thr Cys Gln Lys His Ser Arg Ala Phe Leu His Ala 154
1199 CTG CTG CAC AGT GAC AAC ACG GCC GCG CTG CAC CAC CTC ACG GTC CAC 1246
155 Leu Leu His Ser Asp Asn Thr Ala Ala Leu His His Leu Thr Val His 170
1247 AAC ATC GCC TAC CAG CTG CAG CTC ATG AGC GCC GTC CGC ACC AGC ATC 1294
171 Asn Ile Ala Tyr Gln Leu Gln Leu Met Ser Ala Val Arg Thr Ser Ile 186
1295 GTG GAG AAG CGC TTC CCG GAC TTC GTG CGG GAC TTC ATG GGC GCC ATG 1342
187 Val Glu Lys Arg Phe Pro Asp Phe Val Arg Asp Phe Met Gly Ala Met 202
1343 TAC GGG GAT CCC ACC CTC TGT CCC ACC TGG GCC ACT GAC GCT CTG GCC 1390
203 Tyr Gly Asp Pro Thr Leu Cys Pro Thr Trp Ala Thr Asp Ala Leu Ala 218
1391 TCT GTG GGA ATC ACA CTG GGC TGA CCT GGC ATT GGG AGA GGG AGG GAG 1438
219 Ser Val Gly Ile Thr Leu Gly *** 226
1439 GAA GGA AGG GAG GGA GGG GCT GGA AGA TAC TGA AGG ATT CCT TTT TGA 1486
1487 AAG GTT TTT TTT ATT GTA ACT TAC AAA AAA AAA AAA AAA AAA A 1529

Claims (4)

1. isolating polynucleotide is characterized in that, it is selected from down group:
(a) polynucleotide of the following polypeptide of coding, described polypeptide has the aminoacid sequence of the group of being selected from down: SEQ ID NO:2,5,8,11,14,17,20,23,26,29,32,35;
(b) with polynucleotide (a) complementary polynucleotide.
2. polynucleotide as claimed in claim 1 is characterized in that, the polypeptide of this polynucleotide encoding has the aminoacid sequence of the group of being selected from down: SEQ ID NO:2,5,8,11,14,17,20,23,26,29,32,35.
3. polynucleotide as claimed in claim 1 is characterized in that, the sequence of these polynucleotide is selected from down group:
SEQ ID NO:3,6,9,12,15,18,21,24,27,30,33,36 coding region sequence or full length sequence.
4. a carrier is characterized in that, it contains the described polynucleotide of claim 1.
5. a genetically engineered host cell is characterized in that, it is a kind of host cell that is selected from down group:
(a) host cell that transforms or transduce with the described carrier of claim 4;
(b) host cell that transforms or transduce with the described polynucleotide of claim 1.
CNB011267267A 2001-09-12 2001-09-12 Human protein with function of suppressing cancer cell growth and its coding sequence Expired - Fee Related CN1177048C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB011267267A CN1177048C (en) 2001-09-12 2001-09-12 Human protein with function of suppressing cancer cell growth and its coding sequence

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB011267267A CN1177048C (en) 2001-09-12 2001-09-12 Human protein with function of suppressing cancer cell growth and its coding sequence

Publications (2)

Publication Number Publication Date
CN1403478A CN1403478A (en) 2003-03-19
CN1177048C true CN1177048C (en) 2004-11-24

Family

ID=4666733

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB011267267A Expired - Fee Related CN1177048C (en) 2001-09-12 2001-09-12 Human protein with function of suppressing cancer cell growth and its coding sequence

Country Status (1)

Country Link
CN (1) CN1177048C (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104878103B (en) * 2015-06-01 2018-08-24 北京泱深生物信息技术有限公司 The target of FAM57B genes and its expression product as cholangiocarcinoma diagnosis and treatment
US20210348152A1 (en) * 2018-11-29 2021-11-11 Technion Research & Development Foundation Limited A peptide, a complex and a method for treating cancer

Also Published As

Publication number Publication date
CN1403478A (en) 2003-03-19

Similar Documents

Publication Publication Date Title
CN1170850C (en) Human angiogenin-like protein and coding sequence and application thereof
CN1177048C (en) Human protein with function of suppressing cancer cell growth and its coding sequence
CN1169954C (en) Human protein able to suppress growth of cancer cells and its coding sequence
CN1199998C (en) Human protein with suppression to cancer cell growth and its coding sequence
CN1177864C (en) Novel human protein with expression difference in liver cancer tissue and its code sequence
CN1209373C (en) Human protein with suppression to cancer cell growth and its coding sequence
CN1177049C (en) Human protein with function of suppressing cancer cell growth and its coding sequence
CN1155615C (en) Human protein with cancer cell growth suppressing function and its coding sequence
CN1199997C (en) New human protein having mouse NIH/3T3 cell conversion promoting function and its code sequence
CN1194989C (en) Novel human protein able to suppress cancer cell growth and its coding sequence
CN1160370C (en) A novel human cell cysle control related protein and a sequence encoding the same
CN1177050C (en) Human protein with function of suppressing cancer cell growth and its coding sequence
CN1155616C (en) Human protein with cancer cell growth promoting function and its coding sequence
CN1169958C (en) Human protein able to suppress growth of cancer cells and its coding sequence
CN1169831C (en) Human protein with cancer call growth suppressing function and its coding sequence
CN1169955C (en) Human protein able to suppress growth of cancer cells and its coding sequence
CN1199994C (en) New human protein with cancer cell growth inhibiting function and its coding sequence
CN1230445C (en) Novel human protein with function for promoting mouse NIH/313 cell transformation and coding sequence thereof
CN1190446C (en) New human protein with function of improving mouse NIH/3T3 cell transformation and its encoding sequence
CN1194010C (en) New human protein with the function of inhibiting cancer cell growth and its coding sequence
CN1222616C (en) Novel human protein with cancer-inhibiting function and coding sequence thereof
CN1231496C (en) Human protein with cancer cell growth suppressing function and its coding sequence
CN1169833C (en) Human Protein with cancer inhibiting function and its coding sequence
CN1205225C (en) Human protein with cancer inhibiting function and its coding sequence
CN1229387C (en) Novel human protein with cancer-suppressing function and coding sequence thereof

Legal Events

Date Code Title Description
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C19 Lapse of patent right due to non-payment of the annual fee
CF01 Termination of patent right due to non-payment of annual fee