CN1209370C

CN1209370C - New human protein with cancer inhibiting function and its encoding sequence

Info

Publication number: CN1209370C
Application number: CN 02111646
Authority: CN
Inventors: 顾健人; 杨胜利
Original assignee: SHANGHAI XINSHIJIE GENE TECHN DEVELOPMENT Co Ltd
Current assignee: SHANGHAI XINSHIJIE GENE TECHN DEVELOPMENT Co Ltd
Priority date: 2002-05-13
Filing date: 2002-05-13
Publication date: 2005-07-06
Anticipated expiration: 2022-05-13
Also published as: CN1458170A

Abstract

The present invention discloses new human protein with the function of inhibiting cancer, a polynucleotide for encode the polypeptide and a method for producing the polypeptide by a recombinant technology. The present invention also discloses a method of using the polypeptide in treating various diseases, such as cancer. The present invention also discloses an antagonist of the polypeptide and therapeutic effects thereof. The present invention also discloses an application of the polynucleotide for encoding the human protein with the function of inhibiting cancer.

Description

New people's albumen and encoding sequence thereof with cancer suppressing function

Technical field

The invention belongs to biological technical field, specifically, the present invention relates to new coding and have the proteic polynucleotide of people of cancer suppressing function and the polypeptide of this polynucleotide encoding.The invention still further relates to the purposes and the preparation of these polynucleotide and polypeptide.

Background technology

The research of people's gene group is international focus at present, removes human chromosome DNA large scale sequencing, outside the method for expressed sequence order-checking (EST), also lacks the screening that begins from function and has the high-throughout method of functional gene.

Cancer is one of principal disease of harm humans health.In order to treat effectively and prophylaxis of tumours, people more and more pay close attention to genetic treatment of tumor at present.Therefore, this area presses for people's albumen and the agonist/inhibitor thereof that development research has cancer suppressing function.

Summary of the invention

The purpose of this invention is to provide the new people's protein polypeptide of a class with cancer suppressing function with and fragment, analogue and derivative.

Another object of the present invention provides the polynucleotide of these polypeptide of coding.

Another object of the present invention provides the method for these polypeptide of production and the purposes of this polypeptide and encoding sequence.

In a first aspect of the present invention, novel isolated protein polypeptide with cancer suppressing function is provided, it comprises the polypeptide of the aminoacid sequence with the group of being selected from down: SEQ ID NO:3,6,9; Or its conservative property variation polypeptide or its active fragments or its reactive derivative.

Preferably, this polypeptide is the polypeptide with aminoacid sequence of the group of being selected from down: SEQ ID NO:3,6,9.

In a second aspect of the present invention, a kind of isolating polynucleotide are provided, it comprises a nucleotide sequence, and this nucleotide sequence is shown at least 85% homogeny with a kind of nucleotides sequence that is selected from down group: the polynucleotide of the above-mentioned protein polypeptide with cancer suppressing function of (a) encoding; (b) with polynucleotide (a) complementary polynucleotide.Preferably, the polypeptide of this polynucleotide encoding has the aminoacid sequence of the group of being selected from down: SEQ ID NO:3,6,9.More preferably, the sequence of these polynucleotide is selected from down group: SEQ ID NO:2,5,8 coding region sequence or full length sequence.

In a third aspect of the present invention, the carrier that contains above-mentioned polynucleotide is provided, and has been transformed or host cell of transduceing or the host cell that is directly transformed or transduce by above-mentioned polynucleotide by this carrier.

In a fourth aspect of the present invention, the preparation method who prepares the polypeptide of the protein-active with cancer suppressing function is provided, this method comprises: (a) have under the proteic condition of cancer suppressing function suitable the expression, cultivate the above-mentioned host cell that is transformed or transduce; (b) from culture, isolate the polypeptide of protein-active with cancer suppressing function.

In a fifth aspect of the present invention, provide and above-mentioned protein polypeptide specificity bonded antibody with cancer suppressing function.The nucleic acid molecule that can be used for detecting also is provided, and it contains, and continuous 10 Nucleotide are to full length nucleotide in the above-mentioned polynucleotide, and preferably it contains the about 15-1000 of a successive Nucleotide.

In a sixth aspect of the present invention, a kind of pharmaceutical composition is provided, it contains the protein polypeptide and the pharmaceutically acceptable carrier with cancer suppressing function of the present invention of safe and effective amount.These pharmaceutical compositions can be treated illnesss such as cancer and cellular abnormality propagation.

Others of the present invention are because the disclosure of this paper is conspicuous to those skilled in the art.

Embodiment

The 3T3 cell is a kind of l cell (J.Cell.Biol., 17:299,1963) (being also referred to as the NIH/3T3 cell).In the cancer research field, often foreign gene (especially people's gene) is introduced the 3T3 cell, observe its situation that influences to the growth of 3T3 cell.Now generally believing, is cancer related gene to the 3T3 cell influential gene of growing, wherein to the 3T3 cell growth inhibiting gene is arranged is cancer suppressor gene mostly, and to the 3T3 cell growth (former) oncogene that has the gene of promoter action to be mostly.

The present invention adopts large-scale cDNA clone transfection mouse embryo fibroblasts, has on the basis of cancer suppressing action in acquisition, proves new gene through order-checking, further obtains full length cDNA clone.DNA transfection evidence, the albumen with cancer suppressing function of the present invention has the effect that suppresses clone's formation, its inhibiting rate 〉=50% to the 3T3 cell.

As used herein, " isolating " is meant that material separates (if natural substance, primal environment promptly is a natural surroundings) from its primal environment.Do not have separation and purification as polynucleotide under the native state in the active somatic cell and polypeptide, but same polynucleotide or polypeptide as from native state with in other materials that exist separately, then for separation and purification.

As used herein, " isolating albumen or polypeptide with cancer suppressing function " is meant that the protein polypeptide with cancer suppressing function is substantially free of natural relative other albumen, lipid, carbohydrate or other material.Those skilled in the art can have the albumen of cancer suppressing function with the purified technology of protein purifying of standard.Basically pure polypeptide can produce single master tape on non-reduced polyacrylamide gel.

Polypeptide of the present invention can be recombinant polypeptide, natural polypeptides, synthetic polypeptide, preferred recombinant polypeptide.Polypeptide of the present invention can be the product of natural purifying, or the product of chemosynthesis, or uses recombinant technology to produce from protokaryon or eucaryon host (for example, bacterium, yeast, higher plant, insect and mammalian cell).The host used according to the recombinant production scheme, polypeptide of the present invention can be glycosylated, maybe can be nonglycosylated.Polypeptide of the present invention also can comprise or not comprise initial methionine residues.

The present invention also comprises the proteic fragment of the people with cancer suppressing function, derivative and analogue.As used herein, term " fragment ", " derivative " are meant with " analogue " and keep natural identical biological function or the active polypeptide of people's albumen with cancer suppressing function of the present invention basically.Polypeptide fragment of the present invention, derivative or analogue can be that (i) has one or more conservative or substituted polypeptide of non-conservation amino-acid residue (preferred conservative amino acid residue), and the amino-acid residue of such replacement can be also can not encoded by genetic code, or (ii) in one or more amino-acid residues, has a polypeptide of substituted radical, or (iii) mature polypeptide and another compound (such as the compound that prolongs the polypeptide transformation period, polyoxyethylene glycol for example) merge formed polypeptide, or (iv) additional aminoacid sequence is fused to this peptide sequence and the polypeptide that forms (as leader sequence or secretion sequence or be used for the sequence or the proteinogen sequence of this polypeptide of purifying).According to the instruction of this paper, these fragments, derivative and analogue belong to the known scope of those skilled in the art.

Polynucleotide of the present invention can be dna form or rna form.Dna form comprises the DNA of cDNA, genomic dna or synthetic.DNA can be strand or double-stranded.DNA can be coding strand or noncoding strand.Be example with FP3361 albumen (in this application, its clone numbering is adopted in proteinic name), the coding region sequence of encoding mature polypeptide can be identical with the coding region sequence shown in the SEQ ID NO:2 or the varient of degeneracy.As used herein, " varient of degeneracy " is meant that for FP3361 coding has the protein of SEQ ID NO:3, but with the differentiated nucleotide sequence of coding region sequence shown in the SEQ ID NO:2.Be example with FP7072 albumen again, the coding region sequence of encoding mature polypeptide can be identical with the coding region sequence shown in the SEQ ID NO:5 or the varient of degeneracy; " varient of degeneracy " is meant that for FP7072 coding has the protein of SEQ ID NO:6, but with the differentiated nucleotide sequence of coding region sequence shown in the SEQ ID NO:5.Have the albumen of cancer suppressing function for of the present invention other, can the rest may be inferred.

The polynucleotide of encoding mature polypeptide comprise: the encoding sequence of an encoding mature polypeptide; The encoding sequence of mature polypeptide and various additional code sequence; Encoding sequence of mature polypeptide (with optional additional code sequence) and non-coding sequence.

Term " polynucleotide of coded polypeptide " can be the polynucleotide that comprise this polypeptide of encoding, and also can be the polynucleotide that also comprise additional code and/or non-coding sequence.

The invention still further relates to the varient of above-mentioned polynucleotide, its coding has the polypeptide of identical aminoacid sequence or fragment, analogue and the derivative of polypeptide with the present invention.The varient of these polynucleotide can be the allelic variant of natural generation or the varient that non-natural takes place.These nucleotide diversity bodies comprise and replace varient, deletion mutation body and insert varient.As known in the art, allelic variant is the replacement form of polynucleotide, and it may be replacement, disappearance or the insertion of one or more Nucleotide, but can be from not changing the function of its encoded polypeptides in fact.

The invention still further relates to and above-mentioned sequence hybridization and two sequences between have at least 50%, preferably at least 70%, the polynucleotide of at least 80% homogeny more preferably.The present invention be more particularly directed under stringent condition and the interfertile polynucleotide of polynucleotide of the present invention.In the present invention, " stringent condition " is meant: (1) than hybridization under low ionic strength and the comparatively high temps and wash-out, as 0.2 * SSC, and 0.1%SDS, 60 ℃; Or (2) hybridization the time is added with denaturing agent, as 50% (v/v) methane amide, 0.1% calf serum/0.1%Ficoll, 42 ℃ etc.; Or (3) only at the homogeny between the two sequences at least more than 95%, be more preferably 97% and just hybridize when above.And the polypeptide of interfertile polynucleotide encoding has identical biological function (is example with FP3361 albumen) and activity with the mature polypeptide shown in the SEQ IDNO:3.

The invention still further relates to nucleic acid fragment with above-mentioned sequence hybridization.As used herein, the length of " nucleic acid fragment " contains 15 Nucleotide at least, better is at least 30 Nucleotide, is more preferably at least 50 Nucleotide, preferably more than at least 100 Nucleotide.The amplification technique (as PCR) that nucleic acid fragment can be used for nucleic acid has the proteic polynucleotide of cancer suppressing function to determine and/or to separate to encode.

Polypeptide among the present invention and polynucleotide preferably provide with isolating form, more preferably are purified to homogeneous.

Dna sequence dna of the present invention can obtain with several method.For example, with hybridization technique DNA isolation well known in the art.These technology including, but not limited to: 1) with probe and genome or the hybridization of cDNA library to detect homology nucleotide sequence and 2) antibody screening of expression library to be to detect the dna fragmentation of the clone with common structure feature.

The proteic specific DNA fragment sequence that coding has cancer suppressing function produces also and can obtain with following method: 1) separate double chain DNA sequence from genomic dna; 2) the chemical synthesising DNA sequence is to obtain the double-stranded DNA of required polypeptide.

When the whole aminoacid sequence of the polypeptide product of needs was known, the direct chemical of dna sequence dna is synthetic to be the method for often selecting for use.When if required amino acid whose whole sequence is not known, the direct chemical of dna sequence dna is synthetic to be impossible, and the method for selecting for use is the separation of cDNA sequence.The standard method that separates interested cDNA is from the donorcells separating mRNA of this gene of high expression level and carries out reverse transcription, forms plasmid or phage cDNA library.Extract the existing multiple proven technique of method of mRNA, test kit also can obtain (Qiagene) from commercial channels.And the construction cDNA library also is usual method (Sambrook, et al., Molecular Cloning, A Laboratory Manual, Cold SpringHarbor Laboratory.New York, 1989).Also can obtain the cDNA library of commercial offers, as the different cDNA library of Clontech company.When being used in combination the polymeric enzyme reaction technology, even few expression product also can be cloned.

Available ordinary method is screened gene of the present invention from these cDNA libraries.These methods include, but is not limited to: (1) DNA-DNA or DNA-RNA hybridization; (2) function of marker gene occurs or forfeiture; (3) mensuration has the level of the proteic transcript of cancer suppressing function; (4), detect the protein product of genetic expression by immunological technique or mensuration biologic activity.Aforesaid method can singly be used, but also several different methods combined utilization.

In (1) kind method, hybridizing used probe is and any a part of homology of polynucleotide of the present invention that at least 15 Nucleotide of its length better are at least 30 Nucleotide, are more preferably at least 50 Nucleotide, preferably at least 100 Nucleotide.In addition, the length of probe within 2kb, preferably is within the 1kb usually.Probe used herein is the dna sequence dna of chemosynthesis on the basis of gene DNA sequence information of the present invention normally.Gene of the present invention itself or fragment are certainly as probe.The mark of dna probe can be used radio isotope, fluorescein or enzyme (as alkaline phosphatase) etc.

In (4) kind method, detect the protein product of protein gene expression and can use immunological technique such as Western blotting, radioimmunoprecipitation, enzyme-linked immunosorbent assay (ELISA) etc. with cancer suppressing function.

Use method (Saiki, the et al.Science 1985 of round pcr DNA amplification/RNA; 230:1350-1354) be optimized for acquisition gene of the present invention.When particularly being difficult to obtain the cDNA of total length from the library, can preferably use RACE method (the terminal rapid amplifying method of RACE-cDNA), the primer that is used for PCR can suitably be selected according to sequence information of the present invention disclosed herein, and available ordinary method is synthetic.Available ordinary method is as the DNA/RNA fragment by gel electrophoresis separation and purifying amplification.

The gene of the present invention that obtains as mentioned above, perhaps the available ordinary method of mensuration of the nucleotide sequence of various dna fragmentations etc. such as dideoxy chain termination (Sanger et al.PNAS, 1977,74:5463-5467).This class nucleotide sequencing is available commercial sequencing kit etc. also.In order to obtain the cDNA sequence of total length, order-checking need be carried out repeatedly.Sometimes need to measure a plurality of clones' cDNA sequence, just can be spliced into the cDNA sequence of total length.

The present invention also relates to comprise the carrier of polynucleotide of the present invention, and the host cell that produces through genetically engineered with carrier of the present invention or albumen coded sequence with cancer suppressing function, and the method that produces polypeptide of the present invention through recombinant technology.

By the recombinant DNA technology of routine, can utilize polymerized nucleoside acid sequence of the present invention to can be used to express or produce the protein polypeptide with cancer suppressing function of reorganization.In general following steps are arranged:

(1). have the proteic polynucleotide of people (or varient) of cancer suppressing function with coding of the present invention, or transform or the transduction proper host cell with the recombinant expression vector that contains these polynucleotide;

(2). the host cell of in suitable medium, cultivating;

(3). separation, protein purification from substratum or cell.

Among the present invention, the people's albumen polynucleotide sequence with cancer suppressing function can be inserted in the recombinant expression vector.Term " recombinant expression vector " refers to that bacterial plasmid well known in the art, phage, yeast plasmid, vegetable cell virus, mammalian cell virus are as adenovirus, retrovirus or other carriers.The carrier of Shi Yonging includes but not limited in the present invention: the expression vector based on T7 of expressing in bacterium; The pMSXND expression vector of in mammalian cell, expressing and at the carrier that derives from baculovirus of expressed in insect cells.In a word, as long as can duplicate in host and stablize, any plasmid and carrier can be used.A key character of expression vector is to contain replication orgin, promotor, marker gene and translation controlling elements usually.

Method well-known to those having ordinary skill in the art can be used to make up and contains people's encoding histone dna sequence dna with cancer suppressing function and suitable transcribing/the translate expression vector of control signal.These methods comprise extracorporeal recombinant DNA technology, DNA synthetic technology, the interior recombinant technology of body etc.Described dna sequence dna can effectively be connected on the suitable promotor in the expression vector, and is synthetic to instruct mRNA.The representative example of these promotors has: colibacillary 1ac or trp promotor; Lambda particles phage PL promotor; Eukaryotic promoter comprises LTRs and some other known may command gene expression promoter in protokaryon or eukaryotic cell or its virus of CMV immediate early promoter, early stage and late period SV40 promotor, retrovirus.Expression vector also comprises ribosome bind site and the transcription terminator that translation initiation is used.

In addition, expression vector preferably comprises one or more selected markers, to be provided for selecting the phenotypic character of transformed host cells, cultivate Tetrahydrofolate dehydrogenase, neomycin resistance and the green fluorescent protein (GFP) of usefulness as eukaryotic cell, or be used for colibacillary tsiklomitsin or amicillin resistance.

Comprise the carrier of above-mentioned suitable dna sequence dna and suitable promotor or control sequence, can be used to transform appropriate host cell, so that it can marking protein.

Host cell can be a prokaryotic cell prokaryocyte, as bacterial cell; Or eukaryotic cell such as low, as yeast cell; Or higher eucaryotic cells, as mammalian cell.Representative example has: intestinal bacteria, streptomyces; The bacterial cell of Salmonella typhimurium; Fungal cell such as yeast; Vegetable cell; The insect cell of fruit bat S2 or Sf9; The zooblast of CHO, COS or Bowes melanoma cells etc.

When polynucleotide of the present invention are expressed in higher eucaryotic cells, be enhanced if will make to transcribe when in carrier, inserting enhancer sequence.Enhanser is the cis acting factor of DNA, and nearly 10 to 300 base pairs act on promotor transcribing with enhancing gene usually.Can for example be included in the SV40 enhanser of 100 to 270 base pairs of replication origin side in late period one, at the polyoma enhanser of replication origin side in late period one and adenovirus enhanser etc.

Persons skilled in the art all know how to select appropriate carriers, promotor, enhanser and host cell.

Can carry out with routine techniques well known to those skilled in the art with the recombinant DNA transformed host cell.When the host was prokaryotic organism such as intestinal bacteria, the competent cell that can absorb DNA can be used CaCl in exponential growth after date results ₂Method is handled, and used step is well-known in this area.Alternative is to use MgCl ₂If desired, transforming also the method for available electroporation carries out.When the host is an eukaryote, can select following DNA transfection method for use: coprecipitation of calcium phosphate method, conventional mechanical method such as microinjection, electroporation, liposome packing etc.

The transformant that obtains can be cultivated with ordinary method, expresses the polypeptide of coded by said gene of the present invention.According to used host cell, used substratum can be selected from various conventional substratum in the cultivation.Under the condition that is suitable for the host cell growth, cultivate.After host cell grows into suitable cell density, induce the promotor of selection with suitable method (as temperature transition or chemical induction), cell is cultivated for some time again.

Recombinant polypeptide in the above methods can wrap by in cell, extracellular or on cytolemma, express or be secreted into the extracellular.If desired, can utilize its physics, the separating by various separation methods with other characteristic and the albumen of purification of Recombinant of chemistry.These methods are well-known to those skilled in the art.The example of these methods includes, but are not limited to: conventional renaturation handles, with protein precipitant handle (salt analysis method), centrifugal, the broken bacterium of infiltration, superly handle, the combination of super centrifugal, sieve chromatography (gel-filtration), adsorption chromatography, ion exchange chromatography, high performance liquid chromatography (HPLC) and other various liquid chromatography (LC) technology and these methods.

The people's albumen or the polypeptide with cancer suppressing function of reorganization are of use in many ways.These purposes include, but is not limited to: directly have the disease due to the low or forfeiture of the protein function of cancer suppressing function as pharmacological agent and be used to screen and promote or antagonism has antibody, polypeptide or other part of the protein function of cancer suppressing function.For example, antibody can be used for activating or suppressing to have the proteic function of people of cancer suppressing function.The people's protein screening peptide library that has a cancer suppressing function with the reorganization of expressing can be used for seeking the peptide molecule that can suppress or stimulate the people's protein function with cancer suppressing function of therapeutic value.

The present invention also provides screening of medicaments to improve (agonist) or check the method that (antagonist) has the proteic medicament of people of cancer suppressing function to identify.Agonist improves the biological function such as stimulate cellular proliferation of the people's albumen with cancer suppressing function, and antagonist prevention disorder such as the various cancer relevant with cell hyperproliferation with treatment.For example, can be in the presence of medicine, the proteic film preparation of people that mammalian cell or expression is had cancer suppressing function is cultivated with the people's albumen with cancer suppressing function of mark.Measure the medicine raising then or check this interactional ability.

The proteic antagonist of people with cancer suppressing function comprises antibody, compound, acceptor disappearance thing and the analogue etc. that filter out.The proteic antagonist of people with cancer suppressing function can and be eliminated its function with the people's protein binding with cancer suppressing function, or suppresses to have the proteic generation of people of cancer suppressing function, or combines with the avtive spot of polypeptide and to make polypeptide can not bring into play biological function.The proteic antagonist of people with cancer suppressing function can be used for therepic use.

In screening during as the compound of antagonist, albumen of the present invention can be added during bioanalysis measures, determine by measuring albumen and the interaction between its acceptor that compounds affect has cancer suppressing function whether compound is antagonist.With the same quadrat method of above-mentioned SCREENED COMPOUND, can filter out the acceptor disappearance thing and the analogue of antagonist action.

Polypeptide of the present invention can be directly used in disease treatment, for example, and various malignant tumours and cellular abnormality propagation etc.

Polypeptide of the present invention, and fragment, derivative, analogue or their cell can be used as antigen to produce antibody.These antibody can be polyclone or monoclonal antibody.Polyclonal antibody can obtain by the method with this polypeptide direct injection animal.The technology of preparation monoclonal antibody comprises hybridoma technology, three knurl technology, people B-quadroma technology, EBV-hybridoma technology etc.

Can be with polypeptide of the present invention and antagonist and suitable pharmaceutical carrier combination back use.These carriers can be water, glucose, ethanol, salt, damping fluid, glycerine and their combination.Composition comprises the polypeptide or the antagonist of safe and effective amount and carrier and the vehicle that does not influence effect of drugs.These compositions can be used as medicine and are used for disease treatment.

The present invention also provides medicine box or the test kit that contains one or more containers, and one or more medicinal compositions compositions of the present invention are housed in the container.With these containers, can have by the given indicative prompting of government authorities of making, using or selling medicine or biological products, the government authorities that this prompting reflects production, uses or sells permits it to use on human body.In addition, polypeptide of the present invention can be used in combination with other treatment compound.

Pharmaceutical composition can be with mode administration easily, as by in part, intravenously, intraperitoneal, intramuscular, subcutaneous, the nose or the route of administration of intracutaneous.Albumen with cancer suppressing function comes administration with the amount that treats and/or prevents concrete indication effectively.The proteic amount with cancer suppressing function and the dosage range that are applied to the patient will depend on many factors, as administering mode, person's to be treated healthiness condition and diagnostician's judgement.

The proteic polynucleotide of people with cancer suppressing function also can be used for multiple therapeutic purpose.Gene therapy technology can be used for treating since have that the proteic nothing of cancer suppressing function is expressed or the proteic expression with cancer suppressing function of unusual/non-activity due to cell proliferation, growth or metabolic disturbance.The gene therapy vector of reorganization can be used for treating the protein expression with cancer suppressing function or the disease of active caused by abnormal.Deriving from the expression vector of virus such as protein gene that retrovirus, adenovirus, adeno-associated virus (AAV), hsv, parvovirus etc. can be used for having cancer suppressing function is transferred in the cell.The method that structure carries the recombinant viral vector of the protein gene with cancer suppressing function be found in existing document (Sambrook, etal.).The people protein gene of reorganization with cancer suppressing function can be packaged in the liposome and be transferred in the cell in addition.

Suppress to have cancer suppressing function people's protein mRNA oligonucleotide (comprising sense-rna and DNA) and ribozyme also within the scope of the invention.Ribozyme is the enzyme sample RNA molecule that a kind of energy specificity is decomposed specific RNA, and its mechanism of action is to carry out the endonuclease effect after ribozyme molecule and the hybridization of complementary target RNA-specific.The RNA of antisense and DNA and ribozyme can obtain with existing any RNA or DNA synthetic technology, as the technology widespread use of solid phase phosphoamide chemical synthesis synthetic oligonucleotide.Antisense rna molecule can be transcribed acquisition by the dna sequence dna of this RNA that encodes in external or body.This dna sequence dna has been incorporated into the downstream of rna polymerase promoter of carrier.In order to increase the stability of nucleic acid molecule, available several different methods is modified it, and as increasing the sequence length of both sides, the connection between the ribonucleoside is used phosphoric acid thioester bond or peptide bond but not phosphodiester bond.

Polynucleotide imports tissue or intracellular method comprises: directly be injected into polynucleotide in the in-vivo tissue; Or external by carrier (as virus, phage or plasmid etc.) earlier with the polynucleotide transfered cell in, again cell is transplanted in the body etc.

Polypeptide of the present invention also can be used as the peptide spectrum analysis, for example, the polypeptide available physical, chemistry or enzyme carry out the specificity cutting, and carry out the two-dimentional or three-dimensional gel electrophoresis analysis of one dimension.

The present invention also provides the antibody at the people's proteantigen determinant with cancer suppressing function.These antibody include, but is not limited to: the fragment that polyclonal antibody, monoclonal antibody, chimeric antibody, single-chain antibody, Fab fragment and Fab expression library produce.These antibody can prepare with ordinary method.The anti-proteic antibody of people with cancer suppressing function can be used in the immunohistochemistry technology, detects the people's albumen with cancer suppressing function in the biopsy specimen.

With the also available labelled with radioisotope of the protein bound monoclonal antibody of the people with cancer suppressing function, inject in the body and can follow the tracks of its position and distribution.Antibody among the present invention can be used for treating or prevents and the relevant disease of people's albumen with cancer suppressing function.The antibody that gives suitable dosage can stimulate or block proteic generation of the people with cancer suppressing function or activity.

Antibody also can be used for designing the immunotoxin at a certain privileged sites in the body.As have cancer suppressing function people's albumen high-affinity monoclonal antibody can with bacterium or plant poison (as diphtheria toxin, ricin, abrine etc.) covalent attachment.

Available people's albumen or the polypeptide immune animal of the production of polyclonal antibody with cancer suppressing function, as rabbit, mouse, rat etc.Multiple adjuvant can be used for the enhancing immunity reaction, includes but not limited to freund's adjuvant etc.

Have cancer suppressing function people's protein monoclonal antibody can with hybridoma technology production (Kohler and Milstein.Nature, 1975,256:495-497).With the variable region bonded chimeric antibody in human constant region and inhuman source can with existing technology production (Morrison et al, PNAS, 1985,81:6851).And the technology of existing manufacture order chain antibody (U.S.PatNo.4946778) also can be used for producing the anti-proteic single-chain antibody of people with cancer suppressing function.

Can be incorporated into the rondom polypeptide storehouse that solid formation forms by the various amino acid that may make up by screening with the protein bound peptide molecule of the present invention obtains.During screening, must carry out mark to people's protein molecular with cancer suppressing function.

The invention still further relates to quantitatively and detection and localization has the diagnostic testing process of people's protein level of cancer suppressing function.These tests are known in the art, and comprise that FISH measures and radioimmunoassay.The people's protein level that is detected in the test with cancer suppressing function, the disease that can have the importance of people's albumen in various diseases of cancer suppressing function with laying down a definition and be used to diagnose albumen to work with cancer suppressing function.

Proteic polynucleotide with cancer suppressing function can be used for having the diagnosis and the treatment of the protein related diseases of cancer suppressing function.Aspect diagnosis, the proteic polynucleotide with cancer suppressing function can be used for detecting have cancer suppressing function proteic expression whether or under morbid state, have an abnormal exprssion of cancer suppressing function.As the protein D NA sequence with cancer suppressing function can be used for the hybridization of biopsy specimen is had with judgement the proteic abnormal expression of cancer suppressing function.Hybridization technique comprises the Southern blotting, Northern blotting, in situ hybridization etc.These technological methods all are disclosed mature technologies, and relevant test kit all can obtain from commercial channels.Part or all of polynucleotide of the present invention can be used as probe stationary on microarray (Microarray) or DNA chip (being called " gene chip " again), is used for analyzing the differential expression analysis and the gene diagnosis of tissue gene.Carry out RNA-polymerase chain reaction (RT-PCR) amplification in vitro with the special primer of the albumen with cancer suppressing function and also can detect proteic transcription product with cancer suppressing function.

The sudden change that detection has the protein gene of cancer suppressing function also can be used for diagnosing the relevant disease of albumen with cancer suppressing function.Form with protein mutation of cancer suppressing function comprises that to have point mutation that the protein D NA sequence of cancer suppressing function compares, transposition, disappearance, reorganization and other any unusual etc. with normal wild type.Available existing technology such as Southern blotting, dna sequence analysis, PCR and in situ hybridization detect sudden change.In addition, sudden change might influence proteic expression, therefore can judge indirectly that with Northern blotting, Western blotting gene has or not sudden change.

Sequence of the present invention identifies it also is valuable to karyomit(e).These sequences can be specifically at certain bar human chromosome particular location and and can with its hybridization.At present, need to identify the concrete site of each gene on the karyomit(e).Yet have only chromosomal marker thing seldom to can be used for the marker chromosomes position now based on actual sequence data (repetition polymorphism).For these sequences are associated with disease related gene.The first step is positioned dna sequence dna of the present invention on the karyomit(e) exactly.

In brief, prepare PCR primer (preferred 15-35bp), sequence can be positioned on the karyomit(e) according to cDNA.Then, these primers are used for the somatocyte hybrid cell that the PCR screening contains each bar human chromosome.Have only those hybrid cells that contain corresponding to the people's gene of primer can produce the fragment of amplification.

The PCR localization method of somatocyte hybrid cell is that DNA is navigated to concrete chromosomal quick method.Use Oligonucleolide primers of the present invention,, can utilize one group to realize inferior location from specific chromosomal fragment or a large amount of genomic clone by similar approach.Other the similar strategy that can be used for chromosomal localization comprises in situ hybridization, uses the karyomit(e) prescreen and the hybridization preliminary election of the airflow classification of mark, thereby makes up the special cDNA storehouse of karyomit(e).

The cDNA clone is carried out fluorescence in situ hybridization (FISH) with Metaphase Chromosome, can in a step, accurately carry out chromosomal localization.The summary of this technology is referring to Verma etc., Human Chromosomes:a Manual of BasicTechniques, Pergamon Press, New York (1988).

In case sequence is positioned to chromosome position accurately, the physical location of this sequence on karyomit(e) just can be associated with the gene map data.These data for example are found in, V.Mckusick, Mendelian Inheritance in Man (can by with the online acquisition of Johns Hopkins University Welch Medical Library).Can pass through linkage analysis then, determine gene and navigated to relation between the disease on the chromosomal region already.

Then, need to measure ill and not cDNA between diseased individuals or genome sequence difference.If observe certain sudden change in some or all of diseased individuals, and this sudden change is not observed in any normal individual, then this sudden change may be the cause of disease of disease.More ill and diseased individuals not is usually directed at first seek the variation of structure in the karyomit(e), as from the horizontal visible of karyomit(e) or use based on detectable disappearance of the PCR of cDNA sequence or transposition.

Pyrenoids thuja acid full length sequence or its fragment with cancer suppressing function of the present invention can obtain with the method for pcr amplification method, recombination method or synthetic usually.For the pcr amplification method, can be disclosed according to the present invention about nucleotide sequence, especially open reading frame sequence designs primer, and with commercially available cDNA storehouse or by the prepared cDNA storehouse of ordinary method well known by persons skilled in the art as template, amplification and must relevant sequence.When sequence is longer, usually needs to carry out twice or pcr amplification repeatedly, and then the fragment that each time amplifies is stitched together by proper order.

In case obtained relevant sequence, just can obtain relevant sequence in large quantity with recombination method.This normally is cloned into carrier with it, changes cell again over to, separates obtaining relevant sequence then from the host cell after the propagation by ordinary method.

In addition, also the method for available synthetic is synthesized relevant sequence, especially fragment length more in short-term.Usually, by first synthetic a plurality of small segments, and then connect and to obtain the very long fragment of sequence.

At present, can be fully come the dna sequence dna of code book invention albumen (or its fragment, or derivatives thereof) by chemosynthesis.This dna sequence dna can be introduced then in the various dna moleculars (as carrier) and cell in this area.In addition, also can will suddenly change and introduce in the protein sequence of the present invention by chemosynthesis.

In addition, because the albumen with cancer suppressing function of the present invention has the natural acid sequence that is derived from the people, therefore, compare with the albumen of the same clan that derives from other species, estimate to have higher active and/or lower side effect (for example in the intravital immunogenicity of people lower or do not have) being applied to man-hour.

Below in conjunction with specific embodiment, further set forth the present invention.Should be understood that these embodiment only to be used to the present invention is described and be not used in and limit the scope of the invention.The experimental technique of unreceipted actual conditions in the following example, usually according to people such as normal condition such as Sambrook, molecular cloning: laboratory manual (New York:Cold Spring Harbor LaboratoryPress, 1989) condition described in, or the condition of advising according to manufacturer.

The acquisition of embodiment 1:cDNA gene and the restraining effect that mouse NIH/3T3 cell clone is formed

FP3361, FP7072 and FP8080 come from the human fetal cDNA library that makes up with ordinary method.Get the fetal tissue at 3,6,9 monthly ages, (GIBCO BRL company) extracts total RNA by manufacturer's specification sheets with Trizol reagent, extracts mRNA with the mRNA test kit (Pharmacia company) of purifying.Make up the cDNA library of above-mentioned mRNA with pCMV-script TMXR cDNA library construction test kit (Stratagene company).Wherein ThermoScript II is used MMLV-RT-Superscript II (GIBCO BRL) instead, and reverse transcription reaction carries out at 42 ℃.Transform XL 10-Gold recipient cell, obtained the cDNA library of 1 * 106cfu/ μ g titre.The first round is picking cDNA clone at random, is probe with high abundance cDNA clone with the cDNA clone who has proved anticancer growth function thereafter, screening by hybridization cDNA library, weak positive and negative clone of picking.With Qiagen 96 orifice plate plasmid extraction test kits, carry out the extraction of plasmid DNA by shop instruction.Plasmid DNA and empty carrier transfection simultaneously mouse NIH/3T3 cell.After the 100ng DNA alcohol precipitation drying, add 6 μ lH2O dissolving, treat transfection.Add 0.74 μ l liposome and 9.3 μ l serum-free mediums in every part of DNA sample, behind the mixing, room temperature was placed 10 minutes.Add 150 μ l serum-free mediums in every pipe, divide equally and add 3 holes and grow in the mouse NIH/3T3 cell of 96 orifice plates, placed 2 hours for 37 ℃, every hole adds 50 μ l serum-free mediums again, 37 ℃ 24 hours.Every hole is changed 100 μ l and is trained liquid entirely, 37 ℃ 24 hours, change the full training liquid 100 μ l that contain G418,37 ℃ 24-48 hour, the limit is observed, the training liquid that G418 concentration does not wait is changed on the limit.After about 2-3 time, there is the clone to form up to the microscopy cell, counting.Find that above-mentioned clone has the cell clone of inhibition formation effect, the result is as shown in the table.

CDNA clone's transfectional cell (3T3) clone formation situation

CDNA clones title	CDNA clones number (three repetitions)			Empty carrier clone number (three repetitions)
CDNA clones title	CDNA clones number (three repetitions)			Empty carrier clone number (three repetitions)			FP3361	7	6	3	30	32	29
FP7072	2	0	5	21	28	20	FP3361	7	6	3	30	32	29
FP7072	2	0	5	21	28	20	FP8080	3	1	2	21	25	27

The cDNA clone is adopted two deoxidation cessation method, on the ABI377 automatic dna sequencer, measure the nucleotide sequence of the nearly 500bp of one end.After the analysis, be defined as novel gene cloning, carry out the other end order-checking,, then design primer, check order once more, up to obtaining full length sequence (SEQ ID NO:1,4,7) as obtaining full length cDNA sequence not yet.

Embodiment 2: PCR obtains full-length gene from placenta or fetus cDNA:

Get the fetal tissue (FP clone) at 3,6,9 monthly ages, (GIBCO BRL company) extracts total RNA by manufacturer's specification sheets with Trizol reagent, extracts mRNA with the mRNA test kit (Pharmacia company) of purifying.With MMLV-RT-Superscript II (GIBCO BRL), ThermoScript II is carried out reverse transcription reaction at 42 ℃, obtains placenta or fetus cDNA.Utilize the special primer (as shown in the table) of each gene, by 97 ℃ 3 ', 1 circulation.94 ℃ 30 ", 60 ℃ 30 ", 72 ℃ 1 ', totally 35 circulations; 72 ℃ 10 ', pcr amplification is carried out in 1 circulation, and acquisition contains the amplified production of each protein gene of complete open reading frame sequence.Amplified production is through sequence verification, and the sequence that records with embodiment 1 conforms to, and changes amplified production over to host cell with routine techniques subsequently, obtains recombinant protein (SEQ ID NO:3,6,9).

Gene specific primer

Clone's title	Special primer 1 (5 ' → 3 ')	SEQ ID NO：	Special primer 2 (3 ' → 5 ')	SEQ ID NO：
Clone's title	Special primer 1 (5 ' → 3 ')	SEQ ID NO：	Special primer 2 (3 ' → 5 ')	SEQ ID NO：	FP3361	(10)atgatggtcaggctggtcaa	10	(2947)tcgaaagtacaacgctttattgaa	11
FP7072	(27)ggccgttgacctggtgac	12	(2192)aaagcaaatgtattcttcctcca	13	FP3361	(10)atgatggtcaggctggtcaa	10	(2947)tcgaaagtacaacgctttattgaa	11
FP7072	(27)ggccgttgacctggtgac	12	(2192)aaagcaaatgtattcttcctcca	13	FP8080	(44)tttcaatcccagtctgcctc	14	(2886)ggctggcgctattgtttatt	15

Annotate: in the bracket is the correspondence position of primer in each gene DNA sequence.

Embodiment 3:cDNA cloned sequence is analyzed

1.FP3361 albumen

A: nucleotide sequence (SEQ ID NO:1) length: 3046 bases

1 GGTTTCACCA TGATGGTCAG GCTGGTCAAG ATCTCCTGAC CTCAAGTGAT CTGCCCGCCT

61 CGGCCTCCCA AAGTGCTGGG ATTACAGACA TGAACCACAA CACCTGCCAA GCCCAGGTGT

121 TTTGAATCCA GCACGGGCAA TGGCAAGACC CTGTCCCAAC AAACAAAAAA ACACTGGTGC

181 CTGAGGCCCA CCCCGAGAGA TTCTGATTGA GTTGATTTGG ATTTGAATTA TCTTTAAAAT

241 TTTGGATGTG AATTTTTTTT TTTTTTCTTG GAATGGAGTT TCACTCTTGT TCCCCAGGCT

301 GGAGTGCAAC GGGACGATCT TGGCTCACCG CAACCTCCGC CTACTGGGTT CAAGCCATTC

361 TCCTGCCTCA GCTTCCCAAG TAGCTGGGAT TACAGGCATG TGCTACCATG CCCAGCTAAT

421 TTTGTATTTT TAGTAGAGAC AGGGTTTCTC CATGTTGCTC AGGCTGGTCT TGAACTCCTG

481 ACCTCAGGTG ATCCGCCCGC CTCGGACTCC CAAAGTGCTG GGATTACAGG CATGAGCCAC

541 CATGCCTGGC CGGATGTGAA TTATCTTAAA AATTTTCAGG TAATTCTAAT GGGCCAAGGT

601 TGAGAACCCC TGCTCTGGGC CCATCCGAGC ACCAGGCTGT CACAAACGCA TGCATGCACT

661 CACGCCCGTG GGCTTGGGGG GCTTCGGAAA TGTGCTTCTG CTTTTTTGAG ATGGGGTCTT

721 TCTGTTGCCC ATCCCGGAGC ACAGTGGCAC GATCACAGCT CACTGCAGCC TCGACCTCCT

781 GGGCTCAGGT GATCCTCCCG CCTCAGCCTC CTGAGTGTCT GCTTCTGGTT TTCATGATGA

841 CCTGGGGCCC AGGCATACTA CACTTGTGCT GTTCAGGGGC CAGTCCTGCA CCAGGAGCCC

901 ATCAGCCACA GCTCCGCCGA GAAGCACTGA TATGCAGAGC TAAGCAGCTT TGTTTCCACG

961 TGGATCCTGC GTAGGTTTTC TTGGTCCATC CGTAGACACC GCACTCCTGC AGAGGATCTT

1021 CTCGGGATGC CCCACTGTCT CTGTTTTCCC TCTTCACTGA ACACTCAGTC GGGGCTCGCC

1081 ATGATGCCTC TGTGTCTGCT GGCTTCTCCC CCATTGGAAC AGCCTTCTTG GCACGCCACA

1141 CTGCTAGCTG CTGGGCACTG TGCTTTCTGC CTTTACCGTT CTGCCGTGAT GTTGCCAAAA

1201 TAGCAGCAAC AACAACAACA ACAAAGGCTG GGCACCTGGC TCATGCCTGT ATTCCCAGCA

1261 GTTCGAGATC AGCCTGGGCA ACATGGTGAG ACCCTATCTC TACAAAAATA AAAAATGAAA

1321 TGAGCTGGGT GGGGTGGCGC ATGCATGCCT GTGGTCCCAG CTACTTGGGA GGCTGAGGTC

1381 GGAAGCTCGC TGGTCGGAAG ATCGCTGGAG CTTAGCCTTG AGGTCAAGGT TGCAGTGAGC

1441 CGAGATTACA TCACTGCACT CCAGCGTGGG AGACAGAGAC CCTGTATTAA CAAACAAAAA

1501 CACAAACCAC AAAGGGCAGG TCTGAAACTG CCATTTAAAA AAAAATTTGA TAAACTTAAA

1561 AAAATATATA TCCACAGATG CAGGTGAAGA ACCTGTTGTC TTCCTCAAGC CTCTTTTTCA

1621 CCCATGGGTG GAAATGGTGC CCTGGACACC CAGGCCCACG AGGTCTTTGC GTGGGGTCCC

1681 TACACAGGGC TTTAGCTTAC ACTGTGCTGC CCTCCTGTCC CCCGAGTTCC CAGTCTGTCA

1741 AAATCCAACC TGGTCTCCCA GGCCCAGGGC AAATGCCACC TCCTCCATGA AGCCTGCCAC

1801 ATCCTTTGCA CACCCTTGGG CGCTGACCTT GTTCTCCCAG CGCACAGGCA CGGGTAGTTT

1861 GCCCCTGTAG TAGTAACTCA GGCACAAAAC GAACTCTTGC TGAGGCTCGG CCGCGCAGAG

1921 CTGAGGGTTG CCGCTTCCAG GTTCAAGTGC ATTTTGAGTT TCATTCCCAG CTTCCTTCTT

1981 TTTCTGGTCT TTAATTTCTT CTCCGGATTA GGTCCCACTC AATGCTTTCC TTCTCAATTT

2041 CCAAAAGAGT ATGGTCAGAG CCAGCAGCAC ACCACCTTCC CCATGGGTGG GGGGGGGGCC

2101 AGCCTGTGGC GGGGGTGCGG GTCCCATCTT TTCGAAGGAA TTGACCCACA GTGGGCGGGT

2161 CCACCTTTGA CCTTGCCCCA GGGAGCGCAG ACAGAAAAAA GATCCTTGCT TAGTTTGAGG

2221 GGCCGCTGGG GTGCTCGGTT TGTCTTCAGA GGCCTGTCTG TAACACCAAT GCCAACCCGG

2281 TGGCACTGAC TGGTCACCCT GAAGGCCACG GCCAGTGTCC TAGGAAGGGA CTCAATTTCT

2341 AGCTGTGCCA CCTGAGATTC TGGGGTTAGG CTGGTTGTGC TTCTGAAGTT CCACTGTGCT

2401 CAAAGTGCTT GGTGAAAGTT AGCGAAGGTG ATTTTACAAA AATAGATGCA TAAAATGTCT

2461 AGGAAACACA AAAAATCCTC ATTACTCTTC TCTCCAAATA TTTTTTAAGC CCCAACTGGA

2521 CCCTAGGCAA AAGTGAGTGG CACTCCTCTG CCAGGACTCC AGGCAAGCCC CGGCATCTTC

2581 TTGCTGCCGT CCCAGACAAC AGAAGTTACC AGATGAACAG ACTTGGATGG GCCACGGGGG

2641 TGGAGAGCTG GAAAGCTTGG CTGTGCCTCT CGATGATGAT TAAGATTTCA ATATTTACAG

2701 CAAAACCACA AAGCAAATGA TAGAATAAAG CAAAACAATG GAAAATGTGA GTTCACTCGT

2761 GAGAGAGGTA CGTATGTGAG CTCTGAGGAA ATTACAGAGG GAACGCATGC AGCGGGACAG

2821 CTCTCCCAAT CGCAGCGTGC AAAGTAGACA TCCATAGTGT CTTTTGAAAA ATGAAAAACA

2881 CATTACTTTG AACAGCCAAG AAAAAAATTG CAATTTATTA AGATTCAATA AAGCGTTGTA

2941 CTTTCGAAAG CCAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA

3001 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA

B: aminoacid sequence (SEQ ID NO:3) length: 106 amino acid

1 MHSRPWAWGA SEMCFCFFEM GSFCCPSRST VARSQLTAAS TSWAQVILPP QPPECLLLVF

61 MMTWGPGILH LCCSGASPAP GAHQPQLRRE ALICRAKQLC FHVDPA

C. Nucleotide and amino acid composite sequence (SEQ ID NO:2) clone number and protein name: FP3361 start code: 654 ATG stop coding: 972 TAG protein molecular weight: 11650.08Da

1 GG TTT CAC CAT GAT GGT CAG GCT GGT CAA GAT CTC CTG ACC TCA AGT GAT CTG CCC GCC 59

60 TCG GCC TCC CAA AGT GCT GGG ATT ACA GAC ATG AAC CAC AAC ACC TGC CAA GCC CAG GTG 119

120 TTT TGA ATC CAG CAC GGG CAA TGG CAA GAC CCT GTC CCA ACA AAC AAA AAA ACA CTG GTG 179

180 CCT GAG GCC CAC CCC GAG AGA TTC TGA TTG AGT TGA TTT GGA TTT GAA TTA TCT TTA AAA 239

240 TTT TGG ATG TGA ATT TTT TTT TTT TTT CTT GGA ATG GAG TTT CAC TCC TGT TCC CCA GGC 299

300 TGG AGT GCA ACG GGA CGA TCC TGG CTC ACC GCA ACC TCC GCC TAC TGG GTT CAA GCC ATT 359

360 CTC CTG CCC CAG CTT CCC AAG TAG CTG GGA TTA CAG GCA TGT GCT ACC ATG CCC AGC TAA 419

420 TTT TGT ATT TTT AGT AGA GAC AGG GTT TCT CCA TGT TGC TCA GGC TGG TCT TGA ACT CCT 479

480 GAC CCC AGG TGA TCC GCC CGC CCC GGA CTC CCA AAG TGC TGG GAT TAC AGG CAT GAG CCA 539

540 CCA TGC CTG GCC GGA TGT GAA TTA TCT TAA AAA TTT TCA GGT AAT TCT AAT GGG CCA AGG 599

600 TTG AGA ACC CCT GCT CTG GGC CCA TCC GAG CAC CAG GCC GTC ACA AAC GCA TGC ATG CAC 659

1 Met His 2

660 TCA CGC CCG TGG GCT TGG GGG GCT TCG GAA ATG TGC TTC TGC TTT TTT GAG ATG GGG TCT 719

3 Ser Arg Pro Trp Ala Trp Gly Ala Ser Glu Met Cys Phe Cys Phe Phe Glu Met Gly Ser 22

720 TTC TGT TGC CCA TCC CGG AGC ACA GTG GCA CGA TCA CAG CTC ACT GCA GCC TCG ACC TCC 779

23 Phe Cys Cys Pro Ser Arg Ser Thr Val Ala Arg Ser Gln Leu Thr Ala Ala Ser Thr Ser 42

780 TGG GCT CAG GTG ATC CTC CCG CCC CAG CCT CCT GAG TGT CTG CTT CTG GTT TTC ATG ATG 839

43 Trp Ala Gln Val Ile Leu Pro Pro Gln Pro Pro Glu Cys Leu Leu Leu Val Phe Met Met 62

840 ACC TGG GGC CCA GGC ATA CTA CAC TTG TGC TGT TCA GGG GCC AGT CCT GCA CCA GGA GCC 899

63 Thr Trp Gly Pro Gly Ile Leu His Leu Cys Cys Ser Gly Ala Ser Pro Ala Pro Gly Ala 82

900 CAT CAG CCA CAG CTC CGC CGA GAA GCA CTG ATA TGC AGA GCT AAG CAG CTT TGT TTC CAC 959

83 His Gln Pro Gln Leu Arg Arg Glu Ala Leu Ile Cys Arg Ala Lys Gln Leu Cys Phe His 102

960 GTG GAT CCT GCG TAG GTT TTC TTG GTC CAT CCG TAG ACA CCG CAC TCC TGC AGA GGA TCT 1019

103 Val Asp Pro Ala *** 107

1020 TCC CGG GAT GCC CCA CCG TCT CTG TTT TCC CCC TTC ACT GAA CAC TCA GTC GGG GCT CGC 1079

1080 CAT GAT GCC TCT GTG TCT GCT GGC TTC TCC CCC ATT GGA ACA GCC TTC TTG GCA CGC CAC 1139

1140 ACT GCT AGC TGC TGG GCA CTG TGC TTT CTG CCT TTA CCG TTC TGC CGT GAT GTT GCC AAA 1199

1200 ATA GCA GCA ACA ACA ACA ACA ACA AAG GCT GGG CAC CTG GCT CAT GCC TGT ATT CCC AGC 1259

1260 AGT TCG AGA TCA GCC TGG GCA ACA TGG TGA GAC CCT ATC TCT ACA AAA ATA AAA AAT GAA 1319

1320 ATG AGC TGG GTG GGG TGG CGC ATG CAT GCC TGT GGT CCC AGC TAC TTG GGA GGC TGA GGT 1379

1380 CGG AAG CTC GCT GGT CGG AAG ATC GCT GGA GCT TAG CCT TGA GGT CAA GGT TGC AGT GAG 1439

1440 CCG AGA TTA CAT CAC TGC ACT CCA GCG TGG GAG ACA GAG ACC CTG TAT TAA CAA ACA AAA 1499

1500 ACA CAA ACC ACA AAG GGC AGG TCT GAA ACT GCC ATT TAA AAA AAA ATT TGA TAA ACT TAA 1559

1560 AAA AAT ATA TAT CCA CAG ATG CAG GTG AAG AAC CTG TTG TCT TCC TCA AGC CTC TTT TTC 1619

1620 ACC CAT GGG TGG AAA TGG TGC CCT GGA CAC CCA GGC CCA CGA GGT CTT TGC GTG GGG TCC 1679

1680 CTA CAC AGG GCT TTA GCT TAC ACT GTG CTG CCC TCC TGT CCC CCG AGT TCC CAG TCT GTC 1739

1740 AAA ATC CAA CCT GGT CTC CCA GGC CCA GGG CAA ATG CCA CCT CCT CCA TGA AGC CTG CCA 1799

1800 CAT CCT TTG CAC ACC CTT GGG CGC TGA CCT TGT TCT CCC AGC GCA CAG GCA CGG GTA GTT 1859

1860 TGC CCC TGT AGT AGT AAC TCA GGC ACA AAA CGA ACT CTT GCT GAG GCT CGG CCG CGC AGA 1919

1920 GCT GAG GGT TGC CGC TTC CAG GTT CAA GTG CAT TTT GAG TTT CAT TCC CAG CTT CCT TCT 1979

1980 TTT TCT GGT CTT TAA TTT CTT CTC CGG ATT AGG TCC CAC TCA ATG CTT TCC TTC TCA ATT 2039

2040 TCC AAA AGA GTA TGG TCA GAG CCA GCA GCA CAC CAC CTT CCC CAT GGG TGG GGG GGG GGC 2099

2100 CAG CCT GTG GCG GGG GTG CGG GTC CCA TCT TTT CGA AGG AAT TGA CCC ACA GTG GGC GGG 2159

2160 TCC ACC TTT GAC CTT GCC CCA GGG AGC GCA GAG AGA AAA AAG ATC CTT GCT TAG TTT GAG 2219

2220 GGG CCG CTG GGG TGC TCG GTT TGT CTT CAG AGG CCT GTC TGT AAC ACC AAT GCC AAC CCG 2279

2280 GTG GCA CTG ACT GGT CAC CCT GAA GGC CAC GGC CAG TGT CCT AGG AAG GGA CTC AAT TTC 2339

2340 TAG CTG TGC CAC CTG AGA TTC TGG GGT TAG GCT GGT TGT GCT TCT GAA GTT CCA CTG TGC 2399

2400 TCA AAG TGC TTG GTG AAA GTT AGC GAA GGT GAT TTT ACA AAA ATA GAT GCA TAA AAT GTC 2459

2460 TAG GAA ACA CAA AAA ATC CTC ATT ACT CTT CTC TCC AAA TAT TTT TTA AGC CCC AAC TGG 2519

2520 ACC CTA GGC AAA AGT GAG TGG CAC TCC TCT GCC AGG ACT CCA GGC AAG CCC CGG CAT CTT 2579

2580 CTT GCT GCC GTC CCA GAC AAC AGA AGT TAC CAG ATG AAC AGA CTT GGA TGG GCC ACG GGG 2639

2640 GTG GAG AGC TGG AAA GCT TGG CTG TGC CTC TCG ATG ATG ATT AAG ATT TCA ATA TTT ACA 2699

2700 GCA AAA CCA CAA AGC AAA TGA TAG AAT AAA GCA AAA CAA TGG AAA ATG TGA GTT CAC TCG 2759

2760 TGA GAG AGG TAC GTA TGT GAG CTC TCA GGA AAT TAC AGA GGG AAC GCA TGC AGC GGG ACA 2819

2820 GCT CTC CCA ATC GCA GCG TGC AAA GTA GAC ATC CAT AGT GTC TTT TGA AAA ATG AAA AAC 2879

2880 ACA TTA CTT TGA ACA GCC AAG AAA AAA ATT GCA ATT TAT TAA GAT TCA ATA AAG CGT TGT 2939

2940 ACT TTC GAA AGC CAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA 2999

3000 AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AA 3046

2. FP7072

A: nucleotide sequence (SEQ ID NO:4) length: 2224 bases

1 GGGACCTGGG GGCTGTGGCC GGGGGCGGCC GTTGACCTGG TGACCGCGGC GCCGCCCCAG

61 ACCGGGGGCG CAGTCCCACT CGCTCCGAGC CCCGGTCCCC CAAGCCTCCC TCCCGGGTAC

121 CTGGGGCCGC GCCCGCCCTG CGCCCAGCTC CGCCCTCCGT CGGCCCAGGC CTGACAGAGC

181 CCGGCAGCCA TGAGTGCCAA CCCCCGGTGG GACATCAGCA GGGCGCTGGG GGTGGCCAAG

241 CTCTTCCACC TGGTGTGCGG GGTGCGGGAA GCCTGCGTGA CCCCGTTCCT GACCCTTTAC

301 CTGAGGCAGC TGGGCTTGGC CGCGCCCTGG GTGGGCACCC TAATGGGAAC CAAGCACCTA

361 ATCGCTGCCT TCTGGGCTCC CGTCTGTGCC TTCCTGGCCA AAAGCTACCG GAAAAGGAGA

421 GCGCTTCTGA TCGGCTCCCT GCTCGGCTCG GTGGGGGCCA GCCTGCTGAT GGTCCTGGTC

481 CCACCGGTAG ACAAAAATCG GGTGCACTTC CCTTGTAATG GAAGCAGCGG CCTGACCAGC

541 ACAGACGCAC TCCCGGGGGT CACGCTACCT GTGAACATCA CCTCGGCCCA AGAGTCTGCC

601 TCCAGCCACC CAGCCAAGAG GACTGCAGAG GTGGAAATGC CTGGCTTCAG AAACCCACCT

661 GGTGAAAGTG ACCGAGAAAC TTTCCGTGAT CTGCACGTCT ACTTAGCGCC CTCCGTTGAA

721 GGAGCTAGGA CCACATCCCA AGCTCTCCTC CATCCTGTCA CTTCGGGGCT GAAAGATCAT

781 CCCTGGGAAG TTACTTTTGA GGTGGTCAAG ACAGCCCTCC CCTTGCTTCC TGGGGGGAAA

841 GGGCCCGGGA ATCCAGCCAA TTTGTCAGGG ACCAAGGGGA AAGCCTGGGC TTTTGACCTG

901 TCCTTGGAGG CGTTGCGGCG GACTTTTATC CTCTCCTTGG GGTCCGTGGC GTTCTGGGAG

961 CTGCTGACAG CGCCTCTGGA GCAGGTGGCA GATGACAGCC TTTATGAGTT CCTGGATTTT

1021 GTGGATGCCA CTGACCGATA CAGAAGCCTG TGGGTCTGGA GGTTGCTGGG CATGTCGGCA

1081 GGCGTGTGTG GCATCACAGC CTTGGTGGGG CAGCTGGACT GCTTCCTGAT GACCAGTGGC

1141 CCCCGAGGTG TGGTCCACTT CTATGGGTAC TCGGTGGTCA GCACCCTGGC CTTACTGGTG

1201 AGCATTGCCT TTCCCATTCC CATCTGTCAG CAGTGGGAGC CCAGCTACAA AAGGGTCAAA

1261 GCACTGTCCA TTGTGGGGGG GGTGACCCCC ACCTCATTCT CCTCGCCTCC ACCACTGTTT

1321 TGGTAGGAGC CATCGTCAGT ACTGTCCAGA ACTTTCTGTT CTGGCACATG AAGGACCATG

1381 GGAGCGGCGA GCTGGTCATG GGTTTCTCGG TCGCCCTCAG CTTGCTGGGG GAAATTCTGC

1441 TTCATCCGTT CAAAGCTACA TTGCTTAGGA AACTGTCCAG GACGGGCCTG GTGGGGCTGG

1501 GGCTGAGCTG CCTCGCTGGG CAGCTGCTGT ACTACTCTTT CCTCTGGAGC TGGTGGTCCG

1561 TCCTCCCCAT TCAGATCTTG AGTGCCATTA GCAACAGAGC TTTGTGGTGG GCTGTGGGGG

1621 CCTCAGTAGA GGACCTGGCC ACTCCCCGCA TGGAGAGGGC TCTGAGTGCC TTGTTCCGAG

1681 GCCACTTTTA CGGGAGTGGC TGTAGCCTGG GCAGCTTTGT CGGGGGCTTC GTGGTGATGC

1741 GCTTCAGCCT GGCTGTGCTC TACCAGGCCT GCTGTGTGGC CCTGTTGCTC TGGTTGGCCT

1801 TGCTCCTGTC CATACAGCGG AGGCTGCCCC GAGAGCGGAA AATCAAGTAC TCGAAGCTGC

1861 TGTCCATGGA GGTGAGTGAC ACCAGTGACT CTGAGCAGGG GACAGAACAG GACTGGCTTG

1921 TGAAGGCCAT GAGGGAGGAA CACTCAGACT GAAAGGGCTG AGAAATCCAG AGTGTGCTGA

1981 TCCAGCAAGG AACGAATGGA CTGAACAAAA CTCAGCCTGC TGAGGACAGA AACCTGCCCT

2041 GGACTGCTGG GAGCCGGGGA AGAGAGGATG GGTCTGTGCT GAAGGCCCAA CAGGATCATC

2101 TCATTGCATG ATTTTCTTTA CTTTTGAAGT AAAAGGAGAT TTAACTTTTT GCCAATCTTT

2161 TTTAGATAAT GGAGGAAGAA TACATTTGCT TTTTAAAAAG TTAAAAAAAA AAAAAAAAAA

2221 AAAA

B: aminoacid sequence (SEQ ID NO:6) length: 378 amino acid

1 MSANPRWDIS RALGVAKLFH LVCGVREACV TPFLTLYLRQ LGLAAPWVGT LMGTKHLIAA

61 FWAPVCAFLA KSYRKRRALL IGSLLGSVGA SLLMVLVPPV DKNRVHFPCN GSSGLTSTDA

121 LPGVTLPVNI TSAQESASSH PAKRTAEVEM PGFRNPPGES DRETFRDLHV YLAPSVEGAR

181 TTSQALLHPV TSGLKDHPWE VTFEVVKTAL PLLPGGKGPG NPANLSGTKG KAWAFDLSLE

241 ALRRTFILSL GSVAFWELLT APLEQVADDS LYEFLDFVDA TDRYRSLWVW RLLGMSAGVC

301 GITALVGQLD CFLMTSGPRG VVHFYGYSVV STLALLVSIA FPIPICQQWE PSYKRVKALS

361 IVGGVTPTSF SSPPPLFW

C. Nucleotide and amino acid composite sequence (SEQ ID NO:5) clone number and protein name: FP7072 start code: 190 ATG stop coding: 1324 TAG protein molecular weight: 40874.15Da

1 GGG ACC TGG GGG CTG TGG CCG GGG GCG GCC GTT GAC CTG GTG ACC GCG GCG CCG CCC CAG 60

61 ACC GGG GGC GCA GTC CCA CTC GCT CCG AGC CCC GGT CCC CCA AGC CTC CCT CCC GGG TAC 120

121 CTG GGG CCG CGC CCG CCC TGC GCC CAG CTC CGC CCT CCG TCG GCC CAG GCC TGA CAG AGC 180

181 CCG GCA GCC ATG AGT GCC AAC CCC CGG TGG GAC ATC AGC AGG GCG CTG GGG GTG GCC AAG 240

1 Met Ser Ala Asn Pro Arg Trp Asp Ile Ser Arg Ala Leu Gly Val Ala Lys 17

241 CTC TTC CAC CTG GTG TGC GGG GTG CGG GAA GCC TGC GTG ACC CCG TTC CTG ACC CTT TAC 300

18 Leu Phe His Leu Val Cys Gly Val Arg Glu Ala Cys Val Thr Pro Phe Leu Thr Leu Tyr 37

301 CTG AGG CAG CTG GGC TTG GCC GCG CCC TGG GTG GGC ACC CTA ATG GGA ACC AAG CAC CTA 360

38 Leu Arg Gln Leu Gly Leu Ala Ala Pro Trp Val Gly Thr Leu Met Gly Thr Lys His Leu 57

361 ATC GCT GCC TTC TGG GCT CCC GTC TGT GCC TTC CTG GCC AAA AGC TAC CGG AAA AGG AGA 420

58 Ile Ala Ala Phe Trp Ala Pro Val Cys Ala Phe Leu Ala Lys Ser Tyr Arg Lys Arg Arg 77

421 GCG CTT CTG ATC GGC TCC CTG CTC GGC TCG GTG GGG GCC AGC CTG CTG ATG GTC CTG GTC 480

78 Ala Leu Leu Ile Gly Ser Leu Leu Gly Ser Val Gly Ala Ser Leu Leu Met Val Leu Val 97

481 CCA CCG GTA GAC AAA AAT CGG GTG CAC TTC CCT TGT AAT GGA AGC AGC GGC CTG ACC AGC 540

98 Pro Pro Val Asp Lys Asn Arg Val His Phe Pro Cys Asn Gly Ser Ser Gly Leu Thr Ser 117

541 ACA GAC GCA CTC CCG GGG GTC ACG CTA CCT GTG AAC ATC ACC TCG GCC CAA GAG TCT GCC 600

118 Thr Asp Ala Leu Pro Gly Val Thr Leu Pro Val Asn Ile Thr Ser Ala Gln Glu Ser Ala 137

601 TCC AGC CAC CCA GCC AAG AGG ACT GCA GAG GTG GAA ATG CCT GGC TTC AGA AAC CCA CCT 660

138 Ser Ser His Pro Ala Lys Arg Thr Ala Glu Val Glu Met Pro Gly Phe Arg Asn Pro Pro 157

661 GGT GAA AGT GAC CGA GAA ACT TTC CGT GAT CTG CAC GTC TAC TTA GCG CCC TCC GTT GAA 720

158 Gly Glu Ser Asp Arg Glu Thr Phe Arg Asp Leu His Val Tyr Leu Ala Pro Ser Val Glu 177

721 GGA GCT AGG ACC ACA TCC CAA GCT CTC CTC CAT CCT GTC ACT TCG GGG CTG AAA GAT CAT 780

178 Gly Ala Arg Thr Thr Ser Gln Ala Leu Leu His Pro Val Thr Ser Gly Leu Lys Asp His 197

781 CCC TGG GAA GTT ACT TTT GAG GTG GTC AAG ACA GCC CTC CCC TTG CTT CCT GGG GGG AAA 840

198 Pro Trp Glu Val Thr Phe Glu Val Val Lys Thr Ala Leu Pro Leu Leu Pro Gly Gly Lys 217

841 GGG CCC GGG AAT CCA GCC AAT TTG TCA GGG ACC AAG GGG AAA GCC TGG GCT TTT GAC CTG 900

218 Gly Pro Gly Asn Pro Ala Asn Leu Ser Gly Thr Lys Gly Lys Ala Trp Ala Phe Asp Leu 237

901 TCC TTG GAG GCG TTG CGG CGG ACT TTT ATC CTC TCC TTG GGG TCC GTG GCG TTC TGG GAG 960

238 Ser Leu Glu Ala Leu Arg Arg Thr Phe Ile Leu Ser Leu Gly Ser Val Ala Phe Trp Glu 257

961 CTG CTG ACA GCG CCT CTG GAG CAG GTG GCA GAT GAC AGC CTT TAT GAG TTC CTG GAT TTT 1020

258 Leu Leu Thr Ala Pro Leu Glu Gln Val Ala Asp Asp Ser Leu Tyr Glu Phe Leu Asp Phe 277

1021 GTG GAT GCC ACT GAC CGA TAC AGA AGC CTG TGG GTC TGG AGG TTG CTG GGC ATG TCG GCA 1080

278 Val Asp Ala Thr Asp Arg Tyr Arg Ser Leu Trp Val Trp Arg Leu Leu Gly Met Ser Ala 297

1081 GGC GTG TGT GGC ATC ACA GCC TTG GTG GGG CAG CTG GAC TGC TTC CTG ATG ACC AGT GGC 1140

298 Gly Val Cys Gly Ile Thr Ala Leu Val Gly Gln Leu Asp Cys Phe Leu Met Thr Ser Gly 317

1141 CCC CGA GGT GTG GTC CAC TTC TAT GGG TAC TCG GTG GTC AGC ACC CTG GCC TTA CTG GTG 1200

318 Pro Arg Gly Val Val His Phe Tyr Gly Tyr Ser Val Val Ser Thr Leu Ala Leu Leu Val 337

1201 AGC ATT GCC TTT CCC ATT CCC ATC TGT CAG CAG TGG GAG CCC AGC TAC AAA AGG GTC AAA 1260

338 Ser Ile Ala Phe Pro Ile Pro Ile Cys Gln Gln Trp Glu Pro Ser Tyr Lys Arg Val Lys 357

1261 GCA CTG TCC ATT GTG GGG GGG GTG ACC CCC ACC TCA TTC TCC TCG CCT CCA CCA CTG TTT 1320

358 Ala Leu Ser Ile Val Gly Gly Val Thr Pro Thr Ser Phe Ser Ser Pro Pro Pro Leu Phe 377

1321 TGG TAG GAG CCA TCG TCA GTA CTG TCC AGA ACT TTC TGT TCT GGC ACA TGA AGG ACC ATG 1380

378 Trp *** 379

1381 GGA GCG GCG AGC TGG TCA TGG GTT TCT CGG TCG CCC TCA GCT TGC TGG GGG AAA TTC TGC 1440

1441 TTC ATC CGT TCA AAG CTA CAT TGC TTA GGA AAC TGT CCA GGA CGG GCC TGG TGG GGC TGG 1500

1501 GGC TGA GCT GCC TCG CTG GGC AGC TGC TGT ACT ACT CTT TCC TCT GGA GCT GGT GGT CCG 1560

1561 TCC TCC CCA TTC AGA TCT TGA GTG CCA TTA GCA ACA GAG CTT TGT GGT GGG CTG TGG GGG 1620

1621 CCT CAG TAG AGG ACC TGG CCA CTC CCC GCA TGG AGA GGG CTC TGA GTG CCT TGT TCC GAG 1680

1681 GCC ACT TTT ACG GGA GTG GCT GTA GCC TGG GCA GCT TTG TCG GGG GCT TCG TGG TGA TGC 1740

1741 GCT TCA GCC TGG CTG TGC TCT ACC AGG CCT GCT GTG TGG CCC TGT TGC TCT GGT TGG CCT 1800

1801 TGC TCC TGT CCA TAC AGC GGA GGC TGC CCC GAG AGC GGA AAA TCA AGT ACT CGA AGC TGC 1860

1861 TGT CCA TGG AGG TGA GTG ACA CCA GTG ACT CTG AGC AGG GGA CAG AAC AGG ACT GGC TTG 1920

1921 TGA AGG CCA TGA GGG AGG AAC ACT CAG ACT GAA AGG GCT GAG AAA TCC AGA GTG TGC TGA 1980

1981 TCC AGC AAG GAA CGA ATG GAC TGA ACA AAA CTC AGC CTG CTG AGG ACA GAA ACC TGC CCT 2040

2041 GGA CTG CTG GGA GCC GGG GAA GAG AGG ATG GGT CTG TGC TGA AGG CCC AAC AGG ATC ATC 2100

2101 TCA TTG CAT GAT TTT CTT TAC TTT TGA AGT AAA AGG AGA TTT AAC TTT TTG CCA ATC TTT 2160

2161 TTT AGA TAA TGG AGG AAG AAT ACA TTT GCT TTT TAA AAA GTT AAA AAA AAA AAA AAA AAA 2220

2221 AAA A 2224

3.FP8080 albumen

A: nucleotide sequence (SEQ ID NO:7) length: 2916 bases

1 GAAAAGACGT TTAATAAAAC AGCTTTACCT TTTAAAAATA AAATTTCAAT CCCAGTCTGC

61 CTCAGGTACG GGTGGGGGTG TGCAGCGTGA CTCCTGGCTG TGTTCCTGTA CCGTTTGTGG

121 CCTCCGGGGT CTTGCGCGTC CAGGGCCCAG ATACCGAGTA ACTGCCATGT GGTGTCGTGG

181 TGGCGTCCAT AGGCCTCCGT GGTGGGGCGG GGGCGGGGCT GTGGGTGTGC AGGGCAGCTG

241 CATTTTCCAA GAGTGGAGAT GGAGGATGGA GATTTGCTTC GTAAAGGCCT GGGAGCTACA

301 TGGGAGGTAG GGAGGGGCCA GCCACACCCA GGGCTGGGGG GCCGGTGGAC TCTGCCCAGT

361 GAAGCCAGGG GAGTGTCAGC AATGAGGGGC CTGGACAGCG ACGCGGGACC CGCACCTGGG

421 AGCGGCAGAG ATGGAGCCCA GCTCGCTAAA CGTACCCTAA GGGTGCACGC TGACCGTCTT

481 TGATCCCTGG ACAGTGTCCA TAGAGTTGTC TCCCCTTCGG TCACATTGTC CCTGGCCTTG

541 GAGTTTCTGC CTCAGGGAAA GACCCACTAA AACCACCCGG GCTCTGGCGG CCTCCTCTTC

601 TCCAGAGCAT GGTGGCGCTG CCTCCTGAGC AGAGTCACTG AGAGTGGGTC CAAGGAAGGG

661 ACCCACTTAG CAAAGGTTGT TGTGTGGTGC TGTGCAGGCA CGGCGCCTGG AGTTGGCCCC

721 GCCAGCATCG GGGTGGCTTG GGGCTCTGGT TCCTAAGCTT CTGCTTCTGA GTTGGTCAGC

781 ACGTTGGCTC TGCGCCCTGC AGTGCCTGTC TTGATGGTGG TCCCTGGATG TGCTGAAGGG

841 CTCCAGGAGC AAGGGGCCAG ACCACACTTT GGCTGAGTCC TTGGGTAGTT TCAGAGGGTC

901 ACCTGCACAG TCTGCATTGA GCACCCTGGG GCTGAGGCCG TGGGTCAAGT TCTGCCTGAC

961 AGTGCTTTGA TTATATGGTG ATGTGATTCT GCAAACATAA CCCCCCAACT GAGTGCCAGC

1021 TGCTGGGTGT CCCCACCTCA AGAGCCAAGG GGAGGCCACC CTGGAACTGG TGCTTTGCCC

1081 TTGGTGTCAT GGAGGTGTCT GGGTCCACAC GTGGTGGGTG GGCTGGGCCT GCAGTCACCC

1141 AGCATGGCCG GCGGGATTGG ACCCAGCTGC AACCTGGCAG GTTCCATGGG GCTTCCTGTG

1201 CCTGGGCCTG CCTGTGTCTC ATGCCTGTCA ATAAAGGAGA AAGCAACCTG GGAAAAACAT

1261 TCCTCTCTGT ACCCAGGGGA CTGCCAAGTG TGCTTTATAG AAGGCTGGAA GACCGACTGA

1321 AGTAGGAAGC TGTCACATGT TTGAGCCAAC TGAGCAAAGC AGCTGTTGCC TGCCAGACAG

1381 CCACCCCAAG ACCAAGGGAG CCAGCCTGTC GTTTGTCCTT CTAGCCTGGA AGACCAGTTC

1441 TTGGCTCTTG GGCACTCACA TTGGCCACTG GCCGCAACTG CTGAATGGCT TCACTAATGT

1501 CCCTGGGGTC TGTATCTGCC CTCTCTTCTA TTCCCTAGAA ACTCTGGCCG GGCATGGCGG

1561 CTCACGCCTG TCATCCCAGC ACTTAGGGAG GCCGAGGCAG GTGGATCACA AGGTCAAGAG

1621 ATCGACACCA TCCTGGCCAA CATGGTGAAA CCCCATCTCT ACTAAAAATA CAAAAATTAG

1681 CTGGACATGG TGGTGGGCGC CTGTAGTCCC AGCTACTTGG GAGGCTGAGG CAGGAGAATT

1741 GCTTGAACTC GGGAGGCGTA GGTTGCAGTG AGCCAAGATC GCACCACTGC ACTCCAGCCT

1801 GGCAACAGAG CAAGACTCCT CAAAAAACAA ACAAACAAAC AAACAAACTC TGATCATTCA

1861 CCCGGTGGGA TCCCCGGCAG CTCCTTCTCC TTCCCGAGCT GTGGCTCATG GGCCATTGGA

1921 TTCAGAGACC TTGAGCTGGG TGCTGGCTGC CCTAAAGGAG CACGACCCCT CCAGCCTGGT

1981 GACCGGGAGG CAGAGAAGTA CAGCTGTCCT TGACAAAGGC CCAGCCTCTG CTGCCTCAGG

2041 CCAGCCTGGC CACACCCCGA TGCCACAGTC ATCTGCCCCG CCCAGGCATA AGCCTGACTC

2101 AGGTGATACA GGCGAGGCTT CTCCCATGGG TCCTCCCCTT GTGAGAAAGG GTCTGTGCTG

2161 TGTACCTTCT CTCTGCACAG CCATTGTCAG TGTGGTTTGG AGTGAGTGAG CCTCCCGCAC

2221 TGCTCAGGGG AGAGCTGGTG AGCATGACTC CATGGCCTCC TGCTGGTTGG GGTCTTCTTC

2281 TGATTCTCAC CCGCAGGAGC ACCCAGACCC CCCCTTTTCA GAAATGATTA CATTTTCCCT

2341 TCAAGAGTCA TCTGTGACCT GGTGGTGTGG AGGGACTGTC CCCTCCTACA GCCTGGCCGA

2401 CCCCCTTCCT CCAGGGCTGT GTGGAATCAC CTCTCAGAAG CTCACTCTGA AAAGCCACTC

2461 ACAGGGATTT CACCAGGATG TTGCTGTTCC TGCCGACACC TGGGCTTTGT GACAAATGCT

2521 GTTTGCTTCA AGCTGTTCAC AAGCAACTTC ACCAGAATCC CAAGCTGGTT CGTACTTAAT

2581 TTGTATACAT ACATATGCAT AATATACATG ACCATGTATG AGATACAGAC ACCATGGCTT

2641 CTGTCCATTG GCTCATTCGC CTGGTCGCAT GTTCACTCCA TGAGGTTAGG ACACTGCGGG

2701 TGGCTTTGGG ATGGGTGGTT TCAGGGACAA GGTGTCGCTG TGATCTCTGT ATTTGCTAAC

2761 TAGTCTTGTT TTGATTTCTA ACGGTCTAGT GGAGTACAGT TAATGTTGTG TTTATTTCTG

2821 TAACTGTAGG GGTGGAGTAT AAACCTAACT TTTGGAAACA TGAAGAAATA AACAATAGCG

2881 CCAGCCGTTA AAAAAAAAAA AAAAAAAAAA AAAAAA

B: aminoacid sequence (SEQ ID NO:9) length: 103 amino acid

1 MFEPTEQSSC CLPDSHPKTK GASLSFVLLA WKTSSWLLGT HIGHWPQLLN GFTNVPGVCI

61 CPLFYSLETL AGHGGSRLSS QHLGRPRQVD HKVKRSTPSW PTW

C. Nucleotide and amino acid composite sequence (SEQ ID NO:8) clone number and protein name: FP8080 start code: 1337 ATG stop coding: 1646 TGA protein molecular weight: 11425.50Da

1 G AAA AGA CGT TTA ATA AAA CAG CTT TAC CTT TTA AAA ATA AAA TTT CAA TCC CAG TCT 58

59 GCC TCA GGT ACG GGT GGG GGT GTG CAG CGT GAC TCC TGG CTG TGT TCC TGT ACC GTT TGT 118

119 GGC CTC CGG GGT CTT GCG CGT CCA GGG CCC AGA TAC CGA GTA ACT GCC ATG TGG TGT CGT 178

179 GGT GGC GTC CAT AGG CCT CCG TGG TGG GGC GGG GGC GGG GCT GTG GGT GTG CAG GGC AGC 238

239 TGC ATT TTC CAA GAG TGG AGA TGG AGG ATG GAG ATT TGC TTC GTA AAG GCC TGG GAG CTA 298

299 CAT GGG AGG TAG GGA GGG GCC AGC CAC ACC CAG GGC TGG GGG GCC GGT GGA CTC TGC CCA 358

359 GTG AAG CCA GGG GAG TGT CAG CAA TGA GGG GCC TGG ACA GCG ACG CGG GAC CCG CAC CTG 418

419 GGA GCG GCA GAG ATG GAG CCC AGC TCG CTA AAC GTA CCC TAA GGG TGC ACG CTG ACC GTC 478

479 TTT GAT CCC TGG ACA GTG TCC ATA GAG TTG TCT CCC CTT CGG TCA CAT TGT CCC TGG CCT 538

539 TGG AGT TTC TGC CTC AGG GAA AGA CCC ACT AAA ACC ACC CGG GCT CTG GCG GCC TCC TCT 598

599 TCT CCA GAG CAT GGT GGC GCT GCC TCC TGA GCA GAG TCA CTG AGA GTG GGT CCA AGG AAG 658

659 GGA CCC ACT TAG CAA AGG TTG TTG TGT GGT GCT GTG CAG GCA CGG CGC CTG GAG TTG GCC 718

719 CCG CCA GCA TCG GGG TGG CTT GGG GCT CTG GTT CCT AAG CTT CTG CTT CTG AGT TGG TCA 778

779 GCA CGT TGG CTC TGC GCC CTG CAG TGC CTG TCT TGA TGG TGG TCC CTG GAT GTG CTG AAG 838

839 GGC TCC AGG AGC AAG GGG CCA GAC CAC ACT TTG GCT GAG TCC TTG GGT AGT TTC AGA GGG 898

899 TCA CCT GCA CAG TCT GCA TTG AGC ACC CTG GGG CTG AGG CCG TGG GTC AAG TTC TGC CTG 958

959 ACA GTG CTT TGA TTA TAT GGT GAT GTG ATT CTG CAA ACA TAA CCC CCC AAC TGA GTG CCA 1018

1019 GCT GCT GGG TGT CCC CAC CTC AAG AGC CAA GGG GAG GCC ACC CTG GAA CTG GTG CTT TGC 1078

1079 CCT TGG TGT CAT GGA GGT GTC TGG GTC CAC ACG TGG TGG GTG GGC TGG GCC TGC AGT CAC 1138

1139 CCA GCA TGG CCG GCG GGA TTG GAC CCA GCT GCA ACC TGG CAG GTT CCA TGG GGC TTC CTG 1198

1199 TGC CTG GGC CTG CCT GTG TCT CAT GCC TGT CAA TAA AGG AGA AAG CAA CCT GGG AAA AAC 1258

1259 ATT CCT CTC TGT ACC CAG GGG ACT GCC AAG TGT GCT TTA TAG AAG GCT GGA AGA CCG ACT 1318

1319 GAA GTA GGA AGC TGT CAC ATG TTT GAG CCA ACT GAG CAA AGC AGC TGT TGC CTG CCA GAC 1378

1 Met Phe Glu Pro Thr Glu Gln Ser Ser Cys Cys Leu Pro Asp 14

1379 AGC CAC CCC AAG ACC AAG GGA GCC AGC CTG TCG TTT GTC CTT CTA GCC TGG AAG ACC AGT 1438

15 Ser His Pro Lys Thr Lys Gly Ala Ser Leu Ser Phe Val Leu Leu Ala Trp Lys Thr Ser 34

1439 TCT TGG CTC TTG GGC ACT CAC ATT GGC CAC TGG CCG CAA CTG CTG AAT GGC TTC ACT AAT 1498

35 Ser Trp Leu Leu Gly Thr His Ile Gly His Trp Pro Gln Leu Leu Asn Gly Phe Thr Asn 54

1499 GTC CCT GGG GTC TGT ATC TGC CCT CTC TTC TAT TCC CTA GAA ACT CTG GCC GGG CAT GGC 1558

55 Val Pro Gly Val Cys Ile Cys Pro Leu Phe Tyr Ser Leu Glu Thr Leu Ala Gly His G1y 74

1559 GGC TCA CGC CTG TCA TCC CAG CAC TTA GGG AGG CCG AGG CAG GTG GAT CAC AAG GTC AAG 1618

75 Gly Ser Arg Leu Ser Ser Gln His Leu Gly Arg Pro Arg Gln Val Asp His Lys Val Lys 94

1619 AGA TCG ACA CCA TCC TGG CCA ACA TGG TGA AAC CCC ATC TCT ACT AAA AAT ACA AAA ATT 1678

95 Arg Ser Thr Pro Ser Trp Pro Thr Trp *** 104

1679 AGC TGG ACA TGG TGG TGG GCG CCT GTA GTC CCA GCT ACT TGG GAG GCT GAG GCA GGA GAA 1738

1739 TTG CTT GAA CTC GGG AGG CGT AGG TTG CAG TGA GCC AAG ATC GCA CCA CTG CAC TCC AGC 1798

1799 CTG GCA ACA GAG CAA GAC TCC TCA AAA AAC AAA CAA ACA AAC AAA CAA ACT CTG ATC ATT 1858

1859 CAC CCG GTG GGA TCC CCG GCA GCT CCT TCT CCT TCC CGA GCT GTG GCT CAT GGG CCA TTG 1918

1919 GAT TCA GAG ACC TTG AGC TGG GTG CTG GCT GCC CTA AAG GAG CAC GAC CCC TCC AGC CTG 1978

1979 GTG ACC GGG AGG CAG AGA AGT ACA GCT GTC CTT GAC AAA GGC CCA GCC TCT GCT GCC TCA 2038

2039 GGC CAG CCT GGC CAC ACC CCG ATG CCA CAG TCA TCT GCC CCG CCC AGG CAT AAG CCT GAC 2098

2099 TCA GGT GAT ACA GGC GAG GCT TCT CCC ATG GGT CCT CCC CTT GTG AGA AAG GGT CTG TGC 2158

2159 TGT GTA CCT TCT CTC TGC ACA GCC ATT GTC AGT GTG GTT TGG AGT GAG TGA GCC TCC CGC 2218

2219 ACT GCT CAG GGG AGA GCT GGT GAG CAT GAC TCC ATG GCC TCC TGC TGG TTG GGG TCT TCT 2278

2279 TCT GAT TCT CAC CCG CAG GAG CAC CCA GAC CCC CCC TTT TCA GAA ATG ATT ACA TTT TCC 2338

2339 CTT CAA GAG TCA TCT GTG ACC TGG TGG TGT GGA GGG ACT GTC CCC TCC TAC AGC CTG GCC 2398

2399 GAC CCC CTT CCT CCA GGG CTG TGT GGA ATC ACC TCT CAG AAG CTC ACT CTG AAA AGC CAC 2458

2459 TCA CAG GGA TTT CAC CAG GAT GTT GCT GTT CCT GCC GAC ACC TGG GCT TTG TGA CAA ATG 2518

2519 CTG TTT GCT TCA AGC TGT TCA CAA GCA ACT TCA CCA GAA TCC CAA GCT GGT TCG TAC TTA 2578

2579 ATT TGT ATA CAT ACA TAT GCA TAA TAT ACA TGA CCA TGT ATG AGA TAC AGA CAC CAT GGC 2638

2639 TTC TGT CCA TTG GCT CAT TCG CCT GGT CGC ATG TTC ACT CCA TGA GGT TAG GAC ACT GCG 2698

2699 GGT GGC TTT GGG ATG GGT GGT TTC AGG GAC AAG GTG TCG CTG TGA TCT CTG TAT TTG CTA 2758

2759 ACT AGT CTT GTT TTG ATT TCT AAC GGT CTA GTG GAG TAC AGT TAA TGT TGT GTT TAT TTC 2818

2819 TGT AAC TGT AGG GGT GGA GTA TAA ACC TAA CTT TTG GAA ACA TGA AGA AAT AAA CAA TAG 2878

2879 CGC CAG CCG TTA AAA AAA AAA AAAAAA AAA AAA AAAAA 2916

All quote in this application as a reference at all documents that the present invention mentions, just quoted as a reference separately as each piece document.Should be understood that in addition those skilled in the art can make various changes or modifications the present invention after having read above-mentioned teachings of the present invention, these equivalent form of values fall within the application's appended claims institute restricted portion equally.

Sequence table

＜110〉Shanghai Xinshijie Gene Techn Development Co., Ltd.

＜120〉have new the people's albumen and the encoding sequence thereof of cancer suppressing function

<130>022535

<160>15

<170>PatentIn version 3.0

<210>1

<211>3046

<212>DNA

＜213〉homo sapiens (Homo sapiens)

<400>1

ggtttcacca tgatggtcag gctggtcaag atctcctgac ctcaagtgat ctgcccgcct 60

cggcctccca aagtgctggg attacagaca tgaaccacaa cacctgccaa gcccaggtgt 120

tttgaatcca gcacgggcaa tggcaagacc ctgtcccaac aaacaaaaaa acactggtgc 180

ctgaggccca ccccgagaga ttctgattga gttgatttgg atttgaatta tctttaaaat 240

tttggatgtg aatttttttt ttttttcttg gaatggagtt tcactcttgt tccccaggct 300

ggagtgcaac gggacgatct tggctcaccg caacctccgc ctactgggtt caagccattc 360

tcctgcctca gcttcccaag tagctgggat tacaggcatg tgctaccatg cccagctaat 420

tttgtatttt tagtagagac agggtttctc catgttgctc aggctggtct tgaactcctg 480

acctcaggtg atccgcccgc ctcggactcc caaagtgctg ggattacagg catgagccac 540

catgcctggc cggatgtgaa ttatcttaaa aattttcagg taattctaat gggccaaggt 600

tgagaacccc tgctctgggc ccatccgagc accaggctgt cacaaacgca tgcatgcact 660

cacgcccgtg ggcttggggg gcttcggaaa tgtgcttctg cttttttgag atggggtctt 720

tctgttgccc atcccggagc acagtggcac gatcacagct cactgcagcc tcgacctcct 780

gggctcaggt gatcctcccg cctcagcctc ctgagtgtct gcttctggtt ttcatgatga 840

cctggggccc aggcatacta cacttgtgct gttcaggggc cagtcctgca ccaggagccc 900

atcagccaca gctccgccga gaagcactga tatgcagagc taagcagctt tgtttccacg 960

tggatcctgc gtaggttttc ttggtccatc cgtagacacc gcactcctgc agaggatctt 1020

ctcgggatgc cccactgtct ctgttttccc tcttcactga acactcagtc ggggctcgcc 1080

atgatgcctc tgtgtctgct ggcttctccc ccattggaac agccttcttg gcacgccaca 1140

ctgctagctg ctgggcactg tgctttctgc ctttaccgtt ctgccgtgat gttgccaaaa 1200

tagcagcaac aacaacaaca acaaaggctg ggcacctggc tcatgcctgt attcccagca 1260

gttcgagatc agcctgggca acatggtgag accctatctc tacaaaaata aaaaatgaaa 1320

tgagctgggt ggggtggcgc atgcatgcct gtggtcccag ctacttggga ggctgaggtc 1380

ggaagctcgc tggtcggaag atcgctggag cttagccttg aggtcaaggt tgcagtgagc 1440

cgagattaca tcactgcact ccagcgtggg agacagagac cctgtattaa caaacaaaaa 1500

cacaaaccac aaagggcagg tctgaaactg ccatttaaaa aaaaatttga taaacttaaa 1560

aaaatatata tccacagatg caggtgaaga acctgttgtc ttcctcaagc ctctttttca 1620

cccatgggtg gaaatggtgc cctggacacc caggcccacg aggtctttgc gtggggtccc 1680

tacacagggc tttagcttac actgtgctgc cctcctgtcc cccgagttcc cagtctgtca 1740

aaatccaacc tggtctccca ggcccagggc aaatgccacc tcctccatga agcctgccac 1800

atcctttgca cacccttggg cgctgacctt gttctcccag cgcacaggca cgggtagttt 1860

gcccctgtag tagtaactca ggcacaaaac gaactcttgc tgaggctcgg ccgcgcagag 1920

ctgagggttg ccgcttccag gttcaagtgc attttgagtt tcattcccag cttccttctt 1980

tttctggtct ttaatttctt ctccggatta ggtcccactc aatgctttcc ttctcaattt 2040

ccaaaagagt atggtcagag ccagcagcac accaccttcc ccatgggtgg ggggggggcc 2100

agcctgtggc gggggtgcgg gtcccatctt ttcgaaggaa ttgacccaca gtgggcgggt 2160

ccacctttga ccttgcccca gggagcgcag acagaaaaaa gatccttgct tagtttgagg 2220

ggccgctggg gtgctcggtt tgtcttcaga ggcctgtctg taacaccaat gccaacccgg 2280

tggcactgac tggtcaccct gaaggccacg gccagtgtcc taggaaggga ctcaatttct 2340

agctgtgcca cctgagattc tggggttagg ctggttgtgc ttctgaagtt ccactgtgct 2400

caaagtgctt ggtgaaagtt agcgaaggtg attttacaaa aatagatgca taaaatgtct 2460

aggaaacaca aaaaatcctc attactcttc tctccaaata ttttttaagc cccaactgga 2520

ccctaggcaa aagtgagtgg cactcctctg ccaggactcc aggcaagccc cggcatcttc 2580

ttgctgccgt cccagacaac agaagttacc agatgaacag acttggatgg gccacggggg 2640

tggagagctg gaaagcttgg ctgtgcctct cgatgatgat taagatttca atatttacag 2700

caaaaccaca aagcaaatga tagaataaag caaaacaatg gaaaatgtga gttcactcgt 2760

gagagaggta cgtatgtgag ctctgaggaa attacagagg gaacgcatgc agcgggacag 2820

ctctcccaat cgcagcgtgc aaagtagaca tccatagtgt cttttgaaaa atgaaaaaca 2880

cattactttg aacagccaag aaaaaaattg caatttatta agattcaata aagcgttgta 2940

ctttcgaaag ccaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3000

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 3046

<210>2

<211>3046

<212>DNA

＜213〉homo sapiens (Homo sapiens)

<220>

<221>CDS

<222>(654)..(971)

<400>2

ggtttcacca tgatggtcag gctggtcaag atctcctgac ctcaagtgat ctgcccgcct 60

cggcctccca aagtgctggg attacagaca tgaaccacaa cacctgccaa gcccaggtgt 120

tttgaatcca gcacgggcaa tggcaagacc ctgtcccaac aaacaaaaaa acactggtgc 180

ctgaggccca ccccgagaga ttctgattga gttgatttgg atttgaatta tctttaaaat 240

tttggatgtg aatttttttt ttttttcttg gaatggagtt tcactcttgt tccccaggct 300

ggagtgcaac gggacgatct tggctcaccg caacctccgc ctactgggtt caagccattc 360

tcctgcctca gcttcccaag tagctgggat tacaggcatg tgctaccatg cccagctaat 420

tttgtatttt tagtagagac agggtttctc catgttgctc aggctggtct tgaactcctg 480

acctcaggtg atccgcccgc ctcggactcc caaagtgctg ggattacagg catgagccac 540

catgcctggc cggatgtgaa ttatcttaaa aattttcagg taattctaat gggccaaggt 600

tgagaacccc tgctctgggc ccatccgagc accaggctgt cacaaacgca tgc atg 656

Met

1

cac tca cgc ccg tgg gct tgg ggg gct tcg gaa atg tgc ttc tgc ttt 704

His Ser Arg Pro Trp Ala Trp Gly Ala Ser Glu Met Cys Phe Cys Phe

5 10 15

ttt gag atg ggg tct ttc tgt tgc cca tcc cgg agc aca gtg gca cga 752

Phe Glu Met Gly Ser Phe Cys Cys Pro Ser Arg Ser Thr Val Ala Arg

20 25 30

tca cag ctc act gca gcc tcg acc tcc tgg gct cag gtg atc ctc ccg 800

Ser Gln Leu Thr Ala Ala Ser Thr Ser Trp Ala Gln Val Ile Leu Pro

35 40 45

cct cag cct cct gag tgt ctg ctt ctg gtt ttc atg atg acc tgg ggc 848

Pro Gln Pro Pro Glu Cys Leu Leu Leu Val Phe Met Met Thr Trp Gly

50 55 60 65

cca ggc ata cta cac ttg tgc tgt tca ggg gcc agt cct gca cca gga 896

Pro Gly Ile Leu His Leu Cys Cys Ser Gly Ala Ser Pro Ala Pro Gly

70 75 80

gcc cat cag cca cag ctc cgc cga gaa gca ctg ata tgc aga gct aag 944

Ala His Gln Pro Gln Leu Arg Arg Glu Ala Leu Ile Cys Arg Ala Lys

85 90 95

cag ctt tgt ttc cac gtg gat cct gcg taggttttct tggtccatcc 991

Gln Leu Cys Phe His Val Asp Pro Ala

100 105

gtagacaccg cactcctgca gaggatcttc tcgggatgcc ccactgtctc tgttttccct 1051

cttcactgaa cactcagtcg gggctcgcca tgatgcctct gtgtctgctg gcttctcccc 1111

cattggaaca gccttcttgg cacgccacac tgctagctgc tgggcactgt gctttctgcc 1171

tttaccgttc tgccgtgatg ttgccaaaat agcagcaaca acaacaacaa caaaggctgg 1231

gcacctggct catgcctgta ttcccagcag ttcgagatca gcctgggcaa catggtgaga 1291

ccctatctct acaaaaataa aaaatgaaat gagctgggtg gggtggcgca tgcatgcctg 1351

tggtcccagc tacttgggag gctgaggtcg gaagctcgct ggtcggaaga tcgctggagc 1411

ttagccttga ggtcaaggtt gcagtgagcc gagattacat cactgcactc cagcgtggga 1471

gacagagacc ctgtattaac aaacaaaaac acaaaccaca aagggcaggt ctgaaactgc 1531

catttaaaaa aaaatttgat aaacttaaaa aaatatatat ccacagatgc aggtgaagaa 1591

cctgttgtct tcctcaagcc tctttttcac ccatgggtgg aaatggtgcc ctggacaccc 1651

aggcccacga ggtctttgcg tggggtccct acacagggct ttagcttaca ctgtgctgcc 1711

ctcctgtccc ccgagttccc agtctgtcaa aatccaacct ggtctcccag gcccagggca 1771

aatgccacct cctccatgaa gcctgccaca tcctttgcac acccttgggc gctgaccttg 1831

ttctcccagc gcacaggcac gggtagtttg cccctgtagt agtaactcag gcacaaaacg 1891

aactcttgct gaggctcggc cgcgcagagc tgagggttgc cgcttccagg ttcaagtgca 1951

ttttgagttt cattcccagc ttccttcttt ttctggtctt taatttcttc tccggattag 2011

gtcccactca atgctttcct tctcaatttc caaaagagta tggtcagagc cagcagcaca 2071

ccaccttccc catgggtggg gggggggcca gcctgtggcg ggggtgcggg tcccatcttt 2131

tcgaaggaat tgacccacag tgggcgggtc cacctttgac cttgccccag ggagcgcaga 2191

cagaaaaaag atccttgctt agtttgaggg gccgctgggg tgctcggttt gtcttcagag 2251

gcctgtctgt aacaccaatg ccaacccggt ggcactgact ggtcaccctg aaggccacgg 2311

ccagtgtcct aggaagggac tcaatttcta gctgtgccac ctgagattct ggggttaggc 2371

tggttgtgct tctgaagttc cactgtgctc aaagtgcttg gtgaaagtta gcgaaggtga 2431

ttttacaaaa atagatgcat aaaatgtcta ggaaacacaa aaaatcctca ttactcttct 2491

ctccaaatat tttttaagcc ccaactggac cctaggcaaa agtgagtggc actcctctgc 2551

caggactcca ggcaagcccc ggcatcttct tgctgccgtc ccagacaaca gaagttacca 2611

gatgaacaga cttggatggg ccacgggggt ggagagctgg aaagcttggc tgtgcctctc 2671

gatgatgatt aagatttcaa tatttacagc aaaaccacaa agcaaatgat agaataaagc 2731

aaaacaatgg aaaatgtgag ttcactcgtg agagaggtac gtatgtgagc tctgaggaaa 2791

ttacagaggg aacgcatgca gcgggacagc tctcccaatc gcagcgtgca aagtagacat 2851

ccatagtgtc ttttgaaaaa tgaaaaacac attactttga acagccaaga aaaaaattgc 2911

aatttattaa gattcaataa agcgttgtac tttcgaaagc caaaaaaaaa aaaaaaaaaa 2971

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3031

aaaaaaaaaa aaaaa 3046

<210>3

<211>106

<212>PRT

＜213〉homo sapiens (Homo sapiens)

<400>3

Met His Ser Arg Pro Trp Ala Trp Gly Ala Ser Glu Met Cys Phe Cys

1 5 10 15

Phe Phe Glu Met Gly Ser Phe Cys Cys Pro Ser Arg Ser Thr Val Ala

20 25 30

Arg Ser Gln Leu Thr Ala Ala Ser Thr Ser Trp Ala Gln Val Ile Leu

35 40 45

Pro Pro Gln Pro Pro Glu Cys Leu Leu Leu Val Phe Met Met Thr Trp

50 55 60

Gly Pro Gly Ile Leu His Leu Cys Cys Ser Gly Ala Ser Pro Ala Pro

65 70 75 80

Gly Ala His Gln Pro Gln Leu Arg Arg Glu Ala Leu Ile Cys Arg Ala

85 90 95

Lys Gln Leu Cys Phe His Val Asp Pro Ala

100 105

<210>4

<211>2224

<212>DNA

＜213〉homo sapiens (Homo sapiens)

<400>4

gggacctggg ggctgtggcc gggggcggcc gttgacctgg tgaccgcggc gccgccccag 60

accgggggcg cagtcccact cgctccgagc cccggtcccc caagcctccc tcccgggtac 120

ctggggccgc gcccgccctg cgcccagctc cgccctccgt cggcccaggc ctgacagagc 180

ccggcagcca tgagtgccaa cccccggtgg gacatcagca gggcgctggg ggtggccaag 240

ctcttccacc tggtgtgcgg ggtgcgggaa gcctgcgtga ccccgttcct gaccctttac 300

ctgaggcagc tgggcttggc cgcgccctgg gtgggcaccc taatgggaac caagcaccta 360

atcgctgcct tctgggctcc cgtctgtgcc ttcctggcca aaagctaccg gaaaaggaga 420

gcgcttctga tcggctccct gctcggctcg gtgggggcca gcctgctgat ggtcctggtc 480

ccaccggtag acaaaaatcg ggtgcacttc ccttgtaatg gaagcagcgg cctgaccagc 540

acagacgcac tcccgggggt cacgctacct gtgaacatca cctcggccca agagtctgcc 600

tccagccacc cagccaagag gactgcagag gtggaaatgc ctggcttcag aaacccacct 660

ggtgaaagtg accgagaaac tttccgtgat ctgcacgtct acttagcgcc ctccgttgaa 720

ggagctagga ccacatccca agctctcctc catcctgtca cttcggggct gaaagatcat 780

ccctgggaag ttacttttga ggtggtcaag acagccctcc ccttgcttcc tggggggaaa 840

gggcccggga atccagccaa tttgtcaggg accaagggga aagcctgggc ttttgacctg 900

tccttggagg cgttgcggcg gacttttatc ctctccttgg ggtccgtggc gttctgggag 960

ctgctgacag cgcctctgga gcaggtggca gatgacagcc tttatgagtt cctggatttt 1020

gtggatgcca ctgaccgata cagaagcctg tgggtctgga ggttgctggg catgtcggca 1080

ggcgtgtgtg gcatcacagc cttggtgggg cagctggact gcttcctgat gaccagtggc 1140

ccccgaggtg tggtccactt ctatgggtac tcggtggtca gcaccctggc cttactggtg 1200

agcattgcct ttcccattcc catctgtcag cagtgggagc ccagctacaa aagggtcaaa 1260

gcactgtcca ttgtgggggg ggtgaccccc acctcattct cctcgcctcc accactgttt 1320

tggtaggagc catcgtcagt actgtccaga actttctgtt ctggcacatg aaggaccatg 1380

ggagcggcga gctggtcatg ggtttctcgg tcgccctcag cttgctgggg gaaattctgc 1440

ttcatccgtt caaagctaca ttgcttagga aactgtccag gacgggcctg gtggggctgg 1500

ggctgagctg cctcgctggg cagctgctgt actactcttt cctctggagc tggtggtccg 1560

tcctccccat tcagatcttg agtgccatta gcaacagagc tttgtggtgg gctgtggggg 1620

cctcagtaga ggacctggcc actccccgca tggagagggc tctgagtgcc ttgttccgag 1680

gccactttta cgggagtggc tgtagcctgg gcagctttgt cgggggcttc gtggtgatgc 1740

gcttcagcct ggctgtgctc taccaggcct gctgtgtggc cctgttgctc tggttggcct 1800

tgctcctgtc catacagcgg aggctgcccc gagagcggaa aatcaagtac tcgaagctgc 1860

tgtccatgga ggtgagtgac accagtgact ctgagcaggg gacagaacag gactggcttg 1920

tgaaggccat gagggaggaa cactcagact gaaagggctg agaaatccag agtgtgctga 1980

tccagcaagg aacgaatgga ctgaacaaaa ctcagcctgc tgaggacaga aacctgccct 2040

ggactgctgg gagccgggga agagaggatg ggtctgtgct gaaggcccaa caggatcatc 2100

tcattgcatg attttcttta cttttgaagt aaaaggagat ttaacttttt gccaatcttt 2160

tttagataat ggaggaagaa tacatttgct ttttaaaaag ttaaaaaaaa aaaaaaaaaa 2220

aaaa 2224

<210>5

<211>2224

<212>DNA

＜213〉homo sapiens (Homo sapiens)

<220>

<221>CDS

<222>(190)..(1323)

<400>5

gggacctggg ggctgtggcc gggggcggcc gttgacctgg tgaccgcggc gccgccccag 60

accgggggcg cagtcccact cgctccgagc cccggtcccc caagcctccc tcccgggtac 120

ctggggccgc gcccgccctg cgcccagctc cgccctccgt cggcccaggc ctgacagagc 180

ccggcagcc atg agt gcc aac ccc cgg tgg gac atc agc agg gcg ctg ggg 231

Met Ser Ala Asn Pro Arg Trp Asp Ile Ser Arg Ala Leu Gly

1 5 10

gtg gcc aag ctc ttc cac ctg gtg tgc ggg gtg cgg gaa gcc tgc gtg 279

Val Ala Lys Leu Phe His Leu Val Cys Gly Val Arg Glu Ala Cys Val

15 20 25 30

acc ccg ttc ctg acc ctt tac ctg agg cag ctg ggc ttg gcc gcg ccc 327

Thr Pro Phe Leu Thr Leu Tyr Leu Arg Gln Leu Gly Leu Ala Ala Pro

35 40 45

tgg gtg ggc acc cta atg gga acc aag cac cta atc gct gcc ttc tgg 375

Trp Val Gly Thr Leu Met Gly Thr Lys His Leu Ile Ala Ala Phe Trp

50 55 60

gct ccc gtc tgt gcc ttc ctg gcc aaa agc tac cgg aaa agg aga gcg 423

Ala Pro Val Cys Ala Phe Leu Ala Lys Ser Tyr Arg Lys Arg Arg Ala

65 70 75

ctt ctg atc ggc tcc ctg ctc ggc tcg gtg ggg gcc agc ctg ctg atg 471

Leu Leu Ile Gly Ser Leu Leu Gly Ser Val Gly Ala Ser Leu Leu Met

80 85 90

gtc ctg gtc cca ccg gta gac aaa aat cgg gtg cac ttc cct tgt aat 519

Val Leu Val Pro Pro Val Asp Lys Asn Arg Val His Phe Pro Cys Asn

95 100 105 110

gga agc agc ggc ctg acc agc aca gac gca ctc ccg ggg gtc acg cta 567

Gly Ser Ser Gly Leu Thr Ser Thr Asp Ala Leu Pro Gly Val Thr Leu

115 120 125

cct gtg aac atc acc tcg gcc caa gag tct gcc tcc agc cac cca gcc 615

Pro Val Asn Ile Thr Ser Ala Gln Glu Ser Ala Ser Ser His Pro Ala

130 135 140

aag agg act gca gag gtg gaa atg cct ggc ttc aga aac cca cct ggt 663

Lys Arg Thr Ala Glu Val Glu Met Pro Gly Phe Arg Asn Pro Pro Gly

145 150 155

gaa agt gac cga gaa act ttc cgt gat ctg cac gtc tac tta gcg ccc 711

Glu Ser Asp Arg Glu Thr Phe Arg Asp Leu His Val Tyr Leu Ala Pro

160 165 170

tcc gtt gaa gga gct agg acc aca tcc caa gct ctc ctc cat cct gtc 759

Ser Val Glu Gly Ala Arg Thr Thr Ser Gln Ala Leu Leu His Pro Val

175 180 185 190

act tcg ggg ctg aaa gat cat ccc tgg gaa gtt act ttt gag gtg gtc 807

Thr Ser Gly Leu Lys Asp His Pro Trp Glu Val Thr Phe Glu Val Val

195 200 205

aag aca gcc ctc ccc ttg ctt cct ggg ggg aaa ggg ccc ggg aat cca 855

Lys Thr Ala Leu Pro Leu Leu Pro Gly Gly Lys Gly Pro Gly Asn Pro

210 215 220

gcc aat ttg tca ggg acc aag ggg aaa gcc tgg gct ttt gac ctg tcc 903

Ala Asn Leu Ser Gly Thr Lys Gly Lys Ala Trp Ala Phe Asp Leu Ser

225 230 235

ttg gag gcg ttg cgg cgg act ttt atc ctc tcc ttg ggg tcc gtg gcg 951

Leu Glu Ala Leu Arg Arg Thr Phe Ile Leu Ser Leu Gly Ser Val Ala

240 245 250

ttc tgg gag ctg ctg aca gcg cct ctg gag cag gtg gca gat gac agc 999

Phe Trp Glu Leu Leu Thr Ala Pro Leu Glu Gln Val Ala Asp Asp Ser

255 260 265 270

ctt tat gag ttc ctg gat ttt gtg gat gcc act gac cga tac aga agc 1047

Leu Tyr Glu Phe Leu Asp Phe Val Asp Ala Thr Asp Arg Tyr Arg Ser

275 280 285

ctg tgg gtc tgg agg ttg ctg ggc atg tcg gca ggc gtg tgt ggc atc 1095

Leu Trp Val Trp Arg Leu Leu Gly Met Ser Ala Gly Val Cys Gly Ile

290 295 300

aca gcc ttg gtg ggg cag ctg gac tgc ttc ctg atg acc agt ggc ccc 1143

Thr Ala Leu Val Gly Gln Leu Asp Cys Phe Leu Met Thr Ser Gly Pro

305 310 315

cga ggt gtg gtc cac ttc tat ggg tac tcg gtg gtc agc acc ctg gcc 1191

Arg Gly Val Val His Phe Tyr Gly Tyr Ser Val Val Ser Thr Leu Ala

320 325 330

tta ctg gtg agc att gcc ttt ccc att ccc atc tgt cag cag tgg gag 1239

Leu Leu Val Ser Ile Ala Phe Pro Ile Pro Ile Cys Gln Gln Trp Glu

335 340 345 350

ccc agc tac aaa agg gtc aaa gca ctg tcc att gtg ggg ggg gtg acc 1287

Pro Ser Tyr Lys Arg Val Lys Ala Leu Ser Ile Val Gly Gly Val Thr

355 360 365

ccc acc tca ttc tcc tcg cct cca cca ctg ttt tgg taggagccat 1333

Pro Thr Ser Phe Ser Ser Pro Pro Pro Leu Phe Trp

370 375

cgtcagtact gtccagaact ttctgttctg gcacatgaag gaccatggga gcggcgagct 1393

ggtcatgggt ttctcggtcg ccctcagctt gctgggggaa attctgcttc atccgttcaa 1453

agctacattg cttaggaaac tgtccaggac gggcctggtg gggctggggc tgagctgcct 1513

cgctgggcag ctgctgtact actctttcct ctggagctgg tggtccgtcc tccccattca 1573

gatcttgagt gccattagca acagagcttt gtggtgggct gtgggggcct cagtagagga 1633

cctggccact ccccgcatgg agagggctct gagtgccttg ttccgaggcc acttttacgg 1693

gagtggctgt agcctgggca gctttgtcgg gggcttcgtg gtgatgcgct tcagcctggc 1753

tgtgctctac caggcctgct gtgtggccct gttgctctgg ttggccttgc tcctgtccat 1813

acagcggagg ctgccccgag agcggaaaat caagtactcg aagctgctgt ccatggaggt 1873

gagtgacacc agtgactctg agcaggggac agaacaggac tggcttgtga aggccatgag 1933

ggaggaacac tcagactgaa agggctgaga aatccagagt gtgctgatcc agcaaggaac 1993

gaatggactg aacaaaactc agcctgctga ggacagaaac ctgccctgga ctgctgggag 2053

ccggggaaga gaggatgggt ctgtgctgaa ggcccaacag gatcatctca ttgcatgatt 2113

ttctttactt ttgaagtaaa aggagattta actttttgcc aatctttttt agataatgga 2173

ggaagaatac atttgctttt taaaaagtta aaaaaaaaaa aaaaaaaaaa a 2224

<210>6

<211>378

<212>PRT

＜213〉homo sapiens (Homo sapiens)

<400>6

Met Ser Ala Asn Pro Arg Trp Asp Ile Ser Arg Ala Leu Gly Val Ala

1 5 10 15

Lys Leu Phe His Leu Val Cys Gly Val Arg Glu Ala Cys Val Thr Pro

20 25 30

Phe Leu Thr Leu Tyr Leu Arg Gln Leu Gly Leu Ala Ala Pro Trp Val

35 40 45

Gly Thr Leu Met Gly Thr Lys His Leu Ile Ala Ala Phe Trp Ala Pro

50 55 60

Val Cys Ala Phe Leu Ala Lys Ser Tyr Arg Lys Arg Arg Ala Leu Leu

65 70 75 80

Ile Gly Ser Leu Leu Gly Ser Val Gly Ala Ser Leu Leu Met Val Leu

85 90 95

Val Pro Pro Val Asp Lys Asn Arg Val His Phe Pro Cys Asn Gly Ser

100 105 110

Ser Gly Leu Thr Ser Thr Asp Ala Leu Pro Gly Val Thr Leu Pro Val

115 120 125

Asn Ile Thr Ser Ala Gln Glu Ser Ala Ser Ser His Pro Ala Lys Arg

130 135 140

Thr Ala Glu Val Glu Met Pro Gly Phe Arg Asn Pro Pro Gly Glu Ser

145 150 155 160

Asp Arg Glu Thr Phe Arg Asp Leu His Val Tyr Leu Ala Pro Ser Val

165 170 175

Glu Gly Ala Arg Thr Thr Ser Gln Ala Leu Leu His Pro Val Thr Ser

180 185 190

Gly Leu Lys Asp His Pro Trp Glu Val Thr Phe Glu Val Val Lys Thr

195 200 205

Ala Leu Pro Leu Leu Pro Gly Gly Lys Gly Pro Gly Asn Pro Ala Asn

210 215 220

Leu Ser Gly Thr Lys Gly Lys Ala Trp Ala Phe Asp Leu Ser Leu Glu

225 230 235 240

Ala Leu Arg Arg Thr Phe Ile Leu Ser Leu Gly Ser Val Ala Phe Trp

245 250 255

Glu Leu Leu Thr Ala Pro Leu Glu Gln Val Ala Asp Asp Ser Leu Tyr

260 265 270

Glu Phe Leu Asp Phe Val Asp Ala Thr Asp Arg Tyr Arg Ser Leu Trp

275 280 285

Val Trp Arg Leu Leu Gly Met Ser Ala Gly Val Cys Gly Ile Thr Ala

290 295 300

Leu Val Gly Gln Leu Asp Cys Phe Leu Met Thr Ser Gly Pro Arg Gly

305 310 315 320

Val Val His Phe Tyr Gly Tyr Ser Val Val Ser Thr Leu Ala Leu Leu

325 330 335

Val Ser Ile Ala Phe Pro Ile Pro Ile Cys Gln Gln Trp Glu Pro Ser

340 345 350

Tyr Lys Arg Val Lys Ala Leu Ser Ile Val Gly Gly Val Thr Pro Thr

355 360 365

Ser Phe Ser Ser Pro Pro Pro Leu Phe Trp

370 375

<210>7

<211>2916

<212>DNA

＜213〉homo sapiens (Homo sapiens)

<400>7

gaaaagacgt ttaataaaac agctttacct tttaaaaata aaatttcaat cccagtctgc 60

ctcaggtacg ggtgggggtg tgcagcgtga ctcctggctg tgttcctgta ccgtttgtgg 120

cctccggggt cttgcgcgtc cagggcccag ataccgagta actgccatgt ggtgtcgtgg 180

tggcgtccat aggcctccgt ggtggggcgg gggcggggct gtgggtgtgc agggcagctg 240

cattttccaa gagtggagat ggaggatgga gatttgcttc gtaaaggcct gggagctaca 300

tgggaggtag ggaggggcca gccacaccca gggctggggg gccggtggac tctgcccagt 360

gaagccaggg gagtgtcagc aatgaggggc ctggacagcg acgcgggacc cgcacctggg 420

agcggcagag atggagccca gctcgctaaa cgtaccctaa gggtgcacgc tgaccgtctt 480

tgatccctgg acagtgtcca tagagttgtc tccccttcgg tcacattgtc cctggccttg 540

gagtttctgc ctcagggaaa gacccactaa aaccacccgg gctctggcgg cctcctcttc 600

tccagagcat ggtggcgctg cctcctgagc agagtcactg agagtgggtc caaggaaggg 660

acccacttag caaaggttgt tgtgtggtgc tgtgcaggca cggcgcctgg agttggcccc 720

gccagcatcg gggtggcttg gggctctggt tcctaagctt ctgcttctga gttggtcagc 780

acgttggctc tgcgccctgc agtgcctgtc ttgatggtgg tccctggatg tgctgaaggg 840

ctccaggagc aaggggccag accacacttt ggctgagtcc ttgggtagtt tcagagggtc 900

acctgcacag tctgcattga gcaccctggg gctgaggccg tgggtcaagt tctgcctgac 960

agtgctttga ttatatggtg atgtgattct gcaaacataa ccccccaact gagtgccagc 1020

tgctgggtgt ccccacctca agagccaagg ggaggccacc ctggaactgg tgctttgccc 1080

ttggtgtcat ggaggtgtct gggtccacac gtggtgggtg ggctgggcct gcagtcaccc 1140

agcatggccg gcgggattgg acccagctgc aacctggcag gttccatggg gcttcctgtg 1200

cctgggcctg cctgtgtctc atgcctgtca ataaaggaga aagcaacctg ggaaaaacat 1260

tcctctctgt acccagggga ctgccaagtg tgctttatag aaggctggaa gaccgactga 1320

agtaggaagc tgtcacatgt ttgagccaac tgagcaaagc agctgttgcc tgccagacag 1380

ccaccccaag accaagggag ccagcctgtc gtttgtcctt ctagcctgga agaccagttc 1440

ttggctcttg ggcactcaca ttggccactg gccgcaactg ctgaatggct tcactaatgt 1500

ccctggggtc tgtatctgcc ctctcttcta ttccctagaa actctggccg ggcatggcgg 1560

ctcacgcctg tcatcccagc acttagggag gccgaggcag gtggatcaca aggtcaagag 1620

atcgacacca tcctggccaa catggtgaaa ccccatctct actaaaaata caaaaattag 1680

ctggacatgg tggtgggcgc ctgtagtccc agctacttgg gaggctgagg caggagaatt 1740

gcttgaactc gggaggcgta ggttgcagtg agccaagatc gcaccactgc actccagcct 1800

ggcaacagag caagactcct caaaaaacaa acaaacaaac aaacaaactc tgatcattca 1860

cccggtggga tccccggcag ctccttctcc ttcccgagct gtggctcatg ggccattgga 1920

ttcagagacc ttgagctggg tgctggctgc cctaaaggag cacgacccct ccagcctggt 1980

gaccgggagg cagagaagta cagctgtcct tgacaaaggc ccagcctctg ctgcctcagg 2040

ccagcctggc cacaccccga tgccacagtc atctgccccg cccaggcata agcctgactc 2100

aggtgataca ggcgaggctt ctcccatggg tcctcccctt gtgagaaagg gtctgtgctg 2160

tgtaccttct ctctgcacag ccattgtcag tgtggtttgg agtgagtgag cctcccgcac 2220

tgctcagggg agagctggtg agcatgactc catggcctcc tgctggttgg ggtcttcttc 2280

tgattctcac ccgcaggagc acccagaccc ccccttttca gaaatgatta cattttccct 2340

tcaagagtca tctgtgacct ggtggtgtgg agggactgtc ccctcctaca gcctggccga 2400

cccccttcct ccagggctgt gtggaatcac ctctcagaag ctcactctga aaagccactc 2460

acagggattt caccaggatg ttgctgttcc tgccgacacc tgggctttgt gacaaatgct 2520

gtttgcttca agctgttcac aagcaacttc accagaatcc caagctggtt cgtacttaat 2580

ttgtatacat acatatgcat aatatacatg accatgtatg agatacagac accatggctt 2640

ctgtccattg gctcattcgc ctggtcgcat gttcactcca tgaggttagg acactgcggg 2700

tggctttggg atgggtggtt tcagggacaa ggtgtcgctg tgatctctgt atttgctaac 2760

tagtcttgtt ttgatttcta acggtctagt ggagtacagt taatgttgtg tttatttctg 2820

taactgtagg ggtggagtat aaacctaact tttggaaaca tgaagaaata aacaatagcg 2880

ccagccgtta aaaaaaaaaa aaaaaaaaaa aaaaaa 2916

<210>8

<211>2916

<212>DNA

＜213〉homo sapiens (Homo sapiens)

<220>

<221>CDS

<222>(1337)..(1645)

<400>8

gaaaagacgt ttaataaaac agctttacct tttaaaaata aaatttcaat cccagtctgc 60

ctcaggtacg ggtgggggtg tgcagcgtga ctcctggctg tgttcctgta ccgtttgtgg 120

cctccggggt cttgcgcgtc cagggcccag ataccgagta actgccatgt ggtgtcgtgg 180

tggcgtccat aggcctccgt ggtggggcgg gggcggggct gtgggtgtgc agggcagctg 240

cattttccaa gagtggagat ggaggatgga gatttgcttc gtaaaggcct gggagctaca 300

tgggaggtag ggaggggcca gccacaccca gggctggggg gccggtggac tctgcccagt 360

gaagccaggg gagtgtcagc aatgaggggc ctggacagcg acgcgggacc cgcacctggg 420

agcggcagag atggagccca gctcgctaaa cgtaccctaa gggtgcacgc tgaccgtctt 480

tgatccctgg acagtgtcca tagagttgtc tccccttcgg tcacattgtc cctggccttg 540

gagtttctgc ctcagggaaa gacccactaa aaccacccgg gctctggcgg cctcctcttc 600

tccagagcat ggtggcgctg cctcctgagc agagtcactg agagtgggtc caaggaaggg 660

acccacttag caaaggttgt tgtgtggtgc tgtgcaggca cggcgcctgg agttggcccc 720

gccagcatcg gggtggcttg gggctctggt tcctaagctt ctgcttctga gttggtcagc 780

acgttggctc tgcgccctgc agtgcctgtc ttgatggtgg tccctggatg tgctgaaggg 840

ctccaggagc aaggggccag accacacttt ggctgagtcc ttgggtagtt tcagagggtc 900

acctgcacag tctgcattga gcaccctggg gctgaggccg tgggtcaagt tctgcctgac 960

agtgctttga ttatatggtg atgtgattct gcaaacataa ccccccaact gagtgccagc 1020

tgctgggtgt ccccacctca agagccaagg ggaggccacc ctggaactgg tgctttgccc 1080

ttggtgtcat ggaggtgtct gggtccacac gtggtgggtg ggctgggcct gcagtcaccc 1140

agcatggccg gcgggattgg acccagctgc aacctggcag gttccatggg gcttcctgtg 1200

cctgggcctg cctgtgtctc atgcctgtca ataaaggaga aagcaacctg ggaaaaacat 1260

tcctctctgt acccagggga ctgccaagtg tgctttatag aaggctggaa gaccgactga 1320

agtaggaagc tgtcac atg ttt gag cca act gag caa agc agc tgt tgc ctg 1372

Met Phe Glu Pro Thr Glu Gln Ser Ser Cys Cys Leu

1 5 10

cca gac agc cac ccc aag acc aag gga gcc agc ctg tcg ttt gtc ctt 1420

Pro Asp Ser His Pro Lys Thr Lys Gly Ala Ser Leu Ser Phe Val Leu

15 20 25

cta gcc tgg aag acc agt tct tgg ctc ttg ggc act cac att ggc cac 1468

Leu Ala Trp Lys Thr Ser Ser Trp Leu Leu Gly Thr His Ile Gly His

30 35 40

tgg ccg caa ctg ctg aat ggc ttc act aat gtc cct ggg gtc tgt atc 1516

Trp Pro Gln Leu Leu Asn Gly Phe Thr Asn Val Pro Gly Val Cys Ile

45 50 55 60

tgc cct ctc ttc tat tcc cta gaa act ctg gcc ggg cat ggc ggc tca 1564

Cys Pro Leu Phe Tyr Ser Leu Glu Thr Leu Ala Gly His Gly Gly Ser

65 70 75

cgc ctg tca tcc cag cac tta ggg agg ccg agg cag gtg gat cac aag 1612

Arg Leu Ser Ser Gln His Leu Gly Arg Pro Arg Gln Val Asp His Lys

80 85 90

gtc aag aga tcg aca cca tcc tgg cca aca tgg tgaaacccca tctctactaa 1665

Val Lys Arg Ser Thr Pro Ser Trp Pro Thr Trp

95 100

aaatacaaaa attagctgga catggtggtg ggcgcctgta gtcccagcta cttgggaggc 1725

tgaggcagga gaattgcttg aactcgggag gcgtaggttg cagtgagcca agatcgcacc 1785

actgcactcc agcctggcaa cagagcaaga ctcctcaaaa aacaaacaaa caaacaaaca 1845

aactctgatc attcacccgg tgggatcccc ggcagctcct tctccttccc gagctgtggc 1905

tcatgggcca ttggattcag agaccttgag ctgggtgctg gctgccctaa aggagcacga 1965

cccctccagc ctggtgaccg ggaggcagag aagtacagct gtccttgaca aaggcccagc 2025

ctctgctgcc tcaggccagc ctggccacac cccgatgcca cagtcatctg ccccgcccag 2085

gcataagcct gactcaggtg atacaggcga ggcttctccc atgggtcctc cccttgtgag 2145

aaagggtctg tgctgtgtac cttctctctg cacagccatt gtcagtgtgg tttggagtga 2205

gtgagcctcc cgcactgctc aggggagagc tggtgagcat gactccatgg cctcctgctg 2265

gttggggtct tcttctgatt ctcacccgca ggagcaccca gaccccccct tttcagaaat 2325

gattacattt tcccttcaag agtcatctgt gacctggtgg tgtggaggga ctgtcccctc 2385

ctacagcctg gccgaccccc ttcctccagg gctgtgtgga atcacctctc agaagctcac 2445

tctgaaaagc cactcacagg gatttcacca ggatgttgct gttcctgccg acacctgggc 2505

tttgtgacaa atgctgtttg cttcaagctg ttcacaagca acttcaccag aatcccaagc 2565

tggttcgtac ttaatttgta tacatacata tgcataatat acatgaccat gtatgagata 2625

cagacaccat ggcttctgtc cattggctca ttcgcctggt cgcatgttca ctccatgagg 2685

ttaggacact gcgggtggct ttgggatggg tggtttcagg gacaaggtgt cgctgtgatc 2745

tctgtatttg ctaactagtc ttgttttgat ttctaacggt ctagtggagt acagttaatg 2805

ttgtgtttat ttctgtaact gtaggggtgg agtataaacc taacttttgg aaacatgaag 2865

aaataaacaa tagcgccagc cgttaaaaaa aaaaaaaaaa aaaaaaaaaa a 2916

<210>9

<211>103

<212>PRT

＜213〉homo sapiens (Homo sapiens)

<400>9

Met Phe Glu Pro Thr Glu Gln Ser Ser Cys Cys Leu Pro Asp Ser His

1 5 10 15

Pro Lys Thr Lys Gly Ala Ser Leu Ser Phe Val Leu Leu Ala Trp Lys

20 25 30

Thr Ser Ser Trp Leu Leu Gly Thr His Ile Gly His Trp Pro Gln Leu

35 40 45

Leu Asn Gly Phe Thr Asn Val Pro Gly Val Cys Ile Cys Pro Leu Phe

50 55 60

Tyr Ser Leu Glu Thr Leu Ala Gly His Gly Gly Ser Arg Leu Ser Ser

65 70 75 80

Gln His Leu Gly Arg Pro Arg Gln Val Asp His Lys Val Lys Arg Ser

85 90 95

Thr Pro Ser Trp Pro Thr Trp

100

<210>10

<211>20

<212>DNA

＜213〉artificial sequence

<220>

<221>misc_feature

＜223〉primer

<400>10

atgatggtca ggctggtcaa 20

<210>11

<211>24

<212>DNA

＜213〉artificial sequence

<220>

<221>misc_feature

＜223〉primer

<400>11

tcgaaagtac aacgctttat tgaa 24

<210>12

<211>18

<212>DNA

＜213〉artificial sequence

<220>

<221>misc_feature

＜223〉primer

<400>12

ggccgttgac ctggtgac 18

<210>13

<211>23

<212>DNA

＜213〉artificial sequence

<220>

<221>misc_feature

＜223〉primer

<400>13

aaagcaaatg tattcttcct cca 23

<210>14

<211>20

<212>DNA

＜213〉artificial sequence

<220>

<221>misc_feature

＜223〉primer

<400>14

tttcaatccc agtctgcctc 20

<210>15

<211>20

<212>DNA

＜213〉artificial sequence

<220>

<221>misc_feature

＜223〉primer

<400>15

ggctggcgct attgtttatt 20

Claims

1. isolating polynucleotide is characterized in that, it comprises a nucleotide sequence, and this nucleotide sequence is selected from down group:

(a) coding has the proteic polynucleotide of people of cancer suppressing function, and described albumen has and contains following aminoacid sequence: SEQ ID NO:9;

(b) with the complete complementary polynucleotide of polynucleotide (a).

2. polynucleotide as claimed in claim 1 is characterized in that, the polypeptide of this polynucleotide encoding has following aminoacid sequence: SEQ ID NO:9.

3. polynucleotide as claimed in claim 1 is characterized in that, the sequence of these polynucleotide is selected from down group:

Coding region sequence or the full length sequence of SEQ ID NO:8.

4. a carrier is characterized in that, it contains the described polynucleotide of claim 1.

5. a genetically engineered host cell is characterized in that, it is a kind of host cell that is selected from down group:

(a) host cell that transforms or transduce with the described carrier of claim 4;

(b) host cell that transforms or transduce with the described polynucleotide of claim 1.