CN1209370C - New human protein with cancer inhibiting function and its encoding sequence - Google Patents
New human protein with cancer inhibiting function and its encoding sequence Download PDFInfo
- Publication number
- CN1209370C CN1209370C CN 02111646 CN02111646A CN1209370C CN 1209370 C CN1209370 C CN 1209370C CN 02111646 CN02111646 CN 02111646 CN 02111646 A CN02111646 A CN 02111646A CN 1209370 C CN1209370 C CN 1209370C
- Authority
- CN
- China
- Prior art keywords
- leu
- ser
- ctg
- pro
- ala
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Landscapes
- Peptides Or Proteins (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
The present invention discloses new human protein with the function of inhibiting cancer, a polynucleotide for encode the polypeptide and a method for producing the polypeptide by a recombinant technology. The present invention also discloses a method of using the polypeptide in treating various diseases, such as cancer. The present invention also discloses an antagonist of the polypeptide and therapeutic effects thereof. The present invention also discloses an application of the polynucleotide for encoding the human protein with the function of inhibiting cancer.
Description
Technical field
The invention belongs to biological technical field, specifically, the present invention relates to new coding and have the proteic polynucleotide of people of cancer suppressing function and the polypeptide of this polynucleotide encoding.The invention still further relates to the purposes and the preparation of these polynucleotide and polypeptide.
Background technology
The research of people's gene group is international focus at present, removes human chromosome DNA large scale sequencing, outside the method for expressed sequence order-checking (EST), also lacks the screening that begins from function and has the high-throughout method of functional gene.
Cancer is one of principal disease of harm humans health.In order to treat effectively and prophylaxis of tumours, people more and more pay close attention to genetic treatment of tumor at present.Therefore, this area presses for people's albumen and the agonist/inhibitor thereof that development research has cancer suppressing function.
Summary of the invention
The purpose of this invention is to provide the new people's protein polypeptide of a class with cancer suppressing function with and fragment, analogue and derivative.
Another object of the present invention provides the polynucleotide of these polypeptide of coding.
Another object of the present invention provides the method for these polypeptide of production and the purposes of this polypeptide and encoding sequence.
In a first aspect of the present invention, novel isolated protein polypeptide with cancer suppressing function is provided, it comprises the polypeptide of the aminoacid sequence with the group of being selected from down: SEQ ID NO:3,6,9; Or its conservative property variation polypeptide or its active fragments or its reactive derivative.
Preferably, this polypeptide is the polypeptide with aminoacid sequence of the group of being selected from down: SEQ ID NO:3,6,9.
In a second aspect of the present invention, a kind of isolating polynucleotide are provided, it comprises a nucleotide sequence, and this nucleotide sequence is shown at least 85% homogeny with a kind of nucleotides sequence that is selected from down group: the polynucleotide of the above-mentioned protein polypeptide with cancer suppressing function of (a) encoding; (b) with polynucleotide (a) complementary polynucleotide.Preferably, the polypeptide of this polynucleotide encoding has the aminoacid sequence of the group of being selected from down: SEQ ID NO:3,6,9.More preferably, the sequence of these polynucleotide is selected from down group: SEQ ID NO:2,5,8 coding region sequence or full length sequence.
In a third aspect of the present invention, the carrier that contains above-mentioned polynucleotide is provided, and has been transformed or host cell of transduceing or the host cell that is directly transformed or transduce by above-mentioned polynucleotide by this carrier.
In a fourth aspect of the present invention, the preparation method who prepares the polypeptide of the protein-active with cancer suppressing function is provided, this method comprises: (a) have under the proteic condition of cancer suppressing function suitable the expression, cultivate the above-mentioned host cell that is transformed or transduce; (b) from culture, isolate the polypeptide of protein-active with cancer suppressing function.
In a fifth aspect of the present invention, provide and above-mentioned protein polypeptide specificity bonded antibody with cancer suppressing function.The nucleic acid molecule that can be used for detecting also is provided, and it contains, and continuous 10 Nucleotide are to full length nucleotide in the above-mentioned polynucleotide, and preferably it contains the about 15-1000 of a successive Nucleotide.
In a sixth aspect of the present invention, a kind of pharmaceutical composition is provided, it contains the protein polypeptide and the pharmaceutically acceptable carrier with cancer suppressing function of the present invention of safe and effective amount.These pharmaceutical compositions can be treated illnesss such as cancer and cellular abnormality propagation.
Others of the present invention are because the disclosure of this paper is conspicuous to those skilled in the art.
Embodiment
The 3T3 cell is a kind of l cell (J.Cell.Biol., 17:299,1963) (being also referred to as the NIH/3T3 cell).In the cancer research field, often foreign gene (especially people's gene) is introduced the 3T3 cell, observe its situation that influences to the growth of 3T3 cell.Now generally believing, is cancer related gene to the 3T3 cell influential gene of growing, wherein to the 3T3 cell growth inhibiting gene is arranged is cancer suppressor gene mostly, and to the 3T3 cell growth (former) oncogene that has the gene of promoter action to be mostly.
The present invention adopts large-scale cDNA clone transfection mouse embryo fibroblasts, has on the basis of cancer suppressing action in acquisition, proves new gene through order-checking, further obtains full length cDNA clone.DNA transfection evidence, the albumen with cancer suppressing function of the present invention has the effect that suppresses clone's formation, its inhibiting rate 〉=50% to the 3T3 cell.
As used herein, " isolating " is meant that material separates (if natural substance, primal environment promptly is a natural surroundings) from its primal environment.Do not have separation and purification as polynucleotide under the native state in the active somatic cell and polypeptide, but same polynucleotide or polypeptide as from native state with in other materials that exist separately, then for separation and purification.
As used herein, " isolating albumen or polypeptide with cancer suppressing function " is meant that the protein polypeptide with cancer suppressing function is substantially free of natural relative other albumen, lipid, carbohydrate or other material.Those skilled in the art can have the albumen of cancer suppressing function with the purified technology of protein purifying of standard.Basically pure polypeptide can produce single master tape on non-reduced polyacrylamide gel.
Polypeptide of the present invention can be recombinant polypeptide, natural polypeptides, synthetic polypeptide, preferred recombinant polypeptide.Polypeptide of the present invention can be the product of natural purifying, or the product of chemosynthesis, or uses recombinant technology to produce from protokaryon or eucaryon host (for example, bacterium, yeast, higher plant, insect and mammalian cell).The host used according to the recombinant production scheme, polypeptide of the present invention can be glycosylated, maybe can be nonglycosylated.Polypeptide of the present invention also can comprise or not comprise initial methionine residues.
The present invention also comprises the proteic fragment of the people with cancer suppressing function, derivative and analogue.As used herein, term " fragment ", " derivative " are meant with " analogue " and keep natural identical biological function or the active polypeptide of people's albumen with cancer suppressing function of the present invention basically.Polypeptide fragment of the present invention, derivative or analogue can be that (i) has one or more conservative or substituted polypeptide of non-conservation amino-acid residue (preferred conservative amino acid residue), and the amino-acid residue of such replacement can be also can not encoded by genetic code, or (ii) in one or more amino-acid residues, has a polypeptide of substituted radical, or (iii) mature polypeptide and another compound (such as the compound that prolongs the polypeptide transformation period, polyoxyethylene glycol for example) merge formed polypeptide, or (iv) additional aminoacid sequence is fused to this peptide sequence and the polypeptide that forms (as leader sequence or secretion sequence or be used for the sequence or the proteinogen sequence of this polypeptide of purifying).According to the instruction of this paper, these fragments, derivative and analogue belong to the known scope of those skilled in the art.
Polynucleotide of the present invention can be dna form or rna form.Dna form comprises the DNA of cDNA, genomic dna or synthetic.DNA can be strand or double-stranded.DNA can be coding strand or noncoding strand.Be example with FP3361 albumen (in this application, its clone numbering is adopted in proteinic name), the coding region sequence of encoding mature polypeptide can be identical with the coding region sequence shown in the SEQ ID NO:2 or the varient of degeneracy.As used herein, " varient of degeneracy " is meant that for FP3361 coding has the protein of SEQ ID NO:3, but with the differentiated nucleotide sequence of coding region sequence shown in the SEQ ID NO:2.Be example with FP7072 albumen again, the coding region sequence of encoding mature polypeptide can be identical with the coding region sequence shown in the SEQ ID NO:5 or the varient of degeneracy; " varient of degeneracy " is meant that for FP7072 coding has the protein of SEQ ID NO:6, but with the differentiated nucleotide sequence of coding region sequence shown in the SEQ ID NO:5.Have the albumen of cancer suppressing function for of the present invention other, can the rest may be inferred.
The polynucleotide of encoding mature polypeptide comprise: the encoding sequence of an encoding mature polypeptide; The encoding sequence of mature polypeptide and various additional code sequence; Encoding sequence of mature polypeptide (with optional additional code sequence) and non-coding sequence.
Term " polynucleotide of coded polypeptide " can be the polynucleotide that comprise this polypeptide of encoding, and also can be the polynucleotide that also comprise additional code and/or non-coding sequence.
The invention still further relates to the varient of above-mentioned polynucleotide, its coding has the polypeptide of identical aminoacid sequence or fragment, analogue and the derivative of polypeptide with the present invention.The varient of these polynucleotide can be the allelic variant of natural generation or the varient that non-natural takes place.These nucleotide diversity bodies comprise and replace varient, deletion mutation body and insert varient.As known in the art, allelic variant is the replacement form of polynucleotide, and it may be replacement, disappearance or the insertion of one or more Nucleotide, but can be from not changing the function of its encoded polypeptides in fact.
The invention still further relates to and above-mentioned sequence hybridization and two sequences between have at least 50%, preferably at least 70%, the polynucleotide of at least 80% homogeny more preferably.The present invention be more particularly directed under stringent condition and the interfertile polynucleotide of polynucleotide of the present invention.In the present invention, " stringent condition " is meant: (1) than hybridization under low ionic strength and the comparatively high temps and wash-out, as 0.2 * SSC, and 0.1%SDS, 60 ℃; Or (2) hybridization the time is added with denaturing agent, as 50% (v/v) methane amide, 0.1% calf serum/0.1%Ficoll, 42 ℃ etc.; Or (3) only at the homogeny between the two sequences at least more than 95%, be more preferably 97% and just hybridize when above.And the polypeptide of interfertile polynucleotide encoding has identical biological function (is example with FP3361 albumen) and activity with the mature polypeptide shown in the SEQ IDNO:3.
The invention still further relates to nucleic acid fragment with above-mentioned sequence hybridization.As used herein, the length of " nucleic acid fragment " contains 15 Nucleotide at least, better is at least 30 Nucleotide, is more preferably at least 50 Nucleotide, preferably more than at least 100 Nucleotide.The amplification technique (as PCR) that nucleic acid fragment can be used for nucleic acid has the proteic polynucleotide of cancer suppressing function to determine and/or to separate to encode.
Polypeptide among the present invention and polynucleotide preferably provide with isolating form, more preferably are purified to homogeneous.
Dna sequence dna of the present invention can obtain with several method.For example, with hybridization technique DNA isolation well known in the art.These technology including, but not limited to: 1) with probe and genome or the hybridization of cDNA library to detect homology nucleotide sequence and 2) antibody screening of expression library to be to detect the dna fragmentation of the clone with common structure feature.
The proteic specific DNA fragment sequence that coding has cancer suppressing function produces also and can obtain with following method: 1) separate double chain DNA sequence from genomic dna; 2) the chemical synthesising DNA sequence is to obtain the double-stranded DNA of required polypeptide.
When the whole aminoacid sequence of the polypeptide product of needs was known, the direct chemical of dna sequence dna is synthetic to be the method for often selecting for use.When if required amino acid whose whole sequence is not known, the direct chemical of dna sequence dna is synthetic to be impossible, and the method for selecting for use is the separation of cDNA sequence.The standard method that separates interested cDNA is from the donorcells separating mRNA of this gene of high expression level and carries out reverse transcription, forms plasmid or phage cDNA library.Extract the existing multiple proven technique of method of mRNA, test kit also can obtain (Qiagene) from commercial channels.And the construction cDNA library also is usual method (Sambrook, et al., Molecular Cloning, A Laboratory Manual, Cold SpringHarbor Laboratory.New York, 1989).Also can obtain the cDNA library of commercial offers, as the different cDNA library of Clontech company.When being used in combination the polymeric enzyme reaction technology, even few expression product also can be cloned.
Available ordinary method is screened gene of the present invention from these cDNA libraries.These methods include, but is not limited to: (1) DNA-DNA or DNA-RNA hybridization; (2) function of marker gene occurs or forfeiture; (3) mensuration has the level of the proteic transcript of cancer suppressing function; (4), detect the protein product of genetic expression by immunological technique or mensuration biologic activity.Aforesaid method can singly be used, but also several different methods combined utilization.
In (1) kind method, hybridizing used probe is and any a part of homology of polynucleotide of the present invention that at least 15 Nucleotide of its length better are at least 30 Nucleotide, are more preferably at least 50 Nucleotide, preferably at least 100 Nucleotide.In addition, the length of probe within 2kb, preferably is within the 1kb usually.Probe used herein is the dna sequence dna of chemosynthesis on the basis of gene DNA sequence information of the present invention normally.Gene of the present invention itself or fragment are certainly as probe.The mark of dna probe can be used radio isotope, fluorescein or enzyme (as alkaline phosphatase) etc.
In (4) kind method, detect the protein product of protein gene expression and can use immunological technique such as Western blotting, radioimmunoprecipitation, enzyme-linked immunosorbent assay (ELISA) etc. with cancer suppressing function.
Use method (Saiki, the et al.Science 1985 of round pcr DNA amplification/RNA; 230:1350-1354) be optimized for acquisition gene of the present invention.When particularly being difficult to obtain the cDNA of total length from the library, can preferably use RACE method (the terminal rapid amplifying method of RACE-cDNA), the primer that is used for PCR can suitably be selected according to sequence information of the present invention disclosed herein, and available ordinary method is synthetic.Available ordinary method is as the DNA/RNA fragment by gel electrophoresis separation and purifying amplification.
The gene of the present invention that obtains as mentioned above, perhaps the available ordinary method of mensuration of the nucleotide sequence of various dna fragmentations etc. such as dideoxy chain termination (Sanger et al.PNAS, 1977,74:5463-5467).This class nucleotide sequencing is available commercial sequencing kit etc. also.In order to obtain the cDNA sequence of total length, order-checking need be carried out repeatedly.Sometimes need to measure a plurality of clones' cDNA sequence, just can be spliced into the cDNA sequence of total length.
The present invention also relates to comprise the carrier of polynucleotide of the present invention, and the host cell that produces through genetically engineered with carrier of the present invention or albumen coded sequence with cancer suppressing function, and the method that produces polypeptide of the present invention through recombinant technology.
By the recombinant DNA technology of routine, can utilize polymerized nucleoside acid sequence of the present invention to can be used to express or produce the protein polypeptide with cancer suppressing function of reorganization.In general following steps are arranged:
(1). have the proteic polynucleotide of people (or varient) of cancer suppressing function with coding of the present invention, or transform or the transduction proper host cell with the recombinant expression vector that contains these polynucleotide;
(2). the host cell of in suitable medium, cultivating;
(3). separation, protein purification from substratum or cell.
Among the present invention, the people's albumen polynucleotide sequence with cancer suppressing function can be inserted in the recombinant expression vector.Term " recombinant expression vector " refers to that bacterial plasmid well known in the art, phage, yeast plasmid, vegetable cell virus, mammalian cell virus are as adenovirus, retrovirus or other carriers.The carrier of Shi Yonging includes but not limited in the present invention: the expression vector based on T7 of expressing in bacterium; The pMSXND expression vector of in mammalian cell, expressing and at the carrier that derives from baculovirus of expressed in insect cells.In a word, as long as can duplicate in host and stablize, any plasmid and carrier can be used.A key character of expression vector is to contain replication orgin, promotor, marker gene and translation controlling elements usually.
Method well-known to those having ordinary skill in the art can be used to make up and contains people's encoding histone dna sequence dna with cancer suppressing function and suitable transcribing/the translate expression vector of control signal.These methods comprise extracorporeal recombinant DNA technology, DNA synthetic technology, the interior recombinant technology of body etc.Described dna sequence dna can effectively be connected on the suitable promotor in the expression vector, and is synthetic to instruct mRNA.The representative example of these promotors has: colibacillary 1ac or trp promotor; Lambda particles phage PL promotor; Eukaryotic promoter comprises LTRs and some other known may command gene expression promoter in protokaryon or eukaryotic cell or its virus of CMV immediate early promoter, early stage and late period SV40 promotor, retrovirus.Expression vector also comprises ribosome bind site and the transcription terminator that translation initiation is used.
In addition, expression vector preferably comprises one or more selected markers, to be provided for selecting the phenotypic character of transformed host cells, cultivate Tetrahydrofolate dehydrogenase, neomycin resistance and the green fluorescent protein (GFP) of usefulness as eukaryotic cell, or be used for colibacillary tsiklomitsin or amicillin resistance.
Comprise the carrier of above-mentioned suitable dna sequence dna and suitable promotor or control sequence, can be used to transform appropriate host cell, so that it can marking protein.
Host cell can be a prokaryotic cell prokaryocyte, as bacterial cell; Or eukaryotic cell such as low, as yeast cell; Or higher eucaryotic cells, as mammalian cell.Representative example has: intestinal bacteria, streptomyces; The bacterial cell of Salmonella typhimurium; Fungal cell such as yeast; Vegetable cell; The insect cell of fruit bat S2 or Sf9; The zooblast of CHO, COS or Bowes melanoma cells etc.
When polynucleotide of the present invention are expressed in higher eucaryotic cells, be enhanced if will make to transcribe when in carrier, inserting enhancer sequence.Enhanser is the cis acting factor of DNA, and nearly 10 to 300 base pairs act on promotor transcribing with enhancing gene usually.Can for example be included in the SV40 enhanser of 100 to 270 base pairs of replication origin side in late period one, at the polyoma enhanser of replication origin side in late period one and adenovirus enhanser etc.
Persons skilled in the art all know how to select appropriate carriers, promotor, enhanser and host cell.
Can carry out with routine techniques well known to those skilled in the art with the recombinant DNA transformed host cell.When the host was prokaryotic organism such as intestinal bacteria, the competent cell that can absorb DNA can be used CaCl in exponential growth after date results
2Method is handled, and used step is well-known in this area.Alternative is to use MgCl
2If desired, transforming also the method for available electroporation carries out.When the host is an eukaryote, can select following DNA transfection method for use: coprecipitation of calcium phosphate method, conventional mechanical method such as microinjection, electroporation, liposome packing etc.
The transformant that obtains can be cultivated with ordinary method, expresses the polypeptide of coded by said gene of the present invention.According to used host cell, used substratum can be selected from various conventional substratum in the cultivation.Under the condition that is suitable for the host cell growth, cultivate.After host cell grows into suitable cell density, induce the promotor of selection with suitable method (as temperature transition or chemical induction), cell is cultivated for some time again.
Recombinant polypeptide in the above methods can wrap by in cell, extracellular or on cytolemma, express or be secreted into the extracellular.If desired, can utilize its physics, the separating by various separation methods with other characteristic and the albumen of purification of Recombinant of chemistry.These methods are well-known to those skilled in the art.The example of these methods includes, but are not limited to: conventional renaturation handles, with protein precipitant handle (salt analysis method), centrifugal, the broken bacterium of infiltration, superly handle, the combination of super centrifugal, sieve chromatography (gel-filtration), adsorption chromatography, ion exchange chromatography, high performance liquid chromatography (HPLC) and other various liquid chromatography (LC) technology and these methods.
The people's albumen or the polypeptide with cancer suppressing function of reorganization are of use in many ways.These purposes include, but is not limited to: directly have the disease due to the low or forfeiture of the protein function of cancer suppressing function as pharmacological agent and be used to screen and promote or antagonism has antibody, polypeptide or other part of the protein function of cancer suppressing function.For example, antibody can be used for activating or suppressing to have the proteic function of people of cancer suppressing function.The people's protein screening peptide library that has a cancer suppressing function with the reorganization of expressing can be used for seeking the peptide molecule that can suppress or stimulate the people's protein function with cancer suppressing function of therapeutic value.
The present invention also provides screening of medicaments to improve (agonist) or check the method that (antagonist) has the proteic medicament of people of cancer suppressing function to identify.Agonist improves the biological function such as stimulate cellular proliferation of the people's albumen with cancer suppressing function, and antagonist prevention disorder such as the various cancer relevant with cell hyperproliferation with treatment.For example, can be in the presence of medicine, the proteic film preparation of people that mammalian cell or expression is had cancer suppressing function is cultivated with the people's albumen with cancer suppressing function of mark.Measure the medicine raising then or check this interactional ability.
The proteic antagonist of people with cancer suppressing function comprises antibody, compound, acceptor disappearance thing and the analogue etc. that filter out.The proteic antagonist of people with cancer suppressing function can and be eliminated its function with the people's protein binding with cancer suppressing function, or suppresses to have the proteic generation of people of cancer suppressing function, or combines with the avtive spot of polypeptide and to make polypeptide can not bring into play biological function.The proteic antagonist of people with cancer suppressing function can be used for therepic use.
In screening during as the compound of antagonist, albumen of the present invention can be added during bioanalysis measures, determine by measuring albumen and the interaction between its acceptor that compounds affect has cancer suppressing function whether compound is antagonist.With the same quadrat method of above-mentioned SCREENED COMPOUND, can filter out the acceptor disappearance thing and the analogue of antagonist action.
Polypeptide of the present invention can be directly used in disease treatment, for example, and various malignant tumours and cellular abnormality propagation etc.
Polypeptide of the present invention, and fragment, derivative, analogue or their cell can be used as antigen to produce antibody.These antibody can be polyclone or monoclonal antibody.Polyclonal antibody can obtain by the method with this polypeptide direct injection animal.The technology of preparation monoclonal antibody comprises hybridoma technology, three knurl technology, people B-quadroma technology, EBV-hybridoma technology etc.
Can be with polypeptide of the present invention and antagonist and suitable pharmaceutical carrier combination back use.These carriers can be water, glucose, ethanol, salt, damping fluid, glycerine and their combination.Composition comprises the polypeptide or the antagonist of safe and effective amount and carrier and the vehicle that does not influence effect of drugs.These compositions can be used as medicine and are used for disease treatment.
The present invention also provides medicine box or the test kit that contains one or more containers, and one or more medicinal compositions compositions of the present invention are housed in the container.With these containers, can have by the given indicative prompting of government authorities of making, using or selling medicine or biological products, the government authorities that this prompting reflects production, uses or sells permits it to use on human body.In addition, polypeptide of the present invention can be used in combination with other treatment compound.
Pharmaceutical composition can be with mode administration easily, as by in part, intravenously, intraperitoneal, intramuscular, subcutaneous, the nose or the route of administration of intracutaneous.Albumen with cancer suppressing function comes administration with the amount that treats and/or prevents concrete indication effectively.The proteic amount with cancer suppressing function and the dosage range that are applied to the patient will depend on many factors, as administering mode, person's to be treated healthiness condition and diagnostician's judgement.
The proteic polynucleotide of people with cancer suppressing function also can be used for multiple therapeutic purpose.Gene therapy technology can be used for treating since have that the proteic nothing of cancer suppressing function is expressed or the proteic expression with cancer suppressing function of unusual/non-activity due to cell proliferation, growth or metabolic disturbance.The gene therapy vector of reorganization can be used for treating the protein expression with cancer suppressing function or the disease of active caused by abnormal.Deriving from the expression vector of virus such as protein gene that retrovirus, adenovirus, adeno-associated virus (AAV), hsv, parvovirus etc. can be used for having cancer suppressing function is transferred in the cell.The method that structure carries the recombinant viral vector of the protein gene with cancer suppressing function be found in existing document (Sambrook, etal.).The people protein gene of reorganization with cancer suppressing function can be packaged in the liposome and be transferred in the cell in addition.
Suppress to have cancer suppressing function people's protein mRNA oligonucleotide (comprising sense-rna and DNA) and ribozyme also within the scope of the invention.Ribozyme is the enzyme sample RNA molecule that a kind of energy specificity is decomposed specific RNA, and its mechanism of action is to carry out the endonuclease effect after ribozyme molecule and the hybridization of complementary target RNA-specific.The RNA of antisense and DNA and ribozyme can obtain with existing any RNA or DNA synthetic technology, as the technology widespread use of solid phase phosphoamide chemical synthesis synthetic oligonucleotide.Antisense rna molecule can be transcribed acquisition by the dna sequence dna of this RNA that encodes in external or body.This dna sequence dna has been incorporated into the downstream of rna polymerase promoter of carrier.In order to increase the stability of nucleic acid molecule, available several different methods is modified it, and as increasing the sequence length of both sides, the connection between the ribonucleoside is used phosphoric acid thioester bond or peptide bond but not phosphodiester bond.
Polynucleotide imports tissue or intracellular method comprises: directly be injected into polynucleotide in the in-vivo tissue; Or external by carrier (as virus, phage or plasmid etc.) earlier with the polynucleotide transfered cell in, again cell is transplanted in the body etc.
Polypeptide of the present invention also can be used as the peptide spectrum analysis, for example, the polypeptide available physical, chemistry or enzyme carry out the specificity cutting, and carry out the two-dimentional or three-dimensional gel electrophoresis analysis of one dimension.
The present invention also provides the antibody at the people's proteantigen determinant with cancer suppressing function.These antibody include, but is not limited to: the fragment that polyclonal antibody, monoclonal antibody, chimeric antibody, single-chain antibody, Fab fragment and Fab expression library produce.These antibody can prepare with ordinary method.The anti-proteic antibody of people with cancer suppressing function can be used in the immunohistochemistry technology, detects the people's albumen with cancer suppressing function in the biopsy specimen.
With the also available labelled with radioisotope of the protein bound monoclonal antibody of the people with cancer suppressing function, inject in the body and can follow the tracks of its position and distribution.Antibody among the present invention can be used for treating or prevents and the relevant disease of people's albumen with cancer suppressing function.The antibody that gives suitable dosage can stimulate or block proteic generation of the people with cancer suppressing function or activity.
Antibody also can be used for designing the immunotoxin at a certain privileged sites in the body.As have cancer suppressing function people's albumen high-affinity monoclonal antibody can with bacterium or plant poison (as diphtheria toxin, ricin, abrine etc.) covalent attachment.
Available people's albumen or the polypeptide immune animal of the production of polyclonal antibody with cancer suppressing function, as rabbit, mouse, rat etc.Multiple adjuvant can be used for the enhancing immunity reaction, includes but not limited to freund's adjuvant etc.
Have cancer suppressing function people's protein monoclonal antibody can with hybridoma technology production (Kohler and Milstein.Nature, 1975,256:495-497).With the variable region bonded chimeric antibody in human constant region and inhuman source can with existing technology production (Morrison et al, PNAS, 1985,81:6851).And the technology of existing manufacture order chain antibody (U.S.PatNo.4946778) also can be used for producing the anti-proteic single-chain antibody of people with cancer suppressing function.
Can be incorporated into the rondom polypeptide storehouse that solid formation forms by the various amino acid that may make up by screening with the protein bound peptide molecule of the present invention obtains.During screening, must carry out mark to people's protein molecular with cancer suppressing function.
The invention still further relates to quantitatively and detection and localization has the diagnostic testing process of people's protein level of cancer suppressing function.These tests are known in the art, and comprise that FISH measures and radioimmunoassay.The people's protein level that is detected in the test with cancer suppressing function, the disease that can have the importance of people's albumen in various diseases of cancer suppressing function with laying down a definition and be used to diagnose albumen to work with cancer suppressing function.
Proteic polynucleotide with cancer suppressing function can be used for having the diagnosis and the treatment of the protein related diseases of cancer suppressing function.Aspect diagnosis, the proteic polynucleotide with cancer suppressing function can be used for detecting have cancer suppressing function proteic expression whether or under morbid state, have an abnormal exprssion of cancer suppressing function.As the protein D NA sequence with cancer suppressing function can be used for the hybridization of biopsy specimen is had with judgement the proteic abnormal expression of cancer suppressing function.Hybridization technique comprises the Southern blotting, Northern blotting, in situ hybridization etc.These technological methods all are disclosed mature technologies, and relevant test kit all can obtain from commercial channels.Part or all of polynucleotide of the present invention can be used as probe stationary on microarray (Microarray) or DNA chip (being called " gene chip " again), is used for analyzing the differential expression analysis and the gene diagnosis of tissue gene.Carry out RNA-polymerase chain reaction (RT-PCR) amplification in vitro with the special primer of the albumen with cancer suppressing function and also can detect proteic transcription product with cancer suppressing function.
The sudden change that detection has the protein gene of cancer suppressing function also can be used for diagnosing the relevant disease of albumen with cancer suppressing function.Form with protein mutation of cancer suppressing function comprises that to have point mutation that the protein D NA sequence of cancer suppressing function compares, transposition, disappearance, reorganization and other any unusual etc. with normal wild type.Available existing technology such as Southern blotting, dna sequence analysis, PCR and in situ hybridization detect sudden change.In addition, sudden change might influence proteic expression, therefore can judge indirectly that with Northern blotting, Western blotting gene has or not sudden change.
Sequence of the present invention identifies it also is valuable to karyomit(e).These sequences can be specifically at certain bar human chromosome particular location and and can with its hybridization.At present, need to identify the concrete site of each gene on the karyomit(e).Yet have only chromosomal marker thing seldom to can be used for the marker chromosomes position now based on actual sequence data (repetition polymorphism).For these sequences are associated with disease related gene.The first step is positioned dna sequence dna of the present invention on the karyomit(e) exactly.
In brief, prepare PCR primer (preferred 15-35bp), sequence can be positioned on the karyomit(e) according to cDNA.Then, these primers are used for the somatocyte hybrid cell that the PCR screening contains each bar human chromosome.Have only those hybrid cells that contain corresponding to the people's gene of primer can produce the fragment of amplification.
The PCR localization method of somatocyte hybrid cell is that DNA is navigated to concrete chromosomal quick method.Use Oligonucleolide primers of the present invention,, can utilize one group to realize inferior location from specific chromosomal fragment or a large amount of genomic clone by similar approach.Other the similar strategy that can be used for chromosomal localization comprises in situ hybridization, uses the karyomit(e) prescreen and the hybridization preliminary election of the airflow classification of mark, thereby makes up the special cDNA storehouse of karyomit(e).
The cDNA clone is carried out fluorescence in situ hybridization (FISH) with Metaphase Chromosome, can in a step, accurately carry out chromosomal localization.The summary of this technology is referring to Verma etc., Human Chromosomes:a Manual of BasicTechniques, Pergamon Press, New York (1988).
In case sequence is positioned to chromosome position accurately, the physical location of this sequence on karyomit(e) just can be associated with the gene map data.These data for example are found in, V.Mckusick, Mendelian Inheritance in Man (can by with the online acquisition of Johns Hopkins University Welch Medical Library).Can pass through linkage analysis then, determine gene and navigated to relation between the disease on the chromosomal region already.
Then, need to measure ill and not cDNA between diseased individuals or genome sequence difference.If observe certain sudden change in some or all of diseased individuals, and this sudden change is not observed in any normal individual, then this sudden change may be the cause of disease of disease.More ill and diseased individuals not is usually directed at first seek the variation of structure in the karyomit(e), as from the horizontal visible of karyomit(e) or use based on detectable disappearance of the PCR of cDNA sequence or transposition.
Pyrenoids thuja acid full length sequence or its fragment with cancer suppressing function of the present invention can obtain with the method for pcr amplification method, recombination method or synthetic usually.For the pcr amplification method, can be disclosed according to the present invention about nucleotide sequence, especially open reading frame sequence designs primer, and with commercially available cDNA storehouse or by the prepared cDNA storehouse of ordinary method well known by persons skilled in the art as template, amplification and must relevant sequence.When sequence is longer, usually needs to carry out twice or pcr amplification repeatedly, and then the fragment that each time amplifies is stitched together by proper order.
In case obtained relevant sequence, just can obtain relevant sequence in large quantity with recombination method.This normally is cloned into carrier with it, changes cell again over to, separates obtaining relevant sequence then from the host cell after the propagation by ordinary method.
In addition, also the method for available synthetic is synthesized relevant sequence, especially fragment length more in short-term.Usually, by first synthetic a plurality of small segments, and then connect and to obtain the very long fragment of sequence.
At present, can be fully come the dna sequence dna of code book invention albumen (or its fragment, or derivatives thereof) by chemosynthesis.This dna sequence dna can be introduced then in the various dna moleculars (as carrier) and cell in this area.In addition, also can will suddenly change and introduce in the protein sequence of the present invention by chemosynthesis.
In addition, because the albumen with cancer suppressing function of the present invention has the natural acid sequence that is derived from the people, therefore, compare with the albumen of the same clan that derives from other species, estimate to have higher active and/or lower side effect (for example in the intravital immunogenicity of people lower or do not have) being applied to man-hour.
Below in conjunction with specific embodiment, further set forth the present invention.Should be understood that these embodiment only to be used to the present invention is described and be not used in and limit the scope of the invention.The experimental technique of unreceipted actual conditions in the following example, usually according to people such as normal condition such as Sambrook, molecular cloning: laboratory manual (New York:Cold Spring Harbor LaboratoryPress, 1989) condition described in, or the condition of advising according to manufacturer.
The acquisition of embodiment 1:cDNA gene and the restraining effect that mouse NIH/3T3 cell clone is formed
FP3361, FP7072 and FP8080 come from the human fetal cDNA library that makes up with ordinary method.Get the fetal tissue at 3,6,9 monthly ages, (GIBCO BRL company) extracts total RNA by manufacturer's specification sheets with Trizol reagent, extracts mRNA with the mRNA test kit (Pharmacia company) of purifying.Make up the cDNA library of above-mentioned mRNA with pCMV-script TMXR cDNA library construction test kit (Stratagene company).Wherein ThermoScript II is used MMLV-RT-Superscript II (GIBCO BRL) instead, and reverse transcription reaction carries out at 42 ℃.Transform XL 10-Gold recipient cell, obtained the cDNA library of 1 * 106cfu/ μ g titre.The first round is picking cDNA clone at random, is probe with high abundance cDNA clone with the cDNA clone who has proved anticancer growth function thereafter, screening by hybridization cDNA library, weak positive and negative clone of picking.With Qiagen 96 orifice plate plasmid extraction test kits, carry out the extraction of plasmid DNA by shop instruction.Plasmid DNA and empty carrier transfection simultaneously mouse NIH/3T3 cell.After the 100ng DNA alcohol precipitation drying, add 6 μ lH2O dissolving, treat transfection.Add 0.74 μ l liposome and 9.3 μ l serum-free mediums in every part of DNA sample, behind the mixing, room temperature was placed 10 minutes.Add 150 μ l serum-free mediums in every pipe, divide equally and add 3 holes and grow in the mouse NIH/3T3 cell of 96 orifice plates, placed 2 hours for 37 ℃, every hole adds 50 μ l serum-free mediums again, 37 ℃ 24 hours.Every hole is changed 100 μ l and is trained liquid entirely, 37 ℃ 24 hours, change the full training liquid 100 μ l that contain G418,37 ℃ 24-48 hour, the limit is observed, the training liquid that G418 concentration does not wait is changed on the limit.After about 2-3 time, there is the clone to form up to the microscopy cell, counting.Find that above-mentioned clone has the cell clone of inhibition formation effect, the result is as shown in the table.
CDNA clone's transfectional cell (3T3) clone formation situation
CDNA clones title | CDNA clones number (three repetitions) | Empty carrier clone number (three repetitions) | ||||
FP3361 | 7 | 6 | 3 | 30 | 32 | 29 |
FP7072 | 2 | 0 | 5 | 21 | 28 | 20 |
FP8080 | 3 | 1 | 2 | 21 | 25 | 27 |
The cDNA clone is adopted two deoxidation cessation method, on the ABI377 automatic dna sequencer, measure the nucleotide sequence of the nearly 500bp of one end.After the analysis, be defined as novel gene cloning, carry out the other end order-checking,, then design primer, check order once more, up to obtaining full length sequence (SEQ ID NO:1,4,7) as obtaining full length cDNA sequence not yet.
Embodiment 2: PCR obtains full-length gene from placenta or fetus cDNA:
Get the fetal tissue (FP clone) at 3,6,9 monthly ages, (GIBCO BRL company) extracts total RNA by manufacturer's specification sheets with Trizol reagent, extracts mRNA with the mRNA test kit (Pharmacia company) of purifying.With MMLV-RT-Superscript II (GIBCO BRL), ThermoScript II is carried out reverse transcription reaction at 42 ℃, obtains placenta or fetus cDNA.Utilize the special primer (as shown in the table) of each gene, by 97 ℃ 3 ', 1 circulation.94 ℃ 30 ", 60 ℃ 30 ", 72 ℃ 1 ', totally 35 circulations; 72 ℃ 10 ', pcr amplification is carried out in 1 circulation, and acquisition contains the amplified production of each protein gene of complete open reading frame sequence.Amplified production is through sequence verification, and the sequence that records with embodiment 1 conforms to, and changes amplified production over to host cell with routine techniques subsequently, obtains recombinant protein (SEQ ID NO:3,6,9).
Gene specific primer
Clone's title | Special primer 1 (5 ' → 3 ') | SEQ ID NO: | Special primer 2 (3 ' → 5 ') | SEQ ID NO: |
FP3361 | (10)atgatggtcaggctggtcaa | 10 | (2947)tcgaaagtacaacgctttattgaa | 11 |
FP7072 | (27)ggccgttgacctggtgac | 12 | (2192)aaagcaaatgtattcttcctcca | 13 |
FP8080 | (44)tttcaatcccagtctgcctc | 14 | (2886)ggctggcgctattgtttatt | 15 |
Annotate: in the bracket is the correspondence position of primer in each gene DNA sequence.
Embodiment 3:cDNA cloned sequence is analyzed
1.FP3361 albumen
A: nucleotide sequence (SEQ ID NO:1) length: 3046 bases
1 GGTTTCACCA TGATGGTCAG GCTGGTCAAG ATCTCCTGAC CTCAAGTGAT CTGCCCGCCT
61 CGGCCTCCCA AAGTGCTGGG ATTACAGACA TGAACCACAA CACCTGCCAA GCCCAGGTGT
121 TTTGAATCCA GCACGGGCAA TGGCAAGACC CTGTCCCAAC AAACAAAAAA ACACTGGTGC
181 CTGAGGCCCA CCCCGAGAGA TTCTGATTGA GTTGATTTGG ATTTGAATTA TCTTTAAAAT
241 TTTGGATGTG AATTTTTTTT TTTTTTCTTG GAATGGAGTT TCACTCTTGT TCCCCAGGCT
301 GGAGTGCAAC GGGACGATCT TGGCTCACCG CAACCTCCGC CTACTGGGTT CAAGCCATTC
361 TCCTGCCTCA GCTTCCCAAG TAGCTGGGAT TACAGGCATG TGCTACCATG CCCAGCTAAT
421 TTTGTATTTT TAGTAGAGAC AGGGTTTCTC CATGTTGCTC AGGCTGGTCT TGAACTCCTG
481 ACCTCAGGTG ATCCGCCCGC CTCGGACTCC CAAAGTGCTG GGATTACAGG CATGAGCCAC
541 CATGCCTGGC CGGATGTGAA TTATCTTAAA AATTTTCAGG TAATTCTAAT GGGCCAAGGT
601 TGAGAACCCC TGCTCTGGGC CCATCCGAGC ACCAGGCTGT CACAAACGCA TGCATGCACT
661 CACGCCCGTG GGCTTGGGGG GCTTCGGAAA TGTGCTTCTG CTTTTTTGAG ATGGGGTCTT
721 TCTGTTGCCC ATCCCGGAGC ACAGTGGCAC GATCACAGCT CACTGCAGCC TCGACCTCCT
781 GGGCTCAGGT GATCCTCCCG CCTCAGCCTC CTGAGTGTCT GCTTCTGGTT TTCATGATGA
841 CCTGGGGCCC AGGCATACTA CACTTGTGCT GTTCAGGGGC CAGTCCTGCA CCAGGAGCCC
901 ATCAGCCACA GCTCCGCCGA GAAGCACTGA TATGCAGAGC TAAGCAGCTT TGTTTCCACG
961 TGGATCCTGC GTAGGTTTTC TTGGTCCATC CGTAGACACC GCACTCCTGC AGAGGATCTT
1021 CTCGGGATGC CCCACTGTCT CTGTTTTCCC TCTTCACTGA ACACTCAGTC GGGGCTCGCC
1081 ATGATGCCTC TGTGTCTGCT GGCTTCTCCC CCATTGGAAC AGCCTTCTTG GCACGCCACA
1141 CTGCTAGCTG CTGGGCACTG TGCTTTCTGC CTTTACCGTT CTGCCGTGAT GTTGCCAAAA
1201 TAGCAGCAAC AACAACAACA ACAAAGGCTG GGCACCTGGC TCATGCCTGT ATTCCCAGCA
1261 GTTCGAGATC AGCCTGGGCA ACATGGTGAG ACCCTATCTC TACAAAAATA AAAAATGAAA
1321 TGAGCTGGGT GGGGTGGCGC ATGCATGCCT GTGGTCCCAG CTACTTGGGA GGCTGAGGTC
1381 GGAAGCTCGC TGGTCGGAAG ATCGCTGGAG CTTAGCCTTG AGGTCAAGGT TGCAGTGAGC
1441 CGAGATTACA TCACTGCACT CCAGCGTGGG AGACAGAGAC CCTGTATTAA CAAACAAAAA
1501 CACAAACCAC AAAGGGCAGG TCTGAAACTG CCATTTAAAA AAAAATTTGA TAAACTTAAA
1561 AAAATATATA TCCACAGATG CAGGTGAAGA ACCTGTTGTC TTCCTCAAGC CTCTTTTTCA
1621 CCCATGGGTG GAAATGGTGC CCTGGACACC CAGGCCCACG AGGTCTTTGC GTGGGGTCCC
1681 TACACAGGGC TTTAGCTTAC ACTGTGCTGC CCTCCTGTCC CCCGAGTTCC CAGTCTGTCA
1741 AAATCCAACC TGGTCTCCCA GGCCCAGGGC AAATGCCACC TCCTCCATGA AGCCTGCCAC
1801 ATCCTTTGCA CACCCTTGGG CGCTGACCTT GTTCTCCCAG CGCACAGGCA CGGGTAGTTT
1861 GCCCCTGTAG TAGTAACTCA GGCACAAAAC GAACTCTTGC TGAGGCTCGG CCGCGCAGAG
1921 CTGAGGGTTG CCGCTTCCAG GTTCAAGTGC ATTTTGAGTT TCATTCCCAG CTTCCTTCTT
1981 TTTCTGGTCT TTAATTTCTT CTCCGGATTA GGTCCCACTC AATGCTTTCC TTCTCAATTT
2041 CCAAAAGAGT ATGGTCAGAG CCAGCAGCAC ACCACCTTCC CCATGGGTGG GGGGGGGGCC
2101 AGCCTGTGGC GGGGGTGCGG GTCCCATCTT TTCGAAGGAA TTGACCCACA GTGGGCGGGT
2161 CCACCTTTGA CCTTGCCCCA GGGAGCGCAG ACAGAAAAAA GATCCTTGCT TAGTTTGAGG
2221 GGCCGCTGGG GTGCTCGGTT TGTCTTCAGA GGCCTGTCTG TAACACCAAT GCCAACCCGG
2281 TGGCACTGAC TGGTCACCCT GAAGGCCACG GCCAGTGTCC TAGGAAGGGA CTCAATTTCT
2341 AGCTGTGCCA CCTGAGATTC TGGGGTTAGG CTGGTTGTGC TTCTGAAGTT CCACTGTGCT
2401 CAAAGTGCTT GGTGAAAGTT AGCGAAGGTG ATTTTACAAA AATAGATGCA TAAAATGTCT
2461 AGGAAACACA AAAAATCCTC ATTACTCTTC TCTCCAAATA TTTTTTAAGC CCCAACTGGA
2521 CCCTAGGCAA AAGTGAGTGG CACTCCTCTG CCAGGACTCC AGGCAAGCCC CGGCATCTTC
2581 TTGCTGCCGT CCCAGACAAC AGAAGTTACC AGATGAACAG ACTTGGATGG GCCACGGGGG
2641 TGGAGAGCTG GAAAGCTTGG CTGTGCCTCT CGATGATGAT TAAGATTTCA ATATTTACAG
2701 CAAAACCACA AAGCAAATGA TAGAATAAAG CAAAACAATG GAAAATGTGA GTTCACTCGT
2761 GAGAGAGGTA CGTATGTGAG CTCTGAGGAA ATTACAGAGG GAACGCATGC AGCGGGACAG
2821 CTCTCCCAAT CGCAGCGTGC AAAGTAGACA TCCATAGTGT CTTTTGAAAA ATGAAAAACA
2881 CATTACTTTG AACAGCCAAG AAAAAAATTG CAATTTATTA AGATTCAATA AAGCGTTGTA
2941 CTTTCGAAAG CCAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA
3001 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA
B: aminoacid sequence (SEQ ID NO:3) length: 106 amino acid
1 MHSRPWAWGA SEMCFCFFEM GSFCCPSRST VARSQLTAAS TSWAQVILPP QPPECLLLVF
61 MMTWGPGILH LCCSGASPAP GAHQPQLRRE ALICRAKQLC FHVDPA
C. Nucleotide and amino acid composite sequence (SEQ ID NO:2) clone number and protein name: FP3361 start code: 654 ATG stop coding: 972 TAG protein molecular weight: 11650.08Da
1 GG TTT CAC CAT GAT GGT CAG GCT GGT CAA GAT CTC CTG ACC TCA AGT GAT CTG CCC GCC 59
60 TCG GCC TCC CAA AGT GCT GGG ATT ACA GAC ATG AAC CAC AAC ACC TGC CAA GCC CAG GTG 119
120 TTT TGA ATC CAG CAC GGG CAA TGG CAA GAC CCT GTC CCA ACA AAC AAA AAA ACA CTG GTG 179
180 CCT GAG GCC CAC CCC GAG AGA TTC TGA TTG AGT TGA TTT GGA TTT GAA TTA TCT TTA AAA 239
240 TTT TGG ATG TGA ATT TTT TTT TTT TTT CTT GGA ATG GAG TTT CAC TCC TGT TCC CCA GGC 299
300 TGG AGT GCA ACG GGA CGA TCC TGG CTC ACC GCA ACC TCC GCC TAC TGG GTT CAA GCC ATT 359
360 CTC CTG CCC CAG CTT CCC AAG TAG CTG GGA TTA CAG GCA TGT GCT ACC ATG CCC AGC TAA 419
420 TTT TGT ATT TTT AGT AGA GAC AGG GTT TCT CCA TGT TGC TCA GGC TGG TCT TGA ACT CCT 479
480 GAC CCC AGG TGA TCC GCC CGC CCC GGA CTC CCA AAG TGC TGG GAT TAC AGG CAT GAG CCA 539
540 CCA TGC CTG GCC GGA TGT GAA TTA TCT TAA AAA TTT TCA GGT AAT TCT AAT GGG CCA AGG 599
600 TTG AGA ACC CCT GCT CTG GGC CCA TCC GAG CAC CAG GCC GTC ACA AAC GCA TGC ATG CAC 659
1 Met His 2
660 TCA CGC CCG TGG GCT TGG GGG GCT TCG GAA ATG TGC TTC TGC TTT TTT GAG ATG GGG TCT 719
3 Ser Arg Pro Trp Ala Trp Gly Ala Ser Glu Met Cys Phe Cys Phe Phe Glu Met Gly Ser 22
720 TTC TGT TGC CCA TCC CGG AGC ACA GTG GCA CGA TCA CAG CTC ACT GCA GCC TCG ACC TCC 779
23 Phe Cys Cys Pro Ser Arg Ser Thr Val Ala Arg Ser Gln Leu Thr Ala Ala Ser Thr Ser 42
780 TGG GCT CAG GTG ATC CTC CCG CCC CAG CCT CCT GAG TGT CTG CTT CTG GTT TTC ATG ATG 839
43 Trp Ala Gln Val Ile Leu Pro Pro Gln Pro Pro Glu Cys Leu Leu Leu Val Phe Met Met 62
840 ACC TGG GGC CCA GGC ATA CTA CAC TTG TGC TGT TCA GGG GCC AGT CCT GCA CCA GGA GCC 899
63 Thr Trp Gly Pro Gly Ile Leu His Leu Cys Cys Ser Gly Ala Ser Pro Ala Pro Gly Ala 82
900 CAT CAG CCA CAG CTC CGC CGA GAA GCA CTG ATA TGC AGA GCT AAG CAG CTT TGT TTC CAC 959
83 His Gln Pro Gln Leu Arg Arg Glu Ala Leu Ile Cys Arg Ala Lys Gln Leu Cys Phe His 102
960 GTG GAT CCT GCG TAG GTT TTC TTG GTC CAT CCG TAG ACA CCG CAC TCC TGC AGA GGA TCT 1019
103 Val Asp Pro Ala *** 107
1020 TCC CGG GAT GCC CCA CCG TCT CTG TTT TCC CCC TTC ACT GAA CAC TCA GTC GGG GCT CGC 1079
1080 CAT GAT GCC TCT GTG TCT GCT GGC TTC TCC CCC ATT GGA ACA GCC TTC TTG GCA CGC CAC 1139
1140 ACT GCT AGC TGC TGG GCA CTG TGC TTT CTG CCT TTA CCG TTC TGC CGT GAT GTT GCC AAA 1199
1200 ATA GCA GCA ACA ACA ACA ACA ACA AAG GCT GGG CAC CTG GCT CAT GCC TGT ATT CCC AGC 1259
1260 AGT TCG AGA TCA GCC TGG GCA ACA TGG TGA GAC CCT ATC TCT ACA AAA ATA AAA AAT GAA 1319
1320 ATG AGC TGG GTG GGG TGG CGC ATG CAT GCC TGT GGT CCC AGC TAC TTG GGA GGC TGA GGT 1379
1380 CGG AAG CTC GCT GGT CGG AAG ATC GCT GGA GCT TAG CCT TGA GGT CAA GGT TGC AGT GAG 1439
1440 CCG AGA TTA CAT CAC TGC ACT CCA GCG TGG GAG ACA GAG ACC CTG TAT TAA CAA ACA AAA 1499
1500 ACA CAA ACC ACA AAG GGC AGG TCT GAA ACT GCC ATT TAA AAA AAA ATT TGA TAA ACT TAA 1559
1560 AAA AAT ATA TAT CCA CAG ATG CAG GTG AAG AAC CTG TTG TCT TCC TCA AGC CTC TTT TTC 1619
1620 ACC CAT GGG TGG AAA TGG TGC CCT GGA CAC CCA GGC CCA CGA GGT CTT TGC GTG GGG TCC 1679
1680 CTA CAC AGG GCT TTA GCT TAC ACT GTG CTG CCC TCC TGT CCC CCG AGT TCC CAG TCT GTC 1739
1740 AAA ATC CAA CCT GGT CTC CCA GGC CCA GGG CAA ATG CCA CCT CCT CCA TGA AGC CTG CCA 1799
1800 CAT CCT TTG CAC ACC CTT GGG CGC TGA CCT TGT TCT CCC AGC GCA CAG GCA CGG GTA GTT 1859
1860 TGC CCC TGT AGT AGT AAC TCA GGC ACA AAA CGA ACT CTT GCT GAG GCT CGG CCG CGC AGA 1919
1920 GCT GAG GGT TGC CGC TTC CAG GTT CAA GTG CAT TTT GAG TTT CAT TCC CAG CTT CCT TCT 1979
1980 TTT TCT GGT CTT TAA TTT CTT CTC CGG ATT AGG TCC CAC TCA ATG CTT TCC TTC TCA ATT 2039
2040 TCC AAA AGA GTA TGG TCA GAG CCA GCA GCA CAC CAC CTT CCC CAT GGG TGG GGG GGG GGC 2099
2100 CAG CCT GTG GCG GGG GTG CGG GTC CCA TCT TTT CGA AGG AAT TGA CCC ACA GTG GGC GGG 2159
2160 TCC ACC TTT GAC CTT GCC CCA GGG AGC GCA GAG AGA AAA AAG ATC CTT GCT TAG TTT GAG 2219
2220 GGG CCG CTG GGG TGC TCG GTT TGT CTT CAG AGG CCT GTC TGT AAC ACC AAT GCC AAC CCG 2279
2280 GTG GCA CTG ACT GGT CAC CCT GAA GGC CAC GGC CAG TGT CCT AGG AAG GGA CTC AAT TTC 2339
2340 TAG CTG TGC CAC CTG AGA TTC TGG GGT TAG GCT GGT TGT GCT TCT GAA GTT CCA CTG TGC 2399
2400 TCA AAG TGC TTG GTG AAA GTT AGC GAA GGT GAT TTT ACA AAA ATA GAT GCA TAA AAT GTC 2459
2460 TAG GAA ACA CAA AAA ATC CTC ATT ACT CTT CTC TCC AAA TAT TTT TTA AGC CCC AAC TGG 2519
2520 ACC CTA GGC AAA AGT GAG TGG CAC TCC TCT GCC AGG ACT CCA GGC AAG CCC CGG CAT CTT 2579
2580 CTT GCT GCC GTC CCA GAC AAC AGA AGT TAC CAG ATG AAC AGA CTT GGA TGG GCC ACG GGG 2639
2640 GTG GAG AGC TGG AAA GCT TGG CTG TGC CTC TCG ATG ATG ATT AAG ATT TCA ATA TTT ACA 2699
2700 GCA AAA CCA CAA AGC AAA TGA TAG AAT AAA GCA AAA CAA TGG AAA ATG TGA GTT CAC TCG 2759
2760 TGA GAG AGG TAC GTA TGT GAG CTC TCA GGA AAT TAC AGA GGG AAC GCA TGC AGC GGG ACA 2819
2820 GCT CTC CCA ATC GCA GCG TGC AAA GTA GAC ATC CAT AGT GTC TTT TGA AAA ATG AAA AAC 2879
2880 ACA TTA CTT TGA ACA GCC AAG AAA AAA ATT GCA ATT TAT TAA GAT TCA ATA AAG CGT TGT 2939
2940 ACT TTC GAA AGC CAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA 2999
3000 AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AA 3046
2. FP7072
A: nucleotide sequence (SEQ ID NO:4) length: 2224 bases
1 GGGACCTGGG GGCTGTGGCC GGGGGCGGCC GTTGACCTGG TGACCGCGGC GCCGCCCCAG
61 ACCGGGGGCG CAGTCCCACT CGCTCCGAGC CCCGGTCCCC CAAGCCTCCC TCCCGGGTAC
121 CTGGGGCCGC GCCCGCCCTG CGCCCAGCTC CGCCCTCCGT CGGCCCAGGC CTGACAGAGC
181 CCGGCAGCCA TGAGTGCCAA CCCCCGGTGG GACATCAGCA GGGCGCTGGG GGTGGCCAAG
241 CTCTTCCACC TGGTGTGCGG GGTGCGGGAA GCCTGCGTGA CCCCGTTCCT GACCCTTTAC
301 CTGAGGCAGC TGGGCTTGGC CGCGCCCTGG GTGGGCACCC TAATGGGAAC CAAGCACCTA
361 ATCGCTGCCT TCTGGGCTCC CGTCTGTGCC TTCCTGGCCA AAAGCTACCG GAAAAGGAGA
421 GCGCTTCTGA TCGGCTCCCT GCTCGGCTCG GTGGGGGCCA GCCTGCTGAT GGTCCTGGTC
481 CCACCGGTAG ACAAAAATCG GGTGCACTTC CCTTGTAATG GAAGCAGCGG CCTGACCAGC
541 ACAGACGCAC TCCCGGGGGT CACGCTACCT GTGAACATCA CCTCGGCCCA AGAGTCTGCC
601 TCCAGCCACC CAGCCAAGAG GACTGCAGAG GTGGAAATGC CTGGCTTCAG AAACCCACCT
661 GGTGAAAGTG ACCGAGAAAC TTTCCGTGAT CTGCACGTCT ACTTAGCGCC CTCCGTTGAA
721 GGAGCTAGGA CCACATCCCA AGCTCTCCTC CATCCTGTCA CTTCGGGGCT GAAAGATCAT
781 CCCTGGGAAG TTACTTTTGA GGTGGTCAAG ACAGCCCTCC CCTTGCTTCC TGGGGGGAAA
841 GGGCCCGGGA ATCCAGCCAA TTTGTCAGGG ACCAAGGGGA AAGCCTGGGC TTTTGACCTG
901 TCCTTGGAGG CGTTGCGGCG GACTTTTATC CTCTCCTTGG GGTCCGTGGC GTTCTGGGAG
961 CTGCTGACAG CGCCTCTGGA GCAGGTGGCA GATGACAGCC TTTATGAGTT CCTGGATTTT
1021 GTGGATGCCA CTGACCGATA CAGAAGCCTG TGGGTCTGGA GGTTGCTGGG CATGTCGGCA
1081 GGCGTGTGTG GCATCACAGC CTTGGTGGGG CAGCTGGACT GCTTCCTGAT GACCAGTGGC
1141 CCCCGAGGTG TGGTCCACTT CTATGGGTAC TCGGTGGTCA GCACCCTGGC CTTACTGGTG
1201 AGCATTGCCT TTCCCATTCC CATCTGTCAG CAGTGGGAGC CCAGCTACAA AAGGGTCAAA
1261 GCACTGTCCA TTGTGGGGGG GGTGACCCCC ACCTCATTCT CCTCGCCTCC ACCACTGTTT
1321 TGGTAGGAGC CATCGTCAGT ACTGTCCAGA ACTTTCTGTT CTGGCACATG AAGGACCATG
1381 GGAGCGGCGA GCTGGTCATG GGTTTCTCGG TCGCCCTCAG CTTGCTGGGG GAAATTCTGC
1441 TTCATCCGTT CAAAGCTACA TTGCTTAGGA AACTGTCCAG GACGGGCCTG GTGGGGCTGG
1501 GGCTGAGCTG CCTCGCTGGG CAGCTGCTGT ACTACTCTTT CCTCTGGAGC TGGTGGTCCG
1561 TCCTCCCCAT TCAGATCTTG AGTGCCATTA GCAACAGAGC TTTGTGGTGG GCTGTGGGGG
1621 CCTCAGTAGA GGACCTGGCC ACTCCCCGCA TGGAGAGGGC TCTGAGTGCC TTGTTCCGAG
1681 GCCACTTTTA CGGGAGTGGC TGTAGCCTGG GCAGCTTTGT CGGGGGCTTC GTGGTGATGC
1741 GCTTCAGCCT GGCTGTGCTC TACCAGGCCT GCTGTGTGGC CCTGTTGCTC TGGTTGGCCT
1801 TGCTCCTGTC CATACAGCGG AGGCTGCCCC GAGAGCGGAA AATCAAGTAC TCGAAGCTGC
1861 TGTCCATGGA GGTGAGTGAC ACCAGTGACT CTGAGCAGGG GACAGAACAG GACTGGCTTG
1921 TGAAGGCCAT GAGGGAGGAA CACTCAGACT GAAAGGGCTG AGAAATCCAG AGTGTGCTGA
1981 TCCAGCAAGG AACGAATGGA CTGAACAAAA CTCAGCCTGC TGAGGACAGA AACCTGCCCT
2041 GGACTGCTGG GAGCCGGGGA AGAGAGGATG GGTCTGTGCT GAAGGCCCAA CAGGATCATC
2101 TCATTGCATG ATTTTCTTTA CTTTTGAAGT AAAAGGAGAT TTAACTTTTT GCCAATCTTT
2161 TTTAGATAAT GGAGGAAGAA TACATTTGCT TTTTAAAAAG TTAAAAAAAA AAAAAAAAAA
2221 AAAA
B: aminoacid sequence (SEQ ID NO:6) length: 378 amino acid
1 MSANPRWDIS RALGVAKLFH LVCGVREACV TPFLTLYLRQ LGLAAPWVGT LMGTKHLIAA
61 FWAPVCAFLA KSYRKRRALL IGSLLGSVGA SLLMVLVPPV DKNRVHFPCN GSSGLTSTDA
121 LPGVTLPVNI TSAQESASSH PAKRTAEVEM PGFRNPPGES DRETFRDLHV YLAPSVEGAR
181 TTSQALLHPV TSGLKDHPWE VTFEVVKTAL PLLPGGKGPG NPANLSGTKG KAWAFDLSLE
241 ALRRTFILSL GSVAFWELLT APLEQVADDS LYEFLDFVDA TDRYRSLWVW RLLGMSAGVC
301 GITALVGQLD CFLMTSGPRG VVHFYGYSVV STLALLVSIA FPIPICQQWE PSYKRVKALS
361 IVGGVTPTSF SSPPPLFW
C. Nucleotide and amino acid composite sequence (SEQ ID NO:5) clone number and protein name: FP7072 start code: 190 ATG stop coding: 1324 TAG protein molecular weight: 40874.15Da
1 GGG ACC TGG GGG CTG TGG CCG GGG GCG GCC GTT GAC CTG GTG ACC GCG GCG CCG CCC CAG 60
61 ACC GGG GGC GCA GTC CCA CTC GCT CCG AGC CCC GGT CCC CCA AGC CTC CCT CCC GGG TAC 120
121 CTG GGG CCG CGC CCG CCC TGC GCC CAG CTC CGC CCT CCG TCG GCC CAG GCC TGA CAG AGC 180
181 CCG GCA GCC ATG AGT GCC AAC CCC CGG TGG GAC ATC AGC AGG GCG CTG GGG GTG GCC AAG 240
1 Met Ser Ala Asn Pro Arg Trp Asp Ile Ser Arg Ala Leu Gly Val Ala Lys 17
241 CTC TTC CAC CTG GTG TGC GGG GTG CGG GAA GCC TGC GTG ACC CCG TTC CTG ACC CTT TAC 300
18 Leu Phe His Leu Val Cys Gly Val Arg Glu Ala Cys Val Thr Pro Phe Leu Thr Leu Tyr 37
301 CTG AGG CAG CTG GGC TTG GCC GCG CCC TGG GTG GGC ACC CTA ATG GGA ACC AAG CAC CTA 360
38 Leu Arg Gln Leu Gly Leu Ala Ala Pro Trp Val Gly Thr Leu Met Gly Thr Lys His Leu 57
361 ATC GCT GCC TTC TGG GCT CCC GTC TGT GCC TTC CTG GCC AAA AGC TAC CGG AAA AGG AGA 420
58 Ile Ala Ala Phe Trp Ala Pro Val Cys Ala Phe Leu Ala Lys Ser Tyr Arg Lys Arg Arg 77
421 GCG CTT CTG ATC GGC TCC CTG CTC GGC TCG GTG GGG GCC AGC CTG CTG ATG GTC CTG GTC 480
78 Ala Leu Leu Ile Gly Ser Leu Leu Gly Ser Val Gly Ala Ser Leu Leu Met Val Leu Val 97
481 CCA CCG GTA GAC AAA AAT CGG GTG CAC TTC CCT TGT AAT GGA AGC AGC GGC CTG ACC AGC 540
98 Pro Pro Val Asp Lys Asn Arg Val His Phe Pro Cys Asn Gly Ser Ser Gly Leu Thr Ser 117
541 ACA GAC GCA CTC CCG GGG GTC ACG CTA CCT GTG AAC ATC ACC TCG GCC CAA GAG TCT GCC 600
118 Thr Asp Ala Leu Pro Gly Val Thr Leu Pro Val Asn Ile Thr Ser Ala Gln Glu Ser Ala 137
601 TCC AGC CAC CCA GCC AAG AGG ACT GCA GAG GTG GAA ATG CCT GGC TTC AGA AAC CCA CCT 660
138 Ser Ser His Pro Ala Lys Arg Thr Ala Glu Val Glu Met Pro Gly Phe Arg Asn Pro Pro 157
661 GGT GAA AGT GAC CGA GAA ACT TTC CGT GAT CTG CAC GTC TAC TTA GCG CCC TCC GTT GAA 720
158 Gly Glu Ser Asp Arg Glu Thr Phe Arg Asp Leu His Val Tyr Leu Ala Pro Ser Val Glu 177
721 GGA GCT AGG ACC ACA TCC CAA GCT CTC CTC CAT CCT GTC ACT TCG GGG CTG AAA GAT CAT 780
178 Gly Ala Arg Thr Thr Ser Gln Ala Leu Leu His Pro Val Thr Ser Gly Leu Lys Asp His 197
781 CCC TGG GAA GTT ACT TTT GAG GTG GTC AAG ACA GCC CTC CCC TTG CTT CCT GGG GGG AAA 840
198 Pro Trp Glu Val Thr Phe Glu Val Val Lys Thr Ala Leu Pro Leu Leu Pro Gly Gly Lys 217
841 GGG CCC GGG AAT CCA GCC AAT TTG TCA GGG ACC AAG GGG AAA GCC TGG GCT TTT GAC CTG 900
218 Gly Pro Gly Asn Pro Ala Asn Leu Ser Gly Thr Lys Gly Lys Ala Trp Ala Phe Asp Leu 237
901 TCC TTG GAG GCG TTG CGG CGG ACT TTT ATC CTC TCC TTG GGG TCC GTG GCG TTC TGG GAG 960
238 Ser Leu Glu Ala Leu Arg Arg Thr Phe Ile Leu Ser Leu Gly Ser Val Ala Phe Trp Glu 257
961 CTG CTG ACA GCG CCT CTG GAG CAG GTG GCA GAT GAC AGC CTT TAT GAG TTC CTG GAT TTT 1020
258 Leu Leu Thr Ala Pro Leu Glu Gln Val Ala Asp Asp Ser Leu Tyr Glu Phe Leu Asp Phe 277
1021 GTG GAT GCC ACT GAC CGA TAC AGA AGC CTG TGG GTC TGG AGG TTG CTG GGC ATG TCG GCA 1080
278 Val Asp Ala Thr Asp Arg Tyr Arg Ser Leu Trp Val Trp Arg Leu Leu Gly Met Ser Ala 297
1081 GGC GTG TGT GGC ATC ACA GCC TTG GTG GGG CAG CTG GAC TGC TTC CTG ATG ACC AGT GGC 1140
298 Gly Val Cys Gly Ile Thr Ala Leu Val Gly Gln Leu Asp Cys Phe Leu Met Thr Ser Gly 317
1141 CCC CGA GGT GTG GTC CAC TTC TAT GGG TAC TCG GTG GTC AGC ACC CTG GCC TTA CTG GTG 1200
318 Pro Arg Gly Val Val His Phe Tyr Gly Tyr Ser Val Val Ser Thr Leu Ala Leu Leu Val 337
1201 AGC ATT GCC TTT CCC ATT CCC ATC TGT CAG CAG TGG GAG CCC AGC TAC AAA AGG GTC AAA 1260
338 Ser Ile Ala Phe Pro Ile Pro Ile Cys Gln Gln Trp Glu Pro Ser Tyr Lys Arg Val Lys 357
1261 GCA CTG TCC ATT GTG GGG GGG GTG ACC CCC ACC TCA TTC TCC TCG CCT CCA CCA CTG TTT 1320
358 Ala Leu Ser Ile Val Gly Gly Val Thr Pro Thr Ser Phe Ser Ser Pro Pro Pro Leu Phe 377
1321 TGG TAG GAG CCA TCG TCA GTA CTG TCC AGA ACT TTC TGT TCT GGC ACA TGA AGG ACC ATG 1380
378 Trp *** 379
1381 GGA GCG GCG AGC TGG TCA TGG GTT TCT CGG TCG CCC TCA GCT TGC TGG GGG AAA TTC TGC 1440
1441 TTC ATC CGT TCA AAG CTA CAT TGC TTA GGA AAC TGT CCA GGA CGG GCC TGG TGG GGC TGG 1500
1501 GGC TGA GCT GCC TCG CTG GGC AGC TGC TGT ACT ACT CTT TCC TCT GGA GCT GGT GGT CCG 1560
1561 TCC TCC CCA TTC AGA TCT TGA GTG CCA TTA GCA ACA GAG CTT TGT GGT GGG CTG TGG GGG 1620
1621 CCT CAG TAG AGG ACC TGG CCA CTC CCC GCA TGG AGA GGG CTC TGA GTG CCT TGT TCC GAG 1680
1681 GCC ACT TTT ACG GGA GTG GCT GTA GCC TGG GCA GCT TTG TCG GGG GCT TCG TGG TGA TGC 1740
1741 GCT TCA GCC TGG CTG TGC TCT ACC AGG CCT GCT GTG TGG CCC TGT TGC TCT GGT TGG CCT 1800
1801 TGC TCC TGT CCA TAC AGC GGA GGC TGC CCC GAG AGC GGA AAA TCA AGT ACT CGA AGC TGC 1860
1861 TGT CCA TGG AGG TGA GTG ACA CCA GTG ACT CTG AGC AGG GGA CAG AAC AGG ACT GGC TTG 1920
1921 TGA AGG CCA TGA GGG AGG AAC ACT CAG ACT GAA AGG GCT GAG AAA TCC AGA GTG TGC TGA 1980
1981 TCC AGC AAG GAA CGA ATG GAC TGA ACA AAA CTC AGC CTG CTG AGG ACA GAA ACC TGC CCT 2040
2041 GGA CTG CTG GGA GCC GGG GAA GAG AGG ATG GGT CTG TGC TGA AGG CCC AAC AGG ATC ATC 2100
2101 TCA TTG CAT GAT TTT CTT TAC TTT TGA AGT AAA AGG AGA TTT AAC TTT TTG CCA ATC TTT 2160
2161 TTT AGA TAA TGG AGG AAG AAT ACA TTT GCT TTT TAA AAA GTT AAA AAA AAA AAA AAA AAA 2220
2221 AAA A 2224
3.FP8080 albumen
A: nucleotide sequence (SEQ ID NO:7) length: 2916 bases
1 GAAAAGACGT TTAATAAAAC AGCTTTACCT TTTAAAAATA AAATTTCAAT CCCAGTCTGC
61 CTCAGGTACG GGTGGGGGTG TGCAGCGTGA CTCCTGGCTG TGTTCCTGTA CCGTTTGTGG
121 CCTCCGGGGT CTTGCGCGTC CAGGGCCCAG ATACCGAGTA ACTGCCATGT GGTGTCGTGG
181 TGGCGTCCAT AGGCCTCCGT GGTGGGGCGG GGGCGGGGCT GTGGGTGTGC AGGGCAGCTG
241 CATTTTCCAA GAGTGGAGAT GGAGGATGGA GATTTGCTTC GTAAAGGCCT GGGAGCTACA
301 TGGGAGGTAG GGAGGGGCCA GCCACACCCA GGGCTGGGGG GCCGGTGGAC TCTGCCCAGT
361 GAAGCCAGGG GAGTGTCAGC AATGAGGGGC CTGGACAGCG ACGCGGGACC CGCACCTGGG
421 AGCGGCAGAG ATGGAGCCCA GCTCGCTAAA CGTACCCTAA GGGTGCACGC TGACCGTCTT
481 TGATCCCTGG ACAGTGTCCA TAGAGTTGTC TCCCCTTCGG TCACATTGTC CCTGGCCTTG
541 GAGTTTCTGC CTCAGGGAAA GACCCACTAA AACCACCCGG GCTCTGGCGG CCTCCTCTTC
601 TCCAGAGCAT GGTGGCGCTG CCTCCTGAGC AGAGTCACTG AGAGTGGGTC CAAGGAAGGG
661 ACCCACTTAG CAAAGGTTGT TGTGTGGTGC TGTGCAGGCA CGGCGCCTGG AGTTGGCCCC
721 GCCAGCATCG GGGTGGCTTG GGGCTCTGGT TCCTAAGCTT CTGCTTCTGA GTTGGTCAGC
781 ACGTTGGCTC TGCGCCCTGC AGTGCCTGTC TTGATGGTGG TCCCTGGATG TGCTGAAGGG
841 CTCCAGGAGC AAGGGGCCAG ACCACACTTT GGCTGAGTCC TTGGGTAGTT TCAGAGGGTC
901 ACCTGCACAG TCTGCATTGA GCACCCTGGG GCTGAGGCCG TGGGTCAAGT TCTGCCTGAC
961 AGTGCTTTGA TTATATGGTG ATGTGATTCT GCAAACATAA CCCCCCAACT GAGTGCCAGC
1021 TGCTGGGTGT CCCCACCTCA AGAGCCAAGG GGAGGCCACC CTGGAACTGG TGCTTTGCCC
1081 TTGGTGTCAT GGAGGTGTCT GGGTCCACAC GTGGTGGGTG GGCTGGGCCT GCAGTCACCC
1141 AGCATGGCCG GCGGGATTGG ACCCAGCTGC AACCTGGCAG GTTCCATGGG GCTTCCTGTG
1201 CCTGGGCCTG CCTGTGTCTC ATGCCTGTCA ATAAAGGAGA AAGCAACCTG GGAAAAACAT
1261 TCCTCTCTGT ACCCAGGGGA CTGCCAAGTG TGCTTTATAG AAGGCTGGAA GACCGACTGA
1321 AGTAGGAAGC TGTCACATGT TTGAGCCAAC TGAGCAAAGC AGCTGTTGCC TGCCAGACAG
1381 CCACCCCAAG ACCAAGGGAG CCAGCCTGTC GTTTGTCCTT CTAGCCTGGA AGACCAGTTC
1441 TTGGCTCTTG GGCACTCACA TTGGCCACTG GCCGCAACTG CTGAATGGCT TCACTAATGT
1501 CCCTGGGGTC TGTATCTGCC CTCTCTTCTA TTCCCTAGAA ACTCTGGCCG GGCATGGCGG
1561 CTCACGCCTG TCATCCCAGC ACTTAGGGAG GCCGAGGCAG GTGGATCACA AGGTCAAGAG
1621 ATCGACACCA TCCTGGCCAA CATGGTGAAA CCCCATCTCT ACTAAAAATA CAAAAATTAG
1681 CTGGACATGG TGGTGGGCGC CTGTAGTCCC AGCTACTTGG GAGGCTGAGG CAGGAGAATT
1741 GCTTGAACTC GGGAGGCGTA GGTTGCAGTG AGCCAAGATC GCACCACTGC ACTCCAGCCT
1801 GGCAACAGAG CAAGACTCCT CAAAAAACAA ACAAACAAAC AAACAAACTC TGATCATTCA
1861 CCCGGTGGGA TCCCCGGCAG CTCCTTCTCC TTCCCGAGCT GTGGCTCATG GGCCATTGGA
1921 TTCAGAGACC TTGAGCTGGG TGCTGGCTGC CCTAAAGGAG CACGACCCCT CCAGCCTGGT
1981 GACCGGGAGG CAGAGAAGTA CAGCTGTCCT TGACAAAGGC CCAGCCTCTG CTGCCTCAGG
2041 CCAGCCTGGC CACACCCCGA TGCCACAGTC ATCTGCCCCG CCCAGGCATA AGCCTGACTC
2101 AGGTGATACA GGCGAGGCTT CTCCCATGGG TCCTCCCCTT GTGAGAAAGG GTCTGTGCTG
2161 TGTACCTTCT CTCTGCACAG CCATTGTCAG TGTGGTTTGG AGTGAGTGAG CCTCCCGCAC
2221 TGCTCAGGGG AGAGCTGGTG AGCATGACTC CATGGCCTCC TGCTGGTTGG GGTCTTCTTC
2281 TGATTCTCAC CCGCAGGAGC ACCCAGACCC CCCCTTTTCA GAAATGATTA CATTTTCCCT
2341 TCAAGAGTCA TCTGTGACCT GGTGGTGTGG AGGGACTGTC CCCTCCTACA GCCTGGCCGA
2401 CCCCCTTCCT CCAGGGCTGT GTGGAATCAC CTCTCAGAAG CTCACTCTGA AAAGCCACTC
2461 ACAGGGATTT CACCAGGATG TTGCTGTTCC TGCCGACACC TGGGCTTTGT GACAAATGCT
2521 GTTTGCTTCA AGCTGTTCAC AAGCAACTTC ACCAGAATCC CAAGCTGGTT CGTACTTAAT
2581 TTGTATACAT ACATATGCAT AATATACATG ACCATGTATG AGATACAGAC ACCATGGCTT
2641 CTGTCCATTG GCTCATTCGC CTGGTCGCAT GTTCACTCCA TGAGGTTAGG ACACTGCGGG
2701 TGGCTTTGGG ATGGGTGGTT TCAGGGACAA GGTGTCGCTG TGATCTCTGT ATTTGCTAAC
2761 TAGTCTTGTT TTGATTTCTA ACGGTCTAGT GGAGTACAGT TAATGTTGTG TTTATTTCTG
2821 TAACTGTAGG GGTGGAGTAT AAACCTAACT TTTGGAAACA TGAAGAAATA AACAATAGCG
2881 CCAGCCGTTA AAAAAAAAAA AAAAAAAAAA AAAAAA
B: aminoacid sequence (SEQ ID NO:9) length: 103 amino acid
1 MFEPTEQSSC CLPDSHPKTK GASLSFVLLA WKTSSWLLGT HIGHWPQLLN GFTNVPGVCI
61 CPLFYSLETL AGHGGSRLSS QHLGRPRQVD HKVKRSTPSW PTW
C. Nucleotide and amino acid composite sequence (SEQ ID NO:8) clone number and protein name: FP8080 start code: 1337 ATG stop coding: 1646 TGA protein molecular weight: 11425.50Da
1 G AAA AGA CGT TTA ATA AAA CAG CTT TAC CTT TTA AAA ATA AAA TTT CAA TCC CAG TCT 58
59 GCC TCA GGT ACG GGT GGG GGT GTG CAG CGT GAC TCC TGG CTG TGT TCC TGT ACC GTT TGT 118
119 GGC CTC CGG GGT CTT GCG CGT CCA GGG CCC AGA TAC CGA GTA ACT GCC ATG TGG TGT CGT 178
179 GGT GGC GTC CAT AGG CCT CCG TGG TGG GGC GGG GGC GGG GCT GTG GGT GTG CAG GGC AGC 238
239 TGC ATT TTC CAA GAG TGG AGA TGG AGG ATG GAG ATT TGC TTC GTA AAG GCC TGG GAG CTA 298
299 CAT GGG AGG TAG GGA GGG GCC AGC CAC ACC CAG GGC TGG GGG GCC GGT GGA CTC TGC CCA 358
359 GTG AAG CCA GGG GAG TGT CAG CAA TGA GGG GCC TGG ACA GCG ACG CGG GAC CCG CAC CTG 418
419 GGA GCG GCA GAG ATG GAG CCC AGC TCG CTA AAC GTA CCC TAA GGG TGC ACG CTG ACC GTC 478
479 TTT GAT CCC TGG ACA GTG TCC ATA GAG TTG TCT CCC CTT CGG TCA CAT TGT CCC TGG CCT 538
539 TGG AGT TTC TGC CTC AGG GAA AGA CCC ACT AAA ACC ACC CGG GCT CTG GCG GCC TCC TCT 598
599 TCT CCA GAG CAT GGT GGC GCT GCC TCC TGA GCA GAG TCA CTG AGA GTG GGT CCA AGG AAG 658
659 GGA CCC ACT TAG CAA AGG TTG TTG TGT GGT GCT GTG CAG GCA CGG CGC CTG GAG TTG GCC 718
719 CCG CCA GCA TCG GGG TGG CTT GGG GCT CTG GTT CCT AAG CTT CTG CTT CTG AGT TGG TCA 778
779 GCA CGT TGG CTC TGC GCC CTG CAG TGC CTG TCT TGA TGG TGG TCC CTG GAT GTG CTG AAG 838
839 GGC TCC AGG AGC AAG GGG CCA GAC CAC ACT TTG GCT GAG TCC TTG GGT AGT TTC AGA GGG 898
899 TCA CCT GCA CAG TCT GCA TTG AGC ACC CTG GGG CTG AGG CCG TGG GTC AAG TTC TGC CTG 958
959 ACA GTG CTT TGA TTA TAT GGT GAT GTG ATT CTG CAA ACA TAA CCC CCC AAC TGA GTG CCA 1018
1019 GCT GCT GGG TGT CCC CAC CTC AAG AGC CAA GGG GAG GCC ACC CTG GAA CTG GTG CTT TGC 1078
1079 CCT TGG TGT CAT GGA GGT GTC TGG GTC CAC ACG TGG TGG GTG GGC TGG GCC TGC AGT CAC 1138
1139 CCA GCA TGG CCG GCG GGA TTG GAC CCA GCT GCA ACC TGG CAG GTT CCA TGG GGC TTC CTG 1198
1199 TGC CTG GGC CTG CCT GTG TCT CAT GCC TGT CAA TAA AGG AGA AAG CAA CCT GGG AAA AAC 1258
1259 ATT CCT CTC TGT ACC CAG GGG ACT GCC AAG TGT GCT TTA TAG AAG GCT GGA AGA CCG ACT 1318
1319 GAA GTA GGA AGC TGT CAC ATG TTT GAG CCA ACT GAG CAA AGC AGC TGT TGC CTG CCA GAC 1378
1 Met Phe Glu Pro Thr Glu Gln Ser Ser Cys Cys Leu Pro Asp 14
1379 AGC CAC CCC AAG ACC AAG GGA GCC AGC CTG TCG TTT GTC CTT CTA GCC TGG AAG ACC AGT 1438
15 Ser His Pro Lys Thr Lys Gly Ala Ser Leu Ser Phe Val Leu Leu Ala Trp Lys Thr Ser 34
1439 TCT TGG CTC TTG GGC ACT CAC ATT GGC CAC TGG CCG CAA CTG CTG AAT GGC TTC ACT AAT 1498
35 Ser Trp Leu Leu Gly Thr His Ile Gly His Trp Pro Gln Leu Leu Asn Gly Phe Thr Asn 54
1499 GTC CCT GGG GTC TGT ATC TGC CCT CTC TTC TAT TCC CTA GAA ACT CTG GCC GGG CAT GGC 1558
55 Val Pro Gly Val Cys Ile Cys Pro Leu Phe Tyr Ser Leu Glu Thr Leu Ala Gly His G1y 74
1559 GGC TCA CGC CTG TCA TCC CAG CAC TTA GGG AGG CCG AGG CAG GTG GAT CAC AAG GTC AAG 1618
75 Gly Ser Arg Leu Ser Ser Gln His Leu Gly Arg Pro Arg Gln Val Asp His Lys Val Lys 94
1619 AGA TCG ACA CCA TCC TGG CCA ACA TGG TGA AAC CCC ATC TCT ACT AAA AAT ACA AAA ATT 1678
95 Arg Ser Thr Pro Ser Trp Pro Thr Trp *** 104
1679 AGC TGG ACA TGG TGG TGG GCG CCT GTA GTC CCA GCT ACT TGG GAG GCT GAG GCA GGA GAA 1738
1739 TTG CTT GAA CTC GGG AGG CGT AGG TTG CAG TGA GCC AAG ATC GCA CCA CTG CAC TCC AGC 1798
1799 CTG GCA ACA GAG CAA GAC TCC TCA AAA AAC AAA CAA ACA AAC AAA CAA ACT CTG ATC ATT 1858
1859 CAC CCG GTG GGA TCC CCG GCA GCT CCT TCT CCT TCC CGA GCT GTG GCT CAT GGG CCA TTG 1918
1919 GAT TCA GAG ACC TTG AGC TGG GTG CTG GCT GCC CTA AAG GAG CAC GAC CCC TCC AGC CTG 1978
1979 GTG ACC GGG AGG CAG AGA AGT ACA GCT GTC CTT GAC AAA GGC CCA GCC TCT GCT GCC TCA 2038
2039 GGC CAG CCT GGC CAC ACC CCG ATG CCA CAG TCA TCT GCC CCG CCC AGG CAT AAG CCT GAC 2098
2099 TCA GGT GAT ACA GGC GAG GCT TCT CCC ATG GGT CCT CCC CTT GTG AGA AAG GGT CTG TGC 2158
2159 TGT GTA CCT TCT CTC TGC ACA GCC ATT GTC AGT GTG GTT TGG AGT GAG TGA GCC TCC CGC 2218
2219 ACT GCT CAG GGG AGA GCT GGT GAG CAT GAC TCC ATG GCC TCC TGC TGG TTG GGG TCT TCT 2278
2279 TCT GAT TCT CAC CCG CAG GAG CAC CCA GAC CCC CCC TTT TCA GAA ATG ATT ACA TTT TCC 2338
2339 CTT CAA GAG TCA TCT GTG ACC TGG TGG TGT GGA GGG ACT GTC CCC TCC TAC AGC CTG GCC 2398
2399 GAC CCC CTT CCT CCA GGG CTG TGT GGA ATC ACC TCT CAG AAG CTC ACT CTG AAA AGC CAC 2458
2459 TCA CAG GGA TTT CAC CAG GAT GTT GCT GTT CCT GCC GAC ACC TGG GCT TTG TGA CAA ATG 2518
2519 CTG TTT GCT TCA AGC TGT TCA CAA GCA ACT TCA CCA GAA TCC CAA GCT GGT TCG TAC TTA 2578
2579 ATT TGT ATA CAT ACA TAT GCA TAA TAT ACA TGA CCA TGT ATG AGA TAC AGA CAC CAT GGC 2638
2639 TTC TGT CCA TTG GCT CAT TCG CCT GGT CGC ATG TTC ACT CCA TGA GGT TAG GAC ACT GCG 2698
2699 GGT GGC TTT GGG ATG GGT GGT TTC AGG GAC AAG GTG TCG CTG TGA TCT CTG TAT TTG CTA 2758
2759 ACT AGT CTT GTT TTG ATT TCT AAC GGT CTA GTG GAG TAC AGT TAA TGT TGT GTT TAT TTC 2818
2819 TGT AAC TGT AGG GGT GGA GTA TAA ACC TAA CTT TTG GAA ACA TGA AGA AAT AAA CAA TAG 2878
2879 CGC CAG CCG TTA AAA AAA AAA AAAAAA AAA AAA AAAAA 2916
All quote in this application as a reference at all documents that the present invention mentions, just quoted as a reference separately as each piece document.Should be understood that in addition those skilled in the art can make various changes or modifications the present invention after having read above-mentioned teachings of the present invention, these equivalent form of values fall within the application's appended claims institute restricted portion equally.
Sequence table
<110〉Shanghai Xinshijie Gene Techn Development Co., Ltd.
<120〉have new the people's albumen and the encoding sequence thereof of cancer suppressing function
<130>022535
<160>15
<170>PatentIn version 3.0
<210>1
<211>3046
<212>DNA
<213〉homo sapiens (Homo sapiens)
<400>1
ggtttcacca tgatggtcag gctggtcaag atctcctgac ctcaagtgat ctgcccgcct 60
cggcctccca aagtgctggg attacagaca tgaaccacaa cacctgccaa gcccaggtgt 120
tttgaatcca gcacgggcaa tggcaagacc ctgtcccaac aaacaaaaaa acactggtgc 180
ctgaggccca ccccgagaga ttctgattga gttgatttgg atttgaatta tctttaaaat 240
tttggatgtg aatttttttt ttttttcttg gaatggagtt tcactcttgt tccccaggct 300
ggagtgcaac gggacgatct tggctcaccg caacctccgc ctactgggtt caagccattc 360
tcctgcctca gcttcccaag tagctgggat tacaggcatg tgctaccatg cccagctaat 420
tttgtatttt tagtagagac agggtttctc catgttgctc aggctggtct tgaactcctg 480
acctcaggtg atccgcccgc ctcggactcc caaagtgctg ggattacagg catgagccac 540
catgcctggc cggatgtgaa ttatcttaaa aattttcagg taattctaat gggccaaggt 600
tgagaacccc tgctctgggc ccatccgagc accaggctgt cacaaacgca tgcatgcact 660
cacgcccgtg ggcttggggg gcttcggaaa tgtgcttctg cttttttgag atggggtctt 720
tctgttgccc atcccggagc acagtggcac gatcacagct cactgcagcc tcgacctcct 780
gggctcaggt gatcctcccg cctcagcctc ctgagtgtct gcttctggtt ttcatgatga 840
cctggggccc aggcatacta cacttgtgct gttcaggggc cagtcctgca ccaggagccc 900
atcagccaca gctccgccga gaagcactga tatgcagagc taagcagctt tgtttccacg 960
tggatcctgc gtaggttttc ttggtccatc cgtagacacc gcactcctgc agaggatctt 1020
ctcgggatgc cccactgtct ctgttttccc tcttcactga acactcagtc ggggctcgcc 1080
atgatgcctc tgtgtctgct ggcttctccc ccattggaac agccttcttg gcacgccaca 1140
ctgctagctg ctgggcactg tgctttctgc ctttaccgtt ctgccgtgat gttgccaaaa 1200
tagcagcaac aacaacaaca acaaaggctg ggcacctggc tcatgcctgt attcccagca 1260
gttcgagatc agcctgggca acatggtgag accctatctc tacaaaaata aaaaatgaaa 1320
tgagctgggt ggggtggcgc atgcatgcct gtggtcccag ctacttggga ggctgaggtc 1380
ggaagctcgc tggtcggaag atcgctggag cttagccttg aggtcaaggt tgcagtgagc 1440
cgagattaca tcactgcact ccagcgtggg agacagagac cctgtattaa caaacaaaaa 1500
cacaaaccac aaagggcagg tctgaaactg ccatttaaaa aaaaatttga taaacttaaa 1560
aaaatatata tccacagatg caggtgaaga acctgttgtc ttcctcaagc ctctttttca 1620
cccatgggtg gaaatggtgc cctggacacc caggcccacg aggtctttgc gtggggtccc 1680
tacacagggc tttagcttac actgtgctgc cctcctgtcc cccgagttcc cagtctgtca 1740
aaatccaacc tggtctccca ggcccagggc aaatgccacc tcctccatga agcctgccac 1800
atcctttgca cacccttggg cgctgacctt gttctcccag cgcacaggca cgggtagttt 1860
gcccctgtag tagtaactca ggcacaaaac gaactcttgc tgaggctcgg ccgcgcagag 1920
ctgagggttg ccgcttccag gttcaagtgc attttgagtt tcattcccag cttccttctt 1980
tttctggtct ttaatttctt ctccggatta ggtcccactc aatgctttcc ttctcaattt 2040
ccaaaagagt atggtcagag ccagcagcac accaccttcc ccatgggtgg ggggggggcc 2100
agcctgtggc gggggtgcgg gtcccatctt ttcgaaggaa ttgacccaca gtgggcgggt 2160
ccacctttga ccttgcccca gggagcgcag acagaaaaaa gatccttgct tagtttgagg 2220
ggccgctggg gtgctcggtt tgtcttcaga ggcctgtctg taacaccaat gccaacccgg 2280
tggcactgac tggtcaccct gaaggccacg gccagtgtcc taggaaggga ctcaatttct 2340
agctgtgcca cctgagattc tggggttagg ctggttgtgc ttctgaagtt ccactgtgct 2400
caaagtgctt ggtgaaagtt agcgaaggtg attttacaaa aatagatgca taaaatgtct 2460
aggaaacaca aaaaatcctc attactcttc tctccaaata ttttttaagc cccaactgga 2520
ccctaggcaa aagtgagtgg cactcctctg ccaggactcc aggcaagccc cggcatcttc 2580
ttgctgccgt cccagacaac agaagttacc agatgaacag acttggatgg gccacggggg 2640
tggagagctg gaaagcttgg ctgtgcctct cgatgatgat taagatttca atatttacag 2700
caaaaccaca aagcaaatga tagaataaag caaaacaatg gaaaatgtga gttcactcgt 2760
gagagaggta cgtatgtgag ctctgaggaa attacagagg gaacgcatgc agcgggacag 2820
ctctcccaat cgcagcgtgc aaagtagaca tccatagtgt cttttgaaaa atgaaaaaca 2880
cattactttg aacagccaag aaaaaaattg caatttatta agattcaata aagcgttgta 2940
ctttcgaaag ccaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3000
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 3046
<210>2
<211>3046
<212>DNA
<213〉homo sapiens (Homo sapiens)
<220>
<221>CDS
<222>(654)..(971)
<400>2
ggtttcacca tgatggtcag gctggtcaag atctcctgac ctcaagtgat ctgcccgcct 60
cggcctccca aagtgctggg attacagaca tgaaccacaa cacctgccaa gcccaggtgt 120
tttgaatcca gcacgggcaa tggcaagacc ctgtcccaac aaacaaaaaa acactggtgc 180
ctgaggccca ccccgagaga ttctgattga gttgatttgg atttgaatta tctttaaaat 240
tttggatgtg aatttttttt ttttttcttg gaatggagtt tcactcttgt tccccaggct 300
ggagtgcaac gggacgatct tggctcaccg caacctccgc ctactgggtt caagccattc 360
tcctgcctca gcttcccaag tagctgggat tacaggcatg tgctaccatg cccagctaat 420
tttgtatttt tagtagagac agggtttctc catgttgctc aggctggtct tgaactcctg 480
acctcaggtg atccgcccgc ctcggactcc caaagtgctg ggattacagg catgagccac 540
catgcctggc cggatgtgaa ttatcttaaa aattttcagg taattctaat gggccaaggt 600
tgagaacccc tgctctgggc ccatccgagc accaggctgt cacaaacgca tgc atg 656
Met
1
cac tca cgc ccg tgg gct tgg ggg gct tcg gaa atg tgc ttc tgc ttt 704
His Ser Arg Pro Trp Ala Trp Gly Ala Ser Glu Met Cys Phe Cys Phe
5 10 15
ttt gag atg ggg tct ttc tgt tgc cca tcc cgg agc aca gtg gca cga 752
Phe Glu Met Gly Ser Phe Cys Cys Pro Ser Arg Ser Thr Val Ala Arg
20 25 30
tca cag ctc act gca gcc tcg acc tcc tgg gct cag gtg atc ctc ccg 800
Ser Gln Leu Thr Ala Ala Ser Thr Ser Trp Ala Gln Val Ile Leu Pro
35 40 45
cct cag cct cct gag tgt ctg ctt ctg gtt ttc atg atg acc tgg ggc 848
Pro Gln Pro Pro Glu Cys Leu Leu Leu Val Phe Met Met Thr Trp Gly
50 55 60 65
cca ggc ata cta cac ttg tgc tgt tca ggg gcc agt cct gca cca gga 896
Pro Gly Ile Leu His Leu Cys Cys Ser Gly Ala Ser Pro Ala Pro Gly
70 75 80
gcc cat cag cca cag ctc cgc cga gaa gca ctg ata tgc aga gct aag 944
Ala His Gln Pro Gln Leu Arg Arg Glu Ala Leu Ile Cys Arg Ala Lys
85 90 95
cag ctt tgt ttc cac gtg gat cct gcg taggttttct tggtccatcc 991
Gln Leu Cys Phe His Val Asp Pro Ala
100 105
gtagacaccg cactcctgca gaggatcttc tcgggatgcc ccactgtctc tgttttccct 1051
cttcactgaa cactcagtcg gggctcgcca tgatgcctct gtgtctgctg gcttctcccc 1111
cattggaaca gccttcttgg cacgccacac tgctagctgc tgggcactgt gctttctgcc 1171
tttaccgttc tgccgtgatg ttgccaaaat agcagcaaca acaacaacaa caaaggctgg 1231
gcacctggct catgcctgta ttcccagcag ttcgagatca gcctgggcaa catggtgaga 1291
ccctatctct acaaaaataa aaaatgaaat gagctgggtg gggtggcgca tgcatgcctg 1351
tggtcccagc tacttgggag gctgaggtcg gaagctcgct ggtcggaaga tcgctggagc 1411
ttagccttga ggtcaaggtt gcagtgagcc gagattacat cactgcactc cagcgtggga 1471
gacagagacc ctgtattaac aaacaaaaac acaaaccaca aagggcaggt ctgaaactgc 1531
catttaaaaa aaaatttgat aaacttaaaa aaatatatat ccacagatgc aggtgaagaa 1591
cctgttgtct tcctcaagcc tctttttcac ccatgggtgg aaatggtgcc ctggacaccc 1651
aggcccacga ggtctttgcg tggggtccct acacagggct ttagcttaca ctgtgctgcc 1711
ctcctgtccc ccgagttccc agtctgtcaa aatccaacct ggtctcccag gcccagggca 1771
aatgccacct cctccatgaa gcctgccaca tcctttgcac acccttgggc gctgaccttg 1831
ttctcccagc gcacaggcac gggtagtttg cccctgtagt agtaactcag gcacaaaacg 1891
aactcttgct gaggctcggc cgcgcagagc tgagggttgc cgcttccagg ttcaagtgca 1951
ttttgagttt cattcccagc ttccttcttt ttctggtctt taatttcttc tccggattag 2011
gtcccactca atgctttcct tctcaatttc caaaagagta tggtcagagc cagcagcaca 2071
ccaccttccc catgggtggg gggggggcca gcctgtggcg ggggtgcggg tcccatcttt 2131
tcgaaggaat tgacccacag tgggcgggtc cacctttgac cttgccccag ggagcgcaga 2191
cagaaaaaag atccttgctt agtttgaggg gccgctgggg tgctcggttt gtcttcagag 2251
gcctgtctgt aacaccaatg ccaacccggt ggcactgact ggtcaccctg aaggccacgg 2311
ccagtgtcct aggaagggac tcaatttcta gctgtgccac ctgagattct ggggttaggc 2371
tggttgtgct tctgaagttc cactgtgctc aaagtgcttg gtgaaagtta gcgaaggtga 2431
ttttacaaaa atagatgcat aaaatgtcta ggaaacacaa aaaatcctca ttactcttct 2491
ctccaaatat tttttaagcc ccaactggac cctaggcaaa agtgagtggc actcctctgc 2551
caggactcca ggcaagcccc ggcatcttct tgctgccgtc ccagacaaca gaagttacca 2611
gatgaacaga cttggatggg ccacgggggt ggagagctgg aaagcttggc tgtgcctctc 2671
gatgatgatt aagatttcaa tatttacagc aaaaccacaa agcaaatgat agaataaagc 2731
aaaacaatgg aaaatgtgag ttcactcgtg agagaggtac gtatgtgagc tctgaggaaa 2791
ttacagaggg aacgcatgca gcgggacagc tctcccaatc gcagcgtgca aagtagacat 2851
ccatagtgtc ttttgaaaaa tgaaaaacac attactttga acagccaaga aaaaaattgc 2911
aatttattaa gattcaataa agcgttgtac tttcgaaagc caaaaaaaaa aaaaaaaaaa 2971
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3031
aaaaaaaaaa aaaaa 3046
<210>3
<211>106
<212>PRT
<213〉homo sapiens (Homo sapiens)
<400>3
Met His Ser Arg Pro Trp Ala Trp Gly Ala Ser Glu Met Cys Phe Cys
1 5 10 15
Phe Phe Glu Met Gly Ser Phe Cys Cys Pro Ser Arg Ser Thr Val Ala
20 25 30
Arg Ser Gln Leu Thr Ala Ala Ser Thr Ser Trp Ala Gln Val Ile Leu
35 40 45
Pro Pro Gln Pro Pro Glu Cys Leu Leu Leu Val Phe Met Met Thr Trp
50 55 60
Gly Pro Gly Ile Leu His Leu Cys Cys Ser Gly Ala Ser Pro Ala Pro
65 70 75 80
Gly Ala His Gln Pro Gln Leu Arg Arg Glu Ala Leu Ile Cys Arg Ala
85 90 95
Lys Gln Leu Cys Phe His Val Asp Pro Ala
100 105
<210>4
<211>2224
<212>DNA
<213〉homo sapiens (Homo sapiens)
<400>4
gggacctggg ggctgtggcc gggggcggcc gttgacctgg tgaccgcggc gccgccccag 60
accgggggcg cagtcccact cgctccgagc cccggtcccc caagcctccc tcccgggtac 120
ctggggccgc gcccgccctg cgcccagctc cgccctccgt cggcccaggc ctgacagagc 180
ccggcagcca tgagtgccaa cccccggtgg gacatcagca gggcgctggg ggtggccaag 240
ctcttccacc tggtgtgcgg ggtgcgggaa gcctgcgtga ccccgttcct gaccctttac 300
ctgaggcagc tgggcttggc cgcgccctgg gtgggcaccc taatgggaac caagcaccta 360
atcgctgcct tctgggctcc cgtctgtgcc ttcctggcca aaagctaccg gaaaaggaga 420
gcgcttctga tcggctccct gctcggctcg gtgggggcca gcctgctgat ggtcctggtc 480
ccaccggtag acaaaaatcg ggtgcacttc ccttgtaatg gaagcagcgg cctgaccagc 540
acagacgcac tcccgggggt cacgctacct gtgaacatca cctcggccca agagtctgcc 600
tccagccacc cagccaagag gactgcagag gtggaaatgc ctggcttcag aaacccacct 660
ggtgaaagtg accgagaaac tttccgtgat ctgcacgtct acttagcgcc ctccgttgaa 720
ggagctagga ccacatccca agctctcctc catcctgtca cttcggggct gaaagatcat 780
ccctgggaag ttacttttga ggtggtcaag acagccctcc ccttgcttcc tggggggaaa 840
gggcccggga atccagccaa tttgtcaggg accaagggga aagcctgggc ttttgacctg 900
tccttggagg cgttgcggcg gacttttatc ctctccttgg ggtccgtggc gttctgggag 960
ctgctgacag cgcctctgga gcaggtggca gatgacagcc tttatgagtt cctggatttt 1020
gtggatgcca ctgaccgata cagaagcctg tgggtctgga ggttgctggg catgtcggca 1080
ggcgtgtgtg gcatcacagc cttggtgggg cagctggact gcttcctgat gaccagtggc 1140
ccccgaggtg tggtccactt ctatgggtac tcggtggtca gcaccctggc cttactggtg 1200
agcattgcct ttcccattcc catctgtcag cagtgggagc ccagctacaa aagggtcaaa 1260
gcactgtcca ttgtgggggg ggtgaccccc acctcattct cctcgcctcc accactgttt 1320
tggtaggagc catcgtcagt actgtccaga actttctgtt ctggcacatg aaggaccatg 1380
ggagcggcga gctggtcatg ggtttctcgg tcgccctcag cttgctgggg gaaattctgc 1440
ttcatccgtt caaagctaca ttgcttagga aactgtccag gacgggcctg gtggggctgg 1500
ggctgagctg cctcgctggg cagctgctgt actactcttt cctctggagc tggtggtccg 1560
tcctccccat tcagatcttg agtgccatta gcaacagagc tttgtggtgg gctgtggggg 1620
cctcagtaga ggacctggcc actccccgca tggagagggc tctgagtgcc ttgttccgag 1680
gccactttta cgggagtggc tgtagcctgg gcagctttgt cgggggcttc gtggtgatgc 1740
gcttcagcct ggctgtgctc taccaggcct gctgtgtggc cctgttgctc tggttggcct 1800
tgctcctgtc catacagcgg aggctgcccc gagagcggaa aatcaagtac tcgaagctgc 1860
tgtccatgga ggtgagtgac accagtgact ctgagcaggg gacagaacag gactggcttg 1920
tgaaggccat gagggaggaa cactcagact gaaagggctg agaaatccag agtgtgctga 1980
tccagcaagg aacgaatgga ctgaacaaaa ctcagcctgc tgaggacaga aacctgccct 2040
ggactgctgg gagccgggga agagaggatg ggtctgtgct gaaggcccaa caggatcatc 2100
tcattgcatg attttcttta cttttgaagt aaaaggagat ttaacttttt gccaatcttt 2160
tttagataat ggaggaagaa tacatttgct ttttaaaaag ttaaaaaaaa aaaaaaaaaa 2220
aaaa 2224
<210>5
<211>2224
<212>DNA
<213〉homo sapiens (Homo sapiens)
<220>
<221>CDS
<222>(190)..(1323)
<400>5
gggacctggg ggctgtggcc gggggcggcc gttgacctgg tgaccgcggc gccgccccag 60
accgggggcg cagtcccact cgctccgagc cccggtcccc caagcctccc tcccgggtac 120
ctggggccgc gcccgccctg cgcccagctc cgccctccgt cggcccaggc ctgacagagc 180
ccggcagcc atg agt gcc aac ccc cgg tgg gac atc agc agg gcg ctg ggg 231
Met Ser Ala Asn Pro Arg Trp Asp Ile Ser Arg Ala Leu Gly
1 5 10
gtg gcc aag ctc ttc cac ctg gtg tgc ggg gtg cgg gaa gcc tgc gtg 279
Val Ala Lys Leu Phe His Leu Val Cys Gly Val Arg Glu Ala Cys Val
15 20 25 30
acc ccg ttc ctg acc ctt tac ctg agg cag ctg ggc ttg gcc gcg ccc 327
Thr Pro Phe Leu Thr Leu Tyr Leu Arg Gln Leu Gly Leu Ala Ala Pro
35 40 45
tgg gtg ggc acc cta atg gga acc aag cac cta atc gct gcc ttc tgg 375
Trp Val Gly Thr Leu Met Gly Thr Lys His Leu Ile Ala Ala Phe Trp
50 55 60
gct ccc gtc tgt gcc ttc ctg gcc aaa agc tac cgg aaa agg aga gcg 423
Ala Pro Val Cys Ala Phe Leu Ala Lys Ser Tyr Arg Lys Arg Arg Ala
65 70 75
ctt ctg atc ggc tcc ctg ctc ggc tcg gtg ggg gcc agc ctg ctg atg 471
Leu Leu Ile Gly Ser Leu Leu Gly Ser Val Gly Ala Ser Leu Leu Met
80 85 90
gtc ctg gtc cca ccg gta gac aaa aat cgg gtg cac ttc cct tgt aat 519
Val Leu Val Pro Pro Val Asp Lys Asn Arg Val His Phe Pro Cys Asn
95 100 105 110
gga agc agc ggc ctg acc agc aca gac gca ctc ccg ggg gtc acg cta 567
Gly Ser Ser Gly Leu Thr Ser Thr Asp Ala Leu Pro Gly Val Thr Leu
115 120 125
cct gtg aac atc acc tcg gcc caa gag tct gcc tcc agc cac cca gcc 615
Pro Val Asn Ile Thr Ser Ala Gln Glu Ser Ala Ser Ser His Pro Ala
130 135 140
aag agg act gca gag gtg gaa atg cct ggc ttc aga aac cca cct ggt 663
Lys Arg Thr Ala Glu Val Glu Met Pro Gly Phe Arg Asn Pro Pro Gly
145 150 155
gaa agt gac cga gaa act ttc cgt gat ctg cac gtc tac tta gcg ccc 711
Glu Ser Asp Arg Glu Thr Phe Arg Asp Leu His Val Tyr Leu Ala Pro
160 165 170
tcc gtt gaa gga gct agg acc aca tcc caa gct ctc ctc cat cct gtc 759
Ser Val Glu Gly Ala Arg Thr Thr Ser Gln Ala Leu Leu His Pro Val
175 180 185 190
act tcg ggg ctg aaa gat cat ccc tgg gaa gtt act ttt gag gtg gtc 807
Thr Ser Gly Leu Lys Asp His Pro Trp Glu Val Thr Phe Glu Val Val
195 200 205
aag aca gcc ctc ccc ttg ctt cct ggg ggg aaa ggg ccc ggg aat cca 855
Lys Thr Ala Leu Pro Leu Leu Pro Gly Gly Lys Gly Pro Gly Asn Pro
210 215 220
gcc aat ttg tca ggg acc aag ggg aaa gcc tgg gct ttt gac ctg tcc 903
Ala Asn Leu Ser Gly Thr Lys Gly Lys Ala Trp Ala Phe Asp Leu Ser
225 230 235
ttg gag gcg ttg cgg cgg act ttt atc ctc tcc ttg ggg tcc gtg gcg 951
Leu Glu Ala Leu Arg Arg Thr Phe Ile Leu Ser Leu Gly Ser Val Ala
240 245 250
ttc tgg gag ctg ctg aca gcg cct ctg gag cag gtg gca gat gac agc 999
Phe Trp Glu Leu Leu Thr Ala Pro Leu Glu Gln Val Ala Asp Asp Ser
255 260 265 270
ctt tat gag ttc ctg gat ttt gtg gat gcc act gac cga tac aga agc 1047
Leu Tyr Glu Phe Leu Asp Phe Val Asp Ala Thr Asp Arg Tyr Arg Ser
275 280 285
ctg tgg gtc tgg agg ttg ctg ggc atg tcg gca ggc gtg tgt ggc atc 1095
Leu Trp Val Trp Arg Leu Leu Gly Met Ser Ala Gly Val Cys Gly Ile
290 295 300
aca gcc ttg gtg ggg cag ctg gac tgc ttc ctg atg acc agt ggc ccc 1143
Thr Ala Leu Val Gly Gln Leu Asp Cys Phe Leu Met Thr Ser Gly Pro
305 310 315
cga ggt gtg gtc cac ttc tat ggg tac tcg gtg gtc agc acc ctg gcc 1191
Arg Gly Val Val His Phe Tyr Gly Tyr Ser Val Val Ser Thr Leu Ala
320 325 330
tta ctg gtg agc att gcc ttt ccc att ccc atc tgt cag cag tgg gag 1239
Leu Leu Val Ser Ile Ala Phe Pro Ile Pro Ile Cys Gln Gln Trp Glu
335 340 345 350
ccc agc tac aaa agg gtc aaa gca ctg tcc att gtg ggg ggg gtg acc 1287
Pro Ser Tyr Lys Arg Val Lys Ala Leu Ser Ile Val Gly Gly Val Thr
355 360 365
ccc acc tca ttc tcc tcg cct cca cca ctg ttt tgg taggagccat 1333
Pro Thr Ser Phe Ser Ser Pro Pro Pro Leu Phe Trp
370 375
cgtcagtact gtccagaact ttctgttctg gcacatgaag gaccatggga gcggcgagct 1393
ggtcatgggt ttctcggtcg ccctcagctt gctgggggaa attctgcttc atccgttcaa 1453
agctacattg cttaggaaac tgtccaggac gggcctggtg gggctggggc tgagctgcct 1513
cgctgggcag ctgctgtact actctttcct ctggagctgg tggtccgtcc tccccattca 1573
gatcttgagt gccattagca acagagcttt gtggtgggct gtgggggcct cagtagagga 1633
cctggccact ccccgcatgg agagggctct gagtgccttg ttccgaggcc acttttacgg 1693
gagtggctgt agcctgggca gctttgtcgg gggcttcgtg gtgatgcgct tcagcctggc 1753
tgtgctctac caggcctgct gtgtggccct gttgctctgg ttggccttgc tcctgtccat 1813
acagcggagg ctgccccgag agcggaaaat caagtactcg aagctgctgt ccatggaggt 1873
gagtgacacc agtgactctg agcaggggac agaacaggac tggcttgtga aggccatgag 1933
ggaggaacac tcagactgaa agggctgaga aatccagagt gtgctgatcc agcaaggaac 1993
gaatggactg aacaaaactc agcctgctga ggacagaaac ctgccctgga ctgctgggag 2053
ccggggaaga gaggatgggt ctgtgctgaa ggcccaacag gatcatctca ttgcatgatt 2113
ttctttactt ttgaagtaaa aggagattta actttttgcc aatctttttt agataatgga 2173
ggaagaatac atttgctttt taaaaagtta aaaaaaaaaa aaaaaaaaaa a 2224
<210>6
<211>378
<212>PRT
<213〉homo sapiens (Homo sapiens)
<400>6
Met Ser Ala Asn Pro Arg Trp Asp Ile Ser Arg Ala Leu Gly Val Ala
1 5 10 15
Lys Leu Phe His Leu Val Cys Gly Val Arg Glu Ala Cys Val Thr Pro
20 25 30
Phe Leu Thr Leu Tyr Leu Arg Gln Leu Gly Leu Ala Ala Pro Trp Val
35 40 45
Gly Thr Leu Met Gly Thr Lys His Leu Ile Ala Ala Phe Trp Ala Pro
50 55 60
Val Cys Ala Phe Leu Ala Lys Ser Tyr Arg Lys Arg Arg Ala Leu Leu
65 70 75 80
Ile Gly Ser Leu Leu Gly Ser Val Gly Ala Ser Leu Leu Met Val Leu
85 90 95
Val Pro Pro Val Asp Lys Asn Arg Val His Phe Pro Cys Asn Gly Ser
100 105 110
Ser Gly Leu Thr Ser Thr Asp Ala Leu Pro Gly Val Thr Leu Pro Val
115 120 125
Asn Ile Thr Ser Ala Gln Glu Ser Ala Ser Ser His Pro Ala Lys Arg
130 135 140
Thr Ala Glu Val Glu Met Pro Gly Phe Arg Asn Pro Pro Gly Glu Ser
145 150 155 160
Asp Arg Glu Thr Phe Arg Asp Leu His Val Tyr Leu Ala Pro Ser Val
165 170 175
Glu Gly Ala Arg Thr Thr Ser Gln Ala Leu Leu His Pro Val Thr Ser
180 185 190
Gly Leu Lys Asp His Pro Trp Glu Val Thr Phe Glu Val Val Lys Thr
195 200 205
Ala Leu Pro Leu Leu Pro Gly Gly Lys Gly Pro Gly Asn Pro Ala Asn
210 215 220
Leu Ser Gly Thr Lys Gly Lys Ala Trp Ala Phe Asp Leu Ser Leu Glu
225 230 235 240
Ala Leu Arg Arg Thr Phe Ile Leu Ser Leu Gly Ser Val Ala Phe Trp
245 250 255
Glu Leu Leu Thr Ala Pro Leu Glu Gln Val Ala Asp Asp Ser Leu Tyr
260 265 270
Glu Phe Leu Asp Phe Val Asp Ala Thr Asp Arg Tyr Arg Ser Leu Trp
275 280 285
Val Trp Arg Leu Leu Gly Met Ser Ala Gly Val Cys Gly Ile Thr Ala
290 295 300
Leu Val Gly Gln Leu Asp Cys Phe Leu Met Thr Ser Gly Pro Arg Gly
305 310 315 320
Val Val His Phe Tyr Gly Tyr Ser Val Val Ser Thr Leu Ala Leu Leu
325 330 335
Val Ser Ile Ala Phe Pro Ile Pro Ile Cys Gln Gln Trp Glu Pro Ser
340 345 350
Tyr Lys Arg Val Lys Ala Leu Ser Ile Val Gly Gly Val Thr Pro Thr
355 360 365
Ser Phe Ser Ser Pro Pro Pro Leu Phe Trp
370 375
<210>7
<211>2916
<212>DNA
<213〉homo sapiens (Homo sapiens)
<400>7
gaaaagacgt ttaataaaac agctttacct tttaaaaata aaatttcaat cccagtctgc 60
ctcaggtacg ggtgggggtg tgcagcgtga ctcctggctg tgttcctgta ccgtttgtgg 120
cctccggggt cttgcgcgtc cagggcccag ataccgagta actgccatgt ggtgtcgtgg 180
tggcgtccat aggcctccgt ggtggggcgg gggcggggct gtgggtgtgc agggcagctg 240
cattttccaa gagtggagat ggaggatgga gatttgcttc gtaaaggcct gggagctaca 300
tgggaggtag ggaggggcca gccacaccca gggctggggg gccggtggac tctgcccagt 360
gaagccaggg gagtgtcagc aatgaggggc ctggacagcg acgcgggacc cgcacctggg 420
agcggcagag atggagccca gctcgctaaa cgtaccctaa gggtgcacgc tgaccgtctt 480
tgatccctgg acagtgtcca tagagttgtc tccccttcgg tcacattgtc cctggccttg 540
gagtttctgc ctcagggaaa gacccactaa aaccacccgg gctctggcgg cctcctcttc 600
tccagagcat ggtggcgctg cctcctgagc agagtcactg agagtgggtc caaggaaggg 660
acccacttag caaaggttgt tgtgtggtgc tgtgcaggca cggcgcctgg agttggcccc 720
gccagcatcg gggtggcttg gggctctggt tcctaagctt ctgcttctga gttggtcagc 780
acgttggctc tgcgccctgc agtgcctgtc ttgatggtgg tccctggatg tgctgaaggg 840
ctccaggagc aaggggccag accacacttt ggctgagtcc ttgggtagtt tcagagggtc 900
acctgcacag tctgcattga gcaccctggg gctgaggccg tgggtcaagt tctgcctgac 960
agtgctttga ttatatggtg atgtgattct gcaaacataa ccccccaact gagtgccagc 1020
tgctgggtgt ccccacctca agagccaagg ggaggccacc ctggaactgg tgctttgccc 1080
ttggtgtcat ggaggtgtct gggtccacac gtggtgggtg ggctgggcct gcagtcaccc 1140
agcatggccg gcgggattgg acccagctgc aacctggcag gttccatggg gcttcctgtg 1200
cctgggcctg cctgtgtctc atgcctgtca ataaaggaga aagcaacctg ggaaaaacat 1260
tcctctctgt acccagggga ctgccaagtg tgctttatag aaggctggaa gaccgactga 1320
agtaggaagc tgtcacatgt ttgagccaac tgagcaaagc agctgttgcc tgccagacag 1380
ccaccccaag accaagggag ccagcctgtc gtttgtcctt ctagcctgga agaccagttc 1440
ttggctcttg ggcactcaca ttggccactg gccgcaactg ctgaatggct tcactaatgt 1500
ccctggggtc tgtatctgcc ctctcttcta ttccctagaa actctggccg ggcatggcgg 1560
ctcacgcctg tcatcccagc acttagggag gccgaggcag gtggatcaca aggtcaagag 1620
atcgacacca tcctggccaa catggtgaaa ccccatctct actaaaaata caaaaattag 1680
ctggacatgg tggtgggcgc ctgtagtccc agctacttgg gaggctgagg caggagaatt 1740
gcttgaactc gggaggcgta ggttgcagtg agccaagatc gcaccactgc actccagcct 1800
ggcaacagag caagactcct caaaaaacaa acaaacaaac aaacaaactc tgatcattca 1860
cccggtggga tccccggcag ctccttctcc ttcccgagct gtggctcatg ggccattgga 1920
ttcagagacc ttgagctggg tgctggctgc cctaaaggag cacgacccct ccagcctggt 1980
gaccgggagg cagagaagta cagctgtcct tgacaaaggc ccagcctctg ctgcctcagg 2040
ccagcctggc cacaccccga tgccacagtc atctgccccg cccaggcata agcctgactc 2100
aggtgataca ggcgaggctt ctcccatggg tcctcccctt gtgagaaagg gtctgtgctg 2160
tgtaccttct ctctgcacag ccattgtcag tgtggtttgg agtgagtgag cctcccgcac 2220
tgctcagggg agagctggtg agcatgactc catggcctcc tgctggttgg ggtcttcttc 2280
tgattctcac ccgcaggagc acccagaccc ccccttttca gaaatgatta cattttccct 2340
tcaagagtca tctgtgacct ggtggtgtgg agggactgtc ccctcctaca gcctggccga 2400
cccccttcct ccagggctgt gtggaatcac ctctcagaag ctcactctga aaagccactc 2460
acagggattt caccaggatg ttgctgttcc tgccgacacc tgggctttgt gacaaatgct 2520
gtttgcttca agctgttcac aagcaacttc accagaatcc caagctggtt cgtacttaat 2580
ttgtatacat acatatgcat aatatacatg accatgtatg agatacagac accatggctt 2640
ctgtccattg gctcattcgc ctggtcgcat gttcactcca tgaggttagg acactgcggg 2700
tggctttggg atgggtggtt tcagggacaa ggtgtcgctg tgatctctgt atttgctaac 2760
tagtcttgtt ttgatttcta acggtctagt ggagtacagt taatgttgtg tttatttctg 2820
taactgtagg ggtggagtat aaacctaact tttggaaaca tgaagaaata aacaatagcg 2880
ccagccgtta aaaaaaaaaa aaaaaaaaaa aaaaaa 2916
<210>8
<211>2916
<212>DNA
<213〉homo sapiens (Homo sapiens)
<220>
<221>CDS
<222>(1337)..(1645)
<400>8
gaaaagacgt ttaataaaac agctttacct tttaaaaata aaatttcaat cccagtctgc 60
ctcaggtacg ggtgggggtg tgcagcgtga ctcctggctg tgttcctgta ccgtttgtgg 120
cctccggggt cttgcgcgtc cagggcccag ataccgagta actgccatgt ggtgtcgtgg 180
tggcgtccat aggcctccgt ggtggggcgg gggcggggct gtgggtgtgc agggcagctg 240
cattttccaa gagtggagat ggaggatgga gatttgcttc gtaaaggcct gggagctaca 300
tgggaggtag ggaggggcca gccacaccca gggctggggg gccggtggac tctgcccagt 360
gaagccaggg gagtgtcagc aatgaggggc ctggacagcg acgcgggacc cgcacctggg 420
agcggcagag atggagccca gctcgctaaa cgtaccctaa gggtgcacgc tgaccgtctt 480
tgatccctgg acagtgtcca tagagttgtc tccccttcgg tcacattgtc cctggccttg 540
gagtttctgc ctcagggaaa gacccactaa aaccacccgg gctctggcgg cctcctcttc 600
tccagagcat ggtggcgctg cctcctgagc agagtcactg agagtgggtc caaggaaggg 660
acccacttag caaaggttgt tgtgtggtgc tgtgcaggca cggcgcctgg agttggcccc 720
gccagcatcg gggtggcttg gggctctggt tcctaagctt ctgcttctga gttggtcagc 780
acgttggctc tgcgccctgc agtgcctgtc ttgatggtgg tccctggatg tgctgaaggg 840
ctccaggagc aaggggccag accacacttt ggctgagtcc ttgggtagtt tcagagggtc 900
acctgcacag tctgcattga gcaccctggg gctgaggccg tgggtcaagt tctgcctgac 960
agtgctttga ttatatggtg atgtgattct gcaaacataa ccccccaact gagtgccagc 1020
tgctgggtgt ccccacctca agagccaagg ggaggccacc ctggaactgg tgctttgccc 1080
ttggtgtcat ggaggtgtct gggtccacac gtggtgggtg ggctgggcct gcagtcaccc 1140
agcatggccg gcgggattgg acccagctgc aacctggcag gttccatggg gcttcctgtg 1200
cctgggcctg cctgtgtctc atgcctgtca ataaaggaga aagcaacctg ggaaaaacat 1260
tcctctctgt acccagggga ctgccaagtg tgctttatag aaggctggaa gaccgactga 1320
agtaggaagc tgtcac atg ttt gag cca act gag caa agc agc tgt tgc ctg 1372
Met Phe Glu Pro Thr Glu Gln Ser Ser Cys Cys Leu
1 5 10
cca gac agc cac ccc aag acc aag gga gcc agc ctg tcg ttt gtc ctt 1420
Pro Asp Ser His Pro Lys Thr Lys Gly Ala Ser Leu Ser Phe Val Leu
15 20 25
cta gcc tgg aag acc agt tct tgg ctc ttg ggc act cac att ggc cac 1468
Leu Ala Trp Lys Thr Ser Ser Trp Leu Leu Gly Thr His Ile Gly His
30 35 40
tgg ccg caa ctg ctg aat ggc ttc act aat gtc cct ggg gtc tgt atc 1516
Trp Pro Gln Leu Leu Asn Gly Phe Thr Asn Val Pro Gly Val Cys Ile
45 50 55 60
tgc cct ctc ttc tat tcc cta gaa act ctg gcc ggg cat ggc ggc tca 1564
Cys Pro Leu Phe Tyr Ser Leu Glu Thr Leu Ala Gly His Gly Gly Ser
65 70 75
cgc ctg tca tcc cag cac tta ggg agg ccg agg cag gtg gat cac aag 1612
Arg Leu Ser Ser Gln His Leu Gly Arg Pro Arg Gln Val Asp His Lys
80 85 90
gtc aag aga tcg aca cca tcc tgg cca aca tgg tgaaacccca tctctactaa 1665
Val Lys Arg Ser Thr Pro Ser Trp Pro Thr Trp
95 100
aaatacaaaa attagctgga catggtggtg ggcgcctgta gtcccagcta cttgggaggc 1725
tgaggcagga gaattgcttg aactcgggag gcgtaggttg cagtgagcca agatcgcacc 1785
actgcactcc agcctggcaa cagagcaaga ctcctcaaaa aacaaacaaa caaacaaaca 1845
aactctgatc attcacccgg tgggatcccc ggcagctcct tctccttccc gagctgtggc 1905
tcatgggcca ttggattcag agaccttgag ctgggtgctg gctgccctaa aggagcacga 1965
cccctccagc ctggtgaccg ggaggcagag aagtacagct gtccttgaca aaggcccagc 2025
ctctgctgcc tcaggccagc ctggccacac cccgatgcca cagtcatctg ccccgcccag 2085
gcataagcct gactcaggtg atacaggcga ggcttctccc atgggtcctc cccttgtgag 2145
aaagggtctg tgctgtgtac cttctctctg cacagccatt gtcagtgtgg tttggagtga 2205
gtgagcctcc cgcactgctc aggggagagc tggtgagcat gactccatgg cctcctgctg 2265
gttggggtct tcttctgatt ctcacccgca ggagcaccca gaccccccct tttcagaaat 2325
gattacattt tcccttcaag agtcatctgt gacctggtgg tgtggaggga ctgtcccctc 2385
ctacagcctg gccgaccccc ttcctccagg gctgtgtgga atcacctctc agaagctcac 2445
tctgaaaagc cactcacagg gatttcacca ggatgttgct gttcctgccg acacctgggc 2505
tttgtgacaa atgctgtttg cttcaagctg ttcacaagca acttcaccag aatcccaagc 2565
tggttcgtac ttaatttgta tacatacata tgcataatat acatgaccat gtatgagata 2625
cagacaccat ggcttctgtc cattggctca ttcgcctggt cgcatgttca ctccatgagg 2685
ttaggacact gcgggtggct ttgggatggg tggtttcagg gacaaggtgt cgctgtgatc 2745
tctgtatttg ctaactagtc ttgttttgat ttctaacggt ctagtggagt acagttaatg 2805
ttgtgtttat ttctgtaact gtaggggtgg agtataaacc taacttttgg aaacatgaag 2865
aaataaacaa tagcgccagc cgttaaaaaa aaaaaaaaaa aaaaaaaaaa a 2916
<210>9
<211>103
<212>PRT
<213〉homo sapiens (Homo sapiens)
<400>9
Met Phe Glu Pro Thr Glu Gln Ser Ser Cys Cys Leu Pro Asp Ser His
1 5 10 15
Pro Lys Thr Lys Gly Ala Ser Leu Ser Phe Val Leu Leu Ala Trp Lys
20 25 30
Thr Ser Ser Trp Leu Leu Gly Thr His Ile Gly His Trp Pro Gln Leu
35 40 45
Leu Asn Gly Phe Thr Asn Val Pro Gly Val Cys Ile Cys Pro Leu Phe
50 55 60
Tyr Ser Leu Glu Thr Leu Ala Gly His Gly Gly Ser Arg Leu Ser Ser
65 70 75 80
Gln His Leu Gly Arg Pro Arg Gln Val Asp His Lys Val Lys Arg Ser
85 90 95
Thr Pro Ser Trp Pro Thr Trp
100
<210>10
<211>20
<212>DNA
<213〉artificial sequence
<220>
<221>misc_feature
<223〉primer
<400>10
atgatggtca ggctggtcaa 20
<210>11
<211>24
<212>DNA
<213〉artificial sequence
<220>
<221>misc_feature
<223〉primer
<400>11
tcgaaagtac aacgctttat tgaa 24
<210>12
<211>18
<212>DNA
<213〉artificial sequence
<220>
<221>misc_feature
<223〉primer
<400>12
ggccgttgac ctggtgac 18
<210>13
<211>23
<212>DNA
<213〉artificial sequence
<220>
<221>misc_feature
<223〉primer
<400>13
aaagcaaatg tattcttcct cca 23
<210>14
<211>20
<212>DNA
<213〉artificial sequence
<220>
<221>misc_feature
<223〉primer
<400>14
tttcaatccc agtctgcctc 20
<210>15
<211>20
<212>DNA
<213〉artificial sequence
<220>
<221>misc_feature
<223〉primer
<400>15
ggctggcgct attgtttatt 20
Claims (5)
1. isolating polynucleotide is characterized in that, it comprises a nucleotide sequence, and this nucleotide sequence is selected from down group:
(a) coding has the proteic polynucleotide of people of cancer suppressing function, and described albumen has and contains following aminoacid sequence: SEQ ID NO:9;
(b) with the complete complementary polynucleotide of polynucleotide (a).
2. polynucleotide as claimed in claim 1 is characterized in that, the polypeptide of this polynucleotide encoding has following aminoacid sequence: SEQ ID NO:9.
3. polynucleotide as claimed in claim 1 is characterized in that, the sequence of these polynucleotide is selected from down group:
Coding region sequence or the full length sequence of SEQ ID NO:8.
4. a carrier is characterized in that, it contains the described polynucleotide of claim 1.
5. a genetically engineered host cell is characterized in that, it is a kind of host cell that is selected from down group:
(a) host cell that transforms or transduce with the described carrier of claim 4;
(b) host cell that transforms or transduce with the described polynucleotide of claim 1.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 02111646 CN1209370C (en) | 2002-05-13 | 2002-05-13 | New human protein with cancer inhibiting function and its encoding sequence |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 02111646 CN1209370C (en) | 2002-05-13 | 2002-05-13 | New human protein with cancer inhibiting function and its encoding sequence |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1458170A CN1458170A (en) | 2003-11-26 |
CN1209370C true CN1209370C (en) | 2005-07-06 |
Family
ID=29426328
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 02111646 Expired - Fee Related CN1209370C (en) | 2002-05-13 | 2002-05-13 | New human protein with cancer inhibiting function and its encoding sequence |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1209370C (en) |
-
2002
- 2002-05-13 CN CN 02111646 patent/CN1209370C/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
CN1458170A (en) | 2003-11-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1170850C (en) | Human angiogenin-like protein and coding sequence and application thereof | |
CN1209370C (en) | New human protein with cancer inhibiting function and its encoding sequence | |
CN1169954C (en) | Human protein able to suppress growth of cancer cells and its coding sequence | |
CN1177048C (en) | Human protein with function of suppressing cancer cell growth and its coding sequence | |
CN1177049C (en) | Human protein with function of suppressing cancer cell growth and its coding sequence | |
CN1177864C (en) | Novel human protein with expression difference in liver cancer tissue and its code sequence | |
CN1199998C (en) | Human protein with suppression to cancer cell growth and its coding sequence | |
CN1199997C (en) | New human protein having mouse NIH/3T3 cell conversion promoting function and its code sequence | |
CN1177050C (en) | Human protein with function of suppressing cancer cell growth and its coding sequence | |
CN1155616C (en) | Human protein with cancer cell growth promoting function and its coding sequence | |
CN1169958C (en) | Human protein able to suppress growth of cancer cells and its coding sequence | |
CN1155615C (en) | Human protein with cancer cell growth suppressing function and its coding sequence | |
CN1194989C (en) | Novel human protein able to suppress cancer cell growth and its coding sequence | |
CN1193041C (en) | New human protein with the function of inhibiting cancer cell growth and its encoding sequence | |
CN1194010C (en) | New human protein with the function of inhibiting cancer cell growth and its coding sequence | |
CN1199996C (en) | New human protein having cancer cell growth in hibiting function and its code sequence | |
CN1209374C (en) | Human Protein for promoting transform of 3T3 cell and its coding sequence | |
CN1169833C (en) | Human Protein with cancer inhibiting function and its coding sequence | |
CN1170848C (en) | Novel human hepatoma associated protein and coding sequence thereof | |
CN1246457C (en) | Human TSC403 gene and human INGIL gene | |
CN1209373C (en) | Human protein with suppression to cancer cell growth and its coding sequence | |
CN1190446C (en) | New human protein with function of improving mouse NIH/3T3 cell transformation and its encoding sequence | |
CN1169831C (en) | Human protein with cancer call growth suppressing function and its coding sequence | |
CN1155614C (en) | Human protein with cancer cell growth suppressing function and its coding sequence | |
CN1169955C (en) | Human protein able to suppress growth of cancer cells and its coding sequence |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C19 | Lapse of patent right due to non-payment of the annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |